BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy13797
         (339 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|328792924|ref|XP_395039.4| PREDICTED: lysine-specific demethylase NO66-like [Apis mellifera]
          Length = 489

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/338 (55%), Positives = 242/338 (71%), Gaps = 2/338 (0%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WEK P++I R    YY  L+++  +D ILR++ I F KNIDIT YE+G R+T +  GRA
Sbjct: 69  NWEKTPVHIKRNFPKYYKLLMSTPMLDKILRESYILFTKNIDITSYENGVRETHNPIGRA 128

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +P +VWD+Y  GCS+R LNPQTYI  L  LNA+LQE FGCF+GAN+Y TPP +QGFAPHY
Sbjct: 129 VPTVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFIGANSYLTPPNSQGFAPHY 188

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAFILQ+EGKK+W++Y PR  +EYLPRYSS NFSQ EIG PIL   +  GDLLY PR
Sbjct: 189 DDIEAFILQIEGKKRWRLYKPRNQNEYLPRYSSENFSQFEIGEPILDTIVNEGDLLYFPR 248

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G IHQ  T+ N HSLH+T+SVYQK +W D LEK +P AL AA   + EFR+GLPI YLRY
Sbjct: 249 GTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTAAIKENYEFRKGLPIDYLRY 308

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G    +  +   D+   KE +K LL+KL++Y+D+D+  D M K  +HD LPPVLS  E 
Sbjct: 309 CGYVHSEIQNNSKDKF--KEKIKHLLSKLIDYIDIDEAADLMAKNHIHDFLPPVLSESER 366

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           +CSV ++G +M + G V N  +I  DT+ RL R++ +R
Sbjct: 367 ECSVTQDGEKMIENGIVNNRVEIEPDTRFRLARSHCIR 404


>gi|340723554|ref|XP_003400154.1| PREDICTED: lysine-specific demethylase NO66-like [Bombus
           terrestris]
          Length = 492

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 188/340 (55%), Positives = 242/340 (71%), Gaps = 4/340 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WEK P+++ R    YY  L+++  +D ILR++ I F KNIDIT YE+G R+T +  GR
Sbjct: 71  NNWEKTPVHVKRNFHKYYKLLMSTPMLDKILRESYILFTKNIDITSYENGIRETHNPVGR 130

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A+P +VWD+Y  GCS+R LNPQTYI  L  LNA+LQE FGCFVGAN+Y TPP +QGFAPH
Sbjct: 131 AIPSVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFVGANSYLTPPNSQGFAPH 190

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAFILQ+EGKK+W++Y PR  +EYLPRYSS NFSQ EIG PIL   +  GDLLY P
Sbjct: 191 YDDIEAFILQIEGKKRWRLYKPRNENEYLPRYSSKNFSQSEIGEPILDTVVNAGDLLYFP 250

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQ  T+ N HSLH+T+SVYQK +W D LEK +P AL  A  +D EFR+GLPI YLR
Sbjct: 251 RGTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTTAIESDSEFRKGLPIDYLR 310

Query: 241 YAGLARGKPVDIQ-ADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
             G A     DIQ + +   KE +K L  KL+EY+D+D   D M K  +HD LPPV+S  
Sbjct: 311 QCGFAYS---DIQNSAKDEFKERIKHLFNKLIEYIDIDKAADLMAKNHIHDCLPPVISEN 367

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           E +CS  ++G +M + G V N  +I  DT++RL+R++ +R
Sbjct: 368 EQECSAVQDGEKMIENGIVENRVEIEPDTRIRLLRSHCIR 407


>gi|380022474|ref|XP_003695070.1| PREDICTED: lysine-specific demethylase NO66-like [Apis florea]
          Length = 489

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 187/338 (55%), Positives = 241/338 (71%), Gaps = 2/338 (0%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WEK P++I R    YY  L+++  +D ILR++ I F KNIDIT YE+G R+T +  GRA
Sbjct: 69  NWEKTPVHIKRNFPKYYKLLMSTPMLDKILRESYILFTKNIDITSYENGIRETHNPVGRA 128

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +P +VWD+Y  GCS+R LNPQTYI  L  LNA+LQE FGCF+GAN+Y TPP +QGFAPHY
Sbjct: 129 VPTVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFIGANSYLTPPNSQGFAPHY 188

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAFILQ+EGKK+W++Y PR  +EYLPRYSS NFSQ EIG PIL   +  GDLLY PR
Sbjct: 189 DDIEAFILQIEGKKRWRLYKPRNQNEYLPRYSSENFSQFEIGEPILDTIVNEGDLLYFPR 248

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G IHQ  T+ N HSLH+T+SVYQK +W D LEK +P AL  A   + EFR+GLPI YLRY
Sbjct: 249 GTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTTAIKENYEFRKGLPIDYLRY 308

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G    +  +   D+   KE +K LL+KL++Y+D+D+  D M K  +HD LPPVLS  E 
Sbjct: 309 CGYVHSEIQNNSKDKF--KEKIKHLLSKLIDYIDIDEAADLMAKNHIHDFLPPVLSESER 366

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           +CSV ++G +M + G V N  +I  DT+ RL R++ +R
Sbjct: 367 ECSVTQDGEKMIENGIVNNRVEIEPDTRFRLARSHCIR 404


>gi|307169522|gb|EFN62164.1| JmjC domain-containing protein CG2982 [Camponotus floridanus]
          Length = 447

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 187/337 (55%), Positives = 241/337 (71%), Gaps = 3/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE  P++I R   NYY  ++++  +D ILRDN I F KNIDIT Y DG R+T +  GRAL
Sbjct: 29  WESTPIHIKRNNSNYYKWIVSTSMLDKILRDNYILFTKNIDITSYVDGVRETHNPAGRAL 88

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P +VWD+Y   CS+R LNPQTYI  L  +NA+LQE FGCFVGAN Y TPP +QGFAPHYD
Sbjct: 89  PSVVWDYYLNDCSVRMLNPQTYIPQLHLVNATLQEFFGCFVGANLYLTPPNSQGFAPHYD 148

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAFILQ+EGKK+W++Y+P  ++EYL R SS NF Q EIG PIL   +  GDLLYLPRG
Sbjct: 149 DIEAFILQIEGKKRWRLYMPPSINEYLARCSSKNFDQSEIGEPILDTIVNAGDLLYLPRG 208

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQ +T+ + HSLHVTIS+YQ+++W DLLEK +P+AL+ A  TD +FRRGLP+GY RY 
Sbjct: 209 TIHQGTTIDDTHSLHVTISLYQRSSWCDLLEKVLPQALKRATETDWKFRRGLPVGYFRYT 268

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+A     +    R+A KE++KD++  L+ Y+DLD   D M K  +HD LPP+LS  E +
Sbjct: 269 GIAHS--TNKTEHRMAFKEDVKDMVTNLINYIDLDSAADLMAKDHIHDFLPPLLSKIEEE 326

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           CSV ++G  M   G V N   IT +TK+RL+R + +R
Sbjct: 327 CSVAKDGEIMIN-GIVKNRVNITLETKIRLLRIHCIR 362


>gi|350426682|ref|XP_003494511.1| PREDICTED: lysine-specific demethylase NO66-like [Bombus impatiens]
          Length = 492

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 186/340 (54%), Positives = 240/340 (70%), Gaps = 4/340 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WEK P+++ R    YY  L+++  +D ILR++ I F KNIDIT YE+  R+T +  GR
Sbjct: 71  NNWEKTPVHVKRNFHKYYKLLMSTPMLDKILRESHILFTKNIDITSYENDVRETHNPVGR 130

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A+P +VWD+Y  GCS+R LNPQTYI  L  LNA+LQE FGCFVGAN+Y TPP +QGFAPH
Sbjct: 131 AIPSVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFVGANSYLTPPNSQGFAPH 190

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAFILQ+EGKK+W++Y PR  +EYLPRYSS NFSQ EIG PIL   +  GDLLY P
Sbjct: 191 YDDIEAFILQIEGKKRWRLYKPRNENEYLPRYSSKNFSQSEIGEPILDTVVNAGDLLYFP 250

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQ  T+ N HSLH+T+SVYQK +W D LEK +P AL  A   D EFR+GLPI YLR
Sbjct: 251 RGTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTTAIEADSEFRKGLPIDYLR 310

Query: 241 YAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
             G A     DIQ + +   KE +K L  KL+EY+D+D   D M K  +HD LPP++S  
Sbjct: 311 QCGFAYS---DIQNNAKDEFKERIKHLFNKLIEYIDIDKAADLMAKNHIHDCLPPIISES 367

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           E +CS  ++G +M + G V N  +I  DT++RL+R++ +R
Sbjct: 368 EQECSAVQDGEKMIENGIVENRVEIEPDTRIRLLRSHCIR 407


>gi|383850995|ref|XP_003701049.1| PREDICTED: lysine-specific demethylase NO66-like [Megachile
           rotundata]
          Length = 615

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 187/340 (55%), Positives = 243/340 (71%), Gaps = 4/340 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WEK P++I R   NYY  L+++  +D ILR++ I F KNIDIT YE+G R+T +  GR
Sbjct: 194 NNWEKTPVHIKRNFPNYYKLLMSTPMLDKILRESYIMFTKNIDITSYENGVRETHNPAGR 253

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A+P +VWD+Y  GCS+R LNPQTYI  L  LNA+LQE FGCFVGAN+Y TPP +QGFAPH
Sbjct: 254 AVPSVVWDYYMNGCSVRMLNPQTYISKLHSLNATLQEFFGCFVGANSYLTPPNSQGFAPH 313

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAFILQ+EGKK+W++Y PR   EYLPRYSS NFS+ EIG PIL   +  GDLLY P
Sbjct: 314 YDDIEAFILQVEGKKRWRLYKPRNESEYLPRYSSENFSEAEIGEPILDTIVNAGDLLYFP 373

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQ  T+ + HSLH+T+SVYQK +W D LEK +P+AL  A  TD EFR+GLP+ YLR
Sbjct: 374 RGTIHQGETLDDTHSLHITLSVYQKNSWGDYLEKLLPEALNTAIRTDSEFRKGLPLDYLR 433

Query: 241 YAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
           + G    +    Q++ R   K  +KDL  KL+E++D+D   D M K  ++D LPPVLS  
Sbjct: 434 HNGHVYSES---QSNGREEFKAKVKDLFTKLIEHIDIDKAADMMAKNHVYDFLPPVLSES 490

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           E +CSV ++G +M   G V N  +I  DT+VRL+R++ +R
Sbjct: 491 EQECSVVQDGEQMVANGIVKNRVEIEPDTRVRLLRSHCIR 530


>gi|322793511|gb|EFZ17039.1| hypothetical protein SINV_11691 [Solenopsis invicta]
          Length = 494

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 173/338 (51%), Positives = 231/338 (68%), Gaps = 4/338 (1%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WEK P+Y+ R    YY  ++++  +D +LR+N + F KNIDIT Y +G R+T +  GRA
Sbjct: 80  NWEKAPMYVRRDMPEYYERIVSTDLLDKVLRNNHLLFTKNIDITSYSNGVRETHNPSGRA 139

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +P IVWD+Y   CS+R LNPQT+I  +  LNA+LQE FGCFVGAN Y TPP +QGFAPHY
Sbjct: 140 VPSIVWDYYNNKCSVRMLNPQTFIPKIHALNATLQEFFGCFVGANLYLTPPNSQGFAPHY 199

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DD+EAFILQ+EG+K+W++Y P   +E+LPRYSS NF Q EIG PIL   ++ GDLLYLPR
Sbjct: 200 DDVEAFILQIEGQKRWRLYKPLSENEFLPRYSSKNFDQSEIGEPILDTIVKAGDLLYLPR 259

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G IHQ  T+ N HSLHVT+SVYQK +W D LEK +P+AL  A   D  FR GLP+ YL Y
Sbjct: 260 GTIHQGVTLDNVHSLHVTLSVYQKNSWCDFLEKVLPQALNRAAENDSRFREGLPLNYLTY 319

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G A  +       +   K  +KDL+  ++E++D+D+  D M K  +HD LPP L   E 
Sbjct: 320 VGTAHNE----NNQKNIFKNKIKDLITLMIEHIDIDNAADLMAKNQIHDFLPPFLPKNER 375

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           + +V E+G  M+  G V N  +IT +TK+RL+R + +R
Sbjct: 376 KRTVAEDGEIMTSNGVVKNYPEITLETKIRLLRWHCIR 413


>gi|156540940|ref|XP_001599259.1| PREDICTED: lysine-specific demethylase NO66-like [Nasonia
           vitripennis]
          Length = 448

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 173/339 (51%), Positives = 226/339 (66%), Gaps = 2/339 (0%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WEK PL+I R +  YY +L+T+  +D ILR N I F KNIDIT + + +R+T +  GRA
Sbjct: 25  NWEKNPLHIKRNKPKYYQSLMTTPMLDQILRKNYILFTKNIDITSFTNNKRETHNPHGRA 84

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           LP IVWD+Y  GCS+R LNPQT+I  L  LNA LQE FGCFVG+N+Y TPP +QGFAPHY
Sbjct: 85  LPSIVWDYYANGCSVRLLNPQTFIPKLHTLNAILQEFFGCFVGSNSYLTPPNSQGFAPHY 144

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAFILQ+EGKK+W++Y P+  +E LPRYSS NFSQ+++G PIL   +E GD+LY PR
Sbjct: 145 DDIEAFILQIEGKKRWRLYKPKTDNEILPRYSSLNFSQQDLGEPILNTVIEAGDILYFPR 204

Query: 182 GYIHQASTVT-NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           G IHQ  T   +EHSLH+T+SVYQ+ +W D LEK +P AL  A   D  FR+GLPI YLR
Sbjct: 205 GTIHQGDTFGFDEHSLHITLSVYQRNSWGDFLEKLIPAALNNAIKNDSRFRKGLPINYLR 264

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           Y G    +  D    R     +   LL  L + +D+D   D M    +HD LPPV +  E
Sbjct: 265 YIGSTYTENFDCPI-RKKFSNHTISLLTILSKCIDIDKAADLMAVNHIHDFLPPVFNQNE 323

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             CS+++ G +M + G V N  +I  +T ++L R +  R
Sbjct: 324 RLCSIYDGGEKMIKNGAVINRVEIEPNTLIKLSRGHCAR 362


>gi|307209000|gb|EFN86200.1| JmjC domain-containing protein CG2982 [Harpegnathos saltator]
          Length = 621

 Score =  364 bits (935), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 172/339 (50%), Positives = 231/339 (68%), Gaps = 3/339 (0%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           +++EK P++I R + +YY +L+++  +D+ILR+  + F KNIDIT Y +G R+T +  GR
Sbjct: 183 NNFEKRPVHIQRNKPSYYLSLVSTPLLDEILREKNVLFTKNIDITSYTNGVRETYNPTGR 242

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A+P  VWD+Y   CS+R LNPQT+I  +  LNA+LQE FGCFVG+N Y TPP +QGFAPH
Sbjct: 243 AVPNAVWDYYMNKCSVRMLNPQTFIHKIYMLNATLQEYFGCFVGSNLYLTPPGSQGFAPH 302

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQ+EGKK+W++Y P     YL RYSS NF + EIG PIL   L+ GDLLY P
Sbjct: 303 YDDIEAFVLQVEGKKRWRLYRPPN-KSYLARYSSRNFDESEIGEPILDTILQAGDLLYFP 361

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQA T  + HSLH+T+SVYQK +W D LEK +P  L+ A  ++  FR GLP+ YL+
Sbjct: 362 RGTIHQAETADDTHSLHITLSVYQKNSWYDFLEKLLPLTLKRAAESNYNFRTGLPLDYLK 421

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           Y G A     + QA R   +E +K     L++Y+D+D   D M K  +HD LPP L+ EE
Sbjct: 422 YVGTAHSTS-NGQA-RKIFEEKIKTFFGLLIDYIDVDSAADLMAKSHIHDFLPPALTLEE 479

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            QCSV  +G  M+Q G V    QI  DT +RL+R++ +R
Sbjct: 480 QQCSVARDGDIMTQNGNVKRCVQIESDTMIRLIRSHCIR 518


>gi|157123061|ref|XP_001653808.1| hypothetical protein AaeL_AAEL009382 [Aedes aegypti]
 gi|122105602|sp|Q16W06.1|NO66_AEDAE RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|108874534|gb|EAT38759.1| AAEL009382-PA [Aedes aegypti]
          Length = 635

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 172/339 (50%), Positives = 225/339 (66%), Gaps = 2/339 (0%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WEK+PL I RK  +YY+NL++  KID++LR N IE+ KNID+T Y +GQR+T + +GR
Sbjct: 222 NYWEKKPLLIQRKNPSYYSNLLSRAKIDEMLRQNNIEYTKNIDVTSYREGQRETHNPDGR 281

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            LP  +W  Y EGCSIR LNPQTY+  + ++N  LQE F C  GAN Y TPP +QGFAPH
Sbjct: 282 VLPPDMWSFYEEGCSIRMLNPQTYLPGVYEMNVKLQEFFHCMTGANFYLTPPNSQGFAPH 341

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQ+EG+K WK+Y PR   E L R SSPNF+QEEIGTPIL V LEPGDLLY P
Sbjct: 342 YDDIEAFVLQVEGRKHWKLYSPREPAEMLARVSSPNFTQEEIGTPILEVVLEPGDLLYFP 401

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQASTV   HSLHVT+SVYQK  W DLLE   P AL  A  T  + RRG+P+   +
Sbjct: 402 RGIIHQASTVPGHHSLHVTMSVYQKNCWADLLELFFPHALAQAAETHFDLRRGIPLNLHQ 461

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           + G+           R  +  ++K L+ K+     +D  VD++ K+  HDALPP++S  E
Sbjct: 462 HFGIVHSD--SETPARKQLISHIKSLVDKVFSEDAIDSAVDQLAKRFQHDALPPLISNTE 519

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
              +V+ +    +  G V      T+++ VRL+R N +R
Sbjct: 520 QSTTVYGSSFSHNPDGTVSLRVPFTENSTVRLLRCNVLR 558


>gi|189236258|ref|XP_974397.2| PREDICTED: similar to CG2982 CG2982-PB [Tribolium castaneum]
 gi|270005824|gb|EFA02272.1| hypothetical protein TcasGA2_TC007936 [Tribolium castaneum]
          Length = 568

 Score =  357 bits (916), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 166/338 (49%), Positives = 236/338 (69%), Gaps = 7/338 (2%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WE++PLYI R  ++YY +++ S  +D ILR+N + F +N+D+  YE+G++Q  + EGRA
Sbjct: 151 YWEQKPLYIKRGNRSYYTHILDSSSLDKILRNNSLFFTRNVDVVTYENGEKQVFNQEGRA 210

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
            P  +WD+Y  GCSIR LNPQTY   +  L A+LQE FG  VGAN Y TPP +QGFAPHY
Sbjct: 211 TPSALWDYYGNGCSIRVLNPQTYNHKVHLLLATLQEYFGTMVGANVYLTPPGSQGFAPHY 270

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF++QLEG+K WK+Y P+  D  L R+SSPNF +E++G P + +TL  G+LLY PR
Sbjct: 271 DDIEAFVVQLEGRKHWKLYQPKSED-VLARFSSPNFKREDLGEPFMELTLNAGELLYFPR 329

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G IH+  T  + HSLH+T+SVYQ+T+++DLLE  +PKAL+ A  +D+EFR+GLP+ YL+ 
Sbjct: 330 GTIHEGRTDEDSHSLHITVSVYQQTSYVDLLEHILPKALKKAADSDVEFRKGLPLNYLKD 389

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            GLA G       +R  +   +KDL+  L+ YVD+D   D++GK+ MHD++PPVL+  E 
Sbjct: 390 FGLAAG-----SKNRKFVTSKIKDLMNSLINYVDIDSAADQLGKKFMHDSMPPVLTKSEA 444

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           + S   +G  + + G V N  +I  DT+VRL+R   +R
Sbjct: 445 ERSSKADGPVL-KDGVVKNRVEIDLDTRVRLLRYYCIR 481


>gi|170044529|ref|XP_001849897.1| nucleolar protein 66 [Culex quinquefasciatus]
 gi|284433498|sp|B0WMG3.1|NO66_CULQU RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|167867637|gb|EDS31020.1| nucleolar protein 66 [Culex quinquefasciatus]
          Length = 648

 Score =  353 bits (905), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 170/337 (50%), Positives = 221/337 (65%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+P  + R    YY NL++  KID++LR+N IE+ KN+D+T Y +G R+T + +GRAL
Sbjct: 237 WEKKPFLVQRNDPTYYANLLSRGKIDEMLRNNNIEYTKNLDVTSYREGVRETHNPDGRAL 296

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VW  Y EGCSIR LNPQTY+  + ++N  LQE F C  G+N Y TPP +QGFAPHYD
Sbjct: 297 PPDVWAFYEEGCSIRMLNPQTYLPGVYEMNVKLQEFFHCMTGSNFYLTPPNSQGFAPHYD 356

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQ+EG+K WK+Y PR   E L R SSPNF+QEEIG PIL VTLEPGDLLY PRG
Sbjct: 357 DIEAFVLQVEGRKHWKLYSPRTASEVLARVSSPNFTQEEIGVPILEVTLEPGDLLYFPRG 416

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQASTV   HSLHVT+SVYQK +W DLLE  +P AL  A    LE RRG+P    ++ 
Sbjct: 417 IIHQASTVPGHHSLHVTMSVYQKNSWADLLELYLPHALSQAAENHLELRRGIPQDLHQHF 476

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+      +    R  + + +K L+ K+     +D  VD++ K+  HDALPP+L+ +E  
Sbjct: 477 GIVHSD--NETPTRKDLIKKIKSLVDKIFSEEAIDVAVDQLAKRFQHDALPPLLTDQERA 534

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            + +      +  G V   T  T+ T +RL+R N VR
Sbjct: 535 QTAYGANYAFNPDGTVPLQTAFTERTTIRLLRRNIVR 571


>gi|148234196|ref|NP_001082857.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           NO66 [Danio rerio]
 gi|284433499|sp|A3KP59.1|NO66_DANRE RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|126631921|gb|AAI34173.1| Zgc:162967 protein [Danio rerio]
          Length = 544

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 160/339 (47%), Positives = 225/339 (66%), Gaps = 2/339 (0%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+WEK+P+ I R+  +YY  L ++ + D ILR++ +++  N+D+T Y +G+R+T +  GR
Sbjct: 124 DNWEKKPILIQRQNADYYKGLFSTAEFDRILRNDDVQYGVNLDVTSYTNGKRETHNPPGR 183

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP  VWD Y  GCSIR LNPQ +   + Q+ + LQE FG   GAN Y TPP  QGFAPH
Sbjct: 184 ALPYTVWDFYESGCSIRMLNPQAFSSTVWQVLSVLQEKFGSMAGANVYLTPPGTQGFAPH 243

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +DDIEAF++QLEG+K W+VY PR  DE L   SSPNFSQ+EIG P++ V LE GDLLY P
Sbjct: 244 FDDIEAFVVQLEGRKHWRVYNPRCEDEVLSLVSSPNFSQDEIGEPVMDVVLEAGDLLYFP 303

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG++HQ   + + HSLH+TIS YQ+ +W DL+ K MP AL+ A   D+EFR+GLP+ YL+
Sbjct: 304 RGFVHQGDCLPDAHSLHITISSYQRNSWGDLMLKLMPAALEVAMEEDVEFRKGLPLDYLQ 363

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           Y G+   +  D + DR     +++ L+ KLV +  +D  VD+  K  +HD LPP+L+ EE
Sbjct: 364 YMGVQNSEKEDPRRDRFMA--HIQGLMKKLVSFAPVDAAVDQKAKDFLHDCLPPLLTAEE 421

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
              SV+    R   +  +  A ++   T+++LVRA A R
Sbjct: 422 KAGSVYGAPARWGDSEALDVAVELKSQTRIKLVRAGAAR 460


>gi|71894875|ref|NP_001026367.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           NO66 [Gallus gallus]
 gi|75571463|sp|Q5ZMM1.1|NO66_CHICK RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|53127257|emb|CAG31022.1| hypothetical protein RCJMB04_1k4 [Gallus gallus]
          Length = 601

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 167/338 (49%), Positives = 216/338 (63%), Gaps = 2/338 (0%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HWE+ PL + R    YY  L ++   D ILR   + F  ++D+T Y +G R+T +  GRA
Sbjct: 182 HWERAPLLVQRGDPGYYAGLFSTADFDAILRSGDVHFGTHLDVTSYAEGVRETHNPVGRA 241

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           LP +VWD Y+ GCS+R L+PQ +   +    + LQE FG   GANTY TPP  QGFAPHY
Sbjct: 242 LPAVVWDFYQNGCSLRLLSPQAFSTTVWHFLSILQEHFGSMAGANTYLTPPGTQGFAPHY 301

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF+LQLEGKK W+VY PR   E LP++SS N +Q E+G P+L V LE GDLLY PR
Sbjct: 302 DDIEAFVLQLEGKKHWRVYGPRTSSEALPQFSSANLTQAELGEPLLEVVLEAGDLLYFPR 361

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G+IHQA  + + HSLH+T+S YQ+ +W D LEK +P ALQ A   DLE+R+GLP+  L Y
Sbjct: 362 GFIHQADCLPDAHSLHITVSSYQRNSWGDFLEKLLPAALQMALEEDLEYRQGLPMDCLGY 421

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G+A    VD  A R A  E ++ L+ KLV+Y  +D  +D+  K  +HD LPPVL+  E 
Sbjct: 422 MGVANSDAVD--ARRTAFVEKVQHLIKKLVDYAPIDAAMDQRAKSFLHDCLPPVLTQSEK 479

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           Q SV+    R    G      QITKDT+VRL+    VR
Sbjct: 480 QLSVYGFPARWQDGGPRNVDIQITKDTEVRLLHHGVVR 517


>gi|327259545|ref|XP_003214597.1| PREDICTED: lysine-specific demethylase NO66-like [Anolis
           carolinensis]
          Length = 618

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 169/339 (49%), Positives = 222/339 (65%), Gaps = 4/339 (1%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WEK+PL I R   NYY +L ++ + D ILRD  ++F  N+D+T Y+DG+R+T +  GRA
Sbjct: 199 YWEKKPLLIQRHNPNYYRDLFSTAEFDAILRDQDVQFGINLDVTSYQDGKRETYNPVGRA 258

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           LP +VWD Y+ GCS+R LNPQ +   +    + LQE FG  VGANTY TP   QGFAPHY
Sbjct: 259 LPAVVWDFYQNGCSLRMLNPQAFSSTVWHFLSILQEQFGSMVGANTYLTPAGTQGFAPHY 318

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF++QLEGKK W+VY PR   E LP +SS NF Q EIG PIL   LE GD+LY PR
Sbjct: 319 DDIEAFVIQLEGKKHWRVYSPRREAEVLPPFSSRNFDQAEIGEPILETVLEAGDMLYFPR 378

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G+IHQ   + N HSLH+T+S YQ+ +W DLLEK +P ALQ A   D+E+R+GLP+ YL Y
Sbjct: 379 GFIHQGDCLPNAHSLHITVSSYQRNSWGDLLEKLLPAALQMAIEEDVEYRQGLPMDYLEY 438

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G+     VD +  R +  + +  L+ KL +YV +D  VD+  K  +HD LPP+L+  E 
Sbjct: 439 MGVLNSDRVDPR--RASFMQKVSRLITKLTDYVPVDAAVDQKAKCFLHDCLPPMLTESEK 496

Query: 302 QCSVFENGLRMSQTGEVYNA-TQITKDTKVRLVRANAVR 339
             SV+ +  R  + G V NA  Q+   T VRL+R   VR
Sbjct: 497 ALSVYGHPARW-EDGSVRNADVQVKGTTWVRLLRHGIVR 534


>gi|301608699|ref|XP_002933911.1| PREDICTED: lysine-specific demethylase NO66-like [Xenopus
           (Silurana) tropicalis]
          Length = 584

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 164/338 (48%), Positives = 225/338 (66%), Gaps = 4/338 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE++P  I R    YY+ L ++ + D ILR++ ++F  N+D+T Y DG R+T +  GRAL
Sbjct: 165 WEQKPALIKRDNHRYYDGLFSTTEFDRILREDNVKFGVNLDVTSYTDGTRETHNPPGRAL 224

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P +VWD+++ GCS+R LNPQ++ + +  + + LQELFG  VGANTY TP   QGFAPHYD
Sbjct: 225 PLVVWDYFKNGCSLRLLNPQSFCRTVWNVLSVLQELFGSMVGANTYLTPAGTQGFAPHYD 284

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEGKK W+VY PR   E+LP++SS NF   +IG PIL   LE GDLLY PRG
Sbjct: 285 DIEAFVIQLEGKKHWRVYNPRNNSEFLPQFSSVNFRDRDIGEPILETVLEAGDLLYFPRG 344

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQ   + + HSLH+TIS YQ+ +W DLLEK +P ALQ A   D+EFR+GLP+ YL Y 
Sbjct: 345 FIHQGDCLPDAHSLHITISSYQRNSWADLLEKVLPAALQVAVEEDVEFRKGLPLDYLEYM 404

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+      D + D  A  +N++ L+ KL +Y  +D  VD+  K  +HD LPPVL+P E  
Sbjct: 405 GVQHADLKDPRRD--AFIQNIQSLIKKLSDYAPVDAAVDQKAKNFLHDCLPPVLTPAEKT 462

Query: 303 CSVFENGLRMSQTGEVYNAT-QITKDTKVRLVRANAVR 339
            SV    + +  +G+V + T QI  +TK+ L+R    R
Sbjct: 463 HSVHGAPVTLD-SGKVKDDTLQIELETKICLLRGGIAR 499


>gi|346471459|gb|AEO35574.1| hypothetical protein [Amblyomma maculatum]
          Length = 532

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 167/336 (49%), Positives = 231/336 (68%), Gaps = 6/336 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE++P+ + R Q+++Y  L++ +++D ILR+  + F +NID+T YE G+R+T + +GRA 
Sbjct: 114 WEQKPMLLRRHQRDFYQGLMSCEQLDRILRERSLFFTENIDLTTYEGGKRETHNPDGRAH 173

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
             +VWD +++GCS+R LNPQTY + + +L + LQE FG FVGAN Y TP  +QGFAPHYD
Sbjct: 174 AAVVWDAFQKGCSVRLLNPQTYSRAVWRLCSMLQEFFGSFVGANLYLTPAGSQGFAPHYD 233

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEG+K W++Y PR   E LPR+SS NF  +E+G PIL   LEPGDLLY PRG
Sbjct: 234 DIEAFVVQLEGRKCWRLYAPRDPSEELPRFSSGNFGPDEVGNPILEAVLEPGDLLYFPRG 293

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQA T  + HSLH+T+S  Q+  W DL EK +P+A+Q A   D+E+R+ LP GYL Y 
Sbjct: 294 IIHQAYTPEDVHSLHLTLSTCQRNTWGDLFEKMIPRAVQLAIEEDVEYRKSLPRGYLNYM 353

Query: 243 GLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
           G+A     ++ +D R A  E +  L+ KLV Y  +D  VD+ GK  +H+ALPP+LS EE 
Sbjct: 354 GIANS---EVASDAREAFLEKVSHLVRKLVNYCPVDAAVDQCGKAHLHEALPPLLSQEER 410

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANA 337
             SV  +G R  Q G+V +  ++  DT VRL R NA
Sbjct: 411 SRSV-HSGERW-QYGQVVDVQELDPDTHVRLARRNA 444


>gi|241642121|ref|XP_002409388.1| nucleolar protein, putative [Ixodes scapularis]
 gi|215501349|gb|EEC10843.1| nucleolar protein, putative [Ixodes scapularis]
          Length = 457

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 166/337 (49%), Positives = 226/337 (67%), Gaps = 4/337 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE++PL + R Q  +Y +L++ +++D ILR+  + F +N+D+T Y  G R+T +  GRA 
Sbjct: 35  WEQKPLLVKRHQPGFYGDLMSCQQLDRILRERQLYFTENVDLTTYRGGVRETHNPPGRAH 94

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
             +VWD +++GCS+R LNPQTY + L +L A+LQE FG  VGAN Y TP  +QGFAPHYD
Sbjct: 95  TAVVWDSFQQGCSVRLLNPQTYSRQLWRLCATLQEFFGSMVGANLYLTPAGSQGFAPHYD 154

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E LPR+SS NF++EE+G P+LT  LEPGDLLY PRG
Sbjct: 155 DIEAFVLQLEGQKLWRVYAPRDPSEELPRFSSGNFTKEEVGEPLLTTLLEPGDLLYFPRG 214

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA T    HSLH+T+S  Q+  W DLLEKA+P ALQ A   D++FRR LP  Y    
Sbjct: 215 FIHQACTGETSHSLHLTLSTCQRNTWGDLLEKALPTALQLALEEDVDFRRSLPRDYPSVM 274

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+A     + +  R A    +  L  +L +++ +D   D+M K  +H+ALPP+LS +E +
Sbjct: 275 GVANADLANPR--REAFLAKMHSLFERLWDFLPVDAAADQMVKAHLHEALPPLLSQQERE 332

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           CSV  +G      G V +  ++  DT VRLVR NAVR
Sbjct: 333 CSV--HGGERWHEGRVVDVQELEPDTAVRLVRRNAVR 367


>gi|390349142|ref|XP_782527.3| PREDICTED: lysine-specific demethylase NO66-like
           [Strongylocentrotus purpuratus]
          Length = 639

 Score =  340 bits (872), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 161/325 (49%), Positives = 220/325 (67%), Gaps = 3/325 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           +E++PL++ R +  Y+ ++ +SK++ +IL++N ++F +NID+T Y DG+R+T +  GRA 
Sbjct: 199 FERKPLFLKRHKPGYFTDIFSSKELSNILKENDVQFTRNIDVTTYTDGKRETHNPTGRAQ 258

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQ+VWD+Y  GCS+R LNPQTY   + QL A+LQE FGCFVGAN Y TPP  QGFAPHYD
Sbjct: 259 PQVVWDYYNNGCSVRVLNPQTYSTRVWQLLAALQEFFGCFVGANIYLTPPGTQGFAPHYD 318

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEGKK WK+Y  R   E LPR+SS NF+  +IG PIL  TLEPGDLLY PRG
Sbjct: 319 DIEAFVLQLEGKKHWKLYNQRSPAEVLPRFSSSNFTDADIGQPILDTTLEPGDLLYFPRG 378

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQAST +  HSLH+TIS  QK  W DL+E  + +AL      +++FR+ LP  YL Y 
Sbjct: 379 VIHQASTPSETHSLHITISACQKNTWGDLMEHVLTRALHVVREENIDFRKSLPTDYLNYM 438

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+A  + +D +  +  MK+ + DL +KL++   +D   D+M  +  H +LPPVL   E  
Sbjct: 439 GVAH-EDLDDERRQPFMKK-IGDLFSKLLKSAPIDAAADQMSLEFFHTSLPPVLEEAEES 496

Query: 303 CSVFENGLRMSQTGEVYNATQITKD 327
           CSVF       + G+V    Q+ +D
Sbjct: 497 CSVFGESASC-RNGKVSGIVQLGED 520


>gi|432944965|ref|XP_004083470.1| PREDICTED: lysine-specific demethylase NO66-like [Oryzias latipes]
          Length = 544

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 161/337 (47%), Positives = 214/337 (63%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+P+Y+ R+  +YY  L ++ + D ILR+  ++F  N+D+T Y +G+R+T +  GRAL
Sbjct: 127 WEKKPIYVQRRNPDYYKGLFSTAEFDRILREENVQFGVNLDVTSYTNGKRETHNPPGRAL 186

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VWD Y  GCS+R LNPQ +   +  + + LQE FG   GAN Y TPP  QGFAPHYD
Sbjct: 187 PFTVWDFYESGCSLRLLNPQAFSHTVWNVLSVLQEQFGSMAGANVYLTPPGTQGFAPHYD 246

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEGKK W+VY PR  +E LP  SSPNF Q +IG PIL V LE GDLLY PRG
Sbjct: 247 DIEAFVVQLEGKKHWRVYSPRSENEVLPVLSSPNFDQSDIGKPILDVVLEAGDLLYFPRG 306

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQ   + + HSLH+TIS +QK +W DL++K +  AL+ A   D+EFRRGLP+ YL Y 
Sbjct: 307 FIHQGKCLPDAHSLHITISSFQKNSWGDLMQKVVTAALEIAMEEDVEFRRGLPLDYLTYM 366

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+      D +  R+     ++ L+ KL  Y  +D  VD+  K  +HD LPPVL+P+E  
Sbjct: 367 GVQNSDKEDPR--RVKFLSQMEKLMKKLPNYAPVDAAVDQKAKDFLHDCLPPVLTPDEFA 424

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV    +R         A  IT  T +RL+RA   R
Sbjct: 425 TSVQGASVRWVNGRVKGTAAHITTQTLIRLLRAGCAR 461


>gi|198416710|ref|XP_002119787.1| PREDICTED: similar to Nucleolar protein 66 (hsNO66), partial [Ciona
           intestinalis]
          Length = 594

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 162/341 (47%), Positives = 224/341 (65%), Gaps = 5/341 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D WE  PL + R    Y + L ++K+++ IL +  + +  N+D+T Y++G+R+T +++GR
Sbjct: 164 DIWESRPLLVLRHCPRYADGLFSTKEMNRILNECNVRYSVNLDVTTYQNGRRETHNIDGR 223

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A   +VWD+++ GCSIR  NPQ + KP+ +L A+LQE F C VGANTY TPP  QGFAPH
Sbjct: 224 AYAPVVWDYFKNGCSIRLKNPQAFSKPVWRLCATLQEFFKCMVGANTYLTPPGTQGFAPH 283

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQLEGKK+W +Y PR   E LPRYSS NF+ +EIG  I T TLE G+LLY P
Sbjct: 284 YDDIEAFVLQLEGKKEWTLYSPRSGKETLPRYSSGNFTADEIGDEIFTQTLEAGNLLYFP 343

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQA  + + HSLHVTIS+YQ+ +W DLLEK +P  LQ A   D+EFR+GLP+ YL 
Sbjct: 344 RGYIHQAKALPDTHSLHVTISMYQRNSWGDLLEKLLPTTLQNAIIDDVEFRKGLPLDYLS 403

Query: 241 YAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
             G    +  D   D R A    + DL  +L + VD+D   DE G   +HDA+PP ++ +
Sbjct: 404 LFG---EQNCDKHPDRRRAFMGKISDLFRRLADKVDIDAAADEHGINFLHDAMPPYMTQD 460

Query: 300 ELQCSVFENGLRMSQTGE-VYNATQITKDTKVRLVRANAVR 339
           E   S+    + M+  G   +    I + T++R++R NA+R
Sbjct: 461 EKSRSIHGCDVGMTSPGTPTHYICDIDEKTRIRVLRKNALR 501


>gi|405958337|gb|EKC24473.1| Nucleolar protein 66 [Crassostrea gigas]
          Length = 453

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 157/338 (46%), Positives = 223/338 (65%), Gaps = 3/338 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE++PL + R    Y +   ++ ++D ILR+  I+F  N+D+T + +GQR+T +  GRA 
Sbjct: 29  WERKPLLVQRHMAQYNDGWFSTAELDKILREENIQFSTNLDVTTFTNGQRETHNPVGRAF 88

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P +VWD Y+ GCS+R LNPQTY + + +L + LQE F C VGAN Y TPP  QGFAPHYD
Sbjct: 89  PSVVWDFYQNGCSVRLLNPQTYSRNVWKLLSVLQEYFNCCVGANVYLTPPGTQGFAPHYD 148

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAFILQLEG+K W++Y PR  +E LPR SS NFS++++G PIL   L+PGDLLY PRG
Sbjct: 149 DIEAFILQLEGRKHWRLYSPRTDNEVLPRLSSGNFSEKDLGKPILDTVLDPGDLLYFPRG 208

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQ + + + HSLH+T+S YQK  W DL EK +P+ALQ A   D+EFRRGLP  Y  Y 
Sbjct: 209 TIHQGNCMEDTHSLHITVSCYQKNTWGDLFEKMVPRALQIAIDEDVEFRRGLPREYSSYM 268

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+A    +D     L +K+ ++ L+ ++++++ +D   D+M K  +HD+LPPVLS  E  
Sbjct: 269 GIANSD-MDGAPRNLFLKK-VEQLMVRMIQHLPVDSACDQMAKSFIHDSLPPVLSDAEKA 326

Query: 303 CSVFENGLRMSQTGE-VYNATQITKDTKVRLVRANAVR 339
            SV  +G    +    V    ++  DT ++++R   +R
Sbjct: 327 FSVHGSGEHWDKEKMCVVGTAEMEPDTTIKIIRKGILR 364


>gi|47224570|emb|CAG03554.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 454

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 163/339 (48%), Positives = 220/339 (64%), Gaps = 6/339 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+P+ I RK  +YY  L ++ + D ILR+  +++  N+D+T Y +G+R+T +  GRAL
Sbjct: 39  WEKKPILIQRKNPDYYKGLFSTAEFDRILREEDVQYGVNLDVTSYTNGRRETHNPPGRAL 98

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VW  Y+ GCS+R LNPQ +   +  + + LQE FG   GAN Y TPP  QGFAPHYD
Sbjct: 99  PYNVWGFYKSGCSLRLLNPQAFSSTVWNILSILQENFGSMAGANIYLTPPGTQGFAPHYD 158

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEGKK W+VY PR+ DE LP +SSPNF++ +IG PIL V LE GDLLY PRG
Sbjct: 159 DIEAFVVQLEGKKHWRVYNPRVEDEVLPVFSSPNFNEADIGKPILEVVLEAGDLLYFPRG 218

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQ   + + HSLH+TIS YQK +W DLL++ +P AL+ A   D EFRRGLP+ YL Y 
Sbjct: 219 FIHQGDCLPDAHSLHITISSYQKNSWGDLLQRMLPAALEIAVEEDTEFRRGLPLDYLTYM 278

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+      D +  R      ++ L+ KL  Y  +D  VD+  +  +HD LPP+L+P EL 
Sbjct: 279 GVQNSDKDDPR--RQQFFSQIETLMKKLTTYCPVDAAVDQKARDFLHDCLPPMLTPVELA 336

Query: 303 CSVFENGLRMS-QTGEVYNAT-QITKDTKVRLVRANAVR 339
            SV   G R   + G+V + +  I   T+VRL+RA   R
Sbjct: 337 SSV--KGARTRWEGGQVTDESLDINTQTRVRLLRAGCAR 373


>gi|348517751|ref|XP_003446396.1| PREDICTED: lysine-specific demethylase NO66-like [Oreochromis
           niloticus]
          Length = 541

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 157/338 (46%), Positives = 217/338 (64%), Gaps = 4/338 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+P+++ RK  +YY  L ++ ++D I R+  +++  N+D+T Y +G+R+T +  GR L
Sbjct: 126 WEKKPIFVQRKNPDYYKGLFSTAELDRIFREEDVQYGVNLDVTSYTNGKRETHNPPGRVL 185

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   WD Y  GCS+R LNPQ +   +  + + LQE FG   GAN Y TPP  QGFAPHYD
Sbjct: 186 PYTAWDFYESGCSLRMLNPQAFSSTVWNVLSILQEQFGSMAGANIYLTPPGTQGFAPHYD 245

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEGKK+W+VY PR  DE LP  SSPNF Q +IG PIL V +E GDL+Y PRG
Sbjct: 246 DIEAFVVQLEGKKRWRVYSPRSDDEVLPVLSSPNFDQADIGKPILDVVVEAGDLIYFPRG 305

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQ + + + HSLH+TIS YQK +W DLL+K +P AL+ A   D+EFRRGLP+ YL Y 
Sbjct: 306 FIHQGNCLPDAHSLHITISSYQKNSWGDLLQKLVPAALEVAMEEDVEFRRGLPLDYLTYM 365

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+      D +  R      + +L+ +L  Y  +D  VD+  K  +HD LPP L+P+EL 
Sbjct: 366 GVQNSDKEDPR--RAKFFSRIDNLMKRLTNYAPVDAAVDQKAKDFIHDCLPPALTPDELV 423

Query: 303 CSVFENGLRMSQTGEVYN-ATQITKDTKVRLVRANAVR 339
            SV     R  + G+V +    I   T++R++RA   R
Sbjct: 424 TSVQGAPTRW-EGGQVKDVGVHIITQTQIRILRAQCAR 460


>gi|242006700|ref|XP_002424185.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507526|gb|EEB11447.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 467

 Score =  329 bits (844), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 169/344 (49%), Positives = 218/344 (63%), Gaps = 12/344 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           + WEK+PLYISR    YYN L +    +  L +  + F KNID+T Y DGQR T +++G+
Sbjct: 39  NFWEKKPLYISRNNNEYYNELCSMNAFEKALSEKDMYFTKNIDVTSYIDGQRFTENLDGK 98

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A    +WD + EG SIR LNPQT+I  +  LN +LQE F CFVGAN Y TP   QGFAPH
Sbjct: 99  ATVSNIWDFFNEGKSIRLLNPQTFIPNVWLLNTNLQEFFNCFVGANMYLTPAGTQGFAPH 158

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQLEG+K WKVY PR   E L R SS NF ++EIG PIL VTL+PGD+LY P
Sbjct: 159 YDDIEAFVLQLEGQKHWKVYNPRDSSEVLARESSKNFKEDEIGKPILKVTLKPGDMLYFP 218

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQA  + + HSLH+T+S YQK +W DL EK  P ALQ A   D+EFR+GLPIGYL 
Sbjct: 219 RGYIHQAKCLPDTHSLHLTVSCYQKNSWADLFEKIFPVALQKAIFEDVEFRKGLPIGYLN 278

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           + G+   +  +    +   K N  +L  K++E   +D GVD++    +HDALPP L+ +E
Sbjct: 279 HMGMMFSEENNEFRQKFIQKFN--ELGRKVLEMCPIDAGVDQLSTNFIHDALPPCLTEDE 336

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTK-----VRLVRANAVR 339
                   G R  +  +  N   I K+T      V+L+RA + R
Sbjct: 337 ----KLYGGDRNVKLPDK-NFLHILKNTSNTNLGVKLIRATSAR 375


>gi|443692249|gb|ELT93889.1| hypothetical protein CAPTEDRAFT_153587 [Capitella teleta]
          Length = 698

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 162/359 (45%), Positives = 228/359 (63%), Gaps = 24/359 (6%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRD---------------NVIEFKKNIDITLY 47
           WEK+PL + R+  NY N   +++++D ILR+                 + F +NID+  Y
Sbjct: 246 WEKKPLMVKRRNPNYNNGWFSTQELDRILREVGAFIFEAGFYGISQEHLNFTENIDVVTY 305

Query: 48  EDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
           E+G+R+T +  GRA   ++WDH++ GCS+R LNPQ+Y + + +  + LQE FGCF GAN 
Sbjct: 306 ENGKRETHNPVGRAYAPVLWDHFQNGCSVRLLNPQSYSRNVWKYLSVLQEYFGCFAGANV 365

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
           Y TPP  QGFAPHYDDIEAFILQLEGKK+W++Y PR   E L R+SS NFS+++IG PI 
Sbjct: 366 YLTPPGTQGFAPHYDDIEAFILQLEGKKRWRLYDPRNDAEKLARFSSGNFSEDDIGNPIF 425

Query: 168 TVTLEPGDLLYLPRGYIHQA------STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
            V LE GDLLY PRG IHQA       T+ ++HSLH+T+SV Q  +W DLL+K +P+AL 
Sbjct: 426 DVILEAGDLLYFPRGTIHQACHFASGCTLEDDHSLHITVSVSQLNSWGDLLQKVLPRALD 485

Query: 222 AAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
            A   D+EFR+ LP  YLRY G+A      +  +R      ++ L+ KL+ Y  +D GVD
Sbjct: 486 IAIDEDVEFRQSLPRDYLRYMGVAFSD--QVCPERHQFMRRIEQLMMKLLSYAPVDAGVD 543

Query: 282 EMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGE-VYNATQITKDTKVRLVRANAVR 339
           +MG + + D+LPPVL+  E  CS+  NG R     + +    ++  DT+V+L+R   +R
Sbjct: 544 QMGVKFITDSLPPVLTRTESACSIHGNGERWDDERQCIAGGVELQPDTEVKLIRQGCLR 602


>gi|410916727|ref|XP_003971838.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase
           NO66-like [Takifugu rubripes]
          Length = 553

 Score =  326 bits (836), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 157/337 (46%), Positives = 213/337 (63%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+P+ + RK  +YY  L ++ + D ILR   +++  N+D+T Y +G R+T +  GRAL
Sbjct: 123 WEKKPILVQRKNPDYYKGLFSTAEFDRILRGEDVQYGVNLDVTSYTNGTRETHNPPGRAL 182

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VW  Y+ GCS+R LNPQ +   +  + + LQE FG   GAN Y TPP  QGFAPHYD
Sbjct: 183 PYTVWGFYKSGCSLRLLNPQAFSATVWNVLSILQENFGSMAGANIYLTPPGTQGFAPHYD 242

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEGKK W+VY PR+ DE LP  SSPNF + EIG PIL V LE GDLLY PRG
Sbjct: 243 DIEAFVVQLEGKKHWRVYKPRVDDEILPVLSSPNFDEVEIGEPILEVVLEAGDLLYFPRG 302

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQ   + + HSLH+TIS YQK +W DLL++ +P AL+ A   D+EFRRGLP+ YL Y 
Sbjct: 303 FIHQGDCLPDAHSLHITISSYQKNSWGDLLQRMLPAALEIAMEEDVEFRRGLPLDYLTYM 362

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+      D +  R      ++ L+ KL  Y  +D  VD+  +  +HD LPP+L+  E  
Sbjct: 363 GVQNSDLDDPR--RPKFLSQIETLMKKLASYCPVDAAVDQKARDFLHDCLPPMLTTAEQA 420

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV     R      + +++ I+  T++RL+RA   R
Sbjct: 421 SSVKGARTRWECDQVMDDSSDISTRTRIRLLRAGCAR 457


>gi|195397425|ref|XP_002057329.1| GJ17031 [Drosophila virilis]
 gi|284433508|sp|B4M7P8.1|NO66_DROVI RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|194147096|gb|EDW62815.1| GJ17031 [Drosophila virilis]
          Length = 907

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 224/340 (65%), Gaps = 4/340 (1%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WE+    + RKQ NY++ LI+ K ID++L  N +EF  NID+T Y++G R+T + +GRA
Sbjct: 488 YWERNACQVKRKQPNYFSQLISFKLIDEMLIRNHLEFTTNIDVTTYKNGMRETHNPDGRA 547

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +P  VW  Y +GCSIR LNP TY+  ++QL A +QE F C VGAN Y TPP +QGFAPHY
Sbjct: 548 MPPTVWGFYSDGCSIRILNPSTYLIKVRQLCAMMQEFFHCLVGANVYLTPPNSQGFAPHY 607

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF+LQ+EG+K+W++Y P    + L R SS N+SQ E+G P+    LEPGD+LY PR
Sbjct: 608 DDIEAFVLQVEGRKRWRLYSPLHPSDVLARNSSGNYSQAELGEPLFDAVLEPGDILYFPR 667

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G +HQA     +HSLH+T+SVYQ+ A+ +LLE+ MP  LQ A    L  RRGLP+   ++
Sbjct: 668 GTVHQAVCDQQQHSLHITLSVYQQQAYANLLEELMPAVLQRAIKHHLSLRRGLPLHIWQH 727

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALPPVLSPE 299
            GLA+G       D L    N K L+ + +   D  +D  VD++ K+  H+ALPPV+ PE
Sbjct: 728 LGLAKGDQKSELRDELLG--NTKRLVQQYLMPSDAQIDAAVDQLAKRFQHEALPPVVLPE 785

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           E + +VF +  + +  G+     ++T+ T VRL+RAN +R
Sbjct: 786 EHERTVFGSRSQANSHGQCLCDYELTERTSVRLLRANILR 825


>gi|449274682|gb|EMC83760.1| Nucleolar protein 66, partial [Columba livia]
          Length = 398

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 157/316 (49%), Positives = 204/316 (64%), Gaps = 2/316 (0%)

Query: 24  SKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQT 83
           +   D  LR   + F  ++D+T Y DG R+T +  GRALP +VWD Y+ GCS+R L+PQT
Sbjct: 1   TADFDAALRSGEVHFGTHLDVTSYADGVRETHNPSGRALPAVVWDFYQNGCSLRLLSPQT 60

Query: 84  YIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR 143
           +   + Q  + LQE FG   GANTY TPP  QGFAPHYDDIEAF+LQLEGKK W+VY PR
Sbjct: 61  FSPTVWQFLSVLQEHFGSMAGANTYLTPPGTQGFAPHYDDIEAFVLQLEGKKHWRVYRPR 120

Query: 144 MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
              E LP++SS N +Q E+G P+L   LE GDLLY PRG+IHQ   + + HSLH+T+S Y
Sbjct: 121 TDAEVLPQFSSANLTQAELGEPMLETVLEAGDLLYFPRGFIHQGDCLPDAHSLHITVSSY 180

Query: 204 QKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENL 263
           Q+ +W DLLEK +P ALQ A   D+E+R+GLP+ Y  Y G+A  + VD  A R A  E +
Sbjct: 181 QRNSWGDLLEKLLPAALQMALEEDVEYRQGLPMDYFGYMGVANSEAVD--ARRTAFMEKV 238

Query: 264 KDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQ 323
           + L+ KLV Y  +D  VD+  K  +HD LPPVL+  E   SV+    R    G      +
Sbjct: 239 QSLIKKLVNYAPIDAAVDQRAKSFLHDCLPPVLTESEKAQSVYGFPARWQDGGPHDVDIR 298

Query: 324 ITKDTKVRLVRANAVR 339
           ITKDT+VRL+R   +R
Sbjct: 299 ITKDTEVRLLRHGIIR 314


>gi|195163311|ref|XP_002022494.1| GL12933 [Drosophila persimilis]
 gi|284433504|sp|B4GUZ2.1|NO66_DROPE RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|194104486|gb|EDW26529.1| GL12933 [Drosophila persimilis]
          Length = 687

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 155/340 (45%), Positives = 223/340 (65%), Gaps = 6/340 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           DHWEK P  +       ++NLI+ K ID +L  N +E+  NID+T YEDG R+T++ +GR
Sbjct: 268 DHWEKSPFRVKTTTSGGFSNLISFKMIDQMLIQNHVEYTTNIDVTSYEDGVRKTLNPDGR 327

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP  VW HY+ GCSIR LNP +Y+  L+QL   LQE F C VGAN Y TPP +QGFAPH
Sbjct: 328 ALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTPPESQGFAPH 387

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQ+EGKK+W++Y P    + LPR SS N SQ E+G PI+ + L+PGDLLY P
Sbjct: 388 YDDIEAFVLQVEGKKRWRIYAP---TKELPRESSGNLSQTELGDPIMDIVLKPGDLLYFP 444

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG+IHQA T  + HSLH+T+S YQ+ ++ +L+EK MP  ++ +    L+ R+GLP+   +
Sbjct: 445 RGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGLPLDIFQ 504

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVLSPE 299
             G+A  +     A R  + +++++L  +LV     +D  +D++  +  H+ALPP ++P+
Sbjct: 505 NLGVANAEWKG--AHRQKLIQHIQNLAQRLVPTEGQIDRALDQLAIKFQHEALPPTIAPQ 562

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           EL+ +VF       + G      ++ + T VRL+RAN VR
Sbjct: 563 ELKRTVFGAQATADRNGHCSLDYELAEGTAVRLLRANIVR 602


>gi|194888169|ref|XP_001976869.1| GG18702 [Drosophila erecta]
 gi|284433501|sp|B3NU20.1|NO66_DROER RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|190648518|gb|EDV45796.1| GG18702 [Drosophila erecta]
          Length = 657

 Score =  320 bits (819), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 156/341 (45%), Positives = 218/341 (63%), Gaps = 4/341 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D WE     + RK  NYY+ L++ K IDDIL  N +EF  N+D+T+Y +G+R+T++ EGR
Sbjct: 236 DFWENNACVVQRKNPNYYSKLMSFKMIDDILMRNRVEFGTNLDVTIYRNGKRETLNPEGR 295

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A P +VWD Y EGCSIR LNP TY+  L+Q+ + +QE F C VGAN Y TPP +QGFAPH
Sbjct: 296 AFPSVVWDFYAEGCSIRILNPSTYLLGLRQVCSIMQEFFHCLVGANVYLTPPNSQGFAPH 355

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF++Q+EG+K+W++Y P    + L R SS N+ Q+++G PI+   LE GDLLY P
Sbjct: 356 YDDIEAFVIQVEGRKRWRLYEPPEKADQLSRTSSGNYDQKQLGEPIIDEVLEAGDLLYFP 415

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG +HQA T     SLH+T+SVYQ+ A+ +LLE  MP  L+ A    +  RRGLP+    
Sbjct: 416 RGTVHQAITEKGHFSLHITLSVYQQQAYANLLETLMPIVLKKAVKKSVALRRGLPLHTFH 475

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
             G  +        D+L   EN++ L++K  +    D+D+ VD++ K+  H+ALPP++ P
Sbjct: 476 VLGEVQRTNRCESRDQLV--ENVQKLVSKYLMPSTQDIDEAVDQLAKKFQHEALPPIILP 533

Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           EE   +VF +     Q G      +   DT VRL+RAN +R
Sbjct: 534 EEKVRTVFGSRSISDQEGNSVCDYEFDTDTSVRLLRANILR 574


>gi|312374032|gb|EFR21685.1| hypothetical protein AND_16621 [Anopheles darlingi]
          Length = 646

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 148/308 (48%), Positives = 205/308 (66%), Gaps = 2/308 (0%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WE++PL + R  ++YY  L++   ID++LR N IE+ KNIDIT Y  GQR+T + EGR 
Sbjct: 187 YWERKPLLVQRNDRHYYTGLLSRAMIDEMLRTNNIEYTKNIDITSYVGGQRETHNPEGRV 246

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           LP  +W +Y EGCS+R LNPQTY++ + +LN  LQE F C  GAN Y TPP +QGFAPHY
Sbjct: 247 LPPEMWQYYGEGCSVRMLNPQTYLRTIYELNVKLQEYFHCMTGANFYLTPPNSQGFAPHY 306

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF+LQ+EG+K+W++Y PR   E L R SS N +++E+  P++ V LE GDLLY PR
Sbjct: 307 DDIEAFVLQIEGRKRWRLYPPRRRQEILARVSSENIAEDELAAPLIDVVLEAGDLLYFPR 366

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G IHQA+TV   HSLHVT+SVYQ+ +W DL E+ +P AL  A   D +FR GLP    ++
Sbjct: 367 GCIHQAATVPGAHSLHVTLSVYQRNSWADLFEQMLPIALATAAEEDQQFRAGLPFDLHQH 426

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G+     V  +  RL ++  L+ +  +L     +D  VD++ K+  HDALPP++   E 
Sbjct: 427 FGIVHSDEVSSERRRLTVR--LRAMFDRLFSDSAIDAAVDQLAKRFQHDALPPMVDFTEW 484

Query: 302 QCSVFENG 309
             +V+  G
Sbjct: 485 DDTVYGRG 492


>gi|198477587|ref|XP_002136543.1| GA27865, partial [Drosophila pseudoobscura pseudoobscura]
 gi|198142830|gb|EDY71543.1| GA27865, partial [Drosophila pseudoobscura pseudoobscura]
          Length = 494

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 223/340 (65%), Gaps = 6/340 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           DHWEK P  +       ++NLI+ K ID +L  N +E+  NID+T YEDG R+T++ +GR
Sbjct: 75  DHWEKSPFRVITTTSGGFSNLISFKMIDKMLIQNHVEYTTNIDVTSYEDGVRKTLNPDGR 134

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP  VW HY+ GCSIR LNP +Y+  L+QL   LQE F C VGAN Y TPP +QGFAPH
Sbjct: 135 ALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTPPESQGFAPH 194

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQ+EGKK+W++Y P    + LPR SS N SQ E+G PI+ + L+PGDLLY P
Sbjct: 195 YDDIEAFVLQVEGKKRWRIYAP---TKELPRESSGNLSQTELGDPIMDIVLKPGDLLYFP 251

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG+IHQA T  + HSLH+T+S YQ+ ++ +L+EK MP  ++ +    L+ R+GLP+   +
Sbjct: 252 RGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGLPLDIFQ 311

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVLSPE 299
             G+A  +   +   +L   +++++L  +LV     +D  +D++  +  H+ALPP ++P+
Sbjct: 312 NLGVANAEWNGVHRQKLI--QHIQNLAQRLVPTEGQIDRALDQLAIKFQHEALPPTIAPQ 369

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           EL+ +VF       + G      ++ + T VRL+RAN VR
Sbjct: 370 ELKRTVFGAQATADRNGHCSLDYELAEGTAVRLLRANIVR 409


>gi|195477158|ref|XP_002100112.1| GE16340 [Drosophila yakuba]
 gi|284433510|sp|B4Q068.1|NO66_DROYA RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|194187636|gb|EDX01220.1| GE16340 [Drosophila yakuba]
          Length = 683

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 155/341 (45%), Positives = 221/341 (64%), Gaps = 4/341 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D WE     + RK  +YY+ LI+ K ID++L  + ++F  N+D+T Y++G+R+T++ EGR
Sbjct: 262 DFWEHTAFVVQRKNPHYYSKLISFKMIDEMLVRHRLDFTINVDVTTYKNGKRETLNPEGR 321

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP +VW  Y EGCSIR LNP TY+  L+Q+ + +QE F C VGAN Y TPP +QGFAPH
Sbjct: 322 ALPPVVWGLYSEGCSIRILNPSTYLVGLRQVCSIMQEFFHCLVGANVYLTPPNSQGFAPH 381

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF++Q+EG+K+W++Y P    + L R SS NF QE++G PIL   LE GDLLY P
Sbjct: 382 YDDIEAFVIQVEGRKRWRLYEPPSGSDQLCRNSSSNFDQEQLGEPILDEVLEAGDLLYFP 441

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG +HQA T   +HSLH+T+SVYQ+ A+++LLEK MP  L+ A    +  RRGLP+    
Sbjct: 442 RGTVHQAITEEEQHSLHITLSVYQQQAYVNLLEKLMPIVLKKAIKQSVALRRGLPLHTFH 501

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
             G A+    +    R  + EN++ L+ K  +    D+D+ VD++ K+  H+ALPP++ P
Sbjct: 502 VLGEAQR--ANRSDSRNQLVENVQKLVTKHLMPSAQDIDEAVDQLAKKFQHEALPPIILP 559

Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           EE   +VF +     + G      +    T VRL+RAN +R
Sbjct: 560 EEQVRTVFGSRSTADEQGNAICDYEFDTKTSVRLLRANILR 600


>gi|358334326|dbj|GAA52775.1| lysine-specific demethylase NO66 [Clonorchis sinensis]
          Length = 852

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 168/381 (44%), Positives = 225/381 (59%), Gaps = 44/381 (11%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           ++EK+PL I RK  ++ N  +T+ +ID +LR   + F +++D+  YEDG+R T++  G+A
Sbjct: 246 YFEKQPLLIQRK-GSFENQWLTTYEIDTLLRQERVMFTEHLDLAAYEDGRRYTLNPPGQA 304

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
            P +VWDHY  GCS+R L PQ + + +++  + LQE FGCF+GAN Y TPP +QGFAPHY
Sbjct: 305 FPSVVWDHYNTGCSVRLLCPQVFFRNIRERLSLLQEYFGCFIGANVYLTPPGSQGFAPHY 364

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF+LQLEG K+W+VY PR   E L    SPNFSQEEIG PILT TL+PGD+LY PR
Sbjct: 365 DDIEAFVLQLEGTKRWRVYAPRDESEKLACDPSPNFSQEEIGEPILTATLKPGDVLYFPR 424

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           GYIHQA T  + HSLH+TIS YQK +W D L K +P ALQ+A  T +EFR+GLP+G+LR+
Sbjct: 425 GYIHQAETSEDAHSLHLTISTYQKHSWGDFLSKLLPIALQSAVETSVEFRKGLPVGFLRH 484

Query: 242 AGLARGKPVDIQADRLAMKENL-----KDLLAKLVEYVD--------------------- 275
            G         + D+L     +      +++A+L    D                     
Sbjct: 485 VGGFSQSISSKEKDKLGEATKVVAHVRAEVVARLRMLADQLERQASTGQPEPSSAPCQSL 544

Query: 276 -----LDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRM------------SQTGEV 318
                L    D     L+  +LPP L  EEL C V   G R             S+   V
Sbjct: 545 TAPDPLLTAADLFATDLIAQSLPPQLRDEELMCHVRSQGERWSLCTAAEDGSGGSKKPGV 604

Query: 319 YNATQITKDTKVRLVRANAVR 339
            +  ++  DTKVRLVR +AVR
Sbjct: 605 VDLVELEPDTKVRLVRWSAVR 625


>gi|347966833|ref|XP_321123.4| AGAP001938-PA [Anopheles gambiae str. PEST]
 gi|333469879|gb|EAA01582.4| AGAP001938-PA [Anopheles gambiae str. PEST]
          Length = 716

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 150/309 (48%), Positives = 202/309 (65%), Gaps = 3/309 (0%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HWE++PL + R  + YY  L++   ID++LR N IE+ KNIDIT Y DGQR+T + +GR 
Sbjct: 289 HWERKPLLVQRSDRAYYAGLLSRAMIDEMLRTNNIEYTKNIDITSYRDGQRETHNPDGRV 348

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           LP  +W  Y  GCS+R LNPQTY++ + +LN  LQE F C  GAN Y TPP +QGFAPHY
Sbjct: 349 LPPDLWHFYDAGCSVRMLNPQTYLRSVYELNVKLQEYFHCMTGANFYLTPPGSQGFAPHY 408

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPILTVTLEPGDLLYLP 180
           DDIEAF+LQ+EG+K+W++Y PR   E L R SS N  Q+E+ G PI+ V LE GDLLY P
Sbjct: 409 DDIEAFVLQIEGRKRWRLYGPRQPQEVLARVSSENLRQDELAGPPIVDVVLEAGDLLYFP 468

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQA+TV   HSLH+T+SVYQ+  W DL E+ +P AL  A  T+ + R GLP+   +
Sbjct: 469 RGCIHQAATVPGAHSLHITVSVYQRNCWADLFEQMLPIALATAADTEPQLREGLPLDLHQ 528

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           + G+       I ADR  +   ++ +  KL     +D  VD++ K+  HDALPP++   E
Sbjct: 529 HFGIVHSD--GISADRRRLIVRIRAMFDKLFSDAAIDAAVDQLAKRFQHDALPPLVDFGE 586

Query: 301 LQCSVFENG 309
              +V+  G
Sbjct: 587 WDDTVYGGG 595


>gi|198477583|ref|XP_002136541.1| GA27866 [Drosophila pseudoobscura pseudoobscura]
 gi|284433505|sp|B5DUH6.1|NO66_DROPS RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|198142828|gb|EDY71541.1| GA27866 [Drosophila pseudoobscura pseudoobscura]
          Length = 946

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 223/340 (65%), Gaps = 6/340 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           DHWEK P  +       ++NLI+ K ID +L  N +E+  NID+T YEDG R+T++ +GR
Sbjct: 527 DHWEKSPFRVITTTSGGFSNLISFKMIDKMLIQNHVEYTTNIDVTSYEDGVRKTLNPDGR 586

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP  VW HY+ GCSIR LNP +Y+  L+QL   LQE F C VGAN Y TPP +QGFAPH
Sbjct: 587 ALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTPPESQGFAPH 646

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF+LQ+EGKK+W++Y P    + LPR SS N SQ E+G PI+ + L PGDLLY P
Sbjct: 647 YDDIEAFVLQVEGKKRWRIYAPT---KELPRESSGNLSQTELGDPIMDIVLMPGDLLYFP 703

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG+IHQA T  + HSLH+T+S YQ+ ++ +L+EK MP  ++ +    L+ R+GLP+   +
Sbjct: 704 RGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGLPLDIFQ 763

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVLSPE 299
             G+A  +   +   +L   +++++L  +L+     +D  +D++  +  H+ALPP ++P+
Sbjct: 764 NLGVANAEWNGVHRQKLI--QHIQNLAQRLMPTEGQIDRALDQLAIKFQHEALPPTIAPQ 821

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           EL+ +V+       +TG      ++ + T VRL+RAN VR
Sbjct: 822 ELKRTVYGAQATADKTGHCSLDYELAEGTAVRLLRANIVR 861


>gi|340370610|ref|XP_003383839.1| PREDICTED: lysine-specific demethylase NO66-like [Amphimedon
           queenslandica]
          Length = 559

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 154/337 (45%), Positives = 211/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE++P+ I R   +Y +   +S+ +D ILR+N IE   NI IT Y DG+++     GRA 
Sbjct: 162 WEQKPVMIPRNVPHYNDGWFSSRDLDRILRENHIEHLVNIRITSYADGEKEVEQSSGRAH 221

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
             +VWD + +G SI+FLNPQTY +P+  L + LQE F  FVGAN Y TP   QGFAPHYD
Sbjct: 222 APMVWDRFSDGYSIQFLNPQTYSRPMWLLTSFLQEHFSSFVGANIYLTPAGTQGFAPHYD 281

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++Q+EGKK+W++Y PR   E LPR SS +  +EEIG PIL + L PGDLLY PRG
Sbjct: 282 DIEAFVVQVEGKKRWRLYKPRNRYEILPRTSSNDLDEEEIGKPILDIILNPGDLLYFPRG 341

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQA  + + HSLH+T+S  QK  W D  E+ +P AL++A   D +FR  LP  Y  Y 
Sbjct: 342 VIHQAVALPDTHSLHITLSTAQKHTWGDFFERMLPLALKSAIEEDEDFRHSLPWDYSDYV 401

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G   G   D +  +   K +  DL +++V+Y  L    DEM    +H +LPP LS  E +
Sbjct: 402 GFVHGNSKDKRICQFFDKTH--DLFSRIVQYAPLLSAADEMAVDFLHSSLPPCLSENEKK 459

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           C ++  G  + + G+V + T IT DT +++VR  +VR
Sbjct: 460 CMIYNRGPSLEEDGKVTHQTMITLDTMIKIVRQKSVR 496


>gi|260836893|ref|XP_002613440.1| hypothetical protein BRAFLDRAFT_123918 [Branchiostoma floridae]
 gi|284433495|sp|C3XRY1.1|NO66_BRAFL RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|229298825|gb|EEN69449.1| hypothetical protein BRAFLDRAFT_123918 [Branchiostoma floridae]
          Length = 607

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 159/337 (47%), Positives = 211/337 (62%), Gaps = 25/337 (7%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+PL + R  ++Y +   +++ +  IL +N I+F +N+D+T YE GQR+T +  GRA 
Sbjct: 211 WEKKPLLVKRHLESYNDGWFSTEDLTKILHENDIQFGRNLDVTTYEGGQRETHNPPGRAN 270

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P +VWD+Y+ GCS+R LNPQTY + + +L ++LQE F   VGAN Y TPP  QGFAPHYD
Sbjct: 271 PAVVWDYYQNGCSVRLLNPQTYSQGVWRLCSTLQEYFSSMVGANIYLTPPGTQGFAPHYD 330

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG                      NFSQEEIG  IL VTLEPGDLLY PRG
Sbjct: 331 DIEAFVLQLEG----------------------NFSQEEIGEAILDVTLEPGDLLYFPRG 368

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQAS + + HSLH+T+S  Q+  W DL+EK +P AL  A + D+EFR+ LP  YL Y 
Sbjct: 369 TIHQASALPDTHSLHITVSTCQRNTWGDLMEKLVPAALTMAFSEDVEFRQALPRDYLDYM 428

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           GLA     D +  R A  E L+ LL++LV YV +D GVD+   + M D LPPV +  E  
Sbjct: 429 GLANADLDDPR--RKAFLETLQSLLSRLVNYVPVDAGVDQKAVEFMRDCLPPVFTKNERA 486

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           CS++    R+ + G V  +  +   T V+L+R  A R
Sbjct: 487 CSIYGCRTRL-EKGRVVGSVDLKTSTPVKLIRKGAAR 522


>gi|351712535|gb|EHB15454.1| Nucleolar protein 66 [Heterocephalus glaber]
          Length = 627

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 210/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + I R+   YY  L ++  +D ILR+  ++F +++D T Y +G+R+T++  GRAL
Sbjct: 209 WEREAVLIRRQDHAYYQGLFSTADLDSILRNEEVQFGQHLDATRYVNGRRETLNPPGRAL 268

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 269 PAAAWSLYQAGCSLRLLCPQAFSTTIWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 328

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQE++G P+L   LEPGDLLY PRG
Sbjct: 329 DIEAFVLQLEGRKLWRVYRPRVPGEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 388

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  +L Y 
Sbjct: 389 FIHQAECQNGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFLDYM 448

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 449 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 506

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 507 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 543


>gi|194764344|ref|XP_001964290.1| GF20792 [Drosophila ananassae]
 gi|284433500|sp|B3MSI4.1|NO66_DROAN RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|190619215|gb|EDV34739.1| GF20792 [Drosophila ananassae]
          Length = 843

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 154/339 (45%), Positives = 216/339 (63%), Gaps = 4/339 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK    + R+   Y+  LI+ + ID++L  + +EF  NID+T Y+DG RQT++ EGRA+
Sbjct: 425 WEKNACQVQRQTPKYFAELISFEMIDEMLIRHHLEFTTNIDVTTYKDGVRQTLNPEGRAM 484

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VW  Y +GCSIR LNP TY+  L+Q+ + LQE F C VGAN Y TPP +QGFAPHYD
Sbjct: 485 PPAVWSSYADGCSIRILNPSTYLAGLRQVCSMLQEFFHCLVGANVYLTPPNSQGFAPHYD 544

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQ+EG+K+W++Y+P    + L R+SS NF Q E+  PI    LE GD+LY PRG
Sbjct: 545 DIEAFVLQVEGRKRWRLYMPVKPTDMLARHSSGNFDQGELDEPIFDEVLEAGDVLYFPRG 604

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            +HQA T  ++HSLH+T+SVYQ+ A+ +LLE  MP  L+ A    +  RRGLP+   +  
Sbjct: 605 TVHQAITEKDQHSLHITLSVYQQQAYANLLENLMPMVLKNAVQQKMALRRGLPLHTWQNL 664

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           GLA G        RL +   ++ ++ K  L     +D  VD++ K+  H+ALPP + PEE
Sbjct: 665 GLAHGG--SEGRSRLNLISGIQQMVQKYLLPTEEQIDAAVDQLAKRFQHEALPPTILPEE 722

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
              +VF +  +    G+     +IT+ T VRL+RAN +R
Sbjct: 723 RVRTVFGSRSQTDAQGQCLCDYEITEQTSVRLLRANILR 761


>gi|153792138|ref|NP_001093172.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           NO66 [Bos taurus]
 gi|284433494|sp|A5PK74.1|NO66_BOVIN RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|148745305|gb|AAI42387.1| C10H14ORF169 protein [Bos taurus]
          Length = 667

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 148/337 (43%), Positives = 210/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+  +YY  L ++  +D ILR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 251 WEREAVLVRRQDHSYYQGLFSTAVLDSILRNEEVQFGQHLDAARYINGRRETLNPPGRAL 310

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  YR GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 311 PAAAWSLYRAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 370

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 371 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 430

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S +Q+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 431 FIHQAECQDGVHSLHLTLSTFQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 490

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 491 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 548

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 549 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 585


>gi|296482958|tpg|DAA25073.1| TPA: lysine-specific demethylase NO66 [Bos taurus]
          Length = 667

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 148/337 (43%), Positives = 210/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+  +YY  L ++  +D ILR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 251 WEREAVLVRRQDHSYYQGLFSTAVLDSILRNEEVQFGQHLDAARYINGRRETLNPPGRAL 310

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  YR GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 311 PAAAWSLYRAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 370

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 371 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 430

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S +Q+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 431 FIHQAECQDGVHSLHLTLSTFQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 490

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 491 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 548

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 549 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 585


>gi|195456912|ref|XP_002075342.1| GK15670 [Drosophila willistoni]
 gi|284433509|sp|B4NP88.1|NO66_DROWI RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|194171427|gb|EDW86328.1| GK15670 [Drosophila willistoni]
          Length = 767

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 148/345 (42%), Positives = 220/345 (63%), Gaps = 12/345 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D WEK    + R    Y++ LI+ + ID ++  + +EF  NID+T Y+DG+R+T++ EGR
Sbjct: 347 DFWEKNACQVQRNAPTYFSELISFEMIDQMMLKHHLEFTTNIDVTSYKDGRRETLNPEGR 406

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A+P  VW  Y EGCSIR LNP TY+  L+ + + +QE F C VGAN Y TPP +QGFAPH
Sbjct: 407 AMPPTVWGFYGEGCSIRILNPSTYLPGLRTMCSLMQEFFHCLVGANVYLTPPNSQGFAPH 466

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +DDIEAF+LQ+EG+K+W++Y+P    + L R SS N++ +++G PI    L+PGD+LY P
Sbjct: 467 FDDIEAFVLQVEGRKRWRLYMPLQPSDVLARESSGNYTPDQLGEPIFDEVLKPGDVLYFP 526

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG +HQA T    HSLH+T+SVYQ+ A+ +LLEK MP  LQ+A    +  RRGLP+   +
Sbjct: 527 RGTVHQAITEKKHHSLHITLSVYQQQAYANLLEKLMPMVLQSAIKHSVSLRRGLPLHTWQ 586

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEY------VDLDDGVDEMGKQLMHDALPP 294
           + G+A G      A + + +  L   + ++V+         +D  VD++ K+  H+ALPP
Sbjct: 587 HLGIAHG------ATKCSSRSQLIKGIQEMVQQHLTPSENQIDAAVDQLAKRYQHEALPP 640

Query: 295 VLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            + PEE   +VF +       G+     ++T+DT +RL+RAN +R
Sbjct: 641 TILPEEKLRTVFGSRSATDAHGKCLCDYELTEDTSIRLLRANILR 685


>gi|296215462|ref|XP_002754138.1| PREDICTED: lysine-specific demethylase NO66 [Callithrix jacchus]
          Length = 631

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 148/337 (43%), Positives = 210/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 213 WEREAVLVRRQDHTYYQGLFSTADLDAMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 272

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 273 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 332

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 333 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 392

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 393 FIHQAECQNGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 452

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 453 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 510

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V   TQ+T +T+V +++    R
Sbjct: 511 LSVYGLPIRWEAGEPVNVGTQLTTETEVHMLQDGIAR 547


>gi|257743021|ref|NP_076122.2| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           NO66 [Mus musculus]
 gi|284018104|sp|Q9JJF3.2|NO66_MOUSE RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|148670807|gb|EDL02754.1| mCG4934 [Mus musculus]
 gi|187957070|gb|AAI38095.1| RIKEN cDNA 2410016O06 gene [Mus musculus]
          Length = 603

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 150/337 (44%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+ ++YY  L ++  +D +LR   ++F +++D   Y DG+R+T++  GRAL
Sbjct: 185 WEREAVLVRRQDRSYYEGLFSTADLDSMLRYEDVQFGQHLDAARYVDGRRETLNPPGRAL 244

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  YR GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 245 PAAAWSLYRAGCSLRLLCPQAFSPTVWQFLAVLQEQFGSMAGSNVYLTPPDSQGFAPHYD 304

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQE++G P+L   LEPGDLLY PRG
Sbjct: 305 DIEAFVLQLEGRKLWRVYRPRDPSEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 364

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 365 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAIEENVEFRRGLPRDFMDYM 424

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 425 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 482

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV    +R      V    Q+T +T+V +++    R
Sbjct: 483 LSVHGLPVRWEAGEPVNVGAQLTTETQVHMLQDGVAR 519


>gi|66806873|ref|XP_637159.1| transcription factor jumonji, jmjC domain-containing protein
           [Dictyostelium discoideum AX4]
 gi|74853002|sp|Q54K96.1|NO66_DICDI RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66; AltName: Full=JmjC domain-containing
           protein G
 gi|60465574|gb|EAL63656.1| transcription factor jumonji, jmjC domain-containing protein
           [Dictyostelium discoideum AX4]
          Length = 514

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 154/338 (45%), Positives = 218/338 (64%), Gaps = 11/338 (3%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           ++ ++ LY+ R   N Y N  T   +D +LR+N+++F +N+D+T Y D QR T++ EGRA
Sbjct: 108 YFGQKHLYVKRNGDNIYKNFFTKDSLDKMLRNNLMKFTENVDVTNYVDFQRITLNPEGRA 167

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
            P +VW HY+EGCS+R LNPQT+   + +L ++LQ  F C VGAN Y TP  AQGFAPHY
Sbjct: 168 YPSLVWKHYKEGCSVRLLNPQTFNSNVWKLCSTLQTHFQCGVGANIYLTPAGAQGFAPHY 227

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DD++ FILQLEGKK+W++Y PR  +E LP+ SS NF+QEEIG P  TVTLE GDLLY PR
Sbjct: 228 DDVDVFILQLEGKKEWRLYKPRDANEVLPKKSSENFTQEEIGEPYFTVTLEAGDLLYFPR 287

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G IHQA + ++ HSLH+T+S Y    W DL+ K + +AL+ A    LEFR GLP  Y +Y
Sbjct: 288 GVIHQAVSPSDVHSLHITVSTYLNNTWGDLIGKVLNRALEIANEECLEFREGLPRDYTQY 347

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
            G+     V  +  R  + + +  L  KL + + +D G D+M  + + D+LPPVL+  E 
Sbjct: 348 LGVIHSDKVGDER-RKELTDKVGTLWDKLGQLLPIDIGADQMAVKYLLDSLPPVLTQLEK 406

Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           + S+ +    M          +I  +T+ RL+RA++VR
Sbjct: 407 KHSIEDETTSM----------KIKPETRFRLIRADSVR 434


>gi|126282811|ref|XP_001375764.1| PREDICTED: lysine-specific demethylase NO66-like [Monodelphis
           domestica]
          Length = 841

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 161/337 (47%), Positives = 211/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+ + + R   +YY  L ++ ++D +LR   I+F  ++D   Y +GQR+T++  GRAL
Sbjct: 424 WEKDAVLVRRYNPDYYQGLFSTAELDSVLRREDIQFGLHLDAARYRNGQRETLNPPGRAL 483

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  YR+GCS+R L PQ +   + Q+ + LQE FG  VGAN Y TPP +QGFAPHYD
Sbjct: 484 PATAWSLYRDGCSLRLLCPQAFSAVMWQVLSVLQEHFGSMVGANAYLTPPGSQGFAPHYD 543

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEGKK W+VY PR   E LP++SSPNF  E +G P+L   LEPGDLLY PRG
Sbjct: 544 DIEAFVLQLEGKKLWRVYKPREQVEELPQFSSPNFGPEGLGKPVLQEVLEPGDLLYFPRG 603

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+TIS +Q+ +W DLLE  +P ALQAA   D+EFRRGLP  YL Y 
Sbjct: 604 FIHQAECEPGVHSLHLTISTFQRNSWGDLLEPLLPAALQAAMEEDVEFRRGLPRDYLDYM 663

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G+   +  D +  R    E L+ LL +L +Y  +D   D+  K  +HD LPPVLS +E  
Sbjct: 664 GVQHSESGDPR--RGPFLEKLQGLLVRLAQYAPVDAVADQRAKGFLHDCLPPVLSEKERA 721

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV     R           +IT +T+VRL+R    R
Sbjct: 722 LSVHGLPARWEAEEARDVGAKITTETQVRLLRHGLTR 758


>gi|195047182|ref|XP_001992288.1| GH24285 [Drosophila grimshawi]
 gi|284433502|sp|B4JMQ2.1|NO66_DROGR RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|193893129|gb|EDV91995.1| GH24285 [Drosophila grimshawi]
          Length = 723

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 149/340 (43%), Positives = 220/340 (64%), Gaps = 4/340 (1%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WE +   + RK+K+ Y++L++ + ID++L +N +EF  NID+T Y+DG RQT + +GRA
Sbjct: 301 YWESKACQVKRKRKDLYSDLVSFEMIDEMLIENHLEFTTNIDVTSYKDGVRQTHNPDGRA 360

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +P  VW HY +GCS+R LNP TY+K L+ + A+LQE F C VGAN Y TPP +QGFAPHY
Sbjct: 361 MPPTVWGHYSDGCSVRILNPSTYLKGLRGVCAALQEHFHCLVGANVYLTPPNSQGFAPHY 420

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF+LQ+EG+K+W++Y     ++ L R SS N  Q+++  PI    LE GDLLY PR
Sbjct: 421 DDIEAFVLQVEGRKRWRLYDAPSPNDVLARTSSGNLKQQQLSKPIFDEVLEAGDLLYFPR 480

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G +HQA T    HSLH+T+SVYQ+ ++ +L+E  MP  LQ A   +L+ RRGLP+G   +
Sbjct: 481 GCVHQAVTEQQHHSLHITLSVYQQQSYANLMEALMPAVLQNAIKHNLDMRRGLPLGTWHH 540

Query: 242 AGLARG-KPVDIQADRLAMKENL-KDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
            G+  G K    ++D +   ++L    LA     +D    VD++  +  H+ALPP ++  
Sbjct: 541 LGMVHGDKKTKERSDLITHTQSLFSKYLAPTASQIDA--AVDQLAIRFQHEALPPRIASS 598

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           E + +VF +  +  + G       +T+ TK+RL+R N VR
Sbjct: 599 EKKRTVFGSRNKKDKHGNCRCDYDLTEQTKIRLLRQNIVR 638


>gi|311261319|ref|XP_003128699.1| PREDICTED: lysine-specific demethylase NO66-like [Sus scrofa]
          Length = 645

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 229 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 288

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 289 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 348

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 349 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 408

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 409 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 468

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 469 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 526

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 527 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 563


>gi|106879206|ref|NP_078920.2| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           NO66 [Homo sapiens]
 gi|284018103|sp|Q9H6W3.2|NO66_HUMAN RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=60S ribosomal
           protein L8 histidine hydroxylase; AltName: Full=Histone
           lysine demethylase NO66; AltName: Full=Myc-associated
           protein with JmjC domain; AltName: Full=Nucleolar
           protein 66; Short=hsNO66; AltName: Full=Ribosomal
           oxygenase NO66; Short=ROX
          Length = 641

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 223 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 282

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 283 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 342

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 343 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 402

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 403 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 462

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 463 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 520

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 521 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 557


>gi|7670372|dbj|BAA95038.1| unnamed protein product [Mus musculus]
          Length = 603

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 150/337 (44%), Positives = 207/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+ ++YY  L ++  +D +LR   ++F +++D   Y DG+R+T++  GRAL
Sbjct: 185 WEREAVLVRRQDRSYYEGLFSTADLDSMLRYEDVQFGQHLDAARYVDGRRETLNPPGRAL 244

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  YR GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 245 PAAAWSLYRAGCSLRLLCPQAFSPTVWQFLAVLQEQFGSMAGSNVYLTPPDSQGFAPHYD 304

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQE++G P+L   LEPGDLLY PRG
Sbjct: 305 DIEAFVLQLEGRKLWRVYRPRDPSEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 364

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   + EFRRGLP  ++ Y 
Sbjct: 365 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAIEENEEFRRGLPRDFMDYM 424

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 425 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 482

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV    +R      V    Q+T +T+V +++    R
Sbjct: 483 LSVHGLPVRWEAGEPVNVGAQLTTETQVHMLQDGVAR 519


>gi|410216552|gb|JAA05495.1| chromosome 14 open reading frame 169 [Pan troglodytes]
 gi|410257830|gb|JAA16882.1| chromosome 14 open reading frame 169 [Pan troglodytes]
 gi|410299056|gb|JAA28128.1| chromosome 14 open reading frame 169 [Pan troglodytes]
          Length = 641

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 223 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 282

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 283 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 342

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 343 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 402

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 403 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 462

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 463 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 520

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 521 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 557


>gi|344273988|ref|XP_003408800.1| PREDICTED: lysine-specific demethylase NO66-like [Loxodonta
           africana]
          Length = 652

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 148/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D ILR   ++F +++D   Y +G+R+T++  GRAL
Sbjct: 234 WEREAVLVRRQDHAYYQGLFSTADLDAILRKEDVQFGQHLDAARYVNGRRETLNPPGRAL 293

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 294 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNIYLTPPNSQGFAPHYD 353

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQE++G P+L   LEPGDLLY PRG
Sbjct: 354 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 413

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           ++HQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 414 FVHQAQCQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 473

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 474 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 531

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 532 LSVYGLPIRWEAGQPVNVGAQLTTETEVHMLQDGIAR 568


>gi|431839107|gb|ELK01034.1| Nucleolar protein 66 [Pteropus alecto]
          Length = 427

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y  G+R+T++  GRAL
Sbjct: 11  WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYVSGRRETLNPPGRAL 70

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 71  PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 130

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 131 DIEAFVLQLEGRKLWRVYRPRVSTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 190

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 191 FIHQAECQDGMHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 250

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 251 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 308

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 309 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 345


>gi|10437975|dbj|BAB15138.1| unnamed protein product [Homo sapiens]
 gi|15030188|gb|AAH11350.1| Chromosome 14 open reading frame 169 [Homo sapiens]
 gi|38708274|gb|AAR27292.1| NO66 protein [Homo sapiens]
 gi|48735363|gb|AAH71954.1| Chromosome 14 open reading frame 169 [Homo sapiens]
          Length = 641

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 223 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 282

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 283 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 342

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 343 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 402

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 403 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 462

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 463 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 520

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 521 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 557


>gi|444302172|pdb|4E4H|A Chain A, Crystal Structure Of Histone Demethylase No66
 gi|444302173|pdb|4E4H|B Chain B, Crystal Structure Of Histone Demethylase No66
 gi|444302174|pdb|4E4H|C Chain C, Crystal Structure Of Histone Demethylase No66
 gi|444302175|pdb|4E4H|D Chain D, Crystal Structure Of Histone Demethylase No66
          Length = 463

 Score =  306 bits (784), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 45  WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 104

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 105 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 164

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 165 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 224

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 225 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 284

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 285 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 342

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 343 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 379


>gi|449504341|ref|XP_004186246.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase NO66
           [Taeniopygia guttata]
          Length = 385

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 150/293 (51%), Positives = 190/293 (64%), Gaps = 2/293 (0%)

Query: 47  YEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
           Y +G R+T +  GRALP +VWD Y+ GCS+R L PQ +   +    + LQE FG   GAN
Sbjct: 11  YAEGVRETHNPLGRALPAVVWDFYQNGCSLRLLCPQAFSPTVWHFLSILQEHFGSMAGAN 70

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           TY TPP  QGFAPHYDDIEAF+LQLEGKK W+VY PR   E LP++SS N +Q E+G P+
Sbjct: 71  TYLTPPGTQGFAPHYDDIEAFVLQLEGKKHWRVYRPRTDAEVLPQFSSANLTQPELGEPV 130

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           L   LE GDLLY PRG+IHQ   + + HSLH+T+S YQ+ +W D LEK +P ALQ A   
Sbjct: 131 LETVLEAGDLLYFPRGFIHQGDCLPDAHSLHITVSSYQRNSWGDFLEKLLPAALQMALEE 190

Query: 227 DLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQ 286
           D+E+RRGLP+ YLRY G+A    VD  A R A  E ++ L+ KLV+Y  +D  VD+  K 
Sbjct: 191 DVEYRRGLPMDYLRYMGVANSDTVD--ARRTAFVEKVQSLIKKLVDYAPIDAAVDQRAKS 248

Query: 287 LMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            +HD LPP+L+  E   SV+    R    G       ITKDT+VRL+R   VR
Sbjct: 249 FLHDCLPPMLTESEKAQSVYGFPARWQDGGPCNVDILITKDTEVRLLRHGIVR 301


>gi|391345560|ref|XP_003747053.1| PREDICTED: lysine-specific demethylase NO66-like [Metaseiulus
           occidentalis]
          Length = 515

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/345 (44%), Positives = 221/345 (64%), Gaps = 12/345 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           +++EK+P  +  K  +YY +L+++K++D +LR+  + F+KNID+T Y +G+R+T+ +EGR
Sbjct: 97  NYFEKKPFVVRHKDPSYYKHLLSTKQLDTVLRERDVHFEKNIDLTSYVNGKRETVLLEGR 156

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP +VW +Y EGCS+R LNPQ+Y++ +  L  +L E F  FVGAN Y TP   QGFAPH
Sbjct: 157 ALPALVWQYYSEGCSVRLLNPQSYVEAVHTLCYTLAEHFSSFVGANVYLTPAGTQGFAPH 216

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGDLLYL 179
           +DDI+AF+LQLEGKK+WKVY PR  DE LP  SS N  ++E +  PI+  TLE GDLLY+
Sbjct: 217 WDDIDAFVLQLEGKKRWKVYEPRNEDEELPLVSSGNLDRDELVDPPIIETTLETGDLLYM 276

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
           PRG+IH+AS+  ++HSLH+TIS  Q+  W D L+  +P AL+ A  +++EFRR LP  YL
Sbjct: 277 PRGFIHEASSCEDDHSLHITISANQRNTWSDYLKILLPAALELATESNVEFRRTLPRNYL 336

Query: 240 RYAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
           +  G+      D++   R    E   +L+ KL++    D   D+M    MHD+LPPVL+ 
Sbjct: 337 QNMGVMHS---DLENKARQMFSEIASNLVQKLLKEAPFDPAADQMAVNYMHDSLPPVLNS 393

Query: 299 EELQCSVFENGLRMSQTGEVYNAT----QITKDTKVRLVRANAVR 339
            E + +      R     E  N       +  +T+V L+  N VR
Sbjct: 394 SEKELTA---SHRARWNCETLNGNSSFDHVNPETEVCLISKNCVR 435


>gi|379318603|pdb|4DIQ|A Chain A, Crystal Structure Of Human No66
 gi|379318604|pdb|4DIQ|B Chain B, Crystal Structure Of Human No66
          Length = 489

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 64  WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 123

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 124 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 183

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 184 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 243

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 244 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 303

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 304 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 361

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 362 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 398


>gi|355693420|gb|EHH28023.1| hypothetical protein EGK_18357 [Macaca mulatta]
          Length = 529

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 111 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 170

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 171 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 230

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 231 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 290

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 291 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 350

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 351 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 408

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 409 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 445


>gi|383421627|gb|AFH34027.1| lysine-specific demethylase NO66 [Macaca mulatta]
 gi|387542184|gb|AFJ71719.1| lysine-specific demethylase NO66 [Macaca mulatta]
          Length = 642

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 224 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 283

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 284 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 343

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 344 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 403

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 404 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 463

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 464 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 521

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 522 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 558


>gi|109084252|ref|XP_001090820.1| PREDICTED: lysine-specific demethylase NO66 [Macaca mulatta]
          Length = 642

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 224 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 283

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 284 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 343

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 344 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 403

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 404 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 463

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 464 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 521

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 522 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 558


>gi|355766856|gb|EHH62561.1| hypothetical protein EGM_20943, partial [Macaca fascicularis]
          Length = 423

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 5   WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 64

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 65  PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 124

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG
Sbjct: 125 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 184

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 185 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 244

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 245 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 302

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 303 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 339


>gi|426377411|ref|XP_004055459.1| PREDICTED: lysine-specific demethylase NO66 [Gorilla gorilla
           gorilla]
          Length = 686

 Score =  305 bits (780), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 146/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 268 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 327

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 328 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 387

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR+  E L   SSPN SQ+++G P+L   LEPGDLLY PRG
Sbjct: 388 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNLSQDDLGEPVLQTVLEPGDLLYFPRG 447

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 448 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 507

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 508 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 565

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 566 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 602


>gi|157820533|ref|NP_001101510.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           NO66 [Rattus norvegicus]
 gi|391359316|sp|D3ZU57.1|NO66_RAT RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|149025100|gb|EDL81467.1| similar to RIKEN cDNA 2410016O06 (predicted) [Rattus norvegicus]
          Length = 597

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 149/337 (44%), Positives = 206/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+  +YY  L ++  +D +LR   + F +++D   Y DG+R+T++  GRAL
Sbjct: 179 WEREAVLVRRQDHSYYEGLFSTSDLDWMLRYEDVHFGQHLDAARYIDGRRETLNPPGRAL 238

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 239 PAAAWSLYQAGCSLRLLCPQAFSPTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 298

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQE++G P+L   LEPGDLLY PRG
Sbjct: 299 DIEAFVLQLEGRKLWRVYRPRDPSEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 358

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 359 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAMQAAIEENVEFRRGLPRDFMDYM 418

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 419 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 476

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV    +R      V    Q+T +T+V +++    R
Sbjct: 477 LSVHGLPIRWEAGEPVNVGAQLTTETQVHMLQDGIAR 513


>gi|194390104|dbj|BAG61814.1| unnamed protein product [Homo sapiens]
          Length = 429

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 146/337 (43%), Positives = 207/337 (61%), Gaps = 2/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+E + + R+   YY  L ++  +D +LR+  ++F +++D   Y +G+R+T++  GRAL
Sbjct: 11  WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 70

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P   W  Y+ GCS+R L PQ +   + Q  A LQE FG   G+N Y TPP +QGFAPHYD
Sbjct: 71  PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 130

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEG+K W+VY PR   E L   SSPNFSQ+++G P+L   L PGDLLY PRG
Sbjct: 131 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLGPGDLLYFPRG 190

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
           +IHQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y 
Sbjct: 191 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 250

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
           G       D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E  
Sbjct: 251 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 308

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            SV+   +R      V    Q+T +T+V +++    R
Sbjct: 309 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 345


>gi|195133564|ref|XP_002011209.1| GI16410 [Drosophila mojavensis]
 gi|284433503|sp|B4L6Q5.1|NO66_DROMO RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|193907184|gb|EDW06051.1| GI16410 [Drosophila mojavensis]
          Length = 888

 Score =  303 bits (775), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 141/301 (46%), Positives = 200/301 (66%), Gaps = 4/301 (1%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WE+    + RKQ NY+  LI+ + ID++L +N +EF  NID+T Y+ G RQT++  GRA
Sbjct: 492 YWERNACQVKRKQPNYFTQLISFQMIDEMLIENQLEFTTNIDVTTYKKGVRQTLNPVGRA 551

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +   +W +Y +GCSIR LNP TY+  L+QL +++QE F C VGAN Y TPP +QGFAPHY
Sbjct: 552 MSPAIWGYYGDGCSIRILNPSTYLPKLRQLCSTMQEFFHCLVGANVYLTPPNSQGFAPHY 611

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF++Q+EG+K+W++Y P    + L R SS N+ QEE+G P+    LE GD+LY PR
Sbjct: 612 DDIEAFVIQVEGRKRWRLYAPPHQSDVLARTSSGNYKQEELGQPLFDAVLEAGDILYFPR 671

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G +HQA T   +HSLH+T+SVYQ+ A+ +LLE  MP  L+ A    L  RRGLP+   ++
Sbjct: 672 GTVHQAVTEPKQHSLHITLSVYQQQAYANLLEVLMPSVLERAIKHHLSLRRGLPLHIWQH 731

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLV--EYVDLDDGVDEMGKQLMHDALPPVLSPE 299
            GLA+G     Q D+L    + K L+ + +      +D  VD++ K+  H+ALPP + PE
Sbjct: 732 VGLAKGGQQSEQRDQLM--NSTKQLVQRYLVPTEAQIDAAVDQLAKRFQHEALPPYIKPE 789

Query: 300 E 300
           E
Sbjct: 790 E 790


>gi|196011016|ref|XP_002115372.1| hypothetical protein TRIADDRAFT_50673 [Trichoplax adhaerens]
 gi|190582143|gb|EDV22217.1| hypothetical protein TRIADDRAFT_50673 [Trichoplax adhaerens]
          Length = 431

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 144/337 (42%), Positives = 205/337 (60%), Gaps = 3/337 (0%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE++P+   R+  +Y N L +S  +D I+R+N IE+  N+D+T YE+G R+T + EGR L
Sbjct: 15  WERKPILAQRRSSSYNNGLFSSHDLDRIVRENYIEYSVNLDVTTYENGVRETHNAEGRVL 74

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
             ++WD+Y+ GCSIR LNPQTY + L +  + LQE FG FVG N Y TPP  QGFAPH+D
Sbjct: 75  ASVMWDYYQNGCSIRMLNPQTYSESLWKFCSLLQEYFGSFVGCNMYLTPPGTQGFAPHFD 134

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEGKKKW+ Y PR   E LP YSS NF+Q EIG P     LE GD  Y PRG
Sbjct: 135 DIEAFVLQLEGKKKWRFYNPRDDSEILPEYSSGNFNQNEIGKPSFEFVLEQGDFAYFPRG 194

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQA ++ + HSLH+T+S  Q  ++    EK +P A+++A   +L  R+ LP  +  +A
Sbjct: 195 TIHQAQSLPDCHSLHITVSTCQLHSFGKYFEKLLPMAIRSAFKNNLGLRKSLPPDF--FA 252

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
            +         A R  +   +K  L  +++   +D+  D     ++HD LPP  + EE  
Sbjct: 253 NIGGIHADSKNARRKQLTTEVKQFLKDIIDDAPIDEAADLFAAGVIHDYLPPCHTQEERN 312

Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           C +F  G R + +G V +   + +D +++LVR    R
Sbjct: 313 CMIFTTGGRWNGSG-VVSCVTLNEDNRIKLVRREVAR 348


>gi|328717527|ref|XP_001950505.2| PREDICTED: lysine-specific demethylase NO66-like [Acyrthosiphon
           pisum]
          Length = 521

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 150/344 (43%), Positives = 211/344 (61%), Gaps = 8/344 (2%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           DHWEK  L++ R   NY++ L +  ++D ILR+N +++  N+DIT Y D  R+T +  GR
Sbjct: 86  DHWEKTILHVPRNSSNYFSQLFSLTELDTILRENNLQYGTNVDITSYTDNVRETHNPVGR 145

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A P +VWD+Y  GCS+R LNPQ +   + +  A+LQE FG  VG N Y TPP +QGFAPH
Sbjct: 146 AHPHVVWDYYNNGCSVRLLNPQLFAPEIYKFMANLQEYFGSLVGCNVYLTPPFSQGFAPH 205

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF++Q++G+K W+VY PR   E LPR SS NF Q+EIG PIL V L PGD LY+P
Sbjct: 206 YDDIEAFVVQVDGEKHWRVYKPRSEFETLPRTSSRNFHQDEIGEPILDVILRPGDFLYMP 265

Query: 181 RGYIHQASTV-TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
           RGYIHQA T+ T  HSLH+T S YQ+ +  D L+  +  +L  A   D+ +R GLP+GY 
Sbjct: 266 RGYIHQADTLFTETHSLHLTFSSYQQNSMYDFLQVVVNNSLNNAVKNDISYRSGLPVGYQ 325

Query: 240 RYAGLA---RGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG-KQLMHDALPPV 295
            + GL    +  P  +Q  R   +  +  L+  L  ++DLD  +D+   K    ++LPP 
Sbjct: 326 HFGGLCELEKSPPRTVQ--RQQFENQINQLIRNLKNHLDLDHAIDKFSLKNYYTNSLPPS 383

Query: 296 LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           +   E+  +V  NG +M   G+        +  +V+L+R N  R
Sbjct: 384 ICDVEMLRTV-ANGAKMLPDGKTTGEEVNLEIAEVKLIRRNCFR 426


>gi|326433887|gb|EGD79457.1| hypothetical protein PTSG_10023 [Salpingoeca sp. ATCC 50818]
          Length = 681

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 219/340 (64%), Gaps = 11/340 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WEK+PL + RK++ +++ L +S+ ID I+R N +++  N+D+T YEDG+R T++ +GR
Sbjct: 239 EYWEKKPLLVRRKERGHFDGLFSSEAIDAIVRQNYLKYGVNLDLTKYEDGERTTLNPDGR 298

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
             P ++W  + +GCS+R LNPQT+ +P+ +  ++LQ+ FGC VGANTY TP   QGFAPH
Sbjct: 299 VRPNVMWAMFEDGCSVRVLNPQTFHQPVWKAISALQDYFGCMVGANTYLTPKGTQGFAPH 358

Query: 121 YDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           YDDIEAFILQLEG+K+W++Y  P  V   LPR+SSPN+ + E+G P L V L PGDLLY 
Sbjct: 359 YDDIEAFILQLEGEKRWRLYDQPEGV--RLPRHSSPNYDESELGKPYLDVVLRPGDLLYF 416

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
           PRG +HQA ++   HSLH+T+S YQ   W   ++K +P+AL  A   D +FR GLP   L
Sbjct: 417 PRGVVHQAVSLPKSHSLHLTLSTYQLFDWAQYMQKLVPQALANAIEEDADFRSGLPRNSL 476

Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
            Y GL++      Q ++    +  K L+AKL EY   D   D++  + +H+ +PP LS E
Sbjct: 477 DYVGLSKVDENPPQREKFI--KQAKRLMAKLPEYASFDAAADQVAVENIHNCMPPCLSHE 534

Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           +   S    G    + G V    ++  ++ +R++   AVR
Sbjct: 535 QQALS----GRLRPRRGFVPKPLKM--NSFIRIISPRAVR 568


>gi|281209361|gb|EFA83529.1| transcription factor jumonji [Polysphondylium pallidum PN500]
          Length = 519

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 149/341 (43%), Positives = 212/341 (62%), Gaps = 15/341 (4%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D++  + L + R   NYY +  T   ID +L+ N ++F +NID+T Y DG+RQT++ EGR
Sbjct: 112 DYFGVKHLVVKRGDSNYYKSFFTKDSIDKMLKSNCLKFSENIDVTNYVDGERQTLNPEGR 171

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A P  VW HYREGCSIR LNPQT+   + ++ ++LQ  F C +GAN Y TP  AQGFAPH
Sbjct: 172 AYPAQVWKHYREGCSIRLLNPQTFNANVWRVLSTLQTHFQCGMGANVYLTPAGAQGFAPH 231

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDD++ FI+QLEGKK W++Y PR   E L + SS NF QEEIG PI+ VTLE GD++Y P
Sbjct: 232 YDDVDVFIIQLEGKKVWRLYPPRSEQEVLAKKSSDNFEQEEIGEPIVEVTLEAGDVMYFP 291

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQA +  + HSLH+TIS Y    W DL+ K + +AL  A    +EFR GLP  Y +
Sbjct: 292 RGVIHQAYSPEDVHSLHITISTYMNNTWGDLIGKVLTRALDIANDEYVEFREGLPRDYTQ 351

Query: 241 YAGLARGKPVDIQAD--RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
           + G+      D   D  R  + + +  L  KL + +  D G D M  + +HD+LPPV + 
Sbjct: 352 FLGVMHS---DKAGDERRKELLDKVGTLWEKLGQCLPTDVGADLMAVKFLHDSLPPVFNQ 408

Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            E + S+ ++            A+++  +++VRL+R +++R
Sbjct: 409 LEKKHSIEDDNA----------ASELKLESRVRLIRQDSIR 439


>gi|18858091|ref|NP_572160.1| CG2982 [Drosophila melanogaster]
 gi|122129655|sp|Q7K4H4.1|NO66_DROME RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|15292145|gb|AAK93341.1| LD40453p [Drosophila melanogaster]
 gi|22831673|gb|AAF45939.2| CG2982 [Drosophila melanogaster]
 gi|220946188|gb|ACL85637.1| CG2982-PA [synthetic construct]
 gi|220955888|gb|ACL90487.1| CG2982-PA [synthetic construct]
          Length = 653

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 147/342 (42%), Positives = 211/342 (61%), Gaps = 6/342 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D WE     + R    Y+ ++I+ K +D+IL  + ++F  N+D+T Y++G+R+T++ EGR
Sbjct: 232 DFWEHTACLVQRSNPKYFQSMISFKMLDEILIRHHLDFTVNVDVTTYKNGKRETLNPEGR 291

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           ALP  VW  Y +GCSIR LNP TY+  L+Q+   LQE F C VGAN Y TPP +QGFAPH
Sbjct: 292 ALPPAVWGFYSDGCSIRLLNPSTYLIRLRQVCTVLQEFFHCKVGANLYLTPPNSQGFAPH 351

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF++Q+EG+K+W +Y P    + L R SS N+ QE++G PI+   L  GD+LY P
Sbjct: 352 YDDIEAFVIQVEGRKRWLLYEPPKKADQLARISSGNYDQEQLGKPIIDEVLSAGDVLYFP 411

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG +HQA T   +HSLH+T+SVYQ+ A+ +LLE  MP  L+ A    +  RRGLP+   +
Sbjct: 412 RGAVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPLHTFQ 471

Query: 241 YAGLA-RGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLS 297
             G A +G        R  + EN++ L+    +    D+D+ VD+M K+  H+ALPP++ 
Sbjct: 472 VLGNAYKGNDC---GSRKQLVENVQKLVTNYLMPSEDDIDEAVDQMAKKFQHEALPPIVL 528

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           P E   +V        + G      +  K T VRL+RAN +R
Sbjct: 529 PSEEVRTVHGARSDADEQGNCVCDYKFNKKTSVRLLRANILR 570


>gi|167523278|ref|XP_001745976.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775777|gb|EDQ89400.1| predicted protein [Monosiga brevicollis MX1]
          Length = 432

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 199/339 (58%), Gaps = 2/339 (0%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WE +PL I RK +  +  L +S+++DD++R N I++  NID+  Y DG R T + EGR
Sbjct: 36  EYWETKPLLIRRKNRQRFKGLFSSQQLDDVIRSNYIKYGVNIDMARYSDGVRTTENPEGR 95

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
                +W  Y +GCSIR LNPQTY KP+ QL ++LQE F C VG NTY TPP AQGFAPH
Sbjct: 96  VHANTMWALYEDGCSIRMLNPQTYAKPVWQLISTLQEYFQCMVGCNTYLTPPGAQGFAPH 155

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEA ILQLEG K+W++Y      E LPR SS NF Q E+  PIL V L+PGD LY P
Sbjct: 156 YDDIEALILQLEGSKRWRLY-NNPTGERLPRTSSRNFDQSELSEPILDVVLQPGDFLYFP 214

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG  HQA +  +EHSLH+T+S YQ   W +  +K +P AL  A A D EFR GLP+  L 
Sbjct: 215 RGMAHQAVSTPDEHSLHITLSTYQLFDWAEYFKKLVPAALDYAIAEDAEFREGLPLQALN 274

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           + GL   +  +    R    +  K L  KL++    D   D +    +H ++P  L+ EE
Sbjct: 275 HVGLLHSE-TEGDNQRKRFMDKAKHLFQKLIDVAPYDSAADAVAVDFLHASMPSYLTSEE 333

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           L  +  +  L         ++  +     +RL+R +  R
Sbjct: 334 LALTSRQKQLAARSNPVELSSPALAPSDWIRLIRPSMCR 372


>gi|195565039|ref|XP_002106114.1| GD16684 [Drosophila simulans]
 gi|284433507|sp|B4R4H1.1|NO66_DROSI RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|194203485|gb|EDX17061.1| GD16684 [Drosophila simulans]
          Length = 847

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 213/339 (62%), Gaps = 4/339 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+    + R    Y+ +LI+ K +D+IL  + ++F  N+D+T Y++G+R+T++ EGRAL
Sbjct: 428 WEQTACLVQRTNPKYFQSLISFKMLDEILIRHHLDFTVNLDVTTYKNGKRETLNPEGRAL 487

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VW  Y EGCSIR LNP  Y+  L+++   LQE F C V AN Y TPP +QGFAPHYD
Sbjct: 488 PPAVWGFYSEGCSIRLLNPSAYLTRLREVCTVLQEFFHCKVEANMYLTPPNSQGFAPHYD 547

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++Q+EG+K+W +Y P    ++L R SS N+ QE++G PI+   L  GD+LY PRG
Sbjct: 548 DIEAFVIQVEGRKRWLLYEPPKEADHLARISSGNYDQEQLGKPIIDEVLSAGDVLYFPRG 607

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            +HQA T   +HSLH+T+SVYQ+ A+ +LLE  MP  L+ A    +  RRGLP+   +  
Sbjct: 608 TVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPLHTFQVL 667

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLV--EYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           G A  K  D  + +L + EN++ L+ K +     D+D+ VD+M K+  H+ALPP++ P E
Sbjct: 668 GNAY-KANDCGSRQL-LVENVQKLVTKYLIPSEDDIDEAVDQMAKKFQHEALPPIVLPSE 725

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
              +V        + G      +  + T VRL+RAN +R
Sbjct: 726 EVRTVHGARSGADEQGNCVCDYKFNEKTSVRLLRANILR 764


>gi|195340843|ref|XP_002037022.1| GM12339 [Drosophila sechellia]
 gi|284433506|sp|B4I100.1|NO66_DROSE RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|194131138|gb|EDW53181.1| GM12339 [Drosophila sechellia]
          Length = 655

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 214/339 (63%), Gaps = 4/339 (1%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE+    + R    Y+ +LI+ K +D+IL  + ++F  N+D+T Y++G+R+T++ EGRAL
Sbjct: 236 WEQTACLVQRTNPKYFQSLISFKMLDEILIRHNLDFTVNLDVTTYKNGKRETLNPEGRAL 295

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  VW  Y EGCSIR L+   Y+  L+++   LQE F C VGAN Y TPP +QGFAPHYD
Sbjct: 296 PPAVWGFYSEGCSIRLLHASAYLTRLREVCTVLQEFFHCKVGANMYLTPPNSQGFAPHYD 355

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++Q+EG+K+W +Y P    ++L R SS N++QE++G PI+   L  GD+LY PRG
Sbjct: 356 DIEAFVIQVEGRKRWLLYDPPKEADHLARISSGNYNQEQLGKPIIDEVLSAGDVLYFPRG 415

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            +HQA T   +HSLH+T+SVYQ+ A+ +LLE  MP  L+ A    +  RRGLP+   +  
Sbjct: 416 TVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPLHTFQIL 475

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           G A  K  D  + +L + EN++ L+AK  +    D+D+ VD+M K+  H+ALPP++ P E
Sbjct: 476 GNAY-KANDCGSRQL-LVENVQKLVAKYLMPSEDDIDEAVDQMAKKFQHEALPPIVLPSE 533

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
              +V          G      +  + T VRL+RAN +R
Sbjct: 534 EVRTVHGARSGADDQGNCVCDYKFNEKTSVRLLRANILR 572


>gi|326920526|ref|XP_003206522.1| PREDICTED: lysine-specific demethylase NO66-like, partial
           [Meleagris gallopavo]
          Length = 367

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 140/281 (49%), Positives = 181/281 (64%), Gaps = 2/281 (0%)

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           G+ALP +VWD Y+ GCS+R LNPQ +   +    + LQE FG   GANTY TPP  QGFA
Sbjct: 5   GQALPAVVWDFYQNGCSLRLLNPQAFSTTVWHFLSILQERFGSMAGANTYLTPPGTQGFA 64

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PHYDDIEAF+LQLEGKK+W+VY PR   E LP++SS N +Q E+  P+L V LE GDLLY
Sbjct: 65  PHYDDIEAFVLQLEGKKRWRVYSPRTSSEALPQFSSANLTQAELAEPLLEVVLEAGDLLY 124

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
            PRG+IHQA  + + HSLH+T+S YQ+ +W D LEK +P ALQ A   D+E+R GLP+  
Sbjct: 125 FPRGFIHQADCLPDVHSLHITVSSYQRNSWGDFLEKLLPAALQMALEEDIEYREGLPMDC 184

Query: 239 LRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
           L Y G+A    VD  A R    E ++ L+ KLV+Y  +D  +D+  K  +HD LPPVL+ 
Sbjct: 185 LGYMGVANSDAVD--ARRTVFVEKVQHLIKKLVDYAPIDAAMDQRAKSFLHDCLPPVLTQ 242

Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            E Q SV+    +            ITKDT++RL+    VR
Sbjct: 243 SEKQLSVYGFPAQWHNGRPCNVDIPITKDTEIRLLHHGIVR 283


>gi|328865556|gb|EGG13942.1| transcription factor jumonji [Dictyostelium fasciculatum]
          Length = 524

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 206/339 (60%), Gaps = 11/339 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D++  +   + R   NYY N  T   ID +L+ N ++F +NIDIT Y D +RQT++ +G+
Sbjct: 97  DYFGLQHYIVKRNDPNYYKNFFTKDSIDKMLKSNNLKFTENIDITNYVDNERQTLNPDGK 156

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A P +VW H R+GCS+R LNPQT+   + ++ + LQ  F C +GAN Y TP  AQGFAPH
Sbjct: 157 AYPAVVWKHVRDGCSLRLLNPQTFNMNVWKVLSCLQTYFQCGMGANIYLTPAGAQGFAPH 216

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDD++ FILQLEGKK W++Y PR   E LP+ SS NF Q+EIG PI+ VTLE GD++Y P
Sbjct: 217 YDDVDVFILQLEGKKVWRLYEPRSEQERLPKKSSENFEQDEIGEPIIEVTLEAGDMMYFP 276

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQA +  + HSLH+TIS      W DL  K + +AL  A    +EFR+GLP  Y  
Sbjct: 277 RGIIHQAYSPKDVHSLHITISTQMNNTWGDLFGKVITRALDIANEEAVEFRQGLPRDYTD 336

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           Y G+     V  +  R  + ++   L  KL  Y+ +D G D M  + +HD+LPP LS +E
Sbjct: 337 YLGVINSDKVGDER-RKELLDHASMLWEKLGNYLPVDIGADLMAVKYLHDSLPPFLSTQE 395

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            + ++ ++            A  +    +VRL+R +AVR
Sbjct: 396 KKHTIEDDSA----------AASLKLTQRVRLLRKDAVR 424


>gi|325188355|emb|CCA22892.1| nucleolar protein putative [Albugo laibachii Nc14]
          Length = 492

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 160/343 (46%), Positives = 209/343 (60%), Gaps = 17/343 (4%)

Query: 2   HWEKEPLYISRKQKNYYNNL-ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM--DVE 58
           +W K P+ I RK++ +YNN     +++  IL D+ +E+  ++D+  Y DG R+T   +  
Sbjct: 75  YWGKRPVVIKRKKQAFYNNCWFDEREMIRILEDHSLEYINDVDVARYIDGTRETWLPEKS 134

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           G A  + VW  Y EG SIR L PQ Y   L +L + L+  FG  +GANTY TPP AQGFA
Sbjct: 135 GIASGKDVWTKYAEGWSIRLLCPQQYSDRLWRLLSILESEFGTMMGANTYLTPPRAQGFA 194

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PHYDDIEAFILQLEG+K+WKVY PR  ++ LPR SS NF Q++IG P+L V L PGDLLY
Sbjct: 195 PHYDDIEAFILQLEGRKQWKVYSPRCTEQILPRVSSKNFQQDDIGDPLLEVELGPGDLLY 254

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
           LPRGYIHQA T    HSLH+TIS  Q  A+ DL+E  +P+ALQ+A  ++L  R+ LP  Y
Sbjct: 255 LPRGYIHQAKTAKELHSLHITISAGQHNAYADLMEVLIPQALQSAIQSNLSMRKSLPRNY 314

Query: 239 LRYAGLARGKPVDIQADRLA-MKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVL 296
           L Y G+     +D  A+R   + E  K L A + E V  LD   D+M K+ + D LPP L
Sbjct: 315 LSYMGVMHSD-LDEHAERQKFISETKKQLKAIMTEAVAKLDAASDQMAKRFLMDRLPPAL 373

Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             EE  CSV           E     +IT DT+ +L+R    R
Sbjct: 374 LDEEEDCSV-----------EACPLLKITVDTQFKLLRGEIAR 405


>gi|449685252|ref|XP_002159583.2| PREDICTED: lysine-specific demethylase NO66-like, partial [Hydra
           magnipapillata]
          Length = 424

 Score =  285 bits (728), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 123/212 (58%), Positives = 159/212 (75%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W+K+PL+I R+Q  Y  N  ++KK+D ILR+  +++ KN+DI +Y +GQR+T++ EGRA 
Sbjct: 213 WQKKPLFIKRRQPLYNTNWFSTKKLDKILREKNVQYTKNLDIAVYRNGQRETLNPEGRAF 272

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P +VW  Y +GCSIR LNPQ + K + QL + LQE FGC VG+N Y TPP +QGFAPHYD
Sbjct: 273 PSVVWKFYEDGCSIRLLNPQIFAKSVHQLTSRLQEYFGCLVGSNVYLTPPGSQGFAPHYD 332

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF++QLEGKK WK+Y PR  +E L RYSS N  +E +G PIL   LE GD LY PRG
Sbjct: 333 DIEAFVIQLEGKKHWKLYPPRNTNEVLARYSSENMQEENLGEPILNKVLEAGDTLYFPRG 392

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
            IHQAST+ + HSLH+TIS+YQK++W D LEK
Sbjct: 393 VIHQASTLEDSHSLHITISLYQKSSWGDYLEK 424


>gi|330798349|ref|XP_003287216.1| hypothetical protein DICPUDRAFT_32138 [Dictyostelium purpureum]
 gi|325082799|gb|EGC36270.1| hypothetical protein DICPUDRAFT_32138 [Dictyostelium purpureum]
          Length = 499

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 12/332 (3%)

Query: 8   LYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVW 67
           L+I R + N Y N  T   +D +LR+N ++F +N+D+T Y + +R T++ EGRA P +VW
Sbjct: 100 LHIKRSE-NVYKNFFTKDSLDKMLRNNYMKFTENVDVTNYVNEERITLNPEGRAYPSLVW 158

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
            HY+EGCS+R LNPQT+   + +L + LQ  F C VGAN Y TP  AQGFAPHYDD++ F
Sbjct: 159 KHYKEGCSVRLLNPQTFNSNVWKLCSILQGHFKCGVGANIYLTPAGAQGFAPHYDDVDVF 218

Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA 187
           ILQLEGKK+W++Y P+   E LP+ SS NF+Q+EIG P  T TLE GDLLY PRG IHQA
Sbjct: 219 ILQLEGKKEWRLYHPKDQTEVLPKKSSENFAQDEIGEPYFTCTLEAGDLLYFPRGVIHQA 278

Query: 188 STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARG 247
            +  + HSLH+T+S Y    W DL+ K + +AL+ A    LEFR GLP  Y  + G+   
Sbjct: 279 VSPPDVHSLHITVSTYLNNTWGDLIGKVLNRALEIANEDCLEFREGLPRDYSDFLGVIHS 338

Query: 248 KPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFE 307
             V  +  R  + + +  L  KL +Y+ +D   D+M  + + D+LP  L+  E + S+ +
Sbjct: 339 DKVG-EERRKELLDKVGTLWDKLGQYLPVDVAADQMHIKYLIDSLPTALNQNEKKHSIED 397

Query: 308 NGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                       N  +I  DT+ RL+R ++VR
Sbjct: 398 ET----------NPMKIKTDTRFRLIRQDSVR 419


>gi|355731984|gb|AES10555.1| hypothetical protein [Mustela putorius furo]
          Length = 401

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 139/313 (44%), Positives = 194/313 (61%), Gaps = 2/313 (0%)

Query: 27  IDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIK 86
           +  +LR   ++F +++D   Y +G+R+T++  GRALP   W  Y+ GCS+R L PQ +  
Sbjct: 9   LRSMLRHEDVQFGQHLDAARYVNGRRETLNPPGRALPAAAWSLYQAGCSLRLLCPQAFST 68

Query: 87  PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
            + Q  A LQE FG   G+N Y TPP +QGFAPHYDDIEAF+LQLEG+K W+VY PR+  
Sbjct: 69  TVWQFLAVLQEQFGSMAGSNIYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRVPT 128

Query: 147 EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           E L   SSPNFSQ+++G P+L   LEPGDLLY PRG+IHQA      HSLH+T+S YQ+ 
Sbjct: 129 EELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRGFIHQAECQDGVHSLHLTLSTYQRN 188

Query: 207 AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDL 266
            W D LE  +P A+QAA   ++EFRRGLP  ++ Y G       D +  R A  E ++ L
Sbjct: 189 TWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYMGAQHSDSKDPR--RTAFMEKVRVL 246

Query: 267 LAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITK 326
           +A+L  +  +D   D+  K  +HD+LPPVL+  E   SV+   +R      V    Q+T 
Sbjct: 247 VARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALSVYGLPIRWEAGEPVNVGAQLTT 306

Query: 327 DTKVRLVRANAVR 339
           +T+V +++    R
Sbjct: 307 ETEVHMLQDGIAR 319


>gi|332022277|gb|EGI62590.1| Nucleolar protein 66 [Acromyrmex echinatior]
          Length = 394

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 137/313 (43%), Positives = 198/313 (63%), Gaps = 4/313 (1%)

Query: 27  IDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIK 86
            ++ LR+N + F KN+DIT Y +G ++  +  GRA   IV D++R  CS+R LNPQT+I 
Sbjct: 4   FEETLRNNNVLFTKNLDITSYSNGVKEIYNPPGRAYASIVKDYFRNNCSVRMLNPQTFIP 63

Query: 87  PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
            +   NA+LQE+FGCF+GAN Y TPP +QGFAPH+DD++ FILQ+EGKK+WK+Y P    
Sbjct: 64  QICAFNANLQEIFGCFIGANIYLTPPNSQGFAPHFDDVDVFILQIEGKKRWKLYKPLKSS 123

Query: 147 EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           ++L  Y S +F + E+  PIL   +  GDLLY+PRG IHQA  + + +SLH+TIS+ QK 
Sbjct: 124 DFLAMYPSRDFDESELREPILDKVISAGDLLYIPRGTIHQAMAL-DTYSLHLTISINQKN 182

Query: 207 AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDL 266
            W +LLEK +P+AL+ A   D  FR GLP+ YL Y G        ++  R  MK  ++ +
Sbjct: 183 TWSNLLEKLLPQALKQATKKDFRFREGLPVNYLSYIGFQYTDKDLVRKGR--MKYIIRRM 240

Query: 267 LAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITK 326
           L +L  Y D+D   D M K+ +HD LPP +   E +C++ E+   +S  G      +I  
Sbjct: 241 LGELERYFDIDQAADLMAKKHIHDFLPPNIFKSERRCTILEDDKNLSYNGPK-RCGKIKL 299

Query: 327 DTKVRLVRANAVR 339
            T++RL+R   +R
Sbjct: 300 STEIRLLRLYCMR 312


>gi|255088593|ref|XP_002506219.1| predicted protein [Micromonas sp. RCC299]
 gi|226521490|gb|ACO67477.1| predicted protein [Micromonas sp. RCC299]
          Length = 387

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 145/349 (41%), Positives = 200/349 (57%), Gaps = 18/349 (5%)

Query: 1   DHWEKEPLYISRK-QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE- 58
           D WE+ P Y+SR   K Y++ L++   ID+ LR   + +++N+D+T Y+DG R+T ++  
Sbjct: 20  DIWERRPAYVSRNAHKGYFDGLLSKADIDEWLRAGKMRYQRNVDVTSYKDGVRRTHNLND 79

Query: 59  -------------GRALPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG 104
                        G A    VW  + +EGCS+R L+PQ +  PL +  A+L+  + C  G
Sbjct: 80  DGSGGVDATTGEPGFADADTVWRRFEQEGCSLRVLHPQRWRDPLWKTLAALERFWNCSTG 139

Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT 164
            N Y TP  +QGF+PHYDDI+AFILQLEGKK W+VY PR   E LPRYSSPNF Q+++G 
Sbjct: 140 CNCYLTPADSQGFSPHYDDIDAFILQLEGKKLWRVYPPRSEAEMLPRYSSPNFGQDDVGE 199

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
           P+L V LEPGDLLY+PRG +HQA+ V  +HSLHVT+S  Q   W DLLE A P AL+ A 
Sbjct: 200 PVLEVILEPGDLLYMPRGTVHQANCVPGDHSLHVTLSTNQFNTWADLLEVAFPAALRQAV 259

Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
           A     RR  P  YL + GL  G+   +   R  +   L +L   ++  +  D   D +G
Sbjct: 260 AEVPALRRCPPPDYLAHLGLVDGEFDAVNPRRDDLIGALVELAQCVMRRLPFDAAADHIG 319

Query: 285 KQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLV 333
            +LM   LPP   P  +   V   G   + T       +IT++   RLV
Sbjct: 320 ARLMRQRLPP--PPSHVSAPVSATGANAAATVTDETRVRITQEEGARLV 366


>gi|301098029|ref|XP_002898108.1| nucleolar protein, putative [Phytophthora infestans T30-4]
 gi|262105469|gb|EEY63521.1| nucleolar protein, putative [Phytophthora infestans T30-4]
          Length = 676

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 202/343 (58%), Gaps = 18/343 (5%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++WE+ PL I R   +YY+   + ++ID IL+ + +E+  ++D+T Y D  R T++  G 
Sbjct: 77  NYWEQRPLAIKRNFPSYYDGWFSKQEIDRILKTHTLEYGTDVDLTKYVDDTRHTLNPPGS 136

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A  + VW HY +GCS+R L PQ +   + +L A+L++ +GC  GANTY TP   QGFAPH
Sbjct: 137 ATAKQVWKHYDDGCSVRLLCPQKFSDDVWKLLATLEDEWGCMAGANTYLTPKNTQGFAPH 196

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +DDIEAF+LQ EG K WKVY P    + L RY S NF  E++G P L V LE GDLLY P
Sbjct: 197 FDDIEAFLLQTEGCKHWKVYKPLNESDVLARYPSGNFKAEDLGKPTLEVDLEQGDLLYFP 256

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG+IHQA     +HSLH+T+S  Q+    + LE  +P+AL  A  T++E RR LP  YL 
Sbjct: 257 RGFIHQARAHKEKHSLHLTVSTGQQNTMGNFLEVLLPQALAGAINTNVELRRSLPRDYLE 316

Query: 241 YAGLA----RGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
           Y G+     +G P + QA    +K  LK +L + +    LD   D+M K  + D LPP L
Sbjct: 317 YMGVMHSDRKGDP-ERQAFANKLKGALKTVLGEAMGM--LDAASDQMAKNFLLDRLPPAL 373

Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             EE  C+   + L+           +IT +T+++LVR    R
Sbjct: 374 EDEEENCTSDNSPLQ-----------KITVNTQLKLVRHGVAR 405


>gi|168049347|ref|XP_001777125.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671568|gb|EDQ58118.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 530

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 190/339 (56%), Gaps = 19/339 (5%)

Query: 3   WEKEPLYISR-KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE-DGQRQTMDVEGR 60
           WEK+P  + R K +NYY  +     I+ +L ++ +++  NID+T Y+ DG R T   EG 
Sbjct: 122 WEKKPFLVRRPKNRNYYAGIFDKATIEKLLEEHELKYGLNIDVTKYDIDGGRSTFSSEGS 181

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A P  VW  Y +G S+R L+PQ +  P+  + ++ +  +G   G N Y TP  +QGF+PH
Sbjct: 182 ATPSKVWSKYADGWSVRILHPQRWCDPVFLILSAFERFWGSVAGCNAYLTPAGSQGFSPH 241

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           YDDIEAF++Q EG+K+WKVY PR   E LPR+SSPNF Q EIG PIL V LEPGD+LY+P
Sbjct: 242 YDDIEAFVIQTEGRKRWKVYKPRTPGEALPRFSSPNFEQGEIGEPILDVDLEPGDILYMP 301

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG IHQA    + HSLH+T+SV Q+  W D LE AMP+AL+ A    +  R  LP GY  
Sbjct: 302 RGTIHQAKASEDAHSLHITVSVGQRNCWGDFLEFAMPRALELASEDHILLRESLPRGYAD 361

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
           Y G+A     D    R A  + + + +A + + +  D   D++  + +   LP       
Sbjct: 362 YMGVAHSDDHD-NPQRAAFIDKIMECMAIVSQSIPWDSAADQLAVKFLQSRLP------- 413

Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                    L            +IT  ++VRLV  +  R
Sbjct: 414 ---------LPAPANAVHGKGQKITGKSRVRLVAPDVAR 443


>gi|298708016|emb|CBJ30378.1| expressed unknown protein [Ectocarpus siliculosus]
          Length = 686

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 141/388 (36%), Positives = 205/388 (52%), Gaps = 63/388 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD----- 56
           +WE+ P  +  ++ +Y++  + +  ++  +    +++  ++D+T Y + +R T++     
Sbjct: 181 YWERNPFVVRGRRADYFSGWLETSDLEAFISGQSMQYGTDLDVTNYVNKRRVTLNPAAPA 240

Query: 57  --------------------------------------VEGRA-----LPQIVWDHYREG 73
                                                   GRA      P+ VW  +REG
Sbjct: 241 PPPEQSPSPPPKKGGHKGKGKGGPSTSGNGASGDAHLAATGRAGGAVVTPKFVWQKFREG 300

Query: 74  CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
           CS+R   PQ +  PL  L ++L+E FGC VG+N Y TPP +QGFAPH+DDIEAF+LQ+EG
Sbjct: 301 CSLRMPCPQKFSDPLHLLLSALEEEFGCMVGSNVYLTPPRSQGFAPHWDDIEAFLLQVEG 360

Query: 134 KKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNE 193
           +K+W+VY P      LPR SSPN + E++G P L V LEPGDLLYLPRG+ HQA TV +E
Sbjct: 361 RKRWRVYPPTDDQAVLPRLSSPNLTDEQVGEPALEVVLEPGDLLYLPRGWAHQAETVGDE 420

Query: 194 HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQ 253
            SLH+T+S  Q   W DL+E  +P+A+ +A   + E R GLP  YL Y G+      D +
Sbjct: 421 ASLHITVSAMQGNCWADLVEGLVPQAVASAVQANQELRSGLPRDYLEYMGVVHSDEEDKR 480

Query: 254 AD--RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLR 311
            D  +  M++ L  ++ + +E   LD   D+MGK+ + D LPPVL   E   SV E G  
Sbjct: 481 RDMFKAKMRKKLGLIVEEAIEL--LDAAADQMGKRYISDRLPPVLEESEELSSVVERG-- 536

Query: 312 MSQTGEVYNATQITKDTKVRLVRANAVR 339
                       IT  TK+RL R    R
Sbjct: 537 ---------DGAITPLTKLRLARRGCAR 555


>gi|348682869|gb|EGZ22685.1| hypothetical protein PHYSODRAFT_330429 [Phytophthora sojae]
          Length = 485

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 201/342 (58%), Gaps = 18/342 (5%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WE+ PL I R  ++YY+   +  ++D IL+ + +E+  ++D+T Y D  R T++  G A
Sbjct: 78  YWEQRPLAIKRNFRSYYDGWFSKAEVDRILKTHTLEYGADLDLTKYVDSTRHTLNPSGSA 137

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
             + VW H+ +GCS+R L PQ +   + +L A+L++ +GC  GANTY TP   QGFAPH+
Sbjct: 138 TAKEVWKHFDDGCSVRLLCPQKFSDDVWKLLATLEDEWGCMAGANTYLTPKNTQGFAPHF 197

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DDIEAF+LQ EG K WKVY P    + L RY S N+  E++G P++ V LE GDLLY PR
Sbjct: 198 DDIEAFLLQTEGCKHWKVYKPLNDSDMLARYPSGNYKPEDLGKPMIEVDLEEGDLLYFPR 257

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G+IHQA     +HSLH+T+S  Q+    + LE  +P+AL  A  T++E RR LP  YL Y
Sbjct: 258 GFIHQARAHKEKHSLHLTVSTGQQNTMGNFLEVLLPQALGGAINTNVELRRSLPRDYLDY 317

Query: 242 AGLAR----GKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
            G+      G P + QA    +K  LK +L + +    LD   D+M K  + D LPP L 
Sbjct: 318 MGVMHSDREGDP-ERQAFANKLKGALKIVLGEAMGM--LDAAADQMAKNFLVDRLPPPLE 374

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            EE  C+   + L+           +IT +T+++L+R    R
Sbjct: 375 DEEENCTSDNSPLQ-----------KITVNTQLKLIRHGVAR 405


>gi|384253086|gb|EIE26561.1| cupin 4 [Coccomyxa subellipsoidea C-169]
          Length = 414

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 119/302 (39%), Positives = 189/302 (62%), Gaps = 11/302 (3%)

Query: 3   WEKEPLYISRKQ-KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQT---MDVE 58
           WE+  L + R +   ++ +  +   ++ +LR   I +  NID+T Y DGQ +T    +VE
Sbjct: 76  WEQRVLLVRRPECPTFFESWFSKGAMEALLRKETISYAYNIDVTKY-DGQERTNYNYNVE 134

Query: 59  GRALPQ-----IVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
             A+ +     ++W  Y +EGCS+R L+PQ +   L    A L+E+F   VG N Y TP 
Sbjct: 135 SSAVEEAVDADVLWARYNQEGCSVRLLHPQRWFDHLAATLAKLEEVFSSCVGCNVYLTPG 194

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
            ++GFAPH+DDI+AF+LQ+EG K+W+++ P   D  LPR+SSP+F+QE++  P L V L+
Sbjct: 195 DSKGFAPHFDDIDAFVLQVEGAKRWRLHQPITEDHVLPRFSSPDFAQEDLDDPFLDVVLQ 254

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           PGD+LY+PRG IHQA  +   HSLH+T+S  Q+ +W D LE A+P+ALQ A    ++ RR
Sbjct: 255 PGDVLYMPRGTIHQAQALPGTHSLHLTVSANQRNSWADFLEVALPRALQLAAEECVDLRR 314

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
           G+P+GY++Y G+       ++A R A +   + ++ ++++++ LD   D++  Q M   L
Sbjct: 315 GMPLGYMQYMGVPFSDDSSLEAKRAAFEATAQSMVQRVLQFLPLDGAADQLAAQFMQQRL 374

Query: 293 PP 294
           PP
Sbjct: 375 PP 376


>gi|308505402|ref|XP_003114884.1| CRE-JMJC-1 protein [Caenorhabditis remanei]
 gi|308259066|gb|EFP03019.1| CRE-JMJC-1 protein [Caenorhabditis remanei]
          Length = 759

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 139/347 (40%), Positives = 207/347 (59%), Gaps = 19/347 (5%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++   L + RKQ  Y+ NL ++ ++ ++L  N +E+  NI+I  Y++G R T++ +GR  
Sbjct: 336 YQSNVLVVHRKQPTYFGNLFSTARLCELLEKNHLEYGTNINIAQYKNGVRTTLNGKGRVY 395

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQIV  H    CS++ +NPQT+   +  +   LQE FGCFVGANTY TP  + GFAPH+D
Sbjct: 396 PQIVKQHLNNLCSVQLVNPQTFDDRIWYICEVLQEQFGCFVGANTYLTPAGSSGFAPHWD 455

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
           +I+AF+LQ+EG+K W+V+ P   DE LP  SS NF++EE+    P+    +E GD++Y+P
Sbjct: 456 EIDAFLLQVEGRKYWRVWAPESSDEELPLESSDNFTEEEMKGRVPVFEGWIEKGDVIYIP 515

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQA T +  HSLHVTIS  ++ ++ +LLEK +P+A+ A      + RRGLPIG   
Sbjct: 516 RGYIHQARTDSKVHSLHVTISTGRQWSFANLLEKLVPEAIGALIQDQHKLRRGLPIGLFD 575

Query: 241 YAGLARGKPVDI---QADRLAMKENL--KDLLAKLVEYV---DLDDGVDEMGKQLMHDAL 292
             G+     VD+   Q D    K  +     ++KL  +V     +  VD M K+ M  AL
Sbjct: 576 MGGV-----VDLDYAQEDHFTEKFKIVVDRQMSKLRNFVADHTFESSVDSMAKEFMKQAL 630

Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           PP+L+  E + SV   G  +   G+  +    T  TKV+L+R +  R
Sbjct: 631 PPLLTESEKKLSVI--GASIDLLGD--DLVDFTAKTKVKLIRKHTQR 673


>gi|71991759|ref|NP_001021644.1| Protein JMJC-1, isoform a [Caenorhabditis elegans]
 gi|74956075|sp|O01658.2|NO66_CAEEL RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|351060644|emb|CCD68363.1| Protein JMJC-1, isoform a [Caenorhabditis elegans]
          Length = 748

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 207/347 (59%), Gaps = 19/347 (5%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++   L + RKQ  Y+ NL ++ ++ ++L  N +E+ +NI+I  Y++G R T++ +GRA 
Sbjct: 325 YQSNVLVVRRKQPTYFGNLFSTARLGELLEKNHLEYGRNINIAQYKNGVRTTLNGQGRAY 384

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQIV  H    CS++ +NPQTY   +  L   +QE FGCFVGANTY TP  + GFAPH+D
Sbjct: 385 PQIVKQHLHNMCSVQLVNPQTYDDRIWYLCEVIQEQFGCFVGANTYLTPAGSSGFAPHWD 444

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
           +I+AF+LQ+EG+K W+V+ P   +E LP  SS NF+++++    P+    +E GD++Y+P
Sbjct: 445 EIDAFLLQVEGRKYWRVWAPESAEEELPLESSDNFTEDDMKGREPVFEGWIEKGDMIYIP 504

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQA T +  HSLHVT+S  ++ ++ +L+EK +P+A+     T  + RRGLP G   
Sbjct: 505 RGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKVVPEAIGVLTDTRHKLRRGLPTGLFD 564

Query: 241 YAGLARGKPVDI---QADRLAMKENL-----KDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
             G+     +D+   Q D    K  +       +L  LV    L+  VD + K+ M  AL
Sbjct: 565 MGGV-----IDLDYSQEDHFVEKFKMVVDRHMSMLRNLVADQLLESSVDSLAKEFMKQAL 619

Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           PP L+ +E + SV   G   +  G+  +    T  TKVRL+R +  R
Sbjct: 620 PPRLTEQEKKLSVL--GSSTNLLGD--DLVDFTARTKVRLIRRHTQR 662


>gi|341896700|gb|EGT52635.1| hypothetical protein CAEBREN_31833 [Caenorhabditis brenneri]
          Length = 765

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 134/347 (38%), Positives = 207/347 (59%), Gaps = 19/347 (5%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++   L + RKQ NY+ NL ++ ++ ++L  N +E+  NI+I  Y++G R T++ +GRA 
Sbjct: 342 YQSNVLVVHRKQPNYFGNLFSTARLCELLEKNHLEYGTNINIAQYKNGIRTTLNGKGRAY 401

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQI+  H    CS++ +NPQT+   +  L   LQE+FGCFVGANTY TP  + GFAPH+D
Sbjct: 402 PQIIKQHLHNLCSVQLVNPQTFDDRIWYLCEVLQEIFGCFVGANTYLTPAGSSGFAPHWD 461

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
           +I+AF+LQ+EG+K W+V+ P   +E LP  SS NF+++++    P+    +E GD++Y+P
Sbjct: 462 EIDAFLLQVEGRKYWRVWAPESAEEELPLESSDNFTEDDMKGREPVFEGWIEKGDMIYIP 521

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQA T +  HSLHVT+S  ++ ++ +L+EK +P+A+ A      + RRGLPIG   
Sbjct: 522 RGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKIVPEAIGALTDNRHKLRRGLPIGLFD 581

Query: 241 YAGLARGKPVDI---QADRLAMKENL-----KDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
             G+     +D+   Q D    K  +       +L  LV    L+  VD + K+ M  AL
Sbjct: 582 MGGV-----IDLDYPQEDHFVEKFKIVVDRHMSMLRNLVADHLLESSVDSLAKEFMKQAL 636

Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           PP+L+ +E + SV   G+     G   +       TKVR +R +  R
Sbjct: 637 PPILTEKEKKLSVI--GVSTDLLGS--DLIDFNAKTKVRFIRKHTQR 679


>gi|341876636|gb|EGT32571.1| CBN-JMJC-1 protein [Caenorhabditis brenneri]
          Length = 528

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 133/347 (38%), Positives = 206/347 (59%), Gaps = 19/347 (5%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++   L + RKQ NY+ NL ++ ++ ++L  N +E+  NI+I  Y++G R T++ +GR  
Sbjct: 105 YQSNVLVVHRKQPNYFGNLFSTARLCELLEKNHLEYGTNINIAQYKNGIRTTLNGKGRVY 164

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQI+  H    CS++ +NPQT+   +  L   LQE+FGCFVGANTY TP  + GFAPH+D
Sbjct: 165 PQIIKQHLHNLCSVQLVNPQTFDDRIWYLCEVLQEIFGCFVGANTYLTPAGSSGFAPHWD 224

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
           +I+AF+LQ+EG+K W+V+ P   +E LP  SS NF+++++    P+    +E GD++Y+P
Sbjct: 225 EIDAFLLQVEGRKYWRVWAPETAEEELPLESSDNFTEDDMKGREPVFEGWIEKGDMIYIP 284

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQA T +  HSLHVT+S  ++ ++ +L+EK +P+A+ A      + RRGLPIG   
Sbjct: 285 RGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKIVPEAIGALTDNRHKLRRGLPIGLFD 344

Query: 241 YAGLARGKPVDI---QADRLAMKENL-----KDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
             G+     +D+   Q D    K  +       +L  LV    L+  VD + K+ M  AL
Sbjct: 345 MGGV-----IDLDYPQEDHFVEKFKIVVDRHMSMLRNLVADHLLESSVDSLAKEFMKQAL 399

Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           PP+L+ +E + SV   G+     G   +       TKVR +R +  R
Sbjct: 400 PPILTEKEKKLSVI--GVSTDLLGS--DLIDFNAKTKVRFIRKHTQR 442


>gi|312065885|ref|XP_003136006.1| hypothetical protein LOAG_00418 [Loa loa]
 gi|307768830|gb|EFO28064.1| hypothetical protein LOAG_00418 [Loa loa]
          Length = 747

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 196/343 (57%), Gaps = 11/343 (3%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++K+  ++     +YY NL ++ K  DIL+ + +E+  N++I +Y++ +R T++  G+  
Sbjct: 325 FQKKVFHVRHNNPSYYGNLFSTAKFIDILQTDYVEYGTNVNIAVYKNQERSTLNGSGKVY 384

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQ +  + + GCSI+F NPQ++   +      LQE+FGCFVGANTY TP    GFAPH+D
Sbjct: 385 PQEIQKNIKAGCSIQFTNPQSFCDNVWYYCDLLQEVFGCFVGANTYITPANTAGFAPHWD 444

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT--LEPGDLLYLP 180
           DI+AF+LQLEG+K WK+Y     +E LPR SS NF+ +++    L     LE GD+LY+P
Sbjct: 445 DIDAFLLQLEGRKHWKIYAHTDDNEMLPRLSSDNFTDDDVADRRLVFDDWLEQGDMLYIP 504

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG+IHQ     + HSLH+T+SV +   + DLLE+ +P A+       +  RR LP GYL 
Sbjct: 505 RGFIHQGFADKDVHSLHLTVSVCRNVTYADLLERIIPLAISNFAEQHVNIRRSLPAGYLD 564

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALPPVLSP 298
             G+        +   + +   L  + +   +YV    +  VD M ++ M  ALPPVL+ 
Sbjct: 565 MTGVLECDYPLFKTGAVKLHRFLDAVFSNFCKYVKELYEPAVDMMAREFMKTALPPVLTK 624

Query: 299 EELQCSVFENGLRMSQTGEVYNATQ--ITKDTKVRLVRANAVR 339
           EE   +V          G +Y   Q   TKDT ++L+R +  R
Sbjct: 625 EEKDMTVL-----CIADGSLYGDNQHIFTKDTPIKLLRRHGQR 662


>gi|291221577|ref|XP_002730797.1| PREDICTED: CG2982-like [Saccoglossus kowalevskii]
          Length = 530

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 126/281 (44%), Positives = 168/281 (59%), Gaps = 38/281 (13%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WEK+PL + R   +Y +   ++ ++D I+R+N +EF +NID+T Y DG+R+T + EGRA 
Sbjct: 202 WEKKPLLVKRHMASYNDGWFSTDEMDQIIRENELEFTRNIDVTSYTDGKRETHNPEGRAY 261

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P +VWD+Y+                              F  A T       QGFAPHYD
Sbjct: 262 PSVVWDYYQ-----------------------------LFNSAGT-------QGFAPHYD 285

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DIEAF+LQLEGKK WK+Y PR  DE LP+YSS NF +E++G PIL V LE GDLLY PRG
Sbjct: 286 DIEAFVLQLEGKKHWKLYSPRTEDETLPKYSSDNFKEEDVGDPILDVELEAGDLLYFPRG 345

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
            IHQA T  + HSLH+T+S  QK  W DL+EK +P+AL+ A   D+EFR+ LP  Y  Y 
Sbjct: 346 TIHQAETSGDTHSLHITVSTCQKNTWGDLMEKLVPRALELAFNEDIEFRQSLPRNYQSYM 405

Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
           G+      D +  R+A    +  L+ KLV +  +D  VD+M
Sbjct: 406 GVVNSDNDDPK--RVAFLNKIGQLMMKLVSHSPVDAAVDQM 444


>gi|303274723|ref|XP_003056677.1| expressed protein [Micromonas pusilla CCMP1545]
 gi|226461029|gb|EEH58322.1| expressed protein [Micromonas pusilla CCMP1545]
          Length = 639

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/353 (38%), Positives = 195/353 (55%), Gaps = 26/353 (7%)

Query: 3   WEKEPLYISRK-QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           WE+ PLY+SR   K Y +  ++ + +D  LR   + +  N+D+T Y +G R+T ++    
Sbjct: 149 WERRPLYVSRNANKKYLDGWLSKEDVDGWLRAGKMRYGVNVDVTSYVNGVRETHNINDDD 208

Query: 61  --------ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
                   A    VW  + E GCS+R L+PQ +   L ++ ++++  + C  G N Y TP
Sbjct: 209 ADDEEPWFADAASVWKKFDEKGCSLRILHPQRWKDALWKMLSTMERYWNCSTGCNVYLTP 268

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           P +QGF+PH+DDI+AF+LQLEGKK W+VY PR   E LPRYSSPNF+Q+EIG P+L V L
Sbjct: 269 PNSQGFSPHFDDIDAFVLQLEGKKTWRVYPPRSEKEMLPRYSSPNFAQDEIGEPVLEVIL 328

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
           EPGD+LY+PRG +HQAS V  EHSLHVT+S  Q   W D LE A+P A++ A A     R
Sbjct: 329 EPGDVLYMPRGTVHQASCVDGEHSLHVTLSTNQFNTWADALELALPAAIRDAVAELPALR 388

Query: 232 RGLPIGYLRYAGLARGKPVDIQADRLAMKEN-----LKDLLAKLVEYVDLDDGVDEMGKQ 286
           R  P   L+  G+      D        + +     L +L + ++  + LD   D++G +
Sbjct: 389 RCPPPDMLQRLGIGASDDGDGSDGGGGDRRDAFVGALNELASAVLRRLPLDAAADQLGAR 448

Query: 287 LMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           LM   LPP   P  L      +G           A ++T D+ VRL      R
Sbjct: 449 LMKQRLPP--PPSHLATPKAASGPDA--------AKKVTMDSVVRLTAEGGAR 491


>gi|313230251|emb|CBY07955.1| unnamed protein product [Oikopleura dioica]
          Length = 266

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 113/225 (50%), Positives = 159/225 (70%), Gaps = 3/225 (1%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ-RQTMDVEG 59
           ++WEK+P+ I R + +Y ++L ++ ++ +I+++N ++F  N+D+T ++    RQT +  G
Sbjct: 38  NYWEKKPILIRRNKADYNSSLFSTSQLANIIQNNPVQFGVNLDVTSWDKQHGRQTHNEPG 97

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           RA P  VWD Y  GCS+R LNPQTY K +  L A LQE FGCF G+N+Y TPP+  GFAP
Sbjct: 98  RAWPSKVWDLYNNGCSVRMLNPQTYSKTIWSLCAGLQEFFGCFTGSNSYLTPPSTTGFAP 157

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPILTVTLEPGDLLY 178
           HYDD+E F++Q EG+K+WKV+ P   + +LPR SS N+   EI G P +   ++PGD+LY
Sbjct: 158 HYDDVEVFMVQTEGRKRWKVWKP-FDNSHLPRTSSRNYHPNEIKGEPEIYEIVQPGDVLY 216

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
           +PRG+IHQ   +  EHSLHVTIS YQK AW DLLEK +P+AL  A
Sbjct: 217 VPRGWIHQGECLCGEHSLHVTISTYQKNAWCDLLEKIVPQALCRA 261


>gi|268560752|ref|XP_002646283.1| Hypothetical protein CBG11988 [Caenorhabditis briggsae]
 gi|284433497|sp|A8XEA2.1|NO66_CAEBR RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
          Length = 776

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 202/343 (58%), Gaps = 21/343 (6%)

Query: 8   LYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVW 67
           L + RK   YY NL ++ +  ++L  + +E+  NI+I  Y+DG R T++ +GR  PQIV 
Sbjct: 356 LVVHRKTPAYYGNLFSTTRFSELLEKHYLEYNMNINIAQYKDGIRTTLNGKGRVYPQIVK 415

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
            H    CS++ +NPQT+   +  L   LQE FGCFVGANTY TP  + GFAPH+D+I+AF
Sbjct: 416 QHLHNLCSVQLVNPQTFDDRIWYLCEILQEQFGCFVGANTYLTPAGSSGFAPHWDEIDAF 475

Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI--GTPILTVTLEPGDLLYLPRGYIH 185
           +LQ+EG+K W+V+ P   +E LP  SS NF+++++    P+    +E GD++Y+PRGY H
Sbjct: 476 LLQVEGRKYWRVWAPESAEEELPLESSGNFTEDDMKGKEPVFEGWIEQGDMIYIPRGYTH 535

Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
           QA T    HSLHVT+S  ++ ++ +L+EK +P+A+ A      + RRGLPIG     G+ 
Sbjct: 536 QARTDKKVHSLHVTVSTGRQWSFANLMEKVIPEAVGALTEERHKLRRGLPIGLFDMGGVV 595

Query: 246 R---------GKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
                      +   I  DR   +  L++L+A  +    LD  VD M K+ M  ALPP+L
Sbjct: 596 DLDYSQEEHFTEKFKIVVDRHMSR--LRNLVADHL----LDSSVDSMTKEFMKQALPPIL 649

Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           + +E + SV   GL  +  G+  +      +TKV+ +R +  R
Sbjct: 650 TDKEKKRSVI--GLSTNLLGD--DLIDFCANTKVKFIRKHTQR 688


>gi|402590342|gb|EJW84272.1| lysine-specific demethylase NO66 [Wuchereria bancrofti]
          Length = 776

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 195/343 (56%), Gaps = 11/343 (3%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++K+ L++     NYY NL ++ K  DIL+ + +E+  N+++ +Y++ QR T++  G+  
Sbjct: 354 FQKKVLHVCHNNPNYYGNLFSTAKFIDILQTDYVEYGTNVNVAIYKNQQRSTLNGSGKVY 413

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQ +    + GCSI+  NPQ++   +      LQE+FGCFVGANTY TP  A GFAPH+D
Sbjct: 414 PQEIQKSIKAGCSIQLTNPQSFCDNVWYYCDLLQEVFGCFVGANTYITPANAAGFAPHWD 473

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT--LEPGDLLYLP 180
           DI+AF+LQLEG+K WK+Y P   +E LPR  S NF+  ++    L     LE GD+LY+P
Sbjct: 474 DIDAFLLQLEGRKHWKIYAPDSDNEMLPRLPSGNFTDNDVINRTLVFDDWLEQGDMLYIP 533

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQ     + HSLH+T+SV +   + DLLE+ +P AL      ++  R+ LP  YL 
Sbjct: 534 RGYIHQGFADKDVHSLHLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIRKSLPARYLD 593

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALPPVLSP 298
             G+       ++   + +   L ++ +    Y+    +  VD M ++ M  ALPPVL+ 
Sbjct: 594 MTGVLECDYPLLKTGTVKLHRFLDNIFSNFCIYIKELSEPAVDMMAREFMKTALPPVLTK 653

Query: 299 EELQCSVFENGLRMSQTGEVYNATQ--ITKDTKVRLVRANAVR 339
           EE   +             +Y+  Q   TK+T ++L+R +  R
Sbjct: 654 EEKDMTAL-----CVAGSSLYSDKQHIFTKNTPIKLLRRHGQR 691


>gi|440794353|gb|ELR15514.1| cupin superfamily subfamily protein [Acanthamoeba castellanii str.
            Neff]
          Length = 1219

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 136/343 (39%), Positives = 197/343 (57%), Gaps = 39/343 (11%)

Query: 2    HWEKEPLYISRKQ-KNYYNNLITSKKIDDILRDNV----IEFKKNIDITLYEDGQRQTMD 56
            +WE+ PL I R+Q K+YY  LI+    DD+ R  +    I+F +N+DIT Y+DG+R+T +
Sbjct: 831  YWEQAPLLIQRRQDKDYYAGLISK---DDLTRGLLEKFDIQFTRNLDITSYQDGKRKTHN 887

Query: 57   VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
              GRA P+ V   ++EGCS+R LNP TY   +  +   LQE FG  +GAN Y TP + QG
Sbjct: 888  PAGRADPETVQQFFKEGCSVRMLNPATYCDGVWAVLEGLQEYFGYGMGANVYLTPASTQG 947

Query: 117  FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
            F+PHYDDIEAF+LQLEG K+W++Y P      LPR+SS NF Q EIG  ++ V L  GD 
Sbjct: 948  FSPHYDDIEAFVLQLEGSKRWRLYAPLDEPSTLPRFSSRNFEQSEIGAVVMDVVLHAGDF 1007

Query: 177  LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
            LY PRG+IHQ  +                  W+DLLEK +P+ ++ A   +      LP 
Sbjct: 1008 LYFPRGWIHQCCS------------------WLDLLEKMVPRGIEVA-EEEFGLISSLPR 1048

Query: 237  GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
             Y ++ G  + K   ++ +R    E+ K LL  ++  + +D   D MG Q +H+ LPP L
Sbjct: 1049 QYGQFLGTGKSKLRALREERGTFLESYKQLLEGVLAAMPVDLAADRMGIQYIHNCLPPFL 1108

Query: 297  SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
            S +E         L ++++ E     Q+T+ T+V+L+R  A R
Sbjct: 1109 SADER--------LALARSKEA----QLTETTEVKLLRREAQR 1139



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 22/128 (17%)

Query: 170 TLEPGDLLYLPRGYIHQASTV------------------TNEHSLHVTISVYQKTAWIDL 211
           TL  GDL Y+P+      + +                   N+HSLHV ++  ++T W  +
Sbjct: 391 TLLAGDLAYIPKQMAFTLTPIHSGSSTSSSSSSSSTSAEANKHSLHVLLAPSKRTTWSQM 450

Query: 212 LEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLV 271
            +  +   +Q     ++E R+ LP+GY +Y GL      D    R +  E + +L   + 
Sbjct: 451 FKDTIASGMQCLEKENIEARKTLPMGYQQYMGLVHSDKED-DPRRDSFLEQVSELFGSV- 508

Query: 272 EYVDLDDG 279
             +DLD G
Sbjct: 509 --LDLDSG 514


>gi|444511429|gb|ELV09875.1| Lysine-specific demethylase NO66, partial [Tupaia chinensis]
          Length = 365

 Score =  243 bits (620), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 125/275 (45%), Positives = 166/275 (60%), Gaps = 10/275 (3%)

Query: 73  GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGA--------NTYQTPPTAQGFAPHYDDI 124
           GCS+R L PQ +   + Q  A LQE FG   GA        N Y TPP +QGFAPHYDDI
Sbjct: 9   GCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGAHVLLKAGSNVYLTPPNSQGFAPHYDDI 68

Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
           EAF+LQLEG+K W+VY PR+  E L   SSPNFSQ+++G P+L   LEPGDLLY PRG+I
Sbjct: 69  EAFVLQLEGRKLWRVYRPRVPAEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRGFI 128

Query: 185 HQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGL 244
           HQA      HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y G 
Sbjct: 129 HQAECQDGVHSLHLTVSTYQRNTWGDFLEAILPLAVQAAMDENVEFRRGLPRDFMDYMGA 188

Query: 245 ARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCS 304
                 D +  R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E   S
Sbjct: 189 QHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALS 246

Query: 305 VFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           V+   +R      V    Q+T +T+V +++    R
Sbjct: 247 VYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 281


>gi|170595078|ref|XP_001902238.1| LD40453p [Brugia malayi]
 gi|284433496|sp|A8QFQ3.1|NO66_BRUMA RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase NO66; AltName: Full=Histone lysine
           demethylase NO66
 gi|158590193|gb|EDP28913.1| LD40453p, putative [Brugia malayi]
          Length = 710

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 196/342 (57%), Gaps = 9/342 (2%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           ++K+   +     NYY NL ++ K  DIL+ + +E+  N+++ +Y++ QR T++  G+  
Sbjct: 288 FQKKVFLVCHNNPNYYGNLFSTAKFIDILQTDYVEYGTNVNVAIYKNQQRSTLNGSGKVY 347

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           PQ +    + GCSI+  NPQ++   +      LQE+FGCFVGAN Y TP    GFAPH+D
Sbjct: 348 PQAIQKSIKAGCSIQLTNPQSFCDNVWYYCDLLQEVFGCFVGANIYITPANTAGFAPHWD 407

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT--LEPGDLLYLP 180
           DI+AF+LQLEG+K WK+Y P   DE LPR  S NF+  ++   +L     LE GD+LY+P
Sbjct: 408 DIDAFLLQLEGRKHWKIYAPDSDDEMLPRLPSGNFTDNDVINRMLVFDDWLEQGDMLYIP 467

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RGYIHQ     + HSLH+T+SV +   + DLLE+ +P AL      ++  R+ LP  YL 
Sbjct: 468 RGYIHQGFADKDVHSLHLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIRKSLPARYLD 527

Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDD-GVDEMGKQLMHDALPPVLSP 298
             G+       ++   + +   L  + +   +Y+ +L +  VD M ++ M  ALPPVL+ 
Sbjct: 528 MTGVLECDYPLLKTGTMKLHRFLDSIFSNFCKYIKELSEPAVDMMAREFMRTALPPVLTK 587

Query: 299 EELQCSVFENGLRMSQTGEVYNATQI-TKDTKVRLVRANAVR 339
           EE   +     L ++ +    +   I TK+T ++L+R +  R
Sbjct: 588 EEKDMT----ALCVAGSSLYGDKEHIFTKNTSIKLLRRHGQR 625


>gi|307110623|gb|EFN58859.1| hypothetical protein CHLNCDRAFT_140729 [Chlorella variabilis]
          Length = 624

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 119/310 (38%), Positives = 185/310 (59%), Gaps = 19/310 (6%)

Query: 4   EKEPLYISRKQ-KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDG--QRQTMDVEGR 60
           ++ P+ ++R Q + Y++ L +   ID ++R   ++++ N+D+T Y  G  +R   +  G 
Sbjct: 165 DEAPMLVTRPQNRTYFSGLFSKDDIDRLVRGGAMQYQINVDVTSYSAGGKKRANFNYNGS 224

Query: 61  ALP-----------QIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
             P           ++VW  + +EGCS+R L+PQ +   L +L A L+    C VG N Y
Sbjct: 225 TEPDEEAVPEMADAEVVWRRFEKEGCSVRLLHPQRFCDGLWRLVARLESFLQCPVGCNAY 284

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
            TP   QGFAPH+DDI+AF+LQLEGKK+W+++ P   +  LPRYSS +FS+EE+G  ++ 
Sbjct: 285 LTPAGTQGFAPHWDDIDAFVLQLEGKKRWRLHAPTDPEHVLPRYSSRDFSEEELGECVMD 344

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL 228
           V LEPGD+LY+PRG +HQA ++ + HSLH+TIS  Q+ +W   LE+A+P+AL+AA  +  
Sbjct: 345 VVLEPGDMLYMPRGTVHQAESMPDSHSLHITISANQQRSWTVFLEEALPEALRAAAQSRR 404

Query: 229 EFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE----YVDLDDGVDEMG 284
           E RR LP  Y  Y G+   +P     +    +E      A+L+E    ++ LD   D++ 
Sbjct: 405 ELRRTLPRDYFEYMGVMHERPEGDGGEEDPRREEFTQHCAELIEAVMHHMPLDGVADQLA 464

Query: 285 KQLMHDALPP 294
            Q +   LPP
Sbjct: 465 VQFLQQRLPP 474


>gi|380797767|gb|AFE70759.1| lysine-specific demethylase NO66, partial [Macaca mulatta]
          Length = 339

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 117/250 (46%), Positives = 156/250 (62%), Gaps = 2/250 (0%)

Query: 90  QLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
           Q  A LQE FG   G+N Y TPP +QGFAPHYDDIEAF+LQLEG+K W+VY PR+  E L
Sbjct: 8   QFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEEL 67

Query: 150 PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWI 209
              SSPNFSQ+++G P+L   LEPGDLLY PRG+IHQA      HSLH+T+S YQ+  W 
Sbjct: 68  ALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRGFIHQAECQDGVHSLHLTLSTYQRNTWG 127

Query: 210 DLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAK 269
           D LE  +P A+QAA   ++EFRRGLP  ++ Y G       D +  R A  E ++ L+A+
Sbjct: 128 DFLEAILPLAVQAAMEENVEFRRGLPRDFMDYMGAQHSDSKDPR--RTAFMEKVRVLVAR 185

Query: 270 LVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTK 329
           L  +  +D   D+  K  +HD+LPPVL+  E   SV+   +R      V    Q+T +T+
Sbjct: 186 LGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALSVYGLPIRWEAGEPVNVGAQLTTETE 245

Query: 330 VRLVRANAVR 339
           V +++    R
Sbjct: 246 VHMLQDGIAR 255


>gi|449684642|ref|XP_002167270.2| PREDICTED: MYC-induced nuclear antigen-like [Hydra magnipapillata]
          Length = 419

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 118/310 (38%), Positives = 180/310 (58%), Gaps = 24/310 (7%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG--- 59
           WEK+PLYI R+   YY +L +   + +IL  + +EF+ ++++  Y D +++ ++ +G   
Sbjct: 60  WEKKPLYIKRENSGYYGDLFSLSSMKEILAAHELEFETDVNVCRYVDNEKELLNEDGCLT 119

Query: 60  -RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
                +++ D +    + +   PQ Y   L QL   ++  FGC VG+N Y TP  +QG A
Sbjct: 120 VDKFDKLMNDKH---ATFQLHQPQRYGTVLWQLMEKMETYFGCLVGSNVYITPKESQGLA 176

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PH DD+E FILQLEG K WK+Y P MV+  L R  + + SQ+ IG PI+ +TLEPGDLLY
Sbjct: 177 PHCDDVEVFILQLEGTKHWKLYKP-MVE--LSRDYTQDLSQDSIGEPIMELTLEPGDLLY 233

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
            PRG IHQA +V   +S H+T+S YQ     D +  A+ +A+++A   D+ FRRGLPI Y
Sbjct: 234 FPRGTIHQARSVGESYSTHITLSTYQNNTLGDFMSIAVSQAIESALENDVNFRRGLPINY 293

Query: 239 LRYAGLAR---------GKPVDIQADR-----LAMKENLKDLLAKLVEYVDLDDGVDEMG 284
           L Y G A+         GK   + ++         K+++K  L+KL++++D++   D M 
Sbjct: 294 LSYLGTAKNFSKYFDEDGKESKLSSNENNEKVKKFKDSVKKHLSKLIDHIDVNTAADMMS 353

Query: 285 KQLMHDALPP 294
              M   LPP
Sbjct: 354 YDFMASRLPP 363


>gi|344245523|gb|EGW01627.1| Nucleolar protein 66 [Cricetulus griseus]
          Length = 320

 Score =  224 bits (570), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 113/238 (47%), Positives = 148/238 (62%), Gaps = 2/238 (0%)

Query: 102 FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE 161
             GAN Y TPP +QGFAPHYDDIEAF+LQLEG+K W+VY PR   E LP  SSPNFSQE+
Sbjct: 1   MAGANVYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRDPSEELPLTSSPNFSQED 60

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           +G P+L   LEPGDLLY PRG+IHQA     EHSLH+T+S YQ+  W D LE  +P A+Q
Sbjct: 61  LGEPVLKAVLEPGDLLYFPRGFIHQAECQDEEHSLHLTLSTYQRNTWGDFLEAVVPLAVQ 120

Query: 222 AAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
           AA   ++EFRRGLP  ++ Y G       D +  R A  E ++ L+A L  +  +D   D
Sbjct: 121 AAIEENVEFRRGLPRDFMDYMGAQHSDSKDPR--RTAFMEKVRVLVAHLRHFAPVDAVAD 178

Query: 282 EMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           +  K  +HD+LPPVL+  E   SV    +  S    V    ++T +T+V +++    R
Sbjct: 179 QRAKDFIHDSLPPVLTTTEEALSVHGLPVHWSDGELVDEGARLTTETEVHMLQDGIAR 236


>gi|290985939|ref|XP_002675682.1| predicted protein [Naegleria gruberi]
 gi|284089280|gb|EFC42938.1| predicted protein [Naegleria gruberi]
          Length = 542

 Score =  223 bits (569), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 119/293 (40%), Positives = 172/293 (58%), Gaps = 8/293 (2%)

Query: 4   EKEPLYISRKQ--KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD--VEG 59
           EK  L I R +   +YY  L +  +I   L D+ + +  ++D+ LY +G+R T++   + 
Sbjct: 122 EKRVLVIRRNEVYPDYYKGLYSLDEIKKTLVDHELRYSYDLDLALYRNGRRFTLNPNKDD 181

Query: 60  RALPQIVWDHYR-EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
            A P +VWD Y  E CSIR L PQ +   L  L    +E FG   G N Y TP  +QGFA
Sbjct: 182 VADPTLVWDLYENEKCSIRMLRPQEHSDVLLSLLCHFEEYFGQGAGLNAYLTPAGSQGFA 241

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPILTVTLEPGDLL 177
           PHYDDIEAF++QLEG+K WK+Y P    +YL R+SS NF+QEE+ G     + L+PGD+L
Sbjct: 242 PHYDDIEAFLIQLEGEKHWKIYRPLENQQYLDRFSSKNFTQEEVAGFECFEILLKPGDML 301

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           Y+P+G IHQA T  ++HSLH+T+S     +W D LEKA+P ALQ A    ++ R  LP  
Sbjct: 302 YVPKGVIHQAVTSQDQHSLHITVSTSHLMSWTDYLEKALPLALQMATENHVDLRTALPKD 361

Query: 238 YLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
           +  Y G    +  D + +   +K    +++  ++  + +D   DEM K+ MHD
Sbjct: 362 FKEYMGTNIAECEDPRREEFILK--CSEMVGLVMNSIPMDQAADEMVKKFMHD 412


>gi|320164710|gb|EFW41609.1| nucleolar protein 66 [Capsaspora owczarzaki ATCC 30864]
          Length = 546

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 105/247 (42%), Positives = 146/247 (59%), Gaps = 20/247 (8%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           WE +PL + R + NYY+ + ++  + ++   + I F KN+D+TLYE+ +R T+   GR L
Sbjct: 179 WESKPLVVRRSRPNYYSEVFSTATMFELFSKHDIHFTKNVDVTLYENDERVTVSPPGRVL 238

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P I      +GCS+R LNP +Y   +  L A LQE FG   GAN Y TP  +QGFAPHYD
Sbjct: 239 PPIAQSFLDDGCSLRLLNPHSYDDRVWSLLARLQEAFGAGAGANVYLTPCESQGFAPHYD 298

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           DI+AF+LQLEG K+WK++ PR VD  LP  SS +   +++  P +TV L PGDLLY+PRG
Sbjct: 299 DIDAFVLQLEGTKEWKLHRPRPVDPVLPFISSRDIPADQLEEPFITVQLNPGDLLYMPRG 358

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ-AAGATDLEFRRGLPIGYLRY 241
            IHQ S                   W  LLEKAMP  L+ A+ +  ++ R+ +P  +L  
Sbjct: 359 TIHQNS-------------------WFHLLEKAMPAVLEHASRSGTIDIRQSVPASFLHN 399

Query: 242 AGLARGK 248
              AR +
Sbjct: 400 GASARAQ 406


>gi|440797566|gb|ELR18649.1| cupin superfamily subfamily protein [Acanthamoeba castellanii str.
           Neff]
          Length = 511

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 134/356 (37%), Positives = 197/356 (55%), Gaps = 21/356 (5%)

Query: 1   DHWEKEPLYISRKQKNYYN------NLITSKKIDDILRDN-VIEFKKNIDITLYEDGQRQ 53
           +HWEK+PL + RK++   +      +L +    + +   N  + + KN+++  YEDG+++
Sbjct: 9   EHWEKKPLLVKRKERKVGDGNRLTKDLFSFATFERLFSSNRPLFYTKNLNVCRYEDGEKK 68

Query: 54  TMDVEGR-ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
            ++ EG  A  + V   +REG + +F  PQ + + L ++ A L++ FG  VGAN Y TPP
Sbjct: 69  MLNPEGSPASLKEVRKFWREGSTFQFFQPQQHSEVLHRMIAGLEQYFGALVGANIYMTPP 128

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
            AQG APHYDD+E FILQLEGKKKWK+Y P   DE LPR  S +  Q+E G P++ + LE
Sbjct: 129 NAQGLAPHYDDVEVFILQLEGKKKWKLYSPPPSDE-LPREYSRDLDQDECGEPLMELELE 187

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
            GDLLY PRG IHQA T  +  S HVTIS YQ+  W D L  A+P A++ A   D+EFRR
Sbjct: 188 EGDLLYFPRGTIHQACTGPSA-STHVTISTYQRHTWGDFLAVALPLAIKEAIEEDVEFRR 246

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKE-----NLKDLLAKLVEYVDLDDGVDEMGKQL 287
           GLP    +  G  +GK     +   A K+      +  LL +L  ++ L    D      
Sbjct: 247 GLPGEASQ--GKGKGKKASSGSKGQAAKQQQFESKMVSLLHRLSGHLLLHKAADITSLDF 304

Query: 288 MHDALPPV----LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           ++  LPP      +P+E   +      + S+  +V  A  +  D+ +RL   N +R
Sbjct: 305 VNHRLPPASLCYAAPQEEAQTKKATKKKQSKEPQVQEAQVLDDDSSIRLAEPNYIR 360


>gi|301117762|ref|XP_002906609.1| nucleolar protein, putative [Phytophthora infestans T30-4]
 gi|262107958|gb|EEY66010.1| nucleolar protein, putative [Phytophthora infestans T30-4]
          Length = 506

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 114/307 (37%), Positives = 175/307 (57%), Gaps = 22/307 (7%)

Query: 1   DHWEKEPLYISRKQKN--YYNNLITSKKIDDIL--RDNVIEFKKNIDITLYEDGQRQTMD 56
           +++EK+PL++ +  K   + +NL + KK+  ++  +   + F K++ +  Y D +R+  D
Sbjct: 74  EYFEKKPLHVRKADKGALFDSNLFSRKKLLKVMEKQHRSLSFGKDLTVCRYVDSERENFD 133

Query: 57  VE---GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
            E   G A  + V      G S +F  PQ Y   L +LNA+ +++FG   G++ Y TP  
Sbjct: 134 GEDTNGHATSRQVASLLDRGYSCQFYQPQRYEDGLYELNAAFEDVFGGLAGSSAYLTPAN 193

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           +Q  APH+DD+E F++Q +G+KKWK+Y P +V+  L    S + ++++IG P + +T+E 
Sbjct: 194 SQALAPHHDDVEVFVVQTQGRKKWKLYHP-LVE--LAGEHSSDLAEDQIGEPWMELTVEE 250

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           GDLLY PRG IHQA T   E S HVTISVYQ   W + LE A+P+ ++ A  +D+ FR G
Sbjct: 251 GDLLYFPRGVIHQACTDEKEFSTHVTISVYQHNTWANFLEVALPRVIRHAFDSDVSFREG 310

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKE---NLKDLLAKLVEYV---DLDDGVDEMGKQL 287
           LP+GYL   G          AD    KE     K L+ KL  +V   DL +  DE    +
Sbjct: 311 LPVGYLDSVG------TQFPADSAKAKEFMATCKKLVGKLAAHVSEKDLQEAADEAALDV 364

Query: 288 MHDALPP 294
           + + LPP
Sbjct: 365 LANRLPP 371


>gi|223994815|ref|XP_002287091.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220978406|gb|EED96732.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 830

 Score =  203 bits (517), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 137/413 (33%), Positives = 199/413 (48%), Gaps = 98/413 (23%)

Query: 19  NNLITSKKIDDILRDNVIEFKKNIDITLYEDG------QRQTMD-----------VEGRA 61
           N  +  K IDD++R N + +  ++++T Y D        R T+D            EG A
Sbjct: 333 NGFLHRKSIDDMIRKNKLRYGLDLNVTRYTDSLGNGTRHRITLDPPPKKRKSKTGSEGGA 392

Query: 62  L---------PQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
           +         P  VW +     C++R L P  +   +  + + L+  FGC VG+N Y TP
Sbjct: 393 VDDVEYIVANPTDVWTNVDASHCTLRLLRPHEHNDNIHSMLSLLESEFGCMVGSNAYLTP 452

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI----GTPIL 167
             +QGFAPHYDD++ FILQLEG K+W+VY P    E LPR SS +++++E+    G  +L
Sbjct: 453 LHSQGFAPHYDDVDVFILQLEGYKRWRVYAPMNKQETLPRVSSRDYTEKEVEESMGEEVL 512

Query: 168 TVTLEPGDLLYLPRGYIHQASTV------------TNEHSLHVTISVYQKTAWIDLLEKA 215
            V L PGD+LYLPRG+IHQA TV            T+ HSLH+T+S  Q   W D LE  
Sbjct: 513 DVVLVPGDVLYLPRGWIHQAETVARPSHVSKLPGITDAHSLHLTVSAMQNWCWADFLEIL 572

Query: 216 MPKALQAAGA--TDLEFRRGLPIGYLRYAG--------------------------LARG 247
           MP+AL++A A  T +  R GLP  +L Y G                          +A+G
Sbjct: 573 MPEALESASASETSISLRDGLPRNFLAYMGTMHQLDDEGGELPEGLKQVAEAYAKRVAKG 632

Query: 248 KPVDIQAD------------RLAMKENLKDLLAKLVEYVD------LDDGVDEMGKQLMH 289
              D + D              A+K+  K+   K +  V       LDD  D++GK+ + 
Sbjct: 633 ADKDEEIDDEALAEVMHRQRTAALKKQFKEEAKKRIMRVAKQAMSMLDDACDQIGKRFLS 692

Query: 290 DALPPVLSPEELQCSVFENGLRMSQTGEVYN---ATQITKDTKVRLVRANAVR 339
           D LPP L P E   +      + SQ+   ++   + +I  ++  RLVR N  R
Sbjct: 693 DRLPPALLPHEATLT------KESQSAAAHHHDASKKIWPNSLCRLVRPNIAR 739


>gi|291227952|ref|XP_002733946.1| PREDICTED: CG2982-like [Saccoglossus kowalevskii]
          Length = 513

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 116/311 (37%), Positives = 179/311 (57%), Gaps = 25/311 (8%)

Query: 1   DHWEKEPLYI---SRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           D+WEK+PL +     K+  Y+  L + K++  I + N + F ++++   Y +G+R++++ 
Sbjct: 94  DYWEKKPLILHGEGSKRAAYFKQLFSMKQLRVISKKNSVRFVEDVNCCRYVNGKRESLNE 153

Query: 58  EGRALPQIVWDHYREGC--SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
           +G A  + + +    G   +++F  PQ +   L ++ ++L+ LFGC VGAN Y TP  +Q
Sbjct: 154 DGDATVEKI-NRIFSGLKGTLQFHQPQRFQDELWKIQSNLESLFGCLVGANVYITPAESQ 212

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G APH+DD+E FI QLEG+K W++Y P +    LPR  S + SQ EIG P     L+PGD
Sbjct: 213 GLAPHHDDVEVFIFQLEGEKHWRLYKPSLD---LPRDYSRDLSQNEIGDPTHDFILKPGD 269

Query: 176 LLYLPRGYIHQASTVTNE-HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           ++Y PRG IHQA T+++  HS H+T+S YQ   W DLL   MP  L+     + EFRRG+
Sbjct: 270 VMYFPRGTIHQADTLSSSTHSTHITMSTYQHNTWGDLLSVMMPNLLEKEMDKNKEFRRGI 329

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDL--DDGVDEMGKQLMHDAL 292
           P G L   G+++              E+LK LL +L E ++   D   +EM    + + L
Sbjct: 330 P-GTLITQGISKPN-----------GEHLKMLLHQLAESIESCDDSSANEMKLDFISNRL 377

Query: 293 PPVL-SPEELQ 302
           PP   + EELQ
Sbjct: 378 PPYHNNTEELQ 388


>gi|348688581|gb|EGZ28395.1| hypothetical protein PHYSODRAFT_473010 [Phytophthora sojae]
          Length = 550

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 110/300 (36%), Positives = 169/300 (56%), Gaps = 9/300 (3%)

Query: 1   DHWEKEPLYISRKQKN--YYNNLITSKKIDDIL-RDNVIEFKKNIDITLYEDGQRQTMDV 57
           +++EK+PL++ +  K   + + L + KK+  ++ +   + F K++ +  Y + +R+  D 
Sbjct: 74  EYFEKKPLHVRKADKGELFGSGLFSRKKMLKVMEKQQHMGFGKDLTVCRYVNSERENFDG 133

Query: 58  E---GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           E   G A  + V      G S +F  PQ Y   L +LNA+ +E+FG   GA+ Y TP  +
Sbjct: 134 EDSNGHATSRQVASLLDRGYSCQFYQPQRYADGLYELNAAFEEVFGGLAGASAYLTPANS 193

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           Q  APH+DD+E F+LQ +G+KKWK+Y P +V+  L    S + + ++IG P + +T+E G
Sbjct: 194 QALAPHHDDVEVFVLQTQGRKKWKLYRP-LVE--LAGEHSSDLAADQIGEPWMELTVEEG 250

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           DLLY PRG +HQA T   + S HVTISVYQ   W + LE A+P+ ++ A  +D+ FR+GL
Sbjct: 251 DLLYFPRGIVHQACTDDKQFSTHVTISVYQHNTWANFLEVALPRVIRHAFDSDVAFRKGL 310

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
           P+GYL Y G              A  + L   LA  V   DL +  DE    +  + LPP
Sbjct: 311 PVGYLSYMGTQFPTDSAEAKKFAAACKKLVGQLATHVSDKDLQEAADEAALDVFANRLPP 370


>gi|219124973|ref|XP_002182765.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405559|gb|EEC45501.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 351

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 111/252 (44%), Positives = 158/252 (62%), Gaps = 12/252 (4%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT-AQGFAPHYDDI 124
           +W  + +GC+IR L P  +      L + L+  + C +GAN Y TPP+ +QGFAPHYDDI
Sbjct: 1   LWSRFDQGCTIRLLCPHKHSDSTHGLLSLLESEWTCMIGANAYLTPPSGSQGFAPHYDDI 60

Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLPRG 182
           EAF LQLEGKK+WKVY P    E LPR SS ++ + ++    P L V L+PGD+LY+PRG
Sbjct: 61  EAFCLQLEGKKRWKVYAPLQKSERLPRTSSEDYVEADLRDVEPALDVVLKPGDVLYMPRG 120

Query: 183 YIHQASTV--TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD-LEFRRGLPIGYL 239
           +IHQA T+  T+ +SLH+T+S  Q+ AW DL+E  +P+ALQ+A + D    R+GLP G+L
Sbjct: 121 WIHQACTIDGTDGYSLHLTVSAMQQWAWADLMELLLPEALQSAASGDSTMLRQGLPRGFL 180

Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVE--YVD-LDDGVDEMGKQLMHDALPPVL 296
            Y G    +     A+ L  K   +D  A + E   +D LD   D++GK+ + D +PPVL
Sbjct: 181 NYMGAMYDQ--KDTAEILEQKAE-QDRTAAMDETGAIDMLDAACDQIGKRFLSDRVPPVL 237

Query: 297 SPEELQCSVFEN 308
           +  E   +V E+
Sbjct: 238 THLERSMTVHES 249


>gi|149605061|ref|XP_001516457.1| PREDICTED: lysine-specific demethylase NO66-like, partial
           [Ornithorhynchus anatinus]
          Length = 204

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 97/198 (48%), Positives = 124/198 (62%), Gaps = 7/198 (3%)

Query: 48  EDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
           ED  R  M    R   +  W  YR GCS+R L PQ +   +  + A LQE  GC  G N 
Sbjct: 7   EDQARTLMGPGPRQ--KTAWGLYRAGCSLRLLCPQAFSAVVWHVLALLQEHLGCLAGCNA 64

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
           Y TPP AQGFAPH DD+EAF++QLEG+K+W++Y P    E LPR   P+ +   +G P+L
Sbjct: 65  YLTPPGAQGFAPHSDDVEAFVVQLEGRKRWRLYRP---AEELPRRPGPDLAPGALGRPLL 121

Query: 168 TVTLEPGDLLYLPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
            V L PGDLLY PRG++HQA     T +HSLH+T+S +Q+ +W DLLE  +P A +AA  
Sbjct: 122 QVVLRPGDLLYFPRGFVHQADCPAATGQHSLHLTLSAFQRNSWGDLLEPLLPAAKKAALE 181

Query: 226 TDLEFRRGLPIGYLRYAG 243
            D +FRRGLP  YL Y G
Sbjct: 182 EDADFRRGLPRDYLDYMG 199


>gi|427785655|gb|JAA58279.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 490

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 110/302 (36%), Positives = 170/302 (56%), Gaps = 22/302 (7%)

Query: 2   HWEKEPLYISRK--QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           HWE+EP     +  Q  +++ L    +   +L++  I F K++ +  YE+G+R   +  G
Sbjct: 55  HWEQEPFVSQNRTSQAEFWSPLFGKDRFFALLKEKQIYFAKDVAVCKYEEGKRTNFERHG 114

Query: 60  RALPQIVWDHYR-EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           RA    +   +  +  +++   PQ ++  L ++   L+  FGC VG N Y TPP+AQG A
Sbjct: 115 RATETKMRKLFEGDKATLQIHQPQRWVDGLWEVLEKLECFFGCLVGCNVYITPPSAQGLA 174

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PHYDD+E FI+QLEG+K WK+Y P +    LPR  S +F+ +EIG P    TL+PGDLLY
Sbjct: 175 PHYDDVEVFIVQLEGEKCWKLYKPPV---ELPRTYSKDFNADEIGQPTHEFTLKPGDLLY 231

Query: 179 LPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           +PRG IHQA T   +  HS HVTIS YQ     D L+   P  + AA  + L+ RRGLP+
Sbjct: 232 MPRGTIHQALTPESSGTHSTHVTISTYQNHTIGDCLKSLAPDLIDAAQDSCLQLRRGLPM 291

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
            +L ++ L +    D++A   A+ +++K++           D    + ++L+HD +   L
Sbjct: 292 QFLPHSRLTKA---DVEAALEAVLKHVKNI-----------DSPMPIAEELVHDFMRTRL 337

Query: 297 SP 298
            P
Sbjct: 338 PP 339


>gi|428179240|gb|EKX48112.1| hypothetical protein GUITHDRAFT_137055 [Guillardia theta CCMP2712]
          Length = 525

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 101/294 (34%), Positives = 168/294 (57%), Gaps = 8/294 (2%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDD-ILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           WE+  L++ R + N+Y N+++ K +   + R  V   + N+ +  Y+DG R+   + G +
Sbjct: 66  WEQRHLHVKRGRSNFYGNILSLKDVMRYVGRGRVSMARSNLRVCKYKDGVRRERRIRGTS 125

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
             ++     R G + +  +PQ    P+  + A L+++    VG++ Y TP  +QG APH+
Sbjct: 126 ASELKRQICR-GYTAQIFSPQHLCTPVAAVCAQLEQVLHSLVGSSLYVTPSHSQGLAPHH 184

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           DD++ F+ QL G+K+W++Y P++     P+  S +   +EIG P+L V L+PGDLLY+PR
Sbjct: 185 DDVDVFVFQLHGRKRWRLYAPKV---RWPKSYSRDMRDKEIGEPVLDVVLKPGDLLYMPR 241

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G +HQAST ++  S+H+TIS YQ  AW DLL  A+P A++ A   + E RR +P+  LR 
Sbjct: 242 GTVHQASTRSSP-SIHLTISAYQHQAWDDLLLAALPYAVRQAAEEEQELRRSIPLDTLRV 300

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALP 293
            G+A G        R ++ E  ++L  +L E +   +D  VD M    +   +P
Sbjct: 301 LGMAHGVVNRTCERRESLLEKARNLTHRLAESMTGAVDKAVDLMAADFICGRMP 354


>gi|115647197|ref|XP_789366.2| PREDICTED: myc-induced nuclear antigen-like [Strongylocentrotus
           purpuratus]
          Length = 511

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 103/305 (33%), Positives = 176/305 (57%), Gaps = 17/305 (5%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           ++WEK+P+ IS ++K  +Y+ +L T   ++ ++ +  I F ++ ++  Y+   R++++  
Sbjct: 79  EYWEKQPIVISNREKHRDYFQSLFTRTILEGLVAEKKISFIQDCNVCRYKGEVRESLNGN 138

Query: 59  GRALPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           G   P  + +   ++  +I+F  PQ + + +  L   L+  FGC VG+N Y TP  +QG 
Sbjct: 139 GIVKPTKLKELLDQDKATIQFHQPQRFQESVWNLLEKLESYFGCLVGSNIYMTPKLSQGL 198

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
           APHYDD+E F+LQLEG+K W++Y P  +   LPR  S +  Q E+G P   + L+ GDL+
Sbjct: 199 APHYDDVEVFVLQLEGEKHWRLYKPPTL---LPRDYSRDLDQSELGEPTHDIVLKAGDLM 255

Query: 178 YLPRGYIHQAST-VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           Y PRG +HQA T  T  HS H+TIS YQ+++W DLL  A+P  +Q A + ++ +R+GLP+
Sbjct: 256 YFPRGTVHQADTPSTCSHSTHLTISTYQRSSWGDLLSIALPSMIQTAISENVSYRQGLPL 315

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDL--DDGVDEMGKQLMHDALPP 294
            ++         P + QA        + DL+++L  +++    +  +EM    + + LPP
Sbjct: 316 RFV-------SDP-NHQASSSETGRKMADLISQLSSHIESHGQEAANEMLCDFIANRLPP 367

Query: 295 VLSPE 299
               E
Sbjct: 368 FCDGE 372


>gi|346468123|gb|AEO33906.1| hypothetical protein [Amblyomma maculatum]
          Length = 497

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/254 (39%), Positives = 144/254 (56%), Gaps = 8/254 (3%)

Query: 2   HWEKEPLYI--SRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           HWE+EPL    S  +    + L   +    +L++  I F K+I +  Y DG+R   + +G
Sbjct: 58  HWEQEPLISQNSADRTKLLSPLFGKENFFTLLKEKQIFFTKDIALCKYVDGKRTNFERQG 117

Query: 60  RALPQIVWDHYR-EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           RA        +  +  +++   PQ ++  L +L   L+  FGC VG N Y TPPTAQG A
Sbjct: 118 RATEAKARKLFENDKATMQIHQPQRWVDGLWELLEKLECFFGCLVGCNAYITPPTAQGLA 177

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PHYDD+E FI+QLEG+K WK+Y P +    LPR  S +F   EIG P    TL+P DLLY
Sbjct: 178 PHYDDVEVFIVQLEGEKCWKLYKPPI---ELPRTYSNDFDAAEIGEPSHEFTLKPXDLLY 234

Query: 179 LPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           +PRG IHQA T   +  HS H+TIS YQ     D L+   P  + AA  + ++ R+GLP+
Sbjct: 235 MPRGTIHQAWTPESSGTHSTHITISTYQNHTVGDGLKSLAPNLIDAALDSCVQLRKGLPL 294

Query: 237 GYLRYAGLARGKPV 250
            +L ++ L++   V
Sbjct: 295 QFLPHSSLSKASVV 308


>gi|326432547|gb|EGD78117.1| hypothetical protein PTSG_08995 [Salpingoeca sp. ATCC 50818]
          Length = 912

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 127/199 (63%), Gaps = 9/199 (4%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           ++GC+I+F  PQ ++  LQ +  +L+  FGC VGAN Y TPP +QG APH+DDIEAF+LQ
Sbjct: 220 KDGCTIQFHQPQRFMLELQDVMTTLESTFGCLVGANVYITPPNSQGLAPHHDDIEAFVLQ 279

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           +EG K WK+Y P    E L    SP+  +  IG PI    +  GD+LY PRG +H+A T 
Sbjct: 280 VEGSKHWKLYNPV---EQLASSYSPDLPRTAIGDPIYDGIINAGDVLYFPRGQVHEAQT- 335

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
           T+  S+HVTIS YQ+  +  LL+KA+P  L+ A +  L+ RRGLP+  L+ +G A   P+
Sbjct: 336 TDTLSIHVTISTYQRWTYGHLLQKALPSCLEGALSRVLDLRRGLPLDLLKSSGFALSPPL 395

Query: 251 DIQADRLAMKENLKDLLAK 269
           +      A+K+ L+ LL +
Sbjct: 396 NP-----ALKDTLRALLQR 409


>gi|294881184|ref|XP_002769286.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239872564|gb|EER02004.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 521

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 168/352 (47%), Gaps = 57/352 (16%)

Query: 1   DHWEKEPLYISR-KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           ++WEK+PL++ R   ++YY+ + T    +  L  +   F ++++    E G +   + E 
Sbjct: 66  EYWEKKPLHVRRPTARDYYSGVWTKAMAEKTLTKHECRFGESVNFARVEAGVKVMHNGEE 125

Query: 60  --RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
             +A  + +   + +G S +F+ PQ + KP   L   L+  FG   GAN+Y TP  + GF
Sbjct: 126 GEKATVEYMQGQFEDGVSCQFMQPQRFSKPCHALMERLENYFGTLWGANSYLTPANSVGF 185

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
           APHYDD+E F+LQ EG K+W++Y     D  LP   S ++++EE+  P     +E GDLL
Sbjct: 186 APHYDDVEVFMLQTEGSKRWRLYDSPDDDGPLPMEYSRDYTEEELSLPYFDEVVEQGDLL 245

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM-PKALQAAGATDLEFRRGLPI 236
           Y+PRG +H        +S H+T+S Y   +W +LL+  + P AL  A   D+ FR G+P+
Sbjct: 246 YIPRGTVHFGCVSPEGYSHHLTVSTYYHNSWGELLQNLLIPGALAKAMKEDVGFREGVPV 305

Query: 237 GYLRYAG------------LARG-----------------------KPVD---------- 251
            + RY G            L  G                       + VD          
Sbjct: 306 NWTRYMGRLMAPVTAETAALEAGGDDDNASDDDVEVGKDGSSTEEEEEVDAKVGSVPPQG 365

Query: 252 -IQADRLAM-------KENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPV 295
             QAD  A+       K ++K L+AKL EYVD DD  D+     +    PP 
Sbjct: 366 ETQADPEALMKARKAFKAHVKGLIAKLSEYVDEDDAADQTAVDFVALRTPPA 417


>gi|145347679|ref|XP_001418290.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578519|gb|ABO96583.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 94/252 (37%), Positives = 135/252 (53%), Gaps = 24/252 (9%)

Query: 4   EKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD------- 56
           ++  + + R  + +Y + +T + +    RD    +  ++D+T Y DG R+T +       
Sbjct: 81  DRRAVCVKRGGRAHYGSWMTMEAVTRATRDGEARYGVDLDVTSYVDGVRRTHNRNSDAGR 140

Query: 57  ------------VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG 104
                       V+  A+    +   R   S+R L+PQT      ++ A+L+  F C  G
Sbjct: 141 EADDEDDDVGELVDADAVLTRRFGEERR--SVRLLHPQTRCDATWKILATLERYFECACG 198

Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT 164
            N Y TP ++QGFAPHYDDI+AF+LQ+EG K+W+VY P   DE  PR SS NF+QEEI T
Sbjct: 199 CNVYVTPASSQGFAPHYDDIDAFVLQIEGAKRWRVYEP-FEDETHPRTSSRNFTQEEIAT 257

Query: 165 P--ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
              +    LE GD LYLPRG+IHQA   ++ HS+H T+S  Q  A  D LE A+  AL +
Sbjct: 258 QRVVFDDVLEAGDFLYLPRGWIHQAECSSSTHSVHATLSTNQSNAPADALEIALNNALAS 317

Query: 223 AGATDLEFRRGL 234
                 E RR  
Sbjct: 318 TIDGRAELRRSF 329


>gi|357621298|gb|EHJ73173.1| hypothetical protein KGM_18276 [Danaus plexippus]
          Length = 341

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 91/188 (48%), Positives = 121/188 (64%), Gaps = 6/188 (3%)

Query: 153 SSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           ++P+ S  EIG P+L V LE GD+LY PRGYIHQ  T+  EHSLHVTIS+YQK AW DLL
Sbjct: 73  ANPHLS--EIGEPVLEVVLEAGDMLYFPRGYIHQGVTIDGEHSLHVTISMYQKHAWADLL 130

Query: 213 EKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLA-MKENLKDLLAKLV 271
           EK +P ALQ A   ++EFR GLP+    + GL      D    R A M+E +K L  K+ 
Sbjct: 131 EKMIPAALQIAINENIEFREGLPLDIYDHFGLVHS---DTNTPRKAEMEEIVKRLFNKIK 187

Query: 272 EYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVR 331
           +Y+ +D+ VD+M K+   DALPPVLS  E   +VF +   M + G+V N  +I  DT++R
Sbjct: 188 DYLPIDEAVDQMNKKFQQDALPPVLSDFEKAVTVFGDSDVMIENGKVTNRVEIGLDTRIR 247

Query: 332 LVRANAVR 339
           L+R N +R
Sbjct: 248 LLRKNILR 255



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/63 (52%), Positives = 49/63 (77%)

Query: 3  WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
          WEK+PL+I+RK+ +YY ++I++  ID++LR   I+F KNIDIT Y DG+R+T + EGRA 
Sbjct: 15 WEKKPLHIARKKPDYYKDVISTPVIDNMLRTENIQFTKNIDITSYVDGKRETHNPEGRAN 74

Query: 63 PQI 65
          P +
Sbjct: 75 PHL 77


>gi|412990663|emb|CCO18035.1| unnamed protein product [Bathycoccus prasinos]
          Length = 594

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 105/269 (39%), Positives = 157/269 (58%), Gaps = 31/269 (11%)

Query: 12  RKQ----KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQT-------MDVEGR 60
           RKQ    K+ Y +L+TS+ I+  L++  + +  N+D+T Y+DG R+T        +V   
Sbjct: 158 RKQSVTSKSGYEDLLTSEDIERWLKEKKLTYGTNVDVTSYKDGVRRTHNANDGFSNVSKN 217

Query: 61  ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           A  + V   + E GCS+R L+PQ +   L  L ++L+  +    G N Y TP  +QGFAP
Sbjct: 218 ADYETVAKRFEEDGCSVRVLHPQRWSDKLWALLSNLENFWQSPTGCNAYWTPANSQGFAP 277

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ------EEIGTPILTVTLEP 173
           H+DDI+AF+LQ+EG+K+W++Y  R V E  PR+SS  FS+      EE+    + V LEP
Sbjct: 278 HWDDIDAFVLQIEGRKRWRLYPTRDVSEKFPRFSSEEFSEEERKRLEEM--KYVEVILEP 335

Query: 174 GDLLYLPRGYIHQASTVT---------NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
           GD++Y+PRG IH+A  +T         +E SLH+TISV Q+  + D+LE     AL  A 
Sbjct: 336 GDVMYMPRGTIHEAFCITSEDENDGKKDESSLHLTISVNQRNTFADILEHVFSDALADAV 395

Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQ 253
            T++E RR  P  +LR+  L+R    D+ 
Sbjct: 396 NTNVELRRCPPRDFLRH--LSRLDETDVN 422


>gi|407852401|gb|EKG05908.1| hypothetical protein TCSYLVIO_003010 [Trypanosoma cruzi]
          Length = 476

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 86/252 (34%), Positives = 141/252 (55%), Gaps = 10/252 (3%)

Query: 2   HWEKEPLYISRKQKNYYNNL--------ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ 53
           ++EK PL+ S     ++  +         +++ +  +  +  + +  +I+I  ++  Q++
Sbjct: 44  YFEKRPLHFSHGAATHFTEVQDGLPAVKWSTELMLQLTAEKSLSYTTDINIVRFDAVQKK 103

Query: 54  TMDV--EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
            +    EG    + +    R+G S+RFL P  YI     + A L+E+F C  G N+Y TP
Sbjct: 104 RVPFRSEGIVTEKEMKHSMRKGWSVRFLRPHEYIVENSAVLAMLEEVFACSCGLNSYWTP 163

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
             +QGFAPHYDD++ F+LQLEG+K+W++Y P    + L R+SS +++ EE+  P     L
Sbjct: 164 ANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPERVDVLSRHSSEDYNPEELPKPTQIFRL 223

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
            PGD+LY+PRG +HQ  T  + HSLHVT S  Q   W DL++ A+   ++   A  + +R
Sbjct: 224 VPGDVLYMPRGTVHQGRTYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEKLAANYIHWR 283

Query: 232 RGLPIGYLRYAG 243
           R LP   L+  G
Sbjct: 284 RSLPRSLLKRLG 295


>gi|407417792|gb|EKF38086.1| hypothetical protein MOQ_001707 [Trypanosoma cruzi marinkellei]
          Length = 476

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 86/252 (34%), Positives = 143/252 (56%), Gaps = 10/252 (3%)

Query: 2   HWEKEPLYISRKQKNYYNNL--------ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ 53
           ++EK+PL+ S     ++  +         +++ +  +  +  + +  +I+I  ++  Q++
Sbjct: 44  YFEKKPLHFSHGSATHFTRVQDGIPAVKWSTELMLQLAAEKSLSYTTDINIVRFDAVQKK 103

Query: 54  --TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
             +  +EG    + +    R+G S+RFL P  YI     + A L+E+F C  G N+Y TP
Sbjct: 104 RISFRLEGIVTEKDMKHSMRKGWSVRFLRPHEYIVENSAVLAILEEIFVCSCGLNSYWTP 163

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
             +QGFAPHYDD++ F+LQLEG+K+W++Y P    + L R+SS +++ EE+  P  T  L
Sbjct: 164 ANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPEKVDVLSRHSSEDYNPEELPKPKQTFHL 223

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
            PGD+LY+PRG +HQ  T  + HSLHVT S  Q   W DL++ A+   ++   A  + +R
Sbjct: 224 VPGDVLYMPRGTVHQGQTYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEKLAANYIHWR 283

Query: 232 RGLPIGYLRYAG 243
             LP   L+  G
Sbjct: 284 HSLPRALLKRLG 295


>gi|327268744|ref|XP_003219156.1| PREDICTED: MYC-induced nuclear antigen-like [Anolis carolinensis]
          Length = 470

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 106/309 (34%), Positives = 162/309 (52%), Gaps = 31/309 (10%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WEK+PL + R      +YY++L     ++ I++  +  + K+I++    +G+++  + 
Sbjct: 56  EYWEKKPLVLHRDDPTVASYYSSLFQLTDLEGIVKHGLY-YGKDINVCKSINGKKKVFNK 114

Query: 58  EGR-ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+ +  Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TPP +QG
Sbjct: 115 DGKVSYAQLKKDFDQKKATIQFHQPQRFKNELWKIQEKLECFFGSLVGSNIYITPPGSQG 174

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
             PHYDDIE FILQLEG+K W++Y P      +P     N   E+ IG P     L+PGD
Sbjct: 175 LPPHYDDIEVFILQLEGEKHWRLYKPT-----IPLAQQYNAEPEDKIGAPTHNFILKPGD 229

Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           LLY PRG IHQA+T     HS HVTIS YQ  +W D     +P  +      D+EFR+ +
Sbjct: 230 LLYFPRGTIHQAATPPGIAHSTHVTISTYQNHSWRDFFLDVVPGLILDMAKEDIEFRKSI 289

Query: 235 PIGYLRYAGLARGKP-----VDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
           P   L    L+         +   ADRL  +EN K+L +             +M K  + 
Sbjct: 290 PRQLLMKVDLSDSSRQLSGFLRCLADRL--EENGKELKS------------SDMKKDFIM 335

Query: 290 DALPPVLSP 298
           + LPP L P
Sbjct: 336 NRLPPFLGP 344


>gi|156362163|ref|XP_001625650.1| predicted protein [Nematostella vectensis]
 gi|156212493|gb|EDO33550.1| predicted protein [Nematostella vectensis]
          Length = 269

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 130/209 (62%), Gaps = 7/209 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV-EG 59
           ++WEK+PL I+R+   +Y  L +   ++ +L+   I + ++I++  Y DG+++ ++  EG
Sbjct: 63  NYWEKKPLVINREDSEFYGALFSKAFLEVLLKKKEINYVEDINVCRYIDGEKEFLNEDEG 122

Query: 60  R--ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
                 +I+     +  +I+F  PQ +   L QLN +L+  FGC VGAN Y TPP AQG 
Sbjct: 123 TKATASKIMKKVKDDNATIQFHQPQRFQDTLWQLNGNLERFFGCLVGANVYITPPNAQGL 182

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
           APH+DD+E FILQLEG+K WK+Y P +  E    YS+ +  ++ IG P    TL+ GDLL
Sbjct: 183 APHHDDVEVFILQLEGEKNWKLYSPLV--ELALDYSA-DLEEDSIGKPTHEFTLKTGDLL 239

Query: 178 YLPRGYIHQASTV-TNEHSLHVTISVYQK 205
           Y PRG IHQA T+    HS H+T+S YQ+
Sbjct: 240 YFPRGTIHQAETLKCGNHSTHITLSTYQQ 268


>gi|302850897|ref|XP_002956974.1| hypothetical protein VOLCADRAFT_107490 [Volvox carteri f.
           nagariensis]
 gi|300257692|gb|EFJ41937.1| hypothetical protein VOLCADRAFT_107490 [Volvox carteri f.
           nagariensis]
          Length = 756

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/285 (34%), Positives = 150/285 (52%), Gaps = 43/285 (15%)

Query: 1   DHWEKEPLYI--SRKQKNYYNNLITSK---KIDDIL--------------RDNVIEFKKN 41
           ++WEK+PL+I    ++KN+++ L       K+ D L              + + + F+ +
Sbjct: 57  EYWEKKPLHIPADEERKNFFDGLFDFDELCKVSDALEAGVMLLMDQDPGPKGDALSFEDD 116

Query: 42  IDITL-----------------YEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTY 84
           +D  +                 Y DG+R+T+  E      +   H  EG +++   PQ +
Sbjct: 117 VDDQMVRAIGPLVFGRDVVAVRYVDGERETLAGEFADSDTLRTLHETEGATLQLHQPQRF 176

Query: 85  IKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM 144
           +  L +L A+L+   GC VG N Y TP   QG APH+DD+E F+ Q +G KKW++Y P +
Sbjct: 177 VNALWRLVAALEAQSGCLVGCNAYVTPGGGQGLAPHHDDVELFVCQTKGSKKWRLYKP-V 235

Query: 145 VDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQ 204
            D  LP   S +  Q+E+G PIL VT+ PGD+LY+PRG  HQA     E S H+TIS YQ
Sbjct: 236 NDFELPSKVSGDLPQDELGEPILEVTMNPGDVLYVPRGIPHQA-VAQEEGSCHLTISTYQ 294

Query: 205 KTAWIDLLEKAMPKALQAAGAT-----DLEFRRGLPIGYLRYAGL 244
           +  W DL  K +  A  A G++      LE R+GLP+G +   G+
Sbjct: 295 RWTWGDLAVKLLETARDAQGSSPASALPLELRKGLPLGLIFKHGI 339


>gi|323448506|gb|EGB04404.1| hypothetical protein AURANDRAFT_67230 [Aureococcus anophagefferens]
          Length = 450

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 127/214 (59%), Gaps = 10/214 (4%)

Query: 40  KNIDITLYEDGQRQTMDVEGRALPQIVWDHYR---EGCSIRFLNPQTYIKPLQQLNASLQ 96
           ++ D+T  E G R T+  +G  +    W       EG ++R   P    + +  L   L+
Sbjct: 19  RDADVTRVERGARTTLTRDGEPV-DAAWAKRAFETEGATVRLRCPADSNERVHALCRCLE 77

Query: 97  ELFGCFVGANTYQTPP-TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP 155
           + FG  VGAN+Y TP   AQGFA H+DD++ F+LQ EG K W+++ P   D  LPR SS 
Sbjct: 78  DEFGSTVGANSYLTPGGAAQGFAAHWDDVDVFVLQTEGSKTWQIFAPDADDLKLPRVSSQ 137

Query: 156 NFSQEEI----GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDL 211
           +F  E +    GT    VTL+PGD+LYLPRG++H+A+T     SLHVT+S ++  AWIDL
Sbjct: 138 DFDDEALDALYGTLRAEVTLDPGDVLYLPRGFVHRAATGAAP-SLHVTLSAHRANAWIDL 196

Query: 212 LEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
           LE+ +PKAL++A   D   R+ LP  Y  + G+A
Sbjct: 197 LERLLPKALESAQRGDCGLRKALPRDYAHFLGVA 230


>gi|157872976|ref|XP_001685008.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128079|emb|CAJ08208.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 624

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/284 (32%), Positives = 147/284 (51%), Gaps = 31/284 (10%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           ++G S+RFL P  +I+        +++ F C+ GAN+Y TP  +QGFAPHYDD++ F LQ
Sbjct: 262 KDGWSVRFLRPNEFIESNSAFIGCIEKEFNCYCGANSYWTPANSQGFAPHYDDVDVFFLQ 321

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           LEG+K W +Y P    + L R+SS +++ E   TP  T+TL+ GD+LY+PRG +HQ  T 
Sbjct: 322 LEGEKLWCLYDPPEDVDVLARHSSEDYAPERFPTPKHTITLKAGDVLYMPRGTVHQGKTT 381

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGL------ 244
              HSLH+T S  Q  +W D + +A    ++   A  LE+RR LP    +  G       
Sbjct: 382 LKTHSLHITFSANQMNSWADFMSRAAQYTVETLAANKLEWRRALPRDMPQVMGEVNNPVF 441

Query: 245 --ARGKP---VDIQADRLAMKENLKDLLAK----LVEYVDLDDGVDEMGKQLMHDALPPV 295
               G P    + Q  R  ++  +++++A+    L + V++D   D  GK  +    PP 
Sbjct: 442 RDTHGLPALSANQQDHRANLQTKVREMVAELTLLLTDEVNMDVCADVYGKDTIQKLQPP- 500

Query: 296 LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                            + +G    + +++  ++VRL+  N +R
Sbjct: 501 ---------------SRTYSGAAPASRKLSHASRVRLISRNCMR 529


>gi|71415552|ref|XP_809839.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70874281|gb|EAN87988.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 476

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 85/252 (33%), Positives = 140/252 (55%), Gaps = 10/252 (3%)

Query: 2   HWEKEPLYISRKQKNYYNNL--------ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ 53
           ++EK+PL+ S     ++  +         +++ +  +  +  + +  +I+I  ++  Q++
Sbjct: 44  YFEKKPLHFSHGAATHFTEVQDGLPAVKWSTELMLQLAAEKSLSYTTDINIVRFDAVQKK 103

Query: 54  TMDV--EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
            +    EG    + +    R+G S+RFL P  YI     + A L+E F C  G N+Y TP
Sbjct: 104 RVPFRSEGIVTEKEMKHSMRKGWSVRFLRPHEYIVENSAVLAMLEEAFACSCGLNSYWTP 163

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
             +QGFAPHYDD++ F+LQLEG+K+W++Y P    + L R+SS +++ EE+  P     L
Sbjct: 164 ANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPERVDVLSRHSSEDYNPEELPKPTQIFRL 223

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
            PGD+LY+PRG +HQ     + HSLHVT S  Q   W DL++ A+   ++   A  + +R
Sbjct: 224 FPGDVLYMPRGTVHQGRKYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEKLAANYIHWR 283

Query: 232 RGLPIGYLRYAG 243
           R LP   L+  G
Sbjct: 284 RSLPRSLLKRLG 295


>gi|389602291|ref|XP_001567036.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322505387|emb|CAM42455.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 595

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 180/359 (50%), Gaps = 44/359 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLI---------TSKKIDDILRDNVIEFKKNIDITLYEDG-- 50
           ++EK+PL  S     Y+   +         ++K++ + ++ +   +  ++++  ++    
Sbjct: 153 YFEKKPLVASHGNCEYFARGLPGVVPPVDWSTKRMLEHIKMHPSRYGTDLNVVKFDSKLK 212

Query: 51  QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
           +R +   EG      +    + G S+RFL P  +++        +++ F C+ G N+Y T
Sbjct: 213 KRVSYRTEGLVDAAELERCMKTGWSVRFLRPHEFLESNSAFIGCMEQEFNCYCGVNSYWT 272

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P  +QGFAPHYDD++ F+LQLEG+K W++Y P    + L R+SS +++ E++ TP  T+T
Sbjct: 273 PANSQGFAPHYDDVDVFLLQLEGEKLWRLYDPPEEVDVLTRHSSEDYAPEQLPTPNHTIT 332

Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
           L+ GD+LY+PRG +HQ  T    HSLH+T S  Q  +W D + ++    ++   A  LE+
Sbjct: 333 LKAGDVLYMPRGTVHQGKTNLKTHSLHITFSANQMNSWADFMIRSAQYTIETLAANKLEW 392

Query: 231 RRGLPIGYLRYAGLARGKPV------------DIQADRLAMKENLKDLLAK----LVEYV 274
           R  LP    +  G A   PV            + Q  R ++++ L+++ A+    L +  
Sbjct: 393 RCALPRDVPQVMG-AINHPVFRNTHGLAVLSENQQERRASLQKKLREMAAEVTLLLTDET 451

Query: 275 DLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLV 333
           ++D   D  GK+ +    PP                 M+ +G   ++T +   + VRL+
Sbjct: 452 NMDVCTDVYGKETIRKMQPP----------------SMAYSGAASSSTGLDHASHVRLI 494


>gi|401426122|ref|XP_003877545.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322493791|emb|CBZ29080.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 624

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 138/282 (48%), Gaps = 31/282 (10%)

Query: 73  GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
           G S+RFL P  +I+       S+++ F C+ G N+Y TP  +QGFAPHYDD++ F+LQLE
Sbjct: 264 GWSVRFLRPNEFIESNSAFIGSMEKEFNCYCGVNSYWTPANSQGFAPHYDDVDVFLLQLE 323

Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
           G+K W +Y P    + L R+SS +++ E   TP   +TL+ GD+LY+PRG +HQ  T   
Sbjct: 324 GEKLWCLYDPPENVDVLARHSSEDYAPELFPTPKHAITLKAGDVLYMPRGTVHQGKTTRK 383

Query: 193 EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDI 252
            HSLH+T S  Q  +W D + +A    ++   A  LE+RR LP    +  G         
Sbjct: 384 MHSLHITFSANQMNSWADFMSRAAQYTVETLAANKLEWRRALPRDMPQVMGEMNNPVFRD 443

Query: 253 QADRLAMKENLKDLLAK---------------LVEYVDLDDGVDEMGKQLMHDALPPVLS 297
                A+ EN +D  AK               L + V++D   D  GK  +    PP   
Sbjct: 444 THGLAALSENQQDHRAKLQTKVREMVAELTLLLTDEVNMDVCTDVYGKDTIRKLQPP--- 500

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                          + +G    +T +   ++VRL+  N +R
Sbjct: 501 -------------SRTYSGTAPASTTLNHASRVRLISRNCMR 529


>gi|126336950|ref|XP_001380518.1| PREDICTED: MYC-induced nuclear antigen [Monodelphis domestica]
          Length = 474

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 103/306 (33%), Positives = 154/306 (50%), Gaps = 21/306 (6%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE++PL I R   +   Y+ NL     + ++ R  ++ + +++++    +G+++ ++ 
Sbjct: 57  EYWEQKPLLIQRDDPSLAAYFQNLFQLADLKNLCRQGMV-YGRDMNVCRCVNGRKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 GGKVHYLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECFFGSLVGSNVYITPSGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
             PHYDDIE FILQLEGKK W++Y P      +P     N   E+ IG P    TL+PGD
Sbjct: 176 LPPHYDDIEVFILQLEGKKHWRLYQPT-----VPLAREYNVEPEDRIGAPTHEFTLKPGD 230

Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
            LY PRG IHQA T +   HS HVTIS YQ  +W D L   +P  + A      +   G+
Sbjct: 231 FLYFPRGTIHQADTPSGLAHSTHVTISTYQNNSWGDFLLDVIPGFVFAVAKEKEKLGTGI 290

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKEN-LKDLLAKLVEYVDLDDG--VDEMGKQLMHDA 291
           P       G+ R   V  QA   A+    L   L  L E ++        EM K  +   
Sbjct: 291 P------GGMPRAALVSQQAHDTALATTRLSGFLRTLAEQLEGSGAPRPSEMKKDFVRSR 344

Query: 292 LPPVLS 297
           LPP  S
Sbjct: 345 LPPFYS 350


>gi|167527045|ref|XP_001747855.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773604|gb|EDQ87242.1| predicted protein [Monosiga brevicollis MX1]
          Length = 595

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 121/382 (31%), Positives = 183/382 (47%), Gaps = 62/382 (16%)

Query: 2   HWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           H+E+  LYI R+ ++    Y  L +  ++  +L+   + +  ++++  Y DG+R+ +   
Sbjct: 68  HFERSFLYIPREDRDPGIVYQGLFSLDQLYTLLQRESMFYGTDLNLCRY-DGERKLVLNG 126

Query: 59  GRA----LPQIVWDHYRE---------------------------------GCSIRFLNP 81
           GR     LP I  +H                                      +++F  P
Sbjct: 127 GRNDTTDLPTINGNHSNSQRAEEQDSNDSDDSDELAEEALAADVRRRVEDLKATVQFHQP 186

Query: 82  QTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYL 141
           Q +++ L  L  S ++     VGAN Y TP  +QG APH+DD+E +ILQLEG+K W++Y 
Sbjct: 187 QRFVRALHDLLYSFEQELTTLVGANVYITPANSQGLAPHHDDVEVYILQLEGEKAWRLYE 246

Query: 142 PRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           P    E L    S +  +EE+  PI  + L PGD LYLPRG IH+AS V N+HS H+TIS
Sbjct: 247 PI---EPLAMSYSADLDREELAQPIAELVLRPGDFLYLPRGTIHEASCVGNQHSTHITIS 303

Query: 202 VYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAG-LARGKPVDIQADRLAMK 260
            +Q   +  L+ + +P+ +  A +  LE RRGLP G+LR  G L  G P    A    M+
Sbjct: 304 SHQNWNYGHLMAQTLPECITNAMSNVLELRRGLPHGFLRQHGVLHAGFPPPPSAVFHHMR 363

Query: 261 ENLK-DLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQT--GE 317
           E L+ D +   +E       VD          LPP         ++  N L  + T    
Sbjct: 364 ELLQSDQVWSEIER-HFHQAVDSFALDYTTARLPP--------PALRRNPLNPAPTRVTP 414

Query: 318 VYNATQITKDTKVRLVRANAVR 339
           V++AT     TKVRLV  + +R
Sbjct: 415 VFHAT-----TKVRLVYTDHIR 431


>gi|398019903|ref|XP_003863115.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322501347|emb|CBZ36425.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 507

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 143/281 (50%), Gaps = 25/281 (8%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           + G S+RFL P  +I+        +++ F C+ G N+Y TP  +QGFAPHYDD++ F+LQ
Sbjct: 145 KGGWSVRFLRPNEFIESNSAFIGCMEKEFNCYCGVNSYWTPANSQGFAPHYDDVDVFLLQ 204

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           LEG+K W +Y P    + L R+SS +++ E   TP  T+TL+ GD+LY+PRG +HQ  T 
Sbjct: 205 LEGEKLWCLYDPPEDVDVLARHSSEDYAPELFPTPKHTITLKAGDVLYMPRGTVHQGKTT 264

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
              HSLH+T S  Q  +W D + +A    ++   A  LE+RR LP    +  G     PV
Sbjct: 265 LKTHSLHITFSANQMNSWADFMSRAAQYTVEKLAANKLEWRRALPRSMPQVVGEV-NNPV 323

Query: 251 ------------DIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
                         Q  R  ++  +++++A+L   +  +  +D        D +  +  P
Sbjct: 324 FRDTHGLAALSEKQQEHRANLQTKVREMVAELTLLLTDEGNMDMCADVYAKDTIRKLQPP 383

Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                         + +G    +T+++  ++VRL+  N +R
Sbjct: 384 SR------------TYSGAAPASTRLSHASRVRLISRNCMR 412


>gi|156378500|ref|XP_001631180.1| predicted protein [Nematostella vectensis]
 gi|156218216|gb|EDO39117.1| predicted protein [Nematostella vectensis]
          Length = 109

 Score =  159 bits (402), Expect = 2e-36,   Method: Composition-based stats.
 Identities = 66/108 (61%), Positives = 86/108 (79%)

Query: 36  IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL 95
           ++F +N+D+T Y+DG+R+T + +GRA   +VWD Y  GCS+R LNPQ++  P+ +L + L
Sbjct: 2   VQFSRNLDVTSYKDGKRETHNPQGRAHAPVVWDFYNNGCSVRLLNPQSFSHPVWKLTSLL 61

Query: 96  QELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR 143
           QE FGCFVGANTY TPP  QGFAPHYDDIEAFI+QLEGKK W++Y PR
Sbjct: 62  QEYFGCFVGANTYLTPPGTQGFAPHYDDIEAFIIQLEGKKHWRLYNPR 109


>gi|387017158|gb|AFJ50697.1| MYC-induced nuclear antigen-like [Crotalus adamanteus]
          Length = 468

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 14/242 (5%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE++PL + R   +   YY +L     + D+++  +  + K+I+I    +G++   + 
Sbjct: 56  EYWEQKPLLLQRDDSSVASYYQSLFQLTDLKDLVKLGLY-YGKDINICRCMNGKKLIFNK 114

Query: 58  EGRALPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+     +  H+ ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TPP +QG
Sbjct: 115 DGKVTYAQLKKHFDQKKATIQFHQPQRFKDLLWRIQEKLECYFGSLVGSNVYITPPGSQG 174

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG+K W++Y P   +  EY    + P   +E IGTP     L+PG
Sbjct: 175 LPPHYDDVEVFILQLEGEKHWRLYRPTVPLAQEY---NAEP---EERIGTPTHDFILKPG 228

Query: 175 DLLYLPRGYIHQAST-VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           D+LY PRG IHQA T +   HS H+TIS YQ  +W D L   +P  L      D EFR+ 
Sbjct: 229 DMLYFPRGTIHQADTPLGISHSTHITISTYQNNSWRDYLLDMVPGLLLDIAKEDAEFRKN 288

Query: 234 LP 235
           +P
Sbjct: 289 IP 290


>gi|146094469|ref|XP_001467285.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134071650|emb|CAM70340.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 507

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 143/281 (50%), Gaps = 25/281 (8%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           + G S+RFL P  +I+        +++ F C+ G N+Y TP  +QGFAPHYDD++ F+LQ
Sbjct: 145 KGGWSVRFLRPNEFIESNSAFIGCMEKEFNCYCGVNSYWTPANSQGFAPHYDDVDVFLLQ 204

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           LEG+K W +Y P    + L R+SS +++ E   TP  T+TL+ GD+LY+PRG +HQ  T 
Sbjct: 205 LEGEKLWCLYDPPEDVDVLARHSSEDYAPELFPTPKHTITLKAGDVLYMPRGTVHQGKTT 264

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
              HSLH+T S  Q  +W D + +A    ++   A  +E+RR LP    +  G     PV
Sbjct: 265 LKTHSLHITFSANQMNSWADFMSRAAQYTVEKLAANKMEWRRALPRSMPQVVGEV-NNPV 323

Query: 251 ------------DIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
                         Q  R  ++  +++++A+L   +  +  +D        D +  +  P
Sbjct: 324 FRDTHGLAALSEKQQEHRANLQTKVREMVAELTLLLTDEGNMDMCADVYAKDTIRKLQPP 383

Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                         + +G    +T+++  ++VRL+  N +R
Sbjct: 384 SR------------TYSGAAPASTRLSHASRVRLISRNCMR 412


>gi|442761549|gb|JAA72933.1| Putative nucleolar protein, partial [Ixodes ricinus]
          Length = 439

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 100/286 (34%), Positives = 144/286 (50%), Gaps = 26/286 (9%)

Query: 17  YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYRE--GC 74
           ++  L +      I ++  + F K++    YEDG+R T   EG +      + Y E    
Sbjct: 42  FWPQLFSKDAFFSIAKETKLYFGKDVSACKYEDGKR-TDYAEGYSAKSAKLNKYFEERKA 100

Query: 75  SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGK 134
           +++   PQ +   L ++   ++  FGC VG N Y TP  +QG APH+DD+E FI+QLEG+
Sbjct: 101 TLQVHQPQRWKDSLWEVLELMECFFGCLVGCNAYITPAGSQGLAPHHDDVEVFIVQLEGE 160

Query: 135 KKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST--VTN 192
           K WK++ P  V E L R  S +F+ EEIG P    TL PGD LY+PRG IH A      +
Sbjct: 161 KCWKLHRP--VTE-LARIYSKDFTSEEIGEPTHEFTLRPGDFLYMPRGTIHHAYVPESAD 217

Query: 193 EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDI 252
            HS H+TIS YQK    D L   +P  + +A  + +E R+GLP  +L    L        
Sbjct: 218 SHSTHITISTYQKQTVGDCLMDIVPDLISSAMDSCIELRKGLPNKFLPSCALP------- 270

Query: 253 QADRLAMKENLKDLLAKLVEYV----DLDDGVDEMGKQLMHDALPP 294
                  KE +   L  ++E+V    D     +EM +  M   LPP
Sbjct: 271 -------KETVVTALTSVLEHVKQMSDEMPSPEEMVRDFMFSRLPP 309


>gi|449485668|ref|XP_004176314.1| PREDICTED: LOW QUALITY PROTEIN: MYC-induced nuclear antigen
           [Taeniopygia guttata]
          Length = 471

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 99/304 (32%), Positives = 155/304 (50%), Gaps = 26/304 (8%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           D+WE++PL + R       YY +L     + ++    +  + +++++    +G+++ ++ 
Sbjct: 57  DYWEQKPLLVQRNDVLVAAYYQSLFQLSHLKELCSQGLY-YGRDVNVCRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           EGR    Q+  D  ++  +I+F  PQ + + L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 EGRVNYMQLKKDFDQKKATIQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
             PHYDD+E FILQLEG+K W++Y P +     P     N   EE IG+P     L+PGD
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTV-----PLAREYNVEPEERIGSPTHEFILKPGD 230

Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           LLY PRG IHQA T     +S HVTIS YQ  +W D L  A+P  +  A   D+  R  +
Sbjct: 231 LLYFPRGTIHQADTPHGISYSTHVTISTYQNNSWGDFLLDAIPGLVFDAAKEDVALRTSI 290

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGV--DEMGKQLMHDAL 292
           P   L            +Q +     + L  LL +L + ++    +   +M K  +   L
Sbjct: 291 PRQLL------------MQVNTADCTKKLSSLLRRLADSLENSRELRSSDMKKDFIMHRL 338

Query: 293 PPVL 296
           PP L
Sbjct: 339 PPCL 342


>gi|326913124|ref|XP_003202891.1| PREDICTED: MYC-induced nuclear antigen-like [Meleagris gallopavo]
          Length = 473

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 101/306 (33%), Positives = 158/306 (51%), Gaps = 30/306 (9%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE++PL + R       YY +L     + ++    +  + ++I+I    +G+++ ++ 
Sbjct: 57  EYWEEKPLLVQRNDPLLATYYQSLFQLSDLKELCSQGLY-YGRDINICRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           EGR    Q+  D  ++  +I+F  PQ + + L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 EGRVNYAQLKKDFDQKKATIQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +   +L R  +   S++ IG P     L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTV---HLAREYNVE-SEDRIGNPTHEFVLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     +S HVTIS YQ  +W D L  A+P  +      D+  R  +P
Sbjct: 232 LYFPRGTIHQADTPPGVSYSTHVTISTYQNNSWGDFLLDAIPGLVFNTAKEDVALRTSIP 291

Query: 236 IGYLRYAGLARGKP-----VDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
              L    +A         + + ADRL   EN ++L +             +M K  + +
Sbjct: 292 RKLLMQVDIADSTKKLSSILRMLADRL---ENSRELRSS------------DMRKDFIMN 336

Query: 291 ALPPVL 296
            LPP L
Sbjct: 337 RLPPCL 342


>gi|26327519|dbj|BAC27503.1| unnamed protein product [Mus musculus]
          Length = 465

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 151/303 (49%), Gaps = 23/303 (7%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L +   +  + +  V  + +++++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSINGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GRA   Q+  D  ++  +I+F  PQ Y   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG K W++Y P   +  EY         S++ IGTP     L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTPTHDFLLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T +   +S+H+TIS YQ  +W D L  ++   +      D+  R G
Sbjct: 230 DLLYFPRGTIHQAETPSGLAYSIHLTISTYQNNSWGDCLLDSISGFVFDIAKEDVALRSG 289

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
           +P    R   L    P D+          L D L    E +  D     M K  +   LP
Sbjct: 290 MP----RRMLLNVETPADVTRKLSGFLRTLADQLEGREELLSSD-----MKKDFVKHRLP 340

Query: 294 PVL 296
           P  
Sbjct: 341 PFF 343


>gi|281353888|gb|EFB29472.1| hypothetical protein PANDA_008768 [Ailuropoda melanoleuca]
          Length = 464

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 102/304 (33%), Positives = 152/304 (50%), Gaps = 30/304 (9%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WEK+PL I R       YY +L     +  +    +  + ++I++    +G+++ ++ 
Sbjct: 58  EFWEKKPLLIQRNDPAVATYYQSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 116

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 117 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 176

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E F+LQLEG+K W++Y P +    L R  S   S++ IG P    TL+PGD 
Sbjct: 177 LPPHYDDVEVFVLQLEGEKHWRLYHPTVP---LAREYSVE-SEDRIGRPAHEFTLKPGDF 232

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      DLEFR G+P
Sbjct: 233 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDLEFRAGIP 292

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE-----MGKQLMHD 290
              L            +Q D  A+   L   L  L + +   +G  E     M K  + +
Sbjct: 293 RQLL------------LQVDATAVATRLSAFLRTLADRL---EGTKELLSADMKKDFVMN 337

Query: 291 ALPP 294
            LPP
Sbjct: 338 RLPP 341


>gi|395528782|ref|XP_003766504.1| PREDICTED: MYC-induced nuclear antigen [Sarcophilus harrisii]
          Length = 453

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 103/300 (34%), Positives = 151/300 (50%), Gaps = 22/300 (7%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE+ PL I R   +   Y+ +L     +  + R  +  + ++I++    +G+++ ++ 
Sbjct: 44  EYWEQRPLLIQRDDPSLAAYFQSLFQLTDLKHLCRQGMY-YGRDINVCRCVNGRKKVLNK 102

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
             +    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 103 SSKVHYLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECFFGSLVGSNVYITPSESQG 162

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
             PHYDDIE FILQLEGKK W++Y P      +P     N   E+ IG P    TL+PGD
Sbjct: 163 LPPHYDDIEVFILQLEGKKHWRLYQPT-----VPLAREYNVEPEDKIGAPTHEFTLKPGD 217

Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           LLY PRG IHQA T     HS HVTIS YQ  +W D L   +P  L      +++FR G+
Sbjct: 218 LLYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDVIPGLLFDTAKKEIKFRTGI 277

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
           P   L     A      +  +RL+       LLA  +E    D    EM K  +   LPP
Sbjct: 278 PRQLLMVDDTA------VSTERLS---GFLRLLADQLEGTR-DLRSSEMKKDFIRSRLPP 327


>gi|256355182|ref|NP_080186.3| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA [Mus musculus]
 gi|160013499|sp|Q8CD15.2|MINA_MOUSE RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase MINA; AltName: Full=Histone lysine
           demethylase MINA; AltName: Full=MYC-induced nuclear
           antigen
 gi|12841599|dbj|BAB25275.1| unnamed protein product [Mus musculus]
 gi|12850814|dbj|BAB28860.1| unnamed protein product [Mus musculus]
 gi|34849684|gb|AAH58242.1| Mina protein [Mus musculus]
 gi|54144407|dbj|BAD60965.1| Mina53 [Mus musculus]
          Length = 465

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 150/303 (49%), Gaps = 23/303 (7%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L +   +  + +  V  + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GRA   Q+  D  ++  +I+F  PQ Y   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG K W++Y P   +  EY         S++ IGTP     L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTPTHDFLLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T +   +S+H+TIS YQ  +W D L  ++   +      D+  R G
Sbjct: 230 DLLYFPRGTIHQAETPSGLAYSIHLTISTYQNNSWGDCLLDSISGFVFDIAKEDVALRSG 289

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
           +P    R   L    P D+          L D L    E +  D     M K  +   LP
Sbjct: 290 MP----RRMLLNVETPADVTRKLSGFLRTLADQLEGREELLSSD-----MKKDFVKHRLP 340

Query: 294 PVL 296
           P  
Sbjct: 341 PFF 343


>gi|19263523|gb|AAH25109.1| Mina protein [Mus musculus]
 gi|19483936|gb|AAH23462.1| Myc induced nuclear antigen [Mus musculus]
          Length = 465

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 150/303 (49%), Gaps = 23/303 (7%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L +   +  + +  V  + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GRA   Q+  D  ++  +I+F  PQ Y   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG K W++Y P   +  EY         S++ IGTP     L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTPTHDFLLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T +   +S+H+TIS YQ  +W D L  ++   +      D+  R G
Sbjct: 230 DLLYFPRGTIHQAETPSGLAYSIHLTISTYQNNSWGDCLLDSISGFVFDIAKEDVALRSG 289

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
           +P    R   L    P D+          L D L    E +  D     M K  +   LP
Sbjct: 290 MP----RRMLLNVETPADVTRKLSGFLRTLADQLEGREELLSSD-----MKKDFVKHRLP 340

Query: 294 PVL 296
           P  
Sbjct: 341 PFF 343


>gi|242000530|ref|XP_002434908.1| MYC-induced nuclear antigen, putative [Ixodes scapularis]
 gi|215498238|gb|EEC07732.1| MYC-induced nuclear antigen, putative [Ixodes scapularis]
          Length = 472

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 102/303 (33%), Positives = 151/303 (49%), Gaps = 30/303 (9%)

Query: 2   HWEKEPLYISRK--QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           +WE+EP     +   + ++  L +      I ++  + F K++    YEDG+R     EG
Sbjct: 32  YWEREPFVAHDRAGMRAFWPQLFSKDAFFSIAKETKLYFGKDVSACKYEDGKRSDY-AEG 90

Query: 60  RALPQIVWDHYRE--GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
            +      + Y E    +++   PQ +   L ++   ++  FGC VG N Y TP  +QG 
Sbjct: 91  YSAKSAKLNKYFEERKATLQVHQPQRWKDSLWEVLELMERFFGCLVGCNAYITPAGSQGL 150

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
           APH+DD+  FI+QLEG+K WK++ P  V E L R  S +F+ EEIG P    TL PGD L
Sbjct: 151 APHHDDV--FIVQLEGEKCWKLHKP--VTE-LARIYSKDFTSEEIGEPTHEFTLRPGDFL 205

Query: 178 YLPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           Y+PRG IH A      + HS H+TIS YQK    D L    P  + +A  + +E R+GLP
Sbjct: 206 YMPRGTIHHAYVPESADSHSTHITISTYQKQTVGDCLMDIAPDLISSAMDSCIELRKGLP 265

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV----DLDDGVDEMGKQLMHDA 291
             +L    L+              KE +   L+ ++E+V    D     +EM +  M   
Sbjct: 266 NRFLPSCVLS--------------KETVVTALSSVLEHVKQMPDEMPSPEEMVRDFMFSR 311

Query: 292 LPP 294
           LPP
Sbjct: 312 LPP 314


>gi|308805368|ref|XP_003079996.1| unnamed protein product [Ostreococcus tauri]
 gi|116058453|emb|CAL53642.1| unnamed protein product [Ostreococcus tauri]
          Length = 455

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/190 (46%), Positives = 113/190 (59%), Gaps = 16/190 (8%)

Query: 41  NIDITLYEDGQRQTM-------DVEGRALPQIV-----WDHYRE-GCSIRFLNPQTYIKP 87
           + D+T Y DG R+T        DV   A  +IV        YRE G SIR L+PQT    
Sbjct: 63  DADVTSYVDGVRRTHNSNDDTHDVCDEASNEIVDAKAVMRAYRERGRSIRLLHPQTRHDA 122

Query: 88  LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
             ++ A+L+  F C  G N Y TP  AQGFAPHYDDI+A++LQ+EG+K+W+VY P   DE
Sbjct: 123 TWKMLATLESHFECACGCNVYVTPANAQGFAPHYDDIDAYVLQIEGEKRWRVYAPFQSDE 182

Query: 148 YLPRYSSPNFSQEEI-GTPIL-TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
            LPR SS N++QEEI G  +L    LE GD LY+PRG++HQA   +  HS+H TIS  Q 
Sbjct: 183 -LPRTSSKNYTQEEIAGLEVLFDGVLEAGDFLYIPRGFVHQAECSSRAHSVHATISTNQA 241

Query: 206 TAWIDLLEKA 215
               D  E A
Sbjct: 242 NTHADAFEIA 251


>gi|301607059|ref|XP_002933117.1| PREDICTED: myc-induced nuclear antigen-like [Xenopus (Silurana)
           tropicalis]
          Length = 460

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 99/306 (32%), Positives = 159/306 (51%), Gaps = 22/306 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           D+WE +PL +  +   +     T  ++ D+  +    I +++++++   +DG++ ++   
Sbjct: 54  DYWETKPLLLQGRDPAFAAYYQTLFRLSDLKHIAGGSIYYERDVNVFKCKDGKKISLPRH 113

Query: 59  GRALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           G+A    +   +  G  +I+F  PQ +   L  +   L+  FG  VG+N Y TP  +QG 
Sbjct: 114 GKATYLHLLKDFGSGKATIQFHQPQRFNDALWHIIEKLECFFGALVGSNIYITPQDSQGL 173

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
             HYDD+E FILQLEG+K+W++Y P  V      YS     +++IG+P     L+PGDLL
Sbjct: 174 PAHYDDVEVFILQLEGEKRWRLYSP--VVPLARDYSV--VPEDQIGSPTHDFVLKPGDLL 229

Query: 178 YLPRGYIHQASTVT-NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           Y PRG IHQA  +  + HS HVTIS YQ  +W D L+  +P  L  A   +++ RRG+P 
Sbjct: 230 YFPRGVIHQAQALPGSSHSTHVTISTYQNNSWSDYLQDLLPGILFDAAKENIDLRRGIP- 288

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD--EMGKQLMHDALPP 294
              R   L+   P         + + L  LL  +V+ ++    +   EM +  M   LPP
Sbjct: 289 ---RQQMLSMDTP--------GVAQQLSSLLTTVVKGLESHRHIRSFEMLRDFMGSRLPP 337

Query: 295 VLSPEE 300
            L  +E
Sbjct: 338 FLDNKE 343


>gi|301769327|ref|XP_002920080.1| PREDICTED: MYC-induced nuclear antigen-like [Ailuropoda
           melanoleuca]
          Length = 463

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/300 (34%), Positives = 150/300 (50%), Gaps = 23/300 (7%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WEK+PL I R       YY +L     +  +    +  + ++I++    +G+++ ++ 
Sbjct: 58  EFWEKKPLLIQRNDPAVATYYQSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 116

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 117 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 176

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E F+LQLEG+K W++Y P +    L R  S   S++ IG P    TL+PGD 
Sbjct: 177 LPPHYDDVEVFVLQLEGEKHWRLYHPTVP---LAREYSVE-SEDRIGRPAHEFTLKPGDF 232

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      DLEFR G+P
Sbjct: 233 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDLEFRAGIP 292

Query: 236 IGYLRYAGLARGKPVDIQADRL-AMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
              L     A        A RL A    L D L    E +  D     M K  + + LPP
Sbjct: 293 RQLLLVDATA-------VATRLSAFLRTLADRLEGTKELLSAD-----MKKDFVMNRLPP 340


>gi|410970260|ref|XP_003991604.1| PREDICTED: MYC-induced nuclear antigen [Felis catus]
          Length = 464

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 106/309 (34%), Positives = 153/309 (49%), Gaps = 40/309 (12%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WEK+PL I R       YY +L     +  +    +  + ++I++    +G+++ ++ 
Sbjct: 58  EFWEKKPLLIQRDDPAVATYYRSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 116

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 117 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 176

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W +Y P +    L R  S   +++ IG P    TL+PGD 
Sbjct: 177 LPPHYDDVEVFILQLEGEKHWCLYHPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDF 232

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   M   +      DLEFR G+P
Sbjct: 233 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTMSGLVFDTAKEDLEFRAGIP 292

Query: 236 IGYL----------RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGK 285
              L          R +G  R       ADRL   E  K+LL+             +M K
Sbjct: 293 RQLLLQVEATAVATRLSGFLR-----TLADRL---EGTKELLSA------------DMKK 332

Query: 286 QLMHDALPP 294
             + + LPP
Sbjct: 333 DFVMNRLPP 341


>gi|23346418|ref|NP_116167.3| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA isoform b [Homo sapiens]
 gi|387849375|ref|NP_001248758.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA isoform b [Homo sapiens]
 gi|23200591|dbj|BAC16358.1| Mina53 form-2 [Homo sapiens]
 gi|119600278|gb|EAW79872.1| MYC induced nuclear antigen, isoform CRA_a [Homo sapiens]
 gi|119600279|gb|EAW79873.1| MYC induced nuclear antigen, isoform CRA_a [Homo sapiens]
          Length = 464

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 105/311 (33%), Positives = 157/311 (50%), Gaps = 44/311 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289

Query: 234 LP----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
           +P          +   R +G  R       ADRL   E  K+LL+             +M
Sbjct: 290 IPRQLLLVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SDM 329

Query: 284 GKQLMHDALPP 294
            K  +   LPP
Sbjct: 330 KKDFIMHRLPP 340


>gi|426341335|ref|XP_004035995.1| PREDICTED: MYC-induced nuclear antigen isoform 1 [Gorilla gorilla
           gorilla]
 gi|426341337|ref|XP_004035996.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Gorilla gorilla
           gorilla]
          Length = 465

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLIFDTAKEDVELRTG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|296863377|pdb|2XDV|A Chain A, Crystal Structure Of The Catalytic Domain Of Flj14393
          Length = 442

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 34  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 90

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 91  NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 150

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 151 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 206

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 207 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 266

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 267 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLSS------------D 306

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 307 MKKDFIMHRLPP 318


>gi|50950028|emb|CAH10679.1| hypothetical protein [Homo sapiens]
          Length = 384

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 156/310 (50%), Gaps = 45/310 (14%)

Query: 3   WEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDV 57
           WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ ++ 
Sbjct: 13  WEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVLNK 69

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 70  DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 129

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PGDL
Sbjct: 130 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPGDL 185

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G+P
Sbjct: 186 LYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTGIP 245

Query: 236 -----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
                      +   R +G  R       ADRL   E  K+LL+             +M 
Sbjct: 246 RQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLSS------------DMK 285

Query: 285 KQLMHDALPP 294
           K  +   LPP
Sbjct: 286 KDFIMHRLPP 295


>gi|449275792|gb|EMC84560.1| MYC-induced nuclear antigen [Columba livia]
          Length = 469

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 154/303 (50%), Gaps = 24/303 (7%)

Query: 1   DHWEKEPLYISRKQK---NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE++PL I R       YY +L     + ++    +  + ++I+I    +G+++ ++ 
Sbjct: 57  EYWEQKPLLIQRNDNLMAAYYQSLFQFSNLKELCGQGLY-YGRDINICRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           EG+    Q+  D  ++  +I+F  PQ + + L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 EGKVNYMQLKKDFDQKKATIQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLP--RMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG+K W++Y P   +  EY         S++ IG P     L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTVHLAQEYNVE------SEDRIGNPTHEFILKPG 229

Query: 175 DLLYLPRGYIHQAST-VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T +   +S HVTIS YQ  +W D L  A+P  +      D+  R  
Sbjct: 230 DLLYFPRGTIHQADTPLGISYSTHVTISTYQNNSWGDFLLDAIPGLVFDTAKEDVALRTS 289

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
           +P   L          VDI      +   L+ L  +L    +L     +M K  + + LP
Sbjct: 290 IPRQLLMQ--------VDIADSTKKLSSLLRQLADRLENTRELRSS--DMKKDFIMNRLP 339

Query: 294 PVL 296
           P L
Sbjct: 340 PYL 342


>gi|114588106|ref|XP_001139845.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Pan troglodytes]
 gi|114588108|ref|XP_001139930.1| PREDICTED: MYC-induced nuclear antigen isoform 3 [Pan troglodytes]
 gi|410226778|gb|JAA10608.1| MYC induced nuclear antigen [Pan troglodytes]
 gi|410264218|gb|JAA20075.1| MYC induced nuclear antigen [Pan troglodytes]
 gi|410306734|gb|JAA31967.1| MYC induced nuclear antigen [Pan troglodytes]
 gi|410331313|gb|JAA34603.1| MYC induced nuclear antigen [Pan troglodytes]
 gi|410331315|gb|JAA34604.1| MYC induced nuclear antigen [Pan troglodytes]
          Length = 465

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|15928921|gb|AAH14928.1| MYC induced nuclear antigen [Homo sapiens]
 gi|38708276|gb|AAR27293.1| NO52 protein [Homo sapiens]
          Length = 465

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|110227619|ref|NP_001035998.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA isoform a [Homo sapiens]
 gi|110227621|ref|NP_694822.2| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA isoform a [Homo sapiens]
 gi|74750624|sp|Q8IUF8.1|MINA_HUMAN RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase MINA; AltName: Full=60S ribosomal
           protein L27a histidine hydroxylase; AltName:
           Full=Histone lysine demethylase MINA; AltName:
           Full=MYC-induced nuclear antigen; AltName: Full=Mineral
           dust-induced gene protein; AltName: Full=Nucleolar
           protein 52; AltName: Full=Ribosomal oxygenase MINA;
           Short=ROX
 gi|23307412|dbj|BAC16537.1| Mina53 [Homo sapiens]
 gi|31872083|gb|AAP59421.1| MDIG [Homo sapiens]
 gi|91242970|gb|ABE28016.1| MYC induced nuclear antigen [Homo sapiens]
 gi|119600280|gb|EAW79874.1| MYC induced nuclear antigen, isoform CRA_b [Homo sapiens]
          Length = 465

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|417401381|gb|JAA47579.1| Hypothetical protein [Desmodus rotundus]
          Length = 464

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/237 (37%), Positives = 133/237 (56%), Gaps = 8/237 (3%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           WE++PL I R          +  ++ D+  L    + + ++I++    +G+++ ++ +G+
Sbjct: 59  WEQKPLLIQRDDPALAARYQSLFRLSDLKSLCSRGVYYGRDINVCRCVNGKKKVLNKDGK 118

Query: 61  A-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
               Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG  P
Sbjct: 119 VHFLQLRKDFDQKRATIQFHQPQRFKDDLWRIQEKLECYFGSLVGSNVYLTPAGSQGLPP 178

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           HYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDLLY 
Sbjct: 179 HYDDVEVFILQLEGEKHWRLYHPTVP---LARDYSVE-AEDRIGRPAHEFTLKPGDLLYF 234

Query: 180 PRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           PRG IHQA T     HS HVTIS YQ  +W DLL  A+   +      DLEFR G+P
Sbjct: 235 PRGTIHQADTPPGLAHSTHVTISTYQNNSWGDLLLDAISGLVFDTAKEDLEFRAGIP 291


>gi|402858812|ref|XP_003893879.1| PREDICTED: MYC-induced nuclear antigen isoform 1 [Papio anubis]
 gi|402858814|ref|XP_003893880.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Papio anubis]
          Length = 465

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIRRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRAG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|387762704|ref|NP_001248638.1| MYC induced nuclear antigen [Macaca mulatta]
 gi|355559267|gb|EHH15995.1| hypothetical protein EGK_11218 [Macaca mulatta]
 gi|355746351|gb|EHH50965.1| hypothetical protein EGM_10276 [Macaca fascicularis]
 gi|380817320|gb|AFE80534.1| MYC-induced nuclear antigen isoform a [Macaca mulatta]
 gi|384949894|gb|AFI38552.1| MYC-induced nuclear antigen isoform a [Macaca mulatta]
          Length = 465

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIRRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDLDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRAG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K+LL+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|397502581|ref|XP_003821931.1| PREDICTED: LOW QUALITY PROTEIN: MYC-induced nuclear antigen [Pan
           paniscus]
          Length = 465

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 153/310 (49%), Gaps = 41/310 (13%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +       + +++++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRGDPALATYYGSLFKLTDLKSLCSRGTY-YGRDVNVCRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G+P
Sbjct: 232 LYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTGIP 291

Query: 236 -----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
                      +   R +G  R       ADRL   E  K+LL+             +M 
Sbjct: 292 RQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SDMK 331

Query: 285 KQLMHDALPP 294
           K  +   LPP
Sbjct: 332 KDFIMHRLPP 341


>gi|348555088|ref|XP_003463356.1| PREDICTED: MYC-induced nuclear antigen-like [Cavia porcellus]
          Length = 464

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/240 (36%), Positives = 131/240 (54%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WEK+PL I R       YY +L     +  +   ++  + +++++    +G+++ ++ 
Sbjct: 57  EFWEKKPLLIQRDDPALAAYYQSLFRLTDLKSLCGQSLY-YGRDVNVCRCINGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKVHFLQLRKDFDQKRTTIQFHQPQRFKDELWKIQEKLECYFGSLVGSNVYITPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S    +  +G P    TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYTPTVA---LAREYSVE-PEARLGAPTHEFTLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   M   +      D+E RRG+P
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTMSGLVFDTAKDDVELRRGIP 291


>gi|23463307|ref|NP_695221.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA [Rattus norvegicus]
 gi|81877998|sp|Q8CFC1.1|MINA_RAT RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase MINA; AltName: Full=Histone lysine
           demethylase MINA; AltName: Full=MYC-induced nuclear
           antigen
 gi|23200600|dbj|BAC16362.1| Mina53 [Rattus norvegicus]
 gi|56585203|gb|AAH87650.1| Myc induced nuclear antigen [Rattus norvegicus]
 gi|149060276|gb|EDM10990.1| myc induced nuclear antigen, isoform CRA_a [Rattus norvegicus]
          Length = 465

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 97/299 (32%), Positives = 152/299 (50%), Gaps = 19/299 (6%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R   +   YY +L +   +  +    +  + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPSLAAYYQSLFSLSDLRSLCSQGLY-YGRDVNVCRCIGGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKAQFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K+W++Y P +    L R  S    ++ IGTP     L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGRKRWRLYSPTVP---LAREYSVE-PEDRIGTPTHDFLLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T +   HS+H+TIS YQ  +W D L  ++   +      D+  R G+P
Sbjct: 232 LYFPRGTIHQAETPSGLAHSIHLTISTYQNNSWGDYLLDSISGLVFDIAKEDVALRTGMP 291

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
               R   +    P D+          L D L    E +  D     M K  +   LPP
Sbjct: 292 ----RRMLMNVETPADVTRKLSGFLRTLADQLEGRKELLSSD-----MKKDFVMHRLPP 341


>gi|431892471|gb|ELK02909.1| MYC-induced nuclear antigen [Pteropus alecto]
          Length = 509

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 104/302 (34%), Positives = 154/302 (50%), Gaps = 25/302 (8%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + ++I++    +G+++ +
Sbjct: 102 EFWEQKPLLIQRDDPAVATYYQSLF---KLSDLKSLCSRGMYYGRDINVCRCVNGKKKVL 158

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 159 NKDGKVHFLQLRKDFVQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 218

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PG
Sbjct: 219 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPAHEFTLKPG 274

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      DL FR G
Sbjct: 275 DLLYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDLAFRAG 334

Query: 234 LPIGYLRYAGLARGKPVDIQADRL-AMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
           +P        L + +   +   RL A    L D L    E +  D     M K  + + L
Sbjct: 335 IPRQL-----LLQVETTAVATRRLSAFLRTLADRLEGTSELLSAD-----MKKDFVMNRL 384

Query: 293 PP 294
           PP
Sbjct: 385 PP 386


>gi|62751815|ref|NP_001015675.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA [Bos taurus]
 gi|75057883|sp|Q5EA24.1|MINA_BOVIN RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase MINA; AltName: Full=Histone lysine
           demethylase MINA; AltName: Full=MYC-induced nuclear
           antigen
 gi|59857855|gb|AAX08762.1| MYC induced nuclear antigen isoform 2 [Bos taurus]
 gi|296491583|tpg|DAA33626.1| TPA: MYC-induced nuclear antigen [Bos taurus]
          Length = 462

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 101/304 (33%), Positives = 152/304 (50%), Gaps = 30/304 (9%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +     I + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSWG-IYYGRDVNVCRCVHGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GR    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  AQG
Sbjct: 116 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGAQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ ++W D L   +   +      D+  R G+P
Sbjct: 232 LYFPRGTIHQADTPEGLAHSTHVTISTYQSSSWGDFLLDTISGLVFDTAKADVALRAGIP 291

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE-----MGKQLMHD 290
              L            +QA+ +A+   L   L  L + +   +G  E     M K    +
Sbjct: 292 RQLL------------LQAESIAVATRLSGFLRMLADRL---EGTKELPSADMKKDFAMN 336

Query: 291 ALPP 294
            LPP
Sbjct: 337 RLPP 340


>gi|344277197|ref|XP_003410390.1| PREDICTED: MYC-induced nuclear antigen-like [Loxodonta africana]
          Length = 462

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 97/277 (35%), Positives = 145/277 (52%), Gaps = 24/277 (8%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L    ++  +    +  + ++I++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYQSLFRLTELKSLCSRGLY-YGRDINVCRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKVYFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ ++W D L   +   +      D+E R GLP
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHVTISTYQNSSWGDFLLDTISGLVFDTAKEDIELRAGLP 291

Query: 236 IGYL---------RYAGLARGKPVDIQADRLAMKENL 263
              L         R +G  R     + ADRL   E L
Sbjct: 292 RQLLLVGSTTVTTRLSGFLR-----MLADRLEGTEEL 323


>gi|426217331|ref|XP_004002907.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Ovis aries]
          Length = 460

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 87/240 (36%), Positives = 132/240 (55%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     ++ +     I + +++++  Y  G+++  + 
Sbjct: 56  EFWEQKPLLIQRDDPALATYYQSLFRLSDLESLCSWG-IYYGRDVNVCRYVHGKKKVFNK 114

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GR    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  AQG
Sbjct: 115 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRVQEKLECYFGSLVGSNVYITPAGAQG 174

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 175 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 230

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS H+TIS YQ+++W D L   +   +      D+  R G+P
Sbjct: 231 LYFPRGTIHQADTPEGLAHSTHLTISTYQRSSWGDFLLDTISGLVFDTAKADVALRAGIP 290


>gi|426217329|ref|XP_004002906.1| PREDICTED: MYC-induced nuclear antigen isoform 1 [Ovis aries]
          Length = 461

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/240 (36%), Positives = 132/240 (55%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     ++ +     I + +++++  Y  G+++  + 
Sbjct: 56  EFWEQKPLLIQRDDPALATYYQSLFRLSDLESLCSWG-IYYGRDVNVCRYVHGKKKVFNK 114

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GR    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  AQG
Sbjct: 115 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRVQEKLECYFGSLVGSNVYITPAGAQG 174

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 175 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 230

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS H+TIS YQ+++W D L   +   +      D+  R G+P
Sbjct: 231 LYFPRGTIHQADTPEGLAHSTHLTISTYQRSSWGDFLLDTISGLVFDTAKADVALRAGIP 290


>gi|403273551|ref|XP_003928573.1| PREDICTED: MYC-induced nuclear antigen [Saimiri boliviensis
           boliviensis]
          Length = 434

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/242 (37%), Positives = 134/242 (55%), Gaps = 14/242 (5%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++     G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALAAYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVSGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P    TL+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPTHEFTLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +P  +       +E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTIPGLVFDTAKEGVELRAG 289

Query: 234 LP 235
           +P
Sbjct: 290 IP 291


>gi|197098352|ref|NP_001127580.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
           MINA [Pongo abelii]
 gi|75054795|sp|Q5R673.1|MINA_PONAB RecName: Full=Bifunctional lysine-specific demethylase and
           histidyl-hydroxylase MINA; AltName: Full=Histone lysine
           demethylase MINA; AltName: Full=MYC-induced nuclear
           antigen
 gi|55732074|emb|CAH92743.1| hypothetical protein [Pongo abelii]
          Length = 465

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+   PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQLHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +  A   D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDAAKEDVELRAG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           +P           +   R +G  R       ADRL   E  K++L+             +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKEVLS------------SD 329

Query: 283 MGKQLMHDALPP 294
           M K  +   LPP
Sbjct: 330 MKKDFIMHRLPP 341


>gi|440894819|gb|ELR47165.1| MYC-induced nuclear antigen [Bos grunniens mutus]
          Length = 463

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/304 (33%), Positives = 151/304 (49%), Gaps = 30/304 (9%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +     I + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSWG-IYYGRDVNVCRCVHGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GR    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  AQG
Sbjct: 116 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGAQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ ++W D L   +   +      D+  R G+P
Sbjct: 232 LYFPRGTIHQADTPEGLAHSTHVTISTYQSSSWGDFLLDTISGLVFDTAKADVALRAGIP 291

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE-----MGKQLMHD 290
              L            +QA+  A+   L   L  L + +   +G  E     M K    +
Sbjct: 292 RQLL------------LQAESTAVATRLSGFLRMLADRL---EGTKELPSADMKKDFAMN 336

Query: 291 ALPP 294
            LPP
Sbjct: 337 RLPP 340


>gi|14041887|dbj|BAB55024.1| unnamed protein product [Homo sapiens]
          Length = 464

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/311 (33%), Positives = 156/311 (50%), Gaps = 44/311 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  F   VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFSSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289

Query: 234 LP----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
           +P          +   R +G  R       ADRL   E  K+LL+             +M
Sbjct: 290 IPRQLLLVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SDM 329

Query: 284 GKQLMHDALPP 294
            K  +   LPP
Sbjct: 330 KKDFIMHRLPP 340


>gi|345310558|ref|XP_001519133.2| PREDICTED: MYC-induced nuclear antigen [Ornithorhynchus anatinus]
          Length = 444

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/242 (35%), Positives = 136/242 (56%), Gaps = 14/242 (5%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE++PL I R   +   YY +L     +  +    +  + +++++    +G+++ ++ 
Sbjct: 36  EYWEQKPLLIQRDDPSIAAYYQSLFRLMDLKGLCSRGMF-YGRDVNVCRCVNGKKKVLNK 94

Query: 58  EGRA---LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
            G+    L +  +D  R   +I+F  PQ +   L ++   L+  FG  VG+N Y TPP +
Sbjct: 95  GGKVNYLLLRKDFDQKR--ATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPPGS 152

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +   +L R  +    +++IG+P    TL+PG
Sbjct: 153 QGLPPHYDDVEVFILQLEGEKHWRLYQPTV---HLAREYNVE-PEDKIGSPTHEFTLKPG 208

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +P  +      ++E R G
Sbjct: 209 DLLYFPRGTIHQADTPPGVAHSTHVTISTYQNNSWGDFLLDIIPGLVFETAKEEVELRMG 268

Query: 234 LP 235
           +P
Sbjct: 269 IP 270


>gi|119331084|ref|NP_001073191.1| MYC induced nuclear antigen [Gallus gallus]
 gi|53133678|emb|CAG32168.1| hypothetical protein RCJMB04_19e21 [Gallus gallus]
          Length = 473

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 155/301 (51%), Gaps = 20/301 (6%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WE++PL + R       YY +L     + ++    +  + ++I+I    +G+++ ++ 
Sbjct: 57  EYWEEKPLLVQRNDPLLAAYYQSLFQLSDLKELCSQGLY-YGRDINICRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           EG+    Q+  D  ++  +++F  PQ + + L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 EGKVNYAQLNKDFDQKKATMQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +   +L R  +    ++ IG P     L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTV---HLAREYNVE-PEDRIGNPTHEFVLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     +S HVTIS YQ  +W D L  A+P  + +    D+  R  +P
Sbjct: 232 LYFPRGTIHQADTPPGIPYSTHVTISTYQNNSWGDFLLDAIPGLVFSTAKEDVALRTSIP 291

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPV 295
              L          VDI      +   L+ L  +L    +L     +M K  + + LPP 
Sbjct: 292 RKLLMQ--------VDIADSTKKLSSILRMLADRLENTGELRSS--DMRKDFIMNRLPPC 341

Query: 296 L 296
           L
Sbjct: 342 L 342


>gi|348545500|ref|XP_003460218.1| PREDICTED: myc-induced nuclear antigen-like [Oreochromis niloticus]
          Length = 465

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 94/306 (30%), Positives = 158/306 (51%), Gaps = 32/306 (10%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WEK+PL++ R       YY +L     +  +     +E+ ++I++    +G+++ ++ 
Sbjct: 59  EYWEKKPLHLQRSDPGTALYYQSLFQLCDLQSLCSQG-LEYYRDINVVRCVNGKKKLLNK 117

Query: 58  EGRALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           EGR    ++     +   ++++  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 118 EGRVKHSVLNKIMSQNKATVQYHQPQRFKDELWRIQEKLECFFGALVGSNIYITPEESQG 177

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG+K W +Y P   +  EY  +      S++ IG P   + L+ G
Sbjct: 178 LPPHYDDVEVFILQLEGQKHWHLYNPTVPLAAEYSVQ------SEDSIGIPTHDIILKAG 231

Query: 175 DLLYLPRGYIHQASTVTNEH-SLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T    + S+H+T+S YQ+ +W DLL  A P  L      ++ FR G
Sbjct: 232 DLLYFPRGTIHQARTPAGVNCSIHLTLSTYQRMSWGDLLLDAFPSLLCDRSRREVSFREG 291

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG-----KQLM 288
           +P   L  +           ++ L   + L   L  L +  +++ G+ E+G     +  +
Sbjct: 292 MPRRLLLSS-----------SEGLHTNKRLAACLRSLAD--EMETGMQELGFTHMKRDFI 338

Query: 289 HDALPP 294
            + LPP
Sbjct: 339 MNRLPP 344


>gi|198425574|ref|XP_002131324.1| PREDICTED: similar to MYC induced nuclear antigen [Ciona
           intestinalis]
          Length = 490

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 150/296 (50%), Gaps = 24/296 (8%)

Query: 14  QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ--RQTMDVEGRALPQIVWDHYR 71
           + +Y+N L   + + +++    +++ K+I    ++D +  R   +V G    + V   + 
Sbjct: 76  RSSYFNKLFNHEILKEVVLSKKLKYDKDICACRFDDEKKCRVNAEVHGPVTAEKVHSLFH 135

Query: 72  EG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           +   +++F  PQ +   L ++   L+  FG  VG+N Y TP  +QG APH+D++E FILQ
Sbjct: 136 DDKMTLQFHQPQRFHDELWKIQEKLESFFGSQVGSNVYMTPDGSQGLAPHHDNVEVFILQ 195

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPIL-TVTLEPGDLLYLPRGYIHQAS 188
           LEG+K+WK+Y P +    LPR SS +F    + G  +L T+ ++PGD+LY PRG +HQA 
Sbjct: 196 LEGEKEWKLYSPVV---NLPRNSSSDFDDSTVKGLTLLDTIIMKPGDVLYFPRGTVHQAK 252

Query: 189 TVTNE-HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARG 247
           ++    HS H+TIS Y+   W D +   +P    AA    +  RRGLP  Y         
Sbjct: 253 SIKGTGHSTHLTISTYETQCWGDYILDFIPYLTDAAADKVVSLRRGLPRKYYN------- 305

Query: 248 KPVDIQADRLAMKENLKDLLAKLVEYVDLDDGV---DEMGKQLMHDALPPVLSPEE 300
                Q      K+NLK  L  L E +     V   D +      + LPP  S +E
Sbjct: 306 -----QTSTDGFKQNLKKALISLAESLTESTDVEMSDSIVCNFFANRLPPFGSVQE 356


>gi|332225165|ref|XP_003261750.1| PREDICTED: MYC-induced nuclear antigen [Nomascus leucogenys]
          Length = 465

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 88/242 (36%), Positives = 134/242 (55%), Gaps = 14/242 (5%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
           + WE++PL I R       YY +L    K+ D+  L    + + +++++    +G+++ +
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113

Query: 56  DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           + +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           QG  PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+    L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFLLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQ  T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQVDTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRAG 289

Query: 234 LP 235
           +P
Sbjct: 290 IP 291


>gi|291400828|ref|XP_002716795.1| PREDICTED: MYC induced nuclear antigen-like [Oryctolagus cuniculus]
          Length = 465

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 101/308 (32%), Positives = 154/308 (50%), Gaps = 41/308 (13%)

Query: 3   WEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           WE++PL I R       YY +L     +  +    +  + +++++    +G+++ ++ +G
Sbjct: 59  WEQKPLLIQRDDPALATYYQSLFRLTDLKSLCSRGMY-YGRDVNVCRCVNGKKKVLNKDG 117

Query: 60  RA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           +A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG  
Sbjct: 118 KAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQGLP 177

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDLLY
Sbjct: 178 PHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDLLY 233

Query: 179 LPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP-- 235
            PRG IHQA T     HS HVTIS YQ  +W D L   +   +  A   D+  R G+P  
Sbjct: 234 FPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDAAKEDVALRAGIPRQ 293

Query: 236 ---------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQ 286
                    +   R +G  R     + A+RL   E  K+LL+             +M K 
Sbjct: 294 LLLQAESTTVATRRLSGFLR-----MLAERL---EGTKELLSS------------DMKKD 333

Query: 287 LMHDALPP 294
            +   LPP
Sbjct: 334 FIMHRLPP 341


>gi|456753234|gb|JAA74127.1| MYC induced nuclear antigen tv2 [Sus scrofa]
          Length = 464

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 103/309 (33%), Positives = 150/309 (48%), Gaps = 40/309 (12%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WEK+PL I R       YY +L     +  +     I + ++I++     G+++  + 
Sbjct: 57  EFWEKKPLLIQRNDPVLATYYRSLFRLSDLKSLCSRG-IYYGRDINVCRCVSGKKKVFNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 GGKVHFLQLRKDFDQKKATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYLTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   ++  IG P     L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AENRIGKPAYEFMLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS H+TIS YQ ++W D L   +   +      D+EFR G+P
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHLTISTYQSSSWGDFLLDTISGLVFDMAKADVEFRAGIP 291

Query: 236 IGYL----------RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGK 285
              L          R +G  R     + ADRL   E  K+L +             +M K
Sbjct: 292 RQLLLQVETTDVATRLSGFLR-----MLADRL---EGTKELFST------------DMKK 331

Query: 286 QLMHDALPP 294
               + LPP
Sbjct: 332 DFALNRLPP 340


>gi|81674421|gb|AAI09962.1| MYC induced nuclear antigen [Bos taurus]
          Length = 461

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 87/240 (36%), Positives = 130/240 (54%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +     I + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSWG-IYYGRDVNVCRCVHGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GR    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  AQG
Sbjct: 116 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGAQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ ++W D L   +   +      D+  R G+P
Sbjct: 232 LYFPRGTIHQADTPEGLAHSTHVTISTYQSSSWGDFLLDTISGLVFDTAKADVALRAGIP 291


>gi|149731734|ref|XP_001504462.1| PREDICTED: MYC-induced nuclear antigen [Equus caballus]
          Length = 464

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 99/312 (31%), Positives = 154/312 (49%), Gaps = 45/312 (14%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +    +  + ++I++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E  ILQLEG+K+W++Y P   +  EY         +++ IG+P    TL+PG
Sbjct: 176 LPPHYDDVEVLILQLEGEKRWRLYRPTVPLAREYGVE------AEDSIGSPAREFTLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           D LY PRG IHQA T     HS H+TIS YQ  +W D L   +   +      DLE R G
Sbjct: 230 DFLYFPRGTIHQADTPPGLAHSTHLTISTYQNNSWGDFLLDTISGLVFDTAKEDLELRAG 289

Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
           LP           +   R +G  R     + AD+L   E  K+LL+             +
Sbjct: 290 LPRQLLLQVETTAVARRRVSGFLR-----MLADKL---EGTKELLSA------------D 329

Query: 283 MGKQLMHDALPP 294
           M K  + + LPP
Sbjct: 330 MKKDFIMNRLPP 341


>gi|344254172|gb|EGW10276.1| Myc-induced nuclear antigen [Cricetulus griseus]
          Length = 826

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 86/240 (35%), Positives = 131/240 (54%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     ++ I    +  + K++++    +G+++ ++ 
Sbjct: 414 EFWEQKPLLIQRNDPSLATYYQSLFRLSDMNRICLQGLY-YGKDVNVCRCINGKKKVLNK 472

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 473 SGKVQFLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECYFGSLVGSNVYMTPAGSQG 532

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S    +  IGTP     L+PGDL
Sbjct: 533 LPPHYDDVEVFILQLEGEKHWRLYSPTVP---LAREYSVE-PEARIGTPTHDFILKPGDL 588

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA+T +   HS HVTIS YQ  +W D L  ++   +      D+  R G+P
Sbjct: 589 LYFPRGTIHQANTPSGLAHSTHVTISTYQNNSWGDFLLDSLSGLVFDIAKEDVALRTGIP 648


>gi|354493583|ref|XP_003508920.1| PREDICTED: myc-induced nuclear antigen-like [Cricetulus griseus]
          Length = 439

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 86/240 (35%), Positives = 132/240 (55%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R   +   YY +L     ++ I    +  + K++++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRNDPSLATYYQSLFRLSDMNRICLQGLY-YGKDVNVCRCINGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 SGKVQFLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S    +  IGTP     L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYSPTVP---LAREYSVE-PEARIGTPTHDFILKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA+T +   HS HVTIS YQ  +W D L  ++   +      D+  R G+P
Sbjct: 232 LYFPRGTIHQANTPSGLAHSTHVTISTYQNNSWGDFLLDSLSGLVFDIAKEDVALRTGIP 291


>gi|395821318|ref|XP_003783991.1| PREDICTED: MYC-induced nuclear antigen [Otolemur garnettii]
          Length = 465

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 86/240 (35%), Positives = 130/240 (54%), Gaps = 10/240 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +     I + +++++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYQSLFRLTDLKSLCSRG-IYYGRDVNVCRCVNGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYLTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S    ++ IG P    TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-PEDRIGRPTYEFTLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+  R G+P
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVALRAGIP 291


>gi|147905999|ref|NP_001086856.1| MYC induced nuclear antigen [Xenopus laevis]
 gi|50415478|gb|AAH77562.1| Mina-prov protein [Xenopus laevis]
          Length = 461

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 83/239 (34%), Positives = 132/239 (55%), Gaps = 8/239 (3%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           D+WE + L +  +   + +   T  ++ D+  +    I +++++++    DG++  +   
Sbjct: 54  DYWETKVLLLQGRDPAFTDYFQTLFRLSDLKHIAGGGIYYERDVNVFKCRDGKKIALPRH 113

Query: 59  GRALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           G+A    +   +  G  +I+F  PQ +   L  +   L+  FG  VG+N Y TP  +QG 
Sbjct: 114 GKATYLHLLKDFGSGKATIQFHQPQRFNDALWHIMEKLECFFGALVGSNVYITPQDSQGL 173

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
             HYDD+E FILQLEG+K+W++Y P  V      YS     +++IG+P     L+PGDLL
Sbjct: 174 PAHYDDVEVFILQLEGEKRWRLYNP--VVPLARDYSV--VPEDQIGSPTHDFVLKPGDLL 229

Query: 178 YLPRGYIHQASTVT-NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           Y PRG IHQA  +  + HS HVTIS YQ  +W D L+  +P  L  A   +++ RRG+P
Sbjct: 230 YFPRGVIHQAQALPGSSHSTHVTISTYQNNSWSDYLQDLLPGILFDAAKANIDLRRGIP 288


>gi|351698842|gb|EHB01761.1| MYC-induced nuclear antigen [Heterocephalus glaber]
          Length = 463

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 92/277 (33%), Positives = 140/277 (50%), Gaps = 16/277 (5%)

Query: 1   DHWEKEPLYISRKQ----KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD 56
           + WEK+PL + R        YY +L     +  +    +  + +++++     G+++ ++
Sbjct: 57  EFWEKKPLLVQRHDDPALATYYQSLFRLADLKSLCSQGLY-YGRDVNVCRSVSGKKKVLN 115

Query: 57  VEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
            +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +Q
Sbjct: 116 KDGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRVQEKLEYYFGFLVGSNVYITPAGSQ 175

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PHYDD+E FILQLEG+  W++Y P   +  EY   Y +       +G P    TL+P
Sbjct: 176 GLPPHYDDVEVFILQLEGENHWRLYSPTVALAREYSVEYEA------RLGVPTREFTLKP 229

Query: 174 GDLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           GDLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R 
Sbjct: 230 GDLLYFPRGAIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRS 289

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMK-ENLKDLLA 268
           G+P   L    LA    +      LA + E  K+LL+
Sbjct: 290 GIPRQLLMVETLAVATRLSSFLRTLAERLEGTKELLS 326


>gi|351711026|gb|EHB13945.1| MYC-induced nuclear antigen [Heterocephalus glaber]
          Length = 463

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/242 (34%), Positives = 127/242 (52%), Gaps = 14/242 (5%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WEK+PL I R       YY +L     +  +    +  + +++++     G+++ ++ 
Sbjct: 57  EFWEKKPLLIQRDDPALATYYQSLFRLADLKSLCSQGLY-YGRDVNVCRSVSGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+    Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRVQEKLECYFGSLVGSNVYITPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PHYDD+E FILQLEG+K W++Y P   +  EY   Y +       +G P     L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYPPTVALACEYSVEYEA------RLGAPTHEFMLKPG 229

Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           DLLY PRG IHQA T     HS HVTIS YQ  +W D L   +   +      D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRSG 289

Query: 234 LP 235
           +P
Sbjct: 290 IP 291


>gi|74002431|ref|XP_535711.2| PREDICTED: MYC-induced nuclear antigen isoform 1 [Canis lupus
           familiaris]
          Length = 463

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 146/300 (48%), Gaps = 22/300 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           + WEK+PL + R             ++ D+  L    + + +++++     G+++ ++  
Sbjct: 57  EFWEKQPLVLRRDDPAVAAYCQALFRLSDLKSLCGRGLLYGRDVNVCRCVKGKKKVLNKA 116

Query: 59  GRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           GR    Q+  D  +   +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG 
Sbjct: 117 GRVHFLQLRKDLDQRRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQGL 176

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P    TL+PGD+L
Sbjct: 177 PPHYDDVEVFILQLEGQKHWRLYHPTVP---LAREYSVE-AEERIGRPAYEFTLKPGDVL 232

Query: 178 YLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           Y PRG +HQA       HS HVTIS YQ ++W D L   +   +  A   D+  R G+P 
Sbjct: 233 YFPRGTVHQADVPPGLAHSTHVTISTYQNSSWGDCLLDTVSGLVFDAVKEDVALRAGIPR 292

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGV--DEMGKQLMHDALPP 294
             L   G A            A +  L  LL  L + ++    +    M K  +   LPP
Sbjct: 293 QLLLVEGTA------------AARRRLGGLLRTLADRLEGTGALLAGGMKKDFVTHRLPP 340


>gi|425448177|ref|ZP_18828156.1| Cupin 4 family protein [Microcystis aeruginosa PCC 9443]
 gi|389731118|emb|CCI04793.1| Cupin 4 family protein [Microcystis aeruginosa PCC 9443]
          Length = 383

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 90/298 (30%), Positives = 157/298 (52%), Gaps = 30/298 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNI-DITLYEDGQRQTMDVEG 59
           ++WEK P  ISR + +YY++L++ K ID I+R     +  N+ D+ + ++    T +  G
Sbjct: 19  EYWEKRPCLISRTEPDYYSHLLSLKDIDSIIRS----YGSNLSDVEVIKENSYFTPN--G 72

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
               Q ++  Y +G ++R        +P+  L   LQ  F   V  N Y +P  +QGF  
Sbjct: 73  VVNAQQLYQAYSQGYTLRVRKISERWQPISDLRTKLQVFFNHPVLINLYMSPKNSQGFQA 132

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE---EIGTPILTVTLEPGDL 176
           H+D  E FI+Q+EG K W++Y   +    LP  S   + ++   ++ +P+   TL  GDL
Sbjct: 133 HFDTHEVFIVQVEGSKNWRIYDSPIT---LPLISDLKYQEKLRNQLTSPVTAFTLNAGDL 189

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           LY+PRGYIH+  T ++  S H+T++++    W DLL  A+    Q     D+ FR  LPI
Sbjct: 190 LYIPRGYIHEVFT-SDSFSTHLTVAIH-TYKWFDLLNIAVNLIAQK----DVRFRESLPI 243

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
           G+L      +G+       +++++E  ++LL  L +   ++  V+E+ +  + +  PP
Sbjct: 244 GFLE-----QGEA------KVSLQEQFQELLQILADKSQVEFAVEELAQLFLGETPPP 290


>gi|428769416|ref|YP_007161206.1| hypothetical protein Cyan10605_1036 [Cyanobacterium aponinum PCC
           10605]
 gi|428683695|gb|AFZ53162.1| hypothetical protein Cyan10605_1036 [Cyanobacterium aponinum PCC
           10605]
          Length = 397

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 103/338 (30%), Positives = 167/338 (49%), Gaps = 48/338 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---- 56
           ++WEK+PL ISR QKNYY +L++ + +D IL+ + +   K  +I + ++ Q    D    
Sbjct: 23  EYWEKKPLIISRNQKNYYQDLLSVEDLDSILQYSKL---KPPEIRVVKNQQDFLPDRYVK 79

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            +G      ++  Y E  ++     Q + KPL      LQ  F   V AN Y +P  ++G
Sbjct: 80  ADGSLNLNQLYKAYYEDHTLVVNGLQNFWKPLAIYCQKLQNFFNHGVIANLYLSPKDSKG 139

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS--PNFSQEEIGTPILTVTLEPG 174
            +PHYD  + F+LQ++G K+W+V+      + +P   S  P   +  +   + TV L+PG
Sbjct: 140 LSPHYDTHDVFVLQVDGSKEWQVH---QCFQPVPLLGSFQPVIPENSLPKLLHTVCLQPG 196

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           DLLYLPRG++H A+T     SLH+T+ +Y  T W+DL    M  AL      + + R+ L
Sbjct: 197 DLLYLPRGFVHHAAT-QESFSLHLTLGIY-PTQWLDL----MVNALTMLALKNQDLRKAL 250

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH----- 289
           PIG+L        KP  +      +KE    LL    E   +D+ ++ +  Q +H     
Sbjct: 251 PIGFL-------DKPEMLG----EIKEQFDQLLQTFAEKSSIDEAMELIYDQFIHKITPI 299

Query: 290 -----------DALPP---VLSPEELQCSVFENGLRMS 313
                      DA+ P   V+  E ++C + + G  +S
Sbjct: 300 PDGHFQQVNLVDAIAPDTLVVKREGMRCRIVKKGFSIS 337


>gi|344249714|gb|EGW05818.1| Myc-induced nuclear antigen [Cricetulus griseus]
          Length = 317

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 87/241 (36%), Positives = 129/241 (53%), Gaps = 12/241 (4%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R   +   YY +L     ++ I    +    K++++    +G++  ++ 
Sbjct: 58  EFWEQKPLLIQRNDPSLATYYQSLFRLSDMNRICLQGLY-CGKDVNVCRCINGKK-VLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            G+    Q+  D  ++  +I+F  PQ +   L +    L+  FG  VG+N Y TP  +QG
Sbjct: 116 SGKVQFLQLRKDFDQKRATIQFHQPQRFKDELWKTQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKK-WKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
             PHYDD+E FILQLEG+KK W +Y P +    L R  S    +  IGTP     L+PGD
Sbjct: 176 LPPHYDDVEVFILQLEGEKKHWHLYSPTVP---LAREYSVE-PKARIGTPTHDFILKPGD 231

Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           LLY PRG IHQA+T +   HS HVTIS YQ  +W D L  ++ +        D+  R G+
Sbjct: 232 LLYFPRGTIHQANTPSGLAHSTHVTISTYQNNSWGDFLLDSLSRLEFDIAKEDVALRTGI 291

Query: 235 P 235
           P
Sbjct: 292 P 292


>gi|307154631|ref|YP_003890015.1| cupin [Cyanothece sp. PCC 7822]
 gi|306984859|gb|ADN16740.1| Cupin 4 family protein [Cyanothece sp. PCC 7822]
          Length = 390

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 92/299 (30%), Positives = 150/299 (50%), Gaps = 32/299 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +WEK PL ++R   +YY+ LI+ K ID ILR   +   K+ D+ L ++         G  
Sbjct: 24  YWEKSPLLVARNHPDYYSELISLKNIDSILR---LYGPKSSDVDLIKENS--FFSAGGEV 78

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
               ++  Y  G S+         +PL  L+ +L+      VG N Y T   +QGF  H+
Sbjct: 79  DFNQIYQAYSLGYSLVMRKIHERWQPLSVLHKNLEAFLNHPVGINLYMTSKNSQGFKAHF 138

Query: 122 DDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
           D  + FILQ+EG K+WK+Y     LP + D    +Y+    +Q  + +P     L  GDL
Sbjct: 139 DTHDVFILQVEGSKQWKIYDSPITLPVISD---LKYTDKFINQ--LKSPTAEYCLNKGDL 193

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           LY+PRGYIH+  T  N  S+H+T+ ++    W DL+  A+ K  Q     ++ FR  LP+
Sbjct: 194 LYIPRGYIHEVYT-DNSFSVHLTVGIHS-LKWFDLINSAVTKLAQK----EVRFRESLPV 247

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPV 295
           G+LR            +A+  ++K   ++LL  L E  ++++ V+++ +  +    P V
Sbjct: 248 GFLRQE----------EAEE-SLKNQFQELLKLLAEQSEVEEAVEDIAQGFLGKMSPLV 295


>gi|23200593|dbj|BAC16359.1| Mina53 form-3 [Homo sapiens]
          Length = 280

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/209 (37%), Positives = 118/209 (56%), Gaps = 10/209 (4%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     ++ +    +  + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALATYYGSLFKLTDLESLCSRGMY-YGRDVNVCRCVSGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
             PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P     L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPAHEFMLKPGDL 231

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQ 204
           LY PRG IHQA T     HS HVTIS YQ
Sbjct: 232 LYFPRGTIHQADTPAGLAHSTHVTISTYQ 260


>gi|41056083|ref|NP_956631.1| MYC induced nuclear antigen-like [Danio rerio]
 gi|31419411|gb|AAH53125.1| MYC induced nuclear antigen-like [Danio rerio]
          Length = 528

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 145/343 (42%), Gaps = 60/343 (17%)

Query: 3   WEKEPLYISRKQK---NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           WE++PL + R       YY +L     +  +     +++  +I+      GQ++ ++  G
Sbjct: 77  WERQPLVLHRSDAALAGYYGSLFPLSGLRRLCARG-LQYGTDINTCRCVRGQKRLLNRAG 135

Query: 60  RA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
                 +  D   +  +I+F  PQ +   L ++   L+  FGC VG+N Y TP  AQG  
Sbjct: 136 AVDFCLLERDFLEKKATIQFHQPQRFQDELWRIQERLECFFGCLVGSNVYITPAGAQGLP 195

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-EIGTPILTVTLEPGDLL 177
           PHYDD+E  ILQLEG+K W++Y P      +P     +   E  IG P     L+ GDLL
Sbjct: 196 PHYDDVEVLILQLEGQKHWRLYEP-----TVPLAREYSLEPEGRIGAPTHDFILQAGDLL 250

Query: 178 YLPRGYIHQASTVTNE-HSLHVTISVYQKT------------------------------ 206
           Y PRG IHQA T     HS H+T+S YQ                                
Sbjct: 251 YFPRGTIHQADTPAGAGHSTHLTLSTYQNMCVCAVHNVTHTHTHTLQNVLVLHSFIVCVR 310

Query: 207 ---AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENL 263
              +W DLL   MP  +     TD E R GLP G L    ++            A+   L
Sbjct: 311 ACRSWGDLLLDLMPGCVFDRMKTDCELRTGLPRGLLTTPSISP-----------AVSHQL 359

Query: 264 KDLLAKLVEYVDLDDG---VDEMGKQLMHDALPPVLS-PEELQ 302
              L +L + VD          M +  +   LPP L  P+ LQ
Sbjct: 360 SVFLRRLADVVDQQGQSLRSSSMRRDFISHRLPPFLQDPQLLQ 402


>gi|434387984|ref|YP_007098595.1| hypothetical protein Cha6605_4118 [Chamaesiphon minutus PCC 6605]
 gi|428018974|gb|AFY95068.1| hypothetical protein Cha6605_4118 [Chamaesiphon minutus PCC 6605]
          Length = 396

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 91/292 (31%), Positives = 148/292 (50%), Gaps = 38/292 (13%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDIL--------RDNVIEFKKNI--DITLYEDG 50
           D+WE++PL I+R+   YY  L + + +D +L        +  VI+ ++ +  D  + +DG
Sbjct: 23  DYWEQKPLLIAREHPTYYQELFSKQALDSVLFFSTPKPPQLRVIKNQQEVFPDKYIKQDG 82

Query: 51  QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
           Q     +         +  Y EG ++          PL  L  +LQ  F   V AN Y +
Sbjct: 83  QLNLNQL---------YKLYDEGNTLVVNGLHQLWLPLTILCRNLQLAFNHTVIANCYAS 133

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
           P  ++G  PHYD  + F+LQ+EG K+W V+  P+ V   L     P   + ++  P+ ++
Sbjct: 134 PKQSKGLMPHYDTHDVFVLQIEGAKQWFVHEAPQPVP--LLHSDQPIIPEGKLAEPLYSI 191

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L+ GDLLY+PRG+IH A+T  +  SLH+TI +Y  T W DLL +A+ +          E
Sbjct: 192 YLKAGDLLYIPRGFIHHAATADSASSLHLTIGLY-ATQWFDLLIQALTQ----VSCQRAE 246

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
           FR+ LP+GYL+ +        +I+AD   +K   + L  K  +  + D  +D
Sbjct: 247 FRQALPLGYLQRS--------EIRAD---LKAQFQSLTEKFCQEANFDAAID 287


>gi|307102455|gb|EFN50729.1| hypothetical protein CHLNCDRAFT_142546 [Chlorella variabilis]
          Length = 636

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 82/266 (30%), Positives = 136/266 (51%), Gaps = 29/266 (10%)

Query: 2   HWEKEP--LYISRKQKNYYNNLITSKKIDDILR-----DNVIEFKKNIDITLYEDGQRQT 54
            WE+EP     + +++  ++ L +  ++  + R        +EF  +++   Y +G R+T
Sbjct: 13  QWEQEPAVFRATPERRQLFDGLFSFAELQRLARICEQEGKPLEFGVDVNAARYLNGVRET 72

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           ++ E  A    +   + EGC++     Q + + L +L A+L+   GC  G N Y TP   
Sbjct: 73  LNGE-VADAATINRLFAEGCTM-----QRFSEELWRLLAALERQLGCLAGCNAYLTPKGT 126

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPR------------MVDEYLPRYSSPNFSQEEI 162
           QG APHYDD+E F++Q EG K+W+++ P             +   +L    S +    +I
Sbjct: 127 QGLAPHYDDVEIFVVQTEGSKRWRLHRPPADSGASTAAGGLLARCHLANQVSGDLQPADI 186

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
           G P+L V LE  D+LY+PRG IHQA    +  S H+T+S YQ+ +  DL++ A+  AL  
Sbjct: 187 GEPVLEVVLEAADVLYIPRGTIHQAEAQMHSSS-HLTLSTYQRWSAADLVQYAVSVALAN 245

Query: 223 AGATDL---EFRRGLPIGYLRYAGLA 245
                L     + GLP+ +L  A L+
Sbjct: 246 PQLQPLLAPPLKAGLPLRWLSDASLS 271


>gi|294817685|ref|ZP_06776327.1| Cupin 4 family protein [Streptomyces clavuligerus ATCC 27064]
 gi|294322500|gb|EFG04635.1| Cupin 4 family protein [Streptomyces clavuligerus ATCC 27064]
          Length = 414

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 81/246 (32%), Positives = 131/246 (53%), Gaps = 20/246 (8%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR---QTMDV 57
           D+WEK+PL + R+  +YY + +T   +D IL  + +   ++ ++ +  +G+    Q M  
Sbjct: 35  DYWEKKPLIVHREDPHYYTDPLTFHDVDQILSTSSV---RSSELKVVVNGREIPLQEMTA 91

Query: 58  EGRALP----QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
            G   P    ++++D YR+G ++ F       +PL +L  +L   F     AN Y TP  
Sbjct: 92  SGPGGPANGLEVLYDLYRKGSTVVFKFLHERWEPLGRLCRTLAAEFSAAFQANAYLTPAA 151

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP-NFSQEEIGTPILTVTLE 172
           AQG   HYD  + F+LQ+ G K W++Y        LP  S P   ++E  G P+   TL 
Sbjct: 152 AQGLTSHYDTHDVFVLQIWGSKHWRLY---DSPAELPLQSQPFRRTKEGPGNPVREFTLN 208

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
            GD++YLPRG +H A T  ++ SLHVT+ V Q   W  +L+ A+ +A++    +D  +R 
Sbjct: 209 AGDMMYLPRGTVHDA-TAQDQASLHVTLGV-QPVLWATVLKDALDRAIE----SDPRYRS 262

Query: 233 GLPIGY 238
            LP G+
Sbjct: 263 SLPPGF 268


>gi|326446609|ref|ZP_08221343.1| Cupin 4 family protein [Streptomyces clavuligerus ATCC 27064]
          Length = 385

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 81/246 (32%), Positives = 131/246 (53%), Gaps = 20/246 (8%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR---QTMDV 57
           D+WEK+PL + R+  +YY + +T   +D IL  + +   ++ ++ +  +G+    Q M  
Sbjct: 6   DYWEKKPLIVHREDPHYYTDPLTFHDVDQILSTSSV---RSSELKVVVNGREIPLQEMTA 62

Query: 58  EGRALP----QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
            G   P    ++++D YR+G ++ F       +PL +L  +L   F     AN Y TP  
Sbjct: 63  SGPGGPANGLEVLYDLYRKGSTVVFKFLHERWEPLGRLCRTLAAEFSAAFQANAYLTPAA 122

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP-NFSQEEIGTPILTVTLE 172
           AQG   HYD  + F+LQ+ G K W++Y        LP  S P   ++E  G P+   TL 
Sbjct: 123 AQGLTSHYDTHDVFVLQIWGSKHWRLY---DSPAELPLQSQPFRRTKEGPGNPVREFTLN 179

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
            GD++YLPRG +H A T  ++ SLHVT+ V Q   W  +L+ A+ +A++    +D  +R 
Sbjct: 180 AGDMMYLPRGTVHDA-TAQDQASLHVTLGV-QPVLWATVLKDALDRAIE----SDPRYRS 233

Query: 233 GLPIGY 238
            LP G+
Sbjct: 234 SLPPGF 239


>gi|405354226|ref|ZP_11023606.1| mina protein [Chondromyces apiculatus DSM 436]
 gi|397092469|gb|EJJ23227.1| mina protein [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 392

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 88/300 (29%), Positives = 145/300 (48%), Gaps = 30/300 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           +HWE++PL    +  +++  L + + +D ++R       +   +T     +   ++++  
Sbjct: 21  EHWEQKPLVTRGRAPDFFAPLFSLRDVDRVIRYQKPGPGRLDLVTAGGFVRDNFLNLDNT 80

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
           A   +V+++Y +G ++     +   +PL     +L+      V    Y TPP   G  PH
Sbjct: 81  ANINLVYENYLKGSTVILSGLEETWEPLVVFCRNLEGQLSHPVAVAVYLTPPGHHGVQPH 140

Query: 121 YDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           +D  E FILQ+EG K WKVY     LPR+   Y P       ++E +   +L   L PGD
Sbjct: 141 FDTQENFILQVEGVKHWKVYGVGQELPRVEGSYTP------VARERLPELLLETDLHPGD 194

Query: 176 LLYLPRGYIHQASTVTNEH-SLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           +LY+PRG++H+A     EH SLH+T+ V+ +T W D LE     AL A    D  FRR L
Sbjct: 195 MLYVPRGFVHEAE--AREHASLHITVDVHVRT-WRDFLE----DALAAMADRDPRFRRSL 247

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
           P G LR  G              +++   ++L+A     V L D + +  ++L+    PP
Sbjct: 248 PPGLLRDPGAVN-----------SLESGFRELVAGFHRDVRLSDALAKHAEKLIVARPPP 296


>gi|260791084|ref|XP_002590570.1| hypothetical protein BRAFLDRAFT_123630 [Branchiostoma floridae]
 gi|229275765|gb|EEN46581.1| hypothetical protein BRAFLDRAFT_123630 [Branchiostoma floridae]
          Length = 572

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 69/176 (39%), Positives = 102/176 (57%), Gaps = 7/176 (3%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           ++WEK+PL   R        Y  L +   +  +L+ + IE+ +++++  Y  G+R++++ 
Sbjct: 197 EYWEKKPLIAKRNDAAVSEAYKALFSRDVLKKLLKKHDIEYIRDVNVCRYVSGKRESLNG 256

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
             RA   QI     +   +++F  PQ +   L QL + L+ LFGC VGAN Y TPP +QG
Sbjct: 257 TERATCKQIDKLFDQSKATLQFHQPQRFQDKLWQLCSLLECLFGCLVGANVYMTPPGSQG 316

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
            APHYDD+E FILQLEG+K W++Y P  VD  LPR  S +  Q+ IG P     LE
Sbjct: 317 LAPHYDDVEVFILQLEGRKHWRLYTP-PVD--LPRDYSRDLEQDNIGQPTHDFVLE 369


>gi|115375458|ref|ZP_01462718.1| mina protein [Stigmatella aurantiaca DW4/3-1]
 gi|310821399|ref|YP_003953757.1| cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
 gi|115367501|gb|EAU66476.1| mina protein [Stigmatella aurantiaca DW4/3-1]
 gi|309394471|gb|ADO71930.1| Cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
          Length = 383

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/298 (28%), Positives = 158/298 (53%), Gaps = 31/298 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNID-ITLYEDGQRQT---MD 56
           + WE++PL + +   + ++ L +S+ +  +L     +  ++I+ + L ++G+ +    + 
Sbjct: 18  EAWERKPLVL-QGPPDRWSGLFSSRDLGRLL---TYQPPRSIEGMMLVKEGRHRDENWLS 73

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            +G    + V   +REG +I       + +P+ +  A+++E     VG N Y TPP AQG
Sbjct: 74  PDGSPRLEQVQAAWREGYTIVINKVGQFWEPVGRFCAAVEEELHHPVGVNLYMTPPGAQG 133

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
           F  H+D ++AF+LQ+EG K W+V  P+ V   LP   +   S E +   +L   L+ GD+
Sbjct: 134 FKAHFDIMDAFVLQVEGSKVWQVRGPQ-VTLPLPDEHTAT-SSESLPPVLLEQELKRGDV 191

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           LY+PRG++H+A T    HS+H+T+ + Q   W DL   A+     AA   D  FR+GLP 
Sbjct: 192 LYIPRGFVHEARTAQT-HSVHLTLGL-QAVTWSDLFVAAI-----AAARRDERFRKGLPP 244

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
            +L  + +              M++  ++LLA+L  +++L   + ++ ++L+    PP
Sbjct: 245 RFLEGSAM--------------MEQTFRELLAELPRHLELGHALTQLAERLVVQKPPP 288


>gi|390478066|ref|XP_003735410.1| PREDICTED: LOW QUALITY PROTEIN: MYC-induced nuclear antigen-like
           [Callithrix jacchus]
          Length = 488

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 95/332 (28%), Positives = 151/332 (45%), Gaps = 62/332 (18%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           + WE++PL I R          +  K+ D+  L    + + +++++     G+++ ++ +
Sbjct: 57  EFWEQKPLLIRRDDPALATYCGSLFKLTDLKSLCSQGLYYGRDVNVCRCVGGKKKVLNKD 116

Query: 59  GRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG 
Sbjct: 117 GKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQGL 176

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PHYDD+E FILQLEG+K W++Y P +    L R  S   ++E IG P+   TL+  D +
Sbjct: 177 PPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFTLKEKDHV 232

Query: 178 Y-------LPRGYIHQ-----------------ASTVTNE-HSLHVTISVYQKTAWIDLL 212
                   +P G + Q                 A T+    HS HVTIS YQ  +W D L
Sbjct: 233 VQGCACELVPSGPLQQNLVSVPKDETDFVAVAEADTLAGAGHSTHVTISXYQNNSWGDFL 292

Query: 213 EKAMPKALQAAGATDLEFRRGLP----------IGYLRYAGLARGKPVDIQADRLAMKEN 262
              +   +      D+E R G+P          +   R +G  R     + ADRL   E 
Sbjct: 293 LDTISGLVFDTAEEDVELRAGIPRQLLLVESTAVATRRLSGFLR-----MLADRL---EG 344

Query: 263 LKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
            K+LL+             +M K  +   LPP
Sbjct: 345 TKELLS------------SDMKKDFIMHRLPP 364


>gi|114570978|ref|YP_757658.1| cupin 4 family protein [Maricaulis maris MCS10]
 gi|114341440|gb|ABI66720.1| Cupin 4 family protein [Maricaulis maris MCS10]
          Length = 394

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 140/300 (46%), Gaps = 31/300 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
           D+ EK+PL + R+    Y  L++  +IDDI     LR+  ++  ++      ED    T 
Sbjct: 26  DYHEKKPLIVHREDPGRYAGLLSIARIDDIVSSIDLREGALDMARSEPPVQREDYMFDTG 85

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYI--KPLQQLNASLQELFGCFVGANTYQTPPT 113
            V+       V + YR+G +I  + PQ ++    L +   +++ L  C V  N Y TPP 
Sbjct: 86  YVDRGG----VANQYRQGATI--ILPQLHMMDAVLGEFCRAVESLLSCHVQTNIYLTPPD 139

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            QGF  HYDD + F++Q+EG+K W+ Y   + + Y      P+    + G P+    L+ 
Sbjct: 140 NQGFNTHYDDHDVFVMQIEGEKLWRFYETPVENPYRGEGFRPD--AHKAGEPVAEFVLKA 197

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           G+ +Y+PRG +H A T  +  SLH+T+ +  KT W DL+ +A+ +      A     R  
Sbjct: 198 GECIYVPRGLMHDAQTHGDTASLHITLGLIVKT-WADLMLEAVSEVALRTPA----MRHS 252

Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
           LP G+ R              DR   +   +D+   L   + +D  +D      +   +P
Sbjct: 253 LPPGFAR-----------PDFDRTDAEVQFRDMAEMLAREMSVDGAMDFFVDSFIRSRVP 301


>gi|83859809|ref|ZP_00953329.1| hypothetical protein OA2633_07409 [Oceanicaulis sp. HTCC2633]
 gi|83852168|gb|EAP90022.1| hypothetical protein OA2633_07409 [Oceanicaulis sp. HTCC2633]
          Length = 387

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/255 (32%), Positives = 122/255 (47%), Gaps = 35/255 (13%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           +E+  L+     +N + +LI+   ID IL ++           L  +G       E R L
Sbjct: 22  FEQTHLHAPGTDRNRFASLISLDAIDRILAED-----------LLREGDLSMARAEPR-L 69

Query: 63  PQIVWDH-------------YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
           P   W               Y++G ++     Q   +PL  L   L+  F C V  N Y 
Sbjct: 70  PDRAWLREDGLVDRGEVARLYQQGATLILPQLQARHRPLADLCRQLEAEFSCPVQTNIYL 129

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILT 168
           TPP AQGF  HYD+ +  +LQ+EG K+W++Y  P  V     R++   F+Q E   P   
Sbjct: 130 TPPNAQGFQTHYDNHDVLVLQVEGSKRWRLYDAPVGVPYRGERFTPGRFAQTE---PRAE 186

Query: 169 VTLEPGDLLYLPRGYIHQA-STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD 227
           + L PGD+LY+PRG +H A +  ++E SLH+T  +  KT W D L +A+ +    A    
Sbjct: 187 LVLNPGDVLYVPRGLMHDAVNEGSDEASLHITTGLLAKT-WADFLLEAVSE----AALRT 241

Query: 228 LEFRRGLPIGYLRYA 242
            + RR LP GY R A
Sbjct: 242 PQLRRALPPGYARGA 256


>gi|405354225|ref|ZP_11023605.1| hypothetical protein A176_7050 [Chondromyces apiculatus DSM 436]
 gi|397092468|gb|EJJ23226.1| hypothetical protein A176_7050 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 383

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 150/302 (49%), Gaps = 39/302 (12%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNI-DITLYEDGQRQT---MD 56
           + WE++PL +       +++L  S+ +  +L        ++I D+ L  +G+ +    + 
Sbjct: 18  ETWERKPLVLP-GPPGRWSDLFASRDLGRLL---AYRTPRSIEDMVLVREGRHRDENWLA 73

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
            +G    + V   +REG ++       + +P+ +  A+++E     VGAN Y TPP AQG
Sbjct: 74  PDGSPRLEQVQAAWREGFTLVINRLGRFWEPVGRFCAAVEEELHHPVGANLYMTPPGAQG 133

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE---- 172
           F  H+D ++AF+LQ+EG K W+V  P+ V   LP    P+    E+ +P+  V +E    
Sbjct: 134 FKAHFDVMDAFVLQVEGAKVWQVRGPQRV---LP---LPDEHTTELSSPLPPVLMEHELK 187

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
            G++LY+PRG+IH+A     EHS+H+T+ + Q   W +LL  A+     A    D   RR
Sbjct: 188 SGEVLYIPRGFIHEARA-AREHSVHLTLGL-QAITWEELLRAAI-----AVARRDERLRR 240

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
           GLP  +L                   M    + LL +L   ++LD  + ++ ++L+    
Sbjct: 241 GLPPRFLEGPA--------------TMAPMFQALLEELPRQLELDAALTQLAERLVVQKP 286

Query: 293 PP 294
           PP
Sbjct: 287 PP 288


>gi|317123668|ref|YP_004097780.1| cupin 4 family protein [Intrasporangium calvum DSM 43043]
 gi|315587756|gb|ADU47053.1| cupin 4 family protein [Intrasporangium calvum DSM 43043]
          Length = 411

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 99/314 (31%), Positives = 146/314 (46%), Gaps = 33/314 (10%)

Query: 1   DHWEKEPLY-ISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQT 54
           DHW ++PL  +       + +L + + +D++     LR   +   KN   TL +      
Sbjct: 33  DHWGRQPLVSLGADLAGGFEDLFSLEAVDELVSRRGLRTPFVRVAKNGQ-TLPDSAFTAG 91

Query: 55  MDVEGRALPQIVWDH----YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
             V      Q+  D     + +G +I          P+    A+L E  G  V AN Y T
Sbjct: 92  GGVGAGIADQVSDDKLLRLFADGATIVLQGLHRTWAPVSDFVAALAEDLGHPVQANAYVT 151

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG-----TP 165
           P  +QGF  HYD  + F+LQ+ G+K W++ LP  V  +  R       +E +G      P
Sbjct: 152 PRQSQGFNDHYDVHDVFVLQVAGEKHWRIRLP--VHLWPTRDQPWTAYRERVGQAAAEPP 209

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
           +L VTL  GD LYLPRG++H A T T E S H+T+ V+  T    L E  + +A+ A   
Sbjct: 210 LLDVTLRAGDCLYLPRGFLHSA-TATEEVSAHLTLGVHTWTR-AHLAEALVAEAMHA--L 265

Query: 226 TDLEFRRG-LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
            D E  R  LP+      G+  G P  + AD +     L+++LA LVE +D D     +G
Sbjct: 266 VDAESERAPLPL------GVDVGNPAGV-ADEVT---RLREVLAGLVEGIDPDRVAARLG 315

Query: 285 KQLMHDALPPVLSP 298
           +Q    A P  LSP
Sbjct: 316 RQARAAARPGPLSP 329


>gi|359458845|ref|ZP_09247408.1| cupin 4 family protein [Acaryochloris sp. CCMEE 5410]
          Length = 405

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 128/252 (50%), Gaps = 31/252 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM----DV 57
           +WE + LY+ R   ++Y +++    ID +L++  +    N +  L + G + ++    D 
Sbjct: 23  YWETKTLYLPRNDASFYESVLNPDDIDLLLQNKALLADYN-NFRLVDQGNKLSLEDWCDR 81

Query: 58  EGRALPQIV-----WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
             ++    +     +    +G ++        I  L+   ++L+      +  N Y TPP
Sbjct: 82  HSKSQQYFINNDRLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRTNIYITPP 141

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMV-DEYLPRYSSPNFSQEEIGTPI 166
            AQG APHYD+ + FILQ+ G K+WK+Y     LP  + D+ + R+        E+  P 
Sbjct: 142 QAQGLAPHYDEHDVFILQITGAKEWKLYHSPVELPSHIRDQSIGRH--------ELAEPE 193

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           LTV L+PGDLLY+PRG +HQA++     S+H ++ +Y   A+ +LLE+     L      
Sbjct: 194 LTVMLQPGDLLYIPRGVVHQAASQETT-SVHASLGLYPTFAY-ELLEE-----LVTIAQA 246

Query: 227 DLEFRRGLPIGY 238
           D  FR+ +P G+
Sbjct: 247 DPAFRKAIPHGF 258


>gi|144900526|emb|CAM77390.1| MYC induced nuclear antigen [Magnetospirillum gryphiswaldense
           MSR-1]
          Length = 390

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 87/289 (30%), Positives = 137/289 (47%), Gaps = 34/289 (11%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDIL-------RDNVIEFKKNIDITLYEDGQRQ 53
           ++WEK+PL + R    +Y +L++ + ID +L       RD  I   +  D    E+   +
Sbjct: 21  EYWEKKPLLVKRAAPGFYRDLLSVQAIDQVLAMPGLHRRD--IRVARGTDPLAVEEYADK 78

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
              +   +L ++  D    G +I        ++PL ++  + +++       N Y TP  
Sbjct: 79  DGFINAASLSRLFTD----GFTIILNTLNLKLRPLAEICRAFEQVLSIPCQTNIYYTPRL 134

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP-RYSSPNFSQEEIGTPILTVTLE 172
           AQGF PHYD  + F+ Q+ G+K W V         LP R         E G   +   LE
Sbjct: 135 AQGFKPHYDSHDVFVFQVAGRKHWLV---NDTPVELPLRGQGFEAGLYEPGDVTMEFDLE 191

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           PGDLLY+PRG +H A T ++E SLH+T+     T+W     + + +A+ AA  TD+E RR
Sbjct: 192 PGDLLYIPRGVMHGART-SDEVSLHITLGAL-TTSW----AEVLLEAVAAAALTDVELRR 245

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
            LP GY            D QA R    +    LL ++ E +D++  +D
Sbjct: 246 NLPAGYALPG-------YDAQAAR----QTFASLLNRVAENIDVESILD 283


>gi|158335076|ref|YP_001516248.1| cupin 4 family protein [Acaryochloris marina MBIC11017]
 gi|158305317|gb|ABW26934.1| cupin 4 family protein, putative [Acaryochloris marina MBIC11017]
          Length = 416

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 130/252 (51%), Gaps = 31/252 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM----DV 57
           +WE + LY+ R   ++Y +++  + ID +L++  +    N +  L + G + ++    D 
Sbjct: 34  YWETKTLYLPRNDASFYGSVLQPEDIDLLLQNKALLADYN-NFRLVDQGNKLSLEDWCDR 92

Query: 58  EGRALPQIV-----WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
             ++    +     +    +G ++        I  L+   ++L+      +  N Y TPP
Sbjct: 93  HSKSQQYFINNDKLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRTNIYITPP 152

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMV-DEYLPRYSSPNFSQEEIGTPI 166
            AQG APHYD+ + FILQ+ G+K+WK+Y     LP  + D+ + R+        ++  P 
Sbjct: 153 QAQGLAPHYDEHDVFILQITGEKEWKLYHSPVELPSHIRDQSIGRH--------KLAEPE 204

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           LTV L+PGDLLY+PRG +HQA++     S+H ++ +Y   A+ +LLE+     L      
Sbjct: 205 LTVMLQPGDLLYIPRGVVHQAAS-QETTSVHASLGLYPTFAY-ELLEE-----LVTIAQA 257

Query: 227 DLEFRRGLPIGY 238
           D  FR+ +P G+
Sbjct: 258 DPAFRKAIPHGF 269


>gi|389865125|ref|YP_006367366.1| hypothetical protein MODMU_3475 [Modestobacter marinus]
 gi|388487329|emb|CCH88887.1| Conserved protein of unknown function; putative cupin and
           clavaminate synthase domains [Modestobacter marinus]
          Length = 443

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 116/252 (46%), Gaps = 21/252 (8%)

Query: 1   DHWEKEPLYISRKQ-KNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQ 51
           +HW + PL  + +Q  + + +L+T   +D++L     R   +   K+   +D   +   Q
Sbjct: 37  EHWARRPLLSTAEQLGSTFTDLLTLDAVDELLSTRGLRTPFLRIAKDGAVVDTKRFTSPQ 96

Query: 52  RQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
               +V  +     V   + +G ++          PL      L    G     N Y TP
Sbjct: 97  GAGAEVADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIAFAGQLAADLGHPTQVNAYITP 156

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTP 165
           P+++GF+PHYD  + F+LQ+ G+K W ++ P + D   P  + P        +      P
Sbjct: 157 PSSRGFSPHYDVHDVFVLQVAGEKHWTIHEPLLAD---PLRTHPWADRAAEVAAAAERAP 213

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
           ++   L PGD LYLPRGYIH A  +  E S H+T+ V+  T W   +E A+   ++   A
Sbjct: 214 VIDTVLRPGDALYLPRGYIHSAVAL-GEISAHLTVGVHSVTRW-GAVESAL-DLVRTLAA 270

Query: 226 TDLEFRRGLPIG 237
            D   R  LP+G
Sbjct: 271 DDPALRGSLPLG 282


>gi|320167141|gb|EFW44040.1| hypothetical protein CAOG_02065 [Capsaspora owczarzaki ATCC 30864]
          Length = 522

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 81/142 (57%), Gaps = 6/142 (4%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP--RY 152
           L+E+FG     N Y TP  ++GF PHYD    FI QL G K W+VYLP  V   LP    
Sbjct: 113 LEEIFGVHTNGNLYSTPKFSRGFTPHYDVHSTFIFQLSGNKDWEVYLPETV---LPIGGL 169

Query: 153 SSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
            S N+S   +G P+++ +L  GDLLYLPRG++HQA T T+  SLH+T+S+    A++  +
Sbjct: 170 LSQNYSGRALGEPVMSFSLRKGDLLYLPRGWVHQA-TATDAPSLHLTVSLGHTMAYMTAV 228

Query: 213 EKAMPKALQAAGATDLEFRRGL 234
             A  +    +   D  F + +
Sbjct: 229 LDACMEVAAGSKLADSPFVQAI 250


>gi|310821398|ref|YP_003953756.1| cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
 gi|309394470|gb|ADO71929.1| Cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
          Length = 421

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 87/306 (28%), Positives = 150/306 (49%), Gaps = 41/306 (13%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKK----NIDITLYEDG--QRQT 54
           ++WE++PL    + + ++  L + + +D      VI ++K     +D+ + E G  +   
Sbjct: 41  EYWEQKPLVTRGRAQGFFAPLFSIRDVD-----RVICYQKPGPGRLDL-VTEGGFVRDNF 94

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           ++++  A   +V+++Y +G ++     +   +PL      L+      V    Y TPP  
Sbjct: 95  LNLDNTANINLVYENYLKGSTVILSGLEETWEPLVVFCRKLEGQLSHPVAVAVYLTPPNH 154

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            G  PH+D  E FILQ++G K WKVY     LPR+   Y P       ++E +   +L  
Sbjct: 155 HGVQPHFDTQENFILQVDGVKHWKVYGAGQELPRVEGSYTP------VARERLPELLLET 208

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L PGD+LY+PRG++H+A    +  SLH+T+ V+ +T W D LE     AL A    +  
Sbjct: 209 ELHPGDMLYVPRGFVHEAEA-RDSASLHITVDVHVRT-WRDFLE----DALAAMADRNPR 262

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
           FR+ LP       GL  G     Q     ++E  ++L+  +   V L D + +  ++L+ 
Sbjct: 263 FRKSLP------PGLLNGSHAKAQ-----LEEGFRELMEMVHREVRLSDALGKHAEKLI- 310

Query: 290 DALPPV 295
            A PP+
Sbjct: 311 VARPPL 316


>gi|89052876|ref|YP_508327.1| cupin 4 [Jannaschia sp. CCS1]
 gi|88862425|gb|ABD53302.1| Cupin 4 [Jannaschia sp. CCS1]
          Length = 392

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 133/282 (47%), Gaps = 39/282 (13%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE-- 58
           +++EK+P+ I R Q  Y+++L++  +ID ++    +   + I++T   DG     D    
Sbjct: 22  EYFEKKPMLIKRGQPGYFSDLLSYGEIDRVVSTMGLHVPE-INVT-RADGNITPADFAYE 79

Query: 59  -GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
            G+  P  V   + +G ++        +  L +   +++      V  N Y TPP  QGF
Sbjct: 80  TGQIDPVRVNQLHADGATVILSGLHERLPALARYCRAMEAAMSARVQTNIYMTPPGNQGF 139

Query: 118 APHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
            PHYD  +  +LQ+ G K+W++Y     LP + D+   R         ++G       LE
Sbjct: 140 NPHYDGHDVLVLQVAGTKEWRIYGTPVELP-LADQAFER-------GMDVGEEAQRFVLE 191

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           PGD +Y+PRG  H A   T+E SLH+T  +  +T W D    A+ +A+ A    +   RR
Sbjct: 192 PGDAVYIPRGMAHDA-VATDETSLHITTGLMFRT-WAD----ALAEAVIAKAHREPSLRR 245

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV 274
            LP G+           VD+        ++ KD  A+L+E V
Sbjct: 246 ALPPGFANNG-------VDL--------DDYKDTFAELIELV 272


>gi|156378502|ref|XP_001631181.1| predicted protein [Nematostella vectensis]
 gi|156218217|gb|EDO39118.1| predicted protein [Nematostella vectensis]
          Length = 218

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 56/142 (39%), Positives = 82/142 (57%), Gaps = 3/142 (2%)

Query: 198 VTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRL 257
           +T+S YQK  W D LE  +P+ALQ A   D EFR+GLP+ YL Y+G+A    V    +R 
Sbjct: 1   MTLSTYQKNTWGDFLEMLVPQALQVAFEEDPEFRQGLPLNYLNYSGVANSDTV--TKERS 58

Query: 258 AMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGE 317
                ++ L+ KL+ +  +D   D+M  +++ DALPPVL+  E   SV+ +G    + G+
Sbjct: 59  DFLRRVEKLMTKLISHAPVDAAADQMAVRMLQDALPPVLTEAERSHSVYGSGASW-EDGK 117

Query: 318 VYNATQITKDTKVRLVRANAVR 339
           + N   IT DT VRL+R    R
Sbjct: 118 IVNMMTITGDTDVRLIRRGVAR 139


>gi|152965307|ref|YP_001361091.1| cupin [Kineococcus radiotolerans SRS30216]
 gi|151359824|gb|ABS02827.1| Cupin 4 family protein [Kineococcus radiotolerans SRS30216]
          Length = 434

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 74/258 (28%), Positives = 119/258 (46%), Gaps = 34/258 (13%)

Query: 1   DHWEKEPLYI------SRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ--- 51
           +HW   PL +      +   +   ++L++   +D++L    +   +    +L +DG    
Sbjct: 37  EHWNTRPLLVRAADRAAEGGRASVHDLLSPADVDELLGPRAL---RTPFFSLVQDGTPLP 93

Query: 52  RQTMDVEGRALPQIVWD---------HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF 102
           R +      A  Q + D          +  G +I           LQ   + L    G  
Sbjct: 94  RSSYTRRAVAGNQQLADLPDTDRVAAAHAGGATIVLQALHRTWPALQTFCSQLAADLGHQ 153

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPN--FSQE 160
              N Y TPP AQGF PH+D  +  +LQ++G+K W ++ P +    LP  S P+     +
Sbjct: 154 CQVNVYVTPPGAQGFKPHHDTHDVVVLQVDGRKHWTIHPPAV---ELPLKSQPSTQLGPD 210

Query: 161 EIG--TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPK 218
            +G   P +   LEPGD LYLPRG++H A T T + S+H+T+ +   T W D+L  A+  
Sbjct: 211 PVGGRPPAIDTVLEPGDALYLPRGWLHSART-TEDRSIHLTVGLLATT-WADVLTDAV-- 266

Query: 219 ALQAAGATDLEFRRGLPI 236
              +AG  D+  RR LP+
Sbjct: 267 --ASAGVADVALRRALPL 282


>gi|397628413|gb|EJK68879.1| hypothetical protein THAOC_09906, partial [Thalassiosira oceanica]
          Length = 727

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 83/154 (53%), Gaps = 21/154 (13%)

Query: 111 PPTAQG--FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-----G 163
           PPTA+     P         ++LEG K+WKVY P    E LPR SS +++++EI      
Sbjct: 337 PPTARARRLGPLPPQPARVRVRLEGYKRWKVYAPFNKRETLPRESSRDYTEKEIEDAEET 396

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVT------------NEHSLHVTISVYQKTAWIDL 211
           +  +T  L PGD+LYLPRG++HQA TV             ++HSLH+T+S  Q   W D 
Sbjct: 397 SEAMTCVLGPGDILYLPRGWVHQAETVARPAYMPRLPGVKDDHSLHLTVSAMQNWCWADY 456

Query: 212 LEKAMPKALQAAGATD--LEFRRGLPIGYLRYAG 243
           LE  MP AL AA  +D     R GLP  +L Y G
Sbjct: 457 LEILMPGALDAAVGSDKTTALREGLPRNFLEYMG 490


>gi|324511549|gb|ADY44803.1| Lysine-specific demethylase NO66 [Ascaris suum]
          Length = 259

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 62/182 (34%), Positives = 101/182 (55%), Gaps = 13/182 (7%)

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
           TPI    +E GDLLY+PRG+IHQ  T  + HSLH+T+SV ++TA++DLLE+ +P  + + 
Sbjct: 5   TPIFDGWIEHGDLLYIPRGFIHQGYTEASIHSLHLTVSVCRRTAFVDLLERFLPLVMGSL 64

Query: 224 GATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLD------ 277
              +L  R  LPIGYL   G+    P+  Q +  A  + + ++L K +  +  D      
Sbjct: 65  AEKNLSMRSSLPIGYLDMGGVL---PLSYQLNNKA-SDKIFNVLDKQLAALKKDALDAYE 120

Query: 278 DGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANA 337
             +D M ++ +  ALPP+L+ EE Q S    G+      +     + T +T+++ +R +A
Sbjct: 121 SAIDMMAREYLKTALPPLLTGEERQLSAI-GGVGFDILCK--KQKKFTVNTQIKFIRRHA 177

Query: 338 VR 339
            R
Sbjct: 178 QR 179


>gi|443327728|ref|ZP_21056347.1| Cupin superfamily protein [Xenococcus sp. PCC 7305]
 gi|442792718|gb|ELS02186.1| Cupin superfamily protein [Xenococcus sp. PCC 7305]
          Length = 396

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 77/281 (27%), Positives = 133/281 (47%), Gaps = 28/281 (9%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           ++EKE  YISR  +NYYN ++ ++ ID   ++  ++      + +   G         R 
Sbjct: 21  YYEKEYFYISRHDENYYNGILNTQDIDLFFQNQSLQ---TSGLRVCNKGGEIPASQWRRG 77

Query: 62  LPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           L QI     ++    +G ++   +  + I  L      L+      + +N Y TPP AQG
Sbjct: 78  LSQIADNDKLFVLLNQGNTLIINSGNSSIIKLINYCHVLENELNFRLQSNIYITPPDAQG 137

Query: 117 FAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           FA HYDD + FI+Q  G K W++Y  P  +      ++     + E+G P     ++PGD
Sbjct: 138 FAYHYDDHDVFIMQTTGTKTWRLYHSPIEIPTKKQNHAKFMKGKYELGEPTFEQEIKPGD 197

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LLY+PRG +H A T T++ S+H+T+ ++    + +LL++ +  A       +L FR+ +P
Sbjct: 198 LLYIPRGLVHDAVT-TDQASVHITLGLHPSYRF-ELLQEIVNLA-----QDNLLFRKAIP 250

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDL 276
                        P   +  + A K   K L    ++ VD+
Sbjct: 251 ------------NPFTPEYQKKAFKAEFKQLCHDFIDNVDI 279


>gi|374610812|ref|ZP_09683602.1| transcription factor jumonji jmjC domain-containing protein
           [Mycobacterium tusciae JS617]
 gi|373550228|gb|EHP76876.1| transcription factor jumonji jmjC domain-containing protein
           [Mycobacterium tusciae JS617]
          Length = 405

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 131/290 (45%), Gaps = 40/290 (13%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDI---TLYEDG 50
           ++W ++PL +SR       +++L++   +D+++     R   I   K  D+   T Y   
Sbjct: 16  EYWGRQPL-LSRHTALPRDFSDLLSPTAVDELIAERGVRAPFIRMAKTGDVLARTCYTGP 74

Query: 51  QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
                ++  +     V   +  G +I          PL      L +  G  V AN Y T
Sbjct: 75  AGFGAEMPDQVDSTKVLSEFAAGATIVLQGLHRLWPPLIDFVRGLTDDLGHPVQANAYIT 134

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI------GT 164
           PP AQGF PHYD  + F+LQ  G+K+W V+ P     + P  S P     +         
Sbjct: 135 PPDAQGFEPHYDVHDVFVLQAAGEKRWTVHEPV---HHHPLDSQPWTEHRDAIAARISDA 191

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
           PI+   L+ GD LYLPRG++H A +  N  S+H+TI V   T ++D++     +A+    
Sbjct: 192 PIIDTVLQAGDALYLPRGWVHSARS-ENTTSIHLTIGVSALT-YLDVV-----RAVVDTL 244

Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV 274
           A D EFR+ LP+G            +D   DR  M      ++A+L + V
Sbjct: 245 AADDEFRKSLPMG------------IDTT-DRSEMASTTSKVMAQLADAV 281


>gi|258653233|ref|YP_003202389.1| cupin [Nakamurella multipartita DSM 44233]
 gi|258556458|gb|ACV79400.1| Cupin 4 family protein [Nakamurella multipartita DSM 44233]
          Length = 452

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 81/284 (28%), Positives = 123/284 (43%), Gaps = 34/284 (11%)

Query: 2   HWEKEPLYISRKQKNY-YNNLITSKKIDDI-----LRDNVIEFKKNIDI---TLYEDGQR 52
           +W + PL  +  + N  +++L ++  +D++     LR   +   KN  +     +  G  
Sbjct: 38  YWAQAPLLTTAAELNDDFSDLFSADSVDELVSERGLRTPFLRMAKNGSVLSSASFTRGGG 97

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
               +  +     V      G ++          PL +  + L    G  V  N Y TPP
Sbjct: 98  AGATITDQVADDKVLAQLAGGATLVLQALHRTWPPLVRFGSELAAELGHPVQINAYITPP 157

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS----SPNFSQEEIGTPILT 168
             QGFA HYD  + F+LQ+ G K W+++ P + D  LP  +               P + 
Sbjct: 158 QNQGFASHYDTHDVFVLQIAGTKHWRIHEPVLPDP-LPHQTWDGRRAQVQDRAAQAPAID 216

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL 228
             L PGD LYLPRGY+H A     E S+H+TI V+  T + DL  +     L AA   D 
Sbjct: 217 ALLRPGDALYLPRGYLHSA-VAQGELSIHLTIGVHPLTGY-DLARE-----LIAAAEDDP 269

Query: 229 EFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
           E RR LP+G            VD+  D  AM  +L+    +LV+
Sbjct: 270 ELRRSLPMG------------VDV-TDVDAMATHLRQAAQRLVD 300


>gi|325185748|emb|CCA20228.1| nucleolar protein putative [Albugo laibachii Nc14]
          Length = 287

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 72/136 (52%)

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           +  TLE GD+LY+PRG IHQA T     S HVTISVYQ+ +W + +E AMP+ +  A A 
Sbjct: 1   MEFTLEQGDVLYMPRGVIHQAHTDNKVFSTHVTISVYQRQSWSNFMELAMPRIIHRAFAN 60

Query: 227 DLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQ 286
           D+EFRRG+P  YL + G           D     + L   L++ +E  DL    D     
Sbjct: 61  DVEFRRGVPPQYLNFLGSQYSGSTQSVKDFCNTLKQLVSKLSENLEIQDLQHAADLASLD 120

Query: 287 LMHDALPPVLSPEELQ 302
            +   LPP  +PE  Q
Sbjct: 121 FITHRLPPPDTPESSQ 136


>gi|83717208|ref|YP_439902.1| hypothetical protein BTH_II1708 [Burkholderia thailandensis E264]
 gi|167616559|ref|ZP_02385190.1| hypothetical protein BthaB_09673 [Burkholderia thailandensis Bt4]
 gi|257143062|ref|ZP_05591324.1| hypothetical protein BthaA_28128 [Burkholderia thailandensis E264]
 gi|83651033|gb|ABC35097.1| unnamed protein product [Burkholderia thailandensis E264]
          Length = 397

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 72/244 (29%), Positives = 115/244 (47%), Gaps = 21/244 (8%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDIT--LYEDGQRQTMDVEG 59
           +W ++PL + R+  + Y  L  S++   +L       +    I   +        +  EG
Sbjct: 22  YWGRKPLIVRRQAPHLYACLPDSEEFAFLLHSLTDPERGWFSIVNGVARPPSDSLLTQEG 81

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF-------VGANTYQTPP 112
                 V+  YR+G S+     Q   +    L   ++             +GAN Y +PP
Sbjct: 82  LLNLSEVYAAYRDGNSLLMNQVQRRHRETAMLCRRIESALSAHGIALARHIGANGYLSPP 141

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
           ++QGF  HYD  +  ILQ+EG+K W++Y   +     P  +      EE G+P     L 
Sbjct: 142 SSQGFNIHYDPHDVLILQIEGRKHWRLYGRHVAWPTQPPATP--IPPEEAGSPRREFVLS 199

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           PG+L+Y+PRG +H A+T T+  SLH+T+S+   T W DLL +AM        + +  FRR
Sbjct: 200 PGELVYIPRGVLHDANT-TDSRSLHLTLSIETLT-WTDLLIEAM--------SDNPAFRR 249

Query: 233 GLPI 236
            LP+
Sbjct: 250 NLPV 253


>gi|148665809|gb|EDK98225.1| myc induced nuclear antigen, isoform CRA_a [Mus musculus]
          Length = 242

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 93/171 (54%), Gaps = 13/171 (7%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L +   +  + +  V  + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GRA   Q+  D  ++  +I+F  PQ Y   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTP 165
             PHYDD+E FILQLEG K W++Y P   +  EY         S++ IGTP
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTP 220


>gi|148665811|gb|EDK98227.1| myc induced nuclear antigen, isoform CRA_c [Mus musculus]
 gi|148665812|gb|EDK98228.1| myc induced nuclear antigen, isoform CRA_c [Mus musculus]
          Length = 243

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 93/171 (54%), Gaps = 13/171 (7%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L +   +  + +  V  + +++++     G+++ ++ 
Sbjct: 58  EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 116

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +GRA   Q+  D  ++  +I+F  PQ Y   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 117 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 176

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTP 165
             PHYDD+E FILQLEG K W++Y P   +  EY         S++ IGTP
Sbjct: 177 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTP 221


>gi|115375463|ref|ZP_01462723.1| chromosome 14 open reading frame 169, putative [Stigmatella
           aurantiaca DW4/3-1]
 gi|115367506|gb|EAU66481.1| chromosome 14 open reading frame 169, putative [Stigmatella
           aurantiaca DW4/3-1]
          Length = 355

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 121/246 (49%), Gaps = 29/246 (11%)

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           ++++  A   +V+++Y +G ++     +   +PL      L+      V    Y TPP  
Sbjct: 29  LNLDNTANINLVYENYLKGSTVILSGLEETWEPLVVFCRKLEGQLSHPVAVAVYLTPPNH 88

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            G  PH+D  E FILQ++G K WKVY     LPR+   Y P       ++E +   +L  
Sbjct: 89  HGVQPHFDTQENFILQVDGVKHWKVYGAGQELPRVEGSYTP------VARERLPELLLET 142

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L PGD+LY+PRG++H+A    +  SLH+T+ V+ +T W D LE     AL A    +  
Sbjct: 143 ELHPGDMLYVPRGFVHEAEA-RDSASLHITVDVHVRT-WRDFLE----DALAAMADRNPR 196

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
           FR+ LP       GL  G     Q     ++E  ++L+  +   V L D + +  ++L+ 
Sbjct: 197 FRKSLP------PGLLNGSHAKAQ-----LEEGFRELMEMVHREVRLSDALGKHAEKLI- 244

Query: 290 DALPPV 295
            A PP+
Sbjct: 245 VARPPL 250


>gi|375138113|ref|YP_004998762.1| hypothetical protein [Mycobacterium rhodesiae NBB3]
 gi|359818734|gb|AEV71547.1| hypothetical protein MycrhN_0917 [Mycobacterium rhodesiae NBB3]
          Length = 402

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 77/253 (30%), Positives = 121/253 (47%), Gaps = 27/253 (10%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDI---TLYEDG 50
           DHW ++PL +SR       +++L++   +D+++     R   I   K  D+   + Y   
Sbjct: 16  DHWGRKPL-LSRHTALPRDFSDLLSPAAVDELIAERGVRAPFIRMAKTGDVLARSCYTGP 74

Query: 51  QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
                ++  +     V   +  G +I          PL     +L +  G  V AN Y T
Sbjct: 75  AGFGAEMPDQVDSAKVLAEFATGATIVLQGLHRLWPPLIDFVRALTDDLGHPVQANAYIT 134

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT------ 164
           PP AQGF PHYD  + F+LQ+ G+K+W V+ P  V  + P  S P     +  T      
Sbjct: 135 PPAAQGFEPHYDVHDVFVLQVAGEKRWTVHEP--VHPH-PLDSQPWTDHRDAVTARSRDN 191

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
             +   L  GD LYLPRG++H A +  +  S+H+TI V   T ++D++     +A+    
Sbjct: 192 AAIDTVLNAGDALYLPRGWVHSARS-EDTTSIHLTIGVSALT-YLDVV-----RAVVDTL 244

Query: 225 ATDLEFRRGLPIG 237
           ATD EFR+ LP+G
Sbjct: 245 ATDDEFRKSLPMG 257


>gi|84497464|ref|ZP_00996286.1| hypothetical protein JNB_14758 [Janibacter sp. HTCC2649]
 gi|84382352|gb|EAP98234.1| hypothetical protein JNB_14758 [Janibacter sp. HTCC2649]
          Length = 396

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 137/299 (45%), Gaps = 31/299 (10%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL--RDNVIEFKKNI-DITLYEDGQRQTM 55
           D W + P +++R  +    + +L ++  +D+++  R     F +   D T   DGQ    
Sbjct: 9   DVWSRSP-HLARAGRLPRDFADLFSADAVDELVAHRGLRTPFARMARDGTTLPDGQFTLG 67

Query: 56  DVEGRALP-QIVWDH----YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
              G  +  Q+  D     +  G ++          P+Q   A L +  G  V  N Y T
Sbjct: 68  GGVGAGITDQLSEDRILREFAAGATLVLQALHRTWAPIQTFAADLSDDLGHPVQVNAYVT 127

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGT 164
           PP  QGFA HYD  + F+LQ+ G+K+W++  P +     P    P         Q     
Sbjct: 128 PPQNQGFADHYDVHDVFVLQIAGEKRWRIRPPVL---EAPLRDQPWEQHRAAVEQAATAE 184

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
           P+L  TL PGD LYLPRGY+H A+ +    S+H+T+ V+  T     L  A+   + A  
Sbjct: 185 PLLEETLRPGDCLYLPRGYLHSATALGGT-SIHLTMGVHTWTR--RHLVDALLAGIGARV 241

Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
           + D+E RR LP+     AG  +   + I  D     E ++ L+AK +  + + + +D +
Sbjct: 242 SDDVEARRSLPL----RAGTGQSSELAIDDD----AEIVRSLVAKALSDLSMGELLDTL 292


>gi|349604191|gb|AEP99810.1| hypothetical protein, partial [Equus caballus]
          Length = 229

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 81/146 (55%), Gaps = 2/146 (1%)

Query: 194 HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQ 253
           HSLH+T+S YQ+  W D LE  +P A+QAA   ++EFRRGLP  ++ Y G       D +
Sbjct: 4   HSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYMGAQHSDSKDPR 63

Query: 254 ADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMS 313
             R A  E ++ L+A+L  +  +D   D+  K  +HD+LPPVL+  E   SV+   +R  
Sbjct: 64  --RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALSVYGLPIRWE 121

Query: 314 QTGEVYNATQITKDTKVRLVRANAVR 339
               V    Q+T +T+V +++    R
Sbjct: 122 AGEPVNVGAQLTTETEVHMLQDGIAR 147


>gi|284991701|ref|YP_003410255.1| Cupin 4 family protein [Geodermatophilus obscurus DSM 43160]
 gi|284064946|gb|ADB75884.1| Cupin 4 family protein [Geodermatophilus obscurus DSM 43160]
          Length = 436

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 72/290 (24%), Positives = 130/290 (44%), Gaps = 34/290 (11%)

Query: 1   DHWEKEPLYI-SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQ 51
           ++W + PL   + +    + +L+    +D++L     R   +   K+   +D   +    
Sbjct: 31  EYWARRPLLTRAEETGGSFADLLDLAAVDELLSRRGLRTPFLRIAKDGAVVDPKRFTTSG 90

Query: 52  RQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
               +V  +     V   + +G ++          PL +    L    G     N Y TP
Sbjct: 91  GAGAEVADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIEFADQLAADLGHPTQVNAYVTP 150

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT------P 165
           P+++GF+PHYD  + F+LQ+ G+K+W+++ P + D   P  + P   +           P
Sbjct: 151 PSSRGFSPHYDVHDVFVLQVAGEKRWRIHEPVLTD---PLRTQPWNERGAAVAAAAEREP 207

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
           ++   L PGD LYLPRGY+H A T     S H+T+ ++  T W    E A+   ++    
Sbjct: 208 LIDAVLRPGDALYLPRGYLHSA-TALGAISAHLTVGIHSVTRWA-AAESAL-DLVRVLAT 264

Query: 226 TDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD 275
            D + RR LP+G            VD+ AD  A+ +++  ++  L  ++D
Sbjct: 265 EDPQLRRSLPLG------------VDL-ADPAAVADDVATVVTALKGWLD 301


>gi|383780578|ref|YP_005465144.1| hypothetical protein AMIS_54080 [Actinoplanes missouriensis 431]
 gi|381373810|dbj|BAL90628.1| hypothetical protein AMIS_54080 [Actinoplanes missouriensis 431]
          Length = 401

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/281 (28%), Positives = 121/281 (43%), Gaps = 29/281 (10%)

Query: 3   WEKEPLYISRKQ----KNYYNNLITSKKIDDIL-----RDNVIEFKKNIDI---TLYEDG 50
           W +EPL +SR       + + +L++   +D++L     R   +   ++ ++     +  G
Sbjct: 20  WGREPL-LSRAADLGGPDGFTDLLSPAAVDELLSRRGLRTPFLRVARDGEVLPPARFTGG 78

Query: 51  QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
                +V  + L       Y  G ++          PL      L       V  N Y T
Sbjct: 79  GGAGAEVTDQVLDDRAMHLYATGATLVLQGLHRLWPPLIAFAGELGAALNRPVQLNAYLT 138

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTPIL 167
           PP +QGFA HYD  + F+LQ +G K+W V+ P + D  E  P     +  +    G P L
Sbjct: 139 PPGSQGFATHYDTHDVFVLQADGAKRWCVHPPVLPDPLERQPWGGRADEVAATAAGEPAL 198

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD 227
            V LEPGD LYLPRG++H A ++    SLH+T  V   T +      A+ + L A  + D
Sbjct: 199 DVVLEPGDALYLPRGWLHSARSLGG-RSLHITAGVRGLTRY------ALVEELLALASAD 251

Query: 228 LEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLA 268
              R  LP       GL    P  I  +  A    L+D LA
Sbjct: 252 PALRATLPY------GLEMSDPDAIAPELTATVAALRDWLA 286


>gi|403727844|ref|ZP_10947824.1| hypothetical protein GORHZ_164_00490 [Gordonia rhizosphera NBRC
           16068]
 gi|403203776|dbj|GAB92155.1| hypothetical protein GORHZ_164_00490 [Gordonia rhizosphera NBRC
           16068]
          Length = 399

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 77/276 (27%), Positives = 126/276 (45%), Gaps = 42/276 (15%)

Query: 2   HWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ---RQTM- 55
           HW + P+ +SR  +    +++L++ + +D++L    +       I + ++G    R+   
Sbjct: 17  HWGRRPM-LSRSHELPRDFDDLLSGESVDELLTSRGVRIPF---IRMAKEGALVGRECFV 72

Query: 56  -------DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
                  +V  +    +V   + +G +I          P+      + +  G  V AN Y
Sbjct: 73  GPAGFGAEVADQVDSAMVLAQFADGATIVLQGLHRLWPPVIDFVRDMVDDLGHPVQANAY 132

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEI 162
            TPP+ +GF PHYD  + F+LQ+ G+K W+V+ P   D   P  S P         +   
Sbjct: 133 ITPPSNRGFDPHYDVHDVFVLQVTGRKHWRVHEPVYTD---PLPSQPWTDHRAAIVRRAD 189

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
             P++   LEPGD LYLPRG+IH A  +  E S+H+T+ V   TA  D++   + +    
Sbjct: 190 DAPVIDTILEPGDALYLPRGWIHSAQAL-GETSIHLTVGVEPLTAR-DVMAAVLDEL--- 244

Query: 223 AGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLA 258
             AT  EFR  LP+         R  P D  A  +A
Sbjct: 245 --ATVAEFRASLPM---------RADPADAIAGTIA 269


>gi|425735033|ref|ZP_18853349.1| cupin superfamily protein [Brevibacterium casei S18]
 gi|425480477|gb|EKU47643.1| cupin superfamily protein [Brevibacterium casei S18]
          Length = 402

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/254 (28%), Positives = 123/254 (48%), Gaps = 22/254 (8%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
           D+W +  L +SR   ++ ++L ++  +D++     LR   +   K+   TL         
Sbjct: 35  DYWGRRAL-LSRGVSDF-SDLFSADAVDELISARGLRSPFLRVAKD-GATLPTSSFTSPA 91

Query: 56  DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
            V      Q+    +W  + +G ++     Q   +P+   +++L E  G  V AN Y TP
Sbjct: 92  GVGATIGDQLDDTRLWRQFHDGATLVLQALQRTWEPIGAFSSALSEELGNPVQANAYITP 151

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG-----TPI 166
           P  QGF  HYD  + F+LQ++G K+W ++ P + D  L R       +E +G      P 
Sbjct: 152 PQNQGFDDHYDVHDVFVLQIQGTKRWVIHEP-VFDSPL-RDQPWTDRREAVGRAAATEPA 209

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           +   LEPGD LYLPRG++H A+T   E S+H+T+ ++  T +   + + + +   A    
Sbjct: 210 IDAVLEPGDALYLPRGWLH-AATAQGEVSIHLTLGIHNWTRFA--VAEQLAQVALAHLQD 266

Query: 227 DLEFRRGLPIGYLR 240
           D   R  LP+G  R
Sbjct: 267 DPAMRASLPLGESR 280


>gi|411005423|ref|ZP_11381752.1| transcription factor jumonji jmjC domain-containing protein
           [Streptomyces globisporus C-1027]
          Length = 409

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 92/186 (49%), Gaps = 7/186 (3%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         +P+  L + L    G  V AN Y TPP  +GF  HYD  +
Sbjct: 106 LWRAFADGATLVLQALHRTWEPVADLVSGLSTDLGHPVQANAYVTPPQNRGFDAHYDVHD 165

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPR----YSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
            F+LQ+EG K+W V+ P + D    +    + +    +   GTP L   L PGD+LYLPR
Sbjct: 166 VFVLQIEGAKRWVVHEPVLPDPLRDQPWTDHRAAVADRAAHGTPHLDTMLRPGDVLYLPR 225

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G++H A     E S+H+T+ V+  T +   L + + +A  A    D   RR LP+     
Sbjct: 226 GWLHSAQ-AQGEVSIHLTLGVHAWTRYA--LAEHLTRAALAVLGEDPAMRRSLPLAGPEG 282

Query: 242 AGLARG 247
            G A G
Sbjct: 283 PGGADG 288


>gi|302538916|ref|ZP_07291258.1| predicted protein [Streptomyces sp. C]
 gi|302447811|gb|EFL19627.1| predicted protein [Streptomyces sp. C]
          Length = 394

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 108/242 (44%), Gaps = 43/242 (17%)

Query: 72  EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
           EG ++ F   Q    PL +    L    G  V  N Y TP  +QGF  HYD  ++FI+Q+
Sbjct: 102 EGATLVFQGLQELTGPLAEFGRRLGHDLGRPVNVNAYVTPAGSQGFGDHYDTQDSFIVQI 161

Query: 132 EGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT 191
            G K+W +  P +          P    +  G    T+TLEPGD L+LPRG++H A + T
Sbjct: 162 HGSKRWTLKDPALAQPLSHETGRPLPEDDGSGR---TLTLEPGDCLWLPRGWVHSARS-T 217

Query: 192 NEHSLHVTISVYQKT---AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGK 248
           +  S+H+TIS+Y+ T   AW  +          AA A  L  R  L   + R        
Sbjct: 218 DTASVHLTISLYEWTGHWAWTRI----------AARAPGLPGRFPLSTDFFR-------- 259

Query: 249 PVDIQADRLAMKENLKDLLAKLVEYVDLDD-----------GVDEMGKQLMHDALPPVLS 297
                 DR A ++++  L A+L E++   D           G  E    + H A   VLS
Sbjct: 260 ------DRAAAEKDMAALRAELTEWLATADDSALVDLVRAAGAPEFPSPVRHPAR-EVLS 312

Query: 298 PE 299
           PE
Sbjct: 313 PE 314


>gi|239986030|ref|ZP_04706694.1| putative cupin superfamily protein [Streptomyces roseosporus NRRL
           11379]
 gi|291442971|ref|ZP_06582361.1| cupin 4 family protein [Streptomyces roseosporus NRRL 15998]
 gi|291345918|gb|EFE72822.1| cupin 4 family protein [Streptomyces roseosporus NRRL 15998]
          Length = 407

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 89/176 (50%), Gaps = 7/176 (3%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         +P+  L + L    G  V AN Y TPP  +GF  HYD  +
Sbjct: 104 LWRAFTDGATLVLQALHRTWEPVADLVSGLSTELGHPVQANAYVTPPQNRGFDAHYDVHD 163

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPR----YSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
            F+LQ+EG K+W ++ P + D    +    + +    +   GTP L   L PGD+LYLPR
Sbjct: 164 VFVLQIEGAKRWVIHEPVLPDPLRDQPWTDHRAAVADRAAHGTPHLDTMLRPGDVLYLPR 223

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           G++H A     E S+H+T+ V+  T +   L + + +A  A    D   RR LP+ 
Sbjct: 224 GWLHSAQ-AQGEVSIHLTLGVHAWTRYA--LAEQLTRAALATLGDDPSMRRSLPLA 276


>gi|319951046|ref|ZP_08024906.1| cupin 4 family protein [Dietzia cinnamea P4]
 gi|319435286|gb|EFV90546.1| cupin 4 family protein [Dietzia cinnamea P4]
          Length = 403

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/250 (26%), Positives = 109/250 (43%), Gaps = 25/250 (10%)

Query: 3   WEKEPLYISRKQ-KNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQRQ 53
           W + PL     Q    +++L++   +D+I+     R   +   +    +D   +      
Sbjct: 18  WGRRPLLTRSAQLPRDFSDLLSPAMVDEIIAERGVRTPFVRMAREGSLVDRACFTRAAGF 77

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
              +  +  P  +   +  G +I       +  P+      +    G  V  N Y TPP 
Sbjct: 78  GAQIADQIDPDGMLTQFAAGATIVLQGLHRFWPPIIDFVQGMNADVGHPVQTNAYITPPA 137

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPIL 167
            +GF PHYD  + F+LQ+ G K+W+V+ P     + P    P        S      P++
Sbjct: 138 NRGFDPHYDVHDVFVLQVSGAKRWRVHEPV---HHHPLPDQPWTDHREAISARAADEPVI 194

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD 227
              LEPGD LYLPRG+IH A  +  E S+H+T+ V   T  +D+      +A+    A+D
Sbjct: 195 DEVLEPGDCLYLPRGWIHSAEAL-GETSIHLTVGVAPFTG-VDIA-----RAVLDGLASD 247

Query: 228 LEFRRGLPIG 237
            E R  LP+G
Sbjct: 248 EELRASLPLG 257


>gi|149060277|gb|EDM10991.1| myc induced nuclear antigen, isoform CRA_b [Rattus norvegicus]
          Length = 242

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 85/146 (58%), Gaps = 5/146 (3%)

Query: 1   DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R   +   YY +L +   +  +    +  + +++++     G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPSLAAYYQSLFSLSDLRSLCSQGLY-YGRDVNVCRCIGGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +   L ++   L+  FG  VG+N Y TP  +QG
Sbjct: 116 DGKAQFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLP 142
             PHYDD+E FILQLEG+K+W++Y P
Sbjct: 176 LPPHYDDVEVFILQLEGRKRWRLYSP 201


>gi|315506227|ref|YP_004085114.1| transcription factor jumonji jmjc domain-containing protein
           [Micromonospora sp. L5]
 gi|315412846|gb|ADU10963.1| transcription factor jumonji jmjC domain-containing protein
           [Micromonospora sp. L5]
          Length = 425

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 113/252 (44%), Gaps = 23/252 (9%)

Query: 1   DHWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYE 48
           DHW + PL +SR  +    + + +L++    D++L     R   +   K+   +    + 
Sbjct: 39  DHWGRSPL-LSRAAELPNADGFTDLLSPADADELLSRRGLRTPFLRVAKDGQLVPAARFT 97

Query: 49  DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
            G     ++  + L + V   Y +G ++           L      L       +  N Y
Sbjct: 98  GGGGAGAEIGDQVLDEKVLALYADGATLVLQGLHRTWPALVDFARDLGTALAQPLQVNAY 157

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTP 165
            TP  +QGFA HYD  + F+LQ++G+K W+++ P + D  E  P     +  S    G P
Sbjct: 158 LTPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLADPLERQPWGGRADEVSATAEGRP 217

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
            L V LEPGD LYLPRG++H A       SLH+T+ +   T +      A+ + L A  A
Sbjct: 218 ELDVVLEPGDALYLPRGWLHSAQ-AQESSSLHLTVGIRALTRY------AVVEELLALAA 270

Query: 226 TDLEFRRGLPIG 237
            D   R  LP G
Sbjct: 271 EDARLRASLPFG 282


>gi|379735634|ref|YP_005329140.1| hypothetical protein BLASA_2203 [Blastococcus saxobsidens DD2]
 gi|378783441|emb|CCG03109.1| Conserved protein of unknown function; putative cupin and
           clavaminate synthase domains [Blastococcus saxobsidens
           DD2]
          Length = 439

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 110/248 (44%), Gaps = 15/248 (6%)

Query: 2   HWEKEPLYI-SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQR 52
           +W + PL   +    N + +L+    +D++L     R   +   K+   +D   +     
Sbjct: 24  YWARRPLLTRAEDTGNSFTDLLDLPAVDELLSRRGLRTPFLRIAKDGAVVDAKRFTTSGG 83

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
              ++  +     V   + +G ++          PL +    L    G     N Y TPP
Sbjct: 84  AGAEIADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIEFADQLAADLGHPTQVNAYITPP 143

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY---SSPNFSQEEIGTPILTV 169
           +++GF+PHYD  + F+LQ+ G+K W ++ P + D    +     S   +    G P++  
Sbjct: 144 SSRGFSPHYDVHDVFVLQVAGEKHWTIHEPVLPDPLRTQVWTDRSAEVAAAAEGEPVIDA 203

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L PGD LYLPRGY+H A  +    S H+T+ ++  T W     +++   ++   A D  
Sbjct: 204 VLRPGDALYLPRGYLHSAKAL-GAISAHLTVGIHSVTRWA--AAESVLDLVRTLAAEDRS 260

Query: 230 FRRGLPIG 237
            R  LP+G
Sbjct: 261 LRGSLPLG 268


>gi|167578420|ref|ZP_02371294.1| hypothetical protein BthaT_09788 [Burkholderia thailandensis TXDOH]
          Length = 397

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 73/244 (29%), Positives = 117/244 (47%), Gaps = 21/244 (8%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDIT--LYEDGQRQTMDVEG 59
           +W ++PL + R+  + Y  L  S++   +L       +    I   +        +  EG
Sbjct: 22  YWGRKPLIVRRQAPHLYACLPDSEEFAFLLHSLTDPERGWFSIVNGVARPPSDSLLTQEG 81

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF-------VGANTYQTPP 112
                 V+  YR+G S+     Q   +    L   ++             +GAN Y +PP
Sbjct: 82  LLNLSEVYAAYRDGNSLLMNQVQRRHRETAMLCRRIESALSAHGIALARHIGANGYLSPP 141

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
           ++QGF  HYD  +  ILQ+EG+K W++Y  R V  +  +  +     EE G+P     L 
Sbjct: 142 SSQGFNIHYDPHDVLILQIEGRKHWRLY-GRHV-AWPTQPPAAPIPPEEAGSPRREFVLS 199

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           PG+L+Y+PRG +H A+T T+  SLH+T+S+   T W DLL +AM        + +  FRR
Sbjct: 200 PGELVYIPRGVLHDANT-TDSRSLHLTLSIETLT-WTDLLIEAM--------SDNPAFRR 249

Query: 233 GLPI 236
            LP+
Sbjct: 250 NLPV 253


>gi|119484518|ref|ZP_01619135.1| hypothetical protein L8106_02332 [Lyngbya sp. PCC 8106]
 gi|119457992|gb|EAW39115.1| hypothetical protein L8106_02332 [Lyngbya sp. PCC 8106]
          Length = 386

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 78/284 (27%), Positives = 136/284 (47%), Gaps = 39/284 (13%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W K+ L IS    + +++L + +  + +L  + + +    +I L + GQ          
Sbjct: 20  NWLKQALIISGYSPHKFSHLFSWQDFNTLLNFHHLTYP---EIRLAKSGQ---------T 67

Query: 62  LPQIVWDHY----REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           LP+  +D+     ++G ++   + QT +  + +  A+L+   G     N Y + P +QGF
Sbjct: 68  LPENAYDNLIKSCQDGATVIIDSLQTRLPVIAEFTANLRNELGHRTQINAYCSFPGSQGF 127

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
           A HYD  E FILQ+ G K W+V+ P    E+       N        P +   L+PGDLL
Sbjct: 128 ACHYDSHEVFILQISGDKHWRVFSPTF--EFPLSKHRSNLLDPPTTDPYINQVLKPGDLL 185

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           Y+PRG+ H A  V ++ SLH+T+ V  +T  ID +E  + + LQ        +R+ LP+ 
Sbjct: 186 YIPRGHWHYAVAV-DQPSLHLTLGVDCQTG-IDFVE-WLTEELQENPL----WRQSLPLL 238

Query: 238 YLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
                             R A  ++L+DL+ K +  ++ D+ +D
Sbjct: 239 N--------------STHRQACSQHLRDLIQKWIGQLETDEFID 268


>gi|391340602|ref|XP_003744628.1| PREDICTED: MYC-induced nuclear antigen-like [Metaseiulus
           occidentalis]
          Length = 509

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 83/332 (25%), Positives = 141/332 (42%), Gaps = 46/332 (13%)

Query: 1   DHWEKEPLYISRKQKN----------YYNNLIT-SKKID-DILRDNVIEFKKNIDITLYE 48
           +HW  +PL+I   + +           +  +++ S+K D +++ D    F K +D     
Sbjct: 79  NHWGSKPLHIEAPEASEELSKIFSLEVFEEIVSRSEKGDLNLIADRDFTFSKIVD----- 133

Query: 49  DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
            G RQ +++E      +V +HY+E  +I+   PQ +   L      L+   G  V +  +
Sbjct: 134 -GDRQDLEIE-TLTKDLVKNHYKEKYAIQINQPQRFQDCLWSFIEKLECHLGSLVSSQVF 191

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPI 166
             P   +     YDD E F+LQL G   WK++  +  MV E+          +E +G  I
Sbjct: 192 LVPGDTKNQQRRYDDTEKFVLQLSGSSSWKLHELKGEMVCEF-----DDALKEENVGDQI 246

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           +   L+ GD+LY P+  I ++  VT+    H+  S +Q     +LL + +   ++     
Sbjct: 247 MAANLKAGDVLYFPKTTIFESEGVTD--GAHIVFSTFQGMHRANLLLQTIIATIEEQVPV 304

Query: 227 DLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDG----VDE 282
             E R+GLP  YL  A                   +  D L KL E V          D 
Sbjct: 305 CEELRKGLPFHYLPRANSISNA-------------DYADTLVKLAESVRSRSKPLPPADG 351

Query: 283 MGKQLMHDALPPVLSPE-ELQCSVFENGLRMS 313
           M ++ M + LPP   P+ EL+     +  ++S
Sbjct: 352 MIQEFMTNRLPPYGQPKFELESGSITSSSKIS 383


>gi|384245294|gb|EIE18789.1| hypothetical protein COCSUDRAFT_20456, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 183

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 66/103 (64%), Gaps = 5/103 (4%)

Query: 38  FKKNIDITL--YEDGQRQTMDVEGRALPQIVWDH-YREGCSIRFLNPQTYIKPLQQLNAS 94
            K+ +DI    ++DGQRQT+  E  ++ +  W   Y+EG S++   PQ +   L +L +S
Sbjct: 2   LKQGLDINFVRFKDGQRQTLQNEEGSIER--WQQLYKEGWSLQVHQPQRFCDALTRLCSS 59

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKW 137
           L+   GC VG+N Y TP  AQG APH+DD+E F++Q EG+K+W
Sbjct: 60  LENQLGCLVGSNAYLTPAGAQGLAPHHDDVEIFVVQTEGRKRW 102


>gi|392413955|ref|YP_006450560.1| hypothetical protein Mycch_0053 [Mycobacterium chubuense NBB4]
 gi|390613731|gb|AFM14881.1| hypothetical protein Mycch_0053 [Mycobacterium chubuense NBB4]
          Length = 404

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 122/255 (47%), Gaps = 33/255 (12%)

Query: 2   HWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ---RQT-- 54
           HW ++PL +SR       +++L++   +D++L +  +   +   I + +DGQ   R+   
Sbjct: 17  HWGRQPL-LSRSDALPRDFSDLLSPDTVDELLAERGV---RAPFIRVAKDGQLVGRECYL 72

Query: 55  ------MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
                  ++  +     V + +  G SI          PL     ++ +  G  V AN Y
Sbjct: 73  GPAGFGAEIGDQVDSAKVLEQFAAGASIALQGLHRLWPPLIDFVRAMVDDLGHPVQANAY 132

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEI 162
            TPPT++GF  HYD  + F+LQ  G K W V+ P   D   P  S P        ++   
Sbjct: 133 ITPPTSRGFDAHYDVHDVFVLQAAGHKHWTVHEPVHRD---PLASQPWTEHRAAIARRVR 189

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
            +P++   L PGD LYLPRG++H A  + +  S+H+T+ V   T  +D++ +A+   L  
Sbjct: 190 DSPVIDTVLGPGDALYLPRGWVHSAQAL-DTLSIHLTVGVSPLTG-MDVV-RAVADHLAT 246

Query: 223 AGATDLEFRRGLPIG 237
            G    +FR  LP+G
Sbjct: 247 VG----DFRASLPMG 257


>gi|145594683|ref|YP_001158980.1| cupin 4 family protein [Salinispora tropica CNB-440]
 gi|145304020|gb|ABP54602.1| Cupin 4 family protein [Salinispora tropica CNB-440]
          Length = 496

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 114/252 (45%), Gaps = 25/252 (9%)

Query: 2   HWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYED 49
           HW + PL +SR  +    + + +L++    DD+L     R   +   ++   +    Y  
Sbjct: 16  HWGQTPL-LSRAHELPNPSGFRDLLSPADADDLLSRRGLRTPFLRVAQDGVLVPAARYTG 74

Query: 50  GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
           G     ++  + L + + D Y  G ++           L      L    G  +  N Y 
Sbjct: 75  GGGAGAEITDQVLDEKILDLYAGGATLVLQGLHRTWPALIDFARDLGLAVGQPLQVNAYL 134

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI----GTP 165
           TP  +QGFA HYD  + F+LQ++G K W+++ P + D  L R      + E +    G P
Sbjct: 135 TPAGSQGFATHYDTHDVFVLQVDGGKHWRIHPPVLPDP-LERQPWGGRADEVVATATGAP 193

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
            L V L PGD LYLPRG++H A+      SLH+T+ V   T +  L+E+     L A  A
Sbjct: 194 ALDVLLAPGDALYLPRGWLHSAA-AQERSSLHLTVGVRALTRYT-LVEE-----LLALAA 246

Query: 226 TDLEFRRGLPIG 237
            D   R  LP G
Sbjct: 247 EDQRLRATLPFG 258


>gi|441510904|ref|ZP_20992803.1| hypothetical protein GOACH_30_00100 [Gordonia aichiensis NBRC
           108223]
 gi|441444967|dbj|GAC50764.1| hypothetical protein GOACH_30_00100 [Gordonia aichiensis NBRC
           108223]
          Length = 416

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/254 (27%), Positives = 116/254 (45%), Gaps = 29/254 (11%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQ 53
           +HW ++PL +SR       +++L++   +D+++     R   I   K   + + +D    
Sbjct: 19  EHWGRQPL-LSRSAALPRDFDDLLSPHSVDELIAERGVRAPFIRMAKEGGL-IGKDSYLG 76

Query: 54  TMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
           +         Q+    V   +  G +I          P+      + +  G  V AN Y 
Sbjct: 77  SAGFGAEMPDQVDSAKVLTQFATGATIVLQGLHRLWPPVIDFVRGMVDDLGHPVQANAYI 136

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIG 163
           TPP  +GF PHYD  + F+LQ+ G+K W+V+ P   +   P  + P        +     
Sbjct: 137 TPPANRGFDPHYDVHDVFVLQISGRKHWRVHAPVHDN---PLSTQPWTDHREAIAARSAD 193

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
           +P++   LEPGD LYLPRG+IH A  +  + S+H+T+ V   TA+ DL+       L   
Sbjct: 194 SPVIDTLLEPGDALYLPRGWIHSAEAL-GDTSIHLTVGVSAVTAY-DLVR----AVLDQL 247

Query: 224 GATDLEFRRGLPIG 237
           G  D   R  LP+G
Sbjct: 248 GTVD-ALRASLPMG 260


>gi|302867368|ref|YP_003836005.1| transcription factor jumonji jmjC domain-containing protein
           [Micromonospora aurantiaca ATCC 27029]
 gi|302570227|gb|ADL46429.1| transcription factor jumonji jmjC domain-containing protein
           [Micromonospora aurantiaca ATCC 27029]
          Length = 425

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 112/252 (44%), Gaps = 23/252 (9%)

Query: 1   DHWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYE 48
           DHW + PL +SR  +    + + +L +    D++L     R   +   K+   +    + 
Sbjct: 39  DHWGRSPL-LSRAAELPNPDGFTDLFSPADADELLSRRGLRTPFLRVAKDGQLVPAARFT 97

Query: 49  DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
            G     ++  + L + V   Y +G ++           +      L       +  N Y
Sbjct: 98  GGGGAGAEIGDQVLDEKVLALYADGATLVLQGLHRTWPAIVDFARDLGTALAQPLQVNAY 157

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTP 165
            TP  +QGFA HYD  + F+LQ++G+K W+++ P + D  E  P     +  S    G P
Sbjct: 158 LTPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLADPLERQPWGGRADEVSATAEGRP 217

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
            L V LEPGD LYLPRG++H A       SLH+T+ +   T +      A+ + L A  A
Sbjct: 218 ELDVVLEPGDALYLPRGWLHSAQ-AQESSSLHLTVGIRALTRY------AVVEELLALAA 270

Query: 226 TDLEFRRGLPIG 237
            D   R  LP G
Sbjct: 271 EDARLRASLPFG 282


>gi|126438193|ref|YP_001073884.1| cupin 4 family protein [Mycobacterium sp. JLS]
 gi|126237993|gb|ABO01394.1| Cupin 4 family protein [Mycobacterium sp. JLS]
          Length = 404

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 133/289 (46%), Gaps = 42/289 (14%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQ 53
           ++W + PL +SR       + +L++   +D+++     R   I   K  D+ L +D    
Sbjct: 16  EYWGRRPL-LSRSGALPRDFADLLSPGMVDELIAERGVRAPFIRLAKEGDV-LAKDCYLG 73

Query: 54  TMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
                     Q+    V   +  G +I          P+  L   L +  G  V AN Y 
Sbjct: 74  PAGFGAEMPDQVDSAKVLTQFSAGATIVMQGLHRLWPPVIDLVRHLVDDLGHPVQANAYI 133

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIG 163
           TPP+ +GF PHYD  + F+LQ  G+K+W V+ P  V  + P  S P        ++   G
Sbjct: 134 TPPSNRGFDPHYDVHDVFVLQTAGEKRWVVHEP--VHPH-PLPSQPWTQHRDAIAERAAG 190

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
            P++   L PGD LYLPRG++H A  + +  S+H+TI V   T  +D + +A+  AL  +
Sbjct: 191 EPVIDTVLAPGDALYLPRGWVHSAHAL-DTTSIHLTIGVSAVTG-VD-VARAVVDALADS 247

Query: 224 GATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
            A    FR  LP+G         G P    ADR  +   +  ++A++VE
Sbjct: 248 AA----FRAPLPMG---------GDP----ADRDEIIAAVTKVMAQMVE 279


>gi|409356670|ref|ZP_11235057.1| cupin 4 family protein [Dietzia alimentaria 72]
          Length = 401

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 68/252 (26%), Positives = 110/252 (43%), Gaps = 25/252 (9%)

Query: 1   DHWEKEPLYI-SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQ 51
           D+W + PL   S +    +++L++S+ +D+I+     R   +   +    +D   +    
Sbjct: 16  DYWGRRPLLTRSEELPRDFSDLLSSEMVDEIIAERGVRTPFVRMAREGTLVDRACFTRAG 75

Query: 52  RQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
                +  +  P  V   +  G +I          P+      +    G  V  N Y TP
Sbjct: 76  GFGAQMPDQIDPDGVLTQFAAGATIVLQGLHRLWPPIINFVRGMTSDVGHPVQTNAYITP 135

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE------IGTP 165
           P  +GF PHYD  + F+LQ+ G K+W+V+ P  V  + P    P     E         P
Sbjct: 136 PANRGFDPHYDAHDVFVLQVAGTKRWRVHEP--VHRH-PLSDQPWTDHREAIAARAASEP 192

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
           ++   LEPGD LYLPRG++H A     + S+H+T+ V   T         + +A   A A
Sbjct: 193 VIDAVLEPGDCLYLPRGWVHSAE-AQGKTSIHLTVGVAPFTG------HDVARAALEALA 245

Query: 226 TDLEFRRGLPIG 237
           +  E R  LP+G
Sbjct: 246 SQEELRASLPLG 257


>gi|363421145|ref|ZP_09309234.1| cupin superfamily protein [Rhodococcus pyridinivorans AK37]
 gi|359734880|gb|EHK83848.1| cupin superfamily protein [Rhodococcus pyridinivorans AK37]
          Length = 410

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 112/251 (44%), Gaps = 26/251 (10%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
           W + PL   R     + +L + + +D++     LR   +   K  D T   D    +   
Sbjct: 46  WGRAPLLTRRA--GTFADLFSVEAVDELISRRGLRTPFLRVAK--DGTTLPDSSFTSPGG 101

Query: 58  EGRALPQ-----IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
            G  +       ++W +  +G ++         +P+ Q   +L +  G  V  N Y TPP
Sbjct: 102 VGATISDQLDDTMLWRNLADGATLVLQALHRTWEPISQFGTALSDELGHPVQVNAYITPP 161

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPI 166
             QGF+ HYD  + F++Q+EG K+W ++ P       P  + P        ++       
Sbjct: 162 RNQGFSHHYDVHDVFVVQIEGTKRWVIHEPV---HPAPLRNQPWTDHRAAVARAATEPAC 218

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           +   LEPGD LYLPRG+IH A+    E S+H+T+ ++  T    L E     AL A G  
Sbjct: 219 IDTVLEPGDCLYLPRGWIH-AAEARGEISIHLTVGIHTWTRHA-LAEHLTRAALTALG-D 275

Query: 227 DLEFRRGLPIG 237
           D E R  LP+G
Sbjct: 276 DPEMRGALPMG 286


>gi|444509721|gb|ELV09407.1| MYC-induced nuclear antigen [Tupaia chinensis]
          Length = 398

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 132/304 (43%), Gaps = 70/304 (23%)

Query: 1   DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           + WE++PL I R       YY +L     +  +    +  + +++++    +G+++ ++ 
Sbjct: 57  EFWEQKPLLIQRDDPALAAYYQSLFRLTDLKSLCSRGMY-YGRDVNVCRCINGKKKVLNK 115

Query: 58  EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           +G+A   Q+  D  ++  +I+F  PQ +                                
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRF-------------------------------- 143

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
                   + FILQLEG+K W++Y P +    L R  S   +++ IG P    TL+PGDL
Sbjct: 144 --------KVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPAHEFTLKPGDL 191

Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LY PRG IHQA T     HS HVTIS YQ ++W D+L   +   +      D+  R G+P
Sbjct: 192 LYFPRGTIHQADTPPGLAHSTHVTISTYQNSSWGDILLDTISGLVFDTAKEDVALRAGIP 251

Query: 236 -----IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
                +  + +A       +   ADRL   E  K+LL+             +M K  +  
Sbjct: 252 RQLLLVETIPFATRKLSGFLRTLADRL---EGTKELLS------------SDMKKDFIMH 296

Query: 291 ALPP 294
            LPP
Sbjct: 297 RLPP 300


>gi|332709429|ref|ZP_08429390.1| hypothetical protein LYNGBM3L_40470 [Moorea producens 3L]
 gi|332351688|gb|EGJ31267.1| hypothetical protein LYNGBM3L_40470 [Moorea producens 3L]
          Length = 385

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/325 (25%), Positives = 153/325 (47%), Gaps = 31/325 (9%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW  + L+I  + +  + +L + +K+ D+L  N  +FK   ++ L  DG+   +  E   
Sbjct: 20  HWTSKALFIPSEDERKFADLFSWQKLTDLL--NFHQFKYP-ELRLALDGK---VLEEKEN 73

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
              I W   + G ++        +  +    + +++  G     N+Y + P+ QGF+ HY
Sbjct: 74  ANLIKW--CQSGATLIIDKVHKLVPEITNFTSEVRQDLGYSAQVNSYCSWPSKQGFSCHY 131

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           D  E FILQ+EG K+W V+   +      + S+     E  G P L  TL PGD+LY+PR
Sbjct: 132 DTHEVFILQIEGNKEWYVFSDTLKYPLTEQKSASETPPE--GDPYLKCTLTPGDVLYIPR 189

Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           G+ H A  + ++ SLH+T+ ++ +T  +D L+  + +  Q        +R+ LP+ +   
Sbjct: 190 GHWHYAVAL-DQPSLHLTLGIHCQTG-VDFLDWLVGELRQNEA-----WRKSLPLRF--- 239

Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
                 +P+D   D L   +N K+ L         +  +D + K L   +L     P + 
Sbjct: 240 ----DAEPLDQSIDNLI--QNFKEYLTNSDIQERYNRYLDGLVKPLARYSL-----PAQA 288

Query: 302 QCSVFENGLRMSQTGEVYNATQITK 326
             ++F NG++   +   +   +I++
Sbjct: 289 GFNIFPNGVKTRFSNPKFKRLKISQ 313


>gi|403525442|ref|YP_006660329.1| cupin superfamily protein [Arthrobacter sp. Rue61a]
 gi|403227869|gb|AFR27291.1| putative cupin superfamily protein [Arthrobacter sp. Rue61a]
          Length = 388

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 12/178 (6%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         KP+   +  L    G  V AN Y TPP  +GF  HYD  +
Sbjct: 93  LWRKFADGATLVLQALHRTWKPVSSFSTQLSTELGHPVQANAYITPPQNRGFDDHYDVHD 152

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYL 179
            F+LQ+EG K+W ++ P  VD   P  S P        ++   G   +   LEPGD+LYL
Sbjct: 153 VFVLQIEGTKRWIIHEPVHVD---PLRSQPWTDRRSAVAEAAQGKAYIDTVLEPGDVLYL 209

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           PRG++H A+    + S+H+T+ V+  T     L + + +A  AA   D E RR LP+G
Sbjct: 210 PRGWLH-AAEAQGKVSIHLTLGVHSWTR--HALAEHLAQAALAALCDDPEVRRSLPLG 264


>gi|330468574|ref|YP_004406317.1| transcription factor jumonji jmjC domain-containing protein
           [Verrucosispora maris AB-18-032]
 gi|328811545|gb|AEB45717.1| transcription factor jumonji jmjC domain-containing protein
           [Verrucosispora maris AB-18-032]
          Length = 420

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 112/251 (44%), Gaps = 23/251 (9%)

Query: 2   HWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYED 49
           HW + PL +SR  +      + +L++    D++L     R   +   K+   +    Y  
Sbjct: 36  HWGQTPL-LSRAAELPNPAGFTDLLSPDDADELLSRRGLRTPFLRVAKDGQVLPAARYTG 94

Query: 50  GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
           G     ++  + L + +   Y  G ++          PL      L    G  +  N Y 
Sbjct: 95  GGGAGAEIGDQVLDEKILQLYAGGATLVLQGLHRNWPPLVDFTRELSLAVGQPLQVNAYL 154

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTPI 166
           TPP +QGFA HYD  + F+LQ++G+K W+++ P + D  E  P     +  +   +G   
Sbjct: 155 TPPGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLPDPLERQPWGGRADEVTATALGPAA 214

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           L V LEPGD LYLPRG++H A       SLH+T+ +   T +      A+ + L A    
Sbjct: 215 LDVVLEPGDALYLPRGWLHSAQ-AQESSSLHLTVGIRALTRY------ALVEELLALATE 267

Query: 227 DLEFRRGLPIG 237
           D   R  LP G
Sbjct: 268 DRRLRATLPFG 278


>gi|108802210|ref|YP_642407.1| cupin 4 [Mycobacterium sp. MCS]
 gi|119871363|ref|YP_941315.1| cupin 4 family protein [Mycobacterium sp. KMS]
 gi|108772629|gb|ABG11351.1| Cupin 4 [Mycobacterium sp. MCS]
 gi|119697452|gb|ABL94525.1| Cupin 4 family protein [Mycobacterium sp. KMS]
          Length = 404

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 133/289 (46%), Gaps = 42/289 (14%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQ 53
           ++W + PL +SR       + +L++   +D+++     R   I   K  D+ L +D    
Sbjct: 16  EYWGRRPL-LSRSGALPRDFADLLSPGMVDELIAERGVRAPFIRLAKEGDV-LAKDCYLG 73

Query: 54  TMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
                     Q+    V   +  G +I          P+  L   L +  G  V AN Y 
Sbjct: 74  PAGFGAEMPDQVDSAKVLTQFSAGATIVLQGLHRLWPPVIDLVRHLVDDLGHPVQANAYI 133

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIG 163
           TPP+ +GF PHYD  + F+LQ  G+K+W V+ P  V  + P  S P        ++   G
Sbjct: 134 TPPSNRGFDPHYDVHDVFLLQTAGEKRWVVHEP--VHPH-PLPSQPWTQHRDAIAERAAG 190

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
            P++   L PGD LYLPRG++H A  + +  S+H+TI V   T  +D + +A+  AL  +
Sbjct: 191 EPVIDTVLAPGDALYLPRGWVHSAHAL-DTTSIHLTIGVSAVTG-VD-VARAVVDALADS 247

Query: 224 GATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
            A    FR  LP+G         G P    ADR  +   +  ++A++VE
Sbjct: 248 AA----FRAPLPMG---------GDP----ADRDEIIAAVTKVMAQMVE 279


>gi|377566996|ref|ZP_09796242.1| hypothetical protein GOSPT_129_00550 [Gordonia sputi NBRC 100414]
 gi|377525844|dbj|GAB41407.1| hypothetical protein GOSPT_129_00550 [Gordonia sputi NBRC 100414]
          Length = 411

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 35/257 (13%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYED---- 49
           DHW + PL +SR       +++L++ + +D+++     R   I   K   +   E+    
Sbjct: 19  DHWGRRPL-LSRSDALPRDFDDLLSPRSVDELIAERGVRAPFIRMAKEGGLVGKENYLGP 77

Query: 50  ---GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
              G      V+       V   +  G +I          P+      + +  G  V AN
Sbjct: 78  AGFGAEMPDQVDSAK----VLAQFASGATIVLQGLHRLWPPIIDFVRGMVDDLGHPVQAN 133

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP------RYSSPNFSQE 160
            Y TPP  +GF PHYD  + F+LQ+ G K W+V+ P   D          R +    S E
Sbjct: 134 AYITPPANRGFDPHYDVHDVFVLQISGSKHWRVHSPVHTDPLADQPWTDHRDAIEARSHE 193

Query: 161 EIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
           E   P++   LEPGD LYLPRG++H A  + +  S+H+T+ V   TA+ DL+   + +  
Sbjct: 194 E---PVIDTVLEPGDALYLPRGWLHSAEALGHT-SIHLTVGVSAVTAF-DLVRTVVDQL- 247

Query: 221 QAAGATDLEFRRGLPIG 237
                T    R  LP+G
Sbjct: 248 ----GTVESLRTALPMG 260


>gi|374372801|ref|ZP_09630462.1| Cupin, JmjC-type [Niabella soli DSM 19437]
 gi|373234877|gb|EHP54669.1| Cupin, JmjC-type [Niabella soli DSM 19437]
          Length = 396

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 85/340 (25%), Positives = 145/340 (42%), Gaps = 50/340 (14%)

Query: 4   EKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-QTMDVEGRAL 62
           EK+ L+I R   +YY +++T+ ++   L    I +     + +  DG+   + +   + +
Sbjct: 23  EKDLLHIQRNNTDYYKDVLTASELSAFLDRQDIFYPS---LRIVRDGKEIPSGEYTLKGV 79

Query: 63  P------------QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
           P            +  +  + +G ++     Q Y   L      L   F   V AN Y T
Sbjct: 80  PIGHHKKDGIVNTEKAFALFNDGATLVVQAGQRYFDHLSACCMELSRRFNAPVQANLYLT 139

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P  +QGF PH+D  + F+LQL G K W +Y     ++ LP  S    S +    P+ T+ 
Sbjct: 140 PNKSQGFNPHWDTHDVFVLQLSGTKTWHLY---GFEKELPTKSQGFVSNDYSKQPLQTIQ 196

Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
           L PGD LY+PRGY+H A    +  S H+TI +   T W+    +     L      +  F
Sbjct: 197 LAPGDFLYVPRGYVHDA-VADDGISAHITIGILSFT-WVRYFAE-----LFGQLENEKTF 249

Query: 231 RRGLPIGYLRYAGLARGKPVDIQ--ADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLM 288
           R  +P        + + K   ++   D+L  ++ LK L A+           +++  Q +
Sbjct: 250 REAIPFWRSDIESIVKEKTALLKNTIDQLDAEDVLKKLTAQY----------EKIQPQAL 299

Query: 289 HDALPPVLSPEELQCS---------VFENGLRMSQTGEVY 319
           H+    +L  ++L             FE G+   Q G+VY
Sbjct: 300 HNYFGSLLQLDKLHEDSVIRLNREVFFERGM---QNGQVY 336


>gi|357415101|ref|YP_004926837.1| transcription factor jumonji jmjC domain-containing protein
           [Streptomyces flavogriseus ATCC 33331]
 gi|320012470|gb|ADW07320.1| transcription factor jumonji jmjC domain-containing protein
           [Streptomyces flavogriseus ATCC 33331]
          Length = 402

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 88/179 (49%), Gaps = 13/179 (7%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         +P+  L + L    G  V AN Y TPP  +GF  HYD  +
Sbjct: 106 LWRAFADGATLVLQALHRTWEPVAGLVSELSTELGHPVQANAYVTPPQNRGFDAHYDVHD 165

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-------GTPILTVTLEPGDLLY 178
            F+LQ+EG K+W V+ P + D   P    P     +         T  L   L PGD+LY
Sbjct: 166 VFVLQIEGTKRWIVHEPVLPD---PLRDQPWTDHRQAVADAAARSTAHLDTVLGPGDVLY 222

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           LPRG++H A     E S+H+T+ V+  T +   L + + +A  AA   D   RR LP+G
Sbjct: 223 LPRGWLHSAR-AQGEVSIHLTLGVHTWTRYA--LAEQLTRAALAALRDDPPMRRSLPLG 278


>gi|291449903|ref|ZP_06589293.1| cupin 4 family protein [Streptomyces albus J1074]
 gi|291352852|gb|EFE79754.1| cupin 4 family protein [Streptomyces albus J1074]
          Length = 406

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 16/218 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
           D W +  L ++R+  ++ ++L ++  +D++     LR   +   +N   TL +       
Sbjct: 34  DFWARTAL-LNRRAGDF-SDLFSAAAVDELIARRGLRTPFLRVARN-GATLPDTAYTSPA 90

Query: 56  DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
            V      Q+    +W  + +G ++         +P+ +  A L    G  V AN Y TP
Sbjct: 91  GVGATIADQLDDTALWREFADGATLVLQALHRTWQPVAEFCAGLGTELGHPVQANAYVTP 150

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS---PNFSQEEIGTPILT 168
           P  +GF  HYD  + F+LQ+EG K+W V+ P   D    +  +   P  ++     P L 
Sbjct: 151 PQNRGFDDHYDVHDVFVLQIEGTKRWLVHRPVHPDPLRDQPWTDRRPAVAEAAASEPYLD 210

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           V L PGD+LYLPRG++H A+      S+H+T+ ++  T
Sbjct: 211 VVLAPGDVLYLPRGWLH-AAEAQGAVSVHLTLGIHTWT 247


>gi|421741680|ref|ZP_16179863.1| hypothetical protein SM8_03506 [Streptomyces sp. SM8]
 gi|406689902|gb|EKC93740.1| hypothetical protein SM8_03506 [Streptomyces sp. SM8]
          Length = 406

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 16/218 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
           D W +  L ++R+  ++ ++L ++  +D++     LR   +   +N   TL +       
Sbjct: 34  DFWARTAL-LNRRAGDF-SDLFSAAAVDELIARRGLRTPFLRVARN-GATLPDTAYTSPA 90

Query: 56  DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
            V      Q+    +W  + +G ++         +P+ +  A L    G  V AN Y TP
Sbjct: 91  GVGATIADQLDDTALWREFADGATLVLQALHRTWQPVAEFCAGLGTELGHPVQANAYVTP 150

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS---PNFSQEEIGTPILT 168
           P  +GF  HYD  + F+LQ+EG K+W V+ P   D    +  +   P  ++     P L 
Sbjct: 151 PQNRGFDDHYDVHDVFVLQIEGTKRWLVHRPVHPDPLRDQPWTDRRPAVAEAAASEPYLD 210

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           V L PGD+LYLPRG++H A+      S+H+T+ ++  T
Sbjct: 211 VVLAPGDVLYLPRGWLH-AAEAQGAVSVHLTLGIHTWT 247


>gi|359146656|ref|ZP_09180124.1| cupin superfamily protein [Streptomyces sp. S4]
          Length = 406

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 16/218 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
           D W +  L ++R+  ++ ++L ++  +D++     LR   +   +N   TL +       
Sbjct: 34  DFWARTAL-LNRRAGDF-SDLFSAAAVDELIARRGLRTPFLRVARN-GATLPDTAFTSPA 90

Query: 56  DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
            V      Q+    +W  + +G ++         +P+ +  A L    G  V AN Y TP
Sbjct: 91  GVGATIADQLDDTALWREFADGATLVLQALHRTWQPVAEFCAGLGTELGHPVQANAYVTP 150

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS---PNFSQEEIGTPILT 168
           P  +GF  HYD  + F+LQ+EG K+W V+ P   D    +  +   P  ++     P L 
Sbjct: 151 PQNRGFDDHYDVHDVFVLQIEGTKRWLVHRPVHPDPLRDQPWTDRRPAVAEAAASEPYLD 210

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           V L PGD+LYLPRG++H A+      S+H+T+ ++  T
Sbjct: 211 VVLAPGDVLYLPRGWLH-AAEAQGAVSVHLTLGIHTWT 247


>gi|377561186|ref|ZP_09790649.1| hypothetical protein GOOTI_184_00050 [Gordonia otitidis NBRC
           100426]
 gi|377521621|dbj|GAB35814.1| hypothetical protein GOOTI_184_00050 [Gordonia otitidis NBRC
           100426]
          Length = 411

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 111/235 (47%), Gaps = 28/235 (11%)

Query: 1   DHWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           +HW ++PL +SR       +++L++   +D+++ +  +       I + ++G     D  
Sbjct: 19  EHWGRQPL-LSRSDTLPRDFDDLLSPHSVDELISERGVRAPF---IRMAKEGGLVGKDCY 74

Query: 59  ------GRALP-QI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
                 G  +P Q+    V   +  G +I          P+ +    + +  G  V AN 
Sbjct: 75  LGPAGFGAEMPDQVDSAKVLAQFAAGATIVLQGLHRLWPPMIEFVRGMVDDLGHPVQANA 134

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT--- 164
           Y TPP  +GF PHYD  + F+LQ  G+K W+V+ P   D   P  + P     +  T   
Sbjct: 135 YITPPANRGFDPHYDVHDVFVLQTSGRKHWRVHEPVHRD---PLPTQPWTDHRDAITARG 191

Query: 165 ---PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
              P++   LEPGD LYLPRG++H A  +  + S+H+TI V   T + DL+   M
Sbjct: 192 DDRPVIDTVLEPGDALYLPRGWVHSAEAL-GDTSIHLTIGVSAVTTY-DLVRAVM 244


>gi|428181856|gb|EKX50718.1| hypothetical protein GUITHDRAFT_161698 [Guillardia theta CCMP2712]
          Length = 470

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 71/245 (28%), Positives = 116/245 (47%), Gaps = 40/245 (16%)

Query: 4   EKEPLYISRKQKNYYNNLI--TSKKI----DDIL----RDNVIEFKKNIDITLYE----- 48
           E++ + I+R  +NY+N++I  +S +I    DD L    +D + E     D  L +     
Sbjct: 65  EQQVVMIARNSRNYFNDIINISSAQIMETFDDCLSVHSQDAMDEMISGEDYHLVKKVKGK 124

Query: 49  DGQRQTMDVEGRALPQ---IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGA 105
           DG+  +  V GR   +   I W   + G ++   N       L      +++  G  V  
Sbjct: 125 DGEFWSGRVPGRTHKKDDVIKW--MKRGFTLVINNADMRYPGLVGAAQRMEDALGYAVQM 182

Query: 106 NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQE 160
           N Y TPP++QGF  H+D +   +LQLEG K W+VY     LPR   ++ P++   +    
Sbjct: 183 NLYFTPPSSQGFEVHFDPMNTLVLQLEGSKTWEVYNPIIALPRPEQKFKPKHKEIHHGSR 242

Query: 161 EIGTPILTVTLEPGDLLYLPRGYIHQASTVTN--------EHSLHVTISV-YQKTAWIDL 211
           +      T TL PG   +LPRG++H+A T ++          SLH+T+ +  Q   W  +
Sbjct: 243 Q------TFTLTPGSTFFLPRGWLHEAHTNSSSLPPRSRASASLHLTVGIDAQYVTWESV 296

Query: 212 LEKAM 216
           L  A+
Sbjct: 297 LHAAV 301


>gi|119961145|ref|YP_946193.1| cupin superfamily protein [Arthrobacter aurescens TC1]
 gi|119948004|gb|ABM06915.1| putative cupin superfamily protein [Arthrobacter aurescens TC1]
          Length = 388

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 91/178 (51%), Gaps = 12/178 (6%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         +P+   +  L    G  V AN Y TPP  +GF  HYD  +
Sbjct: 93  LWRKFADGATLVLQALHRTWEPVSSFSTQLSTELGHPVQANAYITPPQNRGFDDHYDVHD 152

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYL 179
            F+LQ+EG K+W ++ P  VD   P  S P        ++   G   +   LEPGD+LYL
Sbjct: 153 VFVLQIEGTKRWIIHEPVHVD---PLRSQPWTDRRSAVAEAAQGKAYIDTVLEPGDVLYL 209

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           PRG++H A+    + S+H+T+ V+  T     L + + +A  AA   D E RR LP+G
Sbjct: 210 PRGWLH-AAEAQGKVSIHLTLGVHSWTR--HALAEHLAQAALAALCDDPEVRRSLPLG 264


>gi|324512664|gb|ADY45238.1| Lysine-specific demethylase NO66 [Ascaris suum]
          Length = 452

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 69/108 (63%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D +EK+ L + RK   YY NL ++KK  +IL+ + IE+  N+++ +Y+DG+R T +  G+
Sbjct: 341 DIFEKKALVVQRKNPKYYGNLYSTKKFIEILQSHYIEYGTNVNVAVYKDGERFTPNGIGK 400

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
             P+ +      G S++F+NPQT+   +  L  + QELFG FVGANT+
Sbjct: 401 VYPEEIRAQLNAGHSVQFVNPQTFCDSVWHLCDTTQELFGSFVGANTF 448


>gi|357399326|ref|YP_004911251.1| hypothetical protein SCAT_1724 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|386355365|ref|YP_006053611.1| cupin 4 family protein [Streptomyces cattleya NRRL 8057 = DSM
           46488]
 gi|337765735|emb|CCB74444.1| protein of unknown function [Streptomyces cattleya NRRL 8057 = DSM
           46488]
 gi|365805873|gb|AEW94089.1| cupin 4 family protein [Streptomyces cattleya NRRL 8057 = DSM
           46488]
          Length = 298

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/310 (26%), Positives = 137/310 (44%), Gaps = 54/310 (17%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDG----QRQTMDVE 58
           W  EPL   R+    +   +T  ++D ++    +  +   ++ L +DG    + +  D +
Sbjct: 17  WPHEPLRF-RRDPAEFGRWLTLAEVDTLIDAGCLAMR---NVVLLKDGRVSEKWEYADPD 72

Query: 59  GRALPQI--VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELF-------GCFVGANTYQ 109
              +P+   V +H + G S+        ++ L+ L  S+ +L+       G  V  N Y 
Sbjct: 73  DPTMPRCGAVREHIKAGGSVS-------LRELEHLKPSVADLYRALCRETGYGVHINAYL 125

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM---VDEYLPRYSSPNFSQEE----- 161
           TPP  QG   H+D     ILQ+ G+K W V+ P +   V EY   +    F+ E+     
Sbjct: 126 TPPGCQGLKYHFDPYVTLILQIHGRKTWPVHPPFVANPVQEY-GSFHLIGFTPEQRHYLA 184

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT---AWIDLLEKAMPK 218
              P  T TL PGD+L+LPRGY+H   T  +E SLH+T++  ++T   A   L E  + +
Sbjct: 185 HTPPAATFTLAPGDVLWLPRGYVHSPYTEGSETSLHITVAFKERTYHWAAERLAEDILRR 244

Query: 219 ALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDD 278
           AL      D E R  LP             P  +  D L + E  +D L   +  +D + 
Sbjct: 245 AL-----ADPEMREELP-------------PHLVTLDTLPVVEQARDYLIGALLSLDPET 286

Query: 279 GVDEMGKQLM 288
            V  + +  +
Sbjct: 287 AVKYLQRSAL 296


>gi|397631554|gb|EJK70195.1| hypothetical protein THAOC_08467 [Thalassiosira oceanica]
          Length = 462

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 14/183 (7%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y +GCS+   +   +   +  L   LQ  F   V AN+Y TPP +     H DD +  ++
Sbjct: 114 YLDGCSVIVNHADHHHSKIAGLCDELQLTFP-HVYANSYLTPPRSSAVKSHSDDRDVLVI 172

Query: 130 QLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPIL---TVTLEPGDLLYLP 180
           Q+ G+K WKVY       P   +E + +   P  S+E +   I     V+LEPG +LY+P
Sbjct: 173 QVLGRKNWKVYKTVPVSFP-FANEQVGKEGRPPVSREVLDGGICFDGKVSLEPGHVLYMP 231

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
           RG++H+AST ++E S H+TI++     W   +   M + ++      +E+R  L IG   
Sbjct: 232 RGFVHEASTDSDEPSFHITIAIATHD-WC--MSTVMTEMVRERLNECIEYRIPLSIGAGA 288

Query: 241 YAG 243
           Y G
Sbjct: 289 YQG 291


>gi|377573080|ref|ZP_09802154.1| hypothetical protein MOPEL_009_00090 [Mobilicoccus pelagius NBRC
           104925]
 gi|377538115|dbj|GAB47319.1| hypothetical protein MOPEL_009_00090 [Mobilicoccus pelagius NBRC
           104925]
          Length = 407

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/173 (34%), Positives = 84/173 (48%), Gaps = 7/173 (4%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           + +G +I          P+ +    L    G  V AN Y TPP +QGF+ HYD  + F+L
Sbjct: 102 FADGATIVLQGLHRTWAPIAEFVRDLAADLGHPVQANAYITPPQSQGFSAHYDVHDVFVL 161

Query: 130 QLEGKKKWKVYLP----RMVDEYLPRYSSPNF-SQEEIGTPILTVTLEPGDLLYLPRGYI 184
           QL G+K+W ++ P     M DE        +  ++   G P L V L PGD LYLPRGY+
Sbjct: 162 QLSGEKRWVIHEPVLQWPMRDEPWETGGRRDLVARAAEGEPALDVVLRPGDCLYLPRGYL 221

Query: 185 HQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           H A+T     S H+T+ V+  T    L    M +AL  AG        G+ +G
Sbjct: 222 H-AATALGGVSAHLTLGVHTWTR-HHLATDVMRQALARAGDLRAPLASGVDVG 272


>gi|295840576|ref|ZP_06827509.1| cupin superfamily protein [Streptomyces sp. SPB74]
 gi|295828066|gb|EFG65806.1| cupin superfamily protein [Streptomyces sp. SPB74]
          Length = 395

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 72/253 (28%), Positives = 114/253 (45%), Gaps = 30/253 (11%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
           W + P  ++R+  ++ ++L +S  +D++     LR   +   K+   TL E        V
Sbjct: 31  WGRTP-SLTRRASDF-SDLFSSAAVDELVSRRGLRTPFLRVAKD-GATLPESSFTAPGGV 87

Query: 58  EGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
                 Q+    +W  + +G ++         +P+    + L    G  V AN Y TPP 
Sbjct: 88  GATIADQLDDTALWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQ 147

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPIL 167
            QGF  HYD  + F+LQ+ G K+W ++ P +     P  + P        ++   G P+L
Sbjct: 148 NQGFDAHYDVHDVFVLQITGTKRWLIHEPVVT---APTRAQPWTAHRAAVAEAAAGPPVL 204

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT---AWIDLLEKAMPKALQAAG 224
              LEPGD+LYLPRG++H A       S H+T+ V+  T       L E A+     A  
Sbjct: 205 DTVLEPGDVLYLPRGWLHSA-VAQGTVSTHLTLGVHPWTRHAVAAQLAETAL-----ATL 258

Query: 225 ATDLEFRRGLPIG 237
             D E R  LP+G
Sbjct: 259 REDPELRTSLPLG 271


>gi|159037898|ref|YP_001537151.1| cupin 4 family protein [Salinispora arenicola CNS-205]
 gi|157916733|gb|ABV98160.1| Cupin 4 family protein [Salinispora arenicola CNS-205]
          Length = 432

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/136 (41%), Positives = 75/136 (55%), Gaps = 12/136 (8%)

Query: 106 NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEI 162
           N Y TP  +QGFA HYD  + F+LQ++G+K W+++ P + D  E  P     +  S    
Sbjct: 163 NAYLTPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLPDPLERQPWGGRADEVSATAT 222

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH-SLHVTISVYQKTAWIDLLEKAMPKALQ 221
           G P L VTL PGD LYLPRG++H A     EH SLH+T+ V   T +  L+E+     L 
Sbjct: 223 GAPALDVTLAPGDALYLPRGWLHSAQ--AQEHSSLHLTVGVRALTRYT-LVEE-----LL 274

Query: 222 AAGATDLEFRRGLPIG 237
           A  A D   R  LP G
Sbjct: 275 ALAAEDQRLRATLPFG 290


>gi|81299184|ref|YP_399392.1| hypothetical protein Synpcc7942_0373 [Synechococcus elongatus PCC
           7942]
 gi|81168065|gb|ABB56405.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 428

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 143/316 (45%), Gaps = 41/316 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W ++ +YI+     +  +L +   ++D+L  N   F+++ ++    DG+     +    
Sbjct: 20  YWAQQSVYIAGDSLRF-QSLFSWNHLNDLL--NYQTFRES-ELRFSRDGE----SLPAGD 71

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
            P +     +EG ++        +  L+ L  +L++ FG     N Y +P   QGF  HY
Sbjct: 72  NPTLWRSRLQEGATLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHY 131

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---TPILTVTLEPGDLLY 178
           D  +  ILQ+EG+K+W +Y      E LP  ++   S + +     P L   L PGDLLY
Sbjct: 132 DTHDVLILQIEGEKEWLIY-----PETLPYPTADQPSYDRLPPEEPPYLQQVLSPGDLLY 186

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           +PRG+ H A       SLH+TI ++  T   W++ L++ +             +R+GLP+
Sbjct: 187 IPRGHWHYA-IAQETASLHLTIGIHTATGLDWVNWLQQQL--------RDQPHWRQGLPL 237

Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP--P 294
                AG     P       L ++ +L+ L  +L+ Y+     +D+  + L     P  P
Sbjct: 238 -----AGSCNFDP-------LKLRGHLESLRDQLITYLQEPQAIDDYLQYLSWQDQPHLP 285

Query: 295 VLSPEELQCSVFENGL 310
           +  P +L       GL
Sbjct: 286 IQLPLQLHGDPLAQGL 301


>gi|386848916|ref|YP_006266929.1| cupin 4 family protein [Actinoplanes sp. SE50/110]
 gi|359836420|gb|AEV84861.1| cupin 4 family protein [Actinoplanes sp. SE50/110]
          Length = 388

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 68/234 (29%), Positives = 110/234 (47%), Gaps = 21/234 (8%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDN--------VIEFKKNIDITLYEDGQRQ 53
           +W K+PL +SR +  +  +L++   +D++L           V +  K +    Y  G   
Sbjct: 21  YWGKQPL-LSRGRAGF-TDLLSVPAVDELLSRRGLRTPFLRVAQHGKVLPAGRYTGGGGA 78

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
             +V  +   + V   +  G ++          PL +    L       +  N Y TPP 
Sbjct: 79  GAEVGDQVHDERVMRLFAGGATLVLQGLHRLWPPLIEFTGRLGAELNRPLQVNAYLTPPG 138

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPIL 167
           +QGFA HYD  + F+LQ EG+K+W ++ P + D   P  + P        +    GTP +
Sbjct: 139 SQGFATHYDTHDVFVLQCEGRKRWIIHPPVLSD---PIENQPWGGHADEVAATAEGTPAI 195

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
              LEPGD LYLPRG++H A T     SLH+T+ +   T +  L+E+ +  A+Q
Sbjct: 196 DTVLEPGDALYLPRGWLHSART-EQVRSLHLTVGIRSLTRYA-LVEELLAMAVQ 247


>gi|326435323|gb|EGD80893.1| hypothetical protein PTSG_01480 [Salpingoeca sp. ATCC 50818]
          Length = 518

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 93/211 (44%), Gaps = 20/211 (9%)

Query: 5   KEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQ 64
           + PL+I+R   NYY  L TS      L       K      L  +G     + E   L  
Sbjct: 90  RRPLHIARDDPNYYGQLFTSSDFFRFL--TFYRHKYEAGFNLVRNGDTVFANDEPVQLGD 147

Query: 65  IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
           ++    R G +       T  + +  L+ +L+     F   N Y TP  ++GF  H D  
Sbjct: 148 VLDSIARNGSTAVMTAVHTRDEHIWALSWALERETSLFSATNIYLTPAQSRGFIFHQDVH 207

Query: 125 EAFILQLEGKKKWKVYLP------------RMVDEYLPRYSSPNFSQEE-IGTPILTVTL 171
              ILQLEGKK W+++ P                + L R+    F +EE +G      TL
Sbjct: 208 AVIILQLEGKKHWRLFTPPEERSFPWYHKQEAASQRLTRF----FKEEELVGLQEFNFTL 263

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
             GD L +PRG IH A T ++EHSLH+T+S+
Sbjct: 264 HAGDTLIVPRGVIHAAET-SDEHSLHLTVSL 293


>gi|254383725|ref|ZP_04999074.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194342619|gb|EDX23585.1| conserved hypothetical protein [Streptomyces sp. Mg1]
          Length = 394

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 3/136 (2%)

Query: 72  EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
           EG S+     +    P+++  A L+  F   V  N Y +    +GF  H+DD +  +LQL
Sbjct: 98  EGASLVIDAIEQIHPPVREAAAGLERFFRTPVQVNAYASWTAEEGFGTHWDDHDVVVLQL 157

Query: 132 EGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT 191
           EG K+WKVY P       P +      +   G PI  + L PGD+LYLPRG+ H  S   
Sbjct: 158 EGSKRWKVYGP---TRQAPAWRDVETPEVPTGDPIADIVLTPGDVLYLPRGWWHAVSADQ 214

Query: 192 NEHSLHVTISVYQKTA 207
              SLH+T  +  +T 
Sbjct: 215 GTASLHLTFGLATQTG 230


>gi|453051139|gb|EME98654.1| hypothetical protein H340_20228 [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 391

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 116/289 (40%), Gaps = 24/289 (8%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           R  P ++     EG S+         +P+  L    ++LF     AN Y +    +GF  
Sbjct: 83  RLHPALLHARLAEGASLALDQADELHRPVAALAEECEQLFRTRFQANAYASWTATEGFGT 142

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           H+DD +  ++QLEG K WK+Y P       P Y      Q     PI  + L+PGDLLYL
Sbjct: 143 HWDDHDVLVVQLEGAKHWKIYGPTRPH---PLYRDTTEPQAPPADPIADLVLQPGDLLYL 199

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
           PRG+ H  +      SLH+T   +Q     DLL     + L  A     EFR  LP+   
Sbjct: 200 PRGWWHAVTADQGVCSLHLTYG-FQTHTPSDLLTFVSDQLLGHA-----EFRADLPL--- 250

Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD--EMGKQLMHDALPPV-- 295
             AG         +  RL ++E L D  A L  Y +  DG D   M   L H    P   
Sbjct: 251 -LAGREAQAAFVTRIGRL-VREQLAD-PALLTRYAESMDGRDTGRMTPSLPHLTAVPADP 307

Query: 296 -----LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
                L+      +  E+G+ ++  G  Y           RLV     R
Sbjct: 308 DIRVRLTTARALLTTTEDGVHLAAAGHSYEFAPAAGPVLARLVSGGEHR 356


>gi|378718179|ref|YP_005283068.1| putative cupin superfamily protein [Gordonia polyisoprenivorans
           VH2]
 gi|375752882|gb|AFA73702.1| putative cupin superfamily protein [Gordonia polyisoprenivorans
           VH2]
          Length = 411

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 59/179 (32%), Positives = 88/179 (49%), Gaps = 18/179 (10%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           V   + +G +I          P+  L   L +  G  V AN Y TPP+ +GF PHYD  +
Sbjct: 90  VLAQFADGATIVLQGLHRLWPPVIDLVRGLVDDLGHPVQANAYVTPPSNRGFDPHYDVHD 149

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYL 179
            F+LQ+ G+K+W V+ P  +D   P  S P      +  +     P++   L  GD LYL
Sbjct: 150 VFVLQISGEKQWTVHAPVHLD---PVDSQPWTDHREDIQRRTDDPPVIDTVLRSGDALYL 206

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL-EFRRGLPIG 237
           PRG++H A  +    S+H+TI     TA +D+L   + +       TD+ E R  LP+G
Sbjct: 207 PRGWVHSARALGTT-SIHLTIGTAALTA-LDVLGAVVEQ------LTDIRELRESLPMG 257


>gi|359766385|ref|ZP_09270197.1| hypothetical protein GOPIP_038_00190 [Gordonia polyisoprenivorans
           NBRC 16320]
 gi|359316221|dbj|GAB23030.1| hypothetical protein GOPIP_038_00190 [Gordonia polyisoprenivorans
           NBRC 16320]
          Length = 352

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 86/175 (49%), Gaps = 18/175 (10%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           + +G +I          P+  L   L +  G  V AN Y TPP+ +GF PHYD  + F+L
Sbjct: 94  FADGATIVLQGLHRLWPPVIDLVRGLVDDLGHPVQANAYVTPPSNRGFDPHYDVHDVFVL 153

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYLPRGY 183
           Q+ G+K W V+ P  +D   P  S P      +  +     P++   L  GD LYLPRG+
Sbjct: 154 QISGEKHWTVHAPVHLD---PVDSQPWTDHREDIERRTDDPPVIDTVLRSGDALYLPRGW 210

Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL-EFRRGLPIG 237
           +H A  +    S+H+TI     TA +D+L   + +       TD+ E R  LP+G
Sbjct: 211 VHSARALGTT-SIHLTIGTAALTA-LDVLGAVVEQ------LTDIRELRESLPMG 257


>gi|113476595|ref|YP_722656.1| cupin 4 [Trichodesmium erythraeum IMS101]
 gi|110167643|gb|ABG52183.1| Cupin 4 [Trichodesmium erythraeum IMS101]
          Length = 385

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/214 (30%), Positives = 109/214 (50%), Gaps = 14/214 (6%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W K+ + IS K +  + +L + +K++ +L  + I++    D+ L  DG +   + E R
Sbjct: 19  NNWTKKAIAISNKGEKDFTDLFSWEKLNYLLNFHQIKYP---DVRLAFDG-KVLEEKENR 74

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
              Q       +G ++        I  +    + L    G     N Y +  + +GF+PH
Sbjct: 75  NFTQWC----EKGATLILDQIHRRIPEVAIFTSKLSYELGYPTQVNAYCSWSSKKGFSPH 130

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEY-LPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           YD  + FILQ+EG K+W VY      +Y LP   S +F+  E     L+  L PGD+LY+
Sbjct: 131 YDTHDVFILQVEGNKQWYVYNDTF--KYPLPNQKSSSFTPPE-KEAYLSCILHPGDVLYI 187

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLE 213
           PRG+ H A T   E S+H+T+ ++  T  +DLLE
Sbjct: 188 PRGHWHYAVT-KEEPSIHLTLGIHSSTG-VDLLE 219


>gi|406575710|ref|ZP_11051405.1| cupin superfamily protein [Janibacter hoylei PVAS-1]
 gi|404554923|gb|EKA60430.1| cupin superfamily protein [Janibacter hoylei PVAS-1]
          Length = 414

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 85/175 (48%), Gaps = 14/175 (8%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           + +G +I          P+  L   L    G  V AN Y TP   +GFA HYD  + F+L
Sbjct: 106 FADGHTIVLQGLHRTHPPVLALAQELAADLGHPVQANAYVTPAANRGFAAHYDVHDVFVL 165

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYLPRGY 183
           Q EG K+W ++ P  V E+ P    P         Q     P L VTLEPGD+LY+PRG+
Sbjct: 166 QTEGTKRWSLHAP--VHEH-PLRDQPWQDRRSQVEQAAAAEPYLDVTLEPGDVLYVPRGW 222

Query: 184 IHQASTVTNEHSLHVTISVYQKTAW-IDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           +H A+ +    S+H+TI V+    W    L +A+    + A A   E R  LP G
Sbjct: 223 LHSATALGGV-SVHLTIGVH---VWHRHHLAEAVLDGTRRALAEQPELRTSLPPG 273


>gi|449457281|ref|XP_004146377.1| PREDICTED: uncharacterized protein LOC101213846 [Cucumis sativus]
          Length = 541

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/142 (38%), Positives = 80/142 (56%), Gaps = 11/142 (7%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
           ++EG +I     +   + +  ++ +L  LFG   VGAN Y TPP +QG A HYDD   F+
Sbjct: 201 FKEGYTIALRGMEFRHEKIAAISNTLASLFGQPSVGANMYLTPPGSQGLARHYDDHCVFV 260

Query: 129 LQLEGKKKWKVYLPRMVDEYLPR-YSSPNF-SQEEIGTPILT---VTLEPGDLLYLPRGY 183
            QL G K+W V+ P    +YLPR Y S  F S  ++ +P+       L  GD+LY+PRG+
Sbjct: 261 CQLAGSKQWTVFSPP--RKYLPRLYDSHEFPSCLDVESPLAVGRKFFLREGDVLYIPRGF 318

Query: 184 IHQASTVT---NEHSLHVTISV 202
           +H+A T     +  SLH+T  +
Sbjct: 319 LHEARTEIGGPDGSSLHLTFGI 340


>gi|302523562|ref|ZP_07275904.1| cupin 4 family protein [Streptomyces sp. SPB78]
 gi|302432457|gb|EFL04273.1| cupin 4 family protein [Streptomyces sp. SPB78]
          Length = 395

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 112/255 (43%), Gaps = 34/255 (13%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
           W + P  ++R   ++ +++ +S  +D++     LR   +   K+   TL E        V
Sbjct: 31  WGRLP-SLTRGAGDF-SDIFSSAAVDELVSRRGLRTPFLRIAKD-GATLPEPSFTAPGGV 87

Query: 58  EGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
                 Q+    +W  + +G ++         +P+    + L    G  V AN Y TPP 
Sbjct: 88  GATIADQLDDTALWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQ 147

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---------- 163
            QGF  HYD  + F+LQ+ G K+W ++ P +        ++P   Q   G          
Sbjct: 148 NQGFDAHYDVHDVFVLQITGTKRWLIHEPVV--------TAPTREQPWTGHRAAVAEAAS 199

Query: 164 -TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
             P+L   LEPGD+LYLPRG++H A       S+H+T+ V+  T     L   + +   A
Sbjct: 200 GAPVLDTVLEPGDVLYLPRGWLHSA-VAQGAVSVHLTLGVHPWTR--HALAAQLAETALA 256

Query: 223 AGATDLEFRRGLPIG 237
           A   D   R  LP+G
Sbjct: 257 ALREDPAMRTSLPLG 271


>gi|318060611|ref|ZP_07979334.1| putative cupin superfamily protein [Streptomyces sp. SA3_actG]
 gi|318075845|ref|ZP_07983177.1| putative cupin superfamily protein [Streptomyces sp. SA3_actF]
          Length = 395

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 22/183 (12%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         +P+    + L    G  V AN Y TPP  QGF  HYD  +
Sbjct: 100 LWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQNQGFDAHYDVHD 159

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG-----------TPILTVTLEPG 174
            F+LQ+ G K+W ++ P +        ++P   Q   G            P+L   LEPG
Sbjct: 160 VFVLQITGTKRWLIHEPVV--------TAPTREQPWTGHRAAVAEAASGAPVLDTVLEPG 211

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           D+LYLPRG++H A       S+H+T+ V+  T     L   + +   AA   D   R  L
Sbjct: 212 DVLYLPRGWLHSA-VAQGAVSVHLTLGVHPWTR--HALAAQLAETALAALREDPAMRTSL 268

Query: 235 PIG 237
           P+G
Sbjct: 269 PLG 271


>gi|333022517|ref|ZP_08450581.1| putative cupin superfamily protein [Streptomyces sp. Tu6071]
 gi|332742369|gb|EGJ72810.1| putative cupin superfamily protein [Streptomyces sp. Tu6071]
          Length = 395

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/258 (27%), Positives = 114/258 (44%), Gaps = 40/258 (15%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
           W + P  ++R   ++ +++ +S  +D++     LR   +   K+   TL E        V
Sbjct: 31  WGRLP-SLTRGAGDF-SDIFSSAAVDELVSQRGLRTPFLRVAKD-GATLPEPSFTAPGGV 87

Query: 58  EGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
                 Q+    +W  + +G ++         +P+    + L    G  V AN Y TPP 
Sbjct: 88  GATIADQLDDTALWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQ 147

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---------- 163
            QGF  HYD  + F+LQ+ G K+W ++ P +        ++P   Q   G          
Sbjct: 148 NQGFDAHYDVHDVFVLQITGTKRWLIHEPVV--------TAPTREQPWTGHRAAVAEAAS 199

Query: 164 -TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA---WIDLLEKAMPKA 219
             P+L   LEPGD+LYLPRG++H A       S+H+T+ V+  T     + L E A+   
Sbjct: 200 GAPVLDTVLEPGDVLYLPRGWLHSA-VAQGAVSVHLTLGVHPWTRHALAVQLAETAL--- 255

Query: 220 LQAAGATDLEFRRGLPIG 237
             AA   D   R  LP+G
Sbjct: 256 --AALREDPAMRTSLPLG 271


>gi|443291200|ref|ZP_21030294.1| Transcription factor jumonji jmjC domain-containing protein
           [Micromonospora lupini str. Lupac 08]
 gi|385885602|emb|CCH18401.1| Transcription factor jumonji jmjC domain-containing protein
           [Micromonospora lupini str. Lupac 08]
          Length = 424

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 108/251 (43%), Gaps = 23/251 (9%)

Query: 2   HWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYED 49
           HW   PL +SR  +      + +L +    D++L     R   +   K+   +    +  
Sbjct: 39  HWGHTPL-LSRAAELPDPAGFTDLFSPADADELLSRRGLRTPFLRVAKDGQLVPAARWTG 97

Query: 50  GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
           G     ++  + L + V + Y  G ++          PL      L       +  N Y 
Sbjct: 98  GGGAGAEIGDQVLDERVLEQYAAGATLVLQGLHRIWPPLVDFARDLGLALNQPLQINAYL 157

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY---SSPNFSQEEIGTPI 166
           TP  +QGFA HYD  + F+LQ++G+K W+++ P + D    +     +   S    G   
Sbjct: 158 TPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLPDPLEKQTWGGRADEVSATAQGPAA 217

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           L V L PGD LYLPRG++H A    +  SLH+T+ +   T +      A+ + L A  A 
Sbjct: 218 LDVVLAPGDALYLPRGWLHSAQ-AQDASSLHLTVGIRALTRY------ALVEELLALAAE 270

Query: 227 DLEFRRGLPIG 237
           D   R  LP G
Sbjct: 271 DQRLRASLPFG 281


>gi|449525377|ref|XP_004169694.1| PREDICTED: lysine-specific demethylase NO66-like [Cucumis sativus]
          Length = 347

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 97/194 (50%), Gaps = 16/194 (8%)

Query: 18  YNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIR 77
           Y   I   + D+ L+  +  F+KN +    E      +    +       + ++EG +I 
Sbjct: 154 YQQDIRVLRTDECLKREIHFFQKNFEPCCIEGPHFLKLHDALKC-----EEAFKEGYTIA 208

Query: 78  FLNPQTYIKPLQQLNASLQELFG-CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKK 136
               +   + +  ++ +L  LFG   VGAN Y TPP +QG A HYDD   F+ QL G K+
Sbjct: 209 LRGMEFRHEKIAAISNTLASLFGQPSVGANMYLTPPGSQGLARHYDDHCVFVCQLAGSKQ 268

Query: 137 WKVYLPRMVDEYLPR-YSSPNF-SQEEIGTPILT---VTLEPGDLLYLPRGYIHQASTVT 191
           W V+ P    +YLPR Y S  F S  ++ +P+       L  GD+LY+PRG++H+A T  
Sbjct: 269 WTVFSPP--RKYLPRLYDSHEFPSCLDVESPLAVGRKFFLREGDVLYIPRGFLHEARTEI 326

Query: 192 ---NEHSLHVTISV 202
              +  SLH+T  +
Sbjct: 327 GGPDGSSLHLTFGI 340


>gi|262202804|ref|YP_003274012.1| cupin [Gordonia bronchialis DSM 43247]
 gi|262086151|gb|ACY22119.1| Cupin 4 family protein [Gordonia bronchialis DSM 43247]
          Length = 414

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 66/214 (30%), Positives = 101/214 (47%), Gaps = 13/214 (6%)

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  V   +  G +I          P+     ++ +  G  V AN Y TPP  +GF PHYD
Sbjct: 87  PAGVLREFAAGATIVLQGLHRLWPPVIDFVRAMVDDLGHPVQANAYVTPPGNRGFDPHYD 146

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI----GTPILTVTLEPGDLLY 178
             + F+LQ+ G K+W+V+ P        +  + + +Q E       P +   L PGD LY
Sbjct: 147 VHDVFVLQVAGTKRWRVHRPVHTHPLATQPWTDHRAQIERRASDDAPEIEAVLSPGDALY 206

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
           LPRG+IH A  +  + S+H+TI V   T       + +  A+ A      EFR+ LP+G 
Sbjct: 207 LPRGWIHSADAL-GDTSIHLTIGVGAVTV------RDVVAAIVAELDDCAEFRQSLPLG- 258

Query: 239 LRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
           +   G  +  P+  +A   A+ E L+D  A + E
Sbjct: 259 IDLTGRDQTVPIATKA-MAAVVERLRDHAADVGE 291


>gi|357162427|ref|XP_003579407.1| PREDICTED: uncharacterized protein LOC100839961 [Brachypodium
           distachyon]
          Length = 803

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/174 (33%), Positives = 93/174 (53%), Gaps = 26/174 (14%)

Query: 46  LYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVG 104
            ++DG   T+ ++  A  +   D ++ G SI     +   + +  + ++L +LFG   VG
Sbjct: 428 FFDDG---TVFLDAAAFIEDCKDAFKSGFSIALRGMEFRSEKVAAIASALADLFGQPSVG 484

Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRMVDEYLPRYSSPNFS 158
           AN Y +PP +QG + HYDD    +LQL G KKWK++      LPR+   Y P + S +  
Sbjct: 485 ANLYFSPPRSQGLSRHYDDHCVLVLQLVGHKKWKIWPSTKSVLPRL---YEP-FDSLDGL 540

Query: 159 QEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNE----------HSLHVTISV 202
            ++ G  I    L  GD++Y+PRGYIH+A T   E          +SLH+T+++
Sbjct: 541 VDDSGGRI--EVLREGDIMYVPRGYIHEAHTAVAEEEYEVNASSNYSLHLTLAI 592


>gi|443672956|ref|ZP_21138032.1| Cupin 4 family protein [Rhodococcus sp. AW25M09]
 gi|443414441|emb|CCQ16370.1| Cupin 4 family protein [Rhodococcus sp. AW25M09]
          Length = 420

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 70/251 (27%), Positives = 115/251 (45%), Gaps = 25/251 (9%)

Query: 3   WEKEP-LYISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQRQ 53
           W   P L  S      +++L+T   +D+++     R   +   K+   +D T +      
Sbjct: 33  WGIAPRLTRSADLPETFDDLMTLDAVDELIAERGVRTPFVRMAKDGRLVDKTHFTASGGF 92

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
             +V  +     V + +  G ++          PL      L    G     N+Y TP  
Sbjct: 93  GAEVTDQIDSGAVLESFAAGNTLVLQGLHRLWPPLIHFVGDLVRELGHPAQVNSYVTPAA 152

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV-----DEYLPRYSSPNFSQEEIGTPILT 168
           +QGF+PHYD  + F++Q+ G K+W V+ P  V       +  R S+    ++   +P L 
Sbjct: 153 SQGFSPHYDVHDVFVVQIAGTKRWIVHSPVHVHPLNNQPWSDRRSA--VEEQASNSPALD 210

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL 228
           V LEPGD+LYLPRG+IH A  +  + ++H+T+ V     + D+   A+ KAL      D+
Sbjct: 211 VVLEPGDVLYLPRGWIHSAQAL-GDTTIHLTVGVASFNDY-DIAHHAL-KALG-----DV 262

Query: 229 EFRRG-LPIGY 238
           E  R  LP G+
Sbjct: 263 ESVRAPLPAGF 273


>gi|428211992|ref|YP_007085136.1| cupin [Oscillatoria acuminata PCC 6304]
 gi|428000373|gb|AFY81216.1| Cupin superfamily protein [Oscillatoria acuminata PCC 6304]
          Length = 397

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 151/338 (44%), Gaps = 50/338 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W +  L+I  K ++ +  L    +++ +L  + I +  N+  +L  DG+         
Sbjct: 19  ENWTEIALFIPGKSRDKFAGLFGWNQLNQLLNYHFIPYP-NLRFSL--DGESLP------ 69

Query: 61  ALPQIVW-DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           A P   W +   +G ++   +    +  L++L A ++E  G     N Y +P   QGF  
Sbjct: 70  ATPSHQWLERLHQGATLIIDSVHERVPELEKLVADVREQTGYRSQINLYCSPSDQQGFNC 129

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---TPILTVTLEPGDL 176
           HYD  +  ILQ+EG+K W V+      E +P   +   S+++I     P L   L+PGDL
Sbjct: 130 HYDTHDVLILQIEGEKDWFVF-----PETVPFPVTGMRSKDQIPPDVPPYLECRLQPGDL 184

Query: 177 LYLPRGYIHQASTVTNEH------SLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDL 228
           LY+PRG+ H A   +N        SLH+T+ +  +T   W++ L K + +  Q       
Sbjct: 185 LYIPRGHWHYAIASSNPSRDDYSPSLHLTLGIDCQTGLDWMNWLGKELQENTQ------- 237

Query: 229 EFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLM 288
            +R  L              PV    DR A++  L+ L  +L+E+++  + +      L 
Sbjct: 238 -WRASL--------------PVIPSGDRTAVQPYLEKLRDRLLEFLEDPETITGYLDDLN 282

Query: 289 H-DALP-PVLSPEELQCSVFENGLRMSQTGEVYNATQI 324
           + D  P P   P +L   +F+ GL         +  QI
Sbjct: 283 YRDRTPLPFAFPHQLGSQIFDFGLETRFIPSALHPVQI 320


>gi|444307228|ref|ZP_21142971.1| cupin superfamily protein [Arthrobacter sp. SJCon]
 gi|443480438|gb|ELT43390.1| cupin superfamily protein [Arthrobacter sp. SJCon]
          Length = 367

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 84/178 (47%), Gaps = 12/178 (6%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++          P+   ++SL    G  V AN Y TPP  +GF  HYD  +
Sbjct: 106 LWRTFADGATLVLQALHRTWAPVADFSSSLSTELGHPVQANAYITPPQNRGFDDHYDVHD 165

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI------LTVTLEPGDLLYL 179
            F+LQ+EG K+W ++ P   D   P    P   +             +   LEPGD+LYL
Sbjct: 166 VFVLQIEGTKRWIIHEPVHTD---PLRDQPWTDRRSAVAEAAKAEAYIDTVLEPGDVLYL 222

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           PRG++H A+    E S+H+T+ ++  T +    + A           D E RR LP+G
Sbjct: 223 PRGWLH-AAQAQGEVSIHLTLGIHSWTRYALAEQLAQAALAAL--CDDPEMRRTLPLG 277


>gi|400536533|ref|ZP_10800067.1| hypothetical protein MCOL_V219156 [Mycobacterium colombiense CECT
           3035]
 gi|400329546|gb|EJO87045.1| hypothetical protein MCOL_V219156 [Mycobacterium colombiense CECT
           3035]
          Length = 338

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 78/149 (52%), Gaps = 13/149 (8%)

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  V D   EG +I     ++Y++ +  L+ S++         N Y TPP + GFAPHYD
Sbjct: 31  PARVRDGLAEGYTIVVNGVESYLRTIASLSHSIEVELDFPTRVNAYVTPPESTGFAPHYD 90

Query: 123 DIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILT----VTLEPGDLL 177
             +  +LQ++G K W V   P +    + R       +  +GT  LT    V L+PGD+L
Sbjct: 91  PHDVLVLQIQGSKTWHVSDGPAVPPHEIQR-------RRGVGTDGLTSSTDVCLQPGDVL 143

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           YLPRG +H A T   E S+H+TI ++  T
Sbjct: 144 YLPRGQVHSAET-HAEPSVHLTIGLHAPT 171


>gi|408826623|ref|ZP_11211513.1| cupin 4 family protein [Streptomyces somaliensis DSM 40738]
          Length = 299

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 88/177 (49%), Gaps = 28/177 (15%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y  G +IR  N Q  I  + Q++  +Q   G     +T+ TPP  QG   H+D   A I+
Sbjct: 86  YESGHTIRLGNLQRIIPAMAQVSEDIQRETGYSNYIHTFLTPPANQGLRHHWDQQMAVIV 145

Query: 130 QLEGKKKWKVYLPRM--------------VDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           Q+ G K+W+++ P +               D+Y+P +        E   P + V L+ G 
Sbjct: 146 QIAGTKQWQLWRPPVEAPMREYNESWRVWRDDYIPTW--------EAAGPDMEVDLKAGR 197

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
            L LPRG++H    V +EHS+H+T ++ ++T  + L+EK +  A++       +FRR
Sbjct: 198 SLLLPRGWVHNPHVVVDEHSIHLTFAIRERTP-MWLVEKLVAGAIETP-----DFRR 248


>gi|296170713|ref|ZP_06852288.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC
           BAA-614]
 gi|295894702|gb|EFG74436.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC
           BAA-614]
          Length = 403

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 65/214 (30%), Positives = 100/214 (46%), Gaps = 21/214 (9%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQT----MDVE 58
           W     +I R++  Y++ L+      D L D+V    +   + L + GQ +        +
Sbjct: 34  WAIRYHHIRRRRPGYFDGLLPGPSAVDALLDHV--QPEASAVRLVKGGQDRDPATYRRAD 91

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           G   P  V D    G +I     + Y++ +  ++ S++         N Y TPP + GF 
Sbjct: 92  GGLDPVAVRDGLAAGYTIVVNGVERYLRTIASMSHSIEVELNFPTRVNAYVTPPASTGFV 151

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE--IGTPILT----VTLE 172
           PHYD  +  +LQ+EG K W V     V         P+  Q    +GT  LT    V L+
Sbjct: 152 PHYDPHDVLVLQIEGSKTWHVSDAAAV--------PPHEIQRRRGVGTDGLTSSTDVDLQ 203

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           PGD+LYLPRG +H A T  +E S+H+TI ++  T
Sbjct: 204 PGDVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 236


>gi|379059516|ref|ZP_09850042.1| cupin superfamily protein [Serinicoccus profundi MCCC 1A05965]
          Length = 401

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 8/146 (5%)

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           +W  + +G ++         +P+ + ++ L +  G  V AN Y TPP  +GF  HYD  +
Sbjct: 106 LWRAFADGATLVLQALHRTWEPVAEFSSRLSDELGHPVQANAYITPPQNRGFDDHYDVHD 165

Query: 126 AFILQLEGKKKWKVYLPRMVD-----EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            F+LQ+ G K+W ++ P   D      +  R S+   S+       L V LEPGD LYLP
Sbjct: 166 VFVLQIAGTKRWIIHEPVHQDPLRDQPWTDRRSA--VSEAAARDAYLDVVLEPGDALYLP 223

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKT 206
           RG++H A+      S+H+T+ V+  T
Sbjct: 224 RGWLH-AAQAQGAVSIHLTLGVHTWT 248


>gi|148238895|ref|YP_001224282.1| hypothetical protein SynWH7803_0559 [Synechococcus sp. WH 7803]
 gi|147847434|emb|CAK22985.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 370

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 94/204 (46%), Gaps = 26/204 (12%)

Query: 48  EDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
           ED   + +  +G      V   + EG SI       +   L  L +SL E  G     N 
Sbjct: 40  EDYDPKIVANDGHLCKPYVLSRFHEGASIVVNEVHRFSSQLMDLASSLSEELGVQCVVNA 99

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
           Y TPP +   +PH+D  + F LQ+ G+K+W       VD  L   ++ +  Q     PIL
Sbjct: 100 YLTPPQSVALSPHFDSHDIFALQVVGQKQW------FVDSELSSLTTKSTFQ-----PIL 148

Query: 168 T-----------VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
           +           V ++ GD++YLPRG +H A T++ + S+H+T+ +Y    W + +  A+
Sbjct: 149 SADQASSVDFREVVMDEGDVMYLPRGCVHHARTISCQ-SMHLTVGLY-PLEWSEFIASAV 206

Query: 217 PKALQAAGATDLEFRRGLPIGYLR 240
             A  A  A  L  R  +P+G  R
Sbjct: 207 EIAASAPEARGL--RTSVPLGLKR 228


>gi|119718597|ref|YP_925562.1| cupin 4 family protein [Nocardioides sp. JS614]
 gi|119539258|gb|ABL83875.1| Cupin 4 family protein [Nocardioides sp. JS614]
          Length = 403

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 100/226 (44%), Gaps = 39/226 (17%)

Query: 19  NNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIV-----WDHYREG 73
           ++L+TS  +    R   I   K+  + L E    +   + G+ L  +V        + +G
Sbjct: 75  DHLLTSSAV----RTPSIRLAKDGAV-LPESAYTRRASLAGKPLTGLVDARKALALFDDG 129

Query: 74  CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
            ++ F     Y  PL +L A L+   G    AN Y TPP AQGFA H D  + F+ Q  G
Sbjct: 130 ATVVFQGLHRYWPPLTRLIARLELELGHPCQANAYLTPPGAQGFAVHSDSHDVFVFQTAG 189

Query: 134 KKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNE 193
            K+W+V+ P   +E L                     LEPG  +YLP G  H A+   + 
Sbjct: 190 SKRWEVHGPDGPEEVL---------------------LEPGVSMYLPTGTPH-AARAQDT 227

Query: 194 HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
            SLHVT+ + Q T W  L+E+ +      AGA        LP GYL
Sbjct: 228 VSLHVTLGINQLT-WRGLVERTV------AGALGEVADEHLPAGYL 266


>gi|298713801|emb|CBJ27173.1| mina protein [Ectocarpus siliculosus]
          Length = 631

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/278 (25%), Positives = 120/278 (43%), Gaps = 73/278 (26%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR---------- 52
           WE++P+ I  +  N+Y+ L+    + D+  ++VIE      ++L  +G+           
Sbjct: 72  WERKPMVIRGRSPNHYDRLV-EFGLGDV--ESVIE----KGVSLSGEGEEPMVLGSDWNI 124

Query: 53  ------QTMDVEGRALP------------------QIVWDHYREGCSIRFLNPQTYIKPL 88
                 Q  D  G  +P                  +     +R G S+     Q   + +
Sbjct: 125 IKRVFVQADDGPGSNVPPAPGKTHAGGADRVVSLEEARKAFHRGGFSVVINRLQRRWRGV 184

Query: 89  QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPR 143
            + + +L+ + G  V AN Y TPP +QGF  H+D ++  ++QL G K W +Y     +PR
Sbjct: 185 LRASRALEHVLGQPVNANLYMTPPQSQGFEAHFDWMDGIVVQLTGSKTWILYHEMVTMPR 244

Query: 144 MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA---------------- 187
              ++ P+ +       E+G PI  + L PGD++Y+PRG+ H+A                
Sbjct: 245 PDLKFKPKAA-------ELGEPIAVLDLYPGDMMYIPRGWPHEAAVNGTAARPGGGRGAS 297

Query: 188 --STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
             S+     SLHVT  V  +TA     E  +  AL+ A
Sbjct: 298 PSSSTARGPSLHVTFGV--ETALSGTYESLLHHALEVA 333


>gi|386353481|ref|YP_006051728.1| oxidoreductase domain-containing protein [Streptomyces cattleya
           NRRL 8057 = DSM 46488]
 gi|365811560|gb|AEW99775.1| oxidoreductase domain-containing protein [Streptomyces cattleya
           NRRL 8057 = DSM 46488]
          Length = 594

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 100/219 (45%), Gaps = 33/219 (15%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM--DVEGR 60
           WE+E L +          L T   + D  R   +      ++++ ++GQ + +  D   R
Sbjct: 26  WEREALLLHGAAVPLVGRLFT---LADFER---LACGGGAELSVVQNGQARPVVADRADR 79

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQ-------ELFGCFVGANTYQTPPT 113
              + V   YR G ++      +   P+ +L   ++        L G  V AN Y TPP 
Sbjct: 80  VPGRHVLAAYRSGATLLLSCLPSRCPPVGELCRGIEFDLLRLGLLPGDPVSANAYLTPPR 139

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           AQGF  H+DD    ++QL G K+W+V+              P     E    +++ TLE 
Sbjct: 140 AQGFGRHHDDHSVLVVQLHGSKRWEVF-------------GPGSQGPEA---VVSATLEA 183

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           GD+L +P G+ H A T T+E SLH+TI ++   +W D L
Sbjct: 184 GDVLSIPAGFPHVAHT-TDEASLHLTIGLH-TLSWADAL 220


>gi|357407530|ref|YP_004919453.1| hypothetical protein SCAT_p0161 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|337762479|emb|CCB71187.1| protein of unknown function [Streptomyces cattleya NRRL 8057 = DSM
           46488]
          Length = 596

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 100/219 (45%), Gaps = 33/219 (15%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM--DVEGR 60
           WE+E L +          L T   + D  R   +      ++++ ++GQ + +  D   R
Sbjct: 28  WEREALLLHGAAVPLVGRLFT---LADFER---LACGGGAELSVVQNGQARPVVADRADR 81

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQ-------ELFGCFVGANTYQTPPT 113
              + V   YR G ++      +   P+ +L   ++        L G  V AN Y TPP 
Sbjct: 82  VPGRHVLAAYRSGATLLLSCLPSRCPPVGELCRGIEFDLLRLGLLPGDPVSANAYLTPPR 141

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           AQGF  H+DD    ++QL G K+W+V+              P     E    +++ TLE 
Sbjct: 142 AQGFGRHHDDHSVLVVQLHGSKRWEVF-------------GPGSQGPEA---VVSATLEA 185

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           GD+L +P G+ H A T T+E SLH+TI ++   +W D L
Sbjct: 186 GDVLSIPAGFPHVAHT-TDEASLHLTIGLH-TLSWADAL 222


>gi|251832992|gb|ACT22499.1| cupin [Triticum aestivum]
          Length = 462

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 79/149 (53%), Gaps = 23/149 (15%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           R G SI     +   + +  + ++L +LFG   VGAN Y +PP +QG A HYDD    + 
Sbjct: 110 RNGFSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPPRSQGLARHYDDHCVLVW 169

Query: 130 QLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
           QL G+KKWK++      LPR+   Y P +S      +  G       L  GD++Y+PRG+
Sbjct: 170 QLLGRKKWKIWPNTKSVLPRL---YEPFHSLDGLVDDSGGR---VEVLHEGDIMYVPRGH 223

Query: 184 IHQAST----------VTNEHSLHVTISV 202
           +H+A T          V+  +SLH+T+++
Sbjct: 224 VHEAHTDVDEGESEVNVSTNYSLHLTLAI 252


>gi|219117481|ref|XP_002179535.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409426|gb|EEC49358.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 481

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 67/223 (30%), Positives = 106/223 (47%), Gaps = 34/223 (15%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           + +GCS+   +       +  L   LQ  F   V ANTY TPP +Q    H DD + F++
Sbjct: 134 FLDGCSVVTNHADRRSPWIAALCEDLQASFP-HVYANTYLTPPGSQTVPAHADDRDVFVI 192

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT------------PILT-VTLEPGDL 176
           QL G K WK+Y  R +    P      +S E++G             P+LT   L PGD+
Sbjct: 193 QLVGCKAWKIY--RNIPVPYP------YSHEQVGKGELEVPGQVLDGPVLTDRVLAPGDV 244

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP- 235
           LY+PRGY+H+A  V    S HVT+++  +  W   L   +  A +A+      +R+ +P 
Sbjct: 245 LYMPRGYVHEAHAVDGGPSFHVTVALATQD-WT--LAGLVTAATEASLTQQRSYRQAVPR 301

Query: 236 -IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLD 277
             G   +  +A    VD   D+ ++++ L D    L E + ++
Sbjct: 302 CFGRRSFESIA----VD---DKQSLQKQLDDAFRILREKITVE 337


>gi|221486010|gb|EEE24280.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 462

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/151 (34%), Positives = 82/151 (54%), Gaps = 11/151 (7%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y EGCS+        ++ LQ +   L + +   V A +Y TPP       H DD + F+L
Sbjct: 190 YLEGCSLVINQADRTLEILQSICQHLSKKYFSHVFAVSYLTPPRTHAVKTHTDDQDVFLL 249

Query: 130 QLEGKKKWKVYLP----RMVDEYL-PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
           Q+ G K WK++ P     + +E L  R + P+    + G P+L   L+ GD+LY+PRG+ 
Sbjct: 250 QVWGSKAWKIWTPPQILPLTEEMLGKREAFPD----DPGKPLLEFVLKEGDILYIPRGFP 305

Query: 185 HQASTVTNEHSLHVTISV-YQKTAWIDLLEK 214
           H A T T E SLH+T++V   + A++  L++
Sbjct: 306 HAAVT-TEEPSLHITLTVPTAEFAYVTCLQR 335


>gi|237834811|ref|XP_002366703.1| hypothetical protein TGME49_040840 [Toxoplasma gondii ME49]
 gi|211964367|gb|EEA99562.1| hypothetical protein TGME49_040840 [Toxoplasma gondii ME49]
 gi|221503506|gb|EEE29197.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 508

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/151 (34%), Positives = 82/151 (54%), Gaps = 11/151 (7%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y EGCS+        ++ LQ +   L + +   V A +Y TPP       H DD + F+L
Sbjct: 190 YLEGCSLVINQADRTLEILQSICQHLSKKYFSHVFAVSYLTPPRTHAVKTHTDDQDVFLL 249

Query: 130 QLEGKKKWKVYLP----RMVDEYL-PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
           Q+ G K WK++ P     + +E L  R + P+    + G P+L   L+ GD+LY+PRG+ 
Sbjct: 250 QVWGSKAWKIWTPPQILPLTEEMLGKREAFPD----DPGKPLLEFVLKEGDILYIPRGFP 305

Query: 185 HQASTVTNEHSLHVTISV-YQKTAWIDLLEK 214
           H A T T E SLH+T++V   + A++  L++
Sbjct: 306 HAAVT-TEEPSLHITLTVPTAEFAYVTCLQR 335


>gi|147783372|emb|CAN72964.1| hypothetical protein VITISV_016492 [Vitis vinifera]
          Length = 590

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 75/149 (50%), Gaps = 16/149 (10%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D Y +G +I     +   + +  +   L  LFG   VG N Y TPP +QG A HYDD   
Sbjct: 437 DAYNKGYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCV 496

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSP--NFSQEEIGTPILTVT---LEPGDLLYLPR 181
           F+ QL G K+W +    +V   LPR   P  +    +IG  +   T   L  GD+LY+PR
Sbjct: 497 FVCQLFGTKQWTIVSQPIVS--LPRLYEPLDSLHSSKIGNSMAGXTQFLLREGDILYIPR 554

Query: 182 GYIHQASTV--------TNEHSLHVTISV 202
           G+ H+A TV        T   SLH+T+++
Sbjct: 555 GFPHEACTVAESXGPDETTGFSLHLTLAI 583


>gi|290954654|ref|YP_003485836.1| hypothetical protein SCAB_0241 [Streptomyces scabiei 87.22]
 gi|260644180|emb|CBG67253.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 390

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 92/199 (46%), Gaps = 12/199 (6%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           +EG SI   +      P+ +L  +++      V AN Y +    +GF  H+DD +  I+Q
Sbjct: 94  KEGASIALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATEGFGIHWDDHDTVIVQ 153

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           L+G K+W++Y         P Y       E    P+  + L PGD+LY+PRG  H  S  
Sbjct: 154 LDGAKRWRIY---GTTRPYPLYRDIEDPGEAPTEPVADLVLWPGDVLYVPRGVWHAVSAD 210

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
               SLHVT  +   TA  DL+     + L     T  ++RR LP   L  A   +G  V
Sbjct: 211 QGTRSLHVTCGLQTHTA-TDLMTWVSEQLL-----THEDWRRDLP---LLAAPDVQGDAV 261

Query: 251 DIQADRLAMKENLKDLLAK 269
           D    RLA   +  +LLA+
Sbjct: 262 DGMRKRLAELLDDPELLAR 280


>gi|302527532|ref|ZP_07279874.1| predicted protein [Streptomyces sp. AA4]
 gi|302436427|gb|EFL08243.1| predicted protein [Streptomyces sp. AA4]
          Length = 385

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 97/208 (46%), Gaps = 14/208 (6%)

Query: 14  QKNYYNNLITSKKIDDILRDNVIEFKK-----NIDI----TLYEDGQRQTMDVEGRALPQ 64
           +K  + +L+   ++D +LR + +EF +     + D+    T  E    +   +  R LP 
Sbjct: 24  EKGRFASLLPWPELDRVLRQHRLEFPRLRLALDGDVVPAHTYTEMVSTRRNGLVPRVLPA 83

Query: 65  IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
              +H R G ++   +    +  + +L   L+      +  N Y    T  GF  H+DD 
Sbjct: 84  PFAEHLRAGATLVLDSVNELVGEVGELATKLEHDLRERIQVNLYAGWGTTHGFDVHWDDH 143

Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
           +AF++QL G+K+W+++    V    P        +   G P     LE GD+LYLPRG+ 
Sbjct: 144 DAFVIQLSGRKRWRLH---GVTRPSPLQRDVELPERPEGEPRDDFLLEDGDVLYLPRGHW 200

Query: 185 HQASTVTNEHSLHVTISVYQKTAWIDLL 212
           H  S V  E SLH+TI     T  +DL+
Sbjct: 201 HDVSAV-GEESLHLTIGFNAATG-VDLV 226


>gi|251832983|gb|ACT22494.1| cupin [Triticum aestivum]
          Length = 466

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 79/149 (53%), Gaps = 23/149 (15%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           R G SI     +   + +  + ++L +LFG   VGAN Y +PP +QG A HYDD    + 
Sbjct: 110 RNGFSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPPRSQGLARHYDDHCVLVW 169

Query: 130 QLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
           QL G+KKWK++      LPR+   Y P +S      +  G       L  GD++Y+PRG+
Sbjct: 170 QLLGRKKWKIWPNTKSILPRL---YEPFHSLDGLVDDRGGR---VEVLREGDIMYVPRGH 223

Query: 184 IHQASTVTNE----------HSLHVTISV 202
           +H+A T  +E          +SLH+T+++
Sbjct: 224 VHEACTDIDEGESEVNASANYSLHLTLAI 252


>gi|297735434|emb|CBI17874.3| unnamed protein product [Vitis vinifera]
          Length = 770

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 75/149 (50%), Gaps = 16/149 (10%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D Y +G +I     +   + +  +   L  LFG   VG N Y TPP +QG A HYDD   
Sbjct: 416 DAYNKGYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCV 475

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSP--NFSQEEIGTPILTVT---LEPGDLLYLPR 181
           F+ QL G K+W +    +V   LPR   P  +    +IG  +   T   L  GD+LY+PR
Sbjct: 476 FVCQLFGTKQWTIVSQPIVS--LPRLYEPLDSLHSSKIGNSMAGRTQFLLREGDILYIPR 533

Query: 182 GYIHQASTV--------TNEHSLHVTISV 202
           G+ H+A TV        T   SLH+T+++
Sbjct: 534 GFPHEACTVAESGGPDETTGFSLHLTLAI 562


>gi|170575231|ref|XP_001893154.1| chromosome 14 open reading frame 169 [Brugia malayi]
 gi|158600986|gb|EDP38013.1| chromosome 14 open reading frame 169, putative [Brugia malayi]
          Length = 248

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 87/167 (52%), Gaps = 7/167 (4%)

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           +LY+PRGYIHQ     + HSL +T+SV +   + DLLE+ +P AL      ++  R+ LP
Sbjct: 1   MLYIPRGYIHQGFADKDVHSLQLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIRKSLP 60

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDD-GVDEMGKQLMHDALP 293
             YL   G+       ++   + +   L  + +   +Y+ +L +  VD M ++ M  ALP
Sbjct: 61  ARYLDMTGVLECDYPLLKTGTIKLHRFLDSIFSNFCKYIKELSEPDVDMMAREFMRTALP 120

Query: 294 PVLSPEELQCSVFENGLRMSQTGEVYNATQI-TKDTKVRLVRANAVR 339
           PVL+ EE   +     L ++ +    +   I TK+T ++L+R +  R
Sbjct: 121 PVLTKEEKDMT----ALCVAGSSLYGDKEHIFTKNTPIKLLRRHGQR 163


>gi|91199785|emb|CAI78141.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
           23877]
 gi|126347489|emb|CAJ89199.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
           23877]
          Length = 311

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 61/198 (30%), Positives = 93/198 (46%), Gaps = 20/198 (10%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           +EG S+   +      P+ +L  +++      V AN Y +    +GF  H+DD +  I+Q
Sbjct: 15  KEGASLALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATEGFGIHWDDHDTVIVQ 74

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           L+G K+W++Y         P Y       E    P+  + L PGD+LY+PRG  H  S  
Sbjct: 75  LDGAKRWRIY---GTTRPYPLYRDIEDPGEAPTEPVADLVLWPGDVLYVPRGVWHAVSAD 131

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
               SLHVT  +   TA  DL+     + L     T  ++RR LP+       LA     
Sbjct: 132 QGTRSLHVTCGLQTHTA-TDLMAWVSEQLL-----THEDWRRDLPL-------LAAS--- 175

Query: 251 DIQADRL-AMKENLKDLL 267
           D+QAD +  M++ L +LL
Sbjct: 176 DVQADAVDGMRKRLAELL 193


>gi|427707810|ref|YP_007050187.1| transcription factor jumonji jmjC domain-containing protein [Nostoc
           sp. PCC 7107]
 gi|427360315|gb|AFY43037.1| transcription factor jumonji jmjC domain-containing protein [Nostoc
           sp. PCC 7107]
          Length = 386

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/221 (28%), Positives = 103/221 (46%), Gaps = 22/221 (9%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQTMDV 57
           W K+ ++I  +      N+ T K ++D+L     ++  + F  N   +L E   RQ    
Sbjct: 21  WTKKAIHIPAEHDQKLQNIFTWKSLNDLLNYHKLKEPDLRFSMN-GKSLSETRNRQEWS- 78

Query: 58  EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
                     D  R+G ++        +  + +L A+L+   G     N Y +P   QGF
Sbjct: 79  ----------DRLRQGATLIINGVHHRVATVAELAANLRHDIGYETHVNLYCSPAKQQGF 128

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
             HYD  +  ILQ++G+K+W VY   +  +Y   +   +  Q+    P L   L+ GDLL
Sbjct: 129 DCHYDTHDVLILQIDGEKQWFVYQETV--QYPTAHIPSSKQQQPQEPPYLECVLKAGDLL 186

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAM 216
           Y+PRG+ H A     + SLH+TI +  +T   W++ L K +
Sbjct: 187 YIPRGHWHYA-VACEQPSLHLTIGIECQTGLDWLNWLMKDL 226


>gi|440715594|ref|ZP_20896130.1| Cupin 4 family protein [Rhodopirellula baltica SWK14]
 gi|436439428|gb|ELP32878.1| Cupin 4 family protein [Rhodopirellula baltica SWK14]
          Length = 277

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 73/137 (53%), Gaps = 4/137 (2%)

Query: 73  GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
           GCS+        +  ++ +   +++  G    AN Y TP  +Q F  H+D  + F+ Q E
Sbjct: 99  GCSLLIQAMNRRLTCVRHICNQIEKASGLLCQANGYLTPRGSQTFNLHFDTHDVFVWQQE 158

Query: 133 GKKKWKVYLPRMVD-EYLPRYSSPNFSQEEI--GTPILTVTLEPGDLLYLPRGYIHQAST 189
           G K WKV+   + +  Y  R+      +E++    PI+  TL PGD+LY+PRGY+H    
Sbjct: 159 GSKAWKVFGSEVDNPSYHHRFQVDEKRREQLVEEGPIIDTTLTPGDMLYIPRGYLH-CGQ 217

Query: 190 VTNEHSLHVTISVYQKT 206
            T + S H+T+ V++K+
Sbjct: 218 ATADDSFHLTVGVFRKS 234


>gi|329940499|ref|ZP_08289780.1| hypothetical protein SGM_5272 [Streptomyces griseoaurantiacus M045]
 gi|329300560|gb|EGG44457.1| hypothetical protein SGM_5272 [Streptomyces griseoaurantiacus M045]
          Length = 395

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 102/206 (49%), Gaps = 24/206 (11%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNA-SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           R+G S+  L+    I P     A  L+   G  V AN Y +    +GF  H+DD +  ++
Sbjct: 96  RDGASL-VLDAVEKIHPAVGAAAEGLERFLGTSVQANVYASWTEREGFGRHWDDHDVVVV 154

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
           QL G K+W+++   M  E  P +      +E  G P+  + L PGD+LYLPRG+ H  + 
Sbjct: 155 QLHGSKRWRLW--GMTREA-PTFRDVESPEEPEGDPVADLVLSPGDVLYLPRGWWHSVTA 211

Query: 190 VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
                SLH+T  +   T   DL+   + + L+A+    L  RR +P    R+  LA    
Sbjct: 212 DQGTESLHLTFGMVPHTG-ADLMLWVVDQ-LRAS----LALRRDIP----RFGSLAE--- 258

Query: 250 VDIQADRL-AMKENLKDLLA--KLVE 272
              Q+D + A++  + D++A  +LVE
Sbjct: 259 ---QSDFIDAVRREVADMMADPRLVE 281


>gi|284028215|ref|YP_003378146.1| Cupin 4 family protein [Kribbella flavida DSM 17836]
 gi|283807508|gb|ADB29347.1| Cupin 4 family protein [Kribbella flavida DSM 17836]
          Length = 425

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 11/156 (7%)

Query: 87  PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
           PL      L    G  V  N Y TP  +QGF+ HYD  + F+LQ+ G+K W V+ P  VD
Sbjct: 136 PLVDFATQLSSEAGHPVQINAYITPAESQGFSAHYDVHDVFVLQVAGEKHWTVHEPVHVD 195

Query: 147 EYL--PRYSSPNFSQEEIGT--PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
                P         E   T  P++   L PGD LY+PRG++H A  +    S H+T+ +
Sbjct: 196 PLRNQPWTDHSKAVAEAASTEQPVVDAVLRPGDALYVPRGFLHSAKALGGV-SAHLTVGL 254

Query: 203 YQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
           +  T ++ + E      L A  A   + R  LP+G+
Sbjct: 255 HTMTRYLLVQE------LAALAANVPDLRSALPLGF 284


>gi|387876598|ref|YP_006306902.1| hypothetical protein W7S_16070 [Mycobacterium sp. MOTT36Y]
 gi|443306372|ref|ZP_21036160.1| hypothetical protein W7U_11950 [Mycobacterium sp. H4Y]
 gi|386790056|gb|AFJ36175.1| hypothetical protein W7S_16070 [Mycobacterium sp. MOTT36Y]
 gi|442767936|gb|ELR85930.1| hypothetical protein W7U_11950 [Mycobacterium sp. H4Y]
          Length = 390

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 95/212 (44%), Gaps = 20/212 (9%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W     +I R +  Y++ L+ +   D++L     E      + L + GQ +      R  
Sbjct: 24  WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78

Query: 63  PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
             +  D  REG     +I     + Y++ +  L+ SL+         N Y TPP A GFA
Sbjct: 79  GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
           PHYD  +  +LQ+ G K W       V    P       S++ +GT        V L  G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWH------VSTGAPVPPHEIESRKGVGTDGPAAATDVCLRAG 192

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           D+LYLPRG +H A T  +E S+H+TI ++  T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223


>gi|406031451|ref|YP_006730342.1| JmjC domain-containing protein [Mycobacterium indicus pranii MTCC
           9506]
 gi|405129998|gb|AFS15253.1| JmjC domain-containing protein [Mycobacterium indicus pranii MTCC
           9506]
          Length = 391

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 92/208 (44%), Gaps = 11/208 (5%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W     +I R +  Y++ L+ +   D++L     E      + L + GQ +      R  
Sbjct: 24  WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78

Query: 63  PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
             +  D  REG     +I     + Y++ +  L+ SL+         N Y TPP A GFA
Sbjct: 79  GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PHYD  +  +LQ+ G K W V     V  +    S      +        V L  GD+LY
Sbjct: 139 PHYDPHDVLVLQIRGSKTWHVSTGAPVPPH-EIESRKGVGTDGPAAAATDVCLRAGDVLY 197

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKT 206
           LPRG +H A T  +E S+H+TI ++  T
Sbjct: 198 LPRGQVHSAET-RSEPSVHLTIGLHAPT 224


>gi|379747918|ref|YP_005338739.1| hypothetical protein OCU_31990 [Mycobacterium intracellulare ATCC
           13950]
 gi|379755222|ref|YP_005343894.1| hypothetical protein OCO_32100 [Mycobacterium intracellulare
           MOTT-02]
 gi|378800282|gb|AFC44418.1| hypothetical protein OCU_31990 [Mycobacterium intracellulare ATCC
           13950]
 gi|378805438|gb|AFC49573.1| hypothetical protein OCO_32100 [Mycobacterium intracellulare
           MOTT-02]
          Length = 390

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 95/212 (44%), Gaps = 20/212 (9%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W     +I R +  Y++ L+ +   D++L     E      + L + GQ +      R  
Sbjct: 24  WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78

Query: 63  PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
             +  D  REG     +I     + Y++ +  L+ SL+         N Y TPP A GFA
Sbjct: 79  GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
           PHYD  +  +LQ+ G K W       V    P       S++ +GT        V L  G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWH------VSTGAPVPPHEIESRKGVGTDGPAAATDVCLRAG 192

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           D+LYLPRG +H A T  +E S+H+TI ++  T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223


>gi|333989158|ref|YP_004521772.1| hypothetical protein JDM601_0518 [Mycobacterium sp. JDM601]
 gi|333485126|gb|AEF34518.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
          Length = 409

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 102/213 (47%), Gaps = 26/213 (12%)

Query: 3   WEKEPLYISRKQKNYYNNL---ITSKKIDDILRD--NVIEFKKNIDITLYE-----DGQR 52
           W     +I R + +Y++ L    T+ ++ + LR   + +   +  D    E     DG  
Sbjct: 36  WGAALHHIGRDRADYFDGLSGLATAGQLLESLRTEPSAVRLVRGADHLAPEQYTLADG-- 93

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
            T+D+ GR     +      G +I   N + Y++P+  L   ++         N Y TPP
Sbjct: 94  -TLDL-GR-----IHAELENGYTIICNNIEKYLRPMSTLTHGVEVELNFPTHVNAYVTPP 146

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-EEIGTPILT-VT 170
           +A GF PHYD  +  +LQ++G K W  Y     DE +P +      + +  G P  T + 
Sbjct: 147 SAVGFLPHYDHHDVLVLQIQGHKTWYFY----GDEPIPPHLMQQLHEVDPAGLPEPTSLH 202

Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
           L  GD LY+PRG +H A T T+E SLH+T+ ++
Sbjct: 203 LSAGDTLYVPRGRVHSAET-TSESSLHLTVGIH 234


>gi|170701700|ref|ZP_02892640.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
 gi|170133393|gb|EDT01781.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
          Length = 304

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 69/147 (46%), Gaps = 15/147 (10%)

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           +P  ++    +GC+I     Q Y   +  L A ++ +  C   AN Y +  +A  F  H+
Sbjct: 86  MPDFLYRALDDGCTIIIDGCQDYFPSVLSLTAEIEHILKCQSWANLYISTQSATSFGCHF 145

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI------GTPILTVTLEPGD 175
           DD +   +QL GKK+W +        Y P Y SPN   +        G+P L   L  G 
Sbjct: 146 DDHDIISVQLSGKKRWHI--------YKPTYISPNRGDKSFYLDPPTGSPDLLENLPTGS 197

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV 202
            LYLP GY H   TV+  HS+H+T  +
Sbjct: 198 SLYLPSGYWHNVETVS-PHSMHITFGL 223


>gi|359485116|ref|XP_003633217.1| PREDICTED: uncharacterized protein LOC100852762 [Vitis vinifera]
          Length = 564

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 75/149 (50%), Gaps = 16/149 (10%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D Y +G +I     +   + +  +   L  LFG   VG N Y TPP +QG A HYDD   
Sbjct: 210 DAYNKGYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCV 269

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSP--NFSQEEIGTPILTVT---LEPGDLLYLPR 181
           F+ QL G K+W +    +V   LPR   P  +    +IG  +   T   L  GD+LY+PR
Sbjct: 270 FVCQLFGTKQWTIVSQPIVS--LPRLYEPLDSLHSSKIGNSMAGRTQFLLREGDILYIPR 327

Query: 182 GYIHQASTV--------TNEHSLHVTISV 202
           G+ H+A TV        T   SLH+T+++
Sbjct: 328 GFPHEACTVAESGGPDETTGFSLHLTLAI 356


>gi|32455630|ref|NP_862112.1| hypothetical protein [Streptomyces violaceoruber]
 gi|28797256|gb|AAO50103.1| hypothetical protein [Streptomyces violaceoruber]
          Length = 390

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 116/263 (44%), Gaps = 39/263 (14%)

Query: 20  NLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALP-----QIVWD------ 68
            L+T   ++ IL  + +E  +   + L  DG+  T+ V G   P       VW       
Sbjct: 34  GLMTFDDLNQILATHRLEPPR---MRLSRDGE--TLLVGGYTTPVATRRHTVWHRLHPAE 88

Query: 69  -HYR--EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
            H R  EG S+   +      P+ +L  +++      V AN Y +    +GF  H+DD +
Sbjct: 89  LHTRLTEGASLALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATEGFGVHWDDHD 148

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
             I+QL+G K+W++Y         P Y       E    P+  + L PGD+LY+PRG  H
Sbjct: 149 TVIVQLDGAKRWRIY---GTTRPFPLYRDIADPGEAPTEPVADLVLWPGDVLYVPRGVWH 205

Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
             S      SLHVT  +   TA  DL+     + L     T  ++RR LP+         
Sbjct: 206 AVSADQGVRSLHVTCGLQTHTA-TDLMAWVSEQLL-----THEDWRRDLPL--------- 250

Query: 246 RGKPVDIQADRL-AMKENLKDLL 267
              P D+QAD +  M++ L +LL
Sbjct: 251 LAAP-DVQADAVDGMRKRLAELL 272


>gi|256824138|ref|YP_003148098.1| Cupin superfamily protein [Kytococcus sedentarius DSM 20547]
 gi|256687531|gb|ACV05333.1| Cupin superfamily protein [Kytococcus sedentarius DSM 20547]
          Length = 414

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/253 (27%), Positives = 111/253 (43%), Gaps = 25/253 (9%)

Query: 3   WEKEPLYI--SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQTM 55
           W   P ++  + +  + + +L +   +DD+L     R   I   ++   TL E+   +  
Sbjct: 34  WGTTPRHVPATDRAGDTFTDLFSLDAVDDLLTHRGLRTPFIRMAQD-GTTLPENRFTRGG 92

Query: 56  DVEGRALPQIVWDHYRE----GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
                A  Q+  D  R     G +I          P+ +    L +  G  V  N Y TP
Sbjct: 93  GTGAGASDQVDEDRVRSLFAGGATIVLQGLHRTWPPIAEFARELGDELGHPVQVNAYITP 152

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP-NFSQEEIG------T 164
           P  QGF+ HYD  + F+LQ+ G K W ++ P +     P    P +  +E +        
Sbjct: 153 PQNQGFSAHYDVHDVFVLQVHGTKHWTLHEPVVAH---PLRDQPWDTVREAVAHRAAQDA 209

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
           P++   L PGD+LYLPRG IH A+    E S H+TI V+  T   D +  A+  A+++  
Sbjct: 210 PLIDAVLAPGDVLYLPRGTIH-AAAAQGEISAHLTIGVHTWTP--DHVTGAVLDAVRSRL 266

Query: 225 ATDLEFRRGLPIG 237
                 R  LP+G
Sbjct: 267 RDQPTVRANLPLG 279


>gi|359401171|ref|ZP_09194142.1| hypothetical protein NSU_3828 [Novosphingobium pentaromativorans
           US6-1]
 gi|357597420|gb|EHJ59167.1| hypothetical protein NSU_3828 [Novosphingobium pentaromativorans
           US6-1]
          Length = 320

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 84/163 (51%), Gaps = 7/163 (4%)

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           GR+   ++ D  R+G  +R  + Q   + ++     L++LF      N Y  P    GF 
Sbjct: 92  GRSQLAVLIDQLRQGAMVRVRDLQNASQSIEHTVRQLEQLFLAKCQVNLYLAPMGGAGFP 151

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI---LTVTLEPGD 175
            H+D  + F++Q  G K W ++        LP   +P   + E   P+    ++ ++ GD
Sbjct: 152 AHFDITDGFVVQCGGTKDWTIFEDYTDQTQLPSADTP--WEPERYKPLGSGKSLVMQLGD 209

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPK 218
           +LYLPRG +H A+  T+EHSLH+TIS+ +   + D+L+  + +
Sbjct: 210 VLYLPRGVMH-AARCTDEHSLHLTISL-ESLTYSDVLQSEVRR 250


>gi|365860786|ref|ZP_09400580.1| cupin 4 family protein [Streptomyces sp. W007]
 gi|364009777|gb|EHM30723.1| cupin 4 family protein [Streptomyces sp. W007]
          Length = 275

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/220 (28%), Positives = 101/220 (45%), Gaps = 18/220 (8%)

Query: 13  KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---VEGRALPQIVWDH 69
           ++    + +IT + ID  L    +      +I +  DG+    D     GR  P  +   
Sbjct: 4   REAGELDRVITPESIDHYLETGCVPAD---EIAVVSDGRALHPDRHRTAGRTDPAKLRSL 60

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y  G +IR  N Q  +  L  ++  +Q   G     + + TPP  QG   H+D   A IL
Sbjct: 61  YEGGHTIRLGNLQRVVPFLADVSRGIQRETGFSNYLHAFVTPPGRQGLRHHWDQQMAVIL 120

Query: 130 QLEGKKKWKVYLP------RMVDEYLPRYSSPNF-SQEEIGTPILTVTLEPGDLLYLPRG 182
           Q+ G K+W+++ P      R   E   R   P+F  Q E   P L V L PG  L LPRG
Sbjct: 121 QISGIKRWQLWRPMFPSPMRAYQESF-RVWDPDFIPQWEAAGPDLEVDLRPGQSLLLPRG 179

Query: 183 YIHQASTV-TNEHSLHVTISVYQKTA-WI--DLLEKAMPK 218
           ++H    +     S+H+T ++ ++T  W+   L+++A+ K
Sbjct: 180 WVHNPHALDAQARSIHLTFAIRERTPLWVAEKLVDRAIEK 219


>gi|254775804|ref|ZP_05217320.1| hypothetical protein MaviaA2_14200 [Mycobacterium avium subsp.
           avium ATCC 25291]
          Length = 393

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 126/306 (41%), Gaps = 38/306 (12%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
           W     +I R +  Y++ L+      D L + V         ++  ++ D   Y  G   
Sbjct: 24  WATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           T++  G        D   +G ++     + Y++ +  L+ +++         N Y TPP 
Sbjct: 83  TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFLTRVNAYVTPPH 136

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT-VTLE 172
           + GF PHYD  +  +LQ+EG K W+V     V    P+           G    T V L 
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWRVSDGPAVP---PQQIQSRKGVGADGPASWTDVCLR 193

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           PGD+LYLPRG +H A T  +E S+H+T+ ++  T         +  AL A    D     
Sbjct: 194 PGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVL-----TLVTSALHALSLRDPRVHD 247

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDA- 291
            LP  +L           D    R  + E ++D +  L +   + DG+  M + L+    
Sbjct: 248 RLPPRHL-----------DDARVRAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVRRGR 296

Query: 292 LPPVLS 297
            PPV S
Sbjct: 297 CPPVGS 302


>gi|413919791|gb|AFW59723.1| hypothetical protein ZEAMMB73_667794 [Zea mays]
          Length = 589

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/189 (29%), Positives = 94/189 (49%), Gaps = 25/189 (13%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
           ++ G SI     +   + +  + ++L +LFG   VGAN Y +P  +QG A HYDD    +
Sbjct: 228 FKNGYSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPAGSQGLARHYDDHCVLV 287

Query: 129 LQLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
            QL G KKW V+      LPR+ + + P   + + S   +        L  GD++Y+PRG
Sbjct: 288 WQLLGSKKWTVWPNPKPLLPRLYEPFDPLDGNLDESSGRVE------VLHEGDMMYIPRG 341

Query: 183 YIHQAST-----VTNEHSLHVTISV-------YQKTAWIDLLEKAMPKALQAAGATDLEF 230
           Y+H+A T       N+ SLH+T+++       ++  A I L   A  + L+ + + + E 
Sbjct: 342 YVHEARTDVSGSEVNDCSLHLTLAIEVEAPFEWEGFAHIALHCWAEKQQLRGSQSVEFEG 401

Query: 231 RRGLPIGYL 239
           + G  +  +
Sbjct: 402 KAGASLSAI 410


>gi|118464331|ref|YP_882543.1| cupin superfamily protein [Mycobacterium avium 104]
 gi|118165618|gb|ABK66515.1| cupin superfamily protein [Mycobacterium avium 104]
          Length = 361

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 129/309 (41%), Gaps = 44/309 (14%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
           W     +I R +  Y++ L+      D L + V         ++  ++ D   Y  G   
Sbjct: 24  WATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           T++  G        D   +G ++     + Y++ +  L+ +++         N Y TPP 
Sbjct: 83  TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPH 136

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG----TPILTV 169
           + GF PHYD  +  +LQ+EG K W+      V +  P       S++ +G         V
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWR------VSDEPPVPPQQIQSRKGVGADGPASRTDV 190

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L PGD+LYLPRG +H A T  +E S+H+T+ ++  T         +  AL A    D  
Sbjct: 191 CLRPGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVLT-----LVTSALHALSLRDPR 244

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
               LP  +L           D    R  + E ++D +  L +   + DG+  M + L+ 
Sbjct: 245 VHDRLPPRHL-----------DDAQVRAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVR 293

Query: 290 DA-LPPVLS 297
            +  PPV S
Sbjct: 294 RSRCPPVGS 302


>gi|256379276|ref|YP_003102936.1| cupin [Actinosynnema mirum DSM 43827]
 gi|255923579|gb|ACU39090.1| Cupin 4 family protein [Actinosynnema mirum DSM 43827]
          Length = 395

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/217 (27%), Positives = 100/217 (46%), Gaps = 32/217 (14%)

Query: 14  QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ-------------RQTMDVEGR 60
           ++  + +L+   +++ +LR + +EF +   + L  DG+             R+   V  R
Sbjct: 33  ERGRFADLLPWSEVNRVLRQHRLEFPR---LRLARDGEVVPAHVYSELVDTRRAGQVP-R 88

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            LP    +  R G ++   + Q  +  +  L   L+      V  N Y       GF  H
Sbjct: 89  VLPGKFAEQMRGGATLVLDSVQELVGAVGDLAVGLEHELRERVQVNAYAGWGVTHGFDVH 148

Query: 121 YDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           +DD +A ++Q+ G+K+W+++      P + D  LP    P       G P+    LE G+
Sbjct: 149 WDDHDAIVVQVSGRKRWRIHGFTRVAPMVRDVELP--PRPE------GEPLDEFVLEAGE 200

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           +LYLPRG  H  S V  E SLH+TI V + T  +DL+
Sbjct: 201 VLYLPRGCWHDVSAV-GEESLHLTIGVNRATG-VDLV 235


>gi|379762757|ref|YP_005349154.1| hypothetical protein OCQ_33210 [Mycobacterium intracellulare
           MOTT-64]
 gi|378810699|gb|AFC54833.1| hypothetical protein OCQ_33210 [Mycobacterium intracellulare
           MOTT-64]
          Length = 390

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 96/212 (45%), Gaps = 20/212 (9%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W     +I R +  Y++ L+ +   D++L     E      + L + GQ +      R  
Sbjct: 24  WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78

Query: 63  PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
             +  D  REG     +I     + Y++ +  L+ SL+         N Y TPP A GF+
Sbjct: 79  GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFS 138

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
           PHYD  +  +LQ+ G K W V     V  +         S++ +GT        V L  G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWHVSTGAPVPPHEIE------SRKGVGTDGPAAATDVCLRAG 192

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           D+LYLPRG +H A T  +E S+H+TI ++  T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223


>gi|323449141|gb|EGB05031.1| hypothetical protein AURANDRAFT_66651 [Aureococcus anophagefferens]
          Length = 450

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)

Query: 75  SIRFLNPQTYIKPLQQLNASLQELF--GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
           S+  L      +   +   ++Q +   GC V  N Y TPP AQ    HYDD   F++QL 
Sbjct: 174 SVVVLGADARFRCAAEARRAVQRVLRVGCSV--NLYATPPGAQALDAHYDDHCVFVVQLR 231

Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV-- 190
           G+K+W++Y P +    L   + P   +         VTL PGD+LY+PRG  H A+    
Sbjct: 232 GRKRWRLYGPALECPTLHEATLPPPPELLRRGADHAVTLAPGDVLYVPRGVYHAATACED 291

Query: 191 TNEHSLHVTISV 202
               S HVT+ V
Sbjct: 292 AGSDSAHVTVGV 303


>gi|429197490|ref|ZP_19189384.1| cupin domain protein, PF06172 family [Streptomyces ipomoeae 91-03]
 gi|428666857|gb|EKX65986.1| cupin domain protein, PF06172 family [Streptomyces ipomoeae 91-03]
          Length = 389

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 69/127 (54%), Gaps = 5/127 (3%)

Query: 86  KPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV 145
           +P ++L  +L+ +F   V  N Y +    +GF  H+DD +  ++QLEG K+W++Y P  +
Sbjct: 110 QPAERLAQALEGVFRGRVQINVYASWTATEGFGTHWDDHDVIVVQLEGAKRWRLYGPTRL 169

Query: 146 DEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
           +       +P    +E   PI  V L  GD+LYLPRG+ H A   T   SLH+T  + Q 
Sbjct: 170 NPLHRDTEAPEPPADE---PIEEVVLRAGDMLYLPRGWWH-AVAATEGRSLHLTCGL-QT 224

Query: 206 TAWIDLL 212
           T   DLL
Sbjct: 225 TTGADLL 231


>gi|254822014|ref|ZP_05227015.1| hypothetical protein MintA_18922 [Mycobacterium intracellulare ATCC
           13950]
          Length = 390

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 96/212 (45%), Gaps = 20/212 (9%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W     +I R +  Y++ L+ +   D++L     E      + L + GQ +      R  
Sbjct: 24  WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78

Query: 63  PQIVWDHYR----EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
             +  D  R    +G +I     + Y++ +  L+ SL+         N Y TPP A GFA
Sbjct: 79  GSLDPDRARGGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
           PHYD  +  +LQ+ G K W V     V  +         S++ +GT        V L  G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWHVSTGAPVPPHEIE------SRKGVGTDGPAAATDVCLRAG 192

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           D+LYLPRG +H A T  +E S+H+TI ++  T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223


>gi|41407243|ref|NP_960079.1| hypothetical protein MAP1145c [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|440776746|ref|ZP_20955580.1| hypothetical protein D522_07778 [Mycobacterium avium subsp.
           paratuberculosis S5]
 gi|41395594|gb|AAS03462.1| hypothetical protein MAP_1145c [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|436723035|gb|ELP46903.1| hypothetical protein D522_07778 [Mycobacterium avium subsp.
           paratuberculosis S5]
          Length = 393

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 129/309 (41%), Gaps = 44/309 (14%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
           W     +I R +  Y++ L+      D L + V         ++  ++ D   Y  G   
Sbjct: 24  WATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           T++  G        D   +G ++     + Y++ +  L+ +++         N Y TPP 
Sbjct: 83  TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPH 136

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG----TPILTV 169
           + GF PHYD  +  +LQ+EG K W+      V +  P       S++ +G         V
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWR------VSDEPPVPPQQIQSRKGVGADGPASWTDV 190

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L PGD+LYLPRG +H A T  +E S+H+T+ ++  T         +  AL A    D  
Sbjct: 191 CLRPGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVL-----TLVTSALHALSLRDPR 244

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
               LP  +L  A +           R  + E ++D +  L +   + DG+  M + L+ 
Sbjct: 245 VHDRLPPRHLDDAQV-----------RAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVR 293

Query: 290 DA-LPPVLS 297
               PPV S
Sbjct: 294 RGRCPPVGS 302


>gi|302533161|ref|ZP_07285503.1| predicted protein [Streptomyces sp. C]
 gi|302442056|gb|EFL13872.1| predicted protein [Streptomyces sp. C]
          Length = 298

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 95/195 (48%), Gaps = 15/195 (7%)

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           GR  P  +   Y  G ++R  N Q  +  +  ++  +Q+  G     + + TP   QG  
Sbjct: 73  GRTDPAKLRKLYESGHTVRLGNLQRVMPLMADISHGIQQETGYSNYIHAFVTPSGEQGLR 132

Query: 119 PHYDDIEAFILQLEGKKKWKVYLP------RMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
            H+D   A I+QLEG K+W+++ P      R  +E    +        E   P L + LE
Sbjct: 133 HHWDQQMAVIVQLEGVKRWQLWKPPVEAPMREFNESWRVWKQEYIPTWEAAGPDLEIHLE 192

Query: 173 PGDLLYLPRGYIHQASTV-TNEHSLHVTISVYQKTA-WIDLLEKAMPKALQAAGATDLEF 230
            G  + LPRG++H  + + TNE S+H+T ++ ++T  WI   E+ +  A++     D EF
Sbjct: 193 AGQSMLLPRGWVHNPAVLDTNERSVHLTFAIRERTPFWI--AERLVQDAIK-----DPEF 245

Query: 231 RRGLPIGYLRYAGLA 245
           RR +    L+  GLA
Sbjct: 246 RRVILPEQLKDEGLA 260


>gi|302535269|ref|ZP_07287611.1| cupin 4 [Streptomyces sp. C]
 gi|302444164|gb|EFL15980.1| cupin 4 [Streptomyces sp. C]
          Length = 403

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 83/165 (50%), Gaps = 17/165 (10%)

Query: 59  GRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           G+A+P      + +H R+G ++       +   ++    +LQ      V AN Y T   A
Sbjct: 82  GQAIPMADMRRIGNHLRDGATLVLDQSNVFDPTMEVACRALQWWSREHVQANVYLTTNDA 141

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP---NFSQEEIGTPILTVTL 171
            GF  H+DD +  I+QL G+K+W+V   R     +P Y      N   EEI   + T T+
Sbjct: 142 SGFDLHWDDHDVVIVQLAGEKQWEV---RGASRPVPMYRDADRNNTPSEEI---VWTGTM 195

Query: 172 EPGDLLYLPRGYIHQASTVTN--EHSLHVTISVYQKTA--WIDLL 212
           + GD++++PRGY H A+ V N   HS HVT    ++TA  W+  L
Sbjct: 196 KAGDVMHIPRGYWHAATRVGNGDGHSTHVTFGFAKRTAVSWLTWL 240


>gi|417746840|ref|ZP_12395324.1| Cupin superfamily protein [Mycobacterium avium subsp.
           paratuberculosis S397]
 gi|336461609|gb|EGO40474.1| Cupin superfamily protein [Mycobacterium avium subsp.
           paratuberculosis S397]
          Length = 393

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 130/309 (42%), Gaps = 44/309 (14%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
           W     +I R +  Y++ L+      D L + V         ++  ++ D   Y  G   
Sbjct: 24  WATRHHHIDRCRPGYFDGLLPEPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           T++  G        D   +G ++     + Y++ +  L+ +++         N Y TPP 
Sbjct: 83  TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPH 136

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG----TPILTV 169
           + GF PHYD  +  +LQ+EG K W+V      DE  P       S++ +G         V
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWRVS-----DEP-PVPPQQIQSRKGVGADGPASWTDV 190

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            L PGD+LYLPRG +H A T  +E S+H+T+ ++  T         +  AL A    D  
Sbjct: 191 CLRPGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVL-----TLVTSALHALSLRDPR 244

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
               LP  +L  A +           R  + E ++D +  L +   + DG+  M + L+ 
Sbjct: 245 VHDRLPPRHLDDAQV-----------RAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVR 293

Query: 290 DA-LPPVLS 297
               PPV S
Sbjct: 294 RGRCPPVGS 302


>gi|313237822|emb|CBY12955.1| unnamed protein product [Oikopleura dioica]
          Length = 520

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 116/260 (44%), Gaps = 42/260 (16%)

Query: 76  IRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKK 135
           + F  PQ +   L ++ A L+E F   VGA     P  ++   P ++  ++FI QL GK 
Sbjct: 146 LDFNQPQRFCDDLWRVIADLEETFQTLVGAKARIQPAKSKALPPVFNSTDSFIQQLNGKC 205

Query: 136 KWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHS 195
             K++    +D   P  +   F ++E+      V L PGD+LY+PRG +  ++ + NE+ 
Sbjct: 206 SVKIWKASDID-LAPEETIEEFKEDEMRKADEIVELAPGDMLYIPRGAVF-STALENENE 263

Query: 196 ----LHVTISVYQKTA-WIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
               LH+T++     A W+++     P  L          +      +LR A   +    
Sbjct: 264 SSSYLHLTMNSQNTVAQWLNV---NFPTFLD---------KVAFQAKWLREAVSTK---- 307

Query: 251 DIQADRLAMKENLKDLLAKLVEYVDLDDGV---DEMGKQLMHDALPPVLSPE-------- 299
            I+ +  A KE LK LL KL E V+ D+ V   D M K      LPP   P+        
Sbjct: 308 -IRTNS-AKKEYLKHLLEKLAESVN-DENVPEEDPMRKDFFAHRLPPYKEPKTFGKMKSF 364

Query: 300 ELQCSVFENGLRMSQTGEVY 319
           +L CSV     R+S    +Y
Sbjct: 365 DLNCSV-----RLSTKKHIY 379


>gi|409991367|ref|ZP_11274635.1| cupin 4 [Arthrospira platensis str. Paraca]
 gi|291567777|dbj|BAI90049.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409937781|gb|EKN79177.1| cupin 4 [Arthrospira platensis str. Paraca]
          Length = 387

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 116/240 (48%), Gaps = 26/240 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W ++ + I       + NL + +K++++L    +   K+ +I L + G+          
Sbjct: 20  YWTEKAVLIPGDNHQKFANLFSWQKLNNLLNYYPL---KHPEIRLAKAGE---------T 67

Query: 62  LPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           LP+I     +    +EG ++        I+ + ++ A L+   G     N+Y + P  QG
Sbjct: 68  LPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPGHQG 127

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
           FA HYD  E FILQ+ G+K W+V+    +   L    S  FS  +   P +   + PGDL
Sbjct: 128 FACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINPGDL 185

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           LY+PRG+ H A  + +E SLH+T+ +  +T  ID  +    +  Q       ++R+ LP+
Sbjct: 186 LYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKNLPL 238


>gi|168047079|ref|XP_001775999.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672657|gb|EDQ59191.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 992

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 46/110 (41%), Positives = 57/110 (51%), Gaps = 4/110 (3%)

Query: 92  NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR 151
           N    EL    VGAN Y TPP +QG   H+DD   F+ QL G+K W VY P    E LPR
Sbjct: 521 NGLAAELGQVTVGANLYLTPPGSQGLRVHFDDHCVFVCQLRGRKGWDVYPPL---EQLPR 577

Query: 152 -YSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTI 200
            YS    S E          L+  D LY+PRG++H+A T   E ++ V I
Sbjct: 578 LYSFKTLSTEVTKDYATHFDLQEWDTLYIPRGFLHEARTECPEQTIEVQI 627


>gi|223996773|ref|XP_002288060.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977176|gb|EED95503.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 488

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 86/183 (46%), Gaps = 20/183 (10%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y +GCSI   +       + +L   LQ  F   V AN Y TPP       H DD + F++
Sbjct: 131 YLDGCSIVVNHADLQSASIAKLCNDLQSSFP-HVYANAYLTPPNGFAVNAHADDRDVFVI 189

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL---------TVTLEPGDLLYLP 180
           Q+ G KKW VY  ++  EY         S  E+   +           + L PGD++Y+P
Sbjct: 190 QVLGTKKWNVY-KKVPVEYPFENEQVGKSGREVPPSVFEGGLCFGNNVLDLGPGDVMYMP 248

Query: 181 RGYIHQAST----VTNEH--SLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           RG++H+A+T    V + H  S H+TI++     W   L   +    +   +  +++R+ L
Sbjct: 249 RGFVHEATTEILDVEDGHSPSFHITIAIATHD-W--CLSVVLADCFRKTLSEVVDYRKAL 305

Query: 235 PIG 237
           PIG
Sbjct: 306 PIG 308


>gi|326521268|dbj|BAJ96837.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 777

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 54/184 (29%), Positives = 89/184 (48%), Gaps = 26/184 (14%)

Query: 36  IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL 95
           I     ++   ++DG   T+  +  A  +      + G SI     +   + +  + ++L
Sbjct: 393 ISTGSGVEEHFFDDG---TVFPDAAAFVEKCKGAIKNGFSIALRGMEFRSEKIAAVASAL 449

Query: 96  QELFG-CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRMVDEY 148
            +LFG   VGAN Y +PP +QG A HYDD    + QL G KKWK++      LPR+   Y
Sbjct: 450 ADLFGQPSVGANIYFSPPRSQGLARHYDDHCVLVWQLLGCKKWKIWPNTRPILPRL---Y 506

Query: 149 LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST----------VTNEHSLHV 198
            P +S      +  G       L  GD++Y+PRG++H+A T           +  +SLH+
Sbjct: 507 EPFHSLDGLVDDSGGR---VEVLHEGDIMYVPRGHVHEAHTDLDEGESEVNASTNYSLHL 563

Query: 199 TISV 202
           T+++
Sbjct: 564 TLAI 567


>gi|404446598|ref|ZP_11011704.1| hypothetical protein MVAC_25070 [Mycobacterium vaccae ATCC 25954]
 gi|403650187|gb|EJZ05456.1| hypothetical protein MVAC_25070 [Mycobacterium vaccae ATCC 25954]
          Length = 401

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/292 (24%), Positives = 125/292 (42%), Gaps = 44/292 (15%)

Query: 1   DHWEKEPLYISRKQ-KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD--- 56
           ++W ++PL    K     + +L++ + +D+++ +  +   +   I L ++GQ    D   
Sbjct: 16  EYWGRKPLLSPAKTLPRDFVDLLSPETVDELIAERGV---RAPFIRLAKEGQVLAKDSYL 72

Query: 57  --------VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
                   +  +  P  V      G ++          PL +         G  V AN Y
Sbjct: 73  GPAGFGAEITDQVDPAKVLTELGSGATVVLQGLHRLWPPLIEFVGQAVADIGHPVQANAY 132

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEI 162
            TPP  +GF  HYD  + F+LQ+ G+K+W V+ P  +    P  + P         +   
Sbjct: 133 ITPPGNRGFDFHYDVHDVFVLQVSGRKRWIVHEPVHLH---PLPNQPWTDHRARIEERVT 189

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
             P++   L  GD LYLPRG++H A  + +  S+H+TI V   T  +D+    + +  ++
Sbjct: 190 ADPVIDAELAAGDALYLPRGWVHAAQAL-DTTSIHLTIGVSATTG-VDVARAVLDELSRS 247

Query: 223 AGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV 274
                 EFR  LP+G           PVD      AM      ++A++V ++
Sbjct: 248 D-----EFRAPLPLGI---------DPVDGN----AMAATAAKVIAEIVTHL 281


>gi|423062943|ref|ZP_17051733.1| cupin 4 family protein [Arthrospira platensis C1]
 gi|406715522|gb|EKD10676.1| cupin 4 family protein [Arthrospira platensis C1]
          Length = 387

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 65/240 (27%), Positives = 116/240 (48%), Gaps = 26/240 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W ++ + I       + +L + +K++D+L    +   K+ +I L + G+          
Sbjct: 20  YWTEKSVLIPGANHQKFADLFSWQKLNDLLNYYPL---KHPEIRLAKTGE---------T 67

Query: 62  LPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           LP+I     +    ++G ++        I+ + ++ A L+   G     N+Y + P  QG
Sbjct: 68  LPEITNNEQIIKQCQQGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPGHQG 127

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
           FA HYD  E FILQ+ G+K W+V+    +   L    S  FS  +   P +   + PGDL
Sbjct: 128 FACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINPGDL 185

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
           LY+PRG+ H A  + +E SLH+T+ +  +T  ID  +    +  Q       ++R+ LP+
Sbjct: 186 LYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKNLPL 238


>gi|224144250|ref|XP_002325235.1| predicted protein [Populus trichocarpa]
 gi|222866669|gb|EEF03800.1| predicted protein [Populus trichocarpa]
          Length = 764

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/178 (29%), Positives = 81/178 (45%), Gaps = 22/178 (12%)

Query: 38  FKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQE 97
           F+K +D   ++      +D+          + ++EG +I     +     +  +  +L  
Sbjct: 385 FQKKVDPCCFKKLAFNNVDIMKCE------EAFKEGYTIALRGVEFRFASIAAVADALAS 438

Query: 98  LFGC-FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRMVDEYLP 150
           LFG   VGAN Y TPP +QG A H DD   F+ QL G K+W +Y      LPR+ D +  
Sbjct: 439 LFGQPSVGANIYLTPPNSQGLARHCDDHCVFVCQLFGTKQWTIYPRPNLQLPRLYDPFDR 498

Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN------EHSLHVTISV 202
            +      ++          L  GD+LY+PRG+ H+A T  +        SLHVT  V
Sbjct: 499 EHC---LGEQNSLAECRKFLLREGDILYIPRGFPHEACTHDDGSSDLARFSLHVTFGV 553


>gi|323449702|gb|EGB05588.1| hypothetical protein AURANDRAFT_66360, partial [Aureococcus
           anophagefferens]
          Length = 714

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 63/132 (47%), Gaps = 6/132 (4%)

Query: 75  SIRFLNPQTYIKPLQQLNASLQELF--GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
           S+  L      +   +   ++Q +   GC V  N Y TPP AQ    HYDD   F++QL 
Sbjct: 262 SVVVLGADARFRCAAEARRAVQRVLRVGCSV--NLYATPPGAQALDAHYDDHCVFVVQLR 319

Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV-- 190
           G+K+W++Y P +    L   + P   +         V L PGD+LY+PRG  H A+    
Sbjct: 320 GRKRWRLYGPALECPTLHEATLPPPPELLRRGADHAVALAPGDVLYVPRGVYHAATACED 379

Query: 191 TNEHSLHVTISV 202
               S HVT+ V
Sbjct: 380 AGSDSAHVTVGV 391


>gi|376007834|ref|ZP_09785019.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375323810|emb|CCE20772.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 387

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 91/183 (49%), Gaps = 14/183 (7%)

Query: 59  GRALPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           G  LP+I     +    +EG ++        I+ + ++ A L+   G     N+Y + P 
Sbjct: 65  GETLPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPG 124

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            QGFA HYD  E FILQ+ G+K W+V+    +   L    S  FS  +   P +   + P
Sbjct: 125 HQGFACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINP 182

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           GDLLY+PRG+ H A  + +E SLH+T+ +  +T  ID  +    +  Q       ++R+ 
Sbjct: 183 GDLLYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKN 235

Query: 234 LPI 236
           LP+
Sbjct: 236 LPL 238


>gi|209526744|ref|ZP_03275266.1| Cupin 4 family protein [Arthrospira maxima CS-328]
 gi|209492788|gb|EDZ93121.1| Cupin 4 family protein [Arthrospira maxima CS-328]
          Length = 387

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 91/183 (49%), Gaps = 14/183 (7%)

Query: 59  GRALPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           G  LP+I     +    +EG ++        I+ + ++ A L+   G     N+Y + P 
Sbjct: 65  GETLPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPG 124

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            QGFA HYD  E FILQ+ G+K W+V+    +   L    S  FS  +   P +   + P
Sbjct: 125 HQGFACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINP 182

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           GDLLY+PRG+ H A  + +E SLH+T+ +  +T  ID  +    +  Q       ++R+ 
Sbjct: 183 GDLLYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKN 235

Query: 234 LPI 236
           LP+
Sbjct: 236 LPL 238


>gi|242077498|ref|XP_002448685.1| hypothetical protein SORBIDRAFT_06g031470 [Sorghum bicolor]
 gi|241939868|gb|EES13013.1| hypothetical protein SORBIDRAFT_06g031470 [Sorghum bicolor]
          Length = 786

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/148 (33%), Positives = 81/148 (54%), Gaps = 21/148 (14%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
           ++ G SI     +   + +  + ++L +LFG   VGAN Y +P  +QG A HYDD    +
Sbjct: 428 FKNGYSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPARSQGLARHYDDHCVLV 487

Query: 129 LQLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
            QL G KKW ++      LPR+ D + P   + +   + IG   + V LE GD++Y+PRG
Sbjct: 488 WQLLGSKKWMIWPNPKPLLPRLYDPFDPLDGTLD---KNIGR--VEVLLE-GDMMYVPRG 541

Query: 183 YIHQAST--------VTNEHSLHVTISV 202
           Y+H+A T           ++SLH+T+++
Sbjct: 542 YVHEARTDVGGSEVNAYADYSLHLTLAI 569


>gi|170698073|ref|ZP_02889154.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
 gi|170137034|gb|EDT05281.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
          Length = 380

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 87/188 (46%), Gaps = 15/188 (7%)

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           V  R    I++D  R G ++ F   +  +  ++ +  S+    G    AN Y      + 
Sbjct: 74  VRRRLEKGILYDELRNGTTLVFNRMELTLYKVRLICKSISRFVGEHTVANGYIAFGEEES 133

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP-RYSSPNFSQEEI-GTPILTVTLEPG 174
           F  H+D    F +Q+ G+K+W VY P      LP ++      Q E    P + VT+E G
Sbjct: 134 FGKHWDTHSVFAVQMMGRKRWLVYEPTHA---LPLKHQRSTGKQSECPAEPYMDVTIETG 190

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRR 232
           D+LYLPRG+ H A  + NE + H+ + V++ T   +I  L   +     A       FR+
Sbjct: 191 DILYLPRGWWHTAIPL-NEETFHLAVGVHESTISDYIKYLADEIIGDFDA-------FRQ 242

Query: 233 GLPIGYLR 240
            +P+G  R
Sbjct: 243 TIPLGERR 250


>gi|167034171|ref|YP_001669402.1| cupin 4 family protein [Pseudomonas putida GB-1]
 gi|166860659|gb|ABY99066.1| Cupin 4 family protein [Pseudomonas putida GB-1]
          Length = 303

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 111/253 (43%), Gaps = 25/253 (9%)

Query: 19  NNLITSKKIDDILRDNVIEFKKNIDITLYEDG----------QRQTMDVEGRALPQI--- 65
           + L+   +I+++L  +++++ +   I L  DG             T+ V G   P I   
Sbjct: 32  HQLVGWGEINNLLEKDILDYPR---IRLANDGIPSERGFKGFVTYTLTVTGETSPHINRY 88

Query: 66  -VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
            +    + G ++     Q + +  QQ  + L     C  GAN Y    +   F  H+D+ 
Sbjct: 89  NLLKRLQTGSTLIIDRCQAFFERAQQAASYLSTHLRCRSGANLYCAWSSTPSFGAHFDNH 148

Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-EIGTPILTVTLEPGDLLYLPRGY 183
           +   +Q+EG K+W+VY P       P  +  +F Q    G P+L  TL PG  +Y+P GY
Sbjct: 149 DVIAVQIEGVKRWEVYAP---TRPYPLLNDKSFDQTPPAGEPMLCHTLTPGQAIYVPAGY 205

Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL-PIGYLRYA 242
            H   T T E S+H++  V +    IDL+   + + L+A+         G   IG  R +
Sbjct: 206 WHNVFTET-ERSMHISFPVVRPRK-IDLIRMVLER-LEASAELREPIAHGSDAIGNARLS 262

Query: 243 GLARGKPVDIQAD 255
           G+      +I  D
Sbjct: 263 GVLYACLANIDID 275


>gi|392544642|ref|ZP_10291779.1| cupin 4 [Pseudoalteromonas piscicida JCM 20779]
          Length = 370

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 91/180 (50%), Gaps = 9/180 (5%)

Query: 49  DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
           +  ++   ++ ++L QI+ + + +G ++     Q     L++  + L + F C V AN Y
Sbjct: 69  NASKREYQIKPQSLEQIL-ELFADGYTVVLPRAQIRNTRLKKFVSDLSKYFHCRVDANIY 127

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV-----DEYLPRYSSPNFSQEEIG 163
            TPP +QGF  H D  + FILQ EG+K+W +   R V     DE   R    +F +++  
Sbjct: 128 VTPPKSQGFKVHLDMHDVFILQTEGEKEWILSKKRPVKITSNDEKF-RVFLESFDEKKDF 186

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
                VTL  G  LYLPRG  H  ++ T++ S+H++++++    W+  L       L  +
Sbjct: 187 INTEKVTLTSGRKLYLPRGVYHCGNS-TDKVSIHISVTLF-PLQWVGFLSDQFSNDLSHS 244


>gi|392544644|ref|ZP_10291781.1| cupin 4 family protein [Pseudoalteromonas piscicida JCM 20779]
          Length = 390

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/132 (32%), Positives = 70/132 (53%), Gaps = 9/132 (6%)

Query: 88  LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKW------KVYL 141
           L ++   + +       AN Y TPP ++GF  HYD  +  +LQ EG KKW      KV+L
Sbjct: 105 LSEICGLISDQLDVRANANIYLTPPNSKGFTAHYDMHDVLLLQTEGAKKWVVSKCDKVHL 164

Query: 142 PRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
               D   P + + +  ++ + T   T+ L+ G +LY+PRG IH+   V +  S+H+T +
Sbjct: 165 TLTEDITRPEFYTIDPEKDLVNTK--TLELKRGQVLYMPRGTIHKGEAVGDRPSIHITFN 222

Query: 202 VYQKTAWIDLLE 213
           +   T W +L+E
Sbjct: 223 IL-PTTWYNLIE 233


>gi|294629460|ref|ZP_06708020.1| cupin superfamily protein [Streptomyces sp. e14]
 gi|292832793|gb|EFF91142.1| cupin superfamily protein [Streptomyces sp. e14]
          Length = 332

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 81/183 (44%), Gaps = 25/183 (13%)

Query: 52  RQTMDVEGRALPQ------------IVWDHYR--------EGCSIRFLNPQTYIKP-LQQ 90
           R  + V+G ALPQ             VW   +         G +   L+    + P +  
Sbjct: 57  RLRLAVDGEALPQHAYSVPVTTRRNTVWHRLKPAELHRHLAGGATLVLDAIDELHPGVGH 116

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
           L   L+      +  N Y +    +GF  H+DD +  +LQL+G K+W++Y         P
Sbjct: 117 LARGLERHLRTGIQVNAYASWTPQEGFGVHWDDHDVLVLQLDGAKRWRIY---GATRQAP 173

Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWID 210
            +   +  +   G P++ + L  GD+LYLPRG+ H  +     HSLH+T  + Q T   D
Sbjct: 174 LHRDTDVPEAPPGEPLVDMVLNAGDMLYLPRGWWHAVAASEGVHSLHLTCGM-QTTTGAD 232

Query: 211 LLE 213
           LL+
Sbjct: 233 LLQ 235


>gi|320164494|gb|EFW41393.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 869

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 65/121 (53%), Gaps = 6/121 (4%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR-YS 153
           L+E    F   N Y +P +AQGF  H+D +++ ILQ  G K W+V  P   D  LP   +
Sbjct: 329 LEESLSTFASVNMYLSPASAQGFKAHHDLMDSVILQTAGSKVWRVCNP---DFQLPSDVN 385

Query: 154 SPNFSQEEIGTPILT-VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           S NF   E+     T + L PGD+LYLPRG IH   T+ N  S+H+T  +     W  ++
Sbjct: 386 SGNFLLSELRYMNCTDILLTPGDVLYLPRGTIHSPYTIGN-MSMHLTFGLSAAFPWQAVM 444

Query: 213 E 213
           +
Sbjct: 445 D 445


>gi|428170986|gb|EKX39906.1| hypothetical protein GUITHDRAFT_113900 [Guillardia theta CCMP2712]
          Length = 337

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 77/142 (54%), Gaps = 13/142 (9%)

Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQ 159
            N Y TP  +Q   PH DD +  ++Q+ G K+W VY     LP   DE + + S    ++
Sbjct: 194 CNMYLTPAGSQAVHPHSDDRDVLLIQIWGTKEWLVYGSPQTLP-FSDEQVGK-SGQRLAE 251

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH-SLHVTISV-YQKTAWIDLLEKAMP 217
             IG   L+ TL+ GD LY+PRG++H+A     EH SLH+TI++  Q   W  ++  AM 
Sbjct: 252 GTIGPVSLSATLKTGDTLYIPRGFVHEAK--AQEHGSLHITIAIPTQDFTWSGVMMDAMR 309

Query: 218 KALQAAGATDLEFRRGLPIGYL 239
           + L+       ++RR +P+  L
Sbjct: 310 QKLREEKYN--KWRRCVPLTLL 329


>gi|453049325|gb|EME96928.1| hypothetical protein H340_29042 [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 395

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 99/206 (48%), Gaps = 24/206 (11%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNA-SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           R+G S+  L+    I P     A  L+   G  V AN Y +    +GF  H+DD +  ++
Sbjct: 96  RDGASL-VLDAVEKIHPSVGAAAEGLERFLGTSVQANVYASWTEREGFGRHWDDHDVVVV 154

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
           QL G K+W+++   M  E  P +      +E    P+  + + PGD+LYLPRG+ H  + 
Sbjct: 155 QLHGSKRWRLW--GMTREG-PTFRDVESPEEPESDPLADLVMSPGDVLYLPRGWWHSVTA 211

Query: 190 VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
                SLH+T  +   T   DL+   + + L+A+    L  RR +P    R+  L     
Sbjct: 212 DQGTESLHLTFGMVPHTG-ADLMLWVVDQ-LRAS----LALRRDIP----RFGSLTE--- 258

Query: 250 VDIQADRL-AMKENLKDLLA--KLVE 272
              QAD   A++  + D++A  +LVE
Sbjct: 259 ---QADFTDALRREVADMMADPRLVE 281


>gi|167538155|ref|XP_001750743.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770767|gb|EDQ84448.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2266

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 75/146 (51%), Gaps = 17/146 (11%)

Query: 86  KPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR-- 143
           +P   L   LQ LF      + Y +PP AQ  APH D  + F++Q+ G+K+W +  P+  
Sbjct: 171 RPYSILQRQLQALFTTMTNVHAYISPPNAQALAPHTDPYDVFVVQVYGQKEWTLCTPQPP 230

Query: 144 ----MVDEYLPRY---------SSPNFSQEEIG-TPILTVTLEPGDLLYLPRGYIHQAST 189
               + D +  ++            N+ + ++       +TL PGDLLY+P+G IH A+T
Sbjct: 231 GGQNLSDAHKAQWQEIAKHNIQGCTNYQEWQLAKMDCQHITLLPGDLLYIPKGVIHYATT 290

Query: 190 VTNEHSLHVTISVYQKT-AWIDLLEK 214
            +   S H+T+S+ + + +W+ L  +
Sbjct: 291 GSVTGSTHLTVSIERLSHSWMMLFHR 316


>gi|159491391|ref|XP_001703650.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270558|gb|EDO96399.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 191

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 66/120 (55%), Gaps = 8/120 (6%)

Query: 28  DDILRD-NVIEFKKNIDITLYEDGQRQTMDV---EGRALPQIVWDHYREGCSIRFLNPQT 83
           D+++RD   + F +++    Y DG+R+ +     +  AL Q+   H     +++   PQ 
Sbjct: 12  DEMIRDIGPLVFGRDVVAVRYIDGKREEIRASFADSEALKQL---HDDARATLQLHQPQR 68

Query: 84  YIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGK-KKWKVYLP 142
           ++    +L A+L+   GC VG N Y TPP  QG APH+DD+E F+ Q +G+  +W   LP
Sbjct: 69  FVNACWRLLAALESQLGCLVGCNAYVTPPGGQGLAPHHDDVELFVCQTQGEGGRWGGRLP 128


>gi|452954391|gb|EME59794.1| Cupin 2 barrel domain-containing protein [Amycolatopsis decaplanina
           DSM 44594]
          Length = 393

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 94/210 (44%), Gaps = 19/210 (9%)

Query: 14  QKNYYNNLITSKKIDDILRDNVIEFKK--------NIDITLYED---GQRQTMDVEGRAL 62
           +   + +L+    ++ ILR + +EF +         + +  Y D    +R       RA 
Sbjct: 34  RAGRFADLLPWPALNAILRRHRLEFPRLRLALDGTPVPMESYTDLVPTRRSGTVPRLRAA 93

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P    +H R G ++     Q    P+  L  SL+      V  N Y       GF  H+D
Sbjct: 94  PFT--EHLRGGATMVLDAIQELSDPIGDLARSLEHDLRESVQVNLYAGWGVTHGFDVHWD 151

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           D +AFI+Q+ G+K+W+++ P            P    E  G PI    LE GD+LY+PRG
Sbjct: 152 DHDAFIIQVAGRKRWRMHGPTRPAPLHRDVEQP----ERPGEPIDDFILEDGDVLYVPRG 207

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           + H  + V  E SLH+T+     T  IDL+
Sbjct: 208 HWHDVTAV-GEMSLHLTLGFSPATG-IDLV 235


>gi|19387273|gb|AAL87184.1|AF480497_12 unknown [Oryza sativa Japonica Group]
 gi|38345193|emb|CAE02886.2| OSJNBa0015K02.3 [Oryza sativa Japonica Group]
 gi|38346410|emb|CAE54575.1| OSJNBa0011F23.16 [Oryza sativa Japonica Group]
          Length = 774

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D ++ G S+     +   + +  + +++ +LFG   VGAN Y +PP AQG A HYDD   
Sbjct: 414 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 473

Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            + QL G KKW      K+ LPR+ + + P     +   +  G   + + LE GD++Y+P
Sbjct: 474 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 527

Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
           RG++H+A T           T + SLH+T+++
Sbjct: 528 RGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAI 559


>gi|116310856|emb|CAH67798.1| OSIGBa0132E09-OSIGBa0108L24.12 [Oryza sativa Indica Group]
          Length = 774

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D ++ G S+     +   + +  + +++ +LFG   VGAN Y +PP AQG A HYDD   
Sbjct: 414 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 473

Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            + QL G KKW      K+ LPR+ + + P     +   +  G   + + LE GD++Y+P
Sbjct: 474 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 527

Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
           RG++H+A T           T + SLH+T+++
Sbjct: 528 RGFVHEAHTDVDVGGFEVISTVDCSLHLTLAI 559


>gi|218195745|gb|EEC78172.1| hypothetical protein OsI_17757 [Oryza sativa Indica Group]
          Length = 769

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D ++ G S+     +   + +  + +++ +LFG   VGAN Y +PP AQG A HYDD   
Sbjct: 409 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 468

Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            + QL G KKW      K+ LPR+ + + P     +   +  G   + + LE GD++Y+P
Sbjct: 469 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 522

Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
           RG++H+A T           T + SLH+T+++
Sbjct: 523 RGFVHEAHTDVDVGGFEVISTVDCSLHLTLAI 554


>gi|222629703|gb|EEE61835.1| hypothetical protein OsJ_16482 [Oryza sativa Japonica Group]
          Length = 769

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D ++ G S+     +   + +  + +++ +LFG   VGAN Y +PP AQG A HYDD   
Sbjct: 409 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 468

Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            + QL G KKW      K+ LPR+ + + P     +   +  G   + + LE GD++Y+P
Sbjct: 469 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 522

Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
           RG++H+A T           T + SLH+T+++
Sbjct: 523 RGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAI 554


>gi|427429763|ref|ZP_18919750.1| Cupin 4 family protein [Caenispirillum salinarum AK4]
 gi|425880000|gb|EKV28701.1| Cupin 4 family protein [Caenispirillum salinarum AK4]
          Length = 359

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 56/180 (31%), Positives = 81/180 (45%), Gaps = 18/180 (10%)

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P  V +  R GCS+      T    L+ + A+L++       AN Y +  + Q F  H D
Sbjct: 92  PSKVMELVRAGCSLGLNRLDTLDAGLRAVGAALEDALAGKAQANLYYSRRSNQAFGVHSD 151

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-------EEIGTPILTVTLEPGD 175
             + F L  EG+K+W +Y     D      + P F +          G    T+ ++PGD
Sbjct: 152 SHDVFALHCEGEKEWHLYEGVDPDP----INHPAFKKLPLEERRRRAGGVAETLVMKPGD 207

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           LLY+PRG+ H A   T++  +HV   V Q    +DLL   M +A       D  FRR LP
Sbjct: 208 LLYIPRGHYHDALCRTDD-CVHVAYGVTQVIG-MDLLTLLMNRA-----PADAAFRRALP 260


>gi|297723531|ref|NP_001174129.1| Os04g0659150 [Oryza sativa Japonica Group]
 gi|255675849|dbj|BAH92857.1| Os04g0659150 [Oryza sativa Japonica Group]
          Length = 430

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
           D ++ G S+     +   + +  + +++ +LFG   VGAN Y +PP AQG A HYDD   
Sbjct: 278 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 337

Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            + QL G KKW      K+ LPR+ + + P     +   +  G   + + LE GD++Y+P
Sbjct: 338 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 391

Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
           RG++H+A T           T + SLH+T+++
Sbjct: 392 RGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAI 423


>gi|398784687|ref|ZP_10547867.1| hypothetical protein SU9_15724 [Streptomyces auratus AGR0001]
 gi|396994993|gb|EJJ06016.1| hypothetical protein SU9_15724 [Streptomyces auratus AGR0001]
          Length = 390

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 5/125 (4%)

Query: 89  QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           + L  +L+  F   V  N Y +    +GF  H+DD +  ++QLEG K+WK+Y    +D  
Sbjct: 113 EDLATALERHFRTDVQVNLYASWTPQEGFGVHWDDHDVVVVQLEGAKRWKLYGATRIDPL 172

Query: 149 LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAW 208
                +P   +   G P+  + L+ GD+LYLPRG+ H A   T   SLH+T  +   T  
Sbjct: 173 RVDTEAP---EPPTGAPVAEIVLQAGDMLYLPRGWWH-AVAATEGRSLHLTCGLKPTTG- 227

Query: 209 IDLLE 213
            DLL 
Sbjct: 228 ADLLS 232


>gi|404254172|ref|ZP_10958140.1| cupin 4 family protein [Sphingomonas sp. PAMC 26621]
          Length = 400

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/275 (26%), Positives = 115/275 (41%), Gaps = 33/275 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITL---------YEDGQR 52
           +W+++P+++         +L+T ++  ++L         N+ + L         Y DG  
Sbjct: 24  YWDRQPVHLPSPAGGR-GSLLTWERFSELLGTGGHWTAANLKLILNSRPILPDFYLDGDP 82

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
                  RA P  V      G S+   + +     L++    L   F     AN Y +  
Sbjct: 83  AAPIDTRRADPARVGVFLGMGASLVADDVERIAPELRRWTTLLGTEFAALTNANVYCSFK 142

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-------EIGTP 165
             Q FA HYD  + F LQ EG+K W++Y  R  +      S+P+  +E         G  
Sbjct: 143 GVQAFATHYDLHDVFALQCEGEKVWRIYRNRAENP----VSAPHGGEEVQRQIDAARGDL 198

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
           +  V + PGD+LYLPRG  H A       SLHV+ +V        L   A+ +AL+ A  
Sbjct: 199 MAEVRMRPGDVLYLPRGCYHDA-LAQEGASLHVSFAV------APLGAMALLRALEEAAL 251

Query: 226 TDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMK 260
            +  FR      YL  A    G+P++ Q   LA +
Sbjct: 252 REPAFR-----AYLADARSDDGQPLEAQFQALARR 281


>gi|291004158|ref|ZP_06562131.1| cupin 4 [Saccharopolyspora erythraea NRRL 2338]
          Length = 374

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/173 (31%), Positives = 83/173 (47%), Gaps = 15/173 (8%)

Query: 69  HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFI 128
           + R+GC++   +   Y   ++    +LQ      V  N Y T   + GF  H+DD +  +
Sbjct: 58  YLRDGCTLVLDDLGAYDPTMEVTCRALQWELNERVQVNAYLTTGESGGFCLHWDDHDVIV 117

Query: 129 LQLEGKKKWKVYLPRMVDEYLP--RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           +Q+ G K W+V   R      P  R S PN +  E    + T TL  GD++ +PRG+ HQ
Sbjct: 118 VQVAGAKTWEV---RPTSRAYPMFRDSDPNLAAPE--KTLWTGTLNAGDVMLIPRGHWHQ 172

Query: 187 ASTVT--NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           A+  T  + +SLH+T    ++T  ID L     +A + A      FRR L  G
Sbjct: 173 ATRATHGDGYSLHLTFGFTRRTG-IDWLTWVADQARKTA-----TFRRDLDAG 219


>gi|76155855|gb|AAX27127.2| SJCHGC04036 protein [Schistosoma japonicum]
          Length = 430

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 62/228 (27%), Positives = 97/228 (42%), Gaps = 63/228 (27%)

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
           L V L+ GDLLY+P+GY    S    EHSLH+ IS  QK  W + L + +   ++ A   
Sbjct: 19  LLVKLKSGDLLYIPKGYTCSISHSNVEHSLHLCISFNQKYNWGNYLSELLSIMMKYAVEN 78

Query: 227 DLEFRRGLPIGYLRYAGLARGK------PVDIQADRLAMKENLKDLLAKL---------- 270
           D EF + LP+   ++ G+   K       V+ +  RL   + LK L++K+          
Sbjct: 79  DPEFNKPLPLDSFKHLGIFFNKDSSSKSAVEEKEFRLQSVQLLKRLISKIEQRQTFSIQK 138

Query: 271 --------------------------------------VEYVD-LDDGVDEMGKQLMHDA 291
                                                 +  VD L   +DE+  ++M  +
Sbjct: 139 SKSLNHQLNEDKQCSLLSSAKEIDSTKKKQPNPYDNLFIHSVDPLHLAIDEIMLKIMKKS 198

Query: 292 LPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             P LS +EL  S+F+ G            +Q+T ++K+RLVR+ AVR
Sbjct: 199 CSPELSEDELMRSIFKQGSD--------KLSQLTVNSKIRLVRSTAVR 238


>gi|375098356|ref|ZP_09744619.1| hypothetical protein SaccyDRAFT_0024 [Saccharomonospora cyanea
           NA-134]
 gi|374659088|gb|EHR58966.1| hypothetical protein SaccyDRAFT_0024 [Saccharomonospora cyanea
           NA-134]
          Length = 386

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 82/326 (25%), Positives = 136/326 (41%), Gaps = 49/326 (15%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDN--------VIEFKKNIDITLYEDGQR-Q 53
           W K  L +     + + +L++   +D++L ++        ++   + + ++ Y    R  
Sbjct: 20  WGKRAL-VHGPTVDGFEDLLSLDGVDELLTEHSLRSPAFRLVREGRTLPLSDYTRTARIG 78

Query: 54  TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
              V G A P  +      G ++       Y  PL      L+   G     N Y TPP 
Sbjct: 79  GTPVTGVADPARILAAVEGGATLVLQGLHRYWPPLAGFFRELELSLGHPCQVNAYVTPPG 138

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           AQG  PH D  + F+LQ  G K W+V            + +P     E       V LEP
Sbjct: 139 AQGLRPHSDRHDVFVLQAFGSKSWQV------------WPAPG-EGAEADRGAEDVELEP 185

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
           G  LYLP G  H A+   ++ S H+T+ ++    W DLLE A+ + L      D  +   
Sbjct: 186 GTALYLPTGTRH-AARAQHDVSGHLTVGIH-PARWRDLLEAAVTRVLD-----DPAWTSP 238

Query: 234 LPIGYLRYAGLARGKPVDIQADRLA--MKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDA 291
           LP+ Y R            Q D +   +  +LK L A+L+   D  D V+E  ++ +  +
Sbjct: 239 LPVRYHR------------QRDSVVNELGRHLKQLGAQLM-ATDPSDPVEERVERFL-TS 284

Query: 292 LPPVLS---PEELQCSVFENGLRMSQ 314
            PP+L     + L+ S  ++  R+ +
Sbjct: 285 RPPMLRGGLSDRLRLSTMDDSARVRR 310


>gi|392544643|ref|ZP_10291780.1| cupin 4 [Pseudoalteromonas piscicida JCM 20779]
          Length = 391

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 81/176 (46%), Gaps = 13/176 (7%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y +G +I   +       + ++ + L   F C V AN Y TPP  +GF  HYD  +  IL
Sbjct: 85  YADGHTIVLNSVHKRWPVIAKVCSELSNQFFCRVNANIYVTPPGNKGFEAHYDTHDVIIL 144

Query: 130 QLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
           Q EG K+W      ++ +  + D+    +   +  ++   +  +++ L+ G  LY+PRG 
Sbjct: 145 QTEGSKEWFLSEHERLQITHINDDLRNEFYEIDAEKDTKNS--VSIQLDAGQCLYVPRGT 202

Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
           IH A    +  S+H+T S++  T W  L    M   +      D E  + +P   L
Sbjct: 203 IHSAEANNSLPSIHITFSIFPVT-WASL----MRAVILDGSLKDTELMKTVPFEVL 253


>gi|318057978|ref|ZP_07976701.1| cupin 4 family protein [Streptomyces sp. SA3_actG]
 gi|318076149|ref|ZP_07983481.1| cupin 4 family protein [Streptomyces sp. SA3_actF]
          Length = 280

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 98/223 (43%), Gaps = 27/223 (12%)

Query: 69  HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFI 128
           ++  G +IR  N Q  +  L  ++  +Q   G     + + TPP  QG   H+D   A I
Sbjct: 65  YHEAGYTIRLGNLQRVVPFLNSVSRQIQMETGYSNYLHAFLTPPGGQGLRHHWDQQMALI 124

Query: 129 LQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE------EIGTPILTVTLEPGDLLYLPRG 182
           +Q+ G K W+++ P+          S     E      E   P  T+ LEPG  L LPRG
Sbjct: 125 VQISGTKTWELWAPKFPSPMRNYAESTRVWDETWRHEWESHGPDHTIELEPGQTLLLPRG 184

Query: 183 YIHQA-STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
           ++H   S   +E S+H+T ++ ++T  + L EK   KA+      DL             
Sbjct: 185 WVHNPHSRNASEDSIHLTFAIRERTH-LWLAEKLTAKAIGRPEFRDLVI----------- 232

Query: 242 AGLARGKPVDIQADRLAMK-ENLKDLLAKLVEYVDLDDGVDEM 283
                  P ++  D L+ + E+ + +L + +  +DL    +E+
Sbjct: 233 -------PPELHGDDLSSRLEDTRRMLVRFLASLDLSAEAEEI 268


>gi|301018918|ref|ZP_07183146.1| cupin family protein [Escherichia coli MS 196-1]
 gi|433051100|ref|ZP_20238364.1| hypothetical protein WII_04991 [Escherichia coli KTE120]
 gi|299882449|gb|EFI90660.1| cupin family protein [Escherichia coli MS 196-1]
 gi|431557901|gb|ELI31586.1| hypothetical protein WII_04991 [Escherichia coli KTE120]
          Length = 297

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 30/204 (14%)

Query: 18  YNNLITSKKIDDILRDNVIEFKK------NI-DITLYEDGQRQTMDVEGRALPQI----V 66
           Y NLI    ++ +L  +++ + +      N  +I  Y+   R T    G   P I    +
Sbjct: 30  YKNLINWNDLNKLLEKDILHYPRVRMANDNFPEIRGYKGFIRYTYSQTGDRTPHINRHQL 89

Query: 67  WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +   R+G ++     Q++ + + +    L +   C   AN Y        F  H+D+ + 
Sbjct: 90  YKCLRDGATLIVDRCQSFFESVDESRLWLSKELECTCSANLYAAFTATPSFGLHFDNHDV 149

Query: 127 FILQLEGKKKWKVYLPR----MVDE----YLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
             +Q+EG KKWKVY P     + DE    YLP  +SP++             + PG  +Y
Sbjct: 150 IAVQIEGIKKWKVYNPTYSYPLEDERSFDYLPPNTSPDYE----------FDITPGQAIY 199

Query: 179 LPRGYIHQASTVTNEHSLHVTISV 202
           LP GY H  +T + +HSLH++ +V
Sbjct: 200 LPAGYWHNVTTQS-KHSLHISFTV 222


>gi|432521851|ref|ZP_19759003.1| hypothetical protein A17U_04852 [Escherichia coli KTE228]
 gi|433161660|ref|ZP_20346455.1| hypothetical protein WKU_04739 [Escherichia coli KTE177]
 gi|431043503|gb|ELD53834.1| hypothetical protein A17U_04852 [Escherichia coli KTE228]
 gi|431670119|gb|ELJ36475.1| hypothetical protein WKU_04739 [Escherichia coli KTE177]
          Length = 297

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 30/204 (14%)

Query: 18  YNNLITSKKIDDILRDNVIEFKK------NI-DITLYEDGQRQTMDVEGRALPQI----V 66
           Y NLI    ++ +L  +++ + +      N  +I  Y+   R T    G   P I    +
Sbjct: 30  YKNLINWNDLNKLLEKDILHYPRVRMANDNFPEIRGYKGFIRYTYSQTGDRTPHINRHQL 89

Query: 67  WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +   R+G ++     Q++ + + +    L +   C   AN Y        F  H+D+ + 
Sbjct: 90  YKCLRDGATLIVDRCQSFFESVDESRLWLSKELECTCSANLYAAFTATPSFGLHFDNHDV 149

Query: 127 FILQLEGKKKWKVYLPR----MVDE----YLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
             +Q+EG KKWKVY P     + DE    YLP  +SP++             + PG  +Y
Sbjct: 150 IAVQIEGIKKWKVYNPTYSYPLEDERSFDYLPPNTSPDYE----------FDITPGQAIY 199

Query: 179 LPRGYIHQASTVTNEHSLHVTISV 202
           LP GY H  +T + +HSLH++ +V
Sbjct: 200 LPAGYWHNVTTQS-KHSLHISFTV 222


>gi|410503126|ref|YP_006940181.1| Cupin 4 family protein [Escherichia coli]
 gi|353234053|emb|CCE21301.1| Cupin 4 family protein [Escherichia coli]
          Length = 297

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 30/204 (14%)

Query: 18  YNNLITSKKIDDILRDNVIEFKK------NI-DITLYEDGQRQTMDVEGRALPQI----V 66
           Y NLI    ++ +L  +++ + +      N  +I  Y+   R T    G   P I    +
Sbjct: 30  YKNLINWNDLNKLLEKDILHYPRVRMANDNFPEIRGYKGFIRYTYSQTGDRTPHINRHQL 89

Query: 67  WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +   R+G ++     Q++ + + +    L +   C   AN Y        F  H+D+ + 
Sbjct: 90  YKCLRDGATLIVDRCQSFFESVDESRLWLSKELECTCSANLYAAFTATPSFGLHFDNHDV 149

Query: 127 FILQLEGKKKWKVYLPR----MVDE----YLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
             +Q+EG KKWKVY P     + DE    YLP  +SP++             + PG  +Y
Sbjct: 150 IAVQIEGIKKWKVYNPTYSYPLEDERSFDYLPPNTSPDYE----------FDITPGQAIY 199

Query: 179 LPRGYIHQASTVTNEHSLHVTISV 202
           LP GY H  +T + +HSLH++ +V
Sbjct: 200 LPAGYWHNVTTQS-KHSLHISFTV 222


>gi|145220941|ref|YP_001131619.1| cupin 4 family protein [Mycobacterium gilvum PYR-GCK]
 gi|315442089|ref|YP_004074968.1| hypothetical protein Mspyr1_04210 [Mycobacterium gilvum Spyr1]
 gi|145213427|gb|ABP42831.1| Cupin 4 family protein [Mycobacterium gilvum PYR-GCK]
 gi|315260392|gb|ADT97133.1| uncharacterized conserved protein [Mycobacterium gilvum Spyr1]
          Length = 402

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 74/149 (49%), Gaps = 12/149 (8%)

Query: 99  FGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF- 157
            G  V  N Y TPP  +GF  H+D  + F+LQ+ G K+W V+ P +    LP  S   F 
Sbjct: 123 IGHPVQVNAYITPPGNRGFDFHHDVHDVFVLQIAGSKRWIVHEP-VHAHPLPDQSWTQFR 181

Query: 158 ---SQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
               +     P++   L  GD LYLPRG++H A  + +  S+H+T+ V   T  +D+   
Sbjct: 182 DLIDERVTADPVIDTALTEGDALYLPRGWVHAAQAL-DTTSIHLTVGVSATTG-VDVARA 239

Query: 215 AMPKALQAAGATDLEFRRGLPIGYLRYAG 243
            +    + AG    EFR  LP+G    AG
Sbjct: 240 VLD---ELAGID--EFRAPLPLGIDPAAG 263


>gi|167520418|ref|XP_001744548.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776879|gb|EDQ90497.1| predicted protein [Monosiga brevicollis MX1]
          Length = 710

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/179 (31%), Positives = 89/179 (49%), Gaps = 21/179 (11%)

Query: 34  NVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNA 93
           NV   K+  D    E G++   D+  RAL        REG +++F +   +   +  L  
Sbjct: 392 NVKFIKRTFDQK-QESGRKTEKDL-ARAL--------REGFTLQFYSVNYWDPNIASLAL 441

Query: 94  SLQELFGCF--VGANTYQTPP-TAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRM 144
            L E  G    V AN Y TP  T+    PH D   + ++QL G K+W+++      LP  
Sbjct: 442 ELSE-HGILLPVNANLYITPGGTSVSLVPHTDYQCSLMVQLAGVKRWRLWKMPEIMLPVS 500

Query: 145 VDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH-SLHVTISV 202
            +    R +    + EE+G P + V L+PGD+LY+PRG +H  ST   +H S+H+T+ +
Sbjct: 501 ANMIRGRDTDDLVASEELGEPYMDVLLQPGDILYVPRGVLHATSTPEGDHPSMHLTVGM 559


>gi|357414603|ref|YP_004926339.1| hypothetical protein Sfla_5425 [Streptomyces flavogriseus ATCC
           33331]
 gi|320011972|gb|ADW06822.1| hypothetical protein Sfla_5425 [Streptomyces flavogriseus ATCC
           33331]
          Length = 403

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 73/143 (51%), Gaps = 8/143 (5%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY-- 152
           +  + G     N + T   AQ FA H+D+++ F+LQ+ G K W+V+ P   +  LP Y  
Sbjct: 120 VMRMVGETASCNLFVTFDDAQAFASHFDEVDTFVLQVLGTKSWQVHGPSE-EHPLPEYGD 178

Query: 153 SSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           S P    E +   +   TLEPGD++++PRG+ H       E SLH+T +  ++T + D L
Sbjct: 179 SDPARCPEAV---LFERTLEPGDVIHVPRGWWHTVRG-GGESSLHLTFAFTRRTGY-DWL 233

Query: 213 EKAMPKALQAAGATDLEFRRGLP 235
                +AL +    +   R G P
Sbjct: 234 RWVAYRALDSTDVRESLARAGTP 256


>gi|451336154|ref|ZP_21906715.1| hypothetical protein C791_2960 [Amycolatopsis azurea DSM 43854]
 gi|449421346|gb|EMD26778.1| hypothetical protein C791_2960 [Amycolatopsis azurea DSM 43854]
          Length = 393

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 103/230 (44%), Gaps = 26/230 (11%)

Query: 18  YNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALP------------QI 65
           + +L+    ++ ILR + +EF +   + L  DG    M+     +P              
Sbjct: 38  FADLLPWPALNAILRQHRLEFPR---LRLALDGTPVPMESYTELVPTRRSGTVPRLQAAP 94

Query: 66  VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
             +H R+G ++     Q     +  L  SL+      V  N Y       GF  H+DD +
Sbjct: 95  FTEHLRDGATMVLDAIQELSDLIGDLARSLEHDLRESVQVNLYAGWGVTHGFDVHWDDHD 154

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
           AFI+Q+ G+K+W+++ P            P    E  G PI    LE GD+LY+PRG+ H
Sbjct: 155 AFIIQVAGRKRWRMHGPTRPAPLHRDVEQP----ERPGEPIDDFILEDGDVLYVPRGHWH 210

Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
             + V  E SLH+T+     T  IDL+     K L+++ A    FR+ LP
Sbjct: 211 DVTAV-GEMSLHLTLGFSPATG-IDLVSWLADK-LRSSTA----FRQDLP 253


>gi|433608094|ref|YP_007040463.1| Cupin 2 barrel domain-containing protein [Saccharothrix espanaensis
           DSM 44229]
 gi|407885947|emb|CCH33590.1| Cupin 2 barrel domain-containing protein [Saccharothrix espanaensis
           DSM 44229]
          Length = 392

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 81/299 (27%), Positives = 130/299 (43%), Gaps = 42/299 (14%)

Query: 14  QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ-------RQTMDVEG--RALPQ 64
           +   + +L+   +++ +LR + +E  +   + L + G+        +T D     R  P 
Sbjct: 33  RAGRFADLLPWSEVNRVLRQHPLEHPR---LRLAQGGEVLPAHTYTETSDTAPVPRLSPA 89

Query: 65  IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
              +  R G ++         +P+ +L A+L+      V  N Y       GF  H+DD 
Sbjct: 90  AFTEKMRGGATLVMDAVHELFEPVGELAAALEHELRERVRVNLYAGWGVTHGFDVHWDDH 149

Query: 125 EAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           +  I+Q+ G+K+W+++      P   D  LP    P  S E    P+    LE GD+LY+
Sbjct: 150 DVIIVQVSGRKRWRLHGSTRPAPLRRDVELP----PRPSTE----PLDDFVLEDGDVLYV 201

Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
           PRG+ H  S V  E SLH+TI  + +   +DL+   +   L+A  A    FR  LP    
Sbjct: 202 PRGHWHDVSAV-GEESLHLTIG-FNRATGVDLVSW-LADQLRAEEA----FRTDLP---- 250

Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
           R+A  A     + +A    ++  L DLL   V    L D  D     L H  LP   +P
Sbjct: 251 RFASRA-----EQEAHAAVLRAKLTDLLDAGVVARFLAD-RDAQAPALPHVGLPWTATP 303


>gi|302761234|ref|XP_002964039.1| hypothetical protein SELMODRAFT_405651 [Selaginella moellendorffii]
 gi|300167768|gb|EFJ34372.1| hypothetical protein SELMODRAFT_405651 [Selaginella moellendorffii]
          Length = 550

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/89 (44%), Positives = 52/89 (58%), Gaps = 4/89 (4%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-E 161
           +GAN Y TP  AQG   HYDD   F++QL GKK+W +     V   LPR  +P    E E
Sbjct: 258 IGANLYLTPSNAQGLGRHYDDHCVFVVQLAGKKRWSINPAATV--VLPRLYAPRLVPEAE 315

Query: 162 IGTPILTVT-LEPGDLLYLPRGYIHQAST 189
           + +  L  T L  G +LY+PRG+ H+AST
Sbjct: 316 VQSRELADTLLTEGSVLYIPRGFAHRAST 344


>gi|167534643|ref|XP_001748997.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772677|gb|EDQ86327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2159

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 71/150 (47%), Gaps = 17/150 (11%)

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD---- 146
           L   L  LF   V  + Y TPP AQ   PH D  +  ++Q  G+K+W +  P+       
Sbjct: 146 LERHLATLFATTVAVHGYVTPPGAQALKPHTDPYDVLVIQTYGEKQWTICTPQPAGAQNR 205

Query: 147 --------EYLPRYSSPNFSQEE----IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH 194
                   + + R+S    +Q E           +TL+ GD+LYLP+G IH A+T  +  
Sbjct: 206 TDAEKAQLQEIVRHSIQGCTQYEAWQLAKMECQAITLKAGDVLYLPKGIIHYATTTESMG 265

Query: 195 SLHVTISVYQKT-AWIDLLEKAMPKALQAA 223
           S H+T+S+ + T +W+ L  +A    L  A
Sbjct: 266 STHITLSLERLTHSWLALFGRACGLGLDRA 295


>gi|357392116|ref|YP_004906957.1| hypothetical protein KSE_52260 [Kitasatospora setae KM-6054]
 gi|311898593|dbj|BAJ31001.1| hypothetical protein KSE_52260 [Kitasatospora setae KM-6054]
          Length = 395

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 62/122 (50%), Gaps = 4/122 (3%)

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
           L   L+      V  N Y +    +GF  H+DD +  ++QL+G K+WK+Y P        
Sbjct: 118 LAKHLENWLRTAVQVNLYGSWTGTEGFGTHWDDHDVIVVQLDGTKRWKLYGPTRTAPMNR 177

Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWID 210
             ++P    E+   PI  + L  GD+LYLPRG+ H  +    ++SLH+T  + Q T   D
Sbjct: 178 DVAAPEQPPEQ---PIADLVLSAGDVLYLPRGWWHAVAASEGQYSLHLTCGL-QTTTGAD 233

Query: 211 LL 212
           L+
Sbjct: 234 LI 235


>gi|414344306|ref|YP_006985827.1| hypothetical protein B932_3351 [Gluconobacter oxydans H24]
 gi|411029641|gb|AFW02896.1| hypothetical protein B932_3351 [Gluconobacter oxydans H24]
          Length = 394

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 99/217 (45%), Gaps = 27/217 (12%)

Query: 4   EKEPLYISRKQKNYYNNLITSKKIDDIL--RDNVIEFKK------NIDITLYEDGQRQTM 55
             + + I  +  NYYN L +    + +   RD    F +       +D++ Y D     +
Sbjct: 41  NNDSILIQGRGNNYYNCLFSFDDAEALTNRRDLFYPFVRLYDGTSRVDVSDYTD----NL 96

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQT-YIKPLQQLNAS-------LQELFGCFVGANT 107
              G  +P  V    RE    R  +  T Y   +++++ S       L+      V  + 
Sbjct: 97  SFSGINVPFYV---SRETLRNRIASGNTAYFTGIERISTSCLALKVQLEAALIATVDLSL 153

Query: 108 YQTPPTAQGFAP-HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           + TP    G  P HYD+++  ILQ+ G+K WK++     D  L    S N S+      +
Sbjct: 154 FVTPSNTVGNLPRHYDNVDGLILQISGRKTWKLWGIAATDPLLVEKRS-NVSEMATDDNV 212

Query: 167 -LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
             T TLE GDLL++P G IH+  T T++HS+H+T+ V
Sbjct: 213 HATYTLEAGDLLFVPAGMIHETDT-TDDHSIHITVGV 248


>gi|453328219|dbj|GAC89443.1| cupin 4 family protein [Gluconobacter thailandicus NBRC 3255]
          Length = 394

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 99/217 (45%), Gaps = 27/217 (12%)

Query: 4   EKEPLYISRKQKNYYNNLITSKKIDDIL--RDNVIEFKK------NIDITLYEDGQRQTM 55
             + + I  +  NYYN L +    + +   RD    F +       +D++ Y D     +
Sbjct: 41  NNDSILIQGRGNNYYNCLFSFDDAEALTNRRDLFYPFVRLYDGTSRVDVSDYTD----NL 96

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQT-YIKPLQQLNAS-------LQELFGCFVGANT 107
              G  +P  V    RE    R  +  T Y   +++++ S       L+      V  + 
Sbjct: 97  SFSGINVPFYV---SRETLRNRIASGNTAYFTGIERISTSCLALKVQLEAALIATVDLSL 153

Query: 108 YQTPPTAQGFAP-HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           + TP    G  P HYD+++  ILQ+ G+K WK++     D  L    S N S+      +
Sbjct: 154 FVTPSNTVGNLPRHYDNVDGLILQISGRKTWKLWGIAATDPLLVEKRS-NVSEMATDDNV 212

Query: 167 -LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
             T TLE GDLL++P G IH+  T T++HS+H+T+ V
Sbjct: 213 HATYTLEAGDLLFVPAGMIHETDT-TDDHSIHITVGV 248


>gi|302768931|ref|XP_002967885.1| hypothetical protein SELMODRAFT_408768 [Selaginella moellendorffii]
 gi|300164623|gb|EFJ31232.1| hypothetical protein SELMODRAFT_408768 [Selaginella moellendorffii]
          Length = 895

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/94 (42%), Positives = 49/94 (52%), Gaps = 14/94 (14%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWK------VYLPRMVDEYLPR-YSSP 155
           +GAN Y TP  AQG   HYDD   F++QL G K+W       V LPR+   Y PR     
Sbjct: 258 IGANLYLTPSNAQGLGRHYDDHCVFVVQLAGTKRWSINPAATVVLPRL---YAPRLVPKA 314

Query: 156 NFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
                E+   +LT     G  LY+PRG+ HQAST
Sbjct: 315 EVQSRELAESVLT----EGSALYIPRGFAHQAST 344


>gi|381163845|ref|ZP_09873075.1| Cupin superfamily protein [Saccharomonospora azurea NA-128]
 gi|379255750|gb|EHY89676.1| Cupin superfamily protein [Saccharomonospora azurea NA-128]
          Length = 388

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 57/223 (25%), Positives = 90/223 (40%), Gaps = 24/223 (10%)

Query: 9   YISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQT----MDVEG 59
           ++     + + +L++   +D++L     R       ++       D  R        V G
Sbjct: 24  WVHGPTADGFEDLLSLDGVDELLTQHSLRSPAFRLVRDGRTVPLSDCTRTARIGGTSVSG 83

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
            A P  V     +G ++       Y  PL      L+   G     N Y TPP AQG  P
Sbjct: 84  VADPARVLAAVEDGATLVLQGLHRYWPPLVGFCRELELELGHPCQVNAYVTPPGAQGLRP 143

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT-PILTVTLEPGDLLY 178
           H D  + F+LQ  G K W+V            + +P     + G   +  V L  G  LY
Sbjct: 144 HTDSHDVFVLQAFGSKSWQV------------WPAPAADAGDTGADGVEDVELRAGMALY 191

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           LP G  H A+   +  S H+T+ ++  T W DL+E A+ + L+
Sbjct: 192 LPTGTRH-AARAQHVVSGHLTVGIH-PTRWRDLIESAVARVLR 232


>gi|384563987|ref|ZP_10011091.1| Cupin superfamily protein [Saccharomonospora glauca K62]
 gi|384519841|gb|EIE97036.1| Cupin superfamily protein [Saccharomonospora glauca K62]
          Length = 383

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 70/165 (42%), Gaps = 20/165 (12%)

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
           V G A P  +     +G ++       Y  PL      L+   G     N Y TPP AQG
Sbjct: 82  VTGVADPARILAAIEDGATLVLQGLHRYWPPLVDFCRELELELGHPCQVNAYVTPPGAQG 141

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT-PILTVTLEPGD 175
             PH D  + F+LQ  G K W+V+                    E GT  +  V L PG 
Sbjct: 142 LRPHVDTHDVFVLQAFGTKSWQVW-----------------RTSEAGTDDVENVELRPGT 184

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
            LYLP G  H A+   +  S H+T+ ++  T W DL+E  + + L
Sbjct: 185 ALYLPTGTRH-AARAQHAVSGHLTVGIH-PTRWRDLIEAVVARLL 227


>gi|429767708|ref|ZP_19299895.1| cupin family protein [Brevundimonas diminuta 470-4]
 gi|429189867|gb|EKY30683.1| cupin family protein [Brevundimonas diminuta 470-4]
          Length = 377

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/175 (33%), Positives = 83/175 (47%), Gaps = 26/175 (14%)

Query: 72  EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
           EG S+   + Q    P+  L++ L   F    GAN Y +    Q F  H+D  + F +Q 
Sbjct: 99  EGASLIAGDVQDLTAPVAALSSVLSRTFAASAGANIYCSFKGVQAFGAHFDLTDVFAVQT 158

Query: 132 EGKKKWKVY---------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           EG+K W++Y         +P  V + + R+    F Q + G     V + PGD+LYLPRG
Sbjct: 159 EGEKLWRIYENRADAPVEMPADVGD-MRRW----FDQTK-GRLQAEVMMRPGDVLYLPRG 212

Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWI--DLLEKAMPKALQAAGATDLEFRRGLP 235
             H A    +  SLHVT S+      I  +LLE+A   A+Q     D  FR  LP
Sbjct: 213 CYHDA-LAQDAASLHVTFSIAALHGRIIFNLLEQA---AMQ-----DPAFRAYLP 258


>gi|393722730|ref|ZP_10342657.1| cupin 4 family protein [Sphingomonas sp. PAMC 26605]
          Length = 395

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 96/220 (43%), Gaps = 19/220 (8%)

Query: 46  LYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKP-LQQLNASLQELFGCFVG 104
            Y DG   TM +E R +     D +    +   +N    I P ++     L E F     
Sbjct: 71  FYLDGD-VTMPIETRRVDPARVDVFLGMGASLVVNDIERIAPEIRDWTQMLGEHFAALTN 129

Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT 164
           AN Y +    Q FA H+D  + F +Q EG+K W++Y  R  D  L         Q +I  
Sbjct: 130 ANVYCSFMGVQAFATHFDLHDVFAVQCEGEKVWRIYQNR-ADSPLVAPHGGEEVQRQIDA 188

Query: 165 ---PILT-VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
              P+L  V + PGD+LYLPRG  H A    +  SLHVT +V   T    L      + L
Sbjct: 189 ARGPLLAEVRMRPGDVLYLPRGCYHDA-LAQDGASLHVTFAVAPLTGMTVL------RVL 241

Query: 221 QAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMK 260
           ++    D  FR      YL  A +  G+P++ Q   LA +
Sbjct: 242 ESEAQRDPRFR-----AYLADARVDAGQPLEAQLRTLATR 276


>gi|392954072|ref|ZP_10319624.1| hypothetical protein WQQ_36960 [Hydrocarboniphaga effusa AP103]
 gi|391857971|gb|EIT68501.1| hypothetical protein WQQ_36960 [Hydrocarboniphaga effusa AP103]
          Length = 383

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 67/144 (46%), Gaps = 10/144 (6%)

Query: 67  WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           W     G S+     +     +Q+L + +          N Y +   A  F  H+D  + 
Sbjct: 84  WAQLHSGASVVLNRLEHQSSTMQRLCSDVARFCSLPTSGNAYLSIGGAGTFGRHWDTHDV 143

Query: 127 FILQLEGKKKWKVYLPR----MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           F +QL G K+W+V+ P     + D+   R   P  +Q     P+L VTL+ GDLLYLPRG
Sbjct: 144 FAVQLIGSKRWRVFEPTYALPLADQTSQRSPLPCPAQ-----PLLDVTLQAGDLLYLPRG 198

Query: 183 YIHQASTVTNEHSLHVTISVYQKT 206
           + H+     +E S H++I  Y  T
Sbjct: 199 WWHEV-LPGDEPSFHLSIGSYAPT 221


>gi|163796488|ref|ZP_02190448.1| hypothetical protein BAL199_11012 [alpha proteobacterium BAL199]
 gi|159178338|gb|EDP62882.1| hypothetical protein BAL199_11012 [alpha proteobacterium BAL199]
          Length = 398

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 8/138 (5%)

Query: 88  LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
           +  +    + LFG     N Y +     G+A H+D ++ F +Q+ G+K W +Y  R  + 
Sbjct: 116 IAAVTRCFERLFGTNTSCNMYCSWQQVPGYASHFDTMDVFAIQITGEKTWNIYDGRFREA 175

Query: 148 YLPRYSSP-NFSQEE----IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
                  P +F+ E+     G     +T+ PGD+LYLPRG  H A   T+  SLH++  V
Sbjct: 176 TFTAGIRPSDFTVEQHNRMRGKVAQRITMRPGDILYLPRGVYHDA-LATDSASLHLSFGV 234

Query: 203 YQKTAW--IDLLEKAMPK 218
             +  +  + +L    PK
Sbjct: 235 SPQVGFTVVGMLASEAPK 252


>gi|357479435|ref|XP_003610003.1| Lysine-specific demethylase NO66 [Medicago truncatula]
 gi|355511058|gb|AES92200.1| Lysine-specific demethylase NO66 [Medicago truncatula]
          Length = 784

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/148 (33%), Positives = 77/148 (52%), Gaps = 11/148 (7%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
           Y+EG ++     +   + +  +  +L  +FG   VGAN Y TP  +QG A H+DD   F+
Sbjct: 437 YKEGYTVALRGLEFRYQRIASIADTLALMFGQPSVGANLYLTPANSQGLARHFDDHCVFV 496

Query: 129 LQLEGKKKWKVYLPRMVDEYLPR-YSSPNFSQEEIGTPI-LTVTLEPGDLLYLPRGYIHQ 186
            Q+ G KKW V+      + LPR Y S   S  +    +     L+ GD+LY+PRG+ H+
Sbjct: 497 CQIFGSKKWTVF--SRPGQLLPRLYDSLCGSDVDCTKAVRREFFLKEGDVLYIPRGFPHE 554

Query: 187 ASTV------TNEHSLHVTISVYQKTAW 208
           A T       +   SLH+T+S+  + A+
Sbjct: 555 AYTNYGVDDGSPGFSLHLTLSIEVEPAF 582


>gi|395493847|ref|ZP_10425426.1| cupin 4 family protein [Sphingomonas sp. PAMC 26617]
          Length = 400

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 72/275 (26%), Positives = 110/275 (40%), Gaps = 33/275 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITL---------YEDGQR 52
           +W+++P+++          L+   +  ++L         N+ + L         Y DG  
Sbjct: 24  YWDRQPVHLPSSAGGR-GGLLNWDRFSELLGTGGHWTAANLKLILNSRPILPDFYLDGDP 82

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
                  RA P  V      G S+   + +     L++    L   F     AN Y +  
Sbjct: 83  AAPIDTRRADPARVDVFLGMGASLVADDVERIAPELRRWTTLLGTEFAALTNANVYCSFK 142

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-------EIGTP 165
             Q FA HYD  + F LQ EG+K W++Y  R  +      S+P+  +E         G  
Sbjct: 143 GVQAFATHYDLHDVFALQCEGEKVWRIYRNRAENP----VSAPHGGEEVQRQIDAARGDL 198

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
           +  V + PGD+LYLPRG  H A       SLHV+ +V        L   A+ +AL+ A  
Sbjct: 199 MAEVRMRPGDVLYLPRGCYHDALAQAGA-SLHVSFAV------APLGAMALLRALEEAAL 251

Query: 226 TDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMK 260
            +  FR      YL  A    G P+  Q   LA +
Sbjct: 252 REPAFR-----AYLADARSDAGGPLAAQLQALASR 281


>gi|389875092|ref|YP_006374448.1| Lysine-specific demethylase [Tistrella mobilis KA081020-065]
 gi|388532272|gb|AFK57466.1| Lysine-specific demethylase [Tistrella mobilis KA081020-065]
          Length = 389

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 113/267 (42%), Gaps = 36/267 (13%)

Query: 16  NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCS 75
           + +N+L+ S+++ D   D     ++  D+   E       D++   +P+++     EG S
Sbjct: 44  STFNSLLGSERLLD---DRTRMVRRGRDLP-REMWSYLGPDMKRVVVPELLQRFCHEGLS 99

Query: 76  IRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKK 135
           I     Q  +  +  L   L++     +G N Y +      F  H+D  +  +L L G+K
Sbjct: 100 IALNQIQHAVPGIAALVKMLEQALPARIGTNVYASFGRESAFRAHHDPHDVLVLHLHGRK 159

Query: 136 KW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
           +W        + P M             + + +G       LEPGD+LYLPRG IH+AS 
Sbjct: 160 RWFGFGHQSAHPPGM-----------PLTNDRLGPVHWEQVLEPGDILYLPRGEIHRAS- 207

Query: 190 VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
           V  E SLH+T ++       DLL       L   G    +  R +P+          G  
Sbjct: 208 VEGEASLHLTNALLWPRG-SDLLHWLAGTGLAGNGLAADDLDRDIPV---------YGTA 257

Query: 250 VDIQADRLAMKENLKDLLAKLVEYVDL 276
            D++  R A  E L+  L +L + +DL
Sbjct: 258 TDLE--RCA--EELRATLHRLADSIDL 280


>gi|167523972|ref|XP_001746322.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775084|gb|EDQ88709.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2283

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 70/150 (46%), Gaps = 17/150 (11%)

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD---- 146
           L   L  LF   V  + Y TP  AQ   PH D  +  ++Q  G+K+W +  P+       
Sbjct: 146 LERQLTTLFATTVAVHGYVTPTGAQALKPHTDPYDVLVIQTYGEKQWTICTPQPAGAQNR 205

Query: 147 --------EYLPRYSSPNFSQEEI----GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH 194
                   + + R+S    +Q E           +TL+ GD+LYLP+G IH A+T  +  
Sbjct: 206 TDAEKAQLQEIVRHSIQGCTQYEAWQLAKMECQAITLKAGDVLYLPKGIIHYATTTESMG 265

Query: 195 SLHVTISVYQKT-AWIDLLEKAMPKALQAA 223
           S H+T+S+ + T +W+ L  +A    L  A
Sbjct: 266 STHITLSLERLTHSWLALFGRACGLGLDRA 295


>gi|325929645|gb|ADZ45327.1| cupin 4 [Streptomyces sp. NRRL 30471]
          Length = 316

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/197 (27%), Positives = 91/197 (46%), Gaps = 24/197 (12%)

Query: 21  LITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD--VEGRALPQIVWDHY-------- 70
           L+    ++DILR NV+ F     + + +D  R  ++     R +   V + Y        
Sbjct: 49  LLDIPALEDILRWNVLRFPY---VFMSKDQARLPLETYCSNRRVFSYVQEGYLDGGKTVA 105

Query: 71  --REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFI 128
              +G S+   + + +     +   S Q L      AN + TP   Q    H+DD   FI
Sbjct: 106 ALEKGASLYLNHLEDWHPGFGRYLQSWQSLPVAKTVANAFITPAGTQAVGRHWDDSHNFI 165

Query: 129 LQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT-LEPGDLLYLPRGYIHQA 187
           LQ  G K+W++Y P       P    P+ + E++ T +L  T LEPGD++Y P  + H+A
Sbjct: 166 LQTAGSKRWRIYAP-------PPNPHPDMTAEDLKTELLLDTVLEPGDVIYFPPAFPHEA 218

Query: 188 STVTNEHSLHVTISVYQ 204
              ++  S+H+T + Y+
Sbjct: 219 EG-SDGLSVHLTFAAYE 234


>gi|290960087|ref|YP_003491269.1| hypothetical protein SCAB_57031 [Streptomyces scabiei 87.22]
 gi|260649613|emb|CBG72728.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 407

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 66/117 (56%), Gaps = 13/117 (11%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSP-NFSQ 159
           V  N Y T   A GF  H+DD +  I+QL G+KKW+V   R     +P Y  S P N + 
Sbjct: 139 VQVNVYLTTNDAAGFPLHWDDHDVVIVQLAGEKKWEV---RTASRNVPMYRDSDPNNTAS 195

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTV--TNEHSLHVTISVYQKT--AWIDLL 212
           +EI   I +  +  GD++++PRG+ HQA+     + +SLHVT  + ++T  +W+  L
Sbjct: 196 DEI---IWSGVMRAGDVMHIPRGHWHQATRTGSGSGNSLHVTFGITKRTGASWLAWL 249


>gi|453054469|gb|EMF01921.1| hypothetical protein H340_03854 [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 398

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 94/189 (49%), Gaps = 24/189 (12%)

Query: 38  FKKNIDI---TLYEDGQRQTMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQ 90
           F+K +++     Y D    ++   G+++P +    + +  REG ++       +   ++ 
Sbjct: 62  FQKGVEVHPAVYYTD----SVSPRGQSIPMVNMRSLGNLLREGVTLILDQANVFDPTMEV 117

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
              +LQ      V  N Y T   A GF  H+DD +  I+QL G+K+W+V   R     +P
Sbjct: 118 ACRALQWWSHERVQVNAYLTTNDAAGFPLHWDDHDVLIVQLAGEKEWEV---RAASRKVP 174

Query: 151 RY--SSPNFS-QEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQK 205
            Y  S PN +  +EI   I +  +  GD++++PRG+ HQAS   +    SLHVT  + ++
Sbjct: 175 MYRDSDPNNTPSDEI---IWSGVMRTGDVMHIPRGHWHQASRTGSGSGKSLHVTFGITKR 231

Query: 206 T--AWIDLL 212
           T  +W+  L
Sbjct: 232 TGASWLTWL 240


>gi|354614310|ref|ZP_09032183.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
           90007]
 gi|353221342|gb|EHB85707.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
           90007]
          Length = 308

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 67/132 (50%), Gaps = 6/132 (4%)

Query: 83  TYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLP 142
           T+   L+    +LQ      V  NTY T   A GF  H+DD +  I+QL G+K W+V  P
Sbjct: 8   TFDPTLEVACQALQWWSHELVQVNTYLTTNDAAGFDLHWDDHDVIIVQLGGEKGWEVRGP 67

Query: 143 RMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT--NEHSLHVTI 200
                    +   +   +EI   + + TL  GD++++PRG+ HQA+ V   + HSLH T 
Sbjct: 68  SRPAPMYRDHERNDTPSDEI---VWSGTLCAGDVMHIPRGHWHQATRVARGDGHSLHATF 124

Query: 201 SVYQKTAWIDLL 212
              ++T  +D L
Sbjct: 125 GFVKRTG-VDWL 135


>gi|86739110|ref|YP_479510.1| cupin [Frankia sp. CcI3]
 gi|86565972|gb|ABD09781.1| Cupin 4 [Frankia sp. CcI3]
          Length = 406

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 9/147 (6%)

Query: 71  REGCSIRFLNPQTYIKP-LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           + GC++  L+   +  P L+    + Q      V AN Y T   A GF+ H+DD +  +L
Sbjct: 100 QSGCTL-VLDAVNHFDPTLEVACRAFQWWLRAPVQANVYLTTGDAAGFSLHWDDHDVIVL 158

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
           QL G K+W+V  P       P Y     + E     + + T+  GD+LY+PRG+ H+AS 
Sbjct: 159 QLAGDKEWEVRGP---SRRAPMYRDAAPNTEPPKDIVWSGTVNTGDVLYIPRGHWHRASR 215

Query: 190 VT--NEHSLHVTISVYQKTA--WIDLL 212
            +  +  SLH T    ++T   W+  L
Sbjct: 216 TSRGDGFSLHATFGFTRRTGVDWLAWL 242


>gi|221201735|ref|ZP_03574773.1| cupin superfamily protein [Burkholderia multivorans CGD2M]
 gi|221207190|ref|ZP_03580200.1| cupin superfamily protein [Burkholderia multivorans CGD2]
 gi|421473482|ref|ZP_15921589.1| cupin domain protein, PF06172 family [Burkholderia multivorans ATCC
           BAA-247]
 gi|221172778|gb|EEE05215.1| cupin superfamily protein [Burkholderia multivorans CGD2]
 gi|221178551|gb|EEE10960.1| cupin superfamily protein [Burkholderia multivorans CGD2M]
 gi|400220866|gb|EJO51370.1| cupin domain protein, PF06172 family [Burkholderia multivorans ATCC
           BAA-247]
          Length = 384

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 21/174 (12%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           R +  +V+D+ R+G ++   N       + QL+  L E  G  V ++ Y        F  
Sbjct: 78  RLIKPVVYDYLRKGATL-IANKIATEPKVNQLSRQLIEFTGRQVVSSAYLAFGERDSFRC 136

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRM--------VDEYLPRYSSPNFSQEEIGTPILTVTL 171
           H+D  + F +QL G+K+W +Y P +          +Y   Y  P+       TP + V L
Sbjct: 137 HWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLYPCPD-------TPYMDVML 189

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA---MPKALQA 222
           E GDLLYLPRG+ H    V  E + H+    +     ID L  A   MP  L A
Sbjct: 190 EAGDLLYLPRGWWHNPLPV-GEATFHLAFGTFPAYV-IDYLSWAINRMPHLLDA 241


>gi|348171193|ref|ZP_08878087.1| cupin 4 [Saccharopolyspora spinosa NRRL 18395]
          Length = 366

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 70/156 (44%), Gaps = 9/156 (5%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           REGC++      ++   ++    +LQ      V  NTY T   A GF  H+DD +  ++Q
Sbjct: 69  REGCTLVLDTLDSFDPAMEIACQALQWWSRELVQVNTYLTTSNAAGFNLHWDDHDVVVVQ 128

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           L G+K W+V   R      P Y     +       +   T+  GD+ ++PRGY H+A+  
Sbjct: 129 LGGEKSWEV---RGTSRVAPMYRDVERADAPSEEVLWAGTMRAGDVFHIPRGYWHRATRT 185

Query: 191 TNE-----HSLHVTISVYQKTAWIDLLEKAMPKALQ 221
                   +SLH T    ++T  +D L     ++ Q
Sbjct: 186 DRTDDRAGYSLHATFGFVKRTG-VDWLSWIADRSRQ 220


>gi|221213317|ref|ZP_03586292.1| cupin superfamily protein [Burkholderia multivorans CGD1]
 gi|221166769|gb|EED99240.1| cupin superfamily protein [Burkholderia multivorans CGD1]
          Length = 384

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 21/174 (12%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           R +  +V+D+ R+G ++   N       + QL+  L E  G  V ++ Y        F  
Sbjct: 78  RLIKPVVYDYLRKGATL-IANKIATEPKVNQLSRQLIEFTGRQVVSSAYLAFGERDSFRC 136

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRM--------VDEYLPRYSSPNFSQEEIGTPILTVTL 171
           H+D  + F +QL G+K+W +Y P +          +Y   Y  P+       TP + V L
Sbjct: 137 HWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLYPCPD-------TPYMDVML 189

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA---MPKALQA 222
           E GDLLYLPRG+ H    V  E + H+    +     ID L  A   MP  L A
Sbjct: 190 EAGDLLYLPRGWWHNPLPV-GEATFHLAFGTFPAYV-IDYLSWAINRMPHLLDA 241


>gi|161526044|ref|YP_001581056.1| cupin 4 family protein [Burkholderia multivorans ATCC 17616]
 gi|189349241|ref|YP_001944869.1| hypothetical protein BMULJ_00360 [Burkholderia multivorans ATCC
           17616]
 gi|160343473|gb|ABX16559.1| Cupin 4 family protein [Burkholderia multivorans ATCC 17616]
 gi|189333263|dbj|BAG42333.1| hypothetical protein BMULJ_00360 [Burkholderia multivorans ATCC
           17616]
          Length = 384

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 21/174 (12%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
           R +  +V+D+ R+G ++   N       + QL+  L E  G  V ++ Y        F  
Sbjct: 78  RLIKPVVYDYLRKGATL-IANKIATEPKVNQLSRQLIEFTGRQVVSSAYLAFGERDSFRC 136

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRM--------VDEYLPRYSSPNFSQEEIGTPILTVTL 171
           H+D  + F +QL G+K+W +Y P +          +Y   Y  P+       TP + V L
Sbjct: 137 HWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLYPCPD-------TPYMDVML 189

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA---MPKALQA 222
           E GDLLYLPRG+ H    V  E + H+    +     ID L  A   MP  L A
Sbjct: 190 EAGDLLYLPRGWWHNPLPV-GEATFHLAFGTFPAYV-IDYLSWAINRMPHLLDA 241


>gi|357407532|ref|YP_004919455.1| hypothetical protein SCAT_p0163 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|337762481|emb|CCB71189.1| putative Unnamed protein product [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
          Length = 395

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 62/115 (53%), Gaps = 9/115 (7%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL-PRYSSPNFSQEE 161
            GA+ + TP +AQGFAPH D+  + +L +EG+K W      MV+E   P        +E 
Sbjct: 128 AGAHVFATPRSAQGFAPHRDEHGSLVLHMEGRKSW------MVEERAGPPRPGAVPPREW 181

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
                   +L PGD+LY+P+   H A T   E SLHVT+ ++    W D+L++ +
Sbjct: 182 ARLRFREFSLGPGDVLYIPQHCGHAART-GAERSLHVTLRLF-PLRWRDVLDEVV 234


>gi|302535492|ref|ZP_07287834.1| LOW QUALITY PROTEIN: cupin 4 family protein [Streptomyces sp. C]
 gi|302444387|gb|EFL16203.1| LOW QUALITY PROTEIN: cupin 4 family protein [Streptomyces sp. C]
          Length = 300

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/207 (25%), Positives = 88/207 (42%), Gaps = 12/207 (5%)

Query: 3   WEKEPLYISRKQKNYYNNL--ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           W++EP   +        +L  + +     +LR   +E  ++  + +  D    +  V G 
Sbjct: 45  WQREPAVFTPPSSAPPFDLDELDAAFDSGLLRTPYLEMVRSNKVVVPPDAYTASRVVNGI 104

Query: 61  ALPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
                     V    R G ++       + +P  +L A L E     V A  + TP   Q
Sbjct: 105 TYSGFADRAKVVAQLRAGATLLLRCVDQWHRPTGELVARLSEELDRRVEAFFFVTPVGGQ 164

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G A H DD + F+LQ+ G K W V+    V ++ P         +E    +L   LEPG 
Sbjct: 165 GLAVHRDDADVFVLQVAGSKTWYVHDAPAVADWQPG----ELPDDEHSPRLLHTLLEPGG 220

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV 202
           +LY+PRG+ H+A       S H+++++
Sbjct: 221 VLYVPRGFAHRAVGAAG-LSAHLSLTI 246


>gi|386353479|ref|YP_006051726.1| hypothetical protein [Streptomyces cattleya NRRL 8057 = DSM 46488]
 gi|365811558|gb|AEW99773.1| hypothetical protein SCATT_p15800 [Streptomyces cattleya NRRL 8057
           = DSM 46488]
          Length = 395

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 62/115 (53%), Gaps = 9/115 (7%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL-PRYSSPNFSQEE 161
            GA+ + TP +AQGFAPH D+  + +L +EG+K W      MV+E   P        +E 
Sbjct: 128 AGAHVFATPRSAQGFAPHRDEHGSLVLHMEGRKSW------MVEERAGPPRPGAVPPREW 181

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
                   +L PGD+LY+P+   H A T   E SLHVT+ ++    W D+L++ +
Sbjct: 182 ARLRFREFSLGPGDVLYIPQHCGHAART-GAERSLHVTLRLF-PLRWRDVLDEVV 234


>gi|318060501|ref|ZP_07979224.1| hypothetical protein SSA3_21343 [Streptomyces sp. SA3_actG]
 gi|318077244|ref|ZP_07984576.1| hypothetical protein SSA3_11103 [Streptomyces sp. SA3_actF]
          Length = 403

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 58/121 (47%), Gaps = 8/121 (6%)

Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
           G    AN Y       GF  H+DD E   +QL G+K W+V  P       P Y     ++
Sbjct: 127 GELTSANAYLAVGETDGFHLHWDDHEVIAVQLSGEKAWEVRGP---SRTAPMYRDAERNR 183

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTV---TNEHSLHVTISVYQKTA--WIDLLEK 214
                 +   TL  GD++++PRGY H A+ +   ++ HSLHVT    ++T   W++ L  
Sbjct: 184 TPSEDVLWKGTLRAGDVMHIPRGYWHTATRLGSGSSGHSLHVTFGFTKRTGVTWVNFLSD 243

Query: 215 A 215
           A
Sbjct: 244 A 244


>gi|418459769|ref|ZP_13030880.1| cupin 4 family protein [Saccharomonospora azurea SZMC 14600]
 gi|359740082|gb|EHK88931.1| cupin 4 family protein [Saccharomonospora azurea SZMC 14600]
          Length = 398

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/223 (25%), Positives = 87/223 (39%), Gaps = 18/223 (8%)

Query: 9   YISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQT----MDVEG 59
           ++     + + +L++   +D++L     R       ++       D  R        V G
Sbjct: 28  WVHGPTADGFEDLLSLDGVDELLTQHSLRSPAFRLVRDGRTVPLSDCTRTARIGGTSVSG 87

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
            A P  V     +G ++       Y  PL      L+   G     N Y TPP AQG  P
Sbjct: 88  VADPARVLAAVEDGATLVLQGLHRYWPPLVGFCRELELELGHPCQVNAYVTPPGAQGLRP 147

Query: 120 HYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           H D  + F+LQ  G K W+V+  P                  E       V L  G  LY
Sbjct: 148 HTDSHDVFVLQAFGSKSWQVWPAPAADAGDTGDTGDTGADGVE------DVELRAGMALY 201

Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           LP G  H A+   +  S H+T+ ++  T W DL+E A+ + L+
Sbjct: 202 LPTGTRH-AARAQHVVSGHLTVGIH-PTRWRDLIESAVGRVLR 242


>gi|386840441|ref|YP_006245499.1| cupin [Streptomyces hygroscopicus subsp. jinggangensis 5008]
 gi|374100742|gb|AEY89626.1| Cupin 4 [Streptomyces hygroscopicus subsp. jinggangensis 5008]
 gi|451793733|gb|AGF63782.1| Cupin 4 [Streptomyces hygroscopicus subsp. jinggangensis TL01]
          Length = 399

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 72/146 (49%), Gaps = 7/146 (4%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           REG ++       +   ++    +LQ      V  N Y T   A+GF  H+DD +  I+Q
Sbjct: 98  REGATLILDQLNVFDPTMEVACRALQWWSHERVQVNAYLTTNDAEGFPLHWDDHDVVIVQ 157

Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
           L G+K+W+V   R     +P Y   + +       I T  L  GD++++PRG+ H+A+  
Sbjct: 158 LAGEKQWEV---RTASRKVPMYRDCDPNDTPSAETIWTGVLRTGDVMHIPRGHWHRATRS 214

Query: 191 TN--EHSLHVTISVYQKT--AWIDLL 212
            +    SLHVT  + ++T  +W+  L
Sbjct: 215 GSGPGKSLHVTFGITRRTGASWLAWL 240


>gi|397171987|ref|ZP_10495383.1| RmlC-like domain-containing protein [Alishewanella aestuarii B11]
 gi|396086329|gb|EJI83943.1| RmlC-like domain-containing protein [Alishewanella aestuarii B11]
          Length = 384

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 104/219 (47%), Gaps = 30/219 (13%)

Query: 2   HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQT 54
           +W+K+PL I +  KN+ + L    +    ++D +   ++E +     ++   +ED  R  
Sbjct: 18  YWQKQPLLIKQGFKNFQDPLSADELAGLALEDDVESRLVECRDGNWQLETGPFEDFNRYG 77

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
            D +   L Q V DH+    +   + P  +I      N  + +L   F       TP   
Sbjct: 78  -DKDWTILVQAV-DHWHPPAA-ELIEPFRFIP-----NWRIDDLMVSF------STP--G 121

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRM-VDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            G  PH D  + FI+Q EGK+ W+V +P   + ++ P    P   Q    T  + V +EP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGMPDASLKQFCPH---PRLLQVTPFTACIDVIMEP 178

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           GD+LY+P G  H+   V+ E+SL+ ++  ++  A  DLL
Sbjct: 179 GDILYIPPGCPHEG--VSLENSLNYSVG-FRAPAQKDLL 214


>gi|160898500|ref|YP_001564082.1| cupin 4 family protein [Delftia acidovorans SPH-1]
 gi|160364084|gb|ABX35697.1| Cupin 4 family protein [Delftia acidovorans SPH-1]
          Length = 298

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/119 (35%), Positives = 55/119 (46%), Gaps = 7/119 (5%)

Query: 89  QQLNASLQELFGCFVGA----NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM 144
           Q+L   L E    F  A    N Y        F  H+D  +   +QL GKK W+VY P  
Sbjct: 105 QELVRRLCEEVASFTNAETTANAYLAFSGEGSFGSHWDTHDVMAIQLIGKKHWRVYAPTY 164

Query: 145 VDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
               LP  +S +F       PI    LE GDLLY+PRG+ H+   +    +LHV I +Y
Sbjct: 165 KSP-LPGQTSKSFDSTCPTDPIFDGVLEAGDLLYVPRGWWHEVLPIGE--TLHVAIGIY 220


>gi|375108697|ref|ZP_09754951.1| RmlC-like domain-containing protein [Alishewanella jeotgali KCTC
           22429]
 gi|374570883|gb|EHR42012.1| RmlC-like domain-containing protein [Alishewanella jeotgali KCTC
           22429]
          Length = 384

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 104/219 (47%), Gaps = 30/219 (13%)

Query: 2   HWEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQT 54
           +W+K+PL I +  KN+ +    N +    ++D +   ++E +     ++   +ED  R  
Sbjct: 18  YWQKQPLLIKQGFKNFQDPLSANELAGLALEDDVESRLVECRDGNWQLETGPFEDFSRYG 77

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
            D +   L Q V DH+    +   + P  +I      N  + +L   F       TP   
Sbjct: 78  -DKDWTILVQAV-DHWHPPAA-ELIEPFRFIP-----NWRIDDLMVSF------STP--G 121

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRM-VDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            G  PH D  + FI+Q EGK+ W+V +P   + ++ P    P   Q    T  + V +EP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGMPDASLKQFCPH---PRLLQVTPFTACIDVIMEP 178

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           GD+LY+P G  H+   ++ E+SL+ ++  ++  A  DLL
Sbjct: 179 GDILYIPPGCPHEG--ISLENSLNYSVG-FRAPAQKDLL 214


>gi|386384473|ref|ZP_10069846.1| hypothetical protein STSU_15719 [Streptomyces tsukubaensis
           NRRL18488]
 gi|385668044|gb|EIF91414.1| hypothetical protein STSU_15719 [Streptomyces tsukubaensis
           NRRL18488]
          Length = 398

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 66/119 (55%), Gaps = 17/119 (14%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFS-Q 159
           V  N Y T   A GF  H+DD +  I+QL G+K W+V   R     +P Y  + PN +  
Sbjct: 130 VQVNAYLTTNEASGFPLHWDDHDVVIVQLAGEKGWEV---RGTSRSVPLYRDADPNDTPS 186

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH----SLHVTISVYQKT--AWIDLL 212
           EEI   I +  ++ GD++++PRG+ HQA+   N H    SLHVT  + ++T  +W+  L
Sbjct: 187 EEI---IWSGVMKAGDMMHIPRGHWHQAT--RNGHGSGKSLHVTFGITKRTGASWLAWL 240


>gi|167538117|ref|XP_001750724.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770748|gb|EDQ84429.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2295

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 28/142 (19%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS 153
           +L + F   V  + Y T P  +   PH D  + F+ Q+ G K W++ +PR+ D  + + S
Sbjct: 212 ALTDAFNVPVTQHVYITGPAGRALNPHTDGGDVFVRQISGSKHWQLCVPRLQDPSVCQPS 271

Query: 154 SPNFS------------------QEEIGTPIL----------TVTLEPGDLLYLPRGYIH 185
           +P+ +                  Q E  TP             +TL PGD LYLPRG +H
Sbjct: 272 APSPTAHPCTDGARARYAEWKRDQFEGCTPYTMGQLADMDCSNITLHPGDTLYLPRGIVH 331

Query: 186 QASTVTNEHSLHVTISVYQKTA 207
            A T +   S HVT  +  K A
Sbjct: 332 HAWTDSGITSTHVTYQLQSKDA 353


>gi|296534974|ref|ZP_06897266.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
 gi|296264700|gb|EFH11033.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
          Length = 406

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 80/347 (23%), Positives = 148/347 (42%), Gaps = 46/347 (13%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------- 52
           ++++K+PL++      +  ++++ ++I+ +L    I   +++ + L  DG          
Sbjct: 20  EYYDKQPLHVQGGAAKF-ASVLSWRQINRLLDQTHIWSSQSLKLVL--DGNAVPPEQYCG 76

Query: 53  --------QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG 104
                   Q M  E   + + V    + G S+  LN    + P   L A    L G  +G
Sbjct: 77  RATSRDNAQVMQPEAHKVAEWV----KRGASV-VLNDVDSLTP--GLTAVSGALEGAGLG 129

Query: 105 ---ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-- 159
              AN Y +  + + F  H+D  + + +Q+EG+K W ++  R  +  +P  +  +  Q  
Sbjct: 130 KAQANVYISWQSHKAFPSHFDTHDVWAVQVEGEKTWNIWEGR-AEWPIPHPAFRSLGQAH 188

Query: 160 --EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMP 217
             +  G     VTL+ GDLLYLPRG+ H A       S+H+   V+     +DL+   + 
Sbjct: 189 HDQAKGKLRGQVTLKTGDLLYLPRGWYHDA-LAEAPSSVHIAYGVHAPLG-MDLVNILLE 246

Query: 218 KALQAAGATDLEFRRGLP----IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLV-E 272
           + L      + EFR+ LP        R+A  +R   +  +   L     + D+L K V +
Sbjct: 247 RVL-----YETEFRKPLPRQDGTAAARFALTSRAAQLGQRLSELTRDPKVMDVLGKFVAD 301

Query: 273 YVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVY 319
           Y     G D +  + +     P    EE    V + G++  + G  +
Sbjct: 302 YRYRRGGNDLLAARGLAAPASPAGGHEENAFRVVQPGVKAVRRGAEW 348


>gi|358462324|ref|ZP_09172458.1| Cupin, JmjC-type [Frankia sp. CN3]
 gi|357071887|gb|EHI81456.1| Cupin, JmjC-type [Frankia sp. CN3]
          Length = 330

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 14/117 (11%)

Query: 102 FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQ 159
            V  N Y T  +  GF  H+DD +  ++QL G K W V   R     +P Y  + PN + 
Sbjct: 52  VVQVNAYLTTRSTGGFPLHWDDHDVVVVQLAGNKTWDV---RSHSRPVPMYRDAEPNSAP 108

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT-----AWI 209
            E    I   T+ PGD+L++PRG+ HQA+   +E   SLH+T    ++T     AWI
Sbjct: 109 SE--DVIWRGTVTPGDVLHIPRGHWHQATHAAHEEPLSLHLTFGFVRRTGVDWLAWI 163


>gi|393761583|ref|ZP_10350220.1| RmlC-like domain-containing protein [Alishewanella agri BL06]
 gi|392607593|gb|EIW90467.1| RmlC-like domain-containing protein [Alishewanella agri BL06]
          Length = 384

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 104/219 (47%), Gaps = 30/219 (13%)

Query: 2   HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQT 54
           +W+K+PL I +  KN+ + L    +    ++D +   ++E +     ++   +ED  R  
Sbjct: 18  YWQKQPLLIKQGFKNFQDPLSADELAGLALEDDVESRLVECRDGNWQLETGPFEDFNRYG 77

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
            D +   L Q V DH+    +   + P  +I      N  + +L   F       TP   
Sbjct: 78  -DKDWTILVQAV-DHWHPPAA-ELIEPFRFIP-----NWRIDDLMVSF------STP--G 121

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRM-VDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            G  PH D  + FI+Q EGK+ W+V +P   + ++ P    P   Q    T  + V +EP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGMPDASLKQFCPH---PRLLQVTPFTACIDVIMEP 178

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           GD+LY+P G  H+   ++ E+SL+ ++  ++  A  DLL
Sbjct: 179 GDILYIPPGCPHEG--ISLENSLNYSVG-FRAPAQKDLL 214


>gi|351699836|gb|EHB02755.1| MYC-induced nuclear antigen [Heterocephalus glaber]
          Length = 361

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 54/125 (43%), Gaps = 26/125 (20%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVT 170
           +     PHYDD+E FIL LEG+K   +YLP   +  EY   Y      +  +G      T
Sbjct: 172 SVMNIGPHYDDVEVFILHLEGEKHSWLYLPTVAVAREYSVEY------EARLGVQTHEFT 225

Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
           L+ GDLLY PRG IH                  Q  +W D L   +   +      D+E 
Sbjct: 226 LKTGDLLYFPRGPIH------------------QNNSWGDFLLDTISGIVFDTAKEDVEL 267

Query: 231 RRGLP 235
           R G+P
Sbjct: 268 RSGIP 272


>gi|345851505|ref|ZP_08804478.1| hypothetical protein SZN_17147 [Streptomyces zinciresistens K42]
 gi|345637051|gb|EGX58585.1| hypothetical protein SZN_17147 [Streptomyces zinciresistens K42]
          Length = 398

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 65/117 (55%), Gaps = 13/117 (11%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFS-Q 159
           V  N Y T   A GF  H+DD +  I+QL G+K+W+V   R     +P Y  + PN +  
Sbjct: 130 VQVNAYLTTNDASGFPLHWDDHDVVIVQLAGEKEWEV---RATSRAVPMYRDADPNSTPS 186

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT--AWIDLL 212
           +EI   I +  +  GD++++PRG+ HQA+   +    SLHVT  + ++T  +W+  L
Sbjct: 187 DEI---IWSGLMRTGDVMHIPRGHWHQATRTGSGSGKSLHVTFGITKRTGASWLAWL 240


>gi|386837115|ref|YP_006242173.1| cupin [Streptomyces hygroscopicus subsp. jinggangensis 5008]
 gi|374097416|gb|AEY86300.1| Cupin 4 family protein [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451790473|gb|AGF60522.1| Cupin 4 family protein [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 316

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 65/131 (49%), Gaps = 5/131 (3%)

Query: 72  EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
           +G ++   +   +  P +++ A+L    G  V A  + TPP  QG   H DD +  +LQ+
Sbjct: 96  KGATLLLRSVDHWHGPTREVLAALAGELGRGVEAFFFVTPPGHQGLPLHRDDADVLVLQV 155

Query: 132 EGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT 191
            G+K W+++      ++      P+   E     IL  TL+ G++LY+PRG+ H A   T
Sbjct: 156 AGRKDWRIHAKPDSADWRAGPVDPSAGPEP--AEILRTTLDAGEVLYVPRGFAHSA---T 210

Query: 192 NEHSLHVTISV 202
            E  L   +SV
Sbjct: 211 GERGLSAHLSV 221


>gi|401404980|ref|XP_003881940.1| Cupin 4 family protein, related [Neospora caninum Liverpool]
 gi|325116354|emb|CBZ51907.1| Cupin 4 family protein, related [Neospora caninum Liverpool]
          Length = 533

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 68/149 (45%), Gaps = 24/149 (16%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI----- 124
           Y +G S+        +  LQ +   L + +   V A +Y TPP       H DD      
Sbjct: 208 YLDGFSLVVNQADRTLGVLQSICHHLAKKYFTHVFAVSYLTPPNTHAVKTHTDDQSLLPG 267

Query: 125 ---EAFILQLEGKKKWKVYLP--------RMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
              +  ++QL G K WK++ P         M+ +  P    P       G P+L   L+ 
Sbjct: 268 SSQDVILIQLWGSKSWKIWTPPQVLPLTEEMLGKQGPFPGDP-------GEPLLDFVLKE 320

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISV 202
           GD+LY+PRG+ H A T T E SLHVT++V
Sbjct: 321 GDILYIPRGFPHAALT-TEEPSLHVTLTV 348


>gi|323457139|gb|EGB13005.1| hypothetical protein AURANDRAFT_60763 [Aureococcus anophagefferens]
          Length = 2291

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 6/104 (5%)

Query: 105  ANTYQTPPTAQ-GFAPHYDDIEAFILQLEGKKKWKVYLP----RMVDEYLPRYSSPNFSQ 159
            AN Y T P  +   +PH D  +  + Q  G+K+W+V+ P    R VD +         + 
Sbjct: 2035 ANVYLTDPGREVSVSPHTDAQDVLVFQTAGRKRWRVWRPPARRRGVDPFGRGKHGDAIAP 2094

Query: 160  EEIGTPILTVTLEPGDLLYLPRGYIHQAST-VTNEHSLHVTISV 202
             E+  P+L   LEPGD+LY+P G+ H  ST   +  S HVT+ +
Sbjct: 2095 AELDEPLLDAWLEPGDVLYVPIGFPHATSTEACDAVSCHVTLGL 2138


>gi|395774364|ref|ZP_10454879.1| hypothetical protein Saci8_31535 [Streptomyces acidiscabies 84-104]
          Length = 398

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 5/107 (4%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI 162
           V  N Y T   A GF  H+DD +  I+QL G+K+W+V   R     +P Y   + +    
Sbjct: 130 VQVNAYLTTNDAAGFPLHWDDHDVIIVQLAGEKEWEV---RSTSRKVPMYRDADPNTAPS 186

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKTA 207
              I +  +  GD++++PRG+ HQA+   +    SLHVT  + ++T 
Sbjct: 187 DQIIWSGVMRTGDVMHIPRGHWHQATRTGSGSGKSLHVTFGITKRTG 233


>gi|312881223|ref|ZP_07741021.1| hypothetical protein VIBC2010_08478 [Vibrio caribbenthicus ATCC
           BAA-2122]
 gi|309371115|gb|EFP98569.1| hypothetical protein VIBC2010_08478 [Vibrio caribbenthicus ATCC
           BAA-2122]
          Length = 376

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 24/197 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
           HW K+P+ I    KN + + +     D++     +  + +ID     + Q +     G  
Sbjct: 15  HWHKKPIVI----KNAFIDFVDPITPDELAG---LAMESDIDSRFINNKQGRWSVTHGPL 67

Query: 60  -----RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPT 113
                 +L +  W    + C+       T + P +QL   L  +L  CF       + P 
Sbjct: 68  SEDFFTSLEESHWQLVVQACNHWHSGSATLLDPFKQLPQWLFDDLMVCF-------SAPR 120

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
             G  PH D  + FI+Q  G++ WKV   R   +Y     +    Q E   PI+   LEP
Sbjct: 121 G-GVGPHIDQYDVFIVQGSGQRHWKVG-ARDQGQYKESIQAGALRQIEGFDPIIDQVLEP 178

Query: 174 GDLLYLPRGYIHQASTV 190
           GDLLY+P G+ H+  T+
Sbjct: 179 GDLLYIPPGFPHEGDTL 195


>gi|256088456|ref|XP_002580351.1| hypothetical protein [Schistosoma mansoni]
          Length = 333

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 72/143 (50%), Gaps = 11/143 (7%)

Query: 2   HWEKEPLYISR---KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
           +++K PL++ R   K   + N  +  K +D       + F +++D+   +  + + ++  
Sbjct: 193 NYQKTPLHVKRSSSKCGEWLNYSLVKKLVDK----EFLFFGQHVDLIEDKTLKHEIVNPP 248

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY--QTPPTAQG 116
           GRA    VW+HY +G  IRF N QT+ + L+     LQE FGC      Y   + PT   
Sbjct: 249 GRAFAGTVWEHYSQGKGIRFHNLQTFSRRLRFRVGLLQEYFGCNFTVTAYLLSSKPTELS 308

Query: 117 FAPHYDDIEAFILQLEGKKKWKV 139
           F     D+  FILQ EG +K+++
Sbjct: 309 FNQLDHDL--FILQQEGSQKFQI 329


>gi|350643695|emb|CCD58399.1| unnamed protein product [Schistosoma mansoni]
          Length = 146

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 71/142 (50%), Gaps = 11/142 (7%)

Query: 3   WEKEPLYISR---KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
           ++K PL++ R   K   + N  +  K +D       + F +++D+   +  + + ++  G
Sbjct: 7   YQKTPLHVKRSSSKCGEWLNYSLVKKLVDK----EFLFFGQHVDLIEDKTLKHEIVNPPG 62

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY--QTPPTAQGF 117
           RA    VW+HY +G  IRF N QT+ + L+     LQE FGC      Y   + PT   F
Sbjct: 63  RAFAGTVWEHYSQGKGIRFHNLQTFSRRLRFRVGLLQEYFGCNFTVTAYLLSSKPTELSF 122

Query: 118 APHYDDIEAFILQLEGKKKWKV 139
                D + FILQ EG +K+++
Sbjct: 123 NQ--LDHDLFILQQEGSQKFQI 142


>gi|290956631|ref|YP_003487813.1| hypothetical protein SCAB_21351 [Streptomyces scabiei 87.22]
 gi|260646157|emb|CBG69250.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 396

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 96/213 (45%), Gaps = 35/213 (16%)

Query: 65  IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
           +V DH      + F +P      L+    +L    G    AN Y       GF  H+DD 
Sbjct: 103 VVLDH------VDFFDPT-----LEVACRALGWWSGELTSANAYLAVGDTDGFNLHWDDH 151

Query: 125 EAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFS-QEEIGTPILTVTLEPGDLLYLPRG 182
           +   +QL G+K W+V  P R    Y  R +  N +  EE+   +   T++ GD++++PRG
Sbjct: 152 DVIAVQLSGEKSWEVRGPSRPAPMY--RDAERNLAPSEEV---LWRGTMQAGDVMHIPRG 206

Query: 183 YIHQASTVT---NEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
           + H A+ +    + HSLHVT  + ++T   W + L          A   D +FR  L   
Sbjct: 207 FWHTATRIGSGDDGHSLHVTFGLTKRTGVTWANFLSD--------AARADEDFRTDLE-- 256

Query: 238 YLRYAGLARGKPVDIQADRLAMKENLKDLLAKL 270
             R  G A G+ +  +   LA + + K  LA+L
Sbjct: 257 --RSDGTADGEVLAAKLTDLAREYDPKRYLAEL 287


>gi|398832643|ref|ZP_10590800.1| Cupin superfamily protein [Herbaspirillum sp. YR522]
 gi|398222980|gb|EJN09336.1| Cupin superfamily protein [Herbaspirillum sp. YR522]
          Length = 393

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 4/121 (3%)

Query: 86  KPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV 145
           +PL ++   L         AN Y +     GF  H+DD +  I+Q++G+KKW+ +     
Sbjct: 112 RPLAEMTQCLACALDVRSAANAYMSFGLTSGFGVHHDDHDVIIIQIDGRKKWQFFSLPDA 171

Query: 146 DEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
           +E          ++ + G    +V +  GD++++P+G  H   T  NE SLH+TIS+   
Sbjct: 172 NEKATVADLEAPTESDRGQ---SVIIHTGDVMFIPKGTWHDV-TAMNERSLHLTISLVNP 227

Query: 206 T 206
           T
Sbjct: 228 T 228


>gi|256398046|ref|YP_003119610.1| cupin [Catenulispora acidiphila DSM 44928]
 gi|256364272|gb|ACU77769.1| Cupin 4 family protein [Catenulispora acidiphila DSM 44928]
          Length = 395

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/182 (25%), Positives = 79/182 (43%), Gaps = 15/182 (8%)

Query: 38  FKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQE 97
           F  N     ++  +R  M   GR L         EG ++       +   L+    +L  
Sbjct: 73  FLTNFVSRRHQASRRADMAAVGRIL--------NEGGTLILDTINQFDPTLEVACRALGW 124

Query: 98  LFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
             G  V  N Y       GF+ H+DD +  ++Q+ G+K W+V   R     +P Y     
Sbjct: 125 WTGELVSVNAYLAVGDTAGFSTHWDDHDVLVVQVAGQKSWEV---RPASRPVPMYRDAEQ 181

Query: 158 SQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKTA--WIDLLE 213
           + E     + + T+  GD++++PRG+ H A+ V +    SLH+T  + ++T   W+  L 
Sbjct: 182 NLEAPEELLWSGTMNTGDVMHIPRGFWHAATRVGSGEGISLHLTFGITRRTGVTWVQHLA 241

Query: 214 KA 215
            A
Sbjct: 242 DA 243


>gi|157375545|ref|YP_001474145.1| cupin 4 family protein [Shewanella sediminis HAW-EB3]
 gi|157317919|gb|ABV37017.1| cupin 4 family protein [Shewanella sediminis HAW-EB3]
          Length = 380

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 91/211 (43%), Gaps = 38/211 (18%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W+K+PL I R+  N + +L+T +++  +  D ++E ++        +G+ Q       
Sbjct: 16  DYWQKKPLVI-RQGFNNFKDLLTPEEMAGLACDEIVESRR----VYKSEGEWQA------ 64

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCF----------VGANT 107
                    +    S   L    +   +Q LN  L   ++L  CF          V A +
Sbjct: 65  --------EFGPFESYEHLGESDWTLVVQALNNWLPDAEKLIQCFDFIPRWRFDDVMA-S 115

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
           Y TP    G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+
Sbjct: 116 YATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHKEFA---AHPALLHTEAFDPII 170

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
            V L PGD+LYLP GY H   T+    S  V
Sbjct: 171 DVELLPGDILYLPPGYPHDGITLEPSMSFSV 201


>gi|428171176|gb|EKX40095.1| hypothetical protein GUITHDRAFT_143042 [Guillardia theta CCMP2712]
          Length = 422

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 50/93 (53%), Gaps = 10/93 (10%)

Query: 120 HYDDIEAFILQLEGKKKWKVYLPR------MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           H D  +   +Q +G K+W++Y+P       M D +     +   + EE+  P+L V LEP
Sbjct: 262 HTDKQDVIAIQAQGSKRWRIYMPPHPANKVMQDPFTRGKGNDVLTFEELEEPLLDVVLEP 321

Query: 174 GDLLYLPRGYIHQASTV----TNEHSLHVTISV 202
           G +LY+P GY H   TV    T E S+H+T+ V
Sbjct: 322 GQMLYVPAGYPHVTDTVNGVKTKEESIHLTLGV 354


>gi|167535487|ref|XP_001749417.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772045|gb|EDQ85702.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2781

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 44/146 (30%), Positives = 62/146 (42%), Gaps = 24/146 (16%)

Query: 94  SLQELFGCFVGANT-----YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           SL+EL     G++T     Y +   A    PH D  +  +L L G+K W V  P      
Sbjct: 117 SLEELIQTAFGSSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSTE 176

Query: 149 LPRYSSPNFSQ------------------EEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
               S    +Q                  E         TL PGD+LYLP+  IH A+T 
Sbjct: 177 WADASQAQLAQLFEIQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTS 236

Query: 191 TNEHSLHVTISV-YQKTAWIDLLEKA 215
            N  + H+T+S+  Q  +WIDL+ +A
Sbjct: 237 PNTTTAHITLSLERQGQSWIDLVRRA 262


>gi|389877013|ref|YP_006370578.1| cupin superfamily protein [Tistrella mobilis KA081020-065]
 gi|388527797|gb|AFK52994.1| cupin superfamily protein [Tistrella mobilis KA081020-065]
          Length = 387

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 73/168 (43%), Gaps = 23/168 (13%)

Query: 44  ITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFV 103
           +T    GQR  +D + + L Q        G S+  L    +++ L  L A ++      V
Sbjct: 74  LTRLVGGQRVLLDAQLQTLCQ-------GGASLVLLGMHRHVQALAALAAMVERHVEAPV 126

Query: 104 GANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI- 162
             N Y +      F  H D  +  ++Q+ G K+W  +          R S P F  E + 
Sbjct: 127 TINAYASFTRDGAFQIHRDGHDVLVVQIAGSKRWFCH---------GRRSDPPFGGETVD 177

Query: 163 -----GTPILTVTLEPGDLLYLPRGYIHQASTV-TNEHSLHVTISVYQ 204
                G P   + LEPGDLL++P+G  H+A        SLH+T+++ +
Sbjct: 178 PRRDPGPPEAEMVLEPGDLLFVPKGDYHRAEVADAGGQSLHLTVAIQR 225


>gi|294932895|ref|XP_002780495.1| hypothetical protein Pmar_PMAR001087 [Perkinsus marinus ATCC 50983]
 gi|239890429|gb|EER12290.1| hypothetical protein Pmar_PMAR001087 [Perkinsus marinus ATCC 50983]
          Length = 256

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%)

Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
           +A ILQ +G+K W VY       Y       +   + +G P+ +VTLE GDLL +PRG I
Sbjct: 30  DALILQCKGRKHWDVYDSSSSHPYPGEEVGKDMPIDNLGEPLYSVTLEEGDLLLIPRGVI 89

Query: 185 HQASTVTNEHSLHVTISV 202
           H+A     + SLH+T+ +
Sbjct: 90  HRARASQEQGSLHITVKI 107


>gi|167533989|ref|XP_001748673.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772914|gb|EDQ86560.1| predicted protein [Monosiga brevicollis MX1]
          Length = 3197

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 44/146 (30%), Positives = 61/146 (41%), Gaps = 24/146 (16%)

Query: 94  SLQELFGCFVGANT-----YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           SL+EL     G +T     Y +   A    PH D  +  +L L G+K W V  P      
Sbjct: 141 SLEELIQTAFGTSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSTE 200

Query: 149 LPRYSSPNFSQ------------------EEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
               S    +Q                  E         TL PGD+LYLP+  IH A+T 
Sbjct: 201 WADASQAQLAQLFEMQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTS 260

Query: 191 TNEHSLHVTISV-YQKTAWIDLLEKA 215
            N  + H+T+S+  Q  +WIDL+ +A
Sbjct: 261 PNTTTAHITLSLERQGQSWIDLVRRA 286


>gi|170727053|ref|YP_001761079.1| cupin 4 family protein [Shewanella woodyi ATCC 51908]
 gi|169812400|gb|ACA86984.1| Cupin 4 family protein [Shewanella woodyi ATCC 51908]
          Length = 374

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 3/83 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D+ + F+LQ EG ++W+V       +Y PR    N +  +   PI+ V LE GD
Sbjct: 120 GVGPHIDNYDVFLLQGEGTRRWRV---GAKGDYTPRGGDTNTALIDDFEPIIDVVLEAGD 176

Query: 176 LLYLPRGYIHQASTVTNEHSLHV 198
           +LY+P GY H A T+T   S  +
Sbjct: 177 MLYIPPGYPHCAETLTTALSYSI 199


>gi|312194488|ref|YP_004014549.1| hypothetical protein FraEuI1c_0601 [Frankia sp. EuI1c]
 gi|311225824|gb|ADP78679.1| hypothetical protein FraEuI1c_0601 [Frankia sp. EuI1c]
          Length = 412

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 69/142 (48%), Gaps = 6/142 (4%)

Query: 73  GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
           G ++     +T+   ++    +L+   G  V  N Y T  +A GF  H+DD +  I+QL 
Sbjct: 107 GVTLVLDGLETFDPIVEVATRALRWWSGELVQTNAYLTTRSADGFPLHWDDHDVLIVQLA 166

Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
           G+K W V   R      P +     ++      +    L  G++L++PRGY HQA+ V +
Sbjct: 167 GEKNWDV---RGSTRSAPMFRDAVPNEVASSEAVWQGVLRAGEVLHIPRGYWHQATRVEH 223

Query: 193 EH--SLHVTISVYQKTAWIDLL 212
           +   SLH+T    ++T  +D L
Sbjct: 224 DDPVSLHLTFGFTRRTG-VDWL 244


>gi|291452955|ref|ZP_06592345.1| cupin 4 [Streptomyces albus J1074]
 gi|291355904|gb|EFE82806.1| cupin 4 [Streptomyces albus J1074]
          Length = 398

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 59/114 (51%), Gaps = 7/114 (6%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI 162
           V  N Y T   A GF  H+DD +  I+QL G+K+W+V   R      P Y   + +    
Sbjct: 130 VQVNAYLTTNDAAGFPLHWDDHDVVIVQLAGEKEWEV---RGTSRPAPMYRDADPNSTPT 186

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT--AWIDLL 212
              + +  +  GD++++PRG+ HQA+   +    SLHVT  V ++T  +W+  L
Sbjct: 187 DEIVWSGVMRTGDVMHIPRGHWHQATRTGSGSGKSLHVTFGVTKRTGASWLAWL 240


>gi|157962362|ref|YP_001502396.1| cupin 4 family protein [Shewanella pealeana ATCC 700345]
 gi|157847362|gb|ABV87861.1| Cupin 4 family protein [Shewanella pealeana ATCC 700345]
          Length = 376

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 5/89 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G   H D+ + FI+Q EG++ W V       +Y P+ + P     E   PI
Sbjct: 113 SFATP--GGGVGAHIDNYDVFIIQGEGQRHWTV---GDKGQYAPKNNDPTTPLIEGFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHS 195
           + V LE GDLLY+P G+ HQ  T+TN  S
Sbjct: 168 IDVVLEKGDLLYIPPGFPHQGQTITNAMS 196


>gi|365864906|ref|ZP_09404580.1| hypothetical protein SPW_4884 [Streptomyces sp. W007]
 gi|364005613|gb|EHM26679.1| hypothetical protein SPW_4884 [Streptomyces sp. W007]
          Length = 369

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 59/114 (51%), Gaps = 7/114 (6%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI 162
           V  N Y T   A GF  H+DD +  I+QL G+K W+V   R      P Y   + +    
Sbjct: 101 VQVNAYLTTNDAAGFPLHWDDHDVLIVQLAGEKDWEV---RGASRTAPMYRDTDANTTPS 157

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT--AWIDLL 212
              + + T+  GD++++PRG+ HQA+   +    SLHVT  + +++  +W+  L
Sbjct: 158 EDIVWSGTMMAGDVMHIPRGHWHQATRTGSGSGKSLHVTFGITKRSGASWLAWL 211


>gi|77360634|ref|YP_340209.1| RmlC-like domain-containing protein [Pseudoalteromonas haloplanktis
           TAC125]
 gi|76875545|emb|CAI86766.1| putative enzyme with RmlC-like domain [Pseudoalteromonas
           haloplanktis TAC125]
          Length = 381

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 91/198 (45%), Gaps = 27/198 (13%)

Query: 3   WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQTM 55
           W+K+PL I +   N+ +    N +    ++D +   ++    N        +ED ++ T 
Sbjct: 19  WQKKPLLIKQGFSNFQDPLDANELAGLAMEDSIESRIVTNHNNEWQSHQGPFEDFEQLT- 77

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
           +     L Q V DH+    + + L P  +I      N  + +L        +Y TP    
Sbjct: 78  EQHSTLLVQAV-DHWHNDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--GG 122

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
           G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E  T ++   LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPSLKQFTQNKTLLQVEAFTAVIDCILEPG 179

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LY+P G  H+   V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197


>gi|163751233|ref|ZP_02158461.1| hypothetical protein KT99_13827 [Shewanella benthica KT99]
 gi|161328947|gb|EDQ00021.1| hypothetical protein KT99_13827 [Shewanella benthica KT99]
          Length = 374

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + F+LQ EG ++WKV        Y PR    + +  +   PI
Sbjct: 113 SFATP--GGGVGPHIDNYDVFLLQGEGTRRWKV---GARGNYAPRGGDTHTALIDDFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           + V LE GD+LY+P GY H+  T+T   S  +
Sbjct: 168 IDVVLEKGDMLYIPPGYPHRGETITTALSYSI 199


>gi|167525110|ref|XP_001746890.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774670|gb|EDQ88297.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1934

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 54/202 (26%), Positives = 80/202 (39%), Gaps = 32/202 (15%)

Query: 94  SLQELFGCFVGANT-----YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           SL+EL     G +T     Y +   A    PH D  +  +L L G+K W V  P      
Sbjct: 141 SLEELIQTAFGTSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSPE 200

Query: 149 LPRYSSPNFSQ------------------EEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
               S    +Q                  E         TL PGD+LYLP+  IH A+T 
Sbjct: 201 WADASQAQLAQLFEIQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTS 260

Query: 191 TNEHSLHVTISV-YQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
            N  + H+T+S+  Q  +WIDL    +   L +   +    R      +L  A     +P
Sbjct: 261 PNTTTAHITLSLERQGQSWIDLACTVIESTLASLETSPSGLR------WLELATFPTDEP 314

Query: 250 VDIQADRLA--MKENLKDLLAK 269
             +   RL    K++L+ L+A+
Sbjct: 315 CQMAQKRLLGFEKDSLRTLVAQ 336


>gi|374370246|ref|ZP_09628255.1| cupin 4 family protein [Cupriavidus basilensis OR16]
 gi|373098248|gb|EHP39360.1| cupin 4 family protein [Cupriavidus basilensis OR16]
          Length = 303

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 64/145 (44%), Gaps = 4/145 (2%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
           D   +G ++        ++ ++ L + + E       AN Y        F  H+D  +  
Sbjct: 80  DLLSKGATVVLNRIDARLERVRHLCSEIAEFSNSETLANGYLAFSGRGSFGAHWDTHDVM 139

Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA 187
            +QL G+K+W+VY P      L   +S  +  +    PI    LE GDLLYLPRG+ H A
Sbjct: 140 AIQLIGRKRWRVYKPTFPFP-LTGQTSKEYKAQCPAEPIFDGILEAGDLLYLPRGWWHNA 198

Query: 188 STVTNEHSLHV---TISVYQKTAWI 209
           + +     + V   T SV    +W+
Sbjct: 199 TPIGETFHIAVGMHTASVVHYVSWL 223


>gi|449688443|ref|XP_002157896.2| PREDICTED: lysine-specific demethylase NO66-like [Hydra
           magnipapillata]
          Length = 190

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 62/111 (55%), Gaps = 6/111 (5%)

Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
           FR GLPI +  + G++  +    + D     + +K L+ KL++YV++D+  DE+    M 
Sbjct: 2   FREGLPIDFSSFVGVSNSEKKCPERDTFV--KTVKKLMEKLIDYVEIDEAGDELVLDHME 59

Query: 290 DALPPVLSPEELQCSVF-ENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
           +  PP  S ++++CSVF ++G      GEV    +I   T++RLVR   VR
Sbjct: 60  EFQPPYYSSDDVECSVFSKDGF---WDGEVRRHHKIELSTELRLVRPGIVR 107


>gi|404379958|ref|ZP_10985005.1| hypothetical protein HMPREF9021_01816 [Simonsiella muelleri ATCC
           29453]
 gi|294482629|gb|EFG30319.1| hypothetical protein HMPREF9021_01816 [Simonsiella muelleri ATCC
           29453]
          Length = 384

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 80/166 (48%), Gaps = 21/166 (12%)

Query: 66  VWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG----FAP 119
           V++H R+G +I +  +N +  +        ++ +    FV A T  +   A G    F  
Sbjct: 84  VYEHLRKGATIVYNRINNEPLVD-------AIAKHIAQFVQAQTVVSGYLAFGSDASFKN 136

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQEEIGTPILTVTLEPGDLL 177
           H+D  + F +QL GKK W +Y P      +P Y   S + +  E   P + V LE GD+L
Sbjct: 137 HWDTRDVFAVQLMGKKHWSLYAPNFE---MPLYMQQSKDVNVPEPQHPDMEVVLEAGDIL 193

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQ 221
           Y+PRG+ H    +  E + H+ I  +  T   +++ L K +P  ++
Sbjct: 194 YIPRGWWHSPVPMNCE-TFHLAIGTFAPTGYQYMEWLMKKVPSIVE 238


>gi|294140589|ref|YP_003556567.1| hypothetical protein SVI_1818 [Shewanella violacea DSS12]
 gi|293327058|dbj|BAJ01789.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 374

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + F+LQ EG ++WKV        Y PR    + +  +   PI
Sbjct: 113 SFATP--GGGVGPHIDNYDVFLLQGEGTRRWKV---GARGNYSPRGGDTHTALIDDFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           + V LE GD+LY+P GY H+  T+T   S  +
Sbjct: 168 IDVVLEKGDMLYIPPGYPHRGETITTALSYSI 199


>gi|167534035|ref|XP_001748696.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772937|gb|EDQ86583.1| predicted protein [Monosiga brevicollis MX1]
          Length = 806

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 79/173 (45%), Gaps = 17/173 (9%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFG-CFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           R+  S+ F   Q   +    L  +LQ +FG     A+ Y + P A+   PH D  +  ++
Sbjct: 143 RDHLSLVFRREQLDAQNYSVLETTLQAVFGSASTTAHAYLSAPGARALQPHTDPYDVIVV 202

Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
           QL G+K+W++    +     P      F  + +        +EPGD+LYLP+  IH A  
Sbjct: 203 QLAGRKRWRLCTGCL---NWPESDLTRFHCQSLW-------MEPGDVLYLPKAVIHVADA 252

Query: 190 --VTNEHSLHVTISVYQKTA-WIDLLEKA---MPKALQAAGATDLEFRRGLPI 236
                  S+H+T S+ ++ A W  LL  A    P    A+   D+E  +G P+
Sbjct: 253 PHADETTSIHLTYSLDREGATWHHLLATAACVTPATPHASAHPDVESMQGSPL 305


>gi|259908744|ref|YP_002649100.1| hypothetical protein EpC_20960 [Erwinia pyrifoliae Ep1/96]
 gi|387871629|ref|YP_005803004.1| hypothetical protein EPYR_02253 [Erwinia pyrifoliae DSM 12163]
 gi|224964366|emb|CAX55875.1| conserved uncharacterized protein YcfD [Erwinia pyrifoliae Ep1/96]
 gi|283478717|emb|CAY74633.1| Uncharacterized protein ycfD [Erwinia pyrifoliae DSM 12163]
          Length = 375

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 89/192 (46%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q     G  
Sbjct: 16  YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + +    V            G  PH
Sbjct: 67  -PFESYDHLGENNWSLLVQAVNHWHEPSAALMHPFRAIPDWRVDDLMISFSVAGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V   R + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKREMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|392555630|ref|ZP_10302767.1| RmlC-like domain-containing protein [Pseudoalteromonas undina NCIMB
           2128]
          Length = 380

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           +Y TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ + +  Q E  T 
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DASLKQFAQNKSLLQVEAFTA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   + N
Sbjct: 171 VIDCVLEPGDILYIPPGCPHEGYAIEN 197


>gi|359433689|ref|ZP_09224009.1| hypothetical protein P20652_2123 [Pseudoalteromonas sp. BSi20652]
 gi|357919668|dbj|GAA60258.1| hypothetical protein P20652_2123 [Pseudoalteromonas sp. BSi20652]
          Length = 385

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 58/198 (29%), Positives = 92/198 (46%), Gaps = 27/198 (13%)

Query: 3   WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
           W+K+PL I +   N+ +    N +    +++ +   +I   KN   T    +ED ++ T 
Sbjct: 19  WQKKPLLIKQGFANFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
           D     L Q V DH+    + + L P  +I      N  + +L        +Y TP    
Sbjct: 78  DDHSTLLVQAV-DHWHSEAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--NG 122

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
           G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    ++   LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEAVIDCILEPG 179

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LY+P G  H+   V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197


>gi|417948195|ref|ZP_12591343.1| hypothetical protein VISP3789_06108 [Vibrio splendidus ATCC 33789]
 gi|342810225|gb|EGU45318.1| hypothetical protein VISP3789_06108 [Vibrio splendidus ATCC 33789]
          Length = 385

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 89/206 (43%), Gaps = 20/206 (9%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +H+ +G +          K  QQL N    +L  C      Y  P    G  PH D  + 
Sbjct: 88  NHWHQGAN-------ELTKAFQQLPNWLFDDLMIC------YSAPEG--GVGPHIDQYDV 132

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           FI+Q +GK++WKV   + V +Y     +    Q E   PI+  TLEPGD+LY+P G+ H+
Sbjct: 133 FIIQGQGKRQWKVG-AKDVGQYKETVQASALRQIEGFEPIIDETLEPGDILYIPPGFPHE 191

Query: 187 ASTVTNEHSLHVTISVYQKTAWI-DLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
            +T+    S  +     ++   I +  +  +   +      D EF+     G +R + L+
Sbjct: 192 GNTLEPSMSYSIGYRSPKEQELISNFADFVLAHDMGDVHLHDPEFKAQDSYGKIRSSDLS 251

Query: 246 RGKPVDIQADRLAMKENLKDLLAKLV 271
                D+    L   E + D +  L+
Sbjct: 252 N--LTDMLKSALEQPERVSDFMGCLL 275


>gi|385788087|ref|YP_005819196.1| hypothetical protein EJP617_26280 [Erwinia sp. Ejp617]
 gi|310767359|gb|ADP12309.1| conserved uncharacterized protein YcfD [Erwinia sp. Ejp617]
          Length = 375

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 89/192 (46%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q     G  
Sbjct: 16  YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + +    V            G  PH
Sbjct: 67  -PFESYDHLGENNWSLLVQAVNHWHEPSAALMHPFRAIPDWRVDDLMISFSVAGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V   R + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKREMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|408533826|emb|CCK32000.1| hypothetical protein BN159_7621 [Streptomyces davawensis JCM 4913]
          Length = 403

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 54/119 (45%), Gaps = 8/119 (6%)

Query: 102 FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE 161
              AN Y       GF  H+DD E   +QL G+K W+V  P       P Y     ++  
Sbjct: 129 LTSANAYLAVGDTDGFHLHWDDHEVIAVQLSGEKSWEVRGP---SRPAPMYRDAERNRTP 185

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTV---TNEHSLHVTISVYQKTA--WIDLLEKA 215
               +   T+  GD++++PRG+ H A+ +      HSLHVT    ++T   W + L  A
Sbjct: 186 SEEALWKGTMRAGDVMHIPRGFWHTATRLGSGAGGHSLHVTFGFTKRTGVTWANFLSDA 244


>gi|127512486|ref|YP_001093683.1| cupin 4 family protein [Shewanella loihica PV-4]
 gi|126637781|gb|ABO23424.1| Cupin 4 family protein [Shewanella loihica PV-4]
          Length = 386

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y TP    G  PH D+ + F+LQ EG+++WKV       +Y PR    + +  +   PI
Sbjct: 117 SYATP--GGGVGPHIDNYDVFLLQGEGRRRWKV---GAKGQYSPRGGDTHTALIDDFEPI 171

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           L V LE GD+LY+P G+ H+  T+    S  +
Sbjct: 172 LDVVLEAGDMLYIPPGFPHRGETLETALSYSI 203


>gi|392532539|ref|ZP_10279676.1| RmlC-like domain-containing protein [Pseudoalteromonas arctica A
           37-1-2]
          Length = 385

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 58/198 (29%), Positives = 93/198 (46%), Gaps = 27/198 (13%)

Query: 3   WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
           W+K+PL I +   N+ +    N +    +++ +   +I   KN   T    +ED ++ T 
Sbjct: 19  WQKKPLLIKQGFANFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
           D     L Q V DH+    + + L P  +I      N  + +L        +Y TP    
Sbjct: 78  DDHSTLLVQAV-DHWHSDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--NG 122

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
           G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    ++   LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEPFEAVIDCILEPG 179

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LY+P G  H+  +V N
Sbjct: 180 DILYIPPGCPHEGYSVEN 197


>gi|320166683|gb|EFW43582.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 908

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 76/158 (48%), Gaps = 12/158 (7%)

Query: 72  EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
           +G ++  ++   +  PL +L A +++  G F   +   T P      P     +  ++Q 
Sbjct: 352 DGATVILISAHNFHAPLMRLVADVEQTLGVFSDVSATFTGPVDYFSRPRSPMFDLILVQT 411

Query: 132 EGKKKWKV---YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAS 188
            G + W+V     P   D+Y       NF   E     +   L+ GD+LY+PRG I  +S
Sbjct: 412 FGSQIWRVCEPIFPLPNDKY-----DGNFMPSEALQHCVDAHLQAGDVLYIPRGTI-ISS 465

Query: 189 TVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAG 224
             T+E SLHVTIS++   A  W  +LE ++ +AL +A 
Sbjct: 466 RATDEASLHVTISLHSHDAFTWHTVLETSL-RALVSAN 502


>gi|407978129|ref|ZP_11158963.1| hypothetical protein BA1_02960 [Bacillus sp. HYC-10]
 gi|407415391|gb|EKF36992.1| hypothetical protein BA1_02960 [Bacillus sp. HYC-10]
          Length = 330

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 78/172 (45%), Gaps = 30/172 (17%)

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG----ANTYQTPPTAQGFA 118
           P+     Y +G ++ F     +I  L++  + L+ L G   G    A  Y    +A GF+
Sbjct: 85  PEEAMSWYEKGAALEFDFTDIFIPELRESISKLKSLLGLPEGTMAKAVLYAAKESA-GFS 143

Query: 119 PHYDDIEAFILQLEGKKKWK----------------VYLPRMVDEYLPRYSSPNFSQEEI 162
            H+D    FI QL+GKKKWK                +  P + D  L  Y +  F  E +
Sbjct: 144 AHFDAYCNFIFQLKGKKKWKLAENFNTINPLQHYELIEAPHLPDP-LKSYWNGEFPDENL 202

Query: 163 --GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
             GT ++   L+ G  L+LPRG  H  S+     +L+ T   + + AW+DL+
Sbjct: 203 SNGTELI---LDAGSFLFLPRGCWHSTSSSEETIALNFT---FGQPAWLDLI 248


>gi|297560490|ref|YP_003679464.1| cupin [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111]
 gi|296844938|gb|ADH66958.1| Cupin 4 family protein [Nocardiopsis dassonvillei subsp.
           dassonvillei DSM 43111]
          Length = 400

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/222 (26%), Positives = 97/222 (43%), Gaps = 30/222 (13%)

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P+ ++   REG S+   +      P+      L  L      AN Y     ++GF  H+D
Sbjct: 92  PEALYRELREGASLVLDSVDRMHPPVGAAADDLMRLVRERAQANLYLIWGGSRGFDTHWD 151

Query: 123 DIEAFILQLEGKKKWKVYLPR-----MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
           D +  I+Q+EG K W+V+ P      M ++    ++ P  +  E+   +    L PG ++
Sbjct: 152 DHDTVIVQVEGTKHWQVHGPGSRPYPMKNDVDHAHTPPRDADGELHL-VWEGVLRPGQVI 210

Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRRGLP 235
           ++PRG+ H   T T   S+H+T    + T   W D L + +          +  FRR LP
Sbjct: 211 HVPRGWWHTV-TGTGGVSMHLTFGFTRATGVEWADALVRRL--------FEEEVFRRDLP 261

Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLD 277
               R+A        D+   R   +  L   +A+L E +DLD
Sbjct: 262 ----RFAD------PDV---RRKHQRALAARMAELAEELDLD 290


>gi|171059323|ref|YP_001791672.1| cupin 4 family protein [Leptothrix cholodnii SP-6]
 gi|170776768|gb|ACB34907.1| Cupin 4 family protein [Leptothrix cholodnii SP-6]
          Length = 418

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 90/203 (44%), Gaps = 29/203 (14%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
           HW+++PL + R+        ++  ++  +L D+ +E +    ++   +G RQT   +   
Sbjct: 57  HWQRKPLLV-RQAVPGIEPPVSRAQMFAMLEDDAVESRF---LSRQGEGDRQTWQFKRGP 112

Query: 60  ---RALPQIV---WDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPP 112
              R+LP I    W    +G ++        +   + +  A L +L   +          
Sbjct: 113 MPRRSLPAIKQPGWTVLVQGLNLHVPAAADLLNRFRFVPQARLDDLMISWASEGG----- 167

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKV-YLPRM-VDEYLPRYSSPNFSQEEIGTPILTVT 170
              G  PH+D  + F++Q+ G+++W++  LP   + E LP     NF  EE         
Sbjct: 168 ---GVGPHFDSYDVFLIQVAGRRRWRIGRLPDARLREGLPVKIIENFRHEE------EWV 218

Query: 171 LEPGDLLYLPRGYIHQASTVTNE 193
           LEPGD+LYLP G+ H    V  E
Sbjct: 219 LEPGDMLYLPPGWAHDGDAVDGE 241


>gi|148977568|ref|ZP_01814147.1| hypothetical protein VSWAT3_23164 [Vibrionales bacterium SWAT-3]
 gi|145963219|gb|EDK28486.1| hypothetical protein VSWAT3_23164 [Vibrionales bacterium SWAT-3]
          Length = 377

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 60/124 (48%), Gaps = 17/124 (13%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +H+ +G +          K  QQL N    +L  C      Y  P    G  PH D  + 
Sbjct: 88  NHWHQGAN-------ELTKAFQQLPNWLFDDLMIC------YSAPEG--GVGPHIDQYDV 132

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           FI+Q +GK++WKV   + V +Y     +    Q E   PI+  TLEPGD+LY+P G+ H+
Sbjct: 133 FIIQGQGKRQWKVG-AKDVGQYKETVQASALRQIEGFEPIIDETLEPGDILYIPPGFPHE 191

Query: 187 ASTV 190
             T+
Sbjct: 192 GDTL 195


>gi|170726550|ref|YP_001760576.1| cupin 4 family protein [Shewanella woodyi ATCC 51908]
 gi|169811897|gb|ACA86481.1| Cupin 4 family protein [Shewanella woodyi ATCC 51908]
          Length = 381

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 89/210 (42%), Gaps = 36/210 (17%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I RK  + + +L++ +++  +  D ++E ++       E G+ Q       
Sbjct: 17  EYWQKKPLVI-RKGFSNFRDLLSPEEMAGLACDEMVESRR----IYKEQGEWQA------ 65

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVG---------ANTY 108
                    +    S   L    +   +Q LN  L   ++L  CF             +Y
Sbjct: 66  --------EFGPFDSYEHLGESGWTLVVQALNNWLPGAEQLIQCFDFIPRWRFDDVMVSY 117

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
            TP    G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+ 
Sbjct: 118 ATP--GGGVGPHIDLYDVFICQGSGRRRWRV---GDRGEHREFAAHPALLHTEAFEPIID 172

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           V L PGD+LYLP GY H   T+    S  V
Sbjct: 173 VELLPGDILYLPPGYPHDGVTLEESMSFSV 202


>gi|432946421|ref|ZP_20142050.1| YcfD protein [Escherichia coli KTE196]
 gi|433042626|ref|ZP_20230145.1| YcfD protein [Escherichia coli KTE117]
 gi|431461796|gb|ELH42063.1| YcfD protein [Escherichia coli KTE196]
 gi|431559026|gb|ELI32604.1| YcfD protein [Escherichia coli KTE117]
          Length = 373

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|37524851|ref|NP_928195.1| hypothetical protein plu0856 [Photorhabdus luminescens subsp.
           laumondii TTO1]
 gi|36784276|emb|CAE13151.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
           TTO1]
          Length = 388

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 68/149 (45%), Gaps = 14/149 (9%)

Query: 88  LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
           + QL  +          ++ Y +      F PH+D  + F +QL GKK+W +Y P   D 
Sbjct: 107 IDQLAKACSSFTDSHPFSSLYLSYGEKSSFKPHWDSRDIFAIQLSGKKRWIIYKPSFPDP 166

Query: 148 -YLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
            YL +      +      P     LE GD+LYLPRG+ H    +  E ++H+++ ++   
Sbjct: 167 VYLHQSKDMENTYPCPSEPYDDFVLETGDVLYLPRGWWHNPLPL-GEETIHLSVGIFPPY 225

Query: 207 A-----W-------IDLLEKAMPKALQAA 223
           A     W       ID+  K++P++ + A
Sbjct: 226 AHEYINWLSYKITDIDIGRKSLPRSWKQA 254


>gi|359439985|ref|ZP_09229913.1| hypothetical protein P20429_0260 [Pseudoalteromonas sp. BSi20429]
 gi|358038186|dbj|GAA66162.1| hypothetical protein P20429_0260 [Pseudoalteromonas sp. BSi20429]
          Length = 385

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/198 (29%), Positives = 92/198 (46%), Gaps = 27/198 (13%)

Query: 3   WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
           W+K+PL I +   N+ +    N +    +++ +   +I   KN   T    +ED ++ T 
Sbjct: 19  WQKKPLLIKQGFANFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
           D     L Q V DH+    + + L P  +I      N  + +L        +Y TP    
Sbjct: 78  DDHSTLLVQAV-DHWHSDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--NG 122

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
           G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    ++   LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEPFEAVIDCILEPG 179

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LY+P G  H+   V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197


>gi|86146439|ref|ZP_01064762.1| hypothetical protein MED222_12528 [Vibrio sp. MED222]
 gi|85835702|gb|EAQ53837.1| hypothetical protein MED222_12528 [Vibrio sp. MED222]
          Length = 377

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +GK++WKV   + V +Y     +    Q E   PI+  TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVGT-KDVGQYKETVQASALRQIEGFEPIIDETLEPGD 180

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195


>gi|260779119|ref|ZP_05888011.1| hypothetical protein VIC_004528 [Vibrio coralliilyticus ATCC
           BAA-450]
 gi|260605283|gb|EEX31578.1| hypothetical protein VIC_004528 [Vibrio coralliilyticus ATCC
           BAA-450]
          Length = 378

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 88/205 (42%), Gaps = 40/205 (19%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           +W K+P  I    K  +N+ I     D++     +  ++ +D        R   +VEG+ 
Sbjct: 16  YWHKKPTVI----KGGFNDFIDPISPDELAG---LAMEEEVD-------SRYISNVEGKW 61

Query: 61  -------------ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGAN 106
                         LP+  W    + C+   L     I+P +QL N    +L  CF    
Sbjct: 62  SAQHGPFDDSLFENLPESHWQLVVQACNHWHLGAAELIEPFKQLPNWLFDDLMVCF---- 117

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTP 165
              + P   G  PH D  + FI+Q  GK++W+V  L +   +Y     +    Q E    
Sbjct: 118 ---SAPEG-GVGPHIDQYDVFIIQGSGKRRWRVGDLDK--GQYKESIQADALRQIEGFDA 171

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTV 190
           I+   LEPGD+LY+P G+ H+ +T+
Sbjct: 172 IIDEVLEPGDILYIPPGFPHEGNTL 196


>gi|84393758|ref|ZP_00992506.1| hypothetical protein V12B01_06771 [Vibrio splendidus 12B01]
 gi|84375620|gb|EAP92519.1| hypothetical protein V12B01_06771 [Vibrio splendidus 12B01]
          Length = 377

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +GK++WKV   + V +Y     +    Q E   PI+  TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVG-AKDVGQYKETVQASALRQIEGFEPIIDETLEPGD 180

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195


>gi|366158813|ref|ZP_09458675.1| hypothetical protein ETW09_07695 [Escherichia sp. TW09308]
 gi|432371849|ref|ZP_19614899.1| YcfD protein [Escherichia coli KTE11]
 gi|430898178|gb|ELC20313.1| YcfD protein [Escherichia coli KTE11]
          Length = 373

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMETEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|307257834|ref|ZP_07539591.1| hypothetical protein appser10_18190 [Actinobacillus
           pleuropneumoniae serovar 10 str. D13039]
 gi|306863740|gb|EFM95666.1| hypothetical protein appser10_18190 [Actinobacillus
           pleuropneumoniae serovar 10 str. D13039]
          Length = 420

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 18/117 (15%)

Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
           GC   ++ Y    T   F  H+D  + F LQ++G+K+W +        + P +++P F  
Sbjct: 153 GCHTFSSLYIAFNTQSSFKSHWDSRDIFALQMQGRKRWII--------HAPTFNNPLFMH 204

Query: 160 EEIGTP---------ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +    P          + + LE GD+LYLPRG+ H    V  E ++H+ I ++   A
Sbjct: 205 KSKDMPEYNPNLDDVYMDIILEAGDILYLPRGWWHDPIPV-GEETVHLAIGIFPAYA 260


>gi|407071039|ref|ZP_11101877.1| hypothetical protein VcycZ_15908 [Vibrio cyclitrophicus ZF14]
          Length = 377

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +GK++WKV   + V +Y     +    Q E   PI+  TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVG-EKDVGQYKETVQASALRQIEGFEPIIDETLEPGD 180

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195


>gi|403511542|ref|YP_006643180.1| cupin-like domain protein [Nocardiopsis alba ATCC BAA-2165]
 gi|402799400|gb|AFR06810.1| cupin-like domain protein [Nocardiopsis alba ATCC BAA-2165]
          Length = 400

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 86/194 (44%), Gaps = 13/194 (6%)

Query: 47  YEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
           Y D    + D      P+ ++   R G S+   +      P+ +    L  L      AN
Sbjct: 76  YTDVGTASRDTRRIVRPESLYRELRAGASLVLDSVDRMHAPVGEAADDLMRLVRERAQAN 135

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR-----MVDEYLPRYSSPNFSQEE 161
            Y     ++GF  H+DD +  I+Q+EG K W+V+ P      M ++    ++ P  +   
Sbjct: 136 LYLIWGESRGFDTHWDDHDTMIVQVEGSKHWQVHGPGTRPFPMKNDTDHTHTPPRDADGG 195

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           +   +    L PG +L++PRG+ H   T T E S+H+T    + T  ID  E ++ + L 
Sbjct: 196 LHL-VWEGVLRPGQVLHVPRGWWHTV-TGTGEMSMHLTFGFTRATG-IDWAE-SLVRGLY 251

Query: 222 AAGATDLEFRRGLP 235
              A    FRR LP
Sbjct: 252 EEEA----FRRDLP 261


>gi|441504976|ref|ZP_20986968.1| hypothetical protein C942_02122 [Photobacterium sp. AK15]
 gi|441427558|gb|ELR65028.1| hypothetical protein C942_02122 [Photobacterium sp. AK15]
          Length = 385

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 64/139 (46%), Gaps = 18/139 (12%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +H+ EG +         + P +QL N  L +L        +Y TP    G  PH D  + 
Sbjct: 93  NHWHEGAA-------RLVTPFRQLPNWLLDDLM------ISYSTP--GGGVGPHIDQYDV 137

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           FI+Q  GK+ W+V  P+  D+Y  +   P   Q E    I+  TLEPGD+LY+P G+ H 
Sbjct: 138 FIIQGSGKRHWRV-GPK-SDDYEEKCRHPALRQIEDFESIINETLEPGDILYIPPGFPHD 195

Query: 187 ASTVTNEHSLHVTISVYQK 205
              +    S  V     +K
Sbjct: 196 GYAIETSMSFSVGFRSPKK 214


>gi|167521271|ref|XP_001744974.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776588|gb|EDQ90207.1| predicted protein [Monosiga brevicollis MX1]
          Length = 750

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 63/135 (46%), Gaps = 17/135 (12%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS- 153
           +Q+  G  +  + Y +PP+ +  +PH D  + F+ Q  G K W +  P+  D     ++ 
Sbjct: 146 IQDHLGVVISDHFYISPPSQRALSPHTDGGDVFVHQWAGSKLWTLCAPKAPDCTNCTFAD 205

Query: 154 --------------SPNFSQEEI-GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
                           +++ E++ G    T+ L+  +LLY+PRG +H A   T   S H 
Sbjct: 206 RALRAELAKSAFEGCTSYTDEQLAGMECRTLDLQENELLYIPRGVVHFARANTTGLSAHA 265

Query: 199 TISVYQKTA-WIDLL 212
           T  V   +A WIDL+
Sbjct: 266 TYQVIPPSATWIDLV 280


>gi|414069366|ref|ZP_11405360.1| hypothetical protein D172_0592 [Pseudoalteromonas sp. Bsw20308]
 gi|410808169|gb|EKS14141.1| hypothetical protein D172_0592 [Pseudoalteromonas sp. Bsw20308]
          Length = 381

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/199 (28%), Positives = 93/199 (46%), Gaps = 27/199 (13%)

Query: 2   HWEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQT 54
           +W+K+PL I +   ++ +    N +    +++ +   +I   KN   T    +ED ++ T
Sbjct: 18  YWQKKPLLIKQGFTDFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT 77

Query: 55  MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
            D     L Q V DH+    + + L P  +I      N  + +L        +Y TP   
Sbjct: 78  -DDHSTLLVQAV-DHWHSDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--N 121

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEP 173
            G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    ++   LEP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEAVIDCILEP 178

Query: 174 GDLLYLPRGYIHQASTVTN 192
           GD+LY+P G  H+   V N
Sbjct: 179 GDILYIPPGCPHEGYAVEN 197


>gi|218709057|ref|YP_002416678.1| hypothetical protein VS_1063 [Vibrio splendidus LGP32]
 gi|218322076|emb|CAV18153.1| hypothetical protein VS_1063 [Vibrio splendidus LGP32]
          Length = 377

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 45/75 (60%), Gaps = 1/75 (1%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +GK++WKV   + + +Y     +    Q E   PI+  TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVG-AKDIGQYKETVQASALRQIEGFDPIIDETLEPGD 180

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195


>gi|359438576|ref|ZP_09228588.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
           BSi20311]
 gi|359444143|ref|ZP_09233947.1| hypothetical protein P20439_0258 [Pseudoalteromonas sp. BSi20439]
 gi|358026702|dbj|GAA64837.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
           BSi20311]
 gi|358042092|dbj|GAA70196.1| hypothetical protein P20439_0258 [Pseudoalteromonas sp. BSi20439]
          Length = 380

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           +Y TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E  + 
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKTLLQVEAFSA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197


>gi|336311795|ref|ZP_08566754.1| hypothetical protein SOHN41_02237 [Shewanella sp. HN-41]
 gi|335864677|gb|EGM69755.1| hypothetical protein SOHN41_02237 [Shewanella sp. HN-41]
          Length = 269

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 11/88 (12%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP---RYSSPNFSQEEIG 163
           ++ TP  + G  PH D+ + FI+Q EG+++W V       +  P   R  +PN    E  
Sbjct: 3   SFATP--SGGVGPHVDNYDVFIIQGEGERRWTV------GDNTPQQRRGGNPNSPLVEDF 54

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVT 191
           TPI+ V LE GD+LY+P G+ H   T+T
Sbjct: 55  TPIIDVVLEKGDMLYIPPGFPHCGETLT 82


>gi|315126740|ref|YP_004068743.1| RmlC-like domain-containing protein [Pseudoalteromonas sp. SM9913]
 gi|315015254|gb|ADT68592.1| RmlC-like domain-containing protein [Pseudoalteromonas sp. SM9913]
          Length = 380

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           +Y TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E  + 
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKTLLQVEAFSA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197


>gi|269102218|ref|ZP_06154915.1| hypothetical protein VDA_001639 [Photobacterium damselae subsp.
           damselae CIP 102761]
 gi|268162116|gb|EEZ40612.1| hypothetical protein VDA_001639 [Photobacterium damselae subsp.
           damselae CIP 102761]
          Length = 377

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 82/203 (40%), Gaps = 24/203 (11%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDIT---LYEDGQR---QTM 55
           +W+K+P+ I    K++         +D I  D +       DI    +  DG     Q  
Sbjct: 15  YWQKQPVIIKGGFKDF---------VDPISADELAGLAMEEDIDSRYVARDGDEWDVQHG 65

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
            +    +P+  W    +  +         + P +QL   L +         +Y TP    
Sbjct: 66  PLSFDNMPENNWSFLVQSVNYWHEGAARLVTPFRQLPGWLLDDLMI-----SYSTP--GG 118

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI Q  GK++W+V   +  D+Y   +  P   Q +   PI+   +E GD
Sbjct: 119 GVGPHIDQYDVFITQGSGKRRWRVGPKK--DDYEEEFRHPALRQIKSFEPIIDEVIEAGD 176

Query: 176 LLYLPRGYIHQASTVTNEHSLHV 198
           +LY+P G+ H   T+    S  V
Sbjct: 177 ILYIPPGFPHDGDTIETAMSFSV 199


>gi|433197725|ref|ZP_20381643.1| YcfD protein [Escherichia coli KTE94]
 gi|431724403|gb|ELJ88328.1| YcfD protein [Escherichia coli KTE94]
          Length = 373

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHS---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|416896624|ref|ZP_11926471.1| cupin superfamily protein [Escherichia coli STEC_7v]
 gi|417112951|ref|ZP_11964871.1| cupin domain protein, PF06172 family [Escherichia coli 1.2741]
 gi|422800542|ref|ZP_16849040.1| cupin superfamily protein [Escherichia coli M863]
 gi|323967067|gb|EGB62493.1| cupin superfamily protein [Escherichia coli M863]
 gi|327253832|gb|EGE65461.1| cupin superfamily protein [Escherichia coli STEC_7v]
 gi|386142561|gb|EIG83699.1| cupin domain protein, PF06172 family [Escherichia coli 1.2741]
          Length = 373

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTASLMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|348030940|ref|YP_004873626.1| hypothetical protein GNIT_3539 [Glaciecola nitratireducens FR1064]
 gi|347948283|gb|AEP31633.1| hypothetical protein GNIT_3539 [Glaciecola nitratireducens FR1064]
          Length = 393

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 71/142 (50%), Gaps = 18/142 (12%)

Query: 65  IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP-HYDD 123
           ++  ++  G ++RF +      PL +   +L++  G  VG   + + PT    AP HYD+
Sbjct: 89  LIKQYHELGYTVRFPDVTELSPPLSEFTRALEKQIGNPVGVVVFWSAPT--NAAPVHYDE 146

Query: 124 IEAFILQLEGKKKWKVYLPRMVDEYLPRY-----SSPNFSQEEIGTPILTVTLEPGDLLY 178
           ++  ++QL G K+W  Y+      Y  ++      +P F+  +      TV + PGD +Y
Sbjct: 147 VDVIVIQLVGTKQW--YISNTPTTYPNKWKNTGEGAPPFNDYQ------TVDVAPGDFIY 198

Query: 179 LPRGYIHQASTVTNEHSLHVTI 200
           LPRG  H   + +   S+H++I
Sbjct: 199 LPRGTAHTVKSTSE--SIHLSI 218


>gi|90413199|ref|ZP_01221194.1| hypothetical protein P3TCK_16534 [Photobacterium profundum 3TCK]
 gi|90325751|gb|EAS42209.1| hypothetical protein P3TCK_16534 [Photobacterium profundum 3TCK]
          Length = 377

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 18/139 (12%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
           +H+ EG +         + P +QL N  L +L        +Y TP    G  PH D  + 
Sbjct: 85  NHWHEGAA-------RLVTPFRQLPNWLLDDLM------ISYSTP--GGGVGPHIDQYDV 129

Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           FI Q  GK+ W+V   +  D+Y   +  P   Q +   PI+   +EPGD+LY+P G+ H 
Sbjct: 130 FITQGSGKRHWRVGPKK--DDYEEEFRHPELRQIKGFEPIIDEVMEPGDILYIPPGFPHD 187

Query: 187 ASTVTNEHSLHVTISVYQK 205
              +    S  V     +K
Sbjct: 188 GYAIETSMSFSVGFRSPKK 206


>gi|420336687|ref|ZP_14838263.1| cupin superfamily protein [Shigella flexneri K-315]
 gi|391262251|gb|EIQ21286.1| cupin superfamily protein [Shigella flexneri K-315]
          Length = 373

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +          + G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPSGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|432616050|ref|ZP_19852174.1| YcfD protein [Escherichia coli KTE75]
 gi|431155982|gb|ELE56723.1| YcfD protein [Escherichia coli KTE75]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETSWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|157370253|ref|YP_001478242.1| cupin 4 family protein [Serratia proteamaculans 568]
 gi|157322017|gb|ABV41114.1| Cupin 4 family protein [Serratia proteamaculans 568]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 92/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  S         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|420352067|ref|ZP_14853226.1| cupin superfamily protein [Shigella boydii 4444-74]
 gi|391284911|gb|EIQ43499.1| cupin superfamily protein [Shigella boydii 4444-74]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLERG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|293409494|ref|ZP_06653070.1| conserved hypothetical protein [Escherichia coli B354]
 gi|291469962|gb|EFF12446.1| conserved hypothetical protein [Escherichia coli B354]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|432674129|ref|ZP_19909614.1| YcfD protein [Escherichia coli KTE142]
 gi|431216635|gb|ELF14232.1| YcfD protein [Escherichia coli KTE142]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|157155178|ref|YP_001462397.1| cupin family protein [Escherichia coli E24377A]
 gi|157160652|ref|YP_001457970.1| cupin family protein [Escherichia coli HS]
 gi|170020479|ref|YP_001725433.1| cupin 4 family protein [Escherichia coli ATCC 8739]
 gi|170680524|ref|YP_001744049.1| cupin family protein [Escherichia coli SMS-3-5]
 gi|187732292|ref|YP_001879936.1| cupin family protein [Shigella boydii CDC 3083-94]
 gi|188492699|ref|ZP_02999969.1| cupin family protein [Escherichia coli 53638]
 gi|191173069|ref|ZP_03034602.1| cupin family protein [Escherichia coli F11]
 gi|194433643|ref|ZP_03065919.1| cupin family protein [Shigella dysenteriae 1012]
 gi|194437001|ref|ZP_03069100.1| cupin family protein [Escherichia coli 101-1]
 gi|215486382|ref|YP_002328813.1| hypothetical protein E2348C_1269 [Escherichia coli O127:H6 str.
           E2348/69]
 gi|218558010|ref|YP_002390923.1| hypothetical protein ECS88_1143 [Escherichia coli S88]
 gi|218694662|ref|YP_002402329.1| hypothetical protein EC55989_1241 [Escherichia coli 55989]
 gi|218700366|ref|YP_002407995.1| hypothetical protein ECIAI39_2028 [Escherichia coli IAI39]
 gi|222155875|ref|YP_002556014.1| hypothetical protein LF82_2717 [Escherichia coli LF82]
 gi|251784657|ref|YP_002998961.1| hypothetical protein B21_01134 [Escherichia coli BL21(DE3)]
 gi|253773848|ref|YP_003036679.1| cupin [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
 gi|254161235|ref|YP_003044343.1| hypothetical protein ECB_01126 [Escherichia coli B str. REL606]
 gi|254288043|ref|YP_003053791.1| hypothetical protein ECD_01126 [Escherichia coli BL21(DE3)]
 gi|297520312|ref|ZP_06938698.1| putative cytoplasmic protein [Escherichia coli OP50]
 gi|306814022|ref|ZP_07448195.1| putative cytoplasmic protein [Escherichia coli NC101]
 gi|312966384|ref|ZP_07780606.1| cupin family protein [Escherichia coli 2362-75]
 gi|312971264|ref|ZP_07785442.1| cupin family protein [Escherichia coli 1827-70]
 gi|331657192|ref|ZP_08358154.1| YcfD protein [Escherichia coli TA206]
 gi|331662539|ref|ZP_08363462.1| YcfD protein [Escherichia coli TA143]
 gi|331667529|ref|ZP_08368393.1| YcfD protein [Escherichia coli TA271]
 gi|331682634|ref|ZP_08383253.1| YcfD protein [Escherichia coli H299]
 gi|383177783|ref|YP_005455788.1| hypothetical protein SSON53_06140 [Shigella sonnei 53G]
 gi|386598936|ref|YP_006100442.1| cupin family protein [Escherichia coli IHE3034]
 gi|386604916|ref|YP_006111216.1| putative cytoplasmic protein [Escherichia coli UM146]
 gi|386613561|ref|YP_006133227.1| hypothetical protein UMNK88_1459 [Escherichia coli UMNK88]
 gi|386618704|ref|YP_006138284.1| hypothetical protein ECNA114_1186 [Escherichia coli NA114]
 gi|386623585|ref|YP_006143313.1| hypothetical protein CE10_1209 [Escherichia coli O7:K1 str. CE10]
 gi|387616429|ref|YP_006119451.1| putative cytoplasmic protein [Escherichia coli O83:H1 str. NRG
           857C]
 gi|387829081|ref|YP_003349018.1| hypothetical protein ECSF_1028 [Escherichia coli SE15]
 gi|407468819|ref|YP_006784739.1| hypothetical protein O3O_10700 [Escherichia coli O104:H4 str.
           2009EL-2071]
 gi|407482514|ref|YP_006779663.1| hypothetical protein O3K_14920 [Escherichia coli O104:H4 str.
           2011C-3493]
 gi|410483066|ref|YP_006770612.1| hypothetical protein O3M_14895 [Escherichia coli O104:H4 str.
           2009EL-2050]
 gi|414575410|ref|ZP_11432615.1| cupin superfamily protein [Shigella sonnei 3233-85]
 gi|415812639|ref|ZP_11504776.1| cupin superfamily protein [Escherichia coli LT-68]
 gi|415841245|ref|ZP_11522350.1| cupin superfamily protein [Escherichia coli RN587/1]
 gi|415850665|ref|ZP_11527540.1| cupin superfamily protein [Shigella sonnei 53G]
 gi|415854379|ref|ZP_11530134.1| cupin family protein [Shigella flexneri 2a str. 2457T]
 gi|416285688|ref|ZP_11647837.1| hypothetical protein SGB_03455 [Shigella boydii ATCC 9905]
 gi|416301322|ref|ZP_11652924.1| hypothetical protein SGF_03422 [Shigella flexneri CDC 796-83]
 gi|416342633|ref|ZP_11676797.1| hypothetical protein ECoL_01732 [Escherichia coli EC4100B]
 gi|417083666|ref|ZP_11951711.1| putative cytoplasmic protein [Escherichia coli cloneA_i1]
 gi|417125924|ref|ZP_11973784.1| cupin domain protein, PF06172 family [Escherichia coli 97.0246]
 gi|417150102|ref|ZP_11989992.1| cupin domain protein, PF06172 family [Escherichia coli 1.2264]
 gi|417158877|ref|ZP_11996235.1| cupin domain protein, PF06172 family [Escherichia coli 99.0741]
 gi|417233323|ref|ZP_12034087.1| cupin domain protein, PF06172 family [Escherichia coli 5.0959]
 gi|417235817|ref|ZP_12034636.1| cupin domain protein, PF06172 family [Escherichia coli 9.0111]
 gi|417266580|ref|ZP_12053948.1| cupin domain protein, PF06172 family [Escherichia coli 3.3884]
 gi|417283444|ref|ZP_12070741.1| cupin domain protein, PF06172 family [Escherichia coli 3003]
 gi|417289202|ref|ZP_12076487.1| cupin domain protein, PF06172 family [Escherichia coli TW07793]
 gi|417607492|ref|ZP_12258004.1| cupin superfamily protein [Escherichia coli STEC_DG131-3]
 gi|417638472|ref|ZP_12288637.1| cupin superfamily protein [Escherichia coli TX1999]
 gi|417661726|ref|ZP_12311307.1| hypothetical protein ECAA86_01265 [Escherichia coli AA86]
 gi|417672413|ref|ZP_12321880.1| cupin superfamily protein [Shigella dysenteriae 155-74]
 gi|417682471|ref|ZP_12331827.1| cupin superfamily protein [Shigella boydii 3594-74]
 gi|417701512|ref|ZP_12350637.1| cupin superfamily protein [Shigella flexneri K-218]
 gi|417706847|ref|ZP_12355896.1| cupin superfamily protein [Shigella flexneri VA-6]
 gi|417722470|ref|ZP_12371295.1| cupin superfamily protein [Shigella flexneri K-304]
 gi|417727771|ref|ZP_12376498.1| cupin superfamily protein [Shigella flexneri K-671]
 gi|417733088|ref|ZP_12381751.1| cupin superfamily protein [Shigella flexneri 2747-71]
 gi|417738115|ref|ZP_12386709.1| cupin superfamily protein [Shigella flexneri 4343-70]
 gi|417742768|ref|ZP_12391310.1| cupin superfamily protein [Shigella flexneri 2930-71]
 gi|417755139|ref|ZP_12403228.1| cupin superfamily protein [Escherichia coli DEC2B]
 gi|417804584|ref|ZP_12451587.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
           LB226692]
 gi|417827259|ref|ZP_12473828.1| cupin superfamily protein [Shigella flexneri J1713]
 gi|417832326|ref|ZP_12478815.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
           01-09591]
 gi|417864944|ref|ZP_12509989.1| hypothetical protein C22711_1876 [Escherichia coli O104:H4 str.
           C227-11]
 gi|418255096|ref|ZP_12879548.1| cupin superfamily protein [Shigella flexneri 6603-63]
 gi|418264138|ref|ZP_12884695.1| cupin superfamily protein [Shigella sonnei str. Moseley]
 gi|418943215|ref|ZP_13496427.1| hypothetical protein T22_10377 [Escherichia coli O157:H43 str. T22]
 gi|418996302|ref|ZP_13543907.1| cupin superfamily protein [Escherichia coli DEC1A]
 gi|419001489|ref|ZP_13549035.1| cupin superfamily protein [Escherichia coli DEC1B]
 gi|419006971|ref|ZP_13554422.1| cupin superfamily protein [Escherichia coli DEC1C]
 gi|419012906|ref|ZP_13560266.1| cupin superfamily protein [Escherichia coli DEC1D]
 gi|419017783|ref|ZP_13565100.1| cupin superfamily protein [Escherichia coli DEC1E]
 gi|419023421|ref|ZP_13570658.1| cupin superfamily protein [Escherichia coli DEC2A]
 gi|419028225|ref|ZP_13575413.1| cupin superfamily protein [Escherichia coli DEC2C]
 gi|419034954|ref|ZP_13582044.1| cupin superfamily protein [Escherichia coli DEC2D]
 gi|419039028|ref|ZP_13586078.1| cupin superfamily protein [Escherichia coli DEC2E]
 gi|419169127|ref|ZP_13713521.1| cupin superfamily protein [Escherichia coli DEC7A]
 gi|419174435|ref|ZP_13718286.1| cupin superfamily protein [Escherichia coli DEC7B]
 gi|419180157|ref|ZP_13723778.1| cupin superfamily protein [Escherichia coli DEC7C]
 gi|419185669|ref|ZP_13729191.1| cupin superfamily protein [Escherichia coli DEC7D]
 gi|419190937|ref|ZP_13734403.1| cupin superfamily protein [Escherichia coli DEC7E]
 gi|419344761|ref|ZP_13886143.1| cupin superfamily protein [Escherichia coli DEC13A]
 gi|419349198|ref|ZP_13890551.1| cupin superfamily protein [Escherichia coli DEC13B]
 gi|419354359|ref|ZP_13895633.1| cupin superfamily protein [Escherichia coli DEC13C]
 gi|419359586|ref|ZP_13900811.1| cupin superfamily protein [Escherichia coli DEC13D]
 gi|419364589|ref|ZP_13905761.1| cupin superfamily protein [Escherichia coli DEC13E]
 gi|419699974|ref|ZP_14227586.1| hypothetical protein OQA_05431 [Escherichia coli SCI-07]
 gi|419915151|ref|ZP_14433519.1| putative cytoplasmic protein [Escherichia coli KD1]
 gi|419925456|ref|ZP_14443294.1| putative cytoplasmic protein [Escherichia coli 541-15]
 gi|419944902|ref|ZP_14461365.1| putative cytoplasmic protein [Escherichia coli HM605]
 gi|420319727|ref|ZP_14821570.1| cupin superfamily protein [Shigella flexneri 2850-71]
 gi|420324846|ref|ZP_14826620.1| cupin superfamily protein [Shigella flexneri CCH060]
 gi|420330541|ref|ZP_14832224.1| cupin superfamily protein [Shigella flexneri K-1770]
 gi|420341112|ref|ZP_14842618.1| cupin superfamily protein [Shigella flexneri K-404]
 gi|420347398|ref|ZP_14848798.1| cupin superfamily protein [Shigella boydii 965-58]
 gi|420357926|ref|ZP_14858928.1| cupin superfamily protein [Shigella sonnei 3226-85]
 gi|420362774|ref|ZP_14863683.1| cupin superfamily protein [Shigella sonnei 4822-66]
 gi|420384996|ref|ZP_14884364.1| cupin superfamily protein [Escherichia coli EPECa12]
 gi|421681872|ref|ZP_16121692.1| cupin superfamily protein [Shigella flexneri 1485-80]
 gi|421776461|ref|ZP_16213065.1| cupin family protein [Escherichia coli AD30]
 gi|422747842|ref|ZP_16801755.1| cupin superfamily protein [Escherichia coli H252]
 gi|422753736|ref|ZP_16807563.1| cupin superfamily protein [Escherichia coli H263]
 gi|422774987|ref|ZP_16828643.1| cupin superfamily protein [Escherichia coli H120]
 gi|422785712|ref|ZP_16838451.1| cupin superfamily protein [Escherichia coli H489]
 gi|422790175|ref|ZP_16842880.1| cupin superfamily protein [Escherichia coli TA007]
 gi|422828616|ref|ZP_16876786.1| hypothetical protein ESNG_01291 [Escherichia coli B093]
 gi|422833344|ref|ZP_16881411.1| hypothetical protein ESOG_01012 [Escherichia coli E101]
 gi|422839587|ref|ZP_16887559.1| hypothetical protein ESPG_02245 [Escherichia coli H397]
 gi|422992009|ref|ZP_16982780.1| hypothetical protein EUAG_01602 [Escherichia coli O104:H4 str.
           C227-11]
 gi|422993959|ref|ZP_16984723.1| hypothetical protein EUBG_01610 [Escherichia coli O104:H4 str.
           C236-11]
 gi|422999138|ref|ZP_16989894.1| hypothetical protein EUEG_01566 [Escherichia coli O104:H4 str.
           09-7901]
 gi|423002737|ref|ZP_16993483.1| hypothetical protein EUDG_00221 [Escherichia coli O104:H4 str.
           04-8351]
 gi|423009273|ref|ZP_17000011.1| hypothetical protein EUFG_01610 [Escherichia coli O104:H4 str.
           11-3677]
 gi|423023465|ref|ZP_17014168.1| hypothetical protein EUHG_01618 [Escherichia coli O104:H4 str.
           11-4404]
 gi|423028613|ref|ZP_17019306.1| hypothetical protein EUIG_01617 [Escherichia coli O104:H4 str.
           11-4522]
 gi|423029479|ref|ZP_17020167.1| hypothetical protein EUJG_00238 [Escherichia coli O104:H4 str.
           11-4623]
 gi|423037318|ref|ZP_17027992.1| hypothetical protein EUKG_01595 [Escherichia coli O104:H4 str.
           11-4632 C1]
 gi|423042433|ref|ZP_17033100.1| hypothetical protein EULG_01608 [Escherichia coli O104:H4 str.
           11-4632 C2]
 gi|423049124|ref|ZP_17039781.1| hypothetical protein EUMG_01612 [Escherichia coli O104:H4 str.
           11-4632 C3]
 gi|423052705|ref|ZP_17041513.1| hypothetical protein EUNG_01111 [Escherichia coli O104:H4 str.
           11-4632 C4]
 gi|423059673|ref|ZP_17048469.1| hypothetical protein EUOG_01613 [Escherichia coli O104:H4 str.
           11-4632 C5]
 gi|423709135|ref|ZP_17683513.1| hypothetical protein ESTG_03603 [Escherichia coli B799]
 gi|425277265|ref|ZP_18668566.1| hypothetical protein ECARS42123_1409 [Escherichia coli ARS4.2123]
 gi|425287956|ref|ZP_18678845.1| hypothetical protein EC3006_1448 [Escherichia coli 3006]
 gi|425299606|ref|ZP_18689602.1| hypothetical protein EC07798_1505 [Escherichia coli 07798]
 gi|425421806|ref|ZP_18802999.1| hypothetical protein EC01288_1172 [Escherichia coli 0.1288]
 gi|429723522|ref|ZP_19258404.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9450]
 gi|429723866|ref|ZP_19258741.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9990]
 gi|429773601|ref|ZP_19305614.1| YcfD protein [Escherichia coli O104:H4 str. 11-02030]
 gi|429776591|ref|ZP_19308571.1| YcfD protein [Escherichia coli O104:H4 str. 11-02033-1]
 gi|429781815|ref|ZP_19313742.1| YcfD protein [Escherichia coli O104:H4 str. 11-02092]
 gi|429787009|ref|ZP_19318900.1| YcfD protein [Escherichia coli O104:H4 str. 11-02093]
 gi|429792188|ref|ZP_19324040.1| YcfD protein [Escherichia coli O104:H4 str. 11-02281]
 gi|429797401|ref|ZP_19329206.1| YcfD protein [Escherichia coli O104:H4 str. 11-02318]
 gi|429802606|ref|ZP_19334367.1| YcfD protein [Escherichia coli O104:H4 str. 11-02913]
 gi|429808977|ref|ZP_19340689.1| YcfD protein [Escherichia coli O104:H4 str. 11-03439]
 gi|429813029|ref|ZP_19344709.1| YcfD protein [Escherichia coli O104:H4 str. 11-04080]
 gi|429818235|ref|ZP_19349871.1| YcfD protein [Escherichia coli O104:H4 str. 11-03943]
 gi|429912063|ref|ZP_19378019.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9941]
 gi|429917905|ref|ZP_19383845.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4984]
 gi|429922943|ref|ZP_19388864.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5604]
 gi|429923794|ref|ZP_19389710.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4986]
 gi|429932687|ref|ZP_19398581.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4987]
 gi|429934290|ref|ZP_19400180.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4988]
 gi|429939952|ref|ZP_19405826.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5603]
 gi|429947591|ref|ZP_19413446.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-6006]
 gi|429950224|ref|ZP_19416072.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0465]
 gi|429958497|ref|ZP_19424326.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0466]
 gi|432357441|ref|ZP_19600684.1| YcfD protein [Escherichia coli KTE4]
 gi|432361865|ref|ZP_19605048.1| YcfD protein [Escherichia coli KTE5]
 gi|432369180|ref|ZP_19612280.1| YcfD protein [Escherichia coli KTE10]
 gi|432376287|ref|ZP_19619293.1| YcfD protein [Escherichia coli KTE12]
 gi|432380836|ref|ZP_19623785.1| YcfD protein [Escherichia coli KTE15]
 gi|432386601|ref|ZP_19629496.1| YcfD protein [Escherichia coli KTE16]
 gi|432396982|ref|ZP_19639767.1| YcfD protein [Escherichia coli KTE25]
 gi|432405914|ref|ZP_19648634.1| YcfD protein [Escherichia coli KTE28]
 gi|432411216|ref|ZP_19653894.1| YcfD protein [Escherichia coli KTE39]
 gi|432421219|ref|ZP_19663774.1| YcfD protein [Escherichia coli KTE178]
 gi|432431260|ref|ZP_19673701.1| YcfD protein [Escherichia coli KTE187]
 gi|432435792|ref|ZP_19678186.1| YcfD protein [Escherichia coli KTE188]
 gi|432440598|ref|ZP_19682946.1| YcfD protein [Escherichia coli KTE189]
 gi|432445707|ref|ZP_19688010.1| YcfD protein [Escherichia coli KTE191]
 gi|432449396|ref|ZP_19691677.1| YcfD protein [Escherichia coli KTE193]
 gi|432456099|ref|ZP_19698293.1| YcfD protein [Escherichia coli KTE201]
 gi|432465131|ref|ZP_19707231.1| YcfD protein [Escherichia coli KTE205]
 gi|432470467|ref|ZP_19712519.1| YcfD protein [Escherichia coli KTE206]
 gi|432484814|ref|ZP_19726733.1| YcfD protein [Escherichia coli KTE212]
 gi|432488828|ref|ZP_19730712.1| YcfD protein [Escherichia coli KTE213]
 gi|432495015|ref|ZP_19736830.1| YcfD protein [Escherichia coli KTE214]
 gi|432499433|ref|ZP_19741203.1| YcfD protein [Escherichia coli KTE216]
 gi|432503853|ref|ZP_19745586.1| YcfD protein [Escherichia coli KTE220]
 gi|432513404|ref|ZP_19750637.1| YcfD protein [Escherichia coli KTE224]
 gi|432523267|ref|ZP_19760402.1| YcfD protein [Escherichia coli KTE230]
 gi|432530459|ref|ZP_19767496.1| YcfD protein [Escherichia coli KTE233]
 gi|432533351|ref|ZP_19770341.1| YcfD protein [Escherichia coli KTE234]
 gi|432542556|ref|ZP_19779411.1| YcfD protein [Escherichia coli KTE236]
 gi|432548031|ref|ZP_19784815.1| YcfD protein [Escherichia coli KTE237]
 gi|432553165|ref|ZP_19789894.1| YcfD protein [Escherichia coli KTE47]
 gi|432558184|ref|ZP_19794872.1| YcfD protein [Escherichia coli KTE49]
 gi|432568116|ref|ZP_19804637.1| YcfD protein [Escherichia coli KTE53]
 gi|432573147|ref|ZP_19809637.1| YcfD protein [Escherichia coli KTE55]
 gi|432583167|ref|ZP_19819575.1| YcfD protein [Escherichia coli KTE57]
 gi|432587375|ref|ZP_19823741.1| YcfD protein [Escherichia coli KTE58]
 gi|432592265|ref|ZP_19828592.1| YcfD protein [Escherichia coli KTE60]
 gi|432596987|ref|ZP_19833268.1| YcfD protein [Escherichia coli KTE62]
 gi|432601653|ref|ZP_19837900.1| YcfD protein [Escherichia coli KTE66]
 gi|432606976|ref|ZP_19843167.1| YcfD protein [Escherichia coli KTE67]
 gi|432610892|ref|ZP_19847059.1| YcfD protein [Escherichia coli KTE72]
 gi|432621321|ref|ZP_19857361.1| YcfD protein [Escherichia coli KTE76]
 gi|432645649|ref|ZP_19881447.1| YcfD protein [Escherichia coli KTE86]
 gi|432650564|ref|ZP_19886323.1| YcfD protein [Escherichia coli KTE87]
 gi|432654749|ref|ZP_19890465.1| YcfD protein [Escherichia coli KTE93]
 gi|432670065|ref|ZP_19905605.1| YcfD protein [Escherichia coli KTE119]
 gi|432679601|ref|ZP_19914995.1| YcfD protein [Escherichia coli KTE143]
 gi|432693917|ref|ZP_19929126.1| YcfD protein [Escherichia coli KTE162]
 gi|432698518|ref|ZP_19933683.1| YcfD protein [Escherichia coli KTE169]
 gi|432710083|ref|ZP_19945147.1| YcfD protein [Escherichia coli KTE6]
 gi|432712757|ref|ZP_19947806.1| YcfD protein [Escherichia coli KTE8]
 gi|432718161|ref|ZP_19953145.1| YcfD protein [Escherichia coli KTE9]
 gi|432722609|ref|ZP_19957532.1| YcfD protein [Escherichia coli KTE17]
 gi|432727197|ref|ZP_19962078.1| YcfD protein [Escherichia coli KTE18]
 gi|432731881|ref|ZP_19966716.1| YcfD protein [Escherichia coli KTE45]
 gi|432740883|ref|ZP_19975604.1| YcfD protein [Escherichia coli KTE23]
 gi|432745139|ref|ZP_19979834.1| YcfD protein [Escherichia coli KTE43]
 gi|432753870|ref|ZP_19988426.1| YcfD protein [Escherichia coli KTE22]
 gi|432758959|ref|ZP_19993458.1| YcfD protein [Escherichia coli KTE46]
 gi|432764493|ref|ZP_19998938.1| YcfD protein [Escherichia coli KTE48]
 gi|432778001|ref|ZP_20012250.1| YcfD protein [Escherichia coli KTE59]
 gi|432783049|ref|ZP_20017233.1| YcfD protein [Escherichia coli KTE63]
 gi|432786789|ref|ZP_20020953.1| YcfD protein [Escherichia coli KTE65]
 gi|432792395|ref|ZP_20026483.1| YcfD protein [Escherichia coli KTE78]
 gi|432798356|ref|ZP_20032380.1| YcfD protein [Escherichia coli KTE79]
 gi|432801432|ref|ZP_20035414.1| YcfD protein [Escherichia coli KTE84]
 gi|432805248|ref|ZP_20039189.1| YcfD protein [Escherichia coli KTE91]
 gi|432813303|ref|ZP_20047127.1| YcfD protein [Escherichia coli KTE101]
 gi|432814763|ref|ZP_20048553.1| YcfD protein [Escherichia coli KTE115]
 gi|432820375|ref|ZP_20054078.1| YcfD protein [Escherichia coli KTE118]
 gi|432826596|ref|ZP_20060250.1| YcfD protein [Escherichia coli KTE123]
 gi|432831099|ref|ZP_20064681.1| YcfD protein [Escherichia coli KTE135]
 gi|432834200|ref|ZP_20067742.1| YcfD protein [Escherichia coli KTE136]
 gi|432838845|ref|ZP_20072334.1| YcfD protein [Escherichia coli KTE140]
 gi|432843664|ref|ZP_20076847.1| YcfD protein [Escherichia coli KTE141]
 gi|432849578|ref|ZP_20080800.1| YcfD protein [Escherichia coli KTE144]
 gi|432860950|ref|ZP_20086034.1| YcfD protein [Escherichia coli KTE146]
 gi|432880849|ref|ZP_20097384.1| YcfD protein [Escherichia coli KTE154]
 gi|432893835|ref|ZP_20105847.1| YcfD protein [Escherichia coli KTE165]
 gi|432898025|ref|ZP_20108856.1| YcfD protein [Escherichia coli KTE192]
 gi|432903996|ref|ZP_20113267.1| YcfD protein [Escherichia coli KTE194]
 gi|432921226|ref|ZP_20124608.1| YcfD protein [Escherichia coli KTE173]
 gi|432926155|ref|ZP_20127936.1| YcfD protein [Escherichia coli KTE175]
 gi|432933686|ref|ZP_20133354.1| YcfD protein [Escherichia coli KTE184]
 gi|432937035|ref|ZP_20135727.1| YcfD protein [Escherichia coli KTE183]
 gi|432971323|ref|ZP_20160196.1| YcfD protein [Escherichia coli KTE207]
 gi|432977758|ref|ZP_20166581.1| YcfD protein [Escherichia coli KTE209]
 gi|432980571|ref|ZP_20169349.1| YcfD protein [Escherichia coli KTE211]
 gi|432984857|ref|ZP_20173586.1| YcfD protein [Escherichia coli KTE215]
 gi|432990196|ref|ZP_20178862.1| YcfD protein [Escherichia coli KTE217]
 gi|432994830|ref|ZP_20183444.1| YcfD protein [Escherichia coli KTE218]
 gi|432999357|ref|ZP_20187893.1| YcfD protein [Escherichia coli KTE223]
 gi|433004572|ref|ZP_20193010.1| YcfD protein [Escherichia coli KTE227]
 gi|433011829|ref|ZP_20200232.1| YcfD protein [Escherichia coli KTE229]
 gi|433013329|ref|ZP_20201701.1| YcfD protein [Escherichia coli KTE104]
 gi|433022970|ref|ZP_20210980.1| YcfD protein [Escherichia coli KTE106]
 gi|433028125|ref|ZP_20215992.1| YcfD protein [Escherichia coli KTE109]
 gi|433032696|ref|ZP_20220464.1| YcfD protein [Escherichia coli KTE112]
 gi|433038160|ref|ZP_20225770.1| YcfD protein [Escherichia coli KTE113]
 gi|433057409|ref|ZP_20244488.1| YcfD protein [Escherichia coli KTE124]
 gi|433072157|ref|ZP_20258847.1| YcfD protein [Escherichia coli KTE129]
 gi|433077260|ref|ZP_20263820.1| YcfD protein [Escherichia coli KTE131]
 gi|433082043|ref|ZP_20268515.1| YcfD protein [Escherichia coli KTE133]
 gi|433086712|ref|ZP_20273104.1| YcfD protein [Escherichia coli KTE137]
 gi|433095996|ref|ZP_20282205.1| YcfD protein [Escherichia coli KTE139]
 gi|433100633|ref|ZP_20286738.1| YcfD protein [Escherichia coli KTE145]
 gi|433105223|ref|ZP_20291236.1| YcfD protein [Escherichia coli KTE148]
 gi|433110422|ref|ZP_20296293.1| YcfD protein [Escherichia coli KTE150]
 gi|433114987|ref|ZP_20300798.1| YcfD protein [Escherichia coli KTE153]
 gi|433119674|ref|ZP_20305375.1| YcfD protein [Escherichia coli KTE157]
 gi|433124659|ref|ZP_20310242.1| YcfD protein [Escherichia coli KTE160]
 gi|433129508|ref|ZP_20314970.1| YcfD protein [Escherichia coli KTE163]
 gi|433134306|ref|ZP_20319673.1| YcfD protein [Escherichia coli KTE166]
 gi|433138719|ref|ZP_20323999.1| YcfD protein [Escherichia coli KTE167]
 gi|433143742|ref|ZP_20328904.1| YcfD protein [Escherichia coli KTE168]
 gi|433148614|ref|ZP_20333663.1| YcfD protein [Escherichia coli KTE174]
 gi|433153201|ref|ZP_20338165.1| YcfD protein [Escherichia coli KTE176]
 gi|433162947|ref|ZP_20347704.1| YcfD protein [Escherichia coli KTE179]
 gi|433167974|ref|ZP_20352637.1| YcfD protein [Escherichia coli KTE180]
 gi|433172951|ref|ZP_20357501.1| YcfD protein [Escherichia coli KTE232]
 gi|433182644|ref|ZP_20366935.1| YcfD protein [Escherichia coli KTE85]
 gi|433187907|ref|ZP_20372022.1| YcfD protein [Escherichia coli KTE88]
 gi|433193157|ref|ZP_20377165.1| YcfD protein [Escherichia coli KTE90]
 gi|433202661|ref|ZP_20386456.1| YcfD protein [Escherichia coli KTE95]
 gi|433207227|ref|ZP_20390920.1| YcfD protein [Escherichia coli KTE97]
 gi|433211981|ref|ZP_20395590.1| YcfD protein [Escherichia coli KTE99]
 gi|433322743|ref|ZP_20400146.1| hypothetical protein B185_004920 [Escherichia coli J96]
 gi|442599916|ref|ZP_21017618.1| FIG002776: hypothetical protein [Escherichia coli O5:K4(L):H4 str.
           ATCC 23502]
 gi|442603849|ref|ZP_21018703.1| FIG002776: hypothetical protein [Escherichia coli Nissle 1917]
 gi|443617239|ref|YP_007381095.1| hypothetical protein APECO78_09600 [Escherichia coli APEC O78]
 gi|450187641|ref|ZP_21889937.1| hypothetical protein A364_06482 [Escherichia coli SEPT362]
 gi|157066332|gb|ABV05587.1| cupin family protein [Escherichia coli HS]
 gi|157077208|gb|ABV16916.1| cupin family protein [Escherichia coli E24377A]
 gi|169755407|gb|ACA78106.1| Cupin 4 family protein [Escherichia coli ATCC 8739]
 gi|170518242|gb|ACB16420.1| cupin family protein [Escherichia coli SMS-3-5]
 gi|187429284|gb|ACD08558.1| cupin family protein [Shigella boydii CDC 3083-94]
 gi|188487898|gb|EDU63001.1| cupin family protein [Escherichia coli 53638]
 gi|190906614|gb|EDV66220.1| cupin family protein [Escherichia coli F11]
 gi|194418072|gb|EDX34165.1| cupin family protein [Shigella dysenteriae 1012]
 gi|194423984|gb|EDX39972.1| cupin family protein [Escherichia coli 101-1]
 gi|215264454|emb|CAS08817.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
 gi|218351394|emb|CAU97100.1| conserved hypothetical protein [Escherichia coli 55989]
 gi|218364779|emb|CAR02469.1| conserved hypothetical protein [Escherichia coli S88]
 gi|218370352|emb|CAR18155.1| conserved hypothetical protein [Escherichia coli IAI39]
 gi|222032880|emb|CAP75620.1| Uncharacterized protein ycfD [Escherichia coli LF82]
 gi|242376930|emb|CAQ31651.1| conserved protein [Escherichia coli BL21(DE3)]
 gi|253324892|gb|ACT29494.1| Cupin 4 family protein [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
 gi|253973136|gb|ACT38807.1| hypothetical protein ECB_01126 [Escherichia coli B str. REL606]
 gi|253977350|gb|ACT43020.1| hypothetical protein ECD_01126 [Escherichia coli BL21(DE3)]
 gi|281178238|dbj|BAI54568.1| conserved hypothetical protein [Escherichia coli SE15]
 gi|294493437|gb|ADE92193.1| cupin family protein [Escherichia coli IHE3034]
 gi|305852659|gb|EFM53107.1| putative cytoplasmic protein [Escherichia coli NC101]
 gi|307627400|gb|ADN71704.1| putative cytoplasmic protein [Escherichia coli UM146]
 gi|310336466|gb|EFQ01652.1| cupin family protein [Escherichia coli 1827-70]
 gi|312288837|gb|EFR16735.1| cupin family protein [Escherichia coli 2362-75]
 gi|312945690|gb|ADR26517.1| putative cytoplasmic protein [Escherichia coli O83:H1 str. NRG
           857C]
 gi|313650418|gb|EFS14825.1| cupin family protein [Shigella flexneri 2a str. 2457T]
 gi|320179172|gb|EFW54130.1| hypothetical protein SGB_03455 [Shigella boydii ATCC 9905]
 gi|320184385|gb|EFW59196.1| hypothetical protein SGF_03422 [Shigella flexneri CDC 796-83]
 gi|320201024|gb|EFW75608.1| hypothetical protein ECoL_01732 [Escherichia coli EC4100B]
 gi|323165636|gb|EFZ51423.1| cupin superfamily protein [Shigella sonnei 53G]
 gi|323172495|gb|EFZ58132.1| cupin superfamily protein [Escherichia coli LT-68]
 gi|323187616|gb|EFZ72923.1| cupin superfamily protein [Escherichia coli RN587/1]
 gi|323947568|gb|EGB43572.1| cupin superfamily protein [Escherichia coli H120]
 gi|323953185|gb|EGB49051.1| cupin superfamily protein [Escherichia coli H252]
 gi|323957954|gb|EGB53666.1| cupin superfamily protein [Escherichia coli H263]
 gi|323962727|gb|EGB58305.1| cupin superfamily protein [Escherichia coli H489]
 gi|323973282|gb|EGB68471.1| cupin superfamily protein [Escherichia coli TA007]
 gi|330910944|gb|EGH39454.1| hypothetical protein ECAA86_01265 [Escherichia coli AA86]
 gi|331055440|gb|EGI27449.1| YcfD protein [Escherichia coli TA206]
 gi|331060961|gb|EGI32925.1| YcfD protein [Escherichia coli TA143]
 gi|331065114|gb|EGI37009.1| YcfD protein [Escherichia coli TA271]
 gi|331080265|gb|EGI51444.1| YcfD protein [Escherichia coli H299]
 gi|332092825|gb|EGI97893.1| cupin superfamily protein [Shigella dysenteriae 155-74]
 gi|332094465|gb|EGI99514.1| cupin superfamily protein [Shigella boydii 3594-74]
 gi|332342730|gb|AEE56064.1| conserved hypothetical protein [Escherichia coli UMNK88]
 gi|332758289|gb|EGJ88612.1| cupin superfamily protein [Shigella flexneri 4343-70]
 gi|332759490|gb|EGJ89798.1| cupin superfamily protein [Shigella flexneri 2747-71]
 gi|332761067|gb|EGJ91354.1| cupin superfamily protein [Shigella flexneri K-671]
 gi|332767348|gb|EGJ97542.1| cupin superfamily protein [Shigella flexneri 2930-71]
 gi|333004939|gb|EGK24459.1| cupin superfamily protein [Shigella flexneri VA-6]
 gi|333005483|gb|EGK25001.1| cupin superfamily protein [Shigella flexneri K-218]
 gi|333019433|gb|EGK38716.1| cupin superfamily protein [Shigella flexneri K-304]
 gi|333969205|gb|AEG36010.1| Hypothetical protein ECNA114_1186 [Escherichia coli NA114]
 gi|335576073|gb|EGM62331.1| cupin superfamily protein [Shigella flexneri J1713]
 gi|340735101|gb|EGR64190.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
           01-09591]
 gi|340740826|gb|EGR75004.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
           LB226692]
 gi|341918233|gb|EGT67847.1| hypothetical protein C22711_1876 [Escherichia coli O104:H4 str.
           C227-11]
 gi|345361538|gb|EGW93697.1| cupin superfamily protein [Escherichia coli STEC_DG131-3]
 gi|345394969|gb|EGX24723.1| cupin superfamily protein [Escherichia coli TX1999]
 gi|349737323|gb|AEQ12029.1| conserved protein [Escherichia coli O7:K1 str. CE10]
 gi|354857242|gb|EHF17698.1| hypothetical protein EUAG_01602 [Escherichia coli O104:H4 str.
           C227-11]
 gi|354865034|gb|EHF25463.1| hypothetical protein EUBG_01610 [Escherichia coli O104:H4 str.
           C236-11]
 gi|354871813|gb|EHF32210.1| hypothetical protein EUDG_00221 [Escherichia coli O104:H4 str.
           04-8351]
 gi|354875315|gb|EHF35681.1| hypothetical protein EUEG_01566 [Escherichia coli O104:H4 str.
           09-7901]
 gi|354877563|gb|EHF37922.1| hypothetical protein EUHG_01618 [Escherichia coli O104:H4 str.
           11-4404]
 gi|354882387|gb|EHF42711.1| hypothetical protein EUIG_01617 [Escherichia coli O104:H4 str.
           11-4522]
 gi|354883095|gb|EHF43417.1| hypothetical protein EUFG_01610 [Escherichia coli O104:H4 str.
           11-3677]
 gi|354899396|gb|EHF59545.1| hypothetical protein EUKG_01595 [Escherichia coli O104:H4 str.
           11-4632 C1]
 gi|354901219|gb|EHF61347.1| hypothetical protein EUJG_00238 [Escherichia coli O104:H4 str.
           11-4623]
 gi|354902873|gb|EHF62986.1| hypothetical protein EULG_01608 [Escherichia coli O104:H4 str.
           11-4632 C2]
 gi|354905066|gb|EHF65151.1| hypothetical protein EUMG_01612 [Escherichia coli O104:H4 str.
           11-4632 C3]
 gi|354916293|gb|EHF76267.1| hypothetical protein EUOG_01613 [Escherichia coli O104:H4 str.
           11-4632 C5]
 gi|354921456|gb|EHF81381.1| hypothetical protein EUNG_01111 [Escherichia coli O104:H4 str.
           11-4632 C4]
 gi|355352609|gb|EHG01784.1| putative cytoplasmic protein [Escherichia coli cloneA_i1]
 gi|371608094|gb|EHN96657.1| hypothetical protein ESOG_01012 [Escherichia coli E101]
 gi|371609461|gb|EHN98000.1| hypothetical protein ESPG_02245 [Escherichia coli H397]
 gi|371612529|gb|EHO01038.1| hypothetical protein ESNG_01291 [Escherichia coli B093]
 gi|375321471|gb|EHS67304.1| hypothetical protein T22_10377 [Escherichia coli O157:H43 str. T22]
 gi|377846903|gb|EHU11908.1| cupin superfamily protein [Escherichia coli DEC1A]
 gi|377848763|gb|EHU13742.1| cupin superfamily protein [Escherichia coli DEC1C]
 gi|377851744|gb|EHU16680.1| cupin superfamily protein [Escherichia coli DEC1B]
 gi|377860538|gb|EHU25362.1| cupin superfamily protein [Escherichia coli DEC1D]
 gi|377864172|gb|EHU28969.1| cupin superfamily protein [Escherichia coli DEC1E]
 gi|377866546|gb|EHU31312.1| cupin superfamily protein [Escherichia coli DEC2A]
 gi|377877092|gb|EHU41689.1| cupin superfamily protein [Escherichia coli DEC2B]
 gi|377881271|gb|EHU45833.1| cupin superfamily protein [Escherichia coli DEC2D]
 gi|377883089|gb|EHU47618.1| cupin superfamily protein [Escherichia coli DEC2C]
 gi|377896504|gb|EHU60899.1| cupin superfamily protein [Escherichia coli DEC2E]
 gi|378018345|gb|EHV81212.1| cupin superfamily protein [Escherichia coli DEC7A]
 gi|378026078|gb|EHV88717.1| cupin superfamily protein [Escherichia coli DEC7C]
 gi|378033087|gb|EHV95668.1| cupin superfamily protein [Escherichia coli DEC7D]
 gi|378036289|gb|EHV98832.1| cupin superfamily protein [Escherichia coli DEC7B]
 gi|378041000|gb|EHW03463.1| cupin superfamily protein [Escherichia coli DEC7E]
 gi|378189189|gb|EHX49783.1| cupin superfamily protein [Escherichia coli DEC13A]
 gi|378204860|gb|EHX65276.1| cupin superfamily protein [Escherichia coli DEC13B]
 gi|378205832|gb|EHX66240.1| cupin superfamily protein [Escherichia coli DEC13C]
 gi|378207045|gb|EHX67447.1| cupin superfamily protein [Escherichia coli DEC13D]
 gi|378216410|gb|EHX76697.1| cupin superfamily protein [Escherichia coli DEC13E]
 gi|380349080|gb|EIA37356.1| hypothetical protein OQA_05431 [Escherichia coli SCI-07]
 gi|385706842|gb|EIG43880.1| hypothetical protein ESTG_03603 [Escherichia coli B799]
 gi|386145441|gb|EIG91899.1| cupin domain protein, PF06172 family [Escherichia coli 97.0246]
 gi|386160695|gb|EIH22501.1| cupin domain protein, PF06172 family [Escherichia coli 1.2264]
 gi|386175533|gb|EIH47522.1| cupin domain protein, PF06172 family [Escherichia coli 99.0741]
 gi|386203589|gb|EII08107.1| cupin domain protein, PF06172 family [Escherichia coli 5.0959]
 gi|386214770|gb|EII25178.1| cupin domain protein, PF06172 family [Escherichia coli 9.0111]
 gi|386231390|gb|EII58738.1| cupin domain protein, PF06172 family [Escherichia coli 3.3884]
 gi|386243387|gb|EII85120.1| cupin domain protein, PF06172 family [Escherichia coli 3003]
 gi|386247994|gb|EII94167.1| cupin domain protein, PF06172 family [Escherichia coli TW07793]
 gi|388384339|gb|EIL46072.1| putative cytoplasmic protein [Escherichia coli KD1]
 gi|388386333|gb|EIL47982.1| putative cytoplasmic protein [Escherichia coli 541-15]
 gi|388417281|gb|EIL77139.1| putative cytoplasmic protein [Escherichia coli HM605]
 gi|391252395|gb|EIQ11594.1| cupin superfamily protein [Shigella flexneri 2850-71]
 gi|391254959|gb|EIQ14114.1| cupin superfamily protein [Shigella flexneri CCH060]
 gi|391255804|gb|EIQ14944.1| cupin superfamily protein [Shigella flexneri K-1770]
 gi|391270959|gb|EIQ29841.1| cupin superfamily protein [Shigella flexneri K-404]
 gi|391271347|gb|EIQ30222.1| cupin superfamily protein [Shigella boydii 965-58]
 gi|391286267|gb|EIQ44814.1| cupin superfamily protein [Shigella sonnei 3226-85]
 gi|391287838|gb|EIQ46353.1| cupin superfamily protein [Shigella sonnei 3233-85]
 gi|391295587|gb|EIQ53731.1| cupin superfamily protein [Shigella sonnei 4822-66]
 gi|391307489|gb|EIQ65220.1| cupin superfamily protein [Escherichia coli EPECa12]
 gi|397899222|gb|EJL15597.1| cupin superfamily protein [Shigella flexneri 6603-63]
 gi|397902254|gb|EJL18579.1| cupin superfamily protein [Shigella sonnei str. Moseley]
 gi|404341081|gb|EJZ67493.1| cupin superfamily protein [Shigella flexneri 1485-80]
 gi|406778228|gb|AFS57652.1| hypothetical protein O3M_14895 [Escherichia coli O104:H4 str.
           2009EL-2050]
 gi|407054811|gb|AFS74862.1| hypothetical protein O3K_14920 [Escherichia coli O104:H4 str.
           2011C-3493]
 gi|407064854|gb|AFS85901.1| hypothetical protein O3O_10700 [Escherichia coli O104:H4 str.
           2009EL-2071]
 gi|408204953|gb|EKI29859.1| hypothetical protein ECARS42123_1409 [Escherichia coli ARS4.2123]
 gi|408216250|gb|EKI40581.1| hypothetical protein EC3006_1448 [Escherichia coli 3006]
 gi|408219578|gb|EKI43707.1| hypothetical protein EC07798_1505 [Escherichia coli 07798]
 gi|408346497|gb|EKJ60792.1| hypothetical protein EC01288_1172 [Escherichia coli 0.1288]
 gi|408458491|gb|EKJ82278.1| cupin family protein [Escherichia coli AD30]
 gi|429355398|gb|EKY92088.1| YcfD protein [Escherichia coli O104:H4 str. 11-02030]
 gi|429359310|gb|EKY95975.1| YcfD protein [Escherichia coli O104:H4 str. 11-02092]
 gi|429363211|gb|EKY99854.1| YcfD protein [Escherichia coli O104:H4 str. 11-02033-1]
 gi|429373558|gb|EKZ10102.1| YcfD protein [Escherichia coli O104:H4 str. 11-02093]
 gi|429377299|gb|EKZ13823.1| YcfD protein [Escherichia coli O104:H4 str. 11-02281]
 gi|429378903|gb|EKZ15410.1| YcfD protein [Escherichia coli O104:H4 str. 11-02318]
 gi|429388187|gb|EKZ24613.1| YcfD protein [Escherichia coli O104:H4 str. 11-03439]
 gi|429390050|gb|EKZ26466.1| YcfD protein [Escherichia coli O104:H4 str. 11-02913]
 gi|429393889|gb|EKZ30276.1| YcfD protein [Escherichia coli O104:H4 str. 11-03943]
 gi|429395192|gb|EKZ31560.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9450]
 gi|429403876|gb|EKZ40157.1| YcfD protein [Escherichia coli O104:H4 str. 11-04080]
 gi|429404994|gb|EKZ41261.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9990]
 gi|429408982|gb|EKZ45216.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4984]
 gi|429417057|gb|EKZ53208.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4987]
 gi|429421885|gb|EKZ58006.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4988]
 gi|429423626|gb|EKZ59734.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4986]
 gi|429425697|gb|EKZ61786.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5603]
 gi|429432784|gb|EKZ68821.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5604]
 gi|429442692|gb|EKZ78648.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-6006]
 gi|429447596|gb|EKZ83514.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0466]
 gi|429452251|gb|EKZ88137.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0465]
 gi|429454648|gb|EKZ90507.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9941]
 gi|430878983|gb|ELC02344.1| YcfD protein [Escherichia coli KTE4]
 gi|430887807|gb|ELC10546.1| YcfD protein [Escherichia coli KTE10]
 gi|430889325|gb|ELC11993.1| YcfD protein [Escherichia coli KTE5]
 gi|430900167|gb|ELC22188.1| YcfD protein [Escherichia coli KTE12]
 gi|430908867|gb|ELC30257.1| YcfD protein [Escherichia coli KTE16]
 gi|430910127|gb|ELC31484.1| YcfD protein [Escherichia coli KTE15]
 gi|430917302|gb|ELC38350.1| YcfD protein [Escherichia coli KTE25]
 gi|430932068|gb|ELC52502.1| YcfD protein [Escherichia coli KTE28]
 gi|430936880|gb|ELC57147.1| YcfD protein [Escherichia coli KTE39]
 gi|430946836|gb|ELC66759.1| YcfD protein [Escherichia coli KTE178]
 gi|430955055|gb|ELC73848.1| YcfD protein [Escherichia coli KTE187]
 gi|430965075|gb|ELC82517.1| YcfD protein [Escherichia coli KTE188]
 gi|430968037|gb|ELC85271.1| YcfD protein [Escherichia coli KTE189]
 gi|430974252|gb|ELC91185.1| YcfD protein [Escherichia coli KTE191]
 gi|430982528|gb|ELC99218.1| YcfD protein [Escherichia coli KTE193]
 gi|430983620|gb|ELD00276.1| YcfD protein [Escherichia coli KTE201]
 gi|430995146|gb|ELD11444.1| YcfD protein [Escherichia coli KTE205]
 gi|430999645|gb|ELD15727.1| YcfD protein [Escherichia coli KTE206]
 gi|431017402|gb|ELD30912.1| YcfD protein [Escherichia coli KTE212]
 gi|431023026|gb|ELD36286.1| YcfD protein [Escherichia coli KTE213]
 gi|431026955|gb|ELD40022.1| YcfD protein [Escherichia coli KTE214]
 gi|431031009|gb|ELD43913.1| YcfD protein [Escherichia coli KTE216]
 gi|431040721|gb|ELD51255.1| YcfD protein [Escherichia coli KTE220]
 gi|431043632|gb|ELD53949.1| YcfD protein [Escherichia coli KTE224]
 gi|431053850|gb|ELD63451.1| YcfD protein [Escherichia coli KTE230]
 gi|431056298|gb|ELD65818.1| YcfD protein [Escherichia coli KTE233]
 gi|431062471|gb|ELD71739.1| YcfD protein [Escherichia coli KTE234]
 gi|431076227|gb|ELD83732.1| YcfD protein [Escherichia coli KTE236]
 gi|431082800|gb|ELD89111.1| YcfD protein [Escherichia coli KTE237]
 gi|431085882|gb|ELD91986.1| YcfD protein [Escherichia coli KTE47]
 gi|431093690|gb|ELD99355.1| YcfD protein [Escherichia coli KTE49]
 gi|431101715|gb|ELE06625.1| YcfD protein [Escherichia coli KTE53]
 gi|431110355|gb|ELE14282.1| YcfD protein [Escherichia coli KTE55]
 gi|431118294|gb|ELE21314.1| YcfD protein [Escherichia coli KTE57]
 gi|431122636|gb|ELE25503.1| YcfD protein [Escherichia coli KTE58]
 gi|431132181|gb|ELE34197.1| YcfD protein [Escherichia coli KTE60]
 gi|431132772|gb|ELE34771.1| YcfD protein [Escherichia coli KTE62]
 gi|431139322|gb|ELE41118.1| YcfD protein [Escherichia coli KTE67]
 gi|431142587|gb|ELE44335.1| YcfD protein [Escherichia coli KTE66]
 gi|431150052|gb|ELE51110.1| YcfD protein [Escherichia coli KTE72]
 gi|431161303|gb|ELE61787.1| YcfD protein [Escherichia coli KTE76]
 gi|431182367|gb|ELE82188.1| YcfD protein [Escherichia coli KTE86]
 gi|431192303|gb|ELE91653.1| YcfD protein [Escherichia coli KTE87]
 gi|431194982|gb|ELE94196.1| YcfD protein [Escherichia coli KTE93]
 gi|431212595|gb|ELF10522.1| YcfD protein [Escherichia coli KTE119]
 gi|431223793|gb|ELF21039.1| YcfD protein [Escherichia coli KTE143]
 gi|431235275|gb|ELF30526.1| YcfD protein [Escherichia coli KTE162]
 gi|431245842|gb|ELF40121.1| YcfD protein [Escherichia coli KTE169]
 gi|431250652|gb|ELF44711.1| YcfD protein [Escherichia coli KTE6]
 gi|431258890|gb|ELF51653.1| YcfD protein [Escherichia coli KTE8]
 gi|431265264|gb|ELF56961.1| YcfD protein [Escherichia coli KTE9]
 gi|431267686|gb|ELF59203.1| YcfD protein [Escherichia coli KTE17]
 gi|431274985|gb|ELF66030.1| YcfD protein [Escherichia coli KTE18]
 gi|431276943|gb|ELF67958.1| YcfD protein [Escherichia coli KTE45]
 gi|431285474|gb|ELF76310.1| YcfD protein [Escherichia coli KTE23]
 gi|431293547|gb|ELF83840.1| YcfD protein [Escherichia coli KTE43]
 gi|431304440|gb|ELF92969.1| YcfD protein [Escherichia coli KTE22]
 gi|431310277|gb|ELF98469.1| YcfD protein [Escherichia coli KTE46]
 gi|431312336|gb|ELG00340.1| YcfD protein [Escherichia coli KTE48]
 gi|431329324|gb|ELG16622.1| YcfD protein [Escherichia coli KTE59]
 gi|431331448|gb|ELG18711.1| YcfD protein [Escherichia coli KTE63]
 gi|431340835|gb|ELG27856.1| YcfD protein [Escherichia coli KTE65]
 gi|431340973|gb|ELG27993.1| YcfD protein [Escherichia coli KTE78]
 gi|431344507|gb|ELG31445.1| YcfD protein [Escherichia coli KTE79]
 gi|431349545|gb|ELG36374.1| YcfD protein [Escherichia coli KTE84]
 gi|431355589|gb|ELG42293.1| YcfD protein [Escherichia coli KTE101]
 gi|431356860|gb|ELG43550.1| YcfD protein [Escherichia coli KTE91]
 gi|431366986|gb|ELG53483.1| YcfD protein [Escherichia coli KTE115]
 gi|431369515|gb|ELG55736.1| YcfD protein [Escherichia coli KTE118]
 gi|431373720|gb|ELG59323.1| YcfD protein [Escherichia coli KTE123]
 gi|431379445|gb|ELG64379.1| YcfD protein [Escherichia coli KTE135]
 gi|431387081|gb|ELG71034.1| YcfD protein [Escherichia coli KTE136]
 gi|431391311|gb|ELG74959.1| YcfD protein [Escherichia coli KTE140]
 gi|431396544|gb|ELG80022.1| YcfD protein [Escherichia coli KTE141]
 gi|431401578|gb|ELG84922.1| YcfD protein [Escherichia coli KTE144]
 gi|431406959|gb|ELG90178.1| YcfD protein [Escherichia coli KTE146]
 gi|431413077|gb|ELG95876.1| YcfD protein [Escherichia coli KTE154]
 gi|431424815|gb|ELH06911.1| YcfD protein [Escherichia coli KTE165]
 gi|431428752|gb|ELH10693.1| YcfD protein [Escherichia coli KTE192]
 gi|431434430|gb|ELH16080.1| YcfD protein [Escherichia coli KTE194]
 gi|431439125|gb|ELH20464.1| YcfD protein [Escherichia coli KTE173]
 gi|431446790|gb|ELH27533.1| YcfD protein [Escherichia coli KTE175]
 gi|431455328|gb|ELH35684.1| YcfD protein [Escherichia coli KTE184]
 gi|431465990|gb|ELH46070.1| YcfD protein [Escherichia coli KTE183]
 gi|431481269|gb|ELH60983.1| YcfD protein [Escherichia coli KTE209]
 gi|431484332|gb|ELH64012.1| YcfD protein [Escherichia coli KTE207]
 gi|431493466|gb|ELH73060.1| YcfD protein [Escherichia coli KTE211]
 gi|431497071|gb|ELH76649.1| YcfD protein [Escherichia coli KTE217]
 gi|431502345|gb|ELH81236.1| YcfD protein [Escherichia coli KTE215]
 gi|431509043|gb|ELH87314.1| YcfD protein [Escherichia coli KTE218]
 gi|431512730|gb|ELH90820.1| YcfD protein [Escherichia coli KTE223]
 gi|431516945|gb|ELH94543.1| YcfD protein [Escherichia coli KTE229]
 gi|431517893|gb|ELH95415.1| YcfD protein [Escherichia coli KTE227]
 gi|431534017|gb|ELI10508.1| YcfD protein [Escherichia coli KTE104]
 gi|431538979|gb|ELI14958.1| YcfD protein [Escherichia coli KTE106]
 gi|431544711|gb|ELI19526.1| YcfD protein [Escherichia coli KTE109]
 gi|431553337|gb|ELI27264.1| YcfD protein [Escherichia coli KTE113]
 gi|431558052|gb|ELI31734.1| YcfD protein [Escherichia coli KTE112]
 gi|431573019|gb|ELI45833.1| YcfD protein [Escherichia coli KTE124]
 gi|431591241|gb|ELI62240.1| YcfD protein [Escherichia coli KTE129]
 gi|431599643|gb|ELI69348.1| YcfD protein [Escherichia coli KTE131]
 gi|431604826|gb|ELI74227.1| YcfD protein [Escherichia coli KTE133]
 gi|431608415|gb|ELI77758.1| YcfD protein [Escherichia coli KTE137]
 gi|431618655|gb|ELI87623.1| YcfD protein [Escherichia coli KTE139]
 gi|431621582|gb|ELI90378.1| YcfD protein [Escherichia coli KTE145]
 gi|431630154|gb|ELI98495.1| YcfD protein [Escherichia coli KTE150]
 gi|431632371|gb|ELJ00660.1| YcfD protein [Escherichia coli KTE148]
 gi|431635837|gb|ELJ04005.1| YcfD protein [Escherichia coli KTE153]
 gi|431646918|gb|ELJ14408.1| YcfD protein [Escherichia coli KTE157]
 gi|431648653|gb|ELJ16029.1| YcfD protein [Escherichia coli KTE160]
 gi|431649917|gb|ELJ17256.1| YcfD protein [Escherichia coli KTE163]
 gi|431660618|gb|ELJ27487.1| YcfD protein [Escherichia coli KTE166]
 gi|431663778|gb|ELJ30533.1| YcfD protein [Escherichia coli KTE167]
 gi|431664761|gb|ELJ31494.1| YcfD protein [Escherichia coli KTE168]
 gi|431674456|gb|ELJ40618.1| YcfD protein [Escherichia coli KTE174]
 gi|431677434|gb|ELJ43511.1| YcfD protein [Escherichia coli KTE176]
 gi|431690865|gb|ELJ56339.1| YcfD protein [Escherichia coli KTE179]
 gi|431692537|gb|ELJ57973.1| YcfD protein [Escherichia coli KTE180]
 gi|431695333|gb|ELJ60648.1| YcfD protein [Escherichia coli KTE232]
 gi|431708098|gb|ELJ72623.1| YcfD protein [Escherichia coli KTE88]
 gi|431710340|gb|ELJ74765.1| YcfD protein [Escherichia coli KTE85]
 gi|431719225|gb|ELJ83285.1| YcfD protein [Escherichia coli KTE90]
 gi|431724759|gb|ELJ88675.1| YcfD protein [Escherichia coli KTE95]
 gi|431731939|gb|ELJ95400.1| YcfD protein [Escherichia coli KTE97]
 gi|431735586|gb|ELJ98944.1| YcfD protein [Escherichia coli KTE99]
 gi|432348796|gb|ELL43239.1| hypothetical protein B185_004920 [Escherichia coli J96]
 gi|441651133|emb|CCQ03108.1| FIG002776: hypothetical protein [Escherichia coli O5:K4(L):H4 str.
           ATCC 23502]
 gi|441715297|emb|CCQ04680.1| FIG002776: hypothetical protein [Escherichia coli Nissle 1917]
 gi|443421747|gb|AGC86651.1| hypothetical protein APECO78_09600 [Escherichia coli APEC O78]
 gi|449323433|gb|EMD13391.1| hypothetical protein A364_06482 [Escherichia coli SEPT362]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|307310095|ref|ZP_07589745.1| Cupin 4 family protein [Escherichia coli W]
 gi|378713474|ref|YP_005278367.1| transcription factor jumonji jmjC domain-containing protein
           [Escherichia coli KO11FL]
 gi|386608488|ref|YP_006123974.1| hypothetical protein ECW_m1236 [Escherichia coli W]
 gi|386701900|ref|YP_006165737.1| putative cytoplasmic protein [Escherichia coli KO11FL]
 gi|386708937|ref|YP_006172658.1| putative cytoplasmic protein [Escherichia coli W]
 gi|306909813|gb|EFN40307.1| Cupin 4 family protein [Escherichia coli W]
 gi|315060405|gb|ADT74732.1| conserved protein [Escherichia coli W]
 gi|323379035|gb|ADX51303.1| transcription factor jumonji jmjC domain-containing protein
           [Escherichia coli KO11FL]
 gi|383393427|gb|AFH18385.1| putative cytoplasmic protein [Escherichia coli KO11FL]
 gi|383404629|gb|AFH10872.1| putative cytoplasmic protein [Escherichia coli W]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|90111217|ref|NP_415646.4| cupin superfamily protein [Escherichia coli str. K-12 substr.
           MG1655]
 gi|170080779|ref|YP_001730099.1| hypothetical protein ECDH10B_1200 [Escherichia coli str. K-12
           substr. DH10B]
 gi|191168343|ref|ZP_03030135.1| cupin family protein [Escherichia coli B7A]
 gi|193071450|ref|ZP_03052363.1| cupin family protein [Escherichia coli E110019]
 gi|194429544|ref|ZP_03062065.1| cupin family protein [Escherichia coli B171]
 gi|209918385|ref|YP_002292469.1| hypothetical protein ECSE_1194 [Escherichia coli SE11]
 gi|218553706|ref|YP_002386619.1| hypothetical protein ECIAI1_1166 [Escherichia coli IAI1]
 gi|238900382|ref|YP_002926178.1| hypothetical protein BWG_0976 [Escherichia coli BW2952]
 gi|260854793|ref|YP_003228684.1| hypothetical protein ECO26_1645 [Escherichia coli O26:H11 str.
           11368]
 gi|260867557|ref|YP_003233959.1| hypothetical protein ECO111_1476 [Escherichia coli O111:H- str.
           11128]
 gi|293433416|ref|ZP_06661844.1| ycfD protein [Escherichia coli B088]
 gi|301029563|ref|ZP_07192641.1| cupin family protein [Escherichia coli MS 196-1]
 gi|386280239|ref|ZP_10057907.1| hypothetical protein ESBG_01540 [Escherichia sp. 4_1_40B]
 gi|386596044|ref|YP_006092444.1| cupin [Escherichia coli DH1]
 gi|386705150|ref|YP_006168997.1| YcfD protein [Escherichia coli P12b]
 gi|387611716|ref|YP_006114832.1| hypothetical protein ETEC_1252 [Escherichia coli ETEC H10407]
 gi|387620837|ref|YP_006128464.1| putative cytoplasmic protein [Escherichia coli DH1]
 gi|388477208|ref|YP_489396.1| hypothetical protein Y75_p1098 [Escherichia coli str. K-12 substr.
           W3110]
 gi|404374512|ref|ZP_10979724.1| hypothetical protein ESCG_03189 [Escherichia sp. 1_1_43]
 gi|415778016|ref|ZP_11489126.1| cupin family protein [Escherichia coli 3431]
 gi|415789197|ref|ZP_11494616.1| cupin family protein [Escherichia coli EPECa14]
 gi|415794696|ref|ZP_11496511.1| cupin superfamily protein [Escherichia coli E128010]
 gi|415819972|ref|ZP_11509269.1| cupin superfamily protein [Escherichia coli OK1180]
 gi|415827210|ref|ZP_11514127.1| cupin superfamily protein [Escherichia coli OK1357]
 gi|417134377|ref|ZP_11979162.1| cupin domain protein, PF06172 family [Escherichia coli 5.0588]
 gi|417137695|ref|ZP_11981485.1| cupin domain protein, PF06172 family [Escherichia coli 97.0259]
 gi|417155461|ref|ZP_11993590.1| cupin domain protein, PF06172 family [Escherichia coli 96.0497]
 gi|417172777|ref|ZP_12002810.1| cupin domain protein, PF06172 family [Escherichia coli 3.2608]
 gi|417180530|ref|ZP_12008238.1| cupin domain protein, PF06172 family [Escherichia coli 93.0624]
 gi|417200789|ref|ZP_12017582.1| cupin domain protein, PF06172 family [Escherichia coli 4.0522]
 gi|417205673|ref|ZP_12019171.1| cupin domain protein, PF06172 family [Escherichia coli JB1-95]
 gi|417225066|ref|ZP_12028357.1| cupin domain protein, PF06172 family [Escherichia coli 96.154]
 gi|417250662|ref|ZP_12042437.1| cupin domain protein, PF06172 family [Escherichia coli 4.0967]
 gi|417275044|ref|ZP_12062384.1| cupin domain protein, PF06172 family [Escherichia coli 2.4168]
 gi|417278474|ref|ZP_12065789.1| cupin domain protein, PF06172 family [Escherichia coli 3.2303]
 gi|417290206|ref|ZP_12077489.1| cupin domain protein, PF06172 family [Escherichia coli B41]
 gi|417297791|ref|ZP_12085035.1| cupin domain protein, PF06172 family [Escherichia coli 900105
           (10e)]
 gi|417307591|ref|ZP_12094458.1| hypothetical protein PPECC33_10300 [Escherichia coli PCN033]
 gi|417580413|ref|ZP_12231229.1| cupin superfamily protein [Escherichia coli STEC_B2F1]
 gi|417591055|ref|ZP_12241764.1| cupin superfamily protein [Escherichia coli 2534-86]
 gi|417601549|ref|ZP_12252127.1| cupin superfamily protein [Escherichia coli STEC_94C]
 gi|417612392|ref|ZP_12262860.1| cupin superfamily protein [Escherichia coli STEC_EH250]
 gi|417617601|ref|ZP_12268029.1| cupin superfamily protein [Escherichia coli G58-1]
 gi|417622660|ref|ZP_12272977.1| cupin superfamily protein [Escherichia coli STEC_H.1.8]
 gi|417633935|ref|ZP_12284151.1| cupin superfamily protein [Escherichia coli STEC_S1191]
 gi|417666405|ref|ZP_12315960.1| cupin superfamily protein [Escherichia coli STEC_O31]
 gi|417944685|ref|ZP_12587925.1| putative cytoplasmic protein [Escherichia coli XH140A]
 gi|417977120|ref|ZP_12617907.1| putative cytoplasmic protein [Escherichia coli XH001]
 gi|419141773|ref|ZP_13686521.1| cupin superfamily protein [Escherichia coli DEC6A]
 gi|419147512|ref|ZP_13692195.1| cupin superfamily protein [Escherichia coli DEC6B]
 gi|419153114|ref|ZP_13697695.1| cupin superfamily protein [Escherichia coli DEC6C]
 gi|419158521|ref|ZP_13703035.1| cupin superfamily protein [Escherichia coli DEC6D]
 gi|419163619|ref|ZP_13708083.1| cupin superfamily protein [Escherichia coli DEC6E]
 gi|419196334|ref|ZP_13739734.1| cupin superfamily protein [Escherichia coli DEC8A]
 gi|419202626|ref|ZP_13745834.1| cupin superfamily protein [Escherichia coli DEC8B]
 gi|419208700|ref|ZP_13751808.1| cupin superfamily protein [Escherichia coli DEC8C]
 gi|419214964|ref|ZP_13757982.1| cupin superfamily protein [Escherichia coli DEC8D]
 gi|419220647|ref|ZP_13763594.1| cupin superfamily protein [Escherichia coli DEC8E]
 gi|419226114|ref|ZP_13768985.1| cupin superfamily protein [Escherichia coli DEC9A]
 gi|419231872|ref|ZP_13774658.1| cupin superfamily protein [Escherichia coli DEC9B]
 gi|419242671|ref|ZP_13785317.1| cupin superfamily protein [Escherichia coli DEC9D]
 gi|419248396|ref|ZP_13790993.1| cupin superfamily protein [Escherichia coli DEC9E]
 gi|419254264|ref|ZP_13796792.1| cupin superfamily protein [Escherichia coli DEC10A]
 gi|419260382|ref|ZP_13802815.1| cupin superfamily protein [Escherichia coli DEC10B]
 gi|419266473|ref|ZP_13808841.1| cupin superfamily protein [Escherichia coli DEC10C]
 gi|419271920|ref|ZP_13814230.1| cupin superfamily protein [Escherichia coli DEC10D]
 gi|419277414|ref|ZP_13819675.1| cupin superfamily protein [Escherichia coli DEC10E]
 gi|419283512|ref|ZP_13825708.1| cupin superfamily protein [Escherichia coli DEC10F]
 gi|419288916|ref|ZP_13831015.1| cupin superfamily protein [Escherichia coli DEC11A]
 gi|419294200|ref|ZP_13836250.1| cupin superfamily protein [Escherichia coli DEC11B]
 gi|419299571|ref|ZP_13841580.1| cupin superfamily protein [Escherichia coli DEC11C]
 gi|419305783|ref|ZP_13847691.1| cupin superfamily protein [Escherichia coli DEC11D]
 gi|419310806|ref|ZP_13852676.1| cupin superfamily protein [Escherichia coli DEC11E]
 gi|419316124|ref|ZP_13857944.1| cupin superfamily protein [Escherichia coli DEC12A]
 gi|419322135|ref|ZP_13863858.1| cupin superfamily protein [Escherichia coli DEC12B]
 gi|419328207|ref|ZP_13869833.1| cupin superfamily protein [Escherichia coli DEC12C]
 gi|419333857|ref|ZP_13875401.1| cupin superfamily protein [Escherichia coli DEC12D]
 gi|419339081|ref|ZP_13880564.1| cupin superfamily protein [Escherichia coli DEC12E]
 gi|419369456|ref|ZP_13910582.1| cupin superfamily protein [Escherichia coli DEC14A]
 gi|419374997|ref|ZP_13916035.1| cupin superfamily protein [Escherichia coli DEC14B]
 gi|419380204|ref|ZP_13921170.1| cupin superfamily protein [Escherichia coli DEC14C]
 gi|419385551|ref|ZP_13926437.1| cupin superfamily protein [Escherichia coli DEC14D]
 gi|419390713|ref|ZP_13931540.1| cupin superfamily protein [Escherichia coli DEC15A]
 gi|419395933|ref|ZP_13936712.1| cupin superfamily protein [Escherichia coli DEC15B]
 gi|419401311|ref|ZP_13942038.1| cupin superfamily protein [Escherichia coli DEC15C]
 gi|419406432|ref|ZP_13947126.1| cupin superfamily protein [Escherichia coli DEC15D]
 gi|419412000|ref|ZP_13952663.1| cupin superfamily protein [Escherichia coli DEC15E]
 gi|419810913|ref|ZP_14335791.1| cytoplasmic protein [Escherichia coli O32:H37 str. P4]
 gi|419867399|ref|ZP_14389726.1| cytoplasmic protein [Escherichia coli O103:H25 str. CVM9340]
 gi|419869802|ref|ZP_14391983.1| cytoplasmic protein [Escherichia coli O103:H2 str. CVM9450]
 gi|419873586|ref|ZP_14395568.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9534]
 gi|419880031|ref|ZP_14401444.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9545]
 gi|419892533|ref|ZP_14412551.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9570]
 gi|419894477|ref|ZP_14414384.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9574]
 gi|419928396|ref|ZP_14446109.1| putative cytoplasmic protein [Escherichia coli 541-1]
 gi|419940688|ref|ZP_14457413.1| putative cytoplasmic protein [Escherichia coli 75]
 gi|419951807|ref|ZP_14467989.1| putative cytoplasmic protein [Escherichia coli CUMT8]
 gi|420091191|ref|ZP_14602946.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9602]
 gi|420095320|ref|ZP_14606833.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9634]
 gi|420106338|ref|ZP_14616750.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9553]
 gi|420117229|ref|ZP_14626595.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10021]
 gi|420120670|ref|ZP_14629857.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10030]
 gi|420129810|ref|ZP_14638330.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10224]
 gi|420136162|ref|ZP_14644225.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9952]
 gi|420390738|ref|ZP_14890002.1| cupin superfamily protein [Escherichia coli EPEC C342-62]
 gi|422762806|ref|ZP_16816562.1| cupin superfamily protein [Escherichia coli E1167]
 gi|422765694|ref|ZP_16819421.1| cupin superfamily protein [Escherichia coli E1520]
 gi|422770360|ref|ZP_16824051.1| cupin superfamily protein [Escherichia coli E482]
 gi|422817530|ref|ZP_16865744.1| hypothetical protein ESMG_02056 [Escherichia coli M919]
 gi|422958689|ref|ZP_16970620.1| hypothetical protein ESQG_02115 [Escherichia coli H494]
 gi|423701973|ref|ZP_17676432.1| hypothetical protein ESSG_01504 [Escherichia coli H730]
 gi|424748811|ref|ZP_18176938.1| cytoplasmic protein [Escherichia coli O26:H11 str. CFSAN001629]
 gi|424760441|ref|ZP_18188068.1| cytoplasmic protein [Escherichia coli O111:H11 str. CFSAN001630]
 gi|424771857|ref|ZP_18198981.1| cytoplasmic protein [Escherichia coli O111:H8 str. CFSAN001632]
 gi|425114495|ref|ZP_18516313.1| hypothetical protein EC80566_1158 [Escherichia coli 8.0566]
 gi|425119205|ref|ZP_18520921.1| cupin superfamily protein [Escherichia coli 8.0569]
 gi|425272131|ref|ZP_18663601.1| hypothetical protein ECTW15901_1387 [Escherichia coli TW15901]
 gi|425282637|ref|ZP_18673725.1| hypothetical protein ECTW00353_1274 [Escherichia coli TW00353]
 gi|425378537|ref|ZP_18762771.1| hypothetical protein ECEC1865_1692 [Escherichia coli EC1865]
 gi|432391150|ref|ZP_19634008.1| YcfD protein [Escherichia coli KTE21]
 gi|432416177|ref|ZP_19658799.1| YcfD protein [Escherichia coli KTE44]
 gi|432480552|ref|ZP_19722513.1| YcfD protein [Escherichia coli KTE210]
 gi|432563189|ref|ZP_19799807.1| YcfD protein [Escherichia coli KTE51]
 gi|432579828|ref|ZP_19816258.1| YcfD protein [Escherichia coli KTE56]
 gi|432626725|ref|ZP_19862706.1| YcfD protein [Escherichia coli KTE77]
 gi|432636393|ref|ZP_19872275.1| YcfD protein [Escherichia coli KTE81]
 gi|432660348|ref|ZP_19895998.1| YcfD protein [Escherichia coli KTE111]
 gi|432684958|ref|ZP_19920266.1| YcfD protein [Escherichia coli KTE156]
 gi|432691048|ref|ZP_19926286.1| YcfD protein [Escherichia coli KTE161]
 gi|432703742|ref|ZP_19938859.1| YcfD protein [Escherichia coli KTE171]
 gi|432736623|ref|ZP_19971393.1| YcfD protein [Escherichia coli KTE42]
 gi|432749603|ref|ZP_19984215.1| YcfD protein [Escherichia coli KTE29]
 gi|432769953|ref|ZP_20004305.1| YcfD protein [Escherichia coli KTE50]
 gi|432874285|ref|ZP_20093422.1| YcfD protein [Escherichia coli KTE147]
 gi|432954419|ref|ZP_20146538.1| YcfD protein [Escherichia coli KTE197]
 gi|432960681|ref|ZP_20150801.1| YcfD protein [Escherichia coli KTE202]
 gi|432967269|ref|ZP_20156185.1| YcfD protein [Escherichia coli KTE203]
 gi|433047270|ref|ZP_20234674.1| YcfD protein [Escherichia coli KTE120]
 gi|433062354|ref|ZP_20249306.1| YcfD protein [Escherichia coli KTE125]
 gi|433091452|ref|ZP_20277744.1| YcfD protein [Escherichia coli KTE138]
 gi|442590699|ref|ZP_21009458.1| FIG002776: hypothetical protein [Escherichia coli O10:K5(L):H4 str.
           ATCC 23506]
 gi|450213320|ref|ZP_21894854.1| cytoplasmic protein [Escherichia coli O08]
 gi|450241873|ref|ZP_21899707.1| cytoplasmic protein [Escherichia coli S17]
 gi|2506643|sp|P27431.2|YCFD_ECOLI RecName: Full=50S ribosomal protein L16 arginine hydroxylase;
           AltName: Full=Ribosomal oxygenase YcfD; Short=ROX
 gi|4062695|dbj|BAA35950.1| conserved hypothetical protein [Escherichia coli str. K12 substr.
           W3110]
 gi|87081836|gb|AAC74212.2| cupin superfamily protein [Escherichia coli str. K-12 substr.
           MG1655]
 gi|169888614|gb|ACB02321.1| conserved protein [Escherichia coli str. K-12 substr. DH10B]
 gi|190901638|gb|EDV61395.1| cupin family protein [Escherichia coli B7A]
 gi|192955219|gb|EDV85709.1| cupin family protein [Escherichia coli E110019]
 gi|194412418|gb|EDX28719.1| cupin family protein [Escherichia coli B171]
 gi|209911644|dbj|BAG76718.1| conserved hypothetical protein [Escherichia coli SE11]
 gi|218360474|emb|CAQ98028.1| conserved hypothetical protein [Escherichia coli IAI1]
 gi|238860298|gb|ACR62296.1| conserved protein [Escherichia coli BW2952]
 gi|257753442|dbj|BAI24944.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
 gi|257763913|dbj|BAI35408.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
 gi|260449733|gb|ACX40155.1| Cupin 4 family protein [Escherichia coli DH1]
 gi|291324235|gb|EFE63657.1| ycfD protein [Escherichia coli B088]
 gi|299877558|gb|EFI85769.1| cupin family protein [Escherichia coli MS 196-1]
 gi|309701452|emb|CBJ00756.1| conserved hypothetical protein [Escherichia coli ETEC H10407]
 gi|315135760|dbj|BAJ42919.1| putative cytoplasmic protein [Escherichia coli DH1]
 gi|315616014|gb|EFU96640.1| cupin family protein [Escherichia coli 3431]
 gi|323153859|gb|EFZ40094.1| cupin family protein [Escherichia coli EPECa14]
 gi|323163672|gb|EFZ49494.1| cupin superfamily protein [Escherichia coli E128010]
 gi|323179336|gb|EFZ64906.1| cupin superfamily protein [Escherichia coli OK1180]
 gi|323185688|gb|EFZ71049.1| cupin superfamily protein [Escherichia coli OK1357]
 gi|323937868|gb|EGB34132.1| cupin superfamily protein [Escherichia coli E1520]
 gi|323942598|gb|EGB38765.1| cupin superfamily protein [Escherichia coli E482]
 gi|324117301|gb|EGC11208.1| cupin superfamily protein [Escherichia coli E1167]
 gi|338770967|gb|EGP25720.1| hypothetical protein PPECC33_10300 [Escherichia coli PCN033]
 gi|342363508|gb|EGU27615.1| putative cytoplasmic protein [Escherichia coli XH140A]
 gi|344193160|gb|EGV47243.1| putative cytoplasmic protein [Escherichia coli XH001]
 gi|345342072|gb|EGW74470.1| cupin superfamily protein [Escherichia coli STEC_B2F1]
 gi|345343290|gb|EGW75678.1| cupin superfamily protein [Escherichia coli 2534-86]
 gi|345352152|gb|EGW84402.1| cupin superfamily protein [Escherichia coli STEC_94C]
 gi|345364344|gb|EGW96469.1| cupin superfamily protein [Escherichia coli STEC_EH250]
 gi|345379738|gb|EGX11646.1| cupin superfamily protein [Escherichia coli G58-1]
 gi|345383846|gb|EGX13717.1| cupin superfamily protein [Escherichia coli STEC_H.1.8]
 gi|345389242|gb|EGX19048.1| cupin superfamily protein [Escherichia coli STEC_S1191]
 gi|359331781|dbj|BAL38228.1| conserved protein [Escherichia coli str. K-12 substr. MDS42]
 gi|371596518|gb|EHN85356.1| hypothetical protein ESQG_02115 [Escherichia coli H494]
 gi|377997429|gb|EHV60533.1| cupin superfamily protein [Escherichia coli DEC6A]
 gi|377998531|gb|EHV61622.1| cupin superfamily protein [Escherichia coli DEC6B]
 gi|378001801|gb|EHV64858.1| cupin superfamily protein [Escherichia coli DEC6C]
 gi|378011450|gb|EHV74393.1| cupin superfamily protein [Escherichia coli DEC6D]
 gi|378013192|gb|EHV76112.1| cupin superfamily protein [Escherichia coli DEC6E]
 gi|378050307|gb|EHW12636.1| cupin superfamily protein [Escherichia coli DEC8A]
 gi|378053698|gb|EHW15993.1| cupin superfamily protein [Escherichia coli DEC8B]
 gi|378058207|gb|EHW20426.1| cupin superfamily protein [Escherichia coli DEC8C]
 gi|378065705|gb|EHW27847.1| cupin superfamily protein [Escherichia coli DEC8D]
 gi|378070233|gb|EHW32314.1| cupin superfamily protein [Escherichia coli DEC8E]
 gi|378077646|gb|EHW39639.1| cupin superfamily protein [Escherichia coli DEC9A]
 gi|378080633|gb|EHW42594.1| cupin superfamily protein [Escherichia coli DEC9B]
 gi|378093317|gb|EHW55130.1| cupin superfamily protein [Escherichia coli DEC9D]
 gi|378098138|gb|EHW59880.1| cupin superfamily protein [Escherichia coli DEC9E]
 gi|378102921|gb|EHW64592.1| cupin superfamily protein [Escherichia coli DEC10A]
 gi|378110066|gb|EHW71662.1| cupin superfamily protein [Escherichia coli DEC10B]
 gi|378114650|gb|EHW76202.1| cupin superfamily protein [Escherichia coli DEC10C]
 gi|378119787|gb|EHW81275.1| cupin superfamily protein [Escherichia coli DEC10D]
 gi|378132583|gb|EHW93935.1| cupin superfamily protein [Escherichia coli DEC10E]
 gi|378133349|gb|EHW94694.1| cupin superfamily protein [Escherichia coli DEC11A]
 gi|378136145|gb|EHW97444.1| cupin superfamily protein [Escherichia coli DEC10F]
 gi|378143753|gb|EHX04938.1| cupin superfamily protein [Escherichia coli DEC11B]
 gi|378151028|gb|EHX12141.1| cupin superfamily protein [Escherichia coli DEC11D]
 gi|378153954|gb|EHX15031.1| cupin superfamily protein [Escherichia coli DEC11C]
 gi|378159404|gb|EHX20408.1| cupin superfamily protein [Escherichia coli DEC11E]
 gi|378171058|gb|EHX31930.1| cupin superfamily protein [Escherichia coli DEC12B]
 gi|378172817|gb|EHX33664.1| cupin superfamily protein [Escherichia coli DEC12A]
 gi|378174309|gb|EHX35135.1| cupin superfamily protein [Escherichia coli DEC12C]
 gi|378186070|gb|EHX46693.1| cupin superfamily protein [Escherichia coli DEC12D]
 gi|378192615|gb|EHX53170.1| cupin superfamily protein [Escherichia coli DEC12E]
 gi|378221131|gb|EHX81382.1| cupin superfamily protein [Escherichia coli DEC14A]
 gi|378223099|gb|EHX83329.1| cupin superfamily protein [Escherichia coli DEC14B]
 gi|378230753|gb|EHX90867.1| cupin superfamily protein [Escherichia coli DEC14C]
 gi|378233830|gb|EHX93913.1| cupin superfamily protein [Escherichia coli DEC14D]
 gi|378240943|gb|EHY00912.1| cupin superfamily protein [Escherichia coli DEC15A]
 gi|378248271|gb|EHY08185.1| cupin superfamily protein [Escherichia coli DEC15B]
 gi|378248965|gb|EHY08875.1| cupin superfamily protein [Escherichia coli DEC15C]
 gi|378256204|gb|EHY16056.1| cupin superfamily protein [Escherichia coli DEC15D]
 gi|378260188|gb|EHY19993.1| cupin superfamily protein [Escherichia coli DEC15E]
 gi|383103318|gb|AFG40827.1| YcfD protein [Escherichia coli P12b]
 gi|385156207|gb|EIF18205.1| cytoplasmic protein [Escherichia coli O32:H37 str. P4]
 gi|385539037|gb|EIF85879.1| hypothetical protein ESMG_02056 [Escherichia coli M919]
 gi|385711376|gb|EIG48335.1| hypothetical protein ESSG_01504 [Escherichia coli H730]
 gi|386122615|gb|EIG71224.1| hypothetical protein ESBG_01540 [Escherichia sp. 4_1_40B]
 gi|386152231|gb|EIH03520.1| cupin domain protein, PF06172 family [Escherichia coli 5.0588]
 gi|386159259|gb|EIH15592.1| cupin domain protein, PF06172 family [Escherichia coli 97.0259]
 gi|386168550|gb|EIH35066.1| cupin domain protein, PF06172 family [Escherichia coli 96.0497]
 gi|386180475|gb|EIH57949.1| cupin domain protein, PF06172 family [Escherichia coli 3.2608]
 gi|386185885|gb|EIH68611.1| cupin domain protein, PF06172 family [Escherichia coli 93.0624]
 gi|386187574|gb|EIH76392.1| cupin domain protein, PF06172 family [Escherichia coli 4.0522]
 gi|386197930|gb|EIH92124.1| cupin domain protein, PF06172 family [Escherichia coli JB1-95]
 gi|386200114|gb|EIH99105.1| cupin domain protein, PF06172 family [Escherichia coli 96.154]
 gi|386219221|gb|EII35694.1| cupin domain protein, PF06172 family [Escherichia coli 4.0967]
 gi|386233472|gb|EII65457.1| cupin domain protein, PF06172 family [Escherichia coli 2.4168]
 gi|386238727|gb|EII75662.1| cupin domain protein, PF06172 family [Escherichia coli 3.2303]
 gi|386256244|gb|EIJ05932.1| cupin domain protein, PF06172 family [Escherichia coli B41]
 gi|386259003|gb|EIJ14480.1| cupin domain protein, PF06172 family [Escherichia coli 900105
           (10e)]
 gi|388332654|gb|EIK99319.1| cytoplasmic protein [Escherichia coli O103:H25 str. CVM9340]
 gi|388341408|gb|EIL07519.1| cytoplasmic protein [Escherichia coli O103:H2 str. CVM9450]
 gi|388347533|gb|EIL13199.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9570]
 gi|388352382|gb|EIL17502.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9534]
 gi|388363916|gb|EIL27814.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9574]
 gi|388370058|gb|EIL33611.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9545]
 gi|388402359|gb|EIL62932.1| putative cytoplasmic protein [Escherichia coli 75]
 gi|388405677|gb|EIL66100.1| putative cytoplasmic protein [Escherichia coli 541-1]
 gi|388413648|gb|EIL73638.1| putative cytoplasmic protein [Escherichia coli CUMT8]
 gi|391313927|gb|EIQ71494.1| cupin superfamily protein [Escherichia coli EPEC C342-62]
 gi|394381661|gb|EJE59337.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10224]
 gi|394384048|gb|EJE61621.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9602]
 gi|394393103|gb|EJE69797.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9634]
 gi|394402398|gb|EJE78121.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10021]
 gi|394416322|gb|EJE90121.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9553]
 gi|394418855|gb|EJE92505.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9952]
 gi|394428576|gb|EJF01107.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10030]
 gi|397785869|gb|EJK96712.1| cupin superfamily protein [Escherichia coli STEC_O31]
 gi|404292010|gb|EJZ48855.1| hypothetical protein ESCG_03189 [Escherichia sp. 1_1_43]
 gi|408195640|gb|EKI21010.1| hypothetical protein ECTW15901_1387 [Escherichia coli TW15901]
 gi|408204607|gb|EKI29551.1| hypothetical protein ECTW00353_1274 [Escherichia coli TW00353]
 gi|408301968|gb|EKJ19518.1| hypothetical protein ECEC1865_1692 [Escherichia coli EC1865]
 gi|408571483|gb|EKK47422.1| hypothetical protein EC80566_1158 [Escherichia coli 8.0566]
 gi|408572441|gb|EKK48350.1| cupin superfamily protein [Escherichia coli 8.0569]
 gi|421939747|gb|EKT97261.1| cytoplasmic protein [Escherichia coli O111:H8 str. CFSAN001632]
 gi|421943433|gb|EKU00718.1| cytoplasmic protein [Escherichia coli O26:H11 str. CFSAN001629]
 gi|421946016|gb|EKU03180.1| cytoplasmic protein [Escherichia coli O111:H11 str. CFSAN001630]
 gi|430921768|gb|ELC42592.1| YcfD protein [Escherichia coli KTE21]
 gi|430941958|gb|ELC62098.1| YcfD protein [Escherichia coli KTE44]
 gi|431009499|gb|ELD24119.1| YcfD protein [Escherichia coli KTE210]
 gi|431096703|gb|ELE02164.1| YcfD protein [Escherichia coli KTE51]
 gi|431107230|gb|ELE11416.1| YcfD protein [Escherichia coli KTE56]
 gi|431164673|gb|ELE65064.1| YcfD protein [Escherichia coli KTE77]
 gi|431173287|gb|ELE73368.1| YcfD protein [Escherichia coli KTE81]
 gi|431202220|gb|ELF00916.1| YcfD protein [Escherichia coli KTE111]
 gi|431223525|gb|ELF20772.1| YcfD protein [Escherichia coli KTE156]
 gi|431228842|gb|ELF25505.1| YcfD protein [Escherichia coli KTE161]
 gi|431245569|gb|ELF39854.1| YcfD protein [Escherichia coli KTE171]
 gi|431285008|gb|ELF75849.1| YcfD protein [Escherichia coli KTE42]
 gi|431298893|gb|ELF88517.1| YcfD protein [Escherichia coli KTE29]
 gi|431317410|gb|ELG05190.1| YcfD protein [Escherichia coli KTE50]
 gi|431404271|gb|ELG87529.1| YcfD protein [Escherichia coli KTE147]
 gi|431469717|gb|ELH49646.1| YcfD protein [Escherichia coli KTE197]
 gi|431473241|gb|ELH53075.1| YcfD protein [Escherichia coli KTE203]
 gi|431477888|gb|ELH57650.1| YcfD protein [Escherichia coli KTE202]
 gi|431569915|gb|ELI42845.1| YcfD protein [Escherichia coli KTE120]
 gi|431585416|gb|ELI57365.1| YcfD protein [Escherichia coli KTE125]
 gi|431612993|gb|ELI82198.1| YcfD protein [Escherichia coli KTE138]
 gi|441608967|emb|CCP95371.1| FIG002776: hypothetical protein [Escherichia coli O10:K5(L):H4 str.
           ATCC 23506]
 gi|449321002|gb|EMD11021.1| cytoplasmic protein [Escherichia coli O08]
 gi|449323348|gb|EMD13309.1| cytoplasmic protein [Escherichia coli S17]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417628130|ref|ZP_12278377.1| cupin superfamily protein [Escherichia coli STEC_MHI813]
 gi|345378434|gb|EGX10365.1| cupin superfamily protein [Escherichia coli STEC_MHI813]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|218704602|ref|YP_002412121.1| hypothetical protein ECUMN_1372 [Escherichia coli UMN026]
 gi|293404483|ref|ZP_06648477.1| conserved hypothetical protein [Escherichia coli FVEC1412]
 gi|298380259|ref|ZP_06989864.1| ycfD protein [Escherichia coli FVEC1302]
 gi|419933745|ref|ZP_14450911.1| hypothetical protein EC5761_08660 [Escherichia coli 576-1]
 gi|422335730|ref|ZP_16416723.1| hypothetical protein HMPREF0986_05217 [Escherichia coli 4_1_47FAA]
 gi|432353143|ref|ZP_19596420.1| YcfD protein [Escherichia coli KTE2]
 gi|432401368|ref|ZP_19644122.1| YcfD protein [Escherichia coli KTE26]
 gi|432425424|ref|ZP_19667938.1| YcfD protein [Escherichia coli KTE181]
 gi|432460196|ref|ZP_19702352.1| YcfD protein [Escherichia coli KTE204]
 gi|432475288|ref|ZP_19717293.1| YcfD protein [Escherichia coli KTE208]
 gi|432521801|ref|ZP_19758956.1| YcfD protein [Escherichia coli KTE228]
 gi|432537234|ref|ZP_19774151.1| YcfD protein [Escherichia coli KTE235]
 gi|432630740|ref|ZP_19866683.1| YcfD protein [Escherichia coli KTE80]
 gi|432640391|ref|ZP_19876229.1| YcfD protein [Escherichia coli KTE83]
 gi|432665466|ref|ZP_19901050.1| YcfD protein [Escherichia coli KTE116]
 gi|432774304|ref|ZP_20008588.1| YcfD protein [Escherichia coli KTE54]
 gi|432885596|ref|ZP_20100117.1| YcfD protein [Escherichia coli KTE158]
 gi|432911672|ref|ZP_20117947.1| YcfD protein [Escherichia coli KTE190]
 gi|433018084|ref|ZP_20206341.1| YcfD protein [Escherichia coli KTE105]
 gi|433052499|ref|ZP_20239717.1| YcfD protein [Escherichia coli KTE122]
 gi|433067444|ref|ZP_20254262.1| YcfD protein [Escherichia coli KTE128]
 gi|433158095|ref|ZP_20342956.1| YcfD protein [Escherichia coli KTE177]
 gi|433177659|ref|ZP_20362102.1| YcfD protein [Escherichia coli KTE82]
 gi|218431699|emb|CAR12581.1| conserved hypothetical protein [Escherichia coli UMN026]
 gi|291429069|gb|EFF02094.1| conserved hypothetical protein [Escherichia coli FVEC1412]
 gi|298279957|gb|EFI21465.1| ycfD protein [Escherichia coli FVEC1302]
 gi|373243240|gb|EHP62754.1| hypothetical protein HMPREF0986_05217 [Escherichia coli 4_1_47FAA]
 gi|388410658|gb|EIL70870.1| hypothetical protein EC5761_08660 [Escherichia coli 576-1]
 gi|430876855|gb|ELC00325.1| YcfD protein [Escherichia coli KTE2]
 gi|430927966|gb|ELC48529.1| YcfD protein [Escherichia coli KTE26]
 gi|430958173|gb|ELC76770.1| YcfD protein [Escherichia coli KTE181]
 gi|430990854|gb|ELD07275.1| YcfD protein [Escherichia coli KTE204]
 gi|431007288|gb|ELD22100.1| YcfD protein [Escherichia coli KTE208]
 gi|431043944|gb|ELD54225.1| YcfD protein [Escherichia coli KTE228]
 gi|431072811|gb|ELD80562.1| YcfD protein [Escherichia coli KTE235]
 gi|431172869|gb|ELE72951.1| YcfD protein [Escherichia coli KTE80]
 gi|431183747|gb|ELE83521.1| YcfD protein [Escherichia coli KTE83]
 gi|431203101|gb|ELF01778.1| YcfD protein [Escherichia coli KTE116]
 gi|431319649|gb|ELG07319.1| YcfD protein [Escherichia coli KTE54]
 gi|431418642|gb|ELH01037.1| YcfD protein [Escherichia coli KTE158]
 gi|431443379|gb|ELH24456.1| YcfD protein [Escherichia coli KTE190]
 gi|431535398|gb|ELI11778.1| YcfD protein [Escherichia coli KTE105]
 gi|431574102|gb|ELI46886.1| YcfD protein [Escherichia coli KTE122]
 gi|431588508|gb|ELI59783.1| YcfD protein [Escherichia coli KTE128]
 gi|431680658|gb|ELJ46481.1| YcfD protein [Escherichia coli KTE177]
 gi|431708250|gb|ELJ72765.1| YcfD protein [Escherichia coli KTE82]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417596243|ref|ZP_12246896.1| cupin superfamily protein [Escherichia coli 3030-1]
 gi|345356953|gb|EGW89152.1| cupin superfamily protein [Escherichia coli 3030-1]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLDETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|420099893|ref|ZP_14611098.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9455]
 gi|394422149|gb|EJE95541.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9455]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|419909946|ref|ZP_14428479.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10026]
 gi|388372311|gb|EIL35745.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10026]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|419903240|ref|ZP_14422333.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9942]
 gi|388371848|gb|EIL35302.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9942]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|156362161|ref|XP_001625649.1| predicted protein [Nematostella vectensis]
 gi|156212492|gb|EDO33549.1| predicted protein [Nematostella vectensis]
          Length = 368

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 14/98 (14%)

Query: 207 AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQAD----------R 256
           AW   +  A+  A++ A  +D++FRRGLPI ++   GL     +DIQ D           
Sbjct: 1   AWGHFISTALTDAIEKAMESDVKFRRGLPINFVSQLGLG----MDIQGDDEKAIASKKEN 56

Query: 257 LAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
            + K+ LK+LL  LV++VD ++  D+M    M   LPP
Sbjct: 57  ESFKKQLKELLIALVDHVDANETSDKMASDFMASRLPP 94


>gi|417264874|ref|ZP_12052256.1| cupin domain protein, PF06172 family [Escherichia coli 2.3916]
 gi|418302295|ref|ZP_12914089.1| cupin superfamily protein [Escherichia coli UMNF18]
 gi|339414393|gb|AEJ56065.1| cupin superfamily protein [Escherichia coli UMNF18]
 gi|386221434|gb|EII43875.1| cupin domain protein, PF06172 family [Escherichia coli 2.3916]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417690009|ref|ZP_12339235.1| cupin superfamily protein [Shigella boydii 5216-82]
 gi|332089270|gb|EGI94376.1| cupin superfamily protein [Shigella boydii 5216-82]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417711927|ref|ZP_12360922.1| cupin superfamily protein [Shigella flexneri K-272]
 gi|417716570|ref|ZP_12365498.1| cupin superfamily protein [Shigella flexneri K-227]
 gi|333008335|gb|EGK27809.1| cupin superfamily protein [Shigella flexneri K-272]
 gi|333019823|gb|EGK39095.1| cupin superfamily protein [Shigella flexneri K-227]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|293414419|ref|ZP_06657068.1| ycfD protein [Escherichia coli B185]
 gi|291434477|gb|EFF07450.1| ycfD protein [Escherichia coli B185]
          Length = 373

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|260843427|ref|YP_003221205.1| cytoplasmic protein [Escherichia coli O103:H2 str. 12009]
 gi|300922658|ref|ZP_07138755.1| cupin family protein [Escherichia coli MS 182-1]
 gi|300949476|ref|ZP_07163474.1| cupin family protein [Escherichia coli MS 116-1]
 gi|300955534|ref|ZP_07167896.1| cupin family protein [Escherichia coli MS 175-1]
 gi|301328497|ref|ZP_07221563.1| cupin family protein [Escherichia coli MS 78-1]
 gi|301644546|ref|ZP_07244539.1| cupin family protein [Escherichia coli MS 146-1]
 gi|309796526|ref|ZP_07690933.1| cupin family protein [Escherichia coli MS 145-7]
 gi|331641672|ref|ZP_08342807.1| YcfD protein [Escherichia coli H736]
 gi|418958539|ref|ZP_13510451.1| cupin family protein [Escherichia coli J53]
 gi|422354516|ref|ZP_16435251.1| cupin family protein [Escherichia coli MS 117-3]
 gi|195182815|dbj|BAG66391.1| predicted protein [Escherichia coli O111:H-]
 gi|257758574|dbj|BAI30071.1| putative cytoplasmic protein [Escherichia coli O103:H2 str. 12009]
 gi|300317573|gb|EFJ67357.1| cupin family protein [Escherichia coli MS 175-1]
 gi|300421007|gb|EFK04318.1| cupin family protein [Escherichia coli MS 182-1]
 gi|300451112|gb|EFK14732.1| cupin family protein [Escherichia coli MS 116-1]
 gi|300845104|gb|EFK72864.1| cupin family protein [Escherichia coli MS 78-1]
 gi|301077128|gb|EFK91934.1| cupin family protein [Escherichia coli MS 146-1]
 gi|308119838|gb|EFO57100.1| cupin family protein [Escherichia coli MS 145-7]
 gi|324017500|gb|EGB86719.1| cupin family protein [Escherichia coli MS 117-3]
 gi|331038470|gb|EGI10690.1| YcfD protein [Escherichia coli H736]
 gi|384378753|gb|EIE36632.1| cupin family protein [Escherichia coli J53]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|38703940|ref|NP_309600.2| hypothetical protein ECs1573 [Escherichia coli O157:H7 str. Sakai]
 gi|168749710|ref|ZP_02774732.1| cupin family protein [Escherichia coli O157:H7 str. EC4113]
 gi|168756467|ref|ZP_02781474.1| cupin family protein [Escherichia coli O157:H7 str. EC4401]
 gi|168762519|ref|ZP_02787526.1| cupin family protein [Escherichia coli O157:H7 str. EC4501]
 gi|168770975|ref|ZP_02795982.1| cupin family protein [Escherichia coli O157:H7 str. EC4486]
 gi|168776081|ref|ZP_02801088.1| cupin family protein [Escherichia coli O157:H7 str. EC4196]
 gi|168782445|ref|ZP_02807452.1| cupin family protein [Escherichia coli O157:H7 str. EC4076]
 gi|168787580|ref|ZP_02812587.1| cupin family protein [Escherichia coli O157:H7 str. EC869]
 gi|168800481|ref|ZP_02825488.1| cupin family protein [Escherichia coli O157:H7 str. EC508]
 gi|208809504|ref|ZP_03251841.1| cupin family protein [Escherichia coli O157:H7 str. EC4206]
 gi|208815823|ref|ZP_03257002.1| cupin family protein [Escherichia coli O157:H7 str. EC4045]
 gi|208822630|ref|ZP_03262949.1| cupin family protein [Escherichia coli O157:H7 str. EC4042]
 gi|209395797|ref|YP_002270028.1| cupin family protein [Escherichia coli O157:H7 str. EC4115]
 gi|217328000|ref|ZP_03444082.1| cupin family protein [Escherichia coli O157:H7 str. TW14588]
 gi|254792568|ref|YP_003077405.1| hypothetical protein ECSP_1488 [Escherichia coli O157:H7 str.
           TW14359]
 gi|261227211|ref|ZP_05941492.1| hypothetical protein EscherichiacoliO157_21828 [Escherichia coli
           O157:H7 str. FRIK2000]
 gi|261258783|ref|ZP_05951316.1| hypothetical protein EscherichiacoliO157EcO_23686 [Escherichia coli
           O157:H7 str. FRIK966]
 gi|387506255|ref|YP_006158511.1| hypothetical protein ECO55CA74_06765 [Escherichia coli O55:H7 str.
           RM12579]
 gi|387882072|ref|YP_006312374.1| hypothetical protein CDCO157_1505 [Escherichia coli Xuzhou21]
 gi|416314246|ref|ZP_11658585.1| cupin family protein [Escherichia coli O157:H7 str. 1044]
 gi|416323935|ref|ZP_11665166.1| hypothetical protein ECoD_05544 [Escherichia coli O157:H7 str.
           EC1212]
 gi|416328815|ref|ZP_11668403.1| hypothetical protein ECF_03346 [Escherichia coli O157:H7 str. 1125]
 gi|416782286|ref|ZP_11877723.1| hypothetical protein ECO5101_10737 [Escherichia coli O157:H7 str.
           G5101]
 gi|416793484|ref|ZP_11882645.1| hypothetical protein ECO9389_04211 [Escherichia coli O157:H- str.
           493-89]
 gi|416804751|ref|ZP_11887506.1| hypothetical protein ECO2687_20326 [Escherichia coli O157:H- str. H
           2687]
 gi|416815872|ref|ZP_11892210.1| hypothetical protein ECO7815_16663 [Escherichia coli O55:H7 str.
           3256-97]
 gi|416825690|ref|ZP_11896799.1| hypothetical protein ECO5905_20143 [Escherichia coli O55:H7 str.
           USDA 5905]
 gi|419044329|ref|ZP_13591295.1| cupin superfamily protein [Escherichia coli DEC3A]
 gi|419050255|ref|ZP_13597156.1| cupin superfamily protein [Escherichia coli DEC3B]
 gi|419056418|ref|ZP_13603252.1| cupin superfamily protein [Escherichia coli DEC3C]
 gi|419061817|ref|ZP_13608577.1| cupin superfamily protein [Escherichia coli DEC3D]
 gi|419068488|ref|ZP_13614347.1| cupin superfamily protein [Escherichia coli DEC3E]
 gi|419074609|ref|ZP_13620162.1| cupin superfamily protein [Escherichia coli DEC3F]
 gi|419079827|ref|ZP_13625303.1| cupin superfamily protein [Escherichia coli DEC4A]
 gi|419085518|ref|ZP_13630911.1| cupin superfamily protein [Escherichia coli DEC4B]
 gi|419091511|ref|ZP_13636823.1| cupin superfamily protein [Escherichia coli DEC4C]
 gi|419097540|ref|ZP_13642771.1| cupin superfamily protein [Escherichia coli DEC4D]
 gi|419103281|ref|ZP_13648439.1| cupin superfamily protein [Escherichia coli DEC4E]
 gi|419108703|ref|ZP_13653796.1| cupin superfamily protein [Escherichia coli DEC4F]
 gi|419114088|ref|ZP_13659118.1| cupin superfamily protein [Escherichia coli DEC5A]
 gi|419119729|ref|ZP_13664707.1| cupin superfamily protein [Escherichia coli DEC5B]
 gi|419125552|ref|ZP_13670447.1| cupin superfamily protein [Escherichia coli DEC5C]
 gi|419130974|ref|ZP_13675821.1| cupin superfamily protein [Escherichia coli DEC5D]
 gi|419135815|ref|ZP_13680620.1| cupin superfamily protein [Escherichia coli DEC5E]
 gi|420268773|ref|ZP_14771167.1| hypothetical protein ECPA22_1679 [Escherichia coli PA22]
 gi|420274686|ref|ZP_14777003.1| hypothetical protein ECPA40_1935 [Escherichia coli PA40]
 gi|420279256|ref|ZP_14781521.1| hypothetical protein ECTW06591_0636 [Escherichia coli TW06591]
 gi|420285914|ref|ZP_14788123.1| hypothetical protein ECTW10246_1732 [Escherichia coli TW10246]
 gi|420297365|ref|ZP_14799445.1| hypothetical protein ECTW09109_1830 [Escherichia coli TW09109]
 gi|420303446|ref|ZP_14805462.1| hypothetical protein ECTW10119_2137 [Escherichia coli TW10119]
 gi|420308899|ref|ZP_14810859.1| hypothetical protein ECEC1738_1727 [Escherichia coli EC1738]
 gi|420314430|ref|ZP_14816328.1| cupin superfamily protein [Escherichia coli EC1734]
 gi|421811581|ref|ZP_16247359.1| hypothetical protein EC80416_1389 [Escherichia coli 8.0416]
 gi|421817695|ref|ZP_16253239.1| cupin superfamily protein [Escherichia coli 10.0821]
 gi|421823270|ref|ZP_16258689.1| hypothetical protein ECFRIK920_1700 [Escherichia coli FRIK920]
 gi|421830049|ref|ZP_16265367.1| hypothetical protein ECPA7_2204 [Escherichia coli PA7]
 gi|423684669|ref|ZP_17659503.1| hypothetical protein ECPA31_1534 [Escherichia coli PA31]
 gi|424076562|ref|ZP_17813768.1| hypothetical protein ECFDA505_1662 [Escherichia coli FDA505]
 gi|424082925|ref|ZP_17819639.1| hypothetical protein ECFDA517_1890 [Escherichia coli FDA517]
 gi|424089470|ref|ZP_17825605.1| hypothetical protein ECFRIK1996_1763 [Escherichia coli FRIK1996]
 gi|424095811|ref|ZP_17831405.1| hypothetical protein ECFRIK1985_1752 [Escherichia coli FRIK1985]
 gi|424102175|ref|ZP_17837201.1| hypothetical protein ECFRIK1990_1740 [Escherichia coli FRIK1990]
 gi|424108962|ref|ZP_17843407.1| hypothetical protein EC93001_1806 [Escherichia coli 93-001]
 gi|424114682|ref|ZP_17848752.1| hypothetical protein ECPA3_1596 [Escherichia coli PA3]
 gi|424121005|ref|ZP_17854567.1| hypothetical protein ECPA5_1625 [Escherichia coli PA5]
 gi|424127180|ref|ZP_17860273.1| hypothetical protein ECPA9_1774 [Escherichia coli PA9]
 gi|424133345|ref|ZP_17866017.1| hypothetical protein ECPA10_1771 [Escherichia coli PA10]
 gi|424139945|ref|ZP_17872073.1| hypothetical protein ECPA14_1718 [Escherichia coli PA14]
 gi|424146400|ref|ZP_17877997.1| hypothetical protein ECPA15_1865 [Escherichia coli PA15]
 gi|424152439|ref|ZP_17883562.1| hypothetical protein ECPA24_1622 [Escherichia coli PA24]
 gi|424209908|ref|ZP_17888998.1| hypothetical protein ECPA25_1468 [Escherichia coli PA25]
 gi|424292474|ref|ZP_17894896.1| hypothetical protein ECPA28_1803 [Escherichia coli PA28]
 gi|424438258|ref|ZP_17900665.1| hypothetical protein ECPA32_1677 [Escherichia coli PA32]
 gi|424454891|ref|ZP_17906277.1| hypothetical protein ECPA33_1665 [Escherichia coli PA33]
 gi|424461229|ref|ZP_17911973.1| hypothetical protein ECPA39_1698 [Escherichia coli PA39]
 gi|424467689|ref|ZP_17917734.1| hypothetical protein ECPA41_1732 [Escherichia coli PA41]
 gi|424474230|ref|ZP_17923766.1| hypothetical protein ECPA42_1841 [Escherichia coli PA42]
 gi|424480077|ref|ZP_17929224.1| hypothetical protein ECTW07945_1727 [Escherichia coli TW07945]
 gi|424486186|ref|ZP_17934931.1| hypothetical protein ECTW09098_1741 [Escherichia coli TW09098]
 gi|424492467|ref|ZP_17940607.1| hypothetical protein ECTW09195_1720 [Escherichia coli TW09195]
 gi|424499415|ref|ZP_17946564.1| hypothetical protein ECEC4203_1670 [Escherichia coli EC4203]
 gi|424505551|ref|ZP_17952209.1| hypothetical protein ECEC4196_1587 [Escherichia coli EC4196]
 gi|424511941|ref|ZP_17958000.1| hypothetical protein ECTW14313_1630 [Escherichia coli TW14313]
 gi|424519405|ref|ZP_17963722.1| hypothetical protein ECTW14301_1600 [Escherichia coli TW14301]
 gi|424525311|ref|ZP_17969216.1| hypothetical protein ECEC4421_1678 [Escherichia coli EC4421]
 gi|424531483|ref|ZP_17975006.1| hypothetical protein ECEC4422_1805 [Escherichia coli EC4422]
 gi|424537455|ref|ZP_17980595.1| hypothetical protein ECEC4013_1858 [Escherichia coli EC4013]
 gi|424543416|ref|ZP_17986084.1| hypothetical protein ECEC4402_1673 [Escherichia coli EC4402]
 gi|424549725|ref|ZP_17991792.1| hypothetical protein ECEC4439_1659 [Escherichia coli EC4439]
 gi|424555953|ref|ZP_17997551.1| hypothetical protein ECEC4436_1622 [Escherichia coli EC4436]
 gi|424562293|ref|ZP_18003470.1| hypothetical protein ECEC4437_1754 [Escherichia coli EC4437]
 gi|424568371|ref|ZP_18009151.1| hypothetical protein ECEC4448_1671 [Escherichia coli EC4448]
 gi|424574517|ref|ZP_18014815.1| hypothetical protein ECEC1845_1636 [Escherichia coli EC1845]
 gi|424580420|ref|ZP_18020254.1| hypothetical protein ECEC1863_1404 [Escherichia coli EC1863]
 gi|425097136|ref|ZP_18500054.1| cupin superfamily protein [Escherichia coli 3.4870]
 gi|425103325|ref|ZP_18505837.1| cupin superfamily protein [Escherichia coli 5.2239]
 gi|425109145|ref|ZP_18511281.1| hypothetical protein EC60172_1841 [Escherichia coli 6.0172]
 gi|425124902|ref|ZP_18526338.1| cupin superfamily protein [Escherichia coli 8.0586]
 gi|425130916|ref|ZP_18531917.1| cupin superfamily protein [Escherichia coli 8.2524]
 gi|425137309|ref|ZP_18537920.1| hypothetical protein EC100833_1902 [Escherichia coli 10.0833]
 gi|425143128|ref|ZP_18543314.1| cupin superfamily protein [Escherichia coli 10.0869]
 gi|425149310|ref|ZP_18549104.1| cupin superfamily protein [Escherichia coli 88.0221]
 gi|425155095|ref|ZP_18554537.1| hypothetical protein ECPA34_1783 [Escherichia coli PA34]
 gi|425161592|ref|ZP_18560644.1| hypothetical protein ECFDA506_2127 [Escherichia coli FDA506]
 gi|425167158|ref|ZP_18565840.1| hypothetical protein ECFDA507_1716 [Escherichia coli FDA507]
 gi|425173251|ref|ZP_18571575.1| hypothetical protein ECFDA504_1691 [Escherichia coli FDA504]
 gi|425179274|ref|ZP_18577202.1| hypothetical protein ECFRIK1999_1867 [Escherichia coli FRIK1999]
 gi|425185491|ref|ZP_18582981.1| hypothetical protein ECFRIK1997_1863 [Escherichia coli FRIK1997]
 gi|425192249|ref|ZP_18589257.1| hypothetical protein ECNE1487_2017 [Escherichia coli NE1487]
 gi|425198603|ref|ZP_18595112.1| hypothetical protein ECNE037_1933 [Escherichia coli NE037]
 gi|425205241|ref|ZP_18601253.1| hypothetical protein ECFRIK2001_2141 [Escherichia coli FRIK2001]
 gi|425210937|ref|ZP_18606552.1| hypothetical protein ECPA4_1823 [Escherichia coli PA4]
 gi|425217003|ref|ZP_18612185.1| hypothetical protein ECPA23_1646 [Escherichia coli PA23]
 gi|425223593|ref|ZP_18618299.1| hypothetical protein ECPA49_1833 [Escherichia coli PA49]
 gi|425229781|ref|ZP_18624055.1| hypothetical protein ECPA45_1810 [Escherichia coli PA45]
 gi|425235939|ref|ZP_18629831.1| hypothetical protein ECTT12B_1698 [Escherichia coli TT12B]
 gi|425242079|ref|ZP_18635592.1| hypothetical protein ECMA6_1932 [Escherichia coli MA6]
 gi|425248146|ref|ZP_18641233.1| hypothetical protein EC5905_1864 [Escherichia coli 5905]
 gi|425253942|ref|ZP_18646685.1| hypothetical protein ECCB7326_1676 [Escherichia coli CB7326]
 gi|425260158|ref|ZP_18652412.1| hypothetical protein ECEC96038_1549 [Escherichia coli EC96038]
 gi|425266317|ref|ZP_18658109.1| hypothetical protein EC5412_1682 [Escherichia coli 5412]
 gi|425293811|ref|ZP_18684243.1| hypothetical protein ECPA38_1677 [Escherichia coli PA38]
 gi|425310473|ref|ZP_18699832.1| hypothetical protein ECEC1735_1710 [Escherichia coli EC1735]
 gi|425316402|ref|ZP_18705363.1| hypothetical protein ECEC1736_1604 [Escherichia coli EC1736]
 gi|425322506|ref|ZP_18711049.1| hypothetical protein ECEC1737_1611 [Escherichia coli EC1737]
 gi|425328706|ref|ZP_18716799.1| hypothetical protein ECEC1846_1640 [Escherichia coli EC1846]
 gi|425334885|ref|ZP_18722481.1| hypothetical protein ECEC1847_1644 [Escherichia coli EC1847]
 gi|425341293|ref|ZP_18728390.1| hypothetical protein ECEC1848_1828 [Escherichia coli EC1848]
 gi|425347153|ref|ZP_18733838.1| hypothetical protein ECEC1849_1617 [Escherichia coli EC1849]
 gi|425353394|ref|ZP_18739653.1| hypothetical protein ECEC1850_1807 [Escherichia coli EC1850]
 gi|425359374|ref|ZP_18745222.1| hypothetical protein ECEC1856_1643 [Escherichia coli EC1856]
 gi|425365514|ref|ZP_18750917.1| hypothetical protein ECEC1862_1640 [Escherichia coli EC1862]
 gi|425371937|ref|ZP_18756775.1| hypothetical protein ECEC1864_1808 [Escherichia coli EC1864]
 gi|425384745|ref|ZP_18768494.1| hypothetical protein ECEC1866_1481 [Escherichia coli EC1866]
 gi|425391445|ref|ZP_18774773.1| hypothetical protein ECEC1868_1825 [Escherichia coli EC1868]
 gi|425397564|ref|ZP_18780474.1| hypothetical protein ECEC1869_1800 [Escherichia coli EC1869]
 gi|425403542|ref|ZP_18786011.1| hypothetical protein ECEC1870_1493 [Escherichia coli EC1870]
 gi|425410074|ref|ZP_18792092.1| hypothetical protein ECNE098_1819 [Escherichia coli NE098]
 gi|425416429|ref|ZP_18797913.1| hypothetical protein ECFRIK523_1705 [Escherichia coli FRIK523]
 gi|425427547|ref|ZP_18808442.1| hypothetical protein EC01304_1737 [Escherichia coli 0.1304]
 gi|428946109|ref|ZP_19018620.1| cupin superfamily protein [Escherichia coli 88.1467]
 gi|428952299|ref|ZP_19024292.1| cupin superfamily protein [Escherichia coli 88.1042]
 gi|428958084|ref|ZP_19029674.1| cupin superfamily protein [Escherichia coli 89.0511]
 gi|428964599|ref|ZP_19035623.1| cupin superfamily protein [Escherichia coli 90.0091]
 gi|428976894|ref|ZP_19046972.1| cupin superfamily protein [Escherichia coli 90.2281]
 gi|428982666|ref|ZP_19052309.1| cupin superfamily protein [Escherichia coli 93.0055]
 gi|428989095|ref|ZP_19058261.1| cupin superfamily protein [Escherichia coli 93.0056]
 gi|428994838|ref|ZP_19063652.1| cupin superfamily protein [Escherichia coli 94.0618]
 gi|429001083|ref|ZP_19069448.1| cupin superfamily protein [Escherichia coli 95.0183]
 gi|429007232|ref|ZP_19074985.1| cupin superfamily protein [Escherichia coli 95.1288]
 gi|429013676|ref|ZP_19080777.1| cupin superfamily protein [Escherichia coli 95.0943]
 gi|429019589|ref|ZP_19086288.1| cupin superfamily protein [Escherichia coli 96.0428]
 gi|429025367|ref|ZP_19091682.1| cupin superfamily protein [Escherichia coli 96.0427]
 gi|429031691|ref|ZP_19097443.1| cupin superfamily protein [Escherichia coli 96.0939]
 gi|429037854|ref|ZP_19103172.1| cupin superfamily protein [Escherichia coli 96.0932]
 gi|429043773|ref|ZP_19108687.1| cupin superfamily protein [Escherichia coli 96.0107]
 gi|429049505|ref|ZP_19114140.1| cupin superfamily protein [Escherichia coli 97.0003]
 gi|429054826|ref|ZP_19119267.1| cupin superfamily protein [Escherichia coli 97.1742]
 gi|429060502|ref|ZP_19124608.1| cupin superfamily protein [Escherichia coli 97.0007]
 gi|429066270|ref|ZP_19129984.1| cupin superfamily protein [Escherichia coli 99.0672]
 gi|429072537|ref|ZP_19135860.1| hypothetical protein EC990678_1670 [Escherichia coli 99.0678]
 gi|429077898|ref|ZP_19141088.1| cupin superfamily protein [Escherichia coli 99.0713]
 gi|429825427|ref|ZP_19356738.1| cupin superfamily protein [Escherichia coli 96.0109]
 gi|429831732|ref|ZP_19362387.1| cupin superfamily protein [Escherichia coli 97.0010]
 gi|444924016|ref|ZP_21243562.1| cupin superfamily protein [Escherichia coli 09BKT078844]
 gi|444930102|ref|ZP_21249221.1| cupin superfamily protein [Escherichia coli 99.0814]
 gi|444935351|ref|ZP_21254238.1| cupin superfamily protein [Escherichia coli 99.0815]
 gi|444941000|ref|ZP_21259607.1| cupin superfamily protein [Escherichia coli 99.0816]
 gi|444946604|ref|ZP_21264987.1| cupin superfamily protein [Escherichia coli 99.0839]
 gi|444952204|ref|ZP_21270381.1| cupin superfamily protein [Escherichia coli 99.0848]
 gi|444957645|ref|ZP_21275592.1| cupin superfamily protein [Escherichia coli 99.1753]
 gi|444962920|ref|ZP_21280623.1| cupin superfamily protein [Escherichia coli 99.1775]
 gi|444968632|ref|ZP_21286075.1| cupin superfamily protein [Escherichia coli 99.1793]
 gi|444974100|ref|ZP_21291331.1| cupin superfamily protein [Escherichia coli 99.1805]
 gi|444979277|ref|ZP_21296262.1| cupin superfamily protein [Escherichia coli ATCC 700728]
 gi|444984957|ref|ZP_21301795.1| cupin superfamily protein [Escherichia coli PA11]
 gi|444990220|ref|ZP_21306931.1| cupin superfamily protein [Escherichia coli PA19]
 gi|444995427|ref|ZP_21311996.1| cupin superfamily protein [Escherichia coli PA13]
 gi|445001071|ref|ZP_21317511.1| cupin superfamily protein [Escherichia coli PA2]
 gi|445006526|ref|ZP_21322834.1| cupin superfamily protein [Escherichia coli PA47]
 gi|445011617|ref|ZP_21327783.1| cupin superfamily protein [Escherichia coli PA48]
 gi|445017578|ref|ZP_21333588.1| cupin superfamily protein [Escherichia coli PA8]
 gi|445023158|ref|ZP_21339038.1| cupin superfamily protein [Escherichia coli 7.1982]
 gi|445028101|ref|ZP_21343849.1| cupin superfamily protein [Escherichia coli 99.1781]
 gi|445033677|ref|ZP_21349270.1| cupin superfamily protein [Escherichia coli 99.1762]
 gi|445039379|ref|ZP_21354815.1| cupin superfamily protein [Escherichia coli PA35]
 gi|445044625|ref|ZP_21359938.1| cupin superfamily protein [Escherichia coli 3.4880]
 gi|445050181|ref|ZP_21365311.1| cupin superfamily protein [Escherichia coli 95.0083]
 gi|445055884|ref|ZP_21370803.1| cupin superfamily protein [Escherichia coli 99.0670]
 gi|452967568|ref|ZP_21965795.1| hypothetical protein EC4009_RS02550 [Escherichia coli O157:H7 str.
           EC4009]
 gi|187768477|gb|EDU32321.1| cupin family protein [Escherichia coli O157:H7 str. EC4196]
 gi|188016012|gb|EDU54134.1| cupin family protein [Escherichia coli O157:H7 str. EC4113]
 gi|188999926|gb|EDU68912.1| cupin family protein [Escherichia coli O157:H7 str. EC4076]
 gi|189356328|gb|EDU74747.1| cupin family protein [Escherichia coli O157:H7 str. EC4401]
 gi|189360179|gb|EDU78598.1| cupin family protein [Escherichia coli O157:H7 str. EC4486]
 gi|189367200|gb|EDU85616.1| cupin family protein [Escherichia coli O157:H7 str. EC4501]
 gi|189372572|gb|EDU90988.1| cupin family protein [Escherichia coli O157:H7 str. EC869]
 gi|189377213|gb|EDU95629.1| cupin family protein [Escherichia coli O157:H7 str. EC508]
 gi|208729305|gb|EDZ78906.1| cupin family protein [Escherichia coli O157:H7 str. EC4206]
 gi|208732471|gb|EDZ81159.1| cupin family protein [Escherichia coli O157:H7 str. EC4045]
 gi|208738115|gb|EDZ85798.1| cupin family protein [Escherichia coli O157:H7 str. EC4042]
 gi|209157197|gb|ACI34630.1| cupin family protein [Escherichia coli O157:H7 str. EC4115]
 gi|217318427|gb|EEC26853.1| cupin family protein [Escherichia coli O157:H7 str. TW14588]
 gi|254591968|gb|ACT71329.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
 gi|320187837|gb|EFW62507.1| hypothetical protein ECoD_05544 [Escherichia coli O157:H7 str.
           EC1212]
 gi|320637578|gb|EFX07378.1| hypothetical protein ECO5101_10737 [Escherichia coli O157:H7 str.
           G5101]
 gi|320643138|gb|EFX12339.1| hypothetical protein ECO9389_04211 [Escherichia coli O157:H- str.
           493-89]
 gi|320648596|gb|EFX17251.1| hypothetical protein ECO2687_20326 [Escherichia coli O157:H- str. H
           2687]
 gi|320653910|gb|EFX21984.1| hypothetical protein ECO7815_16663 [Escherichia coli O55:H7 str.
           3256-97 TW 07815]
 gi|320659391|gb|EFX26960.1| hypothetical protein ECO5905_20143 [Escherichia coli O55:H7 str.
           USDA 5905]
 gi|326339383|gb|EGD63195.1| cupin family protein [Escherichia coli O157:H7 str. 1044]
 gi|326341488|gb|EGD65279.1| hypothetical protein ECF_03346 [Escherichia coli O157:H7 str. 1125]
 gi|374358249|gb|AEZ39956.1| hypothetical protein ECO55CA74_06765 [Escherichia coli O55:H7 str.
           RM12579]
 gi|377898661|gb|EHU63020.1| cupin superfamily protein [Escherichia coli DEC3A]
 gi|377899127|gb|EHU63479.1| cupin superfamily protein [Escherichia coli DEC3B]
 gi|377909709|gb|EHU73908.1| cupin superfamily protein [Escherichia coli DEC3C]
 gi|377914957|gb|EHU79071.1| cupin superfamily protein [Escherichia coli DEC3D]
 gi|377917065|gb|EHU81133.1| cupin superfamily protein [Escherichia coli DEC3E]
 gi|377928060|gb|EHU91972.1| cupin superfamily protein [Escherichia coli DEC3F]
 gi|377931609|gb|EHU95469.1| cupin superfamily protein [Escherichia coli DEC4A]
 gi|377935412|gb|EHU99210.1| cupin superfamily protein [Escherichia coli DEC4B]
 gi|377947794|gb|EHV11452.1| cupin superfamily protein [Escherichia coli DEC4C]
 gi|377947896|gb|EHV11553.1| cupin superfamily protein [Escherichia coli DEC4D]
 gi|377952349|gb|EHV15938.1| cupin superfamily protein [Escherichia coli DEC4E]
 gi|377962981|gb|EHV26432.1| cupin superfamily protein [Escherichia coli DEC4F]
 gi|377964788|gb|EHV28223.1| cupin superfamily protein [Escherichia coli DEC5A]
 gi|377971372|gb|EHV34729.1| cupin superfamily protein [Escherichia coli DEC5B]
 gi|377977817|gb|EHV41100.1| cupin superfamily protein [Escherichia coli DEC5C]
 gi|377979545|gb|EHV42822.1| cupin superfamily protein [Escherichia coli DEC5D]
 gi|377986198|gb|EHV49395.1| cupin superfamily protein [Escherichia coli DEC5E]
 gi|386795530|gb|AFJ28564.1| hypothetical protein CDCO157_1505 [Escherichia coli Xuzhou21]
 gi|390648000|gb|EIN26819.1| hypothetical protein ECFRIK1996_1763 [Escherichia coli FRIK1996]
 gi|390648505|gb|EIN27198.1| hypothetical protein ECFDA517_1890 [Escherichia coli FDA517]
 gi|390649074|gb|EIN27689.1| hypothetical protein ECFDA505_1662 [Escherichia coli FDA505]
 gi|390666784|gb|EIN43872.1| hypothetical protein EC93001_1806 [Escherichia coli 93-001]
 gi|390668814|gb|EIN45563.1| hypothetical protein ECFRIK1985_1752 [Escherichia coli FRIK1985]
 gi|390668936|gb|EIN45652.1| hypothetical protein ECFRIK1990_1740 [Escherichia coli FRIK1990]
 gi|390686437|gb|EIN61795.1| hypothetical protein ECPA3_1596 [Escherichia coli PA3]
 gi|390688004|gb|EIN63140.1| hypothetical protein ECPA5_1625 [Escherichia coli PA5]
 gi|390688275|gb|EIN63355.1| hypothetical protein ECPA9_1774 [Escherichia coli PA9]
 gi|390705250|gb|EIN79004.1| hypothetical protein ECPA10_1771 [Escherichia coli PA10]
 gi|390706005|gb|EIN79627.1| hypothetical protein ECPA15_1865 [Escherichia coli PA15]
 gi|390706776|gb|EIN80280.1| hypothetical protein ECPA14_1718 [Escherichia coli PA14]
 gi|390718103|gb|EIN90866.1| hypothetical protein ECPA22_1679 [Escherichia coli PA22]
 gi|390730188|gb|EIO02249.1| hypothetical protein ECPA24_1622 [Escherichia coli PA24]
 gi|390730471|gb|EIO02486.1| hypothetical protein ECPA25_1468 [Escherichia coli PA25]
 gi|390732152|gb|EIO03884.1| hypothetical protein ECPA28_1803 [Escherichia coli PA28]
 gi|390748799|gb|EIO19145.1| hypothetical protein ECPA32_1677 [Escherichia coli PA32]
 gi|390748914|gb|EIO19228.1| hypothetical protein ECPA31_1534 [Escherichia coli PA31]
 gi|390750571|gb|EIO20610.1| hypothetical protein ECPA33_1665 [Escherichia coli PA33]
 gi|390760387|gb|EIO29716.1| hypothetical protein ECPA40_1935 [Escherichia coli PA40]
 gi|390772971|gb|EIO41452.1| hypothetical protein ECPA41_1732 [Escherichia coli PA41]
 gi|390774229|gb|EIO42499.1| hypothetical protein ECPA42_1841 [Escherichia coli PA42]
 gi|390774908|gb|EIO43018.1| hypothetical protein ECPA39_1698 [Escherichia coli PA39]
 gi|390784966|gb|EIO52522.1| hypothetical protein ECTW06591_0636 [Escherichia coli TW06591]
 gi|390792774|gb|EIO60122.1| hypothetical protein ECTW10246_1732 [Escherichia coli TW10246]
 gi|390801524|gb|EIO68577.1| hypothetical protein ECTW07945_1727 [Escherichia coli TW07945]
 gi|390810140|gb|EIO76910.1| hypothetical protein ECTW09109_1830 [Escherichia coli TW09109]
 gi|390816501|gb|EIO82985.1| hypothetical protein ECTW09098_1741 [Escherichia coli TW09098]
 gi|390817756|gb|EIO84171.1| hypothetical protein ECTW10119_2137 [Escherichia coli TW10119]
 gi|390834069|gb|EIO99040.1| hypothetical protein ECEC4203_1670 [Escherichia coli EC4203]
 gi|390835801|gb|EIP00483.1| hypothetical protein ECTW09195_1720 [Escherichia coli TW09195]
 gi|390837326|gb|EIP01752.1| hypothetical protein ECEC4196_1587 [Escherichia coli EC4196]
 gi|390853083|gb|EIP16164.1| hypothetical protein ECTW14301_1600 [Escherichia coli TW14301]
 gi|390854280|gb|EIP17161.1| hypothetical protein ECTW14313_1630 [Escherichia coli TW14313]
 gi|390855095|gb|EIP17845.1| hypothetical protein ECEC4421_1678 [Escherichia coli EC4421]
 gi|390867539|gb|EIP29335.1| hypothetical protein ECEC4422_1805 [Escherichia coli EC4422]
 gi|390871640|gb|EIP33029.1| hypothetical protein ECEC4013_1858 [Escherichia coli EC4013]
 gi|390880448|gb|EIP41127.1| hypothetical protein ECEC4402_1673 [Escherichia coli EC4402]
 gi|390883853|gb|EIP44247.1| hypothetical protein ECEC4439_1659 [Escherichia coli EC4439]
 gi|390888463|gb|EIP48329.1| hypothetical protein ECEC4436_1622 [Escherichia coli EC4436]
 gi|390902420|gb|EIP61516.1| hypothetical protein ECEC1738_1727 [Escherichia coli EC1738]
 gi|390903202|gb|EIP62258.1| hypothetical protein ECEC4437_1754 [Escherichia coli EC4437]
 gi|390906675|gb|EIP65559.1| hypothetical protein ECEC4448_1671 [Escherichia coli EC4448]
 gi|390910434|gb|EIP69171.1| cupin superfamily protein [Escherichia coli EC1734]
 gi|390922948|gb|EIP80933.1| hypothetical protein ECEC1863_1404 [Escherichia coli EC1863]
 gi|390923960|gb|EIP81789.1| hypothetical protein ECEC1845_1636 [Escherichia coli EC1845]
 gi|408069967|gb|EKH04345.1| hypothetical protein ECPA7_2204 [Escherichia coli PA7]
 gi|408074119|gb|EKH08408.1| hypothetical protein ECFRIK920_1700 [Escherichia coli FRIK920]
 gi|408083026|gb|EKH16945.1| hypothetical protein ECPA34_1783 [Escherichia coli PA34]
 gi|408085188|gb|EKH18847.1| hypothetical protein ECFDA506_2127 [Escherichia coli FDA506]
 gi|408088693|gb|EKH22041.1| hypothetical protein ECFDA507_1716 [Escherichia coli FDA507]
 gi|408099539|gb|EKH32184.1| hypothetical protein ECFDA504_1691 [Escherichia coli FDA504]
 gi|408104889|gb|EKH37115.1| hypothetical protein ECFRIK1999_1867 [Escherichia coli FRIK1999]
 gi|408111796|gb|EKH43503.1| hypothetical protein ECFRIK1997_1863 [Escherichia coli FRIK1997]
 gi|408116673|gb|EKH47954.1| hypothetical protein ECNE1487_2017 [Escherichia coli NE1487]
 gi|408124753|gb|EKH55401.1| hypothetical protein ECNE037_1933 [Escherichia coli NE037]
 gi|408128334|gb|EKH58714.1| hypothetical protein ECFRIK2001_2141 [Escherichia coli FRIK2001]
 gi|408134363|gb|EKH64195.1| hypothetical protein ECPA4_1823 [Escherichia coli PA4]
 gi|408146833|gb|EKH75892.1| hypothetical protein ECPA23_1646 [Escherichia coli PA23]
 gi|408147600|gb|EKH76534.1| hypothetical protein ECPA49_1833 [Escherichia coli PA49]
 gi|408151867|gb|EKH80334.1| hypothetical protein ECPA45_1810 [Escherichia coli PA45]
 gi|408164269|gb|EKH92082.1| hypothetical protein ECTT12B_1698 [Escherichia coli TT12B]
 gi|408167730|gb|EKH95213.1| hypothetical protein ECMA6_1932 [Escherichia coli MA6]
 gi|408168862|gb|EKH96222.1| hypothetical protein EC5905_1864 [Escherichia coli 5905]
 gi|408183320|gb|EKI09770.1| hypothetical protein ECCB7326_1676 [Escherichia coli CB7326]
 gi|408187291|gb|EKI13261.1| hypothetical protein EC5412_1682 [Escherichia coli 5412]
 gi|408187767|gb|EKI13688.1| hypothetical protein ECEC96038_1549 [Escherichia coli EC96038]
 gi|408226532|gb|EKI50168.1| hypothetical protein ECPA38_1677 [Escherichia coli PA38]
 gi|408233748|gb|EKI56824.1| hypothetical protein ECEC1735_1710 [Escherichia coli EC1735]
 gi|408245840|gb|EKI68189.1| hypothetical protein ECEC1736_1604 [Escherichia coli EC1736]
 gi|408248704|gb|EKI70710.1| hypothetical protein ECEC1737_1611 [Escherichia coli EC1737]
 gi|408254192|gb|EKI75735.1| hypothetical protein ECEC1846_1640 [Escherichia coli EC1846]
 gi|408264409|gb|EKI85208.1| hypothetical protein ECEC1847_1644 [Escherichia coli EC1847]
 gi|408266503|gb|EKI87098.1| hypothetical protein ECEC1848_1828 [Escherichia coli EC1848]
 gi|408272547|gb|EKI92627.1| hypothetical protein ECEC1849_1617 [Escherichia coli EC1849]
 gi|408281192|gb|EKJ00624.1| hypothetical protein ECEC1850_1807 [Escherichia coli EC1850]
 gi|408283054|gb|EKJ02282.1| hypothetical protein ECEC1856_1643 [Escherichia coli EC1856]
 gi|408296802|gb|EKJ14990.1| hypothetical protein ECEC1862_1640 [Escherichia coli EC1862]
 gi|408297399|gb|EKJ15495.1| hypothetical protein ECEC1864_1808 [Escherichia coli EC1864]
 gi|408313492|gb|EKJ30117.1| hypothetical protein ECEC1868_1825 [Escherichia coli EC1868]
 gi|408313544|gb|EKJ30161.1| hypothetical protein ECEC1866_1481 [Escherichia coli EC1866]
 gi|408327838|gb|EKJ43471.1| hypothetical protein ECEC1869_1800 [Escherichia coli EC1869]
 gi|408331817|gb|EKJ46956.1| hypothetical protein ECNE098_1819 [Escherichia coli NE098]
 gi|408332241|gb|EKJ47301.1| hypothetical protein ECEC1870_1493 [Escherichia coli EC1870]
 gi|408346856|gb|EKJ61120.1| hypothetical protein ECFRIK523_1705 [Escherichia coli FRIK523]
 gi|408351818|gb|EKJ65519.1| hypothetical protein EC01304_1737 [Escherichia coli 0.1304]
 gi|408555303|gb|EKK32114.1| cupin superfamily protein [Escherichia coli 5.2239]
 gi|408555897|gb|EKK32627.1| cupin superfamily protein [Escherichia coli 3.4870]
 gi|408556083|gb|EKK32777.1| hypothetical protein EC60172_1841 [Escherichia coli 6.0172]
 gi|408581130|gb|EKK56484.1| cupin superfamily protein [Escherichia coli 8.0586]
 gi|408585815|gb|EKK60642.1| cupin superfamily protein [Escherichia coli 8.2524]
 gi|408586043|gb|EKK60827.1| hypothetical protein EC100833_1902 [Escherichia coli 10.0833]
 gi|408599403|gb|EKK73313.1| cupin superfamily protein [Escherichia coli 10.0869]
 gi|408603662|gb|EKK77294.1| cupin superfamily protein [Escherichia coli 88.0221]
 gi|408603915|gb|EKK77518.1| hypothetical protein EC80416_1389 [Escherichia coli 8.0416]
 gi|408615568|gb|EKK88760.1| cupin superfamily protein [Escherichia coli 10.0821]
 gi|427211550|gb|EKV81296.1| cupin superfamily protein [Escherichia coli 88.1042]
 gi|427213076|gb|EKV82551.1| cupin superfamily protein [Escherichia coli 88.1467]
 gi|427213573|gb|EKV82980.1| cupin superfamily protein [Escherichia coli 89.0511]
 gi|427231045|gb|EKV99122.1| cupin superfamily protein [Escherichia coli 90.0091]
 gi|427231157|gb|EKV99213.1| cupin superfamily protein [Escherichia coli 90.2281]
 gi|427248462|gb|EKW15436.1| cupin superfamily protein [Escherichia coli 93.0056]
 gi|427249833|gb|EKW16581.1| cupin superfamily protein [Escherichia coli 93.0055]
 gi|427250704|gb|EKW17360.1| cupin superfamily protein [Escherichia coli 94.0618]
 gi|427267119|gb|EKW32483.1| cupin superfamily protein [Escherichia coli 95.0943]
 gi|427267289|gb|EKW32617.1| cupin superfamily protein [Escherichia coli 95.0183]
 gi|427269155|gb|EKW34151.1| cupin superfamily protein [Escherichia coli 95.1288]
 gi|427284073|gb|EKW48199.1| cupin superfamily protein [Escherichia coli 96.0428]
 gi|427288343|gb|EKW51985.1| cupin superfamily protein [Escherichia coli 96.0427]
 gi|427289118|gb|EKW52706.1| cupin superfamily protein [Escherichia coli 96.0939]
 gi|427301083|gb|EKW63974.1| cupin superfamily protein [Escherichia coli 96.0932]
 gi|427303253|gb|EKW65982.1| cupin superfamily protein [Escherichia coli 97.0003]
 gi|427307089|gb|EKW69574.1| cupin superfamily protein [Escherichia coli 96.0107]
 gi|427318816|gb|EKW80670.1| cupin superfamily protein [Escherichia coli 97.1742]
 gi|427320255|gb|EKW82031.1| cupin superfamily protein [Escherichia coli 97.0007]
 gi|427331043|gb|EKW92288.1| cupin superfamily protein [Escherichia coli 99.0672]
 gi|427332006|gb|EKW93183.1| cupin superfamily protein [Escherichia coli 99.0713]
 gi|427332056|gb|EKW93232.1| hypothetical protein EC990678_1670 [Escherichia coli 99.0678]
 gi|429257652|gb|EKY41666.1| cupin superfamily protein [Escherichia coli 96.0109]
 gi|429259051|gb|EKY42797.1| cupin superfamily protein [Escherichia coli 97.0010]
 gi|444541225|gb|ELV20772.1| cupin superfamily protein [Escherichia coli 99.0814]
 gi|444546042|gb|ELV24820.1| cupin superfamily protein [Escherichia coli 09BKT078844]
 gi|444550103|gb|ELV28234.1| cupin superfamily protein [Escherichia coli 99.0815]
 gi|444562328|gb|ELV39398.1| cupin superfamily protein [Escherichia coli 99.0839]
 gi|444564111|gb|ELV41070.1| cupin superfamily protein [Escherichia coli 99.0816]
 gi|444568178|gb|ELV44869.1| cupin superfamily protein [Escherichia coli 99.0848]
 gi|444578740|gb|ELV54781.1| cupin superfamily protein [Escherichia coli 99.1753]
 gi|444582275|gb|ELV58076.1| cupin superfamily protein [Escherichia coli 99.1775]
 gi|444583924|gb|ELV59604.1| cupin superfamily protein [Escherichia coli 99.1793]
 gi|444597453|gb|ELV72447.1| cupin superfamily protein [Escherichia coli PA11]
 gi|444598867|gb|ELV73780.1| cupin superfamily protein [Escherichia coli ATCC 700728]
 gi|444602639|gb|ELV77378.1| cupin superfamily protein [Escherichia coli 99.1805]
 gi|444611342|gb|ELV85682.1| cupin superfamily protein [Escherichia coli PA19]
 gi|444611386|gb|ELV85722.1| cupin superfamily protein [Escherichia coli PA13]
 gi|444619883|gb|ELV93908.1| cupin superfamily protein [Escherichia coli PA2]
 gi|444629472|gb|ELW03169.1| cupin superfamily protein [Escherichia coli PA47]
 gi|444629674|gb|ELW03357.1| cupin superfamily protein [Escherichia coli PA48]
 gi|444633529|gb|ELW07050.1| cupin superfamily protein [Escherichia coli PA8]
 gi|444643469|gb|ELW16620.1| cupin superfamily protein [Escherichia coli 7.1982]
 gi|444647120|gb|ELW20101.1| cupin superfamily protein [Escherichia coli 99.1781]
 gi|444650310|gb|ELW23151.1| cupin superfamily protein [Escherichia coli 99.1762]
 gi|444659700|gb|ELW32104.1| cupin superfamily protein [Escherichia coli PA35]
 gi|444664860|gb|ELW37022.1| cupin superfamily protein [Escherichia coli 3.4880]
 gi|444670531|gb|ELW42417.1| cupin superfamily protein [Escherichia coli 95.0083]
 gi|444672728|gb|ELW44416.1| cupin superfamily protein [Escherichia coli 99.0670]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|425304659|ref|ZP_18694421.1| hypothetical protein ECN1_1102 [Escherichia coli N1]
 gi|408230791|gb|EKI54150.1| hypothetical protein ECN1_1102 [Escherichia coli N1]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|420367270|ref|ZP_14868067.1| cupin superfamily protein [Shigella flexneri 1235-66]
 gi|391323472|gb|EIQ80123.1| cupin superfamily protein [Shigella flexneri 1235-66]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|416336251|ref|ZP_11672899.1| hypothetical protein EcoM_02298 [Escherichia coli WV_060327]
 gi|320195869|gb|EFW70494.1| hypothetical protein EcoM_02298 [Escherichia coli WV_060327]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|300896843|ref|ZP_07115338.1| cupin family protein [Escherichia coli MS 198-1]
 gi|300359325|gb|EFJ75195.1| cupin family protein [Escherichia coli MS 198-1]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|424743477|ref|ZP_18171787.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           WC-141]
 gi|422943311|gb|EKU38333.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           WC-141]
          Length = 299

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/141 (24%), Positives = 67/141 (47%), Gaps = 4/141 (2%)

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT-PPTAQ 115
           +  +    I++   ++G ++     Q +I  +  +    Q+ F C + AN Y      + 
Sbjct: 80  IHTKMRKDIIYSMIKDGGTLVIDRCQEFIPNINNICQIFQQYFRCPISANLYYCYKKDSS 139

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
            F  H+DD +   +QL G KKW +  P   +  LP + S ++ +     P + +  +   
Sbjct: 140 SFGMHFDDHDVISIQLFGDKKWSISSPTFKNP-LPHHKSFHYDKPCENFPEICIHNKING 198

Query: 176 LLYLPRGYIH--QASTVTNEH 194
           L+++PRGY H  +A++ +N H
Sbjct: 199 LVFIPRGYWHLVEATSSSNVH 219


>gi|365106156|ref|ZP_09335034.1| hypothetical protein HMPREF9428_00903 [Citrobacter freundii
           4_7_47CFAA]
 gi|395227824|ref|ZP_10406150.1| cupin family protein [Citrobacter sp. A1]
 gi|421847783|ref|ZP_16280915.1| hypothetical protein D186_22101 [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|424728473|ref|ZP_18157078.1| cupin family protein [Citrobacter sp. L17]
 gi|363642555|gb|EHL81904.1| hypothetical protein HMPREF9428_00903 [Citrobacter freundii
           4_7_47CFAA]
 gi|394719152|gb|EJF24773.1| cupin family protein [Citrobacter sp. A1]
 gi|411770852|gb|EKS54590.1| hypothetical protein D186_22101 [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|422896344|gb|EKU36126.1| cupin family protein [Citrobacter sp. L17]
 gi|455642176|gb|EMF21342.1| hypothetical protein H262_17086 [Citrobacter freundii GTC 09479]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|157961821|ref|YP_001501855.1| cupin 4 family protein [Shewanella pealeana ATCC 700345]
 gi|157846821|gb|ABV87320.1| Cupin 4 family protein [Shewanella pealeana ATCC 700345]
          Length = 381

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 92/210 (43%), Gaps = 36/210 (17%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I +  KN + +L++  ++  +  D ++E ++     +Y    R+  D +  
Sbjct: 16  EYWQKKPLVIRQGFKN-FQDLLSPDEMAGLACDEMVESRR-----VY----REKGDWQAE 65

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLN---ASLQELFGCFVGAN---------TY 108
             P   ++H         L  + +   +Q LN    + ++L  CF             +Y
Sbjct: 66  FGPFESYEH---------LGEKDWTLIVQALNNWVPAAEDLLKCFDFIPRWRLDDVMVSY 116

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
             P    G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+ 
Sbjct: 117 AVP--GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIID 171

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           V L PGD+LYLP GY H   T+    S  V
Sbjct: 172 VELLPGDILYLPPGYPHDGVTLEPSMSFSV 201


>gi|410861551|ref|YP_006976785.1| hypothetical protein amad1_09615 [Alteromonas macleodii AltDE1]
 gi|410818813|gb|AFV85430.1| hypothetical protein amad1_09615 [Alteromonas macleodii AltDE1]
          Length = 388

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G   H D  + F++Q +GK+ W+V  P    E  P    P   Q E  TP++ V +EPGD
Sbjct: 132 GVGAHIDQYDVFLVQGKGKRHWRVGQPGDYKEVFPH---PKLRQIERFTPVIDVVVEPGD 188

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           ++Y+P G+ H   TV  E SL  ++ 
Sbjct: 189 VIYVPPGWPHDGETV--EDSLTYSVG 212


>gi|24112531|ref|NP_707041.1| hypothetical protein SF1130 [Shigella flexneri 2a str. 301]
 gi|26247349|ref|NP_753389.1| hypothetical protein c1480 [Escherichia coli CFT073]
 gi|30062659|ref|NP_836830.1| hypothetical protein S1210 [Shigella flexneri 2a str. 2457T]
 gi|74311687|ref|YP_310106.1| hypothetical protein SSON_1146 [Shigella sonnei Ss046]
 gi|82544385|ref|YP_408332.1| hypothetical protein SBO_1911 [Shigella boydii Sb227]
 gi|91210284|ref|YP_540270.1| hypothetical protein UTI89_C1257 [Escherichia coli UTI89]
 gi|227886515|ref|ZP_04004320.1| cupin family protein [Escherichia coli 83972]
 gi|237706886|ref|ZP_04537367.1| YcfD [Escherichia sp. 3_2_53FAA]
 gi|300816831|ref|ZP_07097051.1| cupin family protein [Escherichia coli MS 107-1]
 gi|300821088|ref|ZP_07101237.1| cupin family protein [Escherichia coli MS 119-7]
 gi|300906953|ref|ZP_07124625.1| cupin family protein [Escherichia coli MS 84-1]
 gi|300917901|ref|ZP_07134534.1| cupin family protein [Escherichia coli MS 115-1]
 gi|300928314|ref|ZP_07143850.1| cupin family protein [Escherichia coli MS 187-1]
 gi|300938700|ref|ZP_07153424.1| cupin family protein [Escherichia coli MS 21-1]
 gi|300975594|ref|ZP_07173073.1| cupin family protein [Escherichia coli MS 45-1]
 gi|300982387|ref|ZP_07176085.1| cupin family protein [Escherichia coli MS 200-1]
 gi|301024586|ref|ZP_07188258.1| cupin family protein [Escherichia coli MS 69-1]
 gi|301051082|ref|ZP_07197916.1| cupin family protein [Escherichia coli MS 185-1]
 gi|301306072|ref|ZP_07212150.1| cupin family protein [Escherichia coli MS 124-1]
 gi|331646387|ref|ZP_08347490.1| YcfD protein [Escherichia coli M605]
 gi|331652179|ref|ZP_08353198.1| YcfD protein [Escherichia coli M718]
 gi|331672641|ref|ZP_08373430.1| YcfD protein [Escherichia coli TA280]
 gi|331676920|ref|ZP_08377616.1| YcfD protein [Escherichia coli H591]
 gi|332279684|ref|ZP_08392097.1| conserved hypothetical protein [Shigella sp. D9]
 gi|384542756|ref|YP_005726818.1| cupin [Shigella flexneri 2002017]
 gi|386628834|ref|YP_006148554.1| hypothetical protein i02_1350 [Escherichia coli str. 'clone D i2']
 gi|386633754|ref|YP_006153473.1| hypothetical protein i14_1350 [Escherichia coli str. 'clone D i14']
 gi|386638637|ref|YP_006105435.1| cupin superfamily protein family [Escherichia coli ABU 83972]
 gi|387606654|ref|YP_006095510.1| hypothetical protein EC042_1199 [Escherichia coli 042]
 gi|415868183|ref|ZP_11539765.1| cupin family protein [Escherichia coli MS 85-1]
 gi|415881115|ref|ZP_11545351.1| putative cytoplasmic protein [Escherichia coli MS 79-10]
 gi|419804302|ref|ZP_14329462.1| cupin family protein [Escherichia coli AI27]
 gi|422358729|ref|ZP_16439378.1| cupin family protein [Escherichia coli MS 110-3]
 gi|422364209|ref|ZP_16444736.1| cupin family protein [Escherichia coli MS 153-1]
 gi|422371627|ref|ZP_16452004.1| cupin family protein [Escherichia coli MS 16-3]
 gi|422375446|ref|ZP_16455712.1| cupin family protein [Escherichia coli MS 60-1]
 gi|422382846|ref|ZP_16463001.1| cupin family protein [Escherichia coli MS 57-2]
 gi|424837586|ref|ZP_18262223.1| hypothetical protein SF5M90T_1126 [Shigella flexneri 5a str. M90T]
 gi|427804287|ref|ZP_18971354.1| hypothetical protein BN16_16971 [Escherichia coli chi7122]
 gi|427808868|ref|ZP_18975933.1| hypothetical protein BN17_09911 [Escherichia coli]
 gi|26107750|gb|AAN79949.1|AE016759_223 Hypothetical protein ycfD [Escherichia coli CFT073]
 gi|24051423|gb|AAN42748.1| conserved hypothetical protein [Shigella flexneri 2a str. 301]
 gi|30040907|gb|AAP16637.1| hypothetical protein S1210 [Shigella flexneri 2a str. 2457T]
 gi|73855164|gb|AAZ87871.1| conserved hypothetical protein [Shigella sonnei Ss046]
 gi|81245796|gb|ABB66504.1| conserved hypothetical protein [Shigella boydii Sb227]
 gi|91071858|gb|ABE06739.1| YcfD protein [Escherichia coli UTI89]
 gi|226898096|gb|EEH84355.1| YcfD [Escherichia sp. 3_2_53FAA]
 gi|227836719|gb|EEJ47185.1| cupin family protein [Escherichia coli 83972]
 gi|281600541|gb|ADA73525.1| Cupin 4 family protein [Shigella flexneri 2002017]
 gi|284920954|emb|CBG34017.1| conserved hypothetical protein [Escherichia coli 042]
 gi|300297254|gb|EFJ53639.1| cupin family protein [Escherichia coli MS 185-1]
 gi|300307224|gb|EFJ61744.1| cupin family protein [Escherichia coli MS 200-1]
 gi|300396511|gb|EFJ80049.1| cupin family protein [Escherichia coli MS 69-1]
 gi|300401300|gb|EFJ84838.1| cupin family protein [Escherichia coli MS 84-1]
 gi|300410257|gb|EFJ93795.1| cupin family protein [Escherichia coli MS 45-1]
 gi|300414891|gb|EFJ98201.1| cupin family protein [Escherichia coli MS 115-1]
 gi|300456345|gb|EFK19838.1| cupin family protein [Escherichia coli MS 21-1]
 gi|300463700|gb|EFK27193.1| cupin family protein [Escherichia coli MS 187-1]
 gi|300526387|gb|EFK47456.1| cupin family protein [Escherichia coli MS 119-7]
 gi|300530605|gb|EFK51667.1| cupin family protein [Escherichia coli MS 107-1]
 gi|300838644|gb|EFK66404.1| cupin family protein [Escherichia coli MS 124-1]
 gi|307553129|gb|ADN45904.1| cupin superfamily protein family [Escherichia coli ABU 83972]
 gi|315252618|gb|EFU32586.1| cupin family protein [Escherichia coli MS 85-1]
 gi|315287475|gb|EFU46886.1| cupin family protein [Escherichia coli MS 110-3]
 gi|315293104|gb|EFU52456.1| cupin family protein [Escherichia coli MS 153-1]
 gi|315296597|gb|EFU55892.1| cupin family protein [Escherichia coli MS 16-3]
 gi|324005955|gb|EGB75174.1| cupin family protein [Escherichia coli MS 57-2]
 gi|324013221|gb|EGB82440.1| cupin family protein [Escherichia coli MS 60-1]
 gi|331045139|gb|EGI17266.1| YcfD protein [Escherichia coli M605]
 gi|331050457|gb|EGI22515.1| YcfD protein [Escherichia coli M718]
 gi|331070284|gb|EGI41650.1| YcfD protein [Escherichia coli TA280]
 gi|331075609|gb|EGI46907.1| YcfD protein [Escherichia coli H591]
 gi|332102036|gb|EGJ05382.1| conserved hypothetical protein [Shigella sp. D9]
 gi|342926227|gb|EGU94949.1| putative cytoplasmic protein [Escherichia coli MS 79-10]
 gi|355419733|gb|AER83930.1| hypothetical protein i02_1350 [Escherichia coli str. 'clone D i2']
 gi|355424653|gb|AER88849.1| hypothetical protein i14_1350 [Escherichia coli str. 'clone D i14']
 gi|383466638|gb|EID61659.1| hypothetical protein SF5M90T_1126 [Shigella flexneri 5a str. M90T]
 gi|384472731|gb|EIE56782.1| cupin family protein [Escherichia coli AI27]
 gi|412962469|emb|CCK46383.1| hypothetical protein BN16_16971 [Escherichia coli chi7122]
 gi|412969047|emb|CCJ43674.1| hypothetical protein BN17_09911 [Escherichia coli]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|15801303|ref|NP_287320.1| hypothetical protein Z1833 [Escherichia coli O157:H7 str. EDL933]
 gi|291282146|ref|YP_003498964.1| YcfD protein [Escherichia coli O55:H7 str. CB9615]
 gi|12514754|gb|AAG55932.1|AE005327_2 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
 gi|13361037|dbj|BAB34996.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
 gi|209772888|gb|ACI84756.1| hypothetical protein ECs1573 [Escherichia coli]
 gi|209772890|gb|ACI84757.1| hypothetical protein ECs1573 [Escherichia coli]
 gi|209772892|gb|ACI84758.1| hypothetical protein ECs1573 [Escherichia coli]
 gi|209772894|gb|ACI84759.1| hypothetical protein ECs1573 [Escherichia coli]
 gi|209772896|gb|ACI84760.1| hypothetical protein ECs1573 [Escherichia coli]
 gi|290762019|gb|ADD55980.1| YcfD protein [Escherichia coli O55:H7 str. CB9615]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|237731109|ref|ZP_04561590.1| YcfD protein [Citrobacter sp. 30_2]
 gi|226906648|gb|EEH92566.1| YcfD protein [Citrobacter sp. 30_2]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|110805140|ref|YP_688660.1| hypothetical protein SFV_1146 [Shigella flexneri 5 str. 8401]
 gi|110614688|gb|ABF03355.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHWPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|66044707|ref|YP_234548.1| hypothetical protein Psyr_1460 [Pseudomonas syringae pv. syringae
           B728a]
 gi|63255414|gb|AAY36510.1| hypothetical protein Psyr_1460 [Pseudomonas syringae pv. syringae
           B728a]
          Length = 209

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 18/140 (12%)

Query: 72  EGCSIRFLNPQTYIKPLQQLNASLQELF-GCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           EGC++     + Y     +    L++ F  C +  N + TPP +     H DD +   +Q
Sbjct: 72  EGCTLYLRKIENYDPSFFKFKTELEKEFPNCAIQLNAFFTPPWSCALTAHADDHDLLAIQ 131

Query: 131 LEGKKKWKVYLP------RMVDEY----LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           + GKKKW  + P      + +D +    L    + NFS+E+         LE G   ++ 
Sbjct: 132 IAGKKKWSFWRPLSEVACKGLDAHQISVLSMEHAGNFSEEQ------EHVLEAGSYFFMG 185

Query: 181 RGYIHQASTVTNEHSLHVTI 200
           +G IH+A    +E S+HV++
Sbjct: 186 KGTIHKAQCF-DEPSIHVSV 204


>gi|212556965|gb|ACJ29419.1| Transcription factor jumonji, jmjC [Shewanella piezotolerans WP3]
          Length = 381

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 91/213 (42%), Gaps = 44/213 (20%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIE----FKKNIDITLYEDGQRQTMDV 57
           +W+K+PL I +  KN+  +L++ +++  +  D ++E    FK N      ++ Q +    
Sbjct: 17  YWQKKPLVIRQGFKNF-QDLLSPEEMAGLACDELVESRRVFKSN------DEWQAEFGPF 69

Query: 58  EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVGAN-------- 106
           E           Y E      L  + +   +Q LN  L   +EL  CF            
Sbjct: 70  ES----------YEE------LGEKDWTLIVQALNNWLPEAEELIQCFDFIPRWRFDDVM 113

Query: 107 -TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP 165
            +Y TP    G  PH D  + FI Q  G+++W+V       E+    + P     E   P
Sbjct: 114 VSYATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFEP 168

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           I+ V L PGD+LYLP GY H   T+    S  V
Sbjct: 169 IIDVELLPGDILYLPPGYPHDGVTLEPSMSFSV 201


>gi|416836453|ref|ZP_11902068.1| hypothetical protein ECOSU61_02838 [Escherichia coli O157:H7 str.
           LSU-61]
 gi|320664526|gb|EFX31677.1| hypothetical protein ECOSU61_02838 [Escherichia coli O157:H7 str.
           LSU-61]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|110641305|ref|YP_669035.1| hypothetical protein ECP_1123 [Escherichia coli 536]
 gi|110342897|gb|ABG69134.1| hypothetical protein YcfD [Escherichia coli 536]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHWPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|422782050|ref|ZP_16834835.1| cupin superfamily protein [Escherichia coli TW10509]
 gi|323976501|gb|EGB71589.1| cupin superfamily protein [Escherichia coli TW10509]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPSASLMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|291085500|ref|ZP_06353238.2| cupin family protein [Citrobacter youngae ATCC 29220]
 gi|291071160|gb|EFE09269.1| cupin family protein [Citrobacter youngae ATCC 29220]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 69

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 70  -PFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|428970617|ref|ZP_19041136.1| cupin superfamily protein [Escherichia coli 90.0039]
 gi|427231353|gb|EKV99398.1| cupin superfamily protein [Escherichia coli 90.0039]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLLISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|238791836|ref|ZP_04635473.1| hypothetical protein yinte0001_15530 [Yersinia intermedia ATCC
           29909]
 gi|238728940|gb|EEQ20457.1| hypothetical protein yinte0001_15530 [Yersinia intermedia ATCC
           29909]
          Length = 373

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 95/203 (46%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +E+G+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEEGRWQVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P +QL +  + +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRQLSDWRMDDLMVSF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + +Y P    P+  Q +    I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQYCPH---PDLLQVDPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|117623313|ref|YP_852226.1| hypothetical protein APECO1_210 [Escherichia coli APEC O1]
 gi|115512437|gb|ABJ00512.1| conserved hypothetical protein [Escherichia coli APEC O1]
          Length = 376

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVDEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|323453858|gb|EGB09729.1| hypothetical protein AURANDRAFT_63211 [Aureococcus anophagefferens]
          Length = 390

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 7/112 (6%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP--- 150
           ++Q+  G    AN Y TPP A    PH D    F +QL G K+W +   R  D Y+P   
Sbjct: 140 AVQDFLGLPAQANVYVTPPGASAARPHVDRHCVFAVQLAGAKRWCL---READGYVPPPA 196

Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
           R  +   +    G     V L  G  LY+PRG  H      +  S+HV+  V
Sbjct: 197 RERATARADAAYGGE-RAVDLAAGGALYVPRGAPHCCENAGDALSVHVSFGV 247


>gi|416266586|ref|ZP_11641656.1| hypothetical protein SDB_01875 [Shigella dysenteriae CDC 74-1112]
 gi|420380495|ref|ZP_14879960.1| cupin superfamily protein [Shigella dysenteriae 225-75]
 gi|320175634|gb|EFW50726.1| hypothetical protein SDB_01875 [Shigella dysenteriae CDC 74-1112]
 gi|391302198|gb|EIQ60067.1| cupin superfamily protein [Shigella dysenteriae 225-75]
          Length = 373

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGERLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|308049708|ref|YP_003913274.1| cupin [Ferrimonas balearica DSM 9799]
 gi|307631898|gb|ADN76200.1| Cupin 4 family protein [Ferrimonas balearica DSM 9799]
          Length = 373

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 49/85 (57%), Gaps = 5/85 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI+Q EG+++W+V  P+   +Y  R +S   +  E   PI
Sbjct: 112 SFATP--GGGVGPHIDNYDVFIIQGEGERRWQVG-PK--GDYAARPNSGGMALVEDFNPI 166

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
           +   + PGD+LY+P GY H   T+T
Sbjct: 167 IDAVMSPGDVLYIPPGYPHCGQTLT 191


>gi|218689081|ref|YP_002397293.1| hypothetical protein ECED1_1272 [Escherichia coli ED1a]
 gi|218426645|emb|CAR07473.1| conserved hypothetical protein [Escherichia coli ED1a]
          Length = 373

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLHMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|336314360|ref|ZP_08569279.1| Cupin superfamily protein [Rheinheimera sp. A13L]
 gi|335881373|gb|EGM79253.1| Cupin superfamily protein [Rheinheimera sp. A13L]
          Length = 389

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/154 (29%), Positives = 66/154 (42%), Gaps = 29/154 (18%)

Query: 95  LQELFGCFVGANTYQTPPTAQG----FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
           L +    + G  T  +   A G    F  H+D  + F+LQL G+KKW V        Y P
Sbjct: 108 LAKQIAAYCGRQTVVSSYAAFGEKDSFRNHWDTRDVFVLQLIGRKKWTV--------YPP 159

Query: 151 RYSSPNFSQEE---------IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
            + +P F Q+            TP +   LE GD+ YLPRG+ H    V  E + H+ + 
Sbjct: 160 SFENPLFMQQSKDLEQSFPCSDTPYMEFMLEAGDIFYLPRGWWHNPMPV-GEETFHLAVG 218

Query: 202 VYQKTA-----WIDLLEKAMPKALQA--AGATDL 228
            +   A     WI      +P A ++  +G  DL
Sbjct: 219 TFPPFAINYVEWIMTQLPNLPAARKSLDSGEDDL 252


>gi|288960722|ref|YP_003451062.1| hypothetical protein AZL_a09870 [Azospirillum sp. B510]
 gi|288913030|dbj|BAI74518.1| hypothetical protein AZL_a09870 [Azospirillum sp. B510]
          Length = 315

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/126 (34%), Positives = 65/126 (51%), Gaps = 16/126 (12%)

Query: 106 NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYL-------PRMVD--EYLPRYSSP- 155
           N Y++PP  +GFA H DD    +L LEG+K+W +         PR  +    L R ++P 
Sbjct: 119 NAYRSPP-GKGFALHVDDHPVVVLHLEGEKRWWLSRDIGVPDPPRGFNFPPGLKRLTTPW 177

Query: 156 -NFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
             + + + GT    +TL PGDLLY+P G  H+   VT   SL + +     TA +DL+  
Sbjct: 178 GVYERPDEGT-FQEITLRPGDLLYVPAGMWHRTQAVTASLSLTMAMVA---TAPMDLVRP 233

Query: 215 AMPKAL 220
            +  AL
Sbjct: 234 RLETAL 239


>gi|307261586|ref|ZP_07543254.1| hypothetical protein appser12_11470 [Actinobacillus
           pleuropneumoniae serovar 12 str. 1096]
 gi|306868709|gb|EFN00518.1| hypothetical protein appser12_11470 [Actinobacillus
           pleuropneumoniae serovar 12 str. 1096]
          Length = 296

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 14/110 (12%)

Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-------EYLPRYS 153
           C + ++ Y    T + F  H+D  + F +Q++G+K+W ++ P   D       + +P Y 
Sbjct: 31  CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEY- 89

Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
            PN  ++++   IL   LE GD+LYLPRG+ H    V  E ++H+ + V+
Sbjct: 90  FPN--KDDVYMDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 133


>gi|120598590|ref|YP_963164.1| cupin 4 family protein [Shewanella sp. W3-18-1]
 gi|120558683|gb|ABM24610.1| Cupin 4 family protein [Shewanella sp. W3-18-1]
          Length = 379

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|384258940|ref|YP_005402874.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella aquatilis HX2]
 gi|380754916|gb|AFE59307.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella aquatilis HX2]
          Length = 373

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 26/206 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-----D 56
           +W+K+P+ I +   N+ + +       D L    +E + +  +  ++DG+ Q       D
Sbjct: 16  YWQKKPVVIKKAFSNFVDPISP-----DELAGLAMEPEVDSRLVSHKDGEWQASNGPFED 70

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
            +G  L +  W    +  +   L     +KP + L +  L +L        +Y  P    
Sbjct: 71  FDG--LGETGWSLLAQAVNHWHLPAAELVKPFRVLPDWRLDDLM------ISYSVP--GG 120

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFTPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|153000993|ref|YP_001366674.1| cupin 4 family protein [Shewanella baltica OS185]
 gi|151365611|gb|ABS08611.1| Cupin 4 family protein [Shewanella baltica OS185]
          Length = 379

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|421673713|ref|ZP_16113650.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           OIFC065]
 gi|421690504|ref|ZP_16130175.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           IS-116]
 gi|424058349|ref|ZP_17795846.1| hypothetical protein W9K_02677 [Acinetobacter baumannii Ab33333]
 gi|404564776|gb|EKA69955.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           IS-116]
 gi|404665591|gb|EKB33553.1| hypothetical protein W9K_02677 [Acinetobacter baumannii Ab33333]
 gi|410385931|gb|EKP38415.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           OIFC065]
          Length = 405

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 99/204 (48%), Gaps = 20/204 (9%)

Query: 15  KNYYNNLITSKKIDDI-LRDNVIE--FKKNIDITL-----YEDGQRQTMDVEGRALPQIV 66
           +N +   IT ++ID++ LR ++I+  FK N +  +     Y +  +    +  R +   +
Sbjct: 42  RNSFVPQITDQQIDEVVLRADIIQDDFKLNFEGKVLPKNKYVESFQDVGKIRHRFIKAAL 101

Query: 67  WDHYREGCSIRFLNPQTYIKPL-QQLNASLQELFGCFVGANTYQTPPTA--QGFAPHYDD 123
            D+ R G ++  +  + Y +PL    +  ++E+      ++ Y +        F  H+D 
Sbjct: 102 VDYMRRGATL--IANKLYNEPLIYNYHKQIEEIANRPALSSLYISYGNEDDSSFRAHWDT 159

Query: 124 IEAFILQLEGKKKWKVYLPRMVDEY-LPRYSSPNFS---QEEIGTPILTVTLEPGDLLYL 179
            + F+LQL G+KKW +Y P    EY LP   S +     +     P + V LE GDLLY+
Sbjct: 160 RDVFVLQLVGRKKWILYEPNF--EYPLPSQQSKDMEHVVKCPTTEPYMDVILEEGDLLYV 217

Query: 180 PRGYIHQASTVTNEHSLHVTISVY 203
           PRG+ H       E ++H+++  Y
Sbjct: 218 PRGWWHDP-VPCGERTVHLSVGTY 240


>gi|322833762|ref|YP_004213789.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella sp. Y9602]
 gi|321168963|gb|ADW74662.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella sp. Y9602]
          Length = 373

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 26/206 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-----D 56
           +W+K+P+ I +   N+ + +       D L    +E + +  +  ++DG+ Q       D
Sbjct: 16  YWQKKPVVIKKAFSNFVDPISP-----DELAGLAMEPEVDSRLVSHKDGEWQASNGPFED 70

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
            +G  L +  W    +  +   L     +KP + L +  L +L        +Y  P    
Sbjct: 71  FDG--LGETGWSLLAQAVNHWHLPAAELVKPFRVLPDWRLDDLM------ISYSVP--GG 120

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFTPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|170768059|ref|ZP_02902512.1| cupin family protein [Escherichia albertii TW07627]
 gi|170122825|gb|EDS91756.1| cupin family protein [Escherichia albertii TW07627]
          Length = 373

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  +++G+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMENEVDSRLVSHQNGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVDHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V   R + ++ P     +  Q E    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKRQMKQHCPHL---DLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|217973047|ref|YP_002357798.1| cupin [Shewanella baltica OS223]
 gi|217498182|gb|ACK46375.1| Cupin 4 family protein [Shewanella baltica OS223]
          Length = 379

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|146293332|ref|YP_001183756.1| cupin 4 family protein [Shewanella putrefaciens CN-32]
 gi|145565022|gb|ABP75957.1| Cupin 4 family protein [Shewanella putrefaciens CN-32]
          Length = 379

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|417586060|ref|ZP_12236833.1| cupin superfamily protein [Escherichia coli STEC_C165-02]
 gi|345339216|gb|EGW71642.1| cupin superfamily protein [Escherichia coli STEC_C165-02]
          Length = 373

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPIYPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|126174694|ref|YP_001050843.1| cupin 4 family protein [Shewanella baltica OS155]
 gi|386341451|ref|YP_006037817.1| cupin [Shewanella baltica OS117]
 gi|125997899|gb|ABN61974.1| Cupin 4 family protein [Shewanella baltica OS155]
 gi|334863852|gb|AEH14323.1| Cupin 4 family protein [Shewanella baltica OS117]
          Length = 379

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|419237278|ref|ZP_13780013.1| cupin superfamily protein [Escherichia coli DEC9C]
 gi|378086458|gb|EHW48334.1| cupin superfamily protein [Escherichia coli DEC9C]
          Length = 296

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|238749642|ref|ZP_04611147.1| hypothetical protein yrohd0001_29600 [Yersinia rohdei ATCC 43380]
 gi|238712297|gb|EEQ04510.1| hypothetical protein yrohd0001_29600 [Yersinia rohdei ATCC 43380]
          Length = 373

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 91/203 (44%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ + L       D L    +E + +  +  +EDG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWQVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRALSDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTPMKQHCPH---PDLLQVEPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|373949811|ref|ZP_09609772.1| Cupin 4 family protein [Shewanella baltica OS183]
 gi|386324356|ref|YP_006020473.1| cupin [Shewanella baltica BA175]
 gi|333818501|gb|AEG11167.1| Cupin 4 family protein [Shewanella baltica BA175]
 gi|373886411|gb|EHQ15303.1| Cupin 4 family protein [Shewanella baltica OS183]
          Length = 379

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|165976530|ref|YP_001652123.1| hypothetical protein APJL_1123 [Actinobacillus pleuropneumoniae
           serovar 3 str. JL03]
 gi|307250358|ref|ZP_07532307.1| hypothetical protein appser4_11390 [Actinobacillus pleuropneumoniae
           serovar 4 str. M62]
 gi|165876631|gb|ABY69679.1| hypothetical protein APJL_1123 [Actinobacillus pleuropneumoniae
           serovar 3 str. JL03]
 gi|306857633|gb|EFM89740.1| hypothetical protein appser4_11390 [Actinobacillus pleuropneumoniae
           serovar 4 str. M62]
          Length = 396

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 14/110 (12%)

Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-------EYLPRYS 153
           C + ++ Y    T + F  H+D  + F +Q++G+K+W ++ P   D       + +P Y 
Sbjct: 131 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEY- 189

Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
            PN  ++++   IL   LE GD+LYLPRG+ H    V  E ++H+ + V+
Sbjct: 190 FPN--KDDVYIDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 233


>gi|386314007|ref|YP_006010172.1| transcription factor jumonji jmjC domain protein [Shewanella
           putrefaciens 200]
 gi|319426632|gb|ADV54706.1| transcription factor jumonji jmjC domain protein [Shewanella
           putrefaciens 200]
          Length = 379

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E  T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|359452147|ref|ZP_09241502.1| hypothetical protein P20495_0235 [Pseudoalteromonas sp. BSi20495]
 gi|358050741|dbj|GAA77751.1| hypothetical protein P20495_0235 [Pseudoalteromonas sp. BSi20495]
          Length = 381

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 57/198 (28%), Positives = 91/198 (45%), Gaps = 27/198 (13%)

Query: 3   WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
           W+K+PL I +   ++ +    N +    +++ +   +I   KN   T    +ED ++ T 
Sbjct: 19  WQKKPLLIKQGFTDFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
           D     L Q V DH+      + L P  +I      N  + +L        +Y TP    
Sbjct: 78  DDHSTLLVQAV-DHWHPDAG-QLLEPFRFIP-----NWRIDDLM------ISYSTP--NG 122

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
           G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    ++   LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEAVIDCILEPG 179

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LY+P G  H+   V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197


>gi|421746995|ref|ZP_16184747.1| hypothetical protein B551_09866 [Cupriavidus necator HPC(L)]
 gi|409774421|gb|EKN56046.1| hypothetical protein B551_09866 [Cupriavidus necator HPC(L)]
          Length = 400

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 59/129 (45%), Gaps = 22/129 (17%)

Query: 87  PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
           P+++    +    G    AN Y +  +  GF  H DD +  ++Q++G+KKW         
Sbjct: 114 PVEKFTEQMGGTLGVRSSANAYMSFGSTSGFGRHNDDHDVIVIQIDGRKKWTF------- 166

Query: 147 EYLPRYSSPNFSQEEIGTPIL---------TVTLEPGDLLYLPRGYIHQASTVTNEHSLH 197
                  SPN  +  + T            T+T+  GD+L++P+G  H   ++  E SLH
Sbjct: 167 -----LESPNGFRHALVTETSDPEALKAGETITVSRGDILFIPKGTWHDVISI-GEPSLH 220

Query: 198 VTISVYQKT 206
           +T+S+   T
Sbjct: 221 LTVSLVYPT 229


>gi|383190924|ref|YP_005201052.1| hypothetical protein Rahaq2_3092 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
 gi|371589182|gb|AEX52912.1| hypothetical protein Rahaq2_3092 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
          Length = 373

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 26/206 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-----D 56
           +W+K+P+ I +   N+ + +       D L    +E + +  +  ++DG+ Q       D
Sbjct: 16  YWQKKPVVIKKAFSNFVDPISP-----DELAGLAMEPEVDSRLVSHKDGEWQASNGPFED 70

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
            +G  L +  W    +  +   L     +KP + L +  L +L        +Y  P    
Sbjct: 71  FDG--LGETGWSLLAQAVNHWHLPAAELVKPFRVLPDWRLDDLM------ISYSVP--GG 120

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFTPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|161505213|ref|YP_001572325.1| hypothetical protein SARI_03351 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160866560|gb|ABX23183.1| hypothetical protein SARI_03351 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 268

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 52/100 (52%), Gaps = 3/100 (3%)

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
            +   + QL+G K++ +  P+   E    YS+  +  E +G P   V L PGD LY P  
Sbjct: 144 SVNGIVGQLQGVKEFTLVAPQYALEEGKFYST--YELETLGIPFECVVLHPGDFLYFPAH 201

Query: 183 YIHQASTVTNEHS-LHVTISVYQKTAWIDLLEKAMPKALQ 221
           + HQA T+    S +H T+++Y  TA++D     MP  +Q
Sbjct: 202 WWHQAKTIECSVSFIHSTVNLYNMTAFLDDAISQMPALIQ 241


>gi|419918254|ref|ZP_14436465.1| putative cytoplasmic protein [Escherichia coli KD2]
 gi|388391875|gb|EIL53315.1| putative cytoplasmic protein [Escherichia coli KD2]
          Length = 373

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKCPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|308186444|ref|YP_003930575.1| hypothetical protein Pvag_0932 [Pantoea vagans C9-1]
 gi|308056954|gb|ADO09126.1| Uncharacterized protein ycfD [Pantoea vagans C9-1]
          Length = 374

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    V            G  PH
Sbjct: 67  -PFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRVDDLMVSFAVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVALKQHCPH---PDLLQVEPFDAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|193066274|ref|ZP_03047326.1| cupin family protein [Escherichia coli E22]
 gi|192926123|gb|EDV80765.1| cupin family protein [Escherichia coli E22]
          Length = 373

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTATLMRPFRELPDWRIDDLMISFSVPGGGVDPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|399017092|ref|ZP_10719292.1| Cupin superfamily protein [Herbaspirillum sp. CF444]
 gi|398104272|gb|EJL94419.1| Cupin superfamily protein [Herbaspirillum sp. CF444]
          Length = 391

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 64/136 (47%), Gaps = 11/136 (8%)

Query: 71  REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
           REG +         +  +  L  +L    G    AN Y +  +  GF  H DD +  ++Q
Sbjct: 98  REGVTGVLEAVNELVPSVGNLTETLGGTLGAQSTANAYVSFGSISGFGIHNDDHDVIVIQ 157

Query: 131 LEGKKKWKVYLPRMVD----EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           LEG+K+W+ +    VD      +    +P+ S   +      + +  GD++++P+G  H 
Sbjct: 158 LEGRKRWQ-FFASSVDAKRKATVKDIQAPSASDRGV-----ELVISSGDVMFVPKGTWHD 211

Query: 187 ASTVTNEHSLHVTISV 202
              + NE SLH+TIS+
Sbjct: 212 VIAL-NEKSLHLTISL 226


>gi|405371091|ref|ZP_11026802.1| Cupin 4 [Chondromyces apiculatus DSM 436]
 gi|397089076|gb|EJJ20012.1| Cupin 4 [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 295

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 60/239 (25%), Positives = 102/239 (42%), Gaps = 25/239 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           +H+++ P +           L T + ID ++ +   +         Y  G+ +T   E R
Sbjct: 18  EHYQRRP-FTGASSAERLQRLGTWETIDFLVEETACDVLLARQGVPYPGGRPRTAK-EAR 75

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            L       + +G ++    P  +   L QL  +        +  + Y TP    GF  H
Sbjct: 76  EL-------FAQGYTLALRQPDQHHPDLAQLARAFSAELHGRINLHIYCTPAGHHGFGWH 128

Query: 121 YDDIEAFILQLEGKKKW----KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
            D  E FILQ  G+K++        P  + E +P  S    +QE+  TP+ T  L  GD 
Sbjct: 129 CDPEEVFILQTAGRKEYLLRENTLHPVPLPESVP--SGTLAAQEK--TPVETHALGAGDF 184

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           +Y+P GY H A    ++ +L ++I +   T  +DLL+      ++A  A    +RR +P
Sbjct: 185 IYIPGGYWHMAQ--ASDEALSISIGLMPPT-LLDLLD-----GVRATLAASPVWRRRMP 235


>gi|54308332|ref|YP_129352.1| hypothetical protein PBPRA1139 [Photobacterium profundum SS9]
 gi|46912760|emb|CAG19550.1| conserved hypothetical protein [Photobacterium profundum SS9]
          Length = 379

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 4/99 (4%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y TP  + G  PH D  + FI+Q  GK+ W+V   +  D+Y      P   Q +   PI
Sbjct: 114 SYSTP--SGGVGPHIDQYDVFIIQGSGKRHWRVGPKK--DDYEEELRHPALRQIKGFEPI 169

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
           +   +EPGD+LY+P G+ H    +    S  V     +K
Sbjct: 170 IDEVMEPGDILYIPPGFPHDGYAIETSMSFSVGFRSPKK 208


>gi|307248127|ref|ZP_07530155.1| hypothetical protein appser2_11080 [Actinobacillus pleuropneumoniae
           serovar 2 str. S1536]
 gi|306855304|gb|EFM87479.1| hypothetical protein appser2_11080 [Actinobacillus pleuropneumoniae
           serovar 2 str. S1536]
          Length = 374

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 58/114 (50%), Gaps = 22/114 (19%)

Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-----------EYL 149
           C + ++ Y    T + F  H+D  + F +Q++G+K+W ++ P   D           EY 
Sbjct: 131 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEYF 190

Query: 150 PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
           P       +++++   IL   LE GD+LYLPRG+ H    V  E ++H+ + V+
Sbjct: 191 P-------NKDDVYMDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 233


>gi|161503677|ref|YP_001570789.1| hypothetical protein SARI_01761 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160865024|gb|ABX21647.1| hypothetical protein SARI_01761 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 373

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q      R 
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----RH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFDAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|218549149|ref|YP_002382940.1| hypothetical protein EFER_1802 [Escherichia fergusonii ATCC 35469]
 gi|422805306|ref|ZP_16853738.1| cupin superfamily protein [Escherichia fergusonii B253]
 gi|218356690|emb|CAQ89317.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
 gi|324113919|gb|EGC07893.1| cupin superfamily protein [Escherichia fergusonii B253]
          Length = 373

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 81/194 (41%), Gaps = 18/194 (9%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE--DGQRQTMDVEG 59
           HW+K P+ I R     +NN I     D++     +  +  +D  L    DG+ Q      
Sbjct: 16  HWQKRPVVIKRG----FNNFIDPISPDELAG---LAMESEVDSRLVNNLDGKWQV----- 63

Query: 60  RALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
              P   +DH  E   S+       + +P   L    +EL    +            G  
Sbjct: 64  SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY
Sbjct: 124 PHLDQYDVFIIQGTGRRRWRVGEKLPMKQHCPH---PDLLQVEPFEAIIDEELEPGDILY 180

Query: 179 LPRGYIHQASTVTN 192
           +P G+ H+   + N
Sbjct: 181 IPPGFPHEGYALEN 194


>gi|332535585|ref|ZP_08411355.1| hypothetical protein PH505_da00110 [Pseudoalteromonas haloplanktis
           ANT/505]
 gi|332035003|gb|EGI71523.1| hypothetical protein PH505_da00110 [Pseudoalteromonas haloplanktis
           ANT/505]
          Length = 367

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           +Y TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    
Sbjct: 98  SYSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 152

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   V N
Sbjct: 153 VIDCILEPGDILYIPPGCPHEGYAVEN 179


>gi|308807571|ref|XP_003081096.1| COG2850: Uncharacterized conserved protein (ISS) [Ostreococcus
           tauri]
 gi|116059558|emb|CAL55265.1| COG2850: Uncharacterized conserved protein (ISS) [Ostreococcus
           tauri]
          Length = 380

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 52/120 (43%), Gaps = 18/120 (15%)

Query: 95  LQELFGCF----VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV-----YLPRMV 145
           L ELF CF    V        P   G  PH D  + F+LQ EG+K W V     Y P   
Sbjct: 79  LLELFSCFPRWRVSDIQASVSPNGGGVGPHSDHFDVFLLQAEGEKHWAVADNEEYWPDND 138

Query: 146 DEYLPRYSS---PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
             ++P         F +++      + TL PGD+LYLP    H    V +   + VT+S+
Sbjct: 139 RAFVPECEIRVLKEFVEDD------SFTLVPGDMLYLPPKVAHNGVAVNSAPGVSVTLSI 192


>gi|395763450|ref|ZP_10444119.1| hypothetical protein JPAM2_17087 [Janthinobacterium lividum PAMC
           25724]
          Length = 373

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 49/198 (24%), Positives = 85/198 (42%), Gaps = 25/198 (12%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W K+PL I R+    +  L   K + ++   + +E +    +  + DG          
Sbjct: 18  DYWHKKPLLI-RQAVPGFKALFDFKALAELATLDHVESR----LVSHADGHWNMQQGPLT 72

Query: 61  ALP---QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQG 116
           +LP   Q  W    +G ++        ++  + L +A L +L   F             G
Sbjct: 73  SLPSLKQKEWTLLVQGANLHSAKADALLRQFRFLPDARLDDLMVSFA--------TDGGG 124

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PH+D  + F+LQ +GK+ W++   + +   + LP     NF+ +E        TLEPG
Sbjct: 125 VGPHFDSYDVFLLQGQGKRHWRIGAQKDLSLIDGLPLKILSNFTPDE------EFTLEPG 178

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LYLP  Y H    + +
Sbjct: 179 DMLYLPPHYAHDGVAIGD 196


>gi|386822536|ref|ZP_10109743.1| cupin [Serratia plymuthica PRI-2C]
 gi|386380610|gb|EIJ21340.1| cupin [Serratia plymuthica PRI-2C]
          Length = 373

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  S         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|424816550|ref|ZP_18241701.1| hypothetical protein ECD227_1667 [Escherichia fergusonii ECD227]
 gi|325497570|gb|EGC95429.1| hypothetical protein ECD227_1667 [Escherichia fergusonii ECD227]
          Length = 376

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 81/194 (41%), Gaps = 18/194 (9%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE--DGQRQTMDVEG 59
           HW+K P+ I R     +NN I     D++     +  +  +D  L    DG+ Q      
Sbjct: 19  HWQKRPVVIKRG----FNNFIDPISPDELAG---LAMESEVDSRLVNNLDGKWQV----- 66

Query: 60  RALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
              P   +DH  E   S+       + +P   L    +EL    +            G  
Sbjct: 67  SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 126

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY
Sbjct: 127 PHLDQYDVFIIQGTGRRRWRVGEKLPMKQHCPH---PDLLQVEPFEAIIDEELEPGDILY 183

Query: 179 LPRGYIHQASTVTN 192
           +P G+ H+   + N
Sbjct: 184 IPPGFPHEGYALEN 197


>gi|320167155|gb|EFW44054.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 879

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 48/166 (28%), Positives = 77/166 (46%), Gaps = 6/166 (3%)

Query: 39  KKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQEL 98
           + +ID  L+    RQT     +A  Q     Y  G  +       +I+ + +L   L  L
Sbjct: 223 RDDIDHPLWFASLRQTGPTSQQASSQEPHATYASGAPLVIDLQSHFIRKVDRLLHYLGGL 282

Query: 99  FGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM----VDEYLPRYSS 154
            G  V    +  P  A+  AP YD     ILQ+EGK  W++Y   M      + L R  +
Sbjct: 283 TGTRVSGFAHLAPRAARASAPRYDVDNLIILQMEGKSSWRLYRQEMAYPLAAKELARRQT 342

Query: 155 PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTI 200
             F + E+  P   V L+ GDLL++PRG+I+   ++ ++   H+T+
Sbjct: 343 --FKRVELDAPEHQVVLQAGDLLHVPRGWIYDVESLADDQDFHLTV 386


>gi|421783093|ref|ZP_16219545.1| cupin superfamily protein [Serratia plymuthica A30]
 gi|407754799|gb|EKF64930.1| cupin superfamily protein [Serratia plymuthica A30]
          Length = 373

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  S         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|333926821|ref|YP_004500400.1| cupin [Serratia sp. AS12]
 gi|333931775|ref|YP_004505353.1| cupin [Serratia plymuthica AS9]
 gi|386328644|ref|YP_006024814.1| cupin [Serratia sp. AS13]
 gi|333473382|gb|AEF45092.1| Cupin 4 family protein [Serratia plymuthica AS9]
 gi|333490881|gb|AEF50043.1| Cupin 4 family protein [Serratia sp. AS12]
 gi|333960977|gb|AEG27750.1| Cupin 4 family protein [Serratia sp. AS13]
          Length = 373

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  S         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|303253321|ref|ZP_07339470.1| hypothetical protein APP2_0632 [Actinobacillus pleuropneumoniae
           serovar 2 str. 4226]
 gi|302648003|gb|EFL78210.1| hypothetical protein APP2_0632 [Actinobacillus pleuropneumoniae
           serovar 2 str. 4226]
          Length = 277

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 14/110 (12%)

Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-------EYLPRYS 153
           C + ++ Y    T + F  H+D  + F +Q++G+K+W ++ P   D       + +P Y 
Sbjct: 120 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEY- 178

Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
            PN  ++++   IL   LE GD+LYLPRG+ H    V  E ++H+ + V+
Sbjct: 179 FPN--KDDVYMDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 222


>gi|270261451|ref|ZP_06189724.1| hypothetical protein SOD_a06830 [Serratia odorifera 4Rx13]
 gi|270044935|gb|EFA18026.1| hypothetical protein SOD_a06830 [Serratia odorifera 4Rx13]
          Length = 373

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  S         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|442611348|ref|ZP_21026054.1| FIG002776: hypothetical protein [Pseudoalteromonas luteoviolacea B
           = ATCC 29581]
 gi|441747276|emb|CCQ12116.1| FIG002776: hypothetical protein [Pseudoalteromonas luteoviolacea B
           = ATCC 29581]
          Length = 383

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR-MVDEYLPRYSSPNFSQEEIGTP 165
           +Y TP    G  PH D  + FI+Q +GK+ W+V LP   + E+    S     Q E    
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGQGKRHWRVGLPNPSLREFAQNKS---LLQVEAFDA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           I+   LEPGD+LY+P G+ H+   + N
Sbjct: 171 IIDAVLEPGDILYIPPGHPHEGYAIEN 197


>gi|294140715|ref|YP_003556693.1| hypothetical protein SVI_1944 [Shewanella violacea DSS12]
 gi|293327184|dbj|BAJ01915.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 380

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 94/211 (44%), Gaps = 38/211 (18%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYED-GQRQTMDVEG 59
           D+W+K+PL I R+  N + +L+T +++  +  D ++E ++     +Y++ GQ Q      
Sbjct: 16  DYWQKKPLVI-RQGFNNFEDLLTPEEMAGLACDELVESRR-----IYKNQGQWQA----- 64

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVG---------ANT 107
              P   +D Y +      L    +   +Q LN  L   ++L  CF             +
Sbjct: 65  EFGP---FDSYED------LGETDWTLVVQALNHWLPDAEQLIQCFDFIPRWRFDDVMVS 115

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
           Y TP    G  PH D  + FI Q  G+++W+V       E+    + P     E    I+
Sbjct: 116 YATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHKEFA---AHPALLHTEAFDAII 170

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
            V L PGD+LYLP G+ H+   +    S  V
Sbjct: 171 DVELLPGDILYLPPGFPHEGVALEASMSFSV 201


>gi|336450774|ref|ZP_08621221.1| hypothetical protein A28LD_0880 [Idiomarina sp. A28L]
 gi|336282597|gb|EGN75829.1| hypothetical protein A28LD_0880 [Idiomarina sp. A28L]
          Length = 405

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 5/84 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y TP    G  PH D  + FI+Q  G ++W+V        + P    P+  Q     PI
Sbjct: 127 SYSTP--GGGVGPHLDQYDVFIVQGMGSRRWQVGASGDFQTHCPH---PDLKQVAEFAPI 181

Query: 167 LTVTLEPGDLLYLPRGYIHQASTV 190
           +   LEPGD++Y+P G+ H   TV
Sbjct: 182 INEVLEPGDMIYIPAGFPHAGDTV 205


>gi|300716265|ref|YP_003741068.1| hypothetical protein EbC_16860 [Erwinia billingiae Eb661]
 gi|299062101|emb|CAX59217.1| conserved uncharacterized protein YcfD [Erwinia billingiae Eb661]
          Length = 375

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 95/202 (47%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
            D  G      L Q V DH+ E  S   + P +++   +  +  +           ++  
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHE-PSAALMRPFSFLPDWRTDDLMI-----------SFSV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHFDQYDVFIIQGTGRRRWRVGDKVPLKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+  ++ N
Sbjct: 173 MEPGDILYIPPGFPHEGYSLEN 194


>gi|453061876|gb|EMF02872.1| cupin [Serratia marcescens VGH107]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 91/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  +         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSA-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKTPLKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|16764584|ref|NP_460199.1| hypothetical protein STM1229 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|167991981|ref|ZP_02573080.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           4,[5],12:i:- str. CVM23701]
 gi|374980227|ref|ZP_09721557.1| hypothetical protein SEE_02292 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|378444662|ref|YP_005232294.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. D23580]
 gi|378449630|ref|YP_005236989.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. 14028S]
 gi|378699122|ref|YP_005181079.1| hypothetical protein SL1344_1167 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|378983789|ref|YP_005246944.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|378988573|ref|YP_005251737.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. UK-1]
 gi|379700400|ref|YP_005242128.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|421884796|ref|ZP_16316001.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Senftenberg str. SS209]
 gi|422025378|ref|ZP_16371812.1| hypothetical protein B571_06091 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|427548949|ref|ZP_18927123.1| hypothetical protein B576_06196 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427564604|ref|ZP_18931826.1| hypothetical protein B577_05550 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427584451|ref|ZP_18936623.1| hypothetical protein B573_05591 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427606865|ref|ZP_18941437.1| hypothetical protein B574_05619 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427632052|ref|ZP_18946385.1| hypothetical protein B575_06187 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427655221|ref|ZP_18951142.1| hypothetical protein B578_05802 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427660557|ref|ZP_18956048.1| hypothetical protein B579_06422 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427666409|ref|ZP_18960820.1| hypothetical protein B580_05984 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|427752095|ref|ZP_18965920.1| hypothetical protein B581_07315 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm5]
 gi|16419747|gb|AAL20158.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|205329821|gb|EDZ16585.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           4,[5],12:i:- str. CVM23701]
 gi|261246441|emb|CBG24251.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|267993008|gb|ACY87893.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. 14028S]
 gi|301157770|emb|CBW17263.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. SL1344]
 gi|312912217|dbj|BAJ36191.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|321223847|gb|EFX48910.1| hypothetical protein SEE_02292 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|323129499|gb|ADX16929.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|332988120|gb|AEF07103.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. UK-1]
 gi|379985510|emb|CCF88274.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Senftenberg str. SS209]
 gi|414020717|gb|EKT04291.1| hypothetical protein B571_06091 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414020728|gb|EKT04301.1| hypothetical protein B576_06196 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414034694|gb|EKT17612.1| hypothetical protein B577_05550 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414035894|gb|EKT18740.1| hypothetical protein B573_05591 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414039416|gb|EKT22087.1| hypothetical protein B574_05619 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414049030|gb|EKT31256.1| hypothetical protein B578_05802 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414050503|gb|EKT32674.1| hypothetical protein B575_06187 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414055066|gb|EKT36986.1| hypothetical protein B579_06422 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414060577|gb|EKT42086.1| hypothetical protein B580_05984 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|414066208|gb|EKT46813.1| hypothetical protein B581_07315 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm5]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|448241809|ref|YP_007405862.1| cupin superfamily protein [Serratia marcescens WW4]
 gi|445212173|gb|AGE17843.1| cupin superfamily protein [Serratia marcescens WW4]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 91/202 (45%), Gaps = 34/202 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 62  LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
              +    W       DH+ E  +         ++P +QL +  + +L   F        
Sbjct: 71  FDHLSENNWSLLVQAVDHWHEPSA-------ALMRPFRQLPDWRMDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKTPLKQHCPH---PDLLQVEPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
           +EPGD+LY+P G+ H+   + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194


>gi|427399222|ref|ZP_18890460.1| hypothetical protein HMPREF9710_00056 [Massilia timonae CCUG 45783]
 gi|425721713|gb|EKU84621.1| hypothetical protein HMPREF9710_00056 [Massilia timonae CCUG 45783]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/191 (27%), Positives = 82/191 (42%), Gaps = 21/191 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-DVEG 59
           D+W K+PL I R     +  L+  +K+ ++ R N +E +         D Q   + ++  
Sbjct: 18  DYWHKKPLLI-RGAIPGFKPLLKFEKLAEMARQNHVESRLVTQKDGNWDMQHGPLAELPA 76

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFA 118
           R  P   W    +G ++        ++  + + +A L +L   F             G  
Sbjct: 77  RTEPN--WTMLVQGVNLYDERADELLRQFRFVPDARLDDLMVSFA--------TDGAGVG 126

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
           PHYD  + F+LQ  GK++WK+   +   + E LP     NF  EE         LEPGD+
Sbjct: 127 PHYDSYDVFLLQAHGKRRWKIGSQQDLTLVEGLPLKILANFKPEE------EFVLEPGDM 180

Query: 177 LYLPRGYIHQA 187
           LYLP  Y H  
Sbjct: 181 LYLPPHYAHDG 191


>gi|383495951|ref|YP_005396640.1| hypothetical protein UMN798_1277 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
 gi|417339270|ref|ZP_12120862.1| hypothetical protein LTSEBAI_0501 [Salmonella enterica subsp.
           enterica serovar Baildon str. R6-199]
 gi|357960559|gb|EHJ84366.1| hypothetical protein LTSEBAI_0501 [Salmonella enterica subsp.
           enterica serovar Baildon str. R6-199]
 gi|380462772|gb|AFD58175.1| hypothetical protein UMN798_1277 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
          Length = 383

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|283784938|ref|YP_003364803.1| hypothetical protein ROD_12171 [Citrobacter rodentium ICC168]
 gi|282948392|emb|CBG87980.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V   + + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKQQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H    + N
Sbjct: 183 PGFPHAGYALEN 194


>gi|407687590|ref|YP_006802763.1| hypothetical protein AMBAS45_09055 [Alteromonas macleodii str.
           'Balearic Sea AD45']
 gi|407290970|gb|AFT95282.1| hypothetical protein AMBAS45_09055 [Alteromonas macleodii str.
           'Balearic Sea AD45']
          Length = 388

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G   H D  + F++Q +G+++W+V  P    E  P    P   Q E   P++ V +E
Sbjct: 129 NGAGVGAHIDQYDVFLVQGKGRRRWRVGKPADYKEVFPH---PKLRQIEGFDPVIDVVVE 185

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD++Y+P G+ H  +T+  E SL  ++ 
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212


>gi|149926788|ref|ZP_01915047.1| hypothetical protein LMED105_12342 [Limnobacter sp. MED105]
 gi|149824340|gb|EDM83558.1| hypothetical protein LMED105_12342 [Limnobacter sp. MED105]
          Length = 383

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 40/72 (55%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PHYD  + F+LQ+ GK++WK+   L + ++E LP     NF       P     LEP
Sbjct: 125 GVGPHYDSYDVFLLQMHGKRRWKIGPLLDKELEEGLPLKILKNFE------PTEEFVLEP 178

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP  Y H
Sbjct: 179 GDMLYLPPNYGH 190


>gi|167517725|ref|XP_001743203.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778302|gb|EDQ91917.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2199

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 38/141 (26%), Positives = 65/141 (46%), Gaps = 21/141 (14%)

Query: 89  QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           QQ+ A L    G  +  + Y +PP+ +  +PH D  + F+ Q  G K W + +P+  D  
Sbjct: 237 QQIQAHL----GVVMSEHFYISPPSQRALSPHTDGGDVFVHQWAGSKLWTLCVPKAPDCQ 292

Query: 149 LPRYSS---------------PNFSQEEI-GTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
              ++                  ++ E++ G    T+ L+  +LLY+PRG +H A   + 
Sbjct: 293 DCTFADLALRAELAKSAFQGCTAYTDEQLAGMECRTLDLQENELLYVPRGVVHYARANST 352

Query: 193 EHSLHVTISVYQKTA-WIDLL 212
             S H T  V   +A WID++
Sbjct: 353 GLSAHATYQVVPPSATWIDMV 373


>gi|168244303|ref|ZP_02669235.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL486]
 gi|194450781|ref|YP_002045231.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL476]
 gi|386591091|ref|YP_006087491.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. B182]
 gi|419729313|ref|ZP_14256271.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41579]
 gi|419736387|ref|ZP_14263238.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41563]
 gi|419737739|ref|ZP_14264511.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41573]
 gi|419744077|ref|ZP_14270736.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41566]
 gi|419749199|ref|ZP_14275685.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41565]
 gi|421569517|ref|ZP_16015220.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00322]
 gi|421573533|ref|ZP_16019168.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00325]
 gi|421578721|ref|ZP_16024295.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00326]
 gi|421582567|ref|ZP_16028103.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00328]
 gi|194409085|gb|ACF69304.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL476]
 gi|205336690|gb|EDZ23454.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL486]
 gi|381292526|gb|EIC33729.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41563]
 gi|381296887|gb|EIC37986.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41579]
 gi|381304261|gb|EIC45268.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41573]
 gi|381310336|gb|EIC51167.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41566]
 gi|381312206|gb|EIC53014.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. 41565]
 gi|383798135|gb|AFH45217.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. B182]
 gi|402524234|gb|EJW31539.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00326]
 gi|402527359|gb|EJW34622.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00322]
 gi|402527379|gb|EJW34641.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00325]
 gi|402533034|gb|EJW40219.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. CFSAN00328]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|426405554|ref|YP_007024525.1| hypothetical protein Bdt_3583 [Bdellovibrio bacteriovorus str.
           Tiberius]
 gi|425862222|gb|AFY03258.1| hypothetical protein Bdt_3583 [Bdellovibrio bacteriovorus str.
           Tiberius]
          Length = 308

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/246 (22%), Positives = 106/246 (43%), Gaps = 40/246 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE-DGQRQTMDVEGR 60
           HW  EPL+I         ++ + +++ D L++ +   ++ +   L + D +  ++ +E  
Sbjct: 22  HWPVEPLFIP-ATPGKLQDIFSLEQMQD-LKNLISARQRKVRACLPDFDDEYSSIHLE-- 77

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN----------TYQT 110
             P      YR   ++ F + Q+    +  +  +++   G   G             Y T
Sbjct: 78  --PNDALKAYRNNMTLVFDSMQSQDSTIADILGNVRADLGLVTGGAENDLCKARSIAYAT 135

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI---- 166
           P    G   H+D    FI+Q++G K W++     VD    R+++     EE+   +    
Sbjct: 136 P-AGCGTRLHFDANANFIIQIKGTKTWRLAPNESVDFPTERFTT---GSEEMPAALEKQC 191

Query: 167 ------------LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
                       + V ++PG +L++PRGY H+  T T E SL +  +  Q T W D+  K
Sbjct: 192 HAHLIDALDEDSMEVVMKPGCVLFVPRGYWHE--TTTEEESLSLNFTFSQPT-WADVFTK 248

Query: 215 AMPKAL 220
           ++ + L
Sbjct: 249 SLQEVL 254


>gi|406596649|ref|YP_006747779.1| hypothetical protein MASE_08470 [Alteromonas macleodii ATCC 27126]
 gi|406373970|gb|AFS37225.1| hypothetical protein MASE_08470 [Alteromonas macleodii ATCC 27126]
          Length = 388

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G   H D  + F++Q +G+++W+V  P    E  P    P   Q E   P++ V +E
Sbjct: 129 NGAGVGAHIDQYDVFLVQGKGRRRWRVGKPADYKEVFPH---PKLRQIEGFDPVIDVVVE 185

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD++Y+P G+ H  +T+  E SL  ++ 
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212


>gi|108757183|ref|YP_628426.1| cupin [Myxococcus xanthus DK 1622]
 gi|108461063|gb|ABF86248.1| cupin family protein [Myxococcus xanthus DK 1622]
          Length = 295

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 94/205 (45%), Gaps = 27/205 (13%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           + +G ++    P  +   L QL  +        +  + Y TP    GF  H D  E FIL
Sbjct: 78  FAQGYTLALRQPDLHHPDLAQLARAFSAELHGRINLHIYCTPAGHHGFGWHCDPEEVFIL 137

Query: 130 QLEGKKKW----KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
           Q  G+K +        P  + E +P  S    +QE+  TP+ T +L  GD +Y+P G+ H
Sbjct: 138 QTAGRKDYLLRENTLHPVPLPESVP--SGSLAAQEK--TPVETHSLSAGDFIYIPGGHWH 193

Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
            A     E +L ++I +   T  +DLL+      ++AA A+   +RR +P        L 
Sbjct: 194 MAQ--ATEEALSISIGLMPPT-LLDLLD-----GVRAALASSPVWRRRMP-------SLG 238

Query: 246 RGKPVDIQADRLAMKENLKDLLAKL 270
           R   +D    +LA+   L+ LL++L
Sbjct: 239 RASSLD-DPSKLAL---LRTLLSEL 259


>gi|323494948|ref|ZP_08100039.1| hypothetical protein VIBR0546_21010 [Vibrio brasiliensis LMG 20546]
 gi|323310743|gb|EGA63916.1| hypothetical protein VIBR0546_21010 [Vibrio brasiliensis LMG 20546]
          Length = 377

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 85/196 (43%), Gaps = 20/196 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM----- 55
           +HW K+P  I     ++ +  I+ +++  +  +  ++ +        +DGQ   +     
Sbjct: 14  EHWHKKPTVIKAGFADFVDP-ISPEELAGLTMEEEVDSR----FISNQDGQWSALHGPLP 68

Query: 56  DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPTA 114
           +    +L +  W    + C+   L     I P +QL   L  +L  C      Y  P   
Sbjct: 69  ESHFESLSESHWQLIVQACNHWHLGAAELIAPFKQLPQWLFDDLMVC------YSAP--K 120

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
            G  PH D  + FI+Q  GK++W+V   +   +Y     +    Q E    I+   L PG
Sbjct: 121 GGVGPHIDQYDVFIIQGSGKRRWRVG-AKDTGQYNESIQASALRQIEGFDAIIDEVLMPG 179

Query: 175 DLLYLPRGYIHQASTV 190
           D+LY+P G+ H+ +T+
Sbjct: 180 DILYIPPGFPHEGNTI 195


>gi|410091761|ref|ZP_11288312.1| cupin 4 family protein [Pseudomonas viridiflava UASWS0038]
 gi|409760994|gb|EKN46105.1| cupin 4 family protein [Pseudomonas viridiflava UASWS0038]
          Length = 374

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 58/118 (49%), Gaps = 4/118 (3%)

Query: 88  LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
           +++L  +L  +    V AN Y        F  H+D  + F  QL G K+W +Y P  +D 
Sbjct: 104 IKRLCNALSHIANGQVVANGYLAKTGDSAFGNHWDSHDVFASQLLGNKRWLIYPPTFIDP 163

Query: 148 YLPRYSSPNFSQEEIGT-PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQ 204
            + + S  +   E+  T   + V  EPGD+LY+PRG+ H A   +     HV + V++
Sbjct: 164 IVGQKSIKHL--EDCPTEAYMDVITEPGDILYIPRGWWHCAFP-SEGPCFHVAVGVHK 218


>gi|407699915|ref|YP_006824702.1| hypothetical protein AMBLS11_08330 [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407249062|gb|AFT78247.1| hypothetical protein AMBLS11_08330 [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 388

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G   H D  + F++Q +G+++W+V  P    E  P    P   Q E   P++ V +E
Sbjct: 129 NGAGVGAHIDQYDVFLVQGKGQRRWRVGKPANYKEVCPH---PKLRQIEGFDPVIDVVVE 185

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD++Y+P G+ H  +T+  E SL  ++ 
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212


>gi|392537664|ref|ZP_10284801.1| RmlC-like domain-containing protein [Pseudoalteromonas marina
           mano4]
          Length = 385

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           ++ TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    
Sbjct: 116 SFSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197


>gi|261252554|ref|ZP_05945127.1| hypothetical protein VIA_002578 [Vibrio orientalis CIP 102891 =
           ATCC 33934]
 gi|417952662|ref|ZP_12595720.1| hypothetical protein VIOR3934_03262 [Vibrio orientalis CIP 102891 =
           ATCC 33934]
 gi|260935945|gb|EEX91934.1| hypothetical protein VIA_002578 [Vibrio orientalis CIP 102891 =
           ATCC 33934]
 gi|342818888|gb|EGU53739.1| hypothetical protein VIOR3934_03262 [Vibrio orientalis CIP 102891 =
           ATCC 33934]
          Length = 377

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 84/202 (41%), Gaps = 32/202 (15%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNV--IEFKKNIDITLYEDGQRQTMDVE 58
           +HW K+P  I     ++         ID I  D +  +  ++ ID     +   Q     
Sbjct: 14  EHWHKKPTVIKAGFADF---------IDPISADELAGLAMEEEIDSRFISNKDNQWSATH 64

Query: 59  G-------RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQT 110
           G        +L +  W    + C+   L     ++  +QL   L  +L  CF       +
Sbjct: 65  GPLPESHFESLDESHWQLIVQACNHWHLGSAELVQAFKQLPQWLFDDLMVCF-------S 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPNFSQEEIGTPILT 168
            P   G  PH D  + FI+Q  GK++W+V     +D  +Y     +    Q E    I+ 
Sbjct: 118 APEG-GVGPHIDQYDVFIIQGSGKRRWRV---GDIDKGQYKESIQAGALRQIEGFESIID 173

Query: 169 VTLEPGDLLYLPRGYIHQASTV 190
             LEPGD+LY+P G+ H+ +T+
Sbjct: 174 EVLEPGDILYIPPGFPHEGNTL 195


>gi|119470128|ref|ZP_01612894.1| hypothetical protein ATW7_13583 [Alteromonadales bacterium TW-7]
 gi|119446549|gb|EAW27823.1| hypothetical protein ATW7_13583 [Alteromonadales bacterium TW-7]
          Length = 385

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           ++ TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    
Sbjct: 116 SFSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197


>gi|359448363|ref|ZP_09237901.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
           BSi20480]
 gi|358045772|dbj|GAA74150.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
           BSi20480]
          Length = 385

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           ++ TP    G  PH D  + FI+Q EGK+ W+V LP   D  L +++ +    Q E    
Sbjct: 116 SFSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P G  H+   V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197


>gi|117920191|ref|YP_869383.1| cupin 4 family protein [Shewanella sp. ANA-3]
 gi|117612523|gb|ABK47977.1| Cupin 4 family protein [Shewanella sp. ANA-3]
          Length = 379

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E   
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGANTPQQR-----RGGNPNSPLVEDFE 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P GY H   T+T
Sbjct: 166 PIIDVVLEKGDVLYIPPGYPHCGETLT 192


>gi|407683605|ref|YP_006798779.1| hypothetical protein AMEC673_08540 [Alteromonas macleodii str.
           'English Channel 673']
 gi|407245216|gb|AFT74402.1| hypothetical protein AMEC673_08540 [Alteromonas macleodii str.
           'English Channel 673']
          Length = 388

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G   H D  + F++Q +G+++W+V  P    E  P    P   Q E   P++ V +E
Sbjct: 129 NGAGVGAHIDHYDVFLVQGKGRRRWRVGKPADYKEVFPH---PKLRQIEGFDPVIDVVVE 185

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD++Y+P G+ H  +T+  E SL  ++ 
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212


>gi|224010139|ref|XP_002294027.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970044|gb|EED88382.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 415

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 33/126 (26%)

Query: 97  ELFGCFVGANTYQTPPT-----------AQGFAPHYDDIEAF---------------ILQ 130
           E+ G  VG N Y TP              QGF  H+D ++AF               ++Q
Sbjct: 18  EVKGVHVGVNLYLTPEALEDEEGGGKQDRQGFEAHWDWMDAFDAPFFTDQWKSLVVIVIQ 77

Query: 131 LEGKKKWKV------YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
           L G+K+W+V      +L     +  P      +   + G  +   T+ PGD LY+PRG++
Sbjct: 78  LSGRKRWRVAKQPTIFLSNKDQKRRPTNEEAQYFATDEGHYV-EFTMCPGDSLYIPRGFM 136

Query: 185 HQASTV 190
           H ASTV
Sbjct: 137 HNASTV 142


>gi|114047176|ref|YP_737726.1| cupin 4 family protein [Shewanella sp. MR-7]
 gi|113888618|gb|ABI42669.1| Cupin 4 family protein [Shewanella sp. MR-7]
          Length = 379

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E   
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGANTPQQR-----RGGNPNSPLVEDFE 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P GY H   T+T
Sbjct: 166 PIIDVVLEKGDVLYIPPGYPHCGETLT 192


>gi|163751716|ref|ZP_02158934.1| hypothetical protein KT99_12104 [Shewanella benthica KT99]
 gi|161328368|gb|EDP99527.1| hypothetical protein KT99_12104 [Shewanella benthica KT99]
          Length = 397

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 86/211 (40%), Gaps = 38/211 (18%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W+K+PL I +  KN + +L+T + +  +  D+++E              R+    EG 
Sbjct: 33  DYWQKKPLVIRQGFKN-FEDLLTPEDMAGLACDDMVE-------------SRRIYKTEGE 78

Query: 61  ALPQIV-WDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVG---------ANT 107
              +   +D Y        L    +   +Q LN  +    EL  CF             +
Sbjct: 79  WQAEFGPFDSYEH------LGESDWTLVVQALNNWVPKADELIQCFDFIPRWRLDDVMVS 132

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
           Y TP    G  PH D  + FI Q  G+++W+V       E+    + P     E    I+
Sbjct: 133 YATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHKEFA---AHPALLHTEAFEAII 187

Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
            V L PGD+LYLP G+ H    +    S  V
Sbjct: 188 DVELLPGDILYLPPGFPHDGVALEASMSFSV 218


>gi|304397246|ref|ZP_07379125.1| Cupin 4 family protein [Pantoea sp. aB]
 gi|304355395|gb|EFM19763.1| Cupin 4 family protein [Pantoea sp. aB]
          Length = 375

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    V            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRVDDLMVSFAVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|428186247|gb|EKX55098.1| hypothetical protein GUITHDRAFT_99731 [Guillardia theta CCMP2712]
          Length = 390

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 63/286 (22%), Positives = 113/286 (39%), Gaps = 77/286 (26%)

Query: 4   EKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALP 63
           E+ PL I R    YY+  +  K ++ ++++  +E+   ID T Y +GQR T + EG    
Sbjct: 100 EQRPLLIKRNNPAYYDQWMNLKDLELLVKEG-LEWNFEIDATRYTNGQRTTHNGEG---- 154

Query: 64  QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDD 123
                 +   C     + +  +K +   N  +  LFG                    +  
Sbjct: 155 ------WTSNC-----HTELPLKLMYGANMRMVALFG--------------------FQG 183

Query: 124 IEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
           +   ++    +    VY PR   E +PRYSS N ++                        
Sbjct: 184 LTGDLIICARRPYIYVYEPRNEYEAMPRYSSRNMTE------------------------ 219

Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAG 243
              A +  + HSLHVT+S   + +W D++++ +  AL  A   +++ RR + +    Y  
Sbjct: 220 ---AKSSRDTHSLHVTVSHGYRNSWYDVIKETLISALDVAVQHNVDMRRQIKLAASGYDI 276

Query: 244 LARGKPVD-----------IQADRLAMKENLKDL---LAKLVEYVD 275
              G P D            +A+++++K   + L   +A LV  VD
Sbjct: 277 STEGNPDDDVEELLAPAREAEAEKVSLKTQFRLLEGDIAVLVHSVD 322


>gi|440757851|ref|ZP_20937031.1| hypothetical protein F385_865 [Pantoea agglomerans 299R]
 gi|436428324|gb|ELP25981.1| hypothetical protein F385_865 [Pantoea agglomerans 299R]
          Length = 375

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    V            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRVDDLMVSFAVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|113969941|ref|YP_733734.1| cupin 4 family protein [Shewanella sp. MR-4]
 gi|113884625|gb|ABI38677.1| Cupin 4 family protein [Shewanella sp. MR-4]
          Length = 379

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E   
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGANTPQQR-----RGGNPNSPLVEDFE 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P GY H   T+T
Sbjct: 166 PIIDVVLEKGDVLYIPPGYPHCGETLT 192


>gi|113200554|ref|YP_717716.1| gp49 [Synechococcus phage syn9]
 gi|76574453|gb|ABA47018.1| gp49 [Synechococcus phage syn9]
          Length = 236

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 2/124 (1%)

Query: 64  QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDD 123
           QIV +H   G +   +N   Y +  Q+L  + + +F      + Y     A+ F  H D 
Sbjct: 82  QIV-EHINHGHTFIIMNYAFYSRWTQELMKTFESIFAVDCAIHVYGGKEGAKSFNIHDDY 140

Query: 124 IEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
              FI+Q+EG+ +WK+Y  R +   L   ++ +  +E+     L VTL+PGD LY+P   
Sbjct: 141 PSNFIIQVEGETEWKIYKNR-ISSMLQTGTAQDTIREDNLEVDLHVTLKPGDALYIPSRA 199

Query: 184 IHQA 187
            H A
Sbjct: 200 YHCA 203


>gi|343502128|ref|ZP_08739989.1| hypothetical protein VITU9109_16393 [Vibrio tubiashii ATCC 19109]
 gi|418481137|ref|ZP_13050186.1| hypothetical protein VT1337_21872 [Vibrio tubiashii NCIMB 1337 =
           ATCC 19106]
 gi|342815551|gb|EGU50467.1| hypothetical protein VITU9109_16393 [Vibrio tubiashii ATCC 19109]
 gi|384571325|gb|EIF01862.1| hypothetical protein VT1337_21872 [Vibrio tubiashii NCIMB 1337 =
           ATCC 19106]
          Length = 377

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 10/132 (7%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFA 118
            +L +  W    + C+   +     I+P + L N    +L  CF       + P   G  
Sbjct: 73  ESLDESHWQLVVQACNHWHIGAAELIEPFKPLPNWLFDDLMVCF-------SAPKG-GVG 124

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PH D  + FI+Q  GK+ W+V   + V +Y     +    Q E    I+   LEPGD+LY
Sbjct: 125 PHIDQYDVFIIQGSGKRHWRVG-DKDVGQYKESIQAGALRQIEGFDAIIDEILEPGDILY 183

Query: 179 LPRGYIHQASTV 190
           +P G+ H+  T+
Sbjct: 184 IPPGFPHEGDTL 195


>gi|445421484|ref|ZP_21435886.1| cupin domain protein, PF06172 family [Acinetobacter sp. WC-743]
 gi|444757452|gb|ELW81974.1| cupin domain protein, PF06172 family [Acinetobacter sp. WC-743]
          Length = 382

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/173 (22%), Positives = 81/173 (46%), Gaps = 14/173 (8%)

Query: 57  VEGRALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           VE + +  +++++ R+G ++ +  +  + ++  + +  AS  E      G   Y    + 
Sbjct: 75  VEYKCITSVLYEYLRKGATLVYNRIKNEPFVDNISRQVASFAEAHTITGG---YAAFSSK 131

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
             +  H+D  + + +QL G+K+W +  P       P Y        +I  P    + V L
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKYFSDIKEPEEVYMDVIL 188

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
           E GD+LY+PRG+ H    + +E + H+ ++ +  T   ++  L+ A+P  L  
Sbjct: 189 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMQWLQNAIPSILDC 240


>gi|422030384|ref|ZP_16376586.1| hypothetical protein B572_06048, partial [Salmonella enterica
           subsp. enterica serovar Typhimurium str. STm2]
 gi|414022210|gb|EKT05701.1| hypothetical protein B572_06048, partial [Salmonella enterica
           subsp. enterica serovar Typhimurium str. STm2]
          Length = 247

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|416241522|ref|ZP_11632815.1| hypothetical protein E9S_01389 [Moraxella catarrhalis BC7]
 gi|326572066|gb|EGE22067.1| hypothetical protein E9S_01389 [Moraxella catarrhalis BC7]
          Length = 383

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 77/176 (43%), Gaps = 31/176 (17%)

Query: 60  RALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG- 116
           R +   ++ H + G +I +  +N + ++         L +    FV A T  +   A G 
Sbjct: 78  RFIKSAIYHHMKHGATIVYNRINNEPFVD-------GLAKQISHFVQAPTVVSGYLAFGS 130

Query: 117 ---FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-------EIGTPI 166
              F  H+D  + F +QL GKK W +        Y P + SP + Q+       E   P 
Sbjct: 131 DASFKSHWDTRDVFAVQLMGKKYWAL--------YKPNFESPLYMQQSKDVDVPEPTIPD 182

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKAL 220
           + + LE GD+LY+PRG+ H    +  + + H+ I  +  T   ++  L K  P  L
Sbjct: 183 MEIVLEAGDVLYIPRGWWHNPVPMGCK-TFHLAIGTFAPTGYDYMQWLVKQTPNIL 237


>gi|82777261|ref|YP_403610.1| hypothetical protein SDY_2024 [Shigella dysenteriae Sd197]
 gi|81241409|gb|ABB62119.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
          Length = 376

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 19  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 69  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEP D+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPSDILYIP 185

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 186 PGFPHEGYALEN 197


>gi|309788198|ref|ZP_07682804.1| cupin family protein [Shigella dysenteriae 1617]
 gi|308924050|gb|EFP69551.1| cupin family protein [Shigella dysenteriae 1617]
          Length = 373

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 66  GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEP D+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPSDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|238795998|ref|ZP_04639510.1| hypothetical protein ymoll0001_23970 [Yersinia mollaretii ATCC
           43969]
 gi|238720203|gb|EEQ12007.1| hypothetical protein ymoll0001_23970 [Yersinia mollaretii ATCC
           43969]
          Length = 373

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +E+G+ Q     G  
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEEGRWQV----GHG 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    +EL    +            G  PH
Sbjct: 67  -PFESFDHLGETNWSLLVQAVDHWHEPAAALMRPFRELSDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|160875706|ref|YP_001555022.1| cupin 4 family protein [Shewanella baltica OS195]
 gi|378708906|ref|YP_005273800.1| transcription factor jumonji jmjC domain-containing protein
           [Shewanella baltica OS678]
 gi|418024724|ref|ZP_12663706.1| Cupin, JmjC-type [Shewanella baltica OS625]
 gi|160861228|gb|ABX49762.1| Cupin 4 family protein [Shewanella baltica OS195]
 gi|315267895|gb|ADT94748.1| transcription factor jumonji jmjC domain-containing protein
           [Shewanella baltica OS678]
 gi|353536010|gb|EHC05570.1| Cupin, JmjC-type [Shewanella baltica OS625]
          Length = 379

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 9/87 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
           ++ TP  + G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E   
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFI 165

Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
           PI+ V LE GD+LY+P G+ H   T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192


>gi|440230538|ref|YP_007344331.1| hypothetical protein D781_1857 [Serratia marcescens FGI94]
 gi|440052243|gb|AGB82146.1| hypothetical protein D781_1857 [Serratia marcescens FGI94]
          Length = 373

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 93/206 (45%), Gaps = 42/206 (20%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----VVSHQDGRWQVAHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E            ++P +QL +  + +L   F       
Sbjct: 71  YDHLGENNWSLLVQAV-DHWHEPSG-------ALMRPFRQLPDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP---RYSSPNFSQEEIGTPI 166
            P    G  PH D  + FI+Q  G+++W+V       E +P    Y  P+  Q E    I
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRV------GEKVPMKQHYPHPDLLQVEPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTN 192
           +   +EPGD+LY+P G+ H+   + N
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYALEN 194


>gi|238783793|ref|ZP_04627812.1| hypothetical protein yberc0001_24710 [Yersinia bercovieri ATCC
           43970]
 gi|238715344|gb|EEQ07337.1| hypothetical protein yberc0001_24710 [Yersinia bercovieri ATCC
           43970]
          Length = 373

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 83/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L       D L    +E + +  +  +EDG+ Q     G  
Sbjct: 16  YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWQV----GHG 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESFDHLGETNWSLLVQAVDHWHEPAAALMRPFRALSDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVAPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|62179750|ref|YP_216167.1| hypothetical protein SC1180 [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
 gi|224584263|ref|YP_002638061.1| hypothetical protein SPC_2515 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|375114074|ref|ZP_09759244.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SCSA50]
 gi|62127383|gb|AAX65086.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
 gi|224468790|gb|ACN46620.1| hypothetical protein SPC_2515 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|322714220|gb|EFZ05791.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SCSA50]
          Length = 383

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGAGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|422971067|ref|ZP_16974579.1| hypothetical protein ESRG_01213 [Escherichia coli TA124]
 gi|371598932|gb|EHN87723.1| hypothetical protein ESRG_01213 [Escherichia coli TA124]
          Length = 373

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 14/191 (7%)

Query: 3   WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
           W+K P+ + R     +NN I     D+ L    +E + +  +  ++DG+ Q         
Sbjct: 17  WQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SHG 66

Query: 63  PQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
           P   +DH  E   S+       + +P   L    +EL    +            G  PH 
Sbjct: 67  PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126

Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
           D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P 
Sbjct: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIPP 183

Query: 182 GYIHQASTVTN 192
           G+ H+   + N
Sbjct: 184 GFPHEGYALEN 194


>gi|383933981|ref|ZP_09987424.1| hypothetical protein RNAN_0483 [Rheinheimera nanhaiensis E407-8]
 gi|383704980|dbj|GAB57515.1| hypothetical protein RNAN_0483 [Rheinheimera nanhaiensis E407-8]
          Length = 384

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 99/220 (45%), Gaps = 30/220 (13%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W+++P+ +    K + + L T+ ++  +  D  +E +   +I    D Q    + +  
Sbjct: 17  DYWQQKPVLLKGGFKQFQDPL-TADELAGLALDENVESRVVSNINQQWDMQSGPFE-DFA 74

Query: 61  ALPQIVW-------DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
           A  +  W       DH+    +   L+P  +I      N  + +L   F  A        
Sbjct: 75  AFGEQNWTLLVQAVDHWHPPAAT-LLDPFRFIP-----NWRIDDLMVSFSTAGG------ 122

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
             G  PH D  + FI+Q EGK+ W+V +P   + +  P    P   Q       + V  E
Sbjct: 123 --GVGPHLDQYDVFIIQGEGKRHWRVGMPDNSLQQLCPH---PRLLQVSPFNACIDVITE 177

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           PGD+LY+P G  H+ +++  E+SL+ ++  ++  A  DLL
Sbjct: 178 PGDILYIPPGCPHEGTSL--ENSLNYSVG-FRAPAQKDLL 214


>gi|188534104|ref|YP_001907901.1| hypothetical protein ETA_19720 [Erwinia tasmaniensis Et1/99]
 gi|188029146|emb|CAO97018.1| Conserved hypothetical protein YcfD [Erwinia tasmaniensis Et1/99]
          Length = 382

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 88/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q     G  
Sbjct: 23  YWQKRPVVLKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 73

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + +    V            G  PH
Sbjct: 74  -PFESYDHLGENNWSLLVQAVDHWHEPSAALMQPFRAIPDWRVDDLMVSFSVPGGGVGPH 132

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 133 FDQYDVFIIQGTGRRRWRVGEKLEMKQHCPH---PDLLQVDPFDAIIDEEMEPGDILYIP 189

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 190 PGFPHEGYSLEN 201


>gi|417531601|ref|ZP_12186274.1| hypothetical protein LTSEURB_2855 [Salmonella enterica subsp.
           enterica serovar Urbana str. R8-2977]
 gi|353664588|gb|EHD02966.1| hypothetical protein LTSEURB_2855 [Salmonella enterica subsp.
           enterica serovar Urbana str. R8-2977]
          Length = 362

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|168239136|ref|ZP_02664194.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. SL480]
 gi|194736752|ref|YP_002114236.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. CVM19633]
 gi|194712254|gb|ACF91475.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. CVM19633]
 gi|197288150|gb|EDY27537.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. SL480]
          Length = 373

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|56413785|ref|YP_150860.1| hypothetical protein SPA1621 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|161614536|ref|YP_001588501.1| hypothetical protein SPAB_02285 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|167553804|ref|ZP_02347549.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Saintpaul str. SARA29]
 gi|168233111|ref|ZP_02658169.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Kentucky str. CDC 191]
 gi|168467191|ref|ZP_02701033.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. SL317]
 gi|168822336|ref|ZP_02834336.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Weltevreden str. HI_N05-537]
 gi|194444751|ref|YP_002040485.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. SL254]
 gi|194472548|ref|ZP_03078532.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Kentucky str. CVM29188]
 gi|197248001|ref|YP_002146812.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Agona str. SL483]
 gi|197362708|ref|YP_002142345.1| hypothetical protein SSPA1506 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
 gi|200389594|ref|ZP_03216205.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Virchow str. SL491]
 gi|205353037|ref|YP_002226838.1| hypothetical protein SG1892 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|207857264|ref|YP_002243915.1| hypothetical protein SEN1820 [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|238910951|ref|ZP_04654788.1| hypothetical protein SentesTe_07427 [Salmonella enterica subsp.
           enterica serovar Tennessee str. CDC07-0191]
 gi|375001687|ref|ZP_09726027.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Infantis str. SARB27]
 gi|375123866|ref|ZP_09769030.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
           serovar Gallinarum str. SG9]
 gi|378954770|ref|YP_005212257.1| hypothetical protein SPUL_1033 [Salmonella enterica subsp. enterica
           serovar Gallinarum/pullorum str. RKS5078]
 gi|409250502|ref|YP_006886312.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
           serovar Weltevreden str. 2007-60-3289-1]
 gi|416507377|ref|ZP_11735325.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. SARB31]
 gi|416523489|ref|ZP_11741166.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. ATCC BAA710]
 gi|416528531|ref|ZP_11743981.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. LQC 10]
 gi|416535577|ref|ZP_11747831.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. SARB30]
 gi|416553883|ref|ZP_11757911.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 29N]
 gi|416558001|ref|ZP_11759944.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 42N]
 gi|416571633|ref|ZP_11766867.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 4441 H]
 gi|417463691|ref|ZP_12164744.1| hypothetical protein LTSEMON_2720 [Salmonella enterica subsp.
           enterica serovar Montevideo str. S5-403]
 gi|418761292|ref|ZP_13317437.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35185]
 gi|418768380|ref|ZP_13324430.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35199]
 gi|418769788|ref|ZP_13325815.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21539]
 gi|418775972|ref|ZP_13331921.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 33953]
 gi|418780314|ref|ZP_13336203.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35188]
 gi|418786493|ref|ZP_13342308.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21559]
 gi|418789828|ref|ZP_13345614.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19447]
 gi|418795683|ref|ZP_13351384.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19449]
 gi|418798421|ref|ZP_13354098.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19567]
 gi|418802448|ref|ZP_13358075.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35202]
 gi|418808769|ref|ZP_13364322.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21550]
 gi|418812925|ref|ZP_13368446.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22513]
 gi|418816992|ref|ZP_13372480.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21538]
 gi|418820434|ref|ZP_13375867.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22425]
 gi|418827978|ref|ZP_13383071.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22462]
 gi|418832862|ref|ZP_13387796.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM N18486]
 gi|418836321|ref|ZP_13391208.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM N1543]
 gi|418839569|ref|ZP_13394403.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21554]
 gi|418848123|ref|ZP_13402862.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 37978]
 gi|418855525|ref|ZP_13410181.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19593]
 gi|419788388|ref|ZP_14314075.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. Levine 1]
 gi|419791096|ref|ZP_14316751.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. Levine 15]
 gi|421358278|ref|ZP_15808576.1| hypothetical protein SEEE3139_09514 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421364457|ref|ZP_15814689.1| hypothetical protein SEEE0166_17642 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421366754|ref|ZP_15816956.1| hypothetical protein SEEE0631_06178 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421373424|ref|ZP_15823564.1| hypothetical protein SEEE0424_17039 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421377191|ref|ZP_15827290.1| hypothetical protein SEEE3076_13193 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421381690|ref|ZP_15831745.1| hypothetical protein SEEE4917_12943 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421385370|ref|ZP_15835392.1| hypothetical protein SEEE6622_08759 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421390302|ref|ZP_15840277.1| hypothetical protein SEEE6670_10822 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421393562|ref|ZP_15843506.1| hypothetical protein SEEE6426_04514 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421398392|ref|ZP_15848300.1| hypothetical protein SEEE6437_06656 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421404204|ref|ZP_15854048.1| hypothetical protein SEEE7246_13130 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421409715|ref|ZP_15859505.1| hypothetical protein SEEE7250_18232 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421413438|ref|ZP_15863192.1| hypothetical protein SEEE1427_14151 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421418750|ref|ZP_15868451.1| hypothetical protein SEEE2659_18236 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421422426|ref|ZP_15872094.1| hypothetical protein SEEE1757_14019 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421426337|ref|ZP_15875965.1| hypothetical protein SEEE5101_11002 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421432912|ref|ZP_15882480.1| hypothetical protein SEEE8B1_21392 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421434917|ref|ZP_15884463.1| hypothetical protein SEEE5518_08200 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421442191|ref|ZP_15891651.1| hypothetical protein SEEE1618_22104 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421444482|ref|ZP_15893912.1| hypothetical protein SEEE3079_10567 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|421448213|ref|ZP_15897608.1| hypothetical protein SEEE6482_06661 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|436587485|ref|ZP_20511824.1| hypothetical protein SEE22704_00397 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436776523|ref|ZP_20521034.1| hypothetical protein SEE30663_23300 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436798198|ref|ZP_20523343.1| hypothetical protein SEECHS44_08640 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436809177|ref|ZP_20528557.1| hypothetical protein SEEE1882_12114 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436815068|ref|ZP_20532619.1| hypothetical protein SEEE1884_09778 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436844491|ref|ZP_20538249.1| hypothetical protein SEEE1594_15488 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436854178|ref|ZP_20543812.1| hypothetical protein SEEE1566_20799 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436857424|ref|ZP_20545944.1| hypothetical protein SEEE1580_08895 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436864597|ref|ZP_20550564.1| hypothetical protein SEEE1543_09680 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436873839|ref|ZP_20556563.1| hypothetical protein SEEE1441_17537 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436877963|ref|ZP_20558818.1| hypothetical protein SEEE1810_06222 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436888496|ref|ZP_20564825.1| hypothetical protein SEEE1558_13819 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436895721|ref|ZP_20568477.1| hypothetical protein SEEE1018_09342 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436901603|ref|ZP_20572513.1| hypothetical protein SEEE1010_07154 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436912358|ref|ZP_20578187.1| hypothetical protein SEEE1729_13290 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436922290|ref|ZP_20584515.1| hypothetical protein SEEE0895_22485 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436926972|ref|ZP_20586798.1| hypothetical protein SEEE0899_11044 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436936309|ref|ZP_20591749.1| hypothetical protein SEEE1457_13351 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436943499|ref|ZP_20596445.1| hypothetical protein SEEE1747_14492 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436951013|ref|ZP_20600068.1| hypothetical protein SEEE0968_09924 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436961662|ref|ZP_20605036.1| hypothetical protein SEEE1444_12165 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436970744|ref|ZP_20609137.1| hypothetical protein SEEE1445_10116 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436982094|ref|ZP_20613590.1| hypothetical protein SEEE1559_10059 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436994733|ref|ZP_20619001.1| hypothetical protein SEEE1565_14614 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437001772|ref|ZP_20621051.1| hypothetical protein SEEE1808_02269 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437020036|ref|ZP_20627187.1| hypothetical protein SEEE1811_10508 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437033905|ref|ZP_20632789.1| hypothetical protein SEEE0956_16006 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437045604|ref|ZP_20637902.1| hypothetical protein SEEE1455_19061 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437053817|ref|ZP_20642616.1| hypothetical protein SEEE1575_20271 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437058829|ref|ZP_20645676.1| hypothetical protein SEEE1725_13124 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437070348|ref|ZP_20651526.1| hypothetical protein SEEE1745_19933 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437076518|ref|ZP_20654881.1| hypothetical protein SEEE1791_14002 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437081119|ref|ZP_20657571.1| hypothetical protein SEEE1795_04921 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437091718|ref|ZP_20663318.1| hypothetical protein SEEE6709_11442 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437101930|ref|ZP_20666379.1| hypothetical protein SEEE9058_03984 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437122775|ref|ZP_20672579.1| hypothetical protein SEEE0816_12700 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437131123|ref|ZP_20677253.1| hypothetical protein SEEE0819_13450 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437138875|ref|ZP_20681357.1| hypothetical protein SEEE3072_11367 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437145730|ref|ZP_20685637.1| hypothetical protein SEEE3089_10142 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437156765|ref|ZP_20692301.1| hypothetical protein SEEE9163_21092 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437159232|ref|ZP_20693746.1| hypothetical protein SEEE151_05509 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437166103|ref|ZP_20697888.1| hypothetical protein SEEEN202_03846 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437178744|ref|ZP_20704862.1| hypothetical protein SEEE3991_16563 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437183878|ref|ZP_20707950.1| hypothetical protein SEEE3618_09565 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437258707|ref|ZP_20716627.1| hypothetical protein SEEE2490_04439 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437268276|ref|ZP_20721746.1| hypothetical protein SEEEL909_07808 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437277114|ref|ZP_20726633.1| hypothetical protein SEEEL913_09670 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437286194|ref|ZP_20729974.1| hypothetical protein SEEE4941_03980 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437312193|ref|ZP_20736301.1| hypothetical protein SEEE7015_13430 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437328129|ref|ZP_20740911.1| hypothetical protein SEEE7927_13823 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437343646|ref|ZP_20745814.1| hypothetical protein SEEECHS4_15890 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437404138|ref|ZP_20751996.1| hypothetical protein SEEE2217_01595 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437443973|ref|ZP_20758139.1| hypothetical protein SEEE4018_09992 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437460569|ref|ZP_20761523.1| hypothetical protein SEEE6211_04127 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437481372|ref|ZP_20768904.1| hypothetical protein SEEE4441_18866 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437494610|ref|ZP_20772553.1| hypothetical protein SEEE4647_14653 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437503911|ref|ZP_20774925.1| hypothetical protein SEEE9845_03933 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437524523|ref|ZP_20779437.1| hypothetical protein SEEE9317_03971 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|437561003|ref|ZP_20786287.1| hypothetical protein SEEE0116_15890 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437577657|ref|ZP_20791006.1| hypothetical protein SEEE1117_16729 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437586367|ref|ZP_20793328.1| hypothetical protein SEEE1392_05704 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437601754|ref|ZP_20797986.1| hypothetical protein SEEE0268_06478 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437612736|ref|ZP_20801362.1| hypothetical protein SEEE0316_00622 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437627645|ref|ZP_20805920.1| hypothetical protein SEEE0436_00804 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437658116|ref|ZP_20811447.1| hypothetical protein SEEE1319_05348 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437669493|ref|ZP_20815525.1| hypothetical protein SEEE4481_03073 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437688553|ref|ZP_20819861.1| hypothetical protein SEEE6297_01857 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437709427|ref|ZP_20826095.1| hypothetical protein SEEE4220_10855 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437728171|ref|ZP_20830437.1| hypothetical protein SEEE1616_09635 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 16-16]
 gi|437797953|ref|ZP_20837734.1| hypothetical protein SEEE2651_24346 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 76-2651]
 gi|437812427|ref|ZP_20841539.1| hypothetical protein SEEE3944_18708 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|437987534|ref|ZP_20853615.1| hypothetical protein SEEE5646_06793 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|438087107|ref|ZP_20859254.1| hypothetical protein SEEE2625_08557 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438099794|ref|ZP_20863538.1| hypothetical protein SEEE1976_07359 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438110668|ref|ZP_20868066.1| hypothetical protein SEEE3407_07536 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|438144679|ref|ZP_20875670.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Pullorum str. ATCC 9120]
 gi|440761753|ref|ZP_20940822.1| hypothetical protein F434_02369 [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|440767575|ref|ZP_20946551.1| hypothetical protein F514_07997 [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|440774024|ref|ZP_20952912.1| hypothetical protein F515_16533 [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|445134614|ref|ZP_21382999.1| hypothetical protein SEEG9184_003014 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|445169176|ref|ZP_21395115.1| hypothetical protein SEE8A_002740 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445186758|ref|ZP_21399359.1| hypothetical protein SE20037_07109 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445228796|ref|ZP_21404833.1| hypothetical protein SEE10_016120 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|445353184|ref|ZP_21421076.1| hypothetical protein SEE13_020245 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445363821|ref|ZP_21424744.1| hypothetical protein SEE23_012930 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|56128042|gb|AAV77548.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|161363900|gb|ABX67668.1| hypothetical protein SPAB_02285 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|194403414|gb|ACF63636.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. SL254]
 gi|194458912|gb|EDX47751.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Kentucky str. CVM29188]
 gi|195630396|gb|EDX49022.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. SL317]
 gi|197094185|emb|CAR59689.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
 gi|197211704|gb|ACH49101.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Agona str. SL483]
 gi|199602039|gb|EDZ00585.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Virchow str. SL491]
 gi|205272818|emb|CAR37744.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|205321844|gb|EDZ09683.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Saintpaul str. SARA29]
 gi|205332730|gb|EDZ19494.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Kentucky str. CDC 191]
 gi|205341198|gb|EDZ27962.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Weltevreden str. HI_N05-537]
 gi|206709067|emb|CAR33400.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|320086330|emb|CBY96103.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
           serovar Weltevreden str. 2007-60-3289-1]
 gi|326628116|gb|EGE34459.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
           serovar Gallinarum str. SG9]
 gi|353076375|gb|EHB42135.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Infantis str. SARB27]
 gi|353630978|gb|EHC78382.1| hypothetical protein LTSEMON_2720 [Salmonella enterica subsp.
           enterica serovar Montevideo str. S5-403]
 gi|357205381|gb|AET53427.1| hypothetical protein SPUL_1033 [Salmonella enterica subsp. enterica
           serovar Gallinarum/pullorum str. RKS5078]
 gi|363550225|gb|EHL34554.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. ATCC BAA710]
 gi|363553387|gb|EHL37635.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. SARB31]
 gi|363553848|gb|EHL38094.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. LQC 10]
 gi|363562069|gb|EHL46175.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 29N]
 gi|363565785|gb|EHL49809.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. SARB30]
 gi|363574163|gb|EHL58036.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 4441 H]
 gi|363577568|gb|EHL61388.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 42N]
 gi|392617572|gb|EIW99993.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. Levine 1]
 gi|392621318|gb|EIX03680.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. Levine 15]
 gi|392731650|gb|EIZ88874.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35199]
 gi|392739234|gb|EIZ96373.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21539]
 gi|392740910|gb|EIZ98025.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35185]
 gi|392746605|gb|EJA03611.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 33953]
 gi|392748163|gb|EJA05153.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21559]
 gi|392749364|gb|EJA06341.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35188]
 gi|392758211|gb|EJA15086.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19449]
 gi|392760173|gb|EJA17013.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19447]
 gi|392767078|gb|EJA23850.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19567]
 gi|392774151|gb|EJA30846.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22513]
 gi|392775452|gb|EJA32144.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21550]
 gi|392777461|gb|EJA34144.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 35202]
 gi|392789160|gb|EJA45680.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21538]
 gi|392792703|gb|EJA49157.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22425]
 gi|392795311|gb|EJA51684.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22462]
 gi|392796932|gb|EJA53260.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM N18486]
 gi|392802147|gb|EJA58367.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM N1543]
 gi|392812446|gb|EJA68435.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21554]
 gi|392821583|gb|EJA77407.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19593]
 gi|392823435|gb|EJA79231.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 37978]
 gi|395983946|gb|EJH93136.1| hypothetical protein SEEE0166_17642 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395988582|gb|EJH97738.1| hypothetical protein SEEE3139_09514 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|395989409|gb|EJH98543.1| hypothetical protein SEEE0631_06178 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395996543|gb|EJI05588.1| hypothetical protein SEEE0424_17039 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|396000813|gb|EJI09827.1| hypothetical protein SEEE3076_13193 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|396001653|gb|EJI10665.1| hypothetical protein SEEE4917_12943 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396014112|gb|EJI22998.1| hypothetical protein SEEE6670_10822 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396016807|gb|EJI25674.1| hypothetical protein SEEE6622_08759 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396017445|gb|EJI26310.1| hypothetical protein SEEE6426_04514 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396025012|gb|EJI33796.1| hypothetical protein SEEE7250_18232 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396027284|gb|EJI36048.1| hypothetical protein SEEE7246_13130 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396031465|gb|EJI40192.1| hypothetical protein SEEE6437_06656 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396038028|gb|EJI46672.1| hypothetical protein SEEE2659_18236 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396040526|gb|EJI49150.1| hypothetical protein SEEE1427_14151 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396041741|gb|EJI50364.1| hypothetical protein SEEE1757_14019 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396049128|gb|EJI57671.1| hypothetical protein SEEE8B1_21392 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396053844|gb|EJI62337.1| hypothetical protein SEEE5101_11002 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396059298|gb|EJI67753.1| hypothetical protein SEEE5518_08200 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396062868|gb|EJI71279.1| hypothetical protein SEEE1618_22104 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|396066913|gb|EJI75273.1| hypothetical protein SEEE3079_10567 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|396073813|gb|EJI82113.1| hypothetical protein SEEE6482_06661 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|434939014|gb|ELL45899.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Pullorum str. ATCC 9120]
 gi|434960690|gb|ELL54048.1| hypothetical protein SEECHS44_08640 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434961012|gb|ELL54340.1| hypothetical protein SEE30663_23300 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|434966994|gb|ELL59829.1| hypothetical protein SEEE1882_12114 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434973184|gb|ELL65572.1| hypothetical protein SEEE1884_09778 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434979077|gb|ELL71069.1| hypothetical protein SEEE1594_15488 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434981527|gb|ELL73406.1| hypothetical protein SEE22704_00397 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434982981|gb|ELL74789.1| hypothetical protein SEEE1566_20799 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434989576|gb|ELL81126.1| hypothetical protein SEEE1580_08895 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434995632|gb|ELL86948.1| hypothetical protein SEEE1543_09680 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|434998596|gb|ELL89817.1| hypothetical protein SEEE1441_17537 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|435007900|gb|ELL98727.1| hypothetical protein SEEE1810_06222 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435010206|gb|ELM00992.1| hypothetical protein SEEE1558_13819 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435015610|gb|ELM06136.1| hypothetical protein SEEE1018_09342 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435021037|gb|ELM11426.1| hypothetical protein SEEE1010_07154 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435024608|gb|ELM14814.1| hypothetical protein SEEE0895_22485 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435026603|gb|ELM16734.1| hypothetical protein SEEE1729_13290 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435036813|gb|ELM26632.1| hypothetical protein SEEE0899_11044 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435039147|gb|ELM28928.1| hypothetical protein SEEE1457_13351 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435043698|gb|ELM33415.1| hypothetical protein SEEE1747_14492 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435050801|gb|ELM40305.1| hypothetical protein SEEE1444_12165 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435051480|gb|ELM40982.1| hypothetical protein SEEE0968_09924 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435057033|gb|ELM46402.1| hypothetical protein SEEE1445_10116 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435063964|gb|ELM53111.1| hypothetical protein SEEE1565_14614 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435066348|gb|ELM55436.1| hypothetical protein SEEE1559_10059 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435074643|gb|ELM63467.1| hypothetical protein SEEE1808_02269 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435076795|gb|ELM65577.1| hypothetical protein SEEE0956_16006 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435079673|gb|ELM68368.1| hypothetical protein SEEE1811_10508 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435079892|gb|ELM68586.1| hypothetical protein SEEE1455_19061 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435088831|gb|ELM77286.1| hypothetical protein SEEE1575_20271 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435090319|gb|ELM78721.1| hypothetical protein SEEE1745_19933 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435094642|gb|ELM82981.1| hypothetical protein SEEE1725_13124 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435105815|gb|ELM93852.1| hypothetical protein SEEE1791_14002 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435111738|gb|ELM99626.1| hypothetical protein SEEE1795_04921 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435112624|gb|ELN00489.1| hypothetical protein SEEE6709_11442 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435122831|gb|ELN10337.1| hypothetical protein SEEE0816_12700 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435123909|gb|ELN11400.1| hypothetical protein SEEE9058_03984 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435125097|gb|ELN12553.1| hypothetical protein SEEE0819_13450 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435132397|gb|ELN19595.1| hypothetical protein SEEE3072_11367 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435135372|gb|ELN22481.1| hypothetical protein SEEE9163_21092 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435137191|gb|ELN24262.1| hypothetical protein SEEE3089_10142 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435150165|gb|ELN36849.1| hypothetical protein SEEE151_05509 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435152161|gb|ELN38791.1| hypothetical protein SEEE3991_16563 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435153460|gb|ELN40068.1| hypothetical protein SEEEN202_03846 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435162077|gb|ELN48277.1| hypothetical protein SEEE3618_09565 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435165954|gb|ELN51956.1| hypothetical protein SEEE2490_04439 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435173300|gb|ELN58810.1| hypothetical protein SEEEL913_09670 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435174455|gb|ELN59897.1| hypothetical protein SEEEL909_07808 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435183325|gb|ELN68300.1| hypothetical protein SEEE7015_13430 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435184721|gb|ELN69642.1| hypothetical protein SEEE4941_03980 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435191363|gb|ELN75929.1| hypothetical protein SEEE7927_13823 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435191423|gb|ELN75980.1| hypothetical protein SEEECHS4_15890 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435205556|gb|ELN89145.1| hypothetical protein SEEE2217_01595 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435210673|gb|ELN93911.1| hypothetical protein SEEE4018_09992 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435217787|gb|ELO00202.1| hypothetical protein SEEE4441_18866 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435220861|gb|ELO03135.1| hypothetical protein SEEE6211_04127 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435227127|gb|ELO08650.1| hypothetical protein SEEE4647_14653 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435236438|gb|ELO17173.1| hypothetical protein SEEE9845_03933 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435238329|gb|ELO18978.1| hypothetical protein SEEE0116_15890 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435242599|gb|ELO22903.1| hypothetical protein SEEE1117_16729 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435248883|gb|ELO28731.1| hypothetical protein SEEE9317_03971 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|435260325|gb|ELO39536.1| hypothetical protein SEEE1392_05704 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435260766|gb|ELO39956.1| hypothetical protein SEEE0268_06478 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435265309|gb|ELO44190.1| hypothetical protein SEEE0316_00622 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435270811|gb|ELO49296.1| hypothetical protein SEEE1319_05348 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435282100|gb|ELO59736.1| hypothetical protein SEEE4481_03073 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435282667|gb|ELO60281.1| hypothetical protein SEEE6297_01857 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435285798|gb|ELO63162.1| hypothetical protein SEEE0436_00804 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435288613|gb|ELO65621.1| hypothetical protein SEEE2651_24346 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 76-2651]
 gi|435289548|gb|ELO66501.1| hypothetical protein SEEE4220_10855 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435290743|gb|ELO67643.1| hypothetical protein SEEE1616_09635 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 16-16]
 gi|435297425|gb|ELO73707.1| hypothetical protein SEEE3944_18708 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435318315|gb|ELO91256.1| hypothetical protein SEEE2625_08557 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435325392|gb|ELO97257.1| hypothetical protein SEEE1976_07359 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435331875|gb|ELP02973.1| hypothetical protein SEEE3407_07536 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|435337265|gb|ELP06901.1| hypothetical protein SEEE5646_06793 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|436413542|gb|ELP11475.1| hypothetical protein F515_16533 [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|436419484|gb|ELP17359.1| hypothetical protein F514_07997 [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|436424533|gb|ELP22304.1| hypothetical protein F434_02369 [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|444846644|gb|ELX71804.1| hypothetical protein SEEG9184_003014 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|444863684|gb|ELX88501.1| hypothetical protein SEE8A_002740 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444865968|gb|ELX90724.1| hypothetical protein SEE10_016120 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444869388|gb|ELX93976.1| hypothetical protein SE20037_07109 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444873361|gb|ELX97662.1| hypothetical protein SEE13_020245 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444883532|gb|ELY07411.1| hypothetical protein SEE23_012930 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
          Length = 373

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|418845546|ref|ZP_13400330.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19443]
 gi|392812919|gb|EJA68895.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19443]
          Length = 373

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417518865|ref|ZP_12181142.1| hypothetical protein LTSEUGA_2826 [Salmonella enterica subsp.
           enterica serovar Uganda str. R8-3404]
 gi|353648187|gb|EHC91141.1| hypothetical protein LTSEUGA_2826 [Salmonella enterica subsp.
           enterica serovar Uganda str. R8-3404]
          Length = 352

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417539905|ref|ZP_12192078.1| hypothetical protein LTSEWAN_3165 [Salmonella enterica subsp.
           enterica serovar Wandsworth str. A4-580]
 gi|353663648|gb|EHD02277.1| hypothetical protein LTSEWAN_3165 [Salmonella enterica subsp.
           enterica serovar Wandsworth str. A4-580]
          Length = 383

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|123442019|ref|YP_001006002.1| hypothetical protein YE1716 [Yersinia enterocolitica subsp.
           enterocolitica 8081]
 gi|122088980|emb|CAL11791.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
           enterocolitica 8081]
          Length = 373

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 93/203 (45%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +EDG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWQVAHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|445292803|ref|ZP_21411158.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 436]
 gi|444883200|gb|ELY07099.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Enteritidis str. 436]
          Length = 248

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|197264610|ref|ZP_03164684.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Saintpaul str. SARA23]
 gi|197242865|gb|EDY25485.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Saintpaul str. SARA23]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417358629|ref|ZP_12133487.1| hypothetical protein LTSEGIV_2727, partial [Salmonella enterica
           subsp. enterica serovar Give str. S5-487]
 gi|417511486|ref|ZP_12176093.1| hypothetical protein LTSESEN_3098, partial [Salmonella enterica
           subsp. enterica serovar Senftenberg str. A4-543]
 gi|353590763|gb|EHC49201.1| hypothetical protein LTSEGIV_2727, partial [Salmonella enterica
           subsp. enterica serovar Give str. S5-487]
 gi|353642904|gb|EHC87229.1| hypothetical protein LTSESEN_3098, partial [Salmonella enterica
           subsp. enterica serovar Senftenberg str. A4-543]
          Length = 351

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|336311496|ref|ZP_08566459.1| putative cytoplasmic protein [Shewanella sp. HN-41]
 gi|335864989|gb|EGM70049.1| putative cytoplasmic protein [Shewanella sp. HN-41]
          Length = 433

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 95/218 (43%), Gaps = 38/218 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K+PL I R+   ++ +L++ +++  +  D ++E ++     +Y+   +   +     
Sbjct: 64  YWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR-----VYQQAGQWHAEFGP-- 115

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVGAN---------TYQ 109
                +D Y E      L  + +   +Q LN  L   + L  CF             +Y 
Sbjct: 116 -----FDSYEE------LGERDWTLIVQALNNWLPDAEALIQCFDFIPRWRFDDVMVSYA 164

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
           TP    G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+  
Sbjct: 165 TP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFA---AHPALLHTEAFEPIIDT 219

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            L PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 220 ELLPGDILYIPPGFPHDG--ITLEESLSFSVGYRTASA 255


>gi|204930767|ref|ZP_03221640.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Javiana str. GA_MM04042433]
 gi|416422178|ref|ZP_11690082.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 315996572]
 gi|416431269|ref|ZP_11695492.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-1]
 gi|416441324|ref|ZP_11701536.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-3]
 gi|416444169|ref|ZP_11703523.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-4]
 gi|416451957|ref|ZP_11708624.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 515920-1]
 gi|416459031|ref|ZP_11713540.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 515920-2]
 gi|416467760|ref|ZP_11717609.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 531954]
 gi|416479512|ref|ZP_11722321.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. NC_MB110209-0054]
 gi|416484572|ref|ZP_11724260.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. OH_2009072675]
 gi|416497407|ref|ZP_11729675.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CASC_09SCPH15965]
 gi|416541577|ref|ZP_11751064.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 19N]
 gi|416576289|ref|ZP_11768976.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 81038-01]
 gi|416585917|ref|ZP_11775217.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MD_MDA09249507]
 gi|416591001|ref|ZP_11778176.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 414877]
 gi|416599038|ref|ZP_11783389.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 366867]
 gi|416607883|ref|ZP_11788877.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 413180]
 gi|416611150|ref|ZP_11790580.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 446600]
 gi|416623335|ref|ZP_11797355.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 609458-1]
 gi|416629099|ref|ZP_11799863.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 556150-1]
 gi|416639453|ref|ZP_11804522.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 609460]
 gi|416650751|ref|ZP_11810516.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 507440-20]
 gi|416665997|ref|ZP_11817148.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB101509-0077]
 gi|416685681|ref|ZP_11825039.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB102109-0047]
 gi|416707244|ref|ZP_11832342.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB111609-0052]
 gi|416714542|ref|ZP_11837860.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 2009083312]
 gi|416717025|ref|ZP_11839306.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 2009085258]
 gi|416725223|ref|ZP_11845593.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 315731156]
 gi|416733374|ref|ZP_11850420.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2009159199]
 gi|416736079|ref|ZP_11851796.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008282]
 gi|416750056|ref|ZP_11859546.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008283]
 gi|416759823|ref|ZP_11864631.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008284]
 gi|416761883|ref|ZP_11865933.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008285]
 gi|416767266|ref|ZP_11869763.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008287]
 gi|418485944|ref|ZP_13054926.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 80959-06]
 gi|418490970|ref|ZP_13057500.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035278]
 gi|418495421|ref|ZP_13061863.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035318]
 gi|418499285|ref|ZP_13065694.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035320]
 gi|418503164|ref|ZP_13069533.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035321]
 gi|418506425|ref|ZP_13072757.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035327]
 gi|418512653|ref|ZP_13078893.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Pomona str. ATCC 10729]
 gi|418527014|ref|ZP_13092971.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008286]
 gi|452120602|ref|YP_007470850.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Javiana str. CFSAN001992]
 gi|204320226|gb|EDZ05430.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Javiana str. GA_MM04042433]
 gi|322616605|gb|EFY13514.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 315996572]
 gi|322619880|gb|EFY16754.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-1]
 gi|322622448|gb|EFY19293.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-3]
 gi|322629434|gb|EFY26211.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-4]
 gi|322632930|gb|EFY29673.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 515920-1]
 gi|322636825|gb|EFY33528.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 515920-2]
 gi|322641375|gb|EFY38014.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 531954]
 gi|322645140|gb|EFY41669.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. NC_MB110209-0054]
 gi|322652303|gb|EFY48659.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. OH_2009072675]
 gi|322655655|gb|EFY51957.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CASC_09SCPH15965]
 gi|322660960|gb|EFY57190.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 19N]
 gi|322665454|gb|EFY61642.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 81038-01]
 gi|322667569|gb|EFY63730.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MD_MDA09249507]
 gi|322673637|gb|EFY69739.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 414877]
 gi|322677563|gb|EFY73627.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 366867]
 gi|322679772|gb|EFY75811.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 413180]
 gi|322687245|gb|EFY83217.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 446600]
 gi|323193063|gb|EFZ78285.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 609458-1]
 gi|323199440|gb|EFZ84533.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 556150-1]
 gi|323203628|gb|EFZ88650.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 609460]
 gi|323217805|gb|EGA02520.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB101509-0077]
 gi|323218884|gb|EGA03395.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB102109-0047]
 gi|323229608|gb|EGA13731.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB111609-0052]
 gi|323232833|gb|EGA16929.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 2009083312]
 gi|323240131|gb|EGA24175.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 2009085258]
 gi|323242882|gb|EGA26903.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 315731156]
 gi|323246798|gb|EGA30768.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2009159199]
 gi|323254302|gb|EGA38119.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008282]
 gi|323255568|gb|EGA39327.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008283]
 gi|323259402|gb|EGA43038.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008284]
 gi|323266998|gb|EGA50483.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008285]
 gi|323272102|gb|EGA55516.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008287]
 gi|366055834|gb|EHN20169.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 80959-06]
 gi|366059277|gb|EHN23551.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035318]
 gi|366063735|gb|EHN27947.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035278]
 gi|366071820|gb|EHN35914.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035320]
 gi|366074888|gb|EHN38950.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035321]
 gi|366083145|gb|EHN47072.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CT_02035327]
 gi|366083481|gb|EHN47402.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Pomona str. ATCC 10729]
 gi|366827633|gb|EHN54531.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 507440-20]
 gi|372204483|gb|EHP18010.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008286]
 gi|451909606|gb|AGF81412.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Javiana str. CFSAN001992]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|420258884|ref|ZP_14761608.1| hypothetical protein YWA314_09054 [Yersinia enterocolitica subsp.
           enterocolitica WA-314]
 gi|404513670|gb|EKA27481.1| hypothetical protein YWA314_09054 [Yersinia enterocolitica subsp.
           enterocolitica WA-314]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 93/203 (45%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +EDG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWQVAHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|418858539|ref|ZP_13413153.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19470]
 gi|418863928|ref|ZP_13418464.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19536]
 gi|392831777|gb|EJA87404.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19536]
 gi|392832510|gb|EJA88130.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 19470]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTTALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|397168193|ref|ZP_10491631.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
 gi|396089728|gb|EJI87300.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 77/337 (22%), Positives = 146/337 (43%), Gaps = 57/337 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  + DG+ Q     G  
Sbjct: 16  YWQKRPVVLKRGIKNFVDP-ISPDELAGLAMENEVDSR----LVSHLDGKWQV----GHG 66

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT------YQTPPTAQ 115
            P   +DH  E      +     +    Q  A+L   F       T      +  P    
Sbjct: 67  -PFQSYDHLGENNWSLLVQ---AVNHWHQPAAALMRPFRALPDWRTDDLMISFSVP--GG 120

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKTPLKQHCPH---PDLLQVDPFEAIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           +LY+P G+ H+  ++ N  SL+ ++  ++  +  +L+          +G  D   +R L 
Sbjct: 178 ILYIPPGFPHEGYSLEN--SLNYSVG-FRAPSGRELI----------SGFADYVLQREL- 223

Query: 236 IGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVD-LDDGVDEMGKQLMH-- 289
            G  RY+      R  P DI  + +   + L++++  L+   +  +D + E   Q  H  
Sbjct: 224 -GSQRYSDPDVPQRAHPADILPEEV---DKLREMMLNLINQPEQFNDWLGEFISQSRHEL 279

Query: 290 DALP--PVLSPEELQCSVFE-------NGLRMSQTGE 317
           D  P  P   P+E+  ++ +        GLR+ + GE
Sbjct: 280 DVAPPEPPYQPDEIYDALQQGDKLVRLGGLRVLRIGE 316


>gi|198242002|ref|YP_002215906.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Dublin str. CT_02021853]
 gi|445142397|ref|ZP_21386083.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Dublin str. SL1438]
 gi|445158953|ref|ZP_21393237.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Dublin str. HWS51]
 gi|197936518|gb|ACH73851.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Dublin str. CT_02021853]
 gi|444845219|gb|ELX70431.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Dublin str. HWS51]
 gi|444849822|gb|ELX74931.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Dublin str. SL1438]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|212556168|gb|ACJ28622.1| Transcription factor jumonji, jmjC [Shewanella piezotolerans WP3]
          Length = 376

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G   H D+ + FI+Q EG++ W V       +Y  + + P     E   PI
Sbjct: 113 SFATP--HGGVGAHIDNYDVFIIQGEGQRHWTV---GEKGQYAAKNNDPTTPLIEGFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           + V LE GD+LY+P GY HQ  T+T   S  +
Sbjct: 168 IDVVLEKGDMLYIPPGYPHQGQTLTTAMSYSI 199


>gi|417475684|ref|ZP_12170439.1| hypothetical protein LTSERUB_3048, partial [Salmonella enterica
           subsp. enterica serovar Rubislaw str. A4-653]
 gi|353643512|gb|EHC87680.1| hypothetical protein LTSERUB_3048, partial [Salmonella enterica
           subsp. enterica serovar Rubislaw str. A4-653]
          Length = 337

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|437836341|ref|ZP_20845630.1| hypothetical protein SEEERB17_014183 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
 gi|435299455|gb|ELO75599.1| hypothetical protein SEEERB17_014183 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|284007314|emb|CBA72661.1| conserved hypothetical protein [Arsenophonus nasoniae]
          Length = 380

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 7/101 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P  A G  PH D  + FI+Q  G++ W+V     + ++ P    P+  Q E    I
Sbjct: 119 SYSVP--AGGVGPHIDQYDVFIIQGLGRRHWRVGEKMTLKQHCPH---PDLLQVEPFAAI 173

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +   LEPGD+LY+P G+ H+  ++  E SL+ ++     T+
Sbjct: 174 IDEQLEPGDILYIPPGFPHEGYSI--ETSLNYSVGFRSPTS 212


>gi|417349643|ref|ZP_12128258.1| hypothetical protein SeGA_2719 [Salmonella enterica subsp. enterica
           serovar Gaminara str. A4-567]
 gi|353572329|gb|EHC36010.1| hypothetical protein SeGA_2719 [Salmonella enterica subsp. enterica
           serovar Gaminara str. A4-567]
          Length = 366

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|417373978|ref|ZP_12143873.1| hypothetical protein LTSEINV_3007, partial [Salmonella enterica
           subsp. enterica serovar Inverness str. R8-3668]
 gi|353600966|gb|EHC56706.1| hypothetical protein LTSEINV_3007, partial [Salmonella enterica
           subsp. enterica serovar Inverness str. R8-3668]
          Length = 243

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|417366084|ref|ZP_12138503.1| hypothetical protein LTSEHVI_2748 [Salmonella enterica subsp.
           enterica serovar Hvittingfoss str. A4-620]
 gi|353592862|gb|EHC50763.1| hypothetical protein LTSEHVI_2748 [Salmonella enterica subsp.
           enterica serovar Hvittingfoss str. A4-620]
          Length = 383

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPAAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|437897671|ref|ZP_20849620.1| hypothetical protein SEEE5621_13520 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|435322351|gb|ELO94641.1| hypothetical protein SEEE5621_13520 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
          Length = 290

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|168264258|ref|ZP_02686231.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Hadar str. RI_05P066]
 gi|418868700|ref|ZP_13423141.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 4176]
 gi|205347283|gb|EDZ33914.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Hadar str. RI_05P066]
 gi|392837390|gb|EJA92960.1| cupin family protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 4176]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|238789087|ref|ZP_04632876.1| hypothetical protein yfred0001_42190 [Yersinia frederiksenii ATCC
           33641]
 gi|238722851|gb|EEQ14502.1| hypothetical protein yfred0001_42190 [Yersinia frederiksenii ATCC
           33641]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 89/203 (43%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ + L       D L    +E + +  +  +EDG  Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGSWQVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|167624225|ref|YP_001674519.1| cupin 4 family protein [Shewanella halifaxensis HAW-EB4]
 gi|167354247|gb|ABZ76860.1| Cupin 4 family protein [Shewanella halifaxensis HAW-EB4]
          Length = 381

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 87/210 (41%), Gaps = 36/210 (17%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I +  KN+  +L++  ++  +  D ++E ++       E G+ Q       
Sbjct: 16  EYWQKKPLVIRQGFKNF-QDLLSPDEMAGLACDEMVESRR----VYREKGEWQA------ 64

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLN---ASLQELFGCFVGAN---------TY 108
                    +    S   L    +   +Q LN    + ++L  CF             +Y
Sbjct: 65  --------EFGPFESYENLGETDWTLIVQALNNWVPAAEDLLKCFDFIPRWRLDDVMVSY 116

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
             P    G  PH D  + FI Q  G+++W+V     +  +    + P     E   PI+ 
Sbjct: 117 AVP--GGGVGPHIDLYDVFICQGSGRRRWRV---GDIGPHKEFAAHPALLHTEAFEPIID 171

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           V L PGD+LYLP G+ H   T+    S  V
Sbjct: 172 VELLPGDILYLPPGFPHDGVTLEPSMSFSV 201


>gi|417416112|ref|ZP_12159607.1| hypothetical protein LTSEMIS_2735, partial [Salmonella enterica
           subsp. enterica serovar Mississippi str. A4-633]
 gi|353620896|gb|EHC70868.1| hypothetical protein LTSEMIS_2735, partial [Salmonella enterica
           subsp. enterica serovar Mississippi str. A4-633]
          Length = 369

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 26  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 77  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 193 PGFPHEGYALEN 204


>gi|213422268|ref|ZP_03355334.1| hypothetical protein Salmonentericaenterica_32813 [Salmonella
           enterica subsp. enterica serovar Typhi str. E01-6750]
          Length = 300

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 1   HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 51

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 52  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 110

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 111 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 167

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 168 PGFPHEGYALEN 179


>gi|289806227|ref|ZP_06536856.1| hypothetical protein Salmonellaentericaenterica_18037 [Salmonella
           enterica subsp. enterica serovar Typhi str. AG3]
          Length = 359

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|16081039|ref|NP_391867.1| hypothetical protein BSU39880 [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|221311960|ref|ZP_03593807.1| hypothetical protein Bsubs1_21506 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221316284|ref|ZP_03598089.1| hypothetical protein BsubsN3_21417 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221321197|ref|ZP_03602491.1| hypothetical protein BsubsJ_21360 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221325480|ref|ZP_03606774.1| hypothetical protein BsubsS_21516 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|402778153|ref|YP_006632097.1| protein YxbC [Bacillus subtilis QB928]
 gi|452913057|ref|ZP_21961685.1| hypothetical protein BS732_0971 [Bacillus subtilis MB73/2]
 gi|1176977|sp|P46327.1|YXBC_BACSU RecName: Full=Uncharacterized protein YxbC
 gi|904195|dbj|BAA21597.1| yxbC [Bacillus subtilis]
 gi|2636534|emb|CAB16024.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|402483332|gb|AFQ59841.1| YxbC [Bacillus subtilis QB928]
 gi|407962835|dbj|BAM56075.1| hypothetical protein BEST7613_7144 [Bacillus subtilis BEST7613]
 gi|407966847|dbj|BAM60086.1| hypothetical protein BEST7003_3885 [Bacillus subtilis BEST7003]
 gi|452118085|gb|EME08479.1| hypothetical protein BS732_0971 [Bacillus subtilis MB73/2]
          Length = 330

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/240 (24%), Positives = 110/240 (45%), Gaps = 27/240 (11%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W  +PL ++R +   + ++   +K+  +  +NV+    N  + + +    ++  +  R
Sbjct: 24  EYWPVKPL-VARGEVERFTSIPGFEKVRTL--ENVLAIYNNPVMVVGDAVIEESEGITDR 80

Query: 61  AL--PQIVWDHYREGCSIRFLNPQTYI----KPLQQLNASLQELFGCFVGANTYQTPPTA 114
            L  P    + Y +G ++ F     +I    + +++L A L+   G    A  Y      
Sbjct: 81  FLVSPAEALEWYEKGAALEFDFTDLFIPQVRRWIEKLKAELRLPAGTSSKAIVYAAK-NG 139

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQEEI-----GTP-- 165
            GF  H+D     I Q++G+K WK+     V   +  Y  S   +  +++     G P  
Sbjct: 140 GGFKAHFDAYTNLIFQIQGEKTWKLAKNENVSNPMQHYDLSEAPYYPDDLQSYWKGDPPK 199

Query: 166 -----ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
                   V L PG +LYLPRG  H  ST +++ +L + I+ + + AW+DL+  A+ K L
Sbjct: 200 EDLPDAEIVNLTPGTMLYLPRGLWH--STKSDQATLALNIT-FGQPAWLDLMLAALRKKL 256


>gi|16760104|ref|NP_455721.1| hypothetical protein STY1269 [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29142125|ref|NP_805467.1| hypothetical protein t1691 [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|213612643|ref|ZP_03370469.1| hypothetical protein SentesTyp_09119 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-2068]
 gi|213864845|ref|ZP_03386964.1| hypothetical protein SentesT_33730 [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
 gi|378959857|ref|YP_005217343.1| hypothetical protein STBHUCCB_17970 [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
 gi|25357094|pir||AF0646 conserved hypothetical protein STY1269 [imported] - Salmonella
           enterica subsp. enterica serovar Typhi (strain CT18)
 gi|16502398|emb|CAD08353.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhi]
 gi|29137754|gb|AAO69316.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|374353729|gb|AEZ45490.1| hypothetical protein STBHUCCB_17970 [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
          Length = 373

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|153000863|ref|YP_001366544.1| cupin 4 family protein [Shewanella baltica OS185]
 gi|160875571|ref|YP_001554887.1| cupin 4 family protein [Shewanella baltica OS195]
 gi|378708769|ref|YP_005273663.1| transcription factor jumonji jmjC domain-containing protein
           [Shewanella baltica OS678]
 gi|418025366|ref|ZP_12664345.1| Cupin, JmjC-type [Shewanella baltica OS625]
 gi|151365481|gb|ABS08481.1| Cupin 4 family protein [Shewanella baltica OS185]
 gi|160861093|gb|ABX49627.1| Cupin 4 family protein [Shewanella baltica OS195]
 gi|315267758|gb|ADT94611.1| transcription factor jumonji jmjC domain-containing protein
           [Shewanella baltica OS678]
 gi|353535350|gb|EHC04913.1| Cupin, JmjC-type [Shewanella baltica OS625]
          Length = 385

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I R+   ++ +L++ +++  +  D ++E ++       + GQ Q       
Sbjct: 16  EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQ-----AE 65

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
             P   +DH  E       +    ++ L       + L  CF             ++ TP
Sbjct: 66  FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
               G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+   L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIIDTEL 174

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208


>gi|91793181|ref|YP_562832.1| cupin 4 [Shewanella denitrificans OS217]
 gi|91715183|gb|ABE55109.1| Cupin 4 [Shewanella denitrificans OS217]
          Length = 376

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI+Q EG+++W+V       E+  R + P     E   PI
Sbjct: 113 SFATP--LGGVGPHIDNYDVFIIQGEGERRWRV---GDKGEHQRRNNDPMSPMVEDFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           + V L+ GD+LY+P G+ H   T+T   S  V
Sbjct: 168 IDVVLKKGDVLYIPPGFPHSGETLTMAMSYSV 199


>gi|381404919|ref|ZP_09929603.1| hypothetical protein S7A_11740 [Pantoea sp. Sc1]
 gi|380738118|gb|EIB99181.1| hypothetical protein S7A_11740 [Pantoea sp. Sc1]
          Length = 374

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 87/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  K++ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFKDFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRIDDLMVSFAVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|126174567|ref|YP_001050716.1| cupin 4 family protein [Shewanella baltica OS155]
 gi|386341321|ref|YP_006037687.1| cupin [Shewanella baltica OS117]
 gi|125997772|gb|ABN61847.1| Cupin 4 family protein [Shewanella baltica OS155]
 gi|334863722|gb|AEH14193.1| Cupin 4 family protein [Shewanella baltica OS117]
          Length = 385

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I R+   ++ +L++ +++  +  D ++E ++       + GQ Q       
Sbjct: 16  EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQA-----E 65

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
             P   +DH  E       +    ++ L       + L  CF             ++ TP
Sbjct: 66  FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
               G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+   L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIIDTEL 174

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208


>gi|217973180|ref|YP_002357931.1| cupin [Shewanella baltica OS223]
 gi|217498315|gb|ACK46508.1| Cupin 4 family protein [Shewanella baltica OS223]
          Length = 385

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I R+   ++ +L++ +++  +  D ++E ++       + GQ Q       
Sbjct: 16  EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQA-----E 65

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
             P   +DH  E       +    ++ L       + L  CF             ++ TP
Sbjct: 66  FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
               G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+   L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFEPIIDTEL 174

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208


>gi|312172134|emb|CBX80391.1| Uncharacterized protein ycfD [Erwinia amylovora ATCC BAA-2158]
          Length = 375

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 88/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  K++ +  I+  ++  +  +N ++ +    +  ++DG+ Q     G  
Sbjct: 16  YWQKRPVVLKRGFKHFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + +    V            G  PH
Sbjct: 67  -PFESYDHLGENNWSLLVQAVNHWHEPSAALMHPFRAIPDWRVDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKLEMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|372274110|ref|ZP_09510146.1| hypothetical protein PSL1_03385 [Pantoea sp. SL1_M5]
 gi|390433535|ref|ZP_10222073.1| hypothetical protein PaggI_01830 [Pantoea agglomerans IG1]
          Length = 374

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/197 (22%), Positives = 89/197 (45%), Gaps = 24/197 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN------TYQTPPTAQ 115
            P   +DH  E      +     +    + +A+L   F             ++  P    
Sbjct: 66  GPFESYDHLGENNWSLLVQA---VNHWHEASAALMRPFRFLPDWRVDDLMVSFAVP--GG 120

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGAGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|322833624|ref|YP_004213651.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella sp. Y9602]
 gi|384258802|ref|YP_005402736.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella aquatilis HX2]
 gi|321168825|gb|ADW74524.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella sp. Y9602]
 gi|380754778|gb|AFE59169.1| transcription factor jumonji jmjC domain-containing protein
           [Rahnella aquatilis HX2]
          Length = 373

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 60/274 (21%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +   +DG+          
Sbjct: 16  YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSNQDGKWNV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E G S+       + +P   L    + +    +            G  PH
Sbjct: 66  GPFESYDHLGETGWSVLVQAVDHWHEPSSHLMTPFRHIPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD++Y+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVAMKQHCPH---PDLLQVEPFEAIIDEELEPGDIVYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H   ++ N  SL+ ++      A      + M      +G  D    R L  G  R
Sbjct: 183 PGFPHDGYSLEN--SLNYSVGFRAPNA------REM-----VSGFADFVLSREL--GSHR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           Y+    + R  P ++Q   L   + L+ ++  L+
Sbjct: 228 YSDPELVEREHPAEVQPAEL---DKLRGMMLDLI 258


>gi|373949678|ref|ZP_09609639.1| Cupin 4 family protein [Shewanella baltica OS183]
 gi|386324489|ref|YP_006020606.1| cupin [Shewanella baltica BA175]
 gi|333818634|gb|AEG11300.1| Cupin 4 family protein [Shewanella baltica BA175]
 gi|373886278|gb|EHQ15170.1| Cupin 4 family protein [Shewanella baltica OS183]
          Length = 385

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K+PL I R+   ++ +L++ +++  +  D ++E ++       + GQ Q       
Sbjct: 16  EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQA-----E 65

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
             P   +DH  E       +    ++ L       + L  CF             ++ TP
Sbjct: 66  FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
               G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+   L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIIDTEL 174

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208


>gi|213027812|ref|ZP_03342259.1| hypothetical protein Salmonelentericaenterica_37965 [Salmonella
           enterica subsp. enterica serovar Typhi str. 404ty]
          Length = 259

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|377576032|ref|ZP_09805016.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
 gi|377542064|dbj|GAB50181.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
          Length = 379

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  + DG+ Q        
Sbjct: 22  HWQKRPVVLKRGFKNFTDPL-SPDELAGLAMENEVDSR----LVSHLDGKWQV-----SH 71

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L                 G  PH
Sbjct: 72  GPFESYDHLGESNWSLLVQAVNHWHEPAAALMRPFRALPDWRTDDLMISFSVPGGGVGPH 131

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 132 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVDPFDAIIDEELEPGDILYIP 188

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  +  V          I             +G  D   +R L  G  R
Sbjct: 189 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSQR 233

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           Y      AR  P DIQ   L   E ++ ++  L+
Sbjct: 234 YTDPDVPARQYPADIQPQEL---EKVRGMMLDLI 264


>gi|119775200|ref|YP_927940.1| cupin 4 [Shewanella amazonensis SB2B]
 gi|119767700|gb|ABM00271.1| cupin 4 [Shewanella amazonensis SB2B]
          Length = 372

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI+Q EG+++W V             +SP     E   PI
Sbjct: 113 SFATPQG--GVGPHIDNYDVFIIQGEGQRRWTVGPKGNYQRRGGVTTSPLIEDFE---PI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
           + V LE GD+LY+P G+ HQ  T+T
Sbjct: 168 IDVVLEKGDVLYIPPGFPHQGETLT 192


>gi|292488010|ref|YP_003530887.1| hypothetical protein EAMY_1529 [Erwinia amylovora CFBP1430]
 gi|292899228|ref|YP_003538597.1| hypothetical protein EAM_1512 [Erwinia amylovora ATCC 49946]
 gi|428784945|ref|ZP_19002436.1| hypothetical protein EaACW_1536 [Erwinia amylovora ACW56400]
 gi|291199076|emb|CBJ46187.1| conserved hypothetical protein [Erwinia amylovora ATCC 49946]
 gi|291553434|emb|CBA20479.1| Uncharacterized protein ycfD [Erwinia amylovora CFBP1430]
 gi|426276507|gb|EKV54234.1| hypothetical protein EaACW_1536 [Erwinia amylovora ACW56400]
          Length = 375

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 88/192 (45%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  K++ +  I+  ++  +  +N ++ +    +  ++DG+ Q     G  
Sbjct: 16  YWQKRPVVLKRGFKHFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + +    V            G  PH
Sbjct: 67  -PFESYDHLGENNWSLLVQAVNHWHEPSAALMNPFRAIPDWRVDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
           +D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKLEMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|56751149|ref|YP_171850.1| hypothetical protein syc1140_d [Synechococcus elongatus PCC 6301]
 gi|56686108|dbj|BAD79330.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 270

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 68/139 (48%), Gaps = 8/139 (5%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W ++ +YI+     +  +L +   ++D+L  N   F+++ ++    DG+     +    
Sbjct: 20  YWAQQSVYIAGDSLRF-QSLFSWNHLNDLL--NYQTFRES-ELRFSRDGE----SLPAGD 71

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
            P +     +EG ++        +  L+ L  +L++ FG     N Y +P   QGF  HY
Sbjct: 72  NPTLWRSRLQEGATLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHY 131

Query: 122 DDIEAFILQLEGKKKWKVY 140
           D  +  ILQ+EG+K+W +Y
Sbjct: 132 DTHDVLILQIEGEKEWLIY 150


>gi|167515846|ref|XP_001742264.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778888|gb|EDQ92502.1| predicted protein [Monosiga brevicollis MX1]
          Length = 476

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 82/195 (42%), Gaps = 35/195 (17%)

Query: 87  PLQQLNASLQELF-GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV 145
           PL  L  +LQ +F       + Y +   AQ    H D  +  +LQ+ G KKW +  P  +
Sbjct: 147 PLSDLELALQTVFHTTHSTVHAYISSGGAQALPYHTDSYDVIVLQVSGVKKWTICQPPSL 206

Query: 146 DEYLP-------------RYSSP----NFSQEEI-GTPILTVTLEPGDLLYLPRGYIHQA 187
           +   P             R+++P    NF+   I       +T+  GD+LYLP+  IH A
Sbjct: 207 ETAWPNPSPADLAQLYELRHANPDGCSNFNHRTIEQLNCQNLTMLAGDMLYLPKSMIHVA 266

Query: 188 STVTNEHSLHVTISVYQKTA-WIDLLEKAMPK-----------ALQAAGATDLEFRRGLP 235
            T     S H+T S+ ++   W D++ +A              +LQ A +TD      LP
Sbjct: 267 HTKPGTVSAHLTYSLDREGGMWRDVISRACSTWRLPLCPEILGSLQRAKSTD----AALP 322

Query: 236 IGYLRYAGLARGKPV 250
              L Y  +  G PV
Sbjct: 323 WLQLAYEPVRPGAPV 337


>gi|213428095|ref|ZP_03360845.1| hypothetical protein SentesTyphi_22435 [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
          Length = 253

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|167523821|ref|XP_001746247.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775518|gb|EDQ89142.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2290

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 37/123 (30%), Positives = 54/123 (43%), Gaps = 17/123 (13%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM--------- 144
           +L+  F   V  + Y T P  +   PH D  + F+ QL G K W++ +P           
Sbjct: 172 ALRAAFDTDVTQHLYATMPGGRALDPHTDGGDVFVHQLAGHKHWEICVPTTNTTCQNCTH 231

Query: 145 -VDEYLPRYSSPNF------SQEEI-GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSL 196
                L  +   +F      S E++     L +TL  GDLLYLPR  +H A T  +  S 
Sbjct: 232 GAQALLAEFERSSFQGCTSYSYEQLQNMSCLNLTLHAGDLLYLPRALVHHAWTDNSTASY 291

Query: 197 HVT 199
           H+T
Sbjct: 292 HMT 294


>gi|303249991|ref|ZP_07336193.1| hypothetical protein APP6_1402 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|303253163|ref|ZP_07339312.1| hypothetical protein APP2_0468 [Actinobacillus pleuropneumoniae
           serovar 2 str. 4226]
 gi|302647845|gb|EFL78052.1| hypothetical protein APP2_0468 [Actinobacillus pleuropneumoniae
           serovar 2 str. 4226]
 gi|302651054|gb|EFL81208.1| hypothetical protein APP6_1402 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
          Length = 225

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 17/95 (17%)

Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
           GC   ++ Y    T   F  H+D  + F LQ++G+K+W +        + P +++P F  
Sbjct: 128 GCHTFSSLYIAFNTQSSFKSHWDSRDIFALQMQGRKRWII--------HAPTFNNPLFMH 179

Query: 160 EEIGTP---------ILTVTLEPGDLLYLPRGYIH 185
           +    P          + + LE GD+LYLPRG+ H
Sbjct: 180 KSKDMPEYNPNLDDVYMDIILEAGDILYLPRGWWH 214


>gi|90408230|ref|ZP_01216397.1| hypothetical protein PCNPT3_11519 [Psychromonas sp. CNPT3]
 gi|90310676|gb|EAS38794.1| hypothetical protein PCNPT3_11519 [Psychromonas sp. CNPT3]
          Length = 376

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 89/196 (45%), Gaps = 25/196 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---IEFKKNIDITLYEDGQRQTMDV- 57
           +W+K+PL I +   ++ + ++  +     + + V   + ++   D    E G   + D  
Sbjct: 15  YWQKKPLLIKQGFADFQDPIMPDEIAGLAMEEEVESRLVYRDEQDKWQAEHGPFTSFDKL 74

Query: 58  --EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTA 114
             +G +L     DH+ E       + Q  I+P + L N  L +L   +   N        
Sbjct: 75  EQKGTSLLVQAVDHWHE-------DAQQLIRPFRFLPNWRLDDLMISYSSVNG------- 120

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
            G  PH D+ + FI+Q  GK+ W+V     ++E+    +  + +  +    I+ V LEPG
Sbjct: 121 -GVGPHIDNYDVFIIQGLGKRHWRVGAKTELNEFTAHAALKHCAAFD---AIIDVELEPG 176

Query: 175 DLLYLPRGYIHQASTV 190
           D+LY+P G+ H+   +
Sbjct: 177 DILYIPVGFPHEGYAI 192


>gi|417548664|ref|ZP_12199745.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           Naval-18]
 gi|417566030|ref|ZP_12216904.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           OIFC143]
 gi|395557786|gb|EJG23787.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           OIFC143]
 gi|400388963|gb|EJP52035.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           Naval-18]
          Length = 382

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 79/175 (45%), Gaps = 18/175 (10%)

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPP 112
           +E R +  +++++ R G ++     +  IK  +     + +  G F  A T    Y    
Sbjct: 75  IEYRCISSVLYEYLRNGATL----VRNRIKN-EPFVDQISKQIGSFSEARTLTGGYAAFS 129

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTV 169
           +   +  H+D  + + +QL G+K+W +  P       P Y     +  +I  P    + V
Sbjct: 130 SKSSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFTDIKEPEEIYMDV 186

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
            LE GD+LY+PRG+ H    + +E + H+ ++ +  T   ++  L+  MP  L  
Sbjct: 187 ILEAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNTMPNILDC 240


>gi|307248805|ref|ZP_07530818.1| hypothetical protein appser2_17710 [Actinobacillus pleuropneumoniae
           serovar 2 str. S1536]
 gi|307253420|ref|ZP_07535291.1| hypothetical protein appser6_19140 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|307262553|ref|ZP_07544192.1| hypothetical protein appser12_20870 [Actinobacillus
           pleuropneumoniae serovar 12 str. 1096]
 gi|306854732|gb|EFM86922.1| hypothetical protein appser2_17710 [Actinobacillus pleuropneumoniae
           serovar 2 str. S1536]
 gi|306859099|gb|EFM91141.1| hypothetical protein appser6_19140 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|306867751|gb|EFM99588.1| hypothetical protein appser12_20870 [Actinobacillus
           pleuropneumoniae serovar 12 str. 1096]
          Length = 250

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 17/95 (17%)

Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
           GC   ++ Y    T   F  H+D  + F LQ++G+K+W +        + P +++P F  
Sbjct: 153 GCHTFSSLYIAFNTQSSFKSHWDSRDIFALQMQGRKRWII--------HAPTFNNPLFMH 204

Query: 160 EEIGTP---------ILTVTLEPGDLLYLPRGYIH 185
           +    P          + + LE GD+LYLPRG+ H
Sbjct: 205 KSKDMPEYNPNLDDVYMDIILEAGDILYLPRGWWH 239


>gi|293396540|ref|ZP_06640816.1| cupin superfamily protein [Serratia odorifera DSM 4582]
 gi|291420804|gb|EFE94057.1| cupin superfamily protein [Serratia odorifera DSM 4582]
          Length = 373

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 93/203 (45%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  S         ++P +QL +  + +L   F       
Sbjct: 71  YDHLGENNWSLLVQAV-DHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVAPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|365834386|ref|ZP_09375832.1| cupin family protein [Hafnia alvei ATCC 51873]
 gi|364569163|gb|EHM46786.1| cupin family protein [Hafnia alvei ATCC 51873]
          Length = 373

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 84/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVILKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   SI       +  P   L    ++L                 G  PH
Sbjct: 67  -PFESYDHLGENNWSILVQAVDHWHVPSSALMRPFRKLPDWRTDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKTQLRQHSPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|268589244|ref|ZP_06123465.1| cupin family protein [Providencia rettgeri DSM 1131]
 gi|291315428|gb|EFE55881.1| cupin family protein [Providencia rettgeri DSM 1131]
          Length = 374

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 95/209 (45%), Gaps = 32/209 (15%)

Query: 2   HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           HW+K PL I +   N+ + +    +    ++D +   ++  K+ I    +  G  ++ D 
Sbjct: 16  HWQKRPLLIKKGFANFIDPISPDELAGLAMEDEVDSRLVSHKEGIWQVGH--GPFESYDH 73

Query: 58  EGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPP 112
            G      L Q V DH+    S         +KP + L +  + +L        +Y  P 
Sbjct: 74  LGEENWSILVQAV-DHWHHPSS-------ALMKPFRVLSDWRMDDLM------ISYSVP- 118

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G  PH D  + FI+Q EG+++W+V     + ++ P    P+  Q +    I+   +E
Sbjct: 119 -GGGVGPHLDQYDVFIIQGEGRRRWRVGEKIPMKQHCPH---PDLLQVQPFDAIIDEEME 174

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 175 PGDILYIPPGFPHEGYAI--EPSLNYSVG 201


>gi|221134859|ref|ZP_03561162.1| putative enzyme with RmlC-like domain [Glaciecola sp. HTCC2999]
          Length = 381

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 4/85 (4%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G   H D  + FI+Q +G ++W+V    M  +Y   Y  P   Q +   PI
Sbjct: 114 SYSQP--GAGVGAHIDQYDVFIVQGKGTRRWQVGDKSM--KYAKYYPHPKLQQIDEFEPI 169

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
           + V L PGD+LY+P G+ H+  ++T
Sbjct: 170 IDVELLPGDILYIPPGFPHKGQSIT 194


>gi|384126558|ref|YP_005509172.1| hypothetical protein YPD8_2100 [Yersinia pestis D182038]
 gi|262366222|gb|ACY62779.1| hypothetical protein YPD8_2100 [Yersinia pestis D182038]
          Length = 373

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +EDG+        ++
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWHVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+  ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194


>gi|261339429|ref|ZP_05967287.1| cupin family protein [Enterobacter cancerogenus ATCC 35316]
 gi|288318235|gb|EFC57173.1| cupin family protein [Enterobacter cancerogenus ATCC 35316]
          Length = 373

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 61/274 (22%), Positives = 116/274 (42%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGISNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++             +A       +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL--GSYR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           Y+     AR  P DI  + L   + L+ ++  L+
Sbjct: 228 YSDPDVPAREHPADILPEEL---DKLRGMMLDLI 258


>gi|22125686|ref|NP_669109.1| hypothetical protein y1792 [Yersinia pestis KIM10+]
 gi|45441573|ref|NP_993112.1| hypothetical protein YP_1762 [Yersinia pestis biovar Microtus str.
           91001]
 gi|108807887|ref|YP_651803.1| hypothetical protein YPA_1892 [Yersinia pestis Antiqua]
 gi|108812161|ref|YP_647928.1| hypothetical protein YPN_1999 [Yersinia pestis Nepal516]
 gi|145599098|ref|YP_001163174.1| hypothetical protein YPDSF_1816 [Yersinia pestis Pestoides F]
 gi|21958601|gb|AAM85360.1|AE013782_1 hypothetical protein y1792 [Yersinia pestis KIM10+]
 gi|45436434|gb|AAS61989.1| conserved hypothetical protein [Yersinia pestis biovar Microtus
           str. 91001]
 gi|108775809|gb|ABG18328.1| hypothetical protein YPN_1999 [Yersinia pestis Nepal516]
 gi|108779800|gb|ABG13858.1| hypothetical protein YPA_1892 [Yersinia pestis Antiqua]
 gi|145210794|gb|ABP40201.1| hypothetical protein YPDSF_1816 [Yersinia pestis Pestoides F]
          Length = 391

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +EDG+        ++
Sbjct: 34  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWHVSHGPFES 88

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 89  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 134

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 135 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 189

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+  ++ N
Sbjct: 190 EMEPGDILYIPPGFPHEGYSLEN 212


>gi|338531259|ref|YP_004664593.1| cupin family protein [Myxococcus fulvus HW-1]
 gi|337257355|gb|AEI63515.1| cupin family protein [Myxococcus fulvus HW-1]
          Length = 295

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 101/239 (42%), Gaps = 25/239 (10%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           +H+++ P +           L T + ID ++ +   +         Y  G+R T     R
Sbjct: 18  EHYQRRP-FTGASAAERLQRLGTWETIDFLVEETACDVLLARQGVPY-PGERPTTAKAAR 75

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            L       + +G ++    P  +   L QL  +        +  + Y TP    GF  H
Sbjct: 76  EL-------FAQGYTLALRQPDLHHPELAQLARAFSSELHGRINLHIYCTPAGHHGFGWH 128

Query: 121 YDDIEAFILQLEGKKKW----KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
            D  E FILQ  G+K++        P  + E +P  S    +QE+  TP+ T +L  G  
Sbjct: 129 CDPEEVFILQTTGRKEYLLRENTLHPVPLPESVP--SGSLAAQEK--TPVETHSLGAGGF 184

Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           +Y+P GY H A     E +L +++ +   T  +DLL+      ++A  A+   +RR +P
Sbjct: 185 IYIPGGYWHMAQ--APEEALSISMGLMPPTL-LDLLD-----GVRATLASSPVWRRRMP 235


>gi|167526437|ref|XP_001747552.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773998|gb|EDQ87632.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2336

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 19/140 (13%)

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
           L + ++   G     + Y TP  AQ   PH D  +  ++Q+  +K W + LP+  +  + 
Sbjct: 138 LESLIESELGIDATVHAYFTPANAQTLEPHTDPYDVVVVQVANQKHWTLCLPQTDNATVS 197

Query: 151 RYSSPNFSQEEI------GTPILT-----------VTLEPGDLLYLPRGYIHQASTVTNE 193
              +     +EI      G    T           VTL  GD +YLP+G IH A T T+ 
Sbjct: 198 LSEADRAQLQEIKRSHLDGCTTYTMSMLQPMICRNVTLHQGDSMYLPKGVIHYAVT-TDT 256

Query: 194 HSLHVTISVYQKT-AWIDLL 212
            S H+TI + +    W+D+L
Sbjct: 257 PSAHLTIGLSRTGRTWLDVL 276


>gi|24374177|ref|NP_718220.1| cupin 4 family protein [Shewanella oneidensis MR-1]
 gi|24348687|gb|AAN55664.1| cupin 4 family protein [Shewanella oneidensis MR-1]
          Length = 380

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH D+ + FI+Q EG+++W V    P+       R  +PN    E   PI+ + LE 
Sbjct: 120 GVGPHIDNYDVFIIQGEGQRRWTVGDNTPQQC-----RGGNPNSPLVEDFDPIIDMVLEK 174

Query: 174 GDLLYLPRGYIHQASTVT 191
           GD+LY+P G+ H   T+T
Sbjct: 175 GDMLYIPPGFPHCGETLT 192


>gi|72162256|ref|YP_289913.1| hypothetical protein Tfu_1857 [Thermobifida fusca YX]
 gi|71915988|gb|AAZ55890.1| hypothetical protein Tfu_1857 [Thermobifida fusca YX]
          Length = 395

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 66/145 (45%), Gaps = 3/145 (2%)

Query: 63  PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
           P+ ++   R+G S+          P++     L  L    V  N Y     + GF  H+D
Sbjct: 90  PEALYAQLRQGASLVLDGIDRIHPPIRAAADDLMRLVHERVQVNLYLIWGDSHGFNTHWD 149

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
           D + FI+Q+ G K W+V+  +    Y P     + S +     +   T+  G++L++PRG
Sbjct: 150 DHDTFIVQVAGTKHWQVH-GQGTRPY-PMKEDIDHSHQPPEGTVWEGTVRAGEVLHVPRG 207

Query: 183 YIHQASTVTNEHSLHVTISVYQKTA 207
           + H   T T + S+H+T    + T 
Sbjct: 208 WWHTV-TGTGDVSMHLTFGFTRATG 231


>gi|392548135|ref|ZP_10295272.1| RmlC-like domain-containing protein [Pseudoalteromonas rubra ATCC
           29570]
          Length = 384

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           ++ TP    G  PH D  + FI+Q +GK++W+V +P   D  L +++ + +  Q E    
Sbjct: 116 SFSTP--GGGVGPHLDQYDVFIIQGQGKRRWRVGMP---DASLKQFAQNKSLLQVEQFEA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
            +   LEPGD+LY+P G  H+   V N
Sbjct: 171 CIDAELEPGDILYIPPGCPHEGYAVEN 197


>gi|383190790|ref|YP_005200918.1| hypothetical protein Rahaq2_2952 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
 gi|371589048|gb|AEX52778.1| hypothetical protein Rahaq2_2952 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 46/207 (22%), Positives = 90/207 (43%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +   +DG+          
Sbjct: 16  YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSNQDGKWNV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E G S+       + +P   L    + +    +            G  PH
Sbjct: 66  GPFESYDHLGETGWSVLVQAVDHWHEPSSHLMTPFRHIPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   LEPGD++Y+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKIAMKQHCPH---PDLLQVEPFDAIIDEELEPGDIVYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H   ++ N  SL+ ++      A
Sbjct: 183 PGFPHDGYSLEN--SLNYSVGFRAPNA 207


>gi|170023954|ref|YP_001720459.1| cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
 gi|169750488|gb|ACA68006.1| Cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +EDG+        ++
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWHVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+  ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194


>gi|51596758|ref|YP_070949.1| hypothetical protein YPTB2436 [Yersinia pseudotuberculosis IP
           32953]
 gi|149366404|ref|ZP_01888438.1| hypothetical protein YPE_1629 [Yersinia pestis CA88-4125]
 gi|162418272|ref|YP_001607239.1| hypothetical protein YpAngola_A2857 [Yersinia pestis Angola]
 gi|165925645|ref|ZP_02221477.1| cupin family protein [Yersinia pestis biovar Orientalis str.
           F1991016]
 gi|165938468|ref|ZP_02227025.1| cupin family protein [Yersinia pestis biovar Orientalis str. IP275]
 gi|166008219|ref|ZP_02229117.1| cupin family protein [Yersinia pestis biovar Antiqua str. E1979001]
 gi|166211021|ref|ZP_02237056.1| cupin family protein [Yersinia pestis biovar Antiqua str.
           B42003004]
 gi|167401779|ref|ZP_02307270.1| cupin family protein [Yersinia pestis biovar Antiqua str.
           UG05-0454]
 gi|167421724|ref|ZP_02313477.1| cupin family protein [Yersinia pestis biovar Orientalis str.
           MG05-1020]
 gi|167426338|ref|ZP_02318091.1| cupin family protein [Yersinia pestis biovar Mediaevalis str.
           K1973002]
 gi|167469316|ref|ZP_02334020.1| hypothetical protein YpesF_15829 [Yersinia pestis FV-1]
 gi|186895826|ref|YP_001872938.1| cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
 gi|218928768|ref|YP_002346643.1| hypothetical protein YPO1632 [Yersinia pestis CO92]
 gi|229841619|ref|ZP_04461777.1| hypothetical protein YPH_4010 [Yersinia pestis biovar Orientalis
           str. PEXU2]
 gi|229843736|ref|ZP_04463879.1| hypothetical protein YPF_2132 [Yersinia pestis biovar Orientalis
           str. India 195]
 gi|229894480|ref|ZP_04509662.1| hypothetical protein YPS_2256 [Yersinia pestis Pestoides A]
 gi|229902482|ref|ZP_04517601.1| hypothetical protein YP516_2225 [Yersinia pestis Nepal516]
 gi|270490338|ref|ZP_06207412.1| cupin family protein [Yersinia pestis KIM D27]
 gi|294504170|ref|YP_003568232.1| hypothetical protein YPZ3_2060 [Yersinia pestis Z176003]
 gi|384122691|ref|YP_005505311.1| hypothetical protein YPD4_2101 [Yersinia pestis D106004]
 gi|384140479|ref|YP_005523181.1| hypothetical protein A1122_17720 [Yersinia pestis A1122]
 gi|384414271|ref|YP_005623633.1| hypothetical protein YPC_1734 [Yersinia pestis biovar Medievalis
           str. Harbin 35]
 gi|420546393|ref|ZP_15044386.1| cupin superfamily protein [Yersinia pestis PY-01]
 gi|420551710|ref|ZP_15049140.1| cupin superfamily protein [Yersinia pestis PY-02]
 gi|420557260|ref|ZP_15054032.1| cupin superfamily protein [Yersinia pestis PY-03]
 gi|420562800|ref|ZP_15058912.1| cupin superfamily protein [Yersinia pestis PY-04]
 gi|420567813|ref|ZP_15063454.1| cupin superfamily protein [Yersinia pestis PY-05]
 gi|420573506|ref|ZP_15068615.1| cupin superfamily protein [Yersinia pestis PY-06]
 gi|420578792|ref|ZP_15073409.1| cupin superfamily protein [Yersinia pestis PY-07]
 gi|420584154|ref|ZP_15078279.1| cupin superfamily protein [Yersinia pestis PY-08]
 gi|420589318|ref|ZP_15082928.1| cupin superfamily protein [Yersinia pestis PY-09]
 gi|420594626|ref|ZP_15087706.1| cupin superfamily protein [Yersinia pestis PY-10]
 gi|420600320|ref|ZP_15092793.1| cupin superfamily protein [Yersinia pestis PY-11]
 gi|420605779|ref|ZP_15097685.1| cupin superfamily protein [Yersinia pestis PY-12]
 gi|420611143|ref|ZP_15102530.1| cupin superfamily protein [Yersinia pestis PY-13]
 gi|420616463|ref|ZP_15107222.1| cupin superfamily protein [Yersinia pestis PY-14]
 gi|420621861|ref|ZP_15112006.1| cupin superfamily protein [Yersinia pestis PY-15]
 gi|420626880|ref|ZP_15116559.1| cupin superfamily protein [Yersinia pestis PY-16]
 gi|420632086|ref|ZP_15121252.1| cupin superfamily protein [Yersinia pestis PY-19]
 gi|420637202|ref|ZP_15125839.1| cupin superfamily protein [Yersinia pestis PY-25]
 gi|420642732|ref|ZP_15130847.1| cupin superfamily protein [Yersinia pestis PY-29]
 gi|420647938|ref|ZP_15135593.1| cupin superfamily protein [Yersinia pestis PY-32]
 gi|420653572|ref|ZP_15140655.1| cupin superfamily protein [Yersinia pestis PY-34]
 gi|420659092|ref|ZP_15145617.1| cupin superfamily protein [Yersinia pestis PY-36]
 gi|420664398|ref|ZP_15150364.1| cupin superfamily protein [Yersinia pestis PY-42]
 gi|420669342|ref|ZP_15154852.1| cupin superfamily protein [Yersinia pestis PY-45]
 gi|420674667|ref|ZP_15159700.1| cupin superfamily protein [Yersinia pestis PY-46]
 gi|420680226|ref|ZP_15164730.1| cupin superfamily protein [Yersinia pestis PY-47]
 gi|420685501|ref|ZP_15169450.1| cupin superfamily protein [Yersinia pestis PY-48]
 gi|420690685|ref|ZP_15174034.1| cupin superfamily protein [Yersinia pestis PY-52]
 gi|420696479|ref|ZP_15179106.1| cupin superfamily protein [Yersinia pestis PY-53]
 gi|420701969|ref|ZP_15183723.1| cupin superfamily protein [Yersinia pestis PY-54]
 gi|420707842|ref|ZP_15188598.1| cupin superfamily protein [Yersinia pestis PY-55]
 gi|420713175|ref|ZP_15193376.1| cupin superfamily protein [Yersinia pestis PY-56]
 gi|420718601|ref|ZP_15198116.1| cupin superfamily protein [Yersinia pestis PY-58]
 gi|420724164|ref|ZP_15202922.1| cupin superfamily protein [Yersinia pestis PY-59]
 gi|420729761|ref|ZP_15207931.1| cupin superfamily protein [Yersinia pestis PY-60]
 gi|420734818|ref|ZP_15212500.1| cupin superfamily protein [Yersinia pestis PY-61]
 gi|420740282|ref|ZP_15217421.1| cupin superfamily protein [Yersinia pestis PY-63]
 gi|420745748|ref|ZP_15222170.1| cupin superfamily protein [Yersinia pestis PY-64]
 gi|420751427|ref|ZP_15227086.1| cupin superfamily protein [Yersinia pestis PY-65]
 gi|420756820|ref|ZP_15231677.1| cupin superfamily protein [Yersinia pestis PY-66]
 gi|420762542|ref|ZP_15236426.1| cupin superfamily protein [Yersinia pestis PY-71]
 gi|420767801|ref|ZP_15241170.1| cupin superfamily protein [Yersinia pestis PY-72]
 gi|420772776|ref|ZP_15245644.1| cupin superfamily protein [Yersinia pestis PY-76]
 gi|420778224|ref|ZP_15250489.1| cupin superfamily protein [Yersinia pestis PY-88]
 gi|420789062|ref|ZP_15260036.1| cupin superfamily protein [Yersinia pestis PY-90]
 gi|420794543|ref|ZP_15264977.1| cupin superfamily protein [Yersinia pestis PY-91]
 gi|420799657|ref|ZP_15269576.1| cupin superfamily protein [Yersinia pestis PY-92]
 gi|420805005|ref|ZP_15274395.1| cupin superfamily protein [Yersinia pestis PY-93]
 gi|420810292|ref|ZP_15279170.1| cupin superfamily protein [Yersinia pestis PY-94]
 gi|420815964|ref|ZP_15284267.1| cupin superfamily protein [Yersinia pestis PY-95]
 gi|420821152|ref|ZP_15288950.1| cupin superfamily protein [Yersinia pestis PY-96]
 gi|420826242|ref|ZP_15293508.1| cupin superfamily protein [Yersinia pestis PY-98]
 gi|420831992|ref|ZP_15298708.1| cupin superfamily protein [Yersinia pestis PY-99]
 gi|420836835|ref|ZP_15303075.1| cupin superfamily protein [Yersinia pestis PY-100]
 gi|420841993|ref|ZP_15307749.1| cupin superfamily protein [Yersinia pestis PY-101]
 gi|420847632|ref|ZP_15312832.1| cupin superfamily protein [Yersinia pestis PY-102]
 gi|420853075|ref|ZP_15317580.1| cupin superfamily protein [Yersinia pestis PY-103]
 gi|420858557|ref|ZP_15322281.1| cupin superfamily protein [Yersinia pestis PY-113]
 gi|421763099|ref|ZP_16199896.1| hypothetical protein INS_08445 [Yersinia pestis INS]
 gi|51590040|emb|CAH21674.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
           32953]
 gi|115347379|emb|CAL20277.1| conserved hypothetical protein [Yersinia pestis CO92]
 gi|149290778|gb|EDM40853.1| hypothetical protein YPE_1629 [Yersinia pestis CA88-4125]
 gi|162351087|gb|ABX85035.1| cupin family protein [Yersinia pestis Angola]
 gi|165913583|gb|EDR32203.1| cupin family protein [Yersinia pestis biovar Orientalis str. IP275]
 gi|165922754|gb|EDR39905.1| cupin family protein [Yersinia pestis biovar Orientalis str.
           F1991016]
 gi|165992601|gb|EDR44902.1| cupin family protein [Yersinia pestis biovar Antiqua str. E1979001]
 gi|166208201|gb|EDR52681.1| cupin family protein [Yersinia pestis biovar Antiqua str.
           B42003004]
 gi|166960209|gb|EDR56230.1| cupin family protein [Yersinia pestis biovar Orientalis str.
           MG05-1020]
 gi|167048884|gb|EDR60292.1| cupin family protein [Yersinia pestis biovar Antiqua str.
           UG05-0454]
 gi|167054693|gb|EDR64497.1| cupin family protein [Yersinia pestis biovar Mediaevalis str.
           K1973002]
 gi|186698852|gb|ACC89481.1| Cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
 gi|229680528|gb|EEO76625.1| hypothetical protein YP516_2225 [Yersinia pestis Nepal516]
 gi|229689344|gb|EEO81407.1| hypothetical protein YPF_2132 [Yersinia pestis biovar Orientalis
           str. India 195]
 gi|229694082|gb|EEO84130.1| hypothetical protein YPH_4010 [Yersinia pestis biovar Orientalis
           str. PEXU2]
 gi|229703499|gb|EEO90516.1| hypothetical protein YPS_2256 [Yersinia pestis Pestoides A]
 gi|262362287|gb|ACY59008.1| hypothetical protein YPD4_2101 [Yersinia pestis D106004]
 gi|270338842|gb|EFA49619.1| cupin family protein [Yersinia pestis KIM D27]
 gi|294354629|gb|ADE64970.1| hypothetical protein YPZ3_2060 [Yersinia pestis Z176003]
 gi|320014775|gb|ADV98346.1| hypothetical protein YPC_1734 [Yersinia pestis biovar Medievalis
           str. Harbin 35]
 gi|342855608|gb|AEL74161.1| hypothetical protein A1122_17720 [Yersinia pestis A1122]
 gi|391427956|gb|EIQ89991.1| cupin superfamily protein [Yersinia pestis PY-01]
 gi|391429382|gb|EIQ91241.1| cupin superfamily protein [Yersinia pestis PY-02]
 gi|391430794|gb|EIQ92463.1| cupin superfamily protein [Yersinia pestis PY-03]
 gi|391443452|gb|EIR03767.1| cupin superfamily protein [Yersinia pestis PY-04]
 gi|391444756|gb|EIR04947.1| cupin superfamily protein [Yersinia pestis PY-05]
 gi|391447828|gb|EIR07702.1| cupin superfamily protein [Yersinia pestis PY-06]
 gi|391460328|gb|EIR19043.1| cupin superfamily protein [Yersinia pestis PY-07]
 gi|391461292|gb|EIR19909.1| cupin superfamily protein [Yersinia pestis PY-08]
 gi|391463290|gb|EIR21709.1| cupin superfamily protein [Yersinia pestis PY-09]
 gi|391476420|gb|EIR33538.1| cupin superfamily protein [Yersinia pestis PY-10]
 gi|391478148|gb|EIR35099.1| cupin superfamily protein [Yersinia pestis PY-11]
 gi|391478275|gb|EIR35211.1| cupin superfamily protein [Yersinia pestis PY-12]
 gi|391492316|gb|EIR47797.1| cupin superfamily protein [Yersinia pestis PY-13]
 gi|391493193|gb|EIR48571.1| cupin superfamily protein [Yersinia pestis PY-15]
 gi|391495564|gb|EIR50646.1| cupin superfamily protein [Yersinia pestis PY-14]
 gi|391508331|gb|EIR62082.1| cupin superfamily protein [Yersinia pestis PY-19]
 gi|391508362|gb|EIR62111.1| cupin superfamily protein [Yersinia pestis PY-16]
 gi|391513157|gb|EIR66406.1| cupin superfamily protein [Yersinia pestis PY-25]
 gi|391523516|gb|EIR75820.1| cupin superfamily protein [Yersinia pestis PY-29]
 gi|391526248|gb|EIR78299.1| cupin superfamily protein [Yersinia pestis PY-34]
 gi|391527050|gb|EIR79008.1| cupin superfamily protein [Yersinia pestis PY-32]
 gi|391539357|gb|EIR90086.1| cupin superfamily protein [Yersinia pestis PY-36]
 gi|391541914|gb|EIR92423.1| cupin superfamily protein [Yersinia pestis PY-42]
 gi|391543151|gb|EIR93509.1| cupin superfamily protein [Yersinia pestis PY-45]
 gi|391557050|gb|EIS06080.1| cupin superfamily protein [Yersinia pestis PY-46]
 gi|391557449|gb|EIS06441.1| cupin superfamily protein [Yersinia pestis PY-47]
 gi|391558755|gb|EIS07609.1| cupin superfamily protein [Yersinia pestis PY-48]
 gi|391572334|gb|EIS19578.1| cupin superfamily protein [Yersinia pestis PY-52]
 gi|391572861|gb|EIS20032.1| cupin superfamily protein [Yersinia pestis PY-53]
 gi|391582579|gb|EIS28325.1| cupin superfamily protein [Yersinia pestis PY-54]
 gi|391584753|gb|EIS30243.1| cupin superfamily protein [Yersinia pestis PY-55]
 gi|391588022|gb|EIS33108.1| cupin superfamily protein [Yersinia pestis PY-56]
 gi|391601031|gb|EIS44492.1| cupin superfamily protein [Yersinia pestis PY-58]
 gi|391601620|gb|EIS45026.1| cupin superfamily protein [Yersinia pestis PY-60]
 gi|391603261|gb|EIS46467.1| cupin superfamily protein [Yersinia pestis PY-59]
 gi|391615958|gb|EIS57673.1| cupin superfamily protein [Yersinia pestis PY-61]
 gi|391616713|gb|EIS58334.1| cupin superfamily protein [Yersinia pestis PY-63]
 gi|391622502|gb|EIS63412.1| cupin superfamily protein [Yersinia pestis PY-64]
 gi|391627973|gb|EIS68120.1| cupin superfamily protein [Yersinia pestis PY-65]
 gi|391639210|gb|EIS77928.1| cupin superfamily protein [Yersinia pestis PY-71]
 gi|391639541|gb|EIS78208.1| cupin superfamily protein [Yersinia pestis PY-66]
 gi|391641459|gb|EIS79868.1| cupin superfamily protein [Yersinia pestis PY-72]
 gi|391651269|gb|EIS88463.1| cupin superfamily protein [Yersinia pestis PY-76]
 gi|391656529|gb|EIS93154.1| cupin superfamily protein [Yersinia pestis PY-88]
 gi|391664478|gb|EIT00201.1| cupin superfamily protein [Yersinia pestis PY-90]
 gi|391671297|gb|EIT06257.1| cupin superfamily protein [Yersinia pestis PY-91]
 gi|391682105|gb|EIT16011.1| cupin superfamily protein [Yersinia pestis PY-93]
 gi|391683556|gb|EIT17318.1| cupin superfamily protein [Yersinia pestis PY-92]
 gi|391684343|gb|EIT18032.1| cupin superfamily protein [Yersinia pestis PY-94]
 gi|391695963|gb|EIT28495.1| cupin superfamily protein [Yersinia pestis PY-95]
 gi|391699345|gb|EIT31546.1| cupin superfamily protein [Yersinia pestis PY-96]
 gi|391700775|gb|EIT32845.1| cupin superfamily protein [Yersinia pestis PY-98]
 gi|391709817|gb|EIT40956.1| cupin superfamily protein [Yersinia pestis PY-99]
 gi|391716764|gb|EIT47191.1| cupin superfamily protein [Yersinia pestis PY-100]
 gi|391717447|gb|EIT47808.1| cupin superfamily protein [Yersinia pestis PY-101]
 gi|391728262|gb|EIT57393.1| cupin superfamily protein [Yersinia pestis PY-102]
 gi|391731096|gb|EIT59840.1| cupin superfamily protein [Yersinia pestis PY-103]
 gi|391735673|gb|EIT63789.1| cupin superfamily protein [Yersinia pestis PY-113]
 gi|411177305|gb|EKS47320.1| hypothetical protein INS_08445 [Yersinia pestis INS]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 91/203 (44%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
           +W+K P+ + R  KN+ + L       D L    +E + +  +  +EDG+        ++
Sbjct: 16  YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWHVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+  ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194


>gi|310639709|ref|YP_003944467.1| hypothetical protein [Paenibacillus polymyxa SC2]
 gi|386038909|ref|YP_005957863.1| hypothetical protein PPM_0219 [Paenibacillus polymyxa M1]
 gi|309244659|gb|ADO54226.1| Uncharacterized protein yxbC [Paenibacillus polymyxa SC2]
 gi|343094947|emb|CCC83156.1| uncharacterized protein yxbC [Paenibacillus polymyxa M1]
          Length = 333

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 65/164 (39%), Gaps = 24/164 (14%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ---GFAPHYDDI 124
           D Y  G ++ F     ++  L     SL++      G+        A+   GF  H+D  
Sbjct: 93  DWYERGAALEFDFSDMFLPQLHSWMDSLRKELSLPSGSGIKAIVYAAKNGGGFKAHFDAY 152

Query: 125 EAFILQLEGKKKWKVYL----------------PRMVDEYLPRYSSPNFSQEEIGTPILT 168
             FI  L+G K WK+                  P + DE    +   +   +  G  I  
Sbjct: 153 TNFIFHLQGTKTWKLLANENVTNPIQHYDLAEKPYIPDELATYWKGEHPQTDLPGADI-- 210

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           V L PG  LYLPRG  H  S+     SL++T S     AW++LL
Sbjct: 211 VNLLPGSFLYLPRGIWHSTSSTEETLSLNITFS---HPAWLELL 251


>gi|145350199|ref|XP_001419502.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579734|gb|ABO97795.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 392

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 18/118 (15%)

Query: 97  ELFGCF----VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV-----YLPRMVDE 147
           ELF CF    +        P   G  PH D  + F+LQ EG+K W V     Y P     
Sbjct: 92  ELFNCFPRWRISDIQASVSPDGGGVGPHSDHFDVFLLQAEGEKVWAVADNEEYWPDNDAA 151

Query: 148 YLPRYSS---PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
           ++P        +F +++      + TL PGD+LYLP    H      ++  + VT+S+
Sbjct: 152 FVPECEIRVLKSFVEDD------SFTLVPGDMLYLPPKIAHNGVATNSKPGVSVTLSI 203


>gi|334122432|ref|ZP_08496470.1| cupin superfamily protein [Enterobacter hormaechei ATCC 49162]
 gi|333392039|gb|EGK63146.1| cupin superfamily protein [Enterobacter hormaechei ATCC 49162]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 61/274 (22%), Positives = 116/274 (42%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGISNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGESNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++             +A       +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSVREMISGFADYVLQREL--GSYR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           Y+     AR  P DI  + L   + L+ ++  L+
Sbjct: 228 YSDPDVPARAHPADILPEEL---DKLRGMMLDLI 258


>gi|398797523|ref|ZP_10556844.1| hypothetical protein PMI17_00519 [Pantoea sp. GM01]
 gi|398102729|gb|EJL92907.1| hypothetical protein PMI17_00519 [Pantoea sp. GM01]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|302852113|ref|XP_002957578.1| hypothetical protein VOLCADRAFT_98689 [Volvox carteri f.
           nagariensis]
 gi|300257095|gb|EFJ41348.1| hypothetical protein VOLCADRAFT_98689 [Volvox carteri f.
           nagariensis]
          Length = 1183

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-EYLPRYS 153
           ++ L G   GAN Y +PP  QG A HYDD    +LQL+G K+W +  PR++  + LP   
Sbjct: 485 MEWLLGLPAGANVYGSPPGRQGLAAHYDDHCVLVLQLQGAKEWLLQPPRLLPAQLLPLTY 544

Query: 154 SPNFSQEEIG 163
            P      +G
Sbjct: 545 CPRLPLGPLG 554


>gi|332850080|ref|ZP_08432467.1| cupin family protein [Acinetobacter baumannii 6013150]
 gi|332871476|ref|ZP_08439989.1| cupin family protein [Acinetobacter baumannii 6013113]
 gi|332730929|gb|EGJ62235.1| cupin family protein [Acinetobacter baumannii 6013150]
 gi|332731453|gb|EGJ62744.1| cupin family protein [Acinetobacter baumannii 6013113]
          Length = 382

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 39/173 (22%), Positives = 80/173 (46%), Gaps = 14/173 (8%)

Query: 57  VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           +E R +  + +++ R G ++    ++ + ++  + +  A+  E     VG   Y    + 
Sbjct: 75  IEYRCITSVFYEYLRNGATLVRNRISNEPFVDQISKQIATFAEA-RTLVGG--YAAFSSK 131

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
             +  H+D  + + +QL G+K+W +  P       P Y     +  +I  P    + V L
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFPDIKEPEEIYMDVIL 188

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
           E GD+LY+PRG+ H    + +E + H+ ++ +  T   ++  L+  MP  L  
Sbjct: 189 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNTMPNILDC 240


>gi|378579358|ref|ZP_09828026.1| cupin superfamily protein [Pantoea stewartii subsp. stewartii
           DC283]
 gi|377818075|gb|EHU01163.1| cupin superfamily protein [Pantoea stewartii subsp. stewartii
           DC283]
          Length = 374

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|114563434|ref|YP_750947.1| cupin 4 family protein [Shewanella frigidimarina NCIMB 400]
 gi|114334727|gb|ABI72109.1| Cupin 4 family protein [Shewanella frigidimarina NCIMB 400]
          Length = 380

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 5/85 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI+Q +G+++WKV       ++  R  +PN    +   PI
Sbjct: 113 SFATP--GGGVGPHIDNYDVFIIQGDGERRWKV---GDKGQHKRRGDNPNSPLVDDFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
           + V L+ GD+LY+P GY H   T++
Sbjct: 168 IDVVLQKGDVLYIPPGYPHCGETLS 192


>gi|423113827|ref|ZP_17101518.1| hypothetical protein HMPREF9689_01575 [Klebsiella oxytoca 10-5245]
 gi|376387472|gb|EHT00182.1| hypothetical protein HMPREF9689_01575 [Klebsiella oxytoca 10-5245]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++  Y+     +L           +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           YA     +R  P DI    L   + L+D++  L+
Sbjct: 228 YADPDVPSREHPADIMPVEL---DRLRDMMLGLI 258


>gi|291617063|ref|YP_003519805.1| hypothetical protein PANA_1510 [Pantoea ananatis LMG 20103]
 gi|378767674|ref|YP_005196143.1| cupin 4 family protein [Pantoea ananatis LMG 5342]
 gi|386015450|ref|YP_005933731.1| transcription factor YcfD [Pantoea ananatis AJ13355]
 gi|386079829|ref|YP_005993354.1| transcription factor YcfD [Pantoea ananatis PA13]
 gi|291152093|gb|ADD76677.1| YcfD [Pantoea ananatis LMG 20103]
 gi|327393513|dbj|BAK10935.1| transcription factor YcfD [Pantoea ananatis AJ13355]
 gi|354989010|gb|AER33134.1| transcription factor YcfD [Pantoea ananatis PA13]
 gi|365187156|emb|CCF10106.1| cupin 4 family protein [Pantoea ananatis LMG 5342]
          Length = 374

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|326794623|ref|YP_004312443.1| hypothetical protein Marme_1334 [Marinomonas mediterranea MMB-1]
 gi|326545387|gb|ADZ90607.1| Cupin, JmjC-type [Marinomonas mediterranea MMB-1]
          Length = 397

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 40/76 (52%)

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PHYD  + F++Q+ GK++W+V  P   ++     +  +       TP +   LEPGD+L
Sbjct: 132 GPHYDQYDVFLVQMSGKRRWQVAEPTEYEDSAVEGTQLHILDNFPSTPSIECDLEPGDIL 191

Query: 178 YLPRGYIHQASTVTNE 193
           YLP  + H    + +E
Sbjct: 192 YLPPNFAHNGRALNSE 207


>gi|317047740|ref|YP_004115388.1| cupin [Pantoea sp. At-9b]
 gi|316949357|gb|ADU68832.1| Cupin 4 family protein [Pantoea sp. At-9b]
          Length = 373

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|167526435|ref|XP_001747551.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773997|gb|EDQ87631.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2348

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 19/140 (13%)

Query: 91  LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
           L + ++   G     + Y TP  AQ   PH D  +  ++Q+  +K W + LP+  +  + 
Sbjct: 152 LESLIESELGIDATVHAYFTPANAQTLEPHTDPYDVVVVQVANQKHWTLCLPQTDNATVS 211

Query: 151 RYSSPNFSQEEI------GTPILT-----------VTLEPGDLLYLPRGYIHQASTVTNE 193
              +     +EI      G    T           VTL  GD +YLP+G IH A T T+ 
Sbjct: 212 LSEADRAQLQEIKRSHLDGCTTYTMSMLQPMICRNVTLHQGDSMYLPKGVIHYAVT-TDT 270

Query: 194 HSLHVTISVYQKT-AWIDLL 212
            S H+TI + +    W+D+L
Sbjct: 271 PSAHLTIGLSRTGRTWLDVL 290


>gi|407644844|ref|YP_006808603.1| hypothetical protein O3I_018350 [Nocardia brasiliensis ATCC 700358]
 gi|407307728|gb|AFU01629.1| hypothetical protein O3I_018350 [Nocardia brasiliensis ATCC 700358]
          Length = 396

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 12/97 (12%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKV----YLPRMVDEY--------LPRYSSPNFSQEEIG 163
           G   H+D I+   +QL G+K W++    + P  +D +        L R  +     EEI 
Sbjct: 132 GTRAHFDAIDVITIQLTGRKTWRIAPNAFAPAPLDGWTTLETVSPLLRNYASALPPEEIP 191

Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTI 200
             +    LEPG +L+LPRGY H+ ++  +  SLH+ +
Sbjct: 192 DGVTEHVLEPGSVLHLPRGYWHETNSAGDSISLHLAL 228


>gi|398791557|ref|ZP_10552277.1| hypothetical protein PMI39_00853 [Pantoea sp. YR343]
 gi|398214702|gb|EJN01273.1| hypothetical protein PMI39_00853 [Pantoea sp. YR343]
          Length = 373

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|238758067|ref|ZP_04619248.1| hypothetical protein yaldo0001_9460 [Yersinia aldovae ATCC 35236]
 gi|238703821|gb|EEP96357.1| hypothetical protein yaldo0001_9460 [Yersinia aldovae ATCC 35236]
          Length = 373

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 88/201 (43%), Gaps = 16/201 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K+P+ +    KN + N +     D+ L    +E + +  +  + +GQ Q  +     
Sbjct: 16  YWQKQPVVL----KNAFQNFVDPITPDE-LAGLAMEAEVDSRLVSHVNGQWQASNG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E G S+       +  P  +L    + L    +            G  PH
Sbjct: 67  -PFENFDHLGETGWSLLAQAVNHWHSPSAELVCPFRVLPDWRLDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G ++W+V     + ++ P    P     E  TPI+   LEPGD+LY+P
Sbjct: 126 IDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTIS 201
            G+ H     T+E +L+ ++ 
Sbjct: 183 PGFPHDG--FTHETALNYSVG 201


>gi|42525019|ref|NP_970399.1| hypothetical protein Bd3682 [Bdellovibrio bacteriovorus HD100]
 gi|39577230|emb|CAE81053.1| hypothetical protein Bd3682 [Bdellovibrio bacteriovorus HD100]
          Length = 308

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 54/246 (21%), Positives = 105/246 (42%), Gaps = 40/246 (16%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE-DGQRQTMDVEGR 60
           HW  EPL+I         ++   +++ D L++ +   ++ +   L + D +  ++ +E  
Sbjct: 22  HWPVEPLFIP-ATPGKLQDIFALEQMQD-LKNLISARQRKVRACLPDFDDEYSSIHLE-- 77

Query: 61  ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN----------TYQT 110
             P      YR   ++ F + Q+    +  +  +++   G   G             Y T
Sbjct: 78  --PGDALKAYRNNMTLVFDSMQSQDSTIADMLGNVRADLGLVTGGAENDLCKARSIAYAT 135

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI---- 166
           P    G   H+D    FI+Q++G K W++     V+    R+++     EE+   +    
Sbjct: 136 P-AGCGTRLHFDANANFIIQIKGTKTWRLAPNESVEFPTERFTT---GSEEMPAALEKQC 191

Query: 167 ------------LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
                       + V ++PG +L++PRGY H+  T T E SL +  +  Q T W D+  K
Sbjct: 192 HAHLIDALDEDSMKVVMKPGCVLFVPRGYWHE--TTTEEESLSLNFTFSQPT-WADVFTK 248

Query: 215 AMPKAL 220
           ++ + L
Sbjct: 249 SLQEVL 254


>gi|300312570|ref|YP_003776662.1| hypothetical protein Hsero_3273 [Herbaspirillum seropedicae SmR1]
 gi|300075355|gb|ADJ64754.1| conserved hypothetical protein [Herbaspirillum seropedicae SmR1]
          Length = 386

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 127/324 (39%), Gaps = 54/324 (16%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W K+PL I      +   L   +  D   RD+V    ++  IT  E G         +
Sbjct: 31  DYWHKKPLLIRNAIPGFSAPLPRQELFDLSRRDDV----ESRLITHREGGWDMQHGPFNK 86

Query: 61  ALP--QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
             P  Q  W    +G ++        ++  + + +A L +L   +             G 
Sbjct: 87  LPPLKQKAWTLLVQGVNLHHPAADALLRQFRFVPDARLDDLMISYAS--------DGGGV 138

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
            PH+D  + F+LQ  G+++W++       ++E +P     +F  EE         LEPGD
Sbjct: 139 GPHFDSYDVFLLQAHGQRRWRISAQDDLTLEEGVPLKILSHFQAEE------EFVLEPGD 192

Query: 176 LLYLPRGYIHQASTVTN--EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR- 232
           +LYLP  Y H    V     +S+      YQ+                  G + LEF   
Sbjct: 193 MLYLPPHYAHDGVAVGECMTYSIGFRAPTYQEL-----------------GESFLEFMMD 235

Query: 233 --GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
              LP  Y         +P ++QAD +A    ++D L K + + D DD    +G+ L  +
Sbjct: 236 TIDLPGRYADPDLKTSARPAELQADMIA---RIRDELNK-IRFTD-DDITIFLGQHL-SE 289

Query: 291 ALPPVL---SPEELQCSVFENGLR 311
             P V     P+EL  + F  G +
Sbjct: 290 PKPSVFFEAPPQELSFARFFQGAK 313


>gi|422022984|ref|ZP_16369490.1| hypothetical protein OO7_10582 [Providencia sneebia DSM 19967]
 gi|414094714|gb|EKT56378.1| hypothetical protein OO7_10582 [Providencia sneebia DSM 19967]
          Length = 374

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q EG+++W+V     + ++ P    P+  Q +    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRRWRVGEKHPMKQHCPH---PDLLQVQPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 201


>gi|375260293|ref|YP_005019463.1| putative enzyme [Klebsiella oxytoca KCTC 1686]
 gi|397657374|ref|YP_006498076.1| hypothetical protein A225_2356 [Klebsiella oxytoca E718]
 gi|423102531|ref|ZP_17090233.1| hypothetical protein HMPREF9686_01137 [Klebsiella oxytoca 10-5242]
 gi|365909771|gb|AEX05224.1| putative enzyme [Klebsiella oxytoca KCTC 1686]
 gi|376388007|gb|EHT00708.1| hypothetical protein HMPREF9686_01137 [Klebsiella oxytoca 10-5242]
 gi|394345830|gb|AFN31951.1| hypothetical protein A225_2356 [Klebsiella oxytoca E718]
          Length = 373

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++  Y+     +L           +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           YA     +R  P DI    L   + L+D++  L+
Sbjct: 228 YADPDVPSREHPADILPAEL---DRLRDMMLGLI 258


>gi|343513942|ref|ZP_08751030.1| hypothetical protein VIBRN418_07521 [Vibrio sp. N418]
 gi|342801072|gb|EGU36563.1| hypothetical protein VIBRN418_07521 [Vibrio sp. N418]
          Length = 379

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 46/197 (23%), Positives = 85/197 (43%), Gaps = 24/197 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
           +W+K+P+ I    KN + + +     ++I     +  ++ ID     +   Q     G  
Sbjct: 15  YWQKKPVVI----KNAFKDFVDPISAEEIAG---LSLEEMIDSRFVSNKDSQWTAQHGPF 67

Query: 60  -----RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPT 113
                  LP+       + C+         I+P +QL   L  ++  C+   +       
Sbjct: 68  EESFFDDLPETHCQLIVQACNHWHSETAQLIEPFKQLPQWLFDDVMVCYSAPHG------ 121

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
             G  PH D  + FI+Q +GK++W+V   + + +Y          Q E    I+  TL+P
Sbjct: 122 --GVGPHIDQYDVFIIQGQGKREWRVG-AKDIGQYQEVIQGGALRQIEGFDSIIDETLQP 178

Query: 174 GDLLYLPRGYIHQASTV 190
           GD+LY+P G+ H+ +T+
Sbjct: 179 GDMLYIPPGFPHEGTTL 195


>gi|419957457|ref|ZP_14473523.1| cupin [Enterobacter cloacae subsp. cloacae GS1]
 gi|388607615|gb|EIM36819.1| cupin [Enterobacter cloacae subsp. cloacae GS1]
          Length = 373

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 61/274 (22%), Positives = 116/274 (42%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGISNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++             +A       +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL--GSYR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           Y+     AR  P DI  + L   + L+ ++  L+
Sbjct: 228 YSDPDVPAREHPADILPEEL---DKLRGMMLDLI 258


>gi|402842981|ref|ZP_10891384.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
 gi|402278367|gb|EJU27431.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
          Length = 373

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++  Y+     +L           +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           YA     +R  P DI    L   + L+D++  L+
Sbjct: 228 YADPDVPSREHPADILPAEL---DRLRDMMLGLI 258


>gi|226330501|ref|ZP_03806019.1| hypothetical protein PROPEN_04419 [Proteus penneri ATCC 35198]
 gi|225201296|gb|EEG83650.1| cupin family protein [Proteus penneri ATCC 35198]
          Length = 373

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGQGRRRWRVGEKFPMKQHCPH---PDLLQVDPFEAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +   LEPGD+LY+P G+ H+   +  E SL+ ++      A
Sbjct: 169 IDEELEPGDILYIPPGFPHEGYAI--EESLNYSVGFRSPNA 207


>gi|213648227|ref|ZP_03378280.1| hypothetical protein SentesTy_13599 [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
          Length = 294

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|89074234|ref|ZP_01160724.1| hypothetical protein SKA34_13465 [Photobacterium sp. SKA34]
 gi|89049938|gb|EAR55472.1| hypothetical protein SKA34_13465 [Photobacterium sp. SKA34]
          Length = 377

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 58/131 (44%), Gaps = 16/131 (12%)

Query: 68  DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
           +H+ EG +         + P +QL   L +         +Y TP    G  PH D  + F
Sbjct: 85  NHWHEGAA-------QLVTPFRQLPGWLFDDLMI-----SYSTP--GGGVGPHIDQYDVF 130

Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA 187
           I+Q  GK+ W+V   R  D+Y   +      Q +   PI+   L+ GD+LY+P G+ H  
Sbjct: 131 IVQGSGKRHWRVGEKR--DDYEEEFRHSALRQIKGFEPIIDEVLDTGDILYIPPGFPHDG 188

Query: 188 STVTNEHSLHV 198
            +V    S  V
Sbjct: 189 YSVETSMSFSV 199


>gi|88860119|ref|ZP_01134758.1| putative enzyme with RmlC-like domain [Pseudoalteromonas tunicata
           D2]
 gi|88818113|gb|EAR27929.1| putative enzyme with RmlC-like domain [Pseudoalteromonas tunicata
           D2]
          Length = 382

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 89/199 (44%), Gaps = 27/199 (13%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFK---KNIDITLYEDGQRQTMDV- 57
           +W+K+PL I +  +N+  + I+  ++  + ++  IE +      D      G  +  D+ 
Sbjct: 18  YWQKKPLLIKQGFRNF-QDPISPDELAGLAQEECIESRIITNTNDKWKAHHGPFEEFDLL 76

Query: 58  ---EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
                  L Q V DH+    S + L P  +I      N  + +L   F       TP   
Sbjct: 77  TATHSTLLVQAV-DHWHPD-SAQLLEPFRFIP-----NWRIDDLMISF------STP--G 121

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
            G  PH D  + FI+Q EGK+ W+V L  + ++E +P     +  Q      ++   LEP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGLAEQSLEEVIP---CKDLLQITAFEAVIDAVLEP 178

Query: 174 GDLLYLPRGYIHQASTVTN 192
           GD+LY+P G  H+   V N
Sbjct: 179 GDILYIPPGCPHEGYAVEN 197


>gi|415949261|ref|ZP_11556860.1| Cupin 4 family protein [Herbaspirillum frisingense GSF30]
 gi|407757773|gb|EKF67697.1| Cupin 4 family protein [Herbaspirillum frisingense GSF30]
          Length = 373

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 127/321 (39%), Gaps = 48/321 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W K+PL I R     ++  +  +++ D+ R   +E +    IT  EDG         +
Sbjct: 18  DYWHKKPLLI-RNAIPGFSPPLPRQELFDLARHEDVESRL---ITHREDGWDMQHGPFNK 73

Query: 61  ALP--QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
             P  Q  W    +G ++        ++  + + +A L +L        +Y T     G 
Sbjct: 74  LPPLKQKAWTLLVQGVNLHHPAADALLRQFRFVPDARLDDLM------ISYAT--DGGGV 125

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
            PH+D  + F+LQ  G+++W++       ++E +P     +F  EE         LEPGD
Sbjct: 126 GPHFDSYDVFLLQAHGQRRWRISAQNDLTLEEGVPLKILSHFQPEE------EFILEPGD 179

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR--- 232
           +LYLP  Y H    V    +  V                    A Q  G + LEF     
Sbjct: 180 MLYLPPHYAHDGVAVGECMTYSVGFRA---------------PAYQELGESFLEFMMDTI 224

Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
            LP  Y         +P ++QAD   M   ++D L K + + D DD    +G+ L     
Sbjct: 225 DLPGRYADPDLKTSARPAELQAD---MITRIRDELNK-IRFTD-DDITIFLGEHLSEPKP 279

Query: 293 PPVLSP--EELQCSVFENGLR 311
                P  EE+  + F  G +
Sbjct: 280 SVFFEPRGEEISFARFFQGAK 300


>gi|354613741|ref|ZP_09031647.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
           90007]
 gi|353221894|gb|EHB86226.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
           90007]
          Length = 392

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 10/116 (8%)

Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEE 161
           V  NTY T     GF  H+DD    I+Q+ G K+W+V  P R+   Y  R + PN   E 
Sbjct: 130 VQVNTYLTTGNTAGFDLHWDDHPTVIVQIAGSKQWEVRGPSRLAPLY--RDTGPN--TEP 185

Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH---SLHVTISVYQKTA--WIDLL 212
               + +  L  G+++ +PRG+ H A+   ++    SLH+T  +   T   WI  L
Sbjct: 186 PDEVVWSGALRAGEVISIPRGWWHSATRTDHDDDRLSLHLTFGMTPHTGIDWISWL 241


>gi|343512857|ref|ZP_08749972.1| hypothetical protein VIS19158_15204 [Vibrio scophthalmi LMG 19158]
 gi|342794297|gb|EGU30070.1| hypothetical protein VIS19158_15204 [Vibrio scophthalmi LMG 19158]
          Length = 379

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 46/197 (23%), Positives = 85/197 (43%), Gaps = 24/197 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
           +W+K+P+ I    KN + + +     ++I     +  ++ ID     +   Q     G  
Sbjct: 15  YWQKKPVVI----KNAFKDFVDPISAEEIAG---LSLEEMIDSRFVTNKDSQWTAQHGPF 67

Query: 60  -----RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPT 113
                  LP+       + C+         I+P +QL   L  ++  C+   +       
Sbjct: 68  EESFFDDLPETHCQLIVQACNHWHSETAQLIEPFKQLPQWLFDDVMVCYSAPHG------ 121

Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
             G  PH D  + FI+Q +GK++W+V   + + +Y          Q E    I+  TL+P
Sbjct: 122 --GVGPHIDQYDVFIIQGQGKREWRVG-AKDIGQYQEVIQGGALRQIEGFDSIIDETLQP 178

Query: 174 GDLLYLPRGYIHQASTV 190
           GD+LY+P G+ H+ +T+
Sbjct: 179 GDMLYIPPGFPHEGTTL 195


>gi|317491837|ref|ZP_07950272.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
 gi|316920271|gb|EFV41595.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
          Length = 373

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 84/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVILKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   SI       +  P   L    ++L                 G  PH
Sbjct: 67  -PFESYDHLGENNWSILVQAVDHWHVPSSALMRPFRKLPDWRTDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKTPLRQHSPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|119944720|ref|YP_942400.1| cupin 4 family protein [Psychromonas ingrahamii 37]
 gi|119863324|gb|ABM02801.1| Cupin 4 family protein [Psychromonas ingrahamii 37]
          Length = 375

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 9/96 (9%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPN-FSQEEIGTP 165
           +Y TP    G  PH D+ + FI+Q  GK+ W+V         LP +++ +     E    
Sbjct: 113 SYSTP--KGGVGPHIDNYDVFIIQGLGKRHWRVGDKGA----LPEFAAHDALKHCESFDA 166

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           I+ V LEPGD+LY+P GY H+  ++  E SL+ +I 
Sbjct: 167 IIDVELEPGDILYIPAGYPHEGYSI--ETSLNYSIG 200


>gi|307131932|ref|YP_003883948.1| hypothetical protein Dda3937_02402 [Dickeya dadantii 3937]
 gi|306529461|gb|ADM99391.1| hypothetical protein Dda3937_02402 [Dickeya dadantii 3937]
          Length = 381

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V    ++ ++ P    P     +  TPI+   LEPGD
Sbjct: 129 GVGPHIDQYDVFIIQGMGRRRWRVGDKLLMRQFCPH---PALLHVDPFTPIIDEELEPGD 185

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 186 ILYIPPGFPHDGFT 199


>gi|332162076|ref|YP_004298653.1| hypothetical protein YE105_C2454 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|386308002|ref|YP_006004058.1| hypothetical protein [Yersinia enterocolitica subsp. palearctica
           Y11]
 gi|418240503|ref|ZP_12867042.1| hypothetical protein IOK_03636 [Yersinia enterocolitica subsp.
           palearctica PhRBD_Ye1]
 gi|433549953|ref|ZP_20505997.1| FIG002776: hypothetical protein [Yersinia enterocolitica IP 10393]
 gi|318606124|emb|CBY27622.1| hypothetical protein Y11_05591 [Yersinia enterocolitica subsp.
           palearctica Y11]
 gi|325666306|gb|ADZ42950.1| hypothetical protein YE105_C2454 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|330860075|emb|CBX70401.1| uncharacterized protein ycfD [Yersinia enterocolitica W22703]
 gi|351780134|gb|EHB22219.1| hypothetical protein IOK_03636 [Yersinia enterocolitica subsp.
           palearctica PhRBD_Ye1]
 gi|431789088|emb|CCO69037.1| FIG002776: hypothetical protein [Yersinia enterocolitica IP 10393]
          Length = 373

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 90/203 (44%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  K++ + L       D L    +E + +  +  +EDG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKHFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWQVAHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|424799207|ref|ZP_18224749.1| FIG002776: hypothetical protein [Cronobacter sakazakii 696]
 gi|423234928|emb|CCK06619.1| FIG002776: hypothetical protein [Cronobacter sakazakii 696]
          Length = 373

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+    DV    
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  +  V          I             +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
           Y+     AR  P DI     A  E L+D++  L+   D D+  +  G+ +         S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             EL  S  E      Q  E+Y+A +   DT VRL     +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313


>gi|238785736|ref|ZP_04629710.1| hypothetical protein yberc0001_15190 [Yersinia bercovieri ATCC
           43970]
 gi|238713376|gb|EEQ05414.1| hypothetical protein yberc0001_15190 [Yersinia bercovieri ATCC
           43970]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     E  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|237808290|ref|YP_002892730.1| cupin [Tolumonas auensis DSM 9187]
 gi|237500551|gb|ACQ93144.1| Cupin 4 family protein [Tolumonas auensis DSM 9187]
          Length = 381

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G ++W+V      ++++   ++P     E   PI
Sbjct: 113 SYSVP--GGGVGPHIDQYDVFIIQGSGSRRWRVGADTPAEQFV---ATPGLLHVEQFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           + V L+ GD+LY+P G+ H    +T   S  +
Sbjct: 168 IDVELQSGDILYIPPGFPHDGYAITEAMSYSI 199


>gi|425067746|ref|ZP_18470862.1| hypothetical protein HMPREF1311_00902 [Proteus mirabilis WGLW6]
 gi|425072904|ref|ZP_18476010.1| hypothetical protein HMPREF1310_02343 [Proteus mirabilis WGLW4]
 gi|404596678|gb|EKA97198.1| hypothetical protein HMPREF1310_02343 [Proteus mirabilis WGLW4]
 gi|404600946|gb|EKB01371.1| hypothetical protein HMPREF1311_00902 [Proteus mirabilis WGLW6]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHCPH---PDLLQVDPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +   LEPGD+LY+P G+ H+   +  E SL+ ++      A
Sbjct: 169 IDEELEPGDILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207


>gi|417555762|ref|ZP_12206830.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           Naval-81]
 gi|421457572|ref|ZP_15906907.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           IS-123]
 gi|400206662|gb|EJO37635.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           IS-123]
 gi|400389792|gb|EJP56840.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
           Naval-81]
          Length = 382

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/178 (21%), Positives = 80/178 (44%), Gaps = 24/178 (13%)

Query: 57  VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           +E R +  +++++ R G ++    +  + ++  + +  A+  E     VG   Y    + 
Sbjct: 75  IEYRCITSVLYEYLRNGATLVRNRIKNEPFVDQISKQIANFAEA-RTLVGG--YAAFSSK 131

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP--------I 166
             +  H+D  + + +QL G+K+W +  P         +  P F Q+    P         
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPN--------FEFPLFMQQTKNFPDIKEPEEIY 183

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
           + V LE GD+LY+PRG+ H    + +E + H+ ++ +  T   ++  L+  MP  L  
Sbjct: 184 MDVILEAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNIMPSILDC 240


>gi|422009020|ref|ZP_16356003.1| hypothetical protein OOC_13169 [Providencia rettgeri Dmel1]
 gi|414092838|gb|EKT54510.1| hypothetical protein OOC_13169 [Providencia rettgeri Dmel1]
          Length = 374

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 94/209 (44%), Gaps = 32/209 (15%)

Query: 2   HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
           HW+K PL I +   N+ + +    +    ++D +   ++  K+ I    +  G  ++ D 
Sbjct: 16  HWQKRPLLIKKGFANFIDPISPDELAGLAMEDEVDSRLVSHKEGIWQVGH--GPFESYDH 73

Query: 58  EGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPP 112
            G      L Q V DH+    S         +KP + L +  + +L        +Y  P 
Sbjct: 74  LGEENWSILVQAV-DHWHHPSS-------ALMKPFRVLSDWRMDDLM------ISYSVP- 118

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G  PH D  + FI+Q EG++ W+V     + ++ P    P+  Q +    I+   +E
Sbjct: 119 -GGGVGPHLDQYDVFIIQGEGRRHWRVGEKIPMKQHCPH---PDLLQVQPFDAIIDEEME 174

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 175 PGDILYIPPGFPHEGYAI--EPSLNYSVG 201


>gi|197284768|ref|YP_002150640.1| hypothetical protein PMI0883 [Proteus mirabilis HI4320]
 gi|227357774|ref|ZP_03842123.1| cupin 4 family protein [Proteus mirabilis ATCC 29906]
 gi|194682255|emb|CAR41987.1| conserved hypothetical protein [Proteus mirabilis HI4320]
 gi|227162103|gb|EEI47117.1| cupin 4 family protein [Proteus mirabilis ATCC 29906]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHCPH---PDLLQVDPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +   LEPGD+LY+P G+ H+   +  E SL+ ++      A
Sbjct: 169 IDEELEPGDILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207


>gi|399021420|ref|ZP_10723526.1| hypothetical protein PMI16_04492 [Herbaspirillum sp. CF444]
 gi|398091873|gb|EJL82299.1| hypothetical protein PMI16_04492 [Herbaspirillum sp. CF444]
          Length = 374

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 87/198 (43%), Gaps = 29/198 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---- 56
           D+W K+PL I R+    +  L++ + + D+   + +E +      + + G+   MD    
Sbjct: 19  DYWHKKPLLI-RQAIPGFTPLLSRQALFDMASRDDVESR-----LIQQHGEHWKMDHGPL 72

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
            +  AL Q  W    +G ++        ++  + + +A L +L   +             
Sbjct: 73  SKLPALKQKEWTMLVQGVNLHDDAADELLRRFRFIPDARLDDLMISYAS--------DGG 124

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR---MVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
           G  PH+D  + F+LQ  G+++W++   +   +VD  +P     NF  E+         LE
Sbjct: 125 GVGPHFDSYDVFLLQAHGRRRWRISAQKDLSLVDG-MPLKILKNFQSEQ------EFVLE 177

Query: 173 PGDLLYLPRGYIHQASTV 190
           PGD+LYLP  Y H    V
Sbjct: 178 PGDMLYLPPHYAHDGVAV 195


>gi|238799321|ref|ZP_04642763.1| hypothetical protein ymoll0001_40500 [Yersinia mollaretii ATCC
           43969]
 gi|238716837|gb|EEQ08711.1| hypothetical protein ymoll0001_40500 [Yersinia mollaretii ATCC
           43969]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     E  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|417792259|ref|ZP_12439641.1| hypothetical protein CSE899_17031 [Cronobacter sakazakii E899]
 gi|429115173|ref|ZP_19176091.1| FIG002776: hypothetical protein [Cronobacter sakazakii 701]
 gi|449308617|ref|YP_007440973.1| cupin family protein [Cronobacter sakazakii SP291]
 gi|333953663|gb|EGL71583.1| hypothetical protein CSE899_17031 [Cronobacter sakazakii E899]
 gi|426318302|emb|CCK02204.1| FIG002776: hypothetical protein [Cronobacter sakazakii 701]
 gi|449098650|gb|AGE86684.1| cupin family protein [Cronobacter sakazakii SP291]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+    DV    
Sbjct: 16  YWQKRPVVLKRGIKNFIDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  +  V          I             +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSHR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
           Y+     AR  P DI     A  E L+D++  L+   D D+  +  G+ +         S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             EL  S  E      Q  E+Y+A +   DT VRL     +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313


>gi|423119781|ref|ZP_17107465.1| hypothetical protein HMPREF9690_01787 [Klebsiella oxytoca 10-5246]
 gi|376397477|gb|EHT10109.1| hypothetical protein HMPREF9690_01787 [Klebsiella oxytoca 10-5246]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + +       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFVDPISP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLSENNWSLLVQAVNNWHEPTAALMQPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVDPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++  Y+     +L           +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           YA     +R  P DI    L   + L+D++  L+
Sbjct: 228 YADPDVPSRDHPADILPAEL---DRLRDMMLGLI 258


>gi|422013902|ref|ZP_16360518.1| hypothetical protein OOA_04067 [Providencia burhodogranariea DSM
           19968]
 gi|414101924|gb|EKT63520.1| hypothetical protein OOA_04067 [Providencia burhodogranariea DSM
           19968]
          Length = 374

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q EG++ W+V     + ++ P    P+  Q E    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRHWRVGEKVPMKQHCPH---PDLLQVEPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 201


>gi|423107855|ref|ZP_17095550.1| hypothetical protein HMPREF9687_01101 [Klebsiella oxytoca 10-5243]
 gi|376386588|gb|EHS99299.1| hypothetical protein HMPREF9687_01101 [Klebsiella oxytoca 10-5243]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++  Y+     +L           +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           YA     +R  P DI    L   + L+D++  L+
Sbjct: 228 YADPDVPSREHPADILPVEL---DRLRDMMLGLI 258


>gi|421728564|ref|ZP_16167717.1| putative enzyme [Klebsiella oxytoca M5al]
 gi|423128639|ref|ZP_17116318.1| hypothetical protein HMPREF9694_05330 [Klebsiella oxytoca 10-5250]
 gi|376393121|gb|EHT05782.1| hypothetical protein HMPREF9694_05330 [Klebsiella oxytoca 10-5250]
 gi|410370728|gb|EKP25456.1| putative enzyme [Klebsiella oxytoca M5al]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|238754406|ref|ZP_04615762.1| hypothetical protein yruck0001_18110 [Yersinia ruckeri ATCC 29473]
 gi|238707439|gb|EEP99800.1| hypothetical protein yruck0001_18110 [Yersinia ruckeri ATCC 29473]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 90/203 (44%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R  KN+ + +       D L    +E + +  +  ++DG+ Q       +
Sbjct: 16  YWQKRPVILKRGFKNFIDPISP-----DELAGLAMESEVDSRLVSHQDGRWQVGHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L        +Y 
Sbjct: 71  YDHLGETNWSLLVQAV-DHWHEPSA-------ALMRPFRALSDWRMDDLM------ISYS 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH D  + FI+Q  G+++W+V     + ++ P    PN  Q      I+  
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGSGRRQWRVGEKVPMKQHCPH---PNLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+   + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194


>gi|156934387|ref|YP_001438303.1| hypothetical protein ESA_02218 [Cronobacter sakazakii ATCC BAA-894]
 gi|389841361|ref|YP_006343445.1| cupin family protein [Cronobacter sakazakii ES15]
 gi|156532641|gb|ABU77467.1| hypothetical protein ESA_02218 [Cronobacter sakazakii ATCC BAA-894]
 gi|387851837|gb|AFJ99934.1| cupin family protein [Cronobacter sakazakii ES15]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+    DV    
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  +  V          I             +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSHR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
           Y+     AR  P DI     A  E L+D++  L+   D D+  +  G+ +         S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             EL  S  E      Q  E+Y+A +   DT VRL     +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313


>gi|429119199|ref|ZP_19179929.1| FIG002776: hypothetical protein [Cronobacter sakazakii 680]
 gi|426326310|emb|CCK10666.1| FIG002776: hypothetical protein [Cronobacter sakazakii 680]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+    DV    
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  +  V          I             +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSHR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
           Y+     AR  P DI     A  E L+D++  L+   D D+  +  G+ +         S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275

Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
             EL  S  E      Q  E+Y+A +   DT VRL     +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313


>gi|188026123|ref|ZP_02960948.2| hypothetical protein PROSTU_02934 [Providencia stuartii ATCC 25827]
 gi|188021702|gb|EDU59742.1| cupin family protein [Providencia stuartii ATCC 25827]
          Length = 392

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q EG+++W+V     + ++ P    P+  Q +    I
Sbjct: 132 SYSVP--GGGVGPHLDQYDVFIIQGEGRRRWRVGEKVPMKQHCPH---PDLLQVQPFDAI 186

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 187 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 219


>gi|153947083|ref|YP_001400583.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
 gi|152958578|gb|ABS46039.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
          Length = 373

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  +E+G+        ++
Sbjct: 16  YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHENGRWHVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P + L +  + +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRSLSDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+LY+P G+ H+  ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194


>gi|392308693|ref|ZP_10271227.1| RmlC-like domain-containing protein [Pseudoalteromonas citrea NCIMB
           1889]
          Length = 384

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           +Y TP    G  PH D  + FI+Q +GK+ W+V LP   D  L +++ + +  Q E    
Sbjct: 116 SYSTP--HGGVGPHLDQYDVFIIQGQGKRHWRVGLP---DANLKQFAQNKSLLQVEQFDA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
            +   LEPGD+LY+P G  H+   V N
Sbjct: 171 EIDAILEPGDILYIPPGCPHEGYAVEN 197


>gi|209695412|ref|YP_002263341.1| hypothetical protein VSAL_I1952 [Aliivibrio salmonicida LFI1238]
 gi|208009364|emb|CAQ79637.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238]
          Length = 390

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 88/195 (45%), Gaps = 26/195 (13%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIE--FKKNIDITL-YEDG--QRQTM 55
           ++W+K+P+ I    +N+  + +T +++  +  +N ++  F  N++     E G    +  
Sbjct: 14  EYWQKKPVIIKGGFENF-QDPVTPEELAGLTLENEVDSGFVSNLNGEWKAEHGPLTEELF 72

Query: 56  DVEGRALPQIV---WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
           D  G     I+    +H+ EG +  F       KP +Q+   L +         +Y  P 
Sbjct: 73  DTLGETNWSIIVQAANHWHEGAAELF-------KPFKQMPNWLFDDVMI-----SYSVPD 120

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G  PH D  + FI+Q +GK+ W+V     + EY   +      Q     PI+   LE
Sbjct: 121 G--GVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYEEEHRHGTLKQITGFDPIIDQILE 175

Query: 173 PGDLLYLPRGYIHQA 187
           PGD+LY+P G+ H  
Sbjct: 176 PGDILYIPPGFPHDG 190


>gi|345298830|ref|YP_004828188.1| cupin [Enterobacter asburiae LF7a]
 gi|345092767|gb|AEN64403.1| Cupin 4 family protein [Enterobacter asburiae LF7a]
          Length = 373

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 61/274 (22%), Positives = 115/274 (41%), Gaps = 35/274 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGISNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  SL+ ++             +A       +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL--GSYR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
           Y+     AR  P DI    L   + L+ ++  L+
Sbjct: 228 YSDPDVPAREHPADILPQEL---DKLRGMMLDLI 258


>gi|323456451|gb|EGB12318.1| hypothetical protein AURANDRAFT_61265 [Aureococcus anophagefferens]
          Length = 390

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 12/127 (9%)

Query: 88  LQQLNASLQELFGCFVGANTYQTPPTAQGFAP-HYDDIEAFILQLEGKKKWKVYLPRMVD 146
           +  L+ +  + FG     N Y T   A+  AP H D     ++Q  G K+W+V+ P   D
Sbjct: 38  VAALSLAAVDAFGLPACVNLYATGAGARVSAPPHTDTQRVLVMQCHGNKRWRVWRPPRPD 97

Query: 147 EYL---PRYSSPNFSQEEIG----TPILTVTLEPGDLLYLPRGYIHQAST----VTNEHS 195
                 P          E+G    TP   V L PGD LY+P G+ H   T    V +E S
Sbjct: 98  RRPGADPLARGKGDDVLELGELEETPAADVVLTPGDCLYVPAGWPHTTDTLECAVDDEPS 157

Query: 196 LHVTISV 202
           +H+T+ V
Sbjct: 158 VHLTLGV 164


>gi|386741704|ref|YP_006214883.1| hypothetical protein S70_01480 [Providencia stuartii MRSN 2154]
 gi|384478397|gb|AFH92192.1| hypothetical protein S70_01480 [Providencia stuartii MRSN 2154]
          Length = 374

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q EG+++W+V     + ++ P    P+  Q +    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRRWRVGEKVPMKQHCPH---PDLLQVQPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 201


>gi|219120835|ref|XP_002185649.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582498|gb|ACI65119.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 626

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 19/107 (17%)

Query: 119 PHYDDIEAFILQLEGKKKWKVYLP--------------RMVDEYLPRYSSPNFSQEEIGT 164
           PH D  +  ++Q  G+K WKVY P                 D+ LP Y      + ++G 
Sbjct: 246 PHTDKQDVAVVQTSGRKHWKVYSPPNPAMKPTVDIFARGKGDDSLPLY----ILESDLGC 301

Query: 165 PIL-TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWID 210
            +L   TL PGD++++P  + H  STVT + S H   +    T  ID
Sbjct: 302 QLLLETTLNPGDVMFVPAAFPHTTSTVTEDDSTHADKTSIHLTLGID 348


>gi|193078048|gb|ABO12976.2| hypothetical protein A1S_2559 [Acinetobacter baumannii ATCC 17978]
          Length = 382

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/173 (22%), Positives = 80/173 (46%), Gaps = 14/173 (8%)

Query: 57  VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           +E R +  +++ + R G ++    ++ + ++  + +  A+  E     VG   Y    + 
Sbjct: 75  IEYRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEA-RTLVGG--YAAFSSK 131

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
             +  H+D  + + +QL G+K+W +  P       P Y     +  +I  P    + V L
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFPDIKEPEEIYMDVIL 188

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
           E GD+LY+PRG+ H    + +E + H+ ++ +  T   ++  L+  MP  L  
Sbjct: 189 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNIMPGILDC 240


>gi|386313839|ref|YP_006010004.1| transcription factor jumonji jmjC domain protein [Shewanella
           putrefaciens 200]
 gi|319426464|gb|ADV54538.1| transcription factor jumonji jmjC domain protein [Shewanella
           putrefaciens 200]
          Length = 386

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/218 (23%), Positives = 94/218 (43%), Gaps = 38/218 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K+PL I R+    + NL++ +++  +  D ++E ++     +Y+   +   +     
Sbjct: 17  YWQKKPLVI-RQGFKQFQNLVSPEELAGLAMDELVESRR-----VYQQAGQWHAEFG--- 67

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVGAN---------TYQ 109
                +D Y +      L  + +   +Q LN  +   + L  CF             +Y 
Sbjct: 68  ----PFDSYEK------LGERDWTLIVQALNNWVPDAEALIQCFDFIPRWRFDDVMVSYA 117

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
           TP    G  PH D  + FI Q  G+++W+V       E+    + P     E   PI+  
Sbjct: 118 TP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFA---AHPALLHTEAFEPIIDT 172

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            L PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 173 ELLPGDILYIPPGFPHDG--ITLEESLSFSVGYRTASA 208


>gi|395232008|ref|ZP_10410260.1| cupin [Enterobacter sp. Ag1]
 gi|394733512|gb|EJF33129.1| cupin [Enterobacter sp. Ag1]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 82/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + +       D L    +E + +  +  ++ G+ Q        
Sbjct: 16  YWQKRPVVLKRGFKNFIDPISP-----DELAGLAMESEVDSRLVSHQGGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLGENNWSLLVQAVNNWHEPSSALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVAMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|354597768|ref|ZP_09015785.1| Cupin 4 family protein [Brenneria sp. EniD312]
 gi|353675703|gb|EHD21736.1| Cupin 4 family protein [Brenneria sp. EniD312]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/218 (25%), Positives = 95/218 (43%), Gaps = 38/218 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K+P+ I R     +N  I     D+ L    +E + +  +  + DG+ Q       +
Sbjct: 16  YWQKQPVIIKRG----FNRFIDPISPDE-LAGLALENEVDSRLVSHNDGRWQVSHGPFDS 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E            ++P +QL +  L +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPS-------LALMRPFRQLPDWRLDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+  
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            +EPGD+LY+P G+ H   ++ N  SL+ ++      A
Sbjct: 172 EMEPGDILYIPPGFPHDGYSLEN--SLNYSVGFRAPNA 207


>gi|146310156|ref|YP_001175230.1| cupin 4 family protein [Enterobacter sp. 638]
 gi|145317032|gb|ABP59179.1| Cupin 4 family protein [Enterobacter sp. 638]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G ++W+V     + ++ P    P     +  TPI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFTPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201


>gi|429090897|ref|ZP_19153599.1| FIG002776: hypothetical protein [Cronobacter dublinensis 1210]
 gi|426744552|emb|CCJ79712.1| FIG002776: hypothetical protein [Cronobacter dublinensis 1210]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/198 (21%), Positives = 88/198 (44%), Gaps = 14/198 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ +        
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLGENNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHV 198
            G+ H+  ++ N  +  V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200


>gi|238758416|ref|ZP_04619593.1| hypothetical protein yaldo0001_18150 [Yersinia aldovae ATCC 35236]
 gi|238703320|gb|EEP95860.1| hypothetical protein yaldo0001_18150 [Yersinia aldovae ATCC 35236]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 92/203 (45%), Gaps = 36/203 (17%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
           +W+K P+ + R   N+ + L +  ++  +  +N ++ +    +  +EDG+ Q       +
Sbjct: 16  YWQKRPVILKRGFANFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWQVAHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G      L Q V DH+ E  +         ++P ++L +  + +L   F       
Sbjct: 71  FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRKLSDWRMDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
            +EPGD+ Y+P G+ H+   + N
Sbjct: 172 EMEPGDIFYIPPGFPHEGYALEN 194


>gi|261346016|ref|ZP_05973660.1| cupin family protein [Providencia rustigianii DSM 4541]
 gi|282565901|gb|EFB71436.1| cupin family protein [Providencia rustigianii DSM 4541]
          Length = 374

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/202 (25%), Positives = 85/202 (42%), Gaps = 16/202 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K PL I    K  +   I     D+ L    +E + +  +  ++DG  Q     G  
Sbjct: 16  HWQKRPLLI----KQGFTRFIDPISPDE-LAGLAMEDEVDSRLVSHKDGNWQV--AHG-- 66

Query: 62  LPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH   E  SI       +  P  QL    + L    +            G  PH
Sbjct: 67  -PFKSYDHLGDENWSILVQAVDHWHHPSAQLMKPFRVLSDWRMDDLMISYSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q EG+++W+V     + ++ P    P+  Q      I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGEGRRRWRVGEKIPMKQHCPH---PDLLQVHPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISV 202
            G+ H+   +  E SL+ ++  
Sbjct: 183 PGFPHEGYAI--EPSLNFSVGF 202


>gi|389700105|ref|ZP_10185156.1| hypothetical protein LepocDRAFT_00002390 [Leptothrix ochracea L12]
 gi|388591250|gb|EIM31508.1| hypothetical protein LepocDRAFT_00002390 [Leptothrix ochracea L12]
          Length = 361

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 12/82 (14%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL----PRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F++Q+ G ++W++   RM D  L    P     NF  EE         L
Sbjct: 115 GVGPHFDSYDVFLIQVHGCRRWRI--GRMDDAVLLPNLPVKIIANFRFEE------EWLL 166

Query: 172 EPGDLLYLPRGYIHQASTVTNE 193
           EPGD+LYLP G+ H    V  E
Sbjct: 167 EPGDMLYLPPGWAHDGDAVGGE 188


>gi|343507743|ref|ZP_08745127.1| hypothetical protein VII00023_15076 [Vibrio ichthyoenteri ATCC
           700023]
 gi|342796556|gb|EGU32231.1| hypothetical protein VII00023_15076 [Vibrio ichthyoenteri ATCC
           700023]
          Length = 379

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +GK++WKV   + V +Y          Q E    I+  TL PGD
Sbjct: 122 GVGPHIDQYDVFIVQGQGKRQWKVG-AKDVGQYQEVIQGGALRQIEGFDAIIDDTLMPGD 180

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 181 MLYIPPGFPHEGTTL 195


>gi|386348868|ref|YP_006047116.1| cupin region [Rhodospirillum rubrum F11]
 gi|346717304|gb|AEO47319.1| cupin region [Rhodospirillum rubrum F11]
          Length = 262

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 11/125 (8%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR-YS 153
           L++  G  V  +   +  T  G  PHYDD +  I+Q+ G+K WK+    +   +  R  S
Sbjct: 77  LEQALGAVVTIDAIGSFGTVCGLPPHYDDRDLIIVQVAGRKHWKILGTPVEGPWRKRTMS 136

Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLE 213
            P+   +E         ++ GD+L++P G  HQ   V  E SLH+  ++  +    DLL+
Sbjct: 137 VPDTVTDEF-------VMQGGDMLFVPAGLYHQC--VPLEPSLHLG-ALITRPCGADLLK 186

Query: 214 KAMPK 218
              P+
Sbjct: 187 MVQPR 191


>gi|354723018|ref|ZP_09037233.1| Cupin 4 family protein [Enterobacter mori LMG 25706]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|295096186|emb|CBK85276.1| Uncharacterized conserved protein [Enterobacter cloacae subsp.
           cloacae NCTC 9394]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 74/159 (46%), Gaps = 24/159 (15%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           +LY+P G+ H+  ++ N  SL+ ++             +A       +G  D   +R L 
Sbjct: 178 ILYIPPGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL- 223

Query: 236 IGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
            G  RY+     AR  P DI  + L   + L+ ++  L+
Sbjct: 224 -GSYRYSDPDVPAREHPADILPEEL---DKLRGMMLDLI 258


>gi|444378829|ref|ZP_21178019.1| hypothetical protein D515_2835 [Enterovibrio sp. AK16]
 gi|443677171|gb|ELT83862.1| hypothetical protein D515_2835 [Enterovibrio sp. AK16]
          Length = 377

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 3/75 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI Q  GK++W+V       EY          Q E   PI+   LEPGD
Sbjct: 120 GVGPHIDQYDVFICQGMGKRRWRV---GDKGEYKDANHHTGLRQIEGFDPIIDEILEPGD 176

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 177 ILYIPPGFPHEGNTI 191


>gi|429083173|ref|ZP_19146219.1| FIG002776: hypothetical protein [Cronobacter condimenti 1330]
 gi|426547988|emb|CCJ72260.1| FIG002776: hypothetical protein [Cronobacter condimenti 1330]
          Length = 373

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/200 (22%), Positives = 84/200 (42%), Gaps = 14/200 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L       D L    +E + +  +  ++DG+ +        
Sbjct: 16  YWQKRPVVLKRGIKNFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWEV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLSENNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKLPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTI 200
            G+ H+  ++ N  +  V  
Sbjct: 183 PGFPHEGYSLENSMNYSVGF 202


>gi|262274518|ref|ZP_06052329.1| hypothetical protein VHA_001495 [Grimontia hollisae CIP 101886]
 gi|262221081|gb|EEY72395.1| hypothetical protein VHA_001495 [Grimontia hollisae CIP 101886]
          Length = 377

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 3/75 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI Q  GK++W+V       EY          Q E   PI+   LEPGD
Sbjct: 120 GVGPHIDQYDVFICQGMGKRRWRV---GDKGEYKDANHHTGLRQIEGFDPIIDEILEPGD 176

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H+ +T+
Sbjct: 177 ILYIPPGFPHEGNTI 191


>gi|296102852|ref|YP_003612998.1| cupin 4 family protein [Enterobacter cloacae subsp. cloacae ATCC
           13047]
 gi|295057311|gb|ADF62049.1| cupin 4 family protein [Enterobacter cloacae subsp. cloacae ATCC
           13047]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFDGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|126642594|ref|YP_001085578.1| hypothetical protein A1S_2559 [Acinetobacter baumannii ATCC 17978]
          Length = 328

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/173 (22%), Positives = 80/173 (46%), Gaps = 14/173 (8%)

Query: 57  VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
           +E R +  +++ + R G ++    ++ + ++  + +  A+  E     VG   Y    + 
Sbjct: 21  IEYRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEA-RTLVGG--YAAFSSK 77

Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
             +  H+D  + + +QL G+K+W +  P       P Y     +  +I  P    + V L
Sbjct: 78  SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFPDIKEPEEIYMDVIL 134

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
           E GD+LY+PRG+ H    + +E + H+ ++ +  T   ++  L+  MP  L  
Sbjct: 135 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNIMPGILDC 186


>gi|90578715|ref|ZP_01234525.1| hypothetical protein VAS14_03398 [Photobacterium angustum S14]
 gi|90439548|gb|EAS64729.1| hypothetical protein VAS14_03398 [Photobacterium angustum S14]
          Length = 377

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 4/92 (4%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y TP    G  PH D  + FI+Q  GK+ W+V   R  D+Y   +      Q +   PI
Sbjct: 112 SYSTP--GGGVGPHIDQYDVFIVQGSGKRHWRVGEKR--DDYEEEFRHSALRQIKGFEPI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           +   LE GD+LY+P  + H   +V    S  V
Sbjct: 168 IDEVLETGDILYIPPSFPHDGYSVETSMSFSV 199


>gi|146311297|ref|YP_001176371.1| cupin 4 family protein [Enterobacter sp. 638]
 gi|145318173|gb|ABP60320.1| Cupin 4 family protein [Enterobacter sp. 638]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGISNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMHPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|392978593|ref|YP_006477181.1| cupin 4 family protein [Enterobacter cloacae subsp. dissolvens SDM]
 gi|392324526|gb|AFM59479.1| cupin 4 family protein [Enterobacter cloacae subsp. dissolvens SDM]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFDGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|417951601|ref|ZP_12594697.1| hypothetical protein VISP3789_08223 [Vibrio splendidus ATCC 33789]
 gi|342804385|gb|EGU39703.1| hypothetical protein VISP3789_08223 [Vibrio splendidus ATCC 33789]
          Length = 265

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 54/128 (42%), Gaps = 25/128 (19%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS 154
           L EL  C   AN Y +  +  GF  H DD    + QL G+K+W                 
Sbjct: 117 LSELLHCESSANLYVSYGSVSGFGYHKDDHHVLVKQLAGQKRW----------------- 159

Query: 155 PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
                 + G     + L  GDLL++P+G  H   + +++ S HVT S+   T   D L+ 
Sbjct: 160 ------QFGESQDDIVLNEGDLLFVPKGVAHNPVSESHK-SFHVTYSIVTPT-LSDFLKW 211

Query: 215 AMPKALQA 222
           A   A QA
Sbjct: 212 ACLPAHQA 219


>gi|336249790|ref|YP_004593500.1| cupin [Enterobacter aerogenes KCTC 2190]
 gi|444352000|ref|YP_007388144.1| FIG002776: hypothetical protein [Enterobacter aerogenes EA1509E]
 gi|334735846|gb|AEG98221.1| cupin [Enterobacter aerogenes KCTC 2190]
 gi|443902830|emb|CCG30604.1| FIG002776: hypothetical protein [Enterobacter aerogenes EA1509E]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + +       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPISP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNNWHEPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|153949056|ref|YP_001401948.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
 gi|152960551|gb|ABS48012.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E + + ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETAFNYSVG 201


>gi|395235964|ref|ZP_10414164.1| hypothetical protein A936_19833 [Enterobacter sp. Ag1]
 gi|394729270|gb|EJF29266.1| hypothetical protein A936_19833 [Enterobacter sp. Ag1]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D+ + FI+Q  G ++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDNYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|365970021|ref|YP_004951582.1| protein YcfD [Enterobacter cloacae EcWSU1]
 gi|365748934|gb|AEW73161.1| YcfD [Enterobacter cloacae EcWSU1]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 82/192 (42%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFSNFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|170025286|ref|YP_001721791.1| cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
 gi|169751820|gb|ACA69338.1| Cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E + + ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETAFNYSVG 201


>gi|51595407|ref|YP_069598.1| hypothetical protein YPTB1059 [Yersinia pseudotuberculosis IP
           32953]
 gi|186894433|ref|YP_001871545.1| cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
 gi|51588689|emb|CAH20299.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
           32953]
 gi|186697459|gb|ACC88088.1| Cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E + + ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETAFNYSVG 201


>gi|378978170|ref|YP_005226311.1| hypothetical protein KPHS_20110 [Klebsiella pneumoniae subsp.
           pneumoniae HS11286]
 gi|419971737|ref|ZP_14487167.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH1]
 gi|419980908|ref|ZP_14496189.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH2]
 gi|419986030|ref|ZP_14501166.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH4]
 gi|419989732|ref|ZP_14504707.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH5]
 gi|419994848|ref|ZP_14509656.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH6]
 gi|420004050|ref|ZP_14518691.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH7]
 gi|420009672|ref|ZP_14524153.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH8]
 gi|420014645|ref|ZP_14528950.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH9]
 gi|420018901|ref|ZP_14533096.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH10]
 gi|420026743|ref|ZP_14540743.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH11]
 gi|420031526|ref|ZP_14545347.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH12]
 gi|420038306|ref|ZP_14551954.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH14]
 gi|420044346|ref|ZP_14557827.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH16]
 gi|420050150|ref|ZP_14563452.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH17]
 gi|420055659|ref|ZP_14568824.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH18]
 gi|420061310|ref|ZP_14574300.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH19]
 gi|420066084|ref|ZP_14578886.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH20]
 gi|420070819|ref|ZP_14583469.1| hypothetical protein KPNIH21_06765 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH21]
 gi|420078931|ref|ZP_14591383.1| hypothetical protein KPNIH22_18232 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH22]
 gi|420083076|ref|ZP_14595364.1| hypothetical protein KPNIH23_10251 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH23]
 gi|421908479|ref|ZP_16338320.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K26BO]
 gi|421916787|ref|ZP_16346353.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K28BO]
 gi|428150986|ref|ZP_18998739.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST512-K30BO]
 gi|428939574|ref|ZP_19012680.1| hypothetical protein MTE2_08535 [Klebsiella pneumoniae VA360]
 gi|364517581|gb|AEW60709.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae HS11286]
 gi|397344617|gb|EJJ37749.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH2]
 gi|397349397|gb|EJJ42491.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH4]
 gi|397352675|gb|EJJ45753.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH1]
 gi|397367497|gb|EJJ60108.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH7]
 gi|397368621|gb|EJJ61226.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH6]
 gi|397369231|gb|EJJ61833.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH5]
 gi|397377661|gb|EJJ69887.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH9]
 gi|397380630|gb|EJJ72809.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH8]
 gi|397389562|gb|EJJ81495.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH10]
 gi|397394775|gb|EJJ86496.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH11]
 gi|397400395|gb|EJJ92040.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH12]
 gi|397404459|gb|EJJ95967.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH14]
 gi|397411745|gb|EJK02994.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH17]
 gi|397412113|gb|EJK03353.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH16]
 gi|397421512|gb|EJK12524.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH18]
 gi|397430354|gb|EJK21049.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH20]
 gi|397431179|gb|EJK21859.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH19]
 gi|397441323|gb|EJK31703.1| hypothetical protein KPNIH21_06765 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH21]
 gi|397444237|gb|EJK34520.1| hypothetical protein KPNIH22_18232 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH22]
 gi|397451596|gb|EJK41678.1| hypothetical protein KPNIH23_10251 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH23]
 gi|410117614|emb|CCM80945.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K26BO]
 gi|410120897|emb|CCM88978.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K28BO]
 gi|426303771|gb|EKV65932.1| hypothetical protein MTE2_08535 [Klebsiella pneumoniae VA360]
 gi|427539047|emb|CCM94877.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST512-K30BO]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|429108397|ref|ZP_19170266.1| FIG002776: hypothetical protein [Cronobacter malonaticus 681]
 gi|426295120|emb|CCJ96379.1| FIG002776: hypothetical protein [Cronobacter malonaticus 681]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 77/343 (22%), Positives = 137/343 (39%), Gaps = 50/343 (14%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ +        
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
            G+ H+  ++ N  +  V          I             +G  D   +R L  G  R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSQR 227

Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV-EYVDLDDGVDEMGKQLMHDALPPVL 296
           Y+     AR  P DI     A  + L+D++  L+ +  + ++   E   Q  H       
Sbjct: 228 YSDPDVPARECPADIVP---AEVDKLRDMMLDLIRDPANFNEWFGEFITQSRH------- 277

Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
              EL  S  E      Q  E+Y+A +   DT VRL     +R
Sbjct: 278 ---ELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313


>gi|206580898|ref|YP_002239240.1| cupin family protein [Klebsiella pneumoniae 342]
 gi|288936099|ref|YP_003440158.1| cupin [Klebsiella variicola At-22]
 gi|206569956|gb|ACI11732.1| cupin family protein [Klebsiella pneumoniae 342]
 gi|288890808|gb|ADC59126.1| Cupin 4 family protein [Klebsiella variicola At-22]
          Length = 373

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|401676209|ref|ZP_10808195.1| cupin 4 family protein [Enterobacter sp. SST3]
 gi|400216695|gb|EJO47595.1| cupin 4 family protein [Enterobacter sp. SST3]
          Length = 373

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|262043159|ref|ZP_06016297.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039547|gb|EEW40680.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 373

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|290510849|ref|ZP_06550219.1| ycfD [Klebsiella sp. 1_1_55]
 gi|289777565|gb|EFD85563.1| ycfD [Klebsiella sp. 1_1_55]
          Length = 373

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|238894175|ref|YP_002918909.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae
           NTUH-K2044]
 gi|365139668|ref|ZP_09346015.1| hypothetical protein HMPREF1024_02046 [Klebsiella sp. 4_1_44FAA]
 gi|386034318|ref|YP_005954231.1| hypothetical protein KPN2242_08750 [Klebsiella pneumoniae KCTC
           2242]
 gi|402781343|ref|YP_006636889.1| hypothetical protein A79E_3094 [Klebsiella pneumoniae subsp.
           pneumoniae 1084]
 gi|424830114|ref|ZP_18254842.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
 gi|425077232|ref|ZP_18480335.1| hypothetical protein HMPREF1305_03145 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW1]
 gi|425080959|ref|ZP_18484056.1| hypothetical protein HMPREF1306_01707 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW2]
 gi|425087865|ref|ZP_18490958.1| hypothetical protein HMPREF1307_03314 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW3]
 gi|428933734|ref|ZP_19007278.1| hypothetical protein MTE1_13144 [Klebsiella pneumoniae JHCK1]
 gi|449059308|ref|ZP_21737005.1| hypothetical protein G057_15496 [Klebsiella pneumoniae hvKP1]
 gi|238546491|dbj|BAH62842.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae
           NTUH-K2044]
 gi|339761446|gb|AEJ97666.1| putative enzyme [Klebsiella pneumoniae KCTC 2242]
 gi|363654281|gb|EHL93196.1| hypothetical protein HMPREF1024_02046 [Klebsiella sp. 4_1_44FAA]
 gi|402542233|gb|AFQ66382.1| hypothetical protein A79E_3094 [Klebsiella pneumoniae subsp.
           pneumoniae 1084]
 gi|405592941|gb|EKB66393.1| hypothetical protein HMPREF1305_03145 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW1]
 gi|405604589|gb|EKB77710.1| hypothetical protein HMPREF1307_03314 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW3]
 gi|405604769|gb|EKB77876.1| hypothetical protein HMPREF1306_01707 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW2]
 gi|414707539|emb|CCN29243.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
 gi|426304522|gb|EKV66664.1| hypothetical protein MTE1_13144 [Klebsiella pneumoniae JHCK1]
 gi|448874884|gb|EMB09914.1| hypothetical protein G057_15496 [Klebsiella pneumoniae hvKP1]
          Length = 373

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|83592160|ref|YP_425912.1| cupin region [Rhodospirillum rubrum ATCC 11170]
 gi|83575074|gb|ABC21625.1| Cupin region [Rhodospirillum rubrum ATCC 11170]
          Length = 301

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 79/183 (43%), Gaps = 28/183 (15%)

Query: 95  LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR-YS 153
           L++  G  V  +   +  T  G  PHYDD +  I+Q+ G+K WK+    +   +  R  S
Sbjct: 116 LEQALGAVVTIDAIGSFGTVCGLPPHYDDRDLIIVQVAGRKHWKILGTPVEGPWRKRTMS 175

Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLE 213
            P+   +E         ++ GD+L++P G  HQ   V  E SLH+   + +     DLL+
Sbjct: 176 VPDTVTDEF-------VMQGGDMLFVPAGLYHQC--VPLEPSLHLGALITRPCG-ADLLK 225

Query: 214 KAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEY 273
              P+       TD E    L +G             D + D       LK+ L +LV+ 
Sbjct: 226 MVQPR----WETTDPELAARLYVG-------------DGETDLQQQDARLKEALIRLVQD 268

Query: 274 VDL 276
           +D+
Sbjct: 269 MDV 271


>gi|401763245|ref|YP_006578252.1| cupin [Enterobacter cloacae subsp. cloacae ENHKU01]
 gi|400174779|gb|AFP69628.1| cupin [Enterobacter cloacae subsp. cloacae ENHKU01]
          Length = 373

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  I+  ++  +  +N ++ +    +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|197336245|ref|YP_002156409.1| cupin family protein [Vibrio fischeri MJ11]
 gi|197317735|gb|ACH67182.1| cupin family protein [Vibrio fischeri MJ11]
          Length = 382

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 48/199 (24%), Positives = 84/199 (42%), Gaps = 38/199 (19%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG- 59
           ++W+K+P+ I    +N+  + +T +++  +  +N      ++D     +   Q     G 
Sbjct: 14  EYWQKKPVIIKGGFENF-QDPVTPEELAGLTLEN------DVDSRFISNANNQWKAEHGP 66

Query: 60  ------RALPQIVW-------DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
                   L +  W       +H+ EG +  F       KP +Q+   L +         
Sbjct: 67  LSEELYETLGETNWSIIVQAANHWHEGAAELF-------KPFKQMPNWLFDDIMI----- 114

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q +GK+ W+V     + EY   +      Q     PI
Sbjct: 115 SYSVP--HGGVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYQEEHRHSALKQITGFEPI 169

Query: 167 LTVTLEPGDLLYLPRGYIH 185
           +   LEPGD+LY+P G+ H
Sbjct: 170 IDQILEPGDILYIPPGFPH 188


>gi|260597497|ref|YP_003210068.1| hypothetical protein CTU_17050 [Cronobacter turicensis z3032]
 gi|260216674|emb|CBA30020.1| Uncharacterized protein ycfD [Cronobacter turicensis z3032]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/198 (21%), Positives = 87/198 (43%), Gaps = 14/198 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ +        
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHV 198
            G+ H+  ++ N  +  V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200


>gi|423139625|ref|ZP_17127263.1| cupin family protein [Salmonella enterica subsp. houtenae str. ATCC
           BAA-1581]
 gi|379052179|gb|EHY70070.1| cupin family protein [Salmonella enterica subsp. houtenae str. ATCC
           BAA-1581]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 80/192 (41%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ + R   N+ + L       D L    +E + +  +  ++D + Q        
Sbjct: 16  HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDDKWQV-----NH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+   + N
Sbjct: 183 PGFPHEGYALEN 194


>gi|424933998|ref|ZP_18352370.1| Putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
 gi|407808185|gb|EKF79436.1| Putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|253688251|ref|YP_003017441.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
           carotovorum PC1]
 gi|251754829|gb|ACT12905.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
           carotovorum PC1]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 38/219 (17%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ------- 53
           ++W+K PL I    K  + N I     D+ L    +E + +  +  +++G+ Q       
Sbjct: 15  NYWQKRPLLI----KGGFTNFIDPISPDE-LAGLALENEVDSRLVSHQNGRWQVSHGPFE 69

Query: 54  TMDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTY 108
           + D  G +    L Q V DH+ E  S         ++P +QL +    +L   F      
Sbjct: 70  SYDHLGESNWSLLVQAV-DHWHEPSS-------ALMQPFRQLPDWRTDDLMISF------ 115

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
             P    G  PH D  + FI+Q  G+++W++     + ++ P    P+  Q E    I+ 
Sbjct: 116 SVP--GGGVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVEPFEAIID 170

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
             LEPGD++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 171 EELEPGDIIYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|429090415|ref|ZP_19153147.1| FIG002776: hypothetical protein [Cronobacter universalis NCTC 9529]
 gi|426510218|emb|CCK18259.1| FIG002776: hypothetical protein [Cronobacter universalis NCTC 9529]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/198 (21%), Positives = 87/198 (43%), Gaps = 14/198 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ +        
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHV 198
            G+ H+  ++ N  +  V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200


>gi|330016193|ref|ZP_08308423.1| cupin family protein [Klebsiella sp. MS 92-3]
 gi|328529478|gb|EGF56383.1| cupin family protein [Klebsiella sp. MS 92-3]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|429104140|ref|ZP_19166114.1| FIG002776: hypothetical protein [Cronobacter turicensis 564]
 gi|426290789|emb|CCJ92227.1| FIG002776: hypothetical protein [Cronobacter turicensis 564]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/198 (21%), Positives = 87/198 (43%), Gaps = 14/198 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R  KN+ + L +  ++  +  +N ++ +    +  ++DG+ +        
Sbjct: 16  YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q      ++   LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHV 198
            G+ H+  ++ N  +  V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200


>gi|425091021|ref|ZP_18494106.1| hypothetical protein HMPREF1308_01281 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW5]
 gi|405613178|gb|EKB85926.1| hypothetical protein HMPREF1308_01281 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW5]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + L       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDVLYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|421479034|ref|ZP_15926752.1| cupin domain protein, PF06172 family [Burkholderia multivorans CF2]
 gi|400223580|gb|EJO53869.1| cupin domain protein, PF06172 family [Burkholderia multivorans CF2]
          Length = 382

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 64/133 (48%), Gaps = 12/133 (9%)

Query: 102 FVGANTYQTPPTAQG----FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
           F G  T  +   A G    +  H+D  + F +QL+G+K+W VY P   D  L  + S  F
Sbjct: 116 FTGRQTVTSAYIAFGETDSYRAHWDSRDVFAVQLKGRKRWVVYKPTF-DAPLFMHQS-KF 173

Query: 158 SQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLL 212
            ++    P    +   LE GD+ Y+PRG+ H  + +  E ++H+ I  +   A  ++  +
Sbjct: 174 IEDRYPCPAEPHMDFVLEEGDVFYIPRGWWHNVTPL-GEPTVHLAIGTFPAFAVDYVKWV 232

Query: 213 EKAMPKALQAAGA 225
            + +P  + A  A
Sbjct: 233 FRELPDCIYARMA 245


>gi|423686362|ref|ZP_17661170.1| hypothetical protein VFSR5_1673 [Vibrio fischeri SR5]
 gi|371494430|gb|EHN70028.1| hypothetical protein VFSR5_1673 [Vibrio fischeri SR5]
          Length = 382

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 78/195 (40%), Gaps = 26/195 (13%)

Query: 1   DHWEKEPLYISRKQKNYYN--------NLITSKKIDDILRDNVIEFKKNIDITLYEDGQR 52
           ++W+K+P+ I    +N+ +         L     +D     N     K     L ED   
Sbjct: 14  EYWQKKPVIIKGGLENFQDPVTPEELAGLTLENDVDSRFISNANNEWKAEHGPLSEDLYE 73

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
              +     + Q   +H+ EG +  F       KP +Q+   L +         +Y  P 
Sbjct: 74  TLGETNWSIIVQAA-NHWHEGAAELF-------KPFKQMPNWLFDDIMI-----SYSVP- 119

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
              G  PH D  + FI+Q +GK+ W+V     + EY   +      Q     PI+   LE
Sbjct: 120 -HGGVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYQEEHRHSALKQITGFDPIIDQILE 175

Query: 173 PGDLLYLPRGYIHQA 187
           PGD+LY+P G+ H  
Sbjct: 176 PGDILYIPPGFPHDG 190


>gi|421494322|ref|ZP_15941672.1| hypothetical protein MU9_2843 [Morganella morganii subsp. morganii
           KT]
 gi|455739854|ref|YP_007506120.1| hypothetical protein MU9_2702 [Morganella morganii subsp. morganii
           KT]
 gi|400191489|gb|EJO24635.1| hypothetical protein MU9_2843 [Morganella morganii subsp. morganii
           KT]
 gi|455421417|gb|AGG31747.1| hypothetical protein MU9_2702 [Morganella morganii subsp. morganii
           KT]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 26/212 (12%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKI-----DDILRDNVIEFKKNIDITLYEDGQRQTMD 56
           HW+K PL I    +N+ + +   +       D+I    V  F    D+     G  Q  D
Sbjct: 16  HWQKRPLLIKGGFRNFRDTISPDELAGLAMEDEIDSRIVTRFSGKWDV---RHGPFQEFD 72

Query: 57  VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
                L +  W    +            +KP + L +  + +L        +Y  P    
Sbjct: 73  ----GLGEKDWSILVQAVDHWHFPSAALMKPFRCLSDWRMDDLM------ISYSVP--GG 120

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I+   + PGD
Sbjct: 121 GVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHAPH---PDLLQVDPFDAIIDDEMVPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +LY+P G+ H+   +  E SL+ ++      A
Sbjct: 178 ILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207


>gi|251788906|ref|YP_003003627.1| Cupin 4 family protein [Dickeya zeae Ech1591]
 gi|247537527|gb|ACT06148.1| Cupin 4 family protein [Dickeya zeae Ech1591]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     +  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFTPIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--LTFETALNYSVG 201


>gi|238762219|ref|ZP_04623191.1| hypothetical protein ykris0001_1360 [Yersinia kristensenii ATCC
           33638]
 gi|238699566|gb|EEP92311.1| hypothetical protein ykris0001_1360 [Yersinia kristensenii ATCC
           33638]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+   +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 178 ILYIPPGFPHEGYALEN 194


>gi|418043322|ref|ZP_12681489.1| cupin family protein [Escherichia coli W26]
 gi|420371391|ref|ZP_14871808.1| cupin superfamily protein [Shigella flexneri 1235-66]
 gi|383473740|gb|EID65752.1| cupin family protein [Escherichia coli W26]
 gi|391319303|gb|EIQ76334.1| cupin superfamily protein [Shigella flexneri 1235-66]
          Length = 352

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 100 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGD 156

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 157 ILYIPPGFPHEGYALEN 173


>gi|416179413|ref|ZP_11610979.1| cupin family protein [Neisseria meningitidis M6190]
 gi|416193015|ref|ZP_11617036.1| cupin family protein [Neisseria meningitidis ES14902]
 gi|325131698|gb|EGC54403.1| cupin family protein [Neisseria meningitidis M6190]
 gi|325137714|gb|EGC60291.1| cupin family protein [Neisseria meningitidis ES14902]
          Length = 309

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/165 (22%), Positives = 78/165 (47%), Gaps = 15/165 (9%)

Query: 60  RALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           R +  ++++H + G S+ +  +N + +      ++  +  L G     + Y    + + +
Sbjct: 7   RFIKSVIYEHMKNGASLVYNHINNEPFS---DHISRQVARLAGAHTIVSGYLAFGSDESY 63

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLEPG 174
             H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE G
Sbjct: 64  KNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILEAG 119

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
           D+LY+PRG+ H+   +  E + H+ +  +      +++ L K  P
Sbjct: 120 DVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 163


>gi|226330187|ref|ZP_03805705.1| hypothetical protein PROPEN_04100 [Proteus penneri ATCC 35198]
 gi|225200982|gb|EEG83336.1| cupin family protein [Proteus penneri ATCC 35198]
          Length = 329

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D+ + FI+Q  G+++W+V     + ++ P    P     +   PI
Sbjct: 70  SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFCPH---PALLHVDPFEPI 124

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
           + V + PGD+LY+P G+ H   T  +  +  V
Sbjct: 125 IDVEMAPGDILYIPPGFPHDGFTFEDTMNYSV 156


>gi|238762898|ref|ZP_04623866.1| hypothetical protein ykris0001_32090 [Yersinia kristensenii ATCC
           33638]
 gi|238698909|gb|EEP91658.1| hypothetical protein ykris0001_32090 [Yersinia kristensenii ATCC
           33638]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V   + + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKQPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|212712836|ref|ZP_03320964.1| hypothetical protein PROVALCAL_03933 [Providencia alcalifaciens DSM
           30120]
 gi|422017728|ref|ZP_16364291.1| hypothetical protein OO9_03473 [Providencia alcalifaciens Dmel2]
 gi|212684528|gb|EEB44056.1| hypothetical protein PROVALCAL_03933 [Providencia alcalifaciens DSM
           30120]
 gi|414105321|gb|EKT66880.1| hypothetical protein OO9_03473 [Providencia alcalifaciens Dmel2]
          Length = 374

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q EG++ W+V     + ++ P    P+  Q +    I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRHWRVGEKIPMKQHSPH---PDLLQVQPFDAI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +   +EPGD+LY+P G+ H+   +  E SL+ ++      A
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNFSVGFRAPNA 207


>gi|195941247|ref|ZP_03086629.1| hypothetical protein EscherichcoliO157_33462, partial [Escherichia
           coli O157:H7 str. EC4024]
          Length = 274

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 38  GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGD 94

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 95  ILYIPPGFPHEGYALEN 111


>gi|167525589|ref|XP_001747129.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774424|gb|EDQ88053.1| predicted protein [Monosiga brevicollis MX1]
          Length = 4167

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 22/123 (17%)

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---- 163
           Y + P A+   PH D  + F  QL G K W +      D     ++   ++Q+E      
Sbjct: 180 YLSGPNAKALPPHTDRNDIFSFQLAGTKHWTICTNPPPDT---SWTPAEWAQQEENEASR 236

Query: 164 -------TPILTVTLE-------PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWI 209
                  +PI    ++       PGD +YLP+G +H A T  +E SLH T+      +W+
Sbjct: 237 TEGCRNPSPIEVTKMQCEHHVVHPGDWIYLPKGTVHFAQT-NDEMSLHATVGFRPSFSWL 295

Query: 210 DLL 212
           + L
Sbjct: 296 NRL 298


>gi|429118195|ref|ZP_19179113.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 701]
 gi|426321324|emb|CCK05226.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 701]
          Length = 345

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGAKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201


>gi|311279967|ref|YP_003942198.1| transcription factor jumonji jmjC domain-containing protein
           [Enterobacter cloacae SCF1]
 gi|308749162|gb|ADO48914.1| transcription factor jumonji jmjC domain-containing protein
           [Enterobacter cloacae SCF1]
          Length = 373

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 84/192 (43%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ +  IT  ++  +  +N ++ +    +  + DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFNNFVDP-ITPDELAGLAMENEVDSR----LVSHLDGKWQV-----SH 65

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 66  GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMVSFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKAPLKQHCPH---PDLLQVDPFDAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTN 192
            G+ H+  ++ N
Sbjct: 183 PGFPHEGYSLEN 194


>gi|271501440|ref|YP_003334465.1| Cupin 4 family protein [Dickeya dadantii Ech586]
 gi|270344995|gb|ACZ77760.1| Cupin 4 family protein [Dickeya dadantii Ech586]
          Length = 379

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     +  TPI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFTPIIEEELEPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|238752808|ref|ZP_04614275.1| hypothetical protein yrohd0001_5950 [Yersinia rohdei ATCC 43380]
 gi|238708953|gb|EEQ01204.1| hypothetical protein yrohd0001_5950 [Yersinia rohdei ATCC 43380]
          Length = 374

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLSPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|242238851|ref|YP_002987032.1| cupin [Dickeya dadantii Ech703]
 gi|242130908|gb|ACS85210.1| Cupin 4 family protein [Dickeya dadantii Ech703]
          Length = 374

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +  TPI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFTPIIDADLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T++ +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHQTALNYSVG 201


>gi|375119387|ref|ZP_09764554.1| Transcription factor jumonji/aspartyl beta-hydroxylase [Salmonella
           enterica subsp. enterica serovar Dublin str. SD3246]
 gi|417334366|ref|ZP_12117602.1| hypothetical protein LTSEALA_2829 [Salmonella enterica subsp.
           enterica serovar Alachua str. R6-377]
 gi|326623654|gb|EGE29999.1| Transcription factor jumonji/aspartyl beta-hydroxylase [Salmonella
           enterica subsp. enterica serovar Dublin str. SD3246]
 gi|353575328|gb|EHC38094.1| hypothetical protein LTSEALA_2829 [Salmonella enterica subsp.
           enterica serovar Alachua str. R6-377]
          Length = 352

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 100 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGD 156

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 157 ILYIPPGFPHEGYALEN 173


>gi|121634298|ref|YP_974543.1| hypothetical protein NMC0436 [Neisseria meningitidis FAM18]
 gi|120866004|emb|CAM09741.1| hypothetical protein NMC0436 [Neisseria meningitidis FAM18]
          Length = 384

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/165 (22%), Positives = 78/165 (47%), Gaps = 15/165 (9%)

Query: 60  RALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
           R +  ++++H + G S+ +  +N + +      ++  +  L G     + Y    + + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINNEPFS---DHISRQVARLAGAHTIVSGYLAFGSDESY 138

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLEPG 174
             H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE G
Sbjct: 139 KNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILEAG 194

Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
           D+LY+PRG+ H+   +  E + H+ +  +      +++ L K  P
Sbjct: 195 DVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 238


>gi|445338422|ref|ZP_21416088.1| hypothetical protein SEE18569_023278, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 18569]
 gi|444872804|gb|ELX97120.1| hypothetical protein SEE18569_023278, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 18569]
          Length = 330

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 78  GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGD 134

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 135 ILYIPPGFPHEGYALEN 151


>gi|449306517|ref|YP_007443057.1| hypothetical protein CSSP291_20968 [Cronobacter sakazakii SP291]
 gi|449100674|gb|AGE88707.1| hypothetical protein CSSP291_20968 [Cronobacter sakazakii SP291]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGAKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201


>gi|238791619|ref|ZP_04635257.1| hypothetical protein yinte0001_27760 [Yersinia intermedia ATCC
           29909]
 gi|238729235|gb|EEQ20751.1| hypothetical protein yinte0001_27760 [Yersinia intermedia ATCC
           29909]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|429107487|ref|ZP_19169356.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 681]
 gi|426294210|emb|CCJ95469.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 681]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G++H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFLHDG--FTHETALNYSVG 201


>gi|251789544|ref|YP_003004265.1| Cupin 4 family protein [Dickeya zeae Ech1591]
 gi|247538165|gb|ACT06786.1| Cupin 4 family protein [Dickeya zeae Ech1591]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   L PGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVEPFEAIIDEELAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +LY+P G+ H+  ++ N  SL+ ++     +A
Sbjct: 178 ILYIPPGFPHEGYSLEN--SLNYSVGFRAPSA 207


>gi|383813869|ref|ZP_09969292.1| transcription factor jumonji jmjC domain-containing protein
           [Serratia sp. M24T3]
 gi|383297067|gb|EIC85378.1| transcription factor jumonji jmjC domain-containing protein
           [Serratia sp. M24T3]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 85/194 (43%), Gaps = 18/194 (9%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNV--IEFKKNIDITLYEDGQRQTMDVEG 59
           +W+K P+ + R  KN+         ID I  D +  +  +  +D  L    Q +  DV  
Sbjct: 16  YWQKRPVLLKRGFKNF---------IDPISPDELAGLAMENEVDSRLVSHDQGR-WDV-- 63

Query: 60  RALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
           +  P   +DH  E   SI       + +P  +L    +++    +            G  
Sbjct: 64  KHGPFESYDHLGENNWSILVQAVDHWHEPSARLMRPFRQIPDWRMDDLMISFSVPGGGVG 123

Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
           PH D  + FI+Q  G+++W+V     + ++ P    P+  Q E    I+   +EPGD++Y
Sbjct: 124 PHLDQYDVFIIQGTGRRRWRVGEKIPMKQHTPH---PDLLQVEPFDAIIDEEMEPGDIIY 180

Query: 179 LPRGYIHQASTVTN 192
           +P G+ H+  ++ N
Sbjct: 181 IPPGFPHEGYSLEN 194


>gi|410636856|ref|ZP_11347447.1| hypothetical protein GLIP_2024 [Glaciecola lipolytica E3]
 gi|410143662|dbj|GAC14652.1| hypothetical protein GLIP_2024 [Glaciecola lipolytica E3]
          Length = 384

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V   +  D   P   + N  Q     PIL   LEPGD
Sbjct: 121 GVGPHVDQYDVFIIQGRGTRRWQVGENKHHDTIQP---AKNLRQISGFDPILDEILEPGD 177

Query: 176 LLYLPRGYIHQA 187
           L+Y+P G+ H  
Sbjct: 178 LIYIPPGFPHNG 189


>gi|213161977|ref|ZP_03347687.1| hypothetical protein Salmoneentericaenterica_19042 [Salmonella
           enterica subsp. enterica serovar Typhi str. E00-7866]
          Length = 272

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 20  GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGD 76

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 77  ILYIPPGFPHEGYALEN 93


>gi|323452841|gb|EGB08714.1| hypothetical protein AURANDRAFT_63985 [Aureococcus anophagefferens]
          Length = 4512

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 42/137 (30%), Positives = 59/137 (43%), Gaps = 15/137 (10%)

Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEI 162
           A+ Y + P A   A H D  +  + Q+ G K W V+    V ++     Y +   S  E+
Sbjct: 180 AHVYVSAPGAAALANHTDLNDVVVFQIAGAKAWHVH---AVAKHGTCATYDASEMSGLEV 236

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
                   L PGDLL+LP+G +H A     E S H+T+ +    A       A+      
Sbjct: 237 CDDF---ELGPGDLLFLPQGAVHSARATEAEGSTHLTLGLAGTRA-------AVCDEQAP 286

Query: 223 AGATDLEFRRGLPIGYL 239
           AG    EFRR L  G L
Sbjct: 287 AGDLMAEFRRQLDEGDL 303


>gi|37526699|ref|NP_930043.1| hypothetical protein plu2809 [Photorhabdus luminescens subsp.
           laumondii TTO1]
 gi|36786131|emb|CAE15183.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
           TTO1]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 11/89 (12%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP---RYSSPNFSQEEIGTPILTVTLE 172
           G  PH D  + FI+Q  G+++W+V       E LP   R   P+  Q +  + ++   +E
Sbjct: 121 GVGPHLDQYDVFIIQGMGRRRWRV------GEKLPMEQRCPHPDLLQVDPFSAVIDEEME 174

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           PGD+LY+P G+ H+  ++  E SL+ ++ 
Sbjct: 175 PGDILYIPPGFPHEGYSI--EDSLNYSVG 201


>gi|336248485|ref|YP_004592195.1| cupin [Enterobacter aerogenes KCTC 2190]
 gi|334734541|gb|AEG96916.1| cupin [Enterobacter aerogenes KCTC 2190]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    VT+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201


>gi|261821424|ref|YP_003259530.1| cupin [Pectobacterium wasabiae WPP163]
 gi|261605437|gb|ACX87923.1| Cupin 4 family protein [Pectobacterium wasabiae WPP163]
 gi|385871663|gb|AFI90183.1| Cupin 4 family protein [Pectobacterium sp. SCC3193]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 98/219 (44%), Gaps = 38/219 (17%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ------- 53
           ++W+K PL I    K  + N I     D+ L    +E + +  +  +++G+ Q       
Sbjct: 15  NYWQKRPLLI----KGGFTNFIDPISPDE-LAGLALENEVDSRLVSHQNGRWQVSHGPFE 69

Query: 54  TMDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTY 108
           + D  G +    L Q V DH+ E  S         ++P +QL +    +L   F      
Sbjct: 70  SYDHLGESNWSLLVQAV-DHWHEPSS-------ALMQPFRQLPDWRTDDLMISF------ 115

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
             P    G  PH D  + FI+Q  G+++W++     + ++ P    P+  Q +    I+ 
Sbjct: 116 SVP--GGGVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFEAIID 170

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
             LEPGD++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 171 AELEPGDIVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|82702272|ref|YP_411838.1| hypothetical protein Nmul_A1143 [Nitrosospira multiformis ATCC
           25196]
 gi|82410337|gb|ABB74446.1| Cupin region [Nitrosospira multiformis ATCC 25196]
          Length = 415

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/219 (22%), Positives = 88/219 (40%), Gaps = 33/219 (15%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV--- 57
           DHW+K+PL I RK    ++ L+ + ++              ID+   ED Q + +     
Sbjct: 43  DHWQKKPLLI-RKALPDFSGLLDANEL--------------IDLACQEDAQSRLVTRRNG 87

Query: 58  -----EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
                 G   P+      ++G ++   +   ++   ++L      +    +        P
Sbjct: 88  RWEVRHGPFAPRAFARLPQKGWTLLVQDVNHFLPAARELLLKFNFIPHSRLDDLMVSYAP 147

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVY--LPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
              G  PH+D  + F+LQ  G+++W++     R +    P     +F  E+         
Sbjct: 148 EDGGVGPHFDSYDVFLLQGTGRRRWRISGQKDRTLVAAAPLKILQDFRPEQ------EWV 201

Query: 171 LEPGDLLYLPRGYIHQASTV--TNEHSLHVTISVYQKTA 207
           LEPGD+LYLP GY H    V     +S+      YQ+ A
Sbjct: 202 LEPGDMLYLPPGYAHDGVAVEPCMTYSIGFRAPTYQELA 240


>gi|444353388|ref|YP_007389532.1| FIG00732091: hypothetical protein [Enterobacter aerogenes EA1509E]
 gi|443904218|emb|CCG31992.1| FIG00732091: hypothetical protein [Enterobacter aerogenes EA1509E]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    VT+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201


>gi|307131193|ref|YP_003883209.1| hypothetical protein Dda3937_02090 [Dickeya dadantii 3937]
 gi|306528722|gb|ADM98652.1| hypothetical protein Dda3937_02090 [Dickeya dadantii 3937]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 124/288 (43%), Gaps = 55/288 (19%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFK------KNIDITLYEDGQRQTM 55
           HW+K+P+ I    +++ +  IT  ++  +  +N ++ +         D++    G  ++ 
Sbjct: 16  HWQKKPVIIKGGFRHFIDP-ITPDELAGLAMENEVDSRLVSHQGGRWDVS---HGPFESY 71

Query: 56  DVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
           D  G +    L Q V DH+ E  +         ++P +QL +  + +L   F        
Sbjct: 72  DHLGESNWSLLVQAV-DHWHEPSA-------ALMRPFRQLPDWRIDDLMISF------SV 117

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+   
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVAPFDAIIDEE 172

Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
           LEPGD+LY+P  + H+  ++ N  SL+ ++             +A       +G  D   
Sbjct: 173 LEPGDILYIPPSFPHEGYSLEN--SLNYSVGF-----------RAPSSRELVSGFADYVL 219

Query: 231 RRGLPIGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVD 275
            R L  G  RY+     AR  P DI    L   + L+ ++  LV++ D
Sbjct: 220 SREL--GGQRYSDPDIPARPHPADILPQEL---DKLQHMMLDLVQHKD 262


>gi|242239497|ref|YP_002987678.1| cupin [Dickeya dadantii Ech703]
 gi|242131554|gb|ACS85856.1| Cupin 4 family protein [Dickeya dadantii Ech703]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 86/198 (43%), Gaps = 14/198 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           HW+K P+ I +    ++ + IT  ++  +  +N ++ +    +  ++DG+    DV    
Sbjct: 16  HWQKSPVII-KGGFTHFIDPITPDELAGLAMENEVDSR----LVSHQDGR---WDVSHG- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFDSYDHLGETNWSLLVQAVHHWHEPSAALMRPFRHLPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   L PGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGAQVPMKQHCPH---PDLLQVDPFEAIIDEALSPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHV 198
             + H+  ++ N  S  V
Sbjct: 183 PAFPHEGYSLENSMSFSV 200


>gi|407788798|ref|ZP_11135902.1| hypothetical protein B3C1_00890 [Gallaecimonas xiamenensis 3-C-1]
 gi|407208042|gb|EKE77972.1| hypothetical protein B3C1_00890 [Gallaecimonas xiamenensis 3-C-1]
          Length = 373

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 4/77 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-EEIGTPILTVTLEPG 174
           G  PH D  + FI+Q EGK++W+V   + +    P    P+  Q E   T ++   + PG
Sbjct: 120 GVGPHVDQYDVFIIQGEGKRRWRVGDKQPLKSKTPH---PSLLQVEPFETALIDAEMAPG 176

Query: 175 DLLYLPRGYIHQASTVT 191
           D+LY+P G+ H+   +T
Sbjct: 177 DILYIPPGFPHEGYAIT 193


>gi|410087354|ref|ZP_11284058.1| hypothetical protein C790_1453 [Morganella morganii SC01]
 gi|409766089|gb|EKN50185.1| hypothetical protein C790_1453 [Morganella morganii SC01]
          Length = 373

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 90/216 (41%), Gaps = 34/216 (15%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKI-----DDILRDNVIEFKKNIDITLYEDGQRQTMD 56
           HW+K PL I    +N+ + +   +       D+I    V  F    D+     G  Q  D
Sbjct: 16  HWQKRPLLIKGGFRNFRDTISPDELAGLAMEDEIDSRIVTRFSGKWDV---RHGPFQEFD 72

Query: 57  VEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTP 111
             G      L Q V DH+              +KP + L +  + +L        +Y  P
Sbjct: 73  RLGEKDWSILVQAV-DHWH-------FPSAALMKPFRCLSDWRMDDLM------ISYSVP 118

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
               G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I+   +
Sbjct: 119 --GGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHAPH---PDLLQVDPFDAIIDDEM 173

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
            PGD+LY+P G+ H+   +  E SL+ ++      A
Sbjct: 174 VPGDILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207


>gi|387129600|ref|YP_006292490.1| hypothetical protein Q7C_632 [Methylophaga sp. JAM7]
 gi|386270889|gb|AFJ01803.1| hypothetical protein Q7C_632 [Methylophaga sp. JAM7]
          Length = 391

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 12/119 (10%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-EYL---PRYSSPNFSQEEI 162
           +Y  P  + G   H D+ + F+LQ EG ++W++    + D E+L   P    P+FS    
Sbjct: 121 SYAVPGGSVG--AHVDNYDVFLLQAEGVRQWQIEDQPLTDPEWLAGCPLRILPSFS---- 174

Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
             P  +  L PGD+LYLP G  H     T   +L V       + W++ L + + ++ Q
Sbjct: 175 --PCQSWELHPGDMLYLPPGIAHHGVAKTPCLTLSVGFRAASASQWLESLSEQLSQSPQ 231


>gi|365849052|ref|ZP_09389523.1| cupin family protein [Yokenella regensburgei ATCC 43003]
 gi|364569696|gb|EHM47318.1| cupin family protein [Yokenella regensburgei ATCC 43003]
          Length = 373

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 94/228 (41%), Gaps = 39/228 (17%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKIPMKQHCPH---PDLLQVDPFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
           +LY+P G+ H+  ++ N  +  V          I             +G  D   +R L 
Sbjct: 178 ILYIPPGFPHEGYSLENSLNYSVGFRAPNGRELI-------------SGFADYVLQREL- 223

Query: 236 IGYLRYAG---LARGKPVDI-QADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDA 291
            G LRY      +R  P DI   +R  ++  + DL+ +   +   D  + E   Q  H+ 
Sbjct: 224 -GSLRYTDPDVPSREHPADILPVERDRLRAMMVDLMNQPEHF---DQWLGEFITQSRHEL 279

Query: 292 LPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
               +SP E             Q  E+Y+A Q   D  VRL     VR
Sbjct: 280 ---DISPPEPPY----------QPDEIYDALQ-QGDKLVRLGGLRVVR 313


>gi|348029548|ref|YP_004872234.1| enzyme with RmlC-like domain [Glaciecola nitratireducens FR1064]
 gi|347946891|gb|AEP30241.1| putative enzyme with RmlC-like domain [Glaciecola nitratireducens
           FR1064]
          Length = 384

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D+ + FI+Q +G ++W+V LP    E +P    P   Q +  +PI+   L PGD
Sbjct: 120 GVGPHSDEYDVFIIQGKGSRRWQVGLPGNYTESVPH---PLLKQIDGFSPIIDEILLPGD 176

Query: 176 LLYLPRGYIHQASTVTN 192
           ++Y+P  + H    +++
Sbjct: 177 VVYIPPKHPHNGVALSD 193


>gi|152969688|ref|YP_001334797.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae MGH 78578]
 gi|419762761|ref|ZP_14289007.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae DSM 30104]
 gi|150954537|gb|ABR76567.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae MGH 78578]
 gi|397744256|gb|EJK91468.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae DSM 30104]
          Length = 373

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 46/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K P+ + R   N+ + +       D L    +E + +  +  ++DG+ Q        
Sbjct: 16  YWQKRPVVLKRGFANFIDPISP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66

Query: 62  LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       + +P   L    + L    +            G  PH
Sbjct: 67  -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
            G+ H+  ++ N  SL+ ++      A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207


>gi|423195891|ref|ZP_17182474.1| hypothetical protein HMPREF1171_00506 [Aeromonas hydrophila SSU]
 gi|404632692|gb|EKB29294.1| hypothetical protein HMPREF1171_00506 [Aeromonas hydrophila SSU]
          Length = 376

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q +GK+ W+V   + ++E+    +       E    I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200


>gi|421079716|ref|ZP_15540654.1| Protein ycfD [Pectobacterium wasabiae CFBP 3304]
 gi|401705802|gb|EJS95987.1| Protein ycfD [Pectobacterium wasabiae CFBP 3304]
          Length = 373

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W++     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFEAIIDAELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           ++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 178 IVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|117618540|ref|YP_855951.1| cupin superfamily protein [Aeromonas hydrophila subsp. hydrophila
           ATCC 7966]
 gi|117559947|gb|ABK36895.1| cupin superfamily protein [Aeromonas hydrophila subsp. hydrophila
           ATCC 7966]
          Length = 376

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q +GK+ W+V   + ++E+    +       E    I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200


>gi|120598908|ref|YP_963482.1| cupin 4 family protein [Shewanella sp. W3-18-1]
 gi|146293006|ref|YP_001183430.1| cupin 4 family protein [Shewanella putrefaciens CN-32]
 gi|120559001|gb|ABM24928.1| Cupin 4 family protein [Shewanella sp. W3-18-1]
 gi|145564696|gb|ABP75631.1| Cupin 4 family protein [Shewanella putrefaciens CN-32]
          Length = 386

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 7/101 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y TP    G  PH D  + FI Q  G+++W+V       E+    + P     E   PI
Sbjct: 115 SYATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFA---AHPALLHTEAFEPI 169

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +   L PGD+LY+P G+ H    +T E SL  ++     +A
Sbjct: 170 IDTELLPGDILYIPPGFPHDG--ITLEESLSFSVGYRTASA 208


>gi|323497695|ref|ZP_08102711.1| hypothetical protein VISI1226_13616 [Vibrio sinaloensis DSM 21326]
 gi|323317278|gb|EGA70273.1| hypothetical protein VISI1226_13616 [Vibrio sinaloensis DSM 21326]
          Length = 376

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 3/76 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
           G  PH D  + FI+Q  GK++W+V  L R   +Y     +    Q E    I+   LEPG
Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGDLDR--GQYQESIQASALRQIEGFDAIIDEILEPG 179

Query: 175 DLLYLPRGYIHQASTV 190
           D+LY+P G+ H+ +T+
Sbjct: 180 DILYIPPGFPHEGNTL 195


>gi|423106188|ref|ZP_17093889.1| hypothetical protein HMPREF9686_04793 [Klebsiella oxytoca 10-5242]
 gi|376378396|gb|EHS91156.1| hypothetical protein HMPREF9686_04793 [Klebsiella oxytoca 10-5242]
          Length = 374

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|334703818|ref|ZP_08519684.1| cupin superfamily protein [Aeromonas caviae Ae398]
          Length = 376

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q +GK+ W+V   + ++E+    +       E    I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200


>gi|254670287|emb|CBA05598.1| myc induced nuclear antigen [Neisseria meningitidis alpha153]
          Length = 309

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 7   RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHISRQVARFAGAHTIVSGYLAFGSDE 61

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 62  SYKNHWDTRDVYAIQLFGKKRWQLTAP---DVPMPLYMQQT-KDTDISIPEHIDMDIILE 117

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H+ +  +      +++ L K  P
Sbjct: 118 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 163


>gi|423127147|ref|ZP_17114826.1| hypothetical protein HMPREF9694_03838 [Klebsiella oxytoca 10-5250]
 gi|376396006|gb|EHT08651.1| hypothetical protein HMPREF9694_03838 [Klebsiella oxytoca 10-5250]
          Length = 374

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/204 (22%), Positives = 89/204 (43%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           +W+K+P+ + +   N+ + +       D L    +E + +  +   ++GQ Q  +     
Sbjct: 16  YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70

Query: 61  --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +   +     ++P + L +  L +L   F        P    G 
Sbjct: 71  FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H    +T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201


>gi|421724900|ref|ZP_16164104.1| hypothetical protein KOXM_04676 [Klebsiella oxytoca M5al]
 gi|410374272|gb|EKP28949.1| hypothetical protein KOXM_04676 [Klebsiella oxytoca M5al]
          Length = 374

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/204 (22%), Positives = 89/204 (43%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           +W+K+P+ + +   N+ + +       D L    +E + +  +   ++GQ Q  +     
Sbjct: 16  YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70

Query: 61  --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +   +     ++P + L +  L +L   F        P    G 
Sbjct: 71  FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H    +T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201


>gi|423117695|ref|ZP_17105386.1| hypothetical protein HMPREF9689_05443 [Klebsiella oxytoca 10-5245]
 gi|376375825|gb|EHS88611.1| hypothetical protein HMPREF9689_05443 [Klebsiella oxytoca 10-5245]
          Length = 374

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/204 (22%), Positives = 89/204 (43%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           +W+K+P+ + +   N+ + +       D L    +E + +  +   ++GQ Q  +     
Sbjct: 16  YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70

Query: 61  --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +   +     ++P + L +  L +L   F        P    G 
Sbjct: 71  FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H    +T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201


>gi|403377316|gb|EJY88650.1| hypothetical protein OXYTRI_00132 [Oxytricha trifallax]
          Length = 624

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 93/219 (42%), Gaps = 39/219 (17%)

Query: 19  NNLITSKKIDDILRDNV----IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGC 74
           +N+I+ K IDD L+  V    I++   I++  Y +G++ +M++        + D  +   
Sbjct: 173 DNIISKKMIDDYLKMQVPQGNIKYSDQIELFKYVNGKKFSMNLGESVDSDYLEDQLQNQS 232

Query: 75  SIRFLN-PQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
           +I  LN  Q +   L ++ +   ELF   +         T Q F  HY         L  
Sbjct: 233 TILQLNKAQEFSISLAKVMSLSDELFASDI---------TCQVFLQHY---------LTQ 274

Query: 134 KKKWKVYLPRM----VDEYLPRYSSP----NFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
            ++ K  L  M     D+Y+           F+ ++  TP+  V L+ GD  Y+PR   +
Sbjct: 275 DRQKKNQLSEMQQNEFDQYIFMIDGSVGLRFFNNQDEKTPVFKVKLQKGDFAYIPRNVKY 334

Query: 186 QASTVTNE--------HSLHVTISVYQKTAWIDLLEKAM 216
           Q  T   E        + L++T+S   +  W  L++K M
Sbjct: 335 QTKTEIVEDDQTSEQNYCLYITLSHQSQQTWFSLIQKQM 373


>gi|50121374|ref|YP_050541.1| hypothetical protein ECA2447 [Pectobacterium atrosepticum SCRI1043]
 gi|49611900|emb|CAG75349.1| conserved hypothetical protein [Pectobacterium atrosepticum
           SCRI1043]
          Length = 373

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 38/219 (17%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ------- 53
           ++W+K PL I    K  + + I     D+ L    +E + +  +  ++DG+ Q       
Sbjct: 15  NYWQKRPLLI----KGGFTHFIDPISPDE-LAGLALENEVDSRLVSHQDGRWQVSHGPFE 69

Query: 54  TMDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTY 108
           + D  G +    L Q V DH+ E  S         ++P +QL +    +L   F      
Sbjct: 70  SYDHLGESNWSLLVQAV-DHWHEPSS-------ALMQPFRQLPDWRTDDLMISF------ 115

Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
             P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+ 
Sbjct: 116 SVP--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKIPMKQHCPH---PDLLQVDPFEAIID 170

Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
             LEPGD++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 171 EELEPGDIVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|402844686|ref|ZP_10893037.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
 gi|402273419|gb|EJU22620.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
          Length = 374

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|329119755|ref|ZP_08248434.1| cupin family protein [Neisseria bacilliformis ATCC BAA-1200]
 gi|327464150|gb|EGF10456.1| cupin family protein [Neisseria bacilliformis ATCC BAA-1200]
          Length = 426

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 80/174 (45%), Gaps = 24/174 (13%)

Query: 56  DVEGRALPQIVWDHYREGCSIRF--LNPQTYIKPL-QQLNA-----SLQELFGCFVGANT 107
           ++  R +  ++++H R G S+ +  +N + +   + +Q+ A     ++   + CF    +
Sbjct: 116 NIRHRFIKPVIYEHMRNGASLIYNHINNEPFSDGIARQIGALAQAHTVTSAYLCFGSDES 175

Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQEEIGTP 165
           Y+          H+D  + + +QL GKK+W +  P       P Y   + + +  E  T 
Sbjct: 176 YKN---------HWDTRDVYAVQLVGKKRWYLSAPNFPQ---PLYMQQTKDLNVTEPDTV 223

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKA 219
              + LE GD+LY+PRG+ H    +  E S H+ +  +    + D LE  M K 
Sbjct: 224 DYDIVLEAGDVLYIPRGWWHNPIPMGCE-SFHLAVGTFPPNGY-DYLEWLMWKG 275


>gi|421497434|ref|ZP_15944603.1| hypothetical protein B224_001958 [Aeromonas media WS]
 gi|407183554|gb|EKE57442.1| hypothetical protein B224_001958 [Aeromonas media WS]
          Length = 376

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 52/95 (54%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q +GK+ W+V   + ++E+    +  +    E    I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAALLHCEPFEA---I 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200


>gi|109898698|ref|YP_661953.1| cupin 4 [Pseudoalteromonas atlantica T6c]
 gi|109700979|gb|ABG40899.1| Cupin 4 [Pseudoalteromonas atlantica T6c]
          Length = 389

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 3/75 (4%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
           T  G  PH D  + FI+Q +G ++W+        EY   +  P+ SQ +  TPI+   L 
Sbjct: 118 TGAGVGPHLDQYDVFIIQGKGSRRWQA---GKRGEYSTYHPHPDLSQIQGFTPIIDEVLH 174

Query: 173 PGDLLYLPRGYIHQA 187
            GD+LY+P G  H  
Sbjct: 175 SGDMLYIPAGCPHNG 189


>gi|385870105|gb|AFI88625.1| Cupin 4 family protein [Pectobacterium sp. SCC3193]
          Length = 373

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 16/201 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K+P+ +    KN + N +     D+ L    +E + +  +  ++DGQ Q  +     
Sbjct: 16  YWQKKPVVL----KNAFPNFVDPITPDE-LAGLAMEAEVDSRLVSHKDGQWQASNG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E G S+       +  P  +L    + L    +            G  PH
Sbjct: 67  -PFEHFDHLGETGWSLLAQAVNHWHAPSAELVRPFRVLPDWRLDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G ++W+V     + ++    S P     +   PI+   LEPGD+LY+P
Sbjct: 126 IDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTIS 201
            G+ H     T+E +L+ ++ 
Sbjct: 183 PGFPHDG--FTHETALNYSVG 201


>gi|423111729|ref|ZP_17099423.1| hypothetical protein HMPREF9687_04974 [Klebsiella oxytoca 10-5243]
 gi|376375827|gb|EHS88612.1| hypothetical protein HMPREF9687_04974 [Klebsiella oxytoca 10-5243]
          Length = 374

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|261819811|ref|YP_003257917.1| cupin [Pectobacterium wasabiae WPP163]
 gi|261603824|gb|ACX86310.1| Cupin 4 family protein [Pectobacterium wasabiae WPP163]
          Length = 373

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 16/201 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K+P+ +    KN + N +     D+ L    +E + +  +  ++DGQ Q  +     
Sbjct: 16  YWQKKPVVL----KNAFPNFVDPITPDE-LAGLAMEAEVDSRLVSHKDGQWQASNG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E G S+       +  P  +L    + L    +            G  PH
Sbjct: 67  -PFEHFDHLGETGWSLLAQAVNHWHAPSAELVRPFRVLPDWRLDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G ++W+V     + ++    S P     +   PI+   LEPGD+LY+P
Sbjct: 126 IDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGDILYIP 182

Query: 181 RGYIHQASTVTNEHSLHVTIS 201
            G+ H     T+E +L+ ++ 
Sbjct: 183 PGFPHDG--FTHETALNYSVG 201


>gi|195467841|ref|XP_002076034.1| GK10507 [Drosophila willistoni]
 gi|194172119|gb|EDW87020.1| GK10507 [Drosophila willistoni]
          Length = 158

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 39/64 (60%)

Query: 276 LDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRA 335
           +D  VD++ K+  H+ALPP + PEE   +VF +       G+     ++T+DT +RL+RA
Sbjct: 13  IDAAVDQLAKRYQHEALPPTILPEEKLRTVFGSRSATDAHGKCLCDYELTEDTSIRLLRA 72

Query: 336 NAVR 339
           N +R
Sbjct: 73  NILR 76


>gi|83645143|ref|YP_433578.1| hypothetical protein HCH_02330 [Hahella chejuensis KCTC 2396]
 gi|83633186|gb|ABC29153.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
          Length = 405

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 42/93 (45%), Gaps = 14/93 (15%)

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTP 165
           P      PH+D  + F++Q  G ++W++  P   D+  PR          NF Q E    
Sbjct: 122 PPGGSVGPHFDHYDVFLIQATGARRWRIGPP--CDDQSPRVDGTPLRILQNFEQTE---- 175

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
                LEPGD LYLP GY H     T+  +L V
Sbjct: 176 --EWVLEPGDALYLPPGYAHYGVAETSCITLSV 206


>gi|424802359|ref|ZP_18227901.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 696]
 gi|423238080|emb|CCK09771.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 696]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 7/96 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
           +   LEPGD+LY+P G+ H     T+E +L+ ++  
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVGF 202


>gi|59712199|ref|YP_204975.1| hypothetical protein VF_1592 [Vibrio fischeri ES114]
 gi|59480300|gb|AAW86087.1| conserved protein [Vibrio fischeri ES114]
          Length = 394

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +GK+ W+V     + EY   +      Q     PI+   LEPGD
Sbjct: 134 GVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYQEEHRHSALKQITGFEPIIDQILEPGD 190

Query: 176 LLYLPRGYIHQA 187
           +LY+P G+ H  
Sbjct: 191 ILYIPPGFPHDG 202


>gi|385329034|ref|YP_005883337.1| hypothetical protein NMBB_2028 [Neisseria meningitidis alpha710]
 gi|385341362|ref|YP_005895233.1| cupin family protein [Neisseria meningitidis M01-240149]
 gi|385856660|ref|YP_005903172.1| cupin family protein [Neisseria meningitidis NZ-05/33]
 gi|416174072|ref|ZP_11609101.1| cupin family protein [Neisseria meningitidis OX99.30304]
 gi|416188964|ref|ZP_11615017.1| cupin family protein [Neisseria meningitidis M0579]
 gi|433465770|ref|ZP_20423241.1| cupin-like domain protein [Neisseria meningitidis NM422]
 gi|433491143|ref|ZP_20448256.1| cupin-like domain protein [Neisseria meningitidis NM418]
 gi|308389886|gb|ADO32206.1| hypothetical protein NMBB_2028 [Neisseria meningitidis alpha710]
 gi|325129586|gb|EGC52408.1| cupin family protein [Neisseria meningitidis OX99.30304]
 gi|325135654|gb|EGC58270.1| cupin family protein [Neisseria meningitidis M0579]
 gi|325201568|gb|ADY97022.1| cupin family protein [Neisseria meningitidis M01-240149]
 gi|325207549|gb|ADZ03001.1| cupin family protein [Neisseria meningitidis NZ-05/33]
 gi|432200866|gb|ELK56954.1| cupin-like domain protein [Neisseria meningitidis NM422]
 gi|432225702|gb|ELK81442.1| cupin-like domain protein [Neisseria meningitidis NM418]
          Length = 384

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHISRQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H+ +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 238


>gi|377579221|ref|ZP_09808192.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
 gi|377539505|dbj|GAB53357.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 47/204 (23%), Positives = 91/204 (44%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---VE 58
           +W+K+P+ +    KN + N +     D+ L    +E + +  +  +++GQ Q  +    +
Sbjct: 16  YWQKQPVVL----KNAFPNFVDPITPDE-LAGLAMEPEVDSRLVSHKNGQWQAANGPFED 70

Query: 59  GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +   +     ++P + L +  L +L   F        P    G 
Sbjct: 71  FDNLGETGWSLLAQAVNHWHMPSAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H     T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--FTHETALNYSVG 201


>gi|157368749|ref|YP_001476738.1| cupin 4 family protein [Serratia proteamaculans 568]
 gi|157320513|gb|ABV39610.1| Cupin 4 family protein [Serratia proteamaculans 568]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D+ + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDNYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|444912786|ref|ZP_21232946.1| hypothetical protein D187_04882 [Cystobacter fuscus DSM 2262]
 gi|444716710|gb|ELW57553.1| hypothetical protein D187_04882 [Cystobacter fuscus DSM 2262]
          Length = 441

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 52/200 (26%), Positives = 82/200 (41%), Gaps = 35/200 (17%)

Query: 89  QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           +Q+N +    F C++        P  +G   HYD   A I+Q+ G K+W     R+    
Sbjct: 122 RQMNLATPVRFNCYLS-------PDGEGLDTHYDARHATIIQISGSKRW--LYSRLPATN 172

Query: 149 LP------------RYSSPNFSQEEIGTP------ILTVTLEPGDLLYLPRGYIHQASTV 190
            P            R+   N  ++++  P         V LEPGDLLYLP G  H A   
Sbjct: 173 YPLANALVQKDGQIRHGEHNNRRQKLDVPTPDESQFEEVLLEPGDLLYLPPGTWHNAKAS 232

Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
               +L++          I +L+   P+ L+   +  LE+R  LP   L         P+
Sbjct: 233 GESLALNMAFET------IGMLQVLFPE-LERMLSEKLEWRAILPATPLEQTRPGE-VPL 284

Query: 251 DIQADRLAMKENLKDLLAKL 270
           +++    A    L+D+L KL
Sbjct: 285 EVEQFITARLGELRDMLGKL 304


>gi|409407054|ref|ZP_11255505.1| hypothetical protein GWL_26590 [Herbaspirillum sp. GW103]
 gi|386432805|gb|EIJ45631.1| hypothetical protein GWL_26590 [Herbaspirillum sp. GW103]
          Length = 378

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 25/212 (11%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W K+PL I R     +   +  +++ D+ R   +E +    IT   DG         +
Sbjct: 21  DYWHKKPLLI-RNALPGFTAPLPRRELFDLARREEVESRL---ITHRSDGWDMQHGPFNK 76

Query: 61  ALP--QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
             P  Q  W    +G ++        ++  + L +A L +L        +Y T     G 
Sbjct: 77  LPPLKQKAWTLLVQGVNLHHPAADALLRQFRFLPDARLDDLM------ISYAT--DGGGV 128

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
            PH+D  + F+LQ  G+++W++       ++E +P     +F  E+         LEPGD
Sbjct: 129 GPHFDSYDVFLLQAHGQRRWRISAQDDLTLEEGVPLKILSHFQPEQ------EFVLEPGD 182

Query: 176 LLYLPRGYIHQASTVTN--EHSLHVTISVYQK 205
           +LYLP  Y H    V     +S+      YQ+
Sbjct: 183 MLYLPPQYAHDGVAVGECMTYSIGFRAPAYQE 214


>gi|156936656|ref|YP_001440570.1| hypothetical protein ESA_pESA3p05537 [Cronobacter sakazakii ATCC
           BAA-894]
 gi|156534910|gb|ABU79734.1| hypothetical protein ESA_pESA3p05537 [Cronobacter sakazakii ATCC
           BAA-894]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201


>gi|429118537|ref|ZP_19179298.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 680]
 gi|426327009|emb|CCK10035.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 680]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201


>gi|333893393|ref|YP_004467268.1| hypothetical protein ambt_09700 [Alteromonas sp. SN2]
 gi|332993411|gb|AEF03466.1| hypothetical protein ambt_09700 [Alteromonas sp. SN2]
          Length = 388

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 50/109 (45%), Gaps = 12/109 (11%)

Query: 85  IKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR 143
           ++P + L N  L +L   F   N         G   H D  + F++Q +G+++W+V  P 
Sbjct: 103 LEPFRFLPNWRLDDLMVSFATKNA--------GVGAHIDQYDVFLVQGKGQRRWRVGKPG 154

Query: 144 MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
              E  P    P   Q E   PI+   L PGD+LY+P G+ H    + +
Sbjct: 155 EYKEVFPH---PKLRQIEGFDPIIDEVLNPGDVLYIPPGWPHDGKALED 200


>gi|238790022|ref|ZP_04633800.1| hypothetical protein yfred0001_18980 [Yersinia frederiksenii ATCC
           33641]
 gi|238721835|gb|EEQ13497.1| hypothetical protein yfred0001_18980 [Yersinia frederiksenii ATCC
           33641]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|416207366|ref|ZP_11620906.1| cupin family protein [Neisseria meningitidis 961-5945]
 gi|254674054|emb|CBA09838.1| myc induced nuclear antigen [Neisseria meningitidis alpha275]
 gi|325141714|gb|EGC64172.1| cupin family protein [Neisseria meningitidis 961-5945]
          Length = 309

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 7   RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 61

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 62  SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 117

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H+ +  +      +++ L K  P
Sbjct: 118 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 163


>gi|440288006|ref|YP_007340771.1| hypothetical protein D782_2626 [Enterobacteriaceae bacterium strain
           FGI 57]
 gi|440047528|gb|AGB78586.1| hypothetical protein D782_2626 [Enterobacteriaceae bacterium strain
           FGI 57]
          Length = 373

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVNPFEGIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+  ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194


>gi|384083452|ref|ZP_09994627.1| hypothetical protein gproHI_09076 [gamma proteobacterium HIMB30]
          Length = 377

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           HYD    F++QL G+++W++  PR  DE  P  ++   +  E   PI T    PGD+LYL
Sbjct: 129 HYDQFSVFLIQLNGRRQWEIG-PRATDET-PMVANQPIALVEPQEPITTWIANPGDVLYL 186

Query: 180 PRGYIHQASTV 190
           P   IH   ++
Sbjct: 187 PPNMIHHGVSL 197


>gi|445498591|ref|ZP_21465446.1| cupin 4 family [Janthinobacterium sp. HH01]
 gi|444788586|gb|ELX10134.1| cupin 4 family [Janthinobacterium sp. HH01]
          Length = 394

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 47/198 (23%), Positives = 81/198 (40%), Gaps = 25/198 (12%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W K+PL I R     +  L++ + +  +   N  E +   ++    DGQ         
Sbjct: 39  DYWHKKPLLI-RNAIPGFKPLLSMEALTKLASTNHAESRLVTNV----DGQWHMQHGPLE 93

Query: 61  ALP---QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQG 116
           +LP   Q  W    +G ++        ++  + + +A L +L   F             G
Sbjct: 94  SLPPMDQKEWTMLVQGVNLFDAKADALLRQFRFIPDARLDDLMVSFA--------TDGGG 145

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PH+D  + F+LQ +G+++W +   +   + E LP     NF   E         L PG
Sbjct: 146 VGPHFDSYDVFLLQAQGQRRWSIGPQKDLTLVEGLPLKILANFKPNE------EFVLNPG 199

Query: 175 DLLYLPRGYIHQASTVTN 192
           D+LYLP  Y H    + +
Sbjct: 200 DMLYLPPHYAHDGIAIGD 217


>gi|304388287|ref|ZP_07370403.1| cupin family protein [Neisseria meningitidis ATCC 13091]
 gi|304337718|gb|EFM03871.1| cupin family protein [Neisseria meningitidis ATCC 13091]
          Length = 384

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H+ +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 238


>gi|123443237|ref|YP_001007211.1| hypothetical protein YE3022 [Yersinia enterocolitica subsp.
           enterocolitica 8081]
 gi|122090198|emb|CAL13061.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
           enterocolitica 8081]
          Length = 373

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     E  TPI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|385324749|ref|YP_005879188.1| hypothetical protein NMV_1952 [Neisseria meningitidis 8013]
 gi|433510027|ref|ZP_20466884.1| cupin-like domain protein [Neisseria meningitidis 12888]
 gi|261393136|emb|CAX50744.1| conserved hypothetical protein [Neisseria meningitidis 8013]
 gi|432244965|gb|ELL00444.1| cupin-like domain protein [Neisseria meningitidis 12888]
          Length = 387

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
           S+ +    F  A T  +   A G +P    H+D  + F +QL GKK W V  P      +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163

Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           P Y         I  P    + V LE GD+LY+PRG+ H    +  E + H+ I  +   
Sbjct: 164 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222

Query: 207 A--WIDLLEKAMP 217
              +++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235


>gi|422018084|ref|ZP_16364643.1| cupin 4 family protein [Providencia alcalifaciens Dmel2]
 gi|414105209|gb|EKT66772.1| cupin 4 family protein [Providencia alcalifaciens Dmel2]
          Length = 373

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 8/115 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D+ + FI+Q  G+++W+V     + ++    + P     E   PI
Sbjct: 114 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---AHPALLHVEPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           + V + PGD+LY+P G+ H   T   E +++ ++     T   DLL      AL+
Sbjct: 169 IDVEMVPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220


>gi|317494130|ref|ZP_07952546.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
 gi|316917903|gb|EFV39246.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
          Length = 373

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPSGFPHDG--FTHETALNYSVG 201


>gi|398802956|ref|ZP_10562131.1| hypothetical protein PMI15_00876 [Polaromonas sp. CF318]
 gi|398097782|gb|EJL88084.1| hypothetical protein PMI15_00876 [Polaromonas sp. CF318]
          Length = 400

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 79/191 (41%), Gaps = 25/191 (13%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD-VEGR 60
           HW+K+PL + R+    +  L++   +  +     +E +    I     G R        R
Sbjct: 22  HWQKKPLLV-RQALPGFQPLLSRAALFKLAASEQVESRL---IVQQAKGWRMKQGPFAPR 77

Query: 61  ALP---QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQG 116
           +LP   Q  W    +G  +        ++  + + +A L +L   F  A          G
Sbjct: 78  SLPPLSQKAWTLLVQGVDLHDAGAHELLQRFRFVPDARLDDLMISFATAGG--------G 129

Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
             PH+D  + F+LQ  G+++WK+   +   + E +P     NF  E+         LE G
Sbjct: 130 VGPHFDSYDVFLLQASGRRRWKIGRQKDLSLQEGVPLKILQNFEPEQ------EFVLEAG 183

Query: 175 DLLYLPRGYIH 185
           D+LYLP  Y H
Sbjct: 184 DMLYLPPRYAH 194


>gi|398835112|ref|ZP_10592489.1| hypothetical protein PMI40_02498 [Herbaspirillum sp. YR522]
 gi|398217303|gb|EJN03821.1| hypothetical protein PMI40_02498 [Herbaspirillum sp. YR522]
          Length = 436

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 136/337 (40%), Gaps = 49/337 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           D+W K+PL I +    +   L      D   RD+V    ++  IT   DG         +
Sbjct: 82  DYWHKKPLLIRQAIPAFQPPLSRQALFDLAGRDDV----ESRLITHRSDGWDMQHGPLAK 137

Query: 61  --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
             AL Q  W    +G ++        ++  + + +A L +L   F             G 
Sbjct: 138 LPALKQKAWTLLVQGVNLHDDAADALLRQFRFIPDARLDDLMISFAT--------DGGGV 189

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
            PH+D  + F+LQ  G+++W++       + E +P     +F  E+         LEPGD
Sbjct: 190 GPHFDSYDVFLLQAHGQRRWRISAQTDLSLKEDVPLKILRHFEPEQ------EFVLEPGD 243

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV-YQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
           +LYLP  Y H    V       +T S+ ++  A  +L E  +   L    + DL  R   
Sbjct: 244 MLYLPPQYAHDGVAVGE----CMTYSIGFRAPAHQELGEAFLEYMLD---SIDLPGRFAD 296

Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL-- 292
           P   L+    A  KP ++  D   M   L+D L K + + D DD    +G+ L    L  
Sbjct: 297 P--ELK----ATSKPAELPVD---MVRRLRDELNK-IRFTD-DDITIFLGEYLSEPKLSV 345

Query: 293 ----PPVLSPEELQCSVFENGLRMS-QTGEVYNATQI 324
               PP +S         + GLR+  +T  +Y A+ +
Sbjct: 346 TFDAPPEISFARFFQEARKRGLRLHRKTQMLYRASHV 382


>gi|385851891|ref|YP_005898406.1| cupin family protein [Neisseria meningitidis M04-240196]
 gi|416177792|ref|ZP_11610161.1| cupin family protein [Neisseria meningitidis M6190]
 gi|416191669|ref|ZP_11616167.1| cupin family protein [Neisseria meningitidis ES14902]
 gi|433494502|ref|ZP_20451572.1| cupin-like domain protein [Neisseria meningitidis NM762]
 gi|433502709|ref|ZP_20459674.1| cupin-like domain protein [Neisseria meningitidis NM126]
 gi|325132362|gb|EGC55055.1| cupin family protein [Neisseria meningitidis M6190]
 gi|325138455|gb|EGC61021.1| cupin family protein [Neisseria meningitidis ES14902]
 gi|325206714|gb|ADZ02167.1| cupin family protein [Neisseria meningitidis M04-240196]
 gi|432231176|gb|ELK86846.1| cupin-like domain protein [Neisseria meningitidis NM762]
 gi|432240805|gb|ELK96336.1| cupin-like domain protein [Neisseria meningitidis NM126]
          Length = 387

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
           S+ +    F  A T  +   A G +P    H+D  + F +QL GKK W V  P      +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163

Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           P Y         I  P    + V LE GD+LY+PRG+ H    +  E + H+ I  +   
Sbjct: 164 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222

Query: 207 A--WIDLLEKAMP 217
              +++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235


>gi|212711786|ref|ZP_03319914.1| hypothetical protein PROVALCAL_02861 [Providencia alcalifaciens DSM
           30120]
 gi|212685308|gb|EEB44836.1| hypothetical protein PROVALCAL_02861 [Providencia alcalifaciens DSM
           30120]
          Length = 373

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 8/115 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D+ + FI+Q  G+++W+V     + ++    + P     E   PI
Sbjct: 114 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---AHPALLHVEPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           + V + PGD+LY+P G+ H   T   E +++ ++     T   DLL      AL+
Sbjct: 169 IDVEMVPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220


>gi|387889678|ref|YP_006319976.1| cupin superfamily protein family [Escherichia blattae DSM 4481]
 gi|386924511|gb|AFJ47465.1| cupin superfamily protein family [Escherichia blattae DSM 4481]
          Length = 392

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD
Sbjct: 140 GVGPHLDQYDVFIIQGTGRRRWRVGEKLPLKQHCPH---PDLLQVDPFEAIIDEEMEPGD 196

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 197 ILYIPPGFPHEGYALEN 213


>gi|254506132|ref|ZP_05118276.1| Cupin superfamily protein [Vibrio parahaemolyticus 16]
 gi|219550950|gb|EED27931.1| Cupin superfamily protein [Vibrio parahaemolyticus 16]
          Length = 375

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 13/126 (10%)

Query: 67  WDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
           W    + C+   L     + P + L   L  +L  C      Y  P    G  PH D  +
Sbjct: 80  WQLIVQACNHWHLGAAELVAPFKALPQWLFDDLMVC------YSAPQG--GVGPHIDQYD 131

Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
            FI+Q  GK++W+V      DE   + S      Q E    I+   LEPGD+LY+P G+ 
Sbjct: 132 VFIIQGSGKRRWRV---GAADEGQYQESIQGALRQIESFDAIIDEVLEPGDILYIPPGFP 188

Query: 185 HQASTV 190
           H+ +T+
Sbjct: 189 HEGNTI 194


>gi|403358106|gb|EJY78689.1| hypothetical protein OXYTRI_24149 [Oxytricha trifallax]
          Length = 624

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 93/219 (42%), Gaps = 39/219 (17%)

Query: 19  NNLITSKKIDDILRDNV----IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGC 74
           +N+I+ K IDD L+  V    I++   I++  Y +G++ +M++        + D  +   
Sbjct: 173 DNIISKKIIDDYLKMQVPQGNIKYSDQIELFKYVNGKKFSMNLGESVDSDYLEDQLQNQS 232

Query: 75  SIRFLN-PQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
           +I  LN  Q +   L ++ +   ELF   +         T Q F  HY         L  
Sbjct: 233 TILQLNKAQEFSISLAKVMSLSDELFASDI---------TCQVFLQHY---------LTQ 274

Query: 134 KKKWKVYLPRM----VDEYLPRYSSP----NFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
            ++ K  L  M     D+Y+           F+ ++  TP+  V L+ GD  Y+PR   +
Sbjct: 275 DRQKKNQLSEMQQNEFDQYIFMIDGSVGLRFFNNQDEKTPVFKVKLQKGDFAYIPRNVKY 334

Query: 186 QASTVTNE--------HSLHVTISVYQKTAWIDLLEKAM 216
           Q  T   E        + L++T+S   +  W  L++K M
Sbjct: 335 QTKTEIVEDDQTSEQNYCLYITLSHQSQQTWFSLIQKQM 373


>gi|416214530|ref|ZP_11622926.1| cupin family protein [Neisseria meningitidis M01-240013]
 gi|325143802|gb|EGC66118.1| cupin family protein [Neisseria meningitidis M01-240013]
 gi|389606480|emb|CCA45393.1| Myc-induced nuclear antigen [Neisseria meningitidis alpha522]
          Length = 381

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
           S+ +    F  A T  +   A G +P    H+D  + F +QL GKK W V  P      +
Sbjct: 101 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 157

Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           P Y         I  P    + V LE GD+LY+PRG+ H    +  E + H+ I  +   
Sbjct: 158 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 216

Query: 207 A--WIDLLEKAMP 217
              +++ L K +P
Sbjct: 217 GYNYMEWLMKKIP 229


>gi|403058364|ref|YP_006646581.1| cupin 4 family protein [Pectobacterium carotovorum subsp.
           carotovorum PCC21]
 gi|402805690|gb|AFR03328.1| cupin 4 family protein [Pectobacterium carotovorum subsp.
           carotovorum PCC21]
          Length = 373

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W++     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFDAIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           ++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 178 IIYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|340786871|ref|YP_004752336.1| hypothetical protein CFU_1681 [Collimonas fungivorans Ter331]
 gi|340552138|gb|AEK61513.1| hypothetical protein CFU_1681 [Collimonas fungivorans Ter331]
          Length = 394

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 81/203 (39%), Gaps = 39/203 (19%)

Query: 1   DHWEKEPLYISRKQKNYY-----NNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM 55
           D+W K+PL I +    +      + L    + DD+    V  FK+          Q Q  
Sbjct: 39  DYWHKKPLLIRQALPGFIPLLSPDALFELARRDDVESRLVTHFKQ----------QWQVS 88

Query: 56  DVEGRALPQIV---WDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTP 111
           +     LP      W    +G ++        ++  + + +A L +L   +   N     
Sbjct: 89  NGPASELPSTAQKDWTLLVQGVNLHDDGADALLRQFRFIPDARLDDLMISYATDNG---- 144

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS----PNFSQEEIGTPIL 167
               G  PH+D  + F+LQ  G+++WK+   +  D  L   S+     NF  E+      
Sbjct: 145 ----GVGPHFDSYDVFLLQAHGQRRWKIGATQ--DLTLVEGSALKILKNFKPEQ------ 192

Query: 168 TVTLEPGDLLYLPRGYIHQASTV 190
              LEPGD+LYLP  Y H  + +
Sbjct: 193 EFILEPGDMLYLPPQYAHDGTAI 215


>gi|227111568|ref|ZP_03825224.1| hypothetical protein PcarbP_01325 [Pectobacterium carotovorum
           subsp. brasiliensis PBR1692]
          Length = 373

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W++     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFDAIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           ++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 178 IIYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|416214473|ref|ZP_11622908.1| cupin family protein [Neisseria meningitidis M01-240013]
 gi|325143871|gb|EGC66186.1| cupin family protein [Neisseria meningitidis M01-240013]
          Length = 384

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHISRQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|294951730|ref|XP_002787109.1| hypothetical protein Pmar_PMAR001518 [Perkinsus marinus ATCC 50983]
 gi|239901734|gb|EER18905.1| hypothetical protein Pmar_PMAR001518 [Perkinsus marinus ATCC 50983]
          Length = 58

 Score = 50.8 bits (120), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/46 (45%), Positives = 32/46 (69%)

Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
           + +G P+ +VTLE GDLL +PRG IH+A     + SLH+T+ V ++
Sbjct: 8   DNLGEPLYSVTLEEGDLLLIPRGVIHRARASQEQGSLHITVKVLKR 53


>gi|50119371|ref|YP_048538.1| hypothetical protein ECA0420 [Pectobacterium atrosepticum SCRI1043]
 gi|49609897|emb|CAG73335.1| conserved hypothetical protein [Pectobacterium atrosepticum
           SCRI1043]
          Length = 373

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 88/204 (43%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K+P+ +    KN + N +     D+ L    +E + +  +  Y++GQ Q  +     
Sbjct: 16  YWQKKPVVL----KNAFPNFVDPITPDE-LAGLAMEAEVDSRLVSYKNGQWQASNGPFEH 70

Query: 62  ---LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +         + P + L +  L +L   F        P    G 
Sbjct: 71  FDNLGETGWSLLAQAVNHWHAPSAELVSPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H     T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--FTHETALNYSVG 201


>gi|414593044|ref|ZP_11442692.1| hypothetical protein YcfD [Escherichia blattae NBRC 105725]
 gi|403195877|dbj|GAB80344.1| hypothetical protein YcfD [Escherichia blattae NBRC 105725]
          Length = 373

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLPLKQHCPH---PDLLQVDPFEAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 178 ILYIPPGFPHEGYALEN 194


>gi|357387437|ref|YP_004902276.1| hypothetical protein KSE_04740 [Kitasatospora setae KM-6054]
 gi|311893912|dbj|BAJ26320.1| hypothetical protein KSE_04740 [Kitasatospora setae KM-6054]
          Length = 316

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 77/196 (39%), Gaps = 13/196 (6%)

Query: 29  DILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPL 88
           +++RD  +   +    T    G+R++    G A P  +     EG ++       +   +
Sbjct: 54  EMVRDGAVVEPELFCATRTVAGRRES----GWADPARISALLAEGATLLLPQLDQWHAGV 109

Query: 89  QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
           + +   +    G    A  + T    +G   H DD +  ++QL G K W V+ P     +
Sbjct: 110 RAMATDIALHLGRRTEAFCFTTGAGRRGLDVHRDDADVLVVQLAGAKDWTVHEPPADGHW 169

Query: 149 LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAW 208
            P  +         G   L   L PG++LY+PRG  H   T T    L V +S   + A 
Sbjct: 170 RPGLAPAP------GPVALRTALTPGEILYVPRGAPH---TATGHRGLSVHLSFTIREAT 220

Query: 209 IDLLEKAMPKALQAAG 224
              L  A+ + L   G
Sbjct: 221 TGRLHAALGEFLTTPG 236


>gi|421545099|ref|ZP_15991164.1| hypothetical protein NMEN140_1667 [Neisseria meningitidis NM140]
 gi|421547149|ref|ZP_15993187.1| hypothetical protein NMEN183_1679 [Neisseria meningitidis NM183]
 gi|421548120|ref|ZP_15994148.1| hypothetical protein NMEN2781_0562 [Neisseria meningitidis NM2781]
 gi|421553356|ref|ZP_15999319.1| hypothetical protein NMEN576_1705 [Neisseria meningitidis NM576]
 gi|402321512|gb|EJU56985.1| hypothetical protein NMEN183_1679 [Neisseria meningitidis NM183]
 gi|402321782|gb|EJU57254.1| hypothetical protein NMEN140_1667 [Neisseria meningitidis NM140]
 gi|402326842|gb|EJU62240.1| hypothetical protein NMEN2781_0562 [Neisseria meningitidis NM2781]
 gi|402328264|gb|EJU63641.1| hypothetical protein NMEN576_1705 [Neisseria meningitidis NM576]
          Length = 322

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
           S+ +    F  A T  +   A G +P    H+D  + F +QL GKK W V  P      +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163

Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           P Y         I  P    + V LE GD+LY+PRG+ H    +  E + H+ I  +   
Sbjct: 164 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222

Query: 207 A--WIDLLEKAMP 217
              +++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235


>gi|271500463|ref|YP_003333488.1| Cupin 4 family protein [Dickeya dadantii Ech586]
 gi|270344018|gb|ACZ76783.1| Cupin 4 family protein [Dickeya dadantii Ech586]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/289 (22%), Positives = 125/289 (43%), Gaps = 57/289 (19%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
           HW+K+P+ I     ++ +  I+  ++  +  +N ++ +    +  + DG+        ++
Sbjct: 16  HWQKKPVIIKAGFPHFLDP-ISPDELAGLAMENEVDSR----LVSHRDGRWDVSHGPFES 70

Query: 55  MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
            D  G +    L Q V DH+ E  +         ++P +QL +  + +L   F       
Sbjct: 71  YDHLGESNWSLLVQAV-DHWHEPSA-------ALMRPFRQLPDWRIDDLMISF------S 116

Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
            P    G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q      I+  
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVAPFDAIIDE 171

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
            + PGD+LY+P G+ H+  ++ N  SL+ ++             +A       +G  D  
Sbjct: 172 EMAPGDILYIPPGFPHEGYSLEN--SLNYSVGF-----------RAPSSRELVSGFADYV 218

Query: 230 FRRGLPIGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVD 275
             R L  G  RY+     AR  P D+    L   + L+ ++ +LV++ D
Sbjct: 219 LSREL--GGQRYSDPDIPARAHPADVLPQEL---DKLQHMMLELVQHKD 262


>gi|227327069|ref|ZP_03831093.1| hypothetical protein PcarcW_07019 [Pectobacterium carotovorum
           subsp. carotovorum WPP14]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W++     + ++ P    P+  Q +    I+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFDAIIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           ++Y+P G+ H+  ++ N  SL+ ++      A
Sbjct: 178 IVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207


>gi|238920217|ref|YP_002933732.1| Cupin superfamily protein [Edwardsiella ictaluri 93-146]
 gi|238869786|gb|ACR69497.1| Cupin superfamily protein [Edwardsiella ictaluri 93-146]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/193 (23%), Positives = 78/193 (40%), Gaps = 14/193 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K PL I    K++ + L       D L    +E + +  +   +DGQ Q       
Sbjct: 15  NYWQKRPLVIRHGFKDFIDPLSP-----DELAGLAMEQEVDSRLVCRDDGQWQVSHG--- 66

Query: 61  ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
             P   +DH  E   S+       +  P   L    + L    +            G  P
Sbjct: 67  --PFTSYDHLGETNWSLLVQAVDHWHTPAAALMRPFRHLPDWRLDDLMISFSVPGGGVGP 124

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           H D  + FI+Q  G+++W+V       ++ P    P+  Q +    I+ + L PGD+LY+
Sbjct: 125 HIDQYDVFIIQGSGRRRWRVGEKGQRRQHCPH---PDLLQVDPFDAIIDIELSPGDILYI 181

Query: 180 PRGYIHQASTVTN 192
           P  + H+   + N
Sbjct: 182 PPHFPHEGYALEN 194


>gi|374287949|ref|YP_005035034.1| hypothetical protein BMS_1188 [Bacteriovorax marinus SJ]
 gi|301166490|emb|CBW26066.1| conserved hypothetical protein [Bacteriovorax marinus SJ]
          Length = 304

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 23/165 (13%)

Query: 69  HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF------VGANTYQTPPTAQGFAPHYD 122
           +Y +G  + F +   + + +Q     L  L G F      +  N     P   G A H+D
Sbjct: 83  YYDQGMGLLFNDLNLFDEKMQD---DLNRLCGDFGVSRATMSRNLVYATPKGGGTATHFD 139

Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT-----------VTL 171
                +LQL+G+K W +     V   L R++    S  E+ + I +             L
Sbjct: 140 QNINLVLQLKGRKSWWLKENECVRNPLTRHTLGTISDPELSSYIDSPFPDSIKYDEEYEL 199

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
             G LL++P+G+ H+     +  +L+ T S+    +W+D+L  A+
Sbjct: 200 SEGSLLFVPKGHWHKTHAHDDSLALNFTFSL---PSWVDILSMAL 241


>gi|254493031|ref|ZP_05106202.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291]
 gi|226512071|gb|EEH61416.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291]
          Length = 384

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|269139406|ref|YP_003296107.1| cupin 4 family protein [Edwardsiella tarda EIB202]
 gi|387867989|ref|YP_005699458.1| hypothetical protein ETAF_1860 [Edwardsiella tarda FL6-60]
 gi|267985067|gb|ACY84896.1| cupin 4 family protein [Edwardsiella tarda EIB202]
 gi|304559302|gb|ADM41966.1| hypothetical protein ETAF_1860 [Edwardsiella tarda FL6-60]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 14/193 (7%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
           ++W+K PL I    KN+ + L       D L    +E + +  +   +D Q Q       
Sbjct: 15  NYWQKRPLVIRHGFKNFIDPLSP-----DELAGLAMEQEVDSRLVCRDDAQWQVSHG--- 66

Query: 61  ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
             P   +DH  E   S+       +  P   L    + L    +            G  P
Sbjct: 67  --PFTSYDHLGETNWSLLVQAVDHWHTPAAALMRPFRRLPDWRLDDLMISFSVPGGGVGP 124

Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
           H D  + FI+Q  G+++W+V       ++ P    P+  Q +    I+ V L PGD+LY+
Sbjct: 125 HIDQYDVFIIQGSGRRRWRVGEKGQRRQHCPH---PDLLQVDPFDAIIDVELSPGDILYI 181

Query: 180 PRGYIHQASTVTN 192
           P  + H+   + N
Sbjct: 182 PPHFPHEGYALEN 194


>gi|410087896|ref|ZP_11284597.1| hypothetical protein C790_2055 [Morganella morganii SC01]
 gi|421493865|ref|ZP_15941219.1| hypothetical protein MU9_2389 [Morganella morganii subsp. morganii
           KT]
 gi|455739298|ref|YP_007505564.1| hypothetical protein MU9_2145 [Morganella morganii subsp. morganii
           KT]
 gi|400191909|gb|EJO25051.1| hypothetical protein MU9_2389 [Morganella morganii subsp. morganii
           KT]
 gi|409765890|gb|EKN49993.1| hypothetical protein C790_2055 [Morganella morganii SC01]
 gi|455420861|gb|AGG31191.1| hypothetical protein MU9_2145 [Morganella morganii subsp. morganii
           KT]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 56/115 (48%), Gaps = 8/115 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D+ + FI+Q  G+++W+V     + ++    + P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---AHPALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           + V + PGD+LY+P G+ H   T   E +++ ++     T   DLL      AL+
Sbjct: 169 IDVEMAPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220


>gi|385855805|ref|YP_005902318.1| cupin family protein [Neisseria meningitidis M01-240355]
 gi|325204746|gb|ADZ00200.1| cupin family protein [Neisseria meningitidis M01-240355]
          Length = 384

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|429112863|ref|ZP_19174633.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 507]
 gi|426314020|emb|CCK00746.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 507]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E +L+ ++ 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201


>gi|59800581|ref|YP_207293.1| hypothetical protein NGO0121 [Neisseria gonorrhoeae FA 1090]
 gi|268598285|ref|ZP_06132452.1| conserved hypothetical protein [Neisseria gonorrhoeae MS11]
 gi|268685890|ref|ZP_06152752.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035]
 gi|59717476|gb|AAW88881.1| conserved hypothetical protein [Neisseria gonorrhoeae FA 1090]
 gi|268582416|gb|EEZ47092.1| conserved hypothetical protein [Neisseria gonorrhoeae MS11]
 gi|268626174|gb|EEZ58574.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035]
          Length = 384

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|253989212|ref|YP_003040568.1| conserved hypothetical protein [Photorhabdus asymbiotica]
 gi|253780662|emb|CAQ83824.1| conserved hypothetical protein [Photorhabdus asymbiotica]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +    I+   +EPGD
Sbjct: 121 GVGPHLDQYDVFIIQGMGRRRWRVGEKVPMKQHCPH---PDLLQVDSFDAIIDEEMEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H+   +  E SL+ ++ 
Sbjct: 178 VLYIPPGFPHEGYAI--EDSLNYSVG 201


>gi|206578392|ref|YP_002240682.1| cupin family protein [Klebsiella pneumoniae 342]
 gi|288937381|ref|YP_003441440.1| cupin [Klebsiella variicola At-22]
 gi|206567450|gb|ACI09226.1| cupin family protein [Klebsiella pneumoniae 342]
 gi|288892090|gb|ADC60408.1| Cupin 4 family protein [Klebsiella variicola At-22]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    VT+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201


>gi|290512111|ref|ZP_06551478.1| ycfD [Klebsiella sp. 1_1_55]
 gi|289775106|gb|EFD83107.1| ycfD [Klebsiella sp. 1_1_55]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    VT+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201


>gi|422008727|ref|ZP_16355711.1| cupin 4 family protein [Providencia rettgeri Dmel1]
 gi|414095200|gb|EKT56863.1| cupin 4 family protein [Providencia rettgeri Dmel1]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 8/115 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D+ + FI+Q  G+++W+V     + ++    S P     +   PI
Sbjct: 114 SYSVP--EGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---SHPALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
           + V + PGD+LY+P G+ H   T   E +++ ++     T   DLL      AL+
Sbjct: 169 IDVEMAPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220


>gi|411010101|ref|ZP_11386430.1| cupin superfamily protein [Aeromonas aquariorum AAK1]
          Length = 376

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q +GK+ W+V     ++E+    +       E    I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAAPLNEFAAHAA---LLHCEPFEAI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E SL+ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200


>gi|253686795|ref|YP_003015985.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
           carotovorum PC1]
 gi|251753373|gb|ACT11449.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
           carotovorum PC1]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|374619859|ref|ZP_09692393.1| hypothetical protein OMB55_00009960 [gamma proteobacterium HIMB55]
 gi|374303086|gb|EHQ57270.1| hypothetical protein OMB55_00009960 [gamma proteobacterium HIMB55]
          Length = 324

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 2/75 (2%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH+D  + F+LQ  G+++WK+  P+  D   P     +    +   P  T  L+PGD
Sbjct: 77  GVGPHFDQYDVFLLQGAGQRRWKIG-PK-CDANTPTVRESDLRLIDEFEPTETFVLQPGD 134

Query: 176 LLYLPRGYIHQASTV 190
           +LY+P G+ H    V
Sbjct: 135 VLYVPPGFAHWGEAV 149


>gi|365838672|ref|ZP_09380010.1| cupin family protein [Hafnia alvei ATCC 51873]
 gi|364559465|gb|EHM37448.1| cupin family protein [Hafnia alvei ATCC 51873]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|397605458|gb|EJK59020.1| hypothetical protein THAOC_20816 [Thalassiosira oceanica]
          Length = 382

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 15/166 (9%)

Query: 38  FKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQE 97
           ++ N  I+L   G R+ +D       Q++      G  +  ++P  Y+  L+ +N    E
Sbjct: 229 YETNDWISLRIRGSREQLDKSTMCYEQMLEYLDGGGSVVIPISPSDYLHELK-MNVE-NE 286

Query: 98  LFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
           L       N Y + P+A     HYD     +LQL G+K+W +      D           
Sbjct: 287 LDIADTSMNVYHSGPSAVALNIHYDAYPVIVLQLSGEKEWIIQNDAFGD----------- 335

Query: 158 SQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHS-LHVTISV 202
           S +++ T    +T++ GD+LY+P+G  H A+T    +S  HVTI +
Sbjct: 336 SIKDVNT-WKNITMKEGDVLYIPKGIYHAATTKEGFNSTTHVTIGL 380


>gi|397655706|ref|YP_006496408.1| hypothetical protein A225_0637 [Klebsiella oxytoca E718]
 gi|394344379|gb|AFN30500.1| hypothetical protein A225_0637 [Klebsiella oxytoca E718]
          Length = 374

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/204 (21%), Positives = 89/204 (43%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           +W+K+P+ + +   N+ + +       D L    +E + +  +   ++GQ Q  +     
Sbjct: 16  YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70

Query: 61  --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +   +     ++P + L +  L +L   F        P    G 
Sbjct: 71  FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H    +T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201


>gi|375258733|ref|YP_005017903.1| hypothetical protein KOX_09690 [Klebsiella oxytoca KCTC 1686]
 gi|365908211|gb|AEX03664.1| hypothetical protein KOX_09690 [Klebsiella oxytoca KCTC 1686]
          Length = 374

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/204 (21%), Positives = 89/204 (43%), Gaps = 22/204 (10%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
           +W+K+P+ + +   N+ + +       D L    +E + +  +   ++GQ Q  +     
Sbjct: 16  YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70

Query: 61  --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
              L +  W    +  +   +     ++P + L +  L +L   F        P    G 
Sbjct: 71  FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122

Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
            PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGDIL 179

Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
           Y+P G+ H    +T+E +L+ ++ 
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201


>gi|365846400|ref|ZP_09386904.1| cupin family protein [Yokenella regensburgei ATCC 43003]
 gi|364574118|gb|EHM51591.1| cupin family protein [Yokenella regensburgei ATCC 43003]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|326430106|gb|EGD75676.1| hypothetical protein PTSG_07795 [Salpingoeca sp. ATCC 50818]
          Length = 394

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 37/68 (54%)

Query: 70  YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
           Y +GC+I       Y +PL  L   +Q+  G    AN Y TP  +QGF  H D+ +  I+
Sbjct: 280 YLDGCTIVCNIVPAYWQPLAGLMHGVQQETGLAYMANMYLTPRASQGFVEHTDNKDGLIV 339

Query: 130 QLEGKKKW 137
           Q +G+K+W
Sbjct: 340 QTDGRKEW 347


>gi|429085228|ref|ZP_19148206.1| FIG00553994: hypothetical protein [Cronobacter condimenti 1330]
 gi|426545560|emb|CCJ74247.1| FIG00553994: hypothetical protein [Cronobacter condimenti 1330]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|15677626|ref|NP_274785.1| hypothetical protein NMB1786 [Neisseria meningitidis MC58]
 gi|385853846|ref|YP_005900360.1| cupin family protein [Neisseria meningitidis H44/76]
 gi|416198383|ref|ZP_11618931.1| cupin family protein [Neisseria meningitidis CU385]
 gi|427827696|ref|ZP_18994720.1| cupin family protein [Neisseria meningitidis H44/76]
 gi|433488954|ref|ZP_20446104.1| cupin-like domain protein [Neisseria meningitidis M13255]
 gi|433505610|ref|ZP_20462543.1| cupin-like domain protein [Neisseria meningitidis 9506]
 gi|433507776|ref|ZP_20464674.1| cupin-like domain protein [Neisseria meningitidis 9757]
 gi|7227040|gb|AAF42125.1| hypothetical protein NMB1786 [Neisseria meningitidis MC58]
 gi|316984382|gb|EFV63355.1| cupin family protein [Neisseria meningitidis H44/76]
 gi|325139670|gb|EGC62208.1| cupin family protein [Neisseria meningitidis CU385]
 gi|325200850|gb|ADY96305.1| cupin family protein [Neisseria meningitidis H44/76]
 gi|389606457|emb|CCA45370.1| nucleolar protein 66 [Neisseria meningitidis alpha522]
 gi|432221283|gb|ELK77094.1| cupin-like domain protein [Neisseria meningitidis M13255]
 gi|432239330|gb|ELK94884.1| cupin-like domain protein [Neisseria meningitidis 9757]
 gi|432239497|gb|ELK95049.1| cupin-like domain protein [Neisseria meningitidis 9506]
          Length = 384

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDPRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|397163773|ref|ZP_10487231.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
 gi|396094328|gb|EJI91880.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           +Y  P    G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   LEPGD+LY+P G+ H     T+E + + +I 
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHEAAFNYSIG 201


>gi|429089661|ref|ZP_19152393.1| FIG00553994: hypothetical protein [Cronobacter universalis NCTC
           9529]
 gi|426509464|emb|CCK17505.1| FIG00553994: hypothetical protein [Cronobacter universalis NCTC
           9529]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|323451275|gb|EGB07153.1| hypothetical protein AURANDRAFT_71864 [Aureococcus anophagefferens]
          Length = 494

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 119 PHYDDIEAFILQLEGKKKWKVYLP-----RMVDEYLPRYSSPN-FSQEEIGTPILTVTLE 172
           PH D  E   LQ  G+++W+ + P       +   L     P     E +G P L V +E
Sbjct: 138 PHTDFQEVVCLQCTGRQRWRAWRPPKPLGDALGAALALGKGPEPLRPEALGAPALDVVVE 197

Query: 173 PGDLLYLPRGYIHQASTV 190
           PGD++Y+P G+ H   T+
Sbjct: 198 PGDVVYVPAGWPHATDTL 215


>gi|421541069|ref|ZP_15987199.1| cupin family protein [Neisseria meningitidis 93004]
 gi|421541878|ref|ZP_15987991.1| cupin family protein [Neisseria meningitidis NM255]
 gi|421568263|ref|ZP_16013989.1| cupin family protein [Neisseria meningitidis NM3001]
 gi|402315862|gb|EJU51416.1| cupin family protein [Neisseria meningitidis 93004]
 gi|402318901|gb|EJU54416.1| cupin family protein [Neisseria meningitidis NM255]
 gi|402341991|gb|EJU77162.1| cupin family protein [Neisseria meningitidis NM3001]
          Length = 384

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDPRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|421545082|ref|ZP_15991147.1| cupin family protein [Neisseria meningitidis NM140]
 gi|421547132|ref|ZP_15993170.1| cupin family protein [Neisseria meningitidis NM183]
 gi|421548101|ref|ZP_15994129.1| cupin family protein [Neisseria meningitidis NM2781]
 gi|421553339|ref|ZP_15999302.1| cupin family protein [Neisseria meningitidis NM576]
 gi|402321495|gb|EJU56968.1| cupin family protein [Neisseria meningitidis NM183]
 gi|402321765|gb|EJU57237.1| cupin family protein [Neisseria meningitidis NM140]
 gi|402326823|gb|EJU62221.1| cupin family protein [Neisseria meningitidis NM2781]
 gi|402328247|gb|EJU63624.1| cupin family protein [Neisseria meningitidis NM576]
          Length = 384

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDPRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|421541860|ref|ZP_15987973.1| hypothetical protein NMEN255_0488 [Neisseria meningitidis NM255]
 gi|402318883|gb|EJU54398.1| hypothetical protein NMEN255_0488 [Neisseria meningitidis NM255]
          Length = 387

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)

Query: 94  SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
           S+ +    F  A T  +   A G +P    H+D  + F +QL GKK W V  P      +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163

Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
           P Y         I  P    + V LE GD+LY+PRG+ H    +  E + H+ I  +   
Sbjct: 164 PLYMQQAKDMPHIIPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222

Query: 207 A--WIDLLEKAMP 217
              +++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235


>gi|336316193|ref|ZP_08571093.1| hypothetical protein Rhein_2495 [Rheinheimera sp. A13L]
 gi|335879315|gb|EGM77214.1| hypothetical protein Rhein_2495 [Rheinheimera sp. A13L]
          Length = 384

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 96/221 (43%), Gaps = 32/221 (14%)

Query: 1   DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFK--------KNIDITLYEDGQR 52
           D+W+K+PL I    + + +  I+  ++  +  +  IE +         +++   +ED   
Sbjct: 17  DYWQKKPLLIKAGFREFIDP-ISPDELAGLAGEEEIESRIVSNKDAAWDMETGPFEDFSH 75

Query: 53  QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
             +  EG  L     DH+    S   + P  +I      N  + +L   F  A       
Sbjct: 76  --LGEEGWTLLVQAVDHWHPQ-SAALIEPFRFIP-----NWRIDDLMVSFSTAGG----- 122

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
              G  PH D  + FI+Q +GK+ W+V +P   + ++ P    P   Q       + V  
Sbjct: 123 ---GVGPHLDQYDVFIIQGQGKRHWRVGMPDSSLKQHCPH---PRLLQVTPFVDCINVIT 176

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
           EPGD+LY+P G  H    ++ + SL+ ++  ++  A  DLL
Sbjct: 177 EPGDILYIPPGCPHDG--ISLDPSLNYSVG-FRAPAQKDLL 214


>gi|433467078|ref|ZP_20424535.1| cupin-like domain protein [Neisseria meningitidis 87255]
 gi|432203654|gb|ELK59705.1| cupin-like domain protein [Neisseria meningitidis 87255]
          Length = 382

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 58/130 (44%), Gaps = 23/130 (17%)

Query: 102 FVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
           F  A T  +   A G +P    H+D  + F +QL GKK W V  P         +  P +
Sbjct: 115 FAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPN--------FDMPLY 166

Query: 158 SQEEIGTPILT--------VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA-- 207
            Q+    P +T        V LE GD+LY+PRG+ H    +  E + H+ I  +      
Sbjct: 167 MQQAKDMPHITPSKTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYN 225

Query: 208 WIDLLEKAMP 217
           +++ L K +P
Sbjct: 226 YMEWLMKKIP 235


>gi|262045121|ref|ZP_06018150.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259037513|gb|EEW38755.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|268681420|ref|ZP_06148282.1| conserved hypothetical protein [Neisseria gonorrhoeae PID332]
 gi|268621704|gb|EEZ54104.1| conserved hypothetical protein [Neisseria gonorrhoeae PID332]
          Length = 384

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDARDMYTIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|421913072|ref|ZP_16342771.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K26BO]
 gi|410113050|emb|CCM85396.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K26BO]
          Length = 352

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 100 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 156

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 157 ILYIPPGFPHDG--ITHETALNYSVG 180


>gi|365144966|ref|ZP_09348937.1| hypothetical protein HMPREF1024_04968 [Klebsiella sp. 4_1_44FAA]
 gi|363647464|gb|EHL86681.1| hypothetical protein HMPREF1024_04968 [Klebsiella sp. 4_1_44FAA]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|429103335|ref|ZP_19165309.1| FIG00553994: hypothetical protein [Cronobacter turicensis 564]
 gi|426289984|emb|CCJ91422.1| FIG00553994: hypothetical protein [Cronobacter turicensis 564]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|260424469|ref|YP_003212670.1| Uncharacterized protein ycfD [Cronobacter turicensis z3032]
 gi|260219277|emb|CBA34631.1| Uncharacterized protein ycfD [Cronobacter turicensis z3032]
          Length = 384

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 132 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 188

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 189 ILYIPPGFPHDG--FTHETALNYSVG 212


>gi|152973214|ref|YP_001338360.1| hypothetical protein KPN_04743 [Klebsiella pneumoniae subsp.
           pneumoniae MGH 78578]
 gi|330002904|ref|ZP_08304454.1| cupin family protein [Klebsiella sp. MS 92-3]
 gi|378976749|ref|YP_005224890.1| hypothetical protein KPHS_05900 [Klebsiella pneumoniae subsp.
           pneumoniae HS11286]
 gi|386032946|ref|YP_005952859.1| hypothetical protein KPN2242_01850 [Klebsiella pneumoniae KCTC
           2242]
 gi|419976365|ref|ZP_14491764.1| hypothetical protein KPNIH1_23509 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH1]
 gi|419981995|ref|ZP_14497263.1| hypothetical protein KPNIH2_22949 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH2]
 gi|419988292|ref|ZP_14503389.1| hypothetical protein KPNIH4_25477 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH4]
 gi|419994116|ref|ZP_14509037.1| hypothetical protein KPNIH5_25666 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH5]
 gi|419999323|ref|ZP_14514099.1| hypothetical protein KPNIH6_22751 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH6]
 gi|420005743|ref|ZP_14520355.1| hypothetical protein KPNIH7_26067 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH7]
 gi|420011425|ref|ZP_14525875.1| hypothetical protein KPNIH8_25481 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH8]
 gi|420017040|ref|ZP_14531324.1| hypothetical protein KPNIH9_24516 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH9]
 gi|420022977|ref|ZP_14537129.1| hypothetical protein KPNIH10_25849 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH10]
 gi|420028688|ref|ZP_14542656.1| hypothetical protein KPNIH11_25289 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH11]
 gi|420040236|ref|ZP_14553848.1| hypothetical protein KPNIH14_26001 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH14]
 gi|420045955|ref|ZP_14559409.1| hypothetical protein KPNIH16_25704 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH16]
 gi|420051785|ref|ZP_14565059.1| hypothetical protein KPNIH17_26058 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH17]
 gi|420057416|ref|ZP_14570552.1| hypothetical protein KPNIH18_25796 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH18]
 gi|420062991|ref|ZP_14575943.1| hypothetical protein KPNIH19_25601 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH19]
 gi|420068714|ref|ZP_14581486.1| hypothetical protein KPNIH20_25177 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH20]
 gi|420073754|ref|ZP_14586376.1| hypothetical protein KPNIH21_21462 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH21]
 gi|420079549|ref|ZP_14591993.1| hypothetical protein KPNIH22_21338 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH22]
 gi|420085553|ref|ZP_14597773.1| hypothetical protein KPNIH23_22666 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH23]
 gi|421917563|ref|ZP_16347115.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K28BO]
 gi|424828746|ref|ZP_18253474.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
 gi|424935278|ref|ZP_18353650.1| Hypothetical protein B819_26218 [Klebsiella pneumoniae subsp.
           pneumoniae KpQ3]
 gi|425079666|ref|ZP_18482763.1| hypothetical protein HMPREF1306_00379 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW2]
 gi|428151251|ref|ZP_18998991.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST512-K30BO]
 gi|428935656|ref|ZP_19009118.1| hypothetical protein MTE1_22506 [Klebsiella pneumoniae JHCK1]
 gi|428942224|ref|ZP_19015231.1| hypothetical protein MTE2_21456 [Klebsiella pneumoniae VA360]
 gi|150958063|gb|ABR80093.1| hypothetical protein KPN_04743 [Klebsiella pneumoniae subsp.
           pneumoniae MGH 78578]
 gi|328537141|gb|EGF63413.1| cupin family protein [Klebsiella sp. MS 92-3]
 gi|339760074|gb|AEJ96294.1| hypothetical protein KPN2242_01850 [Klebsiella pneumoniae KCTC
           2242]
 gi|364516160|gb|AEW59288.1| hypothetical protein KPHS_05900 [Klebsiella pneumoniae subsp.
           pneumoniae HS11286]
 gi|397340844|gb|EJJ34037.1| hypothetical protein KPNIH1_23509 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH1]
 gi|397341148|gb|EJJ34333.1| hypothetical protein KPNIH4_25477 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH4]
 gi|397341569|gb|EJJ34745.1| hypothetical protein KPNIH2_22949 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH2]
 gi|397356985|gb|EJJ49770.1| hypothetical protein KPNIH5_25666 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH5]
 gi|397358285|gb|EJJ51008.1| hypothetical protein KPNIH6_22751 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH6]
 gi|397360460|gb|EJJ53138.1| hypothetical protein KPNIH7_26067 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH7]
 gi|397374189|gb|EJJ66542.1| hypothetical protein KPNIH9_24516 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH9]
 gi|397375655|gb|EJJ67938.1| hypothetical protein KPNIH8_25481 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH8]
 gi|397382054|gb|EJJ74218.1| hypothetical protein KPNIH10_25849 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH10]
 gi|397391272|gb|EJJ83137.1| hypothetical protein KPNIH11_25289 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH11]
 gi|397399719|gb|EJJ91370.1| hypothetical protein KPNIH14_26001 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH14]
 gi|397408451|gb|EJJ99814.1| hypothetical protein KPNIH17_26058 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH17]
 gi|397408650|gb|EJK00005.1| hypothetical protein KPNIH16_25704 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH16]
 gi|397419496|gb|EJK10644.1| hypothetical protein KPNIH18_25796 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH18]
 gi|397426008|gb|EJK16859.1| hypothetical protein KPNIH20_25177 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH20]
 gi|397426460|gb|EJK17278.1| hypothetical protein KPNIH19_25601 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH19]
 gi|397437640|gb|EJK28197.1| hypothetical protein KPNIH21_21462 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH21]
 gi|397443636|gb|EJK33947.1| hypothetical protein KPNIH22_21338 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH22]
 gi|397449067|gb|EJK39217.1| hypothetical protein KPNIH23_22666 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH23]
 gi|405608178|gb|EKB81130.1| hypothetical protein HMPREF1306_00379 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW2]
 gi|407809465|gb|EKF80716.1| Hypothetical protein B819_26218 [Klebsiella pneumoniae subsp.
           pneumoniae KpQ3]
 gi|410120109|emb|CCM89740.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST258-K28BO]
 gi|414706160|emb|CCN27864.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
 gi|426299055|gb|EKV61416.1| hypothetical protein MTE2_21456 [Klebsiella pneumoniae VA360]
 gi|426300027|gb|EKV62331.1| hypothetical protein MTE1_22506 [Klebsiella pneumoniae JHCK1]
 gi|427538802|emb|CCM95129.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
           pneumoniae ST512-K30BO]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|420034244|ref|ZP_14548034.1| hypothetical protein KPNIH12_24353 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH12]
 gi|397392915|gb|EJJ84689.1| hypothetical protein KPNIH12_24353 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH12]
          Length = 329

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 77  GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 133

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 134 ILYIPPGFPHDG--ITHETALNYSVG 157


>gi|238892875|ref|YP_002917609.1| hypothetical protein KP1_0694 [Klebsiella pneumoniae subsp.
           pneumoniae NTUH-K2044]
 gi|402782628|ref|YP_006638174.1| hypothetical protein A79E_4413 [Klebsiella pneumoniae subsp.
           pneumoniae 1084]
 gi|238545191|dbj|BAH61542.1| hypothetical protein KP1_0694 [Klebsiella pneumoniae subsp.
           pneumoniae NTUH-K2044]
 gi|402543478|gb|AFQ67627.1| hypothetical protein A79E_4413 [Klebsiella pneumoniae subsp.
           pneumoniae 1084]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|449060447|ref|ZP_21738104.1| hypothetical protein G057_21039 [Klebsiella pneumoniae hvKP1]
 gi|448873837|gb|EMB08907.1| hypothetical protein G057_21039 [Klebsiella pneumoniae hvKP1]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|425094501|ref|ZP_18497584.1| hypothetical protein HMPREF1308_04820 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW5]
 gi|405609832|gb|EKB82673.1| hypothetical protein HMPREF1308_04820 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW5]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|425078578|ref|ZP_18481681.1| hypothetical protein HMPREF1305_04517 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW1]
 gi|425089210|ref|ZP_18492303.1| hypothetical protein HMPREF1307_04688 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW3]
 gi|405589793|gb|EKB63346.1| hypothetical protein HMPREF1305_04517 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW1]
 gi|405599651|gb|EKB72826.1| hypothetical protein HMPREF1307_04688 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW3]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    +T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201


>gi|332528291|ref|ZP_08404296.1| Cupin, JmjC-type [Hylemonella gracilis ATCC 19624]
 gi|332042311|gb|EGI78632.1| Cupin, JmjC-type [Hylemonella gracilis ATCC 19624]
          Length = 395

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 8/80 (10%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ  GK+ WK+   +   +   +P     NF  E+         LEP
Sbjct: 136 GVGPHFDSYDVFLLQAHGKRHWKIGSQQDLKLKAGVPLKILSNFHPEQ------EFVLEP 189

Query: 174 GDLLYLPRGYIHQASTVTNE 193
           GD+LYLP  Y H    V  E
Sbjct: 190 GDMLYLPPQYAHDGVAVEGE 209


>gi|422293369|gb|EKU20669.1| cupin 4 family protein, partial [Nannochloropsis gaditana CCMP526]
 gi|422293749|gb|EKU21049.1| cupin 4 family protein, partial [Nannochloropsis gaditana CCMP526]
          Length = 120

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 19/107 (17%)

Query: 132 EGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
           EG K+W++Y      PR   ++ P       S  +IG P +   LEPGDLLYLP G IH+
Sbjct: 1   EGSKRWRLYDALVEQPRPDMQFKP-------SLHDIGEPYIDFVLEPGDLLYLPSGTIHE 53

Query: 187 ASTVTN----EHSLHVTISVYQKT--AWIDLLEKAMPKALQAAGATD 227
           A    +      SLH+T+ +      +W  LL + +   L    A D
Sbjct: 54  ALNENDGGLENSSLHLTVGIETTVLGSWESLLLEVL-TVLSGGSAED 99


>gi|73540810|ref|YP_295330.1| cupin region [Ralstonia eutropha JMP134]
 gi|72118223|gb|AAZ60486.1| Cupin region [Ralstonia eutropha JMP134]
          Length = 402

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ+ G+++W++     +D    +P     +FS EE         LEP
Sbjct: 144 GVGPHFDSYDVFLLQVSGRRRWRISSQTKLDLVPDMPLKILADFSAEE------EWVLEP 197

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP  Y H
Sbjct: 198 GDMLYLPPQYAH 209


>gi|294636786|ref|ZP_06715124.1| cupin family protein [Edwardsiella tarda ATCC 23685]
 gi|451965002|ref|ZP_21918263.1| hypothetical protein ET1_06_00600 [Edwardsiella tarda NBRC 105688]
 gi|291090001|gb|EFE22562.1| cupin family protein [Edwardsiella tarda ATCC 23685]
 gi|451316120|dbj|GAC63625.1| hypothetical protein ET1_06_00600 [Edwardsiella tarda NBRC 105688]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 76/192 (39%), Gaps = 14/192 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K PL I    KN+ + L       D L    +E + +  +  +E+G  Q        
Sbjct: 16  YWQKRPLVIRHGFKNFIDPLSP-----DELAGLAMEQEVDSRLVCHEEGAWQVSHG---- 66

Query: 62  LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E   S+       +  P   L    + L    V            G  PH
Sbjct: 67  -PFTSYDHLGETNWSLLVQAVDHWHSPAAALMRPFRHLPDWRVDDLMISFSVPGGGVGPH 125

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D  + FI+Q  G+++W+V       ++ P    P+  Q +    I+ V L  GD+LY+P
Sbjct: 126 IDQYDVFIIQGSGRRRWRVGEKGQRRQHCPH---PDLLQVDPFEAIIDVELSAGDILYIP 182

Query: 181 RGYIHQASTVTN 192
             + H+   + N
Sbjct: 183 PHFPHEGYALEN 194


>gi|293397680|ref|ZP_06641886.1| hypothetical protein NGNG_00666 [Neisseria gonorrhoeae F62]
 gi|291611626|gb|EFF40695.1| hypothetical protein NGNG_00666 [Neisseria gonorrhoeae F62]
          Length = 384

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|423118424|ref|ZP_17106108.1| hypothetical protein HMPREF9690_00430 [Klebsiella oxytoca 10-5246]
 gi|376401866|gb|EHT14471.1| hypothetical protein HMPREF9690_00430 [Klebsiella oxytoca 10-5246]
          Length = 373

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H    VT+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201


>gi|240015877|ref|ZP_04722417.1| hypothetical protein NgonFA_01712 [Neisseria gonorrhoeae FA6140]
          Length = 384

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|440288846|ref|YP_007341611.1| hypothetical protein D782_3497 [Enterobacteriaceae bacterium strain
           FGI 57]
 gi|440048368|gb|AGB79426.1| hypothetical protein D782_3497 [Enterobacteriaceae bacterium strain
           FGI 57]
          Length = 373

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|194097751|ref|YP_002000792.1| hypothetical protein NGK_0167 [Neisseria gonorrhoeae NCCP11945]
 gi|193933041|gb|ACF28865.1| Conserved hypothetical protein [Neisseria gonorrhoeae NCCP11945]
          Length = 384

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|240013436|ref|ZP_04720349.1| hypothetical protein NgonD_02099 [Neisseria gonorrhoeae DGI18]
 gi|240120507|ref|ZP_04733469.1| hypothetical protein NgonPI_01784 [Neisseria gonorrhoeae PID24-1]
          Length = 384

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|290475833|ref|YP_003468725.1| RmlC-like domain-containing protein [Xenorhabdus bovienii SS-2004]
 gi|289175158|emb|CBJ81961.1| putative enzyme with RmlC-like domain [Xenorhabdus bovienii
           SS-2004]
          Length = 373

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
           +  G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I+   + 
Sbjct: 118 SGGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHCPH---PDLLQVDPFEAIIDEEML 174

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           PGD+LY+P G+ H+   +  E SL+ ++      A
Sbjct: 175 PGDILYIPPGFPHEGYAI--EDSLNYSVGFRAPNA 207


>gi|268594108|ref|ZP_06128275.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02]
 gi|268596160|ref|ZP_06130327.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19]
 gi|268600636|ref|ZP_06134803.1| conserved hypothetical protein [Neisseria gonorrhoeae PID18]
 gi|268602949|ref|ZP_06137116.1| conserved hypothetical protein [Neisseria gonorrhoeae PID1]
 gi|268683587|ref|ZP_06150449.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-92-679]
 gi|291044550|ref|ZP_06570259.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2]
 gi|385334994|ref|YP_005888941.1| hypothetical protein NGTW08_0102 [Neisseria gonorrhoeae
           TCDC-NG08107]
 gi|268547497|gb|EEZ42915.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02]
 gi|268549948|gb|EEZ44967.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19]
 gi|268584767|gb|EEZ49443.1| conserved hypothetical protein [Neisseria gonorrhoeae PID18]
 gi|268587080|gb|EEZ51756.1| conserved hypothetical protein [Neisseria gonorrhoeae PID1]
 gi|268623871|gb|EEZ56271.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-92-679]
 gi|291011444|gb|EFE03440.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2]
 gi|317163537|gb|ADV07078.1| hypothetical protein NGTW08_0102 [Neisseria gonorrhoeae
           TCDC-NG08107]
          Length = 384

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)

Query: 60  RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
           R +  ++++H + G S+ + +        +  +  +      F GA+T    Y    + +
Sbjct: 82  RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
            +  H+D  + + +QL GKK+W++  P   D  +P Y        +I  P    + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192

Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
            GD+LY+PRG+ H+   +  E + H  +  +      +++ L K  P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238


>gi|94310017|ref|YP_583227.1| hypothetical protein Rmet_1072 [Cupriavidus metallidurans CH34]
 gi|93353869|gb|ABF07958.1| conserved hypothetical protein [Cupriavidus metallidurans CH34]
          Length = 375

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ+ G+++W++     +D    LP     +FS E+         LEP
Sbjct: 117 GVGPHFDSYDVFLLQVSGRRRWRISSQTKLDLIPDLPLKILSDFSAEQ------EWVLEP 170

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP  Y H
Sbjct: 171 GDMLYLPPQYAH 182


>gi|330828855|ref|YP_004391807.1| Cupin superfamily protein [Aeromonas veronii B565]
 gi|406677935|ref|ZP_11085115.1| hypothetical protein HMPREF1170_03323 [Aeromonas veronii AMC35]
 gi|423210459|ref|ZP_17197013.1| hypothetical protein HMPREF1169_02531 [Aeromonas veronii AER397]
 gi|328803991|gb|AEB49190.1| Cupin superfamily protein [Aeromonas veronii B565]
 gi|404616347|gb|EKB13305.1| hypothetical protein HMPREF1169_02531 [Aeromonas veronii AER397]
 gi|404623742|gb|EKB20592.1| hypothetical protein HMPREF1170_03323 [Aeromonas veronii AMC35]
          Length = 376

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q  GK+ W+V   + ++E+    +       E    I
Sbjct: 113 SFSTPHA--GVGPHIDNYDVFITQGLGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E S++ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSMNFSVG 200


>gi|423200591|ref|ZP_17187171.1| hypothetical protein HMPREF1167_00754 [Aeromonas veronii AER39]
 gi|404619999|gb|EKB16903.1| hypothetical protein HMPREF1167_00754 [Aeromonas veronii AER39]
          Length = 376

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q  GK+ W+V   + ++E+    +       E    I
Sbjct: 113 SFSTPHA--GVGPHIDNYDVFITQGLGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           + V +EPGD+LY+P G+ H+   +  E S++ ++ 
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSMNFSVG 200


>gi|227114632|ref|ZP_03828288.1| hypothetical protein PcarbP_16803 [Pectobacterium carotovorum
           subsp. brasiliensis PBR1692]
          Length = 373

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++    S P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDENLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|430809300|ref|ZP_19436415.1| hypothetical protein D769_23578 [Cupriavidus sp. HMR-1]
 gi|429498260|gb|EKZ96772.1| hypothetical protein D769_23578 [Cupriavidus sp. HMR-1]
          Length = 402

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ+ G+++W++     +D    LP     +FS E+         LEP
Sbjct: 144 GVGPHFDSYDVFLLQVSGRRRWRISSQTKLDLIPDLPLKILSDFSAEQ------EWVLEP 197

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP  Y H
Sbjct: 198 GDMLYLPPQYAH 209


>gi|339999064|ref|YP_004729947.1| hypothetical protein SBG_1065 [Salmonella bongori NCTC 12419]
 gi|339512425|emb|CCC30161.1| conserved hypothetical protein [Salmonella bongori NCTC 12419]
          Length = 373

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P+  Q +     +   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLPMRQHCPH---PDLLQVDPFEAFIDEELEPGD 177

Query: 176 LLYLPRGYIHQASTVTN 192
           +LY+P G+ H+   + N
Sbjct: 178 ILYIPPGFPHEGYALEN 194


>gi|423205804|ref|ZP_17192360.1| hypothetical protein HMPREF1168_01995 [Aeromonas veronii AMC34]
 gi|404623195|gb|EKB20047.1| hypothetical protein HMPREF1168_01995 [Aeromonas veronii AMC34]
          Length = 376

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 88/201 (43%), Gaps = 16/201 (7%)

Query: 2   HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
           +W+K PL I    K++  + I+  ++  +  + V+E +    +T + DG+      E   
Sbjct: 15  YWQKRPLLIKGGFKDF-QDPISPDELAGLAMEEVVESRL---VTRF-DGKW-----EAAH 64

Query: 62  LPQIVWDHYREGCSIRFLNPQTYIKP-LQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
            P   +DH  E      +    +  P + +L    Q + G               G  PH
Sbjct: 65  GPFESYDHLGEENWTVLVQACNHWAPEVNELAVPFQFIPGWRFDDVMVSFSTPHGGVGPH 124

Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
            D+ + FI Q  GK+ W+V   + ++E+    +       E    I+ V +EPGD+LY+P
Sbjct: 125 IDNYDVFITQGLGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAIIDVIMEPGDILYIP 181

Query: 181 RGYIHQASTVTNEHSLHVTIS 201
            G+ H+   +  E S++ ++ 
Sbjct: 182 PGFPHEGYAI--EPSMNFSVG 200


>gi|393759367|ref|ZP_10348183.1| hypothetical protein QWA_09601 [Alcaligenes faecalis subsp.
           faecalis NCIB 8687]
 gi|393162595|gb|EJC62653.1| hypothetical protein QWA_09601 [Alcaligenes faecalis subsp.
           faecalis NCIB 8687]
          Length = 396

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ  GK+ W++   + +     LP     NF  EE        TLEP
Sbjct: 127 GVGPHFDSYDVFLLQAHGKRHWRISQQKDLSLVPDLPLKILRNFQVEE------EFTLEP 180

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTI-SVYQKTAWIDLLEKAMPKALQAAG 224
           GD+LYLP    H    + +  ++ +   S  Q T    L++ A  +   A+G
Sbjct: 181 GDMLYLPPHVAHDGIAIGDCMTISIGFRSPTQATLARGLMDAASDQLAAASG 232


>gi|374370915|ref|ZP_09628904.1| cupin region [Cupriavidus basilensis OR16]
 gi|373097472|gb|EHP38604.1| cupin region [Cupriavidus basilensis OR16]
          Length = 402

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ+ G+++W++     +D    LP     +F+ EE         LEP
Sbjct: 144 GVGPHFDSYDVFLLQVSGRRRWRISSQTRLDLVPDLPLKILSDFTAEE------EWVLEP 197

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP  Y H
Sbjct: 198 GDMLYLPPQYAH 209


>gi|410641503|ref|ZP_11352023.1| cupin 4 protein [Glaciecola chathamensis S18K6]
 gi|410139036|dbj|GAC10210.1| cupin 4 protein [Glaciecola chathamensis S18K6]
          Length = 389

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +G ++W+        EY   Y  P+ SQ     PI+   L+PGD
Sbjct: 121 GVGPHLDQYDVFIIQGKGSRRWQA---GDKGEYPTFYPHPDLSQITQFEPIVDEVLQPGD 177

Query: 176 LLYLPRGYIHQA 187
           ++Y+P G  H  
Sbjct: 178 MIYIPAGCPHNG 189


>gi|395009353|ref|ZP_10392897.1| hypothetical protein PMI14_05695 [Acidovorax sp. CF316]
 gi|394312610|gb|EJE49743.1| hypothetical protein PMI14_05695 [Acidovorax sp. CF316]
          Length = 379

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ +G+++W++   +   + E +P      F  EE         LEP
Sbjct: 129 GVGPHFDSYDVFLLQAQGRRRWRIGRQKDLALKEGIPLKVLAEFEPEE------EFVLEP 182

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP  Y H
Sbjct: 183 GDMLYLPPRYAH 194


>gi|241764335|ref|ZP_04762363.1| Cupin 4 family protein [Acidovorax delafieldii 2AN]
 gi|241366291|gb|EER60834.1| Cupin 4 family protein [Acidovorax delafieldii 2AN]
          Length = 379

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL----PRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++   R  D+ L    P     +F  E+         L
Sbjct: 129 GVGPHFDSYDVFLLQAHGRRRWRIG--RQKDKRLRGDIPLKVLAHFEPEQ------EFVL 180

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP GY H
Sbjct: 181 EPGDMLYLPPGYAH 194


>gi|300723606|ref|YP_003712911.1| hypothetical protein XNC1_2711 [Xenorhabdus nematophila ATCC 19061]
 gi|297630128|emb|CBJ90765.1| putative enzyme with RmlC-like domain [Xenorhabdus nematophila ATCC
           19061]
          Length = 373

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +G+++W+V     + ++ P    P+  Q +    I+   + PGD
Sbjct: 121 GVGPHLDQYDVFIIQGKGRRRWRVGEKIPMKQHCPH---PDLLQVDPFEAIIDEEMLPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
           +LY+P G+ H+  ++  E SL+ ++      A
Sbjct: 178 ILYIPPGFPHEGYSI--EDSLNYSVGFRAPNA 207


>gi|332160836|ref|YP_004297413.1| hypothetical protein YE105_C1214 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|325665066|gb|ADZ41710.1| hypothetical protein YE105_C1214 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|330863362|emb|CBX73484.1| uncharacterized protein ycfD [Yersinia enterocolitica W22703]
          Length = 373

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     E   PI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFAPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|386309388|ref|YP_006005444.1| hypothetical protein [Yersinia enterocolitica subsp. palearctica
           Y11]
 gi|418240617|ref|ZP_12867155.1| hypothetical protein IOK_04211 [Yersinia enterocolitica subsp.
           palearctica PhRBD_Ye1]
 gi|433550467|ref|ZP_20506511.1| Putative cytoplasmic protein [Yersinia enterocolitica IP 10393]
 gi|318604738|emb|CBY26236.1| hypothetical protein Y11_19451 [Yersinia enterocolitica subsp.
           palearctica Y11]
 gi|351780030|gb|EHB22119.1| hypothetical protein IOK_04211 [Yersinia enterocolitica subsp.
           palearctica PhRBD_Ye1]
 gi|431789602|emb|CCO69551.1| Putative cytoplasmic protein [Yersinia enterocolitica IP 10393]
          Length = 373

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++ P    P     E   PI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFAPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQAST 189
           +LY+P G+ H   T
Sbjct: 178 ILYIPPGFPHDGFT 191


>gi|410615581|ref|ZP_11326600.1| RmlC-like domain-containing protein [Glaciecola psychrophila 170]
 gi|410164994|dbj|GAC40489.1| RmlC-like domain-containing protein [Glaciecola psychrophila 170]
          Length = 382

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 61/136 (44%), Gaps = 8/136 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q +G ++W+V LP   D         + SQ     PI+   LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGKGSRRWQVGLPGDFDTV---NLHKDLSQISGFEPIIDQILEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV---YQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
           ++Y+P  + H    V  E  L+ ++      Q+     L + A+   L     TD E   
Sbjct: 178 IIYIPANHPHNG--VALEECLNYSVGFRAPSQQEMLSSLADFAIDNNLFTQRYTDEEIAH 235

Query: 233 GLPIGYLRYAGLARGK 248
               G +++  ++R K
Sbjct: 236 REFSGEIKHREVSRFK 251


>gi|149376376|ref|ZP_01894139.1| Transcription factor jumonji, jmjC [Marinobacter algicola DG893]
 gi|149359390|gb|EDM47851.1| Transcription factor jumonji, jmjC [Marinobacter algicola DG893]
          Length = 383

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 43/104 (41%), Gaps = 22/104 (21%)

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL---- 167
           P      PHYD  + F+LQ EG ++W        D   PR           GTP+     
Sbjct: 121 PKGGSVGPHYDQYDVFLLQAEGHRRWT--FGGHCDHTSPRVD---------GTPLRILSS 169

Query: 168 -----TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
                TVTL PGD+LYLP G  H    V  +  + ++I     T
Sbjct: 170 WEGEETVTLAPGDMLYLPPGVGHHG--VAEDDCITLSIGFRAPT 211


>gi|444916193|ref|ZP_21236313.1| hypothetical protein D187_08781 [Cystobacter fuscus DSM 2262]
 gi|444712515|gb|ELW53437.1| hypothetical protein D187_08781 [Cystobacter fuscus DSM 2262]
          Length = 455

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 53/130 (40%), Gaps = 38/130 (29%)

Query: 99  FGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR--------------- 143
           F   VG N Y +  +  GF  HYD   A  LQ+EG+K W  Y PR               
Sbjct: 144 FSGRVGFNCYLSA-SGGGFYTHYDARTAMSLQIEGEKTWS-YAPRPALAFPQRNASYALN 201

Query: 144 --------MVDEYLPRYSSPNFSQEEIGTPI-----LTVTLEPGDLLYLPRGYIHQASTV 190
                   +VD ++P          E   PI     L+ TL PGDLL LP G  H A   
Sbjct: 202 GGAVQASKIVDAFVP------IEPWETVIPIPENEFLSETLRPGDLLSLPAGTWHAAK-- 253

Query: 191 TNEHSLHVTI 200
             EHSL + +
Sbjct: 254 AGEHSLALNL 263


>gi|254674227|emb|CBA10011.1| conserved hypothetical protein [Neisseria meningitidis alpha275]
          Length = 387

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
           ++  +  H+D  + F +QL GKK W +  P         +  P + Q+    P +T    
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181

Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
               V LE GD+LY+PRG+ H    +  E + H+ I  +      +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235


>gi|254673854|emb|CBA09627.1| myc induced nuclear antigen [Neisseria meningitidis alpha275]
          Length = 382

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
           ++  +  H+D  + F +QL GKK W +  P         +  P + Q+    P +T    
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181

Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
               V LE GD+LY+PRG+ H    +  E + H+ I  +      +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235


>gi|385339905|ref|YP_005893777.1| cupin family protein [Neisseria meningitidis G2136]
 gi|325198149|gb|ADY93605.1| cupin family protein [Neisseria meningitidis G2136]
          Length = 387

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
           ++  +  H+D  + F +QL GKK W +  P         +  P + Q+    P +T    
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181

Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
               V LE GD+LY+PRG+ H    +  E + H+ I  +      +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235


>gi|421561812|ref|ZP_16007649.1| cupin-like domain protein [Neisseria meningitidis NM2657]
 gi|254670307|emb|CBA05658.1| conserved hypothetical protein [Neisseria meningitidis alpha153]
 gi|402336197|gb|EJU71458.1| cupin-like domain protein [Neisseria meningitidis NM2657]
          Length = 382

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)

Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
           ++  +  H+D  + F +QL GKK W +  P         +  P + Q+    P +T    
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181

Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
               V LE GD+LY+PRG+ H    +  E + H+ I  +      +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235


>gi|429094757|ref|ZP_19157281.1| FIG00553994: hypothetical protein [Cronobacter dublinensis 1210]
 gi|426740151|emb|CCJ83394.1| FIG00553994: hypothetical protein [Cronobacter dublinensis 1210]
          Length = 373

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGNKLPMRQFCPH---PALLHVDPFEPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|421080900|ref|ZP_15541816.1| YcfD [Pectobacterium wasabiae CFBP 3304]
 gi|401704462|gb|EJS94669.1| YcfD [Pectobacterium wasabiae CFBP 3304]
          Length = 373

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++    S P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|227327182|ref|ZP_03831206.1| hypothetical protein PcarcW_07626 [Pectobacterium carotovorum
           subsp. carotovorum WPP14]
          Length = 373

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++    S P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|319792853|ref|YP_004154493.1| transcription factor jumonji jmjc domain-containing protein
           [Variovorax paradoxus EPS]
 gi|315595316|gb|ADU36382.1| transcription factor jumonji jmjC domain-containing protein
           [Variovorax paradoxus EPS]
          Length = 380

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 15/118 (12%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL----PRYSSPNFSQEEIGTPILTVTL 171
           G   H+D  + F+LQ +G+++W +   +  D  L    P     NF  EE      T  L
Sbjct: 129 GVGAHFDSYDVFLLQAQGRRRWSIG--KQGDLRLQPGVPLKILENFEPEE------TFVL 180

Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
           EPGD+LYLP  Y H+   V ++    +T S+  +++ +  L   +   +  A + DLE
Sbjct: 181 EPGDMLYLPPKYAHEGVAVGDD---CMTFSIGLRSSALGELGADLLARMAQAYSEDLE 235


>gi|403056864|ref|YP_006645081.1| hypothetical protein PCC21_004250 [Pectobacterium carotovorum
           subsp. carotovorum PCC21]
 gi|402804190|gb|AFR01828.1| hypothetical protein PCC21_004250 [Pectobacterium carotovorum
           subsp. carotovorum PCC21]
          Length = 373

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G ++W+V     + ++    S P     +   PI+   LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|409200144|ref|ZP_11228347.1| RmlC-like domain-containing protein [Pseudoalteromonas flavipulchra
           JG1]
          Length = 395

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 6/87 (6%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
           ++ TP    G  PH D  + FI+Q +GK+ W+V      D  L +++ + +  Q E    
Sbjct: 116 SFSTP--GGGVGPHLDQYDVFIIQGQGKRHWRV---GERDNSLKQFARNKSLLQVEQFEA 170

Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
           ++   LEPGD+LY+P    H+   V N
Sbjct: 171 VIDAVLEPGDILYIPPACPHEGYAVEN 197


>gi|145298396|ref|YP_001141237.1| hypothetical protein ASA_1388 [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|418362390|ref|ZP_12963024.1| hypothetical protein IYQ_18781 [Aeromonas salmonicida subsp.
           salmonicida 01-B526]
 gi|142851168|gb|ABO89489.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|356686400|gb|EHI51003.1| hypothetical protein IYQ_18781 [Aeromonas salmonicida subsp.
           salmonicida 01-B526]
          Length = 376

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
           ++ TP    G  PH D+ + FI Q +GK+ W+V   + ++E+    +  +    E    I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLEEFAAHAALLHCEPFEA---I 167

Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
           +   +EPGD+LY+P G+ H+   +  E S++ ++ 
Sbjct: 168 IDAIMEPGDILYIPPGFPHEGYAI--EPSMNFSVG 200


>gi|88798694|ref|ZP_01114277.1| hypothetical protein MED297_15859 [Reinekea blandensis MED297]
 gi|88778457|gb|EAR09649.1| hypothetical protein MED297_15859 [Reinekea blandensis MED297]
          Length = 381

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 47/89 (52%), Gaps = 14/89 (15%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVD------EYLPRYSSPNFSQEEIGTPILTV 169
           G  PH+D  + F++QL G+++WK+   +M D      E +P      F +++        
Sbjct: 134 GVGPHFDQYDVFLVQLAGQREWKI--GQMCDEDSDLVENIPVKVLSAFEEQD------AW 185

Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
            L+PGD+LYLP G  H  +++ +  +L V
Sbjct: 186 VLDPGDVLYLPPGVAHWGTSLGDSMTLSV 214


>gi|326435845|gb|EGD81415.1| hypothetical protein PTSG_11840 [Salpingoeca sp. ATCC 50818]
          Length = 8549

 Score = 48.9 bits (115), Expect = 0.003,   Method: Composition-based stats.
 Identities = 42/174 (24%), Positives = 69/174 (39%), Gaps = 50/174 (28%)

Query: 88   LQQLNASLQELFGCFVGANTY-QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV- 145
            ++ L   ++++FG +  A+ Y  T   ++   PH D  +  ++QL+G K+W   +P    
Sbjct: 4888 VELLRQGVRDVFGTYSTAHVYYATQDESRALLPHTDPYDVLVIQLQGSKRWTTCVPDAYV 4947

Query: 146  ----------------------------DEYLPRYSSPNFSQ-EEI-------------- 162
                                         E L  +++    Q +EI              
Sbjct: 4948 SDSNDDDDDSANERSNANSTAADRPPSEAEVLAGFNAAQLGQLQEIRSQRQQGCTGYQDS 5007

Query: 163  ---GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK-TAWIDLL 212
               G      TL  GD +Y+P+G IH A T   + S H+TIS+ +K  AW D L
Sbjct: 5008 MLRGMRCTNFTLREGDTMYMPKGIIHFALT-EEQGSSHITISLERKGLAWADAL 5060


>gi|429099281|ref|ZP_19161387.1| hypothetical protein BN133_3877 [Cronobacter dublinensis 582]
 gi|426285621|emb|CCJ87500.1| hypothetical protein BN133_3877 [Cronobacter dublinensis 582]
          Length = 373

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
           G  PH D  + FI+Q  G+++W+V     + ++ P    P     +   PI+   L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLAPGD 177

Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
           +LY+P G+ H     T+E +L+ ++ 
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201


>gi|167539693|ref|XP_001751194.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770146|gb|EDQ83994.1| predicted protein [Monosiga brevicollis MX1]
          Length = 254

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 49/105 (46%), Gaps = 20/105 (19%)

Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
           P T     PH D+    ILQL+G+K W V     VD           S EE        T
Sbjct: 61  PATQPAINPHTDNYHILILQLQGEKHWLVC---QVDNAE--------SCEEF-------T 102

Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA 215
           L PGD+L+LPR   H A T TN  S+H TI  +Q     DL+E A
Sbjct: 103 LYPGDVLFLPRRAGHVAWT-TNVTSVHATIG-FQGVDCGDLVEAA 145


>gi|300691871|ref|YP_003752866.1| hypothetical protein RPSI07_2227 [Ralstonia solanacearum PSI07]
 gi|299078931|emb|CBJ51591.1| conserved protein of unknown function [Ralstonia solanacearum
           PSI07]
          Length = 409

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQNDLTLVPD--LPLKILANFTPEE------EFVL 201

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 202 EPGDMLYLPPHYAH 215


>gi|387813958|ref|YP_005429441.1| hypothetical protein MARHY1541 [Marinobacter hydrocarbonoclasticus
           ATCC 49840]
 gi|381338971|emb|CCG95018.1| conserved hypothetical protein, putative enzyme with RmlC-like
           domain [Marinobacter hydrocarbonoclasticus ATCC 49840]
          Length = 383

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 37/85 (43%), Gaps = 20/85 (23%)

Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL---- 167
           PT     PHYD  + F+LQ +G ++W        D   PR           GTP+     
Sbjct: 121 PTGGSVGPHYDMYDVFLLQAQGHRRWT--FGGHCDHRSPRVE---------GTPLRILSS 169

Query: 168 -----TVTLEPGDLLYLPRGYIHQA 187
                TVTL PGD+LYLP G  H  
Sbjct: 170 WDGEETVTLAPGDMLYLPPGVGHHG 194


>gi|344167058|emb|CCA79251.1| conserved hypothetical protein [blood disease bacterium R229]
          Length = 409

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQNDLTLVPD--LPLKILANFTPEE------EFVL 201

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 202 EPGDMLYLPPHYAH 215


>gi|300704481|ref|YP_003746084.1| hypothetical protein RCFBP_20285 [Ralstonia solanacearum CFBP2957]
 gi|299072145|emb|CBJ43477.1| conserved protein of unknown function [Ralstonia solanacearum
           CFBP2957]
          Length = 409

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 201

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 202 EPGDMLYLPPHYAH 215


>gi|424776979|ref|ZP_18203954.1| hypothetical protein C660_09304 [Alcaligenes sp. HPC1271]
 gi|422888019|gb|EKU30413.1| hypothetical protein C660_09304 [Alcaligenes sp. HPC1271]
          Length = 396

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ  GK+ W++   + +     LP     NF  EE        TLEP
Sbjct: 127 GVGPHFDSYDVFLLQAHGKRHWRISQQKDLSLVPDLPLKILRNFQVEE------EFTLEP 180

Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTI-SVYQKTAWIDLLEKAMPKALQAAG 224
           GD+LYLP    H    + +  ++ +   S  Q T    L++ A  +   A+G
Sbjct: 181 GDMLYLPPHVAHDGIAMGDCMTISIGFRSPTQATLARGLMDAASDQLAAASG 232


>gi|207743511|ref|YP_002259903.1| hypothetical protein RSIPO_01690 [Ralstonia solanacearum IPO1609]
 gi|206594909|emb|CAQ61836.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609]
          Length = 385

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 126 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 177

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 178 EPGDMLYLPPHYAH 191


>gi|344174288|emb|CCA86078.1| conserved hypothetical protein [Ralstonia syzygii R24]
          Length = 385

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 126 GVGPHFDSYDVFLLQAHGRRRWRISSQNDLTLVPD--LPLKILANFTPEE------EFVL 177

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 178 EPGDMLYLPPHYAH 191


>gi|83748242|ref|ZP_00945268.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551]
 gi|83725083|gb|EAP72235.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551]
          Length = 409

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 201

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 202 EPGDMLYLPPHYAH 215


>gi|390568842|ref|ZP_10249134.1| cupin 4 family protein [Burkholderia terrae BS001]
 gi|389939191|gb|EIN01028.1| cupin 4 family protein [Burkholderia terrae BS001]
          Length = 380

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ+ GK++W++   R   +   LP     NF  EE         LEP
Sbjct: 121 GVGPHFDSYDVFLLQVHGKRRWRIGAQRDLSLQPGLPLKVLQNFEPEE------EWVLEP 174

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP    H
Sbjct: 175 GDMLYLPPHIAH 186


>gi|386333884|ref|YP_006030055.1| hypothetical protein RSPO_c02223 [Ralstonia solanacearum Po82]
 gi|334196334|gb|AEG69519.1| Hypothetical cytosolic protein [Ralstonia solanacearum Po82]
          Length = 409

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
           G  PH+D  + F+LQ  G+++W++     L  + D  LP     NF+ EE         L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 201

Query: 172 EPGDLLYLPRGYIH 185
           EPGD+LYLP  Y H
Sbjct: 202 EPGDMLYLPPHYAH 215


>gi|406974719|gb|EKD97706.1| hypothetical protein ACD_23C00766G0001 [uncultured bacterium]
          Length = 379

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPNFSQEEIGTPILTVTLEP 173
           G  PH+D  + F+LQ  G+++W++   + +   + +P      F  E+         LEP
Sbjct: 129 GVGPHFDSYDVFLLQAHGRRRWRIGRQKNLSLRDDIPLKVLAQFEPEQ------EYVLEP 182

Query: 174 GDLLYLPRGYIH 185
           GD+LYLP GY H
Sbjct: 183 GDMLYLPPGYAH 194


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.137    0.404 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,530,763,211
Number of Sequences: 23463169
Number of extensions: 244271924
Number of successful extensions: 609459
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 484
Number of HSP's successfully gapped in prelim test: 1173
Number of HSP's that attempted gapping in prelim test: 606758
Number of HSP's gapped (non-prelim): 1919
length of query: 339
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 196
effective length of database: 9,003,962,200
effective search space: 1764776591200
effective search space used: 1764776591200
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)