BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy13797
(339 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328792924|ref|XP_395039.4| PREDICTED: lysine-specific demethylase NO66-like [Apis mellifera]
Length = 489
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/338 (55%), Positives = 242/338 (71%), Gaps = 2/338 (0%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WEK P++I R YY L+++ +D ILR++ I F KNIDIT YE+G R+T + GRA
Sbjct: 69 NWEKTPVHIKRNFPKYYKLLMSTPMLDKILRESYILFTKNIDITSYENGVRETHNPIGRA 128
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+P +VWD+Y GCS+R LNPQTYI L LNA+LQE FGCF+GAN+Y TPP +QGFAPHY
Sbjct: 129 VPTVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFIGANSYLTPPNSQGFAPHY 188
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAFILQ+EGKK+W++Y PR +EYLPRYSS NFSQ EIG PIL + GDLLY PR
Sbjct: 189 DDIEAFILQIEGKKRWRLYKPRNQNEYLPRYSSENFSQFEIGEPILDTIVNEGDLLYFPR 248
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G IHQ T+ N HSLH+T+SVYQK +W D LEK +P AL AA + EFR+GLPI YLRY
Sbjct: 249 GTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTAAIKENYEFRKGLPIDYLRY 308
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G + + D+ KE +K LL+KL++Y+D+D+ D M K +HD LPPVLS E
Sbjct: 309 CGYVHSEIQNNSKDKF--KEKIKHLLSKLIDYIDIDEAADLMAKNHIHDFLPPVLSESER 366
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+CSV ++G +M + G V N +I DT+ RL R++ +R
Sbjct: 367 ECSVTQDGEKMIENGIVNNRVEIEPDTRFRLARSHCIR 404
>gi|340723554|ref|XP_003400154.1| PREDICTED: lysine-specific demethylase NO66-like [Bombus
terrestris]
Length = 492
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 188/340 (55%), Positives = 242/340 (71%), Gaps = 4/340 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WEK P+++ R YY L+++ +D ILR++ I F KNIDIT YE+G R+T + GR
Sbjct: 71 NNWEKTPVHVKRNFHKYYKLLMSTPMLDKILRESYILFTKNIDITSYENGIRETHNPVGR 130
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A+P +VWD+Y GCS+R LNPQTYI L LNA+LQE FGCFVGAN+Y TPP +QGFAPH
Sbjct: 131 AIPSVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFVGANSYLTPPNSQGFAPH 190
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAFILQ+EGKK+W++Y PR +EYLPRYSS NFSQ EIG PIL + GDLLY P
Sbjct: 191 YDDIEAFILQIEGKKRWRLYKPRNENEYLPRYSSKNFSQSEIGEPILDTVVNAGDLLYFP 250
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQ T+ N HSLH+T+SVYQK +W D LEK +P AL A +D EFR+GLPI YLR
Sbjct: 251 RGTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTTAIESDSEFRKGLPIDYLR 310
Query: 241 YAGLARGKPVDIQ-ADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
G A DIQ + + KE +K L KL+EY+D+D D M K +HD LPPV+S
Sbjct: 311 QCGFAYS---DIQNSAKDEFKERIKHLFNKLIEYIDIDKAADLMAKNHIHDCLPPVISEN 367
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E +CS ++G +M + G V N +I DT++RL+R++ +R
Sbjct: 368 EQECSAVQDGEKMIENGIVENRVEIEPDTRIRLLRSHCIR 407
>gi|380022474|ref|XP_003695070.1| PREDICTED: lysine-specific demethylase NO66-like [Apis florea]
Length = 489
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/338 (55%), Positives = 241/338 (71%), Gaps = 2/338 (0%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WEK P++I R YY L+++ +D ILR++ I F KNIDIT YE+G R+T + GRA
Sbjct: 69 NWEKTPVHIKRNFPKYYKLLMSTPMLDKILRESYILFTKNIDITSYENGIRETHNPVGRA 128
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+P +VWD+Y GCS+R LNPQTYI L LNA+LQE FGCF+GAN+Y TPP +QGFAPHY
Sbjct: 129 VPTVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFIGANSYLTPPNSQGFAPHY 188
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAFILQ+EGKK+W++Y PR +EYLPRYSS NFSQ EIG PIL + GDLLY PR
Sbjct: 189 DDIEAFILQIEGKKRWRLYKPRNQNEYLPRYSSENFSQFEIGEPILDTIVNEGDLLYFPR 248
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G IHQ T+ N HSLH+T+SVYQK +W D LEK +P AL A + EFR+GLPI YLRY
Sbjct: 249 GTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTTAIKENYEFRKGLPIDYLRY 308
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G + + D+ KE +K LL+KL++Y+D+D+ D M K +HD LPPVLS E
Sbjct: 309 CGYVHSEIQNNSKDKF--KEKIKHLLSKLIDYIDIDEAADLMAKNHIHDFLPPVLSESER 366
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+CSV ++G +M + G V N +I DT+ RL R++ +R
Sbjct: 367 ECSVTQDGEKMIENGIVNNRVEIEPDTRFRLARSHCIR 404
>gi|307169522|gb|EFN62164.1| JmjC domain-containing protein CG2982 [Camponotus floridanus]
Length = 447
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/337 (55%), Positives = 241/337 (71%), Gaps = 3/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE P++I R NYY ++++ +D ILRDN I F KNIDIT Y DG R+T + GRAL
Sbjct: 29 WESTPIHIKRNNSNYYKWIVSTSMLDKILRDNYILFTKNIDITSYVDGVRETHNPAGRAL 88
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +VWD+Y CS+R LNPQTYI L +NA+LQE FGCFVGAN Y TPP +QGFAPHYD
Sbjct: 89 PSVVWDYYLNDCSVRMLNPQTYIPQLHLVNATLQEFFGCFVGANLYLTPPNSQGFAPHYD 148
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAFILQ+EGKK+W++Y+P ++EYL R SS NF Q EIG PIL + GDLLYLPRG
Sbjct: 149 DIEAFILQIEGKKRWRLYMPPSINEYLARCSSKNFDQSEIGEPILDTIVNAGDLLYLPRG 208
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQ +T+ + HSLHVTIS+YQ+++W DLLEK +P+AL+ A TD +FRRGLP+GY RY
Sbjct: 209 TIHQGTTIDDTHSLHVTISLYQRSSWCDLLEKVLPQALKRATETDWKFRRGLPVGYFRYT 268
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+A + R+A KE++KD++ L+ Y+DLD D M K +HD LPP+LS E +
Sbjct: 269 GIAHS--TNKTEHRMAFKEDVKDMVTNLINYIDLDSAADLMAKDHIHDFLPPLLSKIEEE 326
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
CSV ++G M G V N IT +TK+RL+R + +R
Sbjct: 327 CSVAKDGEIMIN-GIVKNRVNITLETKIRLLRIHCIR 362
>gi|350426682|ref|XP_003494511.1| PREDICTED: lysine-specific demethylase NO66-like [Bombus impatiens]
Length = 492
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 186/340 (54%), Positives = 240/340 (70%), Gaps = 4/340 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WEK P+++ R YY L+++ +D ILR++ I F KNIDIT YE+ R+T + GR
Sbjct: 71 NNWEKTPVHVKRNFHKYYKLLMSTPMLDKILRESHILFTKNIDITSYENDVRETHNPVGR 130
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A+P +VWD+Y GCS+R LNPQTYI L LNA+LQE FGCFVGAN+Y TPP +QGFAPH
Sbjct: 131 AIPSVVWDYYMNGCSVRMLNPQTYIPKLHSLNATLQEFFGCFVGANSYLTPPNSQGFAPH 190
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAFILQ+EGKK+W++Y PR +EYLPRYSS NFSQ EIG PIL + GDLLY P
Sbjct: 191 YDDIEAFILQIEGKKRWRLYKPRNENEYLPRYSSKNFSQSEIGEPILDTVVNAGDLLYFP 250
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQ T+ N HSLH+T+SVYQK +W D LEK +P AL A D EFR+GLPI YLR
Sbjct: 251 RGTIHQGETIDNTHSLHITLSVYQKNSWGDFLEKLLPNALTTAIEADSEFRKGLPIDYLR 310
Query: 241 YAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
G A DIQ + + KE +K L KL+EY+D+D D M K +HD LPP++S
Sbjct: 311 QCGFAYS---DIQNNAKDEFKERIKHLFNKLIEYIDIDKAADLMAKNHIHDCLPPIISES 367
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E +CS ++G +M + G V N +I DT++RL+R++ +R
Sbjct: 368 EQECSAVQDGEKMIENGIVENRVEIEPDTRIRLLRSHCIR 407
>gi|383850995|ref|XP_003701049.1| PREDICTED: lysine-specific demethylase NO66-like [Megachile
rotundata]
Length = 615
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 187/340 (55%), Positives = 243/340 (71%), Gaps = 4/340 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WEK P++I R NYY L+++ +D ILR++ I F KNIDIT YE+G R+T + GR
Sbjct: 194 NNWEKTPVHIKRNFPNYYKLLMSTPMLDKILRESYIMFTKNIDITSYENGVRETHNPAGR 253
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A+P +VWD+Y GCS+R LNPQTYI L LNA+LQE FGCFVGAN+Y TPP +QGFAPH
Sbjct: 254 AVPSVVWDYYMNGCSVRMLNPQTYISKLHSLNATLQEFFGCFVGANSYLTPPNSQGFAPH 313
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAFILQ+EGKK+W++Y PR EYLPRYSS NFS+ EIG PIL + GDLLY P
Sbjct: 314 YDDIEAFILQVEGKKRWRLYKPRNESEYLPRYSSENFSEAEIGEPILDTIVNAGDLLYFP 373
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQ T+ + HSLH+T+SVYQK +W D LEK +P+AL A TD EFR+GLP+ YLR
Sbjct: 374 RGTIHQGETLDDTHSLHITLSVYQKNSWGDYLEKLLPEALNTAIRTDSEFRKGLPLDYLR 433
Query: 241 YAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
+ G + Q++ R K +KDL KL+E++D+D D M K ++D LPPVLS
Sbjct: 434 HNGHVYSES---QSNGREEFKAKVKDLFTKLIEHIDIDKAADMMAKNHVYDFLPPVLSES 490
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E +CSV ++G +M G V N +I DT+VRL+R++ +R
Sbjct: 491 EQECSVVQDGEQMVANGIVKNRVEIEPDTRVRLLRSHCIR 530
>gi|322793511|gb|EFZ17039.1| hypothetical protein SINV_11691 [Solenopsis invicta]
Length = 494
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 173/338 (51%), Positives = 231/338 (68%), Gaps = 4/338 (1%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WEK P+Y+ R YY ++++ +D +LR+N + F KNIDIT Y +G R+T + GRA
Sbjct: 80 NWEKAPMYVRRDMPEYYERIVSTDLLDKVLRNNHLLFTKNIDITSYSNGVRETHNPSGRA 139
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+P IVWD+Y CS+R LNPQT+I + LNA+LQE FGCFVGAN Y TPP +QGFAPHY
Sbjct: 140 VPSIVWDYYNNKCSVRMLNPQTFIPKIHALNATLQEFFGCFVGANLYLTPPNSQGFAPHY 199
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DD+EAFILQ+EG+K+W++Y P +E+LPRYSS NF Q EIG PIL ++ GDLLYLPR
Sbjct: 200 DDVEAFILQIEGQKRWRLYKPLSENEFLPRYSSKNFDQSEIGEPILDTIVKAGDLLYLPR 259
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G IHQ T+ N HSLHVT+SVYQK +W D LEK +P+AL A D FR GLP+ YL Y
Sbjct: 260 GTIHQGVTLDNVHSLHVTLSVYQKNSWCDFLEKVLPQALNRAAENDSRFREGLPLNYLTY 319
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G A + + K +KDL+ ++E++D+D+ D M K +HD LPP L E
Sbjct: 320 VGTAHNE----NNQKNIFKNKIKDLITLMIEHIDIDNAADLMAKNQIHDFLPPFLPKNER 375
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ +V E+G M+ G V N +IT +TK+RL+R + +R
Sbjct: 376 KRTVAEDGEIMTSNGVVKNYPEITLETKIRLLRWHCIR 413
>gi|156540940|ref|XP_001599259.1| PREDICTED: lysine-specific demethylase NO66-like [Nasonia
vitripennis]
Length = 448
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 173/339 (51%), Positives = 226/339 (66%), Gaps = 2/339 (0%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WEK PL+I R + YY +L+T+ +D ILR N I F KNIDIT + + +R+T + GRA
Sbjct: 25 NWEKNPLHIKRNKPKYYQSLMTTPMLDQILRKNYILFTKNIDITSFTNNKRETHNPHGRA 84
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
LP IVWD+Y GCS+R LNPQT+I L LNA LQE FGCFVG+N+Y TPP +QGFAPHY
Sbjct: 85 LPSIVWDYYANGCSVRLLNPQTFIPKLHTLNAILQEFFGCFVGSNSYLTPPNSQGFAPHY 144
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAFILQ+EGKK+W++Y P+ +E LPRYSS NFSQ+++G PIL +E GD+LY PR
Sbjct: 145 DDIEAFILQIEGKKRWRLYKPKTDNEILPRYSSLNFSQQDLGEPILNTVIEAGDILYFPR 204
Query: 182 GYIHQASTVT-NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G IHQ T +EHSLH+T+SVYQ+ +W D LEK +P AL A D FR+GLPI YLR
Sbjct: 205 GTIHQGDTFGFDEHSLHITLSVYQRNSWGDFLEKLIPAALNNAIKNDSRFRKGLPINYLR 264
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
Y G + D R + LL L + +D+D D M +HD LPPV + E
Sbjct: 265 YIGSTYTENFDCPI-RKKFSNHTISLLTILSKCIDIDKAADLMAVNHIHDFLPPVFNQNE 323
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
CS+++ G +M + G V N +I +T ++L R + R
Sbjct: 324 RLCSIYDGGEKMIKNGAVINRVEIEPNTLIKLSRGHCAR 362
>gi|307209000|gb|EFN86200.1| JmjC domain-containing protein CG2982 [Harpegnathos saltator]
Length = 621
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 172/339 (50%), Positives = 231/339 (68%), Gaps = 3/339 (0%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
+++EK P++I R + +YY +L+++ +D+ILR+ + F KNIDIT Y +G R+T + GR
Sbjct: 183 NNFEKRPVHIQRNKPSYYLSLVSTPLLDEILREKNVLFTKNIDITSYTNGVRETYNPTGR 242
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A+P VWD+Y CS+R LNPQT+I + LNA+LQE FGCFVG+N Y TPP +QGFAPH
Sbjct: 243 AVPNAVWDYYMNKCSVRMLNPQTFIHKIYMLNATLQEYFGCFVGSNLYLTPPGSQGFAPH 302
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQ+EGKK+W++Y P YL RYSS NF + EIG PIL L+ GDLLY P
Sbjct: 303 YDDIEAFVLQVEGKKRWRLYRPPN-KSYLARYSSRNFDESEIGEPILDTILQAGDLLYFP 361
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQA T + HSLH+T+SVYQK +W D LEK +P L+ A ++ FR GLP+ YL+
Sbjct: 362 RGTIHQAETADDTHSLHITLSVYQKNSWYDFLEKLLPLTLKRAAESNYNFRTGLPLDYLK 421
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
Y G A + QA R +E +K L++Y+D+D D M K +HD LPP L+ EE
Sbjct: 422 YVGTAHSTS-NGQA-RKIFEEKIKTFFGLLIDYIDVDSAADLMAKSHIHDFLPPALTLEE 479
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
QCSV +G M+Q G V QI DT +RL+R++ +R
Sbjct: 480 QQCSVARDGDIMTQNGNVKRCVQIESDTMIRLIRSHCIR 518
>gi|157123061|ref|XP_001653808.1| hypothetical protein AaeL_AAEL009382 [Aedes aegypti]
gi|122105602|sp|Q16W06.1|NO66_AEDAE RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|108874534|gb|EAT38759.1| AAEL009382-PA [Aedes aegypti]
Length = 635
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 172/339 (50%), Positives = 225/339 (66%), Gaps = 2/339 (0%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WEK+PL I RK +YY+NL++ KID++LR N IE+ KNID+T Y +GQR+T + +GR
Sbjct: 222 NYWEKKPLLIQRKNPSYYSNLLSRAKIDEMLRQNNIEYTKNIDVTSYREGQRETHNPDGR 281
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
LP +W Y EGCSIR LNPQTY+ + ++N LQE F C GAN Y TPP +QGFAPH
Sbjct: 282 VLPPDMWSFYEEGCSIRMLNPQTYLPGVYEMNVKLQEFFHCMTGANFYLTPPNSQGFAPH 341
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQ+EG+K WK+Y PR E L R SSPNF+QEEIGTPIL V LEPGDLLY P
Sbjct: 342 YDDIEAFVLQVEGRKHWKLYSPREPAEMLARVSSPNFTQEEIGTPILEVVLEPGDLLYFP 401
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQASTV HSLHVT+SVYQK W DLLE P AL A T + RRG+P+ +
Sbjct: 402 RGIIHQASTVPGHHSLHVTMSVYQKNCWADLLELFFPHALAQAAETHFDLRRGIPLNLHQ 461
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
+ G+ R + ++K L+ K+ +D VD++ K+ HDALPP++S E
Sbjct: 462 HFGIVHSD--SETPARKQLISHIKSLVDKVFSEDAIDSAVDQLAKRFQHDALPPLISNTE 519
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+V+ + + G V T+++ VRL+R N +R
Sbjct: 520 QSTTVYGSSFSHNPDGTVSLRVPFTENSTVRLLRCNVLR 558
>gi|189236258|ref|XP_974397.2| PREDICTED: similar to CG2982 CG2982-PB [Tribolium castaneum]
gi|270005824|gb|EFA02272.1| hypothetical protein TcasGA2_TC007936 [Tribolium castaneum]
Length = 568
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 166/338 (49%), Positives = 236/338 (69%), Gaps = 7/338 (2%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WE++PLYI R ++YY +++ S +D ILR+N + F +N+D+ YE+G++Q + EGRA
Sbjct: 151 YWEQKPLYIKRGNRSYYTHILDSSSLDKILRNNSLFFTRNVDVVTYENGEKQVFNQEGRA 210
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
P +WD+Y GCSIR LNPQTY + L A+LQE FG VGAN Y TPP +QGFAPHY
Sbjct: 211 TPSALWDYYGNGCSIRVLNPQTYNHKVHLLLATLQEYFGTMVGANVYLTPPGSQGFAPHY 270
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF++QLEG+K WK+Y P+ D L R+SSPNF +E++G P + +TL G+LLY PR
Sbjct: 271 DDIEAFVVQLEGRKHWKLYQPKSED-VLARFSSPNFKREDLGEPFMELTLNAGELLYFPR 329
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G IH+ T + HSLH+T+SVYQ+T+++DLLE +PKAL+ A +D+EFR+GLP+ YL+
Sbjct: 330 GTIHEGRTDEDSHSLHITVSVYQQTSYVDLLEHILPKALKKAADSDVEFRKGLPLNYLKD 389
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
GLA G +R + +KDL+ L+ YVD+D D++GK+ MHD++PPVL+ E
Sbjct: 390 FGLAAG-----SKNRKFVTSKIKDLMNSLINYVDIDSAADQLGKKFMHDSMPPVLTKSEA 444
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ S +G + + G V N +I DT+VRL+R +R
Sbjct: 445 ERSSKADGPVL-KDGVVKNRVEIDLDTRVRLLRYYCIR 481
>gi|170044529|ref|XP_001849897.1| nucleolar protein 66 [Culex quinquefasciatus]
gi|284433498|sp|B0WMG3.1|NO66_CULQU RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|167867637|gb|EDS31020.1| nucleolar protein 66 [Culex quinquefasciatus]
Length = 648
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 221/337 (65%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+P + R YY NL++ KID++LR+N IE+ KN+D+T Y +G R+T + +GRAL
Sbjct: 237 WEKKPFLVQRNDPTYYANLLSRGKIDEMLRNNNIEYTKNLDVTSYREGVRETHNPDGRAL 296
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VW Y EGCSIR LNPQTY+ + ++N LQE F C G+N Y TPP +QGFAPHYD
Sbjct: 297 PPDVWAFYEEGCSIRMLNPQTYLPGVYEMNVKLQEFFHCMTGSNFYLTPPNSQGFAPHYD 356
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQ+EG+K WK+Y PR E L R SSPNF+QEEIG PIL VTLEPGDLLY PRG
Sbjct: 357 DIEAFVLQVEGRKHWKLYSPRTASEVLARVSSPNFTQEEIGVPILEVTLEPGDLLYFPRG 416
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQASTV HSLHVT+SVYQK +W DLLE +P AL A LE RRG+P ++
Sbjct: 417 IIHQASTVPGHHSLHVTMSVYQKNSWADLLELYLPHALSQAAENHLELRRGIPQDLHQHF 476
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ + R + + +K L+ K+ +D VD++ K+ HDALPP+L+ +E
Sbjct: 477 GIVHSD--NETPTRKDLIKKIKSLVDKIFSEEAIDVAVDQLAKRFQHDALPPLLTDQERA 534
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ + + G V T T+ T +RL+R N VR
Sbjct: 535 QTAYGANYAFNPDGTVPLQTAFTERTTIRLLRRNIVR 571
>gi|148234196|ref|NP_001082857.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
NO66 [Danio rerio]
gi|284433499|sp|A3KP59.1|NO66_DANRE RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|126631921|gb|AAI34173.1| Zgc:162967 protein [Danio rerio]
Length = 544
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 160/339 (47%), Positives = 225/339 (66%), Gaps = 2/339 (0%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+WEK+P+ I R+ +YY L ++ + D ILR++ +++ N+D+T Y +G+R+T + GR
Sbjct: 124 DNWEKKPILIQRQNADYYKGLFSTAEFDRILRNDDVQYGVNLDVTSYTNGKRETHNPPGR 183
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP VWD Y GCSIR LNPQ + + Q+ + LQE FG GAN Y TPP QGFAPH
Sbjct: 184 ALPYTVWDFYESGCSIRMLNPQAFSSTVWQVLSVLQEKFGSMAGANVYLTPPGTQGFAPH 243
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+DDIEAF++QLEG+K W+VY PR DE L SSPNFSQ+EIG P++ V LE GDLLY P
Sbjct: 244 FDDIEAFVVQLEGRKHWRVYNPRCEDEVLSLVSSPNFSQDEIGEPVMDVVLEAGDLLYFP 303
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG++HQ + + HSLH+TIS YQ+ +W DL+ K MP AL+ A D+EFR+GLP+ YL+
Sbjct: 304 RGFVHQGDCLPDAHSLHITISSYQRNSWGDLMLKLMPAALEVAMEEDVEFRKGLPLDYLQ 363
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
Y G+ + D + DR +++ L+ KLV + +D VD+ K +HD LPP+L+ EE
Sbjct: 364 YMGVQNSEKEDPRRDRFMA--HIQGLMKKLVSFAPVDAAVDQKAKDFLHDCLPPLLTAEE 421
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ R + + A ++ T+++LVRA A R
Sbjct: 422 KAGSVYGAPARWGDSEALDVAVELKSQTRIKLVRAGAAR 460
>gi|71894875|ref|NP_001026367.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
NO66 [Gallus gallus]
gi|75571463|sp|Q5ZMM1.1|NO66_CHICK RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|53127257|emb|CAG31022.1| hypothetical protein RCJMB04_1k4 [Gallus gallus]
Length = 601
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 216/338 (63%), Gaps = 2/338 (0%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HWE+ PL + R YY L ++ D ILR + F ++D+T Y +G R+T + GRA
Sbjct: 182 HWERAPLLVQRGDPGYYAGLFSTADFDAILRSGDVHFGTHLDVTSYAEGVRETHNPVGRA 241
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
LP +VWD Y+ GCS+R L+PQ + + + LQE FG GANTY TPP QGFAPHY
Sbjct: 242 LPAVVWDFYQNGCSLRLLSPQAFSTTVWHFLSILQEHFGSMAGANTYLTPPGTQGFAPHY 301
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF+LQLEGKK W+VY PR E LP++SS N +Q E+G P+L V LE GDLLY PR
Sbjct: 302 DDIEAFVLQLEGKKHWRVYGPRTSSEALPQFSSANLTQAELGEPLLEVVLEAGDLLYFPR 361
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G+IHQA + + HSLH+T+S YQ+ +W D LEK +P ALQ A DLE+R+GLP+ L Y
Sbjct: 362 GFIHQADCLPDAHSLHITVSSYQRNSWGDFLEKLLPAALQMALEEDLEYRQGLPMDCLGY 421
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G+A VD A R A E ++ L+ KLV+Y +D +D+ K +HD LPPVL+ E
Sbjct: 422 MGVANSDAVD--ARRTAFVEKVQHLIKKLVDYAPIDAAMDQRAKSFLHDCLPPVLTQSEK 479
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
Q SV+ R G QITKDT+VRL+ VR
Sbjct: 480 QLSVYGFPARWQDGGPRNVDIQITKDTEVRLLHHGVVR 517
>gi|327259545|ref|XP_003214597.1| PREDICTED: lysine-specific demethylase NO66-like [Anolis
carolinensis]
Length = 618
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 222/339 (65%), Gaps = 4/339 (1%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WEK+PL I R NYY +L ++ + D ILRD ++F N+D+T Y+DG+R+T + GRA
Sbjct: 199 YWEKKPLLIQRHNPNYYRDLFSTAEFDAILRDQDVQFGINLDVTSYQDGKRETYNPVGRA 258
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
LP +VWD Y+ GCS+R LNPQ + + + LQE FG VGANTY TP QGFAPHY
Sbjct: 259 LPAVVWDFYQNGCSLRMLNPQAFSSTVWHFLSILQEQFGSMVGANTYLTPAGTQGFAPHY 318
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF++QLEGKK W+VY PR E LP +SS NF Q EIG PIL LE GD+LY PR
Sbjct: 319 DDIEAFVIQLEGKKHWRVYSPRREAEVLPPFSSRNFDQAEIGEPILETVLEAGDMLYFPR 378
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G+IHQ + N HSLH+T+S YQ+ +W DLLEK +P ALQ A D+E+R+GLP+ YL Y
Sbjct: 379 GFIHQGDCLPNAHSLHITVSSYQRNSWGDLLEKLLPAALQMAIEEDVEYRQGLPMDYLEY 438
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G+ VD + R + + + L+ KL +YV +D VD+ K +HD LPP+L+ E
Sbjct: 439 MGVLNSDRVDPR--RASFMQKVSRLITKLTDYVPVDAAVDQKAKCFLHDCLPPMLTESEK 496
Query: 302 QCSVFENGLRMSQTGEVYNA-TQITKDTKVRLVRANAVR 339
SV+ + R + G V NA Q+ T VRL+R VR
Sbjct: 497 ALSVYGHPARW-EDGSVRNADVQVKGTTWVRLLRHGIVR 534
>gi|301608699|ref|XP_002933911.1| PREDICTED: lysine-specific demethylase NO66-like [Xenopus
(Silurana) tropicalis]
Length = 584
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/338 (48%), Positives = 225/338 (66%), Gaps = 4/338 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE++P I R YY+ L ++ + D ILR++ ++F N+D+T Y DG R+T + GRAL
Sbjct: 165 WEQKPALIKRDNHRYYDGLFSTTEFDRILREDNVKFGVNLDVTSYTDGTRETHNPPGRAL 224
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +VWD+++ GCS+R LNPQ++ + + + + LQELFG VGANTY TP QGFAPHYD
Sbjct: 225 PLVVWDYFKNGCSLRLLNPQSFCRTVWNVLSVLQELFGSMVGANTYLTPAGTQGFAPHYD 284
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEGKK W+VY PR E+LP++SS NF +IG PIL LE GDLLY PRG
Sbjct: 285 DIEAFVIQLEGKKHWRVYNPRNNSEFLPQFSSVNFRDRDIGEPILETVLEAGDLLYFPRG 344
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQ + + HSLH+TIS YQ+ +W DLLEK +P ALQ A D+EFR+GLP+ YL Y
Sbjct: 345 FIHQGDCLPDAHSLHITISSYQRNSWADLLEKVLPAALQVAVEEDVEFRKGLPLDYLEYM 404
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ D + D A +N++ L+ KL +Y +D VD+ K +HD LPPVL+P E
Sbjct: 405 GVQHADLKDPRRD--AFIQNIQSLIKKLSDYAPVDAAVDQKAKNFLHDCLPPVLTPAEKT 462
Query: 303 CSVFENGLRMSQTGEVYNAT-QITKDTKVRLVRANAVR 339
SV + + +G+V + T QI +TK+ L+R R
Sbjct: 463 HSVHGAPVTLD-SGKVKDDTLQIELETKICLLRGGIAR 499
>gi|346471459|gb|AEO35574.1| hypothetical protein [Amblyomma maculatum]
Length = 532
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 167/336 (49%), Positives = 231/336 (68%), Gaps = 6/336 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE++P+ + R Q+++Y L++ +++D ILR+ + F +NID+T YE G+R+T + +GRA
Sbjct: 114 WEQKPMLLRRHQRDFYQGLMSCEQLDRILRERSLFFTENIDLTTYEGGKRETHNPDGRAH 173
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
+VWD +++GCS+R LNPQTY + + +L + LQE FG FVGAN Y TP +QGFAPHYD
Sbjct: 174 AAVVWDAFQKGCSVRLLNPQTYSRAVWRLCSMLQEFFGSFVGANLYLTPAGSQGFAPHYD 233
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEG+K W++Y PR E LPR+SS NF +E+G PIL LEPGDLLY PRG
Sbjct: 234 DIEAFVVQLEGRKCWRLYAPRDPSEELPRFSSGNFGPDEVGNPILEAVLEPGDLLYFPRG 293
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQA T + HSLH+T+S Q+ W DL EK +P+A+Q A D+E+R+ LP GYL Y
Sbjct: 294 IIHQAYTPEDVHSLHLTLSTCQRNTWGDLFEKMIPRAVQLAIEEDVEYRKSLPRGYLNYM 353
Query: 243 GLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G+A ++ +D R A E + L+ KLV Y +D VD+ GK +H+ALPP+LS EE
Sbjct: 354 GIANS---EVASDAREAFLEKVSHLVRKLVNYCPVDAAVDQCGKAHLHEALPPLLSQEER 410
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANA 337
SV +G R Q G+V + ++ DT VRL R NA
Sbjct: 411 SRSV-HSGERW-QYGQVVDVQELDPDTHVRLARRNA 444
>gi|241642121|ref|XP_002409388.1| nucleolar protein, putative [Ixodes scapularis]
gi|215501349|gb|EEC10843.1| nucleolar protein, putative [Ixodes scapularis]
Length = 457
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/337 (49%), Positives = 226/337 (67%), Gaps = 4/337 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE++PL + R Q +Y +L++ +++D ILR+ + F +N+D+T Y G R+T + GRA
Sbjct: 35 WEQKPLLVKRHQPGFYGDLMSCQQLDRILRERQLYFTENVDLTTYRGGVRETHNPPGRAH 94
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
+VWD +++GCS+R LNPQTY + L +L A+LQE FG VGAN Y TP +QGFAPHYD
Sbjct: 95 TAVVWDSFQQGCSVRLLNPQTYSRQLWRLCATLQEFFGSMVGANLYLTPAGSQGFAPHYD 154
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E LPR+SS NF++EE+G P+LT LEPGDLLY PRG
Sbjct: 155 DIEAFVLQLEGQKLWRVYAPRDPSEELPRFSSGNFTKEEVGEPLLTTLLEPGDLLYFPRG 214
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA T HSLH+T+S Q+ W DLLEKA+P ALQ A D++FRR LP Y
Sbjct: 215 FIHQACTGETSHSLHLTLSTCQRNTWGDLLEKALPTALQLALEEDVDFRRSLPRDYPSVM 274
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+A + + R A + L +L +++ +D D+M K +H+ALPP+LS +E +
Sbjct: 275 GVANADLANPR--REAFLAKMHSLFERLWDFLPVDAAADQMVKAHLHEALPPLLSQQERE 332
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
CSV +G G V + ++ DT VRLVR NAVR
Sbjct: 333 CSV--HGGERWHEGRVVDVQELEPDTAVRLVRRNAVR 367
>gi|390349142|ref|XP_782527.3| PREDICTED: lysine-specific demethylase NO66-like
[Strongylocentrotus purpuratus]
Length = 639
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 161/325 (49%), Positives = 220/325 (67%), Gaps = 3/325 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
+E++PL++ R + Y+ ++ +SK++ +IL++N ++F +NID+T Y DG+R+T + GRA
Sbjct: 199 FERKPLFLKRHKPGYFTDIFSSKELSNILKENDVQFTRNIDVTTYTDGKRETHNPTGRAQ 258
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQ+VWD+Y GCS+R LNPQTY + QL A+LQE FGCFVGAN Y TPP QGFAPHYD
Sbjct: 259 PQVVWDYYNNGCSVRVLNPQTYSTRVWQLLAALQEFFGCFVGANIYLTPPGTQGFAPHYD 318
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEGKK WK+Y R E LPR+SS NF+ +IG PIL TLEPGDLLY PRG
Sbjct: 319 DIEAFVLQLEGKKHWKLYNQRSPAEVLPRFSSSNFTDADIGQPILDTTLEPGDLLYFPRG 378
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQAST + HSLH+TIS QK W DL+E + +AL +++FR+ LP YL Y
Sbjct: 379 VIHQASTPSETHSLHITISACQKNTWGDLMEHVLTRALHVVREENIDFRKSLPTDYLNYM 438
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+A + +D + + MK+ + DL +KL++ +D D+M + H +LPPVL E
Sbjct: 439 GVAH-EDLDDERRQPFMKK-IGDLFSKLLKSAPIDAAADQMSLEFFHTSLPPVLEEAEES 496
Query: 303 CSVFENGLRMSQTGEVYNATQITKD 327
CSVF + G+V Q+ +D
Sbjct: 497 CSVFGESASC-RNGKVSGIVQLGED 520
>gi|432944965|ref|XP_004083470.1| PREDICTED: lysine-specific demethylase NO66-like [Oryzias latipes]
Length = 544
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 161/337 (47%), Positives = 214/337 (63%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+P+Y+ R+ +YY L ++ + D ILR+ ++F N+D+T Y +G+R+T + GRAL
Sbjct: 127 WEKKPIYVQRRNPDYYKGLFSTAEFDRILREENVQFGVNLDVTSYTNGKRETHNPPGRAL 186
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VWD Y GCS+R LNPQ + + + + LQE FG GAN Y TPP QGFAPHYD
Sbjct: 187 PFTVWDFYESGCSLRLLNPQAFSHTVWNVLSVLQEQFGSMAGANVYLTPPGTQGFAPHYD 246
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEGKK W+VY PR +E LP SSPNF Q +IG PIL V LE GDLLY PRG
Sbjct: 247 DIEAFVVQLEGKKHWRVYSPRSENEVLPVLSSPNFDQSDIGKPILDVVLEAGDLLYFPRG 306
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQ + + HSLH+TIS +QK +W DL++K + AL+ A D+EFRRGLP+ YL Y
Sbjct: 307 FIHQGKCLPDAHSLHITISSFQKNSWGDLMQKVVTAALEIAMEEDVEFRRGLPLDYLTYM 366
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ D + R+ ++ L+ KL Y +D VD+ K +HD LPPVL+P+E
Sbjct: 367 GVQNSDKEDPR--RVKFLSQMEKLMKKLPNYAPVDAAVDQKAKDFLHDCLPPVLTPDEFA 424
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV +R A IT T +RL+RA R
Sbjct: 425 TSVQGASVRWVNGRVKGTAAHITTQTLIRLLRAGCAR 461
>gi|198416710|ref|XP_002119787.1| PREDICTED: similar to Nucleolar protein 66 (hsNO66), partial [Ciona
intestinalis]
Length = 594
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 224/341 (65%), Gaps = 5/341 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D WE PL + R Y + L ++K+++ IL + + + N+D+T Y++G+R+T +++GR
Sbjct: 164 DIWESRPLLVLRHCPRYADGLFSTKEMNRILNECNVRYSVNLDVTTYQNGRRETHNIDGR 223
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A +VWD+++ GCSIR NPQ + KP+ +L A+LQE F C VGANTY TPP QGFAPH
Sbjct: 224 AYAPVVWDYFKNGCSIRLKNPQAFSKPVWRLCATLQEFFKCMVGANTYLTPPGTQGFAPH 283
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQLEGKK+W +Y PR E LPRYSS NF+ +EIG I T TLE G+LLY P
Sbjct: 284 YDDIEAFVLQLEGKKEWTLYSPRSGKETLPRYSSGNFTADEIGDEIFTQTLEAGNLLYFP 343
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQA + + HSLHVTIS+YQ+ +W DLLEK +P LQ A D+EFR+GLP+ YL
Sbjct: 344 RGYIHQAKALPDTHSLHVTISMYQRNSWGDLLEKLLPTTLQNAIIDDVEFRKGLPLDYLS 403
Query: 241 YAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
G + D D R A + DL +L + VD+D DE G +HDA+PP ++ +
Sbjct: 404 LFG---EQNCDKHPDRRRAFMGKISDLFRRLADKVDIDAAADEHGINFLHDAMPPYMTQD 460
Query: 300 ELQCSVFENGLRMSQTGE-VYNATQITKDTKVRLVRANAVR 339
E S+ + M+ G + I + T++R++R NA+R
Sbjct: 461 EKSRSIHGCDVGMTSPGTPTHYICDIDEKTRIRVLRKNALR 501
>gi|405958337|gb|EKC24473.1| Nucleolar protein 66 [Crassostrea gigas]
Length = 453
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 157/338 (46%), Positives = 223/338 (65%), Gaps = 3/338 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE++PL + R Y + ++ ++D ILR+ I+F N+D+T + +GQR+T + GRA
Sbjct: 29 WERKPLLVQRHMAQYNDGWFSTAELDKILREENIQFSTNLDVTTFTNGQRETHNPVGRAF 88
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +VWD Y+ GCS+R LNPQTY + + +L + LQE F C VGAN Y TPP QGFAPHYD
Sbjct: 89 PSVVWDFYQNGCSVRLLNPQTYSRNVWKLLSVLQEYFNCCVGANVYLTPPGTQGFAPHYD 148
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAFILQLEG+K W++Y PR +E LPR SS NFS++++G PIL L+PGDLLY PRG
Sbjct: 149 DIEAFILQLEGRKHWRLYSPRTDNEVLPRLSSGNFSEKDLGKPILDTVLDPGDLLYFPRG 208
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQ + + + HSLH+T+S YQK W DL EK +P+ALQ A D+EFRRGLP Y Y
Sbjct: 209 TIHQGNCMEDTHSLHITVSCYQKNTWGDLFEKMVPRALQIAIDEDVEFRRGLPREYSSYM 268
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+A +D L +K+ ++ L+ ++++++ +D D+M K +HD+LPPVLS E
Sbjct: 269 GIANSD-MDGAPRNLFLKK-VEQLMVRMIQHLPVDSACDQMAKSFIHDSLPPVLSDAEKA 326
Query: 303 CSVFENGLRMSQTGE-VYNATQITKDTKVRLVRANAVR 339
SV +G + V ++ DT ++++R +R
Sbjct: 327 FSVHGSGEHWDKEKMCVVGTAEMEPDTTIKIIRKGILR 364
>gi|47224570|emb|CAG03554.1| unnamed protein product [Tetraodon nigroviridis]
Length = 454
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 163/339 (48%), Positives = 220/339 (64%), Gaps = 6/339 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+P+ I RK +YY L ++ + D ILR+ +++ N+D+T Y +G+R+T + GRAL
Sbjct: 39 WEKKPILIQRKNPDYYKGLFSTAEFDRILREEDVQYGVNLDVTSYTNGRRETHNPPGRAL 98
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VW Y+ GCS+R LNPQ + + + + LQE FG GAN Y TPP QGFAPHYD
Sbjct: 99 PYNVWGFYKSGCSLRLLNPQAFSSTVWNILSILQENFGSMAGANIYLTPPGTQGFAPHYD 158
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEGKK W+VY PR+ DE LP +SSPNF++ +IG PIL V LE GDLLY PRG
Sbjct: 159 DIEAFVVQLEGKKHWRVYNPRVEDEVLPVFSSPNFNEADIGKPILEVVLEAGDLLYFPRG 218
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQ + + HSLH+TIS YQK +W DLL++ +P AL+ A D EFRRGLP+ YL Y
Sbjct: 219 FIHQGDCLPDAHSLHITISSYQKNSWGDLLQRMLPAALEIAVEEDTEFRRGLPLDYLTYM 278
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ D + R ++ L+ KL Y +D VD+ + +HD LPP+L+P EL
Sbjct: 279 GVQNSDKDDPR--RQQFFSQIETLMKKLTTYCPVDAAVDQKARDFLHDCLPPMLTPVELA 336
Query: 303 CSVFENGLRMS-QTGEVYNAT-QITKDTKVRLVRANAVR 339
SV G R + G+V + + I T+VRL+RA R
Sbjct: 337 SSV--KGARTRWEGGQVTDESLDINTQTRVRLLRAGCAR 373
>gi|348517751|ref|XP_003446396.1| PREDICTED: lysine-specific demethylase NO66-like [Oreochromis
niloticus]
Length = 541
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 157/338 (46%), Positives = 217/338 (64%), Gaps = 4/338 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+P+++ RK +YY L ++ ++D I R+ +++ N+D+T Y +G+R+T + GR L
Sbjct: 126 WEKKPIFVQRKNPDYYKGLFSTAELDRIFREEDVQYGVNLDVTSYTNGKRETHNPPGRVL 185
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P WD Y GCS+R LNPQ + + + + LQE FG GAN Y TPP QGFAPHYD
Sbjct: 186 PYTAWDFYESGCSLRMLNPQAFSSTVWNVLSILQEQFGSMAGANIYLTPPGTQGFAPHYD 245
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEGKK+W+VY PR DE LP SSPNF Q +IG PIL V +E GDL+Y PRG
Sbjct: 246 DIEAFVVQLEGKKRWRVYSPRSDDEVLPVLSSPNFDQADIGKPILDVVVEAGDLIYFPRG 305
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQ + + + HSLH+TIS YQK +W DLL+K +P AL+ A D+EFRRGLP+ YL Y
Sbjct: 306 FIHQGNCLPDAHSLHITISSYQKNSWGDLLQKLVPAALEVAMEEDVEFRRGLPLDYLTYM 365
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ D + R + +L+ +L Y +D VD+ K +HD LPP L+P+EL
Sbjct: 366 GVQNSDKEDPR--RAKFFSRIDNLMKRLTNYAPVDAAVDQKAKDFIHDCLPPALTPDELV 423
Query: 303 CSVFENGLRMSQTGEVYN-ATQITKDTKVRLVRANAVR 339
SV R + G+V + I T++R++RA R
Sbjct: 424 TSVQGAPTRW-EGGQVKDVGVHIITQTQIRILRAQCAR 460
>gi|242006700|ref|XP_002424185.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507526|gb|EEB11447.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 467
Score = 329 bits (844), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 169/344 (49%), Positives = 218/344 (63%), Gaps = 12/344 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
+ WEK+PLYISR YYN L + + L + + F KNID+T Y DGQR T +++G+
Sbjct: 39 NFWEKKPLYISRNNNEYYNELCSMNAFEKALSEKDMYFTKNIDVTSYIDGQRFTENLDGK 98
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A +WD + EG SIR LNPQT+I + LN +LQE F CFVGAN Y TP QGFAPH
Sbjct: 99 ATVSNIWDFFNEGKSIRLLNPQTFIPNVWLLNTNLQEFFNCFVGANMYLTPAGTQGFAPH 158
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQLEG+K WKVY PR E L R SS NF ++EIG PIL VTL+PGD+LY P
Sbjct: 159 YDDIEAFVLQLEGQKHWKVYNPRDSSEVLARESSKNFKEDEIGKPILKVTLKPGDMLYFP 218
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQA + + HSLH+T+S YQK +W DL EK P ALQ A D+EFR+GLPIGYL
Sbjct: 219 RGYIHQAKCLPDTHSLHLTVSCYQKNSWADLFEKIFPVALQKAIFEDVEFRKGLPIGYLN 278
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
+ G+ + + + K N +L K++E +D GVD++ +HDALPP L+ +E
Sbjct: 279 HMGMMFSEENNEFRQKFIQKFN--ELGRKVLEMCPIDAGVDQLSTNFIHDALPPCLTEDE 336
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTK-----VRLVRANAVR 339
G R + + N I K+T V+L+RA + R
Sbjct: 337 ----KLYGGDRNVKLPDK-NFLHILKNTSNTNLGVKLIRATSAR 375
>gi|443692249|gb|ELT93889.1| hypothetical protein CAPTEDRAFT_153587 [Capitella teleta]
Length = 698
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 162/359 (45%), Positives = 228/359 (63%), Gaps = 24/359 (6%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRD---------------NVIEFKKNIDITLY 47
WEK+PL + R+ NY N +++++D ILR+ + F +NID+ Y
Sbjct: 246 WEKKPLMVKRRNPNYNNGWFSTQELDRILREVGAFIFEAGFYGISQEHLNFTENIDVVTY 305
Query: 48 EDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
E+G+R+T + GRA ++WDH++ GCS+R LNPQ+Y + + + + LQE FGCF GAN
Sbjct: 306 ENGKRETHNPVGRAYAPVLWDHFQNGCSVRLLNPQSYSRNVWKYLSVLQEYFGCFAGANV 365
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
Y TPP QGFAPHYDDIEAFILQLEGKK+W++Y PR E L R+SS NFS+++IG PI
Sbjct: 366 YLTPPGTQGFAPHYDDIEAFILQLEGKKRWRLYDPRNDAEKLARFSSGNFSEDDIGNPIF 425
Query: 168 TVTLEPGDLLYLPRGYIHQA------STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
V LE GDLLY PRG IHQA T+ ++HSLH+T+SV Q +W DLL+K +P+AL
Sbjct: 426 DVILEAGDLLYFPRGTIHQACHFASGCTLEDDHSLHITVSVSQLNSWGDLLQKVLPRALD 485
Query: 222 AAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
A D+EFR+ LP YLRY G+A + +R ++ L+ KL+ Y +D GVD
Sbjct: 486 IAIDEDVEFRQSLPRDYLRYMGVAFSD--QVCPERHQFMRRIEQLMMKLLSYAPVDAGVD 543
Query: 282 EMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGE-VYNATQITKDTKVRLVRANAVR 339
+MG + + D+LPPVL+ E CS+ NG R + + ++ DT+V+L+R +R
Sbjct: 544 QMGVKFITDSLPPVLTRTESACSIHGNGERWDDERQCIAGGVELQPDTEVKLIRQGCLR 602
>gi|410916727|ref|XP_003971838.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase
NO66-like [Takifugu rubripes]
Length = 553
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 157/337 (46%), Positives = 213/337 (63%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+P+ + RK +YY L ++ + D ILR +++ N+D+T Y +G R+T + GRAL
Sbjct: 123 WEKKPILVQRKNPDYYKGLFSTAEFDRILRGEDVQYGVNLDVTSYTNGTRETHNPPGRAL 182
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VW Y+ GCS+R LNPQ + + + + LQE FG GAN Y TPP QGFAPHYD
Sbjct: 183 PYTVWGFYKSGCSLRLLNPQAFSATVWNVLSILQENFGSMAGANIYLTPPGTQGFAPHYD 242
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEGKK W+VY PR+ DE LP SSPNF + EIG PIL V LE GDLLY PRG
Sbjct: 243 DIEAFVVQLEGKKHWRVYKPRVDDEILPVLSSPNFDEVEIGEPILEVVLEAGDLLYFPRG 302
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQ + + HSLH+TIS YQK +W DLL++ +P AL+ A D+EFRRGLP+ YL Y
Sbjct: 303 FIHQGDCLPDAHSLHITISSYQKNSWGDLLQRMLPAALEIAMEEDVEFRRGLPLDYLTYM 362
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ D + R ++ L+ KL Y +D VD+ + +HD LPP+L+ E
Sbjct: 363 GVQNSDLDDPR--RPKFLSQIETLMKKLASYCPVDAAVDQKARDFLHDCLPPMLTTAEQA 420
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV R + +++ I+ T++RL+RA R
Sbjct: 421 SSVKGARTRWECDQVMDDSSDISTRTRIRLLRAGCAR 457
>gi|195397425|ref|XP_002057329.1| GJ17031 [Drosophila virilis]
gi|284433508|sp|B4M7P8.1|NO66_DROVI RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|194147096|gb|EDW62815.1| GJ17031 [Drosophila virilis]
Length = 907
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 224/340 (65%), Gaps = 4/340 (1%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WE+ + RKQ NY++ LI+ K ID++L N +EF NID+T Y++G R+T + +GRA
Sbjct: 488 YWERNACQVKRKQPNYFSQLISFKLIDEMLIRNHLEFTTNIDVTTYKNGMRETHNPDGRA 547
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+P VW Y +GCSIR LNP TY+ ++QL A +QE F C VGAN Y TPP +QGFAPHY
Sbjct: 548 MPPTVWGFYSDGCSIRILNPSTYLIKVRQLCAMMQEFFHCLVGANVYLTPPNSQGFAPHY 607
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF+LQ+EG+K+W++Y P + L R SS N+SQ E+G P+ LEPGD+LY PR
Sbjct: 608 DDIEAFVLQVEGRKRWRLYSPLHPSDVLARNSSGNYSQAELGEPLFDAVLEPGDILYFPR 667
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G +HQA +HSLH+T+SVYQ+ A+ +LLE+ MP LQ A L RRGLP+ ++
Sbjct: 668 GTVHQAVCDQQQHSLHITLSVYQQQAYANLLEELMPAVLQRAIKHHLSLRRGLPLHIWQH 727
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALPPVLSPE 299
GLA+G D L N K L+ + + D +D VD++ K+ H+ALPPV+ PE
Sbjct: 728 LGLAKGDQKSELRDELLG--NTKRLVQQYLMPSDAQIDAAVDQLAKRFQHEALPPVVLPE 785
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E + +VF + + + G+ ++T+ T VRL+RAN +R
Sbjct: 786 EHERTVFGSRSQANSHGQCLCDYELTERTSVRLLRANILR 825
>gi|449274682|gb|EMC83760.1| Nucleolar protein 66, partial [Columba livia]
Length = 398
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/316 (49%), Positives = 204/316 (64%), Gaps = 2/316 (0%)
Query: 24 SKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQT 83
+ D LR + F ++D+T Y DG R+T + GRALP +VWD Y+ GCS+R L+PQT
Sbjct: 1 TADFDAALRSGEVHFGTHLDVTSYADGVRETHNPSGRALPAVVWDFYQNGCSLRLLSPQT 60
Query: 84 YIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR 143
+ + Q + LQE FG GANTY TPP QGFAPHYDDIEAF+LQLEGKK W+VY PR
Sbjct: 61 FSPTVWQFLSVLQEHFGSMAGANTYLTPPGTQGFAPHYDDIEAFVLQLEGKKHWRVYRPR 120
Query: 144 MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
E LP++SS N +Q E+G P+L LE GDLLY PRG+IHQ + + HSLH+T+S Y
Sbjct: 121 TDAEVLPQFSSANLTQAELGEPMLETVLEAGDLLYFPRGFIHQGDCLPDAHSLHITVSSY 180
Query: 204 QKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENL 263
Q+ +W DLLEK +P ALQ A D+E+R+GLP+ Y Y G+A + VD A R A E +
Sbjct: 181 QRNSWGDLLEKLLPAALQMALEEDVEYRQGLPMDYFGYMGVANSEAVD--ARRTAFMEKV 238
Query: 264 KDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQ 323
+ L+ KLV Y +D VD+ K +HD LPPVL+ E SV+ R G +
Sbjct: 239 QSLIKKLVNYAPIDAAVDQRAKSFLHDCLPPVLTESEKAQSVYGFPARWQDGGPHDVDIR 298
Query: 324 ITKDTKVRLVRANAVR 339
ITKDT+VRL+R +R
Sbjct: 299 ITKDTEVRLLRHGIIR 314
>gi|195163311|ref|XP_002022494.1| GL12933 [Drosophila persimilis]
gi|284433504|sp|B4GUZ2.1|NO66_DROPE RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|194104486|gb|EDW26529.1| GL12933 [Drosophila persimilis]
Length = 687
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 155/340 (45%), Positives = 223/340 (65%), Gaps = 6/340 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
DHWEK P + ++NLI+ K ID +L N +E+ NID+T YEDG R+T++ +GR
Sbjct: 268 DHWEKSPFRVKTTTSGGFSNLISFKMIDQMLIQNHVEYTTNIDVTSYEDGVRKTLNPDGR 327
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP VW HY+ GCSIR LNP +Y+ L+QL LQE F C VGAN Y TPP +QGFAPH
Sbjct: 328 ALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTPPESQGFAPH 387
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQ+EGKK+W++Y P + LPR SS N SQ E+G PI+ + L+PGDLLY P
Sbjct: 388 YDDIEAFVLQVEGKKRWRIYAP---TKELPRESSGNLSQTELGDPIMDIVLKPGDLLYFP 444
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG+IHQA T + HSLH+T+S YQ+ ++ +L+EK MP ++ + L+ R+GLP+ +
Sbjct: 445 RGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGLPLDIFQ 504
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVLSPE 299
G+A + A R + +++++L +LV +D +D++ + H+ALPP ++P+
Sbjct: 505 NLGVANAEWKG--AHRQKLIQHIQNLAQRLVPTEGQIDRALDQLAIKFQHEALPPTIAPQ 562
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL+ +VF + G ++ + T VRL+RAN VR
Sbjct: 563 ELKRTVFGAQATADRNGHCSLDYELAEGTAVRLLRANIVR 602
>gi|194888169|ref|XP_001976869.1| GG18702 [Drosophila erecta]
gi|284433501|sp|B3NU20.1|NO66_DROER RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|190648518|gb|EDV45796.1| GG18702 [Drosophila erecta]
Length = 657
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 156/341 (45%), Positives = 218/341 (63%), Gaps = 4/341 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D WE + RK NYY+ L++ K IDDIL N +EF N+D+T+Y +G+R+T++ EGR
Sbjct: 236 DFWENNACVVQRKNPNYYSKLMSFKMIDDILMRNRVEFGTNLDVTIYRNGKRETLNPEGR 295
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A P +VWD Y EGCSIR LNP TY+ L+Q+ + +QE F C VGAN Y TPP +QGFAPH
Sbjct: 296 AFPSVVWDFYAEGCSIRILNPSTYLLGLRQVCSIMQEFFHCLVGANVYLTPPNSQGFAPH 355
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF++Q+EG+K+W++Y P + L R SS N+ Q+++G PI+ LE GDLLY P
Sbjct: 356 YDDIEAFVIQVEGRKRWRLYEPPEKADQLSRTSSGNYDQKQLGEPIIDEVLEAGDLLYFP 415
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG +HQA T SLH+T+SVYQ+ A+ +LLE MP L+ A + RRGLP+
Sbjct: 416 RGTVHQAITEKGHFSLHITLSVYQQQAYANLLETLMPIVLKKAVKKSVALRRGLPLHTFH 475
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
G + D+L EN++ L++K + D+D+ VD++ K+ H+ALPP++ P
Sbjct: 476 VLGEVQRTNRCESRDQLV--ENVQKLVSKYLMPSTQDIDEAVDQLAKKFQHEALPPIILP 533
Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EE +VF + Q G + DT VRL+RAN +R
Sbjct: 534 EEKVRTVFGSRSISDQEGNSVCDYEFDTDTSVRLLRANILR 574
>gi|312374032|gb|EFR21685.1| hypothetical protein AND_16621 [Anopheles darlingi]
Length = 646
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 148/308 (48%), Positives = 205/308 (66%), Gaps = 2/308 (0%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WE++PL + R ++YY L++ ID++LR N IE+ KNIDIT Y GQR+T + EGR
Sbjct: 187 YWERKPLLVQRNDRHYYTGLLSRAMIDEMLRTNNIEYTKNIDITSYVGGQRETHNPEGRV 246
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
LP +W +Y EGCS+R LNPQTY++ + +LN LQE F C GAN Y TPP +QGFAPHY
Sbjct: 247 LPPEMWQYYGEGCSVRMLNPQTYLRTIYELNVKLQEYFHCMTGANFYLTPPNSQGFAPHY 306
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF+LQ+EG+K+W++Y PR E L R SS N +++E+ P++ V LE GDLLY PR
Sbjct: 307 DDIEAFVLQIEGRKRWRLYPPRRRQEILARVSSENIAEDELAAPLIDVVLEAGDLLYFPR 366
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G IHQA+TV HSLHVT+SVYQ+ +W DL E+ +P AL A D +FR GLP ++
Sbjct: 367 GCIHQAATVPGAHSLHVTLSVYQRNSWADLFEQMLPIALATAAEEDQQFRAGLPFDLHQH 426
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G+ V + RL ++ L+ + +L +D VD++ K+ HDALPP++ E
Sbjct: 427 FGIVHSDEVSSERRRLTVR--LRAMFDRLFSDSAIDAAVDQLAKRFQHDALPPMVDFTEW 484
Query: 302 QCSVFENG 309
+V+ G
Sbjct: 485 DDTVYGRG 492
>gi|198477587|ref|XP_002136543.1| GA27865, partial [Drosophila pseudoobscura pseudoobscura]
gi|198142830|gb|EDY71543.1| GA27865, partial [Drosophila pseudoobscura pseudoobscura]
Length = 494
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 223/340 (65%), Gaps = 6/340 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
DHWEK P + ++NLI+ K ID +L N +E+ NID+T YEDG R+T++ +GR
Sbjct: 75 DHWEKSPFRVITTTSGGFSNLISFKMIDKMLIQNHVEYTTNIDVTSYEDGVRKTLNPDGR 134
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP VW HY+ GCSIR LNP +Y+ L+QL LQE F C VGAN Y TPP +QGFAPH
Sbjct: 135 ALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTPPESQGFAPH 194
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQ+EGKK+W++Y P + LPR SS N SQ E+G PI+ + L+PGDLLY P
Sbjct: 195 YDDIEAFVLQVEGKKRWRIYAP---TKELPRESSGNLSQTELGDPIMDIVLKPGDLLYFP 251
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG+IHQA T + HSLH+T+S YQ+ ++ +L+EK MP ++ + L+ R+GLP+ +
Sbjct: 252 RGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGLPLDIFQ 311
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVLSPE 299
G+A + + +L +++++L +LV +D +D++ + H+ALPP ++P+
Sbjct: 312 NLGVANAEWNGVHRQKLI--QHIQNLAQRLVPTEGQIDRALDQLAIKFQHEALPPTIAPQ 369
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL+ +VF + G ++ + T VRL+RAN VR
Sbjct: 370 ELKRTVFGAQATADRNGHCSLDYELAEGTAVRLLRANIVR 409
>gi|195477158|ref|XP_002100112.1| GE16340 [Drosophila yakuba]
gi|284433510|sp|B4Q068.1|NO66_DROYA RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|194187636|gb|EDX01220.1| GE16340 [Drosophila yakuba]
Length = 683
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 221/341 (64%), Gaps = 4/341 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D WE + RK +YY+ LI+ K ID++L + ++F N+D+T Y++G+R+T++ EGR
Sbjct: 262 DFWEHTAFVVQRKNPHYYSKLISFKMIDEMLVRHRLDFTINVDVTTYKNGKRETLNPEGR 321
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP +VW Y EGCSIR LNP TY+ L+Q+ + +QE F C VGAN Y TPP +QGFAPH
Sbjct: 322 ALPPVVWGLYSEGCSIRILNPSTYLVGLRQVCSIMQEFFHCLVGANVYLTPPNSQGFAPH 381
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF++Q+EG+K+W++Y P + L R SS NF QE++G PIL LE GDLLY P
Sbjct: 382 YDDIEAFVIQVEGRKRWRLYEPPSGSDQLCRNSSSNFDQEQLGEPILDEVLEAGDLLYFP 441
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG +HQA T +HSLH+T+SVYQ+ A+++LLEK MP L+ A + RRGLP+
Sbjct: 442 RGTVHQAITEEEQHSLHITLSVYQQQAYVNLLEKLMPIVLKKAIKQSVALRRGLPLHTFH 501
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
G A+ + R + EN++ L+ K + D+D+ VD++ K+ H+ALPP++ P
Sbjct: 502 VLGEAQR--ANRSDSRNQLVENVQKLVTKHLMPSAQDIDEAVDQLAKKFQHEALPPIILP 559
Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EE +VF + + G + T VRL+RAN +R
Sbjct: 560 EEQVRTVFGSRSTADEQGNAICDYEFDTKTSVRLLRANILR 600
>gi|358334326|dbj|GAA52775.1| lysine-specific demethylase NO66 [Clonorchis sinensis]
Length = 852
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 168/381 (44%), Positives = 225/381 (59%), Gaps = 44/381 (11%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
++EK+PL I RK ++ N +T+ +ID +LR + F +++D+ YEDG+R T++ G+A
Sbjct: 246 YFEKQPLLIQRK-GSFENQWLTTYEIDTLLRQERVMFTEHLDLAAYEDGRRYTLNPPGQA 304
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
P +VWDHY GCS+R L PQ + + +++ + LQE FGCF+GAN Y TPP +QGFAPHY
Sbjct: 305 FPSVVWDHYNTGCSVRLLCPQVFFRNIRERLSLLQEYFGCFIGANVYLTPPGSQGFAPHY 364
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF+LQLEG K+W+VY PR E L SPNFSQEEIG PILT TL+PGD+LY PR
Sbjct: 365 DDIEAFVLQLEGTKRWRVYAPRDESEKLACDPSPNFSQEEIGEPILTATLKPGDVLYFPR 424
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
GYIHQA T + HSLH+TIS YQK +W D L K +P ALQ+A T +EFR+GLP+G+LR+
Sbjct: 425 GYIHQAETSEDAHSLHLTISTYQKHSWGDFLSKLLPIALQSAVETSVEFRKGLPVGFLRH 484
Query: 242 AGLARGKPVDIQADRLAMKENL-----KDLLAKLVEYVD--------------------- 275
G + D+L + +++A+L D
Sbjct: 485 VGGFSQSISSKEKDKLGEATKVVAHVRAEVVARLRMLADQLERQASTGQPEPSSAPCQSL 544
Query: 276 -----LDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRM------------SQTGEV 318
L D L+ +LPP L EEL C V G R S+ V
Sbjct: 545 TAPDPLLTAADLFATDLIAQSLPPQLRDEELMCHVRSQGERWSLCTAAEDGSGGSKKPGV 604
Query: 319 YNATQITKDTKVRLVRANAVR 339
+ ++ DTKVRLVR +AVR
Sbjct: 605 VDLVELEPDTKVRLVRWSAVR 625
>gi|347966833|ref|XP_321123.4| AGAP001938-PA [Anopheles gambiae str. PEST]
gi|333469879|gb|EAA01582.4| AGAP001938-PA [Anopheles gambiae str. PEST]
Length = 716
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 202/309 (65%), Gaps = 3/309 (0%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HWE++PL + R + YY L++ ID++LR N IE+ KNIDIT Y DGQR+T + +GR
Sbjct: 289 HWERKPLLVQRSDRAYYAGLLSRAMIDEMLRTNNIEYTKNIDITSYRDGQRETHNPDGRV 348
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
LP +W Y GCS+R LNPQTY++ + +LN LQE F C GAN Y TPP +QGFAPHY
Sbjct: 349 LPPDLWHFYDAGCSVRMLNPQTYLRSVYELNVKLQEYFHCMTGANFYLTPPGSQGFAPHY 408
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPILTVTLEPGDLLYLP 180
DDIEAF+LQ+EG+K+W++Y PR E L R SS N Q+E+ G PI+ V LE GDLLY P
Sbjct: 409 DDIEAFVLQIEGRKRWRLYGPRQPQEVLARVSSENLRQDELAGPPIVDVVLEAGDLLYFP 468
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQA+TV HSLH+T+SVYQ+ W DL E+ +P AL A T+ + R GLP+ +
Sbjct: 469 RGCIHQAATVPGAHSLHITVSVYQRNCWADLFEQMLPIALATAADTEPQLREGLPLDLHQ 528
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
+ G+ I ADR + ++ + KL +D VD++ K+ HDALPP++ E
Sbjct: 529 HFGIVHSD--GISADRRRLIVRIRAMFDKLFSDAAIDAAVDQLAKRFQHDALPPLVDFGE 586
Query: 301 LQCSVFENG 309
+V+ G
Sbjct: 587 WDDTVYGGG 595
>gi|198477583|ref|XP_002136541.1| GA27866 [Drosophila pseudoobscura pseudoobscura]
gi|284433505|sp|B5DUH6.1|NO66_DROPS RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|198142828|gb|EDY71541.1| GA27866 [Drosophila pseudoobscura pseudoobscura]
Length = 946
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 223/340 (65%), Gaps = 6/340 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
DHWEK P + ++NLI+ K ID +L N +E+ NID+T YEDG R+T++ +GR
Sbjct: 527 DHWEKSPFRVITTTSGGFSNLISFKMIDKMLIQNHVEYTTNIDVTSYEDGVRKTLNPDGR 586
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP VW HY+ GCSIR LNP +Y+ L+QL LQE F C VGAN Y TPP +QGFAPH
Sbjct: 587 ALPPSVWAHYQRGCSIRILNPSSYLVQLRQLCVKLQEFFHCLVGANVYLTPPESQGFAPH 646
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF+LQ+EGKK+W++Y P + LPR SS N SQ E+G PI+ + L PGDLLY P
Sbjct: 647 YDDIEAFVLQVEGKKRWRIYAPT---KELPRESSGNLSQTELGDPIMDIVLMPGDLLYFP 703
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG+IHQA T + HSLH+T+S YQ+ ++ +L+EK MP ++ + L+ R+GLP+ +
Sbjct: 704 RGWIHQAITEKDSHSLHITLSAYQQQSYANLMEKLMPLVVKESVEQTLKLRKGLPLDIFQ 763
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVLSPE 299
G+A + + +L +++++L +L+ +D +D++ + H+ALPP ++P+
Sbjct: 764 NLGVANAEWNGVHRQKLI--QHIQNLAQRLMPTEGQIDRALDQLAIKFQHEALPPTIAPQ 821
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL+ +V+ +TG ++ + T VRL+RAN VR
Sbjct: 822 ELKRTVYGAQATADKTGHCSLDYELAEGTAVRLLRANIVR 861
>gi|340370610|ref|XP_003383839.1| PREDICTED: lysine-specific demethylase NO66-like [Amphimedon
queenslandica]
Length = 559
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/337 (45%), Positives = 211/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE++P+ I R +Y + +S+ +D ILR+N IE NI IT Y DG+++ GRA
Sbjct: 162 WEQKPVMIPRNVPHYNDGWFSSRDLDRILRENHIEHLVNIRITSYADGEKEVEQSSGRAH 221
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
+VWD + +G SI+FLNPQTY +P+ L + LQE F FVGAN Y TP QGFAPHYD
Sbjct: 222 APMVWDRFSDGYSIQFLNPQTYSRPMWLLTSFLQEHFSSFVGANIYLTPAGTQGFAPHYD 281
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++Q+EGKK+W++Y PR E LPR SS + +EEIG PIL + L PGDLLY PRG
Sbjct: 282 DIEAFVVQVEGKKRWRLYKPRNRYEILPRTSSNDLDEEEIGKPILDIILNPGDLLYFPRG 341
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQA + + HSLH+T+S QK W D E+ +P AL++A D +FR LP Y Y
Sbjct: 342 VIHQAVALPDTHSLHITLSTAQKHTWGDFFERMLPLALKSAIEEDEDFRHSLPWDYSDYV 401
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G G D + + K + DL +++V+Y L DEM +H +LPP LS E +
Sbjct: 402 GFVHGNSKDKRICQFFDKTH--DLFSRIVQYAPLLSAADEMAVDFLHSSLPPCLSENEKK 459
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
C ++ G + + G+V + T IT DT +++VR +VR
Sbjct: 460 CMIYNRGPSLEEDGKVTHQTMITLDTMIKIVRQKSVR 496
>gi|260836893|ref|XP_002613440.1| hypothetical protein BRAFLDRAFT_123918 [Branchiostoma floridae]
gi|284433495|sp|C3XRY1.1|NO66_BRAFL RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|229298825|gb|EEN69449.1| hypothetical protein BRAFLDRAFT_123918 [Branchiostoma floridae]
Length = 607
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 159/337 (47%), Positives = 211/337 (62%), Gaps = 25/337 (7%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+PL + R ++Y + +++ + IL +N I+F +N+D+T YE GQR+T + GRA
Sbjct: 211 WEKKPLLVKRHLESYNDGWFSTEDLTKILHENDIQFGRNLDVTTYEGGQRETHNPPGRAN 270
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +VWD+Y+ GCS+R LNPQTY + + +L ++LQE F VGAN Y TPP QGFAPHYD
Sbjct: 271 PAVVWDYYQNGCSVRLLNPQTYSQGVWRLCSTLQEYFSSMVGANIYLTPPGTQGFAPHYD 330
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG NFSQEEIG IL VTLEPGDLLY PRG
Sbjct: 331 DIEAFVLQLEG----------------------NFSQEEIGEAILDVTLEPGDLLYFPRG 368
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQAS + + HSLH+T+S Q+ W DL+EK +P AL A + D+EFR+ LP YL Y
Sbjct: 369 TIHQASALPDTHSLHITVSTCQRNTWGDLMEKLVPAALTMAFSEDVEFRQALPRDYLDYM 428
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
GLA D + R A E L+ LL++LV YV +D GVD+ + M D LPPV + E
Sbjct: 429 GLANADLDDPR--RKAFLETLQSLLSRLVNYVPVDAGVDQKAVEFMRDCLPPVFTKNERA 486
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
CS++ R+ + G V + + T V+L+R A R
Sbjct: 487 CSIYGCRTRL-EKGRVVGSVDLKTSTPVKLIRKGAAR 522
>gi|351712535|gb|EHB15454.1| Nucleolar protein 66 [Heterocephalus glaber]
Length = 627
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 210/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + I R+ YY L ++ +D ILR+ ++F +++D T Y +G+R+T++ GRAL
Sbjct: 209 WEREAVLIRRQDHAYYQGLFSTADLDSILRNEEVQFGQHLDATRYVNGRRETLNPPGRAL 268
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 269 PAAAWSLYQAGCSLRLLCPQAFSTTIWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 328
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQE++G P+L LEPGDLLY PRG
Sbjct: 329 DIEAFVLQLEGRKLWRVYRPRVPGEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 388
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP +L Y
Sbjct: 389 FIHQAECQNGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFLDYM 448
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 449 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 506
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 507 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 543
>gi|194764344|ref|XP_001964290.1| GF20792 [Drosophila ananassae]
gi|284433500|sp|B3MSI4.1|NO66_DROAN RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|190619215|gb|EDV34739.1| GF20792 [Drosophila ananassae]
Length = 843
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 154/339 (45%), Positives = 216/339 (63%), Gaps = 4/339 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK + R+ Y+ LI+ + ID++L + +EF NID+T Y+DG RQT++ EGRA+
Sbjct: 425 WEKNACQVQRQTPKYFAELISFEMIDEMLIRHHLEFTTNIDVTTYKDGVRQTLNPEGRAM 484
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VW Y +GCSIR LNP TY+ L+Q+ + LQE F C VGAN Y TPP +QGFAPHYD
Sbjct: 485 PPAVWSSYADGCSIRILNPSTYLAGLRQVCSMLQEFFHCLVGANVYLTPPNSQGFAPHYD 544
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQ+EG+K+W++Y+P + L R+SS NF Q E+ PI LE GD+LY PRG
Sbjct: 545 DIEAFVLQVEGRKRWRLYMPVKPTDMLARHSSGNFDQGELDEPIFDEVLEAGDVLYFPRG 604
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+HQA T ++HSLH+T+SVYQ+ A+ +LLE MP L+ A + RRGLP+ +
Sbjct: 605 TVHQAITEKDQHSLHITLSVYQQQAYANLLENLMPMVLKNAVQQKMALRRGLPLHTWQNL 664
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
GLA G RL + ++ ++ K L +D VD++ K+ H+ALPP + PEE
Sbjct: 665 GLAHGG--SEGRSRLNLISGIQQMVQKYLLPTEEQIDAAVDQLAKRFQHEALPPTILPEE 722
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+VF + + G+ +IT+ T VRL+RAN +R
Sbjct: 723 RVRTVFGSRSQTDAQGQCLCDYEITEQTSVRLLRANILR 761
>gi|153792138|ref|NP_001093172.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
NO66 [Bos taurus]
gi|284433494|sp|A5PK74.1|NO66_BOVIN RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|148745305|gb|AAI42387.1| C10H14ORF169 protein [Bos taurus]
Length = 667
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 210/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ +YY L ++ +D ILR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 251 WEREAVLVRRQDHSYYQGLFSTAVLDSILRNEEVQFGQHLDAARYINGRRETLNPPGRAL 310
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W YR GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 311 PAAAWSLYRAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 370
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 371 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 430
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S +Q+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 431 FIHQAECQDGVHSLHLTLSTFQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 490
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 491 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 548
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 549 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 585
>gi|296482958|tpg|DAA25073.1| TPA: lysine-specific demethylase NO66 [Bos taurus]
Length = 667
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 210/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ +YY L ++ +D ILR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 251 WEREAVLVRRQDHSYYQGLFSTAVLDSILRNEEVQFGQHLDAARYINGRRETLNPPGRAL 310
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W YR GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 311 PAAAWSLYRAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 370
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 371 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 430
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S +Q+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 431 FIHQAECQDGVHSLHLTLSTFQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 490
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 491 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 548
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 549 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 585
>gi|195456912|ref|XP_002075342.1| GK15670 [Drosophila willistoni]
gi|284433509|sp|B4NP88.1|NO66_DROWI RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|194171427|gb|EDW86328.1| GK15670 [Drosophila willistoni]
Length = 767
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 148/345 (42%), Positives = 220/345 (63%), Gaps = 12/345 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D WEK + R Y++ LI+ + ID ++ + +EF NID+T Y+DG+R+T++ EGR
Sbjct: 347 DFWEKNACQVQRNAPTYFSELISFEMIDQMMLKHHLEFTTNIDVTSYKDGRRETLNPEGR 406
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A+P VW Y EGCSIR LNP TY+ L+ + + +QE F C VGAN Y TPP +QGFAPH
Sbjct: 407 AMPPTVWGFYGEGCSIRILNPSTYLPGLRTMCSLMQEFFHCLVGANVYLTPPNSQGFAPH 466
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+DDIEAF+LQ+EG+K+W++Y+P + L R SS N++ +++G PI L+PGD+LY P
Sbjct: 467 FDDIEAFVLQVEGRKRWRLYMPLQPSDVLARESSGNYTPDQLGEPIFDEVLKPGDVLYFP 526
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG +HQA T HSLH+T+SVYQ+ A+ +LLEK MP LQ+A + RRGLP+ +
Sbjct: 527 RGTVHQAITEKKHHSLHITLSVYQQQAYANLLEKLMPMVLQSAIKHSVSLRRGLPLHTWQ 586
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEY------VDLDDGVDEMGKQLMHDALPP 294
+ G+A G A + + + L + ++V+ +D VD++ K+ H+ALPP
Sbjct: 587 HLGIAHG------ATKCSSRSQLIKGIQEMVQQHLTPSENQIDAAVDQLAKRYQHEALPP 640
Query: 295 VLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ PEE +VF + G+ ++T+DT +RL+RAN +R
Sbjct: 641 TILPEEKLRTVFGSRSATDAHGKCLCDYELTEDTSIRLLRANILR 685
>gi|296215462|ref|XP_002754138.1| PREDICTED: lysine-specific demethylase NO66 [Callithrix jacchus]
Length = 631
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 210/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 213 WEREAVLVRRQDHTYYQGLFSTADLDAMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 272
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 273 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 332
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 333 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 392
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 393 FIHQAECQNGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 452
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 453 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 510
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V TQ+T +T+V +++ R
Sbjct: 511 LSVYGLPIRWEAGEPVNVGTQLTTETEVHMLQDGIAR 547
>gi|257743021|ref|NP_076122.2| bifunctional lysine-specific demethylase and histidyl-hydroxylase
NO66 [Mus musculus]
gi|284018104|sp|Q9JJF3.2|NO66_MOUSE RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|148670807|gb|EDL02754.1| mCG4934 [Mus musculus]
gi|187957070|gb|AAI38095.1| RIKEN cDNA 2410016O06 gene [Mus musculus]
Length = 603
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 150/337 (44%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ ++YY L ++ +D +LR ++F +++D Y DG+R+T++ GRAL
Sbjct: 185 WEREAVLVRRQDRSYYEGLFSTADLDSMLRYEDVQFGQHLDAARYVDGRRETLNPPGRAL 244
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W YR GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 245 PAAAWSLYRAGCSLRLLCPQAFSPTVWQFLAVLQEQFGSMAGSNVYLTPPDSQGFAPHYD 304
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQE++G P+L LEPGDLLY PRG
Sbjct: 305 DIEAFVLQLEGRKLWRVYRPRDPSEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 364
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 365 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAIEENVEFRRGLPRDFMDYM 424
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 425 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 482
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV +R V Q+T +T+V +++ R
Sbjct: 483 LSVHGLPVRWEAGEPVNVGAQLTTETQVHMLQDGVAR 519
>gi|66806873|ref|XP_637159.1| transcription factor jumonji, jmjC domain-containing protein
[Dictyostelium discoideum AX4]
gi|74853002|sp|Q54K96.1|NO66_DICDI RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66; AltName: Full=JmjC domain-containing
protein G
gi|60465574|gb|EAL63656.1| transcription factor jumonji, jmjC domain-containing protein
[Dictyostelium discoideum AX4]
Length = 514
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 218/338 (64%), Gaps = 11/338 (3%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
++ ++ LY+ R N Y N T +D +LR+N+++F +N+D+T Y D QR T++ EGRA
Sbjct: 108 YFGQKHLYVKRNGDNIYKNFFTKDSLDKMLRNNLMKFTENVDVTNYVDFQRITLNPEGRA 167
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
P +VW HY+EGCS+R LNPQT+ + +L ++LQ F C VGAN Y TP AQGFAPHY
Sbjct: 168 YPSLVWKHYKEGCSVRLLNPQTFNSNVWKLCSTLQTHFQCGVGANIYLTPAGAQGFAPHY 227
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DD++ FILQLEGKK+W++Y PR +E LP+ SS NF+QEEIG P TVTLE GDLLY PR
Sbjct: 228 DDVDVFILQLEGKKEWRLYKPRDANEVLPKKSSENFTQEEIGEPYFTVTLEAGDLLYFPR 287
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G IHQA + ++ HSLH+T+S Y W DL+ K + +AL+ A LEFR GLP Y +Y
Sbjct: 288 GVIHQAVSPSDVHSLHITVSTYLNNTWGDLIGKVLNRALEIANEECLEFREGLPRDYTQY 347
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
G+ V + R + + + L KL + + +D G D+M + + D+LPPVL+ E
Sbjct: 348 LGVIHSDKVGDER-RKELTDKVGTLWDKLGQLLPIDIGADQMAVKYLLDSLPPVLTQLEK 406
Query: 302 QCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ S+ + M +I +T+ RL+RA++VR
Sbjct: 407 KHSIEDETTSM----------KIKPETRFRLIRADSVR 434
>gi|126282811|ref|XP_001375764.1| PREDICTED: lysine-specific demethylase NO66-like [Monodelphis
domestica]
Length = 841
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/337 (47%), Positives = 211/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+ + + R +YY L ++ ++D +LR I+F ++D Y +GQR+T++ GRAL
Sbjct: 424 WEKDAVLVRRYNPDYYQGLFSTAELDSVLRREDIQFGLHLDAARYRNGQRETLNPPGRAL 483
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W YR+GCS+R L PQ + + Q+ + LQE FG VGAN Y TPP +QGFAPHYD
Sbjct: 484 PATAWSLYRDGCSLRLLCPQAFSAVMWQVLSVLQEHFGSMVGANAYLTPPGSQGFAPHYD 543
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEGKK W+VY PR E LP++SSPNF E +G P+L LEPGDLLY PRG
Sbjct: 544 DIEAFVLQLEGKKLWRVYKPREQVEELPQFSSPNFGPEGLGKPVLQEVLEPGDLLYFPRG 603
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+TIS +Q+ +W DLLE +P ALQAA D+EFRRGLP YL Y
Sbjct: 604 FIHQAECEPGVHSLHLTISTFQRNSWGDLLEPLLPAALQAAMEEDVEFRRGLPRDYLDYM 663
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G+ + D + R E L+ LL +L +Y +D D+ K +HD LPPVLS +E
Sbjct: 664 GVQHSESGDPR--RGPFLEKLQGLLVRLAQYAPVDAVADQRAKGFLHDCLPPVLSEKERA 721
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV R +IT +T+VRL+R R
Sbjct: 722 LSVHGLPARWEAEEARDVGAKITTETQVRLLRHGLTR 758
>gi|195047182|ref|XP_001992288.1| GH24285 [Drosophila grimshawi]
gi|284433502|sp|B4JMQ2.1|NO66_DROGR RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|193893129|gb|EDV91995.1| GH24285 [Drosophila grimshawi]
Length = 723
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 220/340 (64%), Gaps = 4/340 (1%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WE + + RK+K+ Y++L++ + ID++L +N +EF NID+T Y+DG RQT + +GRA
Sbjct: 301 YWESKACQVKRKRKDLYSDLVSFEMIDEMLIENHLEFTTNIDVTSYKDGVRQTHNPDGRA 360
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+P VW HY +GCS+R LNP TY+K L+ + A+LQE F C VGAN Y TPP +QGFAPHY
Sbjct: 361 MPPTVWGHYSDGCSVRILNPSTYLKGLRGVCAALQEHFHCLVGANVYLTPPNSQGFAPHY 420
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF+LQ+EG+K+W++Y ++ L R SS N Q+++ PI LE GDLLY PR
Sbjct: 421 DDIEAFVLQVEGRKRWRLYDAPSPNDVLARTSSGNLKQQQLSKPIFDEVLEAGDLLYFPR 480
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G +HQA T HSLH+T+SVYQ+ ++ +L+E MP LQ A +L+ RRGLP+G +
Sbjct: 481 GCVHQAVTEQQHHSLHITLSVYQQQSYANLMEALMPAVLQNAIKHNLDMRRGLPLGTWHH 540
Query: 242 AGLARG-KPVDIQADRLAMKENL-KDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
G+ G K ++D + ++L LA +D VD++ + H+ALPP ++
Sbjct: 541 LGMVHGDKKTKERSDLITHTQSLFSKYLAPTASQIDA--AVDQLAIRFQHEALPPRIASS 598
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E + +VF + + + G +T+ TK+RL+R N VR
Sbjct: 599 EKKRTVFGSRNKKDKHGNCRCDYDLTEQTKIRLLRQNIVR 638
>gi|311261319|ref|XP_003128699.1| PREDICTED: lysine-specific demethylase NO66-like [Sus scrofa]
Length = 645
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 229 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 288
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 289 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 348
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 349 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 408
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 409 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 468
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 469 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 526
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 527 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 563
>gi|106879206|ref|NP_078920.2| bifunctional lysine-specific demethylase and histidyl-hydroxylase
NO66 [Homo sapiens]
gi|284018103|sp|Q9H6W3.2|NO66_HUMAN RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=60S ribosomal
protein L8 histidine hydroxylase; AltName: Full=Histone
lysine demethylase NO66; AltName: Full=Myc-associated
protein with JmjC domain; AltName: Full=Nucleolar
protein 66; Short=hsNO66; AltName: Full=Ribosomal
oxygenase NO66; Short=ROX
Length = 641
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 223 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 282
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 283 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 342
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 343 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 402
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 403 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 462
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 463 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 520
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 521 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 557
>gi|7670372|dbj|BAA95038.1| unnamed protein product [Mus musculus]
Length = 603
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 150/337 (44%), Positives = 207/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ ++YY L ++ +D +LR ++F +++D Y DG+R+T++ GRAL
Sbjct: 185 WEREAVLVRRQDRSYYEGLFSTADLDSMLRYEDVQFGQHLDAARYVDGRRETLNPPGRAL 244
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W YR GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 245 PAAAWSLYRAGCSLRLLCPQAFSPTVWQFLAVLQEQFGSMAGSNVYLTPPDSQGFAPHYD 304
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQE++G P+L LEPGDLLY PRG
Sbjct: 305 DIEAFVLQLEGRKLWRVYRPRDPSEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 364
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA + EFRRGLP ++ Y
Sbjct: 365 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAIEENEEFRRGLPRDFMDYM 424
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 425 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 482
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV +R V Q+T +T+V +++ R
Sbjct: 483 LSVHGLPVRWEAGEPVNVGAQLTTETQVHMLQDGVAR 519
>gi|410216552|gb|JAA05495.1| chromosome 14 open reading frame 169 [Pan troglodytes]
gi|410257830|gb|JAA16882.1| chromosome 14 open reading frame 169 [Pan troglodytes]
gi|410299056|gb|JAA28128.1| chromosome 14 open reading frame 169 [Pan troglodytes]
Length = 641
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 223 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 282
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 283 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 342
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 343 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 402
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 403 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 462
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 463 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 520
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 521 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 557
>gi|344273988|ref|XP_003408800.1| PREDICTED: lysine-specific demethylase NO66-like [Loxodonta
africana]
Length = 652
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D ILR ++F +++D Y +G+R+T++ GRAL
Sbjct: 234 WEREAVLVRRQDHAYYQGLFSTADLDAILRKEDVQFGQHLDAARYVNGRRETLNPPGRAL 293
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 294 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNIYLTPPNSQGFAPHYD 353
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQE++G P+L LEPGDLLY PRG
Sbjct: 354 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 413
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
++HQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 414 FVHQAQCQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 473
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 474 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 531
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 532 LSVYGLPIRWEAGQPVNVGAQLTTETEVHMLQDGIAR 568
>gi|431839107|gb|ELK01034.1| Nucleolar protein 66 [Pteropus alecto]
Length = 427
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y G+R+T++ GRAL
Sbjct: 11 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYVSGRRETLNPPGRAL 70
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 71 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 130
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 131 DIEAFVLQLEGRKLWRVYRPRVSTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 190
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 191 FIHQAECQDGMHSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYM 250
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 251 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 308
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 309 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 345
>gi|10437975|dbj|BAB15138.1| unnamed protein product [Homo sapiens]
gi|15030188|gb|AAH11350.1| Chromosome 14 open reading frame 169 [Homo sapiens]
gi|38708274|gb|AAR27292.1| NO66 protein [Homo sapiens]
gi|48735363|gb|AAH71954.1| Chromosome 14 open reading frame 169 [Homo sapiens]
Length = 641
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 223 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 282
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 283 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 342
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 343 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 402
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 403 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 462
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 463 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 520
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 521 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 557
>gi|444302172|pdb|4E4H|A Chain A, Crystal Structure Of Histone Demethylase No66
gi|444302173|pdb|4E4H|B Chain B, Crystal Structure Of Histone Demethylase No66
gi|444302174|pdb|4E4H|C Chain C, Crystal Structure Of Histone Demethylase No66
gi|444302175|pdb|4E4H|D Chain D, Crystal Structure Of Histone Demethylase No66
Length = 463
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 45 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 104
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 105 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 164
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 165 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 224
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 225 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 284
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 285 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 342
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 343 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 379
>gi|449504341|ref|XP_004186246.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase NO66
[Taeniopygia guttata]
Length = 385
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 150/293 (51%), Positives = 190/293 (64%), Gaps = 2/293 (0%)
Query: 47 YEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
Y +G R+T + GRALP +VWD Y+ GCS+R L PQ + + + LQE FG GAN
Sbjct: 11 YAEGVRETHNPLGRALPAVVWDFYQNGCSLRLLCPQAFSPTVWHFLSILQEHFGSMAGAN 70
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
TY TPP QGFAPHYDDIEAF+LQLEGKK W+VY PR E LP++SS N +Q E+G P+
Sbjct: 71 TYLTPPGTQGFAPHYDDIEAFVLQLEGKKHWRVYRPRTDAEVLPQFSSANLTQPELGEPV 130
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
L LE GDLLY PRG+IHQ + + HSLH+T+S YQ+ +W D LEK +P ALQ A
Sbjct: 131 LETVLEAGDLLYFPRGFIHQGDCLPDAHSLHITVSSYQRNSWGDFLEKLLPAALQMALEE 190
Query: 227 DLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQ 286
D+E+RRGLP+ YLRY G+A VD A R A E ++ L+ KLV+Y +D VD+ K
Sbjct: 191 DVEYRRGLPMDYLRYMGVANSDTVD--ARRTAFVEKVQSLIKKLVDYAPIDAAVDQRAKS 248
Query: 287 LMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+HD LPP+L+ E SV+ R G ITKDT+VRL+R VR
Sbjct: 249 FLHDCLPPMLTESEKAQSVYGFPARWQDGGPCNVDILITKDTEVRLLRHGIVR 301
>gi|391345560|ref|XP_003747053.1| PREDICTED: lysine-specific demethylase NO66-like [Metaseiulus
occidentalis]
Length = 515
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/345 (44%), Positives = 221/345 (64%), Gaps = 12/345 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
+++EK+P + K +YY +L+++K++D +LR+ + F+KNID+T Y +G+R+T+ +EGR
Sbjct: 97 NYFEKKPFVVRHKDPSYYKHLLSTKQLDTVLRERDVHFEKNIDLTSYVNGKRETVLLEGR 156
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP +VW +Y EGCS+R LNPQ+Y++ + L +L E F FVGAN Y TP QGFAPH
Sbjct: 157 ALPALVWQYYSEGCSVRLLNPQSYVEAVHTLCYTLAEHFSSFVGANVYLTPAGTQGFAPH 216
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGDLLYL 179
+DDI+AF+LQLEGKK+WKVY PR DE LP SS N ++E + PI+ TLE GDLLY+
Sbjct: 217 WDDIDAFVLQLEGKKRWKVYEPRNEDEELPLVSSGNLDRDELVDPPIIETTLETGDLLYM 276
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
PRG+IH+AS+ ++HSLH+TIS Q+ W D L+ +P AL+ A +++EFRR LP YL
Sbjct: 277 PRGFIHEASSCEDDHSLHITISANQRNTWSDYLKILLPAALELATESNVEFRRTLPRNYL 336
Query: 240 RYAGLARGKPVDIQAD-RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
+ G+ D++ R E +L+ KL++ D D+M MHD+LPPVL+
Sbjct: 337 QNMGVMHS---DLENKARQMFSEIASNLVQKLLKEAPFDPAADQMAVNYMHDSLPPVLNS 393
Query: 299 EELQCSVFENGLRMSQTGEVYNAT----QITKDTKVRLVRANAVR 339
E + + R E N + +T+V L+ N VR
Sbjct: 394 SEKELTA---SHRARWNCETLNGNSSFDHVNPETEVCLISKNCVR 435
>gi|379318603|pdb|4DIQ|A Chain A, Crystal Structure Of Human No66
gi|379318604|pdb|4DIQ|B Chain B, Crystal Structure Of Human No66
Length = 489
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 64 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 123
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 124 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 183
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 184 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 243
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 244 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 303
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 304 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 361
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 362 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 398
>gi|355693420|gb|EHH28023.1| hypothetical protein EGK_18357 [Macaca mulatta]
Length = 529
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 111 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 170
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 171 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 230
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 231 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 290
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 291 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 350
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 351 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 408
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 409 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 445
>gi|383421627|gb|AFH34027.1| lysine-specific demethylase NO66 [Macaca mulatta]
gi|387542184|gb|AFJ71719.1| lysine-specific demethylase NO66 [Macaca mulatta]
Length = 642
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 224 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 283
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 284 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 343
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 344 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 403
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 404 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 463
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 464 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 521
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 522 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 558
>gi|109084252|ref|XP_001090820.1| PREDICTED: lysine-specific demethylase NO66 [Macaca mulatta]
Length = 642
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 224 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 283
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 284 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 343
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 344 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 403
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 404 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 463
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 464 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 521
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 522 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 558
>gi|355766856|gb|EHH62561.1| hypothetical protein EGM_20943, partial [Macaca fascicularis]
Length = 423
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 209/337 (62%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 5 WEREAVLVRRQDHTYYQGLFSTADLDLMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 64
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 65 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 124
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG
Sbjct: 125 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRG 184
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 185 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 244
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 245 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 302
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 303 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 339
>gi|426377411|ref|XP_004055459.1| PREDICTED: lysine-specific demethylase NO66 [Gorilla gorilla
gorilla]
Length = 686
Score = 305 bits (780), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 208/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 268 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 327
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 328 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 387
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR+ E L SSPN SQ+++G P+L LEPGDLLY PRG
Sbjct: 388 DIEAFVLQLEGRKLWRVYRPRVPTEELALTSSPNLSQDDLGEPVLQTVLEPGDLLYFPRG 447
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 448 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 507
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 508 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 565
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 566 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 602
>gi|157820533|ref|NP_001101510.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
NO66 [Rattus norvegicus]
gi|391359316|sp|D3ZU57.1|NO66_RAT RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|149025100|gb|EDL81467.1| similar to RIKEN cDNA 2410016O06 (predicted) [Rattus norvegicus]
Length = 597
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 206/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ +YY L ++ +D +LR + F +++D Y DG+R+T++ GRAL
Sbjct: 179 WEREAVLVRRQDHSYYEGLFSTSDLDWMLRYEDVHFGQHLDAARYIDGRRETLNPPGRAL 238
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 239 PAAAWSLYQAGCSLRLLCPQAFSPTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 298
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQE++G P+L LEPGDLLY PRG
Sbjct: 299 DIEAFVLQLEGRKLWRVYRPRDPSEELALTSSPNFSQEDLGEPVLQTVLEPGDLLYFPRG 358
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 359 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAVLPLAMQAAIEENVEFRRGLPRDFMDYM 418
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 419 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 476
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV +R V Q+T +T+V +++ R
Sbjct: 477 LSVHGLPIRWEAGEPVNVGAQLTTETQVHMLQDGIAR 513
>gi|194390104|dbj|BAG61814.1| unnamed protein product [Homo sapiens]
Length = 429
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 207/337 (61%), Gaps = 2/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+E + + R+ YY L ++ +D +LR+ ++F +++D Y +G+R+T++ GRAL
Sbjct: 11 WEREAVLVRRQDHTYYQGLFSTADLDSMLRNEEVQFGQHLDAARYINGRRETLNPPGRAL 70
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P W Y+ GCS+R L PQ + + Q A LQE FG G+N Y TPP +QGFAPHYD
Sbjct: 71 PAAAWSLYQAGCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYD 130
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEG+K W+VY PR E L SSPNFSQ+++G P+L L PGDLLY PRG
Sbjct: 131 DIEAFVLQLEGRKLWRVYRPRAPTEELALTSSPNFSQDDLGEPVLQTVLGPGDLLYFPRG 190
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+IHQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y
Sbjct: 191 FIHQAECQDGVHSLHLTLSTYQRNTWGDFLEAILPLAVQAAMEENVEFRRGLPRDFMDYM 250
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
G D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E
Sbjct: 251 GAQHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERA 308
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
SV+ +R V Q+T +T+V +++ R
Sbjct: 309 LSVYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 345
>gi|195133564|ref|XP_002011209.1| GI16410 [Drosophila mojavensis]
gi|284433503|sp|B4L6Q5.1|NO66_DROMO RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|193907184|gb|EDW06051.1| GI16410 [Drosophila mojavensis]
Length = 888
Score = 303 bits (775), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 200/301 (66%), Gaps = 4/301 (1%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WE+ + RKQ NY+ LI+ + ID++L +N +EF NID+T Y+ G RQT++ GRA
Sbjct: 492 YWERNACQVKRKQPNYFTQLISFQMIDEMLIENQLEFTTNIDVTTYKKGVRQTLNPVGRA 551
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+ +W +Y +GCSIR LNP TY+ L+QL +++QE F C VGAN Y TPP +QGFAPHY
Sbjct: 552 MSPAIWGYYGDGCSIRILNPSTYLPKLRQLCSTMQEFFHCLVGANVYLTPPNSQGFAPHY 611
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF++Q+EG+K+W++Y P + L R SS N+ QEE+G P+ LE GD+LY PR
Sbjct: 612 DDIEAFVIQVEGRKRWRLYAPPHQSDVLARTSSGNYKQEELGQPLFDAVLEAGDILYFPR 671
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G +HQA T +HSLH+T+SVYQ+ A+ +LLE MP L+ A L RRGLP+ ++
Sbjct: 672 GTVHQAVTEPKQHSLHITLSVYQQQAYANLLEVLMPSVLERAIKHHLSLRRGLPLHIWQH 731
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLV--EYVDLDDGVDEMGKQLMHDALPPVLSPE 299
GLA+G Q D+L + K L+ + + +D VD++ K+ H+ALPP + PE
Sbjct: 732 VGLAKGGQQSEQRDQLM--NSTKQLVQRYLVPTEAQIDAAVDQLAKRFQHEALPPYIKPE 789
Query: 300 E 300
E
Sbjct: 790 E 790
>gi|196011016|ref|XP_002115372.1| hypothetical protein TRIADDRAFT_50673 [Trichoplax adhaerens]
gi|190582143|gb|EDV22217.1| hypothetical protein TRIADDRAFT_50673 [Trichoplax adhaerens]
Length = 431
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 205/337 (60%), Gaps = 3/337 (0%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE++P+ R+ +Y N L +S +D I+R+N IE+ N+D+T YE+G R+T + EGR L
Sbjct: 15 WERKPILAQRRSSSYNNGLFSSHDLDRIVRENYIEYSVNLDVTTYENGVRETHNAEGRVL 74
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
++WD+Y+ GCSIR LNPQTY + L + + LQE FG FVG N Y TPP QGFAPH+D
Sbjct: 75 ASVMWDYYQNGCSIRMLNPQTYSESLWKFCSLLQEYFGSFVGCNMYLTPPGTQGFAPHFD 134
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEGKKKW+ Y PR E LP YSS NF+Q EIG P LE GD Y PRG
Sbjct: 135 DIEAFVLQLEGKKKWRFYNPRDDSEILPEYSSGNFNQNEIGKPSFEFVLEQGDFAYFPRG 194
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQA ++ + HSLH+T+S Q ++ EK +P A+++A +L R+ LP + +A
Sbjct: 195 TIHQAQSLPDCHSLHITVSTCQLHSFGKYFEKLLPMAIRSAFKNNLGLRKSLPPDF--FA 252
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQ 302
+ A R + +K L +++ +D+ D ++HD LPP + EE
Sbjct: 253 NIGGIHADSKNARRKQLTTEVKQFLKDIIDDAPIDEAADLFAAGVIHDYLPPCHTQEERN 312
Query: 303 CSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
C +F G R + +G V + + +D +++LVR R
Sbjct: 313 CMIFTTGGRWNGSG-VVSCVTLNEDNRIKLVRREVAR 348
>gi|328717527|ref|XP_001950505.2| PREDICTED: lysine-specific demethylase NO66-like [Acyrthosiphon
pisum]
Length = 521
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 211/344 (61%), Gaps = 8/344 (2%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
DHWEK L++ R NY++ L + ++D ILR+N +++ N+DIT Y D R+T + GR
Sbjct: 86 DHWEKTILHVPRNSSNYFSQLFSLTELDTILRENNLQYGTNVDITSYTDNVRETHNPVGR 145
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A P +VWD+Y GCS+R LNPQ + + + A+LQE FG VG N Y TPP +QGFAPH
Sbjct: 146 AHPHVVWDYYNNGCSVRLLNPQLFAPEIYKFMANLQEYFGSLVGCNVYLTPPFSQGFAPH 205
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF++Q++G+K W+VY PR E LPR SS NF Q+EIG PIL V L PGD LY+P
Sbjct: 206 YDDIEAFVVQVDGEKHWRVYKPRSEFETLPRTSSRNFHQDEIGEPILDVILRPGDFLYMP 265
Query: 181 RGYIHQASTV-TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
RGYIHQA T+ T HSLH+T S YQ+ + D L+ + +L A D+ +R GLP+GY
Sbjct: 266 RGYIHQADTLFTETHSLHLTFSSYQQNSMYDFLQVVVNNSLNNAVKNDISYRSGLPVGYQ 325
Query: 240 RYAGLA---RGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG-KQLMHDALPPV 295
+ GL + P +Q R + + L+ L ++DLD +D+ K ++LPP
Sbjct: 326 HFGGLCELEKSPPRTVQ--RQQFENQINQLIRNLKNHLDLDHAIDKFSLKNYYTNSLPPS 383
Query: 296 LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ E+ +V NG +M G+ + +V+L+R N R
Sbjct: 384 ICDVEMLRTV-ANGAKMLPDGKTTGEEVNLEIAEVKLIRRNCFR 426
>gi|326433887|gb|EGD79457.1| hypothetical protein PTSG_10023 [Salpingoeca sp. ATCC 50818]
Length = 681
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 219/340 (64%), Gaps = 11/340 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WEK+PL + RK++ +++ L +S+ ID I+R N +++ N+D+T YEDG+R T++ +GR
Sbjct: 239 EYWEKKPLLVRRKERGHFDGLFSSEAIDAIVRQNYLKYGVNLDLTKYEDGERTTLNPDGR 298
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P ++W + +GCS+R LNPQT+ +P+ + ++LQ+ FGC VGANTY TP QGFAPH
Sbjct: 299 VRPNVMWAMFEDGCSVRVLNPQTFHQPVWKAISALQDYFGCMVGANTYLTPKGTQGFAPH 358
Query: 121 YDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
YDDIEAFILQLEG+K+W++Y P V LPR+SSPN+ + E+G P L V L PGDLLY
Sbjct: 359 YDDIEAFILQLEGEKRWRLYDQPEGV--RLPRHSSPNYDESELGKPYLDVVLRPGDLLYF 416
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
PRG +HQA ++ HSLH+T+S YQ W ++K +P+AL A D +FR GLP L
Sbjct: 417 PRGVVHQAVSLPKSHSLHLTLSTYQLFDWAQYMQKLVPQALANAIEEDADFRSGLPRNSL 476
Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPE 299
Y GL++ Q ++ + K L+AKL EY D D++ + +H+ +PP LS E
Sbjct: 477 DYVGLSKVDENPPQREKFI--KQAKRLMAKLPEYASFDAAADQVAVENIHNCMPPCLSHE 534
Query: 300 ELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ S G + G V ++ ++ +R++ AVR
Sbjct: 535 QQALS----GRLRPRRGFVPKPLKM--NSFIRIISPRAVR 568
>gi|281209361|gb|EFA83529.1| transcription factor jumonji [Polysphondylium pallidum PN500]
Length = 519
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 212/341 (62%), Gaps = 15/341 (4%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D++ + L + R NYY + T ID +L+ N ++F +NID+T Y DG+RQT++ EGR
Sbjct: 112 DYFGVKHLVVKRGDSNYYKSFFTKDSIDKMLKSNCLKFSENIDVTNYVDGERQTLNPEGR 171
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A P VW HYREGCSIR LNPQT+ + ++ ++LQ F C +GAN Y TP AQGFAPH
Sbjct: 172 AYPAQVWKHYREGCSIRLLNPQTFNANVWRVLSTLQTHFQCGMGANVYLTPAGAQGFAPH 231
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDD++ FI+QLEGKK W++Y PR E L + SS NF QEEIG PI+ VTLE GD++Y P
Sbjct: 232 YDDVDVFIIQLEGKKVWRLYPPRSEQEVLAKKSSDNFEQEEIGEPIVEVTLEAGDVMYFP 291
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQA + + HSLH+TIS Y W DL+ K + +AL A +EFR GLP Y +
Sbjct: 292 RGVIHQAYSPEDVHSLHITISTYMNNTWGDLIGKVLTRALDIANDEYVEFREGLPRDYTQ 351
Query: 241 YAGLARGKPVDIQAD--RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
+ G+ D D R + + + L KL + + D G D M + +HD+LPPV +
Sbjct: 352 FLGVMHS---DKAGDERRKELLDKVGTLWEKLGQCLPTDVGADLMAVKFLHDSLPPVFNQ 408
Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E + S+ ++ A+++ +++VRL+R +++R
Sbjct: 409 LEKKHSIEDDNA----------ASELKLESRVRLIRQDSIR 439
>gi|18858091|ref|NP_572160.1| CG2982 [Drosophila melanogaster]
gi|122129655|sp|Q7K4H4.1|NO66_DROME RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|15292145|gb|AAK93341.1| LD40453p [Drosophila melanogaster]
gi|22831673|gb|AAF45939.2| CG2982 [Drosophila melanogaster]
gi|220946188|gb|ACL85637.1| CG2982-PA [synthetic construct]
gi|220955888|gb|ACL90487.1| CG2982-PA [synthetic construct]
Length = 653
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 211/342 (61%), Gaps = 6/342 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D WE + R Y+ ++I+ K +D+IL + ++F N+D+T Y++G+R+T++ EGR
Sbjct: 232 DFWEHTACLVQRSNPKYFQSMISFKMLDEILIRHHLDFTVNVDVTTYKNGKRETLNPEGR 291
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
ALP VW Y +GCSIR LNP TY+ L+Q+ LQE F C VGAN Y TPP +QGFAPH
Sbjct: 292 ALPPAVWGFYSDGCSIRLLNPSTYLIRLRQVCTVLQEFFHCKVGANLYLTPPNSQGFAPH 351
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF++Q+EG+K+W +Y P + L R SS N+ QE++G PI+ L GD+LY P
Sbjct: 352 YDDIEAFVIQVEGRKRWLLYEPPKKADQLARISSGNYDQEQLGKPIIDEVLSAGDVLYFP 411
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG +HQA T +HSLH+T+SVYQ+ A+ +LLE MP L+ A + RRGLP+ +
Sbjct: 412 RGAVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPLHTFQ 471
Query: 241 YAGLA-RGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLS 297
G A +G R + EN++ L+ + D+D+ VD+M K+ H+ALPP++
Sbjct: 472 VLGNAYKGNDC---GSRKQLVENVQKLVTNYLMPSEDDIDEAVDQMAKKFQHEALPPIVL 528
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
P E +V + G + K T VRL+RAN +R
Sbjct: 529 PSEEVRTVHGARSDADEQGNCVCDYKFNKKTSVRLLRANILR 570
>gi|167523278|ref|XP_001745976.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775777|gb|EDQ89400.1| predicted protein [Monosiga brevicollis MX1]
Length = 432
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 199/339 (58%), Gaps = 2/339 (0%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WE +PL I RK + + L +S+++DD++R N I++ NID+ Y DG R T + EGR
Sbjct: 36 EYWETKPLLIRRKNRQRFKGLFSSQQLDDVIRSNYIKYGVNIDMARYSDGVRTTENPEGR 95
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
+W Y +GCSIR LNPQTY KP+ QL ++LQE F C VG NTY TPP AQGFAPH
Sbjct: 96 VHANTMWALYEDGCSIRMLNPQTYAKPVWQLISTLQEYFQCMVGCNTYLTPPGAQGFAPH 155
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEA ILQLEG K+W++Y E LPR SS NF Q E+ PIL V L+PGD LY P
Sbjct: 156 YDDIEALILQLEGSKRWRLY-NNPTGERLPRTSSRNFDQSELSEPILDVVLQPGDFLYFP 214
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG HQA + +EHSLH+T+S YQ W + +K +P AL A A D EFR GLP+ L
Sbjct: 215 RGMAHQAVSTPDEHSLHITLSTYQLFDWAEYFKKLVPAALDYAIAEDAEFREGLPLQALN 274
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
+ GL + + R + K L KL++ D D + +H ++P L+ EE
Sbjct: 275 HVGLLHSE-TEGDNQRKRFMDKAKHLFQKLIDVAPYDSAADAVAVDFLHASMPSYLTSEE 333
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
L + + L ++ + +RL+R + R
Sbjct: 334 LALTSRQKQLAARSNPVELSSPALAPSDWIRLIRPSMCR 372
>gi|195565039|ref|XP_002106114.1| GD16684 [Drosophila simulans]
gi|284433507|sp|B4R4H1.1|NO66_DROSI RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|194203485|gb|EDX17061.1| GD16684 [Drosophila simulans]
Length = 847
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 213/339 (62%), Gaps = 4/339 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+ + R Y+ +LI+ K +D+IL + ++F N+D+T Y++G+R+T++ EGRAL
Sbjct: 428 WEQTACLVQRTNPKYFQSLISFKMLDEILIRHHLDFTVNLDVTTYKNGKRETLNPEGRAL 487
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VW Y EGCSIR LNP Y+ L+++ LQE F C V AN Y TPP +QGFAPHYD
Sbjct: 488 PPAVWGFYSEGCSIRLLNPSAYLTRLREVCTVLQEFFHCKVEANMYLTPPNSQGFAPHYD 547
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++Q+EG+K+W +Y P ++L R SS N+ QE++G PI+ L GD+LY PRG
Sbjct: 548 DIEAFVIQVEGRKRWLLYEPPKEADHLARISSGNYDQEQLGKPIIDEVLSAGDVLYFPRG 607
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+HQA T +HSLH+T+SVYQ+ A+ +LLE MP L+ A + RRGLP+ +
Sbjct: 608 TVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPLHTFQVL 667
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLV--EYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
G A K D + +L + EN++ L+ K + D+D+ VD+M K+ H+ALPP++ P E
Sbjct: 668 GNAY-KANDCGSRQL-LVENVQKLVTKYLIPSEDDIDEAVDQMAKKFQHEALPPIVLPSE 725
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+V + G + + T VRL+RAN +R
Sbjct: 726 EVRTVHGARSGADEQGNCVCDYKFNEKTSVRLLRANILR 764
>gi|195340843|ref|XP_002037022.1| GM12339 [Drosophila sechellia]
gi|284433506|sp|B4I100.1|NO66_DROSE RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|194131138|gb|EDW53181.1| GM12339 [Drosophila sechellia]
Length = 655
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 214/339 (63%), Gaps = 4/339 (1%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE+ + R Y+ +LI+ K +D+IL + ++F N+D+T Y++G+R+T++ EGRAL
Sbjct: 236 WEQTACLVQRTNPKYFQSLISFKMLDEILIRHNLDFTVNLDVTTYKNGKRETLNPEGRAL 295
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P VW Y EGCSIR L+ Y+ L+++ LQE F C VGAN Y TPP +QGFAPHYD
Sbjct: 296 PPAVWGFYSEGCSIRLLHASAYLTRLREVCTVLQEFFHCKVGANMYLTPPNSQGFAPHYD 355
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++Q+EG+K+W +Y P ++L R SS N++QE++G PI+ L GD+LY PRG
Sbjct: 356 DIEAFVIQVEGRKRWLLYDPPKEADHLARISSGNYNQEQLGKPIIDEVLSAGDVLYFPRG 415
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
+HQA T +HSLH+T+SVYQ+ A+ +LLE MP L+ A + RRGLP+ +
Sbjct: 416 TVHQAITEEQQHSLHITLSVYQQQAYANLLETLMPMVLKKAVDRSVALRRGLPLHTFQIL 475
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAK--LVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
G A K D + +L + EN++ L+AK + D+D+ VD+M K+ H+ALPP++ P E
Sbjct: 476 GNAY-KANDCGSRQL-LVENVQKLVAKYLMPSEDDIDEAVDQMAKKFQHEALPPIVLPSE 533
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+V G + + T VRL+RAN +R
Sbjct: 534 EVRTVHGARSGADDQGNCVCDYKFNEKTSVRLLRANILR 572
>gi|326920526|ref|XP_003206522.1| PREDICTED: lysine-specific demethylase NO66-like, partial
[Meleagris gallopavo]
Length = 367
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 140/281 (49%), Positives = 181/281 (64%), Gaps = 2/281 (0%)
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
G+ALP +VWD Y+ GCS+R LNPQ + + + LQE FG GANTY TPP QGFA
Sbjct: 5 GQALPAVVWDFYQNGCSLRLLNPQAFSTTVWHFLSILQERFGSMAGANTYLTPPGTQGFA 64
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PHYDDIEAF+LQLEGKK+W+VY PR E LP++SS N +Q E+ P+L V LE GDLLY
Sbjct: 65 PHYDDIEAFVLQLEGKKRWRVYSPRTSSEALPQFSSANLTQAELAEPLLEVVLEAGDLLY 124
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
PRG+IHQA + + HSLH+T+S YQ+ +W D LEK +P ALQ A D+E+R GLP+
Sbjct: 125 FPRGFIHQADCLPDVHSLHITVSSYQRNSWGDFLEKLLPAALQMALEEDIEYREGLPMDC 184
Query: 239 LRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
L Y G+A VD A R E ++ L+ KLV+Y +D +D+ K +HD LPPVL+
Sbjct: 185 LGYMGVANSDAVD--ARRTVFVEKVQHLIKKLVDYAPIDAAMDQRAKSFLHDCLPPVLTQ 242
Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
E Q SV+ + ITKDT++RL+ VR
Sbjct: 243 SEKQLSVYGFPAQWHNGRPCNVDIPITKDTEIRLLHHGIVR 283
>gi|328865556|gb|EGG13942.1| transcription factor jumonji [Dictyostelium fasciculatum]
Length = 524
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 206/339 (60%), Gaps = 11/339 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D++ + + R NYY N T ID +L+ N ++F +NIDIT Y D +RQT++ +G+
Sbjct: 97 DYFGLQHYIVKRNDPNYYKNFFTKDSIDKMLKSNNLKFTENIDITNYVDNERQTLNPDGK 156
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A P +VW H R+GCS+R LNPQT+ + ++ + LQ F C +GAN Y TP AQGFAPH
Sbjct: 157 AYPAVVWKHVRDGCSLRLLNPQTFNMNVWKVLSCLQTYFQCGMGANIYLTPAGAQGFAPH 216
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDD++ FILQLEGKK W++Y PR E LP+ SS NF Q+EIG PI+ VTLE GD++Y P
Sbjct: 217 YDDVDVFILQLEGKKVWRLYEPRSEQERLPKKSSENFEQDEIGEPIIEVTLEAGDMMYFP 276
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQA + + HSLH+TIS W DL K + +AL A +EFR+GLP Y
Sbjct: 277 RGIIHQAYSPKDVHSLHITISTQMNNTWGDLFGKVITRALDIANEEAVEFRQGLPRDYTD 336
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
Y G+ V + R + ++ L KL Y+ +D G D M + +HD+LPP LS +E
Sbjct: 337 YLGVINSDKVGDER-RKELLDHASMLWEKLGNYLPVDIGADLMAVKYLHDSLPPFLSTQE 395
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ ++ ++ A + +VRL+R +AVR
Sbjct: 396 KKHTIEDDSA----------AASLKLTQRVRLLRKDAVR 424
>gi|325188355|emb|CCA22892.1| nucleolar protein putative [Albugo laibachii Nc14]
Length = 492
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 160/343 (46%), Positives = 209/343 (60%), Gaps = 17/343 (4%)
Query: 2 HWEKEPLYISRKQKNYYNNL-ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM--DVE 58
+W K P+ I RK++ +YNN +++ IL D+ +E+ ++D+ Y DG R+T +
Sbjct: 75 YWGKRPVVIKRKKQAFYNNCWFDEREMIRILEDHSLEYINDVDVARYIDGTRETWLPEKS 134
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
G A + VW Y EG SIR L PQ Y L +L + L+ FG +GANTY TPP AQGFA
Sbjct: 135 GIASGKDVWTKYAEGWSIRLLCPQQYSDRLWRLLSILESEFGTMMGANTYLTPPRAQGFA 194
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PHYDDIEAFILQLEG+K+WKVY PR ++ LPR SS NF Q++IG P+L V L PGDLLY
Sbjct: 195 PHYDDIEAFILQLEGRKQWKVYSPRCTEQILPRVSSKNFQQDDIGDPLLEVELGPGDLLY 254
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
LPRGYIHQA T HSLH+TIS Q A+ DL+E +P+ALQ+A ++L R+ LP Y
Sbjct: 255 LPRGYIHQAKTAKELHSLHITISAGQHNAYADLMEVLIPQALQSAIQSNLSMRKSLPRNY 314
Query: 239 LRYAGLARGKPVDIQADRLA-MKENLKDLLAKLVEYV-DLDDGVDEMGKQLMHDALPPVL 296
L Y G+ +D A+R + E K L A + E V LD D+M K+ + D LPP L
Sbjct: 315 LSYMGVMHSD-LDEHAERQKFISETKKQLKAIMTEAVAKLDAASDQMAKRFLMDRLPPAL 373
Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EE CSV E +IT DT+ +L+R R
Sbjct: 374 LDEEEDCSV-----------EACPLLKITVDTQFKLLRGEIAR 405
>gi|449685252|ref|XP_002159583.2| PREDICTED: lysine-specific demethylase NO66-like, partial [Hydra
magnipapillata]
Length = 424
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 123/212 (58%), Positives = 159/212 (75%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W+K+PL+I R+Q Y N ++KK+D ILR+ +++ KN+DI +Y +GQR+T++ EGRA
Sbjct: 213 WQKKPLFIKRRQPLYNTNWFSTKKLDKILREKNVQYTKNLDIAVYRNGQRETLNPEGRAF 272
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +VW Y +GCSIR LNPQ + K + QL + LQE FGC VG+N Y TPP +QGFAPHYD
Sbjct: 273 PSVVWKFYEDGCSIRLLNPQIFAKSVHQLTSRLQEYFGCLVGSNVYLTPPGSQGFAPHYD 332
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF++QLEGKK WK+Y PR +E L RYSS N +E +G PIL LE GD LY PRG
Sbjct: 333 DIEAFVIQLEGKKHWKLYPPRNTNEVLARYSSENMQEENLGEPILNKVLEAGDTLYFPRG 392
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
IHQAST+ + HSLH+TIS+YQK++W D LEK
Sbjct: 393 VIHQASTLEDSHSLHITISLYQKSSWGDYLEK 424
>gi|330798349|ref|XP_003287216.1| hypothetical protein DICPUDRAFT_32138 [Dictyostelium purpureum]
gi|325082799|gb|EGC36270.1| hypothetical protein DICPUDRAFT_32138 [Dictyostelium purpureum]
Length = 499
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 12/332 (3%)
Query: 8 LYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVW 67
L+I R + N Y N T +D +LR+N ++F +N+D+T Y + +R T++ EGRA P +VW
Sbjct: 100 LHIKRSE-NVYKNFFTKDSLDKMLRNNYMKFTENVDVTNYVNEERITLNPEGRAYPSLVW 158
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
HY+EGCS+R LNPQT+ + +L + LQ F C VGAN Y TP AQGFAPHYDD++ F
Sbjct: 159 KHYKEGCSVRLLNPQTFNSNVWKLCSILQGHFKCGVGANIYLTPAGAQGFAPHYDDVDVF 218
Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA 187
ILQLEGKK+W++Y P+ E LP+ SS NF+Q+EIG P T TLE GDLLY PRG IHQA
Sbjct: 219 ILQLEGKKEWRLYHPKDQTEVLPKKSSENFAQDEIGEPYFTCTLEAGDLLYFPRGVIHQA 278
Query: 188 STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARG 247
+ + HSLH+T+S Y W DL+ K + +AL+ A LEFR GLP Y + G+
Sbjct: 279 VSPPDVHSLHITVSTYLNNTWGDLIGKVLNRALEIANEDCLEFREGLPRDYSDFLGVIHS 338
Query: 248 KPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFE 307
V + R + + + L KL +Y+ +D D+M + + D+LP L+ E + S+ +
Sbjct: 339 DKVG-EERRKELLDKVGTLWDKLGQYLPVDVAADQMHIKYLIDSLPTALNQNEKKHSIED 397
Query: 308 NGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
N +I DT+ RL+R ++VR
Sbjct: 398 ET----------NPMKIKTDTRFRLIRQDSVR 419
>gi|355731984|gb|AES10555.1| hypothetical protein [Mustela putorius furo]
Length = 401
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 194/313 (61%), Gaps = 2/313 (0%)
Query: 27 IDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIK 86
+ +LR ++F +++D Y +G+R+T++ GRALP W Y+ GCS+R L PQ +
Sbjct: 9 LRSMLRHEDVQFGQHLDAARYVNGRRETLNPPGRALPAAAWSLYQAGCSLRLLCPQAFST 68
Query: 87 PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
+ Q A LQE FG G+N Y TPP +QGFAPHYDDIEAF+LQLEG+K W+VY PR+
Sbjct: 69 TVWQFLAVLQEQFGSMAGSNIYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRVPT 128
Query: 147 EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
E L SSPNFSQ+++G P+L LEPGDLLY PRG+IHQA HSLH+T+S YQ+
Sbjct: 129 EELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRGFIHQAECQDGVHSLHLTLSTYQRN 188
Query: 207 AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDL 266
W D LE +P A+QAA ++EFRRGLP ++ Y G D + R A E ++ L
Sbjct: 189 TWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYMGAQHSDSKDPR--RTAFMEKVRVL 246
Query: 267 LAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITK 326
+A+L + +D D+ K +HD+LPPVL+ E SV+ +R V Q+T
Sbjct: 247 VARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALSVYGLPIRWEAGEPVNVGAQLTT 306
Query: 327 DTKVRLVRANAVR 339
+T+V +++ R
Sbjct: 307 ETEVHMLQDGIAR 319
>gi|332022277|gb|EGI62590.1| Nucleolar protein 66 [Acromyrmex echinatior]
Length = 394
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 198/313 (63%), Gaps = 4/313 (1%)
Query: 27 IDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIK 86
++ LR+N + F KN+DIT Y +G ++ + GRA IV D++R CS+R LNPQT+I
Sbjct: 4 FEETLRNNNVLFTKNLDITSYSNGVKEIYNPPGRAYASIVKDYFRNNCSVRMLNPQTFIP 63
Query: 87 PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
+ NA+LQE+FGCF+GAN Y TPP +QGFAPH+DD++ FILQ+EGKK+WK+Y P
Sbjct: 64 QICAFNANLQEIFGCFIGANIYLTPPNSQGFAPHFDDVDVFILQIEGKKRWKLYKPLKSS 123
Query: 147 EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
++L Y S +F + E+ PIL + GDLLY+PRG IHQA + + +SLH+TIS+ QK
Sbjct: 124 DFLAMYPSRDFDESELREPILDKVISAGDLLYIPRGTIHQAMAL-DTYSLHLTISINQKN 182
Query: 207 AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDL 266
W +LLEK +P+AL+ A D FR GLP+ YL Y G ++ R MK ++ +
Sbjct: 183 TWSNLLEKLLPQALKQATKKDFRFREGLPVNYLSYIGFQYTDKDLVRKGR--MKYIIRRM 240
Query: 267 LAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITK 326
L +L Y D+D D M K+ +HD LPP + E +C++ E+ +S G +I
Sbjct: 241 LGELERYFDIDQAADLMAKKHIHDFLPPNIFKSERRCTILEDDKNLSYNGPK-RCGKIKL 299
Query: 327 DTKVRLVRANAVR 339
T++RL+R +R
Sbjct: 300 STEIRLLRLYCMR 312
>gi|255088593|ref|XP_002506219.1| predicted protein [Micromonas sp. RCC299]
gi|226521490|gb|ACO67477.1| predicted protein [Micromonas sp. RCC299]
Length = 387
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 200/349 (57%), Gaps = 18/349 (5%)
Query: 1 DHWEKEPLYISRK-QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE- 58
D WE+ P Y+SR K Y++ L++ ID+ LR + +++N+D+T Y+DG R+T ++
Sbjct: 20 DIWERRPAYVSRNAHKGYFDGLLSKADIDEWLRAGKMRYQRNVDVTSYKDGVRRTHNLND 79
Query: 59 -------------GRALPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG 104
G A VW + +EGCS+R L+PQ + PL + A+L+ + C G
Sbjct: 80 DGSGGVDATTGEPGFADADTVWRRFEQEGCSLRVLHPQRWRDPLWKTLAALERFWNCSTG 139
Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT 164
N Y TP +QGF+PHYDDI+AFILQLEGKK W+VY PR E LPRYSSPNF Q+++G
Sbjct: 140 CNCYLTPADSQGFSPHYDDIDAFILQLEGKKLWRVYPPRSEAEMLPRYSSPNFGQDDVGE 199
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
P+L V LEPGDLLY+PRG +HQA+ V +HSLHVT+S Q W DLLE A P AL+ A
Sbjct: 200 PVLEVILEPGDLLYMPRGTVHQANCVPGDHSLHVTLSTNQFNTWADLLEVAFPAALRQAV 259
Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
A RR P YL + GL G+ + R + L +L ++ + D D +G
Sbjct: 260 AEVPALRRCPPPDYLAHLGLVDGEFDAVNPRRDDLIGALVELAQCVMRRLPFDAAADHIG 319
Query: 285 KQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLV 333
+LM LPP P + V G + T +IT++ RLV
Sbjct: 320 ARLMRQRLPP--PPSHVSAPVSATGANAAATVTDETRVRITQEEGARLV 366
>gi|301098029|ref|XP_002898108.1| nucleolar protein, putative [Phytophthora infestans T30-4]
gi|262105469|gb|EEY63521.1| nucleolar protein, putative [Phytophthora infestans T30-4]
Length = 676
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 202/343 (58%), Gaps = 18/343 (5%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++WE+ PL I R +YY+ + ++ID IL+ + +E+ ++D+T Y D R T++ G
Sbjct: 77 NYWEQRPLAIKRNFPSYYDGWFSKQEIDRILKTHTLEYGTDVDLTKYVDDTRHTLNPPGS 136
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A + VW HY +GCS+R L PQ + + +L A+L++ +GC GANTY TP QGFAPH
Sbjct: 137 ATAKQVWKHYDDGCSVRLLCPQKFSDDVWKLLATLEDEWGCMAGANTYLTPKNTQGFAPH 196
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+DDIEAF+LQ EG K WKVY P + L RY S NF E++G P L V LE GDLLY P
Sbjct: 197 FDDIEAFLLQTEGCKHWKVYKPLNESDVLARYPSGNFKAEDLGKPTLEVDLEQGDLLYFP 256
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG+IHQA +HSLH+T+S Q+ + LE +P+AL A T++E RR LP YL
Sbjct: 257 RGFIHQARAHKEKHSLHLTVSTGQQNTMGNFLEVLLPQALAGAINTNVELRRSLPRDYLE 316
Query: 241 YAGLA----RGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
Y G+ +G P + QA +K LK +L + + LD D+M K + D LPP L
Sbjct: 317 YMGVMHSDRKGDP-ERQAFANKLKGALKTVLGEAMGM--LDAASDQMAKNFLLDRLPPAL 373
Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EE C+ + L+ +IT +T+++LVR R
Sbjct: 374 EDEEENCTSDNSPLQ-----------KITVNTQLKLVRHGVAR 405
>gi|168049347|ref|XP_001777125.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671568|gb|EDQ58118.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 530
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 190/339 (56%), Gaps = 19/339 (5%)
Query: 3 WEKEPLYISR-KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE-DGQRQTMDVEGR 60
WEK+P + R K +NYY + I+ +L ++ +++ NID+T Y+ DG R T EG
Sbjct: 122 WEKKPFLVRRPKNRNYYAGIFDKATIEKLLEEHELKYGLNIDVTKYDIDGGRSTFSSEGS 181
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A P VW Y +G S+R L+PQ + P+ + ++ + +G G N Y TP +QGF+PH
Sbjct: 182 ATPSKVWSKYADGWSVRILHPQRWCDPVFLILSAFERFWGSVAGCNAYLTPAGSQGFSPH 241
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
YDDIEAF++Q EG+K+WKVY PR E LPR+SSPNF Q EIG PIL V LEPGD+LY+P
Sbjct: 242 YDDIEAFVIQTEGRKRWKVYKPRTPGEALPRFSSPNFEQGEIGEPILDVDLEPGDILYMP 301
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG IHQA + HSLH+T+SV Q+ W D LE AMP+AL+ A + R LP GY
Sbjct: 302 RGTIHQAKASEDAHSLHITVSVGQRNCWGDFLEFAMPRALELASEDHILLRESLPRGYAD 361
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEE 300
Y G+A D R A + + + +A + + + D D++ + + LP
Sbjct: 362 YMGVAHSDDHD-NPQRAAFIDKIMECMAIVSQSIPWDSAADQLAVKFLQSRLP------- 413
Query: 301 LQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
L +IT ++VRLV + R
Sbjct: 414 ---------LPAPANAVHGKGQKITGKSRVRLVAPDVAR 443
>gi|298708016|emb|CBJ30378.1| expressed unknown protein [Ectocarpus siliculosus]
Length = 686
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 141/388 (36%), Positives = 205/388 (52%), Gaps = 63/388 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD----- 56
+WE+ P + ++ +Y++ + + ++ + +++ ++D+T Y + +R T++
Sbjct: 181 YWERNPFVVRGRRADYFSGWLETSDLEAFISGQSMQYGTDLDVTNYVNKRRVTLNPAAPA 240
Query: 57 --------------------------------------VEGRA-----LPQIVWDHYREG 73
GRA P+ VW +REG
Sbjct: 241 PPPEQSPSPPPKKGGHKGKGKGGPSTSGNGASGDAHLAATGRAGGAVVTPKFVWQKFREG 300
Query: 74 CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
CS+R PQ + PL L ++L+E FGC VG+N Y TPP +QGFAPH+DDIEAF+LQ+EG
Sbjct: 301 CSLRMPCPQKFSDPLHLLLSALEEEFGCMVGSNVYLTPPRSQGFAPHWDDIEAFLLQVEG 360
Query: 134 KKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNE 193
+K+W+VY P LPR SSPN + E++G P L V LEPGDLLYLPRG+ HQA TV +E
Sbjct: 361 RKRWRVYPPTDDQAVLPRLSSPNLTDEQVGEPALEVVLEPGDLLYLPRGWAHQAETVGDE 420
Query: 194 HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQ 253
SLH+T+S Q W DL+E +P+A+ +A + E R GLP YL Y G+ D +
Sbjct: 421 ASLHITVSAMQGNCWADLVEGLVPQAVASAVQANQELRSGLPRDYLEYMGVVHSDEEDKR 480
Query: 254 AD--RLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLR 311
D + M++ L ++ + +E LD D+MGK+ + D LPPVL E SV E G
Sbjct: 481 RDMFKAKMRKKLGLIVEEAIEL--LDAAADQMGKRYISDRLPPVLEESEELSSVVERG-- 536
Query: 312 MSQTGEVYNATQITKDTKVRLVRANAVR 339
IT TK+RL R R
Sbjct: 537 ---------DGAITPLTKLRLARRGCAR 555
>gi|348682869|gb|EGZ22685.1| hypothetical protein PHYSODRAFT_330429 [Phytophthora sojae]
Length = 485
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 201/342 (58%), Gaps = 18/342 (5%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WE+ PL I R ++YY+ + ++D IL+ + +E+ ++D+T Y D R T++ G A
Sbjct: 78 YWEQRPLAIKRNFRSYYDGWFSKAEVDRILKTHTLEYGADLDLTKYVDSTRHTLNPSGSA 137
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+ VW H+ +GCS+R L PQ + + +L A+L++ +GC GANTY TP QGFAPH+
Sbjct: 138 TAKEVWKHFDDGCSVRLLCPQKFSDDVWKLLATLEDEWGCMAGANTYLTPKNTQGFAPHF 197
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DDIEAF+LQ EG K WKVY P + L RY S N+ E++G P++ V LE GDLLY PR
Sbjct: 198 DDIEAFLLQTEGCKHWKVYKPLNDSDMLARYPSGNYKPEDLGKPMIEVDLEEGDLLYFPR 257
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G+IHQA +HSLH+T+S Q+ + LE +P+AL A T++E RR LP YL Y
Sbjct: 258 GFIHQARAHKEKHSLHLTVSTGQQNTMGNFLEVLLPQALGGAINTNVELRRSLPRDYLDY 317
Query: 242 AGLAR----GKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
G+ G P + QA +K LK +L + + LD D+M K + D LPP L
Sbjct: 318 MGVMHSDREGDP-ERQAFANKLKGALKIVLGEAMGM--LDAAADQMAKNFLVDRLPPPLE 374
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EE C+ + L+ +IT +T+++L+R R
Sbjct: 375 DEEENCTSDNSPLQ-----------KITVNTQLKLIRHGVAR 405
>gi|384253086|gb|EIE26561.1| cupin 4 [Coccomyxa subellipsoidea C-169]
Length = 414
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 119/302 (39%), Positives = 189/302 (62%), Gaps = 11/302 (3%)
Query: 3 WEKEPLYISRKQ-KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQT---MDVE 58
WE+ L + R + ++ + + ++ +LR I + NID+T Y DGQ +T +VE
Sbjct: 76 WEQRVLLVRRPECPTFFESWFSKGAMEALLRKETISYAYNIDVTKY-DGQERTNYNYNVE 134
Query: 59 GRALPQ-----IVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
A+ + ++W Y +EGCS+R L+PQ + L A L+E+F VG N Y TP
Sbjct: 135 SSAVEEAVDADVLWARYNQEGCSVRLLHPQRWFDHLAATLAKLEEVFSSCVGCNVYLTPG 194
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
++GFAPH+DDI+AF+LQ+EG K+W+++ P D LPR+SSP+F+QE++ P L V L+
Sbjct: 195 DSKGFAPHFDDIDAFVLQVEGAKRWRLHQPITEDHVLPRFSSPDFAQEDLDDPFLDVVLQ 254
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
PGD+LY+PRG IHQA + HSLH+T+S Q+ +W D LE A+P+ALQ A ++ RR
Sbjct: 255 PGDVLYMPRGTIHQAQALPGTHSLHLTVSANQRNSWADFLEVALPRALQLAAEECVDLRR 314
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
G+P+GY++Y G+ ++A R A + + ++ ++++++ LD D++ Q M L
Sbjct: 315 GMPLGYMQYMGVPFSDDSSLEAKRAAFEATAQSMVQRVLQFLPLDGAADQLAAQFMQQRL 374
Query: 293 PP 294
PP
Sbjct: 375 PP 376
>gi|308505402|ref|XP_003114884.1| CRE-JMJC-1 protein [Caenorhabditis remanei]
gi|308259066|gb|EFP03019.1| CRE-JMJC-1 protein [Caenorhabditis remanei]
Length = 759
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 207/347 (59%), Gaps = 19/347 (5%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++ L + RKQ Y+ NL ++ ++ ++L N +E+ NI+I Y++G R T++ +GR
Sbjct: 336 YQSNVLVVHRKQPTYFGNLFSTARLCELLEKNHLEYGTNINIAQYKNGVRTTLNGKGRVY 395
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQIV H CS++ +NPQT+ + + LQE FGCFVGANTY TP + GFAPH+D
Sbjct: 396 PQIVKQHLNNLCSVQLVNPQTFDDRIWYICEVLQEQFGCFVGANTYLTPAGSSGFAPHWD 455
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
+I+AF+LQ+EG+K W+V+ P DE LP SS NF++EE+ P+ +E GD++Y+P
Sbjct: 456 EIDAFLLQVEGRKYWRVWAPESSDEELPLESSDNFTEEEMKGRVPVFEGWIEKGDVIYIP 515
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQA T + HSLHVTIS ++ ++ +LLEK +P+A+ A + RRGLPIG
Sbjct: 516 RGYIHQARTDSKVHSLHVTISTGRQWSFANLLEKLVPEAIGALIQDQHKLRRGLPIGLFD 575
Query: 241 YAGLARGKPVDI---QADRLAMKENL--KDLLAKLVEYV---DLDDGVDEMGKQLMHDAL 292
G+ VD+ Q D K + ++KL +V + VD M K+ M AL
Sbjct: 576 MGGV-----VDLDYAQEDHFTEKFKIVVDRQMSKLRNFVADHTFESSVDSMAKEFMKQAL 630
Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
PP+L+ E + SV G + G+ + T TKV+L+R + R
Sbjct: 631 PPLLTESEKKLSVI--GASIDLLGD--DLVDFTAKTKVKLIRKHTQR 673
>gi|71991759|ref|NP_001021644.1| Protein JMJC-1, isoform a [Caenorhabditis elegans]
gi|74956075|sp|O01658.2|NO66_CAEEL RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|351060644|emb|CCD68363.1| Protein JMJC-1, isoform a [Caenorhabditis elegans]
Length = 748
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 135/347 (38%), Positives = 207/347 (59%), Gaps = 19/347 (5%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++ L + RKQ Y+ NL ++ ++ ++L N +E+ +NI+I Y++G R T++ +GRA
Sbjct: 325 YQSNVLVVRRKQPTYFGNLFSTARLGELLEKNHLEYGRNINIAQYKNGVRTTLNGQGRAY 384
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQIV H CS++ +NPQTY + L +QE FGCFVGANTY TP + GFAPH+D
Sbjct: 385 PQIVKQHLHNMCSVQLVNPQTYDDRIWYLCEVIQEQFGCFVGANTYLTPAGSSGFAPHWD 444
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
+I+AF+LQ+EG+K W+V+ P +E LP SS NF+++++ P+ +E GD++Y+P
Sbjct: 445 EIDAFLLQVEGRKYWRVWAPESAEEELPLESSDNFTEDDMKGREPVFEGWIEKGDMIYIP 504
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQA T + HSLHVT+S ++ ++ +L+EK +P+A+ T + RRGLP G
Sbjct: 505 RGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKVVPEAIGVLTDTRHKLRRGLPTGLFD 564
Query: 241 YAGLARGKPVDI---QADRLAMKENL-----KDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
G+ +D+ Q D K + +L LV L+ VD + K+ M AL
Sbjct: 565 MGGV-----IDLDYSQEDHFVEKFKMVVDRHMSMLRNLVADQLLESSVDSLAKEFMKQAL 619
Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
PP L+ +E + SV G + G+ + T TKVRL+R + R
Sbjct: 620 PPRLTEQEKKLSVL--GSSTNLLGD--DLVDFTARTKVRLIRRHTQR 662
>gi|341896700|gb|EGT52635.1| hypothetical protein CAEBREN_31833 [Caenorhabditis brenneri]
Length = 765
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/347 (38%), Positives = 207/347 (59%), Gaps = 19/347 (5%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++ L + RKQ NY+ NL ++ ++ ++L N +E+ NI+I Y++G R T++ +GRA
Sbjct: 342 YQSNVLVVHRKQPNYFGNLFSTARLCELLEKNHLEYGTNINIAQYKNGIRTTLNGKGRAY 401
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQI+ H CS++ +NPQT+ + L LQE+FGCFVGANTY TP + GFAPH+D
Sbjct: 402 PQIIKQHLHNLCSVQLVNPQTFDDRIWYLCEVLQEIFGCFVGANTYLTPAGSSGFAPHWD 461
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
+I+AF+LQ+EG+K W+V+ P +E LP SS NF+++++ P+ +E GD++Y+P
Sbjct: 462 EIDAFLLQVEGRKYWRVWAPESAEEELPLESSDNFTEDDMKGREPVFEGWIEKGDMIYIP 521
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQA T + HSLHVT+S ++ ++ +L+EK +P+A+ A + RRGLPIG
Sbjct: 522 RGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKIVPEAIGALTDNRHKLRRGLPIGLFD 581
Query: 241 YAGLARGKPVDI---QADRLAMKENL-----KDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
G+ +D+ Q D K + +L LV L+ VD + K+ M AL
Sbjct: 582 MGGV-----IDLDYPQEDHFVEKFKIVVDRHMSMLRNLVADHLLESSVDSLAKEFMKQAL 636
Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
PP+L+ +E + SV G+ G + TKVR +R + R
Sbjct: 637 PPILTEKEKKLSVI--GVSTDLLGS--DLIDFNAKTKVRFIRKHTQR 679
>gi|341876636|gb|EGT32571.1| CBN-JMJC-1 protein [Caenorhabditis brenneri]
Length = 528
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 133/347 (38%), Positives = 206/347 (59%), Gaps = 19/347 (5%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++ L + RKQ NY+ NL ++ ++ ++L N +E+ NI+I Y++G R T++ +GR
Sbjct: 105 YQSNVLVVHRKQPNYFGNLFSTARLCELLEKNHLEYGTNINIAQYKNGIRTTLNGKGRVY 164
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQI+ H CS++ +NPQT+ + L LQE+FGCFVGANTY TP + GFAPH+D
Sbjct: 165 PQIIKQHLHNLCSVQLVNPQTFDDRIWYLCEVLQEIFGCFVGANTYLTPAGSSGFAPHWD 224
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLP 180
+I+AF+LQ+EG+K W+V+ P +E LP SS NF+++++ P+ +E GD++Y+P
Sbjct: 225 EIDAFLLQVEGRKYWRVWAPETAEEELPLESSDNFTEDDMKGREPVFEGWIEKGDMIYIP 284
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQA T + HSLHVT+S ++ ++ +L+EK +P+A+ A + RRGLPIG
Sbjct: 285 RGYIHQARTDSKVHSLHVTVSTGRQWSFANLMEKIVPEAIGALTDNRHKLRRGLPIGLFD 344
Query: 241 YAGLARGKPVDI---QADRLAMKENL-----KDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
G+ +D+ Q D K + +L LV L+ VD + K+ M AL
Sbjct: 345 MGGV-----IDLDYPQEDHFVEKFKIVVDRHMSMLRNLVADHLLESSVDSLAKEFMKQAL 399
Query: 293 PPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
PP+L+ +E + SV G+ G + TKVR +R + R
Sbjct: 400 PPILTEKEKKLSVI--GVSTDLLGS--DLIDFNAKTKVRFIRKHTQR 442
>gi|312065885|ref|XP_003136006.1| hypothetical protein LOAG_00418 [Loa loa]
gi|307768830|gb|EFO28064.1| hypothetical protein LOAG_00418 [Loa loa]
Length = 747
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 196/343 (57%), Gaps = 11/343 (3%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++K+ ++ +YY NL ++ K DIL+ + +E+ N++I +Y++ +R T++ G+
Sbjct: 325 FQKKVFHVRHNNPSYYGNLFSTAKFIDILQTDYVEYGTNVNIAVYKNQERSTLNGSGKVY 384
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQ + + + GCSI+F NPQ++ + LQE+FGCFVGANTY TP GFAPH+D
Sbjct: 385 PQEIQKNIKAGCSIQFTNPQSFCDNVWYYCDLLQEVFGCFVGANTYITPANTAGFAPHWD 444
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT--LEPGDLLYLP 180
DI+AF+LQLEG+K WK+Y +E LPR SS NF+ +++ L LE GD+LY+P
Sbjct: 445 DIDAFLLQLEGRKHWKIYAHTDDNEMLPRLSSDNFTDDDVADRRLVFDDWLEQGDMLYIP 504
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG+IHQ + HSLH+T+SV + + DLLE+ +P A+ + RR LP GYL
Sbjct: 505 RGFIHQGFADKDVHSLHLTVSVCRNVTYADLLERIIPLAISNFAEQHVNIRRSLPAGYLD 564
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALPPVLSP 298
G+ + + + L + + +YV + VD M ++ M ALPPVL+
Sbjct: 565 MTGVLECDYPLFKTGAVKLHRFLDAVFSNFCKYVKELYEPAVDMMAREFMKTALPPVLTK 624
Query: 299 EELQCSVFENGLRMSQTGEVYNATQ--ITKDTKVRLVRANAVR 339
EE +V G +Y Q TKDT ++L+R + R
Sbjct: 625 EEKDMTVL-----CIADGSLYGDNQHIFTKDTPIKLLRRHGQR 662
>gi|291221577|ref|XP_002730797.1| PREDICTED: CG2982-like [Saccoglossus kowalevskii]
Length = 530
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 126/281 (44%), Positives = 168/281 (59%), Gaps = 38/281 (13%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+PL + R +Y + ++ ++D I+R+N +EF +NID+T Y DG+R+T + EGRA
Sbjct: 202 WEKKPLLVKRHMASYNDGWFSTDEMDQIIRENELEFTRNIDVTSYTDGKRETHNPEGRAY 261
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +VWD+Y+ F A T QGFAPHYD
Sbjct: 262 PSVVWDYYQ-----------------------------LFNSAGT-------QGFAPHYD 285
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DIEAF+LQLEGKK WK+Y PR DE LP+YSS NF +E++G PIL V LE GDLLY PRG
Sbjct: 286 DIEAFVLQLEGKKHWKLYSPRTEDETLPKYSSDNFKEEDVGDPILDVELEAGDLLYFPRG 345
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYA 242
IHQA T + HSLH+T+S QK W DL+EK +P+AL+ A D+EFR+ LP Y Y
Sbjct: 346 TIHQAETSGDTHSLHITVSTCQKNTWGDLMEKLVPRALELAFNEDIEFRQSLPRNYQSYM 405
Query: 243 GLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
G+ D + R+A + L+ KLV + +D VD+M
Sbjct: 406 GVVNSDNDDPK--RVAFLNKIGQLMMKLVSHSPVDAAVDQM 444
>gi|303274723|ref|XP_003056677.1| expressed protein [Micromonas pusilla CCMP1545]
gi|226461029|gb|EEH58322.1| expressed protein [Micromonas pusilla CCMP1545]
Length = 639
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 137/353 (38%), Positives = 195/353 (55%), Gaps = 26/353 (7%)
Query: 3 WEKEPLYISRK-QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
WE+ PLY+SR K Y + ++ + +D LR + + N+D+T Y +G R+T ++
Sbjct: 149 WERRPLYVSRNANKKYLDGWLSKEDVDGWLRAGKMRYGVNVDVTSYVNGVRETHNINDDD 208
Query: 61 --------ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
A VW + E GCS+R L+PQ + L ++ ++++ + C G N Y TP
Sbjct: 209 ADDEEPWFADAASVWKKFDEKGCSLRILHPQRWKDALWKMLSTMERYWNCSTGCNVYLTP 268
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
P +QGF+PH+DDI+AF+LQLEGKK W+VY PR E LPRYSSPNF+Q+EIG P+L V L
Sbjct: 269 PNSQGFSPHFDDIDAFVLQLEGKKTWRVYPPRSEKEMLPRYSSPNFAQDEIGEPVLEVIL 328
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
EPGD+LY+PRG +HQAS V EHSLHVT+S Q W D LE A+P A++ A A R
Sbjct: 329 EPGDVLYMPRGTVHQASCVDGEHSLHVTLSTNQFNTWADALELALPAAIRDAVAELPALR 388
Query: 232 RGLPIGYLRYAGLARGKPVDIQADRLAMKEN-----LKDLLAKLVEYVDLDDGVDEMGKQ 286
R P L+ G+ D + + L +L + ++ + LD D++G +
Sbjct: 389 RCPPPDMLQRLGIGASDDGDGSDGGGGDRRDAFVGALNELASAVLRRLPLDAAADQLGAR 448
Query: 287 LMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
LM LPP P L +G A ++T D+ VRL R
Sbjct: 449 LMKQRLPP--PPSHLATPKAASGPDA--------AKKVTMDSVVRLTAEGGAR 491
>gi|313230251|emb|CBY07955.1| unnamed protein product [Oikopleura dioica]
Length = 266
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 113/225 (50%), Positives = 159/225 (70%), Gaps = 3/225 (1%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ-RQTMDVEG 59
++WEK+P+ I R + +Y ++L ++ ++ +I+++N ++F N+D+T ++ RQT + G
Sbjct: 38 NYWEKKPILIRRNKADYNSSLFSTSQLANIIQNNPVQFGVNLDVTSWDKQHGRQTHNEPG 97
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
RA P VWD Y GCS+R LNPQTY K + L A LQE FGCF G+N+Y TPP+ GFAP
Sbjct: 98 RAWPSKVWDLYNNGCSVRMLNPQTYSKTIWSLCAGLQEFFGCFTGSNSYLTPPSTTGFAP 157
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPILTVTLEPGDLLY 178
HYDD+E F++Q EG+K+WKV+ P + +LPR SS N+ EI G P + ++PGD+LY
Sbjct: 158 HYDDVEVFMVQTEGRKRWKVWKP-FDNSHLPRTSSRNYHPNEIKGEPEIYEIVQPGDVLY 216
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
+PRG+IHQ + EHSLHVTIS YQK AW DLLEK +P+AL A
Sbjct: 217 VPRGWIHQGECLCGEHSLHVTISTYQKNAWCDLLEKIVPQALCRA 261
>gi|268560752|ref|XP_002646283.1| Hypothetical protein CBG11988 [Caenorhabditis briggsae]
gi|284433497|sp|A8XEA2.1|NO66_CAEBR RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
Length = 776
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 202/343 (58%), Gaps = 21/343 (6%)
Query: 8 LYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVW 67
L + RK YY NL ++ + ++L + +E+ NI+I Y+DG R T++ +GR PQIV
Sbjct: 356 LVVHRKTPAYYGNLFSTTRFSELLEKHYLEYNMNINIAQYKDGIRTTLNGKGRVYPQIVK 415
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
H CS++ +NPQT+ + L LQE FGCFVGANTY TP + GFAPH+D+I+AF
Sbjct: 416 QHLHNLCSVQLVNPQTFDDRIWYLCEILQEQFGCFVGANTYLTPAGSSGFAPHWDEIDAF 475
Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI--GTPILTVTLEPGDLLYLPRGYIH 185
+LQ+EG+K W+V+ P +E LP SS NF+++++ P+ +E GD++Y+PRGY H
Sbjct: 476 LLQVEGRKYWRVWAPESAEEELPLESSGNFTEDDMKGKEPVFEGWIEQGDMIYIPRGYTH 535
Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
QA T HSLHVT+S ++ ++ +L+EK +P+A+ A + RRGLPIG G+
Sbjct: 536 QARTDKKVHSLHVTVSTGRQWSFANLMEKVIPEAVGALTEERHKLRRGLPIGLFDMGGVV 595
Query: 246 R---------GKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
+ I DR + L++L+A + LD VD M K+ M ALPP+L
Sbjct: 596 DLDYSQEEHFTEKFKIVVDRHMSR--LRNLVADHL----LDSSVDSMTKEFMKQALPPIL 649
Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ +E + SV GL + G+ + +TKV+ +R + R
Sbjct: 650 TDKEKKRSVI--GLSTNLLGD--DLIDFCANTKVKFIRKHTQR 688
>gi|402590342|gb|EJW84272.1| lysine-specific demethylase NO66 [Wuchereria bancrofti]
Length = 776
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 195/343 (56%), Gaps = 11/343 (3%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++K+ L++ NYY NL ++ K DIL+ + +E+ N+++ +Y++ QR T++ G+
Sbjct: 354 FQKKVLHVCHNNPNYYGNLFSTAKFIDILQTDYVEYGTNVNVAIYKNQQRSTLNGSGKVY 413
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQ + + GCSI+ NPQ++ + LQE+FGCFVGANTY TP A GFAPH+D
Sbjct: 414 PQEIQKSIKAGCSIQLTNPQSFCDNVWYYCDLLQEVFGCFVGANTYITPANAAGFAPHWD 473
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT--LEPGDLLYLP 180
DI+AF+LQLEG+K WK+Y P +E LPR S NF+ ++ L LE GD+LY+P
Sbjct: 474 DIDAFLLQLEGRKHWKIYAPDSDNEMLPRLPSGNFTDNDVINRTLVFDDWLEQGDMLYIP 533
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQ + HSLH+T+SV + + DLLE+ +P AL ++ R+ LP YL
Sbjct: 534 RGYIHQGFADKDVHSLHLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIRKSLPARYLD 593
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALPPVLSP 298
G+ ++ + + L ++ + Y+ + VD M ++ M ALPPVL+
Sbjct: 594 MTGVLECDYPLLKTGTVKLHRFLDNIFSNFCIYIKELSEPAVDMMAREFMKTALPPVLTK 653
Query: 299 EELQCSVFENGLRMSQTGEVYNATQ--ITKDTKVRLVRANAVR 339
EE + +Y+ Q TK+T ++L+R + R
Sbjct: 654 EEKDMTAL-----CVAGSSLYSDKQHIFTKNTPIKLLRRHGQR 691
>gi|440794353|gb|ELR15514.1| cupin superfamily subfamily protein [Acanthamoeba castellanii str.
Neff]
Length = 1219
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 197/343 (57%), Gaps = 39/343 (11%)
Query: 2 HWEKEPLYISRKQ-KNYYNNLITSKKIDDILRDNV----IEFKKNIDITLYEDGQRQTMD 56
+WE+ PL I R+Q K+YY LI+ DD+ R + I+F +N+DIT Y+DG+R+T +
Sbjct: 831 YWEQAPLLIQRRQDKDYYAGLISK---DDLTRGLLEKFDIQFTRNLDITSYQDGKRKTHN 887
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
GRA P+ V ++EGCS+R LNP TY + + LQE FG +GAN Y TP + QG
Sbjct: 888 PAGRADPETVQQFFKEGCSVRMLNPATYCDGVWAVLEGLQEYFGYGMGANVYLTPASTQG 947
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
F+PHYDDIEAF+LQLEG K+W++Y P LPR+SS NF Q EIG ++ V L GD
Sbjct: 948 FSPHYDDIEAFVLQLEGSKRWRLYAPLDEPSTLPRFSSRNFEQSEIGAVVMDVVLHAGDF 1007
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
LY PRG+IHQ + W+DLLEK +P+ ++ A + LP
Sbjct: 1008 LYFPRGWIHQCCS------------------WLDLLEKMVPRGIEVA-EEEFGLISSLPR 1048
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
Y ++ G + K ++ +R E+ K LL ++ + +D D MG Q +H+ LPP L
Sbjct: 1049 QYGQFLGTGKSKLRALREERGTFLESYKQLLEGVLAAMPVDLAADRMGIQYIHNCLPPFL 1108
Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
S +E L ++++ E Q+T+ T+V+L+R A R
Sbjct: 1109 SADER--------LALARSKEA----QLTETTEVKLLRREAQR 1139
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 22/128 (17%)
Query: 170 TLEPGDLLYLPRGYIHQASTV------------------TNEHSLHVTISVYQKTAWIDL 211
TL GDL Y+P+ + + N+HSLHV ++ ++T W +
Sbjct: 391 TLLAGDLAYIPKQMAFTLTPIHSGSSTSSSSSSSSTSAEANKHSLHVLLAPSKRTTWSQM 450
Query: 212 LEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLV 271
+ + +Q ++E R+ LP+GY +Y GL D R + E + +L +
Sbjct: 451 FKDTIASGMQCLEKENIEARKTLPMGYQQYMGLVHSDKED-DPRRDSFLEQVSELFGSV- 508
Query: 272 EYVDLDDG 279
+DLD G
Sbjct: 509 --LDLDSG 514
>gi|444511429|gb|ELV09875.1| Lysine-specific demethylase NO66, partial [Tupaia chinensis]
Length = 365
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 125/275 (45%), Positives = 166/275 (60%), Gaps = 10/275 (3%)
Query: 73 GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGA--------NTYQTPPTAQGFAPHYDDI 124
GCS+R L PQ + + Q A LQE FG GA N Y TPP +QGFAPHYDDI
Sbjct: 9 GCSLRLLCPQAFSTTVWQFLAVLQEQFGSMAGAHVLLKAGSNVYLTPPNSQGFAPHYDDI 68
Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
EAF+LQLEG+K W+VY PR+ E L SSPNFSQ+++G P+L LEPGDLLY PRG+I
Sbjct: 69 EAFVLQLEGRKLWRVYRPRVPAEELALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRGFI 128
Query: 185 HQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGL 244
HQA HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y G
Sbjct: 129 HQAECQDGVHSLHLTVSTYQRNTWGDFLEAILPLAVQAAMDENVEFRRGLPRDFMDYMGA 188
Query: 245 ARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCS 304
D + R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E S
Sbjct: 189 QHSDSKDPR--RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALS 246
Query: 305 VFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
V+ +R V Q+T +T+V +++ R
Sbjct: 247 VYGLPIRWEAGEPVNVGAQLTTETEVHMLQDGIAR 281
>gi|170595078|ref|XP_001902238.1| LD40453p [Brugia malayi]
gi|284433496|sp|A8QFQ3.1|NO66_BRUMA RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase NO66; AltName: Full=Histone lysine
demethylase NO66
gi|158590193|gb|EDP28913.1| LD40453p, putative [Brugia malayi]
Length = 710
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 196/342 (57%), Gaps = 9/342 (2%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
++K+ + NYY NL ++ K DIL+ + +E+ N+++ +Y++ QR T++ G+
Sbjct: 288 FQKKVFLVCHNNPNYYGNLFSTAKFIDILQTDYVEYGTNVNVAIYKNQQRSTLNGSGKVY 347
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
PQ + + GCSI+ NPQ++ + LQE+FGCFVGAN Y TP GFAPH+D
Sbjct: 348 PQAIQKSIKAGCSIQLTNPQSFCDNVWYYCDLLQEVFGCFVGANIYITPANTAGFAPHWD 407
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT--LEPGDLLYLP 180
DI+AF+LQLEG+K WK+Y P DE LPR S NF+ ++ +L LE GD+LY+P
Sbjct: 408 DIDAFLLQLEGRKHWKIYAPDSDDEMLPRLPSGNFTDNDVINRMLVFDDWLEQGDMLYIP 467
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RGYIHQ + HSLH+T+SV + + DLLE+ +P AL ++ R+ LP YL
Sbjct: 468 RGYIHQGFADKDVHSLHLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIRKSLPARYLD 527
Query: 241 YAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDD-GVDEMGKQLMHDALPPVLSP 298
G+ ++ + + L + + +Y+ +L + VD M ++ M ALPPVL+
Sbjct: 528 MTGVLECDYPLLKTGTMKLHRFLDSIFSNFCKYIKELSEPAVDMMAREFMRTALPPVLTK 587
Query: 299 EELQCSVFENGLRMSQTGEVYNATQI-TKDTKVRLVRANAVR 339
EE + L ++ + + I TK+T ++L+R + R
Sbjct: 588 EEKDMT----ALCVAGSSLYGDKEHIFTKNTSIKLLRRHGQR 625
>gi|307110623|gb|EFN58859.1| hypothetical protein CHLNCDRAFT_140729 [Chlorella variabilis]
Length = 624
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 119/310 (38%), Positives = 185/310 (59%), Gaps = 19/310 (6%)
Query: 4 EKEPLYISRKQ-KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDG--QRQTMDVEGR 60
++ P+ ++R Q + Y++ L + ID ++R ++++ N+D+T Y G +R + G
Sbjct: 165 DEAPMLVTRPQNRTYFSGLFSKDDIDRLVRGGAMQYQINVDVTSYSAGGKKRANFNYNGS 224
Query: 61 ALP-----------QIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
P ++VW + +EGCS+R L+PQ + L +L A L+ C VG N Y
Sbjct: 225 TEPDEEAVPEMADAEVVWRRFEKEGCSVRLLHPQRFCDGLWRLVARLESFLQCPVGCNAY 284
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
TP QGFAPH+DDI+AF+LQLEGKK+W+++ P + LPRYSS +FS+EE+G ++
Sbjct: 285 LTPAGTQGFAPHWDDIDAFVLQLEGKKRWRLHAPTDPEHVLPRYSSRDFSEEELGECVMD 344
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL 228
V LEPGD+LY+PRG +HQA ++ + HSLH+TIS Q+ +W LE+A+P+AL+AA +
Sbjct: 345 VVLEPGDMLYMPRGTVHQAESMPDSHSLHITISANQQRSWTVFLEEALPEALRAAAQSRR 404
Query: 229 EFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE----YVDLDDGVDEMG 284
E RR LP Y Y G+ +P + +E A+L+E ++ LD D++
Sbjct: 405 ELRRTLPRDYFEYMGVMHERPEGDGGEEDPRREEFTQHCAELIEAVMHHMPLDGVADQLA 464
Query: 285 KQLMHDALPP 294
Q + LPP
Sbjct: 465 VQFLQQRLPP 474
>gi|380797767|gb|AFE70759.1| lysine-specific demethylase NO66, partial [Macaca mulatta]
Length = 339
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/250 (46%), Positives = 156/250 (62%), Gaps = 2/250 (0%)
Query: 90 QLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
Q A LQE FG G+N Y TPP +QGFAPHYDDIEAF+LQLEG+K W+VY PR+ E L
Sbjct: 8 QFLAVLQEQFGSMAGSNVYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRVPTEEL 67
Query: 150 PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWI 209
SSPNFSQ+++G P+L LEPGDLLY PRG+IHQA HSLH+T+S YQ+ W
Sbjct: 68 ALTSSPNFSQDDLGEPVLQTVLEPGDLLYFPRGFIHQAECQDGVHSLHLTLSTYQRNTWG 127
Query: 210 DLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAK 269
D LE +P A+QAA ++EFRRGLP ++ Y G D + R A E ++ L+A+
Sbjct: 128 DFLEAILPLAVQAAMEENVEFRRGLPRDFMDYMGAQHSDSKDPR--RTAFMEKVRVLVAR 185
Query: 270 LVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTK 329
L + +D D+ K +HD+LPPVL+ E SV+ +R V Q+T +T+
Sbjct: 186 LGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALSVYGLPIRWEAGEPVNVGAQLTTETE 245
Query: 330 VRLVRANAVR 339
V +++ R
Sbjct: 246 VHMLQDGIAR 255
>gi|449684642|ref|XP_002167270.2| PREDICTED: MYC-induced nuclear antigen-like [Hydra magnipapillata]
Length = 419
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 180/310 (58%), Gaps = 24/310 (7%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG--- 59
WEK+PLYI R+ YY +L + + +IL + +EF+ ++++ Y D +++ ++ +G
Sbjct: 60 WEKKPLYIKRENSGYYGDLFSLSSMKEILAAHELEFETDVNVCRYVDNEKELLNEDGCLT 119
Query: 60 -RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+++ D + + + PQ Y L QL ++ FGC VG+N Y TP +QG A
Sbjct: 120 VDKFDKLMNDKH---ATFQLHQPQRYGTVLWQLMEKMETYFGCLVGSNVYITPKESQGLA 176
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PH DD+E FILQLEG K WK+Y P MV+ L R + + SQ+ IG PI+ +TLEPGDLLY
Sbjct: 177 PHCDDVEVFILQLEGTKHWKLYKP-MVE--LSRDYTQDLSQDSIGEPIMELTLEPGDLLY 233
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
PRG IHQA +V +S H+T+S YQ D + A+ +A+++A D+ FRRGLPI Y
Sbjct: 234 FPRGTIHQARSVGESYSTHITLSTYQNNTLGDFMSIAVSQAIESALENDVNFRRGLPINY 293
Query: 239 LRYAGLAR---------GKPVDIQADR-----LAMKENLKDLLAKLVEYVDLDDGVDEMG 284
L Y G A+ GK + ++ K+++K L+KL++++D++ D M
Sbjct: 294 LSYLGTAKNFSKYFDEDGKESKLSSNENNEKVKKFKDSVKKHLSKLIDHIDVNTAADMMS 353
Query: 285 KQLMHDALPP 294
M LPP
Sbjct: 354 YDFMASRLPP 363
>gi|344245523|gb|EGW01627.1| Nucleolar protein 66 [Cricetulus griseus]
Length = 320
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 113/238 (47%), Positives = 148/238 (62%), Gaps = 2/238 (0%)
Query: 102 FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE 161
GAN Y TPP +QGFAPHYDDIEAF+LQLEG+K W+VY PR E LP SSPNFSQE+
Sbjct: 1 MAGANVYLTPPNSQGFAPHYDDIEAFVLQLEGRKLWRVYRPRDPSEELPLTSSPNFSQED 60
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+G P+L LEPGDLLY PRG+IHQA EHSLH+T+S YQ+ W D LE +P A+Q
Sbjct: 61 LGEPVLKAVLEPGDLLYFPRGFIHQAECQDEEHSLHLTLSTYQRNTWGDFLEAVVPLAVQ 120
Query: 222 AAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
AA ++EFRRGLP ++ Y G D + R A E ++ L+A L + +D D
Sbjct: 121 AAIEENVEFRRGLPRDFMDYMGAQHSDSKDPR--RTAFMEKVRVLVAHLRHFAPVDAVAD 178
Query: 282 EMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ K +HD+LPPVL+ E SV + S V ++T +T+V +++ R
Sbjct: 179 QRAKDFIHDSLPPVLTTTEEALSVHGLPVHWSDGELVDEGARLTTETEVHMLQDGIAR 236
>gi|290985939|ref|XP_002675682.1| predicted protein [Naegleria gruberi]
gi|284089280|gb|EFC42938.1| predicted protein [Naegleria gruberi]
Length = 542
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 172/293 (58%), Gaps = 8/293 (2%)
Query: 4 EKEPLYISRKQ--KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD--VEG 59
EK L I R + +YY L + +I L D+ + + ++D+ LY +G+R T++ +
Sbjct: 122 EKRVLVIRRNEVYPDYYKGLYSLDEIKKTLVDHELRYSYDLDLALYRNGRRFTLNPNKDD 181
Query: 60 RALPQIVWDHYR-EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
A P +VWD Y E CSIR L PQ + L L +E FG G N Y TP +QGFA
Sbjct: 182 VADPTLVWDLYENEKCSIRMLRPQEHSDVLLSLLCHFEEYFGQGAGLNAYLTPAGSQGFA 241
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPILTVTLEPGDLL 177
PHYDDIEAF++QLEG+K WK+Y P +YL R+SS NF+QEE+ G + L+PGD+L
Sbjct: 242 PHYDDIEAFLIQLEGEKHWKIYRPLENQQYLDRFSSKNFTQEEVAGFECFEILLKPGDML 301
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
Y+P+G IHQA T ++HSLH+T+S +W D LEKA+P ALQ A ++ R LP
Sbjct: 302 YVPKGVIHQAVTSQDQHSLHITVSTSHLMSWTDYLEKALPLALQMATENHVDLRTALPKD 361
Query: 238 YLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
+ Y G + D + + +K +++ ++ + +D DEM K+ MHD
Sbjct: 362 FKEYMGTNIAECEDPRREEFILK--CSEMVGLVMNSIPMDQAADEMVKKFMHD 412
>gi|320164710|gb|EFW41609.1| nucleolar protein 66 [Capsaspora owczarzaki ATCC 30864]
Length = 546
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/247 (42%), Positives = 146/247 (59%), Gaps = 20/247 (8%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WE +PL + R + NYY+ + ++ + ++ + I F KN+D+TLYE+ +R T+ GR L
Sbjct: 179 WESKPLVVRRSRPNYYSEVFSTATMFELFSKHDIHFTKNVDVTLYENDERVTVSPPGRVL 238
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P I +GCS+R LNP +Y + L A LQE FG GAN Y TP +QGFAPHYD
Sbjct: 239 PPIAQSFLDDGCSLRLLNPHSYDDRVWSLLARLQEAFGAGAGANVYLTPCESQGFAPHYD 298
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
DI+AF+LQLEG K+WK++ PR VD LP SS + +++ P +TV L PGDLLY+PRG
Sbjct: 299 DIDAFVLQLEGTKEWKLHRPRPVDPVLPFISSRDIPADQLEEPFITVQLNPGDLLYMPRG 358
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ-AAGATDLEFRRGLPIGYLRY 241
IHQ S W LLEKAMP L+ A+ + ++ R+ +P +L
Sbjct: 359 TIHQNS-------------------WFHLLEKAMPAVLEHASRSGTIDIRQSVPASFLHN 399
Query: 242 AGLARGK 248
AR +
Sbjct: 400 GASARAQ 406
>gi|440797566|gb|ELR18649.1| cupin superfamily subfamily protein [Acanthamoeba castellanii str.
Neff]
Length = 511
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 134/356 (37%), Positives = 197/356 (55%), Gaps = 21/356 (5%)
Query: 1 DHWEKEPLYISRKQKNYYN------NLITSKKIDDILRDN-VIEFKKNIDITLYEDGQRQ 53
+HWEK+PL + RK++ + +L + + + N + + KN+++ YEDG+++
Sbjct: 9 EHWEKKPLLVKRKERKVGDGNRLTKDLFSFATFERLFSSNRPLFYTKNLNVCRYEDGEKK 68
Query: 54 TMDVEGR-ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
++ EG A + V +REG + +F PQ + + L ++ A L++ FG VGAN Y TPP
Sbjct: 69 MLNPEGSPASLKEVRKFWREGSTFQFFQPQQHSEVLHRMIAGLEQYFGALVGANIYMTPP 128
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
AQG APHYDD+E FILQLEGKKKWK+Y P DE LPR S + Q+E G P++ + LE
Sbjct: 129 NAQGLAPHYDDVEVFILQLEGKKKWKLYSPPPSDE-LPREYSRDLDQDECGEPLMELELE 187
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
GDLLY PRG IHQA T + S HVTIS YQ+ W D L A+P A++ A D+EFRR
Sbjct: 188 EGDLLYFPRGTIHQACTGPSA-STHVTISTYQRHTWGDFLAVALPLAIKEAIEEDVEFRR 246
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKE-----NLKDLLAKLVEYVDLDDGVDEMGKQL 287
GLP + G +GK + A K+ + LL +L ++ L D
Sbjct: 247 GLPGEASQ--GKGKGKKASSGSKGQAAKQQQFESKMVSLLHRLSGHLLLHKAADITSLDF 304
Query: 288 MHDALPPV----LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
++ LPP +P+E + + S+ +V A + D+ +RL N +R
Sbjct: 305 VNHRLPPASLCYAAPQEEAQTKKATKKKQSKEPQVQEAQVLDDDSSIRLAEPNYIR 360
>gi|301117762|ref|XP_002906609.1| nucleolar protein, putative [Phytophthora infestans T30-4]
gi|262107958|gb|EEY66010.1| nucleolar protein, putative [Phytophthora infestans T30-4]
Length = 506
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 114/307 (37%), Positives = 175/307 (57%), Gaps = 22/307 (7%)
Query: 1 DHWEKEPLYISRKQKN--YYNNLITSKKIDDIL--RDNVIEFKKNIDITLYEDGQRQTMD 56
+++EK+PL++ + K + +NL + KK+ ++ + + F K++ + Y D +R+ D
Sbjct: 74 EYFEKKPLHVRKADKGALFDSNLFSRKKLLKVMEKQHRSLSFGKDLTVCRYVDSERENFD 133
Query: 57 VE---GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
E G A + V G S +F PQ Y L +LNA+ +++FG G++ Y TP
Sbjct: 134 GEDTNGHATSRQVASLLDRGYSCQFYQPQRYEDGLYELNAAFEDVFGGLAGSSAYLTPAN 193
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
+Q APH+DD+E F++Q +G+KKWK+Y P +V+ L S + ++++IG P + +T+E
Sbjct: 194 SQALAPHHDDVEVFVVQTQGRKKWKLYHP-LVE--LAGEHSSDLAEDQIGEPWMELTVEE 250
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
GDLLY PRG IHQA T E S HVTISVYQ W + LE A+P+ ++ A +D+ FR G
Sbjct: 251 GDLLYFPRGVIHQACTDEKEFSTHVTISVYQHNTWANFLEVALPRVIRHAFDSDVSFREG 310
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKE---NLKDLLAKLVEYV---DLDDGVDEMGKQL 287
LP+GYL G AD KE K L+ KL +V DL + DE +
Sbjct: 311 LPVGYLDSVG------TQFPADSAKAKEFMATCKKLVGKLAAHVSEKDLQEAADEAALDV 364
Query: 288 MHDALPP 294
+ + LPP
Sbjct: 365 LANRLPP 371
>gi|223994815|ref|XP_002287091.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978406|gb|EED96732.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 830
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 137/413 (33%), Positives = 199/413 (48%), Gaps = 98/413 (23%)
Query: 19 NNLITSKKIDDILRDNVIEFKKNIDITLYEDG------QRQTMD-----------VEGRA 61
N + K IDD++R N + + ++++T Y D R T+D EG A
Sbjct: 333 NGFLHRKSIDDMIRKNKLRYGLDLNVTRYTDSLGNGTRHRITLDPPPKKRKSKTGSEGGA 392
Query: 62 L---------PQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+ P VW + C++R L P + + + + L+ FGC VG+N Y TP
Sbjct: 393 VDDVEYIVANPTDVWTNVDASHCTLRLLRPHEHNDNIHSMLSLLESEFGCMVGSNAYLTP 452
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI----GTPIL 167
+QGFAPHYDD++ FILQLEG K+W+VY P E LPR SS +++++E+ G +L
Sbjct: 453 LHSQGFAPHYDDVDVFILQLEGYKRWRVYAPMNKQETLPRVSSRDYTEKEVEESMGEEVL 512
Query: 168 TVTLEPGDLLYLPRGYIHQASTV------------TNEHSLHVTISVYQKTAWIDLLEKA 215
V L PGD+LYLPRG+IHQA TV T+ HSLH+T+S Q W D LE
Sbjct: 513 DVVLVPGDVLYLPRGWIHQAETVARPSHVSKLPGITDAHSLHLTVSAMQNWCWADFLEIL 572
Query: 216 MPKALQAAGA--TDLEFRRGLPIGYLRYAG--------------------------LARG 247
MP+AL++A A T + R GLP +L Y G +A+G
Sbjct: 573 MPEALESASASETSISLRDGLPRNFLAYMGTMHQLDDEGGELPEGLKQVAEAYAKRVAKG 632
Query: 248 KPVDIQAD------------RLAMKENLKDLLAKLVEYVD------LDDGVDEMGKQLMH 289
D + D A+K+ K+ K + V LDD D++GK+ +
Sbjct: 633 ADKDEEIDDEALAEVMHRQRTAALKKQFKEEAKKRIMRVAKQAMSMLDDACDQIGKRFLS 692
Query: 290 DALPPVLSPEELQCSVFENGLRMSQTGEVYN---ATQITKDTKVRLVRANAVR 339
D LPP L P E + + SQ+ ++ + +I ++ RLVR N R
Sbjct: 693 DRLPPALLPHEATLT------KESQSAAAHHHDASKKIWPNSLCRLVRPNIAR 739
>gi|291227952|ref|XP_002733946.1| PREDICTED: CG2982-like [Saccoglossus kowalevskii]
Length = 513
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 179/311 (57%), Gaps = 25/311 (8%)
Query: 1 DHWEKEPLYI---SRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
D+WEK+PL + K+ Y+ L + K++ I + N + F ++++ Y +G+R++++
Sbjct: 94 DYWEKKPLILHGEGSKRAAYFKQLFSMKQLRVISKKNSVRFVEDVNCCRYVNGKRESLNE 153
Query: 58 EGRALPQIVWDHYREGC--SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
+G A + + + G +++F PQ + L ++ ++L+ LFGC VGAN Y TP +Q
Sbjct: 154 DGDATVEKI-NRIFSGLKGTLQFHQPQRFQDELWKIQSNLESLFGCLVGANVYITPAESQ 212
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G APH+DD+E FI QLEG+K W++Y P + LPR S + SQ EIG P L+PGD
Sbjct: 213 GLAPHHDDVEVFIFQLEGEKHWRLYKPSLD---LPRDYSRDLSQNEIGDPTHDFILKPGD 269
Query: 176 LLYLPRGYIHQASTVTNE-HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
++Y PRG IHQA T+++ HS H+T+S YQ W DLL MP L+ + EFRRG+
Sbjct: 270 VMYFPRGTIHQADTLSSSTHSTHITMSTYQHNTWGDLLSVMMPNLLEKEMDKNKEFRRGI 329
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDL--DDGVDEMGKQLMHDAL 292
P G L G+++ E+LK LL +L E ++ D +EM + + L
Sbjct: 330 P-GTLITQGISKPN-----------GEHLKMLLHQLAESIESCDDSSANEMKLDFISNRL 377
Query: 293 PPVL-SPEELQ 302
PP + EELQ
Sbjct: 378 PPYHNNTEELQ 388
>gi|348688581|gb|EGZ28395.1| hypothetical protein PHYSODRAFT_473010 [Phytophthora sojae]
Length = 550
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 110/300 (36%), Positives = 169/300 (56%), Gaps = 9/300 (3%)
Query: 1 DHWEKEPLYISRKQKN--YYNNLITSKKIDDIL-RDNVIEFKKNIDITLYEDGQRQTMDV 57
+++EK+PL++ + K + + L + KK+ ++ + + F K++ + Y + +R+ D
Sbjct: 74 EYFEKKPLHVRKADKGELFGSGLFSRKKMLKVMEKQQHMGFGKDLTVCRYVNSERENFDG 133
Query: 58 E---GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
E G A + V G S +F PQ Y L +LNA+ +E+FG GA+ Y TP +
Sbjct: 134 EDSNGHATSRQVASLLDRGYSCQFYQPQRYADGLYELNAAFEEVFGGLAGASAYLTPANS 193
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
Q APH+DD+E F+LQ +G+KKWK+Y P +V+ L S + + ++IG P + +T+E G
Sbjct: 194 QALAPHHDDVEVFVLQTQGRKKWKLYRP-LVE--LAGEHSSDLAADQIGEPWMELTVEEG 250
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
DLLY PRG +HQA T + S HVTISVYQ W + LE A+P+ ++ A +D+ FR+GL
Sbjct: 251 DLLYFPRGIVHQACTDDKQFSTHVTISVYQHNTWANFLEVALPRVIRHAFDSDVAFRKGL 310
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
P+GYL Y G A + L LA V DL + DE + + LPP
Sbjct: 311 PVGYLSYMGTQFPTDSAEAKKFAAACKKLVGQLATHVSDKDLQEAADEAALDVFANRLPP 370
>gi|219124973|ref|XP_002182765.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405559|gb|EEC45501.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 351
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 111/252 (44%), Positives = 158/252 (62%), Gaps = 12/252 (4%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT-AQGFAPHYDDI 124
+W + +GC+IR L P + L + L+ + C +GAN Y TPP+ +QGFAPHYDDI
Sbjct: 1 LWSRFDQGCTIRLLCPHKHSDSTHGLLSLLESEWTCMIGANAYLTPPSGSQGFAPHYDDI 60
Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG--TPILTVTLEPGDLLYLPRG 182
EAF LQLEGKK+WKVY P E LPR SS ++ + ++ P L V L+PGD+LY+PRG
Sbjct: 61 EAFCLQLEGKKRWKVYAPLQKSERLPRTSSEDYVEADLRDVEPALDVVLKPGDVLYMPRG 120
Query: 183 YIHQASTV--TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD-LEFRRGLPIGYL 239
+IHQA T+ T+ +SLH+T+S Q+ AW DL+E +P+ALQ+A + D R+GLP G+L
Sbjct: 121 WIHQACTIDGTDGYSLHLTVSAMQQWAWADLMELLLPEALQSAASGDSTMLRQGLPRGFL 180
Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVE--YVD-LDDGVDEMGKQLMHDALPPVL 296
Y G + A+ L K +D A + E +D LD D++GK+ + D +PPVL
Sbjct: 181 NYMGAMYDQ--KDTAEILEQKAE-QDRTAAMDETGAIDMLDAACDQIGKRFLSDRVPPVL 237
Query: 297 SPEELQCSVFEN 308
+ E +V E+
Sbjct: 238 THLERSMTVHES 249
>gi|149605061|ref|XP_001516457.1| PREDICTED: lysine-specific demethylase NO66-like, partial
[Ornithorhynchus anatinus]
Length = 204
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 97/198 (48%), Positives = 124/198 (62%), Gaps = 7/198 (3%)
Query: 48 EDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
ED R M R + W YR GCS+R L PQ + + + A LQE GC G N
Sbjct: 7 EDQARTLMGPGPRQ--KTAWGLYRAGCSLRLLCPQAFSAVVWHVLALLQEHLGCLAGCNA 64
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
Y TPP AQGFAPH DD+EAF++QLEG+K+W++Y P E LPR P+ + +G P+L
Sbjct: 65 YLTPPGAQGFAPHSDDVEAFVVQLEGRKRWRLYRP---AEELPRRPGPDLAPGALGRPLL 121
Query: 168 TVTLEPGDLLYLPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
V L PGDLLY PRG++HQA T +HSLH+T+S +Q+ +W DLLE +P A +AA
Sbjct: 122 QVVLRPGDLLYFPRGFVHQADCPAATGQHSLHLTLSAFQRNSWGDLLEPLLPAAKKAALE 181
Query: 226 TDLEFRRGLPIGYLRYAG 243
D +FRRGLP YL Y G
Sbjct: 182 EDADFRRGLPRDYLDYMG 199
>gi|427785655|gb|JAA58279.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 490
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 170/302 (56%), Gaps = 22/302 (7%)
Query: 2 HWEKEPLYISRK--QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
HWE+EP + Q +++ L + +L++ I F K++ + YE+G+R + G
Sbjct: 55 HWEQEPFVSQNRTSQAEFWSPLFGKDRFFALLKEKQIYFAKDVAVCKYEEGKRTNFERHG 114
Query: 60 RALPQIVWDHYR-EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
RA + + + +++ PQ ++ L ++ L+ FGC VG N Y TPP+AQG A
Sbjct: 115 RATETKMRKLFEGDKATLQIHQPQRWVDGLWEVLEKLECFFGCLVGCNVYITPPSAQGLA 174
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PHYDD+E FI+QLEG+K WK+Y P + LPR S +F+ +EIG P TL+PGDLLY
Sbjct: 175 PHYDDVEVFIVQLEGEKCWKLYKPPV---ELPRTYSKDFNADEIGQPTHEFTLKPGDLLY 231
Query: 179 LPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
+PRG IHQA T + HS HVTIS YQ D L+ P + AA + L+ RRGLP+
Sbjct: 232 MPRGTIHQALTPESSGTHSTHVTISTYQNHTIGDCLKSLAPDLIDAAQDSCLQLRRGLPM 291
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVL 296
+L ++ L + D++A A+ +++K++ D + ++L+HD + L
Sbjct: 292 QFLPHSRLTKA---DVEAALEAVLKHVKNI-----------DSPMPIAEELVHDFMRTRL 337
Query: 297 SP 298
P
Sbjct: 338 PP 339
>gi|428179240|gb|EKX48112.1| hypothetical protein GUITHDRAFT_137055 [Guillardia theta CCMP2712]
Length = 525
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 101/294 (34%), Positives = 168/294 (57%), Gaps = 8/294 (2%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDD-ILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
WE+ L++ R + N+Y N+++ K + + R V + N+ + Y+DG R+ + G +
Sbjct: 66 WEQRHLHVKRGRSNFYGNILSLKDVMRYVGRGRVSMARSNLRVCKYKDGVRRERRIRGTS 125
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
++ R G + + +PQ P+ + A L+++ VG++ Y TP +QG APH+
Sbjct: 126 ASELKRQICR-GYTAQIFSPQHLCTPVAAVCAQLEQVLHSLVGSSLYVTPSHSQGLAPHH 184
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
DD++ F+ QL G+K+W++Y P++ P+ S + +EIG P+L V L+PGDLLY+PR
Sbjct: 185 DDVDVFVFQLHGRKRWRLYAPKV---RWPKSYSRDMRDKEIGEPVLDVVLKPGDLLYMPR 241
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G +HQAST ++ S+H+TIS YQ AW DLL A+P A++ A + E RR +P+ LR
Sbjct: 242 GTVHQASTRSSP-SIHLTISAYQHQAWDDLLLAALPYAVRQAAEEEQELRRSIPLDTLRV 300
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD--LDDGVDEMGKQLMHDALP 293
G+A G R ++ E ++L +L E + +D VD M + +P
Sbjct: 301 LGMAHGVVNRTCERRESLLEKARNLTHRLAESMTGAVDKAVDLMAADFICGRMP 354
>gi|115647197|ref|XP_789366.2| PREDICTED: myc-induced nuclear antigen-like [Strongylocentrotus
purpuratus]
Length = 511
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 176/305 (57%), Gaps = 17/305 (5%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
++WEK+P+ IS ++K +Y+ +L T ++ ++ + I F ++ ++ Y+ R++++
Sbjct: 79 EYWEKQPIVISNREKHRDYFQSLFTRTILEGLVAEKKISFIQDCNVCRYKGEVRESLNGN 138
Query: 59 GRALPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
G P + + ++ +I+F PQ + + + L L+ FGC VG+N Y TP +QG
Sbjct: 139 GIVKPTKLKELLDQDKATIQFHQPQRFQESVWNLLEKLESYFGCLVGSNIYMTPKLSQGL 198
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
APHYDD+E F+LQLEG+K W++Y P + LPR S + Q E+G P + L+ GDL+
Sbjct: 199 APHYDDVEVFVLQLEGEKHWRLYKPPTL---LPRDYSRDLDQSELGEPTHDIVLKAGDLM 255
Query: 178 YLPRGYIHQAST-VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
Y PRG +HQA T T HS H+TIS YQ+++W DLL A+P +Q A + ++ +R+GLP+
Sbjct: 256 YFPRGTVHQADTPSTCSHSTHLTISTYQRSSWGDLLSIALPSMIQTAISENVSYRQGLPL 315
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDL--DDGVDEMGKQLMHDALPP 294
++ P + QA + DL+++L +++ + +EM + + LPP
Sbjct: 316 RFV-------SDP-NHQASSSETGRKMADLISQLSSHIESHGQEAANEMLCDFIANRLPP 367
Query: 295 VLSPE 299
E
Sbjct: 368 FCDGE 372
>gi|346468123|gb|AEO33906.1| hypothetical protein [Amblyomma maculatum]
Length = 497
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/254 (39%), Positives = 144/254 (56%), Gaps = 8/254 (3%)
Query: 2 HWEKEPLYI--SRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
HWE+EPL S + + L + +L++ I F K+I + Y DG+R + +G
Sbjct: 58 HWEQEPLISQNSADRTKLLSPLFGKENFFTLLKEKQIFFTKDIALCKYVDGKRTNFERQG 117
Query: 60 RALPQIVWDHYR-EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
RA + + +++ PQ ++ L +L L+ FGC VG N Y TPPTAQG A
Sbjct: 118 RATEAKARKLFENDKATMQIHQPQRWVDGLWELLEKLECFFGCLVGCNAYITPPTAQGLA 177
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PHYDD+E FI+QLEG+K WK+Y P + LPR S +F EIG P TL+P DLLY
Sbjct: 178 PHYDDVEVFIVQLEGEKCWKLYKPPI---ELPRTYSNDFDAAEIGEPSHEFTLKPXDLLY 234
Query: 179 LPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
+PRG IHQA T + HS H+TIS YQ D L+ P + AA + ++ R+GLP+
Sbjct: 235 MPRGTIHQAWTPESSGTHSTHITISTYQNHTVGDGLKSLAPNLIDAALDSCVQLRKGLPL 294
Query: 237 GYLRYAGLARGKPV 250
+L ++ L++ V
Sbjct: 295 QFLPHSSLSKASVV 308
>gi|326432547|gb|EGD78117.1| hypothetical protein PTSG_08995 [Salpingoeca sp. ATCC 50818]
Length = 912
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 127/199 (63%), Gaps = 9/199 (4%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
++GC+I+F PQ ++ LQ + +L+ FGC VGAN Y TPP +QG APH+DDIEAF+LQ
Sbjct: 220 KDGCTIQFHQPQRFMLELQDVMTTLESTFGCLVGANVYITPPNSQGLAPHHDDIEAFVLQ 279
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
+EG K WK+Y P E L SP+ + IG PI + GD+LY PRG +H+A T
Sbjct: 280 VEGSKHWKLYNPV---EQLASSYSPDLPRTAIGDPIYDGIINAGDVLYFPRGQVHEAQT- 335
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
T+ S+HVTIS YQ+ + LL+KA+P L+ A + L+ RRGLP+ L+ +G A P+
Sbjct: 336 TDTLSIHVTISTYQRWTYGHLLQKALPSCLEGALSRVLDLRRGLPLDLLKSSGFALSPPL 395
Query: 251 DIQADRLAMKENLKDLLAK 269
+ A+K+ L+ LL +
Sbjct: 396 NP-----ALKDTLRALLQR 409
>gi|294881184|ref|XP_002769286.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239872564|gb|EER02004.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 521
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 168/352 (47%), Gaps = 57/352 (16%)
Query: 1 DHWEKEPLYISR-KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
++WEK+PL++ R ++YY+ + T + L + F ++++ E G + + E
Sbjct: 66 EYWEKKPLHVRRPTARDYYSGVWTKAMAEKTLTKHECRFGESVNFARVEAGVKVMHNGEE 125
Query: 60 --RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
+A + + + +G S +F+ PQ + KP L L+ FG GAN+Y TP + GF
Sbjct: 126 GEKATVEYMQGQFEDGVSCQFMQPQRFSKPCHALMERLENYFGTLWGANSYLTPANSVGF 185
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
APHYDD+E F+LQ EG K+W++Y D LP S ++++EE+ P +E GDLL
Sbjct: 186 APHYDDVEVFMLQTEGSKRWRLYDSPDDDGPLPMEYSRDYTEEELSLPYFDEVVEQGDLL 245
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM-PKALQAAGATDLEFRRGLPI 236
Y+PRG +H +S H+T+S Y +W +LL+ + P AL A D+ FR G+P+
Sbjct: 246 YIPRGTVHFGCVSPEGYSHHLTVSTYYHNSWGELLQNLLIPGALAKAMKEDVGFREGVPV 305
Query: 237 GYLRYAG------------LARG-----------------------KPVD---------- 251
+ RY G L G + VD
Sbjct: 306 NWTRYMGRLMAPVTAETAALEAGGDDDNASDDDVEVGKDGSSTEEEEEVDAKVGSVPPQG 365
Query: 252 -IQADRLAM-------KENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPV 295
QAD A+ K ++K L+AKL EYVD DD D+ + PP
Sbjct: 366 ETQADPEALMKARKAFKAHVKGLIAKLSEYVDEDDAADQTAVDFVALRTPPA 417
>gi|145347679|ref|XP_001418290.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578519|gb|ABO96583.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 94/252 (37%), Positives = 135/252 (53%), Gaps = 24/252 (9%)
Query: 4 EKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD------- 56
++ + + R + +Y + +T + + RD + ++D+T Y DG R+T +
Sbjct: 81 DRRAVCVKRGGRAHYGSWMTMEAVTRATRDGEARYGVDLDVTSYVDGVRRTHNRNSDAGR 140
Query: 57 ------------VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG 104
V+ A+ + R S+R L+PQT ++ A+L+ F C G
Sbjct: 141 EADDEDDDVGELVDADAVLTRRFGEERR--SVRLLHPQTRCDATWKILATLERYFECACG 198
Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT 164
N Y TP ++QGFAPHYDDI+AF+LQ+EG K+W+VY P DE PR SS NF+QEEI T
Sbjct: 199 CNVYVTPASSQGFAPHYDDIDAFVLQIEGAKRWRVYEP-FEDETHPRTSSRNFTQEEIAT 257
Query: 165 P--ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
+ LE GD LYLPRG+IHQA ++ HS+H T+S Q A D LE A+ AL +
Sbjct: 258 QRVVFDDVLEAGDFLYLPRGWIHQAECSSSTHSVHATLSTNQSNAPADALEIALNNALAS 317
Query: 223 AGATDLEFRRGL 234
E RR
Sbjct: 318 TIDGRAELRRSF 329
>gi|357621298|gb|EHJ73173.1| hypothetical protein KGM_18276 [Danaus plexippus]
Length = 341
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 121/188 (64%), Gaps = 6/188 (3%)
Query: 153 SSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
++P+ S EIG P+L V LE GD+LY PRGYIHQ T+ EHSLHVTIS+YQK AW DLL
Sbjct: 73 ANPHLS--EIGEPVLEVVLEAGDMLYFPRGYIHQGVTIDGEHSLHVTISMYQKHAWADLL 130
Query: 213 EKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLA-MKENLKDLLAKLV 271
EK +P ALQ A ++EFR GLP+ + GL D R A M+E +K L K+
Sbjct: 131 EKMIPAALQIAINENIEFREGLPLDIYDHFGLVHS---DTNTPRKAEMEEIVKRLFNKIK 187
Query: 272 EYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVR 331
+Y+ +D+ VD+M K+ DALPPVLS E +VF + M + G+V N +I DT++R
Sbjct: 188 DYLPIDEAVDQMNKKFQQDALPPVLSDFEKAVTVFGDSDVMIENGKVTNRVEIGLDTRIR 247
Query: 332 LVRANAVR 339
L+R N +R
Sbjct: 248 LLRKNILR 255
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/63 (52%), Positives = 49/63 (77%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
WEK+PL+I+RK+ +YY ++I++ ID++LR I+F KNIDIT Y DG+R+T + EGRA
Sbjct: 15 WEKKPLHIARKKPDYYKDVISTPVIDNMLRTENIQFTKNIDITSYVDGKRETHNPEGRAN 74
Query: 63 PQI 65
P +
Sbjct: 75 PHL 77
>gi|412990663|emb|CCO18035.1| unnamed protein product [Bathycoccus prasinos]
Length = 594
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 105/269 (39%), Positives = 157/269 (58%), Gaps = 31/269 (11%)
Query: 12 RKQ----KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQT-------MDVEGR 60
RKQ K+ Y +L+TS+ I+ L++ + + N+D+T Y+DG R+T +V
Sbjct: 158 RKQSVTSKSGYEDLLTSEDIERWLKEKKLTYGTNVDVTSYKDGVRRTHNANDGFSNVSKN 217
Query: 61 ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
A + V + E GCS+R L+PQ + L L ++L+ + G N Y TP +QGFAP
Sbjct: 218 ADYETVAKRFEEDGCSVRVLHPQRWSDKLWALLSNLENFWQSPTGCNAYWTPANSQGFAP 277
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ------EEIGTPILTVTLEP 173
H+DDI+AF+LQ+EG+K+W++Y R V E PR+SS FS+ EE+ + V LEP
Sbjct: 278 HWDDIDAFVLQIEGRKRWRLYPTRDVSEKFPRFSSEEFSEEERKRLEEM--KYVEVILEP 335
Query: 174 GDLLYLPRGYIHQASTVT---------NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
GD++Y+PRG IH+A +T +E SLH+TISV Q+ + D+LE AL A
Sbjct: 336 GDVMYMPRGTIHEAFCITSEDENDGKKDESSLHLTISVNQRNTFADILEHVFSDALADAV 395
Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQ 253
T++E RR P +LR+ L+R D+
Sbjct: 396 NTNVELRRCPPRDFLRH--LSRLDETDVN 422
>gi|407852401|gb|EKG05908.1| hypothetical protein TCSYLVIO_003010 [Trypanosoma cruzi]
Length = 476
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 86/252 (34%), Positives = 141/252 (55%), Gaps = 10/252 (3%)
Query: 2 HWEKEPLYISRKQKNYYNNL--------ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ 53
++EK PL+ S ++ + +++ + + + + + +I+I ++ Q++
Sbjct: 44 YFEKRPLHFSHGAATHFTEVQDGLPAVKWSTELMLQLTAEKSLSYTTDINIVRFDAVQKK 103
Query: 54 TMDV--EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+ EG + + R+G S+RFL P YI + A L+E+F C G N+Y TP
Sbjct: 104 RVPFRSEGIVTEKEMKHSMRKGWSVRFLRPHEYIVENSAVLAMLEEVFACSCGLNSYWTP 163
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
+QGFAPHYDD++ F+LQLEG+K+W++Y P + L R+SS +++ EE+ P L
Sbjct: 164 ANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPERVDVLSRHSSEDYNPEELPKPTQIFRL 223
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
PGD+LY+PRG +HQ T + HSLHVT S Q W DL++ A+ ++ A + +R
Sbjct: 224 VPGDVLYMPRGTVHQGRTYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEKLAANYIHWR 283
Query: 232 RGLPIGYLRYAG 243
R LP L+ G
Sbjct: 284 RSLPRSLLKRLG 295
>gi|407417792|gb|EKF38086.1| hypothetical protein MOQ_001707 [Trypanosoma cruzi marinkellei]
Length = 476
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 86/252 (34%), Positives = 143/252 (56%), Gaps = 10/252 (3%)
Query: 2 HWEKEPLYISRKQKNYYNNL--------ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ 53
++EK+PL+ S ++ + +++ + + + + + +I+I ++ Q++
Sbjct: 44 YFEKKPLHFSHGSATHFTRVQDGIPAVKWSTELMLQLAAEKSLSYTTDINIVRFDAVQKK 103
Query: 54 --TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+ +EG + + R+G S+RFL P YI + A L+E+F C G N+Y TP
Sbjct: 104 RISFRLEGIVTEKDMKHSMRKGWSVRFLRPHEYIVENSAVLAILEEIFVCSCGLNSYWTP 163
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
+QGFAPHYDD++ F+LQLEG+K+W++Y P + L R+SS +++ EE+ P T L
Sbjct: 164 ANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPEKVDVLSRHSSEDYNPEELPKPKQTFHL 223
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
PGD+LY+PRG +HQ T + HSLHVT S Q W DL++ A+ ++ A + +R
Sbjct: 224 VPGDVLYMPRGTVHQGQTYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEKLAANYIHWR 283
Query: 232 RGLPIGYLRYAG 243
LP L+ G
Sbjct: 284 HSLPRALLKRLG 295
>gi|327268744|ref|XP_003219156.1| PREDICTED: MYC-induced nuclear antigen-like [Anolis carolinensis]
Length = 470
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 162/309 (52%), Gaps = 31/309 (10%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WEK+PL + R +YY++L ++ I++ + + K+I++ +G+++ +
Sbjct: 56 EYWEKKPLVLHRDDPTVASYYSSLFQLTDLEGIVKHGLY-YGKDINVCKSINGKKKVFNK 114
Query: 58 EGR-ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ + Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TPP +QG
Sbjct: 115 DGKVSYAQLKKDFDQKKATIQFHQPQRFKNELWKIQEKLECFFGSLVGSNIYITPPGSQG 174
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
PHYDDIE FILQLEG+K W++Y P +P N E+ IG P L+PGD
Sbjct: 175 LPPHYDDIEVFILQLEGEKHWRLYKPT-----IPLAQQYNAEPEDKIGAPTHNFILKPGD 229
Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
LLY PRG IHQA+T HS HVTIS YQ +W D +P + D+EFR+ +
Sbjct: 230 LLYFPRGTIHQAATPPGIAHSTHVTISTYQNHSWRDFFLDVVPGLILDMAKEDIEFRKSI 289
Query: 235 PIGYLRYAGLARGKP-----VDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
P L L+ + ADRL +EN K+L + +M K +
Sbjct: 290 PRQLLMKVDLSDSSRQLSGFLRCLADRL--EENGKELKS------------SDMKKDFIM 335
Query: 290 DALPPVLSP 298
+ LPP L P
Sbjct: 336 NRLPPFLGP 344
>gi|156362163|ref|XP_001625650.1| predicted protein [Nematostella vectensis]
gi|156212493|gb|EDO33550.1| predicted protein [Nematostella vectensis]
Length = 269
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 130/209 (62%), Gaps = 7/209 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV-EG 59
++WEK+PL I+R+ +Y L + ++ +L+ I + ++I++ Y DG+++ ++ EG
Sbjct: 63 NYWEKKPLVINREDSEFYGALFSKAFLEVLLKKKEINYVEDINVCRYIDGEKEFLNEDEG 122
Query: 60 R--ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
+I+ + +I+F PQ + L QLN +L+ FGC VGAN Y TPP AQG
Sbjct: 123 TKATASKIMKKVKDDNATIQFHQPQRFQDTLWQLNGNLERFFGCLVGANVYITPPNAQGL 182
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
APH+DD+E FILQLEG+K WK+Y P + E YS+ + ++ IG P TL+ GDLL
Sbjct: 183 APHHDDVEVFILQLEGEKNWKLYSPLV--ELALDYSA-DLEEDSIGKPTHEFTLKTGDLL 239
Query: 178 YLPRGYIHQASTV-TNEHSLHVTISVYQK 205
Y PRG IHQA T+ HS H+T+S YQ+
Sbjct: 240 YFPRGTIHQAETLKCGNHSTHITLSTYQQ 268
>gi|302850897|ref|XP_002956974.1| hypothetical protein VOLCADRAFT_107490 [Volvox carteri f.
nagariensis]
gi|300257692|gb|EFJ41937.1| hypothetical protein VOLCADRAFT_107490 [Volvox carteri f.
nagariensis]
Length = 756
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/285 (34%), Positives = 150/285 (52%), Gaps = 43/285 (15%)
Query: 1 DHWEKEPLYI--SRKQKNYYNNLITSK---KIDDIL--------------RDNVIEFKKN 41
++WEK+PL+I ++KN+++ L K+ D L + + + F+ +
Sbjct: 57 EYWEKKPLHIPADEERKNFFDGLFDFDELCKVSDALEAGVMLLMDQDPGPKGDALSFEDD 116
Query: 42 IDITL-----------------YEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTY 84
+D + Y DG+R+T+ E + H EG +++ PQ +
Sbjct: 117 VDDQMVRAIGPLVFGRDVVAVRYVDGERETLAGEFADSDTLRTLHETEGATLQLHQPQRF 176
Query: 85 IKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM 144
+ L +L A+L+ GC VG N Y TP QG APH+DD+E F+ Q +G KKW++Y P +
Sbjct: 177 VNALWRLVAALEAQSGCLVGCNAYVTPGGGQGLAPHHDDVELFVCQTKGSKKWRLYKP-V 235
Query: 145 VDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQ 204
D LP S + Q+E+G PIL VT+ PGD+LY+PRG HQA E S H+TIS YQ
Sbjct: 236 NDFELPSKVSGDLPQDELGEPILEVTMNPGDVLYVPRGIPHQA-VAQEEGSCHLTISTYQ 294
Query: 205 KTAWIDLLEKAMPKALQAAGAT-----DLEFRRGLPIGYLRYAGL 244
+ W DL K + A A G++ LE R+GLP+G + G+
Sbjct: 295 RWTWGDLAVKLLETARDAQGSSPASALPLELRKGLPLGLIFKHGI 339
>gi|323448506|gb|EGB04404.1| hypothetical protein AURANDRAFT_67230 [Aureococcus anophagefferens]
Length = 450
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 127/214 (59%), Gaps = 10/214 (4%)
Query: 40 KNIDITLYEDGQRQTMDVEGRALPQIVWDHYR---EGCSIRFLNPQTYIKPLQQLNASLQ 96
++ D+T E G R T+ +G + W EG ++R P + + L L+
Sbjct: 19 RDADVTRVERGARTTLTRDGEPV-DAAWAKRAFETEGATVRLRCPADSNERVHALCRCLE 77
Query: 97 ELFGCFVGANTYQTPP-TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP 155
+ FG VGAN+Y TP AQGFA H+DD++ F+LQ EG K W+++ P D LPR SS
Sbjct: 78 DEFGSTVGANSYLTPGGAAQGFAAHWDDVDVFVLQTEGSKTWQIFAPDADDLKLPRVSSQ 137
Query: 156 NFSQEEI----GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDL 211
+F E + GT VTL+PGD+LYLPRG++H+A+T SLHVT+S ++ AWIDL
Sbjct: 138 DFDDEALDALYGTLRAEVTLDPGDVLYLPRGFVHRAATGAAP-SLHVTLSAHRANAWIDL 196
Query: 212 LEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
LE+ +PKAL++A D R+ LP Y + G+A
Sbjct: 197 LERLLPKALESAQRGDCGLRKALPRDYAHFLGVA 230
>gi|157872976|ref|XP_001685008.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128079|emb|CAJ08208.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 624
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 147/284 (51%), Gaps = 31/284 (10%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
++G S+RFL P +I+ +++ F C+ GAN+Y TP +QGFAPHYDD++ F LQ
Sbjct: 262 KDGWSVRFLRPNEFIESNSAFIGCIEKEFNCYCGANSYWTPANSQGFAPHYDDVDVFFLQ 321
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
LEG+K W +Y P + L R+SS +++ E TP T+TL+ GD+LY+PRG +HQ T
Sbjct: 322 LEGEKLWCLYDPPEDVDVLARHSSEDYAPERFPTPKHTITLKAGDVLYMPRGTVHQGKTT 381
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGL------ 244
HSLH+T S Q +W D + +A ++ A LE+RR LP + G
Sbjct: 382 LKTHSLHITFSANQMNSWADFMSRAAQYTVETLAANKLEWRRALPRDMPQVMGEVNNPVF 441
Query: 245 --ARGKP---VDIQADRLAMKENLKDLLAK----LVEYVDLDDGVDEMGKQLMHDALPPV 295
G P + Q R ++ +++++A+ L + V++D D GK + PP
Sbjct: 442 RDTHGLPALSANQQDHRANLQTKVREMVAELTLLLTDEVNMDVCADVYGKDTIQKLQPP- 500
Query: 296 LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ +G + +++ ++VRL+ N +R
Sbjct: 501 ---------------SRTYSGAAPASRKLSHASRVRLISRNCMR 529
>gi|71415552|ref|XP_809839.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70874281|gb|EAN87988.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 476
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 140/252 (55%), Gaps = 10/252 (3%)
Query: 2 HWEKEPLYISRKQKNYYNNL--------ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ 53
++EK+PL+ S ++ + +++ + + + + + +I+I ++ Q++
Sbjct: 44 YFEKKPLHFSHGAATHFTEVQDGLPAVKWSTELMLQLAAEKSLSYTTDINIVRFDAVQKK 103
Query: 54 TMDV--EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+ EG + + R+G S+RFL P YI + A L+E F C G N+Y TP
Sbjct: 104 RVPFRSEGIVTEKEMKHSMRKGWSVRFLRPHEYIVENSAVLAMLEEAFACSCGLNSYWTP 163
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
+QGFAPHYDD++ F+LQLEG+K+W++Y P + L R+SS +++ EE+ P L
Sbjct: 164 ANSQGFAPHYDDVDVFLLQLEGEKEWRLYDPPERVDVLSRHSSEDYNPEELPKPTQIFRL 223
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFR 231
PGD+LY+PRG +HQ + HSLHVT S Q W DL++ A+ ++ A + +R
Sbjct: 224 FPGDVLYMPRGTVHQGRKYNHAHSLHVTFSANQMNTWADLMKHAVTHVVEKLAANYIHWR 283
Query: 232 RGLPIGYLRYAG 243
R LP L+ G
Sbjct: 284 RSLPRSLLKRLG 295
>gi|389602291|ref|XP_001567036.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505387|emb|CAM42455.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 595
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 180/359 (50%), Gaps = 44/359 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLI---------TSKKIDDILRDNVIEFKKNIDITLYEDG-- 50
++EK+PL S Y+ + ++K++ + ++ + + ++++ ++
Sbjct: 153 YFEKKPLVASHGNCEYFARGLPGVVPPVDWSTKRMLEHIKMHPSRYGTDLNVVKFDSKLK 212
Query: 51 QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
+R + EG + + G S+RFL P +++ +++ F C+ G N+Y T
Sbjct: 213 KRVSYRTEGLVDAAELERCMKTGWSVRFLRPHEFLESNSAFIGCMEQEFNCYCGVNSYWT 272
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P +QGFAPHYDD++ F+LQLEG+K W++Y P + L R+SS +++ E++ TP T+T
Sbjct: 273 PANSQGFAPHYDDVDVFLLQLEGEKLWRLYDPPEEVDVLTRHSSEDYAPEQLPTPNHTIT 332
Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
L+ GD+LY+PRG +HQ T HSLH+T S Q +W D + ++ ++ A LE+
Sbjct: 333 LKAGDVLYMPRGTVHQGKTNLKTHSLHITFSANQMNSWADFMIRSAQYTIETLAANKLEW 392
Query: 231 RRGLPIGYLRYAGLARGKPV------------DIQADRLAMKENLKDLLAK----LVEYV 274
R LP + G A PV + Q R ++++ L+++ A+ L +
Sbjct: 393 RCALPRDVPQVMG-AINHPVFRNTHGLAVLSENQQERRASLQKKLREMAAEVTLLLTDET 451
Query: 275 DLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLV 333
++D D GK+ + PP M+ +G ++T + + VRL+
Sbjct: 452 NMDVCTDVYGKETIRKMQPP----------------SMAYSGAASSSTGLDHASHVRLI 494
>gi|401426122|ref|XP_003877545.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322493791|emb|CBZ29080.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 624
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 138/282 (48%), Gaps = 31/282 (10%)
Query: 73 GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
G S+RFL P +I+ S+++ F C+ G N+Y TP +QGFAPHYDD++ F+LQLE
Sbjct: 264 GWSVRFLRPNEFIESNSAFIGSMEKEFNCYCGVNSYWTPANSQGFAPHYDDVDVFLLQLE 323
Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
G+K W +Y P + L R+SS +++ E TP +TL+ GD+LY+PRG +HQ T
Sbjct: 324 GEKLWCLYDPPENVDVLARHSSEDYAPELFPTPKHAITLKAGDVLYMPRGTVHQGKTTRK 383
Query: 193 EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDI 252
HSLH+T S Q +W D + +A ++ A LE+RR LP + G
Sbjct: 384 MHSLHITFSANQMNSWADFMSRAAQYTVETLAANKLEWRRALPRDMPQVMGEMNNPVFRD 443
Query: 253 QADRLAMKENLKDLLAK---------------LVEYVDLDDGVDEMGKQLMHDALPPVLS 297
A+ EN +D AK L + V++D D GK + PP
Sbjct: 444 THGLAALSENQQDHRAKLQTKVREMVAELTLLLTDEVNMDVCTDVYGKDTIRKLQPP--- 500
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ +G +T + ++VRL+ N +R
Sbjct: 501 -------------SRTYSGTAPASTTLNHASRVRLISRNCMR 529
>gi|126336950|ref|XP_001380518.1| PREDICTED: MYC-induced nuclear antigen [Monodelphis domestica]
Length = 474
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 154/306 (50%), Gaps = 21/306 (6%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE++PL I R + Y+ NL + ++ R ++ + +++++ +G+++ ++
Sbjct: 57 EYWEQKPLLIQRDDPSLAAYFQNLFQLADLKNLCRQGMV-YGRDMNVCRCVNGRKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 GGKVHYLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECFFGSLVGSNVYITPSGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
PHYDDIE FILQLEGKK W++Y P +P N E+ IG P TL+PGD
Sbjct: 176 LPPHYDDIEVFILQLEGKKHWRLYQPT-----VPLAREYNVEPEDRIGAPTHEFTLKPGD 230
Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
LY PRG IHQA T + HS HVTIS YQ +W D L +P + A + G+
Sbjct: 231 FLYFPRGTIHQADTPSGLAHSTHVTISTYQNNSWGDFLLDVIPGFVFAVAKEKEKLGTGI 290
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKEN-LKDLLAKLVEYVDLDDG--VDEMGKQLMHDA 291
P G+ R V QA A+ L L L E ++ EM K +
Sbjct: 291 P------GGMPRAALVSQQAHDTALATTRLSGFLRTLAEQLEGSGAPRPSEMKKDFVRSR 344
Query: 292 LPPVLS 297
LPP S
Sbjct: 345 LPPFYS 350
>gi|167527045|ref|XP_001747855.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773604|gb|EDQ87242.1| predicted protein [Monosiga brevicollis MX1]
Length = 595
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/382 (31%), Positives = 183/382 (47%), Gaps = 62/382 (16%)
Query: 2 HWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
H+E+ LYI R+ ++ Y L + ++ +L+ + + ++++ Y DG+R+ +
Sbjct: 68 HFERSFLYIPREDRDPGIVYQGLFSLDQLYTLLQRESMFYGTDLNLCRY-DGERKLVLNG 126
Query: 59 GRA----LPQIVWDHYRE---------------------------------GCSIRFLNP 81
GR LP I +H +++F P
Sbjct: 127 GRNDTTDLPTINGNHSNSQRAEEQDSNDSDDSDELAEEALAADVRRRVEDLKATVQFHQP 186
Query: 82 QTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYL 141
Q +++ L L S ++ VGAN Y TP +QG APH+DD+E +ILQLEG+K W++Y
Sbjct: 187 QRFVRALHDLLYSFEQELTTLVGANVYITPANSQGLAPHHDDVEVYILQLEGEKAWRLYE 246
Query: 142 PRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
P E L S + +EE+ PI + L PGD LYLPRG IH+AS V N+HS H+TIS
Sbjct: 247 PI---EPLAMSYSADLDREELAQPIAELVLRPGDFLYLPRGTIHEASCVGNQHSTHITIS 303
Query: 202 VYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAG-LARGKPVDIQADRLAMK 260
+Q + L+ + +P+ + A + LE RRGLP G+LR G L G P A M+
Sbjct: 304 SHQNWNYGHLMAQTLPECITNAMSNVLELRRGLPHGFLRQHGVLHAGFPPPPSAVFHHMR 363
Query: 261 ENLK-DLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQT--GE 317
E L+ D + +E VD LPP ++ N L + T
Sbjct: 364 ELLQSDQVWSEIER-HFHQAVDSFALDYTTARLPP--------PALRRNPLNPAPTRVTP 414
Query: 318 VYNATQITKDTKVRLVRANAVR 339
V++AT TKVRLV + +R
Sbjct: 415 VFHAT-----TKVRLVYTDHIR 431
>gi|398019903|ref|XP_003863115.1| hypothetical protein, conserved [Leishmania donovani]
gi|322501347|emb|CBZ36425.1| hypothetical protein, conserved [Leishmania donovani]
Length = 507
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 143/281 (50%), Gaps = 25/281 (8%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
+ G S+RFL P +I+ +++ F C+ G N+Y TP +QGFAPHYDD++ F+LQ
Sbjct: 145 KGGWSVRFLRPNEFIESNSAFIGCMEKEFNCYCGVNSYWTPANSQGFAPHYDDVDVFLLQ 204
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
LEG+K W +Y P + L R+SS +++ E TP T+TL+ GD+LY+PRG +HQ T
Sbjct: 205 LEGEKLWCLYDPPEDVDVLARHSSEDYAPELFPTPKHTITLKAGDVLYMPRGTVHQGKTT 264
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
HSLH+T S Q +W D + +A ++ A LE+RR LP + G PV
Sbjct: 265 LKTHSLHITFSANQMNSWADFMSRAAQYTVEKLAANKLEWRRALPRSMPQVVGEV-NNPV 323
Query: 251 ------------DIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
Q R ++ +++++A+L + + +D D + + P
Sbjct: 324 FRDTHGLAALSEKQQEHRANLQTKVREMVAELTLLLTDEGNMDMCADVYAKDTIRKLQPP 383
Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ +G +T+++ ++VRL+ N +R
Sbjct: 384 SR------------TYSGAAPASTRLSHASRVRLISRNCMR 412
>gi|156378500|ref|XP_001631180.1| predicted protein [Nematostella vectensis]
gi|156218216|gb|EDO39117.1| predicted protein [Nematostella vectensis]
Length = 109
Score = 159 bits (402), Expect = 2e-36, Method: Composition-based stats.
Identities = 66/108 (61%), Positives = 86/108 (79%)
Query: 36 IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL 95
++F +N+D+T Y+DG+R+T + +GRA +VWD Y GCS+R LNPQ++ P+ +L + L
Sbjct: 2 VQFSRNLDVTSYKDGKRETHNPQGRAHAPVVWDFYNNGCSVRLLNPQSFSHPVWKLTSLL 61
Query: 96 QELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR 143
QE FGCFVGANTY TPP QGFAPHYDDIEAFI+QLEGKK W++Y PR
Sbjct: 62 QEYFGCFVGANTYLTPPGTQGFAPHYDDIEAFIIQLEGKKHWRLYNPR 109
>gi|387017158|gb|AFJ50697.1| MYC-induced nuclear antigen-like [Crotalus adamanteus]
Length = 468
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 14/242 (5%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE++PL + R + YY +L + D+++ + + K+I+I +G++ +
Sbjct: 56 EYWEQKPLLLQRDDSSVASYYQSLFQLTDLKDLVKLGLY-YGKDINICRCMNGKKLIFNK 114
Query: 58 EGRALPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ + H+ ++ +I+F PQ + L ++ L+ FG VG+N Y TPP +QG
Sbjct: 115 DGKVTYAQLKKHFDQKKATIQFHQPQRFKDLLWRIQEKLECYFGSLVGSNVYITPPGSQG 174
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG+K W++Y P + EY + P +E IGTP L+PG
Sbjct: 175 LPPHYDDVEVFILQLEGEKHWRLYRPTVPLAQEY---NAEP---EERIGTPTHDFILKPG 228
Query: 175 DLLYLPRGYIHQAST-VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
D+LY PRG IHQA T + HS H+TIS YQ +W D L +P L D EFR+
Sbjct: 229 DMLYFPRGTIHQADTPLGISHSTHITISTYQNNSWRDYLLDMVPGLLLDIAKEDAEFRKN 288
Query: 234 LP 235
+P
Sbjct: 289 IP 290
>gi|146094469|ref|XP_001467285.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134071650|emb|CAM70340.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 507
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 143/281 (50%), Gaps = 25/281 (8%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
+ G S+RFL P +I+ +++ F C+ G N+Y TP +QGFAPHYDD++ F+LQ
Sbjct: 145 KGGWSVRFLRPNEFIESNSAFIGCMEKEFNCYCGVNSYWTPANSQGFAPHYDDVDVFLLQ 204
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
LEG+K W +Y P + L R+SS +++ E TP T+TL+ GD+LY+PRG +HQ T
Sbjct: 205 LEGEKLWCLYDPPEDVDVLARHSSEDYAPELFPTPKHTITLKAGDVLYMPRGTVHQGKTT 264
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
HSLH+T S Q +W D + +A ++ A +E+RR LP + G PV
Sbjct: 265 LKTHSLHITFSANQMNSWADFMSRAAQYTVEKLAANKMEWRRALPRSMPQVVGEV-NNPV 323
Query: 251 ------------DIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
Q R ++ +++++A+L + + +D D + + P
Sbjct: 324 FRDTHGLAALSEKQQEHRANLQTKVREMVAELTLLLTDEGNMDMCADVYAKDTIRKLQPP 383
Query: 299 EELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ +G +T+++ ++VRL+ N +R
Sbjct: 384 SR------------TYSGAAPASTRLSHASRVRLISRNCMR 412
>gi|442761549|gb|JAA72933.1| Putative nucleolar protein, partial [Ixodes ricinus]
Length = 439
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 100/286 (34%), Positives = 144/286 (50%), Gaps = 26/286 (9%)
Query: 17 YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYRE--GC 74
++ L + I ++ + F K++ YEDG+R T EG + + Y E
Sbjct: 42 FWPQLFSKDAFFSIAKETKLYFGKDVSACKYEDGKR-TDYAEGYSAKSAKLNKYFEERKA 100
Query: 75 SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGK 134
+++ PQ + L ++ ++ FGC VG N Y TP +QG APH+DD+E FI+QLEG+
Sbjct: 101 TLQVHQPQRWKDSLWEVLELMECFFGCLVGCNAYITPAGSQGLAPHHDDVEVFIVQLEGE 160
Query: 135 KKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST--VTN 192
K WK++ P V E L R S +F+ EEIG P TL PGD LY+PRG IH A +
Sbjct: 161 KCWKLHRP--VTE-LARIYSKDFTSEEIGEPTHEFTLRPGDFLYMPRGTIHHAYVPESAD 217
Query: 193 EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDI 252
HS H+TIS YQK D L +P + +A + +E R+GLP +L L
Sbjct: 218 SHSTHITISTYQKQTVGDCLMDIVPDLISSAMDSCIELRKGLPNKFLPSCALP------- 270
Query: 253 QADRLAMKENLKDLLAKLVEYV----DLDDGVDEMGKQLMHDALPP 294
KE + L ++E+V D +EM + M LPP
Sbjct: 271 -------KETVVTALTSVLEHVKQMSDEMPSPEEMVRDFMFSRLPP 309
>gi|449485668|ref|XP_004176314.1| PREDICTED: LOW QUALITY PROTEIN: MYC-induced nuclear antigen
[Taeniopygia guttata]
Length = 471
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 155/304 (50%), Gaps = 26/304 (8%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
D+WE++PL + R YY +L + ++ + + +++++ +G+++ ++
Sbjct: 57 DYWEQKPLLVQRNDVLVAAYYQSLFQLSHLKELCSQGLY-YGRDVNVCRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
EGR Q+ D ++ +I+F PQ + + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 EGRVNYMQLKKDFDQKKATIQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
PHYDD+E FILQLEG+K W++Y P + P N EE IG+P L+PGD
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTV-----PLAREYNVEPEERIGSPTHEFILKPGD 230
Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
LLY PRG IHQA T +S HVTIS YQ +W D L A+P + A D+ R +
Sbjct: 231 LLYFPRGTIHQADTPHGISYSTHVTISTYQNNSWGDFLLDAIPGLVFDAAKEDVALRTSI 290
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGV--DEMGKQLMHDAL 292
P L +Q + + L LL +L + ++ + +M K + L
Sbjct: 291 PRQLL------------MQVNTADCTKKLSSLLRRLADSLENSRELRSSDMKKDFIMHRL 338
Query: 293 PPVL 296
PP L
Sbjct: 339 PPCL 342
>gi|326913124|ref|XP_003202891.1| PREDICTED: MYC-induced nuclear antigen-like [Meleagris gallopavo]
Length = 473
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 158/306 (51%), Gaps = 30/306 (9%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE++PL + R YY +L + ++ + + ++I+I +G+++ ++
Sbjct: 57 EYWEEKPLLVQRNDPLLATYYQSLFQLSDLKELCSQGLY-YGRDINICRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
EGR Q+ D ++ +I+F PQ + + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 EGRVNYAQLKKDFDQKKATIQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + +L R + S++ IG P L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTV---HLAREYNVE-SEDRIGNPTHEFVLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T +S HVTIS YQ +W D L A+P + D+ R +P
Sbjct: 232 LYFPRGTIHQADTPPGVSYSTHVTISTYQNNSWGDFLLDAIPGLVFNTAKEDVALRTSIP 291
Query: 236 IGYLRYAGLARGKP-----VDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
L +A + + ADRL EN ++L + +M K + +
Sbjct: 292 RKLLMQVDIADSTKKLSSILRMLADRL---ENSRELRSS------------DMRKDFIMN 336
Query: 291 ALPPVL 296
LPP L
Sbjct: 337 RLPPCL 342
>gi|26327519|dbj|BAC27503.1| unnamed protein product [Mus musculus]
Length = 465
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 151/303 (49%), Gaps = 23/303 (7%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + V + +++++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSINGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GRA Q+ D ++ +I+F PQ Y L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG K W++Y P + EY S++ IGTP L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTPTHDFLLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T + +S+H+TIS YQ +W D L ++ + D+ R G
Sbjct: 230 DLLYFPRGTIHQAETPSGLAYSIHLTISTYQNNSWGDCLLDSISGFVFDIAKEDVALRSG 289
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
+P R L P D+ L D L E + D M K + LP
Sbjct: 290 MP----RRMLLNVETPADVTRKLSGFLRTLADQLEGREELLSSD-----MKKDFVKHRLP 340
Query: 294 PVL 296
P
Sbjct: 341 PFF 343
>gi|281353888|gb|EFB29472.1| hypothetical protein PANDA_008768 [Ailuropoda melanoleuca]
Length = 464
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 152/304 (50%), Gaps = 30/304 (9%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WEK+PL I R YY +L + + + + ++I++ +G+++ ++
Sbjct: 58 EFWEKKPLLIQRNDPAVATYYQSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 116
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 117 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 176
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E F+LQLEG+K W++Y P + L R S S++ IG P TL+PGD
Sbjct: 177 LPPHYDDVEVFVLQLEGEKHWRLYHPTVP---LAREYSVE-SEDRIGRPAHEFTLKPGDF 232
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L + + DLEFR G+P
Sbjct: 233 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDLEFRAGIP 292
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE-----MGKQLMHD 290
L +Q D A+ L L L + + +G E M K + +
Sbjct: 293 RQLL------------LQVDATAVATRLSAFLRTLADRL---EGTKELLSADMKKDFVMN 337
Query: 291 ALPP 294
LPP
Sbjct: 338 RLPP 341
>gi|395528782|ref|XP_003766504.1| PREDICTED: MYC-induced nuclear antigen [Sarcophilus harrisii]
Length = 453
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 103/300 (34%), Positives = 151/300 (50%), Gaps = 22/300 (7%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE+ PL I R + Y+ +L + + R + + ++I++ +G+++ ++
Sbjct: 44 EYWEQRPLLIQRDDPSLAAYFQSLFQLTDLKHLCRQGMY-YGRDINVCRCVNGRKKVLNK 102
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 103 SSKVHYLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECFFGSLVGSNVYITPSESQG 162
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE-IGTPILTVTLEPGD 175
PHYDDIE FILQLEGKK W++Y P +P N E+ IG P TL+PGD
Sbjct: 163 LPPHYDDIEVFILQLEGKKHWRLYQPT-----VPLAREYNVEPEDKIGAPTHEFTLKPGD 217
Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
LLY PRG IHQA T HS HVTIS YQ +W D L +P L +++FR G+
Sbjct: 218 LLYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDVIPGLLFDTAKKEIKFRTGI 277
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
P L A + +RL+ LLA +E D EM K + LPP
Sbjct: 278 PRQLLMVDDTA------VSTERLS---GFLRLLADQLEGTR-DLRSSEMKKDFIRSRLPP 327
>gi|256355182|ref|NP_080186.3| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA [Mus musculus]
gi|160013499|sp|Q8CD15.2|MINA_MOUSE RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase MINA; AltName: Full=Histone lysine
demethylase MINA; AltName: Full=MYC-induced nuclear
antigen
gi|12841599|dbj|BAB25275.1| unnamed protein product [Mus musculus]
gi|12850814|dbj|BAB28860.1| unnamed protein product [Mus musculus]
gi|34849684|gb|AAH58242.1| Mina protein [Mus musculus]
gi|54144407|dbj|BAD60965.1| Mina53 [Mus musculus]
Length = 465
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 150/303 (49%), Gaps = 23/303 (7%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + V + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GRA Q+ D ++ +I+F PQ Y L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG K W++Y P + EY S++ IGTP L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTPTHDFLLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T + +S+H+TIS YQ +W D L ++ + D+ R G
Sbjct: 230 DLLYFPRGTIHQAETPSGLAYSIHLTISTYQNNSWGDCLLDSISGFVFDIAKEDVALRSG 289
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
+P R L P D+ L D L E + D M K + LP
Sbjct: 290 MP----RRMLLNVETPADVTRKLSGFLRTLADQLEGREELLSSD-----MKKDFVKHRLP 340
Query: 294 PVL 296
P
Sbjct: 341 PFF 343
>gi|19263523|gb|AAH25109.1| Mina protein [Mus musculus]
gi|19483936|gb|AAH23462.1| Myc induced nuclear antigen [Mus musculus]
Length = 465
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 150/303 (49%), Gaps = 23/303 (7%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + V + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GRA Q+ D ++ +I+F PQ Y L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG K W++Y P + EY S++ IGTP L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTPTHDFLLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T + +S+H+TIS YQ +W D L ++ + D+ R G
Sbjct: 230 DLLYFPRGTIHQAETPSGLAYSIHLTISTYQNNSWGDCLLDSISGFVFDIAKEDVALRSG 289
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
+P R L P D+ L D L E + D M K + LP
Sbjct: 290 MP----RRMLLNVETPADVTRKLSGFLRTLADQLEGREELLSSD-----MKKDFVKHRLP 340
Query: 294 PVL 296
P
Sbjct: 341 PFF 343
>gi|242000530|ref|XP_002434908.1| MYC-induced nuclear antigen, putative [Ixodes scapularis]
gi|215498238|gb|EEC07732.1| MYC-induced nuclear antigen, putative [Ixodes scapularis]
Length = 472
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 151/303 (49%), Gaps = 30/303 (9%)
Query: 2 HWEKEPLYISRK--QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
+WE+EP + + ++ L + I ++ + F K++ YEDG+R EG
Sbjct: 32 YWEREPFVAHDRAGMRAFWPQLFSKDAFFSIAKETKLYFGKDVSACKYEDGKRSDY-AEG 90
Query: 60 RALPQIVWDHYRE--GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
+ + Y E +++ PQ + L ++ ++ FGC VG N Y TP +QG
Sbjct: 91 YSAKSAKLNKYFEERKATLQVHQPQRWKDSLWEVLELMERFFGCLVGCNAYITPAGSQGL 150
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
APH+DD+ FI+QLEG+K WK++ P V E L R S +F+ EEIG P TL PGD L
Sbjct: 151 APHHDDV--FIVQLEGEKCWKLHKP--VTE-LARIYSKDFTSEEIGEPTHEFTLRPGDFL 205
Query: 178 YLPRGYIHQAST--VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
Y+PRG IH A + HS H+TIS YQK D L P + +A + +E R+GLP
Sbjct: 206 YMPRGTIHHAYVPESADSHSTHITISTYQKQTVGDCLMDIAPDLISSAMDSCIELRKGLP 265
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV----DLDDGVDEMGKQLMHDA 291
+L L+ KE + L+ ++E+V D +EM + M
Sbjct: 266 NRFLPSCVLS--------------KETVVTALSSVLEHVKQMPDEMPSPEEMVRDFMFSR 311
Query: 292 LPP 294
LPP
Sbjct: 312 LPP 314
>gi|308805368|ref|XP_003079996.1| unnamed protein product [Ostreococcus tauri]
gi|116058453|emb|CAL53642.1| unnamed protein product [Ostreococcus tauri]
Length = 455
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/190 (46%), Positives = 113/190 (59%), Gaps = 16/190 (8%)
Query: 41 NIDITLYEDGQRQTM-------DVEGRALPQIV-----WDHYRE-GCSIRFLNPQTYIKP 87
+ D+T Y DG R+T DV A +IV YRE G SIR L+PQT
Sbjct: 63 DADVTSYVDGVRRTHNSNDDTHDVCDEASNEIVDAKAVMRAYRERGRSIRLLHPQTRHDA 122
Query: 88 LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
++ A+L+ F C G N Y TP AQGFAPHYDDI+A++LQ+EG+K+W+VY P DE
Sbjct: 123 TWKMLATLESHFECACGCNVYVTPANAQGFAPHYDDIDAYVLQIEGEKRWRVYAPFQSDE 182
Query: 148 YLPRYSSPNFSQEEI-GTPIL-TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
LPR SS N++QEEI G +L LE GD LY+PRG++HQA + HS+H TIS Q
Sbjct: 183 -LPRTSSKNYTQEEIAGLEVLFDGVLEAGDFLYIPRGFVHQAECSSRAHSVHATISTNQA 241
Query: 206 TAWIDLLEKA 215
D E A
Sbjct: 242 NTHADAFEIA 251
>gi|301607059|ref|XP_002933117.1| PREDICTED: myc-induced nuclear antigen-like [Xenopus (Silurana)
tropicalis]
Length = 460
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 159/306 (51%), Gaps = 22/306 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
D+WE +PL + + + T ++ D+ + I +++++++ +DG++ ++
Sbjct: 54 DYWETKPLLLQGRDPAFAAYYQTLFRLSDLKHIAGGSIYYERDVNVFKCKDGKKISLPRH 113
Query: 59 GRALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
G+A + + G +I+F PQ + L + L+ FG VG+N Y TP +QG
Sbjct: 114 GKATYLHLLKDFGSGKATIQFHQPQRFNDALWHIIEKLECFFGALVGSNIYITPQDSQGL 173
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
HYDD+E FILQLEG+K+W++Y P V YS +++IG+P L+PGDLL
Sbjct: 174 PAHYDDVEVFILQLEGEKRWRLYSP--VVPLARDYSV--VPEDQIGSPTHDFVLKPGDLL 229
Query: 178 YLPRGYIHQASTVT-NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
Y PRG IHQA + + HS HVTIS YQ +W D L+ +P L A +++ RRG+P
Sbjct: 230 YFPRGVIHQAQALPGSSHSTHVTISTYQNNSWSDYLQDLLPGILFDAAKENIDLRRGIP- 288
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD--EMGKQLMHDALPP 294
R L+ P + + L LL +V+ ++ + EM + M LPP
Sbjct: 289 ---RQQMLSMDTP--------GVAQQLSSLLTTVVKGLESHRHIRSFEMLRDFMGSRLPP 337
Query: 295 VLSPEE 300
L +E
Sbjct: 338 FLDNKE 343
>gi|301769327|ref|XP_002920080.1| PREDICTED: MYC-induced nuclear antigen-like [Ailuropoda
melanoleuca]
Length = 463
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/300 (34%), Positives = 150/300 (50%), Gaps = 23/300 (7%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WEK+PL I R YY +L + + + + ++I++ +G+++ ++
Sbjct: 58 EFWEKKPLLIQRNDPAVATYYQSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 116
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 117 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 176
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E F+LQLEG+K W++Y P + L R S S++ IG P TL+PGD
Sbjct: 177 LPPHYDDVEVFVLQLEGEKHWRLYHPTVP---LAREYSVE-SEDRIGRPAHEFTLKPGDF 232
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L + + DLEFR G+P
Sbjct: 233 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDLEFRAGIP 292
Query: 236 IGYLRYAGLARGKPVDIQADRL-AMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
L A A RL A L D L E + D M K + + LPP
Sbjct: 293 RQLLLVDATA-------VATRLSAFLRTLADRLEGTKELLSAD-----MKKDFVMNRLPP 340
>gi|410970260|ref|XP_003991604.1| PREDICTED: MYC-induced nuclear antigen [Felis catus]
Length = 464
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 153/309 (49%), Gaps = 40/309 (12%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WEK+PL I R YY +L + + + + ++I++ +G+++ ++
Sbjct: 58 EFWEKKPLLIQRDDPAVATYYRSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 116
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 117 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 176
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W +Y P + L R S +++ IG P TL+PGD
Sbjct: 177 LPPHYDDVEVFILQLEGEKHWCLYHPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDF 232
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L M + DLEFR G+P
Sbjct: 233 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTMSGLVFDTAKEDLEFRAGIP 292
Query: 236 IGYL----------RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGK 285
L R +G R ADRL E K+LL+ +M K
Sbjct: 293 RQLLLQVEATAVATRLSGFLR-----TLADRL---EGTKELLSA------------DMKK 332
Query: 286 QLMHDALPP 294
+ + LPP
Sbjct: 333 DFVMNRLPP 341
>gi|23346418|ref|NP_116167.3| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA isoform b [Homo sapiens]
gi|387849375|ref|NP_001248758.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA isoform b [Homo sapiens]
gi|23200591|dbj|BAC16358.1| Mina53 form-2 [Homo sapiens]
gi|119600278|gb|EAW79872.1| MYC induced nuclear antigen, isoform CRA_a [Homo sapiens]
gi|119600279|gb|EAW79873.1| MYC induced nuclear antigen, isoform CRA_a [Homo sapiens]
Length = 464
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/311 (33%), Positives = 157/311 (50%), Gaps = 44/311 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289
Query: 234 LP----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
+P + R +G R ADRL E K+LL+ +M
Sbjct: 290 IPRQLLLVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SDM 329
Query: 284 GKQLMHDALPP 294
K + LPP
Sbjct: 330 KKDFIMHRLPP 340
>gi|426341335|ref|XP_004035995.1| PREDICTED: MYC-induced nuclear antigen isoform 1 [Gorilla gorilla
gorilla]
gi|426341337|ref|XP_004035996.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Gorilla gorilla
gorilla]
Length = 465
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLIFDTAKEDVELRTG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|296863377|pdb|2XDV|A Chain A, Crystal Structure Of The Catalytic Domain Of Flj14393
Length = 442
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 34 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 90
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 91 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 150
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 151 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 206
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 207 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 266
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 267 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLSS------------D 306
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 307 MKKDFIMHRLPP 318
>gi|50950028|emb|CAH10679.1| hypothetical protein [Homo sapiens]
Length = 384
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 156/310 (50%), Gaps = 45/310 (14%)
Query: 3 WEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDV 57
WE++PL I R YY +L K+ D+ L + + +++++ +G+++ ++
Sbjct: 13 WEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVLNK 69
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 70 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 129
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PGDL
Sbjct: 130 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPGDL 185
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G+P
Sbjct: 186 LYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTGIP 245
Query: 236 -----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
+ R +G R ADRL E K+LL+ +M
Sbjct: 246 RQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLSS------------DMK 285
Query: 285 KQLMHDALPP 294
K + LPP
Sbjct: 286 KDFIMHRLPP 295
>gi|449275792|gb|EMC84560.1| MYC-induced nuclear antigen [Columba livia]
Length = 469
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 154/303 (50%), Gaps = 24/303 (7%)
Query: 1 DHWEKEPLYISRKQK---NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE++PL I R YY +L + ++ + + ++I+I +G+++ ++
Sbjct: 57 EYWEQKPLLIQRNDNLMAAYYQSLFQFSNLKELCGQGLY-YGRDINICRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
EG+ Q+ D ++ +I+F PQ + + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 EGKVNYMQLKKDFDQKKATIQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLP--RMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG+K W++Y P + EY S++ IG P L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTVHLAQEYNVE------SEDRIGNPTHEFILKPG 229
Query: 175 DLLYLPRGYIHQAST-VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T + +S HVTIS YQ +W D L A+P + D+ R
Sbjct: 230 DLLYFPRGTIHQADTPLGISYSTHVTISTYQNNSWGDFLLDAIPGLVFDTAKEDVALRTS 289
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
+P L VDI + L+ L +L +L +M K + + LP
Sbjct: 290 IPRQLLMQ--------VDIADSTKKLSSLLRQLADRLENTRELRSS--DMKKDFIMNRLP 339
Query: 294 PVL 296
P L
Sbjct: 340 PYL 342
>gi|114588106|ref|XP_001139845.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Pan troglodytes]
gi|114588108|ref|XP_001139930.1| PREDICTED: MYC-induced nuclear antigen isoform 3 [Pan troglodytes]
gi|410226778|gb|JAA10608.1| MYC induced nuclear antigen [Pan troglodytes]
gi|410264218|gb|JAA20075.1| MYC induced nuclear antigen [Pan troglodytes]
gi|410306734|gb|JAA31967.1| MYC induced nuclear antigen [Pan troglodytes]
gi|410331313|gb|JAA34603.1| MYC induced nuclear antigen [Pan troglodytes]
gi|410331315|gb|JAA34604.1| MYC induced nuclear antigen [Pan troglodytes]
Length = 465
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|15928921|gb|AAH14928.1| MYC induced nuclear antigen [Homo sapiens]
gi|38708276|gb|AAR27293.1| NO52 protein [Homo sapiens]
Length = 465
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|110227619|ref|NP_001035998.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA isoform a [Homo sapiens]
gi|110227621|ref|NP_694822.2| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA isoform a [Homo sapiens]
gi|74750624|sp|Q8IUF8.1|MINA_HUMAN RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase MINA; AltName: Full=60S ribosomal
protein L27a histidine hydroxylase; AltName:
Full=Histone lysine demethylase MINA; AltName:
Full=MYC-induced nuclear antigen; AltName: Full=Mineral
dust-induced gene protein; AltName: Full=Nucleolar
protein 52; AltName: Full=Ribosomal oxygenase MINA;
Short=ROX
gi|23307412|dbj|BAC16537.1| Mina53 [Homo sapiens]
gi|31872083|gb|AAP59421.1| MDIG [Homo sapiens]
gi|91242970|gb|ABE28016.1| MYC induced nuclear antigen [Homo sapiens]
gi|119600280|gb|EAW79874.1| MYC induced nuclear antigen, isoform CRA_b [Homo sapiens]
Length = 465
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|417401381|gb|JAA47579.1| Hypothetical protein [Desmodus rotundus]
Length = 464
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/237 (37%), Positives = 133/237 (56%), Gaps = 8/237 (3%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
WE++PL I R + ++ D+ L + + ++I++ +G+++ ++ +G+
Sbjct: 59 WEQKPLLIQRDDPALAARYQSLFRLSDLKSLCSRGVYYGRDINVCRCVNGKKKVLNKDGK 118
Query: 61 A-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG P
Sbjct: 119 VHFLQLRKDFDQKRATIQFHQPQRFKDDLWRIQEKLECYFGSLVGSNVYLTPAGSQGLPP 178
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
HYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDLLY
Sbjct: 179 HYDDVEVFILQLEGEKHWRLYHPTVP---LARDYSVE-AEDRIGRPAHEFTLKPGDLLYF 234
Query: 180 PRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
PRG IHQA T HS HVTIS YQ +W DLL A+ + DLEFR G+P
Sbjct: 235 PRGTIHQADTPPGLAHSTHVTISTYQNNSWGDLLLDAISGLVFDTAKEDLEFRAGIP 291
>gi|402858812|ref|XP_003893879.1| PREDICTED: MYC-induced nuclear antigen isoform 1 [Papio anubis]
gi|402858814|ref|XP_003893880.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Papio anubis]
Length = 465
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIRRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRAG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|387762704|ref|NP_001248638.1| MYC induced nuclear antigen [Macaca mulatta]
gi|355559267|gb|EHH15995.1| hypothetical protein EGK_11218 [Macaca mulatta]
gi|355746351|gb|EHH50965.1| hypothetical protein EGM_10276 [Macaca fascicularis]
gi|380817320|gb|AFE80534.1| MYC-induced nuclear antigen isoform a [Macaca mulatta]
gi|384949894|gb|AFI38552.1| MYC-induced nuclear antigen isoform a [Macaca mulatta]
Length = 465
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIRRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDLDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRAG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K+LL+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|397502581|ref|XP_003821931.1| PREDICTED: LOW QUALITY PROTEIN: MYC-induced nuclear antigen [Pan
paniscus]
Length = 465
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 153/310 (49%), Gaps = 41/310 (13%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + +++++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRGDPALATYYGSLFKLTDLKSLCSRGTY-YGRDVNVCRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G+P
Sbjct: 232 LYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTGIP 291
Query: 236 -----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
+ R +G R ADRL E K+LL+ +M
Sbjct: 292 RQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SDMK 331
Query: 285 KQLMHDALPP 294
K + LPP
Sbjct: 332 KDFIMHRLPP 341
>gi|348555088|ref|XP_003463356.1| PREDICTED: MYC-induced nuclear antigen-like [Cavia porcellus]
Length = 464
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 87/240 (36%), Positives = 131/240 (54%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WEK+PL I R YY +L + + ++ + +++++ +G+++ ++
Sbjct: 57 EFWEKKPLLIQRDDPALAAYYQSLFRLTDLKSLCGQSLY-YGRDVNVCRCINGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKVHFLQLRKDFDQKRTTIQFHQPQRFKDELWKIQEKLECYFGSLVGSNVYITPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S + +G P TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYTPTVA---LAREYSVE-PEARLGAPTHEFTLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L M + D+E RRG+P
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTMSGLVFDTAKDDVELRRGIP 291
>gi|23463307|ref|NP_695221.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA [Rattus norvegicus]
gi|81877998|sp|Q8CFC1.1|MINA_RAT RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase MINA; AltName: Full=Histone lysine
demethylase MINA; AltName: Full=MYC-induced nuclear
antigen
gi|23200600|dbj|BAC16362.1| Mina53 [Rattus norvegicus]
gi|56585203|gb|AAH87650.1| Myc induced nuclear antigen [Rattus norvegicus]
gi|149060276|gb|EDM10990.1| myc induced nuclear antigen, isoform CRA_a [Rattus norvegicus]
Length = 465
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 152/299 (50%), Gaps = 19/299 (6%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R + YY +L + + + + + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPSLAAYYQSLFSLSDLRSLCSQGLY-YGRDVNVCRCIGGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKAQFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K+W++Y P + L R S ++ IGTP L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGRKRWRLYSPTVP---LAREYSVE-PEDRIGTPTHDFLLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T + HS+H+TIS YQ +W D L ++ + D+ R G+P
Sbjct: 232 LYFPRGTIHQAETPSGLAHSIHLTISTYQNNSWGDYLLDSISGLVFDIAKEDVALRTGMP 291
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
R + P D+ L D L E + D M K + LPP
Sbjct: 292 ----RRMLMNVETPADVTRKLSGFLRTLADQLEGRKELLSSD-----MKKDFVMHRLPP 341
>gi|431892471|gb|ELK02909.1| MYC-induced nuclear antigen [Pteropus alecto]
Length = 509
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 154/302 (50%), Gaps = 25/302 (8%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + ++I++ +G+++ +
Sbjct: 102 EFWEQKPLLIQRDDPAVATYYQSLF---KLSDLKSLCSRGMYYGRDINVCRCVNGKKKVL 158
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 159 NKDGKVHFLQLRKDFVQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 218
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PG
Sbjct: 219 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPAHEFTLKPG 274
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + DL FR G
Sbjct: 275 DLLYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDLAFRAG 334
Query: 234 LPIGYLRYAGLARGKPVDIQADRL-AMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
+P L + + + RL A L D L E + D M K + + L
Sbjct: 335 IPRQL-----LLQVETTAVATRRLSAFLRTLADRLEGTSELLSAD-----MKKDFVMNRL 384
Query: 293 PP 294
PP
Sbjct: 385 PP 386
>gi|62751815|ref|NP_001015675.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA [Bos taurus]
gi|75057883|sp|Q5EA24.1|MINA_BOVIN RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase MINA; AltName: Full=Histone lysine
demethylase MINA; AltName: Full=MYC-induced nuclear
antigen
gi|59857855|gb|AAX08762.1| MYC induced nuclear antigen isoform 2 [Bos taurus]
gi|296491583|tpg|DAA33626.1| TPA: MYC-induced nuclear antigen [Bos taurus]
Length = 462
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 101/304 (33%), Positives = 152/304 (50%), Gaps = 30/304 (9%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + I + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSWG-IYYGRDVNVCRCVHGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GR Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP AQG
Sbjct: 116 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGAQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ ++W D L + + D+ R G+P
Sbjct: 232 LYFPRGTIHQADTPEGLAHSTHVTISTYQSSSWGDFLLDTISGLVFDTAKADVALRAGIP 291
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE-----MGKQLMHD 290
L +QA+ +A+ L L L + + +G E M K +
Sbjct: 292 RQLL------------LQAESIAVATRLSGFLRMLADRL---EGTKELPSADMKKDFAMN 336
Query: 291 ALPP 294
LPP
Sbjct: 337 RLPP 340
>gi|344277197|ref|XP_003410390.1| PREDICTED: MYC-induced nuclear antigen-like [Loxodonta africana]
Length = 462
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 97/277 (35%), Positives = 145/277 (52%), Gaps = 24/277 (8%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L ++ + + + ++I++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYQSLFRLTELKSLCSRGLY-YGRDINVCRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKVYFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ ++W D L + + D+E R GLP
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHVTISTYQNSSWGDFLLDTISGLVFDTAKEDIELRAGLP 291
Query: 236 IGYL---------RYAGLARGKPVDIQADRLAMKENL 263
L R +G R + ADRL E L
Sbjct: 292 RQLLLVGSTTVTTRLSGFLR-----MLADRLEGTEEL 323
>gi|426217331|ref|XP_004002907.1| PREDICTED: MYC-induced nuclear antigen isoform 2 [Ovis aries]
Length = 460
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 87/240 (36%), Positives = 132/240 (55%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L ++ + I + +++++ Y G+++ +
Sbjct: 56 EFWEQKPLLIQRDDPALATYYQSLFRLSDLESLCSWG-IYYGRDVNVCRYVHGKKKVFNK 114
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GR Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP AQG
Sbjct: 115 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRVQEKLECYFGSLVGSNVYITPAGAQG 174
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 175 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 230
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS H+TIS YQ+++W D L + + D+ R G+P
Sbjct: 231 LYFPRGTIHQADTPEGLAHSTHLTISTYQRSSWGDFLLDTISGLVFDTAKADVALRAGIP 290
>gi|426217329|ref|XP_004002906.1| PREDICTED: MYC-induced nuclear antigen isoform 1 [Ovis aries]
Length = 461
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/240 (36%), Positives = 132/240 (55%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L ++ + I + +++++ Y G+++ +
Sbjct: 56 EFWEQKPLLIQRDDPALATYYQSLFRLSDLESLCSWG-IYYGRDVNVCRYVHGKKKVFNK 114
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GR Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP AQG
Sbjct: 115 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRVQEKLECYFGSLVGSNVYITPAGAQG 174
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 175 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 230
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS H+TIS YQ+++W D L + + D+ R G+P
Sbjct: 231 LYFPRGTIHQADTPEGLAHSTHLTISTYQRSSWGDFLLDTISGLVFDTAKADVALRAGIP 290
>gi|403273551|ref|XP_003928573.1| PREDICTED: MYC-induced nuclear antigen [Saimiri boliviensis
boliviensis]
Length = 434
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 134/242 (55%), Gaps = 14/242 (5%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALAAYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVSGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P TL+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPTHEFTLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L +P + +E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTIPGLVFDTAKEGVELRAG 289
Query: 234 LP 235
+P
Sbjct: 290 IP 291
>gi|197098352|ref|NP_001127580.1| bifunctional lysine-specific demethylase and histidyl-hydroxylase
MINA [Pongo abelii]
gi|75054795|sp|Q5R673.1|MINA_PONAB RecName: Full=Bifunctional lysine-specific demethylase and
histidyl-hydroxylase MINA; AltName: Full=Histone lysine
demethylase MINA; AltName: Full=MYC-induced nuclear
antigen
gi|55732074|emb|CAH92743.1| hypothetical protein [Pongo abelii]
Length = 465
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 157/312 (50%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+ PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQLHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + A D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDAAKEDVELRAG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
+P + R +G R ADRL E K++L+ +
Sbjct: 290 IPRQLLLQVESTTVATRRLSGFLR-----TLADRL---EGTKEVLS------------SD 329
Query: 283 MGKQLMHDALPP 294
M K + LPP
Sbjct: 330 MKKDFIMHRLPP 341
>gi|440894819|gb|ELR47165.1| MYC-induced nuclear antigen [Bos grunniens mutus]
Length = 463
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/304 (33%), Positives = 151/304 (49%), Gaps = 30/304 (9%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + I + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSWG-IYYGRDVNVCRCVHGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GR Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP AQG
Sbjct: 116 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGAQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ ++W D L + + D+ R G+P
Sbjct: 232 LYFPRGTIHQADTPEGLAHSTHVTISTYQSSSWGDFLLDTISGLVFDTAKADVALRAGIP 291
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE-----MGKQLMHD 290
L +QA+ A+ L L L + + +G E M K +
Sbjct: 292 RQLL------------LQAESTAVATRLSGFLRMLADRL---EGTKELPSADMKKDFAMN 336
Query: 291 ALPP 294
LPP
Sbjct: 337 RLPP 340
>gi|14041887|dbj|BAB55024.1| unnamed protein product [Homo sapiens]
Length = 464
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 156/311 (50%), Gaps = 44/311 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ F VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFSSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRTG 289
Query: 234 LP----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
+P + R +G R ADRL E K+LL+ +M
Sbjct: 290 IPRQLLLVESTTVATRRLSGFLR-----TLADRL---EGTKELLS------------SDM 329
Query: 284 GKQLMHDALPP 294
K + LPP
Sbjct: 330 KKDFIMHRLPP 340
>gi|345310558|ref|XP_001519133.2| PREDICTED: MYC-induced nuclear antigen [Ornithorhynchus anatinus]
Length = 444
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/242 (35%), Positives = 136/242 (56%), Gaps = 14/242 (5%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE++PL I R + YY +L + + + + +++++ +G+++ ++
Sbjct: 36 EYWEQKPLLIQRDDPSIAAYYQSLFRLMDLKGLCSRGMF-YGRDVNVCRCVNGKKKVLNK 94
Query: 58 EGRA---LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
G+ L + +D R +I+F PQ + L ++ L+ FG VG+N Y TPP +
Sbjct: 95 GGKVNYLLLRKDFDQKR--ATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPPGS 152
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + +L R + +++IG+P TL+PG
Sbjct: 153 QGLPPHYDDVEVFILQLEGEKHWRLYQPTV---HLAREYNVE-PEDKIGSPTHEFTLKPG 208
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L +P + ++E R G
Sbjct: 209 DLLYFPRGTIHQADTPPGVAHSTHVTISTYQNNSWGDFLLDIIPGLVFETAKEEVELRMG 268
Query: 234 LP 235
+P
Sbjct: 269 IP 270
>gi|119331084|ref|NP_001073191.1| MYC induced nuclear antigen [Gallus gallus]
gi|53133678|emb|CAG32168.1| hypothetical protein RCJMB04_19e21 [Gallus gallus]
Length = 473
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 155/301 (51%), Gaps = 20/301 (6%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WE++PL + R YY +L + ++ + + ++I+I +G+++ ++
Sbjct: 57 EYWEEKPLLVQRNDPLLAAYYQSLFQLSDLKELCSQGLY-YGRDINICRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
EG+ Q+ D ++ +++F PQ + + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 EGKVNYAQLNKDFDQKKATMQFHQPQRFKEELWKIQEKLECYFGSLVGSNVYITPQGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + +L R + ++ IG P L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYKPTV---HLAREYNVE-PEDRIGNPTHEFVLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T +S HVTIS YQ +W D L A+P + + D+ R +P
Sbjct: 232 LYFPRGTIHQADTPPGIPYSTHVTISTYQNNSWGDFLLDAIPGLVFSTAKEDVALRTSIP 291
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPV 295
L VDI + L+ L +L +L +M K + + LPP
Sbjct: 292 RKLLMQ--------VDIADSTKKLSSILRMLADRLENTGELRSS--DMRKDFIMNRLPPC 341
Query: 296 L 296
L
Sbjct: 342 L 342
>gi|348545500|ref|XP_003460218.1| PREDICTED: myc-induced nuclear antigen-like [Oreochromis niloticus]
Length = 465
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 94/306 (30%), Positives = 158/306 (51%), Gaps = 32/306 (10%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WEK+PL++ R YY +L + + +E+ ++I++ +G+++ ++
Sbjct: 59 EYWEKKPLHLQRSDPGTALYYQSLFQLCDLQSLCSQG-LEYYRDINVVRCVNGKKKLLNK 117
Query: 58 EGRALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
EGR ++ + ++++ PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 118 EGRVKHSVLNKIMSQNKATVQYHQPQRFKDELWRIQEKLECFFGALVGSNIYITPEESQG 177
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG+K W +Y P + EY + S++ IG P + L+ G
Sbjct: 178 LPPHYDDVEVFILQLEGQKHWHLYNPTVPLAAEYSVQ------SEDSIGIPTHDIILKAG 231
Query: 175 DLLYLPRGYIHQASTVTNEH-SLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T + S+H+T+S YQ+ +W DLL A P L ++ FR G
Sbjct: 232 DLLYFPRGTIHQARTPAGVNCSIHLTLSTYQRMSWGDLLLDAFPSLLCDRSRREVSFREG 291
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG-----KQLM 288
+P L + ++ L + L L L + +++ G+ E+G + +
Sbjct: 292 MPRRLLLSS-----------SEGLHTNKRLAACLRSLAD--EMETGMQELGFTHMKRDFI 338
Query: 289 HDALPP 294
+ LPP
Sbjct: 339 MNRLPP 344
>gi|198425574|ref|XP_002131324.1| PREDICTED: similar to MYC induced nuclear antigen [Ciona
intestinalis]
Length = 490
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 150/296 (50%), Gaps = 24/296 (8%)
Query: 14 QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ--RQTMDVEGRALPQIVWDHYR 71
+ +Y+N L + + +++ +++ K+I ++D + R +V G + V +
Sbjct: 76 RSSYFNKLFNHEILKEVVLSKKLKYDKDICACRFDDEKKCRVNAEVHGPVTAEKVHSLFH 135
Query: 72 EG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
+ +++F PQ + L ++ L+ FG VG+N Y TP +QG APH+D++E FILQ
Sbjct: 136 DDKMTLQFHQPQRFHDELWKIQEKLESFFGSQVGSNVYMTPDGSQGLAPHHDNVEVFILQ 195
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-GTPIL-TVTLEPGDLLYLPRGYIHQAS 188
LEG+K+WK+Y P + LPR SS +F + G +L T+ ++PGD+LY PRG +HQA
Sbjct: 196 LEGEKEWKLYSPVV---NLPRNSSSDFDDSTVKGLTLLDTIIMKPGDVLYFPRGTVHQAK 252
Query: 189 TVTNE-HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARG 247
++ HS H+TIS Y+ W D + +P AA + RRGLP Y
Sbjct: 253 SIKGTGHSTHLTISTYETQCWGDYILDFIPYLTDAAADKVVSLRRGLPRKYYN------- 305
Query: 248 KPVDIQADRLAMKENLKDLLAKLVEYVDLDDGV---DEMGKQLMHDALPPVLSPEE 300
Q K+NLK L L E + V D + + LPP S +E
Sbjct: 306 -----QTSTDGFKQNLKKALISLAESLTESTDVEMSDSIVCNFFANRLPPFGSVQE 356
>gi|332225165|ref|XP_003261750.1| PREDICTED: MYC-induced nuclear antigen [Nomascus leucogenys]
Length = 465
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 134/242 (55%), Gaps = 14/242 (5%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTM 55
+ WE++PL I R YY +L K+ D+ L + + +++++ +G+++ +
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLF---KLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVL 113
Query: 56 DVEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+ +G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +
Sbjct: 114 NKDGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGS 173
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
QG PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ L+PG
Sbjct: 174 QGLPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFLLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQ T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQVDTPAGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRAG 289
Query: 234 LP 235
+P
Sbjct: 290 IP 291
>gi|291400828|ref|XP_002716795.1| PREDICTED: MYC induced nuclear antigen-like [Oryctolagus cuniculus]
Length = 465
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 154/308 (50%), Gaps = 41/308 (13%)
Query: 3 WEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
WE++PL I R YY +L + + + + +++++ +G+++ ++ +G
Sbjct: 59 WEQKPLLIQRDDPALATYYQSLFRLTDLKSLCSRGMY-YGRDVNVCRCVNGKKKVLNKDG 117
Query: 60 RA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 118 KAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQGLP 177
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDLLY
Sbjct: 178 PHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDLLY 233
Query: 179 LPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP-- 235
PRG IHQA T HS HVTIS YQ +W D L + + A D+ R G+P
Sbjct: 234 FPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDAAKEDVALRAGIPRQ 293
Query: 236 ---------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQ 286
+ R +G R + A+RL E K+LL+ +M K
Sbjct: 294 LLLQAESTTVATRRLSGFLR-----MLAERL---EGTKELLSS------------DMKKD 333
Query: 287 LMHDALPP 294
+ LPP
Sbjct: 334 FIMHRLPP 341
>gi|456753234|gb|JAA74127.1| MYC induced nuclear antigen tv2 [Sus scrofa]
Length = 464
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 150/309 (48%), Gaps = 40/309 (12%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WEK+PL I R YY +L + + I + ++I++ G+++ +
Sbjct: 57 EFWEKKPLLIQRNDPVLATYYRSLFRLSDLKSLCSRG-IYYGRDINVCRCVSGKKKVFNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 GGKVHFLQLRKDFDQKKATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYLTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S ++ IG P L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AENRIGKPAYEFMLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS H+TIS YQ ++W D L + + D+EFR G+P
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHLTISTYQSSSWGDFLLDTISGLVFDMAKADVEFRAGIP 291
Query: 236 IGYL----------RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGK 285
L R +G R + ADRL E K+L + +M K
Sbjct: 292 RQLLLQVETTDVATRLSGFLR-----MLADRL---EGTKELFST------------DMKK 331
Query: 286 QLMHDALPP 294
+ LPP
Sbjct: 332 DFALNRLPP 340
>gi|81674421|gb|AAI09962.1| MYC induced nuclear antigen [Bos taurus]
Length = 461
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 87/240 (36%), Positives = 130/240 (54%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + I + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSWG-IYYGRDVNVCRCVHGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GR Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP AQG
Sbjct: 116 DGRVHFLQLRQDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGAQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYQPTVP---LAREYSVE-AEDRIGRPTHEFTLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ ++W D L + + D+ R G+P
Sbjct: 232 LYFPRGTIHQADTPEGLAHSTHVTISTYQSSSWGDFLLDTISGLVFDTAKADVALRAGIP 291
>gi|149731734|ref|XP_001504462.1| PREDICTED: MYC-induced nuclear antigen [Equus caballus]
Length = 464
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 154/312 (49%), Gaps = 45/312 (14%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + ++I++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYQSLFRLSDLKSLCSRGMY-YGRDINVCRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E ILQLEG+K+W++Y P + EY +++ IG+P TL+PG
Sbjct: 176 LPPHYDDVEVLILQLEGEKRWRLYRPTVPLAREYGVE------AEDSIGSPAREFTLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
D LY PRG IHQA T HS H+TIS YQ +W D L + + DLE R G
Sbjct: 230 DFLYFPRGTIHQADTPPGLAHSTHLTISTYQNNSWGDFLLDTISGLVFDTAKEDLELRAG 289
Query: 234 LP-----------IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDE 282
LP + R +G R + AD+L E K+LL+ +
Sbjct: 290 LPRQLLLQVETTAVARRRVSGFLR-----MLADKL---EGTKELLSA------------D 329
Query: 283 MGKQLMHDALPP 294
M K + + LPP
Sbjct: 330 MKKDFIMNRLPP 341
>gi|344254172|gb|EGW10276.1| Myc-induced nuclear antigen [Cricetulus griseus]
Length = 826
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 131/240 (54%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L ++ I + + K++++ +G+++ ++
Sbjct: 414 EFWEQKPLLIQRNDPSLATYYQSLFRLSDMNRICLQGLY-YGKDVNVCRCINGKKKVLNK 472
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 473 SGKVQFLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECYFGSLVGSNVYMTPAGSQG 532
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S + IGTP L+PGDL
Sbjct: 533 LPPHYDDVEVFILQLEGEKHWRLYSPTVP---LAREYSVE-PEARIGTPTHDFILKPGDL 588
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA+T + HS HVTIS YQ +W D L ++ + D+ R G+P
Sbjct: 589 LYFPRGTIHQANTPSGLAHSTHVTISTYQNNSWGDFLLDSLSGLVFDIAKEDVALRTGIP 648
>gi|354493583|ref|XP_003508920.1| PREDICTED: myc-induced nuclear antigen-like [Cricetulus griseus]
Length = 439
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 132/240 (55%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R + YY +L ++ I + + K++++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRNDPSLATYYQSLFRLSDMNRICLQGLY-YGKDVNVCRCINGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 SGKVQFLQLRKDFDQKRATIQFHQPQRFKDELWKIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S + IGTP L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYSPTVP---LAREYSVE-PEARIGTPTHDFILKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA+T + HS HVTIS YQ +W D L ++ + D+ R G+P
Sbjct: 232 LYFPRGTIHQANTPSGLAHSTHVTISTYQNNSWGDFLLDSLSGLVFDIAKEDVALRTGIP 291
>gi|395821318|ref|XP_003783991.1| PREDICTED: MYC-induced nuclear antigen [Otolemur garnettii]
Length = 465
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 130/240 (54%), Gaps = 10/240 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + I + +++++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYQSLFRLTDLKSLCSRG-IYYGRDVNVCRCVNGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYLTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S ++ IG P TL+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-PEDRIGRPTYEFTLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ +W D L + + D+ R G+P
Sbjct: 232 LYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVALRAGIP 291
>gi|147905999|ref|NP_001086856.1| MYC induced nuclear antigen [Xenopus laevis]
gi|50415478|gb|AAH77562.1| Mina-prov protein [Xenopus laevis]
Length = 461
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 132/239 (55%), Gaps = 8/239 (3%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
D+WE + L + + + + T ++ D+ + I +++++++ DG++ +
Sbjct: 54 DYWETKVLLLQGRDPAFTDYFQTLFRLSDLKHIAGGGIYYERDVNVFKCRDGKKIALPRH 113
Query: 59 GRALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
G+A + + G +I+F PQ + L + L+ FG VG+N Y TP +QG
Sbjct: 114 GKATYLHLLKDFGSGKATIQFHQPQRFNDALWHIMEKLECFFGALVGSNVYITPQDSQGL 173
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
HYDD+E FILQLEG+K+W++Y P V YS +++IG+P L+PGDLL
Sbjct: 174 PAHYDDVEVFILQLEGEKRWRLYNP--VVPLARDYSV--VPEDQIGSPTHDFVLKPGDLL 229
Query: 178 YLPRGYIHQASTVT-NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
Y PRG IHQA + + HS HVTIS YQ +W D L+ +P L A +++ RRG+P
Sbjct: 230 YFPRGVIHQAQALPGSSHSTHVTISTYQNNSWSDYLQDLLPGILFDAAKANIDLRRGIP 288
>gi|351698842|gb|EHB01761.1| MYC-induced nuclear antigen [Heterocephalus glaber]
Length = 463
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 92/277 (33%), Positives = 140/277 (50%), Gaps = 16/277 (5%)
Query: 1 DHWEKEPLYISRKQ----KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD 56
+ WEK+PL + R YY +L + + + + +++++ G+++ ++
Sbjct: 57 EFWEKKPLLVQRHDDPALATYYQSLFRLADLKSLCSQGLY-YGRDVNVCRSVSGKKKVLN 115
Query: 57 VEGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +Q
Sbjct: 116 KDGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRVQEKLEYYFGFLVGSNVYITPAGSQ 175
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PHYDD+E FILQLEG+ W++Y P + EY Y + +G P TL+P
Sbjct: 176 GLPPHYDDVEVFILQLEGENHWRLYSPTVALAREYSVEYEA------RLGVPTREFTLKP 229
Query: 174 GDLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
GDLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R
Sbjct: 230 GDLLYFPRGAIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRS 289
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMK-ENLKDLLA 268
G+P L LA + LA + E K+LL+
Sbjct: 290 GIPRQLLMVETLAVATRLSSFLRTLAERLEGTKELLS 326
>gi|351711026|gb|EHB13945.1| MYC-induced nuclear antigen [Heterocephalus glaber]
Length = 463
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 127/242 (52%), Gaps = 14/242 (5%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WEK+PL I R YY +L + + + + +++++ G+++ ++
Sbjct: 57 EFWEKKPLLIQRDDPALATYYQSLFRLADLKSLCSQGLY-YGRDVNVCRSVSGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+ Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKVHFLQLRKDFDQKRATIQFHQPQRFKDELWRVQEKLECYFGSLVGSNVYITPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PHYDD+E FILQLEG+K W++Y P + EY Y + +G P L+PG
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYPPTVALACEYSVEYEA------RLGAPTHEFMLKPG 229
Query: 175 DLLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
DLLY PRG IHQA T HS HVTIS YQ +W D L + + D+E R G
Sbjct: 230 DLLYFPRGTIHQADTPPGLAHSTHVTISTYQNNSWGDFLLDTISGLVFDTAKEDVELRSG 289
Query: 234 LP 235
+P
Sbjct: 290 IP 291
>gi|74002431|ref|XP_535711.2| PREDICTED: MYC-induced nuclear antigen isoform 1 [Canis lupus
familiaris]
Length = 463
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 146/300 (48%), Gaps = 22/300 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
+ WEK+PL + R ++ D+ L + + +++++ G+++ ++
Sbjct: 57 EFWEKQPLVLRRDDPAVAAYCQALFRLSDLKSLCGRGLLYGRDVNVCRCVKGKKKVLNKA 116
Query: 59 GRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
GR Q+ D + +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 117 GRVHFLQLRKDLDQRRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQGL 176
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P TL+PGD+L
Sbjct: 177 PPHYDDVEVFILQLEGQKHWRLYHPTVP---LAREYSVE-AEERIGRPAYEFTLKPGDVL 232
Query: 178 YLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
Y PRG +HQA HS HVTIS YQ ++W D L + + A D+ R G+P
Sbjct: 233 YFPRGTVHQADVPPGLAHSTHVTISTYQNSSWGDCLLDTVSGLVFDAVKEDVALRAGIPR 292
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGV--DEMGKQLMHDALPP 294
L G A A + L LL L + ++ + M K + LPP
Sbjct: 293 QLLLVEGTA------------AARRRLGGLLRTLADRLEGTGALLAGGMKKDFVTHRLPP 340
>gi|425448177|ref|ZP_18828156.1| Cupin 4 family protein [Microcystis aeruginosa PCC 9443]
gi|389731118|emb|CCI04793.1| Cupin 4 family protein [Microcystis aeruginosa PCC 9443]
Length = 383
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 157/298 (52%), Gaps = 30/298 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNI-DITLYEDGQRQTMDVEG 59
++WEK P ISR + +YY++L++ K ID I+R + N+ D+ + ++ T + G
Sbjct: 19 EYWEKRPCLISRTEPDYYSHLLSLKDIDSIIRS----YGSNLSDVEVIKENSYFTPN--G 72
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
Q ++ Y +G ++R +P+ L LQ F V N Y +P +QGF
Sbjct: 73 VVNAQQLYQAYSQGYTLRVRKISERWQPISDLRTKLQVFFNHPVLINLYMSPKNSQGFQA 132
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE---EIGTPILTVTLEPGDL 176
H+D E FI+Q+EG K W++Y + LP S + ++ ++ +P+ TL GDL
Sbjct: 133 HFDTHEVFIVQVEGSKNWRIYDSPIT---LPLISDLKYQEKLRNQLTSPVTAFTLNAGDL 189
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
LY+PRGYIH+ T ++ S H+T++++ W DLL A+ Q D+ FR LPI
Sbjct: 190 LYIPRGYIHEVFT-SDSFSTHLTVAIH-TYKWFDLLNIAVNLIAQK----DVRFRESLPI 243
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
G+L +G+ +++++E ++LL L + ++ V+E+ + + + PP
Sbjct: 244 GFLE-----QGEA------KVSLQEQFQELLQILADKSQVEFAVEELAQLFLGETPPP 290
>gi|428769416|ref|YP_007161206.1| hypothetical protein Cyan10605_1036 [Cyanobacterium aponinum PCC
10605]
gi|428683695|gb|AFZ53162.1| hypothetical protein Cyan10605_1036 [Cyanobacterium aponinum PCC
10605]
Length = 397
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 167/338 (49%), Gaps = 48/338 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---- 56
++WEK+PL ISR QKNYY +L++ + +D IL+ + + K +I + ++ Q D
Sbjct: 23 EYWEKKPLIISRNQKNYYQDLLSVEDLDSILQYSKL---KPPEIRVVKNQQDFLPDRYVK 79
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G ++ Y E ++ Q + KPL LQ F V AN Y +P ++G
Sbjct: 80 ADGSLNLNQLYKAYYEDHTLVVNGLQNFWKPLAIYCQKLQNFFNHGVIANLYLSPKDSKG 139
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS--PNFSQEEIGTPILTVTLEPG 174
+PHYD + F+LQ++G K+W+V+ + +P S P + + + TV L+PG
Sbjct: 140 LSPHYDTHDVFVLQVDGSKEWQVH---QCFQPVPLLGSFQPVIPENSLPKLLHTVCLQPG 196
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
DLLYLPRG++H A+T SLH+T+ +Y T W+DL M AL + + R+ L
Sbjct: 197 DLLYLPRGFVHHAAT-QESFSLHLTLGIY-PTQWLDL----MVNALTMLALKNQDLRKAL 250
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH----- 289
PIG+L KP + +KE LL E +D+ ++ + Q +H
Sbjct: 251 PIGFL-------DKPEMLG----EIKEQFDQLLQTFAEKSSIDEAMELIYDQFIHKITPI 299
Query: 290 -----------DALPP---VLSPEELQCSVFENGLRMS 313
DA+ P V+ E ++C + + G +S
Sbjct: 300 PDGHFQQVNLVDAIAPDTLVVKREGMRCRIVKKGFSIS 337
>gi|344249714|gb|EGW05818.1| Myc-induced nuclear antigen [Cricetulus griseus]
Length = 317
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 129/241 (53%), Gaps = 12/241 (4%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R + YY +L ++ I + K++++ +G++ ++
Sbjct: 58 EFWEQKPLLIQRNDPSLATYYQSLFRLSDMNRICLQGLY-CGKDVNVCRCINGKK-VLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
G+ Q+ D ++ +I+F PQ + L + L+ FG VG+N Y TP +QG
Sbjct: 116 SGKVQFLQLRKDFDQKRATIQFHQPQRFKDELWKTQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKK-WKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
PHYDD+E FILQLEG+KK W +Y P + L R S + IGTP L+PGD
Sbjct: 176 LPPHYDDVEVFILQLEGEKKHWHLYSPTVP---LAREYSVE-PKARIGTPTHDFILKPGD 231
Query: 176 LLYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
LLY PRG IHQA+T + HS HVTIS YQ +W D L ++ + D+ R G+
Sbjct: 232 LLYFPRGTIHQANTPSGLAHSTHVTISTYQNNSWGDFLLDSLSRLEFDIAKEDVALRTGI 291
Query: 235 P 235
P
Sbjct: 292 P 292
>gi|307154631|ref|YP_003890015.1| cupin [Cyanothece sp. PCC 7822]
gi|306984859|gb|ADN16740.1| Cupin 4 family protein [Cyanothece sp. PCC 7822]
Length = 390
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 150/299 (50%), Gaps = 32/299 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+WEK PL ++R +YY+ LI+ K ID ILR + K+ D+ L ++ G
Sbjct: 24 YWEKSPLLVARNHPDYYSELISLKNIDSILR---LYGPKSSDVDLIKENS--FFSAGGEV 78
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
++ Y G S+ +PL L+ +L+ VG N Y T +QGF H+
Sbjct: 79 DFNQIYQAYSLGYSLVMRKIHERWQPLSVLHKNLEAFLNHPVGINLYMTSKNSQGFKAHF 138
Query: 122 DDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
D + FILQ+EG K+WK+Y LP + D +Y+ +Q + +P L GDL
Sbjct: 139 DTHDVFILQVEGSKQWKIYDSPITLPVISD---LKYTDKFINQ--LKSPTAEYCLNKGDL 193
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
LY+PRGYIH+ T N S+H+T+ ++ W DL+ A+ K Q ++ FR LP+
Sbjct: 194 LYIPRGYIHEVYT-DNSFSVHLTVGIHS-LKWFDLINSAVTKLAQK----EVRFRESLPV 247
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPV 295
G+LR +A+ ++K ++LL L E ++++ V+++ + + P V
Sbjct: 248 GFLRQE----------EAEE-SLKNQFQELLKLLAEQSEVEEAVEDIAQGFLGKMSPLV 295
>gi|23200593|dbj|BAC16359.1| Mina53 form-3 [Homo sapiens]
Length = 280
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/209 (37%), Positives = 118/209 (56%), Gaps = 10/209 (4%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L ++ + + + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALATYYGSLFKLTDLESLCSRGMY-YGRDVNVCRCVSGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P L+PGDL
Sbjct: 176 LPPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPAHEFMLKPGDL 231
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQ 204
LY PRG IHQA T HS HVTIS YQ
Sbjct: 232 LYFPRGTIHQADTPAGLAHSTHVTISTYQ 260
>gi|41056083|ref|NP_956631.1| MYC induced nuclear antigen-like [Danio rerio]
gi|31419411|gb|AAH53125.1| MYC induced nuclear antigen-like [Danio rerio]
Length = 528
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 145/343 (42%), Gaps = 60/343 (17%)
Query: 3 WEKEPLYISRKQK---NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
WE++PL + R YY +L + + +++ +I+ GQ++ ++ G
Sbjct: 77 WERQPLVLHRSDAALAGYYGSLFPLSGLRRLCARG-LQYGTDINTCRCVRGQKRLLNRAG 135
Query: 60 RA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ D + +I+F PQ + L ++ L+ FGC VG+N Y TP AQG
Sbjct: 136 AVDFCLLERDFLEKKATIQFHQPQRFQDELWRIQERLECFFGCLVGSNVYITPAGAQGLP 195
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-EIGTPILTVTLEPGDLL 177
PHYDD+E ILQLEG+K W++Y P +P + E IG P L+ GDLL
Sbjct: 196 PHYDDVEVLILQLEGQKHWRLYEP-----TVPLAREYSLEPEGRIGAPTHDFILQAGDLL 250
Query: 178 YLPRGYIHQASTVTNE-HSLHVTISVYQKT------------------------------ 206
Y PRG IHQA T HS H+T+S YQ
Sbjct: 251 YFPRGTIHQADTPAGAGHSTHLTLSTYQNMCVCAVHNVTHTHTHTLQNVLVLHSFIVCVR 310
Query: 207 ---AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENL 263
+W DLL MP + TD E R GLP G L ++ A+ L
Sbjct: 311 ACRSWGDLLLDLMPGCVFDRMKTDCELRTGLPRGLLTTPSISP-----------AVSHQL 359
Query: 264 KDLLAKLVEYVDLDDG---VDEMGKQLMHDALPPVLS-PEELQ 302
L +L + VD M + + LPP L P+ LQ
Sbjct: 360 SVFLRRLADVVDQQGQSLRSSSMRRDFISHRLPPFLQDPQLLQ 402
>gi|434387984|ref|YP_007098595.1| hypothetical protein Cha6605_4118 [Chamaesiphon minutus PCC 6605]
gi|428018974|gb|AFY95068.1| hypothetical protein Cha6605_4118 [Chamaesiphon minutus PCC 6605]
Length = 396
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 148/292 (50%), Gaps = 38/292 (13%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDIL--------RDNVIEFKKNI--DITLYEDG 50
D+WE++PL I+R+ YY L + + +D +L + VI+ ++ + D + +DG
Sbjct: 23 DYWEQKPLLIAREHPTYYQELFSKQALDSVLFFSTPKPPQLRVIKNQQEVFPDKYIKQDG 82
Query: 51 QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
Q + + Y EG ++ PL L +LQ F V AN Y +
Sbjct: 83 QLNLNQL---------YKLYDEGNTLVVNGLHQLWLPLTILCRNLQLAFNHTVIANCYAS 133
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P ++G PHYD + F+LQ+EG K+W V+ P+ V L P + ++ P+ ++
Sbjct: 134 PKQSKGLMPHYDTHDVFVLQIEGAKQWFVHEAPQPVP--LLHSDQPIIPEGKLAEPLYSI 191
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L+ GDLLY+PRG+IH A+T + SLH+TI +Y T W DLL +A+ + E
Sbjct: 192 YLKAGDLLYIPRGFIHHAATADSASSLHLTIGLY-ATQWFDLLIQALTQ----VSCQRAE 246
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
FR+ LP+GYL+ + +I+AD +K + L K + + D +D
Sbjct: 247 FRQALPLGYLQRS--------EIRAD---LKAQFQSLTEKFCQEANFDAAID 287
>gi|307102455|gb|EFN50729.1| hypothetical protein CHLNCDRAFT_142546 [Chlorella variabilis]
Length = 636
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 136/266 (51%), Gaps = 29/266 (10%)
Query: 2 HWEKEP--LYISRKQKNYYNNLITSKKIDDILR-----DNVIEFKKNIDITLYEDGQRQT 54
WE+EP + +++ ++ L + ++ + R +EF +++ Y +G R+T
Sbjct: 13 QWEQEPAVFRATPERRQLFDGLFSFAELQRLARICEQEGKPLEFGVDVNAARYLNGVRET 72
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
++ E A + + EGC++ Q + + L +L A+L+ GC G N Y TP
Sbjct: 73 LNGE-VADAATINRLFAEGCTM-----QRFSEELWRLLAALERQLGCLAGCNAYLTPKGT 126
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPR------------MVDEYLPRYSSPNFSQEEI 162
QG APHYDD+E F++Q EG K+W+++ P + +L S + +I
Sbjct: 127 QGLAPHYDDVEIFVVQTEGSKRWRLHRPPADSGASTAAGGLLARCHLANQVSGDLQPADI 186
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
G P+L V LE D+LY+PRG IHQA + S H+T+S YQ+ + DL++ A+ AL
Sbjct: 187 GEPVLEVVLEAADVLYIPRGTIHQAEAQMHSSS-HLTLSTYQRWSAADLVQYAVSVALAN 245
Query: 223 AGATDL---EFRRGLPIGYLRYAGLA 245
L + GLP+ +L A L+
Sbjct: 246 PQLQPLLAPPLKAGLPLRWLSDASLS 271
>gi|294817685|ref|ZP_06776327.1| Cupin 4 family protein [Streptomyces clavuligerus ATCC 27064]
gi|294322500|gb|EFG04635.1| Cupin 4 family protein [Streptomyces clavuligerus ATCC 27064]
Length = 414
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/246 (32%), Positives = 131/246 (53%), Gaps = 20/246 (8%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR---QTMDV 57
D+WEK+PL + R+ +YY + +T +D IL + + ++ ++ + +G+ Q M
Sbjct: 35 DYWEKKPLIVHREDPHYYTDPLTFHDVDQILSTSSV---RSSELKVVVNGREIPLQEMTA 91
Query: 58 EGRALP----QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
G P ++++D YR+G ++ F +PL +L +L F AN Y TP
Sbjct: 92 SGPGGPANGLEVLYDLYRKGSTVVFKFLHERWEPLGRLCRTLAAEFSAAFQANAYLTPAA 151
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP-NFSQEEIGTPILTVTLE 172
AQG HYD + F+LQ+ G K W++Y LP S P ++E G P+ TL
Sbjct: 152 AQGLTSHYDTHDVFVLQIWGSKHWRLY---DSPAELPLQSQPFRRTKEGPGNPVREFTLN 208
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
GD++YLPRG +H A T ++ SLHVT+ V Q W +L+ A+ +A++ +D +R
Sbjct: 209 AGDMMYLPRGTVHDA-TAQDQASLHVTLGV-QPVLWATVLKDALDRAIE----SDPRYRS 262
Query: 233 GLPIGY 238
LP G+
Sbjct: 263 SLPPGF 268
>gi|326446609|ref|ZP_08221343.1| Cupin 4 family protein [Streptomyces clavuligerus ATCC 27064]
Length = 385
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/246 (32%), Positives = 131/246 (53%), Gaps = 20/246 (8%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR---QTMDV 57
D+WEK+PL + R+ +YY + +T +D IL + + ++ ++ + +G+ Q M
Sbjct: 6 DYWEKKPLIVHREDPHYYTDPLTFHDVDQILSTSSV---RSSELKVVVNGREIPLQEMTA 62
Query: 58 EGRALP----QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
G P ++++D YR+G ++ F +PL +L +L F AN Y TP
Sbjct: 63 SGPGGPANGLEVLYDLYRKGSTVVFKFLHERWEPLGRLCRTLAAEFSAAFQANAYLTPAA 122
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP-NFSQEEIGTPILTVTLE 172
AQG HYD + F+LQ+ G K W++Y LP S P ++E G P+ TL
Sbjct: 123 AQGLTSHYDTHDVFVLQIWGSKHWRLY---DSPAELPLQSQPFRRTKEGPGNPVREFTLN 179
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
GD++YLPRG +H A T ++ SLHVT+ V Q W +L+ A+ +A++ +D +R
Sbjct: 180 AGDMMYLPRGTVHDA-TAQDQASLHVTLGV-QPVLWATVLKDALDRAIE----SDPRYRS 233
Query: 233 GLPIGY 238
LP G+
Sbjct: 234 SLPPGF 239
>gi|405354226|ref|ZP_11023606.1| mina protein [Chondromyces apiculatus DSM 436]
gi|397092469|gb|EJJ23227.1| mina protein [Myxococcus sp. (contaminant ex DSM 436)]
Length = 392
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 145/300 (48%), Gaps = 30/300 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
+HWE++PL + +++ L + + +D ++R + +T + ++++
Sbjct: 21 EHWEQKPLVTRGRAPDFFAPLFSLRDVDRVIRYQKPGPGRLDLVTAGGFVRDNFLNLDNT 80
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
A +V+++Y +G ++ + +PL +L+ V Y TPP G PH
Sbjct: 81 ANINLVYENYLKGSTVILSGLEETWEPLVVFCRNLEGQLSHPVAVAVYLTPPGHHGVQPH 140
Query: 121 YDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
+D E FILQ+EG K WKVY LPR+ Y P ++E + +L L PGD
Sbjct: 141 FDTQENFILQVEGVKHWKVYGVGQELPRVEGSYTP------VARERLPELLLETDLHPGD 194
Query: 176 LLYLPRGYIHQASTVTNEH-SLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
+LY+PRG++H+A EH SLH+T+ V+ +T W D LE AL A D FRR L
Sbjct: 195 MLYVPRGFVHEAE--AREHASLHITVDVHVRT-WRDFLE----DALAAMADRDPRFRRSL 247
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
P G LR G +++ ++L+A V L D + + ++L+ PP
Sbjct: 248 PPGLLRDPGAVN-----------SLESGFRELVAGFHRDVRLSDALAKHAEKLIVARPPP 296
>gi|260791084|ref|XP_002590570.1| hypothetical protein BRAFLDRAFT_123630 [Branchiostoma floridae]
gi|229275765|gb|EEN46581.1| hypothetical protein BRAFLDRAFT_123630 [Branchiostoma floridae]
Length = 572
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 102/176 (57%), Gaps = 7/176 (3%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
++WEK+PL R Y L + + +L+ + IE+ +++++ Y G+R++++
Sbjct: 197 EYWEKKPLIAKRNDAAVSEAYKALFSRDVLKKLLKKHDIEYIRDVNVCRYVSGKRESLNG 256
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
RA QI + +++F PQ + L QL + L+ LFGC VGAN Y TPP +QG
Sbjct: 257 TERATCKQIDKLFDQSKATLQFHQPQRFQDKLWQLCSLLECLFGCLVGANVYMTPPGSQG 316
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
APHYDD+E FILQLEG+K W++Y P VD LPR S + Q+ IG P LE
Sbjct: 317 LAPHYDDVEVFILQLEGRKHWRLYTP-PVD--LPRDYSRDLEQDNIGQPTHDFVLE 369
>gi|115375458|ref|ZP_01462718.1| mina protein [Stigmatella aurantiaca DW4/3-1]
gi|310821399|ref|YP_003953757.1| cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
gi|115367501|gb|EAU66476.1| mina protein [Stigmatella aurantiaca DW4/3-1]
gi|309394471|gb|ADO71930.1| Cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
Length = 383
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 158/298 (53%), Gaps = 31/298 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNID-ITLYEDGQRQT---MD 56
+ WE++PL + + + ++ L +S+ + +L + ++I+ + L ++G+ + +
Sbjct: 18 EAWERKPLVL-QGPPDRWSGLFSSRDLGRLL---TYQPPRSIEGMMLVKEGRHRDENWLS 73
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G + V +REG +I + +P+ + A+++E VG N Y TPP AQG
Sbjct: 74 PDGSPRLEQVQAAWREGYTIVINKVGQFWEPVGRFCAAVEEELHHPVGVNLYMTPPGAQG 133
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
F H+D ++AF+LQ+EG K W+V P+ V LP + S E + +L L+ GD+
Sbjct: 134 FKAHFDIMDAFVLQVEGSKVWQVRGPQ-VTLPLPDEHTAT-SSESLPPVLLEQELKRGDV 191
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
LY+PRG++H+A T HS+H+T+ + Q W DL A+ AA D FR+GLP
Sbjct: 192 LYIPRGFVHEARTAQT-HSVHLTLGL-QAVTWSDLFVAAI-----AAARRDERFRKGLPP 244
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
+L + + M++ ++LLA+L +++L + ++ ++L+ PP
Sbjct: 245 RFLEGSAM--------------MEQTFRELLAELPRHLELGHALTQLAERLVVQKPPP 288
>gi|390478066|ref|XP_003735410.1| PREDICTED: LOW QUALITY PROTEIN: MYC-induced nuclear antigen-like
[Callithrix jacchus]
Length = 488
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/332 (28%), Positives = 151/332 (45%), Gaps = 62/332 (18%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI--LRDNVIEFKKNIDITLYEDGQRQTMDVE 58
+ WE++PL I R + K+ D+ L + + +++++ G+++ ++ +
Sbjct: 57 EFWEQKPLLIRRDDPALATYCGSLFKLTDLKSLCSQGLYYGRDVNVCRCVGGKKKVLNKD 116
Query: 59 GRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 117 GKAHFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYITPAGSQGL 176
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PHYDD+E FILQLEG+K W++Y P + L R S ++E IG P+ TL+ D +
Sbjct: 177 PPHYDDVEVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEERIGRPVHEFTLKEKDHV 232
Query: 178 Y-------LPRGYIHQ-----------------ASTVTNE-HSLHVTISVYQKTAWIDLL 212
+P G + Q A T+ HS HVTIS YQ +W D L
Sbjct: 233 VQGCACELVPSGPLQQNLVSVPKDETDFVAVAEADTLAGAGHSTHVTISXYQNNSWGDFL 292
Query: 213 EKAMPKALQAAGATDLEFRRGLP----------IGYLRYAGLARGKPVDIQADRLAMKEN 262
+ + D+E R G+P + R +G R + ADRL E
Sbjct: 293 LDTISGLVFDTAEEDVELRAGIPRQLLLVESTAVATRRLSGFLR-----MLADRL---EG 344
Query: 263 LKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
K+LL+ +M K + LPP
Sbjct: 345 TKELLS------------SDMKKDFIMHRLPP 364
>gi|114570978|ref|YP_757658.1| cupin 4 family protein [Maricaulis maris MCS10]
gi|114341440|gb|ABI66720.1| Cupin 4 family protein [Maricaulis maris MCS10]
Length = 394
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 140/300 (46%), Gaps = 31/300 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
D+ EK+PL + R+ Y L++ +IDDI LR+ ++ ++ ED T
Sbjct: 26 DYHEKKPLIVHREDPGRYAGLLSIARIDDIVSSIDLREGALDMARSEPPVQREDYMFDTG 85
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYI--KPLQQLNASLQELFGCFVGANTYQTPPT 113
V+ V + YR+G +I + PQ ++ L + +++ L C V N Y TPP
Sbjct: 86 YVDRGG----VANQYRQGATI--ILPQLHMMDAVLGEFCRAVESLLSCHVQTNIYLTPPD 139
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
QGF HYDD + F++Q+EG+K W+ Y + + Y P+ + G P+ L+
Sbjct: 140 NQGFNTHYDDHDVFVMQIEGEKLWRFYETPVENPYRGEGFRPD--AHKAGEPVAEFVLKA 197
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
G+ +Y+PRG +H A T + SLH+T+ + KT W DL+ +A+ + A R
Sbjct: 198 GECIYVPRGLMHDAQTHGDTASLHITLGLIVKT-WADLMLEAVSEVALRTPA----MRHS 252
Query: 234 LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP 293
LP G+ R DR + +D+ L + +D +D + +P
Sbjct: 253 LPPGFAR-----------PDFDRTDAEVQFRDMAEMLAREMSVDGAMDFFVDSFIRSRVP 301
>gi|83859809|ref|ZP_00953329.1| hypothetical protein OA2633_07409 [Oceanicaulis sp. HTCC2633]
gi|83852168|gb|EAP90022.1| hypothetical protein OA2633_07409 [Oceanicaulis sp. HTCC2633]
Length = 387
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/255 (32%), Positives = 122/255 (47%), Gaps = 35/255 (13%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
+E+ L+ +N + +LI+ ID IL ++ L +G E R L
Sbjct: 22 FEQTHLHAPGTDRNRFASLISLDAIDRILAED-----------LLREGDLSMARAEPR-L 69
Query: 63 PQIVWDH-------------YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
P W Y++G ++ Q +PL L L+ F C V N Y
Sbjct: 70 PDRAWLREDGLVDRGEVARLYQQGATLILPQLQARHRPLADLCRQLEAEFSCPVQTNIYL 129
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILT 168
TPP AQGF HYD+ + +LQ+EG K+W++Y P V R++ F+Q E P
Sbjct: 130 TPPNAQGFQTHYDNHDVLVLQVEGSKRWRLYDAPVGVPYRGERFTPGRFAQTE---PRAE 186
Query: 169 VTLEPGDLLYLPRGYIHQA-STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD 227
+ L PGD+LY+PRG +H A + ++E SLH+T + KT W D L +A+ + A
Sbjct: 187 LVLNPGDVLYVPRGLMHDAVNEGSDEASLHITTGLLAKT-WADFLLEAVSE----AALRT 241
Query: 228 LEFRRGLPIGYLRYA 242
+ RR LP GY R A
Sbjct: 242 PQLRRALPPGYARGA 256
>gi|405354225|ref|ZP_11023605.1| hypothetical protein A176_7050 [Chondromyces apiculatus DSM 436]
gi|397092468|gb|EJJ23226.1| hypothetical protein A176_7050 [Myxococcus sp. (contaminant ex DSM
436)]
Length = 383
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 150/302 (49%), Gaps = 39/302 (12%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNI-DITLYEDGQRQT---MD 56
+ WE++PL + +++L S+ + +L ++I D+ L +G+ + +
Sbjct: 18 ETWERKPLVLP-GPPGRWSDLFASRDLGRLL---AYRTPRSIEDMVLVREGRHRDENWLA 73
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G + V +REG ++ + +P+ + A+++E VGAN Y TPP AQG
Sbjct: 74 PDGSPRLEQVQAAWREGFTLVINRLGRFWEPVGRFCAAVEEELHHPVGANLYMTPPGAQG 133
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE---- 172
F H+D ++AF+LQ+EG K W+V P+ V LP P+ E+ +P+ V +E
Sbjct: 134 FKAHFDVMDAFVLQVEGAKVWQVRGPQRV---LP---LPDEHTTELSSPLPPVLMEHELK 187
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
G++LY+PRG+IH+A EHS+H+T+ + Q W +LL A+ A D RR
Sbjct: 188 SGEVLYIPRGFIHEARA-AREHSVHLTLGL-QAITWEELLRAAI-----AVARRDERLRR 240
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
GLP +L M + LL +L ++LD + ++ ++L+
Sbjct: 241 GLPPRFLEGPA--------------TMAPMFQALLEELPRQLELDAALTQLAERLVVQKP 286
Query: 293 PP 294
PP
Sbjct: 287 PP 288
>gi|317123668|ref|YP_004097780.1| cupin 4 family protein [Intrasporangium calvum DSM 43043]
gi|315587756|gb|ADU47053.1| cupin 4 family protein [Intrasporangium calvum DSM 43043]
Length = 411
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 99/314 (31%), Positives = 146/314 (46%), Gaps = 33/314 (10%)
Query: 1 DHWEKEPLY-ISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQT 54
DHW ++PL + + +L + + +D++ LR + KN TL +
Sbjct: 33 DHWGRQPLVSLGADLAGGFEDLFSLEAVDELVSRRGLRTPFVRVAKNGQ-TLPDSAFTAG 91
Query: 55 MDVEGRALPQIVWDH----YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
V Q+ D + +G +I P+ A+L E G V AN Y T
Sbjct: 92 GGVGAGIADQVSDDKLLRLFADGATIVLQGLHRTWAPVSDFVAALAEDLGHPVQANAYVT 151
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG-----TP 165
P +QGF HYD + F+LQ+ G+K W++ LP V + R +E +G P
Sbjct: 152 PRQSQGFNDHYDVHDVFVLQVAGEKHWRIRLP--VHLWPTRDQPWTAYRERVGQAAAEPP 209
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
+L VTL GD LYLPRG++H A T T E S H+T+ V+ T L E + +A+ A
Sbjct: 210 LLDVTLRAGDCLYLPRGFLHSA-TATEEVSAHLTLGVHTWTR-AHLAEALVAEAMHA--L 265
Query: 226 TDLEFRRG-LPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMG 284
D E R LP+ G+ G P + AD + L+++LA LVE +D D +G
Sbjct: 266 VDAESERAPLPL------GVDVGNPAGV-ADEVT---RLREVLAGLVEGIDPDRVAARLG 315
Query: 285 KQLMHDALPPVLSP 298
+Q A P LSP
Sbjct: 316 RQARAAARPGPLSP 329
>gi|359458845|ref|ZP_09247408.1| cupin 4 family protein [Acaryochloris sp. CCMEE 5410]
Length = 405
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 128/252 (50%), Gaps = 31/252 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM----DV 57
+WE + LY+ R ++Y +++ ID +L++ + N + L + G + ++ D
Sbjct: 23 YWETKTLYLPRNDASFYESVLNPDDIDLLLQNKALLADYN-NFRLVDQGNKLSLEDWCDR 81
Query: 58 EGRALPQIV-----WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
++ + + +G ++ I L+ ++L+ + N Y TPP
Sbjct: 82 HSKSQQYFINNDRLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRTNIYITPP 141
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMV-DEYLPRYSSPNFSQEEIGTPI 166
AQG APHYD+ + FILQ+ G K+WK+Y LP + D+ + R+ E+ P
Sbjct: 142 QAQGLAPHYDEHDVFILQITGAKEWKLYHSPVELPSHIRDQSIGRH--------ELAEPE 193
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
LTV L+PGDLLY+PRG +HQA++ S+H ++ +Y A+ +LLE+ L
Sbjct: 194 LTVMLQPGDLLYIPRGVVHQAASQETT-SVHASLGLYPTFAY-ELLEE-----LVTIAQA 246
Query: 227 DLEFRRGLPIGY 238
D FR+ +P G+
Sbjct: 247 DPAFRKAIPHGF 258
>gi|144900526|emb|CAM77390.1| MYC induced nuclear antigen [Magnetospirillum gryphiswaldense
MSR-1]
Length = 390
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 137/289 (47%), Gaps = 34/289 (11%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDIL-------RDNVIEFKKNIDITLYEDGQRQ 53
++WEK+PL + R +Y +L++ + ID +L RD I + D E+ +
Sbjct: 21 EYWEKKPLLVKRAAPGFYRDLLSVQAIDQVLAMPGLHRRD--IRVARGTDPLAVEEYADK 78
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
+ +L ++ D G +I ++PL ++ + +++ N Y TP
Sbjct: 79 DGFINAASLSRLFTD----GFTIILNTLNLKLRPLAEICRAFEQVLSIPCQTNIYYTPRL 134
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP-RYSSPNFSQEEIGTPILTVTLE 172
AQGF PHYD + F+ Q+ G+K W V LP R E G + LE
Sbjct: 135 AQGFKPHYDSHDVFVFQVAGRKHWLV---NDTPVELPLRGQGFEAGLYEPGDVTMEFDLE 191
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
PGDLLY+PRG +H A T ++E SLH+T+ T+W + + +A+ AA TD+E RR
Sbjct: 192 PGDLLYIPRGVMHGART-SDEVSLHITLGAL-TTSW----AEVLLEAVAAAALTDVELRR 245
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
LP GY D QA R + LL ++ E +D++ +D
Sbjct: 246 NLPAGYALPG-------YDAQAAR----QTFASLLNRVAENIDVESILD 283
>gi|158335076|ref|YP_001516248.1| cupin 4 family protein [Acaryochloris marina MBIC11017]
gi|158305317|gb|ABW26934.1| cupin 4 family protein, putative [Acaryochloris marina MBIC11017]
Length = 416
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 130/252 (51%), Gaps = 31/252 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM----DV 57
+WE + LY+ R ++Y +++ + ID +L++ + N + L + G + ++ D
Sbjct: 34 YWETKTLYLPRNDASFYGSVLQPEDIDLLLQNKALLADYN-NFRLVDQGNKLSLEDWCDR 92
Query: 58 EGRALPQIV-----WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
++ + + +G ++ I L+ ++L+ + N Y TPP
Sbjct: 93 HSKSQQYFINNDKLYSLLHQGLTLTINGAHKKIPKLRHFCSALECELKFKLRTNIYITPP 152
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMV-DEYLPRYSSPNFSQEEIGTPI 166
AQG APHYD+ + FILQ+ G+K+WK+Y LP + D+ + R+ ++ P
Sbjct: 153 QAQGLAPHYDEHDVFILQITGEKEWKLYHSPVELPSHIRDQSIGRH--------KLAEPE 204
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
LTV L+PGDLLY+PRG +HQA++ S+H ++ +Y A+ +LLE+ L
Sbjct: 205 LTVMLQPGDLLYIPRGVVHQAAS-QETTSVHASLGLYPTFAY-ELLEE-----LVTIAQA 257
Query: 227 DLEFRRGLPIGY 238
D FR+ +P G+
Sbjct: 258 DPAFRKAIPHGF 269
>gi|389865125|ref|YP_006367366.1| hypothetical protein MODMU_3475 [Modestobacter marinus]
gi|388487329|emb|CCH88887.1| Conserved protein of unknown function; putative cupin and
clavaminate synthase domains [Modestobacter marinus]
Length = 443
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 116/252 (46%), Gaps = 21/252 (8%)
Query: 1 DHWEKEPLYISRKQ-KNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQ 51
+HW + PL + +Q + + +L+T +D++L R + K+ +D + Q
Sbjct: 37 EHWARRPLLSTAEQLGSTFTDLLTLDAVDELLSTRGLRTPFLRIAKDGAVVDTKRFTSPQ 96
Query: 52 RQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+V + V + +G ++ PL L G N Y TP
Sbjct: 97 GAGAEVADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIAFAGQLAADLGHPTQVNAYITP 156
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTP 165
P+++GF+PHYD + F+LQ+ G+K W ++ P + D P + P + P
Sbjct: 157 PSSRGFSPHYDVHDVFVLQVAGEKHWTIHEPLLAD---PLRTHPWADRAAEVAAAAERAP 213
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
++ L PGD LYLPRGYIH A + E S H+T+ V+ T W +E A+ ++ A
Sbjct: 214 VIDTVLRPGDALYLPRGYIHSAVAL-GEISAHLTVGVHSVTRW-GAVESAL-DLVRTLAA 270
Query: 226 TDLEFRRGLPIG 237
D R LP+G
Sbjct: 271 DDPALRGSLPLG 282
>gi|320167141|gb|EFW44040.1| hypothetical protein CAOG_02065 [Capsaspora owczarzaki ATCC 30864]
Length = 522
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 81/142 (57%), Gaps = 6/142 (4%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP--RY 152
L+E+FG N Y TP ++GF PHYD FI QL G K W+VYLP V LP
Sbjct: 113 LEEIFGVHTNGNLYSTPKFSRGFTPHYDVHSTFIFQLSGNKDWEVYLPETV---LPIGGL 169
Query: 153 SSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
S N+S +G P+++ +L GDLLYLPRG++HQA T T+ SLH+T+S+ A++ +
Sbjct: 170 LSQNYSGRALGEPVMSFSLRKGDLLYLPRGWVHQA-TATDAPSLHLTVSLGHTMAYMTAV 228
Query: 213 EKAMPKALQAAGATDLEFRRGL 234
A + + D F + +
Sbjct: 229 LDACMEVAAGSKLADSPFVQAI 250
>gi|310821398|ref|YP_003953756.1| cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
gi|309394470|gb|ADO71929.1| Cupin 4 family protein [Stigmatella aurantiaca DW4/3-1]
Length = 421
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 150/306 (49%), Gaps = 41/306 (13%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKK----NIDITLYEDG--QRQT 54
++WE++PL + + ++ L + + +D VI ++K +D+ + E G +
Sbjct: 41 EYWEQKPLVTRGRAQGFFAPLFSIRDVD-----RVICYQKPGPGRLDL-VTEGGFVRDNF 94
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
++++ A +V+++Y +G ++ + +PL L+ V Y TPP
Sbjct: 95 LNLDNTANINLVYENYLKGSTVILSGLEETWEPLVVFCRKLEGQLSHPVAVAVYLTPPNH 154
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
G PH+D E FILQ++G K WKVY LPR+ Y P ++E + +L
Sbjct: 155 HGVQPHFDTQENFILQVDGVKHWKVYGAGQELPRVEGSYTP------VARERLPELLLET 208
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L PGD+LY+PRG++H+A + SLH+T+ V+ +T W D LE AL A +
Sbjct: 209 ELHPGDMLYVPRGFVHEAEA-RDSASLHITVDVHVRT-WRDFLE----DALAAMADRNPR 262
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
FR+ LP GL G Q ++E ++L+ + V L D + + ++L+
Sbjct: 263 FRKSLP------PGLLNGSHAKAQ-----LEEGFRELMEMVHREVRLSDALGKHAEKLI- 310
Query: 290 DALPPV 295
A PP+
Sbjct: 311 VARPPL 316
>gi|89052876|ref|YP_508327.1| cupin 4 [Jannaschia sp. CCS1]
gi|88862425|gb|ABD53302.1| Cupin 4 [Jannaschia sp. CCS1]
Length = 392
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 133/282 (47%), Gaps = 39/282 (13%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE-- 58
+++EK+P+ I R Q Y+++L++ +ID ++ + + I++T DG D
Sbjct: 22 EYFEKKPMLIKRGQPGYFSDLLSYGEIDRVVSTMGLHVPE-INVT-RADGNITPADFAYE 79
Query: 59 -GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
G+ P V + +G ++ + L + +++ V N Y TPP QGF
Sbjct: 80 TGQIDPVRVNQLHADGATVILSGLHERLPALARYCRAMEAAMSARVQTNIYMTPPGNQGF 139
Query: 118 APHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
PHYD + +LQ+ G K+W++Y LP + D+ R ++G LE
Sbjct: 140 NPHYDGHDVLVLQVAGTKEWRIYGTPVELP-LADQAFER-------GMDVGEEAQRFVLE 191
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
PGD +Y+PRG H A T+E SLH+T + +T W D A+ +A+ A + RR
Sbjct: 192 PGDAVYIPRGMAHDA-VATDETSLHITTGLMFRT-WAD----ALAEAVIAKAHREPSLRR 245
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV 274
LP G+ VD+ ++ KD A+L+E V
Sbjct: 246 ALPPGFANNG-------VDL--------DDYKDTFAELIELV 272
>gi|156378502|ref|XP_001631181.1| predicted protein [Nematostella vectensis]
gi|156218217|gb|EDO39118.1| predicted protein [Nematostella vectensis]
Length = 218
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 82/142 (57%), Gaps = 3/142 (2%)
Query: 198 VTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRL 257
+T+S YQK W D LE +P+ALQ A D EFR+GLP+ YL Y+G+A V +R
Sbjct: 1 MTLSTYQKNTWGDFLEMLVPQALQVAFEEDPEFRQGLPLNYLNYSGVANSDTV--TKERS 58
Query: 258 AMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGE 317
++ L+ KL+ + +D D+M +++ DALPPVL+ E SV+ +G + G+
Sbjct: 59 DFLRRVEKLMTKLISHAPVDAAADQMAVRMLQDALPPVLTEAERSHSVYGSGASW-EDGK 117
Query: 318 VYNATQITKDTKVRLVRANAVR 339
+ N IT DT VRL+R R
Sbjct: 118 IVNMMTITGDTDVRLIRRGVAR 139
>gi|152965307|ref|YP_001361091.1| cupin [Kineococcus radiotolerans SRS30216]
gi|151359824|gb|ABS02827.1| Cupin 4 family protein [Kineococcus radiotolerans SRS30216]
Length = 434
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 119/258 (46%), Gaps = 34/258 (13%)
Query: 1 DHWEKEPLYI------SRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ--- 51
+HW PL + + + ++L++ +D++L + + +L +DG
Sbjct: 37 EHWNTRPLLVRAADRAAEGGRASVHDLLSPADVDELLGPRAL---RTPFFSLVQDGTPLP 93
Query: 52 RQTMDVEGRALPQIVWD---------HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF 102
R + A Q + D + G +I LQ + L G
Sbjct: 94 RSSYTRRAVAGNQQLADLPDTDRVAAAHAGGATIVLQALHRTWPALQTFCSQLAADLGHQ 153
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPN--FSQE 160
N Y TPP AQGF PH+D + +LQ++G+K W ++ P + LP S P+ +
Sbjct: 154 CQVNVYVTPPGAQGFKPHHDTHDVVVLQVDGRKHWTIHPPAV---ELPLKSQPSTQLGPD 210
Query: 161 EIG--TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPK 218
+G P + LEPGD LYLPRG++H A T T + S+H+T+ + T W D+L A+
Sbjct: 211 PVGGRPPAIDTVLEPGDALYLPRGWLHSART-TEDRSIHLTVGLLATT-WADVLTDAV-- 266
Query: 219 ALQAAGATDLEFRRGLPI 236
+AG D+ RR LP+
Sbjct: 267 --ASAGVADVALRRALPL 282
>gi|397628413|gb|EJK68879.1| hypothetical protein THAOC_09906, partial [Thalassiosira oceanica]
Length = 727
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 83/154 (53%), Gaps = 21/154 (13%)
Query: 111 PPTAQG--FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-----G 163
PPTA+ P ++LEG K+WKVY P E LPR SS +++++EI
Sbjct: 337 PPTARARRLGPLPPQPARVRVRLEGYKRWKVYAPFNKRETLPRESSRDYTEKEIEDAEET 396
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVT------------NEHSLHVTISVYQKTAWIDL 211
+ +T L PGD+LYLPRG++HQA TV ++HSLH+T+S Q W D
Sbjct: 397 SEAMTCVLGPGDILYLPRGWVHQAETVARPAYMPRLPGVKDDHSLHLTVSAMQNWCWADY 456
Query: 212 LEKAMPKALQAAGATD--LEFRRGLPIGYLRYAG 243
LE MP AL AA +D R GLP +L Y G
Sbjct: 457 LEILMPGALDAAVGSDKTTALREGLPRNFLEYMG 490
>gi|324511549|gb|ADY44803.1| Lysine-specific demethylase NO66 [Ascaris suum]
Length = 259
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 101/182 (55%), Gaps = 13/182 (7%)
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
TPI +E GDLLY+PRG+IHQ T + HSLH+T+SV ++TA++DLLE+ +P + +
Sbjct: 5 TPIFDGWIEHGDLLYIPRGFIHQGYTEASIHSLHLTVSVCRRTAFVDLLERFLPLVMGSL 64
Query: 224 GATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLD------ 277
+L R LPIGYL G+ P+ Q + A + + ++L K + + D
Sbjct: 65 AEKNLSMRSSLPIGYLDMGGVL---PLSYQLNNKA-SDKIFNVLDKQLAALKKDALDAYE 120
Query: 278 DGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANA 337
+D M ++ + ALPP+L+ EE Q S G+ + + T +T+++ +R +A
Sbjct: 121 SAIDMMAREYLKTALPPLLTGEERQLSAI-GGVGFDILCK--KQKKFTVNTQIKFIRRHA 177
Query: 338 VR 339
R
Sbjct: 178 QR 179
>gi|443327728|ref|ZP_21056347.1| Cupin superfamily protein [Xenococcus sp. PCC 7305]
gi|442792718|gb|ELS02186.1| Cupin superfamily protein [Xenococcus sp. PCC 7305]
Length = 396
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 133/281 (47%), Gaps = 28/281 (9%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
++EKE YISR +NYYN ++ ++ ID ++ ++ + + G R
Sbjct: 21 YYEKEYFYISRHDENYYNGILNTQDIDLFFQNQSLQ---TSGLRVCNKGGEIPASQWRRG 77
Query: 62 LPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
L QI ++ +G ++ + + I L L+ + +N Y TPP AQG
Sbjct: 78 LSQIADNDKLFVLLNQGNTLIINSGNSSIIKLINYCHVLENELNFRLQSNIYITPPDAQG 137
Query: 117 FAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
FA HYDD + FI+Q G K W++Y P + ++ + E+G P ++PGD
Sbjct: 138 FAYHYDDHDVFIMQTTGTKTWRLYHSPIEIPTKKQNHAKFMKGKYELGEPTFEQEIKPGD 197
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LLY+PRG +H A T T++ S+H+T+ ++ + +LL++ + A +L FR+ +P
Sbjct: 198 LLYIPRGLVHDAVT-TDQASVHITLGLHPSYRF-ELLQEIVNLA-----QDNLLFRKAIP 250
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDL 276
P + + A K K L ++ VD+
Sbjct: 251 ------------NPFTPEYQKKAFKAEFKQLCHDFIDNVDI 279
>gi|374610812|ref|ZP_09683602.1| transcription factor jumonji jmjC domain-containing protein
[Mycobacterium tusciae JS617]
gi|373550228|gb|EHP76876.1| transcription factor jumonji jmjC domain-containing protein
[Mycobacterium tusciae JS617]
Length = 405
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 131/290 (45%), Gaps = 40/290 (13%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDI---TLYEDG 50
++W ++PL +SR +++L++ +D+++ R I K D+ T Y
Sbjct: 16 EYWGRQPL-LSRHTALPRDFSDLLSPTAVDELIAERGVRAPFIRMAKTGDVLARTCYTGP 74
Query: 51 QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
++ + V + G +I PL L + G V AN Y T
Sbjct: 75 AGFGAEMPDQVDSTKVLSEFAAGATIVLQGLHRLWPPLIDFVRGLTDDLGHPVQANAYIT 134
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI------GT 164
PP AQGF PHYD + F+LQ G+K+W V+ P + P S P +
Sbjct: 135 PPDAQGFEPHYDVHDVFVLQAAGEKRWTVHEPV---HHHPLDSQPWTEHRDAIAARISDA 191
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
PI+ L+ GD LYLPRG++H A + N S+H+TI V T ++D++ +A+
Sbjct: 192 PIIDTVLQAGDALYLPRGWVHSARS-ENTTSIHLTIGVSALT-YLDVV-----RAVVDTL 244
Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV 274
A D EFR+ LP+G +D DR M ++A+L + V
Sbjct: 245 AADDEFRKSLPMG------------IDTT-DRSEMASTTSKVMAQLADAV 281
>gi|258653233|ref|YP_003202389.1| cupin [Nakamurella multipartita DSM 44233]
gi|258556458|gb|ACV79400.1| Cupin 4 family protein [Nakamurella multipartita DSM 44233]
Length = 452
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 123/284 (43%), Gaps = 34/284 (11%)
Query: 2 HWEKEPLYISRKQKNY-YNNLITSKKIDDI-----LRDNVIEFKKNIDI---TLYEDGQR 52
+W + PL + + N +++L ++ +D++ LR + KN + + G
Sbjct: 38 YWAQAPLLTTAAELNDDFSDLFSADSVDELVSERGLRTPFLRMAKNGSVLSSASFTRGGG 97
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
+ + V G ++ PL + + L G V N Y TPP
Sbjct: 98 AGATITDQVADDKVLAQLAGGATLVLQALHRTWPPLVRFGSELAAELGHPVQINAYITPP 157
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS----SPNFSQEEIGTPILT 168
QGFA HYD + F+LQ+ G K W+++ P + D LP + P +
Sbjct: 158 QNQGFASHYDTHDVFVLQIAGTKHWRIHEPVLPDP-LPHQTWDGRRAQVQDRAAQAPAID 216
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL 228
L PGD LYLPRGY+H A E S+H+TI V+ T + DL + L AA D
Sbjct: 217 ALLRPGDALYLPRGYLHSA-VAQGELSIHLTIGVHPLTGY-DLARE-----LIAAAEDDP 269
Query: 229 EFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
E RR LP+G VD+ D AM +L+ +LV+
Sbjct: 270 ELRRSLPMG------------VDV-TDVDAMATHLRQAAQRLVD 300
>gi|325185748|emb|CCA20228.1| nucleolar protein putative [Albugo laibachii Nc14]
Length = 287
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 72/136 (52%)
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
+ TLE GD+LY+PRG IHQA T S HVTISVYQ+ +W + +E AMP+ + A A
Sbjct: 1 MEFTLEQGDVLYMPRGVIHQAHTDNKVFSTHVTISVYQRQSWSNFMELAMPRIIHRAFAN 60
Query: 227 DLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQ 286
D+EFRRG+P YL + G D + L L++ +E DL D
Sbjct: 61 DVEFRRGVPPQYLNFLGSQYSGSTQSVKDFCNTLKQLVSKLSENLEIQDLQHAADLASLD 120
Query: 287 LMHDALPPVLSPEELQ 302
+ LPP +PE Q
Sbjct: 121 FITHRLPPPDTPESSQ 136
>gi|83717208|ref|YP_439902.1| hypothetical protein BTH_II1708 [Burkholderia thailandensis E264]
gi|167616559|ref|ZP_02385190.1| hypothetical protein BthaB_09673 [Burkholderia thailandensis Bt4]
gi|257143062|ref|ZP_05591324.1| hypothetical protein BthaA_28128 [Burkholderia thailandensis E264]
gi|83651033|gb|ABC35097.1| unnamed protein product [Burkholderia thailandensis E264]
Length = 397
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 115/244 (47%), Gaps = 21/244 (8%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDIT--LYEDGQRQTMDVEG 59
+W ++PL + R+ + Y L S++ +L + I + + EG
Sbjct: 22 YWGRKPLIVRRQAPHLYACLPDSEEFAFLLHSLTDPERGWFSIVNGVARPPSDSLLTQEG 81
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF-------VGANTYQTPP 112
V+ YR+G S+ Q + L ++ +GAN Y +PP
Sbjct: 82 LLNLSEVYAAYRDGNSLLMNQVQRRHRETAMLCRRIESALSAHGIALARHIGANGYLSPP 141
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
++QGF HYD + ILQ+EG+K W++Y + P + EE G+P L
Sbjct: 142 SSQGFNIHYDPHDVLILQIEGRKHWRLYGRHVAWPTQPPATP--IPPEEAGSPRREFVLS 199
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
PG+L+Y+PRG +H A+T T+ SLH+T+S+ T W DLL +AM + + FRR
Sbjct: 200 PGELVYIPRGVLHDANT-TDSRSLHLTLSIETLT-WTDLLIEAM--------SDNPAFRR 249
Query: 233 GLPI 236
LP+
Sbjct: 250 NLPV 253
>gi|148665809|gb|EDK98225.1| myc induced nuclear antigen, isoform CRA_a [Mus musculus]
Length = 242
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 93/171 (54%), Gaps = 13/171 (7%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + V + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GRA Q+ D ++ +I+F PQ Y L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTP 165
PHYDD+E FILQLEG K W++Y P + EY S++ IGTP
Sbjct: 176 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTP 220
>gi|148665811|gb|EDK98227.1| myc induced nuclear antigen, isoform CRA_c [Mus musculus]
gi|148665812|gb|EDK98228.1| myc induced nuclear antigen, isoform CRA_c [Mus musculus]
Length = 243
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 93/171 (54%), Gaps = 13/171 (7%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + V + +++++ G+++ ++
Sbjct: 58 EFWEQKPLLIQRDDPVLAKYYQSLFSLSDLKRLCKKGVY-YGRDVNVCRSISGKKKVLNK 116
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+GRA Q+ D ++ +I+F PQ Y L ++ L+ FG VG+N Y TP +QG
Sbjct: 117 DGRAHFLQLRKDFDQKRATIQFHQPQRYKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 176
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTP 165
PHYDD+E FILQLEG K W++Y P + EY S++ IGTP
Sbjct: 177 LPPHYDDVEVFILQLEGTKHWRLYSPTVPLAHEYSVE------SEDRIGTP 221
>gi|115375463|ref|ZP_01462723.1| chromosome 14 open reading frame 169, putative [Stigmatella
aurantiaca DW4/3-1]
gi|115367506|gb|EAU66481.1| chromosome 14 open reading frame 169, putative [Stigmatella
aurantiaca DW4/3-1]
Length = 355
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 121/246 (49%), Gaps = 29/246 (11%)
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
++++ A +V+++Y +G ++ + +PL L+ V Y TPP
Sbjct: 29 LNLDNTANINLVYENYLKGSTVILSGLEETWEPLVVFCRKLEGQLSHPVAVAVYLTPPNH 88
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
G PH+D E FILQ++G K WKVY LPR+ Y P ++E + +L
Sbjct: 89 HGVQPHFDTQENFILQVDGVKHWKVYGAGQELPRVEGSYTP------VARERLPELLLET 142
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L PGD+LY+PRG++H+A + SLH+T+ V+ +T W D LE AL A +
Sbjct: 143 ELHPGDMLYVPRGFVHEAEA-RDSASLHITVDVHVRT-WRDFLE----DALAAMADRNPR 196
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
FR+ LP GL G Q ++E ++L+ + V L D + + ++L+
Sbjct: 197 FRKSLP------PGLLNGSHAKAQ-----LEEGFRELMEMVHREVRLSDALGKHAEKLI- 244
Query: 290 DALPPV 295
A PP+
Sbjct: 245 VARPPL 250
>gi|375138113|ref|YP_004998762.1| hypothetical protein [Mycobacterium rhodesiae NBB3]
gi|359818734|gb|AEV71547.1| hypothetical protein MycrhN_0917 [Mycobacterium rhodesiae NBB3]
Length = 402
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 121/253 (47%), Gaps = 27/253 (10%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDI---TLYEDG 50
DHW ++PL +SR +++L++ +D+++ R I K D+ + Y
Sbjct: 16 DHWGRKPL-LSRHTALPRDFSDLLSPAAVDELIAERGVRAPFIRMAKTGDVLARSCYTGP 74
Query: 51 QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
++ + V + G +I PL +L + G V AN Y T
Sbjct: 75 AGFGAEMPDQVDSAKVLAEFATGATIVLQGLHRLWPPLIDFVRALTDDLGHPVQANAYIT 134
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT------ 164
PP AQGF PHYD + F+LQ+ G+K+W V+ P V + P S P + T
Sbjct: 135 PPAAQGFEPHYDVHDVFVLQVAGEKRWTVHEP--VHPH-PLDSQPWTDHRDAVTARSRDN 191
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
+ L GD LYLPRG++H A + + S+H+TI V T ++D++ +A+
Sbjct: 192 AAIDTVLNAGDALYLPRGWVHSARS-EDTTSIHLTIGVSALT-YLDVV-----RAVVDTL 244
Query: 225 ATDLEFRRGLPIG 237
ATD EFR+ LP+G
Sbjct: 245 ATDDEFRKSLPMG 257
>gi|84497464|ref|ZP_00996286.1| hypothetical protein JNB_14758 [Janibacter sp. HTCC2649]
gi|84382352|gb|EAP98234.1| hypothetical protein JNB_14758 [Janibacter sp. HTCC2649]
Length = 396
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 137/299 (45%), Gaps = 31/299 (10%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL--RDNVIEFKKNI-DITLYEDGQRQTM 55
D W + P +++R + + +L ++ +D+++ R F + D T DGQ
Sbjct: 9 DVWSRSP-HLARAGRLPRDFADLFSADAVDELVAHRGLRTPFARMARDGTTLPDGQFTLG 67
Query: 56 DVEGRALP-QIVWDH----YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
G + Q+ D + G ++ P+Q A L + G V N Y T
Sbjct: 68 GGVGAGITDQLSEDRILREFAAGATLVLQALHRTWAPIQTFAADLSDDLGHPVQVNAYVT 127
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGT 164
PP QGFA HYD + F+LQ+ G+K+W++ P + P P Q
Sbjct: 128 PPQNQGFADHYDVHDVFVLQIAGEKRWRIRPPVL---EAPLRDQPWEQHRAAVEQAATAE 184
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
P+L TL PGD LYLPRGY+H A+ + S+H+T+ V+ T L A+ + A
Sbjct: 185 PLLEETLRPGDCLYLPRGYLHSATALGGT-SIHLTMGVHTWTR--RHLVDALLAGIGARV 241
Query: 225 ATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEM 283
+ D+E RR LP+ AG + + I D E ++ L+AK + + + + +D +
Sbjct: 242 SDDVEARRSLPL----RAGTGQSSELAIDDD----AEIVRSLVAKALSDLSMGELLDTL 292
>gi|349604191|gb|AEP99810.1| hypothetical protein, partial [Equus caballus]
Length = 229
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 81/146 (55%), Gaps = 2/146 (1%)
Query: 194 HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQ 253
HSLH+T+S YQ+ W D LE +P A+QAA ++EFRRGLP ++ Y G D +
Sbjct: 4 HSLHLTLSTYQRNTWGDFLEAVLPLAVQAAMEENVEFRRGLPRDFMDYMGAQHSDSKDPR 63
Query: 254 ADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMS 313
R A E ++ L+A+L + +D D+ K +HD+LPPVL+ E SV+ +R
Sbjct: 64 --RTAFMEKVRVLVARLGHFAPVDAVADQRAKDFIHDSLPPVLTDRERALSVYGLPIRWE 121
Query: 314 QTGEVYNATQITKDTKVRLVRANAVR 339
V Q+T +T+V +++ R
Sbjct: 122 AGEPVNVGAQLTTETEVHMLQDGIAR 147
>gi|284991701|ref|YP_003410255.1| Cupin 4 family protein [Geodermatophilus obscurus DSM 43160]
gi|284064946|gb|ADB75884.1| Cupin 4 family protein [Geodermatophilus obscurus DSM 43160]
Length = 436
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/290 (24%), Positives = 130/290 (44%), Gaps = 34/290 (11%)
Query: 1 DHWEKEPLYI-SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQ 51
++W + PL + + + +L+ +D++L R + K+ +D +
Sbjct: 31 EYWARRPLLTRAEETGGSFADLLDLAAVDELLSRRGLRTPFLRIAKDGAVVDPKRFTTSG 90
Query: 52 RQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+V + V + +G ++ PL + L G N Y TP
Sbjct: 91 GAGAEVADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIEFADQLAADLGHPTQVNAYVTP 150
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT------P 165
P+++GF+PHYD + F+LQ+ G+K+W+++ P + D P + P + P
Sbjct: 151 PSSRGFSPHYDVHDVFVLQVAGEKRWRIHEPVLTD---PLRTQPWNERGAAVAAAAEREP 207
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
++ L PGD LYLPRGY+H A T S H+T+ ++ T W E A+ ++
Sbjct: 208 LIDAVLRPGDALYLPRGYLHSA-TALGAISAHLTVGIHSVTRWA-AAESAL-DLVRVLAT 264
Query: 226 TDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVD 275
D + RR LP+G VD+ AD A+ +++ ++ L ++D
Sbjct: 265 EDPQLRRSLPLG------------VDL-ADPAAVADDVATVVTALKGWLD 301
>gi|383780578|ref|YP_005465144.1| hypothetical protein AMIS_54080 [Actinoplanes missouriensis 431]
gi|381373810|dbj|BAL90628.1| hypothetical protein AMIS_54080 [Actinoplanes missouriensis 431]
Length = 401
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 121/281 (43%), Gaps = 29/281 (10%)
Query: 3 WEKEPLYISRKQ----KNYYNNLITSKKIDDIL-----RDNVIEFKKNIDI---TLYEDG 50
W +EPL +SR + + +L++ +D++L R + ++ ++ + G
Sbjct: 20 WGREPL-LSRAADLGGPDGFTDLLSPAAVDELLSRRGLRTPFLRVARDGEVLPPARFTGG 78
Query: 51 QRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
+V + L Y G ++ PL L V N Y T
Sbjct: 79 GGAGAEVTDQVLDDRAMHLYATGATLVLQGLHRLWPPLIAFAGELGAALNRPVQLNAYLT 138
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTPIL 167
PP +QGFA HYD + F+LQ +G K+W V+ P + D E P + + G P L
Sbjct: 139 PPGSQGFATHYDTHDVFVLQADGAKRWCVHPPVLPDPLERQPWGGRADEVAATAAGEPAL 198
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD 227
V LEPGD LYLPRG++H A ++ SLH+T V T + A+ + L A + D
Sbjct: 199 DVVLEPGDALYLPRGWLHSARSLGG-RSLHITAGVRGLTRY------ALVEELLALASAD 251
Query: 228 LEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLA 268
R LP GL P I + A L+D LA
Sbjct: 252 PALRATLPY------GLEMSDPDAIAPELTATVAALRDWLA 286
>gi|403727844|ref|ZP_10947824.1| hypothetical protein GORHZ_164_00490 [Gordonia rhizosphera NBRC
16068]
gi|403203776|dbj|GAB92155.1| hypothetical protein GORHZ_164_00490 [Gordonia rhizosphera NBRC
16068]
Length = 399
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 126/276 (45%), Gaps = 42/276 (15%)
Query: 2 HWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ---RQTM- 55
HW + P+ +SR + +++L++ + +D++L + I + ++G R+
Sbjct: 17 HWGRRPM-LSRSHELPRDFDDLLSGESVDELLTSRGVRIPF---IRMAKEGALVGRECFV 72
Query: 56 -------DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
+V + +V + +G +I P+ + + G V AN Y
Sbjct: 73 GPAGFGAEVADQVDSAMVLAQFADGATIVLQGLHRLWPPVIDFVRDMVDDLGHPVQANAY 132
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEI 162
TPP+ +GF PHYD + F+LQ+ G+K W+V+ P D P S P +
Sbjct: 133 ITPPSNRGFDPHYDVHDVFVLQVTGRKHWRVHEPVYTD---PLPSQPWTDHRAAIVRRAD 189
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
P++ LEPGD LYLPRG+IH A + E S+H+T+ V TA D++ + +
Sbjct: 190 DAPVIDTILEPGDALYLPRGWIHSAQAL-GETSIHLTVGVEPLTAR-DVMAAVLDEL--- 244
Query: 223 AGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLA 258
AT EFR LP+ R P D A +A
Sbjct: 245 --ATVAEFRASLPM---------RADPADAIAGTIA 269
>gi|425735033|ref|ZP_18853349.1| cupin superfamily protein [Brevibacterium casei S18]
gi|425480477|gb|EKU47643.1| cupin superfamily protein [Brevibacterium casei S18]
Length = 402
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 123/254 (48%), Gaps = 22/254 (8%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
D+W + L +SR ++ ++L ++ +D++ LR + K+ TL
Sbjct: 35 DYWGRRAL-LSRGVSDF-SDLFSADAVDELISARGLRSPFLRVAKD-GATLPTSSFTSPA 91
Query: 56 DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
V Q+ +W + +G ++ Q +P+ +++L E G V AN Y TP
Sbjct: 92 GVGATIGDQLDDTRLWRQFHDGATLVLQALQRTWEPIGAFSSALSEELGNPVQANAYITP 151
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG-----TPI 166
P QGF HYD + F+LQ++G K+W ++ P + D L R +E +G P
Sbjct: 152 PQNQGFDDHYDVHDVFVLQIQGTKRWVIHEP-VFDSPL-RDQPWTDRREAVGRAAATEPA 209
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
+ LEPGD LYLPRG++H A+T E S+H+T+ ++ T + + + + + A
Sbjct: 210 IDAVLEPGDALYLPRGWLH-AATAQGEVSIHLTLGIHNWTRFA--VAEQLAQVALAHLQD 266
Query: 227 DLEFRRGLPIGYLR 240
D R LP+G R
Sbjct: 267 DPAMRASLPLGESR 280
>gi|411005423|ref|ZP_11381752.1| transcription factor jumonji jmjC domain-containing protein
[Streptomyces globisporus C-1027]
Length = 409
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 92/186 (49%), Gaps = 7/186 (3%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ +P+ L + L G V AN Y TPP +GF HYD +
Sbjct: 106 LWRAFADGATLVLQALHRTWEPVADLVSGLSTDLGHPVQANAYVTPPQNRGFDAHYDVHD 165
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPR----YSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
F+LQ+EG K+W V+ P + D + + + + GTP L L PGD+LYLPR
Sbjct: 166 VFVLQIEGAKRWVVHEPVLPDPLRDQPWTDHRAAVADRAAHGTPHLDTMLRPGDVLYLPR 225
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G++H A E S+H+T+ V+ T + L + + +A A D RR LP+
Sbjct: 226 GWLHSAQ-AQGEVSIHLTLGVHAWTRYA--LAEHLTRAALAVLGEDPAMRRSLPLAGPEG 282
Query: 242 AGLARG 247
G A G
Sbjct: 283 PGGADG 288
>gi|302538916|ref|ZP_07291258.1| predicted protein [Streptomyces sp. C]
gi|302447811|gb|EFL19627.1| predicted protein [Streptomyces sp. C]
Length = 394
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 108/242 (44%), Gaps = 43/242 (17%)
Query: 72 EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
EG ++ F Q PL + L G V N Y TP +QGF HYD ++FI+Q+
Sbjct: 102 EGATLVFQGLQELTGPLAEFGRRLGHDLGRPVNVNAYVTPAGSQGFGDHYDTQDSFIVQI 161
Query: 132 EGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT 191
G K+W + P + P + G T+TLEPGD L+LPRG++H A + T
Sbjct: 162 HGSKRWTLKDPALAQPLSHETGRPLPEDDGSGR---TLTLEPGDCLWLPRGWVHSARS-T 217
Query: 192 NEHSLHVTISVYQKT---AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGK 248
+ S+H+TIS+Y+ T AW + AA A L R L + R
Sbjct: 218 DTASVHLTISLYEWTGHWAWTRI----------AARAPGLPGRFPLSTDFFR-------- 259
Query: 249 PVDIQADRLAMKENLKDLLAKLVEYVDLDD-----------GVDEMGKQLMHDALPPVLS 297
DR A ++++ L A+L E++ D G E + H A VLS
Sbjct: 260 ------DRAAAEKDMAALRAELTEWLATADDSALVDLVRAAGAPEFPSPVRHPAR-EVLS 312
Query: 298 PE 299
PE
Sbjct: 313 PE 314
>gi|239986030|ref|ZP_04706694.1| putative cupin superfamily protein [Streptomyces roseosporus NRRL
11379]
gi|291442971|ref|ZP_06582361.1| cupin 4 family protein [Streptomyces roseosporus NRRL 15998]
gi|291345918|gb|EFE72822.1| cupin 4 family protein [Streptomyces roseosporus NRRL 15998]
Length = 407
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 89/176 (50%), Gaps = 7/176 (3%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ +P+ L + L G V AN Y TPP +GF HYD +
Sbjct: 104 LWRAFTDGATLVLQALHRTWEPVADLVSGLSTELGHPVQANAYVTPPQNRGFDAHYDVHD 163
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPR----YSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
F+LQ+EG K+W ++ P + D + + + + GTP L L PGD+LYLPR
Sbjct: 164 VFVLQIEGAKRWVIHEPVLPDPLRDQPWTDHRAAVADRAAHGTPHLDTMLRPGDVLYLPR 223
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
G++H A E S+H+T+ V+ T + L + + +A A D RR LP+
Sbjct: 224 GWLHSAQ-AQGEVSIHLTLGVHAWTRYA--LAEQLTRAALATLGDDPSMRRSLPLA 276
>gi|319951046|ref|ZP_08024906.1| cupin 4 family protein [Dietzia cinnamea P4]
gi|319435286|gb|EFV90546.1| cupin 4 family protein [Dietzia cinnamea P4]
Length = 403
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 109/250 (43%), Gaps = 25/250 (10%)
Query: 3 WEKEPLYISRKQ-KNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQRQ 53
W + PL Q +++L++ +D+I+ R + + +D +
Sbjct: 18 WGRRPLLTRSAQLPRDFSDLLSPAMVDEIIAERGVRTPFVRMAREGSLVDRACFTRAAGF 77
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
+ + P + + G +I + P+ + G V N Y TPP
Sbjct: 78 GAQIADQIDPDGMLTQFAAGATIVLQGLHRFWPPIIDFVQGMNADVGHPVQTNAYITPPA 137
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPIL 167
+GF PHYD + F+LQ+ G K+W+V+ P + P P S P++
Sbjct: 138 NRGFDPHYDVHDVFVLQVSGAKRWRVHEPV---HHHPLPDQPWTDHREAISARAADEPVI 194
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATD 227
LEPGD LYLPRG+IH A + E S+H+T+ V T +D+ +A+ A+D
Sbjct: 195 DEVLEPGDCLYLPRGWIHSAEAL-GETSIHLTVGVAPFTG-VDIA-----RAVLDGLASD 247
Query: 228 LEFRRGLPIG 237
E R LP+G
Sbjct: 248 EELRASLPLG 257
>gi|149060277|gb|EDM10991.1| myc induced nuclear antigen, isoform CRA_b [Rattus norvegicus]
Length = 242
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 85/146 (58%), Gaps = 5/146 (3%)
Query: 1 DHWEKEPLYISRKQKN---YYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R + YY +L + + + + + +++++ G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPSLAAYYQSLFSLSDLRSLCSQGLY-YGRDVNVCRCIGGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ + L ++ L+ FG VG+N Y TP +QG
Sbjct: 116 DGKAQFLQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYMTPAGSQG 175
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLP 142
PHYDD+E FILQLEG+K+W++Y P
Sbjct: 176 LPPHYDDVEVFILQLEGRKRWRLYSP 201
>gi|315506227|ref|YP_004085114.1| transcription factor jumonji jmjc domain-containing protein
[Micromonospora sp. L5]
gi|315412846|gb|ADU10963.1| transcription factor jumonji jmjC domain-containing protein
[Micromonospora sp. L5]
Length = 425
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 113/252 (44%), Gaps = 23/252 (9%)
Query: 1 DHWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYE 48
DHW + PL +SR + + + +L++ D++L R + K+ + +
Sbjct: 39 DHWGRSPL-LSRAAELPNADGFTDLLSPADADELLSRRGLRTPFLRVAKDGQLVPAARFT 97
Query: 49 DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
G ++ + L + V Y +G ++ L L + N Y
Sbjct: 98 GGGGAGAEIGDQVLDEKVLALYADGATLVLQGLHRTWPALVDFARDLGTALAQPLQVNAY 157
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTP 165
TP +QGFA HYD + F+LQ++G+K W+++ P + D E P + S G P
Sbjct: 158 LTPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLADPLERQPWGGRADEVSATAEGRP 217
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
L V LEPGD LYLPRG++H A SLH+T+ + T + A+ + L A A
Sbjct: 218 ELDVVLEPGDALYLPRGWLHSAQ-AQESSSLHLTVGIRALTRY------AVVEELLALAA 270
Query: 226 TDLEFRRGLPIG 237
D R LP G
Sbjct: 271 EDARLRASLPFG 282
>gi|379735634|ref|YP_005329140.1| hypothetical protein BLASA_2203 [Blastococcus saxobsidens DD2]
gi|378783441|emb|CCG03109.1| Conserved protein of unknown function; putative cupin and
clavaminate synthase domains [Blastococcus saxobsidens
DD2]
Length = 439
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 110/248 (44%), Gaps = 15/248 (6%)
Query: 2 HWEKEPLYI-SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQR 52
+W + PL + N + +L+ +D++L R + K+ +D +
Sbjct: 24 YWARRPLLTRAEDTGNSFTDLLDLPAVDELLSRRGLRTPFLRIAKDGAVVDAKRFTTSGG 83
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
++ + V + +G ++ PL + L G N Y TPP
Sbjct: 84 AGAEIADQVSSDAVLRLFADGSTVVLQGLHRLWPPLIEFADQLAADLGHPTQVNAYITPP 143
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY---SSPNFSQEEIGTPILTV 169
+++GF+PHYD + F+LQ+ G+K W ++ P + D + S + G P++
Sbjct: 144 SSRGFSPHYDVHDVFVLQVAGEKHWTIHEPVLPDPLRTQVWTDRSAEVAAAAEGEPVIDA 203
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L PGD LYLPRGY+H A + S H+T+ ++ T W +++ ++ A D
Sbjct: 204 VLRPGDALYLPRGYLHSAKAL-GAISAHLTVGIHSVTRWA--AAESVLDLVRTLAAEDRS 260
Query: 230 FRRGLPIG 237
R LP+G
Sbjct: 261 LRGSLPLG 268
>gi|167578420|ref|ZP_02371294.1| hypothetical protein BthaT_09788 [Burkholderia thailandensis TXDOH]
Length = 397
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 117/244 (47%), Gaps = 21/244 (8%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDIT--LYEDGQRQTMDVEG 59
+W ++PL + R+ + Y L S++ +L + I + + EG
Sbjct: 22 YWGRKPLIVRRQAPHLYACLPDSEEFAFLLHSLTDPERGWFSIVNGVARPPSDSLLTQEG 81
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF-------VGANTYQTPP 112
V+ YR+G S+ Q + L ++ +GAN Y +PP
Sbjct: 82 LLNLSEVYAAYRDGNSLLMNQVQRRHRETAMLCRRIESALSAHGIALARHIGANGYLSPP 141
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
++QGF HYD + ILQ+EG+K W++Y R V + + + EE G+P L
Sbjct: 142 SSQGFNIHYDPHDVLILQIEGRKHWRLY-GRHV-AWPTQPPAAPIPPEEAGSPRREFVLS 199
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
PG+L+Y+PRG +H A+T T+ SLH+T+S+ T W DLL +AM + + FRR
Sbjct: 200 PGELVYIPRGVLHDANT-TDSRSLHLTLSIETLT-WTDLLIEAM--------SDNPAFRR 249
Query: 233 GLPI 236
LP+
Sbjct: 250 NLPV 253
>gi|119484518|ref|ZP_01619135.1| hypothetical protein L8106_02332 [Lyngbya sp. PCC 8106]
gi|119457992|gb|EAW39115.1| hypothetical protein L8106_02332 [Lyngbya sp. PCC 8106]
Length = 386
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 136/284 (47%), Gaps = 39/284 (13%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W K+ L IS + +++L + + + +L + + + +I L + GQ
Sbjct: 20 NWLKQALIISGYSPHKFSHLFSWQDFNTLLNFHHLTYP---EIRLAKSGQ---------T 67
Query: 62 LPQIVWDHY----REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
LP+ +D+ ++G ++ + QT + + + A+L+ G N Y + P +QGF
Sbjct: 68 LPENAYDNLIKSCQDGATVIIDSLQTRLPVIAEFTANLRNELGHRTQINAYCSFPGSQGF 127
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
A HYD E FILQ+ G K W+V+ P E+ N P + L+PGDLL
Sbjct: 128 ACHYDSHEVFILQISGDKHWRVFSPTF--EFPLSKHRSNLLDPPTTDPYINQVLKPGDLL 185
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
Y+PRG+ H A V ++ SLH+T+ V +T ID +E + + LQ +R+ LP+
Sbjct: 186 YIPRGHWHYAVAV-DQPSLHLTLGVDCQTG-IDFVE-WLTEELQENPL----WRQSLPLL 238
Query: 238 YLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD 281
R A ++L+DL+ K + ++ D+ +D
Sbjct: 239 N--------------STHRQACSQHLRDLIQKWIGQLETDEFID 268
>gi|391340602|ref|XP_003744628.1| PREDICTED: MYC-induced nuclear antigen-like [Metaseiulus
occidentalis]
Length = 509
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 141/332 (42%), Gaps = 46/332 (13%)
Query: 1 DHWEKEPLYISRKQKN----------YYNNLIT-SKKID-DILRDNVIEFKKNIDITLYE 48
+HW +PL+I + + + +++ S+K D +++ D F K +D
Sbjct: 79 NHWGSKPLHIEAPEASEELSKIFSLEVFEEIVSRSEKGDLNLIADRDFTFSKIVD----- 133
Query: 49 DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
G RQ +++E +V +HY+E +I+ PQ + L L+ G V + +
Sbjct: 134 -GDRQDLEIE-TLTKDLVKNHYKEKYAIQINQPQRFQDCLWSFIEKLECHLGSLVSSQVF 191
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPI 166
P + YDD E F+LQL G WK++ + MV E+ +E +G I
Sbjct: 192 LVPGDTKNQQRRYDDTEKFVLQLSGSSSWKLHELKGEMVCEF-----DDALKEENVGDQI 246
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
+ L+ GD+LY P+ I ++ VT+ H+ S +Q +LL + + ++
Sbjct: 247 MAANLKAGDVLYFPKTTIFESEGVTD--GAHIVFSTFQGMHRANLLLQTIIATIEEQVPV 304
Query: 227 DLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDG----VDE 282
E R+GLP YL A + D L KL E V D
Sbjct: 305 CEELRKGLPFHYLPRANSISNA-------------DYADTLVKLAESVRSRSKPLPPADG 351
Query: 283 MGKQLMHDALPPVLSPE-ELQCSVFENGLRMS 313
M ++ M + LPP P+ EL+ + ++S
Sbjct: 352 MIQEFMTNRLPPYGQPKFELESGSITSSSKIS 383
>gi|384245294|gb|EIE18789.1| hypothetical protein COCSUDRAFT_20456, partial [Coccomyxa
subellipsoidea C-169]
Length = 183
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 66/103 (64%), Gaps = 5/103 (4%)
Query: 38 FKKNIDITL--YEDGQRQTMDVEGRALPQIVWDH-YREGCSIRFLNPQTYIKPLQQLNAS 94
K+ +DI ++DGQRQT+ E ++ + W Y+EG S++ PQ + L +L +S
Sbjct: 2 LKQGLDINFVRFKDGQRQTLQNEEGSIER--WQQLYKEGWSLQVHQPQRFCDALTRLCSS 59
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKW 137
L+ GC VG+N Y TP AQG APH+DD+E F++Q EG+K+W
Sbjct: 60 LENQLGCLVGSNAYLTPAGAQGLAPHHDDVEIFVVQTEGRKRW 102
>gi|392413955|ref|YP_006450560.1| hypothetical protein Mycch_0053 [Mycobacterium chubuense NBB4]
gi|390613731|gb|AFM14881.1| hypothetical protein Mycch_0053 [Mycobacterium chubuense NBB4]
Length = 404
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 122/255 (47%), Gaps = 33/255 (12%)
Query: 2 HWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ---RQT-- 54
HW ++PL +SR +++L++ +D++L + + + I + +DGQ R+
Sbjct: 17 HWGRQPL-LSRSDALPRDFSDLLSPDTVDELLAERGV---RAPFIRVAKDGQLVGRECYL 72
Query: 55 ------MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
++ + V + + G SI PL ++ + G V AN Y
Sbjct: 73 GPAGFGAEIGDQVDSAKVLEQFAAGASIALQGLHRLWPPLIDFVRAMVDDLGHPVQANAY 132
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEI 162
TPPT++GF HYD + F+LQ G K W V+ P D P S P ++
Sbjct: 133 ITPPTSRGFDAHYDVHDVFVLQAAGHKHWTVHEPVHRD---PLASQPWTEHRAAIARRVR 189
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
+P++ L PGD LYLPRG++H A + + S+H+T+ V T +D++ +A+ L
Sbjct: 190 DSPVIDTVLGPGDALYLPRGWVHSAQAL-DTLSIHLTVGVSPLTG-MDVV-RAVADHLAT 246
Query: 223 AGATDLEFRRGLPIG 237
G +FR LP+G
Sbjct: 247 VG----DFRASLPMG 257
>gi|145594683|ref|YP_001158980.1| cupin 4 family protein [Salinispora tropica CNB-440]
gi|145304020|gb|ABP54602.1| Cupin 4 family protein [Salinispora tropica CNB-440]
Length = 496
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 114/252 (45%), Gaps = 25/252 (9%)
Query: 2 HWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYED 49
HW + PL +SR + + + +L++ DD+L R + ++ + Y
Sbjct: 16 HWGQTPL-LSRAHELPNPSGFRDLLSPADADDLLSRRGLRTPFLRVAQDGVLVPAARYTG 74
Query: 50 GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
G ++ + L + + D Y G ++ L L G + N Y
Sbjct: 75 GGGAGAEITDQVLDEKILDLYAGGATLVLQGLHRTWPALIDFARDLGLAVGQPLQVNAYL 134
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI----GTP 165
TP +QGFA HYD + F+LQ++G K W+++ P + D L R + E + G P
Sbjct: 135 TPAGSQGFATHYDTHDVFVLQVDGGKHWRIHPPVLPDP-LERQPWGGRADEVVATATGAP 193
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
L V L PGD LYLPRG++H A+ SLH+T+ V T + L+E+ L A A
Sbjct: 194 ALDVLLAPGDALYLPRGWLHSAA-AQERSSLHLTVGVRALTRYT-LVEE-----LLALAA 246
Query: 226 TDLEFRRGLPIG 237
D R LP G
Sbjct: 247 EDQRLRATLPFG 258
>gi|441510904|ref|ZP_20992803.1| hypothetical protein GOACH_30_00100 [Gordonia aichiensis NBRC
108223]
gi|441444967|dbj|GAC50764.1| hypothetical protein GOACH_30_00100 [Gordonia aichiensis NBRC
108223]
Length = 416
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 116/254 (45%), Gaps = 29/254 (11%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQ 53
+HW ++PL +SR +++L++ +D+++ R I K + + +D
Sbjct: 19 EHWGRQPL-LSRSAALPRDFDDLLSPHSVDELIAERGVRAPFIRMAKEGGL-IGKDSYLG 76
Query: 54 TMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
+ Q+ V + G +I P+ + + G V AN Y
Sbjct: 77 SAGFGAEMPDQVDSAKVLTQFATGATIVLQGLHRLWPPVIDFVRGMVDDLGHPVQANAYI 136
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIG 163
TPP +GF PHYD + F+LQ+ G+K W+V+ P + P + P +
Sbjct: 137 TPPANRGFDPHYDVHDVFVLQISGRKHWRVHAPVHDN---PLSTQPWTDHREAIAARSAD 193
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
+P++ LEPGD LYLPRG+IH A + + S+H+T+ V TA+ DL+ L
Sbjct: 194 SPVIDTLLEPGDALYLPRGWIHSAEAL-GDTSIHLTVGVSAVTAY-DLVR----AVLDQL 247
Query: 224 GATDLEFRRGLPIG 237
G D R LP+G
Sbjct: 248 GTVD-ALRASLPMG 260
>gi|302867368|ref|YP_003836005.1| transcription factor jumonji jmjC domain-containing protein
[Micromonospora aurantiaca ATCC 27029]
gi|302570227|gb|ADL46429.1| transcription factor jumonji jmjC domain-containing protein
[Micromonospora aurantiaca ATCC 27029]
Length = 425
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 112/252 (44%), Gaps = 23/252 (9%)
Query: 1 DHWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYE 48
DHW + PL +SR + + + +L + D++L R + K+ + +
Sbjct: 39 DHWGRSPL-LSRAAELPNPDGFTDLFSPADADELLSRRGLRTPFLRVAKDGQLVPAARFT 97
Query: 49 DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
G ++ + L + V Y +G ++ + L + N Y
Sbjct: 98 GGGGAGAEIGDQVLDEKVLALYADGATLVLQGLHRTWPAIVDFARDLGTALAQPLQVNAY 157
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTP 165
TP +QGFA HYD + F+LQ++G+K W+++ P + D E P + S G P
Sbjct: 158 LTPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLADPLERQPWGGRADEVSATAEGRP 217
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
L V LEPGD LYLPRG++H A SLH+T+ + T + A+ + L A A
Sbjct: 218 ELDVVLEPGDALYLPRGWLHSAQ-AQESSSLHLTVGIRALTRY------AVVEELLALAA 270
Query: 226 TDLEFRRGLPIG 237
D R LP G
Sbjct: 271 EDARLRASLPFG 282
>gi|126438193|ref|YP_001073884.1| cupin 4 family protein [Mycobacterium sp. JLS]
gi|126237993|gb|ABO01394.1| Cupin 4 family protein [Mycobacterium sp. JLS]
Length = 404
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 133/289 (46%), Gaps = 42/289 (14%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQ 53
++W + PL +SR + +L++ +D+++ R I K D+ L +D
Sbjct: 16 EYWGRRPL-LSRSGALPRDFADLLSPGMVDELIAERGVRAPFIRLAKEGDV-LAKDCYLG 73
Query: 54 TMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
Q+ V + G +I P+ L L + G V AN Y
Sbjct: 74 PAGFGAEMPDQVDSAKVLTQFSAGATIVMQGLHRLWPPVIDLVRHLVDDLGHPVQANAYI 133
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIG 163
TPP+ +GF PHYD + F+LQ G+K+W V+ P V + P S P ++ G
Sbjct: 134 TPPSNRGFDPHYDVHDVFVLQTAGEKRWVVHEP--VHPH-PLPSQPWTQHRDAIAERAAG 190
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
P++ L PGD LYLPRG++H A + + S+H+TI V T +D + +A+ AL +
Sbjct: 191 EPVIDTVLAPGDALYLPRGWVHSAHAL-DTTSIHLTIGVSAVTG-VD-VARAVVDALADS 247
Query: 224 GATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
A FR LP+G G P ADR + + ++A++VE
Sbjct: 248 AA----FRAPLPMG---------GDP----ADRDEIIAAVTKVMAQMVE 279
>gi|409356670|ref|ZP_11235057.1| cupin 4 family protein [Dietzia alimentaria 72]
Length = 401
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 110/252 (43%), Gaps = 25/252 (9%)
Query: 1 DHWEKEPLYI-SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQ 51
D+W + PL S + +++L++S+ +D+I+ R + + +D +
Sbjct: 16 DYWGRRPLLTRSEELPRDFSDLLSSEMVDEIIAERGVRTPFVRMAREGTLVDRACFTRAG 75
Query: 52 RQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
+ + P V + G +I P+ + G V N Y TP
Sbjct: 76 GFGAQMPDQIDPDGVLTQFAAGATIVLQGLHRLWPPIINFVRGMTSDVGHPVQTNAYITP 135
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE------IGTP 165
P +GF PHYD + F+LQ+ G K+W+V+ P V + P P E P
Sbjct: 136 PANRGFDPHYDAHDVFVLQVAGTKRWRVHEP--VHRH-PLSDQPWTDHREAIAARAASEP 192
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
++ LEPGD LYLPRG++H A + S+H+T+ V T + +A A A
Sbjct: 193 VIDAVLEPGDCLYLPRGWVHSAE-AQGKTSIHLTVGVAPFTG------HDVARAALEALA 245
Query: 226 TDLEFRRGLPIG 237
+ E R LP+G
Sbjct: 246 SQEELRASLPLG 257
>gi|363421145|ref|ZP_09309234.1| cupin superfamily protein [Rhodococcus pyridinivorans AK37]
gi|359734880|gb|EHK83848.1| cupin superfamily protein [Rhodococcus pyridinivorans AK37]
Length = 410
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 112/251 (44%), Gaps = 26/251 (10%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
W + PL R + +L + + +D++ LR + K D T D +
Sbjct: 46 WGRAPLLTRRA--GTFADLFSVEAVDELISRRGLRTPFLRVAK--DGTTLPDSSFTSPGG 101
Query: 58 EGRALPQ-----IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
G + ++W + +G ++ +P+ Q +L + G V N Y TPP
Sbjct: 102 VGATISDQLDDTMLWRNLADGATLVLQALHRTWEPISQFGTALSDELGHPVQVNAYITPP 161
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPI 166
QGF+ HYD + F++Q+EG K+W ++ P P + P ++
Sbjct: 162 RNQGFSHHYDVHDVFVVQIEGTKRWVIHEPV---HPAPLRNQPWTDHRAAVARAATEPAC 218
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
+ LEPGD LYLPRG+IH A+ E S+H+T+ ++ T L E AL A G
Sbjct: 219 IDTVLEPGDCLYLPRGWIH-AAEARGEISIHLTVGIHTWTRHA-LAEHLTRAALTALG-D 275
Query: 227 DLEFRRGLPIG 237
D E R LP+G
Sbjct: 276 DPEMRGALPMG 286
>gi|444509721|gb|ELV09407.1| MYC-induced nuclear antigen [Tupaia chinensis]
Length = 398
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 132/304 (43%), Gaps = 70/304 (23%)
Query: 1 DHWEKEPLYISRKQ---KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
+ WE++PL I R YY +L + + + + +++++ +G+++ ++
Sbjct: 57 EFWEQKPLLIQRDDPALAAYYQSLFRLTDLKSLCSRGMY-YGRDVNVCRCINGKKKVLNK 115
Query: 58 EGRA-LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
+G+A Q+ D ++ +I+F PQ +
Sbjct: 116 DGKAHFLQLRKDFDQKRATIQFHQPQRF-------------------------------- 143
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
+ FILQLEG+K W++Y P + L R S +++ IG P TL+PGDL
Sbjct: 144 --------KVFILQLEGEKHWRLYHPTVP---LAREYSVE-AEDRIGRPAHEFTLKPGDL 191
Query: 177 LYLPRGYIHQASTVTN-EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LY PRG IHQA T HS HVTIS YQ ++W D+L + + D+ R G+P
Sbjct: 192 LYFPRGTIHQADTPPGLAHSTHVTISTYQNSSWGDILLDTISGLVFDTAKEDVALRAGIP 251
Query: 236 -----IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
+ + +A + ADRL E K+LL+ +M K +
Sbjct: 252 RQLLLVETIPFATRKLSGFLRTLADRL---EGTKELLS------------SDMKKDFIMH 296
Query: 291 ALPP 294
LPP
Sbjct: 297 RLPP 300
>gi|332709429|ref|ZP_08429390.1| hypothetical protein LYNGBM3L_40470 [Moorea producens 3L]
gi|332351688|gb|EGJ31267.1| hypothetical protein LYNGBM3L_40470 [Moorea producens 3L]
Length = 385
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 153/325 (47%), Gaps = 31/325 (9%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW + L+I + + + +L + +K+ D+L N +FK ++ L DG+ + E
Sbjct: 20 HWTSKALFIPSEDERKFADLFSWQKLTDLL--NFHQFKYP-ELRLALDGK---VLEEKEN 73
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
I W + G ++ + + + +++ G N+Y + P+ QGF+ HY
Sbjct: 74 ANLIKW--CQSGATLIIDKVHKLVPEITNFTSEVRQDLGYSAQVNSYCSWPSKQGFSCHY 131
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
D E FILQ+EG K+W V+ + + S+ E G P L TL PGD+LY+PR
Sbjct: 132 DTHEVFILQIEGNKEWYVFSDTLKYPLTEQKSASETPPE--GDPYLKCTLTPGDVLYIPR 189
Query: 182 GYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
G+ H A + ++ SLH+T+ ++ +T +D L+ + + Q +R+ LP+ +
Sbjct: 190 GHWHYAVAL-DQPSLHLTLGIHCQTG-VDFLDWLVGELRQNEA-----WRKSLPLRF--- 239
Query: 242 AGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSPEEL 301
+P+D D L +N K+ L + +D + K L +L P +
Sbjct: 240 ----DAEPLDQSIDNLI--QNFKEYLTNSDIQERYNRYLDGLVKPLARYSL-----PAQA 288
Query: 302 QCSVFENGLRMSQTGEVYNATQITK 326
++F NG++ + + +I++
Sbjct: 289 GFNIFPNGVKTRFSNPKFKRLKISQ 313
>gi|403525442|ref|YP_006660329.1| cupin superfamily protein [Arthrobacter sp. Rue61a]
gi|403227869|gb|AFR27291.1| putative cupin superfamily protein [Arthrobacter sp. Rue61a]
Length = 388
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 12/178 (6%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ KP+ + L G V AN Y TPP +GF HYD +
Sbjct: 93 LWRKFADGATLVLQALHRTWKPVSSFSTQLSTELGHPVQANAYITPPQNRGFDDHYDVHD 152
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYL 179
F+LQ+EG K+W ++ P VD P S P ++ G + LEPGD+LYL
Sbjct: 153 VFVLQIEGTKRWIIHEPVHVD---PLRSQPWTDRRSAVAEAAQGKAYIDTVLEPGDVLYL 209
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
PRG++H A+ + S+H+T+ V+ T L + + +A AA D E RR LP+G
Sbjct: 210 PRGWLH-AAEAQGKVSIHLTLGVHSWTR--HALAEHLAQAALAALCDDPEVRRSLPLG 264
>gi|330468574|ref|YP_004406317.1| transcription factor jumonji jmjC domain-containing protein
[Verrucosispora maris AB-18-032]
gi|328811545|gb|AEB45717.1| transcription factor jumonji jmjC domain-containing protein
[Verrucosispora maris AB-18-032]
Length = 420
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 112/251 (44%), Gaps = 23/251 (9%)
Query: 2 HWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYED 49
HW + PL +SR + + +L++ D++L R + K+ + Y
Sbjct: 36 HWGQTPL-LSRAAELPNPAGFTDLLSPDDADELLSRRGLRTPFLRVAKDGQVLPAARYTG 94
Query: 50 GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
G ++ + L + + Y G ++ PL L G + N Y
Sbjct: 95 GGGAGAEIGDQVLDEKILQLYAGGATLVLQGLHRNWPPLVDFTRELSLAVGQPLQVNAYL 154
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEIGTPI 166
TPP +QGFA HYD + F+LQ++G+K W+++ P + D E P + + +G
Sbjct: 155 TPPGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLPDPLERQPWGGRADEVTATALGPAA 214
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
L V LEPGD LYLPRG++H A SLH+T+ + T + A+ + L A
Sbjct: 215 LDVVLEPGDALYLPRGWLHSAQ-AQESSSLHLTVGIRALTRY------ALVEELLALATE 267
Query: 227 DLEFRRGLPIG 237
D R LP G
Sbjct: 268 DRRLRATLPFG 278
>gi|108802210|ref|YP_642407.1| cupin 4 [Mycobacterium sp. MCS]
gi|119871363|ref|YP_941315.1| cupin 4 family protein [Mycobacterium sp. KMS]
gi|108772629|gb|ABG11351.1| Cupin 4 [Mycobacterium sp. MCS]
gi|119697452|gb|ABL94525.1| Cupin 4 family protein [Mycobacterium sp. KMS]
Length = 404
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 133/289 (46%), Gaps = 42/289 (14%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQ 53
++W + PL +SR + +L++ +D+++ R I K D+ L +D
Sbjct: 16 EYWGRRPL-LSRSGALPRDFADLLSPGMVDELIAERGVRAPFIRLAKEGDV-LAKDCYLG 73
Query: 54 TMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
Q+ V + G +I P+ L L + G V AN Y
Sbjct: 74 PAGFGAEMPDQVDSAKVLTQFSAGATIVLQGLHRLWPPVIDLVRHLVDDLGHPVQANAYI 133
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIG 163
TPP+ +GF PHYD + F+LQ G+K+W V+ P V + P S P ++ G
Sbjct: 134 TPPSNRGFDPHYDVHDVFLLQTAGEKRWVVHEP--VHPH-PLPSQPWTQHRDAIAERAAG 190
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
P++ L PGD LYLPRG++H A + + S+H+TI V T +D + +A+ AL +
Sbjct: 191 EPVIDTVLAPGDALYLPRGWVHSAHAL-DTTSIHLTIGVSAVTG-VD-VARAVVDALADS 247
Query: 224 GATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
A FR LP+G G P ADR + + ++A++VE
Sbjct: 248 AA----FRAPLPMG---------GDP----ADRDEIIAAVTKVMAQMVE 279
>gi|377566996|ref|ZP_09796242.1| hypothetical protein GOSPT_129_00550 [Gordonia sputi NBRC 100414]
gi|377525844|dbj|GAB41407.1| hypothetical protein GOSPT_129_00550 [Gordonia sputi NBRC 100414]
Length = 411
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 35/257 (13%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYED---- 49
DHW + PL +SR +++L++ + +D+++ R I K + E+
Sbjct: 19 DHWGRRPL-LSRSDALPRDFDDLLSPRSVDELIAERGVRAPFIRMAKEGGLVGKENYLGP 77
Query: 50 ---GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
G V+ V + G +I P+ + + G V AN
Sbjct: 78 AGFGAEMPDQVDSAK----VLAQFASGATIVLQGLHRLWPPIIDFVRGMVDDLGHPVQAN 133
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP------RYSSPNFSQE 160
Y TPP +GF PHYD + F+LQ+ G K W+V+ P D R + S E
Sbjct: 134 AYITPPANRGFDPHYDVHDVFVLQISGSKHWRVHSPVHTDPLADQPWTDHRDAIEARSHE 193
Query: 161 EIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
E P++ LEPGD LYLPRG++H A + + S+H+T+ V TA+ DL+ + +
Sbjct: 194 E---PVIDTVLEPGDALYLPRGWLHSAEALGHT-SIHLTVGVSAVTAF-DLVRTVVDQL- 247
Query: 221 QAAGATDLEFRRGLPIG 237
T R LP+G
Sbjct: 248 ----GTVESLRTALPMG 260
>gi|374372801|ref|ZP_09630462.1| Cupin, JmjC-type [Niabella soli DSM 19437]
gi|373234877|gb|EHP54669.1| Cupin, JmjC-type [Niabella soli DSM 19437]
Length = 396
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 145/340 (42%), Gaps = 50/340 (14%)
Query: 4 EKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-QTMDVEGRAL 62
EK+ L+I R +YY +++T+ ++ L I + + + DG+ + + + +
Sbjct: 23 EKDLLHIQRNNTDYYKDVLTASELSAFLDRQDIFYPS---LRIVRDGKEIPSGEYTLKGV 79
Query: 63 P------------QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
P + + + +G ++ Q Y L L F V AN Y T
Sbjct: 80 PIGHHKKDGIVNTEKAFALFNDGATLVVQAGQRYFDHLSACCMELSRRFNAPVQANLYLT 139
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P +QGF PH+D + F+LQL G K W +Y ++ LP S S + P+ T+
Sbjct: 140 PNKSQGFNPHWDTHDVFVLQLSGTKTWHLY---GFEKELPTKSQGFVSNDYSKQPLQTIQ 196
Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
L PGD LY+PRGY+H A + S H+TI + T W+ + L + F
Sbjct: 197 LAPGDFLYVPRGYVHDA-VADDGISAHITIGILSFT-WVRYFAE-----LFGQLENEKTF 249
Query: 231 RRGLPIGYLRYAGLARGKPVDIQ--ADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLM 288
R +P + + K ++ D+L ++ LK L A+ +++ Q +
Sbjct: 250 REAIPFWRSDIESIVKEKTALLKNTIDQLDAEDVLKKLTAQY----------EKIQPQAL 299
Query: 289 HDALPPVLSPEELQCS---------VFENGLRMSQTGEVY 319
H+ +L ++L FE G+ Q G+VY
Sbjct: 300 HNYFGSLLQLDKLHEDSVIRLNREVFFERGM---QNGQVY 336
>gi|357415101|ref|YP_004926837.1| transcription factor jumonji jmjC domain-containing protein
[Streptomyces flavogriseus ATCC 33331]
gi|320012470|gb|ADW07320.1| transcription factor jumonji jmjC domain-containing protein
[Streptomyces flavogriseus ATCC 33331]
Length = 402
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 88/179 (49%), Gaps = 13/179 (7%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ +P+ L + L G V AN Y TPP +GF HYD +
Sbjct: 106 LWRAFADGATLVLQALHRTWEPVAGLVSELSTELGHPVQANAYVTPPQNRGFDAHYDVHD 165
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI-------GTPILTVTLEPGDLLY 178
F+LQ+EG K+W V+ P + D P P + T L L PGD+LY
Sbjct: 166 VFVLQIEGTKRWIVHEPVLPD---PLRDQPWTDHRQAVADAAARSTAHLDTVLGPGDVLY 222
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
LPRG++H A E S+H+T+ V+ T + L + + +A AA D RR LP+G
Sbjct: 223 LPRGWLHSAR-AQGEVSIHLTLGVHTWTRYA--LAEQLTRAALAALRDDPPMRRSLPLG 278
>gi|291449903|ref|ZP_06589293.1| cupin 4 family protein [Streptomyces albus J1074]
gi|291352852|gb|EFE79754.1| cupin 4 family protein [Streptomyces albus J1074]
Length = 406
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 16/218 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
D W + L ++R+ ++ ++L ++ +D++ LR + +N TL +
Sbjct: 34 DFWARTAL-LNRRAGDF-SDLFSAAAVDELIARRGLRTPFLRVARN-GATLPDTAYTSPA 90
Query: 56 DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
V Q+ +W + +G ++ +P+ + A L G V AN Y TP
Sbjct: 91 GVGATIADQLDDTALWREFADGATLVLQALHRTWQPVAEFCAGLGTELGHPVQANAYVTP 150
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS---PNFSQEEIGTPILT 168
P +GF HYD + F+LQ+EG K+W V+ P D + + P ++ P L
Sbjct: 151 PQNRGFDDHYDVHDVFVLQIEGTKRWLVHRPVHPDPLRDQPWTDRRPAVAEAAASEPYLD 210
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
V L PGD+LYLPRG++H A+ S+H+T+ ++ T
Sbjct: 211 VVLAPGDVLYLPRGWLH-AAEAQGAVSVHLTLGIHTWT 247
>gi|421741680|ref|ZP_16179863.1| hypothetical protein SM8_03506 [Streptomyces sp. SM8]
gi|406689902|gb|EKC93740.1| hypothetical protein SM8_03506 [Streptomyces sp. SM8]
Length = 406
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 16/218 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
D W + L ++R+ ++ ++L ++ +D++ LR + +N TL +
Sbjct: 34 DFWARTAL-LNRRAGDF-SDLFSAAAVDELIARRGLRTPFLRVARN-GATLPDTAYTSPA 90
Query: 56 DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
V Q+ +W + +G ++ +P+ + A L G V AN Y TP
Sbjct: 91 GVGATIADQLDDTALWREFADGATLVLQALHRTWQPVAEFCAGLGTELGHPVQANAYVTP 150
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS---PNFSQEEIGTPILT 168
P +GF HYD + F+LQ+EG K+W V+ P D + + P ++ P L
Sbjct: 151 PQNRGFDDHYDVHDVFVLQIEGTKRWLVHRPVHPDPLRDQPWTDRRPAVAEAAASEPYLD 210
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
V L PGD+LYLPRG++H A+ S+H+T+ ++ T
Sbjct: 211 VVLAPGDVLYLPRGWLH-AAEAQGAVSVHLTLGIHTWT 247
>gi|359146656|ref|ZP_09180124.1| cupin superfamily protein [Streptomyces sp. S4]
Length = 406
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 16/218 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTM 55
D W + L ++R+ ++ ++L ++ +D++ LR + +N TL +
Sbjct: 34 DFWARTAL-LNRRAGDF-SDLFSAAAVDELIARRGLRTPFLRVARN-GATLPDTAFTSPA 90
Query: 56 DVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
V Q+ +W + +G ++ +P+ + A L G V AN Y TP
Sbjct: 91 GVGATIADQLDDTALWREFADGATLVLQALHRTWQPVAEFCAGLGTELGHPVQANAYVTP 150
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS---PNFSQEEIGTPILT 168
P +GF HYD + F+LQ+EG K+W V+ P D + + P ++ P L
Sbjct: 151 PQNRGFDDHYDVHDVFVLQIEGTKRWLVHRPVHPDPLRDQPWTDRRPAVAEAAASEPYLD 210
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
V L PGD+LYLPRG++H A+ S+H+T+ ++ T
Sbjct: 211 VVLAPGDVLYLPRGWLH-AAEAQGAVSVHLTLGIHTWT 247
>gi|377561186|ref|ZP_09790649.1| hypothetical protein GOOTI_184_00050 [Gordonia otitidis NBRC
100426]
gi|377521621|dbj|GAB35814.1| hypothetical protein GOOTI_184_00050 [Gordonia otitidis NBRC
100426]
Length = 411
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 111/235 (47%), Gaps = 28/235 (11%)
Query: 1 DHWEKEPLYISRKQK--NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
+HW ++PL +SR +++L++ +D+++ + + I + ++G D
Sbjct: 19 EHWGRQPL-LSRSDTLPRDFDDLLSPHSVDELISERGVRAPF---IRMAKEGGLVGKDCY 74
Query: 59 ------GRALP-QI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
G +P Q+ V + G +I P+ + + + G V AN
Sbjct: 75 LGPAGFGAEMPDQVDSAKVLAQFAAGATIVLQGLHRLWPPMIEFVRGMVDDLGHPVQANA 134
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT--- 164
Y TPP +GF PHYD + F+LQ G+K W+V+ P D P + P + T
Sbjct: 135 YITPPANRGFDPHYDVHDVFVLQTSGRKHWRVHEPVHRD---PLPTQPWTDHRDAITARG 191
Query: 165 ---PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
P++ LEPGD LYLPRG++H A + + S+H+TI V T + DL+ M
Sbjct: 192 DDRPVIDTVLEPGDALYLPRGWVHSAEAL-GDTSIHLTIGVSAVTTY-DLVRAVM 244
>gi|428181856|gb|EKX50718.1| hypothetical protein GUITHDRAFT_161698 [Guillardia theta CCMP2712]
Length = 470
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 116/245 (47%), Gaps = 40/245 (16%)
Query: 4 EKEPLYISRKQKNYYNNLI--TSKKI----DDIL----RDNVIEFKKNIDITLYE----- 48
E++ + I+R +NY+N++I +S +I DD L +D + E D L +
Sbjct: 65 EQQVVMIARNSRNYFNDIINISSAQIMETFDDCLSVHSQDAMDEMISGEDYHLVKKVKGK 124
Query: 49 DGQRQTMDVEGRALPQ---IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGA 105
DG+ + V GR + I W + G ++ N L +++ G V
Sbjct: 125 DGEFWSGRVPGRTHKKDDVIKW--MKRGFTLVINNADMRYPGLVGAAQRMEDALGYAVQM 182
Query: 106 NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQE 160
N Y TPP++QGF H+D + +LQLEG K W+VY LPR ++ P++ +
Sbjct: 183 NLYFTPPSSQGFEVHFDPMNTLVLQLEGSKTWEVYNPIIALPRPEQKFKPKHKEIHHGSR 242
Query: 161 EIGTPILTVTLEPGDLLYLPRGYIHQASTVTN--------EHSLHVTISV-YQKTAWIDL 211
+ T TL PG +LPRG++H+A T ++ SLH+T+ + Q W +
Sbjct: 243 Q------TFTLTPGSTFFLPRGWLHEAHTNSSSLPPRSRASASLHLTVGIDAQYVTWESV 296
Query: 212 LEKAM 216
L A+
Sbjct: 297 LHAAV 301
>gi|119961145|ref|YP_946193.1| cupin superfamily protein [Arthrobacter aurescens TC1]
gi|119948004|gb|ABM06915.1| putative cupin superfamily protein [Arthrobacter aurescens TC1]
Length = 388
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 91/178 (51%), Gaps = 12/178 (6%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ +P+ + L G V AN Y TPP +GF HYD +
Sbjct: 93 LWRKFADGATLVLQALHRTWEPVSSFSTQLSTELGHPVQANAYITPPQNRGFDDHYDVHD 152
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYL 179
F+LQ+EG K+W ++ P VD P S P ++ G + LEPGD+LYL
Sbjct: 153 VFVLQIEGTKRWIIHEPVHVD---PLRSQPWTDRRSAVAEAAQGKAYIDTVLEPGDVLYL 209
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
PRG++H A+ + S+H+T+ V+ T L + + +A AA D E RR LP+G
Sbjct: 210 PRGWLH-AAEAQGKVSIHLTLGVHSWTR--HALAEHLAQAALAALCDDPEVRRSLPLG 264
>gi|324512664|gb|ADY45238.1| Lysine-specific demethylase NO66 [Ascaris suum]
Length = 452
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 69/108 (63%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D +EK+ L + RK YY NL ++KK +IL+ + IE+ N+++ +Y+DG+R T + G+
Sbjct: 341 DIFEKKALVVQRKNPKYYGNLYSTKKFIEILQSHYIEYGTNVNVAVYKDGERFTPNGIGK 400
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
P+ + G S++F+NPQT+ + L + QELFG FVGANT+
Sbjct: 401 VYPEEIRAQLNAGHSVQFVNPQTFCDSVWHLCDTTQELFGSFVGANTF 448
>gi|357399326|ref|YP_004911251.1| hypothetical protein SCAT_1724 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|386355365|ref|YP_006053611.1| cupin 4 family protein [Streptomyces cattleya NRRL 8057 = DSM
46488]
gi|337765735|emb|CCB74444.1| protein of unknown function [Streptomyces cattleya NRRL 8057 = DSM
46488]
gi|365805873|gb|AEW94089.1| cupin 4 family protein [Streptomyces cattleya NRRL 8057 = DSM
46488]
Length = 298
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 137/310 (44%), Gaps = 54/310 (17%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDG----QRQTMDVE 58
W EPL R+ + +T ++D ++ + + ++ L +DG + + D +
Sbjct: 17 WPHEPLRF-RRDPAEFGRWLTLAEVDTLIDAGCLAMR---NVVLLKDGRVSEKWEYADPD 72
Query: 59 GRALPQI--VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELF-------GCFVGANTYQ 109
+P+ V +H + G S+ ++ L+ L S+ +L+ G V N Y
Sbjct: 73 DPTMPRCGAVREHIKAGGSVS-------LRELEHLKPSVADLYRALCRETGYGVHINAYL 125
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM---VDEYLPRYSSPNFSQEE----- 161
TPP QG H+D ILQ+ G+K W V+ P + V EY + F+ E+
Sbjct: 126 TPPGCQGLKYHFDPYVTLILQIHGRKTWPVHPPFVANPVQEY-GSFHLIGFTPEQRHYLA 184
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT---AWIDLLEKAMPK 218
P T TL PGD+L+LPRGY+H T +E SLH+T++ ++T A L E + +
Sbjct: 185 HTPPAATFTLAPGDVLWLPRGYVHSPYTEGSETSLHITVAFKERTYHWAAERLAEDILRR 244
Query: 219 ALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDD 278
AL D E R LP P + D L + E +D L + +D +
Sbjct: 245 AL-----ADPEMREELP-------------PHLVTLDTLPVVEQARDYLIGALLSLDPET 286
Query: 279 GVDEMGKQLM 288
V + + +
Sbjct: 287 AVKYLQRSAL 296
>gi|397631554|gb|EJK70195.1| hypothetical protein THAOC_08467 [Thalassiosira oceanica]
Length = 462
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 14/183 (7%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y +GCS+ + + + L LQ F V AN+Y TPP + H DD + ++
Sbjct: 114 YLDGCSVIVNHADHHHSKIAGLCDELQLTFP-HVYANSYLTPPRSSAVKSHSDDRDVLVI 172
Query: 130 QLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPIL---TVTLEPGDLLYLP 180
Q+ G+K WKVY P +E + + P S+E + I V+LEPG +LY+P
Sbjct: 173 QVLGRKNWKVYKTVPVSFP-FANEQVGKEGRPPVSREVLDGGICFDGKVSLEPGHVLYMP 231
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
RG++H+AST ++E S H+TI++ W + M + ++ +E+R L IG
Sbjct: 232 RGFVHEASTDSDEPSFHITIAIATHD-WC--MSTVMTEMVRERLNECIEYRIPLSIGAGA 288
Query: 241 YAG 243
Y G
Sbjct: 289 YQG 291
>gi|377573080|ref|ZP_09802154.1| hypothetical protein MOPEL_009_00090 [Mobilicoccus pelagius NBRC
104925]
gi|377538115|dbj|GAB47319.1| hypothetical protein MOPEL_009_00090 [Mobilicoccus pelagius NBRC
104925]
Length = 407
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 84/173 (48%), Gaps = 7/173 (4%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
+ +G +I P+ + L G V AN Y TPP +QGF+ HYD + F+L
Sbjct: 102 FADGATIVLQGLHRTWAPIAEFVRDLAADLGHPVQANAYITPPQSQGFSAHYDVHDVFVL 161
Query: 130 QLEGKKKWKVYLP----RMVDEYLPRYSSPNF-SQEEIGTPILTVTLEPGDLLYLPRGYI 184
QL G+K+W ++ P M DE + ++ G P L V L PGD LYLPRGY+
Sbjct: 162 QLSGEKRWVIHEPVLQWPMRDEPWETGGRRDLVARAAEGEPALDVVLRPGDCLYLPRGYL 221
Query: 185 HQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
H A+T S H+T+ V+ T L M +AL AG G+ +G
Sbjct: 222 H-AATALGGVSAHLTLGVHTWTR-HHLATDVMRQALARAGDLRAPLASGVDVG 272
>gi|295840576|ref|ZP_06827509.1| cupin superfamily protein [Streptomyces sp. SPB74]
gi|295828066|gb|EFG65806.1| cupin superfamily protein [Streptomyces sp. SPB74]
Length = 395
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 114/253 (45%), Gaps = 30/253 (11%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
W + P ++R+ ++ ++L +S +D++ LR + K+ TL E V
Sbjct: 31 WGRTP-SLTRRASDF-SDLFSSAAVDELVSRRGLRTPFLRVAKD-GATLPESSFTAPGGV 87
Query: 58 EGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
Q+ +W + +G ++ +P+ + L G V AN Y TPP
Sbjct: 88 GATIADQLDDTALWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQ 147
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPIL 167
QGF HYD + F+LQ+ G K+W ++ P + P + P ++ G P+L
Sbjct: 148 NQGFDAHYDVHDVFVLQITGTKRWLIHEPVVT---APTRAQPWTAHRAAVAEAAAGPPVL 204
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT---AWIDLLEKAMPKALQAAG 224
LEPGD+LYLPRG++H A S H+T+ V+ T L E A+ A
Sbjct: 205 DTVLEPGDVLYLPRGWLHSA-VAQGTVSTHLTLGVHPWTRHAVAAQLAETAL-----ATL 258
Query: 225 ATDLEFRRGLPIG 237
D E R LP+G
Sbjct: 259 REDPELRTSLPLG 271
>gi|159037898|ref|YP_001537151.1| cupin 4 family protein [Salinispora arenicola CNS-205]
gi|157916733|gb|ABV98160.1| Cupin 4 family protein [Salinispora arenicola CNS-205]
Length = 432
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/136 (41%), Positives = 75/136 (55%), Gaps = 12/136 (8%)
Query: 106 NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPN-FSQEEI 162
N Y TP +QGFA HYD + F+LQ++G+K W+++ P + D E P + S
Sbjct: 163 NAYLTPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLPDPLERQPWGGRADEVSATAT 222
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH-SLHVTISVYQKTAWIDLLEKAMPKALQ 221
G P L VTL PGD LYLPRG++H A EH SLH+T+ V T + L+E+ L
Sbjct: 223 GAPALDVTLAPGDALYLPRGWLHSAQ--AQEHSSLHLTVGVRALTRYT-LVEE-----LL 274
Query: 222 AAGATDLEFRRGLPIG 237
A A D R LP G
Sbjct: 275 ALAAEDQRLRATLPFG 290
>gi|81299184|ref|YP_399392.1| hypothetical protein Synpcc7942_0373 [Synechococcus elongatus PCC
7942]
gi|81168065|gb|ABB56405.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 428
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 143/316 (45%), Gaps = 41/316 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W ++ +YI+ + +L + ++D+L N F+++ ++ DG+ +
Sbjct: 20 YWAQQSVYIAGDSLRF-QSLFSWNHLNDLL--NYQTFRES-ELRFSRDGE----SLPAGD 71
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
P + +EG ++ + L+ L +L++ FG N Y +P QGF HY
Sbjct: 72 NPTLWRSRLQEGATLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHY 131
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---TPILTVTLEPGDLLY 178
D + ILQ+EG+K+W +Y E LP ++ S + + P L L PGDLLY
Sbjct: 132 DTHDVLILQIEGEKEWLIY-----PETLPYPTADQPSYDRLPPEEPPYLQQVLSPGDLLY 186
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRRGLPI 236
+PRG+ H A SLH+TI ++ T W++ L++ + +R+GLP+
Sbjct: 187 IPRGHWHYA-IAQETASLHLTIGIHTATGLDWVNWLQQQL--------RDQPHWRQGLPL 237
Query: 237 GYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALP--P 294
AG P L ++ +L+ L +L+ Y+ +D+ + L P P
Sbjct: 238 -----AGSCNFDP-------LKLRGHLESLRDQLITYLQEPQAIDDYLQYLSWQDQPHLP 285
Query: 295 VLSPEELQCSVFENGL 310
+ P +L GL
Sbjct: 286 IQLPLQLHGDPLAQGL 301
>gi|386848916|ref|YP_006266929.1| cupin 4 family protein [Actinoplanes sp. SE50/110]
gi|359836420|gb|AEV84861.1| cupin 4 family protein [Actinoplanes sp. SE50/110]
Length = 388
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 110/234 (47%), Gaps = 21/234 (8%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDN--------VIEFKKNIDITLYEDGQRQ 53
+W K+PL +SR + + +L++ +D++L V + K + Y G
Sbjct: 21 YWGKQPL-LSRGRAGF-TDLLSVPAVDELLSRRGLRTPFLRVAQHGKVLPAGRYTGGGGA 78
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
+V + + V + G ++ PL + L + N Y TPP
Sbjct: 79 GAEVGDQVHDERVMRLFAGGATLVLQGLHRLWPPLIEFTGRLGAELNRPLQVNAYLTPPG 138
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPIL 167
+QGFA HYD + F+LQ EG+K+W ++ P + D P + P + GTP +
Sbjct: 139 SQGFATHYDTHDVFVLQCEGRKRWIIHPPVLSD---PIENQPWGGHADEVAATAEGTPAI 195
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
LEPGD LYLPRG++H A T SLH+T+ + T + L+E+ + A+Q
Sbjct: 196 DTVLEPGDALYLPRGWLHSART-EQVRSLHLTVGIRSLTRYA-LVEELLAMAVQ 247
>gi|326435323|gb|EGD80893.1| hypothetical protein PTSG_01480 [Salpingoeca sp. ATCC 50818]
Length = 518
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 93/211 (44%), Gaps = 20/211 (9%)
Query: 5 KEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQ 64
+ PL+I+R NYY L TS L K L +G + E L
Sbjct: 90 RRPLHIARDDPNYYGQLFTSSDFFRFL--TFYRHKYEAGFNLVRNGDTVFANDEPVQLGD 147
Query: 65 IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
++ R G + T + + L+ +L+ F N Y TP ++GF H D
Sbjct: 148 VLDSIARNGSTAVMTAVHTRDEHIWALSWALERETSLFSATNIYLTPAQSRGFIFHQDVH 207
Query: 125 EAFILQLEGKKKWKVYLP------------RMVDEYLPRYSSPNFSQEE-IGTPILTVTL 171
ILQLEGKK W+++ P + L R+ F +EE +G TL
Sbjct: 208 AVIILQLEGKKHWRLFTPPEERSFPWYHKQEAASQRLTRF----FKEEELVGLQEFNFTL 263
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
GD L +PRG IH A T ++EHSLH+T+S+
Sbjct: 264 HAGDTLIVPRGVIHAAET-SDEHSLHLTVSL 293
>gi|254383725|ref|ZP_04999074.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194342619|gb|EDX23585.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length = 394
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 3/136 (2%)
Query: 72 EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
EG S+ + P+++ A L+ F V N Y + +GF H+DD + +LQL
Sbjct: 98 EGASLVIDAIEQIHPPVREAAAGLERFFRTPVQVNAYASWTAEEGFGTHWDDHDVVVLQL 157
Query: 132 EGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT 191
EG K+WKVY P P + + G PI + L PGD+LYLPRG+ H S
Sbjct: 158 EGSKRWKVYGP---TRQAPAWRDVETPEVPTGDPIADIVLTPGDVLYLPRGWWHAVSADQ 214
Query: 192 NEHSLHVTISVYQKTA 207
SLH+T + +T
Sbjct: 215 GTASLHLTFGLATQTG 230
>gi|453051139|gb|EME98654.1| hypothetical protein H340_20228 [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 391
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 116/289 (40%), Gaps = 24/289 (8%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
R P ++ EG S+ +P+ L ++LF AN Y + +GF
Sbjct: 83 RLHPALLHARLAEGASLALDQADELHRPVAALAEECEQLFRTRFQANAYASWTATEGFGT 142
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
H+DD + ++QLEG K WK+Y P P Y Q PI + L+PGDLLYL
Sbjct: 143 HWDDHDVLVVQLEGAKHWKIYGPTRPH---PLYRDTTEPQAPPADPIADLVLQPGDLLYL 199
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
PRG+ H + SLH+T +Q DLL + L A EFR LP+
Sbjct: 200 PRGWWHAVTADQGVCSLHLTYG-FQTHTPSDLLTFVSDQLLGHA-----EFRADLPL--- 250
Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVD--EMGKQLMHDALPPV-- 295
AG + RL ++E L D A L Y + DG D M L H P
Sbjct: 251 -LAGREAQAAFVTRIGRL-VREQLAD-PALLTRYAESMDGRDTGRMTPSLPHLTAVPADP 307
Query: 296 -----LSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
L+ + E+G+ ++ G Y RLV R
Sbjct: 308 DIRVRLTTARALLTTTEDGVHLAAAGHSYEFAPAAGPVLARLVSGGEHR 356
>gi|378718179|ref|YP_005283068.1| putative cupin superfamily protein [Gordonia polyisoprenivorans
VH2]
gi|375752882|gb|AFA73702.1| putative cupin superfamily protein [Gordonia polyisoprenivorans
VH2]
Length = 411
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 88/179 (49%), Gaps = 18/179 (10%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
V + +G +I P+ L L + G V AN Y TPP+ +GF PHYD +
Sbjct: 90 VLAQFADGATIVLQGLHRLWPPVIDLVRGLVDDLGHPVQANAYVTPPSNRGFDPHYDVHD 149
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYL 179
F+LQ+ G+K+W V+ P +D P S P + + P++ L GD LYL
Sbjct: 150 VFVLQISGEKQWTVHAPVHLD---PVDSQPWTDHREDIQRRTDDPPVIDTVLRSGDALYL 206
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL-EFRRGLPIG 237
PRG++H A + S+H+TI TA +D+L + + TD+ E R LP+G
Sbjct: 207 PRGWVHSARALGTT-SIHLTIGTAALTA-LDVLGAVVEQ------LTDIRELRESLPMG 257
>gi|359766385|ref|ZP_09270197.1| hypothetical protein GOPIP_038_00190 [Gordonia polyisoprenivorans
NBRC 16320]
gi|359316221|dbj|GAB23030.1| hypothetical protein GOPIP_038_00190 [Gordonia polyisoprenivorans
NBRC 16320]
Length = 352
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 86/175 (49%), Gaps = 18/175 (10%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
+ +G +I P+ L L + G V AN Y TPP+ +GF PHYD + F+L
Sbjct: 94 FADGATIVLQGLHRLWPPVIDLVRGLVDDLGHPVQANAYVTPPSNRGFDPHYDVHDVFVL 153
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYLPRGY 183
Q+ G+K W V+ P +D P S P + + P++ L GD LYLPRG+
Sbjct: 154 QISGEKHWTVHAPVHLD---PVDSQPWTDHREDIERRTDDPPVIDTVLRSGDALYLPRGW 210
Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL-EFRRGLPIG 237
+H A + S+H+TI TA +D+L + + TD+ E R LP+G
Sbjct: 211 VHSARALGTT-SIHLTIGTAALTA-LDVLGAVVEQ------LTDIRELRESLPMG 257
>gi|113476595|ref|YP_722656.1| cupin 4 [Trichodesmium erythraeum IMS101]
gi|110167643|gb|ABG52183.1| Cupin 4 [Trichodesmium erythraeum IMS101]
Length = 385
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 109/214 (50%), Gaps = 14/214 (6%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W K+ + IS K + + +L + +K++ +L + I++ D+ L DG + + E R
Sbjct: 19 NNWTKKAIAISNKGEKDFTDLFSWEKLNYLLNFHQIKYP---DVRLAFDG-KVLEEKENR 74
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
Q +G ++ I + + L G N Y + + +GF+PH
Sbjct: 75 NFTQWC----EKGATLILDQIHRRIPEVAIFTSKLSYELGYPTQVNAYCSWSSKKGFSPH 130
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEY-LPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
YD + FILQ+EG K+W VY +Y LP S +F+ E L+ L PGD+LY+
Sbjct: 131 YDTHDVFILQVEGNKQWYVYNDTF--KYPLPNQKSSSFTPPE-KEAYLSCILHPGDVLYI 187
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLE 213
PRG+ H A T E S+H+T+ ++ T +DLLE
Sbjct: 188 PRGHWHYAVT-KEEPSIHLTLGIHSSTG-VDLLE 219
>gi|406575710|ref|ZP_11051405.1| cupin superfamily protein [Janibacter hoylei PVAS-1]
gi|404554923|gb|EKA60430.1| cupin superfamily protein [Janibacter hoylei PVAS-1]
Length = 414
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 85/175 (48%), Gaps = 14/175 (8%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
+ +G +I P+ L L G V AN Y TP +GFA HYD + F+L
Sbjct: 106 FADGHTIVLQGLHRTHPPVLALAQELAADLGHPVQANAYVTPAANRGFAAHYDVHDVFVL 165
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTPILTVTLEPGDLLYLPRGY 183
Q EG K+W ++ P V E+ P P Q P L VTLEPGD+LY+PRG+
Sbjct: 166 QTEGTKRWSLHAP--VHEH-PLRDQPWQDRRSQVEQAAAAEPYLDVTLEPGDVLYVPRGW 222
Query: 184 IHQASTVTNEHSLHVTISVYQKTAW-IDLLEKAMPKALQAAGATDLEFRRGLPIG 237
+H A+ + S+H+TI V+ W L +A+ + A A E R LP G
Sbjct: 223 LHSATALGGV-SVHLTIGVH---VWHRHHLAEAVLDGTRRALAEQPELRTSLPPG 273
>gi|449457281|ref|XP_004146377.1| PREDICTED: uncharacterized protein LOC101213846 [Cucumis sativus]
Length = 541
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 80/142 (56%), Gaps = 11/142 (7%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
++EG +I + + + ++ +L LFG VGAN Y TPP +QG A HYDD F+
Sbjct: 201 FKEGYTIALRGMEFRHEKIAAISNTLASLFGQPSVGANMYLTPPGSQGLARHYDDHCVFV 260
Query: 129 LQLEGKKKWKVYLPRMVDEYLPR-YSSPNF-SQEEIGTPILT---VTLEPGDLLYLPRGY 183
QL G K+W V+ P +YLPR Y S F S ++ +P+ L GD+LY+PRG+
Sbjct: 261 CQLAGSKQWTVFSPP--RKYLPRLYDSHEFPSCLDVESPLAVGRKFFLREGDVLYIPRGF 318
Query: 184 IHQASTVT---NEHSLHVTISV 202
+H+A T + SLH+T +
Sbjct: 319 LHEARTEIGGPDGSSLHLTFGI 340
>gi|302523562|ref|ZP_07275904.1| cupin 4 family protein [Streptomyces sp. SPB78]
gi|302432457|gb|EFL04273.1| cupin 4 family protein [Streptomyces sp. SPB78]
Length = 395
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 112/255 (43%), Gaps = 34/255 (13%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
W + P ++R ++ +++ +S +D++ LR + K+ TL E V
Sbjct: 31 WGRLP-SLTRGAGDF-SDIFSSAAVDELVSRRGLRTPFLRIAKD-GATLPEPSFTAPGGV 87
Query: 58 EGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
Q+ +W + +G ++ +P+ + L G V AN Y TPP
Sbjct: 88 GATIADQLDDTALWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQ 147
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---------- 163
QGF HYD + F+LQ+ G K+W ++ P + ++P Q G
Sbjct: 148 NQGFDAHYDVHDVFVLQITGTKRWLIHEPVV--------TAPTREQPWTGHRAAVAEAAS 199
Query: 164 -TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
P+L LEPGD+LYLPRG++H A S+H+T+ V+ T L + + A
Sbjct: 200 GAPVLDTVLEPGDVLYLPRGWLHSA-VAQGAVSVHLTLGVHPWTR--HALAAQLAETALA 256
Query: 223 AGATDLEFRRGLPIG 237
A D R LP+G
Sbjct: 257 ALREDPAMRTSLPLG 271
>gi|318060611|ref|ZP_07979334.1| putative cupin superfamily protein [Streptomyces sp. SA3_actG]
gi|318075845|ref|ZP_07983177.1| putative cupin superfamily protein [Streptomyces sp. SA3_actF]
Length = 395
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 22/183 (12%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ +P+ + L G V AN Y TPP QGF HYD +
Sbjct: 100 LWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQNQGFDAHYDVHD 159
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG-----------TPILTVTLEPG 174
F+LQ+ G K+W ++ P + ++P Q G P+L LEPG
Sbjct: 160 VFVLQITGTKRWLIHEPVV--------TAPTREQPWTGHRAAVAEAASGAPVLDTVLEPG 211
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
D+LYLPRG++H A S+H+T+ V+ T L + + AA D R L
Sbjct: 212 DVLYLPRGWLHSA-VAQGAVSVHLTLGVHPWTR--HALAAQLAETALAALREDPAMRTSL 268
Query: 235 PIG 237
P+G
Sbjct: 269 PLG 271
>gi|333022517|ref|ZP_08450581.1| putative cupin superfamily protein [Streptomyces sp. Tu6071]
gi|332742369|gb|EGJ72810.1| putative cupin superfamily protein [Streptomyces sp. Tu6071]
Length = 395
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 114/258 (44%), Gaps = 40/258 (15%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDI-----LRDNVIEFKKNIDITLYEDGQRQTMDV 57
W + P ++R ++ +++ +S +D++ LR + K+ TL E V
Sbjct: 31 WGRLP-SLTRGAGDF-SDIFSSAAVDELVSQRGLRTPFLRVAKD-GATLPEPSFTAPGGV 87
Query: 58 EGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
Q+ +W + +G ++ +P+ + L G V AN Y TPP
Sbjct: 88 GATIADQLDDTALWRAFADGATLVLQALHRTWEPVGTFTSRLATELGHPVQANAYVTPPQ 147
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---------- 163
QGF HYD + F+LQ+ G K+W ++ P + ++P Q G
Sbjct: 148 NQGFDAHYDVHDVFVLQITGTKRWLIHEPVV--------TAPTREQPWTGHRAAVAEAAS 199
Query: 164 -TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA---WIDLLEKAMPKA 219
P+L LEPGD+LYLPRG++H A S+H+T+ V+ T + L E A+
Sbjct: 200 GAPVLDTVLEPGDVLYLPRGWLHSA-VAQGAVSVHLTLGVHPWTRHALAVQLAETAL--- 255
Query: 220 LQAAGATDLEFRRGLPIG 237
AA D R LP+G
Sbjct: 256 --AALREDPAMRTSLPLG 271
>gi|443291200|ref|ZP_21030294.1| Transcription factor jumonji jmjC domain-containing protein
[Micromonospora lupini str. Lupac 08]
gi|385885602|emb|CCH18401.1| Transcription factor jumonji jmjC domain-containing protein
[Micromonospora lupini str. Lupac 08]
Length = 424
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 108/251 (43%), Gaps = 23/251 (9%)
Query: 2 HWEKEPLYISRKQK----NYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYED 49
HW PL +SR + + +L + D++L R + K+ + +
Sbjct: 39 HWGHTPL-LSRAAELPDPAGFTDLFSPADADELLSRRGLRTPFLRVAKDGQLVPAARWTG 97
Query: 50 GQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQ 109
G ++ + L + V + Y G ++ PL L + N Y
Sbjct: 98 GGGAGAEIGDQVLDERVLEQYAAGATLVLQGLHRIWPPLVDFARDLGLALNQPLQINAYL 157
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY---SSPNFSQEEIGTPI 166
TP +QGFA HYD + F+LQ++G+K W+++ P + D + + S G
Sbjct: 158 TPAGSQGFATHYDTHDVFVLQVDGRKHWRIHPPVLPDPLEKQTWGGRADEVSATAQGPAA 217
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
L V L PGD LYLPRG++H A + SLH+T+ + T + A+ + L A A
Sbjct: 218 LDVVLAPGDALYLPRGWLHSAQ-AQDASSLHLTVGIRALTRY------ALVEELLALAAE 270
Query: 227 DLEFRRGLPIG 237
D R LP G
Sbjct: 271 DQRLRASLPFG 281
>gi|449525377|ref|XP_004169694.1| PREDICTED: lysine-specific demethylase NO66-like [Cucumis sativus]
Length = 347
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 97/194 (50%), Gaps = 16/194 (8%)
Query: 18 YNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIR 77
Y I + D+ L+ + F+KN + E + + + ++EG +I
Sbjct: 154 YQQDIRVLRTDECLKREIHFFQKNFEPCCIEGPHFLKLHDALKC-----EEAFKEGYTIA 208
Query: 78 FLNPQTYIKPLQQLNASLQELFG-CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKK 136
+ + + ++ +L LFG VGAN Y TPP +QG A HYDD F+ QL G K+
Sbjct: 209 LRGMEFRHEKIAAISNTLASLFGQPSVGANMYLTPPGSQGLARHYDDHCVFVCQLAGSKQ 268
Query: 137 WKVYLPRMVDEYLPR-YSSPNF-SQEEIGTPILT---VTLEPGDLLYLPRGYIHQASTVT 191
W V+ P +YLPR Y S F S ++ +P+ L GD+LY+PRG++H+A T
Sbjct: 269 WTVFSPP--RKYLPRLYDSHEFPSCLDVESPLAVGRKFFLREGDVLYIPRGFLHEARTEI 326
Query: 192 ---NEHSLHVTISV 202
+ SLH+T +
Sbjct: 327 GGPDGSSLHLTFGI 340
>gi|262202804|ref|YP_003274012.1| cupin [Gordonia bronchialis DSM 43247]
gi|262086151|gb|ACY22119.1| Cupin 4 family protein [Gordonia bronchialis DSM 43247]
Length = 414
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 101/214 (47%), Gaps = 13/214 (6%)
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P V + G +I P+ ++ + G V AN Y TPP +GF PHYD
Sbjct: 87 PAGVLREFAAGATIVLQGLHRLWPPVIDFVRAMVDDLGHPVQANAYVTPPGNRGFDPHYD 146
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI----GTPILTVTLEPGDLLY 178
+ F+LQ+ G K+W+V+ P + + + +Q E P + L PGD LY
Sbjct: 147 VHDVFVLQVAGTKRWRVHRPVHTHPLATQPWTDHRAQIERRASDDAPEIEAVLSPGDALY 206
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
LPRG+IH A + + S+H+TI V T + + A+ A EFR+ LP+G
Sbjct: 207 LPRGWIHSADAL-GDTSIHLTIGVGAVTV------RDVVAAIVAELDDCAEFRQSLPLG- 258
Query: 239 LRYAGLARGKPVDIQADRLAMKENLKDLLAKLVE 272
+ G + P+ +A A+ E L+D A + E
Sbjct: 259 IDLTGRDQTVPIATKA-MAAVVERLRDHAADVGE 291
>gi|357162427|ref|XP_003579407.1| PREDICTED: uncharacterized protein LOC100839961 [Brachypodium
distachyon]
Length = 803
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 93/174 (53%), Gaps = 26/174 (14%)
Query: 46 LYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVG 104
++DG T+ ++ A + D ++ G SI + + + + ++L +LFG VG
Sbjct: 428 FFDDG---TVFLDAAAFIEDCKDAFKSGFSIALRGMEFRSEKVAAIASALADLFGQPSVG 484
Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRMVDEYLPRYSSPNFS 158
AN Y +PP +QG + HYDD +LQL G KKWK++ LPR+ Y P + S +
Sbjct: 485 ANLYFSPPRSQGLSRHYDDHCVLVLQLVGHKKWKIWPSTKSVLPRL---YEP-FDSLDGL 540
Query: 159 QEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNE----------HSLHVTISV 202
++ G I L GD++Y+PRGYIH+A T E +SLH+T+++
Sbjct: 541 VDDSGGRI--EVLREGDIMYVPRGYIHEAHTAVAEEEYEVNASSNYSLHLTLAI 592
>gi|443672956|ref|ZP_21138032.1| Cupin 4 family protein [Rhodococcus sp. AW25M09]
gi|443414441|emb|CCQ16370.1| Cupin 4 family protein [Rhodococcus sp. AW25M09]
Length = 420
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 115/251 (45%), Gaps = 25/251 (9%)
Query: 3 WEKEP-LYISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKN---IDITLYEDGQRQ 53
W P L S +++L+T +D+++ R + K+ +D T +
Sbjct: 33 WGIAPRLTRSADLPETFDDLMTLDAVDELIAERGVRTPFVRMAKDGRLVDKTHFTASGGF 92
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
+V + V + + G ++ PL L G N+Y TP
Sbjct: 93 GAEVTDQIDSGAVLESFAAGNTLVLQGLHRLWPPLIHFVGDLVRELGHPAQVNSYVTPAA 152
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV-----DEYLPRYSSPNFSQEEIGTPILT 168
+QGF+PHYD + F++Q+ G K+W V+ P V + R S+ ++ +P L
Sbjct: 153 SQGFSPHYDVHDVFVVQIAGTKRWIVHSPVHVHPLNNQPWSDRRSA--VEEQASNSPALD 210
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDL 228
V LEPGD+LYLPRG+IH A + + ++H+T+ V + D+ A+ KAL D+
Sbjct: 211 VVLEPGDVLYLPRGWIHSAQAL-GDTTIHLTVGVASFNDY-DIAHHAL-KALG-----DV 262
Query: 229 EFRRG-LPIGY 238
E R LP G+
Sbjct: 263 ESVRAPLPAGF 273
>gi|428211992|ref|YP_007085136.1| cupin [Oscillatoria acuminata PCC 6304]
gi|428000373|gb|AFY81216.1| Cupin superfamily protein [Oscillatoria acuminata PCC 6304]
Length = 397
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 151/338 (44%), Gaps = 50/338 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W + L+I K ++ + L +++ +L + I + N+ +L DG+
Sbjct: 19 ENWTEIALFIPGKSRDKFAGLFGWNQLNQLLNYHFIPYP-NLRFSL--DGESLP------ 69
Query: 61 ALPQIVW-DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
A P W + +G ++ + + L++L A ++E G N Y +P QGF
Sbjct: 70 ATPSHQWLERLHQGATLIIDSVHERVPELEKLVADVREQTGYRSQINLYCSPSDQQGFNC 129
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---TPILTVTLEPGDL 176
HYD + ILQ+EG+K W V+ E +P + S+++I P L L+PGDL
Sbjct: 130 HYDTHDVLILQIEGEKDWFVF-----PETVPFPVTGMRSKDQIPPDVPPYLECRLQPGDL 184
Query: 177 LYLPRGYIHQASTVTNEH------SLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDL 228
LY+PRG+ H A +N SLH+T+ + +T W++ L K + + Q
Sbjct: 185 LYIPRGHWHYAIASSNPSRDDYSPSLHLTLGIDCQTGLDWMNWLGKELQENTQ------- 237
Query: 229 EFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLM 288
+R L PV DR A++ L+ L +L+E+++ + + L
Sbjct: 238 -WRASL--------------PVIPSGDRTAVQPYLEKLRDRLLEFLEDPETITGYLDDLN 282
Query: 289 H-DALP-PVLSPEELQCSVFENGLRMSQTGEVYNATQI 324
+ D P P P +L +F+ GL + QI
Sbjct: 283 YRDRTPLPFAFPHQLGSQIFDFGLETRFIPSALHPVQI 320
>gi|444307228|ref|ZP_21142971.1| cupin superfamily protein [Arthrobacter sp. SJCon]
gi|443480438|gb|ELT43390.1| cupin superfamily protein [Arthrobacter sp. SJCon]
Length = 367
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 84/178 (47%), Gaps = 12/178 (6%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ P+ ++SL G V AN Y TPP +GF HYD +
Sbjct: 106 LWRTFADGATLVLQALHRTWAPVADFSSSLSTELGHPVQANAYITPPQNRGFDDHYDVHD 165
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI------LTVTLEPGDLLYL 179
F+LQ+EG K+W ++ P D P P + + LEPGD+LYL
Sbjct: 166 VFVLQIEGTKRWIIHEPVHTD---PLRDQPWTDRRSAVAEAAKAEAYIDTVLEPGDVLYL 222
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
PRG++H A+ E S+H+T+ ++ T + + A D E RR LP+G
Sbjct: 223 PRGWLH-AAQAQGEVSIHLTLGIHSWTRYALAEQLAQAALAAL--CDDPEMRRTLPLG 277
>gi|400536533|ref|ZP_10800067.1| hypothetical protein MCOL_V219156 [Mycobacterium colombiense CECT
3035]
gi|400329546|gb|EJO87045.1| hypothetical protein MCOL_V219156 [Mycobacterium colombiense CECT
3035]
Length = 338
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 78/149 (52%), Gaps = 13/149 (8%)
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P V D EG +I ++Y++ + L+ S++ N Y TPP + GFAPHYD
Sbjct: 31 PARVRDGLAEGYTIVVNGVESYLRTIASLSHSIEVELDFPTRVNAYVTPPESTGFAPHYD 90
Query: 123 DIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILT----VTLEPGDLL 177
+ +LQ++G K W V P + + R + +GT LT V L+PGD+L
Sbjct: 91 PHDVLVLQIQGSKTWHVSDGPAVPPHEIQR-------RRGVGTDGLTSSTDVCLQPGDVL 143
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKT 206
YLPRG +H A T E S+H+TI ++ T
Sbjct: 144 YLPRGQVHSAET-HAEPSVHLTIGLHAPT 171
>gi|408826623|ref|ZP_11211513.1| cupin 4 family protein [Streptomyces somaliensis DSM 40738]
Length = 299
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 88/177 (49%), Gaps = 28/177 (15%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y G +IR N Q I + Q++ +Q G +T+ TPP QG H+D A I+
Sbjct: 86 YESGHTIRLGNLQRIIPAMAQVSEDIQRETGYSNYIHTFLTPPANQGLRHHWDQQMAVIV 145
Query: 130 QLEGKKKWKVYLPRM--------------VDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
Q+ G K+W+++ P + D+Y+P + E P + V L+ G
Sbjct: 146 QIAGTKQWQLWRPPVEAPMREYNESWRVWRDDYIPTW--------EAAGPDMEVDLKAGR 197
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
L LPRG++H V +EHS+H+T ++ ++T + L+EK + A++ +FRR
Sbjct: 198 SLLLPRGWVHNPHVVVDEHSIHLTFAIRERTP-MWLVEKLVAGAIETP-----DFRR 248
>gi|296170713|ref|ZP_06852288.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC
BAA-614]
gi|295894702|gb|EFG74436.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC
BAA-614]
Length = 403
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 65/214 (30%), Positives = 100/214 (46%), Gaps = 21/214 (9%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQT----MDVE 58
W +I R++ Y++ L+ D L D+V + + L + GQ + +
Sbjct: 34 WAIRYHHIRRRRPGYFDGLLPGPSAVDALLDHV--QPEASAVRLVKGGQDRDPATYRRAD 91
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
G P V D G +I + Y++ + ++ S++ N Y TPP + GF
Sbjct: 92 GGLDPVAVRDGLAAGYTIVVNGVERYLRTIASMSHSIEVELNFPTRVNAYVTPPASTGFV 151
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE--IGTPILT----VTLE 172
PHYD + +LQ+EG K W V V P+ Q +GT LT V L+
Sbjct: 152 PHYDPHDVLVLQIEGSKTWHVSDAAAV--------PPHEIQRRRGVGTDGLTSSTDVDLQ 203
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
PGD+LYLPRG +H A T +E S+H+TI ++ T
Sbjct: 204 PGDVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 236
>gi|379059516|ref|ZP_09850042.1| cupin superfamily protein [Serinicoccus profundi MCCC 1A05965]
Length = 401
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 8/146 (5%)
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+W + +G ++ +P+ + ++ L + G V AN Y TPP +GF HYD +
Sbjct: 106 LWRAFADGATLVLQALHRTWEPVAEFSSRLSDELGHPVQANAYITPPQNRGFDDHYDVHD 165
Query: 126 AFILQLEGKKKWKVYLPRMVD-----EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
F+LQ+ G K+W ++ P D + R S+ S+ L V LEPGD LYLP
Sbjct: 166 VFVLQIAGTKRWIIHEPVHQDPLRDQPWTDRRSA--VSEAAARDAYLDVVLEPGDALYLP 223
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKT 206
RG++H A+ S+H+T+ V+ T
Sbjct: 224 RGWLH-AAQAQGAVSIHLTLGVHTWT 248
>gi|148238895|ref|YP_001224282.1| hypothetical protein SynWH7803_0559 [Synechococcus sp. WH 7803]
gi|147847434|emb|CAK22985.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 370
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 94/204 (46%), Gaps = 26/204 (12%)
Query: 48 EDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT 107
ED + + +G V + EG SI + L L +SL E G N
Sbjct: 40 EDYDPKIVANDGHLCKPYVLSRFHEGASIVVNEVHRFSSQLMDLASSLSEELGVQCVVNA 99
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
Y TPP + +PH+D + F LQ+ G+K+W VD L ++ + Q PIL
Sbjct: 100 YLTPPQSVALSPHFDSHDIFALQVVGQKQW------FVDSELSSLTTKSTFQ-----PIL 148
Query: 168 T-----------VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
+ V ++ GD++YLPRG +H A T++ + S+H+T+ +Y W + + A+
Sbjct: 149 SADQASSVDFREVVMDEGDVMYLPRGCVHHARTISCQ-SMHLTVGLY-PLEWSEFIASAV 206
Query: 217 PKALQAAGATDLEFRRGLPIGYLR 240
A A A L R +P+G R
Sbjct: 207 EIAASAPEARGL--RTSVPLGLKR 228
>gi|119718597|ref|YP_925562.1| cupin 4 family protein [Nocardioides sp. JS614]
gi|119539258|gb|ABL83875.1| Cupin 4 family protein [Nocardioides sp. JS614]
Length = 403
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 100/226 (44%), Gaps = 39/226 (17%)
Query: 19 NNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIV-----WDHYREG 73
++L+TS + R I K+ + L E + + G+ L +V + +G
Sbjct: 75 DHLLTSSAV----RTPSIRLAKDGAV-LPESAYTRRASLAGKPLTGLVDARKALALFDDG 129
Query: 74 CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
++ F Y PL +L A L+ G AN Y TPP AQGFA H D + F+ Q G
Sbjct: 130 ATVVFQGLHRYWPPLTRLIARLELELGHPCQANAYLTPPGAQGFAVHSDSHDVFVFQTAG 189
Query: 134 KKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNE 193
K+W+V+ P +E L LEPG +YLP G H A+ +
Sbjct: 190 SKRWEVHGPDGPEEVL---------------------LEPGVSMYLPTGTPH-AARAQDT 227
Query: 194 HSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
SLHVT+ + Q T W L+E+ + AGA LP GYL
Sbjct: 228 VSLHVTLGINQLT-WRGLVERTV------AGALGEVADEHLPAGYL 266
>gi|298713801|emb|CBJ27173.1| mina protein [Ectocarpus siliculosus]
Length = 631
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 120/278 (43%), Gaps = 73/278 (26%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR---------- 52
WE++P+ I + N+Y+ L+ + D+ ++VIE ++L +G+
Sbjct: 72 WERKPMVIRGRSPNHYDRLV-EFGLGDV--ESVIE----KGVSLSGEGEEPMVLGSDWNI 124
Query: 53 ------QTMDVEGRALP------------------QIVWDHYREGCSIRFLNPQTYIKPL 88
Q D G +P + +R G S+ Q + +
Sbjct: 125 IKRVFVQADDGPGSNVPPAPGKTHAGGADRVVSLEEARKAFHRGGFSVVINRLQRRWRGV 184
Query: 89 QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPR 143
+ + +L+ + G V AN Y TPP +QGF H+D ++ ++QL G K W +Y +PR
Sbjct: 185 LRASRALEHVLGQPVNANLYMTPPQSQGFEAHFDWMDGIVVQLTGSKTWILYHEMVTMPR 244
Query: 144 MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA---------------- 187
++ P+ + E+G PI + L PGD++Y+PRG+ H+A
Sbjct: 245 PDLKFKPKAA-------ELGEPIAVLDLYPGDMMYIPRGWPHEAAVNGTAARPGGGRGAS 297
Query: 188 --STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
S+ SLHVT V +TA E + AL+ A
Sbjct: 298 PSSSTARGPSLHVTFGV--ETALSGTYESLLHHALEVA 333
>gi|386353481|ref|YP_006051728.1| oxidoreductase domain-containing protein [Streptomyces cattleya
NRRL 8057 = DSM 46488]
gi|365811560|gb|AEW99775.1| oxidoreductase domain-containing protein [Streptomyces cattleya
NRRL 8057 = DSM 46488]
Length = 594
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 100/219 (45%), Gaps = 33/219 (15%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM--DVEGR 60
WE+E L + L T + D R + ++++ ++GQ + + D R
Sbjct: 26 WEREALLLHGAAVPLVGRLFT---LADFER---LACGGGAELSVVQNGQARPVVADRADR 79
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQ-------ELFGCFVGANTYQTPPT 113
+ V YR G ++ + P+ +L ++ L G V AN Y TPP
Sbjct: 80 VPGRHVLAAYRSGATLLLSCLPSRCPPVGELCRGIEFDLLRLGLLPGDPVSANAYLTPPR 139
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
AQGF H+DD ++QL G K+W+V+ P E +++ TLE
Sbjct: 140 AQGFGRHHDDHSVLVVQLHGSKRWEVF-------------GPGSQGPEA---VVSATLEA 183
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
GD+L +P G+ H A T T+E SLH+TI ++ +W D L
Sbjct: 184 GDVLSIPAGFPHVAHT-TDEASLHLTIGLH-TLSWADAL 220
>gi|357407530|ref|YP_004919453.1| hypothetical protein SCAT_p0161 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337762479|emb|CCB71187.1| protein of unknown function [Streptomyces cattleya NRRL 8057 = DSM
46488]
Length = 596
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 100/219 (45%), Gaps = 33/219 (15%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM--DVEGR 60
WE+E L + L T + D R + ++++ ++GQ + + D R
Sbjct: 28 WEREALLLHGAAVPLVGRLFT---LADFER---LACGGGAELSVVQNGQARPVVADRADR 81
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQ-------ELFGCFVGANTYQTPPT 113
+ V YR G ++ + P+ +L ++ L G V AN Y TPP
Sbjct: 82 VPGRHVLAAYRSGATLLLSCLPSRCPPVGELCRGIEFDLLRLGLLPGDPVSANAYLTPPR 141
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
AQGF H+DD ++QL G K+W+V+ P E +++ TLE
Sbjct: 142 AQGFGRHHDDHSVLVVQLHGSKRWEVF-------------GPGSQGPEA---VVSATLEA 185
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
GD+L +P G+ H A T T+E SLH+TI ++ +W D L
Sbjct: 186 GDVLSIPAGFPHVAHT-TDEASLHLTIGLH-TLSWADAL 222
>gi|251832992|gb|ACT22499.1| cupin [Triticum aestivum]
Length = 462
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 79/149 (53%), Gaps = 23/149 (15%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFIL 129
R G SI + + + + ++L +LFG VGAN Y +PP +QG A HYDD +
Sbjct: 110 RNGFSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPPRSQGLARHYDDHCVLVW 169
Query: 130 QLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
QL G+KKWK++ LPR+ Y P +S + G L GD++Y+PRG+
Sbjct: 170 QLLGRKKWKIWPNTKSVLPRL---YEPFHSLDGLVDDSGGR---VEVLHEGDIMYVPRGH 223
Query: 184 IHQAST----------VTNEHSLHVTISV 202
+H+A T V+ +SLH+T+++
Sbjct: 224 VHEAHTDVDEGESEVNVSTNYSLHLTLAI 252
>gi|219117481|ref|XP_002179535.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409426|gb|EEC49358.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 481
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 106/223 (47%), Gaps = 34/223 (15%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
+ +GCS+ + + L LQ F V ANTY TPP +Q H DD + F++
Sbjct: 134 FLDGCSVVTNHADRRSPWIAALCEDLQASFP-HVYANTYLTPPGSQTVPAHADDRDVFVI 192
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT------------PILT-VTLEPGDL 176
QL G K WK+Y R + P +S E++G P+LT L PGD+
Sbjct: 193 QLVGCKAWKIY--RNIPVPYP------YSHEQVGKGELEVPGQVLDGPVLTDRVLAPGDV 244
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP- 235
LY+PRGY+H+A V S HVT+++ + W L + A +A+ +R+ +P
Sbjct: 245 LYMPRGYVHEAHAVDGGPSFHVTVALATQD-WT--LAGLVTAATEASLTQQRSYRQAVPR 301
Query: 236 -IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLD 277
G + +A VD D+ ++++ L D L E + ++
Sbjct: 302 CFGRRSFESIA----VD---DKQSLQKQLDDAFRILREKITVE 337
>gi|221486010|gb|EEE24280.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 462
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 82/151 (54%), Gaps = 11/151 (7%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y EGCS+ ++ LQ + L + + V A +Y TPP H DD + F+L
Sbjct: 190 YLEGCSLVINQADRTLEILQSICQHLSKKYFSHVFAVSYLTPPRTHAVKTHTDDQDVFLL 249
Query: 130 QLEGKKKWKVYLP----RMVDEYL-PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
Q+ G K WK++ P + +E L R + P+ + G P+L L+ GD+LY+PRG+
Sbjct: 250 QVWGSKAWKIWTPPQILPLTEEMLGKREAFPD----DPGKPLLEFVLKEGDILYIPRGFP 305
Query: 185 HQASTVTNEHSLHVTISV-YQKTAWIDLLEK 214
H A T T E SLH+T++V + A++ L++
Sbjct: 306 HAAVT-TEEPSLHITLTVPTAEFAYVTCLQR 335
>gi|237834811|ref|XP_002366703.1| hypothetical protein TGME49_040840 [Toxoplasma gondii ME49]
gi|211964367|gb|EEA99562.1| hypothetical protein TGME49_040840 [Toxoplasma gondii ME49]
gi|221503506|gb|EEE29197.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 508
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 82/151 (54%), Gaps = 11/151 (7%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y EGCS+ ++ LQ + L + + V A +Y TPP H DD + F+L
Sbjct: 190 YLEGCSLVINQADRTLEILQSICQHLSKKYFSHVFAVSYLTPPRTHAVKTHTDDQDVFLL 249
Query: 130 QLEGKKKWKVYLP----RMVDEYL-PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
Q+ G K WK++ P + +E L R + P+ + G P+L L+ GD+LY+PRG+
Sbjct: 250 QVWGSKAWKIWTPPQILPLTEEMLGKREAFPD----DPGKPLLEFVLKEGDILYIPRGFP 305
Query: 185 HQASTVTNEHSLHVTISV-YQKTAWIDLLEK 214
H A T T E SLH+T++V + A++ L++
Sbjct: 306 HAAVT-TEEPSLHITLTVPTAEFAYVTCLQR 335
>gi|147783372|emb|CAN72964.1| hypothetical protein VITISV_016492 [Vitis vinifera]
Length = 590
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 75/149 (50%), Gaps = 16/149 (10%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D Y +G +I + + + + L LFG VG N Y TPP +QG A HYDD
Sbjct: 437 DAYNKGYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCV 496
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSP--NFSQEEIGTPILTVT---LEPGDLLYLPR 181
F+ QL G K+W + +V LPR P + +IG + T L GD+LY+PR
Sbjct: 497 FVCQLFGTKQWTIVSQPIVS--LPRLYEPLDSLHSSKIGNSMAGXTQFLLREGDILYIPR 554
Query: 182 GYIHQASTV--------TNEHSLHVTISV 202
G+ H+A TV T SLH+T+++
Sbjct: 555 GFPHEACTVAESXGPDETTGFSLHLTLAI 583
>gi|290954654|ref|YP_003485836.1| hypothetical protein SCAB_0241 [Streptomyces scabiei 87.22]
gi|260644180|emb|CBG67253.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 390
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 92/199 (46%), Gaps = 12/199 (6%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
+EG SI + P+ +L +++ V AN Y + +GF H+DD + I+Q
Sbjct: 94 KEGASIALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATEGFGIHWDDHDTVIVQ 153
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
L+G K+W++Y P Y E P+ + L PGD+LY+PRG H S
Sbjct: 154 LDGAKRWRIY---GTTRPYPLYRDIEDPGEAPTEPVADLVLWPGDVLYVPRGVWHAVSAD 210
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
SLHVT + TA DL+ + L T ++RR LP L A +G V
Sbjct: 211 QGTRSLHVTCGLQTHTA-TDLMTWVSEQLL-----THEDWRRDLP---LLAAPDVQGDAV 261
Query: 251 DIQADRLAMKENLKDLLAK 269
D RLA + +LLA+
Sbjct: 262 DGMRKRLAELLDDPELLAR 280
>gi|302527532|ref|ZP_07279874.1| predicted protein [Streptomyces sp. AA4]
gi|302436427|gb|EFL08243.1| predicted protein [Streptomyces sp. AA4]
Length = 385
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 97/208 (46%), Gaps = 14/208 (6%)
Query: 14 QKNYYNNLITSKKIDDILRDNVIEFKK-----NIDI----TLYEDGQRQTMDVEGRALPQ 64
+K + +L+ ++D +LR + +EF + + D+ T E + + R LP
Sbjct: 24 EKGRFASLLPWPELDRVLRQHRLEFPRLRLALDGDVVPAHTYTEMVSTRRNGLVPRVLPA 83
Query: 65 IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
+H R G ++ + + + +L L+ + N Y T GF H+DD
Sbjct: 84 PFAEHLRAGATLVLDSVNELVGEVGELATKLEHDLRERIQVNLYAGWGTTHGFDVHWDDH 143
Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
+AF++QL G+K+W+++ V P + G P LE GD+LYLPRG+
Sbjct: 144 DAFVIQLSGRKRWRLH---GVTRPSPLQRDVELPERPEGEPRDDFLLEDGDVLYLPRGHW 200
Query: 185 HQASTVTNEHSLHVTISVYQKTAWIDLL 212
H S V E SLH+TI T +DL+
Sbjct: 201 HDVSAV-GEESLHLTIGFNAATG-VDLV 226
>gi|251832983|gb|ACT22494.1| cupin [Triticum aestivum]
Length = 466
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 79/149 (53%), Gaps = 23/149 (15%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFIL 129
R G SI + + + + ++L +LFG VGAN Y +PP +QG A HYDD +
Sbjct: 110 RNGFSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPPRSQGLARHYDDHCVLVW 169
Query: 130 QLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
QL G+KKWK++ LPR+ Y P +S + G L GD++Y+PRG+
Sbjct: 170 QLLGRKKWKIWPNTKSILPRL---YEPFHSLDGLVDDRGGR---VEVLREGDIMYVPRGH 223
Query: 184 IHQASTVTNE----------HSLHVTISV 202
+H+A T +E +SLH+T+++
Sbjct: 224 VHEACTDIDEGESEVNASANYSLHLTLAI 252
>gi|297735434|emb|CBI17874.3| unnamed protein product [Vitis vinifera]
Length = 770
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 75/149 (50%), Gaps = 16/149 (10%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D Y +G +I + + + + L LFG VG N Y TPP +QG A HYDD
Sbjct: 416 DAYNKGYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCV 475
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSP--NFSQEEIGTPILTVT---LEPGDLLYLPR 181
F+ QL G K+W + +V LPR P + +IG + T L GD+LY+PR
Sbjct: 476 FVCQLFGTKQWTIVSQPIVS--LPRLYEPLDSLHSSKIGNSMAGRTQFLLREGDILYIPR 533
Query: 182 GYIHQASTV--------TNEHSLHVTISV 202
G+ H+A TV T SLH+T+++
Sbjct: 534 GFPHEACTVAESGGPDETTGFSLHLTLAI 562
>gi|170575231|ref|XP_001893154.1| chromosome 14 open reading frame 169 [Brugia malayi]
gi|158600986|gb|EDP38013.1| chromosome 14 open reading frame 169, putative [Brugia malayi]
Length = 248
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 87/167 (52%), Gaps = 7/167 (4%)
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+LY+PRGYIHQ + HSL +T+SV + + DLLE+ +P AL ++ R+ LP
Sbjct: 1 MLYIPRGYIHQGFADKDVHSLQLTVSVCRNVTYADLLERVIPPALSNFAEQNVNIRKSLP 60
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV-DLDD-GVDEMGKQLMHDALP 293
YL G+ ++ + + L + + +Y+ +L + VD M ++ M ALP
Sbjct: 61 ARYLDMTGVLECDYPLLKTGTIKLHRFLDSIFSNFCKYIKELSEPDVDMMAREFMRTALP 120
Query: 294 PVLSPEELQCSVFENGLRMSQTGEVYNATQI-TKDTKVRLVRANAVR 339
PVL+ EE + L ++ + + I TK+T ++L+R + R
Sbjct: 121 PVLTKEEKDMT----ALCVAGSSLYGDKEHIFTKNTPIKLLRRHGQR 163
>gi|91199785|emb|CAI78141.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
23877]
gi|126347489|emb|CAJ89199.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
23877]
Length = 311
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 93/198 (46%), Gaps = 20/198 (10%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
+EG S+ + P+ +L +++ V AN Y + +GF H+DD + I+Q
Sbjct: 15 KEGASLALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATEGFGIHWDDHDTVIVQ 74
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
L+G K+W++Y P Y E P+ + L PGD+LY+PRG H S
Sbjct: 75 LDGAKRWRIY---GTTRPYPLYRDIEDPGEAPTEPVADLVLWPGDVLYVPRGVWHAVSAD 131
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
SLHVT + TA DL+ + L T ++RR LP+ LA
Sbjct: 132 QGTRSLHVTCGLQTHTA-TDLMAWVSEQLL-----THEDWRRDLPL-------LAAS--- 175
Query: 251 DIQADRL-AMKENLKDLL 267
D+QAD + M++ L +LL
Sbjct: 176 DVQADAVDGMRKRLAELL 193
>gi|427707810|ref|YP_007050187.1| transcription factor jumonji jmjC domain-containing protein [Nostoc
sp. PCC 7107]
gi|427360315|gb|AFY43037.1| transcription factor jumonji jmjC domain-containing protein [Nostoc
sp. PCC 7107]
Length = 386
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 103/221 (46%), Gaps = 22/221 (9%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQTMDV 57
W K+ ++I + N+ T K ++D+L ++ + F N +L E RQ
Sbjct: 21 WTKKAIHIPAEHDQKLQNIFTWKSLNDLLNYHKLKEPDLRFSMN-GKSLSETRNRQEWS- 78
Query: 58 EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
D R+G ++ + + +L A+L+ G N Y +P QGF
Sbjct: 79 ----------DRLRQGATLIINGVHHRVATVAELAANLRHDIGYETHVNLYCSPAKQQGF 128
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
HYD + ILQ++G+K+W VY + +Y + + Q+ P L L+ GDLL
Sbjct: 129 DCHYDTHDVLILQIDGEKQWFVYQETV--QYPTAHIPSSKQQQPQEPPYLECVLKAGDLL 186
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAM 216
Y+PRG+ H A + SLH+TI + +T W++ L K +
Sbjct: 187 YIPRGHWHYA-VACEQPSLHLTIGIECQTGLDWLNWLMKDL 226
>gi|440715594|ref|ZP_20896130.1| Cupin 4 family protein [Rhodopirellula baltica SWK14]
gi|436439428|gb|ELP32878.1| Cupin 4 family protein [Rhodopirellula baltica SWK14]
Length = 277
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 73/137 (53%), Gaps = 4/137 (2%)
Query: 73 GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
GCS+ + ++ + +++ G AN Y TP +Q F H+D + F+ Q E
Sbjct: 99 GCSLLIQAMNRRLTCVRHICNQIEKASGLLCQANGYLTPRGSQTFNLHFDTHDVFVWQQE 158
Query: 133 GKKKWKVYLPRMVD-EYLPRYSSPNFSQEEI--GTPILTVTLEPGDLLYLPRGYIHQAST 189
G K WKV+ + + Y R+ +E++ PI+ TL PGD+LY+PRGY+H
Sbjct: 159 GSKAWKVFGSEVDNPSYHHRFQVDEKRREQLVEEGPIIDTTLTPGDMLYIPRGYLH-CGQ 217
Query: 190 VTNEHSLHVTISVYQKT 206
T + S H+T+ V++K+
Sbjct: 218 ATADDSFHLTVGVFRKS 234
>gi|329940499|ref|ZP_08289780.1| hypothetical protein SGM_5272 [Streptomyces griseoaurantiacus M045]
gi|329300560|gb|EGG44457.1| hypothetical protein SGM_5272 [Streptomyces griseoaurantiacus M045]
Length = 395
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 102/206 (49%), Gaps = 24/206 (11%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNA-SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
R+G S+ L+ I P A L+ G V AN Y + +GF H+DD + ++
Sbjct: 96 RDGASL-VLDAVEKIHPAVGAAAEGLERFLGTSVQANVYASWTEREGFGRHWDDHDVVVV 154
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
QL G K+W+++ M E P + +E G P+ + L PGD+LYLPRG+ H +
Sbjct: 155 QLHGSKRWRLW--GMTREA-PTFRDVESPEEPEGDPVADLVLSPGDVLYLPRGWWHSVTA 211
Query: 190 VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
SLH+T + T DL+ + + L+A+ L RR +P R+ LA
Sbjct: 212 DQGTESLHLTFGMVPHTG-ADLMLWVVDQ-LRAS----LALRRDIP----RFGSLAE--- 258
Query: 250 VDIQADRL-AMKENLKDLLA--KLVE 272
Q+D + A++ + D++A +LVE
Sbjct: 259 ---QSDFIDAVRREVADMMADPRLVE 281
>gi|284028215|ref|YP_003378146.1| Cupin 4 family protein [Kribbella flavida DSM 17836]
gi|283807508|gb|ADB29347.1| Cupin 4 family protein [Kribbella flavida DSM 17836]
Length = 425
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 11/156 (7%)
Query: 87 PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
PL L G V N Y TP +QGF+ HYD + F+LQ+ G+K W V+ P VD
Sbjct: 136 PLVDFATQLSSEAGHPVQINAYITPAESQGFSAHYDVHDVFVLQVAGEKHWTVHEPVHVD 195
Query: 147 EYL--PRYSSPNFSQEEIGT--PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
P E T P++ L PGD LY+PRG++H A + S H+T+ +
Sbjct: 196 PLRNQPWTDHSKAVAEAASTEQPVVDAVLRPGDALYVPRGFLHSAKALGGV-SAHLTVGL 254
Query: 203 YQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGY 238
+ T ++ + E L A A + R LP+G+
Sbjct: 255 HTMTRYLLVQE------LAALAANVPDLRSALPLGF 284
>gi|387876598|ref|YP_006306902.1| hypothetical protein W7S_16070 [Mycobacterium sp. MOTT36Y]
gi|443306372|ref|ZP_21036160.1| hypothetical protein W7U_11950 [Mycobacterium sp. H4Y]
gi|386790056|gb|AFJ36175.1| hypothetical protein W7S_16070 [Mycobacterium sp. MOTT36Y]
gi|442767936|gb|ELR85930.1| hypothetical protein W7U_11950 [Mycobacterium sp. H4Y]
Length = 390
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 95/212 (44%), Gaps = 20/212 (9%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W +I R + Y++ L+ + D++L E + L + GQ + R
Sbjct: 24 WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78
Query: 63 PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ D REG +I + Y++ + L+ SL+ N Y TPP A GFA
Sbjct: 79 GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
PHYD + +LQ+ G K W V P S++ +GT V L G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWH------VSTGAPVPPHEIESRKGVGTDGPAAATDVCLRAG 192
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
D+LYLPRG +H A T +E S+H+TI ++ T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223
>gi|406031451|ref|YP_006730342.1| JmjC domain-containing protein [Mycobacterium indicus pranii MTCC
9506]
gi|405129998|gb|AFS15253.1| JmjC domain-containing protein [Mycobacterium indicus pranii MTCC
9506]
Length = 391
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 92/208 (44%), Gaps = 11/208 (5%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W +I R + Y++ L+ + D++L E + L + GQ + R
Sbjct: 24 WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78
Query: 63 PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ D REG +I + Y++ + L+ SL+ N Y TPP A GFA
Sbjct: 79 GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PHYD + +LQ+ G K W V V + S + V L GD+LY
Sbjct: 139 PHYDPHDVLVLQIRGSKTWHVSTGAPVPPH-EIESRKGVGTDGPAAAATDVCLRAGDVLY 197
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKT 206
LPRG +H A T +E S+H+TI ++ T
Sbjct: 198 LPRGQVHSAET-RSEPSVHLTIGLHAPT 224
>gi|379747918|ref|YP_005338739.1| hypothetical protein OCU_31990 [Mycobacterium intracellulare ATCC
13950]
gi|379755222|ref|YP_005343894.1| hypothetical protein OCO_32100 [Mycobacterium intracellulare
MOTT-02]
gi|378800282|gb|AFC44418.1| hypothetical protein OCU_31990 [Mycobacterium intracellulare ATCC
13950]
gi|378805438|gb|AFC49573.1| hypothetical protein OCO_32100 [Mycobacterium intracellulare
MOTT-02]
Length = 390
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 95/212 (44%), Gaps = 20/212 (9%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W +I R + Y++ L+ + D++L E + L + GQ + R
Sbjct: 24 WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78
Query: 63 PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ D REG +I + Y++ + L+ SL+ N Y TPP A GFA
Sbjct: 79 GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
PHYD + +LQ+ G K W V P S++ +GT V L G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWH------VSTGAPVPPHEIESRKGVGTDGPAAATDVCLRAG 192
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
D+LYLPRG +H A T +E S+H+TI ++ T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223
>gi|333989158|ref|YP_004521772.1| hypothetical protein JDM601_0518 [Mycobacterium sp. JDM601]
gi|333485126|gb|AEF34518.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length = 409
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 102/213 (47%), Gaps = 26/213 (12%)
Query: 3 WEKEPLYISRKQKNYYNNL---ITSKKIDDILRD--NVIEFKKNIDITLYE-----DGQR 52
W +I R + +Y++ L T+ ++ + LR + + + D E DG
Sbjct: 36 WGAALHHIGRDRADYFDGLSGLATAGQLLESLRTEPSAVRLVRGADHLAPEQYTLADG-- 93
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
T+D+ GR + G +I N + Y++P+ L ++ N Y TPP
Sbjct: 94 -TLDL-GR-----IHAELENGYTIICNNIEKYLRPMSTLTHGVEVELNFPTHVNAYVTPP 146
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-EEIGTPILT-VT 170
+A GF PHYD + +LQ++G K W Y DE +P + + + G P T +
Sbjct: 147 SAVGFLPHYDHHDVLVLQIQGHKTWYFY----GDEPIPPHLMQQLHEVDPAGLPEPTSLH 202
Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
L GD LY+PRG +H A T T+E SLH+T+ ++
Sbjct: 203 LSAGDTLYVPRGRVHSAET-TSESSLHLTVGIH 234
>gi|170701700|ref|ZP_02892640.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
gi|170133393|gb|EDT01781.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
Length = 304
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 69/147 (46%), Gaps = 15/147 (10%)
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
+P ++ +GC+I Q Y + L A ++ + C AN Y + +A F H+
Sbjct: 86 MPDFLYRALDDGCTIIIDGCQDYFPSVLSLTAEIEHILKCQSWANLYISTQSATSFGCHF 145
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI------GTPILTVTLEPGD 175
DD + +QL GKK+W + Y P Y SPN + G+P L L G
Sbjct: 146 DDHDIISVQLSGKKRWHI--------YKPTYISPNRGDKSFYLDPPTGSPDLLENLPTGS 197
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV 202
LYLP GY H TV+ HS+H+T +
Sbjct: 198 SLYLPSGYWHNVETVS-PHSMHITFGL 223
>gi|359485116|ref|XP_003633217.1| PREDICTED: uncharacterized protein LOC100852762 [Vitis vinifera]
Length = 564
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 75/149 (50%), Gaps = 16/149 (10%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D Y +G +I + + + + L LFG VG N Y TPP +QG A HYDD
Sbjct: 210 DAYNKGYTIALRGMEFRFESIAAIADGLASLFGQPSVGVNLYLTPPDSQGLARHYDDHCV 269
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSP--NFSQEEIGTPILTVT---LEPGDLLYLPR 181
F+ QL G K+W + +V LPR P + +IG + T L GD+LY+PR
Sbjct: 270 FVCQLFGTKQWTIVSQPIVS--LPRLYEPLDSLHSSKIGNSMAGRTQFLLREGDILYIPR 327
Query: 182 GYIHQASTV--------TNEHSLHVTISV 202
G+ H+A TV T SLH+T+++
Sbjct: 328 GFPHEACTVAESGGPDETTGFSLHLTLAI 356
>gi|32455630|ref|NP_862112.1| hypothetical protein [Streptomyces violaceoruber]
gi|28797256|gb|AAO50103.1| hypothetical protein [Streptomyces violaceoruber]
Length = 390
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 116/263 (44%), Gaps = 39/263 (14%)
Query: 20 NLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALP-----QIVWD------ 68
L+T ++ IL + +E + + L DG+ T+ V G P VW
Sbjct: 34 GLMTFDDLNQILATHRLEPPR---MRLSRDGE--TLLVGGYTTPVATRRHTVWHRLHPAE 88
Query: 69 -HYR--EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
H R EG S+ + P+ +L +++ V AN Y + +GF H+DD +
Sbjct: 89 LHTRLTEGASLALDSVDELHPPIARLCEAIERELHTRVQANLYASWSATEGFGVHWDDHD 148
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
I+QL+G K+W++Y P Y E P+ + L PGD+LY+PRG H
Sbjct: 149 TVIVQLDGAKRWRIY---GTTRPFPLYRDIADPGEAPTEPVADLVLWPGDVLYVPRGVWH 205
Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
S SLHVT + TA DL+ + L T ++RR LP+
Sbjct: 206 AVSADQGVRSLHVTCGLQTHTA-TDLMAWVSEQLL-----THEDWRRDLPL--------- 250
Query: 246 RGKPVDIQADRL-AMKENLKDLL 267
P D+QAD + M++ L +LL
Sbjct: 251 LAAP-DVQADAVDGMRKRLAELL 272
>gi|256824138|ref|YP_003148098.1| Cupin superfamily protein [Kytococcus sedentarius DSM 20547]
gi|256687531|gb|ACV05333.1| Cupin superfamily protein [Kytococcus sedentarius DSM 20547]
Length = 414
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/253 (27%), Positives = 111/253 (43%), Gaps = 25/253 (9%)
Query: 3 WEKEPLYI--SRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQTM 55
W P ++ + + + + +L + +DD+L R I ++ TL E+ +
Sbjct: 34 WGTTPRHVPATDRAGDTFTDLFSLDAVDDLLTHRGLRTPFIRMAQD-GTTLPENRFTRGG 92
Query: 56 DVEGRALPQIVWDHYRE----GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTP 111
A Q+ D R G +I P+ + L + G V N Y TP
Sbjct: 93 GTGAGASDQVDEDRVRSLFAGGATIVLQGLHRTWPPIAEFARELGDELGHPVQVNAYITP 152
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP-NFSQEEIG------T 164
P QGF+ HYD + F+LQ+ G K W ++ P + P P + +E +
Sbjct: 153 PQNQGFSAHYDVHDVFVLQVHGTKHWTLHEPVVAH---PLRDQPWDTVREAVAHRAAQDA 209
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAG 224
P++ L PGD+LYLPRG IH A+ E S H+TI V+ T D + A+ A+++
Sbjct: 210 PLIDAVLAPGDVLYLPRGTIH-AAAAQGEISAHLTIGVHTWTP--DHVTGAVLDAVRSRL 266
Query: 225 ATDLEFRRGLPIG 237
R LP+G
Sbjct: 267 RDQPTVRANLPLG 279
>gi|359401171|ref|ZP_09194142.1| hypothetical protein NSU_3828 [Novosphingobium pentaromativorans
US6-1]
gi|357597420|gb|EHJ59167.1| hypothetical protein NSU_3828 [Novosphingobium pentaromativorans
US6-1]
Length = 320
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 84/163 (51%), Gaps = 7/163 (4%)
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
GR+ ++ D R+G +R + Q + ++ L++LF N Y P GF
Sbjct: 92 GRSQLAVLIDQLRQGAMVRVRDLQNASQSIEHTVRQLEQLFLAKCQVNLYLAPMGGAGFP 151
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI---LTVTLEPGD 175
H+D + F++Q G K W ++ LP +P + E P+ ++ ++ GD
Sbjct: 152 AHFDITDGFVVQCGGTKDWTIFEDYTDQTQLPSADTP--WEPERYKPLGSGKSLVMQLGD 209
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPK 218
+LYLPRG +H A+ T+EHSLH+TIS+ + + D+L+ + +
Sbjct: 210 VLYLPRGVMH-AARCTDEHSLHLTISL-ESLTYSDVLQSEVRR 250
>gi|365860786|ref|ZP_09400580.1| cupin 4 family protein [Streptomyces sp. W007]
gi|364009777|gb|EHM30723.1| cupin 4 family protein [Streptomyces sp. W007]
Length = 275
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 101/220 (45%), Gaps = 18/220 (8%)
Query: 13 KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---VEGRALPQIVWDH 69
++ + +IT + ID L + +I + DG+ D GR P +
Sbjct: 4 REAGELDRVITPESIDHYLETGCVPAD---EIAVVSDGRALHPDRHRTAGRTDPAKLRSL 60
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y G +IR N Q + L ++ +Q G + + TPP QG H+D A IL
Sbjct: 61 YEGGHTIRLGNLQRVVPFLADVSRGIQRETGFSNYLHAFVTPPGRQGLRHHWDQQMAVIL 120
Query: 130 QLEGKKKWKVYLP------RMVDEYLPRYSSPNF-SQEEIGTPILTVTLEPGDLLYLPRG 182
Q+ G K+W+++ P R E R P+F Q E P L V L PG L LPRG
Sbjct: 121 QISGIKRWQLWRPMFPSPMRAYQESF-RVWDPDFIPQWEAAGPDLEVDLRPGQSLLLPRG 179
Query: 183 YIHQASTV-TNEHSLHVTISVYQKTA-WI--DLLEKAMPK 218
++H + S+H+T ++ ++T W+ L+++A+ K
Sbjct: 180 WVHNPHALDAQARSIHLTFAIRERTPLWVAEKLVDRAIEK 219
>gi|254775804|ref|ZP_05217320.1| hypothetical protein MaviaA2_14200 [Mycobacterium avium subsp.
avium ATCC 25291]
Length = 393
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 126/306 (41%), Gaps = 38/306 (12%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
W +I R + Y++ L+ D L + V ++ ++ D Y G
Sbjct: 24 WATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
T++ G D +G ++ + Y++ + L+ +++ N Y TPP
Sbjct: 83 TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFLTRVNAYVTPPH 136
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT-VTLE 172
+ GF PHYD + +LQ+EG K W+V V P+ G T V L
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWRVSDGPAVP---PQQIQSRKGVGADGPASWTDVCLR 193
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
PGD+LYLPRG +H A T +E S+H+T+ ++ T + AL A D
Sbjct: 194 PGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVL-----TLVTSALHALSLRDPRVHD 247
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDA- 291
LP +L D R + E ++D + L + + DG+ M + L+
Sbjct: 248 RLPPRHL-----------DDARVRAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVRRGR 296
Query: 292 LPPVLS 297
PPV S
Sbjct: 297 CPPVGS 302
>gi|413919791|gb|AFW59723.1| hypothetical protein ZEAMMB73_667794 [Zea mays]
Length = 589
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 94/189 (49%), Gaps = 25/189 (13%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
++ G SI + + + + ++L +LFG VGAN Y +P +QG A HYDD +
Sbjct: 228 FKNGYSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPAGSQGLARHYDDHCVLV 287
Query: 129 LQLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
QL G KKW V+ LPR+ + + P + + S + L GD++Y+PRG
Sbjct: 288 WQLLGSKKWTVWPNPKPLLPRLYEPFDPLDGNLDESSGRVE------VLHEGDMMYIPRG 341
Query: 183 YIHQAST-----VTNEHSLHVTISV-------YQKTAWIDLLEKAMPKALQAAGATDLEF 230
Y+H+A T N+ SLH+T+++ ++ A I L A + L+ + + + E
Sbjct: 342 YVHEARTDVSGSEVNDCSLHLTLAIEVEAPFEWEGFAHIALHCWAEKQQLRGSQSVEFEG 401
Query: 231 RRGLPIGYL 239
+ G + +
Sbjct: 402 KAGASLSAI 410
>gi|118464331|ref|YP_882543.1| cupin superfamily protein [Mycobacterium avium 104]
gi|118165618|gb|ABK66515.1| cupin superfamily protein [Mycobacterium avium 104]
Length = 361
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 129/309 (41%), Gaps = 44/309 (14%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
W +I R + Y++ L+ D L + V ++ ++ D Y G
Sbjct: 24 WATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
T++ G D +G ++ + Y++ + L+ +++ N Y TPP
Sbjct: 83 TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPH 136
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG----TPILTV 169
+ GF PHYD + +LQ+EG K W+ V + P S++ +G V
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWR------VSDEPPVPPQQIQSRKGVGADGPASRTDV 190
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L PGD+LYLPRG +H A T +E S+H+T+ ++ T + AL A D
Sbjct: 191 CLRPGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVLT-----LVTSALHALSLRDPR 244
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
LP +L D R + E ++D + L + + DG+ M + L+
Sbjct: 245 VHDRLPPRHL-----------DDAQVRAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVR 293
Query: 290 DA-LPPVLS 297
+ PPV S
Sbjct: 294 RSRCPPVGS 302
>gi|256379276|ref|YP_003102936.1| cupin [Actinosynnema mirum DSM 43827]
gi|255923579|gb|ACU39090.1| Cupin 4 family protein [Actinosynnema mirum DSM 43827]
Length = 395
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 100/217 (46%), Gaps = 32/217 (14%)
Query: 14 QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ-------------RQTMDVEGR 60
++ + +L+ +++ +LR + +EF + + L DG+ R+ V R
Sbjct: 33 ERGRFADLLPWSEVNRVLRQHRLEFPR---LRLARDGEVVPAHVYSELVDTRRAGQVP-R 88
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
LP + R G ++ + Q + + L L+ V N Y GF H
Sbjct: 89 VLPGKFAEQMRGGATLVLDSVQELVGAVGDLAVGLEHELRERVQVNAYAGWGVTHGFDVH 148
Query: 121 YDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
+DD +A ++Q+ G+K+W+++ P + D LP P G P+ LE G+
Sbjct: 149 WDDHDAIVVQVSGRKRWRIHGFTRVAPMVRDVELP--PRPE------GEPLDEFVLEAGE 200
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
+LYLPRG H S V E SLH+TI V + T +DL+
Sbjct: 201 VLYLPRGCWHDVSAV-GEESLHLTIGVNRATG-VDLV 235
>gi|379762757|ref|YP_005349154.1| hypothetical protein OCQ_33210 [Mycobacterium intracellulare
MOTT-64]
gi|378810699|gb|AFC54833.1| hypothetical protein OCQ_33210 [Mycobacterium intracellulare
MOTT-64]
Length = 390
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 96/212 (45%), Gaps = 20/212 (9%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W +I R + Y++ L+ + D++L E + L + GQ + R
Sbjct: 24 WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78
Query: 63 PQIVWDHYREGC----SIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ D REG +I + Y++ + L+ SL+ N Y TPP A GF+
Sbjct: 79 GSLDPDRAREGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFS 138
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
PHYD + +LQ+ G K W V V + S++ +GT V L G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWHVSTGAPVPPHEIE------SRKGVGTDGPAAATDVCLRAG 192
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
D+LYLPRG +H A T +E S+H+TI ++ T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223
>gi|323449141|gb|EGB05031.1| hypothetical protein AURANDRAFT_66651 [Aureococcus anophagefferens]
Length = 450
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 75 SIRFLNPQTYIKPLQQLNASLQELF--GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
S+ L + + ++Q + GC V N Y TPP AQ HYDD F++QL
Sbjct: 174 SVVVLGADARFRCAAEARRAVQRVLRVGCSV--NLYATPPGAQALDAHYDDHCVFVVQLR 231
Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV-- 190
G+K+W++Y P + L + P + VTL PGD+LY+PRG H A+
Sbjct: 232 GRKRWRLYGPALECPTLHEATLPPPPELLRRGADHAVTLAPGDVLYVPRGVYHAATACED 291
Query: 191 TNEHSLHVTISV 202
S HVT+ V
Sbjct: 292 AGSDSAHVTVGV 303
>gi|429197490|ref|ZP_19189384.1| cupin domain protein, PF06172 family [Streptomyces ipomoeae 91-03]
gi|428666857|gb|EKX65986.1| cupin domain protein, PF06172 family [Streptomyces ipomoeae 91-03]
Length = 389
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 69/127 (54%), Gaps = 5/127 (3%)
Query: 86 KPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV 145
+P ++L +L+ +F V N Y + +GF H+DD + ++QLEG K+W++Y P +
Sbjct: 110 QPAERLAQALEGVFRGRVQINVYASWTATEGFGTHWDDHDVIVVQLEGAKRWRLYGPTRL 169
Query: 146 DEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
+ +P +E PI V L GD+LYLPRG+ H A T SLH+T + Q
Sbjct: 170 NPLHRDTEAPEPPADE---PIEEVVLRAGDMLYLPRGWWH-AVAATEGRSLHLTCGL-QT 224
Query: 206 TAWIDLL 212
T DLL
Sbjct: 225 TTGADLL 231
>gi|254822014|ref|ZP_05227015.1| hypothetical protein MintA_18922 [Mycobacterium intracellulare ATCC
13950]
Length = 390
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 96/212 (45%), Gaps = 20/212 (9%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W +I R + Y++ L+ + D++L E + L + GQ + R
Sbjct: 24 WAIRHHHIQRGEPGYFDGLLPAA--DELLEQMRAEPSA---VRLVKGGQDRDPASYRRGD 78
Query: 63 PQIVWDHYR----EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ D R +G +I + Y++ + L+ SL+ N Y TPP A GFA
Sbjct: 79 GSLDPDRARGGLADGYTIVLNGLERYLRTVATLSHSLEVELNFPTRVNAYVTPPDATGFA 138
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT----PILTVTLEPG 174
PHYD + +LQ+ G K W V V + S++ +GT V L G
Sbjct: 139 PHYDPHDVLVLQIRGSKTWHVSTGAPVPPHEIE------SRKGVGTDGPAAATDVCLRAG 192
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
D+LYLPRG +H A T +E S+H+TI ++ T
Sbjct: 193 DVLYLPRGQVHSAET-RSEPSVHLTIGLHAPT 223
>gi|41407243|ref|NP_960079.1| hypothetical protein MAP1145c [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|440776746|ref|ZP_20955580.1| hypothetical protein D522_07778 [Mycobacterium avium subsp.
paratuberculosis S5]
gi|41395594|gb|AAS03462.1| hypothetical protein MAP_1145c [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|436723035|gb|ELP46903.1| hypothetical protein D522_07778 [Mycobacterium avium subsp.
paratuberculosis S5]
Length = 393
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 129/309 (41%), Gaps = 44/309 (14%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
W +I R + Y++ L+ D L + V ++ ++ D Y G
Sbjct: 24 WATRHHHIDRCRPGYFDGLLPGPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
T++ G D +G ++ + Y++ + L+ +++ N Y TPP
Sbjct: 83 TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPH 136
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG----TPILTV 169
+ GF PHYD + +LQ+EG K W+ V + P S++ +G V
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWR------VSDEPPVPPQQIQSRKGVGADGPASWTDV 190
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L PGD+LYLPRG +H A T +E S+H+T+ ++ T + AL A D
Sbjct: 191 CLRPGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVL-----TLVTSALHALSLRDPR 244
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
LP +L A + R + E ++D + L + + DG+ M + L+
Sbjct: 245 VHDRLPPRHLDDAQV-----------RAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVR 293
Query: 290 DA-LPPVLS 297
PPV S
Sbjct: 294 RGRCPPVGS 302
>gi|302533161|ref|ZP_07285503.1| predicted protein [Streptomyces sp. C]
gi|302442056|gb|EFL13872.1| predicted protein [Streptomyces sp. C]
Length = 298
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 95/195 (48%), Gaps = 15/195 (7%)
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
GR P + Y G ++R N Q + + ++ +Q+ G + + TP QG
Sbjct: 73 GRTDPAKLRKLYESGHTVRLGNLQRVMPLMADISHGIQQETGYSNYIHAFVTPSGEQGLR 132
Query: 119 PHYDDIEAFILQLEGKKKWKVYLP------RMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
H+D A I+QLEG K+W+++ P R +E + E P L + LE
Sbjct: 133 HHWDQQMAVIVQLEGVKRWQLWKPPVEAPMREFNESWRVWKQEYIPTWEAAGPDLEIHLE 192
Query: 173 PGDLLYLPRGYIHQASTV-TNEHSLHVTISVYQKTA-WIDLLEKAMPKALQAAGATDLEF 230
G + LPRG++H + + TNE S+H+T ++ ++T WI E+ + A++ D EF
Sbjct: 193 AGQSMLLPRGWVHNPAVLDTNERSVHLTFAIRERTPFWI--AERLVQDAIK-----DPEF 245
Query: 231 RRGLPIGYLRYAGLA 245
RR + L+ GLA
Sbjct: 246 RRVILPEQLKDEGLA 260
>gi|302535269|ref|ZP_07287611.1| cupin 4 [Streptomyces sp. C]
gi|302444164|gb|EFL15980.1| cupin 4 [Streptomyces sp. C]
Length = 403
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 83/165 (50%), Gaps = 17/165 (10%)
Query: 59 GRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
G+A+P + +H R+G ++ + ++ +LQ V AN Y T A
Sbjct: 82 GQAIPMADMRRIGNHLRDGATLVLDQSNVFDPTMEVACRALQWWSREHVQANVYLTTNDA 141
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP---NFSQEEIGTPILTVTL 171
GF H+DD + I+QL G+K+W+V R +P Y N EEI + T T+
Sbjct: 142 SGFDLHWDDHDVVIVQLAGEKQWEV---RGASRPVPMYRDADRNNTPSEEI---VWTGTM 195
Query: 172 EPGDLLYLPRGYIHQASTVTN--EHSLHVTISVYQKTA--WIDLL 212
+ GD++++PRGY H A+ V N HS HVT ++TA W+ L
Sbjct: 196 KAGDVMHIPRGYWHAATRVGNGDGHSTHVTFGFAKRTAVSWLTWL 240
>gi|417746840|ref|ZP_12395324.1| Cupin superfamily protein [Mycobacterium avium subsp.
paratuberculosis S397]
gi|336461609|gb|EGO40474.1| Cupin superfamily protein [Mycobacterium avium subsp.
paratuberculosis S397]
Length = 393
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 130/309 (42%), Gaps = 44/309 (14%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---------IEFKKNIDITLYEDGQRQ 53
W +I R + Y++ L+ D L + V ++ ++ D Y G
Sbjct: 24 WATRHHHIDRCRPGYFDGLLPEPSAVDGLLEQVRPDPAAVRLVKDGEDRDPAGYRRGD-G 82
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
T++ G D +G ++ + Y++ + L+ +++ N Y TPP
Sbjct: 83 TLNAGG------ARDGLADGYTLVLNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPH 136
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG----TPILTV 169
+ GF PHYD + +LQ+EG K W+V DE P S++ +G V
Sbjct: 137 STGFVPHYDPHDVLVLQIEGCKTWRVS-----DEP-PVPPQQIQSRKGVGADGPASWTDV 190
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
L PGD+LYLPRG +H A T +E S+H+T+ ++ T + AL A D
Sbjct: 191 CLRPGDVLYLPRGQVHSART-HSEPSVHLTVGLHAPTVL-----TLVTSALHALSLRDPR 244
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
LP +L A + R + E ++D + L + + DG+ M + L+
Sbjct: 245 VHDRLPPRHLDDAQV-----------RAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVR 293
Query: 290 DA-LPPVLS 297
PPV S
Sbjct: 294 RGRCPPVGS 302
>gi|313237822|emb|CBY12955.1| unnamed protein product [Oikopleura dioica]
Length = 520
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 116/260 (44%), Gaps = 42/260 (16%)
Query: 76 IRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKK 135
+ F PQ + L ++ A L+E F VGA P ++ P ++ ++FI QL GK
Sbjct: 146 LDFNQPQRFCDDLWRVIADLEETFQTLVGAKARIQPAKSKALPPVFNSTDSFIQQLNGKC 205
Query: 136 KWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHS 195
K++ +D P + F ++E+ V L PGD+LY+PRG + ++ + NE+
Sbjct: 206 SVKIWKASDID-LAPEETIEEFKEDEMRKADEIVELAPGDMLYIPRGAVF-STALENENE 263
Query: 196 ----LHVTISVYQKTA-WIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
LH+T++ A W+++ P L + +LR A +
Sbjct: 264 SSSYLHLTMNSQNTVAQWLNV---NFPTFLD---------KVAFQAKWLREAVSTK---- 307
Query: 251 DIQADRLAMKENLKDLLAKLVEYVDLDDGV---DEMGKQLMHDALPPVLSPE-------- 299
I+ + A KE LK LL KL E V+ D+ V D M K LPP P+
Sbjct: 308 -IRTNS-AKKEYLKHLLEKLAESVN-DENVPEEDPMRKDFFAHRLPPYKEPKTFGKMKSF 364
Query: 300 ELQCSVFENGLRMSQTGEVY 319
+L CSV R+S +Y
Sbjct: 365 DLNCSV-----RLSTKKHIY 379
>gi|409991367|ref|ZP_11274635.1| cupin 4 [Arthrospira platensis str. Paraca]
gi|291567777|dbj|BAI90049.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409937781|gb|EKN79177.1| cupin 4 [Arthrospira platensis str. Paraca]
Length = 387
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 116/240 (48%), Gaps = 26/240 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W ++ + I + NL + +K++++L + K+ +I L + G+
Sbjct: 20 YWTEKAVLIPGDNHQKFANLFSWQKLNNLLNYYPL---KHPEIRLAKAGE---------T 67
Query: 62 LPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
LP+I + +EG ++ I+ + ++ A L+ G N+Y + P QG
Sbjct: 68 LPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPGHQG 127
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
FA HYD E FILQ+ G+K W+V+ + L S FS + P + + PGDL
Sbjct: 128 FACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINPGDL 185
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
LY+PRG+ H A + +E SLH+T+ + +T ID + + Q ++R+ LP+
Sbjct: 186 LYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKNLPL 238
>gi|168047079|ref|XP_001775999.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672657|gb|EDQ59191.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 992
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 57/110 (51%), Gaps = 4/110 (3%)
Query: 92 NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR 151
N EL VGAN Y TPP +QG H+DD F+ QL G+K W VY P E LPR
Sbjct: 521 NGLAAELGQVTVGANLYLTPPGSQGLRVHFDDHCVFVCQLRGRKGWDVYPPL---EQLPR 577
Query: 152 -YSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTI 200
YS S E L+ D LY+PRG++H+A T E ++ V I
Sbjct: 578 LYSFKTLSTEVTKDYATHFDLQEWDTLYIPRGFLHEARTECPEQTIEVQI 627
>gi|223996773|ref|XP_002288060.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977176|gb|EED95503.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 488
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 86/183 (46%), Gaps = 20/183 (10%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y +GCSI + + +L LQ F V AN Y TPP H DD + F++
Sbjct: 131 YLDGCSIVVNHADLQSASIAKLCNDLQSSFP-HVYANAYLTPPNGFAVNAHADDRDVFVI 189
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL---------TVTLEPGDLLYLP 180
Q+ G KKW VY ++ EY S E+ + + L PGD++Y+P
Sbjct: 190 QVLGTKKWNVY-KKVPVEYPFENEQVGKSGREVPPSVFEGGLCFGNNVLDLGPGDVMYMP 248
Query: 181 RGYIHQAST----VTNEH--SLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
RG++H+A+T V + H S H+TI++ W L + + + +++R+ L
Sbjct: 249 RGFVHEATTEILDVEDGHSPSFHITIAIATHD-W--CLSVVLADCFRKTLSEVVDYRKAL 305
Query: 235 PIG 237
PIG
Sbjct: 306 PIG 308
>gi|326521268|dbj|BAJ96837.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 777
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 89/184 (48%), Gaps = 26/184 (14%)
Query: 36 IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL 95
I ++ ++DG T+ + A + + G SI + + + + ++L
Sbjct: 393 ISTGSGVEEHFFDDG---TVFPDAAAFVEKCKGAIKNGFSIALRGMEFRSEKIAAVASAL 449
Query: 96 QELFG-CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRMVDEY 148
+LFG VGAN Y +PP +QG A HYDD + QL G KKWK++ LPR+ Y
Sbjct: 450 ADLFGQPSVGANIYFSPPRSQGLARHYDDHCVLVWQLLGCKKWKIWPNTRPILPRL---Y 506
Query: 149 LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST----------VTNEHSLHV 198
P +S + G L GD++Y+PRG++H+A T + +SLH+
Sbjct: 507 EPFHSLDGLVDDSGGR---VEVLHEGDIMYVPRGHVHEAHTDLDEGESEVNASTNYSLHL 563
Query: 199 TISV 202
T+++
Sbjct: 564 TLAI 567
>gi|404446598|ref|ZP_11011704.1| hypothetical protein MVAC_25070 [Mycobacterium vaccae ATCC 25954]
gi|403650187|gb|EJZ05456.1| hypothetical protein MVAC_25070 [Mycobacterium vaccae ATCC 25954]
Length = 401
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/292 (24%), Positives = 125/292 (42%), Gaps = 44/292 (15%)
Query: 1 DHWEKEPLYISRKQ-KNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD--- 56
++W ++PL K + +L++ + +D+++ + + + I L ++GQ D
Sbjct: 16 EYWGRKPLLSPAKTLPRDFVDLLSPETVDELIAERGV---RAPFIRLAKEGQVLAKDSYL 72
Query: 57 --------VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
+ + P V G ++ PL + G V AN Y
Sbjct: 73 GPAGFGAEITDQVDPAKVLTELGSGATVVLQGLHRLWPPLIEFVGQAVADIGHPVQANAY 132
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEI 162
TPP +GF HYD + F+LQ+ G+K+W V+ P + P + P +
Sbjct: 133 ITPPGNRGFDFHYDVHDVFVLQVSGRKRWIVHEPVHLH---PLPNQPWTDHRARIEERVT 189
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
P++ L GD LYLPRG++H A + + S+H+TI V T +D+ + + ++
Sbjct: 190 ADPVIDAELAAGDALYLPRGWVHAAQAL-DTTSIHLTIGVSATTG-VDVARAVLDELSRS 247
Query: 223 AGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYV 274
EFR LP+G PVD AM ++A++V ++
Sbjct: 248 D-----EFRAPLPLGI---------DPVDGN----AMAATAAKVIAEIVTHL 281
>gi|423062943|ref|ZP_17051733.1| cupin 4 family protein [Arthrospira platensis C1]
gi|406715522|gb|EKD10676.1| cupin 4 family protein [Arthrospira platensis C1]
Length = 387
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/240 (27%), Positives = 116/240 (48%), Gaps = 26/240 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W ++ + I + +L + +K++D+L + K+ +I L + G+
Sbjct: 20 YWTEKSVLIPGANHQKFADLFSWQKLNDLLNYYPL---KHPEIRLAKTGE---------T 67
Query: 62 LPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
LP+I + ++G ++ I+ + ++ A L+ G N+Y + P QG
Sbjct: 68 LPEITNNEQIIKQCQQGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPGHQG 127
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
FA HYD E FILQ+ G+K W+V+ + L S FS + P + + PGDL
Sbjct: 128 FACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINPGDL 185
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPI 236
LY+PRG+ H A + +E SLH+T+ + +T ID + + Q ++R+ LP+
Sbjct: 186 LYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKNLPL 238
>gi|224144250|ref|XP_002325235.1| predicted protein [Populus trichocarpa]
gi|222866669|gb|EEF03800.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 81/178 (45%), Gaps = 22/178 (12%)
Query: 38 FKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQE 97
F+K +D ++ +D+ + ++EG +I + + + +L
Sbjct: 385 FQKKVDPCCFKKLAFNNVDIMKCE------EAFKEGYTIALRGVEFRFASIAAVADALAS 438
Query: 98 LFGC-FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRMVDEYLP 150
LFG VGAN Y TPP +QG A H DD F+ QL G K+W +Y LPR+ D +
Sbjct: 439 LFGQPSVGANIYLTPPNSQGLARHCDDHCVFVCQLFGTKQWTIYPRPNLQLPRLYDPFDR 498
Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN------EHSLHVTISV 202
+ ++ L GD+LY+PRG+ H+A T + SLHVT V
Sbjct: 499 EHC---LGEQNSLAECRKFLLREGDILYIPRGFPHEACTHDDGSSDLARFSLHVTFGV 553
>gi|323449702|gb|EGB05588.1| hypothetical protein AURANDRAFT_66360, partial [Aureococcus
anophagefferens]
Length = 714
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 75 SIRFLNPQTYIKPLQQLNASLQELF--GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
S+ L + + ++Q + GC V N Y TPP AQ HYDD F++QL
Sbjct: 262 SVVVLGADARFRCAAEARRAVQRVLRVGCSV--NLYATPPGAQALDAHYDDHCVFVVQLR 319
Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV-- 190
G+K+W++Y P + L + P + V L PGD+LY+PRG H A+
Sbjct: 320 GRKRWRLYGPALECPTLHEATLPPPPELLRRGADHAVALAPGDVLYVPRGVYHAATACED 379
Query: 191 TNEHSLHVTISV 202
S HVT+ V
Sbjct: 380 AGSDSAHVTVGV 391
>gi|376007834|ref|ZP_09785019.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375323810|emb|CCE20772.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 387
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 91/183 (49%), Gaps = 14/183 (7%)
Query: 59 GRALPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
G LP+I + +EG ++ I+ + ++ A L+ G N+Y + P
Sbjct: 65 GETLPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPG 124
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
QGFA HYD E FILQ+ G+K W+V+ + L S FS + P + + P
Sbjct: 125 HQGFACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINP 182
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
GDLLY+PRG+ H A + +E SLH+T+ + +T ID + + Q ++R+
Sbjct: 183 GDLLYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKN 235
Query: 234 LPI 236
LP+
Sbjct: 236 LPL 238
>gi|209526744|ref|ZP_03275266.1| Cupin 4 family protein [Arthrospira maxima CS-328]
gi|209492788|gb|EDZ93121.1| Cupin 4 family protein [Arthrospira maxima CS-328]
Length = 387
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 91/183 (49%), Gaps = 14/183 (7%)
Query: 59 GRALPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
G LP+I + +EG ++ I+ + ++ A L+ G N+Y + P
Sbjct: 65 GETLPEITNNEQIIKQCQEGATLIIDRLHEKIEAIAKMVALLRIEIGHRSQVNSYCSFPG 124
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
QGFA HYD E FILQ+ G+K W+V+ + L S FS + P + + P
Sbjct: 125 HQGFACHYDSHEVFILQISGRKHWRVFSDTFIYP-LSENRSSQFSPPDT-QPYIDAIINP 182
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
GDLLY+PRG+ H A + +E SLH+T+ + +T ID + + Q ++R+
Sbjct: 183 GDLLYIPRGHWHYAIAI-DEPSLHLTLGIDCQTG-IDFSDWLTSQLQQHP-----QWRKN 235
Query: 234 LPI 236
LP+
Sbjct: 236 LPL 238
>gi|242077498|ref|XP_002448685.1| hypothetical protein SORBIDRAFT_06g031470 [Sorghum bicolor]
gi|241939868|gb|EES13013.1| hypothetical protein SORBIDRAFT_06g031470 [Sorghum bicolor]
Length = 786
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 81/148 (54%), Gaps = 21/148 (14%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
++ G SI + + + + ++L +LFG VGAN Y +P +QG A HYDD +
Sbjct: 428 FKNGYSIALRGMEFRSEKVAAIASALADLFGQPSVGANIYFSPARSQGLARHYDDHCVLV 487
Query: 129 LQLEGKKKWKVY------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
QL G KKW ++ LPR+ D + P + + + IG + V LE GD++Y+PRG
Sbjct: 488 WQLLGSKKWMIWPNPKPLLPRLYDPFDPLDGTLD---KNIGR--VEVLLE-GDMMYVPRG 541
Query: 183 YIHQAST--------VTNEHSLHVTISV 202
Y+H+A T ++SLH+T+++
Sbjct: 542 YVHEARTDVGGSEVNAYADYSLHLTLAI 569
>gi|170698073|ref|ZP_02889154.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
gi|170137034|gb|EDT05281.1| Cupin 4 family protein [Burkholderia ambifaria IOP40-10]
Length = 380
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 87/188 (46%), Gaps = 15/188 (7%)
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
V R I++D R G ++ F + + ++ + S+ G AN Y +
Sbjct: 74 VRRRLEKGILYDELRNGTTLVFNRMELTLYKVRLICKSISRFVGEHTVANGYIAFGEEES 133
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP-RYSSPNFSQEEI-GTPILTVTLEPG 174
F H+D F +Q+ G+K+W VY P LP ++ Q E P + VT+E G
Sbjct: 134 FGKHWDTHSVFAVQMMGRKRWLVYEPTHA---LPLKHQRSTGKQSECPAEPYMDVTIETG 190
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRR 232
D+LYLPRG+ H A + NE + H+ + V++ T +I L + A FR+
Sbjct: 191 DILYLPRGWWHTAIPL-NEETFHLAVGVHESTISDYIKYLADEIIGDFDA-------FRQ 242
Query: 233 GLPIGYLR 240
+P+G R
Sbjct: 243 TIPLGERR 250
>gi|167034171|ref|YP_001669402.1| cupin 4 family protein [Pseudomonas putida GB-1]
gi|166860659|gb|ABY99066.1| Cupin 4 family protein [Pseudomonas putida GB-1]
Length = 303
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 111/253 (43%), Gaps = 25/253 (9%)
Query: 19 NNLITSKKIDDILRDNVIEFKKNIDITLYEDG----------QRQTMDVEGRALPQI--- 65
+ L+ +I+++L +++++ + I L DG T+ V G P I
Sbjct: 32 HQLVGWGEINNLLEKDILDYPR---IRLANDGIPSERGFKGFVTYTLTVTGETSPHINRY 88
Query: 66 -VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
+ + G ++ Q + + QQ + L C GAN Y + F H+D+
Sbjct: 89 NLLKRLQTGSTLIIDRCQAFFERAQQAASYLSTHLRCRSGANLYCAWSSTPSFGAHFDNH 148
Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-EIGTPILTVTLEPGDLLYLPRGY 183
+ +Q+EG K+W+VY P P + +F Q G P+L TL PG +Y+P GY
Sbjct: 149 DVIAVQIEGVKRWEVYAP---TRPYPLLNDKSFDQTPPAGEPMLCHTLTPGQAIYVPAGY 205
Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGL-PIGYLRYA 242
H T T E S+H++ V + IDL+ + + L+A+ G IG R +
Sbjct: 206 WHNVFTET-ERSMHISFPVVRPRK-IDLIRMVLER-LEASAELREPIAHGSDAIGNARLS 262
Query: 243 GLARGKPVDIQAD 255
G+ +I D
Sbjct: 263 GVLYACLANIDID 275
>gi|392544642|ref|ZP_10291779.1| cupin 4 [Pseudoalteromonas piscicida JCM 20779]
Length = 370
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 91/180 (50%), Gaps = 9/180 (5%)
Query: 49 DGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY 108
+ ++ ++ ++L QI+ + + +G ++ Q L++ + L + F C V AN Y
Sbjct: 69 NASKREYQIKPQSLEQIL-ELFADGYTVVLPRAQIRNTRLKKFVSDLSKYFHCRVDANIY 127
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV-----DEYLPRYSSPNFSQEEIG 163
TPP +QGF H D + FILQ EG+K+W + R V DE R +F +++
Sbjct: 128 VTPPKSQGFKVHLDMHDVFILQTEGEKEWILSKKRPVKITSNDEKF-RVFLESFDEKKDF 186
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAA 223
VTL G LYLPRG H ++ T++ S+H++++++ W+ L L +
Sbjct: 187 INTEKVTLTSGRKLYLPRGVYHCGNS-TDKVSIHISVTLF-PLQWVGFLSDQFSNDLSHS 244
>gi|392544644|ref|ZP_10291781.1| cupin 4 family protein [Pseudoalteromonas piscicida JCM 20779]
Length = 390
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 88 LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKW------KVYL 141
L ++ + + AN Y TPP ++GF HYD + +LQ EG KKW KV+L
Sbjct: 105 LSEICGLISDQLDVRANANIYLTPPNSKGFTAHYDMHDVLLLQTEGAKKWVVSKCDKVHL 164
Query: 142 PRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
D P + + + ++ + T T+ L+ G +LY+PRG IH+ V + S+H+T +
Sbjct: 165 TLTEDITRPEFYTIDPEKDLVNTK--TLELKRGQVLYMPRGTIHKGEAVGDRPSIHITFN 222
Query: 202 VYQKTAWIDLLE 213
+ T W +L+E
Sbjct: 223 IL-PTTWYNLIE 233
>gi|294629460|ref|ZP_06708020.1| cupin superfamily protein [Streptomyces sp. e14]
gi|292832793|gb|EFF91142.1| cupin superfamily protein [Streptomyces sp. e14]
Length = 332
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 81/183 (44%), Gaps = 25/183 (13%)
Query: 52 RQTMDVEGRALPQ------------IVWDHYR--------EGCSIRFLNPQTYIKP-LQQ 90
R + V+G ALPQ VW + G + L+ + P +
Sbjct: 57 RLRLAVDGEALPQHAYSVPVTTRRNTVWHRLKPAELHRHLAGGATLVLDAIDELHPGVGH 116
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
L L+ + N Y + +GF H+DD + +LQL+G K+W++Y P
Sbjct: 117 LARGLERHLRTGIQVNAYASWTPQEGFGVHWDDHDVLVLQLDGAKRWRIY---GATRQAP 173
Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWID 210
+ + + G P++ + L GD+LYLPRG+ H + HSLH+T + Q T D
Sbjct: 174 LHRDTDVPEAPPGEPLVDMVLNAGDMLYLPRGWWHAVAASEGVHSLHLTCGM-QTTTGAD 232
Query: 211 LLE 213
LL+
Sbjct: 233 LLQ 235
>gi|320164494|gb|EFW41393.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 869
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 65/121 (53%), Gaps = 6/121 (4%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR-YS 153
L+E F N Y +P +AQGF H+D +++ ILQ G K W+V P D LP +
Sbjct: 329 LEESLSTFASVNMYLSPASAQGFKAHHDLMDSVILQTAGSKVWRVCNP---DFQLPSDVN 385
Query: 154 SPNFSQEEIGTPILT-VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
S NF E+ T + L PGD+LYLPRG IH T+ N S+H+T + W ++
Sbjct: 386 SGNFLLSELRYMNCTDILLTPGDVLYLPRGTIHSPYTIGN-MSMHLTFGLSAAFPWQAVM 444
Query: 213 E 213
+
Sbjct: 445 D 445
>gi|428170986|gb|EKX39906.1| hypothetical protein GUITHDRAFT_113900 [Guillardia theta CCMP2712]
Length = 337
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 77/142 (54%), Gaps = 13/142 (9%)
Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQ 159
N Y TP +Q PH DD + ++Q+ G K+W VY LP DE + + S ++
Sbjct: 194 CNMYLTPAGSQAVHPHSDDRDVLLIQIWGTKEWLVYGSPQTLP-FSDEQVGK-SGQRLAE 251
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH-SLHVTISV-YQKTAWIDLLEKAMP 217
IG L+ TL+ GD LY+PRG++H+A EH SLH+TI++ Q W ++ AM
Sbjct: 252 GTIGPVSLSATLKTGDTLYIPRGFVHEAK--AQEHGSLHITIAIPTQDFTWSGVMMDAMR 309
Query: 218 KALQAAGATDLEFRRGLPIGYL 239
+ L+ ++RR +P+ L
Sbjct: 310 QKLREEKYN--KWRRCVPLTLL 329
>gi|453049325|gb|EME96928.1| hypothetical protein H340_29042 [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 395
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 99/206 (48%), Gaps = 24/206 (11%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNA-SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
R+G S+ L+ I P A L+ G V AN Y + +GF H+DD + ++
Sbjct: 96 RDGASL-VLDAVEKIHPSVGAAAEGLERFLGTSVQANVYASWTEREGFGRHWDDHDVVVV 154
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
QL G K+W+++ M E P + +E P+ + + PGD+LYLPRG+ H +
Sbjct: 155 QLHGSKRWRLW--GMTREG-PTFRDVESPEEPESDPLADLVMSPGDVLYLPRGWWHSVTA 211
Query: 190 VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
SLH+T + T DL+ + + L+A+ L RR +P R+ L
Sbjct: 212 DQGTESLHLTFGMVPHTG-ADLMLWVVDQ-LRAS----LALRRDIP----RFGSLTE--- 258
Query: 250 VDIQADRL-AMKENLKDLLA--KLVE 272
QAD A++ + D++A +LVE
Sbjct: 259 ---QADFTDALRREVADMMADPRLVE 281
>gi|167538155|ref|XP_001750743.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770767|gb|EDQ84448.1| predicted protein [Monosiga brevicollis MX1]
Length = 2266
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 75/146 (51%), Gaps = 17/146 (11%)
Query: 86 KPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR-- 143
+P L LQ LF + Y +PP AQ APH D + F++Q+ G+K+W + P+
Sbjct: 171 RPYSILQRQLQALFTTMTNVHAYISPPNAQALAPHTDPYDVFVVQVYGQKEWTLCTPQPP 230
Query: 144 ----MVDEYLPRY---------SSPNFSQEEIG-TPILTVTLEPGDLLYLPRGYIHQAST 189
+ D + ++ N+ + ++ +TL PGDLLY+P+G IH A+T
Sbjct: 231 GGQNLSDAHKAQWQEIAKHNIQGCTNYQEWQLAKMDCQHITLLPGDLLYIPKGVIHYATT 290
Query: 190 VTNEHSLHVTISVYQKT-AWIDLLEK 214
+ S H+T+S+ + + +W+ L +
Sbjct: 291 GSVTGSTHLTVSIERLSHSWMMLFHR 316
>gi|159491391|ref|XP_001703650.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270558|gb|EDO96399.1| predicted protein [Chlamydomonas reinhardtii]
Length = 191
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 66/120 (55%), Gaps = 8/120 (6%)
Query: 28 DDILRD-NVIEFKKNIDITLYEDGQRQTMDV---EGRALPQIVWDHYREGCSIRFLNPQT 83
D+++RD + F +++ Y DG+R+ + + AL Q+ H +++ PQ
Sbjct: 12 DEMIRDIGPLVFGRDVVAVRYIDGKREEIRASFADSEALKQL---HDDARATLQLHQPQR 68
Query: 84 YIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGK-KKWKVYLP 142
++ +L A+L+ GC VG N Y TPP QG APH+DD+E F+ Q +G+ +W LP
Sbjct: 69 FVNACWRLLAALESQLGCLVGCNAYVTPPGGQGLAPHHDDVELFVCQTQGEGGRWGGRLP 128
>gi|452954391|gb|EME59794.1| Cupin 2 barrel domain-containing protein [Amycolatopsis decaplanina
DSM 44594]
Length = 393
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 94/210 (44%), Gaps = 19/210 (9%)
Query: 14 QKNYYNNLITSKKIDDILRDNVIEFKK--------NIDITLYED---GQRQTMDVEGRAL 62
+ + +L+ ++ ILR + +EF + + + Y D +R RA
Sbjct: 34 RAGRFADLLPWPALNAILRRHRLEFPRLRLALDGTPVPMESYTDLVPTRRSGTVPRLRAA 93
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P +H R G ++ Q P+ L SL+ V N Y GF H+D
Sbjct: 94 PFT--EHLRGGATMVLDAIQELSDPIGDLARSLEHDLRESVQVNLYAGWGVTHGFDVHWD 151
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
D +AFI+Q+ G+K+W+++ P P E G PI LE GD+LY+PRG
Sbjct: 152 DHDAFIIQVAGRKRWRMHGPTRPAPLHRDVEQP----ERPGEPIDDFILEDGDVLYVPRG 207
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
+ H + V E SLH+T+ T IDL+
Sbjct: 208 HWHDVTAV-GEMSLHLTLGFSPATG-IDLV 235
>gi|19387273|gb|AAL87184.1|AF480497_12 unknown [Oryza sativa Japonica Group]
gi|38345193|emb|CAE02886.2| OSJNBa0015K02.3 [Oryza sativa Japonica Group]
gi|38346410|emb|CAE54575.1| OSJNBa0011F23.16 [Oryza sativa Japonica Group]
Length = 774
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D ++ G S+ + + + + +++ +LFG VGAN Y +PP AQG A HYDD
Sbjct: 414 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 473
Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+ QL G KKW K+ LPR+ + + P + + G + + LE GD++Y+P
Sbjct: 474 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 527
Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
RG++H+A T T + SLH+T+++
Sbjct: 528 RGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAI 559
>gi|116310856|emb|CAH67798.1| OSIGBa0132E09-OSIGBa0108L24.12 [Oryza sativa Indica Group]
Length = 774
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D ++ G S+ + + + + +++ +LFG VGAN Y +PP AQG A HYDD
Sbjct: 414 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 473
Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+ QL G KKW K+ LPR+ + + P + + G + + LE GD++Y+P
Sbjct: 474 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 527
Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
RG++H+A T T + SLH+T+++
Sbjct: 528 RGFVHEAHTDVDVGGFEVISTVDCSLHLTLAI 559
>gi|218195745|gb|EEC78172.1| hypothetical protein OsI_17757 [Oryza sativa Indica Group]
Length = 769
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D ++ G S+ + + + + +++ +LFG VGAN Y +PP AQG A HYDD
Sbjct: 409 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 468
Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+ QL G KKW K+ LPR+ + + P + + G + + LE GD++Y+P
Sbjct: 469 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 522
Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
RG++H+A T T + SLH+T+++
Sbjct: 523 RGFVHEAHTDVDVGGFEVISTVDCSLHLTLAI 554
>gi|222629703|gb|EEE61835.1| hypothetical protein OsJ_16482 [Oryza sativa Japonica Group]
Length = 769
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D ++ G S+ + + + + +++ +LFG VGAN Y +PP AQG A HYDD
Sbjct: 409 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 468
Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+ QL G KKW K+ LPR+ + + P + + G + + LE GD++Y+P
Sbjct: 469 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 522
Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
RG++H+A T T + SLH+T+++
Sbjct: 523 RGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAI 554
>gi|427429763|ref|ZP_18919750.1| Cupin 4 family protein [Caenispirillum salinarum AK4]
gi|425880000|gb|EKV28701.1| Cupin 4 family protein [Caenispirillum salinarum AK4]
Length = 359
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 81/180 (45%), Gaps = 18/180 (10%)
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P V + R GCS+ T L+ + A+L++ AN Y + + Q F H D
Sbjct: 92 PSKVMELVRAGCSLGLNRLDTLDAGLRAVGAALEDALAGKAQANLYYSRRSNQAFGVHSD 151
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-------EEIGTPILTVTLEPGD 175
+ F L EG+K+W +Y D + P F + G T+ ++PGD
Sbjct: 152 SHDVFALHCEGEKEWHLYEGVDPDP----INHPAFKKLPLEERRRRAGGVAETLVMKPGD 207
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
LLY+PRG+ H A T++ +HV V Q +DLL M +A D FRR LP
Sbjct: 208 LLYIPRGHYHDALCRTDD-CVHVAYGVTQVIG-MDLLTLLMNRA-----PADAAFRRALP 260
>gi|297723531|ref|NP_001174129.1| Os04g0659150 [Oryza sativa Japonica Group]
gi|255675849|dbj|BAH92857.1| Os04g0659150 [Oryza sativa Japonica Group]
Length = 430
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 81/152 (53%), Gaps = 23/152 (15%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEA 126
D ++ G S+ + + + + +++ +LFG VGAN Y +PP AQG A HYDD
Sbjct: 278 DAFKNGFSVALRGMEFRSEKIAAIASAVADLFGQPSVGANIYFSPPRAQGLARHYDDHCV 337
Query: 127 FILQLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+ QL G KKW K+ LPR+ + + P + + G + + LE GD++Y+P
Sbjct: 338 LVWQLLGCKKWMIWPDTKLLLPRLYEPFEPL---DDLVDDCGGR--MEILLE-GDIMYVP 391
Query: 181 RGYIHQAST----------VTNEHSLHVTISV 202
RG++H+A T T + SLH+T+++
Sbjct: 392 RGFVHEAHTDVDVGGFEVNSTVDCSLHLTLAI 423
>gi|398784687|ref|ZP_10547867.1| hypothetical protein SU9_15724 [Streptomyces auratus AGR0001]
gi|396994993|gb|EJJ06016.1| hypothetical protein SU9_15724 [Streptomyces auratus AGR0001]
Length = 390
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 5/125 (4%)
Query: 89 QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
+ L +L+ F V N Y + +GF H+DD + ++QLEG K+WK+Y +D
Sbjct: 113 EDLATALERHFRTDVQVNLYASWTPQEGFGVHWDDHDVVVVQLEGAKRWKLYGATRIDPL 172
Query: 149 LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAW 208
+P + G P+ + L+ GD+LYLPRG+ H A T SLH+T + T
Sbjct: 173 RVDTEAP---EPPTGAPVAEIVLQAGDMLYLPRGWWH-AVAATEGRSLHLTCGLKPTTG- 227
Query: 209 IDLLE 213
DLL
Sbjct: 228 ADLLS 232
>gi|404254172|ref|ZP_10958140.1| cupin 4 family protein [Sphingomonas sp. PAMC 26621]
Length = 400
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 115/275 (41%), Gaps = 33/275 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITL---------YEDGQR 52
+W+++P+++ +L+T ++ ++L N+ + L Y DG
Sbjct: 24 YWDRQPVHLPSPAGGR-GSLLTWERFSELLGTGGHWTAANLKLILNSRPILPDFYLDGDP 82
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
RA P V G S+ + + L++ L F AN Y +
Sbjct: 83 AAPIDTRRADPARVGVFLGMGASLVADDVERIAPELRRWTTLLGTEFAALTNANVYCSFK 142
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-------EIGTP 165
Q FA HYD + F LQ EG+K W++Y R + S+P+ +E G
Sbjct: 143 GVQAFATHYDLHDVFALQCEGEKVWRIYRNRAENP----VSAPHGGEEVQRQIDAARGDL 198
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
+ V + PGD+LYLPRG H A SLHV+ +V L A+ +AL+ A
Sbjct: 199 MAEVRMRPGDVLYLPRGCYHDA-LAQEGASLHVSFAV------APLGAMALLRALEEAAL 251
Query: 226 TDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMK 260
+ FR YL A G+P++ Q LA +
Sbjct: 252 REPAFR-----AYLADARSDDGQPLEAQFQALARR 281
>gi|291004158|ref|ZP_06562131.1| cupin 4 [Saccharopolyspora erythraea NRRL 2338]
Length = 374
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 83/173 (47%), Gaps = 15/173 (8%)
Query: 69 HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFI 128
+ R+GC++ + Y ++ +LQ V N Y T + GF H+DD + +
Sbjct: 58 YLRDGCTLVLDDLGAYDPTMEVTCRALQWELNERVQVNAYLTTGESGGFCLHWDDHDVIV 117
Query: 129 LQLEGKKKWKVYLPRMVDEYLP--RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
+Q+ G K W+V R P R S PN + E + T TL GD++ +PRG+ HQ
Sbjct: 118 VQVAGAKTWEV---RPTSRAYPMFRDSDPNLAAPE--KTLWTGTLNAGDVMLIPRGHWHQ 172
Query: 187 ASTVT--NEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
A+ T + +SLH+T ++T ID L +A + A FRR L G
Sbjct: 173 ATRATHGDGYSLHLTFGFTRRTG-IDWLTWVADQARKTA-----TFRRDLDAG 219
>gi|76155855|gb|AAX27127.2| SJCHGC04036 protein [Schistosoma japonicum]
Length = 430
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/228 (27%), Positives = 97/228 (42%), Gaps = 63/228 (27%)
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGAT 226
L V L+ GDLLY+P+GY S EHSLH+ IS QK W + L + + ++ A
Sbjct: 19 LLVKLKSGDLLYIPKGYTCSISHSNVEHSLHLCISFNQKYNWGNYLSELLSIMMKYAVEN 78
Query: 227 DLEFRRGLPIGYLRYAGLARGK------PVDIQADRLAMKENLKDLLAKL---------- 270
D EF + LP+ ++ G+ K V+ + RL + LK L++K+
Sbjct: 79 DPEFNKPLPLDSFKHLGIFFNKDSSSKSAVEEKEFRLQSVQLLKRLISKIEQRQTFSIQK 138
Query: 271 --------------------------------------VEYVD-LDDGVDEMGKQLMHDA 291
+ VD L +DE+ ++M +
Sbjct: 139 SKSLNHQLNEDKQCSLLSSAKEIDSTKKKQPNPYDNLFIHSVDPLHLAIDEIMLKIMKKS 198
Query: 292 LPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
P LS +EL S+F+ G +Q+T ++K+RLVR+ AVR
Sbjct: 199 CSPELSEDELMRSIFKQGSD--------KLSQLTVNSKIRLVRSTAVR 238
>gi|375098356|ref|ZP_09744619.1| hypothetical protein SaccyDRAFT_0024 [Saccharomonospora cyanea
NA-134]
gi|374659088|gb|EHR58966.1| hypothetical protein SaccyDRAFT_0024 [Saccharomonospora cyanea
NA-134]
Length = 386
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 136/326 (41%), Gaps = 49/326 (15%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDN--------VIEFKKNIDITLYEDGQR-Q 53
W K L + + + +L++ +D++L ++ ++ + + ++ Y R
Sbjct: 20 WGKRAL-VHGPTVDGFEDLLSLDGVDELLTEHSLRSPAFRLVREGRTLPLSDYTRTARIG 78
Query: 54 TMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
V G A P + G ++ Y PL L+ G N Y TPP
Sbjct: 79 GTPVTGVADPARILAAVEGGATLVLQGLHRYWPPLAGFFRELELSLGHPCQVNAYVTPPG 138
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
AQG PH D + F+LQ G K W+V + +P E V LEP
Sbjct: 139 AQGLRPHSDRHDVFVLQAFGSKSWQV------------WPAPG-EGAEADRGAEDVELEP 185
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRG 233
G LYLP G H A+ ++ S H+T+ ++ W DLLE A+ + L D +
Sbjct: 186 GTALYLPTGTRH-AARAQHDVSGHLTVGIH-PARWRDLLEAAVTRVLD-----DPAWTSP 238
Query: 234 LPIGYLRYAGLARGKPVDIQADRLA--MKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDA 291
LP+ Y R Q D + + +LK L A+L+ D D V+E ++ + +
Sbjct: 239 LPVRYHR------------QRDSVVNELGRHLKQLGAQLM-ATDPSDPVEERVERFL-TS 284
Query: 292 LPPVLS---PEELQCSVFENGLRMSQ 314
PP+L + L+ S ++ R+ +
Sbjct: 285 RPPMLRGGLSDRLRLSTMDDSARVRR 310
>gi|392544643|ref|ZP_10291780.1| cupin 4 [Pseudoalteromonas piscicida JCM 20779]
Length = 391
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 81/176 (46%), Gaps = 13/176 (7%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y +G +I + + ++ + L F C V AN Y TPP +GF HYD + IL
Sbjct: 85 YADGHTIVLNSVHKRWPVIAKVCSELSNQFFCRVNANIYVTPPGNKGFEAHYDTHDVIIL 144
Query: 130 QLEGKKKW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
Q EG K+W ++ + + D+ + + ++ + +++ L+ G LY+PRG
Sbjct: 145 QTEGSKEWFLSEHERLQITHINDDLRNEFYEIDAEKDTKNS--VSIQLDAGQCLYVPRGT 202
Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
IH A + S+H+T S++ T W L M + D E + +P L
Sbjct: 203 IHSAEANNSLPSIHITFSIFPVT-WASL----MRAVILDGSLKDTELMKTVPFEVL 253
>gi|318057978|ref|ZP_07976701.1| cupin 4 family protein [Streptomyces sp. SA3_actG]
gi|318076149|ref|ZP_07983481.1| cupin 4 family protein [Streptomyces sp. SA3_actF]
Length = 280
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 98/223 (43%), Gaps = 27/223 (12%)
Query: 69 HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFI 128
++ G +IR N Q + L ++ +Q G + + TPP QG H+D A I
Sbjct: 65 YHEAGYTIRLGNLQRVVPFLNSVSRQIQMETGYSNYLHAFLTPPGGQGLRHHWDQQMALI 124
Query: 129 LQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE------EIGTPILTVTLEPGDLLYLPRG 182
+Q+ G K W+++ P+ S E E P T+ LEPG L LPRG
Sbjct: 125 VQISGTKTWELWAPKFPSPMRNYAESTRVWDETWRHEWESHGPDHTIELEPGQTLLLPRG 184
Query: 183 YIHQA-STVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRY 241
++H S +E S+H+T ++ ++T + L EK KA+ DL
Sbjct: 185 WVHNPHSRNASEDSIHLTFAIRERTH-LWLAEKLTAKAIGRPEFRDLVI----------- 232
Query: 242 AGLARGKPVDIQADRLAMK-ENLKDLLAKLVEYVDLDDGVDEM 283
P ++ D L+ + E+ + +L + + +DL +E+
Sbjct: 233 -------PPELHGDDLSSRLEDTRRMLVRFLASLDLSAEAEEI 268
>gi|301018918|ref|ZP_07183146.1| cupin family protein [Escherichia coli MS 196-1]
gi|433051100|ref|ZP_20238364.1| hypothetical protein WII_04991 [Escherichia coli KTE120]
gi|299882449|gb|EFI90660.1| cupin family protein [Escherichia coli MS 196-1]
gi|431557901|gb|ELI31586.1| hypothetical protein WII_04991 [Escherichia coli KTE120]
Length = 297
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 30/204 (14%)
Query: 18 YNNLITSKKIDDILRDNVIEFKK------NI-DITLYEDGQRQTMDVEGRALPQI----V 66
Y NLI ++ +L +++ + + N +I Y+ R T G P I +
Sbjct: 30 YKNLINWNDLNKLLEKDILHYPRVRMANDNFPEIRGYKGFIRYTYSQTGDRTPHINRHQL 89
Query: 67 WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+ R+G ++ Q++ + + + L + C AN Y F H+D+ +
Sbjct: 90 YKCLRDGATLIVDRCQSFFESVDESRLWLSKELECTCSANLYAAFTATPSFGLHFDNHDV 149
Query: 127 FILQLEGKKKWKVYLPR----MVDE----YLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
+Q+EG KKWKVY P + DE YLP +SP++ + PG +Y
Sbjct: 150 IAVQIEGIKKWKVYNPTYSYPLEDERSFDYLPPNTSPDYE----------FDITPGQAIY 199
Query: 179 LPRGYIHQASTVTNEHSLHVTISV 202
LP GY H +T + +HSLH++ +V
Sbjct: 200 LPAGYWHNVTTQS-KHSLHISFTV 222
>gi|432521851|ref|ZP_19759003.1| hypothetical protein A17U_04852 [Escherichia coli KTE228]
gi|433161660|ref|ZP_20346455.1| hypothetical protein WKU_04739 [Escherichia coli KTE177]
gi|431043503|gb|ELD53834.1| hypothetical protein A17U_04852 [Escherichia coli KTE228]
gi|431670119|gb|ELJ36475.1| hypothetical protein WKU_04739 [Escherichia coli KTE177]
Length = 297
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 30/204 (14%)
Query: 18 YNNLITSKKIDDILRDNVIEFKK------NI-DITLYEDGQRQTMDVEGRALPQI----V 66
Y NLI ++ +L +++ + + N +I Y+ R T G P I +
Sbjct: 30 YKNLINWNDLNKLLEKDILHYPRVRMANDNFPEIRGYKGFIRYTYSQTGDRTPHINRHQL 89
Query: 67 WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+ R+G ++ Q++ + + + L + C AN Y F H+D+ +
Sbjct: 90 YKCLRDGATLIVDRCQSFFESVDESRLWLSKELECTCSANLYAAFTATPSFGLHFDNHDV 149
Query: 127 FILQLEGKKKWKVYLPR----MVDE----YLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
+Q+EG KKWKVY P + DE YLP +SP++ + PG +Y
Sbjct: 150 IAVQIEGIKKWKVYNPTYSYPLEDERSFDYLPPNTSPDYE----------FDITPGQAIY 199
Query: 179 LPRGYIHQASTVTNEHSLHVTISV 202
LP GY H +T + +HSLH++ +V
Sbjct: 200 LPAGYWHNVTTQS-KHSLHISFTV 222
>gi|410503126|ref|YP_006940181.1| Cupin 4 family protein [Escherichia coli]
gi|353234053|emb|CCE21301.1| Cupin 4 family protein [Escherichia coli]
Length = 297
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 30/204 (14%)
Query: 18 YNNLITSKKIDDILRDNVIEFKK------NI-DITLYEDGQRQTMDVEGRALPQI----V 66
Y NLI ++ +L +++ + + N +I Y+ R T G P I +
Sbjct: 30 YKNLINWNDLNKLLEKDILHYPRVRMANDNFPEIRGYKGFIRYTYSQTGDRTPHINRHQL 89
Query: 67 WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+ R+G ++ Q++ + + + L + C AN Y F H+D+ +
Sbjct: 90 YKCLRDGATLIVDRCQSFFESVDESRLWLSKELECTCSANLYAAFTATPSFGLHFDNHDV 149
Query: 127 FILQLEGKKKWKVYLPR----MVDE----YLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
+Q+EG KKWKVY P + DE YLP +SP++ + PG +Y
Sbjct: 150 IAVQIEGIKKWKVYNPTYSYPLEDERSFDYLPPNTSPDYE----------FDITPGQAIY 199
Query: 179 LPRGYIHQASTVTNEHSLHVTISV 202
LP GY H +T + +HSLH++ +V
Sbjct: 200 LPAGYWHNVTTQS-KHSLHISFTV 222
>gi|145220941|ref|YP_001131619.1| cupin 4 family protein [Mycobacterium gilvum PYR-GCK]
gi|315442089|ref|YP_004074968.1| hypothetical protein Mspyr1_04210 [Mycobacterium gilvum Spyr1]
gi|145213427|gb|ABP42831.1| Cupin 4 family protein [Mycobacterium gilvum PYR-GCK]
gi|315260392|gb|ADT97133.1| uncharacterized conserved protein [Mycobacterium gilvum Spyr1]
Length = 402
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 74/149 (49%), Gaps = 12/149 (8%)
Query: 99 FGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF- 157
G V N Y TPP +GF H+D + F+LQ+ G K+W V+ P + LP S F
Sbjct: 123 IGHPVQVNAYITPPGNRGFDFHHDVHDVFVLQIAGSKRWIVHEP-VHAHPLPDQSWTQFR 181
Query: 158 ---SQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
+ P++ L GD LYLPRG++H A + + S+H+T+ V T +D+
Sbjct: 182 DLIDERVTADPVIDTALTEGDALYLPRGWVHAAQAL-DTTSIHLTVGVSATTG-VDVARA 239
Query: 215 AMPKALQAAGATDLEFRRGLPIGYLRYAG 243
+ + AG EFR LP+G AG
Sbjct: 240 VLD---ELAGID--EFRAPLPLGIDPAAG 263
>gi|167520418|ref|XP_001744548.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776879|gb|EDQ90497.1| predicted protein [Monosiga brevicollis MX1]
Length = 710
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 89/179 (49%), Gaps = 21/179 (11%)
Query: 34 NVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNA 93
NV K+ D E G++ D+ RAL REG +++F + + + L
Sbjct: 392 NVKFIKRTFDQK-QESGRKTEKDL-ARAL--------REGFTLQFYSVNYWDPNIASLAL 441
Query: 94 SLQELFGCF--VGANTYQTPP-TAQGFAPHYDDIEAFILQLEGKKKWKVY------LPRM 144
L E G V AN Y TP T+ PH D + ++QL G K+W+++ LP
Sbjct: 442 ELSE-HGILLPVNANLYITPGGTSVSLVPHTDYQCSLMVQLAGVKRWRLWKMPEIMLPVS 500
Query: 145 VDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH-SLHVTISV 202
+ R + + EE+G P + V L+PGD+LY+PRG +H ST +H S+H+T+ +
Sbjct: 501 ANMIRGRDTDDLVASEELGEPYMDVLLQPGDILYVPRGVLHATSTPEGDHPSMHLTVGM 559
>gi|357414603|ref|YP_004926339.1| hypothetical protein Sfla_5425 [Streptomyces flavogriseus ATCC
33331]
gi|320011972|gb|ADW06822.1| hypothetical protein Sfla_5425 [Streptomyces flavogriseus ATCC
33331]
Length = 403
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 73/143 (51%), Gaps = 8/143 (5%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY-- 152
+ + G N + T AQ FA H+D+++ F+LQ+ G K W+V+ P + LP Y
Sbjct: 120 VMRMVGETASCNLFVTFDDAQAFASHFDEVDTFVLQVLGTKSWQVHGPSE-EHPLPEYGD 178
Query: 153 SSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
S P E + + TLEPGD++++PRG+ H E SLH+T + ++T + D L
Sbjct: 179 SDPARCPEAV---LFERTLEPGDVIHVPRGWWHTVRG-GGESSLHLTFAFTRRTGY-DWL 233
Query: 213 EKAMPKALQAAGATDLEFRRGLP 235
+AL + + R G P
Sbjct: 234 RWVAYRALDSTDVRESLARAGTP 256
>gi|451336154|ref|ZP_21906715.1| hypothetical protein C791_2960 [Amycolatopsis azurea DSM 43854]
gi|449421346|gb|EMD26778.1| hypothetical protein C791_2960 [Amycolatopsis azurea DSM 43854]
Length = 393
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 103/230 (44%), Gaps = 26/230 (11%)
Query: 18 YNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALP------------QI 65
+ +L+ ++ ILR + +EF + + L DG M+ +P
Sbjct: 38 FADLLPWPALNAILRQHRLEFPR---LRLALDGTPVPMESYTELVPTRRSGTVPRLQAAP 94
Query: 66 VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
+H R+G ++ Q + L SL+ V N Y GF H+DD +
Sbjct: 95 FTEHLRDGATMVLDAIQELSDLIGDLARSLEHDLRESVQVNLYAGWGVTHGFDVHWDDHD 154
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
AFI+Q+ G+K+W+++ P P E G PI LE GD+LY+PRG+ H
Sbjct: 155 AFIIQVAGRKRWRMHGPTRPAPLHRDVEQP----ERPGEPIDDFILEDGDVLYVPRGHWH 210
Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+ V E SLH+T+ T IDL+ K L+++ A FR+ LP
Sbjct: 211 DVTAV-GEMSLHLTLGFSPATG-IDLVSWLADK-LRSSTA----FRQDLP 253
>gi|433608094|ref|YP_007040463.1| Cupin 2 barrel domain-containing protein [Saccharothrix espanaensis
DSM 44229]
gi|407885947|emb|CCH33590.1| Cupin 2 barrel domain-containing protein [Saccharothrix espanaensis
DSM 44229]
Length = 392
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 130/299 (43%), Gaps = 42/299 (14%)
Query: 14 QKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQ-------RQTMDVEG--RALPQ 64
+ + +L+ +++ +LR + +E + + L + G+ +T D R P
Sbjct: 33 RAGRFADLLPWSEVNRVLRQHPLEHPR---LRLAQGGEVLPAHTYTETSDTAPVPRLSPA 89
Query: 65 IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
+ R G ++ +P+ +L A+L+ V N Y GF H+DD
Sbjct: 90 AFTEKMRGGATLVMDAVHELFEPVGELAAALEHELRERVRVNLYAGWGVTHGFDVHWDDH 149
Query: 125 EAFILQLEGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
+ I+Q+ G+K+W+++ P D LP P S E P+ LE GD+LY+
Sbjct: 150 DVIIVQVSGRKRWRLHGSTRPAPLRRDVELP----PRPSTE----PLDDFVLEDGDVLYV 201
Query: 180 PRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYL 239
PRG+ H S V E SLH+TI + + +DL+ + L+A A FR LP
Sbjct: 202 PRGHWHDVSAV-GEESLHLTIG-FNRATGVDLVSW-LADQLRAEEA----FRTDLP---- 250
Query: 240 RYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLSP 298
R+A A + +A ++ L DLL V L D D L H LP +P
Sbjct: 251 RFASRA-----EQEAHAAVLRAKLTDLLDAGVVARFLAD-RDAQAPALPHVGLPWTATP 303
>gi|302761234|ref|XP_002964039.1| hypothetical protein SELMODRAFT_405651 [Selaginella moellendorffii]
gi|300167768|gb|EFJ34372.1| hypothetical protein SELMODRAFT_405651 [Selaginella moellendorffii]
Length = 550
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 52/89 (58%), Gaps = 4/89 (4%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-E 161
+GAN Y TP AQG HYDD F++QL GKK+W + V LPR +P E E
Sbjct: 258 IGANLYLTPSNAQGLGRHYDDHCVFVVQLAGKKRWSINPAATV--VLPRLYAPRLVPEAE 315
Query: 162 IGTPILTVT-LEPGDLLYLPRGYIHQAST 189
+ + L T L G +LY+PRG+ H+AST
Sbjct: 316 VQSRELADTLLTEGSVLYIPRGFAHRAST 344
>gi|167534643|ref|XP_001748997.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772677|gb|EDQ86327.1| predicted protein [Monosiga brevicollis MX1]
Length = 2159
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 71/150 (47%), Gaps = 17/150 (11%)
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD---- 146
L L LF V + Y TPP AQ PH D + ++Q G+K+W + P+
Sbjct: 146 LERHLATLFATTVAVHGYVTPPGAQALKPHTDPYDVLVIQTYGEKQWTICTPQPAGAQNR 205
Query: 147 --------EYLPRYSSPNFSQEE----IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH 194
+ + R+S +Q E +TL+ GD+LYLP+G IH A+T +
Sbjct: 206 TDAEKAQLQEIVRHSIQGCTQYEAWQLAKMECQAITLKAGDVLYLPKGIIHYATTTESMG 265
Query: 195 SLHVTISVYQKT-AWIDLLEKAMPKALQAA 223
S H+T+S+ + T +W+ L +A L A
Sbjct: 266 STHITLSLERLTHSWLALFGRACGLGLDRA 295
>gi|357392116|ref|YP_004906957.1| hypothetical protein KSE_52260 [Kitasatospora setae KM-6054]
gi|311898593|dbj|BAJ31001.1| hypothetical protein KSE_52260 [Kitasatospora setae KM-6054]
Length = 395
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 62/122 (50%), Gaps = 4/122 (3%)
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
L L+ V N Y + +GF H+DD + ++QL+G K+WK+Y P
Sbjct: 118 LAKHLENWLRTAVQVNLYGSWTGTEGFGTHWDDHDVIVVQLDGTKRWKLYGPTRTAPMNR 177
Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWID 210
++P E+ PI + L GD+LYLPRG+ H + ++SLH+T + Q T D
Sbjct: 178 DVAAPEQPPEQ---PIADLVLSAGDVLYLPRGWWHAVAASEGQYSLHLTCGL-QTTTGAD 233
Query: 211 LL 212
L+
Sbjct: 234 LI 235
>gi|414344306|ref|YP_006985827.1| hypothetical protein B932_3351 [Gluconobacter oxydans H24]
gi|411029641|gb|AFW02896.1| hypothetical protein B932_3351 [Gluconobacter oxydans H24]
Length = 394
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 99/217 (45%), Gaps = 27/217 (12%)
Query: 4 EKEPLYISRKQKNYYNNLITSKKIDDIL--RDNVIEFKK------NIDITLYEDGQRQTM 55
+ + I + NYYN L + + + RD F + +D++ Y D +
Sbjct: 41 NNDSILIQGRGNNYYNCLFSFDDAEALTNRRDLFYPFVRLYDGTSRVDVSDYTD----NL 96
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQT-YIKPLQQLNAS-------LQELFGCFVGANT 107
G +P V RE R + T Y +++++ S L+ V +
Sbjct: 97 SFSGINVPFYV---SRETLRNRIASGNTAYFTGIERISTSCLALKVQLEAALIATVDLSL 153
Query: 108 YQTPPTAQGFAP-HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+ TP G P HYD+++ ILQ+ G+K WK++ D L S N S+ +
Sbjct: 154 FVTPSNTVGNLPRHYDNVDGLILQISGRKTWKLWGIAATDPLLVEKRS-NVSEMATDDNV 212
Query: 167 -LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
T TLE GDLL++P G IH+ T T++HS+H+T+ V
Sbjct: 213 HATYTLEAGDLLFVPAGMIHETDT-TDDHSIHITVGV 248
>gi|453328219|dbj|GAC89443.1| cupin 4 family protein [Gluconobacter thailandicus NBRC 3255]
Length = 394
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 99/217 (45%), Gaps = 27/217 (12%)
Query: 4 EKEPLYISRKQKNYYNNLITSKKIDDIL--RDNVIEFKK------NIDITLYEDGQRQTM 55
+ + I + NYYN L + + + RD F + +D++ Y D +
Sbjct: 41 NNDSILIQGRGNNYYNCLFSFDDAEALTNRRDLFYPFVRLYDGTSRVDVSDYTD----NL 96
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQT-YIKPLQQLNAS-------LQELFGCFVGANT 107
G +P V RE R + T Y +++++ S L+ V +
Sbjct: 97 SFSGINVPFYV---SRETLRNRIASGNTAYFTGIERISTSCLALKVQLEAALIATVDLSL 153
Query: 108 YQTPPTAQGFAP-HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+ TP G P HYD+++ ILQ+ G+K WK++ D L S N S+ +
Sbjct: 154 FVTPSNTVGNLPRHYDNVDGLILQISGRKTWKLWGIAATDPLLVEKRS-NVSEMATDDNV 212
Query: 167 -LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
T TLE GDLL++P G IH+ T T++HS+H+T+ V
Sbjct: 213 HATYTLEAGDLLFVPAGMIHETDT-TDDHSIHITVGV 248
>gi|302768931|ref|XP_002967885.1| hypothetical protein SELMODRAFT_408768 [Selaginella moellendorffii]
gi|300164623|gb|EFJ31232.1| hypothetical protein SELMODRAFT_408768 [Selaginella moellendorffii]
Length = 895
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 49/94 (52%), Gaps = 14/94 (14%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWK------VYLPRMVDEYLPR-YSSP 155
+GAN Y TP AQG HYDD F++QL G K+W V LPR+ Y PR
Sbjct: 258 IGANLYLTPSNAQGLGRHYDDHCVFVVQLAGTKRWSINPAATVVLPRL---YAPRLVPKA 314
Query: 156 NFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
E+ +LT G LY+PRG+ HQAST
Sbjct: 315 EVQSRELAESVLT----EGSALYIPRGFAHQAST 344
>gi|381163845|ref|ZP_09873075.1| Cupin superfamily protein [Saccharomonospora azurea NA-128]
gi|379255750|gb|EHY89676.1| Cupin superfamily protein [Saccharomonospora azurea NA-128]
Length = 388
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 57/223 (25%), Positives = 90/223 (40%), Gaps = 24/223 (10%)
Query: 9 YISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQT----MDVEG 59
++ + + +L++ +D++L R ++ D R V G
Sbjct: 24 WVHGPTADGFEDLLSLDGVDELLTQHSLRSPAFRLVRDGRTVPLSDCTRTARIGGTSVSG 83
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
A P V +G ++ Y PL L+ G N Y TPP AQG P
Sbjct: 84 VADPARVLAAVEDGATLVLQGLHRYWPPLVGFCRELELELGHPCQVNAYVTPPGAQGLRP 143
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT-PILTVTLEPGDLLY 178
H D + F+LQ G K W+V + +P + G + V L G LY
Sbjct: 144 HTDSHDVFVLQAFGSKSWQV------------WPAPAADAGDTGADGVEDVELRAGMALY 191
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
LP G H A+ + S H+T+ ++ T W DL+E A+ + L+
Sbjct: 192 LPTGTRH-AARAQHVVSGHLTVGIH-PTRWRDLIESAVARVLR 232
>gi|384563987|ref|ZP_10011091.1| Cupin superfamily protein [Saccharomonospora glauca K62]
gi|384519841|gb|EIE97036.1| Cupin superfamily protein [Saccharomonospora glauca K62]
Length = 383
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 70/165 (42%), Gaps = 20/165 (12%)
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG 116
V G A P + +G ++ Y PL L+ G N Y TPP AQG
Sbjct: 82 VTGVADPARILAAIEDGATLVLQGLHRYWPPLVDFCRELELELGHPCQVNAYVTPPGAQG 141
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT-PILTVTLEPGD 175
PH D + F+LQ G K W+V+ E GT + V L PG
Sbjct: 142 LRPHVDTHDVFVLQAFGTKSWQVW-----------------RTSEAGTDDVENVELRPGT 184
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
LYLP G H A+ + S H+T+ ++ T W DL+E + + L
Sbjct: 185 ALYLPTGTRH-AARAQHAVSGHLTVGIH-PTRWRDLIEAVVARLL 227
>gi|429767708|ref|ZP_19299895.1| cupin family protein [Brevundimonas diminuta 470-4]
gi|429189867|gb|EKY30683.1| cupin family protein [Brevundimonas diminuta 470-4]
Length = 377
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 83/175 (47%), Gaps = 26/175 (14%)
Query: 72 EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
EG S+ + Q P+ L++ L F GAN Y + Q F H+D + F +Q
Sbjct: 99 EGASLIAGDVQDLTAPVAALSSVLSRTFAASAGANIYCSFKGVQAFGAHFDLTDVFAVQT 158
Query: 132 EGKKKWKVY---------LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
EG+K W++Y +P V + + R+ F Q + G V + PGD+LYLPRG
Sbjct: 159 EGEKLWRIYENRADAPVEMPADVGD-MRRW----FDQTK-GRLQAEVMMRPGDVLYLPRG 212
Query: 183 YIHQASTVTNEHSLHVTISVYQKTAWI--DLLEKAMPKALQAAGATDLEFRRGLP 235
H A + SLHVT S+ I +LLE+A A+Q D FR LP
Sbjct: 213 CYHDA-LAQDAASLHVTFSIAALHGRIIFNLLEQA---AMQ-----DPAFRAYLP 258
>gi|393722730|ref|ZP_10342657.1| cupin 4 family protein [Sphingomonas sp. PAMC 26605]
Length = 395
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 96/220 (43%), Gaps = 19/220 (8%)
Query: 46 LYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKP-LQQLNASLQELFGCFVG 104
Y DG TM +E R + D + + +N I P ++ L E F
Sbjct: 71 FYLDGD-VTMPIETRRVDPARVDVFLGMGASLVVNDIERIAPEIRDWTQMLGEHFAALTN 129
Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGT 164
AN Y + Q FA H+D + F +Q EG+K W++Y R D L Q +I
Sbjct: 130 ANVYCSFMGVQAFATHFDLHDVFAVQCEGEKVWRIYQNR-ADSPLVAPHGGEEVQRQIDA 188
Query: 165 ---PILT-VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
P+L V + PGD+LYLPRG H A + SLHVT +V T L + L
Sbjct: 189 ARGPLLAEVRMRPGDVLYLPRGCYHDA-LAQDGASLHVTFAVAPLTGMTVL------RVL 241
Query: 221 QAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMK 260
++ D FR YL A + G+P++ Q LA +
Sbjct: 242 ESEAQRDPRFR-----AYLADARVDAGQPLEAQLRTLATR 276
>gi|392954072|ref|ZP_10319624.1| hypothetical protein WQQ_36960 [Hydrocarboniphaga effusa AP103]
gi|391857971|gb|EIT68501.1| hypothetical protein WQQ_36960 [Hydrocarboniphaga effusa AP103]
Length = 383
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 67/144 (46%), Gaps = 10/144 (6%)
Query: 67 WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
W G S+ + +Q+L + + N Y + A F H+D +
Sbjct: 84 WAQLHSGASVVLNRLEHQSSTMQRLCSDVARFCSLPTSGNAYLSIGGAGTFGRHWDTHDV 143
Query: 127 FILQLEGKKKWKVYLPR----MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
F +QL G K+W+V+ P + D+ R P +Q P+L VTL+ GDLLYLPRG
Sbjct: 144 FAVQLIGSKRWRVFEPTYALPLADQTSQRSPLPCPAQ-----PLLDVTLQAGDLLYLPRG 198
Query: 183 YIHQASTVTNEHSLHVTISVYQKT 206
+ H+ +E S H++I Y T
Sbjct: 199 WWHEV-LPGDEPSFHLSIGSYAPT 221
>gi|163796488|ref|ZP_02190448.1| hypothetical protein BAL199_11012 [alpha proteobacterium BAL199]
gi|159178338|gb|EDP62882.1| hypothetical protein BAL199_11012 [alpha proteobacterium BAL199]
Length = 398
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 66/138 (47%), Gaps = 8/138 (5%)
Query: 88 LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
+ + + LFG N Y + G+A H+D ++ F +Q+ G+K W +Y R +
Sbjct: 116 IAAVTRCFERLFGTNTSCNMYCSWQQVPGYASHFDTMDVFAIQITGEKTWNIYDGRFREA 175
Query: 148 YLPRYSSP-NFSQEE----IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
P +F+ E+ G +T+ PGD+LYLPRG H A T+ SLH++ V
Sbjct: 176 TFTAGIRPSDFTVEQHNRMRGKVAQRITMRPGDILYLPRGVYHDA-LATDSASLHLSFGV 234
Query: 203 YQKTAW--IDLLEKAMPK 218
+ + + +L PK
Sbjct: 235 SPQVGFTVVGMLASEAPK 252
>gi|357479435|ref|XP_003610003.1| Lysine-specific demethylase NO66 [Medicago truncatula]
gi|355511058|gb|AES92200.1| Lysine-specific demethylase NO66 [Medicago truncatula]
Length = 784
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 77/148 (52%), Gaps = 11/148 (7%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGC-FVGANTYQTPPTAQGFAPHYDDIEAFI 128
Y+EG ++ + + + + +L +FG VGAN Y TP +QG A H+DD F+
Sbjct: 437 YKEGYTVALRGLEFRYQRIASIADTLALMFGQPSVGANLYLTPANSQGLARHFDDHCVFV 496
Query: 129 LQLEGKKKWKVYLPRMVDEYLPR-YSSPNFSQEEIGTPI-LTVTLEPGDLLYLPRGYIHQ 186
Q+ G KKW V+ + LPR Y S S + + L+ GD+LY+PRG+ H+
Sbjct: 497 CQIFGSKKWTVF--SRPGQLLPRLYDSLCGSDVDCTKAVRREFFLKEGDVLYIPRGFPHE 554
Query: 187 ASTV------TNEHSLHVTISVYQKTAW 208
A T + SLH+T+S+ + A+
Sbjct: 555 AYTNYGVDDGSPGFSLHLTLSIEVEPAF 582
>gi|395493847|ref|ZP_10425426.1| cupin 4 family protein [Sphingomonas sp. PAMC 26617]
Length = 400
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 110/275 (40%), Gaps = 33/275 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITL---------YEDGQR 52
+W+++P+++ L+ + ++L N+ + L Y DG
Sbjct: 24 YWDRQPVHLPSSAGGR-GGLLNWDRFSELLGTGGHWTAANLKLILNSRPILPDFYLDGDP 82
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
RA P V G S+ + + L++ L F AN Y +
Sbjct: 83 AAPIDTRRADPARVDVFLGMGASLVADDVERIAPELRRWTTLLGTEFAALTNANVYCSFK 142
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-------EIGTP 165
Q FA HYD + F LQ EG+K W++Y R + S+P+ +E G
Sbjct: 143 GVQAFATHYDLHDVFALQCEGEKVWRIYRNRAENP----VSAPHGGEEVQRQIDAARGDL 198
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGA 225
+ V + PGD+LYLPRG H A SLHV+ +V L A+ +AL+ A
Sbjct: 199 MAEVRMRPGDVLYLPRGCYHDALAQAGA-SLHVSFAV------APLGAMALLRALEEAAL 251
Query: 226 TDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMK 260
+ FR YL A G P+ Q LA +
Sbjct: 252 REPAFR-----AYLADARSDAGGPLAAQLQALASR 281
>gi|389875092|ref|YP_006374448.1| Lysine-specific demethylase [Tistrella mobilis KA081020-065]
gi|388532272|gb|AFK57466.1| Lysine-specific demethylase [Tistrella mobilis KA081020-065]
Length = 389
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 113/267 (42%), Gaps = 36/267 (13%)
Query: 16 NYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCS 75
+ +N+L+ S+++ D D ++ D+ E D++ +P+++ EG S
Sbjct: 44 STFNSLLGSERLLD---DRTRMVRRGRDLP-REMWSYLGPDMKRVVVPELLQRFCHEGLS 99
Query: 76 IRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKK 135
I Q + + L L++ +G N Y + F H+D + +L L G+K
Sbjct: 100 IALNQIQHAVPGIAALVKMLEQALPARIGTNVYASFGRESAFRAHHDPHDVLVLHLHGRK 159
Query: 136 KW------KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
+W + P M + + +G LEPGD+LYLPRG IH+AS
Sbjct: 160 RWFGFGHQSAHPPGM-----------PLTNDRLGPVHWEQVLEPGDILYLPRGEIHRAS- 207
Query: 190 VTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
V E SLH+T ++ DLL L G + R +P+ G
Sbjct: 208 VEGEASLHLTNALLWPRG-SDLLHWLAGTGLAGNGLAADDLDRDIPV---------YGTA 257
Query: 250 VDIQADRLAMKENLKDLLAKLVEYVDL 276
D++ R A E L+ L +L + +DL
Sbjct: 258 TDLE--RCA--EELRATLHRLADSIDL 280
>gi|167523972|ref|XP_001746322.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775084|gb|EDQ88709.1| predicted protein [Monosiga brevicollis MX1]
Length = 2283
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 70/150 (46%), Gaps = 17/150 (11%)
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD---- 146
L L LF V + Y TP AQ PH D + ++Q G+K+W + P+
Sbjct: 146 LERQLTTLFATTVAVHGYVTPTGAQALKPHTDPYDVLVIQTYGEKQWTICTPQPAGAQNR 205
Query: 147 --------EYLPRYSSPNFSQEEI----GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH 194
+ + R+S +Q E +TL+ GD+LYLP+G IH A+T +
Sbjct: 206 TDAEKAQLQEIVRHSIQGCTQYEAWQLAKMECQAITLKAGDVLYLPKGIIHYATTTESMG 265
Query: 195 SLHVTISVYQKT-AWIDLLEKAMPKALQAA 223
S H+T+S+ + T +W+ L +A L A
Sbjct: 266 STHITLSLERLTHSWLALFGRACGLGLDRA 295
>gi|325929645|gb|ADZ45327.1| cupin 4 [Streptomyces sp. NRRL 30471]
Length = 316
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 91/197 (46%), Gaps = 24/197 (12%)
Query: 21 LITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD--VEGRALPQIVWDHY-------- 70
L+ ++DILR NV+ F + + +D R ++ R + V + Y
Sbjct: 49 LLDIPALEDILRWNVLRFPY---VFMSKDQARLPLETYCSNRRVFSYVQEGYLDGGKTVA 105
Query: 71 --REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFI 128
+G S+ + + + + S Q L AN + TP Q H+DD FI
Sbjct: 106 ALEKGASLYLNHLEDWHPGFGRYLQSWQSLPVAKTVANAFITPAGTQAVGRHWDDSHNFI 165
Query: 129 LQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT-LEPGDLLYLPRGYIHQA 187
LQ G K+W++Y P P P+ + E++ T +L T LEPGD++Y P + H+A
Sbjct: 166 LQTAGSKRWRIYAP-------PPNPHPDMTAEDLKTELLLDTVLEPGDVIYFPPAFPHEA 218
Query: 188 STVTNEHSLHVTISVYQ 204
++ S+H+T + Y+
Sbjct: 219 EG-SDGLSVHLTFAAYE 234
>gi|290960087|ref|YP_003491269.1| hypothetical protein SCAB_57031 [Streptomyces scabiei 87.22]
gi|260649613|emb|CBG72728.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 407
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 66/117 (56%), Gaps = 13/117 (11%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSP-NFSQ 159
V N Y T A GF H+DD + I+QL G+KKW+V R +P Y S P N +
Sbjct: 139 VQVNVYLTTNDAAGFPLHWDDHDVVIVQLAGEKKWEV---RTASRNVPMYRDSDPNNTAS 195
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTV--TNEHSLHVTISVYQKT--AWIDLL 212
+EI I + + GD++++PRG+ HQA+ + +SLHVT + ++T +W+ L
Sbjct: 196 DEI---IWSGVMRAGDVMHIPRGHWHQATRTGSGSGNSLHVTFGITKRTGASWLAWL 249
>gi|453054469|gb|EMF01921.1| hypothetical protein H340_03854 [Streptomyces mobaraensis NBRC
13819 = DSM 40847]
Length = 398
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 94/189 (49%), Gaps = 24/189 (12%)
Query: 38 FKKNIDI---TLYEDGQRQTMDVEGRALPQI----VWDHYREGCSIRFLNPQTYIKPLQQ 90
F+K +++ Y D ++ G+++P + + + REG ++ + ++
Sbjct: 62 FQKGVEVHPAVYYTD----SVSPRGQSIPMVNMRSLGNLLREGVTLILDQANVFDPTMEV 117
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
+LQ V N Y T A GF H+DD + I+QL G+K+W+V R +P
Sbjct: 118 ACRALQWWSHERVQVNAYLTTNDAAGFPLHWDDHDVLIVQLAGEKEWEV---RAASRKVP 174
Query: 151 RY--SSPNFS-QEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQK 205
Y S PN + +EI I + + GD++++PRG+ HQAS + SLHVT + ++
Sbjct: 175 MYRDSDPNNTPSDEI---IWSGVMRTGDVMHIPRGHWHQASRTGSGSGKSLHVTFGITKR 231
Query: 206 T--AWIDLL 212
T +W+ L
Sbjct: 232 TGASWLTWL 240
>gi|354614310|ref|ZP_09032183.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
90007]
gi|353221342|gb|EHB85707.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
90007]
Length = 308
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 83 TYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLP 142
T+ L+ +LQ V NTY T A GF H+DD + I+QL G+K W+V P
Sbjct: 8 TFDPTLEVACQALQWWSHELVQVNTYLTTNDAAGFDLHWDDHDVIIVQLGGEKGWEVRGP 67
Query: 143 RMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT--NEHSLHVTI 200
+ + +EI + + TL GD++++PRG+ HQA+ V + HSLH T
Sbjct: 68 SRPAPMYRDHERNDTPSDEI---VWSGTLCAGDVMHIPRGHWHQATRVARGDGHSLHATF 124
Query: 201 SVYQKTAWIDLL 212
++T +D L
Sbjct: 125 GFVKRTG-VDWL 135
>gi|86739110|ref|YP_479510.1| cupin [Frankia sp. CcI3]
gi|86565972|gb|ABD09781.1| Cupin 4 [Frankia sp. CcI3]
Length = 406
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 9/147 (6%)
Query: 71 REGCSIRFLNPQTYIKP-LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
+ GC++ L+ + P L+ + Q V AN Y T A GF+ H+DD + +L
Sbjct: 100 QSGCTL-VLDAVNHFDPTLEVACRAFQWWLRAPVQANVYLTTGDAAGFSLHWDDHDVIVL 158
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
QL G K+W+V P P Y + E + + T+ GD+LY+PRG+ H+AS
Sbjct: 159 QLAGDKEWEVRGP---SRRAPMYRDAAPNTEPPKDIVWSGTVNTGDVLYIPRGHWHRASR 215
Query: 190 VT--NEHSLHVTISVYQKTA--WIDLL 212
+ + SLH T ++T W+ L
Sbjct: 216 TSRGDGFSLHATFGFTRRTGVDWLAWL 242
>gi|221201735|ref|ZP_03574773.1| cupin superfamily protein [Burkholderia multivorans CGD2M]
gi|221207190|ref|ZP_03580200.1| cupin superfamily protein [Burkholderia multivorans CGD2]
gi|421473482|ref|ZP_15921589.1| cupin domain protein, PF06172 family [Burkholderia multivorans ATCC
BAA-247]
gi|221172778|gb|EEE05215.1| cupin superfamily protein [Burkholderia multivorans CGD2]
gi|221178551|gb|EEE10960.1| cupin superfamily protein [Burkholderia multivorans CGD2M]
gi|400220866|gb|EJO51370.1| cupin domain protein, PF06172 family [Burkholderia multivorans ATCC
BAA-247]
Length = 384
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 21/174 (12%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
R + +V+D+ R+G ++ N + QL+ L E G V ++ Y F
Sbjct: 78 RLIKPVVYDYLRKGATL-IANKIATEPKVNQLSRQLIEFTGRQVVSSAYLAFGERDSFRC 136
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRM--------VDEYLPRYSSPNFSQEEIGTPILTVTL 171
H+D + F +QL G+K+W +Y P + +Y Y P+ TP + V L
Sbjct: 137 HWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLYPCPD-------TPYMDVML 189
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA---MPKALQA 222
E GDLLYLPRG+ H V E + H+ + ID L A MP L A
Sbjct: 190 EAGDLLYLPRGWWHNPLPV-GEATFHLAFGTFPAYV-IDYLSWAINRMPHLLDA 241
>gi|348171193|ref|ZP_08878087.1| cupin 4 [Saccharopolyspora spinosa NRRL 18395]
Length = 366
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 70/156 (44%), Gaps = 9/156 (5%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
REGC++ ++ ++ +LQ V NTY T A GF H+DD + ++Q
Sbjct: 69 REGCTLVLDTLDSFDPAMEIACQALQWWSRELVQVNTYLTTSNAAGFNLHWDDHDVVVVQ 128
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
L G+K W+V R P Y + + T+ GD+ ++PRGY H+A+
Sbjct: 129 LGGEKSWEV---RGTSRVAPMYRDVERADAPSEEVLWAGTMRAGDVFHIPRGYWHRATRT 185
Query: 191 TNE-----HSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+SLH T ++T +D L ++ Q
Sbjct: 186 DRTDDRAGYSLHATFGFVKRTG-VDWLSWIADRSRQ 220
>gi|221213317|ref|ZP_03586292.1| cupin superfamily protein [Burkholderia multivorans CGD1]
gi|221166769|gb|EED99240.1| cupin superfamily protein [Burkholderia multivorans CGD1]
Length = 384
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 21/174 (12%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
R + +V+D+ R+G ++ N + QL+ L E G V ++ Y F
Sbjct: 78 RLIKPVVYDYLRKGATL-IANKIATEPKVNQLSRQLIEFTGRQVVSSAYLAFGERDSFRC 136
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRM--------VDEYLPRYSSPNFSQEEIGTPILTVTL 171
H+D + F +QL G+K+W +Y P + +Y Y P+ TP + V L
Sbjct: 137 HWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLYPCPD-------TPYMDVML 189
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA---MPKALQA 222
E GDLLYLPRG+ H V E + H+ + ID L A MP L A
Sbjct: 190 EAGDLLYLPRGWWHNPLPV-GEATFHLAFGTFPAYV-IDYLSWAINRMPHLLDA 241
>gi|161526044|ref|YP_001581056.1| cupin 4 family protein [Burkholderia multivorans ATCC 17616]
gi|189349241|ref|YP_001944869.1| hypothetical protein BMULJ_00360 [Burkholderia multivorans ATCC
17616]
gi|160343473|gb|ABX16559.1| Cupin 4 family protein [Burkholderia multivorans ATCC 17616]
gi|189333263|dbj|BAG42333.1| hypothetical protein BMULJ_00360 [Burkholderia multivorans ATCC
17616]
Length = 384
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 21/174 (12%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
R + +V+D+ R+G ++ N + QL+ L E G V ++ Y F
Sbjct: 78 RLIKPVVYDYLRKGATL-IANKIATEPKVNQLSRQLIEFTGRQVVSSAYLAFGERDSFRC 136
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRM--------VDEYLPRYSSPNFSQEEIGTPILTVTL 171
H+D + F +QL G+K+W +Y P + +Y Y P+ TP + V L
Sbjct: 137 HWDTRDVFAIQLIGRKRWVLYEPSLEAPLYMQQSKDYEGLYPCPD-------TPYMDVML 189
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA---MPKALQA 222
E GDLLYLPRG+ H V E + H+ + ID L A MP L A
Sbjct: 190 EAGDLLYLPRGWWHNPLPV-GEATFHLAFGTFPAYV-IDYLSWAINRMPHLLDA 241
>gi|357407532|ref|YP_004919455.1| hypothetical protein SCAT_p0163 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337762481|emb|CCB71189.1| putative Unnamed protein product [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 395
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 62/115 (53%), Gaps = 9/115 (7%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL-PRYSSPNFSQEE 161
GA+ + TP +AQGFAPH D+ + +L +EG+K W MV+E P +E
Sbjct: 128 AGAHVFATPRSAQGFAPHRDEHGSLVLHMEGRKSW------MVEERAGPPRPGAVPPREW 181
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
+L PGD+LY+P+ H A T E SLHVT+ ++ W D+L++ +
Sbjct: 182 ARLRFREFSLGPGDVLYIPQHCGHAART-GAERSLHVTLRLF-PLRWRDVLDEVV 234
>gi|302535492|ref|ZP_07287834.1| LOW QUALITY PROTEIN: cupin 4 family protein [Streptomyces sp. C]
gi|302444387|gb|EFL16203.1| LOW QUALITY PROTEIN: cupin 4 family protein [Streptomyces sp. C]
Length = 300
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 88/207 (42%), Gaps = 12/207 (5%)
Query: 3 WEKEPLYISRKQKNYYNNL--ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
W++EP + +L + + +LR +E ++ + + D + V G
Sbjct: 45 WQREPAVFTPPSSAPPFDLDELDAAFDSGLLRTPYLEMVRSNKVVVPPDAYTASRVVNGI 104
Query: 61 ALPQI-----VWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
V R G ++ + +P +L A L E V A + TP Q
Sbjct: 105 TYSGFADRAKVVAQLRAGATLLLRCVDQWHRPTGELVARLSEELDRRVEAFFFVTPVGGQ 164
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G A H DD + F+LQ+ G K W V+ V ++ P +E +L LEPG
Sbjct: 165 GLAVHRDDADVFVLQVAGSKTWYVHDAPAVADWQPG----ELPDDEHSPRLLHTLLEPGG 220
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV 202
+LY+PRG+ H+A S H+++++
Sbjct: 221 VLYVPRGFAHRAVGAAG-LSAHLSLTI 246
>gi|386353479|ref|YP_006051726.1| hypothetical protein [Streptomyces cattleya NRRL 8057 = DSM 46488]
gi|365811558|gb|AEW99773.1| hypothetical protein SCATT_p15800 [Streptomyces cattleya NRRL 8057
= DSM 46488]
Length = 395
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 62/115 (53%), Gaps = 9/115 (7%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL-PRYSSPNFSQEE 161
GA+ + TP +AQGFAPH D+ + +L +EG+K W MV+E P +E
Sbjct: 128 AGAHVFATPRSAQGFAPHRDEHGSLVLHMEGRKSW------MVEERAGPPRPGAVPPREW 181
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
+L PGD+LY+P+ H A T E SLHVT+ ++ W D+L++ +
Sbjct: 182 ARLRFREFSLGPGDVLYIPQHCGHAART-GAERSLHVTLRLF-PLRWRDVLDEVV 234
>gi|318060501|ref|ZP_07979224.1| hypothetical protein SSA3_21343 [Streptomyces sp. SA3_actG]
gi|318077244|ref|ZP_07984576.1| hypothetical protein SSA3_11103 [Streptomyces sp. SA3_actF]
Length = 403
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 58/121 (47%), Gaps = 8/121 (6%)
Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
G AN Y GF H+DD E +QL G+K W+V P P Y ++
Sbjct: 127 GELTSANAYLAVGETDGFHLHWDDHEVIAVQLSGEKAWEVRGP---SRTAPMYRDAERNR 183
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTV---TNEHSLHVTISVYQKTA--WIDLLEK 214
+ TL GD++++PRGY H A+ + ++ HSLHVT ++T W++ L
Sbjct: 184 TPSEDVLWKGTLRAGDVMHIPRGYWHTATRLGSGSSGHSLHVTFGFTKRTGVTWVNFLSD 243
Query: 215 A 215
A
Sbjct: 244 A 244
>gi|418459769|ref|ZP_13030880.1| cupin 4 family protein [Saccharomonospora azurea SZMC 14600]
gi|359740082|gb|EHK88931.1| cupin 4 family protein [Saccharomonospora azurea SZMC 14600]
Length = 398
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/223 (25%), Positives = 87/223 (39%), Gaps = 18/223 (8%)
Query: 9 YISRKQKNYYNNLITSKKIDDIL-----RDNVIEFKKNIDITLYEDGQRQT----MDVEG 59
++ + + +L++ +D++L R ++ D R V G
Sbjct: 28 WVHGPTADGFEDLLSLDGVDELLTQHSLRSPAFRLVRDGRTVPLSDCTRTARIGGTSVSG 87
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
A P V +G ++ Y PL L+ G N Y TPP AQG P
Sbjct: 88 VADPARVLAAVEDGATLVLQGLHRYWPPLVGFCRELELELGHPCQVNAYVTPPGAQGLRP 147
Query: 120 HYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
H D + F+LQ G K W+V+ P E V L G LY
Sbjct: 148 HTDSHDVFVLQAFGSKSWQVWPAPAADAGDTGDTGDTGADGVE------DVELRAGMALY 201
Query: 179 LPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
LP G H A+ + S H+T+ ++ T W DL+E A+ + L+
Sbjct: 202 LPTGTRH-AARAQHVVSGHLTVGIH-PTRWRDLIESAVGRVLR 242
>gi|386840441|ref|YP_006245499.1| cupin [Streptomyces hygroscopicus subsp. jinggangensis 5008]
gi|374100742|gb|AEY89626.1| Cupin 4 [Streptomyces hygroscopicus subsp. jinggangensis 5008]
gi|451793733|gb|AGF63782.1| Cupin 4 [Streptomyces hygroscopicus subsp. jinggangensis TL01]
Length = 399
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 72/146 (49%), Gaps = 7/146 (4%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
REG ++ + ++ +LQ V N Y T A+GF H+DD + I+Q
Sbjct: 98 REGATLILDQLNVFDPTMEVACRALQWWSHERVQVNAYLTTNDAEGFPLHWDDHDVVIVQ 157
Query: 131 LEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
L G+K+W+V R +P Y + + I T L GD++++PRG+ H+A+
Sbjct: 158 LAGEKQWEV---RTASRKVPMYRDCDPNDTPSAETIWTGVLRTGDVMHIPRGHWHRATRS 214
Query: 191 TN--EHSLHVTISVYQKT--AWIDLL 212
+ SLHVT + ++T +W+ L
Sbjct: 215 GSGPGKSLHVTFGITRRTGASWLAWL 240
>gi|397171987|ref|ZP_10495383.1| RmlC-like domain-containing protein [Alishewanella aestuarii B11]
gi|396086329|gb|EJI83943.1| RmlC-like domain-containing protein [Alishewanella aestuarii B11]
Length = 384
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 104/219 (47%), Gaps = 30/219 (13%)
Query: 2 HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQT 54
+W+K+PL I + KN+ + L + ++D + ++E + ++ +ED R
Sbjct: 18 YWQKQPLLIKQGFKNFQDPLSADELAGLALEDDVESRLVECRDGNWQLETGPFEDFNRYG 77
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
D + L Q V DH+ + + P +I N + +L F TP
Sbjct: 78 -DKDWTILVQAV-DHWHPPAA-ELIEPFRFIP-----NWRIDDLMVSF------STP--G 121
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRM-VDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q EGK+ W+V +P + ++ P P Q T + V +EP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGMPDASLKQFCPH---PRLLQVTPFTACIDVIMEP 178
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
GD+LY+P G H+ V+ E+SL+ ++ ++ A DLL
Sbjct: 179 GDILYIPPGCPHEG--VSLENSLNYSVG-FRAPAQKDLL 214
>gi|160898500|ref|YP_001564082.1| cupin 4 family protein [Delftia acidovorans SPH-1]
gi|160364084|gb|ABX35697.1| Cupin 4 family protein [Delftia acidovorans SPH-1]
Length = 298
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 55/119 (46%), Gaps = 7/119 (5%)
Query: 89 QQLNASLQELFGCFVGA----NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM 144
Q+L L E F A N Y F H+D + +QL GKK W+VY P
Sbjct: 105 QELVRRLCEEVASFTNAETTANAYLAFSGEGSFGSHWDTHDVMAIQLIGKKHWRVYAPTY 164
Query: 145 VDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
LP +S +F PI LE GDLLY+PRG+ H+ + +LHV I +Y
Sbjct: 165 KSP-LPGQTSKSFDSTCPTDPIFDGVLEAGDLLYVPRGWWHEVLPIGE--TLHVAIGIY 220
>gi|375108697|ref|ZP_09754951.1| RmlC-like domain-containing protein [Alishewanella jeotgali KCTC
22429]
gi|374570883|gb|EHR42012.1| RmlC-like domain-containing protein [Alishewanella jeotgali KCTC
22429]
Length = 384
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 104/219 (47%), Gaps = 30/219 (13%)
Query: 2 HWEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQT 54
+W+K+PL I + KN+ + N + ++D + ++E + ++ +ED R
Sbjct: 18 YWQKQPLLIKQGFKNFQDPLSANELAGLALEDDVESRLVECRDGNWQLETGPFEDFSRYG 77
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
D + L Q V DH+ + + P +I N + +L F TP
Sbjct: 78 -DKDWTILVQAV-DHWHPPAA-ELIEPFRFIP-----NWRIDDLMVSF------STP--G 121
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRM-VDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q EGK+ W+V +P + ++ P P Q T + V +EP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGMPDASLKQFCPH---PRLLQVTPFTACIDVIMEP 178
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
GD+LY+P G H+ ++ E+SL+ ++ ++ A DLL
Sbjct: 179 GDILYIPPGCPHEG--ISLENSLNYSVG-FRAPAQKDLL 214
>gi|386384473|ref|ZP_10069846.1| hypothetical protein STSU_15719 [Streptomyces tsukubaensis
NRRL18488]
gi|385668044|gb|EIF91414.1| hypothetical protein STSU_15719 [Streptomyces tsukubaensis
NRRL18488]
Length = 398
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 66/119 (55%), Gaps = 17/119 (14%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFS-Q 159
V N Y T A GF H+DD + I+QL G+K W+V R +P Y + PN +
Sbjct: 130 VQVNAYLTTNEASGFPLHWDDHDVVIVQLAGEKGWEV---RGTSRSVPLYRDADPNDTPS 186
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH----SLHVTISVYQKT--AWIDLL 212
EEI I + ++ GD++++PRG+ HQA+ N H SLHVT + ++T +W+ L
Sbjct: 187 EEI---IWSGVMKAGDMMHIPRGHWHQAT--RNGHGSGKSLHVTFGITKRTGASWLAWL 240
>gi|167538117|ref|XP_001750724.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770748|gb|EDQ84429.1| predicted protein [Monosiga brevicollis MX1]
Length = 2295
Score = 65.1 bits (157), Expect = 5e-08, Method: Composition-based stats.
Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 28/142 (19%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS 153
+L + F V + Y T P + PH D + F+ Q+ G K W++ +PR+ D + + S
Sbjct: 212 ALTDAFNVPVTQHVYITGPAGRALNPHTDGGDVFVRQISGSKHWQLCVPRLQDPSVCQPS 271
Query: 154 SPNFS------------------QEEIGTPIL----------TVTLEPGDLLYLPRGYIH 185
+P+ + Q E TP +TL PGD LYLPRG +H
Sbjct: 272 APSPTAHPCTDGARARYAEWKRDQFEGCTPYTMGQLADMDCSNITLHPGDTLYLPRGIVH 331
Query: 186 QASTVTNEHSLHVTISVYQKTA 207
A T + S HVT + K A
Sbjct: 332 HAWTDSGITSTHVTYQLQSKDA 353
>gi|296534974|ref|ZP_06897266.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
gi|296264700|gb|EFH11033.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
Length = 406
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 80/347 (23%), Positives = 148/347 (42%), Gaps = 46/347 (13%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------- 52
++++K+PL++ + ++++ ++I+ +L I +++ + L DG
Sbjct: 20 EYYDKQPLHVQGGAAKF-ASVLSWRQINRLLDQTHIWSSQSLKLVL--DGNAVPPEQYCG 76
Query: 53 --------QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG 104
Q M E + + V + G S+ LN + P L A L G +G
Sbjct: 77 RATSRDNAQVMQPEAHKVAEWV----KRGASV-VLNDVDSLTP--GLTAVSGALEGAGLG 129
Query: 105 ---ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-- 159
AN Y + + + F H+D + + +Q+EG+K W ++ R + +P + + Q
Sbjct: 130 KAQANVYISWQSHKAFPSHFDTHDVWAVQVEGEKTWNIWEGR-AEWPIPHPAFRSLGQAH 188
Query: 160 --EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMP 217
+ G VTL+ GDLLYLPRG+ H A S+H+ V+ +DL+ +
Sbjct: 189 HDQAKGKLRGQVTLKTGDLLYLPRGWYHDA-LAEAPSSVHIAYGVHAPLG-MDLVNILLE 246
Query: 218 KALQAAGATDLEFRRGLP----IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLV-E 272
+ L + EFR+ LP R+A +R + + L + D+L K V +
Sbjct: 247 RVL-----YETEFRKPLPRQDGTAAARFALTSRAAQLGQRLSELTRDPKVMDVLGKFVAD 301
Query: 273 YVDLDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVY 319
Y G D + + + P EE V + G++ + G +
Sbjct: 302 YRYRRGGNDLLAARGLAAPASPAGGHEENAFRVVQPGVKAVRRGAEW 348
>gi|358462324|ref|ZP_09172458.1| Cupin, JmjC-type [Frankia sp. CN3]
gi|357071887|gb|EHI81456.1| Cupin, JmjC-type [Frankia sp. CN3]
Length = 330
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 14/117 (11%)
Query: 102 FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQ 159
V N Y T + GF H+DD + ++QL G K W V R +P Y + PN +
Sbjct: 52 VVQVNAYLTTRSTGGFPLHWDDHDVVVVQLAGNKTWDV---RSHSRPVPMYRDAEPNSAP 108
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT-----AWI 209
E I T+ PGD+L++PRG+ HQA+ +E SLH+T ++T AWI
Sbjct: 109 SE--DVIWRGTVTPGDVLHIPRGHWHQATHAAHEEPLSLHLTFGFVRRTGVDWLAWI 163
>gi|393761583|ref|ZP_10350220.1| RmlC-like domain-containing protein [Alishewanella agri BL06]
gi|392607593|gb|EIW90467.1| RmlC-like domain-containing protein [Alishewanella agri BL06]
Length = 384
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 104/219 (47%), Gaps = 30/219 (13%)
Query: 2 HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQT 54
+W+K+PL I + KN+ + L + ++D + ++E + ++ +ED R
Sbjct: 18 YWQKQPLLIKQGFKNFQDPLSADELAGLALEDDVESRLVECRDGNWQLETGPFEDFNRYG 77
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
D + L Q V DH+ + + P +I N + +L F TP
Sbjct: 78 -DKDWTILVQAV-DHWHPPAA-ELIEPFRFIP-----NWRIDDLMVSF------STP--G 121
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRM-VDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q EGK+ W+V +P + ++ P P Q T + V +EP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGMPDASLKQFCPH---PRLLQVTPFTACIDVIMEP 178
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
GD+LY+P G H+ ++ E+SL+ ++ ++ A DLL
Sbjct: 179 GDILYIPPGCPHEG--ISLENSLNYSVG-FRAPAQKDLL 214
>gi|351699836|gb|EHB02755.1| MYC-induced nuclear antigen [Heterocephalus glaber]
Length = 361
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 54/125 (43%), Gaps = 26/125 (20%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVT 170
+ PHYDD+E FIL LEG+K +YLP + EY Y + +G T
Sbjct: 172 SVMNIGPHYDDVEVFILHLEGEKHSWLYLPTVAVAREYSVEY------EARLGVQTHEFT 225
Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
L+ GDLLY PRG IH Q +W D L + + D+E
Sbjct: 226 LKTGDLLYFPRGPIH------------------QNNSWGDFLLDTISGIVFDTAKEDVEL 267
Query: 231 RRGLP 235
R G+P
Sbjct: 268 RSGIP 272
>gi|345851505|ref|ZP_08804478.1| hypothetical protein SZN_17147 [Streptomyces zinciresistens K42]
gi|345637051|gb|EGX58585.1| hypothetical protein SZN_17147 [Streptomyces zinciresistens K42]
Length = 398
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 65/117 (55%), Gaps = 13/117 (11%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFS-Q 159
V N Y T A GF H+DD + I+QL G+K+W+V R +P Y + PN +
Sbjct: 130 VQVNAYLTTNDASGFPLHWDDHDVVIVQLAGEKEWEV---RATSRAVPMYRDADPNSTPS 186
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT--AWIDLL 212
+EI I + + GD++++PRG+ HQA+ + SLHVT + ++T +W+ L
Sbjct: 187 DEI---IWSGLMRTGDVMHIPRGHWHQATRTGSGSGKSLHVTFGITKRTGASWLAWL 240
>gi|386837115|ref|YP_006242173.1| cupin [Streptomyces hygroscopicus subsp. jinggangensis 5008]
gi|374097416|gb|AEY86300.1| Cupin 4 family protein [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451790473|gb|AGF60522.1| Cupin 4 family protein [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 316
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 65/131 (49%), Gaps = 5/131 (3%)
Query: 72 EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
+G ++ + + P +++ A+L G V A + TPP QG H DD + +LQ+
Sbjct: 96 KGATLLLRSVDHWHGPTREVLAALAGELGRGVEAFFFVTPPGHQGLPLHRDDADVLVLQV 155
Query: 132 EGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVT 191
G+K W+++ ++ P+ E IL TL+ G++LY+PRG+ H A T
Sbjct: 156 AGRKDWRIHAKPDSADWRAGPVDPSAGPEP--AEILRTTLDAGEVLYVPRGFAHSA---T 210
Query: 192 NEHSLHVTISV 202
E L +SV
Sbjct: 211 GERGLSAHLSV 221
>gi|401404980|ref|XP_003881940.1| Cupin 4 family protein, related [Neospora caninum Liverpool]
gi|325116354|emb|CBZ51907.1| Cupin 4 family protein, related [Neospora caninum Liverpool]
Length = 533
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 68/149 (45%), Gaps = 24/149 (16%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI----- 124
Y +G S+ + LQ + L + + V A +Y TPP H DD
Sbjct: 208 YLDGFSLVVNQADRTLGVLQSICHHLAKKYFTHVFAVSYLTPPNTHAVKTHTDDQSLLPG 267
Query: 125 ---EAFILQLEGKKKWKVYLP--------RMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
+ ++QL G K WK++ P M+ + P P G P+L L+
Sbjct: 268 SSQDVILIQLWGSKSWKIWTPPQVLPLTEEMLGKQGPFPGDP-------GEPLLDFVLKE 320
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTISV 202
GD+LY+PRG+ H A T T E SLHVT++V
Sbjct: 321 GDILYIPRGFPHAALT-TEEPSLHVTLTV 348
>gi|323457139|gb|EGB13005.1| hypothetical protein AURANDRAFT_60763 [Aureococcus anophagefferens]
Length = 2291
Score = 63.9 bits (154), Expect = 9e-08, Method: Composition-based stats.
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 6/104 (5%)
Query: 105 ANTYQTPPTAQ-GFAPHYDDIEAFILQLEGKKKWKVYLP----RMVDEYLPRYSSPNFSQ 159
AN Y T P + +PH D + + Q G+K+W+V+ P R VD + +
Sbjct: 2035 ANVYLTDPGREVSVSPHTDAQDVLVFQTAGRKRWRVWRPPARRRGVDPFGRGKHGDAIAP 2094
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQAST-VTNEHSLHVTISV 202
E+ P+L LEPGD+LY+P G+ H ST + S HVT+ +
Sbjct: 2095 AELDEPLLDAWLEPGDVLYVPIGFPHATSTEACDAVSCHVTLGL 2138
>gi|395774364|ref|ZP_10454879.1| hypothetical protein Saci8_31535 [Streptomyces acidiscabies 84-104]
Length = 398
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 5/107 (4%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI 162
V N Y T A GF H+DD + I+QL G+K+W+V R +P Y + +
Sbjct: 130 VQVNAYLTTNDAAGFPLHWDDHDVIIVQLAGEKEWEV---RSTSRKVPMYRDADPNTAPS 186
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKTA 207
I + + GD++++PRG+ HQA+ + SLHVT + ++T
Sbjct: 187 DQIIWSGVMRTGDVMHIPRGHWHQATRTGSGSGKSLHVTFGITKRTG 233
>gi|312881223|ref|ZP_07741021.1| hypothetical protein VIBC2010_08478 [Vibrio caribbenthicus ATCC
BAA-2122]
gi|309371115|gb|EFP98569.1| hypothetical protein VIBC2010_08478 [Vibrio caribbenthicus ATCC
BAA-2122]
Length = 376
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 24/197 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
HW K+P+ I KN + + + D++ + + +ID + Q + G
Sbjct: 15 HWHKKPIVI----KNAFIDFVDPITPDELAG---LAMESDIDSRFINNKQGRWSVTHGPL 67
Query: 60 -----RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPT 113
+L + W + C+ T + P +QL L +L CF + P
Sbjct: 68 SEDFFTSLEESHWQLVVQACNHWHSGSATLLDPFKQLPQWLFDDLMVCF-------SAPR 120
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q G++ WKV R +Y + Q E PI+ LEP
Sbjct: 121 G-GVGPHIDQYDVFIVQGSGQRHWKVG-ARDQGQYKESIQAGALRQIEGFDPIIDQVLEP 178
Query: 174 GDLLYLPRGYIHQASTV 190
GDLLY+P G+ H+ T+
Sbjct: 179 GDLLYIPPGFPHEGDTL 195
>gi|256088456|ref|XP_002580351.1| hypothetical protein [Schistosoma mansoni]
Length = 333
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 72/143 (50%), Gaps = 11/143 (7%)
Query: 2 HWEKEPLYISR---KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVE 58
+++K PL++ R K + N + K +D + F +++D+ + + + ++
Sbjct: 193 NYQKTPLHVKRSSSKCGEWLNYSLVKKLVDK----EFLFFGQHVDLIEDKTLKHEIVNPP 248
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY--QTPPTAQG 116
GRA VW+HY +G IRF N QT+ + L+ LQE FGC Y + PT
Sbjct: 249 GRAFAGTVWEHYSQGKGIRFHNLQTFSRRLRFRVGLLQEYFGCNFTVTAYLLSSKPTELS 308
Query: 117 FAPHYDDIEAFILQLEGKKKWKV 139
F D+ FILQ EG +K+++
Sbjct: 309 FNQLDHDL--FILQQEGSQKFQI 329
>gi|350643695|emb|CCD58399.1| unnamed protein product [Schistosoma mansoni]
Length = 146
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 71/142 (50%), Gaps = 11/142 (7%)
Query: 3 WEKEPLYISR---KQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG 59
++K PL++ R K + N + K +D + F +++D+ + + + ++ G
Sbjct: 7 YQKTPLHVKRSSSKCGEWLNYSLVKKLVDK----EFLFFGQHVDLIEDKTLKHEIVNPPG 62
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTY--QTPPTAQGF 117
RA VW+HY +G IRF N QT+ + L+ LQE FGC Y + PT F
Sbjct: 63 RAFAGTVWEHYSQGKGIRFHNLQTFSRRLRFRVGLLQEYFGCNFTVTAYLLSSKPTELSF 122
Query: 118 APHYDDIEAFILQLEGKKKWKV 139
D + FILQ EG +K+++
Sbjct: 123 NQ--LDHDLFILQQEGSQKFQI 142
>gi|290956631|ref|YP_003487813.1| hypothetical protein SCAB_21351 [Streptomyces scabiei 87.22]
gi|260646157|emb|CBG69250.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 396
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 96/213 (45%), Gaps = 35/213 (16%)
Query: 65 IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDI 124
+V DH + F +P L+ +L G AN Y GF H+DD
Sbjct: 103 VVLDH------VDFFDPT-----LEVACRALGWWSGELTSANAYLAVGDTDGFNLHWDDH 151
Query: 125 EAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFS-QEEIGTPILTVTLEPGDLLYLPRG 182
+ +QL G+K W+V P R Y R + N + EE+ + T++ GD++++PRG
Sbjct: 152 DVIAVQLSGEKSWEVRGPSRPAPMY--RDAERNLAPSEEV---LWRGTMQAGDVMHIPRG 206
Query: 183 YIHQASTVT---NEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRRGLPIG 237
+ H A+ + + HSLHVT + ++T W + L A D +FR L
Sbjct: 207 FWHTATRIGSGDDGHSLHVTFGLTKRTGVTWANFLSD--------AARADEDFRTDLE-- 256
Query: 238 YLRYAGLARGKPVDIQADRLAMKENLKDLLAKL 270
R G A G+ + + LA + + K LA+L
Sbjct: 257 --RSDGTADGEVLAAKLTDLAREYDPKRYLAEL 287
>gi|398832643|ref|ZP_10590800.1| Cupin superfamily protein [Herbaspirillum sp. YR522]
gi|398222980|gb|EJN09336.1| Cupin superfamily protein [Herbaspirillum sp. YR522]
Length = 393
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 4/121 (3%)
Query: 86 KPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV 145
+PL ++ L AN Y + GF H+DD + I+Q++G+KKW+ +
Sbjct: 112 RPLAEMTQCLACALDVRSAANAYMSFGLTSGFGVHHDDHDVIIIQIDGRKKWQFFSLPDA 171
Query: 146 DEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
+E ++ + G +V + GD++++P+G H T NE SLH+TIS+
Sbjct: 172 NEKATVADLEAPTESDRGQ---SVIIHTGDVMFIPKGTWHDV-TAMNERSLHLTISLVNP 227
Query: 206 T 206
T
Sbjct: 228 T 228
>gi|256398046|ref|YP_003119610.1| cupin [Catenulispora acidiphila DSM 44928]
gi|256364272|gb|ACU77769.1| Cupin 4 family protein [Catenulispora acidiphila DSM 44928]
Length = 395
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 79/182 (43%), Gaps = 15/182 (8%)
Query: 38 FKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQE 97
F N ++ +R M GR L EG ++ + L+ +L
Sbjct: 73 FLTNFVSRRHQASRRADMAAVGRIL--------NEGGTLILDTINQFDPTLEVACRALGW 124
Query: 98 LFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
G V N Y GF+ H+DD + ++Q+ G+K W+V R +P Y
Sbjct: 125 WTGELVSVNAYLAVGDTAGFSTHWDDHDVLVVQVAGQKSWEV---RPASRPVPMYRDAEQ 181
Query: 158 SQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKTA--WIDLLE 213
+ E + + T+ GD++++PRG+ H A+ V + SLH+T + ++T W+ L
Sbjct: 182 NLEAPEELLWSGTMNTGDVMHIPRGFWHAATRVGSGEGISLHLTFGITRRTGVTWVQHLA 241
Query: 214 KA 215
A
Sbjct: 242 DA 243
>gi|157375545|ref|YP_001474145.1| cupin 4 family protein [Shewanella sediminis HAW-EB3]
gi|157317919|gb|ABV37017.1| cupin 4 family protein [Shewanella sediminis HAW-EB3]
Length = 380
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 91/211 (43%), Gaps = 38/211 (18%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W+K+PL I R+ N + +L+T +++ + D ++E ++ +G+ Q
Sbjct: 16 DYWQKKPLVI-RQGFNNFKDLLTPEEMAGLACDEIVESRR----VYKSEGEWQA------ 64
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCF----------VGANT 107
+ S L + +Q LN L ++L CF V A +
Sbjct: 65 --------EFGPFESYEHLGESDWTLVVQALNNWLPDAEKLIQCFDFIPRWRFDDVMA-S 115
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
Y TP G PH D + FI Q G+++W+V E+ + P E PI+
Sbjct: 116 YATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHKEFA---AHPALLHTEAFDPII 170
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
V L PGD+LYLP GY H T+ S V
Sbjct: 171 DVELLPGDILYLPPGYPHDGITLEPSMSFSV 201
>gi|428171176|gb|EKX40095.1| hypothetical protein GUITHDRAFT_143042 [Guillardia theta CCMP2712]
Length = 422
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 50/93 (53%), Gaps = 10/93 (10%)
Query: 120 HYDDIEAFILQLEGKKKWKVYLPR------MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
H D + +Q +G K+W++Y+P M D + + + EE+ P+L V LEP
Sbjct: 262 HTDKQDVIAIQAQGSKRWRIYMPPHPANKVMQDPFTRGKGNDVLTFEELEEPLLDVVLEP 321
Query: 174 GDLLYLPRGYIHQASTV----TNEHSLHVTISV 202
G +LY+P GY H TV T E S+H+T+ V
Sbjct: 322 GQMLYVPAGYPHVTDTVNGVKTKEESIHLTLGV 354
>gi|167535487|ref|XP_001749417.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772045|gb|EDQ85702.1| predicted protein [Monosiga brevicollis MX1]
Length = 2781
Score = 62.8 bits (151), Expect = 2e-07, Method: Composition-based stats.
Identities = 44/146 (30%), Positives = 62/146 (42%), Gaps = 24/146 (16%)
Query: 94 SLQELFGCFVGANT-----YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
SL+EL G++T Y + A PH D + +L L G+K W V P
Sbjct: 117 SLEELIQTAFGSSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSTE 176
Query: 149 LPRYSSPNFSQ------------------EEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
S +Q E TL PGD+LYLP+ IH A+T
Sbjct: 177 WADASQAQLAQLFEIQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTS 236
Query: 191 TNEHSLHVTISV-YQKTAWIDLLEKA 215
N + H+T+S+ Q +WIDL+ +A
Sbjct: 237 PNTTTAHITLSLERQGQSWIDLVRRA 262
>gi|389877013|ref|YP_006370578.1| cupin superfamily protein [Tistrella mobilis KA081020-065]
gi|388527797|gb|AFK52994.1| cupin superfamily protein [Tistrella mobilis KA081020-065]
Length = 387
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 73/168 (43%), Gaps = 23/168 (13%)
Query: 44 ITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFV 103
+T GQR +D + + L Q G S+ L +++ L L A ++ V
Sbjct: 74 LTRLVGGQRVLLDAQLQTLCQ-------GGASLVLLGMHRHVQALAALAAMVERHVEAPV 126
Query: 104 GANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI- 162
N Y + F H D + ++Q+ G K+W + R S P F E +
Sbjct: 127 TINAYASFTRDGAFQIHRDGHDVLVVQIAGSKRWFCH---------GRRSDPPFGGETVD 177
Query: 163 -----GTPILTVTLEPGDLLYLPRGYIHQASTV-TNEHSLHVTISVYQ 204
G P + LEPGDLL++P+G H+A SLH+T+++ +
Sbjct: 178 PRRDPGPPEAEMVLEPGDLLFVPKGDYHRAEVADAGGQSLHLTVAIQR 225
>gi|294932895|ref|XP_002780495.1| hypothetical protein Pmar_PMAR001087 [Perkinsus marinus ATCC 50983]
gi|239890429|gb|EER12290.1| hypothetical protein Pmar_PMAR001087 [Perkinsus marinus ATCC 50983]
Length = 256
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%)
Query: 125 EAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
+A ILQ +G+K W VY Y + + +G P+ +VTLE GDLL +PRG I
Sbjct: 30 DALILQCKGRKHWDVYDSSSSHPYPGEEVGKDMPIDNLGEPLYSVTLEEGDLLLIPRGVI 89
Query: 185 HQASTVTNEHSLHVTISV 202
H+A + SLH+T+ +
Sbjct: 90 HRARASQEQGSLHITVKI 107
>gi|167533989|ref|XP_001748673.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772914|gb|EDQ86560.1| predicted protein [Monosiga brevicollis MX1]
Length = 3197
Score = 62.4 bits (150), Expect = 3e-07, Method: Composition-based stats.
Identities = 44/146 (30%), Positives = 61/146 (41%), Gaps = 24/146 (16%)
Query: 94 SLQELFGCFVGANT-----YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
SL+EL G +T Y + A PH D + +L L G+K W V P
Sbjct: 141 SLEELIQTAFGTSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSTE 200
Query: 149 LPRYSSPNFSQ------------------EEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
S +Q E TL PGD+LYLP+ IH A+T
Sbjct: 201 WADASQAQLAQLFEMQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTS 260
Query: 191 TNEHSLHVTISV-YQKTAWIDLLEKA 215
N + H+T+S+ Q +WIDL+ +A
Sbjct: 261 PNTTTAHITLSLERQGQSWIDLVRRA 286
>gi|170727053|ref|YP_001761079.1| cupin 4 family protein [Shewanella woodyi ATCC 51908]
gi|169812400|gb|ACA86984.1| Cupin 4 family protein [Shewanella woodyi ATCC 51908]
Length = 374
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D+ + F+LQ EG ++W+V +Y PR N + + PI+ V LE GD
Sbjct: 120 GVGPHIDNYDVFLLQGEGTRRWRV---GAKGDYTPRGGDTNTALIDDFEPIIDVVLEAGD 176
Query: 176 LLYLPRGYIHQASTVTNEHSLHV 198
+LY+P GY H A T+T S +
Sbjct: 177 MLYIPPGYPHCAETLTTALSYSI 199
>gi|312194488|ref|YP_004014549.1| hypothetical protein FraEuI1c_0601 [Frankia sp. EuI1c]
gi|311225824|gb|ADP78679.1| hypothetical protein FraEuI1c_0601 [Frankia sp. EuI1c]
Length = 412
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 69/142 (48%), Gaps = 6/142 (4%)
Query: 73 GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLE 132
G ++ +T+ ++ +L+ G V N Y T +A GF H+DD + I+QL
Sbjct: 107 GVTLVLDGLETFDPIVEVATRALRWWSGELVQTNAYLTTRSADGFPLHWDDHDVLIVQLA 166
Query: 133 GKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
G+K W V R P + ++ + L G++L++PRGY HQA+ V +
Sbjct: 167 GEKNWDV---RGSTRSAPMFRDAVPNEVASSEAVWQGVLRAGEVLHIPRGYWHQATRVEH 223
Query: 193 EH--SLHVTISVYQKTAWIDLL 212
+ SLH+T ++T +D L
Sbjct: 224 DDPVSLHLTFGFTRRTG-VDWL 244
>gi|291452955|ref|ZP_06592345.1| cupin 4 [Streptomyces albus J1074]
gi|291355904|gb|EFE82806.1| cupin 4 [Streptomyces albus J1074]
Length = 398
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 59/114 (51%), Gaps = 7/114 (6%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI 162
V N Y T A GF H+DD + I+QL G+K+W+V R P Y + +
Sbjct: 130 VQVNAYLTTNDAAGFPLHWDDHDVVIVQLAGEKEWEV---RGTSRPAPMYRDADPNSTPT 186
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT--AWIDLL 212
+ + + GD++++PRG+ HQA+ + SLHVT V ++T +W+ L
Sbjct: 187 DEIVWSGVMRTGDVMHIPRGHWHQATRTGSGSGKSLHVTFGVTKRTGASWLAWL 240
>gi|157962362|ref|YP_001502396.1| cupin 4 family protein [Shewanella pealeana ATCC 700345]
gi|157847362|gb|ABV87861.1| Cupin 4 family protein [Shewanella pealeana ATCC 700345]
Length = 376
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G H D+ + FI+Q EG++ W V +Y P+ + P E PI
Sbjct: 113 SFATP--GGGVGAHIDNYDVFIIQGEGQRHWTV---GDKGQYAPKNNDPTTPLIEGFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHS 195
+ V LE GDLLY+P G+ HQ T+TN S
Sbjct: 168 IDVVLEKGDLLYIPPGFPHQGQTITNAMS 196
>gi|365864906|ref|ZP_09404580.1| hypothetical protein SPW_4884 [Streptomyces sp. W007]
gi|364005613|gb|EHM26679.1| hypothetical protein SPW_4884 [Streptomyces sp. W007]
Length = 369
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 59/114 (51%), Gaps = 7/114 (6%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEI 162
V N Y T A GF H+DD + I+QL G+K W+V R P Y + +
Sbjct: 101 VQVNAYLTTNDAAGFPLHWDDHDVLIVQLAGEKDWEV---RGASRTAPMYRDTDANTTPS 157
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEH--SLHVTISVYQKT--AWIDLL 212
+ + T+ GD++++PRG+ HQA+ + SLHVT + +++ +W+ L
Sbjct: 158 EDIVWSGTMMAGDVMHIPRGHWHQATRTGSGSGKSLHVTFGITKRSGASWLAWL 211
>gi|77360634|ref|YP_340209.1| RmlC-like domain-containing protein [Pseudoalteromonas haloplanktis
TAC125]
gi|76875545|emb|CAI86766.1| putative enzyme with RmlC-like domain [Pseudoalteromonas
haloplanktis TAC125]
Length = 381
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 91/198 (45%), Gaps = 27/198 (13%)
Query: 3 WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKN---IDITLYEDGQRQTM 55
W+K+PL I + N+ + N + ++D + ++ N +ED ++ T
Sbjct: 19 WQKKPLLIKQGFSNFQDPLDANELAGLAMEDSIESRIVTNHNNEWQSHQGPFEDFEQLT- 77
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
+ L Q V DH+ + + L P +I N + +L +Y TP
Sbjct: 78 EQHSTLLVQAV-DHWHNDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--GG 122
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q EGK+ W+V LP D L +++ + Q E T ++ LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPSLKQFTQNKTLLQVEAFTAVIDCILEPG 179
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LY+P G H+ V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197
>gi|163751233|ref|ZP_02158461.1| hypothetical protein KT99_13827 [Shewanella benthica KT99]
gi|161328947|gb|EDQ00021.1| hypothetical protein KT99_13827 [Shewanella benthica KT99]
Length = 374
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + F+LQ EG ++WKV Y PR + + + PI
Sbjct: 113 SFATP--GGGVGPHIDNYDVFLLQGEGTRRWKV---GARGNYAPRGGDTHTALIDDFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ V LE GD+LY+P GY H+ T+T S +
Sbjct: 168 IDVVLEKGDMLYIPPGYPHRGETITTALSYSI 199
>gi|167525110|ref|XP_001746890.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774670|gb|EDQ88297.1| predicted protein [Monosiga brevicollis MX1]
Length = 1934
Score = 61.6 bits (148), Expect = 5e-07, Method: Composition-based stats.
Identities = 54/202 (26%), Positives = 80/202 (39%), Gaps = 32/202 (15%)
Query: 94 SLQELFGCFVGANT-----YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
SL+EL G +T Y + A PH D + +L L G+K W V P
Sbjct: 141 SLEELIQTAFGTSTVTSHVYLSGSEAHALNPHTDPYDVLVLHLHGQKHWTVCHPLDSSPE 200
Query: 149 LPRYSSPNFSQ------------------EEIGTPILTVTLEPGDLLYLPRGYIHQASTV 190
S +Q E TL PGD+LYLP+ IH A+T
Sbjct: 201 WADASQAQLAQLFEIQRKSVDGCTNFDIAETDKMRCEHFTLSPGDVLYLPKSTIHFATTS 260
Query: 191 TNEHSLHVTISV-YQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKP 249
N + H+T+S+ Q +WIDL + L + + R +L A +P
Sbjct: 261 PNTTTAHITLSLERQGQSWIDLACTVIESTLASLETSPSGLR------WLELATFPTDEP 314
Query: 250 VDIQADRLA--MKENLKDLLAK 269
+ RL K++L+ L+A+
Sbjct: 315 CQMAQKRLLGFEKDSLRTLVAQ 336
>gi|374370246|ref|ZP_09628255.1| cupin 4 family protein [Cupriavidus basilensis OR16]
gi|373098248|gb|EHP39360.1| cupin 4 family protein [Cupriavidus basilensis OR16]
Length = 303
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 64/145 (44%), Gaps = 4/145 (2%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
D +G ++ ++ ++ L + + E AN Y F H+D +
Sbjct: 80 DLLSKGATVVLNRIDARLERVRHLCSEIAEFSNSETLANGYLAFSGRGSFGAHWDTHDVM 139
Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA 187
+QL G+K+W+VY P L +S + + PI LE GDLLYLPRG+ H A
Sbjct: 140 AIQLIGRKRWRVYKPTFPFP-LTGQTSKEYKAQCPAEPIFDGILEAGDLLYLPRGWWHNA 198
Query: 188 STVTNEHSLHV---TISVYQKTAWI 209
+ + + V T SV +W+
Sbjct: 199 TPIGETFHIAVGMHTASVVHYVSWL 223
>gi|449688443|ref|XP_002157896.2| PREDICTED: lysine-specific demethylase NO66-like [Hydra
magnipapillata]
Length = 190
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 62/111 (55%), Gaps = 6/111 (5%)
Query: 230 FRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMH 289
FR GLPI + + G++ + + D + +K L+ KL++YV++D+ DE+ M
Sbjct: 2 FREGLPIDFSSFVGVSNSEKKCPERDTFV--KTVKKLMEKLIDYVEIDEAGDELVLDHME 59
Query: 290 DALPPVLSPEELQCSVF-ENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+ PP S ++++CSVF ++G GEV +I T++RLVR VR
Sbjct: 60 EFQPPYYSSDDVECSVFSKDGF---WDGEVRRHHKIELSTELRLVRPGIVR 107
>gi|404379958|ref|ZP_10985005.1| hypothetical protein HMPREF9021_01816 [Simonsiella muelleri ATCC
29453]
gi|294482629|gb|EFG30319.1| hypothetical protein HMPREF9021_01816 [Simonsiella muelleri ATCC
29453]
Length = 384
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 80/166 (48%), Gaps = 21/166 (12%)
Query: 66 VWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG----FAP 119
V++H R+G +I + +N + + ++ + FV A T + A G F
Sbjct: 84 VYEHLRKGATIVYNRINNEPLVD-------AIAKHIAQFVQAQTVVSGYLAFGSDASFKN 136
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQEEIGTPILTVTLEPGDLL 177
H+D + F +QL GKK W +Y P +P Y S + + E P + V LE GD+L
Sbjct: 137 HWDTRDVFAVQLMGKKHWSLYAPNFE---MPLYMQQSKDVNVPEPQHPDMEVVLEAGDIL 193
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQ 221
Y+PRG+ H + E + H+ I + T +++ L K +P ++
Sbjct: 194 YIPRGWWHSPVPMNCE-TFHLAIGTFAPTGYQYMEWLMKKVPSIVE 238
>gi|294140589|ref|YP_003556567.1| hypothetical protein SVI_1818 [Shewanella violacea DSS12]
gi|293327058|dbj|BAJ01789.1| conserved hypothetical protein [Shewanella violacea DSS12]
Length = 374
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + F+LQ EG ++WKV Y PR + + + PI
Sbjct: 113 SFATP--GGGVGPHIDNYDVFLLQGEGTRRWKV---GARGNYSPRGGDTHTALIDDFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ V LE GD+LY+P GY H+ T+T S +
Sbjct: 168 IDVVLEKGDMLYIPPGYPHRGETITTALSYSI 199
>gi|167534035|ref|XP_001748696.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772937|gb|EDQ86583.1| predicted protein [Monosiga brevicollis MX1]
Length = 806
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 79/173 (45%), Gaps = 17/173 (9%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFG-CFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
R+ S+ F Q + L +LQ +FG A+ Y + P A+ PH D + ++
Sbjct: 143 RDHLSLVFRREQLDAQNYSVLETTLQAVFGSASTTAHAYLSAPGARALQPHTDPYDVIVV 202
Query: 130 QLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAST 189
QL G+K+W++ + P F + + +EPGD+LYLP+ IH A
Sbjct: 203 QLAGRKRWRLCTGCL---NWPESDLTRFHCQSLW-------MEPGDVLYLPKAVIHVADA 252
Query: 190 --VTNEHSLHVTISVYQKTA-WIDLLEKA---MPKALQAAGATDLEFRRGLPI 236
S+H+T S+ ++ A W LL A P A+ D+E +G P+
Sbjct: 253 PHADETTSIHLTYSLDREGATWHHLLATAACVTPATPHASAHPDVESMQGSPL 305
>gi|259908744|ref|YP_002649100.1| hypothetical protein EpC_20960 [Erwinia pyrifoliae Ep1/96]
gi|387871629|ref|YP_005803004.1| hypothetical protein EPYR_02253 [Erwinia pyrifoliae DSM 12163]
gi|224964366|emb|CAX55875.1| conserved uncharacterized protein YcfD [Erwinia pyrifoliae Ep1/96]
gi|283478717|emb|CAY74633.1| Uncharacterized protein ycfD [Erwinia pyrifoliae DSM 12163]
Length = 375
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 89/192 (46%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q G
Sbjct: 16 YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + + V G PH
Sbjct: 67 -PFESYDHLGENNWSLLVQAVNHWHEPSAALMHPFRAIPDWRVDDLMISFSVAGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V R + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKREMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|392555630|ref|ZP_10302767.1| RmlC-like domain-containing protein [Pseudoalteromonas undina NCIMB
2128]
Length = 380
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
+Y TP G PH D + FI+Q EGK+ W+V LP D L +++ + + Q E T
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DASLKQFAQNKSLLQVEAFTA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ + N
Sbjct: 171 VIDCVLEPGDILYIPPGCPHEGYAIEN 197
>gi|359433689|ref|ZP_09224009.1| hypothetical protein P20652_2123 [Pseudoalteromonas sp. BSi20652]
gi|357919668|dbj|GAA60258.1| hypothetical protein P20652_2123 [Pseudoalteromonas sp. BSi20652]
Length = 385
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 92/198 (46%), Gaps = 27/198 (13%)
Query: 3 WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
W+K+PL I + N+ + N + +++ + +I KN T +ED ++ T
Sbjct: 19 WQKKPLLIKQGFANFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
D L Q V DH+ + + L P +I N + +L +Y TP
Sbjct: 78 DDHSTLLVQAV-DHWHSEAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--NG 122
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q EGK+ W+V LP D L +++ + Q E ++ LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEAVIDCILEPG 179
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LY+P G H+ V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197
>gi|417948195|ref|ZP_12591343.1| hypothetical protein VISP3789_06108 [Vibrio splendidus ATCC 33789]
gi|342810225|gb|EGU45318.1| hypothetical protein VISP3789_06108 [Vibrio splendidus ATCC 33789]
Length = 385
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 89/206 (43%), Gaps = 20/206 (9%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+H+ +G + K QQL N +L C Y P G PH D +
Sbjct: 88 NHWHQGAN-------ELTKAFQQLPNWLFDDLMIC------YSAPEG--GVGPHIDQYDV 132
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
FI+Q +GK++WKV + V +Y + Q E PI+ TLEPGD+LY+P G+ H+
Sbjct: 133 FIIQGQGKRQWKVG-AKDVGQYKETVQASALRQIEGFEPIIDETLEPGDILYIPPGFPHE 191
Query: 187 ASTVTNEHSLHVTISVYQKTAWI-DLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
+T+ S + ++ I + + + + D EF+ G +R + L+
Sbjct: 192 GNTLEPSMSYSIGYRSPKEQELISNFADFVLAHDMGDVHLHDPEFKAQDSYGKIRSSDLS 251
Query: 246 RGKPVDIQADRLAMKENLKDLLAKLV 271
D+ L E + D + L+
Sbjct: 252 N--LTDMLKSALEQPERVSDFMGCLL 275
>gi|385788087|ref|YP_005819196.1| hypothetical protein EJP617_26280 [Erwinia sp. Ejp617]
gi|310767359|gb|ADP12309.1| conserved uncharacterized protein YcfD [Erwinia sp. Ejp617]
Length = 375
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 89/192 (46%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q G
Sbjct: 16 YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + + V G PH
Sbjct: 67 -PFESYDHLGENNWSLLVQAVNHWHEPSAALMHPFRAIPDWRVDDLMISFSVAGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V R + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKREMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|408533826|emb|CCK32000.1| hypothetical protein BN159_7621 [Streptomyces davawensis JCM 4913]
Length = 403
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 54/119 (45%), Gaps = 8/119 (6%)
Query: 102 FVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEE 161
AN Y GF H+DD E +QL G+K W+V P P Y ++
Sbjct: 129 LTSANAYLAVGDTDGFHLHWDDHEVIAVQLSGEKSWEVRGP---SRPAPMYRDAERNRTP 185
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTV---TNEHSLHVTISVYQKTA--WIDLLEKA 215
+ T+ GD++++PRG+ H A+ + HSLHVT ++T W + L A
Sbjct: 186 SEEALWKGTMRAGDVMHIPRGFWHTATRLGSGAGGHSLHVTFGFTKRTGVTWANFLSDA 244
>gi|127512486|ref|YP_001093683.1| cupin 4 family protein [Shewanella loihica PV-4]
gi|126637781|gb|ABO23424.1| Cupin 4 family protein [Shewanella loihica PV-4]
Length = 386
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y TP G PH D+ + F+LQ EG+++WKV +Y PR + + + PI
Sbjct: 117 SYATP--GGGVGPHIDNYDVFLLQGEGRRRWKV---GAKGQYSPRGGDTHTALIDDFEPI 171
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
L V LE GD+LY+P G+ H+ T+ S +
Sbjct: 172 LDVVLEAGDMLYIPPGFPHRGETLETALSYSI 203
>gi|392532539|ref|ZP_10279676.1| RmlC-like domain-containing protein [Pseudoalteromonas arctica A
37-1-2]
Length = 385
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 93/198 (46%), Gaps = 27/198 (13%)
Query: 3 WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
W+K+PL I + N+ + N + +++ + +I KN T +ED ++ T
Sbjct: 19 WQKKPLLIKQGFANFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
D L Q V DH+ + + L P +I N + +L +Y TP
Sbjct: 78 DDHSTLLVQAV-DHWHSDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--NG 122
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q EGK+ W+V LP D L +++ + Q E ++ LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEPFEAVIDCILEPG 179
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LY+P G H+ +V N
Sbjct: 180 DILYIPPGCPHEGYSVEN 197
>gi|320166683|gb|EFW43582.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 908
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 76/158 (48%), Gaps = 12/158 (7%)
Query: 72 EGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQL 131
+G ++ ++ + PL +L A +++ G F + T P P + ++Q
Sbjct: 352 DGATVILISAHNFHAPLMRLVADVEQTLGVFSDVSATFTGPVDYFSRPRSPMFDLILVQT 411
Query: 132 EGKKKWKV---YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQAS 188
G + W+V P D+Y NF E + L+ GD+LY+PRG I +S
Sbjct: 412 FGSQIWRVCEPIFPLPNDKY-----DGNFMPSEALQHCVDAHLQAGDVLYIPRGTI-ISS 465
Query: 189 TVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAG 224
T+E SLHVTIS++ A W +LE ++ +AL +A
Sbjct: 466 RATDEASLHVTISLHSHDAFTWHTVLETSL-RALVSAN 502
>gi|407978129|ref|ZP_11158963.1| hypothetical protein BA1_02960 [Bacillus sp. HYC-10]
gi|407415391|gb|EKF36992.1| hypothetical protein BA1_02960 [Bacillus sp. HYC-10]
Length = 330
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 78/172 (45%), Gaps = 30/172 (17%)
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVG----ANTYQTPPTAQGFA 118
P+ Y +G ++ F +I L++ + L+ L G G A Y +A GF+
Sbjct: 85 PEEAMSWYEKGAALEFDFTDIFIPELRESISKLKSLLGLPEGTMAKAVLYAAKESA-GFS 143
Query: 119 PHYDDIEAFILQLEGKKKWK----------------VYLPRMVDEYLPRYSSPNFSQEEI 162
H+D FI QL+GKKKWK + P + D L Y + F E +
Sbjct: 144 AHFDAYCNFIFQLKGKKKWKLAENFNTINPLQHYELIEAPHLPDP-LKSYWNGEFPDENL 202
Query: 163 --GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
GT ++ L+ G L+LPRG H S+ +L+ T + + AW+DL+
Sbjct: 203 SNGTELI---LDAGSFLFLPRGCWHSTSSSEETIALNFT---FGQPAWLDLI 248
>gi|297560490|ref|YP_003679464.1| cupin [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111]
gi|296844938|gb|ADH66958.1| Cupin 4 family protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length = 400
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 97/222 (43%), Gaps = 30/222 (13%)
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P+ ++ REG S+ + P+ L L AN Y ++GF H+D
Sbjct: 92 PEALYRELREGASLVLDSVDRMHPPVGAAADDLMRLVRERAQANLYLIWGGSRGFDTHWD 151
Query: 123 DIEAFILQLEGKKKWKVYLPR-----MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
D + I+Q+EG K W+V+ P M ++ ++ P + E+ + L PG ++
Sbjct: 152 DHDTVIVQVEGTKHWQVHGPGSRPYPMKNDVDHAHTPPRDADGELHL-VWEGVLRPGQVI 210
Query: 178 YLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQAAGATDLEFRRGLP 235
++PRG+ H T T S+H+T + T W D L + + + FRR LP
Sbjct: 211 HVPRGWWHTV-TGTGGVSMHLTFGFTRATGVEWADALVRRL--------FEEEVFRRDLP 261
Query: 236 IGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLD 277
R+A D+ R + L +A+L E +DLD
Sbjct: 262 ----RFAD------PDV---RRKHQRALAARMAELAEELDLD 290
>gi|171059323|ref|YP_001791672.1| cupin 4 family protein [Leptothrix cholodnii SP-6]
gi|170776768|gb|ACB34907.1| Cupin 4 family protein [Leptothrix cholodnii SP-6]
Length = 418
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 90/203 (44%), Gaps = 29/203 (14%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
HW+++PL + R+ ++ ++ +L D+ +E + ++ +G RQT +
Sbjct: 57 HWQRKPLLV-RQAVPGIEPPVSRAQMFAMLEDDAVESRF---LSRQGEGDRQTWQFKRGP 112
Query: 60 ---RALPQIV---WDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPP 112
R+LP I W +G ++ + + + A L +L +
Sbjct: 113 MPRRSLPAIKQPGWTVLVQGLNLHVPAAADLLNRFRFVPQARLDDLMISWASEGG----- 167
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKV-YLPRM-VDEYLPRYSSPNFSQEEIGTPILTVT 170
G PH+D + F++Q+ G+++W++ LP + E LP NF EE
Sbjct: 168 ---GVGPHFDSYDVFLIQVAGRRRWRIGRLPDARLREGLPVKIIENFRHEE------EWV 218
Query: 171 LEPGDLLYLPRGYIHQASTVTNE 193
LEPGD+LYLP G+ H V E
Sbjct: 219 LEPGDMLYLPPGWAHDGDAVDGE 241
>gi|148977568|ref|ZP_01814147.1| hypothetical protein VSWAT3_23164 [Vibrionales bacterium SWAT-3]
gi|145963219|gb|EDK28486.1| hypothetical protein VSWAT3_23164 [Vibrionales bacterium SWAT-3]
Length = 377
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 60/124 (48%), Gaps = 17/124 (13%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+H+ +G + K QQL N +L C Y P G PH D +
Sbjct: 88 NHWHQGAN-------ELTKAFQQLPNWLFDDLMIC------YSAPEG--GVGPHIDQYDV 132
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
FI+Q +GK++WKV + V +Y + Q E PI+ TLEPGD+LY+P G+ H+
Sbjct: 133 FIIQGQGKRQWKVG-AKDVGQYKETVQASALRQIEGFEPIIDETLEPGDILYIPPGFPHE 191
Query: 187 ASTV 190
T+
Sbjct: 192 GDTL 195
>gi|170726550|ref|YP_001760576.1| cupin 4 family protein [Shewanella woodyi ATCC 51908]
gi|169811897|gb|ACA86481.1| Cupin 4 family protein [Shewanella woodyi ATCC 51908]
Length = 381
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 89/210 (42%), Gaps = 36/210 (17%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I RK + + +L++ +++ + D ++E ++ E G+ Q
Sbjct: 17 EYWQKKPLVI-RKGFSNFRDLLSPEEMAGLACDEMVESRR----IYKEQGEWQA------ 65
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVG---------ANTY 108
+ S L + +Q LN L ++L CF +Y
Sbjct: 66 --------EFGPFDSYEHLGESGWTLVVQALNNWLPGAEQLIQCFDFIPRWRFDDVMVSY 117
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
TP G PH D + FI Q G+++W+V E+ + P E PI+
Sbjct: 118 ATP--GGGVGPHIDLYDVFICQGSGRRRWRV---GDRGEHREFAAHPALLHTEAFEPIID 172
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
V L PGD+LYLP GY H T+ S V
Sbjct: 173 VELLPGDILYLPPGYPHDGVTLEESMSFSV 202
>gi|432946421|ref|ZP_20142050.1| YcfD protein [Escherichia coli KTE196]
gi|433042626|ref|ZP_20230145.1| YcfD protein [Escherichia coli KTE117]
gi|431461796|gb|ELH42063.1| YcfD protein [Escherichia coli KTE196]
gi|431559026|gb|ELI32604.1| YcfD protein [Escherichia coli KTE117]
Length = 373
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|37524851|ref|NP_928195.1| hypothetical protein plu0856 [Photorhabdus luminescens subsp.
laumondii TTO1]
gi|36784276|emb|CAE13151.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
TTO1]
Length = 388
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 68/149 (45%), Gaps = 14/149 (9%)
Query: 88 LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
+ QL + ++ Y + F PH+D + F +QL GKK+W +Y P D
Sbjct: 107 IDQLAKACSSFTDSHPFSSLYLSYGEKSSFKPHWDSRDIFAIQLSGKKRWIIYKPSFPDP 166
Query: 148 -YLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
YL + + P LE GD+LYLPRG+ H + E ++H+++ ++
Sbjct: 167 VYLHQSKDMENTYPCPSEPYDDFVLETGDVLYLPRGWWHNPLPL-GEETIHLSVGIFPPY 225
Query: 207 A-----W-------IDLLEKAMPKALQAA 223
A W ID+ K++P++ + A
Sbjct: 226 AHEYINWLSYKITDIDIGRKSLPRSWKQA 254
>gi|359439985|ref|ZP_09229913.1| hypothetical protein P20429_0260 [Pseudoalteromonas sp. BSi20429]
gi|358038186|dbj|GAA66162.1| hypothetical protein P20429_0260 [Pseudoalteromonas sp. BSi20429]
Length = 385
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 92/198 (46%), Gaps = 27/198 (13%)
Query: 3 WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
W+K+PL I + N+ + N + +++ + +I KN T +ED ++ T
Sbjct: 19 WQKKPLLIKQGFANFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
D L Q V DH+ + + L P +I N + +L +Y TP
Sbjct: 78 DDHSTLLVQAV-DHWHSDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--NG 122
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q EGK+ W+V LP D L +++ + Q E ++ LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEPFEAVIDCILEPG 179
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LY+P G H+ V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197
>gi|86146439|ref|ZP_01064762.1| hypothetical protein MED222_12528 [Vibrio sp. MED222]
gi|85835702|gb|EAQ53837.1| hypothetical protein MED222_12528 [Vibrio sp. MED222]
Length = 377
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +GK++WKV + V +Y + Q E PI+ TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVGT-KDVGQYKETVQASALRQIEGFEPIIDETLEPGD 180
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195
>gi|260779119|ref|ZP_05888011.1| hypothetical protein VIC_004528 [Vibrio coralliilyticus ATCC
BAA-450]
gi|260605283|gb|EEX31578.1| hypothetical protein VIC_004528 [Vibrio coralliilyticus ATCC
BAA-450]
Length = 378
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 88/205 (42%), Gaps = 40/205 (19%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
+W K+P I K +N+ I D++ + ++ +D R +VEG+
Sbjct: 16 YWHKKPTVI----KGGFNDFIDPISPDELAG---LAMEEEVD-------SRYISNVEGKW 61
Query: 61 -------------ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGAN 106
LP+ W + C+ L I+P +QL N +L CF
Sbjct: 62 SAQHGPFDDSLFENLPESHWQLVVQACNHWHLGAAELIEPFKQLPNWLFDDLMVCF---- 117
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTP 165
+ P G PH D + FI+Q GK++W+V L + +Y + Q E
Sbjct: 118 ---SAPEG-GVGPHIDQYDVFIIQGSGKRRWRVGDLDK--GQYKESIQADALRQIEGFDA 171
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTV 190
I+ LEPGD+LY+P G+ H+ +T+
Sbjct: 172 IIDEVLEPGDILYIPPGFPHEGNTL 196
>gi|84393758|ref|ZP_00992506.1| hypothetical protein V12B01_06771 [Vibrio splendidus 12B01]
gi|84375620|gb|EAP92519.1| hypothetical protein V12B01_06771 [Vibrio splendidus 12B01]
Length = 377
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +GK++WKV + V +Y + Q E PI+ TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVG-AKDVGQYKETVQASALRQIEGFEPIIDETLEPGD 180
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195
>gi|366158813|ref|ZP_09458675.1| hypothetical protein ETW09_07695 [Escherichia sp. TW09308]
gi|432371849|ref|ZP_19614899.1| YcfD protein [Escherichia coli KTE11]
gi|430898178|gb|ELC20313.1| YcfD protein [Escherichia coli KTE11]
Length = 373
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMETEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|307257834|ref|ZP_07539591.1| hypothetical protein appser10_18190 [Actinobacillus
pleuropneumoniae serovar 10 str. D13039]
gi|306863740|gb|EFM95666.1| hypothetical protein appser10_18190 [Actinobacillus
pleuropneumoniae serovar 10 str. D13039]
Length = 420
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 18/117 (15%)
Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
GC ++ Y T F H+D + F LQ++G+K+W + + P +++P F
Sbjct: 153 GCHTFSSLYIAFNTQSSFKSHWDSRDIFALQMQGRKRWII--------HAPTFNNPLFMH 204
Query: 160 EEIGTP---------ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ P + + LE GD+LYLPRG+ H V E ++H+ I ++ A
Sbjct: 205 KSKDMPEYNPNLDDVYMDIILEAGDILYLPRGWWHDPIPV-GEETVHLAIGIFPAYA 260
>gi|407071039|ref|ZP_11101877.1| hypothetical protein VcycZ_15908 [Vibrio cyclitrophicus ZF14]
Length = 377
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +GK++WKV + V +Y + Q E PI+ TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVG-EKDVGQYKETVQASALRQIEGFEPIIDETLEPGD 180
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195
>gi|403511542|ref|YP_006643180.1| cupin-like domain protein [Nocardiopsis alba ATCC BAA-2165]
gi|402799400|gb|AFR06810.1| cupin-like domain protein [Nocardiopsis alba ATCC BAA-2165]
Length = 400
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 86/194 (44%), Gaps = 13/194 (6%)
Query: 47 YEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
Y D + D P+ ++ R G S+ + P+ + L L AN
Sbjct: 76 YTDVGTASRDTRRIVRPESLYRELRAGASLVLDSVDRMHAPVGEAADDLMRLVRERAQAN 135
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR-----MVDEYLPRYSSPNFSQEE 161
Y ++GF H+DD + I+Q+EG K W+V+ P M ++ ++ P +
Sbjct: 136 LYLIWGESRGFDTHWDDHDTMIVQVEGSKHWQVHGPGTRPFPMKNDTDHTHTPPRDADGG 195
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+ + L PG +L++PRG+ H T T E S+H+T + T ID E ++ + L
Sbjct: 196 LHL-VWEGVLRPGQVLHVPRGWWHTV-TGTGEMSMHLTFGFTRATG-IDWAE-SLVRGLY 251
Query: 222 AAGATDLEFRRGLP 235
A FRR LP
Sbjct: 252 EEEA----FRRDLP 261
>gi|441504976|ref|ZP_20986968.1| hypothetical protein C942_02122 [Photobacterium sp. AK15]
gi|441427558|gb|ELR65028.1| hypothetical protein C942_02122 [Photobacterium sp. AK15]
Length = 385
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 64/139 (46%), Gaps = 18/139 (12%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+H+ EG + + P +QL N L +L +Y TP G PH D +
Sbjct: 93 NHWHEGAA-------RLVTPFRQLPNWLLDDLM------ISYSTP--GGGVGPHIDQYDV 137
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
FI+Q GK+ W+V P+ D+Y + P Q E I+ TLEPGD+LY+P G+ H
Sbjct: 138 FIIQGSGKRHWRV-GPK-SDDYEEKCRHPALRQIEDFESIINETLEPGDILYIPPGFPHD 195
Query: 187 ASTVTNEHSLHVTISVYQK 205
+ S V +K
Sbjct: 196 GYAIETSMSFSVGFRSPKK 214
>gi|167521271|ref|XP_001744974.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776588|gb|EDQ90207.1| predicted protein [Monosiga brevicollis MX1]
Length = 750
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 63/135 (46%), Gaps = 17/135 (12%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS- 153
+Q+ G + + Y +PP+ + +PH D + F+ Q G K W + P+ D ++
Sbjct: 146 IQDHLGVVISDHFYISPPSQRALSPHTDGGDVFVHQWAGSKLWTLCAPKAPDCTNCTFAD 205
Query: 154 --------------SPNFSQEEI-GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+++ E++ G T+ L+ +LLY+PRG +H A T S H
Sbjct: 206 RALRAELAKSAFEGCTSYTDEQLAGMECRTLDLQENELLYIPRGVVHFARANTTGLSAHA 265
Query: 199 TISVYQKTA-WIDLL 212
T V +A WIDL+
Sbjct: 266 TYQVIPPSATWIDLV 280
>gi|414069366|ref|ZP_11405360.1| hypothetical protein D172_0592 [Pseudoalteromonas sp. Bsw20308]
gi|410808169|gb|EKS14141.1| hypothetical protein D172_0592 [Pseudoalteromonas sp. Bsw20308]
Length = 381
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 93/199 (46%), Gaps = 27/199 (13%)
Query: 2 HWEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQT 54
+W+K+PL I + ++ + N + +++ + +I KN T +ED ++ T
Sbjct: 18 YWQKKPLLIKQGFTDFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT 77
Query: 55 MDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
D L Q V DH+ + + L P +I N + +L +Y TP
Sbjct: 78 -DDHSTLLVQAV-DHWHSDAA-QLLEPFRFIP-----NWRIDDLMI------SYSTP--N 121
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q EGK+ W+V LP D L +++ + Q E ++ LEP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEAVIDCILEP 178
Query: 174 GDLLYLPRGYIHQASTVTN 192
GD+LY+P G H+ V N
Sbjct: 179 GDILYIPPGCPHEGYAVEN 197
>gi|218709057|ref|YP_002416678.1| hypothetical protein VS_1063 [Vibrio splendidus LGP32]
gi|218322076|emb|CAV18153.1| hypothetical protein VS_1063 [Vibrio splendidus LGP32]
Length = 377
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +GK++WKV + + +Y + Q E PI+ TLEPGD
Sbjct: 122 GVGPHIDQYDVFIIQGQGKRQWKVG-AKDIGQYKETVQASALRQIEGFDPIIDETLEPGD 180
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 181 ILYIPPGFPHEGNTL 195
>gi|359438576|ref|ZP_09228588.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
BSi20311]
gi|359444143|ref|ZP_09233947.1| hypothetical protein P20439_0258 [Pseudoalteromonas sp. BSi20439]
gi|358026702|dbj|GAA64837.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
BSi20311]
gi|358042092|dbj|GAA70196.1| hypothetical protein P20439_0258 [Pseudoalteromonas sp. BSi20439]
Length = 380
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
+Y TP G PH D + FI+Q EGK+ W+V LP D L +++ + Q E +
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKTLLQVEAFSA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197
>gi|336311795|ref|ZP_08566754.1| hypothetical protein SOHN41_02237 [Shewanella sp. HN-41]
gi|335864677|gb|EGM69755.1| hypothetical protein SOHN41_02237 [Shewanella sp. HN-41]
Length = 269
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 11/88 (12%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP---RYSSPNFSQEEIG 163
++ TP + G PH D+ + FI+Q EG+++W V + P R +PN E
Sbjct: 3 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTV------GDNTPQQRRGGNPNSPLVEDF 54
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVT 191
TPI+ V LE GD+LY+P G+ H T+T
Sbjct: 55 TPIIDVVLEKGDMLYIPPGFPHCGETLT 82
>gi|315126740|ref|YP_004068743.1| RmlC-like domain-containing protein [Pseudoalteromonas sp. SM9913]
gi|315015254|gb|ADT68592.1| RmlC-like domain-containing protein [Pseudoalteromonas sp. SM9913]
Length = 380
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
+Y TP G PH D + FI+Q EGK+ W+V LP D L +++ + Q E +
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKTLLQVEAFSA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197
>gi|269102218|ref|ZP_06154915.1| hypothetical protein VDA_001639 [Photobacterium damselae subsp.
damselae CIP 102761]
gi|268162116|gb|EEZ40612.1| hypothetical protein VDA_001639 [Photobacterium damselae subsp.
damselae CIP 102761]
Length = 377
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 82/203 (40%), Gaps = 24/203 (11%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDIT---LYEDGQR---QTM 55
+W+K+P+ I K++ +D I D + DI + DG Q
Sbjct: 15 YWQKQPVIIKGGFKDF---------VDPISADELAGLAMEEDIDSRYVARDGDEWDVQHG 65
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
+ +P+ W + + + P +QL L + +Y TP
Sbjct: 66 PLSFDNMPENNWSFLVQSVNYWHEGAARLVTPFRQLPGWLLDDLMI-----SYSTP--GG 118
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI Q GK++W+V + D+Y + P Q + PI+ +E GD
Sbjct: 119 GVGPHIDQYDVFITQGSGKRRWRVGPKK--DDYEEEFRHPALRQIKSFEPIIDEVIEAGD 176
Query: 176 LLYLPRGYIHQASTVTNEHSLHV 198
+LY+P G+ H T+ S V
Sbjct: 177 ILYIPPGFPHDGDTIETAMSFSV 199
>gi|433197725|ref|ZP_20381643.1| YcfD protein [Escherichia coli KTE94]
gi|431724403|gb|ELJ88328.1| YcfD protein [Escherichia coli KTE94]
Length = 373
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHS---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|416896624|ref|ZP_11926471.1| cupin superfamily protein [Escherichia coli STEC_7v]
gi|417112951|ref|ZP_11964871.1| cupin domain protein, PF06172 family [Escherichia coli 1.2741]
gi|422800542|ref|ZP_16849040.1| cupin superfamily protein [Escherichia coli M863]
gi|323967067|gb|EGB62493.1| cupin superfamily protein [Escherichia coli M863]
gi|327253832|gb|EGE65461.1| cupin superfamily protein [Escherichia coli STEC_7v]
gi|386142561|gb|EIG83699.1| cupin domain protein, PF06172 family [Escherichia coli 1.2741]
Length = 373
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTASLMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|348030940|ref|YP_004873626.1| hypothetical protein GNIT_3539 [Glaciecola nitratireducens FR1064]
gi|347948283|gb|AEP31633.1| hypothetical protein GNIT_3539 [Glaciecola nitratireducens FR1064]
Length = 393
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 71/142 (50%), Gaps = 18/142 (12%)
Query: 65 IVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP-HYDD 123
++ ++ G ++RF + PL + +L++ G VG + + PT AP HYD+
Sbjct: 89 LIKQYHELGYTVRFPDVTELSPPLSEFTRALEKQIGNPVGVVVFWSAPT--NAAPVHYDE 146
Query: 124 IEAFILQLEGKKKWKVYLPRMVDEYLPRY-----SSPNFSQEEIGTPILTVTLEPGDLLY 178
++ ++QL G K+W Y+ Y ++ +P F+ + TV + PGD +Y
Sbjct: 147 VDVIVIQLVGTKQW--YISNTPTTYPNKWKNTGEGAPPFNDYQ------TVDVAPGDFIY 198
Query: 179 LPRGYIHQASTVTNEHSLHVTI 200
LPRG H + + S+H++I
Sbjct: 199 LPRGTAHTVKSTSE--SIHLSI 218
>gi|90413199|ref|ZP_01221194.1| hypothetical protein P3TCK_16534 [Photobacterium profundum 3TCK]
gi|90325751|gb|EAS42209.1| hypothetical protein P3TCK_16534 [Photobacterium profundum 3TCK]
Length = 377
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 18/139 (12%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEA 126
+H+ EG + + P +QL N L +L +Y TP G PH D +
Sbjct: 85 NHWHEGAA-------RLVTPFRQLPNWLLDDLM------ISYSTP--GGGVGPHIDQYDV 129
Query: 127 FILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
FI Q GK+ W+V + D+Y + P Q + PI+ +EPGD+LY+P G+ H
Sbjct: 130 FITQGSGKRHWRVGPKK--DDYEEEFRHPELRQIKGFEPIIDEVMEPGDILYIPPGFPHD 187
Query: 187 ASTVTNEHSLHVTISVYQK 205
+ S V +K
Sbjct: 188 GYAIETSMSFSVGFRSPKK 206
>gi|420336687|ref|ZP_14838263.1| cupin superfamily protein [Shigella flexneri K-315]
gi|391262251|gb|EIQ21286.1| cupin superfamily protein [Shigella flexneri K-315]
Length = 373
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPSGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|432616050|ref|ZP_19852174.1| YcfD protein [Escherichia coli KTE75]
gi|431155982|gb|ELE56723.1| YcfD protein [Escherichia coli KTE75]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETSWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|157370253|ref|YP_001478242.1| cupin 4 family protein [Serratia proteamaculans 568]
gi|157322017|gb|ABV41114.1| Cupin 4 family protein [Serratia proteamaculans 568]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 92/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E S ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|420352067|ref|ZP_14853226.1| cupin superfamily protein [Shigella boydii 4444-74]
gi|391284911|gb|EIQ43499.1| cupin superfamily protein [Shigella boydii 4444-74]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLERG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|293409494|ref|ZP_06653070.1| conserved hypothetical protein [Escherichia coli B354]
gi|291469962|gb|EFF12446.1| conserved hypothetical protein [Escherichia coli B354]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|432674129|ref|ZP_19909614.1| YcfD protein [Escherichia coli KTE142]
gi|431216635|gb|ELF14232.1| YcfD protein [Escherichia coli KTE142]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|157155178|ref|YP_001462397.1| cupin family protein [Escherichia coli E24377A]
gi|157160652|ref|YP_001457970.1| cupin family protein [Escherichia coli HS]
gi|170020479|ref|YP_001725433.1| cupin 4 family protein [Escherichia coli ATCC 8739]
gi|170680524|ref|YP_001744049.1| cupin family protein [Escherichia coli SMS-3-5]
gi|187732292|ref|YP_001879936.1| cupin family protein [Shigella boydii CDC 3083-94]
gi|188492699|ref|ZP_02999969.1| cupin family protein [Escherichia coli 53638]
gi|191173069|ref|ZP_03034602.1| cupin family protein [Escherichia coli F11]
gi|194433643|ref|ZP_03065919.1| cupin family protein [Shigella dysenteriae 1012]
gi|194437001|ref|ZP_03069100.1| cupin family protein [Escherichia coli 101-1]
gi|215486382|ref|YP_002328813.1| hypothetical protein E2348C_1269 [Escherichia coli O127:H6 str.
E2348/69]
gi|218558010|ref|YP_002390923.1| hypothetical protein ECS88_1143 [Escherichia coli S88]
gi|218694662|ref|YP_002402329.1| hypothetical protein EC55989_1241 [Escherichia coli 55989]
gi|218700366|ref|YP_002407995.1| hypothetical protein ECIAI39_2028 [Escherichia coli IAI39]
gi|222155875|ref|YP_002556014.1| hypothetical protein LF82_2717 [Escherichia coli LF82]
gi|251784657|ref|YP_002998961.1| hypothetical protein B21_01134 [Escherichia coli BL21(DE3)]
gi|253773848|ref|YP_003036679.1| cupin [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
gi|254161235|ref|YP_003044343.1| hypothetical protein ECB_01126 [Escherichia coli B str. REL606]
gi|254288043|ref|YP_003053791.1| hypothetical protein ECD_01126 [Escherichia coli BL21(DE3)]
gi|297520312|ref|ZP_06938698.1| putative cytoplasmic protein [Escherichia coli OP50]
gi|306814022|ref|ZP_07448195.1| putative cytoplasmic protein [Escherichia coli NC101]
gi|312966384|ref|ZP_07780606.1| cupin family protein [Escherichia coli 2362-75]
gi|312971264|ref|ZP_07785442.1| cupin family protein [Escherichia coli 1827-70]
gi|331657192|ref|ZP_08358154.1| YcfD protein [Escherichia coli TA206]
gi|331662539|ref|ZP_08363462.1| YcfD protein [Escherichia coli TA143]
gi|331667529|ref|ZP_08368393.1| YcfD protein [Escherichia coli TA271]
gi|331682634|ref|ZP_08383253.1| YcfD protein [Escherichia coli H299]
gi|383177783|ref|YP_005455788.1| hypothetical protein SSON53_06140 [Shigella sonnei 53G]
gi|386598936|ref|YP_006100442.1| cupin family protein [Escherichia coli IHE3034]
gi|386604916|ref|YP_006111216.1| putative cytoplasmic protein [Escherichia coli UM146]
gi|386613561|ref|YP_006133227.1| hypothetical protein UMNK88_1459 [Escherichia coli UMNK88]
gi|386618704|ref|YP_006138284.1| hypothetical protein ECNA114_1186 [Escherichia coli NA114]
gi|386623585|ref|YP_006143313.1| hypothetical protein CE10_1209 [Escherichia coli O7:K1 str. CE10]
gi|387616429|ref|YP_006119451.1| putative cytoplasmic protein [Escherichia coli O83:H1 str. NRG
857C]
gi|387829081|ref|YP_003349018.1| hypothetical protein ECSF_1028 [Escherichia coli SE15]
gi|407468819|ref|YP_006784739.1| hypothetical protein O3O_10700 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|407482514|ref|YP_006779663.1| hypothetical protein O3K_14920 [Escherichia coli O104:H4 str.
2011C-3493]
gi|410483066|ref|YP_006770612.1| hypothetical protein O3M_14895 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|414575410|ref|ZP_11432615.1| cupin superfamily protein [Shigella sonnei 3233-85]
gi|415812639|ref|ZP_11504776.1| cupin superfamily protein [Escherichia coli LT-68]
gi|415841245|ref|ZP_11522350.1| cupin superfamily protein [Escherichia coli RN587/1]
gi|415850665|ref|ZP_11527540.1| cupin superfamily protein [Shigella sonnei 53G]
gi|415854379|ref|ZP_11530134.1| cupin family protein [Shigella flexneri 2a str. 2457T]
gi|416285688|ref|ZP_11647837.1| hypothetical protein SGB_03455 [Shigella boydii ATCC 9905]
gi|416301322|ref|ZP_11652924.1| hypothetical protein SGF_03422 [Shigella flexneri CDC 796-83]
gi|416342633|ref|ZP_11676797.1| hypothetical protein ECoL_01732 [Escherichia coli EC4100B]
gi|417083666|ref|ZP_11951711.1| putative cytoplasmic protein [Escherichia coli cloneA_i1]
gi|417125924|ref|ZP_11973784.1| cupin domain protein, PF06172 family [Escherichia coli 97.0246]
gi|417150102|ref|ZP_11989992.1| cupin domain protein, PF06172 family [Escherichia coli 1.2264]
gi|417158877|ref|ZP_11996235.1| cupin domain protein, PF06172 family [Escherichia coli 99.0741]
gi|417233323|ref|ZP_12034087.1| cupin domain protein, PF06172 family [Escherichia coli 5.0959]
gi|417235817|ref|ZP_12034636.1| cupin domain protein, PF06172 family [Escherichia coli 9.0111]
gi|417266580|ref|ZP_12053948.1| cupin domain protein, PF06172 family [Escherichia coli 3.3884]
gi|417283444|ref|ZP_12070741.1| cupin domain protein, PF06172 family [Escherichia coli 3003]
gi|417289202|ref|ZP_12076487.1| cupin domain protein, PF06172 family [Escherichia coli TW07793]
gi|417607492|ref|ZP_12258004.1| cupin superfamily protein [Escherichia coli STEC_DG131-3]
gi|417638472|ref|ZP_12288637.1| cupin superfamily protein [Escherichia coli TX1999]
gi|417661726|ref|ZP_12311307.1| hypothetical protein ECAA86_01265 [Escherichia coli AA86]
gi|417672413|ref|ZP_12321880.1| cupin superfamily protein [Shigella dysenteriae 155-74]
gi|417682471|ref|ZP_12331827.1| cupin superfamily protein [Shigella boydii 3594-74]
gi|417701512|ref|ZP_12350637.1| cupin superfamily protein [Shigella flexneri K-218]
gi|417706847|ref|ZP_12355896.1| cupin superfamily protein [Shigella flexneri VA-6]
gi|417722470|ref|ZP_12371295.1| cupin superfamily protein [Shigella flexneri K-304]
gi|417727771|ref|ZP_12376498.1| cupin superfamily protein [Shigella flexneri K-671]
gi|417733088|ref|ZP_12381751.1| cupin superfamily protein [Shigella flexneri 2747-71]
gi|417738115|ref|ZP_12386709.1| cupin superfamily protein [Shigella flexneri 4343-70]
gi|417742768|ref|ZP_12391310.1| cupin superfamily protein [Shigella flexneri 2930-71]
gi|417755139|ref|ZP_12403228.1| cupin superfamily protein [Escherichia coli DEC2B]
gi|417804584|ref|ZP_12451587.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
LB226692]
gi|417827259|ref|ZP_12473828.1| cupin superfamily protein [Shigella flexneri J1713]
gi|417832326|ref|ZP_12478815.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
01-09591]
gi|417864944|ref|ZP_12509989.1| hypothetical protein C22711_1876 [Escherichia coli O104:H4 str.
C227-11]
gi|418255096|ref|ZP_12879548.1| cupin superfamily protein [Shigella flexneri 6603-63]
gi|418264138|ref|ZP_12884695.1| cupin superfamily protein [Shigella sonnei str. Moseley]
gi|418943215|ref|ZP_13496427.1| hypothetical protein T22_10377 [Escherichia coli O157:H43 str. T22]
gi|418996302|ref|ZP_13543907.1| cupin superfamily protein [Escherichia coli DEC1A]
gi|419001489|ref|ZP_13549035.1| cupin superfamily protein [Escherichia coli DEC1B]
gi|419006971|ref|ZP_13554422.1| cupin superfamily protein [Escherichia coli DEC1C]
gi|419012906|ref|ZP_13560266.1| cupin superfamily protein [Escherichia coli DEC1D]
gi|419017783|ref|ZP_13565100.1| cupin superfamily protein [Escherichia coli DEC1E]
gi|419023421|ref|ZP_13570658.1| cupin superfamily protein [Escherichia coli DEC2A]
gi|419028225|ref|ZP_13575413.1| cupin superfamily protein [Escherichia coli DEC2C]
gi|419034954|ref|ZP_13582044.1| cupin superfamily protein [Escherichia coli DEC2D]
gi|419039028|ref|ZP_13586078.1| cupin superfamily protein [Escherichia coli DEC2E]
gi|419169127|ref|ZP_13713521.1| cupin superfamily protein [Escherichia coli DEC7A]
gi|419174435|ref|ZP_13718286.1| cupin superfamily protein [Escherichia coli DEC7B]
gi|419180157|ref|ZP_13723778.1| cupin superfamily protein [Escherichia coli DEC7C]
gi|419185669|ref|ZP_13729191.1| cupin superfamily protein [Escherichia coli DEC7D]
gi|419190937|ref|ZP_13734403.1| cupin superfamily protein [Escherichia coli DEC7E]
gi|419344761|ref|ZP_13886143.1| cupin superfamily protein [Escherichia coli DEC13A]
gi|419349198|ref|ZP_13890551.1| cupin superfamily protein [Escherichia coli DEC13B]
gi|419354359|ref|ZP_13895633.1| cupin superfamily protein [Escherichia coli DEC13C]
gi|419359586|ref|ZP_13900811.1| cupin superfamily protein [Escherichia coli DEC13D]
gi|419364589|ref|ZP_13905761.1| cupin superfamily protein [Escherichia coli DEC13E]
gi|419699974|ref|ZP_14227586.1| hypothetical protein OQA_05431 [Escherichia coli SCI-07]
gi|419915151|ref|ZP_14433519.1| putative cytoplasmic protein [Escherichia coli KD1]
gi|419925456|ref|ZP_14443294.1| putative cytoplasmic protein [Escherichia coli 541-15]
gi|419944902|ref|ZP_14461365.1| putative cytoplasmic protein [Escherichia coli HM605]
gi|420319727|ref|ZP_14821570.1| cupin superfamily protein [Shigella flexneri 2850-71]
gi|420324846|ref|ZP_14826620.1| cupin superfamily protein [Shigella flexneri CCH060]
gi|420330541|ref|ZP_14832224.1| cupin superfamily protein [Shigella flexneri K-1770]
gi|420341112|ref|ZP_14842618.1| cupin superfamily protein [Shigella flexneri K-404]
gi|420347398|ref|ZP_14848798.1| cupin superfamily protein [Shigella boydii 965-58]
gi|420357926|ref|ZP_14858928.1| cupin superfamily protein [Shigella sonnei 3226-85]
gi|420362774|ref|ZP_14863683.1| cupin superfamily protein [Shigella sonnei 4822-66]
gi|420384996|ref|ZP_14884364.1| cupin superfamily protein [Escherichia coli EPECa12]
gi|421681872|ref|ZP_16121692.1| cupin superfamily protein [Shigella flexneri 1485-80]
gi|421776461|ref|ZP_16213065.1| cupin family protein [Escherichia coli AD30]
gi|422747842|ref|ZP_16801755.1| cupin superfamily protein [Escherichia coli H252]
gi|422753736|ref|ZP_16807563.1| cupin superfamily protein [Escherichia coli H263]
gi|422774987|ref|ZP_16828643.1| cupin superfamily protein [Escherichia coli H120]
gi|422785712|ref|ZP_16838451.1| cupin superfamily protein [Escherichia coli H489]
gi|422790175|ref|ZP_16842880.1| cupin superfamily protein [Escherichia coli TA007]
gi|422828616|ref|ZP_16876786.1| hypothetical protein ESNG_01291 [Escherichia coli B093]
gi|422833344|ref|ZP_16881411.1| hypothetical protein ESOG_01012 [Escherichia coli E101]
gi|422839587|ref|ZP_16887559.1| hypothetical protein ESPG_02245 [Escherichia coli H397]
gi|422992009|ref|ZP_16982780.1| hypothetical protein EUAG_01602 [Escherichia coli O104:H4 str.
C227-11]
gi|422993959|ref|ZP_16984723.1| hypothetical protein EUBG_01610 [Escherichia coli O104:H4 str.
C236-11]
gi|422999138|ref|ZP_16989894.1| hypothetical protein EUEG_01566 [Escherichia coli O104:H4 str.
09-7901]
gi|423002737|ref|ZP_16993483.1| hypothetical protein EUDG_00221 [Escherichia coli O104:H4 str.
04-8351]
gi|423009273|ref|ZP_17000011.1| hypothetical protein EUFG_01610 [Escherichia coli O104:H4 str.
11-3677]
gi|423023465|ref|ZP_17014168.1| hypothetical protein EUHG_01618 [Escherichia coli O104:H4 str.
11-4404]
gi|423028613|ref|ZP_17019306.1| hypothetical protein EUIG_01617 [Escherichia coli O104:H4 str.
11-4522]
gi|423029479|ref|ZP_17020167.1| hypothetical protein EUJG_00238 [Escherichia coli O104:H4 str.
11-4623]
gi|423037318|ref|ZP_17027992.1| hypothetical protein EUKG_01595 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|423042433|ref|ZP_17033100.1| hypothetical protein EULG_01608 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|423049124|ref|ZP_17039781.1| hypothetical protein EUMG_01612 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|423052705|ref|ZP_17041513.1| hypothetical protein EUNG_01111 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|423059673|ref|ZP_17048469.1| hypothetical protein EUOG_01613 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|423709135|ref|ZP_17683513.1| hypothetical protein ESTG_03603 [Escherichia coli B799]
gi|425277265|ref|ZP_18668566.1| hypothetical protein ECARS42123_1409 [Escherichia coli ARS4.2123]
gi|425287956|ref|ZP_18678845.1| hypothetical protein EC3006_1448 [Escherichia coli 3006]
gi|425299606|ref|ZP_18689602.1| hypothetical protein EC07798_1505 [Escherichia coli 07798]
gi|425421806|ref|ZP_18802999.1| hypothetical protein EC01288_1172 [Escherichia coli 0.1288]
gi|429723522|ref|ZP_19258404.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9450]
gi|429723866|ref|ZP_19258741.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9990]
gi|429773601|ref|ZP_19305614.1| YcfD protein [Escherichia coli O104:H4 str. 11-02030]
gi|429776591|ref|ZP_19308571.1| YcfD protein [Escherichia coli O104:H4 str. 11-02033-1]
gi|429781815|ref|ZP_19313742.1| YcfD protein [Escherichia coli O104:H4 str. 11-02092]
gi|429787009|ref|ZP_19318900.1| YcfD protein [Escherichia coli O104:H4 str. 11-02093]
gi|429792188|ref|ZP_19324040.1| YcfD protein [Escherichia coli O104:H4 str. 11-02281]
gi|429797401|ref|ZP_19329206.1| YcfD protein [Escherichia coli O104:H4 str. 11-02318]
gi|429802606|ref|ZP_19334367.1| YcfD protein [Escherichia coli O104:H4 str. 11-02913]
gi|429808977|ref|ZP_19340689.1| YcfD protein [Escherichia coli O104:H4 str. 11-03439]
gi|429813029|ref|ZP_19344709.1| YcfD protein [Escherichia coli O104:H4 str. 11-04080]
gi|429818235|ref|ZP_19349871.1| YcfD protein [Escherichia coli O104:H4 str. 11-03943]
gi|429912063|ref|ZP_19378019.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9941]
gi|429917905|ref|ZP_19383845.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4984]
gi|429922943|ref|ZP_19388864.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5604]
gi|429923794|ref|ZP_19389710.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4986]
gi|429932687|ref|ZP_19398581.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4987]
gi|429934290|ref|ZP_19400180.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4988]
gi|429939952|ref|ZP_19405826.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5603]
gi|429947591|ref|ZP_19413446.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-6006]
gi|429950224|ref|ZP_19416072.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0465]
gi|429958497|ref|ZP_19424326.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0466]
gi|432357441|ref|ZP_19600684.1| YcfD protein [Escherichia coli KTE4]
gi|432361865|ref|ZP_19605048.1| YcfD protein [Escherichia coli KTE5]
gi|432369180|ref|ZP_19612280.1| YcfD protein [Escherichia coli KTE10]
gi|432376287|ref|ZP_19619293.1| YcfD protein [Escherichia coli KTE12]
gi|432380836|ref|ZP_19623785.1| YcfD protein [Escherichia coli KTE15]
gi|432386601|ref|ZP_19629496.1| YcfD protein [Escherichia coli KTE16]
gi|432396982|ref|ZP_19639767.1| YcfD protein [Escherichia coli KTE25]
gi|432405914|ref|ZP_19648634.1| YcfD protein [Escherichia coli KTE28]
gi|432411216|ref|ZP_19653894.1| YcfD protein [Escherichia coli KTE39]
gi|432421219|ref|ZP_19663774.1| YcfD protein [Escherichia coli KTE178]
gi|432431260|ref|ZP_19673701.1| YcfD protein [Escherichia coli KTE187]
gi|432435792|ref|ZP_19678186.1| YcfD protein [Escherichia coli KTE188]
gi|432440598|ref|ZP_19682946.1| YcfD protein [Escherichia coli KTE189]
gi|432445707|ref|ZP_19688010.1| YcfD protein [Escherichia coli KTE191]
gi|432449396|ref|ZP_19691677.1| YcfD protein [Escherichia coli KTE193]
gi|432456099|ref|ZP_19698293.1| YcfD protein [Escherichia coli KTE201]
gi|432465131|ref|ZP_19707231.1| YcfD protein [Escherichia coli KTE205]
gi|432470467|ref|ZP_19712519.1| YcfD protein [Escherichia coli KTE206]
gi|432484814|ref|ZP_19726733.1| YcfD protein [Escherichia coli KTE212]
gi|432488828|ref|ZP_19730712.1| YcfD protein [Escherichia coli KTE213]
gi|432495015|ref|ZP_19736830.1| YcfD protein [Escherichia coli KTE214]
gi|432499433|ref|ZP_19741203.1| YcfD protein [Escherichia coli KTE216]
gi|432503853|ref|ZP_19745586.1| YcfD protein [Escherichia coli KTE220]
gi|432513404|ref|ZP_19750637.1| YcfD protein [Escherichia coli KTE224]
gi|432523267|ref|ZP_19760402.1| YcfD protein [Escherichia coli KTE230]
gi|432530459|ref|ZP_19767496.1| YcfD protein [Escherichia coli KTE233]
gi|432533351|ref|ZP_19770341.1| YcfD protein [Escherichia coli KTE234]
gi|432542556|ref|ZP_19779411.1| YcfD protein [Escherichia coli KTE236]
gi|432548031|ref|ZP_19784815.1| YcfD protein [Escherichia coli KTE237]
gi|432553165|ref|ZP_19789894.1| YcfD protein [Escherichia coli KTE47]
gi|432558184|ref|ZP_19794872.1| YcfD protein [Escherichia coli KTE49]
gi|432568116|ref|ZP_19804637.1| YcfD protein [Escherichia coli KTE53]
gi|432573147|ref|ZP_19809637.1| YcfD protein [Escherichia coli KTE55]
gi|432583167|ref|ZP_19819575.1| YcfD protein [Escherichia coli KTE57]
gi|432587375|ref|ZP_19823741.1| YcfD protein [Escherichia coli KTE58]
gi|432592265|ref|ZP_19828592.1| YcfD protein [Escherichia coli KTE60]
gi|432596987|ref|ZP_19833268.1| YcfD protein [Escherichia coli KTE62]
gi|432601653|ref|ZP_19837900.1| YcfD protein [Escherichia coli KTE66]
gi|432606976|ref|ZP_19843167.1| YcfD protein [Escherichia coli KTE67]
gi|432610892|ref|ZP_19847059.1| YcfD protein [Escherichia coli KTE72]
gi|432621321|ref|ZP_19857361.1| YcfD protein [Escherichia coli KTE76]
gi|432645649|ref|ZP_19881447.1| YcfD protein [Escherichia coli KTE86]
gi|432650564|ref|ZP_19886323.1| YcfD protein [Escherichia coli KTE87]
gi|432654749|ref|ZP_19890465.1| YcfD protein [Escherichia coli KTE93]
gi|432670065|ref|ZP_19905605.1| YcfD protein [Escherichia coli KTE119]
gi|432679601|ref|ZP_19914995.1| YcfD protein [Escherichia coli KTE143]
gi|432693917|ref|ZP_19929126.1| YcfD protein [Escherichia coli KTE162]
gi|432698518|ref|ZP_19933683.1| YcfD protein [Escherichia coli KTE169]
gi|432710083|ref|ZP_19945147.1| YcfD protein [Escherichia coli KTE6]
gi|432712757|ref|ZP_19947806.1| YcfD protein [Escherichia coli KTE8]
gi|432718161|ref|ZP_19953145.1| YcfD protein [Escherichia coli KTE9]
gi|432722609|ref|ZP_19957532.1| YcfD protein [Escherichia coli KTE17]
gi|432727197|ref|ZP_19962078.1| YcfD protein [Escherichia coli KTE18]
gi|432731881|ref|ZP_19966716.1| YcfD protein [Escherichia coli KTE45]
gi|432740883|ref|ZP_19975604.1| YcfD protein [Escherichia coli KTE23]
gi|432745139|ref|ZP_19979834.1| YcfD protein [Escherichia coli KTE43]
gi|432753870|ref|ZP_19988426.1| YcfD protein [Escherichia coli KTE22]
gi|432758959|ref|ZP_19993458.1| YcfD protein [Escherichia coli KTE46]
gi|432764493|ref|ZP_19998938.1| YcfD protein [Escherichia coli KTE48]
gi|432778001|ref|ZP_20012250.1| YcfD protein [Escherichia coli KTE59]
gi|432783049|ref|ZP_20017233.1| YcfD protein [Escherichia coli KTE63]
gi|432786789|ref|ZP_20020953.1| YcfD protein [Escherichia coli KTE65]
gi|432792395|ref|ZP_20026483.1| YcfD protein [Escherichia coli KTE78]
gi|432798356|ref|ZP_20032380.1| YcfD protein [Escherichia coli KTE79]
gi|432801432|ref|ZP_20035414.1| YcfD protein [Escherichia coli KTE84]
gi|432805248|ref|ZP_20039189.1| YcfD protein [Escherichia coli KTE91]
gi|432813303|ref|ZP_20047127.1| YcfD protein [Escherichia coli KTE101]
gi|432814763|ref|ZP_20048553.1| YcfD protein [Escherichia coli KTE115]
gi|432820375|ref|ZP_20054078.1| YcfD protein [Escherichia coli KTE118]
gi|432826596|ref|ZP_20060250.1| YcfD protein [Escherichia coli KTE123]
gi|432831099|ref|ZP_20064681.1| YcfD protein [Escherichia coli KTE135]
gi|432834200|ref|ZP_20067742.1| YcfD protein [Escherichia coli KTE136]
gi|432838845|ref|ZP_20072334.1| YcfD protein [Escherichia coli KTE140]
gi|432843664|ref|ZP_20076847.1| YcfD protein [Escherichia coli KTE141]
gi|432849578|ref|ZP_20080800.1| YcfD protein [Escherichia coli KTE144]
gi|432860950|ref|ZP_20086034.1| YcfD protein [Escherichia coli KTE146]
gi|432880849|ref|ZP_20097384.1| YcfD protein [Escherichia coli KTE154]
gi|432893835|ref|ZP_20105847.1| YcfD protein [Escherichia coli KTE165]
gi|432898025|ref|ZP_20108856.1| YcfD protein [Escherichia coli KTE192]
gi|432903996|ref|ZP_20113267.1| YcfD protein [Escherichia coli KTE194]
gi|432921226|ref|ZP_20124608.1| YcfD protein [Escherichia coli KTE173]
gi|432926155|ref|ZP_20127936.1| YcfD protein [Escherichia coli KTE175]
gi|432933686|ref|ZP_20133354.1| YcfD protein [Escherichia coli KTE184]
gi|432937035|ref|ZP_20135727.1| YcfD protein [Escherichia coli KTE183]
gi|432971323|ref|ZP_20160196.1| YcfD protein [Escherichia coli KTE207]
gi|432977758|ref|ZP_20166581.1| YcfD protein [Escherichia coli KTE209]
gi|432980571|ref|ZP_20169349.1| YcfD protein [Escherichia coli KTE211]
gi|432984857|ref|ZP_20173586.1| YcfD protein [Escherichia coli KTE215]
gi|432990196|ref|ZP_20178862.1| YcfD protein [Escherichia coli KTE217]
gi|432994830|ref|ZP_20183444.1| YcfD protein [Escherichia coli KTE218]
gi|432999357|ref|ZP_20187893.1| YcfD protein [Escherichia coli KTE223]
gi|433004572|ref|ZP_20193010.1| YcfD protein [Escherichia coli KTE227]
gi|433011829|ref|ZP_20200232.1| YcfD protein [Escherichia coli KTE229]
gi|433013329|ref|ZP_20201701.1| YcfD protein [Escherichia coli KTE104]
gi|433022970|ref|ZP_20210980.1| YcfD protein [Escherichia coli KTE106]
gi|433028125|ref|ZP_20215992.1| YcfD protein [Escherichia coli KTE109]
gi|433032696|ref|ZP_20220464.1| YcfD protein [Escherichia coli KTE112]
gi|433038160|ref|ZP_20225770.1| YcfD protein [Escherichia coli KTE113]
gi|433057409|ref|ZP_20244488.1| YcfD protein [Escherichia coli KTE124]
gi|433072157|ref|ZP_20258847.1| YcfD protein [Escherichia coli KTE129]
gi|433077260|ref|ZP_20263820.1| YcfD protein [Escherichia coli KTE131]
gi|433082043|ref|ZP_20268515.1| YcfD protein [Escherichia coli KTE133]
gi|433086712|ref|ZP_20273104.1| YcfD protein [Escherichia coli KTE137]
gi|433095996|ref|ZP_20282205.1| YcfD protein [Escherichia coli KTE139]
gi|433100633|ref|ZP_20286738.1| YcfD protein [Escherichia coli KTE145]
gi|433105223|ref|ZP_20291236.1| YcfD protein [Escherichia coli KTE148]
gi|433110422|ref|ZP_20296293.1| YcfD protein [Escherichia coli KTE150]
gi|433114987|ref|ZP_20300798.1| YcfD protein [Escherichia coli KTE153]
gi|433119674|ref|ZP_20305375.1| YcfD protein [Escherichia coli KTE157]
gi|433124659|ref|ZP_20310242.1| YcfD protein [Escherichia coli KTE160]
gi|433129508|ref|ZP_20314970.1| YcfD protein [Escherichia coli KTE163]
gi|433134306|ref|ZP_20319673.1| YcfD protein [Escherichia coli KTE166]
gi|433138719|ref|ZP_20323999.1| YcfD protein [Escherichia coli KTE167]
gi|433143742|ref|ZP_20328904.1| YcfD protein [Escherichia coli KTE168]
gi|433148614|ref|ZP_20333663.1| YcfD protein [Escherichia coli KTE174]
gi|433153201|ref|ZP_20338165.1| YcfD protein [Escherichia coli KTE176]
gi|433162947|ref|ZP_20347704.1| YcfD protein [Escherichia coli KTE179]
gi|433167974|ref|ZP_20352637.1| YcfD protein [Escherichia coli KTE180]
gi|433172951|ref|ZP_20357501.1| YcfD protein [Escherichia coli KTE232]
gi|433182644|ref|ZP_20366935.1| YcfD protein [Escherichia coli KTE85]
gi|433187907|ref|ZP_20372022.1| YcfD protein [Escherichia coli KTE88]
gi|433193157|ref|ZP_20377165.1| YcfD protein [Escherichia coli KTE90]
gi|433202661|ref|ZP_20386456.1| YcfD protein [Escherichia coli KTE95]
gi|433207227|ref|ZP_20390920.1| YcfD protein [Escherichia coli KTE97]
gi|433211981|ref|ZP_20395590.1| YcfD protein [Escherichia coli KTE99]
gi|433322743|ref|ZP_20400146.1| hypothetical protein B185_004920 [Escherichia coli J96]
gi|442599916|ref|ZP_21017618.1| FIG002776: hypothetical protein [Escherichia coli O5:K4(L):H4 str.
ATCC 23502]
gi|442603849|ref|ZP_21018703.1| FIG002776: hypothetical protein [Escherichia coli Nissle 1917]
gi|443617239|ref|YP_007381095.1| hypothetical protein APECO78_09600 [Escherichia coli APEC O78]
gi|450187641|ref|ZP_21889937.1| hypothetical protein A364_06482 [Escherichia coli SEPT362]
gi|157066332|gb|ABV05587.1| cupin family protein [Escherichia coli HS]
gi|157077208|gb|ABV16916.1| cupin family protein [Escherichia coli E24377A]
gi|169755407|gb|ACA78106.1| Cupin 4 family protein [Escherichia coli ATCC 8739]
gi|170518242|gb|ACB16420.1| cupin family protein [Escherichia coli SMS-3-5]
gi|187429284|gb|ACD08558.1| cupin family protein [Shigella boydii CDC 3083-94]
gi|188487898|gb|EDU63001.1| cupin family protein [Escherichia coli 53638]
gi|190906614|gb|EDV66220.1| cupin family protein [Escherichia coli F11]
gi|194418072|gb|EDX34165.1| cupin family protein [Shigella dysenteriae 1012]
gi|194423984|gb|EDX39972.1| cupin family protein [Escherichia coli 101-1]
gi|215264454|emb|CAS08817.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
gi|218351394|emb|CAU97100.1| conserved hypothetical protein [Escherichia coli 55989]
gi|218364779|emb|CAR02469.1| conserved hypothetical protein [Escherichia coli S88]
gi|218370352|emb|CAR18155.1| conserved hypothetical protein [Escherichia coli IAI39]
gi|222032880|emb|CAP75620.1| Uncharacterized protein ycfD [Escherichia coli LF82]
gi|242376930|emb|CAQ31651.1| conserved protein [Escherichia coli BL21(DE3)]
gi|253324892|gb|ACT29494.1| Cupin 4 family protein [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
gi|253973136|gb|ACT38807.1| hypothetical protein ECB_01126 [Escherichia coli B str. REL606]
gi|253977350|gb|ACT43020.1| hypothetical protein ECD_01126 [Escherichia coli BL21(DE3)]
gi|281178238|dbj|BAI54568.1| conserved hypothetical protein [Escherichia coli SE15]
gi|294493437|gb|ADE92193.1| cupin family protein [Escherichia coli IHE3034]
gi|305852659|gb|EFM53107.1| putative cytoplasmic protein [Escherichia coli NC101]
gi|307627400|gb|ADN71704.1| putative cytoplasmic protein [Escherichia coli UM146]
gi|310336466|gb|EFQ01652.1| cupin family protein [Escherichia coli 1827-70]
gi|312288837|gb|EFR16735.1| cupin family protein [Escherichia coli 2362-75]
gi|312945690|gb|ADR26517.1| putative cytoplasmic protein [Escherichia coli O83:H1 str. NRG
857C]
gi|313650418|gb|EFS14825.1| cupin family protein [Shigella flexneri 2a str. 2457T]
gi|320179172|gb|EFW54130.1| hypothetical protein SGB_03455 [Shigella boydii ATCC 9905]
gi|320184385|gb|EFW59196.1| hypothetical protein SGF_03422 [Shigella flexneri CDC 796-83]
gi|320201024|gb|EFW75608.1| hypothetical protein ECoL_01732 [Escherichia coli EC4100B]
gi|323165636|gb|EFZ51423.1| cupin superfamily protein [Shigella sonnei 53G]
gi|323172495|gb|EFZ58132.1| cupin superfamily protein [Escherichia coli LT-68]
gi|323187616|gb|EFZ72923.1| cupin superfamily protein [Escherichia coli RN587/1]
gi|323947568|gb|EGB43572.1| cupin superfamily protein [Escherichia coli H120]
gi|323953185|gb|EGB49051.1| cupin superfamily protein [Escherichia coli H252]
gi|323957954|gb|EGB53666.1| cupin superfamily protein [Escherichia coli H263]
gi|323962727|gb|EGB58305.1| cupin superfamily protein [Escherichia coli H489]
gi|323973282|gb|EGB68471.1| cupin superfamily protein [Escherichia coli TA007]
gi|330910944|gb|EGH39454.1| hypothetical protein ECAA86_01265 [Escherichia coli AA86]
gi|331055440|gb|EGI27449.1| YcfD protein [Escherichia coli TA206]
gi|331060961|gb|EGI32925.1| YcfD protein [Escherichia coli TA143]
gi|331065114|gb|EGI37009.1| YcfD protein [Escherichia coli TA271]
gi|331080265|gb|EGI51444.1| YcfD protein [Escherichia coli H299]
gi|332092825|gb|EGI97893.1| cupin superfamily protein [Shigella dysenteriae 155-74]
gi|332094465|gb|EGI99514.1| cupin superfamily protein [Shigella boydii 3594-74]
gi|332342730|gb|AEE56064.1| conserved hypothetical protein [Escherichia coli UMNK88]
gi|332758289|gb|EGJ88612.1| cupin superfamily protein [Shigella flexneri 4343-70]
gi|332759490|gb|EGJ89798.1| cupin superfamily protein [Shigella flexneri 2747-71]
gi|332761067|gb|EGJ91354.1| cupin superfamily protein [Shigella flexneri K-671]
gi|332767348|gb|EGJ97542.1| cupin superfamily protein [Shigella flexneri 2930-71]
gi|333004939|gb|EGK24459.1| cupin superfamily protein [Shigella flexneri VA-6]
gi|333005483|gb|EGK25001.1| cupin superfamily protein [Shigella flexneri K-218]
gi|333019433|gb|EGK38716.1| cupin superfamily protein [Shigella flexneri K-304]
gi|333969205|gb|AEG36010.1| Hypothetical protein ECNA114_1186 [Escherichia coli NA114]
gi|335576073|gb|EGM62331.1| cupin superfamily protein [Shigella flexneri J1713]
gi|340735101|gb|EGR64190.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
01-09591]
gi|340740826|gb|EGR75004.1| putative cytoplasmic protein [Escherichia coli O104:H4 str.
LB226692]
gi|341918233|gb|EGT67847.1| hypothetical protein C22711_1876 [Escherichia coli O104:H4 str.
C227-11]
gi|345361538|gb|EGW93697.1| cupin superfamily protein [Escherichia coli STEC_DG131-3]
gi|345394969|gb|EGX24723.1| cupin superfamily protein [Escherichia coli TX1999]
gi|349737323|gb|AEQ12029.1| conserved protein [Escherichia coli O7:K1 str. CE10]
gi|354857242|gb|EHF17698.1| hypothetical protein EUAG_01602 [Escherichia coli O104:H4 str.
C227-11]
gi|354865034|gb|EHF25463.1| hypothetical protein EUBG_01610 [Escherichia coli O104:H4 str.
C236-11]
gi|354871813|gb|EHF32210.1| hypothetical protein EUDG_00221 [Escherichia coli O104:H4 str.
04-8351]
gi|354875315|gb|EHF35681.1| hypothetical protein EUEG_01566 [Escherichia coli O104:H4 str.
09-7901]
gi|354877563|gb|EHF37922.1| hypothetical protein EUHG_01618 [Escherichia coli O104:H4 str.
11-4404]
gi|354882387|gb|EHF42711.1| hypothetical protein EUIG_01617 [Escherichia coli O104:H4 str.
11-4522]
gi|354883095|gb|EHF43417.1| hypothetical protein EUFG_01610 [Escherichia coli O104:H4 str.
11-3677]
gi|354899396|gb|EHF59545.1| hypothetical protein EUKG_01595 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|354901219|gb|EHF61347.1| hypothetical protein EUJG_00238 [Escherichia coli O104:H4 str.
11-4623]
gi|354902873|gb|EHF62986.1| hypothetical protein EULG_01608 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|354905066|gb|EHF65151.1| hypothetical protein EUMG_01612 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|354916293|gb|EHF76267.1| hypothetical protein EUOG_01613 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|354921456|gb|EHF81381.1| hypothetical protein EUNG_01111 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|355352609|gb|EHG01784.1| putative cytoplasmic protein [Escherichia coli cloneA_i1]
gi|371608094|gb|EHN96657.1| hypothetical protein ESOG_01012 [Escherichia coli E101]
gi|371609461|gb|EHN98000.1| hypothetical protein ESPG_02245 [Escherichia coli H397]
gi|371612529|gb|EHO01038.1| hypothetical protein ESNG_01291 [Escherichia coli B093]
gi|375321471|gb|EHS67304.1| hypothetical protein T22_10377 [Escherichia coli O157:H43 str. T22]
gi|377846903|gb|EHU11908.1| cupin superfamily protein [Escherichia coli DEC1A]
gi|377848763|gb|EHU13742.1| cupin superfamily protein [Escherichia coli DEC1C]
gi|377851744|gb|EHU16680.1| cupin superfamily protein [Escherichia coli DEC1B]
gi|377860538|gb|EHU25362.1| cupin superfamily protein [Escherichia coli DEC1D]
gi|377864172|gb|EHU28969.1| cupin superfamily protein [Escherichia coli DEC1E]
gi|377866546|gb|EHU31312.1| cupin superfamily protein [Escherichia coli DEC2A]
gi|377877092|gb|EHU41689.1| cupin superfamily protein [Escherichia coli DEC2B]
gi|377881271|gb|EHU45833.1| cupin superfamily protein [Escherichia coli DEC2D]
gi|377883089|gb|EHU47618.1| cupin superfamily protein [Escherichia coli DEC2C]
gi|377896504|gb|EHU60899.1| cupin superfamily protein [Escherichia coli DEC2E]
gi|378018345|gb|EHV81212.1| cupin superfamily protein [Escherichia coli DEC7A]
gi|378026078|gb|EHV88717.1| cupin superfamily protein [Escherichia coli DEC7C]
gi|378033087|gb|EHV95668.1| cupin superfamily protein [Escherichia coli DEC7D]
gi|378036289|gb|EHV98832.1| cupin superfamily protein [Escherichia coli DEC7B]
gi|378041000|gb|EHW03463.1| cupin superfamily protein [Escherichia coli DEC7E]
gi|378189189|gb|EHX49783.1| cupin superfamily protein [Escherichia coli DEC13A]
gi|378204860|gb|EHX65276.1| cupin superfamily protein [Escherichia coli DEC13B]
gi|378205832|gb|EHX66240.1| cupin superfamily protein [Escherichia coli DEC13C]
gi|378207045|gb|EHX67447.1| cupin superfamily protein [Escherichia coli DEC13D]
gi|378216410|gb|EHX76697.1| cupin superfamily protein [Escherichia coli DEC13E]
gi|380349080|gb|EIA37356.1| hypothetical protein OQA_05431 [Escherichia coli SCI-07]
gi|385706842|gb|EIG43880.1| hypothetical protein ESTG_03603 [Escherichia coli B799]
gi|386145441|gb|EIG91899.1| cupin domain protein, PF06172 family [Escherichia coli 97.0246]
gi|386160695|gb|EIH22501.1| cupin domain protein, PF06172 family [Escherichia coli 1.2264]
gi|386175533|gb|EIH47522.1| cupin domain protein, PF06172 family [Escherichia coli 99.0741]
gi|386203589|gb|EII08107.1| cupin domain protein, PF06172 family [Escherichia coli 5.0959]
gi|386214770|gb|EII25178.1| cupin domain protein, PF06172 family [Escherichia coli 9.0111]
gi|386231390|gb|EII58738.1| cupin domain protein, PF06172 family [Escherichia coli 3.3884]
gi|386243387|gb|EII85120.1| cupin domain protein, PF06172 family [Escherichia coli 3003]
gi|386247994|gb|EII94167.1| cupin domain protein, PF06172 family [Escherichia coli TW07793]
gi|388384339|gb|EIL46072.1| putative cytoplasmic protein [Escherichia coli KD1]
gi|388386333|gb|EIL47982.1| putative cytoplasmic protein [Escherichia coli 541-15]
gi|388417281|gb|EIL77139.1| putative cytoplasmic protein [Escherichia coli HM605]
gi|391252395|gb|EIQ11594.1| cupin superfamily protein [Shigella flexneri 2850-71]
gi|391254959|gb|EIQ14114.1| cupin superfamily protein [Shigella flexneri CCH060]
gi|391255804|gb|EIQ14944.1| cupin superfamily protein [Shigella flexneri K-1770]
gi|391270959|gb|EIQ29841.1| cupin superfamily protein [Shigella flexneri K-404]
gi|391271347|gb|EIQ30222.1| cupin superfamily protein [Shigella boydii 965-58]
gi|391286267|gb|EIQ44814.1| cupin superfamily protein [Shigella sonnei 3226-85]
gi|391287838|gb|EIQ46353.1| cupin superfamily protein [Shigella sonnei 3233-85]
gi|391295587|gb|EIQ53731.1| cupin superfamily protein [Shigella sonnei 4822-66]
gi|391307489|gb|EIQ65220.1| cupin superfamily protein [Escherichia coli EPECa12]
gi|397899222|gb|EJL15597.1| cupin superfamily protein [Shigella flexneri 6603-63]
gi|397902254|gb|EJL18579.1| cupin superfamily protein [Shigella sonnei str. Moseley]
gi|404341081|gb|EJZ67493.1| cupin superfamily protein [Shigella flexneri 1485-80]
gi|406778228|gb|AFS57652.1| hypothetical protein O3M_14895 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|407054811|gb|AFS74862.1| hypothetical protein O3K_14920 [Escherichia coli O104:H4 str.
2011C-3493]
gi|407064854|gb|AFS85901.1| hypothetical protein O3O_10700 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|408204953|gb|EKI29859.1| hypothetical protein ECARS42123_1409 [Escherichia coli ARS4.2123]
gi|408216250|gb|EKI40581.1| hypothetical protein EC3006_1448 [Escherichia coli 3006]
gi|408219578|gb|EKI43707.1| hypothetical protein EC07798_1505 [Escherichia coli 07798]
gi|408346497|gb|EKJ60792.1| hypothetical protein EC01288_1172 [Escherichia coli 0.1288]
gi|408458491|gb|EKJ82278.1| cupin family protein [Escherichia coli AD30]
gi|429355398|gb|EKY92088.1| YcfD protein [Escherichia coli O104:H4 str. 11-02030]
gi|429359310|gb|EKY95975.1| YcfD protein [Escherichia coli O104:H4 str. 11-02092]
gi|429363211|gb|EKY99854.1| YcfD protein [Escherichia coli O104:H4 str. 11-02033-1]
gi|429373558|gb|EKZ10102.1| YcfD protein [Escherichia coli O104:H4 str. 11-02093]
gi|429377299|gb|EKZ13823.1| YcfD protein [Escherichia coli O104:H4 str. 11-02281]
gi|429378903|gb|EKZ15410.1| YcfD protein [Escherichia coli O104:H4 str. 11-02318]
gi|429388187|gb|EKZ24613.1| YcfD protein [Escherichia coli O104:H4 str. 11-03439]
gi|429390050|gb|EKZ26466.1| YcfD protein [Escherichia coli O104:H4 str. 11-02913]
gi|429393889|gb|EKZ30276.1| YcfD protein [Escherichia coli O104:H4 str. 11-03943]
gi|429395192|gb|EKZ31560.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9450]
gi|429403876|gb|EKZ40157.1| YcfD protein [Escherichia coli O104:H4 str. 11-04080]
gi|429404994|gb|EKZ41261.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9990]
gi|429408982|gb|EKZ45216.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4984]
gi|429417057|gb|EKZ53208.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4987]
gi|429421885|gb|EKZ58006.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4988]
gi|429423626|gb|EKZ59734.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-4986]
gi|429425697|gb|EKZ61786.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5603]
gi|429432784|gb|EKZ68821.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-5604]
gi|429442692|gb|EKZ78648.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-6006]
gi|429447596|gb|EKZ83514.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0466]
gi|429452251|gb|EKZ88137.1| YcfD protein [Escherichia coli O104:H4 str. Ec12-0465]
gi|429454648|gb|EKZ90507.1| YcfD protein [Escherichia coli O104:H4 str. Ec11-9941]
gi|430878983|gb|ELC02344.1| YcfD protein [Escherichia coli KTE4]
gi|430887807|gb|ELC10546.1| YcfD protein [Escherichia coli KTE10]
gi|430889325|gb|ELC11993.1| YcfD protein [Escherichia coli KTE5]
gi|430900167|gb|ELC22188.1| YcfD protein [Escherichia coli KTE12]
gi|430908867|gb|ELC30257.1| YcfD protein [Escherichia coli KTE16]
gi|430910127|gb|ELC31484.1| YcfD protein [Escherichia coli KTE15]
gi|430917302|gb|ELC38350.1| YcfD protein [Escherichia coli KTE25]
gi|430932068|gb|ELC52502.1| YcfD protein [Escherichia coli KTE28]
gi|430936880|gb|ELC57147.1| YcfD protein [Escherichia coli KTE39]
gi|430946836|gb|ELC66759.1| YcfD protein [Escherichia coli KTE178]
gi|430955055|gb|ELC73848.1| YcfD protein [Escherichia coli KTE187]
gi|430965075|gb|ELC82517.1| YcfD protein [Escherichia coli KTE188]
gi|430968037|gb|ELC85271.1| YcfD protein [Escherichia coli KTE189]
gi|430974252|gb|ELC91185.1| YcfD protein [Escherichia coli KTE191]
gi|430982528|gb|ELC99218.1| YcfD protein [Escherichia coli KTE193]
gi|430983620|gb|ELD00276.1| YcfD protein [Escherichia coli KTE201]
gi|430995146|gb|ELD11444.1| YcfD protein [Escherichia coli KTE205]
gi|430999645|gb|ELD15727.1| YcfD protein [Escherichia coli KTE206]
gi|431017402|gb|ELD30912.1| YcfD protein [Escherichia coli KTE212]
gi|431023026|gb|ELD36286.1| YcfD protein [Escherichia coli KTE213]
gi|431026955|gb|ELD40022.1| YcfD protein [Escherichia coli KTE214]
gi|431031009|gb|ELD43913.1| YcfD protein [Escherichia coli KTE216]
gi|431040721|gb|ELD51255.1| YcfD protein [Escherichia coli KTE220]
gi|431043632|gb|ELD53949.1| YcfD protein [Escherichia coli KTE224]
gi|431053850|gb|ELD63451.1| YcfD protein [Escherichia coli KTE230]
gi|431056298|gb|ELD65818.1| YcfD protein [Escherichia coli KTE233]
gi|431062471|gb|ELD71739.1| YcfD protein [Escherichia coli KTE234]
gi|431076227|gb|ELD83732.1| YcfD protein [Escherichia coli KTE236]
gi|431082800|gb|ELD89111.1| YcfD protein [Escherichia coli KTE237]
gi|431085882|gb|ELD91986.1| YcfD protein [Escherichia coli KTE47]
gi|431093690|gb|ELD99355.1| YcfD protein [Escherichia coli KTE49]
gi|431101715|gb|ELE06625.1| YcfD protein [Escherichia coli KTE53]
gi|431110355|gb|ELE14282.1| YcfD protein [Escherichia coli KTE55]
gi|431118294|gb|ELE21314.1| YcfD protein [Escherichia coli KTE57]
gi|431122636|gb|ELE25503.1| YcfD protein [Escherichia coli KTE58]
gi|431132181|gb|ELE34197.1| YcfD protein [Escherichia coli KTE60]
gi|431132772|gb|ELE34771.1| YcfD protein [Escherichia coli KTE62]
gi|431139322|gb|ELE41118.1| YcfD protein [Escherichia coli KTE67]
gi|431142587|gb|ELE44335.1| YcfD protein [Escherichia coli KTE66]
gi|431150052|gb|ELE51110.1| YcfD protein [Escherichia coli KTE72]
gi|431161303|gb|ELE61787.1| YcfD protein [Escherichia coli KTE76]
gi|431182367|gb|ELE82188.1| YcfD protein [Escherichia coli KTE86]
gi|431192303|gb|ELE91653.1| YcfD protein [Escherichia coli KTE87]
gi|431194982|gb|ELE94196.1| YcfD protein [Escherichia coli KTE93]
gi|431212595|gb|ELF10522.1| YcfD protein [Escherichia coli KTE119]
gi|431223793|gb|ELF21039.1| YcfD protein [Escherichia coli KTE143]
gi|431235275|gb|ELF30526.1| YcfD protein [Escherichia coli KTE162]
gi|431245842|gb|ELF40121.1| YcfD protein [Escherichia coli KTE169]
gi|431250652|gb|ELF44711.1| YcfD protein [Escherichia coli KTE6]
gi|431258890|gb|ELF51653.1| YcfD protein [Escherichia coli KTE8]
gi|431265264|gb|ELF56961.1| YcfD protein [Escherichia coli KTE9]
gi|431267686|gb|ELF59203.1| YcfD protein [Escherichia coli KTE17]
gi|431274985|gb|ELF66030.1| YcfD protein [Escherichia coli KTE18]
gi|431276943|gb|ELF67958.1| YcfD protein [Escherichia coli KTE45]
gi|431285474|gb|ELF76310.1| YcfD protein [Escherichia coli KTE23]
gi|431293547|gb|ELF83840.1| YcfD protein [Escherichia coli KTE43]
gi|431304440|gb|ELF92969.1| YcfD protein [Escherichia coli KTE22]
gi|431310277|gb|ELF98469.1| YcfD protein [Escherichia coli KTE46]
gi|431312336|gb|ELG00340.1| YcfD protein [Escherichia coli KTE48]
gi|431329324|gb|ELG16622.1| YcfD protein [Escherichia coli KTE59]
gi|431331448|gb|ELG18711.1| YcfD protein [Escherichia coli KTE63]
gi|431340835|gb|ELG27856.1| YcfD protein [Escherichia coli KTE65]
gi|431340973|gb|ELG27993.1| YcfD protein [Escherichia coli KTE78]
gi|431344507|gb|ELG31445.1| YcfD protein [Escherichia coli KTE79]
gi|431349545|gb|ELG36374.1| YcfD protein [Escherichia coli KTE84]
gi|431355589|gb|ELG42293.1| YcfD protein [Escherichia coli KTE101]
gi|431356860|gb|ELG43550.1| YcfD protein [Escherichia coli KTE91]
gi|431366986|gb|ELG53483.1| YcfD protein [Escherichia coli KTE115]
gi|431369515|gb|ELG55736.1| YcfD protein [Escherichia coli KTE118]
gi|431373720|gb|ELG59323.1| YcfD protein [Escherichia coli KTE123]
gi|431379445|gb|ELG64379.1| YcfD protein [Escherichia coli KTE135]
gi|431387081|gb|ELG71034.1| YcfD protein [Escherichia coli KTE136]
gi|431391311|gb|ELG74959.1| YcfD protein [Escherichia coli KTE140]
gi|431396544|gb|ELG80022.1| YcfD protein [Escherichia coli KTE141]
gi|431401578|gb|ELG84922.1| YcfD protein [Escherichia coli KTE144]
gi|431406959|gb|ELG90178.1| YcfD protein [Escherichia coli KTE146]
gi|431413077|gb|ELG95876.1| YcfD protein [Escherichia coli KTE154]
gi|431424815|gb|ELH06911.1| YcfD protein [Escherichia coli KTE165]
gi|431428752|gb|ELH10693.1| YcfD protein [Escherichia coli KTE192]
gi|431434430|gb|ELH16080.1| YcfD protein [Escherichia coli KTE194]
gi|431439125|gb|ELH20464.1| YcfD protein [Escherichia coli KTE173]
gi|431446790|gb|ELH27533.1| YcfD protein [Escherichia coli KTE175]
gi|431455328|gb|ELH35684.1| YcfD protein [Escherichia coli KTE184]
gi|431465990|gb|ELH46070.1| YcfD protein [Escherichia coli KTE183]
gi|431481269|gb|ELH60983.1| YcfD protein [Escherichia coli KTE209]
gi|431484332|gb|ELH64012.1| YcfD protein [Escherichia coli KTE207]
gi|431493466|gb|ELH73060.1| YcfD protein [Escherichia coli KTE211]
gi|431497071|gb|ELH76649.1| YcfD protein [Escherichia coli KTE217]
gi|431502345|gb|ELH81236.1| YcfD protein [Escherichia coli KTE215]
gi|431509043|gb|ELH87314.1| YcfD protein [Escherichia coli KTE218]
gi|431512730|gb|ELH90820.1| YcfD protein [Escherichia coli KTE223]
gi|431516945|gb|ELH94543.1| YcfD protein [Escherichia coli KTE229]
gi|431517893|gb|ELH95415.1| YcfD protein [Escherichia coli KTE227]
gi|431534017|gb|ELI10508.1| YcfD protein [Escherichia coli KTE104]
gi|431538979|gb|ELI14958.1| YcfD protein [Escherichia coli KTE106]
gi|431544711|gb|ELI19526.1| YcfD protein [Escherichia coli KTE109]
gi|431553337|gb|ELI27264.1| YcfD protein [Escherichia coli KTE113]
gi|431558052|gb|ELI31734.1| YcfD protein [Escherichia coli KTE112]
gi|431573019|gb|ELI45833.1| YcfD protein [Escherichia coli KTE124]
gi|431591241|gb|ELI62240.1| YcfD protein [Escherichia coli KTE129]
gi|431599643|gb|ELI69348.1| YcfD protein [Escherichia coli KTE131]
gi|431604826|gb|ELI74227.1| YcfD protein [Escherichia coli KTE133]
gi|431608415|gb|ELI77758.1| YcfD protein [Escherichia coli KTE137]
gi|431618655|gb|ELI87623.1| YcfD protein [Escherichia coli KTE139]
gi|431621582|gb|ELI90378.1| YcfD protein [Escherichia coli KTE145]
gi|431630154|gb|ELI98495.1| YcfD protein [Escherichia coli KTE150]
gi|431632371|gb|ELJ00660.1| YcfD protein [Escherichia coli KTE148]
gi|431635837|gb|ELJ04005.1| YcfD protein [Escherichia coli KTE153]
gi|431646918|gb|ELJ14408.1| YcfD protein [Escherichia coli KTE157]
gi|431648653|gb|ELJ16029.1| YcfD protein [Escherichia coli KTE160]
gi|431649917|gb|ELJ17256.1| YcfD protein [Escherichia coli KTE163]
gi|431660618|gb|ELJ27487.1| YcfD protein [Escherichia coli KTE166]
gi|431663778|gb|ELJ30533.1| YcfD protein [Escherichia coli KTE167]
gi|431664761|gb|ELJ31494.1| YcfD protein [Escherichia coli KTE168]
gi|431674456|gb|ELJ40618.1| YcfD protein [Escherichia coli KTE174]
gi|431677434|gb|ELJ43511.1| YcfD protein [Escherichia coli KTE176]
gi|431690865|gb|ELJ56339.1| YcfD protein [Escherichia coli KTE179]
gi|431692537|gb|ELJ57973.1| YcfD protein [Escherichia coli KTE180]
gi|431695333|gb|ELJ60648.1| YcfD protein [Escherichia coli KTE232]
gi|431708098|gb|ELJ72623.1| YcfD protein [Escherichia coli KTE88]
gi|431710340|gb|ELJ74765.1| YcfD protein [Escherichia coli KTE85]
gi|431719225|gb|ELJ83285.1| YcfD protein [Escherichia coli KTE90]
gi|431724759|gb|ELJ88675.1| YcfD protein [Escherichia coli KTE95]
gi|431731939|gb|ELJ95400.1| YcfD protein [Escherichia coli KTE97]
gi|431735586|gb|ELJ98944.1| YcfD protein [Escherichia coli KTE99]
gi|432348796|gb|ELL43239.1| hypothetical protein B185_004920 [Escherichia coli J96]
gi|441651133|emb|CCQ03108.1| FIG002776: hypothetical protein [Escherichia coli O5:K4(L):H4 str.
ATCC 23502]
gi|441715297|emb|CCQ04680.1| FIG002776: hypothetical protein [Escherichia coli Nissle 1917]
gi|443421747|gb|AGC86651.1| hypothetical protein APECO78_09600 [Escherichia coli APEC O78]
gi|449323433|gb|EMD13391.1| hypothetical protein A364_06482 [Escherichia coli SEPT362]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|307310095|ref|ZP_07589745.1| Cupin 4 family protein [Escherichia coli W]
gi|378713474|ref|YP_005278367.1| transcription factor jumonji jmjC domain-containing protein
[Escherichia coli KO11FL]
gi|386608488|ref|YP_006123974.1| hypothetical protein ECW_m1236 [Escherichia coli W]
gi|386701900|ref|YP_006165737.1| putative cytoplasmic protein [Escherichia coli KO11FL]
gi|386708937|ref|YP_006172658.1| putative cytoplasmic protein [Escherichia coli W]
gi|306909813|gb|EFN40307.1| Cupin 4 family protein [Escherichia coli W]
gi|315060405|gb|ADT74732.1| conserved protein [Escherichia coli W]
gi|323379035|gb|ADX51303.1| transcription factor jumonji jmjC domain-containing protein
[Escherichia coli KO11FL]
gi|383393427|gb|AFH18385.1| putative cytoplasmic protein [Escherichia coli KO11FL]
gi|383404629|gb|AFH10872.1| putative cytoplasmic protein [Escherichia coli W]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|90111217|ref|NP_415646.4| cupin superfamily protein [Escherichia coli str. K-12 substr.
MG1655]
gi|170080779|ref|YP_001730099.1| hypothetical protein ECDH10B_1200 [Escherichia coli str. K-12
substr. DH10B]
gi|191168343|ref|ZP_03030135.1| cupin family protein [Escherichia coli B7A]
gi|193071450|ref|ZP_03052363.1| cupin family protein [Escherichia coli E110019]
gi|194429544|ref|ZP_03062065.1| cupin family protein [Escherichia coli B171]
gi|209918385|ref|YP_002292469.1| hypothetical protein ECSE_1194 [Escherichia coli SE11]
gi|218553706|ref|YP_002386619.1| hypothetical protein ECIAI1_1166 [Escherichia coli IAI1]
gi|238900382|ref|YP_002926178.1| hypothetical protein BWG_0976 [Escherichia coli BW2952]
gi|260854793|ref|YP_003228684.1| hypothetical protein ECO26_1645 [Escherichia coli O26:H11 str.
11368]
gi|260867557|ref|YP_003233959.1| hypothetical protein ECO111_1476 [Escherichia coli O111:H- str.
11128]
gi|293433416|ref|ZP_06661844.1| ycfD protein [Escherichia coli B088]
gi|301029563|ref|ZP_07192641.1| cupin family protein [Escherichia coli MS 196-1]
gi|386280239|ref|ZP_10057907.1| hypothetical protein ESBG_01540 [Escherichia sp. 4_1_40B]
gi|386596044|ref|YP_006092444.1| cupin [Escherichia coli DH1]
gi|386705150|ref|YP_006168997.1| YcfD protein [Escherichia coli P12b]
gi|387611716|ref|YP_006114832.1| hypothetical protein ETEC_1252 [Escherichia coli ETEC H10407]
gi|387620837|ref|YP_006128464.1| putative cytoplasmic protein [Escherichia coli DH1]
gi|388477208|ref|YP_489396.1| hypothetical protein Y75_p1098 [Escherichia coli str. K-12 substr.
W3110]
gi|404374512|ref|ZP_10979724.1| hypothetical protein ESCG_03189 [Escherichia sp. 1_1_43]
gi|415778016|ref|ZP_11489126.1| cupin family protein [Escherichia coli 3431]
gi|415789197|ref|ZP_11494616.1| cupin family protein [Escherichia coli EPECa14]
gi|415794696|ref|ZP_11496511.1| cupin superfamily protein [Escherichia coli E128010]
gi|415819972|ref|ZP_11509269.1| cupin superfamily protein [Escherichia coli OK1180]
gi|415827210|ref|ZP_11514127.1| cupin superfamily protein [Escherichia coli OK1357]
gi|417134377|ref|ZP_11979162.1| cupin domain protein, PF06172 family [Escherichia coli 5.0588]
gi|417137695|ref|ZP_11981485.1| cupin domain protein, PF06172 family [Escherichia coli 97.0259]
gi|417155461|ref|ZP_11993590.1| cupin domain protein, PF06172 family [Escherichia coli 96.0497]
gi|417172777|ref|ZP_12002810.1| cupin domain protein, PF06172 family [Escherichia coli 3.2608]
gi|417180530|ref|ZP_12008238.1| cupin domain protein, PF06172 family [Escherichia coli 93.0624]
gi|417200789|ref|ZP_12017582.1| cupin domain protein, PF06172 family [Escherichia coli 4.0522]
gi|417205673|ref|ZP_12019171.1| cupin domain protein, PF06172 family [Escherichia coli JB1-95]
gi|417225066|ref|ZP_12028357.1| cupin domain protein, PF06172 family [Escherichia coli 96.154]
gi|417250662|ref|ZP_12042437.1| cupin domain protein, PF06172 family [Escherichia coli 4.0967]
gi|417275044|ref|ZP_12062384.1| cupin domain protein, PF06172 family [Escherichia coli 2.4168]
gi|417278474|ref|ZP_12065789.1| cupin domain protein, PF06172 family [Escherichia coli 3.2303]
gi|417290206|ref|ZP_12077489.1| cupin domain protein, PF06172 family [Escherichia coli B41]
gi|417297791|ref|ZP_12085035.1| cupin domain protein, PF06172 family [Escherichia coli 900105
(10e)]
gi|417307591|ref|ZP_12094458.1| hypothetical protein PPECC33_10300 [Escherichia coli PCN033]
gi|417580413|ref|ZP_12231229.1| cupin superfamily protein [Escherichia coli STEC_B2F1]
gi|417591055|ref|ZP_12241764.1| cupin superfamily protein [Escherichia coli 2534-86]
gi|417601549|ref|ZP_12252127.1| cupin superfamily protein [Escherichia coli STEC_94C]
gi|417612392|ref|ZP_12262860.1| cupin superfamily protein [Escherichia coli STEC_EH250]
gi|417617601|ref|ZP_12268029.1| cupin superfamily protein [Escherichia coli G58-1]
gi|417622660|ref|ZP_12272977.1| cupin superfamily protein [Escherichia coli STEC_H.1.8]
gi|417633935|ref|ZP_12284151.1| cupin superfamily protein [Escherichia coli STEC_S1191]
gi|417666405|ref|ZP_12315960.1| cupin superfamily protein [Escherichia coli STEC_O31]
gi|417944685|ref|ZP_12587925.1| putative cytoplasmic protein [Escherichia coli XH140A]
gi|417977120|ref|ZP_12617907.1| putative cytoplasmic protein [Escherichia coli XH001]
gi|419141773|ref|ZP_13686521.1| cupin superfamily protein [Escherichia coli DEC6A]
gi|419147512|ref|ZP_13692195.1| cupin superfamily protein [Escherichia coli DEC6B]
gi|419153114|ref|ZP_13697695.1| cupin superfamily protein [Escherichia coli DEC6C]
gi|419158521|ref|ZP_13703035.1| cupin superfamily protein [Escherichia coli DEC6D]
gi|419163619|ref|ZP_13708083.1| cupin superfamily protein [Escherichia coli DEC6E]
gi|419196334|ref|ZP_13739734.1| cupin superfamily protein [Escherichia coli DEC8A]
gi|419202626|ref|ZP_13745834.1| cupin superfamily protein [Escherichia coli DEC8B]
gi|419208700|ref|ZP_13751808.1| cupin superfamily protein [Escherichia coli DEC8C]
gi|419214964|ref|ZP_13757982.1| cupin superfamily protein [Escherichia coli DEC8D]
gi|419220647|ref|ZP_13763594.1| cupin superfamily protein [Escherichia coli DEC8E]
gi|419226114|ref|ZP_13768985.1| cupin superfamily protein [Escherichia coli DEC9A]
gi|419231872|ref|ZP_13774658.1| cupin superfamily protein [Escherichia coli DEC9B]
gi|419242671|ref|ZP_13785317.1| cupin superfamily protein [Escherichia coli DEC9D]
gi|419248396|ref|ZP_13790993.1| cupin superfamily protein [Escherichia coli DEC9E]
gi|419254264|ref|ZP_13796792.1| cupin superfamily protein [Escherichia coli DEC10A]
gi|419260382|ref|ZP_13802815.1| cupin superfamily protein [Escherichia coli DEC10B]
gi|419266473|ref|ZP_13808841.1| cupin superfamily protein [Escherichia coli DEC10C]
gi|419271920|ref|ZP_13814230.1| cupin superfamily protein [Escherichia coli DEC10D]
gi|419277414|ref|ZP_13819675.1| cupin superfamily protein [Escherichia coli DEC10E]
gi|419283512|ref|ZP_13825708.1| cupin superfamily protein [Escherichia coli DEC10F]
gi|419288916|ref|ZP_13831015.1| cupin superfamily protein [Escherichia coli DEC11A]
gi|419294200|ref|ZP_13836250.1| cupin superfamily protein [Escherichia coli DEC11B]
gi|419299571|ref|ZP_13841580.1| cupin superfamily protein [Escherichia coli DEC11C]
gi|419305783|ref|ZP_13847691.1| cupin superfamily protein [Escherichia coli DEC11D]
gi|419310806|ref|ZP_13852676.1| cupin superfamily protein [Escherichia coli DEC11E]
gi|419316124|ref|ZP_13857944.1| cupin superfamily protein [Escherichia coli DEC12A]
gi|419322135|ref|ZP_13863858.1| cupin superfamily protein [Escherichia coli DEC12B]
gi|419328207|ref|ZP_13869833.1| cupin superfamily protein [Escherichia coli DEC12C]
gi|419333857|ref|ZP_13875401.1| cupin superfamily protein [Escherichia coli DEC12D]
gi|419339081|ref|ZP_13880564.1| cupin superfamily protein [Escherichia coli DEC12E]
gi|419369456|ref|ZP_13910582.1| cupin superfamily protein [Escherichia coli DEC14A]
gi|419374997|ref|ZP_13916035.1| cupin superfamily protein [Escherichia coli DEC14B]
gi|419380204|ref|ZP_13921170.1| cupin superfamily protein [Escherichia coli DEC14C]
gi|419385551|ref|ZP_13926437.1| cupin superfamily protein [Escherichia coli DEC14D]
gi|419390713|ref|ZP_13931540.1| cupin superfamily protein [Escherichia coli DEC15A]
gi|419395933|ref|ZP_13936712.1| cupin superfamily protein [Escherichia coli DEC15B]
gi|419401311|ref|ZP_13942038.1| cupin superfamily protein [Escherichia coli DEC15C]
gi|419406432|ref|ZP_13947126.1| cupin superfamily protein [Escherichia coli DEC15D]
gi|419412000|ref|ZP_13952663.1| cupin superfamily protein [Escherichia coli DEC15E]
gi|419810913|ref|ZP_14335791.1| cytoplasmic protein [Escherichia coli O32:H37 str. P4]
gi|419867399|ref|ZP_14389726.1| cytoplasmic protein [Escherichia coli O103:H25 str. CVM9340]
gi|419869802|ref|ZP_14391983.1| cytoplasmic protein [Escherichia coli O103:H2 str. CVM9450]
gi|419873586|ref|ZP_14395568.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9534]
gi|419880031|ref|ZP_14401444.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9545]
gi|419892533|ref|ZP_14412551.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9570]
gi|419894477|ref|ZP_14414384.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9574]
gi|419928396|ref|ZP_14446109.1| putative cytoplasmic protein [Escherichia coli 541-1]
gi|419940688|ref|ZP_14457413.1| putative cytoplasmic protein [Escherichia coli 75]
gi|419951807|ref|ZP_14467989.1| putative cytoplasmic protein [Escherichia coli CUMT8]
gi|420091191|ref|ZP_14602946.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9602]
gi|420095320|ref|ZP_14606833.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9634]
gi|420106338|ref|ZP_14616750.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9553]
gi|420117229|ref|ZP_14626595.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10021]
gi|420120670|ref|ZP_14629857.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10030]
gi|420129810|ref|ZP_14638330.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10224]
gi|420136162|ref|ZP_14644225.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9952]
gi|420390738|ref|ZP_14890002.1| cupin superfamily protein [Escherichia coli EPEC C342-62]
gi|422762806|ref|ZP_16816562.1| cupin superfamily protein [Escherichia coli E1167]
gi|422765694|ref|ZP_16819421.1| cupin superfamily protein [Escherichia coli E1520]
gi|422770360|ref|ZP_16824051.1| cupin superfamily protein [Escherichia coli E482]
gi|422817530|ref|ZP_16865744.1| hypothetical protein ESMG_02056 [Escherichia coli M919]
gi|422958689|ref|ZP_16970620.1| hypothetical protein ESQG_02115 [Escherichia coli H494]
gi|423701973|ref|ZP_17676432.1| hypothetical protein ESSG_01504 [Escherichia coli H730]
gi|424748811|ref|ZP_18176938.1| cytoplasmic protein [Escherichia coli O26:H11 str. CFSAN001629]
gi|424760441|ref|ZP_18188068.1| cytoplasmic protein [Escherichia coli O111:H11 str. CFSAN001630]
gi|424771857|ref|ZP_18198981.1| cytoplasmic protein [Escherichia coli O111:H8 str. CFSAN001632]
gi|425114495|ref|ZP_18516313.1| hypothetical protein EC80566_1158 [Escherichia coli 8.0566]
gi|425119205|ref|ZP_18520921.1| cupin superfamily protein [Escherichia coli 8.0569]
gi|425272131|ref|ZP_18663601.1| hypothetical protein ECTW15901_1387 [Escherichia coli TW15901]
gi|425282637|ref|ZP_18673725.1| hypothetical protein ECTW00353_1274 [Escherichia coli TW00353]
gi|425378537|ref|ZP_18762771.1| hypothetical protein ECEC1865_1692 [Escherichia coli EC1865]
gi|432391150|ref|ZP_19634008.1| YcfD protein [Escherichia coli KTE21]
gi|432416177|ref|ZP_19658799.1| YcfD protein [Escherichia coli KTE44]
gi|432480552|ref|ZP_19722513.1| YcfD protein [Escherichia coli KTE210]
gi|432563189|ref|ZP_19799807.1| YcfD protein [Escherichia coli KTE51]
gi|432579828|ref|ZP_19816258.1| YcfD protein [Escherichia coli KTE56]
gi|432626725|ref|ZP_19862706.1| YcfD protein [Escherichia coli KTE77]
gi|432636393|ref|ZP_19872275.1| YcfD protein [Escherichia coli KTE81]
gi|432660348|ref|ZP_19895998.1| YcfD protein [Escherichia coli KTE111]
gi|432684958|ref|ZP_19920266.1| YcfD protein [Escherichia coli KTE156]
gi|432691048|ref|ZP_19926286.1| YcfD protein [Escherichia coli KTE161]
gi|432703742|ref|ZP_19938859.1| YcfD protein [Escherichia coli KTE171]
gi|432736623|ref|ZP_19971393.1| YcfD protein [Escherichia coli KTE42]
gi|432749603|ref|ZP_19984215.1| YcfD protein [Escherichia coli KTE29]
gi|432769953|ref|ZP_20004305.1| YcfD protein [Escherichia coli KTE50]
gi|432874285|ref|ZP_20093422.1| YcfD protein [Escherichia coli KTE147]
gi|432954419|ref|ZP_20146538.1| YcfD protein [Escherichia coli KTE197]
gi|432960681|ref|ZP_20150801.1| YcfD protein [Escherichia coli KTE202]
gi|432967269|ref|ZP_20156185.1| YcfD protein [Escherichia coli KTE203]
gi|433047270|ref|ZP_20234674.1| YcfD protein [Escherichia coli KTE120]
gi|433062354|ref|ZP_20249306.1| YcfD protein [Escherichia coli KTE125]
gi|433091452|ref|ZP_20277744.1| YcfD protein [Escherichia coli KTE138]
gi|442590699|ref|ZP_21009458.1| FIG002776: hypothetical protein [Escherichia coli O10:K5(L):H4 str.
ATCC 23506]
gi|450213320|ref|ZP_21894854.1| cytoplasmic protein [Escherichia coli O08]
gi|450241873|ref|ZP_21899707.1| cytoplasmic protein [Escherichia coli S17]
gi|2506643|sp|P27431.2|YCFD_ECOLI RecName: Full=50S ribosomal protein L16 arginine hydroxylase;
AltName: Full=Ribosomal oxygenase YcfD; Short=ROX
gi|4062695|dbj|BAA35950.1| conserved hypothetical protein [Escherichia coli str. K12 substr.
W3110]
gi|87081836|gb|AAC74212.2| cupin superfamily protein [Escherichia coli str. K-12 substr.
MG1655]
gi|169888614|gb|ACB02321.1| conserved protein [Escherichia coli str. K-12 substr. DH10B]
gi|190901638|gb|EDV61395.1| cupin family protein [Escherichia coli B7A]
gi|192955219|gb|EDV85709.1| cupin family protein [Escherichia coli E110019]
gi|194412418|gb|EDX28719.1| cupin family protein [Escherichia coli B171]
gi|209911644|dbj|BAG76718.1| conserved hypothetical protein [Escherichia coli SE11]
gi|218360474|emb|CAQ98028.1| conserved hypothetical protein [Escherichia coli IAI1]
gi|238860298|gb|ACR62296.1| conserved protein [Escherichia coli BW2952]
gi|257753442|dbj|BAI24944.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
gi|257763913|dbj|BAI35408.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
gi|260449733|gb|ACX40155.1| Cupin 4 family protein [Escherichia coli DH1]
gi|291324235|gb|EFE63657.1| ycfD protein [Escherichia coli B088]
gi|299877558|gb|EFI85769.1| cupin family protein [Escherichia coli MS 196-1]
gi|309701452|emb|CBJ00756.1| conserved hypothetical protein [Escherichia coli ETEC H10407]
gi|315135760|dbj|BAJ42919.1| putative cytoplasmic protein [Escherichia coli DH1]
gi|315616014|gb|EFU96640.1| cupin family protein [Escherichia coli 3431]
gi|323153859|gb|EFZ40094.1| cupin family protein [Escherichia coli EPECa14]
gi|323163672|gb|EFZ49494.1| cupin superfamily protein [Escherichia coli E128010]
gi|323179336|gb|EFZ64906.1| cupin superfamily protein [Escherichia coli OK1180]
gi|323185688|gb|EFZ71049.1| cupin superfamily protein [Escherichia coli OK1357]
gi|323937868|gb|EGB34132.1| cupin superfamily protein [Escherichia coli E1520]
gi|323942598|gb|EGB38765.1| cupin superfamily protein [Escherichia coli E482]
gi|324117301|gb|EGC11208.1| cupin superfamily protein [Escherichia coli E1167]
gi|338770967|gb|EGP25720.1| hypothetical protein PPECC33_10300 [Escherichia coli PCN033]
gi|342363508|gb|EGU27615.1| putative cytoplasmic protein [Escherichia coli XH140A]
gi|344193160|gb|EGV47243.1| putative cytoplasmic protein [Escherichia coli XH001]
gi|345342072|gb|EGW74470.1| cupin superfamily protein [Escherichia coli STEC_B2F1]
gi|345343290|gb|EGW75678.1| cupin superfamily protein [Escherichia coli 2534-86]
gi|345352152|gb|EGW84402.1| cupin superfamily protein [Escherichia coli STEC_94C]
gi|345364344|gb|EGW96469.1| cupin superfamily protein [Escherichia coli STEC_EH250]
gi|345379738|gb|EGX11646.1| cupin superfamily protein [Escherichia coli G58-1]
gi|345383846|gb|EGX13717.1| cupin superfamily protein [Escherichia coli STEC_H.1.8]
gi|345389242|gb|EGX19048.1| cupin superfamily protein [Escherichia coli STEC_S1191]
gi|359331781|dbj|BAL38228.1| conserved protein [Escherichia coli str. K-12 substr. MDS42]
gi|371596518|gb|EHN85356.1| hypothetical protein ESQG_02115 [Escherichia coli H494]
gi|377997429|gb|EHV60533.1| cupin superfamily protein [Escherichia coli DEC6A]
gi|377998531|gb|EHV61622.1| cupin superfamily protein [Escherichia coli DEC6B]
gi|378001801|gb|EHV64858.1| cupin superfamily protein [Escherichia coli DEC6C]
gi|378011450|gb|EHV74393.1| cupin superfamily protein [Escherichia coli DEC6D]
gi|378013192|gb|EHV76112.1| cupin superfamily protein [Escherichia coli DEC6E]
gi|378050307|gb|EHW12636.1| cupin superfamily protein [Escherichia coli DEC8A]
gi|378053698|gb|EHW15993.1| cupin superfamily protein [Escherichia coli DEC8B]
gi|378058207|gb|EHW20426.1| cupin superfamily protein [Escherichia coli DEC8C]
gi|378065705|gb|EHW27847.1| cupin superfamily protein [Escherichia coli DEC8D]
gi|378070233|gb|EHW32314.1| cupin superfamily protein [Escherichia coli DEC8E]
gi|378077646|gb|EHW39639.1| cupin superfamily protein [Escherichia coli DEC9A]
gi|378080633|gb|EHW42594.1| cupin superfamily protein [Escherichia coli DEC9B]
gi|378093317|gb|EHW55130.1| cupin superfamily protein [Escherichia coli DEC9D]
gi|378098138|gb|EHW59880.1| cupin superfamily protein [Escherichia coli DEC9E]
gi|378102921|gb|EHW64592.1| cupin superfamily protein [Escherichia coli DEC10A]
gi|378110066|gb|EHW71662.1| cupin superfamily protein [Escherichia coli DEC10B]
gi|378114650|gb|EHW76202.1| cupin superfamily protein [Escherichia coli DEC10C]
gi|378119787|gb|EHW81275.1| cupin superfamily protein [Escherichia coli DEC10D]
gi|378132583|gb|EHW93935.1| cupin superfamily protein [Escherichia coli DEC10E]
gi|378133349|gb|EHW94694.1| cupin superfamily protein [Escherichia coli DEC11A]
gi|378136145|gb|EHW97444.1| cupin superfamily protein [Escherichia coli DEC10F]
gi|378143753|gb|EHX04938.1| cupin superfamily protein [Escherichia coli DEC11B]
gi|378151028|gb|EHX12141.1| cupin superfamily protein [Escherichia coli DEC11D]
gi|378153954|gb|EHX15031.1| cupin superfamily protein [Escherichia coli DEC11C]
gi|378159404|gb|EHX20408.1| cupin superfamily protein [Escherichia coli DEC11E]
gi|378171058|gb|EHX31930.1| cupin superfamily protein [Escherichia coli DEC12B]
gi|378172817|gb|EHX33664.1| cupin superfamily protein [Escherichia coli DEC12A]
gi|378174309|gb|EHX35135.1| cupin superfamily protein [Escherichia coli DEC12C]
gi|378186070|gb|EHX46693.1| cupin superfamily protein [Escherichia coli DEC12D]
gi|378192615|gb|EHX53170.1| cupin superfamily protein [Escherichia coli DEC12E]
gi|378221131|gb|EHX81382.1| cupin superfamily protein [Escherichia coli DEC14A]
gi|378223099|gb|EHX83329.1| cupin superfamily protein [Escherichia coli DEC14B]
gi|378230753|gb|EHX90867.1| cupin superfamily protein [Escherichia coli DEC14C]
gi|378233830|gb|EHX93913.1| cupin superfamily protein [Escherichia coli DEC14D]
gi|378240943|gb|EHY00912.1| cupin superfamily protein [Escherichia coli DEC15A]
gi|378248271|gb|EHY08185.1| cupin superfamily protein [Escherichia coli DEC15B]
gi|378248965|gb|EHY08875.1| cupin superfamily protein [Escherichia coli DEC15C]
gi|378256204|gb|EHY16056.1| cupin superfamily protein [Escherichia coli DEC15D]
gi|378260188|gb|EHY19993.1| cupin superfamily protein [Escherichia coli DEC15E]
gi|383103318|gb|AFG40827.1| YcfD protein [Escherichia coli P12b]
gi|385156207|gb|EIF18205.1| cytoplasmic protein [Escherichia coli O32:H37 str. P4]
gi|385539037|gb|EIF85879.1| hypothetical protein ESMG_02056 [Escherichia coli M919]
gi|385711376|gb|EIG48335.1| hypothetical protein ESSG_01504 [Escherichia coli H730]
gi|386122615|gb|EIG71224.1| hypothetical protein ESBG_01540 [Escherichia sp. 4_1_40B]
gi|386152231|gb|EIH03520.1| cupin domain protein, PF06172 family [Escherichia coli 5.0588]
gi|386159259|gb|EIH15592.1| cupin domain protein, PF06172 family [Escherichia coli 97.0259]
gi|386168550|gb|EIH35066.1| cupin domain protein, PF06172 family [Escherichia coli 96.0497]
gi|386180475|gb|EIH57949.1| cupin domain protein, PF06172 family [Escherichia coli 3.2608]
gi|386185885|gb|EIH68611.1| cupin domain protein, PF06172 family [Escherichia coli 93.0624]
gi|386187574|gb|EIH76392.1| cupin domain protein, PF06172 family [Escherichia coli 4.0522]
gi|386197930|gb|EIH92124.1| cupin domain protein, PF06172 family [Escherichia coli JB1-95]
gi|386200114|gb|EIH99105.1| cupin domain protein, PF06172 family [Escherichia coli 96.154]
gi|386219221|gb|EII35694.1| cupin domain protein, PF06172 family [Escherichia coli 4.0967]
gi|386233472|gb|EII65457.1| cupin domain protein, PF06172 family [Escherichia coli 2.4168]
gi|386238727|gb|EII75662.1| cupin domain protein, PF06172 family [Escherichia coli 3.2303]
gi|386256244|gb|EIJ05932.1| cupin domain protein, PF06172 family [Escherichia coli B41]
gi|386259003|gb|EIJ14480.1| cupin domain protein, PF06172 family [Escherichia coli 900105
(10e)]
gi|388332654|gb|EIK99319.1| cytoplasmic protein [Escherichia coli O103:H25 str. CVM9340]
gi|388341408|gb|EIL07519.1| cytoplasmic protein [Escherichia coli O103:H2 str. CVM9450]
gi|388347533|gb|EIL13199.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9570]
gi|388352382|gb|EIL17502.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9534]
gi|388363916|gb|EIL27814.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9574]
gi|388370058|gb|EIL33611.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9545]
gi|388402359|gb|EIL62932.1| putative cytoplasmic protein [Escherichia coli 75]
gi|388405677|gb|EIL66100.1| putative cytoplasmic protein [Escherichia coli 541-1]
gi|388413648|gb|EIL73638.1| putative cytoplasmic protein [Escherichia coli CUMT8]
gi|391313927|gb|EIQ71494.1| cupin superfamily protein [Escherichia coli EPEC C342-62]
gi|394381661|gb|EJE59337.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10224]
gi|394384048|gb|EJE61621.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9602]
gi|394393103|gb|EJE69797.1| cytoplasmic protein [Escherichia coli O111:H8 str. CVM9634]
gi|394402398|gb|EJE78121.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10021]
gi|394416322|gb|EJE90121.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9553]
gi|394418855|gb|EJE92505.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9952]
gi|394428576|gb|EJF01107.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10030]
gi|397785869|gb|EJK96712.1| cupin superfamily protein [Escherichia coli STEC_O31]
gi|404292010|gb|EJZ48855.1| hypothetical protein ESCG_03189 [Escherichia sp. 1_1_43]
gi|408195640|gb|EKI21010.1| hypothetical protein ECTW15901_1387 [Escherichia coli TW15901]
gi|408204607|gb|EKI29551.1| hypothetical protein ECTW00353_1274 [Escherichia coli TW00353]
gi|408301968|gb|EKJ19518.1| hypothetical protein ECEC1865_1692 [Escherichia coli EC1865]
gi|408571483|gb|EKK47422.1| hypothetical protein EC80566_1158 [Escherichia coli 8.0566]
gi|408572441|gb|EKK48350.1| cupin superfamily protein [Escherichia coli 8.0569]
gi|421939747|gb|EKT97261.1| cytoplasmic protein [Escherichia coli O111:H8 str. CFSAN001632]
gi|421943433|gb|EKU00718.1| cytoplasmic protein [Escherichia coli O26:H11 str. CFSAN001629]
gi|421946016|gb|EKU03180.1| cytoplasmic protein [Escherichia coli O111:H11 str. CFSAN001630]
gi|430921768|gb|ELC42592.1| YcfD protein [Escherichia coli KTE21]
gi|430941958|gb|ELC62098.1| YcfD protein [Escherichia coli KTE44]
gi|431009499|gb|ELD24119.1| YcfD protein [Escherichia coli KTE210]
gi|431096703|gb|ELE02164.1| YcfD protein [Escherichia coli KTE51]
gi|431107230|gb|ELE11416.1| YcfD protein [Escherichia coli KTE56]
gi|431164673|gb|ELE65064.1| YcfD protein [Escherichia coli KTE77]
gi|431173287|gb|ELE73368.1| YcfD protein [Escherichia coli KTE81]
gi|431202220|gb|ELF00916.1| YcfD protein [Escherichia coli KTE111]
gi|431223525|gb|ELF20772.1| YcfD protein [Escherichia coli KTE156]
gi|431228842|gb|ELF25505.1| YcfD protein [Escherichia coli KTE161]
gi|431245569|gb|ELF39854.1| YcfD protein [Escherichia coli KTE171]
gi|431285008|gb|ELF75849.1| YcfD protein [Escherichia coli KTE42]
gi|431298893|gb|ELF88517.1| YcfD protein [Escherichia coli KTE29]
gi|431317410|gb|ELG05190.1| YcfD protein [Escherichia coli KTE50]
gi|431404271|gb|ELG87529.1| YcfD protein [Escherichia coli KTE147]
gi|431469717|gb|ELH49646.1| YcfD protein [Escherichia coli KTE197]
gi|431473241|gb|ELH53075.1| YcfD protein [Escherichia coli KTE203]
gi|431477888|gb|ELH57650.1| YcfD protein [Escherichia coli KTE202]
gi|431569915|gb|ELI42845.1| YcfD protein [Escherichia coli KTE120]
gi|431585416|gb|ELI57365.1| YcfD protein [Escherichia coli KTE125]
gi|431612993|gb|ELI82198.1| YcfD protein [Escherichia coli KTE138]
gi|441608967|emb|CCP95371.1| FIG002776: hypothetical protein [Escherichia coli O10:K5(L):H4 str.
ATCC 23506]
gi|449321002|gb|EMD11021.1| cytoplasmic protein [Escherichia coli O08]
gi|449323348|gb|EMD13309.1| cytoplasmic protein [Escherichia coli S17]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417628130|ref|ZP_12278377.1| cupin superfamily protein [Escherichia coli STEC_MHI813]
gi|345378434|gb|EGX10365.1| cupin superfamily protein [Escherichia coli STEC_MHI813]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|218704602|ref|YP_002412121.1| hypothetical protein ECUMN_1372 [Escherichia coli UMN026]
gi|293404483|ref|ZP_06648477.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298380259|ref|ZP_06989864.1| ycfD protein [Escherichia coli FVEC1302]
gi|419933745|ref|ZP_14450911.1| hypothetical protein EC5761_08660 [Escherichia coli 576-1]
gi|422335730|ref|ZP_16416723.1| hypothetical protein HMPREF0986_05217 [Escherichia coli 4_1_47FAA]
gi|432353143|ref|ZP_19596420.1| YcfD protein [Escherichia coli KTE2]
gi|432401368|ref|ZP_19644122.1| YcfD protein [Escherichia coli KTE26]
gi|432425424|ref|ZP_19667938.1| YcfD protein [Escherichia coli KTE181]
gi|432460196|ref|ZP_19702352.1| YcfD protein [Escherichia coli KTE204]
gi|432475288|ref|ZP_19717293.1| YcfD protein [Escherichia coli KTE208]
gi|432521801|ref|ZP_19758956.1| YcfD protein [Escherichia coli KTE228]
gi|432537234|ref|ZP_19774151.1| YcfD protein [Escherichia coli KTE235]
gi|432630740|ref|ZP_19866683.1| YcfD protein [Escherichia coli KTE80]
gi|432640391|ref|ZP_19876229.1| YcfD protein [Escherichia coli KTE83]
gi|432665466|ref|ZP_19901050.1| YcfD protein [Escherichia coli KTE116]
gi|432774304|ref|ZP_20008588.1| YcfD protein [Escherichia coli KTE54]
gi|432885596|ref|ZP_20100117.1| YcfD protein [Escherichia coli KTE158]
gi|432911672|ref|ZP_20117947.1| YcfD protein [Escherichia coli KTE190]
gi|433018084|ref|ZP_20206341.1| YcfD protein [Escherichia coli KTE105]
gi|433052499|ref|ZP_20239717.1| YcfD protein [Escherichia coli KTE122]
gi|433067444|ref|ZP_20254262.1| YcfD protein [Escherichia coli KTE128]
gi|433158095|ref|ZP_20342956.1| YcfD protein [Escherichia coli KTE177]
gi|433177659|ref|ZP_20362102.1| YcfD protein [Escherichia coli KTE82]
gi|218431699|emb|CAR12581.1| conserved hypothetical protein [Escherichia coli UMN026]
gi|291429069|gb|EFF02094.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298279957|gb|EFI21465.1| ycfD protein [Escherichia coli FVEC1302]
gi|373243240|gb|EHP62754.1| hypothetical protein HMPREF0986_05217 [Escherichia coli 4_1_47FAA]
gi|388410658|gb|EIL70870.1| hypothetical protein EC5761_08660 [Escherichia coli 576-1]
gi|430876855|gb|ELC00325.1| YcfD protein [Escherichia coli KTE2]
gi|430927966|gb|ELC48529.1| YcfD protein [Escherichia coli KTE26]
gi|430958173|gb|ELC76770.1| YcfD protein [Escherichia coli KTE181]
gi|430990854|gb|ELD07275.1| YcfD protein [Escherichia coli KTE204]
gi|431007288|gb|ELD22100.1| YcfD protein [Escherichia coli KTE208]
gi|431043944|gb|ELD54225.1| YcfD protein [Escherichia coli KTE228]
gi|431072811|gb|ELD80562.1| YcfD protein [Escherichia coli KTE235]
gi|431172869|gb|ELE72951.1| YcfD protein [Escherichia coli KTE80]
gi|431183747|gb|ELE83521.1| YcfD protein [Escherichia coli KTE83]
gi|431203101|gb|ELF01778.1| YcfD protein [Escherichia coli KTE116]
gi|431319649|gb|ELG07319.1| YcfD protein [Escherichia coli KTE54]
gi|431418642|gb|ELH01037.1| YcfD protein [Escherichia coli KTE158]
gi|431443379|gb|ELH24456.1| YcfD protein [Escherichia coli KTE190]
gi|431535398|gb|ELI11778.1| YcfD protein [Escherichia coli KTE105]
gi|431574102|gb|ELI46886.1| YcfD protein [Escherichia coli KTE122]
gi|431588508|gb|ELI59783.1| YcfD protein [Escherichia coli KTE128]
gi|431680658|gb|ELJ46481.1| YcfD protein [Escherichia coli KTE177]
gi|431708250|gb|ELJ72765.1| YcfD protein [Escherichia coli KTE82]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417596243|ref|ZP_12246896.1| cupin superfamily protein [Escherichia coli 3030-1]
gi|345356953|gb|EGW89152.1| cupin superfamily protein [Escherichia coli 3030-1]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLDETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|420099893|ref|ZP_14611098.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9455]
gi|394422149|gb|EJE95541.1| cytoplasmic protein [Escherichia coli O111:H11 str. CVM9455]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|419909946|ref|ZP_14428479.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10026]
gi|388372311|gb|EIL35745.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM10026]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|419903240|ref|ZP_14422333.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9942]
gi|388371848|gb|EIL35302.1| cytoplasmic protein [Escherichia coli O26:H11 str. CVM9942]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|156362161|ref|XP_001625649.1| predicted protein [Nematostella vectensis]
gi|156212492|gb|EDO33549.1| predicted protein [Nematostella vectensis]
Length = 368
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 14/98 (14%)
Query: 207 AWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQAD----------R 256
AW + A+ A++ A +D++FRRGLPI ++ GL +DIQ D
Sbjct: 1 AWGHFISTALTDAIEKAMESDVKFRRGLPINFVSQLGLG----MDIQGDDEKAIASKKEN 56
Query: 257 LAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPP 294
+ K+ LK+LL LV++VD ++ D+M M LPP
Sbjct: 57 ESFKKQLKELLIALVDHVDANETSDKMASDFMASRLPP 94
>gi|417264874|ref|ZP_12052256.1| cupin domain protein, PF06172 family [Escherichia coli 2.3916]
gi|418302295|ref|ZP_12914089.1| cupin superfamily protein [Escherichia coli UMNF18]
gi|339414393|gb|AEJ56065.1| cupin superfamily protein [Escherichia coli UMNF18]
gi|386221434|gb|EII43875.1| cupin domain protein, PF06172 family [Escherichia coli 2.3916]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417690009|ref|ZP_12339235.1| cupin superfamily protein [Shigella boydii 5216-82]
gi|332089270|gb|EGI94376.1| cupin superfamily protein [Shigella boydii 5216-82]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417711927|ref|ZP_12360922.1| cupin superfamily protein [Shigella flexneri K-272]
gi|417716570|ref|ZP_12365498.1| cupin superfamily protein [Shigella flexneri K-227]
gi|333008335|gb|EGK27809.1| cupin superfamily protein [Shigella flexneri K-272]
gi|333019823|gb|EGK39095.1| cupin superfamily protein [Shigella flexneri K-227]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|293414419|ref|ZP_06657068.1| ycfD protein [Escherichia coli B185]
gi|291434477|gb|EFF07450.1| ycfD protein [Escherichia coli B185]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|260843427|ref|YP_003221205.1| cytoplasmic protein [Escherichia coli O103:H2 str. 12009]
gi|300922658|ref|ZP_07138755.1| cupin family protein [Escherichia coli MS 182-1]
gi|300949476|ref|ZP_07163474.1| cupin family protein [Escherichia coli MS 116-1]
gi|300955534|ref|ZP_07167896.1| cupin family protein [Escherichia coli MS 175-1]
gi|301328497|ref|ZP_07221563.1| cupin family protein [Escherichia coli MS 78-1]
gi|301644546|ref|ZP_07244539.1| cupin family protein [Escherichia coli MS 146-1]
gi|309796526|ref|ZP_07690933.1| cupin family protein [Escherichia coli MS 145-7]
gi|331641672|ref|ZP_08342807.1| YcfD protein [Escherichia coli H736]
gi|418958539|ref|ZP_13510451.1| cupin family protein [Escherichia coli J53]
gi|422354516|ref|ZP_16435251.1| cupin family protein [Escherichia coli MS 117-3]
gi|195182815|dbj|BAG66391.1| predicted protein [Escherichia coli O111:H-]
gi|257758574|dbj|BAI30071.1| putative cytoplasmic protein [Escherichia coli O103:H2 str. 12009]
gi|300317573|gb|EFJ67357.1| cupin family protein [Escherichia coli MS 175-1]
gi|300421007|gb|EFK04318.1| cupin family protein [Escherichia coli MS 182-1]
gi|300451112|gb|EFK14732.1| cupin family protein [Escherichia coli MS 116-1]
gi|300845104|gb|EFK72864.1| cupin family protein [Escherichia coli MS 78-1]
gi|301077128|gb|EFK91934.1| cupin family protein [Escherichia coli MS 146-1]
gi|308119838|gb|EFO57100.1| cupin family protein [Escherichia coli MS 145-7]
gi|324017500|gb|EGB86719.1| cupin family protein [Escherichia coli MS 117-3]
gi|331038470|gb|EGI10690.1| YcfD protein [Escherichia coli H736]
gi|384378753|gb|EIE36632.1| cupin family protein [Escherichia coli J53]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|38703940|ref|NP_309600.2| hypothetical protein ECs1573 [Escherichia coli O157:H7 str. Sakai]
gi|168749710|ref|ZP_02774732.1| cupin family protein [Escherichia coli O157:H7 str. EC4113]
gi|168756467|ref|ZP_02781474.1| cupin family protein [Escherichia coli O157:H7 str. EC4401]
gi|168762519|ref|ZP_02787526.1| cupin family protein [Escherichia coli O157:H7 str. EC4501]
gi|168770975|ref|ZP_02795982.1| cupin family protein [Escherichia coli O157:H7 str. EC4486]
gi|168776081|ref|ZP_02801088.1| cupin family protein [Escherichia coli O157:H7 str. EC4196]
gi|168782445|ref|ZP_02807452.1| cupin family protein [Escherichia coli O157:H7 str. EC4076]
gi|168787580|ref|ZP_02812587.1| cupin family protein [Escherichia coli O157:H7 str. EC869]
gi|168800481|ref|ZP_02825488.1| cupin family protein [Escherichia coli O157:H7 str. EC508]
gi|208809504|ref|ZP_03251841.1| cupin family protein [Escherichia coli O157:H7 str. EC4206]
gi|208815823|ref|ZP_03257002.1| cupin family protein [Escherichia coli O157:H7 str. EC4045]
gi|208822630|ref|ZP_03262949.1| cupin family protein [Escherichia coli O157:H7 str. EC4042]
gi|209395797|ref|YP_002270028.1| cupin family protein [Escherichia coli O157:H7 str. EC4115]
gi|217328000|ref|ZP_03444082.1| cupin family protein [Escherichia coli O157:H7 str. TW14588]
gi|254792568|ref|YP_003077405.1| hypothetical protein ECSP_1488 [Escherichia coli O157:H7 str.
TW14359]
gi|261227211|ref|ZP_05941492.1| hypothetical protein EscherichiacoliO157_21828 [Escherichia coli
O157:H7 str. FRIK2000]
gi|261258783|ref|ZP_05951316.1| hypothetical protein EscherichiacoliO157EcO_23686 [Escherichia coli
O157:H7 str. FRIK966]
gi|387506255|ref|YP_006158511.1| hypothetical protein ECO55CA74_06765 [Escherichia coli O55:H7 str.
RM12579]
gi|387882072|ref|YP_006312374.1| hypothetical protein CDCO157_1505 [Escherichia coli Xuzhou21]
gi|416314246|ref|ZP_11658585.1| cupin family protein [Escherichia coli O157:H7 str. 1044]
gi|416323935|ref|ZP_11665166.1| hypothetical protein ECoD_05544 [Escherichia coli O157:H7 str.
EC1212]
gi|416328815|ref|ZP_11668403.1| hypothetical protein ECF_03346 [Escherichia coli O157:H7 str. 1125]
gi|416782286|ref|ZP_11877723.1| hypothetical protein ECO5101_10737 [Escherichia coli O157:H7 str.
G5101]
gi|416793484|ref|ZP_11882645.1| hypothetical protein ECO9389_04211 [Escherichia coli O157:H- str.
493-89]
gi|416804751|ref|ZP_11887506.1| hypothetical protein ECO2687_20326 [Escherichia coli O157:H- str. H
2687]
gi|416815872|ref|ZP_11892210.1| hypothetical protein ECO7815_16663 [Escherichia coli O55:H7 str.
3256-97]
gi|416825690|ref|ZP_11896799.1| hypothetical protein ECO5905_20143 [Escherichia coli O55:H7 str.
USDA 5905]
gi|419044329|ref|ZP_13591295.1| cupin superfamily protein [Escherichia coli DEC3A]
gi|419050255|ref|ZP_13597156.1| cupin superfamily protein [Escherichia coli DEC3B]
gi|419056418|ref|ZP_13603252.1| cupin superfamily protein [Escherichia coli DEC3C]
gi|419061817|ref|ZP_13608577.1| cupin superfamily protein [Escherichia coli DEC3D]
gi|419068488|ref|ZP_13614347.1| cupin superfamily protein [Escherichia coli DEC3E]
gi|419074609|ref|ZP_13620162.1| cupin superfamily protein [Escherichia coli DEC3F]
gi|419079827|ref|ZP_13625303.1| cupin superfamily protein [Escherichia coli DEC4A]
gi|419085518|ref|ZP_13630911.1| cupin superfamily protein [Escherichia coli DEC4B]
gi|419091511|ref|ZP_13636823.1| cupin superfamily protein [Escherichia coli DEC4C]
gi|419097540|ref|ZP_13642771.1| cupin superfamily protein [Escherichia coli DEC4D]
gi|419103281|ref|ZP_13648439.1| cupin superfamily protein [Escherichia coli DEC4E]
gi|419108703|ref|ZP_13653796.1| cupin superfamily protein [Escherichia coli DEC4F]
gi|419114088|ref|ZP_13659118.1| cupin superfamily protein [Escherichia coli DEC5A]
gi|419119729|ref|ZP_13664707.1| cupin superfamily protein [Escherichia coli DEC5B]
gi|419125552|ref|ZP_13670447.1| cupin superfamily protein [Escherichia coli DEC5C]
gi|419130974|ref|ZP_13675821.1| cupin superfamily protein [Escherichia coli DEC5D]
gi|419135815|ref|ZP_13680620.1| cupin superfamily protein [Escherichia coli DEC5E]
gi|420268773|ref|ZP_14771167.1| hypothetical protein ECPA22_1679 [Escherichia coli PA22]
gi|420274686|ref|ZP_14777003.1| hypothetical protein ECPA40_1935 [Escherichia coli PA40]
gi|420279256|ref|ZP_14781521.1| hypothetical protein ECTW06591_0636 [Escherichia coli TW06591]
gi|420285914|ref|ZP_14788123.1| hypothetical protein ECTW10246_1732 [Escherichia coli TW10246]
gi|420297365|ref|ZP_14799445.1| hypothetical protein ECTW09109_1830 [Escherichia coli TW09109]
gi|420303446|ref|ZP_14805462.1| hypothetical protein ECTW10119_2137 [Escherichia coli TW10119]
gi|420308899|ref|ZP_14810859.1| hypothetical protein ECEC1738_1727 [Escherichia coli EC1738]
gi|420314430|ref|ZP_14816328.1| cupin superfamily protein [Escherichia coli EC1734]
gi|421811581|ref|ZP_16247359.1| hypothetical protein EC80416_1389 [Escherichia coli 8.0416]
gi|421817695|ref|ZP_16253239.1| cupin superfamily protein [Escherichia coli 10.0821]
gi|421823270|ref|ZP_16258689.1| hypothetical protein ECFRIK920_1700 [Escherichia coli FRIK920]
gi|421830049|ref|ZP_16265367.1| hypothetical protein ECPA7_2204 [Escherichia coli PA7]
gi|423684669|ref|ZP_17659503.1| hypothetical protein ECPA31_1534 [Escherichia coli PA31]
gi|424076562|ref|ZP_17813768.1| hypothetical protein ECFDA505_1662 [Escherichia coli FDA505]
gi|424082925|ref|ZP_17819639.1| hypothetical protein ECFDA517_1890 [Escherichia coli FDA517]
gi|424089470|ref|ZP_17825605.1| hypothetical protein ECFRIK1996_1763 [Escherichia coli FRIK1996]
gi|424095811|ref|ZP_17831405.1| hypothetical protein ECFRIK1985_1752 [Escherichia coli FRIK1985]
gi|424102175|ref|ZP_17837201.1| hypothetical protein ECFRIK1990_1740 [Escherichia coli FRIK1990]
gi|424108962|ref|ZP_17843407.1| hypothetical protein EC93001_1806 [Escherichia coli 93-001]
gi|424114682|ref|ZP_17848752.1| hypothetical protein ECPA3_1596 [Escherichia coli PA3]
gi|424121005|ref|ZP_17854567.1| hypothetical protein ECPA5_1625 [Escherichia coli PA5]
gi|424127180|ref|ZP_17860273.1| hypothetical protein ECPA9_1774 [Escherichia coli PA9]
gi|424133345|ref|ZP_17866017.1| hypothetical protein ECPA10_1771 [Escherichia coli PA10]
gi|424139945|ref|ZP_17872073.1| hypothetical protein ECPA14_1718 [Escherichia coli PA14]
gi|424146400|ref|ZP_17877997.1| hypothetical protein ECPA15_1865 [Escherichia coli PA15]
gi|424152439|ref|ZP_17883562.1| hypothetical protein ECPA24_1622 [Escherichia coli PA24]
gi|424209908|ref|ZP_17888998.1| hypothetical protein ECPA25_1468 [Escherichia coli PA25]
gi|424292474|ref|ZP_17894896.1| hypothetical protein ECPA28_1803 [Escherichia coli PA28]
gi|424438258|ref|ZP_17900665.1| hypothetical protein ECPA32_1677 [Escherichia coli PA32]
gi|424454891|ref|ZP_17906277.1| hypothetical protein ECPA33_1665 [Escherichia coli PA33]
gi|424461229|ref|ZP_17911973.1| hypothetical protein ECPA39_1698 [Escherichia coli PA39]
gi|424467689|ref|ZP_17917734.1| hypothetical protein ECPA41_1732 [Escherichia coli PA41]
gi|424474230|ref|ZP_17923766.1| hypothetical protein ECPA42_1841 [Escherichia coli PA42]
gi|424480077|ref|ZP_17929224.1| hypothetical protein ECTW07945_1727 [Escherichia coli TW07945]
gi|424486186|ref|ZP_17934931.1| hypothetical protein ECTW09098_1741 [Escherichia coli TW09098]
gi|424492467|ref|ZP_17940607.1| hypothetical protein ECTW09195_1720 [Escherichia coli TW09195]
gi|424499415|ref|ZP_17946564.1| hypothetical protein ECEC4203_1670 [Escherichia coli EC4203]
gi|424505551|ref|ZP_17952209.1| hypothetical protein ECEC4196_1587 [Escherichia coli EC4196]
gi|424511941|ref|ZP_17958000.1| hypothetical protein ECTW14313_1630 [Escherichia coli TW14313]
gi|424519405|ref|ZP_17963722.1| hypothetical protein ECTW14301_1600 [Escherichia coli TW14301]
gi|424525311|ref|ZP_17969216.1| hypothetical protein ECEC4421_1678 [Escherichia coli EC4421]
gi|424531483|ref|ZP_17975006.1| hypothetical protein ECEC4422_1805 [Escherichia coli EC4422]
gi|424537455|ref|ZP_17980595.1| hypothetical protein ECEC4013_1858 [Escherichia coli EC4013]
gi|424543416|ref|ZP_17986084.1| hypothetical protein ECEC4402_1673 [Escherichia coli EC4402]
gi|424549725|ref|ZP_17991792.1| hypothetical protein ECEC4439_1659 [Escherichia coli EC4439]
gi|424555953|ref|ZP_17997551.1| hypothetical protein ECEC4436_1622 [Escherichia coli EC4436]
gi|424562293|ref|ZP_18003470.1| hypothetical protein ECEC4437_1754 [Escherichia coli EC4437]
gi|424568371|ref|ZP_18009151.1| hypothetical protein ECEC4448_1671 [Escherichia coli EC4448]
gi|424574517|ref|ZP_18014815.1| hypothetical protein ECEC1845_1636 [Escherichia coli EC1845]
gi|424580420|ref|ZP_18020254.1| hypothetical protein ECEC1863_1404 [Escherichia coli EC1863]
gi|425097136|ref|ZP_18500054.1| cupin superfamily protein [Escherichia coli 3.4870]
gi|425103325|ref|ZP_18505837.1| cupin superfamily protein [Escherichia coli 5.2239]
gi|425109145|ref|ZP_18511281.1| hypothetical protein EC60172_1841 [Escherichia coli 6.0172]
gi|425124902|ref|ZP_18526338.1| cupin superfamily protein [Escherichia coli 8.0586]
gi|425130916|ref|ZP_18531917.1| cupin superfamily protein [Escherichia coli 8.2524]
gi|425137309|ref|ZP_18537920.1| hypothetical protein EC100833_1902 [Escherichia coli 10.0833]
gi|425143128|ref|ZP_18543314.1| cupin superfamily protein [Escherichia coli 10.0869]
gi|425149310|ref|ZP_18549104.1| cupin superfamily protein [Escherichia coli 88.0221]
gi|425155095|ref|ZP_18554537.1| hypothetical protein ECPA34_1783 [Escherichia coli PA34]
gi|425161592|ref|ZP_18560644.1| hypothetical protein ECFDA506_2127 [Escherichia coli FDA506]
gi|425167158|ref|ZP_18565840.1| hypothetical protein ECFDA507_1716 [Escherichia coli FDA507]
gi|425173251|ref|ZP_18571575.1| hypothetical protein ECFDA504_1691 [Escherichia coli FDA504]
gi|425179274|ref|ZP_18577202.1| hypothetical protein ECFRIK1999_1867 [Escherichia coli FRIK1999]
gi|425185491|ref|ZP_18582981.1| hypothetical protein ECFRIK1997_1863 [Escherichia coli FRIK1997]
gi|425192249|ref|ZP_18589257.1| hypothetical protein ECNE1487_2017 [Escherichia coli NE1487]
gi|425198603|ref|ZP_18595112.1| hypothetical protein ECNE037_1933 [Escherichia coli NE037]
gi|425205241|ref|ZP_18601253.1| hypothetical protein ECFRIK2001_2141 [Escherichia coli FRIK2001]
gi|425210937|ref|ZP_18606552.1| hypothetical protein ECPA4_1823 [Escherichia coli PA4]
gi|425217003|ref|ZP_18612185.1| hypothetical protein ECPA23_1646 [Escherichia coli PA23]
gi|425223593|ref|ZP_18618299.1| hypothetical protein ECPA49_1833 [Escherichia coli PA49]
gi|425229781|ref|ZP_18624055.1| hypothetical protein ECPA45_1810 [Escherichia coli PA45]
gi|425235939|ref|ZP_18629831.1| hypothetical protein ECTT12B_1698 [Escherichia coli TT12B]
gi|425242079|ref|ZP_18635592.1| hypothetical protein ECMA6_1932 [Escherichia coli MA6]
gi|425248146|ref|ZP_18641233.1| hypothetical protein EC5905_1864 [Escherichia coli 5905]
gi|425253942|ref|ZP_18646685.1| hypothetical protein ECCB7326_1676 [Escherichia coli CB7326]
gi|425260158|ref|ZP_18652412.1| hypothetical protein ECEC96038_1549 [Escherichia coli EC96038]
gi|425266317|ref|ZP_18658109.1| hypothetical protein EC5412_1682 [Escherichia coli 5412]
gi|425293811|ref|ZP_18684243.1| hypothetical protein ECPA38_1677 [Escherichia coli PA38]
gi|425310473|ref|ZP_18699832.1| hypothetical protein ECEC1735_1710 [Escherichia coli EC1735]
gi|425316402|ref|ZP_18705363.1| hypothetical protein ECEC1736_1604 [Escherichia coli EC1736]
gi|425322506|ref|ZP_18711049.1| hypothetical protein ECEC1737_1611 [Escherichia coli EC1737]
gi|425328706|ref|ZP_18716799.1| hypothetical protein ECEC1846_1640 [Escherichia coli EC1846]
gi|425334885|ref|ZP_18722481.1| hypothetical protein ECEC1847_1644 [Escherichia coli EC1847]
gi|425341293|ref|ZP_18728390.1| hypothetical protein ECEC1848_1828 [Escherichia coli EC1848]
gi|425347153|ref|ZP_18733838.1| hypothetical protein ECEC1849_1617 [Escherichia coli EC1849]
gi|425353394|ref|ZP_18739653.1| hypothetical protein ECEC1850_1807 [Escherichia coli EC1850]
gi|425359374|ref|ZP_18745222.1| hypothetical protein ECEC1856_1643 [Escherichia coli EC1856]
gi|425365514|ref|ZP_18750917.1| hypothetical protein ECEC1862_1640 [Escherichia coli EC1862]
gi|425371937|ref|ZP_18756775.1| hypothetical protein ECEC1864_1808 [Escherichia coli EC1864]
gi|425384745|ref|ZP_18768494.1| hypothetical protein ECEC1866_1481 [Escherichia coli EC1866]
gi|425391445|ref|ZP_18774773.1| hypothetical protein ECEC1868_1825 [Escherichia coli EC1868]
gi|425397564|ref|ZP_18780474.1| hypothetical protein ECEC1869_1800 [Escherichia coli EC1869]
gi|425403542|ref|ZP_18786011.1| hypothetical protein ECEC1870_1493 [Escherichia coli EC1870]
gi|425410074|ref|ZP_18792092.1| hypothetical protein ECNE098_1819 [Escherichia coli NE098]
gi|425416429|ref|ZP_18797913.1| hypothetical protein ECFRIK523_1705 [Escherichia coli FRIK523]
gi|425427547|ref|ZP_18808442.1| hypothetical protein EC01304_1737 [Escherichia coli 0.1304]
gi|428946109|ref|ZP_19018620.1| cupin superfamily protein [Escherichia coli 88.1467]
gi|428952299|ref|ZP_19024292.1| cupin superfamily protein [Escherichia coli 88.1042]
gi|428958084|ref|ZP_19029674.1| cupin superfamily protein [Escherichia coli 89.0511]
gi|428964599|ref|ZP_19035623.1| cupin superfamily protein [Escherichia coli 90.0091]
gi|428976894|ref|ZP_19046972.1| cupin superfamily protein [Escherichia coli 90.2281]
gi|428982666|ref|ZP_19052309.1| cupin superfamily protein [Escherichia coli 93.0055]
gi|428989095|ref|ZP_19058261.1| cupin superfamily protein [Escherichia coli 93.0056]
gi|428994838|ref|ZP_19063652.1| cupin superfamily protein [Escherichia coli 94.0618]
gi|429001083|ref|ZP_19069448.1| cupin superfamily protein [Escherichia coli 95.0183]
gi|429007232|ref|ZP_19074985.1| cupin superfamily protein [Escherichia coli 95.1288]
gi|429013676|ref|ZP_19080777.1| cupin superfamily protein [Escherichia coli 95.0943]
gi|429019589|ref|ZP_19086288.1| cupin superfamily protein [Escherichia coli 96.0428]
gi|429025367|ref|ZP_19091682.1| cupin superfamily protein [Escherichia coli 96.0427]
gi|429031691|ref|ZP_19097443.1| cupin superfamily protein [Escherichia coli 96.0939]
gi|429037854|ref|ZP_19103172.1| cupin superfamily protein [Escherichia coli 96.0932]
gi|429043773|ref|ZP_19108687.1| cupin superfamily protein [Escherichia coli 96.0107]
gi|429049505|ref|ZP_19114140.1| cupin superfamily protein [Escherichia coli 97.0003]
gi|429054826|ref|ZP_19119267.1| cupin superfamily protein [Escherichia coli 97.1742]
gi|429060502|ref|ZP_19124608.1| cupin superfamily protein [Escherichia coli 97.0007]
gi|429066270|ref|ZP_19129984.1| cupin superfamily protein [Escherichia coli 99.0672]
gi|429072537|ref|ZP_19135860.1| hypothetical protein EC990678_1670 [Escherichia coli 99.0678]
gi|429077898|ref|ZP_19141088.1| cupin superfamily protein [Escherichia coli 99.0713]
gi|429825427|ref|ZP_19356738.1| cupin superfamily protein [Escherichia coli 96.0109]
gi|429831732|ref|ZP_19362387.1| cupin superfamily protein [Escherichia coli 97.0010]
gi|444924016|ref|ZP_21243562.1| cupin superfamily protein [Escherichia coli 09BKT078844]
gi|444930102|ref|ZP_21249221.1| cupin superfamily protein [Escherichia coli 99.0814]
gi|444935351|ref|ZP_21254238.1| cupin superfamily protein [Escherichia coli 99.0815]
gi|444941000|ref|ZP_21259607.1| cupin superfamily protein [Escherichia coli 99.0816]
gi|444946604|ref|ZP_21264987.1| cupin superfamily protein [Escherichia coli 99.0839]
gi|444952204|ref|ZP_21270381.1| cupin superfamily protein [Escherichia coli 99.0848]
gi|444957645|ref|ZP_21275592.1| cupin superfamily protein [Escherichia coli 99.1753]
gi|444962920|ref|ZP_21280623.1| cupin superfamily protein [Escherichia coli 99.1775]
gi|444968632|ref|ZP_21286075.1| cupin superfamily protein [Escherichia coli 99.1793]
gi|444974100|ref|ZP_21291331.1| cupin superfamily protein [Escherichia coli 99.1805]
gi|444979277|ref|ZP_21296262.1| cupin superfamily protein [Escherichia coli ATCC 700728]
gi|444984957|ref|ZP_21301795.1| cupin superfamily protein [Escherichia coli PA11]
gi|444990220|ref|ZP_21306931.1| cupin superfamily protein [Escherichia coli PA19]
gi|444995427|ref|ZP_21311996.1| cupin superfamily protein [Escherichia coli PA13]
gi|445001071|ref|ZP_21317511.1| cupin superfamily protein [Escherichia coli PA2]
gi|445006526|ref|ZP_21322834.1| cupin superfamily protein [Escherichia coli PA47]
gi|445011617|ref|ZP_21327783.1| cupin superfamily protein [Escherichia coli PA48]
gi|445017578|ref|ZP_21333588.1| cupin superfamily protein [Escherichia coli PA8]
gi|445023158|ref|ZP_21339038.1| cupin superfamily protein [Escherichia coli 7.1982]
gi|445028101|ref|ZP_21343849.1| cupin superfamily protein [Escherichia coli 99.1781]
gi|445033677|ref|ZP_21349270.1| cupin superfamily protein [Escherichia coli 99.1762]
gi|445039379|ref|ZP_21354815.1| cupin superfamily protein [Escherichia coli PA35]
gi|445044625|ref|ZP_21359938.1| cupin superfamily protein [Escherichia coli 3.4880]
gi|445050181|ref|ZP_21365311.1| cupin superfamily protein [Escherichia coli 95.0083]
gi|445055884|ref|ZP_21370803.1| cupin superfamily protein [Escherichia coli 99.0670]
gi|452967568|ref|ZP_21965795.1| hypothetical protein EC4009_RS02550 [Escherichia coli O157:H7 str.
EC4009]
gi|187768477|gb|EDU32321.1| cupin family protein [Escherichia coli O157:H7 str. EC4196]
gi|188016012|gb|EDU54134.1| cupin family protein [Escherichia coli O157:H7 str. EC4113]
gi|188999926|gb|EDU68912.1| cupin family protein [Escherichia coli O157:H7 str. EC4076]
gi|189356328|gb|EDU74747.1| cupin family protein [Escherichia coli O157:H7 str. EC4401]
gi|189360179|gb|EDU78598.1| cupin family protein [Escherichia coli O157:H7 str. EC4486]
gi|189367200|gb|EDU85616.1| cupin family protein [Escherichia coli O157:H7 str. EC4501]
gi|189372572|gb|EDU90988.1| cupin family protein [Escherichia coli O157:H7 str. EC869]
gi|189377213|gb|EDU95629.1| cupin family protein [Escherichia coli O157:H7 str. EC508]
gi|208729305|gb|EDZ78906.1| cupin family protein [Escherichia coli O157:H7 str. EC4206]
gi|208732471|gb|EDZ81159.1| cupin family protein [Escherichia coli O157:H7 str. EC4045]
gi|208738115|gb|EDZ85798.1| cupin family protein [Escherichia coli O157:H7 str. EC4042]
gi|209157197|gb|ACI34630.1| cupin family protein [Escherichia coli O157:H7 str. EC4115]
gi|217318427|gb|EEC26853.1| cupin family protein [Escherichia coli O157:H7 str. TW14588]
gi|254591968|gb|ACT71329.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
gi|320187837|gb|EFW62507.1| hypothetical protein ECoD_05544 [Escherichia coli O157:H7 str.
EC1212]
gi|320637578|gb|EFX07378.1| hypothetical protein ECO5101_10737 [Escherichia coli O157:H7 str.
G5101]
gi|320643138|gb|EFX12339.1| hypothetical protein ECO9389_04211 [Escherichia coli O157:H- str.
493-89]
gi|320648596|gb|EFX17251.1| hypothetical protein ECO2687_20326 [Escherichia coli O157:H- str. H
2687]
gi|320653910|gb|EFX21984.1| hypothetical protein ECO7815_16663 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gi|320659391|gb|EFX26960.1| hypothetical protein ECO5905_20143 [Escherichia coli O55:H7 str.
USDA 5905]
gi|326339383|gb|EGD63195.1| cupin family protein [Escherichia coli O157:H7 str. 1044]
gi|326341488|gb|EGD65279.1| hypothetical protein ECF_03346 [Escherichia coli O157:H7 str. 1125]
gi|374358249|gb|AEZ39956.1| hypothetical protein ECO55CA74_06765 [Escherichia coli O55:H7 str.
RM12579]
gi|377898661|gb|EHU63020.1| cupin superfamily protein [Escherichia coli DEC3A]
gi|377899127|gb|EHU63479.1| cupin superfamily protein [Escherichia coli DEC3B]
gi|377909709|gb|EHU73908.1| cupin superfamily protein [Escherichia coli DEC3C]
gi|377914957|gb|EHU79071.1| cupin superfamily protein [Escherichia coli DEC3D]
gi|377917065|gb|EHU81133.1| cupin superfamily protein [Escherichia coli DEC3E]
gi|377928060|gb|EHU91972.1| cupin superfamily protein [Escherichia coli DEC3F]
gi|377931609|gb|EHU95469.1| cupin superfamily protein [Escherichia coli DEC4A]
gi|377935412|gb|EHU99210.1| cupin superfamily protein [Escherichia coli DEC4B]
gi|377947794|gb|EHV11452.1| cupin superfamily protein [Escherichia coli DEC4C]
gi|377947896|gb|EHV11553.1| cupin superfamily protein [Escherichia coli DEC4D]
gi|377952349|gb|EHV15938.1| cupin superfamily protein [Escherichia coli DEC4E]
gi|377962981|gb|EHV26432.1| cupin superfamily protein [Escherichia coli DEC4F]
gi|377964788|gb|EHV28223.1| cupin superfamily protein [Escherichia coli DEC5A]
gi|377971372|gb|EHV34729.1| cupin superfamily protein [Escherichia coli DEC5B]
gi|377977817|gb|EHV41100.1| cupin superfamily protein [Escherichia coli DEC5C]
gi|377979545|gb|EHV42822.1| cupin superfamily protein [Escherichia coli DEC5D]
gi|377986198|gb|EHV49395.1| cupin superfamily protein [Escherichia coli DEC5E]
gi|386795530|gb|AFJ28564.1| hypothetical protein CDCO157_1505 [Escherichia coli Xuzhou21]
gi|390648000|gb|EIN26819.1| hypothetical protein ECFRIK1996_1763 [Escherichia coli FRIK1996]
gi|390648505|gb|EIN27198.1| hypothetical protein ECFDA517_1890 [Escherichia coli FDA517]
gi|390649074|gb|EIN27689.1| hypothetical protein ECFDA505_1662 [Escherichia coli FDA505]
gi|390666784|gb|EIN43872.1| hypothetical protein EC93001_1806 [Escherichia coli 93-001]
gi|390668814|gb|EIN45563.1| hypothetical protein ECFRIK1985_1752 [Escherichia coli FRIK1985]
gi|390668936|gb|EIN45652.1| hypothetical protein ECFRIK1990_1740 [Escherichia coli FRIK1990]
gi|390686437|gb|EIN61795.1| hypothetical protein ECPA3_1596 [Escherichia coli PA3]
gi|390688004|gb|EIN63140.1| hypothetical protein ECPA5_1625 [Escherichia coli PA5]
gi|390688275|gb|EIN63355.1| hypothetical protein ECPA9_1774 [Escherichia coli PA9]
gi|390705250|gb|EIN79004.1| hypothetical protein ECPA10_1771 [Escherichia coli PA10]
gi|390706005|gb|EIN79627.1| hypothetical protein ECPA15_1865 [Escherichia coli PA15]
gi|390706776|gb|EIN80280.1| hypothetical protein ECPA14_1718 [Escherichia coli PA14]
gi|390718103|gb|EIN90866.1| hypothetical protein ECPA22_1679 [Escherichia coli PA22]
gi|390730188|gb|EIO02249.1| hypothetical protein ECPA24_1622 [Escherichia coli PA24]
gi|390730471|gb|EIO02486.1| hypothetical protein ECPA25_1468 [Escherichia coli PA25]
gi|390732152|gb|EIO03884.1| hypothetical protein ECPA28_1803 [Escherichia coli PA28]
gi|390748799|gb|EIO19145.1| hypothetical protein ECPA32_1677 [Escherichia coli PA32]
gi|390748914|gb|EIO19228.1| hypothetical protein ECPA31_1534 [Escherichia coli PA31]
gi|390750571|gb|EIO20610.1| hypothetical protein ECPA33_1665 [Escherichia coli PA33]
gi|390760387|gb|EIO29716.1| hypothetical protein ECPA40_1935 [Escherichia coli PA40]
gi|390772971|gb|EIO41452.1| hypothetical protein ECPA41_1732 [Escherichia coli PA41]
gi|390774229|gb|EIO42499.1| hypothetical protein ECPA42_1841 [Escherichia coli PA42]
gi|390774908|gb|EIO43018.1| hypothetical protein ECPA39_1698 [Escherichia coli PA39]
gi|390784966|gb|EIO52522.1| hypothetical protein ECTW06591_0636 [Escherichia coli TW06591]
gi|390792774|gb|EIO60122.1| hypothetical protein ECTW10246_1732 [Escherichia coli TW10246]
gi|390801524|gb|EIO68577.1| hypothetical protein ECTW07945_1727 [Escherichia coli TW07945]
gi|390810140|gb|EIO76910.1| hypothetical protein ECTW09109_1830 [Escherichia coli TW09109]
gi|390816501|gb|EIO82985.1| hypothetical protein ECTW09098_1741 [Escherichia coli TW09098]
gi|390817756|gb|EIO84171.1| hypothetical protein ECTW10119_2137 [Escherichia coli TW10119]
gi|390834069|gb|EIO99040.1| hypothetical protein ECEC4203_1670 [Escherichia coli EC4203]
gi|390835801|gb|EIP00483.1| hypothetical protein ECTW09195_1720 [Escherichia coli TW09195]
gi|390837326|gb|EIP01752.1| hypothetical protein ECEC4196_1587 [Escherichia coli EC4196]
gi|390853083|gb|EIP16164.1| hypothetical protein ECTW14301_1600 [Escherichia coli TW14301]
gi|390854280|gb|EIP17161.1| hypothetical protein ECTW14313_1630 [Escherichia coli TW14313]
gi|390855095|gb|EIP17845.1| hypothetical protein ECEC4421_1678 [Escherichia coli EC4421]
gi|390867539|gb|EIP29335.1| hypothetical protein ECEC4422_1805 [Escherichia coli EC4422]
gi|390871640|gb|EIP33029.1| hypothetical protein ECEC4013_1858 [Escherichia coli EC4013]
gi|390880448|gb|EIP41127.1| hypothetical protein ECEC4402_1673 [Escherichia coli EC4402]
gi|390883853|gb|EIP44247.1| hypothetical protein ECEC4439_1659 [Escherichia coli EC4439]
gi|390888463|gb|EIP48329.1| hypothetical protein ECEC4436_1622 [Escherichia coli EC4436]
gi|390902420|gb|EIP61516.1| hypothetical protein ECEC1738_1727 [Escherichia coli EC1738]
gi|390903202|gb|EIP62258.1| hypothetical protein ECEC4437_1754 [Escherichia coli EC4437]
gi|390906675|gb|EIP65559.1| hypothetical protein ECEC4448_1671 [Escherichia coli EC4448]
gi|390910434|gb|EIP69171.1| cupin superfamily protein [Escherichia coli EC1734]
gi|390922948|gb|EIP80933.1| hypothetical protein ECEC1863_1404 [Escherichia coli EC1863]
gi|390923960|gb|EIP81789.1| hypothetical protein ECEC1845_1636 [Escherichia coli EC1845]
gi|408069967|gb|EKH04345.1| hypothetical protein ECPA7_2204 [Escherichia coli PA7]
gi|408074119|gb|EKH08408.1| hypothetical protein ECFRIK920_1700 [Escherichia coli FRIK920]
gi|408083026|gb|EKH16945.1| hypothetical protein ECPA34_1783 [Escherichia coli PA34]
gi|408085188|gb|EKH18847.1| hypothetical protein ECFDA506_2127 [Escherichia coli FDA506]
gi|408088693|gb|EKH22041.1| hypothetical protein ECFDA507_1716 [Escherichia coli FDA507]
gi|408099539|gb|EKH32184.1| hypothetical protein ECFDA504_1691 [Escherichia coli FDA504]
gi|408104889|gb|EKH37115.1| hypothetical protein ECFRIK1999_1867 [Escherichia coli FRIK1999]
gi|408111796|gb|EKH43503.1| hypothetical protein ECFRIK1997_1863 [Escherichia coli FRIK1997]
gi|408116673|gb|EKH47954.1| hypothetical protein ECNE1487_2017 [Escherichia coli NE1487]
gi|408124753|gb|EKH55401.1| hypothetical protein ECNE037_1933 [Escherichia coli NE037]
gi|408128334|gb|EKH58714.1| hypothetical protein ECFRIK2001_2141 [Escherichia coli FRIK2001]
gi|408134363|gb|EKH64195.1| hypothetical protein ECPA4_1823 [Escherichia coli PA4]
gi|408146833|gb|EKH75892.1| hypothetical protein ECPA23_1646 [Escherichia coli PA23]
gi|408147600|gb|EKH76534.1| hypothetical protein ECPA49_1833 [Escherichia coli PA49]
gi|408151867|gb|EKH80334.1| hypothetical protein ECPA45_1810 [Escherichia coli PA45]
gi|408164269|gb|EKH92082.1| hypothetical protein ECTT12B_1698 [Escherichia coli TT12B]
gi|408167730|gb|EKH95213.1| hypothetical protein ECMA6_1932 [Escherichia coli MA6]
gi|408168862|gb|EKH96222.1| hypothetical protein EC5905_1864 [Escherichia coli 5905]
gi|408183320|gb|EKI09770.1| hypothetical protein ECCB7326_1676 [Escherichia coli CB7326]
gi|408187291|gb|EKI13261.1| hypothetical protein EC5412_1682 [Escherichia coli 5412]
gi|408187767|gb|EKI13688.1| hypothetical protein ECEC96038_1549 [Escherichia coli EC96038]
gi|408226532|gb|EKI50168.1| hypothetical protein ECPA38_1677 [Escherichia coli PA38]
gi|408233748|gb|EKI56824.1| hypothetical protein ECEC1735_1710 [Escherichia coli EC1735]
gi|408245840|gb|EKI68189.1| hypothetical protein ECEC1736_1604 [Escherichia coli EC1736]
gi|408248704|gb|EKI70710.1| hypothetical protein ECEC1737_1611 [Escherichia coli EC1737]
gi|408254192|gb|EKI75735.1| hypothetical protein ECEC1846_1640 [Escherichia coli EC1846]
gi|408264409|gb|EKI85208.1| hypothetical protein ECEC1847_1644 [Escherichia coli EC1847]
gi|408266503|gb|EKI87098.1| hypothetical protein ECEC1848_1828 [Escherichia coli EC1848]
gi|408272547|gb|EKI92627.1| hypothetical protein ECEC1849_1617 [Escherichia coli EC1849]
gi|408281192|gb|EKJ00624.1| hypothetical protein ECEC1850_1807 [Escherichia coli EC1850]
gi|408283054|gb|EKJ02282.1| hypothetical protein ECEC1856_1643 [Escherichia coli EC1856]
gi|408296802|gb|EKJ14990.1| hypothetical protein ECEC1862_1640 [Escherichia coli EC1862]
gi|408297399|gb|EKJ15495.1| hypothetical protein ECEC1864_1808 [Escherichia coli EC1864]
gi|408313492|gb|EKJ30117.1| hypothetical protein ECEC1868_1825 [Escherichia coli EC1868]
gi|408313544|gb|EKJ30161.1| hypothetical protein ECEC1866_1481 [Escherichia coli EC1866]
gi|408327838|gb|EKJ43471.1| hypothetical protein ECEC1869_1800 [Escherichia coli EC1869]
gi|408331817|gb|EKJ46956.1| hypothetical protein ECNE098_1819 [Escherichia coli NE098]
gi|408332241|gb|EKJ47301.1| hypothetical protein ECEC1870_1493 [Escherichia coli EC1870]
gi|408346856|gb|EKJ61120.1| hypothetical protein ECFRIK523_1705 [Escherichia coli FRIK523]
gi|408351818|gb|EKJ65519.1| hypothetical protein EC01304_1737 [Escherichia coli 0.1304]
gi|408555303|gb|EKK32114.1| cupin superfamily protein [Escherichia coli 5.2239]
gi|408555897|gb|EKK32627.1| cupin superfamily protein [Escherichia coli 3.4870]
gi|408556083|gb|EKK32777.1| hypothetical protein EC60172_1841 [Escherichia coli 6.0172]
gi|408581130|gb|EKK56484.1| cupin superfamily protein [Escherichia coli 8.0586]
gi|408585815|gb|EKK60642.1| cupin superfamily protein [Escherichia coli 8.2524]
gi|408586043|gb|EKK60827.1| hypothetical protein EC100833_1902 [Escherichia coli 10.0833]
gi|408599403|gb|EKK73313.1| cupin superfamily protein [Escherichia coli 10.0869]
gi|408603662|gb|EKK77294.1| cupin superfamily protein [Escherichia coli 88.0221]
gi|408603915|gb|EKK77518.1| hypothetical protein EC80416_1389 [Escherichia coli 8.0416]
gi|408615568|gb|EKK88760.1| cupin superfamily protein [Escherichia coli 10.0821]
gi|427211550|gb|EKV81296.1| cupin superfamily protein [Escherichia coli 88.1042]
gi|427213076|gb|EKV82551.1| cupin superfamily protein [Escherichia coli 88.1467]
gi|427213573|gb|EKV82980.1| cupin superfamily protein [Escherichia coli 89.0511]
gi|427231045|gb|EKV99122.1| cupin superfamily protein [Escherichia coli 90.0091]
gi|427231157|gb|EKV99213.1| cupin superfamily protein [Escherichia coli 90.2281]
gi|427248462|gb|EKW15436.1| cupin superfamily protein [Escherichia coli 93.0056]
gi|427249833|gb|EKW16581.1| cupin superfamily protein [Escherichia coli 93.0055]
gi|427250704|gb|EKW17360.1| cupin superfamily protein [Escherichia coli 94.0618]
gi|427267119|gb|EKW32483.1| cupin superfamily protein [Escherichia coli 95.0943]
gi|427267289|gb|EKW32617.1| cupin superfamily protein [Escherichia coli 95.0183]
gi|427269155|gb|EKW34151.1| cupin superfamily protein [Escherichia coli 95.1288]
gi|427284073|gb|EKW48199.1| cupin superfamily protein [Escherichia coli 96.0428]
gi|427288343|gb|EKW51985.1| cupin superfamily protein [Escherichia coli 96.0427]
gi|427289118|gb|EKW52706.1| cupin superfamily protein [Escherichia coli 96.0939]
gi|427301083|gb|EKW63974.1| cupin superfamily protein [Escherichia coli 96.0932]
gi|427303253|gb|EKW65982.1| cupin superfamily protein [Escherichia coli 97.0003]
gi|427307089|gb|EKW69574.1| cupin superfamily protein [Escherichia coli 96.0107]
gi|427318816|gb|EKW80670.1| cupin superfamily protein [Escherichia coli 97.1742]
gi|427320255|gb|EKW82031.1| cupin superfamily protein [Escherichia coli 97.0007]
gi|427331043|gb|EKW92288.1| cupin superfamily protein [Escherichia coli 99.0672]
gi|427332006|gb|EKW93183.1| cupin superfamily protein [Escherichia coli 99.0713]
gi|427332056|gb|EKW93232.1| hypothetical protein EC990678_1670 [Escherichia coli 99.0678]
gi|429257652|gb|EKY41666.1| cupin superfamily protein [Escherichia coli 96.0109]
gi|429259051|gb|EKY42797.1| cupin superfamily protein [Escherichia coli 97.0010]
gi|444541225|gb|ELV20772.1| cupin superfamily protein [Escherichia coli 99.0814]
gi|444546042|gb|ELV24820.1| cupin superfamily protein [Escherichia coli 09BKT078844]
gi|444550103|gb|ELV28234.1| cupin superfamily protein [Escherichia coli 99.0815]
gi|444562328|gb|ELV39398.1| cupin superfamily protein [Escherichia coli 99.0839]
gi|444564111|gb|ELV41070.1| cupin superfamily protein [Escherichia coli 99.0816]
gi|444568178|gb|ELV44869.1| cupin superfamily protein [Escherichia coli 99.0848]
gi|444578740|gb|ELV54781.1| cupin superfamily protein [Escherichia coli 99.1753]
gi|444582275|gb|ELV58076.1| cupin superfamily protein [Escherichia coli 99.1775]
gi|444583924|gb|ELV59604.1| cupin superfamily protein [Escherichia coli 99.1793]
gi|444597453|gb|ELV72447.1| cupin superfamily protein [Escherichia coli PA11]
gi|444598867|gb|ELV73780.1| cupin superfamily protein [Escherichia coli ATCC 700728]
gi|444602639|gb|ELV77378.1| cupin superfamily protein [Escherichia coli 99.1805]
gi|444611342|gb|ELV85682.1| cupin superfamily protein [Escherichia coli PA19]
gi|444611386|gb|ELV85722.1| cupin superfamily protein [Escherichia coli PA13]
gi|444619883|gb|ELV93908.1| cupin superfamily protein [Escherichia coli PA2]
gi|444629472|gb|ELW03169.1| cupin superfamily protein [Escherichia coli PA47]
gi|444629674|gb|ELW03357.1| cupin superfamily protein [Escherichia coli PA48]
gi|444633529|gb|ELW07050.1| cupin superfamily protein [Escherichia coli PA8]
gi|444643469|gb|ELW16620.1| cupin superfamily protein [Escherichia coli 7.1982]
gi|444647120|gb|ELW20101.1| cupin superfamily protein [Escherichia coli 99.1781]
gi|444650310|gb|ELW23151.1| cupin superfamily protein [Escherichia coli 99.1762]
gi|444659700|gb|ELW32104.1| cupin superfamily protein [Escherichia coli PA35]
gi|444664860|gb|ELW37022.1| cupin superfamily protein [Escherichia coli 3.4880]
gi|444670531|gb|ELW42417.1| cupin superfamily protein [Escherichia coli 95.0083]
gi|444672728|gb|ELW44416.1| cupin superfamily protein [Escherichia coli 99.0670]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|425304659|ref|ZP_18694421.1| hypothetical protein ECN1_1102 [Escherichia coli N1]
gi|408230791|gb|EKI54150.1| hypothetical protein ECN1_1102 [Escherichia coli N1]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|420367270|ref|ZP_14868067.1| cupin superfamily protein [Shigella flexneri 1235-66]
gi|391323472|gb|EIQ80123.1| cupin superfamily protein [Shigella flexneri 1235-66]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|416336251|ref|ZP_11672899.1| hypothetical protein EcoM_02298 [Escherichia coli WV_060327]
gi|320195869|gb|EFW70494.1| hypothetical protein EcoM_02298 [Escherichia coli WV_060327]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|300896843|ref|ZP_07115338.1| cupin family protein [Escherichia coli MS 198-1]
gi|300359325|gb|EFJ75195.1| cupin family protein [Escherichia coli MS 198-1]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|424743477|ref|ZP_18171787.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
WC-141]
gi|422943311|gb|EKU38333.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
WC-141]
Length = 299
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 67/141 (47%), Gaps = 4/141 (2%)
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT-PPTAQ 115
+ + I++ ++G ++ Q +I + + Q+ F C + AN Y +
Sbjct: 80 IHTKMRKDIIYSMIKDGGTLVIDRCQEFIPNINNICQIFQQYFRCPISANLYYCYKKDSS 139
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
F H+DD + +QL G KKW + P + LP + S ++ + P + + +
Sbjct: 140 SFGMHFDDHDVISIQLFGDKKWSISSPTFKNP-LPHHKSFHYDKPCENFPEICIHNKING 198
Query: 176 LLYLPRGYIH--QASTVTNEH 194
L+++PRGY H +A++ +N H
Sbjct: 199 LVFIPRGYWHLVEATSSSNVH 219
>gi|365106156|ref|ZP_09335034.1| hypothetical protein HMPREF9428_00903 [Citrobacter freundii
4_7_47CFAA]
gi|395227824|ref|ZP_10406150.1| cupin family protein [Citrobacter sp. A1]
gi|421847783|ref|ZP_16280915.1| hypothetical protein D186_22101 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|424728473|ref|ZP_18157078.1| cupin family protein [Citrobacter sp. L17]
gi|363642555|gb|EHL81904.1| hypothetical protein HMPREF9428_00903 [Citrobacter freundii
4_7_47CFAA]
gi|394719152|gb|EJF24773.1| cupin family protein [Citrobacter sp. A1]
gi|411770852|gb|EKS54590.1| hypothetical protein D186_22101 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|422896344|gb|EKU36126.1| cupin family protein [Citrobacter sp. L17]
gi|455642176|gb|EMF21342.1| hypothetical protein H262_17086 [Citrobacter freundii GTC 09479]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|157961821|ref|YP_001501855.1| cupin 4 family protein [Shewanella pealeana ATCC 700345]
gi|157846821|gb|ABV87320.1| Cupin 4 family protein [Shewanella pealeana ATCC 700345]
Length = 381
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 92/210 (43%), Gaps = 36/210 (17%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I + KN + +L++ ++ + D ++E ++ +Y R+ D +
Sbjct: 16 EYWQKKPLVIRQGFKN-FQDLLSPDEMAGLACDEMVESRR-----VY----REKGDWQAE 65
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLN---ASLQELFGCFVGAN---------TY 108
P ++H L + + +Q LN + ++L CF +Y
Sbjct: 66 FGPFESYEH---------LGEKDWTLIVQALNNWVPAAEDLLKCFDFIPRWRLDDVMVSY 116
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
P G PH D + FI Q G+++W+V E+ + P E PI+
Sbjct: 117 AVP--GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIID 171
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
V L PGD+LYLP GY H T+ S V
Sbjct: 172 VELLPGDILYLPPGYPHDGVTLEPSMSFSV 201
>gi|410861551|ref|YP_006976785.1| hypothetical protein amad1_09615 [Alteromonas macleodii AltDE1]
gi|410818813|gb|AFV85430.1| hypothetical protein amad1_09615 [Alteromonas macleodii AltDE1]
Length = 388
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G H D + F++Q +GK+ W+V P E P P Q E TP++ V +EPGD
Sbjct: 132 GVGAHIDQYDVFLVQGKGKRHWRVGQPGDYKEVFPH---PKLRQIERFTPVIDVVVEPGD 188
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
++Y+P G+ H TV E SL ++
Sbjct: 189 VIYVPPGWPHDGETV--EDSLTYSVG 212
>gi|24112531|ref|NP_707041.1| hypothetical protein SF1130 [Shigella flexneri 2a str. 301]
gi|26247349|ref|NP_753389.1| hypothetical protein c1480 [Escherichia coli CFT073]
gi|30062659|ref|NP_836830.1| hypothetical protein S1210 [Shigella flexneri 2a str. 2457T]
gi|74311687|ref|YP_310106.1| hypothetical protein SSON_1146 [Shigella sonnei Ss046]
gi|82544385|ref|YP_408332.1| hypothetical protein SBO_1911 [Shigella boydii Sb227]
gi|91210284|ref|YP_540270.1| hypothetical protein UTI89_C1257 [Escherichia coli UTI89]
gi|227886515|ref|ZP_04004320.1| cupin family protein [Escherichia coli 83972]
gi|237706886|ref|ZP_04537367.1| YcfD [Escherichia sp. 3_2_53FAA]
gi|300816831|ref|ZP_07097051.1| cupin family protein [Escherichia coli MS 107-1]
gi|300821088|ref|ZP_07101237.1| cupin family protein [Escherichia coli MS 119-7]
gi|300906953|ref|ZP_07124625.1| cupin family protein [Escherichia coli MS 84-1]
gi|300917901|ref|ZP_07134534.1| cupin family protein [Escherichia coli MS 115-1]
gi|300928314|ref|ZP_07143850.1| cupin family protein [Escherichia coli MS 187-1]
gi|300938700|ref|ZP_07153424.1| cupin family protein [Escherichia coli MS 21-1]
gi|300975594|ref|ZP_07173073.1| cupin family protein [Escherichia coli MS 45-1]
gi|300982387|ref|ZP_07176085.1| cupin family protein [Escherichia coli MS 200-1]
gi|301024586|ref|ZP_07188258.1| cupin family protein [Escherichia coli MS 69-1]
gi|301051082|ref|ZP_07197916.1| cupin family protein [Escherichia coli MS 185-1]
gi|301306072|ref|ZP_07212150.1| cupin family protein [Escherichia coli MS 124-1]
gi|331646387|ref|ZP_08347490.1| YcfD protein [Escherichia coli M605]
gi|331652179|ref|ZP_08353198.1| YcfD protein [Escherichia coli M718]
gi|331672641|ref|ZP_08373430.1| YcfD protein [Escherichia coli TA280]
gi|331676920|ref|ZP_08377616.1| YcfD protein [Escherichia coli H591]
gi|332279684|ref|ZP_08392097.1| conserved hypothetical protein [Shigella sp. D9]
gi|384542756|ref|YP_005726818.1| cupin [Shigella flexneri 2002017]
gi|386628834|ref|YP_006148554.1| hypothetical protein i02_1350 [Escherichia coli str. 'clone D i2']
gi|386633754|ref|YP_006153473.1| hypothetical protein i14_1350 [Escherichia coli str. 'clone D i14']
gi|386638637|ref|YP_006105435.1| cupin superfamily protein family [Escherichia coli ABU 83972]
gi|387606654|ref|YP_006095510.1| hypothetical protein EC042_1199 [Escherichia coli 042]
gi|415868183|ref|ZP_11539765.1| cupin family protein [Escherichia coli MS 85-1]
gi|415881115|ref|ZP_11545351.1| putative cytoplasmic protein [Escherichia coli MS 79-10]
gi|419804302|ref|ZP_14329462.1| cupin family protein [Escherichia coli AI27]
gi|422358729|ref|ZP_16439378.1| cupin family protein [Escherichia coli MS 110-3]
gi|422364209|ref|ZP_16444736.1| cupin family protein [Escherichia coli MS 153-1]
gi|422371627|ref|ZP_16452004.1| cupin family protein [Escherichia coli MS 16-3]
gi|422375446|ref|ZP_16455712.1| cupin family protein [Escherichia coli MS 60-1]
gi|422382846|ref|ZP_16463001.1| cupin family protein [Escherichia coli MS 57-2]
gi|424837586|ref|ZP_18262223.1| hypothetical protein SF5M90T_1126 [Shigella flexneri 5a str. M90T]
gi|427804287|ref|ZP_18971354.1| hypothetical protein BN16_16971 [Escherichia coli chi7122]
gi|427808868|ref|ZP_18975933.1| hypothetical protein BN17_09911 [Escherichia coli]
gi|26107750|gb|AAN79949.1|AE016759_223 Hypothetical protein ycfD [Escherichia coli CFT073]
gi|24051423|gb|AAN42748.1| conserved hypothetical protein [Shigella flexneri 2a str. 301]
gi|30040907|gb|AAP16637.1| hypothetical protein S1210 [Shigella flexneri 2a str. 2457T]
gi|73855164|gb|AAZ87871.1| conserved hypothetical protein [Shigella sonnei Ss046]
gi|81245796|gb|ABB66504.1| conserved hypothetical protein [Shigella boydii Sb227]
gi|91071858|gb|ABE06739.1| YcfD protein [Escherichia coli UTI89]
gi|226898096|gb|EEH84355.1| YcfD [Escherichia sp. 3_2_53FAA]
gi|227836719|gb|EEJ47185.1| cupin family protein [Escherichia coli 83972]
gi|281600541|gb|ADA73525.1| Cupin 4 family protein [Shigella flexneri 2002017]
gi|284920954|emb|CBG34017.1| conserved hypothetical protein [Escherichia coli 042]
gi|300297254|gb|EFJ53639.1| cupin family protein [Escherichia coli MS 185-1]
gi|300307224|gb|EFJ61744.1| cupin family protein [Escherichia coli MS 200-1]
gi|300396511|gb|EFJ80049.1| cupin family protein [Escherichia coli MS 69-1]
gi|300401300|gb|EFJ84838.1| cupin family protein [Escherichia coli MS 84-1]
gi|300410257|gb|EFJ93795.1| cupin family protein [Escherichia coli MS 45-1]
gi|300414891|gb|EFJ98201.1| cupin family protein [Escherichia coli MS 115-1]
gi|300456345|gb|EFK19838.1| cupin family protein [Escherichia coli MS 21-1]
gi|300463700|gb|EFK27193.1| cupin family protein [Escherichia coli MS 187-1]
gi|300526387|gb|EFK47456.1| cupin family protein [Escherichia coli MS 119-7]
gi|300530605|gb|EFK51667.1| cupin family protein [Escherichia coli MS 107-1]
gi|300838644|gb|EFK66404.1| cupin family protein [Escherichia coli MS 124-1]
gi|307553129|gb|ADN45904.1| cupin superfamily protein family [Escherichia coli ABU 83972]
gi|315252618|gb|EFU32586.1| cupin family protein [Escherichia coli MS 85-1]
gi|315287475|gb|EFU46886.1| cupin family protein [Escherichia coli MS 110-3]
gi|315293104|gb|EFU52456.1| cupin family protein [Escherichia coli MS 153-1]
gi|315296597|gb|EFU55892.1| cupin family protein [Escherichia coli MS 16-3]
gi|324005955|gb|EGB75174.1| cupin family protein [Escherichia coli MS 57-2]
gi|324013221|gb|EGB82440.1| cupin family protein [Escherichia coli MS 60-1]
gi|331045139|gb|EGI17266.1| YcfD protein [Escherichia coli M605]
gi|331050457|gb|EGI22515.1| YcfD protein [Escherichia coli M718]
gi|331070284|gb|EGI41650.1| YcfD protein [Escherichia coli TA280]
gi|331075609|gb|EGI46907.1| YcfD protein [Escherichia coli H591]
gi|332102036|gb|EGJ05382.1| conserved hypothetical protein [Shigella sp. D9]
gi|342926227|gb|EGU94949.1| putative cytoplasmic protein [Escherichia coli MS 79-10]
gi|355419733|gb|AER83930.1| hypothetical protein i02_1350 [Escherichia coli str. 'clone D i2']
gi|355424653|gb|AER88849.1| hypothetical protein i14_1350 [Escherichia coli str. 'clone D i14']
gi|383466638|gb|EID61659.1| hypothetical protein SF5M90T_1126 [Shigella flexneri 5a str. M90T]
gi|384472731|gb|EIE56782.1| cupin family protein [Escherichia coli AI27]
gi|412962469|emb|CCK46383.1| hypothetical protein BN16_16971 [Escherichia coli chi7122]
gi|412969047|emb|CCJ43674.1| hypothetical protein BN17_09911 [Escherichia coli]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|15801303|ref|NP_287320.1| hypothetical protein Z1833 [Escherichia coli O157:H7 str. EDL933]
gi|291282146|ref|YP_003498964.1| YcfD protein [Escherichia coli O55:H7 str. CB9615]
gi|12514754|gb|AAG55932.1|AE005327_2 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
gi|13361037|dbj|BAB34996.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|209772888|gb|ACI84756.1| hypothetical protein ECs1573 [Escherichia coli]
gi|209772890|gb|ACI84757.1| hypothetical protein ECs1573 [Escherichia coli]
gi|209772892|gb|ACI84758.1| hypothetical protein ECs1573 [Escherichia coli]
gi|209772894|gb|ACI84759.1| hypothetical protein ECs1573 [Escherichia coli]
gi|209772896|gb|ACI84760.1| hypothetical protein ECs1573 [Escherichia coli]
gi|290762019|gb|ADD55980.1| YcfD protein [Escherichia coli O55:H7 str. CB9615]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|237731109|ref|ZP_04561590.1| YcfD protein [Citrobacter sp. 30_2]
gi|226906648|gb|EEH92566.1| YcfD protein [Citrobacter sp. 30_2]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|110805140|ref|YP_688660.1| hypothetical protein SFV_1146 [Shigella flexneri 5 str. 8401]
gi|110614688|gb|ABF03355.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHWPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|66044707|ref|YP_234548.1| hypothetical protein Psyr_1460 [Pseudomonas syringae pv. syringae
B728a]
gi|63255414|gb|AAY36510.1| hypothetical protein Psyr_1460 [Pseudomonas syringae pv. syringae
B728a]
Length = 209
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 18/140 (12%)
Query: 72 EGCSIRFLNPQTYIKPLQQLNASLQELF-GCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
EGC++ + Y + L++ F C + N + TPP + H DD + +Q
Sbjct: 72 EGCTLYLRKIENYDPSFFKFKTELEKEFPNCAIQLNAFFTPPWSCALTAHADDHDLLAIQ 131
Query: 131 LEGKKKWKVYLP------RMVDEY----LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+ GKKKW + P + +D + L + NFS+E+ LE G ++
Sbjct: 132 IAGKKKWSFWRPLSEVACKGLDAHQISVLSMEHAGNFSEEQ------EHVLEAGSYFFMG 185
Query: 181 RGYIHQASTVTNEHSLHVTI 200
+G IH+A +E S+HV++
Sbjct: 186 KGTIHKAQCF-DEPSIHVSV 204
>gi|212556965|gb|ACJ29419.1| Transcription factor jumonji, jmjC [Shewanella piezotolerans WP3]
Length = 381
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 91/213 (42%), Gaps = 44/213 (20%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIE----FKKNIDITLYEDGQRQTMDV 57
+W+K+PL I + KN+ +L++ +++ + D ++E FK N ++ Q +
Sbjct: 17 YWQKKPLVIRQGFKNF-QDLLSPEEMAGLACDELVESRRVFKSN------DEWQAEFGPF 69
Query: 58 EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVGAN-------- 106
E Y E L + + +Q LN L +EL CF
Sbjct: 70 ES----------YEE------LGEKDWTLIVQALNNWLPEAEELIQCFDFIPRWRFDDVM 113
Query: 107 -TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP 165
+Y TP G PH D + FI Q G+++W+V E+ + P E P
Sbjct: 114 VSYATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFEP 168
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
I+ V L PGD+LYLP GY H T+ S V
Sbjct: 169 IIDVELLPGDILYLPPGYPHDGVTLEPSMSFSV 201
>gi|416836453|ref|ZP_11902068.1| hypothetical protein ECOSU61_02838 [Escherichia coli O157:H7 str.
LSU-61]
gi|320664526|gb|EFX31677.1| hypothetical protein ECOSU61_02838 [Escherichia coli O157:H7 str.
LSU-61]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|110641305|ref|YP_669035.1| hypothetical protein ECP_1123 [Escherichia coli 536]
gi|110342897|gb|ABG69134.1| hypothetical protein YcfD [Escherichia coli 536]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHWPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|422782050|ref|ZP_16834835.1| cupin superfamily protein [Escherichia coli TW10509]
gi|323976501|gb|EGB71589.1| cupin superfamily protein [Escherichia coli TW10509]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPSASLMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|291085500|ref|ZP_06353238.2| cupin family protein [Citrobacter youngae ATCC 29220]
gi|291071160|gb|EFE09269.1| cupin family protein [Citrobacter youngae ATCC 29220]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 69
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 70 -PFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQLKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|428970617|ref|ZP_19041136.1| cupin superfamily protein [Escherichia coli 90.0039]
gi|427231353|gb|EKV99398.1| cupin superfamily protein [Escherichia coli 90.0039]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLLISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|238791836|ref|ZP_04635473.1| hypothetical protein yinte0001_15530 [Yersinia intermedia ATCC
29909]
gi|238728940|gb|EEQ20457.1| hypothetical protein yinte0001_15530 [Yersinia intermedia ATCC
29909]
Length = 373
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 95/203 (46%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +E+G+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEEGRWQVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P +QL + + +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRQLSDWRMDDLMVSF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + +Y P P+ Q + I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQYCPH---PDLLQVDPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|117623313|ref|YP_852226.1| hypothetical protein APECO1_210 [Escherichia coli APEC O1]
gi|115512437|gb|ABJ00512.1| conserved hypothetical protein [Escherichia coli APEC O1]
Length = 376
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVDEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|323453858|gb|EGB09729.1| hypothetical protein AURANDRAFT_63211 [Aureococcus anophagefferens]
Length = 390
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 7/112 (6%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP--- 150
++Q+ G AN Y TPP A PH D F +QL G K+W + R D Y+P
Sbjct: 140 AVQDFLGLPAQANVYVTPPGASAARPHVDRHCVFAVQLAGAKRWCL---READGYVPPPA 196
Query: 151 RYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
R + + G V L G LY+PRG H + S+HV+ V
Sbjct: 197 RERATARADAAYGGE-RAVDLAAGGALYVPRGAPHCCENAGDALSVHVSFGV 247
>gi|416266586|ref|ZP_11641656.1| hypothetical protein SDB_01875 [Shigella dysenteriae CDC 74-1112]
gi|420380495|ref|ZP_14879960.1| cupin superfamily protein [Shigella dysenteriae 225-75]
gi|320175634|gb|EFW50726.1| hypothetical protein SDB_01875 [Shigella dysenteriae CDC 74-1112]
gi|391302198|gb|EIQ60067.1| cupin superfamily protein [Shigella dysenteriae 225-75]
Length = 373
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGERLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|308049708|ref|YP_003913274.1| cupin [Ferrimonas balearica DSM 9799]
gi|307631898|gb|ADN76200.1| Cupin 4 family protein [Ferrimonas balearica DSM 9799]
Length = 373
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 49/85 (57%), Gaps = 5/85 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI+Q EG+++W+V P+ +Y R +S + E PI
Sbjct: 112 SFATP--GGGVGPHIDNYDVFIIQGEGERRWQVG-PK--GDYAARPNSGGMALVEDFNPI 166
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
+ + PGD+LY+P GY H T+T
Sbjct: 167 IDAVMSPGDVLYIPPGYPHCGQTLT 191
>gi|218689081|ref|YP_002397293.1| hypothetical protein ECED1_1272 [Escherichia coli ED1a]
gi|218426645|emb|CAR07473.1| conserved hypothetical protein [Escherichia coli ED1a]
Length = 373
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLHMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|336314360|ref|ZP_08569279.1| Cupin superfamily protein [Rheinheimera sp. A13L]
gi|335881373|gb|EGM79253.1| Cupin superfamily protein [Rheinheimera sp. A13L]
Length = 389
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 66/154 (42%), Gaps = 29/154 (18%)
Query: 95 LQELFGCFVGANTYQTPPTAQG----FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
L + + G T + A G F H+D + F+LQL G+KKW V Y P
Sbjct: 108 LAKQIAAYCGRQTVVSSYAAFGEKDSFRNHWDTRDVFVLQLIGRKKWTV--------YPP 159
Query: 151 RYSSPNFSQEE---------IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ +P F Q+ TP + LE GD+ YLPRG+ H V E + H+ +
Sbjct: 160 SFENPLFMQQSKDLEQSFPCSDTPYMEFMLEAGDIFYLPRGWWHNPMPV-GEETFHLAVG 218
Query: 202 VYQKTA-----WIDLLEKAMPKALQA--AGATDL 228
+ A WI +P A ++ +G DL
Sbjct: 219 TFPPFAINYVEWIMTQLPNLPAARKSLDSGEDDL 252
>gi|288960722|ref|YP_003451062.1| hypothetical protein AZL_a09870 [Azospirillum sp. B510]
gi|288913030|dbj|BAI74518.1| hypothetical protein AZL_a09870 [Azospirillum sp. B510]
Length = 315
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 65/126 (51%), Gaps = 16/126 (12%)
Query: 106 NTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYL-------PRMVD--EYLPRYSSP- 155
N Y++PP +GFA H DD +L LEG+K+W + PR + L R ++P
Sbjct: 119 NAYRSPP-GKGFALHVDDHPVVVLHLEGEKRWWLSRDIGVPDPPRGFNFPPGLKRLTTPW 177
Query: 156 -NFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
+ + + GT +TL PGDLLY+P G H+ VT SL + + TA +DL+
Sbjct: 178 GVYERPDEGT-FQEITLRPGDLLYVPAGMWHRTQAVTASLSLTMAMVA---TAPMDLVRP 233
Query: 215 AMPKAL 220
+ AL
Sbjct: 234 RLETAL 239
>gi|307261586|ref|ZP_07543254.1| hypothetical protein appser12_11470 [Actinobacillus
pleuropneumoniae serovar 12 str. 1096]
gi|306868709|gb|EFN00518.1| hypothetical protein appser12_11470 [Actinobacillus
pleuropneumoniae serovar 12 str. 1096]
Length = 296
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 14/110 (12%)
Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-------EYLPRYS 153
C + ++ Y T + F H+D + F +Q++G+K+W ++ P D + +P Y
Sbjct: 31 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEY- 89
Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
PN ++++ IL LE GD+LYLPRG+ H V E ++H+ + V+
Sbjct: 90 FPN--KDDVYMDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 133
>gi|120598590|ref|YP_963164.1| cupin 4 family protein [Shewanella sp. W3-18-1]
gi|120558683|gb|ABM24610.1| Cupin 4 family protein [Shewanella sp. W3-18-1]
Length = 379
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|384258940|ref|YP_005402874.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella aquatilis HX2]
gi|380754916|gb|AFE59307.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella aquatilis HX2]
Length = 373
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 26/206 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-----D 56
+W+K+P+ I + N+ + + D L +E + + + ++DG+ Q D
Sbjct: 16 YWQKKPVVIKKAFSNFVDPISP-----DELAGLAMEPEVDSRLVSHKDGEWQASNGPFED 70
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
+G L + W + + L +KP + L + L +L +Y P
Sbjct: 71 FDG--LGETGWSLLAQAVNHWHLPAAELVKPFRVLPDWRLDDLM------ISYSVP--GG 120
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFTPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|153000993|ref|YP_001366674.1| cupin 4 family protein [Shewanella baltica OS185]
gi|151365611|gb|ABS08611.1| Cupin 4 family protein [Shewanella baltica OS185]
Length = 379
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|421673713|ref|ZP_16113650.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
OIFC065]
gi|421690504|ref|ZP_16130175.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
IS-116]
gi|424058349|ref|ZP_17795846.1| hypothetical protein W9K_02677 [Acinetobacter baumannii Ab33333]
gi|404564776|gb|EKA69955.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
IS-116]
gi|404665591|gb|EKB33553.1| hypothetical protein W9K_02677 [Acinetobacter baumannii Ab33333]
gi|410385931|gb|EKP38415.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
OIFC065]
Length = 405
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 99/204 (48%), Gaps = 20/204 (9%)
Query: 15 KNYYNNLITSKKIDDI-LRDNVIE--FKKNIDITL-----YEDGQRQTMDVEGRALPQIV 66
+N + IT ++ID++ LR ++I+ FK N + + Y + + + R + +
Sbjct: 42 RNSFVPQITDQQIDEVVLRADIIQDDFKLNFEGKVLPKNKYVESFQDVGKIRHRFIKAAL 101
Query: 67 WDHYREGCSIRFLNPQTYIKPL-QQLNASLQELFGCFVGANTYQTPPTA--QGFAPHYDD 123
D+ R G ++ + + Y +PL + ++E+ ++ Y + F H+D
Sbjct: 102 VDYMRRGATL--IANKLYNEPLIYNYHKQIEEIANRPALSSLYISYGNEDDSSFRAHWDT 159
Query: 124 IEAFILQLEGKKKWKVYLPRMVDEY-LPRYSSPNFS---QEEIGTPILTVTLEPGDLLYL 179
+ F+LQL G+KKW +Y P EY LP S + + P + V LE GDLLY+
Sbjct: 160 RDVFVLQLVGRKKWILYEPNF--EYPLPSQQSKDMEHVVKCPTTEPYMDVILEEGDLLYV 217
Query: 180 PRGYIHQASTVTNEHSLHVTISVY 203
PRG+ H E ++H+++ Y
Sbjct: 218 PRGWWHDP-VPCGERTVHLSVGTY 240
>gi|322833762|ref|YP_004213789.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella sp. Y9602]
gi|321168963|gb|ADW74662.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella sp. Y9602]
Length = 373
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 26/206 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-----D 56
+W+K+P+ I + N+ + + D L +E + + + ++DG+ Q D
Sbjct: 16 YWQKKPVVIKKAFSNFVDPISP-----DELAGLAMEPEVDSRLVSHKDGEWQASNGPFED 70
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
+G L + W + + L +KP + L + L +L +Y P
Sbjct: 71 FDG--LGETGWSLLAQAVNHWHLPAAELVKPFRVLPDWRLDDLM------ISYSVP--GG 120
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFTPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|170768059|ref|ZP_02902512.1| cupin family protein [Escherichia albertii TW07627]
gi|170122825|gb|EDS91756.1| cupin family protein [Escherichia albertii TW07627]
Length = 373
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + +++G+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPLSPDE-LAGLAMENEVDSRLVSHQNGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVDHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V R + ++ P + Q E I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKRQMKQHCPHL---DLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|217973047|ref|YP_002357798.1| cupin [Shewanella baltica OS223]
gi|217498182|gb|ACK46375.1| Cupin 4 family protein [Shewanella baltica OS223]
Length = 379
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|146293332|ref|YP_001183756.1| cupin 4 family protein [Shewanella putrefaciens CN-32]
gi|145565022|gb|ABP75957.1| Cupin 4 family protein [Shewanella putrefaciens CN-32]
Length = 379
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|417586060|ref|ZP_12236833.1| cupin superfamily protein [Escherichia coli STEC_C165-02]
gi|345339216|gb|EGW71642.1| cupin superfamily protein [Escherichia coli STEC_C165-02]
Length = 373
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPIYPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|126174694|ref|YP_001050843.1| cupin 4 family protein [Shewanella baltica OS155]
gi|386341451|ref|YP_006037817.1| cupin [Shewanella baltica OS117]
gi|125997899|gb|ABN61974.1| Cupin 4 family protein [Shewanella baltica OS155]
gi|334863852|gb|AEH14323.1| Cupin 4 family protein [Shewanella baltica OS117]
Length = 379
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|419237278|ref|ZP_13780013.1| cupin superfamily protein [Escherichia coli DEC9C]
gi|378086458|gb|EHW48334.1| cupin superfamily protein [Escherichia coli DEC9C]
Length = 296
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|238749642|ref|ZP_04611147.1| hypothetical protein yrohd0001_29600 [Yersinia rohdei ATCC 43380]
gi|238712297|gb|EEQ04510.1| hypothetical protein yrohd0001_29600 [Yersinia rohdei ATCC 43380]
Length = 373
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 91/203 (44%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + L D L +E + + + +EDG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWQVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRALSDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTPMKQHCPH---PDLLQVEPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|373949811|ref|ZP_09609772.1| Cupin 4 family protein [Shewanella baltica OS183]
gi|386324356|ref|YP_006020473.1| cupin [Shewanella baltica BA175]
gi|333818501|gb|AEG11167.1| Cupin 4 family protein [Shewanella baltica BA175]
gi|373886411|gb|EHQ15303.1| Cupin 4 family protein [Shewanella baltica OS183]
Length = 379
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|165976530|ref|YP_001652123.1| hypothetical protein APJL_1123 [Actinobacillus pleuropneumoniae
serovar 3 str. JL03]
gi|307250358|ref|ZP_07532307.1| hypothetical protein appser4_11390 [Actinobacillus pleuropneumoniae
serovar 4 str. M62]
gi|165876631|gb|ABY69679.1| hypothetical protein APJL_1123 [Actinobacillus pleuropneumoniae
serovar 3 str. JL03]
gi|306857633|gb|EFM89740.1| hypothetical protein appser4_11390 [Actinobacillus pleuropneumoniae
serovar 4 str. M62]
Length = 396
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 14/110 (12%)
Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-------EYLPRYS 153
C + ++ Y T + F H+D + F +Q++G+K+W ++ P D + +P Y
Sbjct: 131 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEY- 189
Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
PN ++++ IL LE GD+LYLPRG+ H V E ++H+ + V+
Sbjct: 190 FPN--KDDVYIDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 233
>gi|386314007|ref|YP_006010172.1| transcription factor jumonji jmjC domain protein [Shewanella
putrefaciens 200]
gi|319426632|gb|ADV54706.1| transcription factor jumonji jmjC domain protein [Shewanella
putrefaciens 200]
Length = 379
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E T
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFT 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|359452147|ref|ZP_09241502.1| hypothetical protein P20495_0235 [Pseudoalteromonas sp. BSi20495]
gi|358050741|dbj|GAA77751.1| hypothetical protein P20495_0235 [Pseudoalteromonas sp. BSi20495]
Length = 381
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 91/198 (45%), Gaps = 27/198 (13%)
Query: 3 WEKEPLYISRKQKNYYN----NLITSKKIDDILRDNVIEFKKNIDITL---YEDGQRQTM 55
W+K+PL I + ++ + N + +++ + +I KN T +ED ++ T
Sbjct: 19 WQKKPLLIKQGFTDFQDPLDANELAGLAMEESIESRIITNHKNDWQTHQGPFEDFEQLT- 77
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ 115
D L Q V DH+ + L P +I N + +L +Y TP
Sbjct: 78 DDHSTLLVQAV-DHWHPDAG-QLLEPFRFIP-----NWRIDDLM------ISYSTP--NG 122
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q EGK+ W+V LP D L +++ + Q E ++ LEPG
Sbjct: 123 GVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEAVIDCILEPG 179
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LY+P G H+ V N
Sbjct: 180 DILYIPPGCPHEGYAVEN 197
>gi|421746995|ref|ZP_16184747.1| hypothetical protein B551_09866 [Cupriavidus necator HPC(L)]
gi|409774421|gb|EKN56046.1| hypothetical protein B551_09866 [Cupriavidus necator HPC(L)]
Length = 400
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 59/129 (45%), Gaps = 22/129 (17%)
Query: 87 PLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD 146
P+++ + G AN Y + + GF H DD + ++Q++G+KKW
Sbjct: 114 PVEKFTEQMGGTLGVRSSANAYMSFGSTSGFGRHNDDHDVIVIQIDGRKKWTF------- 166
Query: 147 EYLPRYSSPNFSQEEIGTPIL---------TVTLEPGDLLYLPRGYIHQASTVTNEHSLH 197
SPN + + T T+T+ GD+L++P+G H ++ E SLH
Sbjct: 167 -----LESPNGFRHALVTETSDPEALKAGETITVSRGDILFIPKGTWHDVISI-GEPSLH 220
Query: 198 VTISVYQKT 206
+T+S+ T
Sbjct: 221 LTVSLVYPT 229
>gi|383190924|ref|YP_005201052.1| hypothetical protein Rahaq2_3092 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
gi|371589182|gb|AEX52912.1| hypothetical protein Rahaq2_3092 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
Length = 373
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 93/206 (45%), Gaps = 26/206 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-----D 56
+W+K+P+ I + N+ + + D L +E + + + ++DG+ Q D
Sbjct: 16 YWQKKPVVIKKAFSNFVDPISP-----DELAGLAMEPEVDSRLVSHKDGEWQASNGPFED 70
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
+G L + W + + L +KP + L + L +L +Y P
Sbjct: 71 FDG--LGETGWSLLAQAVNHWHLPAAELVKPFRVLPDWRLDDLM------ISYSVP--GG 120
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFTPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|161505213|ref|YP_001572325.1| hypothetical protein SARI_03351 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866560|gb|ABX23183.1| hypothetical protein SARI_03351 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 268
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 52/100 (52%), Gaps = 3/100 (3%)
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
+ + QL+G K++ + P+ E YS+ + E +G P V L PGD LY P
Sbjct: 144 SVNGIVGQLQGVKEFTLVAPQYALEEGKFYST--YELETLGIPFECVVLHPGDFLYFPAH 201
Query: 183 YIHQASTVTNEHS-LHVTISVYQKTAWIDLLEKAMPKALQ 221
+ HQA T+ S +H T+++Y TA++D MP +Q
Sbjct: 202 WWHQAKTIECSVSFIHSTVNLYNMTAFLDDAISQMPALIQ 241
>gi|419918254|ref|ZP_14436465.1| putative cytoplasmic protein [Escherichia coli KD2]
gi|388391875|gb|EIL53315.1| putative cytoplasmic protein [Escherichia coli KD2]
Length = 373
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKCPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|308186444|ref|YP_003930575.1| hypothetical protein Pvag_0932 [Pantoea vagans C9-1]
gi|308056954|gb|ADO09126.1| Uncharacterized protein ycfD [Pantoea vagans C9-1]
Length = 374
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L V G PH
Sbjct: 67 -PFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRVDDLMVSFAVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVALKQHCPH---PDLLQVEPFDAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|193066274|ref|ZP_03047326.1| cupin family protein [Escherichia coli E22]
gi|192926123|gb|EDV80765.1| cupin family protein [Escherichia coli E22]
Length = 373
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTATLMRPFRELPDWRIDDLMISFSVPGGGVDPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|399017092|ref|ZP_10719292.1| Cupin superfamily protein [Herbaspirillum sp. CF444]
gi|398104272|gb|EJL94419.1| Cupin superfamily protein [Herbaspirillum sp. CF444]
Length = 391
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 64/136 (47%), Gaps = 11/136 (8%)
Query: 71 REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQ 130
REG + + + L +L G AN Y + + GF H DD + ++Q
Sbjct: 98 REGVTGVLEAVNELVPSVGNLTETLGGTLGAQSTANAYVSFGSISGFGIHNDDHDVIVIQ 157
Query: 131 LEGKKKWKVYLPRMVD----EYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
LEG+K+W+ + VD + +P+ S + + + GD++++P+G H
Sbjct: 158 LEGRKRWQ-FFASSVDAKRKATVKDIQAPSASDRGV-----ELVISSGDVMFVPKGTWHD 211
Query: 187 ASTVTNEHSLHVTISV 202
+ NE SLH+TIS+
Sbjct: 212 VIAL-NEKSLHLTISL 226
>gi|405371091|ref|ZP_11026802.1| Cupin 4 [Chondromyces apiculatus DSM 436]
gi|397089076|gb|EJJ20012.1| Cupin 4 [Myxococcus sp. (contaminant ex DSM 436)]
Length = 295
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 102/239 (42%), Gaps = 25/239 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
+H+++ P + L T + ID ++ + + Y G+ +T E R
Sbjct: 18 EHYQRRP-FTGASSAERLQRLGTWETIDFLVEETACDVLLARQGVPYPGGRPRTAK-EAR 75
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
L + +G ++ P + L QL + + + Y TP GF H
Sbjct: 76 EL-------FAQGYTLALRQPDQHHPDLAQLARAFSAELHGRINLHIYCTPAGHHGFGWH 128
Query: 121 YDDIEAFILQLEGKKKW----KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
D E FILQ G+K++ P + E +P S +QE+ TP+ T L GD
Sbjct: 129 CDPEEVFILQTAGRKEYLLRENTLHPVPLPESVP--SGTLAAQEK--TPVETHALGAGDF 184
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+Y+P GY H A ++ +L ++I + T +DLL+ ++A A +RR +P
Sbjct: 185 IYIPGGYWHMAQ--ASDEALSISIGLMPPT-LLDLLD-----GVRATLAASPVWRRRMP 235
>gi|54308332|ref|YP_129352.1| hypothetical protein PBPRA1139 [Photobacterium profundum SS9]
gi|46912760|emb|CAG19550.1| conserved hypothetical protein [Photobacterium profundum SS9]
Length = 379
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 4/99 (4%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y TP + G PH D + FI+Q GK+ W+V + D+Y P Q + PI
Sbjct: 114 SYSTP--SGGVGPHIDQYDVFIIQGSGKRHWRVGPKK--DDYEEELRHPALRQIKGFEPI 169
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
+ +EPGD+LY+P G+ H + S V +K
Sbjct: 170 IDEVMEPGDILYIPPGFPHDGYAIETSMSFSVGFRSPKK 208
>gi|307248127|ref|ZP_07530155.1| hypothetical protein appser2_11080 [Actinobacillus pleuropneumoniae
serovar 2 str. S1536]
gi|306855304|gb|EFM87479.1| hypothetical protein appser2_11080 [Actinobacillus pleuropneumoniae
serovar 2 str. S1536]
Length = 374
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 58/114 (50%), Gaps = 22/114 (19%)
Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-----------EYL 149
C + ++ Y T + F H+D + F +Q++G+K+W ++ P D EY
Sbjct: 131 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEYF 190
Query: 150 PRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
P +++++ IL LE GD+LYLPRG+ H V E ++H+ + V+
Sbjct: 191 P-------NKDDVYMDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 233
>gi|161503677|ref|YP_001570789.1| hypothetical protein SARI_01761 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160865024|gb|ABX21647.1| hypothetical protein SARI_01761 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 373
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q R
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----RH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFDAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|218549149|ref|YP_002382940.1| hypothetical protein EFER_1802 [Escherichia fergusonii ATCC 35469]
gi|422805306|ref|ZP_16853738.1| cupin superfamily protein [Escherichia fergusonii B253]
gi|218356690|emb|CAQ89317.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
gi|324113919|gb|EGC07893.1| cupin superfamily protein [Escherichia fergusonii B253]
Length = 373
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 81/194 (41%), Gaps = 18/194 (9%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE--DGQRQTMDVEG 59
HW+K P+ I R +NN I D++ + + +D L DG+ Q
Sbjct: 16 HWQKRPVVIKRG----FNNFIDPISPDELAG---LAMESEVDSRLVNNLDGKWQV----- 63
Query: 60 RALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
P +DH E S+ + +P L +EL + G
Sbjct: 64 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 123
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PH D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY
Sbjct: 124 PHLDQYDVFIIQGTGRRRWRVGEKLPMKQHCPH---PDLLQVEPFEAIIDEELEPGDILY 180
Query: 179 LPRGYIHQASTVTN 192
+P G+ H+ + N
Sbjct: 181 IPPGFPHEGYALEN 194
>gi|332535585|ref|ZP_08411355.1| hypothetical protein PH505_da00110 [Pseudoalteromonas haloplanktis
ANT/505]
gi|332035003|gb|EGI71523.1| hypothetical protein PH505_da00110 [Pseudoalteromonas haloplanktis
ANT/505]
Length = 367
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
+Y TP G PH D + FI+Q EGK+ W+V LP D L +++ + Q E
Sbjct: 98 SYSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 152
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ V N
Sbjct: 153 VIDCILEPGDILYIPPGCPHEGYAVEN 179
>gi|308807571|ref|XP_003081096.1| COG2850: Uncharacterized conserved protein (ISS) [Ostreococcus
tauri]
gi|116059558|emb|CAL55265.1| COG2850: Uncharacterized conserved protein (ISS) [Ostreococcus
tauri]
Length = 380
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 52/120 (43%), Gaps = 18/120 (15%)
Query: 95 LQELFGCF----VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV-----YLPRMV 145
L ELF CF V P G PH D + F+LQ EG+K W V Y P
Sbjct: 79 LLELFSCFPRWRVSDIQASVSPNGGGVGPHSDHFDVFLLQAEGEKHWAVADNEEYWPDND 138
Query: 146 DEYLPRYSS---PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
++P F +++ + TL PGD+LYLP H V + + VT+S+
Sbjct: 139 RAFVPECEIRVLKEFVEDD------SFTLVPGDMLYLPPKVAHNGVAVNSAPGVSVTLSI 192
>gi|395763450|ref|ZP_10444119.1| hypothetical protein JPAM2_17087 [Janthinobacterium lividum PAMC
25724]
Length = 373
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 85/198 (42%), Gaps = 25/198 (12%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W K+PL I R+ + L K + ++ + +E + + + DG
Sbjct: 18 DYWHKKPLLI-RQAVPGFKALFDFKALAELATLDHVESR----LVSHADGHWNMQQGPLT 72
Query: 61 ALP---QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQG 116
+LP Q W +G ++ ++ + L +A L +L F G
Sbjct: 73 SLPSLKQKEWTLLVQGANLHSAKADALLRQFRFLPDARLDDLMVSFA--------TDGGG 124
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PH+D + F+LQ +GK+ W++ + + + LP NF+ +E TLEPG
Sbjct: 125 VGPHFDSYDVFLLQGQGKRHWRIGAQKDLSLIDGLPLKILSNFTPDE------EFTLEPG 178
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LYLP Y H + +
Sbjct: 179 DMLYLPPHYAHDGVAIGD 196
>gi|386822536|ref|ZP_10109743.1| cupin [Serratia plymuthica PRI-2C]
gi|386380610|gb|EIJ21340.1| cupin [Serratia plymuthica PRI-2C]
Length = 373
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E S ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|424816550|ref|ZP_18241701.1| hypothetical protein ECD227_1667 [Escherichia fergusonii ECD227]
gi|325497570|gb|EGC95429.1| hypothetical protein ECD227_1667 [Escherichia fergusonii ECD227]
Length = 376
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 81/194 (41%), Gaps = 18/194 (9%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE--DGQRQTMDVEG 59
HW+K P+ I R +NN I D++ + + +D L DG+ Q
Sbjct: 19 HWQKRPVVIKRG----FNNFIDPISPDELAG---LAMESEVDSRLVNNLDGKWQV----- 66
Query: 60 RALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
P +DH E S+ + +P L +EL + G
Sbjct: 67 SHGPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVG 126
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PH D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY
Sbjct: 127 PHLDQYDVFIIQGTGRRRWRVGEKLPMKQHCPH---PDLLQVEPFEAIIDEELEPGDILY 183
Query: 179 LPRGYIHQASTVTN 192
+P G+ H+ + N
Sbjct: 184 IPPGFPHEGYALEN 197
>gi|320167155|gb|EFW44054.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 879
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 77/166 (46%), Gaps = 6/166 (3%)
Query: 39 KKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQEL 98
+ +ID L+ RQT +A Q Y G + +I+ + +L L L
Sbjct: 223 RDDIDHPLWFASLRQTGPTSQQASSQEPHATYASGAPLVIDLQSHFIRKVDRLLHYLGGL 282
Query: 99 FGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM----VDEYLPRYSS 154
G V + P A+ AP YD ILQ+EGK W++Y M + L R +
Sbjct: 283 TGTRVSGFAHLAPRAARASAPRYDVDNLIILQMEGKSSWRLYRQEMAYPLAAKELARRQT 342
Query: 155 PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTI 200
F + E+ P V L+ GDLL++PRG+I+ ++ ++ H+T+
Sbjct: 343 --FKRVELDAPEHQVVLQAGDLLHVPRGWIYDVESLADDQDFHLTV 386
>gi|421783093|ref|ZP_16219545.1| cupin superfamily protein [Serratia plymuthica A30]
gi|407754799|gb|EKF64930.1| cupin superfamily protein [Serratia plymuthica A30]
Length = 373
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E S ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|333926821|ref|YP_004500400.1| cupin [Serratia sp. AS12]
gi|333931775|ref|YP_004505353.1| cupin [Serratia plymuthica AS9]
gi|386328644|ref|YP_006024814.1| cupin [Serratia sp. AS13]
gi|333473382|gb|AEF45092.1| Cupin 4 family protein [Serratia plymuthica AS9]
gi|333490881|gb|AEF50043.1| Cupin 4 family protein [Serratia sp. AS12]
gi|333960977|gb|AEG27750.1| Cupin 4 family protein [Serratia sp. AS13]
Length = 373
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E S ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|303253321|ref|ZP_07339470.1| hypothetical protein APP2_0632 [Actinobacillus pleuropneumoniae
serovar 2 str. 4226]
gi|302648003|gb|EFL78210.1| hypothetical protein APP2_0632 [Actinobacillus pleuropneumoniae
serovar 2 str. 4226]
Length = 277
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 60/110 (54%), Gaps = 14/110 (12%)
Query: 101 CFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-------EYLPRYS 153
C + ++ Y T + F H+D + F +Q++G+K+W ++ P D + +P Y
Sbjct: 120 CHIFSSLYVAFNTQRSFKIHWDSRDIFAIQMQGRKRWIIHSPTFKDPLFMHRSKDMPEY- 178
Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVY 203
PN ++++ IL LE GD+LYLPRG+ H V E ++H+ + V+
Sbjct: 179 FPN--KDDVYMDIL---LEAGDILYLPRGWWHDPIPV-GEETVHLAVGVF 222
>gi|270261451|ref|ZP_06189724.1| hypothetical protein SOD_a06830 [Serratia odorifera 4Rx13]
gi|270044935|gb|EFA18026.1| hypothetical protein SOD_a06830 [Serratia odorifera 4Rx13]
Length = 373
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 91/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E S ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|442611348|ref|ZP_21026054.1| FIG002776: hypothetical protein [Pseudoalteromonas luteoviolacea B
= ATCC 29581]
gi|441747276|emb|CCQ12116.1| FIG002776: hypothetical protein [Pseudoalteromonas luteoviolacea B
= ATCC 29581]
Length = 383
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR-MVDEYLPRYSSPNFSQEEIGTP 165
+Y TP G PH D + FI+Q +GK+ W+V LP + E+ S Q E
Sbjct: 116 SYSTP--GGGVGPHLDQYDVFIIQGQGKRHWRVGLPNPSLREFAQNKS---LLQVEAFDA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
I+ LEPGD+LY+P G+ H+ + N
Sbjct: 171 IIDAVLEPGDILYIPPGHPHEGYAIEN 197
>gi|294140715|ref|YP_003556693.1| hypothetical protein SVI_1944 [Shewanella violacea DSS12]
gi|293327184|dbj|BAJ01915.1| conserved hypothetical protein [Shewanella violacea DSS12]
Length = 380
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 94/211 (44%), Gaps = 38/211 (18%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYED-GQRQTMDVEG 59
D+W+K+PL I R+ N + +L+T +++ + D ++E ++ +Y++ GQ Q
Sbjct: 16 DYWQKKPLVI-RQGFNNFEDLLTPEEMAGLACDELVESRR-----IYKNQGQWQA----- 64
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVG---------ANT 107
P +D Y + L + +Q LN L ++L CF +
Sbjct: 65 EFGP---FDSYED------LGETDWTLVVQALNHWLPDAEQLIQCFDFIPRWRFDDVMVS 115
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
Y TP G PH D + FI Q G+++W+V E+ + P E I+
Sbjct: 116 YATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHKEFA---AHPALLHTEAFDAII 170
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
V L PGD+LYLP G+ H+ + S V
Sbjct: 171 DVELLPGDILYLPPGFPHEGVALEASMSFSV 201
>gi|336450774|ref|ZP_08621221.1| hypothetical protein A28LD_0880 [Idiomarina sp. A28L]
gi|336282597|gb|EGN75829.1| hypothetical protein A28LD_0880 [Idiomarina sp. A28L]
Length = 405
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y TP G PH D + FI+Q G ++W+V + P P+ Q PI
Sbjct: 127 SYSTP--GGGVGPHLDQYDVFIVQGMGSRRWQVGASGDFQTHCPH---PDLKQVAEFAPI 181
Query: 167 LTVTLEPGDLLYLPRGYIHQASTV 190
+ LEPGD++Y+P G+ H TV
Sbjct: 182 INEVLEPGDMIYIPAGFPHAGDTV 205
>gi|300716265|ref|YP_003741068.1| hypothetical protein EbC_16860 [Erwinia billingiae Eb661]
gi|299062101|emb|CAX59217.1| conserved uncharacterized protein YcfD [Erwinia billingiae Eb661]
Length = 375
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 95/202 (47%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQT 110
D G L Q V DH+ E S + P +++ + + + ++
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHE-PSAALMRPFSFLPDWRTDDLMI-----------SFSV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHFDQYDVFIIQGTGRRRWRVGDKVPLKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ ++ N
Sbjct: 173 MEPGDILYIPPGFPHEGYSLEN 194
>gi|453061876|gb|EMF02872.1| cupin [Serratia marcescens VGH107]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/202 (22%), Positives = 91/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E + ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSA-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKTPLKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|16764584|ref|NP_460199.1| hypothetical protein STM1229 [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|167991981|ref|ZP_02573080.1| cupin family protein [Salmonella enterica subsp. enterica serovar
4,[5],12:i:- str. CVM23701]
gi|374980227|ref|ZP_09721557.1| hypothetical protein SEE_02292 [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|378444662|ref|YP_005232294.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. D23580]
gi|378449630|ref|YP_005236989.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|378699122|ref|YP_005181079.1| hypothetical protein SL1344_1167 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|378983789|ref|YP_005246944.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|378988573|ref|YP_005251737.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|379700400|ref|YP_005242128.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|421884796|ref|ZP_16316001.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Senftenberg str. SS209]
gi|422025378|ref|ZP_16371812.1| hypothetical protein B571_06091 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|427548949|ref|ZP_18927123.1| hypothetical protein B576_06196 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427564604|ref|ZP_18931826.1| hypothetical protein B577_05550 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427584451|ref|ZP_18936623.1| hypothetical protein B573_05591 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427606865|ref|ZP_18941437.1| hypothetical protein B574_05619 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427632052|ref|ZP_18946385.1| hypothetical protein B575_06187 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427655221|ref|ZP_18951142.1| hypothetical protein B578_05802 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427660557|ref|ZP_18956048.1| hypothetical protein B579_06422 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427666409|ref|ZP_18960820.1| hypothetical protein B580_05984 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|427752095|ref|ZP_18965920.1| hypothetical protein B581_07315 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
gi|16419747|gb|AAL20158.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|205329821|gb|EDZ16585.1| cupin family protein [Salmonella enterica subsp. enterica serovar
4,[5],12:i:- str. CVM23701]
gi|261246441|emb|CBG24251.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|267993008|gb|ACY87893.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|301157770|emb|CBW17263.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
gi|312912217|dbj|BAJ36191.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|321223847|gb|EFX48910.1| hypothetical protein SEE_02292 [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|323129499|gb|ADX16929.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|332988120|gb|AEF07103.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|379985510|emb|CCF88274.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Senftenberg str. SS209]
gi|414020717|gb|EKT04291.1| hypothetical protein B571_06091 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414020728|gb|EKT04301.1| hypothetical protein B576_06196 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414034694|gb|EKT17612.1| hypothetical protein B577_05550 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414035894|gb|EKT18740.1| hypothetical protein B573_05591 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414039416|gb|EKT22087.1| hypothetical protein B574_05619 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414049030|gb|EKT31256.1| hypothetical protein B578_05802 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414050503|gb|EKT32674.1| hypothetical protein B575_06187 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414055066|gb|EKT36986.1| hypothetical protein B579_06422 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414060577|gb|EKT42086.1| hypothetical protein B580_05984 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|414066208|gb|EKT46813.1| hypothetical protein B581_07315 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|448241809|ref|YP_007405862.1| cupin superfamily protein [Serratia marcescens WW4]
gi|445212173|gb|AGE17843.1| cupin superfamily protein [Serratia marcescens WW4]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/202 (22%), Positives = 91/202 (45%), Gaps = 34/202 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 62 LPQIV---W-------DHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
+ W DH+ E + ++P +QL + + +L F
Sbjct: 71 FDHLSENNWSLLVQAVDHWHEPSA-------ALMRPFRQLPDWRMDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKTPLKQHCPH---PDLLQVEPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 173 MEPGDILYIPPGFPHEGYALEN 194
>gi|427399222|ref|ZP_18890460.1| hypothetical protein HMPREF9710_00056 [Massilia timonae CCUG 45783]
gi|425721713|gb|EKU84621.1| hypothetical protein HMPREF9710_00056 [Massilia timonae CCUG 45783]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 82/191 (42%), Gaps = 21/191 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM-DVEG 59
D+W K+PL I R + L+ +K+ ++ R N +E + D Q + ++
Sbjct: 18 DYWHKKPLLI-RGAIPGFKPLLKFEKLAEMARQNHVESRLVTQKDGNWDMQHGPLAELPA 76
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFA 118
R P W +G ++ ++ + + +A L +L F G
Sbjct: 77 RTEPN--WTMLVQGVNLYDERADELLRQFRFVPDARLDDLMVSFA--------TDGAGVG 126
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
PHYD + F+LQ GK++WK+ + + E LP NF EE LEPGD+
Sbjct: 127 PHYDSYDVFLLQAHGKRRWKIGSQQDLTLVEGLPLKILANFKPEE------EFVLEPGDM 180
Query: 177 LYLPRGYIHQA 187
LYLP Y H
Sbjct: 181 LYLPPHYAHDG 191
>gi|383495951|ref|YP_005396640.1| hypothetical protein UMN798_1277 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|417339270|ref|ZP_12120862.1| hypothetical protein LTSEBAI_0501 [Salmonella enterica subsp.
enterica serovar Baildon str. R6-199]
gi|357960559|gb|EHJ84366.1| hypothetical protein LTSEBAI_0501 [Salmonella enterica subsp.
enterica serovar Baildon str. R6-199]
gi|380462772|gb|AFD58175.1| hypothetical protein UMN798_1277 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
Length = 383
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|283784938|ref|YP_003364803.1| hypothetical protein ROD_12171 [Citrobacter rodentium ICC168]
gi|282948392|emb|CBG87980.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKQQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H + N
Sbjct: 183 PGFPHAGYALEN 194
>gi|407687590|ref|YP_006802763.1| hypothetical protein AMBAS45_09055 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407290970|gb|AFT95282.1| hypothetical protein AMBAS45_09055 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 388
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G H D + F++Q +G+++W+V P E P P Q E P++ V +E
Sbjct: 129 NGAGVGAHIDQYDVFLVQGKGRRRWRVGKPADYKEVFPH---PKLRQIEGFDPVIDVVVE 185
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD++Y+P G+ H +T+ E SL ++
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212
>gi|149926788|ref|ZP_01915047.1| hypothetical protein LMED105_12342 [Limnobacter sp. MED105]
gi|149824340|gb|EDM83558.1| hypothetical protein LMED105_12342 [Limnobacter sp. MED105]
Length = 383
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 40/72 (55%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PHYD + F+LQ+ GK++WK+ L + ++E LP NF P LEP
Sbjct: 125 GVGPHYDSYDVFLLQMHGKRRWKIGPLLDKELEEGLPLKILKNFE------PTEEFVLEP 178
Query: 174 GDLLYLPRGYIH 185
GD+LYLP Y H
Sbjct: 179 GDMLYLPPNYGH 190
>gi|167517725|ref|XP_001743203.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778302|gb|EDQ91917.1| predicted protein [Monosiga brevicollis MX1]
Length = 2199
Score = 57.4 bits (137), Expect = 1e-05, Method: Composition-based stats.
Identities = 38/141 (26%), Positives = 65/141 (46%), Gaps = 21/141 (14%)
Query: 89 QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
QQ+ A L G + + Y +PP+ + +PH D + F+ Q G K W + +P+ D
Sbjct: 237 QQIQAHL----GVVMSEHFYISPPSQRALSPHTDGGDVFVHQWAGSKLWTLCVPKAPDCQ 292
Query: 149 LPRYSS---------------PNFSQEEI-GTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ ++ E++ G T+ L+ +LLY+PRG +H A +
Sbjct: 293 DCTFADLALRAELAKSAFQGCTAYTDEQLAGMECRTLDLQENELLYVPRGVVHYARANST 352
Query: 193 EHSLHVTISVYQKTA-WIDLL 212
S H T V +A WID++
Sbjct: 353 GLSAHATYQVVPPSATWIDMV 373
>gi|168244303|ref|ZP_02669235.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL486]
gi|194450781|ref|YP_002045231.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL476]
gi|386591091|ref|YP_006087491.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. B182]
gi|419729313|ref|ZP_14256271.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41579]
gi|419736387|ref|ZP_14263238.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41563]
gi|419737739|ref|ZP_14264511.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41573]
gi|419744077|ref|ZP_14270736.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41566]
gi|419749199|ref|ZP_14275685.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41565]
gi|421569517|ref|ZP_16015220.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00322]
gi|421573533|ref|ZP_16019168.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00325]
gi|421578721|ref|ZP_16024295.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00326]
gi|421582567|ref|ZP_16028103.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00328]
gi|194409085|gb|ACF69304.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL476]
gi|205336690|gb|EDZ23454.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL486]
gi|381292526|gb|EIC33729.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41563]
gi|381296887|gb|EIC37986.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41579]
gi|381304261|gb|EIC45268.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41573]
gi|381310336|gb|EIC51167.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41566]
gi|381312206|gb|EIC53014.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Heidelberg str. 41565]
gi|383798135|gb|AFH45217.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. B182]
gi|402524234|gb|EJW31539.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00326]
gi|402527359|gb|EJW34622.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00322]
gi|402527379|gb|EJW34641.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00325]
gi|402533034|gb|EJW40219.1| JmjC Cupin hydroxylase [Salmonella enterica subsp. enterica serovar
Heidelberg str. CFSAN00328]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|426405554|ref|YP_007024525.1| hypothetical protein Bdt_3583 [Bdellovibrio bacteriovorus str.
Tiberius]
gi|425862222|gb|AFY03258.1| hypothetical protein Bdt_3583 [Bdellovibrio bacteriovorus str.
Tiberius]
Length = 308
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 106/246 (43%), Gaps = 40/246 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE-DGQRQTMDVEGR 60
HW EPL+I ++ + +++ D L++ + ++ + L + D + ++ +E
Sbjct: 22 HWPVEPLFIP-ATPGKLQDIFSLEQMQD-LKNLISARQRKVRACLPDFDDEYSSIHLE-- 77
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN----------TYQT 110
P YR ++ F + Q+ + + +++ G G Y T
Sbjct: 78 --PNDALKAYRNNMTLVFDSMQSQDSTIADILGNVRADLGLVTGGAENDLCKARSIAYAT 135
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI---- 166
P G H+D FI+Q++G K W++ VD R+++ EE+ +
Sbjct: 136 P-AGCGTRLHFDANANFIIQIKGTKTWRLAPNESVDFPTERFTT---GSEEMPAALEKQC 191
Query: 167 ------------LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
+ V ++PG +L++PRGY H+ T T E SL + + Q T W D+ K
Sbjct: 192 HAHLIDALDEDSMEVVMKPGCVLFVPRGYWHE--TTTEEESLSLNFTFSQPT-WADVFTK 248
Query: 215 AMPKAL 220
++ + L
Sbjct: 249 SLQEVL 254
>gi|406596649|ref|YP_006747779.1| hypothetical protein MASE_08470 [Alteromonas macleodii ATCC 27126]
gi|406373970|gb|AFS37225.1| hypothetical protein MASE_08470 [Alteromonas macleodii ATCC 27126]
Length = 388
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G H D + F++Q +G+++W+V P E P P Q E P++ V +E
Sbjct: 129 NGAGVGAHIDQYDVFLVQGKGRRRWRVGKPADYKEVFPH---PKLRQIEGFDPVIDVVVE 185
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD++Y+P G+ H +T+ E SL ++
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212
>gi|108757183|ref|YP_628426.1| cupin [Myxococcus xanthus DK 1622]
gi|108461063|gb|ABF86248.1| cupin family protein [Myxococcus xanthus DK 1622]
Length = 295
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 94/205 (45%), Gaps = 27/205 (13%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
+ +G ++ P + L QL + + + Y TP GF H D E FIL
Sbjct: 78 FAQGYTLALRQPDLHHPDLAQLARAFSAELHGRINLHIYCTPAGHHGFGWHCDPEEVFIL 137
Query: 130 QLEGKKKW----KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
Q G+K + P + E +P S +QE+ TP+ T +L GD +Y+P G+ H
Sbjct: 138 QTAGRKDYLLRENTLHPVPLPESVP--SGSLAAQEK--TPVETHSLSAGDFIYIPGGHWH 193
Query: 186 QASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLA 245
A E +L ++I + T +DLL+ ++AA A+ +RR +P L
Sbjct: 194 MAQ--ATEEALSISIGLMPPT-LLDLLD-----GVRAALASSPVWRRRMP-------SLG 238
Query: 246 RGKPVDIQADRLAMKENLKDLLAKL 270
R +D +LA+ L+ LL++L
Sbjct: 239 RASSLD-DPSKLAL---LRTLLSEL 259
>gi|323494948|ref|ZP_08100039.1| hypothetical protein VIBR0546_21010 [Vibrio brasiliensis LMG 20546]
gi|323310743|gb|EGA63916.1| hypothetical protein VIBR0546_21010 [Vibrio brasiliensis LMG 20546]
Length = 377
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 85/196 (43%), Gaps = 20/196 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM----- 55
+HW K+P I ++ + I+ +++ + + ++ + +DGQ +
Sbjct: 14 EHWHKKPTVIKAGFADFVDP-ISPEELAGLTMEEEVDSR----FISNQDGQWSALHGPLP 68
Query: 56 DVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPTA 114
+ +L + W + C+ L I P +QL L +L C Y P
Sbjct: 69 ESHFESLSESHWQLIVQACNHWHLGAAELIAPFKQLPQWLFDDLMVC------YSAP--K 120
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q GK++W+V + +Y + Q E I+ L PG
Sbjct: 121 GGVGPHIDQYDVFIIQGSGKRRWRVG-AKDTGQYNESIQASALRQIEGFDAIIDEVLMPG 179
Query: 175 DLLYLPRGYIHQASTV 190
D+LY+P G+ H+ +T+
Sbjct: 180 DILYIPPGFPHEGNTI 195
>gi|410091761|ref|ZP_11288312.1| cupin 4 family protein [Pseudomonas viridiflava UASWS0038]
gi|409760994|gb|EKN46105.1| cupin 4 family protein [Pseudomonas viridiflava UASWS0038]
Length = 374
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 58/118 (49%), Gaps = 4/118 (3%)
Query: 88 LQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDE 147
+++L +L + V AN Y F H+D + F QL G K+W +Y P +D
Sbjct: 104 IKRLCNALSHIANGQVVANGYLAKTGDSAFGNHWDSHDVFASQLLGNKRWLIYPPTFIDP 163
Query: 148 YLPRYSSPNFSQEEIGT-PILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQ 204
+ + S + E+ T + V EPGD+LY+PRG+ H A + HV + V++
Sbjct: 164 IVGQKSIKHL--EDCPTEAYMDVITEPGDILYIPRGWWHCAFP-SEGPCFHVAVGVHK 218
>gi|407699915|ref|YP_006824702.1| hypothetical protein AMBLS11_08330 [Alteromonas macleodii str.
'Black Sea 11']
gi|407249062|gb|AFT78247.1| hypothetical protein AMBLS11_08330 [Alteromonas macleodii str.
'Black Sea 11']
Length = 388
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G H D + F++Q +G+++W+V P E P P Q E P++ V +E
Sbjct: 129 NGAGVGAHIDQYDVFLVQGKGQRRWRVGKPANYKEVCPH---PKLRQIEGFDPVIDVVVE 185
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD++Y+P G+ H +T+ E SL ++
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212
>gi|392537664|ref|ZP_10284801.1| RmlC-like domain-containing protein [Pseudoalteromonas marina
mano4]
Length = 385
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
++ TP G PH D + FI+Q EGK+ W+V LP D L +++ + Q E
Sbjct: 116 SFSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197
>gi|261252554|ref|ZP_05945127.1| hypothetical protein VIA_002578 [Vibrio orientalis CIP 102891 =
ATCC 33934]
gi|417952662|ref|ZP_12595720.1| hypothetical protein VIOR3934_03262 [Vibrio orientalis CIP 102891 =
ATCC 33934]
gi|260935945|gb|EEX91934.1| hypothetical protein VIA_002578 [Vibrio orientalis CIP 102891 =
ATCC 33934]
gi|342818888|gb|EGU53739.1| hypothetical protein VIOR3934_03262 [Vibrio orientalis CIP 102891 =
ATCC 33934]
Length = 377
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 84/202 (41%), Gaps = 32/202 (15%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNV--IEFKKNIDITLYEDGQRQTMDVE 58
+HW K+P I ++ ID I D + + ++ ID + Q
Sbjct: 14 EHWHKKPTVIKAGFADF---------IDPISADELAGLAMEEEIDSRFISNKDNQWSATH 64
Query: 59 G-------RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQT 110
G +L + W + C+ L ++ +QL L +L CF +
Sbjct: 65 GPLPESHFESLDESHWQLIVQACNHWHLGSAELVQAFKQLPQWLFDDLMVCF-------S 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPNFSQEEIGTPILT 168
P G PH D + FI+Q GK++W+V +D +Y + Q E I+
Sbjct: 118 APEG-GVGPHIDQYDVFIIQGSGKRRWRV---GDIDKGQYKESIQAGALRQIEGFESIID 173
Query: 169 VTLEPGDLLYLPRGYIHQASTV 190
LEPGD+LY+P G+ H+ +T+
Sbjct: 174 EVLEPGDILYIPPGFPHEGNTL 195
>gi|119470128|ref|ZP_01612894.1| hypothetical protein ATW7_13583 [Alteromonadales bacterium TW-7]
gi|119446549|gb|EAW27823.1| hypothetical protein ATW7_13583 [Alteromonadales bacterium TW-7]
Length = 385
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
++ TP G PH D + FI+Q EGK+ W+V LP D L +++ + Q E
Sbjct: 116 SFSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197
>gi|359448363|ref|ZP_09237901.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
BSi20480]
gi|358045772|dbj|GAA74150.1| RmlC-like domain-containing protein [Pseudoalteromonas sp.
BSi20480]
Length = 385
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
++ TP G PH D + FI+Q EGK+ W+V LP D L +++ + Q E
Sbjct: 116 SFSTP--NGGVGPHLDQYDVFIIQGEGKRHWRVGLP---DPTLKQFAQNKKLLQVEAFEA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P G H+ V N
Sbjct: 171 VIDCILEPGDILYIPPGCPHEGYAVEN 197
>gi|117920191|ref|YP_869383.1| cupin 4 family protein [Shewanella sp. ANA-3]
gi|117612523|gb|ABK47977.1| Cupin 4 family protein [Shewanella sp. ANA-3]
Length = 379
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGANTPQQR-----RGGNPNSPLVEDFE 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P GY H T+T
Sbjct: 166 PIIDVVLEKGDVLYIPPGYPHCGETLT 192
>gi|407683605|ref|YP_006798779.1| hypothetical protein AMEC673_08540 [Alteromonas macleodii str.
'English Channel 673']
gi|407245216|gb|AFT74402.1| hypothetical protein AMEC673_08540 [Alteromonas macleodii str.
'English Channel 673']
Length = 388
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G H D + F++Q +G+++W+V P E P P Q E P++ V +E
Sbjct: 129 NGAGVGAHIDHYDVFLVQGKGRRRWRVGKPADYKEVFPH---PKLRQIEGFDPVIDVVVE 185
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD++Y+P G+ H +T+ E SL ++
Sbjct: 186 PGDVVYVPPGWPHDGATI--EDSLTYSVG 212
>gi|224010139|ref|XP_002294027.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970044|gb|EED88382.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 415
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 33/126 (26%)
Query: 97 ELFGCFVGANTYQTPPT-----------AQGFAPHYDDIEAF---------------ILQ 130
E+ G VG N Y TP QGF H+D ++AF ++Q
Sbjct: 18 EVKGVHVGVNLYLTPEALEDEEGGGKQDRQGFEAHWDWMDAFDAPFFTDQWKSLVVIVIQ 77
Query: 131 LEGKKKWKV------YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
L G+K+W+V +L + P + + G + T+ PGD LY+PRG++
Sbjct: 78 LSGRKRWRVAKQPTIFLSNKDQKRRPTNEEAQYFATDEGHYV-EFTMCPGDSLYIPRGFM 136
Query: 185 HQASTV 190
H ASTV
Sbjct: 137 HNASTV 142
>gi|114047176|ref|YP_737726.1| cupin 4 family protein [Shewanella sp. MR-7]
gi|113888618|gb|ABI42669.1| Cupin 4 family protein [Shewanella sp. MR-7]
Length = 379
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGANTPQQR-----RGGNPNSPLVEDFE 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P GY H T+T
Sbjct: 166 PIIDVVLEKGDVLYIPPGYPHCGETLT 192
>gi|163751716|ref|ZP_02158934.1| hypothetical protein KT99_12104 [Shewanella benthica KT99]
gi|161328368|gb|EDP99527.1| hypothetical protein KT99_12104 [Shewanella benthica KT99]
Length = 397
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 86/211 (40%), Gaps = 38/211 (18%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W+K+PL I + KN + +L+T + + + D+++E R+ EG
Sbjct: 33 DYWQKKPLVIRQGFKN-FEDLLTPEDMAGLACDDMVE-------------SRRIYKTEGE 78
Query: 61 ALPQIV-WDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVG---------ANT 107
+ +D Y L + +Q LN + EL CF +
Sbjct: 79 WQAEFGPFDSYEH------LGESDWTLVVQALNNWVPKADELIQCFDFIPRWRLDDVMVS 132
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL 167
Y TP G PH D + FI Q G+++W+V E+ + P E I+
Sbjct: 133 YATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHKEFA---AHPALLHTEAFEAII 187
Query: 168 TVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
V L PGD+LYLP G+ H + S V
Sbjct: 188 DVELLPGDILYLPPGFPHDGVALEASMSFSV 218
>gi|304397246|ref|ZP_07379125.1| Cupin 4 family protein [Pantoea sp. aB]
gi|304355395|gb|EFM19763.1| Cupin 4 family protein [Pantoea sp. aB]
Length = 375
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L V G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRVDDLMVSFAVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|428186247|gb|EKX55098.1| hypothetical protein GUITHDRAFT_99731 [Guillardia theta CCMP2712]
Length = 390
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 113/286 (39%), Gaps = 77/286 (26%)
Query: 4 EKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALP 63
E+ PL I R YY+ + K ++ ++++ +E+ ID T Y +GQR T + EG
Sbjct: 100 EQRPLLIKRNNPAYYDQWMNLKDLELLVKEG-LEWNFEIDATRYTNGQRTTHNGEG---- 154
Query: 64 QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDD 123
+ C + + +K + N + LFG +
Sbjct: 155 ------WTSNC-----HTELPLKLMYGANMRMVALFG--------------------FQG 183
Query: 124 IEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
+ ++ + VY PR E +PRYSS N ++
Sbjct: 184 LTGDLIICARRPYIYVYEPRNEYEAMPRYSSRNMTE------------------------ 219
Query: 184 IHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAG 243
A + + HSLHVT+S + +W D++++ + AL A +++ RR + + Y
Sbjct: 220 ---AKSSRDTHSLHVTVSHGYRNSWYDVIKETLISALDVAVQHNVDMRRQIKLAASGYDI 276
Query: 244 LARGKPVD-----------IQADRLAMKENLKDL---LAKLVEYVD 275
G P D +A+++++K + L +A LV VD
Sbjct: 277 STEGNPDDDVEELLAPAREAEAEKVSLKTQFRLLEGDIAVLVHSVD 322
>gi|440757851|ref|ZP_20937031.1| hypothetical protein F385_865 [Pantoea agglomerans 299R]
gi|436428324|gb|ELP25981.1| hypothetical protein F385_865 [Pantoea agglomerans 299R]
Length = 375
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L V G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRVDDLMVSFAVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|113969941|ref|YP_733734.1| cupin 4 family protein [Shewanella sp. MR-4]
gi|113884625|gb|ABI38677.1| Cupin 4 family protein [Shewanella sp. MR-4]
Length = 379
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGANTPQQR-----RGGNPNSPLVEDFE 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P GY H T+T
Sbjct: 166 PIIDVVLEKGDVLYIPPGYPHCGETLT 192
>gi|113200554|ref|YP_717716.1| gp49 [Synechococcus phage syn9]
gi|76574453|gb|ABA47018.1| gp49 [Synechococcus phage syn9]
Length = 236
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 2/124 (1%)
Query: 64 QIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDD 123
QIV +H G + +N Y + Q+L + + +F + Y A+ F H D
Sbjct: 82 QIV-EHINHGHTFIIMNYAFYSRWTQELMKTFESIFAVDCAIHVYGGKEGAKSFNIHDDY 140
Query: 124 IEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGY 183
FI+Q+EG+ +WK+Y R + L ++ + +E+ L VTL+PGD LY+P
Sbjct: 141 PSNFIIQVEGETEWKIYKNR-ISSMLQTGTAQDTIREDNLEVDLHVTLKPGDALYIPSRA 199
Query: 184 IHQA 187
H A
Sbjct: 200 YHCA 203
>gi|343502128|ref|ZP_08739989.1| hypothetical protein VITU9109_16393 [Vibrio tubiashii ATCC 19109]
gi|418481137|ref|ZP_13050186.1| hypothetical protein VT1337_21872 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
gi|342815551|gb|EGU50467.1| hypothetical protein VITU9109_16393 [Vibrio tubiashii ATCC 19109]
gi|384571325|gb|EIF01862.1| hypothetical protein VT1337_21872 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
Length = 377
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 10/132 (7%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFA 118
+L + W + C+ + I+P + L N +L CF + P G
Sbjct: 73 ESLDESHWQLVVQACNHWHIGAAELIEPFKPLPNWLFDDLMVCF-------SAPKG-GVG 124
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PH D + FI+Q GK+ W+V + V +Y + Q E I+ LEPGD+LY
Sbjct: 125 PHIDQYDVFIIQGSGKRHWRVG-DKDVGQYKESIQAGALRQIEGFDAIIDEILEPGDILY 183
Query: 179 LPRGYIHQASTV 190
+P G+ H+ T+
Sbjct: 184 IPPGFPHEGDTL 195
>gi|445421484|ref|ZP_21435886.1| cupin domain protein, PF06172 family [Acinetobacter sp. WC-743]
gi|444757452|gb|ELW81974.1| cupin domain protein, PF06172 family [Acinetobacter sp. WC-743]
Length = 382
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 81/173 (46%), Gaps = 14/173 (8%)
Query: 57 VEGRALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
VE + + +++++ R+G ++ + + + ++ + + AS E G Y +
Sbjct: 75 VEYKCITSVLYEYLRKGATLVYNRIKNEPFVDNISRQVASFAEAHTITGG---YAAFSSK 131
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
+ H+D + + +QL G+K+W + P P Y +I P + V L
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKYFSDIKEPEEVYMDVIL 188
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
E GD+LY+PRG+ H + +E + H+ ++ + T ++ L+ A+P L
Sbjct: 189 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMQWLQNAIPSILDC 240
>gi|422030384|ref|ZP_16376586.1| hypothetical protein B572_06048, partial [Salmonella enterica
subsp. enterica serovar Typhimurium str. STm2]
gi|414022210|gb|EKT05701.1| hypothetical protein B572_06048, partial [Salmonella enterica
subsp. enterica serovar Typhimurium str. STm2]
Length = 247
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|416241522|ref|ZP_11632815.1| hypothetical protein E9S_01389 [Moraxella catarrhalis BC7]
gi|326572066|gb|EGE22067.1| hypothetical protein E9S_01389 [Moraxella catarrhalis BC7]
Length = 383
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 77/176 (43%), Gaps = 31/176 (17%)
Query: 60 RALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQG- 116
R + ++ H + G +I + +N + ++ L + FV A T + A G
Sbjct: 78 RFIKSAIYHHMKHGATIVYNRINNEPFVD-------GLAKQISHFVQAPTVVSGYLAFGS 130
Query: 117 ---FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQE-------EIGTPI 166
F H+D + F +QL GKK W + Y P + SP + Q+ E P
Sbjct: 131 DASFKSHWDTRDVFAVQLMGKKYWAL--------YKPNFESPLYMQQSKDVDVPEPTIPD 182
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKAL 220
+ + LE GD+LY+PRG+ H + + + H+ I + T ++ L K P L
Sbjct: 183 MEIVLEAGDVLYIPRGWWHNPVPMGCK-TFHLAIGTFAPTGYDYMQWLVKQTPNIL 237
>gi|82777261|ref|YP_403610.1| hypothetical protein SDY_2024 [Shigella dysenteriae Sd197]
gi|81241409|gb|ABB62119.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
Length = 376
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 19 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 68
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 69 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 128
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEP D+LY+P
Sbjct: 129 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPSDILYIP 185
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 186 PGFPHEGYALEN 197
>gi|309788198|ref|ZP_07682804.1| cupin family protein [Shigella dysenteriae 1617]
gi|308924050|gb|EFP69551.1| cupin family protein [Shigella dysenteriae 1617]
Length = 373
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 66 GPFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEP D+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPSDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|238795998|ref|ZP_04639510.1| hypothetical protein ymoll0001_23970 [Yersinia mollaretii ATCC
43969]
gi|238720203|gb|EEQ12007.1| hypothetical protein ymoll0001_23970 [Yersinia mollaretii ATCC
43969]
Length = 373
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 87/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + +E+G+ Q G
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEEGRWQV----GHG 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 -PFESFDHLGETNWSLLVQAVDHWHEPAAALMRPFRELSDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|160875706|ref|YP_001555022.1| cupin 4 family protein [Shewanella baltica OS195]
gi|378708906|ref|YP_005273800.1| transcription factor jumonji jmjC domain-containing protein
[Shewanella baltica OS678]
gi|418024724|ref|ZP_12663706.1| Cupin, JmjC-type [Shewanella baltica OS625]
gi|160861228|gb|ABX49762.1| Cupin 4 family protein [Shewanella baltica OS195]
gi|315267895|gb|ADT94748.1| transcription factor jumonji jmjC domain-containing protein
[Shewanella baltica OS678]
gi|353536010|gb|EHC05570.1| Cupin, JmjC-type [Shewanella baltica OS625]
Length = 379
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 9/87 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGT 164
++ TP + G PH D+ + FI+Q EG+++W V P+ R +PN E
Sbjct: 113 SFATP--SGGVGPHVDNYDVFIIQGEGERRWTVGDNTPQQR-----RGGNPNSPLVEDFI 165
Query: 165 PILTVTLEPGDLLYLPRGYIHQASTVT 191
PI+ V LE GD+LY+P G+ H T+T
Sbjct: 166 PIIDVVLEKGDMLYIPPGFPHCGETLT 192
>gi|440230538|ref|YP_007344331.1| hypothetical protein D781_1857 [Serratia marcescens FGI94]
gi|440052243|gb|AGB82146.1| hypothetical protein D781_1857 [Serratia marcescens FGI94]
Length = 373
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 93/206 (45%), Gaps = 42/206 (20%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----VVSHQDGRWQVAHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E ++P +QL + + +L F
Sbjct: 71 YDHLGENNWSLLVQAV-DHWHEPSG-------ALMRPFRQLPDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP---RYSSPNFSQEEIGTPI 166
P G PH D + FI+Q G+++W+V E +P Y P+ Q E I
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRV------GEKVPMKQHYPHPDLLQVEPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTN 192
+ +EPGD+LY+P G+ H+ + N
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYALEN 194
>gi|238783793|ref|ZP_04627812.1| hypothetical protein yberc0001_24710 [Yersinia bercovieri ATCC
43970]
gi|238715344|gb|EEQ07337.1| hypothetical protein yberc0001_24710 [Yersinia bercovieri ATCC
43970]
Length = 373
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 83/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L D L +E + + + +EDG+ Q G
Sbjct: 16 YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWQV----GHG 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESFDHLGETNWSLLVQAVDHWHEPAAALMRPFRALSDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVAPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|62179750|ref|YP_216167.1| hypothetical protein SC1180 [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|224584263|ref|YP_002638061.1| hypothetical protein SPC_2515 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|375114074|ref|ZP_09759244.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SCSA50]
gi|62127383|gb|AAX65086.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|224468790|gb|ACN46620.1| hypothetical protein SPC_2515 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|322714220|gb|EFZ05791.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SCSA50]
Length = 383
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGAGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|422971067|ref|ZP_16974579.1| hypothetical protein ESRG_01213 [Escherichia coli TA124]
gi|371598932|gb|EHN87723.1| hypothetical protein ESRG_01213 [Escherichia coli TA124]
Length = 373
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 14/191 (7%)
Query: 3 WEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRAL 62
W+K P+ + R +NN I D+ L +E + + + ++DG+ Q
Sbjct: 17 WQKRPVVLKRG----FNNFIDPISPDE-LAGLAMESEVDSRLVSHQDGKWQV-----SHG 66
Query: 63 PQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
P +DH E S+ + +P L +EL + G PH
Sbjct: 67 PFESYDHLGETNWSLLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHL 126
Query: 122 DDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPR 181
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 127 DQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIPP 183
Query: 182 GYIHQASTVTN 192
G+ H+ + N
Sbjct: 184 GFPHEGYALEN 194
>gi|383933981|ref|ZP_09987424.1| hypothetical protein RNAN_0483 [Rheinheimera nanhaiensis E407-8]
gi|383704980|dbj|GAB57515.1| hypothetical protein RNAN_0483 [Rheinheimera nanhaiensis E407-8]
Length = 384
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 99/220 (45%), Gaps = 30/220 (13%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W+++P+ + K + + L T+ ++ + D +E + +I D Q + +
Sbjct: 17 DYWQQKPVLLKGGFKQFQDPL-TADELAGLALDENVESRVVSNINQQWDMQSGPFE-DFA 74
Query: 61 ALPQIVW-------DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPT 113
A + W DH+ + L+P +I N + +L F A
Sbjct: 75 AFGEQNWTLLVQAVDHWHPPAAT-LLDPFRFIP-----NWRIDDLMVSFSTAGG------ 122
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G PH D + FI+Q EGK+ W+V +P + + P P Q + V E
Sbjct: 123 --GVGPHLDQYDVFIIQGEGKRHWRVGMPDNSLQQLCPH---PRLLQVSPFNACIDVITE 177
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
PGD+LY+P G H+ +++ E+SL+ ++ ++ A DLL
Sbjct: 178 PGDILYIPPGCPHEGTSL--ENSLNYSVG-FRAPAQKDLL 214
>gi|188534104|ref|YP_001907901.1| hypothetical protein ETA_19720 [Erwinia tasmaniensis Et1/99]
gi|188029146|emb|CAO97018.1| Conserved hypothetical protein YcfD [Erwinia tasmaniensis Et1/99]
Length = 382
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q G
Sbjct: 23 YWQKRPVVLKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 73
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + + V G PH
Sbjct: 74 -PFESYDHLGENNWSLLVQAVDHWHEPSAALMQPFRAIPDWRVDDLMVSFSVPGGGVGPH 132
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 133 FDQYDVFIIQGTGRRRWRVGEKLEMKQHCPH---PDLLQVDPFDAIIDEEMEPGDILYIP 189
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 190 PGFPHEGYSLEN 201
>gi|417531601|ref|ZP_12186274.1| hypothetical protein LTSEURB_2855 [Salmonella enterica subsp.
enterica serovar Urbana str. R8-2977]
gi|353664588|gb|EHD02966.1| hypothetical protein LTSEURB_2855 [Salmonella enterica subsp.
enterica serovar Urbana str. R8-2977]
Length = 362
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|168239136|ref|ZP_02664194.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. SL480]
gi|194736752|ref|YP_002114236.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. CVM19633]
gi|194712254|gb|ACF91475.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. CVM19633]
gi|197288150|gb|EDY27537.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. SL480]
Length = 373
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|56413785|ref|YP_150860.1| hypothetical protein SPA1621 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|161614536|ref|YP_001588501.1| hypothetical protein SPAB_02285 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|167553804|ref|ZP_02347549.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Saintpaul str. SARA29]
gi|168233111|ref|ZP_02658169.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Kentucky str. CDC 191]
gi|168467191|ref|ZP_02701033.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. SL317]
gi|168822336|ref|ZP_02834336.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Weltevreden str. HI_N05-537]
gi|194444751|ref|YP_002040485.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. SL254]
gi|194472548|ref|ZP_03078532.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Kentucky str. CVM29188]
gi|197248001|ref|YP_002146812.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Agona str. SL483]
gi|197362708|ref|YP_002142345.1| hypothetical protein SSPA1506 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|200389594|ref|ZP_03216205.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Virchow str. SL491]
gi|205353037|ref|YP_002226838.1| hypothetical protein SG1892 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|207857264|ref|YP_002243915.1| hypothetical protein SEN1820 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|238910951|ref|ZP_04654788.1| hypothetical protein SentesTe_07427 [Salmonella enterica subsp.
enterica serovar Tennessee str. CDC07-0191]
gi|375001687|ref|ZP_09726027.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Infantis str. SARB27]
gi|375123866|ref|ZP_09769030.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|378954770|ref|YP_005212257.1| hypothetical protein SPUL_1033 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|409250502|ref|YP_006886312.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
serovar Weltevreden str. 2007-60-3289-1]
gi|416507377|ref|ZP_11735325.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. SARB31]
gi|416523489|ref|ZP_11741166.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. ATCC BAA710]
gi|416528531|ref|ZP_11743981.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. LQC 10]
gi|416535577|ref|ZP_11747831.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. SARB30]
gi|416553883|ref|ZP_11757911.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 29N]
gi|416558001|ref|ZP_11759944.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 42N]
gi|416571633|ref|ZP_11766867.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 4441 H]
gi|417463691|ref|ZP_12164744.1| hypothetical protein LTSEMON_2720 [Salmonella enterica subsp.
enterica serovar Montevideo str. S5-403]
gi|418761292|ref|ZP_13317437.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35185]
gi|418768380|ref|ZP_13324430.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35199]
gi|418769788|ref|ZP_13325815.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21539]
gi|418775972|ref|ZP_13331921.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 33953]
gi|418780314|ref|ZP_13336203.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35188]
gi|418786493|ref|ZP_13342308.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21559]
gi|418789828|ref|ZP_13345614.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19447]
gi|418795683|ref|ZP_13351384.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19449]
gi|418798421|ref|ZP_13354098.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19567]
gi|418802448|ref|ZP_13358075.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35202]
gi|418808769|ref|ZP_13364322.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21550]
gi|418812925|ref|ZP_13368446.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22513]
gi|418816992|ref|ZP_13372480.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21538]
gi|418820434|ref|ZP_13375867.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22425]
gi|418827978|ref|ZP_13383071.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22462]
gi|418832862|ref|ZP_13387796.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM N18486]
gi|418836321|ref|ZP_13391208.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM N1543]
gi|418839569|ref|ZP_13394403.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21554]
gi|418848123|ref|ZP_13402862.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 37978]
gi|418855525|ref|ZP_13410181.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19593]
gi|419788388|ref|ZP_14314075.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. Levine 1]
gi|419791096|ref|ZP_14316751.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. Levine 15]
gi|421358278|ref|ZP_15808576.1| hypothetical protein SEEE3139_09514 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421364457|ref|ZP_15814689.1| hypothetical protein SEEE0166_17642 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421366754|ref|ZP_15816956.1| hypothetical protein SEEE0631_06178 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421373424|ref|ZP_15823564.1| hypothetical protein SEEE0424_17039 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421377191|ref|ZP_15827290.1| hypothetical protein SEEE3076_13193 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421381690|ref|ZP_15831745.1| hypothetical protein SEEE4917_12943 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421385370|ref|ZP_15835392.1| hypothetical protein SEEE6622_08759 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421390302|ref|ZP_15840277.1| hypothetical protein SEEE6670_10822 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421393562|ref|ZP_15843506.1| hypothetical protein SEEE6426_04514 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421398392|ref|ZP_15848300.1| hypothetical protein SEEE6437_06656 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421404204|ref|ZP_15854048.1| hypothetical protein SEEE7246_13130 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421409715|ref|ZP_15859505.1| hypothetical protein SEEE7250_18232 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421413438|ref|ZP_15863192.1| hypothetical protein SEEE1427_14151 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418750|ref|ZP_15868451.1| hypothetical protein SEEE2659_18236 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421422426|ref|ZP_15872094.1| hypothetical protein SEEE1757_14019 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421426337|ref|ZP_15875965.1| hypothetical protein SEEE5101_11002 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421432912|ref|ZP_15882480.1| hypothetical protein SEEE8B1_21392 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421434917|ref|ZP_15884463.1| hypothetical protein SEEE5518_08200 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421442191|ref|ZP_15891651.1| hypothetical protein SEEE1618_22104 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421444482|ref|ZP_15893912.1| hypothetical protein SEEE3079_10567 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|421448213|ref|ZP_15897608.1| hypothetical protein SEEE6482_06661 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|436587485|ref|ZP_20511824.1| hypothetical protein SEE22704_00397 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436776523|ref|ZP_20521034.1| hypothetical protein SEE30663_23300 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436798198|ref|ZP_20523343.1| hypothetical protein SEECHS44_08640 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436809177|ref|ZP_20528557.1| hypothetical protein SEEE1882_12114 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436815068|ref|ZP_20532619.1| hypothetical protein SEEE1884_09778 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436844491|ref|ZP_20538249.1| hypothetical protein SEEE1594_15488 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436854178|ref|ZP_20543812.1| hypothetical protein SEEE1566_20799 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436857424|ref|ZP_20545944.1| hypothetical protein SEEE1580_08895 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436864597|ref|ZP_20550564.1| hypothetical protein SEEE1543_09680 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436873839|ref|ZP_20556563.1| hypothetical protein SEEE1441_17537 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436877963|ref|ZP_20558818.1| hypothetical protein SEEE1810_06222 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436888496|ref|ZP_20564825.1| hypothetical protein SEEE1558_13819 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436895721|ref|ZP_20568477.1| hypothetical protein SEEE1018_09342 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436901603|ref|ZP_20572513.1| hypothetical protein SEEE1010_07154 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436912358|ref|ZP_20578187.1| hypothetical protein SEEE1729_13290 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436922290|ref|ZP_20584515.1| hypothetical protein SEEE0895_22485 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436926972|ref|ZP_20586798.1| hypothetical protein SEEE0899_11044 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436936309|ref|ZP_20591749.1| hypothetical protein SEEE1457_13351 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436943499|ref|ZP_20596445.1| hypothetical protein SEEE1747_14492 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436951013|ref|ZP_20600068.1| hypothetical protein SEEE0968_09924 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436961662|ref|ZP_20605036.1| hypothetical protein SEEE1444_12165 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436970744|ref|ZP_20609137.1| hypothetical protein SEEE1445_10116 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436982094|ref|ZP_20613590.1| hypothetical protein SEEE1559_10059 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436994733|ref|ZP_20619001.1| hypothetical protein SEEE1565_14614 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437001772|ref|ZP_20621051.1| hypothetical protein SEEE1808_02269 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437020036|ref|ZP_20627187.1| hypothetical protein SEEE1811_10508 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437033905|ref|ZP_20632789.1| hypothetical protein SEEE0956_16006 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437045604|ref|ZP_20637902.1| hypothetical protein SEEE1455_19061 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437053817|ref|ZP_20642616.1| hypothetical protein SEEE1575_20271 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437058829|ref|ZP_20645676.1| hypothetical protein SEEE1725_13124 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437070348|ref|ZP_20651526.1| hypothetical protein SEEE1745_19933 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437076518|ref|ZP_20654881.1| hypothetical protein SEEE1791_14002 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437081119|ref|ZP_20657571.1| hypothetical protein SEEE1795_04921 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437091718|ref|ZP_20663318.1| hypothetical protein SEEE6709_11442 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437101930|ref|ZP_20666379.1| hypothetical protein SEEE9058_03984 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437122775|ref|ZP_20672579.1| hypothetical protein SEEE0816_12700 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437131123|ref|ZP_20677253.1| hypothetical protein SEEE0819_13450 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437138875|ref|ZP_20681357.1| hypothetical protein SEEE3072_11367 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437145730|ref|ZP_20685637.1| hypothetical protein SEEE3089_10142 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437156765|ref|ZP_20692301.1| hypothetical protein SEEE9163_21092 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437159232|ref|ZP_20693746.1| hypothetical protein SEEE151_05509 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437166103|ref|ZP_20697888.1| hypothetical protein SEEEN202_03846 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437178744|ref|ZP_20704862.1| hypothetical protein SEEE3991_16563 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437183878|ref|ZP_20707950.1| hypothetical protein SEEE3618_09565 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437258707|ref|ZP_20716627.1| hypothetical protein SEEE2490_04439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437268276|ref|ZP_20721746.1| hypothetical protein SEEEL909_07808 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437277114|ref|ZP_20726633.1| hypothetical protein SEEEL913_09670 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437286194|ref|ZP_20729974.1| hypothetical protein SEEE4941_03980 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437312193|ref|ZP_20736301.1| hypothetical protein SEEE7015_13430 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437328129|ref|ZP_20740911.1| hypothetical protein SEEE7927_13823 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437343646|ref|ZP_20745814.1| hypothetical protein SEEECHS4_15890 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437404138|ref|ZP_20751996.1| hypothetical protein SEEE2217_01595 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437443973|ref|ZP_20758139.1| hypothetical protein SEEE4018_09992 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437460569|ref|ZP_20761523.1| hypothetical protein SEEE6211_04127 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437481372|ref|ZP_20768904.1| hypothetical protein SEEE4441_18866 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437494610|ref|ZP_20772553.1| hypothetical protein SEEE4647_14653 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437503911|ref|ZP_20774925.1| hypothetical protein SEEE9845_03933 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437524523|ref|ZP_20779437.1| hypothetical protein SEEE9317_03971 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|437561003|ref|ZP_20786287.1| hypothetical protein SEEE0116_15890 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437577657|ref|ZP_20791006.1| hypothetical protein SEEE1117_16729 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437586367|ref|ZP_20793328.1| hypothetical protein SEEE1392_05704 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437601754|ref|ZP_20797986.1| hypothetical protein SEEE0268_06478 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437612736|ref|ZP_20801362.1| hypothetical protein SEEE0316_00622 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437627645|ref|ZP_20805920.1| hypothetical protein SEEE0436_00804 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437658116|ref|ZP_20811447.1| hypothetical protein SEEE1319_05348 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437669493|ref|ZP_20815525.1| hypothetical protein SEEE4481_03073 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437688553|ref|ZP_20819861.1| hypothetical protein SEEE6297_01857 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437709427|ref|ZP_20826095.1| hypothetical protein SEEE4220_10855 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437728171|ref|ZP_20830437.1| hypothetical protein SEEE1616_09635 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|437797953|ref|ZP_20837734.1| hypothetical protein SEEE2651_24346 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|437812427|ref|ZP_20841539.1| hypothetical protein SEEE3944_18708 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|437987534|ref|ZP_20853615.1| hypothetical protein SEEE5646_06793 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|438087107|ref|ZP_20859254.1| hypothetical protein SEEE2625_08557 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438099794|ref|ZP_20863538.1| hypothetical protein SEEE1976_07359 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438110668|ref|ZP_20868066.1| hypothetical protein SEEE3407_07536 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|438144679|ref|ZP_20875670.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Pullorum str. ATCC 9120]
gi|440761753|ref|ZP_20940822.1| hypothetical protein F434_02369 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440767575|ref|ZP_20946551.1| hypothetical protein F514_07997 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440774024|ref|ZP_20952912.1| hypothetical protein F515_16533 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|445134614|ref|ZP_21382999.1| hypothetical protein SEEG9184_003014 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|445169176|ref|ZP_21395115.1| hypothetical protein SEE8A_002740 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445186758|ref|ZP_21399359.1| hypothetical protein SE20037_07109 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445228796|ref|ZP_21404833.1| hypothetical protein SEE10_016120 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445353184|ref|ZP_21421076.1| hypothetical protein SEE13_020245 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445363821|ref|ZP_21424744.1| hypothetical protein SEE23_012930 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|56128042|gb|AAV77548.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|161363900|gb|ABX67668.1| hypothetical protein SPAB_02285 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|194403414|gb|ACF63636.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. SL254]
gi|194458912|gb|EDX47751.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Kentucky str. CVM29188]
gi|195630396|gb|EDX49022.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. SL317]
gi|197094185|emb|CAR59689.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|197211704|gb|ACH49101.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Agona str. SL483]
gi|199602039|gb|EDZ00585.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Virchow str. SL491]
gi|205272818|emb|CAR37744.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|205321844|gb|EDZ09683.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Saintpaul str. SARA29]
gi|205332730|gb|EDZ19494.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Kentucky str. CDC 191]
gi|205341198|gb|EDZ27962.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Weltevreden str. HI_N05-537]
gi|206709067|emb|CAR33400.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|320086330|emb|CBY96103.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
serovar Weltevreden str. 2007-60-3289-1]
gi|326628116|gb|EGE34459.1| Uncharacterized protein ycfD [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|353076375|gb|EHB42135.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Infantis str. SARB27]
gi|353630978|gb|EHC78382.1| hypothetical protein LTSEMON_2720 [Salmonella enterica subsp.
enterica serovar Montevideo str. S5-403]
gi|357205381|gb|AET53427.1| hypothetical protein SPUL_1033 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|363550225|gb|EHL34554.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. ATCC BAA710]
gi|363553387|gb|EHL37635.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. SARB31]
gi|363553848|gb|EHL38094.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. LQC 10]
gi|363562069|gb|EHL46175.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 29N]
gi|363565785|gb|EHL49809.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. SARB30]
gi|363574163|gb|EHL58036.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 4441 H]
gi|363577568|gb|EHL61388.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 42N]
gi|392617572|gb|EIW99993.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. Levine 1]
gi|392621318|gb|EIX03680.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. Levine 15]
gi|392731650|gb|EIZ88874.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35199]
gi|392739234|gb|EIZ96373.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21539]
gi|392740910|gb|EIZ98025.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35185]
gi|392746605|gb|EJA03611.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 33953]
gi|392748163|gb|EJA05153.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21559]
gi|392749364|gb|EJA06341.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35188]
gi|392758211|gb|EJA15086.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19449]
gi|392760173|gb|EJA17013.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19447]
gi|392767078|gb|EJA23850.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19567]
gi|392774151|gb|EJA30846.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22513]
gi|392775452|gb|EJA32144.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21550]
gi|392777461|gb|EJA34144.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 35202]
gi|392789160|gb|EJA45680.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21538]
gi|392792703|gb|EJA49157.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22425]
gi|392795311|gb|EJA51684.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22462]
gi|392796932|gb|EJA53260.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM N18486]
gi|392802147|gb|EJA58367.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM N1543]
gi|392812446|gb|EJA68435.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21554]
gi|392821583|gb|EJA77407.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19593]
gi|392823435|gb|EJA79231.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 37978]
gi|395983946|gb|EJH93136.1| hypothetical protein SEEE0166_17642 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395988582|gb|EJH97738.1| hypothetical protein SEEE3139_09514 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|395989409|gb|EJH98543.1| hypothetical protein SEEE0631_06178 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395996543|gb|EJI05588.1| hypothetical protein SEEE0424_17039 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|396000813|gb|EJI09827.1| hypothetical protein SEEE3076_13193 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396001653|gb|EJI10665.1| hypothetical protein SEEE4917_12943 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396014112|gb|EJI22998.1| hypothetical protein SEEE6670_10822 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396016807|gb|EJI25674.1| hypothetical protein SEEE6622_08759 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396017445|gb|EJI26310.1| hypothetical protein SEEE6426_04514 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396025012|gb|EJI33796.1| hypothetical protein SEEE7250_18232 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396027284|gb|EJI36048.1| hypothetical protein SEEE7246_13130 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396031465|gb|EJI40192.1| hypothetical protein SEEE6437_06656 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396038028|gb|EJI46672.1| hypothetical protein SEEE2659_18236 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396040526|gb|EJI49150.1| hypothetical protein SEEE1427_14151 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396041741|gb|EJI50364.1| hypothetical protein SEEE1757_14019 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396049128|gb|EJI57671.1| hypothetical protein SEEE8B1_21392 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396053844|gb|EJI62337.1| hypothetical protein SEEE5101_11002 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396059298|gb|EJI67753.1| hypothetical protein SEEE5518_08200 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396062868|gb|EJI71279.1| hypothetical protein SEEE1618_22104 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|396066913|gb|EJI75273.1| hypothetical protein SEEE3079_10567 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|396073813|gb|EJI82113.1| hypothetical protein SEEE6482_06661 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|434939014|gb|ELL45899.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Pullorum str. ATCC 9120]
gi|434960690|gb|ELL54048.1| hypothetical protein SEECHS44_08640 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434961012|gb|ELL54340.1| hypothetical protein SEE30663_23300 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|434966994|gb|ELL59829.1| hypothetical protein SEEE1882_12114 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434973184|gb|ELL65572.1| hypothetical protein SEEE1884_09778 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434979077|gb|ELL71069.1| hypothetical protein SEEE1594_15488 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434981527|gb|ELL73406.1| hypothetical protein SEE22704_00397 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434982981|gb|ELL74789.1| hypothetical protein SEEE1566_20799 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434989576|gb|ELL81126.1| hypothetical protein SEEE1580_08895 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434995632|gb|ELL86948.1| hypothetical protein SEEE1543_09680 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|434998596|gb|ELL89817.1| hypothetical protein SEEE1441_17537 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|435007900|gb|ELL98727.1| hypothetical protein SEEE1810_06222 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435010206|gb|ELM00992.1| hypothetical protein SEEE1558_13819 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435015610|gb|ELM06136.1| hypothetical protein SEEE1018_09342 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435021037|gb|ELM11426.1| hypothetical protein SEEE1010_07154 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435024608|gb|ELM14814.1| hypothetical protein SEEE0895_22485 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435026603|gb|ELM16734.1| hypothetical protein SEEE1729_13290 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435036813|gb|ELM26632.1| hypothetical protein SEEE0899_11044 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435039147|gb|ELM28928.1| hypothetical protein SEEE1457_13351 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435043698|gb|ELM33415.1| hypothetical protein SEEE1747_14492 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435050801|gb|ELM40305.1| hypothetical protein SEEE1444_12165 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435051480|gb|ELM40982.1| hypothetical protein SEEE0968_09924 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435057033|gb|ELM46402.1| hypothetical protein SEEE1445_10116 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435063964|gb|ELM53111.1| hypothetical protein SEEE1565_14614 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435066348|gb|ELM55436.1| hypothetical protein SEEE1559_10059 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435074643|gb|ELM63467.1| hypothetical protein SEEE1808_02269 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435076795|gb|ELM65577.1| hypothetical protein SEEE0956_16006 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435079673|gb|ELM68368.1| hypothetical protein SEEE1811_10508 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435079892|gb|ELM68586.1| hypothetical protein SEEE1455_19061 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435088831|gb|ELM77286.1| hypothetical protein SEEE1575_20271 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435090319|gb|ELM78721.1| hypothetical protein SEEE1745_19933 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435094642|gb|ELM82981.1| hypothetical protein SEEE1725_13124 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435105815|gb|ELM93852.1| hypothetical protein SEEE1791_14002 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435111738|gb|ELM99626.1| hypothetical protein SEEE1795_04921 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435112624|gb|ELN00489.1| hypothetical protein SEEE6709_11442 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435122831|gb|ELN10337.1| hypothetical protein SEEE0816_12700 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435123909|gb|ELN11400.1| hypothetical protein SEEE9058_03984 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435125097|gb|ELN12553.1| hypothetical protein SEEE0819_13450 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435132397|gb|ELN19595.1| hypothetical protein SEEE3072_11367 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435135372|gb|ELN22481.1| hypothetical protein SEEE9163_21092 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435137191|gb|ELN24262.1| hypothetical protein SEEE3089_10142 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435150165|gb|ELN36849.1| hypothetical protein SEEE151_05509 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435152161|gb|ELN38791.1| hypothetical protein SEEE3991_16563 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435153460|gb|ELN40068.1| hypothetical protein SEEEN202_03846 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435162077|gb|ELN48277.1| hypothetical protein SEEE3618_09565 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435165954|gb|ELN51956.1| hypothetical protein SEEE2490_04439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435173300|gb|ELN58810.1| hypothetical protein SEEEL913_09670 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435174455|gb|ELN59897.1| hypothetical protein SEEEL909_07808 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435183325|gb|ELN68300.1| hypothetical protein SEEE7015_13430 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435184721|gb|ELN69642.1| hypothetical protein SEEE4941_03980 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435191363|gb|ELN75929.1| hypothetical protein SEEE7927_13823 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435191423|gb|ELN75980.1| hypothetical protein SEEECHS4_15890 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435205556|gb|ELN89145.1| hypothetical protein SEEE2217_01595 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435210673|gb|ELN93911.1| hypothetical protein SEEE4018_09992 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435217787|gb|ELO00202.1| hypothetical protein SEEE4441_18866 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435220861|gb|ELO03135.1| hypothetical protein SEEE6211_04127 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435227127|gb|ELO08650.1| hypothetical protein SEEE4647_14653 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435236438|gb|ELO17173.1| hypothetical protein SEEE9845_03933 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435238329|gb|ELO18978.1| hypothetical protein SEEE0116_15890 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435242599|gb|ELO22903.1| hypothetical protein SEEE1117_16729 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435248883|gb|ELO28731.1| hypothetical protein SEEE9317_03971 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|435260325|gb|ELO39536.1| hypothetical protein SEEE1392_05704 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435260766|gb|ELO39956.1| hypothetical protein SEEE0268_06478 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435265309|gb|ELO44190.1| hypothetical protein SEEE0316_00622 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435270811|gb|ELO49296.1| hypothetical protein SEEE1319_05348 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435282100|gb|ELO59736.1| hypothetical protein SEEE4481_03073 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435282667|gb|ELO60281.1| hypothetical protein SEEE6297_01857 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435285798|gb|ELO63162.1| hypothetical protein SEEE0436_00804 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435288613|gb|ELO65621.1| hypothetical protein SEEE2651_24346 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|435289548|gb|ELO66501.1| hypothetical protein SEEE4220_10855 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435290743|gb|ELO67643.1| hypothetical protein SEEE1616_09635 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|435297425|gb|ELO73707.1| hypothetical protein SEEE3944_18708 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435318315|gb|ELO91256.1| hypothetical protein SEEE2625_08557 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435325392|gb|ELO97257.1| hypothetical protein SEEE1976_07359 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435331875|gb|ELP02973.1| hypothetical protein SEEE3407_07536 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|435337265|gb|ELP06901.1| hypothetical protein SEEE5646_06793 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|436413542|gb|ELP11475.1| hypothetical protein F515_16533 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|436419484|gb|ELP17359.1| hypothetical protein F514_07997 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|436424533|gb|ELP22304.1| hypothetical protein F434_02369 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|444846644|gb|ELX71804.1| hypothetical protein SEEG9184_003014 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|444863684|gb|ELX88501.1| hypothetical protein SEE8A_002740 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444865968|gb|ELX90724.1| hypothetical protein SEE10_016120 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444869388|gb|ELX93976.1| hypothetical protein SE20037_07109 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444873361|gb|ELX97662.1| hypothetical protein SEE13_020245 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444883532|gb|ELY07411.1| hypothetical protein SEE23_012930 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
Length = 373
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|418845546|ref|ZP_13400330.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19443]
gi|392812919|gb|EJA68895.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19443]
Length = 373
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417518865|ref|ZP_12181142.1| hypothetical protein LTSEUGA_2826 [Salmonella enterica subsp.
enterica serovar Uganda str. R8-3404]
gi|353648187|gb|EHC91141.1| hypothetical protein LTSEUGA_2826 [Salmonella enterica subsp.
enterica serovar Uganda str. R8-3404]
Length = 352
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417539905|ref|ZP_12192078.1| hypothetical protein LTSEWAN_3165 [Salmonella enterica subsp.
enterica serovar Wandsworth str. A4-580]
gi|353663648|gb|EHD02277.1| hypothetical protein LTSEWAN_3165 [Salmonella enterica subsp.
enterica serovar Wandsworth str. A4-580]
Length = 383
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|123442019|ref|YP_001006002.1| hypothetical protein YE1716 [Yersinia enterocolitica subsp.
enterocolitica 8081]
gi|122088980|emb|CAL11791.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
enterocolitica 8081]
Length = 373
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 93/203 (45%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +EDG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWQVAHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|445292803|ref|ZP_21411158.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Enteritidis str. 436]
gi|444883200|gb|ELY07099.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Enteritidis str. 436]
Length = 248
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|197264610|ref|ZP_03164684.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Saintpaul str. SARA23]
gi|197242865|gb|EDY25485.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Saintpaul str. SARA23]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417358629|ref|ZP_12133487.1| hypothetical protein LTSEGIV_2727, partial [Salmonella enterica
subsp. enterica serovar Give str. S5-487]
gi|417511486|ref|ZP_12176093.1| hypothetical protein LTSESEN_3098, partial [Salmonella enterica
subsp. enterica serovar Senftenberg str. A4-543]
gi|353590763|gb|EHC49201.1| hypothetical protein LTSEGIV_2727, partial [Salmonella enterica
subsp. enterica serovar Give str. S5-487]
gi|353642904|gb|EHC87229.1| hypothetical protein LTSESEN_3098, partial [Salmonella enterica
subsp. enterica serovar Senftenberg str. A4-543]
Length = 351
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|336311496|ref|ZP_08566459.1| putative cytoplasmic protein [Shewanella sp. HN-41]
gi|335864989|gb|EGM70049.1| putative cytoplasmic protein [Shewanella sp. HN-41]
Length = 433
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 95/218 (43%), Gaps = 38/218 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K+PL I R+ ++ +L++ +++ + D ++E ++ +Y+ + +
Sbjct: 64 YWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR-----VYQQAGQWHAEFGP-- 115
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVGAN---------TYQ 109
+D Y E L + + +Q LN L + L CF +Y
Sbjct: 116 -----FDSYEE------LGERDWTLIVQALNNWLPDAEALIQCFDFIPRWRFDDVMVSYA 164
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
TP G PH D + FI Q G+++W+V E+ + P E PI+
Sbjct: 165 TP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFA---AHPALLHTEAFEPIIDT 219
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
L PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 220 ELLPGDILYIPPGFPHDG--ITLEESLSFSVGYRTASA 255
>gi|204930767|ref|ZP_03221640.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Javiana str. GA_MM04042433]
gi|416422178|ref|ZP_11690082.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 315996572]
gi|416431269|ref|ZP_11695492.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 495297-1]
gi|416441324|ref|ZP_11701536.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 495297-3]
gi|416444169|ref|ZP_11703523.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 495297-4]
gi|416451957|ref|ZP_11708624.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 515920-1]
gi|416459031|ref|ZP_11713540.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 515920-2]
gi|416467760|ref|ZP_11717609.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 531954]
gi|416479512|ref|ZP_11722321.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. NC_MB110209-0054]
gi|416484572|ref|ZP_11724260.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. OH_2009072675]
gi|416497407|ref|ZP_11729675.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CASC_09SCPH15965]
gi|416541577|ref|ZP_11751064.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 19N]
gi|416576289|ref|ZP_11768976.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 81038-01]
gi|416585917|ref|ZP_11775217.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MD_MDA09249507]
gi|416591001|ref|ZP_11778176.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 414877]
gi|416599038|ref|ZP_11783389.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 366867]
gi|416607883|ref|ZP_11788877.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 413180]
gi|416611150|ref|ZP_11790580.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 446600]
gi|416623335|ref|ZP_11797355.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 609458-1]
gi|416629099|ref|ZP_11799863.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 556150-1]
gi|416639453|ref|ZP_11804522.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 609460]
gi|416650751|ref|ZP_11810516.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 507440-20]
gi|416665997|ref|ZP_11817148.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MB101509-0077]
gi|416685681|ref|ZP_11825039.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MB102109-0047]
gi|416707244|ref|ZP_11832342.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MB111609-0052]
gi|416714542|ref|ZP_11837860.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 2009083312]
gi|416717025|ref|ZP_11839306.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 2009085258]
gi|416725223|ref|ZP_11845593.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 315731156]
gi|416733374|ref|ZP_11850420.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2009159199]
gi|416736079|ref|ZP_11851796.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008282]
gi|416750056|ref|ZP_11859546.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008283]
gi|416759823|ref|ZP_11864631.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008284]
gi|416761883|ref|ZP_11865933.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008285]
gi|416767266|ref|ZP_11869763.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008287]
gi|418485944|ref|ZP_13054926.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 80959-06]
gi|418490970|ref|ZP_13057500.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035278]
gi|418495421|ref|ZP_13061863.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035318]
gi|418499285|ref|ZP_13065694.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035320]
gi|418503164|ref|ZP_13069533.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035321]
gi|418506425|ref|ZP_13072757.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035327]
gi|418512653|ref|ZP_13078893.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Pomona str. ATCC 10729]
gi|418527014|ref|ZP_13092971.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008286]
gi|452120602|ref|YP_007470850.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Javiana str. CFSAN001992]
gi|204320226|gb|EDZ05430.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Javiana str. GA_MM04042433]
gi|322616605|gb|EFY13514.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 315996572]
gi|322619880|gb|EFY16754.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 495297-1]
gi|322622448|gb|EFY19293.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 495297-3]
gi|322629434|gb|EFY26211.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 495297-4]
gi|322632930|gb|EFY29673.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 515920-1]
gi|322636825|gb|EFY33528.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 515920-2]
gi|322641375|gb|EFY38014.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 531954]
gi|322645140|gb|EFY41669.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. NC_MB110209-0054]
gi|322652303|gb|EFY48659.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. OH_2009072675]
gi|322655655|gb|EFY51957.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CASC_09SCPH15965]
gi|322660960|gb|EFY57190.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 19N]
gi|322665454|gb|EFY61642.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 81038-01]
gi|322667569|gb|EFY63730.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MD_MDA09249507]
gi|322673637|gb|EFY69739.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 414877]
gi|322677563|gb|EFY73627.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 366867]
gi|322679772|gb|EFY75811.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 413180]
gi|322687245|gb|EFY83217.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 446600]
gi|323193063|gb|EFZ78285.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 609458-1]
gi|323199440|gb|EFZ84533.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 556150-1]
gi|323203628|gb|EFZ88650.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 609460]
gi|323217805|gb|EGA02520.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MB101509-0077]
gi|323218884|gb|EGA03395.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MB102109-0047]
gi|323229608|gb|EGA13731.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. MB111609-0052]
gi|323232833|gb|EGA16929.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 2009083312]
gi|323240131|gb|EGA24175.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 2009085258]
gi|323242882|gb|EGA26903.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 315731156]
gi|323246798|gb|EGA30768.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2009159199]
gi|323254302|gb|EGA38119.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008282]
gi|323255568|gb|EGA39327.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008283]
gi|323259402|gb|EGA43038.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008284]
gi|323266998|gb|EGA50483.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008285]
gi|323272102|gb|EGA55516.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008287]
gi|366055834|gb|EHN20169.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 80959-06]
gi|366059277|gb|EHN23551.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035318]
gi|366063735|gb|EHN27947.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035278]
gi|366071820|gb|EHN35914.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035320]
gi|366074888|gb|EHN38950.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035321]
gi|366083145|gb|EHN47072.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. CT_02035327]
gi|366083481|gb|EHN47402.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Pomona str. ATCC 10729]
gi|366827633|gb|EHN54531.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. 507440-20]
gi|372204483|gb|EHP18010.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Montevideo str. IA_2010008286]
gi|451909606|gb|AGF81412.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Javiana str. CFSAN001992]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|420258884|ref|ZP_14761608.1| hypothetical protein YWA314_09054 [Yersinia enterocolitica subsp.
enterocolitica WA-314]
gi|404513670|gb|EKA27481.1| hypothetical protein YWA314_09054 [Yersinia enterocolitica subsp.
enterocolitica WA-314]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 93/203 (45%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +EDG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWQVAHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|418858539|ref|ZP_13413153.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19470]
gi|418863928|ref|ZP_13418464.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19536]
gi|392831777|gb|EJA87404.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19536]
gi|392832510|gb|EJA88130.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 19470]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTTALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|397168193|ref|ZP_10491631.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
gi|396089728|gb|EJI87300.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 146/337 (43%), Gaps = 57/337 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + + DG+ Q G
Sbjct: 16 YWQKRPVVLKRGIKNFVDP-ISPDELAGLAMENEVDSR----LVSHLDGKWQV----GHG 66
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT------YQTPPTAQ 115
P +DH E + + Q A+L F T + P
Sbjct: 67 -PFQSYDHLGENNWSLLVQ---AVNHWHQPAAALMRPFRALPDWRTDDLMISFSVP--GG 120
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKTPLKQHCPH---PDLLQVDPFEAIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+LY+P G+ H+ ++ N SL+ ++ ++ + +L+ +G D +R L
Sbjct: 178 ILYIPPGFPHEGYSLEN--SLNYSVG-FRAPSGRELI----------SGFADYVLQREL- 223
Query: 236 IGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVD-LDDGVDEMGKQLMH-- 289
G RY+ R P DI + + + L++++ L+ + +D + E Q H
Sbjct: 224 -GSQRYSDPDVPQRAHPADILPEEV---DKLREMMLNLINQPEQFNDWLGEFISQSRHEL 279
Query: 290 DALP--PVLSPEELQCSVFE-------NGLRMSQTGE 317
D P P P+E+ ++ + GLR+ + GE
Sbjct: 280 DVAPPEPPYQPDEIYDALQQGDKLVRLGGLRVLRIGE 316
>gi|198242002|ref|YP_002215906.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Dublin str. CT_02021853]
gi|445142397|ref|ZP_21386083.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Dublin str. SL1438]
gi|445158953|ref|ZP_21393237.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Dublin str. HWS51]
gi|197936518|gb|ACH73851.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Dublin str. CT_02021853]
gi|444845219|gb|ELX70431.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Dublin str. HWS51]
gi|444849822|gb|ELX74931.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Dublin str. SL1438]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|212556168|gb|ACJ28622.1| Transcription factor jumonji, jmjC [Shewanella piezotolerans WP3]
Length = 376
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G H D+ + FI+Q EG++ W V +Y + + P E PI
Sbjct: 113 SFATP--HGGVGAHIDNYDVFIIQGEGQRHWTV---GEKGQYAAKNNDPTTPLIEGFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ V LE GD+LY+P GY HQ T+T S +
Sbjct: 168 IDVVLEKGDMLYIPPGYPHQGQTLTTAMSYSI 199
>gi|417475684|ref|ZP_12170439.1| hypothetical protein LTSERUB_3048, partial [Salmonella enterica
subsp. enterica serovar Rubislaw str. A4-653]
gi|353643512|gb|EHC87680.1| hypothetical protein LTSERUB_3048, partial [Salmonella enterica
subsp. enterica serovar Rubislaw str. A4-653]
Length = 337
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|437836341|ref|ZP_20845630.1| hypothetical protein SEEERB17_014183 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
gi|435299455|gb|ELO75599.1| hypothetical protein SEEERB17_014183 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|284007314|emb|CBA72661.1| conserved hypothetical protein [Arsenophonus nasoniae]
Length = 380
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P A G PH D + FI+Q G++ W+V + ++ P P+ Q E I
Sbjct: 119 SYSVP--AGGVGPHIDQYDVFIIQGLGRRHWRVGEKMTLKQHCPH---PDLLQVEPFAAI 173
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ LEPGD+LY+P G+ H+ ++ E SL+ ++ T+
Sbjct: 174 IDEQLEPGDILYIPPGFPHEGYSI--ETSLNYSVGFRSPTS 212
>gi|417349643|ref|ZP_12128258.1| hypothetical protein SeGA_2719 [Salmonella enterica subsp. enterica
serovar Gaminara str. A4-567]
gi|353572329|gb|EHC36010.1| hypothetical protein SeGA_2719 [Salmonella enterica subsp. enterica
serovar Gaminara str. A4-567]
Length = 366
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|417373978|ref|ZP_12143873.1| hypothetical protein LTSEINV_3007, partial [Salmonella enterica
subsp. enterica serovar Inverness str. R8-3668]
gi|353600966|gb|EHC56706.1| hypothetical protein LTSEINV_3007, partial [Salmonella enterica
subsp. enterica serovar Inverness str. R8-3668]
Length = 243
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|417366084|ref|ZP_12138503.1| hypothetical protein LTSEHVI_2748 [Salmonella enterica subsp.
enterica serovar Hvittingfoss str. A4-620]
gi|353592862|gb|EHC50763.1| hypothetical protein LTSEHVI_2748 [Salmonella enterica subsp.
enterica serovar Hvittingfoss str. A4-620]
Length = 383
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPAAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|437897671|ref|ZP_20849620.1| hypothetical protein SEEE5621_13520 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|435322351|gb|ELO94641.1| hypothetical protein SEEE5621_13520 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
Length = 290
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFSNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|168264258|ref|ZP_02686231.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Hadar str. RI_05P066]
gi|418868700|ref|ZP_13423141.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 4176]
gi|205347283|gb|EDZ33914.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Hadar str. RI_05P066]
gi|392837390|gb|EJA92960.1| cupin family protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 4176]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|238789087|ref|ZP_04632876.1| hypothetical protein yfred0001_42190 [Yersinia frederiksenii ATCC
33641]
gi|238722851|gb|EEQ14502.1| hypothetical protein yfred0001_42190 [Yersinia frederiksenii ATCC
33641]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 89/203 (43%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + L D L +E + + + +EDG Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGSWQVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|167624225|ref|YP_001674519.1| cupin 4 family protein [Shewanella halifaxensis HAW-EB4]
gi|167354247|gb|ABZ76860.1| Cupin 4 family protein [Shewanella halifaxensis HAW-EB4]
Length = 381
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 87/210 (41%), Gaps = 36/210 (17%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I + KN+ +L++ ++ + D ++E ++ E G+ Q
Sbjct: 16 EYWQKKPLVIRQGFKNF-QDLLSPDEMAGLACDEMVESRR----VYREKGEWQA------ 64
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLN---ASLQELFGCFVGAN---------TY 108
+ S L + +Q LN + ++L CF +Y
Sbjct: 65 --------EFGPFESYENLGETDWTLIVQALNNWVPAAEDLLKCFDFIPRWRLDDVMVSY 116
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
P G PH D + FI Q G+++W+V + + + P E PI+
Sbjct: 117 AVP--GGGVGPHIDLYDVFICQGSGRRRWRV---GDIGPHKEFAAHPALLHTEAFEPIID 171
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
V L PGD+LYLP G+ H T+ S V
Sbjct: 172 VELLPGDILYLPPGFPHDGVTLEPSMSFSV 201
>gi|417416112|ref|ZP_12159607.1| hypothetical protein LTSEMIS_2735, partial [Salmonella enterica
subsp. enterica serovar Mississippi str. A4-633]
gi|353620896|gb|EHC70868.1| hypothetical protein LTSEMIS_2735, partial [Salmonella enterica
subsp. enterica serovar Mississippi str. A4-633]
Length = 369
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 26 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 76
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 77 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 135
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 136 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 192
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 193 PGFPHEGYALEN 204
>gi|213422268|ref|ZP_03355334.1| hypothetical protein Salmonentericaenterica_32813 [Salmonella
enterica subsp. enterica serovar Typhi str. E01-6750]
Length = 300
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 1 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 51
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 52 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 110
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 111 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 167
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 168 PGFPHEGYALEN 179
>gi|289806227|ref|ZP_06536856.1| hypothetical protein Salmonellaentericaenterica_18037 [Salmonella
enterica subsp. enterica serovar Typhi str. AG3]
Length = 359
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|16081039|ref|NP_391867.1| hypothetical protein BSU39880 [Bacillus subtilis subsp. subtilis
str. 168]
gi|221311960|ref|ZP_03593807.1| hypothetical protein Bsubs1_21506 [Bacillus subtilis subsp.
subtilis str. 168]
gi|221316284|ref|ZP_03598089.1| hypothetical protein BsubsN3_21417 [Bacillus subtilis subsp.
subtilis str. NCIB 3610]
gi|221321197|ref|ZP_03602491.1| hypothetical protein BsubsJ_21360 [Bacillus subtilis subsp.
subtilis str. JH642]
gi|221325480|ref|ZP_03606774.1| hypothetical protein BsubsS_21516 [Bacillus subtilis subsp.
subtilis str. SMY]
gi|402778153|ref|YP_006632097.1| protein YxbC [Bacillus subtilis QB928]
gi|452913057|ref|ZP_21961685.1| hypothetical protein BS732_0971 [Bacillus subtilis MB73/2]
gi|1176977|sp|P46327.1|YXBC_BACSU RecName: Full=Uncharacterized protein YxbC
gi|904195|dbj|BAA21597.1| yxbC [Bacillus subtilis]
gi|2636534|emb|CAB16024.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
str. 168]
gi|402483332|gb|AFQ59841.1| YxbC [Bacillus subtilis QB928]
gi|407962835|dbj|BAM56075.1| hypothetical protein BEST7613_7144 [Bacillus subtilis BEST7613]
gi|407966847|dbj|BAM60086.1| hypothetical protein BEST7003_3885 [Bacillus subtilis BEST7003]
gi|452118085|gb|EME08479.1| hypothetical protein BS732_0971 [Bacillus subtilis MB73/2]
Length = 330
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 110/240 (45%), Gaps = 27/240 (11%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W +PL ++R + + ++ +K+ + +NV+ N + + + ++ + R
Sbjct: 24 EYWPVKPL-VARGEVERFTSIPGFEKVRTL--ENVLAIYNNPVMVVGDAVIEESEGITDR 80
Query: 61 AL--PQIVWDHYREGCSIRFLNPQTYI----KPLQQLNASLQELFGCFVGANTYQTPPTA 114
L P + Y +G ++ F +I + +++L A L+ G A Y
Sbjct: 81 FLVSPAEALEWYEKGAALEFDFTDLFIPQVRRWIEKLKAELRLPAGTSSKAIVYAAK-NG 139
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQEEI-----GTP-- 165
GF H+D I Q++G+K WK+ V + Y S + +++ G P
Sbjct: 140 GGFKAHFDAYTNLIFQIQGEKTWKLAKNENVSNPMQHYDLSEAPYYPDDLQSYWKGDPPK 199
Query: 166 -----ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKAL 220
V L PG +LYLPRG H ST +++ +L + I+ + + AW+DL+ A+ K L
Sbjct: 200 EDLPDAEIVNLTPGTMLYLPRGLWH--STKSDQATLALNIT-FGQPAWLDLMLAALRKKL 256
>gi|16760104|ref|NP_455721.1| hypothetical protein STY1269 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29142125|ref|NP_805467.1| hypothetical protein t1691 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213612643|ref|ZP_03370469.1| hypothetical protein SentesTyp_09119 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-2068]
gi|213864845|ref|ZP_03386964.1| hypothetical protein SentesT_33730 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
gi|378959857|ref|YP_005217343.1| hypothetical protein STBHUCCB_17970 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
gi|25357094|pir||AF0646 conserved hypothetical protein STY1269 [imported] - Salmonella
enterica subsp. enterica serovar Typhi (strain CT18)
gi|16502398|emb|CAD08353.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29137754|gb|AAO69316.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|374353729|gb|AEZ45490.1| hypothetical protein STBHUCCB_17970 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
Length = 373
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|153000863|ref|YP_001366544.1| cupin 4 family protein [Shewanella baltica OS185]
gi|160875571|ref|YP_001554887.1| cupin 4 family protein [Shewanella baltica OS195]
gi|378708769|ref|YP_005273663.1| transcription factor jumonji jmjC domain-containing protein
[Shewanella baltica OS678]
gi|418025366|ref|ZP_12664345.1| Cupin, JmjC-type [Shewanella baltica OS625]
gi|151365481|gb|ABS08481.1| Cupin 4 family protein [Shewanella baltica OS185]
gi|160861093|gb|ABX49627.1| Cupin 4 family protein [Shewanella baltica OS195]
gi|315267758|gb|ADT94611.1| transcription factor jumonji jmjC domain-containing protein
[Shewanella baltica OS678]
gi|353535350|gb|EHC04913.1| Cupin, JmjC-type [Shewanella baltica OS625]
Length = 385
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I R+ ++ +L++ +++ + D ++E ++ + GQ Q
Sbjct: 16 EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQ-----AE 65
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
P +DH E + ++ L + L CF ++ TP
Sbjct: 66 FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH D + FI Q G+++W+V E+ + P E PI+ L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIIDTEL 174
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208
>gi|91793181|ref|YP_562832.1| cupin 4 [Shewanella denitrificans OS217]
gi|91715183|gb|ABE55109.1| Cupin 4 [Shewanella denitrificans OS217]
Length = 376
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI+Q EG+++W+V E+ R + P E PI
Sbjct: 113 SFATP--LGGVGPHIDNYDVFIIQGEGERRWRV---GDKGEHQRRNNDPMSPMVEDFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ V L+ GD+LY+P G+ H T+T S V
Sbjct: 168 IDVVLKKGDVLYIPPGFPHSGETLTMAMSYSV 199
>gi|381404919|ref|ZP_09929603.1| hypothetical protein S7A_11740 [Pantoea sp. Sc1]
gi|380738118|gb|EIB99181.1| hypothetical protein S7A_11740 [Pantoea sp. Sc1]
Length = 374
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 87/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R K++ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFKDFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLGENNWSLLVQAVNHWHEPSAALMRPFRFLPDWRIDDLMVSFAVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|126174567|ref|YP_001050716.1| cupin 4 family protein [Shewanella baltica OS155]
gi|386341321|ref|YP_006037687.1| cupin [Shewanella baltica OS117]
gi|125997772|gb|ABN61847.1| Cupin 4 family protein [Shewanella baltica OS155]
gi|334863722|gb|AEH14193.1| Cupin 4 family protein [Shewanella baltica OS117]
Length = 385
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I R+ ++ +L++ +++ + D ++E ++ + GQ Q
Sbjct: 16 EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQA-----E 65
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
P +DH E + ++ L + L CF ++ TP
Sbjct: 66 FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH D + FI Q G+++W+V E+ + P E PI+ L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIIDTEL 174
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208
>gi|217973180|ref|YP_002357931.1| cupin [Shewanella baltica OS223]
gi|217498315|gb|ACK46508.1| Cupin 4 family protein [Shewanella baltica OS223]
Length = 385
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I R+ ++ +L++ +++ + D ++E ++ + GQ Q
Sbjct: 16 EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQA-----E 65
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
P +DH E + ++ L + L CF ++ TP
Sbjct: 66 FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH D + FI Q G+++W+V E+ + P E PI+ L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFEPIIDTEL 174
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208
>gi|312172134|emb|CBX80391.1| Uncharacterized protein ycfD [Erwinia amylovora ATCC BAA-2158]
Length = 375
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R K++ + I+ ++ + +N ++ + + ++DG+ Q G
Sbjct: 16 YWQKRPVVLKRGFKHFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + + V G PH
Sbjct: 67 -PFESYDHLGENNWSLLVQAVNHWHEPSAALMHPFRAIPDWRVDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKLEMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|372274110|ref|ZP_09510146.1| hypothetical protein PSL1_03385 [Pantoea sp. SL1_M5]
gi|390433535|ref|ZP_10222073.1| hypothetical protein PaggI_01830 [Pantoea agglomerans IG1]
Length = 374
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/197 (22%), Positives = 89/197 (45%), Gaps = 24/197 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN------TYQTPPTAQ 115
P +DH E + + + +A+L F ++ P
Sbjct: 66 GPFESYDHLGENNWSLLVQA---VNHWHEASAALMRPFRFLPDWRVDDLMVSFAVP--GG 120
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGAGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|322833624|ref|YP_004213651.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella sp. Y9602]
gi|384258802|ref|YP_005402736.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella aquatilis HX2]
gi|321168825|gb|ADW74524.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella sp. Y9602]
gi|380754778|gb|AFE59169.1| transcription factor jumonji jmjC domain-containing protein
[Rahnella aquatilis HX2]
Length = 373
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/274 (21%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + +DG+
Sbjct: 16 YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSNQDGKWNV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E G S+ + +P L + + + G PH
Sbjct: 66 GPFESYDHLGETGWSVLVQAVDHWHEPSSHLMTPFRHIPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD++Y+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVAMKQHCPH---PDLLQVEPFEAIIDEELEPGDIVYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H ++ N SL+ ++ A + M +G D R L G R
Sbjct: 183 PGFPHDGYSLEN--SLNYSVGFRAPNA------REM-----VSGFADFVLSREL--GSHR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
Y+ + R P ++Q L + L+ ++ L+
Sbjct: 228 YSDPELVEREHPAEVQPAEL---DKLRGMMLDLI 258
>gi|373949678|ref|ZP_09609639.1| Cupin 4 family protein [Shewanella baltica OS183]
gi|386324489|ref|YP_006020606.1| cupin [Shewanella baltica BA175]
gi|333818634|gb|AEG11300.1| Cupin 4 family protein [Shewanella baltica BA175]
gi|373886278|gb|EHQ15170.1| Cupin 4 family protein [Shewanella baltica OS183]
Length = 385
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 92/216 (42%), Gaps = 32/216 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K+PL I R+ ++ +L++ +++ + D ++E ++ + GQ Q
Sbjct: 16 EYWQKKPLVI-RQGFKHFQDLVSPEELAGLAMDELVESRR----VYQQAGQWQA-----E 65
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN---------TYQTP 111
P +DH E + ++ L + L CF ++ TP
Sbjct: 66 FGPFDSYDHLGE------RDWTLIVQALNNWVPDAEALIQCFDFIPRWRLDDVMVSFATP 119
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH D + FI Q G+++W+V E+ + P E PI+ L
Sbjct: 120 --GGGVGPHIDLYDVFICQGSGRRRWRVGDLGPHKEFA---AHPALLHTEAFDPIIDTEL 174
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 175 LPGDILYIPPGFPHDG--ITLEQSLSFSVGYRTASA 208
>gi|213027812|ref|ZP_03342259.1| hypothetical protein Salmonelentericaenterica_37965 [Salmonella
enterica subsp. enterica serovar Typhi str. 404ty]
Length = 259
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|377576032|ref|ZP_09805016.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
gi|377542064|dbj|GAB50181.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
Length = 379
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R KN+ + L + ++ + +N ++ + + + DG+ Q
Sbjct: 22 HWQKRPVVLKRGFKNFTDPL-SPDELAGLAMENEVDSR----LVSHLDGKWQV-----SH 71
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L G PH
Sbjct: 72 GPFESYDHLGESNWSLLVQAVNHWHEPAAALMRPFRALPDWRTDDLMISFSVPGGGVGPH 131
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 132 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVDPFDAIIDEELEPGDILYIP 188
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N + V I +G D +R L G R
Sbjct: 189 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSQR 233
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
Y AR P DIQ L E ++ ++ L+
Sbjct: 234 YTDPDVPARQYPADIQPQEL---EKVRGMMLDLI 264
>gi|119775200|ref|YP_927940.1| cupin 4 [Shewanella amazonensis SB2B]
gi|119767700|gb|ABM00271.1| cupin 4 [Shewanella amazonensis SB2B]
Length = 372
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI+Q EG+++W V +SP E PI
Sbjct: 113 SFATPQG--GVGPHIDNYDVFIIQGEGQRRWTVGPKGNYQRRGGVTTSPLIEDFE---PI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
+ V LE GD+LY+P G+ HQ T+T
Sbjct: 168 IDVVLEKGDVLYIPPGFPHQGETLT 192
>gi|292488010|ref|YP_003530887.1| hypothetical protein EAMY_1529 [Erwinia amylovora CFBP1430]
gi|292899228|ref|YP_003538597.1| hypothetical protein EAM_1512 [Erwinia amylovora ATCC 49946]
gi|428784945|ref|ZP_19002436.1| hypothetical protein EaACW_1536 [Erwinia amylovora ACW56400]
gi|291199076|emb|CBJ46187.1| conserved hypothetical protein [Erwinia amylovora ATCC 49946]
gi|291553434|emb|CBA20479.1| Uncharacterized protein ycfD [Erwinia amylovora CFBP1430]
gi|426276507|gb|EKV54234.1| hypothetical protein EaACW_1536 [Erwinia amylovora ACW56400]
Length = 375
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R K++ + I+ ++ + +N ++ + + ++DG+ Q G
Sbjct: 16 YWQKRPVVLKRGFKHFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV----GHG 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + + V G PH
Sbjct: 67 -PFESYDHLGENNWSLLVQAVNHWHEPSAALMNPFRAIPDWRVDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 FDQYDVFIIQGTGRRRWRVGEKLEMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|56751149|ref|YP_171850.1| hypothetical protein syc1140_d [Synechococcus elongatus PCC 6301]
gi|56686108|dbj|BAD79330.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 270
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 68/139 (48%), Gaps = 8/139 (5%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W ++ +YI+ + +L + ++D+L N F+++ ++ DG+ +
Sbjct: 20 YWAQQSVYIAGDSLRF-QSLFSWNHLNDLL--NYQTFRES-ELRFSRDGE----SLPAGD 71
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHY 121
P + +EG ++ + L+ L +L++ FG N Y +P QGF HY
Sbjct: 72 NPTLWRSRLQEGATLVLNGVHHRVPALKHLATNLRQEFGYRCHINLYSSPAQQQGFDCHY 131
Query: 122 DDIEAFILQLEGKKKWKVY 140
D + ILQ+EG+K+W +Y
Sbjct: 132 DTHDVLILQIEGEKEWLIY 150
>gi|167515846|ref|XP_001742264.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778888|gb|EDQ92502.1| predicted protein [Monosiga brevicollis MX1]
Length = 476
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 82/195 (42%), Gaps = 35/195 (17%)
Query: 87 PLQQLNASLQELF-GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV 145
PL L +LQ +F + Y + AQ H D + +LQ+ G KKW + P +
Sbjct: 147 PLSDLELALQTVFHTTHSTVHAYISSGGAQALPYHTDSYDVIVLQVSGVKKWTICQPPSL 206
Query: 146 DEYLP-------------RYSSP----NFSQEEI-GTPILTVTLEPGDLLYLPRGYIHQA 187
+ P R+++P NF+ I +T+ GD+LYLP+ IH A
Sbjct: 207 ETAWPNPSPADLAQLYELRHANPDGCSNFNHRTIEQLNCQNLTMLAGDMLYLPKSMIHVA 266
Query: 188 STVTNEHSLHVTISVYQKTA-WIDLLEKAMPK-----------ALQAAGATDLEFRRGLP 235
T S H+T S+ ++ W D++ +A +LQ A +TD LP
Sbjct: 267 HTKPGTVSAHLTYSLDREGGMWRDVISRACSTWRLPLCPEILGSLQRAKSTD----AALP 322
Query: 236 IGYLRYAGLARGKPV 250
L Y + G PV
Sbjct: 323 WLQLAYEPVRPGAPV 337
>gi|213428095|ref|ZP_03360845.1| hypothetical protein SentesTyphi_22435 [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
Length = 253
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|167523821|ref|XP_001746247.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775518|gb|EDQ89142.1| predicted protein [Monosiga brevicollis MX1]
Length = 2290
Score = 55.1 bits (131), Expect = 4e-05, Method: Composition-based stats.
Identities = 37/123 (30%), Positives = 54/123 (43%), Gaps = 17/123 (13%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRM--------- 144
+L+ F V + Y T P + PH D + F+ QL G K W++ +P
Sbjct: 172 ALRAAFDTDVTQHLYATMPGGRALDPHTDGGDVFVHQLAGHKHWEICVPTTNTTCQNCTH 231
Query: 145 -VDEYLPRYSSPNF------SQEEI-GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSL 196
L + +F S E++ L +TL GDLLYLPR +H A T + S
Sbjct: 232 GAQALLAEFERSSFQGCTSYSYEQLQNMSCLNLTLHAGDLLYLPRALVHHAWTDNSTASY 291
Query: 197 HVT 199
H+T
Sbjct: 292 HMT 294
>gi|303249991|ref|ZP_07336193.1| hypothetical protein APP6_1402 [Actinobacillus pleuropneumoniae
serovar 6 str. Femo]
gi|303253163|ref|ZP_07339312.1| hypothetical protein APP2_0468 [Actinobacillus pleuropneumoniae
serovar 2 str. 4226]
gi|302647845|gb|EFL78052.1| hypothetical protein APP2_0468 [Actinobacillus pleuropneumoniae
serovar 2 str. 4226]
gi|302651054|gb|EFL81208.1| hypothetical protein APP6_1402 [Actinobacillus pleuropneumoniae
serovar 6 str. Femo]
Length = 225
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 17/95 (17%)
Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
GC ++ Y T F H+D + F LQ++G+K+W + + P +++P F
Sbjct: 128 GCHTFSSLYIAFNTQSSFKSHWDSRDIFALQMQGRKRWII--------HAPTFNNPLFMH 179
Query: 160 EEIGTP---------ILTVTLEPGDLLYLPRGYIH 185
+ P + + LE GD+LYLPRG+ H
Sbjct: 180 KSKDMPEYNPNLDDVYMDIILEAGDILYLPRGWWH 214
>gi|90408230|ref|ZP_01216397.1| hypothetical protein PCNPT3_11519 [Psychromonas sp. CNPT3]
gi|90310676|gb|EAS38794.1| hypothetical protein PCNPT3_11519 [Psychromonas sp. CNPT3]
Length = 376
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 89/196 (45%), Gaps = 25/196 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNV---IEFKKNIDITLYEDGQRQTMDV- 57
+W+K+PL I + ++ + ++ + + + V + ++ D E G + D
Sbjct: 15 YWQKKPLLIKQGFADFQDPIMPDEIAGLAMEEEVESRLVYRDEQDKWQAEHGPFTSFDKL 74
Query: 58 --EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTA 114
+G +L DH+ E + Q I+P + L N L +L + N
Sbjct: 75 EQKGTSLLVQAVDHWHE-------DAQQLIRPFRFLPNWRLDDLMISYSSVNG------- 120
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
G PH D+ + FI+Q GK+ W+V ++E+ + + + + I+ V LEPG
Sbjct: 121 -GVGPHIDNYDVFIIQGLGKRHWRVGAKTELNEFTAHAALKHCAAFD---AIIDVELEPG 176
Query: 175 DLLYLPRGYIHQASTV 190
D+LY+P G+ H+ +
Sbjct: 177 DILYIPVGFPHEGYAI 192
>gi|417548664|ref|ZP_12199745.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
Naval-18]
gi|417566030|ref|ZP_12216904.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
OIFC143]
gi|395557786|gb|EJG23787.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
OIFC143]
gi|400388963|gb|EJP52035.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
Naval-18]
Length = 382
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 79/175 (45%), Gaps = 18/175 (10%)
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPP 112
+E R + +++++ R G ++ + IK + + + G F A T Y
Sbjct: 75 IEYRCISSVLYEYLRNGATL----VRNRIKN-EPFVDQISKQIGSFSEARTLTGGYAAFS 129
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTV 169
+ + H+D + + +QL G+K+W + P P Y + +I P + V
Sbjct: 130 SKSSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFTDIKEPEEIYMDV 186
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
LE GD+LY+PRG+ H + +E + H+ ++ + T ++ L+ MP L
Sbjct: 187 ILEAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNTMPNILDC 240
>gi|307248805|ref|ZP_07530818.1| hypothetical protein appser2_17710 [Actinobacillus pleuropneumoniae
serovar 2 str. S1536]
gi|307253420|ref|ZP_07535291.1| hypothetical protein appser6_19140 [Actinobacillus pleuropneumoniae
serovar 6 str. Femo]
gi|307262553|ref|ZP_07544192.1| hypothetical protein appser12_20870 [Actinobacillus
pleuropneumoniae serovar 12 str. 1096]
gi|306854732|gb|EFM86922.1| hypothetical protein appser2_17710 [Actinobacillus pleuropneumoniae
serovar 2 str. S1536]
gi|306859099|gb|EFM91141.1| hypothetical protein appser6_19140 [Actinobacillus pleuropneumoniae
serovar 6 str. Femo]
gi|306867751|gb|EFM99588.1| hypothetical protein appser12_20870 [Actinobacillus
pleuropneumoniae serovar 12 str. 1096]
Length = 250
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 17/95 (17%)
Query: 100 GCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ 159
GC ++ Y T F H+D + F LQ++G+K+W + + P +++P F
Sbjct: 153 GCHTFSSLYIAFNTQSSFKSHWDSRDIFALQMQGRKRWII--------HAPTFNNPLFMH 204
Query: 160 EEIGTP---------ILTVTLEPGDLLYLPRGYIH 185
+ P + + LE GD+LYLPRG+ H
Sbjct: 205 KSKDMPEYNPNLDDVYMDIILEAGDILYLPRGWWH 239
>gi|293396540|ref|ZP_06640816.1| cupin superfamily protein [Serratia odorifera DSM 4582]
gi|291420804|gb|EFE94057.1| cupin superfamily protein [Serratia odorifera DSM 4582]
Length = 373
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 93/203 (45%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E S ++P +QL + + +L F
Sbjct: 71 YDHLGENNWSLLVQAV-DHWHEPSS-------ALMRPFRQLPDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVAPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|365834386|ref|ZP_09375832.1| cupin family protein [Hafnia alvei ATCC 51873]
gi|364569163|gb|EHM46786.1| cupin family protein [Hafnia alvei ATCC 51873]
Length = 373
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 84/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVILKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E SI + P L ++L G PH
Sbjct: 67 -PFESYDHLGENNWSILVQAVDHWHVPSSALMRPFRKLPDWRTDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKTQLRQHSPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|268589244|ref|ZP_06123465.1| cupin family protein [Providencia rettgeri DSM 1131]
gi|291315428|gb|EFE55881.1| cupin family protein [Providencia rettgeri DSM 1131]
Length = 374
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 95/209 (45%), Gaps = 32/209 (15%)
Query: 2 HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
HW+K PL I + N+ + + + ++D + ++ K+ I + G ++ D
Sbjct: 16 HWQKRPLLIKKGFANFIDPISPDELAGLAMEDEVDSRLVSHKEGIWQVGH--GPFESYDH 73
Query: 58 EGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPP 112
G L Q V DH+ S +KP + L + + +L +Y P
Sbjct: 74 LGEENWSILVQAV-DHWHHPSS-------ALMKPFRVLSDWRMDDLM------ISYSVP- 118
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G PH D + FI+Q EG+++W+V + ++ P P+ Q + I+ +E
Sbjct: 119 -GGGVGPHLDQYDVFIIQGEGRRRWRVGEKIPMKQHCPH---PDLLQVQPFDAIIDEEME 174
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD+LY+P G+ H+ + E SL+ ++
Sbjct: 175 PGDILYIPPGFPHEGYAI--EPSLNYSVG 201
>gi|221134859|ref|ZP_03561162.1| putative enzyme with RmlC-like domain [Glaciecola sp. HTCC2999]
Length = 381
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 4/85 (4%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G H D + FI+Q +G ++W+V M +Y Y P Q + PI
Sbjct: 114 SYSQP--GAGVGAHIDQYDVFIVQGKGTRRWQVGDKSM--KYAKYYPHPKLQQIDEFEPI 169
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
+ V L PGD+LY+P G+ H+ ++T
Sbjct: 170 IDVELLPGDILYIPPGFPHKGQSIT 194
>gi|384126558|ref|YP_005509172.1| hypothetical protein YPD8_2100 [Yersinia pestis D182038]
gi|262366222|gb|ACY62779.1| hypothetical protein YPD8_2100 [Yersinia pestis D182038]
Length = 373
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +EDG+ ++
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWHVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194
>gi|261339429|ref|ZP_05967287.1| cupin family protein [Enterobacter cancerogenus ATCC 35316]
gi|288318235|gb|EFC57173.1| cupin family protein [Enterobacter cancerogenus ATCC 35316]
Length = 373
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 61/274 (22%), Positives = 116/274 (42%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGISNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ +A +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL--GSYR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
Y+ AR P DI + L + L+ ++ L+
Sbjct: 228 YSDPDVPAREHPADILPEEL---DKLRGMMLDLI 258
>gi|22125686|ref|NP_669109.1| hypothetical protein y1792 [Yersinia pestis KIM10+]
gi|45441573|ref|NP_993112.1| hypothetical protein YP_1762 [Yersinia pestis biovar Microtus str.
91001]
gi|108807887|ref|YP_651803.1| hypothetical protein YPA_1892 [Yersinia pestis Antiqua]
gi|108812161|ref|YP_647928.1| hypothetical protein YPN_1999 [Yersinia pestis Nepal516]
gi|145599098|ref|YP_001163174.1| hypothetical protein YPDSF_1816 [Yersinia pestis Pestoides F]
gi|21958601|gb|AAM85360.1|AE013782_1 hypothetical protein y1792 [Yersinia pestis KIM10+]
gi|45436434|gb|AAS61989.1| conserved hypothetical protein [Yersinia pestis biovar Microtus
str. 91001]
gi|108775809|gb|ABG18328.1| hypothetical protein YPN_1999 [Yersinia pestis Nepal516]
gi|108779800|gb|ABG13858.1| hypothetical protein YPA_1892 [Yersinia pestis Antiqua]
gi|145210794|gb|ABP40201.1| hypothetical protein YPDSF_1816 [Yersinia pestis Pestoides F]
Length = 391
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +EDG+ ++
Sbjct: 34 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWHVSHGPFES 88
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 89 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 134
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 135 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 189
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ ++ N
Sbjct: 190 EMEPGDILYIPPGFPHEGYSLEN 212
>gi|338531259|ref|YP_004664593.1| cupin family protein [Myxococcus fulvus HW-1]
gi|337257355|gb|AEI63515.1| cupin family protein [Myxococcus fulvus HW-1]
Length = 295
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 101/239 (42%), Gaps = 25/239 (10%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
+H+++ P + L T + ID ++ + + Y G+R T R
Sbjct: 18 EHYQRRP-FTGASAAERLQRLGTWETIDFLVEETACDVLLARQGVPY-PGERPTTAKAAR 75
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
L + +G ++ P + L QL + + + Y TP GF H
Sbjct: 76 EL-------FAQGYTLALRQPDLHHPELAQLARAFSSELHGRINLHIYCTPAGHHGFGWH 128
Query: 121 YDDIEAFILQLEGKKKW----KVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDL 176
D E FILQ G+K++ P + E +P S +QE+ TP+ T +L G
Sbjct: 129 CDPEEVFILQTTGRKEYLLRENTLHPVPLPESVP--SGSLAAQEK--TPVETHSLGAGGF 184
Query: 177 LYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+Y+P GY H A E +L +++ + T +DLL+ ++A A+ +RR +P
Sbjct: 185 IYIPGGYWHMAQ--APEEALSISMGLMPPTL-LDLLD-----GVRATLASSPVWRRRMP 235
>gi|167526437|ref|XP_001747552.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773998|gb|EDQ87632.1| predicted protein [Monosiga brevicollis MX1]
Length = 2336
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 19/140 (13%)
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
L + ++ G + Y TP AQ PH D + ++Q+ +K W + LP+ + +
Sbjct: 138 LESLIESELGIDATVHAYFTPANAQTLEPHTDPYDVVVVQVANQKHWTLCLPQTDNATVS 197
Query: 151 RYSSPNFSQEEI------GTPILT-----------VTLEPGDLLYLPRGYIHQASTVTNE 193
+ +EI G T VTL GD +YLP+G IH A T T+
Sbjct: 198 LSEADRAQLQEIKRSHLDGCTTYTMSMLQPMICRNVTLHQGDSMYLPKGVIHYAVT-TDT 256
Query: 194 HSLHVTISVYQKT-AWIDLL 212
S H+TI + + W+D+L
Sbjct: 257 PSAHLTIGLSRTGRTWLDVL 276
>gi|24374177|ref|NP_718220.1| cupin 4 family protein [Shewanella oneidensis MR-1]
gi|24348687|gb|AAN55664.1| cupin 4 family protein [Shewanella oneidensis MR-1]
Length = 380
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 7/78 (8%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKV--YLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D+ + FI+Q EG+++W V P+ R +PN E PI+ + LE
Sbjct: 120 GVGPHIDNYDVFIIQGEGQRRWTVGDNTPQQC-----RGGNPNSPLVEDFDPIIDMVLEK 174
Query: 174 GDLLYLPRGYIHQASTVT 191
GD+LY+P G+ H T+T
Sbjct: 175 GDMLYIPPGFPHCGETLT 192
>gi|72162256|ref|YP_289913.1| hypothetical protein Tfu_1857 [Thermobifida fusca YX]
gi|71915988|gb|AAZ55890.1| hypothetical protein Tfu_1857 [Thermobifida fusca YX]
Length = 395
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 66/145 (45%), Gaps = 3/145 (2%)
Query: 63 PQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYD 122
P+ ++ R+G S+ P++ L L V N Y + GF H+D
Sbjct: 90 PEALYAQLRQGASLVLDGIDRIHPPIRAAADDLMRLVHERVQVNLYLIWGDSHGFNTHWD 149
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRG 182
D + FI+Q+ G K W+V+ + Y P + S + + T+ G++L++PRG
Sbjct: 150 DHDTFIVQVAGTKHWQVH-GQGTRPY-PMKEDIDHSHQPPEGTVWEGTVRAGEVLHVPRG 207
Query: 183 YIHQASTVTNEHSLHVTISVYQKTA 207
+ H T T + S+H+T + T
Sbjct: 208 WWHTV-TGTGDVSMHLTFGFTRATG 231
>gi|392548135|ref|ZP_10295272.1| RmlC-like domain-containing protein [Pseudoalteromonas rubra ATCC
29570]
Length = 384
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
++ TP G PH D + FI+Q +GK++W+V +P D L +++ + + Q E
Sbjct: 116 SFSTP--GGGVGPHLDQYDVFIIQGQGKRRWRVGMP---DASLKQFAQNKSLLQVEQFEA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
+ LEPGD+LY+P G H+ V N
Sbjct: 171 CIDAELEPGDILYIPPGCPHEGYAVEN 197
>gi|383190790|ref|YP_005200918.1| hypothetical protein Rahaq2_2952 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
gi|371589048|gb|AEX52778.1| hypothetical protein Rahaq2_2952 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 90/207 (43%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + +DG+
Sbjct: 16 YWQKRPVVLKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSNQDGKWNV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E G S+ + +P L + + + G PH
Sbjct: 66 GPFESYDHLGETGWSVLVQAVDHWHEPSSHLMTPFRHIPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ LEPGD++Y+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKIAMKQHCPH---PDLLQVEPFDAIIDEELEPGDIVYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H ++ N SL+ ++ A
Sbjct: 183 PGFPHDGYSLEN--SLNYSVGFRAPNA 207
>gi|170023954|ref|YP_001720459.1| cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
gi|169750488|gb|ACA68006.1| Cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +EDG+ ++
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWHVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194
>gi|51596758|ref|YP_070949.1| hypothetical protein YPTB2436 [Yersinia pseudotuberculosis IP
32953]
gi|149366404|ref|ZP_01888438.1| hypothetical protein YPE_1629 [Yersinia pestis CA88-4125]
gi|162418272|ref|YP_001607239.1| hypothetical protein YpAngola_A2857 [Yersinia pestis Angola]
gi|165925645|ref|ZP_02221477.1| cupin family protein [Yersinia pestis biovar Orientalis str.
F1991016]
gi|165938468|ref|ZP_02227025.1| cupin family protein [Yersinia pestis biovar Orientalis str. IP275]
gi|166008219|ref|ZP_02229117.1| cupin family protein [Yersinia pestis biovar Antiqua str. E1979001]
gi|166211021|ref|ZP_02237056.1| cupin family protein [Yersinia pestis biovar Antiqua str.
B42003004]
gi|167401779|ref|ZP_02307270.1| cupin family protein [Yersinia pestis biovar Antiqua str.
UG05-0454]
gi|167421724|ref|ZP_02313477.1| cupin family protein [Yersinia pestis biovar Orientalis str.
MG05-1020]
gi|167426338|ref|ZP_02318091.1| cupin family protein [Yersinia pestis biovar Mediaevalis str.
K1973002]
gi|167469316|ref|ZP_02334020.1| hypothetical protein YpesF_15829 [Yersinia pestis FV-1]
gi|186895826|ref|YP_001872938.1| cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
gi|218928768|ref|YP_002346643.1| hypothetical protein YPO1632 [Yersinia pestis CO92]
gi|229841619|ref|ZP_04461777.1| hypothetical protein YPH_4010 [Yersinia pestis biovar Orientalis
str. PEXU2]
gi|229843736|ref|ZP_04463879.1| hypothetical protein YPF_2132 [Yersinia pestis biovar Orientalis
str. India 195]
gi|229894480|ref|ZP_04509662.1| hypothetical protein YPS_2256 [Yersinia pestis Pestoides A]
gi|229902482|ref|ZP_04517601.1| hypothetical protein YP516_2225 [Yersinia pestis Nepal516]
gi|270490338|ref|ZP_06207412.1| cupin family protein [Yersinia pestis KIM D27]
gi|294504170|ref|YP_003568232.1| hypothetical protein YPZ3_2060 [Yersinia pestis Z176003]
gi|384122691|ref|YP_005505311.1| hypothetical protein YPD4_2101 [Yersinia pestis D106004]
gi|384140479|ref|YP_005523181.1| hypothetical protein A1122_17720 [Yersinia pestis A1122]
gi|384414271|ref|YP_005623633.1| hypothetical protein YPC_1734 [Yersinia pestis biovar Medievalis
str. Harbin 35]
gi|420546393|ref|ZP_15044386.1| cupin superfamily protein [Yersinia pestis PY-01]
gi|420551710|ref|ZP_15049140.1| cupin superfamily protein [Yersinia pestis PY-02]
gi|420557260|ref|ZP_15054032.1| cupin superfamily protein [Yersinia pestis PY-03]
gi|420562800|ref|ZP_15058912.1| cupin superfamily protein [Yersinia pestis PY-04]
gi|420567813|ref|ZP_15063454.1| cupin superfamily protein [Yersinia pestis PY-05]
gi|420573506|ref|ZP_15068615.1| cupin superfamily protein [Yersinia pestis PY-06]
gi|420578792|ref|ZP_15073409.1| cupin superfamily protein [Yersinia pestis PY-07]
gi|420584154|ref|ZP_15078279.1| cupin superfamily protein [Yersinia pestis PY-08]
gi|420589318|ref|ZP_15082928.1| cupin superfamily protein [Yersinia pestis PY-09]
gi|420594626|ref|ZP_15087706.1| cupin superfamily protein [Yersinia pestis PY-10]
gi|420600320|ref|ZP_15092793.1| cupin superfamily protein [Yersinia pestis PY-11]
gi|420605779|ref|ZP_15097685.1| cupin superfamily protein [Yersinia pestis PY-12]
gi|420611143|ref|ZP_15102530.1| cupin superfamily protein [Yersinia pestis PY-13]
gi|420616463|ref|ZP_15107222.1| cupin superfamily protein [Yersinia pestis PY-14]
gi|420621861|ref|ZP_15112006.1| cupin superfamily protein [Yersinia pestis PY-15]
gi|420626880|ref|ZP_15116559.1| cupin superfamily protein [Yersinia pestis PY-16]
gi|420632086|ref|ZP_15121252.1| cupin superfamily protein [Yersinia pestis PY-19]
gi|420637202|ref|ZP_15125839.1| cupin superfamily protein [Yersinia pestis PY-25]
gi|420642732|ref|ZP_15130847.1| cupin superfamily protein [Yersinia pestis PY-29]
gi|420647938|ref|ZP_15135593.1| cupin superfamily protein [Yersinia pestis PY-32]
gi|420653572|ref|ZP_15140655.1| cupin superfamily protein [Yersinia pestis PY-34]
gi|420659092|ref|ZP_15145617.1| cupin superfamily protein [Yersinia pestis PY-36]
gi|420664398|ref|ZP_15150364.1| cupin superfamily protein [Yersinia pestis PY-42]
gi|420669342|ref|ZP_15154852.1| cupin superfamily protein [Yersinia pestis PY-45]
gi|420674667|ref|ZP_15159700.1| cupin superfamily protein [Yersinia pestis PY-46]
gi|420680226|ref|ZP_15164730.1| cupin superfamily protein [Yersinia pestis PY-47]
gi|420685501|ref|ZP_15169450.1| cupin superfamily protein [Yersinia pestis PY-48]
gi|420690685|ref|ZP_15174034.1| cupin superfamily protein [Yersinia pestis PY-52]
gi|420696479|ref|ZP_15179106.1| cupin superfamily protein [Yersinia pestis PY-53]
gi|420701969|ref|ZP_15183723.1| cupin superfamily protein [Yersinia pestis PY-54]
gi|420707842|ref|ZP_15188598.1| cupin superfamily protein [Yersinia pestis PY-55]
gi|420713175|ref|ZP_15193376.1| cupin superfamily protein [Yersinia pestis PY-56]
gi|420718601|ref|ZP_15198116.1| cupin superfamily protein [Yersinia pestis PY-58]
gi|420724164|ref|ZP_15202922.1| cupin superfamily protein [Yersinia pestis PY-59]
gi|420729761|ref|ZP_15207931.1| cupin superfamily protein [Yersinia pestis PY-60]
gi|420734818|ref|ZP_15212500.1| cupin superfamily protein [Yersinia pestis PY-61]
gi|420740282|ref|ZP_15217421.1| cupin superfamily protein [Yersinia pestis PY-63]
gi|420745748|ref|ZP_15222170.1| cupin superfamily protein [Yersinia pestis PY-64]
gi|420751427|ref|ZP_15227086.1| cupin superfamily protein [Yersinia pestis PY-65]
gi|420756820|ref|ZP_15231677.1| cupin superfamily protein [Yersinia pestis PY-66]
gi|420762542|ref|ZP_15236426.1| cupin superfamily protein [Yersinia pestis PY-71]
gi|420767801|ref|ZP_15241170.1| cupin superfamily protein [Yersinia pestis PY-72]
gi|420772776|ref|ZP_15245644.1| cupin superfamily protein [Yersinia pestis PY-76]
gi|420778224|ref|ZP_15250489.1| cupin superfamily protein [Yersinia pestis PY-88]
gi|420789062|ref|ZP_15260036.1| cupin superfamily protein [Yersinia pestis PY-90]
gi|420794543|ref|ZP_15264977.1| cupin superfamily protein [Yersinia pestis PY-91]
gi|420799657|ref|ZP_15269576.1| cupin superfamily protein [Yersinia pestis PY-92]
gi|420805005|ref|ZP_15274395.1| cupin superfamily protein [Yersinia pestis PY-93]
gi|420810292|ref|ZP_15279170.1| cupin superfamily protein [Yersinia pestis PY-94]
gi|420815964|ref|ZP_15284267.1| cupin superfamily protein [Yersinia pestis PY-95]
gi|420821152|ref|ZP_15288950.1| cupin superfamily protein [Yersinia pestis PY-96]
gi|420826242|ref|ZP_15293508.1| cupin superfamily protein [Yersinia pestis PY-98]
gi|420831992|ref|ZP_15298708.1| cupin superfamily protein [Yersinia pestis PY-99]
gi|420836835|ref|ZP_15303075.1| cupin superfamily protein [Yersinia pestis PY-100]
gi|420841993|ref|ZP_15307749.1| cupin superfamily protein [Yersinia pestis PY-101]
gi|420847632|ref|ZP_15312832.1| cupin superfamily protein [Yersinia pestis PY-102]
gi|420853075|ref|ZP_15317580.1| cupin superfamily protein [Yersinia pestis PY-103]
gi|420858557|ref|ZP_15322281.1| cupin superfamily protein [Yersinia pestis PY-113]
gi|421763099|ref|ZP_16199896.1| hypothetical protein INS_08445 [Yersinia pestis INS]
gi|51590040|emb|CAH21674.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
32953]
gi|115347379|emb|CAL20277.1| conserved hypothetical protein [Yersinia pestis CO92]
gi|149290778|gb|EDM40853.1| hypothetical protein YPE_1629 [Yersinia pestis CA88-4125]
gi|162351087|gb|ABX85035.1| cupin family protein [Yersinia pestis Angola]
gi|165913583|gb|EDR32203.1| cupin family protein [Yersinia pestis biovar Orientalis str. IP275]
gi|165922754|gb|EDR39905.1| cupin family protein [Yersinia pestis biovar Orientalis str.
F1991016]
gi|165992601|gb|EDR44902.1| cupin family protein [Yersinia pestis biovar Antiqua str. E1979001]
gi|166208201|gb|EDR52681.1| cupin family protein [Yersinia pestis biovar Antiqua str.
B42003004]
gi|166960209|gb|EDR56230.1| cupin family protein [Yersinia pestis biovar Orientalis str.
MG05-1020]
gi|167048884|gb|EDR60292.1| cupin family protein [Yersinia pestis biovar Antiqua str.
UG05-0454]
gi|167054693|gb|EDR64497.1| cupin family protein [Yersinia pestis biovar Mediaevalis str.
K1973002]
gi|186698852|gb|ACC89481.1| Cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
gi|229680528|gb|EEO76625.1| hypothetical protein YP516_2225 [Yersinia pestis Nepal516]
gi|229689344|gb|EEO81407.1| hypothetical protein YPF_2132 [Yersinia pestis biovar Orientalis
str. India 195]
gi|229694082|gb|EEO84130.1| hypothetical protein YPH_4010 [Yersinia pestis biovar Orientalis
str. PEXU2]
gi|229703499|gb|EEO90516.1| hypothetical protein YPS_2256 [Yersinia pestis Pestoides A]
gi|262362287|gb|ACY59008.1| hypothetical protein YPD4_2101 [Yersinia pestis D106004]
gi|270338842|gb|EFA49619.1| cupin family protein [Yersinia pestis KIM D27]
gi|294354629|gb|ADE64970.1| hypothetical protein YPZ3_2060 [Yersinia pestis Z176003]
gi|320014775|gb|ADV98346.1| hypothetical protein YPC_1734 [Yersinia pestis biovar Medievalis
str. Harbin 35]
gi|342855608|gb|AEL74161.1| hypothetical protein A1122_17720 [Yersinia pestis A1122]
gi|391427956|gb|EIQ89991.1| cupin superfamily protein [Yersinia pestis PY-01]
gi|391429382|gb|EIQ91241.1| cupin superfamily protein [Yersinia pestis PY-02]
gi|391430794|gb|EIQ92463.1| cupin superfamily protein [Yersinia pestis PY-03]
gi|391443452|gb|EIR03767.1| cupin superfamily protein [Yersinia pestis PY-04]
gi|391444756|gb|EIR04947.1| cupin superfamily protein [Yersinia pestis PY-05]
gi|391447828|gb|EIR07702.1| cupin superfamily protein [Yersinia pestis PY-06]
gi|391460328|gb|EIR19043.1| cupin superfamily protein [Yersinia pestis PY-07]
gi|391461292|gb|EIR19909.1| cupin superfamily protein [Yersinia pestis PY-08]
gi|391463290|gb|EIR21709.1| cupin superfamily protein [Yersinia pestis PY-09]
gi|391476420|gb|EIR33538.1| cupin superfamily protein [Yersinia pestis PY-10]
gi|391478148|gb|EIR35099.1| cupin superfamily protein [Yersinia pestis PY-11]
gi|391478275|gb|EIR35211.1| cupin superfamily protein [Yersinia pestis PY-12]
gi|391492316|gb|EIR47797.1| cupin superfamily protein [Yersinia pestis PY-13]
gi|391493193|gb|EIR48571.1| cupin superfamily protein [Yersinia pestis PY-15]
gi|391495564|gb|EIR50646.1| cupin superfamily protein [Yersinia pestis PY-14]
gi|391508331|gb|EIR62082.1| cupin superfamily protein [Yersinia pestis PY-19]
gi|391508362|gb|EIR62111.1| cupin superfamily protein [Yersinia pestis PY-16]
gi|391513157|gb|EIR66406.1| cupin superfamily protein [Yersinia pestis PY-25]
gi|391523516|gb|EIR75820.1| cupin superfamily protein [Yersinia pestis PY-29]
gi|391526248|gb|EIR78299.1| cupin superfamily protein [Yersinia pestis PY-34]
gi|391527050|gb|EIR79008.1| cupin superfamily protein [Yersinia pestis PY-32]
gi|391539357|gb|EIR90086.1| cupin superfamily protein [Yersinia pestis PY-36]
gi|391541914|gb|EIR92423.1| cupin superfamily protein [Yersinia pestis PY-42]
gi|391543151|gb|EIR93509.1| cupin superfamily protein [Yersinia pestis PY-45]
gi|391557050|gb|EIS06080.1| cupin superfamily protein [Yersinia pestis PY-46]
gi|391557449|gb|EIS06441.1| cupin superfamily protein [Yersinia pestis PY-47]
gi|391558755|gb|EIS07609.1| cupin superfamily protein [Yersinia pestis PY-48]
gi|391572334|gb|EIS19578.1| cupin superfamily protein [Yersinia pestis PY-52]
gi|391572861|gb|EIS20032.1| cupin superfamily protein [Yersinia pestis PY-53]
gi|391582579|gb|EIS28325.1| cupin superfamily protein [Yersinia pestis PY-54]
gi|391584753|gb|EIS30243.1| cupin superfamily protein [Yersinia pestis PY-55]
gi|391588022|gb|EIS33108.1| cupin superfamily protein [Yersinia pestis PY-56]
gi|391601031|gb|EIS44492.1| cupin superfamily protein [Yersinia pestis PY-58]
gi|391601620|gb|EIS45026.1| cupin superfamily protein [Yersinia pestis PY-60]
gi|391603261|gb|EIS46467.1| cupin superfamily protein [Yersinia pestis PY-59]
gi|391615958|gb|EIS57673.1| cupin superfamily protein [Yersinia pestis PY-61]
gi|391616713|gb|EIS58334.1| cupin superfamily protein [Yersinia pestis PY-63]
gi|391622502|gb|EIS63412.1| cupin superfamily protein [Yersinia pestis PY-64]
gi|391627973|gb|EIS68120.1| cupin superfamily protein [Yersinia pestis PY-65]
gi|391639210|gb|EIS77928.1| cupin superfamily protein [Yersinia pestis PY-71]
gi|391639541|gb|EIS78208.1| cupin superfamily protein [Yersinia pestis PY-66]
gi|391641459|gb|EIS79868.1| cupin superfamily protein [Yersinia pestis PY-72]
gi|391651269|gb|EIS88463.1| cupin superfamily protein [Yersinia pestis PY-76]
gi|391656529|gb|EIS93154.1| cupin superfamily protein [Yersinia pestis PY-88]
gi|391664478|gb|EIT00201.1| cupin superfamily protein [Yersinia pestis PY-90]
gi|391671297|gb|EIT06257.1| cupin superfamily protein [Yersinia pestis PY-91]
gi|391682105|gb|EIT16011.1| cupin superfamily protein [Yersinia pestis PY-93]
gi|391683556|gb|EIT17318.1| cupin superfamily protein [Yersinia pestis PY-92]
gi|391684343|gb|EIT18032.1| cupin superfamily protein [Yersinia pestis PY-94]
gi|391695963|gb|EIT28495.1| cupin superfamily protein [Yersinia pestis PY-95]
gi|391699345|gb|EIT31546.1| cupin superfamily protein [Yersinia pestis PY-96]
gi|391700775|gb|EIT32845.1| cupin superfamily protein [Yersinia pestis PY-98]
gi|391709817|gb|EIT40956.1| cupin superfamily protein [Yersinia pestis PY-99]
gi|391716764|gb|EIT47191.1| cupin superfamily protein [Yersinia pestis PY-100]
gi|391717447|gb|EIT47808.1| cupin superfamily protein [Yersinia pestis PY-101]
gi|391728262|gb|EIT57393.1| cupin superfamily protein [Yersinia pestis PY-102]
gi|391731096|gb|EIT59840.1| cupin superfamily protein [Yersinia pestis PY-103]
gi|391735673|gb|EIT63789.1| cupin superfamily protein [Yersinia pestis PY-113]
gi|411177305|gb|EKS47320.1| hypothetical protein INS_08445 [Yersinia pestis INS]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 91/203 (44%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
+W+K P+ + R KN+ + L D L +E + + + +EDG+ ++
Sbjct: 16 YWQKRPVILKRGFKNFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWHVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194
>gi|310639709|ref|YP_003944467.1| hypothetical protein [Paenibacillus polymyxa SC2]
gi|386038909|ref|YP_005957863.1| hypothetical protein PPM_0219 [Paenibacillus polymyxa M1]
gi|309244659|gb|ADO54226.1| Uncharacterized protein yxbC [Paenibacillus polymyxa SC2]
gi|343094947|emb|CCC83156.1| uncharacterized protein yxbC [Paenibacillus polymyxa M1]
Length = 333
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 65/164 (39%), Gaps = 24/164 (14%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQ---GFAPHYDDI 124
D Y G ++ F ++ L SL++ G+ A+ GF H+D
Sbjct: 93 DWYERGAALEFDFSDMFLPQLHSWMDSLRKELSLPSGSGIKAIVYAAKNGGGFKAHFDAY 152
Query: 125 EAFILQLEGKKKWKVYL----------------PRMVDEYLPRYSSPNFSQEEIGTPILT 168
FI L+G K WK+ P + DE + + + G I
Sbjct: 153 TNFIFHLQGTKTWKLLANENVTNPIQHYDLAEKPYIPDELATYWKGEHPQTDLPGADI-- 210
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
V L PG LYLPRG H S+ SL++T S AW++LL
Sbjct: 211 VNLLPGSFLYLPRGIWHSTSSTEETLSLNITFS---HPAWLELL 251
>gi|145350199|ref|XP_001419502.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579734|gb|ABO97795.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 392
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 18/118 (15%)
Query: 97 ELFGCF----VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKV-----YLPRMVDE 147
ELF CF + P G PH D + F+LQ EG+K W V Y P
Sbjct: 92 ELFNCFPRWRISDIQASVSPDGGGVGPHSDHFDVFLLQAEGEKVWAVADNEEYWPDNDAA 151
Query: 148 YLPRYSS---PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
++P +F +++ + TL PGD+LYLP H ++ + VT+S+
Sbjct: 152 FVPECEIRVLKSFVEDD------SFTLVPGDMLYLPPKIAHNGVATNSKPGVSVTLSI 203
>gi|334122432|ref|ZP_08496470.1| cupin superfamily protein [Enterobacter hormaechei ATCC 49162]
gi|333392039|gb|EGK63146.1| cupin superfamily protein [Enterobacter hormaechei ATCC 49162]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 61/274 (22%), Positives = 116/274 (42%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGISNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGESNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ +A +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSVREMISGFADYVLQREL--GSYR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
Y+ AR P DI + L + L+ ++ L+
Sbjct: 228 YSDPDVPARAHPADILPEEL---DKLRGMMLDLI 258
>gi|398797523|ref|ZP_10556844.1| hypothetical protein PMI17_00519 [Pantoea sp. GM01]
gi|398102729|gb|EJL92907.1| hypothetical protein PMI17_00519 [Pantoea sp. GM01]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|302852113|ref|XP_002957578.1| hypothetical protein VOLCADRAFT_98689 [Volvox carteri f.
nagariensis]
gi|300257095|gb|EFJ41348.1| hypothetical protein VOLCADRAFT_98689 [Volvox carteri f.
nagariensis]
Length = 1183
Score = 54.7 bits (130), Expect = 6e-05, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-EYLPRYS 153
++ L G GAN Y +PP QG A HYDD +LQL+G K+W + PR++ + LP
Sbjct: 485 MEWLLGLPAGANVYGSPPGRQGLAAHYDDHCVLVLQLQGAKEWLLQPPRLLPAQLLPLTY 544
Query: 154 SPNFSQEEIG 163
P +G
Sbjct: 545 CPRLPLGPLG 554
>gi|332850080|ref|ZP_08432467.1| cupin family protein [Acinetobacter baumannii 6013150]
gi|332871476|ref|ZP_08439989.1| cupin family protein [Acinetobacter baumannii 6013113]
gi|332730929|gb|EGJ62235.1| cupin family protein [Acinetobacter baumannii 6013150]
gi|332731453|gb|EGJ62744.1| cupin family protein [Acinetobacter baumannii 6013113]
Length = 382
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 80/173 (46%), Gaps = 14/173 (8%)
Query: 57 VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+E R + + +++ R G ++ ++ + ++ + + A+ E VG Y +
Sbjct: 75 IEYRCITSVFYEYLRNGATLVRNRISNEPFVDQISKQIATFAEA-RTLVGG--YAAFSSK 131
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
+ H+D + + +QL G+K+W + P P Y + +I P + V L
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFPDIKEPEEIYMDVIL 188
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
E GD+LY+PRG+ H + +E + H+ ++ + T ++ L+ MP L
Sbjct: 189 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNTMPNILDC 240
>gi|378579358|ref|ZP_09828026.1| cupin superfamily protein [Pantoea stewartii subsp. stewartii
DC283]
gi|377818075|gb|EHU01163.1| cupin superfamily protein [Pantoea stewartii subsp. stewartii
DC283]
Length = 374
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|114563434|ref|YP_750947.1| cupin 4 family protein [Shewanella frigidimarina NCIMB 400]
gi|114334727|gb|ABI72109.1| Cupin 4 family protein [Shewanella frigidimarina NCIMB 400]
Length = 380
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI+Q +G+++WKV ++ R +PN + PI
Sbjct: 113 SFATP--GGGVGPHIDNYDVFIIQGDGERRWKV---GDKGQHKRRGDNPNSPLVDDFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVT 191
+ V L+ GD+LY+P GY H T++
Sbjct: 168 IDVVLQKGDVLYIPPGYPHCGETLS 192
>gi|423113827|ref|ZP_17101518.1| hypothetical protein HMPREF9689_01575 [Klebsiella oxytoca 10-5245]
gi|376387472|gb|EHT00182.1| hypothetical protein HMPREF9689_01575 [Klebsiella oxytoca 10-5245]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ Y+ +L +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
YA +R P DI L + L+D++ L+
Sbjct: 228 YADPDVPSREHPADIMPVEL---DRLRDMMLGLI 258
>gi|291617063|ref|YP_003519805.1| hypothetical protein PANA_1510 [Pantoea ananatis LMG 20103]
gi|378767674|ref|YP_005196143.1| cupin 4 family protein [Pantoea ananatis LMG 5342]
gi|386015450|ref|YP_005933731.1| transcription factor YcfD [Pantoea ananatis AJ13355]
gi|386079829|ref|YP_005993354.1| transcription factor YcfD [Pantoea ananatis PA13]
gi|291152093|gb|ADD76677.1| YcfD [Pantoea ananatis LMG 20103]
gi|327393513|dbj|BAK10935.1| transcription factor YcfD [Pantoea ananatis AJ13355]
gi|354989010|gb|AER33134.1| transcription factor YcfD [Pantoea ananatis PA13]
gi|365187156|emb|CCF10106.1| cupin 4 family protein [Pantoea ananatis LMG 5342]
Length = 374
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|326794623|ref|YP_004312443.1| hypothetical protein Marme_1334 [Marinomonas mediterranea MMB-1]
gi|326545387|gb|ADZ90607.1| Cupin, JmjC-type [Marinomonas mediterranea MMB-1]
Length = 397
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%)
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PHYD + F++Q+ GK++W+V P ++ + + TP + LEPGD+L
Sbjct: 132 GPHYDQYDVFLVQMSGKRRWQVAEPTEYEDSAVEGTQLHILDNFPSTPSIECDLEPGDIL 191
Query: 178 YLPRGYIHQASTVTNE 193
YLP + H + +E
Sbjct: 192 YLPPNFAHNGRALNSE 207
>gi|317047740|ref|YP_004115388.1| cupin [Pantoea sp. At-9b]
gi|316949357|gb|ADU68832.1| Cupin 4 family protein [Pantoea sp. At-9b]
Length = 373
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|167526435|ref|XP_001747551.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773997|gb|EDQ87631.1| predicted protein [Monosiga brevicollis MX1]
Length = 2348
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 19/140 (13%)
Query: 91 LNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP 150
L + ++ G + Y TP AQ PH D + ++Q+ +K W + LP+ + +
Sbjct: 152 LESLIESELGIDATVHAYFTPANAQTLEPHTDPYDVVVVQVANQKHWTLCLPQTDNATVS 211
Query: 151 RYSSPNFSQEEI------GTPILT-----------VTLEPGDLLYLPRGYIHQASTVTNE 193
+ +EI G T VTL GD +YLP+G IH A T T+
Sbjct: 212 LSEADRAQLQEIKRSHLDGCTTYTMSMLQPMICRNVTLHQGDSMYLPKGVIHYAVT-TDT 270
Query: 194 HSLHVTISVYQKT-AWIDLL 212
S H+TI + + W+D+L
Sbjct: 271 PSAHLTIGLSRTGRTWLDVL 290
>gi|407644844|ref|YP_006808603.1| hypothetical protein O3I_018350 [Nocardia brasiliensis ATCC 700358]
gi|407307728|gb|AFU01629.1| hypothetical protein O3I_018350 [Nocardia brasiliensis ATCC 700358]
Length = 396
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 12/97 (12%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKV----YLPRMVDEY--------LPRYSSPNFSQEEIG 163
G H+D I+ +QL G+K W++ + P +D + L R + EEI
Sbjct: 132 GTRAHFDAIDVITIQLTGRKTWRIAPNAFAPAPLDGWTTLETVSPLLRNYASALPPEEIP 191
Query: 164 TPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTI 200
+ LEPG +L+LPRGY H+ ++ + SLH+ +
Sbjct: 192 DGVTEHVLEPGSVLHLPRGYWHETNSAGDSISLHLAL 228
>gi|398791557|ref|ZP_10552277.1| hypothetical protein PMI39_00853 [Pantoea sp. YR343]
gi|398214702|gb|EJN01273.1| hypothetical protein PMI39_00853 [Pantoea sp. YR343]
Length = 373
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKVPLKQHCPH---PDLLQVEPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|238758067|ref|ZP_04619248.1| hypothetical protein yaldo0001_9460 [Yersinia aldovae ATCC 35236]
gi|238703821|gb|EEP96357.1| hypothetical protein yaldo0001_9460 [Yersinia aldovae ATCC 35236]
Length = 373
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 88/201 (43%), Gaps = 16/201 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K+P+ + KN + N + D+ L +E + + + + +GQ Q +
Sbjct: 16 YWQKQPVVL----KNAFQNFVDPITPDE-LAGLAMEAEVDSRLVSHVNGQWQASNG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E G S+ + P +L + L + G PH
Sbjct: 67 -PFENFDHLGETGWSLLAQAVNHWHSPSAELVCPFRVLPDWRLDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G ++W+V + ++ P P E TPI+ LEPGD+LY+P
Sbjct: 126 IDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTIS 201
G+ H T+E +L+ ++
Sbjct: 183 PGFPHDG--FTHETALNYSVG 201
>gi|42525019|ref|NP_970399.1| hypothetical protein Bd3682 [Bdellovibrio bacteriovorus HD100]
gi|39577230|emb|CAE81053.1| hypothetical protein Bd3682 [Bdellovibrio bacteriovorus HD100]
Length = 308
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/246 (21%), Positives = 105/246 (42%), Gaps = 40/246 (16%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYE-DGQRQTMDVEGR 60
HW EPL+I ++ +++ D L++ + ++ + L + D + ++ +E
Sbjct: 22 HWPVEPLFIP-ATPGKLQDIFALEQMQD-LKNLISARQRKVRACLPDFDDEYSSIHLE-- 77
Query: 61 ALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN----------TYQT 110
P YR ++ F + Q+ + + +++ G G Y T
Sbjct: 78 --PGDALKAYRNNMTLVFDSMQSQDSTIADMLGNVRADLGLVTGGAENDLCKARSIAYAT 135
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI---- 166
P G H+D FI+Q++G K W++ V+ R+++ EE+ +
Sbjct: 136 P-AGCGTRLHFDANANFIIQIKGTKTWRLAPNESVEFPTERFTT---GSEEMPAALEKQC 191
Query: 167 ------------LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
+ V ++PG +L++PRGY H+ T T E SL + + Q T W D+ K
Sbjct: 192 HAHLIDALDEDSMKVVMKPGCVLFVPRGYWHE--TTTEEESLSLNFTFSQPT-WADVFTK 248
Query: 215 AMPKAL 220
++ + L
Sbjct: 249 SLQEVL 254
>gi|300312570|ref|YP_003776662.1| hypothetical protein Hsero_3273 [Herbaspirillum seropedicae SmR1]
gi|300075355|gb|ADJ64754.1| conserved hypothetical protein [Herbaspirillum seropedicae SmR1]
Length = 386
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 78/324 (24%), Positives = 127/324 (39%), Gaps = 54/324 (16%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W K+PL I + L + D RD+V ++ IT E G +
Sbjct: 31 DYWHKKPLLIRNAIPGFSAPLPRQELFDLSRRDDV----ESRLITHREGGWDMQHGPFNK 86
Query: 61 ALP--QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
P Q W +G ++ ++ + + +A L +L + G
Sbjct: 87 LPPLKQKAWTLLVQGVNLHHPAADALLRQFRFVPDARLDDLMISYAS--------DGGGV 138
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
PH+D + F+LQ G+++W++ ++E +P +F EE LEPGD
Sbjct: 139 GPHFDSYDVFLLQAHGQRRWRISAQDDLTLEEGVPLKILSHFQAEE------EFVLEPGD 192
Query: 176 LLYLPRGYIHQASTVTN--EHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR- 232
+LYLP Y H V +S+ YQ+ G + LEF
Sbjct: 193 MLYLPPHYAHDGVAVGECMTYSIGFRAPTYQEL-----------------GESFLEFMMD 235
Query: 233 --GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHD 290
LP Y +P ++QAD +A ++D L K + + D DD +G+ L +
Sbjct: 236 TIDLPGRYADPDLKTSARPAELQADMIA---RIRDELNK-IRFTD-DDITIFLGQHL-SE 289
Query: 291 ALPPVL---SPEELQCSVFENGLR 311
P V P+EL + F G +
Sbjct: 290 PKPSVFFEAPPQELSFARFFQGAK 313
>gi|422022984|ref|ZP_16369490.1| hypothetical protein OO7_10582 [Providencia sneebia DSM 19967]
gi|414094714|gb|EKT56378.1| hypothetical protein OO7_10582 [Providencia sneebia DSM 19967]
Length = 374
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q EG+++W+V + ++ P P+ Q + I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRRWRVGEKHPMKQHCPH---PDLLQVQPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 201
>gi|375260293|ref|YP_005019463.1| putative enzyme [Klebsiella oxytoca KCTC 1686]
gi|397657374|ref|YP_006498076.1| hypothetical protein A225_2356 [Klebsiella oxytoca E718]
gi|423102531|ref|ZP_17090233.1| hypothetical protein HMPREF9686_01137 [Klebsiella oxytoca 10-5242]
gi|365909771|gb|AEX05224.1| putative enzyme [Klebsiella oxytoca KCTC 1686]
gi|376388007|gb|EHT00708.1| hypothetical protein HMPREF9686_01137 [Klebsiella oxytoca 10-5242]
gi|394345830|gb|AFN31951.1| hypothetical protein A225_2356 [Klebsiella oxytoca E718]
Length = 373
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ Y+ +L +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
YA +R P DI L + L+D++ L+
Sbjct: 228 YADPDVPSREHPADILPAEL---DRLRDMMLGLI 258
>gi|343513942|ref|ZP_08751030.1| hypothetical protein VIBRN418_07521 [Vibrio sp. N418]
gi|342801072|gb|EGU36563.1| hypothetical protein VIBRN418_07521 [Vibrio sp. N418]
Length = 379
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 85/197 (43%), Gaps = 24/197 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
+W+K+P+ I KN + + + ++I + ++ ID + Q G
Sbjct: 15 YWQKKPVVI----KNAFKDFVDPISAEEIAG---LSLEEMIDSRFVSNKDSQWTAQHGPF 67
Query: 60 -----RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPT 113
LP+ + C+ I+P +QL L ++ C+ +
Sbjct: 68 EESFFDDLPETHCQLIVQACNHWHSETAQLIEPFKQLPQWLFDDVMVCYSAPHG------ 121
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q +GK++W+V + + +Y Q E I+ TL+P
Sbjct: 122 --GVGPHIDQYDVFIIQGQGKREWRVG-AKDIGQYQEVIQGGALRQIEGFDSIIDETLQP 178
Query: 174 GDLLYLPRGYIHQASTV 190
GD+LY+P G+ H+ +T+
Sbjct: 179 GDMLYIPPGFPHEGTTL 195
>gi|419957457|ref|ZP_14473523.1| cupin [Enterobacter cloacae subsp. cloacae GS1]
gi|388607615|gb|EIM36819.1| cupin [Enterobacter cloacae subsp. cloacae GS1]
Length = 373
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 61/274 (22%), Positives = 116/274 (42%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGISNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ +A +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL--GSYR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
Y+ AR P DI + L + L+ ++ L+
Sbjct: 228 YSDPDVPAREHPADILPEEL---DKLRGMMLDLI 258
>gi|402842981|ref|ZP_10891384.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
gi|402278367|gb|EJU27431.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
Length = 373
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ Y+ +L +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
YA +R P DI L + L+D++ L+
Sbjct: 228 YADPDVPSREHPADILPAEL---DRLRDMMLGLI 258
>gi|226330501|ref|ZP_03806019.1| hypothetical protein PROPEN_04419 [Proteus penneri ATCC 35198]
gi|225201296|gb|EEG83650.1| cupin family protein [Proteus penneri ATCC 35198]
Length = 373
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q +G+++W+V + ++ P P+ Q + I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGQGRRRWRVGEKFPMKQHCPH---PDLLQVDPFEAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ LEPGD+LY+P G+ H+ + E SL+ ++ A
Sbjct: 169 IDEELEPGDILYIPPGFPHEGYAI--EESLNYSVGFRSPNA 207
>gi|213648227|ref|ZP_03378280.1| hypothetical protein SentesTy_13599 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
Length = 294
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|89074234|ref|ZP_01160724.1| hypothetical protein SKA34_13465 [Photobacterium sp. SKA34]
gi|89049938|gb|EAR55472.1| hypothetical protein SKA34_13465 [Photobacterium sp. SKA34]
Length = 377
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 58/131 (44%), Gaps = 16/131 (12%)
Query: 68 DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAF 127
+H+ EG + + P +QL L + +Y TP G PH D + F
Sbjct: 85 NHWHEGAA-------QLVTPFRQLPGWLFDDLMI-----SYSTP--GGGVGPHIDQYDVF 130
Query: 128 ILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQA 187
I+Q GK+ W+V R D+Y + Q + PI+ L+ GD+LY+P G+ H
Sbjct: 131 IVQGSGKRHWRVGEKR--DDYEEEFRHSALRQIKGFEPIIDEVLDTGDILYIPPGFPHDG 188
Query: 188 STVTNEHSLHV 198
+V S V
Sbjct: 189 YSVETSMSFSV 199
>gi|88860119|ref|ZP_01134758.1| putative enzyme with RmlC-like domain [Pseudoalteromonas tunicata
D2]
gi|88818113|gb|EAR27929.1| putative enzyme with RmlC-like domain [Pseudoalteromonas tunicata
D2]
Length = 382
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 89/199 (44%), Gaps = 27/199 (13%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFK---KNIDITLYEDGQRQTMDV- 57
+W+K+PL I + +N+ + I+ ++ + ++ IE + D G + D+
Sbjct: 18 YWQKKPLLIKQGFRNF-QDPISPDELAGLAQEECIESRIITNTNDKWKAHHGPFEEFDLL 76
Query: 58 ---EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
L Q V DH+ S + L P +I N + +L F TP
Sbjct: 77 TATHSTLLVQAV-DHWHPD-SAQLLEPFRFIP-----NWRIDDLMISF------STP--G 121
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q EGK+ W+V L + ++E +P + Q ++ LEP
Sbjct: 122 GGVGPHLDQYDVFIIQGEGKRHWRVGLAEQSLEEVIP---CKDLLQITAFEAVIDAVLEP 178
Query: 174 GDLLYLPRGYIHQASTVTN 192
GD+LY+P G H+ V N
Sbjct: 179 GDILYIPPGCPHEGYAVEN 197
>gi|415949261|ref|ZP_11556860.1| Cupin 4 family protein [Herbaspirillum frisingense GSF30]
gi|407757773|gb|EKF67697.1| Cupin 4 family protein [Herbaspirillum frisingense GSF30]
Length = 373
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 127/321 (39%), Gaps = 48/321 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W K+PL I R ++ + +++ D+ R +E + IT EDG +
Sbjct: 18 DYWHKKPLLI-RNAIPGFSPPLPRQELFDLARHEDVESRL---ITHREDGWDMQHGPFNK 73
Query: 61 ALP--QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
P Q W +G ++ ++ + + +A L +L +Y T G
Sbjct: 74 LPPLKQKAWTLLVQGVNLHHPAADALLRQFRFVPDARLDDLM------ISYAT--DGGGV 125
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
PH+D + F+LQ G+++W++ ++E +P +F EE LEPGD
Sbjct: 126 GPHFDSYDVFLLQAHGQRRWRISAQNDLTLEEGVPLKILSHFQPEE------EFILEPGD 179
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRR--- 232
+LYLP Y H V + V A Q G + LEF
Sbjct: 180 MLYLPPHYAHDGVAVGECMTYSVGFRA---------------PAYQELGESFLEFMMDTI 224
Query: 233 GLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL 292
LP Y +P ++QAD M ++D L K + + D DD +G+ L
Sbjct: 225 DLPGRYADPDLKTSARPAELQAD---MITRIRDELNK-IRFTD-DDITIFLGEHLSEPKP 279
Query: 293 PPVLSP--EELQCSVFENGLR 311
P EE+ + F G +
Sbjct: 280 SVFFEPRGEEISFARFFQGAK 300
>gi|354613741|ref|ZP_09031647.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
90007]
gi|353221894|gb|EHB86226.1| Cupin 4 family protein [Saccharomonospora paurometabolica YIM
90007]
Length = 392
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 10/116 (8%)
Query: 103 VGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEE 161
V NTY T GF H+DD I+Q+ G K+W+V P R+ Y R + PN E
Sbjct: 130 VQVNTYLTTGNTAGFDLHWDDHPTVIVQIAGSKQWEVRGPSRLAPLY--RDTGPN--TEP 185
Query: 162 IGTPILTVTLEPGDLLYLPRGYIHQASTVTNEH---SLHVTISVYQKTA--WIDLL 212
+ + L G+++ +PRG+ H A+ ++ SLH+T + T WI L
Sbjct: 186 PDEVVWSGALRAGEVISIPRGWWHSATRTDHDDDRLSLHLTFGMTPHTGIDWISWL 241
>gi|343512857|ref|ZP_08749972.1| hypothetical protein VIS19158_15204 [Vibrio scophthalmi LMG 19158]
gi|342794297|gb|EGU30070.1| hypothetical protein VIS19158_15204 [Vibrio scophthalmi LMG 19158]
Length = 379
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 85/197 (43%), Gaps = 24/197 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG-- 59
+W+K+P+ I KN + + + ++I + ++ ID + Q G
Sbjct: 15 YWQKKPVVI----KNAFKDFVDPISAEEIAG---LSLEEMIDSRFVTNKDSQWTAQHGPF 67
Query: 60 -----RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPT 113
LP+ + C+ I+P +QL L ++ C+ +
Sbjct: 68 EESFFDDLPETHCQLIVQACNHWHSETAQLIEPFKQLPQWLFDDVMVCYSAPHG------ 121
Query: 114 AQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH D + FI+Q +GK++W+V + + +Y Q E I+ TL+P
Sbjct: 122 --GVGPHIDQYDVFIIQGQGKREWRVG-AKDIGQYQEVIQGGALRQIEGFDSIIDETLQP 178
Query: 174 GDLLYLPRGYIHQASTV 190
GD+LY+P G+ H+ +T+
Sbjct: 179 GDMLYIPPGFPHEGTTL 195
>gi|317491837|ref|ZP_07950272.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
gi|316920271|gb|EFV41595.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
Length = 373
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 84/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVILKRGFKNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGRWQVAHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E SI + P L ++L G PH
Sbjct: 67 -PFESYDHLGENNWSILVQAVDHWHVPSSALMRPFRKLPDWRTDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKTPLRQHSPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|119944720|ref|YP_942400.1| cupin 4 family protein [Psychromonas ingrahamii 37]
gi|119863324|gb|ABM02801.1| Cupin 4 family protein [Psychromonas ingrahamii 37]
Length = 375
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 9/96 (9%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPN-FSQEEIGTP 165
+Y TP G PH D+ + FI+Q GK+ W+V LP +++ + E
Sbjct: 113 SYSTP--KGGVGPHIDNYDVFIIQGLGKRHWRVGDKGA----LPEFAAHDALKHCESFDA 166
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
I+ V LEPGD+LY+P GY H+ ++ E SL+ +I
Sbjct: 167 IIDVELEPGDILYIPAGYPHEGYSI--ETSLNYSIG 200
>gi|307131932|ref|YP_003883948.1| hypothetical protein Dda3937_02402 [Dickeya dadantii 3937]
gi|306529461|gb|ADM99391.1| hypothetical protein Dda3937_02402 [Dickeya dadantii 3937]
Length = 381
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V ++ ++ P P + TPI+ LEPGD
Sbjct: 129 GVGPHIDQYDVFIIQGMGRRRWRVGDKLLMRQFCPH---PALLHVDPFTPIIDEELEPGD 185
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 186 ILYIPPGFPHDGFT 199
>gi|332162076|ref|YP_004298653.1| hypothetical protein YE105_C2454 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|386308002|ref|YP_006004058.1| hypothetical protein [Yersinia enterocolitica subsp. palearctica
Y11]
gi|418240503|ref|ZP_12867042.1| hypothetical protein IOK_03636 [Yersinia enterocolitica subsp.
palearctica PhRBD_Ye1]
gi|433549953|ref|ZP_20505997.1| FIG002776: hypothetical protein [Yersinia enterocolitica IP 10393]
gi|318606124|emb|CBY27622.1| hypothetical protein Y11_05591 [Yersinia enterocolitica subsp.
palearctica Y11]
gi|325666306|gb|ADZ42950.1| hypothetical protein YE105_C2454 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|330860075|emb|CBX70401.1| uncharacterized protein ycfD [Yersinia enterocolitica W22703]
gi|351780134|gb|EHB22219.1| hypothetical protein IOK_03636 [Yersinia enterocolitica subsp.
palearctica PhRBD_Ye1]
gi|431789088|emb|CCO69037.1| FIG002776: hypothetical protein [Yersinia enterocolitica IP 10393]
Length = 373
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 90/203 (44%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R K++ + L D L +E + + + +EDG+ Q +
Sbjct: 16 YWQKRPVILKRGFKHFIDPLSP-----DELAGLAMENEVDSRLVSHEDGRWQVAHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGETNWSLLVQAV-DHWHEPAA-------ALMRPFRPLSDWRMDDLMVSF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|424799207|ref|ZP_18224749.1| FIG002776: hypothetical protein [Cronobacter sakazakii 696]
gi|423234928|emb|CCK06619.1| FIG002776: hypothetical protein [Cronobacter sakazakii 696]
Length = 373
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ DV
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N + V I +G D +R L G R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
Y+ AR P DI A E L+D++ L+ D D+ + G+ + S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL S E Q E+Y+A + DT VRL +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313
>gi|238785736|ref|ZP_04629710.1| hypothetical protein yberc0001_15190 [Yersinia bercovieri ATCC
43970]
gi|238713376|gb|EEQ05414.1| hypothetical protein yberc0001_15190 [Yersinia bercovieri ATCC
43970]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P E TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|237808290|ref|YP_002892730.1| cupin [Tolumonas auensis DSM 9187]
gi|237500551|gb|ACQ93144.1| Cupin 4 family protein [Tolumonas auensis DSM 9187]
Length = 381
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G ++W+V ++++ ++P E PI
Sbjct: 113 SYSVP--GGGVGPHIDQYDVFIIQGSGSRRWRVGADTPAEQFV---ATPGLLHVEQFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ V L+ GD+LY+P G+ H +T S +
Sbjct: 168 IDVELQSGDILYIPPGFPHDGYAITEAMSYSI 199
>gi|425067746|ref|ZP_18470862.1| hypothetical protein HMPREF1311_00902 [Proteus mirabilis WGLW6]
gi|425072904|ref|ZP_18476010.1| hypothetical protein HMPREF1310_02343 [Proteus mirabilis WGLW4]
gi|404596678|gb|EKA97198.1| hypothetical protein HMPREF1310_02343 [Proteus mirabilis WGLW4]
gi|404600946|gb|EKB01371.1| hypothetical protein HMPREF1311_00902 [Proteus mirabilis WGLW6]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q +G+++W+V + ++ P P+ Q + I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHCPH---PDLLQVDPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ LEPGD+LY+P G+ H+ + E SL+ ++ A
Sbjct: 169 IDEELEPGDILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207
>gi|417555762|ref|ZP_12206830.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
Naval-81]
gi|421457572|ref|ZP_15906907.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
IS-123]
gi|400206662|gb|EJO37635.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
IS-123]
gi|400389792|gb|EJP56840.1| cupin domain protein, PF06172 family [Acinetobacter baumannii
Naval-81]
Length = 382
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/178 (21%), Positives = 80/178 (44%), Gaps = 24/178 (13%)
Query: 57 VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+E R + +++++ R G ++ + + ++ + + A+ E VG Y +
Sbjct: 75 IEYRCITSVLYEYLRNGATLVRNRIKNEPFVDQISKQIANFAEA-RTLVGG--YAAFSSK 131
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP--------I 166
+ H+D + + +QL G+K+W + P + P F Q+ P
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPN--------FEFPLFMQQTKNFPDIKEPEEIY 183
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
+ V LE GD+LY+PRG+ H + +E + H+ ++ + T ++ L+ MP L
Sbjct: 184 MDVILEAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNIMPSILDC 240
>gi|422009020|ref|ZP_16356003.1| hypothetical protein OOC_13169 [Providencia rettgeri Dmel1]
gi|414092838|gb|EKT54510.1| hypothetical protein OOC_13169 [Providencia rettgeri Dmel1]
Length = 374
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 94/209 (44%), Gaps = 32/209 (15%)
Query: 2 HWEKEPLYISRKQKNYYNNL----ITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV 57
HW+K PL I + N+ + + + ++D + ++ K+ I + G ++ D
Sbjct: 16 HWQKRPLLIKKGFANFIDPISPDELAGLAMEDEVDSRLVSHKEGIWQVGH--GPFESYDH 73
Query: 58 EGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPP 112
G L Q V DH+ S +KP + L + + +L +Y P
Sbjct: 74 LGEENWSILVQAV-DHWHHPSS-------ALMKPFRVLSDWRMDDLM------ISYSVP- 118
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G PH D + FI+Q EG++ W+V + ++ P P+ Q + I+ +E
Sbjct: 119 -GGGVGPHLDQYDVFIIQGEGRRHWRVGEKIPMKQHCPH---PDLLQVQPFDAIIDEEME 174
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD+LY+P G+ H+ + E SL+ ++
Sbjct: 175 PGDILYIPPGFPHEGYAI--EPSLNYSVG 201
>gi|197284768|ref|YP_002150640.1| hypothetical protein PMI0883 [Proteus mirabilis HI4320]
gi|227357774|ref|ZP_03842123.1| cupin 4 family protein [Proteus mirabilis ATCC 29906]
gi|194682255|emb|CAR41987.1| conserved hypothetical protein [Proteus mirabilis HI4320]
gi|227162103|gb|EEI47117.1| cupin 4 family protein [Proteus mirabilis ATCC 29906]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q +G+++W+V + ++ P P+ Q + I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHCPH---PDLLQVDPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ LEPGD+LY+P G+ H+ + E SL+ ++ A
Sbjct: 169 IDEELEPGDILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207
>gi|399021420|ref|ZP_10723526.1| hypothetical protein PMI16_04492 [Herbaspirillum sp. CF444]
gi|398091873|gb|EJL82299.1| hypothetical protein PMI16_04492 [Herbaspirillum sp. CF444]
Length = 374
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 87/198 (43%), Gaps = 29/198 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---- 56
D+W K+PL I R+ + L++ + + D+ + +E + + + G+ MD
Sbjct: 19 DYWHKKPLLI-RQAIPGFTPLLSRQALFDMASRDDVESR-----LIQQHGEHWKMDHGPL 72
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
+ AL Q W +G ++ ++ + + +A L +L +
Sbjct: 73 SKLPALKQKEWTMLVQGVNLHDDAADELLRRFRFIPDARLDDLMISYAS--------DGG 124
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR---MVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G PH+D + F+LQ G+++W++ + +VD +P NF E+ LE
Sbjct: 125 GVGPHFDSYDVFLLQAHGRRRWRISAQKDLSLVDG-MPLKILKNFQSEQ------EFVLE 177
Query: 173 PGDLLYLPRGYIHQASTV 190
PGD+LYLP Y H V
Sbjct: 178 PGDMLYLPPHYAHDGVAV 195
>gi|238799321|ref|ZP_04642763.1| hypothetical protein ymoll0001_40500 [Yersinia mollaretii ATCC
43969]
gi|238716837|gb|EEQ08711.1| hypothetical protein ymoll0001_40500 [Yersinia mollaretii ATCC
43969]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P E TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|417792259|ref|ZP_12439641.1| hypothetical protein CSE899_17031 [Cronobacter sakazakii E899]
gi|429115173|ref|ZP_19176091.1| FIG002776: hypothetical protein [Cronobacter sakazakii 701]
gi|449308617|ref|YP_007440973.1| cupin family protein [Cronobacter sakazakii SP291]
gi|333953663|gb|EGL71583.1| hypothetical protein CSE899_17031 [Cronobacter sakazakii E899]
gi|426318302|emb|CCK02204.1| FIG002776: hypothetical protein [Cronobacter sakazakii 701]
gi|449098650|gb|AGE86684.1| cupin family protein [Cronobacter sakazakii SP291]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ DV
Sbjct: 16 YWQKRPVVLKRGIKNFIDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N + V I +G D +R L G R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSHR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
Y+ AR P DI A E L+D++ L+ D D+ + G+ + S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL S E Q E+Y+A + DT VRL +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313
>gi|423119781|ref|ZP_17107465.1| hypothetical protein HMPREF9690_01787 [Klebsiella oxytoca 10-5246]
gi|376397477|gb|EHT10109.1| hypothetical protein HMPREF9690_01787 [Klebsiella oxytoca 10-5246]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + + D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFVDPISP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLSENNWSLLVQAVNNWHEPTAALMQPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVDPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ Y+ +L +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
YA +R P DI L + L+D++ L+
Sbjct: 228 YADPDVPSRDHPADILPAEL---DRLRDMMLGLI 258
>gi|422013902|ref|ZP_16360518.1| hypothetical protein OOA_04067 [Providencia burhodogranariea DSM
19968]
gi|414101924|gb|EKT63520.1| hypothetical protein OOA_04067 [Providencia burhodogranariea DSM
19968]
Length = 374
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q EG++ W+V + ++ P P+ Q E I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRHWRVGEKVPMKQHCPH---PDLLQVEPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 201
>gi|423107855|ref|ZP_17095550.1| hypothetical protein HMPREF9687_01101 [Klebsiella oxytoca 10-5243]
gi|376386588|gb|EHS99299.1| hypothetical protein HMPREF9687_01101 [Klebsiella oxytoca 10-5243]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ Y+ +L +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVG-YRAPNARELF----------SGFADYILQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
YA +R P DI L + L+D++ L+
Sbjct: 228 YADPDVPSREHPADILPVEL---DRLRDMMLGLI 258
>gi|421728564|ref|ZP_16167717.1| putative enzyme [Klebsiella oxytoca M5al]
gi|423128639|ref|ZP_17116318.1| hypothetical protein HMPREF9694_05330 [Klebsiella oxytoca 10-5250]
gi|376393121|gb|EHT05782.1| hypothetical protein HMPREF9694_05330 [Klebsiella oxytoca 10-5250]
gi|410370728|gb|EKP25456.1| putative enzyme [Klebsiella oxytoca M5al]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNNWHQPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|238754406|ref|ZP_04615762.1| hypothetical protein yruck0001_18110 [Yersinia ruckeri ATCC 29473]
gi|238707439|gb|EEP99800.1| hypothetical protein yruck0001_18110 [Yersinia ruckeri ATCC 29473]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 90/203 (44%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R KN+ + + D L +E + + + ++DG+ Q +
Sbjct: 16 YWQKRPVILKRGFKNFIDPISP-----DELAGLAMESEVDSRLVSHQDGRWQVGHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L +Y
Sbjct: 71 YDHLGETNWSLLVQAV-DHWHEPSA-------ALMRPFRALSDWRMDDLM------ISYS 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH D + FI+Q G+++W+V + ++ P PN Q I+
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGSGRRQWRVGEKVPMKQHCPH---PNLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ + N
Sbjct: 172 EMEPGDILYIPPGFPHEGYALEN 194
>gi|156934387|ref|YP_001438303.1| hypothetical protein ESA_02218 [Cronobacter sakazakii ATCC BAA-894]
gi|389841361|ref|YP_006343445.1| cupin family protein [Cronobacter sakazakii ES15]
gi|156532641|gb|ABU77467.1| hypothetical protein ESA_02218 [Cronobacter sakazakii ATCC BAA-894]
gi|387851837|gb|AFJ99934.1| cupin family protein [Cronobacter sakazakii ES15]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ DV
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N + V I +G D +R L G R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSHR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
Y+ AR P DI A E L+D++ L+ D D+ + G+ + S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL S E Q E+Y+A + DT VRL +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313
>gi|429119199|ref|ZP_19179929.1| FIG002776: hypothetical protein [Cronobacter sakazakii 680]
gi|426326310|emb|CCK10666.1| FIG002776: hypothetical protein [Cronobacter sakazakii 680]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 139/342 (40%), Gaps = 48/342 (14%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ DV
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGK---WDVSHG- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N + V I +G D +R L G R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSHR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDALPPVLS 297
Y+ AR P DI A E L+D++ L+ D D+ + G+ + S
Sbjct: 228 YSDPDVPARECPADIVP---AEVEKLRDMMLDLIR--DPDNFNEWFGEFITQ-------S 275
Query: 298 PEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL S E Q E+Y+A + DT VRL +R
Sbjct: 276 RHELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313
>gi|188026123|ref|ZP_02960948.2| hypothetical protein PROSTU_02934 [Providencia stuartii ATCC 25827]
gi|188021702|gb|EDU59742.1| cupin family protein [Providencia stuartii ATCC 25827]
Length = 392
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q EG+++W+V + ++ P P+ Q + I
Sbjct: 132 SYSVP--GGGVGPHLDQYDVFIIQGEGRRRWRVGEKVPMKQHCPH---PDLLQVQPFDAI 186
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 187 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 219
>gi|153947083|ref|YP_001400583.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
gi|152958578|gb|ABS46039.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
Length = 373
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 94/203 (46%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
+W+K P+ + R KN+ + L + ++ + +N ++ + + +E+G+ ++
Sbjct: 16 YWQKRPVILKRGFKNFIDPL-SPDELAGLAMENEVDSR----LVSHENGRWHVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P + L + + +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRSLSDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+LY+P G+ H+ ++ N
Sbjct: 172 EMEPGDILYIPPGFPHEGYSLEN 194
>gi|392308693|ref|ZP_10271227.1| RmlC-like domain-containing protein [Pseudoalteromonas citrea NCIMB
1889]
Length = 384
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
+Y TP G PH D + FI+Q +GK+ W+V LP D L +++ + + Q E
Sbjct: 116 SYSTP--HGGVGPHLDQYDVFIIQGQGKRHWRVGLP---DANLKQFAQNKSLLQVEQFDA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
+ LEPGD+LY+P G H+ V N
Sbjct: 171 EIDAILEPGDILYIPPGCPHEGYAVEN 197
>gi|209695412|ref|YP_002263341.1| hypothetical protein VSAL_I1952 [Aliivibrio salmonicida LFI1238]
gi|208009364|emb|CAQ79637.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238]
Length = 390
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 88/195 (45%), Gaps = 26/195 (13%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIE--FKKNIDITL-YEDG--QRQTM 55
++W+K+P+ I +N+ + +T +++ + +N ++ F N++ E G +
Sbjct: 14 EYWQKKPVIIKGGFENF-QDPVTPEELAGLTLENEVDSGFVSNLNGEWKAEHGPLTEELF 72
Query: 56 DVEGRALPQIV---WDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
D G I+ +H+ EG + F KP +Q+ L + +Y P
Sbjct: 73 DTLGETNWSIIVQAANHWHEGAAELF-------KPFKQMPNWLFDDVMI-----SYSVPD 120
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G PH D + FI+Q +GK+ W+V + EY + Q PI+ LE
Sbjct: 121 G--GVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYEEEHRHGTLKQITGFDPIIDQILE 175
Query: 173 PGDLLYLPRGYIHQA 187
PGD+LY+P G+ H
Sbjct: 176 PGDILYIPPGFPHDG 190
>gi|345298830|ref|YP_004828188.1| cupin [Enterobacter asburiae LF7a]
gi|345092767|gb|AEN64403.1| Cupin 4 family protein [Enterobacter asburiae LF7a]
Length = 373
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/274 (22%), Positives = 115/274 (41%), Gaps = 35/274 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGISNFIDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N SL+ ++ +A +G D +R L G R
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL--GSYR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
Y+ AR P DI L + L+ ++ L+
Sbjct: 228 YSDPDVPAREHPADILPQEL---DKLRGMMLDLI 258
>gi|323456451|gb|EGB12318.1| hypothetical protein AURANDRAFT_61265 [Aureococcus anophagefferens]
Length = 390
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 12/127 (9%)
Query: 88 LQQLNASLQELFGCFVGANTYQTPPTAQGFAP-HYDDIEAFILQLEGKKKWKVYLPRMVD 146
+ L+ + + FG N Y T A+ AP H D ++Q G K+W+V+ P D
Sbjct: 38 VAALSLAAVDAFGLPACVNLYATGAGARVSAPPHTDTQRVLVMQCHGNKRWRVWRPPRPD 97
Query: 147 EYL---PRYSSPNFSQEEIG----TPILTVTLEPGDLLYLPRGYIHQAST----VTNEHS 195
P E+G TP V L PGD LY+P G+ H T V +E S
Sbjct: 98 RRPGADPLARGKGDDVLELGELEETPAADVVLTPGDCLYVPAGWPHTTDTLECAVDDEPS 157
Query: 196 LHVTISV 202
+H+T+ V
Sbjct: 158 VHLTLGV 164
>gi|386741704|ref|YP_006214883.1| hypothetical protein S70_01480 [Providencia stuartii MRSN 2154]
gi|384478397|gb|AFH92192.1| hypothetical protein S70_01480 [Providencia stuartii MRSN 2154]
Length = 374
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q EG+++W+V + ++ P P+ Q + I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRRWRVGEKVPMKQHCPH---PDLLQVQPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNYSVG 201
>gi|219120835|ref|XP_002185649.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582498|gb|ACI65119.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 626
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 19/107 (17%)
Query: 119 PHYDDIEAFILQLEGKKKWKVYLP--------------RMVDEYLPRYSSPNFSQEEIGT 164
PH D + ++Q G+K WKVY P D+ LP Y + ++G
Sbjct: 246 PHTDKQDVAVVQTSGRKHWKVYSPPNPAMKPTVDIFARGKGDDSLPLY----ILESDLGC 301
Query: 165 PIL-TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWID 210
+L TL PGD++++P + H STVT + S H + T ID
Sbjct: 302 QLLLETTLNPGDVMFVPAAFPHTTSTVTEDDSTHADKTSIHLTLGID 348
>gi|193078048|gb|ABO12976.2| hypothetical protein A1S_2559 [Acinetobacter baumannii ATCC 17978]
Length = 382
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 80/173 (46%), Gaps = 14/173 (8%)
Query: 57 VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+E R + +++ + R G ++ ++ + ++ + + A+ E VG Y +
Sbjct: 75 IEYRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEA-RTLVGG--YAAFSSK 131
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
+ H+D + + +QL G+K+W + P P Y + +I P + V L
Sbjct: 132 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFPDIKEPEEIYMDVIL 188
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
E GD+LY+PRG+ H + +E + H+ ++ + T ++ L+ MP L
Sbjct: 189 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNIMPGILDC 240
>gi|386313839|ref|YP_006010004.1| transcription factor jumonji jmjC domain protein [Shewanella
putrefaciens 200]
gi|319426464|gb|ADV54538.1| transcription factor jumonji jmjC domain protein [Shewanella
putrefaciens 200]
Length = 386
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/218 (23%), Positives = 94/218 (43%), Gaps = 38/218 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K+PL I R+ + NL++ +++ + D ++E ++ +Y+ + +
Sbjct: 17 YWQKKPLVI-RQGFKQFQNLVSPEELAGLAMDELVESRR-----VYQQAGQWHAEFG--- 67
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASL---QELFGCFVGAN---------TYQ 109
+D Y + L + + +Q LN + + L CF +Y
Sbjct: 68 ----PFDSYEK------LGERDWTLIVQALNNWVPDAEALIQCFDFIPRWRFDDVMVSYA 117
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
TP G PH D + FI Q G+++W+V E+ + P E PI+
Sbjct: 118 TP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFA---AHPALLHTEAFEPIIDT 172
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
L PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 173 ELLPGDILYIPPGFPHDG--ITLEESLSFSVGYRTASA 208
>gi|395232008|ref|ZP_10410260.1| cupin [Enterobacter sp. Ag1]
gi|394733512|gb|EJF33129.1| cupin [Enterobacter sp. Ag1]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 82/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + + D L +E + + + ++ G+ Q
Sbjct: 16 YWQKRPVVLKRGFKNFIDPISP-----DELAGLAMESEVDSRLVSHQGGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLGENNWSLLVQAVNNWHEPSSALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVAMKQHCPH---PDLLQVEPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|354597768|ref|ZP_09015785.1| Cupin 4 family protein [Brenneria sp. EniD312]
gi|353675703|gb|EHD21736.1| Cupin 4 family protein [Brenneria sp. EniD312]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 95/218 (43%), Gaps = 38/218 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K+P+ I R +N I D+ L +E + + + + DG+ Q +
Sbjct: 16 YWQKQPVIIKRG----FNRFIDPISPDE-LAGLALENEVDSRLVSHNDGRWQVSHGPFDS 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E ++P +QL + L +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPS-------LALMRPFRQLPDWRLDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH D + FI+Q G+++W+V + ++ P P+ Q E I+
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVEPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+EPGD+LY+P G+ H ++ N SL+ ++ A
Sbjct: 172 EMEPGDILYIPPGFPHDGYSLEN--SLNYSVGFRAPNA 207
>gi|146310156|ref|YP_001175230.1| cupin 4 family protein [Enterobacter sp. 638]
gi|145317032|gb|ABP59179.1| Cupin 4 family protein [Enterobacter sp. 638]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G ++W+V + ++ P P + TPI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFTPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201
>gi|429090897|ref|ZP_19153599.1| FIG002776: hypothetical protein [Cronobacter dublinensis 1210]
gi|426744552|emb|CCJ79712.1| FIG002776: hypothetical protein [Cronobacter dublinensis 1210]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/198 (21%), Positives = 88/198 (44%), Gaps = 14/198 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ +
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLGENNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHV 198
G+ H+ ++ N + V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200
>gi|238758416|ref|ZP_04619593.1| hypothetical protein yaldo0001_18150 [Yersinia aldovae ATCC 35236]
gi|238703320|gb|EEP95860.1| hypothetical protein yaldo0001_18150 [Yersinia aldovae ATCC 35236]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 92/203 (45%), Gaps = 36/203 (17%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ-------T 54
+W+K P+ + R N+ + L + ++ + +N ++ + + +EDG+ Q +
Sbjct: 16 YWQKRPVILKRGFANFIDPL-SPDELAGLAMENEVDSR----LVSHEDGRWQVAHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G L Q V DH+ E + ++P ++L + + +L F
Sbjct: 71 FDHLGENNWSLLVQAV-DHWHEPAA-------ALMRPFRKLSDWRMDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH+D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHFDQYDVFIIQGSGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTN 192
+EPGD+ Y+P G+ H+ + N
Sbjct: 172 EMEPGDIFYIPPGFPHEGYALEN 194
>gi|261346016|ref|ZP_05973660.1| cupin family protein [Providencia rustigianii DSM 4541]
gi|282565901|gb|EFB71436.1| cupin family protein [Providencia rustigianii DSM 4541]
Length = 374
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 85/202 (42%), Gaps = 16/202 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K PL I K + I D+ L +E + + + ++DG Q G
Sbjct: 16 HWQKRPLLI----KQGFTRFIDPISPDE-LAGLAMEDEVDSRLVSHKDGNWQV--AHG-- 66
Query: 62 LPQIVWDHY-REGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E SI + P QL + L + G PH
Sbjct: 67 -PFKSYDHLGDENWSILVQAVDHWHHPSAQLMKPFRVLSDWRMDDLMISYSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q EG+++W+V + ++ P P+ Q I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGEGRRRWRVGEKIPMKQHCPH---PDLLQVHPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISV 202
G+ H+ + E SL+ ++
Sbjct: 183 PGFPHEGYAI--EPSLNFSVGF 202
>gi|389700105|ref|ZP_10185156.1| hypothetical protein LepocDRAFT_00002390 [Leptothrix ochracea L12]
gi|388591250|gb|EIM31508.1| hypothetical protein LepocDRAFT_00002390 [Leptothrix ochracea L12]
Length = 361
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL----PRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F++Q+ G ++W++ RM D L P NF EE L
Sbjct: 115 GVGPHFDSYDVFLIQVHGCRRWRI--GRMDDAVLLPNLPVKIIANFRFEE------EWLL 166
Query: 172 EPGDLLYLPRGYIHQASTVTNE 193
EPGD+LYLP G+ H V E
Sbjct: 167 EPGDMLYLPPGWAHDGDAVGGE 188
>gi|343507743|ref|ZP_08745127.1| hypothetical protein VII00023_15076 [Vibrio ichthyoenteri ATCC
700023]
gi|342796556|gb|EGU32231.1| hypothetical protein VII00023_15076 [Vibrio ichthyoenteri ATCC
700023]
Length = 379
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +GK++WKV + V +Y Q E I+ TL PGD
Sbjct: 122 GVGPHIDQYDVFIVQGQGKRQWKVG-AKDVGQYQEVIQGGALRQIEGFDAIIDDTLMPGD 180
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 181 MLYIPPGFPHEGTTL 195
>gi|386348868|ref|YP_006047116.1| cupin region [Rhodospirillum rubrum F11]
gi|346717304|gb|AEO47319.1| cupin region [Rhodospirillum rubrum F11]
Length = 262
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 11/125 (8%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR-YS 153
L++ G V + + T G PHYDD + I+Q+ G+K WK+ + + R S
Sbjct: 77 LEQALGAVVTIDAIGSFGTVCGLPPHYDDRDLIIVQVAGRKHWKILGTPVEGPWRKRTMS 136
Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLE 213
P+ +E ++ GD+L++P G HQ V E SLH+ ++ + DLL+
Sbjct: 137 VPDTVTDEF-------VMQGGDMLFVPAGLYHQC--VPLEPSLHLG-ALITRPCGADLLK 186
Query: 214 KAMPK 218
P+
Sbjct: 187 MVQPR 191
>gi|354723018|ref|ZP_09037233.1| Cupin 4 family protein [Enterobacter mori LMG 25706]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNNWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|295096186|emb|CBK85276.1| Uncharacterized conserved protein [Enterobacter cloacae subsp.
cloacae NCTC 9394]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 74/159 (46%), Gaps = 24/159 (15%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+LY+P G+ H+ ++ N SL+ ++ +A +G D +R L
Sbjct: 178 ILYIPPGFPHEGYSLEN--SLNYSVGF-----------RAPSGREMISGFADYVLQREL- 223
Query: 236 IGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLV 271
G RY+ AR P DI + L + L+ ++ L+
Sbjct: 224 -GSYRYSDPDVPAREHPADILPEEL---DKLRGMMLDLI 258
>gi|444378829|ref|ZP_21178019.1| hypothetical protein D515_2835 [Enterovibrio sp. AK16]
gi|443677171|gb|ELT83862.1| hypothetical protein D515_2835 [Enterovibrio sp. AK16]
Length = 377
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI Q GK++W+V EY Q E PI+ LEPGD
Sbjct: 120 GVGPHIDQYDVFICQGMGKRRWRV---GDKGEYKDANHHTGLRQIEGFDPIIDEILEPGD 176
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 177 ILYIPPGFPHEGNTI 191
>gi|429083173|ref|ZP_19146219.1| FIG002776: hypothetical protein [Cronobacter condimenti 1330]
gi|426547988|emb|CCJ72260.1| FIG002776: hypothetical protein [Cronobacter condimenti 1330]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/200 (22%), Positives = 84/200 (42%), Gaps = 14/200 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L D L +E + + + ++DG+ +
Sbjct: 16 YWQKRPVVLKRGIKNFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWEV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLSENNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKLPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTI 200
G+ H+ ++ N + V
Sbjct: 183 PGFPHEGYSLENSMNYSVGF 202
>gi|262274518|ref|ZP_06052329.1| hypothetical protein VHA_001495 [Grimontia hollisae CIP 101886]
gi|262221081|gb|EEY72395.1| hypothetical protein VHA_001495 [Grimontia hollisae CIP 101886]
Length = 377
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI Q GK++W+V EY Q E PI+ LEPGD
Sbjct: 120 GVGPHIDQYDVFICQGMGKRRWRV---GDKGEYKDANHHTGLRQIEGFDPIIDEILEPGD 176
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H+ +T+
Sbjct: 177 ILYIPPGFPHEGNTI 191
>gi|296102852|ref|YP_003612998.1| cupin 4 family protein [Enterobacter cloacae subsp. cloacae ATCC
13047]
gi|295057311|gb|ADF62049.1| cupin 4 family protein [Enterobacter cloacae subsp. cloacae ATCC
13047]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFDGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|126642594|ref|YP_001085578.1| hypothetical protein A1S_2559 [Acinetobacter baumannii ATCC 17978]
Length = 328
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 80/173 (46%), Gaps = 14/173 (8%)
Query: 57 VEGRALPQIVWDHYREGCSI--RFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTA 114
+E R + +++ + R G ++ ++ + ++ + + A+ E VG Y +
Sbjct: 21 IEYRCITSVLYKYLRNGATLVRNRISNEPFVDQISKQIATFAEA-RTLVGG--YAAFSSK 77
Query: 115 QGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTL 171
+ H+D + + +QL G+K+W + P P Y + +I P + V L
Sbjct: 78 SSYKSHWDTRDVYAVQLLGRKRWILRKPNFE---FPLYMQQTKNFPDIKEPEEIYMDVIL 134
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMPKALQA 222
E GD+LY+PRG+ H + +E + H+ ++ + T ++ L+ MP L
Sbjct: 135 EAGDILYIPRGWWHDPLPL-DEETFHLAVATFAPTGFEYMRWLQNIMPGILDC 186
>gi|90578715|ref|ZP_01234525.1| hypothetical protein VAS14_03398 [Photobacterium angustum S14]
gi|90439548|gb|EAS64729.1| hypothetical protein VAS14_03398 [Photobacterium angustum S14]
Length = 377
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 4/92 (4%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y TP G PH D + FI+Q GK+ W+V R D+Y + Q + PI
Sbjct: 112 SYSTP--GGGVGPHIDQYDVFIVQGSGKRHWRVGEKR--DDYEEEFRHSALRQIKGFEPI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ LE GD+LY+P + H +V S V
Sbjct: 168 IDEVLETGDILYIPPSFPHDGYSVETSMSFSV 199
>gi|146311297|ref|YP_001176371.1| cupin 4 family protein [Enterobacter sp. 638]
gi|145318173|gb|ABP60320.1| Cupin 4 family protein [Enterobacter sp. 638]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGISNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMHPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|392978593|ref|YP_006477181.1| cupin 4 family protein [Enterobacter cloacae subsp. dissolvens SDM]
gi|392324526|gb|AFM59479.1| cupin 4 family protein [Enterobacter cloacae subsp. dissolvens SDM]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFDGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|417951601|ref|ZP_12594697.1| hypothetical protein VISP3789_08223 [Vibrio splendidus ATCC 33789]
gi|342804385|gb|EGU39703.1| hypothetical protein VISP3789_08223 [Vibrio splendidus ATCC 33789]
Length = 265
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 54/128 (42%), Gaps = 25/128 (19%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS 154
L EL C AN Y + + GF H DD + QL G+K+W
Sbjct: 117 LSELLHCESSANLYVSYGSVSGFGYHKDDHHVLVKQLAGQKRW----------------- 159
Query: 155 PNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEK 214
+ G + L GDLL++P+G H + +++ S HVT S+ T D L+
Sbjct: 160 ------QFGESQDDIVLNEGDLLFVPKGVAHNPVSESHK-SFHVTYSIVTPT-LSDFLKW 211
Query: 215 AMPKALQA 222
A A QA
Sbjct: 212 ACLPAHQA 219
>gi|336249790|ref|YP_004593500.1| cupin [Enterobacter aerogenes KCTC 2190]
gi|444352000|ref|YP_007388144.1| FIG002776: hypothetical protein [Enterobacter aerogenes EA1509E]
gi|334735846|gb|AEG98221.1| cupin [Enterobacter aerogenes KCTC 2190]
gi|443902830|emb|CCG30604.1| FIG002776: hypothetical protein [Enterobacter aerogenes EA1509E]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + + D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPISP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNNWHEPTAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|153949056|ref|YP_001401948.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
gi|152960551|gb|ABS48012.1| cupin family protein [Yersinia pseudotuberculosis IP 31758]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E + + ++
Sbjct: 178 ILYIPPGFPHDG--ITHETAFNYSVG 201
>gi|395235964|ref|ZP_10414164.1| hypothetical protein A936_19833 [Enterobacter sp. Ag1]
gi|394729270|gb|EJF29266.1| hypothetical protein A936_19833 [Enterobacter sp. Ag1]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D+ + FI+Q G ++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDNYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|365970021|ref|YP_004951582.1| protein YcfD [Enterobacter cloacae EcWSU1]
gi|365748934|gb|AEW73161.1| YcfD [Enterobacter cloacae EcWSU1]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 82/192 (42%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFSNFVDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|170025286|ref|YP_001721791.1| cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
gi|169751820|gb|ACA69338.1| Cupin 4 family protein [Yersinia pseudotuberculosis YPIII]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E + + ++
Sbjct: 178 ILYIPPGFPHDG--ITHETAFNYSVG 201
>gi|51595407|ref|YP_069598.1| hypothetical protein YPTB1059 [Yersinia pseudotuberculosis IP
32953]
gi|186894433|ref|YP_001871545.1| cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
gi|51588689|emb|CAH20299.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
32953]
gi|186697459|gb|ACC88088.1| Cupin 4 family protein [Yersinia pseudotuberculosis PB1/+]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E + + ++
Sbjct: 178 ILYIPPGFPHDG--ITHETAFNYSVG 201
>gi|378978170|ref|YP_005226311.1| hypothetical protein KPHS_20110 [Klebsiella pneumoniae subsp.
pneumoniae HS11286]
gi|419971737|ref|ZP_14487167.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH1]
gi|419980908|ref|ZP_14496189.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH2]
gi|419986030|ref|ZP_14501166.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH4]
gi|419989732|ref|ZP_14504707.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH5]
gi|419994848|ref|ZP_14509656.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH6]
gi|420004050|ref|ZP_14518691.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH7]
gi|420009672|ref|ZP_14524153.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH8]
gi|420014645|ref|ZP_14528950.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH9]
gi|420018901|ref|ZP_14533096.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH10]
gi|420026743|ref|ZP_14540743.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH11]
gi|420031526|ref|ZP_14545347.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH12]
gi|420038306|ref|ZP_14551954.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH14]
gi|420044346|ref|ZP_14557827.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH16]
gi|420050150|ref|ZP_14563452.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH17]
gi|420055659|ref|ZP_14568824.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH18]
gi|420061310|ref|ZP_14574300.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH19]
gi|420066084|ref|ZP_14578886.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH20]
gi|420070819|ref|ZP_14583469.1| hypothetical protein KPNIH21_06765 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH21]
gi|420078931|ref|ZP_14591383.1| hypothetical protein KPNIH22_18232 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH22]
gi|420083076|ref|ZP_14595364.1| hypothetical protein KPNIH23_10251 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH23]
gi|421908479|ref|ZP_16338320.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K26BO]
gi|421916787|ref|ZP_16346353.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K28BO]
gi|428150986|ref|ZP_18998739.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST512-K30BO]
gi|428939574|ref|ZP_19012680.1| hypothetical protein MTE2_08535 [Klebsiella pneumoniae VA360]
gi|364517581|gb|AEW60709.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae HS11286]
gi|397344617|gb|EJJ37749.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH2]
gi|397349397|gb|EJJ42491.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH4]
gi|397352675|gb|EJJ45753.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH1]
gi|397367497|gb|EJJ60108.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH7]
gi|397368621|gb|EJJ61226.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH6]
gi|397369231|gb|EJJ61833.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH5]
gi|397377661|gb|EJJ69887.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH9]
gi|397380630|gb|EJJ72809.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH8]
gi|397389562|gb|EJJ81495.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH10]
gi|397394775|gb|EJJ86496.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH11]
gi|397400395|gb|EJJ92040.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH12]
gi|397404459|gb|EJJ95967.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH14]
gi|397411745|gb|EJK02994.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH17]
gi|397412113|gb|EJK03353.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH16]
gi|397421512|gb|EJK12524.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH18]
gi|397430354|gb|EJK21049.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH20]
gi|397431179|gb|EJK21859.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KPNIH19]
gi|397441323|gb|EJK31703.1| hypothetical protein KPNIH21_06765 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH21]
gi|397444237|gb|EJK34520.1| hypothetical protein KPNIH22_18232 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH22]
gi|397451596|gb|EJK41678.1| hypothetical protein KPNIH23_10251 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH23]
gi|410117614|emb|CCM80945.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K26BO]
gi|410120897|emb|CCM88978.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K28BO]
gi|426303771|gb|EKV65932.1| hypothetical protein MTE2_08535 [Klebsiella pneumoniae VA360]
gi|427539047|emb|CCM94877.1| FIG002776: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST512-K30BO]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|429108397|ref|ZP_19170266.1| FIG002776: hypothetical protein [Cronobacter malonaticus 681]
gi|426295120|emb|CCJ96379.1| FIG002776: hypothetical protein [Cronobacter malonaticus 681]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/343 (22%), Positives = 137/343 (39%), Gaps = 50/343 (14%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ +
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLR 240
G+ H+ ++ N + V I +G D +R L G R
Sbjct: 183 PGFPHEGYSLENSMNYSVGFRAPSGRELI-------------SGFADYVLQREL--GSQR 227
Query: 241 YAG---LARGKPVDIQADRLAMKENLKDLLAKLV-EYVDLDDGVDEMGKQLMHDALPPVL 296
Y+ AR P DI A + L+D++ L+ + + ++ E Q H
Sbjct: 228 YSDPDVPARECPADIVP---AEVDKLRDMMLDLIRDPANFNEWFGEFITQSRH------- 277
Query: 297 SPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
EL S E Q E+Y+A + DT VRL +R
Sbjct: 278 ---ELDVSPPEPPY---QPDEIYDALK-QGDTLVRLGGLRVLR 313
>gi|206580898|ref|YP_002239240.1| cupin family protein [Klebsiella pneumoniae 342]
gi|288936099|ref|YP_003440158.1| cupin [Klebsiella variicola At-22]
gi|206569956|gb|ACI11732.1| cupin family protein [Klebsiella pneumoniae 342]
gi|288890808|gb|ADC59126.1| Cupin 4 family protein [Klebsiella variicola At-22]
Length = 373
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|401676209|ref|ZP_10808195.1| cupin 4 family protein [Enterobacter sp. SST3]
gi|400216695|gb|EJO47595.1| cupin 4 family protein [Enterobacter sp. SST3]
Length = 373
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|262043159|ref|ZP_06016297.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
gi|259039547|gb|EEW40680.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
Length = 373
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|290510849|ref|ZP_06550219.1| ycfD [Klebsiella sp. 1_1_55]
gi|289777565|gb|EFD85563.1| ycfD [Klebsiella sp. 1_1_55]
Length = 373
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|238894175|ref|YP_002918909.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae
NTUH-K2044]
gi|365139668|ref|ZP_09346015.1| hypothetical protein HMPREF1024_02046 [Klebsiella sp. 4_1_44FAA]
gi|386034318|ref|YP_005954231.1| hypothetical protein KPN2242_08750 [Klebsiella pneumoniae KCTC
2242]
gi|402781343|ref|YP_006636889.1| hypothetical protein A79E_3094 [Klebsiella pneumoniae subsp.
pneumoniae 1084]
gi|424830114|ref|ZP_18254842.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
gi|425077232|ref|ZP_18480335.1| hypothetical protein HMPREF1305_03145 [Klebsiella pneumoniae subsp.
pneumoniae WGLW1]
gi|425080959|ref|ZP_18484056.1| hypothetical protein HMPREF1306_01707 [Klebsiella pneumoniae subsp.
pneumoniae WGLW2]
gi|425087865|ref|ZP_18490958.1| hypothetical protein HMPREF1307_03314 [Klebsiella pneumoniae subsp.
pneumoniae WGLW3]
gi|428933734|ref|ZP_19007278.1| hypothetical protein MTE1_13144 [Klebsiella pneumoniae JHCK1]
gi|449059308|ref|ZP_21737005.1| hypothetical protein G057_15496 [Klebsiella pneumoniae hvKP1]
gi|238546491|dbj|BAH62842.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae
NTUH-K2044]
gi|339761446|gb|AEJ97666.1| putative enzyme [Klebsiella pneumoniae KCTC 2242]
gi|363654281|gb|EHL93196.1| hypothetical protein HMPREF1024_02046 [Klebsiella sp. 4_1_44FAA]
gi|402542233|gb|AFQ66382.1| hypothetical protein A79E_3094 [Klebsiella pneumoniae subsp.
pneumoniae 1084]
gi|405592941|gb|EKB66393.1| hypothetical protein HMPREF1305_03145 [Klebsiella pneumoniae subsp.
pneumoniae WGLW1]
gi|405604589|gb|EKB77710.1| hypothetical protein HMPREF1307_03314 [Klebsiella pneumoniae subsp.
pneumoniae WGLW3]
gi|405604769|gb|EKB77876.1| hypothetical protein HMPREF1306_01707 [Klebsiella pneumoniae subsp.
pneumoniae WGLW2]
gi|414707539|emb|CCN29243.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
gi|426304522|gb|EKV66664.1| hypothetical protein MTE1_13144 [Klebsiella pneumoniae JHCK1]
gi|448874884|gb|EMB09914.1| hypothetical protein G057_15496 [Klebsiella pneumoniae hvKP1]
Length = 373
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|83592160|ref|YP_425912.1| cupin region [Rhodospirillum rubrum ATCC 11170]
gi|83575074|gb|ABC21625.1| Cupin region [Rhodospirillum rubrum ATCC 11170]
Length = 301
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 79/183 (43%), Gaps = 28/183 (15%)
Query: 95 LQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPR-YS 153
L++ G V + + T G PHYDD + I+Q+ G+K WK+ + + R S
Sbjct: 116 LEQALGAVVTIDAIGSFGTVCGLPPHYDDRDLIIVQVAGRKHWKILGTPVEGPWRKRTMS 175
Query: 154 SPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLE 213
P+ +E ++ GD+L++P G HQ V E SLH+ + + DLL+
Sbjct: 176 VPDTVTDEF-------VMQGGDMLFVPAGLYHQC--VPLEPSLHLGALITRPCG-ADLLK 225
Query: 214 KAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEY 273
P+ TD E L +G D + D LK+ L +LV+
Sbjct: 226 MVQPR----WETTDPELAARLYVG-------------DGETDLQQQDARLKEALIRLVQD 268
Query: 274 VDL 276
+D+
Sbjct: 269 MDV 271
>gi|401763245|ref|YP_006578252.1| cupin [Enterobacter cloacae subsp. cloacae ENHKU01]
gi|400174779|gb|AFP69628.1| cupin [Enterobacter cloacae subsp. cloacae ENHKU01]
Length = 373
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + I+ ++ + +N ++ + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFSNFVDP-ISPDELAGLAMENEVDSR----LVSHQDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEGIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|197336245|ref|YP_002156409.1| cupin family protein [Vibrio fischeri MJ11]
gi|197317735|gb|ACH67182.1| cupin family protein [Vibrio fischeri MJ11]
Length = 382
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 84/199 (42%), Gaps = 38/199 (19%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEG- 59
++W+K+P+ I +N+ + +T +++ + +N ++D + Q G
Sbjct: 14 EYWQKKPVIIKGGFENF-QDPVTPEELAGLTLEN------DVDSRFISNANNQWKAEHGP 66
Query: 60 ------RALPQIVW-------DHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGAN 106
L + W +H+ EG + F KP +Q+ L +
Sbjct: 67 LSEELYETLGETNWSIIVQAANHWHEGAAELF-------KPFKQMPNWLFDDIMI----- 114
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q +GK+ W+V + EY + Q PI
Sbjct: 115 SYSVP--HGGVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYQEEHRHSALKQITGFEPI 169
Query: 167 LTVTLEPGDLLYLPRGYIH 185
+ LEPGD+LY+P G+ H
Sbjct: 170 IDQILEPGDILYIPPGFPH 188
>gi|260597497|ref|YP_003210068.1| hypothetical protein CTU_17050 [Cronobacter turicensis z3032]
gi|260216674|emb|CBA30020.1| Uncharacterized protein ycfD [Cronobacter turicensis z3032]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/198 (21%), Positives = 87/198 (43%), Gaps = 14/198 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ +
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHV 198
G+ H+ ++ N + V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200
>gi|423139625|ref|ZP_17127263.1| cupin family protein [Salmonella enterica subsp. houtenae str. ATCC
BAA-1581]
gi|379052179|gb|EHY70070.1| cupin family protein [Salmonella enterica subsp. houtenae str. ATCC
BAA-1581]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 80/192 (41%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ + R N+ + L D L +E + + + ++D + Q
Sbjct: 16 HWQKRPVVLKRGFTNFIDPLSP-----DELAGLAMESEIDSRLVSHQDDKWQV-----NH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGESNWSLLVQAVNHWHEPTAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ + N
Sbjct: 183 PGFPHEGYALEN 194
>gi|424933998|ref|ZP_18352370.1| Putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
gi|407808185|gb|EKF79436.1| Putative enzyme [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|253688251|ref|YP_003017441.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
gi|251754829|gb|ACT12905.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 38/219 (17%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ------- 53
++W+K PL I K + N I D+ L +E + + + +++G+ Q
Sbjct: 15 NYWQKRPLLI----KGGFTNFIDPISPDE-LAGLALENEVDSRLVSHQNGRWQVSHGPFE 69
Query: 54 TMDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTY 108
+ D G + L Q V DH+ E S ++P +QL + +L F
Sbjct: 70 SYDHLGESNWSLLVQAV-DHWHEPSS-------ALMQPFRQLPDWRTDDLMISF------ 115
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
P G PH D + FI+Q G+++W++ + ++ P P+ Q E I+
Sbjct: 116 SVP--GGGVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVEPFEAIID 170
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
LEPGD++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 171 EELEPGDIIYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|429090415|ref|ZP_19153147.1| FIG002776: hypothetical protein [Cronobacter universalis NCTC 9529]
gi|426510218|emb|CCK18259.1| FIG002776: hypothetical protein [Cronobacter universalis NCTC 9529]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/198 (21%), Positives = 87/198 (43%), Gaps = 14/198 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ +
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHV 198
G+ H+ ++ N + V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200
>gi|330016193|ref|ZP_08308423.1| cupin family protein [Klebsiella sp. MS 92-3]
gi|328529478|gb|EGF56383.1| cupin family protein [Klebsiella sp. MS 92-3]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|429104140|ref|ZP_19166114.1| FIG002776: hypothetical protein [Cronobacter turicensis 564]
gi|426290789|emb|CCJ92227.1| FIG002776: hypothetical protein [Cronobacter turicensis 564]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/198 (21%), Positives = 87/198 (43%), Gaps = 14/198 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R KN+ + L + ++ + +N ++ + + ++DG+ +
Sbjct: 16 YWQKRPVVLKRGIKNFVDPL-SPDELAGLAMENEVDSR----LVSHQDGKWEV-----SH 65
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFQSYDHLGETNWSLLVQAVNNWHEPSAALMRPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q ++ LEPGD+LY+P
Sbjct: 126 LDQYDVFIIQGVGRRRWRVGEKVPMKQHCPH---PDLLQVAPFEALIDEELEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHV 198
G+ H+ ++ N + V
Sbjct: 183 PGFPHEGYSLENSMNYSV 200
>gi|425091021|ref|ZP_18494106.1| hypothetical protein HMPREF1308_01281 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
gi|405613178|gb|EKB85926.1| hypothetical protein HMPREF1308_01281 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + L D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPLSP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDVLYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|421479034|ref|ZP_15926752.1| cupin domain protein, PF06172 family [Burkholderia multivorans CF2]
gi|400223580|gb|EJO53869.1| cupin domain protein, PF06172 family [Burkholderia multivorans CF2]
Length = 382
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 64/133 (48%), Gaps = 12/133 (9%)
Query: 102 FVGANTYQTPPTAQG----FAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
F G T + A G + H+D + F +QL+G+K+W VY P D L + S F
Sbjct: 116 FTGRQTVTSAYIAFGETDSYRAHWDSRDVFAVQLKGRKRWVVYKPTF-DAPLFMHQS-KF 173
Query: 158 SQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLL 212
++ P + LE GD+ Y+PRG+ H + + E ++H+ I + A ++ +
Sbjct: 174 IEDRYPCPAEPHMDFVLEEGDVFYIPRGWWHNVTPL-GEPTVHLAIGTFPAFAVDYVKWV 232
Query: 213 EKAMPKALQAAGA 225
+ +P + A A
Sbjct: 233 FRELPDCIYARMA 245
>gi|423686362|ref|ZP_17661170.1| hypothetical protein VFSR5_1673 [Vibrio fischeri SR5]
gi|371494430|gb|EHN70028.1| hypothetical protein VFSR5_1673 [Vibrio fischeri SR5]
Length = 382
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 78/195 (40%), Gaps = 26/195 (13%)
Query: 1 DHWEKEPLYISRKQKNYYN--------NLITSKKIDDILRDNVIEFKKNIDITLYEDGQR 52
++W+K+P+ I +N+ + L +D N K L ED
Sbjct: 14 EYWQKKPVIIKGGLENFQDPVTPEELAGLTLENDVDSRFISNANNEWKAEHGPLSEDLYE 73
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
+ + Q +H+ EG + F KP +Q+ L + +Y P
Sbjct: 74 TLGETNWSIIVQAA-NHWHEGAAELF-------KPFKQMPNWLFDDIMI-----SYSVP- 119
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
G PH D + FI+Q +GK+ W+V + EY + Q PI+ LE
Sbjct: 120 -HGGVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYQEEHRHSALKQITGFDPIIDQILE 175
Query: 173 PGDLLYLPRGYIHQA 187
PGD+LY+P G+ H
Sbjct: 176 PGDILYIPPGFPHDG 190
>gi|421494322|ref|ZP_15941672.1| hypothetical protein MU9_2843 [Morganella morganii subsp. morganii
KT]
gi|455739854|ref|YP_007506120.1| hypothetical protein MU9_2702 [Morganella morganii subsp. morganii
KT]
gi|400191489|gb|EJO24635.1| hypothetical protein MU9_2843 [Morganella morganii subsp. morganii
KT]
gi|455421417|gb|AGG31747.1| hypothetical protein MU9_2702 [Morganella morganii subsp. morganii
KT]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 26/212 (12%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKI-----DDILRDNVIEFKKNIDITLYEDGQRQTMD 56
HW+K PL I +N+ + + + D+I V F D+ G Q D
Sbjct: 16 HWQKRPLLIKGGFRNFRDTISPDELAGLAMEDEIDSRIVTRFSGKWDV---RHGPFQEFD 72
Query: 57 VEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQ 115
L + W + +KP + L + + +L +Y P
Sbjct: 73 ----GLGEKDWSILVQAVDHWHFPSAALMKPFRCLSDWRMDDLM------ISYSVP--GG 120
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +G+++W+V + ++ P P+ Q + I+ + PGD
Sbjct: 121 GVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHAPH---PDLLQVDPFDAIIDDEMVPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+LY+P G+ H+ + E SL+ ++ A
Sbjct: 178 ILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207
>gi|251788906|ref|YP_003003627.1| Cupin 4 family protein [Dickeya zeae Ech1591]
gi|247537527|gb|ACT06148.1| Cupin 4 family protein [Dickeya zeae Ech1591]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P + TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFTPIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--LTFETALNYSVG 201
>gi|238762219|ref|ZP_04623191.1| hypothetical protein ykris0001_1360 [Yersinia kristensenii ATCC
33638]
gi|238699566|gb|EEP92311.1| hypothetical protein ykris0001_1360 [Yersinia kristensenii ATCC
33638]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + FI+Q G+++W+V + ++ P P+ Q I+ +EPGD
Sbjct: 121 GVGPHFDQYDVFIIQGTGRRRWRVGEKTEMKQHCPH---PDLLQVGPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 178 ILYIPPGFPHEGYALEN 194
>gi|418043322|ref|ZP_12681489.1| cupin family protein [Escherichia coli W26]
gi|420371391|ref|ZP_14871808.1| cupin superfamily protein [Shigella flexneri 1235-66]
gi|383473740|gb|EID65752.1| cupin family protein [Escherichia coli W26]
gi|391319303|gb|EIQ76334.1| cupin superfamily protein [Shigella flexneri 1235-66]
Length = 352
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 100 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGD 156
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 157 ILYIPPGFPHEGYALEN 173
>gi|416179413|ref|ZP_11610979.1| cupin family protein [Neisseria meningitidis M6190]
gi|416193015|ref|ZP_11617036.1| cupin family protein [Neisseria meningitidis ES14902]
gi|325131698|gb|EGC54403.1| cupin family protein [Neisseria meningitidis M6190]
gi|325137714|gb|EGC60291.1| cupin family protein [Neisseria meningitidis ES14902]
Length = 309
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 78/165 (47%), Gaps = 15/165 (9%)
Query: 60 RALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
R + ++++H + G S+ + +N + + ++ + L G + Y + + +
Sbjct: 7 RFIKSVIYEHMKNGASLVYNHINNEPFS---DHISRQVARLAGAHTIVSGYLAFGSDESY 63
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLEPG 174
H+D + + +QL GKK+W++ P D +P Y +I P + + LE G
Sbjct: 64 KNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILEAG 119
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
D+LY+PRG+ H+ + E + H+ + + +++ L K P
Sbjct: 120 DVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 163
>gi|226330187|ref|ZP_03805705.1| hypothetical protein PROPEN_04100 [Proteus penneri ATCC 35198]
gi|225200982|gb|EEG83336.1| cupin family protein [Proteus penneri ATCC 35198]
Length = 329
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D+ + FI+Q G+++W+V + ++ P P + PI
Sbjct: 70 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFCPH---PALLHVDPFEPI 124
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
+ V + PGD+LY+P G+ H T + + V
Sbjct: 125 IDVEMAPGDILYIPPGFPHDGFTFEDTMNYSV 156
>gi|238762898|ref|ZP_04623866.1| hypothetical protein ykris0001_32090 [Yersinia kristensenii ATCC
33638]
gi|238698909|gb|EEP91658.1| hypothetical protein ykris0001_32090 [Yersinia kristensenii ATCC
33638]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKQPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|212712836|ref|ZP_03320964.1| hypothetical protein PROVALCAL_03933 [Providencia alcalifaciens DSM
30120]
gi|422017728|ref|ZP_16364291.1| hypothetical protein OO9_03473 [Providencia alcalifaciens Dmel2]
gi|212684528|gb|EEB44056.1| hypothetical protein PROVALCAL_03933 [Providencia alcalifaciens DSM
30120]
gi|414105321|gb|EKT66880.1| hypothetical protein OO9_03473 [Providencia alcalifaciens Dmel2]
Length = 374
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q EG++ W+V + ++ P P+ Q + I
Sbjct: 114 SYSVP--GGGVGPHLDQYDVFIIQGEGRRHWRVGEKIPMKQHSPH---PDLLQVQPFDAI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ +EPGD+LY+P G+ H+ + E SL+ ++ A
Sbjct: 169 IDEEMEPGDILYIPPGFPHEGYAI--EPSLNFSVGFRAPNA 207
>gi|195941247|ref|ZP_03086629.1| hypothetical protein EscherichcoliO157_33462, partial [Escherichia
coli O157:H7 str. EC4024]
Length = 274
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 38 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPH---PDLLQVDPFEAIIDEELEPGD 94
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 95 ILYIPPGFPHEGYALEN 111
>gi|167525589|ref|XP_001747129.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774424|gb|EDQ88053.1| predicted protein [Monosiga brevicollis MX1]
Length = 4167
Score = 52.4 bits (124), Expect = 3e-04, Method: Composition-based stats.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 22/123 (17%)
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIG---- 163
Y + P A+ PH D + F QL G K W + D ++ ++Q+E
Sbjct: 180 YLSGPNAKALPPHTDRNDIFSFQLAGTKHWTICTNPPPDT---SWTPAEWAQQEENEASR 236
Query: 164 -------TPILTVTLE-------PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWI 209
+PI ++ PGD +YLP+G +H A T +E SLH T+ +W+
Sbjct: 237 TEGCRNPSPIEVTKMQCEHHVVHPGDWIYLPKGTVHFAQT-NDEMSLHATVGFRPSFSWL 295
Query: 210 DLL 212
+ L
Sbjct: 296 NRL 298
>gi|429118195|ref|ZP_19179113.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 701]
gi|426321324|emb|CCK05226.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 701]
Length = 345
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G+++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGAKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201
>gi|311279967|ref|YP_003942198.1| transcription factor jumonji jmjC domain-containing protein
[Enterobacter cloacae SCF1]
gi|308749162|gb|ADO48914.1| transcription factor jumonji jmjC domain-containing protein
[Enterobacter cloacae SCF1]
Length = 373
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 84/192 (43%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + IT ++ + +N ++ + + + DG+ Q
Sbjct: 16 YWQKRPVVLKRGFNNFVDP-ITPDELAGLAMENEVDSR----LVSHLDGKWQV-----SH 65
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 66 GPFESYDHLGENNWSLLVQAVNHWHEPTAALMRPFRALPDWRMDDLMVSFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKAPLKQHCPH---PDLLQVDPFDAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTN 192
G+ H+ ++ N
Sbjct: 183 PGFPHEGYSLEN 194
>gi|271501440|ref|YP_003334465.1| Cupin 4 family protein [Dickeya dadantii Ech586]
gi|270344995|gb|ACZ77760.1| Cupin 4 family protein [Dickeya dadantii Ech586]
Length = 379
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P + TPI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFTPIIEEELEPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|238752808|ref|ZP_04614275.1| hypothetical protein yrohd0001_5950 [Yersinia rohdei ATCC 43380]
gi|238708953|gb|EEQ01204.1| hypothetical protein yrohd0001_5950 [Yersinia rohdei ATCC 43380]
Length = 374
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLSPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|242238851|ref|YP_002987032.1| cupin [Dickeya dadantii Ech703]
gi|242130908|gb|ACS85210.1| Cupin 4 family protein [Dickeya dadantii Ech703]
Length = 374
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + TPI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFTPIIDADLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T++ +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHQTALNYSVG 201
>gi|375119387|ref|ZP_09764554.1| Transcription factor jumonji/aspartyl beta-hydroxylase [Salmonella
enterica subsp. enterica serovar Dublin str. SD3246]
gi|417334366|ref|ZP_12117602.1| hypothetical protein LTSEALA_2829 [Salmonella enterica subsp.
enterica serovar Alachua str. R6-377]
gi|326623654|gb|EGE29999.1| Transcription factor jumonji/aspartyl beta-hydroxylase [Salmonella
enterica subsp. enterica serovar Dublin str. SD3246]
gi|353575328|gb|EHC38094.1| hypothetical protein LTSEALA_2829 [Salmonella enterica subsp.
enterica serovar Alachua str. R6-377]
Length = 352
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 100 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGD 156
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 157 ILYIPPGFPHEGYALEN 173
>gi|121634298|ref|YP_974543.1| hypothetical protein NMC0436 [Neisseria meningitidis FAM18]
gi|120866004|emb|CAM09741.1| hypothetical protein NMC0436 [Neisseria meningitidis FAM18]
Length = 384
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 78/165 (47%), Gaps = 15/165 (9%)
Query: 60 RALPQIVWDHYREGCSIRF--LNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGF 117
R + ++++H + G S+ + +N + + ++ + L G + Y + + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINNEPFS---DHISRQVARLAGAHTIVSGYLAFGSDESY 138
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLEPG 174
H+D + + +QL GKK+W++ P D +P Y +I P + + LE G
Sbjct: 139 KNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILEAG 194
Query: 175 DLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
D+LY+PRG+ H+ + E + H+ + + +++ L K P
Sbjct: 195 DVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 238
>gi|445338422|ref|ZP_21416088.1| hypothetical protein SEE18569_023278, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 18569]
gi|444872804|gb|ELX97120.1| hypothetical protein SEE18569_023278, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 18569]
Length = 330
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 78 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGD 134
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 135 ILYIPPGFPHEGYALEN 151
>gi|449306517|ref|YP_007443057.1| hypothetical protein CSSP291_20968 [Cronobacter sakazakii SP291]
gi|449100674|gb|AGE88707.1| hypothetical protein CSSP291_20968 [Cronobacter sakazakii SP291]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G+++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGAKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201
>gi|238791619|ref|ZP_04635257.1| hypothetical protein yinte0001_27760 [Yersinia intermedia ATCC
29909]
gi|238729235|gb|EEQ20751.1| hypothetical protein yinte0001_27760 [Yersinia intermedia ATCC
29909]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|429107487|ref|ZP_19169356.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 681]
gi|426294210|emb|CCJ95469.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 681]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G ++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G++H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFLHDG--FTHETALNYSVG 201
>gi|251789544|ref|YP_003004265.1| Cupin 4 family protein [Dickeya zeae Ech1591]
gi|247538165|gb|ACT06786.1| Cupin 4 family protein [Dickeya zeae Ech1591]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q E I+ L PGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVEPFEAIIDEELAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+LY+P G+ H+ ++ N SL+ ++ +A
Sbjct: 178 ILYIPPGFPHEGYSLEN--SLNYSVGFRAPSA 207
>gi|383813869|ref|ZP_09969292.1| transcription factor jumonji jmjC domain-containing protein
[Serratia sp. M24T3]
gi|383297067|gb|EIC85378.1| transcription factor jumonji jmjC domain-containing protein
[Serratia sp. M24T3]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 85/194 (43%), Gaps = 18/194 (9%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNV--IEFKKNIDITLYEDGQRQTMDVEG 59
+W+K P+ + R KN+ ID I D + + + +D L Q + DV
Sbjct: 16 YWQKRPVLLKRGFKNF---------IDPISPDELAGLAMENEVDSRLVSHDQGR-WDV-- 63
Query: 60 RALPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFA 118
+ P +DH E SI + +P +L +++ + G
Sbjct: 64 KHGPFESYDHLGENNWSILVQAVDHWHEPSARLMRPFRQIPDWRMDDLMISFSVPGGGVG 123
Query: 119 PHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLY 178
PH D + FI+Q G+++W+V + ++ P P+ Q E I+ +EPGD++Y
Sbjct: 124 PHLDQYDVFIIQGTGRRRWRVGEKIPMKQHTPH---PDLLQVEPFDAIIDEEMEPGDIIY 180
Query: 179 LPRGYIHQASTVTN 192
+P G+ H+ ++ N
Sbjct: 181 IPPGFPHEGYSLEN 194
>gi|410636856|ref|ZP_11347447.1| hypothetical protein GLIP_2024 [Glaciecola lipolytica E3]
gi|410143662|dbj|GAC14652.1| hypothetical protein GLIP_2024 [Glaciecola lipolytica E3]
Length = 384
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + D P + N Q PIL LEPGD
Sbjct: 121 GVGPHVDQYDVFIIQGRGTRRWQVGENKHHDTIQP---AKNLRQISGFDPILDEILEPGD 177
Query: 176 LLYLPRGYIHQA 187
L+Y+P G+ H
Sbjct: 178 LIYIPPGFPHNG 189
>gi|213161977|ref|ZP_03347687.1| hypothetical protein Salmoneentericaenterica_19042 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
Length = 272
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ LEPGD
Sbjct: 20 GVGPHLDQYDVFIIQGTGRRRWRVGEKLQMRQHCPH---PDLLQVDPFEAIIDEELEPGD 76
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 77 ILYIPPGFPHEGYALEN 93
>gi|323452841|gb|EGB08714.1| hypothetical protein AURANDRAFT_63985 [Aureococcus anophagefferens]
Length = 4512
Score = 52.0 bits (123), Expect = 4e-04, Method: Composition-based stats.
Identities = 42/137 (30%), Positives = 59/137 (43%), Gaps = 15/137 (10%)
Query: 105 ANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEI 162
A+ Y + P A A H D + + Q+ G K W V+ V ++ Y + S E+
Sbjct: 180 AHVYVSAPGAAALANHTDLNDVVVFQIAGAKAWHVH---AVAKHGTCATYDASEMSGLEV 236
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQA 222
L PGDLL+LP+G +H A E S H+T+ + A A+
Sbjct: 237 CDDF---ELGPGDLLFLPQGAVHSARATEAEGSTHLTLGLAGTRA-------AVCDEQAP 286
Query: 223 AGATDLEFRRGLPIGYL 239
AG EFRR L G L
Sbjct: 287 AGDLMAEFRRQLDEGDL 303
>gi|37526699|ref|NP_930043.1| hypothetical protein plu2809 [Photorhabdus luminescens subsp.
laumondii TTO1]
gi|36786131|emb|CAE15183.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
TTO1]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 11/89 (12%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLP---RYSSPNFSQEEIGTPILTVTLE 172
G PH D + FI+Q G+++W+V E LP R P+ Q + + ++ +E
Sbjct: 121 GVGPHLDQYDVFIIQGMGRRRWRV------GEKLPMEQRCPHPDLLQVDPFSAVIDEEME 174
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
PGD+LY+P G+ H+ ++ E SL+ ++
Sbjct: 175 PGDILYIPPGFPHEGYSI--EDSLNYSVG 201
>gi|336248485|ref|YP_004592195.1| cupin [Enterobacter aerogenes KCTC 2190]
gi|334734541|gb|AEG96916.1| cupin [Enterobacter aerogenes KCTC 2190]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H VT+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201
>gi|261821424|ref|YP_003259530.1| cupin [Pectobacterium wasabiae WPP163]
gi|261605437|gb|ACX87923.1| Cupin 4 family protein [Pectobacterium wasabiae WPP163]
gi|385871663|gb|AFI90183.1| Cupin 4 family protein [Pectobacterium sp. SCC3193]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 98/219 (44%), Gaps = 38/219 (17%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ------- 53
++W+K PL I K + N I D+ L +E + + + +++G+ Q
Sbjct: 15 NYWQKRPLLI----KGGFTNFIDPISPDE-LAGLALENEVDSRLVSHQNGRWQVSHGPFE 69
Query: 54 TMDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTY 108
+ D G + L Q V DH+ E S ++P +QL + +L F
Sbjct: 70 SYDHLGESNWSLLVQAV-DHWHEPSS-------ALMQPFRQLPDWRTDDLMISF------ 115
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
P G PH D + FI+Q G+++W++ + ++ P P+ Q + I+
Sbjct: 116 SVP--GGGVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFEAIID 170
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
LEPGD++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 171 AELEPGDIVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|82702272|ref|YP_411838.1| hypothetical protein Nmul_A1143 [Nitrosospira multiformis ATCC
25196]
gi|82410337|gb|ABB74446.1| Cupin region [Nitrosospira multiformis ATCC 25196]
Length = 415
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/219 (22%), Positives = 88/219 (40%), Gaps = 33/219 (15%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDV--- 57
DHW+K+PL I RK ++ L+ + ++ ID+ ED Q + +
Sbjct: 43 DHWQKKPLLI-RKALPDFSGLLDANEL--------------IDLACQEDAQSRLVTRRNG 87
Query: 58 -----EGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
G P+ ++G ++ + ++ ++L + + P
Sbjct: 88 RWEVRHGPFAPRAFARLPQKGWTLLVQDVNHFLPAARELLLKFNFIPHSRLDDLMVSYAP 147
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVY--LPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
G PH+D + F+LQ G+++W++ R + P +F E+
Sbjct: 148 EDGGVGPHFDSYDVFLLQGTGRRRWRISGQKDRTLVAAAPLKILQDFRPEQ------EWV 201
Query: 171 LEPGDLLYLPRGYIHQASTV--TNEHSLHVTISVYQKTA 207
LEPGD+LYLP GY H V +S+ YQ+ A
Sbjct: 202 LEPGDMLYLPPGYAHDGVAVEPCMTYSIGFRAPTYQELA 240
>gi|444353388|ref|YP_007389532.1| FIG00732091: hypothetical protein [Enterobacter aerogenes EA1509E]
gi|443904218|emb|CCG31992.1| FIG00732091: hypothetical protein [Enterobacter aerogenes EA1509E]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H VT+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201
>gi|307131193|ref|YP_003883209.1| hypothetical protein Dda3937_02090 [Dickeya dadantii 3937]
gi|306528722|gb|ADM98652.1| hypothetical protein Dda3937_02090 [Dickeya dadantii 3937]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 124/288 (43%), Gaps = 55/288 (19%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFK------KNIDITLYEDGQRQTM 55
HW+K+P+ I +++ + IT ++ + +N ++ + D++ G ++
Sbjct: 16 HWQKKPVIIKGGFRHFIDP-ITPDELAGLAMENEVDSRLVSHQGGRWDVS---HGPFESY 71
Query: 56 DVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQT 110
D G + L Q V DH+ E + ++P +QL + + +L F
Sbjct: 72 DHLGESNWSLLVQAV-DHWHEPSA-------ALMRPFRQLPDWRIDDLMISF------SV 117
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P G PH D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 118 P--GGGVGPHLDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVAPFDAIIDEE 172
Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEF 230
LEPGD+LY+P + H+ ++ N SL+ ++ +A +G D
Sbjct: 173 LEPGDILYIPPSFPHEGYSLEN--SLNYSVGF-----------RAPSSRELVSGFADYVL 219
Query: 231 RRGLPIGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVD 275
R L G RY+ AR P DI L + L+ ++ LV++ D
Sbjct: 220 SREL--GGQRYSDPDIPARPHPADILPQEL---DKLQHMMLDLVQHKD 262
>gi|242239497|ref|YP_002987678.1| cupin [Dickeya dadantii Ech703]
gi|242131554|gb|ACS85856.1| Cupin 4 family protein [Dickeya dadantii Ech703]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 86/198 (43%), Gaps = 14/198 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
HW+K P+ I + ++ + IT ++ + +N ++ + + ++DG+ DV
Sbjct: 16 HWQKSPVII-KGGFTHFIDPITPDELAGLAMENEVDSR----LVSHQDGR---WDVSHG- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFDSYDHLGETNWSLLVQAVHHWHEPSAALMRPFRHLPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ L PGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGAQVPMKQHCPH---PDLLQVDPFEAIIDEALSPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHV 198
+ H+ ++ N S V
Sbjct: 183 PAFPHEGYSLENSMSFSV 200
>gi|407788798|ref|ZP_11135902.1| hypothetical protein B3C1_00890 [Gallaecimonas xiamenensis 3-C-1]
gi|407208042|gb|EKE77972.1| hypothetical protein B3C1_00890 [Gallaecimonas xiamenensis 3-C-1]
Length = 373
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 4/77 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQ-EEIGTPILTVTLEPG 174
G PH D + FI+Q EGK++W+V + + P P+ Q E T ++ + PG
Sbjct: 120 GVGPHVDQYDVFIIQGEGKRRWRVGDKQPLKSKTPH---PSLLQVEPFETALIDAEMAPG 176
Query: 175 DLLYLPRGYIHQASTVT 191
D+LY+P G+ H+ +T
Sbjct: 177 DILYIPPGFPHEGYAIT 193
>gi|410087354|ref|ZP_11284058.1| hypothetical protein C790_1453 [Morganella morganii SC01]
gi|409766089|gb|EKN50185.1| hypothetical protein C790_1453 [Morganella morganii SC01]
Length = 373
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 90/216 (41%), Gaps = 34/216 (15%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKI-----DDILRDNVIEFKKNIDITLYEDGQRQTMD 56
HW+K PL I +N+ + + + D+I V F D+ G Q D
Sbjct: 16 HWQKRPLLIKGGFRNFRDTISPDELAGLAMEDEIDSRIVTRFSGKWDV---RHGPFQEFD 72
Query: 57 VEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTP 111
G L Q V DH+ +KP + L + + +L +Y P
Sbjct: 73 RLGEKDWSILVQAV-DHWH-------FPSAALMKPFRCLSDWRMDDLM------ISYSVP 118
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH D + FI+Q +G+++W+V + ++ P P+ Q + I+ +
Sbjct: 119 --GGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHAPH---PDLLQVDPFDAIIDDEM 173
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
PGD+LY+P G+ H+ + E SL+ ++ A
Sbjct: 174 VPGDILYIPPGFPHEGYAI--EESLNYSVGFRAPNA 207
>gi|387129600|ref|YP_006292490.1| hypothetical protein Q7C_632 [Methylophaga sp. JAM7]
gi|386270889|gb|AFJ01803.1| hypothetical protein Q7C_632 [Methylophaga sp. JAM7]
Length = 391
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 12/119 (10%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVD-EYL---PRYSSPNFSQEEI 162
+Y P + G H D+ + F+LQ EG ++W++ + D E+L P P+FS
Sbjct: 121 SYAVPGGSVG--AHVDNYDVFLLQAEGVRQWQIEDQPLTDPEWLAGCPLRILPSFS---- 174
Query: 163 GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
P + L PGD+LYLP G H T +L V + W++ L + + ++ Q
Sbjct: 175 --PCQSWELHPGDMLYLPPGIAHHGVAKTPCLTLSVGFRAASASQWLESLSEQLSQSPQ 231
>gi|365849052|ref|ZP_09389523.1| cupin family protein [Yokenella regensburgei ATCC 43003]
gi|364569696|gb|EHM47318.1| cupin family protein [Yokenella regensburgei ATCC 43003]
Length = 373
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 94/228 (41%), Gaps = 39/228 (17%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKIPMKQHCPH---PDLLQVDPFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLP 235
+LY+P G+ H+ ++ N + V I +G D +R L
Sbjct: 178 ILYIPPGFPHEGYSLENSLNYSVGFRAPNGRELI-------------SGFADYVLQREL- 223
Query: 236 IGYLRYAG---LARGKPVDI-QADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDA 291
G LRY +R P DI +R ++ + DL+ + + D + E Q H+
Sbjct: 224 -GSLRYTDPDVPSREHPADILPVERDRLRAMMVDLMNQPEHF---DQWLGEFITQSRHEL 279
Query: 292 LPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRANAVR 339
+SP E Q E+Y+A Q D VRL VR
Sbjct: 280 ---DISPPEPPY----------QPDEIYDALQ-QGDKLVRLGGLRVVR 313
>gi|348029548|ref|YP_004872234.1| enzyme with RmlC-like domain [Glaciecola nitratireducens FR1064]
gi|347946891|gb|AEP30241.1| putative enzyme with RmlC-like domain [Glaciecola nitratireducens
FR1064]
Length = 384
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D+ + FI+Q +G ++W+V LP E +P P Q + +PI+ L PGD
Sbjct: 120 GVGPHSDEYDVFIIQGKGSRRWQVGLPGNYTESVPH---PLLKQIDGFSPIIDEILLPGD 176
Query: 176 LLYLPRGYIHQASTVTN 192
++Y+P + H +++
Sbjct: 177 VVYIPPKHPHNGVALSD 193
>gi|152969688|ref|YP_001334797.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae MGH 78578]
gi|419762761|ref|ZP_14289007.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae DSM 30104]
gi|150954537|gb|ABR76567.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae MGH 78578]
gi|397744256|gb|EJK91468.1| putative enzyme [Klebsiella pneumoniae subsp. pneumoniae DSM 30104]
Length = 373
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 88/207 (42%), Gaps = 16/207 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K P+ + R N+ + + D L +E + + + ++DG+ Q
Sbjct: 16 YWQKRPVVLKRGFANFIDPISP-----DELAGLAMESEVDSRLVSHQDGKWQVSHG---- 66
Query: 62 LPQIVWDHYREG-CSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + +P L + L + G PH
Sbjct: 67 -PFESYDHLSENNWSLLVQAVNHWHEPSAALMHPFRALPDWRIDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD+LY+P
Sbjct: 126 LDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVDPFEAIIDEEMEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTISVYQKTA 207
G+ H+ ++ N SL+ ++ A
Sbjct: 183 PGFPHEGYSLEN--SLNYSVGYRAPNA 207
>gi|423195891|ref|ZP_17182474.1| hypothetical protein HMPREF1171_00506 [Aeromonas hydrophila SSU]
gi|404632692|gb|EKB29294.1| hypothetical protein HMPREF1171_00506 [Aeromonas hydrophila SSU]
Length = 376
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q +GK+ W+V + ++E+ + E I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200
>gi|421079716|ref|ZP_15540654.1| Protein ycfD [Pectobacterium wasabiae CFBP 3304]
gi|401705802|gb|EJS95987.1| Protein ycfD [Pectobacterium wasabiae CFBP 3304]
Length = 373
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W++ + ++ P P+ Q + I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFEAIIDAELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 178 IVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|117618540|ref|YP_855951.1| cupin superfamily protein [Aeromonas hydrophila subsp. hydrophila
ATCC 7966]
gi|117559947|gb|ABK36895.1| cupin superfamily protein [Aeromonas hydrophila subsp. hydrophila
ATCC 7966]
Length = 376
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q +GK+ W+V + ++E+ + E I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200
>gi|120598908|ref|YP_963482.1| cupin 4 family protein [Shewanella sp. W3-18-1]
gi|146293006|ref|YP_001183430.1| cupin 4 family protein [Shewanella putrefaciens CN-32]
gi|120559001|gb|ABM24928.1| Cupin 4 family protein [Shewanella sp. W3-18-1]
gi|145564696|gb|ABP75631.1| Cupin 4 family protein [Shewanella putrefaciens CN-32]
Length = 386
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 7/101 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y TP G PH D + FI Q G+++W+V E+ + P E PI
Sbjct: 115 SYATP--GGGVGPHIDLYDVFICQGSGRRRWRVGDRGPHREFA---AHPALLHTEAFEPI 169
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+ L PGD+LY+P G+ H +T E SL ++ +A
Sbjct: 170 IDTELLPGDILYIPPGFPHDG--ITLEESLSFSVGYRTASA 208
>gi|323497695|ref|ZP_08102711.1| hypothetical protein VISI1226_13616 [Vibrio sinaloensis DSM 21326]
gi|323317278|gb|EGA70273.1| hypothetical protein VISI1226_13616 [Vibrio sinaloensis DSM 21326]
Length = 376
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 3/76 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY-LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
G PH D + FI+Q GK++W+V L R +Y + Q E I+ LEPG
Sbjct: 122 GVGPHIDQYDVFIIQGSGKRRWRVGDLDR--GQYQESIQASALRQIEGFDAIIDEILEPG 179
Query: 175 DLLYLPRGYIHQASTV 190
D+LY+P G+ H+ +T+
Sbjct: 180 DILYIPPGFPHEGNTL 195
>gi|423106188|ref|ZP_17093889.1| hypothetical protein HMPREF9686_04793 [Klebsiella oxytoca 10-5242]
gi|376378396|gb|EHS91156.1| hypothetical protein HMPREF9686_04793 [Klebsiella oxytoca 10-5242]
Length = 374
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|334703818|ref|ZP_08519684.1| cupin superfamily protein [Aeromonas caviae Ae398]
Length = 376
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q +GK+ W+V + ++E+ + E I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200
>gi|254670287|emb|CBA05598.1| myc induced nuclear antigen [Neisseria meningitidis alpha153]
Length = 309
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 7 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHISRQVARFAGAHTIVSGYLAFGSDE 61
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 62 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DVPMPLYMQQT-KDTDISIPEHIDMDIILE 117
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H+ + + +++ L K P
Sbjct: 118 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 163
>gi|423127147|ref|ZP_17114826.1| hypothetical protein HMPREF9694_03838 [Klebsiella oxytoca 10-5250]
gi|376396006|gb|EHT08651.1| hypothetical protein HMPREF9694_03838 [Klebsiella oxytoca 10-5250]
Length = 374
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/204 (22%), Positives = 89/204 (43%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
+W+K+P+ + + N+ + + D L +E + + + ++GQ Q +
Sbjct: 16 YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70
Query: 61 --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + ++P + L + L +L F P G
Sbjct: 71 FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H +T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201
>gi|421724900|ref|ZP_16164104.1| hypothetical protein KOXM_04676 [Klebsiella oxytoca M5al]
gi|410374272|gb|EKP28949.1| hypothetical protein KOXM_04676 [Klebsiella oxytoca M5al]
Length = 374
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/204 (22%), Positives = 89/204 (43%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
+W+K+P+ + + N+ + + D L +E + + + ++GQ Q +
Sbjct: 16 YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70
Query: 61 --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + ++P + L + L +L F P G
Sbjct: 71 FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H +T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201
>gi|423117695|ref|ZP_17105386.1| hypothetical protein HMPREF9689_05443 [Klebsiella oxytoca 10-5245]
gi|376375825|gb|EHS88611.1| hypothetical protein HMPREF9689_05443 [Klebsiella oxytoca 10-5245]
Length = 374
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/204 (22%), Positives = 89/204 (43%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
+W+K+P+ + + N+ + + D L +E + + + ++GQ Q +
Sbjct: 16 YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70
Query: 61 --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + ++P + L + L +L F P G
Sbjct: 71 FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H +T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201
>gi|403377316|gb|EJY88650.1| hypothetical protein OXYTRI_00132 [Oxytricha trifallax]
Length = 624
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 93/219 (42%), Gaps = 39/219 (17%)
Query: 19 NNLITSKKIDDILRDNV----IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGC 74
+N+I+ K IDD L+ V I++ I++ Y +G++ +M++ + D +
Sbjct: 173 DNIISKKMIDDYLKMQVPQGNIKYSDQIELFKYVNGKKFSMNLGESVDSDYLEDQLQNQS 232
Query: 75 SIRFLN-PQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
+I LN Q + L ++ + ELF + T Q F HY L
Sbjct: 233 TILQLNKAQEFSISLAKVMSLSDELFASDI---------TCQVFLQHY---------LTQ 274
Query: 134 KKKWKVYLPRM----VDEYLPRYSSP----NFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
++ K L M D+Y+ F+ ++ TP+ V L+ GD Y+PR +
Sbjct: 275 DRQKKNQLSEMQQNEFDQYIFMIDGSVGLRFFNNQDEKTPVFKVKLQKGDFAYIPRNVKY 334
Query: 186 QASTVTNE--------HSLHVTISVYQKTAWIDLLEKAM 216
Q T E + L++T+S + W L++K M
Sbjct: 335 QTKTEIVEDDQTSEQNYCLYITLSHQSQQTWFSLIQKQM 373
>gi|50121374|ref|YP_050541.1| hypothetical protein ECA2447 [Pectobacterium atrosepticum SCRI1043]
gi|49611900|emb|CAG75349.1| conserved hypothetical protein [Pectobacterium atrosepticum
SCRI1043]
Length = 373
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 38/219 (17%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQ------- 53
++W+K PL I K + + I D+ L +E + + + ++DG+ Q
Sbjct: 15 NYWQKRPLLI----KGGFTHFIDPISPDE-LAGLALENEVDSRLVSHQDGRWQVSHGPFE 69
Query: 54 TMDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTY 108
+ D G + L Q V DH+ E S ++P +QL + +L F
Sbjct: 70 SYDHLGESNWSLLVQAV-DHWHEPSS-------ALMQPFRQLPDWRTDDLMISF------ 115
Query: 109 QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT 168
P G PH D + FI+Q G+++W+V + ++ P P+ Q + I+
Sbjct: 116 SVP--GGGVGPHLDQYDVFIIQGTGRRRWRVGEKIPMKQHCPH---PDLLQVDPFEAIID 170
Query: 169 VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
LEPGD++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 171 EELEPGDIVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|402844686|ref|ZP_10893037.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
gi|402273419|gb|EJU22620.1| cupin domain protein, PF06172 family [Klebsiella sp. OBRC7]
Length = 374
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|329119755|ref|ZP_08248434.1| cupin family protein [Neisseria bacilliformis ATCC BAA-1200]
gi|327464150|gb|EGF10456.1| cupin family protein [Neisseria bacilliformis ATCC BAA-1200]
Length = 426
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/174 (24%), Positives = 80/174 (45%), Gaps = 24/174 (13%)
Query: 56 DVEGRALPQIVWDHYREGCSIRF--LNPQTYIKPL-QQLNA-----SLQELFGCFVGANT 107
++ R + ++++H R G S+ + +N + + + +Q+ A ++ + CF +
Sbjct: 116 NIRHRFIKPVIYEHMRNGASLIYNHINNEPFSDGIARQIGALAQAHTVTSAYLCFGSDES 175
Query: 108 YQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRY--SSPNFSQEEIGTP 165
Y+ H+D + + +QL GKK+W + P P Y + + + E T
Sbjct: 176 YKN---------HWDTRDVYAVQLVGKKRWYLSAPNFPQ---PLYMQQTKDLNVTEPDTV 223
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKA 219
+ LE GD+LY+PRG+ H + E S H+ + + + D LE M K
Sbjct: 224 DYDIVLEAGDVLYIPRGWWHNPIPMGCE-SFHLAVGTFPPNGY-DYLEWLMWKG 275
>gi|421497434|ref|ZP_15944603.1| hypothetical protein B224_001958 [Aeromonas media WS]
gi|407183554|gb|EKE57442.1| hypothetical protein B224_001958 [Aeromonas media WS]
Length = 376
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 52/95 (54%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q +GK+ W+V + ++E+ + + E I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLNEFAAHAALLHCEPFEA---I 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200
>gi|109898698|ref|YP_661953.1| cupin 4 [Pseudoalteromonas atlantica T6c]
gi|109700979|gb|ABG40899.1| Cupin 4 [Pseudoalteromonas atlantica T6c]
Length = 389
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 3/75 (4%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
T G PH D + FI+Q +G ++W+ EY + P+ SQ + TPI+ L
Sbjct: 118 TGAGVGPHLDQYDVFIIQGKGSRRWQA---GKRGEYSTYHPHPDLSQIQGFTPIIDEVLH 174
Query: 173 PGDLLYLPRGYIHQA 187
GD+LY+P G H
Sbjct: 175 SGDMLYIPAGCPHNG 189
>gi|385870105|gb|AFI88625.1| Cupin 4 family protein [Pectobacterium sp. SCC3193]
Length = 373
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 16/201 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K+P+ + KN + N + D+ L +E + + + ++DGQ Q +
Sbjct: 16 YWQKKPVVL----KNAFPNFVDPITPDE-LAGLAMEAEVDSRLVSHKDGQWQASNG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E G S+ + P +L + L + G PH
Sbjct: 67 -PFEHFDHLGETGWSLLAQAVNHWHAPSAELVRPFRVLPDWRLDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G ++W+V + ++ S P + PI+ LEPGD+LY+P
Sbjct: 126 IDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTIS 201
G+ H T+E +L+ ++
Sbjct: 183 PGFPHDG--FTHETALNYSVG 201
>gi|423111729|ref|ZP_17099423.1| hypothetical protein HMPREF9687_04974 [Klebsiella oxytoca 10-5243]
gi|376375827|gb|EHS88612.1| hypothetical protein HMPREF9687_04974 [Klebsiella oxytoca 10-5243]
Length = 374
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|261819811|ref|YP_003257917.1| cupin [Pectobacterium wasabiae WPP163]
gi|261603824|gb|ACX86310.1| Cupin 4 family protein [Pectobacterium wasabiae WPP163]
Length = 373
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 16/201 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K+P+ + KN + N + D+ L +E + + + ++DGQ Q +
Sbjct: 16 YWQKKPVVL----KNAFPNFVDPITPDE-LAGLAMEAEVDSRLVSHKDGQWQASNG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E G S+ + P +L + L + G PH
Sbjct: 67 -PFEHFDHLGETGWSLLAQAVNHWHAPSAELVRPFRVLPDWRLDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G ++W+V + ++ S P + PI+ LEPGD+LY+P
Sbjct: 126 IDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGDILYIP 182
Query: 181 RGYIHQASTVTNEHSLHVTIS 201
G+ H T+E +L+ ++
Sbjct: 183 PGFPHDG--FTHETALNYSVG 201
>gi|195467841|ref|XP_002076034.1| GK10507 [Drosophila willistoni]
gi|194172119|gb|EDW87020.1| GK10507 [Drosophila willistoni]
Length = 158
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 39/64 (60%)
Query: 276 LDDGVDEMGKQLMHDALPPVLSPEELQCSVFENGLRMSQTGEVYNATQITKDTKVRLVRA 335
+D VD++ K+ H+ALPP + PEE +VF + G+ ++T+DT +RL+RA
Sbjct: 13 IDAAVDQLAKRYQHEALPPTILPEEKLRTVFGSRSATDAHGKCLCDYELTEDTSIRLLRA 72
Query: 336 NAVR 339
N +R
Sbjct: 73 NILR 76
>gi|83645143|ref|YP_433578.1| hypothetical protein HCH_02330 [Hahella chejuensis KCTC 2396]
gi|83633186|gb|ABC29153.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
Length = 405
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 42/93 (45%), Gaps = 14/93 (15%)
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSP------NFSQEEIGTP 165
P PH+D + F++Q G ++W++ P D+ PR NF Q E
Sbjct: 122 PPGGSVGPHFDHYDVFLIQATGARRWRIGPP--CDDQSPRVDGTPLRILQNFEQTE---- 175
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
LEPGD LYLP GY H T+ +L V
Sbjct: 176 --EWVLEPGDALYLPPGYAHYGVAETSCITLSV 206
>gi|424802359|ref|ZP_18227901.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 696]
gi|423238080|emb|CCK09771.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 696]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G+++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISV 202
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVGF 202
>gi|59712199|ref|YP_204975.1| hypothetical protein VF_1592 [Vibrio fischeri ES114]
gi|59480300|gb|AAW86087.1| conserved protein [Vibrio fischeri ES114]
Length = 394
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +GK+ W+V + EY + Q PI+ LEPGD
Sbjct: 134 GVGPHIDQYDVFIIQGQGKRHWRV---GDIGEYQEEHRHSALKQITGFEPIIDQILEPGD 190
Query: 176 LLYLPRGYIHQA 187
+LY+P G+ H
Sbjct: 191 ILYIPPGFPHDG 202
>gi|385329034|ref|YP_005883337.1| hypothetical protein NMBB_2028 [Neisseria meningitidis alpha710]
gi|385341362|ref|YP_005895233.1| cupin family protein [Neisseria meningitidis M01-240149]
gi|385856660|ref|YP_005903172.1| cupin family protein [Neisseria meningitidis NZ-05/33]
gi|416174072|ref|ZP_11609101.1| cupin family protein [Neisseria meningitidis OX99.30304]
gi|416188964|ref|ZP_11615017.1| cupin family protein [Neisseria meningitidis M0579]
gi|433465770|ref|ZP_20423241.1| cupin-like domain protein [Neisseria meningitidis NM422]
gi|433491143|ref|ZP_20448256.1| cupin-like domain protein [Neisseria meningitidis NM418]
gi|308389886|gb|ADO32206.1| hypothetical protein NMBB_2028 [Neisseria meningitidis alpha710]
gi|325129586|gb|EGC52408.1| cupin family protein [Neisseria meningitidis OX99.30304]
gi|325135654|gb|EGC58270.1| cupin family protein [Neisseria meningitidis M0579]
gi|325201568|gb|ADY97022.1| cupin family protein [Neisseria meningitidis M01-240149]
gi|325207549|gb|ADZ03001.1| cupin family protein [Neisseria meningitidis NZ-05/33]
gi|432200866|gb|ELK56954.1| cupin-like domain protein [Neisseria meningitidis NM422]
gi|432225702|gb|ELK81442.1| cupin-like domain protein [Neisseria meningitidis NM418]
Length = 384
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHISRQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H+ + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 238
>gi|377579221|ref|ZP_09808192.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
gi|377539505|dbj|GAB53357.1| hypothetical protein YcfD [Escherichia hermannii NBRC 105704]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 91/204 (44%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD---VE 58
+W+K+P+ + KN + N + D+ L +E + + + +++GQ Q + +
Sbjct: 16 YWQKQPVVL----KNAFPNFVDPITPDE-LAGLAMEPEVDSRLVSHKNGQWQAANGPFED 70
Query: 59 GRALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + ++P + L + L +L F P G
Sbjct: 71 FDNLGETGWSLLAQAVNHWHMPSAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--FTHETALNYSVG 201
>gi|157368749|ref|YP_001476738.1| cupin 4 family protein [Serratia proteamaculans 568]
gi|157320513|gb|ABV39610.1| Cupin 4 family protein [Serratia proteamaculans 568]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D+ + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDNYDVFIIQGMGSRRWRVGDALPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|444912786|ref|ZP_21232946.1| hypothetical protein D187_04882 [Cystobacter fuscus DSM 2262]
gi|444716710|gb|ELW57553.1| hypothetical protein D187_04882 [Cystobacter fuscus DSM 2262]
Length = 441
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 82/200 (41%), Gaps = 35/200 (17%)
Query: 89 QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
+Q+N + F C++ P +G HYD A I+Q+ G K+W R+
Sbjct: 122 RQMNLATPVRFNCYLS-------PDGEGLDTHYDARHATIIQISGSKRW--LYSRLPATN 172
Query: 149 LP------------RYSSPNFSQEEIGTP------ILTVTLEPGDLLYLPRGYIHQASTV 190
P R+ N ++++ P V LEPGDLLYLP G H A
Sbjct: 173 YPLANALVQKDGQIRHGEHNNRRQKLDVPTPDESQFEEVLLEPGDLLYLPPGTWHNAKAS 232
Query: 191 TNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLEFRRGLPIGYLRYAGLARGKPV 250
+L++ I +L+ P+ L+ + LE+R LP L P+
Sbjct: 233 GESLALNMAFET------IGMLQVLFPE-LERMLSEKLEWRAILPATPLEQTRPGE-VPL 284
Query: 251 DIQADRLAMKENLKDLLAKL 270
+++ A L+D+L KL
Sbjct: 285 EVEQFITARLGELRDMLGKL 304
>gi|409407054|ref|ZP_11255505.1| hypothetical protein GWL_26590 [Herbaspirillum sp. GW103]
gi|386432805|gb|EIJ45631.1| hypothetical protein GWL_26590 [Herbaspirillum sp. GW103]
Length = 378
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 25/212 (11%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W K+PL I R + + +++ D+ R +E + IT DG +
Sbjct: 21 DYWHKKPLLI-RNALPGFTAPLPRRELFDLARREEVESRL---ITHRSDGWDMQHGPFNK 76
Query: 61 ALP--QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
P Q W +G ++ ++ + L +A L +L +Y T G
Sbjct: 77 LPPLKQKAWTLLVQGVNLHHPAADALLRQFRFLPDARLDDLM------ISYAT--DGGGV 128
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
PH+D + F+LQ G+++W++ ++E +P +F E+ LEPGD
Sbjct: 129 GPHFDSYDVFLLQAHGQRRWRISAQDDLTLEEGVPLKILSHFQPEQ------EFVLEPGD 182
Query: 176 LLYLPRGYIHQASTVTN--EHSLHVTISVYQK 205
+LYLP Y H V +S+ YQ+
Sbjct: 183 MLYLPPQYAHDGVAVGECMTYSIGFRAPAYQE 214
>gi|156936656|ref|YP_001440570.1| hypothetical protein ESA_pESA3p05537 [Cronobacter sakazakii ATCC
BAA-894]
gi|156534910|gb|ABU79734.1| hypothetical protein ESA_pESA3p05537 [Cronobacter sakazakii ATCC
BAA-894]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G+++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201
>gi|429118537|ref|ZP_19179298.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 680]
gi|426327009|emb|CCK10035.1| FIG00553994: hypothetical protein [Cronobacter sakazakii 680]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G+++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201
>gi|333893393|ref|YP_004467268.1| hypothetical protein ambt_09700 [Alteromonas sp. SN2]
gi|332993411|gb|AEF03466.1| hypothetical protein ambt_09700 [Alteromonas sp. SN2]
Length = 388
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 50/109 (45%), Gaps = 12/109 (11%)
Query: 85 IKPLQQL-NASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR 143
++P + L N L +L F N G H D + F++Q +G+++W+V P
Sbjct: 103 LEPFRFLPNWRLDDLMVSFATKNA--------GVGAHIDQYDVFLVQGKGQRRWRVGKPG 154
Query: 144 MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTN 192
E P P Q E PI+ L PGD+LY+P G+ H + +
Sbjct: 155 EYKEVFPH---PKLRQIEGFDPIIDEVLNPGDVLYIPPGWPHDGKALED 200
>gi|238790022|ref|ZP_04633800.1| hypothetical protein yfred0001_18980 [Yersinia frederiksenii ATCC
33641]
gi|238721835|gb|EEQ13497.1| hypothetical protein yfred0001_18980 [Yersinia frederiksenii ATCC
33641]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|416207366|ref|ZP_11620906.1| cupin family protein [Neisseria meningitidis 961-5945]
gi|254674054|emb|CBA09838.1| myc induced nuclear antigen [Neisseria meningitidis alpha275]
gi|325141714|gb|EGC64172.1| cupin family protein [Neisseria meningitidis 961-5945]
Length = 309
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 7 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 61
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 62 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 117
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H+ + + +++ L K P
Sbjct: 118 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 163
>gi|440288006|ref|YP_007340771.1| hypothetical protein D782_2626 [Enterobacteriaceae bacterium strain
FGI 57]
gi|440047528|gb|AGB78586.1| hypothetical protein D782_2626 [Enterobacteriaceae bacterium strain
FGI 57]
Length = 373
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKVPMKQHCPH---PDLLQVNPFEGIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ ++ N
Sbjct: 178 ILYIPPGFPHEGYSLEN 194
>gi|384083452|ref|ZP_09994627.1| hypothetical protein gproHI_09076 [gamma proteobacterium HIMB30]
Length = 377
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 2/71 (2%)
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
HYD F++QL G+++W++ PR DE P ++ + E PI T PGD+LYL
Sbjct: 129 HYDQFSVFLIQLNGRRQWEIG-PRATDET-PMVANQPIALVEPQEPITTWIANPGDVLYL 186
Query: 180 PRGYIHQASTV 190
P IH ++
Sbjct: 187 PPNMIHHGVSL 197
>gi|445498591|ref|ZP_21465446.1| cupin 4 family [Janthinobacterium sp. HH01]
gi|444788586|gb|ELX10134.1| cupin 4 family [Janthinobacterium sp. HH01]
Length = 394
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/198 (23%), Positives = 81/198 (40%), Gaps = 25/198 (12%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W K+PL I R + L++ + + + N E + ++ DGQ
Sbjct: 39 DYWHKKPLLI-RNAIPGFKPLLSMEALTKLASTNHAESRLVTNV----DGQWHMQHGPLE 93
Query: 61 ALP---QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQG 116
+LP Q W +G ++ ++ + + +A L +L F G
Sbjct: 94 SLPPMDQKEWTMLVQGVNLFDAKADALLRQFRFIPDARLDDLMVSFA--------TDGGG 145
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PH+D + F+LQ +G+++W + + + E LP NF E L PG
Sbjct: 146 VGPHFDSYDVFLLQAQGQRRWSIGPQKDLTLVEGLPLKILANFKPNE------EFVLNPG 199
Query: 175 DLLYLPRGYIHQASTVTN 192
D+LYLP Y H + +
Sbjct: 200 DMLYLPPHYAHDGIAIGD 217
>gi|304388287|ref|ZP_07370403.1| cupin family protein [Neisseria meningitidis ATCC 13091]
gi|304337718|gb|EFM03871.1| cupin family protein [Neisseria meningitidis ATCC 13091]
Length = 384
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 77/167 (46%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H+ + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHLAVGTFPPNGYNYLEWLMKKFP 238
>gi|123443237|ref|YP_001007211.1| hypothetical protein YE3022 [Yersinia enterocolitica subsp.
enterocolitica 8081]
gi|122090198|emb|CAL13061.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
enterocolitica 8081]
Length = 373
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P E TPI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFTPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|385324749|ref|YP_005879188.1| hypothetical protein NMV_1952 [Neisseria meningitidis 8013]
gi|433510027|ref|ZP_20466884.1| cupin-like domain protein [Neisseria meningitidis 12888]
gi|261393136|emb|CAX50744.1| conserved hypothetical protein [Neisseria meningitidis 8013]
gi|432244965|gb|ELL00444.1| cupin-like domain protein [Neisseria meningitidis 12888]
Length = 387
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
S+ + F A T + A G +P H+D + F +QL GKK W V P +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163
Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
P Y I P + V LE GD+LY+PRG+ H + E + H+ I +
Sbjct: 164 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222
Query: 207 A--WIDLLEKAMP 217
+++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235
>gi|422018084|ref|ZP_16364643.1| cupin 4 family protein [Providencia alcalifaciens Dmel2]
gi|414105209|gb|EKT66772.1| cupin 4 family protein [Providencia alcalifaciens Dmel2]
Length = 373
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 8/115 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D+ + FI+Q G+++W+V + ++ + P E PI
Sbjct: 114 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---AHPALLHVEPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+ V + PGD+LY+P G+ H T E +++ ++ T DLL AL+
Sbjct: 169 IDVEMVPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220
>gi|317494130|ref|ZP_07952546.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
gi|316917903|gb|EFV39246.1| cupin superfamily protein [Enterobacteriaceae bacterium 9_2_54FAA]
Length = 373
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPSGFPHDG--FTHETALNYSVG 201
>gi|398802956|ref|ZP_10562131.1| hypothetical protein PMI15_00876 [Polaromonas sp. CF318]
gi|398097782|gb|EJL88084.1| hypothetical protein PMI15_00876 [Polaromonas sp. CF318]
Length = 400
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 79/191 (41%), Gaps = 25/191 (13%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMD-VEGR 60
HW+K+PL + R+ + L++ + + +E + I G R R
Sbjct: 22 HWQKKPLLV-RQALPGFQPLLSRAALFKLAASEQVESRL---IVQQAKGWRMKQGPFAPR 77
Query: 61 ALP---QIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQG 116
+LP Q W +G + ++ + + +A L +L F A G
Sbjct: 78 SLPPLSQKAWTLLVQGVDLHDAGAHELLQRFRFVPDARLDDLMISFATAGG--------G 129
Query: 117 FAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPG 174
PH+D + F+LQ G+++WK+ + + E +P NF E+ LE G
Sbjct: 130 VGPHFDSYDVFLLQASGRRRWKIGRQKDLSLQEGVPLKILQNFEPEQ------EFVLEAG 183
Query: 175 DLLYLPRGYIH 185
D+LYLP Y H
Sbjct: 184 DMLYLPPRYAH 194
>gi|398835112|ref|ZP_10592489.1| hypothetical protein PMI40_02498 [Herbaspirillum sp. YR522]
gi|398217303|gb|EJN03821.1| hypothetical protein PMI40_02498 [Herbaspirillum sp. YR522]
Length = 436
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 136/337 (40%), Gaps = 49/337 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
D+W K+PL I + + L D RD+V ++ IT DG +
Sbjct: 82 DYWHKKPLLIRQAIPAFQPPLSRQALFDLAGRDDV----ESRLITHRSDGWDMQHGPLAK 137
Query: 61 --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
AL Q W +G ++ ++ + + +A L +L F G
Sbjct: 138 LPALKQKAWTLLVQGVNLHDDAADALLRQFRFIPDARLDDLMISFAT--------DGGGV 189
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
PH+D + F+LQ G+++W++ + E +P +F E+ LEPGD
Sbjct: 190 GPHFDSYDVFLLQAHGQRRWRISAQTDLSLKEDVPLKILRHFEPEQ------EFVLEPGD 243
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV-YQKTAWIDLLEKAMPKALQAAGATDLEFRRGL 234
+LYLP Y H V +T S+ ++ A +L E + L + DL R
Sbjct: 244 MLYLPPQYAHDGVAVGE----CMTYSIGFRAPAHQELGEAFLEYMLD---SIDLPGRFAD 296
Query: 235 PIGYLRYAGLARGKPVDIQADRLAMKENLKDLLAKLVEYVDLDDGVDEMGKQLMHDAL-- 292
P L+ A KP ++ D M L+D L K + + D DD +G+ L L
Sbjct: 297 P--ELK----ATSKPAELPVD---MVRRLRDELNK-IRFTD-DDITIFLGEYLSEPKLSV 345
Query: 293 ----PPVLSPEELQCSVFENGLRMS-QTGEVYNATQI 324
PP +S + GLR+ +T +Y A+ +
Sbjct: 346 TFDAPPEISFARFFQEARKRGLRLHRKTQMLYRASHV 382
>gi|385851891|ref|YP_005898406.1| cupin family protein [Neisseria meningitidis M04-240196]
gi|416177792|ref|ZP_11610161.1| cupin family protein [Neisseria meningitidis M6190]
gi|416191669|ref|ZP_11616167.1| cupin family protein [Neisseria meningitidis ES14902]
gi|433494502|ref|ZP_20451572.1| cupin-like domain protein [Neisseria meningitidis NM762]
gi|433502709|ref|ZP_20459674.1| cupin-like domain protein [Neisseria meningitidis NM126]
gi|325132362|gb|EGC55055.1| cupin family protein [Neisseria meningitidis M6190]
gi|325138455|gb|EGC61021.1| cupin family protein [Neisseria meningitidis ES14902]
gi|325206714|gb|ADZ02167.1| cupin family protein [Neisseria meningitidis M04-240196]
gi|432231176|gb|ELK86846.1| cupin-like domain protein [Neisseria meningitidis NM762]
gi|432240805|gb|ELK96336.1| cupin-like domain protein [Neisseria meningitidis NM126]
Length = 387
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
S+ + F A T + A G +P H+D + F +QL GKK W V P +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163
Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
P Y I P + V LE GD+LY+PRG+ H + E + H+ I +
Sbjct: 164 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222
Query: 207 A--WIDLLEKAMP 217
+++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235
>gi|212711786|ref|ZP_03319914.1| hypothetical protein PROVALCAL_02861 [Providencia alcalifaciens DSM
30120]
gi|212685308|gb|EEB44836.1| hypothetical protein PROVALCAL_02861 [Providencia alcalifaciens DSM
30120]
Length = 373
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 8/115 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D+ + FI+Q G+++W+V + ++ + P E PI
Sbjct: 114 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---AHPALLHVEPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+ V + PGD+LY+P G+ H T E +++ ++ T DLL AL+
Sbjct: 169 IDVEMVPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220
>gi|387889678|ref|YP_006319976.1| cupin superfamily protein family [Escherichia blattae DSM 4481]
gi|386924511|gb|AFJ47465.1| cupin superfamily protein family [Escherichia blattae DSM 4481]
Length = 392
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD
Sbjct: 140 GVGPHLDQYDVFIIQGTGRRRWRVGEKLPLKQHCPH---PDLLQVDPFEAIIDEEMEPGD 196
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 197 ILYIPPGFPHEGYALEN 213
>gi|254506132|ref|ZP_05118276.1| Cupin superfamily protein [Vibrio parahaemolyticus 16]
gi|219550950|gb|EED27931.1| Cupin superfamily protein [Vibrio parahaemolyticus 16]
Length = 375
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 13/126 (10%)
Query: 67 WDHYREGCSIRFLNPQTYIKPLQQLNASL-QELFGCFVGANTYQTPPTAQGFAPHYDDIE 125
W + C+ L + P + L L +L C Y P G PH D +
Sbjct: 80 WQLIVQACNHWHLGAAELVAPFKALPQWLFDDLMVC------YSAPQG--GVGPHIDQYD 131
Query: 126 AFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTPILTVTLEPGDLLYLPRGYI 184
FI+Q GK++W+V DE + S Q E I+ LEPGD+LY+P G+
Sbjct: 132 VFIIQGSGKRRWRV---GAADEGQYQESIQGALRQIESFDAIIDEVLEPGDILYIPPGFP 188
Query: 185 HQASTV 190
H+ +T+
Sbjct: 189 HEGNTI 194
>gi|403358106|gb|EJY78689.1| hypothetical protein OXYTRI_24149 [Oxytricha trifallax]
Length = 624
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 93/219 (42%), Gaps = 39/219 (17%)
Query: 19 NNLITSKKIDDILRDNV----IEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGC 74
+N+I+ K IDD L+ V I++ I++ Y +G++ +M++ + D +
Sbjct: 173 DNIISKKIIDDYLKMQVPQGNIKYSDQIELFKYVNGKKFSMNLGESVDSDYLEDQLQNQS 232
Query: 75 SIRFLN-PQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEG 133
+I LN Q + L ++ + ELF + T Q F HY L
Sbjct: 233 TILQLNKAQEFSISLAKVMSLSDELFASDI---------TCQVFLQHY---------LTQ 274
Query: 134 KKKWKVYLPRM----VDEYLPRYSSP----NFSQEEIGTPILTVTLEPGDLLYLPRGYIH 185
++ K L M D+Y+ F+ ++ TP+ V L+ GD Y+PR +
Sbjct: 275 DRQKKNQLSEMQQNEFDQYIFMIDGSVGLRFFNNQDEKTPVFKVKLQKGDFAYIPRNVKY 334
Query: 186 QASTVTNE--------HSLHVTISVYQKTAWIDLLEKAM 216
Q T E + L++T+S + W L++K M
Sbjct: 335 QTKTEIVEDDQTSEQNYCLYITLSHQSQQTWFSLIQKQM 373
>gi|416214530|ref|ZP_11622926.1| cupin family protein [Neisseria meningitidis M01-240013]
gi|325143802|gb|EGC66118.1| cupin family protein [Neisseria meningitidis M01-240013]
gi|389606480|emb|CCA45393.1| Myc-induced nuclear antigen [Neisseria meningitidis alpha522]
Length = 381
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
S+ + F A T + A G +P H+D + F +QL GKK W V P +
Sbjct: 101 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 157
Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
P Y I P + V LE GD+LY+PRG+ H + E + H+ I +
Sbjct: 158 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 216
Query: 207 A--WIDLLEKAMP 217
+++ L K +P
Sbjct: 217 GYNYMEWLMKKIP 229
>gi|403058364|ref|YP_006646581.1| cupin 4 family protein [Pectobacterium carotovorum subsp.
carotovorum PCC21]
gi|402805690|gb|AFR03328.1| cupin 4 family protein [Pectobacterium carotovorum subsp.
carotovorum PCC21]
Length = 373
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W++ + ++ P P+ Q + I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFDAIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 178 IIYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|340786871|ref|YP_004752336.1| hypothetical protein CFU_1681 [Collimonas fungivorans Ter331]
gi|340552138|gb|AEK61513.1| hypothetical protein CFU_1681 [Collimonas fungivorans Ter331]
Length = 394
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 81/203 (39%), Gaps = 39/203 (19%)
Query: 1 DHWEKEPLYISRKQKNYY-----NNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTM 55
D+W K+PL I + + + L + DD+ V FK+ Q Q
Sbjct: 39 DYWHKKPLLIRQALPGFIPLLSPDALFELARRDDVESRLVTHFKQ----------QWQVS 88
Query: 56 DVEGRALPQIV---WDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTP 111
+ LP W +G ++ ++ + + +A L +L + N
Sbjct: 89 NGPASELPSTAQKDWTLLVQGVNLHDDGADALLRQFRFIPDARLDDLMISYATDNG---- 144
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSS----PNFSQEEIGTPIL 167
G PH+D + F+LQ G+++WK+ + D L S+ NF E+
Sbjct: 145 ----GVGPHFDSYDVFLLQAHGQRRWKIGATQ--DLTLVEGSALKILKNFKPEQ------ 192
Query: 168 TVTLEPGDLLYLPRGYIHQASTV 190
LEPGD+LYLP Y H + +
Sbjct: 193 EFILEPGDMLYLPPQYAHDGTAI 215
>gi|227111568|ref|ZP_03825224.1| hypothetical protein PcarbP_01325 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length = 373
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W++ + ++ P P+ Q + I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFDAIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 178 IIYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|416214473|ref|ZP_11622908.1| cupin family protein [Neisseria meningitidis M01-240013]
gi|325143871|gb|EGC66186.1| cupin family protein [Neisseria meningitidis M01-240013]
Length = 384
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHISRQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|294951730|ref|XP_002787109.1| hypothetical protein Pmar_PMAR001518 [Perkinsus marinus ATCC 50983]
gi|239901734|gb|EER18905.1| hypothetical protein Pmar_PMAR001518 [Perkinsus marinus ATCC 50983]
Length = 58
Score = 50.8 bits (120), Expect = 0.001, Method: Composition-based stats.
Identities = 21/46 (45%), Positives = 32/46 (69%)
Query: 160 EEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK 205
+ +G P+ +VTLE GDLL +PRG IH+A + SLH+T+ V ++
Sbjct: 8 DNLGEPLYSVTLEEGDLLLIPRGVIHRARASQEQGSLHITVKVLKR 53
>gi|50119371|ref|YP_048538.1| hypothetical protein ECA0420 [Pectobacterium atrosepticum SCRI1043]
gi|49609897|emb|CAG73335.1| conserved hypothetical protein [Pectobacterium atrosepticum
SCRI1043]
Length = 373
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 88/204 (43%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K+P+ + KN + N + D+ L +E + + + Y++GQ Q +
Sbjct: 16 YWQKKPVVL----KNAFPNFVDPITPDE-LAGLAMEAEVDSRLVSYKNGQWQASNGPFEH 70
Query: 62 ---LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + P + L + L +L F P G
Sbjct: 71 FDNLGETGWSLLAQAVNHWHAPSAELVSPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--FTHETALNYSVG 201
>gi|414593044|ref|ZP_11442692.1| hypothetical protein YcfD [Escherichia blattae NBRC 105725]
gi|403195877|dbj|GAB80344.1| hypothetical protein YcfD [Escherichia blattae NBRC 105725]
Length = 373
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLPLKQHCPH---PDLLQVDPFEAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 178 ILYIPPGFPHEGYALEN 194
>gi|357387437|ref|YP_004902276.1| hypothetical protein KSE_04740 [Kitasatospora setae KM-6054]
gi|311893912|dbj|BAJ26320.1| hypothetical protein KSE_04740 [Kitasatospora setae KM-6054]
Length = 316
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 77/196 (39%), Gaps = 13/196 (6%)
Query: 29 DILRDNVIEFKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPL 88
+++RD + + T G+R++ G A P + EG ++ + +
Sbjct: 54 EMVRDGAVVEPELFCATRTVAGRRES----GWADPARISALLAEGATLLLPQLDQWHAGV 109
Query: 89 QQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY 148
+ + + G A + T +G H DD + ++QL G K W V+ P +
Sbjct: 110 RAMATDIALHLGRRTEAFCFTTGAGRRGLDVHRDDADVLVVQLAGAKDWTVHEPPADGHW 169
Query: 149 LPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAW 208
P + G L L PG++LY+PRG H T T L V +S + A
Sbjct: 170 RPGLAPAP------GPVALRTALTPGEILYVPRGAPH---TATGHRGLSVHLSFTIREAT 220
Query: 209 IDLLEKAMPKALQAAG 224
L A+ + L G
Sbjct: 221 TGRLHAALGEFLTTPG 236
>gi|421545099|ref|ZP_15991164.1| hypothetical protein NMEN140_1667 [Neisseria meningitidis NM140]
gi|421547149|ref|ZP_15993187.1| hypothetical protein NMEN183_1679 [Neisseria meningitidis NM183]
gi|421548120|ref|ZP_15994148.1| hypothetical protein NMEN2781_0562 [Neisseria meningitidis NM2781]
gi|421553356|ref|ZP_15999319.1| hypothetical protein NMEN576_1705 [Neisseria meningitidis NM576]
gi|402321512|gb|EJU56985.1| hypothetical protein NMEN183_1679 [Neisseria meningitidis NM183]
gi|402321782|gb|EJU57254.1| hypothetical protein NMEN140_1667 [Neisseria meningitidis NM140]
gi|402326842|gb|EJU62240.1| hypothetical protein NMEN2781_0562 [Neisseria meningitidis NM2781]
gi|402328264|gb|EJU63641.1| hypothetical protein NMEN576_1705 [Neisseria meningitidis NM576]
Length = 322
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
S+ + F A T + A G +P H+D + F +QL GKK W V P +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163
Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
P Y I P + V LE GD+LY+PRG+ H + E + H+ I +
Sbjct: 164 PLYMQQAKDMPHITPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222
Query: 207 A--WIDLLEKAMP 217
+++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235
>gi|271500463|ref|YP_003333488.1| Cupin 4 family protein [Dickeya dadantii Ech586]
gi|270344018|gb|ACZ76783.1| Cupin 4 family protein [Dickeya dadantii Ech586]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 125/289 (43%), Gaps = 57/289 (19%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQR-------QT 54
HW+K+P+ I ++ + I+ ++ + +N ++ + + + DG+ ++
Sbjct: 16 HWQKKPVIIKAGFPHFLDP-ISPDELAGLAMENEVDSR----LVSHRDGRWDVSHGPFES 70
Query: 55 MDVEGRA----LPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQ 109
D G + L Q V DH+ E + ++P +QL + + +L F
Sbjct: 71 YDHLGESNWSLLVQAV-DHWHEPSA-------ALMRPFRQLPDWRIDDLMISF------S 116
Query: 110 TPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTV 169
P G PH D + FI+Q G+++W+V + ++ P P+ Q I+
Sbjct: 117 VP--GGGVGPHLDQYDVFIIQGTGRRRWRVGDKVPMKQHCPH---PDLLQVAPFDAIIDE 171
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
+ PGD+LY+P G+ H+ ++ N SL+ ++ +A +G D
Sbjct: 172 EMAPGDILYIPPGFPHEGYSLEN--SLNYSVGF-----------RAPSSRELVSGFADYV 218
Query: 230 FRRGLPIGYLRYAG---LARGKPVDIQADRLAMKENLKDLLAKLVEYVD 275
R L G RY+ AR P D+ L + L+ ++ +LV++ D
Sbjct: 219 LSREL--GGQRYSDPDIPARAHPADVLPQEL---DKLQHMMLELVQHKD 262
>gi|227327069|ref|ZP_03831093.1| hypothetical protein PcarcW_07019 [Pectobacterium carotovorum
subsp. carotovorum WPP14]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W++ + ++ P P+ Q + I+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRIGEKIPMKQHCPH---PDLLQVDPFDAIIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
++Y+P G+ H+ ++ N SL+ ++ A
Sbjct: 178 IVYIPPGFPHEGYSLEN--SLNYSVGFRAPNA 207
>gi|238920217|ref|YP_002933732.1| Cupin superfamily protein [Edwardsiella ictaluri 93-146]
gi|238869786|gb|ACR69497.1| Cupin superfamily protein [Edwardsiella ictaluri 93-146]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 78/193 (40%), Gaps = 14/193 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K PL I K++ + L D L +E + + + +DGQ Q
Sbjct: 15 NYWQKRPLVIRHGFKDFIDPLSP-----DELAGLAMEQEVDSRLVCRDDGQWQVSHG--- 66
Query: 61 ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
P +DH E S+ + P L + L + G P
Sbjct: 67 --PFTSYDHLGETNWSLLVQAVDHWHTPAAALMRPFRHLPDWRLDDLMISFSVPGGGVGP 124
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
H D + FI+Q G+++W+V ++ P P+ Q + I+ + L PGD+LY+
Sbjct: 125 HIDQYDVFIIQGSGRRRWRVGEKGQRRQHCPH---PDLLQVDPFDAIIDIELSPGDILYI 181
Query: 180 PRGYIHQASTVTN 192
P + H+ + N
Sbjct: 182 PPHFPHEGYALEN 194
>gi|374287949|ref|YP_005035034.1| hypothetical protein BMS_1188 [Bacteriovorax marinus SJ]
gi|301166490|emb|CBW26066.1| conserved hypothetical protein [Bacteriovorax marinus SJ]
Length = 304
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 71/165 (43%), Gaps = 23/165 (13%)
Query: 69 HYREGCSIRFLNPQTYIKPLQQLNASLQELFGCF------VGANTYQTPPTAQGFAPHYD 122
+Y +G + F + + + +Q L L G F + N P G A H+D
Sbjct: 83 YYDQGMGLLFNDLNLFDEKMQD---DLNRLCGDFGVSRATMSRNLVYATPKGGGTATHFD 139
Query: 123 DIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT-----------VTL 171
+LQL+G+K W + V L R++ S E+ + I + L
Sbjct: 140 QNINLVLQLKGRKSWWLKENECVRNPLTRHTLGTISDPELSSYIDSPFPDSIKYDEEYEL 199
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAM 216
G LL++P+G+ H+ + +L+ T S+ +W+D+L A+
Sbjct: 200 SEGSLLFVPKGHWHKTHAHDDSLALNFTFSL---PSWVDILSMAL 241
>gi|254493031|ref|ZP_05106202.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291]
gi|226512071|gb|EEH61416.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291]
Length = 384
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|269139406|ref|YP_003296107.1| cupin 4 family protein [Edwardsiella tarda EIB202]
gi|387867989|ref|YP_005699458.1| hypothetical protein ETAF_1860 [Edwardsiella tarda FL6-60]
gi|267985067|gb|ACY84896.1| cupin 4 family protein [Edwardsiella tarda EIB202]
gi|304559302|gb|ADM41966.1| hypothetical protein ETAF_1860 [Edwardsiella tarda FL6-60]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 14/193 (7%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR 60
++W+K PL I KN+ + L D L +E + + + +D Q Q
Sbjct: 15 NYWQKRPLVIRHGFKNFIDPLSP-----DELAGLAMEQEVDSRLVCRDDAQWQVSHG--- 66
Query: 61 ALPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAP 119
P +DH E S+ + P L + L + G P
Sbjct: 67 --PFTSYDHLGETNWSLLVQAVDHWHTPAAALMRPFRRLPDWRLDDLMISFSVPGGGVGP 124
Query: 120 HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYL 179
H D + FI+Q G+++W+V ++ P P+ Q + I+ V L PGD+LY+
Sbjct: 125 HIDQYDVFIIQGSGRRRWRVGEKGQRRQHCPH---PDLLQVDPFDAIIDVELSPGDILYI 181
Query: 180 PRGYIHQASTVTN 192
P + H+ + N
Sbjct: 182 PPHFPHEGYALEN 194
>gi|410087896|ref|ZP_11284597.1| hypothetical protein C790_2055 [Morganella morganii SC01]
gi|421493865|ref|ZP_15941219.1| hypothetical protein MU9_2389 [Morganella morganii subsp. morganii
KT]
gi|455739298|ref|YP_007505564.1| hypothetical protein MU9_2145 [Morganella morganii subsp. morganii
KT]
gi|400191909|gb|EJO25051.1| hypothetical protein MU9_2389 [Morganella morganii subsp. morganii
KT]
gi|409765890|gb|EKN49993.1| hypothetical protein C790_2055 [Morganella morganii SC01]
gi|455420861|gb|AGG31191.1| hypothetical protein MU9_2145 [Morganella morganii subsp. morganii
KT]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 56/115 (48%), Gaps = 8/115 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D+ + FI+Q G+++W+V + ++ + P + PI
Sbjct: 114 SYSVP--GGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---AHPALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+ V + PGD+LY+P G+ H T E +++ ++ T DLL AL+
Sbjct: 169 IDVEMAPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220
>gi|385855805|ref|YP_005902318.1| cupin family protein [Neisseria meningitidis M01-240355]
gi|325204746|gb|ADZ00200.1| cupin family protein [Neisseria meningitidis M01-240355]
Length = 384
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|429112863|ref|ZP_19174633.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 507]
gi|426314020|emb|CCK00746.1| FIG00553994: hypothetical protein [Cronobacter malonaticus 507]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G ++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E +L+ ++
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHETALNYSVG 201
>gi|59800581|ref|YP_207293.1| hypothetical protein NGO0121 [Neisseria gonorrhoeae FA 1090]
gi|268598285|ref|ZP_06132452.1| conserved hypothetical protein [Neisseria gonorrhoeae MS11]
gi|268685890|ref|ZP_06152752.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035]
gi|59717476|gb|AAW88881.1| conserved hypothetical protein [Neisseria gonorrhoeae FA 1090]
gi|268582416|gb|EEZ47092.1| conserved hypothetical protein [Neisseria gonorrhoeae MS11]
gi|268626174|gb|EEZ58574.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035]
Length = 384
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDTRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|253989212|ref|YP_003040568.1| conserved hypothetical protein [Photorhabdus asymbiotica]
gi|253780662|emb|CAQ83824.1| conserved hypothetical protein [Photorhabdus asymbiotica]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + I+ +EPGD
Sbjct: 121 GVGPHLDQYDVFIIQGMGRRRWRVGEKVPMKQHCPH---PDLLQVDSFDAIIDEEMEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H+ + E SL+ ++
Sbjct: 178 VLYIPPGFPHEGYAI--EDSLNYSVG 201
>gi|206578392|ref|YP_002240682.1| cupin family protein [Klebsiella pneumoniae 342]
gi|288937381|ref|YP_003441440.1| cupin [Klebsiella variicola At-22]
gi|206567450|gb|ACI09226.1| cupin family protein [Klebsiella pneumoniae 342]
gi|288892090|gb|ADC60408.1| Cupin 4 family protein [Klebsiella variicola At-22]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H VT+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201
>gi|290512111|ref|ZP_06551478.1| ycfD [Klebsiella sp. 1_1_55]
gi|289775106|gb|EFD83107.1| ycfD [Klebsiella sp. 1_1_55]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H VT+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201
>gi|422008727|ref|ZP_16355711.1| cupin 4 family protein [Providencia rettgeri Dmel1]
gi|414095200|gb|EKT56863.1| cupin 4 family protein [Providencia rettgeri Dmel1]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 8/115 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D+ + FI+Q G+++W+V + ++ S P + PI
Sbjct: 114 SYSVP--EGGVGPHIDNYDVFIIQGMGRRRWRVGDALPMRQFC---SHPALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQ 221
+ V + PGD+LY+P G+ H T E +++ ++ T DLL AL+
Sbjct: 169 IDVEMAPGDILYIPPGFPHDGFTF--EDTMNYSVGFRGPTG-RDLLSSFADYALE 220
>gi|411010101|ref|ZP_11386430.1| cupin superfamily protein [Aeromonas aquariorum AAK1]
Length = 376
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q +GK+ W+V ++E+ + E I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAAPLNEFAAHAA---LLHCEPFEAI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E SL+ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSLNFSVG 200
>gi|253686795|ref|YP_003015985.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
gi|251753373|gb|ACT11449.1| Cupin 4 family protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|374619859|ref|ZP_09692393.1| hypothetical protein OMB55_00009960 [gamma proteobacterium HIMB55]
gi|374303086|gb|EHQ57270.1| hypothetical protein OMB55_00009960 [gamma proteobacterium HIMB55]
Length = 324
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 2/75 (2%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH+D + F+LQ G+++WK+ P+ D P + + P T L+PGD
Sbjct: 77 GVGPHFDQYDVFLLQGAGQRRWKIG-PK-CDANTPTVRESDLRLIDEFEPTETFVLQPGD 134
Query: 176 LLYLPRGYIHQASTV 190
+LY+P G+ H V
Sbjct: 135 VLYVPPGFAHWGEAV 149
>gi|365838672|ref|ZP_09380010.1| cupin family protein [Hafnia alvei ATCC 51873]
gi|364559465|gb|EHM37448.1| cupin family protein [Hafnia alvei ATCC 51873]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|397605458|gb|EJK59020.1| hypothetical protein THAOC_20816 [Thalassiosira oceanica]
Length = 382
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 15/166 (9%)
Query: 38 FKKNIDITLYEDGQRQTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQE 97
++ N I+L G R+ +D Q++ G + ++P Y+ L+ +N E
Sbjct: 229 YETNDWISLRIRGSREQLDKSTMCYEQMLEYLDGGGSVVIPISPSDYLHELK-MNVE-NE 286
Query: 98 LFGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
L N Y + P+A HYD +LQL G+K+W + D
Sbjct: 287 LDIADTSMNVYHSGPSAVALNIHYDAYPVIVLQLSGEKEWIIQNDAFGD----------- 335
Query: 158 SQEEIGTPILTVTLEPGDLLYLPRGYIHQASTVTNEHS-LHVTISV 202
S +++ T +T++ GD+LY+P+G H A+T +S HVTI +
Sbjct: 336 SIKDVNT-WKNITMKEGDVLYIPKGIYHAATTKEGFNSTTHVTIGL 380
>gi|397655706|ref|YP_006496408.1| hypothetical protein A225_0637 [Klebsiella oxytoca E718]
gi|394344379|gb|AFN30500.1| hypothetical protein A225_0637 [Klebsiella oxytoca E718]
Length = 374
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/204 (21%), Positives = 89/204 (43%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
+W+K+P+ + + N+ + + D L +E + + + ++GQ Q +
Sbjct: 16 YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70
Query: 61 --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + ++P + L + L +L F P G
Sbjct: 71 FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H +T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201
>gi|375258733|ref|YP_005017903.1| hypothetical protein KOX_09690 [Klebsiella oxytoca KCTC 1686]
gi|365908211|gb|AEX03664.1| hypothetical protein KOX_09690 [Klebsiella oxytoca KCTC 1686]
Length = 374
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/204 (21%), Positives = 89/204 (43%), Gaps = 22/204 (10%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGR- 60
+W+K+P+ + + N+ + + D L +E + + + ++GQ Q +
Sbjct: 16 YWQKQPVVLKKAFANFVDPITP-----DELAGLAMEPEVDSRLVSLKNGQWQASNGPFEH 70
Query: 61 --ALPQIVWDHYREGCSIRFLNPQTYIKPLQQL-NASLQELFGCFVGANTYQTPPTAQGF 117
L + W + + + ++P + L + L +L F P G
Sbjct: 71 FDGLGETGWSLLAQAVNHWHMPAAELVRPFRVLPDWRLDDLMISF------SVP--GGGV 122
Query: 118 APHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLL 177
PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD+L
Sbjct: 123 GPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGDIL 179
Query: 178 YLPRGYIHQASTVTNEHSLHVTIS 201
Y+P G+ H +T+E +L+ ++
Sbjct: 180 YIPPGFPHDG--ITHETALNYSVG 201
>gi|365846400|ref|ZP_09386904.1| cupin family protein [Yokenella regensburgei ATCC 43003]
gi|364574118|gb|EHM51591.1| cupin family protein [Yokenella regensburgei ATCC 43003]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|326430106|gb|EGD75676.1| hypothetical protein PTSG_07795 [Salpingoeca sp. ATCC 50818]
Length = 394
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 37/68 (54%)
Query: 70 YREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPHYDDIEAFIL 129
Y +GC+I Y +PL L +Q+ G AN Y TP +QGF H D+ + I+
Sbjct: 280 YLDGCTIVCNIVPAYWQPLAGLMHGVQQETGLAYMANMYLTPRASQGFVEHTDNKDGLIV 339
Query: 130 QLEGKKKW 137
Q +G+K+W
Sbjct: 340 QTDGRKEW 347
>gi|429085228|ref|ZP_19148206.1| FIG00553994: hypothetical protein [Cronobacter condimenti 1330]
gi|426545560|emb|CCJ74247.1| FIG00553994: hypothetical protein [Cronobacter condimenti 1330]
Length = 373
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|15677626|ref|NP_274785.1| hypothetical protein NMB1786 [Neisseria meningitidis MC58]
gi|385853846|ref|YP_005900360.1| cupin family protein [Neisseria meningitidis H44/76]
gi|416198383|ref|ZP_11618931.1| cupin family protein [Neisseria meningitidis CU385]
gi|427827696|ref|ZP_18994720.1| cupin family protein [Neisseria meningitidis H44/76]
gi|433488954|ref|ZP_20446104.1| cupin-like domain protein [Neisseria meningitidis M13255]
gi|433505610|ref|ZP_20462543.1| cupin-like domain protein [Neisseria meningitidis 9506]
gi|433507776|ref|ZP_20464674.1| cupin-like domain protein [Neisseria meningitidis 9757]
gi|7227040|gb|AAF42125.1| hypothetical protein NMB1786 [Neisseria meningitidis MC58]
gi|316984382|gb|EFV63355.1| cupin family protein [Neisseria meningitidis H44/76]
gi|325139670|gb|EGC62208.1| cupin family protein [Neisseria meningitidis CU385]
gi|325200850|gb|ADY96305.1| cupin family protein [Neisseria meningitidis H44/76]
gi|389606457|emb|CCA45370.1| nucleolar protein 66 [Neisseria meningitidis alpha522]
gi|432221283|gb|ELK77094.1| cupin-like domain protein [Neisseria meningitidis M13255]
gi|432239330|gb|ELK94884.1| cupin-like domain protein [Neisseria meningitidis 9757]
gi|432239497|gb|ELK95049.1| cupin-like domain protein [Neisseria meningitidis 9506]
Length = 384
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDPRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|397163773|ref|ZP_10487231.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
gi|396094328|gb|EJI91880.1| cupin superfamily protein [Enterobacter radicincitans DSM 16656]
Length = 373
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
+Y P G PH D + FI+Q G ++W+V + ++ P P + PI
Sbjct: 114 SYSVP--GGGVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPI 168
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ LEPGD+LY+P G+ H T+E + + +I
Sbjct: 169 IDEDLEPGDILYIPPGFPHDG--FTHEAAFNYSIG 201
>gi|429089661|ref|ZP_19152393.1| FIG00553994: hypothetical protein [Cronobacter universalis NCTC
9529]
gi|426509464|emb|CCK17505.1| FIG00553994: hypothetical protein [Cronobacter universalis NCTC
9529]
Length = 373
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|323451275|gb|EGB07153.1| hypothetical protein AURANDRAFT_71864 [Aureococcus anophagefferens]
Length = 494
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 119 PHYDDIEAFILQLEGKKKWKVYLP-----RMVDEYLPRYSSPN-FSQEEIGTPILTVTLE 172
PH D E LQ G+++W+ + P + L P E +G P L V +E
Sbjct: 138 PHTDFQEVVCLQCTGRQRWRAWRPPKPLGDALGAALALGKGPEPLRPEALGAPALDVVVE 197
Query: 173 PGDLLYLPRGYIHQASTV 190
PGD++Y+P G+ H T+
Sbjct: 198 PGDVVYVPAGWPHATDTL 215
>gi|421541069|ref|ZP_15987199.1| cupin family protein [Neisseria meningitidis 93004]
gi|421541878|ref|ZP_15987991.1| cupin family protein [Neisseria meningitidis NM255]
gi|421568263|ref|ZP_16013989.1| cupin family protein [Neisseria meningitidis NM3001]
gi|402315862|gb|EJU51416.1| cupin family protein [Neisseria meningitidis 93004]
gi|402318901|gb|EJU54416.1| cupin family protein [Neisseria meningitidis NM255]
gi|402341991|gb|EJU77162.1| cupin family protein [Neisseria meningitidis NM3001]
Length = 384
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDPRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|421545082|ref|ZP_15991147.1| cupin family protein [Neisseria meningitidis NM140]
gi|421547132|ref|ZP_15993170.1| cupin family protein [Neisseria meningitidis NM183]
gi|421548101|ref|ZP_15994129.1| cupin family protein [Neisseria meningitidis NM2781]
gi|421553339|ref|ZP_15999302.1| cupin family protein [Neisseria meningitidis NM576]
gi|402321495|gb|EJU56968.1| cupin family protein [Neisseria meningitidis NM183]
gi|402321765|gb|EJU57237.1| cupin family protein [Neisseria meningitidis NM140]
gi|402326823|gb|EJU62221.1| cupin family protein [Neisseria meningitidis NM2781]
gi|402328247|gb|EJU63624.1| cupin family protein [Neisseria meningitidis NM576]
Length = 384
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDPRDVYAIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|421541860|ref|ZP_15987973.1| hypothetical protein NMEN255_0488 [Neisseria meningitidis NM255]
gi|402318883|gb|EJU54398.1| hypothetical protein NMEN255_0488 [Neisseria meningitidis NM255]
Length = 387
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 59/133 (44%), Gaps = 13/133 (9%)
Query: 94 SLQELFGCFVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYL 149
S+ + F A T + A G +P H+D + F +QL GKK W V P +
Sbjct: 107 SIAKQIAQFAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPNFD---M 163
Query: 150 PRYSSPNFSQEEIGTPI---LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
P Y I P + V LE GD+LY+PRG+ H + E + H+ I +
Sbjct: 164 PLYMQQAKDMPHIIPPTTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPN 222
Query: 207 A--WIDLLEKAMP 217
+++ L K +P
Sbjct: 223 GYNYMEWLMKKIP 235
>gi|336316193|ref|ZP_08571093.1| hypothetical protein Rhein_2495 [Rheinheimera sp. A13L]
gi|335879315|gb|EGM77214.1| hypothetical protein Rhein_2495 [Rheinheimera sp. A13L]
Length = 384
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 96/221 (43%), Gaps = 32/221 (14%)
Query: 1 DHWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFK--------KNIDITLYEDGQR 52
D+W+K+PL I + + + I+ ++ + + IE + +++ +ED
Sbjct: 17 DYWQKKPLLIKAGFREFIDP-ISPDELAGLAGEEEIESRIVSNKDAAWDMETGPFEDFSH 75
Query: 53 QTMDVEGRALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPP 112
+ EG L DH+ S + P +I N + +L F A
Sbjct: 76 --LGEEGWTLLVQAVDHWHPQ-SAALIEPFRFIP-----NWRIDDLMVSFSTAGG----- 122
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLP-RMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH D + FI+Q +GK+ W+V +P + ++ P P Q + V
Sbjct: 123 ---GVGPHLDQYDVFIIQGQGKRHWRVGMPDSSLKQHCPH---PRLLQVTPFVDCINVIT 176
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLL 212
EPGD+LY+P G H ++ + SL+ ++ ++ A DLL
Sbjct: 177 EPGDILYIPPGCPHDG--ISLDPSLNYSVG-FRAPAQKDLL 214
>gi|433467078|ref|ZP_20424535.1| cupin-like domain protein [Neisseria meningitidis 87255]
gi|432203654|gb|ELK59705.1| cupin-like domain protein [Neisseria meningitidis 87255]
Length = 382
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 58/130 (44%), Gaps = 23/130 (17%)
Query: 102 FVGANTYQTPPTAQGFAP----HYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNF 157
F A T + A G +P H+D + F +QL GKK W V P + P +
Sbjct: 115 FAQAQTVVSGYLAFGSSPSYRNHWDTRDVFAVQLIGKKHWTVSAPN--------FDMPLY 166
Query: 158 SQEEIGTPILT--------VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA-- 207
Q+ P +T V LE GD+LY+PRG+ H + E + H+ I +
Sbjct: 167 MQQAKDMPHITPSKTVDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYN 225
Query: 208 WIDLLEKAMP 217
+++ L K +P
Sbjct: 226 YMEWLMKKIP 235
>gi|262045121|ref|ZP_06018150.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
gi|259037513|gb|EEW38755.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
Length = 373
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|268681420|ref|ZP_06148282.1| conserved hypothetical protein [Neisseria gonorrhoeae PID332]
gi|268621704|gb|EEZ54104.1| conserved hypothetical protein [Neisseria gonorrhoeae PID332]
Length = 384
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDARDMYTIQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|421913072|ref|ZP_16342771.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K26BO]
gi|410113050|emb|CCM85396.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K26BO]
Length = 352
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 100 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 156
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 157 ILYIPPGFPHDG--ITHETALNYSVG 180
>gi|365144966|ref|ZP_09348937.1| hypothetical protein HMPREF1024_04968 [Klebsiella sp. 4_1_44FAA]
gi|363647464|gb|EHL86681.1| hypothetical protein HMPREF1024_04968 [Klebsiella sp. 4_1_44FAA]
Length = 373
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|429103335|ref|ZP_19165309.1| FIG00553994: hypothetical protein [Cronobacter turicensis 564]
gi|426289984|emb|CCJ91422.1| FIG00553994: hypothetical protein [Cronobacter turicensis 564]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|260424469|ref|YP_003212670.1| Uncharacterized protein ycfD [Cronobacter turicensis z3032]
gi|260219277|emb|CBA34631.1| Uncharacterized protein ycfD [Cronobacter turicensis z3032]
Length = 384
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 132 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 188
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 189 ILYIPPGFPHDG--FTHETALNYSVG 212
>gi|152973214|ref|YP_001338360.1| hypothetical protein KPN_04743 [Klebsiella pneumoniae subsp.
pneumoniae MGH 78578]
gi|330002904|ref|ZP_08304454.1| cupin family protein [Klebsiella sp. MS 92-3]
gi|378976749|ref|YP_005224890.1| hypothetical protein KPHS_05900 [Klebsiella pneumoniae subsp.
pneumoniae HS11286]
gi|386032946|ref|YP_005952859.1| hypothetical protein KPN2242_01850 [Klebsiella pneumoniae KCTC
2242]
gi|419976365|ref|ZP_14491764.1| hypothetical protein KPNIH1_23509 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH1]
gi|419981995|ref|ZP_14497263.1| hypothetical protein KPNIH2_22949 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH2]
gi|419988292|ref|ZP_14503389.1| hypothetical protein KPNIH4_25477 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH4]
gi|419994116|ref|ZP_14509037.1| hypothetical protein KPNIH5_25666 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH5]
gi|419999323|ref|ZP_14514099.1| hypothetical protein KPNIH6_22751 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH6]
gi|420005743|ref|ZP_14520355.1| hypothetical protein KPNIH7_26067 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH7]
gi|420011425|ref|ZP_14525875.1| hypothetical protein KPNIH8_25481 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH8]
gi|420017040|ref|ZP_14531324.1| hypothetical protein KPNIH9_24516 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH9]
gi|420022977|ref|ZP_14537129.1| hypothetical protein KPNIH10_25849 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH10]
gi|420028688|ref|ZP_14542656.1| hypothetical protein KPNIH11_25289 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH11]
gi|420040236|ref|ZP_14553848.1| hypothetical protein KPNIH14_26001 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH14]
gi|420045955|ref|ZP_14559409.1| hypothetical protein KPNIH16_25704 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH16]
gi|420051785|ref|ZP_14565059.1| hypothetical protein KPNIH17_26058 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH17]
gi|420057416|ref|ZP_14570552.1| hypothetical protein KPNIH18_25796 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH18]
gi|420062991|ref|ZP_14575943.1| hypothetical protein KPNIH19_25601 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH19]
gi|420068714|ref|ZP_14581486.1| hypothetical protein KPNIH20_25177 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH20]
gi|420073754|ref|ZP_14586376.1| hypothetical protein KPNIH21_21462 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH21]
gi|420079549|ref|ZP_14591993.1| hypothetical protein KPNIH22_21338 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH22]
gi|420085553|ref|ZP_14597773.1| hypothetical protein KPNIH23_22666 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH23]
gi|421917563|ref|ZP_16347115.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K28BO]
gi|424828746|ref|ZP_18253474.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
gi|424935278|ref|ZP_18353650.1| Hypothetical protein B819_26218 [Klebsiella pneumoniae subsp.
pneumoniae KpQ3]
gi|425079666|ref|ZP_18482763.1| hypothetical protein HMPREF1306_00379 [Klebsiella pneumoniae subsp.
pneumoniae WGLW2]
gi|428151251|ref|ZP_18998991.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST512-K30BO]
gi|428935656|ref|ZP_19009118.1| hypothetical protein MTE1_22506 [Klebsiella pneumoniae JHCK1]
gi|428942224|ref|ZP_19015231.1| hypothetical protein MTE2_21456 [Klebsiella pneumoniae VA360]
gi|150958063|gb|ABR80093.1| hypothetical protein KPN_04743 [Klebsiella pneumoniae subsp.
pneumoniae MGH 78578]
gi|328537141|gb|EGF63413.1| cupin family protein [Klebsiella sp. MS 92-3]
gi|339760074|gb|AEJ96294.1| hypothetical protein KPN2242_01850 [Klebsiella pneumoniae KCTC
2242]
gi|364516160|gb|AEW59288.1| hypothetical protein KPHS_05900 [Klebsiella pneumoniae subsp.
pneumoniae HS11286]
gi|397340844|gb|EJJ34037.1| hypothetical protein KPNIH1_23509 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH1]
gi|397341148|gb|EJJ34333.1| hypothetical protein KPNIH4_25477 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH4]
gi|397341569|gb|EJJ34745.1| hypothetical protein KPNIH2_22949 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH2]
gi|397356985|gb|EJJ49770.1| hypothetical protein KPNIH5_25666 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH5]
gi|397358285|gb|EJJ51008.1| hypothetical protein KPNIH6_22751 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH6]
gi|397360460|gb|EJJ53138.1| hypothetical protein KPNIH7_26067 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH7]
gi|397374189|gb|EJJ66542.1| hypothetical protein KPNIH9_24516 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH9]
gi|397375655|gb|EJJ67938.1| hypothetical protein KPNIH8_25481 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH8]
gi|397382054|gb|EJJ74218.1| hypothetical protein KPNIH10_25849 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH10]
gi|397391272|gb|EJJ83137.1| hypothetical protein KPNIH11_25289 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH11]
gi|397399719|gb|EJJ91370.1| hypothetical protein KPNIH14_26001 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH14]
gi|397408451|gb|EJJ99814.1| hypothetical protein KPNIH17_26058 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH17]
gi|397408650|gb|EJK00005.1| hypothetical protein KPNIH16_25704 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH16]
gi|397419496|gb|EJK10644.1| hypothetical protein KPNIH18_25796 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH18]
gi|397426008|gb|EJK16859.1| hypothetical protein KPNIH20_25177 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH20]
gi|397426460|gb|EJK17278.1| hypothetical protein KPNIH19_25601 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH19]
gi|397437640|gb|EJK28197.1| hypothetical protein KPNIH21_21462 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH21]
gi|397443636|gb|EJK33947.1| hypothetical protein KPNIH22_21338 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH22]
gi|397449067|gb|EJK39217.1| hypothetical protein KPNIH23_22666 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH23]
gi|405608178|gb|EKB81130.1| hypothetical protein HMPREF1306_00379 [Klebsiella pneumoniae subsp.
pneumoniae WGLW2]
gi|407809465|gb|EKF80716.1| Hypothetical protein B819_26218 [Klebsiella pneumoniae subsp.
pneumoniae KpQ3]
gi|410120109|emb|CCM89740.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST258-K28BO]
gi|414706160|emb|CCN27864.1| cupin family protein [Klebsiella pneumoniae subsp. pneumoniae Ecl8]
gi|426299055|gb|EKV61416.1| hypothetical protein MTE2_21456 [Klebsiella pneumoniae VA360]
gi|426300027|gb|EKV62331.1| hypothetical protein MTE1_22506 [Klebsiella pneumoniae JHCK1]
gi|427538802|emb|CCM95129.1| FIG00732091: hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae ST512-K30BO]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|420034244|ref|ZP_14548034.1| hypothetical protein KPNIH12_24353 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH12]
gi|397392915|gb|EJJ84689.1| hypothetical protein KPNIH12_24353 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH12]
Length = 329
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 77 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 133
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 134 ILYIPPGFPHDG--ITHETALNYSVG 157
>gi|238892875|ref|YP_002917609.1| hypothetical protein KP1_0694 [Klebsiella pneumoniae subsp.
pneumoniae NTUH-K2044]
gi|402782628|ref|YP_006638174.1| hypothetical protein A79E_4413 [Klebsiella pneumoniae subsp.
pneumoniae 1084]
gi|238545191|dbj|BAH61542.1| hypothetical protein KP1_0694 [Klebsiella pneumoniae subsp.
pneumoniae NTUH-K2044]
gi|402543478|gb|AFQ67627.1| hypothetical protein A79E_4413 [Klebsiella pneumoniae subsp.
pneumoniae 1084]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|449060447|ref|ZP_21738104.1| hypothetical protein G057_21039 [Klebsiella pneumoniae hvKP1]
gi|448873837|gb|EMB08907.1| hypothetical protein G057_21039 [Klebsiella pneumoniae hvKP1]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|425094501|ref|ZP_18497584.1| hypothetical protein HMPREF1308_04820 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
gi|405609832|gb|EKB82673.1| hypothetical protein HMPREF1308_04820 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|425078578|ref|ZP_18481681.1| hypothetical protein HMPREF1305_04517 [Klebsiella pneumoniae subsp.
pneumoniae WGLW1]
gi|425089210|ref|ZP_18492303.1| hypothetical protein HMPREF1307_04688 [Klebsiella pneumoniae subsp.
pneumoniae WGLW3]
gi|405589793|gb|EKB63346.1| hypothetical protein HMPREF1305_04517 [Klebsiella pneumoniae subsp.
pneumoniae WGLW1]
gi|405599651|gb|EKB72826.1| hypothetical protein HMPREF1307_04688 [Klebsiella pneumoniae subsp.
pneumoniae WGLW3]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L+PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLQPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H +T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--ITHETALNYSVG 201
>gi|332528291|ref|ZP_08404296.1| Cupin, JmjC-type [Hylemonella gracilis ATCC 19624]
gi|332042311|gb|EGI78632.1| Cupin, JmjC-type [Hylemonella gracilis ATCC 19624]
Length = 395
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 8/80 (10%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ GK+ WK+ + + +P NF E+ LEP
Sbjct: 136 GVGPHFDSYDVFLLQAHGKRHWKIGSQQDLKLKAGVPLKILSNFHPEQ------EFVLEP 189
Query: 174 GDLLYLPRGYIHQASTVTNE 193
GD+LYLP Y H V E
Sbjct: 190 GDMLYLPPQYAHDGVAVEGE 209
>gi|422293369|gb|EKU20669.1| cupin 4 family protein, partial [Nannochloropsis gaditana CCMP526]
gi|422293749|gb|EKU21049.1| cupin 4 family protein, partial [Nannochloropsis gaditana CCMP526]
Length = 120
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 19/107 (17%)
Query: 132 EGKKKWKVY-----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLPRGYIHQ 186
EG K+W++Y PR ++ P S +IG P + LEPGDLLYLP G IH+
Sbjct: 1 EGSKRWRLYDALVEQPRPDMQFKP-------SLHDIGEPYIDFVLEPGDLLYLPSGTIHE 53
Query: 187 ASTVTN----EHSLHVTISVYQKT--AWIDLLEKAMPKALQAAGATD 227
A + SLH+T+ + +W LL + + L A D
Sbjct: 54 ALNENDGGLENSSLHLTVGIETTVLGSWESLLLEVL-TVLSGGSAED 99
>gi|73540810|ref|YP_295330.1| cupin region [Ralstonia eutropha JMP134]
gi|72118223|gb|AAZ60486.1| Cupin region [Ralstonia eutropha JMP134]
Length = 402
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ+ G+++W++ +D +P +FS EE LEP
Sbjct: 144 GVGPHFDSYDVFLLQVSGRRRWRISSQTKLDLVPDMPLKILADFSAEE------EWVLEP 197
Query: 174 GDLLYLPRGYIH 185
GD+LYLP Y H
Sbjct: 198 GDMLYLPPQYAH 209
>gi|294636786|ref|ZP_06715124.1| cupin family protein [Edwardsiella tarda ATCC 23685]
gi|451965002|ref|ZP_21918263.1| hypothetical protein ET1_06_00600 [Edwardsiella tarda NBRC 105688]
gi|291090001|gb|EFE22562.1| cupin family protein [Edwardsiella tarda ATCC 23685]
gi|451316120|dbj|GAC63625.1| hypothetical protein ET1_06_00600 [Edwardsiella tarda NBRC 105688]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 76/192 (39%), Gaps = 14/192 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K PL I KN+ + L D L +E + + + +E+G Q
Sbjct: 16 YWQKRPLVIRHGFKNFIDPLSP-----DELAGLAMEQEVDSRLVCHEEGAWQVSHG---- 66
Query: 62 LPQIVWDHYRE-GCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E S+ + P L + L V G PH
Sbjct: 67 -PFTSYDHLGETNWSLLVQAVDHWHSPAAALMRPFRHLPDWRVDDLMISFSVPGGGVGPH 125
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D + FI+Q G+++W+V ++ P P+ Q + I+ V L GD+LY+P
Sbjct: 126 IDQYDVFIIQGSGRRRWRVGEKGQRRQHCPH---PDLLQVDPFEAIIDVELSAGDILYIP 182
Query: 181 RGYIHQASTVTN 192
+ H+ + N
Sbjct: 183 PHFPHEGYALEN 194
>gi|293397680|ref|ZP_06641886.1| hypothetical protein NGNG_00666 [Neisseria gonorrhoeae F62]
gi|291611626|gb|EFF40695.1| hypothetical protein NGNG_00666 [Neisseria gonorrhoeae F62]
Length = 384
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|423118424|ref|ZP_17106108.1| hypothetical protein HMPREF9690_00430 [Klebsiella oxytoca 10-5246]
gi|376401866|gb|EHT14471.1| hypothetical protein HMPREF9690_00430 [Klebsiella oxytoca 10-5246]
Length = 373
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFPPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H VT+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--VTHETALNYSVG 201
>gi|240015877|ref|ZP_04722417.1| hypothetical protein NgonFA_01712 [Neisseria gonorrhoeae FA6140]
Length = 384
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|440288846|ref|YP_007341611.1| hypothetical protein D782_3497 [Enterobacteriaceae bacterium strain
FGI 57]
gi|440048368|gb|AGB79426.1| hypothetical protein D782_3497 [Enterobacteriaceae bacterium strain
FGI 57]
Length = 373
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|194097751|ref|YP_002000792.1| hypothetical protein NGK_0167 [Neisseria gonorrhoeae NCCP11945]
gi|193933041|gb|ACF28865.1| Conserved hypothetical protein [Neisseria gonorrhoeae NCCP11945]
Length = 384
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|240013436|ref|ZP_04720349.1| hypothetical protein NgonD_02099 [Neisseria gonorrhoeae DGI18]
gi|240120507|ref|ZP_04733469.1| hypothetical protein NgonPI_01784 [Neisseria gonorrhoeae PID24-1]
Length = 384
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|290475833|ref|YP_003468725.1| RmlC-like domain-containing protein [Xenorhabdus bovienii SS-2004]
gi|289175158|emb|CBJ81961.1| putative enzyme with RmlC-like domain [Xenorhabdus bovienii
SS-2004]
Length = 373
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLE 172
+ G PH D + FI+Q +G+++W+V + ++ P P+ Q + I+ +
Sbjct: 118 SGGGVGPHLDQYDVFIIQGQGRRRWRVGEKIPMKQHCPH---PDLLQVDPFEAIIDEEML 174
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
PGD+LY+P G+ H+ + E SL+ ++ A
Sbjct: 175 PGDILYIPPGFPHEGYAI--EDSLNYSVGFRAPNA 207
>gi|268594108|ref|ZP_06128275.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02]
gi|268596160|ref|ZP_06130327.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19]
gi|268600636|ref|ZP_06134803.1| conserved hypothetical protein [Neisseria gonorrhoeae PID18]
gi|268602949|ref|ZP_06137116.1| conserved hypothetical protein [Neisseria gonorrhoeae PID1]
gi|268683587|ref|ZP_06150449.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-92-679]
gi|291044550|ref|ZP_06570259.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2]
gi|385334994|ref|YP_005888941.1| hypothetical protein NGTW08_0102 [Neisseria gonorrhoeae
TCDC-NG08107]
gi|268547497|gb|EEZ42915.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02]
gi|268549948|gb|EEZ44967.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19]
gi|268584767|gb|EEZ49443.1| conserved hypothetical protein [Neisseria gonorrhoeae PID18]
gi|268587080|gb|EEZ51756.1| conserved hypothetical protein [Neisseria gonorrhoeae PID1]
gi|268623871|gb|EEZ56271.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-92-679]
gi|291011444|gb|EFE03440.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2]
gi|317163537|gb|ADV07078.1| hypothetical protein NGTW08_0102 [Neisseria gonorrhoeae
TCDC-NG08107]
Length = 384
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 19/167 (11%)
Query: 60 RALPQIVWDHYREGCSIRFLNPQTYIKPLQQLNASLQELFGCFVGANT----YQTPPTAQ 115
R + ++++H + G S+ + + + + + F GA+T Y + +
Sbjct: 82 RFIKSVIYEHMKNGASLVYNHINN-----EPFSDHIARQVARFAGAHTIVSGYLAFGSDE 136
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTP---ILTVTLE 172
+ H+D + + +QL GKK+W++ P D +P Y +I P + + LE
Sbjct: 137 SYKNHWDARDMYAVQLFGKKRWQLTAP---DFPMPLYMQQT-KDTDISIPEHIDMDIILE 192
Query: 173 PGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
GD+LY+PRG+ H+ + E + H + + +++ L K P
Sbjct: 193 AGDVLYIPRGWWHRPIPLGCE-TFHFAVGTFPPNGYNYLEWLMKKFP 238
>gi|94310017|ref|YP_583227.1| hypothetical protein Rmet_1072 [Cupriavidus metallidurans CH34]
gi|93353869|gb|ABF07958.1| conserved hypothetical protein [Cupriavidus metallidurans CH34]
Length = 375
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ+ G+++W++ +D LP +FS E+ LEP
Sbjct: 117 GVGPHFDSYDVFLLQVSGRRRWRISSQTKLDLIPDLPLKILSDFSAEQ------EWVLEP 170
Query: 174 GDLLYLPRGYIH 185
GD+LYLP Y H
Sbjct: 171 GDMLYLPPQYAH 182
>gi|330828855|ref|YP_004391807.1| Cupin superfamily protein [Aeromonas veronii B565]
gi|406677935|ref|ZP_11085115.1| hypothetical protein HMPREF1170_03323 [Aeromonas veronii AMC35]
gi|423210459|ref|ZP_17197013.1| hypothetical protein HMPREF1169_02531 [Aeromonas veronii AER397]
gi|328803991|gb|AEB49190.1| Cupin superfamily protein [Aeromonas veronii B565]
gi|404616347|gb|EKB13305.1| hypothetical protein HMPREF1169_02531 [Aeromonas veronii AER397]
gi|404623742|gb|EKB20592.1| hypothetical protein HMPREF1170_03323 [Aeromonas veronii AMC35]
Length = 376
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q GK+ W+V + ++E+ + E I
Sbjct: 113 SFSTPHA--GVGPHIDNYDVFITQGLGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E S++ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSMNFSVG 200
>gi|423200591|ref|ZP_17187171.1| hypothetical protein HMPREF1167_00754 [Aeromonas veronii AER39]
gi|404619999|gb|EKB16903.1| hypothetical protein HMPREF1167_00754 [Aeromonas veronii AER39]
Length = 376
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q GK+ W+V + ++E+ + E I
Sbjct: 113 SFSTPHA--GVGPHIDNYDVFITQGLGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAI 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ V +EPGD+LY+P G+ H+ + E S++ ++
Sbjct: 168 IDVIMEPGDILYIPPGFPHEGYAI--EPSMNFSVG 200
>gi|227114632|ref|ZP_03828288.1| hypothetical protein PcarbP_16803 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length = 373
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ S P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDENLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|430809300|ref|ZP_19436415.1| hypothetical protein D769_23578 [Cupriavidus sp. HMR-1]
gi|429498260|gb|EKZ96772.1| hypothetical protein D769_23578 [Cupriavidus sp. HMR-1]
Length = 402
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ+ G+++W++ +D LP +FS E+ LEP
Sbjct: 144 GVGPHFDSYDVFLLQVSGRRRWRISSQTKLDLIPDLPLKILSDFSAEQ------EWVLEP 197
Query: 174 GDLLYLPRGYIH 185
GD+LYLP Y H
Sbjct: 198 GDMLYLPPQYAH 209
>gi|339999064|ref|YP_004729947.1| hypothetical protein SBG_1065 [Salmonella bongori NCTC 12419]
gi|339512425|emb|CCC30161.1| conserved hypothetical protein [Salmonella bongori NCTC 12419]
Length = 373
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P+ Q + + LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGTGRRRWRVGEKLPMRQHCPH---PDLLQVDPFEAFIDEELEPGD 177
Query: 176 LLYLPRGYIHQASTVTN 192
+LY+P G+ H+ + N
Sbjct: 178 ILYIPPGFPHEGYALEN 194
>gi|423205804|ref|ZP_17192360.1| hypothetical protein HMPREF1168_01995 [Aeromonas veronii AMC34]
gi|404623195|gb|EKB20047.1| hypothetical protein HMPREF1168_01995 [Aeromonas veronii AMC34]
Length = 376
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 88/201 (43%), Gaps = 16/201 (7%)
Query: 2 HWEKEPLYISRKQKNYYNNLITSKKIDDILRDNVIEFKKNIDITLYEDGQRQTMDVEGRA 61
+W+K PL I K++ + I+ ++ + + V+E + +T + DG+ E
Sbjct: 15 YWQKRPLLIKGGFKDF-QDPISPDELAGLAMEEVVESRL---VTRF-DGKW-----EAAH 64
Query: 62 LPQIVWDHYREGCSIRFLNPQTYIKP-LQQLNASLQELFGCFVGANTYQTPPTAQGFAPH 120
P +DH E + + P + +L Q + G G PH
Sbjct: 65 GPFESYDHLGEENWTVLVQACNHWAPEVNELAVPFQFIPGWRFDDVMVSFSTPHGGVGPH 124
Query: 121 YDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGDLLYLP 180
D+ + FI Q GK+ W+V + ++E+ + E I+ V +EPGD+LY+P
Sbjct: 125 IDNYDVFITQGLGKRHWRVGDAKPLNEFAAHAA---LLHCEPFEAIIDVIMEPGDILYIP 181
Query: 181 RGYIHQASTVTNEHSLHVTIS 201
G+ H+ + E S++ ++
Sbjct: 182 PGFPHEGYAI--EPSMNFSVG 200
>gi|393759367|ref|ZP_10348183.1| hypothetical protein QWA_09601 [Alcaligenes faecalis subsp.
faecalis NCIB 8687]
gi|393162595|gb|EJC62653.1| hypothetical protein QWA_09601 [Alcaligenes faecalis subsp.
faecalis NCIB 8687]
Length = 396
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ GK+ W++ + + LP NF EE TLEP
Sbjct: 127 GVGPHFDSYDVFLLQAHGKRHWRISQQKDLSLVPDLPLKILRNFQVEE------EFTLEP 180
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTI-SVYQKTAWIDLLEKAMPKALQAAG 224
GD+LYLP H + + ++ + S Q T L++ A + A+G
Sbjct: 181 GDMLYLPPHVAHDGIAIGDCMTISIGFRSPTQATLARGLMDAASDQLAAASG 232
>gi|374370915|ref|ZP_09628904.1| cupin region [Cupriavidus basilensis OR16]
gi|373097472|gb|EHP38604.1| cupin region [Cupriavidus basilensis OR16]
Length = 402
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ+ G+++W++ +D LP +F+ EE LEP
Sbjct: 144 GVGPHFDSYDVFLLQVSGRRRWRISSQTRLDLVPDLPLKILSDFTAEE------EWVLEP 197
Query: 174 GDLLYLPRGYIH 185
GD+LYLP Y H
Sbjct: 198 GDMLYLPPQYAH 209
>gi|410641503|ref|ZP_11352023.1| cupin 4 protein [Glaciecola chathamensis S18K6]
gi|410139036|dbj|GAC10210.1| cupin 4 protein [Glaciecola chathamensis S18K6]
Length = 389
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +G ++W+ EY Y P+ SQ PI+ L+PGD
Sbjct: 121 GVGPHLDQYDVFIIQGKGSRRWQA---GDKGEYPTFYPHPDLSQITQFEPIVDEVLQPGD 177
Query: 176 LLYLPRGYIHQA 187
++Y+P G H
Sbjct: 178 MIYIPAGCPHNG 189
>gi|395009353|ref|ZP_10392897.1| hypothetical protein PMI14_05695 [Acidovorax sp. CF316]
gi|394312610|gb|EJE49743.1| hypothetical protein PMI14_05695 [Acidovorax sp. CF316]
Length = 379
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ +G+++W++ + + E +P F EE LEP
Sbjct: 129 GVGPHFDSYDVFLLQAQGRRRWRIGRQKDLALKEGIPLKVLAEFEPEE------EFVLEP 182
Query: 174 GDLLYLPRGYIH 185
GD+LYLP Y H
Sbjct: 183 GDMLYLPPRYAH 194
>gi|241764335|ref|ZP_04762363.1| Cupin 4 family protein [Acidovorax delafieldii 2AN]
gi|241366291|gb|EER60834.1| Cupin 4 family protein [Acidovorax delafieldii 2AN]
Length = 379
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL----PRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ R D+ L P +F E+ L
Sbjct: 129 GVGPHFDSYDVFLLQAHGRRRWRIG--RQKDKRLRGDIPLKVLAHFEPEQ------EFVL 180
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP GY H
Sbjct: 181 EPGDMLYLPPGYAH 194
>gi|300723606|ref|YP_003712911.1| hypothetical protein XNC1_2711 [Xenorhabdus nematophila ATCC 19061]
gi|297630128|emb|CBJ90765.1| putative enzyme with RmlC-like domain [Xenorhabdus nematophila ATCC
19061]
Length = 373
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +G+++W+V + ++ P P+ Q + I+ + PGD
Sbjct: 121 GVGPHLDQYDVFIIQGKGRRRWRVGEKIPMKQHCPH---PDLLQVDPFEAIIDEEMLPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISVYQKTA 207
+LY+P G+ H+ ++ E SL+ ++ A
Sbjct: 178 ILYIPPGFPHEGYSI--EDSLNYSVGFRAPNA 207
>gi|332160836|ref|YP_004297413.1| hypothetical protein YE105_C1214 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|325665066|gb|ADZ41710.1| hypothetical protein YE105_C1214 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|330863362|emb|CBX73484.1| uncharacterized protein ycfD [Yersinia enterocolitica W22703]
Length = 373
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P E PI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFAPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|386309388|ref|YP_006005444.1| hypothetical protein [Yersinia enterocolitica subsp. palearctica
Y11]
gi|418240617|ref|ZP_12867155.1| hypothetical protein IOK_04211 [Yersinia enterocolitica subsp.
palearctica PhRBD_Ye1]
gi|433550467|ref|ZP_20506511.1| Putative cytoplasmic protein [Yersinia enterocolitica IP 10393]
gi|318604738|emb|CBY26236.1| hypothetical protein Y11_19451 [Yersinia enterocolitica subsp.
palearctica Y11]
gi|351780030|gb|EHB22119.1| hypothetical protein IOK_04211 [Yersinia enterocolitica subsp.
palearctica PhRBD_Ye1]
gi|431789602|emb|CCO69551.1| Putative cytoplasmic protein [Yersinia enterocolitica IP 10393]
Length = 373
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ P P E PI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPLKQFCPH---PALLHVEPFAPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQAST 189
+LY+P G+ H T
Sbjct: 178 ILYIPPGFPHDGFT 191
>gi|410615581|ref|ZP_11326600.1| RmlC-like domain-containing protein [Glaciecola psychrophila 170]
gi|410164994|dbj|GAC40489.1| RmlC-like domain-containing protein [Glaciecola psychrophila 170]
Length = 382
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 61/136 (44%), Gaps = 8/136 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q +G ++W+V LP D + SQ PI+ LEPGD
Sbjct: 121 GVGPHLDQYDVFIIQGKGSRRWQVGLPGDFDTV---NLHKDLSQISGFEPIIDQILEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTISV---YQKTAWIDLLEKAMPKALQAAGATDLEFRR 232
++Y+P + H V E L+ ++ Q+ L + A+ L TD E
Sbjct: 178 IIYIPANHPHNG--VALEECLNYSVGFRAPSQQEMLSSLADFAIDNNLFTQRYTDEEIAH 235
Query: 233 GLPIGYLRYAGLARGK 248
G +++ ++R K
Sbjct: 236 REFSGEIKHREVSRFK 251
>gi|149376376|ref|ZP_01894139.1| Transcription factor jumonji, jmjC [Marinobacter algicola DG893]
gi|149359390|gb|EDM47851.1| Transcription factor jumonji, jmjC [Marinobacter algicola DG893]
Length = 383
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 43/104 (41%), Gaps = 22/104 (21%)
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL---- 167
P PHYD + F+LQ EG ++W D PR GTP+
Sbjct: 121 PKGGSVGPHYDQYDVFLLQAEGHRRWT--FGGHCDHTSPRVD---------GTPLRILSS 169
Query: 168 -----TVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKT 206
TVTL PGD+LYLP G H V + + ++I T
Sbjct: 170 WEGEETVTLAPGDMLYLPPGVGHHG--VAEDDCITLSIGFRAPT 211
>gi|444916193|ref|ZP_21236313.1| hypothetical protein D187_08781 [Cystobacter fuscus DSM 2262]
gi|444712515|gb|ELW53437.1| hypothetical protein D187_08781 [Cystobacter fuscus DSM 2262]
Length = 455
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 53/130 (40%), Gaps = 38/130 (29%)
Query: 99 FGCFVGANTYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPR--------------- 143
F VG N Y + + GF HYD A LQ+EG+K W Y PR
Sbjct: 144 FSGRVGFNCYLSA-SGGGFYTHYDARTAMSLQIEGEKTWS-YAPRPALAFPQRNASYALN 201
Query: 144 --------MVDEYLPRYSSPNFSQEEIGTPI-----LTVTLEPGDLLYLPRGYIHQASTV 190
+VD ++P E PI L+ TL PGDLL LP G H A
Sbjct: 202 GGAVQASKIVDAFVP------IEPWETVIPIPENEFLSETLRPGDLLSLPAGTWHAAK-- 253
Query: 191 TNEHSLHVTI 200
EHSL + +
Sbjct: 254 AGEHSLALNL 263
>gi|254674227|emb|CBA10011.1| conserved hypothetical protein [Neisseria meningitidis alpha275]
Length = 387
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
++ + H+D + F +QL GKK W + P + P + Q+ P +T
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181
Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
V LE GD+LY+PRG+ H + E + H+ I + +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235
>gi|254673854|emb|CBA09627.1| myc induced nuclear antigen [Neisseria meningitidis alpha275]
Length = 382
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
++ + H+D + F +QL GKK W + P + P + Q+ P +T
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181
Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
V LE GD+LY+PRG+ H + E + H+ I + +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235
>gi|385339905|ref|YP_005893777.1| cupin family protein [Neisseria meningitidis G2136]
gi|325198149|gb|ADY93605.1| cupin family protein [Neisseria meningitidis G2136]
Length = 387
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
++ + H+D + F +QL GKK W + P + P + Q+ P +T
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181
Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
V LE GD+LY+PRG+ H + E + H+ I + +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235
>gi|421561812|ref|ZP_16007649.1| cupin-like domain protein [Neisseria meningitidis NM2657]
gi|254670307|emb|CBA05658.1| conserved hypothetical protein [Neisseria meningitidis alpha153]
gi|402336197|gb|EJU71458.1| cupin-like domain protein [Neisseria meningitidis NM2657]
Length = 382
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 19/115 (16%)
Query: 113 TAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILT---- 168
++ + H+D + F +QL GKK W + P + P + Q+ P +T
Sbjct: 130 SSSSYRNHWDTRDVFAVQLIGKKHWTISAPN--------FDMPLYMQQAKDMPHITPSKT 181
Query: 169 ----VTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTA--WIDLLEKAMP 217
V LE GD+LY+PRG+ H + E + H+ I + +++ L K +P
Sbjct: 182 VDMEVILEAGDILYIPRGWWHNPMPMNCE-TFHLAIGTFPPNGYNYMEWLMKKIP 235
>gi|429094757|ref|ZP_19157281.1| FIG00553994: hypothetical protein [Cronobacter dublinensis 1210]
gi|426740151|emb|CCJ83394.1| FIG00553994: hypothetical protein [Cronobacter dublinensis 1210]
Length = 373
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P + PI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGNKLPMRQFCPH---PALLHVDPFEPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|421080900|ref|ZP_15541816.1| YcfD [Pectobacterium wasabiae CFBP 3304]
gi|401704462|gb|EJS94669.1| YcfD [Pectobacterium wasabiae CFBP 3304]
Length = 373
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ S P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|227327182|ref|ZP_03831206.1| hypothetical protein PcarcW_07626 [Pectobacterium carotovorum
subsp. carotovorum WPP14]
Length = 373
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ S P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|319792853|ref|YP_004154493.1| transcription factor jumonji jmjc domain-containing protein
[Variovorax paradoxus EPS]
gi|315595316|gb|ADU36382.1| transcription factor jumonji jmjC domain-containing protein
[Variovorax paradoxus EPS]
Length = 380
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 15/118 (12%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYL----PRYSSPNFSQEEIGTPILTVTL 171
G H+D + F+LQ +G+++W + + D L P NF EE T L
Sbjct: 129 GVGAHFDSYDVFLLQAQGRRRWSIG--KQGDLRLQPGVPLKILENFEPEE------TFVL 180
Query: 172 EPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKAMPKALQAAGATDLE 229
EPGD+LYLP Y H+ V ++ +T S+ +++ + L + + A + DLE
Sbjct: 181 EPGDMLYLPPKYAHEGVAVGDD---CMTFSIGLRSSALGELGADLLARMAQAYSEDLE 235
>gi|403056864|ref|YP_006645081.1| hypothetical protein PCC21_004250 [Pectobacterium carotovorum
subsp. carotovorum PCC21]
gi|402804190|gb|AFR01828.1| hypothetical protein PCC21_004250 [Pectobacterium carotovorum
subsp. carotovorum PCC21]
Length = 373
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G ++W+V + ++ S P + PI+ LEPGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGSRRWRVGDKLPMRQFC---SHPALLHVDPFPPIIDEDLEPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|409200144|ref|ZP_11228347.1| RmlC-like domain-containing protein [Pseudoalteromonas flavipulchra
JG1]
Length = 395
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYS-SPNFSQEEIGTP 165
++ TP G PH D + FI+Q +GK+ W+V D L +++ + + Q E
Sbjct: 116 SFSTP--GGGVGPHLDQYDVFIIQGQGKRHWRV---GERDNSLKQFARNKSLLQVEQFEA 170
Query: 166 ILTVTLEPGDLLYLPRGYIHQASTVTN 192
++ LEPGD+LY+P H+ V N
Sbjct: 171 VIDAVLEPGDILYIPPACPHEGYAVEN 197
>gi|145298396|ref|YP_001141237.1| hypothetical protein ASA_1388 [Aeromonas salmonicida subsp.
salmonicida A449]
gi|418362390|ref|ZP_12963024.1| hypothetical protein IYQ_18781 [Aeromonas salmonicida subsp.
salmonicida 01-B526]
gi|142851168|gb|ABO89489.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
salmonicida A449]
gi|356686400|gb|EHI51003.1| hypothetical protein IYQ_18781 [Aeromonas salmonicida subsp.
salmonicida 01-B526]
Length = 376
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 107 TYQTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPI 166
++ TP G PH D+ + FI Q +GK+ W+V + ++E+ + + E I
Sbjct: 113 SFSTP--HGGVGPHIDNYDVFITQGQGKRHWRVGDAKPLEEFAAHAALLHCEPFEA---I 167
Query: 167 LTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTIS 201
+ +EPGD+LY+P G+ H+ + E S++ ++
Sbjct: 168 IDAIMEPGDILYIPPGFPHEGYAI--EPSMNFSVG 200
>gi|88798694|ref|ZP_01114277.1| hypothetical protein MED297_15859 [Reinekea blandensis MED297]
gi|88778457|gb|EAR09649.1| hypothetical protein MED297_15859 [Reinekea blandensis MED297]
Length = 381
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 47/89 (52%), Gaps = 14/89 (15%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVD------EYLPRYSSPNFSQEEIGTPILTV 169
G PH+D + F++QL G+++WK+ +M D E +P F +++
Sbjct: 134 GVGPHFDQYDVFLVQLAGQREWKI--GQMCDEDSDLVENIPVKVLSAFEEQD------AW 185
Query: 170 TLEPGDLLYLPRGYIHQASTVTNEHSLHV 198
L+PGD+LYLP G H +++ + +L V
Sbjct: 186 VLDPGDVLYLPPGVAHWGTSLGDSMTLSV 214
>gi|326435845|gb|EGD81415.1| hypothetical protein PTSG_11840 [Salpingoeca sp. ATCC 50818]
Length = 8549
Score = 48.9 bits (115), Expect = 0.003, Method: Composition-based stats.
Identities = 42/174 (24%), Positives = 69/174 (39%), Gaps = 50/174 (28%)
Query: 88 LQQLNASLQELFGCFVGANTY-QTPPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMV- 145
++ L ++++FG + A+ Y T ++ PH D + ++QL+G K+W +P
Sbjct: 4888 VELLRQGVRDVFGTYSTAHVYYATQDESRALLPHTDPYDVLVIQLQGSKRWTTCVPDAYV 4947
Query: 146 ----------------------------DEYLPRYSSPNFSQ-EEI-------------- 162
E L +++ Q +EI
Sbjct: 4948 SDSNDDDDDSANERSNANSTAADRPPSEAEVLAGFNAAQLGQLQEIRSQRQQGCTGYQDS 5007
Query: 163 ---GTPILTVTLEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQK-TAWIDLL 212
G TL GD +Y+P+G IH A T + S H+TIS+ +K AW D L
Sbjct: 5008 MLRGMRCTNFTLREGDTMYMPKGIIHFALT-EEQGSSHITISLERKGLAWADAL 5060
>gi|429099281|ref|ZP_19161387.1| hypothetical protein BN133_3877 [Cronobacter dublinensis 582]
gi|426285621|emb|CCJ87500.1| hypothetical protein BN133_3877 [Cronobacter dublinensis 582]
Length = 373
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVTLEPGD 175
G PH D + FI+Q G+++W+V + ++ P P + PI+ L PGD
Sbjct: 121 GVGPHIDQYDVFIIQGMGRRRWRVGDKLPMRQFCPH---PALLHVDPFEPIIDEDLAPGD 177
Query: 176 LLYLPRGYIHQASTVTNEHSLHVTIS 201
+LY+P G+ H T+E +L+ ++
Sbjct: 178 ILYIPPGFPHDG--FTHETALNYSVG 201
>gi|167539693|ref|XP_001751194.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770146|gb|EDQ83994.1| predicted protein [Monosiga brevicollis MX1]
Length = 254
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 49/105 (46%), Gaps = 20/105 (19%)
Query: 111 PPTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPILTVT 170
P T PH D+ ILQL+G+K W V VD S EE T
Sbjct: 61 PATQPAINPHTDNYHILILQLQGEKHWLVC---QVDNAE--------SCEEF-------T 102
Query: 171 LEPGDLLYLPRGYIHQASTVTNEHSLHVTISVYQKTAWIDLLEKA 215
L PGD+L+LPR H A T TN S+H TI +Q DL+E A
Sbjct: 103 LYPGDVLFLPRRAGHVAWT-TNVTSVHATIG-FQGVDCGDLVEAA 145
>gi|300691871|ref|YP_003752866.1| hypothetical protein RPSI07_2227 [Ralstonia solanacearum PSI07]
gi|299078931|emb|CBJ51591.1| conserved protein of unknown function [Ralstonia solanacearum
PSI07]
Length = 409
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQNDLTLVPD--LPLKILANFTPEE------EFVL 201
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 202 EPGDMLYLPPHYAH 215
>gi|387813958|ref|YP_005429441.1| hypothetical protein MARHY1541 [Marinobacter hydrocarbonoclasticus
ATCC 49840]
gi|381338971|emb|CCG95018.1| conserved hypothetical protein, putative enzyme with RmlC-like
domain [Marinobacter hydrocarbonoclasticus ATCC 49840]
Length = 383
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 37/85 (43%), Gaps = 20/85 (23%)
Query: 112 PTAQGFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEYLPRYSSPNFSQEEIGTPIL---- 167
PT PHYD + F+LQ +G ++W D PR GTP+
Sbjct: 121 PTGGSVGPHYDMYDVFLLQAQGHRRWT--FGGHCDHRSPRVE---------GTPLRILSS 169
Query: 168 -----TVTLEPGDLLYLPRGYIHQA 187
TVTL PGD+LYLP G H
Sbjct: 170 WDGEETVTLAPGDMLYLPPGVGHHG 194
>gi|344167058|emb|CCA79251.1| conserved hypothetical protein [blood disease bacterium R229]
Length = 409
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQNDLTLVPD--LPLKILANFTPEE------EFVL 201
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 202 EPGDMLYLPPHYAH 215
>gi|300704481|ref|YP_003746084.1| hypothetical protein RCFBP_20285 [Ralstonia solanacearum CFBP2957]
gi|299072145|emb|CBJ43477.1| conserved protein of unknown function [Ralstonia solanacearum
CFBP2957]
Length = 409
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 201
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 202 EPGDMLYLPPHYAH 215
>gi|424776979|ref|ZP_18203954.1| hypothetical protein C660_09304 [Alcaligenes sp. HPC1271]
gi|422888019|gb|EKU30413.1| hypothetical protein C660_09304 [Alcaligenes sp. HPC1271]
Length = 396
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVDEY--LPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ GK+ W++ + + LP NF EE TLEP
Sbjct: 127 GVGPHFDSYDVFLLQAHGKRHWRISQQKDLSLVPDLPLKILRNFQVEE------EFTLEP 180
Query: 174 GDLLYLPRGYIHQASTVTNEHSLHVTI-SVYQKTAWIDLLEKAMPKALQAAG 224
GD+LYLP H + + ++ + S Q T L++ A + A+G
Sbjct: 181 GDMLYLPPHVAHDGIAMGDCMTISIGFRSPTQATLARGLMDAASDQLAAASG 232
>gi|207743511|ref|YP_002259903.1| hypothetical protein RSIPO_01690 [Ralstonia solanacearum IPO1609]
gi|206594909|emb|CAQ61836.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609]
Length = 385
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 126 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 177
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 178 EPGDMLYLPPHYAH 191
>gi|344174288|emb|CCA86078.1| conserved hypothetical protein [Ralstonia syzygii R24]
Length = 385
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 126 GVGPHFDSYDVFLLQAHGRRRWRISSQNDLTLVPD--LPLKILANFTPEE------EFVL 177
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 178 EPGDMLYLPPHYAH 191
>gi|83748242|ref|ZP_00945268.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551]
gi|83725083|gb|EAP72235.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551]
Length = 409
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 201
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 202 EPGDMLYLPPHYAH 215
>gi|390568842|ref|ZP_10249134.1| cupin 4 family protein [Burkholderia terrae BS001]
gi|389939191|gb|EIN01028.1| cupin 4 family protein [Burkholderia terrae BS001]
Length = 380
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPR--MVDEYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ+ GK++W++ R + LP NF EE LEP
Sbjct: 121 GVGPHFDSYDVFLLQVHGKRRWRIGAQRDLSLQPGLPLKVLQNFEPEE------EWVLEP 174
Query: 174 GDLLYLPRGYIH 185
GD+LYLP H
Sbjct: 175 GDMLYLPPHIAH 186
>gi|386333884|ref|YP_006030055.1| hypothetical protein RSPO_c02223 [Ralstonia solanacearum Po82]
gi|334196334|gb|AEG69519.1| Hypothetical cytosolic protein [Ralstonia solanacearum Po82]
Length = 409
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 12/74 (16%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVY----LPRMVDEYLPRYSSPNFSQEEIGTPILTVTL 171
G PH+D + F+LQ G+++W++ L + D LP NF+ EE L
Sbjct: 150 GVGPHFDSYDVFLLQAHGRRRWRISSQDDLTLVPD--LPLKILANFTPEE------EFVL 201
Query: 172 EPGDLLYLPRGYIH 185
EPGD+LYLP Y H
Sbjct: 202 EPGDMLYLPPHYAH 215
>gi|406974719|gb|EKD97706.1| hypothetical protein ACD_23C00766G0001 [uncultured bacterium]
Length = 379
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 116 GFAPHYDDIEAFILQLEGKKKWKVYLPRMVD--EYLPRYSSPNFSQEEIGTPILTVTLEP 173
G PH+D + F+LQ G+++W++ + + + +P F E+ LEP
Sbjct: 129 GVGPHFDSYDVFLLQAHGRRRWRIGRQKNLSLRDDIPLKVLAQFEPEQ------EYVLEP 182
Query: 174 GDLLYLPRGYIH 185
GD+LYLP GY H
Sbjct: 183 GDMLYLPPGYAH 194
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.137 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,530,763,211
Number of Sequences: 23463169
Number of extensions: 244271924
Number of successful extensions: 609459
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 484
Number of HSP's successfully gapped in prelim test: 1173
Number of HSP's that attempted gapping in prelim test: 606758
Number of HSP's gapped (non-prelim): 1919
length of query: 339
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 196
effective length of database: 9,003,962,200
effective search space: 1764776591200
effective search space used: 1764776591200
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)