RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17306
(904 letters)
>gnl|CDD|193551 cd05676, M20_dipept_like_CNDP, M20 cytosolic nonspecific
dipeptidases including anserinase and serum carnosinase.
Peptidase M20 family, CNDP (cytosolic nonspecific
dipeptidase) subfamily including anserinase
(Xaa-methyl-His dipeptidase, EC 3.4.13.5), 'serum'
carnosinase (beta-alanyl-L-histidine dipeptidase; EC
3.4.13.20), and some uncharacterized proteins. Two
genes, CN1 and CN2, coding for proteins that degrade
carnosine (beta-alanyl-L-histidine) and homocarnosine
(gamma-aminobutyric acid-L-histidine), two naturally
occurring dipeptides with potential neuroprotective and
neurotransmitter functions, have been identified. CN1
encodes for serum carnosinase and has narrow substrate
specificity for Xaa-His dipeptides, where Xaa can be
beta-alanine (carnosine), N-methyl beta-alanine,
alanine, glycine and gamma-aminobutyric acid
(homocarnosine). CN2 corresponds to the cytosolic
nonspecific dipeptidase (CNDP; EC 3.4.13.18) and is not
limited to Xaa-His dipeptides. CNDP requires Mn(2+) for
full activity and does not hydrolyze homocarnosine.
Anserinase is a dipeptidase that mainly catalyzes the
hydrolysis of N-alpha-acetylhistidine.
Length = 466
Score = 375 bits (964), Expect = e-121
Identities = 155/322 (48%), Positives = 189/322 (58%), Gaps = 62/322 (19%)
Query: 302 KKEDGWNTEPFVLDFERRKALWRRGALVMMKRPVL------------------------- 336
K EDGW+T+PF L E+ L+ RG+ K PVL
Sbjct: 99 KLEDGWDTDPFTLT-EKDGKLYGRGS-TDDKGPVLGWLNAIEAYQKLGQDLPVNLKFCFE 156
Query: 337 --------GLASAIES-------------ISAN----QEK----------CAVNIKIECA 361
GL IE+ IS N +K C I++E A
Sbjct: 157 GMEESGSEGLDELIEARKDTFFSDVDYVCISDNYWLGTKKPCLTYGLRGICYFFIEVEGA 216
Query: 362 SKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFYEKIDFD 421
KDLHSG+FGGSVHE MTDLI +M LV+ +GKILIP IY V PL + E + YEKIDFD
Sbjct: 217 DKDLHSGVFGGSVHEPMTDLIALMSSLVDSDGKILIPGIYDLVAPLTEEEWELYEKIDFD 276
Query: 422 TEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGKVVGKFSIR 481
E++R I +L +K ++LM RWRYPSLS+HGIEGAFS PG KTVIP KV+GKFSIR
Sbjct: 277 MEEYREDIGVRRLLYDNKEELLMHRWRYPSLSIHGIEGAFSEPGAKTVIPAKVIGKFSIR 336
Query: 482 IVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPEHPNYVAAARATKY 541
+VPN P+ VEK V DYL +++ SPNK K Y+ GK W +P+HPNY A +ATK
Sbjct: 337 LVPNMDPEQVEKLVTDYLEKVFAELGSPNKLKVYMGHGGKPWVADPDHPNYKAGRKATKR 396
Query: 542 VYNVEPDLTREGGSIPITLTFE 563
V+ VEPDLTREGGSIPITLTF+
Sbjct: 397 VFGVEPDLTREGGSIPITLTFQ 418
>gnl|CDD|193555 cd05680, M20_dipept_like_3, Uncharacterized M20 Dipeptidases.
Peptidase M20 family, unknown dipeptidase-like subfamily
3 (inferred by homology to be dipeptidases). M20
dipeptidases include a large variety of bacterial
enzymes including cytosolic nonspecific dipeptidase
(CNDP), Xaa-methyl-His dipeptidase (anserinase),and
canosinase. These dipeptidases have been shown to act on
a wide range of dipeptides, but not larger peptides. For
example, anserinase mainly catalyzes the hydrolysis of
N-alpha-acetylhistidine while carnosinase degrades
beta-alanyl-L-histidine.
Length = 437
Score = 176 bits (449), Expect = 7e-48
Identities = 72/217 (33%), Positives = 105/217 (48%), Gaps = 13/217 (5%)
Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
+ I + A +DLHSGL+GG+V + L ++ L + +G+I IP Y DV PL E
Sbjct: 184 AYLEITVRGADRDLHSGLYGGAVPNPIHALARLLASLHDEDGRITIPGFYDDVRPLSAEE 243
Query: 412 EQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIP 471
+ + FD E F + L L W P+L ++GI G ++G G KTVIP
Sbjct: 244 RAAWAALPFDEEAFLAELGVSALAGEAGYTTLERLWARPTLEVNGIWGGYTGEGSKTVIP 303
Query: 472 GKVVGKFSIRIVPNQTPQ----CVEKYVLDYLNELWKARNSPN-KFKAYLLDSGKSWRTN 526
+ K S+R+VP+Q P +E +V +A P + L G+ +
Sbjct: 304 AEAHAKISMRLVPDQDPDKILELLEAHV--------RAHTPPGVTVEVTRLHGGRPYLVP 355
Query: 527 PEHPNYVAAARATKYVYNVEPDLTREGGSIPITLTFE 563
P+HP AA RA + + P REGGSIPI F+
Sbjct: 356 PDHPALQAARRALEEAFGKPPVFIREGGSIPIVGLFK 392
>gnl|CDD|193552 cd05677, M20_dipept_like_DUG2_type, M20 Defective in Utilization of
Glutathione-type peptidases containing WD repeats.
Peptidase M20 family, Defective in Utilization of
Glutathione (DUG2) subfamily. DUG2-type proteins are
metallopeptidases containing WD repeats at the
N-terminus. DUG2 proteins are involved in the
alternative pathway of glutathione (GSH) degradation.
GSH, the major low-molecular-weight thiol compound in
most eukaryotic cells, is normally degraded through the
gamma-glutamyl cycle initiated by gamma-glutamyl
transpeptidase. However, a novel pathway for the
degradation of GSH has been characterized; it requires
the participation of three genes identified in
Saccharomyces cerevisiae as "defective in utilization of
glutathione" genes including DUG1, DUG2, and DUG3. DUG1
encodes a probable di- or tri-peptidase identified as
M20 metallopeptidase, DUG2 gene encodes a protein with a
metallopeptidase domain and a large N-terminal WD40
repeat region, while DUG3 encodes a protein with a
glutamine amidotransferase domain. Although dipeptides
and tripeptides with a normal peptide bond, such as
cys-gly or glu-cys-gly, can be hydrolyzed by the DUG1
protein, the presence of an unusual peptide bond, like
in GSH, requires the participation of the DUG2 and DUG3
proteins as well. These three proteins form a GSH
degradosomal complex.
Length = 434
Score = 161 bits (410), Expect = 7e-43
Identities = 71/203 (34%), Positives = 109/203 (53%), Gaps = 12/203 (5%)
Query: 355 NIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQF 414
I + DLHSG+ GG++ E DLI ++G+L +P+G+ILIP Y V+P+ + EE
Sbjct: 193 TISVSSDEPDLHSGVDGGALREPTADLILLLGKLQDPDGRILIPGFYDPVKPITEEEEAR 252
Query: 415 YEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGKV 474
+ I A+ T L+++WR PSL++H + SGPG TVIP +
Sbjct: 253 FRDI------ASIALIPLPTTVDS----LIAKWRKPSLTIHSVN--VSGPGNSTVIPRRA 300
Query: 475 VGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPEHPNYVA 534
SIR+VP+Q + +++ + DYL E + SPN +L+ + W +P++P +
Sbjct: 301 SASVSIRLVPDQDLETIKQDLKDYLRECFAELKSPNHLDIEVLNEAEPWLGDPDNPAFQT 360
Query: 535 AARATKYVYNVEPDLTREGGSIP 557
A + VEP REGGSIP
Sbjct: 361 LREAVTDAWGVEPLYIREGGSIP 383
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 145 bits (368), Expect = 6e-41
Identities = 53/108 (49%), Positives = 69/108 (63%), Gaps = 9/108 (8%)
Query: 805 AVDAYFKEALRTSEPKA---PKAPRPPKQPIVQDFQFFPPRLFEILDQEIYYFRKTVGYK 861
+VD Y+K+ L T KA PKAPR PKQ VQD QFFPPRL E+ ++E Y++K +GYK
Sbjct: 1 SVDNYYKDVLNTGGRKASKKPKAPRAPKQVNVQDHQFFPPRLKELQEKEQLYYKKKIGYK 60
Query: 862 VPKN------PELGSDATKAQKEEQKKIDESEPLTEEELAEKEELLTQ 903
VP + E + +K EQ+KID +EPLTEEE EK+ LL +
Sbjct: 61 VPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108
>gnl|CDD|193513 cd03893, M20_Dipept_like, M20 Dipeptidases. Peptidase M20 family,
dipeptidase-like subfamily. This group contains a large
variety of enzymes, including cytosolic nonspecific
dipeptidase (CNDP), Xaa-methyl-His dipeptidase
(anserinase), canosinase, DUG2 type proteins, as well as
many proteins inferred by homology to be dipeptidases.
These enzymes have been shown to act on a wide range of
dipeptides, but not larger peptides. For example,
anserinase mainly catalyzes the hydrolysis of
N-alpha-acetylhistidine while carnosinase degrades
beta-alanyl-L-histidine. Substrates of CNDP are varied
and not limited to Xaa-His dipeptides. DUG2 proteins
contain a metallopeptidase domain and a large N-terminal
WD40 repeat region, and are involved in the alternative
pathway of glutathione degradation.
Length = 428
Score = 155 bits (394), Expect = 9e-41
Identities = 56/215 (26%), Positives = 97/215 (45%), Gaps = 18/215 (8%)
Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
V + + +HSG+FGG+ +A+T L+ ++ L + +G++ +P +Y L
Sbjct: 184 ANVTVTVRTLDHAVHSGMFGGAAPDALTALVRLLATLHDEDGRVAVPGLY---SDLPWEG 240
Query: 412 EQFYEKIDFDTEDFRT---AIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKT 468
+ E E+FR +D +L A + W P+L++ GI+ A G
Sbjct: 241 VDYPE------EEFRAEAGVLDGVEL--AGTGSIADRLWAKPALTVIGID-APPVAGASN 291
Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPE 528
+P K S+R+VP Q P+ ++ + +L K D+G+ +R + +
Sbjct: 292 AVPPSARAKLSLRLVPGQDPEEAQEALEAHLE---AHAPWGAKVTVTREDAGEPFRADTD 348
Query: 529 HPNYVAAARATKYVYNVEPDLTREGGSIPITLTFE 563
P Y AA A + VEP L GGSIP+ +
Sbjct: 349 GPAYQAARAALATAWGVEPVLIGSGGSIPLISDLQ 383
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 140 bits (355), Expect = 5e-34
Identities = 67/180 (37%), Positives = 101/180 (56%), Gaps = 17/180 (9%)
Query: 729 ITDEDIDVILERCEAKTEELNKKFEQLGESSLRDFTLDAPTQSVYKFEGEDYREKQKI-- 786
ITDEDID I+ + E T EL+ K ++ E +++ F +D +Y F+ ED +++ K+
Sbjct: 655 ITDEDIDRIIAKGEEATAELDAKMKKFTEDAIK-FKMDDTA-ELYDFDDEDDKDENKLDF 712
Query: 787 --VPIGHWIEPPKRERKANYAVDAYFKEALRTSEPKAPKAPRPPKQPIVQDFQFF-PPRL 843
+ +WI+PPKRERK NY+ YFK+A+R P PK PR P+ P + DFQFF RL
Sbjct: 713 KKIVSDNWIDPPKRERKRNYSESEYFKQAMRQGAPAKPKEPRIPRMPQLHDFQFFNVQRL 772
Query: 844 FEILDQEIYYFRKTVGYKVPKNPELGSDATKAQKEEQKKIDESEPLTEEELAEKEELLTQ 903
E+ ++E V Y + + + T E + + +PLT EE EKE+LL +
Sbjct: 773 TELYEKE-------VRYLMQAHQKGQLKDTIDVAEPE---EPGDPLTAEEQEEKEQLLEE 822
>gnl|CDD|169276 PRK08201, PRK08201, hypothetical protein; Provisional.
Length = 456
Score = 131 bits (330), Expect = 2e-32
Identities = 71/212 (33%), Positives = 106/212 (50%), Gaps = 5/212 (2%)
Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
A+ I + A DLHSGL+GG+V A+ L+ ++ L + +G + + Y V PL E
Sbjct: 200 AALEIDVRGAKGDLHSGLYGGAVPNALHALVQLLASLHDEHGTVAVEGFYDGVRPLTPEE 259
Query: 412 EQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIP 471
+ + + FD E + + +L + L W P+L L+G+ G F G G KTVIP
Sbjct: 260 REEFAALGFDEEKLKRELGVDELFGEEGYTALERTWARPTLELNGVYGGFQGEGTKTVIP 319
Query: 472 GKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPN-KFKAYLLDSGKSWRTNPEHP 530
+ K + R+VP+Q PQ +LD + +A + D G ++ +HP
Sbjct: 320 AEAHAKITCRLVPDQDPQ----EILDLIEAHLQAHTPAGVRVTIRRFDKGPAFVAPIDHP 375
Query: 531 NYVAAARATKYVYNVEPDLTREGGSIPITLTF 562
AAARA + VY E TR GGSIP+ TF
Sbjct: 376 AIQAAARAYEAVYGTEAAFTRMGGSIPVVETF 407
>gnl|CDD|235802 PRK06446, PRK06446, hypothetical protein; Provisional.
Length = 436
Score = 116 bits (293), Expect = 1e-27
Identities = 64/248 (25%), Positives = 106/248 (42%), Gaps = 27/248 (10%)
Query: 308 NTEPFVLDF-ERRKA---LWRRGALVMMKRP--VLGLASAIESISANQEKCAVNIKIECA 361
N E F+ + KA + L RP VLG+ + V + +
Sbjct: 142 NLEDFIEKNKNKLKADSVIMEGAGLDPKGRPQIVLGVKGLL----------YVELVLRTG 191
Query: 362 SKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFYEKIDFD 421
+KDLHS V DL+ ++ LV+ G++LIP Y DV L + E + +K D D
Sbjct: 192 TKDLHS-SNAPIVRNPAWDLVKLLSTLVDGEGRVLIPGFYDDVRELTEEERELLKKYDID 250
Query: 422 TEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGKVVGKFSIR 481
E+ R A+ +L +D+ ++ + P+ ++ G ++G G KT++P + K R
Sbjct: 251 VEELRKALGFKELKYSDREKIAEALLTEPTCNIDGFYSGYTGKGSKTIVPSRAFAKLDFR 310
Query: 482 IVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSG--KSWRTNPEHPNYVAAARAT 539
+VPNQ P + + + +L F ++ G RT+ A +
Sbjct: 311 LVPNQDPYKIFELLKKHL--------QKVGFNGEIIVHGFEYPVRTSVNSKVVKAMIESA 362
Query: 540 KYVYNVEP 547
K VY EP
Sbjct: 363 KRVYGTEP 370
>gnl|CDD|193556 cd05681, M20_dipept_like_4, Uncharacterized M20 Dipeptidases.
Peptidase M20 family, unknown dipeptidase-like subfamily
3 (inferred by homology to be dipeptidases). M20
dipeptidases include a large variety of bacterial
enzymes including cytosolic nonspecific dipeptidase
(CNDP), Xaa-methyl-His dipeptidase (anserinase),and
canosinase. These dipeptidases have been shown to act on
a wide range of dipeptides, but not larger peptides. For
example, anserinase mainly catalyzes the hydrolysis of
N-alpha-acetylhistidine while carnosinase degrades
beta-alanyl-L-histidine.
Length = 430
Score = 115 bits (291), Expect = 2e-27
Identities = 54/200 (27%), Positives = 92/200 (46%), Gaps = 9/200 (4%)
Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
V + ++ AS DLHS V L+ + L + G+ILIP Y DV P + E
Sbjct: 180 LYVELSVKGASADLHSSY-AAVVDNPAWRLVQALASLRDEGGRILIPGFYDDVRPPTEEE 238
Query: 412 EQFYEKIDFDTEDFRTA--IDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTV 469
++ +K+ FD E + A + P L L + P+ +++G+ ++G G KT+
Sbjct: 239 KELIDKLPFDEEALKKAFGLKRP-LDGLTGRDALEALVFEPTCNINGLYSGYTGEGSKTI 297
Query: 470 IPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPEH 529
+P + K R+VP+Q P+ V + +L+ +N + + L RT+ +
Sbjct: 298 LPAEATAKLDFRLVPDQDPEEVLDLLRAHLD-----KNGFSDIEVTYLLGEPPARTDLDA 352
Query: 530 PNYVAAARATKYVYNVEPDL 549
P AA K VY +P +
Sbjct: 353 PLVQAALDTAKKVYGTDPVV 372
>gnl|CDD|236379 PRK09104, PRK09104, hypothetical protein; Validated.
Length = 464
Score = 97.3 bits (243), Expect = 4e-21
Identities = 60/208 (28%), Positives = 92/208 (44%), Gaps = 13/208 (6%)
Query: 356 IKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFY 415
+ I A +DLHSGLFGG+ + L I+ L + G++ +P Y VE L +
Sbjct: 212 VTITAADRDLHSGLFGGAAANPIRVLTRILAGLHDETGRVTLPGFYDGVEELPPEILAQW 271
Query: 416 EKIDFDTEDFRTAID--HPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGK 473
+ + F E F + P K ++ VL W P+ ++GI G ++G G KTVIP +
Sbjct: 272 KALGFTAEAFLGPVGLSIPAGEK-GRS-VLEQIWSRPTCEINGIWGGYTGEGFKTVIPAE 329
Query: 474 VVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPN---KFKAYLLDSGKSWRTNPEHP 530
K S R+V Q P + + Y+ +AR + +F + + + P
Sbjct: 330 ASAKVSFRLVGGQDPAKIREAFRAYV----RARLPADCSVEFHDH--GGSPAIALPYDSP 383
Query: 531 NYVAAARATKYVYNVEPDLTREGGSIPI 558
AA A + L GGSIPI
Sbjct: 384 ALAAAKAALSDEWGKPAVLIGSGGSIPI 411
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 83.6 bits (207), Expect = 2e-19
Identities = 47/111 (42%), Positives = 60/111 (54%), Gaps = 11/111 (9%)
Query: 586 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 645
EA NLGN+Y + G EALE Y A+ L PD D Y NLAAA G E+A++
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALE----- 55
Query: 646 LQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPN 696
Y KA+E PD A A+ NLG + G+ A+ +EKA+ LDPN
Sbjct: 56 ------DYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
Score = 82.0 bits (203), Expect = 7e-19
Identities = 39/91 (42%), Positives = 51/91 (56%)
Query: 77 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 136
EA NLGN+Y + G EALE Y A+ L PD D Y NLAAA G E+A++ Y A
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60
Query: 137 LQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
L+ +PD +LG LG+ +EA + Y
Sbjct: 61 LELDPDNAKAYYNLGLAYYKLGKYEEALEAY 91
Score = 70.1 bits (172), Expect = 1e-14
Identities = 34/99 (34%), Positives = 57/99 (57%)
Query: 44 VLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAV 103
LL L +++++ D++ + A++ +P A+AY NL Y + G+ +EALE+Y A+
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 104 RLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
L PD Y NL A G E+A++AY AL+ +P+
Sbjct: 62 ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
Score = 67.4 bits (165), Expect = 1e-13
Identities = 31/87 (35%), Positives = 49/87 (56%)
Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
D++ + A++ +P A+AY NL Y + G+ +EALE+Y A+ L PD Y N
Sbjct: 14 GDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYN 73
Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPS 651
L A G E+A++AY AL+ +P+
Sbjct: 74 LGLAYYKLGKYEEALEAYEKALELDPN 100
Score = 65.1 bits (159), Expect = 7e-13
Identities = 30/74 (40%), Positives = 44/74 (59%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 285
A++ +P A+AY NL Y + G+ +EALE+Y A+ L PD Y NL A G E
Sbjct: 26 ALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYE 85
Query: 286 QAVQAYVTALQYNP 299
+A++AY AL+ +P
Sbjct: 86 EALEAYEKALELDP 99
Score = 62.4 bits (152), Expect = 6e-12
Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 11/96 (11%)
Query: 622 YINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIW 681
+NL GD ++A++ Y KA+E PD A A+ NL + G+
Sbjct: 3 LLNLGNLYYKLGDYDEALE-----------YYEKALELDPDNADAYYNLAAAYYKLGKYE 51
Query: 682 LAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
A+ +EKA+ LDP+ AY NLG + ++ A
Sbjct: 52 EALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEA 87
Score = 62.4 bits (152), Expect = 6e-12
Identities = 31/65 (47%), Positives = 39/65 (60%)
Query: 235 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 294
EA NLGN+Y + G EALE Y A+ L PD D Y NLAAA G E+A++ Y A
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60
Query: 295 LQYNP 299
L+ +P
Sbjct: 61 LELDP 65
Score = 48.9 bits (117), Expect = 4e-07
Identities = 18/52 (34%), Positives = 29/52 (55%)
Query: 666 AWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
A NLG ++ G+ A+ ++EKA+ LDP+ DAY NL + ++ A
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEA 53
Score = 45.1 bits (107), Expect = 8e-06
Identities = 35/154 (22%), Positives = 56/154 (36%), Gaps = 56/154 (36%)
Query: 113 YINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSD 172
+NL GD ++A++ Y AL+ +PD D Y +
Sbjct: 3 LLNLGNLYYKLGDYDEALEYYEKALELDPD--------------------NADAYY---N 39
Query: 173 LGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPL 232
L LG+ +EA + + A++ +P
Sbjct: 40 LAAAYYKLGKYEEALEDYEK---------------------------------ALELDPD 66
Query: 233 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
A+AY NLG Y + G+ +EALE Y A+ L P+
Sbjct: 67 NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 76.7 bits (189), Expect = 3e-14
Identities = 69/287 (24%), Positives = 115/287 (40%), Gaps = 40/287 (13%)
Query: 11 LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
LL LA +Y A + + +NT + L+ + ++ + + + K
Sbjct: 638 LLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQK 697
Query: 71 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
Q+P A + G++Y + A++ YR A++ P I L AL+A+G+ +AV
Sbjct: 698 QHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPS-SQNAIKLHRALLASGNTAEAV 756
Query: 131 QAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLH 190
+ L+ +P+ +R+ L L A D+A K+
Sbjct: 757 KTLEAWLKTHPNDAVLRTALAELYLAQKDYDKA----------------------IKHYQ 794
Query: 191 TENIKPVTMKVQ--NAIVCN-----YGGRK-PTTLESAHFSTLAIKQNPLLAEAYSNLGN 242
T +K NA+V N Y K P LE A A+K P + LG
Sbjct: 795 T------VVKKAPDNAVVLNNLAWLYLELKDPRALEYA---ERALKLAPNIPAILDTLGW 845
Query: 243 VYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 289
+ E+G+ AL R AV + P+ +LA AL+A G +A +
Sbjct: 846 LLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALLATGRKAEARK 892
Score = 72.8 bits (179), Expect = 5e-13
Identities = 64/330 (19%), Positives = 114/330 (34%), Gaps = 58/330 (17%)
Query: 22 GDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSN 81
G + A + ++ L LL + +K+A + A + +P A A +
Sbjct: 343 GRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQ 402
Query: 82 LGNVYKERGQLQEALENYRHAVRLKPDFID------------------------------ 111
LG +G EA+ + A +L P+
Sbjct: 403 LGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQP 462
Query: 112 ----GYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
+ L A + GD+ +A +A+ AL PD + ++L + G D+A +
Sbjct: 463 DNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRF 522
Query: 168 --CVRSDLGNL--LKALGRLDEAKN--------------LHTENIKPVTMKVQNAIVCNY 209
+ D NL + AL L L+ + I+P Q +
Sbjct: 523 EKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQ 582
Query: 210 GGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID 269
+ L A A +P EA+ LG G L +A+ +++ + L+PD
Sbjct: 583 LKKALAILNEA---ADAAPDSP---EAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSAL 636
Query: 270 GYINLAAALVAAGDMEQAVQAYVTALQYNP 299
+ LA A + +A+ + AL+ P
Sbjct: 637 ALLLLADAYAVMKNYAKAITSLKRALELKP 666
Score = 71.7 bits (176), Expect = 1e-12
Identities = 68/300 (22%), Positives = 120/300 (40%), Gaps = 45/300 (15%)
Query: 11 LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
LA + Q G+ + A + ++ + N +L L+ ++ + +++ + A +
Sbjct: 502 AANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAE 561
Query: 71 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
NP E L Y +GQL++AL A PD + ++ L A +AAGD+ +AV
Sbjct: 562 LNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAV 621
Query: 131 QAYVTALQYNPD----------LYCVRSDLGNLLKALGR-LDEAKDLYCVRSDLGNLLKA 179
++ L PD Y V + + +L R L+ D + L LL A
Sbjct: 622 SSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLA 681
Query: 180 LGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSN 239
R + AK + L+ KQ+P A +
Sbjct: 682 AKRTESAKKIAKS------------------------LQ---------KQHPKAALGFEL 708
Query: 240 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
G++Y + A++ YR A++ P I L AL+A+G+ +AV+ L+ +P
Sbjct: 709 EGDLYLRQKDYPAAIQAYRKALKRAPS-SQNAIKLHRALLASGNTAEAVKTLEAWLKTHP 767
Score = 65.9 bits (161), Expect = 8e-11
Identities = 57/244 (23%), Positives = 90/244 (36%), Gaps = 54/244 (22%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
A++++P AEA LG +Y G A + R A+ L LA A + G +
Sbjct: 48 ALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQ 107
Query: 128 QAVQAYVTALQYNPD-----LYCVRSDLGNLLKALGRLDEAKDLYC-----------VRS 171
Q + + D L +R G LG+L+ A+ Y +
Sbjct: 108 QVL-DELPGKTLLDDEGAAELLALR---GLAYLGLGQLELAQKSYEQALAIDPRSLYAKL 163
Query: 172 DLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNP 231
L L A R DEA+ L E + +P
Sbjct: 164 GLAQLALAENRFDEARALIDE---------------------------------VLTADP 190
Query: 232 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 291
+A G++ G ++ AL YR A+ L+P+ I + LA L+ AG+ E+A + +
Sbjct: 191 GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEA-EKH 249
Query: 292 VTAL 295
AL
Sbjct: 250 ADAL 253
Score = 64.3 bits (157), Expect = 2e-10
Identities = 74/325 (22%), Positives = 127/325 (39%), Gaps = 46/325 (14%)
Query: 4 IAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAH 63
+A LG+G LELA + YE A + + L L+ + + D++
Sbjct: 133 LAYLGLGQLELAQK-----SYEQALA-------IDPRSLYAKLGLAQLALAENRFDEARA 180
Query: 64 FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAA 123
+ +P +A G++ G ++ AL YR A+ L+P+ I + LA L+ A
Sbjct: 181 LIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEA 240
Query: 124 GDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKAL-----GRLDEAKDLY--CVRSD---- 172
G+ E+A + L+ P+ S L + LKAL ++A++ ++S
Sbjct: 241 GEFEEAEKHADALLKKAPN-----SPLAHYLKALVDFQKKNYEDARETLQDALKSAPEYL 295
Query: 173 ------------LGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESA 220
LGNL +A L++ + + + + TL
Sbjct: 296 PALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRVDEAIATLSP- 354
Query: 221 HFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVA 280
A+ +P A S LG Y G ++A E A L P+ L + ++
Sbjct: 355 -----ALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLS 409
Query: 281 AGDMEQAVQAYVTALQYNPLLKKED 305
GD +A+ TA Q +P L + D
Sbjct: 410 QGDPSEAIADLETAAQLDPELGRAD 434
Score = 61.3 bits (149), Expect = 2e-09
Identities = 48/210 (22%), Positives = 79/210 (37%), Gaps = 22/210 (10%)
Query: 503 WKARNSPNKFKAYLLDSGKSWRTNPEHPNYVAAARATKYVYNVEPDLTREGGSIPITLTF 562
A P+ A LL + NY A + K ++PD T +
Sbjct: 627 LLALQ-PDSALALLLLADAYAVMK----NYAKAITSLKRALELKPDNTE---AQIGLAQL 678
Query: 563 ECRKLDKSAHFSTLAI--KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID 620
+ KQ+P A + G++Y + A++ YR A++ P
Sbjct: 679 LLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPS-SQ 737
Query: 621 GYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEI 680
I L AL+A+G+ +AV+ +LK T P+ AV + L ++ AQ +
Sbjct: 738 NAIKLHRALLASGNTAEAVKTLEA--------WLK---THPNDAVLRTALAELYLAQKDY 786
Query: 681 WLAIHHFEKAVSLDPNFLDAYINLGNVLKE 710
AI H++ V P+ NL + E
Sbjct: 787 DKAIKHYQTVVKKAPDNAVVLNNLAWLYLE 816
Score = 61.3 bits (149), Expect = 2e-09
Identities = 69/317 (21%), Positives = 125/317 (39%), Gaps = 29/317 (9%)
Query: 2 NDIAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKS 61
N A+ +G+ +L+ GD A + + LLL + + + DK+
Sbjct: 396 NAAARTQLGISKLS-----QGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKA 450
Query: 62 AHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALV 121
+ K+ P A ++ LG +Y +G L +A E + A+ ++PDF NLA +
Sbjct: 451 LAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDI 510
Query: 122 AAGDMEQAVQAYVTALQYNP----------DLYCVR---SDLGNLLKALGRLD---EAKD 165
G+ + A+Q + L +P LY + L+ L+
Sbjct: 511 QEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPA 570
Query: 166 LYCVRSDL--GNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFS 223
L + L G L KAL L+EA + ++ + M + + + ++ +
Sbjct: 571 LALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKL--- 627
Query: 224 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 283
+ P A A L + Y +A+ + + A+ LKPD + I LA L+AA
Sbjct: 628 ---LALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKR 684
Query: 284 MEQAVQAYVTALQYNPL 300
E A + + + +P
Sbjct: 685 TESAKKIAKSLQKQHPK 701
Score = 60.9 bits (148), Expect = 3e-09
Identities = 34/137 (24%), Positives = 57/137 (41%), Gaps = 11/137 (8%)
Query: 581 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 640
+ AE + G Y GQL+ A ++Y A+ + P + + LA +A ++A
Sbjct: 121 DEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARA 180
Query: 641 AYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDA 700
L +P A G + + G I LA+ + KA++L PN +
Sbjct: 181 LIDEVLTADP-----------GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAV 229
Query: 701 YINLGNVLKEARIFDRA 717
+ L +L EA F+ A
Sbjct: 230 LLALATILIEAGEFEEA 246
Score = 59.7 bits (145), Expect = 5e-09
Identities = 39/194 (20%), Positives = 70/194 (36%), Gaps = 30/194 (15%)
Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
+ D++ + +P +A G++ G ++ AL YR A+ L+P+ I +
Sbjct: 173 NRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLA 232
Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPS----CYLKAIE--TRPDFAVAWSNL-------- 670
LA L+ AG+ E+A + L+ P+ YLKA+ + ++ A L
Sbjct: 233 LATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKKNYEDARETLQDALKSAP 292
Query: 671 ---------GCVFNAQGEIWLAIHHFEKAVSLDPNFLDA-------YINLGNVLKEARIF 714
G G + A + + + PN A + LG V +
Sbjct: 293 EYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRVDEAIATL 352
Query: 715 DRANTLFYVSSYFM 728
A L +
Sbjct: 353 SPALGLDPDDPAAL 366
Score = 58.9 bits (143), Expect = 9e-09
Identities = 58/281 (20%), Positives = 104/281 (37%), Gaps = 21/281 (7%)
Query: 11 LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
+L LA + G+ E A + L L+ + +L K+ A
Sbjct: 536 ILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAAD 595
Query: 71 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
P EA+ LG G L +A+ +++ + L+PD + LA A + +A+
Sbjct: 596 AAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAI 655
Query: 131 QAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY-----------CVRSDLGNLLKA 179
+ AL+ PD + L LL A R + AK + G+L
Sbjct: 656 TSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLR 715
Query: 180 LGRLDEAKNLHTENIK-----PVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLA 234
A + + +K +K+ A++ + G E+ +K +P A
Sbjct: 716 QKDYPAAIQAYRKALKRAPSSQNAIKLHRALLAS--GNTA---EAVKTLEAWLKTHPNDA 770
Query: 235 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLA 275
+ L +Y + +A+++Y+ V+ PD NLA
Sbjct: 771 VLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLA 811
Score = 55.5 bits (134), Expect = 1e-07
Identities = 58/280 (20%), Positives = 101/280 (36%), Gaps = 23/280 (8%)
Query: 21 AGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKS-AHFSTLAIKQNPLLAEAY 79
GDY +AE+ + VL LL+ + K + + + AE
Sbjct: 69 LGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQQVLDELPGKTLLDDEGAAELL 128
Query: 80 SNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQY 139
+ G Y GQL+ A ++Y A+ + P + + LA +A ++A L
Sbjct: 129 ALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTA 188
Query: 140 NPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIKPVTM 199
+P G+LL +LG ++ A Y +A L NI +
Sbjct: 189 DPGNVDALLLKGDLLLSLGNIELALAAY----------------RKAIALRPNNIAVLLA 232
Query: 200 KVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRH 259
I ++ L N LA L V ++ ++A E +
Sbjct: 233 LATILIEAGEFEEAEKHADAL----LKKAPNSPLAHYLKAL--VDFQKKNYEDARETLQD 286
Query: 260 AVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
A++ P+++ + A+ G++EQA Q L+Y P
Sbjct: 287 ALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAP 326
Score = 51.2 bits (123), Expect = 2e-06
Identities = 47/203 (23%), Positives = 79/203 (38%), Gaps = 30/203 (14%)
Query: 562 FECRKLDKSAHFSTL--AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI 619
R ++ + L A + NP E L Y +GQL++AL A PD
Sbjct: 542 LYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSP 601
Query: 620 DGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGE 679
+ ++ L A +AAGD+ +AV S + K + +PD A+A L +
Sbjct: 602 EAWLMLGRAQLAAGDLNKAV-----------SSFKKLLALQPDSALALLLLADAYAVMKN 650
Query: 680 IWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRANTLFYVSSYFMITDEDIDVILE 739
AI ++A+ L P+ +A I L +L A+ + A + +
Sbjct: 651 YAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIA-----------------K 693
Query: 740 RCEAKTEELNKKFEQLGESSLRD 762
+ + + FE G+ LR
Sbjct: 694 SLQKQHPKAALGFELEGDLYLRQ 716
Score = 48.9 bits (117), Expect = 1e-05
Identities = 39/153 (25%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
+K+A + A + +P A A + LG +G EA+ + A +L P+ +
Sbjct: 377 GDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRADLL 436
Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETR-PDFAVAWSNLGCVFNAQGEIWLA 683
L + + +G ++A+ A K +E + PD A + LG ++ +G++ A
Sbjct: 437 LILSYLRSGQFDKALAA------------AKKLEKKQPDNASLHNLLGAIYLGKGDLAKA 484
Query: 684 IHHFEKAVSLDPNFLDAYINLGNV-LKEARIFD 715
FEKA+S++P+F A NL + ++E D
Sbjct: 485 REAFEKALSIEPDFFPAAANLARIDIQEGNPDD 517
Score = 47.0 bits (112), Expect = 5e-05
Identities = 50/250 (20%), Positives = 90/250 (36%), Gaps = 34/250 (13%)
Query: 44 VLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAV 103
+L L + +L+ + A+ +P A L + + EA +
Sbjct: 127 LLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVL 186
Query: 104 RLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEA 163
P +D + L++ G++E A+ AY A+ P+ V L +L G +EA
Sbjct: 187 TADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEA 246
Query: 164 -KDLYCVR-----SDLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTL 217
K + S L + LKAL +D K ++A TL
Sbjct: 247 EKHADALLKKAPNSPLAHYLKAL--VDFQKK-----------NYEDAR---------ETL 284
Query: 218 ESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 277
+ A+K P A G + G L++A + ++ P+ LA+
Sbjct: 285 QD------ALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASI 338
Query: 278 LVAAGDMEQA 287
+ G +++A
Sbjct: 339 QLRLGRVDEA 348
Score = 41.2 bits (97), Expect = 0.003
Identities = 31/142 (21%), Positives = 53/142 (37%), Gaps = 12/142 (8%)
Query: 579 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 638
K +P A + L +Y + +A+++Y+ V+ PD NLA + D
Sbjct: 764 KTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKDPR-- 821
Query: 639 VQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFL 698
AL+Y +A++ P+ LG + +GE A+ KAV++ P
Sbjct: 822 ------ALEYA----ERALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAA 871
Query: 699 DAYINLGNVLKEARIFDRANTL 720
+L L A
Sbjct: 872 AIRYHLALALLATGRKAEARKE 893
Score = 41.2 bits (97), Expect = 0.003
Identities = 35/153 (22%), Positives = 61/153 (39%), Gaps = 1/153 (0%)
Query: 14 LAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNP 73
L +G+ A + + N+ + L+ ++ + DK+ +K+ P
Sbjct: 742 LHRALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAP 801
Query: 74 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 133
A +NL +Y E ALE A++L P+ L LV G+ ++A+
Sbjct: 802 DNAVVLNNLAWLYLELKD-PRALEYAERALKLAPNIPAILDTLGWLLVEKGEADRALPLL 860
Query: 134 VTALQYNPDLYCVRSDLGNLLKALGRLDEAKDL 166
A+ P+ +R L L A GR EA+
Sbjct: 861 RKAVNIAPEAAAIRYHLALALLATGRKAEARKE 893
Score = 39.7 bits (93), Expect = 0.007
Identities = 39/162 (24%), Positives = 71/162 (43%), Gaps = 12/162 (7%)
Query: 544 NVEPDLTREGGSIPITLTFECRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQE 603
++P+L R + ++ + DK+ + K+ P A ++ LG +Y +G L +
Sbjct: 425 QLDPELGRADLLLILSY-LRSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAK 483
Query: 604 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDF 663
A E + A+ ++PDF NLA + G+ + A+Q + L +P L+AI
Sbjct: 484 AREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPK-NLRAILALAGL 542
Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLG 705
+ N + WL EKA L+P ++ + L
Sbjct: 543 YLRTGN-----EEEAVAWL-----EKAAELNPQEIEPALALA 574
Score = 38.1 bits (89), Expect = 0.026
Identities = 32/150 (21%), Positives = 56/150 (37%), Gaps = 24/150 (16%)
Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQA 641
P EA+ LG G L +A+ +++ + L+PD + LA A + +A+ +
Sbjct: 598 PDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITS 657
Query: 642 YVTALQYNPSCYLKAI-----------------------ETRPDFAVAWSNLGCVFNAQG 678
AL+ P I + P A+ + G ++ Q
Sbjct: 658 LKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQK 717
Query: 679 EIWLAIHHFEKAVSLDPNFLDAYINLGNVL 708
+ AI + KA+ P+ +A I L L
Sbjct: 718 DYPAAIQAYRKALKRAPSSQNA-IKLHRAL 746
Score = 37.4 bits (87), Expect = 0.039
Identities = 20/64 (31%), Positives = 31/64 (48%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
A+K P + LG + E+G+ AL R AV + P+ +LA AL+A G
Sbjct: 829 ALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALLATGRKA 888
Query: 637 QAVQ 640
+A +
Sbjct: 889 EARK 892
Score = 37.0 bits (86), Expect = 0.055
Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 2/79 (2%)
Query: 230 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 289
+ AE + G Y GQL+ A ++Y A+ + P + + LA +A ++A
Sbjct: 121 DEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARA 180
Query: 290 AYVTALQYNPLLKKEDGWN 308
L +P D
Sbjct: 181 LIDEVLTADP--GNVDALL 197
Score = 36.6 bits (85), Expect = 0.072
Identities = 33/131 (25%), Positives = 52/131 (39%), Gaps = 23/131 (17%)
Query: 598 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP------- 650
+ ++A E + A++ P+++ + A+ G++EQA Q L+Y P
Sbjct: 274 KKNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARR 333
Query: 651 ---SCYLK-------------AIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLD 694
S L+ A+ PD A S LG + A G+ A + KA LD
Sbjct: 334 LLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELD 393
Query: 695 PNFLDAYINLG 705
P A LG
Sbjct: 394 PENAAARTQLG 404
Score = 35.1 bits (81), Expect = 0.22
Identities = 27/132 (20%), Positives = 46/132 (34%), Gaps = 11/132 (8%)
Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
+L+ + A+ +P A L + + EA + P +D +
Sbjct: 139 GQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLL 198
Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
L++ G++E A+ AY KAI RP+ L + GE A
Sbjct: 199 KGDLLLSLGNIELALAAYR-----------KAIALRPNNIAVLLALATILIEAGEFEEAE 247
Query: 685 HHFEKAVSLDPN 696
H + + PN
Sbjct: 248 KHADALLKKAPN 259
Score = 34.7 bits (80), Expect = 0.25
Identities = 30/160 (18%), Positives = 57/160 (35%), Gaps = 24/160 (15%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 644
E Y ++ + + A+ ++A++ P+ + L +A GD A +
Sbjct: 22 PEELIEAAKSYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRK 81
Query: 645 ALQYNPS----------CYL------KAIET--------RPDFAVAWSNLGCVFNAQGEI 680
AL YL + ++ A + G + G++
Sbjct: 82 ALSLGYPKNQVLPLLARAYLLQGKFQQVLDELPGKTLLDDEGAAELLALRGLAYLGLGQL 141
Query: 681 WLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRANTL 720
LA +E+A+++DP L A + L + FD A L
Sbjct: 142 ELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARAL 181
>gnl|CDD|193553 cd05678, M20_dipept_like_1, Uncharacterized M20 Dipeptidases.
Peptidase M20 family, unknown dipeptidase-like subfamily
1 (inferred by homology to be dipeptidases). M20
dipeptidases include a large variety of bacterial
enzymes including cytosolic nonspecific dipeptidase
(CNDP), Xaa-methyl-His dipeptidase (anserinase),and
canosinase. These dipeptidases have been shown to act on
a wide range of dipeptides, but not larger peptides. For
example, anserinase mainly catalyzes the hydrolysis of
N-alpha-acetylhistidine while carnosinase degrades
beta-alanyl-L-histidine.
Length = 466
Score = 75.0 bits (185), Expect = 5e-14
Identities = 52/226 (23%), Positives = 93/226 (41%), Gaps = 31/226 (13%)
Query: 354 VNIKIECASKDLHSGLFGGSV-HEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEE 412
+ + + A LHSG +G + A L ++ + + +G++LIP Y D PL + E+
Sbjct: 205 ITLTVYGAKVPLHSGHYGNYAPNPAFR-LARLLASMKDDDGRVLIPGYY-DGIPLSEEEK 262
Query: 413 QFYEKIDFDTEDFRTAIDHPKLTKADKT----QVLMSRWRYPSLSLHGIEGAFSGPGGKT 468
+ + D E R L + +K Q + +YPSL++ G++ ++G +T
Sbjct: 263 KALAAVPDDEESLR---KRLGLAQTEKVGRNYQEAL---QYPSLNIRGLQSGWTGDKART 316
Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYL--------------NELWKARNSPNKFKA 514
+IP + IR+VP P + + ++ E P
Sbjct: 317 IIPSTATAEIDIRLVPETDPDRLLDLLKKHIEAQGYHVVDGEPTAEER---LAHPKIATF 373
Query: 515 YLLDSGKSWRTNPEHPNYVAAARATKYVYNVEPDLTRE-GGSIPIT 559
S ++RT+ + P A + EP R GGS+PI+
Sbjct: 374 TRTGSYNAFRTDLDSPIGKWLRAALTKAFGEEPIQIRTMGGSVPIS 419
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 61.9 bits (151), Expect = 4e-12
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 75 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDM-EQAVQAY 133
AEA NLGN + G EA+E Y A+ L PD + Y NLA A + G E+A++
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 134 VTALQYNP 141
AL+ +P
Sbjct: 62 EKALELDP 69
Score = 61.9 bits (151), Expect = 4e-12
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 233 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDM-EQAVQAY 291
AEA NLGN + G EA+E Y A+ L PD + Y NLA A + G E+A++
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 292 VTALQYNP 299
AL+ +P
Sbjct: 62 EKALELDP 69
Score = 61.9 bits (151), Expect = 4e-12
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 584 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDM-EQAVQAY 642
AEA NLGN + G EA+E Y A+ L PD + Y NLA A + G E+A++
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 643 VTALQYNP 650
AL+ +P
Sbjct: 62 EKALELDP 69
Score = 50.4 bits (121), Expect = 4e-08
Identities = 25/75 (33%), Positives = 36/75 (48%), Gaps = 12/75 (16%)
Query: 622 YINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIW 681
NL AL GD ++A++ Y KA+E PD A A+ NL + G+ +
Sbjct: 6 LKNLGNALFKLGDYDEAIE-----------AYEKALELDPDNAEAYYNLALAYLKLGKDY 54
Query: 682 -LAIHHFEKAVSLDP 695
A+ EKA+ LDP
Sbjct: 55 EEALEDLEKALELDP 69
Score = 45.0 bits (107), Expect = 4e-06
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 1/65 (1%)
Query: 44 VLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERG-QLQEALENYRHA 102
L L + F+ D++ A++ +P AEAY NL Y + G +EALE+ A
Sbjct: 5 ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKA 64
Query: 103 VRLKP 107
+ L P
Sbjct: 65 LELDP 69
Score = 42.3 bits (100), Expect = 3e-05
Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 562 FECRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERG-QLQEALENYRHAVRLKP 616
F+ D++ A++ +P AEAY NL Y + G +EALE+ A+ L P
Sbjct: 14 FKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELDP 69
Score = 41.1 bits (97), Expect = 9e-05
Identities = 18/47 (38%), Positives = 24/47 (51%)
Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKE 710
A A NLG G+ AI +EKA+ LDP+ +AY NL +
Sbjct: 3 AEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLK 49
Score = 38.4 bits (90), Expect = 8e-04
Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 224 TLAIKQNPLLAEAYSNLGNVYKERG-QLQEALENYRHAVRLKP 265
A++ +P AEAY NL Y + G +EALE+ A+ L P
Sbjct: 27 EKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELDP 69
Score = 31.9 bits (73), Expect = 0.14
Identities = 18/56 (32%), Positives = 27/56 (48%), Gaps = 1/56 (1%)
Query: 113 YINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALG-RLDEAKDLY 167
NL AL GD ++A++AY AL+ +PD +L LG +EA +
Sbjct: 6 LKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
>gnl|CDD|236127 PRK07907, PRK07907, hypothetical protein; Provisional.
Length = 449
Score = 66.9 bits (164), Expect = 2e-11
Identities = 46/208 (22%), Positives = 85/208 (40%), Gaps = 32/208 (15%)
Query: 365 LHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFYEKIDFDTED 424
+HSG FGG+ +A+T L+ ++ L + +G + + G + + +D+D E
Sbjct: 214 VHSGQFGGAAPDALTALVRLLATLHDEDGNVAVD---------GLDATEPWLGVDYDEER 264
Query: 425 FRTA---IDHPKLTKADKTQVLMSRWRYPSLSLHGI-----EGAFSGPGGKTVIPGKVVG 476
FR +D +L + W P++++ GI GA + +P
Sbjct: 265 FRADAGVLDGVELIGTGSVADRL--WAKPAITVIGIDAPPVAGASN------ALPPSARA 316
Query: 477 KFSIRIVPNQTPQCVEKYVLDYL--NELWKARNSPNKFKAYLLDSGKSWRTNPEHPNYVA 534
+ S+R+ P Q + ++ +L + W A D+G+ + + P Y A
Sbjct: 317 RLSLRVAPGQDAAEAQDALVAHLEAHAPWGAH-----VTVERGDAGQPFAADASGPAYDA 371
Query: 535 AARATKYVYNVEPDLTREGGSIPITLTF 562
A A + + +P GGSIP
Sbjct: 372 ARAAMREAWGKDPVDMGMGGSIPFIAEL 399
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 57.9 bits (138), Expect = 7e-09
Identities = 68/243 (27%), Positives = 104/243 (42%), Gaps = 35/243 (14%)
Query: 67 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVR--LKPDFIDGYINLAAALVAAG 124
L + N LA L + G+L+EALE A+ L P+ + +NL L A G
Sbjct: 50 LELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALG 109
Query: 125 DMEQAVQAYVTALQYNPDLYCVRS--------DLGNLLKALGRLDEAKDLYCVRSDLGNL 176
E+A++ AL +PD + +LG+ +AL ++A +L ++L
Sbjct: 110 KYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEA 169
Query: 177 LKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEA 236
L ALG L EA + E ++ AEA
Sbjct: 170 LLALGALLEALGRYEEALE-------------------------LLEKALKLNPDDDAEA 204
Query: 237 YSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
NLG +Y + G+ +EALE Y A+ L PD + NLA L+ G E+A++A AL+
Sbjct: 205 LLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALE 264
Query: 297 YNP 299
+P
Sbjct: 265 LDP 267
Score = 56.0 bits (133), Expect = 3e-08
Identities = 50/184 (27%), Positives = 78/184 (42%), Gaps = 5/184 (2%)
Query: 5 AQLGIGLLELAHREYQAGDYESAERHCMQ-LWRQETNNTGVLLLLSSIHFQCRKLDKSAH 63
L LL L G YE A + L + LL ++ +++
Sbjct: 92 PNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALE 151
Query: 64 FSTLAIKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD-FIDGYINLAAA 119
A++ +P LAEA LG + + G+ +EALE A++L PD + +NL
Sbjct: 152 LYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLL 211
Query: 120 LVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKA 179
+ G E+A++ Y AL+ +PD +L LL LGR +EA + +L L
Sbjct: 212 YLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYN 271
Query: 180 LGRL 183
LG
Sbjct: 272 LGLA 275
Score = 51.0 bits (120), Expect = 1e-06
Identities = 47/144 (32%), Positives = 71/144 (49%), Gaps = 10/144 (6%)
Query: 576 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLA-AALVAAGD 634
L ++ P LAEA NLG + + G+ +EALE A+ L PD LA AL GD
Sbjct: 86 LELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGD 145
Query: 635 MEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLD 694
E+A++ Y AL+ +P + A A LG + A G A+ EKA+ L+
Sbjct: 146 YEEALELYEKALELDPELN--------ELAEALLALGALLEALGRYEEALELLEKALKLN 197
Query: 695 PN-FLDAYINLGNVLKEARIFDRA 717
P+ +A +NLG + + ++ A
Sbjct: 198 PDDDAEALLNLGLLYLKLGKYEEA 221
Score = 49.1 bits (115), Expect = 4e-06
Identities = 50/145 (34%), Positives = 73/145 (50%), Gaps = 14/145 (9%)
Query: 576 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKP---DFIDGYINLAAALVAA 632
LA+ +P LAEA LG E G +EALE Y A+ L P + + + L A L A
Sbjct: 122 LALDPDPDLAEALLALG-ALYELGDYEEALELYEKALELDPELNELAEALLALGALLEAL 180
Query: 633 GDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVS 692
G E+A++ AL+ NP D A A NLG ++ G+ A+ ++EKA+
Sbjct: 181 GRYEEALELLEKALKLNPD----------DDAEALLNLGLLYLKLGKYEEALEYYEKALE 230
Query: 693 LDPNFLDAYINLGNVLKEARIFDRA 717
LDP+ +A NL +L E ++ A
Sbjct: 231 LDPDNAEALYNLALLLLELGRYEEA 255
Score = 48.3 bits (113), Expect = 9e-06
Identities = 49/187 (26%), Positives = 81/187 (43%), Gaps = 15/187 (8%)
Query: 531 NYVAAARATKYVYNVEPDLTREGGSIPITLTFECRKLDKSAHFSTLAIKQNP---LLAEA 587
Y A + ++PD + + +E +++ A++ +P LAEA
Sbjct: 110 KYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEA 169
Query: 588 YSNLGNVYKERGQLQEALENYRHAVRLKPD-FIDGYINLAAALVAAGDMEQAVQAYVTAL 646
LG + + G+ +EALE A++L PD + +NL + G E+A++
Sbjct: 170 LLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALE------ 223
Query: 647 QYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGN 706
Y KA+E PD A A NL + G A+ EKA+ LDP+ + + L
Sbjct: 224 -----YYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYNLGLALLL 278
Query: 707 VLKEARI 713
+L EA
Sbjct: 279 LLAEALE 285
Score = 36.4 bits (82), Expect = 0.051
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 225 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLA-AALVAAGD 283
L ++ P LAEA NLG + + G+ +EALE A+ L PD LA AL GD
Sbjct: 86 LELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGD 145
Query: 284 MEQAVQAYVTALQYNPLLKKED 305
E+A++ Y AL+ +P L +
Sbjct: 146 YEEALELYEKALELDPELNELA 167
Score = 30.2 bits (66), Expect = 5.3
Identities = 46/160 (28%), Positives = 70/160 (43%), Gaps = 13/160 (8%)
Query: 566 KLDKSAHFSTLAIKQNPLLAEAYSNLGN-VYKERGQLQEALENYRHAVRLKP--DFIDGY 622
L + A+ + A A G + + G+L EALE A+ L P D
Sbjct: 3 DLLLALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLL 62
Query: 623 INLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWL 682
+ LA AL+ G +E+A++ AL+ E P+ A A NLG + A G+
Sbjct: 63 LLLALALLKLGRLEEALELLEKALEL---------ELLPNLAEALLNLGLLLEALGKYEE 113
Query: 683 AIHHFEKAVSLDPNFLDAYINLG-NVLKEARIFDRANTLF 721
A+ EKA++LDP+ A L L E ++ A L+
Sbjct: 114 ALELLEKALALDPDPDLAEALLALGALYELGDYEEALELY 153
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 55.8 bits (135), Expect = 2e-08
Identities = 49/221 (22%), Positives = 88/221 (39%), Gaps = 21/221 (9%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 135
A+ L Y E+G L+ A EN A+ PD Y+ LA G++E+A ++
Sbjct: 31 AKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRR 90
Query: 136 ALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIK 195
AL NP+ V ++ G L G+ ++A + +A+ L+ + +
Sbjct: 91 ALTLNPNNGDVLNNYGTFLCQQGKYEQA---------MQQFEQAI-----EDPLYPQPAR 136
Query: 196 PVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALE 255
+ NA +C + ++ + T A++ +P E+ L +Y RGQ ++A
Sbjct: 137 SLE----NAGLCAL--KAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARA 190
Query: 256 NYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
+ + + L + A A Q Y LQ
Sbjct: 191 YLERYQQTYNQTAES-LWLGIRIARALGDVAAAQRYGAQLQ 230
Score = 55.4 bits (134), Expect = 2e-08
Identities = 38/140 (27%), Positives = 58/140 (41%), Gaps = 15/140 (10%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 644
A+ L Y E+G L+ A EN A+ PD Y+ LA G++E+A ++
Sbjct: 31 AKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRR 90
Query: 645 ALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF---LDAY 701
AL N P+ +N G QG+ A+ FE+A+ DP + +
Sbjct: 91 ALTLN-----------PNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIE-DPLYPQPARSL 138
Query: 702 INLGNVLKEARIFDRANTLF 721
N G +A FD+A
Sbjct: 139 ENAGLCALKAGDFDKAEKYL 158
Score = 52.3 bits (126), Expect = 2e-07
Identities = 34/131 (25%), Positives = 60/131 (45%), Gaps = 15/131 (11%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
A++ +P AY L Y++ G+L++A +++R A+ L P+ D N L G E
Sbjct: 57 ALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYE 116
Query: 637 QAVQAYVTALQYNPSCYLKAIETRPDF---AVAWSNLGCVFNAQGEIWLAIHHFEKAVSL 693
QA+Q + +AIE P + A + N G G+ A + +A+ +
Sbjct: 117 QAMQQ-----------FEQAIED-PLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQI 164
Query: 694 DPNFLDAYINL 704
DP ++ + L
Sbjct: 165 DPQRPESLLEL 175
Score = 44.6 bits (106), Expect = 1e-04
Identities = 22/66 (33%), Positives = 32/66 (48%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 293
A+ L Y E+G L+ A EN A+ PD Y+ LA G++E+A ++
Sbjct: 31 AKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRR 90
Query: 294 ALQYNP 299
AL NP
Sbjct: 91 ALTLNP 96
Score = 42.3 bits (100), Expect = 5e-04
Identities = 27/98 (27%), Positives = 45/98 (45%), Gaps = 2/98 (2%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
A++ +P AY L Y++ G+L++A +++R A+ L P+ D N L G E
Sbjct: 57 ALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYE 116
Query: 128 QAVQAYVTAL--QYNPDLYCVRSDLGNLLKALGRLDEA 163
QA+Q + A+ P + G G D+A
Sbjct: 117 QAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKA 154
Score = 37.7 bits (88), Expect = 0.014
Identities = 23/110 (20%), Positives = 42/110 (38%), Gaps = 36/110 (32%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD----------FI------- 268
A++ +P AY L Y++ G+L++A +++R A+ L P+ F+
Sbjct: 57 ALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYE 116
Query: 269 -------------------DGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
N + AGD ++A + ALQ +P
Sbjct: 117 QAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDP 166
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 55.1 bits (133), Expect = 3e-08
Identities = 38/145 (26%), Positives = 64/145 (44%), Gaps = 9/145 (6%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
A++ +P A+ + Y++ G+ A E+YR A+ L P+ D N A L A G E
Sbjct: 61 ALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPE 120
Query: 637 QAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPN 696
+A+Q + AL +P+ + NLG G+ A + ++A+ LDP
Sbjct: 121 EAMQQFERALA-DPAYGEPSD--------TLENLGLCALKAGQFDQAEEYLKRALELDPQ 171
Query: 697 FLDAYINLGNVLKEARIFDRANTLF 721
F A + L + +A + A
Sbjct: 172 FPPALLELARLHYKAGDYAPARLYL 196
Score = 45.5 bits (108), Expect = 5e-05
Identities = 48/224 (21%), Positives = 84/224 (37%), Gaps = 27/224 (12%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 135
A+A L Y ++G +A +N A+ P + ++ A G+ + A ++Y
Sbjct: 35 AKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRK 94
Query: 136 ALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIK 195
AL P+ V ++ G L A GR +E A+ + + A
Sbjct: 95 ALSLAPNNGDVLNNYGAFLCAQGRPEE----------------AMQQFERALADPAYGEP 138
Query: 196 PVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEA-- 253
T+ +N +C + E + A++ +P A L ++ + G A
Sbjct: 139 SDTL--ENLGLCALKAGQFDQAE--EYLKRALELDPQFPPALLELARLHYKAGDYAPARL 194
Query: 254 -LENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
LE Y+ + + + I +A L GD A Q Y LQ
Sbjct: 195 YLERYQQRGGAQAESLLLGIRIAKRL---GD-RAAAQRYQAQLQ 234
Score = 39.3 bits (92), Expect = 0.006
Identities = 16/66 (24%), Positives = 29/66 (43%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 293
A+A L Y ++G +A +N A+ P + ++ A G+ + A ++Y
Sbjct: 35 AKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRK 94
Query: 294 ALQYNP 299
AL P
Sbjct: 95 ALSLAP 100
Score = 38.2 bits (89), Expect = 0.012
Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 6/104 (5%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
A++ +P A+ + Y++ G+ A E+YR A+ L P+ D N A L A G E
Sbjct: 61 ALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPE 120
Query: 128 QAVQAYVTALQYNPDLYCVRSDLGNL----LKALGRLDEAKDLY 167
+A+Q + AL +P L NL LKA G+ D+A++
Sbjct: 121 EAMQQFERALA-DPAYGEPSDTLENLGLCALKA-GQFDQAEEYL 162
Score = 35.5 bits (82), Expect = 0.080
Identities = 35/177 (19%), Positives = 67/177 (37%), Gaps = 4/177 (2%)
Query: 3 DIAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSA 62
D + L+LA Q GDY A+++ + + + L+ + + + + D +
Sbjct: 30 DRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLAD 89
Query: 63 HFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF---IDGYINLAAA 119
A+ P + +N G +G+ +EA++ + A+ P + D NL
Sbjct: 90 ESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERAL-ADPAYGEPSDTLENLGLC 148
Query: 120 LVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNL 176
+ AG +QA + AL+ +P +L L G A+ G
Sbjct: 149 ALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQQRGGA 205
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 50.1 bits (120), Expect = 6e-08
Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 7/74 (9%)
Query: 72 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLK-------PDFIDGYINLAAALVAAG 124
+P LA A +NL V + G EALE A+ L P+ NLA +A G
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 125 DMEQAVQAYVTALQ 138
D ++A++ AL
Sbjct: 61 DYDEALEYLEKALA 74
Score = 50.1 bits (120), Expect = 6e-08
Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 7/74 (9%)
Query: 230 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLK-------PDFIDGYINLAAALVAAG 282
+P LA A +NL V + G EALE A+ L P+ NLA +A G
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 283 DMEQAVQAYVTALQ 296
D ++A++ AL
Sbjct: 61 DYDEALEYLEKALA 74
Score = 50.1 bits (120), Expect = 6e-08
Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 7/74 (9%)
Query: 581 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLK-------PDFIDGYINLAAALVAAG 633
+P LA A +NL V + G EALE A+ L P+ NLA +A G
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 634 DMEQAVQAYVTALQ 647
D ++A++ AL
Sbjct: 61 DYDEALEYLEKALA 74
Score = 44.7 bits (106), Expect = 6e-06
Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 4/81 (4%)
Query: 616 PDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFN 675
PD NLA L GD ++A++ AL+ + E P+ A A +NL ++
Sbjct: 2 PDLAAALNNLALVLRRLGDYDEALELLEKALEL----ARELGEDHPETARALNNLARLYL 57
Query: 676 AQGEIWLAIHHFEKAVSLDPN 696
A G+ A+ + EKA++L
Sbjct: 58 ALGDYDEALEYLEKALALREA 78
Score = 38.1 bits (89), Expect = 0.001
Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 7/72 (9%)
Query: 44 VLLLLSSIHFQCRKLDKSAHFSTLAIKQ-------NPLLAEAYSNLGNVYKERGQLQEAL 96
L L+ + + D++ A++ +P A A +NL +Y G EAL
Sbjct: 7 ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66
Query: 97 ENYRHAVRLKPD 108
E A+ L+
Sbjct: 67 EYLEKALALREA 78
Score = 37.3 bits (87), Expect = 0.002
Identities = 24/98 (24%), Positives = 35/98 (35%), Gaps = 26/98 (26%)
Query: 169 VRSDLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIK 228
++L +L+ LG DEA L + LE A +
Sbjct: 7 ALNNLALVLRRLGDYDEALELLEK-----------------------ALELAR---ELGE 40
Query: 229 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
+P A A +NL +Y G EALE A+ L+
Sbjct: 41 DHPETARALNNLARLYLALGDYDEALEYLEKALALREA 78
Score = 37.0 bits (86), Expect = 0.003
Identities = 13/36 (36%), Positives = 17/36 (47%)
Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 617
P A A +NL +Y G EALE A+ L+
Sbjct: 43 PETARALNNLARLYLALGDYDEALEYLEKALALREA 78
Score = 32.7 bits (75), Expect = 0.096
Identities = 19/65 (29%), Positives = 25/65 (38%), Gaps = 7/65 (10%)
Query: 660 RPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSL-------DPNFLDAYINLGNVLKEAR 712
PD A A +NL V G+ A+ EKA+ L P A NL +
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 713 IFDRA 717
+D A
Sbjct: 61 DYDEA 65
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 55.9 bits (135), Expect = 7e-08
Identities = 33/153 (21%), Positives = 55/153 (35%), Gaps = 10/153 (6%)
Query: 15 AHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNPL 74
A + Y AG Y+ A + L + +N L L I + K ++ A+ +P
Sbjct: 313 ALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPN 372
Query: 75 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYV 134
NL + G+ QEA+ + P+ +G+ LA A G+ +A+ A
Sbjct: 373 SPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARA 432
Query: 135 TALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
G L +A+ L A
Sbjct: 433 EGYALA----------GRLEQAIIFLMRASQQV 455
Score = 42.4 bits (100), Expect = 0.001
Identities = 26/104 (25%), Positives = 42/104 (40%), Gaps = 14/104 (13%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
A NP E G++ E + +EA+E + A+ L P+ +NLA AL+ G +
Sbjct: 335 AQPDNPYYLEL---AGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQ 391
Query: 637 QAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEI 680
+A++ N YL P+ W L + G
Sbjct: 392 EAIR------ILNR--YLFND---PEDPNGWDLLAQAYAELGNR 424
Score = 40.1 bits (94), Expect = 0.004
Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 13/88 (14%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 285
A NP E G++ E + +EA+E + A+ L P+ +NLA AL+ G +
Sbjct: 335 AQPDNPYYLEL---AGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQ 391
Query: 286 QAVQAYVTALQYNPLLKKE----DGWNT 309
+A++ N L + +GW+
Sbjct: 392 EAIR------ILNRYLFNDPEDPNGWDL 413
Score = 37.8 bits (88), Expect = 0.023
Identities = 16/73 (21%), Positives = 28/73 (38%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 285
A+ +P NL + G+ QEA+ + P+ +G+ LA A G+
Sbjct: 366 ALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRA 425
Query: 286 QAVQAYVTALQYN 298
+A+ A
Sbjct: 426 EALLARAEGYALA 438
Score = 37.8 bits (88), Expect = 0.023
Identities = 16/73 (21%), Positives = 28/73 (38%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
A+ +P NL + G+ QEA+ + P+ +G+ LA A G+
Sbjct: 366 ALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRA 425
Query: 637 QAVQAYVTALQYN 649
+A+ A
Sbjct: 426 EALLARAEGYALA 438
>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
repeats [Intracellular trafficking and secretion].
Length = 257
Score = 49.0 bits (117), Expect = 3e-06
Identities = 41/191 (21%), Positives = 70/191 (36%), Gaps = 13/191 (6%)
Query: 12 LELAHREYQAGDYESAERHCMQLWRQETNNTGVLLL--LSSIHFQCRKLDKSAHFSTLAI 69
+ + + L N L + L++ + D S +
Sbjct: 34 AKATGAPESSLAMRQTQGAAAALGAAVLRNPEDLSIAKLATALYLRGDADSSLAVLQKSA 93
Query: 70 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 129
P E + G G EA+ R A RL P + + L AAL G ++A
Sbjct: 94 IAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEA 153
Query: 130 VQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY-----------CVRSDLGNLLK 178
+AY AL+ P+ + ++LG L G L++A+ L VR +L ++
Sbjct: 154 RRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVG 213
Query: 179 ALGRLDEAKNL 189
G EA+++
Sbjct: 214 LQGDFREAEDI 224
Score = 40.5 bits (95), Expect = 0.003
Identities = 42/211 (19%), Positives = 63/211 (29%), Gaps = 45/211 (21%)
Query: 88 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVR 147
Q Q A AV P+ + LA AL GD + ++ + P +
Sbjct: 45 AMRQTQGAAAALGAAVLRNPEDLSIA-KLATALYLRGDADSSLAVLQKSAIAYPKDRELL 103
Query: 148 SDLGNLLKALGRLDEAKDLYC-----------VRSDLGNLLKALGRLDEAKNLHTENIKP 196
+ G G EA + + LG L LGR DEA+ + + ++
Sbjct: 104 AAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALE- 162
Query: 197 VTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALEN 256
LA + +NLG RG L++A
Sbjct: 163 ----------------------------LAPNE----PSIANNLGMSLLLRGDLEDAETL 190
Query: 257 YRHAVRLKPDFIDGYINLAAALVAAGDMEQA 287
A NLA + GD +A
Sbjct: 191 LLPAYLSPAADSRVRQNLALVVGLQGDFREA 221
Score = 38.6 bits (90), Expect = 0.009
Identities = 31/123 (25%), Positives = 44/123 (35%), Gaps = 11/123 (8%)
Query: 598 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAI 657
G EA+ R A RL P + + L AAL G ++A +AY AL+ L
Sbjct: 113 NGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALE------LAPN 166
Query: 658 ETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
E +NLG +G++ A A NL V+ F A
Sbjct: 167 E-----PSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVGLQGDFREA 221
Query: 718 NTL 720
+
Sbjct: 222 EDI 224
Score = 38.6 bits (90), Expect = 0.009
Identities = 19/53 (35%), Positives = 26/53 (49%)
Query: 247 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
G EA+ R A RL P + + L AAL G ++A +AY AL+ P
Sbjct: 113 NGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAP 165
Score = 37.8 bits (88), Expect = 0.016
Identities = 28/125 (22%), Positives = 41/125 (32%), Gaps = 12/125 (9%)
Query: 597 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKA 656
Q Q A AV P+ + LA AL GD + ++ + P
Sbjct: 45 AMRQTQGAAAALGAAVLRNPEDLSIA-KLATALYLRGDADSSLAVLQKSAIAYPK----- 98
Query: 657 IETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDR 716
D + + G G A+ KA L P +A+ LG L + FD
Sbjct: 99 -----DRELL-AAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDE 152
Query: 717 ANTLF 721
A +
Sbjct: 153 ARRAY 157
Score = 37.4 bits (87), Expect = 0.021
Identities = 31/156 (19%), Positives = 52/156 (33%), Gaps = 34/156 (21%)
Query: 11 LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
L + + G++ A + R +
Sbjct: 103 LAAQGKNQIRNGNFGEAVSVLRKAARLAPTD----------------------------- 133
Query: 71 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
EA++ LG + G+ EA YR A+ L P+ NL +L+ GD+E A
Sbjct: 134 -----WEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAE 188
Query: 131 QAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDL 166
+ A VR +L ++ G EA+D+
Sbjct: 189 TLLLPAYLSPAADSRVRQNLALVVGLQGDFREAEDI 224
Score = 32.4 bits (74), Expect = 0.93
Identities = 19/55 (34%), Positives = 28/55 (50%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 639
EA++ LG + G+ EA YR A+ L P+ NL +L+ GD+E A
Sbjct: 134 WEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAE 188
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 43.9 bits (105), Expect = 4e-06
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
A+A NLGN Y + G+ EALE Y A+ L P+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 43.9 bits (105), Expect = 4e-06
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
A+A NLGN Y + G+ EALE Y A+ L P+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 43.9 bits (105), Expect = 4e-06
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
A+A NLGN Y + G+ EALE Y A+ L P+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 40.5 bits (96), Expect = 7e-05
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
A A NLG + G+ A+ ++EKA+ L+PN
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 32.8 bits (76), Expect = 0.031
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 110 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
NL A + G ++A++ Y AL+ NP+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 32.4 bits (75), Expect = 0.045
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 619 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPSC 652
NL A + G ++A++ Y AL+ NP+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 31.6 bits (73), Expect = 0.082
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 268 IDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
NL A + G ++A++ Y AL+ NP
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNP 32
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 42.8 bits (102), Expect = 1e-05
Identities = 17/34 (50%), Positives = 20/34 (58%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
AEA NLGN Y + G EALE Y A+ L P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 42.8 bits (102), Expect = 1e-05
Identities = 17/34 (50%), Positives = 20/34 (58%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
AEA NLGN Y + G EALE Y A+ L P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 42.8 bits (102), Expect = 1e-05
Identities = 17/34 (50%), Positives = 20/34 (58%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
AEA NLGN Y + G EALE Y A+ L P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 39.0 bits (92), Expect = 2e-04
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
A A NLG + G+ A+ ++EKA+ LDPN
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 31.3 bits (72), Expect = 0.096
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 110 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
+ NL A + GD ++A++ Y AL+ +P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 31.3 bits (72), Expect = 0.097
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 619 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPSC 652
+ NL A + GD ++A++ Y AL+ +P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 30.5 bits (70), Expect = 0.19
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 268 IDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
+ NL A + GD ++A++ Y AL+ +P
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDP 32
>gnl|CDD|193559 cd08011, M20_ArgE_DapE_like6, M20 Peptidases with similarity to
acetylornithine deacetylases and
succinyl-diaminopimelate desuccinylases. Peptidase M20
family, Uncharacterized protein subfamily with
similarity to acetylornithine
deacetylase/succinyl-diaminopimelate desuccinylase
(ArgE/DapE) subfamily. This group includes the
hypothetical protein ygeY from Escherichia coli, a
putative deacetylase, but many in this subfamily are
classified as unassigned peptidases. ArgE/DapE enzymes
catalyze analogous reactions and share a common
activator, the metal ion (usually Co2+ or Zn2+). ArgE
catalyzes a broad range of substrates, including
N-acetylornithine, alpha-N-acetylmethionine and
alpha-N-formylmethionine, while DapE catalyzes the
hydrolysis of N-succinyl-L,L-diaminopimelate (L,L-SDAP)
to L,L-diaminopimelate and succinate. Proteins in this
subfamily are mostly archaeal, and have been inferred by
homology as being related to both ArgE and DapE.
Length = 390
Score = 46.6 bits (111), Expect = 4e-05
Identities = 61/266 (22%), Positives = 105/266 (39%), Gaps = 34/266 (12%)
Query: 305 DGWNTEPFVLDFERRKALWRRGALVMMKRPVLGLASAIESISA-NQEKCAVNIKIECA-S 362
DGW+ +PF + ++ RGA MK G+A+AI ++ A + N +E A
Sbjct: 87 DGWSVDPFKP-VVKDGRIYGRGA-SDMKG---GIAAAIAALEALAEAGVPPNGPVEAAFV 141
Query: 363 KDLHSGLFGGSVHEAMTDLIYIMGQLV---EPNGK--ILIPH---IYKDVEPLGDTE--- 411
D SG G+ + + + I V EP+G I I H ++ +V G
Sbjct: 142 PDEESGGETGTGY--LVEEGGIRPDYVIIAEPSGLDNIWIGHRGIVWGEVRVKGKQAHGS 199
Query: 412 -----EQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWR--YPSLSLHGIEGAFSGP 464
+EK + + A+ + K++ R P+++L G G
Sbjct: 200 TPWLGINAFEKASVIALELQEAL--KPRLSSRKSKYEYEDERTANPTVTLGGEAE---GT 254
Query: 465 GGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWR 524
G ++PG R++P + + V K ++D L + ++F+ L +
Sbjct: 255 GKTNIVPGYFRFSIDRRLIPEENLEEVRKEIVDLLERAEEGI--KHRFEVKSLMIVSAEF 312
Query: 525 TNPEHPNYVAAARATKYVYNVEPDLT 550
T P+ A A + V VEP T
Sbjct: 313 TPPDSSVVEALREAIREVLGVEPRKT 338
>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
Provisional.
Length = 172
Score = 43.9 bits (104), Expect = 8e-05
Identities = 39/130 (30%), Positives = 55/130 (42%), Gaps = 35/130 (26%)
Query: 599 GQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYN---PSC 652
G+ EALENY A++L+ D D YI N+ + G+ ++A++ Y AL+ N PS
Sbjct: 49 GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSA 108
Query: 653 YLKAI-------------ETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLD 699
L I D A A F+ E W ++A+ L PN
Sbjct: 109 -LNNIAVIYHKRGEKAEEAGDQDEAEAL------FDKAAEYW------KQAIRLAPN--- 152
Query: 700 AYINLGNVLK 709
YI N LK
Sbjct: 153 NYIEAQNWLK 162
Score = 41.6 bits (98), Expect = 4e-04
Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 10/88 (11%)
Query: 90 GQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNPDLYCV 146
G+ EALENY A++L+ D D YI N+ + G+ ++A++ Y AL+ NP
Sbjct: 49 GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSA 108
Query: 147 RSDLGNLLKALGR-------LDEAKDLY 167
+++ + G DEA+ L+
Sbjct: 109 LNNIAVIYHKRGEKAEEAGDQDEAEALF 136
Score = 40.8 bits (96), Expect = 0.001
Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 248 GQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNP 299
G+ EALENY A++L+ D D YI N+ + G+ ++A++ Y AL+ NP
Sbjct: 49 GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNP 103
>gnl|CDD|183140 PRK11447, PRK11447, cellulose synthase subunit BcsC; Provisional.
Length = 1157
Score = 46.2 bits (110), Expect = 9e-05
Identities = 55/253 (21%), Positives = 92/253 (36%), Gaps = 42/253 (16%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDG-----------Y--- 113
A++ NP +EA LG Y ++G A+ + A+ L P + Y
Sbjct: 295 AVRANPKDSEALGALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWLL 354
Query: 114 INLAAALVAAGDMEQAVQAYVTALQYNP-DLYCVRSDLGNLLKALGRLDEAKDLY--CVR 170
I A + A ++ QA + Y A Q + D Y V LG++ A A+ Y +R
Sbjct: 355 IQQGDAALKANNLAQAERLYQQARQVDNTDSYAVLG-LGDVAMARKDYAAAERYYQQALR 413
Query: 171 SDLGN--LLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIK 228
D GN ++ L L ++ K I +L ++ ++
Sbjct: 414 MDPGNTNAVRGLANLYRQQS---------PEKALAFI---------ASLSASQRRSIDDI 455
Query: 229 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN--LAAALVAAGDMEQ 286
+ L + + + +G+ +A E R + L P ++ LA L AG Q
Sbjct: 456 ERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPG--SVWLTYRLAQDLRQAGQRSQ 513
Query: 287 AVQAYVTALQYNP 299
A Q P
Sbjct: 514 ADALMRRLAQQKP 526
Score = 44.7 bits (106), Expect = 2e-04
Identities = 64/297 (21%), Positives = 110/297 (37%), Gaps = 35/297 (11%)
Query: 5 AQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDK-SAH 63
A LG+G + +A DY +AER+ Q R + NT + L+++ ++ + +K A
Sbjct: 387 AVLGLGDVAMAR-----KDYAAAERYYQQALRMDPGNTNAVRGLANL-YRQQSPEKALAF 440
Query: 64 FSTLAIKQ--------NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 115
++L+ Q L + + + +G+ +A E R + L P ++
Sbjct: 441 IASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPG--SVWLT 498
Query: 116 --LAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDL 173
LA L AG QA Q P+ G L R A
Sbjct: 499 YRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAA---------- 548
Query: 174 GNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLL 233
L L L A+ NI+ + ++Q+ V R + + A L ++Q P
Sbjct: 549 ---LAHLNTLPRAQ--WNSNIQELAQRLQSDQVLETANRLRDSGKEAEAEAL-LRQQPPS 602
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQA 290
L + ++RG A Y+ + +P D + L +A GD+ A
Sbjct: 603 TRIDLTLADWAQQRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARAQ 659
Score = 34.3 bits (79), Expect = 0.34
Identities = 30/126 (23%), Positives = 53/126 (42%), Gaps = 11/126 (8%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF--IDGYINLAA-----AL 629
A++ NP +EA LG Y ++G A+ + A+ L P D + +L L
Sbjct: 295 AVRANPKDSEALGALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWLL 354
Query: 630 VAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEK 689
+ GD A + Y +A + + A LG V A+ + A ++++
Sbjct: 355 IQQGDAALKANNLAQAERL----YQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQ 410
Query: 690 AVSLDP 695
A+ +DP
Sbjct: 411 ALRMDP 416
Score = 33.9 bits (78), Expect = 0.51
Identities = 18/81 (22%), Positives = 28/81 (34%), Gaps = 2/81 (2%)
Query: 88 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQY-NPDLYCV 146
+RG A Y+ + +P D + L +A GD+ A N L
Sbjct: 615 QRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARAQLAKLPATANDSLNTQ 674
Query: 147 RSDLGNLLKALGRLDEAKDLY 167
R + ALG A+ +
Sbjct: 675 RR-VALAWAALGDTAAAQRTF 694
Score = 31.2 bits (71), Expect = 3.3
Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 4/56 (7%)
Query: 597 ERGQLQEALENYRHAVRLKPDFIDGYIN--LAAALVAAGDMEQAVQAYVTALQYNP 650
+G+ +A E R + L P ++ LA L AG QA Q P
Sbjct: 473 NQGKWAQAAELQRQRLALDPG--SVWLTYRLAQDLRQAGQRSQADALMRRLAQQKP 526
Score = 30.8 bits (70), Expect = 4.4
Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 4/75 (5%)
Query: 597 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKA 656
+RG A Y+ + +P D + L +A GD+ A L P+ +
Sbjct: 615 QRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARA----QLAKLPATANDS 670
Query: 657 IETRPDFAVAWSNLG 671
+ T+ A+AW+ LG
Sbjct: 671 LNTQRRVALAWAALG 685
>gnl|CDD|223697 COG0624, ArgE, Acetylornithine deacetylase/Succinyl-diaminopimelate
desuccinylase and related deacylases [Amino acid
transport and metabolism].
Length = 409
Score = 44.8 bits (106), Expect = 1e-04
Identities = 20/150 (13%), Positives = 39/150 (26%), Gaps = 43/150 (28%)
Query: 354 VNIKIECASKDLHSG-LFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEE 412
+ +++ K H+ + I + +L+E G +
Sbjct: 198 LWLEVTVKGKAGHASTTPPDLGRNPIHAAIEALAELIEELGDLAGEGFD----------- 246
Query: 413 QFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPG 472
L L L G + +G VIPG
Sbjct: 247 -----------------GPLGLN--------------VGLILAGPGASVNGGDKVNVIPG 275
Query: 473 KVVGKFSIRIVPNQTPQCVEKYVLDYLNEL 502
+ IR++P + V + + L +
Sbjct: 276 EAEATVDIRLLPGEDLDDVLEELEAELRAI 305
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 39.0 bits (92), Expect = 2e-04
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
AEA NLG Y + G +EALE Y A+ L P+
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 39.0 bits (92), Expect = 2e-04
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
AEA NLG Y + G +EALE Y A+ L P+
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 39.0 bits (92), Expect = 2e-04
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
AEA NLG Y + G +EALE Y A+ L P+
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 30.2 bits (69), Expect = 0.30
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 111 DGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
+ NL A GD E+A++AY AL+ +P+
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
Score = 29.0 bits (66), Expect = 0.64
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 269 DGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
+ NL A GD E+A++AY AL+ +P
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELDP 32
Score = 29.0 bits (66), Expect = 0.64
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 620 DGYINLAAALVAAGDMEQAVQAYVTALQYNP 650
+ NL A GD E+A++AY AL+ +P
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELDP 32
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 44.6 bits (105), Expect = 2e-04
Identities = 49/230 (21%), Positives = 93/230 (40%), Gaps = 28/230 (12%)
Query: 70 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 129
K A A + G +G+ EAL + ++ L P YI A+ + GD ++A
Sbjct: 325 KLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKA 384
Query: 130 VQAYVTALQYN---PDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEA 186
+ + AL+ N PD+Y R+ L + +K G +A Y ++
Sbjct: 385 EEDFDKALKLNSEDPDIYYHRAQL-HFIK--GEFAQAGKDY----------------QKS 425
Query: 187 KNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKE 246
+L + I +Q + G +++ A F K P + Y+ G + +
Sbjct: 426 IDLDPDFIFS---HIQLGVTQYKEGSIASSM--ATFRR-CKKNFPEAPDVYNYYGELLLD 479
Query: 247 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
+ + EA+E + A+ L+ + Y+N+ + A + Q Q ++ A
Sbjct: 480 QNKFDEAIEKFDTAIELEKETKPMYMNVLPLINKALALFQWKQDFIEAEN 529
Score = 41.5 bits (97), Expect = 0.002
Identities = 34/143 (23%), Positives = 60/143 (41%), Gaps = 11/143 (7%)
Query: 579 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 638
K A A + G +G+ EAL + ++ L P YI A+ + GD ++A
Sbjct: 325 KLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKA 384
Query: 639 VQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFL 698
+ + AL+ N PD + L + +GE A ++K++ LDP+F+
Sbjct: 385 EEDFDKALKLNSE--------DPDIYYHRAQLHFI---KGEFAQAGKDYQKSIDLDPDFI 433
Query: 699 DAYINLGNVLKEARIFDRANTLF 721
++I LG + + F
Sbjct: 434 FSHIQLGVTQYKEGSIASSMATF 456
Score = 40.7 bits (95), Expect = 0.003
Identities = 26/88 (29%), Positives = 38/88 (43%), Gaps = 1/88 (1%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 135
A GN +A++ Y A+ KPD + Y N AA A GD E+ V+
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPV-YYSNRAACHNALGDWEKVVEDTTA 185
Query: 136 ALQYNPDLYCVRSDLGNLLKALGRLDEA 163
AL+ +PD + N LG+ +A
Sbjct: 186 ALELDPDYSKALNRRANAYDGLGKYADA 213
Score = 40.0 bits (93), Expect = 0.005
Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 644
A GN +A++ Y A+ KPD + Y N AA A GD E+ V+
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPV-YYSNRAACHNALGDWEKVVEDTTA 185
Query: 645 ALQYNPSCYLKAIETRPDFAVAWSNLG 671
AL+ +P Y KA+ R A A+ LG
Sbjct: 186 ALELDPD-YSKALNRR---ANAYDGLG 208
Score = 39.6 bits (92), Expect = 0.008
Identities = 45/190 (23%), Positives = 81/190 (42%), Gaps = 13/190 (6%)
Query: 530 PNYVAAARATKYVYNVEPDLTREGGSIPITLTFECRKLDKSAHFSTL--AIKQNPLLAEA 587
+Y AARA + ++ +E ++ + TF+C K + L +I+ +P + ++
Sbjct: 308 ESYEEAARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQS 367
Query: 588 YSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 647
Y ++ E G +A E++ A++L + D Y + A G+ QA + Y
Sbjct: 368 YIKRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQ---- 423
Query: 648 YNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNV 707
K+I+ PDF + LG +G I ++ F + P D Y G +
Sbjct: 424 -------KSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGEL 476
Query: 708 LKEARIFDRA 717
L + FD A
Sbjct: 477 LLDQNKFDEA 486
Score = 36.9 bits (85), Expect = 0.056
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 1/66 (1%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 293
A GN +A++ Y A+ KPD + Y N AA A GD E+ V+
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPV-YYSNRAACHNALGDWEKVVEDTTA 185
Query: 294 ALQYNP 299
AL+ +P
Sbjct: 186 ALELDP 191
Score = 35.0 bits (80), Expect = 0.18
Identities = 27/122 (22%), Positives = 54/122 (44%)
Query: 59 DKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAA 118
DK+ A+K N + Y + ++ +G+ +A ++Y+ ++ L PDFI +I L
Sbjct: 382 DKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGV 441
Query: 119 ALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLK 178
G + ++ + + P+ V + G LL + DEA + + +L K
Sbjct: 442 TQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETK 501
Query: 179 AL 180
+
Sbjct: 502 PM 503
Score = 35.0 bits (80), Expect = 0.20
Identities = 27/134 (20%), Positives = 57/134 (42%), Gaps = 4/134 (2%)
Query: 560 LTFECRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI 619
++ + S K P + Y+ G + ++ + EA+E + A+ L+ +
Sbjct: 442 TQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETK 501
Query: 620 DGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGE 679
Y+N+ + A + Q Q ++ A KA+ P+ +A + + + QG+
Sbjct: 502 PMYMNVLPLINKALALFQWKQDFIEAENLCE----KALIIDPECDIAVATMAQLLLQQGD 557
Query: 680 IWLAIHHFEKAVSL 693
+ A+ FE+A L
Sbjct: 558 VDEALKLFERAAEL 571
Score = 33.0 bits (75), Expect = 0.75
Identities = 30/151 (19%), Positives = 63/151 (41%), Gaps = 11/151 (7%)
Query: 48 LSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKP 107
L ++ + S K P + Y+ G + ++ + EA+E + A+ L+
Sbjct: 439 LGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEK 498
Query: 108 DFIDGYINLAAALVAAGDMEQAVQAYV-------TALQYNPDLYCVRSDLGNLLKALGRL 160
+ Y+N+ + A + Q Q ++ AL +P+ + + LL G +
Sbjct: 499 ETKPMYMNVLPLINKALALFQWKQDFIEAENLCEKALIIDPECDIAVATMAQLLLQQGDV 558
Query: 161 DEAKDLYCVRSDL----GNLLKALGRLDEAK 187
DEA L+ ++L G L++A+ + +
Sbjct: 559 DEALKLFERAAELARTEGELVQAISYAEATR 589
Score = 33.0 bits (75), Expect = 0.81
Identities = 54/267 (20%), Positives = 101/267 (37%), Gaps = 28/267 (10%)
Query: 53 FQCRKLDKSAHFSTLAI--KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI 110
++ + +K+ + AI K +P+ YSN + G ++ +E+ A+ L PD+
Sbjct: 138 YRNKDFNKAIKLYSKAIECKPDPVY---YSNRAACHNALGDWEKVVEDTTAALELDPDYS 194
Query: 111 DGYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSD--LGNLLKALGRLDEAKDLYC 168
A A G A+ +TA +S + LLK +AK++
Sbjct: 195 KALNRRANAYDGLGKYADALLD-LTASCIIDGFRNEQSAQAVERLLKKFA-ESKAKEILE 252
Query: 169 VR-------SDLGNLLKAL------GRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPT 215
+ + +GN L++ L+++ L E + +
Sbjct: 253 TKPENLPSVTFVGNYLQSFRPKPRPAGLEDSNELDEETGNGQLQLGLKSPESKAD---ES 309
Query: 216 TLESAHFSTLAIKQNPL---LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYI 272
E+A A+ L A A + G +G+ EAL + ++ L P YI
Sbjct: 310 YEEAARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYI 369
Query: 273 NLAAALVAAGDMEQAVQAYVTALQYNP 299
A+ + GD ++A + + AL+ N
Sbjct: 370 KRASMNLELGDPDKAEEDFDKALKLNS 396
Score = 32.7 bits (74), Expect = 1.2
Identities = 27/124 (21%), Positives = 53/124 (42%), Gaps = 18/124 (14%)
Query: 568 DKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAA 627
DK+ A+K N + Y + ++ +G+ +A ++Y+ ++ L PDFI +I L
Sbjct: 382 DKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGV 441
Query: 628 ALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHF 687
G + ++ + + P PD V+N GE+ L + F
Sbjct: 442 TQYKEGSIASSMATFRRCKKNFPEA--------PD----------VYNYYGELLLDQNKF 483
Query: 688 EKAV 691
++A+
Sbjct: 484 DEAI 487
Score = 31.9 bits (72), Expect = 1.8
Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 5/75 (6%)
Query: 626 AAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIH 685
AA L G+ + + A++ Y KAIE +PD V +SN NA G+ +
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKL----YSKAIECKPD-PVYYSNRAACHNALGDWEKVVE 181
Query: 686 HFEKAVSLDPNFLDA 700
A+ LDP++ A
Sbjct: 182 DTTAALELDPDYSKA 196
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 37.8 bits (89), Expect = 5e-04
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
A AY LG +Y + G +EA E Y A+ L P+
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
Score = 37.8 bits (89), Expect = 5e-04
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
A AY LG +Y + G +EA E Y A+ L P+
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
Score = 37.8 bits (89), Expect = 5e-04
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
A AY LG +Y + G +EA E Y A+ L P+
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
Score = 27.1 bits (61), Expect = 3.4
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 666 AWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
A+ LG ++ G+ A ++EKA+ LDPN
Sbjct: 3 AYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
Score = 26.7 bits (60), Expect = 4.8
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 113 YINLAAALVAAGDMEQAVQAYVTALQYNPD 142
Y L + GD E+A + Y AL+ +P+
Sbjct: 4 YYLLGQIYLQLGDYEEAKEYYEKALELDPN 33
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 38.0 bits (89), Expect = 9e-04
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 108
A+ + PL AEA LG +G+L EA R A+ PD
Sbjct: 23 ALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63
Score = 38.0 bits (89), Expect = 9e-04
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
A+ + PL AEA LG +G+L EA R A+ PD
Sbjct: 23 ALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63
Score = 38.0 bits (89), Expect = 9e-04
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 617
A+ + PL AEA LG +G+L EA R A+ PD
Sbjct: 23 ALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63
Score = 36.9 bits (86), Expect = 0.002
Identities = 16/61 (26%), Positives = 23/61 (37%)
Query: 82 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 141
L G EAL A+ P + + L AL+ G + +A AL +P
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADP 62
Query: 142 D 142
D
Sbjct: 63 D 63
Score = 36.9 bits (86), Expect = 0.003
Identities = 14/57 (24%), Positives = 20/57 (35%)
Query: 240 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
L G EAL A+ P + + L AL+ G + +A AL
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59
Score = 36.9 bits (86), Expect = 0.003
Identities = 14/57 (24%), Positives = 20/57 (35%)
Query: 591 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 647
L G EAL A+ P + + L AL+ G + +A AL
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59
Score = 36.1 bits (84), Expect = 0.004
Identities = 20/52 (38%), Positives = 24/52 (46%)
Query: 116 LAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
LA A + AGD ++A+ A AL P LG L GRL EA L
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALL 54
Score = 36.1 bits (84), Expect = 0.004
Identities = 21/72 (29%), Positives = 29/72 (40%), Gaps = 11/72 (15%)
Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
LA A + AGD ++A+ A AL P A A LG QG + A
Sbjct: 3 LARAALRAGDYDEALAALEAALARYP-----------LAAEALLLLGEALLRQGRLAEAA 51
Query: 685 HHFEKAVSLDPN 696
A++ DP+
Sbjct: 52 ALLRAALAADPD 63
Score = 27.3 bits (61), Expect = 7.1
Identities = 10/39 (25%), Positives = 15/39 (38%)
Query: 683 AIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRANTLF 721
A+ E A++ P +A + LG L A L
Sbjct: 16 ALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALL 54
>gnl|CDD|219515 pfam07687, M20_dimer, Peptidase dimerisation domain. This domain
consists of 4 beta strands and two alpha helices which
make up the dimerisation surface of members of the M20
family of peptidases. This family includes a range of
zinc metallopeptidases belonging to several families in
the peptidase classification. Family M20 are Glutamate
carboxypeptidases. Peptidase family M25 contains X-His
dipeptidases.
Length = 107
Score = 38.9 bits (91), Expect = 0.001
Identities = 15/56 (26%), Positives = 30/56 (53%), Gaps = 4/56 (7%)
Query: 447 WRYPSLSLHGIEGAFSGPGGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNEL 502
+ +L++ GIEG + + VIP + KF IR++P + + + K + + L +
Sbjct: 51 FPRTTLNITGIEGGTA----RNVIPAEAEAKFDIRLLPGEDLEELLKEIEEILEKE 102
>gnl|CDD|193557 cd05682, M20_dipept_like_5, Uncharacterized M20 Dipeptidases.
Peptidase M20 family, unknown dipeptidase-like subfamily
3 (inferred by homology to be dipeptidases). M20
dipeptidases include a large variety of bacterial
enzymes including cytosolic nonspecific dipeptidase
(CNDP), Xaa-methyl-His dipeptidase (anserinase),and
canosinase. These dipeptidases have been shown to act on
a wide range of dipeptides, but not larger peptides. For
example, anserinase mainly catalyzes the hydrolysis of
N-alpha-acetylhistidine while carnosinase degrades
beta-alanyl-L-histidine.
Length = 451
Score = 41.5 bits (98), Expect = 0.002
Identities = 53/228 (23%), Positives = 91/228 (39%), Gaps = 44/228 (19%)
Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVE-----PNGKIL-------IP- 398
++ ++ + +HSG G V + I+ QL++ G+IL IP
Sbjct: 194 LGGDLTVQVLDEGVHSGDASGIVPSSFR----ILRQLLDRIEDPETGEILPDELHCEIPA 249
Query: 399 HIYKD----VEPLGDTEEQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSR-WRYPSLS 453
H Y+ E LG+ +++ F + + D ++L++R WR P+LS
Sbjct: 250 HRYEQAKKAAEILGEP---VFDRFPF--------LSGVRPVTTDPVELLLNRTWR-PALS 297
Query: 454 LHGIEGAFSGPGGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPN--K 511
+ G +G S V+ + K S+R+ P E L +L + P K
Sbjct: 298 VTGADGLPSVATAGNVLRPETTLKLSLRLPPT---VDAEA-ASAALKKLLTT-DPPYGAK 352
Query: 512 FKAYLLDSGKSWRTNPEHPNYVAAA--RATKYVYNVEPDLTREGGSIP 557
+G W P ++A A +A++ + P EGGSIP
Sbjct: 353 VTFEPDGAGSGWNAPALAP-WLAKALNKASQDFFGKPPAYMGEGGSIP 399
>gnl|CDD|176974 CHL00033, ycf3, photosystem I assembly protein Ycf3.
Length = 168
Score = 39.6 bits (93), Expect = 0.002
Identities = 36/127 (28%), Positives = 57/127 (44%), Gaps = 27/127 (21%)
Query: 598 RGQLQEALENYRHAVRLKPDFID-GYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKA 656
G+ EAL+NY A+RL+ D D YI L+ + E + AL+Y Y +A
Sbjct: 48 EGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGE-----HTKALEY----YFQA 98
Query: 657 IETRPDFAVAWSNLGCVFNAQGE------------IWL--AIHHFEKAVSLDPNFLDAYI 702
+E P A +N+ + + +GE W A ++++A++L P YI
Sbjct: 99 LERNPFLPQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPG---NYI 155
Query: 703 NLGNVLK 709
N LK
Sbjct: 156 EAQNWLK 162
Score = 37.3 bits (87), Expect = 0.013
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 247 RGQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNPLL 301
G+ EAL+NY A+RL+ D D YI N+ + G+ +A++ Y AL+ NP L
Sbjct: 48 EGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFL 105
Score = 36.5 bits (85), Expect = 0.020
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 89 RGQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNPDL 143
G+ EAL+NY A+RL+ D D YI N+ + G+ +A++ Y AL+ NP L
Sbjct: 48 EGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFL 105
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 36.0 bits (84), Expect = 0.002
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALE 97
A++ +P AEAY NL + GQ EAL+
Sbjct: 5 ALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 36.0 bits (84), Expect = 0.002
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALE 255
A++ +P AEAY NL + GQ EAL+
Sbjct: 5 ALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 36.0 bits (84), Expect = 0.002
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALE 606
A++ +P AEAY NL + GQ EAL+
Sbjct: 5 ALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 32.6 bits (75), Expect = 0.046
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 652 CYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
Y KA+E P+ A A+ NL + G+ A+
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEAL 33
Score = 32.2 bits (74), Expect = 0.049
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 99 YRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 131
Y A+ L P+ + Y NLA L+ G ++A+Q
Sbjct: 2 YEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 32.2 bits (74), Expect = 0.049
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 257 YRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 289
Y A+ L P+ + Y NLA L+ G ++A+Q
Sbjct: 2 YEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 32.2 bits (74), Expect = 0.049
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 608 YRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 640
Y A+ L P+ + Y NLA L+ G ++A+Q
Sbjct: 2 YEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 31.8 bits (73), Expect = 0.079
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 686 HFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
+EKA+ LDPN +AY NL +L +D A
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEA 32
Score = 29.9 bits (68), Expect = 0.32
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 132 AYVTALQYNPDLYCVRSDLGNLLKALGRLDEA 163
Y AL+ +P+ +L LL LG+ DEA
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEA 32
>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
Length = 389
Score = 40.2 bits (95), Expect = 0.005
Identities = 25/89 (28%), Positives = 34/89 (38%), Gaps = 19/89 (21%)
Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
LA +A GD++ A AL +P C A LG + AQG+ AI
Sbjct: 186 LAQQALARGDLDAARALLKKALAADPQC-----------VRASILLGDLALAQGDYAAAI 234
Query: 685 HHFEKAVSLDPNFL--------DAYINLG 705
E+ DP +L + Y LG
Sbjct: 235 EALERVEEQDPEYLSEVLPKLMECYQALG 263
Score = 36.3 bits (85), Expect = 0.058
Identities = 42/166 (25%), Positives = 61/166 (36%), Gaps = 33/166 (19%)
Query: 4 IAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAH 63
Q + L EL +AG + AE +QL + G L L I+ Q + K
Sbjct: 103 REQRLLALQELGQDYLKAGLLDRAEELFLQLVDEGDFAEGALQQLLEIYQQEKDWQK--- 159
Query: 64 FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAA 123
AI +AE LG L+ + ++ Y LA +A
Sbjct: 160 ----AID----VAERLEKLG-----GDSLRVEIAHF-------------YCELAQQALAR 193
Query: 124 GDMEQAVQAYVTALQYNPDLYCVRS--DLGNLLKALGRLDEAKDLY 167
GD++ A AL +P CVR+ LG+L A G A +
Sbjct: 194 GDLDAARALLKKALAADPQ--CVRASILLGDLALAQGDYAAAIEAL 237
>gnl|CDD|205553 pfam13374, TPR_10, Tetratricopeptide repeat.
Length = 42
Score = 35.2 bits (82), Expect = 0.005
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 75 LAEAYSNLGNVYKERGQLQEALENYRHAVRL 105
A + +NL + +G+ +EA E A+ +
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAI 31
Score = 35.2 bits (82), Expect = 0.005
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 233 LAEAYSNLGNVYKERGQLQEALENYRHAVRL 263
A + +NL + +G+ +EA E A+ +
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAI 31
Score = 35.2 bits (82), Expect = 0.005
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 584 LAEAYSNLGNVYKERGQLQEALENYRHAVRL 614
A + +NL + +G+ +EA E A+ +
Sbjct: 1 TAASLNNLARALRAQGRYEEAEELLEEALAI 31
Score = 27.2 bits (61), Expect = 4.3
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 170 RSDLGNLLKALGRLDEAKNLHTE 192
++L L+A GR +EA+ L E
Sbjct: 5 LNNLARALRAQGRYEEAEELLEE 27
>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
subunit; Provisional.
Length = 987
Score = 39.5 bits (92), Expect = 0.009
Identities = 31/165 (18%), Positives = 65/165 (39%), Gaps = 7/165 (4%)
Query: 9 IGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLA 68
LL A+ AG+ + +R Q ++ + + L + + + + + + T +
Sbjct: 543 EDLLAAANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLHAQRYIPGQPELALNDLTRS 602
Query: 69 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQ 128
+ P A AY +Y++R + A+ + R A+ L+P+ + L AL +GD+ Q
Sbjct: 603 LNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQ 661
Query: 129 AVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDL 173
+ + A + PD + L D+ +
Sbjct: 662 SREMLERAHKGLPDDPALIRQLAY------VNQRLDDMAATQHYA 700
Score = 34.5 bits (79), Expect = 0.32
Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 211 GRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDG 270
G+ L T ++ P A AY +Y++R + A+ + R A+ L+P+ +
Sbjct: 590 GQPELALN---DLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNY 645
Query: 271 YINLAAALVAAGDMEQAVQ 289
L AL +GD+ Q+ +
Sbjct: 646 QAALGYALWDSGDIAQSRE 664
Score = 34.1 bits (78), Expect = 0.44
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 573 FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAA 632
T ++ P A AY +Y++R + A+ + R A+ L+P+ + L AL +
Sbjct: 598 DLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDS 656
Query: 633 GDMEQAVQ 640
GD+ Q+ +
Sbjct: 657 GDIAQSRE 664
>gnl|CDD|234406 TIGR03939, PGA_TPR_OMP, poly-beta-1,6 N-acetyl-D-glucosamine export
porin PgaA. Members of this protein family are the
poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin
PgaA of Gram-negative bacteria. There is no counterpart
in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis
systems of Gram-positive bacteria such as Staphylococcus
epidermidis. The PGA polysaccharide adhesin is a
critical determinant of biofilm formation. The conserved
C-terminal domain of this outer membrane protein is
preceded by a variable number of TPR repeats.
Length = 800
Score = 39.3 bits (92), Expect = 0.010
Identities = 33/125 (26%), Positives = 53/125 (42%), Gaps = 13/125 (10%)
Query: 581 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 640
L A L Y+ Q +ALE YR ++ P+ DG + LA L AG +A++
Sbjct: 82 VGLPARVLEALAKAYRNEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALK 141
Query: 641 AYVTALQYNPSCYLKAIETRPDFAVAW-SNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLD 699
YLK R A L V A + A+ +++A++L+P+ +
Sbjct: 142 ------------YLKEYVARFPTDAARYEALAYVLRAAEDHLDALQAWQQALTLEPDNPE 189
Query: 700 AYINL 704
A + L
Sbjct: 190 AALEL 194
Score = 34.6 bits (80), Expect = 0.29
Identities = 31/121 (25%), Positives = 45/121 (37%), Gaps = 8/121 (6%)
Query: 72 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 131
L A L Y+ Q +ALE YR ++ P+ DG + LA L AG +A++
Sbjct: 82 VGLPARVLEALAKAYRNEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALK 141
Query: 132 A-------YVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLG-NLLKALGRL 183
+ T L V + L AL +A L + L + L RL
Sbjct: 142 YLKEYVARFPTDAARYEALAYVLRAAEDHLDALQAWQQALTLEPDNPEAALELYRILTRL 201
Query: 184 D 184
Sbjct: 202 G 202
>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
modification, protein turnover, chaperones].
Length = 287
Score = 38.5 bits (90), Expect = 0.011
Identities = 30/98 (30%), Positives = 42/98 (42%), Gaps = 3/98 (3%)
Query: 69 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME- 127
++QNP AE + LG Y G+ +AL YR+A+RL D + + LA AL +
Sbjct: 149 LQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQM 208
Query: 128 --QAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEA 163
+A AL +P S L G EA
Sbjct: 209 TAKARALLRQALALDPANIRALSLLAFAAFEQGDYAEA 246
Score = 38.1 bits (89), Expect = 0.013
Identities = 34/121 (28%), Positives = 51/121 (42%), Gaps = 12/121 (9%)
Query: 578 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD--M 635
++QNP AE + LG Y G+ +AL YR+A+RL D + + LA AL M
Sbjct: 149 LQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQM 208
Query: 636 EQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDP 695
+A L+ +A+ P A S L QG+ A ++ + L P
Sbjct: 209 TAKARA---LLR-------QALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLP 258
Query: 696 N 696
Sbjct: 259 A 259
Score = 37.7 bits (88), Expect = 0.020
Identities = 21/57 (36%), Positives = 30/57 (52%)
Query: 227 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 283
++QNP AE + LG Y G+ +AL YR+A+RL D + + LA AL
Sbjct: 149 LQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAG 205
Score = 33.1 bits (76), Expect = 0.49
Identities = 23/121 (19%), Positives = 39/121 (32%), Gaps = 14/121 (11%)
Query: 600 QLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIET 659
+++ + ++ P +G+ L A +A G A+ AY AL+
Sbjct: 137 EMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALR-----------L 185
Query: 660 RPDFAVAWSNLGCVFNAQGE---IWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDR 716
D L Q A +A++LDP + A L E +
Sbjct: 186 AGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRALSLLAFAAFEQGDYAE 245
Query: 717 A 717
A
Sbjct: 246 A 246
Score = 30.4 bits (69), Expect = 3.4
Identities = 12/52 (23%), Positives = 23/52 (44%)
Query: 91 QLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
+++ + ++ P +G+ L A +A G A+ AY AL+ D
Sbjct: 137 EMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGD 188
>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
[Carbohydrate transport and metabolism].
Length = 389
Score = 38.5 bits (90), Expect = 0.013
Identities = 39/163 (23%), Positives = 67/163 (41%), Gaps = 14/163 (8%)
Query: 11 LLELAHREYQ-AGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAI 69
L +L R+Y AG + AE QL + G L L +I+ R+ +K+ + +
Sbjct: 110 LQQLG-RDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERLV 168
Query: 70 K-----QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAG 124
K +A+ Y L + A E + A++ + I L +A G
Sbjct: 169 KLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKG 228
Query: 125 DMEQAVQAYVTALQYNPDLYCVRSDLGNLLK----ALGRLDEA 163
D ++AV+A L+ NP+ S++ +L LG+ E
Sbjct: 229 DYQKAVEALERVLEQNPEYL---SEVLEMLYECYAQLGKPAEG 268
Score = 35.8 bits (83), Expect = 0.082
Identities = 48/219 (21%), Positives = 81/219 (36%), Gaps = 44/219 (20%)
Query: 586 EAYSNLGNVYKERGQLQEALENYRHAVRLKPD--------FIDGYINLAAALVAAGDMEQ 637
A L N+Y+ + ++A++ V+L F Y LA +A+ D+++
Sbjct: 142 GALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQF---YCELAQQALASSDVDR 198
Query: 638 AVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
A + ALQ + C +I LG V A+G+ A+ E+ + +P +
Sbjct: 199 ARELLKKALQADKKCVRASII-----------LGRVELAKGDYQKAVEALERVLEQNPEY 247
Query: 698 L--------DAYINLGNVLKEARIFDRANTLFY--VSSYFMITD--EDIDVILERCEAKT 745
L + Y LG E F R + M+ D E + I T
Sbjct: 248 LSEVLEMLYECYAQLGK-PAEGLNFLRRAMETNTGADAELMLADLIELQEGIDAAQAYLT 306
Query: 746 EELNKKFEQLGESSLRDFTLDAPTQSVYKFEGEDYREKQ 784
+L +K G L D+ L + E+ R K+
Sbjct: 307 RQLRRKPTMRGFHRLMDYHLA---------DAEEGRAKE 336
Score = 32.4 bits (74), Expect = 1.0
Identities = 62/284 (21%), Positives = 98/284 (34%), Gaps = 69/284 (24%)
Query: 27 AERHCMQLWRQETNN------TGVLLLLSSIHFQCRKLDKSA-HFSTLAIKQNPLLAEAY 79
R Q + + N G+ LLS + DK+ F + +++P EA+
Sbjct: 20 GRRSAQQDKQDQANRLSRDYVKGLNFLLS------NQPDKAVDLFLEML-QEDPETFEAH 72
Query: 80 SNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQY 139
LGN+++ RG+ +D I + L+ + D+ T Q
Sbjct: 73 LTLGNLFRSRGE------------------VDRAIRIHQTLLESPDL--------TFEQR 106
Query: 140 NPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIKPVTM 199
L LG A G LD A+D++ N L G E NI T
Sbjct: 107 LLALQ----QLGRDYMAAGLLDRAEDIF-------NQLVDEGEFAEGALQQLLNIYQATR 155
Query: 200 KVQNAIVCNYG----GRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALE 255
+ + AI G + +E A F Y L + A E
Sbjct: 156 EWEKAIDVAERLVKLGGQTYRVEIAQF--------------YCELAQQALASSDVDRARE 201
Query: 256 NYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
+ A++ + I L +A GD ++AV+A L+ NP
Sbjct: 202 LLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNP 245
Score = 31.6 bits (72), Expect = 1.9
Identities = 17/72 (23%), Positives = 29/72 (40%)
Query: 579 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 638
+A+ Y L + A E + A++ + I L +A GD ++A
Sbjct: 174 TYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKA 233
Query: 639 VQAYVTALQYNP 650
V+A L+ NP
Sbjct: 234 VEALERVLEQNP 245
>gnl|CDD|227310 COG4976, COG4976, Predicted methyltransferase (contains TPR repeat)
[General function prediction only].
Length = 287
Score = 35.9 bits (83), Expect = 0.081
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 95 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
A E Y A+ L P++ G+ L AG+ + A AY L+ +P+
Sbjct: 14 AAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPE 61
Score = 34.4 bits (79), Expect = 0.23
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 253 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
A E Y A+ L P++ G+ L AG+ + A AY L+ +P
Sbjct: 14 AAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDP 60
Score = 34.4 bits (79), Expect = 0.23
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 604 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 650
A E Y A+ L P++ G+ L AG+ + A AY L+ +P
Sbjct: 14 AAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDP 60
Score = 29.0 bits (65), Expect = 9.8
Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 11/70 (15%)
Query: 626 AAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIH 685
A L +GD E A + Y +A+E P++A W LG GE A
Sbjct: 2 AYMLAESGDAEAAAELY-----------NQALELAPEWAAGWFRLGEYTEKAGEFDAAAA 50
Query: 686 HFEKAVSLDP 695
+E+ + LDP
Sbjct: 51 AYEEVLELDP 60
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 32.1 bits (73), Expect = 0.085
Identities = 14/44 (31%), Positives = 17/44 (38%)
Query: 76 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 119
A L G L EAL R A+ L PD + + LA
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
Score = 32.1 bits (73), Expect = 0.085
Identities = 14/44 (31%), Positives = 17/44 (38%)
Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 277
A L G L EAL R A+ L PD + + LA
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
Score = 32.1 bits (73), Expect = 0.085
Identities = 14/44 (31%), Positives = 17/44 (38%)
Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 628
A L G L EAL R A+ L PD + + LA
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat.
Length = 36
Score = 30.7 bits (70), Expect = 0.19
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 78 AYSNLGNVYKERGQLQEALENYRHAVRLKPD 108
A SNLG +Y++ G ++A+ Y A+ L D
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKD 31
Score = 30.7 bits (70), Expect = 0.19
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 236 AYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
A SNLG +Y++ G ++A+ Y A+ L D
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKD 31
Score = 30.7 bits (70), Expect = 0.19
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 587 AYSNLGNVYKERGQLQEALENYRHAVRLKPD 617
A SNLG +Y++ G ++A+ Y A+ L D
Sbjct: 1 ALSNLGRLYRKLGDYEKAISLYERALALAKD 31
Score = 26.4 bits (59), Expect = 7.1
Identities = 9/26 (34%), Positives = 11/26 (42%)
Query: 113 YINLAAALVAAGDMEQAVQAYVTALQ 138
NL GD E+A+ Y AL
Sbjct: 2 LSNLGRLYRKLGDYEKAISLYERALA 27
Score = 26.4 bits (59), Expect = 7.1
Identities = 9/26 (34%), Positives = 11/26 (42%)
Query: 271 YINLAAALVAAGDMEQAVQAYVTALQ 296
NL GD E+A+ Y AL
Sbjct: 2 LSNLGRLYRKLGDYEKAISLYERALA 27
Score = 26.4 bits (59), Expect = 7.1
Identities = 9/26 (34%), Positives = 11/26 (42%)
Query: 622 YINLAAALVAAGDMEQAVQAYVTALQ 647
NL GD E+A+ Y AL
Sbjct: 2 LSNLGRLYRKLGDYEKAISLYERALA 27
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 34.7 bits (80), Expect = 0.19
Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%)
Query: 14 LAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNP 73
A R + GD++ A+ +++ + L+ L K+A +KQ+P
Sbjct: 269 YAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEK---WLKQHP 325
Query: 74 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 133
S LG + + +A E A++L+P D Y LA AL G+ E+A Q
Sbjct: 326 EDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASD-YAELADALDQLGEPEEAEQVR 384
Query: 134 VTAL 137
AL
Sbjct: 385 REAL 388
Score = 34.7 bits (80), Expect = 0.21
Identities = 63/296 (21%), Positives = 98/296 (33%), Gaps = 35/296 (11%)
Query: 9 IGLLELAHREYQAGDYESAERHCMQLWRQETN-NTGVLLLLSSIHFQCRKLDKSAHFSTL 67
+ L A Q GD + A R+ + + V L + + R +
Sbjct: 119 LAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAVELTRARLLLNRRDYPAARENVDQ 178
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEA---LENYRHAVRLKPDFIDGYINLAAALVAAG 124
++ P E Y G Q L R A L A
Sbjct: 179 LLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSD-------------EEAA 225
Query: 125 DMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRL-DEAKDLYCVRSDLGNLLKALGRL 183
+EQ QA+ LQ D + L + ++ + L LG
Sbjct: 226 RLEQ--QAWEGLLQQARD----DNGSEGLKTWWKNQPRKLRNDPELVVAYAERLIRLGDH 279
Query: 184 DEAKNLHTENIKPVTMKVQNAIVCNYGGR----KPTTLESAHFSTLAIKQNPLLAEAYSN 239
DEA+ + + +K + + +C R P L A + +KQ+P S
Sbjct: 280 DEAQEIIEDALK----RQWDPRLCRLIPRLRPGDPEPLIKA--AEKWLKQHPEDPLLLST 333
Query: 240 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTAL 295
LG + + +A E A++L+P D Y LA AL G+ E+A Q AL
Sbjct: 334 LGRLALKNKLWGKASEALEAALKLRPSASD-YAELADALDQLGEPEEAEQVRREAL 388
Score = 32.3 bits (74), Expect = 1.2
Identities = 26/82 (31%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
L K+A +KQ+P S LG + + +A E A++L+P D Y
Sbjct: 311 EPLIKAAEK---WLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASD-YAE 366
Query: 625 LAAALVAAGDMEQAVQAYVTAL 646
LA AL G+ E+A Q AL
Sbjct: 367 LADALDQLGEPEEAEQVRREAL 388
Score = 30.0 bits (68), Expect = 5.8
Identities = 60/277 (21%), Positives = 90/277 (32%), Gaps = 66/277 (23%)
Query: 11 LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQ-------CRKLDKSAH 63
L A DY +A + QL + VL L + + L K
Sbjct: 156 ELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLR- 214
Query: 64 FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEA-----LENYRHAVRLKPDFIDGYINLAA 118
K L E + L G LQ+A E + + +P + L
Sbjct: 215 ------KAGLLSDEEAARLEQ-QAWEGLLQQARDDNGSEGLKTWWKNQPRKLRNDPELVV 267
Query: 119 ALVA----AGDMEQAVQAYVTAL--QYNPDLY----CVR-SDLGNLLKAL-GRLDEAKDL 166
A GD ++A + AL Q++P L +R D L+KA L + +
Sbjct: 268 AYAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQHPED 327
Query: 167 YCVRSDLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLA 226
LL LGRL L G+ LE+ A
Sbjct: 328 PL-------LLSTLGRLALKNKLW--------------------GKASEALEA------A 354
Query: 227 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRL 263
+K P A Y+ L + + G+ +EA + R A+ L
Sbjct: 355 LKLRP-SASDYAELADALDQLGEPEEAEQVRREALLL 390
>gnl|CDD|236323 PRK08651, PRK08651, succinyl-diaminopimelate desuccinylase;
Reviewed.
Length = 394
Score = 34.2 bits (79), Expect = 0.26
Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 7/94 (7%)
Query: 464 PGGKT-VIPGKVVGKFSI--RIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSG 520
G KT ++PG FSI R++P +T + V + L+E+ A + + +
Sbjct: 250 GGTKTNIVPGYC--AFSIDRRLIPEETAEEVRDELEALLDEV--APELGIEVEFEITPFS 305
Query: 521 KSWRTNPEHPNYVAAARATKYVYNVEPDLTREGG 554
+++ T+P+ A A + V VEP T G
Sbjct: 306 EAFVTDPDSELVKALREAIREVLGVEPKKTISLG 339
>gnl|CDD|215573 PLN03098, LPA1, LOW PSII ACCUMULATION1; Provisional.
Length = 453
Score = 34.5 bits (79), Expect = 0.28
Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 10/89 (11%)
Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYINLAAALVAAGDMEQA 638
AE NLG +G++++AL + A+ L P+ + Y N A + ++A
Sbjct: 72 VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKA 131
Query: 639 VQAYVTAL-QYNPSCYLK--AIETRPDFA 664
TAL YN LK I PD A
Sbjct: 132 ADCLRTALRDYN----LKFSTILNDPDLA 156
Score = 34.1 bits (78), Expect = 0.35
Identities = 24/89 (26%), Positives = 37/89 (41%), Gaps = 13/89 (14%)
Query: 73 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYINLAAALVAAGDMEQA 129
AE NLG +G++++AL + A+ L P+ + Y N A + ++A
Sbjct: 72 VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKA 131
Query: 130 VQAYVTAL-QYN---------PDLYCVRS 148
TAL YN PDL R+
Sbjct: 132 ADCLRTALRDYNLKFSTILNDPDLAPFRA 160
Score = 34.1 bits (78), Expect = 0.36
Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 4/72 (5%)
Query: 231 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYINLAAALVAAGDMEQA 287
AE NLG +G++++AL + A+ L P+ + Y N A + ++A
Sbjct: 72 VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKA 131
Query: 288 VQAYVTAL-QYN 298
TAL YN
Sbjct: 132 ADCLRTALRDYN 143
Score = 31.8 bits (72), Expect = 1.9
Identities = 34/123 (27%), Positives = 48/123 (39%), Gaps = 16/123 (13%)
Query: 669 NLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVL----------KEARIFDRAN 718
NLG ++G + A+ FE A+ L+PN +A N K A A
Sbjct: 80 NLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTAL 139
Query: 719 TLFYVSSYFMITDEDIDVILERCEAKT--EELNKKFEQLGESSLRDFTL----DAPTQSV 772
+ + ++ D D+ E K EE K E +G S RD L AP + V
Sbjct: 140 RDYNLKFSTILNDPDLAPFRASPEFKELQEEARKGGEDIGSSFRRDLKLISEVQAPFRGV 199
Query: 773 YKF 775
KF
Sbjct: 200 RKF 202
>gnl|CDD|226428 COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Posttranslational modification, protein
turnover, chaperones].
Length = 620
Score = 34.4 bits (79), Expect = 0.32
Identities = 32/160 (20%), Positives = 50/160 (31%), Gaps = 11/160 (6%)
Query: 2 NDIAQ-LGIGLLELAHREYQAGDYESAERHCMQLWRQ-ETNNTGVLL--LLSSIHFQCRK 57
N +A L + L + A ++ + + L N +LL LS +
Sbjct: 23 NALAGHLSLLDLAIVLEWLNAEGLQALAIYALLLGIAINDVNPELLLAAFLSILLAPLAD 82
Query: 58 LDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRH-AVRLKPDFIDG---- 112
+ + + NP A NL + G AL + A L PD +
Sbjct: 83 STLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHL 142
Query: 113 --YINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDL 150
+ L L G +A A A+ P V L
Sbjct: 143 IRFYQLGRYLKLLGRTAEAELALERAVDLLPKYPRVLGAL 182
Score = 34.0 bits (78), Expect = 0.45
Identities = 26/115 (22%), Positives = 33/115 (28%), Gaps = 18/115 (15%)
Query: 567 LDKSAHFSTLA-------IKQNPLLAEAYSNLGNVYKERGQLQEALENYRH-AVRLKPDF 618
L STLA + NP A NL + G AL + A L PD
Sbjct: 76 LLAPLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDN 135
Query: 619 IDG------YINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAW 667
+ + L L G +A A A+ P K A
Sbjct: 136 AEFLGHLIRFYQLGRYLKLLGRTAEAELALERAVDLLP----KYPRVLGALMTAR 186
Score = 31.3 bits (71), Expect = 2.7
Identities = 19/82 (23%), Positives = 27/82 (32%), Gaps = 7/82 (8%)
Query: 225 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRH-AVRLKPDFIDG------YINLAAA 277
+ + NP A NL + G AL + A L PD + + L
Sbjct: 92 IPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHLIRFYQLGRY 151
Query: 278 LVAAGDMEQAVQAYVTALQYNP 299
L G +A A A+ P
Sbjct: 152 LKLLGRTAEAELALERAVDLLP 173
>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
[Posttranslational modification, protein turnover,
chaperones].
Length = 304
Score = 33.9 bits (78), Expect = 0.33
Identities = 14/51 (27%), Positives = 22/51 (43%)
Query: 117 AAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
A L+ A D +A ALQ P+ + L L A G ++ A+ +
Sbjct: 141 AKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAIL 191
Score = 32.7 bits (75), Expect = 0.71
Identities = 25/110 (22%), Positives = 39/110 (35%), Gaps = 11/110 (10%)
Query: 88 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT-----------A 136
E EA + A++ P+ + + LA L+AAGD+E A
Sbjct: 146 EAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHG 205
Query: 137 LQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEA 186
LQ +L + + RL D L + L +GR + A
Sbjct: 206 LQAQIELLEQAAATPEIQDLQRRLAADPDDVEAALALADQLHLVGRNEAA 255
>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF. Members of
this protein family are the product of one of seven
genes regularly clustered in operons to encode the
proteins of the tol-pal system, which is critical for
maintaining the integrity of the bacterial outer
membrane. The gene for this periplasmic protein has been
designated orf2 and ybgF. All members of the seed
alignment were from unique tol-pal gene regions from
completed bacterial genomes. The architecture of this
protein is a signal sequence, a low-complexity region
usually rich in Asn and Gln, a well-conserved region
with tandem repeats that resemble the tetratricopeptide
(TPR) repeat, involved in protein-protein interaction.
Length = 117
Score = 32.3 bits (74), Expect = 0.34
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 6/71 (8%)
Query: 69 IKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI---DGYINLAAALVA 122
+K+ P A+ LG Y +G +A + + V+ P D + L +L
Sbjct: 27 LKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQE 86
Query: 123 AGDMEQAVQAY 133
GD E+A
Sbjct: 87 LGDKEKAKATL 97
Score = 32.3 bits (74), Expect = 0.34
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 6/71 (8%)
Query: 227 IKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI---DGYINLAAALVA 280
+K+ P A+ LG Y +G +A + + V+ P D + L +L
Sbjct: 27 LKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQE 86
Query: 281 AGDMEQAVQAY 291
GD E+A
Sbjct: 87 LGDKEKAKATL 97
Score = 32.3 bits (74), Expect = 0.34
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 6/71 (8%)
Query: 578 IKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI---DGYINLAAALVA 631
+K+ P A+ LG Y +G +A + + V+ P D + L +L
Sbjct: 27 LKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQE 86
Query: 632 AGDMEQAVQAY 642
GD E+A
Sbjct: 87 LGDKEKAKATL 97
>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
subunit NrfG; Provisional.
Length = 198
Score = 33.0 bits (75), Expect = 0.49
Identities = 18/53 (33%), Positives = 27/53 (50%)
Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 120
I+ NP +E ++ LG Y R AL YR A++L+ + + Y LA L
Sbjct: 65 KIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117
Score = 33.0 bits (75), Expect = 0.49
Identities = 18/53 (33%), Positives = 27/53 (50%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 278
I+ NP +E ++ LG Y R AL YR A++L+ + + Y LA L
Sbjct: 65 KIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117
Score = 33.0 bits (75), Expect = 0.49
Identities = 18/53 (33%), Positives = 27/53 (50%)
Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 629
I+ NP +E ++ LG Y R AL YR A++L+ + + Y LA L
Sbjct: 65 KIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117
>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
Members of this protein family are uncharacterized
tetratricopeptide repeat (TPR) proteins invariably found
in heme biosynthesis gene clusters. The absence of any
invariant residues other than Ala argues against this
protein serving as an enzyme per se. The gene symbol
hemY assigned in E. coli is unfortunate in that an
unrelated protein, protoporphyrinogen oxidase (HemG in
E. coli) is designated HemY in Bacillus subtilis
[Unknown function, General].
Length = 367
Score = 33.4 bits (77), Expect = 0.57
Identities = 44/180 (24%), Positives = 69/180 (38%), Gaps = 24/180 (13%)
Query: 20 QAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQ----------CRKLDKSAHFSTLAI 69
DYE+A L Q +T VL L + + L K+ S
Sbjct: 146 DQRDYEAALAALDSLQAQAPRHTAVLRLALRAYQRSGNWDALLKLLPALRKAKALSPEEA 205
Query: 70 KQNPLLAEAYSNLGNVYKERGQ--LQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
+ L +AY L + +E L+ + A R +P+ A AL+ GD +
Sbjct: 206 AR--LEQQAYIGLLDEAREEDADALKTWWKQLPRAERQEPELAVAA---AEALIQLGDHD 260
Query: 128 QAVQAYVTAL--QYNPDLY--CVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRL 183
+A + AL +++P+L R G+ + R ++ L D LL ALGRL
Sbjct: 261 EAEKLIEEALKKEWDPELLRLYGRLQPGDPSPLIKRAEKW--LKKHPDD-ALLLLALGRL 317
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 33.4 bits (77), Expect = 0.60
Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%)
Query: 56 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 115
DK+ + AI+ P L E Y + K G LQ+A E A L D D YIN
Sbjct: 207 GSRDKALEYIDKAIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSL--DLADRYIN 264
Query: 116 LAAA--LVAAGDMEQAVQ 131
A ++ A + E+A +
Sbjct: 265 SKCAKYMLRANENEEAEK 282
Score = 33.4 bits (77), Expect = 0.60
Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%)
Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
DK+ + AI+ P L E Y + K G LQ+A E A L D D YIN
Sbjct: 207 GSRDKALEYIDKAIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSL--DLADRYIN 264
Query: 625 LAAA--LVAAGDMEQAVQ 640
A ++ A + E+A +
Sbjct: 265 SKCAKYMLRANENEEAEK 282
Score = 30.7 bits (70), Expect = 4.5
Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 4/66 (6%)
Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA--LVAAGD 283
AI+ P L E Y + K G LQ+A E A L D D YIN A ++ A +
Sbjct: 219 AIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSL--DLADRYINSKCAKYMLRANE 276
Query: 284 MEQAVQ 289
E+A +
Sbjct: 277 NEEAEK 282
>gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE;
Provisional.
Length = 656
Score = 33.2 bits (75), Expect = 0.74
Identities = 27/137 (19%), Positives = 47/137 (34%), Gaps = 9/137 (6%)
Query: 56 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 115
KL + H+ A++ N + + GQ ++A+ + ++ PD
Sbjct: 265 AKLQAAEHWRH-ALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAM 323
Query: 116 LAAALVAAGDMEQAVQAYVTALQYNPD-LYCVRSDLGNLLKALGRLDEAKDL---YCVRS 171
A AL G A +V + R LL+A G+ EA+ + Y
Sbjct: 324 YARALRQVGQYTAASDEFVQLAREKGVTSKWNRYAAAALLQA-GKTSEAESVFEHYIQAR 382
Query: 172 DL---GNLLKALGRLDE 185
+ + L LD
Sbjct: 383 ASHLPQSFEEGLLALDG 399
>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
Found in the DNRI/REDD/AFSR family of regulators. This
region of AFSR along with the C terminal region is
capable of independently directing actinorhodin
production. This family contains TPR repeats.
Length = 145
Score = 31.5 bits (72), Expect = 0.90
Identities = 22/75 (29%), Positives = 30/75 (40%), Gaps = 3/75 (4%)
Query: 232 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 291
L EA L G+ +EAL + L P + L AL AG +A++AY
Sbjct: 59 LRLEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAY 118
Query: 292 VTALQYNPLLKKEDG 306
+ LL E G
Sbjct: 119 RRLRR---LLADELG 130
Score = 30.3 bits (69), Expect = 2.3
Identities = 18/60 (30%), Positives = 25/60 (41%)
Query: 74 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 133
L EA L G+ +EAL + L P + L AL AG +A++AY
Sbjct: 59 LRLEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAY 118
Score = 30.3 bits (69), Expect = 2.3
Identities = 18/60 (30%), Positives = 25/60 (41%)
Query: 583 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 642
L EA L G+ +EAL + L P + L AL AG +A++AY
Sbjct: 59 LRLEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAY 118
Score = 29.2 bits (66), Expect = 5.4
Identities = 25/94 (26%), Positives = 34/94 (36%), Gaps = 21/94 (22%)
Query: 92 LQEALENYR---------------HAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 136
L+ AL YR RL+ ++ LA AL+A G E+A+
Sbjct: 28 LEAALALYRGPLLADVPDEDWAEAERERLRELRLEALEALAEALLALGRHEEALALLERL 87
Query: 137 LQYNP---DLYCVRSDLGNLLKALGRLDEAKDLY 167
L +P L+ L L GR EA Y
Sbjct: 88 LALDPLRERLHR---LLMRALYRAGRRAEALRAY 118
>gnl|CDD|182695 PRK10747, PRK10747, putative protoheme IX biogenesis protein;
Provisional.
Length = 398
Score = 32.8 bits (75), Expect = 0.91
Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 69 IKQ---NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 125
IKQ PLL +S LG + + G+ QEA +R A++ +PD D Y LA AL
Sbjct: 321 IKQHGDTPLL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADALDRLHK 376
Query: 126 MEQAVQ----AYVTALQYNPD 142
E+A + LQ NP
Sbjct: 377 PEEAAAMRRDGLMLTLQNNPP 397
Score = 32.0 bits (73), Expect = 1.7
Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 7/55 (12%)
Query: 227 IKQ---NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 278
IKQ PLL +S LG + + G+ QEA +R A++ +PD D Y LA AL
Sbjct: 321 IKQHGDTPLL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADAL 371
Score = 32.0 bits (73), Expect = 1.7
Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 7/55 (12%)
Query: 578 IKQ---NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 629
IKQ PLL +S LG + + G+ QEA +R A++ +PD D Y LA AL
Sbjct: 321 IKQHGDTPLL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADAL 371
>gnl|CDD|236644 PRK10049, pgaA, outer membrane protein PgaA; Provisional.
Length = 765
Score = 32.8 bits (75), Expect = 0.93
Identities = 26/142 (18%), Positives = 47/142 (33%), Gaps = 18/142 (12%)
Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQA 641
L A Y+ + Y+ Q Q +L ++ A+ L+P D L L AG ++A+
Sbjct: 46 QLPARGYAAVAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVK 105
Query: 642 YVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSL---DPNFL 698
K + + L V+ G W + +A+ +
Sbjct: 106 ------------AKQLVSGAPDKANLLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYP 153
Query: 699 DAYINLGNVLKEARIFDRANTL 720
Y+ L+ R+ A
Sbjct: 154 TEYV---QALRNNRLSAPALGA 172
Score = 32.0 bits (73), Expect = 1.6
Identities = 34/162 (20%), Positives = 53/162 (32%), Gaps = 42/162 (25%)
Query: 73 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYI-------------NL 116
L A Y+ + Y+ Q Q +L ++ A+ L+P D G I
Sbjct: 46 QLPARGYAAVAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVK 105
Query: 117 AAALVA-----------------AGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLK---- 155
A LV+ AG ++A AL P ++ L+
Sbjct: 106 AKQLVSGAPDKANLLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYPTEYVQALRNNRL 165
Query: 156 ---ALGRLDEAKDLYCVRSDL-GNLLKALGRLDEAKNLHTEN 193
ALG +D+A + DL + L RL +E
Sbjct: 166 SAPALGAIDDANLTPAEKRDLEADAAAELVRLSFMPT-RSEK 206
Score = 30.1 bits (68), Expect = 6.2
Identities = 16/52 (30%), Positives = 21/52 (40%), Gaps = 1/52 (1%)
Query: 248 GQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
GQ EAL + V PD + + LA AG ++A AL P
Sbjct: 97 GQYDEALVKAKQLVSGAPDKAN-LLALAYVYKRAGRHWDELRAMTQALPRAP 147
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 31.6 bits (72), Expect = 1.2
Identities = 26/113 (23%), Positives = 39/113 (34%), Gaps = 27/113 (23%)
Query: 796 PKRERKANYAVDAYFKEALRTSE----PKAPKAPRPPKQPIVQ----DFQFFPPRLFEIL 847
P + SE P P+ P+ Q +FFP
Sbjct: 63 PSEKGGKYLLAALTIDRFYHISESVKIPAIYDKVVVPRPPVPQPKGLKMRFFP------- 115
Query: 848 DQEIYYFRKTVGYKVPKNP--ELGSDATKAQKEEQKKIDESEPLTEEELAEKE 898
GY P P ELGS++ ++KE K+++ + EEE EK+
Sbjct: 116 ----------TGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKK 158
>gnl|CDD|164812 PHA00450, PHA00450, host dGTPase inhibitor.
Length = 85
Score = 29.4 bits (66), Expect = 1.9
Identities = 16/44 (36%), Positives = 22/44 (50%), Gaps = 6/44 (13%)
Query: 712 RIFDRANTLFYVSSYFMITDEDIDVILERCEAKTEELNKKFEQL 755
R+ DR+ L S F DED+ +TE LNK ++QL
Sbjct: 44 RVEDRSGHLI-ASRTFEHHDEDV-----LYNMQTEWLNKMYDQL 81
>gnl|CDD|216562 pfam01546, Peptidase_M20, Peptidase family M20/M25/M40. This
family includes a range of zinc metallopeptidases
belonging to several families in the peptidase
classification. Family M20 are Glutamate
carboxypeptidases. Peptidase family M25 contains X-His
dipeptidases.
Length = 310
Score = 31.4 bits (71), Expect = 2.0
Identities = 15/108 (13%), Positives = 28/108 (25%), Gaps = 6/108 (5%)
Query: 456 GIEGAFSGPGGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAY 515
GI G G + + R ++ + + + + + A
Sbjct: 164 GIGTVGGGGGSNNNVIPEAAFLRGRRRTLDEELRALVEEEEEAIAAGAAAAG----VVEE 219
Query: 516 LLDSGKSWRTNPEHPNYVAAAR--ATKYVYNVEPDLTREGGSIPITLT 561
D + P VAA A + EP+ + G
Sbjct: 220 EEDYRPPYPVTVNDPALVAALEEAAKELGLGPEPEPSGGGEDAAFFAE 267
>gnl|CDD|233446 TIGR01513, NAPRTase_put, putative nicotinate
phosphoribosyltransferase. A deep split separates two
related families of proteins, one of which includes
experimentally characterized examples of nicotinate
phosphoribosyltransferase, an the first enzyme of NAD
salvage biosynthesis. This model represents the other
family. Members have a different (longer) spacing of
several key motifs and have an additional C-terminal
domain of up to 100 residues. One argument suggesting
that this family represents the same enzyme is that no
species has a member of both families. Another is that
the gene encoding this protein is located near other NAD
salvage biosynthesis genes in Nostoc and in at least
four different Gram-positive bacteria. NAD and NADP are
ubiquitous in life. Most members of this family are
Gram-positive bacteria. An additional set of mutually
closely related archaeal sequences score between the
trusted and noise cutoffs [Biosynthesis of cofactors,
prosthetic groups, and carriers, Pyridine nucleotides].
Length = 443
Score = 31.6 bits (72), Expect = 2.2
Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 10/81 (12%)
Query: 116 LAAALVAA-GDMEQAVQAYVTALQYNP----DLY-CVRSDLGNLLKALGRLDEAKDLYCV 169
+A + V + + A +AY D Y +RS L N + L E + V
Sbjct: 193 MAHSFVMSFDNELAAFRAYAKLYPKATVLLVDTYDTLRSGLPNAIAVAKELGEQGKVVGV 252
Query: 170 RSDLGNLL----KALGRLDEA 186
R D G+LL +A +LD A
Sbjct: 253 RIDSGDLLYLSKQARKQLDAA 273
>gnl|CDD|193570 cd08659, M20_ArgE_DapE_like, Peptidase M20 acetylornithine
deacetylase/succinyl-diaminopimelate desuccinylase
(ArgE/DapE)-like. Peptidase M20 acetylornithine
deacetylase/succinyl-diaminopimelate desuccinylase
(ArgE/DapE) like family of enzymes catalyze analogous
reactions and share a common activator, the metal ion
(usually Co2+ or Zn2+). ArgE catalyzes a broad range of
substrates, including N-acetylornithine,
alpha-N-acetylmethionine and alpha-N-formylmethionine,
while DapE catalyzes the hydrolysis of
N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
L,L-diaminopimelate and succinate. Proteins in this
family are mostly bacterial and have been inferred by
homology as being related to both, ArgE and DapE. This
family also includes N-acetyl-L-citrulline deacetylase
(ACDase; acetylcitrulline deacetylase), a unique, novel
enzyme found in Xanthomonas campestris, a plant
pathogen, in which N-acetyl-L-ornithine is the substrate
for transcarbamoylation reaction, and the product is
N-acetyl-L-citrulline. Thus, in the arginine
biosynthesis pathway, ACDase subsequently catalyzes the
hydrolysis of N-acetyl-L-citrulline to acetate and
L-citrulline.
Length = 365
Score = 31.0 bits (71), Expect = 2.6
Identities = 28/108 (25%), Positives = 43/108 (39%), Gaps = 12/108 (11%)
Query: 450 PSLSLHGIEGAFSGPGGKT-VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNS 508
P+L++ I+G G + V+P + + IR+VP + P VL L L
Sbjct: 221 PTLNVGVIKG-----GEQVNVVPDEATLELDIRLVPGEDPD----EVLAELEALLAQVPP 271
Query: 509 PNKFKAYLLDSGKSWRTNPEHPNYV-AAARATKYVYNVEPDLTREGGS 555
P + L T+P+ P V A A A V P + G+
Sbjct: 272 PADVEVDLSVPPPPVVTDPDSP-LVQALAAAIADVTGRPPKVRGVPGA 318
>gnl|CDD|225523 COG2976, COG2976, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 207
Score = 30.5 bits (69), Expect = 3.0
Identities = 26/103 (25%), Positives = 40/103 (38%), Gaps = 6/103 (5%)
Query: 10 GLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLL---LSSIHFQCRKLDKSAHFST 66
LELA E +A + + AE Q Q + L L+ + Q +K D + T
Sbjct: 91 AALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAA--LKT 148
Query: 67 LAIKQNPLLAEAYSNL-GNVYKERGQLQEALENYRHAVRLKPD 108
L + A + L G++ +G QEA Y A+
Sbjct: 149 LDTIKEESWAAIVAELRGDILLAKGDKQEARAAYEKALESDAS 191
>gnl|CDD|222667 pfam14308, DnaJ-X, X-domain of DnaJ-containing. IN certain plant
and yeast proteins, the DnaJ-1 proteins have a
three-domain structure. The x-domain lies between the
N-terminal DnaJ and the C-terminal Z domains. The exact
function is not known.
Length = 204
Score = 30.6 bits (70), Expect = 3.0
Identities = 15/34 (44%), Positives = 20/34 (58%)
Query: 869 GSDATKAQKEEQKKIDESEPLTEEELAEKEELLT 902
DA +E +K ++ LTEEELAE EE +T
Sbjct: 98 ALDAQSTMEEMKKLEEKGGELTEEELAEMEEKVT 131
>gnl|CDD|226414 COG3898, COG3898, Uncharacterized membrane-bound protein [Function
unknown].
Length = 531
Score = 31.0 bits (70), Expect = 3.1
Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 2/80 (2%)
Query: 95 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA--LQYNPDLYCVRSDLGN 152
A ++ A +L PD + + A AL G++ + + TA + +PD+ + +
Sbjct: 248 ARDDALEANKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEPHPDIALLYVRARS 307
Query: 153 LLKALGRLDEAKDLYCVRSD 172
AL RL AK L ++ +
Sbjct: 308 GDTALDRLKRAKKLESLKPN 327
>gnl|CDD|215341 PLN02635, PLN02635, disproportionating enzyme.
Length = 538
Score = 30.9 bits (70), Expect = 3.2
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 9/71 (12%)
Query: 213 KPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYI 272
+P + FS +A ++PL+A+A L G+L+E LE++R P+ I ++
Sbjct: 117 EPVPVGKVDFSAVAELKDPLIAKAAERL---LLSDGELKEELEDFRK----DPE-ISSWL 168
Query: 273 NLAAALVAAGD 283
AAL AA D
Sbjct: 169 E-DAALFAAID 178
>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 262
Score = 30.5 bits (69), Expect = 3.2
Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 3/62 (4%)
Query: 71 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD---FIDGYINLAAALVAAGDME 127
+ AY LG +G ++A + V+ P D + L +L G+ +
Sbjct: 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTD 232
Query: 128 QA 129
+A
Sbjct: 233 EA 234
Score = 30.5 bits (69), Expect = 3.2
Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 3/62 (4%)
Query: 229 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD---FIDGYINLAAALVAAGDME 285
+ AY LG +G ++A + V+ P D + L +L G+ +
Sbjct: 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTD 232
Query: 286 QA 287
+A
Sbjct: 233 EA 234
Score = 30.5 bits (69), Expect = 3.2
Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 3/62 (4%)
Query: 580 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD---FIDGYINLAAALVAAGDME 636
+ AY LG +G ++A + V+ P D + L +L G+ +
Sbjct: 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTD 232
Query: 637 QA 638
+A
Sbjct: 233 EA 234
>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
Length = 296
Score = 30.6 bits (70), Expect = 3.5
Identities = 19/66 (28%), Positives = 29/66 (43%)
Query: 66 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 125
+ A+ P +A+AY+ LG + G A E + + L P + Y+N AL G
Sbjct: 88 SQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGR 147
Query: 126 MEQAVQ 131
E A
Sbjct: 148 YELAQD 153
Score = 30.6 bits (70), Expect = 3.5
Identities = 19/66 (28%), Positives = 29/66 (43%)
Query: 224 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 283
+ A+ P +A+AY+ LG + G A E + + L P + Y+N AL G
Sbjct: 88 SQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGR 147
Query: 284 MEQAVQ 289
E A
Sbjct: 148 YELAQD 153
Score = 30.6 bits (70), Expect = 3.5
Identities = 19/66 (28%), Positives = 29/66 (43%)
Query: 575 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 634
+ A+ P +A+AY+ LG + G A E + + L P + Y+N AL G
Sbjct: 88 SQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGR 147
Query: 635 MEQAVQ 640
E A
Sbjct: 148 YELAQD 153
>gnl|CDD|233632 TIGR01910, DapE-ArgE, acetylornithine deacetylase or
succinyl-diaminopimelate desuccinylase. This group of
sequences contains annotations for both acetylornithine
deacetylase and succinyl-diaminopimelate desuccinylase,
but does not contain any members with experimental
characterization. Bacillus, Staphylococcus and
Sulfolobus species contain multiple hits to this
subfamily and each may have a separate activity.
Determining which is which must await further laboratory
research [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 375
Score = 30.8 bits (70), Expect = 3.7
Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 1/79 (1%)
Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPE 528
+P +RI+P + V++ + D + L K+ + + + SG + P+
Sbjct: 248 SVPDYCEFSIDVRIIPEENLDEVKQIIEDVVKALSKSDGWLYENEPVVKWSGPNETP-PD 306
Query: 529 HPNYVAAARATKYVYNVEP 547
A K V +EP
Sbjct: 307 SRLVKALEAIIKKVRGIEP 325
>gnl|CDD|204480 pfam10414, CysG_dimeriser, Sirohaem synthase dimerisation region.
Bacterial sulfur metabolism depends on the
iron-containing porphinoid sirohaem. CysG,
S-adenosyl-L-methionine (SAM)-dependent
bis-methyltransferase, dehydrogenase and ferrochelatase,
synthesises sirohaem from uroporphyrinogen III via
reactions which encompass two branchpoint intermediates
in tetrapyrrole biosynthesis, diverting flux first from
protoporphyrin IX biosynthesis and then from cobalamin
(vitamin B12) biosynthesis. CysG is a dimer of two
structurally similar protomers held together
asymmetrically through a number of salt-bridges across
complementary residues in the CysG_dimeriser region to
produce a series of active sites, accounting for CysG's
multifunctionality, catalyzing four diverse reactions:
two SAM-dependent methylations, NAD+-dependent
tetrapyrrole dehydrogenation and metal chelation. The
CysG_dimeriser region holding the two protomers together
is of 74 residues.
Length = 60
Score = 27.5 bits (62), Expect = 4.0
Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 14/60 (23%)
Query: 90 GQLQEALENYRHAV--RLKP---------DFIDGYINLAAALVAAGDMEQAVQAYVTALQ 138
G+L +R V RL F DG A LV AGD E+A A AL+
Sbjct: 4 GRLAALAGRFRDRVKQRLPDVAERRRFWERFFDGPF---AELVLAGDEEEAEAALEQALE 60
Score = 27.5 bits (62), Expect = 4.0
Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 14/60 (23%)
Query: 248 GQLQEALENYRHAV--RLKP---------DFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
G+L +R V RL F DG A LV AGD E+A A AL+
Sbjct: 4 GRLAALAGRFRDRVKQRLPDVAERRRFWERFFDGPF---AELVLAGDEEEAEAALEQALE 60
Score = 27.5 bits (62), Expect = 4.0
Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 14/60 (23%)
Query: 599 GQLQEALENYRHAV--RLKP---------DFIDGYINLAAALVAAGDMEQAVQAYVTALQ 647
G+L +R V RL F DG A LV AGD E+A A AL+
Sbjct: 4 GRLAALAGRFRDRVKQRLPDVAERRRFWERFFDGPF---AELVLAGDEEEAEAALEQALE 60
>gnl|CDD|237265 PRK13009, PRK13009, succinyl-diaminopimelate desuccinylase;
Reviewed.
Length = 375
Score = 30.4 bits (70), Expect = 4.5
Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 15/90 (16%)
Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLD---SGKSWRT 525
VIPG++ +F+ R T + ++ V L ++ + Y L+ SG+ + T
Sbjct: 245 VIPGELEAQFNFRFSTEHTAESLKARVEAIL-----DKHGLD----YTLEWTLSGEPFLT 295
Query: 526 NPEHPNYVAA-ARATKYVYNVEPDLTREGG 554
P V A A + V + P+L+ GG
Sbjct: 296 PPG--KLVDAVVAAIEAVTGITPELSTSGG 323
>gnl|CDD|237851 PRK14894, PRK14894, glycyl-tRNA synthetase; Provisional.
Length = 539
Score = 30.4 bits (68), Expect = 4.6
Identities = 28/98 (28%), Positives = 40/98 (40%), Gaps = 15/98 (15%)
Query: 87 KERGQLQEAL-ENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT-------ALQ 138
K RGQ Q + E+YR +RLKP LA VA +++ + V LQ
Sbjct: 409 KLRGQAQPLIDEHYRTVLRLKP-------RLAPIKVAVFPLKRNHEGLVATAKAVRRQLQ 461
Query: 139 YNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNL 176
+ V D G + K R DE +C+ D +
Sbjct: 462 VGGRMRTVYDDTGAIGKLYRRQDEIGTPFCITVDFDTI 499
>gnl|CDD|226036 COG3505, VirD4, Type IV secretory pathway, VirD4 components
[Intracellular trafficking and secretion].
Length = 596
Score = 30.4 bits (69), Expect = 4.9
Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 795 PPKRERKANYAVDAYFKEALRTSEPKAPKAPRPPKQPIVQDFQFFPP 841
PP + K Y D FK+ L+ EP+ P P P
Sbjct: 540 PPIKCGKLRYFKDPKFKDRLKPVEPELEAGP-KLGAPAKASTDAAPS 585
>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
chaperone LcrH/SycD. Genes in this family are found in
type III secretion operons. LcrH, from Yersinia is
believed to have a regulatory function in the
low-calcium response of the secretion system. The same
protein is also known as SycD (SYC = Specific Yop
Chaperone) for its chaperone role. In Pseudomonas, where
the homolog is known as PcrH, the chaperone role has
been demonstrated and the regulatory role appears to be
absent. ScyD/LcrH contains three central
tetratricopeptide-like repeats that are predicted to
fold into an all-alpha-helical array.
Length = 135
Score = 28.8 bits (65), Expect = 6.3
Identities = 21/87 (24%), Positives = 40/87 (45%)
Query: 52 HFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID 111
+Q + D++ L +P + + L + + +EA++ Y A L PD
Sbjct: 27 LYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPR 86
Query: 112 GYINLAAALVAAGDMEQAVQAYVTALQ 138
Y + A L+A G+ E A++A A++
Sbjct: 87 PYFHAAECLLALGEPESALKALDLAIE 113
Score = 28.8 bits (65), Expect = 7.0
Identities = 20/96 (20%), Positives = 39/96 (40%), Gaps = 2/96 (2%)
Query: 14 LAHREYQAGDYESAERHCMQLWRQ-ETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQN 72
LA+ YQ G Y+ A QL + N+ L L++ ++ +++ LA +
Sbjct: 23 LAYNLYQQGRYDEA-LKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD 81
Query: 73 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 108
P Y + G+ + AL+ A+ + +
Sbjct: 82 PDDPRPYFHAAECLLALGEPESALKALDLAIEICGE 117
>gnl|CDD|233451 TIGR01531, glyc_debranch, glycogen debranching enzymye. glycogen
debranching enzyme possesses two different catalytic
activities; oligo-1,4-->1,4-glucantransferase (EC
2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site
directed mutagenesis studies in S. cerevisiae indicate
that the transferase and glucosidase activities are
independent and located in different regions of the
polypeptide chain. Proteins in this model belong to the
larger alpha-amylase family. The model covers eukaryotic
proteins with a seed composed of human, nematode and
yeast sequences. Yeast seed sequence is well
characterized. The model is quite rigorous; either query
sequence yields large bit score or it fails to hit the
model altogether. There doesn't appear to be any middle
ground [Energy metabolism, Biosynthesis and degradation
of polysaccharides].
Length = 1464
Score = 30.2 bits (68), Expect = 6.3
Identities = 20/111 (18%), Positives = 34/111 (30%), Gaps = 8/111 (7%)
Query: 18 EYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHF----QCRKLDKSAHFSTLAIKQNP 73
E+ D + A W QE++ + S + R+ + +F T N
Sbjct: 286 EFYQVDVQKAVNDFKAHWTQESSYVTNNIKDQSSDIIQDPEYRRFGVTVNFETALRIFNR 345
Query: 74 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAG 124
+ K L AL +RL I++A + G
Sbjct: 346 HNGDLKLEEDRGEKCSSSLATALNILNENLRL----YRYDIDVALEQLLGG 392
>gnl|CDD|236148 PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA
polymerase III subunit epsilon; Validated.
Length = 928
Score = 30.3 bits (69), Expect = 6.7
Identities = 24/107 (22%), Positives = 39/107 (36%), Gaps = 20/107 (18%)
Query: 685 HHFEKAVS------LDPNFLDAYINLGNVLKEARIFDRANTLFYVSSYFM-ITDEDIDVI 737
HHFE+A S ++ L+E + + LF S + D+D
Sbjct: 465 HHFEEAASRHLGEQFSYMSFQLLLSRLGTLEEDGLLSKLAKLFKKSDQASRSSFRDLDES 524
Query: 738 LERCEAKTEELNKKFEQLGESSLRDFTLDAPTQS-----VYKFEGED 779
L+ + + +EL F LR F L Q +Y++ E
Sbjct: 525 LKELKFEADEL---F-----QMLRSFVLKRKKQEQNGRLIYRYNTES 563
>gnl|CDD|226862 COG4455, ImpE, Protein of avirulence locus involved in
temperature-dependent protein secretion [General
function prediction only].
Length = 273
Score = 29.4 bits (66), Expect = 7.7
Identities = 17/61 (27%), Positives = 23/61 (37%), Gaps = 1/61 (1%)
Query: 599 GQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIE 658
LQ+A+ R V+ KP G L L AGD E+A+ A +P
Sbjct: 15 NSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQ-DTVGAS 73
Query: 659 T 659
Sbjct: 74 L 74
>gnl|CDD|215883 pfam00374, NiFeSe_Hases, Nickel-dependent hydrogenase.
Length = 502
Score = 29.8 bits (67), Expect = 8.0
Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 5/63 (7%)
Query: 616 PDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFN 675
D++D L A A ++ Q+V + + Y KA++ R V LG N
Sbjct: 76 LDWVDVTSALKADPKKASELAQSVSPWPKS-----PGYFKAVQNRLKKFVESGQLGIFTN 130
Query: 676 AQG 678
A
Sbjct: 131 AYW 133
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.135 0.395
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,250,961
Number of extensions: 4783156
Number of successful extensions: 6385
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6130
Number of HSP's successfully gapped: 329
Length of query: 904
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 798
Effective length of database: 6,236,078
Effective search space: 4976390244
Effective search space used: 4976390244
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)