RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4538
(343 letters)
>gnl|CDD|223540 COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational
modification, protein turnover, chaperones].
Length = 494
Score = 205 bits (523), Expect = 5e-62
Identities = 100/219 (45%), Positives = 132/219 (60%), Gaps = 13/219 (5%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKTL+ K VA + ++ F + S L SKW GE EK +R LF A PSI
Sbjct: 279 VLLYGPPGTGKTLLAKAVALESRSRFISVKGSELLSKWVGESEKNIRELFEKARKLAPSI 338
Query: 61 IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
IFIDEIDSL R E+ + RR+ + L LDG + VLVI ATNRP +LD A
Sbjct: 339 IFIDEIDSLASGRGPSEDGSGRRVVGQLLTELDGIEKAEG--VLVIAATNRPDDLDPALL 396
Query: 121 R--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVD-DIEEVGRMTTDFSGADMASLCR 177
R R + +Y+PLPD + R EI LR+ L D D+EE+ +T +SGAD+A+L R
Sbjct: 397 RPGRFDRLIYVPLPDLEERLEIFKIHLRDKKPPLAEDVDLEELAEITEGYSGADIAALVR 456
Query: 178 EASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSV 216
EA+L +R R +++DDF DALK ++PSV
Sbjct: 457 EAALEALRE--------ARRREVTLDDFLDALKKIKPSV 487
Score = 118 bits (296), Expect = 1e-29
Identities = 91/238 (38%), Positives = 125/238 (52%), Gaps = 19/238 (7%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL GPPGTGKTL+ + +A A F I+ + SK+ GE E +R LF A PSI
Sbjct: 21 VLLHGPPGTGKTLLARALA-NEGAEFLSINGPEILSKYVGESELRLRELFEEAEKLAPSI 79
Query: 61 IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
IFIDEID+L +RS + E RR+ + L +DG L V+VIGATNRP LD A R
Sbjct: 80 IFIDEIDALAPKRSSDQGEVERRVVAQLLALMDG---LKRGQVIVIGATNRPDGLDPAKR 136
Query: 121 R--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCRE 178
R R + + + LPDE R EI+ R + + + T SGAD+ +L +E
Sbjct: 137 RPGRFDREIEVNLPDEAGRLEILQIHTRLMFLGPPG-TGKTLAARTVGKSGADLGALAKE 195
Query: 179 ASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFVHQPSIIFIDEIDSL 236
A+L +R + + V + DDF +ALK V PS ++F DE +L
Sbjct: 196 AALRELRRAIDLVGEYIGV---TEDDFEEALKKVLPS---------RGVLFEDEDVTL 241
Score = 88.7 bits (220), Expect = 2e-19
Identities = 39/81 (48%), Positives = 48/81 (59%), Gaps = 4/81 (4%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 282
PSIIFIDEIDSL R E+ + RR+ + L LDG + VLVI ATNRP +L
Sbjct: 334 LAPSIIFIDEIDSLASGRGPSEDGSGRRVVGQLLTELDGIEKAEG--VLVIAATNRPDDL 391
Query: 283 DEAARR--RLVKRLYIPLPDE 301
D A R R + +Y+PLPD
Sbjct: 392 DPALLRPGRFDRLIYVPLPDL 412
Score = 53.3 bits (128), Expect = 7e-08
Identities = 44/120 (36%), Positives = 59/120 (49%), Gaps = 11/120 (9%)
Query: 192 IDALDVRPISIDDFRDALKTVRPS------VCQADFVHQPSIIFIDEIDSLLCQRSDQEN 245
+ +SI+ K V S + + PSIIFIDEID+L +RS +
Sbjct: 38 LANEGAEFLSINGPEILSKYVGESELRLRELFEEAEKLAPSIIFIDEIDALAPKRSSDQG 97
Query: 246 ETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAARR--RLVKRLYIPLPDEQK 303
E RR+ + L +DG L V+VIGATNRP LD A RR R + + + LPDE
Sbjct: 98 EVERRVVAQLLALMDG---LKRGQVIVIGATNRPDGLDPAKRRPGRFDREIEVNLPDEAG 154
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 165 bits (421), Expect = 7e-48
Identities = 88/221 (39%), Positives = 123/221 (55%), Gaps = 25/221 (11%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKTL+ K VA Q ATF + S L K+ GEG ++VR LF +A PSI
Sbjct: 188 VLLYGPPGTGKTLLAKAVANQTDATFIRVVGSELVQKYIGEGARLVRELFELAREKAPSI 247
Query: 61 IFIDEIDSLLCQRSDQ----ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD 116
IFIDEID++ +R D + E R + E L LDG + V VI ATNRP LD
Sbjct: 248 IFIDEIDAIGAKRFDSGTSGDREVQRTM-LELLNQLDGFDPRGN--VKVIMATNRPDILD 304
Query: 117 EAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVD---DIEEVGRMTTDFSGAD 171
A R R +++ PLPDE+ R EI L+ T + + D+E + R+T FSGAD
Sbjct: 305 PALLRPGRFDRKIEFPLPDEEGRAEI----LKIHTRKMNLADDVDLELLARLTEGFSGAD 360
Query: 172 MASLCREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTV 212
+ ++C EA + +R ++++DF A++ V
Sbjct: 361 LKAICTEAGMFAIRE---------RRDEVTMEDFLKAVEKV 392
Score = 65.7 bits (161), Expect = 6e-12
Identities = 37/86 (43%), Positives = 47/86 (54%), Gaps = 9/86 (10%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQ----ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNR 278
PSIIFIDEID++ +R D + E R + E L LDG + V VI ATNR
Sbjct: 243 KAPSIIFIDEIDAIGAKRFDSGTSGDREVQRTM-LELLNQLDGFDPRGN--VKVIMATNR 299
Query: 279 PQELDEAARR--RLVKRLYIPLPDEQ 302
P LD A R R +++ PLPDE+
Sbjct: 300 PDILDPALLRPGRFDRKIEFPLPDEE 325
>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
activities (AAA). AAA family proteins often perform
chaperone-like functions that assist in the assembly,
operation, or disassembly of protein complexes.
Length = 131
Score = 152 bits (386), Expect = 6e-46
Identities = 62/132 (46%), Positives = 77/132 (58%), Gaps = 2/132 (1%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKT + K VA + A F IS S L SK+ GE EK +R LF A P +
Sbjct: 1 LLLYGPPGTGKTTLAKAVAKELGAPFIEISGSELVSKYVGESEKRLRELFEAAKKLAPCV 60
Query: 61 IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
IFIDEID+L R + SRR+ + L LDG T V+VI ATNRP +LD A
Sbjct: 61 IFIDEIDALAGSRGSGGDSESRRVVNQLLTELDGF-TSSLSKVIVIAATNRPDKLDPALL 119
Query: 121 R-RLVKRLYIPL 131
R R + + PL
Sbjct: 120 RGRFDRIIEFPL 131
Score = 85.7 bits (213), Expect = 1e-20
Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 12/126 (9%)
Query: 174 SLCREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFVHQPSIIFIDEI 233
++ +E P I S + + V S R+ + + P +IFIDEI
Sbjct: 17 AVAKELGA-PFIEISGSELVSKYVG-ESEKRLRELFEAAK--------KLAPCVIFIDEI 66
Query: 234 DSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAARR-RLVK 292
D+L R + SRR+ + L LDG T V+VI ATNRP +LD A R R +
Sbjct: 67 DALAGSRGSGGDSESRRVVNQLLTELDGF-TSSLSKVIVIAATNRPDKLDPALLRGRFDR 125
Query: 293 RLYIPL 298
+ PL
Sbjct: 126 IIEFPL 131
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 159 bits (405), Expect = 1e-45
Identities = 83/218 (38%), Positives = 121/218 (55%), Gaps = 17/218 (7%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKTL+ K VA + ATF + S L K+ GEG ++VR LF +A PSI
Sbjct: 168 VLLYGPPGTGKTLLAKAVAHETNATFIRVVGSELVQKFIGEGARLVRELFELAREKAPSI 227
Query: 61 IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
IFIDEID++ +R+D R R + L +DG + V +I ATNR LD
Sbjct: 228 IFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAEMDGFDPRGN--VKIIAATNRIDILDP 285
Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
A R R + + +PLPDE+ R EI+ R + + V D+EE+ +T SGAD+ ++
Sbjct: 286 AILRPGRFDRIIEVPLPDEEGRLEILKIHTRKMNLADDV-DLEELAELTEGASGADLKAI 344
Query: 176 CREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVR 213
C EA + +R D ++++DF A++ V
Sbjct: 345 CTEAGMFAIRD---------DRTEVTMEDFLKAIEKVM 373
Score = 57.9 bits (141), Expect = 2e-09
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 7/85 (8%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRP 279
PSIIFIDEID++ +R+D R R + L +DG + V +I ATNR
Sbjct: 223 KAPSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAEMDGFDPRGN--VKIIAATNRI 280
Query: 280 QELDEAARR--RLVKRLYIPLPDEQ 302
LD A R R + + +PLPDE+
Sbjct: 281 DILDPAILRPGRFDRIIEVPLPDEE 305
>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily. This
subfamily of the AAA family ATPases includes two members
each from three archaeal species. It also includes yeast
CDC48 (cell division control protein 48) and the human
ortholog, transitional endoplasmic reticulum ATPase
(valosin-containing protein). These proteins in
eukaryotes are involved in the budding and transfer of
membrane from the transitional endoplasmic reticulum to
the Golgi apparatus.
Length = 733
Score = 163 bits (413), Expect = 7e-45
Identities = 92/243 (37%), Positives = 141/243 (58%), Gaps = 15/243 (6%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKTL+ K VA + A F I+ + SK+YGE E+ +R +F A + PSI
Sbjct: 215 VLLYGPPGTGKTLLAKAVANEAGAYFISINGPEIMSKYYGESEERLREIFKEAEENAPSI 274
Query: 61 IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
IFIDEID++ +R + E +R+ + L +DG V+VIGATNRP LD A R
Sbjct: 275 IFIDEIDAIAPKREEVTGEVEKRVVAQLLTLMDGLK--GRGRVIVIGATNRPDALDPALR 332
Query: 121 R--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCRE 178
R R + + I +PD++AR EI+ RN+ + V D++++ +T F GAD+A+L +E
Sbjct: 333 RPGRFDREIVIRVPDKRARKEILKVHTRNMPLAEDV-DLDKLAEVTHGFVGADLAALAKE 391
Query: 179 ASLGPVRS-IDLSRID---------ALDVRPISIDDFRDALKTVRPSVCQADFVHQPSII 228
A++ +R I +I+ L +++ DF +ALK V PS + V P++
Sbjct: 392 AAMAALRRFIREGKINFEAEEIPAEVLKELKVTMKDFMEALKMVEPSAIREVLVEVPNVR 451
Query: 229 FID 231
+ D
Sbjct: 452 WSD 454
Score = 156 bits (396), Expect = 1e-42
Identities = 87/232 (37%), Positives = 132/232 (56%), Gaps = 15/232 (6%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LLFGPPGTGKTL+ K VA + A F + + SKW GE EK +R +F A P+I
Sbjct: 490 VLLFGPPGTGKTLLAKAVATESGANFIAVRGPEILSKWVGESEKAIREIFRKARQAAPAI 549
Query: 61 IFIDEIDSLLCQRSDQENET-SRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAA 119
IF DEID++ R + + + + R+ + L +DG L + V+VI ATNRP LD A
Sbjct: 550 IFFDEIDAIAPARGARFDTSVTDRIVNQLLTEMDGIQELSN--VVVIAATNRPDILDPAL 607
Query: 120 RR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCR 177
R R + + +P PDE+AR EI R++ + V D+EE+ MT ++GAD+ ++CR
Sbjct: 608 LRPGRFDRLILVPPPDEEARKEIFKIHTRSMPLAEDV-DLEELAEMTEGYTGADIEAVCR 666
Query: 178 EASLGPVRS---------IDLSRIDALDVRPISIDDFRDALKTVRPSVCQAD 220
EA++ +R +++ + L + + F +ALK V+PSV + D
Sbjct: 667 EAAMAALRESIGSPAKEKLEVGEEEFLKDLKVEMRHFLEALKKVKPSVSKED 718
Score = 61.8 bits (150), Expect = 2e-10
Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 4/80 (5%)
Query: 225 PSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 284
PSIIFIDEID++ +R + E +R+ + L +DG V+VIGATNRP LD
Sbjct: 272 PSIIFIDEIDAIAPKREEVTGEVEKRVVAQLLTLMDGLK--GRGRVIVIGATNRPDALDP 329
Query: 285 AARR--RLVKRLYIPLPDEQ 302
A RR R + + I +PD++
Sbjct: 330 ALRRPGRFDREIVIRVPDKR 349
Score = 58.4 bits (141), Expect = 2e-09
Identities = 31/81 (38%), Positives = 47/81 (58%), Gaps = 5/81 (6%)
Query: 225 PSIIFIDEIDSLLCQRSDQENET-SRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD 283
P+IIF DEID++ R + + + + R+ + L +DG L + V+VI ATNRP LD
Sbjct: 547 PAIIFFDEIDAIAPARGARFDTSVTDRIVNQLLTEMDGIQELSN--VVVIAATNRPDILD 604
Query: 284 EAARR--RLVKRLYIPLPDEQ 302
A R R + + +P PDE+
Sbjct: 605 PALLRPGRFDRLILVPPPDEE 625
>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family. Many
proteins may score above the trusted cutoff because an
internal.
Length = 364
Score = 145 bits (367), Expect = 2e-40
Identities = 84/217 (38%), Positives = 121/217 (55%), Gaps = 17/217 (7%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKTL+ K VA + ATF + S L K+ GEG ++VR +F +A PSI
Sbjct: 159 VLLYGPPGTGKTLLAKAVAHETNATFIRVVGSELVRKYIGEGARLVREIFELAKEKAPSI 218
Query: 61 IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
IFIDEID++ +R+D R R + L LDG + V VI ATNRP LD
Sbjct: 219 IFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAELDGFDPRGN--VKVIAATNRPDILDP 276
Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
A R R + + +PLPD + R EI+ R + + V D+E + +MT SGAD+ ++
Sbjct: 277 ALLRPGRFDRIIEVPLPDFEGRLEILKIHTRKMKLAEDV-DLEAIAKMTEGASGADLKAI 335
Query: 176 CREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTV 212
C EA + +R + +++DDF A++ V
Sbjct: 336 CTEAGMFAIR--EERDY-------VTMDDFIKAVEKV 363
Score = 57.5 bits (139), Expect = 3e-09
Identities = 35/83 (42%), Positives = 45/83 (54%), Gaps = 7/83 (8%)
Query: 225 PSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQE 281
PSIIFIDEID++ +R+D R R + L LDG + V VI ATNRP
Sbjct: 216 PSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAELDGFDPRGN--VKVIAATNRPDI 273
Query: 282 LDEAARR--RLVKRLYIPLPDEQ 302
LD A R R + + +PLPD +
Sbjct: 274 LDPALLRPGRFDRIIEVPLPDFE 296
>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
HflB(FtsH) is a pleiotropic protein required for correct
cell division in bacteria. It has ATP-dependent zinc
metalloprotease activity. It was formerly designated
cell division protein FtsH [Cellular processes, Cell
division, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 495
Score = 132 bits (335), Expect = 7e-35
Identities = 82/206 (39%), Positives = 113/206 (54%), Gaps = 17/206 (8%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL GPPGTGKTL+ K VA + FF IS S + G G VR LF A + P I
Sbjct: 91 VLLVGPPGTGKTLLAKAVAGEAGVPFFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCI 150
Query: 61 IFIDEIDSLLCQR-------SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 113
IFIDEID++ QR +D+ +T +L L+ +DG T + V+VI ATNRP
Sbjct: 151 IFIDEIDAVGRQRGAGLGGGNDEREQTLNQL----LVEMDGFGT--NTGVIVIAATNRPD 204
Query: 114 ELDEAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGAD 171
LD A R R +++ + LPD + R EI+ +N + V D++ V R T FSGAD
Sbjct: 205 VLDPALLRPGRFDRQVVVDLPDIKGREEILKVHAKNKKLAPDV-DLKAVARRTPGFSGAD 263
Query: 172 MASLCREASLGPVRSIDLSRIDALDV 197
+A+L EA+L R + + I D+
Sbjct: 264 LANLLNEAALLAARK-NKTEITMNDI 288
Score = 52.3 bits (126), Expect = 2e-07
Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 15/87 (17%)
Query: 223 HQPSIIFIDEIDSLLCQR-------SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGA 275
+ P IIFIDEID++ QR +D+ +T +L L+ +DG T + V+VI A
Sbjct: 146 NAPCIIFIDEIDAVGRQRGAGLGGGNDEREQTLNQL----LVEMDGFGT--NTGVIVIAA 199
Query: 276 TNRPQELDEAARR--RLVKRLYIPLPD 300
TNRP LD A R R +++ + LPD
Sbjct: 200 TNRPDVLDPALLRPGRFDRQVVVDLPD 226
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 130 bits (328), Expect = 2e-34
Identities = 84/227 (37%), Positives = 116/227 (51%), Gaps = 33/227 (14%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
++L+GPPGTGKTL+ K VA + ATF + S L K+ G+G K+VR LF VA + PSI
Sbjct: 220 VILYGPPGTGKTLLAKAVANETSATFLRVVGSELIQKYLGDGPKLVRELFRVAEENAPSI 279
Query: 61 IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
+FIDEID++ +R D + + R E L LDG + D V VI ATNR + LD
Sbjct: 280 VFIDEIDAIGTKRYDATSGGEKEIQRTMLELLNQLDGFDSRGD--VKVIMATNRIESLDP 337
Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
A R R+ +++ P PDE+ + I +T + V D+EE + SGAD+ ++
Sbjct: 338 ALIRPGRIDRKIEFPNPDEKTKRRIFEIHTSKMTLAEDV-DLEEFIMAKDELSGADIKAI 396
Query: 176 CREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFV 222
C EA L AL+ R V QADF
Sbjct: 397 CTEAGL-------------------------LALRERRMKVTQADFR 418
Score = 55.5 bits (134), Expect = 1e-08
Identities = 34/85 (40%), Positives = 48/85 (56%), Gaps = 7/85 (8%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRP 279
+ PSI+FIDEID++ +R D + + R E L LDG + D V VI ATNR
Sbjct: 275 NAPSIVFIDEIDAIGTKRYDATSGGEKEIQRTMLELLNQLDGFDSRGD--VKVIMATNRI 332
Query: 280 QELDEAARR--RLVKRLYIPLPDEQ 302
+ LD A R R+ +++ P PDE+
Sbjct: 333 ESLDPALIRPGRIDRKIEFPNPDEK 357
>gnl|CDD|240423 PTZ00454, PTZ00454, 26S protease regulatory subunit 6B-like
protein; Provisional.
Length = 398
Score = 124 bits (314), Expect = 1e-32
Identities = 76/191 (39%), Positives = 106/191 (55%), Gaps = 10/191 (5%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPGTGKT++ K VA ATF + S K+ GEG +MVR +F +A + PSI
Sbjct: 182 VLLYGPPGTGKTMLAKAVAHHTTATFIRVVGSEFVQKYLGEGPRMVRDVFRLARENAPSI 241
Query: 61 IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDL-VLVIGATNRPQELD 116
IFIDE+DS+ +R D + R R+ E L +DG D V VI ATNR LD
Sbjct: 242 IFIDEVDSIATKRFDAQTGADREVQRILLELLNQMDG---FDQTTNVKVIMATNRADTLD 298
Query: 117 EAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMAS 174
A R RL +++ PLPD + + I + + S V D+E+ S AD+A+
Sbjct: 299 PALLRPGRLDRKIEFPLPDRRQKRLIFQTITSKMNLSEEV-DLEDFVSRPEKISAADIAA 357
Query: 175 LCREASLGPVR 185
+C+EA + VR
Sbjct: 358 ICQEAGMQAVR 368
Score = 55.5 bits (134), Expect = 1e-08
Identities = 36/85 (42%), Positives = 47/85 (55%), Gaps = 9/85 (10%)
Query: 225 PSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDL-VLVIGATNRPQ 280
PSIIFIDE+DS+ +R D + R R+ E L +DG D V VI ATNR
Sbjct: 239 PSIIFIDEVDSIATKRFDAQTGADREVQRILLELLNQMDG---FDQTTNVKVIMATNRAD 295
Query: 281 ELDEAARR--RLVKRLYIPLPDEQK 303
LD A R RL +++ PLPD ++
Sbjct: 296 TLDPALLRPGRLDRKIEFPLPDRRQ 320
>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
modification, protein turnover, chaperones].
Length = 596
Score = 114 bits (288), Expect = 3e-28
Identities = 78/214 (36%), Positives = 115/214 (53%), Gaps = 19/214 (8%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY-GEGEKMVRALFAVASVHQPS 59
+LL GPPGTGKTL+ K VA + FF IS S + + G G VR LF A + P
Sbjct: 186 VLLVGPPGTGKTLLAKAVAGEAGVPFFSISGSD-FVEMFVGVGASRVRDLFEQAKKNAPC 244
Query: 60 IIFIDEIDSLLCQRSDQE---NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD 116
IIFIDEID++ QR N+ + + L+ +DG ++ V+VI ATNRP LD
Sbjct: 245 IIFIDEIDAVGRQRGAGLGGGNDEREQTLNQLLVEMDGFGG--NEGVIVIAATNRPDVLD 302
Query: 117 EAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMAS 174
A R R +++ + LPD + R +I+ +N + V D++++ R T FSGAD+A+
Sbjct: 303 PALLRPGRFDRQILVELPDIKGREQILKVHAKNKPLAEDV-DLKKIARGTPGFSGADLAN 361
Query: 175 LCREASLGPVRSIDLSRIDALDVRPISIDDFRDA 208
L EA+L R + + I++ D +A
Sbjct: 362 LLNEAALLAARR---------NKKEITMRDIEEA 386
Score = 52.3 bits (126), Expect = 2e-07
Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQE---NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP 279
+ P IIFIDEID++ QR N+ + + L+ +DG ++ V+VI ATNRP
Sbjct: 241 NAPCIIFIDEIDAVGRQRGAGLGGGNDEREQTLNQLLVEMDGFGG--NEGVIVIAATNRP 298
Query: 280 QELDEAARR--RLVKRLYIPLPD 300
LD A R R +++ + LPD
Sbjct: 299 DVLDPALLRPGRFDRQILVELPD 321
>gnl|CDD|182683 PRK10733, hflB, ATP-dependent metalloprotease; Reviewed.
Length = 644
Score = 112 bits (282), Expect = 2e-27
Identities = 72/194 (37%), Positives = 106/194 (54%), Gaps = 16/194 (8%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+L+ GPPGTGKTL+ K +A + K FF IS S + G G VR +F A P I
Sbjct: 188 VLMVGPPGTGKTLLAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCI 247
Query: 61 IFIDEIDSLLCQRS-------DQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 113
IFIDEID++ QR D+ +T ++ L+ +DG ++ ++VI ATNRP
Sbjct: 248 IFIDEIDAVGRQRGAGLGGGHDEREQTLNQM----LVEMDGFE--GNEGIIVIAATNRPD 301
Query: 114 ELDEAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGAD 171
LD A R R +++ + LPD + R +I+ +R + + + D + R T FSGAD
Sbjct: 302 VLDPALLRPGRFDRQVVVGLPDVRGREQILKVHMRRVPLAPDI-DAAIIARGTPGFSGAD 360
Query: 172 MASLCREASLGPVR 185
+A+L EA+L R
Sbjct: 361 LANLVNEAALFAAR 374
Score = 44.3 bits (104), Expect = 6e-05
Identities = 31/85 (36%), Positives = 47/85 (55%), Gaps = 15/85 (17%)
Query: 225 PSIIFIDEIDSLLCQRS-------DQENETSRRLKTEFLISLDGASTLDDDLVLVIGATN 277
P IIFIDEID++ QR D+ +T ++ L+ +DG ++ ++VI ATN
Sbjct: 245 PCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQM----LVEMDGFE--GNEGIIVIAATN 298
Query: 278 RPQELDEAARR--RLVKRLYIPLPD 300
RP LD A R R +++ + LPD
Sbjct: 299 RPDVLDPALLRPGRFDRQVVVGLPD 323
>gnl|CDD|224144 COG1223, COG1223, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 368
Score = 109 bits (275), Expect = 2e-27
Identities = 71/221 (32%), Positives = 107/221 (48%), Gaps = 22/221 (9%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+L +GPPGTGKT++ K +A + K + A+ L + G+G + + L+ A P I
Sbjct: 154 VLFYGPPGTGKTMMAKALANEAKVPLLLVKATELIGEHVGDGARRIHELYERARKAAPCI 213
Query: 61 IFIDEIDSLLCQRSDQE-----NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 115
+FIDE+D++ R QE +E L TE LDG +++ V+ I ATNRP+ L
Sbjct: 214 VFIDELDAIALDRRYQELRGDVSEIVNALLTE----LDG--IKENEGVVTIAATNRPELL 267
Query: 116 DEAARRRLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVD-DIEEVGRMTTDFSGADMAS 174
D A R R + + LP+++ R EI+ + L VD D+ + T SG D+
Sbjct: 268 DPAIRSRFEEEIEFKLPNDEERLEILEYYAK--KFPLPVDADLRYLAAKTKGMSGRDI-- 323
Query: 175 LCREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPS 215
+E L L R A D + +D ALK R
Sbjct: 324 --KEKVLKTA----LHRAIAEDREKVEREDIEKALKKERKR 358
Score = 50.2 bits (120), Expect = 7e-07
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 11/84 (13%)
Query: 225 PSIIFIDEIDSLLCQRSDQE-----NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP 279
P I+FIDE+D++ R QE +E L TE LDG +++ V+ I ATNRP
Sbjct: 211 PCIVFIDELDAIALDRRYQELRGDVSEIVNALLTE----LDG--IKENEGVVTIAATNRP 264
Query: 280 QELDEAARRRLVKRLYIPLPDEQK 303
+ LD A R R + + LP++++
Sbjct: 265 ELLDPAIRSRFEEEIEFKLPNDEE 288
>gnl|CDD|214386 CHL00176, ftsH, cell division protein; Validated.
Length = 638
Score = 107 bits (270), Expect = 8e-26
Identities = 76/203 (37%), Positives = 106/203 (52%), Gaps = 12/203 (5%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL GPPGTGKTL+ K +A + + FF IS S + G G VR LF A + P I
Sbjct: 219 VLLVGPPGTGKTLLAKAIAGEAEVPFFSISGSEFVEMFVGVGAARVRDLFKKAKENSPCI 278
Query: 61 IFIDEIDSLLCQRSDQ---ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
+FIDEID++ QR N+ + + L +DG V+VI ATNR LD
Sbjct: 279 VFIDEIDAVGRQRGAGIGGGNDEREQTLNQLLTEMDGFKGNKG--VIVIAATNRVDILDA 336
Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
A R R +++ + LPD + R +I+ RN S V +E + R T FSGAD+A+L
Sbjct: 337 ALLRPGRFDRQITVSLPDREGRLDILKVHARNKKLSPDV-SLELIARRTPGFSGADLANL 395
Query: 176 CREASLGPVR----SIDLSRIDA 194
EA++ R +I + ID
Sbjct: 396 LNEAAILTARRKKATITMKEIDT 418
Score = 45.0 bits (107), Expect = 4e-05
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 7/81 (8%)
Query: 225 PSIIFIDEIDSLLCQRSDQ---ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQE 281
P I+FIDEID++ QR N+ + + L +DG V+VI ATNR
Sbjct: 276 PCIVFIDEIDAVGRQRGAGIGGGNDEREQTLNQLLTEMDGFKGNKG--VIVIAATNRVDI 333
Query: 282 LDEAARR--RLVKRLYIPLPD 300
LD A R R +++ + LPD
Sbjct: 334 LDAALLRPGRFDRQITVSLPD 354
>gnl|CDD|200312 TIGR03689, pup_AAA, proteasome ATPase. In the Actinobacteria, as
shown for Mycobacterium tuberculosis, some proteins are
modified by ligation between an epsilon-amino group of a
lysine side chain and the C-terminal carboxylate of the
ubiquitin-like protein Pup. This modification leads to
protein degradation by the archaeal-like proteasome
found in the Actinobacteria. Members of this protein
family belong to the AAA family of ATPases and tend to
be clustered with the genes for Pup, the Pup ligase
PafA, and structural components of the proteasome. This
protein forms hexameric rings with ATPase activity
[Protein fate, Degradation of proteins, peptides, and
glycopeptides].
Length = 512
Score = 92.1 bits (229), Expect = 1e-20
Identities = 63/166 (37%), Positives = 85/166 (51%), Gaps = 27/166 (16%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC----------KATFFCISASTLTSKWYGEGEKMVRALF 50
+LL+GPPG GKTLI K VA K+ F I L +K+ GE E+ +R +F
Sbjct: 219 VLLYGPPGCGKTLIAKAVANSLAARIGAEGGGKSYFLNIKGPELLNKYVGETERQIRLIF 278
Query: 51 A----VASVHQPSIIFIDEIDSLLCQR-----SDQENETSRRLKTEFLISLDGASTLDDD 101
AS +P I+F DE+DSL R SD E +L E +DG +LD+
Sbjct: 279 QRAREKASEGRPVIVFFDEMDSLFRTRGSGVSSDVETTVVPQLLAE----IDGVESLDN- 333
Query: 102 LVLVIGATNRPQELDEAARR--RLVKRLYIPLPDEQARCEIVTKLL 145
V+VIGA+NR +D A R RL ++ I PD +A +I K L
Sbjct: 334 -VIVIGASNREDMIDPAILRPGRLDVKIRIERPDAEAAADIFAKYL 378
Score = 46.2 bits (110), Expect = 2e-05
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 13/85 (15%)
Query: 225 PSIIFIDEIDSLLCQR-----SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP 279
P I+F DE+DSL R SD E +L E +DG +LD+ V+VIGA+NR
Sbjct: 290 PVIVFFDEMDSLFRTRGSGVSSDVETTVVPQLLAE----IDGVESLDN--VIVIGASNRE 343
Query: 280 QELDEAARR--RLVKRLYIPLPDEQ 302
+D A R RL ++ I PD +
Sbjct: 344 DMIDPAILRPGRLDVKIRIERPDAE 368
>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
cellular Activities) superfamily represents an ancient
group of ATPases belonging to the ASCE (for additional
strand, catalytic E) division of the P-loop NTPase fold.
The ASCE division also includes ABC, RecA-like,
VirD4-like, PilT-like, and SF1/2 helicases. Members of
the AAA+ ATPases function as molecular chaperons, ATPase
subunits of proteases, helicases, or nucleic-acid
stimulated ATPases. The AAA+ proteins contain several
distinct features in addition to the conserved
alpha-beta-alpha core domain structure and the Walker A
and B motifs of the P-loop NTPases.
Length = 151
Score = 86.4 bits (214), Expect = 1e-20
Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 17/139 (12%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC---KATFFCISASTLTSKWYGEGEK---MVRALFAVAS 54
+LL+GPPGTGKT + + +A + A F ++AS L +VR LF +A
Sbjct: 22 LLLYGPPGTGKTTLARAIANELFRPGAPFLYLNASDLLEGLVVAELFGHFLVRLLFELAE 81
Query: 55 VHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP-- 112
+P ++FIDEIDSL L D ++ V VIGATNRP
Sbjct: 82 KAKPGVLFIDEIDSL-------SRGAQNALLRVLETLNDLRIDREN--VRVIGATNRPLL 132
Query: 113 QELDEAARRRLVKRLYIPL 131
+LD A RL R+ IPL
Sbjct: 133 GDLDRALYDRLDIRIVIPL 151
Score = 44.4 bits (105), Expect = 1e-05
Identities = 29/78 (37%), Positives = 36/78 (46%), Gaps = 11/78 (14%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP--Q 280
+P ++FIDEIDSL L D ++ V VIGATNRP
Sbjct: 83 AKPGVLFIDEIDSL-------SRGAQNALLRVLETLNDLRIDREN--VRVIGATNRPLLG 133
Query: 281 ELDEAARRRLVKRLYIPL 298
+LD A RL R+ IPL
Sbjct: 134 DLDRALYDRLDIRIVIPL 151
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only detects
a fraction of this vast family. The poorly conserved
N-terminal helix is missing from the alignment.
Length = 148
Score = 67.4 bits (164), Expect = 1e-13
Identities = 38/151 (25%), Positives = 61/151 (40%), Gaps = 25/151 (16%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFC---ISASTLTS--------------KWYGEGE 43
IL+ GPPG+GKT + + +A + I + K G GE
Sbjct: 5 ILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKASGSGE 64
Query: 44 KMVRALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLV 103
+R A+A +P ++ +DEI SLL + L+ L+ + +
Sbjct: 65 LRLRLALALARKLKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKN-------L 117
Query: 104 LVIGATNRPQELDEAA-RRRLVKRLYIPLPD 133
VI TN ++L A RRR +R+ + L
Sbjct: 118 TVILTTNDEKDLGPALLRRRFDRRIVLLLIL 148
Score = 39.3 bits (91), Expect = 6e-04
Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 8/79 (10%)
Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 282
+P ++ +DEI SLL + L+ L+ + + VI TN ++L
Sbjct: 77 LKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKN-------LTVILTTNDEKDL 129
Query: 283 DEAA-RRRLVKRLYIPLPD 300
A RRR +R+ + L
Sbjct: 130 GPALLRRRFDRRIVLLLIL 148
>gnl|CDD|177094 CHL00195, ycf46, Ycf46; Provisional.
Length = 489
Score = 51.2 bits (123), Expect = 4e-07
Identities = 47/175 (26%), Positives = 78/175 (44%), Gaps = 8/175 (4%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL G GTGK+L K +A + + L GE E +R + +A P I
Sbjct: 262 LLLVGIQGTGKSLTAKAIANDWQLPLLRLDVGKLFGGIVGESESRMRQMIRIAEALSPCI 321
Query: 61 IFIDEIDSLLCQ-RSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD-EA 118
++IDEID S ++ T+ R+ F+ L + V V+ N L E
Sbjct: 322 LWIDEIDKAFSNSESKGDSGTTNRVLATFITWLSEKKS----PVFVVATANNIDLLPLEI 377
Query: 119 ARR-RLVKRLYIPLPDEQARCEIVTKLLRNI-TNSLTVDDIEEVGRMTTDFSGAD 171
R+ R + ++ LP + R +I L+ S DI+++ +++ FSGA+
Sbjct: 378 LRKGRFDEIFFLDLPSLEEREKIFKIHLQKFRPKSWKKYDIKKLSKLSNKFSGAE 432
>gnl|CDD|237355 PRK13342, PRK13342, recombination factor protein RarA; Reviewed.
Length = 413
Score = 50.5 bits (122), Expect = 6e-07
Identities = 43/171 (25%), Positives = 64/171 (37%), Gaps = 69/171 (40%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVA----SVH 56
++L+GPPGTGKT + + +A A F +SA T G K +R + A S
Sbjct: 39 MILWGPPGTGKTTLARIIAGATDAPFEALSAVT-------SGVKDLREVIEEARQRRSAG 91
Query: 57 QPSIIFIDEI--------DSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGA 108
+ +I+FIDEI D+LL ++D + +IGA
Sbjct: 92 RRTILFIDEIHRFNKAQQDALL-------------------------PHVEDGTITLIGA 126
Query: 109 T--------NRP----------QELDEAARRRLVKR-------LYIPLPDE 134
T N + L E +L+KR + L DE
Sbjct: 127 TTENPSFEVNPALLSRAQVFELKPLSEEDIEQLLKRALEDKERGLVELDDE 177
>gnl|CDD|232949 TIGR00382, clpX, endopeptidase Clp ATP-binding regulatory subunit
(clpX). A member of the ATP-dependent proteases, ClpX
has ATP-dependent chaperone activity and is required for
specific ATP-dependent proteolytic activities expressed
by ClpPX. The gene is also found to be involved in
stress tolerance in Bacillus subtilis and is essential
for the efficient acquisition of genes specifying type
IA and IB restriction [Protein fate, Protein folding and
stabilization, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 413
Score = 46.3 bits (110), Expect = 1e-05
Identities = 30/73 (41%), Positives = 38/73 (52%), Gaps = 6/73 (8%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY-GEG-EKMVRALFAVA----S 54
ILL GP G+GKTL+ + +A F A+TLT Y GE E ++ L A
Sbjct: 119 ILLIGPTGSGKTLLAQTLARILNVPFAIADATTLTEAGYVGEDVENILLKLLQAADYDVE 178
Query: 55 VHQPSIIFIDEID 67
Q II+IDEID
Sbjct: 179 KAQKGIIYIDEID 191
>gnl|CDD|225165 COG2256, MGS1, ATPase related to the helicase subunit of the
Holliday junction resolvase [DNA replication,
recombination, and repair].
Length = 436
Score = 42.6 bits (101), Expect = 2e-04
Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 28/111 (25%)
Query: 3 LFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVA----SVHQP 58
L+GPPGTGKT + + +A A F +SA T G K +R + A + +
Sbjct: 53 LWGPPGTGKTTLARLIAGTTNAAFEALSAVT-------SGVKDLREIIEEARKNRLLGRR 105
Query: 59 SIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGAT 109
+I+F+DEI N+ + + L+ +++ +++IGAT
Sbjct: 106 TILFLDEIHRF--------NKA----QQDALLPH-----VENGTIILIGAT 139
>gnl|CDD|235422 PRK05342, clpX, ATP-dependent protease ATP-binding subunit ClpX;
Provisional.
Length = 412
Score = 42.1 bits (100), Expect = 3e-04
Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 16/78 (20%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATF----FCIS-ASTLT-SKWYGEG-EKMVRALFAVA 53
ILL GP G+GKTL+ AQ A F I+ A+TLT + + GE E ++ L A
Sbjct: 111 ILLIGPTGSGKTLL-----AQTLARILDVPFAIADATTLTEAGYVGEDVENILLKLLQAA 165
Query: 54 --SVH--QPSIIFIDEID 67
V Q I++IDEID
Sbjct: 166 DYDVEKAQRGIVYIDEID 183
>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
only].
Length = 329
Score = 41.3 bits (97), Expect = 4e-04
Identities = 44/182 (24%), Positives = 72/182 (39%), Gaps = 32/182 (17%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCIS------ASTLTSKW--------YGEGEKMV 46
+LL GPPG GKTL+ + +A F I S L + GE +
Sbjct: 46 VLLEGPPGVGKTLLARALARALGLPFVRIQCTPDLLPSDLLGTYAYAALLLEPGEFRFVP 105
Query: 47 RALFAVASVHQPSIIFIDEIDSLLCQRSDQENETS--RRLKTEFLISLDGASTLD-DDLV 103
LFA V I+ +DEI+ R+ E + + L+ E +++ G +T+
Sbjct: 106 GPLFAAVRV----ILLLDEIN-----RAPPEVQNALLEALE-ERQVTVPGLTTIRLPPPF 155
Query: 104 LVIGATNRP-----QELDEAARRRLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIE 158
+VI N L EA R + R+Y+ PD + I+ + + ++
Sbjct: 156 IVIATQNPGEYEGTYPLPEALLDRFLLRIYVDYPDSEEEERIILARVGGVDELDLESLVK 215
Query: 159 EV 160
V
Sbjct: 216 PV 217
>gnl|CDD|224140 COG1219, ClpX, ATP-dependent protease Clp, ATPase subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 408
Score = 41.0 bits (97), Expect = 6e-04
Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 8/74 (10%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY-GEG-----EKMVRAL-FAVA 53
ILL GP G+GKTL+ + +A F A+TLT Y GE K+++A + V
Sbjct: 100 ILLIGPTGSGKTLLAQTLAKILNVPFAIADATTLTEAGYVGEDVENILLKLLQAADYDVE 159
Query: 54 SVHQPSIIFIDEID 67
+ II+IDEID
Sbjct: 160 RA-ERGIIYIDEID 172
>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
nucleotide-binding domain. ABC transporters are a large
family of proteins involved in the transport of a wide
variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are
a subset of nucleotide hydrolases that contain a
signature motif, Q-loop, and H-loop/switch region, in
addition to, the Walker A motif/P-loop and Walker B
motif commonly found in a number of ATP- and GTP-binding
and hydrolyzing proteins.
Length = 157
Score = 38.0 bits (89), Expect = 0.002
Identities = 29/139 (20%), Positives = 50/139 (35%), Gaps = 29/139 (20%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC-----KATFFCISASTLTSKWYGE----------GEKM 45
+ L GP G+GK+ + + +A + + L + G++
Sbjct: 28 VALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELRRRIGYVPQLSGGQRQ 87
Query: 46 VRALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLV 105
R A A + P ++ +DE S L D + RL + T V
Sbjct: 88 -RVALARALLLNPDLLLLDEPTSGL----DPASR--ERLLELLRELAEEGRT-------V 133
Query: 106 IGATNRPQELDEAARRRLV 124
I T+ P+ + AA R +V
Sbjct: 134 IIVTHDPELAELAADRVIV 152
Score = 27.6 bits (62), Expect = 6.1
Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 13/71 (18%)
Query: 221 FVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 280
+ P ++ +DE S L D + RL + T VI T+ P+
Sbjct: 95 LLLNPDLLLLDEPTSGL----DPASR--ERLLELLRELAEEGRT-------VIIVTHDPE 141
Query: 281 ELDEAARRRLV 291
+ AA R +V
Sbjct: 142 LAELAADRVIV 152
>gnl|CDD|131948 TIGR02902, spore_lonB, ATP-dependent protease LonB. Members of
this protein are LonB, a paralog of the ATP-dependent
protease La (LonA, TIGR00763). LonB proteins are found
strictly, and almost universally, in endospore-forming
bacteria. This protease was shown, in Bacillus subtilis,
to be expressed specifically in the forespore, during
sporulation, under control of sigma(F). The lonB gene,
despite location immediately upstream of lonA, was shown
to be monocistronic. LonB appears able to act on
sigma(H) for post-translation control, but lonB mutation
did not produce an obvious sporulation defect under the
conditions tested. Note that additional paralogs of LonA
and LonB occur in the Clostridium lineage and this model
selects only one per species as the protein that
corresponds to LonB in B. subtilis [Protein fate,
Degradation of proteins, peptides, and glycopeptides,
Cellular processes, Sporulation and germination].
Length = 531
Score = 37.8 bits (88), Expect = 0.007
Identities = 57/245 (23%), Positives = 99/245 (40%), Gaps = 56/245 (22%)
Query: 1 ILLFGPPGTGKTLIGKCV---AAQC-------KATFFCISASTLTSKWYGEGEKMVRAL- 49
++++GPPG GKT + V A + A F I A+T G + ++ ++
Sbjct: 89 VIIYGPPGVGKTAAARLVLEEAKKNPASPFKEGAAFVEIDATTARFDERGIADPLIGSVH 148
Query: 50 ---------FAVASVHQPS----------IIFIDEIDSLLCQRSDQENETSRRLKTEFLI 90
+A + QP ++FIDEI L Q N+ + L+ +
Sbjct: 149 DPIYQGAGPLGIAGIPQPKPGAVTRAHGGVLFIDEIGEL---HPVQMNKLLKVLEDR-KV 204
Query: 91 SLDGASTLDDDLVL-----------------VIGATNR-PQELDEAARRRLVKRLYIPLP 132
LD A ++ + +IGAT R P+E+ A R R V+ + PL
Sbjct: 205 FLDSAYYNSENPNIPSHIHDIFQNGLPADFRLIGATTRNPEEIPPALRSRCVEIFFRPLL 264
Query: 133 DEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCREASLGPVRSIDLSRI 192
DE+ + EI I +L +E + + + +G + ++ + A+ G RI
Sbjct: 265 DEEIK-EIAKNAAEKIGINLEKHALELIVKYAS--NGREAVNIVQLAA-GIALGEGRKRI 320
Query: 193 DALDV 197
A D+
Sbjct: 321 LAEDI 325
>gnl|CDD|188437 TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA. This
model represents the AAA family ATPase, EccA, of the
actinobacterial flavor of type VII secretion systems.
Species such as Mycobacterium tuberculosis have several
instances of this system per genome, designated EccA1,
EccA2, etc [Protein fate, Protein and peptide secretion
and trafficking].
Length = 557
Score = 37.9 bits (88), Expect = 0.008
Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 21/148 (14%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC-------KATFFCISASTLTSKWYGEGEKMVRALFAVA 53
+L GPPGTGKT I + VA K +S + L ++ GE E + A
Sbjct: 315 MLFAGPPGTGKTTIARVVAKIYCGLGVLRKPLVREVSRADLIGQYIGESEAKTNEIIDSA 374
Query: 54 SVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 113
++F+DE +L+ Q++ L ++ D D ++VIGA R
Sbjct: 375 ---LGGVLFLDEAYTLVETGYGQKDPFGLEAIDTLLARMEN----DRDRLVVIGAGYRKD 427
Query: 114 -----ELDEAARRRLVKRLYIP--LPDE 134
E++E R R + + P PDE
Sbjct: 428 LDKFLEVNEGLRSRFTRVIEFPSYSPDE 455
>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily). This Pfam
entry includes some of the AAA proteins not detected by
the pfam00004 model.
Length = 135
Score = 35.0 bits (81), Expect = 0.018
Identities = 31/137 (22%), Positives = 45/137 (32%), Gaps = 17/137 (12%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQ-CKATFFCI------SASTLTSKW-YGEGEKMVRALFAV 52
+LL GPPGTGK+ + + +AA F + + L + G V
Sbjct: 2 VLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPLV 61
Query: 53 ASVHQPSIIFIDEIDSLLCQRSDQENETSRRL--KTEFLISLDGASTLDDDLVLVIGATN 110
+ + I +DEI+ D N L + L D +I N
Sbjct: 62 RAAREGEIAVLDEINRA---NPDVLNSLLSLLDERRLLLPEGGELVKAAPDGFRLIATMN 118
Query: 111 RP----QELDEAARRRL 123
EL A R R
Sbjct: 119 PLDRGLNELSPALRSRF 135
>gnl|CDD|129721 TIGR00635, ruvB, Holliday junction DNA helicase, RuvB subunit.
All proteins in this family for which functions are
known are 5'-3' DNA helicases that, as part of a
complex with RuvA homologs serve as a 5'-3' Holliday
junction helicase. RuvA specifically binds Holliday
junctions as a sandwich of two tetramers and maintains
the configuration of the junction. It forms a complex
with two hexameric rings of RuvB, the subunit that
contains helicase activity. The complex drives
ATP-dependent branch migration of the Holliday junction
recombination intermediate. The endonuclease RuvC
resolves junctions [DNA metabolism, DNA replication,
recombination, and repair].
Length = 305
Score = 35.7 bits (83), Expect = 0.022
Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 8/69 (11%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LL+GPPG GKT + +A + S L EK + ++ + +
Sbjct: 33 LLLYGPPGLGKTTLAHIIANEMGVNLKITSGPAL--------EKPGDLAAILTNLEEGDV 84
Query: 61 IFIDEIDSL 69
+FIDEI L
Sbjct: 85 LFIDEIHRL 93
>gnl|CDD|234619 PRK00080, ruvB, Holliday junction DNA helicase RuvB; Reviewed.
Length = 328
Score = 35.5 bits (83), Expect = 0.031
Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 10/70 (14%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQP-S 59
+LL+GPPG GKT + +A + S L EK L A+ + +
Sbjct: 54 VLLYGPPGLGKTTLANIIANEMGVNIRITSGPAL--------EK-PGDLAAILTNLEEGD 104
Query: 60 IIFIDEIDSL 69
++FIDEI L
Sbjct: 105 VLFIDEIHRL 114
>gnl|CDD|225164 COG2255, RuvB, Holliday junction resolvasome, helicase subunit [DNA
replication, recombination, and repair].
Length = 332
Score = 35.2 bits (82), Expect = 0.042
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 8/66 (12%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
+LLFGPPG GKT + +A + S L EK + ++ + +
Sbjct: 55 VLLFGPPGLGKTTLAHIIANELGVNLKITSGPAL--------EKPGDLAAILTNLEEGDV 106
Query: 61 IFIDEI 66
+FIDEI
Sbjct: 107 LFIDEI 112
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 35.0 bits (81), Expect = 0.061
Identities = 22/83 (26%), Positives = 33/83 (39%), Gaps = 20/83 (24%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGE--------------KMV 46
+ L GPPG GKT +GK +A F S + E E +++
Sbjct: 350 LCLVGPPGVGKTSLGKSIAKALNRKFVRFSLGGVRD----EAEIRGHRRTYVGAMPGRII 405
Query: 47 RALFAVASVHQPSIIFIDEIDSL 69
+ L A P + +DEID +
Sbjct: 406 QGL-KKAKTKNP-LFLLDEIDKI 426
>gnl|CDD|238979 cd02021, GntK, Gluconate kinase (GntK) catalyzes the phosphoryl
transfer from ATP to gluconate. The resulting product
gluconate-6-phoshate is an important precursor of
gluconate metabolism. GntK acts as a dimmer composed of
two identical subunits.
Length = 150
Score = 33.4 bits (77), Expect = 0.065
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFF 27
I++ G G+GK+ +GK +A + A F
Sbjct: 2 IVVMGVSGSGKSTVGKALAERLGAPFI 28
>gnl|CDD|219536 pfam07724, AAA_2, AAA domain (Cdc48 subfamily). This Pfam entry
includes some of the AAA proteins not detected by the
pfam00004 model.
Length = 168
Score = 33.7 bits (78), Expect = 0.069
Identities = 21/84 (25%), Positives = 30/84 (35%), Gaps = 21/84 (25%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQ---CKATFFCISASTLTSKW--------------YGEGE 43
L GP G GKT + K +A + I S + Y EG
Sbjct: 6 FLFLGPTGVGKTELAKALAELLFGDERALIRIDMSEYMEEHSVSRLIGAPPGYVGYEEGG 65
Query: 44 KMVRALFAVASVHQPSIIFIDEID 67
++ A+ SI+ IDEI+
Sbjct: 66 QLTEAV----RRKPYSIVLIDEIE 85
>gnl|CDD|222104 pfam13401, AAA_22, AAA domain.
Length = 124
Score = 32.6 bits (75), Expect = 0.089
Identities = 16/85 (18%), Positives = 30/85 (35%), Gaps = 16/85 (18%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQ-CKATFFCISASTLTS---------------KWYGEGEK 44
+L G G+GKT + + +A Q + A +L + G +
Sbjct: 7 GVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDLLRKILRALGLPLSGGTTAE 66
Query: 45 MVRALFAVASVHQPSIIFIDEIDSL 69
++ A+ ++ IDE L
Sbjct: 67 LLEAILDALKRRGRPLLIIDEAQHL 91
>gnl|CDD|144608 pfam01078, Mg_chelatase, Magnesium chelatase, subunit ChlI.
Magnesium-chelatase is a three-component enzyme that
catalyzes the insertion of Mg2+ into protoporphyrin IX.
This is the first unique step in the synthesis of
(bacterio)chlorophyll. Due to this, it is thought that
Mg-chelatase has an important role in channelling
inter- mediates into the (bacterio)chlorophyll branch
in response to conditions suitable for photosynthetic
growth. ChlI and BchD have molecular weight between
38-42 kDa.
Length = 207
Score = 33.3 bits (77), Expect = 0.10
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 1 ILLFGPPGTGKTLIGKC 17
+L+ GPPG+GKT++ K
Sbjct: 25 LLMIGPPGSGKTMLAKR 41
>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein. The N-terminal
end matches very strongly a pfam Mg_chelatase domain
[Unknown function, General].
Length = 499
Score = 33.7 bits (77), Expect = 0.13
Identities = 10/18 (55%), Positives = 15/18 (83%)
Query: 1 ILLFGPPGTGKTLIGKCV 18
+LLFGPPG+GKT++ +
Sbjct: 214 LLLFGPPGSGKTMLASRL 231
>gnl|CDD|238260 cd00464, SK, Shikimate kinase (SK) is the fifth enzyme in the
shikimate pathway, a seven-step biosynthetic pathway
which converts erythrose-4-phosphate to chorismic acid,
found in bacteria, fungi and plants. Chorismic acid is
a important intermediate in the synthesis of aromatic
compounds, such as aromatic amino acids, p-aminobenzoic
acid, folate and ubiquinone. Shikimate kinase catalyses
the phosphorylation of the 3-hydroxyl group of shikimic
acid using ATP.
Length = 154
Score = 32.5 bits (75), Expect = 0.15
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFF 27
I+L G G GKT +G+ +A F
Sbjct: 2 IVLIGMMGAGKTTVGRLLAKALGLPFV 28
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 33.7 bits (78), Expect = 0.17
Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 6/33 (18%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKAT---FFCIS 30
+ L GPPG GKT +GK +A KA F IS
Sbjct: 353 LCLVGPPGVGKTSLGKSIA---KALGRKFVRIS 382
>gnl|CDD|204202 pfam09336, Vps4_C, Vps4 C terminal oligomerisation domain. This
domain is found at the C terminal of ATPase proteins
involved in vacuolar sorting. It forms an alpha helix
structure and is required for oligomerisation.
Length = 62
Score = 29.9 bits (68), Expect = 0.24
Identities = 9/22 (40%), Positives = 15/22 (68%)
Query: 199 PISIDDFRDALKTVRPSVCQAD 220
P+++ DF AL T +P+V + D
Sbjct: 29 PLTMKDFIKALATTKPTVSKDD 50
>gnl|CDD|222165 pfam13481, AAA_25, AAA domain. This AAA domain is found in a wide
variety of presumed DNA repair proteins.
Length = 154
Score = 31.9 bits (73), Expect = 0.26
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 10/80 (12%)
Query: 1 ILLFGPPGTGKT--LIGKCVAAQCKATFFCISASTLTSKWYGEGE--------KMVRALF 50
LL G PGTGK+ + A F Y +GE ++
Sbjct: 36 TLLAGAPGTGKSTLALDLAAAVATGRPFLGPFPVEPGRVLYLDGEDSEAGLRRRLRALGE 95
Query: 51 AVASVHQPSIIFIDEIDSLL 70
A+ + P ++ ID + SLL
Sbjct: 96 ALEEIEGPDLVVIDPLASLL 115
>gnl|CDD|213527 TIGR00390, hslU, ATP-dependent protease HslVU, ATPase subunit.
This model represents the ATPase subunit of HslVU, while
the proteasome-related peptidase subunit is HslV.
Residues 54-61 of the model contain a P-loop ATP-binding
motif. Cys-287 of E. coli (position 308 in the seed
alignment), studied in MEDLINE:98389714, is Ser in other
members of the seed alignment [Protein fate, Protein
folding and stabilization].
Length = 441
Score = 32.5 bits (74), Expect = 0.31
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY--GEGEKMVRALFAVA 53
IL+ GP G GKT I + +A A F + A+ T Y + E MVR L +A
Sbjct: 50 ILMIGPTGVGKTEIARRLAKLANAPFIKVEATKFTEVGYVGRDVESMVRDLVDIA 104
Score = 28.6 bits (64), Expect = 5.6
Identities = 33/121 (27%), Positives = 50/121 (41%), Gaps = 31/121 (25%)
Query: 51 AVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGAST-------LDDDLV 103
A+ V Q IIFIDEID + + + SR L+ + ST + D +
Sbjct: 241 AIDRVEQSGIIFIDEIDKIAKKGESSGADVSREGVQRDLLPIVEGSTVNTKYGSVKTDHI 300
Query: 104 LVI--GATN--RPQELDEAARRRLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEE 159
L I GA + +P +L +P+ Q R I +L +LT+DD E
Sbjct: 301 LFIAAGAFHLAKPSDL---------------IPELQGRFPIRVEL-----KALTIDDFER 340
Query: 160 V 160
+
Sbjct: 341 I 341
>gnl|CDD|187579 cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha
subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical
(a) SDRs. This subgroup of extended SDR-like proteins
are atypical SDRs. They have a glycine-rich
NAD(P)-binding motif similar to the typical SDRs,
GXXGXXG, and have the YXXXK active site motif (though
not the other residues of the SDR tetrad). Members
identified include NDUFA9 (mitochondrial) and putative
nucleoside-diphosphate-sugar epimerase. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Atypical SDRs include
biliverdin IX beta reductase (BVR-B,aka flavin
reductase), NMRa (a negative transcriptional regulator
of various fungi), progesterone 5-beta-reductase like
proteins, phenylcoumaran benzylic ether and
pinoresinol-lariciresinol reductases, phenylpropene
synthases, eugenol synthase, triphenylmethane reductase,
isoflavone reductases, and others. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. In addition to the
Rossmann fold core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids, and typically have a
TGXXGXXG cofactor binding motif. Complex (multidomain)
SDRs such as ketoreductase domains of fatty acid
synthase have a GGXGXXG NAD(P)-binding motif and an
altered active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 273
Score = 32.2 bits (74), Expect = 0.32
Identities = 14/32 (43%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 34 LTSKWYGEGEKMVRALFAVASVHQPSIIFIDE 65
L SK EGE+ VR F A++ +PS++F E
Sbjct: 127 LRSK--AEGEEAVREAFPEATIVRPSVVFGRE 156
>gnl|CDD|222866 PHA02544, 44, clamp loader, small subunit; Provisional.
Length = 316
Score = 32.3 bits (74), Expect = 0.37
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISAS 32
+L PGTGKT + K + + A ++ S
Sbjct: 46 LLHSPSPGTGKTTVAKALCNEVGAEVLFVNGS 77
>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
recombination, and repair].
Length = 254
Score = 32.0 bits (73), Expect = 0.38
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 11/73 (15%)
Query: 1 ILLFGPPGTGKTL----IGKCVAAQCKATFFCISASTLTSKW---YGEGEKMVRALFAVA 53
++L GPPG GKT IG + + F I+A L SK + EG + L +
Sbjct: 108 LVLLGPPGVGKTHLAIAIGNELLKAGISVLF-ITAPDLLSKLKAAFDEGRLEEKLLRELK 166
Query: 54 SVHQPSIIFIDEI 66
V ++ ID+I
Sbjct: 167 KV---DLLIIDDI 176
>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
Length = 268
Score = 31.5 bits (72), Expect = 0.55
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 2 LLFGPPGTGKTLIGKCVA 19
LL+G GTGKT + C+A
Sbjct: 118 LLWGSVGTGKTYLAACIA 135
>gnl|CDD|237354 PRK13341, PRK13341, recombination factor protein RarA/unknown
domain fusion protein; Reviewed.
Length = 725
Score = 31.9 bits (73), Expect = 0.58
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 16/72 (22%)
Query: 2 LLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQP--- 58
+L+GPPG GKT + + +A +A F ++A L G K +RA V +
Sbjct: 56 ILYGPPGVGKTTLARIIANHTRAHFSSLNA-VLA------GVKDLRA--EVDRAKERLER 106
Query: 59 ----SIIFIDEI 66
+I+FIDE+
Sbjct: 107 HGKRTILFIDEV 118
>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
ATPase domain and a DNA-binding domain [Transcription /
Signal transduction mechanisms].
Length = 403
Score = 31.2 bits (71), Expect = 0.70
Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 21/83 (25%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC-------KATFFC--ISASTLTSKWYGEGEKMVRALFA 51
+L+ G GTGK L + + A F C S + ++ +G + F
Sbjct: 104 VLIIGETGTGKELFARLIHALSARRAEAPFIAFNCAAYSENLQEAELFG----HEKGAFT 159
Query: 52 VASVHQPSII--------FIDEI 66
A + + F+DEI
Sbjct: 160 GAQGGKAGLFEQANGGTLFLDEI 182
>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
Provisional.
Length = 758
Score = 31.3 bits (71), Expect = 0.81
Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 12/81 (14%)
Query: 2 LLFGPPGTGKTLIGKCVA------------AQCKATFFCISASTLTSKWYGEGEKMVRAL 49
LL G G GKT I + +A A C I + +K+ G+ EK +AL
Sbjct: 211 LLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKAL 270
Query: 50 FAVASVHQPSIIFIDEIDSLL 70
SI+FIDEI +++
Sbjct: 271 LKQLEQDTNSILFIDEIHTII 291
>gnl|CDD|233959 TIGR02639, ClpA, ATP-dependent Clp protease ATP-binding subunit
clpA. [Protein fate, Degradation of proteins, peptides,
and glycopeptides].
Length = 730
Score = 31.1 bits (71), Expect = 0.82
Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 12/77 (15%)
Query: 2 LLFGPPGTGKTLI----------GKCVAAQCKATFFCISASTLT--SKWYGEGEKMVRAL 49
LL G PG GKT I GK A + + TL +K+ G+ E+ ++A+
Sbjct: 206 LLVGEPGVGKTAIVEGLALRIAEGKVPERLKNAKIYSLDMGTLLAGTKYRGDFEERLKAV 265
Query: 50 FAVASVHQPSIIFIDEI 66
+ +I+FIDEI
Sbjct: 266 VSEIEKEPNAILFIDEI 282
>gnl|CDD|233355 TIGR01313, therm_gnt_kin, carbohydrate kinase, thermoresistant
glucokinase family. This model represents a subfamily
of proteins that includes thermoresistant and
thermosensitve isozymes of gluconate kinase
(gluconokinase) in E. coli and other related proteins;
members of this family are often named by similarity to
the thermostable isozyme. These proteins show homology
to shikimate kinases and adenylate kinases but not to
gluconate kinases from the FGGY family of carbohydrate
kinases.
Length = 163
Score = 30.1 bits (68), Expect = 0.97
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATF 26
+L G G+GK+ I +A + A F
Sbjct: 1 FVLMGVAGSGKSTIASALAHRLGAKF 26
>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
Members of this protein family resemble the widely
distributed ATP-dependent protease La, also called Lon
and LonA. It resembles even more closely LonB, which is
a LonA paralog found in genomes if and only if the
species is capable of endospore formation (as in
Bacillus subtilis, Clostridium tetani, and select other
members of the Firmicutes) and expressed specifically in
the forespore compartment. Members of this family are
restricted to a subset of spore-forming species, and are
very likely to participate in the program of endospore
formation. We propose the designation LonC [Protein
fate, Degradation of proteins, peptides, and
glycopeptides, Cellular processes, Sporulation and
germination].
Length = 615
Score = 30.9 bits (70), Expect = 1.1
Identities = 56/219 (25%), Positives = 80/219 (36%), Gaps = 75/219 (34%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCK----------ATFFCISASTLTSKWYGEGEKMVRALF 50
I+L+GPPG GKT + + K A F + +TL +W + ++ L
Sbjct: 178 IILYGPPGVGKTTAARLALEEAKKLKHTPFAEDAPFVEVDGTTL--RW--DPREVTNPL- 232
Query: 51 AVASVHQP---------------------------SIIFIDEI---DSLLCQR------- 73
+ SVH P ++FIDEI D LL +
Sbjct: 233 -LGSVHDPIYQGARRDLAETGVPEPKTGLVTDAHGGVLFIDEIGELDPLLQNKLLKVLED 291
Query: 74 -----------SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNR-PQELDEAARR 121
D N + +K F +GA D VL IGAT R P+E++ A R
Sbjct: 292 KRVEFSSSYYDPDDPN-VPKYIKKLF---EEGAPA---DFVL-IGATTRDPEEINPALRS 343
Query: 122 RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEV 160
R + + PL E IV I N +EE+
Sbjct: 344 RCAEVFFEPLTPEDIA-LIVLNAAEKI-NVHLAAGVEEL 380
>gnl|CDD|213190 cd03223, ABCD_peroxisomal_ALDP, ATP-binding cassette domain of
peroxisomal transporter, subfamily D. Peroxisomal
ATP-binding cassette transporter (Pat) is involved in
the import of very long-chain fatty acids (VLCFA) into
the peroxisome. The peroxisomal membrane forms a
permeability barrier for a wide variety of metabolites
required for and formed during fatty acid
beta-oxidation. To communicate with the cytoplasm and
mitochondria, peroxisomes need dedicated proteins to
transport such hydrophilic molecules across their
membranes. X-linked adrenoleukodystrophy (X-ALD) is
caused by mutations in the ALD gene, which encodes ALDP
(adrenoleukodystrophy protein ), a peroxisomal integral
membrane protein that is a member of the ATP-binding
cassette (ABC) transporter protein family. The disease
is characterized by a striking and unpredictable
variation in phenotypic expression. Phenotypes include
the rapidly progressive childhood cerebral form (CCALD),
the milder adult form, adrenomyeloneuropathy (AMN), and
variants without neurologic involvement (i.e.
asymptomatic).
Length = 166
Score = 29.8 bits (68), Expect = 1.1
Identities = 26/107 (24%), Positives = 41/107 (38%), Gaps = 37/107 (34%)
Query: 1 ILLFGPPGTGKTLI------------GKCVAAQCKATFFC-----ISASTLTSK----WY 39
+L+ GP GTGK+ + G+ + + F + TL + W
Sbjct: 30 LLITGPSGTGKSSLFRALAGLWPWGSGRIGMPEGEDLLFLPQRPYLPLGTLREQLIYPWD 89
Query: 40 ---GEGEK----MVRALFAVASVHQPSIIFIDEIDSLLCQRSDQENE 79
GE+ R L H+P +F+DE S L D+E+E
Sbjct: 90 DVLSGGEQQRLAFARLLL-----HKPKFVFLDEATSAL----DEESE 127
>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
metabolism].
Length = 191
Score = 30.2 bits (69), Expect = 1.1
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISAST 33
I+L GP G GK+ + K + F +SA+T
Sbjct: 7 IVLSGPSGVGKSTLVKALLED-DKLRFSVSATT 38
>gnl|CDD|238540 cd01120, RecA-like_NTPases, RecA-like NTPases. This family
includes the NTP binding domain of F1 and V1 H+ATPases,
DnaB and related helicases as well as bacterial RecA
and related eukaryotic and archaeal recombinases. This
group also includes bacterial conjugation proteins and
related DNA transfer proteins involved in type II and
type IV secretion.
Length = 165
Score = 29.8 bits (67), Expect = 1.2
Identities = 21/84 (25%), Positives = 31/84 (36%), Gaps = 9/84 (10%)
Query: 1 ILLFGPPGTGKT-----LIGKCVAAQCKATFFCI--SASTLTSKWYGEGEK--MVRALFA 51
IL+FGP G+GKT L K + I LT + GE K + +
Sbjct: 2 ILVFGPTGSGKTTLALQLALNIATKGGKVVYVDIEEEIEELTERLIGESLKGALDNLIIV 61
Query: 52 VASVHQPSIIFIDEIDSLLCQRSD 75
A+ P+ + L +R
Sbjct: 62 FATADDPAAARLLSKAERLRERGG 85
>gnl|CDD|215651 pfam00006, ATP-synt_ab, ATP synthase alpha/beta family,
nucleotide-binding domain. This family includes the
ATP synthase alpha and beta subunits, the ATP synthase
associated with flagella and the termination factor
Rho.
Length = 213
Score = 30.1 bits (69), Expect = 1.2
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCI 29
I +FG GTGKT++ +A KA +
Sbjct: 18 IGIFGGSGTGKTVLLGMIARNAKADVVEV 46
>gnl|CDD|222163 pfam13479, AAA_24, AAA domain. This AAA domain is found in a wide
variety of presumed phage proteins.
Length = 201
Score = 29.9 bits (68), Expect = 1.2
Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 8/52 (15%)
Query: 186 SIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFVHQPSIIFIDEIDSLL 237
+ LSR D +D+R S D RD L + + + + + ID I L
Sbjct: 37 TKVLSRGDNVDIR--SWQDLRDFLDELA------EDLAEYDTLVIDTITKLE 80
Score = 29.9 bits (68), Expect = 1.5
Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 9/77 (11%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEG------EKMVRALFAVAS 54
+L++GPPG GKT + K K F + A T G+ + + L +A
Sbjct: 6 VLIYGPPGIGKTSLAK--TLPPKTLFLDLDAGTTKVLSRGDNVDIRSWQDLRDFLDELAE 63
Query: 55 VHQP-SIIFIDEIDSLL 70
+ ID I L
Sbjct: 64 DLAEYDTLVIDTITKLE 80
>gnl|CDD|223679 COG0606, COG0606, Predicted ATPase with chaperone activity
[Posttranslational modification, protein turnover,
chaperones].
Length = 490
Score = 30.3 bits (69), Expect = 1.5
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 1 ILLFGPPGTGKTLIGKC 17
+LL GPPGTGKT++
Sbjct: 201 LLLVGPPGTGKTMLASR 217
>gnl|CDD|216186 pfam00910, RNA_helicase, RNA helicase. This family includes RNA
helicases thought to be involved in duplex unwinding
during viral RNA replication. Members of this family
are found in a variety of single stranded RNA viruses.
Length = 105
Score = 28.7 bits (65), Expect = 1.6
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 1 ILLFGPPGTGKTLIGK 16
I L+GPPG GK+ + K
Sbjct: 1 IWLYGPPGCGKSTLAK 16
>gnl|CDD|224141 COG1220, HslU, ATP-dependent protease HslVU (ClpYQ), ATPase subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 444
Score = 30.3 bits (69), Expect = 1.7
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY--GEGEKMVRALFAVA 53
IL+ GP G GKT I + +A A F + A+ T Y + E ++R L +A
Sbjct: 53 ILMIGPTGVGKTEIARRLAKLAGAPFIKVEATKFTEVGYVGRDVESIIRDLVEIA 107
Score = 28.8 bits (65), Expect = 5.2
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 51 AVASVHQPSIIFIDEIDSLLCQRSD-QENETSR 82
A+ + Q I+FIDEID + +R + SR
Sbjct: 244 AIDAAEQNGIVFIDEIDK-IAKRGGSGGPDVSR 275
>gnl|CDD|235364 PRK05201, hslU, ATP-dependent protease ATP-binding subunit HslU;
Provisional.
Length = 443
Score = 30.0 bits (69), Expect = 1.7
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 51 AVASVHQPSIIFIDEID 67
A+ V Q I+FIDEID
Sbjct: 243 AIERVEQNGIVFIDEID 259
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 30.3 bits (69), Expect = 1.8
Identities = 9/12 (75%), Positives = 11/12 (91%)
Query: 1 ILLFGPPGTGKT 12
+LL+GPPG GKT
Sbjct: 42 LLLYGPPGVGKT 53
>gnl|CDD|235335 PRK05057, aroK, shikimate kinase I; Reviewed.
Length = 172
Score = 29.3 bits (66), Expect = 1.9
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFF 27
I L GP G GK+ IG+ +A Q F+
Sbjct: 7 IFLVGPMGAGKSTIGRQLAQQLNMEFY 33
>gnl|CDD|182120 PRK09862, PRK09862, putative ATP-dependent protease; Provisional.
Length = 506
Score = 29.9 bits (67), Expect = 2.3
Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 11/49 (22%)
Query: 1 ILLFGPPGTGKTLIGKCV-----------AAQCKATFFCISASTLTSKW 38
+LL GPPGTGKT++ + A + A ++A ++ +W
Sbjct: 213 LLLIGPPGTGKTMLASRINGLLPDLSNEEALESAAILSLVNAESVQKQW 261
>gnl|CDD|221912 pfam13086, AAA_11, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins.
Length = 220
Score = 29.2 bits (66), Expect = 2.4
Identities = 9/11 (81%), Positives = 10/11 (90%)
Query: 2 LLFGPPGTGKT 12
L+ GPPGTGKT
Sbjct: 21 LIQGPPGTGKT 31
>gnl|CDD|116340 pfam07726, AAA_3, ATPase family associated with various cellular
activities (AAA). This Pfam entry includes some of the
AAA proteins not detected by the pfam00004 model.
Length = 131
Score = 28.7 bits (65), Expect = 2.4
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 1 ILLFGPPGTGKTLIGKCVA 19
+LL G PG KTL+ + +A
Sbjct: 2 VLLEGVPGLAKTLLARTLA 20
>gnl|CDD|222036 pfam13304, AAA_21, AAA domain.
Length = 256
Score = 29.3 bits (65), Expect = 2.6
Identities = 11/40 (27%), Positives = 22/40 (55%), Gaps = 2/40 (5%)
Query: 33 TLTSKWYGEGEKMVRALFA--VASVHQPSIIFIDEIDSLL 70
L K +G K + AL ++++ + S++ IDE ++ L
Sbjct: 184 KLLIKGISDGTKRLLALLLALLSALPKGSLLLIDEPENGL 223
>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus.
The RuvB protein makes up part of the RuvABC revolvasome
which catalyzes the resolution of Holliday junctions
that arise during genetic recombination and DNA repair.
Branch migration is catalyzed by the RuvB protein that
is targeted to the Holliday junction by the structure
specific RuvA protein. This family contains the
N-terminal region of the protein.
Length = 231
Score = 29.0 bits (66), Expect = 2.8
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 18/71 (25%)
Query: 1 ILLFGPPGTGKTLIGKCVA----AQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVH 56
+LL+GPPG GKT + +A + T S L EK L A+ +
Sbjct: 53 VLLYGPPGLGKTTLANIIANEMGVNIRIT----SGPAL--------EK-PGDLAAILTNL 99
Query: 57 QP-SIIFIDEI 66
+P ++FIDEI
Sbjct: 100 EPGDVLFIDEI 110
>gnl|CDD|222000 pfam13238, AAA_18, AAA domain.
Length = 128
Score = 28.2 bits (63), Expect = 2.8
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 1 ILLFGPPGTGKTLIGK 16
IL+ G PG+GKT + K
Sbjct: 1 ILITGTPGSGKTTLAK 16
>gnl|CDD|226583 COG4098, comFA, Superfamily II DNA/RNA helicase required for DNA
uptake (late competence protein) [DNA replication,
recombination, and repair].
Length = 441
Score = 29.4 bits (66), Expect = 2.9
Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 11/58 (18%)
Query: 18 VAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQP-------SIIFIDEIDS 68
+ + K F L YG+ + RA VA+ HQ ++ IDE+D+
Sbjct: 160 LYPRLKQAFSNCDIDLL----YGDSDSYFRAPLVVATTHQLLRFKQAFDLLIIDEVDA 213
>gnl|CDD|224145 COG1224, TIP49, DNA helicase TIP49, TBP-interacting protein
[Transcription].
Length = 450
Score = 29.2 bits (66), Expect = 3.1
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC-KATFFC-ISASTLTSKWYGEGEKMVRAL 49
IL+ GPPGTGKT + +A + + F IS S + S + E + +AL
Sbjct: 68 ILIVGPPGTGKTALAMGIARELGEDVPFVAISGSEIYSLEVKKTEALTQAL 118
>gnl|CDD|238713 cd01428, ADK, Adenylate kinase (ADK) catalyzes the reversible
phosphoryl transfer from adenosine triphosphates (ATP)
to adenosine monophosphates (AMP) and to yield
adenosine diphosphates (ADP). This enzyme is required
for the biosynthesis of ADP and is essential for
homeostasis of adenosine phosphates.
Length = 194
Score = 28.4 bits (64), Expect = 4.0
Identities = 9/16 (56%), Positives = 11/16 (68%)
Query: 1 ILLFGPPGTGKTLIGK 16
ILL GPPG+GK +
Sbjct: 2 ILLLGPPGSGKGTQAE 17
>gnl|CDD|187652 cd08948, 5beta-POR_like_SDR_a, progesterone 5-beta-reductase-like
proteins (5beta-POR), atypical (a) SDRs. 5beta-POR
catalyzes the reduction of progesterone to
5beta-pregnane-3,20-dione in Digitalis plants. This
subgroup of atypical-extended SDRs, shares the structure
of an extended SDR, but has a different glycine-rich
nucleotide binding motif (GXXGXXG) and lacks the YXXXK
active site motif of classical and extended SDRs.
Tyr-179 and Lys 147 are present in the active site, but
not in the usual SDR configuration. Given these
differences, it has been proposed that this subfamily
represents a new SDR class. Other atypical SDRs include
biliverdin IX beta reductase (BVR-B,aka flavin
reductase), NMRa (a negative transcriptional regulator
of various fungi), phenylcoumaran benzylic ether and
pinoresinol-lariciresinol reductases, phenylpropene
synthases, eugenol synthase, triphenylmethane reductase,
isoflavone reductases, and others. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. In addition to the
Rossmann fold core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids, and typically have a
TGXXGXXG cofactor binding motif. Complex (multidomain)
SDRs such as ketoreductase domains of fatty acid
synthase have a GGXGXXG NAD(P)-binding motif and an
altered active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 308
Score = 28.8 bits (65), Expect = 4.0
Identities = 18/71 (25%), Positives = 24/71 (33%), Gaps = 13/71 (18%)
Query: 99 DDDLVLVIGA--TNRPQELDEAARRRLVK------RLYIPLPDEQARCEIVTKLLRNITN 150
D LV IG + + A L YI PDE E+ +LRN
Sbjct: 42 DPRLVEHIGIDLLDPADTVLRAKLPGLEDVTHVFYAAYIERPDEAELVEVNGAMLRN--- 98
Query: 151 SLTVDDIEEVG 161
+D +E
Sbjct: 99 --FLDALEPAS 107
>gnl|CDD|184195 PRK13635, cbiO, cobalt transporter ATP-binding subunit;
Provisional.
Length = 279
Score = 28.8 bits (65), Expect = 4.3
Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 17/65 (26%)
Query: 57 QPSIIFIDEIDSLLCQRSDQEN-ETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 115
QP II +DE S+L R +E ET R+LK + I+ V+ T+ +L
Sbjct: 158 QPDIIILDEATSMLDPRGRREVLETVRQLKEQKGIT-------------VLSITH---DL 201
Query: 116 DEAAR 120
DEAA+
Sbjct: 202 DEAAQ 206
Score = 28.8 bits (65), Expect = 4.3
Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 17/65 (26%)
Query: 224 QPSIIFIDEIDSLLCQRSDQEN-ETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 282
QP II +DE S+L R +E ET R+LK + I+ V+ T+ +L
Sbjct: 158 QPDIIILDEATSMLDPRGRREVLETVRQLKEQKGIT-------------VLSITH---DL 201
Query: 283 DEAAR 287
DEAA+
Sbjct: 202 DEAAQ 206
>gnl|CDD|190347 pfam02562, PhoH, PhoH-like protein. PhoH is a cytoplasmic
protein and predicted ATPase that is induced by
phosphate starvation.
Length = 205
Score = 28.2 bits (64), Expect = 4.5
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQ 21
+ GP GTGKT + A
Sbjct: 22 VFGIGPAGTGKTYLAVAAAVD 42
>gnl|CDD|233120 TIGR00764, lon_rel, lon-related putative ATP-dependent protease.
This model represents a set of proteins with extensive
C-terminal homology to the ATP-dependent protease La,
product of the lon gene of E. coli. The model is based
on a seed alignment containing only archaeal members,
but several bacterial proteins match the model well.
Because several species, including Thermotoga maritima
and Treponema pallidum, contain both a close homolog of
the lon protease and nearly full-length homolog of the
members of this family, we suggest there may also be a
functional division between the two families. Members
of this family from Pyrococcus horikoshii and
Pyrococcus abyssi each contain a predicted intein
[Protein fate, Degradation of proteins, peptides, and
glycopeptides].
Length = 608
Score = 28.7 bits (64), Expect = 4.8
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 1 ILLFGPPGTGKTLIGKCVA 19
+LL G PG GK+++ K +A
Sbjct: 40 VLLIGEPGVGKSMLAKAMA 58
>gnl|CDD|225804 COG3265, GntK, Gluconate kinase [Carbohydrate transport and
metabolism].
Length = 161
Score = 28.0 bits (63), Expect = 4.8
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 5 GPPGTGKTLIGKCVAAQCKATF 26
G G+GK+ +G +A + A F
Sbjct: 2 GVSGSGKSTVGSALAERLGAKF 23
>gnl|CDD|178613 PLN03055, PLN03055, AMP deaminase; Provisional.
Length = 602
Score = 28.7 bits (64), Expect = 4.8
Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 2/69 (2%)
Query: 128 YIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCREASLGPVRSI 187
Y P P Q +V ++ +++ V TT F+ DM + R SLG VR+
Sbjct: 53 YEPEPPSQHVFRMVDGVMHVYAPDDAKEELFPVPDATTFFT--DMHRILRIVSLGNVRTF 110
Query: 188 DLSRIDALD 196
R+ L+
Sbjct: 111 CHHRLKLLE 119
>gnl|CDD|213188 cd03221, ABCF_EF-3, ATP-binding cassette domain of elongation
factor 3, subfamily F. Elongation factor 3 (EF-3) is a
cytosolic protein required by fungal ribosomes for in
vitro protein synthesis and for in vivo growth. EF-3
stimulates the binding of the EF-1: GTP: aa-tRNA
ternary complex to the ribosomal A site by facilitated
release of the deacylated tRNA from the E site. The
reaction requires ATP hydrolysis. EF-3 contains two ATP
nucleotide binding sequence (NBS) motifs. NBSI is
sufficient for the intrinsic ATPase activity. NBSII is
essential for the ribosome-stimulated functions.
Length = 144
Score = 27.8 bits (63), Expect = 4.9
Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 5/65 (7%)
Query: 5 GPPGTGKTLIGKCVAAQCKATF-FCISASTLTSKWYGE---GEKMVRALFAVASVHQPSI 60
G G GK+ + K +A + + ST+ ++ + GEKM R A + P++
Sbjct: 33 GRNGAGKSTLLKLIAGELEPDEGIVTWGSTVKIGYFEQLSGGEKM-RLALAKLLLENPNL 91
Query: 61 IFIDE 65
+ +DE
Sbjct: 92 LLLDE 96
>gnl|CDD|147949 pfam06068, TIP49, TIP49 C-terminus. This family consists of the
C-terminal region of several eukaryotic and archaeal
RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin
or TIP49b) proteins. The N-terminal domain contains the
pfam00004 domain. In zebrafish, the liebeskummer (lik)
mutation, causes development of hyperplastic embryonic
hearts. lik encodes Reptin, a component of a
DNA-stimulated ATPase complex. Beta-catenin and Pontin,
a DNA-stimulated ATPase that is often part of complexes
with Reptin, are in the same genetic pathways. The
Reptin/Pontin ratio serves to regulate heart growth
during development, at least in part via the
beta-catenin pathway. TBP-interacting protein 49
(TIP49) was originally identified as a TBP-binding
protein, and two related proteins are encoded by
individual genes, tip49a and b. Although the function
of this gene family has not been elucidated, they are
supposed to play a critical role in nuclear events
because they interact with various kinds of nuclear
factors and have DNA helicase activities.TIP49a has
been suggested to act as an autoantigen in some
patients with autoimmune diseases.
Length = 395
Score = 28.8 bits (65), Expect = 5.0
Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 2/38 (5%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQC-KATFFC-ISASTLTS 36
+L+ GPPGTGKT + ++ + + T FC IS S + S
Sbjct: 53 VLIAGPPGTGKTALAIAISKELGEDTPFCPISGSEVYS 90
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 28.7 bits (65), Expect = 5.0
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 1 ILLFGPPGTGKTLIGKCVA 19
+L +G GTGKT + C+A
Sbjct: 186 LLFYGNTGTGKTFLSNCIA 204
>gnl|CDD|227629 COG5317, COG5317, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 175
Score = 27.9 bits (62), Expect = 5.2
Identities = 27/105 (25%), Positives = 38/105 (36%), Gaps = 23/105 (21%)
Query: 236 LLCQRSDQENETSRR--LKTEFLISLDGASTLDD---------DLVLVIGATNRPQELDE 284
LL QR+ E +R L + + LD S D ++ + R Q
Sbjct: 79 LLLQRAVNSGEMTREQVLSEKSKVRLDTPSADRDAPGWNELPESFRDLVERSLRLQAR-- 136
Query: 285 AARRRLVKRLYIPLPDEQKVP-GSNPTMDKIDQENETSRRLKTEF 328
RRL + +Y +VP G NP +ID L T F
Sbjct: 137 --VRRLDREIYGESEATDEVPRGRNPVQAQID-------LLTTAF 172
>gnl|CDD|223775 COG0703, AroK, Shikimate kinase [Amino acid transport and
metabolism].
Length = 172
Score = 27.9 bits (63), Expect = 5.5
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFF 27
I+L G G GK+ IG+ +A F
Sbjct: 5 IVLIGFMGAGKSTIGRALAKALNLPFI 31
>gnl|CDD|237495 PRK13765, PRK13765, ATP-dependent protease Lon; Provisional.
Length = 637
Score = 28.8 bits (65), Expect = 5.6
Identities = 8/19 (42%), Positives = 15/19 (78%)
Query: 1 ILLFGPPGTGKTLIGKCVA 19
+++ G PGTGK+++ K +A
Sbjct: 53 VMMIGSPGTGKSMLAKAMA 71
>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
protein. [Transport and binding proteins, Other].
Length = 1394
Score = 28.9 bits (65), Expect = 5.6
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 52 VASVHQPSIIFIDEIDSLL 70
+ ++HQPS I +E D LL
Sbjct: 956 LCTIHQPSAILFEEFDRLL 974
Score = 28.2 bits (63), Expect = 8.1
Identities = 9/16 (56%), Positives = 11/16 (68%)
Query: 222 VHQPSIIFIDEIDSLL 237
+HQPS I +E D LL
Sbjct: 959 IHQPSAILFEEFDRLL 974
>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
protein. Members of this protein family are found
exclusively in the archaea. This set of DNA binding
proteins shows homology to the origin recognition
complex subunit 1/cell division control protein 6
family in eukaryotes. Several members may be found in
genome and interact with each other [DNA metabolism,
DNA replication, recombination, and repair].
Length = 365
Score = 28.4 bits (64), Expect = 6.0
Identities = 8/18 (44%), Positives = 12/18 (66%)
Query: 1 ILLFGPPGTGKTLIGKCV 18
+ ++G GTGKT + K V
Sbjct: 43 VFIYGKTGTGKTAVTKYV 60
>gnl|CDD|238687 cd01393, recA_like, RecA is a bacterial enzyme which has roles
in homologous recombination, DNA repair, and the
induction of the SOS response. RecA couples ATP
hydrolysis to DNA strand exchange. While prokaryotes
have a single RecA protein, eukaryotes have multiple
RecA homologs such as Rad51, DMC1 and Rad55/57.
Archaea have the RecA-like homologs radA and radB.
Length = 226
Score = 27.9 bits (63), Expect = 6.1
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 3 LFGPPGTGKTLIGKCVAAQC 22
+FG G+GKT + +A +
Sbjct: 24 IFGEFGSGKTQLCLQLAVEA 43
>gnl|CDD|224054 COG1131, CcmA, ABC-type multidrug transport system, ATPase
component [Defense mechanisms].
Length = 293
Score = 28.0 bits (63), Expect = 6.2
Identities = 9/24 (37%), Positives = 14/24 (58%), Gaps = 1/24 (4%)
Query: 42 GEKMVRALFAVASVHQPSIIFIDE 65
G K R A+A +H P ++ +DE
Sbjct: 140 GMKQ-RLSIALALLHDPELLILDE 162
>gnl|CDD|219158 pfam06745, KaiC, KaiC. This family represents a conserved region
within bacterial and archaeal proteins, most of which
are hypothetical. More than one copy is sometimes found
in each protein. This family includes KaiC, which is
one of the Kai proteins among which direct
protein-protein association may be a critical process
in the generation of circadian rhythms in
cyanobacteria.
Length = 231
Score = 28.0 bits (63), Expect = 6.3
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 1 ILLFGPPGTGKTLIGK 16
+L+ G PGTGKT+ G
Sbjct: 22 VLITGGPGTGKTIFGL 37
>gnl|CDD|238544 cd01124, KaiC, KaiC is a circadian clock protein primarily found
in cyanobacteria KaiC is a RecA-like ATPase, having
both Walker A and Walker B motifs. A related protein is
found in archaea.
Length = 187
Score = 27.7 bits (62), Expect = 6.4
Identities = 9/15 (60%), Positives = 9/15 (60%)
Query: 1 ILLFGPPGTGKTLIG 15
LL G PGTGKT
Sbjct: 2 TLLSGGPGTGKTTFA 16
>gnl|CDD|221983 pfam13207, AAA_17, AAA domain.
Length = 114
Score = 26.9 bits (59), Expect = 6.7
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYG 40
IL+ GPPG+GK+ + K +A + +
Sbjct: 2 ILITGPPGSGKSTLAKKLAEKLGIPVISLDDLLREEGLAE 41
>gnl|CDD|214364 CHL00100, ilvH, acetohydroxyacid synthase small subunit.
Length = 174
Score = 27.8 bits (62), Expect = 6.8
Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 12/77 (15%)
Query: 124 VKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCREASLGP 183
+ R+ + +P + E +TK L + N L V DI ++ + RE L
Sbjct: 43 ISRITMVVPGDDRTIEQLTKQLYKLVNILKVQDI------------TNIPCVERELMLIK 90
Query: 184 VRSIDLSRIDALDVRPI 200
+ +R + L++ I
Sbjct: 91 INVNSQTRPEILEIAQI 107
>gnl|CDD|173927 cd07766, DHQ_Fe-ADH, Dehydroquinate synthase-like (DHQ-like) and
iron-containing alcohol dehydrogenases (Fe-ADH).
Dehydroquinate synthase-like. This superfamily divides
into two subgroups: the dehydroquinate synthase-like,
and a large metal-containing alcohol dehydrogenases
(ADH), known as iron-containing alcohol dehydrogenases.
Dehydroquinate synthase (DHQS) catalyzes the conversion
of 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) to
dehydroquinate (DHQ) in the second step of the shikimate
pathway. This pathway involves seven sequential
enzymatic steps in the conversion of erythrose
4-phosphate and phosphoenolpyruvate into chorismate for
subsequent synthesis of aromatic compounds.
Dehydroquinate synthase-like group includes
dehydroquinate synthase, 2-deoxy-scyllo-inosose
synthase, and 2-epi-5-epi-valiolone synthase. The
alcohol dehydrogenases in this superfamily contain a
dehydroquinate synthase-like protein structural fold and
mostly contain iron. They are distinct from other
alcohol dehydrogenases which contains different protein
domains. There are several distinct families of alcohol
dehydrogenases: Zinc-containing long-chain alcohol
dehydrogenases; insect-type, or short-chain alcohol
dehydrogenases; iron-containing alcohol dehydrogenases,
and others. The iron-containing family has a Rossmann
fold-like topology that resembles the fold of the
zinc-dependent alcohol dehydrogenases, but lacks
sequence homology, and differs in strand arrangement.
ADH catalyzes the reversible oxidation of alcohol to
acetaldehyde with the simultaneous reduction of NAD(P)+
to NAD(P)H.
Length = 332
Score = 28.1 bits (63), Expect = 6.8
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 301 EQKVPGSNPTMDKIDQENETSRRLKTEFLISLDGASTLD 339
G NPT +++ + E +R + + +I++ G STLD
Sbjct: 54 IFDGVGPNPTFEEVKEAVERARAAEVDAVIAVGGGSTLD 92
>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the
drug resistance transporter and related proteins,
subfamily A. This family of ATP-binding proteins
belongs to a multi-subunit transporter involved in drug
resistance (BcrA and DrrA), nodulation, lipid transport,
and lantibiotic immunity. In bacteria and archaea, these
transporters usually include an ATP-binding protein and
one or two integral membrane proteins. Eukaryotic
systems of the ABCA subfamily display ABC domains that
are quite similar to this family. The ATP-binding domain
shows the highest similarity between all members of the
ABC transporter family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 173
Score = 27.7 bits (63), Expect = 7.0
Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 1/24 (4%)
Query: 42 GEKMVRALFAVASVHQPSIIFIDE 65
G K R A A +H P ++ +DE
Sbjct: 99 GMKQ-RLALAQALLHDPELLILDE 121
>gnl|CDD|216270 pfam01057, Parvo_NS1, Parvovirus non-structural protein NS1. This
family also contains the NS2 protein. Parvoviruses
encode two non-structural proteins, NS1 and NS2. The
mRNA for NS2 contains the coding sequence for the first
87 amino acids of NS1, then by an alternative splicing
mechanism mRNA from a different reading frame, encoding
the last 78 amino acids, makes up the full length of the
NS2 mRNA. NS1, is the major non-structural protein. It
is essential for DNA replication. It is an 83-kDa
nuclear phosphoprotein. It has DNA helicase and ATPase
activity.
Length = 271
Score = 28.1 bits (63), Expect = 7.3
Identities = 6/14 (42%), Positives = 9/14 (64%)
Query: 1 ILLFGPPGTGKTLI 14
+ +GP TGKT +
Sbjct: 116 VWFYGPASTGKTNL 129
>gnl|CDD|234654 PRK00131, aroK, shikimate kinase; Reviewed.
Length = 175
Score = 27.5 bits (62), Expect = 7.4
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 1 ILLFGPPGTGKTLIGKCVAAQCKATFF 27
I+L G G GK+ IG+ +A + F
Sbjct: 7 IVLIGFMGAGKSTIGRLLAKRLGYDFI 33
>gnl|CDD|216285 pfam01075, Glyco_transf_9, Glycosyltransferase family 9
(heptosyltransferase). Members of this family belong to
glycosyltransferase family 9. Lipopolysaccharide is a
major component of the outer leaflet of the outer
membrane in Gram-negative bacteria. It is composed of
three domains; lipid A, Core oligosaccharide and the
O-antigen. All of these enzymes transfer heptose to the
lipopolysaccharide core.
Length = 249
Score = 27.7 bits (62), Expect = 7.5
Identities = 21/98 (21%), Positives = 30/98 (30%), Gaps = 20/98 (20%)
Query: 27 FCISASTLTSKWYGEGEKMVRALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKT 86
FC AS +W E L ++ L D+E E ++RL
Sbjct: 112 FCPGASRPAKRWPAEH---FAELAQALQERGWQVV-------LFGGAEDREEEIAKRLAA 161
Query: 87 EFLISLDGASTLDDDLVLVIGATNRPQELDEAARRRLV 124
L V ++G T+ Q A LV
Sbjct: 162 G----------LRTPCVNLLGKTSLEQAAALLAGADLV 189
>gnl|CDD|236783 PRK10875, recD, exonuclease V subunit alpha; Provisional.
Length = 615
Score = 28.0 bits (63), Expect = 8.0
Identities = 13/35 (37%), Positives = 18/35 (51%), Gaps = 3/35 (8%)
Query: 273 IGATNRPQELDEAARRRLVKRLYIPLPDE---QKV 304
N E+DEA R+ + L+ P+ DE QKV
Sbjct: 125 FNEVNHAIEVDEALLRQTLDALFGPVTDEVDWQKV 159
>gnl|CDD|234763 PRK00440, rfc, replication factor C small subunit; Reviewed.
Length = 319
Score = 27.5 bits (62), Expect = 9.5
Identities = 11/12 (91%), Positives = 11/12 (91%), Gaps = 1/12 (8%)
Query: 2 LLF-GPPGTGKT 12
LLF GPPGTGKT
Sbjct: 41 LLFAGPPGTGKT 52
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 27.8 bits (62), Expect = 9.8
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 4/42 (9%)
Query: 47 RALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEF 88
ALF + S+ FID+I S +NE +LK F
Sbjct: 375 EALFDKRGIKALSLFFIDKIASYR----SYDNELPAKLKGLF 412
>gnl|CDD|214940 smart00962, SRP54, SRP54-type protein, GTPase domain. This entry
represents the GTPase domain of the 54 kDa SRP54
component, a GTP-binding protein that interacts with
the signal sequence when it emerges from the ribosome.
SRP54 of the signal recognition particle has a
three-domain structure: an N-terminal helical bundle
domain, a GTPase domain, and the M-domain that binds
the 7s RNA and also binds the signal sequence. The
extreme C-terminal region is glycine-rich and lower in
complexity and poorly conserved between species. The
GTPase domain is evolutionary related to P-loop NTPase
domains found in a variety of other proteins.
Length = 197
Score = 27.4 bits (62), Expect = 9.8
Identities = 14/24 (58%), Positives = 16/24 (66%), Gaps = 2/24 (8%)
Query: 1 ILLFGPPGTGKT-LIGKCVAAQCK 23
ILL GP G GKT I K +AA+ K
Sbjct: 4 ILLVGPNGVGKTTTIAK-LAARLK 26
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.385
Gapped
Lambda K H
0.267 0.0889 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,819,087
Number of extensions: 1774297
Number of successful extensions: 2679
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2587
Number of HSP's successfully gapped: 186
Length of query: 343
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 245
Effective length of database: 6,590,910
Effective search space: 1614772950
Effective search space used: 1614772950
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.2 bits)