RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3808
(330 letters)
>gnl|CDD|223540 COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational
modification, protein turnover, chaperones].
Length = 494
Score = 204 bits (521), Expect = 6e-62
Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 36/351 (10%)
Query: 1 MTTFDKGINIAIKATEADKNKNYEEALKLYTH-------SVQYFLHALKYEITSAEGKQS 53
M +G I I AT + A + ++ L+
Sbjct: 111 MDGLKRGQVIVIGATNRPDGL--DPAKRRPGRFDREIEVNLPDEAGRLEILQIHTRLMFL 168
Query: 54 IREKCDEYLARAE--KLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQANL 111
+ LA K L K+ R D E +ED D ++ +
Sbjct: 169 GPPGTGKTLAARTVGKSGADLGALAKEAALRELRRAIDLVGE-YIGVTED-DFEEALKKV 226
Query: 112 EGA--IVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPG 168
+ ++ E +V D+ GLE AKE LKEA+ P+K P+LF + P KG+LL+GPPG
Sbjct: 227 LPSRGVLFEDEDVTLDDIGGLEEAKEELKEAIETPLKRPELFRKLGLRPPKGVLLYGPPG 286
Query: 169 TGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE-- 226
TGK+ LAKAVA E+ S F SV S+L+SKW+GESEK ++ LFE AR PSIIFIDE
Sbjct: 287 TGKTLLAKAVALESR-SRFISVKGSELLSKWVGESEKNIRELFEKARKLAPSIIFIDEID 345
Query: 227 ------------RIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRR--RFEK 272
R + L L +G+LV+ ATN P LD A+ R RF++
Sbjct: 346 SLASGRGPSEDGSGRRVVGQLLTEL--DGIEKAEGVLVIAATNRPDDLDPALLRPGRFDR 403
Query: 273 RIYIALPELNARLFMFRHHLGNTRNTIS-DENLKELAKRTEGYSGADISTL 322
IY+ LP+L RL +F+ HL + + ++ D +L+ELA+ TEGYSGADI+ L
Sbjct: 404 LIYVPLPDLEERLEIFKIHLRDKKPPLAEDVDLEELAEITEGYSGADIAAL 454
Score = 81.8 bits (202), Expect = 3e-17
Identities = 70/179 (39%), Positives = 97/179 (54%), Gaps = 16/179 (8%)
Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHR 218
KG+LL GPPGTGK+ LA+A+A E F S++ +++SK++GESE ++ LFE A
Sbjct: 19 KGVLLHGPPGTGKTLLARALANEGAE--FLSINGPEILSKYVGESELRLRELFEEAEKLA 76
Query: 219 PSIIFIDERI----DRGISNALFLL--LSGVGNDMDG-----ILVLGATNIPWVLDAAIR 267
PSIIFIDE R ++ + MDG ++V+GATN P LD A R
Sbjct: 77 PSIIFIDEIDALAPKRSSDQGEVERRVVAQLLALMDGLKRGQVIVIGATNRPDGLDPAKR 136
Query: 268 R--RFEKRIYIALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTLER 324
R RF++ I + LP+ RL + + H K LA RT G SGAD+ L +
Sbjct: 137 RPGRFDREIEVNLPDEAGRLEILQIHT-RLMFLGPPGTGKTLAARTVGKSGADLGALAK 194
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 192 bits (490), Expect = 2e-58
Identities = 108/250 (43%), Positives = 144/250 (57%), Gaps = 43/250 (17%)
Query: 97 DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV 156
+DP + M+ V+E PNV + D+ GLE ++EAV LP+K P+LF + V
Sbjct: 110 PSEKDPRVQAME-------VIESPNVTYEDIGGLEEQIREVREAVELPLKKPELF--EEV 160
Query: 157 ---PWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFEL 213
P KG+LL+GPPGTGK+ LAKAVA E N +TF V S+LV K++GE +LV+ LFEL
Sbjct: 161 GIEPPKGVLLYGPPGTGKTLLAKAVAHETN-ATFIRVVGSELVQKFIGEGARLVRELFEL 219
Query: 214 ARAHRPSIIFIDE-------RIDRGIS------NALFLLLSGVGNDMDG------ILVLG 254
AR PSIIFIDE R D G S L LL+ +MDG + ++
Sbjct: 220 AREKAPSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLA----EMDGFDPRGNVKIIA 275
Query: 255 ATNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTR--NTISDENLKELAKR 310
ATN +LD AI R RF++ I + LP+ RL + + H TR N D +L+ELA+
Sbjct: 276 ATNRIDILDPAILRPGRFDRIIEVPLPDEEGRLEILKIH---TRKMNLADDVDLEELAEL 332
Query: 311 TEGYSGADIS 320
TEG SGAD+
Sbjct: 333 TEGASGADLK 342
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 178 bits (453), Expect = 1e-52
Identities = 100/249 (40%), Positives = 141/249 (56%), Gaps = 43/249 (17%)
Query: 97 DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV 156
DP M+ V EKP+V + D+ GL+ + ++E V LP+K P+LF + +
Sbjct: 130 PPEVDPRVSVME-------VEEKPDVTYEDIGGLDEQIQEIREVVELPLKNPELF--EEL 180
Query: 157 ---PWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFEL 213
P KG+LL+GPPGTGK+ LAKAVA + + +TF V S+LV K++GE +LV+ LFEL
Sbjct: 181 GIDPPKGVLLYGPPGTGKTLLAKAVANQTD-ATFIRVVGSELVQKYIGEGARLVRELFEL 239
Query: 214 ARAHRPSIIFIDE-------RI------DRGISNALFLLLSGVGNDMDG------ILVLG 254
AR PSIIFIDE R DR + + LL N +DG + V+
Sbjct: 240 AREKAPSIIFIDEIDAIGAKRFDSGTSGDREVQRTMLELL----NQLDGFDPRGNVKVIM 295
Query: 255 ATNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTR--NTISDENLKELAKR 310
ATN P +LD A+ R RF+++I LP+ R + + H TR N D +L+ LA+
Sbjct: 296 ATNRPDILDPALLRPGRFDRKIEFPLPDEEGRAEILKIH---TRKMNLADDVDLELLARL 352
Query: 311 TEGYSGADI 319
TEG+SGAD+
Sbjct: 353 TEGFSGADL 361
>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily. This
subfamily of the AAA family ATPases includes two members
each from three archaeal species. It also includes yeast
CDC48 (cell division control protein 48) and the human
ortholog, transitional endoplasmic reticulum ATPase
(valosin-containing protein). These proteins in
eukaryotes are involved in the budding and transfer of
membrane from the transitional endoplasmic reticulum to
the Golgi apparatus.
Length = 733
Score = 178 bits (452), Expect = 2e-50
Identities = 103/257 (40%), Positives = 152/257 (59%), Gaps = 33/257 (12%)
Query: 87 TKDDKKESDEDDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIK 146
T D E+ K ++ + +++E PNV+W+D+ GLE K+ L+EAV P+K
Sbjct: 424 TMKDFMEA---------LKMVEPSAIREVLVEVPNVRWSDIGGLEEVKQELREAVEWPLK 474
Query: 147 FPQLFT--GKRVPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESE 204
P++F G R P KG+LLFGPPGTGK+ LAKAVATE+ + F +V +++SKW+GESE
Sbjct: 475 HPEIFEKMGIRPP-KGVLLFGPPGTGKTLLAKAVATES-GANFIAVRGPEILSKWVGESE 532
Query: 205 KLVKTLFELARAHRPSIIFIDERID-----RG----------ISNALFLLLSGVGNDMDG 249
K ++ +F AR P+IIF DE ID RG I N L + G+ ++
Sbjct: 533 KAIREIFRKARQAAPAIIFFDE-IDAIAPARGARFDTSVTDRIVNQLLTEMDGI-QELSN 590
Query: 250 ILVLGATNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTRNTISDENLKEL 307
++V+ ATN P +LD A+ R RF++ I + P+ AR +F+ H + D +L+EL
Sbjct: 591 VVVIAATNRPDILDPALLRPGRFDRLILVPPPDEEARKEIFKIHT-RSMPLAEDVDLEEL 649
Query: 308 AKRTEGYSGADISTLER 324
A+ TEGY+GADI + R
Sbjct: 650 AEMTEGYTGADIEAVCR 666
Score = 149 bits (378), Expect = 3e-40
Identities = 87/221 (39%), Positives = 134/221 (60%), Gaps = 19/221 (8%)
Query: 120 PNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPGTGKSYLAKAV 178
P V + D+ GL+ AKE ++E V LP+K P+LF + P KG+LL+GPPGTGK+ LAKAV
Sbjct: 173 PKVTYEDIGGLKEAKEKIREMVELPMKHPELFEHLGIEPPKGVLLYGPPGTGKTLLAKAV 232
Query: 179 ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE------------ 226
A EA + F S++ +++SK+ GESE+ ++ +F+ A + PSIIFIDE
Sbjct: 233 ANEA-GAYFISINGPEIMSKYYGESEERLREIFKEAEENAPSIIFIDEIDAIAPKREEVT 291
Query: 227 -RIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRR--RFEKRIYIALPELNA 283
+++ + L L+ G+ ++V+GATN P LD A+RR RF++ I I +P+ A
Sbjct: 292 GEVEKRVVAQLLTLMDGL-KGRGRVIVIGATNRPDALDPALRRPGRFDREIVIRVPDKRA 350
Query: 284 RLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTLER 324
R + + H N D +L +LA+ T G+ GAD++ L +
Sbjct: 351 RKEILKVHTRN-MPLAEDVDLDKLAEVTHGFVGADLAALAK 390
>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family. Many
proteins may score above the trusted cutoff because an
internal.
Length = 364
Score = 168 bits (427), Expect = 2e-49
Identities = 104/245 (42%), Positives = 143/245 (58%), Gaps = 39/245 (15%)
Query: 99 SEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-P 157
S+DP K M+ V E+PNV + D+ GLE ++EAV LP+K P+LF + P
Sbjct: 103 SKDPLVKGME-------VEERPNVSYEDIGGLEEQIREIREAVELPLKHPELFEEVGIEP 155
Query: 158 WKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH 217
KG+LL+GPPGTGK+ LAKAVA E N +TF V S+LV K++GE +LV+ +FELA+
Sbjct: 156 PKGVLLYGPPGTGKTLLAKAVAHETN-ATFIRVVGSELVRKYIGEGARLVREIFELAKEK 214
Query: 218 RPSIIFIDE-------RI------DRGISNALFLLLSGVGNDMDG------ILVLGATNI 258
PSIIFIDE R DR + L LL+ ++DG + V+ ATN
Sbjct: 215 APSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLA----ELDGFDPRGNVKVIAATNR 270
Query: 259 PWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTR--NTISDENLKELAKRTEGY 314
P +LD A+ R RF++ I + LP+ RL + + H TR D +L+ +AK TEG
Sbjct: 271 PDILDPALLRPGRFDRIIEVPLPDFEGRLEILKIH---TRKMKLAEDVDLEAIAKMTEGA 327
Query: 315 SGADI 319
SGAD+
Sbjct: 328 SGADL 332
>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
HflB(FtsH) is a pleiotropic protein required for correct
cell division in bacteria. It has ATP-dependent zinc
metalloprotease activity. It was formerly designated
cell division protein FtsH [Cellular processes, Cell
division, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 495
Score = 167 bits (426), Expect = 4e-48
Identities = 97/226 (42%), Positives = 133/226 (58%), Gaps = 27/226 (11%)
Query: 118 EKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFT--GKRVPWKGILLFGPPGTGKSYLA 175
EKP V + DVAG++ AKE L E V +K P FT G ++P KG+LL GPPGTGK+ LA
Sbjct: 48 EKPKVTFKDVAGIDEAKEELMEIVDF-LKNPSKFTKLGAKIP-KGVLLVGPPGTGKTLLA 105
Query: 176 KAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----R 230
KAVA EA FFS+S SD V ++G V+ LFE A+ + P IIFIDE ID R
Sbjct: 106 KAVAGEAGVP-FFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCIIFIDE-IDAVGRQR 163
Query: 231 GIS------------NALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRR--RFEKRIYI 276
G N L + + G G + G++V+ ATN P VLD A+ R RF++++ +
Sbjct: 164 GAGLGGGNDEREQTLNQLLVEMDGFGTN-TGVIVIAATNRPDVLDPALLRPGRFDRQVVV 222
Query: 277 ALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
LP++ R + + H N D +LK +A+RT G+SGAD++ L
Sbjct: 223 DLPDIKGREEILKVHAKNK-KLAPDVDLKAVARRTPGFSGADLANL 267
>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
activities (AAA). AAA family proteins often perform
chaperone-like functions that assist in the assembly,
operation, or disassembly of protein complexes.
Length = 131
Score = 139 bits (353), Expect = 4e-41
Identities = 57/131 (43%), Positives = 78/131 (59%), Gaps = 17/131 (12%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
+LL+GPPGTGK+ LAKAVA E + F +S S+LVSK++GESEK ++ LFE A+ P
Sbjct: 1 LLLYGPPGTGKTTLAKAVAKELG-APFIEISGSELVSKYVGESEKRLRELFEAAKKLAPC 59
Query: 221 IIFIDERID--------------RGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAI 266
+IFIDE ID R + N L L G + + ++V+ ATN P LD A+
Sbjct: 60 VIFIDE-IDALAGSRGSGGDSESRRVVNQLLTELDGFTSSLSKVIVIAATNRPDKLDPAL 118
Query: 267 RR-RFEKRIYI 276
R RF++ I
Sbjct: 119 LRGRFDRIIEF 129
>gnl|CDD|240423 PTZ00454, PTZ00454, 26S protease regulatory subunit 6B-like
protein; Provisional.
Length = 398
Score = 138 bits (349), Expect = 1e-37
Identities = 85/227 (37%), Positives = 128/227 (56%), Gaps = 28/227 (12%)
Query: 118 EKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPGTGKSYLAK 176
EKP+V ++D+ GL+ K+ ++EAV LP+ P+L+ + P +G+LL+GPPGTGK+ LAK
Sbjct: 138 EKPDVTYSDIGGLDIQKQEIREAVELPLTCPELYEQIGIDPPRGVLLYGPPGTGKTMLAK 197
Query: 177 AVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE-------RI- 228
AVA +TF V S+ V K+LGE ++V+ +F LAR + PSIIFIDE R
Sbjct: 198 AVAHHTT-ATFIRVVGSEFVQKYLGEGPRMVRDVFRLARENAPSIIFIDEVDSIATKRFD 256
Query: 229 -----DRGISNALFLLLSGVGNDMDG------ILVLGATNIPWVLDAAIRR--RFEKRIY 275
DR + L LL N MDG + V+ ATN LD A+ R R +++I
Sbjct: 257 AQTGADREVQRILLELL----NQMDGFDQTTNVKVIMATNRADTLDPALLRPGRLDRKIE 312
Query: 276 IALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
LP+ + +F+ + + N + +L++ R E S ADI+ +
Sbjct: 313 FPLPDRRQKRLIFQ-TITSKMNLSEEVDLEDFVSRPEKISAADIAAI 358
>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
modification, protein turnover, chaperones].
Length = 596
Score = 138 bits (349), Expect = 1e-36
Identities = 93/232 (40%), Positives = 132/232 (56%), Gaps = 39/232 (16%)
Query: 118 EKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFT--GKRVPWKGILLFGPPGTGKSYLA 175
++ V + DVAG++ AKE L E ++ +K P+ + G ++P KG+LL GPPGTGK+ LA
Sbjct: 143 DQVKVTFADVAGVDEAKEELSE-LVDFLKNPKKYQALGAKIP-KGVLLVGPPGTGKTLLA 200
Query: 176 KAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----R 230
KAVA EA FFS+S SD V ++G V+ LFE A+ + P IIFIDE ID R
Sbjct: 201 KAVAGEAGVP-FFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCIIFIDE-IDAVGRQR 258
Query: 231 GISNALFLLLSGVGND------------MDG------ILVLGATNIPWVLDAAIRR--RF 270
G G GND MDG ++V+ ATN P VLD A+ R RF
Sbjct: 259 GAG-------LGGGNDEREQTLNQLLVEMDGFGGNEGVIVIAATNRPDVLDPALLRPGRF 311
Query: 271 EKRIYIALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
+++I + LP++ R + + H N D +LK++A+ T G+SGAD++ L
Sbjct: 312 DRQILVELPDIKGREQILKVHAKNKPLA-EDVDLKKIARGTPGFSGADLANL 362
>gnl|CDD|214386 CHL00176, ftsH, cell division protein; Validated.
Length = 638
Score = 138 bits (349), Expect = 2e-36
Identities = 90/226 (39%), Positives = 132/226 (58%), Gaps = 43/226 (19%)
Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFT--GKRVPWKGILLFGPPGTGKSYLAKAVATEAN 183
D+AG+E AKE +E V +K P+ FT G ++P KG+LL GPPGTGK+ LAKA+A EA
Sbjct: 184 DIAGIEEAKEEFEEVVSF-LKKPERFTAVGAKIP-KGVLLVGPPGTGKTLLAKAIAGEA- 240
Query: 184 NSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----RGISNALFL 238
FFS+S S+ V ++G V+ LF+ A+ + P I+FIDE ID RG
Sbjct: 241 EVPFFSISGSEFVEMFVGVGAARVRDLFKKAKENSPCIVFIDE-IDAVGRQRG------- 292
Query: 239 LLSGVG--ND------------MDG------ILVLGATNIPWVLDAAIRR--RFEKRIYI 276
+G+G ND MDG ++V+ ATN +LDAA+ R RF+++I +
Sbjct: 293 --AGIGGGNDEREQTLNQLLTEMDGFKGNKGVIVIAATNRVDILDAALLRPGRFDRQITV 350
Query: 277 ALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
+LP+ RL + + H N + D +L+ +A+RT G+SGAD++ L
Sbjct: 351 SLPDREGRLDILKVHARN-KKLSPDVSLELIARRTPGFSGADLANL 395
>gnl|CDD|224144 COG1223, COG1223, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 368
Score = 132 bits (334), Expect = 8e-36
Identities = 75/218 (34%), Positives = 115/218 (52%), Gaps = 22/218 (10%)
Query: 117 MEKPNVKWTDVAGLEAAKEALKEAVILP-IKFPQLFTGKRVPWKGILLFGPPGTGKSYLA 175
++ DV G E AK + +I+ ++ P+ F G P K +L +GPPGTGK+ +A
Sbjct: 113 EIISDITLDDVIGQEEAKRKCR--LIMEYLENPERF-GDWAP-KNVLFYGPPGTGKTMMA 168
Query: 176 KAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE----RIDRG 231
KA+A EA V +++L+ + +G+ + + L+E AR P I+FIDE +DR
Sbjct: 169 KALANEAK-VPLLLVKATELIGEHVGDGARRIHELYERARKAAPCIVFIDELDAIALDRR 227
Query: 232 ----------ISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRRRFEKRIYIALPEL 281
I NAL L G+ + +G++ + ATN P +LD AIR RFE+ I LP
Sbjct: 228 YQELRGDVSEIVNALLTELDGI-KENEGVVTIAATNRPELLDPAIRSRFEEEIEFKLPND 286
Query: 282 NARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADI 319
RL + ++ D +L+ LA +T+G SG DI
Sbjct: 287 EERLEILEYYA-KKFPLPVDADLRYLAAKTKGMSGRDI 323
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 121 bits (306), Expect = 2e-31
Identities = 87/247 (35%), Positives = 132/247 (53%), Gaps = 39/247 (15%)
Query: 97 DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLF--TGK 154
D DP M+ V + P + D+ GLE + +KEAV LP+ P+L+ G
Sbjct: 162 LDEVDPLVSVMK-------VDKAPLESYADIGGLEQQIQEIKEAVELPLTHPELYDDIGI 214
Query: 155 RVPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELA 214
+ P KG++L+GPPGTGK+ LAKAVA E + +TF V S+L+ K+LG+ KLV+ LF +A
Sbjct: 215 KPP-KGVILYGPPGTGKTLLAKAVANETS-ATFLRVVGSELIQKYLGDGPKLVRELFRVA 272
Query: 215 RAHRPSIIFIDE-------RID------RGISNALFLLLSGVGNDMDG------ILVLGA 255
+ PSI+FIDE R D + I + LL N +DG + V+ A
Sbjct: 273 EENAPSIVFIDEIDAIGTKRYDATSGGEKEIQRTMLELL----NQLDGFDSRGDVKVIMA 328
Query: 256 TNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTRNTISDE-NLKELAKRTE 312
TN LD A+ R R +++I P+ + +F H ++ T++++ +L+E +
Sbjct: 329 TNRIESLDPALIRPGRIDRKIEFPNPDEKTKRRIFEIH--TSKMTLAEDVDLEEFIMAKD 386
Query: 313 GYSGADI 319
SGADI
Sbjct: 387 ELSGADI 393
>gnl|CDD|182683 PRK10733, hflB, ATP-dependent metalloprotease; Reviewed.
Length = 644
Score = 115 bits (289), Expect = 2e-28
Identities = 84/222 (37%), Positives = 125/222 (56%), Gaps = 31/222 (13%)
Query: 124 WTDVAGLEAAKEALKEAVIL---PIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVAT 180
+ DVAG + AKE + E V P +F +L G ++P KG+L+ GPPGTGK+ LAKA+A
Sbjct: 151 FADVAGCDEAKEEVAELVEYLREPSRFQKL--GGKIP-KGVLMVGPPGTGKTLLAKAIAG 207
Query: 181 EANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----RGIS-- 233
EA FF++S SD V ++G V+ +FE A+ P IIFIDE ID RG
Sbjct: 208 EA-KVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCIIFIDE-IDAVGRQRGAGLG 265
Query: 234 ----------NALFLLLSGV-GNDMDGILVLGATNIPWVLDAAIRR--RFEKRIYIALPE 280
N + + + G GN +GI+V+ ATN P VLD A+ R RF++++ + LP+
Sbjct: 266 GGHDEREQTLNQMLVEMDGFEGN--EGIIVIAATNRPDVLDPALLRPGRFDRQVVVGLPD 323
Query: 281 LNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
+ R + + H+ D + +A+ T G+SGAD++ L
Sbjct: 324 VRGREQILKVHMRRV-PLAPDIDAAIIARGTPGFSGADLANL 364
>gnl|CDD|239141 cd02678, MIT_VPS4, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family
of MIT domains is found in intracellular protein
transport proteins of the AAA-ATPase family. The
molecular function of the MIT domain is unclear.
Length = 75
Score = 101 bits (255), Expect = 2e-27
Identities = 43/73 (58%), Positives = 53/73 (72%), Gaps = 1/73 (1%)
Query: 5 DKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLAR 64
K I + KA E D NYEEAL+LY H+++YF+HALKYE + + K+SIR KC EYL R
Sbjct: 4 QKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKYE-KNPKSKESIRAKCTEYLDR 62
Query: 65 AEKLKEYLKKGKK 77
AEKLKEYL K +K
Sbjct: 63 AEKLKEYLAKKEK 75
>gnl|CDD|200312 TIGR03689, pup_AAA, proteasome ATPase. In the Actinobacteria, as
shown for Mycobacterium tuberculosis, some proteins are
modified by ligation between an epsilon-amino group of a
lysine side chain and the C-terminal carboxylate of the
ubiquitin-like protein Pup. This modification leads to
protein degradation by the archaeal-like proteasome
found in the Actinobacteria. Members of this protein
family belong to the AAA family of ATPases and tend to
be clustered with the genes for Pup, the Pup ligase
PafA, and structural components of the proteasome. This
protein forms hexameric rings with ATPase activity
[Protein fate, Degradation of proteins, peptides, and
glycopeptides].
Length = 512
Score = 105 bits (264), Expect = 3e-25
Identities = 67/213 (31%), Positives = 112/213 (52%), Gaps = 41/213 (19%)
Query: 115 IVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPGTGKSY 173
++ E P+V + D+ GL + E +++AV LP P+L+ + P KG+LL+GPPG GK+
Sbjct: 172 VLEEVPDVTYADIGGLGSQIEQIRDAVELPFLHPELYREYGLKPPKGVLLYGPPGCGKTL 231
Query: 174 LAKAVATE---------ANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH----RPS 220
+AKAVA S F ++ +L++K++GE+E+ ++ +F+ AR RP
Sbjct: 232 IAKAVANSLAARIGAEGGGKSYFLNIKGPELLNKYVGETERQIRLIFQRAREKASEGRPV 291
Query: 221 IIFIDERIDRGISNALFLLL-SGVGNDM------------------DGILVLGATNIPWV 261
I+F DE ++LF SGV +D+ D ++V+GA+N +
Sbjct: 292 IVFFDEM------DSLFRTRGSGVSSDVETTVVPQLLAEIDGVESLDNVIVIGASNREDM 345
Query: 262 LDAAIRR--RFEKRIYIALPELNARLFMFRHHL 292
+D AI R R + +I I P+ A +F +L
Sbjct: 346 IDPAILRPGRLDVKIRIERPDAEAAADIFAKYL 378
>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
cellular Activities) superfamily represents an ancient
group of ATPases belonging to the ASCE (for additional
strand, catalytic E) division of the P-loop NTPase fold.
The ASCE division also includes ABC, RecA-like,
VirD4-like, PilT-like, and SF1/2 helicases. Members of
the AAA+ ATPases function as molecular chaperons, ATPase
subunits of proteases, helicases, or nucleic-acid
stimulated ATPases. The AAA+ proteins contain several
distinct features in addition to the conserved
alpha-beta-alpha core domain structure and the Walker A
and B motifs of the P-loop NTPases.
Length = 151
Score = 95.7 bits (238), Expect = 4e-24
Identities = 62/163 (38%), Positives = 89/163 (54%), Gaps = 24/163 (14%)
Query: 128 AGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEA--NNS 185
G E A EAL+EA+ LP P K +LL+GPPGTGK+ LA+A+A E +
Sbjct: 1 VGQEEAIEALREALELP------------PPKNLLLYGPPGTGKTTLARAIANELFRPGA 48
Query: 186 TFFSVSSSDLVSKWLGESEK---LVKTLFELARAHRPSIIFIDE--RIDRGISNALFLLL 240
F +++SDL+ + LV+ LFELA +P ++FIDE + RG NAL +L
Sbjct: 49 PFLYLNASDLLEGLVVAELFGHFLVRLLFELAEKAKPGVLFIDEIDSLSRGAQNALLRVL 108
Query: 241 SGVGNDM---DGILVLGATNIPWV--LDAAIRRRFEKRIYIAL 278
+ + + + V+GATN P + LD A+ R + RI I L
Sbjct: 109 ETLNDLRIDRENVRVIGATNRPLLGDLDRALYDRLDIRIVIPL 151
>gnl|CDD|146707 pfam04212, MIT, MIT (microtubule interacting and transport)
domain. The MIT domain forms an asymmetric three-helix
bundle and binds ESCRT-III (endosomal sorting complexes
required for transport) substrates.
Length = 69
Score = 78.4 bits (194), Expect = 1e-18
Identities = 33/69 (47%), Positives = 48/69 (69%), Gaps = 1/69 (1%)
Query: 4 FDKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLA 63
+K + + KA EAD+ NYEEAL+LY +++Y L ALKYE + ++++R+K EYL
Sbjct: 2 LEKALELVKKAVEADEAGNYEEALELYKEAIEYLLQALKYE-PDPKRREALRQKIAEYLD 60
Query: 64 RAEKLKEYL 72
RAE+LKE L
Sbjct: 61 RAEELKELL 69
>gnl|CDD|239121 cd02656, MIT, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. The MIT domain
is found in sorting nexins, the nuclear thiol protease
PalBH, the AAA protein spastin and archaebacterial
proteins with similar domain architecture, vacuolar
sorting proteins and others. The molecular function of
the MIT domain is unclear.
Length = 75
Score = 75.8 bits (187), Expect = 1e-17
Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 1/73 (1%)
Query: 5 DKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLAR 64
+ + +A + D++ NYEEAL+LY ++ Y L ALK E + ++ +R+K EYL R
Sbjct: 4 QQAKELIKQAVKEDEDGNYEEALELYKEALDYLLQALKAEKEP-KLRKLLRKKVKEYLDR 62
Query: 65 AEKLKEYLKKGKK 77
AE LKE LKK K+
Sbjct: 63 AEFLKELLKKQKQ 75
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only detects
a fraction of this vast family. The poorly conserved
N-terminal helix is missing from the alignment.
Length = 148
Score = 73.2 bits (179), Expect = 7e-16
Identities = 41/148 (27%), Positives = 64/148 (43%), Gaps = 24/148 (16%)
Query: 157 PWKGILLFGPPGTGKSYLAKAVATEAN--NSTFFSVSSSDL--------------VSKWL 200
P + IL+ GPPG+GK+ LA+A+A E + D+ K
Sbjct: 1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKAS 60
Query: 201 GESEKLVKTLFELARAHRPSIIFIDE--RIDRGISNALFLLLSGVGNDM-----DGILVL 253
G E ++ LAR +P ++ +DE + AL LLL + + + V+
Sbjct: 61 GSGELRLRLALALARKLKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKNLTVI 120
Query: 254 GATNIPWVLD-AAIRRRFEKRIYIALPE 280
TN L A +RRRF++RI + L
Sbjct: 121 LTTNDEKDLGPALLRRRFDRRIVLLLIL 148
>gnl|CDD|177094 CHL00195, ycf46, Ycf46; Provisional.
Length = 489
Score = 75.4 bits (186), Expect = 4e-15
Identities = 66/222 (29%), Positives = 103/222 (46%), Gaps = 31/222 (13%)
Query: 120 PNVKWTDVAGLEAAKEALKEAVILPIKFPQLFT------GKRVPWKGILLFGPPGTGKSY 173
N K +D+ GL+ K+ LK K F+ G P +G+LL G GTGKS
Sbjct: 223 VNEKISDIGGLDNLKDWLK-------KRSTSFSKQASNYGLPTP-RGLLLVGIQGTGKSL 274
Query: 174 LAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERIDRGIS 233
AKA+A + + L +GESE ++ + +A A P I++IDE ID+ S
Sbjct: 275 TAKAIANDW-QLPLLRLDVGKLFGGIVGESESRMRQMIRIAEALSPCILWIDE-IDKAFS 332
Query: 234 NALFLLLSGVGNDMDGIL-------------VLGATNIPWVLDAAIRR-RFEKRIYIALP 279
N+ SG N + V A NI + +R+ RF++ ++ LP
Sbjct: 333 NSESKGDSGTTNRVLATFITWLSEKKSPVFVVATANNIDLLPLEILRKGRFDEIFFLDLP 392
Query: 280 ELNARLFMFRHHLGNTR-NTISDENLKELAKRTEGYSGADIS 320
L R +F+ HL R + ++K+L+K + +SGA+I
Sbjct: 393 SLEEREKIFKIHLQKFRPKSWKKYDIKKLSKLSNKFSGAEIE 434
>gnl|CDD|197854 smart00745, MIT, Microtubule Interacting and Trafficking molecule
domain.
Length = 77
Score = 68.1 bits (167), Expect = 8e-15
Identities = 26/72 (36%), Positives = 45/72 (62%), Gaps = 1/72 (1%)
Query: 2 TTFDKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEY 61
K + KA +AD+ NYEEAL+LY +++Y L +K E + ++ +++++ K EY
Sbjct: 3 DYLSKAKELISKALKADEAGNYEEALELYKKAIEYLLEGIKVE-SDSKRREALKAKAAEY 61
Query: 62 LARAEKLKEYLK 73
L RAE++K+ L
Sbjct: 62 LDRAEEIKKSLL 73
>gnl|CDD|239147 cd02684, MIT_2, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family
of MIT domains is found in proteins with an n-terminal
serine/threonine kinase domain. The molecular function
of the MIT domain is unclear.
Length = 75
Score = 56.7 bits (137), Expect = 1e-10
Identities = 26/70 (37%), Positives = 47/70 (67%), Gaps = 1/70 (1%)
Query: 3 TFDKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYL 62
+ +K I + ++A + D+ + AL LY ++QYF+ AL YE T A+ K+++R+K +Y+
Sbjct: 2 SLEKAIALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYE-TDAQRKEALRQKVLQYV 60
Query: 63 ARAEKLKEYL 72
+RAE+LK +
Sbjct: 61 SRAEELKALI 70
>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily). This Pfam
entry includes some of the AAA proteins not detected by
the pfam00004 model.
Length = 135
Score = 54.6 bits (132), Expect = 2e-09
Identities = 41/136 (30%), Positives = 54/136 (39%), Gaps = 26/136 (19%)
Query: 160 GILLFGPPGTGKSYLAKAVATEANNSTFFSV------SSSDLVSKWLGESEKLVKTLFEL 213
G+LL GPPGTGKS LA+ +A +N F V + DL + + L
Sbjct: 1 GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPL 60
Query: 214 ARAHRPS-IIFIDERIDRGISNAL-----------FLLLSG---VGNDMDGILVLGATNI 258
RA R I +DE I+R + L LL G V DG ++ N
Sbjct: 61 VRAAREGEIAVLDE-INRANPDVLNSLLSLLDERRLLLPEGGELVKAAPDGFRLIATMNP 119
Query: 259 P----WVLDAAIRRRF 270
L A+R RF
Sbjct: 120 LDRGLNELSPALRSRF 135
>gnl|CDD|239146 cd02683, MIT_1, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family
of MIT domains is found in proteins with unknown
function, co-occuring with an as yet undescribed
domain. The molecular function of the MIT domain is
unclear.
Length = 77
Score = 45.5 bits (108), Expect = 9e-07
Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 1/71 (1%)
Query: 6 KGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARA 65
+ +A E D+ ++EAL Y + + LK A+ K+++R+K EY+ RA
Sbjct: 5 AAKEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTKDEAK-KKNLRQKISEYMDRA 63
Query: 66 EKLKEYLKKGK 76
E +K+ L + K
Sbjct: 64 EAIKKRLDQEK 74
>gnl|CDD|188437 TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA. This
model represents the AAA family ATPase, EccA, of the
actinobacterial flavor of type VII secretion systems.
Species such as Mycobacterium tuberculosis have several
instances of this system per genome, designated EccA1,
EccA2, etc [Protein fate, Protein and peptide secretion
and trafficking].
Length = 557
Score = 47.9 bits (114), Expect = 4e-06
Identities = 54/219 (24%), Positives = 91/219 (41%), Gaps = 38/219 (17%)
Query: 84 ESRTK--DDKKESDEDDSEDPDKKKMQANL--EGAIVMEKPNVKWTDVAGLEAAKE---A 136
E+RT D + DP + +A L E + + GLE K A
Sbjct: 238 EARTDPWDPSSAPSRAEFVDPAAAERKAKLLAEAEAELAE-------QIGLERVKRQVAA 290
Query: 137 LKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVA------TEANNSTFFSV 190
LK + + + + +L GPPGTGK+ +A+ VA V
Sbjct: 291 LKSSTAMALARAERGLPVAQTSNHMLFAGPPGTGKTTIARVVAKIYCGLGVLRKPLVREV 350
Query: 191 SSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDER---IDRGISN-------ALFLLL 240
S +DL+ +++GESE + + A ++F+DE ++ G A+ LL
Sbjct: 351 SRADLIGQYIGESEAKTNEIIDSALG---GVLFLDEAYTLVETGYGQKDPFGLEAIDTLL 407
Query: 241 SGVGNDMDGILVLGA---TNIPWVLDA--AIRRRFEKRI 274
+ + ND D ++V+GA ++ L+ +R RF + I
Sbjct: 408 ARMENDRDRLVVIGAGYRKDLDKFLEVNEGLRSRFTRVI 446
>gnl|CDD|225165 COG2256, MGS1, ATPase related to the helicase subunit of the
Holliday junction resolvase [DNA replication,
recombination, and repair].
Length = 436
Score = 47.2 bits (113), Expect = 6e-06
Identities = 47/184 (25%), Positives = 79/184 (42%), Gaps = 45/184 (24%)
Query: 163 LFGPPGTGKSYLAKAVATEAN------NSTFFSVSSSDLVSKWLGESEKLVKTLFELARA 216
L+GPPGTGK+ LA+ +A N ++ V K ++ + E AR
Sbjct: 53 LWGPPGTGKTTLARLIAGTTNAAFEALSAVTSGV--------------KDLREIIEEARK 98
Query: 217 HRPS----IIFIDE--RIDRGISNALFLLLSGVGNDMDGILVLGAT--NIPWVLDAAIRR 268
+R I+F+DE R ++ +AL L V N I+++GAT N + L+ A+
Sbjct: 99 NRLLGRRTILFLDEIHRFNKAQQDAL---LPHVENGT--IILIGATTENPSFELNPALLS 153
Query: 269 RFEKRIYIALPELNARLFMFR---------HHLGNTRNTISDENLKELAKRTEGYSGADI 319
R R++ L L++ LG + +E L L + + G + +
Sbjct: 154 R--ARVFE-LKPLSSEDIKKLLKRALLDEERGLGGQIIVLDEEALDYLVRLSNGDARRAL 210
Query: 320 STLE 323
+ LE
Sbjct: 211 NLLE 214
>gnl|CDD|237355 PRK13342, PRK13342, recombination factor protein RarA; Reviewed.
Length = 413
Score = 47.0 bits (113), Expect = 7e-06
Identities = 41/164 (25%), Positives = 73/164 (44%), Gaps = 54/164 (32%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN------NSTFFSVSSSDLVSKWLGESEKLVKTLFELA 214
++L+GPPGTGK+ LA+ +A + ++ V K ++ + E A
Sbjct: 39 MILWGPPGTGKTTLARIIAGATDAPFEALSAVTSGV--------------KDLREVIEEA 84
Query: 215 RAHRPS----IIFIDE--RIDRGISNALFLLLSGVGNDMDGILVLGAT--NIPWVLDAAI 266
R R + I+FIDE R ++ +A LL V + I ++GAT N + ++ A+
Sbjct: 85 RQRRSAGRRTILFIDEIHRFNKAQQDA---LLPHVEDGT--ITLIGATTENPSFEVNPAL 139
Query: 267 RRRFEKRIYIALPELNARLFMFRHHLGNTRNTISDENLKELAKR 310
R A++F + S+E++++L KR
Sbjct: 140 LSR-------------AQVFELKPL--------SEEDIEQLLKR 162
>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
only].
Length = 329
Score = 45.5 bits (108), Expect = 2e-05
Identities = 41/165 (24%), Positives = 58/165 (35%), Gaps = 30/165 (18%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVS------SSDLVSKWLGESEKLVKTLFE-- 212
+LL GPPG GK+ LA+A+A A F + SDL+ + + L F
Sbjct: 46 VLLEGPPGVGKTLLARALA-RALGLPFVRIQCTPDLLPSDLLGTYAYAALLLEPGEFRFV 104
Query: 213 ---LARAHRPSIIFIDE--RIDRGISNALFLLLS-------GVGND--MDGILVLGATNI 258
L A R I+ +DE R + NAL L G+ +V+ N
Sbjct: 105 PGPLFAAVRV-ILLLDEINRAPPEVQNALLEALEERQVTVPGLTTIRLPPPFIVIATQNP 163
Query: 259 PWV-----LDAAIRRRFEKRIYIA-LPELNARLFMFRHHLGNTRN 297
L A+ RF RIY+ + G
Sbjct: 164 GEYEGTYPLPEALLDRFLLRIYVDYPDSEEEERIILARVGGVDEL 208
>gnl|CDD|219536 pfam07724, AAA_2, AAA domain (Cdc48 subfamily). This Pfam entry
includes some of the AAA proteins not detected by the
pfam00004 model.
Length = 168
Score = 44.1 bits (105), Expect = 2e-05
Identities = 29/96 (30%), Positives = 39/96 (40%), Gaps = 14/96 (14%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNS--TFFSVSSSDL-----VSKWLGESEKLVK----- 208
L GP G GK+ LAKA+A + S+ VS+ +G V
Sbjct: 6 FLFLGPTGVGKTELAKALAELLFGDERALIRIDMSEYMEEHSVSRLIGAPPGYVGYEEGG 65
Query: 209 TLFELARAHRPSIIFIDE--RIDRGISNALFLLLSG 242
L E R SI+ IDE + G+ N L +L G
Sbjct: 66 QLTEAVRRKPYSIVLIDEIEKAHPGVQNDLLQILEG 101
>gnl|CDD|222104 pfam13401, AAA_22, AAA domain.
Length = 124
Score = 43.0 bits (102), Expect = 2e-05
Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 15/81 (18%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSV------SSSDLVSKWLGE---------SEK 205
+L G G+GK+ L + +A + N V + DL+ K L + +
Sbjct: 7 GVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDLLRKILRALGLPLSGGTTAE 66
Query: 206 LVKTLFELARAHRPSIIFIDE 226
L++ + + + ++ IDE
Sbjct: 67 LLEAILDALKRRGRPLLIIDE 87
>gnl|CDD|224145 COG1224, TIP49, DNA helicase TIP49, TBP-interacting protein
[Transcription].
Length = 450
Score = 43.1 bits (102), Expect = 1e-04
Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 8/83 (9%)
Query: 129 GLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNST-F 187
G E A+EA VI+ ++ ++ +GIL+ GPPGTGK+ LA +A E F
Sbjct: 43 GQEEAREAA--GVIV-----KMIKQGKMAGRGILIVGPPGTGKTALAMGIARELGEDVPF 95
Query: 188 FSVSSSDLVSKWLGESEKLVKTL 210
++S S++ S + ++E L + L
Sbjct: 96 VAISGSEIYSLEVKKTEALTQAL 118
>gnl|CDD|147949 pfam06068, TIP49, TIP49 C-terminus. This family consists of the
C-terminal region of several eukaryotic and archaeal
RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin
or TIP49b) proteins. The N-terminal domain contains the
pfam00004 domain. In zebrafish, the liebeskummer (lik)
mutation, causes development of hyperplastic embryonic
hearts. lik encodes Reptin, a component of a
DNA-stimulated ATPase complex. Beta-catenin and Pontin,
a DNA-stimulated ATPase that is often part of complexes
with Reptin, are in the same genetic pathways. The
Reptin/Pontin ratio serves to regulate heart growth
during development, at least in part via the
beta-catenin pathway. TBP-interacting protein 49 (TIP49)
was originally identified as a TBP-binding protein, and
two related proteins are encoded by individual genes,
tip49a and b. Although the function of this gene family
has not been elucidated, they are supposed to play a
critical role in nuclear events because they interact
with various kinds of nuclear factors and have DNA
helicase activities.TIP49a has been suggested to act as
an autoantigen in some patients with autoimmune
diseases.
Length = 395
Score = 43.1 bits (102), Expect = 1e-04
Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 8/79 (10%)
Query: 129 GLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNST-F 187
G E A+EA VI+ ++ ++ + +L+ GPPGTGK+ LA A++ E T F
Sbjct: 28 GQEKAREAA--GVIV-----EMIKEGKIAGRAVLIAGPPGTGKTALAIAISKELGEDTPF 80
Query: 188 FSVSSSDLVSKWLGESEKL 206
+S S++ S + ++E L
Sbjct: 81 CPISGSEVYSLEMKKTEAL 99
>gnl|CDD|221983 pfam13207, AAA_17, AAA domain.
Length = 114
Score = 39.2 bits (91), Expect = 3e-04
Identities = 26/124 (20%), Positives = 46/124 (37%), Gaps = 21/124 (16%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
IL+ GPPG+GKS LAK +A + +S DL+ + L EL
Sbjct: 2 ILITGPPGSGKSTLAKKLAEKLG---IPVISLDDLL---------REEGLAELDDGELDD 49
Query: 221 IIFIDERIDRGISNALFL--LLSGVGNDMDGILVLGATNIPW-------VLDAAIRRRFE 271
I E ++ + ++ GV + + A + + ++RR +
Sbjct: 50 IDIDLELLEEILDELAKQEWVIDGVRESTLELRLEEADLVVFLDLPLPACRFRLLKRRLQ 109
Query: 272 KRIY 275
+
Sbjct: 110 RGRG 113
>gnl|CDD|223679 COG0606, COG0606, Predicted ATPase with chaperone activity
[Posttranslational modification, protein turnover,
chaperones].
Length = 490
Score = 41.5 bits (98), Expect = 4e-04
Identities = 21/51 (41%), Positives = 24/51 (47%), Gaps = 14/51 (27%)
Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAK 176
DV G E AK AL+ A G +LL GPPGTGK+ LA
Sbjct: 180 DVKGQEQAKRALEIAA----------AGG----HNLLLVGPPGTGKTMLAS 216
>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
recombination, and repair].
Length = 254
Score = 40.5 bits (95), Expect = 7e-04
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 14/93 (15%)
Query: 159 KGILLFGPPGTGKSYLAKAVATEANN---STFFSVSSSDLVSKW---LGESEKLVKTLFE 212
+ ++L GPPG GK++LA A+ E S F +++ DL+SK E K L E
Sbjct: 106 ENLVLLGPPGVGKTHLAIAIGNELLKAGISVLF-ITAPDLLSKLKAAFDEGRLEEKLLRE 164
Query: 213 LARAHRPSIIFIDE----RIDRGISNALFLLLS 241
L + ++ ID+ + ++ LF L+S
Sbjct: 165 LKKV---DLLIIDDIGYEPFSQEEADLLFQLIS 194
>gnl|CDD|144608 pfam01078, Mg_chelatase, Magnesium chelatase, subunit ChlI.
Magnesium-chelatase is a three-component enzyme that
catalyzes the insertion of Mg2+ into protoporphyrin IX.
This is the first unique step in the synthesis of
(bacterio)chlorophyll. Due to this, it is thought that
Mg-chelatase has an important role in channelling inter-
mediates into the (bacterio)chlorophyll branch in
response to conditions suitable for photosynthetic
growth. ChlI and BchD have molecular weight between
38-42 kDa.
Length = 207
Score = 39.8 bits (94), Expect = 7e-04
Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 20/60 (33%)
Query: 124 WTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKG---ILLFGPPGTGKSYLAKAVAT 180
DV G E AK AL+ A G +L+ GPPG+GK+ LAK +
Sbjct: 2 LADVKGQEQAKRALEIAAA-----------------GGHNLLMIGPPGSGKTMLAKRLPG 44
>gnl|CDD|214396 CHL00206, ycf2, Ycf2; Provisional.
Length = 2281
Score = 41.0 bits (96), Expect = 8e-04
Identities = 41/163 (25%), Positives = 62/163 (38%), Gaps = 57/163 (34%)
Query: 157 PWKGILLFGPPGTGKSYLAKAVAT------------------------------------ 180
P +GIL+ G GTG+SYL K +AT
Sbjct: 1629 PSRGILVIGSIGTGRSYLVKYLATNSYVPFITVFLNKFLDNKPKGFLIDDIDIDDSDDID 1688
Query: 181 ---------EANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERIDRG 231
+ T + + D++ K + FELA+A P II+I D
Sbjct: 1689 DSDDIDRDLDTELLTMMNALTMDMMPK---IDRFYITLQFELAKAMSPCIIWIPNIHDLN 1745
Query: 232 I--SNALFL--LLSGVGNDMD-----GILVLGATNIPWVLDAA 265
+ SN L L L++ + D + ILV+ +T+IP +D A
Sbjct: 1746 VNESNYLSLGLLVNSLSRDCERCSTRNILVIASTHIPQKVDPA 1788
>gnl|CDD|110677 pfam01695, IstB_IS21, IstB-like ATP binding protein. This protein
contains an ATP/GTP binding P-loop motif. It is found
associated with IS21 family insertion sequences. The
function of this protein is unknown, but it may perform
a transposase function.
Length = 178
Score = 38.4 bits (90), Expect = 0.002
Identities = 29/92 (31%), Positives = 49/92 (53%), Gaps = 15/92 (16%)
Query: 160 GILLFGPPGTGKSYLAKAVATEA---NNSTFFSVSSSDLVSKWLGESE---KLVKTLFEL 213
+LL GPPG GK++LA A+ +A S F + DLV + L + +L +TL L
Sbjct: 49 NLLLLGPPGVGKTHLACALGHQACRAGYSVLF-TRTPDLVEQ-LKRARGDGRLARTLQRL 106
Query: 214 ARAHRPSIIFIDE----RIDRGISNALFLLLS 241
A+A ++ +D+ + + ++ LF L+S
Sbjct: 107 AKA---DLLILDDIGYLPLSQEAAHLLFELIS 135
>gnl|CDD|223637 COG0563, Adk, Adenylate kinase and related kinases [Nucleotide
transport and metabolism].
Length = 178
Score = 38.0 bits (89), Expect = 0.002
Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 8/87 (9%)
Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHR 218
IL+ GPPG GKS LAK + A + + D++ + E +L + + +
Sbjct: 1 MRILILGPPGAGKSTLAKKL---AKKLGLPHLDTGDILRAAIAERTELGEEIKKYIDKGE 57
Query: 219 --PS---IIFIDERIDRGISNALFLLL 240
P + ER+D A F+L
Sbjct: 58 LVPDEIVNGLVKERLDEADCKAGFILD 84
>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
Length = 268
Score = 38.1 bits (89), Expect = 0.003
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 160 GILLFGPPGTGKSYLAKAVATE 181
G+LL+G GTGK+YLA +A E
Sbjct: 116 GLLLWGSVGTGKTYLAACIANE 137
>gnl|CDD|216186 pfam00910, RNA_helicase, RNA helicase. This family includes RNA
helicases thought to be involved in duplex unwinding
during viral RNA replication. Members of this family are
found in a variety of single stranded RNA viruses.
Length = 105
Score = 36.1 bits (84), Expect = 0.004
Identities = 14/33 (42%), Positives = 16/33 (48%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSS 193
I L+GPPG GKS LAK +A S
Sbjct: 1 IWLYGPPGCGKSTLAKYLARALLKHLGLPKKDS 33
>gnl|CDD|136670 PRK08181, PRK08181, transposase; Validated.
Length = 269
Score = 36.8 bits (85), Expect = 0.011
Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 7/87 (8%)
Query: 161 ILLFGPPGTGKSYLAKAV--ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHR 218
+LLFGPPG GKS+LA A+ A N ++DLV K L + + ++ +A+ +
Sbjct: 109 LLLFGPPGGGKSHLAAAIGLALIENGWRVLFTRTTDLVQK-LQVARRELQLESAIAKLDK 167
Query: 219 PSIIFIDERI----DRGISNALFLLLS 241
++ +D+ D+ ++ LF L+S
Sbjct: 168 FDLLILDDLAYVTKDQAETSVLFELIS 194
>gnl|CDD|237495 PRK13765, PRK13765, ATP-dependent protease Lon; Provisional.
Length = 637
Score = 36.5 bits (85), Expect = 0.015
Identities = 26/85 (30%), Positives = 40/85 (47%), Gaps = 14/85 (16%)
Query: 95 DEDDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGK 154
+E++ ED + + E +E P V G E A E +K+A +
Sbjct: 1 EEEEDEDEEDLLGGLDFETTSDIEVPERLIDQVIGQEHAVEVIKKAAK-----------Q 49
Query: 155 RVPWKGILLFGPPGTGKSYLAKAVA 179
R + +++ G PGTGKS LAKA+A
Sbjct: 50 R---RHVMMIGSPGTGKSMLAKAMA 71
>gnl|CDD|234619 PRK00080, ruvB, Holliday junction DNA helicase RuvB; Reviewed.
Length = 328
Score = 35.5 bits (83), Expect = 0.027
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 161 ILLFGPPGTGKSYLAKAVATE 181
+LL+GPPG GK+ LA +A E
Sbjct: 54 VLLYGPPGLGKTTLANIIANE 74
>gnl|CDD|225164 COG2255, RuvB, Holliday junction resolvasome, helicase subunit [DNA
replication, recombination, and repair].
Length = 332
Score = 35.6 bits (83), Expect = 0.028
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
+LLFGPPG GK+ LA +A E
Sbjct: 55 VLLFGPPGLGKTTLAHIIANELG 77
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 35.7 bits (83), Expect = 0.029
Identities = 44/192 (22%), Positives = 82/192 (42%), Gaps = 39/192 (20%)
Query: 35 QYFLHALKYEITSAEGKQSIREKCDEYLARAEK---LKEYLKKGKKQPVKDGESRTKDDK 91
+ L LK E+ + + I +K +E + + ++ L+E LK KK+ + + + + +K
Sbjct: 192 KKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEK 251
Query: 92 KESDEDDSEDPD--KKKMQANLEGAIVMEKP---------------NVKWTDV----AGL 130
+ ++ + P+ KK ++ L ++E ++ W L
Sbjct: 252 LKEKLEELKLPEEVKKVIEKELTKLSLLEPSSSEFTVTRNYLDWLTDLPWGKYSKENLDL 311
Query: 131 EAAKEAL----------KEAVILPIKFPQLFTGKRVPWKGILLF-GPPGTGKSYLAKAVA 179
+ AKE L KE ++ + +L + P IL GPPG GK+ L K++A
Sbjct: 312 KRAKEILDEDHYGLKKVKERILEYLAVQKLRGKMKGP---ILCLVGPPGVGKTSLGKSIA 368
Query: 180 TEANNSTFFSVS 191
+A N F S
Sbjct: 369 -KALNRKFVRFS 379
>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
nucleotide-binding domain. ABC transporters are a large
family of proteins involved in the transport of a wide
variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are
a subset of nucleotide hydrolases that contain a
signature motif, Q-loop, and H-loop/switch region, in
addition to, the Walker A motif/P-loop and Walker B
motif commonly found in a number of ATP- and GTP-binding
and hydrolyzing proteins.
Length = 157
Score = 34.5 bits (80), Expect = 0.030
Identities = 20/84 (23%), Positives = 34/84 (40%), Gaps = 23/84 (27%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNS----TFFSVSSSDLVSKWL------------GESE 204
+ L GP G+GKS L +A+A + + L + L G+ +
Sbjct: 28 VALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELRRRIGYVPQLSGGQRQ 87
Query: 205 KLVKTLFELARA--HRPSIIFIDE 226
++ LARA P ++ +DE
Sbjct: 88 RV-----ALARALLLNPDLLLLDE 106
>gnl|CDD|225806 COG3267, ExeA, Type II secretory pathway, component ExeA (predicted
ATPase) [Intracellular trafficking and secretion].
Length = 269
Score = 35.1 bits (81), Expect = 0.031
Identities = 41/189 (21%), Positives = 73/189 (38%), Gaps = 33/189 (17%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSD-----------LVSKWLGESEKLVKT 209
+ + G G+GK+ L +A+ N V +V+ + + V
Sbjct: 54 LAVTGEVGSGKTVLRRALLASLNEDQVAVVVIDKPTLSDATLLEAIVADLESQPKVNVNA 113
Query: 210 LFE---------LARAHRPSIIFIDERIDRGIS--NALFLLLSGV--GNDMDGILVLGAT 256
+ E + + RP ++ +DE D S AL LL + + + I+++G
Sbjct: 114 VLEQIDRELAALVKKGKRPVVLMVDEAHDLNDSALEALRLLTNLEEDSSKLLSIVLIGQP 173
Query: 257 NIPWVLDAAIRRRFEKR--IYIALPELN---ARLFMFRHHL---GNTRNTISDENLKELA 308
+ L + R E+R I I LP L L++ RH L G SD+ L +
Sbjct: 174 KLRPRLRLPVLRELEQRIDIRIELPPLTEAETGLYL-RHRLEGAGLPEPLFSDDALLLIH 232
Query: 309 KRTEGYSGA 317
+ ++G
Sbjct: 233 EASQGIPRL 241
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 35.7 bits (83), Expect = 0.033
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 11/76 (14%)
Query: 162 LLFGPPGTGKSYLAKAVATE-AN--------NSTFFSVSSSDLVS--KWLGESEKLVKTL 210
+L G PG GK+ + + +A N + +S+ LV+ K+ GE E+ +K +
Sbjct: 195 VLVGEPGVGKTAIVEGLAQRIVNGDVPESLKDKRIYSLDLGSLVAGAKYRGEFEERLKAV 254
Query: 211 FELARAHRPSIIFIDE 226
+ + I+FIDE
Sbjct: 255 LKEVEKSKNVILFIDE 270
Score = 28.0 bits (63), Expect = 9.2
Identities = 54/212 (25%), Positives = 83/212 (39%), Gaps = 43/212 (20%)
Query: 38 LHALKYEITSAEGKQSIREKCDEYLARAEKLKEYLKKGKKQPVKDGESRTKDDKKESDED 97
L L+ E + E +Q +EK KL + + K K+ + + E K+ + E DED
Sbjct: 410 LAQLEIEKEALEREQDEKEK---------KLIDEIIKLKEGRIPELE---KELEAEVDED 457
Query: 98 DSED--------PDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQ 149
D + P K ++ E ++ V G + A EA+ +A+
Sbjct: 458 DIAEVVARWTGIPVAKLLED--EKEKLLNLERRLKKRVIGQDEAVEAVSDAIRR------ 509
Query: 150 LFTGKRVPWK--GILLF-GPPGTGKSYLAKAVATEANNST--FFSVSSSDL-----VSKW 199
G P + G LF GP G GK+ LAKA+A + S+ VS+
Sbjct: 510 ARAGLGDPNRPIGSFLFLGPTGVGKTELAKALAEALFGDEQALIRIDMSEYMEKHSVSRL 569
Query: 200 LGESEKLV-----KTLFELARAHRPSIIFIDE 226
+G V L E R S+I +DE
Sbjct: 570 IGAPPGYVGYEEGGQLTEAVRRKPYSVILLDE 601
>gnl|CDD|239140 cd02677, MIT_SNX15, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This MIT domain
sub-family is found in sorting nexin 15 and related
proteins. The molecular function of the MIT domain is
unclear.
Length = 75
Score = 32.7 bits (75), Expect = 0.035
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 13 KATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKL 68
A E ++ +YE A + Y V L ++ + +S E +++++ K EYL RAE++
Sbjct: 12 LALEKEEEGDYEAAFEFYRAGVDLLLKGVQGD-SSPERREAVKRKIAEYLKRAEEI 66
>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
[Transcription / DNA replication, recombination, and
repair].
Length = 442
Score = 35.1 bits (81), Expect = 0.049
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 166 PPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEK 205
P G GK+ +A E ST V + +L+ +W +K
Sbjct: 63 PTGAGKTVVAAEAIAELKRSTLVLVPTKELLDQWAEALKK 102
>gnl|CDD|234203 TIGR03420, DnaA_homol_Hda, DnaA regulatory inactivator Hda.
Members of this protein family are Hda (Homologous to
DnaA). These proteins are about half the length of DnaA
and homologous over length of Hda. In the model species
Escherichia coli, the initiation of DNA replication
requires DnaA bound to ATP rather than ADP; Hda helps
facilitate the conversion of DnaA-ATP to DnaA-ADP [DNA
metabolism, DNA replication, recombination, and repair].
Length = 226
Score = 34.1 bits (79), Expect = 0.055
Identities = 30/129 (23%), Positives = 49/129 (37%), Gaps = 25/129 (19%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
+ L+G G+GKS+L +A A S+ + L E + + E +
Sbjct: 41 LYLWGESGSGKSHLLQAACAAAEERG----KSAIYLP--LAELAQADPEVLEGLE--QAD 92
Query: 221 IIFID--ERID--RGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRRRFEKRIYI 276
++ +D E I ALF L + V + G L++ P L +
Sbjct: 93 LVCLDDVEAIAGQPEWQEALFHLYNRV-REAGGRLLIAGRAAPAQLP------------L 139
Query: 277 ALPELNARL 285
LP+L RL
Sbjct: 140 RLPDLRTRL 148
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 34.9 bits (81), Expect = 0.060
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 8/58 (13%)
Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEAN 183
DV G E AKE L+E + + K P K +LL+GPPG GK+ LA A+A +
Sbjct: 15 DVVGNEKAKEQLREWIE---SWL-----KGKPKKALLLYGPPGVGKTSLAHALANDYG 64
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 34.8 bits (81), Expect = 0.070
Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 15/67 (22%)
Query: 129 GLEAAKEALKE--AVILPIKFPQLFTGKRVPWKG--ILLFGPPGTGKSYLAKAVATEANN 184
GLE KE + E AV Q T K KG + L GPPG GK+ L K++A +A
Sbjct: 327 GLEKVKERILEYLAV-------QKLTKKL---KGPILCLVGPPGVGKTSLGKSIA-KALG 375
Query: 185 STFFSVS 191
F +S
Sbjct: 376 RKFVRIS 382
>gnl|CDD|237354 PRK13341, PRK13341, recombination factor protein RarA/unknown
domain fusion protein; Reviewed.
Length = 725
Score = 34.6 bits (80), Expect = 0.071
Identities = 34/97 (35%), Positives = 47/97 (48%), Gaps = 10/97 (10%)
Query: 162 LLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFE-LARAHRPS 220
+L+GPPG GK+ LA+ +A N T SS + V + + V E L R + +
Sbjct: 56 ILYGPPGVGKTTLARIIA----NHTRAHFSSLNAVLAGVKDLRAEVDRAKERLERHGKRT 111
Query: 221 IIFIDERIDRGISNALFLLLSGVGNDMDGILVL-GAT 256
I+FIDE + R LL V N G + L GAT
Sbjct: 112 ILFIDE-VHRFNKAQQDALLPWVEN---GTITLIGAT 144
>gnl|CDD|180607 PRK06526, PRK06526, transposase; Provisional.
Length = 254
Score = 34.1 bits (78), Expect = 0.072
Identities = 23/89 (25%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 161 ILLFGPPGTGKSYLAKAV---ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH 217
++ GPPGTGK++LA + A +A + F+ +++ V++ L + + EL +
Sbjct: 101 VVFLGPPGTGKTHLAIGLGIRACQAGHRVLFA-TAAQWVAR-LAAAHHAGRLQAELVKLG 158
Query: 218 RPSIIFIDE----RIDRGISNALFLLLSG 242
R ++ +DE + +N F L+S
Sbjct: 159 RYPLLIVDEVGYIPFEPEAANLFFQLVSS 187
>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 154
Score = 33.3 bits (76), Expect = 0.076
Identities = 27/122 (22%), Positives = 42/122 (34%), Gaps = 37/122 (30%)
Query: 161 ILLFGPPGTGKSYLAKAVAT------------EANNSTFFSVSSSDLVSKWLGESEK--- 205
+LL GP GTGK+ L + + E N FS + +L+ + L E
Sbjct: 27 VLLTGPSGTGKTSLLRELLEGLLVAAGKCDQAERNPPYAFSQALRELLRQLLRELAAELL 86
Query: 206 --------------------LVKTLFELARAHRPSIIFIDE--RIDRGISNALFLLLSGV 243
LV+ L L RP ++ +D+ D + L LL +
Sbjct: 87 LLREALLAALGAELIEGLQDLVELLERLLARARPLVLVLDDLQWADEESLDLLAALLRRL 146
Query: 244 GN 245
Sbjct: 147 ER 148
>gnl|CDD|216270 pfam01057, Parvo_NS1, Parvovirus non-structural protein NS1. This
family also contains the NS2 protein. Parvoviruses
encode two non-structural proteins, NS1 and NS2. The
mRNA for NS2 contains the coding sequence for the first
87 amino acids of NS1, then by an alternative splicing
mechanism mRNA from a different reading frame, encoding
the last 78 amino acids, makes up the full length of the
NS2 mRNA. NS1, is the major non-structural protein. It
is essential for DNA replication. It is an 83-kDa
nuclear phosphoprotein. It has DNA helicase and ATPase
activity.
Length = 271
Score = 34.2 bits (79), Expect = 0.077
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 161 ILLFGPPGTGKSYLAKAVAT 180
+ +GP TGK+ LA+A+A
Sbjct: 116 VWFYGPASTGKTNLAQAIAH 135
>gnl|CDD|224160 COG1239, ChlI, Mg-chelatase subunit ChlI [Coenzyme metabolism].
Length = 423
Score = 34.3 bits (79), Expect = 0.087
Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 8/46 (17%)
Query: 135 EALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVAT 180
+ LK A+ L PQ+ G L+ G GT KS LA+A+A
Sbjct: 23 DPLKLALGLNAVDPQI--------GGALIAGEKGTAKSTLARALAD 60
>gnl|CDD|239143 cd02680, MIT_calpain7_2, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family
of MIT domains is found in the nuclear thiol protease
PalBH. The molecular function of the MIT domain is
unclear.
Length = 75
Score = 31.5 bits (71), Expect = 0.10
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 13 KATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKEYL 72
+A + D+ N EEA++LYT +V+ ++ E + +++ + L RAE LKE +
Sbjct: 12 QAFDEDEKGNAEEAIELYTEAVELCINT-SNETMDQALQTKLKQLARQALDRAEALKESM 70
Query: 73 KK 74
K
Sbjct: 71 SK 72
>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
also known as guanylate kinase (GKase), catalyzes the
reversible phosphoryl transfer from adenosine
triphosphate (ATP) to guanosine monophosphate (GMP) to
yield adenosine diphosphate (ADP) and guanosine
diphosphate (GDP). It plays an essential role in the
biosynthesis of guanosine triphosphate (GTP). This
enzyme is also important for the activation of some
antiviral and anticancer agents, such as acyclovir,
ganciclovir, carbovir, and thiopurines.
Length = 137
Score = 32.5 bits (75), Expect = 0.10
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVS 191
I+L GP G GKS L K + E + + FSVS
Sbjct: 2 IVLSGPSGVGKSTLLKRLLEEFDPNFGFSVS 32
>gnl|CDD|129721 TIGR00635, ruvB, Holliday junction DNA helicase, RuvB subunit. All
proteins in this family for which functions are known
are 5'-3' DNA helicases that, as part of a complex with
RuvA homologs serve as a 5'-3' Holliday junction
helicase. RuvA specifically binds Holliday junctions as
a sandwich of two tetramers and maintains the
configuration of the junction. It forms a complex with
two hexameric rings of RuvB, the subunit that contains
helicase activity. The complex drives ATP-dependent
branch migration of the Holliday junction recombination
intermediate. The endonuclease RuvC resolves junctions
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 305
Score = 33.8 bits (78), Expect = 0.11
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
+LL+GPPG GK+ LA +A E
Sbjct: 33 LLLYGPPGLGKTTLAHIIANEMG 55
>gnl|CDD|181681 PRK09183, PRK09183, transposase/IS protein; Provisional.
Length = 259
Score = 33.5 bits (77), Expect = 0.11
Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 20/87 (22%)
Query: 161 ILLFGPPGTGKSYLAKAV---ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH 217
I+L GP G GK++LA A+ A A F +++DL L +L+ A
Sbjct: 105 IVLLGPSGVGKTHLAIALGYEAVRAGIKVRF-TTAADL--------------LLQLSTAQ 149
Query: 218 RPSIIFIDERIDRGISNALFLLLSGVG 244
R + RG+ L++ +G
Sbjct: 150 RQG--RYKTTLQRGVMAPRLLIIDEIG 174
>gnl|CDD|235422 PRK05342, clpX, ATP-dependent protease ATP-binding subunit ClpX;
Provisional.
Length = 412
Score = 33.6 bits (78), Expect = 0.11
Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 38/90 (42%)
Query: 161 ILLFGPPGTGKSYLAK----------AVA-----TEANNSTFFSVSSSDLVSKWLGES-E 204
ILL GP G+GK+ LA+ A+A TEA ++GE E
Sbjct: 111 ILLIGPTGSGKTLLAQTLARILDVPFAIADATTLTEA---------------GYVGEDVE 155
Query: 205 K-LVKTL----FELARAHRPSIIFIDERID 229
L+K L +++ +A R I++IDE ID
Sbjct: 156 NILLKLLQAADYDVEKAQR-GIVYIDE-ID 183
>gnl|CDD|184311 PRK13764, PRK13764, ATPase; Provisional.
Length = 602
Score = 33.7 bits (78), Expect = 0.13
Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 18/51 (35%)
Query: 160 GILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTL 210
GIL+ G PG GKS A+A+A + + K+VKT+
Sbjct: 259 GILIAGAPGAGKSTFAQALAE------------------FYADMGKIVKTM 291
>gnl|CDD|190347 pfam02562, PhoH, PhoH-like protein. PhoH is a cytoplasmic protein
and predicted ATPase that is induced by phosphate
starvation.
Length = 205
Score = 32.8 bits (76), Expect = 0.14
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 161 ILLFGPPGTGKSYLAKAVATEA 182
+ GP GTGK+YLA A A +A
Sbjct: 22 VFGIGPAGTGKTYLAVAAAVDA 43
>gnl|CDD|237090 PRK12402, PRK12402, replication factor C small subunit 2; Reviewed.
Length = 337
Score = 33.4 bits (77), Expect = 0.14
Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 18/72 (25%)
Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEA--- 182
D+ G + E L AV P P L L+ GPPG+GK+ +A+A E
Sbjct: 16 DILGQDEVVERLSRAVDSPN-LPHL-----------LVQGPPGSGKTAAVRALARELYGD 63
Query: 183 ---NNSTFFSVS 191
NN T F+V+
Sbjct: 64 PWENNFTEFNVA 75
>gnl|CDD|221955 pfam13173, AAA_14, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily.
Length = 127
Score = 31.8 bits (73), Expect = 0.15
Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 2/66 (3%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
I++ GP GK+ L E + ++ D L L++ +L
Sbjct: 5 IVITGPRQVGKTTLLLQFLKELLSENILYINLDDPRFLRL-ALLDLLEEFLKLLLP-GKK 62
Query: 221 IIFIDE 226
+F+DE
Sbjct: 63 YLFLDE 68
>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus.
The RuvB protein makes up part of the RuvABC revolvasome
which catalyzes the resolution of Holliday junctions
that arise during genetic recombination and DNA repair.
Branch migration is catalyzed by the RuvB protein that
is targeted to the Holliday junction by the structure
specific RuvA protein. This family contains the
N-terminal region of the protein.
Length = 231
Score = 32.4 bits (75), Expect = 0.24
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 161 ILLFGPPGTGKSYLAKAVATE 181
+LL+GPPG GK+ LA +A E
Sbjct: 53 VLLYGPPGLGKTTLANIIANE 73
>gnl|CDD|222866 PHA02544, 44, clamp loader, small subunit; Provisional.
Length = 316
Score = 32.7 bits (75), Expect = 0.24
Identities = 22/61 (36%), Positives = 30/61 (49%), Gaps = 8/61 (13%)
Query: 139 EAVILPIKFPQLFTG----KRVPWKGILLFGP-PGTGKSYLAKAVATEANNSTFFSVSSS 193
+ ILP + F R+P +LL P PGTGK+ +AKA+ E F V+ S
Sbjct: 21 DECILPAADKETFKSIVKKGRIP--NMLLHSPSPGTGKTTVAKALCNEVGAEVLF-VNGS 77
Query: 194 D 194
D
Sbjct: 78 D 78
>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
Members of this protein family resemble the widely
distributed ATP-dependent protease La, also called Lon
and LonA. It resembles even more closely LonB, which is
a LonA paralog found in genomes if and only if the
species is capable of endospore formation (as in
Bacillus subtilis, Clostridium tetani, and select other
members of the Firmicutes) and expressed specifically in
the forespore compartment. Members of this family are
restricted to a subset of spore-forming species, and are
very likely to participate in the program of endospore
formation. We propose the designation LonC [Protein
fate, Degradation of proteins, peptides, and
glycopeptides, Cellular processes, Sporulation and
germination].
Length = 615
Score = 32.9 bits (75), Expect = 0.28
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 22/126 (17%)
Query: 66 EKLKEYLKKGKKQPVKD-GESRTKDDKKESDEDDSEDPDKKKMQANLEGAIVMEKPNVKW 124
E+ +YL++ + Q +K+ K+ + E +KKK+ + + + +P +
Sbjct: 100 ERQNKYLEEIRLQVLKEEKGPENSSTLKKLER--LEKLEKKKLHKSAQSLL---RPR-AF 153
Query: 125 TDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEA-- 182
+++ G E A +AL V P FPQ I+L+GPPG GK+ A+ EA
Sbjct: 154 SEIVGQERAIKALLAKVASP--FPQ----------HIILYGPPGVGKTTAARLALEEAKK 201
Query: 183 -NNSTF 187
++ F
Sbjct: 202 LKHTPF 207
>gnl|CDD|222307 pfam13671, AAA_33, AAA domain. This family of domains contain only
a P-loop motif, that is characteristic of the AAA
superfamily. Many of the proteins in this family are
just short fragments so there is no Walker B motif.
Length = 143
Score = 31.5 bits (72), Expect = 0.30
Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 20/78 (25%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLV------------SKWLGESEKLVK 208
IL+ G PG+GKS A+ + E V SSD + S + S ++ +
Sbjct: 2 ILMVGLPGSGKSTFARRLLRELG----AVVLSSDTLRKRLRGDGPPDISYYARASGRVYQ 57
Query: 209 TLFELA----RAHRPSII 222
L ELA RA RP I+
Sbjct: 58 RLLELAREALRAGRPVIV 75
>gnl|CDD|236353 PRK08939, PRK08939, primosomal protein DnaI; Reviewed.
Length = 306
Score = 32.1 bits (74), Expect = 0.32
Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 30/182 (16%)
Query: 15 TEADKNKNYEEALK-LYTH-SVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKE-- 70
D K YE+ + + VQ FL + E+T +S+ K EY+ + +K
Sbjct: 13 KRQDFRKRYEKLKQEVLADPDVQAFLQEHQEELTQEMIDRSL-SKLYEYIDQRKKCDRGC 71
Query: 71 --YLKKGKKQPVKDGES------RTKDDKKESDEDDSEDPDKKKMQANLEGAIVMEKP-- 120
Y K + + V +G T + + D++K +I M K
Sbjct: 72 PNYAKGYQPKLVMNGGYIDVSYLPTPEKIEA---------DEEKAIKKRIQSIYMPKDLL 122
Query: 121 NVKWTDVAGLEAA-KEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVA 179
D+ + +AL A+ +P G++V KG+ L+G G GKSYL A+A
Sbjct: 123 QASLADIDLDDRDRLDALMAALDFLEAYPP---GEKV--KGLYLYGDFGVGKSYLLAAIA 177
Query: 180 TE 181
E
Sbjct: 178 NE 179
>gnl|CDD|215947 pfam00493, MCM, MCM2/3/5 family.
Length = 327
Score = 32.1 bits (74), Expect = 0.33
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 14/68 (20%)
Query: 135 EALKEAVILPIKFPQLF--TGKRVPWKG-------ILLFGPPGTGKSYLAKAVATEANNS 185
E +K+A++L QLF K +P +LL G PGT KS L K VA A +
Sbjct: 30 EDVKKAILL-----QLFGGVKKNLPDGTRLRGDINVLLVGDPGTAKSQLLKYVAKLAPRA 84
Query: 186 TFFSVSSS 193
+ S S
Sbjct: 85 VYTSGKGS 92
>gnl|CDD|224140 COG1219, ClpX, ATP-dependent protease Clp, ATPase subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 408
Score = 32.2 bits (74), Expect = 0.33
Identities = 28/91 (30%), Positives = 40/91 (43%), Gaps = 38/91 (41%)
Query: 161 ILLFGPPGTGKSYLAKAVA---------------TEANNSTFFSVSSSDLVSKWLGES-- 203
ILL GP G+GK+ LA+ +A TEA ++GE
Sbjct: 100 ILLIGPTGSGKTLLAQTLAKILNVPFAIADATTLTEAG---------------YVGEDVE 144
Query: 204 ---EKLV-KTLFELARAHRPSIIFIDERIDR 230
KL+ +++ RA R II+IDE ID+
Sbjct: 145 NILLKLLQAADYDVERAER-GIIYIDE-IDK 173
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 32.5 bits (74), Expect = 0.34
Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 4/91 (4%)
Query: 23 YEEALKL--YTHSVQYFLHAL--KYEITSAEGKQSIREKCDEYLARAEKLKEYLKKGKKQ 78
++EA+ + L AL YE + S ++ + +KL++Y KK +
Sbjct: 329 FKEAINIARNLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEV 388
Query: 79 PVKDGESRTKDDKKESDEDDSEDPDKKKMQA 109
+ S + K + ++ E+ +KK+ +
Sbjct: 389 KDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419
>gnl|CDD|224768 COG1855, COG1855, ATPase (PilT family) [General function prediction
only].
Length = 604
Score = 32.3 bits (74), Expect = 0.34
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 159 KGILLFGPPGTGKSYLAKAVAT 180
+GIL+ G PG GKS A+A+A
Sbjct: 264 EGILIAGAPGAGKSTFAQALAE 285
>gnl|CDD|233120 TIGR00764, lon_rel, lon-related putative ATP-dependent protease.
This model represents a set of proteins with extensive
C-terminal homology to the ATP-dependent protease La,
product of the lon gene of E. coli. The model is based
on a seed alignment containing only archaeal members,
but several bacterial proteins match the model well.
Because several species, including Thermotoga maritima
and Treponema pallidum, contain both a close homolog of
the lon protease and nearly full-length homolog of the
members of this family, we suggest there may also be a
functional division between the two families. Members of
this family from Pyrococcus horikoshii and Pyrococcus
abyssi each contain a predicted intein [Protein fate,
Degradation of proteins, peptides, and glycopeptides].
Length = 608
Score = 32.5 bits (74), Expect = 0.35
Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 14/53 (26%)
Query: 127 VAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVA 179
V G E A E +K+A + +LL G PG GKS LAKA+A
Sbjct: 20 VIGQEEAVEIIKKAA--------------KQKRNVLLIGEPGVGKSMLAKAMA 58
>gnl|CDD|232949 TIGR00382, clpX, endopeptidase Clp ATP-binding regulatory subunit
(clpX). A member of the ATP-dependent proteases, ClpX
has ATP-dependent chaperone activity and is required for
specific ATP-dependent proteolytic activities expressed
by ClpPX. The gene is also found to be involved in
stress tolerance in Bacillus subtilis and is essential
for the efficient acquisition of genes specifying type
IA and IB restriction [Protein fate, Protein folding and
stabilization, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 413
Score = 32.0 bits (73), Expect = 0.39
Identities = 25/76 (32%), Positives = 43/76 (56%), Gaps = 8/76 (10%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGES-EKLVKTLFELA----- 214
ILL GP G+GK+ LA+ +A N + +++ + ++GE E ++ L + A
Sbjct: 119 ILLIGPTGSGKTLLAQTLARILNVPFAIADATTLTEAGYVGEDVENILLKLLQAADYDVE 178
Query: 215 RAHRPSIIFIDERIDR 230
+A + II+IDE ID+
Sbjct: 179 KAQK-GIIYIDE-IDK 192
>gnl|CDD|235878 PRK06851, PRK06851, hypothetical protein; Provisional.
Length = 367
Score = 31.9 bits (73), Expect = 0.39
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTF---FSVSSSD 194
+L G PGTGKS L K + E + F SSD
Sbjct: 33 FILKGGPGTGKSTLMKKIGEEFLEKGYDVEFLHCSSD 69
>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
ATPase domain and a DNA-binding domain [Transcription /
Signal transduction mechanisms].
Length = 403
Score = 32.0 bits (73), Expect = 0.41
Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 12/83 (14%)
Query: 161 ILLFGPPGTGKSYLAKAV---ATEANNSTFFSVSSSDLVSKWLGESEKL--VKTLFELAR 215
+L+ G GTGK A+ + + + F + + + S+ L E+E K F A+
Sbjct: 104 VLIIGETGTGKELFARLIHALSARRAEAPFIAFNCAAY-SENLQEAELFGHEKGAFTGAQ 162
Query: 216 AHRPSIIFIDERIDRGISNALFL 238
+ + E+ + G LFL
Sbjct: 163 GGKAGLF---EQANGGT---LFL 179
>gnl|CDD|239144 cd02681, MIT_calpain7_1, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family
of MIT domains is found in the nuclear thiol protease
PalBH. The molecular function of the MIT domain is
unclear.
Length = 76
Score = 29.5 bits (66), Expect = 0.42
Identities = 17/63 (26%), Positives = 33/63 (52%)
Query: 14 ATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKEYLK 73
A + D+ Y EA+ Y + Q ++A + ++I+EK +EYL RA+ L + ++
Sbjct: 13 AVQRDQEGRYSEAVFYYKEAAQLLIYAEMAGTLNDSHLKTIQEKSNEYLDRAQALHQLVQ 72
Query: 74 KGK 76
+
Sbjct: 73 GQQ 75
>gnl|CDD|219014 pfam06414, Zeta_toxin, Zeta toxin. This family consists of several
bacterial zeta toxin proteins. Zeta toxin is thought to
be part of a postregulational killing system in
bacteria. It relies on antitoxin/toxin systems that
secure stable inheritance of low and medium copy number
plasmids during cell division and kill cells that have
lost the plasmid.
Length = 191
Score = 31.5 bits (72), Expect = 0.45
Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 24/86 (27%)
Query: 159 KGILLFGPPGTGKSYLAKAVATEANNS-----------TFF----------SVSSSDLVS 197
+LL G PG GK+ LA+A+ E T+ +S+L
Sbjct: 13 VAVLLGGQPGAGKTELARALLEELGGGNVVRIDPDELRTYHPDYDELQKADPKDASELTQ 72
Query: 198 KWLGESEKLVKTLFELARAHRPSIIF 223
V+ L + A +II
Sbjct: 73 PDASR---WVEKLIDYAIERGYNIIL 95
>gnl|CDD|224616 COG1702, PhoH, Phosphate starvation-inducible protein PhoH,
predicted ATPase [Signal transduction mechanisms].
Length = 348
Score = 31.9 bits (73), Expect = 0.45
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 165 GPPGTGKSYLAKAVATEA 182
GP GTGK+YLA A A +A
Sbjct: 150 GPAGTGKTYLAVAKAVDA 167
>gnl|CDD|218685 pfam05673, DUF815, Protein of unknown function (DUF815). This
family consists of several bacterial proteins of unknown
function.
Length = 248
Score = 31.7 bits (73), Expect = 0.46
Identities = 30/106 (28%), Positives = 39/106 (36%), Gaps = 18/106 (16%)
Query: 122 VKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATE 181
V D+ G++ KEAL Q G P +LL+G GTGKS L KA+ E
Sbjct: 25 VDLDDLVGIDRQKEALLRNTE------QFLAGL--PANNVLLWGARGTGKSSLVKALLNE 76
Query: 182 ANNST--FFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFID 225
+ V DL L + L I+F D
Sbjct: 77 YADQGLRLIEVDKDDLGD--------LPDIVDLLRGRPYRFILFCD 114
>gnl|CDD|183521 PRK12422, PRK12422, chromosomal replication initiation protein;
Provisional.
Length = 445
Score = 31.7 bits (72), Expect = 0.48
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 5/42 (11%)
Query: 143 LPIKFPQLFTG-----KRVPWKGILLFGPPGTGKSYLAKAVA 179
LP + Q FT K P+ I LFGP G+GK++L +A
Sbjct: 121 LPHRILQEFTKVSEQGKGFPFNPIYLFGPEGSGKTHLMQAAV 162
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 31.8 bits (73), Expect = 0.50
Identities = 29/128 (22%), Positives = 39/128 (30%), Gaps = 20/128 (15%)
Query: 34 VQYFLHALKYEI--TSAEGKQSIREKC----------DEYLARAEKLK---EYLKKGKKQ 78
VQY+ A K EI E K++ K E AR + K E K
Sbjct: 428 VQYYRQA-KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKD 486
Query: 79 PVKDGESRTKDDKKESDE----DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAK 134
V +R K K + + PD + A E + A K
Sbjct: 487 AVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPK 546
Query: 135 EALKEAVI 142
+A A I
Sbjct: 547 KAAVAAAI 554
>gnl|CDD|223666 COG0593, DnaA, ATPase involved in DNA replication initiation [DNA
replication, recombination, and repair].
Length = 408
Score = 31.5 bits (72), Expect = 0.62
Identities = 11/42 (26%), Positives = 22/42 (52%), Gaps = 4/42 (9%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN----NSTFFSVSSSDLVSK 198
+ ++G G GK++L +A+ EA N+ ++S D +
Sbjct: 116 LFIYGGVGLGKTHLLQAIGNEALANGPNARVVYLTSEDFTND 157
>gnl|CDD|239145 cd02682, MIT_AAA_Arch, MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family
of MIT domains is found in mostly archaebacterial
AAA-ATPases. The molecular function of the MIT domain
is unclear.
Length = 75
Score = 29.0 bits (65), Expect = 0.65
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 11 AIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKE 70
AI A +A+K N E+A+ Y +++ +K S + + +EY R E L++
Sbjct: 10 AINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDS-PTRLIYEQMINEYKRRIEVLEK 68
>gnl|CDD|222194 pfam13521, AAA_28, AAA domain.
Length = 162
Score = 30.6 bits (70), Expect = 0.65
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
I++ G P TGK+ L +A+A
Sbjct: 2 IVITGGPSTGKTTLLEALA 20
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 30.4 bits (69), Expect = 0.71
Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 60 EYLARAEKLKEYLKKGKK-QPVKDGESRTKDDKKESDEDDSEDPDKKKMQANLEGA 114
EY + + + K KK KD + KDDK E ++ + + + +
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132
Score = 29.7 bits (67), Expect = 1.6
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 1/61 (1%)
Query: 53 SIREKCDEYLARAEKL-KEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQANL 111
++K E EK+ KEY +K K + K + KD K+ +D +D +KK +
Sbjct: 59 EAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Query: 112 E 112
E
Sbjct: 119 E 119
>gnl|CDD|222163 pfam13479, AAA_24, AAA domain. This AAA domain is found in a wide
variety of presumed phage proteins.
Length = 201
Score = 30.7 bits (70), Expect = 0.72
Identities = 11/20 (55%), Positives = 15/20 (75%), Gaps = 1/20 (5%)
Query: 157 PWKGILLFGPPGTGKSYLAK 176
K +L++GPPG GK+ LAK
Sbjct: 3 NIK-VLIYGPPGIGKTSLAK 21
>gnl|CDD|213788 TIGR03263, guanyl_kin, guanylate kinase. Members of this family
are the enzyme guanylate kinase, also called GMP kinase.
This enzyme transfers a phosphate from ATP to GMP,
yielding ADP and GDP [Purines, pyrimidines, nucleosides,
and nucleotides, Nucleotide and nucleoside
interconversions].
Length = 179
Score = 30.5 bits (70), Expect = 0.74
Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSS 192
I++ GP G GKS L KA+ E + + FS+S+
Sbjct: 3 IVISGPSGAGKSTLVKAL-LEEDPNLKFSISA 33
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 31.0 bits (71), Expect = 0.74
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 159 KGILLFGPPGTGKSYLAKAVATE 181
+ +L +G GTGK++L+ +A E
Sbjct: 184 ENLLFYGNTGTGKTFLSNCIAKE 206
>gnl|CDD|222000 pfam13238, AAA_18, AAA domain.
Length = 128
Score = 29.7 bits (67), Expect = 0.84
Identities = 17/52 (32%), Positives = 24/52 (46%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFE 212
IL+ G PG+GK+ LAK +A + + LV + E K L E
Sbjct: 1 ILITGTPGSGKTTLAKELAERLGDVLRDLAKENGLVLELDEEITDESKRLDE 52
>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein. The N-terminal
end matches very strongly a pfam Mg_chelatase domain
[Unknown function, General].
Length = 499
Score = 31.0 bits (70), Expect = 0.85
Identities = 23/73 (31%), Positives = 31/73 (42%), Gaps = 19/73 (26%)
Query: 104 KKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILL 163
K K N I ++ D+ G + AK AL+ A G +LL
Sbjct: 176 KPKSIINKSYIIDLD-----LKDIKGQQHAKRALEIAA----------AGGH----NLLL 216
Query: 164 FGPPGTGKSYLAK 176
FGPPG+GK+ LA
Sbjct: 217 FGPPGSGKTMLAS 229
>gnl|CDD|222165 pfam13481, AAA_25, AAA domain. This AAA domain is found in a wide
variety of presumed DNA repair proteins.
Length = 154
Score = 29.9 bits (68), Expect = 0.86
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
LL G PGTGKS LA +A
Sbjct: 36 TLLAGAPGTGKSTLALDLA 54
>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
Provisional.
Length = 758
Score = 31.0 bits (70), Expect = 0.94
Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 162 LLFGPPGTGKSYLAKAVATE---------ANNSTFFSVSSSDLV--SKWLGESEKLVKTL 210
LL G G GK+ +A+ +A + T +S+ L+ +K+ G+ EK K L
Sbjct: 211 LLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKAL 270
Query: 211 FELARAHRPSIIFIDE 226
+ SI+FIDE
Sbjct: 271 LKQLEQDTNSILFIDE 286
>gnl|CDD|234763 PRK00440, rfc, replication factor C small subunit; Reviewed.
Length = 319
Score = 30.6 bits (70), Expect = 1.0
Identities = 22/57 (38%), Positives = 27/57 (47%), Gaps = 14/57 (24%)
Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLF-GPPGTGKSYLAKAVATE 181
++ G E E LK V K +P LLF GPPGTGK+ A A+A E
Sbjct: 18 EIVGQEEIVERLKSYV----------KEKNMPH---LLFAGPPGTGKTTAALALARE 61
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 31.2 bits (71), Expect = 1.0
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 51 KQSIREKCDEYLARAEKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQAN 110
+SI ++ + + E+L+E L++ + ++D ESR D KKE DE +++ + ++
Sbjct: 849 IKSIEKEIENLNGKKEELEEELEELEAA-LRDLESRLGDLKKERDELEAQLRELERKIEE 907
Query: 111 LEGAIVMEKPNVKWTDVAGLEAAKEALKE 139
LE I ++ + LEA +E L E
Sbjct: 908 LEAQIEKKRKRLSELKAK-LEALEEELSE 935
>gnl|CDD|233369 TIGR01351, adk, adenylate kinase. Adenylate kinase (EC 2.7.4.3)
converts ATP + AMP to ADP + ADP, that is, uses ATP as a
phosphate donor for AMP. Most members of this family are
known or believed to be adenylate kinase. However, some
members accept other nucleotide triphosphates as donors,
may be unable to use ATP, and may fail to complement
adenylate kinase mutants. An example of a
nucleoside-triphosphate--adenylate kinase (EC 2.7.4.10)
is a GTP:AMP phosphotransferase. This family is
designated subfamily rather than equivalog for this
reason [Purines, pyrimidines, nucleosides, and
nucleotides, Nucleotide and nucleoside
interconversions].
Length = 210
Score = 30.3 bits (69), Expect = 1.0
Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDL 195
++L GPPG+GK AK +A + +S+ DL
Sbjct: 2 LVLLGPPGSGKGTQAKRIAEKYG---LPHISTGDL 33
>gnl|CDD|233959 TIGR02639, ClpA, ATP-dependent Clp protease ATP-binding subunit
clpA. [Protein fate, Degradation of proteins, peptides,
and glycopeptides].
Length = 730
Score = 30.8 bits (70), Expect = 1.1
Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 11/76 (14%)
Query: 162 LLFGPPGTGKSYLA---------KAVATEANNSTFFSVSSSDLV--SKWLGESEKLVKTL 210
LL G PG GK+ + V N+ +S+ L+ +K+ G+ E+ +K +
Sbjct: 206 LLVGEPGVGKTAIVEGLALRIAEGKVPERLKNAKIYSLDMGTLLAGTKYRGDFEERLKAV 265
Query: 211 FELARAHRPSIIFIDE 226
+I+FIDE
Sbjct: 266 VSEIEKEPNAILFIDE 281
>gnl|CDD|201353 pfam00625, Guanylate_kin, Guanylate kinase.
Length = 183
Score = 30.0 bits (68), Expect = 1.1
Identities = 14/33 (42%), Positives = 20/33 (60%)
Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVS 191
+ I+L GP G GKS++ KA+ E +SVS
Sbjct: 3 RPIVLSGPSGVGKSHIKKALLDEYPEKFGYSVS 35
>gnl|CDD|238540 cd01120, RecA-like_NTPases, RecA-like NTPases. This family includes
the NTP binding domain of F1 and V1 H+ATPases, DnaB and
related helicases as well as bacterial RecA and related
eukaryotic and archaeal recombinases. This group also
includes bacterial conjugation proteins and related DNA
transfer proteins involved in type II and type IV
secretion.
Length = 165
Score = 29.8 bits (67), Expect = 1.3
Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 8/69 (11%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTF------FSVSSSDLVSKWLGESEK--LVKTLFE 212
IL+FGP G+GK+ LA +A +L + +GES K L +
Sbjct: 2 ILVFGPTGSGKTTLALQLALNIATKGGKVVYVDIEEEIEELTERLIGESLKGALDNLIIV 61
Query: 213 LARAHRPSI 221
A A P+
Sbjct: 62 FATADDPAA 70
>gnl|CDD|216506 pfam01443, Viral_helicase1, Viral (Superfamily 1) RNA helicase.
Helicase activity for this family has been demonstrated
and NTPase activity. This helicase has multiple roles at
different stages of viral RNA replication, as dissected
by mutational analysis.
Length = 226
Score = 30.0 bits (68), Expect = 1.3
Identities = 15/70 (21%), Positives = 27/70 (38%), Gaps = 7/70 (10%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKL----VKTLFELARA 216
I++ G PG GKS L + + +++L ++ + L V T
Sbjct: 1 IVVHGVPGCGKSTLIRKLLRTDLTV---IRPTAELRTEGKPDLPNLNVRTVDTFLMALLK 57
Query: 217 HRPSIIFIDE 226
I+ +DE
Sbjct: 58 PTGKILILDE 67
>gnl|CDD|214361 CHL00095, clpC, Clp protease ATP binding subunit.
Length = 821
Score = 30.8 bits (70), Expect = 1.3
Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 54/193 (27%)
Query: 161 ILLFGPPGTGKSYLAKAVA---------TEANNSTFFSVSSSDLV--SKWLGESEKLVKT 209
+L G PG GK+ +A+ +A + ++ L+ +K+ GE E+ +K
Sbjct: 203 PILIGEPGVGKTAIAEGLAQRIVNRDVPDILEDKLVITLDIGLLLAGTKYRGEFEERLKR 262
Query: 210 LFELARAHRPSIIFIDERIDRGISNALFLLLSGVGN-----DMDGIL----------VLG 254
+F+ + + I+ IDE L G G D IL +G
Sbjct: 263 IFDEIQENNNIILVIDE----------VHTLIGAGAAEGAIDAANILKPALARGELQCIG 312
Query: 255 ATNIPWV-----LDAAIRRRFEKRIYIALP---ELNARLFMFR-----HHLGNTRNTISD 301
AT + D A+ RRF+ +Y+ P E LF R HH +ISD
Sbjct: 313 ATTLDEYRKHIEKDPALERRFQP-VYVGEPSVEETIEILFGLRSRYEKHH----NLSISD 367
Query: 302 ENLKELAKRTEGY 314
+ L+ AK ++ Y
Sbjct: 368 KALEAAAKLSDQY 380
>gnl|CDD|187579 cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha
subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical
(a) SDRs. This subgroup of extended SDR-like proteins
are atypical SDRs. They have a glycine-rich
NAD(P)-binding motif similar to the typical SDRs,
GXXGXXG, and have the YXXXK active site motif (though
not the other residues of the SDR tetrad). Members
identified include NDUFA9 (mitochondrial) and putative
nucleoside-diphosphate-sugar epimerase. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Atypical SDRs include
biliverdin IX beta reductase (BVR-B,aka flavin
reductase), NMRa (a negative transcriptional regulator
of various fungi), progesterone 5-beta-reductase like
proteins, phenylcoumaran benzylic ether and
pinoresinol-lariciresinol reductases, phenylpropene
synthases, eugenol synthase, triphenylmethane reductase,
isoflavone reductases, and others. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. In addition to the
Rossmann fold core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids, and typically have a
TGXXGXXG cofactor binding motif. Complex (multidomain)
SDRs such as ketoreductase domains of fatty acid
synthase have a GGXGXXG NAD(P)-binding motif and an
altered active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 273
Score = 30.3 bits (69), Expect = 1.4
Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 14/69 (20%)
Query: 174 LAKAVATEANNSTFFSVS---------SSDLVSKWLGESEKLVKTLFELARAHRPSIIFI 224
LAKA EA +S S L SK E E+ V+ F A RPS++F
Sbjct: 98 LAKAAK-EAGVERLIHISALGADANSPSKYLRSK--AEGEEAVREAFPEATIVRPSVVFG 154
Query: 225 DERIDRGIS 233
E DR ++
Sbjct: 155 RE--DRFLN 161
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 30.2 bits (69), Expect = 1.4
Identities = 10/23 (43%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 154 KRVPWKG-ILLFGPPGTGKSYLA 175
K +P K ++++GPP TGKS
Sbjct: 429 KGIPKKNCLVIYGPPDTGKSMFC 451
>gnl|CDD|183114 PRK11388, PRK11388, DNA-binding transcriptional regulator DhaR;
Provisional.
Length = 638
Score = 30.4 bits (69), Expect = 1.4
Identities = 47/203 (23%), Positives = 80/203 (39%), Gaps = 56/203 (27%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNST--FFSV-----SSSDLVSKWLG----ESEKLVKT 209
+LL G G GK+ LA+A+ E+ + + +V L ++LG +SE +
Sbjct: 351 VLLCGEEGVGKALLAQAIHNESERAAGPYIAVNCQLYPDEALAEEFLGSDRTDSENGRLS 410
Query: 210 LFELARAHRPSIIFID--ERIDRGISNALFLLL-SGVGNDMDG-------ILVLGATNIP 259
FELA +F++ E + + +AL +L +GV +D + V+ T
Sbjct: 411 KFELAHG---GTLFLEKVEYLSPELQSALLQVLKTGVITRLDSRRLIPVDVRVIATTTAD 467
Query: 260 WVLDAAI-RRRFEKRIY-------IALPELNARL----FMFRHHLG------NTRNTISD 301
L + + RF +++Y I +P L R + + L +TR I D
Sbjct: 468 --LAMLVEQNRFSRQLYYALHAFEITIPPLRMRREDIPALVNNKLRSLEKRFSTRLKIDD 525
Query: 302 E------------NLKELAKRTE 312
+ N EL E
Sbjct: 526 DALARLVSYRWPGNDFELRSVIE 548
>gnl|CDD|236442 PRK09270, PRK09270, nucleoside triphosphate hydrolase
domain-containing protein; Reviewed.
Length = 229
Score = 29.9 bits (68), Expect = 1.6
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 12/56 (21%)
Query: 131 EAAKEALKEAVILPIKFPQLFTGKRVPWKGILLF--GPPGTGKSYLAKAVATEANN 184
K L+ L Q +R ++ GPPG GKS LA+ +
Sbjct: 14 AVHKPLLRRLAAL-----QAEPQRR-----TIVGIAGPPGAGKSTLAEFLEALLQQ 59
>gnl|CDD|148114 pfam06309, Torsin, Torsin. This family consists of several
eukaryotic torsin proteins. Torsion dystonia is an
autosomal dominant movement disorder characterized by
involuntary, repetitive muscle contractions and twisted
postures. The most severe early-onset form of dystonia
has been linked to mutations in the human DYT1 (TOR1A)
gene encoding a protein termed torsinA. While causative
genetic alterations have been identified, the function
of torsin proteins and the molecular mechanism
underlying dystonia remain unknown. Phylogenetic
analysis of the torsin protein family indicates these
proteins share distant sequence similarity with the
large and diverse family of (pfam00004) proteins. It has
been suggested that torsins play a role in effectively
managing protein folding and that possible breakdown in
a neuroprotective mechanism that is, in part, mediated
by torsins may be responsible for the neuronal
dysfunction associated with dystonia.
Length = 127
Score = 28.9 bits (65), Expect = 1.6
Identities = 9/37 (24%), Positives = 19/37 (51%), Gaps = 4/37 (10%)
Query: 165 GPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLG 201
G GTGK+++A+ +A ++ + S V ++
Sbjct: 60 GWTGTGKNFVAEIIA----DNLYRDGLRSPYVHHFVA 92
>gnl|CDD|131948 TIGR02902, spore_lonB, ATP-dependent protease LonB. Members of
this protein are LonB, a paralog of the ATP-dependent
protease La (LonA, TIGR00763). LonB proteins are found
strictly, and almost universally, in endospore-forming
bacteria. This protease was shown, in Bacillus subtilis,
to be expressed specifically in the forespore, during
sporulation, under control of sigma(F). The lonB gene,
despite location immediately upstream of lonA, was shown
to be monocistronic. LonB appears able to act on
sigma(H) for post-translation control, but lonB mutation
did not produce an obvious sporulation defect under the
conditions tested. Note that additional paralogs of LonA
and LonB occur in the Clostridium lineage and this model
selects only one per species as the protein that
corresponds to LonB in B. subtilis [Protein fate,
Degradation of proteins, peptides, and glycopeptides,
Cellular processes, Sporulation and germination].
Length = 531
Score = 30.1 bits (68), Expect = 1.6
Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 12/62 (19%)
Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNS 185
++ G E +ALK A+ P PQ ++++GPPG GK+ A+ V EA +
Sbjct: 66 EIIGQEEGIKALKAALCGP--NPQ----------HVIIYGPPGVGKTAAARLVLEEAKKN 113
Query: 186 TF 187
Sbjct: 114 PA 115
>gnl|CDD|172992 PRK14526, PRK14526, adenylate kinase; Provisional.
Length = 211
Score = 29.8 bits (67), Expect = 1.6
Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 3/36 (8%)
Query: 160 GILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDL 195
++ GPPG+GK +AK ++ E N ++ +S+ DL
Sbjct: 2 KLVFLGPPGSGKGTIAKILSNELN---YYHISTGDL 34
>gnl|CDD|215651 pfam00006, ATP-synt_ab, ATP synthase alpha/beta family,
nucleotide-binding domain. This family includes the ATP
synthase alpha and beta subunits, the ATP synthase
associated with flagella and the termination factor Rho.
Length = 213
Score = 29.7 bits (68), Expect = 1.7
Identities = 9/19 (47%), Positives = 12/19 (63%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
I +FG GTGK+ L +A
Sbjct: 18 IGIFGGSGTGKTVLLGMIA 36
>gnl|CDD|214631 smart00350, MCM, minichromosome maintenance proteins.
Length = 509
Score = 29.9 bits (68), Expect = 2.0
Identities = 21/53 (39%), Positives = 26/53 (49%), Gaps = 14/53 (26%)
Query: 135 EALKEAVILPIKFPQLFTG--KRVPWKG-------ILLFGPPGTGKSYLAKAV 178
E +K+A++L LF G K +P ILL G PGT KS L K V
Sbjct: 209 EDIKKAILL-----LLFGGVHKNLPDGMKIRGDINILLCGDPGTAKSQLLKYV 256
>gnl|CDD|238979 cd02021, GntK, Gluconate kinase (GntK) catalyzes the phosphoryl
transfer from ATP to gluconate. The resulting product
gluconate-6-phoshate is an important precursor of
gluconate metabolism. GntK acts as a dimmer composed of
two identical subunits.
Length = 150
Score = 29.1 bits (66), Expect = 2.0
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
I++ G G+GKS + KA+A
Sbjct: 2 IVVMGVSGSGKSTVGKALAERLG 24
>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
Length = 616
Score = 30.0 bits (67), Expect = 2.0
Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 10/69 (14%)
Query: 52 QSIREKCDEYLARAEKLKEY--------LKKGKKQPVKDGESRTKDDKKESDEDDSEDPD 103
+SIR + + YL + E Y +K ++P D + +DK +SD+D D D
Sbjct: 498 KSIRNRRNGYLLKDEDRYYYDRGHRNLGAEKDPEKPHPDKDEEYPEDKDKSDDDRLNDAD 557
Query: 104 K--KKMQAN 110
K +K++A+
Sbjct: 558 KAMRKLEAD 566
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.1 bits (67), Expect = 2.1
Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 4/100 (4%)
Query: 13 KATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLK--E 70
A EA+ + EA + + A K + + ++K +E +A++LK
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
Query: 71 YLKKGKKQPVKDGESRTKDD--KKESDEDDSEDPDKKKMQ 108
KK + K E + K D KK+++E D KKK +
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454
Score = 29.3 bits (65), Expect = 3.3
Identities = 24/95 (25%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 14 ATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKEYLK 73
A +A+++K E K + ALK E A+ + +++K E +AE+LK+ +
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA-E 1725
Query: 74 KGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQ 108
+ K ++ + ++DKK+++E ++ +KKK+
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760
>gnl|CDD|226997 COG4650, RtcR, Sigma54-dependent transcription regulator containing
an AAA-type ATPase domain and a DNA-binding domain
[Transcription / Signal transduction mechanisms].
Length = 531
Score = 29.9 bits (67), Expect = 2.3
Identities = 11/18 (61%), Positives = 14/18 (77%)
Query: 161 ILLFGPPGTGKSYLAKAV 178
ILL GP G GKS+LA+ +
Sbjct: 211 ILLNGPTGAGKSFLARRI 228
>gnl|CDD|180215 PRK05707, PRK05707, DNA polymerase III subunit delta'; Validated.
Length = 328
Score = 29.7 bits (67), Expect = 2.3
Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 149 QLFTGKRVPWKGILLFGPPGTGKSYLAKAVA 179
QL R P LL GP G GK LA+ +A
Sbjct: 14 QLAGRGRHP-HAYLLHGPAGIGKRALAERLA 43
>gnl|CDD|116340 pfam07726, AAA_3, ATPase family associated with various cellular
activities (AAA). This Pfam entry includes some of the
AAA proteins not detected by the pfam00004 model.
Length = 131
Score = 28.7 bits (65), Expect = 2.4
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
+LL G PG K+ LA+ +A
Sbjct: 2 VLLEGVPGLAKTLLARTLA 20
>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
metabolism].
Length = 191
Score = 29.0 bits (66), Expect = 2.4
Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSS 192
I+L GP G GKS L KA+ ++ FSVS+
Sbjct: 7 IVLSGPSGVGKSTLVKAL--LEDDKLRFSVSA 36
>gnl|CDD|184150 PRK13569, PRK13569, anthranilate synthase component I; Provisional.
Length = 506
Score = 29.6 bits (67), Expect = 2.5
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 11/71 (15%)
Query: 15 TEADKNKNYEEA---LKLYTHS-------VQYFLHALKYEITSAEGKQSIREKCDEYLAR 64
TE +K + Y+EA ++ + L A S EG S EK +++L
Sbjct: 178 TEEEKIEKYKEAQAEIETLIEKLARRKAEKELLLPADSERTVSFEGVTSNYEK-EQFLRD 236
Query: 65 AEKLKEYLKKG 75
EK+KEY+K G
Sbjct: 237 VEKIKEYIKAG 247
>gnl|CDD|233443 TIGR01509, HAD-SF-IA-v3, haloacid dehalogenase superfamily,
subfamily IA, variant 3 with third motif having DD or
ED. This model represents part of one structural
subfamily of the Haloacid Dehalogenase (HAD) superfamily
of aspartate-nucleophile hydrolases. The superfamily is
defined by the presence of three short catalytic motifs.
The subfamilies are defined based on the location and
the observed or predicted fold of a so-called "capping
domain", or the absence of such a domain. Subfamily I
consists of sequences in which the capping domain is
found in between the first and second catalytic motifs.
Subfamily II consists of sequences in which the capping
domain is found between the second and third motifs.
Subfamily III sequences have no capping domain in either
of these positions.The Subfamily IA and IB capping
domains are predicted by PSI-PRED to consist of an alpha
helical bundle. Subfamily I encompasses such a wide
region of sequence space (the sequences are highly
divergent) that representing it with a single model is
impossible, resulting in an overly broad description
which allows in many unrelated sequences. Subfamily IA
and IB are separated based on an aparrent phylogenetic
bifurcation. Subfamily IA is still too broad to model,
but cannot be further subdivided into large chunks based
on phylogenetic trees. Of the three motifs defining the
HAD superfamily, the third has three variant forms : (1)
hhhhsDxxx(x)D, (2) hhhhssxxx(x)D and (3) hhhhDDxxx(x)s
where _s_ refers to a small amino acid and _h_ to a
hydrophobic one. All three of these variants are found
in subfamily IA. Individual models were made based on
seeds exhibiting only one of the variants each. Variant
3 (this model) is found in the enzymes
beta-phosphoglucomutase (TIGR01990) and
deoxyglucose-6-phosphatase, while many other enzymes of
subfamily IA exhibit this variant as well as variant 1
(TIGR01549). These three variant models were created
withthe knowledge that there will be overlap among them
- this is by design and serves the purpose of
eliminating the overlap with models of more distantly
relatedHAD subfamilies caused by an overly broad single
model [Unknown function, Enzymes of unknown
specificity].
Length = 177
Score = 28.9 bits (65), Expect = 2.6
Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 8/68 (11%)
Query: 246 DMDGILVLGATNIPWVLDAAIRRRFEKRIYI---ALPELNARLFMFRHHLGNTRNTISDE 302
D+DG+LV A I ++D A + + EL R + ++ T+S E
Sbjct: 5 DLDGVLVDTAHAIAKLVDRAEFPLVPDELGVSEVGSLELALRRWKAKYGR-----TMSAE 59
Query: 303 NLKELAKR 310
+ + L K+
Sbjct: 60 DAQLLYKQ 67
>gnl|CDD|223718 COG0645, COG0645, Predicted kinase [General function prediction
only].
Length = 170
Score = 28.9 bits (65), Expect = 2.6
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 159 KGILLFGPPGTGKSYLAKAVATE 181
+ +L+ G PG+GKS LA+ +A
Sbjct: 2 RLVLVGGLPGSGKSTLARGLAEL 24
>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit.
Length = 100
Score = 28.0 bits (63), Expect = 2.8
Identities = 12/46 (26%), Positives = 18/46 (39%), Gaps = 3/46 (6%)
Query: 158 WKGILLFGPPGTGKSYLAKAVATEANN---STFFSVSSSDLVSKWL 200
K L+ G+GK+ A A+ F V DL+ + L
Sbjct: 18 KKRGLIVMATGSGKTLTAAALIARLAKGKKKVLFVVPRKDLLEQAL 63
>gnl|CDD|223401 COG0324, MiaA, tRNA delta(2)-isopentenylpyrophosphate transferase
[Translation, ribosomal structure and biogenesis].
Length = 308
Score = 29.1 bits (66), Expect = 2.9
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
I++ GP +GK+ LA A+A
Sbjct: 6 IVIAGPTASGKTALAIALAKRLG 28
>gnl|CDD|238713 cd01428, ADK, Adenylate kinase (ADK) catalyzes the reversible
phosphoryl transfer from adenosine triphosphates (ATP)
to adenosine monophosphates (AMP) and to yield adenosine
diphosphates (ADP). This enzyme is required for the
biosynthesis of ADP and is essential for homeostasis of
adenosine phosphates.
Length = 194
Score = 28.7 bits (65), Expect = 2.9
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 3/36 (8%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLV 196
ILL GPPG+GK A+ +A + +S+ DL+
Sbjct: 2 ILLLGPPGSGKGTQAERLAKKYG---LPHISTGDLL 34
>gnl|CDD|215967 pfam00519, PPV_E1_C, Papillomavirus helicase. This protein is a
DNA helicase that is required for initiation of viral
DNA replication. This protein forms a complex with the
E2 protein pfam00508.
Length = 432
Score = 29.4 bits (67), Expect = 3.1
Identities = 7/15 (46%), Positives = 11/15 (73%)
Query: 161 ILLFGPPGTGKSYLA 175
++++GPP TGKS
Sbjct: 265 LVIYGPPDTGKSMFC 279
>gnl|CDD|215758 pfam00158, Sigma54_activat, Sigma-54 interaction domain.
Length = 168
Score = 28.5 bits (65), Expect = 3.4
Identities = 8/18 (44%), Positives = 12/18 (66%)
Query: 161 ILLFGPPGTGKSYLAKAV 178
+L+ G GTGK A+A+
Sbjct: 25 VLITGESGTGKELFARAI 42
>gnl|CDD|237464 PRK13670, PRK13670, hypothetical protein; Provisional.
Length = 388
Score = 29.0 bits (66), Expect = 3.5
Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 17/83 (20%)
Query: 7 GINIAIKATEADKNKNYEEALKL-----YTHS------VQYFLHALKYEITSAEGKQSIR 55
G+ IK K +++E ++L YT++ L+A K ++ IR
Sbjct: 268 GLENRIKK-AIKKASSFDEFVELVKTKRYTYTRIQRLLTYILLNATKEDVEKTSLPPYIR 326
Query: 56 EKCDEYLARAEKLKEYLKKGKKQ 78
L EK ++YLKK KK+
Sbjct: 327 -----VLGFTEKGQQYLKKIKKK 344
>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
protein. Members of this protein family are found
exclusively in the archaea. This set of DNA binding
proteins shows homology to the origin recognition
complex subunit 1/cell division control protein 6 family
in eukaryotes. Several members may be found in genome
and interact with each other [DNA metabolism, DNA
replication, recombination, and repair].
Length = 365
Score = 29.1 bits (66), Expect = 3.6
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNST 186
+ ++G GTGK+ + K V E +
Sbjct: 43 VFIYGKTGTGKTAVTKYVMKELEEAA 68
>gnl|CDD|182120 PRK09862, PRK09862, putative ATP-dependent protease; Provisional.
Length = 506
Score = 29.2 bits (65), Expect = 3.7
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 161 ILLFGPPGTGKSYLA 175
+LL GPPGTGK+ LA
Sbjct: 213 LLLIGPPGTGKTMLA 227
>gnl|CDD|215300 PLN02545, PLN02545, 3-hydroxybutyryl-CoA dehydrogenase.
Length = 295
Score = 28.9 bits (65), Expect = 3.8
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 6/59 (10%)
Query: 171 KSYLAKAVATEA-----NNSTFFSVSSSDLVSKWLGESEKLVKTLF-ELARAHRPSIIF 223
K +++ A + + +D + + + ESE L K LF EL R +PS I
Sbjct: 57 KGKMSQEEADATLGRIRCTTNLEELRDADFIIEAIVESEDLKKKLFSELDRICKPSAIL 115
>gnl|CDD|183088 PRK11331, PRK11331, 5-methylcytosine-specific restriction enzyme
subunit McrB; Provisional.
Length = 459
Score = 28.9 bits (64), Expect = 3.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 159 KGILLFGPPGTGKSYLAKAVA 179
K I+L GPPG GK+++A+ +A
Sbjct: 195 KNIILQGPPGVGKTFVARRLA 215
>gnl|CDD|202726 pfam03691, UPF0167, Uncharacterized protein family (UPF0167). The
proteins in this family are about 200 amino acids long
and each contain 3 CXXC motifs.
Length = 177
Score = 28.5 bits (64), Expect = 3.9
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 298 TISDENLKELAKRTEGYSG 316
+ +E +KE+ +RT GYS
Sbjct: 79 GVDEELVKEVLERTPGYSS 97
>gnl|CDD|219468 pfam07565, Band_3_cyto, Band 3 cytoplasmic domain. This family
contains the cytoplasmic domain of the Band 3 anion
exchange proteins that exchange Cl-/HCO3-. Band 3
constitutes the most abundant polypeptide in the red
blood cell membrane, comprising 25% of the total
membrane protein. The cytoplasmic domain of band 3
functions primarily as an anchoring site for other
membrane-associated proteins. Included among the protein
ligands of cdb3 are ankyrin, protein 4.2, protein 4.1,
glyceraldehyde-3-phosphate dehydrogenase (GAPDH),
phosphofructokinase, aldolase, hemoglobin, hemichromes,
and the protein tyrosine kinase (p72syk).
Length = 250
Score = 28.7 bits (65), Expect = 4.0
Identities = 23/107 (21%), Positives = 40/107 (37%), Gaps = 29/107 (27%)
Query: 86 RTKDDKKESDEDDSEDPDKKKMQANLEGAIV-------MEKPN---VKWTDVAGLEAAKE 135
+K+ E + V +E+P V+ ++ LE+ E
Sbjct: 119 LNSQSGSLPSMSKFNSTLMRKIPPGAEASNVLVGEVDFLEQPVLAFVRLSEAVVLESVLE 178
Query: 136 ALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSY--LAKAVAT 180
V +P++F LF +L GP G+G Y + +A+AT
Sbjct: 179 -----VPVPVRF--LF----------VLLGPSGSGLDYHEIGRAIAT 208
>gnl|CDD|221803 pfam12846, AAA_10, AAA-like domain. This family of domains contain
a P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins.
Length = 316
Score = 28.9 bits (65), Expect = 4.1
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
+L+ GP G+GKS L K +A
Sbjct: 4 MLIVGPSGSGKSTLLKLLA 22
>gnl|CDD|206680 cd01893, Miro1, Mitochondrial Rho family 1 (Miro1), N-terminal.
Miro1 subfamily. Miro (mitochondrial Rho) proteins have
tandem GTP-binding domains separated by a linker region
containing putative calcium-binding EF hand motifs.
Genes encoding Miro-like proteins were found in several
eukaryotic organisms. This CD represents the N-terminal
GTPase domain of Miro proteins. These atypical Rho
GTPases have roles in mitochondrial homeostasis and
apoptosis. Most Rho proteins contain a lipid
modification site at the C-terminus; however, Miro is
one of few Rho subfamilies that lack this feature.
Length = 168
Score = 28.1 bits (63), Expect = 4.4
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
I+L G G GKS L ++ +E
Sbjct: 5 IVLIGDEGVGKSSLIMSLVSEEF 27
>gnl|CDD|218719 pfam05729, NACHT, NACHT domain. This NTPase domain is found in
apoptosis proteins as well as those involved in MHC
transcription activation. This family is closely related
to pfam00931.
Length = 165
Score = 28.0 bits (63), Expect = 4.8
Identities = 16/88 (18%), Positives = 27/88 (30%), Gaps = 14/88 (15%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNST---------FFSVSSSDLVSKWLGESEKLVKTLF 211
++L G G+GK+ L + +A F + ++ L
Sbjct: 3 VILQGEAGSGKTTLLQKIALLWAQGKLPQDFDFVFFLPCRELSRSGE-ASLADLLFSQWP 61
Query: 212 ELARAHRPSIIFIDERIDRGISNALFLL 239
E A I E +R L +L
Sbjct: 62 EPAAPVSEVWAVILELPER----LLLIL 85
>gnl|CDD|235825 PRK06547, PRK06547, hypothetical protein; Provisional.
Length = 172
Score = 28.2 bits (63), Expect = 4.8
Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 6/61 (9%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLG---ESEKLVKTLFELARAH 217
+L+ G G+GK+ LA A+A F V DL W G SE + + + + R
Sbjct: 18 VLIDGRSGSGKTTLAGALAART---GFQLVHLDDLYPGWHGLAAASEHVAEAVLDEGRPG 74
Query: 218 R 218
R
Sbjct: 75 R 75
>gnl|CDD|222258 pfam13604, AAA_30, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. There is a Walker A and
Walker B.
Length = 195
Score = 28.3 bits (64), Expect = 4.8
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
++ GP GTGK+ KA
Sbjct: 21 AVVQGPAGTGKTTSLKAAR 39
>gnl|CDD|213965 TIGR04352, HprK_rel_A, HprK-related kinase A. A number of protein
families resemble HPr kinase (see TIGR00679) but do not
belong to that system. They include this family, which
appears instead to be the marker for a different type of
gene neighborhood, in which one of the conserved
neighboring proteins resembles (but is distinct from)
PqqD.
Length = 280
Score = 28.4 bits (64), Expect = 4.9
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 159 KGILLFGPPGTGKSYLAKAVA 179
+ ++L PPG+GKS L A+A
Sbjct: 109 RALILPAPPGSGKSTLCAALA 129
>gnl|CDD|238978 cd02020, CMPK, Cytidine monophosphate kinase (CMPK) catalyzes the
reversible phosphorylation of cytidine monophosphate
(CMP) to produce cytidine diphosphate (CDP), using ATP
as the preferred phosphoryl donor.
Length = 147
Score = 27.8 bits (63), Expect = 5.0
Identities = 9/15 (60%), Positives = 12/15 (80%)
Query: 165 GPPGTGKSYLAKAVA 179
GP G+GKS +AK +A
Sbjct: 6 GPAGSGKSTVAKLLA 20
>gnl|CDD|214432 MTH00144, ND2, NADH dehydrogenase subunit 2; Provisional.
Length = 328
Score = 28.3 bits (64), Expect = 5.0
Identities = 7/17 (41%), Positives = 10/17 (58%)
Query: 185 STFFSVSSSDLVSKWLG 201
T S+SSS + W+G
Sbjct: 16 GTLLSLSSSHWLGVWVG 32
>gnl|CDD|224292 COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 398
Score = 28.5 bits (64), Expect = 5.1
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 7/68 (10%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRP- 219
IL+ GP GK+ L K + + ++ DL + +L+ L
Sbjct: 40 ILILGPRQVGKTTLLKLLIKGLLEEIIY-INFDDLRLDRI----ELLDLLRAYIELKERE 94
Query: 220 -SIIFIDE 226
S IF+DE
Sbjct: 95 KSYIFLDE 102
>gnl|CDD|197284 cd09188, PLDc_FAM83H_N, N-terminal phospholipase D-like domain of
the uncharacterized protein, Family with sequence
similarity 83H. N-terminal phospholipase D (PLD)-like
domain of the protein, Family with sequence similarity
83H (FAM83H) on chromosome 8q24.3, which localizes in
the intracellular environment and is associated with
vesicles, can be regulated by kinases, and plays
important roles during ameloblast differentiation and
enamel matrix calcification. The gene encoding protein
FAM83H is the first gene involved in the etiology of
amelogenesis imperfecta (AI), that encodes a
non-secreted protein due to the absence of a signal
peptide. Defects in gene FAM83H cause autosomal dominant
hypocalcified amelogenesis imperfecta (ADHCAI). Since
the N-terminal PLD-like domain of FAM83H shows only
trace similarity to the PLD catalytic domain and lacks
the functionally important histidine residue, FAM83H may
share a similar three-dimensional fold with PLD enzymes,
but is most unlikely to carry PLD activity.
Length = 265
Score = 28.3 bits (63), Expect = 5.2
Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 12/81 (14%)
Query: 207 VKTLFELARAHRPSIIFIDERIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAI 266
V TL + PSI E R I +A + + MD + + +L+AA
Sbjct: 106 VTTLVQPPPPDNPSI---KEEARRMIRSAQQV----IAVVMD--IFTDVDILSELLEAAA 156
Query: 267 RRRFEKRIYIALPELNARLFM 287
RR +YI L E+NA+LF+
Sbjct: 157 RR---VPVYILLDEMNAQLFL 174
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 28.1 bits (63), Expect = 5.2
Identities = 15/42 (35%), Positives = 20/42 (47%)
Query: 65 AEKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKK 106
AE +E K+ KK+ E + K DKKE + KKK
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
>gnl|CDD|218528 pfam05272, VirE, Virulence-associated protein E. This family
contains several bacterial virulence-associated protein
E like proteins. These proteins contain a P-loop motif.
Length = 198
Score = 28.1 bits (63), Expect = 5.3
Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 4/50 (8%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSK----WLGESEKL 206
++L G G+GKS K + E + S D K W+ E +L
Sbjct: 55 LILQGAQGSGKSTFLKKLGGEWFTDSIRSFEGKDAYEKLQGVWIVEIAEL 104
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 28.7 bits (64), Expect = 5.3
Identities = 19/101 (18%), Positives = 34/101 (33%), Gaps = 14/101 (13%)
Query: 56 EKCDEYLARAEKLKEYLKKGK------------KQPVKDGESRTKDDKKESDEDDSEDPD 103
+K D+ A+AE+ +E L++ +Q K + K E E
Sbjct: 1166 DKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYG 1225
Query: 104 KKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILP 144
M+ +V KP + AA + +E +
Sbjct: 1226 SSAMETENVAEVV--KPKGRAGAKKKAPAAAKEKEEEDEIL 1264
>gnl|CDD|238899 cd01918, HprK_C, HprK/P, the bifunctional histidine-containing
protein kinase/phosphatase, controls the phosphorylation
state of the phosphocarrier protein HPr and regulates
the utilization of carbon sources by gram-positive
bacteria. It catalyzes both the ATP-dependent
phosphorylation of Ser-46 of HPr and its
dephosphorylation by phosphorolysis. The latter reaction
uses inorganic phosphate as substrate and produces
pyrophosphate. Phosphoenolpyruvate carboxykinase (PEPCK)
and the C-terminal catalytic domain of HprK/P are
structurally similar with conserved active site residues
suggesting these two phosphotransferases have related
functions. The HprK/P N-terminal domain is structurally
similar to the N-terminal domains of the MurE and MurF
amino acid ligases.
Length = 149
Score = 27.6 bits (62), Expect = 5.3
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 159 KGILLFGPPGTGKSYLA 175
G+L+ GP G GKS LA
Sbjct: 15 IGVLITGPSGIGKSELA 31
>gnl|CDD|222074 pfam13362, Toprim_3, Toprim domain. The toprim domain is found in
a wide variety of enzymes involved in nucleic acid
manipulation.
Length = 95
Score = 26.9 bits (60), Expect = 5.5
Identities = 15/48 (31%), Positives = 18/48 (37%), Gaps = 6/48 (12%)
Query: 231 GISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRRRFEKRIYIAL 278
GI AL L + VLGA N+ V R R+ IA
Sbjct: 7 GIETAL--SLRRALPGAPVVAVLGANNLKAVAPPERGR----RVIIAA 48
>gnl|CDD|224391 COG1474, CDC6, Cdc6-related protein, AAA superfamily ATPase [DNA
replication, recombination, and repair /
Posttranslational modification, protein turnover,
chaperones].
Length = 366
Score = 28.5 bits (64), Expect = 5.6
Identities = 26/132 (19%), Positives = 50/132 (37%), Gaps = 25/132 (18%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNS---------------TFFSVSSSDLVSKWLGESE- 204
I+++GP GTGK+ K V E S T + V S +++K
Sbjct: 45 IIIYGPTGTGKTATVKFVMEELEESSANVEVVYINCLELRTPYQVLSK-ILNKLGKVPLT 103
Query: 205 -----KLVKTLFELARAHRPSIIFIDERIDRGISNALFLL--LSGVGNDMD-GILVLGAT 256
+++K L++ ++I I + +D + +L L + + ++ +
Sbjct: 104 GDSSLEILKRLYDNLSKKGKTVIVILDEVDALVDKDGEVLYSLLRAPGENKVKVSIIAVS 163
Query: 257 NIPWVLDAAIRR 268
N LD R
Sbjct: 164 NDDKFLDYLDPR 175
>gnl|CDD|225713 COG3172, NadR, Predicted ATPase/kinase involved in NAD metabolism
[Coenzyme metabolism].
Length = 187
Score = 27.8 bits (62), Expect = 5.6
Identities = 18/78 (23%), Positives = 30/78 (38%), Gaps = 12/78 (15%)
Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLF------- 211
K + + G +GKS L +A N+T + V + LG E L + +
Sbjct: 9 KTVAILGGESSGKSTLVNKLA-NIFNTTSAWEYGREYVFEHLGGDEALQYSDYAKIALGQ 67
Query: 212 ----ELARAHRPSIIFID 225
+ A + + FID
Sbjct: 68 AAYEDAAVRYANKVAFID 85
>gnl|CDD|107157 PHA02244, PHA02244, ATPase-like protein.
Length = 383
Score = 28.5 bits (63), Expect = 5.7
Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 4/88 (4%)
Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSD--LVSKWLGESEKLVKTLFELARAHR 218
+ L G G+GK+++A+ +A + +F + D + ++ + K +T F A +
Sbjct: 122 VFLKGGAGSGKNHIAEQIAEALDLDFYFMNAIMDEFELKGFIDANGKFHETPFYEAFK-K 180
Query: 219 PSIIFIDERIDRGISNALFLLLSGVGND 246
+ FIDE ID I AL ++ S + N
Sbjct: 181 GGLFFIDE-IDASIPEALIIINSAIANK 207
>gnl|CDD|183452 PRK12337, PRK12337, 2-phosphoglycerate kinase; Provisional.
Length = 475
Score = 28.2 bits (63), Expect = 6.1
Identities = 10/19 (52%), Positives = 13/19 (68%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
+L+ G G GKS LA A+A
Sbjct: 258 VLIGGVSGVGKSVLASALA 276
>gnl|CDD|226126 COG3598, RepA, RecA-family ATPase [DNA replication, recombination,
and repair].
Length = 402
Score = 28.3 bits (63), Expect = 6.2
Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 15/134 (11%)
Query: 148 PQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLV 207
PQL +L+G G GK+ L + + +L + E K++
Sbjct: 79 PQLIDEFFRKGYVSILYGDSGVGKTTLLLYLCIAL-------AAGKNLFGNKVKEPGKVL 131
Query: 208 KTLFELARAHRPSIIFIDERIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIR 267
EL +R I+ ER+ + + L + V N MD V GA + VL +
Sbjct: 132 YVSLEL---YREDIL---ERL-EPVRARMGLSPADVRN-MDLTDVSGAADESDVLSPKLY 183
Query: 268 RRFEKRIYIALPEL 281
RRFEK + P+
Sbjct: 184 RRFEKILEQKRPDF 197
>gnl|CDD|179615 PRK03634, PRK03634, rhamnulose-1-phosphate aldolase; Provisional.
Length = 274
Score = 28.0 bits (63), Expect = 6.4
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 283 ARLFMFRHHLGNTRNTISDENLKELAKR 310
A +++ +G + TI+DE L L +R
Sbjct: 235 AEIYVKVLSMGGMKQTITDEELIALGER 262
>gnl|CDD|236134 PRK07940, PRK07940, DNA polymerase III subunit delta'; Validated.
Length = 394
Score = 28.3 bits (64), Expect = 6.5
Identities = 18/61 (29%), Positives = 23/61 (37%), Gaps = 10/61 (16%)
Query: 124 WTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVP----WKGILLFGPPGTGKSYLAKAVA 179
W D+ G EA L+ A W L GPPG+G+S A+A A
Sbjct: 4 WDDLVGQEAVVAELRAAARAA---RADVAAAGSGMTHAW---LFTGPPGSGRSVAARAFA 57
Query: 180 T 180
Sbjct: 58 A 58
>gnl|CDD|221912 pfam13086, AAA_11, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins.
Length = 220
Score = 27.7 bits (62), Expect = 6.6
Identities = 8/11 (72%), Positives = 10/11 (90%)
Query: 162 LLFGPPGTGKS 172
L+ GPPGTGK+
Sbjct: 21 LIQGPPGTGKT 31
>gnl|CDD|238260 cd00464, SK, Shikimate kinase (SK) is the fifth enzyme in the
shikimate pathway, a seven-step biosynthetic pathway
which converts erythrose-4-phosphate to chorismic acid,
found in bacteria, fungi and plants. Chorismic acid is a
important intermediate in the synthesis of aromatic
compounds, such as aromatic amino acids, p-aminobenzoic
acid, folate and ubiquinone. Shikimate kinase catalyses
the phosphorylation of the 3-hydroxyl group of shikimic
acid using ATP.
Length = 154
Score = 27.5 bits (62), Expect = 6.6
Identities = 7/19 (36%), Positives = 12/19 (63%)
Query: 161 ILLFGPPGTGKSYLAKAVA 179
I+L G G GK+ + + +A
Sbjct: 2 IVLIGMMGAGKTTVGRLLA 20
>gnl|CDD|236347 PRK08903, PRK08903, DnaA regulatory inactivator Hda; Validated.
Length = 227
Score = 28.0 bits (63), Expect = 6.7
Identities = 24/109 (22%), Positives = 41/109 (37%), Gaps = 22/109 (20%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN----NSTFFSVSSSDLVSKWLGESEKLVKTLFELARA 216
L+G G+G+S+L +A+ +A+ N+ + +S L
Sbjct: 45 FYLWGEAGSGRSHLLQALVADASYGGRNARYLDAASPLL----------------AFDFD 88
Query: 217 HRPSIIFID--ERIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLD 263
+ +D ER+D ALF L + V G L++ P L
Sbjct: 89 PEAELYAVDDVERLDDAQQIALFNLFNRVRAHGQGALLVAGPAAPLALP 137
>gnl|CDD|213191 cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of
branched-chain amino acid transporter. LivF (TM1139) is
part of the LIV-I bacterial ABC-type two-component
transport system that imports neutral, branched-chain
amino acids. The E. coli branched-chain amino acid
transporter comprises a heterodimer of ABC transporters
(LivF and LivG), a heterodimer of six-helix TM domains
(LivM and LivH), and one of two alternative soluble
periplasmic substrate binding proteins (LivK or LivJ).
ABC transporters are a large family of proteins involved
in the transport of a wide variety of different
compounds, like sugars, ions, peptides, and more complex
organic molecules.
Length = 222
Score = 27.8 bits (63), Expect = 6.8
Identities = 30/104 (28%), Positives = 37/104 (35%), Gaps = 27/104 (25%)
Query: 206 LVKTLFELARAHRPSIIF----I-----DERIDRGIS-----NALFLLLSGVGNDMDGIL 251
L+KT+ L SI F I ER GI +F L+ N L
Sbjct: 42 LLKTIMGLLPPRSGSIRFDGRDITGLPPHERARAGIGYVPEGRRIFPELTVEEN-----L 96
Query: 252 VLGATNIPWVLDAAIRRRFEKRIYIALPELNARLFMFRHHLGNT 295
+LGA A + R E R+Y P L R R L T
Sbjct: 97 LLGAYARR---RAKRKARLE-RVYELFPRLKER----RKQLAGT 132
>gnl|CDD|227034 COG4690, PepD, Dipeptidase [Amino acid transport and metabolism].
Length = 464
Score = 28.2 bits (63), Expect = 6.9
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 282 NARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGAD 318
F H+ N + I E+LK+ AK+ + D
Sbjct: 182 APNQFRIDHYEFNDEDYICSEDLKDFAKKQGLWDPFD 218
>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
Length = 619
Score = 28.4 bits (62), Expect = 6.9
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 66 EKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPD 103
+K+KE KK K K G D + + DE D +DP+
Sbjct: 38 KKVKEKAKKIKNSLTKHGNGHDHDVEDDDDEYDEQDPE 75
>gnl|CDD|223775 COG0703, AroK, Shikimate kinase [Amino acid transport and
metabolism].
Length = 172
Score = 27.6 bits (62), Expect = 7.2
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
I+L G G GKS + +A+A N
Sbjct: 5 IVLIGFMGAGKSTIGRALAKALN 27
>gnl|CDD|213244 cd03277, ABC_SMC5_euk, ATP-binding cassette domain of eukaryotic
SMC5 proteins. The structural maintenance of
chromosomes (SMC) proteins are large (approximately 110
to 170 kDa), and each is arranged into five
recognizable domains. Amino-acid sequence homology of
SMC proteins between species is largely confined to the
amino- and carboxy-terminal globular domains. The
amino-terminal domain contains a 'Walker A'
nucleotide-binding domain (GxxGxGKS/T, in the
single-letter amino-acid code), which by mutational
studies has been shown to be essential in several
proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B'
motif, and a motif with homology to the signature
sequence of the ATP-binding cassette (ABC) family of
ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 213
Score = 27.6 bits (62), Expect = 7.5
Identities = 7/19 (36%), Positives = 14/19 (73%)
Query: 60 EYLARAEKLKEYLKKGKKQ 78
+ L RA+K+ E++K+G +
Sbjct: 52 KLLGRAKKVGEFVKRGCDE 70
>gnl|CDD|219839 pfam08433, KTI12, Chromatin associated protein KTI12. This is a
family of chromatin associated proteins which interact
with the Elongator complex, a component of the
elongating form of RNA polymerase II. The Elongator
complex has histone acetyltransferase activity.
Length = 266
Score = 27.6 bits (62), Expect = 7.9
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 161 ILLFGPPGTGKSYLAKAVATE 181
I+L G P +GKS AK +A
Sbjct: 2 IILTGLPSSGKSTRAKELAKY 22
>gnl|CDD|233087 TIGR00678, holB, DNA polymerase III, delta' subunit. This model
describes the N-terminal half of the delta' subunit of
DNA polymerase III. Delta' is homologous to the gamma
and tau subunits, which form an outgroup for
phylogenetic comparison. The gamma/tau branch of the
tree is much more tighly conserved than the delta'
branch, and some members of that branch score more
highly against this model than some proteins
classisified as delta'. The noise cutoff is set to
detect weakly scoring delta' subunits rather than to
exclude gamma/tau subunits. At position 126-127 of the
seed alignment, this family lacks the HM motif of
gamma/tau; at 132 it has a near-invariant A vs. an
invariant F in gamma/tau [DNA metabolism, DNA
replication, recombination, and repair].
Length = 188
Score = 27.6 bits (62), Expect = 8.1
Identities = 11/19 (57%), Positives = 12/19 (63%), Gaps = 1/19 (5%)
Query: 162 LLF-GPPGTGKSYLAKAVA 179
LF GP G GK LA A+A
Sbjct: 17 YLFAGPEGVGKELLALALA 35
>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin,
subunit D2 [Chromatin structure and dynamics / Cell
division and chromosome partitioning].
Length = 1128
Score = 28.0 bits (62), Expect = 8.3
Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 7/78 (8%)
Query: 41 LKYEITSAEGKQSIREKCDEYLARAEKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSE 100
+++ S G Q ++ L AE LK G ++ + DGE KE ++D+ +
Sbjct: 415 MRHPFASEHGSQLRLTLWEKNLGSAENQLNSLKSGLQETLCDGE-------KEVEQDEGQ 467
Query: 101 DPDKKKMQANLEGAIVME 118
+ + N +E
Sbjct: 468 CRTELEGSFNKSAESRIE 485
>gnl|CDD|215749 pfam00148, Oxidored_nitro, Nitrogenase component 1 type
Oxidoreductase.
Length = 397
Score = 28.0 bits (63), Expect = 8.3
Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 3/36 (8%)
Query: 189 SVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFI 224
++ D+V G EKL + + E+ + ++P IF+
Sbjct: 48 DLTEKDVV---FGGEEKLEEAIREIDKRYKPKAIFV 80
>gnl|CDD|226372 COG3854, SpoIIIAA, ncharacterized protein conserved in bacteria
[Function unknown].
Length = 308
Score = 27.9 bits (62), Expect = 8.4
Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 4/31 (12%)
Query: 158 WKGILLFGPPGTGKSYL----AKAVATEANN 184
W L+ GPP GK+ L A+ ++ N
Sbjct: 137 WLNTLIIGPPQVGKTTLLRDIARLLSDGINQ 167
>gnl|CDD|234579 PRK00023, cmk, cytidylate kinase; Provisional.
Length = 225
Score = 27.4 bits (62), Expect = 8.6
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 165 GPPGTGKSYLAKAVATE 181
GP G+GK +AK +A +
Sbjct: 11 GPAGSGKGTVAKILAKK 27
>gnl|CDD|225327 COG2607, COG2607, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 287
Score = 27.7 bits (62), Expect = 8.9
Identities = 12/21 (57%), Positives = 15/21 (71%)
Query: 161 ILLFGPPGTGKSYLAKAVATE 181
+LL+G GTGKS L KA+ E
Sbjct: 88 VLLWGARGTGKSSLVKALLNE 108
>gnl|CDD|236022 PRK07471, PRK07471, DNA polymerase III subunit delta'; Validated.
Length = 365
Score = 27.7 bits (62), Expect = 9.5
Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 15/57 (26%)
Query: 125 TDVAGLEAAKEALKEAVILPIKFPQLFTGKRVP--WKGILLFGPPGTGKSYLAKAVA 179
T + G AA+ AL +A + R+ W L+ GP G GK+ LA +A
Sbjct: 19 TALFGHAAAEAALLDA----------YRSGRLHHAW---LIGGPQGIGKATLAYRMA 62
>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
Length = 857
Score = 27.9 bits (62), Expect = 9.7
Identities = 39/154 (25%), Positives = 66/154 (42%), Gaps = 31/154 (20%)
Query: 162 LLFGPPGTGKSYLAKAVATEA---------NNSTFFSVSSSDLV--SKWLGE-SEKLVKT 209
+L G PG GK+ + + +A ++ LV +K+ GE E+L
Sbjct: 203 VLIGEPGVGKTAIVEGLAQRIINGEVPEGLKGRRVLALDMGALVAGAKYRGEFEERLKGV 262
Query: 210 LFELARAHRPSIIFIDE---RIDRGISNALFLLLSGVGNDMDGILV------LGATNIPW 260
L +LA+ I+FIDE + G ++ GN + L +GAT +
Sbjct: 263 LNDLAKQEGNVILFIDELHTMVGAGKADGAM----DAGNMLKPALARGELHCVGATTLDE 318
Query: 261 VL-----DAAIRRRFEKRIYIALPELNARLFMFR 289
DAA+ RRF+K +++A P + + + R
Sbjct: 319 YRQYIEKDAALERRFQK-VFVAEPSVEDTIAILR 351
>gnl|CDD|224162 COG1241, MCM2, Predicted ATPase involved in replication control,
Cdc46/Mcm family [DNA replication, recombination, and
repair].
Length = 682
Score = 27.7 bits (62), Expect = 10.0
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 14/54 (25%)
Query: 135 EALKEAVILPIKFPQLF--TGKRVPWKG-------ILLFGPPGTGKSYLAKAVA 179
E +K+A++L QLF K +P ILL G PGT KS L K VA
Sbjct: 292 EDVKKAILL-----QLFGGVKKNLPDGTRIRGDIHILLVGDPGTAKSQLLKYVA 340
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.133 0.377
Gapped
Lambda K H
0.267 0.0740 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,148,804
Number of extensions: 1715775
Number of successful extensions: 3567
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3410
Number of HSP's successfully gapped: 304
Length of query: 330
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 233
Effective length of database: 6,635,264
Effective search space: 1546016512
Effective search space used: 1546016512
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.5 bits)