RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy568
(1614 letters)
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 830 bits (2145), Expect = 0.0
Identities = 349/1616 (21%), Positives = 587/1616 (36%), Gaps = 140/1616 (8%)
Query: 21 EGNEEGGPWMEKVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGVAHII 80
E + G + + + + + K + Y ++
Sbjct: 1160 ESQQVSGLDVAQFVSFLRSVDTGAFHAEYFRVFLCKIKHYTDACD----YLWHVKSPYVK 1215
Query: 81 DPKAISKEALYGVLDPNTREWTDGLFTHILRRIIDNVRGEIS-KRQWIIFDGDVDPEWVE 139
D E F R +++ ++ + + +VE
Sbjct: 1216 KK----------YFDA-DMELR-QFFLMFNR---EDMEARLADSKMEYEVE-----RYVE 1255
Query: 140 NLNSVLDDNKLLTLPNGERLSLPPNIRIMFEVQDLKYATLATVSRCGMIWFSEDVLSTEM 199
+ + KL GE +++ + V RC + + S LST+
Sbjct: 1256 KTKAEVSSLKLELSSVGEG-------QVVVSNLG---SIGDKVGRCLVEYDSISRLSTKG 1305
Query: 200 IFENYLSRLRNIALDDIDDDSSLLITVDATGKAPDDVLSPALTLQQDVASILSTHFAPDG 259
+F + L LD+ D S + +
Sbjct: 1306 VFLDELGD-TKRYLDECLDFFSC------------FEEVQKEIDELSMVFCAD------- 1345
Query: 260 LVVRALDYAMQQEHIMD-------FTRLRALGSLFSMLNQGVRNVLQYNHSHSDFPLSQD 312
AL ++ HI+ A SL + L
Sbjct: 1346 ----ALRFSADLYHIVKERRFSGVLAGSDASESLGGKSIELAAI------------LEHK 1389
Query: 313 VVERYIPRILVYSLLWSFAGDGKLKMRSDFGNFLRSVTTITLPATSS----DIVDFEVNI 368
+ + R + L GD + F L S I+ F ++
Sbjct: 1390 DLIVEMKRGINDVLKLRIFGDKCRESTPRFYLISDGDLIKDLNERSDYEEMLIMMFNISA 1449
Query: 369 KNGEWVPWSNKVPQIEVETQKVA-ASDVVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGS 427
+ E+ ++V +VV+PT DT +S L + + CGPPGS
Sbjct: 1450 VITNNGS----IAGFELRGERVMLRKEVVIPTSDTGFVDSFSNEALNTLRSYIYCGPPGS 1505
Query: 428 GKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGVILSPIQLGKWL 487
GK M + +LR+ EV NFS+ T L + EY V L P + K L
Sbjct: 1506 GKEMLMCPSLRSELITEVKYFNFSTCTMTPSKLSVLERETEYYPNTGVVRLYPKPVVKDL 1565
Query: 488 VLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRPADKQWVSLERIQCVGACNPPTDPG 547
VLFCDEINLP +Y VI FLR L+E++GF+ WV++ I GACNP TD G
Sbjct: 1566 VLFCDEINLPYGFEYYPPTVIVFLRPLVERQGFWSSIAVSWVTICGIILYGACNPGTDEG 1625
Query: 548 RKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAMLRLIPPLRGYADALTNAMVELYLA 607
R RF+R ++ YP SL+ IY ++ +A VELYL+
Sbjct: 1626 RVKYYERFIRKPVFVFCCYPELASLRNIYEAVLMGSYLCFDEFNRLSEETMSASVELYLS 1685
Query: 608 SQEKFTQDMQPHYVYSPREMTRWVRGICEAIRPLESLTVEGLVRLWAHEALRLFQDRLVN 667
S++K +Q +Y Y PRE+TR +R I L+ W EA+R DRLV
Sbjct: 1686 SKDKTKFFLQMNYGYKPRELTRSLRAIFGYAETRIDTPDVSLIIDWYCEAIREKIDRLVQ 1745
Query: 668 DVERQWTNENIDAVAMKYFSNIDKEVLA-RPILYSNWLSKNYVPVGTTELREYVQARLKV 726
E + +++ ++ + + I +S L + +L +V+ K+
Sbjct: 1746 QKESSTSRQDLYDFGLRAIREMIAGHIGEAEITFSMILFFGMACLLKKDLAVFVEEVRKI 1805
Query: 727 FYEEELDVQLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVF 786
F LDV+ V + + L H+LR R GH +L GV G FV ++N ++
Sbjct: 1806 FGSSHLDVEAVAYKDALLHILRSRRGLLVVGGHGVLKGVLIRGACDAREFVCWLNPRNMR 1865
Query: 787 QIRAHNKYTGADFDEDLRTVLRRSGCKNEKIAFLLDESNVLESGFLERMNTLLANGEIPG 846
+I H DF + L+ R + + ES +ES FLE N LL N
Sbjct: 1866 EIFGHRDELTGDFRDSLKVQDLRRNIHGGRECLFIFESIPVESSFLEDFNPLLDNNRFLC 1925
Query: 847 LFEGDEYTTLMTQCKEGAQREGLMLDSNEELYKWFTQQVMKNLHVVFTMNPSSEGLKDRA 906
LF G+E + + + L D+ L + F + +NL VVF+ S +
Sbjct: 1926 LFSGNERIRIPENLRFVFESTSLEKDTEATLTRVFLVYMEENLPVVFSACCSQDTSVLAG 1985
Query: 907 ATSPALFNRCVLNWFGDWSDTALYQVAKEFTSKIDLDGPQNWKAPDFFPSVCSLVSTTPS 966
SPAL NRC +++ W + Q A + G + + L +
Sbjct: 1986 IRSPALKNRCFIDFKKLWDTEEMSQYANSVETLSRDGGRVFFINGE-------LGVGKGA 2038
Query: 967 --HRDSVINACVYVHQTLHKANARLSKRGSRTMAITPRHYLDFINHFVKLYREKCSELEE 1024
+A V +S ++ + ++ + + EL+
Sbjct: 2039 LISEVFGDDAVVI-----EGRGFEISMIEG-SLGESKIKFIGGLKVYDARCVIYIEELDC 2092
Query: 1025 QQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQ 1084
++L G+ K E + E+++ L+ L K + A+ L ++ E E + V
Sbjct: 2093 TNVNLVEGVRKYNEYGRGMGELKEQLSNTVVILGVKEKNADDALSGTPGERLEREVKSVF 2152
Query: 1085 SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
+ + + E+ +++ VM+ + +PAV++A V +IKK L E+RS PP
Sbjct: 2153 VEAPRDMLFLLEEEVRKRKGSVMKFKSSKKPAVLEAVLFVYKIKKASLREIRSFIRPPGD 2212
Query: 1145 VKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITD-EVREKMHSRYLSN 1203
+ + +E +C LLG A W + + R++FI I + E+ D E R +R S+
Sbjct: 2213 LCIEMEDVCDLLGFEAKIWFGEQQSLRRDDFIRIIG-KYPDEIEFDLEARRFREARECSD 2271
Query: 1204 PDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETK 1263
P ++ NRAS ACGP+ +W + + + + +L+ PLR E K ++ +A + + K
Sbjct: 2272 PSFTGSILNRASKACGPLKRWLVRECNRSKVLEVKIPLREEEKRIDGEAFLVEDRLTLGK 2331
Query: 1264 DLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQA 1323
L + L + Y+ I + D+D V
Sbjct: 2332 GLSSDLMTFKLRRRS------------------------YYSLDILRVHGKIADMDTVHK 2367
Query: 1324 KVERSMALLKSLGIERERWEATSETFRSQMATIIGDVLLSSAYLAYAGYFDQHYRQSLFS 1383
V RS+ + + L E W M + GD SS Y G R F
Sbjct: 2368 DVLRSIFVSEILINEDSEWGGVFSEVPKLMVELDGDGHPSSCLHPYIGTLGFLCRAIEF- 2426
Query: 1384 TWNSHLIAAGIQFR-PEIA---LTEYLSSPDERLRWQGNALPSDHLCTENA-IMLRRFNR 1438
I +FR EI E + + S EN+ I +
Sbjct: 2427 --GMSFIRISKEFRDKEIRRRQFITEGVQKIEDFKEEAC---STDYGLENSRIRKDLQDL 2481
Query: 1439 YPLIIDPSGQATEFILKEFESRKITKTSFLDDAFRKNLESALRFGNPLLVQDVENYDTIL 1498
++ DPS + + ++ +K SF + F L A R G+ ++ D E D +
Sbjct: 2482 TAVLNDPSSKIVTSQRQMYDEKKAILGSFREMEFAFGLSQARREGSDKIIGDAEALDEEI 2541
Query: 1499 NPVLNRELRRTGGRVLITLGDQDIDISPTFVIFLSTRDPTVEFPPDICSRVTFVNFTVTR 1558
++ E + V + + +I S +F + + + ++ V F
Sbjct: 2542 GRLIKEEFKSNLSEVKVMINPPEIVRSTVEAVFWLSEGRSGDMGSIEWKQLIQVMFVSKV 2601
Query: 1559 SSLQSQCLNRVLKAERPDIDTKRSDLLKLQGEFHLRLRHLEKSLLGALNESKGKLL 1614
+++ + + K + L L+ L L L + LL L S +
Sbjct: 2602 LGCETEIPDALEKLVSGPLFVHEKALNALKACGSLFLWVLARYLLAKLMLSISNME 2657
>gnl|CDD|193256 pfam12780, AAA_8, P-loop containing dynein motor region D4. The 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This particular family is the D4 ATP-binding region of
the motor.
Length = 268
Score = 156 bits (396), Expect = 2e-42
Identities = 86/283 (30%), Positives = 141/283 (49%), Gaps = 25/283 (8%)
Query: 734 VQLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNK 793
+ LVLF++ + HV RI+RI P+G+ LL+GV G+GK +LSR A+++ L VFQI
Sbjct: 8 MNLVLFEDAMAHVCRINRILESPRGNALLVGVGGSGKQSLSRLAAYISSLDVFQITLRKG 67
Query: 794 YTGADFDEDLRTVLRRSGCKNEKIAFLLDESNVLESGFLERMNTLLANGEIPGLFEGDEY 853
Y D DL ++G KN FL+ ++ V + FL +N LLA+GEIP LF DE
Sbjct: 68 YGIPDLKIDLAAQCIKAGVKNVPTVFLMTDAQVADEQFLVLINDLLASGEIPDLFMDDEV 127
Query: 854 TTLMTQCKEGAQREGLMLDSNEELYKWFTQQVMKNLHVVFTMNPSSEGLKDRAATSPALF 913
+++ + + G + D+ E +K+F +V + L V+ +P L+ R+ PA+
Sbjct: 128 ENIISSVRNEVKSLG-LNDTRENCWKFFIDRVRRQLKVILCFSPVGSVLRVRSRKFPAVV 186
Query: 914 NRCVLNWFGDWSDTALYQVAKEFTSKIDLDGPQNWKAPDFFPSVCSLVSTTPSHRDSVIN 973
N ++WF +W AL V+ F + + + + FF +
Sbjct: 187 NCTAIDWFHEWPQDALVSVSARFLEETEGIPWEVKASISFFMA----------------- 229
Query: 974 ACVYVHQTLHKANARLSKRGSRTMAITPRHYLDFINHFVKLYR 1016
YVH ++++ + R TP+ +L+ I KLY+
Sbjct: 230 ---YVHTSVNEMSRVYLANERRYNYTTPKSFLEQI----KLYQ 265
>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor. the 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This family is the region between D4 and D5 and is the
two predicted alpha-helical coiled coil segments that
form the stalk supporting the ATP-sensitive microtubule
binding component.
Length = 344
Score = 142 bits (358), Expect = 1e-36
Identities = 100/368 (27%), Positives = 173/368 (47%), Gaps = 33/368 (8%)
Query: 1029 LNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDI 1088
L GL K+ T QV++++ LA++ EL+ KNE A+ ++ + + ++ K K +
Sbjct: 3 LENGLMKLQSTASQVDDLKAKLAIQEVELKQKNEDADKLIQVVGVETEKVSKEKAIADQE 62
Query: 1089 QAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLA 1148
+ ++ E+ +K+ EDLA+ EPA++ AQ+A+ + K L EL+S +PP V
Sbjct: 63 EVKVAVINKEVTEKQKDCEEDLAKAEPALLAAQEALNTLNKNNLTELKSFGSPPDAVVNV 122
Query: 1149 LESICLLLGENAT-----DWKAIRAVVMR-ENFINSIVSNFNTEMITDEVREKMHSRYLS 1202
++ +L WKA + ++ + + F++S++ NF+ E I + K YL
Sbjct: 123 TAAVMVLTAPGGKIPKDRSWKAAKVMMGKVDTFLDSLI-NFDKENIHENCL-KAIKPYLG 180
Query: 1203 NPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEET 1262
+P++ E S A + W I + + ++ VEP R L+ + + + K
Sbjct: 181 DPEFDPEFIRSKSTAAAGLCSWVINIVRFYEVYCDVEPKRQALEKANAELAAAQEKLARI 240
Query: 1263 KDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQ 1322
K I +L ++A+ + + A A +K Q A+AT L N
Sbjct: 241 KAKIAELNANLANLTAAFEK--ATADKLK-------------CQQEAEATNRTILLAN-- 283
Query: 1323 AKVERSMALLKSLGIERERWEATSETFRSQMATIIGDVLLSSAYLAYAGYFDQHYRQSLF 1382
L+ L E RW E F+SQ T+ GDVLL +A+++Y GYF + YR L
Sbjct: 284 -------RLVGGLASENVRWAEAVENFKSQEVTLCGDVLLITAFVSYVGYFTKKYRNELM 336
Query: 1383 S-TWNSHL 1389
W +L
Sbjct: 337 ERFWIPYL 344
>gnl|CDD|193257 pfam12781, AAA_9, ATP-binding dynein motor region D5. The 380 kDa
motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This particular family is the D5 ATP-binding region of
the motor, but has lost its P-loop.
Length = 228
Score = 129 bits (325), Expect = 2e-33
Identities = 71/205 (34%), Positives = 119/205 (58%), Gaps = 12/205 (5%)
Query: 1415 WQGNALPSDHLCTENAIMLRRFNRYPLIIDPSGQATEFILKEFESR----KITKTSFLDD 1470
W LPSD + TENA +L R+PL++DP Q ++I ++ S +I + +LD
Sbjct: 11 WNNEGLPSDRMSTENATILGNCERWPLMVDPQLQGIKWIKNKYGSDLRVIRIGQKGYLD- 69
Query: 1471 AFRKNLESALRFGNPLLVQDV-ENYDTILNPVLNRELRRTGGRVLITLGDQDIDISPTFV 1529
+E A+ G+ +L++++ E+ D +L+P+L R + G I +GD++++ +P F
Sbjct: 70 ----VIEQAISEGDTVLIENIEESIDPVLDPLLGRNTIKKGR--YIKIGDKEVEYNPKFR 123
Query: 1530 IFLSTRDPTVEFPPDICSRVTFVNFTVTRSSLQSQCLNRVLKAERPDIDTKRSDLLKLQG 1589
+ L T+ + P++ ++ T +NFTVTR L+ Q L V+ ERPD++ +SDL K Q
Sbjct: 124 LILHTKLANPHYKPEMQAQCTLINFTVTRDGLEDQLLAAVVAKERPDLEQLKSDLTKQQN 183
Query: 1590 EFHLRLRHLEKSLLGALNESKGKLL 1614
EF + L+ LE SLL L+ + G L
Sbjct: 184 EFKIVLKELEDSLLSRLSSASGNFL 208
>gnl|CDD|193251 pfam12775, AAA_7, P-loop containing dynein motor region D3. the
380 kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules,
of which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This particular family is the D3 and is an ATP binding
site.
Length = 272
Score = 93.8 bits (233), Expect = 1e-20
Identities = 75/277 (27%), Positives = 132/277 (47%), Gaps = 31/277 (11%)
Query: 395 VVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL-PDMEVV-SLNFSS 452
+V T +T+R + + +P++L G G+GK++ + L +L PD +V ++ F+
Sbjct: 12 CLVHTSETIRVRYFMDLLMERQRPVMLVGNAGTGKSVLVGDKLASLDPDAYLVKNVPFNY 71
Query: 453 ATTPELLLKTFDHYCEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLR 512
TT +L + E + N P K LV F D++N+P++D Y T + + +R
Sbjct: 72 YTTSAMLQAVLEKPLEKKAGRN---YGPPGTKK-LVYFIDDMNMPEVDAYGTVQPHTLIR 127
Query: 513 QLIEQRGFYRPADKQWVSLERI---QCVGACNPPTDPGRKPLSHRFLRHVPVIYVDYPGE 569
Q ++ +Y D+ ++L+ I Q V NP G ++ R RH V + +PG+
Sbjct: 128 QHMDYGHWY---DRSKLTLKEIHNCQYVSCMNPTA--GSFTINSRLQRHFCVFALSFPGQ 182
Query: 570 TSLKQIY----------GTFSRAMLRLIPPLRGYADALTNAMVELYLASQEKFTQDMQPH 619
+L IY G+ S A+ + PPL A AL + +L + KF H
Sbjct: 183 DALSTIYNIILTQHLALGSVSAALQKSSPPLVDAALALHQKIATTFLPTAIKF------H 236
Query: 620 YVYSPREMTRWVRGICEAIRPLESLTVEGLVRLWAHE 656
YV++ R+++ +G+ + + L+RLW HE
Sbjct: 237 YVFNLRDLSNIFQGLLFSSVECLKSPWD-LIRLWLHE 272
>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily). This Pfam
entry includes some of the AAA proteins not detected by
the pfam00004 model.
Length = 135
Score = 67.7 bits (166), Expect = 3e-13
Identities = 34/151 (22%), Positives = 59/151 (39%), Gaps = 26/151 (17%)
Query: 45 GLMMVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTR--EWT 102
G+++VGP G+GKS + L AL + ++E L G + + W
Sbjct: 1 GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLT----RDTTEEDLKGRRNIDPGGASWV 56
Query: 103 DGLFTHILRRIIDNVRGEISKRQWIIFDGDVD---PEWVENLNSVLDDNKLLTLPNGERL 159
DG R I +++ P+ + +L S+LD+ +LL GE +
Sbjct: 57 DGPLVRAAR------------EGEIAVLDEINRANPDVLNSLLSLLDERRLLLPEGGELV 104
Query: 160 SLPP-NIRIMFEV----QDLKYATLATVSRC 185
P R++ + + L + A SR
Sbjct: 105 KAAPDGFRLIATMNPLDRGLNELSPALRSRF 135
Score = 44.2 bits (105), Expect = 5e-05
Identities = 33/142 (23%), Positives = 56/142 (39%), Gaps = 11/142 (7%)
Query: 418 PLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGVI 477
++L GPPG+GK+ AL + V + + TT E L + +G +
Sbjct: 1 GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPL 60
Query: 478 LSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRPADKQWVSLER--IQ 535
+ + G+ + DEIN + D V++ L L+++R P + V +
Sbjct: 61 VRAAREGE--IAVLDEINRANPD------VLNSLLSLLDERRLLLPEGGELVKAAPDGFR 112
Query: 536 CVGACNPPTDPGRKPLSHRFLR 557
+ NP D G LS
Sbjct: 113 LIATMNPL-DRGLNELSPALRS 133
Score = 41.9 bits (99), Expect = 4e-04
Identities = 37/169 (21%), Positives = 59/169 (34%), Gaps = 45/169 (26%)
Query: 759 HLLLIGVSGAGKTTLS-RFVAFMNGLSVFQIRAHNKYTGADF----------DEDLRTVL 807
+LL+G G GK+ L+ R A ++ VF ++ T D + L
Sbjct: 1 GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPL 60
Query: 808 RRSGCKNEKIAFLLDESNVLESGFLERMNTLLANGEIPGLFEGDEYTTLMTQCKEGAQRE 867
R+ + +IA LDE N L + +LL +
Sbjct: 61 VRAA-REGEIAV-LDEINRANPDVLNSLLSLLDERRL----------------------- 95
Query: 868 GLMLDSNEELYKWFTQQVMKNLHVVFTMNPSSEGLKDRAATSPALFNRC 916
L+ + E + ++ TMNP GL + SPAL +R
Sbjct: 96 -LLPEGGELVKA-----APDGFRLIATMNPLDRGLNE---LSPALRSRF 135
>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
cellular Activities) superfamily represents an ancient
group of ATPases belonging to the ASCE (for additional
strand, catalytic E) division of the P-loop NTPase fold.
The ASCE division also includes ABC, RecA-like,
VirD4-like, PilT-like, and SF1/2 helicases. Members of
the AAA+ ATPases function as molecular chaperons, ATPase
subunits of proteases, helicases, or nucleic-acid
stimulated ATPases. The AAA+ proteins contain several
distinct features in addition to the conserved
alpha-beta-alpha core domain structure and the Walker A
and B motifs of the P-loop NTPases.
Length = 151
Score = 67.9 bits (166), Expect = 5e-13
Identities = 36/167 (21%), Positives = 58/167 (34%), Gaps = 21/167 (12%)
Query: 398 PTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL--PDMEVVSLNFSSATT 455
+ L K L+L GPPG+GKT + L P + LN S
Sbjct: 1 VGQEEAIEALREALELPPPKNLLLYGPPGTGKTTLARAIANELFRPGAPFLYLNASDLLE 60
Query: 456 PELLLKTFDHYCEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLI 515
++ + F H+ + K VLF DEI+ + + A ++ L L
Sbjct: 61 GLVVAELFGHFLVRLLF------ELAEKAKPGVLFIDEID--SLSRGAQNALLRVLETLN 112
Query: 516 EQRGFYRPADKQWVSLERIQCVGACNPPTDPG-RKPLSHRFLRHVPV 561
+ R E ++ +GA N P + L R + +
Sbjct: 113 DLRIDR----------ENVRVIGATNRPLLGDLDRALYDRLDIRIVI 149
Score = 47.1 bits (112), Expect = 8e-06
Identities = 26/98 (26%), Positives = 38/98 (38%), Gaps = 7/98 (7%)
Query: 749 IDRIFRQPQGHLLLIGVSGAGKTTLSRFVA---FMNGLSVFQIRAHNKYTGADFDEDLRT 805
+ + P +LLL G G GKTTL+R +A F G + A + G E
Sbjct: 11 REALELPPPKNLLLYGPPGTGKTTLARAIANELFRPGAPFLYLNASDLLEGLVVAELFGH 70
Query: 806 VLRRS----GCKNEKIAFLLDESNVLESGFLERMNTLL 839
L R K + +DE + L G + +L
Sbjct: 71 FLVRLLFELAEKAKPGVLFIDEIDSLSRGAQNALLRVL 108
Score = 29.4 bits (66), Expect = 7.6
Identities = 10/40 (25%), Positives = 18/40 (45%)
Query: 32 KVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLKALERYE 71
+ L+ L++ GP G+GK+T + + L R
Sbjct: 8 EALREALELPPPKNLLLYGPPGTGKTTLARAIANELFRPG 47
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only detects
a fraction of this vast family. The poorly conserved
N-terminal helix is missing from the alignment.
Length = 148
Score = 54.3 bits (130), Expect = 3e-08
Identities = 29/153 (18%), Positives = 49/153 (32%), Gaps = 17/153 (11%)
Query: 416 HKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNG 475
+ +++ GPPGSGKT + R L + E +L K +G
Sbjct: 2 GEVILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKASG 61
Query: 476 -------VILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRPADKQW 528
+ L+ + K VL DEI + ++ +
Sbjct: 62 SGELRLRLALALARKLKPDVLILDEITSLLDAE----------QEALLLLLEELRLLLLL 111
Query: 529 VSLERIQCVGACNPPTDPGRKPLSHRFLRHVPV 561
S + + + N D G L RF R + +
Sbjct: 112 KSEKNLTVILTTNDEKDLGPALLRRRFDRRIVL 144
Score = 35.4 bits (81), Expect = 0.068
Identities = 36/163 (22%), Positives = 54/163 (33%), Gaps = 19/163 (11%)
Query: 756 PQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNE 815
P +L++G G+GKTTL+R +A G + Y + +
Sbjct: 1 PGEVILIVGPPGSGKTTLARALARELGPPGGGV----IYIDGEDILEEVLDQLLLIIVGG 56
Query: 816 KIAFLLDESNVLESGFLERMNTLLANGEIPGLFEGDEYTTLMTQCKEGAQREGLMLDSNE 875
K A E + L + L DE T+L+ + +L E
Sbjct: 57 KKASGSGELRL------RLALALARKLKPDVLIL-DEITSLLD------AEQEALLLLLE 103
Query: 876 ELYKWFTQQVMKNLHVVFTMNPSSEGLKDRAATSPALFNRCVL 918
EL + KNL V+ T N + A R VL
Sbjct: 104 ELRLLLLLKSEKNLTVILTTNDEKD--LGPALLRRRFDRRIVL 144
Score = 35.4 bits (81), Expect = 0.091
Identities = 26/90 (28%), Positives = 36/90 (40%), Gaps = 8/90 (8%)
Query: 43 NHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGV-AHIIDPKAISKEALYGVLDPNTREW 101
+++VGP GSGK+T L +AL R G G ID + I +E L +L
Sbjct: 2 GEVILIVGPPGSGKTT----LARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGK 57
Query: 102 TDGLFT-HILRRIIDNVRGEISKRQWIIFD 130
LR + R K +I D
Sbjct: 58 KASGSGELRLRLALALARK--LKPDVLILD 85
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 52.0 bits (125), Expect = 3e-06
Identities = 60/331 (18%), Positives = 135/331 (40%), Gaps = 34/331 (10%)
Query: 1015 YREKCSELEEQQLHLNV--------GLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL 1066
Y+E +EL E +L L V L ++ E +++ EE + L + QEL+ K E L
Sbjct: 215 YKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRL 274
Query: 1067 KLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQ---- 1122
++ E+ ++ +E +K + + +E+Q + ++ + L ++E + + +
Sbjct: 275 EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
Query: 1123 ------AVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIR-AVVMRENF 1175
++E ++ EL S+ ++ LE + L E + +R V E
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQ 394
Query: 1176 INSIVSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADML 1235
I S+ + E E R + +K A + ++
Sbjct: 395 IASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAEL---------------KELQ 439
Query: 1236 KKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLD 1295
++E L EL+ L+ + + EE ++ + + E+++ + + E AQL A+ +++ +
Sbjct: 440 AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE 499
Query: 1296 NVQAKFYEYAQLIAQATAIKTDLDNVQAKVE 1326
N++ L+ + + L + +
Sbjct: 500 NLEGFSEGVKALLKNQSGLSGILGVLSELIS 530
Score = 46.6 bits (111), Expect = 1e-04
Identities = 64/326 (19%), Positives = 131/326 (40%), Gaps = 28/326 (8%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
EK ELEE+ L L ++ + +E++EE + L + +EL + A L + +
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
++ E+R Q E+E + E+ ++ E+LA+ E A ++ +A E K++L L
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAE-AEIEELEAQIEQLKEELKAL 801
Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
R + L + LL E A + R + + + E +E+ E
Sbjct: 802 REALD-------ELRAELTLLNEEAAN-LRERLESLERRIAATERRLEDLEEQIEELSED 853
Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASEN 1255
+ S + +++ + L L E++
Sbjct: 854 IESLAAEIEELEELIEE------------LESELEALLNERASLEEALALLRSELEELSE 901
Query: 1256 KAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQL-----IAQ 1310
+ + E+K ++L + + +++ AQL + ++ +DN+Q + E L A
Sbjct: 902 ELRELESK--RSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEAL 959
Query: 1311 ATAIKTDLDNVQAKVERSMALLKSLG 1336
I+ D + + +++R +K LG
Sbjct: 960 ENKIEDDEEEARRRLKRLENKIKELG 985
Score = 40.8 bits (96), Expect = 0.006
Identities = 21/96 (21%), Positives = 49/96 (51%), Gaps = 3/96 (3%)
Query: 1008 INHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK 1067
+ ++ RE+ + LE Q L L ++ ++++ E L K +EL+ + E+ +
Sbjct: 300 LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359
Query: 1068 LKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
L+E+ + +E E R ++++ ++E ++AQ
Sbjct: 360 LEELEAELEELESR---LEELEEQLETLRSKVAQLE 392
Score = 40.0 bits (94), Expect = 0.013
Identities = 53/320 (16%), Positives = 122/320 (38%), Gaps = 41/320 (12%)
Query: 1036 IAETVEQVE--EMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQ------- 1086
+ E Q++ E Q A + +EL+++ L L + ++ E ++Q +
Sbjct: 195 LNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEE 254
Query: 1087 --DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAV-------------KEIKKQQ 1131
++ AE+++ ++ + R+ V E ++E + Q+ + K+I +++
Sbjct: 255 LEELTAELQELEEKLEELRLEVSELEEEIE----ELQKELYALANEISRLEQQKQILRER 310
Query: 1132 LVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDE 1191
L L LES L E + + + E + S+ + +
Sbjct: 311 LANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEE--LESLEAELEEL---EA 365
Query: 1192 VREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKK-----VEPLRLELK 1246
E++ SR L + E + +I + + E L+ E++
Sbjct: 366 ELEELESR-LEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE 424
Query: 1247 SLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQ 1306
L + E + K E + + +LE+ + ++E +L ++ +L+ + +
Sbjct: 425 ELLKKLEEAELK--ELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAER 482
Query: 1307 LIAQATAIKTDLDNVQAKVE 1326
+AQ A L+ +Q +E
Sbjct: 483 ELAQLQARLDSLERLQENLE 502
Score = 38.9 bits (91), Expect = 0.026
Identities = 28/116 (24%), Positives = 57/116 (49%), Gaps = 6/116 (5%)
Query: 1242 RLELKSLEVQASENKAKGE--ETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQA 1299
L L+ E++ + + E + I++LE+ ++ A L Q ++ L+ +++
Sbjct: 271 ELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELES 330
Query: 1300 KFYEYAQLIAQATA----IKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRS 1351
K E A+ +A+ +K +L++++A++E A L+ L E E ET RS
Sbjct: 331 KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRS 386
Score = 33.9 bits (78), Expect = 0.96
Identities = 15/105 (14%), Positives = 49/105 (46%), Gaps = 3/105 (2%)
Query: 1243 LELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFY 1302
L L+ L + E + + +E ++ + +L + +++ +L + + ++ +++ +Q + Y
Sbjct: 232 LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291
Query: 1303 EYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSE 1347
A I+ ++ ++ ++ L+ L + E E+ +
Sbjct: 292 ALANEIS---RLEQQKQILRERLANLERQLEELEAQLEELESKLD 333
Score = 30.8 bits (70), Expect = 7.2
Identities = 24/127 (18%), Positives = 52/127 (40%), Gaps = 11/127 (8%)
Query: 1242 RLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKF 1301
R E++ LE + E + K E + + +L K + ++E QL + + + ++
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735
Query: 1302 YE-------YAQLIAQATAIKTDLDN----VQAKVERSMALLKSLGIERERWEATSETFR 1350
+ IAQ + T+L+ ++ ++E + L E E EA E +
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795
Query: 1351 SQMATII 1357
++ +
Sbjct: 796 EELKALR 802
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 51.6 bits (124), Expect = 4e-06
Identities = 56/351 (15%), Positives = 140/351 (39%), Gaps = 65/351 (18%)
Query: 1015 YREKCSELEEQQL-----HLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKN---EAANL 1066
Y+E +EL E +L L ++ E E++ +++ L +EL+ E
Sbjct: 215 YQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKS 274
Query: 1067 KLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKE 1126
+L+E+ ++ +E ++ ++ ++ E+E + + ++ + +L ++E + + ++ ++
Sbjct: 275 ELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEA 334
Query: 1127 IKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTE 1186
+K++ LE LL E + S + E
Sbjct: 335 LKEE------------------LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376
Query: 1187 MITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELK 1246
+ + +RE++ ++ +A+I ++E L+ E++
Sbjct: 377 LF-EALREELAE-----------------------LEAELAEIR-----NELEELKREIE 407
Query: 1247 SLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQ 1306
SLE + + E+ K+ + +LE + + E +L +L+ ++ + +
Sbjct: 408 SLEERLERLSERLEDLKEELKELEAELEELQTELEEL-------NEELEELEEQ---LEE 457
Query: 1307 LIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRSQMATII 1357
L + ++ +L +Q +++R L SL +R EA + A +
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLE 508
Score = 46.2 bits (110), Expect = 2e-04
Identities = 45/310 (14%), Positives = 118/310 (38%), Gaps = 54/310 (17%)
Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLA---VKSQELQSKNEAANLKLKEMIKDQQ 1076
EL+E + L ++ + E+++ ++ L +EL+ + E +L+E+ ++
Sbjct: 667 RELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELA 726
Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
E+ Q Q E+E++ E+ ++ + E L ++E + ++A+ ++K++
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEE------ 780
Query: 1137 SMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREKM 1196
+E + + + + + E+ + E R +
Sbjct: 781 ------------IEELEEKRQALQ---EELEELEEELEEAERRLDALERELESLEQRRER 825
Query: 1197 HSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENK 1256
+ + + ++ +K++ L EL+ LE + E K
Sbjct: 826 LEQEIEE-----------------------LEEEIEELEEKLDELEEELEELEKELEELK 862
Query: 1257 AKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT 1316
+ EE + +LE + ++E +L + L ++++ E + I +
Sbjct: 863 EELEELEAEKEELEDELKELEEEKEELEEE-------LRELESELAELKEEIEKLRERLE 915
Query: 1317 DLDNVQAKVE 1326
+L+ ++E
Sbjct: 916 ELEAKLERLE 925
Score = 42.8 bits (101), Expect = 0.002
Identities = 51/339 (15%), Positives = 133/339 (39%), Gaps = 26/339 (7%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQS-KNEAANLKLKEM-IKDQQEAEKRKVQSQDIQAEI 1092
++ E E++ E++ L +EL+S KNE +L+ ++ Q E +R+++ +
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727
Query: 1093 EKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESI 1152
++ +E Q R+ +E+ + ++ Q E +++L L +K +E +
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA---KLKEEIEEL 784
Query: 1153 CLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREKMHSRYLSNPDYSYEKAN 1212
+ + + + E+ + E R + + + + E+
Sbjct: 785 EEKRQALQ---EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELE 841
Query: 1213 RASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGE-------ETKDL 1265
+ + ++ +++E L E + LE + E + + E E +
Sbjct: 842 EKLDEL--EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESE 899
Query: 1266 ITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKV 1325
+ +L++ I ++ +L A+ ++ +L ++ + E + ++T+L+ ++
Sbjct: 900 LAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDT-----LETELEREIERL 954
Query: 1326 ERSMALLKSLGIER-ERWEATSETFR---SQMATIIGDV 1360
E + L + + E +E E + SQ +
Sbjct: 955 EEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAK 993
Score = 41.6 bits (98), Expect = 0.004
Identities = 44/252 (17%), Positives = 95/252 (37%), Gaps = 38/252 (15%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
+E+ ELEE+ L + +E +E+ ++ L + +EL+ + E KL E+ ++
Sbjct: 792 QEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEEL 851
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
+E EK + ++ E+E + E+ + + E+ ++E + + + + E+ K+++ +L
Sbjct: 852 EELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAEL-KEEIEKL 910
Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
R LE L E + + + E+ +
Sbjct: 911 RERLEELEAKLERLEVELPELEEELEE--------------------EYEDTLETELERE 950
Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQIS-----YADMLKKVEPLRLELKSLEV 1250
+ E+ A GP+ AI + Y ++ + E L + L
Sbjct: 951 I------------ERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLE 998
Query: 1251 QASENKAKGEET 1262
E + E
Sbjct: 999 VIEELDKEKRER 1010
Score = 41.6 bits (98), Expect = 0.004
Identities = 59/318 (18%), Positives = 136/318 (42%), Gaps = 51/318 (16%)
Query: 1015 YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAAN------LKL 1068
+E ELEE + L+ ++ E E++EE +K + EL+ E L+L
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLEL 293
Query: 1069 KEMIKD--------QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDA 1120
KE I++ ++ E+ + + ++++ +E+ +I + E+L + E + +
Sbjct: 294 KEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK----EELEERETLLEEL 349
Query: 1121 QQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIV 1180
+Q + E+++ EL S + LE + L E + + N + +
Sbjct: 350 EQLLAELEE-AKEELEEKL---SALLEELEELFEALREELAE--LEAELAEIRNELEEL- 402
Query: 1181 SNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEP 1240
E+ + E+ R S ++ ++ ++ ++E
Sbjct: 403 ---KREI-----------------ESLEERLERLSE----RLEDLKEEL--KELEAELEE 436
Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAK 1300
L+ EL+ L + E + + EE +D + +LE+ +A ++E +L + ++++ LD ++A+
Sbjct: 437 LQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAE 496
Query: 1301 FYEYAQLIAQATAIKTDL 1318
+ A A+++ L
Sbjct: 497 QRASQGVRAVLEALESGL 514
Score = 34.7 bits (80), Expect = 0.53
Identities = 27/126 (21%), Positives = 57/126 (45%), Gaps = 2/126 (1%)
Query: 1008 INHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK 1067
I+ + E +ELEE + L KI E++EE + L Q L EA +
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE-E 362
Query: 1068 LKEMIKDQQEAEKRKVQS-QDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKE 1126
L+E + E + ++ ++ AE+E + EI + + ++ +E + + +++
Sbjct: 363 LEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLED 422
Query: 1127 IKKQQL 1132
+K++
Sbjct: 423 LKEELK 428
Score = 34.3 bits (79), Expect = 0.79
Identities = 59/354 (16%), Positives = 127/354 (35%), Gaps = 54/354 (15%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
+ + ELEE+ L L ++ E +E++EE +SL +L+ + E K Q
Sbjct: 736 QSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR------Q 789
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
E+ + +++ E+E+ + + + E +Q ++E++++
Sbjct: 790 ALQEELE----ELEEELEEAERRLDALERELESLEQRRE----RLEQEIEELEEEI---- 837
Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
++ L+ + L E K + + + + E+ E ++
Sbjct: 838 -------EELEEKLDELEEELEELE---KELEELKEELEELEAEKEELEDELKELEEEKE 887
Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASEN 1255
L + + + + ++ K+E L +EL LE + E
Sbjct: 888 ELEEELRELESELAELKE---------EIEKLRERLEELEAKLERLEVELPELEEELEE- 937
Query: 1256 KAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIK 1315
E L T+LE+ I ++E L + + V+ Y +L +Q ++
Sbjct: 938 ----EYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEE---RYEELKSQREDLE 990
Query: 1316 TDLDNVQAKVERSMALLKSLGIER--ERWEATSETFRSQMATIIGDVLLSSAYL 1367
+ + +E L ER E ++ +E F + G +A L
Sbjct: 991 EAKEKLLEVIEE----LDKEKRERFKETFDKINENFSEIFKELFGG---GTAEL 1037
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 51.2 bits (122), Expect = 6e-06
Identities = 82/384 (21%), Positives = 140/384 (36%), Gaps = 68/384 (17%)
Query: 416 HKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYC-------- 467
KP++L G PG GKT +L++AL +++ +N S T L F
Sbjct: 1543 GKPILLEGSPGVGKT-SLITALARKTGKKLIRINLSEQTD---LCDLFGSDLPVEEGGEF 1598
Query: 468 EYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRG--FYRPAD 525
+ P L ++ G W++L DEINL A+Q V+ L ++ R + D
Sbjct: 1599 RWMDAP---FLHAMRDGGWVLL--DEINL------ASQSVLEGLNACLDHRREAYIPELD 1647
Query: 526 KQWVSLERIQCVGACNPP-TDPGRKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAML 584
K + + A NP GRK L FL V+ +D + I + M
Sbjct: 1648 KTFDVHPNFRVFAAQNPQDQGGGRKGLPKSFLNRFSVVKMDGLTTDDITHI----ANKMY 1703
Query: 585 RLIPPLRGYADALTNAMVELYLASQEKFTQDMQPHYVYSPREMTRWVR-----GICEAIR 639
+ + ++ + F P + ++ R+ RW+ G E +
Sbjct: 1704 PQVNEDWRLKIIKFMSRLQDNIEKDISFGSFGSP-WEFNLRDTLRWLILLNQVGTLEDVD 1762
Query: 640 PLESLTVEGLVRLWAHEALRLFQDRLVNDVERQWTNENIDAVAMKYFSNIDKEVLARPIL 699
+ + + R+ E + +++ + F + V R
Sbjct: 1763 TSDFIDESVVRRMRTVEDRVRTCELFKEVFGDYEPRTIGFSLSSQCF-KVGHSVTVRMKE 1821
Query: 700 YSNWLSKNYVPVGTTELREYVQARLKVFYEEELDVQLVLFDEVLDHVLR-IDRIFRQPQG 758
L ++V L QL +VL+ V+R I+ P
Sbjct: 1822 RRPRLDDSFVL---------------------LHSQL----QVLESVMRCINM--NWP-- 1852
Query: 759 HLLLIGVSGAGKTTLSRFVAFMNG 782
L+L+G +G GKT+L RF+A + G
Sbjct: 1853 -LILVGDTGVGKTSLLRFLASIFG 1875
Score = 41.1 bits (96), Expect = 0.006
Identities = 38/137 (27%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 397 VPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTP 456
V TL+T+ + L P+ L G GSGK+ L+S L +V ++ T
Sbjct: 130 VGTLETLDRLAKEAVDLKIAVPIYLEGGRGSGKSF-LISELCDEGGQRIVEIHLREITDA 188
Query: 457 ELLLKTFDHY----CEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLR 512
++L+ T+ E+ K GV++ + G W +LF +DK A V+S+L
Sbjct: 189 KVLIGTYTSPKPGDFEWMK---GVLIEAVVSGDW-ILF------KRIDK-APHGVLSYLL 237
Query: 513 QLIEQRGFYRPADKQWV 529
L+E+R P+ + V
Sbjct: 238 TLLEKRRLLIPSRGETV 254
Score = 36.1 bits (83), Expect = 0.23
Identities = 106/501 (21%), Positives = 178/501 (35%), Gaps = 105/501 (20%)
Query: 475 GVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFL------RQLIEQRGFYRPADKQW 528
G+++ ++ G+WL+L DEINL + T +I L L+ + G RP
Sbjct: 627 GILVKCMRNGEWLLL--DEINLASEE---TLELIDGLLGKKESGILLSESGDLRPIRIH- 680
Query: 529 VSLERIQCVGACNPPTDPGRKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAMLRLIP 588
+ G NP TD G++ L F IYV P TSLK + + + R
Sbjct: 681 ---PGFRLFGCMNPGTDVGKRKLEGSFRSRFTEIYVHSP-STSLKDLLSIIHKYIGRYEI 736
Query: 589 PLRGYADALTNAMVELYLASQEKF-------TQDMQPHYVYSPREMTRWVRGICEAIRPL 641
+ ++ + ELYL ++ +PH S R + R + +
Sbjct: 737 S----EERVSRKIAELYLEAKSLSLGKELVDGAVQEPH--VSGRTLVRTLNYVTNIGE-- 788
Query: 642 ESLTVEGLVRLWAHEALRLFQDRLVNDVERQWTNENID---AVAMKYFSNIDKEVLARPI 698
+ GL R +E + L++ V + ++ + I+K+ P
Sbjct: 789 ----IYGLRRS-LYEGFCMSFLTLLDFVSESILVSLLSKFTLGLLESQNAIEKQKPKVP- 842
Query: 699 LYSNWLSKNYVPVGTTELREYVQARLKVFYEEELDVQLVLFDEVLDHVLRIDRIFRQPQG 758
+Y+ Y + EE+ ++ V + L R
Sbjct: 843 ------DHSYI-----AFCHYWKHGGSFPVEEQ--EHYIITPFVQKNYLNTMRAASLSNF 889
Query: 759 HLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHN-----KYTGADFDEDLRTVLRRSGCK 813
LL+ G + +GKT++ ++A G +I H +Y G +D ++ + G
Sbjct: 890 PLLIQGPTSSGKTSMILYLARETGHKFVRINNHEHTDLQEYIGTYVTDDDGSLSFKEGVL 949
Query: 814 NEKIA----FLLDESNVLESGFLERMNTLLANGEIPGLFEGDEYTTLMTQCKEGAQREGL 869
E + +LDE N+ + LE +N L
Sbjct: 950 VEALRRGYWIVLDELNLAPTDVLEALNRL------------------------------- 978
Query: 870 MLDSNEELYKWFTQQVMK---NLHVVFTMNPSSEGLKDRAATSPALFNRCVLNWFGDWSD 926
LD N EL+ TQ+V+ N + T NP G R S A NR + F D +
Sbjct: 979 -LDDNRELFIPETQEVVVPHPNFRLFATQNPPG-GYGGRKGLSRAFRNRFLEMHFDDIPE 1036
Query: 927 TALY-------QVAKEFTSKI 940
L ++A + KI
Sbjct: 1037 DELEEILHGRCEIAPSYAKKI 1057
Score = 32.3 bits (73), Expect = 2.7
Identities = 44/165 (26%), Positives = 75/165 (45%), Gaps = 19/165 (11%)
Query: 418 PLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTF----DHYCEYRKTP 473
PL++ GP SGKT +++ L + V +N T + + T+ D +++
Sbjct: 890 PLLIQGPTSSGKT-SMILYLARETGHKFVRINNHEHTDLQEYIGTYVTDDDGSLSFKE-- 946
Query: 474 NGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLI-EQRGFYRPADKQWV-SL 531
GV++ ++ G W+VL DE+NL D V+ L +L+ + R + P ++ V
Sbjct: 947 -GVLVEALRRGYWIVL--DELNLAPTD------VLEALNRLLDDNRELFIPETQEVVVPH 997
Query: 532 ERIQCVGACNPPTD-PGRKPLSHRFLRHVPVIYVDYPGETSLKQI 575
+ NPP GRK LS F ++ D E L++I
Sbjct: 998 PNFRLFATQNPPGGYGGRKGLSRAFRNRFLEMHFDDIPEDELEEI 1042
>gnl|CDD|222104 pfam13401, AAA_22, AAA domain.
Length = 124
Score = 42.3 bits (100), Expect = 2e-04
Identities = 29/105 (27%), Positives = 40/105 (38%), Gaps = 14/105 (13%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKT----FDHYCEYRKTPN 474
VL G GSGKT L R LP+ VV + S TP+ LL+ T
Sbjct: 7 GVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDLLRKILRALGLPLSGGTTAE 66
Query: 475 GV--ILSPIQLGKWLVLFCDEI-NLPDMDKYATQRVISFLRQLIE 516
+ IL ++ +L DE +L + LR L +
Sbjct: 67 LLEAILDALKRRGRPLLIIDEAQHLSL-------EALEELRDLYD 104
Score = 36.1 bits (84), Expect = 0.029
Identities = 21/64 (32%), Positives = 28/64 (43%), Gaps = 4/64 (6%)
Query: 754 RQPQGHLLLIGVSGAGKTTLSR-FVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGC 812
R+ G +L G SG+GKTTL R + V + A + T D LR +LR G
Sbjct: 1 RRGAGIGVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDL---LRKILRALGL 57
Query: 813 KNEK 816
Sbjct: 58 PLSG 61
Score = 29.2 bits (66), Expect = 7.0
Identities = 20/83 (24%), Positives = 29/83 (34%), Gaps = 18/83 (21%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVL-----DPNTREWTD 103
G SGSGK+T + L + L V ++ P + + L + P + T
Sbjct: 10 TGESGSGKTTLLRRLARQLPNRR----VVYVEAPSLGTPKDLLRKILRALGLPLSGGTTA 65
Query: 104 GLFTHILRR---------IIDNV 117
L IL IID
Sbjct: 66 ELLEAILDALKRRGRPLLIIDEA 88
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 45.5 bits (108), Expect = 2e-04
Identities = 33/165 (20%), Positives = 68/165 (41%), Gaps = 23/165 (13%)
Query: 1038 ETVEQVEEMQKSLAVKSQ-------ELQSKNEAANLK-----LKEM-IKDQQEAEKRKVQ 1084
+TVE EE++KSL Q +L +K + K +K+ IK+ E +K K +
Sbjct: 161 KTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEE 220
Query: 1085 SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
++ E + ++++ ++ Q + + A + E+KK +L+ + N ++
Sbjct: 221 FDKLKKEGKA-----DKQKIKSAKNDLQNDKKQLKAD--LAELKKAPQNDLKRLENKYAI 273
Query: 1145 VKLALESIC-LLLGENATDW--KAIRAVVMRENFINSIVSNFNTE 1186
L++ L G + K ++ E +N E
Sbjct: 274 KSGDLKNFAVDLFGPEIRKYLQKFLKYYDQAEPLLNKSKKEPKEE 318
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 44.2 bits (105), Expect = 6e-04
Identities = 16/67 (23%), Positives = 30/67 (44%)
Query: 1260 EETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLD 1319
EE ++LI QLE+ I Y+ + + + +I + +L A A++ L+
Sbjct: 278 EEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILE 337
Query: 1320 NVQAKVE 1326
K+E
Sbjct: 338 KNLQKLE 344
Score = 32.7 bits (75), Expect = 2.0
Identities = 19/89 (21%), Positives = 42/89 (47%), Gaps = 3/89 (3%)
Query: 1251 QASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQ 1310
+ +NKAK + L+ +L++ I +Y+ E L +++ ++ ++A E L +
Sbjct: 388 KKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEA---EIKALEKE 444
Query: 1311 ATAIKTDLDNVQAKVERSMALLKSLGIER 1339
++ L N++ + LLK+ G
Sbjct: 445 IKELEKQLTNIEPTADEINKLLKAYGFGN 473
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 43.1 bits (102), Expect = 0.001
Identities = 32/168 (19%), Positives = 63/168 (37%), Gaps = 36/168 (21%)
Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
+ K + LE+Q E E ++ ++ L QEL+ + EA KLK
Sbjct: 210 LARLESKEAALEKQLRLEF-----EREKEELRKKYEEKL---RQELERQAEAHEQKLKNE 261
Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQ------TVEIAQKRVFVMEDLAQVEPAVMDA----- 1120
+ Q E ++ +++I+ ++E++ + R+ +E D
Sbjct: 262 LA-LQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQ 320
Query: 1121 --QQAVKEIK--------------KQQLVELRSMANPPSVVKLALESI 1152
AV+ +K ++L L+ +A +V AL S+
Sbjct: 321 QLWLAVEALKSALKSGSAGSPRPLVKELDALKELAKDDELVDAALASL 368
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 42.8 bits (101), Expect = 0.002
Identities = 55/367 (14%), Positives = 127/367 (34%), Gaps = 46/367 (12%)
Query: 1002 RHYLDFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKN 1061
+ +L + L+E + L ++ + +E+ E K L QEL+ +
Sbjct: 343 SELEELAEEKNELAKLLEERLKELEERL----EELEKELEKALERLKQLEEAIQELKEEL 398
Query: 1062 EAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ 1121
+ L+E+ Q+E E+ + + ++++ E+E+ EI + + Q
Sbjct: 399 AELSAALEEI---QEELEELEKELEELERELEELEEEIKKL----------------EEQ 439
Query: 1122 QAVKEIKKQQLVELRSMANPPSVVKLAL----ESICLLLGENATDWKAIR-AVVMRENFI 1176
E K+ + EL V L E L L E + + E +
Sbjct: 440 INQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAEL 499
Query: 1177 NSIVSNFNTEMITDE----VREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYA 1232
+ E+ E ++ + EK + ++
Sbjct: 500 REEIEELEKELRELEEELIELLELEEALKEELEEKLEKLEN-----------LLEELEEL 548
Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
+++ L+ EL+ LE + E K E L+ ++ + ++ +L + ++
Sbjct: 549 KEKLQLQQLKEELRQLEDRLQELKEL-LEELRLLRTRKEELEELRERLKELKKKLKELEE 607
Query: 1293 DLDNVQA--KFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFR 1350
L ++ + E ++ + + +L++ K+ L + L E E E
Sbjct: 608 RLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667
Query: 1351 SQMATII 1357
+++ +
Sbjct: 668 AEIRREL 674
Score = 41.7 bits (98), Expect = 0.004
Identities = 68/350 (19%), Positives = 130/350 (37%), Gaps = 74/350 (21%)
Query: 1010 HFVKLYREKCSELE--EQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK 1067
H +L ELE E++L ++ E +E++E+ + L + EL EA +
Sbjct: 471 HEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEE 530
Query: 1068 LKEMIKD------QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ 1121
L+E ++ + E K K+Q Q ++ E+ ++ + + E L +
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQLQQLKEELR----QLEDRLQELKELL---------EE 577
Query: 1122 QAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVS 1181
+ +K++L ELR L E K + R + + ++
Sbjct: 578 LRLLRTRKEELEELRER-----------------LKELKKKLKELEE---RLSQLEELLQ 617
Query: 1182 NFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPL 1241
+ +E+ E S EK N A + L+++E
Sbjct: 618 SLELSEAENELEEA-EEELESE----LEKLN----------LQAELEELLQAALEELEEK 662
Query: 1242 RLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKF 1301
EL++ E++ + + EE QLE+ +E QL + ++ +L+ + K
Sbjct: 663 VEELEA-EIRRELQRIENEE------QLEE----KLEELEQLEEELEQLREELEELLKKL 711
Query: 1302 YEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRS 1351
E QLI + L++ +A++E L+ L E E E
Sbjct: 712 GEIEQLIEE-------LESRKAELEELKKELEKLEKALELLEELREKLGK 754
Score = 40.5 bits (95), Expect = 0.008
Identities = 55/381 (14%), Positives = 139/381 (36%), Gaps = 55/381 (14%)
Query: 982 LHKANARLSKRGSRTMAI--TPRHYLDFINHFVKLYREKCSELEEQQLHLNVGLGKIAET 1039
L +A L + + A + L+ + ++ + ELEE+ L + ++
Sbjct: 387 LEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESK 446
Query: 1040 VEQVEEMQKSLA---VKSQELQ--SKNEAANLK------LKEMIKDQQEAEKRKVQSQDI 1088
+ E+ + V QEL + E L L+E + ++E + + + +++
Sbjct: 447 ELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEEL 506
Query: 1089 QAEI---EKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVE-LRSMANPPSV 1144
+ E+ E++ +E+ + + E+L + + + + ++E+K++ ++ L+
Sbjct: 507 EKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLED 566
Query: 1145 VKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNT-EMITDEVREKMHSRYLSN 1203
L+ LL + +RE + + E ++ E + S
Sbjct: 567 RLQELKE--LLEELRLLRTRKEELEELRER-LKELKKKLKELEERLSQLEELLQSL---E 620
Query: 1204 PDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETK 1263
+ + A E L EL+ L +QA + +
Sbjct: 621 LSEAENELEEAE-----------------------EELESELEKLNLQAELEELL----Q 653
Query: 1264 DLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQA 1323
+ +LE+ + + E + + + + + ++ E QL + ++ +L+ +
Sbjct: 654 AALEELEEKVEELEAEIRRELQRIENEEQLEEKLE----ELEQLEEELEQLREELEELLK 709
Query: 1324 KVERSMALLKSLGIERERWEA 1344
K+ L++ L + E
Sbjct: 710 KLGEIEQLIEELESRKAELEE 730
Score = 35.5 bits (82), Expect = 0.33
Identities = 63/369 (17%), Positives = 146/369 (39%), Gaps = 45/369 (12%)
Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
+ E E+EE + L GL + E +E++ E KSL + ++L+ K E +L+E+
Sbjct: 291 ERLEELEREIEELEEELE-GLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELA 349
Query: 1073 KDQ----QEAEKRKVQSQDIQAEIEKQTV---------EIAQKRVFVMEDLAQVEPAVMD 1119
+++ + E+R + ++ E+EK+ E A + + E+LA++ A+ +
Sbjct: 350 EEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQE--LKEELAELSAALEE 407
Query: 1120 AQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSI 1179
Q+ ++E++K +L EL ++ E I L + I + +E I +
Sbjct: 408 IQEELEELEK-ELEELE------RELEELEEEIKKLE-------EQINQLESKELMIAEL 453
Query: 1180 VSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVE 1239
+ + + H + L + ++ + ++ + +
Sbjct: 454 AGAGEKCPVCGQELPEEHEKELLELYELELEELEEELS-----REKEEAELREEIEELEK 508
Query: 1240 PLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQA 1299
LR + L +A EE ++ + +LE + ++ +L Q +
Sbjct: 509 ELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELR------ 562
Query: 1300 KFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRSQMATIIGD 1359
+ + + + +L ++ + E L + L +++ + E SQ+ ++
Sbjct: 563 ---QLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERL-SQLEELLQS 618
Query: 1360 VLLSSAYLA 1368
+ LS A
Sbjct: 619 LELSEAENE 627
Score = 31.7 bits (72), Expect = 4.5
Identities = 56/351 (15%), Positives = 126/351 (35%), Gaps = 28/351 (7%)
Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
++ E+ +ELEE++ L ++ E E K + +EL+ E K++ +
Sbjct: 234 IEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERL 293
Query: 1072 ---IK----DQQEAEKRKVQSQDIQAEIEKQTVEIAQ---------KRVFVMEDLAQVEP 1115
+ ++E E + ++++ +EK + K +E+LA+ +
Sbjct: 294 EELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353
Query: 1116 AVMDAQQAVKEIKKQQLVELRSMANPPSVVKLAL-ESICLLLGENATDWKAIRAVVMREN 1174
+ + + +++L EL L E+I L E A A+ +
Sbjct: 354 ELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELE 413
Query: 1175 FINSIVSNFNTEM--ITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYA 1232
+ + E+ + +E+++ + P V +
Sbjct: 414 ELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP-VCGQELPEEHE 472
Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
L E LEL+ LE + S K E +L ++E+ ++ +LI +
Sbjct: 473 KELL--ELYELELEELEEELSREK----EEAELREEIEELEKELRELEEELIELLELEEA 526
Query: 1293 DLDNVQAKFYEYAQLIAQATAIKTDL--DNVQAKVERSMALLKSLGIERER 1341
+ ++ K + L+ + +K L ++ ++ + L+ L E
Sbjct: 527 LKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577
Score = 31.3 bits (71), Expect = 5.4
Identities = 21/117 (17%), Positives = 60/117 (51%), Gaps = 9/117 (7%)
Query: 1015 YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD 1074
+ SEL E L L + + ++++EE+Q+ + +EL+ + EA L+E + +
Sbjct: 190 LEGQLSELLEDIEDLLEALEEELKELKKLEEIQE--EQEEEELEQEIEA----LEERLAE 243
Query: 1075 QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
+E ++R ++++A + + + E+L ++E + + ++ ++ +++ +
Sbjct: 244 LEEEKER---LEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELE 297
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 41.4 bits (98), Expect = 0.004
Identities = 28/111 (25%), Positives = 51/111 (45%), Gaps = 4/111 (3%)
Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM--IKDQQEAEKRKVQSQDIQAEIE 1093
I E E++ E+ SL +EL+ K E A LKE +K++ E +K K+Q + +E
Sbjct: 511 IGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE-EDKLLE 569
Query: 1094 KQTVEIAQKRVFVMEDLAQVEPAVMDAQQA-VKEIKKQQLVELRSMANPPS 1143
+ E Q ++ ++ + Q+ +K +L+E R N +
Sbjct: 570 EAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKAN 620
Score = 31.3 bits (72), Expect = 4.8
Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 16/81 (19%)
Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
+++ +E L EL+ +A E +A +E + L +LE+ ++E +L+ +A
Sbjct: 520 ELIASLEELERELEQ---KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEK--- 573
Query: 1293 DLDNVQAKFYEYAQLIAQATA 1313
E Q I +A
Sbjct: 574 ----------EAQQAIKEAKK 584
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 41.6 bits (98), Expect = 0.004
Identities = 50/324 (15%), Positives = 122/324 (37%), Gaps = 43/324 (13%)
Query: 1032 GLGKIAETVEQVEEMQKSLAV----------KSQELQSKNEAANLKLKEMIKDQQEAEKR 1081
++ E++E +++ L+ + EL + A+ K+ E+ K+ ++ E+
Sbjct: 672 EPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731
Query: 1082 KVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANP 1141
+ + ++ E+E+ + Q+ V +L ++E A ++ + ++ L +L + +
Sbjct: 732 EEKLKERLEELEEDLSSLEQEIENVKSELKELE-ARIEELEEDLHKLEEALNDLEARLSH 790
Query: 1142 PSVVKLALESICLLLGENATDWKAIRAVVMR-ENFINSIVSNFNTEMITDEVREKMHSRY 1200
+ ++ E + V R E + I N + E EK
Sbjct: 791 SRIPEIQAEL------------SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQEL 838
Query: 1201 LSNPDY--SYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAK 1258
K+ + K + + ++ + L L L+ + E +A+
Sbjct: 839 QEQRIDLKEQIKSIEKEIENLNGKKEELEEE-LEELEAALRDLESRLGDLKKERDELEAQ 897
Query: 1259 GEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT-- 1316
E + I +LE I + ++L A+ +A E +++ +
Sbjct: 898 LRELERKIEELEAQIEKKRKRLSELKAKL----------EALEEELSEIEDPKGEDEEIP 947
Query: 1317 ----DLDNVQAKVERSMALLKSLG 1336
L++VQA+++R +++L
Sbjct: 948 EEELSLEDVQAELQRVEEEIRALE 971
Score = 40.1 bits (94), Expect = 0.012
Identities = 53/338 (15%), Positives = 128/338 (37%), Gaps = 44/338 (13%)
Query: 1021 ELEEQQLHLNVGLGKIAETVEQVEEMQ--KSLAVKSQELQSKNEAANLKLKEMIKDQQEA 1078
ELEE + ++ I E +Q+E ++ + A + Q L + K+ E
Sbjct: 178 ELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237
Query: 1079 EKRKVQSQ--DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
+K ++ Q ++ E+EK T EI++ ++E ++ K+IK E
Sbjct: 238 QKEAIERQLASLEEELEKLTEEISELE-------KRLEEIEQLLEELNKKIKDLGEEEQL 290
Query: 1137 SMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREKM 1196
+ ++ + S+ E + K E +E+ ++
Sbjct: 291 RVKEKIGELEAEIASL-----ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345
Query: 1197 HSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENK 1256
E+ R + YA++ +++E LR EL+ ++ + +E +
Sbjct: 346 EE----------ERKRRDKL-----------TEEYAELKEELEDLRAELEEVDKEFAETR 384
Query: 1257 AKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT 1316
+ ++ ++ + +L++ I K E +L + + +L ++ A A + A+ ++
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAI---AGIEAKINELEE 441
Query: 1317 DLDNVQAKVERS----MALLKSLGIERERWEATSETFR 1350
+ ++ ++++ L L + E +
Sbjct: 442 EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYD 479
Score = 39.3 bits (92), Expect = 0.018
Identities = 56/305 (18%), Positives = 112/305 (36%), Gaps = 40/305 (13%)
Query: 1015 YREKCSELEEQQLHLNVGLGKIAET--------VEQVEEMQKSLAVKSQELQSKNEAANL 1066
Y+ E E + + + + E + +EE + L + EL+ + E
Sbjct: 213 YQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQ 272
Query: 1067 KLKEM---IKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQA 1123
L+E+ IKD E E+ + ++ +I + EIA + E ++E A +
Sbjct: 273 LLEELNKKIKDLGEEEQLR-----VKEKIGELEAEIASLERSIAEKERELEDAEERLAKL 327
Query: 1124 VKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNF 1183
EI K E+ + + + + E + + +RA + +
Sbjct: 328 EAEIDKLLA-EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVD-----KEFA 381
Query: 1184 NTEMITDEVREKM------HSRYLSNPDYSYEKANRASMACGPM---VKWAIAQISYADM 1234
T + REK+ + D E+ R S + + A+I ++
Sbjct: 382 ETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKI--NEL 439
Query: 1235 LKKVEPLRLELKSLEVQASENKAKGEE-------TKDLITQLEKSIASYKDEYAQLIAQA 1287
++ E LE+K E + + A + K+ ++EK ++ + E A+ AQA
Sbjct: 440 EEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQA 499
Query: 1288 TAIKT 1292
A +
Sbjct: 500 RASEE 504
Score = 35.4 bits (82), Expect = 0.29
Identities = 25/118 (21%), Positives = 52/118 (44%), Gaps = 12/118 (10%)
Query: 1227 AQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQ 1286
+D +K+ + E++ LE + + K + EE ++ ++ LE+ I + K E +L A+
Sbjct: 707 LSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766
Query: 1287 ATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEA 1344
++ DL ++ + ++ L +++ A L L E R EA
Sbjct: 767 IEELEEDLHKLEEALND----------LEARLS--HSRIPEIQAELSKLEEEVSRIEA 812
Score = 33.9 bits (78), Expect = 0.92
Identities = 23/128 (17%), Positives = 61/128 (47%), Gaps = 17/128 (13%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
E+ S +E + + L ++ E +E+ + QELQ + + + I+ +
Sbjct: 804 EEEVSRIEARLREIEQKLNRLTLEKEYLEK-------EIQELQEQRIDLKEQ-IKSIEKE 855
Query: 1076 QE---AEKRKVQSQ--DIQAEI---EKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI 1127
E +K +++ + +++A + E + ++ ++R + L ++E + + + + E
Sbjct: 856 IENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQI-EK 914
Query: 1128 KKQQLVEL 1135
K+++L EL
Sbjct: 915 KRKRLSEL 922
Score = 32.7 bits (75), Expect = 1.9
Identities = 30/143 (20%), Positives = 68/143 (47%), Gaps = 10/143 (6%)
Query: 1010 HFVKLY-REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL 1068
K Y ++ EL+EQ++ L +I +++E + +EL+ EAA L
Sbjct: 825 TLEKEYLEKEIQELQEQRIDLKE---QIKSIEKEIENLNGKKEELEEELEEL-EAALRDL 880
Query: 1069 KEMIKD-QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI 1127
+ + D ++E ++ + Q ++++ +IE+ +I +KR + E A++E ++ + EI
Sbjct: 881 ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE----ALEEELSEI 936
Query: 1128 KKQQLVELRSMANPPSVVKLALE 1150
+ + + S+ + E
Sbjct: 937 EDPKGEDEEIPEEELSLEDVQAE 959
>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
only].
Length = 329
Score = 40.5 bits (95), Expect = 0.005
Identities = 61/274 (22%), Positives = 96/274 (35%), Gaps = 30/274 (10%)
Query: 393 SDVVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSS 452
VVV + + L L ++L GPPG GKT+ + RAL + V + +
Sbjct: 23 EKVVVGDEEVIE---LALLALLAGGHVLLEGPPGVGKTLLARALARAL-GLPFVRIQCTP 78
Query: 453 ATTPELLLKTFD---HYCEYRKTPNGVILS-PIQLGKWLVLFCDEINLPDMDKYATQRVI 508
P LL T+ E + + P+ ++L DEIN A V
Sbjct: 79 DLLPSDLLGTYAYAALLLEPGEF---RFVPGPLFAAVRVILLLDEINR------APPEVQ 129
Query: 509 SFLRQLIEQRGFYRPADKQWVSLERIQCVGA-CNPPTDPGRKPLSHRFLRHVPV-IYVDY 566
+ L + +E+R P + L V A NP G PL L + IYVDY
Sbjct: 130 NALLEALEERQVTVP-GLTTIRLPPPFIVIATQNPGEYEGTYPLPEALLDRFLLRIYVDY 188
Query: 567 PGETSLKQIYGTFSRAMLRLIPPLRGYADALTNAMVELYLASQEKFTQDMQPHYVYSPRE 626
P ++I +R L + E L Q++ + V E
Sbjct: 189 PDSEEEERII--LARVGGVDELDLESLVKPVL--SDEELLRLQKEVKK------VPVSDE 238
Query: 627 MTRWVRGICEAIRPLESLTVEGLVRLWAHEALRL 660
+ ++ + A+R + + R L
Sbjct: 239 VIDYIVTLVAALREAPDVALGASPRASLALLAAL 272
Score = 37.0 bits (86), Expect = 0.068
Identities = 31/123 (25%), Positives = 49/123 (39%), Gaps = 19/123 (15%)
Query: 729 EEELDVQLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQI 788
EL+ +V +EV++ L GH+LL G G GKT L+R +A GL +I
Sbjct: 19 RSELEKVVVGDEEVIELALLALLA----GGHVLLEGPPGVGKTLLARALARALGLPFVRI 74
Query: 789 RAHNKYTGADFD--EDLRTVLRRSGCK---------NEKIAFLLDESNV----LESGFLE 833
+ +D +L G ++ LLDE N +++ LE
Sbjct: 75 QCTPDLLPSDLLGTYAYAALLLEPGEFRFVPGPLFAAVRVILLLDEINRAPPEVQNALLE 134
Query: 834 RMN 836
+
Sbjct: 135 ALE 137
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
Provisional.
Length = 387
Score = 40.6 bits (95), Expect = 0.006
Identities = 26/115 (22%), Positives = 48/115 (41%), Gaps = 12/115 (10%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
+E+ +LE+++L E +Q EE K ++ Q + E A K K +
Sbjct: 100 QERLKQLEKERL-------AAQEQKKQAEEAAK----QAALKQKQAEEAAAKAAAAAKAK 148
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
EAE ++ + +A E + A+ + A+ + A +A E KK+
Sbjct: 149 AEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE-AKKKAEAEAAAKAAAEAKKK 202
Score = 38.2 bits (89), Expect = 0.031
Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 5/96 (5%)
Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKD---QQEAEKRKVQSQDIQAEIEKQTVEIA 1100
E+ +K +++ELQ K A +LK++ K+ QE +K+ ++ A +KQ E A
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138
Query: 1101 QKRVFVMEDLAQVEPAVMD--AQQAVKEIKKQQLVE 1134
K + A+ E A++A E KK+ E
Sbjct: 139 AKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE 174
Score = 33.6 bits (77), Expect = 0.81
Identities = 28/126 (22%), Positives = 49/126 (38%), Gaps = 13/126 (10%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
R+K + + ++L AE + ++ LA + Q+ Q EAA K +
Sbjct: 81 RKKKEQQQAEELQQKQ----AAEQERLKQLEKERLAAQEQKKQ-AEEAAKQAAL---KQK 132
Query: 1076 QEAEKRKVQSQD--IQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLV 1133
Q E + +AE E + A K+ A+ + A++A E KK+
Sbjct: 133 QAEEAAAKAAAAAKAKAEAEAKRAAAAAKKA---AAEAKKKAEAEAAKKAAAEAKKKAEA 189
Query: 1134 ELRSMA 1139
E + A
Sbjct: 190 EAAAKA 195
Score = 30.5 bits (69), Expect = 8.1
Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 11/80 (13%)
Query: 1055 QELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVE 1114
Q+ Q K+ + ++ + QQ E + Q AE E+ ++ ++R+ E Q E
Sbjct: 67 QQQQQKSAKRAEEQRKKKEQQQAEELQ----QKQAAEQERLK-QLEKERLAAQEQKKQAE 121
Query: 1115 PAVMDAQQAVKEIKKQQLVE 1134
+ A + KQ+ E
Sbjct: 122 ------EAAKQAALKQKQAE 135
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 40.6 bits (96), Expect = 0.007
Identities = 26/126 (20%), Positives = 56/126 (44%), Gaps = 14/126 (11%)
Query: 1222 VKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYA 1281
+K + + + +E + EL L+ +N+ + EE KD +L K++ + Y
Sbjct: 103 IKEIEELL--DTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAKSFSYG 160
Query: 1282 QLIAQATAIKTDLDNVQAKFYEYAQL-----IAQAT----AIKTDLDNVQAKVERSMALL 1332
+ ++ LD ++ +F ++ +L +A ++ + D ++ K+E LL
Sbjct: 161 PAL---DELEKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEETDALEQKMEEIPPLL 217
Query: 1333 KSLGIE 1338
K L E
Sbjct: 218 KELQNE 223
Score = 31.8 bits (73), Expect = 3.8
Identities = 23/100 (23%), Positives = 53/100 (53%), Gaps = 8/100 (8%)
Query: 1038 ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQE-----AEKRKVQS--QDIQA 1090
+ + +++ +Q+S + EL++ E +L+E+ + AEK+ S Q+
Sbjct: 324 QLLLELDRLQQSYTLNEDELETVRELEK-QLEELEAQYDQLVERIAEKKVAYSELQEELE 382
Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
EIEKQ EI +++ + E L + ++A++ ++E +++
Sbjct: 383 EIEKQLEEIEKEQEELSESLQGLRKDELEAREKLQEYRQK 422
Score = 30.6 bits (70), Expect = 9.0
Identities = 26/125 (20%), Positives = 54/125 (43%), Gaps = 26/125 (20%)
Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAE 1079
SEL+E+ +I + +E++E+ Q+ L+ Q L+ A KL+E + E
Sbjct: 375 SELQEELE-------EIEKQLEEIEKEQEELSESLQGLRKDELEAREKLQEYRQKLHEI- 426
Query: 1080 KRKVQSQDI--------------QAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVK 1125
KR ++ ++ EIE E+ + + ++ V + +A V+
Sbjct: 427 KRYLEKSNLPGLPEDYLEYFFVVSDEIEALADELNEVPI----NMDAVNRQLEEATDDVE 482
Query: 1126 EIKKQ 1130
+K++
Sbjct: 483 TLKEK 487
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 38.8 bits (91), Expect = 0.007
Identities = 17/102 (16%), Positives = 41/102 (40%), Gaps = 8/102 (7%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD------QQEAEKRKVQSQDI 1088
KIA+ + + E +++ E + + E A + E+I+ Q E + +++
Sbjct: 41 KIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEEL 100
Query: 1089 QAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
+ E EI ++ +E+L A +++ +
Sbjct: 101 ERIKEAAEAEIEAEKERALEELRAE--VAELAVAIAEKLLGK 140
Score = 34.6 bits (80), Expect = 0.16
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 7/71 (9%)
Query: 1069 KEMIKDQ-QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI 1127
+ I D EAE+ K ++Q + AE E++ +E A+++ + + A+ E A+Q +EI
Sbjct: 39 QAKIADDLAEAERLKEEAQALLAEYEQE-LEEAREQASEIIEQAKKE-----AEQIAEEI 92
Query: 1128 KKQQLVELRSM 1138
K + EL +
Sbjct: 93 KAEAEEELERI 103
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 37.6 bits (88), Expect = 0.014
Identities = 20/95 (21%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS--QDIQAEIEKQTVE 1098
++ +++Q L K +ELQ + + + + ++ ++A+++++Q Q++Q + + E
Sbjct: 42 KEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQE 101
Query: 1099 IAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLV 1133
+ QK+ E L + + A + V + K LV
Sbjct: 102 LQQKQ---QELLQPIYDKIDKAIKEVAKEKGYDLV 133
Score = 29.5 bits (67), Expect = 7.0
Identities = 19/96 (19%), Positives = 48/96 (50%), Gaps = 12/96 (12%)
Query: 1046 MQKSLAVKS--QELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
+ +S A K+ ++L+ + + +L++ K+ Q+ E Q +Q + + E + +
Sbjct: 27 LSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEE------QKLQKQAATLSEEARKAK 80
Query: 1104 VFVMEDLAQVEPAVMDAQQAV-KEIKKQQLVELRSM 1138
++L Q + + QQA +E++++Q L+ +
Sbjct: 81 ---QQELQQKQQELQQKQQAAQQELQQKQQELLQPI 113
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked with
mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of it
enhances mitochondrial movement. The activity appears to
be mediated through binding the mitochondria to the actin
intermediate filaments (IFs).
Length = 349
Score = 39.1 bits (92), Expect = 0.016
Identities = 24/101 (23%), Positives = 52/101 (51%), Gaps = 5/101 (4%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK-DQQEAEKRKVQSQDIQAEIEKQTVEI 1099
EQ+EE ++ L + E +++ E K E + +Q+ AEKR+++ + + E+E+Q E
Sbjct: 241 EQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEK 300
Query: 1100 AQKRVFVMEDLAQVEPAVMDA----QQAVKEIKKQQLVELR 1136
++R E+ + + + Q ++E +++ L E
Sbjct: 301 EERRAAEREEELEEGERLREEEAERQARIEEERQRLLKEHA 341
Score = 33.3 bits (77), Expect = 1.0
Identities = 20/97 (20%), Positives = 50/97 (51%), Gaps = 1/97 (1%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAE-KRKVQSQDIQAEIEKQTVEI 1099
Q+EE ++ A + +E + +E + + + +++E E KRK + ++ +A +++Q E
Sbjct: 24 AQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEER 83
Query: 1100 AQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
++R E+ Q + + + ++E + + E R
Sbjct: 84 EKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKR 120
Score = 32.2 bits (74), Expect = 2.5
Identities = 16/60 (26%), Positives = 31/60 (51%)
Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
EE +K K + +++ E K + + +++ E E + + + QAE E+ E A+KR
Sbjct: 223 EEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKR 282
Score = 31.8 bits (73), Expect = 3.1
Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 2/105 (1%)
Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQ 1095
+A Q EE + + + + KE K+++EAEKR+ Q Q++Q E+Q
Sbjct: 184 VARLRAQQEEAEDEREELDELRADLYQEEYER-KERQKEKEEAEKRRRQKQELQRAREEQ 242
Query: 1096 TVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMAN 1140
+E ++R+ + E M +QA E +Q+ E R M
Sbjct: 243 -IEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKR 286
>gnl|CDD|144608 pfam01078, Mg_chelatase, Magnesium chelatase, subunit ChlI.
Magnesium-chelatase is a three-component enzyme that
catalyzes the insertion of Mg2+ into protoporphyrin IX.
This is the first unique step in the synthesis of
(bacterio)chlorophyll. Due to this, it is thought that
Mg-chelatase has an important role in channelling inter-
mediates into the (bacterio)chlorophyll branch in
response to conditions suitable for photosynthetic
growth. ChlI and BchD have molecular weight between
38-42 kDa.
Length = 207
Score = 38.3 bits (90), Expect = 0.016
Identities = 44/156 (28%), Positives = 57/156 (36%), Gaps = 54/156 (34%)
Query: 419 LVLCGPPGSGKTMTLLSALRA-LPDME---------VVSLNFSSATTPELLLKTFDHYCE 468
L++ GPPGSGKTM L L LP + + S+ + + F
Sbjct: 25 LLMIGPPGSGKTM-LAKRLPGILPPLTEQEALEVTAIHSVAGLGGDGGLIRRRPFRA--- 80
Query: 469 YRKTPN-----------GVILSP--IQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLI 515
P+ G I P I L VLF DE LP+ + RV+ LRQ +
Sbjct: 81 ----PHHSASAAALVGGGSIPRPGEISLAHNGVLFLDE--LPEFSR----RVLESLRQPL 130
Query: 516 E------QRGFYR---PADKQWVSLERIQCVGACNP 542
E R + PA R Q V A NP
Sbjct: 131 EDGEITISRARAKVTFPA--------RFQLVAAMNP 158
Score = 31.3 bits (72), Expect = 3.1
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 43 NHGLMMVGPSGSGKST 58
H L+M+GP GSGK+
Sbjct: 22 GHNLLMIGPPGSGKTM 37
>gnl|CDD|178532 PLN02946, PLN02946, cysteine-tRNA ligase.
Length = 557
Score = 39.1 bits (91), Expect = 0.018
Identities = 43/169 (25%), Positives = 73/169 (43%), Gaps = 32/169 (18%)
Query: 977 YVHQTLHKANARLSKRGSR-TMAITPRHYLDFINHFVKLYREKCSELEEQQLHLNVGLGK 1035
Y++QTLH L + S P L+ IN F + S+ LH V L
Sbjct: 375 YIYQTLHDCEESLQQHDSTFEKDSVPPDTLNCINKFHDEFVTSMSD----DLHTPVALAA 430
Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD----------------QQEAE 1079
++E ++ + ++ + K QE + ++ AA L++ I+D QQ E
Sbjct: 431 LSEPLKTINDLLHTRKGKKQEKRLESLAA---LEKKIRDVLSVLGLMPTSYSEALQQLRE 487
Query: 1080 K----RKVQSQDIQAEIEKQTVEIAQKRV----FVMEDLAQVEPAVMDA 1120
K K+ + + +IE++TV K + +DLA V A+MD+
Sbjct: 488 KALRRAKLTEEQVLQKIEERTVARKNKEYEKSDAIRKDLAAVGIALMDS 536
>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
also known as guanylate kinase (GKase), catalyzes the
reversible phosphoryl transfer from adenosine
triphosphate (ATP) to guanosine monophosphate (GMP) to
yield adenosine diphosphate (ADP) and guanosine
diphosphate (GDP). It plays an essential role in the
biosynthesis of guanosine triphosphate (GTP). This
enzyme is also important for the activation of some
antiviral and anticancer agents, such as acyclovir,
ganciclovir, carbovir, and thiopurines.
Length = 137
Score = 37.1 bits (87), Expect = 0.018
Identities = 17/39 (43%), Positives = 20/39 (51%), Gaps = 5/39 (12%)
Query: 418 PLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
+VL GP G GK+ TLL L D + FS S TT
Sbjct: 1 LIVLSGPSGVGKS-TLLKRLLEEFDP---NFGFSVSHTT 35
Score = 29.4 bits (67), Expect = 7.9
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 46 LMMVGPSGSGKSTAWKVLLK 65
+++ GPSG GKST K LL+
Sbjct: 2 IVLSGPSGVGKSTLLKRLLE 21
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 38.5 bits (90), Expect = 0.019
Identities = 50/272 (18%), Positives = 102/272 (37%), Gaps = 52/272 (19%)
Query: 1052 VKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLA 1111
V + L K + + KL E+ K+++ + + + + +IE EI K + +++
Sbjct: 24 VFAALLSDKIQNQDSKLSELQKEKKNIQN---EIESLDNQIE----EIQSKIDELQKEID 76
Query: 1112 QVEPAVMDAQQAVKEIKK-----QQLVE--LRSM---ANPPSVVKLALESICLLLGENAT 1161
Q + + Q+ + E+K+ Q+L++ R+M S + + L S
Sbjct: 77 QSKAEIKKLQKEIAELKENIVERQELLKKRARAMQVNGTATSYIDVILNS---------- 126
Query: 1162 DWKAIRAVVMRENFINSIVSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPM 1221
K+ ++ R I+ IV + +++ + +K + L + E
Sbjct: 127 --KSFSDLISRVTAISVIVD-ADKKILEQQKEDK---KSLEEKQAALEDKLE-------- 172
Query: 1222 VKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYA 1281
Q L + + E +L + +A K + + +K++A A
Sbjct: 173 -TLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEE-QKALAE---AAA 227
Query: 1282 QLIAQATAIKTDLDNVQAKFYEYAQLIAQATA 1313
A+ A A+ E A L A ATA
Sbjct: 228 AEAAKQEAAA----KAAAQ--EQAALQAAATA 253
Score = 35.5 bits (82), Expect = 0.17
Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 7/92 (7%)
Query: 1040 VEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI 1099
+EQ +E +KSL K L+ K E ++ Q E E + +AE +
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLE-------TLVALQNELETQLNSLNSQKAEKNALIAAL 202
Query: 1100 AQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
A K + + A +E A+ A E KQ+
Sbjct: 203 AAKEASALGEKAALEEQKALAEAAAAEAAKQE 234
Score = 32.4 bits (74), Expect = 1.7
Identities = 24/106 (22%), Positives = 37/106 (34%), Gaps = 3/106 (2%)
Query: 1021 ELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEK 1080
LEE+Q L L + ++E SL + E + A K + ++ E+
Sbjct: 159 SLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEE 218
Query: 1081 RKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKE 1126
+K ++ AE KQ E A A AV E
Sbjct: 219 QKALAEAAAAEAAKQEAAAKAAA---QEQAALQAAATAAQPSAVTE 261
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 39.6 bits (93), Expect = 0.020
Identities = 63/343 (18%), Positives = 123/343 (35%), Gaps = 69/343 (20%)
Query: 1014 LYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK 1073
LY + EQ + E ++ E+ ++ + Q+ Q+ ++ NL +
Sbjct: 295 LYTSRRQLAAEQYRLV--------EMARELAELNEAESDLEQDYQAASDHLNLVQTALR- 345
Query: 1074 DQQEAEKRKVQSQD---IQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
QQE +R + + E + + VE A E + E A++ V E+K Q
Sbjct: 346 -QQEKIERYQADLEELEERLEEQNEVVEEAD------EQQEENEARAEAAEEEVDELKSQ 398
Query: 1131 ------QLVELRSMANPPSVVKLALESICLLLG------ENATDWKAIRAVVMRENFINS 1178
L ++ A ALE L G +NA DW + +E
Sbjct: 399 LADYQQALDVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAEDW--LEEFQAKE----- 451
Query: 1179 IVSNFNTEMITDEVRE-----KMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQ----- 1228
+ T+E+ + S + +Y+ + + W +A+
Sbjct: 452 -------QEATEELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRR 504
Query: 1229 -ISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEK-SIASYKDE--YAQLI 1284
+ ++++ LR+ L LE Q + + E L+ + K + DE QL
Sbjct: 505 LREQRHLAEQLQQLRMRLSELE-QRLRQQQRAER---LLAEFCKRLGKNLDDEDELEQLQ 560
Query: 1285 AQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVER 1327
+ A L ++ + A++ L+ +QA+++R
Sbjct: 561 EELEARLESLSESV------SEARERRMALRQQLEQLQARIQR 597
>gnl|CDD|222307 pfam13671, AAA_33, AAA domain. This family of domains contain only
a P-loop motif, that is characteristic of the AAA
superfamily. Many of the proteins in this family are
just short fragments so there is no Walker B motif.
Length = 143
Score = 36.9 bits (86), Expect = 0.025
Identities = 10/31 (32%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLN 449
+++ G PGSGK+ T L V+S +
Sbjct: 2 ILMVGLPGSGKS-TFARRLLRELGAVVLSSD 31
Score = 33.4 bits (77), Expect = 0.30
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
++MVG GSGKST + LL+ L
Sbjct: 2 ILMVGLPGSGKSTFARRLLREL 23
Score = 30.0 bits (68), Expect = 5.6
Identities = 7/16 (43%), Positives = 14/16 (87%)
Query: 760 LLLIGVSGAGKTTLSR 775
+L++G+ G+GK+T +R
Sbjct: 2 ILMVGLPGSGKSTFAR 17
>gnl|CDD|221983 pfam13207, AAA_17, AAA domain.
Length = 114
Score = 36.1 bits (83), Expect = 0.029
Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 2/82 (2%)
Query: 761 LLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTG--ADFDEDLRTVLRRSGCKNEKIA 818
L+ G G+GK+TL++ +A G+ V + + G D +L + E +
Sbjct: 3 LITGPPGSGKSTLAKKLAEKLGIPVISLDDLLREEGLAELDDGELDDIDIDLELLEEILD 62
Query: 819 FLLDESNVLESGFLERMNTLLA 840
L + V++ + L
Sbjct: 63 ELAKQEWVIDGVRESTLELRLE 84
Score = 35.3 bits (81), Expect = 0.053
Identities = 22/82 (26%), Positives = 37/82 (45%), Gaps = 13/82 (15%)
Query: 47 MMVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLF 106
++ GP GSGKST L K L G +I + +E LD + D
Sbjct: 3 LITGPPGSGKST----LAKKLAEKLG----IPVISLDDLLREEGLAELDDGELDDID--- 51
Query: 107 THILRRIIDNVRGEISKRQWII 128
I +++ + E++K++W+I
Sbjct: 52 --IDLELLEEILDELAKQEWVI 71
Score = 30.7 bits (69), Expect = 1.8
Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 1/29 (3%)
Query: 420 VLCGPPGSGKTMTLLSALRALPDMEVVSL 448
++ GPPGSGK+ TL L + V+SL
Sbjct: 3 LITGPPGSGKS-TLAKKLAEKLGIPVISL 30
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of the
C-terminal region.
Length = 546
Score = 38.4 bits (89), Expect = 0.030
Identities = 31/167 (18%), Positives = 66/167 (39%), Gaps = 26/167 (15%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQEL----------QSKNEAANL 1066
S++ + L L E E++EE K S+ + +++N+ L
Sbjct: 164 GDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRIL 223
Query: 1067 KLKE----MIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQ 1122
+L++ + + +QE ++ +QDI+AE+E+ E+ Q+ + + Q +
Sbjct: 224 ELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQR---LKKMTIQRRDEETERID 280
Query: 1123 AVKEIK--KQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIR 1167
E + + L L+ + + LLGE ++R
Sbjct: 281 LQLENEQLHEDLRTLQERLE-------SSQQKAGLLGEELASLGSLR 320
>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
activities (AAA). AAA family proteins often perform
chaperone-like functions that assist in the assembly,
operation, or disassembly of protein complexes.
Length = 131
Score = 36.0 bits (84), Expect = 0.037
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 5/67 (7%)
Query: 760 LLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAH---NKYTGADFDEDLRTVLRRSGCKNEK 816
LLL G G GKTTL++ VA G +I +KY G + ++ LR + +
Sbjct: 1 LLLYGPPGTGKTTLAKAVAKELGAPFIEISGSELVSKYVG-ESEKRLRELFEAAKKLAPC 59
Query: 817 IAFLLDE 823
+ +DE
Sbjct: 60 V-IFIDE 65
Score = 35.3 bits (82), Expect = 0.067
Identities = 32/159 (20%), Positives = 54/159 (33%), Gaps = 39/159 (24%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSAT------TPELLLKTFDHYCEYRKT 472
L+L GPPG+GKT + + L + ++ S + + L + F+ +
Sbjct: 1 LLLYGPPGTGKTTLAKAVAKEL-GAPFIEISGSELVSKYVGESEKRLRELFEAAKKLAPC 59
Query: 473 PNGVILSPIQLGKWLVLFCDEI-----NLPDMDKYATQRVISFLRQLIEQRGFYRPADKQ 527
V+F DEI + ++RV++ L + D
Sbjct: 60 ---------------VIFIDEIDALAGSRGSGGDSESRRVVNQLLTEL---------DGF 95
Query: 528 WVSLERIQCVGACNPPTDPGRKPLSHRFLRHVPVIYVDY 566
SL ++ + A N P L RF R I
Sbjct: 96 TSSLSKVIVIAATNRPDKLDPALLRGRFDRI---IEFPL 131
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 38.6 bits (89), Expect = 0.038
Identities = 52/277 (18%), Positives = 106/277 (38%), Gaps = 27/277 (9%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLA--VKSQELQSKNEAANLKLKEMIKD 1074
E+ + EE + L K AE ++ EE++K A K E K E N E K
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737
Query: 1075 QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVE 1134
+ E +K+K + E +K+ + ++ E++ + + AV++ + ++ K++ V+
Sbjct: 1738 EAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
Query: 1135 LR---SMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDE 1191
+ N ++++ E ++ + AI+ V +N F
Sbjct: 1798 KKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKN 1857
Query: 1192 VREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQ 1251
D++ EK + +I AD ++K ++ +E +
Sbjct: 1858 NENGEDGN--KEADFNKEKDLKEDD---------EEEIEEADEIEK-----IDKDDIERE 1901
Query: 1252 ASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQAT 1288
N G+ + +L+ KDEY + A+ T
Sbjct: 1902 IPNNNMAGKNNDIIDDKLD------KDEYIKRDAEET 1932
Score = 37.0 bits (85), Expect = 0.10
Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 1/105 (0%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEK 1094
K AE ++ +E +K+ K + ++K +A K + K + E K+K + +AE K
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK 1342
Query: 1095 QTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMA 1139
+ E A+ D A+ +A + KE K++ + A
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387
Score = 37.0 bits (85), Expect = 0.11
Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 9/123 (7%)
Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKS---LAVKSQELQSKNEAANLKLK 1069
K+ E+ + EE++ + K AE ++ EE++K+ +K+ E K E K +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Query: 1070 EMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKK 1129
E K E +++K + E + E +K+ E+ + E + + +IK
Sbjct: 1679 EAKK--AEEDEKKAAEALKKEAEEAKKAEELKKK--EAEEKKKAEE--LKKAEEENKIKA 1732
Query: 1130 QQL 1132
++
Sbjct: 1733 EEA 1735
Score = 34.3 bits (78), Expect = 0.70
Identities = 24/110 (21%), Positives = 45/110 (40%), Gaps = 10/110 (9%)
Query: 1033 LGKIAETVEQVEEMQKSL--AVKSQELQSKNE----AANLKLKEMIKDQQEAEKRKVQSQ 1086
K AE ++ +E +K A K+ E + K E A K K + + K+K +
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482
Query: 1087 DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
E +K+ E +K ++ + A A +A K + ++ E +
Sbjct: 1483 KKADEAKKKAEEAKKK----ADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528
Score = 33.6 bits (76), Expect = 1.2
Identities = 26/103 (25%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 1035 KIAETVEQVEEMQK---SLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAE 1091
K E ++ EE +K + K++E + EAA + E D+ EA + K ++ + + E
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE-AEAAADEAEAAEEKAEAAEKKKE 1374
Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVK---EIKKQQ 1131
K+ + A+K+ E+ + + A A++ K E+KK
Sbjct: 1375 EAKKKADAAKKKA---EEKKKADEAKKKAEEDKKKADELKKAA 1414
Score = 30.9 bits (69), Expect = 7.1
Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 12/103 (11%)
Query: 1033 LGKIAETVEQVEEMQK---SLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQ 1089
K AE ++ +E +K K+ EL+ K AA K E K + EK+K +
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELK-KAAAAKKKADEAKKKAE--EKKKADEAKKK 1439
Query: 1090 AEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVK--EIKKQ 1130
AE E + + A+K+ E+ + E A A++A K E KK+
Sbjct: 1440 AE-EAKKADEAKKKA---EEAKKAEEAKKKAEEAKKADEAKKK 1478
Score = 30.5 bits (68), Expect = 9.8
Identities = 22/104 (21%), Positives = 48/104 (46%), Gaps = 2/104 (1%)
Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQT 1096
A+ +++ EE +K+ K E + K + A K +E + + K+K + +A+ K+
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE--AKKADEAKKKAEEAKKKADAAKKK 1337
Query: 1097 VEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMAN 1140
E A+K + A+ +A + E +++ E + A+
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
>gnl|CDD|213195 cd03228, ABCC_MRP_Like, ATP-binding cassette domain of multidrug
resistance protein-like transporters. The MRP
(Multidrug Resistance Protein)-like transporters are
involved in drug, peptide, and lipid export. They
belong to the subfamily C of the ATP-binding cassette
(ABC) superfamily of transport proteins. The ABCC
subfamily contains transporters with a diverse
functional spectrum that includes ion transport, cell
surface receptor, and toxin secretion activities. The
MRP-like family, similar to all ABC proteins, have a
common four-domain core structure constituted by two
membrane-spanning domains, each composed of six
transmembrane (TM) helices, and two nucleotide-binding
domains (NBD). ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 171
Score = 36.6 bits (86), Expect = 0.043
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
VGPSGSGKST K+LL+ + G
Sbjct: 34 VGPSGSGKSTLLKLLLRLYDPTSG 57
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of proteins
contain a band 4.1 domain (pfam00373), at their amino
terminus. This family represents the rest of these
proteins.
Length = 244
Score = 37.4 bits (87), Expect = 0.044
Identities = 29/148 (19%), Positives = 65/148 (43%), Gaps = 6/148 (4%)
Query: 1021 ELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQS---KNEAANLKLKEMI-KDQQ 1076
ELEE+ + + + + +E+ EE L K ++ + E +L+E + ++
Sbjct: 9 ELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEE 68
Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ--QAVKEIKKQQLVE 1134
EA + + + ++AE+++ T E+A+ + A+ + + Q E +Q+L+E
Sbjct: 69 EAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLE 128
Query: 1135 LRSMANPPSVVKLALESICLLLGENATD 1162
+ P V + L +N +
Sbjct: 129 AAAAPTAPPHVAAPVNGEQLEPDDNGEE 156
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the autophagic
process and in the vacuolar protein sorting pathway.
Apg14p may be a component specifically required for the
function of Apg6/Vps30p through the autophagic pathway.
There are 17 auto-phagosomal component proteins which are
categorized into six functional units, one of which is
the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
complex and the Atg2-Atg18 complex are essential for
nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol 3-phosphate
(PtdIns(3)P) at the pre-autophagosomal structure (PAS).
The localisation of this complex at the PAS is controlled
by Atg14. Autophagy mediates the cellular response to
nutrient deprivation, protein aggregation, and pathogen
invasion in humans, and malfunction of autophagy has been
implicated in multiple human diseases including cancer.
This effect seems to be mediated through direct
interaction of the human Atg14 with Beclin 1 in the human
phosphatidylinositol 3-kinase class III complex.
Length = 307
Score = 37.3 bits (87), Expect = 0.048
Identities = 40/220 (18%), Positives = 80/220 (36%), Gaps = 33/220 (15%)
Query: 958 CSL--VSTTPSHRDSVINACVYVHQT----LHKANARLSKRGSRTMAITPRHYLDFINHF 1011
C + S P + +Y + L N L ++ +
Sbjct: 2 CPICHRSRRPFYCAHCARNSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADL 61
Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
+KL + E Q +I++ E++E+ ++ + + L + +
Sbjct: 62 LKLEVARKKERLNQIRA------RISQLKEEIEQKRERIEELKRALAQRRSDLSSAS--- 112
Query: 1072 IKDQQEAEKRKVQSQDIQAEIEK-------QTVEIAQKRVFVMEDLAQVEPAVM------ 1118
Q ++R Q + +Q EI++ +A+KR F+ +LA++ P
Sbjct: 113 ---YQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAEKRSFLCRELAKLFPLRRVIRGRK 169
Query: 1119 -DAQQAVKEIKKQQLVELRSM-ANPPSVVKLALESICLLL 1156
D+ I LV+LR + + PP V+ +L + LL
Sbjct: 170 GDSSSEPYTIGGVPLVDLRDLNSLPPEVISASLGYMAQLL 209
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 37.9 bits (89), Expect = 0.049
Identities = 24/123 (19%), Positives = 58/123 (47%), Gaps = 14/123 (11%)
Query: 1222 VKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYA 1281
+ + + + + +E + EL+ L +N+ + E+ KDL +L KS+ + + +
Sbjct: 107 INEIESLLDLIE--EDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFG 164
Query: 1282 QLIAQATAIKTDLDNVQAKFYEYAQLIA-----QATAI----KTDLDNVQAKVERSMALL 1332
+ + ++ L+N++ +F ++ +L +A I + +L ++ +E LL
Sbjct: 165 PALDE---LEKQLENLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELL 221
Query: 1333 KSL 1335
K L
Sbjct: 222 KEL 224
Score = 32.9 bits (76), Expect = 1.6
Identities = 21/113 (18%), Positives = 51/113 (45%), Gaps = 17/113 (15%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQ-----SKNEAANLK-----LKEMIKDQQE-----AE 1079
+ + +E +E K L + ++ +++E +++ L+ + K E AE
Sbjct: 314 TLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAE 373
Query: 1080 KRKVQS--QDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
+ S Q+ EI KQ EI +++ + E L + ++A++ ++ + +
Sbjct: 374 QEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNK 426
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 37.3 bits (87), Expect = 0.053
Identities = 24/134 (17%), Positives = 63/134 (47%), Gaps = 19/134 (14%)
Query: 1009 NHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL 1068
H ++ YR K ++E++ E EQVE++++ +++E+ + E ++
Sbjct: 19 THEIQKYRFKVLSIKEKE-------RLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEI 71
Query: 1069 KEMIKDQ--QEAEKRKVQS----QDIQAEIEKQTVEIAQKRVFVMEDLAQ------VEPA 1116
E +++ +E K+ ++ + +Q +IE++ E ++ ++E+ E
Sbjct: 72 VEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKG 131
Query: 1117 VMDAQQAVKEIKKQ 1130
+ + V+E+ +Q
Sbjct: 132 REEGLEEVRELIEQ 145
Score = 31.9 bits (73), Expect = 2.6
Identities = 26/135 (19%), Positives = 53/135 (39%), Gaps = 18/135 (13%)
Query: 1020 SELEEQQLHLNVGL-GKIAETVEQVEEMQKSLAVKSQEL--QSKNEAANLKLKEMIKDQQ 1076
E +E+ L I + Q+E Q+ + + L ++K E ++ ++
Sbjct: 77 EEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKGREEGL 136
Query: 1077 E------------AEKRKVQSQDIQAEIEKQTVEIA---QKRVFVMEDLAQVEPAVMDAQ 1121
E EK + ++I E++ VE+A K+V E A+ +
Sbjct: 137 EEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKKVIKEISENSKEIALALVK 196
Query: 1122 QAVKEIKKQQLVELR 1136
+ +KE+K + +R
Sbjct: 197 ELLKEVKDATDITIR 211
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 35.6 bits (83), Expect = 0.055
Identities = 19/102 (18%), Positives = 47/102 (46%), Gaps = 6/102 (5%)
Query: 1035 KIAETVEQ-VEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS--QDIQAE 1091
+ +E+ ++ Q L +ELQ E + + +E +++++Q Q+ Q +
Sbjct: 18 AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRK 77
Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLV 1133
+K ++ +++ E+L ++ + A + V + K LV
Sbjct: 78 QQKLQQDLQKRQ---QEELQKILDKINKAIKEVAKKKGYDLV 116
>gnl|CDD|221955 pfam13173, AAA_14, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily.
Length = 127
Score = 35.2 bits (82), Expect = 0.056
Identities = 25/111 (22%), Positives = 36/111 (32%), Gaps = 24/111 (21%)
Query: 415 EHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPN 474
+K +V+ GP GKT LL L+ L ++ +N L L P
Sbjct: 1 NNKIIVITGPRQVGKTTLLLQFLKELLSENILYINLDDPRFLRLALLDLLEEFLKLLLPG 60
Query: 475 GVILSPIQLGKWLVLFCDEINLPDMDKYATQRV---ISFLRQLIEQRGFYR 522
LF DEI Q+V L++L + R
Sbjct: 61 KKY-----------LFLDEI----------QKVPDWEDALKRLYDDGRNLR 90
>gnl|CDD|236694 PRK10436, PRK10436, hypothetical protein; Provisional.
Length = 462
Score = 37.2 bits (87), Expect = 0.079
Identities = 13/18 (72%), Positives = 15/18 (83%)
Query: 423 GPPGSGKTMTLLSALRAL 440
GP GSGKT+TL SAL+ L
Sbjct: 225 GPTGSGKTVTLYSALQTL 242
>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
Length = 174
Score = 35.7 bits (83), Expect = 0.081
Identities = 30/106 (28%), Positives = 46/106 (43%), Gaps = 22/106 (20%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQD----IQA 1090
I + +++ EE + A E Q K A QQEAE+ + ++ I+A
Sbjct: 54 AILQALKEAEERLRQAAQALAEAQQKLAQA----------QQEAERIRADAKARAEAIRA 103
Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQA-VKEIKKQQLVEL 1135
EIEKQ +E M L Q A + A+Q V +++ EL
Sbjct: 104 EIEKQAIE-------DMARLKQTAAADLSAEQERVIAQLRREAAEL 142
>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein. The N-terminal
end matches very strongly a pfam Mg_chelatase domain
[Unknown function, General].
Length = 499
Score = 37.1 bits (86), Expect = 0.085
Identities = 40/154 (25%), Positives = 55/154 (35%), Gaps = 23/154 (14%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYR--KTPN-- 474
L+L GPPGSGKTM LP + ++ + + R ++P+
Sbjct: 214 LLLFGPPGSGKTMLASRLQGILPPLTNEEAIETARIWSLVGKLIDRKQIKQRPFRSPHHS 273
Query: 475 ---------GVILSP--IQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRP 523
G I P I L VLF DE LP+ + V+ LR+ IE
Sbjct: 274 ASKPALVGGGPIPLPGEISLAHNGVLFLDE--LPEFKR----SVLDALREPIEDGSISIS 327
Query: 524 ADKQWVSL-ERIQCVGACNP-PTDPGRKPLSHRF 555
+ R Q V A NP P +H
Sbjct: 328 RASAKIFYPARFQLVAAMNPCPCGHYGGKNTHCR 361
>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 154
Score = 35.3 bits (81), Expect = 0.085
Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 5/70 (7%)
Query: 740 DEVLDHVLR-IDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGAD 798
+E L+ +L + R +LL G SG GKT+L R + GL V +
Sbjct: 6 EEELERLLDALRRARSGGPPSVLLTGPSGTGKTSLLR--ELLEGLLV--AAGKCDQAERN 61
Query: 799 FDEDLRTVLR 808
LR
Sbjct: 62 PPYAFSQALR 71
>gnl|CDD|227900 COG5613, COG5613, Uncharacterized conserved protein [Function
unknown].
Length = 400
Score = 36.9 bits (85), Expect = 0.086
Identities = 19/107 (17%), Positives = 37/107 (34%), Gaps = 4/107 (3%)
Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
Y + + QQ + L K+ VE +++ K L + + +++
Sbjct: 99 SYPYLSQVNLASPQQFEIE--LAKLLSLVESAQKVSKLLNSNEIYMDQNTKKMENNQQKI 156
Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVM 1118
K QE E +K + ++I V ++ A V A
Sbjct: 157 KK--QEEEAKKAEKSGQASKIFGWVSAFGSLIVALIMVAAGVGAAAG 201
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile protein
found in all eukaryote cell types. This family consists
of the coiled-coil myosin heavy chain tail region. The
coiled-coil is composed of the tail from two molecules of
myosin. These can then assemble into the macromolecular
thick filament. The coiled-coil region provides the
structural backbone the thick filament.
Length = 859
Score = 37.3 bits (87), Expect = 0.090
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 11/89 (12%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVE-------QVEEMQKSLAVKSQELQSKNEAANLKL 1068
+ ELEE + LN + ++ E E +E+ + L + ++LQ + E AN
Sbjct: 313 ALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAA 372
Query: 1069 KEMIKDQQEAEK----RKVQSQDIQAEIE 1093
E+ K Q+ +K K + ++QAE++
Sbjct: 373 SELEKKQKNFDKILAEWKRKVDELQAELD 401
Score = 35.0 bits (81), Expect = 0.36
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 7/67 (10%)
Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQ--DIQAEIEK 1094
A E++EE++K L K EL+ EAAN K D E K ++QS+ D+Q E+E+
Sbjct: 313 ALRAEELEELKKKLNQKISELEEAAEAANAKC-----DSLEKTKSRLQSELEDLQIELER 367
Query: 1095 QTVEIAQ 1101
++
Sbjct: 368 ANAAASE 374
Score = 32.7 bits (75), Expect = 2.0
Identities = 54/308 (17%), Positives = 119/308 (38%), Gaps = 62/308 (20%)
Query: 1033 LGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM--IKDQQEAEKRKVQSQDIQA 1090
L ++ + VE + K+L + +L + + E+ + + EAEK ++Q+ +A
Sbjct: 421 LEELKDQVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEA 480
Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALE 1150
E L E V+ AQ + +I+ + +E R
Sbjct: 481 E----------------AALELEESKVLRAQVELSQIRSE--IERR-------------- 508
Query: 1151 SICLLLGENATDWKAIRAVVMRENFINSIVSNFNTE--MITDEVR--EKMHSRYLSNPDY 1206
L E +++ R R I S+ + E + R +K+ ++ +
Sbjct: 509 -----LAEKEEEFENTRKNHQRA--IESLQATLEAEAKGKAEASRLKKKLEGD-INELEI 560
Query: 1207 SYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLI 1266
+ + AN+A+ A+ K V+ + ++K L+ Q E + E+ ++ +
Sbjct: 561 ALDHANKAN----------------AEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQL 604
Query: 1267 TQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVE 1326
E+ + + E +L + + + + E ++ + + TA + L + K+E
Sbjct: 605 AVAERRATALEAELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLE 664
Query: 1327 RSMALLKS 1334
+A L+S
Sbjct: 665 GELAALQS 672
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have LPXTG
C-terminal anchoring motifs and a substantial number have
the KxYKxGKxW putative sorting signal at the N-terminus.
The tetracycline resistance plasmid pCF10 in Enterococcus
faecalis promotes conjugal plasmid transfer in response
to sex pheromones, but PgrA/Sec10 encoded by that
plasmid, a member of this family, specifically inhibits
the ability of cells to receive homologous plasmids. The
phenomenon is called surface exclusion.
Length = 356
Score = 36.6 bits (85), Expect = 0.090
Identities = 13/65 (20%), Positives = 25/65 (38%), Gaps = 3/65 (4%)
Query: 1262 TKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNV 1321
T + + L+ ++A+ + + A + L Q YA A + +L N
Sbjct: 252 TPNSLKALQAALATAQADLAAAQTALNTAQAALTTAQT---AYAAAQAALATAQKELANA 308
Query: 1322 QAKVE 1326
QA+
Sbjct: 309 QAQAL 313
>gnl|CDD|237355 PRK13342, PRK13342, recombination factor protein RarA; Reviewed.
Length = 413
Score = 36.6 bits (86), Expect = 0.092
Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 15/90 (16%)
Query: 407 SLLYTWLAEHKP--LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFD 464
L + + ++L GPPG+GKT TL + D +L+ ++ + L + +
Sbjct: 25 KPLRRMIEAGRLSSMILWGPPGTGKT-TLARIIAGATDAPFEALSAVTSGVKD-LREVIE 82
Query: 465 HYCEYRKTPNGVILSPIQLGKWLVLFCDEI 494
+ R I LF DEI
Sbjct: 83 EARQRRSAGRRTI-----------LFIDEI 101
Score = 31.6 bits (73), Expect = 3.9
Identities = 28/104 (26%), Positives = 41/104 (39%), Gaps = 33/104 (31%)
Query: 739 FDEVL--DHVLRIDRIFRQP--QGHL---LLIGVSGAGKTTLSRFVAFMNGLSVFQIRAH 791
DEV+ +H+L + R+ G L +L G G GKTTL+R +A
Sbjct: 11 LDEVVGQEHLLGPGKPLRRMIEAGRLSSMILWGPPGTGKTTLARIIA------------- 57
Query: 792 NKYTGADFDE---------DLRTVL---RRSGCKNEKIAFLLDE 823
T A F+ DLR V+ R+ + +DE
Sbjct: 58 -GATDAPFEALSAVTSGVKDLREVIEEARQRRSAGRRTILFIDE 100
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit. This
model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 36.5 bits (85), Expect = 0.092
Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 6/97 (6%)
Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
+KK + L L + Q + A + QLE + S+ + +L+ + +
Sbjct: 46 QKVKKGQVL-ARLDDDDYQLALQAALAQ-LAAAEAQLELAQRSF-ERAERLVKRNAVSQA 102
Query: 1293 DLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSM 1329
DLD+ +A A A K L + Q + +
Sbjct: 103 DLDDAKA---AVEAAQADLEAAKASLASAQLNLRYTE 136
Score = 33.8 bits (78), Expect = 0.59
Identities = 20/91 (21%), Positives = 41/91 (45%), Gaps = 6/91 (6%)
Query: 1245 LKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEY 1304
+ + V+ + KG ++ +L+ Y+ +AQ A + L+ Q F
Sbjct: 37 ITKISVREGQKVKKG----QVLARLDD--DDYQLALQAALAQLAAAEAQLELAQRSFERA 90
Query: 1305 AQLIAQATAIKTDLDNVQAKVERSMALLKSL 1335
+L+ + + DLD+ +A VE + A L++
Sbjct: 91 ERLVKRNAVSQADLDDAKAAVEAAQADLEAA 121
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a number
of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and mature
spermatozoa, together with the detection of Gas8 also in
cilia at the apical surfaces of epithelial cells lining
the pulmonary bronchi and Fallopian tubes suggests that
the Gas8 protein may have a role in the functioning of
motile cellular appendages. Gas8 is a microtubule-binding
protein localised to regions of dynein regulation in
mammalian cells.
Length = 201
Score = 36.0 bits (84), Expect = 0.093
Identities = 26/118 (22%), Positives = 48/118 (40%), Gaps = 18/118 (15%)
Query: 1235 LKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDL 1294
++VE LR +LK ++K + K + +LEK + + K E L + ++ +
Sbjct: 68 EEEVEELRKKLK----DYEKDKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERER 123
Query: 1295 DNVQAKF----YEYAQLIAQATAIKTDLDNV--QAKVERSMALLKSLGIERERWEATS 1346
D + KF + Q KT L N+ + K+E L+ + A +
Sbjct: 124 DELYDKFEAAIQDVQQ--------KTGLKNLLLEQKLEALNEELEKKEAQLNEVLAAA 173
>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory protein.
This family represents an accessory protein that works
with the bacteriocin maturation and ABC transport
secretion protein described by TIGR01193 [Transport and
binding proteins, Other].
Length = 457
Score = 36.6 bits (85), Expect = 0.10
Identities = 22/129 (17%), Positives = 46/129 (35%), Gaps = 5/129 (3%)
Query: 1230 SYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATA 1289
+ L +VE L E + ++ E+TK + Q +Y Q + A +
Sbjct: 141 LFNGYLAQVESLTSETQQQNDKSQTQNEAAEKTKAQLDQQISKTDQKLQDY-QALKNAIS 199
Query: 1290 IKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETF 1349
T + N Y AQ + Q K +L ++ + ++ + + ++
Sbjct: 200 NGTKVANFNPYQSLYENYQAQLKSASDKDQKNQVKS----TILATIQQQIDQLQKSIASY 255
Query: 1350 RSQMATIIG 1358
+ Q A +
Sbjct: 256 QVQKAGLTK 264
Score = 36.2 bits (84), Expect = 0.14
Identities = 25/96 (26%), Positives = 43/96 (44%), Gaps = 4/96 (4%)
Query: 1242 RLELKSLEVQASENKAKGEETKDL---ITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQ 1298
+ +LKS + +N+ K + I QL+KSIASY+ + A L + N +
Sbjct: 218 QAQLKSASDKDQKNQVKSTILATIQQQIDQLQKSIASYQVQKAGLTKSTASNYASSQNSK 277
Query: 1299 AKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKS 1334
+ Q +A+ TDL+ ++E + LK
Sbjct: 278 LAQLK-EQQLAKVKQEITDLNQKLLELESKIKSLKE 312
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 36.3 bits (85), Expect = 0.13
Identities = 31/132 (23%), Positives = 63/132 (47%), Gaps = 21/132 (15%)
Query: 1004 YLDFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEA 1063
N F K RE+ +EL++ + + + Q EE +L K + L+ + E
Sbjct: 66 IHKLRNEFEKELRERRNELQKLE-----------KRLLQKEE---NLDRKLELLEKREEE 111
Query: 1064 ANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA-----QKRVFVMEDLAQVEPAVM 1118
K KE+ + QQE EK++ + +++ E ++ I+ + + ++E + E A
Sbjct: 112 LEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKV--EEEARH 169
Query: 1119 DAQQAVKEIKKQ 1130
+A +KEI+++
Sbjct: 170 EAAVLIKEIEEE 181
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA binding.
This is the middle region of a family of TATA element
modulatory factor 1 proteins conserved in eukaryotes that
contains at its N-terminal section a number of leucine
zippers that could potentially form coiled coil
structures. The whole proteins bind to the TATA element
of some RNA polymerase II promoters and repress their
activity. by competing with the binding of TATA binding
protein. TMFs are evolutionarily conserved golgins that
bind Rab6, a ubiquitous ras-like GTP-binding Golgi
protein, and contribute to Golgi organisation in animal
and plant cells.
Length = 74
Score = 33.1 bits (76), Expect = 0.13
Identities = 15/69 (21%), Positives = 33/69 (47%), Gaps = 7/69 (10%)
Query: 1033 LGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEI 1092
K+AE EQ+ ++ + ++L K N +K++ +E EK + +++ ++
Sbjct: 4 EKKLAEKDEQIAQLMEEG----EKLSKKELKHNNTIKKLRAKNKELEK---EIAELKKKL 56
Query: 1093 EKQTVEIAQ 1101
EK E+
Sbjct: 57 EKLEKELEN 65
>gnl|CDD|223703 COG0630, VirB11, Type IV secretory pathway, VirB11 components, and
related ATPases involved in archaeal flagella
biosynthesis [Cell motility and secretion /
Intracellular trafficking and secretion].
Length = 312
Score = 36.2 bits (84), Expect = 0.13
Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 401 DTVRHESLLYTWLA--EHKPLVLCGPPGSGKTMTLLSALRALP 441
T+ E Y WLA K +++CG SGKT L + L +P
Sbjct: 126 GTISPEQAAYLWLAIEARKSIIICGGTASGKTTLLNALLDFIP 168
>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
Length = 147
Score = 34.7 bits (81), Expect = 0.14
Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 2/42 (4%)
Query: 1063 AANLKLKEMIKDQQEA--EKRKVQSQDIQAEIEKQTVEIAQK 1102
NLK E + + EA + +++ + ++E TV I K
Sbjct: 41 KGNLKQLEARRAELEAKAAEELAEAEALAEKLEGLTVTIKAK 82
Score = 31.6 bits (73), Expect = 1.6
Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 16/85 (18%)
Query: 1032 GLGKIA--ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRK------- 1082
G A ++Q+E + L K+ E EA L E ++ K K
Sbjct: 34 GKAVRATKGNLKQLEARRAELEAKAAEE--LAEA--EALAEKLEGLTVTIKAKAGEDGKL 89
Query: 1083 ---VQSQDIQAEIEKQTVEIAQKRV 1104
V ++DI ++KQ +EI ++++
Sbjct: 90 FGSVTTKDIAEALKKQGIEIDKRKI 114
>gnl|CDD|131368 TIGR02315, ABC_phnC, phosphonate ABC transporter, ATP-binding
protein. Phosphonates are a class of
phosphorus-containing organic compound with a stable
direct C-P bond rather than a C-O-P linkage. A number of
bacterial species have operons, typically about 14 genes
in size, with genes for ATP-dependent transport of
phosphonates, degradation, and regulation of the
expression of the system. Members of this protein family
are the ATP-binding cassette component of tripartite ABC
transporters of phosphonates [Transport and binding
proteins, Anions].
Length = 243
Score = 35.4 bits (82), Expect = 0.15
Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 4/80 (5%)
Query: 762 LIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNEKIAFLL 821
+IG SGAGK+TL R + + S I + LR + RR G + L+
Sbjct: 33 IIGPSGAGKSTLLRCINRLVEPSSGSILLEGTDITKLRGKKLRKLRRRIGMIFQHYN-LI 91
Query: 822 DESNVLE---SGFLERMNTL 838
+ VLE G L T
Sbjct: 92 ERLTVLENVLHGRLGYKPTW 111
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 36.1 bits (84), Expect = 0.16
Identities = 21/46 (45%), Positives = 24/46 (52%), Gaps = 9/46 (19%)
Query: 411 TWLA--EHKPLVLCGPPGSGKTMTLLSALRALP-DM--EVVSLNFS 451
+WL K L+L GPPG GKT S AL D EV+ LN S
Sbjct: 32 SWLKGKPKKALLLYGPPGVGKT----SLAHALANDYGWEVIELNAS 73
>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
recombination, and repair].
Length = 254
Score = 35.5 bits (82), Expect = 0.16
Identities = 32/115 (27%), Positives = 44/115 (38%), Gaps = 21/115 (18%)
Query: 409 LYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSS--ATTPEL---LLKTF 463
L + + LVL GPPG GKT L A+ E++ S T P+L L F
Sbjct: 98 LVEFFERGENLVLLGPPGVGKT-HLAIAI----GNELLKAGISVLFITAPDLLSKLKAAF 152
Query: 464 DHYCEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQR 518
D +L +L K +L D+I + L QLI +R
Sbjct: 153 D-----EGRLEEKLLR--ELKKVDLLIIDDIGYEPFSQEEA----DLLFQLISRR 196
>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO. This family
contains the bacterial type III secretion protein YscO,
which is approximately 150 residues long. YscO has been
shown to be required for high-level expression and
secretion of the anti-host proteins V antigen and Yops in
Yersinia pestis.
Length = 152
Score = 34.3 bits (79), Expect = 0.17
Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 11/95 (11%)
Query: 1005 LDFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAA 1064
L+ V L RE + LE+Q L + + +E + + + QE + E
Sbjct: 69 LEKWQQQVGLLRENEASLEQQ-------LAEAKQRLEAERQRLRQARQQLQEARKAQE-- 119
Query: 1065 NLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI 1099
K E+ + +Q + + Q + + E +TV
Sbjct: 120 --KFAELARQEQAEAQAQRQYLEELEQEEFRTVSR 152
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 35.2 bits (81), Expect = 0.18
Identities = 35/144 (24%), Positives = 63/144 (43%), Gaps = 33/144 (22%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEA------ANL-KL 1068
EK ELEE++ E + +M + + +E +++ +A N+ K+
Sbjct: 84 AEKLEELEEEER----------EWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKM 133
Query: 1069 KEMIKDQQEAEKRKVQSQDIQAEIEKQT--VEIAQKRV-FVM--------EDLAQVEPAV 1117
+MI D A+KRK + Q +A E++ V A++ + + E L Q E
Sbjct: 134 PQMIAD-WRAQKRK-REQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKE- 190
Query: 1118 MDAQQAVKEIKKQQLVELRSMANP 1141
++ VKE K+++ E R A
Sbjct: 191 --EKKKVKEAKRREKEEKRMAALV 212
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
the myosin-like domain [Function unknown].
Length = 499
Score = 35.8 bits (82), Expect = 0.19
Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 21/160 (13%)
Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAE 1079
+ L +Q L L +AE Q+E +SL ++LQ+ ++ ++ + E
Sbjct: 140 ARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIE 199
Query: 1080 K---------------------RKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVM 1118
+ R +Q I+++ +I+QK + Q+
Sbjct: 200 QEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERER 259
Query: 1119 DAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGE 1158
Q+ + + + A + V+L ++ G+
Sbjct: 260 QLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQRGQ 299
Score = 35.0 bits (80), Expect = 0.37
Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 5/125 (4%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
+ L+ Q L ++ V ++ + ++Q L ++ AA + +E+ +
Sbjct: 165 AQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAA 224
Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVME---DLAQVEPAVMDAQQAVKEIKK--QQ 1131
A++ Q A+I ++ +IA + + E L ++E A +Q V +++ Q
Sbjct: 225 AAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYYQA 284
Query: 1132 LVELR 1136
V LR
Sbjct: 285 YVRLR 289
Score = 33.8 bits (77), Expect = 0.81
Identities = 16/99 (16%), Positives = 39/99 (39%), Gaps = 6/99 (6%)
Query: 1038 ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTV 1097
V Q+++++ L EL + E + E +K + + + + ++
Sbjct: 71 SGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQERE----AVRQELA 126
Query: 1098 EIAQKRVFVMEDLAQVEPAVMDAQQAVKEI--KKQQLVE 1134
Q ++LA++ D Q +K + +++QL
Sbjct: 127 AARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEA 165
Score = 30.8 bits (69), Expect = 6.9
Identities = 30/153 (19%), Positives = 61/153 (39%), Gaps = 37/153 (24%)
Query: 1233 DMLKKVEPLRLELKSLEVQ-----------ASENKAKGEETKDLITQLE---KSIASYKD 1278
D+ ++ LR EL + + + SE + +E + + +L +++A +
Sbjct: 78 DIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQ 137
Query: 1279 EYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT-----------------DLDNV 1321
E A+L QA ++T L + + QL AQA +++ DL
Sbjct: 138 ELARLTKQAQDLQTRLKTLAE---QRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLR 194
Query: 1322 QAKVERSMALLKSLGIERERWEATSETFRSQMA 1354
A++E+ ++L +A +E + A
Sbjct: 195 SAQIEQEA---QNLATRANAAQARTEELARRAA 224
>gnl|CDD|233914 TIGR02538, type_IV_pilB, type IV-A pilus assembly ATPase PilB.
This model describes a protein of type IV pilus
biogenesis designated PilB in Pseudomonas aeruginosa but
PilF in Neisseria gonorrhoeae; the more common usage,
reflected here, is PilB. This protein is an ATPase
involved in protein export for pilin assembly and is
closely related to GspE (TIGR02533) of type II
secretion, also called the main terminal branch of the
general secretion pathway. Note that type IV pilus
systems are often divided into type IV-A and IV-B, with
the latter group including bundle-forming pilus,
mannose-sensitive hemagglutinin, etc. Members of this
family are found in type IV-A systems [Cell envelope,
Surface structures, Protein fate, Protein and peptide
secretion and trafficking].
Length = 564
Score = 36.1 bits (84), Expect = 0.19
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 6/46 (13%)
Query: 414 AEHKP--LVL-CGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTP 456
A HKP +VL GP GSGKT++L +AL L EV N S+A P
Sbjct: 311 AIHKPQGMVLVTGPTGSGKTVSLYTALNILNTEEV---NISTAEDP 353
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino acids
in length. This family contains a P-loop motif suggesting
it is a nucleotide binding protein. It may be involved in
replication.
Length = 1198
Score = 36.2 bits (84), Expect = 0.19
Identities = 62/301 (20%), Positives = 105/301 (34%), Gaps = 72/301 (23%)
Query: 1016 REKCSELEEQ--QLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK 1073
K + EEQ Q + + K AE + Q L + Q LQ++ ++ KL+ I
Sbjct: 620 VAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDL--QRLQNEQQSLKDKLELAIA 677
Query: 1074 DQQEAEKRKVQSQDIQAE-IEKQ---------------TVEIAQKRVFVMEDL------- 1110
++++ + +++ D Q + + +Q E K V +L
Sbjct: 678 ERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQL 737
Query: 1111 -AQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAV 1169
A +E A A+ +KE+KKQ EL S+ P+ VK
Sbjct: 738 SAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVK--------------------ELK 777
Query: 1170 VMRENFINSIVSNFNTEMITDEVREK---MHSRYLSNPDYSYEKANRASMACGPMVKWAI 1226
E +I + EVRE M +L E+ N
Sbjct: 778 RQIEELETTIE---RIAVRRPEVREYRAFMQETWLHRDSLREERPNL------------- 821
Query: 1227 AQISYADMLKKVEPLRLELKSLEVQASENKAKGEE----TKDLITQLEKSIASYKDEYAQ 1282
I ++ E L+ EL L + K E+ + + QL++ + +DE Q
Sbjct: 822 -AIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQ 880
Query: 1283 L 1283
L
Sbjct: 881 L 881
>gnl|CDD|222000 pfam13238, AAA_18, AAA domain.
Length = 128
Score = 34.0 bits (78), Expect = 0.20
Identities = 13/48 (27%), Positives = 21/48 (43%)
Query: 761 LLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLR 808
L+ G G+GKTTL++ +A G + + N +E R
Sbjct: 2 LITGTPGSGKTTLAKELAERLGDVLRDLAKENGLVLELDEEITDESKR 49
Score = 28.9 bits (65), Expect = 9.5
Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 13/86 (15%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLFTH 108
G GSGK+T K L + L + + L LD + + L
Sbjct: 4 TGTPGSGKTTLAKELAERL----------GDVLRDLAKENGLVLELDEEITDESKRLDED 53
Query: 109 ILRRIIDNVRGEISKR---QWIIFDG 131
L +++D + I + + +I DG
Sbjct: 54 KLAKLLDKLEKIIEELAEGENVIIDG 79
>gnl|CDD|236982 PRK11784, PRK11784, tRNA 2-selenouridine synthase; Provisional.
Length = 345
Score = 35.6 bits (83), Expect = 0.21
Identities = 14/32 (43%), Positives = 16/32 (50%), Gaps = 1/32 (3%)
Query: 409 LYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL 440
L A+ +VL G GSGKT LL AL
Sbjct: 134 LEEAPAQFPLVVLGGNTGSGKT-ELLQALANA 164
>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570). This is a
family of proteins with unknown function.
Length = 110
Score = 33.3 bits (76), Expect = 0.23
Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 10/70 (14%)
Query: 1062 EAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ 1121
A N + I+ QQ+A +R E E+Q VE QK ++E +
Sbjct: 31 RAENKAQAQTIEQQQKANQRLT----DALEQERQAVEEQQKIA------NEIERKAEENA 80
Query: 1122 QAVKEIKKQQ 1131
+ VK I KQQ
Sbjct: 81 EEVKTILKQQ 90
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 35.8 bits (83), Expect = 0.24
Identities = 29/121 (23%), Positives = 63/121 (52%), Gaps = 6/121 (4%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
+E+ ELE++ L K+ E + ++EE + L + +EL+ K + +LKE ++
Sbjct: 237 KEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK-ELKEKAEEY 295
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
+ + + D EIEK+ + ++ + E + ++E + ++ ++E+KK +L EL
Sbjct: 296 IKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELE----EKEERLEELKK-KLKEL 350
Query: 1136 R 1136
Sbjct: 351 E 351
Score = 33.9 bits (78), Expect = 0.87
Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
EK LEE + L ++ E E+ E +++ A K + + K L +++ K+ +
Sbjct: 335 EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELE 394
Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQ 1101
E EK K ++I+ EI K T I +
Sbjct: 395 ELEKAK---EEIEEEISKITARIGE 416
Score = 31.6 bits (72), Expect = 4.8
Identities = 57/317 (17%), Positives = 130/317 (41%), Gaps = 61/317 (19%)
Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL---KL 1068
+K E+ +L E L +I + + ++EE + + +EL+ K E KL
Sbjct: 288 LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347
Query: 1069 KEMIKDQQEAEKRKVQSQDIQA------------------EIEKQTVEIAQKRVFVMEDL 1110
KE+ K +E E+R ++ +A ++EK+ E+ + + + E++
Sbjct: 348 KELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
Query: 1111 AQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVV 1170
+++ + + ++ +KE+KK + EL+ + C + G T+
Sbjct: 408 SKITARIGELKKEIKELKK-AIEELKKA-----------KGKCPVCGRELTEEH------ 449
Query: 1171 MRENFINSI---VSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIA 1227
R+ + + E+ E +E+ + L + +K +
Sbjct: 450 -RKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESEL------------- 495
Query: 1228 QISYADMLKKVEPLRLELKSLEVQASENKAKG-EETKDLITQLEKSIASYKDEY---AQL 1283
I ++ ++++ L +LK ++ E KA+ E+ K+ + +L+ I S K E +L
Sbjct: 496 -IKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEEL 554
Query: 1284 IAQATAIKTDLDNVQAK 1300
+ ++ LD ++ +
Sbjct: 555 KKKLAELEKKLDELEEE 571
>gnl|CDD|222005 pfam13245, AAA_19, Part of AAA domain.
Length = 73
Score = 32.3 bits (74), Expect = 0.24
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 414 AEHKPLVLCGPPGSGKTMTLLSALRAL 440
+ V+ G PG+GKT T + + L
Sbjct: 8 SGRSLFVVDGGPGTGKTATAAAIIARL 34
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 35.0 bits (81), Expect = 0.26
Identities = 54/333 (16%), Positives = 121/333 (36%), Gaps = 78/333 (23%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
+L+E +L ++ E +E+++E + L ++ EL K + N K++E+ + Q
Sbjct: 3 AMLDKLDELELKRK----QLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQ 58
Query: 1077 E-AEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
E E+R EI ++ E+ +KR + L + +++ EL
Sbjct: 59 ELREERD--------EINEEVQELKEKRDEINAKL---------------QELRKEYREL 95
Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
+ N ++ +++S+ E I + T ++T E +
Sbjct: 96 KEKRNEFNLGGRSIKSL--------------------EREIERLEKKQQTSVLTPEEERE 135
Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKV-EPLRLELKSLEVQASE 1254
+ + I ++ K +LK L+ + E
Sbjct: 136 LVQK--------------------------IKELRKELEDAKKALEENEKLKELKAEIDE 169
Query: 1255 NKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAI 1314
K K E + I +L Y +E +L +A ++ + D + +F E + +
Sbjct: 170 LKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVEL---SKKIDEL 226
Query: 1315 KTDLDNVQAKVERSMALLKSLGIERERWEATSE 1347
+ N+Q ++ +K+L + + + +
Sbjct: 227 HEEFRNLQNELRELEKKIKALRAKEKAAKRREK 259
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome partitioning].
Length = 1480
Score = 35.6 bits (82), Expect = 0.26
Identities = 77/439 (17%), Positives = 157/439 (35%), Gaps = 103/439 (23%)
Query: 1014 LYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK 1073
LY + EQ H+++ + + ++ + L + Q+ ++ NL + +
Sbjct: 295 LYTSRQQLAAEQYRHVDM-----SRELAELNGAEGDL---EADYQAASDHLNLVQTAL-R 345
Query: 1074 DQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPA----------VMDAQQA 1123
Q++ E+ + +++ +E+Q + + E+ A+ E A + D QQA
Sbjct: 346 QQEKIERYQADLEELTIRLEEQNEVVEEANERQEENEARAEAAELEVDELKSQLADYQQA 405
Query: 1124 V-----KEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINS 1178
+ + I+ QQ + A L + L ++A +W E
Sbjct: 406 LDVQQTRAIQYQQAIAALERAK-------ELCHLPDLTADSAEEWLETFQAKEEE----- 453
Query: 1179 IVSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVK---WAIAQ------I 1229
++++ E + M S + +Y+ + G + + W +A+
Sbjct: 454 ----ATEKLLSLEQKMSMAQAAHSQFEQAYQLVVAIA---GELARSEAWDVARELLREGP 506
Query: 1230 SYADMLKKVEPLRLELKSLEVQASE-----------NKAKG-----EETKDLITQLEKSI 1273
+ ++V+PLR+ L LE + + K +G EE + L +LE I
Sbjct: 507 DQRHLAEQVQPLRMRLSELEQRLRQQQSAERLLADFCKRQGKNLDAEELEALHQELEALI 566
Query: 1274 ASYKDEYAQLIAQATAIKTDLDNVQAKFYEYA--------------QLIAQATAIKTDLD 1319
S D + Q A++ + + +Q++ QL Q+ TD
Sbjct: 567 ESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRAPVWLAAQNALEQLSEQSGEEFTDSQ 626
Query: 1320 NVQAKVERSMALLKSLGIERERWEATSETFRSQ-----------------MATIIGDVLL 1362
+V +++ + + +ER+ A + +A G VLL
Sbjct: 627 DVTEYMQQLLEREREATVERDELGARKNALDEEIERLSQPGGSEDQRLNALAERFGGVLL 686
Query: 1363 SSAY----LAYAGYFDQHY 1377
S Y + A YF Y
Sbjct: 687 SEIYDDVTIEDAPYFSALY 705
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 35.3 bits (81), Expect = 0.27
Identities = 17/86 (19%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA-QK 1102
++ +K ++EL+ K A +LK++ K++ +A++++ Q+++ + + + + + Q
Sbjct: 79 QQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQA 138
Query: 1103 RVFVMEDLAQVEPAVMDAQQAVKEIK 1128
R E + E A A ++K
Sbjct: 139 RKAAAEQKKKAEAAKAKAAAEAAKLK 164
Score = 33.4 bits (76), Expect = 1.1
Identities = 25/119 (21%), Positives = 47/119 (39%), Gaps = 1/119 (0%)
Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
+L +E+ E+Q+ E +Q EE + A + ++ +A +
Sbjct: 105 QLEKERLKAQEQQKQAEEAEKQAQLEQKQQ-EEQARKAAAEQKKKAEAAKAKAAAEAAKL 163
Query: 1073 KDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
K EA+K+ ++ E + + A K+ E A E A +A+ K KK +
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222
Score = 32.2 bits (73), Expect = 2.4
Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 1042 QVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
EE++ A + QE + E LK +E K +EAEK+ Q Q E ++ +
Sbjct: 88 VAEELKPKQAAE-QERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146
Query: 1102 KR 1103
K+
Sbjct: 147 KK 148
Score = 30.3 bits (68), Expect = 8.1
Identities = 20/102 (19%), Positives = 41/102 (40%), Gaps = 1/102 (0%)
Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE 1093
G + + +++ Q S Q+ + K E +LK +QE K+ + + E +
Sbjct: 58 GAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQ 117
Query: 1094 KQTVEIA-QKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVE 1134
KQ E Q ++ + Q A + ++ + K + E
Sbjct: 118 KQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAE 159
Score = 30.3 bits (68), Expect = 9.1
Identities = 23/103 (22%), Positives = 42/103 (40%), Gaps = 10/103 (9%)
Query: 1038 ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE--KQ 1095
E ++Q+E+ + +K+QE Q + E A + + K Q+E ++ Q +AE K
Sbjct: 101 ERLKQLEKER----LKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA 156
Query: 1096 TVEIAQKRVFV----MEDLAQVEPAVMDAQQAVKEIKKQQLVE 1134
E A+ + + A A+ KK+ E
Sbjct: 157 AAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAE 199
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit. Within
the family is an example from a sodium-translocating
rather than proton-translocating ATP synthase [Energy
metabolism, ATP-proton motive force interconversion].
Length = 147
Score = 33.5 bits (77), Expect = 0.30
Identities = 24/96 (25%), Positives = 45/96 (46%), Gaps = 11/96 (11%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEK 1094
KIA+ + E +K A+ ++ Q + A + +E+I + A KR + I E +
Sbjct: 30 KIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEII---ENANKRGSE---ILEEAKA 83
Query: 1095 QTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
+ E +K ++ A+ E QA +E++KQ
Sbjct: 84 EAREEREK----IKAQARAEIEAEKE-QAREELRKQ 114
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 35.0 bits (80), Expect = 0.32
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA 1100
+QV E QK K+Q KN+ LK K+ +A K +S+ + E E + +E
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKD-----HKAFDLKQESKASEKEAEDKELEAQ 327
Query: 1101 QKRVFVMEDLAQVEPAV 1117
+KR V EDL + +P V
Sbjct: 328 KKREPVAEDLQKTKPQV 344
Score = 35.0 bits (80), Expect = 0.34
Identities = 19/89 (21%), Positives = 40/89 (44%), Gaps = 9/89 (10%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA 1100
E++++ Q Q+ + A+ + E+ + QQEA+ + D + E + V
Sbjct: 220 EELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLP-KPADTSSPKEDKQVAEN 278
Query: 1101 QKRVFVMEDLAQVEPAVMDAQQAVKEIKK 1129
QKR ++E A ++ ++ +E K
Sbjct: 279 QKR--------EIEKAQIEIKKNDEEALK 299
Score = 31.9 bits (72), Expect = 3.2
Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 5/85 (5%)
Query: 1052 VKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQ--AEIEKQTVEIAQKRVFVMED 1109
+K +E Q + A +E+ K Q +A+K + ++ Q A+ ++ V Q+ +
Sbjct: 203 LKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262
Query: 1110 LAQVEPAVMD---AQQAVKEIKKQQ 1131
A D A+ +EI+K Q
Sbjct: 263 PADTSSPKEDKQVAENQKREIEKAQ 287
>gnl|CDD|222165 pfam13481, AAA_25, AAA domain. This AAA domain is found in a wide
variety of presumed DNA repair proteins.
Length = 154
Score = 33.4 bits (77), Expect = 0.37
Identities = 14/60 (23%), Positives = 20/60 (33%), Gaps = 5/60 (8%)
Query: 381 PQIEVETQKVAASDVVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL 440
E+ D P L L+ L +L G PG+GK+ L A+
Sbjct: 3 EPSELLPSSAEDLDAPPPPLRW-----LVKGLLPRGGLTLLAGAPGTGKSTLALDLAAAV 57
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 34.6 bits (80), Expect = 0.38
Identities = 18/129 (13%), Positives = 54/129 (41%), Gaps = 12/129 (9%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQEL-QSKNEAANL------KLK 1069
E L+E L L + ++ + + +L + ++L Q ++E + + K
Sbjct: 151 ENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAK 210
Query: 1070 EMIKDQQEAEKRKVQSQ-DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIK 1128
E +K + KV+ +++ E+++ +I +++ + +A++ +++ +
Sbjct: 211 EKLKKLLQEIMIKVKKLEELEEELQELESKIEDLT----NKKSELNTEIAEAEKKLEQCR 266
Query: 1129 KQQLVELRS 1137
E+
Sbjct: 267 GFTFKEIEK 275
Score = 30.4 bits (69), Expect = 8.1
Identities = 12/73 (16%), Positives = 32/73 (43%), Gaps = 8/73 (10%)
Query: 1262 TKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKF----YEYAQLIAQATAIK-- 1315
+ L L++++ K++Y L+ + + + ++ + E QL ++
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201
Query: 1316 --TDLDNVQAKVE 1326
T+LD + K++
Sbjct: 202 DPTELDRAKEKLK 214
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 35.1 bits (82), Expect = 0.38
Identities = 21/96 (21%), Positives = 42/96 (43%), Gaps = 7/96 (7%)
Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQ 1095
+ E ++E+ L K EL + EA +LKE+ K+ ++ + + + ++ Q
Sbjct: 696 LNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAAAA--AGDLLAQ 753
Query: 1096 TVEIAQKRVFVMEDLAQVEPAVMDA-QQAVKEIKKQ 1130
E+ +V AQVE A + ++K +
Sbjct: 754 AKEVNGVKVLA----AQVEGVDAKALRTLADDLKDK 785
>gnl|CDD|224046 COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component
[Inorganic ion transport and metabolism].
Length = 254
Score = 34.5 bits (80), Expect = 0.39
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
+GP+G+GKST K +L L+ G
Sbjct: 36 IGPNGAGKSTLLKAILGLLKPSSG 59
>gnl|CDD|221803 pfam12846, AAA_10, AAA-like domain. This family of domains
contain a P-loop motif that is characteristic of the
AAA superfamily. Many of the proteins in this family
are conjugative transfer proteins.
Length = 316
Score = 34.3 bits (79), Expect = 0.44
Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
Query: 43 NHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPK 83
N +++VGPSGSGKST K+L L G + +IDPK
Sbjct: 1 NPNMLIVGPSGSGKSTLLKLLALRLLAR-GGRVI--VIDPK 38
>gnl|CDD|213223 cd03256, ABC_PhnC_transporter, ATP-binding cassette domain of the
binding protein-dependent phosphonate transport system.
Phosphonates are a class of organophosphorus compounds
characterized by a chemically stable
carbon-to-phosphorus (C-P) bond. Phosphonates are
widespread among naturally occurring compounds in all
kingdoms of wildlife, but only prokaryotic
microorganisms are able to cleave this bond. Certain
bacteria such as E. coli can use alkylphosphonates as a
phosphorus source. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 241
Score = 34.1 bits (79), Expect = 0.45
Identities = 31/94 (32%), Positives = 45/94 (47%), Gaps = 15/94 (15%)
Query: 762 LIGVSGAGKTTL----SRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNEKI 817
LIG SGAGK+TL + V +G + NK G LR + R+ G ++
Sbjct: 32 LIGPSGAGKSTLLRCLNGLVEPTSGSVLIDGTDINKLKGK----ALRQLRRQIGMIFQQF 87
Query: 818 AFLLDESNVLE---SGFLERMNTLLANGEIPGLF 848
L++ +VLE SG L R +T + + GLF
Sbjct: 88 N-LIERLSVLENVLSGRLGRRSTWRS---LFGLF 117
Score = 31.4 bits (72), Expect = 2.8
Identities = 16/44 (36%), Positives = 22/44 (50%), Gaps = 7/44 (15%)
Query: 41 NLNHGLM--MVGPSGSGKSTAWKVLLKALERY-EGVEGVAHIID 81
++N G ++GPSG+GKST LL+ L E G I
Sbjct: 23 SINPGEFVALIGPSGAGKST----LLRCLNGLVEPTSGSVLIDG 62
>gnl|CDD|238550 cd01130, VirB11-like_ATPase, Type IV secretory pathway component
VirB11, and related ATPases. The homohexamer, VirB11 is
one of eleven Vir proteins, which are required for
T-pilus biogenesis and virulence in the transfer of
T-DNA from the Ti (tumor-inducing) plasmid of bacterial
to plant cells. The pilus is a fibrous cell surface
organelle, which mediates adhesion between bacteria
during conjugative transfer or between bacteria and host
eukaryotic cells during infection. VirB11- related
ATPases include the archaeal flagella biosynthesis
protein and the pilus assembly proteins CpaF/TadA and
TrbB. This alignment contains the C-terminal domain,
which is the ATPase.
Length = 186
Score = 33.8 bits (78), Expect = 0.45
Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 9/55 (16%)
Query: 410 YTWLA--EHKPLVLCGPPGSGKTMTLLSALRAL--PDMEVVSLNFSSATTPELLL 460
Y WLA K +++ G GSGKT TLL+AL A PD ++++ T EL L
Sbjct: 17 YLWLAVEARKNILISGGTGSGKT-TLLNALLAFIPPDERIITIE----DTAELQL 66
>gnl|CDD|236686 PRK10416, PRK10416, signal recognition particle-docking protein
FtsY; Provisional.
Length = 318
Score = 34.3 bits (80), Expect = 0.46
Identities = 15/64 (23%), Positives = 36/64 (56%), Gaps = 4/64 (6%)
Query: 713 TTELREYVQARLKVF-YEEELDVQLVLFDEVLDHVLRIDRIFRQPQGHL---LLIGVSGA 768
T E+ E ++ R+K ++ +++ +L +E+ + + +++ + L++GV+G
Sbjct: 66 TEEIIEELRERVKRKNLKDPEELKELLKEELAEILEPVEKPLNIEEKKPFVILVVGVNGV 125
Query: 769 GKTT 772
GKTT
Sbjct: 126 GKTT 129
>gnl|CDD|213788 TIGR03263, guanyl_kin, guanylate kinase. Members of this family
are the enzyme guanylate kinase, also called GMP kinase.
This enzyme transfers a phosphate from ATP to GMP,
yielding ADP and GDP [Purines, pyrimidines, nucleosides,
and nucleotides, Nucleotide and nucleoside
interconversions].
Length = 179
Score = 33.6 bits (78), Expect = 0.46
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 6/40 (15%)
Query: 417 KPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
+V+ GP G+GK+ TL+ AL E +L FS SATT
Sbjct: 1 LLIVISGPSGAGKS-TLVKALLE----EDPNLKFSISATT 35
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This family
consists of several eukaryotic proteins of unknown
function.
Length = 536
Score = 34.7 bits (80), Expect = 0.47
Identities = 22/144 (15%), Positives = 61/144 (42%), Gaps = 12/144 (8%)
Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLK-EMIKDQQEAEKRKVQSQDIQ---AE 1091
+ + S ++++++ AA EM ++ ++R+ + + +Q E
Sbjct: 222 LLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDE 281
Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRS---MANPPSVVKLA 1148
+E E+ + + + QV + +Q +E++++ ++ R+ + + + V
Sbjct: 282 LESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVA-K 340
Query: 1149 LESICLLLGEN----ATDWKAIRA 1168
L+++ + E A W+A R
Sbjct: 341 LQALVVASSERLLELAQQWEAHRT 364
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents the
N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 32.2 bits (74), Expect = 0.48
Identities = 13/71 (18%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 1034 GKIAETVEQVEEM---QKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQA 1090
G V+++ E+ ++ L V+ +ELQ++ + ++ + K +++AE + ++++
Sbjct: 22 GVDVLDVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKD 81
Query: 1091 EIEKQTVEIAQ 1101
E++ E+ +
Sbjct: 82 ELKALEAELRE 92
>gnl|CDD|223637 COG0563, Adk, Adenylate kinase and related kinases [Nucleotide
transport and metabolism].
Length = 178
Score = 33.4 bits (77), Expect = 0.51
Identities = 9/19 (47%), Positives = 16/19 (84%)
Query: 760 LLLIGVSGAGKTTLSRFVA 778
+L++G GAGK+TL++ +A
Sbjct: 3 ILILGPPGAGKSTLAKKLA 21
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA a
protein that suppresses sigma54-dependent transcription.
The PspA protein, a negative regulator of the Escherichia
coli phage shock psp operon, is produced when virulence
factors are exported through secretins in many
Gram-negative pathogenic bacteria and its homologue in
plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to the
AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 33.9 bits (78), Expect = 0.51
Identities = 25/112 (22%), Positives = 47/112 (41%), Gaps = 10/112 (8%)
Query: 1236 KKVEPLRLELKSLEVQASENKAKGEE--TKDLITQ---LEKSIASYKDEYAQLIAQATAI 1290
+K+E + + K LE +A KG E ++ + + LEK + + + Q + +
Sbjct: 58 RKLEEQKEQAKKLENKARAALTKGNEELAREALAEIATLEKQAEALETQLTQQRSAVEQL 117
Query: 1291 KTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQA--KVERSMALLKSLGIERE 1340
+ L ++ K QL A+ TA+K L +A V S+ +
Sbjct: 118 RKQLAALETK---IQQLKAKKTALKARLKAAKAQEAVNTSLGSASTESATDS 166
>gnl|CDD|206350 pfam14182, YgaB, YgaB-like protein. The YgaB-like protein family
includes the B. subtilis YgaB protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are
approximately 90 amino acids in length.
Length = 79
Score = 31.5 bits (72), Expect = 0.52
Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 14/70 (20%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS---------QDIQAE 1091
EQ++ M K L LQS+ E KE+++ +++AE +Q ++IQ
Sbjct: 8 EQMKTMDKLL-----FLQSELERCQEIEKELLELEKKAELESIQQEISRMRKELKEIQEL 62
Query: 1092 IEKQTVEIAQ 1101
EKQT E+ Q
Sbjct: 63 FEKQTEEVIQ 72
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 34.2 bits (79), Expect = 0.53
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 46 LMMVGPSGSGKSTAWKVLLKALERYE 71
L++V SG KS K+ +K LE E
Sbjct: 42 LLVVAESGERKSAVDKLAMKPLEEIE 67
>gnl|CDD|177744 PLN00135, PLN00135, malate dehydrogenase.
Length = 309
Score = 34.0 bits (78), Expect = 0.53
Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 20/94 (21%)
Query: 235 DVLSPALTLQQDVASILSTHFAPD-------------GLVVRALDYAMQQEHIMDFTRL- 280
DV+S +++ + AS L H APD L+++ ++ +++I TRL
Sbjct: 78 DVMSKNVSIYKSQASALEKHAAPDCKVLVVANPANTNALILKEFAPSIPEKNITCLTRLD 137
Query: 281 --RALGSLFSMLN---QGVRNVLQY-NHSHSDFP 308
RALG + L V+NV+ + NHS + +P
Sbjct: 138 HNRALGQISERLGVPVSDVKNVIIWGNHSSTQYP 171
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 32.8 bits (76), Expect = 0.59
Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 24/108 (22%)
Query: 1035 KIAETVEQVEEMQKSLA-----VKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS-QDI 1088
KIA+ + E +K L ++Q +++ EAA E+I ++A+KR Q ++
Sbjct: 39 KIADGLAAAERAKKELELAQAKYEAQLAEARAEAA-----EII---EQAKKRAAQIIEEA 90
Query: 1089 QAE--------IEKQTVEIAQKRVFVMEDL-AQVEP-AVMDAQQAVKE 1126
+AE + EI Q+R E+L QV AV A++ +
Sbjct: 91 KAEAEAEAARIKAQAQAEIEQERKRAREELRKQVADLAVAGAEKILGR 138
>gnl|CDD|234033 TIGR02857, CydD, thiol reductant ABC exporter, CydD subunit. The
gene pair cydCD encodes an ABC-family transporter in
which each gene contains an N-terminal membrane-spanning
domain (pfam00664) and a C-terminal ATP-binding domain
(pfam00005). In E. coli these genes were discovered as
mutants which caused the terminal heme-copper oxidase
complex cytochrome bd to fail to assemble. Recent work
has shown that the transporter is involved in export of
redox-active thiol compounds such as cysteine and
glutathione. The linkage to assembly of the cytochrome
bd complex is further supported by the conserved operon
structure found outside the gammaproteobacteria
(cydABCD) containing both the transporter and oxidase
genes components. The genes used as the seed members for
this model are all either found in the
gammproteobacterial context or the CydABCD context. All
members of this family scoring above trusted at the time
of its creation were from genomes which encode a
cytochrome bd complex. Unfortunately, the gene symbol
nomenclature adopted based on this operon in B. subtilis
assigns cydC to the third gene in the operon where this
gene is actually homologous to the E. coli cydD gene. We
have chosen to name all homologs in this family in
accordance with the precedence of publication of the E.
coli name, CydD.
Length = 529
Score = 34.2 bits (79), Expect = 0.59
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEGV 73
VGPSG+GKST +LL ++ EG
Sbjct: 354 VGPSGAGKSTLLNLLLGFVDPTEGS 378
Score = 30.3 bits (69), Expect = 9.8
Identities = 10/19 (52%), Positives = 13/19 (68%)
Query: 755 QPQGHLLLIGVSGAGKTTL 773
P + L+G SGAGK+TL
Sbjct: 346 PPGERVALVGPSGAGKSTL 364
>gnl|CDD|217429 pfam03215, Rad17, Rad17 cell cycle checkpoint protein.
Length = 490
Score = 34.2 bits (78), Expect = 0.66
Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 3/62 (4%)
Query: 6 EKIKEVCREEFLVCGEGNEEGGPWMEKVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLK 65
EK K R+E + + E W++ V + SN L++ GPSG GKST KVL K
Sbjct: 11 EKYKPRRRDELAIHKKKIAEVDHWLKAV---FLESNKQLILLLTGPSGCGKSTTVKVLSK 67
Query: 66 AL 67
L
Sbjct: 68 EL 69
Score = 30.4 bits (68), Expect = 9.0
Identities = 18/71 (25%), Positives = 30/71 (42%), Gaps = 14/71 (19%)
Query: 368 IKNGEWVPWS-----NKVPQIEVETQKVAASDVVVPTLDTVRHESLLYTWLAEHKPLVLC 422
+K E W+ + ++ + +K+A V L V ES + L+L
Sbjct: 1 MKADEHELWTEKYKPRRRDELAIHKKKIAE---VDHWLKAVFLES------NKQLILLLT 51
Query: 423 GPPGSGKTMTL 433
GP G GK+ T+
Sbjct: 52 GPSGCGKSTTV 62
>gnl|CDD|225165 COG2256, MGS1, ATPase related to the helicase subunit of the
Holliday junction resolvase [DNA replication,
recombination, and repair].
Length = 436
Score = 33.7 bits (78), Expect = 0.74
Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 13/76 (17%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGVIL 478
++L GPPG+GKT TL + + +L+ ++ + L + + E RK
Sbjct: 51 MILWGPPGTGKT-TLARLIAGTTNAAFEALSAVTSGVKD-LREIIE---EARKN------ 99
Query: 479 SPIQLGKWLVLFCDEI 494
LG+ +LF DEI
Sbjct: 100 --RLLGRRTILFLDEI 113
>gnl|CDD|106979 PHA00671, PHA00671, hypothetical protein.
Length = 135
Score = 32.3 bits (72), Expect = 0.75
Identities = 28/115 (24%), Positives = 55/115 (47%), Gaps = 10/115 (8%)
Query: 1054 SQELQSKNEAANLKLKEMIKDQ--QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLA 1111
S + +K E A ++ + Q Q+A++ K +S + E +KQT E Q++ +M+
Sbjct: 19 SMYMSNKQEKAQRSAQQQAQQQAQQQADQAKAESDRLTQEYQKQT-EAYQQQAKIMQQ-- 75
Query: 1112 QVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAI 1166
++D+QQ+ ++Q R+++ S K +S +L G D A+
Sbjct: 76 ----QLLDSQQSFNRANQKQPNSQRALSAVGSAAKSG-QSGTMLTGPQGVDQSAL 125
>gnl|CDD|129721 TIGR00635, ruvB, Holliday junction DNA helicase, RuvB subunit. All
proteins in this family for which functions are known
are 5'-3' DNA helicases that, as part of a complex with
RuvA homologs serve as a 5'-3' Holliday junction
helicase. RuvA specifically binds Holliday junctions as
a sandwich of two tetramers and maintains the
configuration of the junction. It forms a complex with
two hexameric rings of RuvB, the subunit that contains
helicase activity. The complex drives ATP-dependent
branch migration of the Holliday junction recombination
intermediate. The endonuclease RuvC resolves junctions
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 305
Score = 33.4 bits (77), Expect = 0.76
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 741 EVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAF 779
E L + ++ ++ HLLL G G GKTTL+ +A
Sbjct: 14 EQLQLFIEAAKMRQEALDHLLLYGPPGLGKTTLAHIIAN 52
>gnl|CDD|213190 cd03223, ABCD_peroxisomal_ALDP, ATP-binding cassette domain of
peroxisomal transporter, subfamily D. Peroxisomal
ATP-binding cassette transporter (Pat) is involved in
the import of very long-chain fatty acids (VLCFA) into
the peroxisome. The peroxisomal membrane forms a
permeability barrier for a wide variety of metabolites
required for and formed during fatty acid
beta-oxidation. To communicate with the cytoplasm and
mitochondria, peroxisomes need dedicated proteins to
transport such hydrophilic molecules across their
membranes. X-linked adrenoleukodystrophy (X-ALD) is
caused by mutations in the ALD gene, which encodes ALDP
(adrenoleukodystrophy protein ), a peroxisomal integral
membrane protein that is a member of the ATP-binding
cassette (ABC) transporter protein family. The disease
is characterized by a striking and unpredictable
variation in phenotypic expression. Phenotypes include
the rapidly progressive childhood cerebral form (CCALD),
the milder adult form, adrenomyeloneuropathy (AMN), and
variants without neurologic involvement (i.e.
asymptomatic).
Length = 166
Score = 32.9 bits (76), Expect = 0.76
Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 744 DHVLRIDRIFRQPQG-HLLLIGVSGAGKTTLSRFVA 778
VL D F G LL+ G SG GK++L R +A
Sbjct: 13 GRVLLKDLSFEIKPGDRLLITGPSGTGKSSLFRALA 48
Score = 29.4 bits (67), Expect = 8.4
Identities = 11/22 (50%), Positives = 16/22 (72%), Gaps = 4/22 (18%)
Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
L++ GPSG+GKS+ L +AL
Sbjct: 30 LLITGPSGTGKSS----LFRAL 47
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil domain
of unknwon function.
Length = 126
Score = 31.8 bits (73), Expect = 0.82
Identities = 16/105 (15%), Positives = 51/105 (48%), Gaps = 5/105 (4%)
Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
+ + + L++++ L ++ E++ + ++ K K + + A + K
Sbjct: 21 EEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRA---EKKAEEEKKLRK 77
Query: 1073 KDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVF--VMEDLAQVEP 1115
+ ++E ++ K + ++++AEIEK ++ + + + +E + + P
Sbjct: 78 EKEEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVERSP 122
Score = 31.8 bits (73), Expect = 0.90
Identities = 21/99 (21%), Positives = 46/99 (46%), Gaps = 12/99 (12%)
Query: 1042 QVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
++EE+Q +L K +E + + E + +E+ K ++E ++ + + +K E
Sbjct: 8 EMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLI-------KFDKFLKENEA 60
Query: 1102 KRVFVMEDLAQVEPAVMDAQQAVKEIK--KQQLVELRSM 1138
KR + + ++ +EIK K +L EL++
Sbjct: 61 KRRRAEKKAEE---EKKLRKEKEEEIKELKAELEELKAE 96
>gnl|CDD|223256 COG0178, UvrA, Excinuclease ATPase subunit [DNA replication,
recombination, and repair].
Length = 935
Score = 34.1 bits (79), Expect = 0.82
Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 26/82 (31%)
Query: 45 GLMMV--GPSGSGKST-AWKVLLKALER-----------YEGVEGVAHI-----IDPKAI 85
G+ G SGSGKST L+ AL R Y+ +EG+ HI ID I
Sbjct: 627 GVFTCVTGVSGSGKSTLINDTLVPALARHLNGTKEEPGPYKKIEGLEHIDKVIDIDQSPI 686
Query: 86 -----SKEALY-GVLDPNTREW 101
S A Y GV D RE
Sbjct: 687 GRTPRSNPATYTGVFDD-IREL 707
>gnl|CDD|223540 COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational
modification, protein turnover, chaperones].
Length = 494
Score = 33.6 bits (77), Expect = 0.84
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 754 RQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRA 790
+P +LL G G GKT L++ VA + ++
Sbjct: 273 LRPPKGVLLYGPPGTGKTLLAKAVALESRSRFISVKG 309
>gnl|CDD|225164 COG2255, RuvB, Holliday junction resolvasome, helicase subunit [DNA
replication, recombination, and repair].
Length = 332
Score = 33.7 bits (78), Expect = 0.86
Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 22/67 (32%)
Query: 716 LREYV-QARLKVFYEEELDVQL---VLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKT 771
L E++ Q ++K E+L + + E LDHVL L G G GKT
Sbjct: 25 LDEFIGQEKVK----EQLQIFIKAAKKRGEALDHVL--------------LFGPPGLGKT 66
Query: 772 TLSRFVA 778
TL+ +A
Sbjct: 67 TLAHIIA 73
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 34.1 bits (78), Expect = 0.86
Identities = 30/140 (21%), Positives = 63/140 (45%), Gaps = 5/140 (3%)
Query: 1006 DFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAV----KSQELQSKN 1061
+ NH + Y EK S LEE+ + + + I E + +++ ++ L KS +K
Sbjct: 465 EKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKI 524
Query: 1062 EAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI-AQKRVFVMEDLAQVEPAVMDA 1120
E+A L+++ E + + + ++I+ + +E KR + LA + ++
Sbjct: 525 ESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIET 584
Query: 1121 QQAVKEIKKQQLVELRSMAN 1140
++ K+QL +L S
Sbjct: 585 NRSRSNEIKKQLNDLESRLQ 604
>gnl|CDD|234814 PRK00698, tmk, thymidylate kinase; Validated.
Length = 205
Score = 32.8 bits (76), Expect = 0.87
Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 21/75 (28%)
Query: 50 GPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLFTHI 109
G G+GKST ++L + LE D V TRE
Sbjct: 10 GIDGAGKSTQIELLKELLE--------QQGRDV----------VF---TREPGGTPLGEK 48
Query: 110 LRRIIDNVRGEISKR 124
LR ++ + E+ +
Sbjct: 49 LRELLLDPNEEMDDK 63
>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal structure
and biogenesis].
Length = 148
Score = 32.2 bits (74), Expect = 0.87
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 15/79 (18%)
Query: 1032 GLGKIAETVEQVEEMQ------KSLAVKSQELQSKNEAANLKLKEMIKDQQEAEK--RKV 1083
GLGK + VE + K LAV + + NLKL E + + E +
Sbjct: 11 GLGKKGDIVEVKDGYARNFLIPKGLAVPATK-------GNLKLLEARRAKLEKKAAEELA 63
Query: 1084 QSQDIQAEIEKQTVEIAQK 1102
+++ ++ ++E +TVEIA K
Sbjct: 64 EAEALKEKLEGKTVEIAVK 82
Score = 29.5 bits (67), Expect = 8.0
Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 16/85 (18%)
Query: 1032 GLGKIA--ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRK------- 1082
GL A ++ +E + L + + + EA LKE ++ + K
Sbjct: 34 GLAVPATKGNLKLLEARRAKL--EKKAAEELAEA--EALKEKLEGKTVEIAVKAGEDGKL 89
Query: 1083 ---VQSQDIQAEIEKQTVEIAQKRV 1104
V S+DI ++ ++ ++++
Sbjct: 90 FGSVTSKDIAEALKAAGFKLDKRKI 114
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner membrane
complex of itself with TolQ and TolR to the outer
membrane complex of TolB and OprL (also called Pal). Most
of the length of the protein consists of low-complexity
sequence that may differ in both length and composition
from one species to another, complicating efforts to
discriminate TolA (the most divergent gene in the tol-pal
system) from paralogs such as TonB. Selection of members
of the seed alignment and criteria for setting scoring
cutoffs are based largely conserved operon struction.
//The Tol-Pal complex is required for maintaining outer
membrane integrity. Also involved in transport (uptake)
of colicins and filamentous DNA, and implicated in
pathogenesis. Transport is energized by the proton motive
force. TolA is an inner membrane protein that interacts
with periplasmic TolB and with outer membrane porins
ompC, phoE and lamB [Transport and binding proteins,
Other, Cellular processes, Pathogenesis].
Length = 346
Score = 33.7 bits (77), Expect = 0.88
Identities = 23/123 (18%), Positives = 47/123 (38%), Gaps = 8/123 (6%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI--- 1072
++ E E+Q+ ++ + + +++ Q + + +A K K+
Sbjct: 74 EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133
Query: 1073 -KDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
K + EAEK+ + QAE E + A+ + E + E +A E K +
Sbjct: 134 AKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEA----EAKAKAEAKAKA 189
Query: 1132 LVE 1134
E
Sbjct: 190 KAE 192
Score = 32.1 bits (73), Expect = 2.5
Identities = 21/109 (19%), Positives = 42/109 (38%), Gaps = 7/109 (6%)
Query: 1033 LGKIAETVEQVEEMQKSLAVKSQEL-QSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAE 1091
K+ + E+ E+ + + + +EL Q K E Q E ++++ + +
Sbjct: 70 QKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQA 129
Query: 1092 IEKQTVEIAQKRVFVMEDLA-QVEP-----AVMDAQQAVKEIKKQQLVE 1134
E + A+ E+ Q E A +A++ E KK+ E
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE 178
>gnl|CDD|238549 cd01129, PulE-GspE, PulE/GspE The type II secretory pathway is the
main terminal branch of the general secretory pathway
(GSP). It is responsible for the export the majority of
Gram-negative bacterial exoenzymes and toxins. PulE is a
cytoplasmic protein of the GSP, which contains an ATP
binding site and a tetracysteine motif. This subgroup
also includes PillB and HofB.
Length = 264
Score = 33.0 bits (76), Expect = 0.93
Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 27 GPWMEKVLQLYQISNLNHGLMMV-GPSGSGKSTAWKVLLKALERYE 71
G E + ++ HG+++V GP+GSGK+T L L E
Sbjct: 63 GLKPENLEIFRKLLEKPHGIILVTGPTGSGKTTTLYSALSELNTPE 108
>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 1 (PACSIN1). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 1 or Syndapin I is expressed specifically in the
brain and is localized in neurites and synaptic boutons.
It binds the brain-specific proteins dynamin I,
synaptojanin, synapsin I, and neural Wiskott-Aldrich
syndrome protein (nWASP), and functions as a link between
the cytoskeletal machinery and synaptic vesicle
endocytosis. PACSIN 1 interacts with huntingtin and may
be implicated in the neuropathology of Huntington's
disease. It contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 258
Score = 33.1 bits (75), Expect = 0.94
Identities = 35/140 (25%), Positives = 66/140 (47%), Gaps = 13/140 (9%)
Query: 1009 NHFVKLYREKCSELEEQQLHLNV--GLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL 1066
N+ + EK ++ H + G + E + + QK A K +EL++ +A +L
Sbjct: 85 NNLLNEDLEKVKNWQKDAYHKQIMGGFKETKEAEDGFRKAQKPWAKKMKELEAAKKAYHL 144
Query: 1067 KLK-EMIKDQQEAEKRKVQS------QDIQAEIE--KQTVEIAQKRV-FVMEDLAQVEPA 1116
K E + +EA + QS + +Q +++ KQ V+ Q++ V++D+ + P
Sbjct: 145 ACKEEKLAMTREANSKAEQSVTPEQQKKLQDKVDKCKQDVQKTQEKYEKVLDDVGKTTPQ 204
Query: 1117 VMDAQQAVKEIKKQQLVELR 1136
M+ + V E + QQ E R
Sbjct: 205 YMENMEQVFE-QCQQFEEKR 223
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 33.4 bits (77), Expect = 0.98
Identities = 25/112 (22%), Positives = 53/112 (47%), Gaps = 14/112 (12%)
Query: 1235 LKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDL 1294
L+K++ EL+ + + +E K E K+ I+ ++S+ + + +A +K +
Sbjct: 315 LEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITL-------VDKAKKVKAAI 367
Query: 1295 DNVQAKFYEYAQLIAQATAIKTDLDN-----VQAKVERSMA--LLKSLGIER 1339
+ +QA+F + A+ +A+ + V+ K R + LLK GI+
Sbjct: 368 EELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLLKDSGIKA 419
>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
(TACC). This family contains the proteins TACC 1, 2 and
3 the genes for which are found concentrated in the
centrosomes of eukaryotic and may play a conserved role
in organising centrosomal microtubules. The human TACC
proteins have been linked to cancer and TACC2 has been
identified as a possible tumour suppressor (AZU-1). The
functional homologue (Alp7) in Schizosaccharomyces pombe
has been shown to be required for organisation of bipolar
spindles.
Length = 207
Score = 32.8 bits (75), Expect = 0.99
Identities = 21/107 (19%), Positives = 52/107 (48%), Gaps = 3/107 (2%)
Query: 1228 QISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQA 1287
+ S DM ++ ++ E+ S E+++ E K K E+ + ++ K + ++ AQ+I +
Sbjct: 1 KYSQKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGKIVDEFEKTIAQMI-EE 59
Query: 1288 TAIKTDL--DNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALL 1332
+ + +L +Q E Q A +++T ++ + E+ ++
Sbjct: 60 SQKQKELSKRELQEVLAEKDQAYADLNSLETSFSDLFKRYEKYKEVI 106
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown to
be a U1 snRNA associated protein with a role in splice
site recognition. The family also contains human and
mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 33.2 bits (76), Expect = 1.0
Identities = 32/163 (19%), Positives = 61/163 (37%), Gaps = 26/163 (15%)
Query: 974 ACVYVHQTLHKANARLSKRGSRTMAITPRHYLDFINHFVKLYREKCSELEEQQLHLNVGL 1033
C VH KA+ + + L+ + FV K ++
Sbjct: 55 PCPKVHDLKLKADYERASKSQDYFPYE-VEALEILERFVHDCDRK----------IDKAK 103
Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKV-QSQDIQAEI 1092
++ T E E+ + + K++EL +E L E + E+ KV ++ + E+
Sbjct: 104 QRLELTQE--EQTKIAADSKAEELAELDEEIGKLLAEA---EALGEEGKVDEAMKLMKEV 158
Query: 1093 EKQTVEIAQKRVFVMEDLAQVE-PAVMDAQQAVKEIKKQQLVE 1134
E+ A+K+ D + P+ AQQ +K ++ E
Sbjct: 159 EEL---KAKKKELEDSDEVRNAAPSSAQAQQ-----QKLRVCE 193
>gnl|CDD|234719 PRK00300, gmk, guanylate kinase; Provisional.
Length = 205
Score = 32.8 bits (76), Expect = 1.0
Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 6/38 (15%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
+VL GP G+GK+ TL+ AL +L S SATT
Sbjct: 8 IVLSGPSGAGKS-TLVKALLERDP----NLQLSVSATT 40
>gnl|CDD|153042 pfam12608, DUF3773, Protein of unknown function (DUF3773). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are approximately 110 amino
acids in length.
Length = 101
Score = 31.4 bits (71), Expect = 1.1
Identities = 10/50 (20%), Positives = 20/50 (40%), Gaps = 3/50 (6%)
Query: 192 EDVLSTEMIFENYLSRLRNIALDDIDDDSSLLITVDATGKAPDDVLSPAL 241
+ E ++ R ++ D++ D+ LL A + D + S L
Sbjct: 6 MLDILAEEAGADFYLDGREVSYDEVFSDTGLL---PALARRADQLASLCL 52
>gnl|CDD|227321 COG4988, CydD, ABC-type transport system involved in cytochrome bd
biosynthesis, ATPase and permease components [Energy
production and conversion / Posttranslational
modification, protein turnover, chaperones].
Length = 559
Score = 33.4 bits (77), Expect = 1.1
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
VG SG+GKST +LL L +G
Sbjct: 353 VGASGAGKSTLLNLLLGFLAPTQG 376
>gnl|CDD|184375 PRK13891, PRK13891, conjugal transfer protein TrbE; Provisional.
Length = 852
Score = 33.6 bits (77), Expect = 1.1
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 39 ISNLNHGLMMVGPSGSGKSTAWKVLLKALERYEG 72
+ +L H M GP+G+GKST ++ L RY G
Sbjct: 485 VRDLGHTFMF-GPTGAGKSTHLGIIAAQLRRYAG 517
>gnl|CDD|216584 pfam01580, FtsK_SpoIIIE, FtsK/SpoIIIE family. FtsK has extensive
sequence similarity to wide variety of proteins from
prokaryotes and plasmids, termed the FtsK/SpoIIIE
family. This domain contains a putative ATP binding
P-loop motif. It is found in the FtsK cell division
protein from E. coli and the stage III sporulation
protein E SpoIIIE, which has roles in regulation of
prespore specific gene expression in B. subtilis. A
mutation in FtsK causes a temperature sensitive block
in cell division and it is involved in peptidoglycan
synthesis or modification. The SpoIIIE protein is
implicated in intercellular chromosomal DNA transfer.
Length = 201
Score = 32.4 bits (74), Expect = 1.2
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 27 GPWMEKVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGVA-HIIDPK 83
G + + + + H L+ G +GSGKST L+ +L E V ++IDPK
Sbjct: 23 GKDISGNPVVADLVKMPHLLI-AGATGSGKSTFLNTLILSLAARHSPEEVRLYLIDPK 79
>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily. This
subfamily of the AAA family ATPases includes two members
each from three archaeal species. It also includes yeast
CDC48 (cell division control protein 48) and the human
ortholog, transitional endoplasmic reticulum ATPase
(valosin-containing protein). These proteins in
eukaryotes are involved in the budding and transfer of
membrane from the transitional endoplasmic reticulum to
the Golgi apparatus.
Length = 733
Score = 33.3 bits (76), Expect = 1.3
Identities = 49/176 (27%), Positives = 73/176 (41%), Gaps = 34/176 (19%)
Query: 417 KPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGV 476
K ++L GPPG+GK TLL+ A NF + PE+L K E K +
Sbjct: 488 KGVLLFGPPGTGK--TLLAKAVATES----GANFIAVRGPEILSKWVG---ESEKAIREI 538
Query: 477 ILSPIQLGKWLVLFCDEINL------PDMDKYATQRVISFLRQLIEQRGFYRPADKQWVS 530
Q ++ F DEI+ D T R+++ L L E G ++
Sbjct: 539 FRKARQAAPAIIFF-DEIDAIAPARGARFDTSVTDRIVNQL--LTEMDGIQELSN----- 590
Query: 531 LERIQCVGACNPP--TDPG-RKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAM 583
+ + A N P DP +P RF R +I V P E + K+I+ +R+M
Sbjct: 591 ---VVVIAATNRPDILDPALLRP--GRFDR---LILVPPPDEEARKEIFKIHTRSM 638
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists of
several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 33.4 bits (76), Expect = 1.3
Identities = 64/354 (18%), Positives = 132/354 (37%), Gaps = 46/354 (12%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVK--SQELQSKNEAANLKLKE---- 1070
++ ELE + L K + EE K L K ++ L+ +NE + +
Sbjct: 107 DRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESI 166
Query: 1071 -MIKDQQEAEKRKVQSQDIQ-----AEIE--KQTVEIAQKRVFVMEDLAQVEPAVMDAQQ 1122
IK+ + + Q+ D + +E+E ++ +E QK + E Q A
Sbjct: 167 SRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEKKLQ-SLTSEQASS 225
Query: 1123 AVKEIKKQQL-VELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMREN-FINSIV 1180
A +K + L EL+ VVK E + + + + A + EN + S+
Sbjct: 226 ADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQI-----PELERELAALREENRKLRSM- 279
Query: 1181 SNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEP 1240
+ E++ +E+ + +R K A ++ + +++
Sbjct: 280 -KEDNELLKEEL---------------EDLQSRLERFEKMREKLADLELEKEKLENELK- 322
Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAK 1300
KSL N ++ I L+ K++ + + A ++T L +Q
Sbjct: 323 ---SWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQL- 378
Query: 1301 FYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRSQMA 1354
E + +++ +K L+ ++A V R L + ER+ A ++ ++
Sbjct: 379 --ERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKELT 430
>gnl|CDD|234734 PRK00349, uvrA, excinuclease ABC subunit A; Reviewed.
Length = 943
Score = 33.5 bits (78), Expect = 1.3
Identities = 26/75 (34%), Positives = 33/75 (44%), Gaps = 24/75 (32%)
Query: 50 GPSGSGKST-----AWKVLLKALER-------YEGVEGVAHI-----IDPKAI-----SK 87
G SGSGKST +K L + L ++ +EG+ H+ ID I S
Sbjct: 642 GVSGSGKSTLINETLYKALARKLNGAKKVPGKHKEIEGLEHLDKVIDIDQSPIGRTPRSN 701
Query: 88 EALY-GVLDPNTREW 101
A Y GV DP RE
Sbjct: 702 PATYTGVFDP-IREL 715
>gnl|CDD|235290 PRK04351, PRK04351, hypothetical protein; Provisional.
Length = 149
Score = 31.7 bits (73), Expect = 1.3
Identities = 12/24 (50%), Positives = 15/24 (62%), Gaps = 2/24 (8%)
Query: 1503 NRELRRTGGRVLITLGDQDIDISP 1526
N+ LR TGGR L L D I+ +P
Sbjct: 29 NKRLRTTGGRYL--LKDHHIEFNP 50
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is thought
to translocate protons through membrane (inner membrane
in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1) subunits.
This domain should not be confused with the ab CF(1)
proteins (in the head of the ATP synthase) which are
found in pfam00006.
Length = 132
Score = 31.5 bits (72), Expect = 1.4
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 15/98 (15%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD-QQEAEKRKVQSQDIQAEIE 1093
KIA +++ EE K A E + + A + E+I + ++EA+K K ++I AE +
Sbjct: 34 KIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLK---EEILAEAQ 90
Query: 1094 KQ-TVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
K + R ++ Q + +QA+ E+++Q
Sbjct: 91 KDAERLLESAR----AEIEQEK------EQALAELRQQ 118
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 33.1 bits (76), Expect = 1.4
Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 2/47 (4%)
Query: 728 YEEELDVQLVLFDEVLDHVLR--IDRIFRQPQGHLLLIGVSGAGKTT 772
++E L F E L +L I+ + + + + L+G +G GKTT
Sbjct: 172 MKDESYEDLRYFSEKLRKLLLSLIENLIVEQKRVIALVGPTGVGKTT 218
>gnl|CDD|213200 cd03233, ABCG_PDR_domain1, First domain of the pleiotropic drug
resistance-like subfamily G of ATP-binding cassette
transporters. The pleiotropic drug resistance (PDR) is
a well-described phenomenon occurring in fungi and
shares several similarities with processes in bacteria
and higher eukaryotes. This PDR subfamily represents
domain I of its (ABC-IM)2 organization. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds
including sugars, ions, peptides, and more complex
organic molecules. The nucleotide-binding domain shows
the highest similarity between all members of the
family. ABC transporters are a subset of nucleotide
hydrolases that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 202
Score = 32.2 bits (74), Expect = 1.4
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEGVEG 75
++++G GSG ST K L E VEG
Sbjct: 36 VLVLGRPGSGCSTLLKALANRTEGNVSVEG 65
>gnl|CDD|145928 pfam03040, CemA, CemA family. Members of this family are probable
integral membrane proteins. Their molecular function is
unknown. CemA proteins are found in the inner envelope
membrane of chloroplasts but not in the thylakoid
membrane. A cyanobacterial member of this family has been
implicated in CO2 transport, but is probably not a CO2
transporter itself. They are predicted to be haem-binding
however this has not been proven experimentally.
Length = 230
Score = 32.5 bits (75), Expect = 1.4
Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 10/58 (17%)
Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
++ K QE + L+L E+IK+ Q+++ EI K+ +E+A+
Sbjct: 51 SIQEERALEKLQEFEEL-----LELDELIKESPP-----TSLQELRIEIHKKAIELAK 98
>gnl|CDD|235450 PRK05416, PRK05416, glmZ(sRNA)-inactivating NTPase; Provisional.
Length = 288
Score = 32.4 bits (75), Expect = 1.6
Identities = 22/71 (30%), Positives = 31/71 (43%), Gaps = 17/71 (23%)
Query: 50 GPSGSGKSTAWKVLLKALE--RYEGVEG---------VAHIIDPKAISKEALYGVLDPNT 98
G SG+GKS A L+ALE Y V+ V + I K A+ V+D +
Sbjct: 13 GLSGAGKSVA----LRALEDLGYYCVDNLPPSLLPKLVELLAQSGGIRKVAV--VIDVRS 66
Query: 99 REWTDGLFTHI 109
R + D L +
Sbjct: 67 RPFFDDLPEAL 77
>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
metabolism].
Length = 191
Score = 32.1 bits (74), Expect = 1.7
Identities = 19/40 (47%), Positives = 22/40 (55%), Gaps = 7/40 (17%)
Query: 417 KPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
+VL GP G GK+ TL+ AL E L FS SATT
Sbjct: 5 LLIVLSGPSGVGKS-TLVKALL-----EDDKLRFSVSATT 38
>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
permease components [Defense mechanisms].
Length = 567
Score = 32.8 bits (75), Expect = 1.7
Identities = 14/27 (51%), Positives = 19/27 (70%)
Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEG 72
+ +VGPSGSGKST K+LL+ + G
Sbjct: 358 VAIVGPSGSGKSTLIKLLLRLYDPTSG 384
>gnl|CDD|238899 cd01918, HprK_C, HprK/P, the bifunctional histidine-containing
protein kinase/phosphatase, controls the
phosphorylation state of the phosphocarrier protein HPr
and regulates the utilization of carbon sources by
gram-positive bacteria. It catalyzes both the
ATP-dependent phosphorylation of Ser-46 of HPr and its
dephosphorylation by phosphorolysis. The latter
reaction uses inorganic phosphate as substrate and
produces pyrophosphate. Phosphoenolpyruvate
carboxykinase (PEPCK) and the C-terminal catalytic
domain of HprK/P are structurally similar with
conserved active site residues suggesting these two
phosphotransferases have related functions. The HprK/P
N-terminal domain is structurally similar to the
N-terminal domains of the MurE and MurF amino acid
ligases.
Length = 149
Score = 31.4 bits (72), Expect = 1.8
Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 6/25 (24%)
Query: 44 HGLMMVGPSGSGKSTAWKVLLKALE 68
G+++ GPSG GKS ALE
Sbjct: 15 IGVLITGPSGIGKSEL------ALE 33
>gnl|CDD|213184 cd03217, ABC_FeS_Assembly, ABC-type transport system involved in
Fe-S cluster assembly, ATPase component. Biosynthesis
of iron-sulfur clusters (Fe-S) depends on multi-protein
systems. The SUF system of E. coli and Erwinia
chrysanthemi is important for Fe-S biogenesis under
stressful conditions. The SUF system is made of six
proteins: SufC is an atypical cytoplasmic ABC-ATPase,
which forms a complex with SufB and SufD; SufA plays
the role of a scaffold protein for assembly of
iron-sulfur clusters and delivery to target proteins;
SufS is a cysteine desulfurase which mobilizes the
sulfur atom from cysteine and provides it to the
cluster; SufE has no associated function yet.
Length = 200
Score = 31.7 bits (73), Expect = 1.8
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 9/35 (25%)
Query: 44 HGLMMVGPSGSGKSTAWKVLLKAL---ERYEGVEG 75
H LM GP+GSGKST L K + +YE EG
Sbjct: 29 HALM--GPNGSGKST----LAKTIMGHPKYEVTEG 57
>gnl|CDD|226164 COG3638, COG3638, ABC-type phosphate/phosphonate transport system,
ATPase component [Inorganic ion transport and
metabolism].
Length = 258
Score = 32.2 bits (74), Expect = 1.8
Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 5/90 (5%)
Query: 753 FRQPQG-HLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSG 811
QG + +IG SGAGK+TL R + + + +I + ++LR + R G
Sbjct: 25 LEINQGEMVAIIGPSGAGKSTLLRSLNGLVDPTSGEILFNGVQITKLKGKELRKLRRDIG 84
Query: 812 CKNEKIAFLLDES---NVLESGFLERMNTL 838
++ + S NVL G L +T
Sbjct: 85 MIFQQFNLVPRLSVLENVL-LGRLGYTSTW 113
Score = 31.5 bits (72), Expect = 3.2
Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 6/49 (12%)
Query: 41 NLNHGLM--MVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISK 87
+N G M ++GPSG+GKST LL++L + + I+K
Sbjct: 26 EINQGEMVAIIGPSGAGKST----LLRSLNGLVDPTSGEILFNGVQITK 70
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
chromosome partitioning].
Length = 420
Score = 32.8 bits (75), Expect = 1.8
Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 7/89 (7%)
Query: 1244 ELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYE 1303
+LK ++ + + + K E +D +LEK + S + E A L AQ DL ++ +
Sbjct: 39 QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQ--- 95
Query: 1304 YAQLIAQATAIKTDLDNVQAKVERSMALL 1332
IA A L+ + + R +A
Sbjct: 96 ----IADLNARLNALEVQEREQRRRLAEQ 120
Score = 30.8 bits (70), Expect = 5.6
Identities = 21/91 (23%), Positives = 35/91 (38%), Gaps = 8/91 (8%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA 1100
EQ + K LA +E + N E+ DQ++ E+ + ++ EI A
Sbjct: 193 EQRAQQAK-LAQLLEERKKTLAQLN---SELSADQKKLEELRANESRLKNEIASAEAAAA 248
Query: 1101 QKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
+ R E A E A A+ A + +
Sbjct: 249 KAR----EAAAAAEAAAARARAAEAKRTGET 275
>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
nucleotide-binding domain. ABC transporters are a
large family of proteins involved in the transport of a
wide variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are
a subset of nucleotide hydrolases that contain a
signature motif, Q-loop, and H-loop/switch region, in
addition to, the Walker A motif/P-loop and Walker B
motif commonly found in a number of ATP- and
GTP-binding and hydrolyzing proteins.
Length = 157
Score = 31.4 bits (72), Expect = 1.8
Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 4/42 (9%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEAL 90
VGP+GSGKST LL+A+ +ID K I+K L
Sbjct: 31 VGPNGSGKST----LLRAIAGLLKPTSGEILIDGKDIAKLPL 68
Score = 29.9 bits (68), Expect = 6.2
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 759 HLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLR 804
+ L+G +G+GK+TL R +A + + +I K E+LR
Sbjct: 27 IVALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELR 72
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 31.2 bits (71), Expect = 1.9
Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 9/81 (11%)
Query: 1060 KNEAANLKLKEMIKDQQEAEKRKVQ---SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPA 1116
K L+ ++ ++EAEK K + AE EK+ E+ +K+ + +
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKKIM 125
Query: 1117 VMD------AQQAVKEIKKQQ 1131
D Q+ EI ++Q
Sbjct: 126 FADTSSLSPEQRQYYEIMQKQ 146
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different model
[Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 32.3 bits (74), Expect = 2.0
Identities = 19/102 (18%), Positives = 44/102 (43%), Gaps = 15/102 (14%)
Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSK-NEAA------- 1064
KL R ++E + + E + +++ +K L + +ELQ+K NE +
Sbjct: 5 KLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAK 64
Query: 1065 -------NLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI 1099
KE+ + ++E + + ++AE++ + + I
Sbjct: 65 GQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSI 106
>gnl|CDD|110677 pfam01695, IstB_IS21, IstB-like ATP binding protein. This protein
contains an ATP/GTP binding P-loop motif. It is found
associated with IS21 family insertion sequences. The
function of this protein is unknown, but it may perform
a transposase function.
Length = 178
Score = 31.5 bits (72), Expect = 2.1
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 412 WLAEHKPLVLCGPPGSGKT 430
W+ + + L+L GPPG GKT
Sbjct: 43 WIEQAENLLLLGPPGVGKT 61
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 32.5 bits (74), Expect = 2.2
Identities = 14/54 (25%), Positives = 22/54 (40%)
Query: 1525 SPTFVIFLSTRDPTVEFPPDICSRVTFVNFTVTRSSLQSQCLNRVLKAERPDID 1578
P V+F+ + P I SR NF +S + L + K E+ I+
Sbjct: 146 PPKHVVFIFATTEFQKIPLTIISRCQRYNFKKLNNSELQELLKSIAKKEKIKIE 199
>gnl|CDD|237917 PRK15134, PRK15134, microcin C ABC transporter ATP-binding protein
YejF; Provisional.
Length = 529
Score = 32.4 bits (74), Expect = 2.3
Identities = 16/28 (57%), Positives = 18/28 (64%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVV 446
L L G GSGK++T LS LR LP VV
Sbjct: 38 LALVGESGSGKSVTALSILRLLPSPPVV 65
>gnl|CDD|183133 PRK11432, fbpC, ferric transporter ATP-binding subunit;
Provisional.
Length = 351
Score = 32.0 bits (73), Expect = 2.3
Identities = 13/24 (54%), Positives = 17/24 (70%), Gaps = 1/24 (4%)
Query: 756 PQGHLL-LIGVSGAGKTTLSRFVA 778
QG ++ L+G SG GKTT+ R VA
Sbjct: 30 KQGTMVTLLGPSGCGKTTVLRLVA 53
>gnl|CDD|132313 TIGR03269, met_CoM_red_A2, methyl coenzyme M reductase system,
component A2. The enzyme that catalyzes the final step
in methanogenesis, methyl coenzyme M reductase, contains
alpha, beta, and gamma chains. In older literature, the
complex of alpha, beta, and gamma chains was termed
component C, while this single chain protein was termed
methyl coenzyme M reductase system component A2 [Energy
metabolism, Methanogenesis].
Length = 520
Score = 32.5 bits (74), Expect = 2.4
Identities = 11/17 (64%), Positives = 15/17 (88%)
Query: 762 LIGVSGAGKTTLSRFVA 778
++G SGAGKTTLS+ +A
Sbjct: 315 IVGTSGAGKTTLSKIIA 331
>gnl|CDD|234357 TIGR03797, NHLM_micro_ABC2, NHLM bacteriocin system ABC
transporter, ATP-binding protein. Members of this
protein family are ABC transporter ATP-binding subunits,
part of a three-gene putative bacteriocin transport
operon. The other subunits include another ATP-binding
subunit (TIGR03796), which has an N-terminal leader
sequence cleavage domain, and an HlyD homolog
(TIGR03794). In a number of genomes, members of protein
families related to nitrile hydratase alpha subunit or
to nif11 have undergone paralogous family expansions,
with members possessing a putative bacteriocin cleavage
region ending with a classic Gly-Gly motif. Those sets
of putative bacteriocins, members of this protein family
and its partners TIGR03794 and TIGR03796, and
cyclodehydratase/docking scaffold fusion proteins of
thiazole/oxazole biosynthesis frequently show correlated
species distribution and co-clustering within many of
those genomes [Transport and binding proteins, Amino
acids, peptides and amines, Cellular processes,
Biosynthesis of natural products].
Length = 686
Score = 32.2 bits (74), Expect = 2.4
Identities = 12/16 (75%), Positives = 14/16 (87%)
Query: 49 VGPSGSGKSTAWKVLL 64
VGPSGSGKST ++LL
Sbjct: 485 VGPSGSGKSTLLRLLL 500
>gnl|CDD|224059 COG1136, SalX, ABC-type antimicrobial peptide transport system,
ATPase component [Defense mechanisms].
Length = 226
Score = 31.7 bits (73), Expect = 2.5
Identities = 10/13 (76%), Positives = 12/13 (92%)
Query: 46 LMMVGPSGSGKST 58
+ +VGPSGSGKST
Sbjct: 34 VAIVGPSGSGKST 46
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 32.3 bits (74), Expect = 2.5
Identities = 17/99 (17%), Positives = 33/99 (33%), Gaps = 4/99 (4%)
Query: 1008 INHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL- 1066
+ E EE +L E ++ E+Q++ ++KNE
Sbjct: 819 LKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFK 878
Query: 1067 ---KLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQK 1102
K + ++ + V S D + EK T + +
Sbjct: 879 NDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKD 917
>gnl|CDD|223455 COG0378, HypB, Ni2+-binding GTPase involved in regulation of
expression and maturation of urease and hydrogenase
[Posttranslational modification, protein turnover,
chaperones / Transcription].
Length = 202
Score = 31.4 bits (72), Expect = 2.6
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 421 LCGPPGSGKTMTLLSALRALPD 442
+ GPPGSGKT + LRAL D
Sbjct: 18 VGGPPGSGKTALIEKTLRALKD 39
>gnl|CDD|233265 TIGR01085, murE, UDP-N-acetylmuramyl-tripeptide synthetase. Most
members of this family are EC 6.3.2.13,
UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,
6-diaminopimelate ligase. An exception is Staphylococcus
aureus, in which diaminopimelate is replaced by lysine
in the peptidoglycan and MurE is EC 6.3.2.7. The
Mycobacteria, part of the closest neighboring branch
outside of the low-GC Gram-positive bacteria, use
diaminopimelate. A close homolog, scoring just below the
trusted cutoff, is found (with introns) in Arabidopsis
thaliana. Its role is unknown [Cell envelope,
Biosynthesis and degradation of murein sacculus and
peptidoglycan].
Length = 464
Score = 32.3 bits (74), Expect = 2.6
Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 7/60 (11%)
Query: 726 VFYEEELDVQLVLFDEV----LDHVLRI--DRIFRQPQGHLLLIGVSGA-GKTTLSRFVA 778
V E ++D + + L H L + P L +IGV+G GKTT + +A
Sbjct: 45 VVVERDVDFYVAPVPVIIVPDLRHALSSLAAAFYGHPSKKLKVIGVTGTNGKTTTTSLIA 104
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 32.2 bits (74), Expect = 2.6
Identities = 30/145 (20%), Positives = 54/145 (37%), Gaps = 32/145 (22%)
Query: 1219 GPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKD 1278
GP V + + ++V L+ +L+ + ++++A E + + LE A ++
Sbjct: 132 GPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEE 191
Query: 1279 EYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGI- 1337
+ +L AQ QL +A T + Q + E + K L +
Sbjct: 192 KQQELEAQ-----------------LEQLQEKAAE--TSQERKQKRKEITDQAAKRLELS 232
Query: 1338 ERER------------WEATSETFR 1350
E E WEA S+T R
Sbjct: 233 EEETRILIDQQLRKAGWEADSKTLR 257
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 32.0 bits (73), Expect = 2.6
Identities = 14/39 (35%), Positives = 26/39 (66%), Gaps = 2/39 (5%)
Query: 38 QISNLNHGLMMVGPSGSGKSTAWKVLLKAL--ERYEGVE 74
+ ++++H + GP G+GK+T ++L K+L E +GVE
Sbjct: 31 KKNSISHAYIFAGPRGTGKTTVARILAKSLNCENRKGVE 69
>gnl|CDD|131721 TIGR02673, FtsE, cell division ATP-binding protein FtsE. This
model describes FtsE, a member of the ABC transporter
ATP-binding protein family. This protein, and its
permease partner FtsX, localize to the division site.
In a number of species, the ftsEX gene pair is located
next to FtsY, the signal recognition particle-docking
protein [Cellular processes, Cell division].
Length = 214
Score = 31.4 bits (72), Expect = 2.6
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEG 72
L + GPSG+GK+T K+L AL G
Sbjct: 31 LFLTGPSGAGKTTLLKLLYGALTPSRG 57
>gnl|CDD|224048 COG1123, COG1123, ATPase components of various ABC-type transport
systems, contain duplicated ATPase [General function
prediction only].
Length = 539
Score = 32.2 bits (74), Expect = 2.8
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 762 LIGVSGAGKTTLSRFVA 778
L+G SG+GK+TL+R +A
Sbjct: 322 LVGESGSGKSTLARILA 338
Score = 31.4 bits (72), Expect = 5.0
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEGVE 74
VG SGSGKST ++L L G
Sbjct: 323 VGESGSGKSTLARILAGLLPPSSGSI 348
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827). This
family consists of several plant proteins of unknown
function. Several sequences in this family are described
as being "myosin heavy chain-like".
Length = 484
Score = 31.9 bits (72), Expect = 2.9
Identities = 66/337 (19%), Positives = 130/337 (38%), Gaps = 35/337 (10%)
Query: 1023 EEQQLHL-NVGLGKIAETVEQVEEMQKSLAVKSQELQS-KNEAANLKLKEMIKDQQEAEK 1080
+E +L L K+ + +E V E +K +K EL+S K +LKLK ++AEK
Sbjct: 34 KETELMLAQEEANKLKKELE-VAEKEKLQVLK--ELESTKRTVEDLKLK-----LEKAEK 85
Query: 1081 RKVQSQDIQAEIEKQTVEIAQKRVFVME------DLAQVEPAVMDAQQAVKEIKKQQLVE 1134
+ Q+ KQ E+A+ R +E ++ + A + +E++K +
Sbjct: 86 EEQQA--------KQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEY 137
Query: 1135 LRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVRE 1194
+ + +K A E+IC A+ + + + I S E E
Sbjct: 138 DALVEERDAALKRAEEAIC------ASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEE 191
Query: 1195 KMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASE 1254
+ L A ++ + L+K+ EL+SL+ + S
Sbjct: 192 ERIGAALEKDQDRETYEKELKEA-EKELERLKQDLDPEKDLEKLAEASAELESLQKEISI 250
Query: 1255 NKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAI 1314
+ K+L + + ++ +E L ++K +L+ + + E + +A
Sbjct: 251 MASVASVLKEL-EEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEA 309
Query: 1315 KTDL--DNVQAKVERSM-ALLKSLGIERERWEATSET 1348
+ L + + K E+ A K + I + +A+SE
Sbjct: 310 ASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEA 346
Score = 30.3 bits (68), Expect = 9.2
Identities = 35/142 (24%), Positives = 60/142 (42%), Gaps = 12/142 (8%)
Query: 1019 CSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL--------KE 1070
ELEE++ L K E E ++ L E +SK +++ E
Sbjct: 286 KQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSE 345
Query: 1071 MIKDQQEAEKRKVQSQDIQAEIE--KQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIK 1128
+ ++EAE + + + ++ E E K +E A+ R+ + E A A+ EIK
Sbjct: 346 AEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAALK--EAEAAKAAEALALAEIK 403
Query: 1129 KQQLVELRSMANPPSVVKLALE 1150
Q E + A+ P + L+LE
Sbjct: 404 ALQESEESAKADSPRKITLSLE 425
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members of
this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded and
aggregated proteins [Protein fate, Protein folding and
stabilization].
Length = 852
Score = 32.2 bits (74), Expect = 3.0
Identities = 23/86 (26%), Positives = 41/86 (47%), Gaps = 20/86 (23%)
Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAK 1300
L +E ++L+ K K E +K+ + LEK +A ++EYA L Q A K + +Q
Sbjct: 417 LEIEREALK------KEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQ-- 468
Query: 1301 FYEYAQLIAQATAIKTDLDNVQAKVE 1326
IK +++ V+ ++E
Sbjct: 469 ------------QIKEEIEQVRLELE 482
>gnl|CDD|233062 TIGR00630, uvra, excinuclease ABC, A subunit. This family is a
member of the ABC transporter superfamily of proteins of
which all members for which functions are known except
the UvrA proteins are involved in the transport of
material through membranes. UvrA orthologs are involved
in the recognition of DNA damage as a step in nucleotide
excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 925
Score = 32.3 bits (74), Expect = 3.0
Identities = 26/77 (33%), Positives = 29/77 (37%), Gaps = 25/77 (32%)
Query: 45 GLMMV--GPSGSGKST-AWKVLLKALER-----------YEGVEGVAHI-----IDPKAI 85
GL G SGSGKST L AL Y +EG+ H+ ID I
Sbjct: 634 GLFTCITGVSGSGKSTLINDTLYPALANRLNGAKTVPGRYTSIEGLEHLDKVIHIDQSPI 693
Query: 86 -----SKEALY-GVLDP 96
S A Y GV D
Sbjct: 694 GRTPRSNPATYTGVFDE 710
>gnl|CDD|234132 TIGR03167, tRNA_sel_U_synt, tRNA 2-selenouridine synthase. The
Escherichia coli YbbB protein was shown to encode a
selenophosphate-dependent tRNA 2-selenouridine synthase,
essential for modification of some tRNAs to replace a
sulfur atom with selenium. This enzyme works with SelD,
the selenium donor protein, which also acts in
selenocysteine incorporation. Although the members of
this protein family show a fairly deep split, sequences
from both sides of the split are supported by
co-occurence with, and often proximity to, the selD gene
[Protein synthesis, tRNA and rRNA base modification].
Length = 311
Score = 31.8 bits (73), Expect = 3.1
Identities = 13/32 (40%), Positives = 14/32 (43%), Gaps = 1/32 (3%)
Query: 409 LYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL 440
L +VL G GSGKT LL AL
Sbjct: 120 LEELPQPFPLIVLGGMTGSGKT-ELLHALANA 150
>gnl|CDD|213233 cd03266, ABC_NatA_sodium_exporter, ATP-binding cassette domain of
the Na+ transporter. NatA is the ATPase component of a
bacterial ABC-type Na+ transport system called NatAB,
which catalyzes ATP-dependent electrogenic Na+ extrusion
without mechanically coupled proton or K+ uptake. NatB
possess six putative membrane spanning regions at its
C-terminus. In B. subtilis, NatAB is inducible by agents
such as ethanol and protonophores, which lower the
proton-motive force across the membrane. The closest
sequence similarity to NatA is exhibited by DrrA of the
two-component daunorubicin- and doxorubicin-efflux
system. Hence, the functional NatAB is presumably
assembled with two copies of a single ATP-binding
protein and a single integral membrane protein.
Length = 218
Score = 31.2 bits (71), Expect = 3.1
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 739 FDEVLDHVLRIDRI-FRQPQGHLL-LIGVSGAGKTTLSRFVA 778
F +V V +D + F G + L+G +GAGKTT R +A
Sbjct: 11 FRDVKKTVQAVDGVSFTVKPGEVTGLLGPNGAGKTTTLRMLA 52
>gnl|CDD|225982 COG3451, VirB4, Type IV secretory pathway, VirB4 components
[Intracellular trafficking and secretion].
Length = 796
Score = 32.0 bits (73), Expect = 3.2
Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 11/63 (17%)
Query: 21 EGNEEGGPWMEKVLQLYQISNLNHGL-----------MMVGPSGSGKSTAWKVLLKALER 69
G G PW E + L +++GP+G+GK+ LL +
Sbjct: 403 LGPFRGNPWGEPLTVLKTEDGSPFYFNFHVGEDVGHTLIIGPTGAGKTVLLSFLLAQALK 462
Query: 70 YEG 72
Y
Sbjct: 463 YGN 465
>gnl|CDD|219014 pfam06414, Zeta_toxin, Zeta toxin. This family consists of several
bacterial zeta toxin proteins. Zeta toxin is thought to
be part of a postregulational killing system in
bacteria. It relies on antitoxin/toxin systems that
secure stable inheritance of low and medium copy number
plasmids during cell division and kill cells that have
lost the plasmid.
Length = 191
Score = 31.1 bits (71), Expect = 3.2
Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 27/96 (28%)
Query: 48 MVGPSGSGKSTAWKVLLKALERYEGVEGVAHII-D------PKAISKEALYG-----VLD 95
+ G G+GK+ + LL+ L V I D P + +
Sbjct: 17 LGGQPGAGKTELARALLEELGG----GNVVRIDPDELRTYHPDYDELQKADPKDASELTQ 72
Query: 96 PNTREWTDGLFTHILRRIIDNVRGEISKRQWIIFDG 131
P+ W + L + + R II +G
Sbjct: 73 PDASRWVEKLIDYAIER-----------GYNIILEG 97
Score = 30.3 bits (69), Expect = 5.9
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLN 449
++L G PG+GKT + L L VV ++
Sbjct: 15 VLLGGQPGAGKTELARALLEELGGGNVVRID 45
>gnl|CDD|213227 cd03260, ABC_PstB_phosphate_transporter, ATP-binding cassette
domain of the phosphate transport system. Phosphate
uptake is of fundamental importance in the cell
physiology of bacteria because phosphate is required as
a nutrient. The Pst system of E. coli comprises four
distinct subunits encoded by the pstS, pstA, pstB, and
pstC genes. The PstS protein is a phosphate-binding
protein located in the periplasmic space. PstA and PstC
are hydrophobic and they form the transmembrane portion
of the Pst system. PstB is the catalytic subunit, which
couples the energy of ATP hydrolysis to the import of
phosphate across cellular membranes through the Pst
system, often referred as ABC-protein. PstB belongs to
one of the largest superfamilies of proteins
characterized by a highly conserved adenosine
triphosphate (ATP) binding cassette (ABC), which is
also a nucleotide binding domain (NBD).
Length = 227
Score = 31.4 bits (72), Expect = 3.3
Identities = 12/21 (57%), Positives = 14/21 (66%), Gaps = 4/21 (19%)
Query: 49 VGPSGSGKSTAWKVLLKALER 69
+GPSG GKST LL+ L R
Sbjct: 32 IGPSGCGKST----LLRLLNR 48
Score = 30.2 bits (69), Expect = 6.5
Identities = 15/45 (33%), Positives = 19/45 (42%), Gaps = 6/45 (13%)
Query: 740 DEVLDHV-LRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGL 783
L + L I LIG SG GK+TL R + +N L
Sbjct: 13 KHALKDISLDI-----PKGEITALIGPSGCGKSTLLRLLNRLNDL 52
>gnl|CDD|216246 pfam01017, STAT_alpha, STAT protein, all-alpha domain. STAT proteins
(Signal Transducers and Activators of Transcription) are
a family of transcription factors that are specifically
activated to regulate gene transcription when cells
encounter cytokines and growth factors. STAT proteins
also include an SH2 domain pfam00017.
Length = 182
Score = 31.1 bits (71), Expect = 3.4
Identities = 17/123 (13%), Positives = 44/123 (35%), Gaps = 22/123 (17%)
Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
EK E++ + L + + ++Q+E++Q + + LQS E
Sbjct: 2 EKQLEIDSKVEDLRNSVQDTEQDIKQLEDLQDEFDFRYKTLQSLIET-----------PA 50
Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI--------K 1128
K + + +++ ++ QKR + + + + + + K
Sbjct: 51 NGTSLKEHLKQEKLTLQQMLNKLDQKRKELADKHQET---LGLLEALQNALLDEELIEWK 107
Query: 1129 KQQ 1131
++Q
Sbjct: 108 RRQ 110
>gnl|CDD|132050 TIGR03005, ectoine_ehuA, ectoine/hydroxyectoine ABC transporter,
ATP-binding protein. Members of this family are the
ATP-binding protein of a conserved four gene ABC
transporter operon found next to ectoine unilization
operons and ectoine biosynthesis operons. Ectoine is a
compatible solute that protects enzymes from high
osmolarity. It is released by some species in response
to hypoosmotic shock, and it is taken up by a number of
bacteria as a compatible solute or for consumption.
This family shows strong sequence similiarity to a
number of amino acid ABC transporter ATP-binding
proteins.
Length = 252
Score = 31.3 bits (71), Expect = 3.4
Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 1/29 (3%)
Query: 44 HGLMMVGPSGSGKSTAWKVLLKALERYEG 72
+ ++GPSGSGKST ++L+ LE +
Sbjct: 27 EKVALIGPSGSGKSTILRILM-TLEPIDE 54
>gnl|CDD|226967 COG4608, AppF, ABC-type oligopeptide transport system, ATPase
component [Amino acid transport and metabolism].
Length = 268
Score = 31.4 bits (72), Expect = 3.5
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 48 MVGPSGSGKSTAWKVLLKALERYEG 72
+VG SG GKST +++L E G
Sbjct: 44 LVGESGCGKSTLGRLILGLEEPTSG 68
>gnl|CDD|224051 COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase
component [Amino acid transport and metabolism].
Length = 240
Score = 31.3 bits (72), Expect = 3.5
Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 4/23 (17%)
Query: 49 VGPSGSGKSTAWKVLLKALERYE 71
+GPSGSGKST LL+ L E
Sbjct: 34 IGPSGSGKST----LLRCLNGLE 52
Score = 30.9 bits (71), Expect = 4.8
Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 16/77 (20%)
Query: 741 EVLDHV-LRIDRIFRQPQGH-LLLIGVSGAGKTTLSRFVAFMNGLSVFQ---IRAHNKYT 795
EVL + L +++ G +++IG SG+GK+TL R +NGL I +
Sbjct: 16 EVLKGISLSVEK------GEVVVIIGPSGSGKSTLLR---CLNGLEEPDSGSITVDGEDV 66
Query: 796 GADFDEDLRTVLRRSGC 812
G +D+ + R+ G
Sbjct: 67 GD--KKDILKLRRKVGM 81
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number of
Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 31.6 bits (72), Expect = 3.7
Identities = 21/123 (17%), Positives = 48/123 (39%), Gaps = 19/123 (15%)
Query: 1237 KVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSI--------------ASYKDEYAQ 1282
++E L +L +L ++ ++ TK I QLE+ Y Q
Sbjct: 255 RIEALEKQLDALRLRYTDKHPDVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPVYQQ 314
Query: 1283 LIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMA-LLKSLGIERER 1341
L + + ++ +++A+ E I + +++ L + VE + L + + +
Sbjct: 315 LQIELAEAEAEIASLEARVAELTARIER---LESLLRTIPE-VEAELTQLNRDYEVNKSN 370
Query: 1342 WEA 1344
+E
Sbjct: 371 YEQ 373
>gnl|CDD|213224 cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain
of nickel/oligopeptides specific transporters. The ABC
transporter subfamily specific for the transport of
dipeptides, oligopeptides (OppD), and nickel (NikDE).
The NikABCDE system of E. coli belongs to this family
and is composed of the periplasmic binding protein
NikA, two integral membrane components (NikB and NikC),
and two ATPase (NikD and NikE). The NikABCDE
transporter is synthesized under anaerobic conditions
to meet the increased demand for nickel resulting from
hydrogenase synthesis. The molecular mechanism of
nickel uptake in many bacteria and most archaea is not
known. Many other members of this ABC family are also
involved in the uptake of dipeptides and oligopeptides.
The oligopeptide transport system (Opp) is a
five-component ABC transport composed of a
membrane-anchored substrate binding proteins (SRP),
OppA, two transmembrane proteins, OppB and OppC, and
two ATP-binding domains, OppD and OppF.
Length = 228
Score = 30.9 bits (71), Expect = 3.7
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 48 MVGPSGSGKSTAWKVLLKALERYEG 72
+VG SGSGKST + +L L+ G
Sbjct: 36 LVGESGSGKSTLARAILGLLKPTSG 60
>gnl|CDD|213229 cd03262, ABC_HisP_GlnQ, ATP-binding cassette domain of the
histidine and glutamine transporters. HisP and GlnQ
are the ATP-binding components of the bacterial
periplasmic histidine and glutamine permeases,
respectively. Histidine permease is a multi-subunit
complex containing the HisQ and HisM integral membrane
subunits and two copies of HisP. HisP has properties
intermediate between those of integral and peripheral
membrane proteins and is accessible from both sides of
the membrane, presumably by its interaction with HisQ
and HisM. The two HisP subunits form a homodimer within
the complex. The domain structure of the amino acid
uptake systems is typical for prokaryotic extracellular
solute binding protein-dependent uptake systems. All of
the amino acid uptake systems also have at least one,
and in a few cases, two extracellular solute binding
proteins located in the periplasm of Gram-negative
bacteria, or attached to the cell membrane of
Gram-positive bacteria. The best-studied member of the
PAAT (polar amino acid transport) family is the HisJQMP
system of S. typhimurium, where HisJ is the
extracellular solute binding proteins and HisP is the
ABC protein.
Length = 213
Score = 31.0 bits (71), Expect = 3.8
Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 48 MVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKE 88
++GPSGSGKST LL+ + E + IID ++ +
Sbjct: 31 IIGPSGSGKST----LLRCINLLEEPDSGTIIIDGLKLTDD 67
>gnl|CDD|224985 COG2074, COG2074, 2-phosphoglycerate kinase [Carbohydrate transport
and metabolism].
Length = 299
Score = 31.2 bits (71), Expect = 3.9
Identities = 25/101 (24%), Positives = 38/101 (37%), Gaps = 25/101 (24%)
Query: 711 VGTTELREYVQARLKVFYEEELDVQLVLFDEVLDHVLRIDRIFRQPQGHL--LLIGVSGA 768
V E+RE Q L+ E L+ R R+ + L L+ G SG
Sbjct: 53 VTKDEIREVYQKLLEKGDPEVAKRYLLW------------RRIRKMKRPLIILIGGASGV 100
Query: 769 GKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRR 809
GK+T++ +A G+ D +R VLR+
Sbjct: 101 GKSTIAGELARRLGIRSV--------ISTDS---IREVLRK 130
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class 3
flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 29.7 bits (67), Expect = 3.9
Identities = 14/62 (22%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 1042 QVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
Q++ +QK + +++L+ + + +L +E K QQ ++ Q Q +QA+I + + ++
Sbjct: 20 QIKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQLIQQ---QIQMLQAQIAQLQQQQSE 76
Query: 1102 KR 1103
+
Sbjct: 77 EA 78
>gnl|CDD|225363 COG2804, PulE, Type II secretory pathway, ATPase PulE/Tfp pilus
assembly pathway, ATPase PilB [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 500
Score = 31.5 bits (72), Expect = 3.9
Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 21/92 (22%)
Query: 735 QLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTL------SRFVAFMNGLSV--- 785
+L + L R+ R+ +PQG +L+ G +G+GKTT N +++
Sbjct: 239 KLGMSPFQLA---RLLRLLNRPQGLILVTGPTGSGKTTTLYAALSELNTPERNIITIEDP 295
Query: 786 --------FQIRAHNKYTGADFDEDLRTVLRR 809
Q++ + K G F LR +LR+
Sbjct: 296 VEYQLPGINQVQVNPK-IGLTFARALRAILRQ 326
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 31.5 bits (72), Expect = 3.9
Identities = 21/67 (31%), Positives = 33/67 (49%), Gaps = 9/67 (13%)
Query: 713 TTELREYVQARLKVFYEEELDVQLVLFD---EVLDHVLRIDRIFRQPQGH----LLLIGV 765
ELR+ + K+ EE V+ L + E+L V ++D P+ +L +GV
Sbjct: 90 IEELRKREGKKKKIKDEE--TVKEALREALIEILRPVDKVDLPLEIPKEKKPFVILFVGV 147
Query: 766 SGAGKTT 772
+G GKTT
Sbjct: 148 NGVGKTT 154
>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
unknown].
Length = 108
Score = 29.7 bits (67), Expect = 4.0
Identities = 27/104 (25%), Positives = 50/104 (48%), Gaps = 16/104 (15%)
Query: 1032 GLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAE 1091
G G AET E+V+++ L VK EL + EA + ++++ +EA Q E
Sbjct: 15 GAGLAAETAEKVQKLVDEL-VKKGEL-NAEEAKRF-VDDLLRQAKEA----------QGE 61
Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
+E++ I +K ++ DL + MD + ++Q+ +L
Sbjct: 62 LEEK---IPRKIEEMLSDLEVARQSEMDELTERVDALERQVADL 102
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 31.7 bits (72), Expect = 4.0
Identities = 16/113 (14%), Positives = 49/113 (43%), Gaps = 8/113 (7%)
Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE-KQTVEI 1099
E++ + + L+ +EL+ KNE LKE K ++E E+ + ++ + + + E
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570
Query: 1100 AQKRVFVMEDLAQV-------EPAVMDAQQAVKEIKKQQLVELRSMANPPSVV 1145
+ + +++ + + ++++++ K + + + P +
Sbjct: 571 QEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQ 623
>gnl|CDD|237421 PRK13539, PRK13539, cytochrome c biogenesis protein CcmA;
Provisional.
Length = 207
Score = 31.0 bits (71), Expect = 4.2
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 755 QPQGHLLLIGVSGAGKTTLSRFVA 778
L+L G +G+GKTTL R +A
Sbjct: 26 AAGEALVLTGPNGSGKTTLLRLIA 49
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 30.4 bits (69), Expect = 4.3
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 419 LVLCGPPGSGKTMTLLSAL 437
++L P GSGKT+ L +
Sbjct: 3 VLLAAPTGSGKTLAALLPI 21
>gnl|CDD|234401 TIGR03925, T7SS_EccC_b, type VII secretion protein EccCb. This
model represents the C-terminal domain or EccCb subunit
of the type VII secretion protein EccC as found in the
Actinobacteria. Type VII secretion is defined more
broadly as including secretion systems for ESAT-6-like
proteins in the Firmicutes as well as in the
Actinobacteria, but this family does not show close
homologs in the Firmicutes [Protein fate, Protein and
peptide secretion and trafficking].
Length = 566
Score = 31.5 bits (72), Expect = 4.4
Identities = 12/53 (22%), Positives = 25/53 (47%), Gaps = 5/53 (9%)
Query: 46 LMMVGPSGSGKSTAWKVLLKAL-ERYEGVEGVAHIIDPKAISKEALYGVLDPN 97
L++ G S SGK+T + + + + RY + ++D + +L G +
Sbjct: 366 LLVFGDSESGKTTLLRAIARGITRRYSPQQARLVLVDY----RRSLLGAVPEE 414
>gnl|CDD|234277 TIGR03604, TOMM_cyclo_SagD, thiazole/oxazole-forming peptide
maturase, SagD family component. Members of this
protein family include enzymes related to SagD,
previously referred to as a scaffold or docking protein
involved in the biosynthesis of streptolysin S in
Streptococcus pyogenes from the protoxin polypeptide
(product of the sagA gene). Newer evidence describes an
enzymatic activity, an ATP-dependent cyclodehydration
reaction, previously ascribed to the SagC component.
This protein family serves as a marker for widely
distributed prokaryotic systems for making a general
class of heterocycle-containing bacteriocins.
Length = 377
Score = 31.1 bits (71), Expect = 4.5
Identities = 11/59 (18%), Positives = 25/59 (42%), Gaps = 12/59 (20%)
Query: 575 IYGTFSRAMLRLIPPLRGYADALTNAMVELYLASQEKFTQDMQPHYVYSPREMTRWVRG 633
++ +++ R + PL + L+ S+E++ + P + P +WV G
Sbjct: 33 VFASYAELGERALNPLS----------LGLH--SEEQYAEPGFPFRPFDPDTPIQWVPG 79
>gnl|CDD|213220 cd03253, ABCC_ATM1_transporter, ATP-binding cassette domain of
iron-sulfur clusters transporter, subfamily C. ATM1 is
an ABC transporter that is expressed in the
mitochondria. Although the specific function of ATM1 is
unknown, its disruption results in the accumulation of
excess mitochondrial iron, loss of mitochondrial
cytochromes, oxidative damage to mitochondrial DNA, and
decreased levels of cytosolic heme proteins. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. The nucleotide binding domain shows the
highest similarity between all members of the family.
ABC transporters are a subset of nucleotide hydrolases
that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 236
Score = 31.0 bits (71), Expect = 4.5
Identities = 11/17 (64%), Positives = 14/17 (82%)
Query: 49 VGPSGSGKSTAWKVLLK 65
VGPSGSGKST ++L +
Sbjct: 33 VGPSGSGKSTILRLLFR 49
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 31.2 bits (71), Expect = 4.6
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 10/66 (15%)
Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEK--RKVQSQDIQAEIE- 1093
AE V+ E MQK V+ ++L + EAA K E + Q+E +K ++VQ + +Q +
Sbjct: 108 AEMVKSDEHMQK---VR-KKLLA--EAAAKKASEEARKQRELKKFGKQVQVEKLQERAKE 161
Query: 1094 -KQTVE 1098
K +E
Sbjct: 162 KKDMLE 167
>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
systems, ATPase components [Inorganic ion transport and
metabolism / Coenzyme metabolism].
Length = 258
Score = 31.0 bits (71), Expect = 4.6
Identities = 12/22 (54%), Positives = 14/22 (63%), Gaps = 4/22 (18%)
Query: 49 VGPSGSGKSTAWKVLLKALERY 70
+GP+GSGKST LLK L
Sbjct: 34 LGPNGSGKST----LLKCLAGL 51
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 31.3 bits (71), Expect = 4.6
Identities = 29/147 (19%), Positives = 63/147 (42%), Gaps = 39/147 (26%)
Query: 1016 REKCSELEEQQLHL----------------------NVGLGKIAETVEQVEEMQKSLAVK 1053
+EK LEE+ HL N + ++ E +Q+ + + LA K
Sbjct: 167 QEKLKLLEEENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEELAKK 226
Query: 1054 SQELQSKNEAANLKLKEMIKDQQE-----AEKRKVQ-----SQDIQAEIEKQTVEIAQKR 1103
+++L+ + E L +++ Q++ E ++Q ++D Q +++ + E+ K
Sbjct: 227 TEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQELQDKY 286
Query: 1104 VFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
ME L +AQ+ +K ++ +
Sbjct: 287 AECMEML-------HEAQEELKNLRNK 306
>gnl|CDD|213192 cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding
cassette component of cobalt transport system. Domain
I of the ABC component of a cobalt transport family
found in bacteria, archaea, and eukaryota. The
transition metal cobalt is an essential component of
many enzymes and must be transported into cells in
appropriate amounts when needed. This ABC transport
system of the CbiMNQO family is involved in cobalt
transport in association with the cobalamin (vitamin
B12) biosynthetic pathways. Most of cobalt (Cbi)
transport systems possess a separate CbiN component,
the cobalt-binding periplasmic protein, and they are
encoded by the conserved gene cluster cbiMNQO. Both the
CbiM and CbiQ proteins are integral cytoplasmic
membrane proteins, and the CbiO protein has the linker
peptide and the Walker A and B motifs commonly found in
the ATPase components of the ABC-type transport
systems.
Length = 211
Score = 30.5 bits (70), Expect = 4.8
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 7/35 (20%)
Query: 36 LYQIS-NLNHG--LMMVGPSGSGKSTAWKVLLKAL 67
L IS + G +++VGP+GSGKST LL+ L
Sbjct: 17 LDDISLTIKKGEFVLIVGPNGSGKST----LLRLL 47
Score = 30.5 bits (70), Expect = 5.3
Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 9/43 (20%)
Query: 742 VLDHV-LRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGL 783
LD + L I + +L++G +G+GK+TL R +NGL
Sbjct: 16 ALDDISLTI-----KKGEFVLIVGPNGSGKSTLLR---LLNGL 50
>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
Length = 718
Score = 31.4 bits (71), Expect = 4.8
Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 14/53 (26%)
Query: 26 GGPWMEKVLQLYQISNLNHG------LMMVGPSGSGKSTAWKVLLKALERYEG 72
GGP + K NLN G + MVGP+G GKST K++ L+ G
Sbjct: 520 GGPLLFK--------NLNFGIDLDSRIAMVGPNGIGKSTILKLISGELQPSSG 564
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This family
includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 31.2 bits (69), Expect = 4.9
Identities = 30/118 (25%), Positives = 63/118 (53%), Gaps = 11/118 (9%)
Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL-------KEMIKDQQEAEKRKVQSQD 1087
++ E +QVE+++KS K+Q+ +K+ A K+ KE + + EA+KR Q ++
Sbjct: 59 ELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEE 118
Query: 1088 -IQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
Q + E++ ++ Q++ DL +++ A ++AVK K ++ + + + P V
Sbjct: 119 HKQWKAEQERIQKEQEK--KEADLKKLQ-AEKKKEKAVKAEKAEKAEKTKKASTPAPV 173
>gnl|CDD|233193 TIGR00929, VirB4_CagE, type IV secretion/conjugal transfer ATPase,
VirB4 family. Type IV secretion systems are found in
Gram-negative pathogens. They export proteins, DNA, or
complexes in different systems and are related to
plasmid conjugation systems. This model represents
related ATPases that include VirB4 in Agrobacterium
tumefaciens (DNA export) CagE in Helicobacter pylori
(protein export) and plasmid TraB (conjugation).
Length = 785
Score = 31.5 bits (72), Expect = 4.9
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 758 GHLLLIGVSGAGKTTLSRFVA 778
GH L+ G +G+GKTTL F+
Sbjct: 435 GHTLIFGPTGSGKTTLLNFLL 455
>gnl|CDD|172734 PRK14246, PRK14246, phosphate ABC transporter ATP-binding
protein; Provisional.
Length = 257
Score = 30.8 bits (69), Expect = 4.9
Identities = 14/24 (58%), Positives = 19/24 (79%)
Query: 48 MVGPSGSGKSTAWKVLLKALERYE 71
++GPSGSGKST KVL + +E Y+
Sbjct: 41 IMGPSGSGKSTLLKVLNRLIEIYD 64
>gnl|CDD|223473 COG0396, sufC, Cysteine desulfurase activator ATPase
[Posttranslational modification, protein turnover,
chaperones].
Length = 251
Score = 30.6 bits (70), Expect = 5.0
Identities = 14/32 (43%), Positives = 19/32 (59%), Gaps = 3/32 (9%)
Query: 44 HGLMMVGPSGSGKSTAWKVLLKALERYEGVEG 75
H +M GP+GSGKST ++ +YE EG
Sbjct: 33 HAIM--GPNGSGKSTLAYTIM-GHPKYEVTEG 61
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 29.9 bits (68), Expect = 5.1
Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 6/97 (6%)
Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
LY+E EL+E+ L K+ E E E +QK+L V + + A + + +I
Sbjct: 36 ALYKEN-EELKEEIERLE---EKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELII 91
Query: 1073 KD-QQEAEKRKVQSQDIQAEIEKQTVEI-AQKRVFVM 1107
K+ + +AE+ + + ++ + E+ Q R+F
Sbjct: 92 KEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFRT 128
>gnl|CDD|213196 cd03229, ABC_Class3, ATP-binding cassette domain of the binding
protein-dependent transport systems. This class is
comprised of all BPD (Binding Protein Dependent)
systems that are largely represented in archaea and
eubacteria and are primarily involved in scavenging
solutes from the environment. ABC transporters are a
large family of proteins involved in the transport of a
wide variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide binding domain shows the highest similarity
between all members of the family. ABC transporters are
a subset of nucleotide hydrolases that contain a
signature motif, Q-loop, and H-loop/switch region, in
addition to, the Walker A motif/P-loop and Walker B
motif commonly found in a number of ATP- and
GTP-binding and hydrolyzing proteins.
Length = 178
Score = 30.2 bits (69), Expect = 5.1
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
+GPSGSGKST + + E G
Sbjct: 32 LGPSGSGKSTLLRCIAGLEEPDSG 55
>gnl|CDD|237422 PRK13548, hmuV, hemin importer ATP-binding subunit; Provisional.
Length = 258
Score = 30.9 bits (71), Expect = 5.2
Identities = 11/19 (57%), Positives = 15/19 (78%), Gaps = 4/19 (21%)
Query: 49 VGPSGSGKSTAWKVLLKAL 67
+GP+G+GKST LL+AL
Sbjct: 34 LGPNGAGKST----LLRAL 48
>gnl|CDD|178596 PLN03025, PLN03025, replication factor C subunit; Provisional.
Length = 319
Score = 30.9 bits (70), Expect = 5.5
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 419 LVLCGPPGSGKTMTLLSALRAL 440
L+L GPPG+GKT ++L+ L
Sbjct: 37 LILSGPPGTGKTTSILALAHEL 58
>gnl|CDD|213216 cd03249, ABC_MTABC3_MDL1_MDL2, ATP-binding cassette domain of a
mitochondrial protein MTABC3 and related proteins.
MTABC3 (also known as ABCB6) is a mitochondrial
ATP-binding cassette protein involved in iron
homeostasis and one of four ABC transporters expressed
in the mitochondrial inner membrane, the other three
being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast
MDL1 (multidrug resistance-like protein 1) and MDL2
(multidrug resistance-like protein 2) transporters are
also included in this CD. MDL1 is an ATP-dependent
permease that acts as a high-copy suppressor of ATM1
and is thought to have a role in resistance to
oxidative stress. Interestingly, subfamily B is more
closely related to the carboxyl-terminal component of
subfamily C than the two halves of ABCC molecules are
with one another.
Length = 238
Score = 30.6 bits (70), Expect = 5.5
Identities = 11/22 (50%), Positives = 14/22 (63%), Gaps = 4/22 (18%)
Query: 49 VGPSGSGKSTAWKVLLKALERY 70
VG SG GKST ++ LER+
Sbjct: 35 VGSSGCGKST----VVSLLERF 52
>gnl|CDD|236442 PRK09270, PRK09270, nucleoside triphosphate hydrolase
domain-containing protein; Reviewed.
Length = 229
Score = 30.7 bits (70), Expect = 5.6
Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)
Query: 421 LCGPPGSGKTMTLLSALRAL 440
+ GPPG+GK+ TL L AL
Sbjct: 38 IAGPPGAGKS-TLAEFLEAL 56
>gnl|CDD|116340 pfam07726, AAA_3, ATPase family associated with various cellular
activities (AAA). This Pfam entry includes some of the
AAA proteins not detected by the pfam00004 model.
Length = 131
Score = 29.8 bits (68), Expect = 5.6
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 759 HLLLIGVSGAGKTTLSRFVA 778
H+LL GV G KT L+R +A
Sbjct: 1 HVLLEGVPGLAKTLLARTLA 20
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 31.0 bits (70), Expect = 5.6
Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 8/109 (7%)
Query: 1033 LGKIAETVEQ-VEEMQKSLAVKSQELQSKNEA-ANLKLKEMIKDQQEAEKRKVQSQDIQA 1090
+G I E +E ++S + + + Q++ EA A + E I Q EA + +
Sbjct: 324 VGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAE 383
Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMA 1139
E+ + A E Q + + +A K + Q E+++ A
Sbjct: 384 RAEQAALVAAA------EAAEQEQVEIAVRAEAAKAEAEAQAAEIKAEA 426
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized. This
protein is found in bacteria and eukaryotes. Proteins in
this family are about 260 amino acids in length.
Length = 250
Score = 30.7 bits (70), Expect = 5.6
Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 8/83 (9%)
Query: 1036 IAETVEQV-EEMQKSL--AVKSQELQSKNEAANLKLKEMIKD-QQEAEKRKVQSQDIQAE 1091
A +++Q Q+++ A +SQ+ K +L I+ +E E +V + +Q
Sbjct: 20 AAASLDQALAAAQQTVAAAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRL 79
Query: 1092 IEKQTVEIAQKRVFVMEDLAQVE 1114
+ Q EIA + + + Q+E
Sbjct: 80 VANQQQEIAS----LQQQIEQIE 98
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.2 bits (71), Expect = 5.7
Identities = 24/118 (20%), Positives = 59/118 (50%), Gaps = 10/118 (8%)
Query: 1015 YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD 1074
RE+ ELE + L + ++ E +E+ E++ + + + L+ + E E+I +
Sbjct: 473 DRERVEELEAELEDLEEEVEEVEERLERAEDLVE-AEDRIERLEERREDLE----ELIAE 527
Query: 1075 QQE-AEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
++E E+++ ++++++ + E +KR E A+ E +A++ V E+ +
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKR----EAAAEAEEEAEEAREEVAELNSKL 581
>gnl|CDD|223521 COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport
system, ATPase component [Amino acid transport and
metabolism / Inorganic ion transport and metabolism].
Length = 316
Score = 31.0 bits (71), Expect = 5.8
Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
Query: 42 LNHG--LMMVGPSGSGKSTAWKVLLKALERYEG 72
L G L +VG SGSGKS K ++ L +
Sbjct: 28 LKKGEILGIVGESGSGKSVLAKAIMGLLPKPNA 60
>gnl|CDD|233878 TIGR02467, CbiE, precorrin-6y C5,15-methyltransferase
(decarboxylating), CbiE subunit. This model recognizes
the CbiE methylase which is responsible, in part (along
with CbiT), for methylating precorrin-6y (or
cobalt-precorrin-6y) at both the 5 and 15 positions as
well as the concomitant decarbozylation at C-12. In many
organisms, this protein is fused to the CbiT subunit. The
fused protein, when found in organisms catalyzing the
oxidative version of the cobalamin biosynthesis pathway,
is called CobL.
Length = 204
Score = 30.4 bits (69), Expect = 5.8
Identities = 10/29 (34%), Positives = 13/29 (44%)
Query: 1388 HLIAAGIQFRPEIALTEYLSSPDERLRWQ 1416
LI GI E+ + E L DER+
Sbjct: 158 ELIELGIGGSYELTVGENLGYEDERITEG 186
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 31.2 bits (71), Expect = 5.8
Identities = 46/258 (17%), Positives = 84/258 (32%), Gaps = 56/258 (21%)
Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNE---------AANLKLKEMIKDQQEAEKRKVQ 1084
G + E EQ+E K + + +++ L + K Q + E++ Q
Sbjct: 258 GDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQ 317
Query: 1085 SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
+Q Q E+E++ E+ + + LA VKEI + +A
Sbjct: 318 NQLTQEELEERDKELLE----SLPKLALPAE-------HVKEIAAE-------LAEIDKP 359
Query: 1145 VKLALESICLLLGENATDWK-AIRAVVMRENFINSIVSNFNTEMIT--DEVREKMHSRYL 1201
E L G T + I+ V S + E+ EV +K+
Sbjct: 360 ATTDSEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKIS---- 415
Query: 1202 SNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEE 1261
+ P A +L+++ + EL E + E + E
Sbjct: 416 TIPSEE----------------------QIAQLLEELGEAQNELFRSEAEIEELLRQLET 453
Query: 1262 TKDLITQLEKSIASYKDE 1279
K+ I L K++ +
Sbjct: 454 LKEAIEALRKTLDEKTKQ 471
>gnl|CDD|222194 pfam13521, AAA_28, AAA domain.
Length = 162
Score = 29.8 bits (68), Expect = 5.9
Identities = 12/21 (57%), Positives = 14/21 (66%), Gaps = 1/21 (4%)
Query: 420 VLCGPPGSGKTMTLLSALRAL 440
V+ G P +GKT TLL AL A
Sbjct: 3 VITGGPSTGKT-TLLEALAAR 22
>gnl|CDD|162242 TIGR01187, potA, spermidine/putrescine ABC transporter ATP-binding
subunit. This model describes spermidine/putrescine ABC
transporter, ATP binding subunit in bacteria and its
equivalents in archaea. This transport system belong to
the larger ATP-Binding Cassette (ABC) transporter
superfamily. The characteristic feature of these
transporter is the obligatory coupling of ATP hydrolysis
to substrate translocation. The minimal configuration of
bacterial ABC transport system: an ATPase or ATP binding
subunit; An integral membrane protein; a hydrophilic
polypetpide, which likely functions as substrate binding
protein. Polyamines like spermidine and putrescine play
vital role in cell proliferation, differentiation, and
ion homeostasis. The concentration of polyamines within
the cell are regulated by biosynthesis, degradation and
transport (uptake and efflux included) [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 325
Score = 30.9 bits (70), Expect = 5.9
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 10/48 (20%)
Query: 762 LIGVSGAGKTTLSRFVA----------FMNGLSVFQIRAHNKYTGADF 799
L+G SG GKTTL R +A ++G V + H ++ F
Sbjct: 1 LLGPSGCGKTTLLRLLAGFEQPDSGSIMLDGEDVTNVPPHLRHINMVF 48
>gnl|CDD|224042 COG1117, PstB, ABC-type phosphate transport system, ATPase
component [Inorganic ion transport and metabolism].
Length = 253
Score = 30.6 bits (70), Expect = 6.0
Identities = 12/22 (54%), Positives = 15/22 (68%), Gaps = 4/22 (18%)
Query: 48 MVGPSGSGKSTAWKVLLKALER 69
++GPSG GKST LL+ L R
Sbjct: 38 LIGPSGCGKST----LLRCLNR 55
>gnl|CDD|213264 cd03297, ABC_ModC_molybdenum_transporter, ATP-binding cassette
domain of the molybdenum transport system. ModC is an
ABC-type transporter and the ATPase component of a
molybdate transport system that also includes the
periplasmic binding protein ModA and the membrane
protein ModB. ABC transporters are a large family of
proteins involved in the transport of a wide variety of
different compounds, like sugars, ions, peptides and
more complex organic molecules. The nucleotide binding
domain shows the highest similarity between all members
of the family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 214
Score = 30.3 bits (69), Expect = 6.2
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 2/38 (5%)
Query: 741 EVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVA 778
+ D L+ID F + + G SGAGK+TL R +A
Sbjct: 9 RLPDFTLKID--FDLNEEVTGIFGASGAGKSTLLRCIA 44
>gnl|CDD|213222 cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of
the transporters involved in export of lipoprotein and
macrolide, and cell division protein. This family is
comprised of MJ0796 ATP-binding cassette,
macrolide-specific ABC-type efflux carrier (MacAB), and
proteins involved in cell division (FtsE), and release
of lipoproteins from the cytoplasmic membrane (LolCDE).
They are clustered together phylogenetically. MacAB is
an exporter that confers resistance to macrolides,
while the LolCDE system is not a transporter at all. An
FtsE null mutants showed filamentous growth and
appeared viable on high salt medium only, indicating a
role for FtsE in cell division and/or salt transport.
The LolCDE complex catalyzes the release of
lipoproteins from the cytoplasmic membrane prior to
their targeting to the outer membrane.
Length = 218
Score = 30.5 bits (70), Expect = 6.2
Identities = 10/13 (76%), Positives = 12/13 (92%)
Query: 46 LMMVGPSGSGKST 58
+ +VGPSGSGKST
Sbjct: 33 VAIVGPSGSGKST 45
Score = 29.8 bits (68), Expect = 8.7
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 8/35 (22%)
Query: 741 EVLDHV-LRIDRIFRQPQG-HLLLIGVSGAGKTTL 773
+ L V L I++ G + ++G SG+GK+TL
Sbjct: 18 QALKGVSLSIEK------GEFVAIVGPSGSGKSTL 46
>gnl|CDD|179157 PRK00889, PRK00889, adenylylsulfate kinase; Provisional.
Length = 175
Score = 30.0 bits (68), Expect = 6.3
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 762 LIGVSGAGKTTLSRFVA 778
G+SGAGKTT++R +A
Sbjct: 9 FTGLSGAGKTTIARALA 25
>gnl|CDD|224043 COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase
component [Inorganic ion transport and metabolism].
Length = 345
Score = 30.8 bits (70), Expect = 6.4
Identities = 13/24 (54%), Positives = 17/24 (70%), Gaps = 1/24 (4%)
Query: 756 PQGHLL-LIGVSGAGKTTLSRFVA 778
G L+ L+G SGAGK+TL R +A
Sbjct: 26 KSGELVALLGPSGAGKSTLLRIIA 49
>gnl|CDD|222036 pfam13304, AAA_21, AAA domain.
Length = 256
Score = 30.5 bits (68), Expect = 6.5
Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 420 VLCGPPGSGKTMTLLSALRALPDMEVVSLNFSS 452
V+ GP GSGK+ LL AL L + + L
Sbjct: 3 VIIGPNGSGKS-NLLKALALLLLLLSLGLTLDR 34
>gnl|CDD|234015 TIGR02788, VirB11, P-type DNA transfer ATPase VirB11. The VirB11
protein is found in the vir locus of Agrobacterium Ti
plasmids where it is involved in the type IV secretion
system for DNA transfer. VirB11 is believed to be an
ATPase. VirB11 is a homolog of the P-like conjugation
system TrbB protein and the Flp pilus sytem protein
TadA.
Length = 308
Score = 30.8 bits (70), Expect = 6.7
Identities = 15/46 (32%), Positives = 22/46 (47%)
Query: 399 TLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDME 444
LD + L +A K +++ G GSGKT L S + +P E
Sbjct: 127 LLDAGDIKEFLRLAIASRKNIIISGGTGSGKTTFLKSLVDEIPKDE 172
>gnl|CDD|225183 COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain
an N-terminal double-glycine peptidase domain [Defense
mechanisms].
Length = 709
Score = 31.0 bits (71), Expect = 6.7
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
VG SGSGKST K+LL + +G
Sbjct: 505 VGRSGSGKSTLLKLLLGLYKPQQG 528
>gnl|CDD|218640 pfam05565, Sipho_Gp157, Siphovirus Gp157. This family contains both
viral and bacterial proteins which are related to the
Gp157 protein of the Streptococcus thermophilus SFi
bacteriophages. It is thought that bacteria possessing
the gene coding for this protein have an increased
resistance to the bacteriophage.
Length = 162
Score = 29.9 bits (68), Expect = 6.7
Identities = 16/64 (25%), Positives = 26/64 (40%), Gaps = 2/64 (3%)
Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDE-YAQLIAQA-TAI 1290
+ KVE + +K+LE KA+ + + +E + KD + A I
Sbjct: 37 EFEDKVENIAKVIKNLEADIEAIKAEIKRLAERKKSIENKVKRLKDYLEEAMEATGIKKI 96
Query: 1291 KTDL 1294
KT L
Sbjct: 97 KTPL 100
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 30.0 bits (68), Expect = 6.7
Identities = 14/86 (16%), Positives = 42/86 (48%)
Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
RE+ EL+++ L L ++ E + ++++ + L + L+++ E +L + +
Sbjct: 65 RERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
A + ++++++ E+ + E
Sbjct: 125 ANAIELDEENRELREELAELKQENEA 150
>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
metal-type transporters. This family includes
transporters involved in the uptake of various metallic
cations such as iron, manganese, and zinc. The ATPases
of this group of transporters are very similar to
members of iron-siderophore uptake family suggesting
that they share a common ancestor. The best
characterized metal-type ABC transporters are the
YfeABCD system of Y. pestis, the SitABCD system of
Salmonella enterica serovar Typhimurium, and the
SitABCD transporter of Shigella flexneri. Moreover
other uncharacterized homologs of these metal-type
transporters are mainly found in pathogens like
Haemophilus or enteroinvasive E. coli isolates.
Length = 213
Score = 30.2 bits (69), Expect = 6.7
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
VGP+G+GKST K +L L+ G
Sbjct: 31 VGPNGAGKSTLLKAILGLLKPTSG 54
>gnl|CDD|223873 COG0802, COG0802, Predicted ATPase or kinase [General function
prediction only].
Length = 149
Score = 29.5 bits (67), Expect = 6.8
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 760 LLLIGVSGAGKTTLSRFVA 778
+LL G GAGKTTL R +A
Sbjct: 28 VLLSGDLGAGKTTLVRGIA 46
>gnl|CDD|226969 COG4618, ArpD, ABC-type protease/lipase transport system, ATPase
and permease components [General function prediction
only].
Length = 580
Score = 30.7 bits (70), Expect = 6.9
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 45 GLMMVGPSGSGKSTAWKVLL 64
L ++GPSGSGKST ++L+
Sbjct: 364 ALGIIGPSGSGKSTLARLLV 383
>gnl|CDD|226359 COG3839, MalK, ABC-type sugar transport systems, ATPase
components [Carbohydrate transport and metabolism].
Length = 338
Score = 30.7 bits (70), Expect = 7.0
Identities = 10/20 (50%), Positives = 14/20 (70%), Gaps = 4/20 (20%)
Query: 48 MVGPSGSGKSTAWKVLLKAL 67
++GPSG GKST LL+ +
Sbjct: 34 LLGPSGCGKST----LLRMI 49
>gnl|CDD|213180 cd03213, ABCG_EPDR, Eye pigment and drug resistance transporter
subfamily G of the ATP-binding cassette superfamily.
ABCG transporters are involved in eye pigment (EP)
precursor transport, regulation of lipid-trafficking
mechanisms, and pleiotropic drug resistance (DR). DR is
a well-described phenomenon occurring in fungi and
shares several similarities with processes in bacteria
and higher eukaryotes. Compared to other members of the
ABC transporter subfamilies, the ABCG transporter family
is composed of proteins that have an ATP-binding
cassette domain at the N-terminus and a TM
(transmembrane) domain at the C-terminus.
Length = 194
Score = 30.2 bits (69), Expect = 7.0
Identities = 12/22 (54%), Positives = 14/22 (63%), Gaps = 1/22 (4%)
Query: 758 GHLLLI-GVSGAGKTTLSRFVA 778
G L I G SGAGK+TL +A
Sbjct: 35 GELTAIMGPSGAGKSTLLNALA 56
>gnl|CDD|188099 TIGR00972, 3a0107s01c2, phosphate ABC transporter, ATP-binding
protein. This model represents the ATP-binding protein
of a family of ABC transporters for inorganic
phosphate. In the model species Escherichia coli, a
constitutive transporter for inorganic phosphate, with
low affinity, is also present. The high affinity
transporter that includes this polypeptide is induced
when extracellular phosphate concentrations are low.
The proteins most similar to the members of this family
but not included appear to be amino acid transporters
[Transport and binding proteins, Anions].
Length = 247
Score = 30.3 bits (69), Expect = 7.1
Identities = 15/30 (50%), Positives = 20/30 (66%), Gaps = 5/30 (16%)
Query: 48 MVGPSGSGKSTAWKVLLKALER-YEGVEGV 76
++GPSG GKST LL++L R + V GV
Sbjct: 32 LIGPSGCGKST----LLRSLNRMNDLVPGV 57
>gnl|CDD|226646 COG4178, COG4178, ABC-type uncharacterized transport system,
permease and ATPase components [General function
prediction only].
Length = 604
Score = 30.7 bits (70), Expect = 7.2
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 755 QPQGHLLLIGVSGAGKTTLSRFVA 778
+P LL+ G SGAGKT+L R +A
Sbjct: 417 RPGERLLITGESGAGKTSLLRALA 440
>gnl|CDD|129690 TIGR00602, rad24, checkpoint protein rad24. All proteins in this
family for which functions are known are involved in DNA
damage tolerance (likely cell cycle checkpoints).This
family is based on the phylogenomic analysis of JA Eisen
(1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 637
Score = 30.7 bits (69), Expect = 7.2
Identities = 23/63 (36%), Positives = 33/63 (52%), Gaps = 5/63 (7%)
Query: 6 EKIKEVCREEFLVCGEGNEEGGPWME-KVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLL 64
EK K + E V + EE W++ +VL+ + L++ GPSG GKST K+L
Sbjct: 76 EKYKPETQHELAVHKKKIEEVETWLKAQVLE----NAPKRILLITGPSGCGKSTTIKILS 131
Query: 65 KAL 67
K L
Sbjct: 132 KEL 134
>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
HflB(FtsH) is a pleiotropic protein required for correct
cell division in bacteria. It has ATP-dependent zinc
metalloprotease activity. It was formerly designated
cell division protein FtsH [Cellular processes, Cell
division, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 495
Score = 30.7 bits (70), Expect = 7.3
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 7/48 (14%)
Query: 754 RQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDE 801
+ P+G +LL+G G GKT L++ VA G+ F I +G+DF E
Sbjct: 86 KIPKG-VLLVGPPGTGKTLLAKAVAGEAGVPFFSI------SGSDFVE 126
>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus.
The RuvB protein makes up part of the RuvABC revolvasome
which catalyzes the resolution of Holliday junctions
that arise during genetic recombination and DNA repair.
Branch migration is catalyzed by the RuvB protein that
is targeted to the Holliday junction by the structure
specific RuvA protein. This family contains the
N-terminal region of the protein.
Length = 231
Score = 30.1 bits (69), Expect = 7.4
Identities = 23/71 (32%), Positives = 29/71 (40%), Gaps = 30/71 (42%)
Query: 716 LREYV-QARLKVFYEEELDVQLVLF-------DEVLDHVLRIDRIFRQPQGHLLLIGVSG 767
L EY+ Q ++K E L + F E LDHVL L G G
Sbjct: 23 LDEYIGQEKVK----ENLKI----FIEAAKKRGEALDHVL--------------LYGPPG 60
Query: 768 AGKTTLSRFVA 778
GKTTL+ +A
Sbjct: 61 LGKTTLANIIA 71
>gnl|CDD|181906 PRK09493, glnQ, glutamine ABC transporter ATP-binding protein;
Reviewed.
Length = 240
Score = 30.1 bits (68), Expect = 7.6
Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 16/53 (30%)
Query: 48 MVGPSGSGKSTAWKVLLK---ALERYEG----VEGVAHIIDPKA----ISKEA 89
++GPSGSGKST LL+ LE V+G+ + DPK I +EA
Sbjct: 32 IIGPSGSGKST----LLRCINKLEEITSGDLIVDGL-KVNDPKVDERLIRQEA 79
>gnl|CDD|213199 cd03232, ABCG_PDR_domain2, Second domain of the pleiotropic drug
resistance-like (PDR) subfamily G of ATP-binding
cassette transporters. The pleiotropic drug resistance
(PDR) is a well-described phenomenon occurring in fungi
and shares several similarities with processes in
bacteria and higher eukaryotes. This PDR subfamily
represents domain I of its (ABC-IM)2 organization. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds
including sugars, ions, peptides, and more complex
organic molecules. The nucleotide binding domain shows
the highest similarity between all members of the
family. ABC transporters are a subset of nucleotide
hydrolases that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing proteins.
Length = 192
Score = 29.9 bits (68), Expect = 7.7
Identities = 10/12 (83%), Positives = 11/12 (91%)
Query: 762 LIGVSGAGKTTL 773
L+G SGAGKTTL
Sbjct: 38 LMGESGAGKTTL 49
>gnl|CDD|223203 COG0125, Tmk, Thymidylate kinase [Nucleotide transport and
metabolism].
Length = 208
Score = 30.0 bits (68), Expect = 7.8
Identities = 40/244 (16%), Positives = 76/244 (31%), Gaps = 86/244 (35%)
Query: 50 GPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLFTHI 109
G G+GK+T ++L + LE G++ V TRE
Sbjct: 10 GIDGAGKTTQAELLKERLEE-RGIKVVL--------------------TREPGGTPIGEK 48
Query: 110 LRRIIDNVRGEISKR-------------------------QWIIFD-------------G 131
+R ++ N ++S + + +I D
Sbjct: 49 IRELLLNGEEKLSPKAEALLFAADRAQHLEEVIKPALKEGKVVICDRYVDSSLAYQGGGR 108
Query: 132 DVDPEWVENLNSVLDDNKLLTLPNGERLSLPPNIRIMFEVQDLKYATLATVSRCGMIWFS 191
+D +WV LN L P++ + +V + A L + + G
Sbjct: 109 GLDLDWVLALNEFA------------PGGLKPDLTLYLDV-PPEVA-LERIRKRGE---L 151
Query: 192 EDVLSTEMIFENYLSRLRNIALDDIDDDSSLLITVDATGKAPDDVLSPALTLQQDVASIL 251
D E +L ++R L+ +I +DA+ ++V +++ IL
Sbjct: 152 RDRFEKEDD--EFLEKVREGYLELAAKFPERIIVIDASRP-LEEV-------HEEILKIL 201
Query: 252 STHF 255
Sbjct: 202 KERL 205
>gnl|CDD|215414 PLN02772, PLN02772, guanylate kinase.
Length = 398
Score = 30.6 bits (69), Expect = 7.8
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 7/41 (17%)
Query: 417 KPLVLCGPPGSGKTMTLLSAL-RALPDMEVVSLNFS-SATT 455
KP+V+ GP G GK TL+S L + P M FS S TT
Sbjct: 136 KPIVISGPSGVGKG-TLISMLMKEFPSM----FGFSVSHTT 171
>gnl|CDD|132302 TIGR03258, PhnT, 2-aminoethylphosphonate ABC transport system,
ATP-binding component PhnT. This ATP-binding component
of an ABC transport system is found in Salmonella and
Burkholderia lineages in the vicinity of enzymes for the
breakdown of 2-aminoethylphosphonate.
Length = 362
Score = 30.3 bits (68), Expect = 7.8
Identities = 13/19 (68%), Positives = 14/19 (73%)
Query: 760 LLLIGVSGAGKTTLSRFVA 778
L LIG SG GKTTL R +A
Sbjct: 34 LALIGKSGCGKTTLLRAIA 52
>gnl|CDD|179118 PRK00771, PRK00771, signal recognition particle protein Srp54;
Provisional.
Length = 437
Score = 30.6 bits (70), Expect = 7.8
Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 19/73 (26%)
Query: 709 VPVGTTELREYVQARLKVFYEEELDVQLV-LFDEVLDHVLRIDRIFRQPQGHLLLIGVSG 767
P G T RE+V +K+ YEE LV L E + + +PQ ++L+G+ G
Sbjct: 60 PPKGLTP-REHV---IKIVYEE-----LVKLLGEETE----PLVLPLKPQ-TIMLVGLQG 105
Query: 768 AGKTT----LSRF 776
+GKTT L+R+
Sbjct: 106 SGKTTTAAKLARY 118
>gnl|CDD|129124 TIGR00012, L29, ribosomal protein L29. This model describes a
ribosomal large subunit protein, called L29 in
prokaryotic (50S) large subunits and L35 in eukaryotic
(60S) large subunits [Protein synthesis, Ribosomal
proteins: synthesis and modification].
Length = 55
Score = 27.6 bits (62), Expect = 8.0
Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 3/49 (6%)
Query: 1226 IAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIA 1274
+ + S ++ KK++ L+ EL L Q + I Q+ + IA
Sbjct: 2 LREKSKEELAKKLDELKKELFELRFQKA---TGQLAKPHRIRQVRRDIA 47
>gnl|CDD|224041 COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport
system, ATPase component [Inorganic ion transport and
metabolism].
Length = 248
Score = 30.2 bits (69), Expect = 8.0
Identities = 10/22 (45%), Positives = 15/22 (68%), Gaps = 4/22 (18%)
Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
+ ++GPSG GKST LL+ +
Sbjct: 32 VAILGPSGCGKST----LLRLI 49
>gnl|CDD|184430 PRK13975, PRK13975, thymidylate kinase; Provisional.
Length = 196
Score = 29.7 bits (67), Expect = 8.0
Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 7/58 (12%)
Query: 764 GVSGAGKTTLSRFVA-FMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNEKIAFL 820
G+ G+GKTT ++ +A +N + T + +R +L S C E +A L
Sbjct: 9 GIDGSGKTTQAKLLAEKLNAFWTCEP------TDGKIGKLIREILSGSKCDKETLALL 60
>gnl|CDD|224058 COG1135, AbcC, ABC-type metal ion transport system, ATPase
component [Inorganic ion transport and metabolism].
Length = 339
Score = 30.3 bits (69), Expect = 8.1
Identities = 12/23 (52%), Positives = 15/23 (65%), Gaps = 3/23 (13%)
Query: 762 LIGVSGAGKTTLSRFVAFMNGLS 784
+IG SGAGK+TL R + N L
Sbjct: 37 IIGYSGAGKSTLLRLI---NLLE 56
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is involved
in determining the site of synaptic vesicle fusion. The
C-terminus is a PDZ-binding motif that binds directly to
RIM (a small G protein Rab-3A effector). The family also
contains four coiled-coil domains.
Length = 774
Score = 30.8 bits (69), Expect = 8.1
Identities = 23/129 (17%), Positives = 57/129 (44%), Gaps = 14/129 (10%)
Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
+ L R+ +L+ + L L E ++++ EM +S + ++ + + E
Sbjct: 130 LFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDS--------ER 181
Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQTVEIAQK--RVFVMEDLAQVEPAVMDAQQAVKEIKK 1129
+ +EAE ++ + + + EK+ + + ++ + ME + A +A Q + + K
Sbjct: 182 LGRIREAEDQQSHLEVLLEQKEKEHMMLREEIHQKLQMER----DDAKTEASQKLIDEKD 237
Query: 1130 QQLVELRSM 1138
++ E M
Sbjct: 238 TKIKEFEKM 246
>gnl|CDD|213260 cd03293, ABC_NrtD_SsuB_transporters, ATP-binding cassette domain
of the nitrate and sulfonate transporters. NrtD and
SsuB are the ATP-binding subunits of the bacterial
ABC-type nitrate and sulfonate transport systems,
respectively. ABC transporters are a large family of
proteins involved in the transport of a wide variety of
different compounds, like sugars, ions, peptides, and
more complex organic molecules. The nucleotide binding
domain shows the highest similarity between all members
of the family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 220
Score = 30.1 bits (69), Expect = 8.2
Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 4/22 (18%)
Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
+ +VGPSG GKST LL+ +
Sbjct: 33 VALVGPSGCGKST----LLRII 50
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 30.8 bits (69), Expect = 8.2
Identities = 25/140 (17%), Positives = 60/140 (42%), Gaps = 9/140 (6%)
Query: 1003 HYLDFINHFVKLYREKCSELEEQQLHLNVGLGK-------IAETVEQVEEMQKSLAVKSQ 1055
H LD + ++L R+ + +EQ HL + I +++ ++ ++ L S
Sbjct: 836 HELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELST 895
Query: 1056 ELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVM-EDLAQVE 1114
E+QS ++ + EK + + +++ + E + AQ +V + E + +
Sbjct: 896 EVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSN-KKAQDKVNDIKEKVKNIH 954
Query: 1115 PAVMDAQQAVKEIKKQQLVE 1134
+ D + +++ K L +
Sbjct: 955 GYMKDIENKIQDGKDDYLKQ 974
>gnl|CDD|213213 cd03246, ABCC_Protease_Secretion, ATP-binding cassette domain of
PrtD, subfamily C. This family represents the ABC
component of the protease secretion system PrtD, a
60-kDa integral membrane protein sharing 37% identity
with HlyB, the ABC component of the alpha-hemolysin
secretion pathway, in the C-terminal domain. They
export degradative enzymes by using a type I protein
secretion system and lack an N-terminal signal peptide,
but contain a C-terminal secretion signal. The Type I
secretion apparatus is made up of three components, an
ABC transporter, a membrane fusion protein (MFP), and
an outer membrane protein (OMP). For the HlyA
transporter complex, HlyB (ABC transporter) and HlyD
(MFP) reside in the inner membrane of E. coli. The OMP
component is TolC, which is thought to interact with
the MFP to form a continuous channel across the
periplasm from the cytoplasm to the exterior. HlyB
belongs to the family of ABC transporters, which are
ubiquitous, ATP-dependent transmembrane pumps or
channels. The spectrum of transport substrates ranges
from inorganic ions, nutrients such as amino acids,
sugars, or peptides, hydrophobic drugs, to large
polypeptides, such as HlyA.
Length = 173
Score = 29.5 bits (67), Expect = 8.2
Identities = 10/16 (62%), Positives = 14/16 (87%)
Query: 49 VGPSGSGKSTAWKVLL 64
+GPSGSGKST +++L
Sbjct: 34 IGPSGSGKSTLARLIL 49
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 30.6 bits (69), Expect = 8.3
Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 6/88 (6%)
Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQE---LQSKNEAANLKLK-EMIKDQ 1075
EL+E++L L + E+VEE+ +A S++ L+ A L+ + E +K+
Sbjct: 769 RELKEEELALLEEAIDALD--EEVEELHAQVAALSRQIAQLEGGGTVAELRQRRESLKED 826
Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
E + RK S + ++ ++ + + ++R
Sbjct: 827 LEEKARKWASLRLAVQVLEEALRLFKER 854
>gnl|CDD|237870 PRK14964, PRK14964, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 491
Score = 30.5 bits (69), Expect = 8.3
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 8/42 (19%)
Query: 741 EVLDHVLR----IDRIFRQPQGHLLLIGVSGAGKTTLSRFVA 778
+VL +LR +++I PQ +LL+G SG GKTT +R ++
Sbjct: 19 DVLVRILRNAFTLNKI---PQS-ILLVGASGVGKTTCARIIS 56
>gnl|CDD|238260 cd00464, SK, Shikimate kinase (SK) is the fifth enzyme in the
shikimate pathway, a seven-step biosynthetic pathway
which converts erythrose-4-phosphate to chorismic acid,
found in bacteria, fungi and plants. Chorismic acid is a
important intermediate in the synthesis of aromatic
compounds, such as aromatic amino acids, p-aminobenzoic
acid, folate and ubiquinone. Shikimate kinase catalyses
the phosphorylation of the 3-hydroxyl group of shikimic
acid using ATP.
Length = 154
Score = 29.4 bits (67), Expect = 8.4
Identities = 11/17 (64%), Positives = 14/17 (82%)
Query: 762 LIGVSGAGKTTLSRFVA 778
LIG+ GAGKTT+ R +A
Sbjct: 4 LIGMMGAGKTTVGRLLA 20
>gnl|CDD|172735 PRK14247, PRK14247, phosphate ABC transporter ATP-binding
protein; Provisional.
Length = 250
Score = 30.3 bits (68), Expect = 8.4
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 48 MVGPSGSGKSTAWKVLLKALERY 70
++GPSGSGKST +V + +E Y
Sbjct: 34 LMGPSGSGKSTLLRVFNRLIELY 56
>gnl|CDD|107346 cd06351, PBP1_iGluR_N_LIVBP_like, N-terminal
leucine/isoleucine/valine-binding protein (LIVBP)-like
domain of the NMDA, AMPA, and kainate receptor subtypes
of ionotropic glutamate receptors (iGluRs). N-terminal
leucine/isoleucine/valine-binding protein (LIVBP)-like
domain of the NMDA, AMPA, and kainate receptor subtypes
of ionotropic glutamate receptors (iGluRs). While this
N-terminal domain belongs to the periplasmic-binding
fold type I superfamily, the glutamate-binding domain of
the iGluR is structurally homologous to the
periplasmic-binding fold type II. The LIVBP-like domain
of iGluRs is thought to play a role in the initial
assembly of iGluR subunits, but it is not well
understood how this domain is arranged and functions in
intact iGluR. Glutamate mediates the majority of
excitatory synaptic transmission in the central nervous
system via two broad classes of ionotropic receptors
characterized by their response to glutamate agonists:
N-methyl-aspartate (NMDA) and non-NMDA receptors. NMDA
receptors have intrinsically slow kinetics, are highly
permeable to Ca2+, and are blocked by extracellular Mg2+
in a voltage-dependent manner. On the other hand,
non-NMDA receptors have faster kinetics, are weakly
permeable to Ca2+, and are not blocked by extracellular
Mg2+. While non-NMDA receptors typically mediate
excitatory synaptic responses at resting membrane
potentials, NMDA receptors contribute to several forms
of synaptic plasticity and are suggested to play an
important role in the development of synaptic pathways.
Length = 328
Score = 30.2 bits (68), Expect = 8.5
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 3/48 (6%)
Query: 558 HVPVIYVDYPGETSLKQIYGTFSRAMLRLIPPLRGYADALTNAMVELY 605
+P I + E + + + L+L P L ADAL + +
Sbjct: 87 EIPHISISGGSEGLSDKEESSTT---LQLYPSLEDLADALLDLLEYYN 131
>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family. Many
proteins may score above the trusted cutoff because an
internal.
Length = 364
Score = 30.5 bits (69), Expect = 8.5
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 8/66 (12%)
Query: 720 VQARLKVFYEE--ELDVQLVLFDEVLDHVLRIDRIFRQ-----PQGHLLLIGVSGAGKTT 772
V+ R V YE+ L+ Q+ E ++ L+ +F + P+G +LL G G GKT
Sbjct: 113 VEERPNVSYEDIGGLEEQIREIREAVELPLKHPELFEEVGIEPPKG-VLLYGPPGTGKTL 171
Query: 773 LSRFVA 778
L++ VA
Sbjct: 172 LAKAVA 177
>gnl|CDD|221816 pfam12866, DUF3823, Protein of unknown function (DUF3823). This is
a family of uncharacterized proteins from Bacteroidetes.
It has characteristic DN and DR sequence-motifs. The
function is not known.
Length = 220
Score = 29.9 bits (68), Expect = 8.6
Identities = 22/68 (32%), Positives = 26/68 (38%), Gaps = 14/68 (20%)
Query: 330 FAGDGKLKMRSDFGNFLRSVTTITLPATSSDIVDFEVN----IKNGEWVPWSNKVPQIEV 385
F GD KL + G + V TI + + DFEV IKN I
Sbjct: 75 FDGDYKLVLVRGNGPWEYPVDTIDVEVKGNTTQDFEVTPYYRIKN----------ASISK 124
Query: 386 ETQKVAAS 393
E KV AS
Sbjct: 125 EGNKVTAS 132
>gnl|CDD|173809 cd07483, Peptidases_S8_Subtilisin_Novo-like, Peptidase S8 family
domain in Subtilisin_Novo-like proteins. Subtilisins are
a group of alkaline proteinases originating from
different strains of Bacillus subtilis. Novo is one of
the strains that produced enzymes belonging to this
group. The enzymes obtained from the Novo and BPN'
strains are identical. The Carlsburg and Novo
subtilisins are thought to have arisen from a common
ancestral protein. They have similar peptidase and
esterase activities, pH profiles, catalyze
transesterification reactions, and are both inhibited by
diispropyl fluorophosphate, though they differ in 85
positions in the amino acid sequence. Members of the
peptidases S8 and S35 clan include endopeptidases,
exopeptidases and also a tripeptidyl-peptidase. The S8
family has an Asp/His/Ser catalytic triad similar to that
found in trypsin-like proteases, but do not share their
three-dimensional structure and are not homologous to
trypsin. The S53 family contains a catalytic triad
Glu/Asp/Ser with an additional acidic residue Asp in the
oxyanion hole, similar to that of subtilisin.. The
stability of these enzymes may be enhanced by calcium,
some members have been shown to bind up to 4 ions via
binding sites with different affinity. Some members of
this clan contain disulfide bonds. These enzymes can be
intra- and extracellular, some function at extreme
temperatures and pH values.
Length = 291
Score = 30.4 bits (69), Expect = 8.6
Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 3/57 (5%)
Query: 1177 NSIVSNF-NTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQI-SY 1231
N++V+NF N +V Y + PD YE + SMA P+V A I SY
Sbjct: 218 NNLVANFSNYGKKNVDVFAPGERIYSTTPDNEYETDSGTSMAA-PVVSGVAALIWSY 273
>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the
drug resistance transporter and related proteins,
subfamily A. This family of ATP-binding proteins
belongs to a multi-subunit transporter involved in drug
resistance (BcrA and DrrA), nodulation, lipid
transport, and lantibiotic immunity. In bacteria and
archaea, these transporters usually include an
ATP-binding protein and one or two integral membrane
proteins. Eukaryotic systems of the ABCA subfamily
display ABC domains that are quite similar to this
family. The ATP-binding domain shows the highest
similarity between all members of the ABC transporter
family. ABC transporters are a subset of nucleotide
hydrolases that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 173
Score = 29.7 bits (68), Expect = 8.6
Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 5/48 (10%)
Query: 44 HGLMMVGPSGSGKSTAWKVLLKALERYEG---VEGVAHIIDPKAISKE 88
+GL +GP+G+GK+T K++L L+ G V G +P+ + +
Sbjct: 29 YGL--LGPNGAGKTTLIKIILGLLKPDSGEIKVLGKDIKKEPEEVKRR 74
>gnl|CDD|184585 PRK14239, PRK14239, phosphate transporter ATP-binding protein;
Provisional.
Length = 252
Score = 30.1 bits (68), Expect = 8.7
Identities = 12/22 (54%), Positives = 17/22 (77%), Gaps = 4/22 (18%)
Query: 48 MVGPSGSGKSTAWKVLLKALER 69
++GPSGSGKST LL+++ R
Sbjct: 36 LIGPSGSGKST----LLRSINR 53
>gnl|CDD|221912 pfam13086, AAA_11, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins.
Length = 220
Score = 30.0 bits (68), Expect = 8.8
Identities = 9/18 (50%), Positives = 14/18 (77%)
Query: 423 GPPGSGKTMTLLSALRAL 440
GPPG+GKT T++ ++ L
Sbjct: 24 GPPGTGKTTTIVEIIKQL 41
>gnl|CDD|225438 COG2884, FtsE, Predicted ATPase involved in cell division [Cell
division and chromosome partitioning].
Length = 223
Score = 29.9 bits (68), Expect = 9.0
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 49 VGPSGSGKSTAWKVLLKALE 68
GPSG+GKST K++
Sbjct: 34 TGPSGAGKSTLLKLIYGEER 53
>gnl|CDD|218528 pfam05272, VirE, Virulence-associated protein E. This family
contains several bacterial virulence-associated protein
E like proteins. These proteins contain a P-loop motif.
Length = 198
Score = 29.7 bits (67), Expect = 9.0
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 44 HGLMMVGPSGSGKSTAWKVL 63
H L++ G GSGKST K L
Sbjct: 53 HVLILQGAQGSGKSTFLKKL 72
>gnl|CDD|182778 PRK10851, PRK10851, sulfate/thiosulfate transporter subunit;
Provisional.
Length = 353
Score = 30.4 bits (69), Expect = 9.1
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 762 LIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNK 793
L+G SG+GKTTL R +A + + IR H
Sbjct: 33 LLGPSGSGKTTLLRIIAGLEHQTSGHIRFHGT 64
>gnl|CDD|224260 COG1341, COG1341, Predicted GTPase or GTP-binding protein [General
function prediction only].
Length = 398
Score = 30.4 bits (69), Expect = 9.1
Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 9/73 (12%)
Query: 714 TELREYVQARLKVFYEEELDVQLV--------LFDEVLDHVLRIDRIFRQPQGHLLLIGV 765
LRE L V QLV +E+ D + G ++++G
Sbjct: 23 LGLREVEGGALSVARVRTTYHQLVEVPEDRSEPLEEIADTWESKSESAGKV-GVVMVVGP 81
Query: 766 SGAGKTTLSRFVA 778
+GK+TL+ ++A
Sbjct: 82 VDSGKSTLTTYLA 94
>gnl|CDD|206649 cd01850, CDC_Septin, CDC/Septin GTPase family. Septins are a
conserved family of GTP-binding proteins associated
with diverse processes in dividing and non-dividing
cells. They were first discovered in the budding yeast
S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and
CDC12) required for normal bud morphology. Septins are
also present in metazoan cells, where they are required
for cytokinesis in some systems, and implicated in a
variety of other processes involving organization of
the cell cortex and exocytosis. In humans, 12 septin
genes generate dozens of polypeptides, many of which
comprise heterooligomeric complexes. Since septin
mutants are commonly defective in cytokinesis and
formation of the neck formation of the neck
filaments/septin rings, septins have been considered to
be the primary constituents of the neck filaments.
Septins belong to the GTPase superfamily for their
conserved GTPase motifs and enzymatic activities.
Length = 275
Score = 30.2 bits (69), Expect = 9.1
Identities = 9/13 (69%), Positives = 11/13 (84%)
Query: 46 LMMVGPSGSGKST 58
+M+VG SG GKST
Sbjct: 7 IMVVGESGLGKST 19
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 30.4 bits (69), Expect = 9.2
Identities = 24/104 (23%), Positives = 50/104 (48%), Gaps = 6/104 (5%)
Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE 1093
K+ Q+EE +K+L+ + EL+ + + +L + + RK+ + + AE+E
Sbjct: 98 EKLDNLENQLEEREKALSARELELEELEKQLDNELYR-VAGLTPEQARKLLLKLLDAELE 156
Query: 1094 KQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRS 1137
++ +RV +E+ A +E A AQ + + ++ E S
Sbjct: 157 EE----KAQRVKKIEEEADLE-AERKAQNILAQAMQRIASETAS 195
>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component
of iron-siderophores, vitamin B12 and hemin
transporters and related proteins. ABC transporters,
involved in the uptake of siderophores, heme, and
vitamin B12, are widely conserved in bacteria and
archaea. Only very few species lack representatives of
the siderophore family transporters. The E. coli BtuCD
protein is an ABC transporter mediating vitamin B12
uptake. The two ATP-binding cassettes (BtuD) are in
close contact with each other, as are the two
membrane-spanning subunits (BtuC); this arrangement is
distinct from that observed for the E. coli lipid
flippase MsbA. The BtuC subunits provide 20
transmembrane helices grouped around a translocation
pathway that is closed to the cytoplasm by a gate
region, whereas the dimer arrangement of the BtuD
subunits resembles the ATP-bound form of the Rad50 DNA
repair enzyme. A prominent cytoplasmic loop of BtuC
forms the contact region with the ATP-binding cassette
and represent a conserved motif among the ABC
transporters.
Length = 180
Score = 29.7 bits (68), Expect = 9.2
Identities = 11/19 (57%), Positives = 14/19 (73%), Gaps = 4/19 (21%)
Query: 49 VGPSGSGKSTAWKVLLKAL 67
+GP+G+GKST LLK L
Sbjct: 31 LGPNGAGKST----LLKTL 45
>gnl|CDD|234041 TIGR02868, CydC, thiol reductant ABC exporter, CydC subunit. The
gene pair cydCD encodes an ABC-family transporter in
which each gene contains an N-terminal membrane-spanning
domain (pfam00664) and a C-terminal ATP-binding domain
(pfam00005). In E. coli these genes were discovered as
mutants which caused the terminal heme-copper oxidase
complex cytochrome bd to fail to assemble. Recent work
has shown that the transporter is involved in export of
redox-active thiol compounds such as cysteine and
glutathione. The linkage to assembly of the cytochrome
bd complex is further supported by the conserved operon
structure found outside the gammaproteobacteria
(cydABCD) containing both the transporter and oxidase
genes components. The genes used as the seed members for
this model are all either found in the
gammproteobacterial context or the CydABCD context. All
members of this family scoring above trusted at the time
of its creation were from genomes which encode a
cytochrome bd complex.
Length = 530
Score = 30.4 bits (69), Expect = 9.2
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
+GPSGSGKST L L+ +G
Sbjct: 367 LGPSGSGKSTLLATLAGLLDPLQG 390
>gnl|CDD|153370 cd07686, F-BAR_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fer (Fes related) tyrosine
kinase. F-BAR domains are dimerization modules that bind
and bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Fer (Fes
related) is a cytoplasmic (or nonreceptor) tyrosine
kinase expressed in a wide variety of tissues, and is
found to reside in both the cytoplasm and the nucleus. It
plays important roles in neuronal polarization and
neurite development, cytoskeletal reorganization, cell
migration, growth factor signaling, and the regulation of
cell-cell interactions mediated by adherens junctions and
focal adhesions. Fer kinase also regulates cell cycle
progression in malignant cells. It contains an N-terminal
F-BAR domain, an SH2 domain, and a C-terminal catalytic
kinase domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 234
Score = 30.0 bits (67), Expect = 9.4
Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 1040 VEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQ-SQDIQAEIEKQT 1096
V+Q E++ K + ++EL N +L MIKD+Q+ +K + Q I+AE+ K T
Sbjct: 71 VQQTEQLSKIMKTHAEEL---NSGPLHRLTMMIKDKQQVKKSYIGVHQQIEAEMYKVT 125
>gnl|CDD|227584 COG5259, RSC8, RSC chromatin remodeling complex subunit RSC8
[Chromatin structure and dynamics / Transcription].
Length = 531
Score = 30.2 bits (68), Expect = 9.4
Identities = 23/133 (17%), Positives = 49/133 (36%), Gaps = 14/133 (10%)
Query: 1005 LDFINHFVKLYREKCSELEEQQLH------LNVGLGKIAETVEQVEEMQKSLAVKSQELQ 1058
LD KL EE+++ + L K+ + ++E++KS +++ QEL
Sbjct: 401 LDSGKEKAKLQATN----EERKMERLRNVLIQAQLEKLKMKLGHLKELEKSTSLERQELD 456
Query: 1059 SKNEAANLKLKE---MIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRV-FVMEDLAQVE 1114
+ L +E I + + + K + E + + + + V+
Sbjct: 457 ANLLLRRLNAEEKLFAIDEVLSKCLNLISDDQGKNPDNKPSTETGENHLGQEISNETDVK 516
Query: 1115 PAVMDAQQAVKEI 1127
P +A Q +
Sbjct: 517 PISKEAPQFYRYW 529
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes the
bacterial secreted proteins PopB, PepB, YopB and EspD
which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 30.1 bits (68), Expect = 9.5
Identities = 10/50 (20%), Positives = 21/50 (42%)
Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAI 1290
+ L+ +A + +A + + QL+ I K+E+ +L Q
Sbjct: 232 TGAAVAVLQKEAGKLEADLALLRAEMEQLQAVIKQLKEEFKKLQEQFQQA 281
>gnl|CDD|214345 CHL00043, cemA, envelope membrane protein.
Length = 261
Score = 29.8 bits (68), Expect = 9.6
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 10/55 (18%)
Query: 1047 QKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
+K K EL+ L L EMIK+ E Q ++ I K+T+++A+
Sbjct: 85 EKRALEKFIELEEL-----LLLDEMIKEYPE-----THLQKLRIGIHKETIQLAK 129
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD and
GSK-3beta in intact cells.
Length = 711
Score = 30.6 bits (69), Expect = 9.6
Identities = 28/100 (28%), Positives = 54/100 (54%), Gaps = 8/100 (8%)
Query: 1018 KCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL---KLKEMIKD 1074
K + ++ L L L ++ + V+ + L+ SQEL+ +NE L +L++ IK+
Sbjct: 28 KEAYYLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIKE 87
Query: 1075 QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVE 1114
+ E R +Q +E+E++ + + QK+V V+ +QVE
Sbjct: 88 YKFREARLLQD---YSELEEENISL-QKQVSVLRQ-SQVE 122
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 30.2 bits (69), Expect = 9.7
Identities = 22/128 (17%), Positives = 42/128 (32%), Gaps = 18/128 (14%)
Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK--------LKEM 1071
S Q L E + + +SK L LK +
Sbjct: 86 SRTNSQDALLQAENALPEWKKELEKPSEPKEEEPKAAAESKVVQKELDELRDELKELKNL 145
Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQTV------EIAQKRVFVMEDLAQVEPAVMD-AQQAV 1124
++DQ ++ + AE+ K+ EIA+K +++ L + P A + +
Sbjct: 146 LEDQLSGLRQVERIPPEFAELYKRLKRSGLSPEIAEK---LLKLLLEHMPPRERTAWRYL 202
Query: 1125 KEIKKQQL 1132
E+ +
Sbjct: 203 LELLANMI 210
>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division and
chromosome partitioning].
Length = 212
Score = 29.7 bits (67), Expect = 9.7
Identities = 17/136 (12%), Positives = 52/136 (38%), Gaps = 16/136 (11%)
Query: 1232 ADMLKKVEPLRLELKSLEVQASENKAK------GEETKDLI---TQLEKSIASYKDEYAQ 1282
++ ++++ S +Q +E +A+ E D++ + + + A
Sbjct: 54 DELKEELKEAADAEDSQAIQQAETEAEELKQAAEAEADDILKRASAQAQRVFGKAQYKAD 113
Query: 1283 LIAQATAIKTDLDNVQAKF-YEYAQLIAQATA----IKTD--LDNVQAKVERSMALLKSL 1335
+ I +L++++ + E +L + A K + + +++ V+ A +
Sbjct: 114 RYLEDAKIAQELEDLKRQAQVERQRLRSDIEAQLASAKQEDWDEILRSTVDEVEAANEEA 173
Query: 1336 GIERERWEATSETFRS 1351
+ +A ++ R
Sbjct: 174 ERLADAAQADADRLRD 189
>gnl|CDD|181939 PRK09544, znuC, high-affinity zinc transporter ATPase; Reviewed.
Length = 251
Score = 30.1 bits (68), Expect = 9.7
Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
Query: 36 LYQIS-NLNHG--LMMVGPSGSGKSTAWKVLLKALERYEGV 73
L +S L G L ++GP+G+GKST +V+L + EGV
Sbjct: 20 LSDVSLELKPGKILTLLGPNGAGKSTLVRVVLGLVAPDEGV 60
>gnl|CDD|184697 PRK14475, PRK14475, F0F1 ATP synthase subunit B; Provisional.
Length = 167
Score = 29.5 bits (66), Expect = 9.8
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 5/66 (7%)
Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQ--AEIEK 1094
E Q M LA + + A KL+E IK + E +RK+ + Q A+++
Sbjct: 72 EEAERQAAAM---LAAAKADARRMEAEAKEKLEEQIKRRAEMAERKIAQAEAQAAADVKA 128
Query: 1095 QTVEIA 1100
V++A
Sbjct: 129 AAVDLA 134
>gnl|CDD|213262 cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of
the osmoprotectant transporter. OpuCA is a the ATP
binding component of a bacterial solute transporter
that serves a protective role to cells growing in a
hyperosmolar environment. ABC (ATP-binding cassette)
transporter nucleotide-binding domain; ABC transporters
are a large family of proteins involved in the
transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. The nucleotide binding domain shows the
highest similarity between all members of the family.
ABC transporters are a subset of nucleotide hydrolases
that contain a signature motif, Q-loop, and
H-loop/switch region, in addition, to the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 242
Score = 30.0 bits (68), Expect = 9.9
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEG 72
L+++GPSGSGK+T K++ + +E G
Sbjct: 30 LVLIGPSGSGKTTTMKMINRLIEPTSG 56
>gnl|CDD|227320 COG4987, CydC, ABC-type transport system involved in cytochrome bd
biosynthesis, fused ATPase and permease components
[Energy production and conversion / Posttranslational
modification, protein turnover, chaperones].
Length = 573
Score = 30.4 bits (69), Expect = 9.9
Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 8/43 (18%)
Query: 49 VGPSGSGKSTAWKVLLKALE------RYEGVEGVAHIIDPKAI 85
+G SGSGKST ++L A + GVE +A +D +A+
Sbjct: 370 LGRSGSGKSTLLQLLAGAWDPQQGSITLNGVE-IAS-LDEQAL 410
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the synaptonemal
complex. Synaptonemal complexes are structures that are
formed between homologous chromosomes during meiotic
prophase.
Length = 787
Score = 30.5 bits (68), Expect = 9.9
Identities = 43/183 (23%), Positives = 78/183 (42%), Gaps = 24/183 (13%)
Query: 1000 TPRHYLDFINHFVKL---YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKS------- 1049
T + Y+D N+ K+ + E + E +L ++ L + E ++ +EE K
Sbjct: 181 TRQVYMDLNNNIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEK 240
Query: 1050 ----LAVKSQELQSKNEAANLKLKEMI-KDQQEAEKRKVQSQDIQAEIEKQ---TVEIAQ 1101
L ++ E ++K + L+E K Q EK K+Q ++++ IEK+ T E+
Sbjct: 241 QVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELED 300
Query: 1102 KRVFVMEDLAQVEPAVMDAQQAVKEI------KKQQLVELRSMANPPSVVKLALESICLL 1155
++ + ++ + D Q A K I K+ Q+ EL S V E+
Sbjct: 301 IKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCS 360
Query: 1156 LGE 1158
L E
Sbjct: 361 LEE 363
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.393
Gapped
Lambda K H
0.267 0.0889 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 83,718,427
Number of extensions: 8537761
Number of successful extensions: 12083
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11705
Number of HSP's successfully gapped: 642
Length of query: 1614
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1504
Effective length of database: 6,058,662
Effective search space: 9112227648
Effective search space used: 9112227648
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 66 (28.9 bits)