RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11935
(1334 letters)
>gnl|CDD|236387 PRK09130, PRK09130, NADH dehydrogenase subunit G; Validated.
Length = 687
Score = 785 bits (2029), Expect = 0.0
Identities = 306/640 (47%), Positives = 377/640 (58%), Gaps = 119/640 (18%)
Query: 790 PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFG 849
PVA+CAMPV +G + TN+ M ++AREGVMEFLL+NHPLDCPICDQGGECDLQDQ+MA+G
Sbjct: 59 PVASCAMPVGEGMVIFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYG 118
Query: 850 ---------------------------------------------NDLGTTGRGSDMQVG 864
+LG GRG DM++
Sbjct: 119 VDTSRYHENKRAVEDKYMGPLVKTVMTRCIHCTRCVRFATEVAGVPELGAIGRGEDMEIT 178
Query: 865 TYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRT 924
TY+E+ SELSGNVIDLCPVGALTSKPY+FTARPWE +KTES+DV+DAVGSNI V TR
Sbjct: 179 TYLEQALTSELSGNVIDLCPVGALTSKPYAFTARPWELKKTESIDVMDAVGSNIRVDTRG 238
Query: 925 GEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVRNCDGQLVATEWEDALIAVA 984
EV+R+LPR+NE++NEEW++DK RF++DGLKRQRL P+VR +G+LV W++A A+A
Sbjct: 239 REVMRILPRVNEEVNEEWISDKTRFSWDGLKRQRLDRPYVRK-NGKLVPASWDEAFAAIA 297
Query: 985 QKLQ---TSEVAGVVGSLADAEAMVALKDLLNKLGSEDL---YTEYAFPLEGAGTDLRAN 1038
K++ ++A + G LAD E+M ALKDL+ KLGS +L LRA+
Sbjct: 298 AKIKGTPGEKIAAIAGDLADVESMFALKDLMQKLGSSNLDCRQDGAKLD-----PSLRAS 352
Query: 1039 YLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYDYEH 1098
YL N IAG EEAD ILLIG NPRFEAP+ NARIRK + +A IG + DL Y YE+
Sbjct: 353 YLFNTTIAGIEEADAILLIGANPRFEAPVLNARIRKRWRAGGFKIAVIGEQADLTYPYEY 412
Query: 1099 LGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCES 1155
LG D + LASG H F+ L AAK+P+I+VG +DGAAVLAL +LA KV
Sbjct: 413 LGAGPDTLADLASGKHEFADVLKAAKRPMIIVGQGALARADGAAVLALAAKLAEKVGAVR 472
Query: 1156 DVPCDWKVLNILQKAASQVAALDIGYKPGT------SAIREKPPKVLFLLGADEGSISRD 1209
D W N+L AAS+V LD+G+ PG + VL+LLGADE IS+
Sbjct: 473 D---GWNGFNVLHTAASRVGGLDLGFVPGEGGKDAAEMLESGALDVLYLLGADEIDISK- 528
Query: 1210 DVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARE 1269
GK F+IYQGHHGD GA AD ILPGAAYTEK TYVNTEGR Q AV PPG A+E
Sbjct: 529 --GKSAFVIYQGHHGDRGAHRADVILPGAAYTEKSGTYVNTEGRVQLANRAVFPPGEAKE 586
Query: 1270 DWKIIRALSE--------------------------------------------GVSSNL 1285
DW I+RALS+ L
Sbjct: 587 DWAILRALSDVLGKTLPYDSLAQLRAKLAAAYPHFAAIDQITPSKDAKDLAALASKKGKL 646
Query: 1286 SSEPVDVKQKNLEDYYMTDPISRASGTMAKCIQAVHKQKQ 1325
S P K D+Y+T+PI+RAS TMA+C +
Sbjct: 647 SKAPFTSPVK---DFYLTNPIARASATMAECSALASGRAL 683
Score = 162 bits (413), Expect = 3e-41
Identities = 55/119 (46%), Positives = 69/119 (57%), Gaps = 37/119 (31%)
Query: 669 IDDKKVMVDPGTTVLQ-------------------------------------PVAACAM 691
+D K++ V G T+LQ PVA+CAM
Sbjct: 6 VDGKEIEVPDGYTLLQACEAAGAEIPRFCYHERLSIAGNCRMCLVEVKGGPPKPVASCAM 65
Query: 692 PVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
PV +G + TN+ M ++AREGVMEFLL+NHPLDCPICDQGGECDLQDQ+MA+G D SR+
Sbjct: 66 PVGEGMVIFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYGVDTSRY 124
>gnl|CDD|233662 TIGR01973, NuoG, NADH-quinone oxidoreductase, chain G. This model
represents the G subunit (one of 14: A->N) of the
NADH-quinone oxidoreductase complex I which generally
couples NADH and ubiquinone oxidation/reduction in
bacteria and mammalian mitochondria while translocating
protons, but may act on NADPH and/or plastoquinone in
cyanobacteria and plant chloroplasts. This model excludes
related subunits from formate dehydrogenase complexes
[Energy metabolism, Electron transport].
Length = 603
Score = 622 bits (1607), Expect = 0.0
Identities = 286/667 (42%), Positives = 361/667 (54%), Gaps = 120/667 (17%)
Query: 667 VFIDDKKVMVDPGTTVLQ-------------------------------------PVAAC 689
+FID K++ V GTTVLQ PVA+C
Sbjct: 1 IFIDGKELEVPKGTTVLQACLSAGIEIPRFCYHEKLSIAGNCRMCLVEVEKFPDKPVASC 60
Query: 690 AMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSR 749
A PV G ++ TNSE ++AREGVMEFLL+NHPLDCPICDQGGECDLQDQ++ +GSDRSR
Sbjct: 61 ATPVTDGMKISTNSEKVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAVMYGSDRSR 120
Query: 750 FTDIDFSGKSGTYTRVTRQRSWGLIPSRVFLQSINRHWIGPVAACAMPVMKGWRVKTNSE 809
F R +++ ++GP+ +KT E
Sbjct: 121 F--------------------------REKKRTVENKYLGPL------------IKT--E 140
Query: 810 MTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGNDLGTTGRGSDMQVGTYVEK 869
MTR C C +A DLG GRG+++++GTY K
Sbjct: 141 MTR--------------------CIHCTRCVRFANEVAGVEDLGVIGRGNNVEIGTYEGK 180
Query: 870 LFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRTGEVLR 929
SELSGN+ID+CPVGALTSKPY+F ARPWE + T S+ V D+VG NI V R GE++R
Sbjct: 181 TLESELSGNLIDICPVGALTSKPYAFKARPWELKSTPSICVHDSVGCNIRVDERNGEIMR 240
Query: 930 VLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQLVATEWEDALIAVAQKLQ 988
+LPR N++INEEWL DKGRF YDGL RQ RL P +RN +G L+ W +AL A+KL+
Sbjct: 241 ILPRENDEINEEWLCDKGRFGYDGLNRQDRLTKPLLRNQEGNLLEVSWAEALAIAAEKLK 300
Query: 989 TSE-VAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDLRANYLLNNKIAG 1047
S + G+ G + E + ALK L+ KLGSE+ + DLRANYL N +A
Sbjct: 301 ASSRIGGIAGPRSSLEELFALKKLVRKLGSENFDLRIRNYEFES-ADLRANYLFNTTLAD 359
Query: 1048 AEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGP-KVDLRYD-----YEHLGE 1101
EEADL+LL+G + R EAPL N R+RK VA IG K +L Y H G
Sbjct: 360 IEEADLVLLVGADLRQEAPLLNLRLRKAVKKGGAKVALIGIEKWNLTYPANTNLVFHPGL 419
Query: 1102 SADLIKQLASGSHP-FSKKLSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCESDV 1157
S + +ASG+H + L AAKKPLI+VG DGAA+++ AKV
Sbjct: 420 SPKKLDDIASGAHSDIAAALKAAKKPLIIVGDSAYSHLDGAALISAAAN-IAKVIK--VR 476
Query: 1158 PCDWKVLNILQKAASQVAALDIGYKPGT--SAIREKPPKVLFLLGADEG-----SISRDD 1210
+W LNIL A+ V LD+G + +A+ LFLLGAD +
Sbjct: 477 RKEWNGLNILSSGANSVGLLDLGGESTGLDAALNLGAADALFLLGADLERALDKTARDAL 536
Query: 1211 VGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARED 1270
D F+IYQGHHG A AD ILPGAA+TEK TYVN EGRAQ+ AV PPG ARED
Sbjct: 537 SKADAFVIYQGHHGTETAEKADVILPGAAFTEKSGTYVNLEGRAQRFEQAVKPPGEARED 596
Query: 1271 WKIIRAL 1277
W+I+RAL
Sbjct: 597 WRILRAL 603
>gnl|CDD|239174 cd02773, MopB_Res-Cmplx1_Nad11, MopB_Res_Cmplx1_Nad11: The second
domain of the Nad11/75-kDa subunit of the NADH-quinone
oxidoreductase/respiratory complex I/NADH
dehydrogenase-1(NDH-1) of eukaryotes and the Nqo3/G
subunit of alphaproteobacteria NDH-1. The NADH-quinone
oxidoreductase is the first energy-transducting complex
in the respiratory chains of many prokaryotes and
eukaryotes. Mitochondrial complex I and its bacterial
counterpart, NDH-1, function as a redox pump that uses
the redox energy to translocate H+ ions across the
membrane, resulting in a significant contribution to
energy production. The nad11 gene codes for the largest
(75 kDa) subunit of the mitochondrial NADH:ubiquinone
oxidoreductase, it constitutes the electron input part of
the enzyme, or the so-called NADH dehydrogenase fragment.
In Paracoccus denitrificans, this subunit is encoded by
the nqo3 gene, and is part of the 14 distinct subunits
constituting the 'minimal' functional enzyme. The
Nad11/Nqo3 subunit is made of two domains: the first
contains three binding sites for FeS clusters (the fer2
domain), the second domain (this CD), is of unknown
function or, as postulated, has lost an ancestral formate
dehydrogenase activity that became redundant during the
evolution of the complex I enzyme. Although only
vestigial sequence evidence remains of a molybdopterin
binding site, this protein domain belongs to the
molybdopterin_binding (MopB) superfamily of proteins.
Length = 375
Score = 611 bits (1577), Expect = 0.0
Identities = 235/381 (61%), Positives = 278/381 (72%), Gaps = 15/381 (3%)
Query: 906 ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVR 965
ES+DVLDAVGSNI V TR GEV+R+LPRLNEDINEEW++DK RFAYDGLKRQRL P++R
Sbjct: 1 ESIDVLDAVGSNIRVDTRGGEVMRILPRLNEDINEEWISDKTRFAYDGLKRQRLDKPYIR 60
Query: 966 NCDGQLVATEWEDALIAVAQKLQT---SEVAGVVGSLADAEAMVALKDLLNKLGSEDLYT 1022
+G+L WE+AL A+A+ L+ E+A + G LAD E+MVALKDLLNKLGSE+L
Sbjct: 61 K-NGKLKPATWEEALAAIAKALKGVKPDEIAAIAGDLADVESMVALKDLLNKLGSENLAC 119
Query: 1023 EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELD 1082
E P DLR+NYL N IAG EEAD +LL+GTNPRFEAP+ NARIRK +L L
Sbjct: 120 EQDGP--DLPADLRSNYLFNTTIAGIEEADAVLLVGTNPRFEAPVLNARIRKAWLHGGLK 177
Query: 1083 VAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI---SDGAA 1139
V IGP VDL YDY+HLG A ++ +ASG HPFSK L AKKP+I+VG+ DGAA
Sbjct: 178 VGVIGPPVDLTYDYDHLGTDAKTLQDIASGKHPFSKALKDAKKPMIIVGSGALARKDGAA 237
Query: 1140 VLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAALDIGYKPGTSAIREK-PPKVLFL 1198
+LA V +LA K V W N+L +AAS+V ALD+G+ PG AIR+ PPKVL+L
Sbjct: 238 ILAAVAKLAKKN---GVVREGWNGFNVLHRAASRVGALDLGFVPGAGAIRKSGPPKVLYL 294
Query: 1199 LGADEGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTL 1258
LGADE I + KD F++YQGHHGD GA IAD ILPGAAYTEK TYVNTEGR QQT
Sbjct: 295 LGADEIDI--TPIPKDAFVVYQGHHGDRGAQIADVILPGAAYTEKSGTYVNTEGRVQQTR 352
Query: 1259 TAVTPPGLAREDWKIIRALSE 1279
AV+PPG AREDWKI+RALSE
Sbjct: 353 KAVSPPGDAREDWKILRALSE 373
>gnl|CDD|223965 COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75
kD subunit (chain G) [Energy production and conversion].
Length = 693
Score = 469 bits (1208), Expect = e-149
Identities = 236/640 (36%), Positives = 314/640 (49%), Gaps = 115/640 (17%)
Query: 790 PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFG 849
VA+CA PV G + TNSE ++AREGVMEFLL+NHPLDCP+CD+GGEC+LQD ++ +G
Sbjct: 58 LVASCATPVTDGMVISTNSEEVKKAREGVMEFLLINHPLDCPVCDKGGECELQDLAVKYG 117
Query: 850 ND---------------------------------------------LGTTGRGSDMQVG 864
LG RG + ++G
Sbjct: 118 VSHSRYRETKRTHRDKDLGPLVKYDMNRCILCTRCVRFCKEIAGTHELGVIKRGENSEIG 177
Query: 865 TYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRT 924
TY+++ SEL GN +D+CPVGALTSKP++FTAR WE +KT S+ V AVG NI V R
Sbjct: 178 TYLDQPLESELCGNCVDVCPVGALTSKPFAFTARKWELKKTPSICVHCAVGCNIRVDERY 237
Query: 925 GEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVRNCDGQLVATEWEDALIAVA 984
GEV R+LPR NE +NEEWL DKGRFAYDGL QRL P +R G+LV W +A A+A
Sbjct: 238 GEVRRILPRYNEVVNEEWLCDKGRFAYDGLNLQRLDRPKIRV-GGRLVEASWLEANEAIA 296
Query: 985 QKL---QTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDL---RAN 1038
Q L + +V + A E + ALK+L +LGS + E A D RA
Sbjct: 297 QALALIKPEKVGAIASPRASVEELFALKELAGELGSSN----IDHRQEDARLDPKVARAG 352
Query: 1039 YLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYDY-- 1096
YL N IA E AD +L+IG N R EAP+ RIRK L VA IG + Y
Sbjct: 353 YLYNPTIAEIESADAVLVIGANLRQEAPVLALRIRKAVKGKGLPVAVIGGVAEWLYALLL 412
Query: 1097 -EHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVT 1152
LG L+ +LA G+ + + A++ LI G + AA+LAL ++LA ++
Sbjct: 413 SILLGAGIALLDELALGAEAATAAVKKAERELIEKGKGALAGAKKAAILALAEKLADEL- 471
Query: 1153 CESDVPCDWKVLNILQKAASQVAALDIGYKPG-----TSAIREKPPK-VLFLLGADEGSI 1206
W +L +AAS+V AL +G+ PG + + L LLG DE
Sbjct: 472 --GAAEARWN-GVVLHEAASRVNALGLGFLPGLSGEDAALMLGPADANALLLLGIDE-EA 527
Query: 1207 SRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGL 1266
D F++Y HHGD GA +AD +LP A++TEK TYVN EGR Q+ A+ P G
Sbjct: 528 DEADEHAK-FVVYSDHHGDAGAEVADVVLPAASFTEKSGTYVNLEGRVQRFNQALRPGGD 586
Query: 1267 AREDWKIIRALS--------------------------EGVSSNLSSEPVDVK------- 1293
REDW+++ AL+ + S E
Sbjct: 587 EREDWRVLHALASELGLKLDFDQLDAVRAALAAAHPGFALIDSAAKGEREPPAGEGILAN 646
Query: 1294 --------QKNLEDYYMTDPISRASGTMAKCIQAVHKQKQ 1325
+D+Y+ P +R S AK +Q Q
Sbjct: 647 RANISVAEPLQPQDFYLAMPGARNSPQAAKKLQLEVGQAV 686
Score = 121 bits (306), Expect = 4e-28
Identities = 49/121 (40%), Positives = 64/121 (52%), Gaps = 36/121 (29%)
Query: 667 VFIDDKKVMVDPGTTVLQ------------------------------------PVAACA 690
+ ID K++ V G TVLQ VA+CA
Sbjct: 4 ITIDGKEIEVPEGETVLQAAREAGIDIPTFCYHPRLSIAGACRMCLVEVEGAPKLVASCA 63
Query: 691 MPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
PV G + TNSE ++AREGVMEFLL+NHPLDCP+CD+GGEC+LQD ++ +G SR+
Sbjct: 64 TPVTDGMVISTNSEEVKKAREGVMEFLLINHPLDCPVCDKGGECELQDLAVKYGVSHSRY 123
Query: 751 T 751
Sbjct: 124 R 124
>gnl|CDD|239169 cd02768, MopB_NADH-Q-OR-NuoG2, MopB_NADH-Q-OR-NuoG2: The
NuoG/Nad11/75-kDa subunit (second domain) of the
NADH-quinone oxidoreductase (NADH-Q-OR)/respiratory
complex I/NADH dehydrogenase-1 (NDH-1). The NADH-Q-OR is
the first energy-transducting complex in the respiratory
chains of many prokaryotes and eukaryotes. Mitochondrial
complex I and its bacterial counterpart, NDH-1, function
as a redox pump that uses the redox energy to translocate
H+ ions across the membrane, resulting in a significant
contribution to energy production. The atomic structure
of complex I is not known and the mechanisms of electron
transfer and proton pumping are not established. The
nad11 gene codes for the largest (75-kDa) subunit of the
mitochondrial NADH:ubiquinone oxidoreductase, it
constitutes the electron input part of the enzyme, or the
so-called NADH dehydrogenase fragment. In Escherichia
coli, this subunit is encoded by the nuoG gene, and is
part of the 14 distinct subunits constituting the
'minimal' functional enzyme. The nad11 gene is
nuclear-encoded in animals, plants, and fungi, but is
still encoded in the mitochondrial genome of some
protists. The Nad11/NuoG subunit is made of two domains:
the first contains three binding sites for FeS clusters
(the fer2 domain), the second domain (this CD), is of
unknown function or, as postulated, has lost an ancestral
formate dehydrogenase activity that became redundant
during the evolution of the complex I enzyme. Although
only vestigial sequence evidence remains of a
molybdopterin binding site, this protein domain family
belongs to the molybdopterin_binding (MopB) superfamily
of proteins. Bacterial type II NADH-quinone
oxidoreductases and NQR-type sodium-motive NADH-quinone
oxidoreductases are not homologs of this domain family.
Length = 386
Score = 435 bits (1120), Expect = e-141
Identities = 186/394 (47%), Positives = 230/394 (58%), Gaps = 30/394 (7%)
Query: 906 ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFV 964
ES+DV DA+GSNI V R GEV+R+LPR NE INEEW++DKGRF YDGL RQRL P +
Sbjct: 1 ESIDVHDALGSNIRVDVRGGEVMRILPRENEAINEEWISDKGRFGYDGLNSRQRLTQPLI 60
Query: 965 RNCDGQLVATEWEDALIAVAQKLQTS---EVAGVVGSLADAEAMVALKDLLNKLGSEDLY 1021
+ G+LV WE+AL VA+ L+ ++ G+ G AD E++ LK LLNKLGS ++
Sbjct: 61 KK-GGKLVPVSWEEALKTVAEGLKAVKGDKIGGIAGPRADLESLFLLKKLLNKLGSNNID 119
Query: 1022 TEYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNEL 1081
A LR NYL N IA EEAD +LLIG+N R EAPL NAR+RK
Sbjct: 120 HRLRQSDLPADNRLRGNYLFNTSIAEIEEADAVLLIGSNLRKEAPLLNARLRKAVKKKGA 179
Query: 1082 DVAYIGPK-----VDLRYDYEHLGESADLIKQLASGSH--PFSKKLSAAKKPLIVVGADI 1134
+A IGPK DL Y LG S + +A G H PF+K L AKKPLI++G+
Sbjct: 180 KIAVIGPKDTDLIADLTYPVSPLGASLATLLDIAEGKHLKPFAKSLKKAKKPLIILGSSA 239
Query: 1135 --SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAA--LDIGYKPGTSAIRE 1190
DGAA+L + LAAK+ + W LN+L +++ LD G +
Sbjct: 240 LRKDGAAILKALANLAAKLGTGAG---LWNGLNVLNSVGARLGGAGLDAGLALLEPGKAK 296
Query: 1191 KPPKVLFLLGADEGSISRDDVG-----KDCFIIYQGHHGDHGASIADAILPGAAYTEKQS 1245
L LLG DE S D F++YQGHHGD GA AD ILP AA+TEK
Sbjct: 297 -----LLLLGEDELDRSNPPAAVALAAADAFVVYQGHHGDTGAQ-ADVILPAAAFTEKSG 350
Query: 1246 TYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
TYVNTEGR Q+ AV+PPG AREDWKI+RALS
Sbjct: 351 TYVNTEGRVQRFKKAVSPPGDAREDWKILRALSN 384
>gnl|CDD|236386 PRK09129, PRK09129, NADH dehydrogenase subunit G; Validated.
Length = 776
Score = 381 bits (982), Expect = e-116
Identities = 204/712 (28%), Positives = 296/712 (41%), Gaps = 185/712 (25%)
Query: 666 EVFIDDKKVMVDPGTTVLQ------------------------------------PVAAC 689
E+ ID KKV V G+ V++ P+ AC
Sbjct: 3 EIEIDGKKVEVPEGSMVIEAADKAGIYIPRFCYHKKLSIAANCRMCLVEVEKAPKPLPAC 62
Query: 690 AMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSR 749
A PV G +V T SE +A++ VMEFLL+NHPLDCPICDQGGEC LQD ++ +G SR
Sbjct: 63 ATPVTDGMKVFTRSEKALKAQKSVMEFLLINHPLDCPICDQGGECQLQDLAVGYGRSTSR 122
Query: 750 FTDIDFSGKSGTYTRVTRQRSWGLIPSRVFLQSINRHWIGPVAACAMPVMKGWRVKTNSE 809
+T+ +R VF + +GP+ + +E
Sbjct: 123 YTE--------------EKRV-------VFDKD-----LGPLIS--------------TE 142
Query: 810 MTR--------RARE---GVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGNDLGTTGRG 858
MTR R + GVME G GRG
Sbjct: 143 MTRCIHCTRCVRFGQEIAGVMEL-------------------------------GMMGRG 171
Query: 859 SDMQVGTYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNI 918
++ TYV K SELSGN+IDLCPVGALTSKP+ ++AR WE + +SV D++GSN+
Sbjct: 172 EHSEITTYVGKTVDSELSGNMIDLCPVGALTSKPFRYSARTWELSRRKSVSPHDSLGSNL 231
Query: 919 IVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQLVATEWE 977
+V + V+RV+PR NE +NE W++D+ RF+Y+GL + RL P ++ GQ +WE
Sbjct: 232 VVHVKNNRVMRVVPRENEAVNECWISDRDRFSYEGLNSEDRLTKPMIKQ-GGQWKEVDWE 290
Query: 978 DALIAVAQKLQ-------TSEVAGVVGSLADAEAMVALKDLLNKLGSEDL---YTEYAFP 1027
AL VA+ L+ ++ + + E + L+ L LGS ++ + F
Sbjct: 291 TALEYVAEGLKGIIEDHGADQIGALASPHSTLEELYLLQKLARGLGSGNIDHRLRQQDFR 350
Query: 1028 LEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIG 1087
+ A L IA D +L++G+N R E PL AR+R+ N ++ I
Sbjct: 351 DDAAAPGAPW---LGMPIAELSNLDAVLVVGSNLRKEHPLLAARLRQAAK-NGAKLSAIN 406
Query: 1088 PKV-DLRYDYEH---------LGESADLIKQLA---SGSHPFS--------------KKL 1120
P D + A + +A + P + + L
Sbjct: 407 PVDDDFLFPVAQRIIVAPSAWADALAGVAAAVAAAKGVALPEALAKVLAAAAARAIAQSL 466
Query: 1121 SAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAAL 1177
+ ++ I++G A + AL Q +A L L +AA+ V A
Sbjct: 467 ANGERAAILLGNLAVNHPQAATLRALAQWIAKLTGA---------TLGFLTEAANSVGAH 517
Query: 1178 DIGYKPGTS-----AIREKPPKVLFLLGADEGSISRDDV-------GKDCFIIYQGHHGD 1225
G PG A+ +P K LL + D + +
Sbjct: 518 LAGALPGKGGLNAAAMLAQPRKAYLLLNVEPELDCADPAQARAALNQAEFVVALSAFASK 577
Query: 1226 HGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRAL 1277
AD +LP A +TE T+VN EGR Q V P G AR WK++R L
Sbjct: 578 ATLDYADVLLPIAPFTETSGTFVNAEGRVQSFKGVVRPLGEARPAWKVLRVL 629
>gnl|CDD|215888 pfam00384, Molybdopterin, Molybdopterin oxidoreductase.
Length = 358
Score = 344 bits (885), Expect = e-108
Identities = 164/357 (45%), Positives = 203/357 (56%), Gaps = 35/357 (9%)
Query: 958 RLLTPFVRNCDGQLVATEWEDALIAVAQKLQ---------TSEVAGVVGSLADAEAMVAL 1008
RL P VR +G+ V W++AL +A+KL+ + G G L D E++ AL
Sbjct: 1 RLKYPMVRRGEGKFVRVSWDEALDLIAKKLKRIIKKYGPDAIAINGGSGGLTDVESLYAL 60
Query: 1009 KDLLNKLGSEDLYTE-----YAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRF 1063
K LLN+LGS++ TE G+DLR+NYL NN IA E ADLILLIGTNPR
Sbjct: 61 KKLLNRLGSKNGNTEDHNGDLCTAAAAFGSDLRSNYLFNNSIADIENADLILLIGTNPRE 120
Query: 1064 EAPLFNARIRKGYLTNELDVAYIGPKVDLRYDYEHLGESAD-LIKQLASGSHPFSKKLSA 1122
EAP+ NARIRK L + V IGP++D Y EHLG + +G+H F K+L
Sbjct: 121 EAPILNARIRKAALKGKAKVIVIGPRLDETYADEHLGIKPGTDLALALAGAHVFIKELKK 180
Query: 1123 AK----KPLIVVGADI---SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVA 1175
K KP+I+VGA + +DG A+ + LA W LNILQ AAS V
Sbjct: 181 DKDFAPKPIIIVGAGVLQRADGEAIFRAIANLADLTGNIGRPGGGWNGLNILQGAASPVG 240
Query: 1176 ALDIGYKPG------TSAIREKPPKVLFLLG-------ADEGSISRDDVGKDCFIIYQGH 1222
ALD+G PG +AI++ KVL+LLG ADE + + D F++Y GH
Sbjct: 241 ALDLGLVPGIKSVEMINAIKKGGIKVLYLLGNNPFVTHADENRVVKALEKLDLFVVYDGH 300
Query: 1223 HGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
HGD A AD ILP AAYTEK TYVNTEGR Q T AV PPG AREDWKI+RALSE
Sbjct: 301 HGDKTAKYADVILPAAAYTEKNGTYVNTEGRVQSTKQAVPPPGEAREDWKILRALSE 357
>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
Mitochondrial-encoded NADH-quinone
oxidoreductase/respiratory complex I, the second domain
of the Nad11/75-kDa subunit of some protists.
NADH-quinone oxidoreductase is the first
energy-transducting complex in the respiratory chain and
functions as a redox pump that uses the redox energy to
translocate H+ ions across the membrane, resulting in a
significant contribution to energy production. The nad11
gene codes for the largest (75-kDa) subunit of the
mitochondrial NADH-quinone oxidoreductase, it constitutes
the electron input part of the enzyme, or the so-called
NADH dehydrogenase fragment. The Nad11 subunit is made of
two domains: the first contains three binding sites for
FeS clusters (the fer2 domain), the second domain (this
CD), is of unknown function or, as postulated, has lost
an ancestral formate dehydrogenase activity that became
redundant during the evolution of the complex I enzyme.
Although only vestigial sequence evidence remains of a
molybdopterin binding site, this protein domain belongs
to the molybdopterin_binding (MopB) superfamily of
proteins.
Length = 366
Score = 255 bits (654), Expect = 8e-76
Identities = 120/375 (32%), Positives = 197/375 (52%), Gaps = 16/375 (4%)
Query: 906 ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVR 965
ES+DVLD++GSNI V + E+LR+LP++N+++NEEW++DK RF+YD LK QR+ TP ++
Sbjct: 1 ESIDVLDSLGSNIRVDIKGNEILRILPKINDELNEEWISDKIRFSYDSLKYQRIKTPLLK 60
Query: 966 NCDGQLVATEWEDALIAVAQKLQT---SEVAGVVGSLADAEAMVALKDLLNKLGSEDLYT 1022
+ + W+ A + + + S++ ++GS D E + K LLNKLGS + +
Sbjct: 61 LSNNSFLEIGWKTAFKFLNKFILLKKFSKLNFIIGSKIDLETLFYYKKLLNKLGSLNTNS 120
Query: 1023 EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELD 1082
NYL NN + +++DL LLIG+N R E+P+ N R+R Y
Sbjct: 121 NNFLENNNYFNLDLENYLFNNSLKNLDKSDLCLLIGSNLRVESPILNIRLRNRYNKGNKK 180
Query: 1083 VAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADISDGAAVLA 1142
+ IG K D Y +H+G S + + ++ G H F K+L +KKPLI++G+ S
Sbjct: 181 IFVIGNKFDTTYPSKHIGLSLNTLLKILEGKHLFCKQLKKSKKPLIIIGSSFSLRKNYSF 240
Query: 1143 LVQQLAAKVTCESDVPCDWKVLNILQKAASQVAALDIGYKPGTSAIREKPPKVLFLLGAD 1202
++ +L S+ N L ++ + L I K ++ +K L+ +
Sbjct: 241 IISKLK---NFSSNNE---NNFNFLNIISNSLYYLGI-KKFNSN--NKKNLSNLYYIKET 291
Query: 1203 EGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVT 1262
K+ F+IYQGHH + A+ ++ ILP + EK++ Y+N EG Q+T ++
Sbjct: 292 NFQK----FNKNNFVIYQGHHFLNLANNSNLILPSKTFFEKEALYLNLEGILQKTKKILS 347
Query: 1263 PPGLAREDWKIIRAL 1277
+ D II +L
Sbjct: 348 FKENIKSDDNIIFSL 362
>gnl|CDD|238218 cd00368, Molybdopterin-Binding, Molybdopterin-Binding (MopB) domain
of the MopB superfamily of proteins, a large, diverse,
heterogeneous superfamily of enzymes that, in general,
bind molybdopterin as a cofactor. The MopB domain is
found in a wide variety of molybdenum- and
tungsten-containing enzymes, including formate
dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms of
nitrate reductase (Nap, Nas, NarG), dimethylsulfoxide
reductase (DMSOR), thiosulfate reductase,
formylmethanofuran dehydrogenase, and arsenite oxidase.
Molybdenum is present in most of these enzymes in the
form of molybdopterin, a modified pterin ring with a
dithiolene side chain, which is responsible for ligating
the Mo. In many bacterial and archaeal species,
molybdopterin is in the form of a dinucleotide, with two
molybdopterin dinucleotide units per molybdenum. These
proteins can function as monomers, heterodimers, or
heterotrimers, depending on the protein and organism.
Also included in the MopB superfamily is the
eukaryotic/eubacterial protein domain family of the
75-kDa subunit/Nad11/NuoG (second domain) of respiratory
complex 1/NADH-quinone oxidoreductase which is postulated
to have lost an ancestral formate dehydrogenase activity
and only vestigial sequence evidence remains of a
molybdopterin binding site.
Length = 374
Score = 219 bits (559), Expect = 6e-63
Identities = 111/396 (28%), Positives = 159/396 (40%), Gaps = 46/396 (11%)
Query: 906 ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFV 964
SV VG I+V + G+V+R+ N +NE L DKGR DGL RL P +
Sbjct: 1 PSVCPFCGVGCGILVYVKDGKVVRIEGDPNHPVNEGRLCDKGRAGLDGLYSPDRLKYPLI 60
Query: 965 R-NCDGQLVATEWEDALIAVAQKLQ-------TSEVAGVVGSLADAEAMVALKDLLNKLG 1016
R G+ V W++AL +A+KL+ +A G A E L+ LL LG
Sbjct: 61 RVGGRGKFVPISWDEALDEIAEKLKEIREKYGPDAIAFYGGGGASNEEAYLLQKLLRALG 120
Query: 1017 SEDLYTEYAFPLEGAGTDLRA--NYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRK 1074
S ++ + A L+A N +A E ADLILL G+NP P+ AR+R+
Sbjct: 121 SNNVDSHARLCHASAVAALKAFGGGAPTNTLADIENADLILLWGSNPAETHPVLAARLRR 180
Query: 1075 GYLTNELDVAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI 1134
+ I P AD + G+ L+ A+ + G
Sbjct: 181 AKK-RGAKLIVIDP------RRTETAAKADEWLPIRPGTDAA---LALAEWAAEITGVP- 229
Query: 1135 SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQK---------AASQVAAL--DIGYKP 1183
A + + + A L + A + +AAL +IG
Sbjct: 230 ---AETIRALAREFAAA-----KRAVILWGMGLTQHTNGTQNVRAIANLAALTGNIGRPG 281
Query: 1184 GTSAIREKPPKVLFLLGADEGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEK 1243
G P + D + D ++ + A AD +LP A Y EK
Sbjct: 282 GGLGPGGNP----LVSAPDANRVRAALKKLDFVVVIDIFMTETAA-YADVVLPAATYLEK 336
Query: 1244 QSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
+ TY NTEGR Q AV PPG AR DW+I+R L++
Sbjct: 337 EGTYTNTEGRVQLFRQAVEPPGEARSDWEILRELAK 372
>gnl|CDD|236118 PRK07860, PRK07860, NADH dehydrogenase subunit G; Validated.
Length = 797
Score = 209 bits (534), Expect = 6e-56
Identities = 121/369 (32%), Positives = 158/369 (42%), Gaps = 85/369 (23%)
Query: 790 PVAACAMPVMKGWRVKT--NSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMA 847
P A+C V G VKT S + +A+ GVME LL+NHPLDCP+CD+GGEC L Q+ A
Sbjct: 61 PQASCTTTVTDGMVVKTQLTSPVADKAQHGVMELLLINHPLDCPVCDKGGECPL--QNQA 118
Query: 848 FGNDLGTT------------------------------------------------GRGS 859
N + RG+
Sbjct: 119 MSNGRAESRFTDVKRTFPKPINISTQVLLDRERCVLCARCTRFSDQIAGDPFIDLQERGA 178
Query: 860 DMQVGTYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNII 919
QVG Y + F S SGN + +CPVGALT Y F ARP++ T SV A G
Sbjct: 179 LQQVGIYEGEPFQSYFSGNTVQICPVGALTGAAYRFRARPFDLVSTPSVCEHCASGCAQR 238
Query: 920 VSTRTGEVLRVLPRLNEDINEEWLADKGRFAYD-GLKRQRLLTPFVRNCDGQLVATEWED 978
R G+VLR L + ++NEEW DKGR+A+ + R+ TP VR+ DG+L W +
Sbjct: 239 TDHRRGKVLRRLAGDDPEVNEEWNCDKGRWAFTYATQPDRITTPLVRDEDGELEPASWSE 298
Query: 979 ALIAVAQKLQTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGT---DL 1035
AL A+ L + G VG L V ED Y F GT D
Sbjct: 299 ALAVAARGL--AAARGRVGVLVGGRLTV-----------EDAYAYAKFARVALGTNDIDF 345
Query: 1036 RAN-------YLLNNKIAGA---------EEADLILLIGTNPRFEAPLFNARIRKGYLTN 1079
RA L ++AG E+A +LL+G P E+P+ R+RK +
Sbjct: 346 RARPHSAEEADFLAARVAGRGLGVTYADLEKAPAVLLVGFEPEEESPIVFLRLRKAARKH 405
Query: 1080 ELDVAYIGP 1088
L V I P
Sbjct: 406 GLKVYSIAP 414
Score = 116 bits (292), Expect = 3e-26
Identities = 39/71 (54%), Positives = 50/71 (70%), Gaps = 2/71 (2%)
Query: 685 PVAACAMPVMKGWRVKT--NSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMA 742
P A+C V G VKT S + +A+ GVME LL+NHPLDCP+CD+GGEC LQ+Q+M+
Sbjct: 61 PQASCTTTVTDGMVVKTQLTSPVADKAQHGVMELLLINHPLDCPVCDKGGECPLQNQAMS 120
Query: 743 FGSDRSRFTDI 753
G SRFTD+
Sbjct: 121 NGRAESRFTDV 131
Score = 34.2 bits (79), Expect = 0.53
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 1231 ADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALS 1278
AD +LP A EK T++N EGR + A+ G A D +++ AL+
Sbjct: 596 ADVVLPVAPVAEKAGTFLNWEGRLRPFEAALRTTG-ALSDLRVLDALA 642
>gnl|CDD|239173 cd02772, MopB_NDH-1_NuoG2, MopB_NDH-1_NuoG2: The second domain of the
NuoG subunit of the NADH-quinone oxidoreductase/NADH
dehydrogenase-1 (NDH-1), found in beta- and
gammaproteobacteria. The NDH-1 is the first
energy-transducting complex in the respiratory chain and
functions as a redox pump that uses the redox energy to
translocate H+ ions across the membrane, resulting in a
significant contribution to energy production. In
Escherichia coli NDH-1, the largest subunit is encoded by
the nuoG gene, and is part of the 14 distinct subunits
constituting the functional enzyme. The NuoG subunit is
made of two domains: the first contains three binding
sites for FeS clusters (the fer2 domain), the second
domain (this CD), is of unknown function or, as
postulated, has lost an ancestral formate dehydrogenase
activity that became redundant during the evolution of
the complex I enzyme. Although only vestigial sequence
evidence remains of a molybdopterin binding site, this
protein domain belongs to the molybdopterin_binding
(MopB) superfamily of proteins.
Length = 414
Score = 184 bits (469), Expect = 1e-50
Identities = 118/423 (27%), Positives = 181/423 (42%), Gaps = 66/423 (15%)
Query: 907 SVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVR 965
SV DA+GSN++V + +V+RV+PR NE INE WL+D+ RF+Y+GL + RL P ++
Sbjct: 2 SVSPHDALGSNLVVHVKNNKVMRVVPRENEAINECWLSDRDRFSYEGLNSEDRLTKPMIK 61
Query: 966 NCDGQLVATEWEDALIAVAQKLQT---SEVAGVVGSLADA----EAMVALKDLLNKLGSE 1018
DGQ +WE AL VA+ L A +G+LA E + L+ L LGS+
Sbjct: 62 K-DGQWQEVDWETALEYVAEGLSAIIKKHGADQIGALASPHSTLEELYLLQKLARGLGSD 120
Query: 1019 DLYT---EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKG 1075
++ + F + L IA E D +L+IG+N R E PL R+R+
Sbjct: 121 NIDHRLRQSDFRDDAK---ASGAPWLGMPIAEISELDRVLVIGSNLRKEHPLLAQRLRQA 177
Query: 1076 YLTNELDVAYIGP-KVDLRYDYEH------------LGESADLIKQLASGSHPFSKK--- 1119
++ I P D + L + A + + + P
Sbjct: 178 V-KKGAKLSAINPADDDFLFPLSGKAIVAPSALANALAQVAKALAEEKGLAVPDEDAKVE 236
Query: 1120 -----------LSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCESDVPCDWKVLN 1165
L +A++ + +G A + AL Q++ AK+T + L
Sbjct: 237 ASEEARKIAASLVSAERAAVFLGNLAQNHPQAATLRALAQEI-AKLTGAT--------LG 287
Query: 1166 ILQKAASQVAALDIGYKPGTS----AIREKPPKVLFLLGA------DEGSISRDDVGKDC 1215
+L + A+ V A G P A+ E+P K LL + + + +
Sbjct: 288 VLGEGANSVGAYLAGALPHGGLNAAAMLEQPRKAYLLLNVEPELDCANPAQALAALNQAE 347
Query: 1216 FIIYQGHHGDHGA-SIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKII 1274
F++ AD +LP A +TE T+VN EGR Q V P G AR WK++
Sbjct: 348 FVVALSAFASAALLDYADVLLPIAPFTETSGTFVNLEGRVQSFKGVVKPLGEARPAWKVL 407
Query: 1275 RAL 1277
R L
Sbjct: 408 RVL 410
>gnl|CDD|236170 PRK08166, PRK08166, NADH dehydrogenase subunit G; Validated.
Length = 791
Score = 180 bits (459), Expect = 1e-46
Identities = 139/563 (24%), Positives = 210/563 (37%), Gaps = 97/563 (17%)
Query: 791 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQD------- 843
V +C P G + + + R V+E+L+ NHP DCP+C++GG C LQD
Sbjct: 64 VMSCMTPATDGTFISIDDPEAKAFRASVVEWLMTNHPHDCPVCEEGGNCHLQDMTVMTGH 123
Query: 844 --------------------------------------QSMAFGNDLGTTGRGSDMQVGT 865
+ A G DLG G ++ G
Sbjct: 124 SFRRYRFTKRTHRNQDLGPFISHEMNRCIACYRCVRYYKDYAGGTDLGVYGAHDNVYFGR 183
Query: 866 YVEKLFLSELSGNVIDLCPVGALTSKPYSF-TARPWETRKTESVDVLDAVGSNIIVSTRT 924
+ SE SGN++++CP G T K +S R W+ + S+ +VG NI R
Sbjct: 184 PEDGTLESEFSGNLVEVCPTGVFTDKTHSERYNRKWDMQFAPSICQHCSVGCNISPGERY 243
Query: 925 GEVLRVLPRLNEDINEEWLADKGRFAYDGLKR-QRLLTPFVRNCDGQLVATEWEDALIAV 983
GE+ R+ R N +N +L D+GRF Y + R P R D + + AL
Sbjct: 244 GELRRIENRYNGAVNGYFLCDRGRFGYGYVNLKDRPRQPLQRRGDD-FITLNADQALQGA 302
Query: 984 AQKL-QTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDLRANYLLN 1042
A L Q +V G+ A E+ AL++L +G+E+ YT A E L L
Sbjct: 303 ADILRQAKKVIGIGSPRASLESNFALREL---VGAENFYTGIAAG-EQERLQLALKVLRE 358
Query: 1043 NKI-----AGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYD-- 1095
I E D +L++G + A +R+ ++A KV
Sbjct: 359 GGIYTPSLREIESYDAVLVLGEDLTQTAARVALAVRQAVKGKAREMA-AAQKVADWQIAA 417
Query: 1096 YEHLGESADLIKQLASGSHP-FSKKLSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCE 1154
++G+ A P F L+ AKKPLI+ G A + A A
Sbjct: 418 VRNIGQRA---------KSPLFITNLAGAKKPLIISGTSAGSPAIIEAAANVAKALKGRG 468
Query: 1155 SDVPCDWKVLNILQKAASQVAALDIGYKPGT--SAIREKPPKVLFLLGAD---EGSISRD 1209
+DV + + +A S AL G + + +L D +R
Sbjct: 469 ADV----GITLVAPEANSMGLALLGGGSLEEALEELESGRADAVIVLENDLYRHAPAARV 524
Query: 1210 DV---GKDCFIIYQGHHGDHGASI----ADAILPGAAYTEKQSTYVNTEGRAQQTLTAVT 1262
D I+ DH + A +LP A++ E T VN EGRAQ+
Sbjct: 525 DAALAKAPLVIVL-----DHQRTATMEKAHLVLPAASFAESDGTLVNNEGRAQRFFQVYD 579
Query: 1263 PPG-----LAREDWKIIRALSEG 1280
P + E W+ + +L
Sbjct: 580 PAYYDSKTVMLESWRWLHSLHST 602
Score = 77.3 bits (191), Expect = 3e-14
Identities = 30/124 (24%), Positives = 48/124 (38%), Gaps = 41/124 (33%)
Query: 668 FIDDKKVMVDPGTTVLQ---------P----------VAAC---AM-------------- 691
+D K+ V+ +L+ P V AC A+
Sbjct: 5 HVDGKEYEVNGADNLLEACLSLGIDIPYFCWHPALGSVGACRQCAVKQYQNPEDTRGRLV 64
Query: 692 -----PVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSD 746
P G + + + R V+E+L+ NHP DCP+C++GG C LQD ++ G
Sbjct: 65 MSCMTPATDGTFISIDDPEAKAFRASVVEWLMTNHPHDCPVCEEGGNCHLQDMTVMTGHS 124
Query: 747 RSRF 750
R+
Sbjct: 125 FRRY 128
>gnl|CDD|239172 cd02771, MopB_NDH-1_NuoG2-N7, MopB_NDH-1_NuoG2-N7: The second domain
of the NuoG subunit (with a [4Fe-4S] cluster, N7) of the
NADH-quinone oxidoreductase/NADH dehydrogenase-1 (NDH-1)
found in various bacteria. The NDH-1 is the first
energy-transducting complex in the respiratory chain and
functions as a redox pump that uses the redox energy to
translocate H+ ions across the membrane, resulting in a
significant contribution to energy production. In
Escherichia coli NDH-1, the largest subunit is encoded by
the nuoG gene, and is part of the 14 distinct subunits
constituting the functional enzyme. The NuoG subunit is
made of two domains: the first contains three binding
sites for FeS clusters (the fer2 domain), the second
domain (this CD), is of unknown function or, as
postulated, has lost an ancestral formate dehydrogenase
activity that became redundant during the evolution of
the complex I enzyme. Unique to this group, compared to
the other prokaryotic and eukaryotic groups in this
domain protein family (NADH-Q-OR-NuoG2), is an N-terminal
[4Fe-4S] cluster (N7/N1c) present in the second domain.
Although only vestigial sequence evidence remains of a
molybdopterin binding site, this protein domain belongs
to the molybdopterin_binding (MopB) superfamily of
proteins.
Length = 472
Score = 131 bits (331), Expect = 5e-32
Identities = 109/431 (25%), Positives = 164/431 (38%), Gaps = 73/431 (16%)
Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFVRNCDGQL 971
+VG NI + R GE+ RV R N +N +L D+GRF Y + R RL P +R G L
Sbjct: 8 SVGCNISLGERYGELRRVENRYNGAVNHYFLCDRGRFGYGYVNSRDRLTQPLIRR-GGTL 66
Query: 972 VATEWEDALIAVAQKLQTSE--VAGVVGSLADAEAMVALKDLLNK-LGSEDLYTEYAFPL 1028
V W +AL A +L+ ++ V G+ A E+ AL+ L+ LG+ ++
Sbjct: 67 VPVSWNEALDVAAARLKEAKDKVGGIGSPRASNESNYALQKLVGAVLGTNNVDHR---AR 123
Query: 1029 EGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKG------------- 1075
LR + + E AD +L++G + AP +R+
Sbjct: 124 RLIAEILRNGPIYIPSLRDIESADAVLVLGEDLTQTAPRIALALRQAARRKAVELAALSG 183
Query: 1076 --------------------YLTNELDVAY--IGPK--VDLRYDYEHLGES-ADLIKQLA 1110
++ N L I + LG + A + A
Sbjct: 184 IPKWQDAAVRNIAQGAKSPLFIVNALATRLDDIAAESIRASPGGQARLGAALARAVDASA 243
Query: 1111 SGSHPFSKK---------LSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCESDVPCDW 1161
+G + K L+ AKKPLIV G + A AK
Sbjct: 244 AGVSGLAPKEKAARIAARLTGAKKPLIVSGTLSGSLELIKAAAN--LAKALKRRGENAGL 301
Query: 1162 KVLNILQKAASQVAAL--DIGYKPGTS------AIREKPPKVLFLLGAD-EGSISRDDV- 1211
+ +++ S L +PG A+ + L +LG D S V
Sbjct: 302 TLA--VEEGNSPGLLLLGGHVTEPGLDLDGALAALEDGSADALIVLGNDLYRSAPERRVE 359
Query: 1212 ---GKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAV-TPPGLA 1267
F++ H A AD +LP A++ EK T+VN EGRAQ+ A P G A
Sbjct: 360 AALDAAEFVVVLDHFLTETAERADVVLPAASFAEKSGTFVNYEGRAQRFFKAYDDPAGDA 419
Query: 1268 REDWKIIRALS 1278
R DW+ + AL+
Sbjct: 420 RSDWRWLHALA 430
>gnl|CDD|225918 COG3383, COG3383, Uncharacterized anaerobic dehydrogenase [General
function prediction only].
Length = 978
Score = 116 bits (293), Expect = 3e-26
Identities = 95/393 (24%), Positives = 144/393 (36%), Gaps = 118/393 (30%)
Query: 791 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQG-GECDLQDQSMAFG 849
V +C+ PV G V+TNSE + AR M+ +L NHPL C +CD G+C+L + MA G
Sbjct: 61 VRSCSTPVEDGMVVRTNSERVKEARREAMDRILSNHPLYCTVCDNNNGDCELHNMVMALG 120
Query: 850 ND-----------------------------------------------LGTTGRGSDMQ 862
L RG D +
Sbjct: 121 LTEQRYPYEEKNPPYPKDESNPFYIYDPNQCILCGRCVEACQEVQVNEALTIDWRGEDPR 180
Query: 863 VGTYVEKLFLSELS----GNVIDLCPVGALTSKPYSFTAR-------------------- 898
V + + ++E S G + +CPV AL K A
Sbjct: 181 V-IWDNDVPINESSCVSCGACVTVCPVNALMEKSMLGEAGYLTGINEDTLEPMIDLVKKV 239
Query: 899 -PW-----------------ETRKTESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINE 940
P +KT++V VG + V T+ E+L+V P N
Sbjct: 240 EPEYGPIFAISEIEAAMRETRIKKTKTVCTYCGVGCSFEVWTKGREILKVQPDPEGPANR 299
Query: 941 EWLADKGRFAYDGLK-RQRLLTPFVRNCDGQLVATEWEDALIAVAQKL-QTSEVAG--VV 996
KG+F +D + R R+ P +R + WE+AL VA +L + E G +
Sbjct: 300 ISTCVKGKFGWDFVNSRDRITKPLIRE-GDRFREVSWEEALDLVASRLREIKEKYGGDSI 358
Query: 997 GSLADAEAMVALKDLLNKL-------GSEDLYTEYA--------FPLEGAGTDLRANYLL 1041
G +A ++ L+ KL + D + Y F G+G D
Sbjct: 359 GFIASSKCTNEENYLMQKLARQVFGTNNVDNCSRYCQSPATDGLFRTVGSGAD------- 411
Query: 1042 NNKIAGAEEADLILLIGTNPRFEAPLFNARIRK 1074
+ I E ADL+L+IG NP P+ +R+++
Sbjct: 412 SGSIEDVEGADLVLIIGANPTEGHPVLASRLKR 444
Score = 73.6 bits (181), Expect = 5e-13
Identities = 37/121 (30%), Positives = 53/121 (43%), Gaps = 35/121 (28%)
Query: 667 VFIDDKKVMVDPGTTVLQ---------P-------------------------VAACAMP 692
V ID + + V+ GTT+L+ P V +C+ P
Sbjct: 8 VTIDGRSIEVEEGTTILRAANRNGIEIPHICYHESLGPIGSCDTCLVEIDGKLVRSCSTP 67
Query: 693 VMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQG-GECDLQDQSMAFGSDRSRFT 751
V G V+TNSE + AR M+ +L NHPL C +CD G+C+L + MA G R+
Sbjct: 68 VEDGMVVRTNSERVKEARREAMDRILSNHPLYCTVCDNNNGDCELHNMVMALGLTEQRYP 127
Query: 752 D 752
Sbjct: 128 Y 128
Score = 58.6 bits (142), Expect = 2e-08
Identities = 21/52 (40%), Positives = 32/52 (61%)
Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
A+ AD +LP +A EK T+ NTE R Q+ + P G +R DW+II+ ++
Sbjct: 690 ANYADVVLPASASLEKDGTFTNTERRIQRLYKVLEPLGDSRPDWEIIQEVAN 741
>gnl|CDD|204520 pfam10588, NADH-G_4Fe-4S_3, NADH-ubiquinone oxidoreductase-G
iron-sulfur binding region.
Length = 41
Score = 78.7 bits (195), Expect = 3e-18
Identities = 23/41 (56%), Positives = 29/41 (70%)
Query: 710 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
R+ ++E LL NHPLDCP CD+ G C+LQD + G D SRF
Sbjct: 1 RKTILELLLANHPLDCPTCDKNGNCELQDLAYELGVDESRF 41
Score = 72.1 bits (178), Expect = 6e-16
Identities = 20/37 (54%), Positives = 26/37 (70%)
Query: 815 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGND 851
R+ ++E LL NHPLDCP CD+ G C+LQD + G D
Sbjct: 1 RKTILELLLANHPLDCPTCDKNGNCELQDLAYELGVD 37
>gnl|CDD|214918 smart00929, NADH-G_4Fe-4S_3, NADH-ubiquinone oxidoreductase-G
iron-sulfur binding region.
Length = 41
Score = 74.5 bits (184), Expect = 1e-16
Identities = 22/41 (53%), Positives = 30/41 (73%)
Query: 710 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
R+ ++E LL NHPLDCP+CD+ GEC+LQD + G D R+
Sbjct: 1 RKTILELLLANHPLDCPVCDKNGECELQDLAYELGVDEQRY 41
Score = 68.4 bits (168), Expect = 1e-14
Identities = 20/35 (57%), Positives = 27/35 (77%)
Query: 815 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFG 849
R+ ++E LL NHPLDCP+CD+ GEC+LQD + G
Sbjct: 1 RKTILELLLANHPLDCPVCDKNGECELQDLAYELG 35
>gnl|CDD|140326 PTZ00305, PTZ00305, NADH:ubiquinone oxidoreductase; Provisional.
Length = 297
Score = 77.8 bits (191), Expect = 3e-15
Identities = 37/79 (46%), Positives = 51/79 (64%), Gaps = 3/79 (3%)
Query: 674 VMVDPGTTVLQPVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGE 733
V VD GT L V +CA + G + T+S + R AREG +E +L+NHP DCPIC+Q
Sbjct: 118 VQVD-GTQNL--VVSCATVALPGMSIITDSRLVRDAREGNVELILINHPNDCPICEQATN 174
Query: 734 CDLQDQSMAFGSDRSRFTD 752
CDLQ+ SM +G+D R+ +
Sbjct: 175 CDLQNVSMNYGTDIPRYKE 193
Score = 75.5 bits (185), Expect = 2e-14
Identities = 30/62 (48%), Positives = 42/62 (67%)
Query: 791 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGN 850
V +CA + G + T+S + R AREG +E +L+NHP DCPIC+Q CDLQ+ SM +G
Sbjct: 127 VVSCATVALPGMSIITDSRLVRDAREGNVELILINHPNDCPICEQATNCDLQNVSMNYGT 186
Query: 851 DL 852
D+
Sbjct: 187 DI 188
>gnl|CDD|239154 cd02753, MopB_Formate-Dh-H, Formate dehydrogenase H (Formate-Dh-H)
catalyzes the reversible oxidation of formate to CO2 with
the release of a proton and two electrons. It is a
component of the anaerobic formate hydrogen lyase
complex. The E. coli formate dehydrogenase H (Fdh-H) is a
monomer composed of a single polypeptide chain with a Mo
active site region and a [4Fe-4S] center. Members of the
MopB_Formate-Dh-H CD belong to the molybdopterin_binding
(MopB) superfamily of proteins.
Length = 512
Score = 64.5 bits (158), Expect = 2e-10
Identities = 45/198 (22%), Positives = 73/198 (36%), Gaps = 61/198 (30%)
Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFVRNCDGQL 971
VG + + + +++ V P +N L KGRF +D + + RL P +R +G+
Sbjct: 8 GVGCGLELWVKDNKIVGVEPVKGHPVNRGKLCVKGRFGFDFVNSKDRLTKPLIRK-NGKF 66
Query: 972 VATEWEDALIAVAQKLQT-------SEVAGVVGS-------------------------- 998
V W++AL VA +L+ +A +
Sbjct: 67 VEASWDEALSLVASRLKEIKDKYGPDAIAFFGSAKCTNEENYLFQKLARAVGGTNNVDHC 126
Query: 999 --LADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDLRANYLLNNKIAGAEEADLILL 1056
L + + L + L GA T N IA EEAD+IL+
Sbjct: 127 ARLCHSPTVAGLAETLG---------------SGAMT---------NSIADIEEADVILV 162
Query: 1057 IGTNPRFEAPLFNARIRK 1074
IG+N P+ RI++
Sbjct: 163 IGSNTTEAHPVIARRIKR 180
Score = 63.0 bits (154), Expect = 7e-10
Identities = 26/52 (50%), Positives = 35/52 (67%)
Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
A +AD +LP A++ EK T+ NTE R Q+ AV PPG AR DW+II+ L+
Sbjct: 386 AELADVVLPAASFAEKDGTFTNTERRVQRVRKAVEPPGEARPDWEIIQELAN 437
>gnl|CDD|204199 pfam09326, DUF1982, Domain of unknown function (DUF1982). Members of
this family of functionally uncharacterized domains are
found in the C-terminal region of various prokaryotic
NADH dehydrogenases.
Length = 48
Score = 55.0 bits (133), Expect = 9e-10
Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 1270 DWKIIRALSEGVSSNLSSEPVDVKQKNLEDYYMTDPISRASGTMAKC 1316
W + + G L P + ++D+Y+TDPISRAS TMA+C
Sbjct: 7 GWTALALSAAGGK--LGKAPF---RSPIKDFYLTDPISRASPTMAEC 48
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 59.3 bits (144), Expect = 1e-08
Identities = 46/268 (17%), Positives = 119/268 (44%), Gaps = 16/268 (5%)
Query: 2 KAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQL 61
K ++E+++L + S++ L E + QL ++ +E++ LA +L+ +
Sbjct: 685 KLEEELKSLKNELRSLEDLLEE-LRRQLEELERQLEELKRELAALEEELEQLQSRLEELE 743
Query: 62 NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQ 121
++ ++E++ L + ++ EL + L KE+++E + + L E +
Sbjct: 744 EELEELEEELEELQERLEELEEELES-LEEALAKL-KEEIEELEEKRQ--ALQEELEELE 799
Query: 122 KEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLD-FKAKCEVY 180
+E E L++ E+ +++E ++++++ E+ + LD + + E
Sbjct: 800 EELEEAERRLDALERELESLEQRRE-----RLEQEIEELEEEIEELEEKLDELEEELEEL 854
Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 240
E E ++LK EL + + + +E LKE +++E + +++ +L+ + + ++
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELR-----ELESELAELKEEIEK 909
Query: 241 LHTSKESLATRQPSEQSLIDSTTDTNCP 268
L E L + + + +
Sbjct: 910 LRERLEELEAKLERLEVELPELEEELEE 937
Score = 58.2 bits (141), Expect = 3e-08
Identities = 51/277 (18%), Positives = 125/277 (45%), Gaps = 17/277 (6%)
Query: 375 LEAEVKDSAGPMTSFLESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQR 434
L+ ++ + +G +T + S + + L L+ EK+E ++L+ E +
Sbjct: 641 LDGDLVEPSGSITG-GSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRS 699
Query: 435 SRKLEEELQDTRQTCE---EVVRKEVSIVT--VTLFLERLRSSHSKRSKNADSLVRLQKA 489
L EEL+ + E E +++E++ + + RL + + + L LQ+
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER 759
Query: 490 QDEIQTLLAKSASIKTELREFYQTQLNN--NVKEKLKEFQTQLDSAELTLHREFQQKEKT 547
+E++ L +L+E + ++E+L+E + +L+ AE L ++ E
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESL 819
Query: 548 I--VENAAKQLKQISEKPGATKLMLD-FKAKCEVYEAENKKLKAELTQRDIKLKETNHLL 604
E ++++++ E+ + LD + + E E E ++LK EL + + + +E L
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879
Query: 605 KEESKKRELMAKKVESV------MQSQLQQAIHMITS 635
KE +++E + +++ + ++ ++++ +
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Score = 48.9 bits (117), Expect = 2e-05
Identities = 44/198 (22%), Positives = 91/198 (45%), Gaps = 26/198 (13%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
KE+IE LE Q + +LEEEL++ + + LER S +R
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDA--------------LERELESLEQRR 823
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
+ + +++ ++EI+ L K ++ EL E + +KE+L+E + + + E L
Sbjct: 824 ERLEQ--EIEELEEEIEELEEKLDELEEELEEL--EKELEELKEELEELEAEKEELEDEL 879
Query: 538 --HREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 595
E +++ + + +L ++ E+ + + + E EA+ ++L+ EL + +
Sbjct: 880 KELEEEKEELEEELRELESELAELKEE------IEKLRERLEELEAKLERLEVELPELEE 933
Query: 596 KLKETNHLLKEESKKREL 613
+L+E E +RE+
Sbjct: 934 ELEEEYEDTLETELEREI 951
Score = 47.0 bits (112), Expect = 8e-05
Identities = 43/226 (19%), Positives = 104/226 (46%), Gaps = 26/226 (11%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
+++IE L+ L+ + +L+EEL + ++ EE+ + + ERL ++
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR------ERLEELENELE 319
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
+ + L L++ + ++ L + ++ EL QL ++E +E + +L + L
Sbjct: 320 ELEERLEELKEKIEALKEELEERETLLEEL-----EQLLAELEEAKEELEEKLSALLEEL 374
Query: 538 HREFQQKEKTI-------------VENAAKQLKQISEKPGATKLML-DFKAKCEVYEAEN 583
F+ + + +E ++++ + E+ L D K + + EAE
Sbjct: 375 EELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAEL 434
Query: 584 KKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 629
++L+ EL + + +L+E L+E + + + +++ + Q +LQ+
Sbjct: 435 EELQTELEELNEELEELEEQLEELRDRLKELERELAEL-QEELQRL 479
Score = 45.5 bits (108), Expect = 2e-04
Identities = 50/271 (18%), Positives = 121/271 (44%), Gaps = 21/271 (7%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
++ + E++ L + + ++ EL E Q +L ++A+ EI+ L ++ ++ EL E +
Sbjct: 235 KELRKELEELEEELSRLEEELEE-LQEEL---EEAEKEIEELKSELEELREELEELQEEL 290
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN------NVKEKLKEFQTQLDSAELT 114
L + ++EI+ L + + ++ E + +L +KEK++ + +L T
Sbjct: 291 L----ELKEEIEELEGEISLLRER-LEELENELEELEERLEELKEKIEALKEEL-EERET 344
Query: 115 LHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI-SEKPGATKLMLDF 173
L E +Q AK+ + + ++ + + E ++L ++ +E + +
Sbjct: 345 LLEELEQLLA--ELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEEL 402
Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQ--SQLQQAI 231
K + E E ++L L +LKE L+E + E + +++E + + +L+ +
Sbjct: 403 KREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462
Query: 232 HMITSDVTHLHTSKESLATRQPSEQSLIDST 262
+ ++ L + L S ++ +D
Sbjct: 463 KELERELAELQEELQRLEKELSSLEARLDRL 493
Score = 44.7 bits (106), Expect = 3e-04
Identities = 44/215 (20%), Positives = 97/215 (45%), Gaps = 27/215 (12%)
Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
L KE+IE LE + +R +LE EL++ + EE+ K + E L
Sbjct: 290 LLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEA------LKEELEERE 343
Query: 474 SKRSKNADSLVRLQKAQDEIQTLL--------AKSASIKTELREFYQTQLNNNVKEKLKE 525
+ + L L++A++E++ L +++ EL E ++ +L+E
Sbjct: 344 TLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAEL--EAELAEIRNELEE 401
Query: 526 FQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKK 585
+ +++S E L ++ +E+ ++LK++ + + + + + E E ++
Sbjct: 402 LKREIESLEERL-----ERLSERLEDLKEELKELEAE------LEELQTELEELNEELEE 450
Query: 586 LKAELTQRDIKLKETNHLLKEESKKRELMAKKVES 620
L+ +L + +LKE L E ++ + + K++ S
Sbjct: 451 LEEQLEELRDRLKELERELAELQEELQRLEKELSS 485
Score = 42.0 bits (99), Expect = 0.003
Identities = 48/225 (21%), Positives = 92/225 (40%), Gaps = 15/225 (6%)
Query: 413 TLKISKEKIESLERNLQTETQR----SRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLER 468
LK E++ LQ E +LE E+ R+ EE+ + + ER
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEEL------EER 324
Query: 469 LRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNN--NVKEKLKEF 526
L K + L + +E++ LLA+ K EL E L + E L+E
Sbjct: 325 LEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384
Query: 527 QTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKL 586
+L++ + E ++ ++ I + + + + + +A+ E + E ++L
Sbjct: 385 LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444
Query: 587 KAELTQRDIKLKETNHLLKE-ESKKRELMAK--KVESVMQSQLQQ 628
EL + + +L+E LKE E + EL + ++E + S +
Sbjct: 445 NEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEAR 489
Score = 37.0 bits (86), Expect = 0.080
Identities = 37/201 (18%), Positives = 82/201 (40%), Gaps = 25/201 (12%)
Query: 434 RSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEI 493
R + E +L+ T + E + LE L K + A+ R Q+ + E+
Sbjct: 173 RKEEAERKLERTEENLER----------LEDLLEELEKQLEKLERQAEKAERYQELKAEL 222
Query: 494 Q----TLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR---EFQQKEK 546
+ LL + E + +L + ++E+L+E Q +L+ AE + E ++ +
Sbjct: 223 RELELALLLAKLKELRKELEELEEEL-SRLEEELEELQEELEEAEKEIEELKSELEELRE 281
Query: 547 TIVENAA------KQLKQISEKPGATKLMLDFKAK-CEVYEAENKKLKAELTQRDIKLKE 599
+ E ++++++ + + L+ E E ++LK ++ +L+E
Sbjct: 282 ELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEE 341
Query: 600 TNHLLKEESKKRELMAKKVES 620
LL+E + + + E
Sbjct: 342 RETLLEELEQLLAELEEAKEE 362
Score = 31.6 bits (72), Expect = 3.6
Identities = 34/178 (19%), Positives = 73/178 (41%), Gaps = 22/178 (12%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQ----KAQDEIQTL---LAKSASIKTEL 53
++ ++EI+ L K ++ EL E + +L + + + E + L L + K EL
Sbjct: 831 EELEEEIEELEEKLDELEEELEE-LEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889
Query: 54 REFYQTQLNN-------NQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQT 106
E + + +K ++ ++ L AK ++ EL E + E +
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER 949
Query: 107 QLDSAELTLHREFQQKEKTIVENAAKQLKQISE--NEFKQKEKTIVENAAKQLKQISE 162
+++ E E + + A ++ +++ E E K + + + E K L+ I E
Sbjct: 950 EIERLE-----EEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 58.9 bits (143), Expect = 2e-08
Identities = 21/65 (32%), Positives = 35/65 (53%)
Query: 686 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGS 745
V +C +G + TN+ R +M+ VNHPL+C +CD+ GEC+LQ+ + G
Sbjct: 57 VYSCNTKAKEGMNILTNTPNLMDERNAIMQTYDVNHPLECGVCDKSGECELQNFTHEMGV 116
Query: 746 DRSRF 750
+ +
Sbjct: 117 NHQPY 121
Score = 58.6 bits (142), Expect = 2e-08
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 791 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQD 843
V +C +G + TN+ R +M+ VNHPL+C +CD+ GEC+LQ+
Sbjct: 57 VYSCNTKAKEGMNILTNTPNLMDERNAIMQTYDVNHPLECGVCDKSGECELQN 109
Score = 40.5 bits (95), Expect = 0.007
Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 3/79 (3%)
Query: 877 GNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRTGEVLRVLP---R 933
G I +CPVGAL+S + +T+ WE +K + + I + +L R
Sbjct: 210 GECIAVCPVGALSSSDFQYTSNAWELKKIPATCPHCSDCCLIYYDVKHSSILNQESKIYR 269
Query: 934 LNEDINEEWLADKGRFAYD 952
++ D L GRFA+D
Sbjct: 270 VSNDFYFNPLCGAGRFAFD 288
>gnl|CDD|239155 cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA
(Nitrate-R-NapA), NasA, and NarB catalyze the reduction
of nitrate to nitrite. Monomeric Nas is located in the
cytoplasm and participates in nitrogen assimilation.
Dimeric Nap is located in the periplasm and is coupled to
quinol oxidation via a membrane-anchored tetraheme
cytochrome. Members of the MopB_Nitrate-R-NapA CD belong
to the molybdopterin_binding (MopB) superfamily of
proteins.
Length = 565
Score = 58.0 bits (141), Expect = 3e-08
Identities = 22/52 (42%), Positives = 30/52 (57%)
Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
A AD +LP A++ EK+ T N+E R AV PPG AR DW I+ ++
Sbjct: 432 AEYADLVLPAASWGEKEGTMTNSERRVSLLRAAVEPPGEARPDWWILADVAR 483
Score = 46.8 bits (112), Expect = 6e-05
Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 1/79 (1%)
Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQL 971
VG + + + G+V+ V +N L KG + L RL P +R G+L
Sbjct: 8 GVGCGVEIGVKDGKVVAVRGDPEHPVNRGRLCIKGLNLHKTLNGPERLTRPLLRRNGGEL 67
Query: 972 VATEWEDALIAVAQKLQTS 990
V W++AL +A++ +
Sbjct: 68 VPVSWDEALDLIAERFKAI 86
>gnl|CDD|181037 PRK07569, PRK07569, bidirectional hydrogenase complex protein HoxU;
Validated.
Length = 234
Score = 55.4 bits (134), Expect = 3e-08
Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 4/69 (5%)
Query: 685 PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLV--NHPLDCPICDQGGECDLQDQSMA 742
+ AC PV +G V+TN+ + R ++E L NH C +C G C+LQD ++
Sbjct: 60 LLPACVTPVAEGMVVQTNTPRLQEYRRMIVELLFAEGNHV--CAVCVANGNCELQDLAIE 117
Query: 743 FGSDRSRFT 751
G D RF
Sbjct: 118 VGMDHVRFP 126
Score = 49.3 bits (118), Expect = 4e-06
Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 4/64 (6%)
Query: 790 PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLV--NHPLDCPICDQGGECDLQDQSMA 847
+ AC PV +G V+TN+ + R ++E L NH C +C G C+LQD ++
Sbjct: 60 LLPACVTPVAEGMVVQTNTPRLQEYRRMIVELLFAEGNHV--CAVCVANGNCELQDLAIE 117
Query: 848 FGND 851
G D
Sbjct: 118 VGMD 121
>gnl|CDD|130652 TIGR01591, Fdh-alpha, formate dehydrogenase, alpha subunit,
archaeal-type. This model describes a subset of formate
dehydrogenase alpha chains found mainly archaea but also
in alpha and gamma proteobacteria and a small number of
gram positive bacteria. The alpha chain contains domains
for molybdopterin dinucleotide binding and molybdopterin
oxidoreductase (pfam01568 and pfam00384, respectively).
The holo-enzyme also contains beta and gamma subunits.
The enzyme catalyzes the oxidation of formate (produced
from pyruvate during anaerobic growth) to carbon dioxide
with the concomitant release of two electrons and two
protons. The enzyme's purpose is to allow growth on
formate in some circumstances and, in the case of FdhH in
gamma proteobacteria, to pass electrons to hydrogenase
(by which process acid is neutralized). This model is
well-defined, with only a single fragmentary sequence
falling between trusted and noise. The alpha subunit of a
version of nitrate reductase is closely related.
Length = 671
Score = 55.2 bits (133), Expect = 2e-07
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 16/116 (13%)
Query: 1178 DIGYKPGTS------AIREKPPKVLFLLGADEGSISRDDVGK--------DCFIIYQGHH 1223
+ +PG A + K L+++G D S + K + ++ Q
Sbjct: 363 KLPAEPGLRIPEMIDAAADGDVKALYIMGEDP-LQSDPNTSKVRKALEKLELLVV-QDIF 420
Query: 1224 GDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
A AD +LP AA+ EK+ T+ N E R Q+ AV P G ++ DW+II+ L+
Sbjct: 421 MTETAKYADVVLPAAAWLEKEGTFTNAERRIQRFFKAVEPKGESKPDWEIIQELAN 476
Score = 48.6 bits (116), Expect = 2e-05
Identities = 39/174 (22%), Positives = 76/174 (43%), Gaps = 13/174 (7%)
Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQL 971
VG ++ + + G+++RV P N L KG FA++ + + RL TP +R +
Sbjct: 7 GVGCSLNLVVKDGKIVRVEPYQGHKANRGHLCVKGYFAWEFINSKDRLTTPLIRE-GDKF 65
Query: 972 VATEWEDALIAVAQKLQT-------SEVAGVVGS-LADAEAMVALKDLLNKLGSEDLYTE 1023
W++A+ +A+KL+ + + S + E + K +G+ ++
Sbjct: 66 REVSWDEAISYIAEKLKEIKEKYGPDSIGFIGSSRGTNEENYLLQKLARAVIGTNNVDNC 125
Query: 1024 YAFPLEGAGTDLRANY---LLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRK 1074
+ L+ ++N I+ E ADLI++IG NP P+ ++
Sbjct: 126 ARVCHGPSVAGLKQTVGIGAMSNTISEIENADLIVIIGYNPAESHPVVAQYLKN 179
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 48.5 bits (116), Expect = 1e-05
Identities = 37/166 (22%), Positives = 73/166 (43%), Gaps = 16/166 (9%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
QK +D++Q+ +K+ +++L + +N+ ++E + + + EL E +Q+
Sbjct: 56 QKEKDKLQSEHSKAILARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKELTEKFQST 115
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
L + Q +E + + K + N ++EKLKE Q + E + +
Sbjct: 116 LKDIQDQMEE-----HSNPNHK-------LCEENEELREKLKELIEQYELREQHFEKLLK 163
Query: 121 QKEKTIVENAAK----QLKQISENEFKQKEKTIVENAAKQLKQISE 162
K+ N AK Q + E E ++EK I+ A Q+ + E
Sbjct: 164 TKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKE 209
Score = 42.7 bits (101), Expect = 8e-04
Identities = 43/210 (20%), Positives = 74/210 (35%), Gaps = 63/210 (30%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
+ K+ESL R LQ ++ L+EE + R EE RKE++
Sbjct: 73 RSKLESLCRELQKH---NKTLKEENK-ARSDEEEEKRKELTE------------------ 110
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
+ Q +IQ + + ++ +L E N ++EKLKE Q + E
Sbjct: 111 -------KFQSTLKDIQDQMEEHSNPNHKLCE-----ENEELREKLKELIEQYELREQHF 158
Query: 538 HREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 597
+ + K+ N AK E ++ + E +
Sbjct: 159 EKLLKTKDLEQQLNDAK------------------------LEQAQEQAEQEQEKHK--- 191
Query: 598 KETNHLLKEESKKRELMAKKVESVMQSQLQ 627
+E LLK + K+ E +++QL
Sbjct: 192 REKEILLK--EAAQVSTLKETEKELRAQLN 219
>gnl|CDD|223321 COG0243, BisC, Anaerobic dehydrogenases, typically
selenocysteine-containing [Energy production and
conversion].
Length = 765
Score = 47.5 bits (113), Expect = 5e-05
Identities = 31/119 (26%), Positives = 45/119 (37%), Gaps = 18/119 (15%)
Query: 1175 AALDIGYKPGTSAIREKPP--KVLFLLG-----ADEGSISRDDVG--KDCFIIYQGHH-G 1224
AA G A+ E P K LF+ G + + D F++
Sbjct: 413 AAAKPGLSTADRALLEGPYPIKALFVYGGNPVVSAPDDRNVKKALLRDDEFVVVIDIFMT 472
Query: 1225 DHGASIADAILPGAAYTEKQSTYVNTEGRA----QQTLTAVTPPGLAREDWKIIRALSE 1279
+ A AD +LP + EK Y N ++ V PPG AR D+ II L++
Sbjct: 473 ET-AKYADIVLPATTFLEKDDIYTNAGRSYVQLRRK---VVEPPGEARPDYWIIIELAK 527
Score = 35.2 bits (81), Expect = 0.26
Identities = 36/151 (23%), Positives = 49/151 (32%), Gaps = 59/151 (39%)
Query: 946 KGRFAYDGL-KRQRLLTPFVRN----CDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLA 1000
KG + + RL P +R +G+ V W++AL +A KL
Sbjct: 84 KGARGRERVYSPDRLKYPLLRRVGKRGEGKFVRISWDEALDLIAAKL------------- 130
Query: 1001 DAEAMVALKDLLNKLGSEDLYTEYAFPLEG---AGTDLRANYLLNN-------------- 1043
L ++ GS L T E AG RA NN
Sbjct: 131 -------LPRIIGFYGSGALTTGN----EAGYLAGKLARAFLGGNNIDHCGRYCHAAAAA 179
Query: 1044 ---KIAGA----------EEADLILLIGTNP 1061
G+ E ADLI+L G+NP
Sbjct: 180 GLPYTFGSGAATGSYPDIENADLIVLWGSNP 210
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 47.1 bits (112), Expect = 7e-05
Identities = 78/571 (13%), Positives = 214/571 (37%), Gaps = 62/571 (10%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
+ + E + L + +++ L E L ++ +E++ L + S++ E + + +
Sbjct: 221 IQEEQEEEELEQEIEALEERLAE-----LEEEKERLEELKARLLEIESLELEALKIREEE 275
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE--LTLHRE 118
L ++ +E++ + + ++ E+ E +E+L+ + L+ E L +
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEEL---------EEELEGLRALLEELEELLEKLKS 326
Query: 119 FQQKEKTIVE--NAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAK 176
+++ + + E + + E + K + E + +++ E + L+ +
Sbjct: 327 LEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQ 386
Query: 177 CEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITS 236
E ++LK EL + L+E L+E K+ E +L++ + +
Sbjct: 387 ---LEEAIQELKEELAELSAALEEIQEELEELEKELE------------ELERELEELEE 431
Query: 237 DVTHLHTSKESLATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSNQQYSIPTSNI 296
++ L L +++ L + CPV Q +P +++ Y + +
Sbjct: 432 EIKKLEEQINQLESKELMIAELAGA--GEKCPVCGQELPEEHEKELLE---LYELELEEL 486
Query: 297 NQQYSMFSADSTHMNIVERDIPNSADDMRKYIEMQYSMFSADSTHMNIVERDIPNSADDM 356
++ S + +I ++R+ E + + +E + +++
Sbjct: 487 EEELSREK----EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKL-EKLENL 541
Query: 357 RKYIEMLLEKPPGNPVSDLEAEVKDSAGPMTSFLESSVTSDIFNEPALYTKTVRSVTLKI 416
+ +E L EK + L+ E++ + E + + +R ++
Sbjct: 542 LEELEELKEKL---QLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKEL 598
Query: 417 SK--EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
K +++E L+ Q S +L E + + EE + LE+L
Sbjct: 599 KKKLKELEERLSQLEELLQ-SLELSEAENELEEAEEE----------LESELEKLNLQAE 647
Query: 475 KRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLD-- 531
+L L++ +E++ + + + + ++E+L++ + +L+
Sbjct: 648 LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEEL 707
Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEK 562
+L + ++ ++ + K++ +
Sbjct: 708 LKKLGEIEQLIEELESRKAELEELKKELEKL 738
Score = 46.3 bits (110), Expect = 1e-04
Identities = 43/217 (19%), Positives = 96/217 (44%), Gaps = 15/217 (6%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVT-----VTLFLERLRSS 472
E++E L L++ +R KLEE+L+ EE+ ++ + + ERL
Sbjct: 314 LEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEEL 373
Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNN--NVKEKLKEFQTQL 530
+ K + L +L++A E++ LA+ ++ E++E + ++ +L+E + ++
Sbjct: 374 EKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEI 433
Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAEL 590
E +++ E +L EK L + + E+ E +L+ EL
Sbjct: 434 KKLEEQINQLES------KELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELE-EL 486
Query: 591 TQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 627
+ + ++ L+EE ++ E +++E + L+
Sbjct: 487 EEE-LSREKEEAELREEIEELEKELRELEEELIELLE 522
Score = 44.4 bits (105), Expect = 5e-04
Identities = 49/255 (19%), Positives = 97/255 (38%), Gaps = 41/255 (16%)
Query: 401 EPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIV 460
E AL ++ KE++ L L+ + +LE+EL++ + EE+ +
Sbjct: 378 EKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEI---- 433
Query: 461 TVTLFLERLRSSHSKRSKNA----------------------DSLVRLQKAQDEIQTLLA 498
LE + + L+ L + + E
Sbjct: 434 ---KKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEEL 490
Query: 499 KSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTI--VENAAKQL 556
+ ELRE + ++++L+E + +L ++ E+ + +EN ++L
Sbjct: 491 SREKEEAELREEIE-----ELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEEL 545
Query: 557 KQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ---RDIKLKETNHLLKEESKKREL 613
+++ EK +L + + E E K+L EL R +L+E LKE KK +
Sbjct: 546 EELKEKLQLQQL-KEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLK- 603
Query: 614 MAKKVESVMQSQLQQ 628
++ S ++ LQ
Sbjct: 604 ELEERLSQLEELLQS 618
Score = 39.4 bits (92), Expect = 0.014
Identities = 38/212 (17%), Positives = 100/212 (47%), Gaps = 7/212 (3%)
Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTR--QTCEEVVRKEVSIVTVTL 464
+ L++ + E LE L+ +LEE + + Q EE+ + E + +
Sbjct: 514 EEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKE 573
Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
LE LR +++ + + RL++ + +++ L + + ++ L+ ++ N ++E +
Sbjct: 574 LLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633
Query: 525 EFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENK 584
E +++L+ L ++ + +E ++++++ + ++ + + E E +
Sbjct: 634 ELESELEKLNLQAE--LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELE 691
Query: 585 KLKAELTQRDIKLKETNHLLKEESKKRELMAK 616
+L+ EL Q +L+E LLK+ + +L+ +
Sbjct: 692 QLEEELEQLREELEE---LLKKLGEIEQLIEE 720
Score = 39.0 bits (91), Expect = 0.018
Identities = 43/211 (20%), Positives = 90/211 (42%), Gaps = 11/211 (5%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
K KIE LE L + L E L++ E K++ + E L
Sbjct: 184 KAKIEELEGQLSELLEDIEDLLEALEE-----ELKELKKLEEIQEEQEEEELEQEIEALE 238
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
+ L ++ +E++ L + S++ E + + +L ++ L+E + +++ E L
Sbjct: 239 ERLAELEEEKERLEELKARLLEIESLELEALKIREEELRE-LERLLEELEEKIERLE-EL 296
Query: 538 HREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 597
RE ++ E E + E + + + + E E + +KL++EL + +
Sbjct: 297 EREIEELE----EELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352
Query: 598 KETNHLLKEESKKRELMAKKVESVMQSQLQQ 628
E LL+E K+ E +++E ++ L++
Sbjct: 353 NELAKLLEERLKELEERLEELEKELEKALER 383
Score = 30.9 bits (70), Expect = 6.1
Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 23/173 (13%)
Query: 84 ELREFYQTQLNNNVKEKL----KEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISE 139
E F L + KE+ + F + L +E ++ K +E QL ++ E
Sbjct: 144 EFDAF----LKSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLE 199
Query: 140 NEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLK 199
+ + +E K+LK++ E + + + E E E + L+ +R +L+
Sbjct: 200 D--IEDLLEALEEELKELKKLEE--------IQEEQEEEELEQEIEALE----ERLAELE 245
Query: 200 ETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQ 252
E L EE K R L + +E ++ + + + L E L +
Sbjct: 246 EEKERL-EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELE 297
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 47.0 bits (111), Expect = 7e-05
Identities = 58/240 (24%), Positives = 108/240 (45%), Gaps = 18/240 (7%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
QKA +E + K+ TE +E +LN + A + T + EL Q +
Sbjct: 312 QKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQR 371
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
L N+ D+++ + + +EL E T+ NN + +L+E + L E L E +
Sbjct: 372 LEKNE---DQLKIITMELQKKSSELEEM--TKFKNNKEVELEELKKILAEDE-KLLDEKK 425
Query: 121 QKEKTIVENAAKQLKQISENEFKQKE-----------KTIVENAAKQLKQISEKPGATKL 169
Q EK E K+ + I + ++KE KT E+ K+++ + + KL
Sbjct: 426 QFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKL 485
Query: 170 M-LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 228
++ A C+ ENK+L E + ++LK+ + K+ E M K++E++ + ++
Sbjct: 486 KNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMN 545
Score = 40.1 bits (93), Expect = 0.009
Identities = 54/244 (22%), Positives = 104/244 (42%), Gaps = 29/244 (11%)
Query: 412 VTLKISKEKIESLERNLQTETQRSRKLEEELQ------DTRQTCEEVVRKEVSIVTVTLF 465
++L+ S ++LE +LQ T+ +L EE + + + V E T +L
Sbjct: 303 MSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSL- 361
Query: 466 LERLRSSHSKRSKNADSL----VRLQKAQDEIQTLLAKSASIKTELREF----------- 510
E LR+ + KN D L + LQK E++ + + + EL E
Sbjct: 362 EELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLL 421
Query: 511 YQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE------KTIVENAAKQLKQISEKPG 564
+ + + E+LK + +L +E E KT E+ K+++ + +
Sbjct: 422 DEKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELE 481
Query: 565 ATKLM-LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQ 623
KL ++ A C+ ENK+L E + ++LK+ + K+ E M K++E++ +
Sbjct: 482 KEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEE 541
Query: 624 SQLQ 627
++
Sbjct: 542 KEMN 545
Score = 32.8 bits (74), Expect = 1.5
Identities = 46/219 (21%), Positives = 99/219 (45%), Gaps = 19/219 (8%)
Query: 8 QTLLAKSASIKTELR--EFYQTQLNNN-QKAQDEIQTLLAKSASIKTELREFYQTQLNNN 64
+ L + +KTEL + +L + K E + L +++ + EL++ +Q + N
Sbjct: 467 EHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKK-HQEDIINC 525
Query: 65 QKAQD----EIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
+K ++ +I+ L K +++ EL + + + K K +++ ++ + +
Sbjct: 526 KKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKK 585
Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
+K+ I+EN LK+ EN+ K E+ EN A + K +E + Y
Sbjct: 586 EKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAE-----------NKQLNAY 634
Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKV 219
E + KL+ EL K +E ++E + +++ +K+
Sbjct: 635 EIKVNKLELELASAKQKFEEIIDNYQKEIEDKKISEEKL 673
Score = 31.2 bits (70), Expect = 4.2
Identities = 45/188 (23%), Positives = 81/188 (43%), Gaps = 23/188 (12%)
Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVV---RKEVSIVTVTLFLERLRS 471
K K+K + + L + KLE EL +Q EE++ +KE+ ++ E+L
Sbjct: 618 KALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIEDKKIS--EEKLLE 675
Query: 472 SHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD 531
K AD V+LQK EI + L E ++ Q + ++E+
Sbjct: 676 EVEKAKAIADEAVKLQK---EIDKRCQHKIAEMVALMEKHKHQYDKIIEER--------- 723
Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
+EL L++ +Q++ + +L I + +L K + E+ + E +KLK E
Sbjct: 724 DSELGLYKNKEQEQSSAKAALEIELSNIKAE------LLSLKKQLEIEKEEKEKLKMEAK 777
Query: 592 QRDIKLKE 599
+ LK+
Sbjct: 778 ENTAILKD 785
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.8 bits (109), Expect = 2e-04
Identities = 38/176 (21%), Positives = 74/176 (42%), Gaps = 17/176 (9%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
+IE LER ++ E +R KL EE + ++ E +R E+ V R
Sbjct: 335 LAEIEELEREIEEERKRRDKLTEEYAELKEE-LEDLRAELEEVDKE--FAETRDELKDYR 391
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLDSAELT 536
+ +L+K + EI L + ++ EL+ + + N ++ +L+ +
Sbjct: 392 E------KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
Query: 537 LHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
E +++E + E A L + ++ + D K + + E E KL+ EL +
Sbjct: 446 KALEIKKQEWKL-EQLAADLSKYEQE------LYDLKEEYDRVEKELSKLQRELAE 494
Score = 40.4 bits (95), Expect = 0.007
Identities = 41/240 (17%), Positives = 90/240 (37%), Gaps = 35/240 (14%)
Query: 408 TVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCE--EVVRKEVS------- 458
+ + + E++ER L + + KL EE+ + + E E + +E++
Sbjct: 226 YELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLG 285
Query: 459 ----------IVTVTLFLERLRSSHS-KRSKNADSLVRLQKAQDEIQTLLAKSASIKTEL 507
I + + L S + K + D+ RL K + EI LLA+ ++ E+
Sbjct: 286 EEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345
Query: 508 REFY--QTQLNNNVKEKLKEFQT------QLDSAELTLHREFQQKEKTIV-------ENA 552
E + +L E +E + ++D E + + + E
Sbjct: 346 EEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELK 405
Query: 553 AKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRE 612
+ + E ++ + D A EA+ +L+ E + +++K+ L++ +
Sbjct: 406 RELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465
Score = 39.7 bits (93), Expect = 0.014
Identities = 35/201 (17%), Positives = 91/201 (45%), Gaps = 14/201 (6%)
Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
L+ +E + SLE+ ++ ++LE +++ + + + + ++ + L R+
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHK-LEEALNDLEARLSHSRIPEIQ 797
Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSA 533
++ SK + + R++ EI+ L + L + Y + ++E+ + + Q+ S
Sbjct: 798 AELSKLEEEVSRIEARLREIEQKLNRL-----TLEKEYLEKEIQELQEQRIDLKEQIKSI 852
Query: 534 ELTLHREFQQKEK--TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
E + +KE+ +E L+ + + G D K + + EA+ ++L+ ++
Sbjct: 853 EKEIENLNGKKEELEEELEELEAALRDLESRLG------DLKKERDELEAQLRELERKIE 906
Query: 592 QRDIKLKETNHLLKEESKKRE 612
+ + ++++ L E K E
Sbjct: 907 ELEAQIEKKRKRLSELKAKLE 927
Score = 39.3 bits (92), Expect = 0.018
Identities = 51/273 (18%), Positives = 113/273 (41%), Gaps = 33/273 (12%)
Query: 375 LEAEVKDSAGPMT--SFLESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTET 432
LE E+ + +G MT S +EPA L+ +E++E L+R L +
Sbjct: 644 LEGELFEKSGAMTGGSRAPRGGILFSRSEPA---------ELQRLRERLEGLKRELSSLQ 694
Query: 433 QRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK-RSKNADSLVRLQKAQD 491
R++E L + Q + RK I + +E+L K + + + L +
Sbjct: 695 SELRRIENRLDELSQELSDASRK---IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQ 751
Query: 492 EIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAEL-TLHREFQQKEKTIVE 550
EI+ + ++ ++ + E + L + ++E L + + +L + + + E + E+ +
Sbjct: 752 EIENVKSELKELEARIEE-LEEDL-HKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSR 809
Query: 551 ------------NAAKQLKQISEKPGATKL--MLDFKAKCEVYEAENKKLKAELTQRDIK 596
N K+ EK +D K + + E E + L + + + +
Sbjct: 810 IEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEE 869
Query: 597 LKETNHLLKEESKKRELMAKKVESVMQSQLQQA 629
L+E L++ + + K+ + +++QL++
Sbjct: 870 LEELEAALRDLESRLGDLKKERDE-LEAQLREL 901
Score = 37.7 bits (88), Expect = 0.054
Identities = 46/250 (18%), Positives = 97/250 (38%), Gaps = 38/250 (15%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
Q+ ++ ++ L + +S+++ELR + DE+ L AS K E Q
Sbjct: 677 QRLRERLEGLKRELSSLQSELRRI--------ENRLDELSQEL-SDASRKIGEIEKEIEQ 727
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLH---- 116
L ++ E L + S + E NVK +LKE + +++ E LH
Sbjct: 728 LEQEEEKLKERLEELEEDLSSLEQEIE--------NVKSELKELEARIEELEEDLHKLEE 779
Query: 117 -------REFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKL 169
R + I +K +++S E + +E N K+ EK
Sbjct: 780 ALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQ 839
Query: 170 ML---------DFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE 220
+ + E + ++L+ EL + + L++ L + K+R+ + ++
Sbjct: 840 EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL- 898
Query: 221 SVMQSQLQQA 230
++ ++++
Sbjct: 899 RELERKIEEL 908
Score = 33.1 bits (76), Expect = 1.4
Identities = 29/154 (18%), Positives = 65/154 (42%), Gaps = 8/154 (5%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFY--QTQLNNN-QKAQDEIQTLLAKSASIKTELREFY 57
K + EI LLA+ ++ E+ E + +L + ++E++ L A+ + E E
Sbjct: 325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETR 384
Query: 58 QTQLNNNQK---AQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLDSAEL 113
+ +K + EI L + ++ EL+ + + N ++ +L+ +
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444
Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEK 147
E +++E + E A L + + + KE+
Sbjct: 445 DKALEIKKQEWKL-EQLAADLSKYEQELYDLKEE 477
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 45.8 bits (109), Expect = 2e-04
Identities = 39/222 (17%), Positives = 95/222 (42%), Gaps = 26/222 (11%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVS----IVTVTLFLERLRSSH 473
K ++ LE L R +L EEL++ ++ +E + + + LE LR
Sbjct: 219 KAELRELELALLV--LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEV 276
Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSA 533
S+ + +++ Q E+ L + + ++ + + +E+L + QL+
Sbjct: 277 SELEE------EIEELQKELYALANEISRLEQQKQIL---------RERLANLERQLEEL 321
Query: 534 ELTLHREFQQKE--KTIVENAAKQLKQISEKPGATKLMLD-FKAKCEVYEAENKKLKAEL 590
E L + + + ++L+++ E+ + + L+ +A+ E E+ ++L+ +L
Sbjct: 322 EAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
Query: 591 TQRDIKLKETNHLLKEESKKRELMAKKVES--VMQSQLQQAI 630
K+ + + + + E + ++E + +LQQ I
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI 423
Score = 45.4 bits (108), Expect = 2e-04
Identities = 53/229 (23%), Positives = 94/229 (41%), Gaps = 26/229 (11%)
Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
++ +EKI LE+ L + +LEEEL+ R+ EE+ R+ I + L RL +
Sbjct: 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ---ISALRKDLARLEAE 741
Query: 473 HSKRSKNADSL-VRLQKAQDEIQTLLAKSASIKTELREF-----YQTQLNNNVKEKLKEF 526
+ + L L + + EI+ L + + EL E +KE+LK
Sbjct: 742 VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL 801
Query: 527 QTQLDS--AELT-LHREFQQKE-------------KTIVENAAKQLKQISEK-PGATKLM 569
+ LD AELT L+ E + +E+ +Q++++SE +
Sbjct: 802 REALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEI 861
Query: 570 LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKV 618
+ + E E+E + L E + L L+E S++ + K
Sbjct: 862 EELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKR 910
Score = 38.1 bits (89), Expect = 0.035
Identities = 38/242 (15%), Positives = 97/242 (40%), Gaps = 14/242 (5%)
Query: 22 REFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASI 81
E Q +L ++ +E+ L + ELR +++ ++ +E+Q L A+
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELR----LEVSELEEEIEELQKELYALANE 296
Query: 82 KTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE--KTIVENAAKQLKQISE 139
+ L + Q +E+L + QL+ E L + + + ++L+++ E
Sbjct: 297 ISRLEQQKQIL-----RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
Query: 140 NEFKQKEKTIVENAAKQLKQISEKPGA-TKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 198
+ + +E +L+++ + + + ++K E + L E+ + + +L
Sbjct: 352 ELESLEAE--LEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL 409
Query: 199 KETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSL 258
+ + ++ E + KK+E +LQ + + ++ L E L +
Sbjct: 410 ERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
Query: 259 ID 260
++
Sbjct: 470 LE 471
Score = 31.6 bits (72), Expect = 3.5
Identities = 33/180 (18%), Positives = 64/180 (35%), Gaps = 30/180 (16%)
Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQ------DTRQTCEEVVRKEVSIVTVTL-- 464
++ E IESL ++ + +LE EL+ + + ++R E+ ++ L
Sbjct: 846 QIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905
Query: 465 ----FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAK-SASIKTELREFYQTQLN--- 516
E R R K A +RL+ + I L + S L E +
Sbjct: 906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIED 965
Query: 517 --NNVKEKLKEFQTQLDS------------AELTLHREFQQKEKTIVENAAKQLKQISEK 562
+ +LK + ++ EL +F +K + A + L++ E+
Sbjct: 966 DEEEARRRLKRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEE 1025
>gnl|CDD|132519 TIGR03479, DMSO_red_II_alp, DMSO reductase family type II enzyme,
molybdopterin subunit. This model represents the
molybdopterin subunit, typically called the alpha
subunit, of various proteins that also contain an
iron-sulfur subunit and a heme b subunit. The group
includes two distinct but very closely related
periplasmic proteins of anaerobic respiration, selenate
reductase and chlorate reductase. Other members of this
family include dimethyl sulphide dehydrogenase,
ethylbenzene dehydrogenase, and an archaeal respiratory
nitrate reductase. This alpha subunit has a twin-arginine
translocation (TAT) signal for Sec-independent
translocation across the plasma membrane.
Length = 912
Score = 44.0 bits (104), Expect = 6e-04
Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 13/64 (20%)
Query: 1225 DHGASIADAILPGAAYTEK-------QSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRAL 1277
D A AD +LP A + EK ++N R V P G ++ DW+I L
Sbjct: 585 DSTAMYADIVLPAAWHYEKHDLRTTSGHRFINFFDRP------VKPMGESKTDWQIFALL 638
Query: 1278 SEGV 1281
++ +
Sbjct: 639 AKKI 642
>gnl|CDD|130767 TIGR01706, NAPA, periplasmic nitrate reductase, large subunit. This
model represents the large subunit of a family of nitrate
reductases found in proteobacteria which are localized to
the periplasm. This subunit binds molybdopterin and
contains a twin-arginine motif at the N-terminus. The
protein associates with NapB, a soluble heme-containing
protein and NapC, a membrane-bound cytochrome c. The
periplasmic nitrate reductases are not involved in the
assimilation of nitrogen, and are not directly involved
in the formation of electrochemical gradients (i.e.
respiration) either. Rather, the purpose of this enzyme
is either dissimilatory (i.e. to dispose of excess
reductive equivalents) or indirectly respiratory by
virtue of the consumption of electrons derived from NADH
via the proton translocating NADH dehydrogenase. The
enzymes from Alicagenes eutrophus and Paracoccus
pantotrophus have been characterized. In E. coli (as well
as other organisms) this gene is part of a large nitrate
reduction operon (napFDAGHBC) [Energy metabolism,
Aerobic, Energy metabolism, Electron transport, Central
intermediary metabolism, Nitrogen metabolism].
Length = 830
Score = 43.7 bits (103), Expect = 8e-04
Identities = 24/62 (38%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 1214 DCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARED-WK 1272
D FI+ + A AD ILP A + EK+ Y N E R Q V PG AR D W+
Sbjct: 504 DNFIVVSDAYPTVTALAADLILPSAMWVEKEGAYGNAERRTQVWHQQVLAPGEARSDLWQ 563
Query: 1273 II 1274
++
Sbjct: 564 LV 565
>gnl|CDD|197994 smart00926, Molybdop_Fe4S4, Molybdopterin oxidoreductase Fe4S4
domain. The molybdopterin oxidoreductase Fe4S4 domain
is found in a number of reductase/dehydrogenase
families, which include the periplasmic nitrate
reductase precursor and the formate dehydrogenase alpha
chain.
Length = 55
Score = 38.0 bits (89), Expect = 0.001
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYD 952
VG ++V + G V+RV + +N L KGR +
Sbjct: 12 GVGCGLLVEVKDGRVVRVRGDPDHPVNRGRLCPKGRAGLE 51
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 41.9 bits (98), Expect = 0.003
Identities = 40/216 (18%), Positives = 76/216 (35%), Gaps = 17/216 (7%)
Query: 436 RKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQT 495
RK +E L+ + E + + + + + K + +K + E +
Sbjct: 170 RKKKERLKKLIEETENLAELIIDL---EELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 496 LLAKSASIKT-ELREFYQTQLNNNVKEKLKEFQTQLDSAEL-----------TLHREFQQ 543
LL E + Q L + +E Q E+ ++ Q+
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQE 286
Query: 544 KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHL 603
+E ++ ++LK K K+ + K K E E KKL+ EL + +++E
Sbjct: 287 EELKLLAKEEEELKSELLKLERRKVDDEEKLKES--EKELKKLEKELKKEKEEIEELEKE 344
Query: 604 LKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 639
LKE KRE ++ E + + Q +
Sbjct: 345 LKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKK 380
Score = 41.1 bits (96), Expect = 0.004
Identities = 46/256 (17%), Positives = 95/256 (37%), Gaps = 18/256 (7%)
Query: 4 QDEIQTLLAKSASIKTELREF-----YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQ 58
+++ + + E+ E + + K E LA+ EL+
Sbjct: 142 GGKVEIIAMMKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQEL 201
Query: 59 TQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQL--NNNVKEKLKEFQTQLDSAE---L 113
+KA + Q K EL E L +E++ Q L +
Sbjct: 202 KLKEQAKKALEYYQLKE------KLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIE 255
Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDF 173
+ +E +++E+ + + + ++ E + +++E ++ ++LK K K+ +
Sbjct: 256 SSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEE 315
Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHM 233
K K E E KKL+ EL + +++E LKE KRE ++ E + + Q +
Sbjct: 316 KLKES--EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLE 373
Query: 234 ITSDVTHLHTSKESLA 249
S+ +
Sbjct: 374 EELLAKKKLESERLSS 389
Score = 39.2 bits (91), Expect = 0.018
Identities = 39/215 (18%), Positives = 83/215 (38%), Gaps = 12/215 (5%)
Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
L+ +E + + + + E + + EEEL+ + EE+ + + + + E
Sbjct: 261 LEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 320
Query: 474 SKRSKNADS-LVRLQKAQDEIQTLLAKSASIKTELRE------FYQTQLNNNVKEKLKEF 526
K K + L + ++ +E++ L K IK E E + ++E+L
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKEL-KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379
Query: 527 QTQLDSAELTLHREFQQK--EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENK 584
+ + + +++ K E AK L ++SE+ L+ + K + E +
Sbjct: 380 KKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQ--EEDLLKEEKKEELKIVEELE 437
Query: 585 KLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE 619
+ + + KE + K +L KK E
Sbjct: 438 ESLETKQGKLTEEKEELEKQALKLLKDKLELKKSE 472
Score = 37.6 bits (87), Expect = 0.048
Identities = 37/268 (13%), Positives = 83/268 (30%), Gaps = 15/268 (5%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
+K++ + + + + + + + K IK EL++ +
Sbjct: 661 EKSELKASLSELTKELLAEQELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEK 720
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
E Q + E + + ++ +E+ K + + E
Sbjct: 721 EELLADKVQEAQDKIN-------EELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELS-- 771
Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
KEK + E K K E K+++ E + L++ ++ + E
Sbjct: 772 LKEKELAEEEEKTEKL-KVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK 830
Query: 181 EAE-----NKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMIT 235
E E + + +E L+EE K EL+ + + + + Q+ +
Sbjct: 831 IKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELE 890
Query: 236 SDVTHLHTSKESLATRQPSEQSLIDSTT 263
S K+ L + L +
Sbjct: 891 SKEEKEKEEKKELEEESQKDNLLEEKEN 918
>gnl|CDD|239151 cd02750, MopB_Nitrate-R-NarG-like, Respiratory nitrate reductase A
(NarGHI), alpha chain (NarG) and related proteins. Under
anaerobic conditions in the presence of nitrate, E. coli
synthesizes the cytoplasmic membrane-bound quinol-nitrate
oxidoreductase (NarGHI), which reduces nitrate to nitrite
and forms part of a redox loop generating a proton-motive
force. Found in prokaryotes and some archaea, NarGHI
usually functions as a heterotrimer. The alpha chain
contains the molybdenum cofactor-containing Mo-bisMGD
catalytic subunit. Members of the
MopB_Nitrate-R-NarG-like CD belong to the
molybdopterin_binding (MopB) superfamily of proteins.
Length = 461
Score = 41.2 bits (97), Expect = 0.004
Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 13/61 (21%)
Query: 1225 DHGASIADAILPGAAYTEKQS-------TYVNTEGRAQQTLTAVTPPGLAREDWKIIRAL 1277
D A +D +LP A + EK +++ A V P A+ DW+I +AL
Sbjct: 374 DSTALYSDIVLPAATWYEKHDLSTTDMHPFIHPFSPA------VDPLWEAKSDWEIFKAL 427
Query: 1278 S 1278
+
Sbjct: 428 A 428
>gnl|CDD|237416 PRK13532, PRK13532, nitrate reductase catalytic subunit; Provisional.
Length = 830
Score = 41.0 bits (97), Expect = 0.004
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARED-WKII 1274
A AD ILP A + EK+ Y N E R Q V PG A+ D W+++
Sbjct: 518 ALAADLILPTAMWVEKEGAYGNAERRTQFWRQQVKAPGEAKSDLWQLV 565
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 40.8 bits (95), Expect = 0.007
Identities = 104/646 (16%), Positives = 244/646 (37%), Gaps = 88/646 (13%)
Query: 7 IQTLLAKSASIKTELREFYQTQLNNNQKAQ--DEIQTLLAKSASIKTELREFYQTQLNNN 64
+TL + S++ E + + +Q+ + + T + + + + +
Sbjct: 496 TETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIK 555
Query: 65 QKAQDEIQTLLAKSASIKTELREFYQTQLN--NNVKEKLKEFQTQLDSAELTLHREFQQK 122
+ DE+ +LL + K +L ++ ++ N +++L + +L S E +
Sbjct: 556 SRHSDELTSLLGYFPN-KKQLEDWLHSKSKEINQTRDRLAKLNKELASLE-QNKNHINNE 613
Query: 123 EKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEK----PGATKLM-------- 170
++ E + ++ + Q E++ +E +++++ S++ GAT +
Sbjct: 614 LESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLT 673
Query: 171 -----------LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKV 219
F+ + E+ E L+++L KLK T LK++ K+R+ M
Sbjct: 674 DENQSCCPVCQRVFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLA 732
Query: 220 E---SVMQ------SQLQQAIHMITSDVTHLHTSKES----LATRQPSEQSLIDSTTDTN 266
S++ +L+ + + D+ L E L T P E+S TD
Sbjct: 733 PGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVT 792
Query: 267 CPVTQQYVPSSNQQYAPSSNQQYSIPTSNINQQYSMFSADSTHMNIVERDIPNSADDMRK 326
+ +++ + Q + S++++ + + + + + RK
Sbjct: 793 --IMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRK 850
Query: 327 YIEMQYSMFSADSTHMNIVERD---IPNSADDMRKYIEMLLEKPPGNPVSDLEAEVKDSA 383
I+ Q + N ++ + I + +++ E L+E V L E+KD+
Sbjct: 851 LIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVEL--STEVQSLIREIKDAK 908
Query: 384 G---PMTSFLESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEE 440
P+ +FLE E + +K + K +++K+ ++ ++ + +E
Sbjct: 909 EQDSPLETFLEKDQQE---KEELISSKETSN---KKAQDKVNDIKEKVKNIHGYMKDIEN 962
Query: 441 ELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKS 500
++QD + + + +KE + TV LE K +++ ++ + +I T +
Sbjct: 963 KIQDGKD--DYLKQKETELNTVNAQLEECEKHQEKINED------MRLMRQDIDTQKIQE 1014
Query: 501 ASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT----LHREFQQKEKTIVENAAKQL 556
++ L + V+E+LK+ ++ ++ H++ ++ I N L
Sbjct: 1015 RWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLAL 1074
Query: 557 KQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNH 602
+ YE E K K EL + + E +
Sbjct: 1075 GRQKG-----------------YEKEIKHFKKELREPQFRDAEEKY 1103
Score = 36.9 bits (85), Expect = 0.081
Identities = 113/607 (18%), Positives = 214/607 (35%), Gaps = 89/607 (14%)
Query: 53 LREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEK--LKEFQTQLDS 110
LR+ QTQ Q+ Q E++ L + E+R+ ++ + +K ++ +LD
Sbjct: 191 LRQVRQTQGQKVQEHQMELK-YLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELD- 248
Query: 111 AELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKP-GATKL 169
+ + K I N +K +K +E + + K +E +L+ EK T
Sbjct: 249 -------PLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDE 301
Query: 170 MLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE-SVMQSQLQ 228
L+ ++Y + ++ + + +E L KE +R L +K E V Q +LQ
Sbjct: 302 QLN-----DLYHNHQRTVREKERELVDCQRELEKLNKE---RRLLNQEKTELLVEQGRLQ 353
Query: 229 QAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSNQQ 288
+ + +SLATR + ++ V + A ++ Q
Sbjct: 354 LQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQL 413
Query: 289 YSIPTSNINQQYSMFSADSTHMNIVERDIPNSADDMRKYIE----MQYSMFSADSTHMNI 344
+ S + + R I + + K E + + + + I
Sbjct: 414 CADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRI 473
Query: 345 VERDIPNSADDMRKYIEMLLEKPPGNPVSDLEAEVKDSAGPMTSFLESSVTSDIFNEPAL 404
+E D ++RK L + + L+ EVK D E
Sbjct: 474 LELD-----QELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLN 528
Query: 405 YTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTL 464
+ T R+ ++K+K++ E+ + +++ S +L L + L
Sbjct: 529 HHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNK---------KQLEDWL 579
Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
HSK + + RL K E+ +L Q N++ +L+
Sbjct: 580 --------HSKSKEINQTRDRLAKLNKELASL----------------EQNKNHINNELE 615
Query: 525 EFQTQLDSAELTLH-----REFQQKEKTIVENAAKQLKQISEKPGATKLM---------- 569
+ QL S E L ++ + + + E K KQ + GAT +
Sbjct: 616 SKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDE 675
Query: 570 ---------LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVES 620
F+ + E+ E L+++L KLK T LK++ K+R+ M
Sbjct: 676 NQSCCPVCQRVFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAP- 733
Query: 621 VMQSQLQ 627
QS +
Sbjct: 734 GRQSIID 740
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 40.4 bits (95), Expect = 0.007
Identities = 37/208 (17%), Positives = 73/208 (35%), Gaps = 20/208 (9%)
Query: 14 SASIKTELREFYQTQLNNNQKA-QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQ 72
+ K EL + + + ++ I + EL T+ N + + +++
Sbjct: 265 TEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKEN-SEFKLDVE 323
Query: 73 TLLAKSASIKTELREFYQTQLNN------------NVKEKLKEFQTQLDSAE--LTLHRE 118
L A +++ L E +L ++ + ++ +D+ + H E
Sbjct: 324 ELKALLEALEEIL-EKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNE 382
Query: 119 FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCE 178
K A K+L E K+ + K I+ K +A+ +
Sbjct: 383 KIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIK---QLEAEIK 439
Query: 179 VYEAENKKLKAELTQRDIKLKETNHLLK 206
E E K+L+ +LT + E N LLK
Sbjct: 440 ALEKEIKELEKQLTNIEPTADEINKLLK 467
Score = 37.3 bits (87), Expect = 0.063
Identities = 40/235 (17%), Positives = 84/235 (35%), Gaps = 37/235 (15%)
Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
I++E+ LE + E Q +L E+L++ E + K + + L E+ S
Sbjct: 263 TITEERKAELEAHFDEEYQ---ELIEQLEELIDKYESHIEKALEELESILDTEKENSEFK 319
Query: 475 KRSKNADSLVR-LQKAQDEIQTLLA---KSASIKTELREFYQTQLNNNVKEKLKEFQTQL 530
+ +L+ L++ ++ L K S EL ++ + ++ +
Sbjct: 320 LDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIEL---------ESITDLIESINDII 370
Query: 531 DSAE--LTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKK--- 585
D+ + H E K A K+L +L D A + + K
Sbjct: 371 DAINELIREHNEKIDNLKKEKNKAKKKLWLH----LVAELKEDIDAYQKEKKGLEKAINS 426
Query: 586 LKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHL 640
L+ E+ Q + ++K +KE +++L + ++ I +
Sbjct: 427 LEKEIKQLEAEIKALEKEIKE--LEKQL----------TNIEPTADEINKLLKAY 469
Score = 35.4 bits (82), Expect = 0.25
Identities = 43/225 (19%), Positives = 95/225 (42%), Gaps = 21/225 (9%)
Query: 2 KAQDEIQTLLAKSASIKTELREFYQ---TQLNNNQKAQDEIQTLLAKSASIKT--ELREF 56
K D+ LAK + L E + + N +K E+++++ ++S+ + EL+
Sbjct: 131 KFLDKAWKKLAKK--YDSNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAK 188
Query: 57 YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE---- 112
+T ++N+ + + I E E + + + + E L +++
Sbjct: 189 IKTLFSSNKPELALLTLSVIDFDEI--EQAEILEKSIIGSSDVPISELINNLGNSDWVKE 246
Query: 113 -LTLHRE-----FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGA 166
L H E F Q+ TI E +L+ + E+++ + + E K I +
Sbjct: 247 GLEYHEEGDTCPFCQQ--TITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEE 304
Query: 167 TKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKK 211
+ +LD + + ++ + ++LKA L + L++ L+E+ K
Sbjct: 305 LESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLKD 349
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 40.3 bits (95), Expect = 0.007
Identities = 32/155 (20%), Positives = 60/155 (38%), Gaps = 18/155 (11%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
Q AQ+++ ++ S++T+ E Q L N + +I+ LL LR +
Sbjct: 138 QNAQNDLAEYNSQLVSLQTQP-ERAQAALYANSQRLQQIRNLLKGGKVGGKALRPSQRVL 196
Query: 61 LNNNQK---AQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR 117
L Q AQ+++Q + T+L++ Q Q + L +L+ L
Sbjct: 197 LQAEQALLNAQNDLQRKSLEGN---TQLQDLLQKQR-----DYLTARIQRLEHQLQLLQE 248
Query: 118 EFQQKEKTIVENAAKQLK------QISENEFKQKE 146
K T+ E ++ + +I N +E
Sbjct: 249 AINSKRLTLSEKTVQEAQSQDEAARIQANPLVAQE 283
Score = 36.0 bits (84), Expect = 0.19
Identities = 46/216 (21%), Positives = 83/216 (38%), Gaps = 54/216 (25%)
Query: 417 SKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKR 476
KE+ E L++ L + R+ + EL+ + +E R+ +S TL L +L S ++
Sbjct: 78 QKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLS----TLSLRQLESRLAQT 133
Query: 477 SKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT 536
L +LQ AQ+++ Y +QL + QTQ + A
Sbjct: 134 ------LDQLQNAQNDLAE---------------YNSQLVSL--------QTQPERA--- 161
Query: 537 LHREFQQKEKTIVENAAKQLKQI-----SEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
Q +++L+QI K G L + +AE L A+
Sbjct: 162 ------QAALY---ANSQRLQQIRNLLKGGKVGGKALRPSQRVL---LQAEQALLNAQND 209
Query: 592 QRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 627
+ L+ L K+R+ + ++ ++ QLQ
Sbjct: 210 LQRKSLEGNTQLQDLLQKQRDYLTARI-QRLEHQLQ 244
>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
This family represents the C-terminal region of
merozoite surface protein 1 (MSP1) which are found in a
number of Plasmodium species. MSP-1 is a 200-kDa protein
expressed on the surface of the P. vivax merozoite.
MSP-1 of Plasmodium species is synthesised as a
high-molecular-weight precursor and then processed into
several fragments. At the time of red cell invasion by
the merozoite, only the 19-kDa C-terminal fragment
(MSP-119), which contains two epidermal growth
factor-like domains, remains on the surface. Antibodies
against MSP-119 inhibit merozoite entry into red cells,
and immunisation with MSP-119 protects monkeys from
challenging infections. Hence, MSP-119 is considered a
promising vaccine candidate.
Length = 574
Score = 40.3 bits (94), Expect = 0.008
Identities = 39/184 (21%), Positives = 70/184 (38%), Gaps = 44/184 (23%)
Query: 51 TELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD- 109
EL +FY++ L D+ + + T L E L + + Q D
Sbjct: 17 FELEKFYESYLQQIDGYNDDFIQFIKSKKELITALTETKVNALYLEIAHLKELLQHSYDR 76
Query: 110 --SAELTLHREFQQKEKTIVENAAKQLKQISE--------------------------NE 141
+L L R + +KE+ + + Q+K+++ N+
Sbjct: 77 YYKYKLKLERLYNKKEQ--IGQSKMQIKKLTLLKERLEKRKNSLNNPFYVLSNFSNFFNK 134
Query: 142 FKQKEKTIVENAAKQLKQISEKPGATKLMLD-FKAKCEVYEAENKKLKAELTQRDIKLKE 200
++ EK VEN K T ++L +KA+ + Y E LK L++ I+ +E
Sbjct: 135 KREAEKQEVENTLKN----------TDILLKYYKARVKYYTGEAVPLKT-LSEVSIQ-RE 182
Query: 201 TNHL 204
N+L
Sbjct: 183 DNYL 186
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 39.5 bits (92), Expect = 0.012
Identities = 86/430 (20%), Positives = 156/430 (36%), Gaps = 61/430 (14%)
Query: 204 LLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTT 263
L KEE+ + K+ L ++ + + S + Q E +
Sbjct: 201 LPKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGK 260
Query: 264 DTNCPVTQQYVPSSNQQYAPSSNQQYSIPTSNINQQY---SMFSADSTHMNIVERDIPNS 320
D + Q S S + + T + + + S S+ ST+ + +S
Sbjct: 261 DHHHSHNHQ-HHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSS 319
Query: 321 ADDMRKYIEMQYSMFSADSTHMNIVERDIPNSADDMRKYIEMLLEKPPGNPVSDLEAEVK 380
A S S + N ++ K P + VSD E++ K
Sbjct: 320 AAAG-----------SIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQK 368
Query: 381 DSAGPMTSFLESSVTSDIF--NEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKL 438
++ + +S + N ++ + E++E + LQ E Q++R+
Sbjct: 369 RASKSSSGARDSKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQN 428
Query: 439 EEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLA 498
E EL R ++S++T S + K+ L +L+K D +QT L
Sbjct: 429 ESEL-----------RNQISLLT----------SLERSLKS--DLGQLKKENDMLQTKLN 465
Query: 499 KSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQ--- 555
S K + ++ Q+ ++++LK +AE L E +K K E A +
Sbjct: 466 SMVSAKQKDKQSMQS-----MEKRLKSEADSRVNAEKQLAEE--KKRKKEEEETAARAAA 518
Query: 556 LKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMA 615
S + A L K + E E KKL + D+KLKE + E K+ + +
Sbjct: 519 QAAASREECAESL----KQAKQDLEMEIKKL-----EHDLKLKEEECRMLE--KEAQELR 567
Query: 616 KKVESVMQSQ 625
K ES +++
Sbjct: 568 KYQESEKETE 577
Score = 33.7 bits (77), Expect = 0.83
Identities = 55/222 (24%), Positives = 92/222 (41%), Gaps = 31/222 (13%)
Query: 30 NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTEL---- 85
N I+ L ++ EL Q N + +++I L + S+K++L
Sbjct: 399 ENKISTPSAIERLEQDIKKLQAEL----QQARQNESELRNQISLLTSLERSLKSDLGQLK 454
Query: 86 --REFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFK 143
+ QT+LN+ V K K+ Q+ + S E L E + E KQL + E + K
Sbjct: 455 KENDMLQTKLNSMVSAKQKDKQS-MQSMEKRLKSE--ADSRVNAE---KQLAE--EKKRK 506
Query: 144 QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNH 203
++E+ AA Q E+ + K + E E KKL + D+KLKE
Sbjct: 507 KEEEETAARAAAQAAASREECAES-----LKQAKQDLEMEIKKL-----EHDLKLKEEEC 556
Query: 204 LLKEESKKRELMAKKVESVMQSQ-LQQAIHMITSDVTHLHTS 244
+ E K+ + + K ES +++ L A+ + L S
Sbjct: 557 RMLE--KEAQELRKYQESEKETEVLMSALQAMQDKNLMLENS 596
>gnl|CDD|131740 TIGR02693, arsenite_ox_L, arsenite oxidase, large subunit. This
model represents the large subunit of an arsenite oxidase
complex. The small subunit is a Rieske protein. Homologs
to both large and small subunits that score in the gray
zone between the set trusted and noise bit score cutoffs
for the respective models are found in Aeropyrum pernix
K1 and in Sulfolobus tokodaii str. 7. This enzyme acts in
energy metabolim by arsenite oxidation, rather than
detoxification by reduction of arsenate to arsenite prior
to export [Energy metabolism, Electron transport].
Length = 806
Score = 39.5 bits (92), Expect = 0.012
Identities = 33/132 (25%), Positives = 46/132 (34%), Gaps = 25/132 (18%)
Query: 949 FAYDGLKRQRLLTPFVRNCDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLADAEAMVAL 1008
++ D + RL P +R D Q AT W+DAL +A + + + V
Sbjct: 106 WSLDRGTQDRLTYPLLRVGD-QFQATSWDDALTLMALL-----TKKIRDRDGNDDIAVKC 159
Query: 1009 KD-------LLNKLGSEDLYTEY------------AFPLEGAGTDLRANYLLNNKIAGAE 1049
D N G+ L+ A+ E GT LNN A
Sbjct: 160 FDHGGAGGGFENTWGAGKLFFAALSVKHIRIHNRPAYNSEVHGTREMGVGELNNTYEDAR 219
Query: 1050 EADLILLIGTNP 1061
AD I+L G N
Sbjct: 220 LADTIVLWGANS 231
Score = 31.0 bits (70), Expect = 5.1
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 8/80 (10%)
Query: 1231 ADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSEGVSSNLSSE-- 1288
A ILP A + E T +N E R + + PPG A+ D I ++ ++ +E
Sbjct: 523 AHLILPAAGWGEMNLTSMNGERRMRLYEKFMDPPGEAKPDCLIAAWVANTIAELYRAEGK 582
Query: 1289 PVDVKQ------KNLEDYYM 1302
+ K+ K ED ++
Sbjct: 583 FEEAKKFEGFDWKTEEDAFL 602
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 39.7 bits (92), Expect = 0.013
Identities = 69/342 (20%), Positives = 144/342 (42%), Gaps = 30/342 (8%)
Query: 20 ELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSA 79
E ++ + ++ K +E + + A+ A K E + +L ++ + +++ L K A
Sbjct: 1585 EAKKAEEARIEEVMKLYEEEKKMKAEEAK-KAEEAKIKAEELKKAEEEKKKVEQLKKKEA 1643
Query: 80 SIKTELREFYQTQLNNNVK---EKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQ 136
K + E + + N +K E K + + + E E ++K ++ A++ K+
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Query: 137 ISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 196
E + K+ E+ A++LK+ E+ KA+ EAE K KAE ++D
Sbjct: 1704 AEELKKKEAEEK---KKAEELKKAEEEN-------KIKAEEAKKEAEEDKKKAEEAKKDE 1753
Query: 197 KLK-ETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH----LHTSKESLATR 251
+ K + HL KEE KK E + K+ E+V++ +L + +V + + ++
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEG 1813
Query: 252 QPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSN-QQYSIPTSNINQQYSMFSADSTHM 310
+I+ + + ++ S N Q + +++ +N N + AD
Sbjct: 1814 GKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFN-- 1871
Query: 311 NIVERDIPNSADDMRKYIEMQYSMFSADSTHMNIVERDIPNS 352
++ DD + E + + +ER+IPN+
Sbjct: 1872 ----KEKDLKEDDEEEIEEADEI----EKIDKDDIEREIPNN 1905
Score = 35.9 bits (82), Expect = 0.21
Identities = 52/228 (22%), Positives = 102/228 (44%), Gaps = 20/228 (8%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
K+K E ++ + + RK EE + EEV++ + E + + +
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA--EEAKKAEEAKI 1620
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVK---------------EK 522
K A+ L + ++ + +++ L K A K + E + + N +K E+
Sbjct: 1621 K-AEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679
Query: 523 LKEFQTQLDSAELTLHREFQQKEKT-IVENAAKQLKQISEKPGATKLMLDFKAKCEVYEA 581
K+ + A L +E ++ +K ++ + K+ +E+ + KA+ EA
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEA 1739
Query: 582 ENKKLKAELTQRDIKLK-ETNHLLKEESKKRELMAKKVESVMQSQLQQ 628
E K KAE ++D + K + HL KEE KK E + K+ E+V++ +L +
Sbjct: 1740 EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDE 1787
Score = 35.1 bits (80), Expect = 0.33
Identities = 24/111 (21%), Positives = 48/111 (43%), Gaps = 1/111 (0%)
Query: 117 REFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLD-FKA 175
++ ++K+K A++ K+ E + K +E + A K+ ++ +K A K + K
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Query: 176 KCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQ 226
E +AE + E + K + +E KK + KK E ++
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394
Score = 31.6 bits (71), Expect = 3.9
Identities = 41/226 (18%), Positives = 91/226 (40%), Gaps = 5/226 (2%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
K E A++A+ + E E ++ + K A K + E +
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA 1400
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
+ +KA + L K+A+ K + E + ++ K+ + A+ + +
Sbjct: 1401 EEDKKKADE-----LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
K+ + A++ K+ E + K +E + A K+ ++ +K K + K K +
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQ 226
+ + KA+ ++ + K+ + K E KK+ KK E + +++
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAE 1561
Score = 30.9 bits (69), Expect = 6.7
Identities = 42/192 (21%), Positives = 79/192 (41%), Gaps = 14/192 (7%)
Query: 36 QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 95
+D + A+ E+R+F + ++ + + Q I+ A+ A EL++ + + +
Sbjct: 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKAD---ELKKAEEKKKAD 1293
Query: 96 NVK--EKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKE--KTIVE 151
K E+ K+ AE + +K+ + A K+ +E K E K E
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Query: 152 NAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLK---EE 208
AA + + EK A + K + +A+ K KAE ++ + K+ K +E
Sbjct: 1354 AAADEAEAAEEKAEAAEK----KKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Query: 209 SKKRELMAKKVE 220
KK KK +
Sbjct: 1410 LKKAAAAKKKAD 1421
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 39.3 bits (92), Expect = 0.015
Identities = 45/211 (21%), Positives = 93/211 (44%), Gaps = 30/211 (14%)
Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
L+ KE+IE LE+ L++ RKLEE++++ + EE+ ++ + L+ L+
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELK-- 289
Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 532
+ A+ ++L + +E + E R + N ++E++KE +
Sbjct: 290 -----EKAEEYIKLSEFYEEY-----LDELREIEKRLSRLEEEINGIEERIKELEE--KE 337
Query: 533 AELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
L E ++K K + K++ E +L + KAK + E ++LK LT
Sbjct: 338 ERL---EELKKKLKEL-------EKRLEELEERHELYEEAKAK----KEELERLKKRLTG 383
Query: 593 RDIKLKETNHLLKEESKKRELMAKKVESVMQ 623
++ L+E K +E + +++ +
Sbjct: 384 LTP--EKLEKELEELEKAKEEIEEEISKITA 412
Score = 36.2 bits (84), Expect = 0.14
Identities = 45/205 (21%), Positives = 94/205 (45%), Gaps = 19/205 (9%)
Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
+LK EK+E L++ L ++ +LEEEL + + EE+ + S+ + L+ L
Sbjct: 543 SLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE--SVEELEERLKELEPF 600
Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELRE-FYQTQLNNNVKEKLKEFQTQLD 531
+++ + L+ A+ E++ + ++ EL + F + E+L++ +L+
Sbjct: 601 YNE-------YLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRK---ELE 650
Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
E +++ ++E + + L+ E G + + + + E + +KLK EL
Sbjct: 651 ELE----KKYSEEEYEELRE--EYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELE 704
Query: 592 QRDIKLKETNHLLKEESKKRELMAK 616
+R+ KE L K + EL K
Sbjct: 705 EREKAKKELEKLEKALERVEELREK 729
Score = 33.9 bits (78), Expect = 0.83
Identities = 47/224 (20%), Positives = 99/224 (44%), Gaps = 22/224 (9%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
+K +E + L K +K E++ + +L ++ + ++ L K ++ EL E +
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLDSAELTLHREF 119
++ +E++ L EL FY L + +++L+ + +L E L + F
Sbjct: 580 EELGFESVEELEERLK-------ELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAF 632
Query: 120 Q--QKEKTIVENAAKQLKQI----SENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDF 173
+ + + +E K+L+++ SE E+++ + + L+ E G + +
Sbjct: 633 EELAETEKRLEELRKELEELEKKYSEEEYEELRE-------EYLELSRELAGLRAELEEL 685
Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAK 217
+ + E + +KLK EL +R+ KE L K + EL K
Sbjct: 686 EKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREK 729
Score = 31.6 bits (72), Expect = 4.0
Identities = 44/225 (19%), Positives = 98/225 (43%), Gaps = 24/225 (10%)
Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFL 466
K + K + E++E + ++ KL+ E+ + +E+ + E +
Sbjct: 506 KELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEI---KSLKKELEKLEELKKKLAELE 562
Query: 467 ERLRSSHSKRSKNADSLVRLQ-KAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLK 524
++L + ++ L L ++ +E++ L EL FY L + +++L+
Sbjct: 563 KKLDELEEELAELLKELEELGFESVEELEERLK-------ELEPFYNEYLELKDAEKELE 615
Query: 525 EFQTQLDSAELTLHREFQ--QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY--- 579
+ +L E L + F+ + + +E K+L+++ +K +++ E Y
Sbjct: 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEE----EYEELREEYLEL 671
Query: 580 EAENKKLKAELTQRDIKLKETNHL---LKEESKKRELMAKKVESV 621
E L+AEL + + + +E LKEE ++RE K++E +
Sbjct: 672 SRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKL 716
Score = 30.4 bits (69), Expect = 9.3
Identities = 37/165 (22%), Positives = 73/165 (44%), Gaps = 16/165 (9%)
Query: 20 ELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSA 79
EL FY + + A+ E++ + ++ EL KA +E+ +
Sbjct: 596 ELEPFYN-EYLELKDAEKELEREEKELKKLEEEL-----------DKAFEELAETEKRLE 643
Query: 80 SIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEK---TIVENAAKQLKQ 136
++ EL E + + + E+L+E +L L E ++ EK I + K ++
Sbjct: 644 ELRKELEEL-EKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEE 702
Query: 137 ISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE 181
+ E E +KE +E A ++++++ EK K +L +A +V E
Sbjct: 703 LEEREKAKKELEKLEKALERVEELREKVKKYKALLKERALSKVGE 747
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 38.5 bits (90), Expect = 0.022
Identities = 32/225 (14%), Positives = 79/225 (35%), Gaps = 33/225 (14%)
Query: 406 TKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLF 465
SV ++K + Q + R L EE+ + V
Sbjct: 58 RDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSEEVAEAEAKATSV------------- 104
Query: 466 LERLRSSHSKRSKNADSLV-RLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
++ + K+ LV L++ +E+ A ++ + F +
Sbjct: 105 -----AAEATTPKSIQELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKD 159
Query: 525 EFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENK 584
+ ++ + SA+ L + +K + ++L++ ++ + E+ +
Sbjct: 160 DLESLIASAKEELD-QLSKKLAELKAEEEEELERALKE-----------KREELLSKLEE 207
Query: 585 KLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 629
+L A L + + L E +++E + KK E ++ +L++
Sbjct: 208 ELLARLESK--EAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250
Score = 35.4 bits (82), Expect = 0.23
Identities = 22/134 (16%), Positives = 62/134 (46%), Gaps = 4/134 (2%)
Query: 31 NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSAS-IKTELREFY 89
+D++++L+A K EL + + + ++E++ L + + ++L E
Sbjct: 153 KEDNLKDDLESLIAS---AKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEEL 209
Query: 90 QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
+L + K+ + + + + L +++++K + +E A+ +Q +NE + +
Sbjct: 210 LARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIEL 269
Query: 150 VENAAKQLKQISEK 163
K++K+ E+
Sbjct: 270 QREFNKEIKEKVEE 283
Score = 30.8 bits (70), Expect = 5.5
Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 16/163 (9%)
Query: 405 YTKTVRSVTLKISKE---KIESLERNLQTETQRSR---------KLEEELQDTRQTCEEV 452
+ + + ++ K +LE+ L+ E +R + KL +EL+ + E+
Sbjct: 198 REELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQK 257
Query: 453 VRKEVSIVTVTLFLERLRSSHSKRSKNADS-LVRLQKAQDEIQTLLAKSASIKTELREFY 511
++ E+++ + L E + K + + L +L + ++ L S E +
Sbjct: 258 LKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENH 317
Query: 512 QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAK 554
+ Q E LK + L S R ++ + E A
Sbjct: 318 KVQQLWLAVEALK---SALKSGSAGSPRPLVKELDALKELAKD 357
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 38.9 bits (91), Expect = 0.023
Identities = 34/151 (22%), Positives = 60/151 (39%), Gaps = 24/151 (15%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
+E+++ E LQ+ + ++ EE+L EE R E + ++ +
Sbjct: 606 RERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAE---------------AEARTA 650
Query: 478 -KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLN------NNVKEKLKEFQTQL 530
K A + LQ+ Q+E Q+L K E ++ +TQL + E+ + F L
Sbjct: 651 LKQAR--LDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEAL 708
Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISE 561
L E K + + QL Q+S
Sbjct: 709 KDDFRELRTERLAKWQVVEGELDNQLAQLSA 739
Score = 34.7 bits (80), Expect = 0.48
Identities = 32/164 (19%), Positives = 66/164 (40%), Gaps = 18/164 (10%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
Q+A++ +Q+ +AK + +L Q N + Q + A + +
Sbjct: 610 QQAEEALQSAVAKQKQAEEQL-----VQANAELEEQKRAEAE----ARTALKQARLDLQR 660
Query: 61 LNNNQKA-QDEIQTLLAKSA-SIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHRE 118
L N Q++ +D+++ +A+ +T+LR+ + E+ + F L L E
Sbjct: 661 LQNEQQSLKDKLELAIAERKQQAETQLRQLDAQL--KQLLEQQQAFLEALKDDFRELRTE 718
Query: 119 FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISE 162
K + + QL Q+S E + A +LK++ +
Sbjct: 719 RLAKWQVVEGELDNQLAQLS----AAIEAARTQAKA-RLKELKK 757
Score = 30.8 bits (70), Expect = 5.8
Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 13/128 (10%)
Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSK 478
R +L++ L T EE+ + E++ + E + + +
Sbjct: 435 AGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQ 494
Query: 479 NADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLH 538
L +L+K +DE L + + L + +++ L E + QL +L
Sbjct: 495 LQSELRQLRKRRDEALEALQR---AERRLLQ---------LRQALDELELQLSPQAGSLL 542
Query: 539 REFQQKEK 546
F + E
Sbjct: 543 -HFLRNEA 549
Score = 30.4 bits (69), Expect = 7.8
Identities = 33/154 (21%), Positives = 58/154 (37%), Gaps = 29/154 (18%)
Query: 484 VRLQKAQDEIQTLLAKSASIKTELREFYQTQLN---NNVKEKLKEFQTQLDSAELTLHRE 540
RLQ+A++ +Q+ +AK + +L Q N K E +T L A L L R
Sbjct: 607 ERLQQAEEALQSAVAKQKQAEEQL-----VQANAELEEQKRAEAEARTALKQARLDLQRL 661
Query: 541 FQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKET 600
+++ +L K A E + ++L A+L Q + +
Sbjct: 662 QNEQQS---LKDKLELAIAERKQQA--------------ETQLRQLDAQLKQLLEQQQAF 704
Query: 601 NHLLKEESKKRELMAKK----VESVMQSQLQQAI 630
LK++ ++ VE + +QL Q
Sbjct: 705 LEALKDDFRELRTERLAKWQVVEGELDNQLAQLS 738
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 37.9 bits (89), Expect = 0.035
Identities = 50/242 (20%), Positives = 92/242 (38%), Gaps = 60/242 (24%)
Query: 31 NNQKAQDEIQTL--------LAKSASIKT--ELREFYQTQLNNNQKAQDEIQTLLAKSAS 80
N++ ++ I TL AK K +L +F + N++ E+
Sbjct: 279 ENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLTDFLEHAREQNKQLLLELD-------- 330
Query: 81 IKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAA--------- 131
L++ Y LN + E ++E + QL+ E ++ Q + I E
Sbjct: 331 ---RLQQSYT--LNEDELETVRELEKQLEELE----AQYDQLVERIAEKKVAYSELQEEL 381
Query: 132 ----KQLKQISEN--EFKQKEKTI--VENAAKQ-LKQISEKPGATKLML----------D 172
KQL++I + E + + + E A++ L++ +K K L D
Sbjct: 382 EEIEKQLEEIEKEQEELSESLQGLRKDELEAREKLQEYRQKLHEIKRYLEKSNLPGLPED 441
Query: 173 FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQS-----QL 227
+ V E + L EL + I + N L+E + E + +K E ++ + QL
Sbjct: 442 YLEYFFVVSDEIEALADELNEVPINMDAVNRQLEEATDDVETLKEKTEELVDNATLAEQL 501
Query: 228 QQ 229
Q
Sbjct: 502 IQ 503
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 37.7 bits (87), Expect = 0.040
Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 23/172 (13%)
Query: 101 LKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI 160
LKE ++Q D+ +E K++ + A ++ +N KQ+++ +Q +
Sbjct: 203 LKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEV---RQKQQEAKN 259
Query: 161 SEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIK------LKETNHL---LKEESKK 211
KP T K + AEN+K + E Q +IK LK +H LK+ESK
Sbjct: 260 LPKPADTS-----SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKA 314
Query: 212 RELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTT 263
E A+ E Q + + + D+ E+ T + IDS+
Sbjct: 315 SEKEAEDKELEAQKKREP----VAEDLQKTKPQVEAQPTSL--NEDAIDSSN 360
Score = 34.2 bits (78), Expect = 0.56
Identities = 37/206 (17%), Positives = 77/206 (37%), Gaps = 26/206 (12%)
Query: 466 LERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKE 525
+E LR + K + L++ + + A+ + + ++ +
Sbjct: 183 VEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQ-----------IDADK 231
Query: 526 FQTQLDSAELTLH------REFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 579
Q + D A+ R+ QQ+ K + + A + ++ + KA+ E+
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291
Query: 580 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 639
+ + + LKA+ + +E+ KE K KK E V LQ+ + + T
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV-AEDLQKTKPQVEAQPTS 350
Query: 640 LH------VLPC--VRVLSPVKTLST 657
L+ P ++V+ P+ LS
Sbjct: 351 LNEDAIDSSNPVYGLKVVDPITNLSE 376
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 37.4 bits (87), Expect = 0.067
Identities = 76/592 (12%), Positives = 176/592 (29%), Gaps = 75/592 (12%)
Query: 47 ASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQT 106
I+ +R Y + Q + I+ + + Y+ + +K +
Sbjct: 749 TRIQRAIRGRY-LRRRYLQALKR-IKKIQVIQHGFRLRRLVDYELKWRLFIKLQ------ 800
Query: 107 QLDSAELTLHREFQ-QKEKTIVENAA-----KQLKQISENEFKQKEKTIVENAAKQLKQI 160
L R+ I++ K+L++ E EF K + +++ +
Sbjct: 801 --PLLSLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGR----- 853
Query: 161 SEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE 220
S K +L + + + +L + I +K + L K + + E +++
Sbjct: 854 SLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSL-KLVNLELESEIIELK 912
Query: 221 SVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQ 280
+ S L + + T + L K+ L E I+
Sbjct: 913 KSLSSDLIENLEFKTELIARL---KKLLNNIDLEEGPSIEYVKL---------------- 953
Query: 281 YAPSSNQQYSIPTSNINQQYSMFSADSTHMNIVERDIPNSADDMRKY------IEMQYSM 334
+ S + + + I+ R+ + +++ + + QY
Sbjct: 954 ---PELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGA 1010
Query: 335 FSADSTHMNIVERDIPNSADDMRKYIEMLLEKPPGNPVSDLE----AEVKDSAGPMTSFL 390
+ + + ++ + E P+ L+ E +
Sbjct: 1011 LQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKALK 1070
Query: 391 ESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCE 450
S + ++ ++ ++ I+ + +E RNL + + ++ E
Sbjct: 1071 LRRENSLLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQE 1130
Query: 451 EVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREF 510
K +S + + L K S + + ++ L + +
Sbjct: 1131 IS--KFLSQL-----VNTLEPVFQKLSVL-QLELDGLFWEANLEALPSPPPFAALSEKRL 1182
Query: 511 YQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLML 570
YQ + EK K ++++ + L F K + K K ISE T+
Sbjct: 1183 YQ---SALYDEKSKLSSSEVNDLKNELIALF-SKIFSGWPRGDKLKKLISEGWVPTEYS- 1237
Query: 571 DFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVM 622
+ N K T + ++ LL + K+E +
Sbjct: 1238 ------TSLKGFNNLNKKFDTPASMSNEKLLSLLNSIDNLLS--SYKLEEEV 1281
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 35.8 bits (83), Expect = 0.075
Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 26 QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTEL 85
Q +L Q+ E+Q LA+ EL++ T ++ Q E+ + SA+ EL
Sbjct: 72 QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI-EL 130
Query: 86 REFYQTQLNNNVKEKLKEFQTQLDSAE 112
E N ++E+L E + + ++ E
Sbjct: 131 DE-----ENRELREELAELKQENEALE 152
Score = 33.1 bits (76), Expect = 0.66
Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 1/82 (1%)
Query: 5 DEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNN 64
E+Q LA+ EL++ T ++ Q E+ + SA+ EL E +
Sbjct: 83 AELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI-ELDEENRELREEL 141
Query: 65 QKAQDEIQTLLAKSASIKTELR 86
+ + E + L A++ ++ +
Sbjct: 142 AELKQENEALEAENERLQENEQ 163
Score = 31.5 bits (72), Expect = 2.0
Identities = 29/135 (21%), Positives = 54/135 (40%), Gaps = 34/135 (25%)
Query: 484 VRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQ 543
RL + Q E+ L + A ++ +L E Q N +K++L + +L+ + L R
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAE--LQQENQELKQELSTLEAELERLQKELAR---- 119
Query: 544 KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAE---LTQRDIKLKET 600
+KQ+S + EN++L+ E L Q + L+
Sbjct: 120 ------------IKQLSANAIEL-------------DEENRELREELAELKQENEALEAE 154
Query: 601 NHLLKEESKKRELMA 615
N L+E ++R +
Sbjct: 155 NERLQENEQRRWFLY 169
Score = 30.7 bits (70), Expect = 3.7
Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 4/90 (4%)
Query: 1 QKAQDEIQTLLAKSASIKT---ELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY 57
+ Q E+ L + A ++ EL++ Q + E++ L + A IK +L
Sbjct: 69 PELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK-QLSANA 127
Query: 58 QTQLNNNQKAQDEIQTLLAKSASIKTELRE 87
N++ ++E+ L ++ +++ E
Sbjct: 128 IELDEENRELREELAELKQENEALEAENER 157
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 37.1 bits (87), Expect = 0.081
Identities = 21/102 (20%), Positives = 44/102 (43%), Gaps = 10/102 (9%)
Query: 71 IQTLLAKSASIKTELREFYQ-----TQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKT 125
I +L ++ + E +L ++EK ++ Q + D +E QQ K
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581
Query: 126 IVENAAKQLKQISENEFK----QKEKTIVENAAKQLKQISEK 163
+ A + +K++ + + K ++E A K+L + +EK
Sbjct: 582 AKKEADEIIKELRQLQKGGYASVKAHELIE-ARKRLNKANEK 622
Score = 35.2 bits (82), Expect = 0.26
Identities = 42/203 (20%), Positives = 83/203 (40%), Gaps = 30/203 (14%)
Query: 421 IESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNA 480
I SLE + Q++ + E L++ + EE+ K E+L+ K + A
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEK----------KEKLQEEEDKLLEEA 571
Query: 481 DSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHRE 540
+ ++AQ I+ ++ I ELR+ + + +L E + +L+ A ++
Sbjct: 572 E-----KEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKK 626
Query: 541 FQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRD--IKLK 598
+++++ KQ K G L K EV + K E + +K+K
Sbjct: 627 KKKQKE----------KQEELKVGDEVKYLSLGQKGEVLSIPDDK---EAIVQAGIMKMK 673
Query: 599 ETNHLLKEESKKRELMAKKVESV 621
L++ K ++ KK ++V
Sbjct: 674 VPLSDLEKIQKPKKKKKKKPKTV 696
Score = 30.6 bits (70), Expect = 8.2
Identities = 22/126 (17%), Positives = 56/126 (44%), Gaps = 4/126 (3%)
Query: 4 QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 63
++++ L+A ++ EL + + ++A+ + L K ++ E + +
Sbjct: 515 KEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA--- 571
Query: 64 NQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE 123
++AQ I+ ++ I ELR+ + + +L E + +L+ A ++ ++K+
Sbjct: 572 EKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKK-KKKQ 630
Query: 124 KTIVEN 129
K E
Sbjct: 631 KEKQEE 636
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 36.1 bits (84), Expect = 0.082
Identities = 29/148 (19%), Positives = 60/148 (40%), Gaps = 22/148 (14%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
Q+ Q+++Q ++ ++T Q QL+ ++ EI+ L + T L + +T
Sbjct: 95 QELQEQLQQENSQLIELQTRPERA-QQQLSEARRRLQEIRNRLQALSPGGTPLAQAQRTL 153
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
L A+ A++K ++ E L+NN +++L Q L +
Sbjct: 154 LQ-------------AELAALKAQIEELELELLSNNNRQELLRLQRDLLKKRIERLEAEL 200
Query: 121 Q--------KEKTIVENAAKQLKQISEN 140
Q K E A ++ ++++E
Sbjct: 201 QALQNAINRKRLAESEQAVEEAERLAEQ 228
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 36.3 bits (84), Expect = 0.12
Identities = 29/164 (17%), Positives = 63/164 (38%), Gaps = 6/164 (3%)
Query: 3 AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQ-DEIQTLLAKSASIKTELREFYQTQL 61
+ E A+ S K + + QL N +K Q + + + + +
Sbjct: 47 FEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALE 106
Query: 62 NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQ 121
N + E++ L + + EL++ Q EK +E + + ++ ++
Sbjct: 107 LLNLEKDKELELLEKELDELSKELQKQLQNTAEI--IEKKRENNKNEERLKFENEKKLEE 164
Query: 122 KEKTIVENAAKQLKQISEN-EFKQKEKTI--VENAAKQLKQISE 162
+ E +QL + + + EFK+ E+ K+LK+ +E
Sbjct: 165 SLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKRRAE 208
Score = 36.3 bits (84), Expect = 0.12
Identities = 23/148 (15%), Positives = 58/148 (39%), Gaps = 12/148 (8%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNN----NQKAQDEIQTLLAKSASIKTELREF 56
+ E + L + + +L N + E++ L + + EL++
Sbjct: 74 INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133
Query: 57 YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLH 116
Q +K ++ + L+ + +L +++ + ++F+ QL A L L
Sbjct: 134 LQNTAEIIEKKRENNKNEE--------RLKFENEKKLEESLELEREKFEEQLHEANLDLE 185
Query: 117 REFQQKEKTIVENAAKQLKQISENEFKQ 144
+ ++++ K+LK+ +E +Q
Sbjct: 186 FKENEEQRESKWAILKKLKRRAELGSQQ 213
Score = 34.4 bits (79), Expect = 0.38
Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 7/172 (4%)
Query: 63 NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQK 122
Q+ Q I L + E R Y++ + + ++ Q + + + +Q
Sbjct: 32 IEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQF 91
Query: 123 EKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEA 182
E + + AK N K KE +E K+L ++S++ L A+ +
Sbjct: 92 ELALQDEIAKLEALELLNLEKDKE---LELLEKELDELSKELQKQ---LQNTAEIIEKKR 145
Query: 183 ENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMI 234
EN K E + + + K L E K E + + + + ++
Sbjct: 146 ENNK-NEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRESK 196
>gnl|CDD|223107 COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate
synthase, pyruvate dehydrogenase (cytochrome), glyoxylate
carboligase, phosphonopyruvate decarboxylase] [Amino acid
transport and metabolism / Coenzyme metabolism].
Length = 550
Score = 36.1 bits (84), Expect = 0.13
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 10/76 (13%)
Query: 1082 DVAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADISDGAA-- 1139
+ GP+ + Y + I++ A + L+ AK+P+I+ G + A
Sbjct: 166 EAEEPGPEPAILPPYRPAPPPPEAIRKAA-------ELLAEAKRPVILAGGGVRRAGASE 218
Query: 1140 -VLALVQQLAAKVTCE 1154
+ L ++L A V
Sbjct: 219 ELRELAEKLGAPVVTT 234
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 36.2 bits (84), Expect = 0.13
Identities = 36/259 (13%), Positives = 90/259 (34%), Gaps = 45/259 (17%)
Query: 4 QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDE--------------IQTLLAKSASI 49
++ + L + A ++ EL E ++ + +K + E ++ + A +
Sbjct: 222 SEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKAN 281
Query: 50 KTELREF----------------YQTQLNNNQKAQ--DEIQTLLAK------SASIKTEL 85
+ +LRE + QL +++Q Q L + + K L
Sbjct: 282 RAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLAL 341
Query: 86 REFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQ- 144
+ ++ + E K T + E T +E +Q+K+ ++ Q
Sbjct: 342 PAEHVKEIAAELAEIDKPATTDSEIPHRLSGSEL-----TQLEVLIQQVKRELQDAKSQL 396
Query: 145 -KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNH 203
KE +E ++ + + + + + + E + +AE+ + +L+
Sbjct: 397 LKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKE 456
Query: 204 LLKEESKKRELMAKKVESV 222
++ K + K+ +
Sbjct: 457 AIEALRKTLDEKTKQKINA 475
Score = 33.5 bits (77), Expect = 0.93
Identities = 20/95 (21%), Positives = 43/95 (45%), Gaps = 5/95 (5%)
Query: 418 KEKIESLERNLQT--ETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK 475
+E++ +++ + T ++ +L EEL + + E+ R E I + LE L+ +
Sbjct: 404 EEELAEVDKKISTIPSEEQIAQLLEELGEAQ---NELFRSEAEIEELLRQLETLKEAIEA 460
Query: 476 RSKNADSLVRLQKAQDEIQTLLAKSASIKTELREF 510
K D + + E++ + + K L+EF
Sbjct: 461 LRKTLDEKTKQKINAFELERAITIADKAKKTLKEF 495
>gnl|CDD|239153 cd02752, MopB_Formate-Dh-Na-like, Formate dehydrogenase N, alpha
subunit (Formate-Dh-Na) is a major component of nitrate
respiration in bacteria such as in the E. coli formate
dehydrogenase N (Fdh-N). Fdh-N is a membrane protein that
is a complex of three different subunits and is the major
electron donor to the nitrate respiratory chain. Also
included in this CD is the Desulfovibrio gigas tungsten
formate dehydrogenase, DgW-FDH. In contrast to Fdh-N,
which is a functional heterotrimer, DgW-FDH is a
heterodimer. The DgW-FDH complex is composed of a large
subunit carrying the W active site and one [4Fe-4S]
center, and a small subunit that harbors a series of
three [4Fe-4S] clusters as well as a putative vacant
binding site for a fourth cluster. The smaller subunit is
not included in this alignment. Members of the
MopB_Formate-Dh-Na-like CD belong to the
molybdopterin_binding (MopB) superfamily of proteins.
Length = 649
Score = 36.2 bits (84), Expect = 0.14
Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 1223 HGDHGASIADAI--LPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
G SI + LP A EK+ + N+ Q V PPG A+ D I+ L++
Sbjct: 380 PGMDPKSIQTEVFLLPAACQYEKEGSITNSGRWLQWRYKVVEPPGEAKSDGDILVELAK 438
Score = 32.0 bits (73), Expect = 2.8
Identities = 41/178 (23%), Positives = 64/178 (35%), Gaps = 32/178 (17%)
Query: 912 DAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKG----RFAYDGLKRQRLLTPFVR-N 966
+VG +I + G + + +N L KG F + RL P R
Sbjct: 7 CSVGCGLIAYVQNGVWVHQEGDPDHPVNRGSLCPKGAALRDFVHSPK---RLKYPMYRAP 63
Query: 967 CDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAF 1026
G+ W++AL +A+K++ A V A A +V D + LGS L E +
Sbjct: 64 GSGKWEEISWDEALDEIARKMKDIRDASFVEKNA-AGVVVNRPDSIAFLGSAKLSNEECY 122
Query: 1027 PLEGAGTDLRANYL-----------------------LNNKIAGAEEADLILLIGTNP 1061
+ L N L + N + AD+IL++G NP
Sbjct: 123 LIRKFARALGTNNLDHQARIUHSPTVAGLANTFGRGAMTNSWNDIKNADVILVMGGNP 180
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 35.7 bits (82), Expect = 0.19
Identities = 42/210 (20%), Positives = 88/210 (41%), Gaps = 20/210 (9%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
+ +IE LE L+ T+++ +LEE+ +E S+ L R+ + +R
Sbjct: 170 RSRIEQLETALRHSTEKTEELEEQ-------HKEAQSSSESMSAERNALLAQRAENQQRI 222
Query: 478 KNADSLVR-LQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT 536
+ ++ L + + E +L + I+ EL ++ +K++LK+ Q E
Sbjct: 223 LELEQDIQTLTQKKQENDRVLEGTQDIEAELE-----RMKGELKQRLKKMTIQRRDEETE 277
Query: 537 LHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE---AENKKLKAELTQR 593
+ E+ + Q + S + A L + + + + AE + + E Q
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337
Query: 594 DIKLKETNHLLKEE----SKKRELMAKKVE 619
++L + N LKE +++RE + + E
Sbjct: 338 SLQLSQLNLALKEGQSQWAQERETLRQSAE 367
>gnl|CDD|203110 pfam04879, Molybdop_Fe4S4, Molybdopterin oxidoreductase Fe4S4
domain. This domain is found in formate dehydrogenase H
for which the structure is known. The first domain
(residues 1 to 60, 448 to 476, and 499 to 540),
comprising two small antiparallel sheets and four
helices, coordinates the Fe4S4 cluster just below the
protein surface.
Length = 55
Score = 31.5 bits (72), Expect = 0.21
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYD 952
VG ++V + G++++V + +N L KGRF Y+
Sbjct: 12 GVGCGLLVHVKDGKIVKVEGDPDHPVNRGRLCVKGRFGYE 51
>gnl|CDD|239157 cd02756, MopB_Arsenite-Ox, Arsenite oxidase (Arsenite-Ox) oxidizes
arsenite to the less toxic arsenate; it transfers the
electrons obtained from the oxidation of arsenite towards
the soluble periplasmic electron carriers cytochrome c
and/or amicyanin. Arsenite oxidase is a heterodimeric
enzyme containing a large and a small subunit. The large
catalytic subunit harbors the molybdopterin cofactor and
the [3Fe-4S] cluster; and the small subunit belongs to
the structural class of the Rieske proteins. The small
subunit is not included in this alignment. Members of
MopB_Arsenite-Ox CD belong to the molybdopterin_binding
(MopB) superfamily of proteins.
Length = 676
Score = 35.5 bits (82), Expect = 0.22
Identities = 34/125 (27%), Positives = 44/125 (35%), Gaps = 26/125 (20%)
Query: 957 QRLLTPFVRNCDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLA-DAEAMVALKD----- 1010
RL TP VR GQL T W+DA+ VA+ + G++ D + D
Sbjct: 116 TRLTTPLVRR-GGQLQPTTWDDAIDLVARV-----IKGILDKDGNDDAVFASRFDHGGGG 169
Query: 1011 --LLNKLGSEDLYTEY------------AFPLEGAGTDLRANYLLNNKIAGAEEADLILL 1056
N G + A+ E T LNN A AD I+L
Sbjct: 170 GGFENNWGVGKFFFMALQTPFVRIHNRPAYNSEVHATREMGVGELNNSYEDARLADTIVL 229
Query: 1057 IGTNP 1061
G NP
Sbjct: 230 WGNNP 234
>gnl|CDD|226369 COG3851, UhpB, Signal transduction histidine kinase,
glucose-6-phosphate specific [Signal transduction
mechanisms].
Length = 497
Score = 35.1 bits (81), Expect = 0.25
Identities = 43/196 (21%), Positives = 85/196 (43%), Gaps = 31/196 (15%)
Query: 404 LYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVS----- 458
L + + + L I+ +++ L + LQ E R+R L E+L T EE +RK+V+
Sbjct: 257 LLAQALTGLGLGIAIQRLRELNQRLQKELARNRALAEQLIST----EESIRKDVARELHD 312
Query: 459 --------IVTVTLFLERLRSSHSKRSKNADSLVRLQ-KAQDEIQTLLAK---------- 499
I T ++R +++ + A + +L + D ++ LL +
Sbjct: 313 EIGQNITAIRTQAGIVKRAAD-NAQVKQAASLIEQLSLRIYDSVRQLLGRLRPRQLDDLT 371
Query: 500 -SASIKTELREFYQTQLNNNVKEKLKEFQTQLD-SAELTLHREFQQKEKTIVENAAKQLK 557
+I++ LRE + + + + +T LD + +TL+R Q+ I ++A
Sbjct: 372 LEQAIRSLLREMELEERGIHCQLDWRINETALDETQRVTLYRLCQELLNNICKHADASAV 431
Query: 558 QISEKPGATKLMLDFK 573
I +LML+ +
Sbjct: 432 TIQLWQQDERLMLEIE 447
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 35.1 bits (81), Expect = 0.25
Identities = 50/239 (20%), Positives = 87/239 (36%), Gaps = 27/239 (11%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY--- 57
+ + L + +L++ Q + Q+ D +Q L + + E
Sbjct: 157 NEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRL-DFLQFQLEELEEADLQPGEDEALE 215
Query: 58 --QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQL---DSAE 112
Q +L+N +K ++ Q LA LR Q + + E L E Q L
Sbjct: 216 AEQQRLSNLEKLRELSQNALAA-------LRGDVDVQEGSLL-EGLGEAQLALASVIDGS 267
Query: 113 LTLHREFQQKEKTIVENAAKQLKQISEN-EFKQKEKTIVENAAKQLKQISEKPGATKLML 171
L E T VE A ++L+ + EF + +E Q+K++ K
Sbjct: 268 LRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERLAQIKRLKRK-------- 319
Query: 172 DFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 230
+ A E +K+K EL Q D + L +E K E + K ++ + + A
Sbjct: 320 -YGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDKLEEELDKAAVALSLIRRKAA 377
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 35.1 bits (81), Expect = 0.26
Identities = 36/219 (16%), Positives = 71/219 (32%), Gaps = 25/219 (11%)
Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
LK +++I +LE+ ++ + + KLE++L+ + + L
Sbjct: 40 LKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ----------LIETADDL 89
Query: 474 SKRSKN----ADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQ 529
K K L L+ + E + LA+ +L ++ N + Q
Sbjct: 90 KKLRKQIADLNARLNALEVQEREQRRRLAE------QLAALQRSGRNPPPALLVSPEDAQ 143
Query: 530 LDSAELTLHREFQQ---KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKL 586
+ + ++ KQL + + A + L A+ KL
Sbjct: 144 RSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAEL--TTLLSEQRAQQAKL 201
Query: 587 KAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQ 625
L +R L + N L + KK E + +
Sbjct: 202 AQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEI 240
Score = 32.4 bits (74), Expect = 1.9
Identities = 34/223 (15%), Positives = 70/223 (31%), Gaps = 25/223 (11%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
K + ++++L + AS++ +L E L +K ++ L + E R Q
Sbjct: 62 AKLEKQLKSLETEIASLEAQLIET-ADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQ 120
Query: 61 LNNNQK-----------------AQDEIQTLLAKSA-SIKTELREFYQTQ-LNNNVKEKL 101
L Q+ + + + T V+ ++
Sbjct: 121 LAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEI 180
Query: 102 KEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQ--LKQ 159
Q +L + + + + + E K+ +E +K + E A + LK
Sbjct: 181 AAEQAELTTLLSEQRAQQAKLAQLLEER--KKTLAQLNSELSADQKKLEELRANESRLKN 238
Query: 160 -ISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKET 201
I+ A + A E A + +A+ T K
Sbjct: 239 EIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTAP 281
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 34.6 bits (80), Expect = 0.32
Identities = 31/125 (24%), Positives = 54/125 (43%), Gaps = 14/125 (11%)
Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
+K + ++ L+ LQ Q+ + EEL+ R+ E + VS + L LER R+
Sbjct: 152 QIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKE---KGLVSRLE-LLELERERAE 207
Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 532
A + +A E++ L + ++ E ++ QT V E+L E Q +L
Sbjct: 208 -------AQGELGRLEA--ELEVLKRQIDELQLERQQIEQT-FREEVLEELTEAQARLAE 257
Query: 533 AELTL 537
L
Sbjct: 258 LRERL 262
Score = 31.5 bits (72), Expect = 3.5
Identities = 34/165 (20%), Positives = 67/165 (40%), Gaps = 22/165 (13%)
Query: 4 QDEIQTLLAKSASIKTELREFYQTQLNNNQK----AQDEIQ---TLLAKSASIKTELREF 56
+ +++ +LA+ ++ EL Q QL ++ +E++ L K + EL E
Sbjct: 143 RAQLELILAQIKQLEAELAG-LQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLE- 200
Query: 57 YQTQLNNNQKAQDEIQTLLAKSASIKTELREF------YQTQLNNNVKEKLKEFQTQLDS 110
+AQ E+ L A+ +K ++ E + V E+L E Q +L
Sbjct: 201 ---LERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAE 257
Query: 111 AELTLHR-EFQQKEKTI---VENAAKQLKQISENEFKQKEKTIVE 151
L++ + + I V+ + LK + Q +T++E
Sbjct: 258 LRERLNKARDRLQRLIIRSPVDGTVQSLKVHTVGGVVQPGETLME 302
>gnl|CDD|235931 PRK07092, PRK07092, benzoylformate decarboxylase; Reviewed.
Length = 530
Score = 34.9 bits (81), Expect = 0.33
Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 1/31 (3%)
Query: 1120 LSAAKKPLIVVGADISDGAAVLALVQQLAAK 1150
L AA++P +VVG + D A +LA +
Sbjct: 203 LDAARRPALVVGPAV-DRAGAWDDAVRLAER 232
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 34.6 bits (80), Expect = 0.37
Identities = 46/261 (17%), Positives = 92/261 (35%), Gaps = 66/261 (25%)
Query: 6 EIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELR--EFYQTQLN- 62
+I L K ++RE Q +IQTL K I+ +++ +
Sbjct: 161 DISVLSEMDKLNKDKIRELNQ-----------QIQTLDMKIDHIQQQIKTYNKNIEEQRK 209
Query: 63 ----NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKE---------KLKEFQTQLD 109
N + Q++ L+ ++ +IK E+ E +L N V + KL ++
Sbjct: 210 KNGENIARKQNKYDELVEEAKTIKAEIEEL-TDELLNLVMDIEDPSAALNKLNTAAAKIK 268
Query: 110 S------AELTLHRE----------FQQKEKTIVE------NAAKQLKQISE-------- 139
S + ++ + + I + L+++
Sbjct: 269 SKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI 328
Query: 140 -NEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 198
+EF ++ K ++E K IS + L KAK +A ++L+AE +L
Sbjct: 329 MDEFNEQSKKLLELKNK----ISTNKQSLIT-LVDKAK--KVKAAIEELQAEFVDNAEEL 381
Query: 199 KETNHLLKEESKKRELMAKKV 219
+ L + K + + K+
Sbjct: 382 AKLQDELDKIVKTKSELVKEK 402
Score = 33.1 bits (76), Expect = 1.3
Identities = 18/104 (17%), Positives = 40/104 (38%), Gaps = 9/104 (8%)
Query: 3 AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREF---YQT 59
D I + K ++ L + T ++ ++ DE K +K ++ T
Sbjct: 297 GPDRITKIKDKLKELQHSLEKL-DTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLIT 355
Query: 60 QLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKE 103
++ +K + I+ L A+ EL +L + + + +K
Sbjct: 356 LVDKAKKVKAAIEELQAEFVDNAEEL-----AKLQDELDKIVKT 394
>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit
4. Members of this family function as part of the
Mediator (Med) complex, which links DNA-bound
transcriptional regulators and the general
transcription machinery, particularly the RNA
polymerase II enzyme. They play a role in basal
transcription by mediating activation or repression
according to the specific complement of transcriptional
regulators bound to the promoter.
Length = 176
Score = 33.5 bits (77), Expect = 0.38
Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 9/85 (10%)
Query: 4 QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 63
+ + L++ +K+ L +L +Q+ Q I L A+ S+ ++++ +L
Sbjct: 1 LELAEDLISADDELKSAL-----KELEEHQENQQRILELRAEVESLDEKIKDI-LKELKE 54
Query: 64 NQKAQDEIQTLLAKSASIKTELREF 88
A+ E++TL S+ K R+
Sbjct: 55 ---AEKELRTLPDLSSINKANKRKV 76
>gnl|CDD|216842 pfam02009, Rifin_STEVOR, Rifin/stevor family. Several multicopy
gene families have been described in Plasmodium
falciparum, including the stevor family of subtelomeric
open reading frames and the rif interspersed repetitive
elements. Both families contain three predicted
transmembrane segments. It has been proposed that stevor
and rif are members of a larger superfamily that code
for variant surface antigens.
Length = 290
Score = 33.9 bits (78), Expect = 0.47
Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 3/44 (6%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVT 461
KE ++ R QT QR ++ +E ++D RQ C+E KE+ +
Sbjct: 38 KEVMDKFNR--QTS-QRFKEYDERMKDKRQKCKEQCDKEIQKII 78
>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
unknown].
Length = 565
Score = 34.4 bits (79), Expect = 0.49
Identities = 39/205 (19%), Positives = 70/205 (34%), Gaps = 27/205 (13%)
Query: 17 IKTELREFYQTQLNNNQKAQDEIQTLLAKSASI----KTELREFYQTQLNNNQKAQDEIQ 72
+K EL Y ++ + I+ +L+ + I K + +F + + A+D ++
Sbjct: 101 LKHELPNGYDRYEEIMEEQKKAIKDILSDVSHILDLGKLDTEDFKDEVDSARKHAKDTLE 160
Query: 73 TLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAK 132
L + L V+ K++E + + Q +E TI A K
Sbjct: 161 KLYK---------LDQEGMTLMAAVESKMQELKAIIR----------QLEEWTIKGGATK 201
Query: 133 QLKQISENEFKQKEKTIVENAA---KQLKQ-ISEKPGATKLMLDFKAKCEVYEAENKKLK 188
+ I E TI + AA Q + + K K D + + E+ NK
Sbjct: 202 KGVPIHYVAKAFAEVTIHKKAAEVALQSETYLDIKTELAKERPDMRDLDKPLESANKTGY 261
Query: 189 AELTQRDIKLKETNHLLKEESKKRE 213
L DI + + R
Sbjct: 262 EGLAGEDIVPFFLAEETGQLAYSRA 286
>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
(trigger factor) [Posttranslational modification,
protein turnover, chaperones].
Length = 441
Score = 34.2 bits (79), Expect = 0.50
Identities = 40/191 (20%), Positives = 80/191 (41%), Gaps = 17/191 (8%)
Query: 45 KSASIKTELREFYQTQL--NNNQKAQD--EIQTLLAKSASIKTELREFYQTQLNNNVKEK 100
K A+ K +++E + +L +++ A+ E TL ++ L + KE+
Sbjct: 229 KEATFKVKVKEVKKRELPELDDEFAKKLGEEDTLEELKEKLRKNLERELKEATLEKRKEQ 288
Query: 101 LKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI 160
L + + + +L + + +++ A +QL+Q + + E ++ K+
Sbjct: 289 LLDALVEANDFDLP-ESLVEAEIDNLLKQALQQLQQQGIDSL-EASGESEEELREEFKEE 346
Query: 161 SEKPGATKLMLDFKAKCEVYEAENKKLKAELT-----------QRDIKLKETNHLLKEES 209
+EK L+L+ AK E E +++KAE+ + IKL N L +
Sbjct: 347 AEKRVKLGLLLEEIAKEEKLEVTEEEIKAEIEELARQYGGEQPEEVIKLYYNNQELLDAL 406
Query: 210 KKRELMAKKVE 220
K L K V+
Sbjct: 407 KADILEEKAVD 417
>gnl|CDD|215786 pfam00205, TPP_enzyme_M, Thiamine pyrophosphate enzyme, central
domain. The central domain of TPP enzymes contains a
2-fold Rossman fold.
Length = 136
Score = 32.5 bits (75), Expect = 0.50
Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 3/38 (7%)
Query: 1118 KKLSAAKKPLIVVG--ADISDG-AAVLALVQQLAAKVT 1152
+ L++AK+P+I+ G S + AL ++L V
Sbjct: 6 ELLASAKRPVILAGGGVRRSGASEELRALAEKLGIPVV 43
Score = 29.8 bits (68), Expect = 3.9
Identities = 28/101 (27%), Positives = 36/101 (35%), Gaps = 34/101 (33%)
Query: 982 AVAQKLQTSE----VAGVVGSLADAEAMVALKDLLNKLG-------------SED--LYT 1022
A+ L +++ +AG + A L+ L KLG ED LY
Sbjct: 3 KAAELLASAKRPVILAGGGVRRSGASE--ELRALAEKLGIPVVTTLMGKGAFPEDHPLYL 60
Query: 1023 EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRF 1063
G AN L EEADL+L IG RF
Sbjct: 61 GML----GMHGTPAANEAL-------EEADLVLAIGA--RF 88
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 34.0 bits (79), Expect = 0.53
Identities = 53/271 (19%), Positives = 109/271 (40%), Gaps = 59/271 (21%)
Query: 4 QDEIQTLL---AKSASIKTELREFYQ----TQLNNNQK---AQDEIQTLLAKSASIKTEL 53
+E+Q LL K+ +L++ Y+ + L N A DE++ L ++ E
Sbjct: 125 LEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLEN---LEEEF 181
Query: 54 REFYQTQLNNN---QKAQDEIQTLLAKSASIKTELREFYQ--TQLNNNVKEKLKEFQT-- 106
+F +L + +A++ + L + A+++ + E + +L + ++L+E +
Sbjct: 182 SQF--VELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTELPDQLQELKAGY 239
Query: 107 -QLDSA-----ELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI 160
+L L + +E Q ++ I EN L + E + ++ A ++ ++I
Sbjct: 240 RELVEEGYHLDHLDIEKEIQDLKEQIDEN----LALLEELD--------LDEAEEKNEEI 287
Query: 161 SEKPGATKLMLD-----FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEE----SKK 211
E+ + D KA+ V +K L KE N LKEE +
Sbjct: 288 QER---IDQLYDILEREVKARKYV-----EKNSDTLPDFLEHAKEQNKELKEEIDRVKQS 339
Query: 212 RELMAKKVESVMQSQLQQAIHMITSDVTHLH 242
L ++ES QL++ + + +
Sbjct: 340 YTLNESELES--VRQLEKQLESLEKQYDEIT 368
Score = 32.1 bits (74), Expect = 2.3
Identities = 25/135 (18%), Positives = 55/135 (40%), Gaps = 26/135 (19%)
Query: 516 NNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLD---- 571
+ ++++++++ + Q+D L E+ ++ A ++ ++I E+ + D
Sbjct: 251 HLDIEKEIQDLKEQIDENLALL-------EELDLDEAEEKNEEIQER---IDQLYDILER 300
Query: 572 -FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEE----SKKRELMAKKVESVMQSQL 626
KA+ V +K L KE N LKEE + L ++ES QL
Sbjct: 301 EVKARKYV-----EKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELES--VRQL 353
Query: 627 QQAIHMITSDVTHLH 641
++ + + +
Sbjct: 354 EKQLESLEKQYDEIT 368
>gnl|CDD|239167 cd02766, MopB_3, The MopB_3 CD includes a group of related
uncharacterized bacterial and archaeal
molybdopterin-binding oxidoreductase-like domains with a
putative N-terminal iron-sulfur [4Fe-4S] cluster binding
site and molybdopterin cofactor binding site. These
members belong to the molybdopterin_binding (MopB)
superfamily of proteins.
Length = 501
Score = 34.1 bits (79), Expect = 0.54
Identities = 45/167 (26%), Positives = 66/167 (39%), Gaps = 35/167 (20%)
Query: 918 IIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ----RLLTPFVRN--CDGQL 971
++V+ G ++RV ++ KG Y ++R RLLTP R GQ
Sbjct: 14 LLVTVEDGRIVRVEGDPAHPYTRGFICAKGA-RY--VERVYSPDRLLTPLKRVGRKGGQW 70
Query: 972 VATEWEDALIAVAQKLQ------TSEV------AGVVGSLA-DAEAMVALKDLLNKLG-S 1017
W++AL +A KL+ E AG +G L A + LG S
Sbjct: 71 ERISWDEALDTIAAKLKEIKAEYGPESILPYSYAGTMGLLQRAARGR-----FFHALGAS 125
Query: 1018 EDLYTEYAFPLEGAGTD-LRANY--LLNNKIAGAEEADLILLIGTNP 1061
E T GAG + + ++ L N ADLI++ G NP
Sbjct: 126 ELRGT----ICSGAGIEAQKYDFGASLGNDPEDMVNADLIVIWGINP 168
Score = 33.4 bits (77), Expect = 0.94
Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 13/59 (22%)
Query: 1228 ASIADAILPGAAYTEK-------QSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
A AD +LP + E Y+ + A+ PPG AR + +I R L++
Sbjct: 371 ARYADIVLPATTFLEHEDVYASYWHYYL---QYNEP---AIPPPGEARSNTEIFRELAK 423
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 34.2 bits (78), Expect = 0.63
Identities = 109/636 (17%), Positives = 239/636 (37%), Gaps = 61/636 (9%)
Query: 6 EIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQ 65
E++ +K S ++ F+ +L + + A+ T +E + NQ
Sbjct: 18 ELERKQSKLGSSMNSIKTFWSPELKRERALRK-------DEAARITVWKEQLRVTQEENQ 70
Query: 66 KAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQ-TQLDSAELTLHREFQQKEK 124
AQ +Q L + S + R + + + N +L++ + ++ + + RE Q E
Sbjct: 71 HAQLTVQALQEELKSQRELNRLQQKLEKSGNETGELQQLEKSEGQFSRIQAERERQAMEL 130
Query: 125 TIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAEN 184
++ + ++L+ E Q+ + K+L ++ + G L KA+ E E
Sbjct: 131 FLLRDTLEKLQ--LRMERLQQTLDARDEEIKKLLEMLQSKG-----LSAKAEEEDSERLG 183
Query: 185 KKLKAELTQRDI----KLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 240
+ +AE Q + + KE H++ E ++L ++ ++ ++ Q+ I + +
Sbjct: 184 RIREAEDQQSHLEVLLEQKEKEHMMLREEIHQKLQMERDDAKTEA-SQKLIDEKDTKIKE 242
Query: 241 LHTSKESLATRQPSEQSLID-STTDTNCPVTQQYVPSSNQQYAPSSNQQYSIPTSNINQQ 299
E QS D S D N + S + + + S +
Sbjct: 243 FEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTE 302
Query: 300 YSMFSADSTHMNIVERDIPNSADDMRKYIEMQYSMFSADSTHMNIVERDIPNSADDMRKY 359
T + + N DMR++++ + I++ ++ D +R
Sbjct: 303 ---LLGLQTEL----ETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEV----DALRYE 351
Query: 360 IE---MLLEKPPGNPVSDLEAEVKDSAGPMTSFLESSVTSDIFNEPALYTKTVRSVTLKI 416
+E L K + + + E AG +I + Y KT R L++
Sbjct: 352 LERKHNTLTKKTAS-LQAAQEEKATYAG------------EIEDMRDRYEKTERK--LRV 396
Query: 417 SKEKIESLERNLQTETQRSRKLEE---ELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
++KIE+L+ + + +R ++ +E LQ T + + E ++ +ERL+
Sbjct: 397 LQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQR 456
Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREF-YQTQLNNNVKEKLKEFQTQLDS 532
+ + + + E + L + +++ +L E Q +L KL Q + S
Sbjct: 457 DRDER--YEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRS 514
Query: 533 AELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
H E ++ + E K+LK++ P + A A++ + E+
Sbjct: 515 DLERAHIELEKIREKH-EKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVD- 572
Query: 593 RDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQ 628
+L + ++E E+ A ++ ++ +
Sbjct: 573 ---RLLDRLEKAEQERDDTEMEAGRLAKELEKAQRH 605
>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
protein. This family represents an accessory protein
that works with the bacteriocin maturation and ABC
transport secretion protein described by TIGR01193
[Transport and binding proteins, Other].
Length = 457
Score = 33.9 bits (78), Expect = 0.67
Identities = 49/241 (20%), Positives = 86/241 (35%), Gaps = 47/241 (19%)
Query: 1 QKAQDEIQTLLAKSASIKTELREF-------YQTQLNN-----------NQKAQDEIQTL 42
+D+ ++L SI+ +F Y+ N Q+ D+ QT
Sbjct: 107 DNLKDQKKSLDTLKQSIENGRNQFPTDDSFGYRNLFNGYLAQVESLTSETQQQNDKSQTQ 166
Query: 43 LAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREF--YQTQLNNNVKEK 100
+ K +L Q++ + + Q L + S T++ F YQ+ N
Sbjct: 167 NEAAEKTKAQL----DQQISKTDQKLQDYQALK-NAISNGTKVANFNPYQSLYEN----- 216
Query: 101 LKEFQTQLDSAELTLHREFQQKEKTIVENAAKQ---LKQISENEFKQKEKTIVENAAKQL 157
+Q QL SA + Q + TI+ +Q L++ + QK A+
Sbjct: 217 ---YQAQLKSASDKDQKN--QVKSTILATIQQQIDQLQKSIASYQVQKAGLTKSTASNYA 271
Query: 158 KQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAK 217
+ K K K K E+ + K L+ E + LKE+S+K + A
Sbjct: 272 SSQNSKLAQLKEQQLAKVKQEITDLNQKLLELESKIK---------SLKEDSQKGVIKAP 322
Query: 218 K 218
+
Sbjct: 323 E 323
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 33.3 bits (76), Expect = 0.74
Identities = 58/225 (25%), Positives = 102/225 (45%), Gaps = 26/225 (11%)
Query: 8 QTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKA 67
Q+LL ++ + + E + QL KA+DEI L ++K EL +FY + + +
Sbjct: 87 QSLLKQNKVLMEK-NEKLEEQLG---KARDEILQL-RHELNLKDELLQFY-SDADEESED 140
Query: 68 QDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHRE---FQQKEK 124
+ T L S + F L +K L+E L S L E +++KE+
Sbjct: 141 ESSESTPLRPQESSSSSHGCFQLEALQEKLKL-LEEENEHLRSEASHLKTETVTYEEKEQ 199
Query: 125 TIVENAAKQLK----QISENEFKQKEKTIVENAAKQLKQISEKPGATKLM---LDFKAKC 177
+V + KQL+ QI+ + +KT E+ +Q ++I T L+ +D + KC
Sbjct: 200 QLVNDCVKQLREANDQIASLSEELAKKT--EDLERQQEEI------THLLSQIVDLQKKC 251
Query: 178 EVYEAENKKLKAEL-TQRDIKLKETNHLLKEESKKRELMAKKVES 221
+ Y EN++L+ L +D + + L + + K E M E+
Sbjct: 252 KSYALENEELQQHLAAAKDAQRQLQAELQELQDKYAECMEMLHEA 296
>gnl|CDD|219925 pfam08598, Sds3, Sds3-like. Repression of gene transcription is
mediated by histone deacetylases containing
repressor-co-repressor complexes, which are recruited to
promoters of target genes via interactions with
sequence-specific transcription factors. The
co-repressor complex contains a core of at least seven
proteins. This family represents the conserved region
found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
Length = 184
Score = 32.7 bits (75), Expect = 0.79
Identities = 26/97 (26%), Positives = 43/97 (44%), Gaps = 10/97 (10%)
Query: 520 KEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKC--E 577
+E+L + QT+L+ H E+ + K + E +LK +L ++K +C
Sbjct: 23 RERLAQLQTELELLLQGTHPEYLEPLKDLEERRDDRLKVA-------ELRREYKLECIER 75
Query: 578 VYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELM 614
YEAE + K E + L+E L + E K L
Sbjct: 76 EYEAERQAAKQEFEKEKRLLRE-RLLEELEEKIYRLE 111
>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57. The
CLD or centrosome localisation domain of Cep57 is found
at the N-terminus, and lies approximately between
residues 58 and 239. This region lies within the first
alpha-helical coiled-coil segment of Cep57, and
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57.
Length = 178
Score = 32.5 bits (74), Expect = 0.80
Identities = 44/167 (26%), Positives = 77/167 (46%), Gaps = 22/167 (13%)
Query: 67 AQDEIQTLLAKSASIKTELREFYQTQ--LNNNVKEKLKEFQTQLDSAELTLHREFQQKE- 123
A+D +QTL ++ K L E Q + N + ++ K+ +QL SAE +Q E
Sbjct: 23 AEDNLQTLSREATHYKKVLEEEEQERNLAANELTKQNKDLTSQLSSAESRCSLLEKQLEY 82
Query: 124 -KTIVENAAKQLKQISENEFK-QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE 181
+ +VENA K+ + E + Q+EK E+ ++ EK EV E
Sbjct: 83 MRRMVENAEKERTAVLEQQVSLQREK---EHDQMDVQAQLEK-------------LEVLE 126
Query: 182 AENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 228
E +L + + K+++ L+EE +R+L+ K + Q+ L+
Sbjct: 127 KEYLRLTRTQSLAETKIQQLEEKLQEEEHQRKLVQDKAAEL-QTGLE 172
Score = 32.1 bits (73), Expect = 0.98
Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 13/151 (8%)
Query: 489 AQDEIQTLLAKSASIKTELREFYQTQ--LNNNVKEKLKEFQTQLDSAELTLHREFQQKE- 545
A+D +QTL ++ K L E Q + N + ++ K+ +QL SAE +Q E
Sbjct: 23 AEDNLQTLSREATHYKKVLEEEEQERNLAANELTKQNKDLTSQLSSAESRCSLLEKQLEY 82
Query: 546 -KTIVENAAKQLKQISEKPG-----ATKLMLDFKA---KCEVYEAENKKLKAELTQRDIK 596
+ +VENA K+ + E+ +D +A K EV E E +L + + K
Sbjct: 83 MRRMVENAEKERTAVLEQQVSLQREKEHDQMDVQAQLEKLEVLEKEYLRLTRTQSLAETK 142
Query: 597 LKETNHLLKEESKKRELMAKKVESVMQSQLQ 627
+++ L+EE +R+L+ K + Q+ L+
Sbjct: 143 IQQLEEKLQEEEHQRKLVQDKAAEL-QTGLE 172
>gnl|CDD|240667 cd11657, TIN2_N, N-terminal domain of TRF-interacting nuclear
factor 2; shelterin complex protein of telomeres. TIN2
is one of the six proteins of shelterin complex, which
acts to protect telomeres from DNA damage repair
machinery. TIN2 binds directly to TRF1 and TRF2 and
stabilizes TRF2 complex-telomere binding by tethering it
to the TRF1 complex. TIN2 binding to TRF2 is primarily
via the TRF binding motif (TBM) region and the
N-terminus, while the far C-terminal region has lower
affinity. The TIN2 TBM, but not the N-terminal region,
is involved in TIN2 binding to TRF1. Truncation of the
TIN2 N-terminus in mouse results in telomere elongation,
suggesting a negative regulatory function of this
region. Three shelterin components (TRF1, TRF2, POT1)
bind DNA and 3 components (TIN2, RAP1, TPP1) are
recruited by these DNA binding factors. TRF1 activity at
telomeres is regulated in part by selective
ubiquitination and degradation. Ubiquitination of TRF1
is mediated by Fbx4, which binds TRF1 in the TRFH
domain, via a small GTPase module. When bound to
telomeres, TIN2 acts to protect TRF1 from SCF-Fbx4
mediated ubiquitination. F-box proteins act in substrate
recognition as part of Skp1-Cul1-Rbx1-F- box (SCF)
protein complexes. Tankyrase-mediated ADP-ribosylation
releases TRF1 from telomeres, rendering them susceptible
to ubiquitination and degradation, promoting telomere
elongation. TIN2 also binds PIP1, which recruits POT1 to
telomeres.
Length = 188
Score = 32.6 bits (75), Expect = 0.80
Identities = 22/133 (16%), Positives = 49/133 (36%), Gaps = 16/133 (12%)
Query: 451 EVVRKEVSIVTVTL--FLERLRSSHSKRSKNADSLVRLQKAQDE----IQTLLA----KS 500
E+ R E + L L RL + + + D +++ Q +Q+LL +
Sbjct: 60 ELCRNEKAADLAALNPHLPRLLPPYPNKCQRED--ALMEEPQLNFLQLVQSLLKDPEERE 117
Query: 501 ASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTI--VENAAKQLKQ 558
+ Y + + +++ L EF ++L+ L + +Q + + ++
Sbjct: 118 HFFQEVFPVEYGEEFDQALEKLLWEFLSRLEKL-LPVPD-LKQTVSWLSTSPSVLEECLD 175
Query: 559 ISEKPGATKLMLD 571
P K +L
Sbjct: 176 SVSDPDDLKSLLQ 188
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 32.9 bits (75), Expect = 0.81
Identities = 29/133 (21%), Positives = 54/133 (40%), Gaps = 8/133 (6%)
Query: 39 IQTLLAKSASIKTELREFYQTQLNNNQKAQ---DEIQTLLAKSASIKTELREFYQTQLNN 95
IQ + + I ++ Q Q Q+A+ + Q+L + AS + +E Q +
Sbjct: 31 IQVMKKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQH 90
Query: 96 NVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQ-----KEKTIV 150
+ E ++ T D L RE Q+ K + + +Q+ +I E+ IV
Sbjct: 91 LLNEAREDVATARDEWLEQLEREKQEFFKALQQQTGQQMVKIIRAALADLANATLEQQIV 150
Query: 151 ENAAKQLKQISEK 163
+L+ +SE
Sbjct: 151 GIFIARLEHLSEA 163
Score = 29.8 bits (67), Expect = 9.3
Identities = 16/74 (21%), Positives = 34/74 (45%)
Query: 486 LQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE 545
++ + + Q+L + AS + +E Q + + E ++ T D L RE Q+
Sbjct: 59 AERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEAREDVATARDEWLEQLEREKQEFF 118
Query: 546 KTIVENAAKQLKQI 559
K + + +Q+ +I
Sbjct: 119 KALQQQTGQQMVKI 132
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 33.8 bits (77), Expect = 0.87
Identities = 41/214 (19%), Positives = 85/214 (39%), Gaps = 18/214 (8%)
Query: 428 LQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLR------SSHSKRSKNAD 481
Q LE+ +Q+ T E VV +E S+ V++ ++ S K + D
Sbjct: 523 CQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQDVSVEASEIKKNFLGLKSSEKEINSPD 582
Query: 482 SLVRLQK--AQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR 539
+ +E++TL ++ ++ EL + K KL+E + +L+ L
Sbjct: 583 EVKGAVCISTLEELETLKSEKENLDGELSKCKDDL--EESKNKLQETEKKLEE----LKS 636
Query: 540 EFQQKEKTIVENAAKQLKQISEKPGATKL-MLDFKAKCEVYEAENKKLKAELTQRDIKLK 598
E +++ A QLK + E + +L D +A+ + + + L+ EL +
Sbjct: 637 ELDASQES-NSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELE--KERQN 693
Query: 599 ETNHLLKEESKKRELMAKKVESVMQSQLQQAIHM 632
+ K + ++ + E MQ ++ +
Sbjct: 694 HQELIAKCRELEEKIERAEQEENMQKLDEEEQKI 727
>gnl|CDD|173195 PRK14733, coaE, dephospho-CoA kinase; Provisional.
Length = 204
Score = 32.7 bits (74), Expect = 0.90
Identities = 28/143 (19%), Positives = 66/143 (46%), Gaps = 8/143 (5%)
Query: 15 ASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELRE-FYQTQLNNNQKAQDEIQT 73
AS K+ + +LN N D I + K S+ ++ E F + N Q + ++
Sbjct: 16 ASGKSTATRILKEKLNLNVVCADTISREITKKPSVIKKIAEKFGDEIVMNKQINRAMLRA 75
Query: 74 LLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL-------HREFQQKEKTI 126
++ +S K L ++ +N +K+++KE T + ++ L H ++ +K I
Sbjct: 76 IITESKEAKKWLEDYLHPVINKEIKKQVKESDTVMTIVDIPLLGPYNFRHYDYLKKVIVI 135
Query: 127 VENAAKQLKQISENEFKQKEKTI 149
+ +++++ E + K +++ +
Sbjct: 136 KADLETRIRRLMERDGKNRQQAV 158
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 32.3 bits (74), Expect = 0.91
Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 9/79 (11%)
Query: 94 NNNVKEKLKEFQTQLDSAELTL------HREFQQKEKTIVENAAKQLKQISENEFKQKEK 147
+ + L E + + A+ L E +++ I+E A K+ +QI+E + E+
Sbjct: 39 QAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEE 98
Query: 148 T---IVENAAKQLKQISEK 163
I E A +++ E+
Sbjct: 99 ELERIKEAAEAEIEAEKER 117
Score = 29.2 bits (66), Expect = 8.8
Identities = 23/134 (17%), Positives = 52/134 (38%), Gaps = 16/134 (11%)
Query: 26 QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTEL 85
L ++ ++E Q LLA+ E RE +A + I+ ++ I E+
Sbjct: 43 ADDLAEAERLKEEAQALLAEYEQELEEARE----------QASEIIEQAKKEAEQIAEEI 92
Query: 86 REFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQK 145
+ + +L + E + + + A L E + + A++L +E
Sbjct: 93 KAEAEEELERIKEAAEAEIEAEKERALEELRAEVAE----LAVAIAEKLLGKKVDE--AA 146
Query: 146 EKTIVENAAKQLKQ 159
+K +++ +L +
Sbjct: 147 QKDLIDAFIAELGE 160
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 31.5 bits (72), Expect = 1.0
Identities = 20/94 (21%), Positives = 46/94 (48%), Gaps = 11/94 (11%)
Query: 69 DEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVE 128
+ + L ++ +K E ++EKL+E++ ++ + TL Q+ + +
Sbjct: 32 KDYEALYKENEELKEE---------IERLEEKLEEYKELEETLQKTLVVA-QETAEEVKA 81
Query: 129 NAAKQLKQISENEFKQKEKTIVENAAKQLKQISE 162
NA K+ + I E + K + IV +A ++ K+++
Sbjct: 82 NAQKEAELII-KEAEAKAERIVNDANEEAKKLAT 114
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 33.0 bits (75), Expect = 1.1
Identities = 37/175 (21%), Positives = 62/175 (35%), Gaps = 17/175 (9%)
Query: 58 QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR 117
Q+Q ++ +K + + + + A + Q +L KE+LK + Q + E
Sbjct: 68 QSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQA 127
Query: 118 EFQQKEKTIVEN--AAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKA 175
+ +QK++ AA+Q K+ + K + AA + K+ KA
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKK--------------KA 173
Query: 176 KCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 230
+ AE K KAE K E E K E AK +
Sbjct: 174 EEAAKAAEEAKAKAEAAAAKKK-AEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 32.2 bits (74), Expect = 1.1
Identities = 34/187 (18%), Positives = 77/187 (41%), Gaps = 17/187 (9%)
Query: 51 TELREFYQTQLNNN----QKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQT 106
E++ +Y +NN + ++EI + + + E Q N + E LK +
Sbjct: 12 NEIKNYYNDITHNNLELIKSLKEEIAEMKKNEEHNEKLMAEI--AQENKRLVEPLK--KA 67
Query: 107 QLDSAELTLHREFQQKEKTIVENAAKQLKQIS--------ENEFKQKEKTIVENAAKQLK 158
+ + EL + +K+K ++N +LK++ E+E ++ VE +L
Sbjct: 68 EEEVEELRKKLKDYEKDKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELY 127
Query: 159 QISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKK 218
E + K + E + + L EL +++ +L E + + +++K
Sbjct: 128 DKFEA-AIQDVQQKTGLKNLLLEQKLEALNEELEKKEAQLNEVLAAANLDPAALQAVSEK 186
Query: 219 VESVMQS 225
++ V++
Sbjct: 187 LDDVLEE 193
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 33.1 bits (76), Expect = 1.2
Identities = 18/140 (12%), Positives = 53/140 (37%), Gaps = 4/140 (2%)
Query: 97 VKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQ 156
+ ++ + + + K ++ Q +Q +++ + E ++
Sbjct: 31 LAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQ-EARREREELQREEERLVQK 89
Query: 157 LKQISEKPGATKLMLDFKAKCE-VYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELM 215
+Q+ + + + + E A +L+ Q D +L L E++ R+L+
Sbjct: 90 EEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQA--RKLL 147
Query: 216 AKKVESVMQSQLQQAIHMIT 235
K +++ ++ + Q + I
Sbjct: 148 LKLLDAELEEEKAQRVKKIE 167
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 31.6 bits (72), Expect = 1.2
Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 6/66 (9%)
Query: 155 KQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKREL 214
+Q+ +I + G + + K+K + E LK +L RD ++++ LK++ K E
Sbjct: 2 EQIDKIMAEYGKD--IQNPKSKLDKANEERDSLKKQLKNRDKQIED----LKKKVKDNEE 55
Query: 215 MAKKVE 220
+ KK+E
Sbjct: 56 LQKKIE 61
Score = 31.6 bits (72), Expect = 1.2
Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 6/66 (9%)
Query: 554 KQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKREL 613
+Q+ +I + G + + K+K + E LK +L RD ++++ LK++ K E
Sbjct: 2 EQIDKIMAEYGKD--IQNPKSKLDKANEERDSLKKQLKNRDKQIED----LKKKVKDNEE 55
Query: 614 MAKKVE 619
+ KK+E
Sbjct: 56 LQKKIE 61
Score = 30.8 bits (70), Expect = 2.5
Identities = 25/108 (23%), Positives = 46/108 (42%), Gaps = 22/108 (20%)
Query: 517 NNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKC 576
N K KL + + DS K ++N KQ++ + +K + + + K
Sbjct: 16 QNPKSKLDKANEERDSL------------KKQLKNRDKQIEDLKKKVKDNE---ELQKKI 60
Query: 577 EVYEAENKKLKAELTQRDIKLKET--NHLLKEESKKRELMAKKVESVM 622
E + +NK K E KL +T N+ ++ KK + A+ ++V
Sbjct: 61 EKLKQQNKTAKEEYEA---KLADTKLNNAIELALKKAK--ARNAKAVK 103
Score = 30.0 bits (68), Expect = 4.0
Identities = 24/107 (22%), Positives = 47/107 (43%), Gaps = 13/107 (12%)
Query: 119 FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCE 178
+ K I +K K + E +K + +N KQ++ + +K + + + K E
Sbjct: 8 MAEYGKDIQNPKSKLDK--ANEERDSLKKQL-KNRDKQIEDLKKKVKDNE---ELQKKIE 61
Query: 179 VYEAENKKLKAELTQRDIKLKET--NHLLKEESKKRELMAKKVESVM 223
+ +NK K E KL +T N+ ++ KK + A+ ++V
Sbjct: 62 KLKQQNKTAKEEYEA---KLADTKLNNAIELALKKAK--ARNAKAVK 103
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 32.9 bits (75), Expect = 1.2
Identities = 32/193 (16%), Positives = 61/193 (31%), Gaps = 15/193 (7%)
Query: 31 NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQ 90
N K + Q K EF L NN + L A + Y+
Sbjct: 302 NLLKKLSKEQLKTLNLEKQKLLSIEFNTNFLLNNFVSLINR--LKASVNQVFEFEIYLYK 359
Query: 91 TQLNNN--VKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAK---QLKQISENEFKQK 145
N ++ K E + + + + K+ IVE + K + + +
Sbjct: 360 IINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTK-IFY 418
Query: 146 EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAEN--KKLKAELTQRDIKLKET-- 201
K + K+ E+ + + + + E + + K + +DI+L +
Sbjct: 419 HKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNML 478
Query: 202 ---NHLLKEESKK 211
N EES +
Sbjct: 479 QKFNKFKNEESAE 491
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 31.5 bits (72), Expect = 1.2
Identities = 22/109 (20%), Positives = 45/109 (41%), Gaps = 21/109 (19%)
Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDF 173
T++R+ K I++ K+ ++ E E K++++ + K L Q E+
Sbjct: 2 TVYRD---KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEER---------- 48
Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKK----RELMAKK 218
E E +K K + R ++ + LKE+ + + + KK
Sbjct: 49 ----EKRLEELEKAKNKPLARYADDEDYDEELKEQERWDDPMAQFLRKK 93
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 32.6 bits (75), Expect = 1.3
Identities = 27/132 (20%), Positives = 45/132 (34%), Gaps = 15/132 (11%)
Query: 14 SASIKTELREFYQTQLNNNQKAQD-----EIQTLLAKSASIKTELREFYQTQLNNNQKAQ 68
I T L E Q + Q +I L A+ A+++ E+ E Q+AQ
Sbjct: 107 EKEIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQ 166
Query: 69 DEI----QTLLA---KSASIKTELREFYQTQLNN---NVKEKLKEFQTQLDSAELTLHRE 118
E T +A K E + Q +L + + + + Q + E
Sbjct: 167 CEAEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNRQAA 226
Query: 119 FQQKEKTIVENA 130
+K+ I N
Sbjct: 227 VAEKQARIEAND 238
>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
Length = 156
Score = 31.5 bits (72), Expect = 1.6
Identities = 23/99 (23%), Positives = 44/99 (44%), Gaps = 16/99 (16%)
Query: 65 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEK 124
+K + +LAK+ + + Y+ +L+ KE QL+ + Q++ K
Sbjct: 59 RKNLTKASEILAKANELTKQ----YEQELSKARKE------AQLEITQS------QKEAK 102
Query: 125 TIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEK 163
IVEN KQ ++ ++ + K + K LK + E+
Sbjct: 103 EIVENELKQAQKYIDSLLNEATKQLEAQKEKALKSLEEQ 141
Score = 30.7 bits (70), Expect = 2.8
Identities = 19/109 (17%), Positives = 48/109 (44%), Gaps = 20/109 (18%)
Query: 33 QKAQDEIQTLLAKSASIKTELREFYQTQLNN-NQKAQDEIQTLLAKSASIKTELREFYQT 91
+K + +LAK+ + + Y+ +L+ ++AQ EI Q
Sbjct: 59 RKNLTKASEILAKANELTKQ----YEQELSKARKEAQLEITQ---------------SQK 99
Query: 92 QLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEN 140
+ V+ +LK+ Q +DS ++ + +++ +++ +Q+ +S+
Sbjct: 100 EAKEIVENELKQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQ 148
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 31.6 bits (72), Expect = 1.6
Identities = 33/187 (17%), Positives = 81/187 (43%), Gaps = 19/187 (10%)
Query: 51 TELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 110
EL + + + + +K D ++ L A + + L+E + +L + L E +
Sbjct: 3 DELEKLLKKLIKDTKKLLDSLKELPAAAEKLSEALQELGK-ELPDLSNTDLGEALEKFGK 61
Query: 111 AELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI---------VENAAKQLKQIS 161
+ L Q E+ + + LK+ + + K+ +K + A ++LK++
Sbjct: 62 IQKELEEFRDQLEQKLENKVIEPLKEYLKKDLKEIKKRRKKLDKARLDYDAAREKLKKLR 121
Query: 162 EKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVES 221
+K + + + + + + + + ++L+ EL + N L E K++EL + S
Sbjct: 122 KKKKDSSKLAEAEEELQEAKEKYEELREELIED------LNKL---EEKRKELFLSLLLS 172
Query: 222 VMQSQLQ 228
+++Q +
Sbjct: 173 FIEAQSE 179
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 32.4 bits (74), Expect = 1.7
Identities = 22/100 (22%), Positives = 34/100 (34%), Gaps = 5/100 (5%)
Query: 3 AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN 62
Q + T A A+ +T L T AQ A A+ + EL L
Sbjct: 259 LQAALATAQADLAAAQTAL----NTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQ 314
Query: 63 NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 102
Q Q LA + + + +E LN ++ +K
Sbjct: 315 TAQNNLATAQAALANAEARLAKAKE-ALANLNADLAKKQA 353
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 32.3 bits (74), Expect = 1.7
Identities = 37/185 (20%), Positives = 73/185 (39%), Gaps = 15/185 (8%)
Query: 418 KEKIESLERNLQTE---TQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
+ +IE LE+ QT + R+L +++++ R+ E+ + + +
Sbjct: 113 EREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKK 172
Query: 475 KRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE 534
K + + + L E + K ELR+ + + E+ E ++D
Sbjct: 173 KAREIHEKIQELANEAQEYHEEMIKLFEEADELRK-----EADELHEEFVELSKKIDE-- 225
Query: 535 LTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLK--AELTQ 592
LH EF+ + + E K++K + K A K + E E +K K +LT
Sbjct: 226 --LHEEFRNLQNELREL-EKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEKLTT 282
Query: 593 RDIKL 597
++ L
Sbjct: 283 EELLL 287
>gnl|CDD|224000 COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit
(contains helicase and exonuclease domains) [DNA
replication, recombination, and repair].
Length = 1139
Score = 32.8 bits (75), Expect = 1.7
Identities = 15/69 (21%), Positives = 26/69 (37%), Gaps = 6/69 (8%)
Query: 507 LREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGAT 566
L Q L+E T+ S E L R ++ I N +Q +++
Sbjct: 691 LTNGLHAQEL------LQELLTRALSYEHGLLRWLERFIALIENNDEEQARRLESDRDLV 744
Query: 567 KLMLDFKAK 575
++M K+K
Sbjct: 745 RIMTIHKSK 753
>gnl|CDD|180530 PRK06328, PRK06328, type III secretion system protein; Validated.
Length = 223
Score = 31.7 bits (72), Expect = 1.8
Identities = 20/82 (24%), Positives = 36/82 (43%)
Query: 81 IKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEN 140
E + + N KE K + QL E + +Q ++ +V A +K+I
Sbjct: 49 THEECEKLREEAKNQGFKEGSKAWSKQLAFLEEETQKLREQVKEALVPLAIASVKKIIGK 108
Query: 141 EFKQKEKTIVENAAKQLKQISE 162
E + +TIV A LK++++
Sbjct: 109 ELELHPETIVSIIANSLKELTQ 130
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 31.0 bits (71), Expect = 1.8
Identities = 17/106 (16%), Positives = 44/106 (41%), Gaps = 12/106 (11%)
Query: 39 IQTLLAKSA---SIKTELREFYQTQLNNNQKAQDEIQTLLAK--------SASIKTELRE 87
+Q +L +S + + +L + ++ + +K + E+Q L K S + + + +
Sbjct: 6 VQKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEK 65
Query: 88 FYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQ 133
Q + + K ++ Q L + ++ K ++ AK+
Sbjct: 66 ELQ-KKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAKK 110
Score = 29.1 bits (66), Expect = 7.6
Identities = 21/85 (24%), Positives = 45/85 (52%), Gaps = 8/85 (9%)
Query: 486 LQKAQDEIQTLLAK----SASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREF 541
L+K + E+Q L K +A++ RE + +L +K++EFQ + + L +
Sbjct: 34 LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQ----KKVQEFQRKQQKLQQDLQKRQ 89
Query: 542 QQKEKTIVENAAKQLKQISEKPGAT 566
Q++ + I++ K +K++++K G
Sbjct: 90 QEELQKILDKINKAIKEVAKKKGYD 114
>gnl|CDD|239165 cd02764, MopB_PHLH, The MopB_PHLH CD includes a group of related
uncharacterized putative hydrogenase-like homologs (PHLH)
of molybdopterin binding (MopB) proteins. This CD is of
the PHLH region homologous to the catalytic
molybdopterin-binding subunit of MopB homologs.
Length = 524
Score = 32.5 bits (74), Expect = 2.0
Identities = 39/217 (17%), Positives = 67/217 (30%), Gaps = 26/217 (11%)
Query: 1048 AEEADLILLI--GTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYDYEHLGESADL 1105
AD+ L I F L + I+KG ++ D + +L
Sbjct: 247 GANADVRLAIRPSQEKAFALGLAHKLIKKGAGSSLPD----------FFRALNLAFKPAK 296
Query: 1106 IKQLASGSHPF----SKKLSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCESDVPCDW 1161
+ +L +K L+AA K L+V G+++S A V A +D
Sbjct: 297 VAELTVDLDKALAALAKALAAAGKSLVVAGSELSQTAGADTQVAVNALNSLLGNDG---- 352
Query: 1162 KVLNILQKAASQVAALDIGYKPGTSAIREKPPKVLFLLGAD------EGSISRDDVGKDC 1215
K ++ + K S I L + + +G + K
Sbjct: 353 KTVDHARPIKGGELGNQQDLKALASRINAGKVSALLVYDVNPVYDLPQGLGFAKALEKVP 412
Query: 1216 FIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEG 1252
+ G D A + D + P + E +G
Sbjct: 413 LSVSFGDRLDETAMLCDWVAPMSHGLESWGDAETPDG 449
>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO. This family
contains the bacterial type III secretion protein YscO,
which is approximately 150 residues long. YscO has been
shown to be required for high-level expression and
secretion of the anti-host proteins V antigen and Yops
in Yersinia pestis.
Length = 152
Score = 30.9 bits (70), Expect = 2.1
Identities = 32/153 (20%), Positives = 57/153 (37%), Gaps = 21/153 (13%)
Query: 3 AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN 62
A+ ++ + RE Q+ + Q Q E L A+ L+E
Sbjct: 16 AEKAVKRQEQALVQARARHREAQQSLQDYRQWRQQEENRLFAELQGATVLLKEL------ 69
Query: 63 NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQK 122
+K Q ++ L AS++ +L E K++L+ + +L A L + +
Sbjct: 70 --EKWQQQVGLLRENEASLEQQLAE---------AKQRLEAERQRLRQARQQLQEARKAQ 118
Query: 123 EKTI----VENAAKQLKQISENEFKQKEKTIVE 151
EK E A Q ++ E +Q+E V
Sbjct: 119 EKFAELARQEQAEAQAQRQYLEELEQEEFRTVS 151
>gnl|CDD|131471 TIGR02418, acolac_catab, acetolactate synthase, catabolic.
Acetolactate synthase (EC 2.2.1.6) combines two molecules
of pyruvate to yield 2-acetolactate with the release of
CO2. This reaction may be involved in either valine
biosynthesis (biosynthetic) or conversion of pyruvate to
acetoin and possibly to 2,3-butanediol (catabolic). The
biosynthetic type, described by TIGR00118, is also
capable of forming acetohydroxybutyrate from pyruvate and
2-oxobutyrate for isoleucine biosynthesis. The family
described here, part of the same larger family of
thiamine pyrophosphate-dependent enzymes (pfam00205,
pfam02776) is the catabolic form, generally found
associated with in species with acetolactate
decarboxylase and usually found in the same operon. The
model may not encompass all catabolic acetolactate
synthases, but rather one particular clade in the larger
TPP-dependent enzyme family [Energy metabolism,
Fermentation].
Length = 539
Score = 32.0 bits (73), Expect = 2.3
Identities = 45/212 (21%), Positives = 86/212 (40%), Gaps = 40/212 (18%)
Query: 977 EDALIAVAQKLQTSE----VAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAF--PLEG 1030
+DA+ VA+ +Q ++ + G+ S E A++ LL K T+ +G
Sbjct: 182 DDAIDEVAEAIQNAKLPVLLLGLRAS--SPETTEAVRRLLKK-------TQLPVVETFQG 232
Query: 1031 AGTDLRANY--------LLNNKIAGA--EEADLILLIGTNP-RFEAPLFNARIRKGYLTN 1079
AG R L N+ ++ADL++ IG +P +E +N+ +
Sbjct: 233 AGAVSRELEDHFFGRVGLFRNQPGDRLLKQADLVITIGYDPIEYEPRNWNSENDATIVHI 292
Query: 1080 ELDVAYIG----PKVDLRYDYEHLGESADLIKQ------LASGSHPFSKKLSAAKKPLIV 1129
+++ A I P ++L D + + DL+ + L + + L ++ L
Sbjct: 293 DVEPAQIDNNYQPDLELVGD---IASTLDLLAERIPGYELPPDALAILEDLKQQREALDR 349
Query: 1130 VGADI-SDGAAVLALVQQLAAKVTCESDVPCD 1160
V A + L +++ + A VT + V D
Sbjct: 350 VPATLKQAHLHPLEIIKAMQAIVTDDVTVTVD 381
>gnl|CDD|218791 pfam05879, RHD3, Root hair defective 3 GTP-binding protein (RHD3).
This family consists of several eukaryotic root hair
defective 3 like GTP-binding proteins. It has been
speculated that the RHD3 protein is a member of a novel
class of GTP-binding proteins that is widespread in
eukaryotes and required for regulated cell enlargement.
The family also contains the homologous yeast synthetic
enhancement of YOP1 (SEY1) protein which is involved in
membrane trafficking.
Length = 743
Score = 32.0 bits (73), Expect = 2.3
Identities = 38/216 (17%), Positives = 74/216 (34%), Gaps = 33/216 (15%)
Query: 416 ISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK 475
I+ E E ++ R+L+E + + K SI E+ S +
Sbjct: 259 IANEVFEEFITAYDPLEEKWRELKEAGE--GGLLGGLGAKLASI------REKCLSEYDT 310
Query: 476 RSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAEL 535
+ + V +K Q E++ + + L+ +Q QL K L+ F+ + SA
Sbjct: 311 EASRYNKGVYQEKRQ-ELEE------KLDSHLKPTFQAQLGALHKGLLESFKEAVSSA-- 361
Query: 536 TLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 595
A + + + G TK + F +CE E R
Sbjct: 362 --------------LKAGEGFAESVKD-GKTKCVEKFDEECESLAIEQATWSNSDKIRLK 406
Query: 596 KLKE-TNHLLKEESKKRELMAKKVESVMQSQLQQAI 630
K+ H + ++ + + + E + S L + +
Sbjct: 407 LEKDIDAHSARLRKEELKELTNRYEKKLVSALSEPV 442
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.1 bits (74), Expect = 2.6
Identities = 30/145 (20%), Positives = 64/145 (44%), Gaps = 17/145 (11%)
Query: 90 QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
+ + KE L E + ++ +E +++ + + K+L Q EN ++ E
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL-QKLEKRLLQKEENLDRKLEL-- 104
Query: 150 VENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEES 209
+E ++L++ ++ + + + K E E +L E Q +L+ + L EE+
Sbjct: 105 LEKREEELEKKEKE--LEQKQQELEKKEEELE----ELIEEQLQ---ELERISGLTAEEA 155
Query: 210 KKRELMAKKVESVMQSQLQQAIHMI 234
K+ L +KVE + +A +I
Sbjct: 156 KEILL--EKVEEEAR---HEAAVLI 175
Score = 31.7 bits (73), Expect = 3.1
Identities = 27/143 (18%), Positives = 63/143 (44%), Gaps = 13/143 (9%)
Query: 97 VKEKLKEFQTQLDSAELTLHREFQQKEKTI-VENAAKQLKQISENEFKQKEKTIVENAAK 155
+KE +E + L+ A+ E +KE + + +L+ E E +++ + K
Sbjct: 33 IKEAEEEAKRILEEAKKEA--EAIKKEALLEAKEEIHKLRNEFEKELRERRNEL----QK 86
Query: 156 QLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELM 215
K++ +K + LD K E+ E ++L+ + + + K +E +E + E
Sbjct: 87 LEKRLLQK----EENLDRKL--ELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQ 140
Query: 216 AKKVESVMQSQLQQAIHMITSDV 238
+++E + ++A ++ V
Sbjct: 141 LQELERISGLTAEEAKEILLEKV 163
Score = 30.9 bits (71), Expect = 5.6
Identities = 31/146 (21%), Positives = 65/146 (44%), Gaps = 29/146 (19%)
Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
L+ + +++ LE+ L Q+ L+ +L+ + EE+ +KE LE+ +
Sbjct: 77 LRERRNELQKLEKRLL---QKEENLDRKLELLEKREEELEKKEKE-------LEQKQQEL 126
Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSA 533
K+ + + L+ ++++Q L S E +E L V+E+ + ++A
Sbjct: 127 EKKEEELEELI-----EEQLQELERISGLTAEEAKE----ILLEKVEEEARH-----EAA 172
Query: 534 ELTLHREFQQKEKTIVENAAKQLKQI 559
L ++ E+ E A K+ K+I
Sbjct: 173 VL-----IKEIEEEAKEEADKKAKEI 193
Score = 30.1 bits (69), Expect = 9.7
Identities = 9/48 (18%), Positives = 24/48 (50%)
Query: 410 RSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEV 457
+ L E +E E L+ + + + ++EL+ + EE++ +++
Sbjct: 94 KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQL 141
>gnl|CDD|219397 pfam07393, Sec10, Exocyst complex component Sec10. This family
contains the Sec10 component (approximately 650 residues
long) of the eukaryotic exocyst complex, which
specifically affects the synthesis and delivery of
secretory and basolateral plasma membrane proteins.
Length = 685
Score = 31.9 bits (73), Expect = 2.6
Identities = 32/150 (21%), Positives = 53/150 (35%), Gaps = 30/150 (20%)
Query: 408 TVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVT------ 461
R ++S + + L + +EE LQ VV++E +I+
Sbjct: 159 IDRDNVDEVSGLDNDEIWEKLADPDEHPSIVEESLQALFSEVVVVVKEEAAIIKRVFPNP 218
Query: 462 ---VTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTE--LREFYQT--Q 514
+ F++R V Q Q ++ LL K+ SI LR + Q
Sbjct: 219 EVVMQKFIQR---------------VFAQVIQQRLEALLDKAKSISQLAYLRSLHSLYSQ 263
Query: 515 LNNNVKEKLKEF-QTQLDSAELTLHREFQQ 543
VK+ LKEF T+ L + +
Sbjct: 264 TLKLVKD-LKEFLSTENPDLSAFLDQLTED 292
>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain
fusion protein; Provisional.
Length = 906
Score = 32.0 bits (73), Expect = 2.7
Identities = 28/101 (27%), Positives = 40/101 (39%), Gaps = 15/101 (14%)
Query: 531 DSAELTLHREFQQKEK-TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAE 589
DS L L F Q K IVE+ ++ E KL L A+ ENKKLK
Sbjct: 83 DSNLLNLIDSFSQNLKWAIVEHICDKILLYGEN----KLALRTLAEAYAKLNENKKLKGV 138
Query: 590 LTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAI 630
+ L+K + E++ K S + ++AI
Sbjct: 139 WER----------LVKADRDNPEIVKKLATSYEEEDKEKAI 169
Score = 31.3 bits (71), Expect = 5.2
Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 15/91 (16%)
Query: 142 FKQKEK-TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKE 200
F Q K IVE+ ++ E KL L A+ ENKKLK +
Sbjct: 93 FSQNLKWAIVEHICDKILLYGEN----KLALRTLAEAYAKLNENKKLKGVWER------- 141
Query: 201 TNHLLKEESKKRELMAKKVESVMQSQLQQAI 231
L+K + E++ K S + ++AI
Sbjct: 142 ---LVKADRDNPEIVKKLATSYEEEDKEKAI 169
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 31.9 bits (72), Expect = 2.7
Identities = 34/233 (14%), Positives = 77/233 (33%), Gaps = 37/233 (15%)
Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSK 478
E + L +E ++ L+E++Q+ +Q+ ++
Sbjct: 542 TSEEDVYHQLTSERKQRASLKEQMQEIQQS--------------------FSILTQCDNR 581
Query: 479 NADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQL--------NNNVKEKLKEFQTQL 530
+ + + LQ +Q L K + + L L +V+ L++ +L
Sbjct: 582 SKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQEL 641
Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAEL 590
LH + V A ++ + ++ A++ + K + E + K L
Sbjct: 642 ALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQL--TYWKEML 699
Query: 591 TQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHVL 643
Q L+E ++E ++ + + S + S L + L
Sbjct: 700 AQCQTLLRELETHIEEYDREFNEI-ENASSSLGSDLAA------REDALNQSL 745
Score = 30.7 bits (69), Expect = 7.8
Identities = 34/222 (15%), Positives = 73/222 (32%), Gaps = 28/222 (12%)
Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSK 478
++ Q + + L E LQ T+Q+ +L + R + ++ K
Sbjct: 212 CMPDTYHERKQVLEKELKHLREALQQTQQSHA--------------YLTQKREAQEEQLK 257
Query: 479 NADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQ---TQLDSAEL 535
L +L+ +E++ A + + + L TQ++
Sbjct: 258 KQQLLKQLRARIEELRAQEAVLEETQERINRARK-------AAPLAAHIKAVTQIEQQAQ 310
Query: 536 TLHREFQQKEK---TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
+H E Q K + ++ A +KQ S +L+ ++ ++ +
Sbjct: 311 RIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREI 370
Query: 593 RDIKLKETNHLLKEESKKRELMAK-KVESVMQSQLQQAIHMI 633
+ T H+ + +K L K + LQ+ I
Sbjct: 371 SCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATI 412
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.1 bits (73), Expect = 2.8
Identities = 28/175 (16%), Positives = 69/175 (39%), Gaps = 24/175 (13%)
Query: 69 DEIQTLLAKSASIKTELREFY---------QTQLNNNVKEKLKEFQTQLDSAELTLHREF 119
+EI L+ K ++++ EL + Q +L ++++++E + + + +L L +E
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570
Query: 120 QQKEKTIVENAAKQLKQISENE------------FKQKEKTIVENAAKQLKQISEKPGAT 167
Q+ K + + ++++ E + + ++T + K ++K G
Sbjct: 571 QEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDK 630
Query: 168 KLMLDFKAKCEVYE--AENKKLKAELTQR-DIKLKETNHLLKEESKKRELMAKKV 219
+ F K ++ + NK R + E + K K+ + K
Sbjct: 631 VRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEKINKAPPPKKFKVPKTT 685
Score = 31.7 bits (72), Expect = 3.7
Identities = 24/141 (17%), Positives = 53/141 (37%), Gaps = 10/141 (7%)
Query: 487 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ---- 542
++ + E++ L + + K EL + Q L KE ++ +L ++ +E +
Sbjct: 546 KELEQEMEELKERERNKKLELEKEAQEALKALKKE-VESIIRELKEKKIHKAKEIKSIED 604
Query: 543 --QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE--AENKKLKAELTQR-DIKL 597
+ ++T + K ++K G + F K ++ + NK R +
Sbjct: 605 LVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHG 664
Query: 598 KETNHLLKEESKKRELMAKKV 618
E + K K+ + K
Sbjct: 665 SELEKINKAPPPKKFKVPKTT 685
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 31.0 bits (70), Expect = 2.9
Identities = 25/155 (16%), Positives = 60/155 (38%), Gaps = 11/155 (7%)
Query: 404 LYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVT 463
+ + + K+ + + R +EE + T++ ++V + + IV +
Sbjct: 58 IVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEID 117
Query: 464 LFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKL 523
E + + L+ L Q+ ++ L K K L E +++L
Sbjct: 118 ELQEFIDELLKSDKEILPLLLYLG--QERLEELKFKRKEKKERLEEL---------EKEL 166
Query: 524 KEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQ 558
+E + + D E L + ++KE ++ ++L +
Sbjct: 167 EELEDEKDLLEKLLEEKEKKKELEELKEELEELLE 201
Score = 29.8 bits (67), Expect = 7.3
Identities = 25/130 (19%), Positives = 52/130 (40%), Gaps = 10/130 (7%)
Query: 30 NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY 89
NN K + + + K + + ++ + Q+ I LL KS L +
Sbjct: 82 NNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELL-KSDKEILPLLLYL 140
Query: 90 QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
+ +K K KE + +L+ E L +K+ L+++ E + K+KE
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDL---------LEKLLEEKEKKKELEE 191
Query: 150 VENAAKQLKQ 159
++ ++L +
Sbjct: 192 LKEELEELLE 201
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.8 bits (73), Expect = 3.0
Identities = 25/143 (17%), Positives = 60/143 (41%), Gaps = 25/143 (17%)
Query: 115 LHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFK 174
L + +K+ E AK++ + ++ E + +K + A +++ ++ +
Sbjct: 18 LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAE----------- 66
Query: 175 AKCEVYEAENKKLKAELTQRDIKLKETNHLLKEES--KKRELMAKKVESVMQSQLQQAIH 232
E E K+ + EL + + +L + +EE+ +K E + KK E++ + ++ +
Sbjct: 67 -----LERELKERRNELQRLERRLLQ-----REETLDRKMESLDKKEENLEKK--EKELS 114
Query: 233 MITSDVTHLHTSKESLATRQPSE 255
++ E L Q E
Sbjct: 115 NKEKNLDEKEEELEELIAEQREE 137
>gnl|CDD|236272 PRK08475, PRK08475, F0F1 ATP synthase subunit B; Validated.
Length = 167
Score = 30.8 bits (70), Expect = 3.0
Identities = 26/112 (23%), Positives = 53/112 (47%), Gaps = 7/112 (6%)
Query: 49 IKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTE-LREFYQTQLNNNVKEKLKEFQTQ 107
L+ FY++++N K +EIQ L +S K + L++ + KEK +
Sbjct: 43 AAKPLKNFYKSRINKISKRLEEIQEKLKESKEKKEDALKKLEE------AKEKAELIVET 96
Query: 108 LDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQ 159
L ++ +++ K +EN K +++ E E ++ E+ +VE +L +
Sbjct: 97 AKKEAYILTQKIEKQTKDDIENLIKSFEELMEFEVRKMEREVVEEVLNELFE 148
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 31.6 bits (72), Expect = 3.0
Identities = 16/88 (18%), Positives = 26/88 (29%)
Query: 416 ISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK 475
+K + + + S L ++T E ++ T R SHSK
Sbjct: 449 ELSKKKTKIHKLNKLRHDLSSLLSSIPEETSDRVESEKASKLRSSASTKLNLRSSRSHSK 508
Query: 476 RSKNADSLVRLQKAQDEIQTLLAKSASI 503
+ + K Q LA S
Sbjct: 509 FRDHLNGSNSSTKELSLNQVDLAGSERK 536
Score = 30.5 bits (69), Expect = 7.0
Identities = 26/195 (13%), Positives = 71/195 (36%), Gaps = 12/195 (6%)
Query: 76 AKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLK 135
+++ SIK +++ + + ++E + E+ + RE Q ++ + ++
Sbjct: 331 SRAKSIKNKIQVNSSSDSSREIEEIKFDLSEDRSEIEILVFREQSQLSQSSLSGIFAYMQ 390
Query: 136 QISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRD 195
+ + K + L S G + K + Y++ + L+ E+ +
Sbjct: 391 SLKKETETLKSRI-------DLIMKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLL 443
Query: 196 ----IKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATR 251
+L + + + +K R ++ + S+ + + S + ++K +L
Sbjct: 444 LLREEELSKKKTKIHKLNKLRHDLSSLLSSIPEETSDRVESEKASKLRSSASTKLNLR-S 502
Query: 252 QPSEQSLIDSTTDTN 266
S D +N
Sbjct: 503 SRSHSKFRDHLNGSN 517
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 31.5 bits (72), Expect = 3.0
Identities = 22/127 (17%), Positives = 46/127 (36%), Gaps = 17/127 (13%)
Query: 108 LDSAELTLHREFQQKEKTIVENAAK--------------QLKQISENEFKQKEKTIVENA 153
DS LT Q ++K ++ K ++K + + ++
Sbjct: 410 FDSNTLTWESNAQGEDKELLRKGFKIIKEIVDEVFGKGAKIKIQKALKSAENPLQSLKEF 469
Query: 154 AKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRE 213
++ +T ML+ +AK + E + +LK ++ + + L EE+ K
Sbjct: 470 KPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQLKEATELQEFMINNED--LIEEA-KEL 526
Query: 214 LMAKKVE 220
K +E
Sbjct: 527 FGIKSIE 533
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 31.7 bits (72), Expect = 3.1
Identities = 49/216 (22%), Positives = 96/216 (44%), Gaps = 43/216 (19%)
Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK-- 475
E+I L++ L+ +R+ +LE +L+ Q E+ ++ V + L+ SK
Sbjct: 480 NERITELQQLLEDANRRNNELETQLRLANQRILELQQQ------VEDLQKALQEQGSKSE 533
Query: 476 -----RSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQL 530
+SK + L +L +A +E+Q K E E + + N+ K+ E + L
Sbjct: 534 DSSLLKSKLEEHLEQLHEANEELQK--------KREQIEELEPDQDQNLSRKIAELEAAL 585
Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISEK--PGATKLMLDFKAKCEVYEAENKKLKA 588
+ + R +++ K VE A + +K + K P + ++ L LK
Sbjct: 586 QKKDEDM-RAMEERYKKYVEKAREVIKTLDPKQNPASPEIQL---------------LKK 629
Query: 589 ELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQS 624
+LT+RD ++ L+ E +K + M ++ E ++ S
Sbjct: 630 QLTERDKRI----RHLESEYEKAKPMREQEEKLIVS 661
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.4 bits (71), Expect = 3.3
Identities = 15/76 (19%), Positives = 33/76 (43%)
Query: 143 KQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETN 202
K+ + +V K L+++ E + + K K E + E ++L EL + + + +E
Sbjct: 110 KELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQ 169
Query: 203 HLLKEESKKRELMAKK 218
LK + + +
Sbjct: 170 ERLKRLEVENSRLEEM 185
Score = 30.3 bits (68), Expect = 7.4
Identities = 14/73 (19%), Positives = 31/73 (42%)
Query: 545 EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLL 604
+ +V K L+++ E + + K K E + E ++L EL + + + +E L
Sbjct: 113 KNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERL 172
Query: 605 KEESKKRELMAKK 617
K + + +
Sbjct: 173 KRLEVENSRLEEM 185
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 30.6 bits (70), Expect = 3.6
Identities = 25/117 (21%), Positives = 50/117 (42%), Gaps = 13/117 (11%)
Query: 478 KNADSLVRLQ-KAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT 536
+N + L RL+ K + + L +K E + +L +K++LKE + +L
Sbjct: 61 RNKE-LKRLKLKYTNTVHIL----THLK-EKLHMLEEELER-LKKELKEKEERLAKLRKE 113
Query: 537 LHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDF---KAKCEVYEAENKKLKAEL 590
L++ + E+ + K+L+Q L+ D+ K E +KL+ +
Sbjct: 114 LYKV--KLERDKLRKQNKKLRQQGGLLHVPALLYDYDKTVEKVEEKRKSVEKLRRKH 168
>gnl|CDD|181337 PRK08266, PRK08266, hypothetical protein; Provisional.
Length = 542
Score = 31.5 bits (72), Expect = 3.6
Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 1118 KKLSAAKKPLIVVGADISD-GAAVLALVQQLAAKVT 1152
++AAK P+I VG + G + L + L A V
Sbjct: 200 ALIAAAKNPMIFVGGGAAGAGEEIRELAEMLQAPVV 235
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 29.9 bits (68), Expect = 3.8
Identities = 16/68 (23%), Positives = 35/68 (51%), Gaps = 4/68 (5%)
Query: 95 NNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENA---AKQLKQISENEFKQKEKTIVE 151
+E+LK+ L AE + + + I+ NA A++LK+ E ++ + ++E
Sbjct: 40 KEAEERLKQAAALLAEAE-QQLAQARAEASEIINNAKKEAQKLKEEILAEAQKDAERLLE 98
Query: 152 NAAKQLKQ 159
+A +++Q
Sbjct: 99 SARAEIEQ 106
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 31.2 bits (72), Expect = 3.9
Identities = 13/72 (18%), Positives = 35/72 (48%), Gaps = 12/72 (16%)
Query: 466 LERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKE 525
LE L++ + SK + + ++ ++ + L+A+ +K E++ ++ +L E
Sbjct: 44 LEELQAERNALSK---EIGQAKRKGEDAEALIAEVKELKEEIKA---------LEAELDE 91
Query: 526 FQTQLDSAELTL 537
+ +L+ L +
Sbjct: 92 LEAELEELLLRI 103
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 31.3 bits (71), Expect = 4.0
Identities = 17/92 (18%), Positives = 37/92 (40%), Gaps = 6/92 (6%)
Query: 2 KAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQL 61
K++ + + LLA+ E R Q + + ++ + +I+ E E Y+ +
Sbjct: 327 KSKQKFEPLLAQYGESFQEWRSKLQAL---AKVESRSLLSVEERLPTIEKEDLELYEQRQ 383
Query: 62 NNNQKAQDEIQTLLAKSASIKTELREFYQTQL 93
Q E + L + ++ + E YQ +
Sbjct: 384 ---QLWFFENRKLWQRKKKLREQADEEYQQRH 412
Score = 30.9 bits (70), Expect = 5.3
Identities = 20/103 (19%), Positives = 42/103 (40%), Gaps = 16/103 (15%)
Query: 29 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREF 88
L +K++ + + LLA+ E R Q + + ++ + +I+ E E
Sbjct: 322 LEEVEKSKQKFEPLLAQYGESFQEWRSKLQAL---AKVESRSLLSVEERLPTIEKEDLEL 378
Query: 89 YQTQL------NNNVKEKLKEFQTQLDSAELTLHREFQQKEKT 125
Y+ + N + ++ K+ + Q D E+QQ+ T
Sbjct: 379 YEQRQQLWFFENRKLWQRKKKLREQAD-------EEYQQRHAT 414
>gnl|CDD|239166 cd02765, MopB_4, The MopB_4 CD includes a group of related
uncharacterized bacterial and archaeal
molybdopterin-binding oxidoreductase-like domains with a
putative N-terminal iron-sulfur [4Fe-4S] cluster binding
site and molybdopterin cofactor binding site. These
members belong to the molybdopterin_binding (MopB)
superfamily of proteins.
Length = 567
Score = 31.3 bits (71), Expect = 4.4
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 3/34 (8%)
Query: 958 RLLTPFVR---NCDGQLVATEWEDALIAVAQKLQ 988
RL P R +G+ W++AL +A KL
Sbjct: 55 RLKYPMKRVGERGEGKFERITWDEALDTIADKLT 88
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 31.1 bits (70), Expect = 4.7
Identities = 38/201 (18%), Positives = 71/201 (35%), Gaps = 41/201 (20%)
Query: 51 TELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 110
+ L E Y+ +L+ N + + ++ L + E E +E+ K + +LD+
Sbjct: 96 STLAENYERELDRNLELEVRLKALEELEKKAENEAAE---------AEEEAKLLKDKLDA 146
Query: 111 AELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLM 170
L L E + + K E+ ++ +SE + +NA +LK +
Sbjct: 147 ESLKLQNEKEDQLKEAKESISRIKNDLSE------MQCRAQNADTELKLL---------- 190
Query: 171 LDFKAKCEVYEAENKKLKAELT-QRDIKLKE-TNHLLKEESKKRELMAKKVESVMQSQLQ 228
+ EL Q + KE K +S E + SV L+
Sbjct: 191 --------------ESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLE 236
Query: 229 QAIHMITSDVTHLHTSKESLA 249
+ + D + + KE L
Sbjct: 237 EELKRYEQDAEVVKSMKEQLL 257
>gnl|CDD|218956 pfam06248, Zw10, Centromere/kinetochore Zw10. Zw10 and rough deal
proteins are both required for correct metaphase
check-pointing during mitosis. These proteins bind to
the centromere/kinetochore.
Length = 593
Score = 31.2 bits (71), Expect = 4.7
Identities = 72/377 (19%), Positives = 139/377 (36%), Gaps = 73/377 (19%)
Query: 39 IQTLLAKSASIKTELREF----YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN 94
I LL+K K E+R+ Y L N A+D LL + +S+ ++ + +++++
Sbjct: 16 ISKLLSKVEEYKEEVRDMINKKYSDFLPNMSSAED----LLEQVSSLSEDISDLLKSEVS 71
Query: 95 NNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAA 154
N + + L + EL + Q +EK + ++L +I E ++ + +
Sbjct: 72 NEILQDLNDAVE-----ELAELKR-QLEEKILGLKILQKLLKIHE-ALEETNEALSGKDY 124
Query: 155 KQLKQISEKPGATKLMLDFKAKCEVYEAEN--KKLKAELT-QRDI--------------- 196
+ + EK K +LD K E K L+ E T Q
Sbjct: 125 LKAADLLEK---AKSLLD-GLKSRKGEELEIYKSLRLEYTVQSQNLLYHLGEEWDRLVQW 180
Query: 197 KLKETNHLL-KEESKKRELMAKKVESVMQ-SQLQQAIHMITSDVTHLHTSKESL------ 248
KL + L + +KK L K ES + S + QA+ ++ L + L
Sbjct: 181 KLPSSKELGSAQNTKKTTLHLSKDESQGELSSVLQALAVLGELEYKLKKFCQFLLKNVFE 240
Query: 249 -ATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSNQQYS--------IPTSNIN-- 297
+P ++ D +T Y + + P + +S + ++
Sbjct: 241 PLIVKPVLVVCVEDQKDGTVQITLSYESKTTKLLRPDPKKVFSKIRLVLKTLSKMLLSVS 300
Query: 298 -QQYSMFSADSTHM----------NIVERDIPNSADDMRKYIEM--QYSMFSADSTHMNI 344
S+F + + + + IP ++ ++++Y E+ F A MN
Sbjct: 301 VDDCSLFEILGDLIWERLSELLIKDCLVKSIPETSSELQEYSEVIEDTEEFEAALKEMNF 360
Query: 345 VERDIPNSADDMRKYIE 361
++ D A D+ KY
Sbjct: 361 LDGD----ATDLLKYAR 373
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 31.0 bits (71), Expect = 4.9
Identities = 31/148 (20%), Positives = 71/148 (47%), Gaps = 10/148 (6%)
Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRK-EVSIVTVTLFLERLRSSHSKRS 477
E++ + L+ + R ++EE L + RK V+I + +L++++ ++
Sbjct: 283 EELRAYLDELEFDPNRLEEVEERLFALKSLA----RKYGVTIEDLLEYLDKIKEELAQLD 338
Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
+ +SL L+K +++ L ++A + +R+ +L V +LK ++ A T+
Sbjct: 339 NSEESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEVTAELKAL--AMEKARFTV 396
Query: 538 HREFQQKEKTIVENAAKQLK-QISEKPG 564
E + E++ + A +++ IS PG
Sbjct: 397 --ELKPLEESPTADGADKVEFLISTNPG 422
>gnl|CDD|153322 cd07638, BAR_ACAP2, The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP
with Coiled-coil, ANK repeat and PH domain containing
protein 2. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions. ACAP2 (ArfGAP with
Coiled-coil, ANK repeat and PH domain containing protein
2), also called centaurin beta-2, is an Arf6-specific
GTPase activating protein (GAP) which mediates Arf6
signaling. Arf6 is involved in the regulation of
endocytosis, phagocytosis, cell adhesion and migration.
ACAP2 contains an N-terminal BAR domain, followed by a
Pleckstrin homology (PH) domain, an Arf GAP domain, and
C-terminal ankyrin (ANK) repeats. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions.
Length = 200
Score = 30.4 bits (68), Expect = 5.0
Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 4/105 (3%)
Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
F+ +R SK+A L K D +Q ++ + + + + QL VKE L+
Sbjct: 42 FMNGIRDLAQYSSKDAVIETSLTKFSDTLQEMINYHTILFDQAQRSIKAQLQTFVKEDLR 101
Query: 525 EFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLM 569
+F+ A+ + ++KE +V+NA Q + E AT ++
Sbjct: 102 KFK----DAKKQFDKVSEEKENALVKNAQVQRNKQHEVEEATNIL 142
>gnl|CDD|132072 TIGR03027, pepcterm_export, putative polysaccharide export protein,
PEP-CTERM sytem-associated. This protein family belongs
to the larger set of polysaccharide biosynthesis/export
proteins described by pfam02563. Members of this family
are variable in either containing of lacking a
78-residue insert, but appear to fall within a single
clade, nevertheless, where the regions in which the gene
is found encode components of the PEP-CTERM/EpsH
proposed exosortase protein sorting system [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 165
Score = 30.0 bits (68), Expect = 5.0
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 7/54 (12%)
Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLER 468
KI+ + L + +T TQ +R +EE+L + VR V V VT F+
Sbjct: 30 KITTPLVGDLVASGKTPTQLARDIEEKLA-------KYVRNPVVTVIVTGFIGP 76
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 31.1 bits (70), Expect = 5.0
Identities = 56/216 (25%), Positives = 94/216 (43%), Gaps = 14/216 (6%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
++A+ E++ L K E + Q EI ++A AS+ EL E
Sbjct: 212 KEAEKELERLKQDLDPEKD--LEKLAEASAELESLQKEIS-IMASVASVLKELEEAKA-- 266
Query: 61 LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
N +KA +E ++L S+K EL E + + ++EK E + S E L+R
Sbjct: 267 --NLEKAAEEEKSLRNLVESLKQELEE--EKKELEELREKEGEAEEAASSLEAELNRTKS 322
Query: 121 QKEKTIVENAA---KQLKQIS-ENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAK 176
+KE E +L+Q S E E +KE ++LK+ +E+ A + + +
Sbjct: 323 EKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLE 382
Query: 177 CEVYEAENKKLKAELTQRDIK-LKETNHLLKEESKK 211
+ EAE K L +IK L+E+ K +S +
Sbjct: 383 AALKEAEAAKAAEALALAEIKALQESEESAKADSPR 418
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 31.0 bits (70), Expect = 5.1
Identities = 29/116 (25%), Positives = 60/116 (51%), Gaps = 18/116 (15%)
Query: 121 QKEKTIVENAAKQ---LKQISE--NEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKA 175
++E + E+A+K+ L++I E E KQ + N + +++S L + K
Sbjct: 17 REESLLQESASKEAYYLQRILELQAELKQLRAE-LSNVQAENERLS------SLSQELKE 69
Query: 176 KCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRE---LMAKKVESVMQSQLQ 228
+ E+ E + +L+ E+ ++ K +E LL++ S+ E + K+V + QSQ++
Sbjct: 70 ENEMLELQRGRLRDEI--KEYKFREA-RLLQDYSELEEENISLQKQVSVLRQSQVE 122
>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
Length = 435
Score = 30.9 bits (71), Expect = 5.2
Identities = 31/179 (17%), Positives = 68/179 (37%), Gaps = 35/179 (19%)
Query: 75 LAKSASIK---TELREFYQTQLNNNVKEKLK-EFQTQL-----DSAELTL-----HREFQ 120
AK EL+ + L KE + + + + ++AE+ L +E
Sbjct: 252 FAKKLGEFETLEELKADIRKNLEREKKEAQRAKVKEAVLDALVENAEIDLPEALVEQEID 311
Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
+ ++ A +Q E +F + T E ++ ++ +E+ L+LD E+
Sbjct: 312 R----LLRQALQQGLD-LEGQFLEDTGTTEEEPREEFREQAERRVKLGLLLD-----EIA 361
Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVT 239
+AE ++ E E ++E + + + +E + Q + + +DV
Sbjct: 362 KAEEIEVSDE---------EVKAEIEEMASQYGQPPEVIE--FYLKNPQLLAALRADVL 409
>gnl|CDD|217899 pfam04108, APG17, Autophagy protein Apg17. Apg17 is required for
activating Apg1 protein kinases.
Length = 408
Score = 30.8 bits (70), Expect = 5.3
Identities = 27/145 (18%), Positives = 61/145 (42%), Gaps = 13/145 (8%)
Query: 16 SIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLL 75
+ EL NN ++ + + + ++K + + +E L +K ++ + L
Sbjct: 251 DVVKELHTVIDEIENNEKRVKKFLSSHMSKIEELHSATKE----LLEELEKYKERLPRYL 306
Query: 76 AKSASIKTELREFYQT--QLNNNVKEKLKEFQTQL----DSAELTLHRE--FQQKEKTIV 127
A A I+ +F + Q + E L EF L + R ++K + I+
Sbjct: 307 AIFADIRALWEDFKEPIQQYIQELSE-LCEFYDNFLNSYKGLLLEVERRRAVRKKMEKIL 365
Query: 128 ENAAKQLKQISENEFKQKEKTIVEN 152
+ +QL ++ E + K++++ + E
Sbjct: 366 KFCNEQLDKLQEEDQKRRQQFLAEY 390
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.5 bits (69), Expect = 5.6
Identities = 37/142 (26%), Positives = 50/142 (35%), Gaps = 12/142 (8%)
Query: 77 KSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQ 136
KSA E R+ + Q +++K Q +L +L R Q++K E AAKQ
Sbjct: 72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERL--KQLEKERLAAQEQKKQAEEAAKQA-- 127
Query: 137 ISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 196
K+K E AAK K A AK AE KK +
Sbjct: 128 ------ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAK--KAAAEAKKKAEAEAAKKA 179
Query: 197 KLKETNHLLKEESKKRELMAKK 218
+ E + K AKK
Sbjct: 180 AAEAKKKAEAEAAAKAAAEAKK 201
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 30.4 bits (69), Expect = 5.9
Identities = 24/109 (22%), Positives = 47/109 (43%), Gaps = 10/109 (9%)
Query: 415 KISKEK-IESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
+ E+ + +L +Q +R LE+EL + EE+ + E I + LERL +
Sbjct: 84 AVKDERELRALNIEIQIAKERINSLEDELAELM---EEIEKLEKEIEDLKERLERLEKNL 140
Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELRE----FYQTQLNNN 518
++ + + + ++E Q L +K +K +L Y+ N
Sbjct: 141 AEAEARLEE--EVAEIREEGQELSSKREELKEKLDPELLSEYERIRKNK 187
Score = 30.0 bits (68), Expect = 6.9
Identities = 38/192 (19%), Positives = 84/192 (43%), Gaps = 20/192 (10%)
Query: 30 NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY 89
NNN K+ IQ L + ++ ++E + L +KA+ E++ L +++ EL +
Sbjct: 3 NNNLKSLLAIQKLDLEKDRLEPRIKE-IRKAL---KKAKAELEALNKALEALEIELED-L 57
Query: 90 QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
+ Q++ E ++E + ++ AE L ++E A QI++ E +
Sbjct: 58 ENQVSQLESE-IQEIRERIKRAEEKLSAVKDEREL----RALNIEIQIAKERINSLEDEL 112
Query: 150 VENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEES 209
E ++++++ ++ + E E K L + + ++ E +E S
Sbjct: 113 AE-LMEEIEKLEKE---------IEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELS 162
Query: 210 KKRELMAKKVES 221
KRE + +K++
Sbjct: 163 SKREELKEKLDP 174
>gnl|CDD|237025 PRK11929, PRK11929, putative bifunctional
UDP-N-acetylmuramoylalanyl-D-glutamate--2,
6-diaminopimelate
ligase/UDP-N-acetylmuramoyl-tripeptide:D-alanyl-D-alanine
ligase; Provisional.
Length = 958
Score = 30.8 bits (70), Expect = 6.1
Identities = 14/41 (34%), Positives = 18/41 (43%)
Query: 1199 LGADEGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAA 1239
AD SR+ D F+ +G D A I A+ GAA
Sbjct: 28 ATADLRLDSREVQPGDLFVACRGAASDGRAFIDQALARGAA 68
>gnl|CDD|233326 TIGR01239, galT_2, galactose-1-phosphate uridylyltransferase,
family 2. This enzyme is involved in glucose and
galactose interconversion. This model describes one of
two extremely distantly related branches of the model
pfam01087 from PFAM [Energy metabolism, Sugars].
Length = 489
Score = 30.6 bits (69), Expect = 6.2
Identities = 15/78 (19%), Positives = 28/78 (35%), Gaps = 8/78 (10%)
Query: 85 LREFYQTQLNNNVKEKLKEFQTQLDSAEL--------TLHREFQQKEKTIVENAAKQLKQ 136
L + Q N E + L++ + ++R+F + + E A K Q
Sbjct: 54 LDQLLQWAAENGRMEADTTQRDILEAKLMDLITPRPSEVNRKFWETYASYPEQATKYFYQ 113
Query: 137 ISENEFKQKEKTIVENAA 154
S+ + K I +N
Sbjct: 114 YSKRVYYVKTDAIAKNIH 131
>gnl|CDD|238569 cd01149, HutB, Hemin binding protein HutB. These proteins have been
shown to function as initial receptors in ABC transport
of hemin and hemoproteins in many eubacterial species.
They belong to the TroA superfamily of periplasmic metal
binding proteins that share a distinct fold and ligand
binding mechanism. A typical TroA protein is comprised of
two globular subdomains connected by a single helix and
can bind the metal ion in the cleft between these
domains.
Length = 235
Score = 30.3 bits (69), Expect = 6.5
Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 16/91 (17%)
Query: 1118 KKLSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAAL 1177
+L AA P++ V + +L ++Q+A + L + +K + +
Sbjct: 75 DQLRAAGVPVVTVPST-PTLDGLLTKIRQVA-------------QALGVPEKGEALAQEV 120
Query: 1178 DIGYK--PGTSAIREKPPKVLFLLGADEGSI 1206
T A +KPP+VLFLL G+
Sbjct: 121 RQRLAALRKTVAAHKKPPRVLFLLSHGGGAA 151
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 30.7 bits (69), Expect = 6.8
Identities = 31/140 (22%), Positives = 57/140 (40%), Gaps = 10/140 (7%)
Query: 78 SASIKTELREFYQTQL-NNNVKEKL---KEFQTQLDSAELTLHREFQQKEKTIVENAAKQ 133
S I+ ++F QL N+ VK L K+ L+ E + +K + ++N
Sbjct: 83 SLKIEVYNKQFKHEQLWNSQVKGILTLGKKTNDDLEKIE-SKKESINKKNEKKIKNE-AS 140
Query: 134 LKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLML---DFKAKCEVYEAENKKLKAE 190
L+ ++ E K+++ I + K+ E+ FK K E EN K+ A
Sbjct: 141 LQVKTQREEKEEKDFIADCWRNLYKKNEERFKEYLENFKRKKFKRKILK-EFENAKINAS 199
Query: 191 LTQRDIKLKETNHLLKEESK 210
K+KE ++ ++
Sbjct: 200 EIVGLEKVKEKIEIVFRHNQ 219
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.1 bits (68), Expect = 6.8
Identities = 40/238 (16%), Positives = 92/238 (38%), Gaps = 29/238 (12%)
Query: 1 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQ-- 58
Q ++ L + +I+ E+ ++ N ++ Q +I L + K E+++ +
Sbjct: 34 QNQDSKLSELQKEKKNIQNEI----ESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEI 89
Query: 59 TQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD------SAE 112
+L N + E+ A++ + + LN+ + + +++ A+
Sbjct: 90 AELKENIVERQELLKKRARAMQVNGTATSYIDVILNS---KSFSDLISRVTAISVIVDAD 146
Query: 113 LTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLD 172
+ + ++ +K++ ++ + E++ + E + S+K L+
Sbjct: 147 KKILEQQKEDKKSL-----EEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAA 201
Query: 173 FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 230
AK EA KA L ++ K E+ K+E A K + Q+ LQ A
Sbjct: 202 LAAK----EASALGEKAALEEQ----KALAEAAAAEAAKQE-AAAKAAAQEQAALQAA 250
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.2 bits (68), Expect = 7.0
Identities = 29/155 (18%), Positives = 56/155 (36%), Gaps = 9/155 (5%)
Query: 394 VTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVV 453
+++F+ L ++T ++ + +E + L L +E R E+E +Q ++
Sbjct: 4 AMTNLFD--ILSSQT--NIDHPLCEECADILLERLDSEL---RDAEKERDTYKQYLSKLE 56
Query: 454 RKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQT 513
+ V I L + + D L L+K D++ L + K +L
Sbjct: 57 SQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116
Query: 514 QLNNNVKEKLK--EFQTQLDSAELTLHREFQQKEK 546
L + + L S EL Q +K
Sbjct: 117 YLREYNLFDRNNLQLEDNLQSLELQYEYSLNQLDK 151
>gnl|CDD|236042 PRK07525, PRK07525, sulfoacetaldehyde acetyltransferase; Validated.
Length = 588
Score = 30.7 bits (70), Expect = 7.1
Identities = 25/90 (27%), Positives = 38/90 (42%), Gaps = 25/90 (27%)
Query: 1072 IRKGYLTNELDVAYIGPKVDLRYDY-----EHLGESADLIKQLASGSHPFSKKLSAAKKP 1126
I + Y +DV P +R + + L E+A+L LS AK P
Sbjct: 161 IPRDYFYGVIDVEIPQP---VRLERGAGGEQSLAEAAEL--------------LSEAKFP 203
Query: 1127 LIVVGADISDGAAV---LALVQQLAAKVTC 1153
+I+ GA + A+ AL ++L A V C
Sbjct: 204 VILSGAGVVLSDAIEECKALAERLDAPVAC 233
>gnl|CDD|162434 TIGR01580, narG, respiratory nitrate reductase, alpha subunit. The
Nitrate reductase enzyme complex allows bacteria to use
nitrate as an electron acceptor during anaerobic growth.
The enzyme complex consists of a tetramer that has an
alpha, beta and 2 gamma subunits. The alpha and beta
subunits have catalytic activity and the gamma subunits
attach the enzyme to the membrane and is a b-type
cytochrome that receives electrons from the quinone pool
and transfers them to the beta subunit. This model is
specific for the alpha subunit for nitrate reductase I
(narG) and nitrate reductase II (narZ) for gram positive
and gram negative bacteria.A few thermophiles and archaea
also match the model The seed members used to make the
model include Nitrate reductases from Pseudomonas
fluorescens (GP:11344601), E.coli and B.subtilis. All
seed members are experimentally characterized. Some
unpublished nitrate reductases, that are shorter
sequences, and probably fragments fall in between the
noise and trusted cutoffs. Pfam models pfam00384
(Molybdopterin oxidoreductase) and pfam01568(Molydopterin
dinucleotide binding domain) will also match the nitrate
reductase, alpha subunit [Energy metabolism, Anaerobic].
Length = 1235
Score = 30.7 bits (69), Expect = 7.1
Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 1231 ADAILPGAAYTEKQSTYVNTEGRAQQTLT-AVTPPGLAREDWKIIRALSEGVS 1282
+D +LP A + EK + L+ A+ P ++ DW+I +A+++ S
Sbjct: 782 SDIVLPTATWYEKDDMNTSDMHPFIHPLSAAIDPAWESKSDWEIYKAIAKAFS 834
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 30.4 bits (69), Expect = 7.7
Identities = 40/191 (20%), Positives = 76/191 (39%), Gaps = 32/191 (16%)
Query: 40 QTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKE 99
L A+ ++ +L + + QL +A+ E++ L+K+ + + R ++++ +E
Sbjct: 260 ANLQAQLRQLEHDL-DSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEE 318
Query: 100 ----------KLKEFQTQLDSA-------ELTLHREFQQKEKTIVENAAKQLKQISENEF 142
K+ E + ++A E T R + E +E SE E
Sbjct: 319 LEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAA-ASELEK 377
Query: 143 KQK--EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKE 200
KQK +K + E K + +E A + + E +LK EL +LK+
Sbjct: 378 KQKNFDKILAEWKRKVDELQAELDTAQR-------EARNLSTELFRLKNELE----ELKD 426
Query: 201 TNHLLKEESKK 211
L+ E+K
Sbjct: 427 QVEALRRENKN 437
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 29.2 bits (66), Expect = 8.0
Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 54 REFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAEL 113
EF ++ +K + EI+ L +KT+L E + + ++ K ++ + +L + E
Sbjct: 49 LEFRESLEETLRKLEAEIERLQNTIERLKTQL-EDLERE-LALLQAKERQLEKKLKTLEQ 106
Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
L E ++ ++ +N +Q K +E K++++ I
Sbjct: 107 KLKNEKEEVQRL--KNIIQQRKTQYNHELKKRDREI 140
>gnl|CDD|190336 pfam02520, DUF148, Domain of unknown function DUF148. This domain
has no known function nor do any of the proteins that
possess it. In one member of this family the aligned
region is repeated twice.
Length = 114
Score = 28.7 bits (65), Expect = 8.3
Identities = 20/104 (19%), Positives = 42/104 (40%), Gaps = 20/104 (19%)
Query: 65 QKAQDEIQTLLAKS----ASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
++A+ E +L A I +L + + V ++ +EF+ + + L E +
Sbjct: 1 EEARQEFFAILKNETLTIAEIDAQLEAWAEK---YGVADQYEEFKKNVKA----LKAEAE 53
Query: 121 QKEKTIVEN---AAKQLKQISENEFKQKEKTIVE--NAAKQLKQ 159
+ ++ N +L I +N K+ T + A LK+
Sbjct: 54 KNVTAVISNLSSVDAKLSAILDN----KDLTRRQKKEAIDALKK 93
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.1 bits (66), Expect = 8.4
Identities = 22/121 (18%), Positives = 49/121 (40%), Gaps = 19/121 (15%)
Query: 39 IQTLLAKSA---SIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 95
+Q +L++S + + +L + ++ QK + E+Q K Q Q
Sbjct: 23 VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQK-----------LQKQAAT 71
Query: 96 NVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAK 155
+E K Q +L + +E QQK++ + ++ +++ K ++ AK
Sbjct: 72 LSEEARKAKQQELQQKQ----QELQQKQQAAQQELQQKQQEL-LQPIYDKIDKAIKEVAK 126
Query: 156 Q 156
+
Sbjct: 127 E 127
>gnl|CDD|232855 TIGR00173, menD,
2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-
carboxylic-acid synthase. MenD was thought until
recently to act as SHCHC synthase, but has recently been
shown to act instead as SEPHCHC synthase. Conversion of
SEPHCHC into SHCHC and pyruvate may occur spontaneously
but is catalyzed efficiently, at least in some organisms,
by MenH (see TIGR03695). 2-oxoglutarate
decarboxylase/SHCHC synthase (menD) is a thiamine
pyrophosphate enzyme involved in menaquinone biosynthesis
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Menaquinone and ubiquinone].
Length = 430
Score = 30.3 bits (69), Expect = 8.6
Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 2/44 (4%)
Query: 1113 SHPFSKKLSAAKKPLIVVG--ADISDGAAVLALVQQLAAKVTCE 1154
+L AK+ LI+ G A D A+ AL + L + +
Sbjct: 200 LDDLWDRLRQAKRGLIIAGPLAGAEDAEALAALAEALGWPLLAD 243
>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
Firmicutes type. AddAB, also called RexAB, substitutes
for RecBCD in several bacterial lineages. These DNA
recombination proteins act before synapse and are
particularly important for DNA repair of double-stranded
breaks by homologous recombination. The term AddAB is
used broadly, with AddA homologous between the
Firmicutes (as modeled here) and the
alphaproteobacteria, while the partner AddB proteins
show no strong homology across the two groups of species
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 1230
Score = 30.4 bits (69), Expect = 8.6
Identities = 29/164 (17%), Positives = 60/164 (36%), Gaps = 23/164 (14%)
Query: 412 VTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRS 471
L+ +EK++ E + +LE Q Q +E++++ ++ + + +
Sbjct: 217 NELEGLQEKLQRALELFMAEDGLAPRLEN-FQLDLQNIDELIQESLAQADWNELRKAVAA 275
Query: 472 SHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD 531
K K A +E +L K++L++ +T
Sbjct: 276 FKFKNLKAAKGDEEDADLLEEAD-------------------KLREEAKKQLEKLKTDYF 316
Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKL---MLDF 572
+ H Q+ K +V+ + +K E+ GA K +LDF
Sbjct: 317 TRSEEDHLRIMQEMKPVVKTLVQLVKDFIERFGAEKREKNILDF 360
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 30.0 bits (67), Expect = 8.9
Identities = 27/207 (13%), Positives = 76/207 (36%), Gaps = 29/207 (14%)
Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
+ ++++ + +NL Q +L ++ QD + RL++
Sbjct: 119 EAVRQELAAARQNLAKAQQELARLTKQAQDLQT--------------------RLKTLAE 158
Query: 475 KRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE 534
+R + LQ +Q ++Q A + +K+++ + ++++ + T+ ++A+
Sbjct: 159 QRRQLEAQAQSLQASQKQLQ---ASATQLKSQVLDLKLRS--AQIEQEAQNLATRANAAQ 213
Query: 535 LTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRD 594
E ++ + A ++Q + ++ A+ E ++L+ T +
Sbjct: 214 -ARTEELARRAAAAQQTAQA-IQQRDAQ--ISQKAQQIAARAEQIRERERQLQRLETAQA 269
Query: 595 IKLKETNHLLKEESKKRELMAKKVESV 621
+E L L + +
Sbjct: 270 RLEQEVAQLEAYYQAYVRLRQQAAATQ 296
>gnl|CDD|220772 pfam10475, DUF2450, Protein of unknown function N-terminal domain
(DUF2450). This protein is found in eukaryotes but its
function is not known. The C-terminal part of some
members is DUF2451.
Length = 291
Score = 30.0 bits (68), Expect = 9.4
Identities = 28/151 (18%), Positives = 63/151 (41%), Gaps = 22/151 (14%)
Query: 431 ETQRSRKLEEELQDTRQTCEEVVRKEVSIVT--VTLFLERLRSSHSKRS------KNADS 482
QR R++E++L++ C+ R+ + +T L + + KR +
Sbjct: 65 GMQRIREIEKDLKEANVICKNG-RRHLDSSKKNLTRTALGLLAKYKKRQLLIDLLPTLTT 123
Query: 483 LVRLQKAQDEIQTLLAKS------------ASIKTELREFYQTQ-LNNNVKEKLKEFQTQ 529
+ ++ + ++ LL + I ++ Q L++ ++E L +T+
Sbjct: 124 IKSIRDTEVRLRELLEEGNYPGAIQLLLECQQIAEGYSQYNCVQELSSKLQETLLLTETK 183
Query: 530 LDSAELTLHREFQQKEKTIVENAAKQLKQIS 560
LDS L + +F ++ V+ A K L ++
Sbjct: 184 LDSVLLEVCVDFDSEKYAKVQEAYKLLGKVL 214
>gnl|CDD|115367 pfam06705, SF-assemblin, SF-assemblin/beta giardin. This family
consists of several eukaryotic SF-assemblin and related
beta giardin proteins. During mitosis the
SF-assemblin-based cytoskeleton is reorganised; it
divides in prophase and is reduced to two dot-like
structures at each spindle pole in metaphase. During
anaphase, the two dots present at each pole are
connected again. In telophase there is an asymmetrical
outgrowth of new fibres. It has been suggested that
SF-assemblin is involved in re-establishing the
microtubular root system characteristic of interphase
cells after mitosis.
Length = 247
Score = 29.8 bits (67), Expect = 9.6
Identities = 28/150 (18%), Positives = 63/150 (42%), Gaps = 14/150 (9%)
Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVV--RKEVSIVTVTL 464
+ + ++ E I LE++L+ E +R + ++++Q+ + + + R + +
Sbjct: 29 RIAEATRFQMITEAIARLEKSLEAEVKRRAESDKQIQEHFEGEIKNLQERIAEQVADMQA 88
Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQ--DEIQTL---LAKS-----ASIKTE--LREFYQ 512
+ S S + + +R ++ Q +I+ L L K A+I E R+ Q
Sbjct: 89 AFKSAVDSLSDTLNDLHTNIRNEREQRRSDIEHLATSLVKKVNECVAAIDEERISRKQEQ 148
Query: 513 TQLNNNVKEKLKEFQTQLDSAELTLHREFQ 542
+Q+ V E ++D+ + T E
Sbjct: 149 SQILKKVAEDFARISARIDTEKNTREAEVS 178
>gnl|CDD|153338 cd07654, F-BAR_FCHSD, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of FCH and double SH3
domains proteins (FCHSD). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. This subfamily is composed of FCH
and double SH3 domain (FCHSD) proteins, so named as they
contain an N-terminal F-BAR domain and two SH3 domains
at the C-terminus. Vertebrates harbor two subfamily
members, FCHSD1 and FCHSD2, which have been
characterized only in silico. Their biological function
is still unknown. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 264
Score = 29.9 bits (67), Expect = 9.8
Identities = 36/165 (21%), Positives = 65/165 (39%), Gaps = 20/165 (12%)
Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEV---VRKEVSIVTVT 463
KT RS + K+ E L+R Q E Q++ + + + T E+V R++ + V
Sbjct: 102 KTGRSAKEQQLKKCTEQLQR-AQAEVQQTVRELSKSRKTYFEREQVAHLAREKAADVQAR 160
Query: 464 LFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKS--------ASIKTELREFYQTQL 515
L S+ S S V+L + E + + A+ +YQT L
Sbjct: 161 EARSDLSIFQSRTSLQKAS-VKLSARKAECSSKATAARNDYLLNLAATNAHQDRYYQTDL 219
Query: 516 NN-------NVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAA 553
+ + LK+F L EL + Q+ + ++E ++
Sbjct: 220 PAIIKALDGELYDHLKDFLISLSHTELETAQVIQETFQRLLETSS 264
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.128 0.353
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 64,492,197
Number of extensions: 6338665
Number of successful extensions: 7404
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6892
Number of HSP's successfully gapped: 686
Length of query: 1334
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1225
Effective length of database: 6,103,016
Effective search space: 7476194600
Effective search space used: 7476194600
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (28.9 bits)