RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11935
         (1334 letters)



>gnl|CDD|236387 PRK09130, PRK09130, NADH dehydrogenase subunit G; Validated.
          Length = 687

 Score =  785 bits (2029), Expect = 0.0
 Identities = 306/640 (47%), Positives = 377/640 (58%), Gaps = 119/640 (18%)

Query: 790  PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFG 849
            PVA+CAMPV +G  + TN+ M ++AREGVMEFLL+NHPLDCPICDQGGECDLQDQ+MA+G
Sbjct: 59   PVASCAMPVGEGMVIFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYG 118

Query: 850  ---------------------------------------------NDLGTTGRGSDMQVG 864
                                                          +LG  GRG DM++ 
Sbjct: 119  VDTSRYHENKRAVEDKYMGPLVKTVMTRCIHCTRCVRFATEVAGVPELGAIGRGEDMEIT 178

Query: 865  TYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRT 924
            TY+E+   SELSGNVIDLCPVGALTSKPY+FTARPWE +KTES+DV+DAVGSNI V TR 
Sbjct: 179  TYLEQALTSELSGNVIDLCPVGALTSKPYAFTARPWELKKTESIDVMDAVGSNIRVDTRG 238

Query: 925  GEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVRNCDGQLVATEWEDALIAVA 984
             EV+R+LPR+NE++NEEW++DK RF++DGLKRQRL  P+VR  +G+LV   W++A  A+A
Sbjct: 239  REVMRILPRVNEEVNEEWISDKTRFSWDGLKRQRLDRPYVRK-NGKLVPASWDEAFAAIA 297

Query: 985  QKLQ---TSEVAGVVGSLADAEAMVALKDLLNKLGSEDL---YTEYAFPLEGAGTDLRAN 1038
             K++     ++A + G LAD E+M ALKDL+ KLGS +L                 LRA+
Sbjct: 298  AKIKGTPGEKIAAIAGDLADVESMFALKDLMQKLGSSNLDCRQDGAKLD-----PSLRAS 352

Query: 1039 YLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYDYEH 1098
            YL N  IAG EEAD ILLIG NPRFEAP+ NARIRK +      +A IG + DL Y YE+
Sbjct: 353  YLFNTTIAGIEEADAILLIGANPRFEAPVLNARIRKRWRAGGFKIAVIGEQADLTYPYEY 412

Query: 1099 LGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCES 1155
            LG   D +  LASG H F+  L AAK+P+I+VG      +DGAAVLAL  +LA KV    
Sbjct: 413  LGAGPDTLADLASGKHEFADVLKAAKRPMIIVGQGALARADGAAVLALAAKLAEKVGAVR 472

Query: 1156 DVPCDWKVLNILQKAASQVAALDIGYKPGT------SAIREKPPKVLFLLGADEGSISRD 1209
            D    W   N+L  AAS+V  LD+G+ PG         +      VL+LLGADE  IS+ 
Sbjct: 473  D---GWNGFNVLHTAASRVGGLDLGFVPGEGGKDAAEMLESGALDVLYLLGADEIDISK- 528

Query: 1210 DVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARE 1269
              GK  F+IYQGHHGD GA  AD ILPGAAYTEK  TYVNTEGR Q    AV PPG A+E
Sbjct: 529  --GKSAFVIYQGHHGDRGAHRADVILPGAAYTEKSGTYVNTEGRVQLANRAVFPPGEAKE 586

Query: 1270 DWKIIRALSE--------------------------------------------GVSSNL 1285
            DW I+RALS+                                                 L
Sbjct: 587  DWAILRALSDVLGKTLPYDSLAQLRAKLAAAYPHFAAIDQITPSKDAKDLAALASKKGKL 646

Query: 1286 SSEPVDVKQKNLEDYYMTDPISRASGTMAKCIQAVHKQKQ 1325
            S  P     K   D+Y+T+PI+RAS TMA+C      +  
Sbjct: 647  SKAPFTSPVK---DFYLTNPIARASATMAECSALASGRAL 683



 Score =  162 bits (413), Expect = 3e-41
 Identities = 55/119 (46%), Positives = 69/119 (57%), Gaps = 37/119 (31%)

Query: 669 IDDKKVMVDPGTTVLQ-------------------------------------PVAACAM 691
           +D K++ V  G T+LQ                                     PVA+CAM
Sbjct: 6   VDGKEIEVPDGYTLLQACEAAGAEIPRFCYHERLSIAGNCRMCLVEVKGGPPKPVASCAM 65

Query: 692 PVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
           PV +G  + TN+ M ++AREGVMEFLL+NHPLDCPICDQGGECDLQDQ+MA+G D SR+
Sbjct: 66  PVGEGMVIFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYGVDTSRY 124


>gnl|CDD|233662 TIGR01973, NuoG, NADH-quinone oxidoreductase, chain G.  This model
            represents the G subunit (one of 14: A->N) of the
            NADH-quinone oxidoreductase complex I which generally
            couples NADH and ubiquinone oxidation/reduction in
            bacteria and mammalian mitochondria while translocating
            protons, but may act on NADPH and/or plastoquinone in
            cyanobacteria and plant chloroplasts. This model excludes
            related subunits from formate dehydrogenase complexes
            [Energy metabolism, Electron transport].
          Length = 603

 Score =  622 bits (1607), Expect = 0.0
 Identities = 286/667 (42%), Positives = 361/667 (54%), Gaps = 120/667 (17%)

Query: 667  VFIDDKKVMVDPGTTVLQ-------------------------------------PVAAC 689
            +FID K++ V  GTTVLQ                                     PVA+C
Sbjct: 1    IFIDGKELEVPKGTTVLQACLSAGIEIPRFCYHEKLSIAGNCRMCLVEVEKFPDKPVASC 60

Query: 690  AMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSR 749
            A PV  G ++ TNSE  ++AREGVMEFLL+NHPLDCPICDQGGECDLQDQ++ +GSDRSR
Sbjct: 61   ATPVTDGMKISTNSEKVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAVMYGSDRSR 120

Query: 750  FTDIDFSGKSGTYTRVTRQRSWGLIPSRVFLQSINRHWIGPVAACAMPVMKGWRVKTNSE 809
            F                          R   +++   ++GP+            +KT  E
Sbjct: 121  F--------------------------REKKRTVENKYLGPL------------IKT--E 140

Query: 810  MTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGNDLGTTGRGSDMQVGTYVEK 869
            MTR                    C     C      +A   DLG  GRG+++++GTY  K
Sbjct: 141  MTR--------------------CIHCTRCVRFANEVAGVEDLGVIGRGNNVEIGTYEGK 180

Query: 870  LFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRTGEVLR 929
               SELSGN+ID+CPVGALTSKPY+F ARPWE + T S+ V D+VG NI V  R GE++R
Sbjct: 181  TLESELSGNLIDICPVGALTSKPYAFKARPWELKSTPSICVHDSVGCNIRVDERNGEIMR 240

Query: 930  VLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQLVATEWEDALIAVAQKLQ 988
            +LPR N++INEEWL DKGRF YDGL RQ RL  P +RN +G L+   W +AL   A+KL+
Sbjct: 241  ILPRENDEINEEWLCDKGRFGYDGLNRQDRLTKPLLRNQEGNLLEVSWAEALAIAAEKLK 300

Query: 989  TSE-VAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDLRANYLLNNKIAG 1047
             S  + G+ G  +  E + ALK L+ KLGSE+           +  DLRANYL N  +A 
Sbjct: 301  ASSRIGGIAGPRSSLEELFALKKLVRKLGSENFDLRIRNYEFES-ADLRANYLFNTTLAD 359

Query: 1048 AEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGP-KVDLRYD-----YEHLGE 1101
             EEADL+LL+G + R EAPL N R+RK        VA IG  K +L Y        H G 
Sbjct: 360  IEEADLVLLVGADLRQEAPLLNLRLRKAVKKGGAKVALIGIEKWNLTYPANTNLVFHPGL 419

Query: 1102 SADLIKQLASGSHP-FSKKLSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCESDV 1157
            S   +  +ASG+H   +  L AAKKPLI+VG       DGAA+++      AKV      
Sbjct: 420  SPKKLDDIASGAHSDIAAALKAAKKPLIIVGDSAYSHLDGAALISAAAN-IAKVIK--VR 476

Query: 1158 PCDWKVLNILQKAASQVAALDIGYKPGT--SAIREKPPKVLFLLGADEG-----SISRDD 1210
              +W  LNIL   A+ V  LD+G +     +A+       LFLLGAD       +     
Sbjct: 477  RKEWNGLNILSSGANSVGLLDLGGESTGLDAALNLGAADALFLLGADLERALDKTARDAL 536

Query: 1211 VGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARED 1270
               D F+IYQGHHG   A  AD ILPGAA+TEK  TYVN EGRAQ+   AV PPG ARED
Sbjct: 537  SKADAFVIYQGHHGTETAEKADVILPGAAFTEKSGTYVNLEGRAQRFEQAVKPPGEARED 596

Query: 1271 WKIIRAL 1277
            W+I+RAL
Sbjct: 597  WRILRAL 603


>gnl|CDD|239174 cd02773, MopB_Res-Cmplx1_Nad11, MopB_Res_Cmplx1_Nad11: The second
            domain of the Nad11/75-kDa subunit of the NADH-quinone
            oxidoreductase/respiratory complex I/NADH
            dehydrogenase-1(NDH-1) of eukaryotes and the Nqo3/G
            subunit of alphaproteobacteria NDH-1. The NADH-quinone
            oxidoreductase is the first energy-transducting complex
            in the respiratory chains of many prokaryotes and
            eukaryotes. Mitochondrial complex I and its bacterial
            counterpart, NDH-1, function as a redox pump that uses
            the redox energy to translocate H+ ions across the
            membrane, resulting in a significant contribution to
            energy production. The nad11 gene codes for the largest
            (75 kDa) subunit of the mitochondrial NADH:ubiquinone
            oxidoreductase, it constitutes the electron input part of
            the enzyme, or the so-called NADH dehydrogenase fragment.
            In Paracoccus denitrificans, this subunit is encoded by
            the nqo3 gene, and is part of the 14 distinct subunits
            constituting the 'minimal' functional enzyme. The
            Nad11/Nqo3 subunit is made of two domains: the first
            contains three binding sites for FeS clusters (the fer2
            domain), the second domain (this CD), is of unknown
            function or, as postulated, has lost an ancestral formate
            dehydrogenase activity that became redundant during the
            evolution of the complex I enzyme. Although only
            vestigial sequence evidence remains of a molybdopterin
            binding site, this protein domain belongs to the
            molybdopterin_binding (MopB) superfamily of proteins.
          Length = 375

 Score =  611 bits (1577), Expect = 0.0
 Identities = 235/381 (61%), Positives = 278/381 (72%), Gaps = 15/381 (3%)

Query: 906  ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVR 965
            ES+DVLDAVGSNI V TR GEV+R+LPRLNEDINEEW++DK RFAYDGLKRQRL  P++R
Sbjct: 1    ESIDVLDAVGSNIRVDTRGGEVMRILPRLNEDINEEWISDKTRFAYDGLKRQRLDKPYIR 60

Query: 966  NCDGQLVATEWEDALIAVAQKLQT---SEVAGVVGSLADAEAMVALKDLLNKLGSEDLYT 1022
              +G+L    WE+AL A+A+ L+     E+A + G LAD E+MVALKDLLNKLGSE+L  
Sbjct: 61   K-NGKLKPATWEEALAAIAKALKGVKPDEIAAIAGDLADVESMVALKDLLNKLGSENLAC 119

Query: 1023 EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELD 1082
            E   P      DLR+NYL N  IAG EEAD +LL+GTNPRFEAP+ NARIRK +L   L 
Sbjct: 120  EQDGP--DLPADLRSNYLFNTTIAGIEEADAVLLVGTNPRFEAPVLNARIRKAWLHGGLK 177

Query: 1083 VAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI---SDGAA 1139
            V  IGP VDL YDY+HLG  A  ++ +ASG HPFSK L  AKKP+I+VG+      DGAA
Sbjct: 178  VGVIGPPVDLTYDYDHLGTDAKTLQDIASGKHPFSKALKDAKKPMIIVGSGALARKDGAA 237

Query: 1140 VLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAALDIGYKPGTSAIREK-PPKVLFL 1198
            +LA V +LA K      V   W   N+L +AAS+V ALD+G+ PG  AIR+  PPKVL+L
Sbjct: 238  ILAAVAKLAKKN---GVVREGWNGFNVLHRAASRVGALDLGFVPGAGAIRKSGPPKVLYL 294

Query: 1199 LGADEGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTL 1258
            LGADE  I    + KD F++YQGHHGD GA IAD ILPGAAYTEK  TYVNTEGR QQT 
Sbjct: 295  LGADEIDI--TPIPKDAFVVYQGHHGDRGAQIADVILPGAAYTEKSGTYVNTEGRVQQTR 352

Query: 1259 TAVTPPGLAREDWKIIRALSE 1279
             AV+PPG AREDWKI+RALSE
Sbjct: 353  KAVSPPGDAREDWKILRALSE 373


>gnl|CDD|223965 COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75
            kD subunit (chain G) [Energy production and conversion].
          Length = 693

 Score =  469 bits (1208), Expect = e-149
 Identities = 236/640 (36%), Positives = 314/640 (49%), Gaps = 115/640 (17%)

Query: 790  PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFG 849
             VA+CA PV  G  + TNSE  ++AREGVMEFLL+NHPLDCP+CD+GGEC+LQD ++ +G
Sbjct: 58   LVASCATPVTDGMVISTNSEEVKKAREGVMEFLLINHPLDCPVCDKGGECELQDLAVKYG 117

Query: 850  ND---------------------------------------------LGTTGRGSDMQVG 864
                                                           LG   RG + ++G
Sbjct: 118  VSHSRYRETKRTHRDKDLGPLVKYDMNRCILCTRCVRFCKEIAGTHELGVIKRGENSEIG 177

Query: 865  TYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRT 924
            TY+++   SEL GN +D+CPVGALTSKP++FTAR WE +KT S+ V  AVG NI V  R 
Sbjct: 178  TYLDQPLESELCGNCVDVCPVGALTSKPFAFTARKWELKKTPSICVHCAVGCNIRVDERY 237

Query: 925  GEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVRNCDGQLVATEWEDALIAVA 984
            GEV R+LPR NE +NEEWL DKGRFAYDGL  QRL  P +R   G+LV   W +A  A+A
Sbjct: 238  GEVRRILPRYNEVVNEEWLCDKGRFAYDGLNLQRLDRPKIRV-GGRLVEASWLEANEAIA 296

Query: 985  QKL---QTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDL---RAN 1038
            Q L   +  +V  +    A  E + ALK+L  +LGS +         E A  D    RA 
Sbjct: 297  QALALIKPEKVGAIASPRASVEELFALKELAGELGSSN----IDHRQEDARLDPKVARAG 352

Query: 1039 YLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYDY-- 1096
            YL N  IA  E AD +L+IG N R EAP+   RIRK      L VA IG   +  Y    
Sbjct: 353  YLYNPTIAEIESADAVLVIGANLRQEAPVLALRIRKAVKGKGLPVAVIGGVAEWLYALLL 412

Query: 1097 -EHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVT 1152
               LG    L+ +LA G+   +  +  A++ LI  G      +  AA+LAL ++LA ++ 
Sbjct: 413  SILLGAGIALLDELALGAEAATAAVKKAERELIEKGKGALAGAKKAAILALAEKLADEL- 471

Query: 1153 CESDVPCDWKVLNILQKAASQVAALDIGYKPG-----TSAIREKPPK-VLFLLGADEGSI 1206
                    W    +L +AAS+V AL +G+ PG      + +        L LLG DE   
Sbjct: 472  --GAAEARWN-GVVLHEAASRVNALGLGFLPGLSGEDAALMLGPADANALLLLGIDE-EA 527

Query: 1207 SRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGL 1266
               D     F++Y  HHGD GA +AD +LP A++TEK  TYVN EGR Q+   A+ P G 
Sbjct: 528  DEADEHAK-FVVYSDHHGDAGAEVADVVLPAASFTEKSGTYVNLEGRVQRFNQALRPGGD 586

Query: 1267 AREDWKIIRALS--------------------------EGVSSNLSSEPVDVK------- 1293
             REDW+++ AL+                            + S    E            
Sbjct: 587  EREDWRVLHALASELGLKLDFDQLDAVRAALAAAHPGFALIDSAAKGEREPPAGEGILAN 646

Query: 1294 --------QKNLEDYYMTDPISRASGTMAKCIQAVHKQKQ 1325
                        +D+Y+  P +R S   AK +Q    Q  
Sbjct: 647  RANISVAEPLQPQDFYLAMPGARNSPQAAKKLQLEVGQAV 686



 Score =  121 bits (306), Expect = 4e-28
 Identities = 49/121 (40%), Positives = 64/121 (52%), Gaps = 36/121 (29%)

Query: 667 VFIDDKKVMVDPGTTVLQ------------------------------------PVAACA 690
           + ID K++ V  G TVLQ                                     VA+CA
Sbjct: 4   ITIDGKEIEVPEGETVLQAAREAGIDIPTFCYHPRLSIAGACRMCLVEVEGAPKLVASCA 63

Query: 691 MPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
            PV  G  + TNSE  ++AREGVMEFLL+NHPLDCP+CD+GGEC+LQD ++ +G   SR+
Sbjct: 64  TPVTDGMVISTNSEEVKKAREGVMEFLLINHPLDCPVCDKGGECELQDLAVKYGVSHSRY 123

Query: 751 T 751
            
Sbjct: 124 R 124


>gnl|CDD|239169 cd02768, MopB_NADH-Q-OR-NuoG2, MopB_NADH-Q-OR-NuoG2: The
            NuoG/Nad11/75-kDa subunit (second domain) of the
            NADH-quinone oxidoreductase (NADH-Q-OR)/respiratory
            complex I/NADH dehydrogenase-1 (NDH-1). The NADH-Q-OR is
            the first energy-transducting complex in the respiratory
            chains of many prokaryotes and eukaryotes. Mitochondrial
            complex I and its bacterial counterpart, NDH-1, function
            as a redox pump that uses the redox energy to translocate
            H+ ions across the membrane, resulting in a significant
            contribution to energy production. The atomic structure
            of complex I is not known and the mechanisms of electron
            transfer and proton pumping are not established. The
            nad11 gene codes for the largest (75-kDa) subunit of the
            mitochondrial NADH:ubiquinone oxidoreductase, it
            constitutes the electron input part of the enzyme, or the
            so-called NADH dehydrogenase fragment. In Escherichia
            coli, this subunit is encoded by the nuoG gene, and is
            part of the 14 distinct subunits constituting the
            'minimal' functional enzyme. The nad11 gene is
            nuclear-encoded in animals, plants, and fungi, but is
            still encoded in the mitochondrial genome of some
            protists. The Nad11/NuoG subunit is made of two domains:
            the first contains three binding sites for FeS clusters
            (the fer2 domain), the second domain (this CD), is of
            unknown function or, as postulated, has lost an ancestral
            formate dehydrogenase activity that became redundant
            during the evolution of the complex I enzyme. Although
            only vestigial sequence evidence remains of a
            molybdopterin binding site, this protein domain family
            belongs to the molybdopterin_binding (MopB) superfamily
            of proteins. Bacterial type II NADH-quinone
            oxidoreductases and NQR-type sodium-motive NADH-quinone
            oxidoreductases are not homologs of this domain family.
          Length = 386

 Score =  435 bits (1120), Expect = e-141
 Identities = 186/394 (47%), Positives = 230/394 (58%), Gaps = 30/394 (7%)

Query: 906  ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFV 964
            ES+DV DA+GSNI V  R GEV+R+LPR NE INEEW++DKGRF YDGL  RQRL  P +
Sbjct: 1    ESIDVHDALGSNIRVDVRGGEVMRILPRENEAINEEWISDKGRFGYDGLNSRQRLTQPLI 60

Query: 965  RNCDGQLVATEWEDALIAVAQKLQTS---EVAGVVGSLADAEAMVALKDLLNKLGSEDLY 1021
            +   G+LV   WE+AL  VA+ L+     ++ G+ G  AD E++  LK LLNKLGS ++ 
Sbjct: 61   KK-GGKLVPVSWEEALKTVAEGLKAVKGDKIGGIAGPRADLESLFLLKKLLNKLGSNNID 119

Query: 1022 TEYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNEL 1081
                     A   LR NYL N  IA  EEAD +LLIG+N R EAPL NAR+RK       
Sbjct: 120  HRLRQSDLPADNRLRGNYLFNTSIAEIEEADAVLLIGSNLRKEAPLLNARLRKAVKKKGA 179

Query: 1082 DVAYIGPK-----VDLRYDYEHLGESADLIKQLASGSH--PFSKKLSAAKKPLIVVGADI 1134
             +A IGPK      DL Y    LG S   +  +A G H  PF+K L  AKKPLI++G+  
Sbjct: 180  KIAVIGPKDTDLIADLTYPVSPLGASLATLLDIAEGKHLKPFAKSLKKAKKPLIILGSSA 239

Query: 1135 --SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAA--LDIGYKPGTSAIRE 1190
               DGAA+L  +  LAAK+   +     W  LN+L    +++    LD G         +
Sbjct: 240  LRKDGAAILKALANLAAKLGTGAG---LWNGLNVLNSVGARLGGAGLDAGLALLEPGKAK 296

Query: 1191 KPPKVLFLLGADEGSISRDDVG-----KDCFIIYQGHHGDHGASIADAILPGAAYTEKQS 1245
                 L LLG DE   S           D F++YQGHHGD GA  AD ILP AA+TEK  
Sbjct: 297  -----LLLLGEDELDRSNPPAAVALAAADAFVVYQGHHGDTGAQ-ADVILPAAAFTEKSG 350

Query: 1246 TYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            TYVNTEGR Q+   AV+PPG AREDWKI+RALS 
Sbjct: 351  TYVNTEGRVQRFKKAVSPPGDAREDWKILRALSN 384


>gnl|CDD|236386 PRK09129, PRK09129, NADH dehydrogenase subunit G; Validated.
          Length = 776

 Score =  381 bits (982), Expect = e-116
 Identities = 204/712 (28%), Positives = 296/712 (41%), Gaps = 185/712 (25%)

Query: 666  EVFIDDKKVMVDPGTTVLQ------------------------------------PVAAC 689
            E+ ID KKV V  G+ V++                                    P+ AC
Sbjct: 3    EIEIDGKKVEVPEGSMVIEAADKAGIYIPRFCYHKKLSIAANCRMCLVEVEKAPKPLPAC 62

Query: 690  AMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSR 749
            A PV  G +V T SE   +A++ VMEFLL+NHPLDCPICDQGGEC LQD ++ +G   SR
Sbjct: 63   ATPVTDGMKVFTRSEKALKAQKSVMEFLLINHPLDCPICDQGGECQLQDLAVGYGRSTSR 122

Query: 750  FTDIDFSGKSGTYTRVTRQRSWGLIPSRVFLQSINRHWIGPVAACAMPVMKGWRVKTNSE 809
            +T+               +R        VF +      +GP+ +              +E
Sbjct: 123  YTE--------------EKRV-------VFDKD-----LGPLIS--------------TE 142

Query: 810  MTR--------RARE---GVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGNDLGTTGRG 858
            MTR        R  +   GVME                                G  GRG
Sbjct: 143  MTRCIHCTRCVRFGQEIAGVMEL-------------------------------GMMGRG 171

Query: 859  SDMQVGTYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNI 918
               ++ TYV K   SELSGN+IDLCPVGALTSKP+ ++AR WE  + +SV   D++GSN+
Sbjct: 172  EHSEITTYVGKTVDSELSGNMIDLCPVGALTSKPFRYSARTWELSRRKSVSPHDSLGSNL 231

Query: 919  IVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQLVATEWE 977
            +V  +   V+RV+PR NE +NE W++D+ RF+Y+GL  + RL  P ++   GQ    +WE
Sbjct: 232  VVHVKNNRVMRVVPRENEAVNECWISDRDRFSYEGLNSEDRLTKPMIKQ-GGQWKEVDWE 290

Query: 978  DALIAVAQKLQ-------TSEVAGVVGSLADAEAMVALKDLLNKLGSEDL---YTEYAFP 1027
             AL  VA+ L+         ++  +    +  E +  L+ L   LGS ++     +  F 
Sbjct: 291  TALEYVAEGLKGIIEDHGADQIGALASPHSTLEELYLLQKLARGLGSGNIDHRLRQQDFR 350

Query: 1028 LEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIG 1087
             + A         L   IA     D +L++G+N R E PL  AR+R+    N   ++ I 
Sbjct: 351  DDAAAPGAPW---LGMPIAELSNLDAVLVVGSNLRKEHPLLAARLRQAAK-NGAKLSAIN 406

Query: 1088 PKV-DLRYDYEH---------LGESADLIKQLA---SGSHPFS--------------KKL 1120
            P   D  +                 A +   +A     + P +              + L
Sbjct: 407  PVDDDFLFPVAQRIIVAPSAWADALAGVAAAVAAAKGVALPEALAKVLAAAAARAIAQSL 466

Query: 1121 SAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAAL 1177
            +  ++  I++G         A + AL Q +A               L  L +AA+ V A 
Sbjct: 467  ANGERAAILLGNLAVNHPQAATLRALAQWIAKLTGA---------TLGFLTEAANSVGAH 517

Query: 1178 DIGYKPGTS-----AIREKPPKVLFLLGADEGSISRDDV-------GKDCFIIYQGHHGD 1225
              G  PG       A+  +P K   LL  +      D           +  +        
Sbjct: 518  LAGALPGKGGLNAAAMLAQPRKAYLLLNVEPELDCADPAQARAALNQAEFVVALSAFASK 577

Query: 1226 HGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRAL 1277
                 AD +LP A +TE   T+VN EGR Q     V P G AR  WK++R L
Sbjct: 578  ATLDYADVLLPIAPFTETSGTFVNAEGRVQSFKGVVRPLGEARPAWKVLRVL 629


>gnl|CDD|215888 pfam00384, Molybdopterin, Molybdopterin oxidoreductase. 
          Length = 358

 Score =  344 bits (885), Expect = e-108
 Identities = 164/357 (45%), Positives = 203/357 (56%), Gaps = 35/357 (9%)

Query: 958  RLLTPFVRNCDGQLVATEWEDALIAVAQKLQ---------TSEVAGVVGSLADAEAMVAL 1008
            RL  P VR  +G+ V   W++AL  +A+KL+            + G  G L D E++ AL
Sbjct: 1    RLKYPMVRRGEGKFVRVSWDEALDLIAKKLKRIIKKYGPDAIAINGGSGGLTDVESLYAL 60

Query: 1009 KDLLNKLGSEDLYTE-----YAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRF 1063
            K LLN+LGS++  TE             G+DLR+NYL NN IA  E ADLILLIGTNPR 
Sbjct: 61   KKLLNRLGSKNGNTEDHNGDLCTAAAAFGSDLRSNYLFNNSIADIENADLILLIGTNPRE 120

Query: 1064 EAPLFNARIRKGYLTNELDVAYIGPKVDLRYDYEHLGESAD-LIKQLASGSHPFSKKLSA 1122
            EAP+ NARIRK  L  +  V  IGP++D  Y  EHLG      +    +G+H F K+L  
Sbjct: 121  EAPILNARIRKAALKGKAKVIVIGPRLDETYADEHLGIKPGTDLALALAGAHVFIKELKK 180

Query: 1123 AK----KPLIVVGADI---SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVA 1175
             K    KP+I+VGA +   +DG A+   +  LA            W  LNILQ AAS V 
Sbjct: 181  DKDFAPKPIIIVGAGVLQRADGEAIFRAIANLADLTGNIGRPGGGWNGLNILQGAASPVG 240

Query: 1176 ALDIGYKPG------TSAIREKPPKVLFLLG-------ADEGSISRDDVGKDCFIIYQGH 1222
            ALD+G  PG       +AI++   KVL+LLG       ADE  + +     D F++Y GH
Sbjct: 241  ALDLGLVPGIKSVEMINAIKKGGIKVLYLLGNNPFVTHADENRVVKALEKLDLFVVYDGH 300

Query: 1223 HGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            HGD  A  AD ILP AAYTEK  TYVNTEGR Q T  AV PPG AREDWKI+RALSE
Sbjct: 301  HGDKTAKYADVILPAAAYTEKNGTYVNTEGRVQSTKQAVPPPGEAREDWKILRALSE 357


>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
            Mitochondrial-encoded NADH-quinone
            oxidoreductase/respiratory complex I, the second domain
            of the Nad11/75-kDa subunit of some protists.
            NADH-quinone oxidoreductase is the first
            energy-transducting complex in the respiratory chain and
            functions as a redox pump that uses the redox energy to
            translocate H+ ions across the membrane, resulting in a
            significant contribution to energy production. The nad11
            gene codes for the largest (75-kDa) subunit of the
            mitochondrial NADH-quinone oxidoreductase, it constitutes
            the electron input part of the enzyme, or the so-called
            NADH dehydrogenase fragment. The Nad11 subunit is made of
            two domains: the first contains three binding sites for
            FeS clusters (the fer2 domain), the second domain (this
            CD), is of unknown function or, as postulated, has lost
            an ancestral formate dehydrogenase activity that became
            redundant during the evolution of the complex I enzyme.
            Although only vestigial sequence evidence remains of a
            molybdopterin binding site, this protein domain belongs
            to the molybdopterin_binding (MopB) superfamily of
            proteins.
          Length = 366

 Score =  255 bits (654), Expect = 8e-76
 Identities = 120/375 (32%), Positives = 197/375 (52%), Gaps = 16/375 (4%)

Query: 906  ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQRLLTPFVR 965
            ES+DVLD++GSNI V  +  E+LR+LP++N+++NEEW++DK RF+YD LK QR+ TP ++
Sbjct: 1    ESIDVLDSLGSNIRVDIKGNEILRILPKINDELNEEWISDKIRFSYDSLKYQRIKTPLLK 60

Query: 966  NCDGQLVATEWEDALIAVAQKLQT---SEVAGVVGSLADAEAMVALKDLLNKLGSEDLYT 1022
              +   +   W+ A   + + +     S++  ++GS  D E +   K LLNKLGS +  +
Sbjct: 61   LSNNSFLEIGWKTAFKFLNKFILLKKFSKLNFIIGSKIDLETLFYYKKLLNKLGSLNTNS 120

Query: 1023 EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELD 1082
                           NYL NN +   +++DL LLIG+N R E+P+ N R+R  Y      
Sbjct: 121  NNFLENNNYFNLDLENYLFNNSLKNLDKSDLCLLIGSNLRVESPILNIRLRNRYNKGNKK 180

Query: 1083 VAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADISDGAAVLA 1142
            +  IG K D  Y  +H+G S + + ++  G H F K+L  +KKPLI++G+  S       
Sbjct: 181  IFVIGNKFDTTYPSKHIGLSLNTLLKILEGKHLFCKQLKKSKKPLIIIGSSFSLRKNYSF 240

Query: 1143 LVQQLAAKVTCESDVPCDWKVLNILQKAASQVAALDIGYKPGTSAIREKPPKVLFLLGAD 1202
            ++ +L       S+        N L   ++ +  L I  K  ++   +K    L+ +   
Sbjct: 241  IISKLK---NFSSNNE---NNFNFLNIISNSLYYLGI-KKFNSN--NKKNLSNLYYIKET 291

Query: 1203 EGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVT 1262
                      K+ F+IYQGHH  + A+ ++ ILP   + EK++ Y+N EG  Q+T   ++
Sbjct: 292  NFQK----FNKNNFVIYQGHHFLNLANNSNLILPSKTFFEKEALYLNLEGILQKTKKILS 347

Query: 1263 PPGLAREDWKIIRAL 1277
                 + D  II +L
Sbjct: 348  FKENIKSDDNIIFSL 362


>gnl|CDD|238218 cd00368, Molybdopterin-Binding, Molybdopterin-Binding (MopB) domain
            of the MopB superfamily of proteins, a  large, diverse,
            heterogeneous superfamily of enzymes that, in general,
            bind molybdopterin as a cofactor. The MopB domain is
            found in a wide variety of molybdenum- and
            tungsten-containing enzymes, including formate
            dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms of
            nitrate reductase (Nap, Nas, NarG), dimethylsulfoxide
            reductase (DMSOR), thiosulfate reductase,
            formylmethanofuran dehydrogenase, and arsenite oxidase.
            Molybdenum is present in most of these enzymes in the
            form of molybdopterin, a modified pterin ring with a
            dithiolene side chain, which is responsible for ligating
            the Mo. In many bacterial and archaeal species,
            molybdopterin is in the form of a dinucleotide, with two
            molybdopterin dinucleotide units per molybdenum. These
            proteins can function as monomers, heterodimers, or
            heterotrimers, depending on the protein and organism.
            Also included in the MopB superfamily is the
            eukaryotic/eubacterial protein domain family of the
            75-kDa subunit/Nad11/NuoG (second domain) of respiratory
            complex 1/NADH-quinone oxidoreductase which is postulated
            to have lost an ancestral formate dehydrogenase activity
            and only vestigial sequence evidence remains of a
            molybdopterin binding site.
          Length = 374

 Score =  219 bits (559), Expect = 6e-63
 Identities = 111/396 (28%), Positives = 159/396 (40%), Gaps = 46/396 (11%)

Query: 906  ESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFV 964
             SV     VG  I+V  + G+V+R+    N  +NE  L DKGR   DGL    RL  P +
Sbjct: 1    PSVCPFCGVGCGILVYVKDGKVVRIEGDPNHPVNEGRLCDKGRAGLDGLYSPDRLKYPLI 60

Query: 965  R-NCDGQLVATEWEDALIAVAQKLQ-------TSEVAGVVGSLADAEAMVALKDLLNKLG 1016
            R    G+ V   W++AL  +A+KL+          +A   G  A  E    L+ LL  LG
Sbjct: 61   RVGGRGKFVPISWDEALDEIAEKLKEIREKYGPDAIAFYGGGGASNEEAYLLQKLLRALG 120

Query: 1017 SEDLYTEYAFPLEGAGTDLRA--NYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRK 1074
            S ++ +        A   L+A       N +A  E ADLILL G+NP    P+  AR+R+
Sbjct: 121  SNNVDSHARLCHASAVAALKAFGGGAPTNTLADIENADLILLWGSNPAETHPVLAARLRR 180

Query: 1075 GYLTNELDVAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADI 1134
                    +  I P              AD    +  G+      L+ A+    + G   
Sbjct: 181  AKK-RGAKLIVIDP------RRTETAAKADEWLPIRPGTDAA---LALAEWAAEITGVP- 229

Query: 1135 SDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQK---------AASQVAAL--DIGYKP 1183
               A  +  + +  A                 L +         A + +AAL  +IG   
Sbjct: 230  ---AETIRALAREFAAA-----KRAVILWGMGLTQHTNGTQNVRAIANLAALTGNIGRPG 281

Query: 1184 GTSAIREKPPKVLFLLGADEGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAAYTEK 1243
            G       P     +   D   +       D  ++      +  A  AD +LP A Y EK
Sbjct: 282  GGLGPGGNP----LVSAPDANRVRAALKKLDFVVVIDIFMTETAA-YADVVLPAATYLEK 336

Query: 1244 QSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            + TY NTEGR Q    AV PPG AR DW+I+R L++
Sbjct: 337  EGTYTNTEGRVQLFRQAVEPPGEARSDWEILRELAK 372


>gnl|CDD|236118 PRK07860, PRK07860, NADH dehydrogenase subunit G; Validated.
          Length = 797

 Score =  209 bits (534), Expect = 6e-56
 Identities = 121/369 (32%), Positives = 158/369 (42%), Gaps = 85/369 (23%)

Query: 790  PVAACAMPVMKGWRVKT--NSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMA 847
            P A+C   V  G  VKT   S +  +A+ GVME LL+NHPLDCP+CD+GGEC L  Q+ A
Sbjct: 61   PQASCTTTVTDGMVVKTQLTSPVADKAQHGVMELLLINHPLDCPVCDKGGECPL--QNQA 118

Query: 848  FGNDLGTT------------------------------------------------GRGS 859
              N    +                                                 RG+
Sbjct: 119  MSNGRAESRFTDVKRTFPKPINISTQVLLDRERCVLCARCTRFSDQIAGDPFIDLQERGA 178

Query: 860  DMQVGTYVEKLFLSELSGNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNII 919
              QVG Y  + F S  SGN + +CPVGALT   Y F ARP++   T SV    A G    
Sbjct: 179  LQQVGIYEGEPFQSYFSGNTVQICPVGALTGAAYRFRARPFDLVSTPSVCEHCASGCAQR 238

Query: 920  VSTRTGEVLRVLPRLNEDINEEWLADKGRFAYD-GLKRQRLLTPFVRNCDGQLVATEWED 978
               R G+VLR L   + ++NEEW  DKGR+A+    +  R+ TP VR+ DG+L    W +
Sbjct: 239  TDHRRGKVLRRLAGDDPEVNEEWNCDKGRWAFTYATQPDRITTPLVRDEDGELEPASWSE 298

Query: 979  ALIAVAQKLQTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGT---DL 1035
            AL   A+ L  +   G VG L      V           ED Y    F     GT   D 
Sbjct: 299  ALAVAARGL--AAARGRVGVLVGGRLTV-----------EDAYAYAKFARVALGTNDIDF 345

Query: 1036 RAN-------YLLNNKIAGA---------EEADLILLIGTNPRFEAPLFNARIRKGYLTN 1079
            RA          L  ++AG          E+A  +LL+G  P  E+P+   R+RK    +
Sbjct: 346  RARPHSAEEADFLAARVAGRGLGVTYADLEKAPAVLLVGFEPEEESPIVFLRLRKAARKH 405

Query: 1080 ELDVAYIGP 1088
             L V  I P
Sbjct: 406  GLKVYSIAP 414



 Score =  116 bits (292), Expect = 3e-26
 Identities = 39/71 (54%), Positives = 50/71 (70%), Gaps = 2/71 (2%)

Query: 685 PVAACAMPVMKGWRVKT--NSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMA 742
           P A+C   V  G  VKT   S +  +A+ GVME LL+NHPLDCP+CD+GGEC LQ+Q+M+
Sbjct: 61  PQASCTTTVTDGMVVKTQLTSPVADKAQHGVMELLLINHPLDCPVCDKGGECPLQNQAMS 120

Query: 743 FGSDRSRFTDI 753
            G   SRFTD+
Sbjct: 121 NGRAESRFTDV 131



 Score = 34.2 bits (79), Expect = 0.53
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 1231 ADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALS 1278
            AD +LP A   EK  T++N EGR +    A+   G A  D +++ AL+
Sbjct: 596  ADVVLPVAPVAEKAGTFLNWEGRLRPFEAALRTTG-ALSDLRVLDALA 642


>gnl|CDD|239173 cd02772, MopB_NDH-1_NuoG2, MopB_NDH-1_NuoG2: The second domain of the
            NuoG subunit of the NADH-quinone oxidoreductase/NADH
            dehydrogenase-1 (NDH-1), found in beta- and
            gammaproteobacteria. The NDH-1 is the first
            energy-transducting complex in the respiratory chain and
            functions as a redox pump that uses the redox energy to
            translocate H+ ions across the membrane, resulting in a
            significant contribution to energy production. In
            Escherichia coli NDH-1, the largest subunit is encoded by
            the nuoG gene, and is part of the 14 distinct subunits
            constituting the functional enzyme. The NuoG subunit is
            made of two domains: the first contains three binding
            sites for FeS clusters (the fer2 domain), the second
            domain (this CD), is of unknown function or, as
            postulated, has lost an ancestral formate dehydrogenase
            activity that became redundant during the evolution of
            the complex I enzyme. Although only vestigial sequence
            evidence remains of a molybdopterin binding site, this
            protein domain belongs to the molybdopterin_binding
            (MopB) superfamily of proteins.
          Length = 414

 Score =  184 bits (469), Expect = 1e-50
 Identities = 118/423 (27%), Positives = 181/423 (42%), Gaps = 66/423 (15%)

Query: 907  SVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVR 965
            SV   DA+GSN++V  +  +V+RV+PR NE INE WL+D+ RF+Y+GL  + RL  P ++
Sbjct: 2    SVSPHDALGSNLVVHVKNNKVMRVVPRENEAINECWLSDRDRFSYEGLNSEDRLTKPMIK 61

Query: 966  NCDGQLVATEWEDALIAVAQKLQT---SEVAGVVGSLADA----EAMVALKDLLNKLGSE 1018
              DGQ    +WE AL  VA+ L        A  +G+LA      E +  L+ L   LGS+
Sbjct: 62   K-DGQWQEVDWETALEYVAEGLSAIIKKHGADQIGALASPHSTLEELYLLQKLARGLGSD 120

Query: 1019 DLYT---EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKG 1075
            ++     +  F  +           L   IA   E D +L+IG+N R E PL   R+R+ 
Sbjct: 121  NIDHRLRQSDFRDDAK---ASGAPWLGMPIAEISELDRVLVIGSNLRKEHPLLAQRLRQA 177

Query: 1076 YLTNELDVAYIGP-KVDLRYDYEH------------LGESADLIKQLASGSHPFSKK--- 1119
                   ++ I P   D  +                L + A  + +    + P       
Sbjct: 178  V-KKGAKLSAINPADDDFLFPLSGKAIVAPSALANALAQVAKALAEEKGLAVPDEDAKVE 236

Query: 1120 -----------LSAAKKPLIVVGADI---SDGAAVLALVQQLAAKVTCESDVPCDWKVLN 1165
                       L +A++  + +G         A + AL Q++ AK+T  +        L 
Sbjct: 237  ASEEARKIAASLVSAERAAVFLGNLAQNHPQAATLRALAQEI-AKLTGAT--------LG 287

Query: 1166 ILQKAASQVAALDIGYKPGTS----AIREKPPKVLFLLGA------DEGSISRDDVGKDC 1215
            +L + A+ V A   G  P       A+ E+P K   LL           + +   + +  
Sbjct: 288  VLGEGANSVGAYLAGALPHGGLNAAAMLEQPRKAYLLLNVEPELDCANPAQALAALNQAE 347

Query: 1216 FIIYQGHHGDHGA-SIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKII 1274
            F++             AD +LP A +TE   T+VN EGR Q     V P G AR  WK++
Sbjct: 348  FVVALSAFASAALLDYADVLLPIAPFTETSGTFVNLEGRVQSFKGVVKPLGEARPAWKVL 407

Query: 1275 RAL 1277
            R L
Sbjct: 408  RVL 410


>gnl|CDD|236170 PRK08166, PRK08166, NADH dehydrogenase subunit G; Validated.
          Length = 791

 Score =  180 bits (459), Expect = 1e-46
 Identities = 139/563 (24%), Positives = 210/563 (37%), Gaps = 97/563 (17%)

Query: 791  VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQD------- 843
            V +C  P   G  +  +    +  R  V+E+L+ NHP DCP+C++GG C LQD       
Sbjct: 64   VMSCMTPATDGTFISIDDPEAKAFRASVVEWLMTNHPHDCPVCEEGGNCHLQDMTVMTGH 123

Query: 844  --------------------------------------QSMAFGNDLGTTGRGSDMQVGT 865
                                                  +  A G DLG  G   ++  G 
Sbjct: 124  SFRRYRFTKRTHRNQDLGPFISHEMNRCIACYRCVRYYKDYAGGTDLGVYGAHDNVYFGR 183

Query: 866  YVEKLFLSELSGNVIDLCPVGALTSKPYSF-TARPWETRKTESVDVLDAVGSNIIVSTRT 924
              +    SE SGN++++CP G  T K +S    R W+ +   S+    +VG NI    R 
Sbjct: 184  PEDGTLESEFSGNLVEVCPTGVFTDKTHSERYNRKWDMQFAPSICQHCSVGCNISPGERY 243

Query: 925  GEVLRVLPRLNEDINEEWLADKGRFAYDGLKR-QRLLTPFVRNCDGQLVATEWEDALIAV 983
            GE+ R+  R N  +N  +L D+GRF Y  +    R   P  R  D   +    + AL   
Sbjct: 244  GELRRIENRYNGAVNGYFLCDRGRFGYGYVNLKDRPRQPLQRRGDD-FITLNADQALQGA 302

Query: 984  AQKL-QTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDLRANYLLN 1042
            A  L Q  +V G+    A  E+  AL++L   +G+E+ YT  A   E     L    L  
Sbjct: 303  ADILRQAKKVIGIGSPRASLESNFALREL---VGAENFYTGIAAG-EQERLQLALKVLRE 358

Query: 1043 NKI-----AGAEEADLILLIGTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYD-- 1095
              I        E  D +L++G +    A      +R+       ++A    KV       
Sbjct: 359  GGIYTPSLREIESYDAVLVLGEDLTQTAARVALAVRQAVKGKAREMA-AAQKVADWQIAA 417

Query: 1096 YEHLGESADLIKQLASGSHP-FSKKLSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCE 1154
              ++G+ A           P F   L+ AKKPLI+ G      A + A      A     
Sbjct: 418  VRNIGQRA---------KSPLFITNLAGAKKPLIISGTSAGSPAIIEAAANVAKALKGRG 468

Query: 1155 SDVPCDWKVLNILQKAASQVAALDIGYKPGT--SAIREKPPKVLFLLGAD---EGSISRD 1209
            +DV     +  +  +A S   AL  G         +       + +L  D       +R 
Sbjct: 469  ADV----GITLVAPEANSMGLALLGGGSLEEALEELESGRADAVIVLENDLYRHAPAARV 524

Query: 1210 DV---GKDCFIIYQGHHGDHGASI----ADAILPGAAYTEKQSTYVNTEGRAQQTLTAVT 1262
            D         I+      DH  +     A  +LP A++ E   T VN EGRAQ+      
Sbjct: 525  DAALAKAPLVIVL-----DHQRTATMEKAHLVLPAASFAESDGTLVNNEGRAQRFFQVYD 579

Query: 1263 PPG-----LAREDWKIIRALSEG 1280
            P       +  E W+ + +L   
Sbjct: 580  PAYYDSKTVMLESWRWLHSLHST 602



 Score = 77.3 bits (191), Expect = 3e-14
 Identities = 30/124 (24%), Positives = 48/124 (38%), Gaps = 41/124 (33%)

Query: 668 FIDDKKVMVDPGTTVLQ---------P----------VAAC---AM-------------- 691
            +D K+  V+    +L+         P          V AC   A+              
Sbjct: 5   HVDGKEYEVNGADNLLEACLSLGIDIPYFCWHPALGSVGACRQCAVKQYQNPEDTRGRLV 64

Query: 692 -----PVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSD 746
                P   G  +  +    +  R  V+E+L+ NHP DCP+C++GG C LQD ++  G  
Sbjct: 65  MSCMTPATDGTFISIDDPEAKAFRASVVEWLMTNHPHDCPVCEEGGNCHLQDMTVMTGHS 124

Query: 747 RSRF 750
             R+
Sbjct: 125 FRRY 128


>gnl|CDD|239172 cd02771, MopB_NDH-1_NuoG2-N7, MopB_NDH-1_NuoG2-N7: The second domain
            of the NuoG subunit (with a [4Fe-4S] cluster, N7) of the
            NADH-quinone oxidoreductase/NADH dehydrogenase-1 (NDH-1)
            found in various bacteria. The NDH-1 is the first
            energy-transducting complex in the respiratory chain and
            functions as a redox pump that uses the redox energy to
            translocate H+ ions across the membrane, resulting in a
            significant contribution to energy production. In
            Escherichia coli NDH-1, the largest subunit is encoded by
            the nuoG gene, and is part of the 14 distinct subunits
            constituting the functional enzyme. The NuoG subunit is
            made of two domains: the first contains three binding
            sites for FeS clusters (the fer2 domain), the second
            domain (this CD), is of unknown function or, as
            postulated, has lost an ancestral formate dehydrogenase
            activity that became redundant during the evolution of
            the complex I enzyme. Unique to this group, compared to
            the other prokaryotic and eukaryotic groups in this
            domain protein family (NADH-Q-OR-NuoG2), is an N-terminal
            [4Fe-4S] cluster (N7/N1c) present in the second domain.
            Although only vestigial sequence evidence remains of a
            molybdopterin binding site, this protein domain belongs
            to the molybdopterin_binding (MopB) superfamily of
            proteins.
          Length = 472

 Score =  131 bits (331), Expect = 5e-32
 Identities = 109/431 (25%), Positives = 164/431 (38%), Gaps = 73/431 (16%)

Query: 913  AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFVRNCDGQL 971
            +VG NI +  R GE+ RV  R N  +N  +L D+GRF Y  +  R RL  P +R   G L
Sbjct: 8    SVGCNISLGERYGELRRVENRYNGAVNHYFLCDRGRFGYGYVNSRDRLTQPLIRR-GGTL 66

Query: 972  VATEWEDALIAVAQKLQTSE--VAGVVGSLADAEAMVALKDLLNK-LGSEDLYTEYAFPL 1028
            V   W +AL   A +L+ ++  V G+    A  E+  AL+ L+   LG+ ++        
Sbjct: 67   VPVSWNEALDVAAARLKEAKDKVGGIGSPRASNESNYALQKLVGAVLGTNNVDHR---AR 123

Query: 1029 EGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRKG------------- 1075
                  LR   +    +   E AD +L++G +    AP     +R+              
Sbjct: 124  RLIAEILRNGPIYIPSLRDIESADAVLVLGEDLTQTAPRIALALRQAARRKAVELAALSG 183

Query: 1076 --------------------YLTNELDVAY--IGPK--VDLRYDYEHLGES-ADLIKQLA 1110
                                ++ N L      I  +           LG + A  +   A
Sbjct: 184  IPKWQDAAVRNIAQGAKSPLFIVNALATRLDDIAAESIRASPGGQARLGAALARAVDASA 243

Query: 1111 SGSHPFSKK---------LSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCESDVPCDW 1161
            +G    + K         L+ AKKPLIV G        + A      AK           
Sbjct: 244  AGVSGLAPKEKAARIAARLTGAKKPLIVSGTLSGSLELIKAAAN--LAKALKRRGENAGL 301

Query: 1162 KVLNILQKAASQVAAL--DIGYKPGTS------AIREKPPKVLFLLGAD-EGSISRDDV- 1211
             +   +++  S    L      +PG        A+ +     L +LG D   S     V 
Sbjct: 302  TLA--VEEGNSPGLLLLGGHVTEPGLDLDGALAALEDGSADALIVLGNDLYRSAPERRVE 359

Query: 1212 ---GKDCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAV-TPPGLA 1267
                   F++   H     A  AD +LP A++ EK  T+VN EGRAQ+   A   P G A
Sbjct: 360  AALDAAEFVVVLDHFLTETAERADVVLPAASFAEKSGTFVNYEGRAQRFFKAYDDPAGDA 419

Query: 1268 REDWKIIRALS 1278
            R DW+ + AL+
Sbjct: 420  RSDWRWLHALA 430


>gnl|CDD|225918 COG3383, COG3383, Uncharacterized anaerobic dehydrogenase [General
            function prediction only].
          Length = 978

 Score =  116 bits (293), Expect = 3e-26
 Identities = 95/393 (24%), Positives = 144/393 (36%), Gaps = 118/393 (30%)

Query: 791  VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQG-GECDLQDQSMAFG 849
            V +C+ PV  G  V+TNSE  + AR   M+ +L NHPL C +CD   G+C+L +  MA G
Sbjct: 61   VRSCSTPVEDGMVVRTNSERVKEARREAMDRILSNHPLYCTVCDNNNGDCELHNMVMALG 120

Query: 850  ND-----------------------------------------------LGTTGRGSDMQ 862
                                                             L    RG D +
Sbjct: 121  LTEQRYPYEEKNPPYPKDESNPFYIYDPNQCILCGRCVEACQEVQVNEALTIDWRGEDPR 180

Query: 863  VGTYVEKLFLSELS----GNVIDLCPVGALTSKPYSFTAR-------------------- 898
            V  +   + ++E S    G  + +CPV AL  K     A                     
Sbjct: 181  V-IWDNDVPINESSCVSCGACVTVCPVNALMEKSMLGEAGYLTGINEDTLEPMIDLVKKV 239

Query: 899  -PW-----------------ETRKTESVDVLDAVGSNIIVSTRTGEVLRVLPRLNEDINE 940
             P                    +KT++V     VG +  V T+  E+L+V P      N 
Sbjct: 240  EPEYGPIFAISEIEAAMRETRIKKTKTVCTYCGVGCSFEVWTKGREILKVQPDPEGPANR 299

Query: 941  EWLADKGRFAYDGLK-RQRLLTPFVRNCDGQLVATEWEDALIAVAQKL-QTSEVAG--VV 996
                 KG+F +D +  R R+  P +R    +     WE+AL  VA +L +  E  G   +
Sbjct: 300  ISTCVKGKFGWDFVNSRDRITKPLIRE-GDRFREVSWEEALDLVASRLREIKEKYGGDSI 358

Query: 997  GSLADAEAMVALKDLLNKL-------GSEDLYTEYA--------FPLEGAGTDLRANYLL 1041
            G +A ++       L+ KL        + D  + Y         F   G+G D       
Sbjct: 359  GFIASSKCTNEENYLMQKLARQVFGTNNVDNCSRYCQSPATDGLFRTVGSGAD------- 411

Query: 1042 NNKIAGAEEADLILLIGTNPRFEAPLFNARIRK 1074
            +  I   E ADL+L+IG NP    P+  +R+++
Sbjct: 412  SGSIEDVEGADLVLIIGANPTEGHPVLASRLKR 444



 Score = 73.6 bits (181), Expect = 5e-13
 Identities = 37/121 (30%), Positives = 53/121 (43%), Gaps = 35/121 (28%)

Query: 667 VFIDDKKVMVDPGTTVLQ---------P-------------------------VAACAMP 692
           V ID + + V+ GTT+L+         P                         V +C+ P
Sbjct: 8   VTIDGRSIEVEEGTTILRAANRNGIEIPHICYHESLGPIGSCDTCLVEIDGKLVRSCSTP 67

Query: 693 VMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQG-GECDLQDQSMAFGSDRSRFT 751
           V  G  V+TNSE  + AR   M+ +L NHPL C +CD   G+C+L +  MA G    R+ 
Sbjct: 68  VEDGMVVRTNSERVKEARREAMDRILSNHPLYCTVCDNNNGDCELHNMVMALGLTEQRYP 127

Query: 752 D 752
            
Sbjct: 128 Y 128



 Score = 58.6 bits (142), Expect = 2e-08
 Identities = 21/52 (40%), Positives = 32/52 (61%)

Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            A+ AD +LP +A  EK  T+ NTE R Q+    + P G +R DW+II+ ++ 
Sbjct: 690  ANYADVVLPASASLEKDGTFTNTERRIQRLYKVLEPLGDSRPDWEIIQEVAN 741


>gnl|CDD|204520 pfam10588, NADH-G_4Fe-4S_3, NADH-ubiquinone oxidoreductase-G
           iron-sulfur binding region. 
          Length = 41

 Score = 78.7 bits (195), Expect = 3e-18
 Identities = 23/41 (56%), Positives = 29/41 (70%)

Query: 710 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
           R+ ++E LL NHPLDCP CD+ G C+LQD +   G D SRF
Sbjct: 1   RKTILELLLANHPLDCPTCDKNGNCELQDLAYELGVDESRF 41



 Score = 72.1 bits (178), Expect = 6e-16
 Identities = 20/37 (54%), Positives = 26/37 (70%)

Query: 815 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGND 851
           R+ ++E LL NHPLDCP CD+ G C+LQD +   G D
Sbjct: 1   RKTILELLLANHPLDCPTCDKNGNCELQDLAYELGVD 37


>gnl|CDD|214918 smart00929, NADH-G_4Fe-4S_3, NADH-ubiquinone oxidoreductase-G
           iron-sulfur binding region. 
          Length = 41

 Score = 74.5 bits (184), Expect = 1e-16
 Identities = 22/41 (53%), Positives = 30/41 (73%)

Query: 710 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGSDRSRF 750
           R+ ++E LL NHPLDCP+CD+ GEC+LQD +   G D  R+
Sbjct: 1   RKTILELLLANHPLDCPVCDKNGECELQDLAYELGVDEQRY 41



 Score = 68.4 bits (168), Expect = 1e-14
 Identities = 20/35 (57%), Positives = 27/35 (77%)

Query: 815 REGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFG 849
           R+ ++E LL NHPLDCP+CD+ GEC+LQD +   G
Sbjct: 1   RKTILELLLANHPLDCPVCDKNGECELQDLAYELG 35


>gnl|CDD|140326 PTZ00305, PTZ00305, NADH:ubiquinone oxidoreductase; Provisional.
          Length = 297

 Score = 77.8 bits (191), Expect = 3e-15
 Identities = 37/79 (46%), Positives = 51/79 (64%), Gaps = 3/79 (3%)

Query: 674 VMVDPGTTVLQPVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGE 733
           V VD GT  L  V +CA   + G  + T+S + R AREG +E +L+NHP DCPIC+Q   
Sbjct: 118 VQVD-GTQNL--VVSCATVALPGMSIITDSRLVRDAREGNVELILINHPNDCPICEQATN 174

Query: 734 CDLQDQSMAFGSDRSRFTD 752
           CDLQ+ SM +G+D  R+ +
Sbjct: 175 CDLQNVSMNYGTDIPRYKE 193



 Score = 75.5 bits (185), Expect = 2e-14
 Identities = 30/62 (48%), Positives = 42/62 (67%)

Query: 791 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGN 850
           V +CA   + G  + T+S + R AREG +E +L+NHP DCPIC+Q   CDLQ+ SM +G 
Sbjct: 127 VVSCATVALPGMSIITDSRLVRDAREGNVELILINHPNDCPICEQATNCDLQNVSMNYGT 186

Query: 851 DL 852
           D+
Sbjct: 187 DI 188


>gnl|CDD|239154 cd02753, MopB_Formate-Dh-H, Formate dehydrogenase H (Formate-Dh-H)
            catalyzes the reversible oxidation of formate to CO2 with
            the release of a proton and two electrons. It is a
            component of the anaerobic formate hydrogen lyase
            complex. The E. coli formate dehydrogenase H (Fdh-H) is a
            monomer composed of a single polypeptide chain with a  Mo
            active site region and a [4Fe-4S] center. Members of the
            MopB_Formate-Dh-H CD belong to the molybdopterin_binding
            (MopB) superfamily of proteins.
          Length = 512

 Score = 64.5 bits (158), Expect = 2e-10
 Identities = 45/198 (22%), Positives = 73/198 (36%), Gaps = 61/198 (30%)

Query: 913  AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLK-RQRLLTPFVRNCDGQL 971
             VG  + +  +  +++ V P     +N   L  KGRF +D +  + RL  P +R  +G+ 
Sbjct: 8    GVGCGLELWVKDNKIVGVEPVKGHPVNRGKLCVKGRFGFDFVNSKDRLTKPLIRK-NGKF 66

Query: 972  VATEWEDALIAVAQKLQT-------SEVAGVVGS-------------------------- 998
            V   W++AL  VA +L+          +A    +                          
Sbjct: 67   VEASWDEALSLVASRLKEIKDKYGPDAIAFFGSAKCTNEENYLFQKLARAVGGTNNVDHC 126

Query: 999  --LADAEAMVALKDLLNKLGSEDLYTEYAFPLEGAGTDLRANYLLNNKIAGAEEADLILL 1056
              L  +  +  L + L                 GA T         N IA  EEAD+IL+
Sbjct: 127  ARLCHSPTVAGLAETLG---------------SGAMT---------NSIADIEEADVILV 162

Query: 1057 IGTNPRFEAPLFNARIRK 1074
            IG+N     P+   RI++
Sbjct: 163  IGSNTTEAHPVIARRIKR 180



 Score = 63.0 bits (154), Expect = 7e-10
 Identities = 26/52 (50%), Positives = 35/52 (67%)

Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            A +AD +LP A++ EK  T+ NTE R Q+   AV PPG AR DW+II+ L+ 
Sbjct: 386  AELADVVLPAASFAEKDGTFTNTERRVQRVRKAVEPPGEARPDWEIIQELAN 437


>gnl|CDD|204199 pfam09326, DUF1982, Domain of unknown function (DUF1982).  Members of
            this family of functionally uncharacterized domains are
            found in the C-terminal region of various prokaryotic
            NADH dehydrogenases.
          Length = 48

 Score = 55.0 bits (133), Expect = 9e-10
 Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 5/47 (10%)

Query: 1270 DWKIIRALSEGVSSNLSSEPVDVKQKNLEDYYMTDPISRASGTMAKC 1316
             W  +   + G    L   P    +  ++D+Y+TDPISRAS TMA+C
Sbjct: 7    GWTALALSAAGGK--LGKAPF---RSPIKDFYLTDPISRASPTMAEC 48


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 59.3 bits (144), Expect = 1e-08
 Identities = 46/268 (17%), Positives = 119/268 (44%), Gaps = 16/268 (5%)

Query: 2   KAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQL 61
           K ++E+++L  +  S++  L E  + QL   ++  +E++  LA       +L+   +   
Sbjct: 685 KLEEELKSLKNELRSLEDLLEE-LRRQLEELERQLEELKRELAALEEELEQLQSRLEELE 743

Query: 62  NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQ 121
              ++ ++E++ L  +   ++ EL    +  L    KE+++E + +     L    E  +
Sbjct: 744 EELEELEEELEELQERLEELEEELES-LEEALAKL-KEEIEELEEKRQ--ALQEELEELE 799

Query: 122 KEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLD-FKAKCEVY 180
           +E    E     L++  E+  +++E        ++++++ E+    +  LD  + + E  
Sbjct: 800 EELEEAERRLDALERELESLEQRRE-----RLEQEIEELEEEIEELEEKLDELEEELEEL 854

Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 240
           E E ++LK EL + + + +E    LKE  +++E + +++      +L+  +  +  ++  
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELR-----ELESELAELKEEIEK 909

Query: 241 LHTSKESLATRQPSEQSLIDSTTDTNCP 268
           L    E L  +    +  +    +    
Sbjct: 910 LRERLEELEAKLERLEVELPELEEELEE 937



 Score = 58.2 bits (141), Expect = 3e-08
 Identities = 51/277 (18%), Positives = 125/277 (45%), Gaps = 17/277 (6%)

Query: 375 LEAEVKDSAGPMTSFLESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQR 434
           L+ ++ + +G +T     +  S +  +  L         L+   EK+E   ++L+ E + 
Sbjct: 641 LDGDLVEPSGSITG-GSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRS 699

Query: 435 SRKLEEELQDTRQTCE---EVVRKEVSIVT--VTLFLERLRSSHSKRSKNADSLVRLQKA 489
              L EEL+   +  E   E +++E++ +   +     RL     +  +  + L  LQ+ 
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER 759

Query: 490 QDEIQTLLAKSASIKTELREFYQTQLNN--NVKEKLKEFQTQLDSAELTLHREFQQKEKT 547
            +E++  L        +L+E  +        ++E+L+E + +L+ AE  L    ++ E  
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESL 819

Query: 548 I--VENAAKQLKQISEKPGATKLMLD-FKAKCEVYEAENKKLKAELTQRDIKLKETNHLL 604
               E   ++++++ E+    +  LD  + + E  E E ++LK EL + + + +E    L
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879

Query: 605 KEESKKRELMAKKVESV------MQSQLQQAIHMITS 635
           KE  +++E + +++  +      ++ ++++    +  
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916



 Score = 48.9 bits (117), Expect = 2e-05
 Identities = 44/198 (22%), Positives = 91/198 (45%), Gaps = 26/198 (13%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
           KE+IE LE   Q   +   +LEEEL++  +  +               LER   S  +R 
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDA--------------LERELESLEQRR 823

Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
           +  +    +++ ++EI+ L  K   ++ EL E    +    +KE+L+E + + +  E  L
Sbjct: 824 ERLEQ--EIEELEEEIEELEEKLDELEEELEEL--EKELEELKEELEELEAEKEELEDEL 879

Query: 538 --HREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 595
               E +++ +  +     +L ++ E+      +   + + E  EA+ ++L+ EL + + 
Sbjct: 880 KELEEEKEELEEELRELESELAELKEE------IEKLRERLEELEAKLERLEVELPELEE 933

Query: 596 KLKETNHLLKEESKKREL 613
           +L+E      E   +RE+
Sbjct: 934 ELEEEYEDTLETELEREI 951



 Score = 47.0 bits (112), Expect = 8e-05
 Identities = 43/226 (19%), Positives = 104/226 (46%), Gaps = 26/226 (11%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
           +++IE L+  L+   +   +L+EEL + ++  EE+  +   +       ERL    ++  
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR------ERLEELENELE 319

Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
           +  + L  L++  + ++  L +  ++  EL      QL   ++E  +E + +L +    L
Sbjct: 320 ELEERLEELKEKIEALKEELEERETLLEEL-----EQLLAELEEAKEELEEKLSALLEEL 374

Query: 538 HREFQQKEKTI-------------VENAAKQLKQISEKPGATKLML-DFKAKCEVYEAEN 583
              F+   + +             +E   ++++ + E+       L D K + +  EAE 
Sbjct: 375 EELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAEL 434

Query: 584 KKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 629
           ++L+ EL + + +L+E    L+E   + + + +++  + Q +LQ+ 
Sbjct: 435 EELQTELEELNEELEELEEQLEELRDRLKELERELAEL-QEELQRL 479



 Score = 45.5 bits (108), Expect = 2e-04
 Identities = 50/271 (18%), Positives = 121/271 (44%), Gaps = 21/271 (7%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           ++ + E++ L  + + ++ EL E  Q +L   ++A+ EI+ L ++   ++ EL E  +  
Sbjct: 235 KELRKELEELEEELSRLEEELEE-LQEEL---EEAEKEIEELKSELEELREELEELQEEL 290

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN------NVKEKLKEFQTQLDSAELT 114
           L    + ++EI+ L  + + ++    E  + +L         +KEK++  + +L     T
Sbjct: 291 L----ELKEEIEELEGEISLLRER-LEELENELEELEERLEELKEKIEALKEEL-EERET 344

Query: 115 LHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI-SEKPGATKLMLDF 173
           L  E +Q         AK+  +   +   ++ + + E   ++L ++ +E       + + 
Sbjct: 345 LLEELEQLLA--ELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEEL 402

Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQ--SQLQQAI 231
           K + E  E   ++L   L     +LKE    L+E   + E + +++E + +   +L+  +
Sbjct: 403 KREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462

Query: 232 HMITSDVTHLHTSKESLATRQPSEQSLIDST 262
             +  ++  L    + L     S ++ +D  
Sbjct: 463 KELERELAELQEELQRLEKELSSLEARLDRL 493



 Score = 44.7 bits (106), Expect = 3e-04
 Identities = 44/215 (20%), Positives = 97/215 (45%), Gaps = 27/215 (12%)

Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
           L   KE+IE LE  +    +R  +LE EL++  +  EE+  K  +        E L    
Sbjct: 290 LLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEA------LKEELEERE 343

Query: 474 SKRSKNADSLVRLQKAQDEIQTLL--------AKSASIKTELREFYQTQLNNNVKEKLKE 525
           +   +    L  L++A++E++  L            +++ EL E         ++ +L+E
Sbjct: 344 TLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAEL--EAELAEIRNELEE 401

Query: 526 FQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKK 585
            + +++S E  L     ++    +E+  ++LK++  +      + + + + E    E ++
Sbjct: 402 LKREIESLEERL-----ERLSERLEDLKEELKELEAE------LEELQTELEELNEELEE 450

Query: 586 LKAELTQRDIKLKETNHLLKEESKKRELMAKKVES 620
           L+ +L +   +LKE    L E  ++ + + K++ S
Sbjct: 451 LEEQLEELRDRLKELERELAELQEELQRLEKELSS 485



 Score = 42.0 bits (99), Expect = 0.003
 Identities = 48/225 (21%), Positives = 92/225 (40%), Gaps = 15/225 (6%)

Query: 413 TLKISKEKIESLERNLQTETQR----SRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLER 468
            LK   E++      LQ E         +LE E+   R+  EE+  +   +       ER
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEEL------EER 324

Query: 469 LRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNN--NVKEKLKEF 526
           L     K     + L   +   +E++ LLA+    K EL E     L     + E L+E 
Sbjct: 325 LEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384

Query: 527 QTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKL 586
             +L++    +  E ++ ++ I     +  +         + + + +A+ E  + E ++L
Sbjct: 385 LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444

Query: 587 KAELTQRDIKLKETNHLLKE-ESKKRELMAK--KVESVMQSQLQQ 628
             EL + + +L+E    LKE E +  EL  +  ++E  + S   +
Sbjct: 445 NEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEAR 489



 Score = 37.0 bits (86), Expect = 0.080
 Identities = 37/201 (18%), Positives = 82/201 (40%), Gaps = 25/201 (12%)

Query: 434 RSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEI 493
           R  + E +L+ T +  E           +   LE L     K  + A+   R Q+ + E+
Sbjct: 173 RKEEAERKLERTEENLER----------LEDLLEELEKQLEKLERQAEKAERYQELKAEL 222

Query: 494 Q----TLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR---EFQQKEK 546
           +     LL        +  E  + +L + ++E+L+E Q +L+ AE  +     E ++  +
Sbjct: 223 RELELALLLAKLKELRKELEELEEEL-SRLEEELEELQEELEEAEKEIEELKSELEELRE 281

Query: 547 TIVENAA------KQLKQISEKPGATKLMLDFKAK-CEVYEAENKKLKAELTQRDIKLKE 599
            + E         ++++++  +    +  L+      E  E   ++LK ++     +L+E
Sbjct: 282 ELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEE 341

Query: 600 TNHLLKEESKKRELMAKKVES 620
              LL+E  +    + +  E 
Sbjct: 342 RETLLEELEQLLAELEEAKEE 362



 Score = 31.6 bits (72), Expect = 3.6
 Identities = 34/178 (19%), Positives = 73/178 (41%), Gaps = 22/178 (12%)

Query: 1    QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQ----KAQDEIQTL---LAKSASIKTEL 53
            ++ ++EI+ L  K   ++ EL E  + +L   +    + + E + L   L +    K EL
Sbjct: 831  EELEEEIEELEEKLDELEEELEE-LEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889

Query: 54   REFYQTQLNN-------NQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQT 106
             E  +   +         +K ++ ++ L AK   ++ EL E  +            E + 
Sbjct: 890  EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER 949

Query: 107  QLDSAELTLHREFQQKEKTIVENAAKQLKQISE--NEFKQKEKTIVENAAKQLKQISE 162
            +++  E     E  +    +   A ++ +++ E   E K + + + E   K L+ I E
Sbjct: 950  EIERLE-----EEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 58.9 bits (143), Expect = 2e-08
 Identities = 21/65 (32%), Positives = 35/65 (53%)

Query: 686 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQDQSMAFGS 745
           V +C     +G  + TN+      R  +M+   VNHPL+C +CD+ GEC+LQ+ +   G 
Sbjct: 57  VYSCNTKAKEGMNILTNTPNLMDERNAIMQTYDVNHPLECGVCDKSGECELQNFTHEMGV 116

Query: 746 DRSRF 750
           +   +
Sbjct: 117 NHQPY 121



 Score = 58.6 bits (142), Expect = 2e-08
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query: 791 VAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLVNHPLDCPICDQGGECDLQD 843
           V +C     +G  + TN+      R  +M+   VNHPL+C +CD+ GEC+LQ+
Sbjct: 57  VYSCNTKAKEGMNILTNTPNLMDERNAIMQTYDVNHPLECGVCDKSGECELQN 109



 Score = 40.5 bits (95), Expect = 0.007
 Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 3/79 (3%)

Query: 877 GNVIDLCPVGALTSKPYSFTARPWETRKTESVDVLDAVGSNIIVSTRTGEVLRVLP---R 933
           G  I +CPVGAL+S  + +T+  WE +K  +     +    I    +   +L       R
Sbjct: 210 GECIAVCPVGALSSSDFQYTSNAWELKKIPATCPHCSDCCLIYYDVKHSSILNQESKIYR 269

Query: 934 LNEDINEEWLADKGRFAYD 952
           ++ D     L   GRFA+D
Sbjct: 270 VSNDFYFNPLCGAGRFAFD 288


>gnl|CDD|239155 cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA
            (Nitrate-R-NapA), NasA, and NarB catalyze the reduction
            of nitrate to nitrite. Monomeric Nas is located in the
            cytoplasm and participates in nitrogen assimilation.
            Dimeric Nap is located in the periplasm and is coupled to
            quinol oxidation via a membrane-anchored tetraheme
            cytochrome. Members of the MopB_Nitrate-R-NapA CD belong
            to the molybdopterin_binding (MopB) superfamily of
            proteins.
          Length = 565

 Score = 58.0 bits (141), Expect = 3e-08
 Identities = 22/52 (42%), Positives = 30/52 (57%)

Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            A  AD +LP A++ EK+ T  N+E R      AV PPG AR DW I+  ++ 
Sbjct: 432  AEYADLVLPAASWGEKEGTMTNSERRVSLLRAAVEPPGEARPDWWILADVAR 483



 Score = 46.8 bits (112), Expect = 6e-05
 Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 1/79 (1%)

Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQL 971
            VG  + +  + G+V+ V       +N   L  KG   +  L    RL  P +R   G+L
Sbjct: 8   GVGCGVEIGVKDGKVVAVRGDPEHPVNRGRLCIKGLNLHKTLNGPERLTRPLLRRNGGEL 67

Query: 972 VATEWEDALIAVAQKLQTS 990
           V   W++AL  +A++ +  
Sbjct: 68  VPVSWDEALDLIAERFKAI 86


>gnl|CDD|181037 PRK07569, PRK07569, bidirectional hydrogenase complex protein HoxU;
           Validated.
          Length = 234

 Score = 55.4 bits (134), Expect = 3e-08
 Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 4/69 (5%)

Query: 685 PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLV--NHPLDCPICDQGGECDLQDQSMA 742
            + AC  PV +G  V+TN+   +  R  ++E L    NH   C +C   G C+LQD ++ 
Sbjct: 60  LLPACVTPVAEGMVVQTNTPRLQEYRRMIVELLFAEGNHV--CAVCVANGNCELQDLAIE 117

Query: 743 FGSDRSRFT 751
            G D  RF 
Sbjct: 118 VGMDHVRFP 126



 Score = 49.3 bits (118), Expect = 4e-06
 Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 4/64 (6%)

Query: 790 PVAACAMPVMKGWRVKTNSEMTRRAREGVMEFLLV--NHPLDCPICDQGGECDLQDQSMA 847
            + AC  PV +G  V+TN+   +  R  ++E L    NH   C +C   G C+LQD ++ 
Sbjct: 60  LLPACVTPVAEGMVVQTNTPRLQEYRRMIVELLFAEGNHV--CAVCVANGNCELQDLAIE 117

Query: 848 FGND 851
            G D
Sbjct: 118 VGMD 121


>gnl|CDD|130652 TIGR01591, Fdh-alpha, formate dehydrogenase, alpha subunit,
            archaeal-type.  This model describes a subset of formate
            dehydrogenase alpha chains found mainly archaea but also
            in alpha and gamma proteobacteria and a small number of
            gram positive bacteria. The alpha chain contains domains
            for molybdopterin dinucleotide binding and molybdopterin
            oxidoreductase (pfam01568 and pfam00384, respectively).
            The holo-enzyme also contains beta and gamma subunits.
            The enzyme catalyzes the oxidation of formate (produced
            from pyruvate during anaerobic growth) to carbon dioxide
            with the concomitant release of two electrons and two
            protons. The enzyme's purpose is to allow growth on
            formate in some circumstances and, in the case of FdhH in
            gamma proteobacteria, to pass electrons to hydrogenase
            (by which process acid is neutralized). This model is
            well-defined, with only a single fragmentary sequence
            falling between trusted and noise. The alpha subunit of a
            version of nitrate reductase is closely related.
          Length = 671

 Score = 55.2 bits (133), Expect = 2e-07
 Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 16/116 (13%)

Query: 1178 DIGYKPGTS------AIREKPPKVLFLLGADEGSISRDDVGK--------DCFIIYQGHH 1223
             +  +PG        A  +   K L+++G D    S  +  K        +  ++ Q   
Sbjct: 363  KLPAEPGLRIPEMIDAAADGDVKALYIMGEDP-LQSDPNTSKVRKALEKLELLVV-QDIF 420

Query: 1224 GDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
                A  AD +LP AA+ EK+ T+ N E R Q+   AV P G ++ DW+II+ L+ 
Sbjct: 421  MTETAKYADVVLPAAAWLEKEGTFTNAERRIQRFFKAVEPKGESKPDWEIIQELAN 476



 Score = 48.6 bits (116), Expect = 2e-05
 Identities = 39/174 (22%), Positives = 76/174 (43%), Gaps = 13/174 (7%)

Query: 913  AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ-RLLTPFVRNCDGQL 971
             VG ++ +  + G+++RV P      N   L  KG FA++ +  + RL TP +R    + 
Sbjct: 7    GVGCSLNLVVKDGKIVRVEPYQGHKANRGHLCVKGYFAWEFINSKDRLTTPLIRE-GDKF 65

Query: 972  VATEWEDALIAVAQKLQT-------SEVAGVVGS-LADAEAMVALKDLLNKLGSEDLYTE 1023
                W++A+  +A+KL+          +  +  S   + E  +  K     +G+ ++   
Sbjct: 66   REVSWDEAISYIAEKLKEIKEKYGPDSIGFIGSSRGTNEENYLLQKLARAVIGTNNVDNC 125

Query: 1024 YAFPLEGAGTDLRANY---LLNNKIAGAEEADLILLIGTNPRFEAPLFNARIRK 1074
                   +   L+       ++N I+  E ADLI++IG NP    P+    ++ 
Sbjct: 126  ARVCHGPSVAGLKQTVGIGAMSNTISEIENADLIVIIGYNPAESHPVVAQYLKN 179


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 48.5 bits (116), Expect = 1e-05
 Identities = 37/166 (22%), Positives = 73/166 (43%), Gaps = 16/166 (9%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           QK +D++Q+  +K+   +++L    +    +N+  ++E +    +    + EL E +Q+ 
Sbjct: 56  QKEKDKLQSEHSKAILARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKELTEKFQST 115

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
           L + Q   +E       + + K         + N  ++EKLKE   Q +  E    +  +
Sbjct: 116 LKDIQDQMEE-----HSNPNHK-------LCEENEELREKLKELIEQYELREQHFEKLLK 163

Query: 121 QKEKTIVENAAK----QLKQISENEFKQKEKTIVENAAKQLKQISE 162
            K+     N AK    Q +   E E  ++EK I+   A Q+  + E
Sbjct: 164 TKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKE 209



 Score = 42.7 bits (101), Expect = 8e-04
 Identities = 43/210 (20%), Positives = 74/210 (35%), Gaps = 63/210 (30%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
           + K+ESL R LQ     ++ L+EE +  R   EE  RKE++                   
Sbjct: 73  RSKLESLCRELQKH---NKTLKEENK-ARSDEEEEKRKELTE------------------ 110

Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
                  + Q    +IQ  + + ++   +L E      N  ++EKLKE   Q +  E   
Sbjct: 111 -------KFQSTLKDIQDQMEEHSNPNHKLCE-----ENEELREKLKELIEQYELREQHF 158

Query: 538 HREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 597
            +  + K+     N AK                         E   ++ + E  +     
Sbjct: 159 EKLLKTKDLEQQLNDAK------------------------LEQAQEQAEQEQEKHK--- 191

Query: 598 KETNHLLKEESKKRELMAKKVESVMQSQLQ 627
           +E   LLK     +    K+ E  +++QL 
Sbjct: 192 REKEILLK--EAAQVSTLKETEKELRAQLN 219


>gnl|CDD|223321 COG0243, BisC, Anaerobic dehydrogenases, typically
            selenocysteine-containing [Energy production and
            conversion].
          Length = 765

 Score = 47.5 bits (113), Expect = 5e-05
 Identities = 31/119 (26%), Positives = 45/119 (37%), Gaps = 18/119 (15%)

Query: 1175 AALDIGYKPGTSAIREKPP--KVLFLLG-----ADEGSISRDDVG--KDCFIIYQGHH-G 1224
            AA   G      A+ E P   K LF+ G     +     +        D F++       
Sbjct: 413  AAAKPGLSTADRALLEGPYPIKALFVYGGNPVVSAPDDRNVKKALLRDDEFVVVIDIFMT 472

Query: 1225 DHGASIADAILPGAAYTEKQSTYVNTEGRA----QQTLTAVTPPGLAREDWKIIRALSE 1279
            +  A  AD +LP   + EK   Y N         ++    V PPG AR D+ II  L++
Sbjct: 473  ET-AKYADIVLPATTFLEKDDIYTNAGRSYVQLRRK---VVEPPGEARPDYWIIIELAK 527



 Score = 35.2 bits (81), Expect = 0.26
 Identities = 36/151 (23%), Positives = 49/151 (32%), Gaps = 59/151 (39%)

Query: 946  KGRFAYDGL-KRQRLLTPFVRN----CDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLA 1000
            KG    + +    RL  P +R      +G+ V   W++AL  +A KL             
Sbjct: 84   KGARGRERVYSPDRLKYPLLRRVGKRGEGKFVRISWDEALDLIAAKL------------- 130

Query: 1001 DAEAMVALKDLLNKLGSEDLYTEYAFPLEG---AGTDLRANYLLNN-------------- 1043
                   L  ++   GS  L T      E    AG   RA    NN              
Sbjct: 131  -------LPRIIGFYGSGALTTGN----EAGYLAGKLARAFLGGNNIDHCGRYCHAAAAA 179

Query: 1044 ---KIAGA----------EEADLILLIGTNP 1061
                  G+          E ADLI+L G+NP
Sbjct: 180  GLPYTFGSGAATGSYPDIENADLIVLWGSNP 210


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 47.1 bits (112), Expect = 7e-05
 Identities = 78/571 (13%), Positives = 214/571 (37%), Gaps = 62/571 (10%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
            + + E + L  +  +++  L E     L   ++  +E++  L +  S++ E  +  + +
Sbjct: 221 IQEEQEEEELEQEIEALEERLAE-----LEEEKERLEELKARLLEIESLELEALKIREEE 275

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE--LTLHRE 118
           L   ++  +E++  + +   ++ E+ E          +E+L+  +  L+  E  L   + 
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEEL---------EEELEGLRALLEELEELLEKLKS 326

Query: 119 FQQKEKTIVE--NAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAK 176
            +++ + + E     +   +    E  +  K + E   +  +++ E     +  L+   +
Sbjct: 327 LEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQ 386

Query: 177 CEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITS 236
               E   ++LK EL +    L+E    L+E  K+ E            +L++ +  +  
Sbjct: 387 ---LEEAIQELKEELAELSAALEEIQEELEELEKELE------------ELERELEELEE 431

Query: 237 DVTHLHTSKESLATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSNQQYSIPTSNI 296
           ++  L      L +++     L  +     CPV  Q +P  +++        Y +    +
Sbjct: 432 EIKKLEEQINQLESKELMIAELAGA--GEKCPVCGQELPEEHEKELLE---LYELELEEL 486

Query: 297 NQQYSMFSADSTHMNIVERDIPNSADDMRKYIEMQYSMFSADSTHMNIVERDIPNSADDM 356
            ++ S           +  +I     ++R+  E    +   +      +E  +    +++
Sbjct: 487 EEELSREK----EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKL-EKLENL 541

Query: 357 RKYIEMLLEKPPGNPVSDLEAEVKDSAGPMTSFLESSVTSDIFNEPALYTKTVRSVTLKI 416
            + +E L EK     +  L+ E++     +    E      +        + +R    ++
Sbjct: 542 LEELEELKEKL---QLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKEL 598

Query: 417 SK--EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
            K  +++E     L+   Q S +L E   +  +  EE          +   LE+L     
Sbjct: 599 KKKLKELEERLSQLEELLQ-SLELSEAENELEEAEEE----------LESELEKLNLQAE 647

Query: 475 KRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLD-- 531
                  +L  L++  +E++  + +         +  +       ++E+L++ + +L+  
Sbjct: 648 LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEEL 707

Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEK 562
             +L    +  ++ ++      +  K++ + 
Sbjct: 708 LKKLGEIEQLIEELESRKAELEELKKELEKL 738



 Score = 46.3 bits (110), Expect = 1e-04
 Identities = 43/217 (19%), Positives = 96/217 (44%), Gaps = 15/217 (6%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVT-----VTLFLERLRSS 472
            E++E L   L++  +R  KLEE+L+      EE+  ++  +       +    ERL   
Sbjct: 314 LEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEEL 373

Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNN--NVKEKLKEFQTQL 530
             +  K  + L +L++A  E++  LA+ ++   E++E  +        ++ +L+E + ++
Sbjct: 374 EKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEI 433

Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAEL 590
              E  +++          E    +L    EK       L  + + E+ E    +L+ EL
Sbjct: 434 KKLEEQINQLES------KELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELE-EL 486

Query: 591 TQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 627
            +  +  ++    L+EE ++ E   +++E  +   L+
Sbjct: 487 EEE-LSREKEEAELREEIEELEKELRELEEELIELLE 522



 Score = 44.4 bits (105), Expect = 5e-04
 Identities = 49/255 (19%), Positives = 97/255 (38%), Gaps = 41/255 (16%)

Query: 401 EPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIV 460
           E AL         ++  KE++  L   L+   +   +LE+EL++  +  EE+  +     
Sbjct: 378 EKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEI---- 433

Query: 461 TVTLFLERLRSSHSKRSKNA----------------------DSLVRLQKAQDEIQTLLA 498
                LE   +    +                            L+ L + + E      
Sbjct: 434 ---KKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEEL 490

Query: 499 KSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTI--VENAAKQL 556
                + ELRE  +      ++++L+E + +L           ++ E+ +  +EN  ++L
Sbjct: 491 SREKEEAELREEIE-----ELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEEL 545

Query: 557 KQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ---RDIKLKETNHLLKEESKKREL 613
           +++ EK    +L  +   + E    E K+L  EL     R  +L+E    LKE  KK + 
Sbjct: 546 EELKEKLQLQQL-KEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLK- 603

Query: 614 MAKKVESVMQSQLQQ 628
             ++  S ++  LQ 
Sbjct: 604 ELEERLSQLEELLQS 618



 Score = 39.4 bits (92), Expect = 0.014
 Identities = 38/212 (17%), Positives = 100/212 (47%), Gaps = 7/212 (3%)

Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTR--QTCEEVVRKEVSIVTVTL 464
           +      L++ +   E LE  L+       +LEE  +  +  Q  EE+ + E  +  +  
Sbjct: 514 EEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKE 573

Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
            LE LR   +++ +  +   RL++ + +++ L  + + ++  L+    ++  N ++E  +
Sbjct: 574 LLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633

Query: 525 EFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENK 584
           E +++L+   L      ++  +  +E   ++++++  +       ++ + + E    E +
Sbjct: 634 ELESELEKLNLQAE--LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELE 691

Query: 585 KLKAELTQRDIKLKETNHLLKEESKKRELMAK 616
           +L+ EL Q   +L+E   LLK+  +  +L+ +
Sbjct: 692 QLEEELEQLREELEE---LLKKLGEIEQLIEE 720



 Score = 39.0 bits (91), Expect = 0.018
 Identities = 43/211 (20%), Positives = 90/211 (42%), Gaps = 11/211 (5%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
           K KIE LE  L    +    L E L++     E    K++  +      E L        
Sbjct: 184 KAKIEELEGQLSELLEDIEDLLEALEE-----ELKELKKLEEIQEEQEEEELEQEIEALE 238

Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
           +    L   ++  +E++  L +  S++ E  +  + +L   ++  L+E + +++  E  L
Sbjct: 239 ERLAELEEEKERLEELKARLLEIESLELEALKIREEELRE-LERLLEELEEKIERLE-EL 296

Query: 538 HREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 597
            RE ++ E    E        + E     + +   + + E  E + +KL++EL +   + 
Sbjct: 297 EREIEELE----EELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352

Query: 598 KETNHLLKEESKKRELMAKKVESVMQSQLQQ 628
            E   LL+E  K+ E   +++E  ++  L++
Sbjct: 353 NELAKLLEERLKELEERLEELEKELEKALER 383



 Score = 30.9 bits (70), Expect = 6.1
 Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 23/173 (13%)

Query: 84  ELREFYQTQLNNNVKEKL----KEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISE 139
           E   F    L +  KE+     + F  +       L +E  ++ K  +E    QL ++ E
Sbjct: 144 EFDAF----LKSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLE 199

Query: 140 NEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLK 199
           +   +     +E   K+LK++ E        +  + + E  E E + L+    +R  +L+
Sbjct: 200 D--IEDLLEALEEELKELKKLEE--------IQEEQEEEELEQEIEALE----ERLAELE 245

Query: 200 ETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQ 252
           E    L EE K R L  + +E       ++ +  +   +  L    E L   +
Sbjct: 246 EEKERL-EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELE 297


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 47.0 bits (111), Expect = 7e-05
 Identities = 58/240 (24%), Positives = 108/240 (45%), Gaps = 18/240 (7%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           QKA +E   +  K+    TE +E    +LN  + A   + T    +     EL    Q +
Sbjct: 312 QKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQR 371

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
           L  N+   D+++ +  +     +EL E   T+  NN + +L+E +  L   E  L  E +
Sbjct: 372 LEKNE---DQLKIITMELQKKSSELEEM--TKFKNNKEVELEELKKILAEDE-KLLDEKK 425

Query: 121 QKEKTIVENAAKQLKQISENEFKQKE-----------KTIVENAAKQLKQISEKPGATKL 169
           Q EK   E   K+ + I   + ++KE           KT  E+  K+++ +  +    KL
Sbjct: 426 QFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKL 485

Query: 170 M-LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 228
             ++  A C+    ENK+L  E +   ++LK+    +    K+ E M K++E++ + ++ 
Sbjct: 486 KNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMN 545



 Score = 40.1 bits (93), Expect = 0.009
 Identities = 54/244 (22%), Positives = 104/244 (42%), Gaps = 29/244 (11%)

Query: 412 VTLKISKEKIESLERNLQTETQRSRKLEEELQ------DTRQTCEEVVRKEVSIVTVTLF 465
           ++L+ S    ++LE +LQ  T+   +L EE +      +  +     V  E    T +L 
Sbjct: 303 MSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSL- 361

Query: 466 LERLRSSHSKRSKNADSL----VRLQKAQDEIQTLLAKSASIKTELREF----------- 510
            E LR+   +  KN D L    + LQK   E++ +     + + EL E            
Sbjct: 362 EELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLL 421

Query: 511 YQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE------KTIVENAAKQLKQISEKPG 564
            + +    + E+LK  + +L        +E    E      KT  E+  K+++ +  +  
Sbjct: 422 DEKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELE 481

Query: 565 ATKLM-LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQ 623
             KL  ++  A C+    ENK+L  E +   ++LK+    +    K+ E M K++E++ +
Sbjct: 482 KEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEE 541

Query: 624 SQLQ 627
            ++ 
Sbjct: 542 KEMN 545



 Score = 32.8 bits (74), Expect = 1.5
 Identities = 46/219 (21%), Positives = 99/219 (45%), Gaps = 19/219 (8%)

Query: 8   QTLLAKSASIKTELR--EFYQTQLNNN-QKAQDEIQTLLAKSASIKTELREFYQTQLNNN 64
           +  L +   +KTEL   +    +L  +  K   E + L  +++ +  EL++ +Q  + N 
Sbjct: 467 EHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKK-HQEDIINC 525

Query: 65  QKAQD----EIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
           +K ++    +I+ L  K  +++ EL    +  +    + K K  +++ ++  +      +
Sbjct: 526 KKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKK 585

Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
           +K+  I+EN    LK+  EN+ K  E+   EN A + K  +E             +   Y
Sbjct: 586 EKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAE-----------NKQLNAY 634

Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKV 219
           E +  KL+ EL     K +E     ++E + +++  +K+
Sbjct: 635 EIKVNKLELELASAKQKFEEIIDNYQKEIEDKKISEEKL 673



 Score = 31.2 bits (70), Expect = 4.2
 Identities = 45/188 (23%), Positives = 81/188 (43%), Gaps = 23/188 (12%)

Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVV---RKEVSIVTVTLFLERLRS 471
           K  K+K  +  + L     +  KLE EL   +Q  EE++   +KE+    ++   E+L  
Sbjct: 618 KALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIEDKKIS--EEKLLE 675

Query: 472 SHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD 531
              K    AD  V+LQK   EI        +    L E ++ Q +  ++E+         
Sbjct: 676 EVEKAKAIADEAVKLQK---EIDKRCQHKIAEMVALMEKHKHQYDKIIEER--------- 723

Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
            +EL L++  +Q++ +       +L  I  +      +L  K + E+ + E +KLK E  
Sbjct: 724 DSELGLYKNKEQEQSSAKAALEIELSNIKAE------LLSLKKQLEIEKEEKEKLKMEAK 777

Query: 592 QRDIKLKE 599
           +    LK+
Sbjct: 778 ENTAILKD 785


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 45.8 bits (109), Expect = 2e-04
 Identities = 38/176 (21%), Positives = 74/176 (42%), Gaps = 17/176 (9%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
             +IE LER ++ E +R  KL EE  + ++   E +R E+  V         R       
Sbjct: 335 LAEIEELEREIEEERKRRDKLTEEYAELKEE-LEDLRAELEEVDKE--FAETRDELKDYR 391

Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLDSAELT 536
           +      +L+K + EI  L  +   ++ EL+   +   + N     ++    +L+  +  
Sbjct: 392 E------KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445

Query: 537 LHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
              E +++E  + E  A  L +  ++      + D K + +  E E  KL+ EL +
Sbjct: 446 KALEIKKQEWKL-EQLAADLSKYEQE------LYDLKEEYDRVEKELSKLQRELAE 494



 Score = 40.4 bits (95), Expect = 0.007
 Identities = 41/240 (17%), Positives = 90/240 (37%), Gaps = 35/240 (14%)

Query: 408 TVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCE--EVVRKEVS------- 458
                  +  + + E++ER L +  +   KL EE+ +  +  E  E + +E++       
Sbjct: 226 YELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLG 285

Query: 459 ----------IVTVTLFLERLRSSHS-KRSKNADSLVRLQKAQDEIQTLLAKSASIKTEL 507
                     I  +   +  L  S + K  +  D+  RL K + EI  LLA+   ++ E+
Sbjct: 286 EEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345

Query: 508 REFY--QTQLNNNVKEKLKEFQT------QLDSAELTLHREFQQKEKTIV-------ENA 552
            E    + +L     E  +E +       ++D        E +   + +        E  
Sbjct: 346 EEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELK 405

Query: 553 AKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRE 612
            +  +   E    ++ + D  A     EA+  +L+ E   + +++K+    L++ +    
Sbjct: 406 RELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465



 Score = 39.7 bits (93), Expect = 0.014
 Identities = 35/201 (17%), Positives = 91/201 (45%), Gaps = 14/201 (6%)

Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
           L+  +E + SLE+ ++      ++LE  +++  +   + + + ++ +   L   R+    
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHK-LEEALNDLEARLSHSRIPEIQ 797

Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSA 533
           ++ SK  + + R++    EI+  L +       L + Y  +    ++E+  + + Q+ S 
Sbjct: 798 AELSKLEEEVSRIEARLREIEQKLNRL-----TLEKEYLEKEIQELQEQRIDLKEQIKSI 852

Query: 534 ELTLHREFQQKEK--TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
           E  +     +KE+    +E     L+ +  + G      D K + +  EA+ ++L+ ++ 
Sbjct: 853 EKEIENLNGKKEELEEELEELEAALRDLESRLG------DLKKERDELEAQLRELERKIE 906

Query: 592 QRDIKLKETNHLLKEESKKRE 612
           + + ++++    L E   K E
Sbjct: 907 ELEAQIEKKRKRLSELKAKLE 927



 Score = 39.3 bits (92), Expect = 0.018
 Identities = 51/273 (18%), Positives = 113/273 (41%), Gaps = 33/273 (12%)

Query: 375 LEAEVKDSAGPMT--SFLESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTET 432
           LE E+ + +G MT  S           +EPA          L+  +E++E L+R L +  
Sbjct: 644 LEGELFEKSGAMTGGSRAPRGGILFSRSEPA---------ELQRLRERLEGLKRELSSLQ 694

Query: 433 QRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK-RSKNADSLVRLQKAQD 491
              R++E  L +  Q   +  RK   I  +   +E+L     K + +  +    L   + 
Sbjct: 695 SELRRIENRLDELSQELSDASRK---IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQ 751

Query: 492 EIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAEL-TLHREFQQKEKTIVE 550
           EI+ + ++   ++  + E  +  L + ++E L + + +L  + +  +  E  + E+ +  
Sbjct: 752 EIENVKSELKELEARIEE-LEEDL-HKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSR 809

Query: 551 ------------NAAKQLKQISEKPGATKL--MLDFKAKCEVYEAENKKLKAELTQRDIK 596
                       N     K+  EK         +D K + +  E E + L  +  + + +
Sbjct: 810 IEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEE 869

Query: 597 LKETNHLLKEESKKRELMAKKVESVMQSQLQQA 629
           L+E    L++   +   + K+ +  +++QL++ 
Sbjct: 870 LEELEAALRDLESRLGDLKKERDE-LEAQLREL 901



 Score = 37.7 bits (88), Expect = 0.054
 Identities = 46/250 (18%), Positives = 97/250 (38%), Gaps = 38/250 (15%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           Q+ ++ ++ L  + +S+++ELR          +   DE+   L   AS K    E    Q
Sbjct: 677 QRLRERLEGLKRELSSLQSELRRI--------ENRLDELSQEL-SDASRKIGEIEKEIEQ 727

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLH---- 116
           L   ++   E    L +  S   +  E        NVK +LKE + +++  E  LH    
Sbjct: 728 LEQEEEKLKERLEELEEDLSSLEQEIE--------NVKSELKELEARIEELEEDLHKLEE 779

Query: 117 -------REFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKL 169
                  R    +   I    +K  +++S  E + +E     N     K+  EK      
Sbjct: 780 ALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQ 839

Query: 170 ML---------DFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE 220
                        + + E    + ++L+ EL + +  L++    L +  K+R+ +  ++ 
Sbjct: 840 EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL- 898

Query: 221 SVMQSQLQQA 230
             ++ ++++ 
Sbjct: 899 RELERKIEEL 908



 Score = 33.1 bits (76), Expect = 1.4
 Identities = 29/154 (18%), Positives = 65/154 (42%), Gaps = 8/154 (5%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFY--QTQLNNN-QKAQDEIQTLLAKSASIKTELREFY 57
            K + EI  LLA+   ++ E+ E    + +L     + ++E++ L A+   +  E  E  
Sbjct: 325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETR 384

Query: 58  QTQLNNNQK---AQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLDSAEL 113
               +  +K    + EI  L  +   ++ EL+   +   + N     ++    +L+  + 
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444

Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEK 147
               E +++E  + E  A  L +  +  +  KE+
Sbjct: 445 DKALEIKKQEWKL-EQLAADLSKYEQELYDLKEE 477


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 45.8 bits (109), Expect = 2e-04
 Identities = 39/222 (17%), Positives = 95/222 (42%), Gaps = 26/222 (11%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVS----IVTVTLFLERLRSSH 473
           K ++  LE  L     R  +L EEL++ ++  +E   +       +  +   LE LR   
Sbjct: 219 KAELRELELALLV--LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEV 276

Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSA 533
           S+  +       +++ Q E+  L  + + ++ + +           +E+L   + QL+  
Sbjct: 277 SELEE------EIEELQKELYALANEISRLEQQKQIL---------RERLANLERQLEEL 321

Query: 534 ELTLHREFQQKE--KTIVENAAKQLKQISEKPGATKLMLD-FKAKCEVYEAENKKLKAEL 590
           E  L     + +     +    ++L+++ E+  + +  L+  +A+ E  E+  ++L+ +L
Sbjct: 322 EAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381

Query: 591 TQRDIKLKETNHLLKEESKKRELMAKKVES--VMQSQLQQAI 630
                K+ +    +   + + E +  ++E     + +LQQ I
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI 423



 Score = 45.4 bits (108), Expect = 2e-04
 Identities = 53/229 (23%), Positives = 94/229 (41%), Gaps = 26/229 (11%)

Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
            ++  +EKI  LE+ L    +   +LEEEL+  R+  EE+ R+   I  +   L RL + 
Sbjct: 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ---ISALRKDLARLEAE 741

Query: 473 HSKRSKNADSL-VRLQKAQDEIQTLLAKSASIKTELREF-----YQTQLNNNVKEKLKEF 526
             +  +    L   L + + EI+ L  +    + EL E              +KE+LK  
Sbjct: 742 VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL 801

Query: 527 QTQLDS--AELT-LHREFQQKE-------------KTIVENAAKQLKQISEK-PGATKLM 569
           +  LD   AELT L+ E                  +  +E+  +Q++++SE        +
Sbjct: 802 REALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEI 861

Query: 570 LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKV 618
            + +   E  E+E + L  E    +  L      L+E S++   +  K 
Sbjct: 862 EELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKR 910



 Score = 38.1 bits (89), Expect = 0.035
 Identities = 38/242 (15%), Positives = 97/242 (40%), Gaps = 14/242 (5%)

Query: 22  REFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASI 81
            E  Q +L   ++  +E+   L +      ELR     +++  ++  +E+Q  L   A+ 
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELR----LEVSELEEEIEELQKELYALANE 296

Query: 82  KTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE--KTIVENAAKQLKQISE 139
            + L +  Q       +E+L   + QL+  E  L     + +     +    ++L+++ E
Sbjct: 297 ISRLEQQKQIL-----RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351

Query: 140 NEFKQKEKTIVENAAKQLKQISEKPGA-TKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 198
                + +  +E    +L+++  +     + +   ++K    E +   L  E+ + + +L
Sbjct: 352 ELESLEAE--LEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL 409

Query: 199 KETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSL 258
           +      +   ++ E + KK+E     +LQ  +  +  ++  L    E L       +  
Sbjct: 410 ERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469

Query: 259 ID 260
           ++
Sbjct: 470 LE 471



 Score = 31.6 bits (72), Expect = 3.5
 Identities = 33/180 (18%), Positives = 64/180 (35%), Gaps = 30/180 (16%)

Query: 413  TLKISKEKIESLERNLQTETQRSRKLEEELQ------DTRQTCEEVVRKEVSIVTVTL-- 464
             ++   E IESL   ++   +   +LE EL+       + +    ++R E+  ++  L  
Sbjct: 846  QIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905

Query: 465  ----FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAK-SASIKTELREFYQTQLN--- 516
                  E  R     R K A   +RL+  +  I  L  + S      L E    +     
Sbjct: 906  LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIED 965

Query: 517  --NNVKEKLKEFQTQLDS------------AELTLHREFQQKEKTIVENAAKQLKQISEK 562
                 + +LK  + ++               EL    +F   +K  +  A + L++  E+
Sbjct: 966  DEEEARRRLKRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEE 1025


>gnl|CDD|132519 TIGR03479, DMSO_red_II_alp, DMSO reductase family type II enzyme,
            molybdopterin subunit.  This model represents the
            molybdopterin subunit, typically called the alpha
            subunit, of various proteins that also contain an
            iron-sulfur subunit and a heme b subunit. The group
            includes two distinct but very closely related
            periplasmic proteins of anaerobic respiration, selenate
            reductase and chlorate reductase. Other members of this
            family include dimethyl sulphide dehydrogenase,
            ethylbenzene dehydrogenase, and an archaeal respiratory
            nitrate reductase. This alpha subunit has a twin-arginine
            translocation (TAT) signal for Sec-independent
            translocation across the plasma membrane.
          Length = 912

 Score = 44.0 bits (104), Expect = 6e-04
 Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 13/64 (20%)

Query: 1225 DHGASIADAILPGAAYTEK-------QSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRAL 1277
            D  A  AD +LP A + EK          ++N   R       V P G ++ DW+I   L
Sbjct: 585  DSTAMYADIVLPAAWHYEKHDLRTTSGHRFINFFDRP------VKPMGESKTDWQIFALL 638

Query: 1278 SEGV 1281
            ++ +
Sbjct: 639  AKKI 642


>gnl|CDD|130767 TIGR01706, NAPA, periplasmic nitrate reductase, large subunit.  This
            model represents the large subunit of a family of nitrate
            reductases found in proteobacteria which are localized to
            the periplasm. This subunit binds molybdopterin and
            contains a twin-arginine motif at the N-terminus. The
            protein associates with NapB, a soluble heme-containing
            protein and NapC, a membrane-bound cytochrome c. The
            periplasmic nitrate reductases are not involved in the
            assimilation of nitrogen, and are not directly involved
            in the formation of electrochemical gradients (i.e.
            respiration) either. Rather, the purpose of this enzyme
            is either dissimilatory (i.e. to dispose of excess
            reductive equivalents) or indirectly respiratory by
            virtue of the consumption of electrons derived from NADH
            via the proton translocating NADH dehydrogenase. The
            enzymes from Alicagenes eutrophus and Paracoccus
            pantotrophus have been characterized. In E. coli (as well
            as other organisms) this gene is part of a large nitrate
            reduction operon (napFDAGHBC) [Energy metabolism,
            Aerobic, Energy metabolism, Electron transport, Central
            intermediary metabolism, Nitrogen metabolism].
          Length = 830

 Score = 43.7 bits (103), Expect = 8e-04
 Identities = 24/62 (38%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 1214 DCFIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARED-WK 1272
            D FI+    +    A  AD ILP A + EK+  Y N E R Q     V  PG AR D W+
Sbjct: 504  DNFIVVSDAYPTVTALAADLILPSAMWVEKEGAYGNAERRTQVWHQQVLAPGEARSDLWQ 563

Query: 1273 II 1274
            ++
Sbjct: 564  LV 565


>gnl|CDD|197994 smart00926, Molybdop_Fe4S4, Molybdopterin oxidoreductase Fe4S4
           domain.  The molybdopterin oxidoreductase Fe4S4 domain
           is found in a number of reductase/dehydrogenase
           families, which include the periplasmic nitrate
           reductase precursor and the formate dehydrogenase alpha
           chain.
          Length = 55

 Score = 38.0 bits (89), Expect = 0.001
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYD 952
            VG  ++V  + G V+RV    +  +N   L  KGR   +
Sbjct: 12  GVGCGLLVEVKDGRVVRVRGDPDHPVNRGRLCPKGRAGLE 51


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 41.9 bits (98), Expect = 0.003
 Identities = 40/216 (18%), Positives = 76/216 (35%), Gaps = 17/216 (7%)

Query: 436 RKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQT 495
           RK +E L+   +  E +    + +        +      +  K  +     +K + E + 
Sbjct: 170 RKKKERLKKLIEETENLAELIIDL---EELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 496 LLAKSASIKT-ELREFYQTQLNNNVKEKLKEFQTQLDSAEL-----------TLHREFQQ 543
           LL         E  +  Q  L +  +E     Q      E+              ++ Q+
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQE 286

Query: 544 KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHL 603
           +E  ++    ++LK    K    K+  + K K    E E KKL+ EL +   +++E    
Sbjct: 287 EELKLLAKEEEELKSELLKLERRKVDDEEKLKES--EKELKKLEKELKKEKEEIEELEKE 344

Query: 604 LKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 639
           LKE   KRE   ++ E + + Q +            
Sbjct: 345 LKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKK 380



 Score = 41.1 bits (96), Expect = 0.004
 Identities = 46/256 (17%), Positives = 95/256 (37%), Gaps = 18/256 (7%)

Query: 4   QDEIQTLLAKSASIKTELREF-----YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQ 58
             +++ +       + E+ E       + +     K   E    LA+      EL+    
Sbjct: 142 GGKVEIIAMMKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQEL 201

Query: 59  TQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQL--NNNVKEKLKEFQTQLDSAE---L 113
                 +KA +  Q         K EL E     L      +E++   Q  L   +    
Sbjct: 202 KLKEQAKKALEYYQLKE------KLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIE 255

Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDF 173
           +  +E +++E+ + +   +  ++  E + +++E  ++    ++LK    K    K+  + 
Sbjct: 256 SSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEE 315

Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHM 233
           K K    E E KKL+ EL +   +++E    LKE   KRE   ++ E + + Q +     
Sbjct: 316 KLKES--EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLE 373

Query: 234 ITSDVTHLHTSKESLA 249
                     S+   +
Sbjct: 374 EELLAKKKLESERLSS 389



 Score = 39.2 bits (91), Expect = 0.018
 Identities = 39/215 (18%), Positives = 83/215 (38%), Gaps = 12/215 (5%)

Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
           L+  +E +  + +  + E +  +  EEEL+   +  EE+  + + +    +  E      
Sbjct: 261 LEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 320

Query: 474 SKRSKNADS-LVRLQKAQDEIQTLLAKSASIKTELRE------FYQTQLNNNVKEKLKEF 526
            K  K  +  L + ++  +E++  L K   IK E  E          +    ++E+L   
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKEL-KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379

Query: 527 QTQLDSAELTLHREFQQK--EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENK 584
           +        +  +  +++   K   E  AK L ++SE+     L+ + K +      E +
Sbjct: 380 KKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQ--EEDLLKEEKKEELKIVEELE 437

Query: 585 KLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE 619
           +       +  + KE       +  K +L  KK E
Sbjct: 438 ESLETKQGKLTEEKEELEKQALKLLKDKLELKKSE 472



 Score = 37.6 bits (87), Expect = 0.048
 Identities = 37/268 (13%), Positives = 83/268 (30%), Gaps = 15/268 (5%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           +K++ +          +  +  +          +     + +  K   IK EL++    +
Sbjct: 661 EKSELKASLSELTKELLAEQELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEK 720

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
                    E Q  +        E  +  + ++    +E+ K    + +  E        
Sbjct: 721 EELLADKVQEAQDKIN-------EELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELS-- 771

Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
            KEK + E   K  K     E K+++    E   + L++  ++            + E  
Sbjct: 772 LKEKELAEEEEKTEKL-KVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK 830

Query: 181 EAE-----NKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMIT 235
             E           E  + +   +E    L+EE  K EL+ + +    + + Q+    + 
Sbjct: 831 IKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELE 890

Query: 236 SDVTHLHTSKESLATRQPSEQSLIDSTT 263
           S        K+ L      +  L +   
Sbjct: 891 SKEEKEKEEKKELEEESQKDNLLEEKEN 918


>gnl|CDD|239151 cd02750, MopB_Nitrate-R-NarG-like, Respiratory nitrate reductase A
            (NarGHI), alpha chain (NarG) and related proteins. Under
            anaerobic conditions in the presence of nitrate, E. coli
            synthesizes the cytoplasmic membrane-bound quinol-nitrate
            oxidoreductase (NarGHI), which reduces nitrate to nitrite
            and forms part of a redox loop generating a proton-motive
            force. Found in prokaryotes and some archaea, NarGHI
            usually functions as a heterotrimer. The alpha chain
            contains the molybdenum cofactor-containing Mo-bisMGD
            catalytic subunit. Members of the
            MopB_Nitrate-R-NarG-like CD belong to the
            molybdopterin_binding (MopB) superfamily of proteins.
          Length = 461

 Score = 41.2 bits (97), Expect = 0.004
 Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 13/61 (21%)

Query: 1225 DHGASIADAILPGAAYTEKQS-------TYVNTEGRAQQTLTAVTPPGLAREDWKIIRAL 1277
            D  A  +D +LP A + EK          +++    A      V P   A+ DW+I +AL
Sbjct: 374  DSTALYSDIVLPAATWYEKHDLSTTDMHPFIHPFSPA------VDPLWEAKSDWEIFKAL 427

Query: 1278 S 1278
            +
Sbjct: 428  A 428


>gnl|CDD|237416 PRK13532, PRK13532, nitrate reductase catalytic subunit; Provisional.
          Length = 830

 Score = 41.0 bits (97), Expect = 0.004
 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 1228 ASIADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLARED-WKII 1274
            A  AD ILP A + EK+  Y N E R Q     V  PG A+ D W+++
Sbjct: 518  ALAADLILPTAMWVEKEGAYGNAERRTQFWRQQVKAPGEAKSDLWQLV 565


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 40.8 bits (95), Expect = 0.007
 Identities = 104/646 (16%), Positives = 244/646 (37%), Gaps = 88/646 (13%)

Query: 7    IQTLLAKSASIKTELREFYQTQLNNNQKAQ--DEIQTLLAKSASIKTELREFYQTQLNNN 64
             +TL  +  S++ E  +  +     +Q+ +  +   T   +   +  +  +  +      
Sbjct: 496  TETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIK 555

Query: 65   QKAQDEIQTLLAKSASIKTELREFYQTQLN--NNVKEKLKEFQTQLDSAELTLHREFQQK 122
             +  DE+ +LL    + K +L ++  ++    N  +++L +   +L S E         +
Sbjct: 556  SRHSDELTSLLGYFPN-KKQLEDWLHSKSKEINQTRDRLAKLNKELASLE-QNKNHINNE 613

Query: 123  EKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEK----PGATKLM-------- 170
             ++  E  +    ++ +    Q E++ +E   +++++ S++     GAT +         
Sbjct: 614  LESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLT 673

Query: 171  -----------LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKV 219
                         F+ + E+ E     L+++L     KLK T   LK++ K+R+ M    
Sbjct: 674  DENQSCCPVCQRVFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLA 732

Query: 220  E---SVMQ------SQLQQAIHMITSDVTHLHTSKES----LATRQPSEQSLIDSTTDTN 266
                S++        +L+  +  +  D+  L    E     L T  P E+S     TD  
Sbjct: 733  PGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVT 792

Query: 267  CPVTQQYVPSSNQQYAPSSNQQYSIPTSNINQQYSMFSADSTHMNIVERDIPNSADDMRK 326
              + +++           + Q   +  S++++     + +          + +  +  RK
Sbjct: 793  --IMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRK 850

Query: 327  YIEMQYSMFSADSTHMNIVERD---IPNSADDMRKYIEMLLEKPPGNPVSDLEAEVKDSA 383
             I+ Q        +  N ++ +   I  +    +++ E L+E      V  L  E+KD+ 
Sbjct: 851  LIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVEL--STEVQSLIREIKDAK 908

Query: 384  G---PMTSFLESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEE 440
                P+ +FLE         E  + +K   +   K +++K+  ++  ++      + +E 
Sbjct: 909  EQDSPLETFLEKDQQE---KEELISSKETSN---KKAQDKVNDIKEKVKNIHGYMKDIEN 962

Query: 441  ELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKS 500
            ++QD +   + + +KE  + TV   LE       K +++      ++  + +I T   + 
Sbjct: 963  KIQDGKD--DYLKQKETELNTVNAQLEECEKHQEKINED------MRLMRQDIDTQKIQE 1014

Query: 501  ASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT----LHREFQQKEKTIVENAAKQL 556
              ++  L    +      V+E+LK+   ++   ++      H++ ++    I  N    L
Sbjct: 1015 RWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLAL 1074

Query: 557  KQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNH 602
             +                    YE E K  K EL +   +  E  +
Sbjct: 1075 GRQKG-----------------YEKEIKHFKKELREPQFRDAEEKY 1103



 Score = 36.9 bits (85), Expect = 0.081
 Identities = 113/607 (18%), Positives = 214/607 (35%), Gaps = 89/607 (14%)

Query: 53  LREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEK--LKEFQTQLDS 110
           LR+  QTQ    Q+ Q E++  L +      E+R+   ++       +  +K ++ +LD 
Sbjct: 191 LRQVRQTQGQKVQEHQMELK-YLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELD- 248

Query: 111 AELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKP-GATKL 169
                    + + K I  N +K +K  +E +  +  K  +E    +L+   EK    T  
Sbjct: 249 -------PLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDE 301

Query: 170 MLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE-SVMQSQLQ 228
            L+     ++Y    + ++ +  +     +E   L KE   +R L  +K E  V Q +LQ
Sbjct: 302 QLN-----DLYHNHQRTVREKERELVDCQRELEKLNKE---RRLLNQEKTELLVEQGRLQ 353

Query: 229 QAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSNQQ 288
                    +    +  +SLATR   +       ++         V    +  A ++ Q 
Sbjct: 354 LQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQL 413

Query: 289 YSIPTSNINQQYSMFSADSTHMNIVERDIPNSADDMRKYIE----MQYSMFSADSTHMNI 344
            +   S    +             + R I    + + K  E    +   +   + +   I
Sbjct: 414 CADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRI 473

Query: 345 VERDIPNSADDMRKYIEMLLEKPPGNPVSDLEAEVKDSAGPMTSFLESSVTSDIFNEPAL 404
           +E D      ++RK    L +    +    L+ EVK                D   E   
Sbjct: 474 LELD-----QELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLN 528

Query: 405 YTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTL 464
           +  T R+    ++K+K++  E+  + +++ S +L   L                 +   L
Sbjct: 529 HHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNK---------KQLEDWL 579

Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
                   HSK  +   +  RL K   E+ +L                 Q  N++  +L+
Sbjct: 580 --------HSKSKEINQTRDRLAKLNKELASL----------------EQNKNHINNELE 615

Query: 525 EFQTQLDSAELTLH-----REFQQKEKTIVENAAKQLKQISEKPGATKLM---------- 569
             + QL S E  L      ++ +   + + E   K  KQ +   GAT +           
Sbjct: 616 SKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDE 675

Query: 570 ---------LDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVES 620
                      F+ + E+ E     L+++L     KLK T   LK++ K+R+ M      
Sbjct: 676 NQSCCPVCQRVFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAP- 733

Query: 621 VMQSQLQ 627
             QS + 
Sbjct: 734 GRQSIID 740


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 40.4 bits (95), Expect = 0.007
 Identities = 37/208 (17%), Positives = 73/208 (35%), Gaps = 20/208 (9%)

Query: 14  SASIKTELREFYQTQLNNNQKA-QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQ 72
           +   K EL   +  +     +  ++ I    +       EL     T+  N  + + +++
Sbjct: 265 TEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKEN-SEFKLDVE 323

Query: 73  TLLAKSASIKTELREFYQTQLNN------------NVKEKLKEFQTQLDSAE--LTLHRE 118
            L A   +++  L E    +L              ++ + ++     +D+    +  H E
Sbjct: 324 ELKALLEALEEIL-EKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNE 382

Query: 119 FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCE 178
                K     A K+L      E K+      +      K I+      K     +A+ +
Sbjct: 383 KIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIK---QLEAEIK 439

Query: 179 VYEAENKKLKAELTQRDIKLKETNHLLK 206
             E E K+L+ +LT  +    E N LLK
Sbjct: 440 ALEKEIKELEKQLTNIEPTADEINKLLK 467



 Score = 37.3 bits (87), Expect = 0.063
 Identities = 40/235 (17%), Positives = 84/235 (35%), Gaps = 37/235 (15%)

Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
            I++E+   LE +   E Q   +L E+L++     E  + K +  +   L  E+  S   
Sbjct: 263 TITEERKAELEAHFDEEYQ---ELIEQLEELIDKYESHIEKALEELESILDTEKENSEFK 319

Query: 475 KRSKNADSLVR-LQKAQDEIQTLLA---KSASIKTELREFYQTQLNNNVKEKLKEFQTQL 530
              +   +L+  L++  ++    L    K  S   EL          ++ + ++     +
Sbjct: 320 LDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIEL---------ESITDLIESINDII 370

Query: 531 DSAE--LTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKK--- 585
           D+    +  H E     K     A K+L          +L  D  A  +  +   K    
Sbjct: 371 DAINELIREHNEKIDNLKKEKNKAKKKLWLH----LVAELKEDIDAYQKEKKGLEKAINS 426

Query: 586 LKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHL 640
           L+ E+ Q + ++K     +KE   +++L          + ++     I   +   
Sbjct: 427 LEKEIKQLEAEIKALEKEIKE--LEKQL----------TNIEPTADEINKLLKAY 469



 Score = 35.4 bits (82), Expect = 0.25
 Identities = 43/225 (19%), Positives = 95/225 (42%), Gaps = 21/225 (9%)

Query: 2   KAQDEIQTLLAKSASIKTELREFYQ---TQLNNNQKAQDEIQTLLAKSASIKT--ELREF 56
           K  D+    LAK     + L E  +    + N  +K   E+++++  ++S+ +  EL+  
Sbjct: 131 KFLDKAWKKLAKK--YDSNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAK 188

Query: 57  YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE---- 112
            +T  ++N+     +   +     I  E  E  +  +  +    + E    L +++    
Sbjct: 189 IKTLFSSNKPELALLTLSVIDFDEI--EQAEILEKSIIGSSDVPISELINNLGNSDWVKE 246

Query: 113 -LTLHRE-----FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGA 166
            L  H E     F Q+  TI E    +L+   + E+++  + + E   K    I +    
Sbjct: 247 GLEYHEEGDTCPFCQQ--TITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEE 304

Query: 167 TKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKK 211
            + +LD + +   ++ + ++LKA L   +  L++    L+E+ K 
Sbjct: 305 LESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLKD 349


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 40.3 bits (95), Expect = 0.007
 Identities = 32/155 (20%), Positives = 60/155 (38%), Gaps = 18/155 (11%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           Q AQ+++    ++  S++T+  E  Q  L  N +   +I+ LL         LR   +  
Sbjct: 138 QNAQNDLAEYNSQLVSLQTQP-ERAQAALYANSQRLQQIRNLLKGGKVGGKALRPSQRVL 196

Query: 61  LNNNQK---AQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR 117
           L   Q    AQ+++Q    +     T+L++  Q Q      + L     +L+     L  
Sbjct: 197 LQAEQALLNAQNDLQRKSLEGN---TQLQDLLQKQR-----DYLTARIQRLEHQLQLLQE 248

Query: 118 EFQQKEKTIVENAAKQLK------QISENEFKQKE 146
               K  T+ E   ++ +      +I  N    +E
Sbjct: 249 AINSKRLTLSEKTVQEAQSQDEAARIQANPLVAQE 283



 Score = 36.0 bits (84), Expect = 0.19
 Identities = 46/216 (21%), Positives = 83/216 (38%), Gaps = 54/216 (25%)

Query: 417 SKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKR 476
            KE+ E L++ L     + R+ + EL+  +   +E  R+ +S    TL L +L S  ++ 
Sbjct: 78  QKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLS----TLSLRQLESRLAQT 133

Query: 477 SKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT 536
                 L +LQ AQ+++                 Y +QL +         QTQ + A   
Sbjct: 134 ------LDQLQNAQNDLAE---------------YNSQLVSL--------QTQPERA--- 161

Query: 537 LHREFQQKEKTIVENAAKQLKQI-----SEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
                 Q         +++L+QI       K G   L    +      +AE   L A+  
Sbjct: 162 ------QAALY---ANSQRLQQIRNLLKGGKVGGKALRPSQRVL---LQAEQALLNAQND 209

Query: 592 QRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 627
            +   L+    L     K+R+ +  ++   ++ QLQ
Sbjct: 210 LQRKSLEGNTQLQDLLQKQRDYLTARI-QRLEHQLQ 244


>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
           This family represents the C-terminal region of
           merozoite surface protein 1 (MSP1) which are found in a
           number of Plasmodium species. MSP-1 is a 200-kDa protein
           expressed on the surface of the P. vivax merozoite.
           MSP-1 of Plasmodium species is synthesised as a
           high-molecular-weight precursor and then processed into
           several fragments. At the time of red cell invasion by
           the merozoite, only the 19-kDa C-terminal fragment
           (MSP-119), which contains two epidermal growth
           factor-like domains, remains on the surface. Antibodies
           against MSP-119 inhibit merozoite entry into red cells,
           and immunisation with MSP-119 protects monkeys from
           challenging infections. Hence, MSP-119 is considered a
           promising vaccine candidate.
          Length = 574

 Score = 40.3 bits (94), Expect = 0.008
 Identities = 39/184 (21%), Positives = 70/184 (38%), Gaps = 44/184 (23%)

Query: 51  TELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD- 109
            EL +FY++ L       D+    +     + T L E     L   +    +  Q   D 
Sbjct: 17  FELEKFYESYLQQIDGYNDDFIQFIKSKKELITALTETKVNALYLEIAHLKELLQHSYDR 76

Query: 110 --SAELTLHREFQQKEKTIVENAAKQLKQISE--------------------------NE 141
               +L L R + +KE+  +  +  Q+K+++                           N+
Sbjct: 77  YYKYKLKLERLYNKKEQ--IGQSKMQIKKLTLLKERLEKRKNSLNNPFYVLSNFSNFFNK 134

Query: 142 FKQKEKTIVENAAKQLKQISEKPGATKLMLD-FKAKCEVYEAENKKLKAELTQRDIKLKE 200
            ++ EK  VEN  K           T ++L  +KA+ + Y  E   LK  L++  I+ +E
Sbjct: 135 KREAEKQEVENTLKN----------TDILLKYYKARVKYYTGEAVPLKT-LSEVSIQ-RE 182

Query: 201 TNHL 204
            N+L
Sbjct: 183 DNYL 186


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 39.5 bits (92), Expect = 0.012
 Identities = 86/430 (20%), Positives = 156/430 (36%), Gaps = 61/430 (14%)

Query: 204 LLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTT 263
           L KEE+    +  K+        L     ++    +  + S    +  Q  E    +   
Sbjct: 201 LPKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGK 260

Query: 264 DTNCPVTQQYVPSSNQQYAPSSNQQYSIPTSNINQQY---SMFSADSTHMNIVERDIPNS 320
           D +     Q   S       S +    + T  + + +   S  S+ ST+ +       +S
Sbjct: 261 DHHHSHNHQ-HHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSS 319

Query: 321 ADDMRKYIEMQYSMFSADSTHMNIVERDIPNSADDMRKYIEMLLEKPPGNPVSDLEAEVK 380
           A              S  S      +    N ++   K         P + VSD E++ K
Sbjct: 320 AAAG-----------SIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQK 368

Query: 381 DSAGPMTSFLESSVTSDIF--NEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKL 438
            ++   +   +S   +     N           ++   + E++E   + LQ E Q++R+ 
Sbjct: 369 RASKSSSGARDSKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQN 428

Query: 439 EEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLA 498
           E EL           R ++S++T          S  +  K+   L +L+K  D +QT L 
Sbjct: 429 ESEL-----------RNQISLLT----------SLERSLKS--DLGQLKKENDMLQTKLN 465

Query: 499 KSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQ--- 555
              S K + ++  Q+     ++++LK       +AE  L  E  +K K   E  A +   
Sbjct: 466 SMVSAKQKDKQSMQS-----MEKRLKSEADSRVNAEKQLAEE--KKRKKEEEETAARAAA 518

Query: 556 LKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMA 615
               S +  A  L    K   +  E E KKL     + D+KLKE    + E  K+ + + 
Sbjct: 519 QAAASREECAESL----KQAKQDLEMEIKKL-----EHDLKLKEEECRMLE--KEAQELR 567

Query: 616 KKVESVMQSQ 625
           K  ES  +++
Sbjct: 568 KYQESEKETE 577



 Score = 33.7 bits (77), Expect = 0.83
 Identities = 55/222 (24%), Positives = 92/222 (41%), Gaps = 31/222 (13%)

Query: 30  NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTEL---- 85
            N       I+ L      ++ EL    Q    N  + +++I  L +   S+K++L    
Sbjct: 399 ENKISTPSAIERLEQDIKKLQAEL----QQARQNESELRNQISLLTSLERSLKSDLGQLK 454

Query: 86  --REFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFK 143
              +  QT+LN+ V  K K+ Q+ + S E  L  E     +   E   KQL +  E + K
Sbjct: 455 KENDMLQTKLNSMVSAKQKDKQS-MQSMEKRLKSE--ADSRVNAE---KQLAE--EKKRK 506

Query: 144 QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNH 203
           ++E+     AA Q     E+   +      K   +  E E KKL     + D+KLKE   
Sbjct: 507 KEEEETAARAAAQAAASREECAES-----LKQAKQDLEMEIKKL-----EHDLKLKEEEC 556

Query: 204 LLKEESKKRELMAKKVESVMQSQ-LQQAIHMITSDVTHLHTS 244
            + E  K+ + + K  ES  +++ L  A+  +      L  S
Sbjct: 557 RMLE--KEAQELRKYQESEKETEVLMSALQAMQDKNLMLENS 596


>gnl|CDD|131740 TIGR02693, arsenite_ox_L, arsenite oxidase, large subunit.  This
            model represents the large subunit of an arsenite oxidase
            complex. The small subunit is a Rieske protein. Homologs
            to both large and small subunits that score in the gray
            zone between the set trusted and noise bit score cutoffs
            for the respective models are found in Aeropyrum pernix
            K1 and in Sulfolobus tokodaii str. 7. This enzyme acts in
            energy metabolim by arsenite oxidation, rather than
            detoxification by reduction of arsenate to arsenite prior
            to export [Energy metabolism, Electron transport].
          Length = 806

 Score = 39.5 bits (92), Expect = 0.012
 Identities = 33/132 (25%), Positives = 46/132 (34%), Gaps = 25/132 (18%)

Query: 949  FAYDGLKRQRLLTPFVRNCDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLADAEAMVAL 1008
            ++ D   + RL  P +R  D Q  AT W+DAL  +A          +     + +  V  
Sbjct: 106  WSLDRGTQDRLTYPLLRVGD-QFQATSWDDALTLMALL-----TKKIRDRDGNDDIAVKC 159

Query: 1009 KD-------LLNKLGSEDLYTEY------------AFPLEGAGTDLRANYLLNNKIAGAE 1049
             D         N  G+  L+               A+  E  GT       LNN    A 
Sbjct: 160  FDHGGAGGGFENTWGAGKLFFAALSVKHIRIHNRPAYNSEVHGTREMGVGELNNTYEDAR 219

Query: 1050 EADLILLIGTNP 1061
             AD I+L G N 
Sbjct: 220  LADTIVLWGANS 231



 Score = 31.0 bits (70), Expect = 5.1
 Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 8/80 (10%)

Query: 1231 ADAILPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSEGVSSNLSSE-- 1288
            A  ILP A + E   T +N E R +     + PPG A+ D  I   ++  ++    +E  
Sbjct: 523  AHLILPAAGWGEMNLTSMNGERRMRLYEKFMDPPGEAKPDCLIAAWVANTIAELYRAEGK 582

Query: 1289 PVDVKQ------KNLEDYYM 1302
              + K+      K  ED ++
Sbjct: 583  FEEAKKFEGFDWKTEEDAFL 602


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 39.7 bits (92), Expect = 0.013
 Identities = 69/342 (20%), Positives = 144/342 (42%), Gaps = 30/342 (8%)

Query: 20   ELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSA 79
            E ++  + ++    K  +E + + A+ A  K E  +    +L   ++ + +++ L  K A
Sbjct: 1585 EAKKAEEARIEEVMKLYEEEKKMKAEEAK-KAEEAKIKAEELKKAEEEKKKVEQLKKKEA 1643

Query: 80   SIKTELREFYQTQLNNNVK---EKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQ 136
              K +  E  + +  N +K   E  K  + +  + E     E ++K    ++  A++ K+
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703

Query: 137  ISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 196
              E + K+ E+      A++LK+  E+          KA+    EAE  K KAE  ++D 
Sbjct: 1704 AEELKKKEAEEK---KKAEELKKAEEEN-------KIKAEEAKKEAEEDKKKAEEAKKDE 1753

Query: 197  KLK-ETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH----LHTSKESLATR 251
            + K +  HL KEE KK E + K+ E+V++ +L +       +V      +  +  ++   
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEG 1813

Query: 252  QPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSN-QQYSIPTSNINQQYSMFSADSTHM 310
                  +I+ + +      ++   S N Q   +   +++    +N N +     AD    
Sbjct: 1814 GKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFN-- 1871

Query: 311  NIVERDIPNSADDMRKYIEMQYSMFSADSTHMNIVERDIPNS 352
                ++     DD  +  E        +    + +ER+IPN+
Sbjct: 1872 ----KEKDLKEDDEEEIEEADEI----EKIDKDDIEREIPNN 1905



 Score = 35.9 bits (82), Expect = 0.21
 Identities = 52/228 (22%), Positives = 102/228 (44%), Gaps = 20/228 (8%)

Query: 418  KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
            K+K E  ++  + +    RK EE  +      EEV++       +    E  + +   + 
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA--EEAKKAEEAKI 1620

Query: 478  KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVK---------------EK 522
            K A+ L + ++ + +++ L  K A  K +  E  + +  N +K               E+
Sbjct: 1621 K-AEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679

Query: 523  LKEFQTQLDSAELTLHREFQQKEKT-IVENAAKQLKQISEKPGATKLMLDFKAKCEVYEA 581
             K+ +     A   L +E ++ +K   ++    + K+ +E+    +     KA+    EA
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEA 1739

Query: 582  ENKKLKAELTQRDIKLK-ETNHLLKEESKKRELMAKKVESVMQSQLQQ 628
            E  K KAE  ++D + K +  HL KEE KK E + K+ E+V++ +L +
Sbjct: 1740 EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDE 1787



 Score = 35.1 bits (80), Expect = 0.33
 Identities = 24/111 (21%), Positives = 48/111 (43%), Gaps = 1/111 (0%)

Query: 117  REFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLD-FKA 175
            ++ ++K+K      A++ K+  E + K +E    + A K+ ++  +K  A K   +  K 
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343

Query: 176  KCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQ 226
              E  +AE +    E    + K +      +E  KK +   KK E   ++ 
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394



 Score = 31.6 bits (71), Expect = 3.9
 Identities = 41/226 (18%), Positives = 91/226 (40%), Gaps = 5/226 (2%)

Query: 1    QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
             K   E     A++A+ + E  E         ++   +      K A  K +  E  +  
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA 1400

Query: 61   LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
              + +KA +     L K+A+ K +  E  +        ++ K+   +   A+    +  +
Sbjct: 1401 EEDKKKADE-----LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455

Query: 121  QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
             K+    +  A++ K+  E + K +E    + A K+ ++  +K    K   + K K +  
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515

Query: 181  EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQ 226
            +   +  KA+  ++  + K+ +   K E KK+    KK E + +++
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAE 1561



 Score = 30.9 bits (69), Expect = 6.7
 Identities = 42/192 (21%), Positives = 79/192 (41%), Gaps = 14/192 (7%)

Query: 36   QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 95
            +D  +   A+      E+R+F + ++ +  + Q  I+   A+ A    EL++  + +  +
Sbjct: 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKAD---ELKKAEEKKKAD 1293

Query: 96   NVK--EKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKE--KTIVE 151
              K  E+ K+       AE     +  +K+    +  A   K+ +E   K  E  K   E
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353

Query: 152  NAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLK---EE 208
             AA + +   EK  A +     K +    +A+  K KAE  ++  + K+     K   +E
Sbjct: 1354 AAADEAEAAEEKAEAAEK----KKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409

Query: 209  SKKRELMAKKVE 220
             KK     KK +
Sbjct: 1410 LKKAAAAKKKAD 1421


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 39.3 bits (92), Expect = 0.015
 Identities = 45/211 (21%), Positives = 93/211 (44%), Gaps = 30/211 (14%)

Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
            L+  KE+IE LE+ L++     RKLEE++++  +  EE+ ++   +      L+ L+  
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELK-- 289

Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 532
                + A+  ++L +  +E           + E R     +  N ++E++KE +     
Sbjct: 290 -----EKAEEYIKLSEFYEEY-----LDELREIEKRLSRLEEEINGIEERIKELEE--KE 337

Query: 533 AELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
             L    E ++K K +        K++ E     +L  + KAK    + E ++LK  LT 
Sbjct: 338 ERL---EELKKKLKEL-------EKRLEELEERHELYEEAKAK----KEELERLKKRLTG 383

Query: 593 RDIKLKETNHLLKEESKKRELMAKKVESVMQ 623
                ++    L+E  K +E + +++  +  
Sbjct: 384 LTP--EKLEKELEELEKAKEEIEEEISKITA 412



 Score = 36.2 bits (84), Expect = 0.14
 Identities = 45/205 (21%), Positives = 94/205 (45%), Gaps = 19/205 (9%)

Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
           +LK   EK+E L++ L    ++  +LEEEL +  +  EE+  +  S+  +   L+ L   
Sbjct: 543 SLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE--SVEELEERLKELEPF 600

Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELRE-FYQTQLNNNVKEKLKEFQTQLD 531
           +++        + L+ A+ E++    +   ++ EL + F +        E+L++   +L+
Sbjct: 601 YNE-------YLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRK---ELE 650

Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELT 591
             E    +++ ++E   +    + L+   E  G    + + + + E  +   +KLK EL 
Sbjct: 651 ELE----KKYSEEEYEELRE--EYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELE 704

Query: 592 QRDIKLKETNHLLKEESKKRELMAK 616
           +R+   KE   L K   +  EL  K
Sbjct: 705 EREKAKKELEKLEKALERVEELREK 729



 Score = 33.9 bits (78), Expect = 0.83
 Identities = 47/224 (20%), Positives = 99/224 (44%), Gaps = 22/224 (9%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           +K  +E + L  K   +K E++   + +L   ++ + ++  L  K   ++ EL E  +  
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLKEFQTQLDSAELTLHREF 119
                ++ +E++  L        EL  FY   L   + +++L+  + +L   E  L + F
Sbjct: 580 EELGFESVEELEERLK-------ELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAF 632

Query: 120 Q--QKEKTIVENAAKQLKQI----SENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDF 173
           +   + +  +E   K+L+++    SE E+++  +       + L+   E  G    + + 
Sbjct: 633 EELAETEKRLEELRKELEELEKKYSEEEYEELRE-------EYLELSRELAGLRAELEEL 685

Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAK 217
           + + E  +   +KLK EL +R+   KE   L K   +  EL  K
Sbjct: 686 EKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREK 729



 Score = 31.6 bits (72), Expect = 4.0
 Identities = 44/225 (19%), Positives = 98/225 (43%), Gaps = 24/225 (10%)

Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFL 466
           K +     K + E++E      +   ++  KL+ E+   +   +E+ + E     +    
Sbjct: 506 KELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEI---KSLKKELEKLEELKKKLAELE 562

Query: 467 ERLRSSHSKRSKNADSLVRLQ-KAQDEIQTLLAKSASIKTELREFYQTQLN-NNVKEKLK 524
           ++L     + ++    L  L  ++ +E++  L        EL  FY   L   + +++L+
Sbjct: 563 KKLDELEEELAELLKELEELGFESVEELEERLK-------ELEPFYNEYLELKDAEKELE 615

Query: 525 EFQTQLDSAELTLHREFQ--QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY--- 579
             + +L   E  L + F+   + +  +E   K+L+++ +K        +++   E Y   
Sbjct: 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEE----EYEELREEYLEL 671

Query: 580 EAENKKLKAELTQRDIKLKETNHL---LKEESKKRELMAKKVESV 621
             E   L+AEL + + + +E       LKEE ++RE   K++E +
Sbjct: 672 SRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKL 716



 Score = 30.4 bits (69), Expect = 9.3
 Identities = 37/165 (22%), Positives = 73/165 (44%), Gaps = 16/165 (9%)

Query: 20  ELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSA 79
           EL  FY  +    + A+ E++    +   ++ EL            KA +E+     +  
Sbjct: 596 ELEPFYN-EYLELKDAEKELEREEKELKKLEEEL-----------DKAFEELAETEKRLE 643

Query: 80  SIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEK---TIVENAAKQLKQ 136
            ++ EL E  + + +    E+L+E   +L      L  E ++ EK    I +   K  ++
Sbjct: 644 ELRKELEEL-EKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEE 702

Query: 137 ISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE 181
           + E E  +KE   +E A ++++++ EK    K +L  +A  +V E
Sbjct: 703 LEEREKAKKELEKLEKALERVEELREKVKKYKALLKERALSKVGE 747


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 38.5 bits (90), Expect = 0.022
 Identities = 32/225 (14%), Positives = 79/225 (35%), Gaps = 33/225 (14%)

Query: 406 TKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLF 465
                SV   ++K      +   Q +  R   L EE+ +       V             
Sbjct: 58  RDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSEEVAEAEAKATSV------------- 104

Query: 466 LERLRSSHSKRSKNADSLV-RLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
                ++ +   K+   LV  L++  +E+    A    ++  +  F     +        
Sbjct: 105 -----AAEATTPKSIQELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKD 159

Query: 525 EFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENK 584
           + ++ + SA+  L  +  +K   +     ++L++  ++            + E+     +
Sbjct: 160 DLESLIASAKEELD-QLSKKLAELKAEEEEELERALKE-----------KREELLSKLEE 207

Query: 585 KLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 629
           +L A L  +  +      L  E  +++E + KK E  ++ +L++ 
Sbjct: 208 ELLARLESK--EAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250



 Score = 35.4 bits (82), Expect = 0.23
 Identities = 22/134 (16%), Positives = 62/134 (46%), Gaps = 4/134 (2%)

Query: 31  NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSAS-IKTELREFY 89
                +D++++L+A     K EL +  +       + ++E++  L +    + ++L E  
Sbjct: 153 KEDNLKDDLESLIAS---AKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEEL 209

Query: 90  QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
             +L +      K+ + + +  +  L +++++K +  +E  A+  +Q  +NE   +   +
Sbjct: 210 LARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIEL 269

Query: 150 VENAAKQLKQISEK 163
                K++K+  E+
Sbjct: 270 QREFNKEIKEKVEE 283



 Score = 30.8 bits (70), Expect = 5.5
 Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 16/163 (9%)

Query: 405 YTKTVRSVTLKISKE---KIESLERNLQTETQRSR---------KLEEELQDTRQTCEEV 452
             + +  +  ++      K  +LE+ L+ E +R +         KL +EL+   +  E+ 
Sbjct: 198 REELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQK 257

Query: 453 VRKEVSIVTVTLFLERLRSSHSKRSKNADS-LVRLQKAQDEIQTLLAKSASIKTELREFY 511
           ++ E+++  + L  E  +    K  +  +  L +L +    ++ L     S      E +
Sbjct: 258 LKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENH 317

Query: 512 QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAK 554
           + Q      E LK   + L S      R   ++   + E A  
Sbjct: 318 KVQQLWLAVEALK---SALKSGSAGSPRPLVKELDALKELAKD 357


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 38.9 bits (91), Expect = 0.023
 Identities = 34/151 (22%), Positives = 60/151 (39%), Gaps = 24/151 (15%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
           +E+++  E  LQ+   + ++ EE+L       EE  R E               + ++ +
Sbjct: 606 RERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAE---------------AEARTA 650

Query: 478 -KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLN------NNVKEKLKEFQTQL 530
            K A   + LQ+ Q+E Q+L  K      E ++  +TQL         + E+ + F   L
Sbjct: 651 LKQAR--LDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEAL 708

Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISE 561
                 L  E   K + +      QL Q+S 
Sbjct: 709 KDDFRELRTERLAKWQVVEGELDNQLAQLSA 739



 Score = 34.7 bits (80), Expect = 0.48
 Identities = 32/164 (19%), Positives = 66/164 (40%), Gaps = 18/164 (10%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           Q+A++ +Q+ +AK    + +L      Q N   + Q   +      A    +       +
Sbjct: 610 QQAEEALQSAVAKQKQAEEQL-----VQANAELEEQKRAEAE----ARTALKQARLDLQR 660

Query: 61  LNNNQKA-QDEIQTLLAKSA-SIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHRE 118
           L N Q++ +D+++  +A+     +T+LR+         + E+ + F   L      L  E
Sbjct: 661 LQNEQQSLKDKLELAIAERKQQAETQLRQLDAQL--KQLLEQQQAFLEALKDDFRELRTE 718

Query: 119 FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISE 162
              K + +      QL Q+S       E    +  A +LK++ +
Sbjct: 719 RLAKWQVVEGELDNQLAQLS----AAIEAARTQAKA-RLKELKK 757



 Score = 30.8 bits (70), Expect = 5.8
 Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 13/128 (10%)

Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSK 478
                          R  +L++ L     T EE+ + E++   +    E    + +   +
Sbjct: 435 AGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQ 494

Query: 479 NADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLH 538
               L +L+K +DE    L +    +  L +         +++ L E + QL     +L 
Sbjct: 495 LQSELRQLRKRRDEALEALQR---AERRLLQ---------LRQALDELELQLSPQAGSLL 542

Query: 539 REFQQKEK 546
             F + E 
Sbjct: 543 -HFLRNEA 549



 Score = 30.4 bits (69), Expect = 7.8
 Identities = 33/154 (21%), Positives = 58/154 (37%), Gaps = 29/154 (18%)

Query: 484 VRLQKAQDEIQTLLAKSASIKTELREFYQTQLN---NNVKEKLKEFQTQLDSAELTLHRE 540
            RLQ+A++ +Q+ +AK    + +L      Q N      K    E +T L  A L L R 
Sbjct: 607 ERLQQAEEALQSAVAKQKQAEEQL-----VQANAELEEQKRAEAEARTALKQARLDLQRL 661

Query: 541 FQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKET 600
             +++         +L     K  A              E + ++L A+L Q   + +  
Sbjct: 662 QNEQQS---LKDKLELAIAERKQQA--------------ETQLRQLDAQLKQLLEQQQAF 704

Query: 601 NHLLKEESKKRELMAKK----VESVMQSQLQQAI 630
              LK++ ++           VE  + +QL Q  
Sbjct: 705 LEALKDDFRELRTERLAKWQVVEGELDNQLAQLS 738


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 37.9 bits (89), Expect = 0.035
 Identities = 50/242 (20%), Positives = 92/242 (38%), Gaps = 60/242 (24%)

Query: 31  NNQKAQDEIQTL--------LAKSASIKT--ELREFYQTQLNNNQKAQDEIQTLLAKSAS 80
            N++ ++ I TL         AK    K   +L +F +     N++   E+         
Sbjct: 279 ENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLTDFLEHAREQNKQLLLELD-------- 330

Query: 81  IKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAA--------- 131
               L++ Y   LN +  E ++E + QL+  E     ++ Q  + I E            
Sbjct: 331 ---RLQQSYT--LNEDELETVRELEKQLEELE----AQYDQLVERIAEKKVAYSELQEEL 381

Query: 132 ----KQLKQISEN--EFKQKEKTI--VENAAKQ-LKQISEKPGATKLML----------D 172
               KQL++I +   E  +  + +   E  A++ L++  +K    K  L          D
Sbjct: 382 EEIEKQLEEIEKEQEELSESLQGLRKDELEAREKLQEYRQKLHEIKRYLEKSNLPGLPED 441

Query: 173 FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQS-----QL 227
           +     V   E + L  EL +  I +   N  L+E +   E + +K E ++ +     QL
Sbjct: 442 YLEYFFVVSDEIEALADELNEVPINMDAVNRQLEEATDDVETLKEKTEELVDNATLAEQL 501

Query: 228 QQ 229
            Q
Sbjct: 502 IQ 503


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 37.7 bits (87), Expect = 0.040
 Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 23/172 (13%)

Query: 101 LKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI 160
           LKE ++Q D+      +E   K++   + A ++     +N  KQ+++       +Q  + 
Sbjct: 203 LKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEV---RQKQQEAKN 259

Query: 161 SEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIK------LKETNHL---LKEESKK 211
             KP  T        K +   AEN+K + E  Q +IK      LK  +H    LK+ESK 
Sbjct: 260 LPKPADTS-----SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKA 314

Query: 212 RELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTT 263
            E  A+  E   Q + +     +  D+       E+  T     +  IDS+ 
Sbjct: 315 SEKEAEDKELEAQKKREP----VAEDLQKTKPQVEAQPTSL--NEDAIDSSN 360



 Score = 34.2 bits (78), Expect = 0.56
 Identities = 37/206 (17%), Positives = 77/206 (37%), Gaps = 26/206 (12%)

Query: 466 LERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKE 525
           +E LR  + K       +  L++ + +     A+    + + ++               +
Sbjct: 183 VEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQ-----------IDADK 231

Query: 526 FQTQLDSAELTLH------REFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 579
            Q + D A+          R+ QQ+ K + + A     +  ++    +     KA+ E+ 
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291

Query: 580 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 639
           + + + LKA+  +     +E+    KE   K     KK E V    LQ+    + +  T 
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV-AEDLQKTKPQVEAQPTS 350

Query: 640 LH------VLPC--VRVLSPVKTLST 657
           L+        P   ++V+ P+  LS 
Sbjct: 351 LNEDAIDSSNPVYGLKVVDPITNLSE 376


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 37.4 bits (87), Expect = 0.067
 Identities = 76/592 (12%), Positives = 176/592 (29%), Gaps = 75/592 (12%)

Query: 47   ASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQT 106
              I+  +R  Y  +    Q  +  I+ +       +      Y+ +    +K +      
Sbjct: 749  TRIQRAIRGRY-LRRRYLQALKR-IKKIQVIQHGFRLRRLVDYELKWRLFIKLQ------ 800

Query: 107  QLDSAELTLHREFQ-QKEKTIVENAA-----KQLKQISENEFKQKEKTIVENAAKQLKQI 160
                  L   R+        I++        K+L++  E EF  K + +++   +     
Sbjct: 801  --PLLSLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGR----- 853

Query: 161  SEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVE 220
            S K      +L  +          +  + +L +  I +K  + L K  + + E    +++
Sbjct: 854  SLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSL-KLVNLELESEIIELK 912

Query: 221  SVMQSQLQQAIHMITSDVTHLHTSKESLATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQ 280
              + S L + +   T  +  L   K+ L      E   I+                    
Sbjct: 913  KSLSSDLIENLEFKTELIARL---KKLLNNIDLEEGPSIEYVKL---------------- 953

Query: 281  YAPSSNQQYSIPTSNINQQYSMFSADSTHMNIVERDIPNSADDMRKY------IEMQYSM 334
                   +     S + +    +        I+ R+   +  +++ +      +  QY  
Sbjct: 954  ---PELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGA 1010

Query: 335  FSADSTHMNIVERDIPNSADDMRKYIEMLLEKPPGNPVSDLE----AEVKDSAGPMTSFL 390
                +  +  +  ++       +       E     P+  L+     E         +  
Sbjct: 1011 LQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKALK 1070

Query: 391  ESSVTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCE 450
                 S + ++     ++  ++   I+ + +E   RNL       + +  ++       E
Sbjct: 1071 LRRENSLLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQE 1130

Query: 451  EVVRKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREF 510
                K +S +     +  L     K S      +     +  ++ L +         +  
Sbjct: 1131 IS--KFLSQL-----VNTLEPVFQKLSVL-QLELDGLFWEANLEALPSPPPFAALSEKRL 1182

Query: 511  YQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLML 570
            YQ   +    EK K   ++++  +  L   F  K  +      K  K ISE    T+   
Sbjct: 1183 YQ---SALYDEKSKLSSSEVNDLKNELIALF-SKIFSGWPRGDKLKKLISEGWVPTEYS- 1237

Query: 571  DFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVM 622
                     +  N   K   T   +  ++   LL          + K+E  +
Sbjct: 1238 ------TSLKGFNNLNKKFDTPASMSNEKLLSLLNSIDNLLS--SYKLEEEV 1281


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 35.8 bits (83), Expect = 0.075
 Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 26  QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTEL 85
           Q +L   Q+   E+Q  LA+      EL++   T     ++ Q E+  +   SA+   EL
Sbjct: 72  QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI-EL 130

Query: 86  REFYQTQLNNNVKEKLKEFQTQLDSAE 112
            E      N  ++E+L E + + ++ E
Sbjct: 131 DE-----ENRELREELAELKQENEALE 152



 Score = 33.1 bits (76), Expect = 0.66
 Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 5   DEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNN 64
            E+Q  LA+      EL++   T     ++ Q E+  +   SA+   EL E  +      
Sbjct: 83  AELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI-ELDEENRELREEL 141

Query: 65  QKAQDEIQTLLAKSASIKTELR 86
            + + E + L A++  ++   +
Sbjct: 142 AELKQENEALEAENERLQENEQ 163



 Score = 31.5 bits (72), Expect = 2.0
 Identities = 29/135 (21%), Positives = 54/135 (40%), Gaps = 34/135 (25%)

Query: 484 VRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQ 543
            RL + Q E+  L  + A ++ +L E    Q N  +K++L   + +L+  +  L R    
Sbjct: 66  ERLPELQQELAELQEELAELQEQLAE--LQQENQELKQELSTLEAELERLQKELAR---- 119

Query: 544 KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAE---LTQRDIKLKET 600
                       +KQ+S                   + EN++L+ E   L Q +  L+  
Sbjct: 120 ------------IKQLSANAIEL-------------DEENRELREELAELKQENEALEAE 154

Query: 601 NHLLKEESKKRELMA 615
           N  L+E  ++R  + 
Sbjct: 155 NERLQENEQRRWFLY 169



 Score = 30.7 bits (70), Expect = 3.7
 Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 4/90 (4%)

Query: 1   QKAQDEIQTLLAKSASIKT---ELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY 57
            + Q E+  L  + A ++    EL++  Q         + E++ L  + A IK +L    
Sbjct: 69  PELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK-QLSANA 127

Query: 58  QTQLNNNQKAQDEIQTLLAKSASIKTELRE 87
                 N++ ++E+  L  ++ +++ E   
Sbjct: 128 IELDEENRELREELAELKQENEALEAENER 157


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 37.1 bits (87), Expect = 0.081
 Identities = 21/102 (20%), Positives = 44/102 (43%), Gaps = 10/102 (9%)

Query: 71  IQTLLAKSASIKTELREFYQ-----TQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKT 125
           I +L      ++ +  E         +L   ++EK ++ Q + D       +E QQ  K 
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581

Query: 126 IVENAAKQLKQISENEFK----QKEKTIVENAAKQLKQISEK 163
             + A + +K++ + +       K   ++E A K+L + +EK
Sbjct: 582 AKKEADEIIKELRQLQKGGYASVKAHELIE-ARKRLNKANEK 622



 Score = 35.2 bits (82), Expect = 0.26
 Identities = 42/203 (20%), Positives = 83/203 (40%), Gaps = 30/203 (14%)

Query: 421 IESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSKNA 480
           I SLE   +   Q++ + E  L++  +  EE+  K           E+L+    K  + A
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEK----------KEKLQEEEDKLLEEA 571

Query: 481 DSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHRE 540
           +     ++AQ  I+    ++  I  ELR+  +    +    +L E + +L+ A     ++
Sbjct: 572 E-----KEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKK 626

Query: 541 FQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRD--IKLK 598
            +++++          KQ   K G     L    K EV    + K   E   +   +K+K
Sbjct: 627 KKKQKE----------KQEELKVGDEVKYLSLGQKGEVLSIPDDK---EAIVQAGIMKMK 673

Query: 599 ETNHLLKEESKKRELMAKKVESV 621
                L++  K ++   KK ++V
Sbjct: 674 VPLSDLEKIQKPKKKKKKKPKTV 696



 Score = 30.6 bits (70), Expect = 8.2
 Identities = 22/126 (17%), Positives = 56/126 (44%), Gaps = 4/126 (3%)

Query: 4   QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 63
           ++++  L+A    ++ EL +  +      ++A+   + L  K   ++ E  +  +     
Sbjct: 515 KEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA--- 571

Query: 64  NQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE 123
            ++AQ  I+    ++  I  ELR+  +    +    +L E + +L+ A     ++ ++K+
Sbjct: 572 EKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKK-KKKQ 630

Query: 124 KTIVEN 129
           K   E 
Sbjct: 631 KEKQEE 636


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 36.1 bits (84), Expect = 0.082
 Identities = 29/148 (19%), Positives = 60/148 (40%), Gaps = 22/148 (14%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           Q+ Q+++Q   ++   ++T      Q QL+  ++   EI+  L   +   T L +  +T 
Sbjct: 95  QELQEQLQQENSQLIELQTRPERA-QQQLSEARRRLQEIRNRLQALSPGGTPLAQAQRTL 153

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
           L              A+ A++K ++ E     L+NN +++L   Q  L    +       
Sbjct: 154 LQ-------------AELAALKAQIEELELELLSNNNRQELLRLQRDLLKKRIERLEAEL 200

Query: 121 Q--------KEKTIVENAAKQLKQISEN 140
           Q        K     E A ++ ++++E 
Sbjct: 201 QALQNAINRKRLAESEQAVEEAERLAEQ 228


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 36.3 bits (84), Expect = 0.12
 Identities = 29/164 (17%), Positives = 63/164 (38%), Gaps = 6/164 (3%)

Query: 3   AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQ-DEIQTLLAKSASIKTELREFYQTQL 61
            + E     A+  S K +     + QL N +K Q +     + +      +     +   
Sbjct: 47  FEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALE 106

Query: 62  NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQ 121
             N +   E++ L  +   +  EL++  Q        EK +E     +  +    ++ ++
Sbjct: 107 LLNLEKDKELELLEKELDELSKELQKQLQNTAEI--IEKKRENNKNEERLKFENEKKLEE 164

Query: 122 KEKTIVENAAKQLKQISEN-EFKQKEKTI--VENAAKQLKQISE 162
             +   E   +QL + + + EFK+ E+         K+LK+ +E
Sbjct: 165 SLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKRRAE 208



 Score = 36.3 bits (84), Expect = 0.12
 Identities = 23/148 (15%), Positives = 58/148 (39%), Gaps = 12/148 (8%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNN----NQKAQDEIQTLLAKSASIKTELREF 56
              + E + L  +         +    +L      N +   E++ L  +   +  EL++ 
Sbjct: 74  INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133

Query: 57  YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLH 116
            Q      +K ++  +            L+   + +L  +++ + ++F+ QL  A L L 
Sbjct: 134 LQNTAEIIEKKRENNKNEE--------RLKFENEKKLEESLELEREKFEEQLHEANLDLE 185

Query: 117 REFQQKEKTIVENAAKQLKQISENEFKQ 144
            +  ++++       K+LK+ +E   +Q
Sbjct: 186 FKENEEQRESKWAILKKLKRRAELGSQQ 213



 Score = 34.4 bits (79), Expect = 0.38
 Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 7/172 (4%)

Query: 63  NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQK 122
             Q+ Q  I   L +      E R  Y++     + +  ++   Q    +   + + +Q 
Sbjct: 32  IEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQF 91

Query: 123 EKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEA 182
           E  + +  AK       N  K KE   +E   K+L ++S++       L   A+    + 
Sbjct: 92  ELALQDEIAKLEALELLNLEKDKE---LELLEKELDELSKELQKQ---LQNTAEIIEKKR 145

Query: 183 ENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMI 234
           EN K   E  + + + K    L  E  K  E + +    +   + ++     
Sbjct: 146 ENNK-NEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRESK 196


>gnl|CDD|223107 COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate
            synthase, pyruvate dehydrogenase (cytochrome), glyoxylate
            carboligase, phosphonopyruvate decarboxylase] [Amino acid
            transport and metabolism / Coenzyme metabolism].
          Length = 550

 Score = 36.1 bits (84), Expect = 0.13
 Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 10/76 (13%)

Query: 1082 DVAYIGPKVDLRYDYEHLGESADLIKQLASGSHPFSKKLSAAKKPLIVVGADISDGAA-- 1139
            +    GP+  +   Y       + I++ A       + L+ AK+P+I+ G  +    A  
Sbjct: 166  EAEEPGPEPAILPPYRPAPPPPEAIRKAA-------ELLAEAKRPVILAGGGVRRAGASE 218

Query: 1140 -VLALVQQLAAKVTCE 1154
             +  L ++L A V   
Sbjct: 219  ELRELAEKLGAPVVTT 234


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 36.2 bits (84), Expect = 0.13
 Identities = 36/259 (13%), Positives = 90/259 (34%), Gaps = 45/259 (17%)

Query: 4   QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDE--------------IQTLLAKSASI 49
            ++ + L  + A ++ EL E  ++  +  +K + E              ++ + A   + 
Sbjct: 222 SEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKAN 281

Query: 50  KTELREF----------------YQTQLNNNQKAQ--DEIQTLLAK------SASIKTEL 85
           + +LRE                  + QL   +++Q     Q  L +       +  K  L
Sbjct: 282 RAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLAL 341

Query: 86  REFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQ- 144
              +  ++   + E  K   T  +        E      T +E   +Q+K+  ++   Q 
Sbjct: 342 PAEHVKEIAAELAEIDKPATTDSEIPHRLSGSEL-----TQLEVLIQQVKRELQDAKSQL 396

Query: 145 -KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNH 203
            KE   +E    ++ +      + + +     +    + E  + +AE+ +   +L+    
Sbjct: 397 LKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKE 456

Query: 204 LLKEESKKRELMAKKVESV 222
            ++   K  +   K+  + 
Sbjct: 457 AIEALRKTLDEKTKQKINA 475



 Score = 33.5 bits (77), Expect = 0.93
 Identities = 20/95 (21%), Positives = 43/95 (45%), Gaps = 5/95 (5%)

Query: 418 KEKIESLERNLQT--ETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK 475
           +E++  +++ + T    ++  +L EEL + +    E+ R E  I  +   LE L+ +   
Sbjct: 404 EEELAEVDKKISTIPSEEQIAQLLEELGEAQ---NELFRSEAEIEELLRQLETLKEAIEA 460

Query: 476 RSKNADSLVRLQKAQDEIQTLLAKSASIKTELREF 510
             K  D   + +    E++  +  +   K  L+EF
Sbjct: 461 LRKTLDEKTKQKINAFELERAITIADKAKKTLKEF 495


>gnl|CDD|239153 cd02752, MopB_Formate-Dh-Na-like, Formate dehydrogenase N, alpha
            subunit (Formate-Dh-Na) is a major component of nitrate
            respiration in bacteria such as in the E. coli formate
            dehydrogenase N (Fdh-N). Fdh-N is a membrane protein that
            is a complex of three different subunits and is the major
            electron donor to the nitrate respiratory chain. Also
            included in this CD is the Desulfovibrio gigas tungsten
            formate dehydrogenase, DgW-FDH. In contrast to Fdh-N,
            which is a  functional heterotrimer, DgW-FDH is a
            heterodimer. The DgW-FDH complex is composed of a large
            subunit carrying the W active site and one [4Fe-4S]
            center, and a small subunit that harbors a series of
            three [4Fe-4S] clusters as well as a putative vacant
            binding site for a fourth cluster. The smaller subunit is
            not included in this alignment. Members of the
            MopB_Formate-Dh-Na-like CD belong to the
            molybdopterin_binding (MopB) superfamily of proteins.
          Length = 649

 Score = 36.2 bits (84), Expect = 0.14
 Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 1223 HGDHGASIADAI--LPGAAYTEKQSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
             G    SI   +  LP A   EK+ +  N+    Q     V PPG A+ D  I+  L++
Sbjct: 380  PGMDPKSIQTEVFLLPAACQYEKEGSITNSGRWLQWRYKVVEPPGEAKSDGDILVELAK 438



 Score = 32.0 bits (73), Expect = 2.8
 Identities = 41/178 (23%), Positives = 64/178 (35%), Gaps = 32/178 (17%)

Query: 912  DAVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKG----RFAYDGLKRQRLLTPFVR-N 966
             +VG  +I   + G  +      +  +N   L  KG     F +      RL  P  R  
Sbjct: 7    CSVGCGLIAYVQNGVWVHQEGDPDHPVNRGSLCPKGAALRDFVHSPK---RLKYPMYRAP 63

Query: 967  CDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAF 1026
              G+     W++AL  +A+K++    A  V   A A  +V   D +  LGS  L  E  +
Sbjct: 64   GSGKWEEISWDEALDEIARKMKDIRDASFVEKNA-AGVVVNRPDSIAFLGSAKLSNEECY 122

Query: 1027 PLEGAGTDLRANYL-----------------------LNNKIAGAEEADLILLIGTNP 1061
             +      L  N L                       + N     + AD+IL++G NP
Sbjct: 123  LIRKFARALGTNNLDHQARIUHSPTVAGLANTFGRGAMTNSWNDIKNADVILVMGGNP 180


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 35.7 bits (82), Expect = 0.19
 Identities = 42/210 (20%), Positives = 88/210 (41%), Gaps = 20/210 (9%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRS 477
           + +IE LE  L+  T+++ +LEE+        +E      S+      L   R+ + +R 
Sbjct: 170 RSRIEQLETALRHSTEKTEELEEQ-------HKEAQSSSESMSAERNALLAQRAENQQRI 222

Query: 478 KNADSLVR-LQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT 536
              +  ++ L + + E   +L  +  I+ EL      ++   +K++LK+   Q    E  
Sbjct: 223 LELEQDIQTLTQKKQENDRVLEGTQDIEAELE-----RMKGELKQRLKKMTIQRRDEETE 277

Query: 537 LHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE---AENKKLKAELTQR 593
                 + E+   +    Q +  S +  A  L  +  +   + +   AE  + + E  Q 
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337

Query: 594 DIKLKETNHLLKEE----SKKRELMAKKVE 619
            ++L + N  LKE     +++RE + +  E
Sbjct: 338 SLQLSQLNLALKEGQSQWAQERETLRQSAE 367


>gnl|CDD|203110 pfam04879, Molybdop_Fe4S4, Molybdopterin oxidoreductase Fe4S4
           domain.  This domain is found in formate dehydrogenase H
           for which the structure is known. The first domain
           (residues 1 to 60, 448 to 476, and 499 to 540),
           comprising two small antiparallel sheets and four
           helices, coordinates the Fe4S4 cluster just below the
           protein surface.
          Length = 55

 Score = 31.5 bits (72), Expect = 0.21
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 913 AVGSNIIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYD 952
            VG  ++V  + G++++V    +  +N   L  KGRF Y+
Sbjct: 12  GVGCGLLVHVKDGKIVKVEGDPDHPVNRGRLCVKGRFGYE 51


>gnl|CDD|239157 cd02756, MopB_Arsenite-Ox, Arsenite oxidase (Arsenite-Ox) oxidizes
            arsenite to the less toxic arsenate; it transfers the
            electrons obtained from the oxidation of arsenite towards
            the soluble periplasmic electron carriers cytochrome c
            and/or amicyanin.  Arsenite oxidase is a heterodimeric
            enzyme containing a large and a small subunit. The large
            catalytic subunit harbors the molybdopterin cofactor and
            the [3Fe-4S] cluster; and the small subunit belongs to
            the structural class of the Rieske proteins. The small
            subunit is not included in this alignment. Members of
            MopB_Arsenite-Ox CD belong to the molybdopterin_binding
            (MopB) superfamily of proteins.
          Length = 676

 Score = 35.5 bits (82), Expect = 0.22
 Identities = 34/125 (27%), Positives = 44/125 (35%), Gaps = 26/125 (20%)

Query: 957  QRLLTPFVRNCDGQLVATEWEDALIAVAQKLQTSEVAGVVGSLA-DAEAMVALKD----- 1010
             RL TP VR   GQL  T W+DA+  VA+      + G++     D     +  D     
Sbjct: 116  TRLTTPLVRR-GGQLQPTTWDDAIDLVARV-----IKGILDKDGNDDAVFASRFDHGGGG 169

Query: 1011 --LLNKLGSEDLYTEY------------AFPLEGAGTDLRANYLLNNKIAGAEEADLILL 1056
                N  G    +               A+  E   T       LNN    A  AD I+L
Sbjct: 170  GGFENNWGVGKFFFMALQTPFVRIHNRPAYNSEVHATREMGVGELNNSYEDARLADTIVL 229

Query: 1057 IGTNP 1061
             G NP
Sbjct: 230  WGNNP 234


>gnl|CDD|226369 COG3851, UhpB, Signal transduction histidine kinase,
           glucose-6-phosphate specific [Signal transduction
           mechanisms].
          Length = 497

 Score = 35.1 bits (81), Expect = 0.25
 Identities = 43/196 (21%), Positives = 85/196 (43%), Gaps = 31/196 (15%)

Query: 404 LYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVS----- 458
           L  + +  + L I+ +++  L + LQ E  R+R L E+L  T    EE +RK+V+     
Sbjct: 257 LLAQALTGLGLGIAIQRLRELNQRLQKELARNRALAEQLIST----EESIRKDVARELHD 312

Query: 459 --------IVTVTLFLERLRSSHSKRSKNADSLVRLQ-KAQDEIQTLLAK---------- 499
                   I T    ++R    +++  + A  + +L  +  D ++ LL +          
Sbjct: 313 EIGQNITAIRTQAGIVKRAAD-NAQVKQAASLIEQLSLRIYDSVRQLLGRLRPRQLDDLT 371

Query: 500 -SASIKTELREFYQTQLNNNVKEKLKEFQTQLD-SAELTLHREFQQKEKTIVENAAKQLK 557
              +I++ LRE    +   + +   +  +T LD +  +TL+R  Q+    I ++A     
Sbjct: 372 LEQAIRSLLREMELEERGIHCQLDWRINETALDETQRVTLYRLCQELLNNICKHADASAV 431

Query: 558 QISEKPGATKLMLDFK 573
            I       +LML+ +
Sbjct: 432 TIQLWQQDERLMLEIE 447


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 35.1 bits (81), Expect = 0.25
 Identities = 50/239 (20%), Positives = 87/239 (36%), Gaps = 27/239 (11%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY--- 57
            +     + L       + +L++  Q +    Q+  D +Q  L +      +  E     
Sbjct: 157 NEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRL-DFLQFQLEELEEADLQPGEDEALE 215

Query: 58  --QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQL---DSAE 112
             Q +L+N +K ++  Q  LA        LR     Q  + + E L E Q  L       
Sbjct: 216 AEQQRLSNLEKLRELSQNALAA-------LRGDVDVQEGSLL-EGLGEAQLALASVIDGS 267

Query: 113 LTLHREFQQKEKTIVENAAKQLKQISEN-EFKQKEKTIVENAAKQLKQISEKPGATKLML 171
           L    E      T VE A ++L+   +  EF  +    +E    Q+K++  K        
Sbjct: 268 LRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERLAQIKRLKRK-------- 319

Query: 172 DFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 230
            + A  E      +K+K EL Q D   +    L +E  K  E + K   ++   + + A
Sbjct: 320 -YGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDKLEEELDKAAVALSLIRRKAA 377


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 35.1 bits (81), Expect = 0.26
 Identities = 36/219 (16%), Positives = 71/219 (32%), Gaps = 25/219 (11%)

Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
           LK  +++I +LE+ ++ +  +  KLE++L+        +  +          L       
Sbjct: 40  LKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ----------LIETADDL 89

Query: 474 SKRSKN----ADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQ 529
            K  K        L  L+  + E +  LA+      +L    ++  N      +     Q
Sbjct: 90  KKLRKQIADLNARLNALEVQEREQRRRLAE------QLAALQRSGRNPPPALLVSPEDAQ 143

Query: 530 LDSAELTLHREFQQ---KEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKL 586
                   +        +    ++   KQL  +  +  A +  L          A+  KL
Sbjct: 144 RSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAEL--TTLLSEQRAQQAKL 201

Query: 587 KAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQ 625
              L +R   L + N  L  + KK E +      +    
Sbjct: 202 AQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEI 240



 Score = 32.4 bits (74), Expect = 1.9
 Identities = 34/223 (15%), Positives = 70/223 (31%), Gaps = 25/223 (11%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
            K + ++++L  + AS++ +L E     L   +K   ++   L      + E R     Q
Sbjct: 62  AKLEKQLKSLETEIASLEAQLIET-ADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQ 120

Query: 61  LNNNQK-----------------AQDEIQTLLAKSA-SIKTELREFYQTQ-LNNNVKEKL 101
           L   Q+                     +         +    +     T      V+ ++
Sbjct: 121 LAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEI 180

Query: 102 KEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQ--LKQ 159
              Q +L +       +  +  + + E   K+      +E    +K + E  A +  LK 
Sbjct: 181 AAEQAELTTLLSEQRAQQAKLAQLLEER--KKTLAQLNSELSADQKKLEELRANESRLKN 238

Query: 160 -ISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKET 201
            I+    A     +  A  E   A  +  +A+ T    K    
Sbjct: 239 EIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTAP 281


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 34.6 bits (80), Expect = 0.32
 Identities = 31/125 (24%), Positives = 54/125 (43%), Gaps = 14/125 (11%)

Query: 413 TLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSS 472
            +K  + ++  L+  LQ   Q+   + EEL+  R+  E   +  VS +   L LER R+ 
Sbjct: 152 QIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKE---KGLVSRLE-LLELERERAE 207

Query: 473 HSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 532
                  A   +   +A  E++ L  +   ++ E ++  QT     V E+L E Q +L  
Sbjct: 208 -------AQGELGRLEA--ELEVLKRQIDELQLERQQIEQT-FREEVLEELTEAQARLAE 257

Query: 533 AELTL 537
               L
Sbjct: 258 LRERL 262



 Score = 31.5 bits (72), Expect = 3.5
 Identities = 34/165 (20%), Positives = 67/165 (40%), Gaps = 22/165 (13%)

Query: 4   QDEIQTLLAKSASIKTELREFYQTQLNNNQK----AQDEIQ---TLLAKSASIKTELREF 56
           + +++ +LA+   ++ EL    Q QL   ++      +E++    L  K    + EL E 
Sbjct: 143 RAQLELILAQIKQLEAELAG-LQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLE- 200

Query: 57  YQTQLNNNQKAQDEIQTLLAKSASIKTELREF------YQTQLNNNVKEKLKEFQTQLDS 110
                    +AQ E+  L A+   +K ++ E        +      V E+L E Q +L  
Sbjct: 201 ---LERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAE 257

Query: 111 AELTLHR-EFQQKEKTI---VENAAKQLKQISENEFKQKEKTIVE 151
               L++   + +   I   V+   + LK  +     Q  +T++E
Sbjct: 258 LRERLNKARDRLQRLIIRSPVDGTVQSLKVHTVGGVVQPGETLME 302


>gnl|CDD|235931 PRK07092, PRK07092, benzoylformate decarboxylase; Reviewed.
          Length = 530

 Score = 34.9 bits (81), Expect = 0.33
 Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 1120 LSAAKKPLIVVGADISDGAAVLALVQQLAAK 1150
            L AA++P +VVG  + D A       +LA +
Sbjct: 203  LDAARRPALVVGPAV-DRAGAWDDAVRLAER 232


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 34.6 bits (80), Expect = 0.37
 Identities = 46/261 (17%), Positives = 92/261 (35%), Gaps = 66/261 (25%)

Query: 6   EIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELR--EFYQTQLN- 62
           +I  L       K ++RE  Q           +IQTL  K   I+ +++       +   
Sbjct: 161 DISVLSEMDKLNKDKIRELNQ-----------QIQTLDMKIDHIQQQIKTYNKNIEEQRK 209

Query: 63  ----NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKE---------KLKEFQTQLD 109
               N  + Q++   L+ ++ +IK E+ E    +L N V +         KL     ++ 
Sbjct: 210 KNGENIARKQNKYDELVEEAKTIKAEIEEL-TDELLNLVMDIEDPSAALNKLNTAAAKIK 268

Query: 110 S------AELTLHRE----------FQQKEKTIVE------NAAKQLKQISE-------- 139
           S        + ++ +            +    I +           L+++          
Sbjct: 269 SKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI 328

Query: 140 -NEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKL 198
            +EF ++ K ++E   K    IS    +    L  KAK    +A  ++L+AE      +L
Sbjct: 329 MDEFNEQSKKLLELKNK----ISTNKQSLIT-LVDKAK--KVKAAIEELQAEFVDNAEEL 381

Query: 199 KETNHLLKEESKKRELMAKKV 219
            +    L +  K +  + K+ 
Sbjct: 382 AKLQDELDKIVKTKSELVKEK 402



 Score = 33.1 bits (76), Expect = 1.3
 Identities = 18/104 (17%), Positives = 40/104 (38%), Gaps = 9/104 (8%)

Query: 3   AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREF---YQT 59
             D I  +  K   ++  L +   T ++  ++  DE      K   +K ++        T
Sbjct: 297 GPDRITKIKDKLKELQHSLEKL-DTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLIT 355

Query: 60  QLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKE 103
            ++  +K +  I+ L A+      EL      +L + + + +K 
Sbjct: 356 LVDKAKKVKAAIEELQAEFVDNAEEL-----AKLQDELDKIVKT 394


>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit
          4.  Members of this family function as part of the
          Mediator (Med) complex, which links DNA-bound
          transcriptional regulators and the general
          transcription machinery, particularly the RNA
          polymerase II enzyme. They play a role in basal
          transcription by mediating activation or repression
          according to the specific complement of transcriptional
          regulators bound to the promoter.
          Length = 176

 Score = 33.5 bits (77), Expect = 0.38
 Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 9/85 (10%)

Query: 4  QDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 63
           +  + L++    +K+ L      +L  +Q+ Q  I  L A+  S+  ++++    +L  
Sbjct: 1  LELAEDLISADDELKSAL-----KELEEHQENQQRILELRAEVESLDEKIKDI-LKELKE 54

Query: 64 NQKAQDEIQTLLAKSASIKTELREF 88
             A+ E++TL   S+  K   R+ 
Sbjct: 55 ---AEKELRTLPDLSSINKANKRKV 76


>gnl|CDD|216842 pfam02009, Rifin_STEVOR, Rifin/stevor family.  Several multicopy
           gene families have been described in Plasmodium
           falciparum, including the stevor family of subtelomeric
           open reading frames and the rif interspersed repetitive
           elements. Both families contain three predicted
           transmembrane segments. It has been proposed that stevor
           and rif are members of a larger superfamily that code
           for variant surface antigens.
          Length = 290

 Score = 33.9 bits (78), Expect = 0.47
 Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 3/44 (6%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVT 461
           KE ++   R  QT  QR ++ +E ++D RQ C+E   KE+  + 
Sbjct: 38  KEVMDKFNR--QTS-QRFKEYDERMKDKRQKCKEQCDKEIQKII 78


>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
           unknown].
          Length = 565

 Score = 34.4 bits (79), Expect = 0.49
 Identities = 39/205 (19%), Positives = 70/205 (34%), Gaps = 27/205 (13%)

Query: 17  IKTELREFYQTQLNNNQKAQDEIQTLLAKSASI----KTELREFYQTQLNNNQKAQDEIQ 72
           +K EL   Y       ++ +  I+ +L+  + I    K +  +F     +  + A+D ++
Sbjct: 101 LKHELPNGYDRYEEIMEEQKKAIKDILSDVSHILDLGKLDTEDFKDEVDSARKHAKDTLE 160

Query: 73  TLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAK 132
            L            +     L   V+ K++E +  +           Q +E TI   A K
Sbjct: 161 KLYK---------LDQEGMTLMAAVESKMQELKAIIR----------QLEEWTIKGGATK 201

Query: 133 QLKQISENEFKQKEKTIVENAA---KQLKQ-ISEKPGATKLMLDFKAKCEVYEAENKKLK 188
           +   I        E TI + AA    Q +  +  K    K   D +   +  E+ NK   
Sbjct: 202 KGVPIHYVAKAFAEVTIHKKAAEVALQSETYLDIKTELAKERPDMRDLDKPLESANKTGY 261

Query: 189 AELTQRDIKLKETNHLLKEESKKRE 213
             L   DI          + +  R 
Sbjct: 262 EGLAGEDIVPFFLAEETGQLAYSRA 286


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 34.2 bits (79), Expect = 0.50
 Identities = 40/191 (20%), Positives = 80/191 (41%), Gaps = 17/191 (8%)

Query: 45  KSASIKTELREFYQTQL--NNNQKAQD--EIQTLLAKSASIKTELREFYQTQLNNNVKEK 100
           K A+ K +++E  + +L   +++ A+   E  TL      ++  L    +       KE+
Sbjct: 229 KEATFKVKVKEVKKRELPELDDEFAKKLGEEDTLEELKEKLRKNLERELKEATLEKRKEQ 288

Query: 101 LKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI 160
           L +   + +  +L      + +   +++ A +QL+Q   +   +      E   ++ K+ 
Sbjct: 289 LLDALVEANDFDLP-ESLVEAEIDNLLKQALQQLQQQGIDSL-EASGESEEELREEFKEE 346

Query: 161 SEKPGATKLMLDFKAKCEVYEAENKKLKAELT-----------QRDIKLKETNHLLKEES 209
           +EK     L+L+  AK E  E   +++KAE+            +  IKL   N  L +  
Sbjct: 347 AEKRVKLGLLLEEIAKEEKLEVTEEEIKAEIEELARQYGGEQPEEVIKLYYNNQELLDAL 406

Query: 210 KKRELMAKKVE 220
           K   L  K V+
Sbjct: 407 KADILEEKAVD 417


>gnl|CDD|215786 pfam00205, TPP_enzyme_M, Thiamine pyrophosphate enzyme, central
            domain.  The central domain of TPP enzymes contains a
            2-fold Rossman fold.
          Length = 136

 Score = 32.5 bits (75), Expect = 0.50
 Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 3/38 (7%)

Query: 1118 KKLSAAKKPLIVVG--ADISDG-AAVLALVQQLAAKVT 1152
            + L++AK+P+I+ G     S     + AL ++L   V 
Sbjct: 6    ELLASAKRPVILAGGGVRRSGASEELRALAEKLGIPVV 43



 Score = 29.8 bits (68), Expect = 3.9
 Identities = 28/101 (27%), Positives = 36/101 (35%), Gaps = 34/101 (33%)

Query: 982  AVAQKLQTSE----VAGVVGSLADAEAMVALKDLLNKLG-------------SED--LYT 1022
              A+ L +++    +AG     + A     L+ L  KLG              ED  LY 
Sbjct: 3    KAAELLASAKRPVILAGGGVRRSGASE--ELRALAEKLGIPVVTTLMGKGAFPEDHPLYL 60

Query: 1023 EYAFPLEGAGTDLRANYLLNNKIAGAEEADLILLIGTNPRF 1063
                   G      AN  L       EEADL+L IG   RF
Sbjct: 61   GML----GMHGTPAANEAL-------EEADLVLAIGA--RF 88


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 34.0 bits (79), Expect = 0.53
 Identities = 53/271 (19%), Positives = 109/271 (40%), Gaps = 59/271 (21%)

Query: 4   QDEIQTLL---AKSASIKTELREFYQ----TQLNNNQK---AQDEIQTLLAKSASIKTEL 53
            +E+Q LL    K+     +L++ Y+    + L N      A DE++  L     ++ E 
Sbjct: 125 LEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLEN---LEEEF 181

Query: 54  REFYQTQLNNN---QKAQDEIQTLLAKSASIKTELREFYQ--TQLNNNVKEKLKEFQT-- 106
            +F   +L  +    +A++ +  L  + A+++  + E  +   +L   + ++L+E +   
Sbjct: 182 SQF--VELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTELPDQLQELKAGY 239

Query: 107 -QLDSA-----ELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQI 160
            +L         L + +E Q  ++ I EN    L  + E +        ++ A ++ ++I
Sbjct: 240 RELVEEGYHLDHLDIEKEIQDLKEQIDEN----LALLEELD--------LDEAEEKNEEI 287

Query: 161 SEKPGATKLMLD-----FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEE----SKK 211
            E+      + D      KA+  V     +K    L       KE N  LKEE     + 
Sbjct: 288 QER---IDQLYDILEREVKARKYV-----EKNSDTLPDFLEHAKEQNKELKEEIDRVKQS 339

Query: 212 RELMAKKVESVMQSQLQQAIHMITSDVTHLH 242
             L   ++ES    QL++ +  +      + 
Sbjct: 340 YTLNESELES--VRQLEKQLESLEKQYDEIT 368



 Score = 32.1 bits (74), Expect = 2.3
 Identities = 25/135 (18%), Positives = 55/135 (40%), Gaps = 26/135 (19%)

Query: 516 NNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLD---- 571
           + ++++++++ + Q+D     L       E+  ++ A ++ ++I E+      + D    
Sbjct: 251 HLDIEKEIQDLKEQIDENLALL-------EELDLDEAEEKNEEIQER---IDQLYDILER 300

Query: 572 -FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEE----SKKRELMAKKVESVMQSQL 626
             KA+  V     +K    L       KE N  LKEE     +   L   ++ES    QL
Sbjct: 301 EVKARKYV-----EKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELES--VRQL 353

Query: 627 QQAIHMITSDVTHLH 641
           ++ +  +      + 
Sbjct: 354 EKQLESLEKQYDEIT 368


>gnl|CDD|239167 cd02766, MopB_3, The MopB_3 CD includes a group of related
            uncharacterized bacterial and archaeal
            molybdopterin-binding oxidoreductase-like domains with a
            putative N-terminal iron-sulfur [4Fe-4S] cluster binding
            site and molybdopterin cofactor binding site. These
            members belong to the molybdopterin_binding (MopB)
            superfamily of proteins.
          Length = 501

 Score = 34.1 bits (79), Expect = 0.54
 Identities = 45/167 (26%), Positives = 66/167 (39%), Gaps = 35/167 (20%)

Query: 918  IIVSTRTGEVLRVLPRLNEDINEEWLADKGRFAYDGLKRQ----RLLTPFVRN--CDGQL 971
            ++V+   G ++RV           ++  KG   Y  ++R     RLLTP  R     GQ 
Sbjct: 14   LLVTVEDGRIVRVEGDPAHPYTRGFICAKGA-RY--VERVYSPDRLLTPLKRVGRKGGQW 70

Query: 972  VATEWEDALIAVAQKLQ------TSEV------AGVVGSLA-DAEAMVALKDLLNKLG-S 1017
                W++AL  +A KL+        E       AG +G L   A          + LG S
Sbjct: 71   ERISWDEALDTIAAKLKEIKAEYGPESILPYSYAGTMGLLQRAARGR-----FFHALGAS 125

Query: 1018 EDLYTEYAFPLEGAGTD-LRANY--LLNNKIAGAEEADLILLIGTNP 1061
            E   T       GAG +  + ++   L N       ADLI++ G NP
Sbjct: 126  ELRGT----ICSGAGIEAQKYDFGASLGNDPEDMVNADLIVIWGINP 168



 Score = 33.4 bits (77), Expect = 0.94
 Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 13/59 (22%)

Query: 1228 ASIADAILPGAAYTEK-------QSTYVNTEGRAQQTLTAVTPPGLAREDWKIIRALSE 1279
            A  AD +LP   + E           Y+      +    A+ PPG AR + +I R L++
Sbjct: 371  ARYADIVLPATTFLEHEDVYASYWHYYL---QYNEP---AIPPPGEARSNTEIFRELAK 423


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 34.2 bits (78), Expect = 0.63
 Identities = 109/636 (17%), Positives = 239/636 (37%), Gaps = 61/636 (9%)

Query: 6   EIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQ 65
           E++   +K  S    ++ F+  +L   +  +          A+  T  +E  +     NQ
Sbjct: 18  ELERKQSKLGSSMNSIKTFWSPELKRERALRK-------DEAARITVWKEQLRVTQEENQ 70

Query: 66  KAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQ-TQLDSAELTLHREFQQKEK 124
            AQ  +Q L  +  S +   R   + + + N   +L++ + ++   + +   RE Q  E 
Sbjct: 71  HAQLTVQALQEELKSQRELNRLQQKLEKSGNETGELQQLEKSEGQFSRIQAERERQAMEL 130

Query: 125 TIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAEN 184
            ++ +  ++L+     E  Q+     +   K+L ++ +  G     L  KA+ E  E   
Sbjct: 131 FLLRDTLEKLQ--LRMERLQQTLDARDEEIKKLLEMLQSKG-----LSAKAEEEDSERLG 183

Query: 185 KKLKAELTQRDI----KLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTH 240
           +  +AE  Q  +    + KE  H++  E   ++L  ++ ++  ++  Q+ I    + +  
Sbjct: 184 RIREAEDQQSHLEVLLEQKEKEHMMLREEIHQKLQMERDDAKTEA-SQKLIDEKDTKIKE 242

Query: 241 LHTSKESLATRQPSEQSLID-STTDTNCPVTQQYVPSSNQQYAPSSNQQYSIPTSNINQQ 299
                E         QS  D S  D N    +     S      +   + +   S    +
Sbjct: 243 FEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTE 302

Query: 300 YSMFSADSTHMNIVERDIPNSADDMRKYIEMQYSMFSADSTHMNIVERDIPNSADDMRKY 359
                   T +      + N   DMR++++      +       I++ ++    D +R  
Sbjct: 303 ---LLGLQTEL----ETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEV----DALRYE 351

Query: 360 IE---MLLEKPPGNPVSDLEAEVKDSAGPMTSFLESSVTSDIFNEPALYTKTVRSVTLKI 416
           +E     L K   + +   + E    AG            +I +    Y KT R   L++
Sbjct: 352 LERKHNTLTKKTAS-LQAAQEEKATYAG------------EIEDMRDRYEKTERK--LRV 396

Query: 417 SKEKIESLERNLQTETQRSRKLEE---ELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
            ++KIE+L+   + + +R ++ +E    LQ    T   + + E ++      +ERL+   
Sbjct: 397 LQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQR 456

Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREF-YQTQLNNNVKEKLKEFQTQLDS 532
            +  +        +  + E + L  +  +++ +L E   Q +L      KL   Q +  S
Sbjct: 457 DRDER--YEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRS 514

Query: 533 AELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
                H E ++  +   E   K+LK++   P +        A      A++   + E+  
Sbjct: 515 DLERAHIELEKIREKH-EKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVD- 572

Query: 593 RDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQ 628
              +L +     ++E    E+ A ++   ++   + 
Sbjct: 573 ---RLLDRLEKAEQERDDTEMEAGRLAKELEKAQRH 605


>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
           protein.  This family represents an accessory protein
           that works with the bacteriocin maturation and ABC
           transport secretion protein described by TIGR01193
           [Transport and binding proteins, Other].
          Length = 457

 Score = 33.9 bits (78), Expect = 0.67
 Identities = 49/241 (20%), Positives = 86/241 (35%), Gaps = 47/241 (19%)

Query: 1   QKAQDEIQTLLAKSASIKTELREF-------YQTQLNN-----------NQKAQDEIQTL 42
              +D+ ++L     SI+    +F       Y+   N             Q+  D+ QT 
Sbjct: 107 DNLKDQKKSLDTLKQSIENGRNQFPTDDSFGYRNLFNGYLAQVESLTSETQQQNDKSQTQ 166

Query: 43  LAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREF--YQTQLNNNVKEK 100
              +   K +L      Q++   +   + Q L   + S  T++  F  YQ+   N     
Sbjct: 167 NEAAEKTKAQL----DQQISKTDQKLQDYQALK-NAISNGTKVANFNPYQSLYEN----- 216

Query: 101 LKEFQTQLDSAELTLHREFQQKEKTIVENAAKQ---LKQISENEFKQKEKTIVENAAKQL 157
              +Q QL SA     +   Q + TI+    +Q   L++   +   QK       A+   
Sbjct: 217 ---YQAQLKSASDKDQKN--QVKSTILATIQQQIDQLQKSIASYQVQKAGLTKSTASNYA 271

Query: 158 KQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAK 217
              + K    K     K K E+ +   K L+ E   +          LKE+S+K  + A 
Sbjct: 272 SSQNSKLAQLKEQQLAKVKQEITDLNQKLLELESKIK---------SLKEDSQKGVIKAP 322

Query: 218 K 218
           +
Sbjct: 323 E 323


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 33.3 bits (76), Expect = 0.74
 Identities = 58/225 (25%), Positives = 102/225 (45%), Gaps = 26/225 (11%)

Query: 8   QTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKA 67
           Q+LL ++  +  +  E  + QL    KA+DEI  L     ++K EL +FY +  +   + 
Sbjct: 87  QSLLKQNKVLMEK-NEKLEEQLG---KARDEILQL-RHELNLKDELLQFY-SDADEESED 140

Query: 68  QDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHRE---FQQKEK 124
           +    T L    S  +    F    L   +K  L+E    L S    L  E   +++KE+
Sbjct: 141 ESSESTPLRPQESSSSSHGCFQLEALQEKLKL-LEEENEHLRSEASHLKTETVTYEEKEQ 199

Query: 125 TIVENAAKQLK----QISENEFKQKEKTIVENAAKQLKQISEKPGATKLM---LDFKAKC 177
            +V +  KQL+    QI+    +  +KT  E+  +Q ++I      T L+   +D + KC
Sbjct: 200 QLVNDCVKQLREANDQIASLSEELAKKT--EDLERQQEEI------THLLSQIVDLQKKC 251

Query: 178 EVYEAENKKLKAEL-TQRDIKLKETNHLLKEESKKRELMAKKVES 221
           + Y  EN++L+  L   +D + +    L + + K  E M    E+
Sbjct: 252 KSYALENEELQQHLAAAKDAQRQLQAELQELQDKYAECMEMLHEA 296


>gnl|CDD|219925 pfam08598, Sds3, Sds3-like.  Repression of gene transcription is
           mediated by histone deacetylases containing
           repressor-co-repressor complexes, which are recruited to
           promoters of target genes via interactions with
           sequence-specific transcription factors. The
           co-repressor complex contains a core of at least seven
           proteins. This family represents the conserved region
           found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
          Length = 184

 Score = 32.7 bits (75), Expect = 0.79
 Identities = 26/97 (26%), Positives = 43/97 (44%), Gaps = 10/97 (10%)

Query: 520 KEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKC--E 577
           +E+L + QT+L+      H E+ +  K + E    +LK         +L  ++K +C   
Sbjct: 23  RERLAQLQTELELLLQGTHPEYLEPLKDLEERRDDRLKVA-------ELRREYKLECIER 75

Query: 578 VYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELM 614
            YEAE +  K E  +    L+E   L + E K   L 
Sbjct: 76  EYEAERQAAKQEFEKEKRLLRE-RLLEELEEKIYRLE 111


>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57.  The
           CLD or centrosome localisation domain of Cep57 is found
           at the N-terminus, and lies approximately between
           residues 58 and 239. This region lies within the first
           alpha-helical coiled-coil segment of Cep57, and
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57.
          Length = 178

 Score = 32.5 bits (74), Expect = 0.80
 Identities = 44/167 (26%), Positives = 77/167 (46%), Gaps = 22/167 (13%)

Query: 67  AQDEIQTLLAKSASIKTELREFYQTQ--LNNNVKEKLKEFQTQLDSAELTLHREFQQKE- 123
           A+D +QTL  ++   K  L E  Q +    N + ++ K+  +QL SAE       +Q E 
Sbjct: 23  AEDNLQTLSREATHYKKVLEEEEQERNLAANELTKQNKDLTSQLSSAESRCSLLEKQLEY 82

Query: 124 -KTIVENAAKQLKQISENEFK-QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE 181
            + +VENA K+   + E +   Q+EK   E+    ++   EK              EV E
Sbjct: 83  MRRMVENAEKERTAVLEQQVSLQREK---EHDQMDVQAQLEK-------------LEVLE 126

Query: 182 AENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQ 228
            E  +L    +  + K+++    L+EE  +R+L+  K   + Q+ L+
Sbjct: 127 KEYLRLTRTQSLAETKIQQLEEKLQEEEHQRKLVQDKAAEL-QTGLE 172



 Score = 32.1 bits (73), Expect = 0.98
 Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 13/151 (8%)

Query: 489 AQDEIQTLLAKSASIKTELREFYQTQ--LNNNVKEKLKEFQTQLDSAELTLHREFQQKE- 545
           A+D +QTL  ++   K  L E  Q +    N + ++ K+  +QL SAE       +Q E 
Sbjct: 23  AEDNLQTLSREATHYKKVLEEEEQERNLAANELTKQNKDLTSQLSSAESRCSLLEKQLEY 82

Query: 546 -KTIVENAAKQLKQISEKPG-----ATKLMLDFKA---KCEVYEAENKKLKAELTQRDIK 596
            + +VENA K+   + E+            +D +A   K EV E E  +L    +  + K
Sbjct: 83  MRRMVENAEKERTAVLEQQVSLQREKEHDQMDVQAQLEKLEVLEKEYLRLTRTQSLAETK 142

Query: 597 LKETNHLLKEESKKRELMAKKVESVMQSQLQ 627
           +++    L+EE  +R+L+  K   + Q+ L+
Sbjct: 143 IQQLEEKLQEEEHQRKLVQDKAAEL-QTGLE 172


>gnl|CDD|240667 cd11657, TIN2_N, N-terminal domain of TRF-interacting nuclear
           factor 2; shelterin complex protein of telomeres.  TIN2
           is one of the six proteins of shelterin complex, which
           acts to protect telomeres from DNA damage repair
           machinery. TIN2 binds directly to TRF1 and TRF2 and
           stabilizes TRF2 complex-telomere binding by tethering it
           to the TRF1 complex. TIN2 binding to TRF2 is primarily
           via the TRF binding motif (TBM) region and the
           N-terminus, while the far C-terminal region has lower
           affinity. The TIN2 TBM, but not the N-terminal region,
           is involved in TIN2 binding to TRF1. Truncation of the
           TIN2 N-terminus in mouse results in telomere elongation,
           suggesting a negative regulatory function of this
           region. Three shelterin components (TRF1, TRF2, POT1)
           bind DNA and 3 components (TIN2, RAP1, TPP1) are
           recruited by these DNA binding factors. TRF1 activity at
           telomeres is regulated in part by selective
           ubiquitination and degradation. Ubiquitination of TRF1
           is mediated by Fbx4, which binds TRF1 in the TRFH
           domain, via a small GTPase module. When bound to
           telomeres, TIN2 acts to protect TRF1 from SCF-Fbx4
           mediated ubiquitination. F-box proteins act in substrate
           recognition as part of Skp1-Cul1-Rbx1-F- box (SCF)
           protein complexes. Tankyrase-mediated ADP-ribosylation
           releases TRF1 from telomeres, rendering them susceptible
           to ubiquitination and degradation, promoting telomere
           elongation. TIN2 also binds PIP1, which recruits POT1 to
           telomeres.
          Length = 188

 Score = 32.6 bits (75), Expect = 0.80
 Identities = 22/133 (16%), Positives = 49/133 (36%), Gaps = 16/133 (12%)

Query: 451 EVVRKEVSIVTVTL--FLERLRSSHSKRSKNADSLVRLQKAQDE----IQTLLA----KS 500
           E+ R E +     L   L RL   +  + +  D    +++ Q      +Q+LL     + 
Sbjct: 60  ELCRNEKAADLAALNPHLPRLLPPYPNKCQRED--ALMEEPQLNFLQLVQSLLKDPEERE 117

Query: 501 ASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTI--VENAAKQLKQ 558
              +      Y  + +  +++ L EF ++L+   L +    +Q    +    +  ++   
Sbjct: 118 HFFQEVFPVEYGEEFDQALEKLLWEFLSRLEKL-LPVPD-LKQTVSWLSTSPSVLEECLD 175

Query: 559 ISEKPGATKLMLD 571
               P   K +L 
Sbjct: 176 SVSDPDDLKSLLQ 188


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 32.9 bits (75), Expect = 0.81
 Identities = 29/133 (21%), Positives = 54/133 (40%), Gaps = 8/133 (6%)

Query: 39  IQTLLAKSASIKTELREFYQTQLNNNQKAQ---DEIQTLLAKSASIKTELREFYQTQLNN 95
           IQ +  +   I    ++  Q Q    Q+A+    + Q+L  + AS   + +E    Q  +
Sbjct: 31  IQVMKKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQH 90

Query: 96  NVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQ-----KEKTIV 150
            + E  ++  T  D     L RE Q+  K + +   +Q+ +I             E+ IV
Sbjct: 91  LLNEAREDVATARDEWLEQLEREKQEFFKALQQQTGQQMVKIIRAALADLANATLEQQIV 150

Query: 151 ENAAKQLKQISEK 163
                +L+ +SE 
Sbjct: 151 GIFIARLEHLSEA 163



 Score = 29.8 bits (67), Expect = 9.3
 Identities = 16/74 (21%), Positives = 34/74 (45%)

Query: 486 LQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKE 545
            ++ + + Q+L  + AS   + +E    Q  + + E  ++  T  D     L RE Q+  
Sbjct: 59  AERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEAREDVATARDEWLEQLEREKQEFF 118

Query: 546 KTIVENAAKQLKQI 559
           K + +   +Q+ +I
Sbjct: 119 KALQQQTGQQMVKI 132


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 33.8 bits (77), Expect = 0.87
 Identities = 41/214 (19%), Positives = 85/214 (39%), Gaps = 18/214 (8%)

Query: 428 LQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLR------SSHSKRSKNAD 481
            Q        LE+ +Q+   T E VV +E S+  V++    ++       S  K   + D
Sbjct: 523 CQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQDVSVEASEIKKNFLGLKSSEKEINSPD 582

Query: 482 SLVRLQK--AQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR 539
            +         +E++TL ++  ++  EL +          K KL+E + +L+     L  
Sbjct: 583 EVKGAVCISTLEELETLKSEKENLDGELSKCKDDL--EESKNKLQETEKKLEE----LKS 636

Query: 540 EFQQKEKTIVENAAKQLKQISEKPGATKL-MLDFKAKCEVYEAENKKLKAELTQRDIKLK 598
           E    +++    A  QLK + E   + +L   D +A+ +  + +   L+ EL     +  
Sbjct: 637 ELDASQES-NSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELE--KERQN 693

Query: 599 ETNHLLKEESKKRELMAKKVESVMQSQLQQAIHM 632
               + K    + ++   + E  MQ   ++   +
Sbjct: 694 HQELIAKCRELEEKIERAEQEENMQKLDEEEQKI 727


>gnl|CDD|173195 PRK14733, coaE, dephospho-CoA kinase; Provisional.
          Length = 204

 Score = 32.7 bits (74), Expect = 0.90
 Identities = 28/143 (19%), Positives = 66/143 (46%), Gaps = 8/143 (5%)

Query: 15  ASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELRE-FYQTQLNNNQKAQDEIQT 73
           AS K+      + +LN N    D I   + K  S+  ++ E F    + N Q  +  ++ 
Sbjct: 16  ASGKSTATRILKEKLNLNVVCADTISREITKKPSVIKKIAEKFGDEIVMNKQINRAMLRA 75

Query: 74  LLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL-------HREFQQKEKTI 126
           ++ +S   K  L ++    +N  +K+++KE  T +   ++ L       H ++ +K   I
Sbjct: 76  IITESKEAKKWLEDYLHPVINKEIKKQVKESDTVMTIVDIPLLGPYNFRHYDYLKKVIVI 135

Query: 127 VENAAKQLKQISENEFKQKEKTI 149
             +   +++++ E + K +++ +
Sbjct: 136 KADLETRIRRLMERDGKNRQQAV 158


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 32.3 bits (74), Expect = 0.91
 Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 9/79 (11%)

Query: 94  NNNVKEKLKEFQTQLDSAELTL------HREFQQKEKTIVENAAKQLKQISENEFKQKEK 147
              + + L E +   + A+  L        E +++   I+E A K+ +QI+E    + E+
Sbjct: 39  QAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEE 98

Query: 148 T---IVENAAKQLKQISEK 163
               I E A  +++   E+
Sbjct: 99  ELERIKEAAEAEIEAEKER 117



 Score = 29.2 bits (66), Expect = 8.8
 Identities = 23/134 (17%), Positives = 52/134 (38%), Gaps = 16/134 (11%)

Query: 26  QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTEL 85
              L   ++ ++E Q LLA+      E RE          +A + I+    ++  I  E+
Sbjct: 43  ADDLAEAERLKEEAQALLAEYEQELEEARE----------QASEIIEQAKKEAEQIAEEI 92

Query: 86  REFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQK 145
           +   + +L    +    E + + + A   L  E  +    +    A++L     +E    
Sbjct: 93  KAEAEEELERIKEAAEAEIEAEKERALEELRAEVAE----LAVAIAEKLLGKKVDE--AA 146

Query: 146 EKTIVENAAKQLKQ 159
           +K +++    +L +
Sbjct: 147 QKDLIDAFIAELGE 160


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 31.5 bits (72), Expect = 1.0
 Identities = 20/94 (21%), Positives = 46/94 (48%), Gaps = 11/94 (11%)

Query: 69  DEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVE 128
            + + L  ++  +K E            ++EKL+E++   ++ + TL    Q+  + +  
Sbjct: 32  KDYEALYKENEELKEE---------IERLEEKLEEYKELEETLQKTLVVA-QETAEEVKA 81

Query: 129 NAAKQLKQISENEFKQKEKTIVENAAKQLKQISE 162
           NA K+ + I   E + K + IV +A ++ K+++ 
Sbjct: 82  NAQKEAELII-KEAEAKAERIVNDANEEAKKLAT 114


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 33.0 bits (75), Expect = 1.1
 Identities = 37/175 (21%), Positives = 62/175 (35%), Gaps = 17/175 (9%)

Query: 58  QTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHR 117
           Q+Q ++ +K + + +    + A      +   Q +L    KE+LK  + Q  + E     
Sbjct: 68  QSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQA 127

Query: 118 EFQQKEKTIVEN--AAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKA 175
           + +QK++       AA+Q K+    + K   +     AA + K+              KA
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKK--------------KA 173

Query: 176 KCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 230
           +     AE  K KAE      K  E       E  K E  AK           + 
Sbjct: 174 EEAAKAAEEAKAKAEAAAAKKK-AEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 32.2 bits (74), Expect = 1.1
 Identities = 34/187 (18%), Positives = 77/187 (41%), Gaps = 17/187 (9%)

Query: 51  TELREFYQTQLNNN----QKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQT 106
            E++ +Y    +NN    +  ++EI  +       +  + E    Q N  + E LK  + 
Sbjct: 12  NEIKNYYNDITHNNLELIKSLKEEIAEMKKNEEHNEKLMAEI--AQENKRLVEPLK--KA 67

Query: 107 QLDSAELTLHREFQQKEKTIVENAAKQLKQIS--------ENEFKQKEKTIVENAAKQLK 158
           + +  EL    +  +K+K  ++N   +LK++         E+E  ++    VE    +L 
Sbjct: 68  EEEVEELRKKLKDYEKDKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELY 127

Query: 159 QISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKK 218
              E      +      K  + E + + L  EL +++ +L E       +    + +++K
Sbjct: 128 DKFEA-AIQDVQQKTGLKNLLLEQKLEALNEELEKKEAQLNEVLAAANLDPAALQAVSEK 186

Query: 219 VESVMQS 225
           ++ V++ 
Sbjct: 187 LDDVLEE 193


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 33.1 bits (76), Expect = 1.2
 Identities = 18/140 (12%), Positives = 53/140 (37%), Gaps = 4/140 (2%)

Query: 97  VKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQ 156
           + ++ +    +              + K ++     Q +Q      +++ +   E   ++
Sbjct: 31  LAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQ-EARREREELQREEERLVQK 89

Query: 157 LKQISEKPGATKLMLDFKAKCE-VYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELM 215
            +Q+  +      + +   + E    A   +L+    Q D +L     L  E++  R+L+
Sbjct: 90  EEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQA--RKLL 147

Query: 216 AKKVESVMQSQLQQAIHMIT 235
            K +++ ++ +  Q +  I 
Sbjct: 148 LKLLDAELEEEKAQRVKKIE 167


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 31.6 bits (72), Expect = 1.2
 Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 6/66 (9%)

Query: 155 KQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKREL 214
           +Q+ +I  + G    + + K+K +    E   LK +L  RD ++++    LK++ K  E 
Sbjct: 2   EQIDKIMAEYGKD--IQNPKSKLDKANEERDSLKKQLKNRDKQIED----LKKKVKDNEE 55

Query: 215 MAKKVE 220
           + KK+E
Sbjct: 56  LQKKIE 61



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 6/66 (9%)

Query: 554 KQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKREL 613
           +Q+ +I  + G    + + K+K +    E   LK +L  RD ++++    LK++ K  E 
Sbjct: 2   EQIDKIMAEYGKD--IQNPKSKLDKANEERDSLKKQLKNRDKQIED----LKKKVKDNEE 55

Query: 614 MAKKVE 619
           + KK+E
Sbjct: 56  LQKKIE 61



 Score = 30.8 bits (70), Expect = 2.5
 Identities = 25/108 (23%), Positives = 46/108 (42%), Gaps = 22/108 (20%)

Query: 517 NNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKC 576
            N K KL +   + DS             K  ++N  KQ++ + +K    +   + + K 
Sbjct: 16  QNPKSKLDKANEERDSL------------KKQLKNRDKQIEDLKKKVKDNE---ELQKKI 60

Query: 577 EVYEAENKKLKAELTQRDIKLKET--NHLLKEESKKRELMAKKVESVM 622
           E  + +NK  K E      KL +T  N+ ++   KK +  A+  ++V 
Sbjct: 61  EKLKQQNKTAKEEYEA---KLADTKLNNAIELALKKAK--ARNAKAVK 103



 Score = 30.0 bits (68), Expect = 4.0
 Identities = 24/107 (22%), Positives = 47/107 (43%), Gaps = 13/107 (12%)

Query: 119 FQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCE 178
             +  K I    +K  K  +  E    +K + +N  KQ++ + +K    +   + + K E
Sbjct: 8   MAEYGKDIQNPKSKLDK--ANEERDSLKKQL-KNRDKQIEDLKKKVKDNE---ELQKKIE 61

Query: 179 VYEAENKKLKAELTQRDIKLKET--NHLLKEESKKRELMAKKVESVM 223
             + +NK  K E      KL +T  N+ ++   KK +  A+  ++V 
Sbjct: 62  KLKQQNKTAKEEYEA---KLADTKLNNAIELALKKAK--ARNAKAVK 103


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 32.9 bits (75), Expect = 1.2
 Identities = 32/193 (16%), Positives = 61/193 (31%), Gaps = 15/193 (7%)

Query: 31  NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQ 90
           N  K   + Q         K    EF    L NN  +      L A    +       Y+
Sbjct: 302 NLLKKLSKEQLKTLNLEKQKLLSIEFNTNFLLNNFVSLINR--LKASVNQVFEFEIYLYK 359

Query: 91  TQLNNN--VKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAK---QLKQISENEFKQK 145
               N   ++ K  E   + +   +   +    K+  IVE + K       + + +    
Sbjct: 360 IINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTK-IFY 418

Query: 146 EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAEN--KKLKAELTQRDIKLKET-- 201
            K  +    K+     E+    + +   +   +  E  +  +  K +   +DI+L +   
Sbjct: 419 HKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNML 478

Query: 202 ---NHLLKEESKK 211
              N    EES +
Sbjct: 479 QKFNKFKNEESAE 491


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 22/109 (20%), Positives = 45/109 (41%), Gaps = 21/109 (19%)

Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDF 173
           T++R+   K   I++   K+ ++  E E K++++   +   K L Q  E+          
Sbjct: 2   TVYRD---KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEER---------- 48

Query: 174 KAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKK----RELMAKK 218
               E    E +K K +   R    ++ +  LKE+ +      + + KK
Sbjct: 49  ----EKRLEELEKAKNKPLARYADDEDYDEELKEQERWDDPMAQFLRKK 93


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 32.6 bits (75), Expect = 1.3
 Identities = 27/132 (20%), Positives = 45/132 (34%), Gaps = 15/132 (11%)

Query: 14  SASIKTELREFYQTQLNNNQKAQD-----EIQTLLAKSASIKTELREFYQTQLNNNQKAQ 68
              I T L E  Q +    Q         +I  L A+ A+++ E+ E         Q+AQ
Sbjct: 107 EKEIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQ 166

Query: 69  DEI----QTLLA---KSASIKTELREFYQTQLNN---NVKEKLKEFQTQLDSAELTLHRE 118
            E      T +A        K E  +  Q +L      +   + + + Q  + E      
Sbjct: 167 CEAEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNRQAA 226

Query: 119 FQQKEKTIVENA 130
             +K+  I  N 
Sbjct: 227 VAEKQARIEAND 238


>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
          Length = 156

 Score = 31.5 bits (72), Expect = 1.6
 Identities = 23/99 (23%), Positives = 44/99 (44%), Gaps = 16/99 (16%)

Query: 65  QKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEK 124
           +K   +   +LAK+  +  +    Y+ +L+   KE       QL+  +       Q++ K
Sbjct: 59  RKNLTKASEILAKANELTKQ----YEQELSKARKE------AQLEITQS------QKEAK 102

Query: 125 TIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEK 163
            IVEN  KQ ++  ++   +  K +     K LK + E+
Sbjct: 103 EIVENELKQAQKYIDSLLNEATKQLEAQKEKALKSLEEQ 141



 Score = 30.7 bits (70), Expect = 2.8
 Identities = 19/109 (17%), Positives = 48/109 (44%), Gaps = 20/109 (18%)

Query: 33  QKAQDEIQTLLAKSASIKTELREFYQTQLNN-NQKAQDEIQTLLAKSASIKTELREFYQT 91
           +K   +   +LAK+  +  +    Y+ +L+   ++AQ EI                  Q 
Sbjct: 59  RKNLTKASEILAKANELTKQ----YEQELSKARKEAQLEITQ---------------SQK 99

Query: 92  QLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEN 140
           +    V+ +LK+ Q  +DS      ++ + +++  +++  +Q+  +S+ 
Sbjct: 100 EAKEIVENELKQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQ 148


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 31.6 bits (72), Expect = 1.6
 Identities = 33/187 (17%), Positives = 81/187 (43%), Gaps = 19/187 (10%)

Query: 51  TELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 110
            EL +  +  + + +K  D ++ L A +  +   L+E  + +L +     L E   +   
Sbjct: 3   DELEKLLKKLIKDTKKLLDSLKELPAAAEKLSEALQELGK-ELPDLSNTDLGEALEKFGK 61

Query: 111 AELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI---------VENAAKQLKQIS 161
            +  L     Q E+ +     + LK+  + + K+ +K            + A ++LK++ 
Sbjct: 62  IQKELEEFRDQLEQKLENKVIEPLKEYLKKDLKEIKKRRKKLDKARLDYDAAREKLKKLR 121

Query: 162 EKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVES 221
           +K   +  + + + + +  + + ++L+ EL +        N L   E K++EL    + S
Sbjct: 122 KKKKDSSKLAEAEEELQEAKEKYEELREELIED------LNKL---EEKRKELFLSLLLS 172

Query: 222 VMQSQLQ 228
            +++Q +
Sbjct: 173 FIEAQSE 179


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
           This model describes a conserved domain found in surface
           proteins of a number of Firmutes. Many members have
           LPXTG C-terminal anchoring motifs and a substantial
           number have the KxYKxGKxW putative sorting signal at the
           N-terminus. The tetracycline resistance plasmid pCF10 in
           Enterococcus faecalis promotes conjugal plasmid transfer
           in response to sex pheromones, but PgrA/Sec10 encoded by
           that plasmid, a member of this family, specifically
           inhibits the ability of cells to receive homologous
           plasmids. The phenomenon is called surface exclusion.
          Length = 356

 Score = 32.4 bits (74), Expect = 1.7
 Identities = 22/100 (22%), Positives = 34/100 (34%), Gaps = 5/100 (5%)

Query: 3   AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN 62
            Q  + T  A  A+ +T L     T       AQ       A  A+ + EL       L 
Sbjct: 259 LQAALATAQADLAAAQTAL----NTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQ 314

Query: 63  NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 102
             Q      Q  LA + +   + +E     LN ++ +K  
Sbjct: 315 TAQNNLATAQAALANAEARLAKAKE-ALANLNADLAKKQA 353


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 32.3 bits (74), Expect = 1.7
 Identities = 37/185 (20%), Positives = 73/185 (39%), Gaps = 15/185 (8%)

Query: 418 KEKIESLERNLQTE---TQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
           + +IE LE+  QT     +  R+L +++++ R+  E+  +       +      +     
Sbjct: 113 EREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKK 172

Query: 475 KRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE 534
           K  +  + +  L     E    + K      ELR+       + + E+  E   ++D   
Sbjct: 173 KAREIHEKIQELANEAQEYHEEMIKLFEEADELRK-----EADELHEEFVELSKKIDE-- 225

Query: 535 LTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLK--AELTQ 592
             LH EF+  +  + E   K++K +  K  A K     +   E  E   +K K   +LT 
Sbjct: 226 --LHEEFRNLQNELREL-EKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEKLTT 282

Query: 593 RDIKL 597
            ++ L
Sbjct: 283 EELLL 287


>gnl|CDD|224000 COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit
           (contains helicase and exonuclease domains) [DNA
           replication, recombination, and repair].
          Length = 1139

 Score = 32.8 bits (75), Expect = 1.7
 Identities = 15/69 (21%), Positives = 26/69 (37%), Gaps = 6/69 (8%)

Query: 507 LREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGAT 566
           L      Q        L+E  T+  S E  L R  ++    I  N  +Q +++       
Sbjct: 691 LTNGLHAQEL------LQELLTRALSYEHGLLRWLERFIALIENNDEEQARRLESDRDLV 744

Query: 567 KLMLDFKAK 575
           ++M   K+K
Sbjct: 745 RIMTIHKSK 753


>gnl|CDD|180530 PRK06328, PRK06328, type III secretion system protein; Validated.
          Length = 223

 Score = 31.7 bits (72), Expect = 1.8
 Identities = 20/82 (24%), Positives = 36/82 (43%)

Query: 81  IKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEN 140
              E  +  +   N   KE  K +  QL   E    +  +Q ++ +V  A   +K+I   
Sbjct: 49  THEECEKLREEAKNQGFKEGSKAWSKQLAFLEEETQKLREQVKEALVPLAIASVKKIIGK 108

Query: 141 EFKQKEKTIVENAAKQLKQISE 162
           E +   +TIV   A  LK++++
Sbjct: 109 ELELHPETIVSIIANSLKELTQ 130


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 31.0 bits (71), Expect = 1.8
 Identities = 17/106 (16%), Positives = 44/106 (41%), Gaps = 12/106 (11%)

Query: 39  IQTLLAKSA---SIKTELREFYQTQLNNNQKAQDEIQTLLAK--------SASIKTELRE 87
           +Q +L +S    + + +L + ++ +    +K + E+Q L  K        S + + +  +
Sbjct: 6   VQKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEK 65

Query: 88  FYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQ 133
             Q +     + K ++ Q  L   +    ++   K    ++  AK+
Sbjct: 66  ELQ-KKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAKK 110



 Score = 29.1 bits (66), Expect = 7.6
 Identities = 21/85 (24%), Positives = 45/85 (52%), Gaps = 8/85 (9%)

Query: 486 LQKAQDEIQTLLAK----SASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREF 541
           L+K + E+Q L  K    +A++    RE  + +L     +K++EFQ +    +  L +  
Sbjct: 34  LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQ----KKVQEFQRKQQKLQQDLQKRQ 89

Query: 542 QQKEKTIVENAAKQLKQISEKPGAT 566
           Q++ + I++   K +K++++K G  
Sbjct: 90  QEELQKILDKINKAIKEVAKKKGYD 114


>gnl|CDD|239165 cd02764, MopB_PHLH, The MopB_PHLH CD includes a group of related
            uncharacterized putative hydrogenase-like homologs (PHLH)
            of molybdopterin binding (MopB) proteins. This CD is of
            the PHLH region homologous to the catalytic
            molybdopterin-binding subunit of MopB homologs.
          Length = 524

 Score = 32.5 bits (74), Expect = 2.0
 Identities = 39/217 (17%), Positives = 67/217 (30%), Gaps = 26/217 (11%)

Query: 1048 AEEADLILLI--GTNPRFEAPLFNARIRKGYLTNELDVAYIGPKVDLRYDYEHLGESADL 1105
               AD+ L I       F   L +  I+KG  ++  D           +   +L      
Sbjct: 247  GANADVRLAIRPSQEKAFALGLAHKLIKKGAGSSLPD----------FFRALNLAFKPAK 296

Query: 1106 IKQLASGSHPF----SKKLSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCESDVPCDW 1161
            + +L           +K L+AA K L+V G+++S  A     V   A      +D     
Sbjct: 297  VAELTVDLDKALAALAKALAAAGKSLVVAGSELSQTAGADTQVAVNALNSLLGNDG---- 352

Query: 1162 KVLNILQKAASQVAALDIGYKPGTSAIREKPPKVLFLLGAD------EGSISRDDVGKDC 1215
            K ++  +             K   S I       L +   +      +G      + K  
Sbjct: 353  KTVDHARPIKGGELGNQQDLKALASRINAGKVSALLVYDVNPVYDLPQGLGFAKALEKVP 412

Query: 1216 FIIYQGHHGDHGASIADAILPGAAYTEKQSTYVNTEG 1252
              +  G   D  A + D + P +   E        +G
Sbjct: 413  LSVSFGDRLDETAMLCDWVAPMSHGLESWGDAETPDG 449


>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO.  This family
           contains the bacterial type III secretion protein YscO,
           which is approximately 150 residues long. YscO has been
           shown to be required for high-level expression and
           secretion of the anti-host proteins V antigen and Yops
           in Yersinia pestis.
          Length = 152

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 32/153 (20%), Positives = 57/153 (37%), Gaps = 21/153 (13%)

Query: 3   AQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN 62
           A+  ++         +   RE  Q+  +  Q  Q E   L A+       L+E       
Sbjct: 16  AEKAVKRQEQALVQARARHREAQQSLQDYRQWRQQEENRLFAELQGATVLLKEL------ 69

Query: 63  NNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQK 122
             +K Q ++  L    AS++ +L E          K++L+  + +L  A   L    + +
Sbjct: 70  --EKWQQQVGLLRENEASLEQQLAE---------AKQRLEAERQRLRQARQQLQEARKAQ 118

Query: 123 EKTI----VENAAKQLKQISENEFKQKEKTIVE 151
           EK       E A  Q ++    E +Q+E   V 
Sbjct: 119 EKFAELARQEQAEAQAQRQYLEELEQEEFRTVS 151


>gnl|CDD|131471 TIGR02418, acolac_catab, acetolactate synthase, catabolic.
            Acetolactate synthase (EC 2.2.1.6) combines two molecules
            of pyruvate to yield 2-acetolactate with the release of
            CO2. This reaction may be involved in either valine
            biosynthesis (biosynthetic) or conversion of pyruvate to
            acetoin and possibly to 2,3-butanediol (catabolic). The
            biosynthetic type, described by TIGR00118, is also
            capable of forming acetohydroxybutyrate from pyruvate and
            2-oxobutyrate for isoleucine biosynthesis. The family
            described here, part of the same larger family of
            thiamine pyrophosphate-dependent enzymes (pfam00205,
            pfam02776) is the catabolic form, generally found
            associated with in species with acetolactate
            decarboxylase and usually found in the same operon. The
            model may not encompass all catabolic acetolactate
            synthases, but rather one particular clade in the larger
            TPP-dependent enzyme family [Energy metabolism,
            Fermentation].
          Length = 539

 Score = 32.0 bits (73), Expect = 2.3
 Identities = 45/212 (21%), Positives = 86/212 (40%), Gaps = 40/212 (18%)

Query: 977  EDALIAVAQKLQTSE----VAGVVGSLADAEAMVALKDLLNKLGSEDLYTEYAF--PLEG 1030
            +DA+  VA+ +Q ++    + G+  S    E   A++ LL K       T+       +G
Sbjct: 182  DDAIDEVAEAIQNAKLPVLLLGLRAS--SPETTEAVRRLLKK-------TQLPVVETFQG 232

Query: 1031 AGTDLRANY--------LLNNKIAGA--EEADLILLIGTNP-RFEAPLFNARIRKGYLTN 1079
            AG   R           L  N+      ++ADL++ IG +P  +E   +N+      +  
Sbjct: 233  AGAVSRELEDHFFGRVGLFRNQPGDRLLKQADLVITIGYDPIEYEPRNWNSENDATIVHI 292

Query: 1080 ELDVAYIG----PKVDLRYDYEHLGESADLIKQ------LASGSHPFSKKLSAAKKPLIV 1129
            +++ A I     P ++L  D   +  + DL+ +      L   +    + L   ++ L  
Sbjct: 293  DVEPAQIDNNYQPDLELVGD---IASTLDLLAERIPGYELPPDALAILEDLKQQREALDR 349

Query: 1130 VGADI-SDGAAVLALVQQLAAKVTCESDVPCD 1160
            V A +       L +++ + A VT +  V  D
Sbjct: 350  VPATLKQAHLHPLEIIKAMQAIVTDDVTVTVD 381


>gnl|CDD|218791 pfam05879, RHD3, Root hair defective 3 GTP-binding protein (RHD3). 
           This family consists of several eukaryotic root hair
           defective 3 like GTP-binding proteins. It has been
           speculated that the RHD3 protein is a member of a novel
           class of GTP-binding proteins that is widespread in
           eukaryotes and required for regulated cell enlargement.
           The family also contains the homologous yeast synthetic
           enhancement of YOP1 (SEY1) protein which is involved in
           membrane trafficking.
          Length = 743

 Score = 32.0 bits (73), Expect = 2.3
 Identities = 38/216 (17%), Positives = 74/216 (34%), Gaps = 33/216 (15%)

Query: 416 ISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK 475
           I+ E  E          ++ R+L+E  +        +  K  SI       E+  S +  
Sbjct: 259 IANEVFEEFITAYDPLEEKWRELKEAGE--GGLLGGLGAKLASI------REKCLSEYDT 310

Query: 476 RSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAEL 535
            +   +  V  +K Q E++        + + L+  +Q QL    K  L+ F+  + SA  
Sbjct: 311 EASRYNKGVYQEKRQ-ELEE------KLDSHLKPTFQAQLGALHKGLLESFKEAVSSA-- 361

Query: 536 TLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 595
                           A +   +  +  G TK +  F  +CE    E          R  
Sbjct: 362 --------------LKAGEGFAESVKD-GKTKCVEKFDEECESLAIEQATWSNSDKIRLK 406

Query: 596 KLKE-TNHLLKEESKKRELMAKKVESVMQSQLQQAI 630
             K+   H  +   ++ + +  + E  + S L + +
Sbjct: 407 LEKDIDAHSARLRKEELKELTNRYEKKLVSALSEPV 442


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.1 bits (74), Expect = 2.6
 Identities = 30/145 (20%), Positives = 64/145 (44%), Gaps = 17/145 (11%)

Query: 90  QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
           + +     KE L E + ++        +E +++   + +   K+L Q  EN  ++ E   
Sbjct: 48  KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL-QKLEKRLLQKEENLDRKLEL-- 104

Query: 150 VENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEES 209
           +E   ++L++  ++    +   + + K E  E    +L  E  Q   +L+  + L  EE+
Sbjct: 105 LEKREEELEKKEKE--LEQKQQELEKKEEELE----ELIEEQLQ---ELERISGLTAEEA 155

Query: 210 KKRELMAKKVESVMQSQLQQAIHMI 234
           K+  L  +KVE   +    +A  +I
Sbjct: 156 KEILL--EKVEEEAR---HEAAVLI 175



 Score = 31.7 bits (73), Expect = 3.1
 Identities = 27/143 (18%), Positives = 63/143 (44%), Gaps = 13/143 (9%)

Query: 97  VKEKLKEFQTQLDSAELTLHREFQQKEKTI-VENAAKQLKQISENEFKQKEKTIVENAAK 155
           +KE  +E +  L+ A+     E  +KE  +  +    +L+   E E +++   +     K
Sbjct: 33  IKEAEEEAKRILEEAKKEA--EAIKKEALLEAKEEIHKLRNEFEKELRERRNEL----QK 86

Query: 156 QLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELM 215
             K++ +K    +  LD K   E+ E   ++L+ +  + + K +E     +E  +  E  
Sbjct: 87  LEKRLLQK----EENLDRKL--ELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQ 140

Query: 216 AKKVESVMQSQLQQAIHMITSDV 238
            +++E +     ++A  ++   V
Sbjct: 141 LQELERISGLTAEEAKEILLEKV 163



 Score = 30.9 bits (71), Expect = 5.6
 Identities = 31/146 (21%), Positives = 65/146 (44%), Gaps = 29/146 (19%)

Query: 414 LKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
           L+  + +++ LE+ L    Q+   L+ +L+   +  EE+ +KE         LE+ +   
Sbjct: 77  LRERRNELQKLEKRLL---QKEENLDRKLELLEKREEELEKKEKE-------LEQKQQEL 126

Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSA 533
            K+ +  + L+     ++++Q L   S     E +E     L   V+E+ +      ++A
Sbjct: 127 EKKEEELEELI-----EEQLQELERISGLTAEEAKE----ILLEKVEEEARH-----EAA 172

Query: 534 ELTLHREFQQKEKTIVENAAKQLKQI 559
            L      ++ E+   E A K+ K+I
Sbjct: 173 VL-----IKEIEEEAKEEADKKAKEI 193



 Score = 30.1 bits (69), Expect = 9.7
 Identities = 9/48 (18%), Positives = 24/48 (50%)

Query: 410 RSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEV 457
           +   L    E +E  E  L+ + +   + ++EL+   +  EE++ +++
Sbjct: 94  KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQL 141


>gnl|CDD|219397 pfam07393, Sec10, Exocyst complex component Sec10.  This family
           contains the Sec10 component (approximately 650 residues
           long) of the eukaryotic exocyst complex, which
           specifically affects the synthesis and delivery of
           secretory and basolateral plasma membrane proteins.
          Length = 685

 Score = 31.9 bits (73), Expect = 2.6
 Identities = 32/150 (21%), Positives = 53/150 (35%), Gaps = 30/150 (20%)

Query: 408 TVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVT------ 461
             R    ++S    + +   L    +    +EE LQ        VV++E +I+       
Sbjct: 159 IDRDNVDEVSGLDNDEIWEKLADPDEHPSIVEESLQALFSEVVVVVKEEAAIIKRVFPNP 218

Query: 462 ---VTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTE--LREFYQT--Q 514
              +  F++R               V  Q  Q  ++ LL K+ SI     LR  +    Q
Sbjct: 219 EVVMQKFIQR---------------VFAQVIQQRLEALLDKAKSISQLAYLRSLHSLYSQ 263

Query: 515 LNNNVKEKLKEF-QTQLDSAELTLHREFQQ 543
               VK+ LKEF  T+       L +  + 
Sbjct: 264 TLKLVKD-LKEFLSTENPDLSAFLDQLTED 292


>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain
           fusion protein; Provisional.
          Length = 906

 Score = 32.0 bits (73), Expect = 2.7
 Identities = 28/101 (27%), Positives = 40/101 (39%), Gaps = 15/101 (14%)

Query: 531 DSAELTLHREFQQKEK-TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAE 589
           DS  L L   F Q  K  IVE+   ++    E     KL L   A+      ENKKLK  
Sbjct: 83  DSNLLNLIDSFSQNLKWAIVEHICDKILLYGEN----KLALRTLAEAYAKLNENKKLKGV 138

Query: 590 LTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAI 630
             +          L+K +    E++ K   S  +   ++AI
Sbjct: 139 WER----------LVKADRDNPEIVKKLATSYEEEDKEKAI 169



 Score = 31.3 bits (71), Expect = 5.2
 Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 15/91 (16%)

Query: 142 FKQKEK-TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKE 200
           F Q  K  IVE+   ++    E     KL L   A+      ENKKLK    +       
Sbjct: 93  FSQNLKWAIVEHICDKILLYGEN----KLALRTLAEAYAKLNENKKLKGVWER------- 141

Query: 201 TNHLLKEESKKRELMAKKVESVMQSQLQQAI 231
              L+K +    E++ K   S  +   ++AI
Sbjct: 142 ---LVKADRDNPEIVKKLATSYEEEDKEKAI 169


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 31.9 bits (72), Expect = 2.7
 Identities = 34/233 (14%), Positives = 77/233 (33%), Gaps = 37/233 (15%)

Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSK 478
              E +   L +E ++   L+E++Q+ +Q+                            ++
Sbjct: 542 TSEEDVYHQLTSERKQRASLKEQMQEIQQS--------------------FSILTQCDNR 581

Query: 479 NADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQL--------NNNVKEKLKEFQTQL 530
           + + +  LQ     +Q L  K +  +  L       L          +V+  L++   +L
Sbjct: 582 SKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQEL 641

Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAEL 590
                 LH       +  V   A  ++ + ++  A++ +   K + E  +      K  L
Sbjct: 642 ALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQL--TYWKEML 699

Query: 591 TQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHVL 643
            Q    L+E    ++E  ++   + +   S + S L         +      L
Sbjct: 700 AQCQTLLRELETHIEEYDREFNEI-ENASSSLGSDLAA------REDALNQSL 745



 Score = 30.7 bits (69), Expect = 7.8
 Identities = 34/222 (15%), Positives = 73/222 (32%), Gaps = 28/222 (12%)

Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSKRSK 478
              ++     Q   +  + L E LQ T+Q+                +L + R +  ++ K
Sbjct: 212 CMPDTYHERKQVLEKELKHLREALQQTQQSHA--------------YLTQKREAQEEQLK 257

Query: 479 NADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQ---TQLDSAEL 535
               L +L+   +E++   A     +  +    +          L       TQ++    
Sbjct: 258 KQQLLKQLRARIEELRAQEAVLEETQERINRARK-------AAPLAAHIKAVTQIEQQAQ 310

Query: 536 TLHREFQQKEK---TIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQ 592
            +H E Q K +    ++   A  +KQ S      +L+    ++       ++   +    
Sbjct: 311 RIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREI 370

Query: 593 RDIKLKETNHLLKEESKKRELMAK-KVESVMQSQLQQAIHMI 633
              +   T H+   + +K  L  K +        LQ+    I
Sbjct: 371 SCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATI 412


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.1 bits (73), Expect = 2.8
 Identities = 28/175 (16%), Positives = 69/175 (39%), Gaps = 24/175 (13%)

Query: 69  DEIQTLLAKSASIKTELREFY---------QTQLNNNVKEKLKEFQTQLDSAELTLHREF 119
           +EI  L+ K ++++ EL +           Q +L   ++++++E + +  + +L L +E 
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570

Query: 120 QQKEKTIVENAAKQLKQISENE------------FKQKEKTIVENAAKQLKQISEKPGAT 167
           Q+  K + +     ++++ E +              + ++T  +   K     ++K G  
Sbjct: 571 QEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDK 630

Query: 168 KLMLDFKAKCEVYE--AENKKLKAELTQR-DIKLKETNHLLKEESKKRELMAKKV 219
             +  F  K ++ +    NK        R  +   E   + K    K+  + K  
Sbjct: 631 VRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEKINKAPPPKKFKVPKTT 685



 Score = 31.7 bits (72), Expect = 3.7
 Identities = 24/141 (17%), Positives = 53/141 (37%), Gaps = 10/141 (7%)

Query: 487 QKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ---- 542
           ++ + E++ L  +  + K EL +  Q  L    KE ++    +L   ++   +E +    
Sbjct: 546 KELEQEMEELKERERNKKLELEKEAQEALKALKKE-VESIIRELKEKKIHKAKEIKSIED 604

Query: 543 --QKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYE--AENKKLKAELTQR-DIKL 597
             + ++T  +   K     ++K G    +  F  K ++ +    NK        R  +  
Sbjct: 605 LVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHG 664

Query: 598 KETNHLLKEESKKRELMAKKV 618
            E   + K    K+  + K  
Sbjct: 665 SELEKINKAPPPKKFKVPKTT 685


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 25/155 (16%), Positives = 60/155 (38%), Gaps = 11/155 (7%)

Query: 404 LYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVT 463
           +    +     +  K+    +         + R +EE  + T++  ++V +  + IV + 
Sbjct: 58  IVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEID 117

Query: 464 LFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKL 523
              E +        +    L+ L   Q+ ++ L  K    K  L E          +++L
Sbjct: 118 ELQEFIDELLKSDKEILPLLLYLG--QERLEELKFKRKEKKERLEEL---------EKEL 166

Query: 524 KEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQ 558
           +E + + D  E  L  + ++KE   ++   ++L +
Sbjct: 167 EELEDEKDLLEKLLEEKEKKKELEELKEELEELLE 201



 Score = 29.8 bits (67), Expect = 7.3
 Identities = 25/130 (19%), Positives = 52/130 (40%), Gaps = 10/130 (7%)

Query: 30  NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY 89
           NN  K +  +     +    K +  +    ++    + Q+ I  LL KS      L  + 
Sbjct: 82  NNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELL-KSDKEILPLLLYL 140

Query: 90  QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
             +    +K K KE + +L+  E  L     +K+          L+++ E + K+KE   
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDL---------LEKLLEEKEKKKELEE 191

Query: 150 VENAAKQLKQ 159
           ++   ++L +
Sbjct: 192 LKEELEELLE 201


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 31.8 bits (73), Expect = 3.0
 Identities = 25/143 (17%), Positives = 60/143 (41%), Gaps = 25/143 (17%)

Query: 115 LHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFK 174
           L +   +K+    E  AK++ + ++ E +  +K  +  A +++ ++  +           
Sbjct: 18  LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAE----------- 66

Query: 175 AKCEVYEAENKKLKAELTQRDIKLKETNHLLKEES--KKRELMAKKVESVMQSQLQQAIH 232
                 E E K+ + EL + + +L +     +EE+  +K E + KK E++ +   ++ + 
Sbjct: 67  -----LERELKERRNELQRLERRLLQ-----REETLDRKMESLDKKEENLEKK--EKELS 114

Query: 233 MITSDVTHLHTSKESLATRQPSE 255
               ++       E L   Q  E
Sbjct: 115 NKEKNLDEKEEELEELIAEQREE 137


>gnl|CDD|236272 PRK08475, PRK08475, F0F1 ATP synthase subunit B; Validated.
          Length = 167

 Score = 30.8 bits (70), Expect = 3.0
 Identities = 26/112 (23%), Positives = 53/112 (47%), Gaps = 7/112 (6%)

Query: 49  IKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTE-LREFYQTQLNNNVKEKLKEFQTQ 107
               L+ FY++++N   K  +EIQ  L +S   K + L++  +       KEK +     
Sbjct: 43  AAKPLKNFYKSRINKISKRLEEIQEKLKESKEKKEDALKKLEE------AKEKAELIVET 96

Query: 108 LDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQ 159
                  L ++ +++ K  +EN  K  +++ E E ++ E+ +VE    +L +
Sbjct: 97  AKKEAYILTQKIEKQTKDDIENLIKSFEELMEFEVRKMEREVVEEVLNELFE 148


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score = 31.6 bits (72), Expect = 3.0
 Identities = 16/88 (18%), Positives = 26/88 (29%)

Query: 416 ISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK 475
              +K   + +  +     S  L    ++T    E     ++     T    R   SHSK
Sbjct: 449 ELSKKKTKIHKLNKLRHDLSSLLSSIPEETSDRVESEKASKLRSSASTKLNLRSSRSHSK 508

Query: 476 RSKNADSLVRLQKAQDEIQTLLAKSASI 503
              + +      K     Q  LA S   
Sbjct: 509 FRDHLNGSNSSTKELSLNQVDLAGSERK 536



 Score = 30.5 bits (69), Expect = 7.0
 Identities = 26/195 (13%), Positives = 71/195 (36%), Gaps = 12/195 (6%)

Query: 76  AKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLK 135
           +++ SIK +++    +  +  ++E   +        E+ + RE  Q  ++ +      ++
Sbjct: 331 SRAKSIKNKIQVNSSSDSSREIEEIKFDLSEDRSEIEILVFREQSQLSQSSLSGIFAYMQ 390

Query: 136 QISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRD 195
            + +     K +         L   S   G  +     K +   Y++  + L+ E+ +  
Sbjct: 391 SLKKETETLKSRI-------DLIMKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLL 443

Query: 196 ----IKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVTHLHTSKESLATR 251
                +L +    + + +K R  ++  + S+ +    +      S +    ++K +L   
Sbjct: 444 LLREEELSKKKTKIHKLNKLRHDLSSLLSSIPEETSDRVESEKASKLRSSASTKLNLR-S 502

Query: 252 QPSEQSLIDSTTDTN 266
             S     D    +N
Sbjct: 503 SRSHSKFRDHLNGSN 517


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 31.5 bits (72), Expect = 3.0
 Identities = 22/127 (17%), Positives = 46/127 (36%), Gaps = 17/127 (13%)

Query: 108 LDSAELTLHREFQQKEKTIVENAAK--------------QLKQISENEFKQKEKTIVENA 153
            DS  LT     Q ++K ++    K              ++K     +  +     ++  
Sbjct: 410 FDSNTLTWESNAQGEDKELLRKGFKIIKEIVDEVFGKGAKIKIQKALKSAENPLQSLKEF 469

Query: 154 AKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRE 213
               ++      +T  ML+ +AK +  E +  +LK     ++  +   +  L EE+ K  
Sbjct: 470 KPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQLKEATELQEFMINNED--LIEEA-KEL 526

Query: 214 LMAKKVE 220
              K +E
Sbjct: 527 FGIKSIE 533


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 31.7 bits (72), Expect = 3.1
 Identities = 49/216 (22%), Positives = 96/216 (44%), Gaps = 43/216 (19%)

Query: 418 KEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHSK-- 475
            E+I  L++ L+   +R+ +LE +L+   Q   E+ ++      V    + L+   SK  
Sbjct: 480 NERITELQQLLEDANRRNNELETQLRLANQRILELQQQ------VEDLQKALQEQGSKSE 533

Query: 476 -----RSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQL 530
                +SK  + L +L +A +E+Q         K E  E  +   + N+  K+ E +  L
Sbjct: 534 DSSLLKSKLEEHLEQLHEANEELQK--------KREQIEELEPDQDQNLSRKIAELEAAL 585

Query: 531 DSAELTLHREFQQKEKTIVENAAKQLKQISEK--PGATKLMLDFKAKCEVYEAENKKLKA 588
              +  + R  +++ K  VE A + +K +  K  P + ++ L               LK 
Sbjct: 586 QKKDEDM-RAMEERYKKYVEKAREVIKTLDPKQNPASPEIQL---------------LKK 629

Query: 589 ELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQS 624
           +LT+RD ++      L+ E +K + M ++ E ++ S
Sbjct: 630 QLTERDKRI----RHLESEYEKAKPMREQEEKLIVS 661


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 31.4 bits (71), Expect = 3.3
 Identities = 15/76 (19%), Positives = 33/76 (43%)

Query: 143 KQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETN 202
           K+ +  +V    K L+++ E     +   + K K E  + E ++L  EL + + + +E  
Sbjct: 110 KELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQ 169

Query: 203 HLLKEESKKRELMAKK 218
             LK    +   + + 
Sbjct: 170 ERLKRLEVENSRLEEM 185



 Score = 30.3 bits (68), Expect = 7.4
 Identities = 14/73 (19%), Positives = 31/73 (42%)

Query: 545 EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLL 604
           +  +V    K L+++ E     +   + K K E  + E ++L  EL + + + +E    L
Sbjct: 113 KNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERL 172

Query: 605 KEESKKRELMAKK 617
           K    +   + + 
Sbjct: 173 KRLEVENSRLEEM 185


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 30.6 bits (70), Expect = 3.6
 Identities = 25/117 (21%), Positives = 50/117 (42%), Gaps = 13/117 (11%)

Query: 478 KNADSLVRLQ-KAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELT 536
           +N + L RL+ K  + +  L      +K E     + +L   +K++LKE + +L      
Sbjct: 61  RNKE-LKRLKLKYTNTVHIL----THLK-EKLHMLEEELER-LKKELKEKEERLAKLRKE 113

Query: 537 LHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDF---KAKCEVYEAENKKLKAEL 590
           L++   + E+  +    K+L+Q         L+ D+     K E      +KL+ + 
Sbjct: 114 LYKV--KLERDKLRKQNKKLRQQGGLLHVPALLYDYDKTVEKVEEKRKSVEKLRRKH 168


>gnl|CDD|181337 PRK08266, PRK08266, hypothetical protein; Provisional.
          Length = 542

 Score = 31.5 bits (72), Expect = 3.6
 Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 1118 KKLSAAKKPLIVVGADISD-GAAVLALVQQLAAKVT 1152
              ++AAK P+I VG   +  G  +  L + L A V 
Sbjct: 200  ALIAAAKNPMIFVGGGAAGAGEEIRELAEMLQAPVV 235


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 29.9 bits (68), Expect = 3.8
 Identities = 16/68 (23%), Positives = 35/68 (51%), Gaps = 4/68 (5%)

Query: 95  NNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENA---AKQLKQISENEFKQKEKTIVE 151
              +E+LK+    L  AE     + + +   I+ NA   A++LK+    E ++  + ++E
Sbjct: 40  KEAEERLKQAAALLAEAE-QQLAQARAEASEIINNAKKEAQKLKEEILAEAQKDAERLLE 98

Query: 152 NAAKQLKQ 159
           +A  +++Q
Sbjct: 99  SARAEIEQ 106


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 31.2 bits (72), Expect = 3.9
 Identities = 13/72 (18%), Positives = 35/72 (48%), Gaps = 12/72 (16%)

Query: 466 LERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKE 525
           LE L++  +  SK    + + ++  ++ + L+A+   +K E++          ++ +L E
Sbjct: 44  LEELQAERNALSK---EIGQAKRKGEDAEALIAEVKELKEEIKA---------LEAELDE 91

Query: 526 FQTQLDSAELTL 537
            + +L+   L +
Sbjct: 92  LEAELEELLLRI 103


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 31.3 bits (71), Expect = 4.0
 Identities = 17/92 (18%), Positives = 37/92 (40%), Gaps = 6/92 (6%)

Query: 2   KAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQL 61
           K++ + + LLA+      E R   Q      +     + ++  +  +I+ E  E Y+ + 
Sbjct: 327 KSKQKFEPLLAQYGESFQEWRSKLQAL---AKVESRSLLSVEERLPTIEKEDLELYEQRQ 383

Query: 62  NNNQKAQDEIQTLLAKSASIKTELREFYQTQL 93
              Q    E + L  +   ++ +  E YQ + 
Sbjct: 384 ---QLWFFENRKLWQRKKKLREQADEEYQQRH 412



 Score = 30.9 bits (70), Expect = 5.3
 Identities = 20/103 (19%), Positives = 42/103 (40%), Gaps = 16/103 (15%)

Query: 29  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREF 88
           L   +K++ + + LLA+      E R   Q      +     + ++  +  +I+ E  E 
Sbjct: 322 LEEVEKSKQKFEPLLAQYGESFQEWRSKLQAL---AKVESRSLLSVEERLPTIEKEDLEL 378

Query: 89  YQTQL------NNNVKEKLKEFQTQLDSAELTLHREFQQKEKT 125
           Y+ +       N  + ++ K+ + Q D        E+QQ+  T
Sbjct: 379 YEQRQQLWFFENRKLWQRKKKLREQAD-------EEYQQRHAT 414


>gnl|CDD|239166 cd02765, MopB_4, The MopB_4 CD includes a group of related
           uncharacterized bacterial and archaeal
           molybdopterin-binding oxidoreductase-like domains with a
           putative N-terminal iron-sulfur [4Fe-4S] cluster binding
           site and molybdopterin cofactor binding site. These
           members belong to the molybdopterin_binding (MopB)
           superfamily of proteins.
          Length = 567

 Score = 31.3 bits (71), Expect = 4.4
 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 3/34 (8%)

Query: 958 RLLTPFVR---NCDGQLVATEWEDALIAVAQKLQ 988
           RL  P  R     +G+     W++AL  +A KL 
Sbjct: 55  RLKYPMKRVGERGEGKFERITWDEALDTIADKLT 88


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 31.1 bits (70), Expect = 4.7
 Identities = 38/201 (18%), Positives = 71/201 (35%), Gaps = 41/201 (20%)

Query: 51  TELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDS 110
           + L E Y+ +L+ N + +  ++ L       + E  E          +E+ K  + +LD+
Sbjct: 96  STLAENYERELDRNLELEVRLKALEELEKKAENEAAE---------AEEEAKLLKDKLDA 146

Query: 111 AELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLM 170
             L L  E + + K   E+ ++    +SE       +   +NA  +LK +          
Sbjct: 147 ESLKLQNEKEDQLKEAKESISRIKNDLSE------MQCRAQNADTELKLL---------- 190

Query: 171 LDFKAKCEVYEAENKKLKAELT-QRDIKLKE-TNHLLKEESKKRELMAKKVESVMQSQLQ 228
                         +    EL  Q +   KE      K +S   E  +    SV    L+
Sbjct: 191 --------------ESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLE 236

Query: 229 QAIHMITSDVTHLHTSKESLA 249
           + +     D   + + KE L 
Sbjct: 237 EELKRYEQDAEVVKSMKEQLL 257


>gnl|CDD|218956 pfam06248, Zw10, Centromere/kinetochore Zw10.  Zw10 and rough deal
           proteins are both required for correct metaphase
           check-pointing during mitosis. These proteins bind to
           the centromere/kinetochore.
          Length = 593

 Score = 31.2 bits (71), Expect = 4.7
 Identities = 72/377 (19%), Positives = 139/377 (36%), Gaps = 73/377 (19%)

Query: 39  IQTLLAKSASIKTELREF----YQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLN 94
           I  LL+K    K E+R+     Y   L N   A+D    LL + +S+  ++ +  +++++
Sbjct: 16  ISKLLSKVEEYKEEVRDMINKKYSDFLPNMSSAED----LLEQVSSLSEDISDLLKSEVS 71

Query: 95  NNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAA 154
           N + + L +        EL   +  Q +EK +     ++L +I E   ++  + +     
Sbjct: 72  NEILQDLNDAVE-----ELAELKR-QLEEKILGLKILQKLLKIHE-ALEETNEALSGKDY 124

Query: 155 KQLKQISEKPGATKLMLDFKAKCEVYEAEN--KKLKAELT-QRDI--------------- 196
            +   + EK    K +LD   K    E     K L+ E T Q                  
Sbjct: 125 LKAADLLEK---AKSLLD-GLKSRKGEELEIYKSLRLEYTVQSQNLLYHLGEEWDRLVQW 180

Query: 197 KLKETNHLL-KEESKKRELMAKKVESVMQ-SQLQQAIHMITSDVTHLHTSKESL------ 248
           KL  +  L   + +KK  L   K ES  + S + QA+ ++      L    + L      
Sbjct: 181 KLPSSKELGSAQNTKKTTLHLSKDESQGELSSVLQALAVLGELEYKLKKFCQFLLKNVFE 240

Query: 249 -ATRQPSEQSLIDSTTDTNCPVTQQYVPSSNQQYAPSSNQQYS--------IPTSNIN-- 297
               +P     ++   D    +T  Y   + +   P   + +S        +    ++  
Sbjct: 241 PLIVKPVLVVCVEDQKDGTVQITLSYESKTTKLLRPDPKKVFSKIRLVLKTLSKMLLSVS 300

Query: 298 -QQYSMFSADSTHM----------NIVERDIPNSADDMRKYIEM--QYSMFSADSTHMNI 344
               S+F      +          + + + IP ++ ++++Y E+      F A    MN 
Sbjct: 301 VDDCSLFEILGDLIWERLSELLIKDCLVKSIPETSSELQEYSEVIEDTEEFEAALKEMNF 360

Query: 345 VERDIPNSADDMRKYIE 361
           ++ D    A D+ KY  
Sbjct: 361 LDGD----ATDLLKYAR 373


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 31.0 bits (71), Expect = 4.9
 Identities = 31/148 (20%), Positives = 71/148 (47%), Gaps = 10/148 (6%)

Query: 419 EKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRK-EVSIVTVTLFLERLRSSHSKRS 477
           E++ +    L+ +  R  ++EE L   +       RK  V+I  +  +L++++   ++  
Sbjct: 283 EELRAYLDELEFDPNRLEEVEERLFALKSLA----RKYGVTIEDLLEYLDKIKEELAQLD 338

Query: 478 KNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTL 537
            + +SL  L+K   +++  L ++A   + +R+    +L   V  +LK     ++ A  T+
Sbjct: 339 NSEESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEVTAELKAL--AMEKARFTV 396

Query: 538 HREFQQKEKTIVENAAKQLK-QISEKPG 564
             E +  E++   + A +++  IS  PG
Sbjct: 397 --ELKPLEESPTADGADKVEFLISTNPG 422


>gnl|CDD|153322 cd07638, BAR_ACAP2, The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP
           with Coiled-coil, ANK repeat and PH domain containing
           protein 2.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. ACAP2 (ArfGAP with
           Coiled-coil, ANK repeat and PH domain containing protein
           2), also called centaurin beta-2, is an Arf6-specific
           GTPase activating protein (GAP) which mediates Arf6
           signaling. Arf6 is involved in the regulation of
           endocytosis, phagocytosis, cell adhesion and migration.
           ACAP2 contains an N-terminal BAR domain, followed by a
           Pleckstrin homology (PH) domain, an Arf GAP domain, and
           C-terminal ankyrin (ANK) repeats. BAR domains form
           dimers that bind to membranes, induce membrane bending
           and curvature, and may also be involved in
           protein-protein interactions.
          Length = 200

 Score = 30.4 bits (68), Expect = 5.0
 Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 4/105 (3%)

Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLK 524
           F+  +R      SK+A     L K  D +Q ++     +  + +   + QL   VKE L+
Sbjct: 42  FMNGIRDLAQYSSKDAVIETSLTKFSDTLQEMINYHTILFDQAQRSIKAQLQTFVKEDLR 101

Query: 525 EFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKLM 569
           +F+     A+    +  ++KE  +V+NA  Q  +  E   AT ++
Sbjct: 102 KFK----DAKKQFDKVSEEKENALVKNAQVQRNKQHEVEEATNIL 142


>gnl|CDD|132072 TIGR03027, pepcterm_export, putative polysaccharide export protein,
           PEP-CTERM sytem-associated.  This protein family belongs
           to the larger set of polysaccharide biosynthesis/export
           proteins described by pfam02563. Members of this family
           are variable in either containing of lacking a
           78-residue insert, but appear to fall within a single
           clade, nevertheless, where the regions in which the gene
           is found encode components of the PEP-CTERM/EpsH
           proposed exosortase protein sorting system [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 165

 Score = 30.0 bits (68), Expect = 5.0
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 7/54 (12%)

Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLER 468
           KI+   +  L  + +T TQ +R +EE+L        + VR  V  V VT F+  
Sbjct: 30  KITTPLVGDLVASGKTPTQLARDIEEKLA-------KYVRNPVVTVIVTGFIGP 76


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 31.1 bits (70), Expect = 5.0
 Identities = 56/216 (25%), Positives = 94/216 (43%), Gaps = 14/216 (6%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQ 60
           ++A+ E++ L       K    E         +  Q EI  ++A  AS+  EL E     
Sbjct: 212 KEAEKELERLKQDLDPEKD--LEKLAEASAELESLQKEIS-IMASVASVLKELEEAKA-- 266

Query: 61  LNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
             N +KA +E ++L     S+K EL E  + +    ++EK  E +    S E  L+R   
Sbjct: 267 --NLEKAAEEEKSLRNLVESLKQELEE--EKKELEELREKEGEAEEAASSLEAELNRTKS 322

Query: 121 QKEKTIVENAA---KQLKQIS-ENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAK 176
           +KE    E       +L+Q S E E  +KE        ++LK+ +E+  A     + + +
Sbjct: 323 EKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLE 382

Query: 177 CEVYEAENKKLKAELTQRDIK-LKETNHLLKEESKK 211
             + EAE  K    L   +IK L+E+    K +S +
Sbjct: 383 AALKEAEAAKAAEALALAEIKALQESEESAKADSPR 418


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 31.0 bits (70), Expect = 5.1
 Identities = 29/116 (25%), Positives = 60/116 (51%), Gaps = 18/116 (15%)

Query: 121 QKEKTIVENAAKQ---LKQISE--NEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKA 175
           ++E  + E+A+K+   L++I E   E KQ     + N   + +++S       L  + K 
Sbjct: 17  REESLLQESASKEAYYLQRILELQAELKQLRAE-LSNVQAENERLS------SLSQELKE 69

Query: 176 KCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRE---LMAKKVESVMQSQLQ 228
           + E+ E +  +L+ E+  ++ K +E   LL++ S+  E    + K+V  + QSQ++
Sbjct: 70  ENEMLELQRGRLRDEI--KEYKFREA-RLLQDYSELEEENISLQKQVSVLRQSQVE 122


>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
          Length = 435

 Score = 30.9 bits (71), Expect = 5.2
 Identities = 31/179 (17%), Positives = 68/179 (37%), Gaps = 35/179 (19%)

Query: 75  LAKSASIK---TELREFYQTQLNNNVKEKLK-EFQTQL-----DSAELTL-----HREFQ 120
            AK         EL+   +  L    KE  + + +  +     ++AE+ L      +E  
Sbjct: 252 FAKKLGEFETLEELKADIRKNLEREKKEAQRAKVKEAVLDALVENAEIDLPEALVEQEID 311

Query: 121 QKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVY 180
           +    ++  A +Q     E +F +   T  E   ++ ++ +E+     L+LD     E+ 
Sbjct: 312 R----LLRQALQQGLD-LEGQFLEDTGTTEEEPREEFREQAERRVKLGLLLD-----EIA 361

Query: 181 EAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQAIHMITSDVT 239
           +AE  ++  E         E    ++E + +     + +E     +  Q +  + +DV 
Sbjct: 362 KAEEIEVSDE---------EVKAEIEEMASQYGQPPEVIE--FYLKNPQLLAALRADVL 409


>gnl|CDD|217899 pfam04108, APG17, Autophagy protein Apg17.  Apg17 is required for
           activating Apg1 protein kinases.
          Length = 408

 Score = 30.8 bits (70), Expect = 5.3
 Identities = 27/145 (18%), Positives = 61/145 (42%), Gaps = 13/145 (8%)

Query: 16  SIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLL 75
            +  EL        NN ++ +  + + ++K   + +  +E     L   +K ++ +   L
Sbjct: 251 DVVKELHTVIDEIENNEKRVKKFLSSHMSKIEELHSATKE----LLEELEKYKERLPRYL 306

Query: 76  AKSASIKTELREFYQT--QLNNNVKEKLKEFQTQL----DSAELTLHRE--FQQKEKTIV 127
           A  A I+    +F +   Q    + E L EF            L + R    ++K + I+
Sbjct: 307 AIFADIRALWEDFKEPIQQYIQELSE-LCEFYDNFLNSYKGLLLEVERRRAVRKKMEKIL 365

Query: 128 ENAAKQLKQISENEFKQKEKTIVEN 152
           +   +QL ++ E + K++++ + E 
Sbjct: 366 KFCNEQLDKLQEEDQKRRQQFLAEY 390


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.5 bits (69), Expect = 5.6
 Identities = 37/142 (26%), Positives = 50/142 (35%), Gaps = 12/142 (8%)

Query: 77  KSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQ 136
           KSA    E R+  + Q    +++K    Q +L   +L   R   Q++K   E AAKQ   
Sbjct: 72  KSAKRAEEQRKKKEQQQAEELQQKQAAEQERL--KQLEKERLAAQEQKKQAEEAAKQA-- 127

Query: 137 ISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDI 196
                   K+K   E AAK       K  A        AK     AE KK       +  
Sbjct: 128 ------ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAK--KAAAEAKKKAEAEAAKKA 179

Query: 197 KLKETNHLLKEESKKRELMAKK 218
             +       E + K    AKK
Sbjct: 180 AAEAKKKAEAEAAAKAAAEAKK 201


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 30.4 bits (69), Expect = 5.9
 Identities = 24/109 (22%), Positives = 47/109 (43%), Gaps = 10/109 (9%)

Query: 415 KISKEK-IESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSH 473
            +  E+ + +L   +Q   +R   LE+EL +     EE+ + E  I  +   LERL  + 
Sbjct: 84  AVKDERELRALNIEIQIAKERINSLEDELAELM---EEIEKLEKEIEDLKERLERLEKNL 140

Query: 474 SKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELRE----FYQTQLNNN 518
           ++     +    + + ++E Q L +K   +K +L       Y+    N 
Sbjct: 141 AEAEARLEE--EVAEIREEGQELSSKREELKEKLDPELLSEYERIRKNK 187



 Score = 30.0 bits (68), Expect = 6.9
 Identities = 38/192 (19%), Positives = 84/192 (43%), Gaps = 20/192 (10%)

Query: 30  NNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFY 89
           NNN K+   IQ L  +   ++  ++E  +  L   +KA+ E++ L     +++ EL +  
Sbjct: 3   NNNLKSLLAIQKLDLEKDRLEPRIKE-IRKAL---KKAKAELEALNKALEALEIELED-L 57

Query: 90  QTQLNNNVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
           + Q++    E ++E + ++  AE  L     ++E      A     QI++      E  +
Sbjct: 58  ENQVSQLESE-IQEIRERIKRAEEKLSAVKDEREL----RALNIEIQIAKERINSLEDEL 112

Query: 150 VENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEES 209
            E   ++++++ ++          +   E  E   K L     + + ++ E     +E S
Sbjct: 113 AE-LMEEIEKLEKE---------IEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELS 162

Query: 210 KKRELMAKKVES 221
            KRE + +K++ 
Sbjct: 163 SKREELKEKLDP 174


>gnl|CDD|237025 PRK11929, PRK11929, putative bifunctional
            UDP-N-acetylmuramoylalanyl-D-glutamate--2,
            6-diaminopimelate
            ligase/UDP-N-acetylmuramoyl-tripeptide:D-alanyl-D-alanine
            ligase; Provisional.
          Length = 958

 Score = 30.8 bits (70), Expect = 6.1
 Identities = 14/41 (34%), Positives = 18/41 (43%)

Query: 1199 LGADEGSISRDDVGKDCFIIYQGHHGDHGASIADAILPGAA 1239
              AD    SR+    D F+  +G   D  A I  A+  GAA
Sbjct: 28   ATADLRLDSREVQPGDLFVACRGAASDGRAFIDQALARGAA 68


>gnl|CDD|233326 TIGR01239, galT_2, galactose-1-phosphate uridylyltransferase,
           family 2.  This enzyme is involved in glucose and
           galactose interconversion. This model describes one of
           two extremely distantly related branches of the model
           pfam01087 from PFAM [Energy metabolism, Sugars].
          Length = 489

 Score = 30.6 bits (69), Expect = 6.2
 Identities = 15/78 (19%), Positives = 28/78 (35%), Gaps = 8/78 (10%)

Query: 85  LREFYQTQLNNNVKEKLKEFQTQLDSAEL--------TLHREFQQKEKTIVENAAKQLKQ 136
           L +  Q    N   E     +  L++  +         ++R+F +   +  E A K   Q
Sbjct: 54  LDQLLQWAAENGRMEADTTQRDILEAKLMDLITPRPSEVNRKFWETYASYPEQATKYFYQ 113

Query: 137 ISENEFKQKEKTIVENAA 154
            S+  +  K   I +N  
Sbjct: 114 YSKRVYYVKTDAIAKNIH 131


>gnl|CDD|238569 cd01149, HutB, Hemin binding protein HutB.  These proteins have been
            shown to function as initial receptors in ABC transport
            of hemin and hemoproteins in many eubacterial species.
            They belong to the TroA superfamily of periplasmic metal
            binding proteins that share a distinct fold and ligand
            binding mechanism. A typical TroA protein is comprised of
            two globular subdomains connected by a single helix and
            can bind the metal ion in the cleft between these
            domains.
          Length = 235

 Score = 30.3 bits (69), Expect = 6.5
 Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 16/91 (17%)

Query: 1118 KKLSAAKKPLIVVGADISDGAAVLALVQQLAAKVTCESDVPCDWKVLNILQKAASQVAAL 1177
             +L AA  P++ V +       +L  ++Q+A             + L + +K  +    +
Sbjct: 75   DQLRAAGVPVVTVPST-PTLDGLLTKIRQVA-------------QALGVPEKGEALAQEV 120

Query: 1178 DIGYK--PGTSAIREKPPKVLFLLGADEGSI 1206
                     T A  +KPP+VLFLL    G+ 
Sbjct: 121  RQRLAALRKTVAAHKKPPRVLFLLSHGGGAA 151


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 30.7 bits (69), Expect = 6.8
 Identities = 31/140 (22%), Positives = 57/140 (40%), Gaps = 10/140 (7%)

Query: 78  SASIKTELREFYQTQL-NNNVKEKL---KEFQTQLDSAELTLHREFQQKEKTIVENAAKQ 133
           S  I+   ++F   QL N+ VK  L   K+    L+  E +      +K +  ++N    
Sbjct: 83  SLKIEVYNKQFKHEQLWNSQVKGILTLGKKTNDDLEKIE-SKKESINKKNEKKIKNE-AS 140

Query: 134 LKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLML---DFKAKCEVYEAENKKLKAE 190
           L+  ++ E K+++  I +      K+  E+            FK K    E EN K+ A 
Sbjct: 141 LQVKTQREEKEEKDFIADCWRNLYKKNEERFKEYLENFKRKKFKRKILK-EFENAKINAS 199

Query: 191 LTQRDIKLKETNHLLKEESK 210
                 K+KE   ++   ++
Sbjct: 200 EIVGLEKVKEKIEIVFRHNQ 219


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.1 bits (68), Expect = 6.8
 Identities = 40/238 (16%), Positives = 92/238 (38%), Gaps = 29/238 (12%)

Query: 1   QKAQDEIQTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQ-- 58
           Q    ++  L  +  +I+ E+    ++  N  ++ Q +I  L  +    K E+++  +  
Sbjct: 34  QNQDSKLSELQKEKKNIQNEI----ESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEI 89

Query: 59  TQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD------SAE 112
            +L  N   + E+    A++  +      +    LN+   +   +  +++        A+
Sbjct: 90  AELKENIVERQELLKKRARAMQVNGTATSYIDVILNS---KSFSDLISRVTAISVIVDAD 146

Query: 113 LTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAKQLKQISEKPGATKLMLD 172
             +  + ++ +K++     ++ +   E++ +       E   +     S+K     L+  
Sbjct: 147 KKILEQQKEDKKSL-----EEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAA 201

Query: 173 FKAKCEVYEAENKKLKAELTQRDIKLKETNHLLKEESKKRELMAKKVESVMQSQLQQA 230
             AK    EA     KA L ++    K        E+ K+E  A K  +  Q+ LQ A
Sbjct: 202 LAAK----EASALGEKAALEEQ----KALAEAAAAEAAKQE-AAAKAAAQEQAALQAA 250


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.2 bits (68), Expect = 7.0
 Identities = 29/155 (18%), Positives = 56/155 (36%), Gaps = 9/155 (5%)

Query: 394 VTSDIFNEPALYTKTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVV 453
             +++F+   L ++T  ++   + +E  + L   L +E    R  E+E    +Q   ++ 
Sbjct: 4   AMTNLFD--ILSSQT--NIDHPLCEECADILLERLDSEL---RDAEKERDTYKQYLSKLE 56

Query: 454 RKEVSIVTVTLFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQT 513
            + V I         L     +  +  D L  L+K  D++   L +    K +L      
Sbjct: 57  SQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116

Query: 514 QLNNNVKEKLK--EFQTQLDSAELTLHREFQQKEK 546
            L           + +  L S EL       Q +K
Sbjct: 117 YLREYNLFDRNNLQLEDNLQSLELQYEYSLNQLDK 151


>gnl|CDD|236042 PRK07525, PRK07525, sulfoacetaldehyde acetyltransferase; Validated.
          Length = 588

 Score = 30.7 bits (70), Expect = 7.1
 Identities = 25/90 (27%), Positives = 38/90 (42%), Gaps = 25/90 (27%)

Query: 1072 IRKGYLTNELDVAYIGPKVDLRYDY-----EHLGESADLIKQLASGSHPFSKKLSAAKKP 1126
            I + Y    +DV    P   +R +      + L E+A+L              LS AK P
Sbjct: 161  IPRDYFYGVIDVEIPQP---VRLERGAGGEQSLAEAAEL--------------LSEAKFP 203

Query: 1127 LIVVGADISDGAAV---LALVQQLAAKVTC 1153
            +I+ GA +    A+    AL ++L A V C
Sbjct: 204  VILSGAGVVLSDAIEECKALAERLDAPVAC 233


>gnl|CDD|162434 TIGR01580, narG, respiratory nitrate reductase, alpha subunit.  The
            Nitrate reductase enzyme complex allows bacteria to use
            nitrate as an electron acceptor during anaerobic growth.
            The enzyme complex consists of a tetramer that has an
            alpha, beta and 2 gamma subunits. The alpha and beta
            subunits have catalytic activity and the gamma subunits
            attach the enzyme to the membrane and is a b-type
            cytochrome that receives electrons from the quinone pool
            and transfers them to the beta subunit. This model is
            specific for the alpha subunit for nitrate reductase I
            (narG) and nitrate reductase II (narZ) for gram positive
            and gram negative bacteria.A few thermophiles and archaea
            also match the model The seed members used to make the
            model include Nitrate reductases from Pseudomonas
            fluorescens (GP:11344601), E.coli and B.subtilis. All
            seed members are experimentally characterized. Some
            unpublished nitrate reductases, that are shorter
            sequences, and probably fragments fall in between the
            noise and trusted cutoffs. Pfam models pfam00384
            (Molybdopterin oxidoreductase) and pfam01568(Molydopterin
            dinucleotide binding domain) will also match the nitrate
            reductase, alpha subunit [Energy metabolism, Anaerobic].
          Length = 1235

 Score = 30.7 bits (69), Expect = 7.1
 Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 1231 ADAILPGAAYTEKQSTYVNTEGRAQQTLT-AVTPPGLAREDWKIIRALSEGVS 1282
            +D +LP A + EK     +        L+ A+ P   ++ DW+I +A+++  S
Sbjct: 782  SDIVLPTATWYEKDDMNTSDMHPFIHPLSAAIDPAWESKSDWEIYKAIAKAFS 834


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 30.4 bits (69), Expect = 7.7
 Identities = 40/191 (20%), Positives = 76/191 (39%), Gaps = 32/191 (16%)

Query: 40  QTLLAKSASIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKE 99
             L A+   ++ +L +  + QL    +A+ E++  L+K+ +   + R  ++++     +E
Sbjct: 260 ANLQAQLRQLEHDL-DSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEE 318

Query: 100 ----------KLKEFQTQLDSA-------ELTLHREFQQKEKTIVENAAKQLKQISENEF 142
                     K+ E +   ++A       E T  R   + E   +E         SE E 
Sbjct: 319 LEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAA-ASELEK 377

Query: 143 KQK--EKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRDIKLKE 200
           KQK  +K + E   K  +  +E   A +       +      E  +LK EL     +LK+
Sbjct: 378 KQKNFDKILAEWKRKVDELQAELDTAQR-------EARNLSTELFRLKNELE----ELKD 426

Query: 201 TNHLLKEESKK 211
               L+ E+K 
Sbjct: 427 QVEALRRENKN 437


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 29.2 bits (66), Expect = 8.0
 Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 4/96 (4%)

Query: 54  REFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAEL 113
            EF ++     +K + EI+ L      +KT+L E  + +    ++ K ++ + +L + E 
Sbjct: 49  LEFRESLEETLRKLEAEIERLQNTIERLKTQL-EDLERE-LALLQAKERQLEKKLKTLEQ 106

Query: 114 TLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTI 149
            L  E ++ ++   +N  +Q K    +E K++++ I
Sbjct: 107 KLKNEKEEVQRL--KNIIQQRKTQYNHELKKRDREI 140


>gnl|CDD|190336 pfam02520, DUF148, Domain of unknown function DUF148.  This domain
           has no known function nor do any of the proteins that
           possess it. In one member of this family the aligned
           region is repeated twice.
          Length = 114

 Score = 28.7 bits (65), Expect = 8.3
 Identities = 20/104 (19%), Positives = 42/104 (40%), Gaps = 20/104 (19%)

Query: 65  QKAQDEIQTLLAKS----ASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAELTLHREFQ 120
           ++A+ E   +L       A I  +L  + +      V ++ +EF+  + +    L  E +
Sbjct: 1   EEARQEFFAILKNETLTIAEIDAQLEAWAEK---YGVADQYEEFKKNVKA----LKAEAE 53

Query: 121 QKEKTIVEN---AAKQLKQISENEFKQKEKTIVE--NAAKQLKQ 159
           +    ++ N      +L  I +N    K+ T  +   A   LK+
Sbjct: 54  KNVTAVISNLSSVDAKLSAILDN----KDLTRRQKKEAIDALKK 93


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 29.1 bits (66), Expect = 8.4
 Identities = 22/121 (18%), Positives = 49/121 (40%), Gaps = 19/121 (15%)

Query: 39  IQTLLAKSA---SIKTELREFYQTQLNNNQKAQDEIQTLLAKSASIKTELREFYQTQLNN 95
           +Q +L++S    + + +L + ++      QK + E+Q    K            Q Q   
Sbjct: 23  VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQK-----------LQKQAAT 71

Query: 96  NVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAAKQLKQISENEFKQKEKTIVENAAK 155
             +E  K  Q +L   +    +E QQK++   +   ++ +++       K    ++  AK
Sbjct: 72  LSEEARKAKQQELQQKQ----QELQQKQQAAQQELQQKQQEL-LQPIYDKIDKAIKEVAK 126

Query: 156 Q 156
           +
Sbjct: 127 E 127


>gnl|CDD|232855 TIGR00173, menD,
            2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-
            carboxylic-acid synthase.  MenD was thought until
            recently to act as SHCHC synthase, but has recently been
            shown to act instead as SEPHCHC synthase. Conversion of
            SEPHCHC into SHCHC and pyruvate may occur spontaneously
            but is catalyzed efficiently, at least in some organisms,
            by MenH (see TIGR03695). 2-oxoglutarate
            decarboxylase/SHCHC synthase (menD) is a thiamine
            pyrophosphate enzyme involved in menaquinone biosynthesis
            [Biosynthesis of cofactors, prosthetic groups, and
            carriers, Menaquinone and ubiquinone].
          Length = 430

 Score = 30.3 bits (69), Expect = 8.6
 Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 2/44 (4%)

Query: 1113 SHPFSKKLSAAKKPLIVVG--ADISDGAAVLALVQQLAAKVTCE 1154
                  +L  AK+ LI+ G  A   D  A+ AL + L   +  +
Sbjct: 200  LDDLWDRLRQAKRGLIIAGPLAGAEDAEALAALAEALGWPLLAD 243


>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
           Firmicutes type.  AddAB, also called RexAB, substitutes
           for RecBCD in several bacterial lineages. These DNA
           recombination proteins act before synapse and are
           particularly important for DNA repair of double-stranded
           breaks by homologous recombination. The term AddAB is
           used broadly, with AddA homologous between the
           Firmicutes (as modeled here) and the
           alphaproteobacteria, while the partner AddB proteins
           show no strong homology across the two groups of species
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 1230

 Score = 30.4 bits (69), Expect = 8.6
 Identities = 29/164 (17%), Positives = 60/164 (36%), Gaps = 23/164 (14%)

Query: 412 VTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRS 471
             L+  +EK++        E   + +LE   Q   Q  +E++++ ++        + + +
Sbjct: 217 NELEGLQEKLQRALELFMAEDGLAPRLEN-FQLDLQNIDELIQESLAQADWNELRKAVAA 275

Query: 472 SHSKRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLD 531
              K  K A          +E                     +L    K++L++ +T   
Sbjct: 276 FKFKNLKAAKGDEEDADLLEEAD-------------------KLREEAKKQLEKLKTDYF 316

Query: 532 SAELTLHREFQQKEKTIVENAAKQLKQISEKPGATKL---MLDF 572
           +     H    Q+ K +V+   + +K   E+ GA K    +LDF
Sbjct: 317 TRSEEDHLRIMQEMKPVVKTLVQLVKDFIERFGAEKREKNILDF 360


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 30.0 bits (67), Expect = 8.9
 Identities = 27/207 (13%), Positives = 76/207 (36%), Gaps = 29/207 (14%)

Query: 415 KISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVVRKEVSIVTVTLFLERLRSSHS 474
           +  ++++ +  +NL    Q   +L ++ QD +                     RL++   
Sbjct: 119 EAVRQELAAARQNLAKAQQELARLTKQAQDLQT--------------------RLKTLAE 158

Query: 475 KRSKNADSLVRLQKAQDEIQTLLAKSASIKTELREFYQTQLNNNVKEKLKEFQTQLDSAE 534
           +R +       LQ +Q ++Q   A +  +K+++ +         ++++ +   T+ ++A+
Sbjct: 159 QRRQLEAQAQSLQASQKQLQ---ASATQLKSQVLDLKLRS--AQIEQEAQNLATRANAAQ 213

Query: 535 LTLHREFQQKEKTIVENAAKQLKQISEKPGATKLMLDFKAKCEVYEAENKKLKAELTQRD 594
                E  ++     + A   ++Q   +   ++      A+ E      ++L+   T + 
Sbjct: 214 -ARTEELARRAAAAQQTAQA-IQQRDAQ--ISQKAQQIAARAEQIRERERQLQRLETAQA 269

Query: 595 IKLKETNHLLKEESKKRELMAKKVESV 621
              +E   L         L  +   + 
Sbjct: 270 RLEQEVAQLEAYYQAYVRLRQQAAATQ 296


>gnl|CDD|220772 pfam10475, DUF2450, Protein of unknown function N-terminal domain
           (DUF2450).  This protein is found in eukaryotes but its
           function is not known. The C-terminal part of some
           members is DUF2451.
          Length = 291

 Score = 30.0 bits (68), Expect = 9.4
 Identities = 28/151 (18%), Positives = 63/151 (41%), Gaps = 22/151 (14%)

Query: 431 ETQRSRKLEEELQDTRQTCEEVVRKEVSIVT--VTLFLERLRSSHSKRS------KNADS 482
             QR R++E++L++    C+   R+ +      +T     L + + KR           +
Sbjct: 65  GMQRIREIEKDLKEANVICKNG-RRHLDSSKKNLTRTALGLLAKYKKRQLLIDLLPTLTT 123

Query: 483 LVRLQKAQDEIQTLLAKS------------ASIKTELREFYQTQ-LNNNVKEKLKEFQTQ 529
           +  ++  +  ++ LL +               I     ++   Q L++ ++E L   +T+
Sbjct: 124 IKSIRDTEVRLRELLEEGNYPGAIQLLLECQQIAEGYSQYNCVQELSSKLQETLLLTETK 183

Query: 530 LDSAELTLHREFQQKEKTIVENAAKQLKQIS 560
           LDS  L +  +F  ++   V+ A K L ++ 
Sbjct: 184 LDSVLLEVCVDFDSEKYAKVQEAYKLLGKVL 214


>gnl|CDD|115367 pfam06705, SF-assemblin, SF-assemblin/beta giardin.  This family
           consists of several eukaryotic SF-assemblin and related
           beta giardin proteins. During mitosis the
           SF-assemblin-based cytoskeleton is reorganised; it
           divides in prophase and is reduced to two dot-like
           structures at each spindle pole in metaphase. During
           anaphase, the two dots present at each pole are
           connected again. In telophase there is an asymmetrical
           outgrowth of new fibres. It has been suggested that
           SF-assemblin is involved in re-establishing the
           microtubular root system characteristic of interphase
           cells after mitosis.
          Length = 247

 Score = 29.8 bits (67), Expect = 9.6
 Identities = 28/150 (18%), Positives = 63/150 (42%), Gaps = 14/150 (9%)

Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEVV--RKEVSIVTVTL 464
           +   +   ++  E I  LE++L+ E +R  + ++++Q+  +   + +  R    +  +  
Sbjct: 29  RIAEATRFQMITEAIARLEKSLEAEVKRRAESDKQIQEHFEGEIKNLQERIAEQVADMQA 88

Query: 465 FLERLRSSHSKRSKNADSLVRLQKAQ--DEIQTL---LAKS-----ASIKTE--LREFYQ 512
             +    S S    +  + +R ++ Q   +I+ L   L K      A+I  E   R+  Q
Sbjct: 89  AFKSAVDSLSDTLNDLHTNIRNEREQRRSDIEHLATSLVKKVNECVAAIDEERISRKQEQ 148

Query: 513 TQLNNNVKEKLKEFQTQLDSAELTLHREFQ 542
           +Q+   V E       ++D+ + T   E  
Sbjct: 149 SQILKKVAEDFARISARIDTEKNTREAEVS 178


>gnl|CDD|153338 cd07654, F-BAR_FCHSD, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of FCH and double SH3
           domains proteins (FCHSD).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. This subfamily is composed of FCH
           and double SH3 domain (FCHSD) proteins, so named as they
           contain an N-terminal F-BAR domain and two SH3 domains
           at the C-terminus. Vertebrates harbor two subfamily
           members, FCHSD1 and FCHSD2, which have been
           characterized only in silico. Their biological function
           is still unknown. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 264

 Score = 29.9 bits (67), Expect = 9.8
 Identities = 36/165 (21%), Positives = 65/165 (39%), Gaps = 20/165 (12%)

Query: 407 KTVRSVTLKISKEKIESLERNLQTETQRSRKLEEELQDTRQTCEEV---VRKEVSIVTVT 463
           KT RS   +  K+  E L+R  Q E Q++ +   + + T    E+V    R++ + V   
Sbjct: 102 KTGRSAKEQQLKKCTEQLQR-AQAEVQQTVRELSKSRKTYFEREQVAHLAREKAADVQAR 160

Query: 464 LFLERLRSSHSKRSKNADSLVRLQKAQDEIQTLLAKS--------ASIKTELREFYQTQL 515
                L    S+ S    S V+L   + E  +    +        A+       +YQT L
Sbjct: 161 EARSDLSIFQSRTSLQKAS-VKLSARKAECSSKATAARNDYLLNLAATNAHQDRYYQTDL 219

Query: 516 NN-------NVKEKLKEFQTQLDSAELTLHREFQQKEKTIVENAA 553
                     + + LK+F   L   EL   +  Q+  + ++E ++
Sbjct: 220 PAIIKALDGELYDHLKDFLISLSHTELETAQVIQETFQRLLETSS 264


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.128    0.353 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 64,492,197
Number of extensions: 6338665
Number of successful extensions: 7404
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6892
Number of HSP's successfully gapped: 686
Length of query: 1334
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1225
Effective length of database: 6,103,016
Effective search space: 7476194600
Effective search space used: 7476194600
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (28.9 bits)