RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17711
         (335 letters)



>gnl|CDD|215484 PLN02894, PLN02894, hydrolase, alpha/beta fold family protein.
          Length = 402

 Score =  131 bits (331), Expect = 4e-35
 Identities = 80/257 (31%), Positives = 118/257 (45%), Gaps = 38/257 (14%)

Query: 2   GRSSRPRFS-TDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           G SSRP F+    E  E   + S EEWRK   L   ILLGHSFGGY+A  YA+++P+ V+
Sbjct: 143 GGSSRPDFTCKSTEETEAWFIDSFEEWRKAKNLSNFILLGHSFGGYVAAKYALKHPEHVQ 202

Query: 61  HLILADPWGFPQKSIDPQ------KASKIPLWARMIGNLY--KNFNPLWPVRFVGPLGQW 112
           HLIL  P GF  +S D        +A+    W   + N     NF P   +R +GP G  
Sbjct: 203 HLILVGPAGFSSESDDKSEWLTKFRAT----WKGAVLNHLWESNFTPQKIIRGLGPWG-- 256

Query: 113 VVEKMRPDLPKKFT-----------PVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGL 161
                 P+L +++T            + +E+S  +T+Y++       SGE     +    
Sbjct: 257 ------PNLVRRYTTARFGAHSTGDILSEEESKLLTDYVYHTLAAKASGELCLKYIFSFG 310

Query: 162 GYAKRPMLHRVDQLAAHVPVTVIYGSRSWVDNSSGDKIKEARSQ-NSFVQVKSVTGAGHH 220
            +A++P+L    +    VP T IYG   W++        EAR +     ++  V   GH 
Sbjct: 311 AFARKPLLESASEWK--VPTTFIYGRHDWMNYEGA---VEARKRMKVPCEIIRVPQGGHF 365

Query: 221 VYADRADVFNKMVNDTC 237
           V+ D    F+  V   C
Sbjct: 366 VFLDNPSGFHSAVLYAC 382


>gnl|CDD|201306 pfam00561, Abhydrolase_1, alpha/beta hydrolase fold.  This
           catalytic domain is found in a very wide range of
           enzymes.
          Length = 226

 Score = 98.0 bits (244), Expect = 4e-24
 Identities = 56/241 (23%), Positives = 87/241 (36%), Gaps = 31/241 (12%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           FGRSS P+     +     L   +E     L L ++ L+GHS GG +A AYA +YPDRVK
Sbjct: 11  FGRSSPPKD--FADYRFDDLAEDLEALLDALGLDKVNLVGHSMGGLIALAYAAKYPDRVK 68

Query: 61  HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
            L+L                   PL  R  GNL       +  R         VE +   
Sbjct: 69  ALVLVGTV--------HPAGLSSPLTPR--GNLLGLLLDNFFNRLYDS-----VEALLGR 113

Query: 121 LPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEG-----LGYAKRPMLHRVDQL 175
             K+F  + +   S   +     ++       A   L        L + +   L  +D  
Sbjct: 114 AIKQFQALGRPFVSDFLKQFELSSLIRFGETLALDGLLGYALGYDLVWDRSAALKDID-- 171

Query: 176 AAHVPVTVIYGSRSW-VDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVN 234
              VP  +I+G     V   + +K+         V +     AGH    ++ D   +++ 
Sbjct: 172 ---VPTLIIWGDDDPLVPPDASEKLAALFPNAQLVVID---DAGHLAQLEKPDEVAELIL 225

Query: 235 D 235
            
Sbjct: 226 K 226


>gnl|CDD|223669 COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta
           hydrolase superfamily) [General function prediction
           only].
          Length = 282

 Score = 83.5 bits (205), Expect = 2e-18
 Identities = 46/236 (19%), Positives = 80/236 (33%), Gaps = 20/236 (8%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
            GRS    +S             +      L L++++L+GHS GG +A A A+++PDRV+
Sbjct: 61  HGRSDPAGYSLS------AYADDLAALLDALGLEKVVLVGHSMGGAVALALALRHPDRVR 114

Query: 61  HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
            L+L  P   P       +          + +L    +       +  LG      +   
Sbjct: 115 GLVLIGPAPPPGLLEAALRQPAGAAPLAALADLLLGLDAAAFAALLAALG------LLAA 168

Query: 121 LPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRVDQLAAHVP 180
           L       L E   A             +       L   L    R  L R+      VP
Sbjct: 169 LAAAARAGLAEALRAPLLGAAAAAFARAARADLAAALLALLDRDLRAALARIT-----VP 223

Query: 181 VTVIYGSRS-WVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVND 235
             +I+G     V      ++  A   ++  ++  + GAGH  + +  + F   +  
Sbjct: 224 TLIIHGEDDPVVPAELARRLAAALPNDA--RLVVIPGAGHFPHLEAPEAFAAALLA 277


>gnl|CDD|225176 COG2267, PldB, Lysophospholipase [Lipid metabolism].
          Length = 298

 Score = 65.5 bits (160), Expect = 4e-12
 Identities = 51/249 (20%), Positives = 83/249 (33%), Gaps = 46/249 (18%)

Query: 1   FGRSSR-PRFSTDPETVERQLVTSIEEWRKELKLQ----EMILLGHSFGGYLAFAYAIQY 55
            GRS R  R   D        V  ++ + + +        + LLGHS GG +A  Y  +Y
Sbjct: 72  HGRSPRGQRGHVD---SFADYVDDLDAFVETIAEPDPGLPVFLLGHSMGGLIALLYLARY 128

Query: 56  PDRVKHLILADPWGFPQKSIDPQKASKI--PLWARMIGNLYKNFNPLWPVRFVGPL-GQW 112
           P R+  L+L+ P      +I     +++   L  R+   L  + N L  V          
Sbjct: 129 PPRIDGLVLSSPALGLGGAILRLILARLALKLLGRIRPKLPVDSNLLEGVLTDDLSRDPA 188

Query: 113 VVEKMRPDLPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRV 172
            V     D                        V      +            + P L   
Sbjct: 189 EVAAYEAD----------------PLIGVGGPVSRWVDLALLAG--------RVPALR-- 222

Query: 173 DQLAAHVPVTVIYGSR-SWVDNSSG--DKIKEARSQNSFVQVKSVTGAGHHVYAD----R 225
           D  A  +PV ++ G     VDN  G     + A S +   ++K + GA H +  +    R
Sbjct: 223 DAPAIALPVLLLQGGDDRVVDNVEGLARFFERAGSPD--KELKVIPGAYHELLNEPDRAR 280

Query: 226 ADVFNKMVN 234
            +V   ++ 
Sbjct: 281 EEVLKDILA 289


>gnl|CDD|221720 pfam12697, Abhydrolase_6, Alpha/beta hydrolase family.  This family
           contains alpha/beta hydrolase enzymes of diverse
           specificity.
          Length = 187

 Score = 58.6 bits (142), Expect = 3e-10
 Identities = 36/182 (19%), Positives = 60/182 (32%), Gaps = 33/182 (18%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
            G S  P     P ++E      +      L L  ++L+GHS GG +A A A + P+RV 
Sbjct: 35  HGDSDGPP--RTPYSLE-DDAADLAALLDALGLGPVVLVGHSLGGAVALAAAARRPERVA 91

Query: 61  HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
            L+L  P                     ++         L            +   +R  
Sbjct: 92  GLVLISPPL--------------RDLEELLAADAAALLALLR-------AALLDADLREA 130

Query: 121 LPKKFTPVL----KEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRVDQLA 176
           L +   PVL    ++D     E   +  +      +    L  G G+   P L   +++A
Sbjct: 131 LARLTVPVLVIHGEDDPLVPPEAARR--LAEALPGAELVVL-PGAGHL--PHLEHPEEVA 185

Query: 177 AH 178
             
Sbjct: 186 EA 187



 Score = 33.9 bits (78), Expect = 0.064
 Identities = 13/57 (22%), Positives = 24/57 (42%), Gaps = 4/57 (7%)

Query: 176 AAHVPVTVIYGSRS-WVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNK 231
              VPV VI+G     V   +  ++ EA       ++  + GAGH  + +  +   +
Sbjct: 133 RLTVPVLVIHGEDDPLVPPEAARRLAEALPG---AELVVLPGAGHLPHLEHPEEVAE 186


>gnl|CDD|188121 TIGR01250, pro_imino_pep_2, proline-specific peptidase, Bacillus
           coagulans-type subfamily.  This model describes a
           subfamily of the alpha/beta fold family of hydrolases.
           Characterized members include prolinases (Pro-Xaa
           dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC
           3.4.11.5), and a leucyl aminopeptidase.
          Length = 289

 Score = 58.2 bits (141), Expect = 1e-09
 Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 16/100 (16%)

Query: 2   GRSSRPRFSTDPE--TVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
           G S +P  S D E  T++   V  +EE R++L L +  LLGHS+GG LA  YA++Y   +
Sbjct: 65  GYSDQPDDS-DEELWTIDY-FVDELEEVREKLGLDKFYLLGHSWGGMLAQEYALKYGQHL 122

Query: 60  KHLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNP 99
           K LI++                  P + + +  L K   P
Sbjct: 123 KGLIISSM------------LDSAPEYVKELNRLRKELPP 150


>gnl|CDD|234315 TIGR03695, menH_SHCHC,
           2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate
           synthase.  This protein catalyzes the formation of
           SHCHC, or (1 R,6
           R)-2-succinyl-6-hydroxy-2,
           4-cyclohexadiene-1-carboxylate, by elmination of
           pyruvate from
           2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-
           carboxylate (SEPHCHC). Note that SHCHC synthase activity
           previously was attributed to MenD, which in fact is
           SEPHCHC synthase [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Menaquinone and ubiquinone].
          Length = 252

 Score = 46.1 bits (110), Expect = 8e-06
 Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 2   GRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKH 61
           G S  P    +    E      +     +L ++   L+G+S GG +A  YA+QYP+RV+ 
Sbjct: 40  GSSQSPS-DIERYDFEEIAQLLLATLLDQLGIEPFFLVGYSMGGRIALYYALQYPERVQG 98

Query: 62  LIL 64
           LIL
Sbjct: 99  LIL 101


>gnl|CDD|130316 TIGR01249, pro_imino_pep_1, proline iminopeptidase, Neisseria-type
           subfamily.  This model represents one of two related
           families of proline iminopeptidase in the alpha/beta
           fold hydrolase family. The fine specificities of the
           various members, including both the range of short
           peptides from which proline can be removed and whether
           other amino acids such as alanine can be also removed,
           may vary among members.
          Length = 306

 Score = 45.2 bits (107), Expect = 2e-05
 Identities = 18/45 (40%), Positives = 29/45 (64%)

Query: 20  LVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLIL 64
           LV  IE+ R++L ++  ++ G S+G  LA AYA  +P+ V  L+L
Sbjct: 81  LVADIEKLREKLGIKNWLVFGGSWGSTLALAYAQTHPEVVTGLVL 125


>gnl|CDD|214007 cd12808, Esterase_713_like-1, Uncharacterized enzymes similar to
           novel bacterial esterase that cleaves esters on
           halogenated cyclic compounds.  This family contains
           uncharacterized proteins similar to a novel bacterial
           esterase (Alcaligenes esterase 713) with the alpha/beta
           hydrolase fold but does not contain the GXSXXG
           pentapeptide around the active site serine residue as
           commonly seen in other enzymes of this class. Esterase
           713 shows negligible sequence homology to other esterase
           and lipase enzymes. It is active as a dimer and cleaves
           esters on halogenated cyclic compounds though its
           natural substrate is unknown.
          Length = 309

 Score = 44.2 bits (105), Expect = 5e-05
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQ 72
           I++ HS GG  AF  A   PD V+ ++  +P G P 
Sbjct: 191 IVVAHSQGGGFAFEAARARPDLVRAVVALEPSGAPD 226


>gnl|CDD|221718 pfam12695, Abhydrolase_5, Alpha/beta hydrolase family.  This family
           contains a diverse range of alpha/beta hydrolase
           enzymes.
          Length = 145

 Score = 40.1 bits (94), Expect = 3e-04
 Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 4/75 (5%)

Query: 13  PETVERQLVTSIEEWRKELK--LQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGF 70
           P           E    +     + ++L+GHS GG +A   A + P     ++LA     
Sbjct: 35  PGHGASLGAPDAEAVLADAPLDPERIVLVGHSLGGGVALLLAARDPRVKAAVVLAAGDPP 94

Query: 71  PQKSIDPQKASKIPL 85
              ++D      +P+
Sbjct: 95  D--ALDDLAKLTVPV 107



 Score = 29.3 bits (66), Expect = 1.5
 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 4/62 (6%)

Query: 160 GLGYAKRPMLHRVDQLAA-HVPVTVIYGSR-SWVDNSSGDKIKEARSQNSFVQVKSVTGA 217
            +  A       +D LA   VPV +I+G+R   V     + +  A    + + V  + GA
Sbjct: 85  AVVLAAGDPPDALDDLAKLTVPVLIIHGTRDGVVPPEEAEALAAALPGPAELVV--IEGA 142

Query: 218 GH 219
           GH
Sbjct: 143 GH 144


>gnl|CDD|215315 PLN02578, PLN02578, hydrolase.
          Length = 354

 Score = 41.8 bits (98), Expect = 3e-04
 Identities = 52/231 (22%), Positives = 95/231 (41%), Gaps = 31/231 (13%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           FG S +     D      Q    + ++ KE+  +  +L+G+S GG+ A + A+ YP+ V 
Sbjct: 123 FGWSDKALIEYDAMVWRDQ----VADFVKEVVKEPAVLVGNSLGGFTALSTAVGYPELVA 178

Query: 61  HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPL--WPVRFVGPLGQWVVEKMR 118
            + L +  G         K   I +   ++        PL  W  R V     W  ++  
Sbjct: 179 GVALLNSAGQFGSESRE-KEEAIVVEETVLTRFV--VKPLKEWFQRVVLGFLFWQAKQ-- 233

Query: 119 PDLPKKFTPVLKE---DSSAITEYIFQCNVQAPS-----GESAFHTLTEGLGYAKRPMLH 170
              P +   VLK    D S + +Y+ + ++  P+     GE  +  ++  L    R  L 
Sbjct: 234 ---PSRIESVLKSVYKDKSNVDDYLVE-SITEPAADPNAGEVYYRLMSRFLFNQSRYTLD 289

Query: 171 RV-DQLAAHVPVTVIYG-SRSWVDNSSGDKIKEARSQNSFVQVKSVTGAGH 219
            +  +L+   P+ +++G    WV  +  +KIK      + V ++    AGH
Sbjct: 290 SLLSKLSC--PLLLLWGDLDPWVGPAKAEKIKAFYPDTTLVNLQ----AGH 334


>gnl|CDD|132386 TIGR03343, biphenyl_bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate
           hydrolase.  Members of this family are
           2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or
           HOPD hydrolase, the BphD protein of biphenyl
           degradation. BphD acts on the product of ring
           meta-cleavage by BphC. Many species carrying bphC and
           bphD are capable of degrading polychlorinated biphenyls
           as well as biphenyl itself.
          Length = 282

 Score = 40.3 bits (94), Expect = 7e-04
 Identities = 20/59 (33%), Positives = 31/59 (52%)

Query: 31  LKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQKASKIPLWARM 89
           L +++  L+G+S GG  A  +A++YPDR+  LIL  P G       P     I L  ++
Sbjct: 98  LDIEKAHLVGNSMGGATALNFALEYPDRIGKLILMGPGGLGPSLFAPMPMEGIKLLFKL 156


>gnl|CDD|215567 PLN03087, PLN03087, BODYGUARD 1 domain containing hydrolase;
           Provisional.
          Length = 481

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKE-LKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
           FGRS +P    D     R+ +  IE    E  K++   ++ HS G  LA A A+++P  V
Sbjct: 243 FGRSPKP---ADSLYTLREHLEMIERSVLERYKVKSFHIVAHSLGCILALALAVKHPGAV 299

Query: 60  KHLILADPWGFP 71
           K L L  P  +P
Sbjct: 300 KSLTLLAPPYYP 311


>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
           dihydrolipoyllysine-residue acetyltransferase;
           Provisional.
          Length = 371

 Score = 39.2 bits (92), Expect = 0.002
 Identities = 17/52 (32%), Positives = 26/52 (50%)

Query: 19  QLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGF 70
           +L  ++  +   L ++   L+GHS GG +A   A + P RV  L L  P G 
Sbjct: 182 ELAAAVLAFLDALGIERAHLVGHSMGGAVALRLAARAPQRVASLTLIAPAGL 233



 Score = 29.5 bits (67), Expect = 2.5
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 12/67 (17%)

Query: 173 DQLAAH-VPVTVIYGSRSWVDNSSGDKIKEAR-SQN--SFVQVKSVTGAGHHVYADRADV 228
           D+LA+  +PV VI+G +        D+I  A  +Q     V V  + GAGH    + A  
Sbjct: 308 DRLASLAIPVLVIWGEQ--------DRIIPAAHAQGLPDGVAVHVLPGAGHMPQMEAAAD 359

Query: 229 FNKMVND 235
            N+++ +
Sbjct: 360 VNRLLAE 366


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 36.6 bits (85), Expect = 0.003
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
           A           EE +EEEE ++EEE++ EE+ 
Sbjct: 64  AAAAAAAAAAAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 11/42 (26%), Positives = 19/42 (45%)

Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           +E +       V             +EEEE ++EEE+++EE 
Sbjct: 51  EEAIKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEES 92



 Score = 35.4 bits (82), Expect = 0.007
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           A            E +EEEE ++EEE+++ E
Sbjct: 63  AAAAAAAAAAAAAEEEEEEEEEEEEEEEESE 93



 Score = 32.3 bits (74), Expect = 0.072
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
           A          EEE +EEEE ++EEE+ +EE   
Sbjct: 65  AAAAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 29.6 bits (67), Expect = 0.70
 Identities = 9/44 (20%), Positives = 21/44 (47%)

Query: 245 DIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
           + +   A  P              EE ++EEE+++EE+ + +++
Sbjct: 52  EAIKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 29.2 bits (66), Expect = 0.87
 Identities = 11/52 (21%), Positives = 23/52 (44%)

Query: 239 LSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           L    ++    KA   P              E ++EEE+++EE+ +  +++A
Sbjct: 45  LEGVNIEEAIKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEESEEEA 96


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 36.5 bits (85), Expect = 0.003
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
                     E K+EEE ++EE+++ EE+ 
Sbjct: 68  AAAAAAAAAAEEKKEEEEEEEEKEESEEEA 97



 Score = 34.5 bits (80), Expect = 0.015
 Identities = 8/32 (25%), Positives = 16/32 (50%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           A              +++EE ++EEEK++ E+
Sbjct: 64  AAAAAAAAAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 33.8 bits (78), Expect = 0.030
 Identities = 8/32 (25%), Positives = 16/32 (50%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           A              +E++E ++EEE+K+E +
Sbjct: 63  AAAAAAAAAAAAAAAEEKKEEEEEEEEKEESE 94



 Score = 33.4 bits (77), Expect = 0.041
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
           A           EE KEEEE ++E+E+ +EE  
Sbjct: 66  AAAAAAAAAAAAEEKKEEEEEEEEKEESEEEAA 98



 Score = 33.0 bits (76), Expect = 0.047
 Identities = 8/32 (25%), Positives = 14/32 (43%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
                           EE++ ++EEE++KEE 
Sbjct: 62  VAAAAAAAAAAAAAAAEEKKEEEEEEEEKEES 93



 Score = 31.5 bits (72), Expect = 0.14
 Identities = 8/42 (19%), Positives = 19/42 (45%)

Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           +E +       V               EE++++EEE++++E+
Sbjct: 51  EEAIKKAAAAPVAAAAAAAAAAAAAAAEEKKEEEEEEEEKEE 92



 Score = 29.2 bits (66), Expect = 1.1
 Identities = 8/46 (17%), Positives = 20/46 (43%)

Query: 245 DIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           + +   A  P                 +K+EE+++EE+ +  +++A
Sbjct: 52  EAIKKAAAAPVAAAAAAAAAAAAAAAEEKKEEEEEEEEKEESEEEA 97


>gnl|CDD|179554 PRK03204, PRK03204, haloalkane dehalogenase; Provisional.
          Length = 286

 Score = 38.3 bits (89), Expect = 0.004
 Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 3/83 (3%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           FG S RP  S     ++ +    I E+   L L   + +G  +GG ++ A A++  DRV+
Sbjct: 71  FGLSERP--SGFGYQID-EHARVIGEFVDHLGLDRYLSMGQDWGGPISMAVAVERADRVR 127

Query: 61  HLILADPWGFPQKSIDPQKASKI 83
            ++L + W +P  ++  +  S++
Sbjct: 128 GVVLGNTWFWPADTLAMKAFSRV 150


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 37.2 bits (87), Expect = 0.004
 Identities = 27/123 (21%), Positives = 43/123 (34%), Gaps = 14/123 (11%)

Query: 201 EARSQNSFVQVKSVTGAGHHVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQE 260
           EA  Q +F Q KS+      + AD  ++              +L     K  +  KE   
Sbjct: 36  EASDQWAFYQAKSIKQHLAELAADLLELEAAAPRA-------ELQ---AKIARYKKEKAR 85

Query: 261 PEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFLVLFG 320
              E KE E + KE E + +         A +V  + + I + S    +  LT    L+ 
Sbjct: 86  YRSEAKELEAKAKEAEAESDHALHQHHRFALAVALLQIAIALAS----IAALTKKKWLWY 141

Query: 321 LGY 323
              
Sbjct: 142 ASL 144


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 37.5 bits (88), Expect = 0.006
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
                + Q    E +EEEE ++EEE+  EE+ 
Sbjct: 290 EVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEA 321



 Score = 32.5 bits (75), Expect = 0.25
 Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 230 NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEE 276
              + D   L +E  ++++ +A     E +E EEE +EEEE   EEE
Sbjct: 275 AAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEE-PSEEE 320



 Score = 32.5 bits (75), Expect = 0.26
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           +K     Q      +EEEE ++EEE+++  +
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318


>gnl|CDD|216224 pfam00975, Thioesterase, Thioesterase domain.  Peptide synthetases
           are involved in the non-ribosomal synthesis of peptide
           antibiotics. Next to the operons encoding these enzymes,
           in almost all cases, are genes that encode proteins that
           have similarity to the type II fatty acid thioesterases
           of vertebrates. There are also modules within the
           peptide synthetases that also share this similarity.
           With respect to antibiotic production, thioesterases are
           required for the addition of the last amino acid to the
           peptide antibiotic, thereby forming a cyclic antibiotic.
           Thioesterases (non-integrated) have molecular masses of
           25-29 kDa.
          Length = 224

 Score = 36.2 bits (84), Expect = 0.012
 Identities = 30/124 (24%), Positives = 44/124 (35%), Gaps = 27/124 (21%)

Query: 38  LLGHSFGGYLAFAYAIQYPDR---VKHLILADPW----GFPQKSIDPQKASKIPLWARMI 90
           L GHS GG LAF  A +   R      LIL+D +    G  + S        +    R+ 
Sbjct: 69  LFGHSMGGLLAFEVARRLERRGEEPAGLILSDAYAPHFGRREASHLLGDDELLAELRRIG 128

Query: 91  GNLYKNFNPLWPVRFVGPLGQWVVEKMRPDLPKKFTPVLKEDSSAITEYIFQCNVQAPSG 150
           G                   + ++E    +L     P L+ D  A+  Y     + APS 
Sbjct: 129 GTP-----------------EALLED--EELLSLLLPALRADYRALENYPIH-PIAAPSD 168

Query: 151 ESAF 154
            + F
Sbjct: 169 ATLF 172


>gnl|CDD|182637 PRK10673, PRK10673, acyl-CoA esterase; Provisional.
          Length = 255

 Score = 36.6 bits (85), Expect = 0.012
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 30  ELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILAD 66
            L++++   +GHS GG    A     PDR+  L+  D
Sbjct: 77  ALQIEKATFIGHSMGGKAVMALTALAPDRIDKLVAID 113


>gnl|CDD|224932 COG2021, MET2, Homoserine acetyltransferase [Amino acid transport
           and metabolism].
          Length = 368

 Score = 36.5 bits (85), Expect = 0.015
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDRVKHLI 63
            ++G S GG  A  +AI+YPDRV+  I
Sbjct: 150 AVVGGSMGGMQALEWAIRYPDRVRRAI 176


>gnl|CDD|235135 PRK03592, PRK03592, haloalkane dehalogenase; Provisional.
          Length = 295

 Score = 36.1 bits (84), Expect = 0.020
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 24  IEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           ++ W   L L +++L+GH +G  L F +A ++PDRV+
Sbjct: 83  LDAWFDALGLDDVVLVGHDWGSALGFDWAARHPDRVR 119


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 33.9 bits (78), Expect = 0.021
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           A        E   E  E EE +KEEE ++E D
Sbjct: 69  AAAAAAAGAEAAAEADEAEEEEKEEEAEEESD 100



 Score = 30.4 bits (69), Expect = 0.38
 Identities = 8/32 (25%), Positives = 13/32 (40%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           A            E   E +  +EEEK++E +
Sbjct: 65  AAAGAAAAAAAGAEAAAEADEAEEEEKEEEAE 96



 Score = 29.3 bits (66), Expect = 0.90
 Identities = 8/34 (23%), Positives = 16/34 (47%)

Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
            A     E     +E +EEE+ ++ EE+  ++  
Sbjct: 71  AAAAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 16/82 (19%)

Query: 211 VKSVT-GAGHHVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPP---------KEPQE 260
           +KSV   AG  V   RA            L    +D V   A + P              
Sbjct: 22  LKSVLEAAGVEVEEARAKALVA------ALEGVDIDEVIKNAAEAPAAAAAAGAAAAAAA 75

Query: 261 PEEEVKEEEERKKEEEKKKEED 282
             E   E +E ++EE++++ E+
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEE 97


>gnl|CDD|233393 TIGR01392, homoserO_Ac_trn, homoserine O-acetyltransferase.  This
           family describes homoserine-O-acetyltransferase, an
           enzyme of methionine biosynthesis. This model has been
           rebuilt to identify sequences more broadly, including a
           number of sequences suggested to be homoserine
           O-acetyltransferase based on proximity to other Met
           biosynthesis genes [Amino acid biosynthesis, Aspartate
           family].
          Length = 351

 Score = 35.7 bits (83), Expect = 0.023
 Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 11/61 (18%)

Query: 7   PRFST-DPETVERQLVTS--IEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLI 63
           P  +  D    ++ L+    IE+           ++G S GG  A  +AI YP+RV+ ++
Sbjct: 105 PLITIRDDVKAQKLLLDHLGIEQIA--------AVVGGSMGGMQALEWAIDYPERVRAIV 156

Query: 64  L 64
           +
Sbjct: 157 V 157


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 34.0 bits (78), Expect = 0.024
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEED 282
                   E+  ++EEE+KKEEEK++EE+
Sbjct: 68  AAAAAAAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 34.0 bits (78), Expect = 0.026
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           +K  +    A          EE+ +E+EE KK+EE+K+EE+
Sbjct: 55  KKAAVAAAAAAPAAAAAAAAEEKAEEKEEEKKKEEEKEEEE 95



 Score = 33.2 bits (76), Expect = 0.037
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
                   E + E KEEE++K+EE++++EE+ 
Sbjct: 67  AAAAAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 31.3 bits (71), Expect = 0.21
 Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 239 LSDEKLDIVTTKAVKPPKE--PQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           L +  +D    KA        P        EE+  +KEEEKKKEE+ + ++++A
Sbjct: 45  LEEVNIDEAIKKAAVAAAAAAPAAAAAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 29.4 bits (66), Expect = 0.88
 Identities = 10/33 (30%), Positives = 21/33 (63%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
           A       +E  EE +EE+++++E+E+++EE  
Sbjct: 67  AAAAAAAAEEKAEEKEEEKKKEEEKEEEEEEAL 99


>gnl|CDD|214006 cd12807, Esterase_713, Novel bacterial esterase 713 that cleaves
           esters on halogenated cyclic compounds.  This family
           contains proteins similar to a novel bacterial esterase
           (esterase 713) with the alpha/beta hydrolase fold that
           cleaves esters on halogenated cyclic compounds. This
           Alcaligenes esterase, however, does not contain the
           GXSXXG pentapeptide around the active site serine
           residue as seen in other esterase families. This enzyme
           is active as a dimer though its natural substrate is
           unknown. It has two distinct disulfide bridges; one
           formed between adjacent cysteines appears to facilitate
           the correct formation of the oxyanion cleft in the
           catalytic site. Esterase 713 also resembles human
           pancreatic lipase in its location of the acidic residue
           of the catalytic triad. It is possibly exported from the
           cytosol to the periplasmic space. A large majority of
           sequences in this family have yet to be characterized.
          Length = 315

 Score = 35.4 bits (82), Expect = 0.031
 Identities = 23/77 (29%), Positives = 32/77 (41%), Gaps = 5/77 (6%)

Query: 32  KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQKA--SKIPLWARM 89
           KL   +LLGHS  G      A+  P  VK ++  +P   P      Q    +KIP+   +
Sbjct: 187 KLGGAVLLGHSQSGPFPLEAALLRPAGVKGIVSVEPGCCP-APTADQIKTLAKIPILV-V 244

Query: 90  IG-NLYKNFNPLWPVRF 105
            G +L     P W  R 
Sbjct: 245 FGDHLDAASGPTWQQRL 261


>gnl|CDD|216102 pfam00756, Esterase, Putative esterase.  This family contains
           Esterase D. However it is not clear if all members of
           the family have the same function. This family is
           related to the pfam00135 family.
          Length = 245

 Score = 34.4 bits (79), Expect = 0.055
 Identities = 12/43 (27%), Positives = 15/43 (34%), Gaps = 8/43 (18%)

Query: 38  LLGHSFGGYLAFAYAIQYPDRVKHLIL--------ADPWGFPQ 72
           L G S GG  A   A++YPD    +             WG   
Sbjct: 113 LAGQSMGGLGALYLALKYPDLFGSVSSFSPILNPSNSMWGPED 155


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 34.8 bits (80), Expect = 0.064
 Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 1/50 (2%)

Query: 241 DEKLDIVTTKAVKPPKEP-QEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           ++KL   T K  +   E     E E K +EE+ K+EE +K++  Q  +DK
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424



 Score = 29.8 bits (67), Expect = 2.0
 Identities = 14/61 (22%), Positives = 30/61 (49%)

Query: 231 KMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           K  N       +KL  V  +     +   + +EE  ++EE +K+++++ +ED + +Q   
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431

Query: 291 R 291
           R
Sbjct: 432 R 432



 Score = 29.8 bits (67), Expect = 2.4
 Identities = 21/101 (20%), Positives = 49/101 (48%), Gaps = 9/101 (8%)

Query: 190 WVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVNDTCTLSDEKLDIVTT 249
           W+ +  G+  KEA +    +    +T     + A    ++ ++ ++T  LS +K   +  
Sbjct: 321 WIYSGRGE-FKEAINIARNLDDNDLT-----LLA-LTKLYEEVKSNT-DLSGDKRQELLK 372

Query: 250 KAVKPPKEPQEPEEEVKEE-EERKKEEEKKKEEDGQHQQDK 289
           +  K  ++  +   EVK+E +  ++ E K KEE  + ++++
Sbjct: 373 EYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 34.5 bits (80), Expect = 0.073
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
            K  +  +E +E +EE KEEEE + EEEK++EE+ + +Q
Sbjct: 438 KKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQ 476



 Score = 32.6 bits (75), Expect = 0.28
 Identities = 15/45 (33%), Positives = 27/45 (60%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQ 286
           +K      K  +  +E ++ EEE +EEEE  +EE++++EE  + Q
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQ 476



 Score = 32.6 bits (75), Expect = 0.32
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
             KA    K+ +E EEE +++EE K+EEE++ EE+
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 31.0 bits (71), Expect = 1.0
 Identities = 18/54 (33%), Positives = 32/54 (59%)

Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
           +EK +          KE +E EE+ K+EEE+++EEE+ +EE  + ++ K +  T
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478



 Score = 29.1 bits (66), Expect = 3.6
 Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 2/56 (3%)

Query: 239 LSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
              +K      K V+  K  ++ EEE KE++++    +KK+EE+ + ++ K     
Sbjct: 403 TGSKKATKKIKKIVE--KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 29.1 bits (66), Expect = 3.7
 Identities = 13/45 (28%), Positives = 24/45 (53%)

Query: 239 LSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
           L++E+++ +T       K  +  E+  K+ EE KKE++KK     
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGK 438


>gnl|CDD|234678 PRK00175, metX, homoserine O-acetyltransferase; Provisional.
          Length = 379

 Score = 34.4 bits (80), Expect = 0.076
 Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 19/72 (26%)

Query: 10  STDPETVE----RQLVTSIEEWRKELKLQEMIL-----------LGHSFGGYLAFAYAIQ 54
           S +P+T +       V +I +W   ++ Q  +L           +G S GG  A  +AI 
Sbjct: 111 SINPDTGKPYGSDFPVITIRDW---VRAQARLLDALGITRLAAVVGGSMGGMQALEWAID 167

Query: 55  YPDRVKH-LILA 65
           YPDRV+  L++A
Sbjct: 168 YPDRVRSALVIA 179


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 33.5 bits (77), Expect = 0.079
 Identities = 16/41 (39%), Positives = 25/41 (60%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           +KL+       K  +E +E  E+ KEEEERK+ EE+ ++E 
Sbjct: 32  KKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQ 72



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
           K  ++ E E KEEEE ++E EKKKEE+ + ++++  
Sbjct: 33  KLEEKREGERKEEEELEEEREKKKEEEERKEREEQA 68



 Score = 28.9 bits (65), Expect = 2.9
 Identities = 12/36 (33%), Positives = 24/36 (66%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            E +   E  +EEE  ++ E+KK+EE+ + ++++AR
Sbjct: 34  LEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69



 Score = 27.7 bits (62), Expect = 6.3
 Identities = 12/36 (33%), Positives = 26/36 (72%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
           +E +E E E +++ E K+E E+K+EE+ + +++K +
Sbjct: 21  REAEEEEREERKKLEEKREGERKEEEELEEEREKKK 56


>gnl|CDD|225856 COG3319, COG3319, Thioesterase domains of type I polyketide
           synthases or non-ribosomal peptide synthetases
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 257

 Score = 34.0 bits (78), Expect = 0.080
 Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDR---VKHLILADPW 68
           +LLG S GG +AF  A Q   +   V  L L D  
Sbjct: 68  VLLGWSLGGAVAFEVAAQLEAQGEEVAFLGLLDAV 102


>gnl|CDD|214005 cd12806, Esterase_713_like, Novel bacterial esterase that cleaves
           esters on halogenated cyclic compounds.  This family
           contains proteins similar to a novel bacterial esterase
           (Alcaligenes esterase 713) with the alpha/beta hydrolase
           fold but does not contain the GXSXXG pentapeptide around
           the active site serine residue as commonly seen in other
           enzymes of this class. Esterase 713 shows negligible
           sequence homology to other esterase and lipase enzymes.
           It is active as a dimer and cleaves esters on
           halogenated cyclic compounds though its natural
           substrate is unknown. This enzyme is possibly exported
           from the cytosol to the periplasmic space. A large
           majority of sequences in this family have yet to be
           characterized.
          Length = 261

 Score = 34.0 bits (78), Expect = 0.081
 Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 32  KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQKASKIPLWARMIG 91
           KL   +LL HS  G   F  A   P  +K ++  +P G P K  D +  + IP+   ++ 
Sbjct: 138 KLDPTVLLTHSQSGIFGFQTAAMRPKGIKAIVAVEPGGCP-KPEDVKPLTSIPVL--VVY 194

Query: 92  NLYKNFNPLWPVR 104
             +    P W VR
Sbjct: 195 GDHIEEFPRWAVR 207


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 34.0 bits (77), Expect = 0.14
 Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 252  VKPPKEPQEPEEEVKEEEERKKEEE--KKKEEDGQHQQDKARSV 293
            +K  +E ++ EE+ K+ EE KK EE  KK  E  + + ++A+  
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704



 Score = 32.4 bits (73), Expect = 0.38
 Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 242  EKLDIVTTKAVKPPKEPQEPEEEVK---EEEERKKEEEKKKEEDGQHQQDKAR 291
            E+L     +  K  +E ++ EEE K    EE +K EE+KKK E+ +  ++  +
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688



 Score = 32.4 bits (73), Expect = 0.44
 Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 1/73 (1%)

Query: 220  HVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKK 279
            HV  D     +    D     D + D  T +A    +E ++ E   K EE RK EE KKK
Sbjct: 1065 HVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETG-KAEEARKAEEAKKK 1123

Query: 280  EEDGQHQQDKARS 292
             ED +  ++  ++
Sbjct: 1124 AEDARKAEEARKA 1136



 Score = 32.0 bits (72), Expect = 0.51
 Identities = 13/50 (26%), Positives = 23/50 (46%)

Query: 242  EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +K         +   +  E  EE  E  E+KKEE KKK +  + + ++ +
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391



 Score = 31.6 bits (71), Expect = 0.78
 Identities = 12/43 (27%), Positives = 24/43 (55%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
            +A K   E ++  E  K+ +E KK EE KK ++ +  ++  ++
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKA 1536



 Score = 31.6 bits (71), Expect = 0.79
 Identities = 13/40 (32%), Positives = 25/40 (62%)

Query: 252  VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
             K   E ++ EE+ K +E +KK EE KK ++ + + ++A+
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328



 Score = 31.3 bits (70), Expect = 0.92
 Identities = 15/43 (34%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 250  KAVKPPKEPQEPEE-EVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +A K  +E ++ +E + K EE +K +E KKK E+ + + D+A+
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAK 1503



 Score = 31.3 bits (70), Expect = 1.0
 Identities = 17/39 (43%), Positives = 21/39 (53%)

Query: 252  VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
             K   E ++ EE  K EE++K EE KK EED      KA
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA 1583



 Score = 30.9 bits (69), Expect = 1.1
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 240  SDEKLDIVTTKAVKPPKEPQEPEEEVK--EEEERKKEEEKKKEEDGQHQQDKAR 291
            ++E+  I   +  K  +E ++  EE K  EE+E+K  E  KKE +   + ++ +
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELK 1708



 Score = 30.5 bits (68), Expect = 1.4
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
            K +K  +  +  E ++K EE +K EEEKKK E  + ++ +
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644



 Score = 30.5 bits (68), Expect = 1.7
 Identities = 12/29 (41%), Positives = 21/29 (72%)

Query: 263  EEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            EE K+ +E KK EEKKK ++ + + ++A+
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAK 1315



 Score = 30.5 bits (68), Expect = 1.7
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 259  QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            ++ EE  K +E +KK EE KK E+ + + ++A+
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470



 Score = 30.1 bits (67), Expect = 2.2
 Identities = 14/42 (33%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 251  AVKPPKEPQEPEEEVKEEEE-RKKEEEKKKEEDGQHQQDKAR 291
            A K   E ++  EE K+ +E +KK EE KK ++ + + ++A+
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457



 Score = 30.1 bits (67), Expect = 2.2
 Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 4/54 (7%)

Query: 242  EKLDIVTTKAVKPPKEPQE----PEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +K D    KA +  K+  E       + K +E +KK EEKKK ++ + + ++A+
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444



 Score = 29.7 bits (66), Expect = 2.5
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 242  EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
            E+  I     +   ++  + EE  K EE + K EE KK E+
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630



 Score = 29.7 bits (66), Expect = 2.7
 Identities = 14/42 (33%), Positives = 26/42 (61%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +A K  +  ++ EE  K EE +KK EE KK ++ + + ++A+
Sbjct: 1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483



 Score = 29.7 bits (66), Expect = 3.0
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 252  VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
             K  +E ++ +E  K EE +K EE+KK EE  + ++DK
Sbjct: 1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576



 Score = 29.7 bits (66), Expect = 3.2
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 252  VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
            +K  +E ++ +E  K EE++K +E KKK E+ +
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK 1315



 Score = 29.3 bits (65), Expect = 3.3
 Identities = 13/42 (30%), Positives = 26/42 (61%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +A K  +  ++ EE  K +E +KK EE KK ++ + + ++A+
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496



 Score = 29.3 bits (65), Expect = 3.7
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 4/44 (9%)

Query: 253  KPPKEPQEPEEEVKEEEERKKEEE----KKKEEDGQHQQDKARS 292
            K  +  ++  EE K+ EE KK EE    K  EE  + ++DK ++
Sbjct: 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677



 Score = 29.3 bits (65), Expect = 3.8
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            KA +  K     E +  EE+++K EE KK EED +   +  +
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695



 Score = 29.0 bits (64), Expect = 4.3
 Identities = 15/42 (35%), Positives = 26/42 (61%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +A K  +E ++ EE  K+ EE KK +E KK+ +   + D+A+
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAK 1489



 Score = 29.0 bits (64), Expect = 4.7
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
            +A K   E ++ EE  K +E +K EE KK +E
Sbjct: 1507 EAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538



 Score = 29.0 bits (64), Expect = 4.9
 Identities = 16/55 (29%), Positives = 32/55 (58%), Gaps = 3/55 (5%)

Query: 242  EKLDIVTTKAVKPPKEPQEPEEEVK---EEEERKKEEEKKKEEDGQHQQDKARSV 293
            E+L     +  K  +E ++ EEE K   EE +++ EE+KKK E+ +  +++ + +
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759



 Score = 29.0 bits (64), Expect = 5.2
 Identities = 16/52 (30%), Positives = 27/52 (51%)

Query: 231  KMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
              + +   L +E+  +   +A K  +   + EE  K EEE+KK E+ KK+E 
Sbjct: 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEA 1643



 Score = 28.6 bits (63), Expect = 6.2
 Identities = 14/43 (32%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 250  KAVKPPKEPQEPEE-EVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +A K  +E ++ +E + K EE +KK +E KK  + + + D+A+
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516



 Score = 28.6 bits (63), Expect = 6.5
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 262  EEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            E + K EE +K +E KKK E+ + + D A+
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAK 1335



 Score = 28.6 bits (63), Expect = 6.7
 Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 252  VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
             +  +E ++ E+  K E  +K EE KK  E+ +  ++
Sbjct: 1211 ERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE 1247



 Score = 28.2 bits (62), Expect = 8.0
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 253  KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
            K   E ++ EE+ K +E +K EE KK EE  + ++ K
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570



 Score = 28.2 bits (62), Expect = 8.1
 Identities = 13/52 (25%), Positives = 27/52 (51%)

Query: 242  EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
            E+        ++   +  E E+++K EE +K EE K K E+ +  +++ + V
Sbjct: 1584 EEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635



 Score = 28.2 bits (62), Expect = 8.4
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 259  QEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
            +  E++ K EE +K EEEKKK    + +++K
Sbjct: 1738 EAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768



 Score = 28.2 bits (62), Expect = 8.9
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
            +A K  ++ ++ EE  K E+ RK EE +K E+ 
Sbjct: 1119 EAKKKAEDARKAEEARKAEDARKAEEARKAEDA 1151



 Score = 28.2 bits (62), Expect = 8.9
 Identities = 15/45 (33%), Positives = 27/45 (60%), Gaps = 2/45 (4%)

Query: 250  KAVKPPKEPQEPE--EEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
            KA +  K+  E +  EE K+ +E KK EE KK ++ +  ++K ++
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548



 Score = 28.2 bits (62), Expect = 9.5
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 253  KPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
            K   E ++ EE  K +E +K EE+KK +E
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADE 1550



 Score = 28.2 bits (62), Expect = 9.6
 Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 2/37 (5%)

Query: 248  TTKAVKPPKEPQEPEEEVKEEEERKK--EEEKKKEED 282
              +A K     ++  EE K+ +E KK  EE+KKK ++
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409



 Score = 27.8 bits (61), Expect = 9.8
 Identities = 15/42 (35%), Positives = 26/42 (61%)

Query: 250  KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
            +A K  +E ++ +E  K+ EE KK EE KK+ +   + D+A+
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 33.8 bits (77), Expect = 0.16
 Identities = 14/29 (48%), Positives = 24/29 (82%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
           +E +E EEE +EEEE ++EEE+++EE+ +
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 33.4 bits (76), Expect = 0.21
 Identities = 13/31 (41%), Positives = 26/31 (83%)

Query: 257 EPQEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
           + +E EEE +EEEE ++EEE+++EE+ ++++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 13/29 (44%), Positives = 23/29 (79%)

Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
           E EEE +EEEE ++EEE+++EE+ +  ++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 12/30 (40%), Positives = 25/30 (83%)

Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           + EEE +EEEE ++EEE+++EE+ + ++++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 30.0 bits (67), Expect = 2.1
 Identities = 13/32 (40%), Positives = 25/32 (78%)

Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
           EEE +EEEE ++EEE+++EE+ + + ++  S+
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 32.8 bits (75), Expect = 0.20
 Identities = 9/49 (18%), Positives = 22/49 (44%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           E       +  KP K+P++P+ + K + + K + + + +      +  A
Sbjct: 78  EPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126



 Score = 31.7 bits (72), Expect = 0.39
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           T     P   P +P+E+ K E++ KK + K K +
Sbjct: 71  TEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK 104


>gnl|CDD|223477 COG0400, COG0400, Predicted esterase [General function prediction
           only].
          Length = 207

 Score = 32.3 bits (74), Expect = 0.21
 Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 2/43 (4%)

Query: 24  IEEW--RKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLIL 64
           +EE      +    +IL+G S G  +A +  +  P      IL
Sbjct: 87  LEELAEEYGIDSSRIILIGFSQGANIALSLGLTLPGLFAGAIL 129


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.4 bits (74), Expect = 0.22
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 262 EEEVKEEEERKKEEEKKKEED 282
           EE+ +EEEE+  EEEK +EE+
Sbjct: 160 EEKEREEEEKAAEEEKAREEE 180



 Score = 28.2 bits (63), Expect = 5.8
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
           +E  EE + EEE K  EE+K  E+
Sbjct: 156 KERAEEKEREEEEKAAEEEKAREE 179



 Score = 27.8 bits (62), Expect = 7.3
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 266 KEEEERKKEEEKKKEEDGQHQQDKARS 292
           + EE+ ++EEEK  EE+   +++    
Sbjct: 158 RAEEKEREEEEKAAEEEKAREEEILTG 184


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 32.9 bits (75), Expect = 0.22
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 9/48 (18%)

Query: 253 KPPKEPQEPEEEVKEEEERK---------KEEEKKKEEDGQHQQDKAR 291
           K  ++P+E  ++ K +EE K         KE+EKK EE    +++K R
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166



 Score = 32.2 bits (73), Expect = 0.39
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
           K  +EP++ EEE K E  R K   KK  +     + K   
Sbjct: 152 KKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191



 Score = 31.8 bits (72), Expect = 0.60
 Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 249 TKAVKPPKEPQEPEEEVKEE---EERKKEEEKKKEEDGQHQQDKARS 292
           TK  K PK     EEE ++E   EE+KK++EK KEE    +  +   
Sbjct: 92  TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 31.4 bits (71), Expect = 0.75
 Identities = 16/47 (34%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 248 TTKAVKPPKEPQE--PEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
             K  KP +E +E  P +E ++E+E+K EE + +EE+ + ++ +A+S
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKS 173



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 13/46 (28%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
            K     +E +E +E+VKEE+++KKE+ K++ +D + +++      
Sbjct: 98  PKNESGKEEEKE-KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 29.9 bits (67), Expect = 2.0
 Identities = 9/40 (22%), Positives = 20/40 (50%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
              A K     +   E  KEEE+ K++ +++K++  +  +
Sbjct: 86  KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 33.1 bits (76), Expect = 0.22
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           ++L +   +  +PP  P EPEEE +EEEE  +EEE+++EE
Sbjct: 286 DRLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEE 325



 Score = 29.2 bits (66), Expect = 3.3
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
           PP  P+  EEE +EEE  ++EEE+++EE+   +++   +V
Sbjct: 299 PPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRATV 338


>gnl|CDD|211851 TIGR03611, RutD, pyrimidine utilization protein D.  This protein is
           observed in operons extremely similar to that
           characterized in E. coli K-12 responsible for the import
           and catabolism of pyrimidines, primarily uracil. This
           protein is a member of the hydrolase, alpha/beta fold
           family defined by pfam00067.
          Length = 248

 Score = 32.4 bits (74), Expect = 0.23
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 4/47 (8%)

Query: 31  LKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDP 77
           L ++    +GH+ GG +    A+ YP+R+  L+L + W       DP
Sbjct: 76  LGIERFHFVGHALGGLIGLQLALDYPERLTSLVLINGW----SRPDP 118


>gnl|CDD|181553 PRK08775, PRK08775, homoserine O-acetyltransferase; Provisional.
          Length = 343

 Score = 32.8 bits (75), Expect = 0.24
 Identities = 8/26 (30%), Positives = 17/26 (65%)

Query: 40  GHSFGGYLAFAYAIQYPDRVKHLILA 65
           G+S+G  +   +A ++P RV+ L++ 
Sbjct: 144 GYSYGALVGLQFASRHPARVRTLVVV 169


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.5 bits (74), Expect = 0.27
 Identities = 11/38 (28%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           K  ++ +E +++   E+ER K+ EK++    Q Q+ +A
Sbjct: 84  KEQQQAEELQQKQAAEQERLKQLEKERLAA-QEQKKQA 120


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 32.7 bits (74), Expect = 0.27
 Identities = 13/52 (25%), Positives = 31/52 (59%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
           EKL +   +  +  K P++ +    E++ +K ++++KKE++ +  +DK + V
Sbjct: 171 EKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222



 Score = 30.8 bits (69), Expect = 1.0
 Identities = 11/39 (28%), Positives = 27/39 (69%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
           K+ ++P+++ K+E+E++++++KKKE +G      A   +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDS 235


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 30.3 bits (69), Expect = 0.28
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 261 PEEEVKEEEERKKEEEKKKEEDG 283
                   EE KKEEE+++EED 
Sbjct: 59  AAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 29.5 bits (67), Expect = 0.54
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEE 276
           A           EE K+EEE ++EE+
Sbjct: 55  AAAAAAAAAAAAEEEKKEEEEEEEED 80



 Score = 28.7 bits (65), Expect = 0.97
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEE 276
           A          EEE KEEEE ++E++
Sbjct: 56  AAAAAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 28.0 bits (63), Expect = 2.3
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 258 PQEPEEEVKEEEERKKEEEKKKEED 282
                     EEE+K+EEE+++E+D
Sbjct: 57  AAAAAAAAAAEEEKKEEEEEEEEDD 81


>gnl|CDD|221019 pfam11187, DUF2974, Protein of unknown function (DUF2974).  This
           bacterial family of proteins has no known function.
          Length = 224

 Score = 32.2 bits (74), Expect = 0.30
 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 4/44 (9%)

Query: 37  ILLGHSFGGYLAFAYAIQ----YPDRVKHLILADPWGFPQKSID 76
            L GHS GG LA   A+       DR+K +   D  GFP++ I+
Sbjct: 87  YLGGHSKGGNLAVYAAMNAEPDLQDRIKKIYSFDGPGFPKEVIE 130


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.7 bits (74), Expect = 0.32
 Identities = 13/35 (37%), Positives = 26/35 (74%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           K  +E E++ +EE ER+KE+EK++E + + + ++A
Sbjct: 587 KAKREAEQKAREEREREKEKEKEREREREREAERA 621



 Score = 28.1 bits (62), Expect = 8.7
 Identities = 12/39 (30%), Positives = 26/39 (66%)

Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
           K  K+ +E  E+ K E E+K  EE+++E++ + ++++ R
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERER 614


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 29.6 bits (67), Expect = 0.33
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEE 281
           KE +  EE+ +EEEE +K EE+K+ E
Sbjct: 27  KELKAEEEKREEEEEARKREERKERE 52



 Score = 28.8 bits (65), Expect = 0.76
 Identities = 10/32 (31%), Positives = 25/32 (78%)

Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
           ++E+K EEE+++EEE+ ++ + + +++K +S 
Sbjct: 26  KKELKAEEEKREEEEEARKREERKEREKNKSF 57



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEE 281
           +E EE  K EE +++E+ K  EE
Sbjct: 37  EEEEEARKREERKEREKNKSFEE 59


>gnl|CDD|165193 PHA02857, PHA02857, monoglyceride lipase; Provisional.
          Length = 276

 Score = 32.2 bits (73), Expect = 0.35
 Identities = 40/235 (17%), Positives = 89/235 (37%), Gaps = 34/235 (14%)

Query: 2   GRSSRPRFS-TDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           GRS+  +    D     R +V  +   +       + LLGHS G  ++   A + P+   
Sbjct: 64  GRSNGEKMMIDDFGVYVRDVVQHVVTIKSTYPGVPVFLLGHSMGATISILAAYKNPNLFT 123

Query: 61  HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
            +IL  P       ++ +   ++ L A  +          +P + VG        K+ P+
Sbjct: 124 AMILMSPL------VNAEAVPRLNLLAAKL------MGIFYPNKIVG--------KLCPE 163

Query: 121 LPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRVDQLAAHVP 180
                   +  D   + +Y +   V     ++ F +         R ++ ++       P
Sbjct: 164 S-------VSRDMDEVYKYQYDPLVNHEKIKAGFASQVLKATNKVRKIIPKIK-----TP 211

Query: 181 VTVIYGSRSWVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVND 235
           + ++ G+ + + + SG      +  N   ++K   GA HH++ +  +V   ++ +
Sbjct: 212 ILILQGTNNEISDVSGAYYF-MQHANCNREIKIYEGAKHHLHKETDEVKKSVMKE 265


>gnl|CDD|225256 COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic
           ion transport and metabolism].
          Length = 299

 Score = 31.6 bits (72), Expect = 0.44
 Identities = 10/37 (27%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDRVKHLILADP--WGFP 71
           +L G S GG ++    +++P+R  H++      W  P
Sbjct: 180 VLAGDSLGGLVSLYAGLRHPERFGHVLSQSGSFWWTP 216


>gnl|CDD|130253 TIGR01185, devC, DevC protein.  This model describes a predicted
           membrane subunit, DevC, of an ABC transporter known so
           far from two species of cyanobacteria. Some experimental
           data from mutational analysis suggest that this protein
           along with DevA and DevB encoded in the same operon may
           be involved in the transport/export of glycolipids
           [Transport and binding proteins, Other].
          Length = 380

 Score = 31.8 bits (72), Expect = 0.44
 Identities = 15/44 (34%), Positives = 22/44 (50%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFG 44
           F RSSRP +    +  E+    S E  R+ + +  +  LG SFG
Sbjct: 140 FDRSSRPEYGPIAKNFEQGQSISAEVRRRRINVVGLFTLGASFG 183


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 32.0 bits (73), Expect = 0.44
 Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 7/61 (11%)

Query: 220 HVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKK 279
            +  +    F++  N   T S       T K  K  K  +  ++  K+++++K+++E K 
Sbjct: 33  VLSKEILSTFSEEENKVATTS-------TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85

Query: 280 E 280
           E
Sbjct: 86  E 86


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 28.4 bits (64), Expect = 0.46
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 262 EEEVKEEEERKKEEEKKKE 280
           E E K  EE+KK EE +KE
Sbjct: 15  EAEQKALEEQKKIEELRKE 33


>gnl|CDD|220735 pfam10408, Ufd2P_core, Ubiquitin elongating factor core.  This is
           the most conserved part of the core region of Ufd2P
           ubiquitin elongating factor or E4, running from helix
           alpha-11 to alpha-38. It consists of 31 helices of
           variable length connected by loops of variable size
           forming a compact unit; the helical packing pattern of
           the compact unit consists of five structural repeats
           that resemble tandem Armadillo (ARM) repeats. This
           domain is involved in ubiquitination as it binds Cdc48p
           and escorts ubiquitinated proteins from Cdc48p to the
           proteasome for degradation. The core is structurally
           similar to the nuclear transporter protein
           importin-alpha. The core is associated with the U-box at
           the C-terminus, pfam04564, which has ligase activity.
          Length = 625

 Score = 31.9 bits (73), Expect = 0.53
 Identities = 21/81 (25%), Positives = 42/81 (51%), Gaps = 7/81 (8%)

Query: 230 NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           N ++ND   L DE L     K +K   + Q+  ++  E E   +EE +++E + +  + +
Sbjct: 439 NLLLNDATYLLDESLS--YLKEIK---QLQKELQDRAEWESLSQEEREERESELRQLERQ 493

Query: 290 ARSVTFMTLGIVMVSMMILMT 310
           A+S  ++ L    V ++ L+T
Sbjct: 494 AKS--YLQLANETVKLLKLLT 512


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 30.1 bits (68), Expect = 0.54
 Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 7/62 (11%)

Query: 238 TLSDEKLDIVTTKAVKPPKEPQEPEEEVKEE---EERKKEEEKKKE----EDGQHQQDKA 290
            +++   +    +     +E Q  EEE  +E   EE + EEE+ KE    E  +  Q+K 
Sbjct: 33  EIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKALQEKR 92

Query: 291 RS 292
           ++
Sbjct: 93  KA 94



 Score = 26.6 bits (59), Expect = 7.2
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
               +    E Q  EEE  E EE++  EE+  +E 
Sbjct: 30  VKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEG 64


>gnl|CDD|225749 COG3208, GrsT, Predicted thioesterase involved in non-ribosomal
           peptide biosynthesis [Secondary metabolites
           biosynthesis, transport, and catabolism].
          Length = 244

 Score = 31.5 bits (72), Expect = 0.55
 Identities = 18/73 (24%), Positives = 24/73 (32%), Gaps = 14/73 (19%)

Query: 2   GRSSRPR--FSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
           GR  R      TD E++  +L   +     +       L GHS G  LAF  A       
Sbjct: 43  GRGDRFGEPLLTDIESLADELANELLPPLLD---APFALFGHSMGAMLAFEVA------- 92

Query: 60  KHLILADPWGFPQ 72
               L      P+
Sbjct: 93  --RRLERAGLPPR 103


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 31.6 bits (72), Expect = 0.56
 Identities = 10/61 (16%), Positives = 26/61 (42%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMF 315
           K     E + +E++E    ++ ++ +D   +  KAR      L   +    + +  +++ 
Sbjct: 64  KIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEVLEQLNQALNESSLALDFVSLL 123

Query: 316 L 316
           L
Sbjct: 124 L 124


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 31.6 bits (72), Expect = 0.63
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
           P    +  E+E +EEE+R  E E+++ ED + +  ++R 
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSRE 402


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 30.8 bits (70), Expect = 0.67
 Identities = 34/153 (22%), Positives = 55/153 (35%), Gaps = 38/153 (24%)

Query: 186 GSRSWVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVY--ADRAD-----VFNKMV--NDT 236
           G+ + VD+++    K+      F+ V   T +G+  Y   DRA       F   V   D 
Sbjct: 59  GNATLVDDATDASGKQ------FITV--TTKSGNTFYIVIDRAAEGENVHFLNQVDEADL 110

Query: 237 CTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFM 296
             L +E+     T +  P        E    E E + E E++ E+               
Sbjct: 111 LALMEEEDAAAPTCSCCPVCSV-NMTECTGPEPEPEPEPEEEPEKKS------------- 156

Query: 297 TLGIVMVSMMILMTLLTMFLVLFGLGYYFSYLK 329
                   M  L+ +L + L+  G  YYF + K
Sbjct: 157 -------GMGPLLLVLAVALIGGGAYYYFKFYK 182


>gnl|CDD|223700 COG0627, COG0627, Predicted esterase [General function prediction
           only].
          Length = 316

 Score = 31.3 bits (71), Expect = 0.72
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 40  GHSFGGYLAFAYAIQYPDRVKHL 62
           GHS GGY A   A+++PDR K  
Sbjct: 158 GHSMGGYGALKLALKHPDRFKSA 180


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 29.2 bits (66), Expect = 0.76
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKE 280
           +PP EP E   E +E E+R+K+E+ +K+
Sbjct: 53  EPPPEPTETWLEKREREKREKKEKLEKK 80


>gnl|CDD|131294 TIGR02240, PHA_depoly_arom, poly(3-hydroxyalkanoate) depolymerase. 
           This family consists of the polyhydroxyalkanoic acid
           (PHA) depolymerase of Pseudomonas oleovorans,
           Pseudomonas putida BM01, and related species. This
           enzyme is part of polyester storage and mobilization
           system as in many bacteria. However, species containing
           this enzyme are unusual in their capacity to produce
           aromatic polyesters when grown on carbon sources such as
           benzoic acid or phenylacetic acid [Energy metabolism,
           Other].
          Length = 276

 Score = 30.7 bits (69), Expect = 0.81
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 39  LGHSFGGYLAFAYAIQYPDRVKHLILA 65
           +G S+GG LA  +A  YP+R K LILA
Sbjct: 96  IGVSWGGALAQQFAHDYPERCKKLILA 122


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7 plays
           a role in protein translation. Deletions of the TMA7
           gene results in altered protein synthesis rates.
          Length = 63

 Score = 28.7 bits (64), Expect = 0.83
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           P K+ +E +EE    ++++KEE K  +E
Sbjct: 14  PKKQKKELDEEDLAFKQKQKEEAKALKE 41


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 31.1 bits (71), Expect = 0.90
 Identities = 14/65 (21%), Positives = 37/65 (56%), Gaps = 10/65 (15%)

Query: 223 ADRADVF------NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEE 276
           A R D F      +++  +     +++++ +  K  KPPK+ +E ++  K ++++K++++
Sbjct: 349 AARVDYFSGRYIGDELKEEL----NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKK 404

Query: 277 KKKEE 281
            KK +
Sbjct: 405 GKKRK 409


>gnl|CDD|190521 pfam03096, Ndr, Ndr family.  This family consists of proteins from
           different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3.
           Their similarity was previously noted. The precise
           molecular and cellular function of members of this
           family is still unknown. Yet, they are known to be
           involved in cellular differentiation events. The Ndr1
           group was the first to be discovered. Their expression
           is repressed by the proto-oncogenes N-myc and c-myc, and
           in line with this observation, Ndr1 protein expression
           is down-regulated in neoplastic cells, and is
           reactivated when differentiation is induced by chemicals
           such as retinoic acid. Ndr2 and Ndr3 expression is not
           under the control of N-myc or c-myc. Ndr1 expression is
           also activated by several chemicals: tunicamycin and
           homocysteine induce Ndr1 in human umbilical endothelial
           cells; nickel induces Ndr1 in several cell types.
           Members of this family are found in wide variety of
           multicellular eukaryotes, including an Ndr1 type protein
           in Helianthus annuus (sunflower), known as Sf21.
           Interestingly, the highest scoring matches in the noise
           are all alpha/beta hydrolases pfam00561, suggesting that
           this family may have an enzymatic function (Bateman A
           pers. obs.).
          Length = 284

 Score = 30.8 bits (70), Expect = 0.91
 Identities = 14/49 (28%), Positives = 27/49 (55%)

Query: 19  QLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADP 67
            L   I       +L+ +I +G   G Y+   +A+++P+RV+ L+L +P
Sbjct: 84  DLADMIPVVLDHFRLKSVIGMGVGAGAYILARFALKHPERVEGLVLINP 132


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 29.2 bits (66), Expect = 1.0
 Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 211 VKSVTGA-GHHVYADRAD-VFNKM----VNDTCTLSDEKLDIVTTKAVKP-PKEPQEPEE 263
           VK + G+ G  V  ++ + V +++    V +      EKL  V   A    P        
Sbjct: 23  VKKILGSVGVEVDDEKLNKVISELEGKDVEELIAAGKEKLASVPAGAGGAAPAAAAAAAA 82

Query: 264 EVKEEEERKKEEEKKKEED 282
               ++E KKEE +++ +D
Sbjct: 83  AAAAKKEEKKEESEEESDD 101


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 30.6 bits (69), Expect = 1.1
 Identities = 11/40 (27%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 253 KPPKEPQEPEEEVKEEEERKKE-EEKKKEEDGQHQQDKAR 291
           K  ++ +E E++   E+ R+KE E++   E    Q ++A 
Sbjct: 72  KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA 111



 Score = 30.2 bits (68), Expect = 1.7
 Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 4/46 (8%)

Query: 250 KAVKPPKEPQEPEEEVKEEEERKK----EEEKKKEEDGQHQQDKAR 291
                 K+ Q  E + K+  E K     E EKK +E+ + Q ++  
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEA 156



 Score = 29.4 bits (66), Expect = 2.8
 Identities = 7/32 (21%), Positives = 15/32 (46%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
              +   + E+  K+ EEK+K+ +    +  A
Sbjct: 99  AAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 4/46 (8%)

Query: 250 KAVKPPKEPQEPEEEVKEEEE--RKKEEEKKKE--EDGQHQQDKAR 291
           K     K   E E E K +EE  ++ EEE K +   + + +  +A+
Sbjct: 127 KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAK 172



 Score = 28.6 bits (64), Expect = 5.1
 Identities = 10/49 (20%), Positives = 27/49 (55%), Gaps = 5/49 (10%)

Query: 247 VTTKAVKPPKEPQEPEEEVKE-----EEERKKEEEKKKEEDGQHQQDKA 290
           +  +     K+ QE ++++++     E++R  E+ ++KE + +   +KA
Sbjct: 55  IQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKA 103



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 7/29 (24%), Positives = 16/29 (55%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
           +E E+    E+  K+ E+  K+ + + +Q
Sbjct: 92  KELEQRAAAEKAAKQAEQAAKQAEEKQKQ 120



 Score = 28.3 bits (63), Expect = 6.3
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
            K  ++ E+  K+ EE++K+ E+ K +     + KA
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136



 Score = 27.9 bits (62), Expect = 8.3
 Identities = 8/44 (18%), Positives = 23/44 (52%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
             +A +  ++ +  +    E+  ++ E+  K+ E+ Q Q ++A+
Sbjct: 82  KQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK 125


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 10/44 (22%), Positives = 24/44 (54%)

Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
           DE+ +    K     K  +  + +++E+E+ K+E+E+K   + +
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 30.7 bits (69), Expect = 1.3
 Identities = 13/29 (44%), Positives = 23/29 (79%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
           +E +  EEE +EEEE ++EE++ +EE+G+
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGE 468



 Score = 28.7 bits (64), Expect = 5.9
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
           +  +E +E EEE +EEE+  +EEE + EE+ +  + 
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA 478



 Score = 28.0 bits (62), Expect = 10.0
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
           +T    P    QE EEE   EEE ++EEE+++EE    +++
Sbjct: 426 STSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEE 466


>gnl|CDD|164765 MTH00222, ATP9, ATP synthase F0 subunit 9; Provisional.
          Length = 77

 Score = 28.3 bits (63), Expect = 1.4
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 289 KARSVTFMTLGIVMVSMMILMTLLTMFLVLFGL 321
           K +  T+  LG  +   M L  L+  FL+LF L
Sbjct: 45  KQQLFTYAILGFAISEAMGLFCLMMAFLILFAL 77


>gnl|CDD|205219 pfam13038, DUF3899, Domain of unknown function (DUF3899).  Putative
           Tryptophanyl-tRNA synthetase.
          Length = 92

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 11/51 (21%), Positives = 26/51 (50%)

Query: 266 KEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFL 316
            E++E+   EE +   D + ++ K    T+    ++   ++IL ++L  F+
Sbjct: 42  VEKKEKDSIEEDEFANDKKVKKKKYSVFTWTAPLLLSGLLLILSSILLSFI 92


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 30.3 bits (68), Expect = 1.5
 Identities = 8/31 (25%), Positives = 18/31 (58%)

Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
                    + E++++E+ ++ EE+K EE+G
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEG 332



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           +  K   E     E  ++E+  + EEEK +EE G  ++ K
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339



 Score = 28.4 bits (63), Expect = 6.4
 Identities = 6/32 (18%), Positives = 12/32 (37%)

Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
               + +E E++   E   K E  Q +  +  
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEES 323


>gnl|CDD|177659 PLN00021, PLN00021, chlorophyllase.
          Length = 313

 Score = 30.0 bits (68), Expect = 1.5
 Identities = 14/35 (40%), Positives = 15/35 (42%), Gaps = 5/35 (14%)

Query: 38  LLGHSFGGYLAFAYAIQYPDRVKH-----LILADP 67
           L GHS GG  AFA A+             LI  DP
Sbjct: 130 LAGHSRGGKTAFALALGKAAVSLPLKFSALIGLDP 164


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 29.5 bits (67), Expect = 1.6
 Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 1/37 (2%)

Query: 245 DIVTTKAVKPPKEPQ-EPEEEVKEEEERKKEEEKKKE 280
           DI+  K + PPK P  + EEE   E   ++  E   E
Sbjct: 5   DILRKKGILPPKPPSPKEEEEEALELAIQEAHENALE 41


>gnl|CDD|215859 pfam00326, Peptidase_S9, Prolyl oligopeptidase family. 
          Length = 212

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 18 RQLVTSIEEWRKEL--KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADP 67
             + + E    +       + + G S+GGYL  A   Q PD  K  +   P
Sbjct: 46 DDFIAAAEYLIAQGYVDPDRLAIWGGSYGGYLTGAALNQRPDLFKAAVAVVP 97


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 30.3 bits (68), Expect = 1.6
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTF 295
            E ++     +E E +KK+EEK KE++ +  +   +    
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKA 45


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 9/20 (45%), Positives = 17/20 (85%)

Query: 262 EEEVKEEEERKKEEEKKKEE 281
           EE+ +E+E  K+E+E+K+E+
Sbjct: 15  EEKREEKEREKEEKERKEEK 34


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.1 bits (68), Expect = 1.7
 Identities = 18/43 (41%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 254 PPKEPQEPEEEVKEEEERK--KEEEKKKEEDGQHQQDKARSVT 294
           P +E +E +EE KEEEE+   KEEE  +EE+ + ++ K + V 
Sbjct: 36  PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78



 Score = 29.7 bits (67), Expect = 2.3
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 257 EPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           E + P+EE +EE+E KKEEE+K  +  +   ++
Sbjct: 32  EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEE 64



 Score = 29.4 bits (66), Expect = 2.9
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKK 279
            +  K  +E    +EE  +EEE K+E++KK 
Sbjct: 44  KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKT 74



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 13/24 (54%), Positives = 21/24 (87%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
           +E E+EV +EEE +++EEKK+EE+
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEE 52



 Score = 27.8 bits (62), Expect = 9.7
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEE 281
              +E E + +EE+E KK++ KK +E
Sbjct: 54  TTDKEEEVDEEEEKEEKKKKTKKVKE 79


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 30.3 bits (69), Expect = 1.7
 Identities = 7/39 (17%), Positives = 21/39 (53%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
            P E ++  ++ ++ E++ ++EE +K    +  +  A+ 
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKK 445


>gnl|CDD|214008 cd12809, Esterase_713_like-2, Uncharacterized enzymes similar to
           novel bacterial esterase that cleaves esters on
           halogenated cyclic compounds.  This family contains
           uncharacterized proteins similar to a novel bacterial
           esterase (Alcaligenes esterase 713) with the alpha/beta
           hydrolase fold but does not contain the GXSXXG
           pentapeptide around the active site serine residue as
           commonly seen in other enzymes of this class. Esterase
           713 shows negligible sequence homology to other esterase
           and lipase enzymes. It is active as a dimer and cleaves
           esters on halogenated cyclic compounds though its
           natural substrate is unknown.
          Length = 280

 Score = 29.9 bits (68), Expect = 1.8
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFP 71
           IL+ HS GG   +  A   PD VK ++  +P G P
Sbjct: 174 ILITHSQGGPFGWLAADARPDLVKAIVAIEPSGPP 208


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 29.0 bits (65), Expect = 1.8
 Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 8/42 (19%)

Query: 254 PPKEP--------QEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
           PP+ P           +EE +EEEE +K+++  + +  QHQQ
Sbjct: 80  PPEGPAGTTSGTGHTRQEEEEEEEENEKQQQSDEAQVQQHQQ 121


>gnl|CDD|234235 TIGR03503, TIGR03503, TIGR03503 family protein.  This set of
           conserved hypothetical protein has a phylogenetic range
           that closely matches that of TIGR03501, a putative
           C-terminal protein targeting signal.
          Length = 374

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 14/66 (21%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMT 310
            V+ P E +   EE   EE  K + E+++    + +   ++   F  + IV   +++L+ 
Sbjct: 306 NVEEPPEAEPSAEEAATEEALKAKAEQEQALLVEREAKDSK---FWIIIIVGNVVILLLG 362

Query: 311 LLTMFL 316
            +  F+
Sbjct: 363 GIGFFV 368


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 29.3 bits (66), Expect = 1.9
 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
               E E EV+EEE  KKE++KKKE   + ++ K
Sbjct: 140 TAKVEKEAEVEEEE--KKEKKKKKEVKKEKKEKK 171


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 30.0 bits (67), Expect = 2.0
 Identities = 16/84 (19%), Positives = 31/84 (36%), Gaps = 5/84 (5%)

Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKK-----EEDGQHQQDKARSVTFMTLGIVMVS 304
           K  K  ++ +EPE E   E  RK    ++       +  Q     A  VTF+ +     +
Sbjct: 418 KKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQRSSKLASWVTFLQVSYRKAN 477

Query: 305 MMILMTLLTMFLVLFGLGYYFSYL 328
             + +    + L+   + +    L
Sbjct: 478 QALWLMAAAILLLAALMSFLTGQL 501


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 9/34 (26%), Positives = 19/34 (55%)

Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
             + +     +E EE+ KE+ +R K+  + ++ED
Sbjct: 275 KPSGERSDSEEETEEKEKEKRKRLKKMMEDEDED 308


>gnl|CDD|223187 COG0109, CyoE, Polyprenyltransferase (cytochrome oxidase assembly
           factor) [Posttranslational modification, protein
           turnover, chaperones].
          Length = 304

 Score = 29.9 bits (68), Expect = 2.0
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 297 TLGIVMVSMMILMT-LLTMFLVLFGL-GYYFSYLKTHFVK 334
            LG+   S++  +  LL   L LFG+  Y   Y  T ++K
Sbjct: 107 VLGVAGFSLLWFLVNLLAAVLGLFGIFFYVVVY--TLWLK 144


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 29.3 bits (66), Expect = 2.0
 Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 225 RADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
               F + V      ++E+ DI    ++   K  +   E+V EE+E+ +EEE+++EE+
Sbjct: 131 ALSKFKRKVG---LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEE 185



 Score = 27.8 bits (62), Expect = 8.1
 Identities = 11/43 (25%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 243 KLDIVTTKAVKPPKEPQEPEEEVKE--EEERKKEEEKKKEEDG 283
           K+ + T +     ++    E+++KE   E+  +E+EK +EE+ 
Sbjct: 138 KVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEE 180


>gnl|CDD|223964 COG1033, COG1033, Predicted exporters of the RND superfamily
           [General function prediction only].
          Length = 727

 Score = 30.0 bits (68), Expect = 2.0
 Identities = 14/68 (20%), Positives = 27/68 (39%), Gaps = 3/68 (4%)

Query: 258 PQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFLV 317
             E E+  +E     ++E       G        SV ++ +  ++ S  ++ T+L + LV
Sbjct: 525 QGELEDVGREILRDIEKEN---IPTGVKVYVTGESVIYVEMNELLTSSQLISTVLGIILV 581

Query: 318 LFGLGYYF 325
              L   F
Sbjct: 582 FALLLIIF 589


>gnl|CDD|177159 MTH00097, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 121

 Score = 28.7 bits (65), Expect = 2.0
 Identities = 7/40 (17%), Positives = 14/40 (35%), Gaps = 2/40 (5%)

Query: 296 MTLGIVMVSMMILMTLLTMFLVLFGLGYYFSYLKTHFVKL 335
            +  +    ++     L+MF   F   Y+F       + L
Sbjct: 43  SSFKLKKPYLVFFSLFLSMFF--FFFFYFFKVYFFSNLSL 80


>gnl|CDD|130799 TIGR01738, bioH, pimelyl-[acyl-carrier protein] methyl ester
           esterase.  This CoA-binding enzyme is required for the
           production of pimeloyl-coenzyme A, the substrate of the
           BioF protein early in the biosynthesis of biotin. Its
           exact function is unknown, but is proposed in ref 2.
           This enzyme belongs to the alpha/beta hydrolase fold
           family (Pfam model pfam00561). Members of this family
           are restricted to the Proteobacteria [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Biotin].
          Length = 245

 Score = 29.4 bits (66), Expect = 2.1
 Identities = 34/161 (21%), Positives = 51/161 (31%), Gaps = 39/161 (24%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQ--KASKIPLWARMIGNLY 94
           I LG S GG +A   A  +PDRV+ L+       P  S      +  K  +       L 
Sbjct: 68  IWLGWSLGGLVALHIAATHPDRVRALVTVA--SSPCFSAREDWPEGIKPDVLTGFQQQLS 125

Query: 95  KNFNPLWPVRFVG------PLGQWVVEKMRPDLPKKFTPVLKEDSSAITEYIFQCNVQAP 148
            ++      RF+       P  +     ++  L  + TP                NVQ  
Sbjct: 126 DDYQ-RTIERFLALQTLGTPTARQDARALKQTLLARPTP----------------NVQ-- 166

Query: 149 SGESAFHTLTEGLGYAKRPML-HRVDQLAAHVPVTVIYGSR 188
                   L  GL       L   +  ++  VP   +YG  
Sbjct: 167 -------VLQAGLEILATVDLRQPLQNIS--VPFLRLYGYL 198


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 29.2 bits (66), Expect = 2.1
 Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 8/51 (15%)

Query: 249 TKAVKPPKEPQEPEEEVKEEE--------ERKKEEEKKKEEDGQHQQDKAR 291
            K  +  +E +   EE+ EEE        E  K + +K+ E+ + Q++  +
Sbjct: 98  KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148


>gnl|CDD|217536 pfam03403, PAF-AH_p_II, Platelet-activating factor acetylhydrolase,
           isoform II.  Platelet-activating factor acetylhydrolase
           (PAF-AH) is a subfamily of phospholipases A2,
           responsible for inactivation of platelet-activating
           factor through cleavage of an acetyl group. Three known
           PAF-AHs are the brain heterotrimeric PAF-AH Ib, whose
           catalytic beta and gamma subunits are aligned in
           pfam02266, the extracellular, plasma PAF-AH (pPAF-AH),
           and the intracellular PAF-AH isoform II (PAF-AH II).
           This family aligns pPAF-AH and PAF-AH II, whose
           similarity was previously noted.
          Length = 372

 Score = 29.7 bits (67), Expect = 2.1
 Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 1/35 (2%)

Query: 37  ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFP 71
            ++GHSFGG      ++    R +  I  D W FP
Sbjct: 224 AVIGHSFGGATVIQ-SLSEDTRFRCGIALDAWMFP 257


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 30.0 bits (67), Expect = 2.2
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
               PKE ++  E  K E E+ + E KK +E+    +D  
Sbjct: 265 DTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHK 304


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 28.1 bits (63), Expect = 2.3
 Identities = 8/35 (22%), Positives = 22/35 (62%)

Query: 247 VTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           V+    +  KEP+E ++E + E++ + +E+  +++
Sbjct: 72  VSGGIKQAAKEPEEDDKEDESEDDDESDEDDDEDD 106


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 28.1 bits (63), Expect = 2.3
 Identities = 10/23 (43%), Positives = 15/23 (65%), Gaps = 2/23 (8%)

Query: 260 EPEEEVKEEEERKKEEEKKKEED 282
               E K+EE  KKEEE+++ +D
Sbjct: 76  AAAAEAKKEE--KKEEEEEESDD 96


>gnl|CDD|115027 pfam06342, DUF1057, Alpha/beta hydrolase of unknown function
           (DUF1057).  This family consists of several
           Caenorhabditis elegans specific proteins of unknown
           function. Members of this family have an alpha/beta
           hydrolase fold.
          Length = 297

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 4/51 (7%)

Query: 29  KELKLQE-MILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGF-PQKSIDP 77
           +EL+L+  +I++GHS G   A   A   P     L++ +P GF   K I P
Sbjct: 98  EELELKGKLIIMGHSRGCENALQTATTRP--AHGLVMINPTGFRIHKGIRP 146


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 28.3 bits (63), Expect = 2.4
 Identities = 9/41 (21%), Positives = 21/41 (51%)

Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTF 295
             + Q  +E+  E+ +    ++KKK    + +++  R+VT 
Sbjct: 2   APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTV 42


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
              V P ++ +E E+E +EEEE ++E E+ + E+G
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373


>gnl|CDD|130895 TIGR01836, PHA_synth_III_C, poly(R)-hydroxyalkanoic acid synthase,
           class III, PhaC subunit.  This model represents the PhaC
           subunit of a heterodimeric form of polyhydroxyalkanoic
           acid (PHA) synthase. Excepting the PhaC of Bacillus
           megaterium (which needs PhaR), all members require PhaE
           (TIGR01834) for activity and are designated class III.
           This enzyme builds ester polymers for carbon and energy
           storage that accumulate in inclusions, and both this
           enzyme and the depolymerase associate with the
           inclusions. Class III enzymes polymerize
           short-chain-length hydroxyalkanoates [Fatty acid and
           phospholipid metabolism, Biosynthesis].
          Length = 350

 Score = 29.7 bits (67), Expect = 2.4
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 7/60 (11%)

Query: 32  KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLI-LADPWGFPQKSIDPQKASKIPLWARMI 90
           KL ++ LLG   GG  +  YA  YPD++K+L+ +  P  F          + +  WAR +
Sbjct: 134 KLDQISLLGICQGGTFSLCYAALYPDKIKNLVTMVTPVDFETPG------NMLSNWARHV 187


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 28.6 bits (64), Expect = 2.6
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           P +  +E +E  KE EE++  + KK+++
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKK 119



 Score = 28.2 bits (63), Expect = 4.6
 Identities = 11/30 (36%), Positives = 20/30 (66%), Gaps = 2/30 (6%)

Query: 254 PPKEPQEPEEEVKEE--EERKKEEEKKKEE 281
             KE +E  +E +E+   + KKE+++KKE+
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 29.0 bits (65), Expect = 2.6
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 247 VTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
            TTK  K   + +  EEE K E +  K EE + + D   +   ++ 
Sbjct: 63  ATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQE 108


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 29.1 bits (65), Expect = 2.6
 Identities = 9/26 (34%), Positives = 20/26 (76%)

Query: 266 KEEEERKKEEEKKKEEDGQHQQDKAR 291
           K++E+R+ E+ +K +ED + +Q++ R
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKR 190


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 13/55 (23%), Positives = 26/55 (47%)

Query: 235 DTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           D    ++E+    T +   P  + +  E++ +E  + K E+EK K E  + +  K
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150


>gnl|CDD|236060 PRK07581, PRK07581, hypothetical protein; Validated.
          Length = 339

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 6/61 (9%)

Query: 1   FGRSSRPRFST-DPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
           F  +  P  +  D    + +L+T  E++  E +L   +++G S G    + +A++YPD V
Sbjct: 95  FNAARFPHVTIYDNVRAQHRLLT--EKFGIE-RLA--LVVGWSMGAQQTYHWAVRYPDMV 149

Query: 60  K 60
           +
Sbjct: 150 E 150


>gnl|CDD|131868 TIGR02821, fghA_ester_D, S-formylglutathione hydrolase.  This model
           describes a protein family from bacteria, yeast, and
           human, with a conserved critical role in formaldehyde
           detoxification as S-formylglutathione hydrolase (EC
           3.1.2.12). Members in eukaryotes such as the human
           protein are better known as esterase D (EC 3.1.1.1), an
           enzyme with broad specificity, although
           S-formylglutathione hydrolase has now been demonstrated
           as well [Cellular processes, Detoxification].
          Length = 275

 Score = 29.0 bits (65), Expect = 2.9
 Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 8/45 (17%)

Query: 38  LLGHSFGGYLAFAYAIQYPDRVKHL-----ILAD---PWGFPQKS 74
           + GHS GG+ A   A++ PDR K +     I+A    PWG    S
Sbjct: 142 ITGHSMGGHGALVIALKNPDRFKSVSAFAPIVAPSRCPWGQKAFS 186


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 9/34 (26%), Positives = 22/34 (64%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           KE +E ++   ++++ KK+++K K++    + DK
Sbjct: 76  KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109


>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
           (NESP55).  This family consists of several mammalian
           neuroendocrine-specific golgi protein P55 (NESP55)
           sequences. NESP55 is a novel member of the chromogranin
           family and is a soluble, acidic, heat-stable secretory
           protein that is expressed exclusively in endocrine and
           nervous tissues, although less widely than
           chromogranins.
          Length = 261

 Score = 29.1 bits (64), Expect = 3.0
 Identities = 11/45 (24%), Positives = 27/45 (60%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
           +T+  +  +E +EPE    +++ R  EEE++++E+ + Q  + + 
Sbjct: 192 STQEPESAREGEEPERGPLDKDPRDPEEEEEEKEEEKQQPHRCKP 236


>gnl|CDD|177149 MTH00080, COX2, cytochrome c oxidase subunit II; Provisional.
          Length = 231

 Score = 28.8 bits (65), Expect = 3.0
 Identities = 7/33 (21%), Positives = 15/33 (45%)

Query: 300 IVMVSMMILMTLLTMFLVLFGLGYYFSYLKTHF 332
            ++    +L  ++ +FL L    +YF   K  +
Sbjct: 28  SLLFGEFVLAFVVFLFLYLISNNFYFKSKKIEY 60


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 27.7 bits (62), Expect = 3.0
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 259 QEPEEEVKEEEERKKEEEKKK 279
            E  EE + E+E ++E+  +K
Sbjct: 84  LEDREEERLEKEEEREKRARK 104


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 28.7 bits (64), Expect = 3.2
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
            E +E+ +EEEE  +EE ++ E+ 
Sbjct: 45  DEKQEDDEEEEEEDEEEIEEPEDI 68



 Score = 28.3 bits (63), Expect = 3.6
 Identities = 14/39 (35%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 257 EPQEPEEEVKEEEERKKEEEKKKEED--GQHQQDKARSV 293
           E +E EEE +E+    K+ EKK   D     Q D A+++
Sbjct: 76  EDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNL 114



 Score = 27.5 bits (61), Expect = 6.1
 Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
            E  E EEE+ E+EE  +EEE +++       +K
Sbjct: 65  PEDIEDEEEIVEDEE--EEEEDEEDNVDLKDIEK 96



 Score = 27.1 bits (60), Expect = 9.5
 Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 3/68 (4%)

Query: 224 DRADVF--NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEER-KKEEEKKKE 280
           D+ D+   N+ V D     DE+ +    + ++ P++ ++ EE V++EEE  + EE+    
Sbjct: 32  DKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDL 91

Query: 281 EDGQHQQD 288
           +D + +  
Sbjct: 92  KDIEKKNI 99


>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding.  This domain is
           found on CASC3 (cancer susceptibility candidate gene 3
           protein) which is also known as Barentsz (Btz). CASC3 is
           a component of the EJC (exon junction complex) which is
           a complex that is involved in post-transcriptional
           regulation of mRNA in metazoa. The complex is formed by
           the association of four proteins (eIF4AIII, Barentsz,
           Mago, and Y14), mRNA, and ATP. This domain wraps around
           eIF4AIII and stacks against the 5' nucleotide.
          Length = 116

 Score = 27.8 bits (62), Expect = 3.3
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
            EP E  +++E +++E EK  +ED
Sbjct: 14  AEPTEPKEDKERKRREHEKYDDED 37


>gnl|CDD|131480 TIGR02427, protocat_pcaD, 3-oxoadipate enol-lactonase.  Members of
           this family are 3-oxoadipate enol-lactonase. Note that
           the substrate is known as 3-oxoadipate enol-lactone,
           2-oxo-2,3-dihydrofuran-5-acetate,
           4,5-Dihydro-5-oxofuran-2-acetate, and
           5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the
           catalyzes the fourth step in the protocatechuate
           degradation to beta-ketoadipate and then to succinyl-CoA
           and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate,
           and vanillate all can be converted in one step to
           protocatechuate. This enzyme also acts in catechol
           degradation. In genomes that catabolize both catechol
           and protocatechuate, two forms of this enzyme may be
           found. All members of the seed alignment for this model
           were chosen from within protocatechuate degradation
           operons of at least three genes of the pathway, from
           genomes with the complete pathway through
           beta-ketoadipate [Energy metabolism, Other].
          Length = 251

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 4/65 (6%)

Query: 2   GRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKH 61
           G S  P     P ++E  L   +      L ++  +  G S GG +A   A + PDRV+ 
Sbjct: 51  GLSDAPE---GPYSIE-DLADDVLALLDHLGIERAVFCGLSLGGLIAQGLAARRPDRVRA 106

Query: 62  LILAD 66
           L+L++
Sbjct: 107 LVLSN 111


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 28.3 bits (63), Expect = 3.5
 Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 4/38 (10%)

Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEED----GQHQQDK 289
           +E +E +E ++E EE +K + +K EE+     Q  +DK
Sbjct: 34  EEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDK 71


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 29.4 bits (66), Expect = 3.6
 Identities = 17/55 (30%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 238 TLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
           TLSD   + V     K P E +E + E +++E +K   ++KK+  G  +  KA++
Sbjct: 170 TLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKG-AKRKKDASGDDKSKKAKT 223


>gnl|CDD|179147 PRK00870, PRK00870, haloalkane dehalogenase; Provisional.
          Length = 302

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           FGRS +P    D  T  R  V  +  W ++L L ++ L+   +GG +    A ++PDR  
Sbjct: 84  FGRSDKPTRRED-YTYARH-VEWMRSWFEQLDLTDVTLVCQDWGGLIGLRLAAEHPDRFA 141

Query: 61  HLILA 65
            L++A
Sbjct: 142 RLVVA 146


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 28.9 bits (64), Expect = 3.8
 Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 5/50 (10%)

Query: 247 VTTKAVKPPKEP-----QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
           V  KA KP +E      QE  +E K EE++ +EE++++E + + +Q++ +
Sbjct: 190 VQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239



 Score = 28.5 bits (63), Expect = 5.2
 Identities = 13/31 (41%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 258 PQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
             E ++  +E+EE + EEE K+EE GQ   D
Sbjct: 214 KTEEKQPQEEQEEEEVEEEAKQEE-GQGTDD 243


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 28.9 bits (65), Expect = 3.8
 Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 7/35 (20%)

Query: 254 PPKEPQEPEEEVKEEEE-------RKKEEEKKKEE 281
           P +E +EPEE  +E EE       RK+  E+ KE+
Sbjct: 150 PEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQ 184


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 29.2 bits (66), Expect = 3.8
 Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 239 LSDEKLDIVTTKAVKPPKEPQE---PEEEVKEEEERKKEEEKKKEEDGQ 284
           L+  K+  +      P +EP+E    E E  + E+ K E++   E+ GQ
Sbjct: 817 LTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865


>gnl|CDD|178633 PLN03084, PLN03084, alpha/beta hydrolase fold protein; Provisional.
          Length = 383

 Score = 28.7 bits (64), Expect = 4.1
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query: 1   FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
           FG S +P+          + V+S+E    ELK  ++ L+   +       YA  +PD++K
Sbjct: 164 FGFSDKPQPGYGFNYTLDEYVSSLESLIDELKSDKVSLVVQGYFSPPVVKYASAHPDKIK 223

Query: 61  HLILADP 67
            LIL +P
Sbjct: 224 KLILLNP 230


>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6.  This region covers
           TM regions 4-6 of the ryanodine receptor 1 family.
          Length = 217

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%)

Query: 254 PPKEPQEPE-------EEVKEEEERKKEEEKKKEEDGQHQQ 287
            P+EP  PE       E  +E EE K E EK   EDG+ + 
Sbjct: 74  EPEEPPTPEGAPAEKRELKEEGEEPKSEPEKADLEDGEKRT 114


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 28.6 bits (64), Expect = 4.2
 Identities = 5/36 (13%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
            ++P++ +E      E ++ ++K++ E  + +++  
Sbjct: 71  DQKPEDEKELSASSLEAEQAKKKEEAE-AKKKKEME 105


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 27.6 bits (62), Expect = 4.4
 Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 9/74 (12%)

Query: 223 ADRADVFNKMVN--DTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
             +       VN      +   ++D +    V+P     E +E    E +R ++EEKK +
Sbjct: 51  ERKRSENAGKVNIGGGLFVDQAEVDAIARSRVQPVL--DEIDERA--EAQRARDEEKKLD 106

Query: 281 ED---GQHQQDKAR 291
           E+    QH++ K R
Sbjct: 107 EEEAKRQHEEAKER 120


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 29.0 bits (65), Expect = 4.4
 Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 2/28 (7%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           PP EP EPEEE   +E  + + +  +E 
Sbjct: 278 PPPEPPEPEEE--PDEPDQTDPDDGEET 303


>gnl|CDD|114761 pfam06057, VirJ, Bacterial virulence protein (VirJ).  This family
          consists of several bacterial VirJ virulence proteins.
          VirJ is thought to be involved in the type IV secretion
          system. It is thought that the substrate proteins
          localised to the periplasm may associate with the pilus
          in a manner that is mediated by VirJ, and suggest a
          two-step process for type IV secretion in
          Agrobacterium.
          Length = 192

 Score = 28.5 bits (64), Expect = 4.4
 Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 12 DPETVERQLVTSIEEWRKELKLQEMILLGHSFGG-YLAFAYAIQYPDRVK 60
           PE V   L   I+ +RK  K++ ++L+G+SFG   L  AY  + P   K
Sbjct: 46 TPEEVADDLDRIIDTYRKRWKVKNVVLIGYSFGADVLPAAYN-RLPPATK 94


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 28.6 bits (64), Expect = 4.5
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 9/48 (18%)

Query: 249 TKAVKPPKEPQEPEEEVKEE-----EERKKEEEKKKEEDGQHQQDKAR 291
           ++A +   E  E   + +E      EER +  EKKKEE      +KA+
Sbjct: 101 SQADRVEGEEPESIRKWRERRDLRIEERDEASEKKKEE----LIEKAQ 144


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 28.8 bits (65), Expect = 4.5
 Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 12/81 (14%)

Query: 222 YADRADVFNKMVN--DTCT---LSDEKL-------DIVTTKAVKPPKEPQEPEEEVKEEE 269
             D   + N ++   D      LS E L       +    K +K  +E ++ E + K+EE
Sbjct: 231 LEDLLPLLNLVLYLVDKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEE 290

Query: 270 ERKKEEEKKKEEDGQHQQDKA 290
           ++K+E E K  +    +Q K 
Sbjct: 291 KKKEEREAKLAKLSPEEQRKL 311


>gnl|CDD|183178 PRK11525, dinD, DNA-damage-inducible protein D; Provisional.
          Length = 279

 Score = 28.5 bits (64), Expect = 4.6
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
           P+E   PEE +K+ E R K  EKK  ++ 
Sbjct: 251 PEELPTPEESIKKLERRLKSTEKKLLKNP 279


>gnl|CDD|201483 pfam00873, ACR_tran, AcrB/AcrD/AcrF family.  Members of this family
            are integral membrane proteins. Some are involved in drug
            resistance. AcrB cooperates with a membrane fusion
            protein, AcrA, and an outer membrane channel TolC. The
            structure shows the AcrB forms a homotrimer.
          Length = 1021

 Score = 28.8 bits (65), Expect = 4.6
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 298  LGIVMVSMMILMTLLTMFLV 317
            LGIV+   +I  T+LT+ +V
Sbjct: 994  LGIVVFGGLITSTVLTLLVV 1013


>gnl|CDD|203773 pfam07819, PGAP1, PGAP1-like protein.  The sequences found in this
           family are similar to PGAP1. This is an endoplasmic
           reticulum membrane protein with a catalytic serine
           containing motif that is conserved in a number of
           lipases. PGAP1 functions as a GPI inositol-deacylase;
           this deacylation is important for the efficient
           transport of GPI-anchored proteins from the endoplasmic
           reticulum to the Golgi body.
          Length = 225

 Score = 28.5 bits (64), Expect = 4.7
 Identities = 18/69 (26%), Positives = 25/69 (36%), Gaps = 15/69 (21%)

Query: 37  ILLGHSFGGYLA---FAYAIQYPDRVKHLI-LADPWGFPQKSIDPQKASKIPLWARMIGN 92
           IL+GHS GG +A          PD V  ++ L+ P   P  + D             +  
Sbjct: 88  ILIGHSMGGLVARAALTLPNYIPDSVNTIVTLSSPHAGPPLTFDGD-----------LLR 136

Query: 93  LYKNFNPLW 101
            Y   N  W
Sbjct: 137 FYALLNEYW 145


>gnl|CDD|183863 PRK13107, PRK13107, preprotein translocase subunit SecA; Reviewed.
          Length = 908

 Score = 28.8 bits (64), Expect = 4.7
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
           + + +V+E E R++EE+ K + D QH   +A
Sbjct: 832 QAQSDVEEMEARRREEDAKIQRDYQHAAAEA 862


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.8 bits (64), Expect = 4.8
 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 250 KAVKPPKEPQEPEEEVKEEEERKK--EEEKKKEEDGQHQQDKA 290
           +  K  +  +E + +   E+ER K  E+E+ K ++ Q Q ++A
Sbjct: 81  RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEA 123


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 29.0 bits (65), Expect = 5.1
 Identities = 12/61 (19%), Positives = 33/61 (54%), Gaps = 2/61 (3%)

Query: 234  NDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG--QHQQDKAR 291
            N+   L +EKL ++    ++  ++ +E  E   +E+   +E+  +++++   + ++ KA 
Sbjct: 2556 NEIKELENEKLKLLEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAY 2615

Query: 292  S 292
            S
Sbjct: 2616 S 2616


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 27.2 bits (60), Expect = 5.1
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
           E +      A  P        E  ++ EE K+EEE++++ED
Sbjct: 59  ELMSSQAAAAAAPAATAAATAEPAEKAEEAKEEEEEEEDED 99


>gnl|CDD|240536 cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic
           compound extrusion (MATE)-like proteins similar to
           Vibrio cholerae NorM.  The integral membrane proteins
           from the MATE family are involved in exporting
           metabolites across the cell membrane and are responsible
           for multidrug resistance (MDR) in many bacteria and
           animals. This subfamily includes Vibrio cholerae NorM
           and functions most likely as a multidrug efflux pump,
           removing xenobiotics from the interior of the cell. The
           pump utilizes a cation gradient across the membrane to
           facilitate the export process. NorM appears to bind
           monovalent cations in an outward-facing conformation and
           may subsequently cycle through an inward-facing and
           outward-facing conformation to capture and release its
           substrate.
          Length = 435

 Score = 28.6 bits (65), Expect = 5.2
 Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 3/42 (7%)

Query: 283 GQHQQDKARSVTFMTLGI-VMVS--MMILMTLLTMFLVLFGL 321
           G   + + R      L + +++S  +M+L+ L    L L G 
Sbjct: 71  GAGDRRRIRRAVRQGLWLALLLSLPLMLLLWLAPPILPLMGQ 112


>gnl|CDD|216672 pfam01738, DLH, Dienelactone hydrolase family. 
          Length = 215

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 180 PVTVIYGSR-SWVDNSSGDKIKEA-RSQNSFVQVKSVTGAGHHVYADRADVFNK 231
           P+   +G    +V   + D ++EA R+ N   ++    GAGH   AD  + ++ 
Sbjct: 145 PILAHFGEEDPFVPAEARDLLEEALRAANVDHELHVYPGAGHAFAADGRESYDA 198


>gnl|CDD|238382 cd00741, Lipase, Lipase.  Lipases are esterases that can
          hydrolyze long-chain acyl-triglycerides into di- and
          monoglycerides, glycerol, and free fatty acids at a
          water/lipid interface.  A typical feature of lipases is
          "interfacial activation", the process of becoming
          active at the lipid/water interface, although several
          examples of lipases have been identified that do not
          undergo interfacial activation . The active site of a
          lipase contains a catalytic triad consisting of Ser -
          His - Asp/Glu, but unlike most serine proteases, the
          active site is buried inside the structure.  A "lid" or
          "flap" covers the active site, making it inaccessible
          to solvent and substrates. The lid opens during the
          process of interfacial activation, allowing the lipid
          substrate access to the active site.
          Length = 153

 Score = 27.9 bits (62), Expect = 5.4
 Identities = 11/75 (14%), Positives = 25/75 (33%), Gaps = 9/75 (12%)

Query: 18 RQLVTSIEEWRKELKLQEMILLGHSFGG----YLAFAYAIQYPDRVKHLILADP-----W 68
            ++  ++    +    ++ + GHS GG            +   R+  +    P      
Sbjct: 12 NLVLPLLKSALAQYPDYKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRVYTFGPPRVGNA 71

Query: 69 GFPQKSIDPQKASKI 83
           F +  +DP  A  +
Sbjct: 72 AFAEDRLDPSDALFV 86


>gnl|CDD|204918 pfam12436, USP7, Ubiquitin-specific protease 7.  This domain family
           is found in eukaryotes, and is approximately 40 amino
           acids in length. The family is found in association with
           pfam00443, pfam00917. USP7 regulates the turnover of
           p53.
          Length = 35

 Score = 25.5 bits (57), Expect = 5.4
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 268 EEERKKEEEKKKEEDGQHQ 286
           EEER++ E ++KE +  H 
Sbjct: 14  EEEREERERRRKEREEAHL 32


>gnl|CDD|240542 cd13137, MATE_NorM_like, Subfamily of the multidrug and toxic
           compound extrusion (MATE)-like proteins similar to
           Thermotoga marina NorM.  The integral membrane proteins
           from the MATE family are involved in exporting
           metabolites across the cell membrane and are responsible
           for multidrug resistance (MDR) in many bacteria and
           animals. A number of family members are involved in the
           synthesis of peptidoglycan components in bacteria.
          Length = 432

 Score = 28.6 bits (65), Expect = 5.6
 Identities = 10/38 (26%), Positives = 16/38 (42%), Gaps = 4/38 (10%)

Query: 283 GQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFLVLFG 320
           G    D+AR      L +     ++ M LL + L +F 
Sbjct: 294 GAGDPDEARRYGKRALRLG----VLFMGLLGLLLFIFA 327


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 25.2 bits (56), Expect = 5.7
 Identities = 8/19 (42%), Positives = 14/19 (73%)

Query: 263 EEVKEEEERKKEEEKKKEE 281
           + V + E++ +EE+KK EE
Sbjct: 11  KRVWKAEQKAEEEKKKIEE 29



 Score = 25.2 bits (56), Expect = 6.4
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 262 EEEVKEEEERKKEEEKKKE 280
           + E K EEE+KK EE++KE
Sbjct: 15  KAEQKAEEEKKKIEERRKE 33


>gnl|CDD|214445 MTH00166, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 160

 Score = 27.8 bits (63), Expect = 5.7
 Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 294 TFMTLGIVMVSMMILMTLLT-MFLVLFGLGYYFSYL 328
            F+   + M  ++I+ T+L  + L L    ++FSY+
Sbjct: 16  LFLNHPLSMGLILIIQTILISLLLGLMMKSFWFSYI 51


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 6.0
 Identities = 16/55 (29%), Positives = 25/55 (45%)

Query: 232 MVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQ 286
           +VN+   L  E          K      + EEE  E++ RK+ + KK +ED + Q
Sbjct: 99  VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQ 153


>gnl|CDD|227257 COG4920, COG4920, Predicted membrane protein [Function unknown].
          Length = 249

 Score = 27.9 bits (62), Expect = 6.0
 Identities = 19/103 (18%), Positives = 38/103 (36%), Gaps = 11/103 (10%)

Query: 244 LDIVTTKAVKPP----KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLG 299
           L IV      P        QE        EE++  E  +K+E+  ++  +    +FM + 
Sbjct: 45  LSIVMGLRSNPYLKGVVSAQEISRARPLYEEKEANELLEKDEELLNEYKRFARASFMPML 104

Query: 300 IVMVSMMILMTLLTMFLVLFG-------LGYYFSYLKTHFVKL 335
            ++   ++L T L   L           +   F+   ++F  +
Sbjct: 105 TMLPYFILLATFLFPTLPPLAENGLGPVVLREFARFLSYFELV 147


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.2 bits (63), Expect = 6.1
 Identities = 13/47 (27%), Positives = 27/47 (57%), Gaps = 3/47 (6%)

Query: 248 TTKAVKPPKE-PQEPEEEVKEEEERKKEE--EKKKEEDGQHQQDKAR 291
            + + KP +   QE  ++  +E E KK +  ++K+EE+ + Q+ K +
Sbjct: 165 LSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQ 211



 Score = 27.8 bits (62), Expect = 8.6
 Identities = 10/33 (30%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
           QE E +  +++++K+EEE++K+   + QQ++  
Sbjct: 185 QEWELKKLKQQQQKREEERRKQR--KKQQEEEE 215


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 27.3 bits (59), Expect = 6.3
 Identities = 12/42 (28%), Positives = 27/42 (64%)

Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
             K  +  K+P++PE E +++ E+ + E++KK E  +++ +K
Sbjct: 59  IKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEK 100


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.5 bits (63), Expect = 6.4
 Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 4/29 (13%)

Query: 259 QEPEEEVKEEEERKK----EEEKKKEEDG 283
            E EE  K+ EER+K    EE+++K+E+ 
Sbjct: 214 LELEELKKKREERRKVLEEEEQRRKQEEA 242


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 28.4 bits (63), Expect = 6.5
 Identities = 17/81 (20%), Positives = 26/81 (32%), Gaps = 2/81 (2%)

Query: 204 SQNSFVQVKSVTG--AGHHVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEP 261
           +  S VQV    G           AD+  +   D    +     +    A   P+EP   
Sbjct: 345 AAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAAL 404

Query: 262 EEEVKEEEERKKEEEKKKEED 282
             E  +E E +  E+     D
Sbjct: 405 ASEAHDETEPEVPEKAAPIPD 425


>gnl|CDD|224978 COG2067, FadL, Long-chain fatty acid transport protein [Lipid
           metabolism].
          Length = 440

 Score = 28.1 bits (63), Expect = 6.6
 Identities = 20/63 (31%), Positives = 24/63 (38%), Gaps = 16/63 (25%)

Query: 147 APSGESAFHTLTEGLGYAKRPM--------LHRV-DQLAAHVPVTVIYGSRSWVDNSSGD 197
            P G  A   +T G G     +         H+V DQ A H       GS  W D SS D
Sbjct: 263 LPIGPLALGGVTPGSGKLTLSLPASAELSGQHKVADQWAIH-------GSVKWTDWSSFD 315

Query: 198 KIK 200
           K+ 
Sbjct: 316 KLD 318


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 28.0 bits (62), Expect = 6.6
 Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 2/31 (6%)

Query: 256 KEPQEPEEEVKEE--EERKKEEEKKKEEDGQ 284
           K  Q+ EEEVKE+     +K    K+EEDG 
Sbjct: 96  KTKQKVEEEVKEQLQSLLEKIVVSKQEEDGP 126


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 28.4 bits (64), Expect = 6.7
 Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
           PP++P+     +++E    K++ + +  +    Q  A +
Sbjct: 136 PPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEA 174


>gnl|CDD|237418 PRK13534, PRK13534, 7-cyano-7-deazaguanine tRNA-ribosyltransferase;
           Provisional.
          Length = 639

 Score = 28.4 bits (64), Expect = 6.8
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 6/39 (15%)

Query: 244 LDIVTTKAVKPPKEPQE-PEEEVKEEEERKKEEEKKKEE 281
           LDI T     PP   +E  EE+++E  ER KE  + KE+
Sbjct: 121 LDIPT-----PPDVSREKAEEDLEETLERAKEAIEIKEK 154


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 26.3 bits (58), Expect = 6.8
 Identities = 7/26 (26%), Positives = 18/26 (69%)

Query: 257 EPQEPEEEVKEEEERKKEEEKKKEED 282
           E +EPE++  +E++   +++  K++D
Sbjct: 46  EGEEPEDDDDDEDDDDDDDKDDKDDD 71


>gnl|CDD|224882 COG1971, COG1971, Predicted membrane protein [Function unknown].
          Length = 190

 Score = 27.6 bits (62), Expect = 7.0
 Identities = 13/71 (18%), Positives = 26/71 (36%), Gaps = 17/71 (23%)

Query: 272 KKEEEKKKEEDGQHQQD---------------KARSVTFMTLGIVMVSMMILMTLLTMFL 316
           K EE++  +   +H  +                A  V    LG+ ++   + + L+T+ L
Sbjct: 90  KNEEDEFVDPAEKHDLNFKELILLAIATSIDALAVGVGLAFLGVNILLAAVAIGLITLIL 149

Query: 317 VLFG--LGYYF 325
              G  +G   
Sbjct: 150 SALGAIIGRKL 160


>gnl|CDD|192955 pfam12214, TPX2_importin, Cell cycle regulated microtubule
           associated protein.  This domain is found in eukaryotes.
           This domain is typically between 127 to 182 amino acids
           in length. This domain is found associated with
           pfam06886. This domain is found in the protein TPX2
           (a.k.a p100) which is involved in cell cycling. It is
           only expressed between the start of the S phase and
           completion of cytokinesis. The microtubule-associated
           protein TPX2 has been reported to be crucial for mitotic
           spindle formation. This domain is close to the C
           terminal of TPX2. The protein importin alpha regulates
           the activity of TPX2 by binding to the nuclear
           localisation signal in this domain.
          Length = 176

 Score = 27.5 bits (61), Expect = 7.2
 Identities = 15/72 (20%), Positives = 26/72 (36%), Gaps = 8/72 (11%)

Query: 221 VYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
           +Y  +A   NK + +  +    K           PK P E +E   E  +R +E   K +
Sbjct: 38  IYKFKARPLNKKIFEAPSPPLPK--------KPTPKLPTEFQEFHLETSKRAQERSSKAK 89

Query: 281 EDGQHQQDKARS 292
              +     +R 
Sbjct: 90  TASEQFTFHSRP 101


>gnl|CDD|236833 PRK11059, PRK11059, regulatory protein CsrD; Provisional.
          Length = 640

 Score = 28.3 bits (64), Expect = 7.4
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 303 VSMMILMTLLTMFLVLFGLGYYFSYLKTHFV 333
            + + L+  L MF+ L G    F  L     
Sbjct: 12  SAFVTLLVALAMFVTLLGCTLSFYQLTQEKQ 42


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 26.8 bits (60), Expect = 7.4
 Identities = 8/32 (25%), Positives = 18/32 (56%)

Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
               +  ++ +   EE  E + ++KE++KKK+
Sbjct: 54  VTVTRGGRKGKTAREEAVEAKAKEKEKKKKKK 85



 Score = 26.8 bits (60), Expect = 8.3
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 246 IVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
            VT    K     +E  E   +E+E+KK+++K+ E
Sbjct: 55  TVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89


>gnl|CDD|198412 cd10430, BI-1, BAX inhibitor (BI)-1.  Mammalian members of the BAX
           inhibitor (BI)-1 like family of small transmembrane
           proteins have been shown to have an antiapoptotic effect
           either by stimulating the antiapoptotic function of
           Bcl-2, a well-characterized oncogene, or by inhibiting
           the proapoptotic effect of Bax, another member of the
           Bcl-2 family. Their broad tissue distribution and high
           degree of conservation suggests an important regulatory
           role. In plants, BI-1 like proteins play a role in
           pathogen resistance.
          Length = 213

 Score = 27.6 bits (62), Expect = 7.6
 Identities = 6/31 (19%), Positives = 18/31 (58%)

Query: 290 ARSVTFMTLGIVMVSMMILMTLLTMFLVLFG 320
           A+   ++ LG ++ S + ++ L+++  +  G
Sbjct: 118 AKRREYLYLGGLLSSALSILLLVSLANIFGG 148


>gnl|CDD|220692 pfam10324, 7TM_GPCR_Srw, Serpentine type 7TM GPCR chemoreceptor
           Srw.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srw is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'. The genes encoding Srw do not appear to be
           under as strong an adaptive evolutionary pressure as
           those of Srz.
          Length = 317

 Score = 27.9 bits (63), Expect = 7.7
 Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 17/75 (22%)

Query: 276 EKKKEEDGQHQQDKARS---VTFMT---------LGIVMVSMMILMTLLTMFLVLFGLGY 323
           +K ++          R+   V FMT         LGI  +          + L+L  L  
Sbjct: 223 KKSRKNLSSSSNKSDRTTKLVIFMTISFFIAEFPLGISYIIQFFFYDDPGIRLILTYLSV 282

Query: 324 YFSYLKT-----HFV 333
            FS L T     H +
Sbjct: 283 IFSLLLTINAISHCL 297


>gnl|CDD|214444 MTH00165, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 573

 Score = 28.2 bits (64), Expect = 7.8
 Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 2/30 (6%)

Query: 302 MVSMMILMTLLTMFLVLFG--LGYYFSYLK 329
           M+ +   + LLT+F++L G  LGY  S + 
Sbjct: 447 MICLPFYLKLLTLFVILLGGWLGYELSLMS 476


>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family.  This is a
           family of transcription elongation factors which
           includes those referred to as Bex proteins as well as
           those named TCEAL7. Bex1 was shown to be a novel link
           between neurotrophin signalling, the cell cycle, and
           neuronal differentiation, suggesting it might function
           by coordinating internal cellular states with the
           ability of cells to respond to external signals. TCEAL7
           has been shown negatively to regulate the NF-kappaB
           pathway, hence being important in ovarian cancer as it
           one of the genes frequently downregulated in this
           cancer. A closely related protein, TFIIS/TCEA, found in
           pfam07500 is involved in transcription elongation and
           transcript fidelity. TFIIS/TCEA promotes 3'
           endoribonuclease activity of RNA polymerase II (pol II)
           and allows pol II to bypass transcript pause or 'arrest'
           during elongation process. It is thus possible that BEX
           is also acting in this way.
          Length = 97

 Score = 26.5 bits (59), Expect = 7.8
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDK 289
           E E K E E K+EEEK+  E+G+ ++ +
Sbjct: 7   ENEGKPESEPKEEEEKRPLEEGEGKKPE 34


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 27.0 bits (59), Expect = 7.9
 Identities = 8/26 (30%), Positives = 20/26 (76%)

Query: 264 EVKEEEERKKEEEKKKEEDGQHQQDK 289
           E+KEE ++ + E +K++++ + ++DK
Sbjct: 47  EMKEERDKMETEREKRDKESKEERDK 72


>gnl|CDD|177157 MTH00094, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 403

 Score = 27.9 bits (63), Expect = 8.0
 Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 3/39 (7%)

Query: 292 SVTFMT--LGIVMVSMMILMTLLTMFLVLFGLGYYFSYL 328
           S++F +  L I    M    +   +F     + +Y+S  
Sbjct: 327 SLSFFSEFLIISNSFMKNKFSFFFIFFYFL-VSFYYSLF 364


>gnl|CDD|226584 COG4099, COG4099, Predicted peptidase [General function prediction
           only].
          Length = 387

 Score = 28.0 bits (62), Expect = 8.0
 Identities = 21/104 (20%), Positives = 38/104 (36%), Gaps = 15/104 (14%)

Query: 7   PRFSTDPETVERQLVTSIEEWRKELKLQEMI------LLGHSFGGYLAFAYAIQYPDRVK 60
           P F+   E     L+  I+   + L     I      ++G S GG+  +A A ++PD   
Sbjct: 236 PIFADSEEKTLLYLIEKIDLILEVLASTYNIDRSRIYVIGLSRGGFGTWALAEKFPDFFA 295

Query: 61  HLIL-ADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPV 103
             +  A            +   K P+W       + + + + PV
Sbjct: 296 AAVPIAGGGDRVYLV---RTLKKAPIWV-----FHSSDDKVIPV 331


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.1 bits (63), Expect = 8.1
 Identities = 9/32 (28%), Positives = 21/32 (65%)

Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
           + EE  +E+EE++  +E+K+    + Q++K +
Sbjct: 549 DKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580


>gnl|CDD|221747 pfam12740, Chlorophyllase2, Chlorophyllase enzyme.  This family
           consists of several chlorophyllase and chlorophyllase-2
           (EC:3.1.1.14) enzymes. Chlorophyllase (Chlase) is the
           first enzyme involved in chlorophyll (Chl) degradation
           and catalyzes the hydrolysis of an ester bond to yield
           chlorophyllide and phytol. The family includes both
           plant and Amphioxus members.
          Length = 258

 Score = 27.8 bits (62), Expect = 8.1
 Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 4/39 (10%)

Query: 33  LQEMILLGHSFGGYLAFAYAIQYPDRV----KHLILADP 67
             ++ L GHS GG +AFA A+   + +      LI  DP
Sbjct: 90  FSKLALAGHSRGGKVAFALALGNANVLHLKFSALIGVDP 128


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 28.2 bits (64), Expect = 8.2
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 71  PQKSIDPQKASKIPLWARMIGNLYKNFNPLWPV 103
           P  +       K  LWARM  +   +F  L+ V
Sbjct: 37  PTSASSVPAKEKGALWARMGVDPDHDFEALYEV 69


>gnl|CDD|227502 COG5175, MOT2, Transcriptional repressor [Transcription].
          Length = 480

 Score = 28.1 bits (62), Expect = 8.3
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
              P+E + E+  +EERK  E+++KE +GQ
Sbjct: 72  TLSPEELKMELARKEERKMREKERKEAEGQ 101


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 27.4 bits (60), Expect = 8.5
 Identities = 9/28 (32%), Positives = 20/28 (71%)

Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
           P ++P++  EE++++ E   E+E++ EE
Sbjct: 35  PEQKPEDIVEELEDQPEEPPEQEEENEE 62


>gnl|CDD|235293 PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional.
          Length = 296

 Score = 27.8 bits (63), Expect = 8.6
 Identities = 8/34 (23%), Positives = 17/34 (50%), Gaps = 5/34 (14%)

Query: 296 MTLGIVM--VSMMILMTL---LTMFLVLFGLGYY 324
           +  G+V+  +  ++L      L  +L L G+ +Y
Sbjct: 94  LIFGLVLGVLGFLLLGLFVNPLAAWLTLAGIFFY 127


>gnl|CDD|214867 smart00864, Tubulin, Tubulin/FtsZ family, GTPase domain.  This
          domain is found in all tubulin chains, as well as the
          bacterial FtsZ family of proteins. These proteins are
          involved in polymer formation. Tubulin is the major
          component of microtubules, while FtsZ is the
          polymer-forming protein of bacterial cell division, it
          is part of a ring in the middle of the dividing cell
          that is required for constriction of cell membrane and
          cell envelope to yield two daughter cells. FtsZ and
          tubulin are GTPases, this entry is the GTPase domain.
          FtsZ can polymerise into tubes, sheets, and rings in
          vitro and is ubiquitous in bacteria and archaea.
          Length = 192

 Score = 27.5 bits (62), Expect = 8.8
 Identities = 10/34 (29%), Positives = 17/34 (50%)

Query: 12 DPETVERQLVTSIEEWRKELKLQEMILLGHSFGG 45
          DPE        S++E R+EL+  + + +    GG
Sbjct: 61 DPEVGREAAEESLDEIREELEGADGVFITAGMGG 94


>gnl|CDD|236879 PRK11195, PRK11195, lysophospholipid transporter LplT; Provisional.
          Length = 393

 Score = 27.9 bits (63), Expect = 8.9
 Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 290 ARSVTFMTLGIVMVSMMILMTLLT------MFLVLFG-LGYYF 325
              +  +  GI+M  +++LM L          L+L G LG +F
Sbjct: 268 ETVLRVLPAGILMGLVVLLMALQHSLLPAYPLLILIGALGGFF 310


>gnl|CDD|184853 PRK14851, PRK14851, hypothetical protein; Provisional.
          Length = 679

 Score = 27.9 bits (62), Expect = 9.1
 Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 10/52 (19%)

Query: 112 WVVEKM--------RPDLPKKFTPVLKED--SSAITEYIFQCNVQAPSGESA 153
           W V+ +        R   P+    V   D     + +YI Q  +QAPSG++ 
Sbjct: 293 WYVQNVLIRNTDQARVREPEPPVSVAHFDSLPQGVGDYILQAGIQAPSGDNV 344


>gnl|CDD|234859 PRK00893, PRK00893, aspartate carbamoyltransferase regulatory
           subunit; Reviewed.
          Length = 152

 Score = 27.1 bits (61), Expect = 9.3
 Identities = 9/18 (50%), Positives = 13/18 (72%)

Query: 113 VVEKMRPDLPKKFTPVLK 130
           VVEK + +LP++   VLK
Sbjct: 90  VVEKRKVELPEEIEGVLK 107


>gnl|CDD|219340 pfam07224, Chlorophyllase, Chlorophyllase.  This family consists of
           several plant specific Chlorophyllase proteins
           (EC:3.1.1.14). Chlorophyllase (Chlase) is the first
           enzyme involved in chlorophyll (Chl) degradation and
           catalyzes the hydrolysis of ester bond to yield
           chlorophyllide and phytol.
          Length = 307

 Score = 27.5 bits (61), Expect = 9.4
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 30  ELKLQEMILLGHSFGGYLAFAYAIQYPDRVK--HLILADP 67
           E  L ++ L GHS GG  AFA A+ Y   V    LI  DP
Sbjct: 117 EANLSKLALSGHSRGGKTAFALALGYSLDVTFSALIGVDP 156


>gnl|CDD|215869 pfam00344, SecY, SecY translocase. 
          Length = 340

 Score = 27.5 bits (62), Expect = 9.7
 Identities = 8/21 (38%), Positives = 12/21 (57%), Gaps = 1/21 (4%)

Query: 294 TFMTLGIV-MVSMMILMTLLT 313
           +   LGI   ++  I+M LLT
Sbjct: 1   SIFALGIGPYITASIIMQLLT 21


>gnl|CDD|225375 COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily
           [General function prediction only].
          Length = 264

 Score = 27.4 bits (61), Expect = 9.8
 Identities = 15/52 (28%), Positives = 20/52 (38%), Gaps = 1/52 (1%)

Query: 16  VERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADP 67
           +  QL   IE  R     +   ++GHS GG       + YPD      L  P
Sbjct: 120 LTEQLKPFIEA-RYRTNSERTAIIGHSLGGLFVLFALLTYPDCFGRYGLISP 170


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 27.7 bits (62), Expect = 9.8
 Identities = 9/49 (18%), Positives = 25/49 (51%), Gaps = 1/49 (2%)

Query: 241 DEKL-DIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
           ++++  +       P K   E +++ +    R+K++EKK + + +  Q+
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 26.5 bits (58), Expect = 9.9
 Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 3/40 (7%)

Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
             +  E EEEV E++E   EEE + EE+   ++++    T
Sbjct: 47  GDDEMEEEEEVDEDDE---EEEGEGEEEEGEEEEETEGAT 83


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,280,156
Number of extensions: 1665371
Number of successful extensions: 6984
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6139
Number of HSP's successfully gapped: 470
Length of query: 335
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 238
Effective length of database: 6,635,264
Effective search space: 1579192832
Effective search space used: 1579192832
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)