RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13998
         (319 letters)



>gnl|CDD|225926 COG3391, COG3391, Uncharacterized conserved protein [Function
           unknown].
          Length = 381

 Score = 64.9 bits (158), Expect = 8e-12
 Identities = 63/243 (25%), Positives = 100/243 (41%), Gaps = 40/243 (16%)

Query: 55  FTWPRGIAVGPDNSIV-VADSSNHRVQVCFPHFDLKTNCV--FLAFTW-PRGIAVGPDN- 109
             +P G+AV P  + V V    ++ V V     D  TN V   +     P G+AV PD  
Sbjct: 73  GVYPAGVAVNPAGNKVYVTTGDSNTVSV----IDTATNTVLGSIPVGLGPVGLAVDPDGK 128

Query: 110 SIVVADS--SNHRVQVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVS-NTNRVIVSDSNNH 166
            + VA++   N+ V V  +    V     +GN       P  +AV  + N+V V++S+++
Sbjct: 129 YVYVANAGNGNNTVSVIDAATNKVTATIPVGN------TPTGVAVDPDGNKVYVTNSDDN 182

Query: 167 RVQIFDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVS-NTNRVIVSDSNNHRVQIF 225
            V + D +G             GS+G+  G    P  IAV  + NRV V++  +    + 
Sbjct: 183 TVSVIDTSGNSV--------VRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVL 234

Query: 226 DVNGRVITSFGSEGSEEGQLKFPRGVAVDDQG-----------YISVGDSGNNRIQIFTP 274
            ++        ++         PRGVAVD  G            +SV D   +R+    P
Sbjct: 235 KIDTATGNVTATDLPVGSG--APRGVAVDPAGKAAYVANSQGGTVSVIDGATDRVVKTGP 292

Query: 275 DGQ 277
            G 
Sbjct: 293 TGN 295



 Score = 46.4 bits (110), Expect = 1e-05
 Identities = 29/125 (23%), Positives = 54/125 (43%), Gaps = 17/125 (13%)

Query: 200 HPHYIAVSN-TNRVIVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQLKFPRGVAVDDQG- 257
           +P  +AV+   N+V V+  +++ V + D     +      G        P G+AVD  G 
Sbjct: 75  YPAGVAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGL------GPVGLAVDPDGK 128

Query: 258 --YISVGDSGNNRIQIFTPDGQFLRAFGCWGSGDGEFKGLEGVAVMSNGN-ILVCDRENH 314
             Y++   +GNN + +       + A    G+         GVAV  +GN + V + +++
Sbjct: 129 YVYVANAGNGNNTVSVIDAATNKVTATIPVGNTPT------GVAVDPDGNKVYVTNSDDN 182

Query: 315 RIQVF 319
            + V 
Sbjct: 183 TVSVI 187



 Score = 39.1 bits (91), Expect = 0.002
 Identities = 37/142 (26%), Positives = 47/142 (33%), Gaps = 23/142 (16%)

Query: 58  PRGIAVGPD-NSIVVADSSNHRVQVCFPHFDLKTNCVFLA--------FTWPRGIAVGPD 108
           P G+AV PD N + V +S ++ V V     D   N V            T P GIAV PD
Sbjct: 162 PTGVAVDPDGNKVYVTNSDDNTVSV----IDTSGNSVVRGSVGSLVGVGTGPAGIAVDPD 217

Query: 109 NSIVVA---DSSNHRVQVFQSDGTFVGKFGSMGNKA---GQLEHPH----YIAVSNTNRV 158
            + V      S ++ V    +    V             G    P     Y+A S    V
Sbjct: 218 GNRVYVANDGSGSNNVLKIDTATGNVTATDLPVGSGAPRGVAVDPAGKAAYVANSQGGTV 277

Query: 159 IVSDSNNHRVQIFDVNGRVFQS 180
            V D    RV      G     
Sbjct: 278 SVIDGATDRVVKTGPTGNEALG 299


>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
           protein.
          Length = 1057

 Score = 60.6 bits (147), Expect = 3e-10
 Identities = 51/211 (24%), Positives = 85/211 (40%), Gaps = 30/211 (14%)

Query: 96  AFTWPRGIAVGPD-NSIVVADSSNHRVQVFQ-----------SDGTFVGKFGSMGNKAG- 142
           +F  P GI++ PD   + +ADS +  ++               D TF       G+  G 
Sbjct: 738 SFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGV 797

Query: 143 ----QLEHPHYIAVSNTNRVIVSDSNNHRVQIFD-VNGRVFQSDGTFVGKFGSMGNKAGQ 197
                L+HP  +  +   ++ V+DS NH+++  D    RV    GT    F        Q
Sbjct: 798 GSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQ 857

Query: 198 LEHPHYIAVSNTNRVIVSDSNNHRVQIFDVN---GRVITSFGSEGSE-----EGQLKFPR 249
           L  P  +A+    R+ V+D+NN  ++  D+N      I +   +G +        LK  R
Sbjct: 858 LSEPAGLALGENGRLFVADTNNSLIRYLDLNKGEAAEILTLELKGVQPPRPKSKSLKRLR 917

Query: 250 GVAVDDQGYISV----GDSGNNRIQIFTPDG 276
             +  D   I V       G+ +++I  P G
Sbjct: 918 RRSSADTQVIKVDGVTSLEGDLQLKISLPPG 948



 Score = 48.3 bits (115), Expect = 3e-06
 Identities = 50/196 (25%), Positives = 88/196 (44%), Gaps = 25/196 (12%)

Query: 103 IAVGPDNSIVVADSSNHRVQVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVS-NTNRVIVS 161
           IA+   + I   + S+   +VF  DG      GS G  +     P  I++S +   + ++
Sbjct: 699 IAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSG-TSTSFAQPSGISLSPDLKELYIA 757

Query: 162 DSNNHRVQIFDVN---------GRVFQSDGTFVGKFGS---MGNKAGQLEHPHYIAVSNT 209
           DS +  ++  D+          G    SD  F  KFG    +G++   L+HP  +  +  
Sbjct: 758 DSESSSIRALDLKTGGSRLLAGGDPTFSDNLF--KFGDHDGVGSEV-LLQHPLGVLCAKD 814

Query: 210 NRVIVSDSNNHRVQIFD-VNGRVIT--SFGSEGSEEG-----QLKFPRGVAVDDQGYISV 261
            ++ V+DS NH+++  D    RV T    G  G ++G     QL  P G+A+ + G + V
Sbjct: 815 GQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFV 874

Query: 262 GDSGNNRIQIFTPDGQ 277
            D+ N+ I+    +  
Sbjct: 875 ADTNNSLIRYLDLNKG 890



 Score = 42.5 bits (100), Expect = 3e-04
 Identities = 60/252 (23%), Positives = 104/252 (41%), Gaps = 42/252 (16%)

Query: 108 DNSIVVADSSNHRVQVFQSDGTFVGKFGSMGN---KAGQLE-----HPHYIAV-SNTNRV 158
           +N + ++DS+++R+ V   DG F+ + GS G    + G  E      P  +A  +  N +
Sbjct: 579 NNRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLL 638

Query: 159 IVSDSNNHRVQIFD-VNGRVFQSDGTFVGKFGSMGNKAGQLE----------HPH----Y 203
            V+D+ NH ++  D VN  V    G         G K G  +           P     Y
Sbjct: 639 YVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVY 698

Query: 204 IAVSNTNRVIVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQLKFPRGVAVD---DQGYIS 260
           IA++  +++   + ++   ++F  +G      GS G+     + P G+++     + YI+
Sbjct: 699 IAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQ-PSGISLSPDLKELYIA 757

Query: 261 VGDSGNNRIQIFTPDGQFLRAFG-------CWGSGDGEFKGLE-------GVAVMSNGNI 306
             +S + R       G  L A G        +  GD +  G E       GV    +G I
Sbjct: 758 DSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQI 817

Query: 307 LVCDRENHRIQV 318
            V D  NH+I+ 
Sbjct: 818 YVADSYNHKIKK 829



 Score = 37.9 bits (88), Expect = 0.006
 Identities = 29/96 (30%), Positives = 40/96 (41%), Gaps = 25/96 (26%)

Query: 43  FKIGSR---GSEPGCFTWPRGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTNCVF-LA-- 96
           FK G     GSE      P G+    D  I VADS NH+++      D  T  V  LA  
Sbjct: 789 FKFGDHDGVGSE-VLLQHPLGVLCAKDGQIYVADSYNHKIKK----LDPATKRVTTLAGT 843

Query: 97  --------------FTWPRGIAVGPDNSIVVADSSN 118
                          + P G+A+G +  + VAD++N
Sbjct: 844 GKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNN 879


>gnl|CDD|110440 pfam01436, NHL, NHL repeat.  The NHL (NCL-1, HT2A and LIN-41)
           repeat is found in multiple tandem copies. It is about
           40 residues long and resembles the WD repeat pfam00400.
           The repeats have a catalytic activity in bovine PAM
           (peptidyl-glycine alpha-amidating monooxygenase),
           proteolysis has shown that the
           Peptidyl-alpha-hydroxyglycine alpha-amidating lyase
           (PAL) activity is localised to the repeats. Human E3
           ubiquitin-protein ligase TRIM32 interacts with the
           activation domain of Tat. This interaction is me diated
           by the NHL repeats.
          Length = 28

 Score = 45.5 bits (109), Expect = 4e-07
 Identities = 17/28 (60%), Positives = 19/28 (67%)

Query: 97  FTWPRGIAVGPDNSIVVADSSNHRVQVF 124
           F  P G+AV  D  + VADS NHRVQVF
Sbjct: 1   FNRPHGVAVDSDGDVYVADSENHRVQVF 28



 Score = 42.8 bits (102), Expect = 3e-06
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 55 FTWPRGIAVGPDNSIVVADSSNHRVQV 81
          F  P G+AV  D  + VADS NHRVQV
Sbjct: 1  FNRPHGVAVDSDGDVYVADSENHRVQV 27



 Score = 42.4 bits (101), Expect = 4e-06
 Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 292 FKGLEGVAVMSNGNILVCDRENHRIQVF 319
           F    GVAV S+G++ V D ENHR+QVF
Sbjct: 1   FNRPHGVAVDSDGDVYVADSENHRVQVF 28



 Score = 40.9 bits (97), Expect = 2e-05
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 144 LEHPHYIAVSNTNRVIVSDSNNHRVQIF 171
              PH +AV +   V V+DS NHRVQ+F
Sbjct: 1   FNRPHGVAVDSDGDVYVADSENHRVQVF 28



 Score = 40.9 bits (97), Expect = 2e-05
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 198 LEHPHYIAVSNTNRVIVSDSNNHRVQIF 225
              PH +AV +   V V+DS NHRVQ+F
Sbjct: 1   FNRPHGVAVDSDGDVYVADSENHRVQVF 28



 Score = 37.4 bits (88), Expect = 2e-04
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 245 LKFPRGVAVDDQGYISVGDSGNNRIQIF 272
              P GVAVD  G + V DS N+R+Q+F
Sbjct: 1   FNRPHGVAVDSDGDVYVADSENHRVQVF 28


>gnl|CDD|219847 pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region.  This
           family describes a region that is found in proteins
           expressed by a variety of eukaryotic and prokaryotic
           species. These proteins include various enzymes, such as
           senescence marker protein 30 (SMP-30), gluconolactonase
           and luciferin-regenerating enzyme (LRE). SMP-30 is known
           to hydrolyse diisopropyl phosphorofluoridate in the
           liver, and has been noted as having sequence similarity,
           in the region described in this family, with PON1 and
           LRE.
          Length = 245

 Score = 41.1 bits (97), Expect = 3e-04
 Identities = 24/86 (27%), Positives = 36/86 (41%), Gaps = 15/86 (17%)

Query: 204 IAVSNTNR-VIVSDSNNHRVQIFDV--------NGRVITSFGSEGSEEGQLKFPRGVAVD 254
           +A S   + +  +DS   R+  FD         N RV   F     E      P G+AVD
Sbjct: 138 LAWSPDGKTLYFADSPTRRIWAFDYDADGGLISNRRVFADFKDGDGE------PDGMAVD 191

Query: 255 DQGYISVGDSGNNRIQIFTPDGQFLR 280
            +G + V   G  ++  + PDG+ L 
Sbjct: 192 AEGNVWVARWGGGKVVRYDPDGKLLG 217



 Score = 34.9 bits (81), Expect = 0.032
 Identities = 22/86 (25%), Positives = 34/86 (39%), Gaps = 12/86 (13%)

Query: 58  PRGIAVGPDNSIV-VADSSNHRVQVCFPHFDLKTNC-----VFLAFT----WPRGIAVGP 107
             G+A  PD   +  ADS   R+      +D          VF  F      P G+AV  
Sbjct: 135 SNGLAWSPDGKTLYFADSPTRRIWAF--DYDADGGLISNRRVFADFKDGDGEPDGMAVDA 192

Query: 108 DNSIVVADSSNHRVQVFQSDGTFVGK 133
           + ++ VA     +V  +  DG  +G+
Sbjct: 193 EGNVWVARWGGGKVVRYDPDGKLLGE 218



 Score = 29.9 bits (68), Expect = 1.6
 Identities = 27/132 (20%), Positives = 39/132 (29%), Gaps = 19/132 (14%)

Query: 62  AVGPDNSIVVADSSNHRVQVCFP------HFDLKTNCVFLAFTWPRGIAVGPDNSIV-VA 114
            V PD                 P        D K   V    T   G+A  PD   +  A
Sbjct: 91  KVDPDGRFWFGTMGFDIAPGGEPGALYRLDPDGKVERVLDGITISNGLAWSPDGKTLYFA 150

Query: 115 DSSNHRVQVFQ-SDGTFVGKFGSMGNK------AGQLEHPHYIAVSNTNRVIVSDSNNHR 167
           DS   R+  F           G + N+            P  +AV     V V+     +
Sbjct: 151 DSPTRRIWAFDYDADG-----GLISNRRVFADFKDGDGEPDGMAVDAEGNVWVARWGGGK 205

Query: 168 VQIFDVNGRVFQ 179
           V  +D +G++  
Sbjct: 206 VVRYDPDGKLLG 217


>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 40.4 bits (95), Expect = 7e-04
 Identities = 50/273 (18%), Positives = 101/273 (36%), Gaps = 51/273 (18%)

Query: 59  RGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTNCVFLAFT----WPRGIAVGPDNSIVVA 114
             +A  PD  ++   S +  ++V    +DL+T  +           R +A   D + + +
Sbjct: 13  TCVAFSPDGKLLATGSGDGTIKV----WDLETGELLRTLKGHTGPVRDVAASADGTYLAS 68

Query: 115 DSSNHRVQVFQ-SDGTFVGKFGSMGNKAGQLEHPHYI---AVSNTNRVIVSDSNNHRVQI 170
            SS+  ++++    G  V              H  Y+   A S   R++ S S +  +++
Sbjct: 69  GSSDKTIRLWDLETGECVRTLTG---------HTSYVSSVAFSPDGRILSSSSRDKTIKV 119

Query: 171 FDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYI---AVSNTNRVIVSDSNNHRVQIFDV 227
           +DV      +  T  G             H  ++   A S     + S S +  ++++D+
Sbjct: 120 WDVETGKCLT--TLRG-------------HTDWVNSVAFSPDGTFVASSSQDGTIKLWDL 164

Query: 228 -NGRVITSFGSEGSEEGQLKFPRGVAVDDQGYISVGDSGNNRIQIFTPDGQFLRAFGCWG 286
             G+ + +        G       VA    G   +  S +  I+++           C G
Sbjct: 165 RTGKCVATL------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK-----CLG 213

Query: 287 SGDGEFKGLEGVAVMSNGNILVCDRENHRIQVF 319
           +  G   G+  VA   +G +L    E+  I+V+
Sbjct: 214 TLRGHENGVNSVAFSPDGYLLASGSEDGTIRVW 246



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 23/144 (15%), Positives = 53/144 (36%), Gaps = 23/144 (15%)

Query: 146 HPHYI---AVSNTNRVIVSDSNNHRVQIFDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPH 202
           H   +   A S   +++ + S +  ++++D+         T  G  G + +         
Sbjct: 8   HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLR--TLKGHTGPVRD--------- 56

Query: 203 YIAVSNTNRVIVSDSNNHRVQIFDVN-GRVITSFGSEGSEEGQLKFPRGVAVDDQGYISV 261
            +A S     + S S++  ++++D+  G  + +        G   +   VA    G I  
Sbjct: 57  -VAASADGTYLASGSSDKTIRLWDLETGECVRTL------TGHTSYVSSVAFSPDGRILS 109

Query: 262 GDSGNNRIQIF-TPDGQFLRAFGC 284
             S +  I+++    G+ L     
Sbjct: 110 SSSRDKTIKVWDVETGKCLTTLRG 133



 Score = 33.1 bits (76), Expect = 0.14
 Identities = 36/178 (20%), Positives = 74/178 (41%), Gaps = 33/178 (18%)

Query: 57  WPRGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTN-CV--FLAFT-WPRGIAVGPDNSIV 112
           W   +A  PD + V + S +  +++    +DL+T  CV      T     +A  PD   +
Sbjct: 137 WVNSVAFSPDGTFVASSSQDGTIKL----WDLRTGKCVATLTGHTGEVNSVAFSPDGEKL 192

Query: 113 VADSSNHRVQVF-QSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVIVSDSNNHRVQIF 171
           ++ SS+  ++++  S G  +G      N        + +A S    ++ S S +  ++++
Sbjct: 193 LSSSSDGTIKLWDLSTGKCLGTLRGHENGV------NSVAFSPDGYLLASGSEDGTIRVW 246

Query: 172 DVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYI---AVSNTNRVIVSDSNNHRVQIFD 226
           D+  R  +   T  G             H + +   A S   + + S S +  ++I+D
Sbjct: 247 DL--RTGECVQTLSG-------------HTNSVTSLAWSPDGKRLASGSADGTIRIWD 289


>gnl|CDD|220674 pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller.  This
           entry contains bacterial 6-phosphogluconolactonases
           (6PGL)YbhE-type (EC:3.1.1.31) which hydrolyse
           6-phosphogluconolactone to 6-phosphogluconate. The entry
           also contains the fungal muconate lactonising enzyme
           carboxy-cis,cis-muconate cyclase (EC:5.5.1.5) and
           muconate cycloisomerase (EC:5.5.1.1), which convert
           cis,cis-muconates to muconolactones and vice versa as
           part of the microbial beta-ketoadipate pathway.
           Structures of proteins in this family have revealed a
           7-bladed beta-propeller fold.
          Length = 344

 Score = 34.5 bits (80), Expect = 0.056
 Identities = 27/141 (19%), Positives = 54/141 (38%), Gaps = 22/141 (15%)

Query: 103 IAVGPDNSI---VVADSSNHRVQVFQSDG-----TFVGKFGSMGNKAGQLEHPHYIAVSN 154
           +AV PD      V     +  V  F+ D      T + +  + G        P +++V  
Sbjct: 42  LAVSPDGKTLYAVNETGEDGGVSAFRIDPDTGKLTLLNQVPTGGA------SPCHLSVDP 95

Query: 155 TNRVIVSDSNNH--RVQIFDVN--GRVFQSDGTFVGKFGSMGNKAGQ-LEHPHYIAVSNT 209
             R +   +N     V ++ ++  G +       V   GS  N   Q   H H + ++  
Sbjct: 96  DGRFLFV-ANYGGGSVSVYPLDADGSL-GEASQVVQHEGSGPNPERQEGPHAHSVVLTPD 153

Query: 210 NR-VIVSDSNNHRVQIFDVNG 229
            + ++V D    RV+++ ++ 
Sbjct: 154 GKFLVVPDLGTDRVRVYRLDD 174


>gnl|CDD|234377 TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated
           beta-propeller protein.  Members of this protein family
           consist of seven repeats each of the YVTN family
           beta-propeller repeat (see TIGR02276). Members occur
           invariably as part of a transport operon that is
           associated with PQQ-dependent catabolism of alcohols
           such as phenylethanol.
          Length = 300

 Score = 33.1 bits (76), Expect = 0.17
 Identities = 25/114 (21%), Positives = 43/114 (37%), Gaps = 21/114 (18%)

Query: 66  DNSIVVADSSNHRVQVCFPHFDLKTNCVFLAFTWPRGIAVGPDNSIV-VADSSNHRVQVF 124
           DN+I V D++   V   FP               PRGI    D  ++ V  S +  VQV 
Sbjct: 10  DNTISVIDTATLEVTRTFP-----------VGQRPRGITFSKDGKLLYVCASDSDTVQVI 58

Query: 125 QSD-GTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVI-VSDSNNHRVQIFDVNGR 176
               G  +    S          P   A+    +++ +++ ++  V + D+  R
Sbjct: 59  DPATGEVLHTLPSG-------PDPELFALHPNGKILYIANEDDALVTVIDIETR 105



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 14/49 (28%)

Query: 43 FKIGSRGSEPGCFTWPRGIAVGPDNSIV-VADSSNHRVQVCFPHFDLKT 90
          F +G R         PRGI    D  ++ V  S +  VQV     D  T
Sbjct: 27 FPVGQR---------PRGITFSKDGKLLYVCASDSDTVQV----IDPAT 62


>gnl|CDD|236846 PRK11100, PRK11100, sensory histidine kinase CreC; Provisional.
          Length = 475

 Score = 32.1 bits (74), Expect = 0.35
 Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 213 IVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQ 244
           I     ++RV + D  G V+  F SEG   GQ
Sbjct: 86  IRKTRVDYRVYVTDAQGIVL--FDSEGRAVGQ 115



 Score = 28.3 bits (64), Expect = 6.9
 Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 159 IVSDSNNHRVQIFDVNGRV-FQSDGTFVGK 187
           I     ++RV + D  G V F S+G  VG+
Sbjct: 86  IRKTRVDYRVYVTDAQGIVLFDSEGRAVGQ 115


>gnl|CDD|225921 COG3386, COG3386, Gluconolactonase [Carbohydrate transport and
           metabolism].
          Length = 307

 Score = 32.0 bits (73), Expect = 0.36
 Identities = 45/216 (20%), Positives = 72/216 (33%), Gaps = 54/216 (25%)

Query: 107 PDNSIVVADSSNHRVQVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVIVSDSNNH 166
               I   D    + +VF S G F    G++ +  G+L     IA  +  R++  D+   
Sbjct: 45  LGGRIHRLDPETGKKRVFPSPGGFSS--GALIDAGGRL-----IACEHGVRLLDPDTGGK 97

Query: 167 RVQIFDV---------NGRVFQSDGTFVGKFGSMGNK------------------AGQLE 199
              + +          N  V   DG     FG MG                     G + 
Sbjct: 98  ITLLAEPEDGLPLNRPNDGVVDPDGRIW--FGDMGYFDLGKSEERPTGSLYRVDPDGGVV 155

Query: 200 HP--HYIAVSNT-------NRVIVSDSNNHRVQIFDVNGRVITSFGSEGS-----EEGQL 245
                 + + N          + V+D+  +R+  +D++       G  G      E G  
Sbjct: 156 RLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPG-- 213

Query: 246 KFPRGVAVDDQGYI-SVGDSGNNRIQIFTPDGQFLR 280
             P G+AVD  G +      G  R+  F PDG+ L 
Sbjct: 214 -LPDGMAVDADGNLWVAAVWGGGRVVRFNPDGKLLG 248



 Score = 28.5 bits (64), Expect = 4.4
 Identities = 34/161 (21%), Positives = 52/161 (32%), Gaps = 36/161 (22%)

Query: 52  PGCFTWPRGIAVGPDNSIV-VADSSNHRV-QVCFPHFDLKTN--CVFLAFTWPRGIAVGP 107
               T P G+A  PD   + VAD+  +R+ +               F+ F    G+   P
Sbjct: 159 DDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGL---P 215

Query: 108 DNSIVVAD--------SSNHRVQVFQSDGTFVGK------------FGSMGNKAGQLEHP 147
           D   V AD            RV  F  DG  +G+            FG  G     L   
Sbjct: 216 DGMAVDADGNLWVAAVWGGGRVVRFNPDGKLLGEIKLPVKRPTNPAFG--GPDLNTL--- 270

Query: 148 HYIAVSNTNRVIVSDSNNHRVQIFDVNGRVFQSDGTFVGKF 188
            YI  + +    +  ++     +F +   V    G    +F
Sbjct: 271 -YITSARSGMSRMLTADPLGGGLFSLRLEV---KGLPEPRF 307


>gnl|CDD|197380 cd09971, SdiA-regulated, SdiA-regulated.  This model represents a
           bacterial family of proteins that may be regulated by
           SdiA, a member of the LuxR family of transcriptional
           regulators. The C-terminal domain included in the
           alignment forms a five-bladed beta-propeller structure.
           The X-ray structure of Escherichia coli yjiK (C-terminal
           domain) exhibits binding of calcium ions (Ca++) in what
           appears to be an evolutionarily conserved site. Sequence
           analysis suggests a distant relationship to proteins
           that are characterized as containing NHL-repeats. The
           latter also form beta-propeller structures, with several
           examples known to form six-bladed beta-propellers.
           Several of the six-bladed beta-propellers containing NHL
           repeats have been characterized functionally, including
           members with enzymatic functions that are dependent on
           metal ions. No functional characterization is available
           for this family of five-bladed propellers, though.
          Length = 242

 Score = 31.4 bits (72), Expect = 0.48
 Identities = 12/60 (20%), Positives = 24/60 (40%), Gaps = 6/60 (10%)

Query: 260 SVGDSGNNRIQIFTPDGQFLRAFGCWGSGDGEFKGLEGVAVMSNGNILVCDRENHRIQVF 319
           +V +     +++ + DG  LR        DG F   EG+  + +G   + D     + + 
Sbjct: 31  AVTNKPPALVEL-SKDGDVLRRI----PLDG-FSDPEGITYLGDGQFAIVDERLQALLIV 84


>gnl|CDD|222139 pfam13449, Phytase-like, Esterase-like activity of phytase.  This
           is a repeated domain that carries several highly
           conserved Glu and Asp residues indicating the likelihood
           that the domain incorporates the enzymic activity of the
           PLC-like phospho-diesterase part of the proteins.
          Length = 324

 Score = 31.1 bits (71), Expect = 0.69
 Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 14/84 (16%)

Query: 248 PRGVAVDDQGYISVGDSGN------NRIQIFTPDGQFLRAF--------GCWGSGDGEFK 293
           P G+A+D  G + +   G+        ++ F  DG+ LR          G    G     
Sbjct: 87  PEGIALDPGGTLWISSEGDRSRGIPPFLRRFDLDGRLLRELPLPAAFLPGPGKQGVRNNL 146

Query: 294 GLEGVAVMSNGNILVCDRENHRIQ 317
           G EG+ +  +G+ L    E    Q
Sbjct: 147 GFEGLTLSPDGDTLWVATEAALKQ 170


>gnl|CDD|223663 COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport
           and metabolism / Translation, ribosomal structure and
           biogenesis].
          Length = 152

 Score = 30.0 bits (68), Expect = 0.99
 Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 1/35 (2%)

Query: 91  NCVFLAFTWPRGIAVGPDNSIVVADSSNHRVQVFQ 125
             V+ A + P+  A+G    I+     NHRV+V+ 
Sbjct: 102 RVVYGA-SDPKTGAIGSLLDILKDPRLNHRVEVYG 135



 Score = 28.4 bits (64), Expect = 3.3
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 54  CFTWPRGIAVGPDNSIVVADSSNHRVQV 81
             + P+  A+G    I+     NHRV+V
Sbjct: 106 GASDPKTGAIGSLLDILKDPRLNHRVEV 133


>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966).  Family of
           plant proteins with unknown function.
          Length = 308

 Score = 29.7 bits (67), Expect = 2.1
 Identities = 12/23 (52%), Positives = 13/23 (56%), Gaps = 6/23 (26%)

Query: 142 GQLEHPHYIAVSNTNRVIVSDSN 164
           GQLEHPH+I       V VS  N
Sbjct: 11  GQLEHPHFI------EVHVSSRN 27



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 12/23 (52%), Positives = 13/23 (56%), Gaps = 6/23 (26%)

Query: 196 GQLEHPHYIAVSNTNRVIVSDSN 218
           GQLEHPH+I       V VS  N
Sbjct: 11  GQLEHPHFI------EVHVSSRN 27


>gnl|CDD|234491 TIGR04183, Por_Secre_tail, Por secretion system C-terminal sorting
           domain.  Species that include Porphyromonas gingivalis,
           Fibrobacter succinogenes, Flavobacterium johnsoniae,
           Cytophaga hutchinsonii, Gramella forsetii, Prevotella
           intermedia, and Salinibacter ruber average twenty or
           more copies of a C-terminal domain, modeled by This
           model, associated with sorting to the outer membrane and
           covalent modification.
          Length = 71

 Score = 27.0 bits (60), Expect = 3.0
 Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 206 VSNTNRVIVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQL-KFPRGV 251
                 +++S S   +V+I+D++G+V+       S    L     GV
Sbjct: 6   AGTLIIILLSSSEIAKVEIYDLSGKVVKKTTLNNSNSIDLSNLSSGV 52


>gnl|CDD|226708 COG4257, Vgb, Streptogramin lyase [Defense mechanisms].
          Length = 353

 Score = 29.1 bits (65), Expect = 3.2
 Identities = 20/86 (23%), Positives = 28/86 (32%), Gaps = 16/86 (18%)

Query: 45  IGSRGSEPGCFTW--------PRGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTNCVFL- 95
            GS  +  G            P  +A  PD ++         +     H D  T  V   
Sbjct: 43  AGSSPAPDGSSAEFPVPNGSAPFDVAPAPDGAVWFTAQGTGAIG----HLDPATGEVETY 98

Query: 96  ---AFTWPRGIAVGPDNSIVVADSSN 118
              +   P GI VGPD S  + D+  
Sbjct: 99  PLGSGASPHGIVVGPDGSAWITDTGL 124


>gnl|CDD|215254 PLN02460, PLN02460, indole-3-glycerol-phosphate synthase.
          Length = 338

 Score = 29.0 bits (65), Expect = 3.8
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)

Query: 222 VQIFDVNGRVITSFGSEGSEEGQL-KFPRGVAVDDQGYISVGDSGNNRIQIFTPD 275
           V++  +N R + +F  + S   +L +  RG  + ++G I VG+SG     +FTPD
Sbjct: 252 VELIGINNRSLETFEVDISNTKKLLEGERGEQIREKGIIVVGESG-----LFTPD 301


>gnl|CDD|219247 pfam06977, SdiA-regulated, SdiA-regulated.  This family represents
           a conserved region approximately within a number of
           hypothetical bacterial proteins that may be regulated by
           SdiA, a member of the LuxR family of transcriptional
           regulators. Some family members contain the pfam01436
           repeat.
          Length = 249

 Score = 28.7 bits (65), Expect = 4.0
 Identities = 11/56 (19%), Positives = 21/56 (37%), Gaps = 7/56 (12%)

Query: 266 NNRIQI--FTPDGQFLRAFGCWGSGDGEFKGLEGVAVMSNGNILVCDRENHRIQVF 319
           N   ++   + +G  LR     G     F   EG+  + +G   + D  +  + V 
Sbjct: 42  NKPPELVELSLEGDVLRRIPLDG-----FSDPEGIEYIGDGRFAITDERDQALFVV 92


>gnl|CDD|223567 COG0493, GltD, NADPH-dependent glutamate synthase beta chain and
           related oxidoreductases [Amino acid transport and
           metabolism / General function prediction only].
          Length = 457

 Score = 28.4 bits (64), Expect = 5.6
 Identities = 30/130 (23%), Positives = 41/130 (31%), Gaps = 32/130 (24%)

Query: 170 IFDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVIVSDSNNHRVQIFDVNG 229
           I +  GRV              G K G++E   Y+      R  V      +    D   
Sbjct: 333 IGNEGGRV-------------TGVKFGRVEPGEYVDGWGR-RGPVGVIGTEKTDAADT-- 376

Query: 230 RVITSFGSEGSEEGQLKFPRGVAVDDQGYISVGDSGNNRIQIFTP---------DGQFLR 280
            VI + G EG     L    G+ +D +G I V D    +  I  P          G  L 
Sbjct: 377 -VILAIGFEGDATDGLLLEFGLKLDKRGRIKV-DENLQQTSI--PGVFAGGDAVRGAALV 432

Query: 281 AFGCWGSGDG 290
               W   +G
Sbjct: 433 V---WAIAEG 439


>gnl|CDD|222750 pfam14433, SUKH-3, SUKH-3 immunity protein.  This family belongs to
           the SUKH superfamily and functions as immunity proteins
           in bacterial toxin systems.
          Length = 142

 Score = 27.3 bits (61), Expect = 6.7
 Identities = 8/33 (24%), Positives = 12/33 (36%)

Query: 158 VIVSDSNNHRVQIFDVNGRVFQSDGTFVGKFGS 190
            I    N H   + D +GRV+      +   G 
Sbjct: 94  PIGELDNGHSTLLVDESGRVYGLGDGGLWLLGE 126


>gnl|CDD|185138 PRK15216, PRK15216, putative fimbrial biosynthesis regulatory
           protein; Provisional.
          Length = 340

 Score = 27.7 bits (61), Expect = 9.3
 Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 15/75 (20%)

Query: 103 IAVGPDNSIVVADSSNHRVQV-FQSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNR---V 158
           I +GP++S+ +      +++V +QS         +M N+ G +E P+ I +S  N    V
Sbjct: 73  IGMGPNDSVKL------KLKVEWQSSSAI-----NMRNQNGVIEAPYKITISEINSLEAV 121

Query: 159 IVSDSNNHRVQIFDV 173
            VS    + V I ++
Sbjct: 122 TVSARGGYSVTIDNI 136


>gnl|CDD|215057 PLN00105, PLN00105, malate/L-lactate dehydrogenase; Provisional.
          Length = 330

 Score = 27.5 bits (61), Expect = 9.7
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 224 IFDVNGRVITSFGSEGSEEGQLKFPRGVAVDDQG 257
           + D+     + FG   ++    K PRGVA+D QG
Sbjct: 162 VLDMATSAYSFFGLLEAKTAGKKLPRGVAIDKQG 195


>gnl|CDD|234536 TIGR04312, choice_anch_B, choice-of-anchor B domain.  This domain,
           about 385 amino acids long, can have either of at least
           two types of C-terminal sorting signal. Members from
           Shewanella and allies have the rhombosortase target
           domain GlyGly-CTERM (TIGR03501), while members of the
           Bacteroidetes have the Por secretion system C-terminal
           domain (TIGR04183). Most other members lack any
           C-terminal extension, but in most of those, the normal
           signal sequence is replaced by a lipoprotein signal
           sequence. Member sequences show a region of local
           similarity to the LVIVD repeat sequence (pfam08309).
          Length = 364

 Score = 27.7 bits (62), Expect = 9.9
 Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 9/70 (12%)

Query: 150 IAVSNTNRVIVSDSNNHRVQIFDVNG-------RVFQSDGTFVGKFGSMGNKAGQLEHPH 202
           I V   +  IVS+++ H +Q+FD+         + F +D  +   FGS  N     E  +
Sbjct: 84  IKVYQNHAYIVSEASGHGMQVFDLTRLRNVNPPQTFTADAHYTD-FGSAHNIYINEESGY 142

Query: 203 -YIAVSNTNR 211
            YI  +NT  
Sbjct: 143 AYIVGTNTFG 152


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.138    0.423 

Gapped
Lambda     K      H
   0.267   0.0717    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,813,734
Number of extensions: 1490499
Number of successful extensions: 1324
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1294
Number of HSP's successfully gapped: 59
Length of query: 319
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 222
Effective length of database: 6,635,264
Effective search space: 1473028608
Effective search space used: 1473028608
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)