RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7942
         (216 letters)



>gnl|CDD|233282 TIGR01130, ER_PDI_fam, protein disulfide isomerase, eukaryotic.
           This model represents eukaryotic protein disulfide
           isomerases retained in the endoplasmic reticulum (ER)
           and closely related forms. Some members have been
           assigned alternative or additional functions such as
           prolyl 4-hydroxylase and
           dolichyl-diphosphooligosaccharide-protein
           glycotransferase. Members of this family have at least
           two protein-disulfide domains, each similar to
           thioredoxin but with the redox-active disulfide in the
           motif PWCGHCK, and an ER retention signal at the extreme
           C-terminus (KDEL, HDEL, and similar motifs).
          Length = 462

 Score =  124 bits (313), Expect = 2e-33
 Identities = 65/195 (33%), Positives = 90/195 (46%), Gaps = 25/195 (12%)

Query: 11  GGRTAEDIVNWLLKKTGPPAKEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIAS 70
           G R A+ IV ++ K++GP  KE  +V +++AF+AD  VVV G FKD  SEL  TF  +A 
Sbjct: 94  GPRDADGIVKYMKKQSGPAVKEIETVADLEAFLADDDVVVIGFFKDLDSELNDTFLSVAE 153

Query: 71  KVDDLVFDASSELAKTFNEIASKVDDLVFVTSTNADILAEYSVDDDTVAIFKKFDEGR-- 128
           K+ D+ F                     F  S++    A+     D+V +FK  DE    
Sbjct: 154 KLRDVYF--------------------FFAHSSDVAAFAKLGAFPDSVVLFKPKDEDEKF 193

Query: 129 --VNYEGPASDEAALRKFLSTQSLPLVVEFNHETAQKIFGGEIKSHLLVFFSKAAGHYES 186
             V+ E    D + L KF+  +SLPLV EF  ETA K F       L     ++   +E 
Sbjct: 194 SKVDGEMDT-DVSDLEKFIRAESLPLVGEFTQETAAKYFESGPLVVLYYNVDESLDPFEE 252

Query: 187 HFEPVQTVAKDFREK 201
                   AK FR K
Sbjct: 253 LRNRFLEAAKKFRGK 267


>gnl|CDD|222416 pfam13848, Thioredoxin_6, Thioredoxin-like domain. 
          Length = 183

 Score = 77.4 bits (191), Expect = 9e-18
 Identities = 40/126 (31%), Positives = 65/126 (51%), Gaps = 4/126 (3%)

Query: 78  DASSELAKTFNEIASKV-DDLVFVTSTNADILAEYSVDDDTVAIFKKFDEGRVNYEGPAS 136
           D    L + F + A ++  D+ F  + + ++  +Y + + T+ +F+KFDE +V Y G   
Sbjct: 3   DKDDPLYEIFEKAAKELRGDVRFGVTFDKEVAKKYGIKEPTILLFRKFDEKQVTYPGEDK 62

Query: 137 -DEAALRKFLSTQSLPLVVEFNHETAQKIFGGEIKSHLLVFFSKAAGHYESHFEPVQTVA 195
                L+KF+   SLPLV EF  E A+++F       LL+F  K +   E     V+ VA
Sbjct: 63  TTFDDLKKFIQKNSLPLVGEFTPENAEELFESGK--PLLLFIKKDSEETEKLKNRVEEVA 120

Query: 196 KDFREK 201
           K F+ K
Sbjct: 121 KKFKGK 126


>gnl|CDD|239279 cd02981, PDI_b_family, Protein Disulfide Isomerase (PDIb) family,
           redox inactive TRX-like domain b; composed of eukaryotic
           proteins involved in oxidative protein folding in the
           endoplasmic reticulum (ER) by acting as catalysts and
           folding assistants. Members of this family include PDI,
           calsequestrin and other PDI-related proteins like ERp72,
           ERp57, ERp44 and PDIR. PDI, ERp57 (or ERp60), ERp72 and
           PDIR are all oxidases, catalyzing the formation of
           disulfide bonds of newly synthesized polypeptides in the
           ER. They also exhibit reductase activity in acting as
           isomerases to correct any non-native disulfide bonds, as
           well as chaperone activity to prevent protein
           aggregation and facilitate the folding of newly
           synthesized proteins. These proteins contain multiple
           copies of a redox active TRX (a) domain containing a
           CXXC motif, and one or more redox inactive TRX-like (b)
           domains. The molecular structure of PDI is abb'a'. Also
           included in this family is the PDI-related protein
           ERp27, which contains only redox-inactive TRX-like (b
           and b') domains. The redox inactive b domains are
           implicated in substrate recognition.
          Length = 97

 Score = 63.9 bits (156), Expect = 1e-13
 Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 22/117 (18%)

Query: 30  AKEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIASKVDDLVFDASSELAKTFNE 89
            KE TS +E++ F+    VVV G FKD  SE  KTF ++A  +                 
Sbjct: 1   VKELTSKEELEKFLDKDDVVVVGFFKDEESEEYKTFEKVAESLR---------------- 44

Query: 90  IASKVDDLVFVTSTNADILAEYSVDDDTVAIFKKFDEGRVNYEGPASDEAALRKFLS 146
                DD  F  +++ ++  +  V   +V +FK F+E  V Y+G    E +L +F+ 
Sbjct: 45  -----DDYGFGHTSDKEVAKKLKVKPGSVVLFKPFEEEPVEYDGE-FTEESLVEFIK 95


>gnl|CDD|240266 PTZ00102, PTZ00102, disulphide isomerase; Provisional.
          Length = 477

 Score = 63.6 bits (155), Expect = 5e-12
 Identities = 56/191 (29%), Positives = 78/191 (40%), Gaps = 34/191 (17%)

Query: 11  GGRTAEDIVNWLLKKTGPPAKEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIAS 70
           GGRTA+ IV+W+ K TGP   E  S  EIK       V   G +    SEL K F E+A 
Sbjct: 123 GGRTADGIVSWIKKLTGPAVTEVESASEIKLIAKKIFVAFYGEYTSKDSELYKKFEEVAD 182

Query: 71  KVDDLVFDASSELAKTFNEIASKVDDLVFVTSTNADILAEYSVDDDTVAIFKKFDEGRVN 130
           K          E AK F +                          + + +  K +EG   
Sbjct: 183 K--------HREHAKFFVKKHEG---------------------KNKIYVLHKDEEGVEL 213

Query: 131 YEGPASDEAALRKFLSTQSLPLVVEFNHETAQKIFGGEIKSHLLVFFSKAAGHYESHFEP 190
           + G   +E  L +F+ST+S PL  E N E  ++          LV+F      Y+ +   
Sbjct: 214 FMGKTKEE--LEEFVSTESFPLFAEINAENYRRYISSGKD---LVWFCGTTEDYDKYKSV 268

Query: 191 VQTVAKDFREK 201
           V+ VA+  REK
Sbjct: 269 VRKVARKLREK 279


>gnl|CDD|239367 cd03069, PDI_b_ERp57, PDIb family, ERp57 subfamily, first redox
           inactive TRX-like domain b; ERp57 (or ERp60) exhibits
           both disulfide oxidase and reductase functions like PDI,
           by catalyzing the formation of disulfide bonds of newly
           synthesized polypeptides in the ER and acting as
           isomerases to correct any non-native disulfide bonds. It
           also displays chaperone activity to prevent protein
           aggregation and facilitate the folding of newly
           synthesized proteins. ERp57 contains two redox-active
           TRX (a) domains and two redox inactive TRX-like (b)
           domains.  It shares the same domain arrangement of
           abb'a' as PDI, but lacks the C-terminal acid-rich region
           (c domain) that is present in PDI. ERp57 interacts with
           the lectin chaperones, calnexin and calreticulin, and
           specifically promotes the oxidative folding of
           glycoproteins. Similar to PDI, the b domain of ERp57 is
           likely involved in binding to substrates.
          Length = 104

 Score = 38.1 bits (89), Expect = 4e-04
 Identities = 23/121 (19%), Positives = 47/121 (38%), Gaps = 27/121 (22%)

Query: 30  AKEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIASKVDDLVFDASSELAKTFNE 89
           + E  +  E + F++D    V G F+D  S+L   F + A  + +               
Sbjct: 2   SVELRTEAEFEKFLSDDDASVVGFFEDEDSKLLSEFLKAADTLRESF------------- 48

Query: 90  IASKVDDLVFVTSTNADILAEYSVDDDTVAI-----FKKFDEGRVNYEGPASDEAALRKF 144
                    F  +++  +L +Y   +  V         KF++  V ++G   D + ++KF
Sbjct: 49  --------RFAHTSDKQLLEKYGYGEGVVLFRPPRLSNKFEDSSVKFDG-DLDSSKIKKF 99

Query: 145 L 145
           +
Sbjct: 100 I 100


>gnl|CDD|205018 pfam12683, DUF3798, Protein of unknown function (DUF3798).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 247 and 417 amino
           acids in length. Most of the proteins in this family
           have an N-terminal lipoprotein attachment site. These
           proteins have distant similarity to periplasmic ligand
           binding families such as pfam02608, which suggests that
           this family have a similar role.
          Length = 275

 Score = 33.4 bits (77), Expect = 0.058
 Identities = 18/93 (19%), Positives = 36/93 (38%), Gaps = 6/93 (6%)

Query: 32  EFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIASKVDD-----LVFDASSE-LAK 85
           E+ + + +     D  +    L  + S+E   T ++I S  DD     ++   +      
Sbjct: 18  EYRAAEALLEKYGDDMIKHVTLPDNFSTEQETTISKIVSLADDPKVKAIIVSQAVPGTLP 77

Query: 86  TFNEIASKVDDLVFVTSTNADILAEYSVDDDTV 118
            F +I  K  D++ + +   +   E S   D V
Sbjct: 78  AFQKIKEKRPDILLIAAPPHEDPEEMSKVADLV 110


>gnl|CDD|153352 cd07668, BAR_SNX9, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexin 9.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. SNX9, also known as SH3PX1, is a
           cytosolic protein that interacts with proteins
           associated with clathrin-coated pits such as
           Cdc-42-associated tyrosine kinase 2 (ACK2). It binds
           class I polyproline sequences found in dynamin 1/2 and
           the WASP/N-WASP actin regulators. SNX9 is localized to
           plasma membrane endocytic sites and acts primarily in
           clathrin-mediated endocytosis. Its array of interacting
           partners suggests that SNX9 functions at the interface
           between endocytosis and actin cytoskeletal organization.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions.
          Length = 210

 Score = 31.5 bits (71), Expect = 0.22
 Identities = 37/154 (24%), Positives = 63/154 (40%), Gaps = 14/154 (9%)

Query: 57  ASSELAKTFNEIASKVDDLVFDASSELAKTFNEIASKVD-----DLVFVTSTNADILAEY 111
           A   LA  F+    + +  + DA +E  KT+ EIAS V      DL F+  TN +     
Sbjct: 56  ALQSLATVFSTSGYQGETDLNDAITEAGKTYEEIASLVAEQPKKDLHFLMETNHEYKGFL 115

Query: 112 SVDDDTVAI-------FKKFDEGRVNYEGPASDEAALRKFLSTQSLPLVVEFNHETAQKI 164
               D +          K+ D+     +    D+  + K +ST S  L  E NH  + +I
Sbjct: 116 GCFPDIIGAHKGAIEKVKESDKLVATSKITLQDKQNMVKRVSTMSYALQAEMNHFHSNRI 175

Query: 165 FGGEIKSHLLVFFSKAAGHYESHFEPVQTVAKDF 198
           +  +  S + ++  +    YE+  E ++     F
Sbjct: 176 Y--DYNSVIRLYLEQQVQFYETIAEKLRQALSRF 207


>gnl|CDD|239259 cd02961, PDI_a_family, Protein Disulfide Isomerase (PDIa) family,
           redox active TRX domains; composed of eukaryotic
           proteins involved in oxidative protein folding in the
           endoplasmic reticulum (ER) by acting as catalysts and
           folding assistants. Members of this family include PDI
           and PDI-related proteins like ERp72, ERp57 (or ERp60),
           ERp44, P5, PDIR, ERp46 and the transmembrane PDIs. PDI,
           ERp57, ERp72, P5, PDIR and ERp46 are all oxidases,
           catalyzing the formation of disulfide bonds of newly
           synthesized polypeptides in the ER. They also exhibit
           reductase activity in acting as isomerases to correct
           any non-native disulfide bonds, as well as chaperone
           activity to prevent protein aggregation and facilitate
           the folding of newly synthesized proteins. These
           proteins usually contain multiple copies of a redox
           active TRX (a) domain containing a CXXC motif, and may
           also contain one or more redox inactive TRX-like (b)
           domains. Only one a domain is required for the oxidase
           function but multiple copies are necessary for the
           isomerase function. The different types of PDIs may show
           different substrate specificities and tissue-specific
           expression, or may be induced by stress. PDIs are in
           their reduced form at steady state and are oxidized to
           the active form by Ero1, which is localized in the ER
           through ERp44. Some members of this family also contain
           a DnaJ domain in addition to the redox active a domains;
           examples are ERdj5 and Pfj2. Also included in the family
           is the redox inactive N-terminal TRX-like domain of
           ERp29.
          Length = 101

 Score = 29.9 bits (68), Expect = 0.31
 Identities = 25/98 (25%), Positives = 42/98 (42%), Gaps = 15/98 (15%)

Query: 62  AKTFNEIASKVDD-LV-FDA-----SSELAKTFNEIASKV---DDLVFVT---STNADIL 108
              F+E+     D LV F A        LA  + ++A ++     +V      + N D+ 
Sbjct: 5   DDNFDELVKDSKDVLVEFYAPWCGHCKALAPEYEKLAKELKGDGKVVVAKVDCTANNDLC 64

Query: 109 AEYSVDD-DTVAIFKKFDEGRVNYEGPASDEAALRKFL 145
           +EY V    T+ +F    +  V YEGP     +L +F+
Sbjct: 65  SEYGVRGYPTIKLFPNGSKEPVKYEGP-RTLESLVEFI 101


>gnl|CDD|239280 cd02982, PDI_b'_family, Protein Disulfide Isomerase (PDIb') family,
           redox inactive TRX-like domain b'; composed of
           eukaryotic proteins involved in oxidative protein
           folding in the endoplasmic reticulum (ER) by acting as
           catalysts and folding assistants. Members of this family
           include PDI, calsequestrin and other PDI-related
           proteins like ERp72, ERp57 (or ERp60), ERp44, P5 and
           PDIR. PDI, ERp57, ERp72, P5 and PDIR are all oxidases,
           catalyzing the formation of disulfide bonds of newly
           synthesized polypeptides in the ER. They also exhibit
           reductase activity in acting as isomerases to correct
           any non-native disulfide bonds, as well as chaperone
           activity to prevent protein aggregation and facilitate
           the folding of newly synthesized proteins. These
           proteins contain multiple copies of a redox active TRX
           (a) domain containing a CXXC motif, and one or more
           redox inactive TRX-like (b) domains. The molecular
           structure of PDI is abb'a'. Also included in this family
           is the PDI-related protein ERp27, which contains only
           redox-inactive TRX-like (b and b') domains. The redox
           inactive domains are implicated in substrate recognition
           with the b' domain serving as the primary substrate
           binding site. Only the b' domain is necessary for the
           binding of small peptide substrates. In addition to the
           b' domain, other domains are required for the binding of
           larger polypeptide substrates. The b' domain is also
           implicated in chaperone activity.
          Length = 103

 Score = 29.9 bits (68), Expect = 0.35
 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 160 TAQKIFGGEI--KSHLLVFFSKAAGHYESHFEPVQTVAKDFREK 201
            A+  F  E   K  L++F++K     E   E  + VAK F+ K
Sbjct: 1   NAETFFNYEESGKPLLVLFYNKDDSESEELRERFKEVAKKFKGK 44


>gnl|CDD|218290 pfam04847, Calcipressin, Calcipressin.  Calcipressin is also known
           as calcineurin-binding protein, since it inhibits
           calcineurin-mediated transcriptional modulation by
           binding to calcineurin's catalytic domain.
          Length = 177

 Score = 30.5 bits (69), Expect = 0.37
 Identities = 27/84 (32%), Positives = 29/84 (34%), Gaps = 6/84 (7%)

Query: 97  LVFVTSTNADILAEYSV---DDDTVAIFKKFDEGRVNYEGPASDEAALRKFLSTQSLPLV 153
           LVFV   N D L E         T    K F   RVN+  P +  AA  K L   S  L 
Sbjct: 2   LVFVNPDNKDKLRELFRQYNPIVTFQPLKSFRRVRVNFSSPEAAAAARIK-LHEGSEFLG 60

Query: 154 VEFNHETAQKIFGGEI--KSHLLV 175
            E     AQ         K HL  
Sbjct: 61  KELKLYFAQPQHPERSAAKQHLAP 84


>gnl|CDD|181432 PRK08453, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 29.8 bits (67), Expect = 1.1
 Identities = 28/121 (23%), Positives = 53/121 (43%), Gaps = 15/121 (12%)

Query: 55  KDASSELAKTFNEIASKVDDLV-FDASSELAKTFNEIASKVDDLVFVTSTNADILAEYSV 113
            D+  E  K +NE+  K+D+   +DA +++A  FN     V D+  + S+  ++ +    
Sbjct: 484 IDSLKEFVKAYNELIPKLDEDTRYDADTKIAGIFN----GVGDIRSIRSSLNNVFSYSVH 539

Query: 114 DDDTVAIFKKF-----DEGRVNYEGPASDEAALRKFLSTQSLPLVVEFNHETAQKIFGGE 168
            D+ V    K+     ++G +       DEA L   L++        F    ++ + G E
Sbjct: 540 TDNGVESLMKYGLSLDEKGVMTL-----DEAKLSSALNSDPKATQDFFYGSDSKDMGGRE 594

Query: 169 I 169
           I
Sbjct: 595 I 595


>gnl|CDD|219887 pfam08532, Glyco_hydro_42M, Beta-galactosidase trimerisation
           domain.  This is non catalytic domain B of
           beta-galactosidase enzymes belong to the glycosyl
           hydrolase 42 family. This domain is related to glutamine
           amidotransferase enzymes, but the catalytic residues are
           replaced by non functional amino acids. This domain is
           involved in trimerisation.
          Length = 207

 Score = 29.1 bits (66), Expect = 1.2
 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 3/50 (6%)

Query: 100 VTSTNADILAEYSVD--DDTVAI-FKKFDEGRVNYEGPASDEAALRKFLS 146
           +  T A++LA Y  D    T A+    + +G+  Y G   D   L   L 
Sbjct: 149 IHPTGAEVLATYQDDFYAGTPAVTRNNYGKGKAYYLGSRLDPDGLDALLE 198


>gnl|CDD|221435 pfam12132, DUF3587, Protein of unknown function (DUF3587).  This
           protein is found in viruses. Proteins in this family are
           typically between 209 and 248 amino acids in length.
          Length = 201

 Score = 28.8 bits (65), Expect = 1.5
 Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 8/39 (20%)

Query: 13  RTAED---I-VNWLLKKTG---PPA-KEFTSVDEIKAFI 43
           RT ED   I V+ LL   G   PP   EFTS+ E+  F+
Sbjct: 84  RTEEDRILINVDSLLPIFGGIVPPGMDEFTSISELHNFV 122


>gnl|CDD|178283 PLN02679, PLN02679, hydrolase, alpha/beta fold family protein.
          Length = 360

 Score = 29.0 bits (65), Expect = 2.1
 Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 10/40 (25%)

Query: 112 SVDDDTVAIFKKFDEGRVNYEGPASDEAALRKFLSTQSLP 151
           +VDD+ V I +          GPA DE AL  F+S  + P
Sbjct: 250 AVDDELVEIIR----------GPADDEGALDAFVSIVTGP 279


>gnl|CDD|218429 pfam05097, DUF688, Protein of unknown function (DUF688).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana.
          Length = 446

 Score = 28.6 bits (63), Expect = 2.4
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 184 YESHFEPVQTVAKDFREKEEQDEDDD 209
           Y   F P   +  D  ++EE+DE+DD
Sbjct: 211 YRVSFAPNYYLNNDIDDEEEEDENDD 236


>gnl|CDD|225543 COG2996, COG2996, Predicted RNA-bindining protein (contains S1
          and HTH domains) [General function prediction only].
          Length = 287

 Score = 28.5 bits (64), Expect = 3.0
 Identities = 16/60 (26%), Positives = 22/60 (36%), Gaps = 10/60 (16%)

Query: 16 EDIVNWLLKKTGPPAKEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIASKVDDL 75
          ED    LL K+ P   E    DE+  FI          + D+   L  T  E  + V + 
Sbjct: 27 EDGTTILLPKSEPEEDELEVGDEVTVFI----------YVDSEDRLIATTREPKATVGEY 76


>gnl|CDD|236084 PRK07739, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 507

 Score = 28.4 bits (64), Expect = 3.3
 Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 3/40 (7%)

Query: 61  LAKTFNEIASKVDDLVFDASSEL---AKTFNEIASKVDDL 97
           LA+TFN ++  + D+  D  SE+    K  N +AS++ DL
Sbjct: 151 LAETFNYLSQSLTDIQNDLKSEIDVTVKEINSLASQISDL 190


>gnl|CDD|182079 PRK09793, PRK09793, methyl-accepting protein IV; Provisional.
          Length = 533

 Score = 28.1 bits (62), Expect = 4.0
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 13/125 (10%)

Query: 38  EIKAFIADSKVVV---AGLFKDASSELAKTFNEIAS---KVDDL---VFDASSELAKTFN 88
           EIK  I +S   V   + L  +A    A T  +I S   +V+D+   +  AS E  +   
Sbjct: 412 EIKGLIEESVNRVQQGSKLVNNA----AATMTDIVSSVTRVNDIMGEIASASEEQRRGIE 467

Query: 89  EIASKVDDLVFVTSTNADILAEYSVDDDTVAIFKKFDEGRVNYEGPASDEAALRKFLSTQ 148
           ++A  V  +  VT  NA ++ E +V  + +A        RV        E A  +    Q
Sbjct: 468 QVAQAVSQMDQVTQQNASLVEEAAVATEQLANQADHLSSRVAVFTLEEHEVARHESAQLQ 527

Query: 149 SLPLV 153
             P+V
Sbjct: 528 IAPVV 532


>gnl|CDD|233894 TIGR02492, flgK_ends, flagellar hook-associated protein FlgK.  The
           flagellar hook-associated protein FlgK of bacterial
           flagella has conserved N- and C-terminal domains. The
           central region is highly variable in length and
           sequence, and often contains substantial runs of
           low-complexity sequence. This model is built from an
           alignment of FlgK sequences with the central region
           excised. Note that several other proteins of the
           flagellar apparatus also are homologous in the N- and
           C-terminal regions to FlgK, but are excluded from this
           model [Cellular processes, Chemotaxis and motility].
          Length = 324

 Score = 28.1 bits (63), Expect = 4.2
 Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 54  FKDASSELAKTFNEIASKVDDLVFDASSELAKTFNEIASKVDDL 97
           F +A  ELAK  +  A +    V +++  LA  FN+ +S++ DL
Sbjct: 112 FFNALQELAKNPDSEALR--QAVLESAQALANQFNQTSSELQDL 153


>gnl|CDD|185008 PRK15048, PRK15048, methyl-accepting chemotaxis protein II;
           Provisional.
          Length = 553

 Score = 27.7 bits (61), Expect = 5.1
 Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 5/78 (6%)

Query: 38  EIKAFIADS--KVVVAGLFKDASSELAKTFNEIASKVDDL---VFDASSELAKTFNEIAS 92
           EIKA I DS  +V    +  +++ E         ++V D+   +  AS E ++  +++A 
Sbjct: 414 EIKALIEDSVSRVDTGSVLVESAGETMNNIVNAVTRVTDIMGEIASASDEQSRGIDQVAL 473

Query: 93  KVDDLVFVTSTNADILAE 110
            V ++  VT  NA ++ E
Sbjct: 474 AVSEMDRVTQQNASLVQE 491


>gnl|CDD|224348 COG1431, COG1431, Argonaute homolog, implicated in RNA metabolism
           [Translation, ribosomal structure and biogenesis].
          Length = 685

 Score = 27.5 bits (61), Expect = 5.5
 Identities = 25/119 (21%), Positives = 40/119 (33%), Gaps = 7/119 (5%)

Query: 11  GGRTAEDIVNWLLKKTGPPAKEFTSV--DEIKAFIADSKVVV-AGLFKDASSELAKTFNE 67
           G      I N  L    P  K   ++  +EI   + DS+++              K  N 
Sbjct: 315 GTEGKTTIDNVNLFCYLPYFKPDGTMLWNEISPIVTDSELLTRLKSTIKKVVYGFKNSNG 374

Query: 68  IASKVDDLVFDASSELAKTFNEIASKVDDLVFVTSTNADILAEYSVDDDTVAIFKKFDE 126
           I  KV+ L       +A    ++   +  ++           E   DD   AI K+ DE
Sbjct: 375 IDWKVEGLTL----HVAGKRPKMKDDLTKIIKEIDVEELKKQEMYKDDVKYAILKRLDE 429


>gnl|CDD|241084 cd12640, RRM3_Bruno_like, RNA recognition motif 3 in Drosophila
          melanogaster Bruno protein and similar proteins.  This
          subgroup corresponds to the RRM3 of Bruno protein, a
          Drosophila RNA recognition motif (RRM)-containing
          protein that plays a central role in regulation of
          Oskar (Osk) expression. It mediates repression by
          binding to regulatory Bruno response elements (BREs) in
          the Osk mRNA 3' UTR. The full-length Bruno protein
          contains three RRMs, two located in the N-terminal half
          of the protein and the third near the C-terminus,
          separated by a linker region. .
          Length = 79

 Score = 25.7 bits (56), Expect = 6.1
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 31 KEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTF 65
          +EFT  D  + F+    V+ A +F D  + L+K F
Sbjct: 14 QEFTDTDLAQTFLPFGNVISAKVFIDKQTNLSKCF 48


>gnl|CDD|185226 PRK15326, PRK15326, type III secretion system needle complex
           protein PrgI; Provisional.
          Length = 80

 Score = 25.9 bits (56), Expect = 6.4
 Identities = 14/65 (21%), Positives = 36/65 (55%), Gaps = 5/65 (7%)

Query: 65  FNEIASKVDDLVFDASSELAKTFNEIASKVDDLVFVTSTNADILAEYSV----DDDTVAI 120
            +++++K D  V +  +++ +  +++A+K  D   + +  +  L+EY++      +TV +
Sbjct: 9   LDDVSAKFDTGVDNLQTQVTEALDKLAAKPSDPALLAAYQSK-LSEYNLYRNAQSNTVKV 67

Query: 121 FKKFD 125
           FK  D
Sbjct: 68  FKDID 72


>gnl|CDD|215167 PLN02296, PLN02296, carbonate dehydratase.
          Length = 269

 Score = 27.0 bits (60), Expect = 7.1
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 5/45 (11%)

Query: 27  GPPAK---EFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEI 68
           G PAK   + T  +E  AFI+ S    + L +  ++E AK+F+EI
Sbjct: 183 GNPAKFLRKLT--EEEIAFISQSATNYSNLAQVHAAENAKSFDEI 225


>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
           and secretion / Signal transduction mechanisms].
          Length = 408

 Score = 27.3 bits (60), Expect = 7.2
 Identities = 16/95 (16%), Positives = 35/95 (36%), Gaps = 5/95 (5%)

Query: 16  EDIVNWLLKKTGPPAKEFTSVDEIKAFIADSKVVVAGLFKDASSELAKTFNEIASKVDDL 75
             ++  +  +      +     E  A      V +      +  E+A    E++  + + 
Sbjct: 295 GLLIEEIQNEA----ADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISE- 349

Query: 76  VFDASSELAKTFNEIASKVDDLVFVTSTNADILAE 110
           +  A+ E      EI + +++L  VT  NA  + E
Sbjct: 350 IAAATEEQTAVLEEINASIEELDDVTQENAAAVEE 384


>gnl|CDD|223840 COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell
           envelope biogenesis, outer membrane].
          Length = 475

 Score = 27.3 bits (61), Expect = 7.8
 Identities = 28/96 (29%), Positives = 38/96 (39%), Gaps = 21/96 (21%)

Query: 87  FNEIASKVDDLVFVTSTN------ADILAEYSVDDDTVAIFKKFDEGRVNYEGPASDEAA 140
              IA ++ D+V VTS N      A ILA+         I     E    YE     E A
Sbjct: 379 MGAIAEQLADIVIVTSDNPRSEDPAVILAD---------ILAGI-EAPEKYEIIEDREEA 428

Query: 141 LRKFLST-QSLPLVVEF--NHETAQKIFGGEIKSHL 173
           +RK L   +   +V+     HET Q +   E+K   
Sbjct: 429 IRKALDLAKEGDVVLIAGKGHETYQIM--LELKRPF 462


>gnl|CDD|185647 PTZ00468, PTZ00468, phosphofructokinase family protein;
           Provisional.
          Length = 1328

 Score = 27.2 bits (60), Expect = 9.3
 Identities = 24/96 (25%), Positives = 42/96 (43%), Gaps = 19/96 (19%)

Query: 93  KVDDLVFV----TSTNADILAEYSVDDDTVAIFKKFDEGRVNYEGPASDEAALRKFLSTQ 148
           K+  LV +    ++TNA +LAEY         FK+     V    P + +  L+  +   
Sbjct: 196 KLHGLVVIGGDDSNTNAAVLAEY---------FKRNSSSTVVVGCPKTIDGDLKNEVIET 246

Query: 149 SLPLVVEFNHETAQKIFGGEIKSHLLVFFSKAAGHY 184
           S      F ++TA K +  +I S +    ++  G+Y
Sbjct: 247 S------FGYDTAVKTYSEQIGSIMDAIKTEGYGYY 276


>gnl|CDD|132268 TIGR03224, benzo_boxA, benzoyl-CoA oxygenase/reductase, BoxA
           protein.  Members of this protein family are BoxA, the A
           component of the BoxAB benzoyl-CoA oxygenase/reductase.
           This oxygen-requiring enzyme acts in an aerobic pathway
           of benzoate catabolism via coenzyme A ligation. BoxA is
           a homodimeric iron-sulphur-flavoprotein and acts as an
           NADPH-dependent reductase for BoxB [Energy metabolism,
           Other].
          Length = 411

 Score = 26.8 bits (59), Expect = 9.5
 Identities = 19/70 (27%), Positives = 29/70 (41%), Gaps = 7/70 (10%)

Query: 147 TQSLPL--VVEFNHETAQKIFGGEIKSHLLVFFSKAAGHYESHFEPVQTVAKDFREKEEQ 204
           T S P+  + E          GG+    L++FF         +F P+Q + KDF +    
Sbjct: 277 TGSAPMRAMTERRRRRRDHGEGGK----LMLFFGARTKEELPYFGPLQKLPKDFIDINFA 332

Query: 205 -DEDDDQPKR 213
                +QPKR
Sbjct: 333 FSRTPEQPKR 342


>gnl|CDD|215206 PLN02362, PLN02362, hexokinase.
          Length = 509

 Score = 26.8 bits (59), Expect = 9.5
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 6/48 (12%)

Query: 169 IKSHLLVFFSKAAGHYESHFEPVQTVAKDFREKEEQDEDDDQPKRDEL 216
           I  HL+   S      E  F+ + +  K F EKEE   +  Q +R EL
Sbjct: 131 IPQHLMNSTS------EVLFDFIASSLKQFVEKEENGSEFSQVRRREL 172


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.365 

Gapped
Lambda     K      H
   0.267   0.0742    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,126,589
Number of extensions: 1050136
Number of successful extensions: 1061
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1043
Number of HSP's successfully gapped: 57
Length of query: 216
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 123
Effective length of database: 6,812,680
Effective search space: 837959640
Effective search space used: 837959640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.7 bits)