RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2047
         (233 letters)



>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 92.9 bits (231), Expect = 1e-21
 Identities = 66/207 (31%), Positives = 91/207 (43%), Gaps = 52/207 (25%)

Query: 22  EVEQVDQAKRIISPFMLRRLKKDVLTELPKKTALVIKVPMIPSQAEKYRGLMEDFKKTAN 81
           + E V Q  +++ PF+LRRLK DV   LP K   ++KV M   Q + Y+ L++      N
Sbjct: 367 QQEVVQQLHKVLRPFLLRRLKSDVEKGLPPKKETILKVGMSQMQKQYYKALLQKDLDVVN 426

Query: 82  PEGSNRSNEISHMSMFMMLRKMANHPLGLRYYFQENTLREIADCLVEDPTYKGTNPQYIL 141
             G  R      +++ M LRK  NHP    Y FQ                  G  P    
Sbjct: 427 -AGGERKR---LLNIAMQLRKCCNHP----YLFQ------------------GAEPG--- 457

Query: 142 EDISWLSDYDIHQLSLKHKTLDCAKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 201
                                    Y   + LV  SGK+  LD++LP LK+   RVLIFS
Sbjct: 458 -----------------------PPYTTGEHLVENSGKMVLLDKLLPKLKERDSRVLIFS 494

Query: 202 QFIFVLDILGHYMDIRGWRHLRLDGAT 228
           Q   +LDIL  Y+  RG+++ R+DG T
Sbjct: 495 QMTRLLDILEDYLMYRGYQYCRIDGNT 521


>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 866

 Score = 83.2 bits (205), Expect = 2e-18
 Identities = 69/244 (28%), Positives = 107/244 (43%), Gaps = 51/244 (20%)

Query: 5   SEDDARYEDKRKEQATFEVEQVDQAKRIISPFMLRRLKKD--VLTELPKKTALVIKVPMI 62
            +     ED    +A      ++  ++++SPF+LRR K+D  VL ELP K   V++  + 
Sbjct: 543 EKPIQAEEDIGPLEAREL--GIELLRKLLSPFILRRTKEDVEVLKELPPKIEKVLECELS 600

Query: 63  PSQAEKYRGL----------MEDFKKTANPEGSNRSNEISHMSMFMMLRKMANHPLGLRY 112
             Q E Y  L          +ED +K  + E     +E++ +++   LR++ NHP     
Sbjct: 601 EEQRELYEALLEGAEKNQQLLEDLEKADSDENRIGDSELNILALLTRLRQICNHP----- 655

Query: 113 YFQENTLREIADCLVEDPTYKGTNPQYILEDISWLSDYDIHQLSLKHKTLDCAKYKVPDD 172
                        LV++      +   +L       DY    L    K            
Sbjct: 656 ------------ALVDEGLEATFDRIVLLLREDKDFDYLKKPLIQLSK------------ 691

Query: 173 LVVESGKLKKLDEILPD-LKKNGH--RVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQ 229
                GKL+ LDE+L D L + GH  +VLIFSQF  VLD+L  Y+   G +++RLDG+T 
Sbjct: 692 -----GKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGSTP 746

Query: 230 VSSR 233
              R
Sbjct: 747 AKRR 750


>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain.  This domain is
           found in proteins involved in a variety of processes
           including transcription regulation (e.g., SNF2, STH1,
           brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
           recombination (e.g. RAD54), and chromatin unwinding
           (e.g. ISWI) as well as a variety of other proteins with
           little functional information (e.g. lodestar, ETL1).
          Length = 301

 Score = 65.5 bits (160), Expect = 1e-12
 Identities = 27/87 (31%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 24  EQVDQAKRIISPFMLRRLKKDVLTELPKKTALVIKVPMIPSQAEKYRGLMEDFKKTANP- 82
           E +++  +++ PF+LRR K DV   LP KT  V+   +   Q + Y+ L+   +   +  
Sbjct: 212 EGINRLHKLLKPFLLRRTKDDVEKSLPPKTEHVLYCNLSDEQRKLYKKLLTKRRLALSFA 271

Query: 83  -EGSNRSNEISH-MSMFMMLRKMANHP 107
            EG  ++  I+  +++ M LRK+ NHP
Sbjct: 272 VEGGEKNVGIASLLNLIMQLRKICNHP 298


>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
           with DEXDc-, DEAD-, and DEAH-box proteins, yeast
           initiation factor 4A, Ski2p, and Hepatitis C virus NS3
           helicases; this domain is found in a wide variety of
           helicases and helicase related proteins; may not be an
           autonomously folding unit, but an integral part of the
           helicase; 4 helicase superfamilies at present according
           to the organization of their signature motifs; all
           helicases share the ability to unwind nucleic acid
           duplexes with a distinct directional polarity; they
           utilize the free energy from nucleoside triphosphate
           hydrolysis to fuel their translocation along DNA,
           unwinding the duplex in the process.
          Length = 131

 Score = 36.8 bits (86), Expect = 0.002
 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 173 LVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWR--HLRLDGATQ 229
           L VE  KL+ L E+L +  K G +VLIF     +LD L   +   G +   L  DG+ +
Sbjct: 7   LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQE 65


>gnl|CDD|180775 PRK06954, PRK06954, acetyl-CoA acetyltransferase; Provisional.
          Length = 397

 Score = 32.2 bits (73), Expect = 0.24
 Identities = 30/124 (24%), Positives = 46/124 (37%), Gaps = 24/124 (19%)

Query: 1   MDSDSED-DARYEDKRKEQATFEVEQVDQAKRI---------ISPFMLRRLKKDVLT--- 47
           M + +E+    Y   R+ Q  F +E + +AKR          I+P  +   K D +    
Sbjct: 163 MGTFAEECAGEYGFTREAQDAFAIESLARAKRANEDGSFAWEIAPVTVAGKKGDTVIDRD 222

Query: 48  ELPKKTALVIKVPMIPSQAEKYRGLMEDFKKTANPEGSNRSNEISHMSMFMMLRKMANHP 107
           E P K         IP+       L   F KT     +N S+     +  +M+R      
Sbjct: 223 EQPFK----ANPEKIPT-------LKPAFSKTGTVTAANSSSISDGAAALVMMRASTAKR 271

Query: 108 LGLR 111
           LGL 
Sbjct: 272 LGLA 275


>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
          Length = 545

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 172 DLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDG 226
            +V E  K  KL  +L  + ++G ++LIF +     D L   + + GW  L + G
Sbjct: 355 FVVEEHEKRGKLKMLLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHG 409


>gnl|CDD|223050 PHA03335, PHA03335, hypothetical protein; Provisional.
          Length = 385

 Score = 29.2 bits (65), Expect = 2.1
 Identities = 14/78 (17%), Positives = 30/78 (38%), Gaps = 4/78 (5%)

Query: 40  RLKKDVLTELPKKTALVIKVPMIPSQAEKYRGLMEDFKKTANPEGSNRSNEISHMSMFMM 99
           ++  D++     +  L +   + P      RG+M   ++     G NR  +I+ +   + 
Sbjct: 179 QVVVDIMRRRTVEDPLDLIPTIYPWTLPILRGIMSQLQRNIWAGGQNRKEKITELVEELA 238

Query: 100 LRKM----ANHPLGLRYY 113
             K       H + L Y+
Sbjct: 239 KVKRGAANLEHTMCLIYH 256


>gnl|CDD|223194 COG0116, COG0116, Predicted N6-adenine-specific DNA methylase [DNA
           replication, recombination, and repair].
          Length = 381

 Score = 28.8 bits (65), Expect = 2.5
 Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 7/54 (12%)

Query: 172 DLVVESGKLK-----KLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWR 220
           D +V+  + K      +D   PD++ N    L        +D  G  +  RG+R
Sbjct: 116 DAIVDRFRRKYGRRPSVDLDGPDVRINVE--LDKDTATLGIDTTGDSLHKRGYR 167


>gnl|CDD|212061 cd11492, SLC5sbd_NIS-SMVT, Na(+)/iodide (NIS) and
           Na(+)/multivitamin (SMVT) cotransporters, and related
           proteins; solute binding domain.  NIS (encoded by the
           SLC5A5 gene) transports I-, and other anions including
           ClO4-, SCN-, and Br-. SMVT (encoded by the SLC5A6 gene)
           transports biotin, pantothenic acid and lipoate. This
           subfamily also includes SMCT1 and -2. SMCT1(encoded by
           the SLC5A8 gene) is a high-affinity transporter of
           various monocarboxylates including lactate and pyruvate,
           short-chain fatty acids, ketone bodies, nicotinate and
           its structural analogs, pyroglutamate, benzoate and its
           derivatives, and iodide. SMCT2 (encoded by the SLC5A12
           gene) is a low-affinity transporter for short-chain
           fatty acids, lactate, pyruvate, and nicotinate. This
           subgroup belongs to the solute carrier 5 (SLC5)
           transporter family.
          Length = 522

 Score = 28.6 bits (65), Expect = 2.7
 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 17/49 (34%)

Query: 165 AKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHY 213
           AKY   D L   +G + K D++LP                FV+DILGH 
Sbjct: 291 AKYHDCDPL--TAGLISKPDQLLP---------------YFVMDILGHL 322


>gnl|CDD|173933 cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like.
           Glycerol-1-phosphate dehydrogenase-like. The proteins of
           this family have not been characterized. The protein
           sequences have high similarity with that of
           glycerol-1-phosphate dehydrogenase (G1PDH). G1PDH plays
           a role in the synthesis of phosphoglycerolipids in
           Gram-positive bacterial species. It catalyzes the
           reversibly reduction of dihydroxyacetone phosphate
           (DHAP) to glycerol-1-phosphate (G1P) in a NADH-dependent
           manner. Its activity requires Ni++ ion. This family is
           bacteria specific.
          Length = 331

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 170 PDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 201
           P  L +  G + +L E+L D   N  RV + S
Sbjct: 1   PLVLDIGEGAIARLGELLSDRNPNFGRVAVVS 32


>gnl|CDD|185019 PRK15059, PRK15059, tartronate semialdehyde reductase; Provisional.
          Length = 292

 Score = 28.4 bits (63), Expect = 3.4
 Identities = 22/89 (24%), Positives = 36/89 (40%), Gaps = 12/89 (13%)

Query: 21  FEVEQVDQAKRIISP-FMLRRLKKDVLTELPKKTALVIKVPMIPSQAEKYRGLMEDFKKT 79
            EV      KR  +P F +   +KD+   L    AL + +P   +  E +          
Sbjct: 214 LEVHGERMIKRTFNPGFKIALHQKDLNLALQSAKALALNLPNTATCQELF---------- 263

Query: 80  ANPEGSNRSNEISHMSMFMMLRKMANHPL 108
            N   +N  +++ H ++   L  MANH L
Sbjct: 264 -NTCAANGGSQLDHSALVQALELMANHKL 291


>gnl|CDD|178359 PLN02759, PLN02759, Formate--tetrahydrofolate ligase.
          Length = 637

 Score = 28.2 bits (63), Expect = 3.9
 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 4/40 (10%)

Query: 8   DAR--YEDKRKEQATFE--VEQVDQAKRIISPFMLRRLKK 43
           D R  +E  + ++A F        + KR  +  M RRLKK
Sbjct: 161 DTRVFHEATQSDKALFNRLCPANKEGKRSFAAVMFRRLKK 200


>gnl|CDD|152729 pfam12294, DUF3626, Protein of unknown function (DUF3626).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 294 and 374 amino
           acids in length.
          Length = 297

 Score = 27.8 bits (62), Expect = 4.7
 Identities = 17/54 (31%), Positives = 22/54 (40%), Gaps = 4/54 (7%)

Query: 119 LREIADCLVEDPTYKGTNPQYILEDISWLSDYDIHQLSLKHKTLDCAKYKVPDD 172
           L E  + LV DP++KGT     L ++     Y I         L      VPDD
Sbjct: 187 LAEDIEALVADPSFKGTPIGRSLNELC--ERYGIDLQWHCGFRLAVD--DVPDD 236


>gnl|CDD|213304 cd05938, hsFATP2a_ACSVL_like, Fatty acid transport proteins (FATP)
           including hsFATP2, hsFATP5, and hsFATP6, and similar
           proteins.  Fatty acid transport proteins (FATP) of this
           family transport long-chain or very-long-chain fatty
           acids across the plasma membrane. At least five copies
           of FATPs are identified in mammalian cells. This family
           includes hsFATP2, hsFATP5, and hsFATP6, and similar
           proteins. Each FATP has unique patterns of tissue
           distribution. These FATPs also have fatty acid CoA
           synthetase activity, thus playing dual roles as fatty
           acid transporters and its activation enzymes. The hsFATP
           proteins exist in two splice variants; the b variant,
           lacking exon 3, has no acyl-CoA synthetase activity.
           FATPs are key players in the trafficking of exogenous
           fatty acids into the cell and in intracellular fatty
           acid homeostasis.
          Length = 535

 Score = 27.7 bits (62), Expect = 5.2
 Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 5/58 (8%)

Query: 144 ISWLSDYDIHQLSLKHKTLDCAKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 201
            ++L + +I   SL H    C   +V   LV +   L+ ++EILP L+  G RV   S
Sbjct: 56  TAFL-NTNIRSGSLLH-CFRCCGARV---LVADPELLEAVEEILPALRAMGVRVFYLS 108


>gnl|CDD|191916 pfam07982, Herpes_UL74, Herpes UL74 glycoproteins.  Members of
          this family are viral glycoproteins that form part of
          an envelope complex.
          Length = 417

 Score = 27.7 bits (61), Expect = 5.4
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)

Query: 38 LRRLKKDVLTELPKKTALVIKVPMIPSQAEKYRGL--MEDFKKTANPEGSNRSNEISHM 94
           R+LK ++L +L  K+    K    PSQ + Y     M+ F       G  R++ I+H+
Sbjct: 11 ERKLKLEILKQLASKSGDYYKFFTFPSQQKLYNITVEMKQFPPNYILAGPIRNDSITHL 69


>gnl|CDD|219361 pfam07277, SapC, SapC.  This family contains a number of bacterial
           SapC proteins approximately 250 residues long. In
           Campylobacter fetus, SapC forms part of a
           paracrystalline surface layer (S-layer) that confers
           serum resistance.
          Length = 220

 Score = 27.2 bits (61), Expect = 6.3
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 161 TLDCAKYKVPDDLVVESGKLKKL-DEILPDLKKNGHRVLIFSQ 202
           TLD  K  +     ++  KL  L DE L +L K+G+  LI++ 
Sbjct: 168 TLDGEKRNLNGFYTIDEEKLNALSDEALLELHKSGYLPLIYAH 210


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.395 

Gapped
Lambda     K      H
   0.267   0.0711    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,284,673
Number of extensions: 1192384
Number of successful extensions: 1214
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1208
Number of HSP's successfully gapped: 28
Length of query: 233
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 139
Effective length of database: 6,768,326
Effective search space: 940797314
Effective search space used: 940797314
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.8 bits)