RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11027
         (252 letters)



>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region.  This family
           consists of the N terminal region of various alpha, beta
           and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
           protein complexes. The adaptor protein (AP) complexes
           are involved in the formation of clathrin-coated pits
           and vesicles. The N-terminal region of the various
           adaptor proteins (APs) is constant by comparison to the
           C-terminal which is variable within members of the AP-2
           family; and it has been proposed that this constant
           region interacts with another uniform component of the
           coated vesicles.
          Length = 522

 Score =  141 bits (357), Expect = 7e-39
 Identities = 58/139 (41%), Positives = 80/139 (57%), Gaps = 4/139 (2%)

Query: 1   MLTTNMIRKDLNAQNQYDAGLALSALACFTSTDLARDLANDIMMLLTSTKPYLRKKAVLM 60
           +L TN I+KDL + N    GLAL  L+C    +LARDLA DI  LL    PY+RKKA L 
Sbjct: 77  ILVTNSIKKDLQSPNPLIRGLALRTLSCIRVPELARDLAPDIKKLLVDRDPYVRKKAALA 136

Query: 61  MYKVFLKFPDALRPA-FPRLKEKLEDPDSGVQSAAVNVVCELARKNPKNYLSLAPVFFKL 119
           + K++ K PD +R    P LKE L D D GV SAAV ++ E+ + +      L P+  + 
Sbjct: 137 ILKLYRKDPDLVRDFLVPELKELLSDKDPGVVSAAVALLYEIRKNDRLYLNKLLPLLVRR 196

Query: 120 MT---TSSNNWMLIKIIKL 135
           +    T  N W+ +KI++L
Sbjct: 197 LCNLLTVCNPWLQVKILRL 215



 Score = 42.2 bits (100), Expect = 2e-04
 Identities = 31/158 (19%), Positives = 56/158 (35%), Gaps = 34/158 (21%)

Query: 84  EDPDSGVQSAAVNVVCELARKNPKNYLSLAPVFFKLMTTSSNNWMLIKIIKLRLQGVFAV 143
           E  D   +   V  +  LA K P +      V  +L++ +  ++++ +I          V
Sbjct: 348 EIADPEFKIKLVKAIGRLAEKFPTDAEWCIDVLLELLSLA-GSYVVDEI----------V 396

Query: 144 NVVCELARKNPKNYLSLAPVFFKLMTTSSNNWMLIKIIKLVLYAVLYAAAWICGEFREHL 203
            V+ ++ RK P+    +     +L+    +                 AA WI GE+ E +
Sbjct: 397 EVIRDIIRKYPELREYILEHLCELLEDIESP------------EARAAALWILGEYGELI 444

Query: 204 DSPRATLEAMTRRGLSLLPGHIQAVYVQNMLKILARIL 241
                          S L   I  V+V   LK+   +L
Sbjct: 445 -----------PNSPSDLLRSILEVFVLESLKVRLALL 471


>gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits
           [Intracellular trafficking and secretion].
          Length = 757

 Score = 44.7 bits (106), Expect = 3e-05
 Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 2/112 (1%)

Query: 2   LTTNMIRKDLNAQNQYDAGLALSALACFTSTDLARDLANDIMMLLTSTKPYLRKKAVLMM 61
           L  N I+KDL   N+   G AL  L+     +L  ++ + I  LLT    Y+RK A L +
Sbjct: 92  LAVNTIQKDLQDPNEEIRGFALRTLSLLRVKELLGNIIDPIKKLLTDPHAYVRKTAALAV 151

Query: 62  YKVFLKFPDALRPAFP--RLKEKLEDPDSGVQSAAVNVVCELARKNPKNYLS 111
            K++    D          LKE + D D  V + A+  + E+  +    Y  
Sbjct: 152 AKLYRLDKDLYHELGLIDILKELVADSDPIVIANALASLAEIDPELAHGYSL 203


>gnl|CDD|221731 pfam12717, Cnd1, non-SMC mitotic condensation complex subunit 1.
           The three non-SMC (structural maintenance of
           chromosomes) subunits of the mitotic condensation
           complex are Cnd1-3. The whole complex is essential for
           viability and the condensing of chromosomes in mitosis.
          Length = 171

 Score = 32.1 bits (74), Expect = 0.13
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 7/67 (10%)

Query: 46  LTSTKPYLRKKAVLMMYKVFL----KFPDALRPAFPRLKEKLEDPDSGVQSAAVNVVCEL 101
           L    PY+RK A+L++  + L    K    L     +    L D D  +++ A +   EL
Sbjct: 34  LRDEDPYVRKTALLVLTHLILNDMVKVKGQLFLEMLKC---LVDEDPEIRALAKSFFSEL 90

Query: 102 ARKNPKN 108
            +KNP  
Sbjct: 91  LKKNPNL 97


>gnl|CDD|237417 PRK13533, PRK13533, 7-cyano-7-deazaguanine tRNA-ribosyltransferase;
           Provisional.
          Length = 487

 Score = 29.8 bits (68), Expect = 1.2
 Identities = 20/84 (23%), Positives = 33/84 (39%), Gaps = 2/84 (2%)

Query: 31  STDLARDLANDIMMLLT-STKPYLRKKAVLMMYKVFLK-FPDALRPAFPRLKEKLEDPDS 88
              L R       +L+T   KP    +  ++ +K     FP  L   +P  +E+ E+ D 
Sbjct: 380 HRRLERLTLGKKALLITAGDKPENDSEDDVLFFKPPFGPFPPELSETYPLSQEEPEEIDE 439

Query: 89  GVQSAAVNVVCELARKNPKNYLSL 112
              +AA   +  L   NP    +L
Sbjct: 440 EAIAAAAEGLARLLEANPYAKFTL 463


>gnl|CDD|234308 TIGR03682, arCOG04112, diphthamide biosynthesis enzyme Dph2.
          Members of this family are the archaeal protein Dph2,
          members of the universal archaeal protein family
          designated arCOG04112. The chemical function of this
          protein is analogous to the radical SAM family
          (pfam04055), although the sequence is not homologous.
          The chemistry involves [4Fe-4S]-aided formation of a
          3-amino-3-carboxypropyl radical rather than the
          canonical 5'-deoxyadenosyl radical of the radical SAM
          family.
          Length = 308

 Score = 28.8 bits (65), Expect = 2.5
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 63 KVFLKFPDALRPAFPRLKEKLEDPDSGV 90
          +V L+ P+ L+     + +KLE+    V
Sbjct: 4  RVLLQAPEGLKRRAFEIAQKLEEKGYEV 31


>gnl|CDD|237763 PRK14603, ruvA, Holliday junction DNA helicase RuvA; Provisional.
          Length = 197

 Score = 28.2 bits (63), Expect = 3.1
 Identities = 27/99 (27%), Positives = 40/99 (40%), Gaps = 12/99 (12%)

Query: 21  LALSALACFTSTDLARDLANDIMMLLTSTKPYLRKKAVLMMYKVFLKFPDALR-PAFPRL 79
           LAL+ L+      LAR L      LLTS     +K A  +  ++  K P+ L        
Sbjct: 84  LALALLSALPPALLARALLEGDARLLTSASGVGKKLAERIALELKGKVPEHLLAGPAGGG 143

Query: 80  KEKLEDPDS----------GVQSAAV-NVVCELARKNPK 107
             K+    +          G + A V +VV EL  +NP+
Sbjct: 144 GTKVSSTAAEDAVLALLALGFREAQVRSVVAELLAQNPE 182


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
          related to armadillo/beta-catenin-like repeats (see
          pfam00514). These EZ repeats are found in subunits of
          cyanobacterial phycocyanin lyase and other proteins and
          probably carry out a scaffolding role.
          Length = 55

 Score = 26.3 bits (58), Expect = 3.2
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 51 PYLRKKAVLMMYKVFLKFPDALRP----AFPRLKEKLEDPDSGVQSAAV 95
            +R+ A L +  +    P+ LRP      P L   L+D D  V+ AA 
Sbjct: 1  WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAA 49


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
          HEAT repeats.
          Length = 88

 Score = 26.5 bits (59), Expect = 5.6
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 6/55 (10%)

Query: 42 IMMLLTSTKPYLRKKAVLMMYKVFLKFPDALRPAFPRLKEKLEDPDSGVQSAAVN 96
          +  LL+   P +R  A   + +  L  P+AL    P L E L+DPD  V+ AA  
Sbjct: 5  LEALLSDPDPEVRAAAARALGE--LGDPEAL----PALLELLKDPDPEVRRAAAE 53


>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin, subunit
            D2 [Chromatin structure and dynamics / Cell division and
            chromosome partitioning].
          Length = 1128

 Score = 28.0 bits (62), Expect = 5.7
 Identities = 19/90 (21%), Positives = 32/90 (35%)

Query: 16   QYDAGLALSALACFTSTDLARDLANDIMMLLTSTKPYLRKKAVLMMYKVFLKFPDALRPA 75
            Q  A L+L  L C +    +  L   I  +     P +R  AV+ +    + F       
Sbjct: 913  QVAAYLSLYKLMCLSFEFCSEHLPLLITSMEKHPIPRIRANAVVGLGDFLVCFNTTADEH 972

Query: 76   FPRLKEKLEDPDSGVQSAAVNVVCELARKN 105
               L  +L D D+ V+   +  +  L    
Sbjct: 973  THYLYRRLGDEDADVRRTCLMTIHFLILAG 1002


>gnl|CDD|222230 pfam13569, DUF4132, Domain of unknown function (DUF4132).  This
           domain might be involved in the biosynthesis of the
           molybdopterin cofactor in E.coli.
          Length = 179

 Score = 27.2 bits (61), Expect = 6.3
 Identities = 7/35 (20%), Positives = 15/35 (42%)

Query: 198 EFREHLDSPRATLEAMTRRGLSLLPGHIQAVYVQN 232
           + +      RA LE    RG    P   + +++++
Sbjct: 28  DLKTQASRQRARLEEAMVRGRRWTPEEWKELFLEH 62


>gnl|CDD|215640 PLN03232, PLN03232, ABC transporter C family member; Provisional.
          Length = 1495

 Score = 27.6 bits (61), Expect = 7.9
 Identities = 15/57 (26%), Positives = 24/57 (42%)

Query: 117  FKLMTTSSNNWMLIKIIKLRLQGVFAVNVVCELARKNPKNYLSLAPVFFKLMTTSSN 173
            F L  TSSN W+ I++  L    ++       L   N +N    A     L++ + N
Sbjct: 1125 FTLANTSSNRWLTIRLETLGGVMIWLTATFAVLRNGNAENQAGFASTMGLLLSYTLN 1181


>gnl|CDD|220772 pfam10475, DUF2450, Protein of unknown function N-terminal domain
           (DUF2450).  This protein is found in eukaryotes but its
           function is not known. The C-terminal part of some
           members is DUF2451.
          Length = 291

 Score = 27.3 bits (61), Expect = 8.1
 Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 189 LYAAAWICGEFREHLDSPRATLEAMTRRGLSLLPGHIQAVYVQNMLKILARI 240
           L  A  IC   R HLDS +   + +TR  L LL  + +   + ++L  L  I
Sbjct: 76  LKEANVICKNGRRHLDSSK---KNLTRTALGLLAKYKKRQLLIDLLPTLTTI 124


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.135    0.391 

Gapped
Lambda     K      H
   0.267   0.0715    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,749,499
Number of extensions: 1209927
Number of successful extensions: 1558
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1549
Number of HSP's successfully gapped: 30
Length of query: 252
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 157
Effective length of database: 6,723,972
Effective search space: 1055663604
Effective search space used: 1055663604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 58 (26.1 bits)