RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17223
         (323 letters)



>gnl|CDD|215866 pfam00339, Arrestin_N, Arrestin (or S-antigen), N-terminal domain. 
           Ig-like beta-sandwich fold. Scop reports duplication
           with C-terminal domain.
          Length = 148

 Score = 74.3 bits (183), Expect = 3e-16
 Identities = 33/83 (39%), Positives = 41/83 (49%), Gaps = 17/83 (20%)

Query: 11  GQERLMKK--------LGPNAFPFFFELPPSCPASVTLQPAPGDTGKPCGVDYELKAFVG 62
            QERL KK         G +AFPF FELPP+CP S          G+P G+ YE+K  + 
Sbjct: 75  TQERLWKKKDGSNELPAGTHAFPFSFELPPNCPPSF--------EGQPGGIRYEVKVEL- 125

Query: 63  ETAEDKIHKRNSVRLAIRKIMYA 85
           +    K HKR  V   IRK+   
Sbjct: 126 DRPWKKDHKRKKVFTVIRKLDLN 148


>gnl|CDD|214976 smart01017, Arrestin_C, Arrestin (or S-antigen), C-terminal domain.
            Ig-like beta-sandwich fold. Scop reports duplication
           with N-terminal domain. Arrestins comprise a family of
           closely-related proteins that includes beta-arrestin-1
           and -2, which regulate the function of beta-adrenergic
           receptors by binding to their phosphorylated forms,
           impairing their capacity to activate G(S) proteins; Cone
           photoreceptors C-arrestin (arrestin-X). which could bind
           to phosphorylated red/green opsins; and Drosophila
           phosrestins I and II, which undergo light-induced
           phosphorylation, and probably play a role in
           photoreceptor transduction.
          Length = 142

 Score = 65.4 bits (160), Expect = 4e-13
 Identities = 21/42 (50%), Positives = 26/42 (61%)

Query: 104 SPNKLHLEASLDKELYYHGESIAVNVHVANNSNRTVKKIKVS 145
               L LE SL K+ Y  GE+I V + + N S +TVKKIKVS
Sbjct: 1   WSGPLSLEVSLPKKGYVPGETIPVTIKITNLSKKTVKKIKVS 42



 Score = 50.8 bits (122), Expect = 5e-08
 Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 3/62 (4%)

Query: 226 YYHGESIAVNVHVANNSNRTVKKIKVS---DICLFSTAQYKCTVAETESDCPIAPVSMFD 282
           Y  GE+I V + + N S +TVKKIKVS    +   S+          +S    A      
Sbjct: 16  YVPGETIPVTIKITNLSKKTVKKIKVSLVQTVTYVSSDGPVKRSLAEKSKEKKADRKTLV 75

Query: 283 TE 284
            E
Sbjct: 76  KE 77


>gnl|CDD|217215 pfam02752, Arrestin_C, Arrestin (or S-antigen), C-terminal domain. 
           Ig-like beta-sandwich fold. Scop reports duplication
           with N-terminal domain.
          Length = 136

 Score = 62.0 bits (151), Expect = 6e-12
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 104 SPNKLHLEASLDKELYYHGESIAVNVHVANNSNRTVKKIKVS 145
               + L  SL K+ Y  GE+I + V + N S++ +KKIK+S
Sbjct: 1   WSGPVSLSVSLPKKGYVPGETIPITVEIDNQSSKKIKKIKIS 42



 Score = 47.7 bits (114), Expect = 6e-07
 Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 9/56 (16%)

Query: 226 YYHGESIAVNVHVANNSNRTVKKIKVS-----DICLFS----TAQYKCTVAETESD 272
           Y  GE+I + V + N S++ +KKIK+S          +    T   +  VA+ ES 
Sbjct: 16  YVPGETIPITVEIDNQSSKKIKKIKISLVQQVTYKAKTPLRRTKNEERVVAKEESG 71


>gnl|CDD|184771 PRK14623, PRK14623, hypothetical protein; Provisional.
          Length = 106

 Score = 31.3 bits (71), Expect = 0.21
 Identities = 12/55 (21%), Positives = 27/55 (49%), Gaps = 14/55 (25%)

Query: 135 SNRTVKKIKVSDSGAEDDQDLKD--------------ELADSDIDGMEEDDLPNI 175
           +NR +K I + D   ED + L+D              E+ ++++  + ++ +P+I
Sbjct: 44  ANREIKSISIDDELLEDKEQLEDYLVLTLNKAIEKATEINEAELGAVAKEGMPDI 98


>gnl|CDD|177407 PHA02578, 53, baseplate wedge subunit; Provisional.
          Length = 181

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 9/53 (16%)

Query: 209 WLKKGSKEKSKKKYLFLYYHGESIAVNVHVA----NNSNRTVKKIKVSDICLF 257
           W  KG K        +L YHG   AV+ +      N   R +K I  SDI  F
Sbjct: 124 WYDKGDKA-----RRYLQYHGALAAVDTYEDAVLRNEEKRKIKIISPSDINSF 171


>gnl|CDD|233043 TIGR00599, rad18, DNA repair protein rad18.  All proteins in this
           family for which functions are known are involved in
           nucleotide excision repair.This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 397

 Score = 29.2 bits (65), Expect = 3.1
 Identities = 13/79 (16%), Positives = 23/79 (29%)

Query: 27  FFELPPSCPASVTLQPAPGDTGKPCGVDYELKAFVGETAEDKIHKRNSVRLAIRKIMYAP 86
           F  +P + P        P ++ K   ++      V    ED      S R    +     
Sbjct: 98  FLRIPKTTPVENPDLAGPENSSKIELIEESESDGVDAEDEDLQRSATSSRALAARSTADI 157

Query: 87  SKQGEQPSVEVSKEFMMSP 105
            +    PS     ++   P
Sbjct: 158 LQLSSDPSKRNDADYRSEP 176


>gnl|CDD|215788 pfam00207, A2M, Alpha-2-macroglobulin family.  This family includes
           the C-terminal region of the alpha-2-macroglobulin
           family.
          Length = 92

 Score = 27.5 bits (62), Expect = 3.3
 Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 8/52 (15%)

Query: 89  QGEQPSVEVSKEFMMSPNKLHLEASLDKELYYHGESIAVNVHVANNSNRTVK 140
             E P V V K F +  N   L  S+ +     GE + +   V N  ++ +K
Sbjct: 44  VAEPPEVTVFKPFFVDLN---LPYSVRR-----GEQVEIKATVFNYLDKDIK 87


>gnl|CDD|130079 TIGR01006, polys_exp_MPA1, polysaccharide export protein, MPA1
           family, Gram-positive type.  This family contains
           members from Low GC Gram-positive bacteria; they are
           proposed to have a function in the export of complex
           polysaccharides [Transport and binding proteins,
           Carbohydrates, organic alcohols, and acids].
          Length = 226

 Score = 28.6 bits (64), Expect = 3.8
 Identities = 22/97 (22%), Positives = 40/97 (41%), Gaps = 13/97 (13%)

Query: 99  KEFMMSPNKLHLEASLDKELYYHGESIAVNVHVANNSNRTVKKIKVSDSGAEDDQDLKDE 158
           KE ++SP+ L  + S D +L    + ++  V V N ++  +  I V D   +D   + + 
Sbjct: 81  KEIILSPDILD-KVSKDLKLDLSPKELSSMVTVTNPTDTRLISISVKDKTPQDASKIANS 139

Query: 159 LAD------------SDIDGMEEDDLPNIKAWGKNKR 183
           L +            SD+  +EE       +    KR
Sbjct: 140 LREVASKKIPKITNVSDVTTLEEAKPATTPSSPNPKR 176


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 27.0 bits (60), Expect = 5.3
 Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 4/64 (6%)

Query: 134 NSNRTVKKIKVSDSGAEDDQDLKDELADSDIDGMEEDDLPNIKA---WGKNKRMYYNTDY 190
            S R  +  +  D   E++ +  D+  + D D  ++ D   I A    G+ +  Y + D 
Sbjct: 1   TSPRLERSRRSRDDYDEEEDEDMDDFIEDD-DEEDDYDRDEIWAMFNKGRKRYAYRDDDD 59

Query: 191 VDDD 194
            DDD
Sbjct: 60  DDDD 63


>gnl|CDD|183758 PRK12803, PRK12803, flagellin; Provisional.
          Length = 335

 Score = 28.2 bits (62), Expect = 5.7
 Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 10/64 (15%)

Query: 83  MYAPSKQGEQPSVEVSKEFMMSPNKLHLEASLDKE-----LYYH-----GESIAVNVHVA 132
           M+  S +    +V  ++E  M P K++  ASL        L  H      E+IAVN++ A
Sbjct: 133 MHMLSNKSAAQNVRTAEELGMQPAKINTPASLSGSQASWTLRVHVGANQDEAIAVNIYAA 192

Query: 133 NNSN 136
           N +N
Sbjct: 193 NVAN 196


>gnl|CDD|234448 TIGR04047, MSMEG_0565_glyc, glycosyltransferase, MSMEG_0565
          family.  A conserved gene cluster found sporadically
          from Actinobacteria to Proteobacteria to Cyanobacteria
          features a radical SAM protein, an N-acetyltransferase,
          an oxidoreductase, and two additional proteins whose
          functional classes are unclear. The metabolic role of
          the cluster is probably biosynthetic. This
          glycosyltransferase, named from member MSMEG_0565 from
          Mycobacterium smegmatis, occurs in most but not all
          instances of the cluster [Unknown function, Enzymes of
          unknown specificity].
          Length = 373

 Score = 27.7 bits (62), Expect = 7.6
 Identities = 12/28 (42%), Positives = 13/28 (46%), Gaps = 1/28 (3%)

Query: 21 PNAFPFFFELPPSCPAS-VTLQPAPGDT 47
            A  F F   P C    V + PAPGDT
Sbjct: 37 LAADGFGFFRDPPCAVRLVPVAPAPGDT 64


>gnl|CDD|225248 COG2373, COG2373, Large extracellular alpha-helical protein
           [General function prediction only].
          Length = 1621

 Score = 28.2 bits (63), Expect = 8.4
 Identities = 9/40 (22%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 94  SVEVSKEFMMSPNKLHLEASLDKELYYHGESIAVNVHVAN 133
           S  V ++F+  P++  +  +LDK  +  G+ + + V +  
Sbjct: 487 SFRV-EDFI--PDRFKINLTLDKTEWVPGKDVKIKVDLRY 523


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.132    0.390 

Gapped
Lambda     K      H
   0.267   0.0713    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,222,585
Number of extensions: 1529507
Number of successful extensions: 1348
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1343
Number of HSP's successfully gapped: 22
Length of query: 323
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 226
Effective length of database: 6,635,264
Effective search space: 1499569664
Effective search space used: 1499569664
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.5 bits)