RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15691
         (620 letters)



>gnl|CDD|218295 pfam04858, TH1, TH1 protein.  TH1 is a highly conserved but
           uncharacterized metazoan protein. No homologue has been
           identified in Caenorhabditis elegans. TH1 binds
           specifically to A-Raf kinase.
          Length = 582

 Score =  884 bits (2287), Expect = 0.0
 Identities = 360/584 (61%), Positives = 427/584 (73%), Gaps = 11/584 (1%)

Query: 1   MNDDYDDEQTHRWDQDMVDV----DSDGGDDPMENPQEVLNECLEKFSTPDYIMEPGIFS 56
           M +DY  E T  WD+         D D   +  ENP  VL ECL +FST DYIMEP IF 
Sbjct: 1   MEEDY--EVTSGWDRSDGRGTAQGDDDARAEVDENPTAVLQECLRRFSTKDYIMEPSIFD 58

Query: 57  QLKRYFQAGGNPEQVIELLSQNYKAVAQMANLVAEWLILGGVNVTEVQAMVENHLKDMIL 116
            LKRYFQAGG PE VIELLS+NY+AVAQM NL+AEWLI+ GV+ T+VQAMVENHLK +I 
Sbjct: 59  DLKRYFQAGGAPETVIELLSENYRAVAQMTNLLAEWLIVTGVSPTKVQAMVENHLKTLIR 118

Query: 117 KTFDPKKADTIFTEEGETPAWLTEMIEHPTWRSLIYRLAEEYPDCLMLNFTIKLISDAGF 176
           K FDP KAD IFTEEGETPAWL  MI H TWR+L+Y LAEE P+CLMLNFTIKLISDAG 
Sbjct: 119 KRFDPNKADAIFTEEGETPAWLDGMIRHETWRALLYSLAEENPNCLMLNFTIKLISDAGM 178

Query: 177 QREITSISTAAQQIEVFSRVLKTSISNFLES-SDDWQSSVEECAKMVCHGQHTYVYSQVL 235
           Q EITS+STAAQQ EVFSRVL T I+  L S  DD + ++EE A MVCHG HTYV+ Q++
Sbjct: 179 QHEITSLSTAAQQFEVFSRVLATHIAGILVSGPDDVRRAIEELASMVCHGIHTYVFGQLM 238

Query: 236 LQVLSLESKGGSKVKRLAQEITKCAQQSGHDVTPIIMALNGAAFYPQANQALSSMISRNS 295
           L VL  E  GG+   RL+QEI + A +     T I +AL  +A  PQA  AL+ M+SR +
Sbjct: 239 LAVLGQEEDGGTAAARLSQEIEEAAAKLHGAATVITLALATSAVDPQACYALADMLSRGA 298

Query: 296 LNPADITVLFRLYSSPDPPPIDLIRIPQFLELLTDSLFKPGVKLNPEHKPKYIYLLAYAS 355
           LNPADITVL R+Y+S DPPP+DLIR P FL LL D LFK GVK+N  HK KY  LLAYA+
Sbjct: 299 LNPADITVLLRMYTSSDPPPVDLIRHPDFLYLLIDELFKWGVKINNRHKSKYFELLAYAA 358

Query: 356 SVYEVSTGKKTKQKSFNKDDLKTTIQAIEKVHSI-CNSSKSSNELIAEITTLFHCIRYPV 414
           SV E       K ++ NKD+LK+TI+AIEK H+I CN+ +  +ELIAE++TL++CIR+PV
Sbjct: 359 SVVETP---PNKDRNINKDELKSTIKAIEKAHAIICNAKRGKSELIAELSTLYNCIRFPV 415

Query: 415 VSVGVIRWVECTVTEPSYFKLCTEHTPTHLALLDEVVTCHTFLHHKVLQLFIELFESKQD 474
           V  GVI WV   VTEPSYF+L T   P HLA+LDEV   H  L  +VL+L + LFES  D
Sbjct: 416 VGAGVIAWVRSAVTEPSYFRLVTNSNPVHLAVLDEVARKHVALQGQVLELLVALFESHGD 475

Query: 475 ELEILVQLEMRKMLLDRMVNLLCRGCVVPVVKYVKQCWQRGDTDISLIRYFVTEVLEAIT 534
           ELEILVQLE++KMLLDRMV LL RG VVPV+ Y  QCW+R DTDISLIRYFVTEVLE   
Sbjct: 476 ELEILVQLELKKMLLDRMVELLARGHVVPVITYACQCWRRLDTDISLIRYFVTEVLEIAG 535

Query: 535 PPYTPEFVQLFLPIVENDEITGTMRGDGENDPVSEFIVQTLEHE 578
           PPY+ +FVQLFLP++EN+ I GTMRG+G++DPV EFIV      
Sbjct: 536 PPYSRDFVQLFLPLLENENIRGTMRGEGDSDPVDEFIVHCKRRY 579


>gnl|CDD|219491 pfam07632, DUF1593, Protein of unknown function (DUF1593).  A
           family of proteins in Rhodopirellula baltica that are
           predicted to be secreted. Also, a member has been
           identified in Caulobacter crescentus. These proteins mat
           be related to pfam01156.
          Length = 260

 Score = 31.9 bits (73), Expect = 0.89
 Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 18/92 (19%)

Query: 511 CWQRGDTDISLIRYFV---TEV---LEAITPPY-TPEFVQLFLPIVENDEITGTMRGDGE 563
            W +G      I   +    +V   L    P Y T E+      IV+       M   GE
Sbjct: 39  TWHKGGVRPEDIDRIIDAYEKVYPNLNKHAPGYPTAEY---LRSIVKQGNPDYGMPFVGE 95

Query: 564 NDPV--SEFIVQTLEHEWHQLEDDKKIY-QCW 592
                 SE I++ L       +D + ++   W
Sbjct: 96  GKDTEGSELIIEALLD-----DDPRPLWILVW 122


>gnl|CDD|224296 COG1378, COG1378, Predicted transcriptional regulators
           [Transcription].
          Length = 247

 Score = 30.9 bits (70), Expect = 1.6
 Identities = 18/124 (14%), Positives = 49/124 (39%), Gaps = 5/124 (4%)

Query: 460 KVLQLFIELFESKQDELEILVQLEMRKMLLDRMVNLLCRGCVVPVVKYVKQCWQRGDTDI 519
           ++++   E+    + E+ I++  E+ K L + ++  L RG  V ++ +     +      
Sbjct: 120 EIIEKIKEVINEAEKEIIIVLPYEIFKELKEPLIRALKRGVRVLILVFPI---RDVPLAF 176

Query: 520 SLIRYFVTEVLEAITPPYTPEFV--QLFLPIVENDEITGTMRGDGENDPVSEFIVQTLEH 577
            + ++F+  +++          V     + I+       T+ G   +     FI   + +
Sbjct: 177 IVNKFFLDALIDIRFGDPLAGEVIISDRIRILIVPSDGETVEGVIYSFDEIAFITHNIFY 236

Query: 578 EWHQ 581
           E  +
Sbjct: 237 EVWK 240


>gnl|CDD|177383 PHA02548, 24, capsid vertex protein; Provisional.
          Length = 412

 Score = 31.2 bits (71), Expect = 1.7
 Identities = 22/72 (30%), Positives = 34/72 (47%), Gaps = 10/72 (13%)

Query: 140 EMIEHPTWRSLIYRLAEEYPDCLMLNFTIKLISDAG----FQREITSISTAAQQI----- 190
           E++E     S    L E   + ++L   I+L+SDA     F+     I+  + QI     
Sbjct: 105 EVLEAGPLASSETDLEEAIQEAVVLG-KIRLVSDAASTEKFEDGDEEIAETSFQIDKWQV 163

Query: 191 EVFSRVLKTSIS 202
            V SR LKTS++
Sbjct: 164 PVRSRKLKTSLT 175


>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K.  CENP-K is one
           of seven new CENP-A-nucleosome distal (CAD) centromere
           components (the others being CENP-L, CENP-O, CENP-P,
           CENP-Q, CENP-R and CENP-S) that are identified as
           assembling on the CENP-A nucleosome associated complex,
           NAC. The CENP-A NAC is essential, as disruption of the
           complex causes errors of chromosome alignment and
           segregation that preclude cell survival despite
           continued centromere-derived mitotic checkpoint
           signalling. CENP-K is centromere-associated through its
           interaction with one or more components of the CENP-A
           NAC.
          Length = 273

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 4/46 (8%)

Query: 314 PPIDLIRIPQFLELLTDSLFK----PGVKLNPEHKPKYIYLLAYAS 355
           P + LI + + LE+L + L      P VK+     P Y+ LL    
Sbjct: 209 PSVQLITLHEILEILINRLMDTPHDPYVKVKDSFWPPYVELLLRYG 254


>gnl|CDD|176458 cd03468, PolY_like, DNA Polymerase Y-family.  Y-family DNA
           polymerases are a specialized subset of polymerases that
           facilitate translesion synthesis (TLS), a process that
           allows the bypass of a variety of DNA lesions.  Unlike
           replicative polymerases, TLS polymerases lack
           proofreading activity and have low fidelity and low
           processivity.  They use damaged DNA as templates and
           insert nucleotides opposite the lesions. The active
           sites of TLS polymerases are large and flexible to allow
           the accomodation of distorted bases.  Expression of
           Y-family polymerases is often induced by DNA damage and
           is believed to be highly regulated. TLS is likely
           induced by the monoubiquitination of the replication
           clamp PCNA, which provides a scaffold for TLS
           polymerases to bind in order to access the lesion.
           Because of their high error rates, TLS polymerases are
           potential targets for cancer treatment and prevention.
          Length = 335

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 2/65 (3%)

Query: 481 QLEMRKMLLDRMVNLLCRGCVVPVVKYVKQCWQRGDTDISLIRYFVTEVLEAITPPYTPE 540
            L +R +   R+   L R      V  V     R   D   +   + E LE +  P    
Sbjct: 267 FLALRGLGARRLSLTLFR--EDGRVTRVLVGLARPSRDDLPLLRLLRERLERLALPRGIA 324

Query: 541 FVQLF 545
            V+L 
Sbjct: 325 PVRLL 329


>gnl|CDD|185231 PRK15331, PRK15331, chaperone protein SicA; Provisional.
          Length = 165

 Score = 29.6 bits (66), Expect = 2.5
 Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 39  CLEKFSTPDYIMEPGIFSQLKRYFQAGGNPEQVIELLSQN-YKAV 82
           C+  F  PDY M      QLK+ FQ   +   V   L +N Y+ V
Sbjct: 64  CIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPV 108


>gnl|CDD|236007 PRK07379, PRK07379, coproporphyrinogen III oxidase; Provisional.
          Length = 400

 Score = 30.0 bits (68), Expect = 3.6
 Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 5/41 (12%)

Query: 573 QTLEHEWHQ-LEDDKKIYQCWPVHFSSVDFVIEPKGTYGRR 612
           QTLE  W   LE    +    P H S  D V+EP   +G++
Sbjct: 179 QTLED-WQASLEAAIALN---PTHLSCYDLVLEPGTAFGKQ 215


>gnl|CDD|233925 TIGR02553, SipD_IpaD_SspD, type III effector protein
           IpaD/SipD/SspD.  These proteins are found within type
           III secretion operons and have been shown to be secreted
           by that system.
          Length = 313

 Score = 29.6 bits (66), Expect = 4.7
 Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 9/82 (10%)

Query: 69  EQVIELLSQNYKAVAQMANLVAEWLILGG------VNVTEVQAMVE---NHLKDMILKTF 119
           E V+E  +  Y+A + + + + +W+  G       ++V +++A+++   +HL +     F
Sbjct: 130 ENVVEGYTDFYQAFSDILSKMQDWISPGKDGNNVKLDVGKLKALLQQLIDHLPNKKNVLF 189

Query: 120 DPKKADTIFTEEGETPAWLTEM 141
             +       +E +   W  E+
Sbjct: 190 PAQDGGMQLPKEADARRWRKEL 211


>gnl|CDD|218236 pfam04735, Baculo_helicase, Baculovirus DNA helicase. 
          Length = 1173

 Score = 29.9 bits (68), Expect = 5.0
 Identities = 22/76 (28%), Positives = 27/76 (35%), Gaps = 3/76 (3%)

Query: 446 LLDEVVTCHTFLHHKVLQLFIELFESKQDELEILVQLEMRKMLLDRMVNLLCRGCVVPVV 505
           L      C      K+  LF EL+     EL IL      K + D + NL C  C     
Sbjct: 593 LFGNCANCKIEEQDKLNDLFRELWNYDHSELIILGLYLNDKKMSDLIHNLKCSECKESA- 651

Query: 506 KYVKQCWQRGDTDISL 521
              K+C      DI L
Sbjct: 652 --SKKCDCLKKIDIDL 665


>gnl|CDD|219096 pfam06586, TraK, TraK protein.  This family consists of several
           TraK proteins from Escherichia coli, Salmonella typhi
           and Salmonella typhimurium. TraK is known to be
           essential for pilus assembly but its exact role in this
           process is unknown.
          Length = 231

 Score = 28.8 bits (65), Expect = 6.5
 Identities = 27/157 (17%), Positives = 53/157 (33%), Gaps = 28/157 (17%)

Query: 166 FTIKLISDAG--FQREITSISTAAQQIEVFSRVLKTSISN-FLESSDDWQSSVEECAKMV 222
           FT+ + ++ G  F   +T  +   + I + S            E+S  ++ ++ +  + +
Sbjct: 87  FTLFVTTEKGRTFSLTLTPKAGPGETIRLTSEPALKREEAAVWETSTPYEKALVKLTRAM 146

Query: 223 CHG------QHTYVYSQVLLQVLSLESK-------GGSKVKRLAQEITKCAQQSGHDVTP 269
             G          V  + L     L             K KR    +T        +V P
Sbjct: 147 LTGKLPDGYSVKPVTKRTLTLPKGLRLTPVAVYDGNELKGKRY--RLT--------NVGP 196

Query: 270 IIMALNGAAFYPQANQALSSMISRNSLNPADITVLFR 306
             + L  + FY       +  +SR  L P + T ++ 
Sbjct: 197 KPVELRESDFYSPG--TRAVALSRPVLAPGETTQVYV 231


>gnl|CDD|175984 cd04017, C2D_Ferlin, C2 domain fourth repeat in Ferlin.  Ferlins
           are involved in vesicle fusion events.  Ferlins and
           other proteins, such as Synaptotagmins, are implicated
           in facilitating the fusion process when cell membranes
           fuse together.  There are six known human Ferlins:
           Dysferlin (Fer1L1), Otoferlin (Fer1L2), Myoferlin
           (Fer1L3), Fer1L4, Fer1L5, and Fer1L6.  Defects in these
           genes can lead to a wide range of diseases including
           muscular dystrophy (dysferlin), deafness (otoferlin),
           and infertility (fer-1, fertilization factor-1).
           Structurally they have 6 tandem C2 domains, designated
           as (C2A-C2F) and a single C-terminal transmembrane
           domain, though there is a new study that disputes this
           and claims that there are actually 7 tandem C2 domains
           with another C2 domain inserted between C2D and C2E. 
           In a subset of them (Dysferlin, Myoferlin, and Fer1)
           there is an additional conserved domain called DysF. C2
           domains fold into an 8-standed beta-sandwich that can
           adopt 2 structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions. This cd contains the fourth C2
           repeat, C2D, and has a type-II topology.
          Length = 135

 Score = 27.9 bits (63), Expect = 7.6
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 19/80 (23%)

Query: 283 ANQALSSMISRNSLNPA-DITVLFR---LYSSP-----DPPPI-------DLIRIPQFLE 326
            NQ+  + + + +L+P  D T++F    LY SP     +PP +       D +   +FL 
Sbjct: 31  LNQSQETEVIKETLSPTWDQTLIFDEVELYGSPEEIAQNPPLVVVELFDQDSVGKDEFLG 90

Query: 327 LLTDSLFKPGVKLNPEHKPK 346
               S+ KP VKL+ E    
Sbjct: 91  ---RSVAKPLVKLDLEEDFP 107


>gnl|CDD|225749 COG3208, GrsT, Predicted thioesterase involved in non-ribosomal
           peptide biosynthesis [Secondary metabolites
           biosynthesis, transport, and catabolism].
          Length = 244

 Score = 28.8 bits (65), Expect = 7.7
 Identities = 20/58 (34%), Positives = 24/58 (41%), Gaps = 10/58 (17%)

Query: 499 GCVVPVVKYVKQCWQRGDTDI--SLIRYFVT--EVLEAITPPYTPEFVQLFLPIVEND 552
           GC  P     KQ     D D    L+    T  E+LE       PE + LFLPI+  D
Sbjct: 109 GCRAPHYDRGKQIHHLDDADFLADLVDLGGTPPELLE------DPELMALFLPILRAD 160


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 29.1 bits (66), Expect = 8.2
 Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 1/28 (3%)

Query: 17  MVDVDSDGGDDPMENPQEVLNECLEKFS 44
           ++D+    G DP+EN + ++NE LEK+S
Sbjct: 244 LIDIAPIDGSDPVENARIIINE-LEKYS 270


>gnl|CDD|215170 PLN02301, PLN02301, pectinesterase/pectinesterase inhibitor.
          Length = 548

 Score = 29.1 bits (65), Expect = 8.2
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 10/56 (17%)

Query: 95  LGGVNVTEVQAMVENHLKDMILKT---------FDPKKADTIFTEEGETPAWLTEM 141
           L G+N    Q+M +  LKD+I +            P K D +    G+ P+WLT  
Sbjct: 168 LDGINGPSRQSM-KPGLKDLISRARTSLAILVSVSPAKEDLLMPLSGDFPSWLTSK 222


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.135    0.402 

Gapped
Lambda     K      H
   0.267   0.0609    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,587,499
Number of extensions: 3090353
Number of successful extensions: 2930
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2926
Number of HSP's successfully gapped: 27
Length of query: 620
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 517
Effective length of database: 6,369,140
Effective search space: 3292845380
Effective search space used: 3292845380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (27.9 bits)