RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5801
         (176 letters)



>gnl|CDD|218855 pfam06017, Myosin_TH1, Myosin tail. 
          Length = 199

 Score = 88.1 bits (219), Expect = 4e-22
 Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 9/116 (7%)

Query: 64  HNNKGNDLVFTIISSEDRVMFSAFVHKTNKFNRCADRALLVTDFAVYKLD----SGAKFK 119
            N K    V +   +E +V+F+  V+K N+  + + R LL+TD A+Y +        K  
Sbjct: 33  ENPKLQKAVGSEEGNE-KVLFADKVNKVNRSGKPSKRDLLLTDKALYLVGREKVKDQKTY 91

Query: 120 AMKRGMSLQEMTGLSVSPGSDQLVVIHNNKGNDLVFTIISSEDRVGELVGALASRY 175
            +KR + L E+T +S+SP  D L VIH +   DLV       D   ELV  L   Y
Sbjct: 92  VLKRRIPLSEITSVSLSPLQDDLFVIHVSGKGDLVLES----DFKTELVTVLCKHY 143



 Score = 72.7 bits (179), Expect = 2e-16
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 2   FSAFVHKTNKFNRCADRALLVTDFAVYKLD----SGAKFKAMKRGMSLQEMTGLSVSPGS 57
           F+  V+K N+  + + R LL+TD A+Y +        K   +KR + L E+T +S+SP  
Sbjct: 52  FADKVNKVNRSGKPSKRDLLLTDKALYLVGREKVKDQKTYVLKRRIPLSEITSVSLSPLQ 111

Query: 58  DQLVVIHNNKGNDLVF 73
           D L VIH +   DLV 
Sbjct: 112 DDLFVIHVSGKGDLVL 127


>gnl|CDD|216448 pfam01347, Vitellogenin_N, Lipoprotein amino terminal region.  This
           family contains regions from: Vitellogenin, Microsomal
           triglyceride transfer protein and apolipoprotein B-100.
           These proteins are all involved in lipid transport. This
           family contains the LV1n chain from lipovitellin, that
           contains two structural domains.
          Length = 579

 Score = 31.9 bits (73), Expect = 0.14
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 16/90 (17%)

Query: 6   VHKTNKFNRCADRALLVTDFAVYKLDSGAKFKAMKRGMSLQEMTGLSVSPGSDQLVVIHN 65
           V KT   N C  R       A Y     A++           ++  +VS    + ++   
Sbjct: 159 VTKTKNLNNCQQRIAYHFGLAGY-----AEWCCGTNQKG-NFLSSSAVS----RYIL--- 205

Query: 66  NKGNDLVFTIISSE--DRVMFSAFVHKTNK 93
            KG D  F I S+E  ++V+ S F++ + K
Sbjct: 206 -KGTDNGFLIQSAETTEKVLVSPFLYNSQK 234



 Score = 28.0 bits (63), Expect = 3.0
 Identities = 18/74 (24%), Positives = 27/74 (36%), Gaps = 14/74 (18%)

Query: 88  VHKTNKFNRCADRALLVTDFAVYKLDSGAKFKAMKRGMSLQEMTGLSVSPGSDQLVVIHN 147
           V KT   N C  R       A Y     A++           ++  +VS    + ++   
Sbjct: 159 VTKTKNLNNCQQRIAYHFGLAGY-----AEWCCGTNQKG-NFLSSSAVS----RYIL--- 205

Query: 148 NKGNDLVFTIISSE 161
            KG D  F I S+E
Sbjct: 206 -KGTDNGFLIQSAE 218


>gnl|CDD|115459 pfam06802, DUF1231, Protein of unknown function (DUF1231).  This
           family consists of several Orthopoxvirus specific
           proteins predominantly of around 340 residues in length.
           This family contains both B17 and B15 proteins, the
           function of which are unknown.
          Length = 340

 Score = 28.8 bits (64), Expect = 1.3
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 8   KTNKFNRCADRALLVTDFAVYKL 30
           K  K NRCAD A+++  F  +KL
Sbjct: 132 KVKKLNRCADHAIVINPFVKFKL 154



 Score = 28.8 bits (64), Expect = 1.3
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 90  KTNKFNRCADRALLVTDFAVYKL 112
           K  K NRCAD A+++  F  +KL
Sbjct: 132 KVKKLNRCADHAIVINPFVKFKL 154


>gnl|CDD|177532 PHA03115, PHA03115, hypothetical protein; Provisional.
          Length = 340

 Score = 28.8 bits (64), Expect = 1.3
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 8   KTNKFNRCADRALLVTDFAVYKL 30
           K  K NRCAD A+++  F  +KL
Sbjct: 132 KVKKLNRCADHAIVINPFVKFKL 154



 Score = 28.8 bits (64), Expect = 1.3
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 90  KTNKFNRCADRALLVTDFAVYKL 112
           K  K NRCAD A+++  F  +KL
Sbjct: 132 KVKKLNRCADHAIVINPFVKFKL 154


>gnl|CDD|171500 PRK12439, PRK12439, NAD(P)H-dependent glycerol-3-phosphate
           dehydrogenase; Provisional.
          Length = 341

 Score = 27.4 bits (61), Expect = 3.7
 Identities = 22/64 (34%), Positives = 29/64 (45%), Gaps = 3/64 (4%)

Query: 110 YKLDSGAKFKAMKRGMSLQEMTGLSVSPGSDQLVVIHNNKGNDLVFTIISSEDR---VGE 166
           Y L  G   +AM    +L+EMT L V+ G +           DL+ T  S   R   VGE
Sbjct: 206 YSLGIGENTRAMVIARALREMTKLGVAMGGNPETFAGLAGMGDLIVTCTSQRSRNRHVGE 265

Query: 167 LVGA 170
            +GA
Sbjct: 266 QLGA 269


>gnl|CDD|227042 COG4698, COG4698, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 197

 Score = 27.4 bits (61), Expect = 3.7
 Identities = 6/34 (17%), Positives = 18/34 (52%)

Query: 135 VSPGSDQLVVIHNNKGNDLVFTIISSEDRVGELV 168
           +SP  +   +   ++ ++  F I ++  ++ EL+
Sbjct: 34  LSPREEPTHLEDASEKSEKSFQITTTRSQLNELI 67


>gnl|CDD|239038 cd02123, PA_C_RZF_like, PA_C-RZF_ like: Protease-associated (PA)
           domain C_RZF-like. This group includes various PA
           domain-containing proteins similar to C-RZF (chicken
           embryo RING zinc finger) protein. These proteins contain
           a C3H2C3 RING finger. C-RZF is expressed in embryo cells
           and is restricted mainly to brain and heart, it is
           localized to both the nucleus and endosomes. Additional
           C3H2C3 RING finger proteins belonging to this group,
           include Arabidopsis ReMembR-H2 protein and mouse
           sperizin. ReMembR-H2 is likely to be an integral
           membrane protein, and to traffic through the endosomal
           pathway. Sperizin is expressed in haploid germ cells and
           localized in the cytoplasm, it may participate in
           spermatogenesis. The significance of the PA domain to
           these proteins has not been ascertained. It may be a
           protein-protein interaction domain. At peptidase active
           sites, the PA domain may participate in substrate
           binding and/or promoting conformational changes, which
           influence the stability and accessibility of the site to
           substrate.
          Length = 153

 Score = 26.5 bits (59), Expect = 5.3
 Identities = 4/22 (18%), Positives = 14/22 (63%)

Query: 59  QLVVIHNNKGNDLVFTIISSED 80
           +  +++N++ NDL+    + ++
Sbjct: 93  KAAIVYNDESNDLISMSGNDQE 114



 Score = 26.5 bits (59), Expect = 5.3
 Identities = 4/22 (18%), Positives = 14/22 (63%)

Query: 141 QLVVIHNNKGNDLVFTIISSED 162
           +  +++N++ NDL+    + ++
Sbjct: 93  KAAIVYNDESNDLISMSGNDQE 114


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.134    0.375 

Gapped
Lambda     K      H
   0.267   0.0561    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,549,496
Number of extensions: 755900
Number of successful extensions: 530
Number of sequences better than 10.0: 1
Number of HSP's gapped: 528
Number of HSP's successfully gapped: 14
Length of query: 176
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 86
Effective length of database: 6,945,742
Effective search space: 597333812
Effective search space used: 597333812
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.7 bits)