RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11780
         (383 letters)



>gnl|CDD|206637 cd11304, Cadherin_repeat, Cadherin tandem repeat domain.  Cadherins
           are glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers.
          Length = 98

 Score = 91.2 bits (227), Expect = 8e-23
 Identities = 42/100 (42%), Positives = 54/100 (54%), Gaps = 8/100 (8%)

Query: 185 VTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS-QGIFDIDPNTGNIFTLKELDR 243
              V E+ P GT VL V A D D G N ++ YSI SG+  G+F IDP+TG I T K LDR
Sbjct: 3   EVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDR 62

Query: 244 ESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDV 283
           E  +S    Y L +  T+      P  ++   VTI + DV
Sbjct: 63  EEQSS----YTLTVTATDGG---GPPLSSTATVTITVLDV 95



 Score = 57.7 bits (140), Expect = 1e-10
 Identities = 30/115 (26%), Positives = 41/115 (35%), Gaps = 26/115 (22%)

Query: 60  DTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFS 119
           + P GT +  +   DPD                         +V Y +  V G+E  LFS
Sbjct: 9   NAPPGTVVLTVSATDPDSGENG--------------------EVTYSI--VSGNEDGLFS 46

Query: 120 ISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVED 174
           I    G   I     LD E +  Y L + A D        + TA + + V DV D
Sbjct: 47  IDPSTG--EITTAKPLDREEQSSYTLTVTATDGGGPPL--SSTATVTITVLDVND 97



 Score = 26.9 bits (60), Expect = 6.9
 Identities = 15/54 (27%), Positives = 22/54 (40%), Gaps = 7/54 (12%)

Query: 319 YVAEIHENSVINTPVTFLGGVVPQVFDYDQGKNG--TFQMFIEGDMGTFEITPQ 370
           Y   + EN+   T V  +        D D G+NG  T+ +    + G F I P 
Sbjct: 2   YEVSVPENAPPGTVVLTV-----SATDPDSGENGEVTYSIVSGNEDGLFSIDPS 50


>gnl|CDD|215665 pfam00028, Cadherin, Cadherin domain. 
          Length = 92

 Score = 75.4 bits (186), Expect = 4e-17
 Identities = 42/98 (42%), Positives = 52/98 (53%), Gaps = 8/98 (8%)

Query: 185 VTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSG-SQGIFDIDPNTGNIFTLKELDR 243
              V E+ PVGT VL V A D D G N +I YSI  G   G F IDP+TG++ T K LDR
Sbjct: 2   SASVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRIDPDTGDLSTTKPLDR 61

Query: 244 ESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIIT 281
           ES     G Y L +  T+      P  ++ T VTI + 
Sbjct: 62  ES----IGEYELTVLATDSG---GPPLSSTTTVTITVL 92



 Score = 40.0 bits (94), Expect = 2e-04
 Identities = 23/117 (19%), Positives = 36/117 (30%), Gaps = 26/117 (22%)

Query: 51  IVIRLKEGPDTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENV 110
               + E  + P GT +  +   D D                         ++ Y +  +
Sbjct: 1   YSASVPE--NAPVGTEVLTVTATDADLGPNG--------------------RIFYSI--L 36

Query: 111 DGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAILV 167
            G     F I    G   +     LD E+   Y+L +LA D       +T T  I V
Sbjct: 37  GGGPGGWFRIDPDTG--DLSTTKPLDRESIGEYELTVLATDSGGPPLSSTTTVTITV 91


>gnl|CDD|214520 smart00112, CA, Cadherin repeats.  Cadherins are glycoproteins
           involved in Ca2+-mediated cell-cell adhesion. Cadherin
           domains occur as repeats in the extracellular regions
           which are thought to mediate cell-cell contact when
           bound to calcium.
          Length = 81

 Score = 65.1 bits (159), Expect = 2e-13
 Identities = 34/83 (40%), Positives = 43/83 (51%), Gaps = 8/83 (9%)

Query: 202 KAVDGDRGINNKILYSISSGS-QGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVT 260
            A D D G N K+ YSI SG+  G+F IDP TG I T K LDRE        Y L ++ T
Sbjct: 1   SATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREE----QPEYTLTVEAT 56

Query: 261 EESKTVQPAPTAKTEVTIIITDV 283
           +      P  ++   VTI + DV
Sbjct: 57  DGG---GPPLSSTATVTITVLDV 76



 Score = 53.5 bits (129), Expect = 2e-09
 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 6/76 (7%)

Query: 102 KVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTG 161
           KV Y +  + G++  LFSI    G   I     LD E +  Y L + A D       +T 
Sbjct: 12  KVTYSI--LSGNDDGLFSIDPETG--EITTTKPLDREEQPEYTLTVEATDGGGPPLSSTA 67

Query: 162 TAAILVKVQDVEDQPP 177
           T  + + V DV D  P
Sbjct: 68  T--VTITVLDVNDNAP 81


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 32.9 bits (74), Expect = 0.33
 Identities = 30/118 (25%), Positives = 43/118 (36%), Gaps = 17/118 (14%)

Query: 209 GINNKILYSISSG---------SQGIFDIDPNTGNI-FTLKELDRESATSNNGAYILEIK 258
           G++N I+ SI  G         S G   I  N G   F   +   +  TS   A   E+K
Sbjct: 96  GVDNSIITSIKGGTKVTVETTESNGWHKITYNDGKTGFVNGKYLTDKVTSTPVAPTQEVK 155

Query: 259 VTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPTFRS 316
               ++   PA   KTE       V + ++   PAP         + D N  T   +S
Sbjct: 156 KETTTQQAAPAAETKTE-------VKQTTQATTPAPKVAETKETPVVDQNATTHAVKS 206


>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide.  This family
           contains several plant plasma membrane proteins termed
           DREPPs as they are developmentally regulated plasma
           membrane polypeptides.
          Length = 201

 Score = 29.6 bits (66), Expect = 2.1
 Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 259 VTEESKTVQPAPTAKTEVTIIITD----VTEESKTVQPAPTAKTEVTIIITDVNDETP 312
           + EE K  + AP A+TE    + +    V EE K  +  P A    T       +ETP
Sbjct: 130 IVEEKKPEEEAPAAETEKKPAVKEKEIVVEEEKKEEEAVPVAAAPSTKAPETKVEETP 187


>gnl|CDD|202974 pfam04327, DUF464, Protein of unknown function (DUF464). 
          Length = 102

 Score = 28.4 bits (64), Expect = 2.1
 Identities = 13/63 (20%), Positives = 33/63 (52%), Gaps = 2/63 (3%)

Query: 119 SISTVNG-KGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPP 177
           +I  VNG + +  +  K++ E+ +L  L++       N++       +L+ ++ + +Q P
Sbjct: 40  AIGAVNGLEKLAGIKPKVEVEDGYLE-LELPDDLLEKNEKAQLLLETLLLGLKTIAEQYP 98

Query: 178 EFV 180
           E++
Sbjct: 99  EYI 101


>gnl|CDD|224000 COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit
           (contains helicase and exonuclease domains) [DNA
           replication, recombination, and repair].
          Length = 1139

 Score = 29.7 bits (67), Expect = 3.1
 Identities = 16/47 (34%), Positives = 17/47 (36%), Gaps = 16/47 (34%)

Query: 45  IDE-------QTEIVIRLKEGPDTPAGTF---------IYRLRGADP 75
           IDE       Q  I+ RL  G      T          IYR RGAD 
Sbjct: 384 IDEFQDTDPQQWRILSRLFAGFKAGNRTLFLVGDPKQSIYRFRGADI 430


>gnl|CDD|234563 TIGR04364, methyltran_FxLD, methyltransferase, FxLD system.
           Members of this family resemble occur regularly in the
           vicinity of lantibiotic biosynthesis enzymes and their
           probable target, the FxLD family of putative ribosomal
           natural product precursor (TIGR04363). Members resemble
           protein-L-isoaspartate O-methyltransferase (TIGR00080)
           and a predicted methyltranserase, TIGR04188, of another
           putative peptide modification system.
          Length = 394

 Score = 29.3 bits (66), Expect = 3.3
 Identities = 11/28 (39%), Positives = 12/28 (42%)

Query: 63  AGTFIYRLRGADPDGDVLTFGVQGHGHD 90
            G+F Y  R     G    FGV  HG D
Sbjct: 311 GGSFAYLARRPAEGGGGFEFGVIAHGPD 338


>gnl|CDD|129488 TIGR00393, kpsF, KpsF/GutQ family protein.  This model describes a
           number of closely related proteins with the
           phosphosugar-binding domain SIS (Sugar ISomerase)
           followed by two copies of the CBS (named after
           Cystathionine Beta Synthase) domain. One is GutQ, a
           protein of the glucitol operon. Another is KpsF, a
           virulence factor involved in capsular polysialic acid
           biosynthesis in some pathogenic strains of E. coli
           [Energy metabolism, Sugars].
          Length = 268

 Score = 28.6 bits (64), Expect = 4.8
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 244 ESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDV 283
            S+ +    Y+L+IKV +E+  +  APT  T +T+ + D 
Sbjct: 85  NSSLARAADYVLDIKVEKEACPINLAPTTSTTLTLALGDA 124


>gnl|CDD|214776 smart00690, DM5, Domain of unknown function, currently peculiar to
           Drosophila. 
          Length = 102

 Score = 27.3 bits (61), Expect = 5.2
 Identities = 11/43 (25%), Positives = 16/43 (37%), Gaps = 5/43 (11%)

Query: 262 ESKTVQPAPTAKTEVTIII-----TDVTEESKTVQPAPTAKTE 299
           E+  +  A  A  E T I       D +E +  + PAP     
Sbjct: 48  ENAALALAKQANEEKTAIYVLSKKPDASELADQLAPAPQTSPS 90


>gnl|CDD|180266 PRK05803, PRK05803, sporulation sigma factor SigK; Reviewed.
          Length = 233

 Score = 28.0 bits (63), Expect = 7.3
 Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 4/48 (8%)

Query: 92  IRIE-NLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYE 138
           I IE NL ++  +V   EN   D  DL SI T+   G+IK I   D  
Sbjct: 53  ILIERNLRLVAHIVKKFENTGEDVDDLISIGTI---GLIKAIESFDAG 97


>gnl|CDD|132512 TIGR03472, HpnI, hopanoid biosynthesis associated glycosyl
           transferase protein HpnI.  This family of genes include
           a glycosyl transferase, group 2 domain (pfam00535) which
           are responsible, generally for the transfer of
           nucleotide-diphosphate sugars to substrates such as
           polysaccharides and lipids. The member of this clade
           from Acidithiobacillus ferrooxidans ATCC 23270
           (AFE_0974) is found in the same locus as squalene-hopene
           cyclase (SHC, TIGR01507) and other genes associated with
           the biosynthesis of hopanoid natural products.
           Similarly, in Ralstonia eutropha JMP134 (Reut_B4902)
           this gene is adjacent to HpnAB, IspH and HpnH
           (TIGR03470), although SHC itself is elsewhere in the
           genome. Notably, this gene (here named HpnI) and three
           others form a conserved set (HpnIJKL) which occur in a
           subset of all genomes containing the SHC enzyme. This
           relationship was discerned using the method of partial
           phylogenetic profiling. This group includes Zymomonas
           mobilis, the organism where the initial hopanoid
           biosynthesis locus was described consisting of the genes
           HpnA-E and SHC (HpnF). Continuing past SHC are found a
           phosphorylase enzyme (ZMO0873, i.e. HpnG, TIGR03468) and
           another radical SAM enzyme (ZMO0874), HpnH. Although
           discontinuous in Z. mobilis, we continue the gene symbol
           sequence with HpnIJKL. Hopanoids are known to feature
           polar glycosyl head groups in many organisms.
          Length = 373

 Score = 28.1 bits (63), Expect = 7.3
 Identities = 18/49 (36%), Positives = 24/49 (48%), Gaps = 2/49 (4%)

Query: 59  PDTPAGTFIYRLRGADPDGDV-LTFGVQGHGHDVIRIENLNVMLKVVYH 106
           PD PA   + RLR   PD D+ L    + HG +  ++ NL  ML    H
Sbjct: 80  PDDPALAVVRRLRADFPDADIDLVIDARRHGPN-RKVSNLINMLPHARH 127


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 28.5 bits (64), Expect = 7.8
 Identities = 17/74 (22%), Positives = 23/74 (31%), Gaps = 7/74 (9%)

Query: 261 EESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPTFRSPRYV 320
            E++ VQ  P            V    + V  AP  +    ++   V    P       V
Sbjct: 859 REAEEVQVQPVVAE------VPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVV 912

Query: 321 AEIHENSVINTPVT 334
              H   VI  PVT
Sbjct: 913 ETTHPE-VIAAPVT 925



 Score = 28.1 bits (63), Expect = 9.6
 Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 2/80 (2%)

Query: 256 EIKVTEESKTVQPAPTAKTEVTIIITD-VTEESKTVQPAPTAKTEVTIIITDVNDETPTF 314
           E+ V    + V  AP  +    ++    V  E +  +      T   +I   V ++    
Sbjct: 872 EVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVI 931

Query: 315 RSPRYVAEIHENSVINTPVT 334
            +   VA   E +    PV 
Sbjct: 932 -TESDVAVAQEVAEHAEPVV 950


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.136    0.375 

Gapped
Lambda     K      H
   0.267   0.0715    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,464,089
Number of extensions: 1931246
Number of successful extensions: 1582
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1570
Number of HSP's successfully gapped: 36
Length of query: 383
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 284
Effective length of database: 6,546,556
Effective search space: 1859221904
Effective search space used: 1859221904
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.9 bits)