RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14533
         (209 letters)



>gnl|CDD|238020 cd00063, FN3, Fibronectin type 3 domain; One of three types of
           internal repeats found in the plasma protein
           fibronectin. Its tenth fibronectin type III repeat
           contains an RGD cell recognition sequence in a flexible
           loop between 2 strands. Approximately 2% of all animal
           proteins contain the FN3 repeat; including extracellular
           and intracellular proteins, membrane spanning cytokine
           receptors, growth hormone receptors, tyrosine
           phosphatase receptors, and adhesion molecules. FN3-like
           domains are also found in bacterial glycosyl hydrolases.
          Length = 93

 Score = 70.2 bits (172), Expect = 5e-16
 Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 7/97 (7%)

Query: 58  PSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGA 117
           P+ L    ++S S+ ++W+PP      G + GY + YRE+G G       T     S   
Sbjct: 4   PTNLRVTDVTSTSVTLSWTPPED--DGGPITGYVVEYREKGSGDWKEVEVTPGSETS--- 58

Query: 118 GVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQT 154
              TLTGL+   +Y+  V+A N  G  P S  V+V T
Sbjct: 59  --YTLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93



 Score = 42.5 bits (100), Expect = 1e-05
 Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 162 PPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYY 199
           PP ++  + ++STS++++W PP      G I GY V Y
Sbjct: 3   PPTNLRVTDVTSTSVTLSWTPPE--DDGGPITGYVVEY 38



 Score = 40.6 bits (95), Expect = 5e-05
 Identities = 16/42 (38%), Positives = 19/42 (45%)

Query: 8  NSGTEQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHT 49
            G+E    L  L P T Y  RV A N  G   PS+ + V T
Sbjct: 52 TPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93


>gnl|CDD|200951 pfam00041, fn3, Fibronectin type III domain. 
          Length = 84

 Score = 59.4 bits (144), Expect = 4e-12
 Identities = 30/91 (32%), Positives = 42/91 (46%), Gaps = 8/91 (8%)

Query: 57  EPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDG 116
            P+ L    ++S S+ ++WSPPP    NG + GY + YR       N             
Sbjct: 2   APTNLTVTDVTSTSLTLSWSPPPG---NGPITGYEVEYRP-----VNGGEEWKEITVPGT 53

Query: 117 AGVATLTGLRKYRKYDIVVQAFNEKGPGPMS 147
               TLTGL+   +Y++ VQA N  G GP S
Sbjct: 54  TTSYTLTGLKPGTEYEVRVQAVNGAGEGPPS 84



 Score = 42.8 bits (101), Expect = 6e-06
 Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 3/43 (6%)

Query: 162 PPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENMRE 204
            P ++T + ++STSL+++W PPP    NG I GY+V Y  +  
Sbjct: 2   APTNLTVTDVTSTSLTLSWSPPPG---NGPITGYEVEYRPVNG 41



 Score = 35.9 bits (83), Expect = 0.002
 Identities = 19/41 (46%), Positives = 19/41 (46%), Gaps = 1/41 (2%)

Query: 3  EWKSQN-SGTEQWAVLQDLLPATLYRVRVLAENSLGAGRPS 42
          EWK     GT     L  L P T Y VRV A N  G G PS
Sbjct: 44 EWKEITVPGTTTSYTLTGLKPGTEYEVRVQAVNGAGEGPPS 84


>gnl|CDD|214495 smart00060, FN3, Fibronectin type 3 domain.  One of three types of
           internal repeat within the plasma protein, fibronectin.
           The tenth fibronectin type III repeat contains a RGD
           cell recognition sequence in a flexible loop between 2
           strands. Type III modules are present in both
           extracellular and intracellular proteins.
          Length = 83

 Score = 58.8 bits (142), Expect = 8e-12
 Identities = 29/87 (33%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 58  PSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGA 117
           PS L    ++S S+ ++W PPP     G ++GY + YRE+G       N T         
Sbjct: 4   PSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYRVEYREEG-SEWKEVNVTPSSTS---- 58

Query: 118 GVATLTGLRKYRKYDIVVQAFNEKGPG 144
              TLTGL+   +Y+  V+A N  G G
Sbjct: 59  --YTLTGLKPGTEYEFRVRAVNGAGEG 83



 Score = 41.1 bits (96), Expect = 3e-05
 Identities = 15/39 (38%), Positives = 27/39 (69%)

Query: 162 PPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYE 200
           PP ++  + ++STS++++W+PPP     G I+GY+V Y 
Sbjct: 3   PPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYRVEYR 41



 Score = 33.7 bits (77), Expect = 0.012
 Identities = 15/40 (37%), Positives = 17/40 (42%), Gaps = 1/40 (2%)

Query: 1  DVEWKSQN-SGTEQWAVLQDLLPATLYRVRVLAENSLGAG 39
            EWK  N + +     L  L P T Y  RV A N  G G
Sbjct: 44 GSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83


>gnl|CDD|222106 pfam13403, Hint_2, Hint domain.  This domain is found in inteins.
          Length = 147

 Score = 28.6 bits (65), Expect = 1.3
 Identities = 22/70 (31%), Positives = 27/70 (38%), Gaps = 17/70 (24%)

Query: 22  PATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPP--- 78
           P  L  VR+    +LG G P   LLV           S  H V +SS    + +      
Sbjct: 53  PPRLRPVRI-RAGALGNGLPERDLLV-----------SPQHRVLVSSPIAELLFGEDEVL 100

Query: 79  -PA-HLTNGD 86
            PA HL NG 
Sbjct: 101 VPAKHLVNGP 110


>gnl|CDD|221976 pfam13199, Glyco_hydro_66, Glycosyl hydrolase family 66.  This
           family is a set of glycosyl hydrolase enzymes including
           cycloisomaltooligosaccharide glucanotransferase
           (EC:2.4.1.-) and dextranase (EC:3.2.1.11) activities.
          Length = 564

 Score = 28.4 bits (64), Expect = 2.9
 Identities = 12/53 (22%), Positives = 21/53 (39%)

Query: 132 DIVVQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSALSSTSLSVTWQPPP 184
            + + A  + G G  +  V    L +V     + +     +S SL+  W PP 
Sbjct: 12  TVNLTASLKNGTGGGTVTVRYYHLGEVVGETEVTLKLKPGASWSLTFEWTPPA 64


>gnl|CDD|79056 PRK12581, PRK12581, oxaloacetate decarboxylase; Provisional.
          Length = 468

 Score = 28.2 bits (62), Expect = 3.2
 Identities = 17/56 (30%), Positives = 25/56 (44%)

Query: 44  PLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGF 99
           PL+VHT A    ++ + L AV   +D I    SP     +       YL  +E G+
Sbjct: 208 PLIVHTHATSGISQMTYLAAVEAGADRIDTALSPFSEGTSQPATESMYLALKEAGY 263


>gnl|CDD|215292 PLN02533, PLN02533, probable purple acid phosphatase.
          Length = 427

 Score = 28.1 bits (62), Expect = 4.2
 Identities = 24/86 (27%), Positives = 37/86 (43%), Gaps = 8/86 (9%)

Query: 30  VLAENSLGAGRPS--DPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTW---SPPPAHLTN 84
           VLA N L   RP     L++H + E     P  +H   +  D +R++W      P  +  
Sbjct: 14  VLAGNVLSYDRPGTRKNLVIHPDNEDDPTHPDQVHISLVGPDKMRISWITQDSIPPSVVY 73

Query: 85  GDLLGYYLGYREQGFGRQNSYNFTTI 110
           G + G Y G      G  +SY++  I
Sbjct: 74  GTVSGKYEG---SANGTSSSYHYLLI 96


>gnl|CDD|114473 pfam05749, Rubella_E2, Rubella membrane glycoprotein E2.  Rubella
           virus (RV), the sole member of the genus Rubivirus
           within the family Togaviridae, is a small enveloped,
           positive strand RNA virus. The nucleocapsid consists of
           40S genomic RNA and a single species of capsid protein
           which is enveloped within a host-derived lipid bilayer
           containing two viral glycoproteins, E1 (58 kDa) and E2
           (42-46 kDa). In virus infected cells, RV matures by
           budding either at the plasma membrane, or at the
           internal membranes depending on the cell type and enters
           adjacent uninfected cells by a membrane fusion process
           in the endosome, directed by E1-E2 heterodimers. The
           heterodimer formation is crucial for E1 transport out of
           the endoplasmic reticulum to the Golgi and plasma
           membrane. In RV E1, a cysteine at position 82 is crucial
           for the E1-E2 heterodimer formation and cell surface
           expression of the two proteins.
          Length = 267

 Score = 27.4 bits (60), Expect = 4.9
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 53  PPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYL 92
           PP A P+ L   A S+ +     +P P H    D  G +L
Sbjct: 97  PPPATPTPLTTAANSTTAATPATAPAPCHAGLNDSCGGFL 136


>gnl|CDD|180337 PRK05985, PRK05985, cytosine deaminase; Provisional.
          Length = 391

 Score = 27.2 bits (61), Expect = 6.3
 Identities = 15/36 (41%), Positives = 18/36 (50%), Gaps = 7/36 (19%)

Query: 68  SDSIRVTWSPPPAHLTNGDLLG--YYLGYREQGFGR 101
           +D IR TW P      NGD+L     +GYR  GF  
Sbjct: 295 NDGIRDTWWP----YGNGDMLERAMLIGYR-SGFRT 325


>gnl|CDD|153301 cd07617, BAR_Endophilin_B2, The Bin/Amphiphysin/Rvs (BAR) domain of
           Endophilin-B2.  BAR domains are dimerization, lipid
           binding and curvature sensing modules found in many
           different proteins with diverse functions. Endophilins
           play roles in synaptic vesicle formation, virus budding,
           mitochondrial morphology maintenance, receptor-mediated
           endocytosis inhibition, and endosomal sorting.
           Endophilins contain an N-terminal N-BAR domain (BAR
           domain with an additional N-terminal amphipathic helix),
           followed by a variable region containing proline
           clusters, and a C-terminal SH3 domain. They are
           classified into two types, A and B. Vertebrates contain
           two endophilin-B isoforms. Endophilin-B proteins are
           cytoplasmic proteins expressed mainly in the heart,
           placenta, and skeletal muscle. Endophilin-B2, also
           called SH3GLB2 (SH3-domain GRB2-like endophilin B2), is
           a cytoplasmic protein that interacts with the apoptosis
           inducer Bax. It is overexpressed in prostate cancer
           metastasis and has been identified as a cancer antigen
           with potential utility in immunotherapy. Endophilin-B2
           forms homo- and heterodimers (with endophilin-B1)
           through its BAR domain, which can bind and bend
           membranes.
          Length = 220

 Score = 26.9 bits (59), Expect = 6.7
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 79  PAHLTNGDLLGYYLGYREQGFGRQNSYNFTTI 110
           P+ +TN +LLG Y+      FG    Y  T I
Sbjct: 65  PSRVTNAELLGQYMTEAANDFGPGTPYGKTLI 96


>gnl|CDD|218627 pfam05540, Serpulina_VSP, Serpulina hyodysenteriae variable surface
           protein.  This family consists of several variable
           surface proteins from Serpulina hyodysenteriae.
          Length = 377

 Score = 26.8 bits (59), Expect = 8.9
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query: 92  LGYREQGFGRQNSYNFTTIPNRSDGAGVAT 121
           +GY   GFG    YN+T     + G GV T
Sbjct: 91  IGYTSDGFGIGVGYNYTYTAAANKGLGVHT 120


>gnl|CDD|236209 PRK08265, PRK08265, short chain dehydrogenase; Provisional.
          Length = 261

 Score = 26.5 bits (59), Expect = 9.6
 Identities = 11/31 (35%), Positives = 15/31 (48%), Gaps = 7/31 (22%)

Query: 63  AVAISSDSIRV-------TWSPPPAHLTNGD 86
           A+ ++ D IRV       TWS     L+ GD
Sbjct: 165 AMDLAPDGIRVNSVSPGWTWSRVMDELSGGD 195


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.132    0.397 

Gapped
Lambda     K      H
   0.267   0.0737    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,821,165
Number of extensions: 991675
Number of successful extensions: 587
Number of sequences better than 10.0: 1
Number of HSP's gapped: 581
Number of HSP's successfully gapped: 24
Length of query: 209
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 116
Effective length of database: 6,812,680
Effective search space: 790270880
Effective search space used: 790270880
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.7 bits)