RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10689
         (369 letters)



>gnl|CDD|130026 TIGR00951, 2A43, Lysosomal Cystine Transporter.  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 220

 Score =  153 bits (389), Expect = 5e-45
 Identities = 74/197 (37%), Positives = 112/197 (56%), Gaps = 8/197 (4%)

Query: 67  LSTIIGWIYFVSWGASAHCQVLVNWQRKSVIGLNFDYLGLNMIGHSLYGVYNFVLYFSYP 126
           LS I+GW Y  +W  S + Q++ NW+RKS  GL+FD++ LN++G + Y ++NF+  + + 
Sbjct: 3   LSQILGWGYVAAWSISFYPQIIKNWRRKSAEGLSFDFVMLNLVGFTAYVIFNFLQLYCWS 62

Query: 127 VQEEYYLRNPHSANPIQLNDVVFSAHNSLVTLYTVYQCYKYERGNQRLSKGAKLLYTVYA 186
           +  E+ L +P     +  NDV F+ H  L+    ++QC  YERG QR+S    L   V  
Sbjct: 63  ITNEFPLSSPG----VTQNDVFFTLHAILICFIVLHQCGDYERGWQRVSNPWILRILVAL 118

Query: 187 VWVFLSGFLALVHKIHWLDFINQCALIKLTITLIKYAPQAYMNFKRKSTVGWSIGNVLLD 246
           +  F +  +AL+  I  L F+   + IK+ +TL+KY PQA  N+  KST   SI  V LD
Sbjct: 119 LACFATLLVALLSPITPLAFVTMLSYIKVAVTLVKYFPQAATNYHNKSTGQLSIITVFLD 178

Query: 247 FAGGLFSLLQMLILAYN 263
           F G    LLQ +  + N
Sbjct: 179 FTG----LLQRIFQSVN 191



 Score = 61.4 bits (149), Expect = 5e-11
 Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 13/89 (14%)

Query: 279 LSLIKTKFTLIFW---AYMNFKRKSTVGWSIGNVLLDFAGGLFSLLQMLILAYNFDDWNS 335
           LS IK   TL+ +   A  N+  KST   SI  V LDF G    LLQ +  + N      
Sbjct: 142 LSYIKVAVTLVKYFPQAATNYHNKSTGQLSIITVFLDFTG----LLQRIFQSVNET---- 193

Query: 336 LLTDTTKLGLAAFTLMFDVLFLVQHYILY 364
              D  K GL   + +F+ LF  Q +  +
Sbjct: 194 --GDPLKAGLFVVSSLFNGLFAAQVFFYW 220


>gnl|CDD|202926 pfam04193, PQ-loop, PQ loop repeat.  Members of this family are all
           membrane bound proteins possessing a pair of repeats
           each spanning two transmembrane helices connected by a
           loop. The PQ motif found on loop 2 is critical for the
           localisation of cystinosin to lysosomes. However, the PQ
           motif appears not to be a general lysosome-targeting
           motif. It is thought likely to possess a more general
           function. Most probably this involves a glutamine
           residue.
          Length = 61

 Score = 59.9 bits (146), Expect = 7e-12
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 210 CALIKLTITLIKYAPQAYMNFKRKSTVGWSIGNVLLDFAGGLFSLLQMLILAYNF 264
              I L + L+ Y PQ   N++RKST G SI  +LL   G +  LL  ++  +  
Sbjct: 5   LGYISLVLWLVSYIPQIIKNYRRKSTEGLSILFLLLWLLGDILYLLYNILNGFPP 59



 Score = 50.6 bits (122), Expect = 1e-08
 Identities = 22/61 (36%), Positives = 36/61 (59%)

Query: 67  LSTIIGWIYFVSWGASAHCQVLVNWQRKSVIGLNFDYLGLNMIGHSLYGVYNFVLYFSYP 126
           LS ++G+I  V W  S   Q++ N++RKS  GL+  +L L ++G  LY +YN +  F   
Sbjct: 1   LSNVLGYISLVLWLVSYIPQIIKNYRRKSTEGLSILFLLLWLLGDILYLLYNILNGFPPL 60

Query: 127 V 127
           +
Sbjct: 61  L 61



 Score = 46.0 bits (110), Expect = 5e-07
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 293 YMNFKRKSTVGWSIGNVLLDFAGGLFSLLQMLILAYNFDD 332
             N++RKST G SI  +LL   G +  LL  ++  +    
Sbjct: 22  IKNYRRKSTEGLSILFLLLWLLGDILYLLYNILNGFPPLL 61


>gnl|CDD|128923 smart00679, CTNS, Repeated motif present between transmembrane
           helices in cystinosin, yeast ERS1p, mannose-P-dolichol
           utilization defect 1, and other hypothetical proteins.
           Function unknown, but likely to be associated with the
           glycosylation machinery.
          Length = 32

 Score = 42.8 bits (102), Expect = 4e-06
 Identities = 14/32 (43%), Positives = 18/32 (56%)

Query: 220 IKYAPQAYMNFKRKSTVGWSIGNVLLDFAGGL 251
           +   PQ   N++RKST G SI  VLL   G +
Sbjct: 1   VSLLPQIIKNYRRKSTEGLSILFVLLWLLGDI 32



 Score = 35.1 bits (82), Expect = 0.002
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 292 AYMNFKRKSTVGWSIGNVLLDFAGGL 317
              N++RKST G SI  VLL   G +
Sbjct: 7   IIKNYRRKSTEGLSILFVLLWLLGDI 32



 Score = 30.5 bits (70), Expect = 0.085
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 86  QVLVNWQRKSVIGLNFDYLGLNMIG 110
           Q++ N++RKS  GL+  ++ L ++G
Sbjct: 6   QIIKNYRRKSTEGLSILFVLLWLLG 30


>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 31.1 bits (70), Expect = 1.0
 Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 8/46 (17%)

Query: 6   NGRNLSKSGQNTAKRSNHLNQTLNGYNPGNAIVLIQLQPYEQLEKD 51
           +G+ L  SGQ  A         +NG  PG+  V+  ++P+E  E+D
Sbjct: 223 HGQQLRVSGQGEAG--------VNGGPPGDLYVVFNVKPHEFFERD 260


>gnl|CDD|179143 PRK00855, PRK00855, argininosuccinate lyase; Provisional.
          Length = 459

 Score = 29.0 bits (66), Expect = 4.2
 Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 12/47 (25%)

Query: 315 GGLFSLLQM---LILAYNFD---D----WNSLLTDTTKLGLAAFTLM 351
           G L  LL +   L LAYN D   D    ++++  DT KL L A   M
Sbjct: 303 GNLTGLLTVMKGLPLAYNRDLQEDKEPLFDAV--DTLKLSLEAMAGM 347


>gnl|CDD|218343 pfam04950, DUF663, Protein of unknown function (DUF663).  This
           family contains several uncharacterized eukaryotic
           proteins.
          Length = 293

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 3/34 (8%)

Query: 31  YNPGNAIVLIQLQPYEQLEKDVVVHVDVEKGHHW 64
           +N    I++  L P+E   K  VV+  ++K H W
Sbjct: 91  FNSKQLIIVFGLLPHE--HKMSVVNFKIKK-HRW 121


>gnl|CDD|200570 cd10946, CE4_Mll8295_like, Putative catalytic NodB homology domain
           of uncharacterized Mll8295 protein encoded from
           Rhizobium loti and its bacterial homologs.  This family
           is represented by a putative polysaccharide deacetylase
           Mll8295 encoded from Rhizobium loti. Although its
           biological function still remains unknown, Mll8295 shows
           high sequence homology to the catalytic domain of
           Streptococcus pneumoniae polysaccharide deacetylase PgdA
           (SpPgdA), which is an extracellular metal-dependent
           polysaccharide deacetylase with de-N-acetylase activity
           toward a hexamer of chitooligosaccharide
           N-acetylglucosamine, but not shorter
           chitooligosaccharides or a synthetic peptidoglycan
           tetrasaccharide. Both Mll8295 and SpPgdA belong to the
           carbohydrate esterase 4 (CE4) superfamily. This family
           also includes many uncharacterized bacterial
           polysaccharide deacetylases.
          Length = 217

 Score = 28.1 bits (63), Expect = 6.0
 Identities = 12/39 (30%), Positives = 18/39 (46%)

Query: 1   WKPPKNGRNLSKSGQNTAKRSNHLNQTLNGYNPGNAIVL 39
           W+P   G    +S     K+ +HL  T N +  G  I+L
Sbjct: 144 WQPEDWGGTPVQSVDEMVKKIDHLLNTNNTFTKGKVILL 182


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.140    0.447 

Gapped
Lambda     K      H
   0.267   0.0863    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,825,882
Number of extensions: 1819445
Number of successful extensions: 2010
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2004
Number of HSP's successfully gapped: 44
Length of query: 369
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 271
Effective length of database: 6,590,910
Effective search space: 1786136610
Effective search space used: 1786136610
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 60 (26.6 bits)