RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6799
         (176 letters)



>gnl|CDD|205965 pfam13792, Sulfate_tra_GLY, Sulfate transporter N-terminal domain
           with GLY motif.  This domain is found usually at the
           N-terminus of sulfate-transporter proteins. It carries a
           highly conserved GLY sequence motif, but the function of
           the domain is not known.
          Length = 83

 Score =  122 bits (309), Expect = 6e-37
 Identities = 40/79 (50%), Positives = 58/79 (73%)

Query: 62  VPILQWLPEYNSESAVSDLVAGVTVGLTVIPQAIAYSNVAGLPPQVGLYSSFMACFVYTI 121
           +PILQWLP+Y+      DL+AG+TV L +IPQ +AY+ +AGLPP  GLY+SF+   +Y +
Sbjct: 1   LPILQWLPKYSRSWLKGDLIAGLTVALVLIPQGMAYALLAGLPPVYGLYASFVPPLIYAL 60

Query: 122 FGSCKDSAIGPTAIMSILT 140
           FGS +  ++GPTA +S+L 
Sbjct: 61  FGSSRHLSVGPTAAISLLV 79


>gnl|CDD|162054 TIGR00815, sulP, high affinity sulphate transporter 1.  The SulP
           family is a large and ubiquitous family with over 30
           sequenced members derived from bacteria, fungi, plants
           and animals. Many organisms including Bacillus subtilis,
           Synechocystis sp, Saccharomyces cerevisiae, Arabidopsis
           thaliana and Caenorhabditis elegans possess multiple
           SulP family paralogues. Many of these proteins are
           functionally characterized, and all are sulfate uptake
           transporters. Some transport their substrate with high
           affinities, while others transport it with relatively
           low affinities. Most function by SO42- :H+symport, but
           SO42- :HCO3- antiport has been reported for the rat
           protein (spP45380). The bacterial proteins vary in size
           from 434 residues to 566 residues with one exception, a
           Mycobacterium tuberculosis protein with 784 residues.
           The eukaryotic proteins vary in size from 611 residues
           to 893 residues with one exception, a protein designated
           "early nodulin 70 protein" from Glycine max which is
           reported to be of 485 residues. Thus, the eukaryotic
           proteins are almost without exception larger than the
           prokaryotic proteins. These proteins exhibit 10-13
           putative transmembrane a-helical spanners (TMSs)
           depending on the protein. The phylogenetic tree for the
           SulP family reveals five principal branches. Three of
           these are bacterial specific as follows: one bears a
           single protein from M. tuberculosis; a second bears two
           proteins, one from M. tuberculosis, the other from
           Synechocystis sp, and the third bears all remaining
           prokaryotic proteins. The remaining two clusters bear
           only eukaryotic proteins with the animal proteins all
           localized to one branch and the plant and fungal
           proteins localized to the other. The generalized
           transport reactions catalyzed by SulP family proteins
           are: (1) SO42- (out) + nH+ (out) --> SO42- (in) + nH+
           (in). (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3-
           (out) [Transport and binding proteins, Anions].
          Length = 563

 Score =  120 bits (303), Expect = 3e-32
 Identities = 52/121 (42%), Positives = 79/121 (65%), Gaps = 9/121 (7%)

Query: 63  PILQWLPEYNSESAVSDLVAGVTVGLTVIPQAIAYSNVAGLPPQVGLYSSFMACFVYTIF 122
           P+L+WLP Y  +    DL+AG+TVG+ +IPQA+AY+ +AGL P  GLY+SF+  F+Y +F
Sbjct: 1   PVLRWLPHYRLKKFKGDLMAGLTVGILLIPQAMAYAILAGLSPIYGLYTSFVPPFIYALF 60

Query: 123 GSCKDSAIGPTAIMSILTRENLHGLG---------PQFAVFLTFVSGIVQLFMGLLQLAL 173
           G+ +D AIGP A+MS+L    +  +G          + A  LT ++GI Q+ +GLL+L  
Sbjct: 61  GTSRDIAIGPVAVMSLLLGSVIARVGLQYLFDCDAIRLAFTLTLLAGIFQVILGLLRLGF 120

Query: 174 I 174
           +
Sbjct: 121 L 121


>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
           superfamily) [Inorganic ion transport and metabolism].
          Length = 554

 Score =  108 bits (271), Expect = 8e-28
 Identities = 42/120 (35%), Positives = 68/120 (56%), Gaps = 3/120 (2%)

Query: 56  KMLHKKVPILQWLPEYNSESAVSDLVAGVTVGLTVIPQAIAYSNVAGLPPQVGLYSSFMA 115
           K L  ++P L+WLP Y       DL+AG+TV    +P A+A++  AG+PP+ GLY+S +A
Sbjct: 1   KRLRSEIPTLKWLPYYFRSWLRGDLLAGLTVAAVALPLAMAFAIAAGVPPEAGLYASIVA 60

Query: 116 CFVYTIFGSCKDSAIGPTAIMSILTRENLHGL---GPQFAVFLTFVSGIVQLFMGLLQLA 172
             +Y +FG  +    GPT   +++    +  L   G   A   T ++G+ Q+ +GLL+L 
Sbjct: 61  GIIYALFGGSRGLISGPTGAFAVVLAAVIASLVETGLALAFLATLLAGVFQILLGLLRLG 120


>gnl|CDD|183265 PRK11660, PRK11660, putative transporter; Provisional.
          Length = 568

 Score = 62.3 bits (152), Expect = 7e-12
 Identities = 38/105 (36%), Positives = 60/105 (57%), Gaps = 8/105 (7%)

Query: 71  YNSESAVSDLVAGVTVGLTVIPQAIAYSNVAGLPPQVGLYSSFMACFVYTIFGSCKDSAI 130
           Y +     DL+AG+TVG+  IP A+A +  +G+PPQ GLY++ +A  V  + G  + S  
Sbjct: 24  YTAARFTRDLIAGITVGIIAIPLAMALAIASGVPPQYGLYTAAVAGIVIALTGGSRFSVS 83

Query: 131 GPTAIMSILTRENLHGLGPQFAV----FLTFVSGIVQLFMGLLQL 171
           GPTA   ++    L+ +  QF +      T +SGI+ + MGL +L
Sbjct: 84  GPTAAFVVI----LYPVSQQFGLAGLLVATLMSGIILILMGLARL 124


>gnl|CDD|221583 pfam12449, DUF3684, Protein of unknown function (DUF3684).  This
           domain family is found in eukaryotes, and is typically
           between 1072 and 1090 amino acids in length.
          Length = 1084

 Score = 32.7 bits (75), Expect = 0.10
 Identities = 30/139 (21%), Positives = 45/139 (32%), Gaps = 23/139 (16%)

Query: 25  DLSSLDPPVGPRQRVHVSEML--KEKARKVFTKKMLHKKVPILQWLPEYNSESAVSDLVA 82
             S +D   G RQR   SE+   KE  R +         +PIL W  ++   S     + 
Sbjct: 656 AESGMDSKKGRRQRYKASELYEPKEALRAL--------GLPILDWPGKWRPSSDEGKFLF 707

Query: 83  GVTVGLTVIPQAIAYSNVAGLPPQVGLYSSFMACFV-------YTIFGSCKDSAIGPTAI 135
             ++GL   P A     +        L +  M+ F+       Y  F    D +    A 
Sbjct: 708 --SLGLRSFPSAPELIELMASSDDADLRTKAMSYFISNHHANGYAAF----DISAVTIAF 761

Query: 136 MSILTRENLHGLGPQFAVF 154
           +     E    L      F
Sbjct: 762 LPAEGNEKEGKLSTPSECF 780


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 31.9 bits (73), Expect = 0.18
 Identities = 17/70 (24%), Positives = 27/70 (38%), Gaps = 11/70 (15%)

Query: 107 VGLYSSFMACFVYTIFGSCKDSAIGPTAIMSILTRENLH-------GLGPQFAVFLTFVS 159
           + LY          I+G       GP  ++S L    L        GL     + L+ + 
Sbjct: 410 ILLYLGIST----IIWGFLYGEFFGPAVLLSTLPIGLLFVYHGLDEGLLFSNILILSLLI 465

Query: 160 GIVQLFMGLL 169
           G++ L +GLL
Sbjct: 466 GVLHLSLGLL 475


>gnl|CDD|235456 PRK05425, PRK05425, asparagine synthetase AsnA; Provisional.
          Length = 327

 Score = 29.0 bits (66), Expect = 1.2
 Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 5/33 (15%)

Query: 43  EMLKEKARKVF-----TKKMLHKKVPILQWLPE 70
           + LKE   K++     T+K + KK P+L +LPE
Sbjct: 133 DYLKETVEKIYKAIKATEKAVSKKYPLLPFLPE 165


>gnl|CDD|235998 PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional.
          Length = 500

 Score = 28.9 bits (65), Expect = 1.4
 Identities = 17/37 (45%), Positives = 22/37 (59%), Gaps = 3/37 (8%)

Query: 74  ESAV-SDLVAGVTVGLTVIPQAIAYSNVAGLPPQVGL 109
           ES V +DLV GV    + IP AI +S  +G+P   GL
Sbjct: 308 ESPVDADLVIGVPD--SGIPAAIGFSQASGIPYAEGL 342


>gnl|CDD|219662 pfam07954, DUF1689, Protein of unknown function (DUF1689).  Family
           of fungal proteins with unknown function. A member of
           this family has been found to localise in the
           mitochondria.
          Length = 152

 Score = 27.7 bits (62), Expect = 2.3
 Identities = 21/76 (27%), Positives = 28/76 (36%), Gaps = 20/76 (26%)

Query: 93  QAIAY-SNVAGLPPQVGLYSSFMACFVYTIFGSCKDSAIGPTAIMSILTRENLHGLGPQF 151
           Q+IA  +   G       Y  FMA F             GPTA     T        P+ 
Sbjct: 27  QSIARATLWGG-------YGGFMAGF------------GGPTAYYRYKTGAIKGVPVPRI 67

Query: 152 AVFLTFVSGIVQLFMG 167
             FL+F+ G+  L +G
Sbjct: 68  FFFLSFLLGVTALVVG 83


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 28.4 bits (64), Expect = 2.3
 Identities = 10/68 (14%), Positives = 23/68 (33%), Gaps = 5/68 (7%)

Query: 107 VGLYSSFMACFVYTIFGSCKDSAIGPTAIMSILTRENLHGL-----GPQFAVFLTFVSGI 161
           + +Y          + GS    ++        L     +            + ++ + G+
Sbjct: 395 ILIYLGISTIIWGLLTGSFFGFSLPIFLPGGYLELPEGYPSLSTENDVMTILIISLLIGV 454

Query: 162 VQLFMGLL 169
           + LF+GLL
Sbjct: 455 IHLFLGLL 462


>gnl|CDD|237150 PRK12608, PRK12608, transcription termination factor Rho;
           Provisional.
          Length = 380

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 8/25 (32%), Positives = 17/25 (68%), Gaps = 2/25 (8%)

Query: 27  SSLDPPVGPRQRVHVSEMLKEKARK 51
           S+ D P  P + + V+E++ E+A++
Sbjct: 192 STFDRP--PDEHIRVAELVLERAKR 214


>gnl|CDD|220520 pfam10011, DUF2254, Predicted membrane protein (DUF2254).  Members
           of this family of bacterial proteins comprises various
           hypothetical and putative membrane proteins. Their exact
           function, has not, as yet, been defined.
          Length = 369

 Score = 27.5 bits (62), Expect = 4.3
 Identities = 17/101 (16%), Positives = 33/101 (32%), Gaps = 35/101 (34%)

Query: 81  VAGVTVGLTVIPQAIAYSNVAGLPPQVG--LYSSFM-------------ACFVYTIFGSC 125
           V   +  +TV+    A    +    Q    L   FM               FVY++    
Sbjct: 60  VTTFSFSITVV----ALQLASS---QFTPRLLRLFMEDRTTQNVLGTFIGTFVYSL---- 108

Query: 126 KDSAIGPTAIMSILTRENLHGLGPQFAVFLTFVSGIVQLFM 166
                    ++ I  R   +G  P+ +V +T +  ++ +  
Sbjct: 109 ---------LVLITIRGGAYGFVPRLSVLVTILLAVLSVAA 140


>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type), in some its is found in the middle (M-type),
           or C-terminal (C-type). N-type and M-type kinesins are
           (+) end-directed motors, while C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 328

 Score = 26.8 bits (60), Expect = 5.7
 Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 119 YTIFGSCKDSAIGPTAIMSILTR-ENLHGLGPQFAVFLTFV 158
           YT+FGS KD  I P A+  +    +        F+V ++++
Sbjct: 94  YTMFGSPKDPGIIPRALEDLFNLIDERKEKNKSFSVSVSYL 134


>gnl|CDD|219638 pfam07914, DUF1679, Protein of unknown function (DUF1679).  The
           region featured in this family is found in a number of
           C. elegans proteins, in one case as a repeat. In many of
           the family members, this region is associated with the
           CHK region described by SMART as being found in ZnF_C4
           and HLH domain-containing kinases. In fact, one member
           of this family is annotated as being a member of the
           nuclear hormone receptor family, and contains regions
           typical of such proteins (Interpro:IPR000536,
           Interpro:IPR008946, and Interpro:IPR001628).
          Length = 413

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 31  PPVGP-RQRVHVSEMLKEKARKVFTKKM 57
           P +GP      +SE  KE+ R+V  +KM
Sbjct: 371 PLIGPFLDSKSMSEEEKEEYREVVIEKM 398


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.408 

Gapped
Lambda     K      H
   0.267   0.0887    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,870,603
Number of extensions: 804857
Number of successful extensions: 824
Number of sequences better than 10.0: 1
Number of HSP's gapped: 820
Number of HSP's successfully gapped: 36
Length of query: 176
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 86
Effective length of database: 6,945,742
Effective search space: 597333812
Effective search space used: 597333812
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.1 bits)