RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1701
         (270 letters)



>gnl|CDD|216528 pfam01490, Aa_trans, Transmembrane amino acid transporter protein. 
           This transmembrane region is found in many amino acid
           transporters including UNC-47 and MTR. UNC-47 encodes a
           vesicular amino butyric acid (GABA) transporter, (VGAT).
           UNC-47 is predicted to have 10 transmembrane domains.
           MTR is a N system amino acid transporter system protein
           involved in methyltryptophan resistance. Other members
           of this family include proline transporters and amino
           acid permeases.
          Length = 406

 Score =  126 bits (319), Expect = 6e-34
 Identities = 65/214 (30%), Positives = 100/214 (46%), Gaps = 7/214 (3%)

Query: 51  FTQFPLFFSTVLFAMEGIGTVLPIENSMKKPGHFLGKFGVLNIAMAVVISFYGMVGYCGY 110
             +  L    ++FA EG   +LPI+N+MK P  F     VL  A+ +V   Y +VG  GY
Sbjct: 192 LARLFLAIGIIVFAFEGHAVLLPIQNTMKSPSKFKAMTKVLLTAIIIVTVLYILVGLVGY 251

Query: 111 LKYGEDTKGSITLNLPK-DPFSESIKILVALSILFTYGLQFGVPSEIV----WERIKHRV 165
           L +G + KG+I LNLPK D   +   +L+ L +L +Y LQ     +IV    + +     
Sbjct: 252 LAFGNNVKGNILLNLPKSDWLIDIANLLLVLHLLLSYPLQAFPIRQIVENLLFRKGASGK 311

Query: 166 PIPRHNMGYYIMRGLMIIGTVLVAAIIPNLGPIISLVGAICFSMLGLFCPAVIDYVTFYD 225
             P+  +   ++R  +++ T L+A  +P LG  +SLVGA   + L    P +        
Sbjct: 312 HNPKSKLLRVVIRSGLVVITYLIAISVPFLGDFLSLVGATSGAPLTFILPPLFHLKLKKT 371

Query: 226 PNKS--WYRPRTIKNFIIILISLGALISGTYSSI 257
             KS        I + I I+I L  +  G    I
Sbjct: 372 KKKSLEKLWKPDILDVICIVIGLLLMAYGVAGLI 405


>gnl|CDD|240313 PTZ00206, PTZ00206, amino acid transporter; Provisional.
          Length = 467

 Score = 41.4 bits (97), Expect = 3e-04
 Identities = 48/215 (22%), Positives = 92/215 (42%), Gaps = 32/215 (14%)

Query: 64  AMEGIGT--------VLPIENSMKKPGHFLGKFGVL-NIAMAVVISFYGMVGYCGYLKYG 114
           A+EG+G         +   E  M      +GKF +   IAM +  + Y +  + GY+ +G
Sbjct: 258 AIEGLGVFIFAYVFQITAYEVYMDMTNRSVGKFVLASTIAMGMCFTMYVLTAFFGYMDFG 317

Query: 115 EDTKGSITLNLPKDPFSES------IKILVALSI---LFTYGLQFGVPSEIVWERIKHRV 165
            +  GS+ L    DP +E       + +LV L +   L     +  +   I W+  K  V
Sbjct: 318 RNVTGSVLLMY--DPVNEPAIMVGFVGVLVKLFVSYALLGMACRNALYDVIGWDARK--V 373

Query: 166 PIPRHNMGYYIMRGLMIIGTVLVAAIIPNLGPIISLVGAICFSMLGLFCPAVIDYVTFYD 225
              +H +    +  +M+    L    IP +  ++   G+I   +LG   PA++     Y 
Sbjct: 374 AFWKHCIAVVTLSVVML----LCGLFIPKINTVLGFAGSISGGLLGFILPALL---FMYS 426

Query: 226 PNKSWYRP---RTIKNFIIILISLGALISGTYSSI 257
              +W +      I  +++++  + A++ GT ++I
Sbjct: 427 GGFTWQKVGPFYYISTYVVLITGVIAIVFGTGATI 461


>gnl|CDD|223884 COG0814, SdaC, Amino acid permeases [Amino acid transport and
           metabolism].
          Length = 415

 Score = 37.2 bits (87), Expect = 0.007
 Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 24/221 (10%)

Query: 56  LFFSTVLFAMEGIGTVLPIENSMKKPGHFLGKFGVLNIAMAVVISFYGMVGYCGYLKYGE 115
           L     +F+    G +  + N M+K      +   + I   + +  Y +VG+  +  +G 
Sbjct: 197 LAIPVFVFSFGFHGNIPSLVNYMRKNSKKAVRK-AILIGSLIALVLYILVGFFVFGCFGS 255

Query: 116 DTKGSITLNLPKDPFSESIKILVALSILFTYGLQ-----------FGVPSEIVWERIKHR 164
              G+I     ++    S    V  S + +  L             GV    ++E +   
Sbjct: 256 LVFGNILAAKEQNISLLSALAGVINSPILSIALNIFALFAIATSFLGVYLG-LFEGLADL 314

Query: 165 VPIPRHNMG--------YYIMRGLMIIGTVLVAAIIPNLGPIISLVGAICFSMLGLFCPA 216
                   G        +       ++     A  +   G +I+ +GA     L    P 
Sbjct: 315 FKKSNSKPGRKKTGLLTFLPPLIFALLYPWGFAIALGYAGGLIATIGAPIIPALLFIKPR 374

Query: 217 VIDYVTFYDPNKSWYRPRTIKNFIIILISLGALISGTYSSI 257
               + +  P    Y    +   +++L  +  ++S   ++ 
Sbjct: 375 ---KLIYKLPALKVYGGNFLLLLLVLLFGILVILSPFLATF 412


>gnl|CDD|215559 PLN03074, PLN03074, auxin influx permease; Provisional.
          Length = 473

 Score = 35.5 bits (82), Expect = 0.028
 Identities = 50/242 (20%), Positives = 87/242 (35%), Gaps = 59/242 (24%)

Query: 56  LFFS---TVLFAMEGIGTVLPIENSMKKPGHF-----LGKFGVLNI----AMAVVISFYG 103
           L+F+    +L+   G    + I ++M KP  F          VL +    A AV  +F  
Sbjct: 234 LYFTGATNILYTFGGHAVTVEIMHAMWKPQKFKYIYLAATLYVLTLTLPSAAAVYWAF-- 291

Query: 104 MVGYCGYLKYGEDTKGSITLN-------LPKDPFSESIKILVALSILFTYGLQFGVPSEI 156
                          G   L        LP+  + ++  IL+ +    T+G     P   
Sbjct: 292 ---------------GDELLTHSNAFSLLPRSGWRDAAVILMLIHQFITFGF-ACTPLYF 335

Query: 157 VWERIKHRVPIPRHNMGYYIMRGLMIIGTVLV----AAIIPNLGPIISLVGAICFSMLGL 212
           VWE+      I  H+     +R L  +  V+     A I P  GPI S VGA+  S    
Sbjct: 336 VWEKA-----IGVHDTKSICLRALARLPVVVPIWFLAIIFPFFGPINSAVGALLVSFTVY 390

Query: 213 FCPAVIDYVTFYDPN-------------KSWYRPRTIKNFIIILISLGALISGTYSSIEE 259
             P++   +T+   +              SW     +  F+++ + +     G ++S+  
Sbjct: 391 IIPSLAHMLTYRSASARQNAAEKPPFFLPSWTGMYVVNAFVVVWVLVVGFGFGGWASMTN 450

Query: 260 II 261
            +
Sbjct: 451 FV 452


>gnl|CDD|222497 pfam14015, DUF4231, Protein of unknown function (DUF4231).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria, archaea,
           eukaryotes and viruses. Proteins in this family are
           typically between 148 and 288 amino acids in length.
          Length = 112

 Score = 33.2 bits (76), Expect = 0.038
 Identities = 23/91 (25%), Positives = 38/91 (41%), Gaps = 6/91 (6%)

Query: 174 YYIMRGLMIIGTVLVAAIIPNLGPIISLVG-----AICFSMLGLFCPAVIDYVTFYDPNK 228
           YY++R L II  VL  A+IP L  +  L G      +  ++L      +   ++ +   +
Sbjct: 18  YYLLRILSIILLVL-GAVIPVLVGLAPLGGGAAYLKLVAAILSALVAILASLLSLFKYQE 76

Query: 229 SWYRPRTIKNFIIILISLGALISGTYSSIEE 259
           +W R R     +          +G YS  EE
Sbjct: 77  NWIRYRATAESLKAEKWRYLARAGPYSDEEE 107


>gnl|CDD|234609 PRK00068, PRK00068, hypothetical protein; Validated.
          Length = 970

 Score = 32.3 bits (74), Expect = 0.32
 Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 6/77 (7%)

Query: 185 TVLVAAIIPNLGPIISLVGAICFSMLGLFCPAVIDYVTFYDPNKSWYRPRT-----IKNF 239
           T LV  I+  + P+  LVG I F  L L   +   +V   D N    R R      ++ F
Sbjct: 55  TKLVTRIVLFI-PVGLLVGGIVFISLWLAYRSRPVFVPKADSNDPVARYRAVVEKRLRLF 113

Query: 240 IIILISLGALISGTYSS 256
           +I + S   L++G ++ 
Sbjct: 114 LIGIPSFIGLLAGIFAQ 130


>gnl|CDD|223481 COG0404, GcvT, Glycine cleavage system T protein
           (aminomethyltransferase) [Amino acid transport and
           metabolism].
          Length = 379

 Score = 30.7 bits (70), Expect = 0.84
 Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 12/94 (12%)

Query: 151 GVPSEIVWERIKHRVPIPRHNMGYYIMRGLMIIGTVLVAAIIPNLGPIISL--------- 201
           GV  ++V  ++  + P+ R         G + +G V      P LG  I+L         
Sbjct: 287 GVRRKLVGLKLDDKGPVLRGGEPVLDADGEVEVGEVTSGTFSPTLGKSIALAYVDSDYAK 346

Query: 202 VGA-ICFSMLGLFCPAVIDYVTFYDPNKSWYRPR 234
            G  +   + G   PA +    FYDP     R R
Sbjct: 347 PGTELEVEIRGKRVPARVVKPPFYDPEG--ERLR 378


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 19/84 (22%), Positives = 29/84 (34%), Gaps = 3/84 (3%)

Query: 180 LMIIGTVLVAAIIPNLGPIISLVGAICFSMLGLFCPAVIDYVTFYDPNKSWYRPRTIKNF 239
           L+  G  L+      LG +IS +  I    LGL   A+              RP+ ++  
Sbjct: 18  LIASGIALLVLFGAGLGALISTLLGIALLALGLVLLALGLLCLLLKAPVQSVRPQKLEEL 77

Query: 240 IIILISLGALISGTYSSIEEIIAF 263
              L  L   +S      EE+   
Sbjct: 78  QGELSELKQQLS---ELQEELEDL 98


>gnl|CDD|217327 pfam03030, H_PPase, Inorganic H+ pyrophosphatase.  The H+
           pyrophosphatase is an transmembrane proton pump involved
           in establishing the H+ electrochemical potential
           difference between the vacuole lumen and the cell
           cytosol. Vacuolar-type H(+)-translocating inorganic
           pyrophosphatases have long been considered to be
           restricted to plants and to a few species of
           photo-trophic bacteria. However, in recent
           investigations, these pyrophosphatases have been found
           in organisms as disparate as thermophilic Archaea and
           parasitic protists.
          Length = 669

 Score = 29.7 bits (68), Expect = 1.6
 Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 1/55 (1%)

Query: 171 NMGYYIMRGLMIIGTVLVAAI-IPNLGPIISLVGAICFSMLGLFCPAVIDYVTFY 224
           N G  +   L I+ T       +P  G            ++GL   A+I  +T Y
Sbjct: 284 NRGLIVSAILSIVATYFATYWLLPAAGKGFLWWNLFLAVLIGLVAGALIGLITEY 338


>gnl|CDD|233438 TIGR01494, ATPase_P-type, ATPase, P-type (transporting), HAD
           superfamily, subfamily IC.  The P-type ATPases are a
           large family of trans-membrane transporters acting on
           charged substances. The distinguishing feature of the
           family is the formation of a phosphorylated intermediate
           (aspartyl-phosphate) during the course of the reaction.
           Another common name for these enzymes is the E1-E2
           ATPases based on the two isolable conformations: E1
           (unphosphorylated) and E2 (phosphorylated). Generally,
           P-type ATPases consist of only a single subunit
           encompassing the ATPase and ion translocation pathway,
           however, in the case of the potassium (TIGR01497) and
           sodium/potassium (TIGR01106) varieties, these functions
           are split between two subunits. Additional small
           regulatory or stabilizing subunits may also exist in
           some forms. P-type ATPases are nearly ubiquitous in life
           and are found in numerous copies in higher organisms (at
           least 45 in Arabidopsis thaliana, for instance ).
           Phylogenetic analyses have revealed that the P-type
           ATPase subfamily is divided up into groups based on
           substrate specificities and this is represented in the
           various subfamily and equivalog models that have been
           made: IA (K+) TIGR01497, IB (heavy metals) TIGR01525,
           IIA1 (SERCA-type Ca++) TIGR01116, IIA2 (PMR1-type Ca++)
           TIGR01522, IIB (PMCA-type Ca++) TIGR01517, IIC (Na+/K+,
           H+/K+ antiporters) TIGR01106, IID (fungal-type Na+ and
           K+) TIGR01523, IIIA (H+) TIGR01647, IIIB (Mg++)
           TIGR01524, IV (phospholipid, flippase) TIGR01652 and V
           (unknown specificity) TIGR01657. The crystal structure
           of one calcium-pumping ATPase and an analysis of the
           fold of the catalytic domain of the P-type ATPases have
           been published. These reveal that the catalytic core of
           these enzymes is a haloacid dehalogenase(HAD)-type
           aspartate-nucleophile hydrolase. The location of the
           ATP-binding loop in between the first and second HAD
           conserved catalytic motifs defines these enzymes as
           members of subfamily I of the HAD superfamily (see also
           TIGR01493, TIGR01509, TIGR01549, TIGR01544 and
           TIGR01545). Based on these classifications, the P-type
           ATPase _superfamily_ corresponds to the IC subfamily of
           the HAD superfamily.
          Length = 543

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 9/39 (23%), Positives = 14/39 (35%)

Query: 180 LMIIGTVLVAAIIPNLGPIISLVGAICFSMLGLFCPAVI 218
              I + +  AI  NL  I   +  I   +L     A+ 
Sbjct: 504 FSNIKSNIFWAIAYNLILIPLALLLIVIILLLPMLAALA 542


>gnl|CDD|222939 PHA02878, PHA02878, ankyrin repeat protein; Provisional.
          Length = 477

 Score = 28.7 bits (64), Expect = 3.9
 Identities = 10/29 (34%), Positives = 16/29 (55%), Gaps = 1/29 (3%)

Query: 217 VIDYVTFYDPNKSWYR-PRTIKNFIIILI 244
            +D + F   N  W R P  IK++I+ L+
Sbjct: 434 SLDNIFFEKQNYMWNRLPLEIKHYIMELL 462


>gnl|CDD|200395 TIGR04144, archaeo_VPXXXP, archaeosortase B, VPXXXP-CTERM-specific.
            Members of this protein family are found so far in
           Methanohalophilus mahii DSM 5219 and Methanohalobium
           evestigatum Z-7303, along with five and nine proteins,
           respectively, with the VPXXXP-CTERM protein sorting
           signal (TIGR04143). In these species, this boutique
           system represents a second
           exosortase/archaeosortase-type system.
          Length = 156

 Score = 27.6 bits (61), Expect = 5.0
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 199 ISLVGAICFSMLGLFCPAVIDYVTFY 224
           I++V  +C + +GL+ PA+ DYV +Y
Sbjct: 107 INMVRLVCLAFVGLWFPAIFDYVHYY 132


>gnl|CDD|236535 PRK09477, napH, quinol dehydrogenase membrane component;
           Provisional.
          Length = 271

 Score = 27.9 bits (63), Expect = 5.7
 Identities = 12/42 (28%), Positives = 26/42 (61%), Gaps = 7/42 (16%)

Query: 227 NKSWYRPRTIKNFIIILISLGALISGTYSSIEEIIAFYAINP 268
           N+S   PR ++ ++++L+ +G+ ++GT       +A+  INP
Sbjct: 112 NQSATLPRNLRYWLLVLVLVGSALTGT-------LAWEWINP 146


>gnl|CDD|235359 PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha;
           Provisional.
          Length = 310

 Score = 27.4 bits (62), Expect = 8.7
 Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 3/28 (10%)

Query: 117 TKGSITLNLPKDPFSESIKILVALSILF 144
           T GSIT   P++  + + KILV    +F
Sbjct: 199 TDGSIT---PEEALALAAKILVEQLSVF 223


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.328    0.144    0.450 

Gapped
Lambda     K      H
   0.267   0.0760    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,364,947
Number of extensions: 1424866
Number of successful extensions: 2158
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2136
Number of HSP's successfully gapped: 120
Length of query: 270
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 175
Effective length of database: 6,723,972
Effective search space: 1176695100
Effective search space used: 1176695100
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 58 (26.1 bits)