RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8160
         (324 letters)



>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
           [Carbohydrate transport and metabolism].
          Length = 389

 Score = 43.5 bits (103), Expect = 8e-05
 Identities = 49/208 (23%), Positives = 89/208 (42%), Gaps = 28/208 (13%)

Query: 32  KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALK-HLTLCLIDASPCTFTKLE 90
           KA+  F ++L  DP      E HL LG +F+   E D A++ H TL  +++   TF +  
Sbjct: 53  KAVDLFLEMLQEDPETF---EAHLTLGNLFRSRGEVDRAIRIHQTL--LESPDLTFEQRL 107

Query: 91  GLMFKVNNEYDSALKHLTLCLIDASPCTFTKL--------EVRFHIAHLHEVQRKYKTAK 142
             + ++  +Y +A       L+D +   F +L             + ++++  R+++ A 
Sbjct: 108 LALQQLGRDYMAA------GLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAI 161

Query: 143 DSYEQLLKEDDLPVHLK-ADICRQLGWMYHCIDTLGEKSHRETLAIHCLQKSIEADPKSG 201
           D  E+L+K       ++ A    +L         L         A   L+K+++AD K  
Sbjct: 162 DVAERLVKLGGQTYRVEIAQFYCELAQQA-----LASSDVDR--ARELLKKALQADKKCV 214

Query: 202 QSLYLLGRCFAAVGKVHDAFLAYRNSVE 229
           ++  +LGR   A G    A  A    +E
Sbjct: 215 RASIILGRVELAKGDYQKAVEALERVLE 242


>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
           modification, protein turnover, chaperones].
          Length = 287

 Score = 40.4 bits (95), Expect = 6e-04
 Identities = 16/53 (30%), Positives = 29/53 (54%)

Query: 190 LQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIG 242
           L+  ++ +P   +   LLGR + A+G+  DA LAYRN++  +  N +    + 
Sbjct: 145 LETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLA 197


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 37.0 bits (86), Expect = 0.002
 Identities = 15/58 (25%), Positives = 29/58 (50%)

Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIGN 243
           A+   +K++E DP +  + Y L   +  +GK  +A   Y  ++E    NA  + ++G 
Sbjct: 19  ALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGL 76



 Score = 34.7 bits (80), Expect = 0.011
 Identities = 26/104 (25%), Positives = 47/104 (45%), Gaps = 11/104 (10%)

Query: 126 FHIAHLHEVQRKYKTAKDSYEQLLKEDDLPVHLKADICRQLGWMYHCIDTLGEKSHRETL 185
            ++ +L+     Y  A + YE+ L+ D       AD    L   Y+       K  +   
Sbjct: 4   LNLGNLYYKLGDYDEALEYYEKALELDPDN----ADAYYNLAAAYY-------KLGKYEE 52

Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVE 229
           A+   +K++E DP + ++ Y LG  +  +GK  +A  AY  ++E
Sbjct: 53  ALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 33.9 bits (78), Expect = 0.019
 Identities = 16/81 (19%), Positives = 34/81 (41%), Gaps = 4/81 (4%)

Query: 32  KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALKHLTLCL-IDASPCTFTKLE 90
           +A++ +++ L +DP      + +  L   +    +Y+ AL+     L +D          
Sbjct: 18  EALEYYEKALELDPDN---ADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNL 74

Query: 91  GLMFKVNNEYDSALKHLTLCL 111
           GL +    +Y+ AL+     L
Sbjct: 75  GLAYYKLGKYEEALEAYEKAL 95



 Score = 32.4 bits (74), Expect = 0.073
 Identities = 20/102 (19%), Positives = 42/102 (41%), Gaps = 9/102 (8%)

Query: 52  EVHLRLGLMFKVNNEYDSALKHLTLCLIDASPC---TFTKLEGLMFKVNNEYDSALKHLT 108
           E  L LG ++    +YD AL++     ++  P     +  L    +    +Y+ AL+   
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYE-KALELDPDNADAYYNL-AAAYYKLGKYEEALEDYE 58

Query: 109 LCLIDASPCTFTKLEVRFHIAHLHEVQRKYKTAKDSYEQLLK 150
             L           +  +++   +    KY+ A ++YE+ L+
Sbjct: 59  KALELDP----DNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 29.3 bits (66), Expect = 0.80
 Identities = 26/112 (23%), Positives = 48/112 (42%), Gaps = 15/112 (13%)

Query: 88  KLEGLMFKVNNEYDSALKHLTLCLIDASPCTFTKLEVRFHIAHLHEVQRKYKTAKDSYEQ 147
              G ++    +YD AL++     ++  P      +  +++A  +    KY+ A + YE+
Sbjct: 4   LNLGNLYYKLGDYDEALEYYE-KALELDPDNA---DAYYNLAAAYYKLGKYEEALEDYEK 59

Query: 148 LLKEDDLPVHLKADICRQLGWMYHCIDTLGEKSHRETLAIHCLQKSIEADPK 199
            L+ D  P + KA     LG  Y        K  +   A+   +K++E DP 
Sbjct: 60  ALELD--PDNAKA--YYNLGLAY-------YKLGKYEEALEAYEKALELDPN 100


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 35.9 bits (84), Expect = 0.019
 Identities = 46/211 (21%), Positives = 80/211 (37%), Gaps = 40/211 (18%)

Query: 32  KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALK-HLTLCLIDASPC-TFTKL 89
           KAI  F ++L VDP  +   E+HL LG +F+   E D A++ H  L    + P  T  + 
Sbjct: 53  KAIDLFIEMLKVDPETV---ELHLALGNLFRRRGEVDRAIRIHQNL---LSRPDLTREQR 106

Query: 90  E-------------GLMFKVNNEYDSALKHLTLCLIDASPCTFTKLEVRFHIAHLHEVQR 136
                         GL        D A +     L+D              +  +++ ++
Sbjct: 107 LLALQELGQDYLKAGL-------LDRAEELFLQ-LVDEGD---FAEGALQQLLEIYQQEK 155

Query: 137 KYKTAKDSYEQLLKEDDLPVHLKADICRQLGWMYHCIDTLGEKSHRET-LAIHCLQKSIE 195
            ++ A D  E+L K     +        ++   ++C       +  +   A   L+K++ 
Sbjct: 156 DWQKAIDVAERLEKLGGDSLR------VEIA-HFYCELAQQALARGDLDAARALLKKALA 208

Query: 196 ADPKSGQSLYLLGRCFAAVGKVHDAFLAYRN 226
           ADP+  ++  LLG    A G    A  A   
Sbjct: 209 ADPQCVRASILLGDLALAQGDYAAAIEALER 239


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 32.2 bits (74), Expect = 0.022
 Identities = 8/32 (25%), Positives = 17/32 (53%)

Query: 189 CLQKSIEADPKSGQSLYLLGRCFAAVGKVHDA 220
             +K++E DP + ++ Y L      +G+  +A
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLLNLGQYDEA 32


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 32.3 bits (74), Expect = 0.039
 Identities = 11/39 (28%), Positives = 21/39 (53%)

Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAY 224
           AI   +K++E DP + ++ Y L   +  +GK ++  L  
Sbjct: 22  AIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 3/62 (4%)

Query: 202 QSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIGNKDFSNNSNREDYHQALNK 261
           ++L  LG     +G   +A  AY  ++E    NA+ + ++     +     +DY +AL  
Sbjct: 4   EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLAL---AYLKLGKDYEEALED 60

Query: 262 YR 263
             
Sbjct: 61  LE 62


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 30.7 bits (70), Expect = 0.12
 Identities = 10/50 (20%), Positives = 23/50 (46%)

Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNA 235
           A+  L+ ++   P + ++L LLG      G++ +A    R ++     + 
Sbjct: 16  ALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPDDP 65



 Score = 26.5 bits (59), Expect = 3.7
 Identities = 8/45 (17%), Positives = 12/45 (26%), Gaps = 3/45 (6%)

Query: 126 FHIAHLHEVQRKYKTAKDSYEQLLKEDDLPV---HLKADICRQLG 167
             +A        Y  A  + E  L    L      L  +   + G
Sbjct: 1   LALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQG 45


>gnl|CDD|118184 pfam09652, Cas_VVA1548, Putative CRISPR-associated protein
           (Cas_VVA1548).  This entry represents a conserved region
           of about 95 amino acids found exclusively in species
           with CRISPRs (Clustered Regularly Interspaced Short
           Palindromic Repeats). In all bacterial species that
           contain this entry, the genes encoding the proteins are
           in the midst of a cluster of cas (CRISPR-associated)
           genes.
          Length = 93

 Score = 29.3 bits (66), Expect = 0.70
 Identities = 15/54 (27%), Positives = 21/54 (38%), Gaps = 16/54 (29%)

Query: 122 LEVRFHIAHLHEVQRKYKTAKDSYEQLLKED----DLPVHLKADICRQLGWMYH 171
           L+V   + HL               Q+   D     LPVHL AD+C +    +H
Sbjct: 18  LDVDRFVDHLD------------PAQVSAGDKVIGTLPVHLAADVCEKGARYFH 59


>gnl|CDD|187874 cd09743, Csx16_III-U, CRISPR/Cas system-associated protein Csx16.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Small protein often seen in proximity to Csx1 (COG1517)
           family; also known as VVA1548 family.
          Length = 90

 Score = 29.3 bits (66), Expect = 0.71
 Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 16/54 (29%)

Query: 122 LEVRFHIAHLHEVQRKYKTAKDSYEQLLKED----DLPVHLKADICRQLGWMYH 171
           L+V   + HL              EQ+   D     LPVHL AD+C +    +H
Sbjct: 18  LDVDRFVIHL------------DPEQISAGDKVIGTLPVHLAADVCEKGARYFH 59


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 31.0 bits (68), Expect = 0.79
 Identities = 28/123 (22%), Positives = 54/123 (43%), Gaps = 6/123 (4%)

Query: 32  KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALKHLT--LCLIDASPCTFTKL 89
           +A++ +++ L +DP      E  L LG + +    Y+ AL+ L   L L           
Sbjct: 148 EALELYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLN 207

Query: 90  EGLMFKVNNEYDSALKHLTLCLIDASPCTFTKLEVRFHIAHLHEVQRKYKTAKDSYEQLL 149
            GL++    +Y+ AL++    L           E  +++A L     +Y+ A ++ E+ L
Sbjct: 208 LGLLYLKLGKYEEALEYYEKALELDPD----NAEALYNLALLLLELGRYEEALEALEKAL 263

Query: 150 KED 152
           + D
Sbjct: 264 ELD 266


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 24/116 (20%), Positives = 41/116 (35%), Gaps = 16/116 (13%)

Query: 124 VRFHIAHLHEVQRKYKTAKDSYEQLLK--EDDLPVHLKADICRQLGWMYHCIDTLGEKSH 181
           +R  +A L+  Q+ Y  A   Y+ ++K   D+  V         L W+Y       E   
Sbjct: 772 LRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVL------NNLAWLYL------ELKD 819

Query: 182 RETLAIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADT 237
              L     +++++  P     L  LG      G+   A    R +V  +   A  
Sbjct: 820 PRALEY--AERALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAI 873



 Score = 28.5 bits (64), Expect = 5.6
 Identities = 16/80 (20%), Positives = 30/80 (37%), Gaps = 4/80 (5%)

Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIGNKD 245
           A+   +K +   P S  +L LL   +A +     A  + + ++E    N +    IG   
Sbjct: 620 AVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEA--QIGLAQ 677

Query: 246 FSNNSNREDYHQALNKYRDL 265
               + R +   A    + L
Sbjct: 678 LLLAAKRTE--SAKKIAKSL 695


>gnl|CDD|240089 cd04738, DHOD_2_like, Dihydroorotate dehydrogenase (DHOD) class 2.
           DHOD catalyzes the oxidation of (S)-dihydroorotate to
           orotate. This is the fourth step and the only redox
           reaction in the de novo biosynthesis of UMP, the
           precursor of all pyrimidine nucleotides. DHOD requires
           FMN as co-factor. DHOD divides into class 1 and class 2
           based on their amino acid sequences, their cellular
           location and their natural electron acceptor used to
           reoxidize the flavin group. Members of class 1 are
           cytosolic enzymes and multimers, while class 2 enzymes
           are membrane associated, monomeric and use respiratory
           quinones as their physiological electron acceptors.
          Length = 327

 Score = 29.8 bits (68), Expect = 1.8
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 240 SIG-NKDFSNNSNR-EDYHQALNKYRDLGDFLVIN 272
           +IG NKD +   +  EDY   + K     D+LV+N
Sbjct: 134 NIGKNKD-TPLEDAVEDYVIGVRKLGPYADYLVVN 167


>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
           Found in the DNRI/REDD/AFSR family of regulators. This
           region of AFSR along with the C terminal region is
           capable of independently directing actinorhodin
           production. This family contains TPR repeats.
          Length = 145

 Score = 28.8 bits (65), Expect = 2.2
 Identities = 13/44 (29%), Positives = 20/44 (45%)

Query: 182 RETLAIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYR 225
           R   A+  L++ +  DP   +   LL R     G+  +A  AYR
Sbjct: 76  RHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAYR 119


>gnl|CDD|215560 PLN03076, PLN03076, ARF guanine nucleotide exchange factor
           (ARF-GEF); Provisional.
          Length = 1780

 Score = 28.6 bits (64), Expect = 5.8
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%)

Query: 186 AIHCLQKSI-------EADPKSGQSLYLLGRCFAAVGKVHD 219
           A+ C+QK I       EADP  G    LL +   +V K HD
Sbjct: 106 ALDCIQKLIAHGYLRGEADPSGGPEALLLAKLIESVCKCHD 146


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.402 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,691,002
Number of extensions: 1441575
Number of successful extensions: 1333
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1325
Number of HSP's successfully gapped: 43
Length of query: 324
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 227
Effective length of database: 6,635,264
Effective search space: 1506204928
Effective search space used: 1506204928
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)