RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9266
         (157 letters)



>gnl|CDD|238106 cd00182, TBOX, T-box DNA binding domain of the T-box family of
           transcriptional regulators. The T-box family is an
           ancient group that appears to play a critical role in
           development in all animal species. These genes were
           uncovered on the basis of similarity to the DNA binding
           domain of murine Brachyury (T) gene product, the
           defining feature of the family.  Common features shared
           by T-box family members are DNA-binding and
           transcriptional regulatory activity, a role in
           development and conserved expression patterns, most of
           the known genes in all species being expressed in
           mesoderm or mesoderm precursors.
          Length = 188

 Score =  121 bits (306), Expect = 1e-35
 Identities = 51/120 (42%), Positives = 72/120 (60%), Gaps = 11/120 (9%)

Query: 11  NVEVTLVNADLWQNFHAIGTEMIITKTGRTKISTRGRMFPSLQLQVSGLDQAAHYSLLVE 70
            ++V+L N +LW+ FH +GTEMIITK+GR       RMFP+L+++VSGLD  A YS+L++
Sbjct: 1   GIKVSLENRELWKKFHELGTEMIITKSGR-------RMFPTLKVKVSGLDPNALYSVLMD 53

Query: 71  FHLASPHKYKY-KTSWEPVGDAERPIAHPAARLYSHPAGHALGSFWTGQTLHLNKIKLAN 129
                 H+YK+    W   G AE    H   R+Y HP   A G+ W  Q +  +K+KL N
Sbjct: 54  LVPVDDHRYKFSGGKWVVAGKAE---PHLPPRVYVHPDSPATGAHWMKQPVSFDKLKLTN 110


>gnl|CDD|216184 pfam00907, T-box, T-box.  The T-box encodes a 180 amino acid domain
           that binds to DNA. Genes encoding T-box proteins are
           found in a wide range of animals, but not in other
           kingdoms such as plants. Family members are all thought
           to bind to the DNA consensus sequence TCACACCT. they are
           found exclusively in the nucleus, and perform
           DNA-binding and transcriptional activation/repression
           roles. They are generally required for development of
           the specific tissues they are expressed in, and
           mutations in T-box genes are implicated in human
           conditions such as DiGeorge syndrome and X-linked cleft
           palate, which feature malformations.
          Length = 182

 Score =  115 bits (291), Expect = 2e-33
 Identities = 52/118 (44%), Positives = 70/118 (59%), Gaps = 11/118 (9%)

Query: 13  EVTLVNADLWQNFHAIGTEMIITKTGRTKISTRGRMFPSLQLQVSGLDQAAHYSLLVEFH 72
            V+L + +LW  FH +GTEMIITK+GR       RMFP+L++ VSGLD  A YS+L++F 
Sbjct: 1   TVSLEDRELWDKFHELGTEMIITKSGR-------RMFPTLKVSVSGLDPNALYSVLLDFV 53

Query: 73  LASPHKYKY-KTSWEPVGDAERPIAHPAARLYSHPAGHALGSFWTGQTLHLNKIKLAN 129
               H+YK+    W   G AE    HP  R+Y HP   A G+ W  Q +  +K+KL N
Sbjct: 54  PVDDHRYKFHNGEWVVAGKAEP---HPPPRVYIHPDSPATGAHWMKQPVSFDKVKLTN 108


>gnl|CDD|214656 smart00425, TBOX, Domain first found in the mice T locus
           (Brachyury) protein. 
          Length = 190

 Score =  112 bits (282), Expect = 5e-32
 Identities = 48/119 (40%), Positives = 71/119 (59%), Gaps = 11/119 (9%)

Query: 12  VEVTLVNADLWQNFHAIGTEMIITKTGRTKISTRGRMFPSLQLQVSGLDQAAHYSLLVEF 71
           ++V+L + +LW+ FH +GTEMI+TK+GR       RMFP+L+ +VSGLD  A YS+L++ 
Sbjct: 1   IKVSLEDKELWRKFHELGTEMIVTKSGR-------RMFPTLKYKVSGLDPNALYSVLMDL 53

Query: 72  HLASPHKYKY-KTSWEPVGDAERPIAHPAARLYSHPAGHALGSFWTGQTLHLNKIKLAN 129
                 +YK+    W   G AE    H  +R+Y HP   A G+ W  Q +  +K+KL N
Sbjct: 54  VPVDDKRYKFNNGKWVVAGKAEP---HMPSRVYVHPDSPATGAHWMKQPVSFDKVKLTN 109


>gnl|CDD|215563 PLN03081, PLN03081, pentatricopeptide (PPR) repeat-containing
           protein; Provisional.
          Length = 697

 Score = 31.0 bits (70), Expect = 0.24
 Identities = 23/74 (31%), Positives = 31/74 (41%), Gaps = 9/74 (12%)

Query: 81  YKTSWEPVGDAERPIAHPAARLYSHPAGHALGSFWTGQTLHLNKIKLANAFVEGTRFLEV 140
           ++  WE   DAE        R     A   LGS   GQ LH   +K     V G  F+  
Sbjct: 212 FREMWEDGSDAEPRTFVVMLR-----ASAGLGSARAGQQLHCCVLKTG---VVGDTFVSC 263

Query: 141 RHLLLIYPQCGQIQ 154
             L+ +Y +CG I+
Sbjct: 264 A-LIDMYSKCGDIE 276


>gnl|CDD|239487 cd03393, PAP2_like_3, PAP2_like_3 proteins. PAP2 is a super-family
           of phosphatases and haloperoxidases. This subgroup,
           which is specific to bacteria and archaea, lacks
           functional characterization and may act as a
           membrane-associated lipid phosphatase.
          Length = 125

 Score = 28.5 bits (64), Expect = 0.72
 Identities = 16/49 (32%), Positives = 21/49 (42%), Gaps = 2/49 (4%)

Query: 78  KYKYKTSWEPVGDAERPIAHPAARLYSHPAGHALGS--FWTGQTLHLNK 124
           K  +K       D  + I   +A  Y  P+GHA  S  FW    LH+ K
Sbjct: 33  KEVFKIPRPFTYDGIQAIYEESAGGYGFPSGHAQTSATFWGSLMLHVRK 81


>gnl|CDD|238716 cd01436, Dipth_tox_like, Mono-ADP-ribosylating toxins catalyze the
           transfer of ADP_ribose from NAD+ to eukaryotic
           Elongation Factor 2, halting protein synthesis. A single
           molecule of delivered toxin is sufficient to kill a
           cell.  These toxins share mono-ADP-ribosylating activity
           with a variety of bacterial toxins, such as cholera
           toxin and pertussis toxin.   The structural core is
           homologous to the poly-ADP ribosylating enzymes such as
           the PARP enzymes and Tankyrase. Diphtheria toxin is
           encoded by a lysogenic bacteriophage. Both diphtheria
           toxin and Pseudomonas aeruginosa exotoxin A are
           multi-domain proteins. These domains provide a EF2
           ADP_ribosylating, receptor-binding, and intracellular
           trafficking/transmembrane functions .
          Length = 147

 Score = 28.0 bits (61), Expect = 1.5
 Identities = 2/45 (4%), Positives = 7/45 (15%)

Query: 81  YKTSWEPVGDAERPIAHPAARLYSHPAGHALGSFWTGQTLHLNKI 125
                +     ++ +            G        G       +
Sbjct: 75  LALKVDNAETIKKELGLSLTEPLMEQVGTEEFIKRFGDGASRVVL 119


>gnl|CDD|79035 PRK12559, PRK12559, transcriptional regulator Spx; Provisional.
          Length = 131

 Score = 27.0 bits (59), Expect = 3.5
 Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 34 ITKTGRTKI-STRGRMFPSLQLQVSGLDQAAHYSLLVEFHL 73
          +T+ G T+I STR + F  L + +  L     Y L++E  L
Sbjct: 48 LTEEGATEIISTRSKTFQDLNINIEELSLNEFYKLIIEHPL 88


>gnl|CDD|183547 PRK12475, PRK12475, thiamine/molybdopterin biosynthesis MoeB-like
           protein; Provisional.
          Length = 338

 Score = 27.0 bits (60), Expect = 4.7
 Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 12/58 (20%)

Query: 1   MKIYADADFSNVEVTLVNADLWQNFHAIGTEMIITKTGRTKIST-----RGRMFPSLQ 53
           +KI  + DF  +  T ++ D+W N +     M I K  + K  T       R +PSL 
Sbjct: 203 LKILVE-DFEALRETFLSFDIWNNQN-----MSI-KVNKQKKDTCPSCGLTRTYPSLT 253


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 27.1 bits (60), Expect = 4.7
 Identities = 17/59 (28%), Positives = 18/59 (30%), Gaps = 9/59 (15%)

Query: 53  QLQVSGLDQAAHYSLLVEFHLASPHKYKYKTS-WEPVGDAERPIAHPAARLYSHPAGHA 110
           Q Q  G    A Y        ASP  Y  +        DA RP      R Y HP    
Sbjct: 168 QQQRLGFPPRAPY--------ASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDW 218


>gnl|CDD|237584 PRK14014, PRK14014, putative acyltransferase; Provisional.
          Length = 301

 Score = 26.4 bits (59), Expect = 6.3
 Identities = 8/12 (66%), Positives = 9/12 (75%)

Query: 131 FVEGTRFLEVRH 142
           FVEGTRF   +H
Sbjct: 180 FVEGTRFTPEKH 191


>gnl|CDD|117729 pfam09175, DUF1944, Domain of unknown function (DUF1944).
          Members of this family adopt a structure consisting of
          several large open beta-sheets. Their exact function
          has not, as yet, been determined.
          Length = 165

 Score = 26.1 bits (58), Expect = 7.1
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 52 LQLQVSGLDQAAHYSLLVEFHLASPHKYKYKTSW 85
          +QL V+ L +  ++ +  +  L S HK   K +W
Sbjct: 32 VQLIVANLAENDNWKICADAVLLSKHKVMAKIAW 65


>gnl|CDD|224193 COG1274, PckA, Phosphoenolpyruvate carboxykinase (GTP) [Energy
           production and conversion].
          Length = 608

 Score = 26.6 bits (59), Expect = 7.8
 Identities = 7/25 (28%), Positives = 11/25 (44%)

Query: 77  HKYKYKTSWEPVGDAERPIAHPAAR 101
               +K +    G+   P AHP +R
Sbjct: 367 ELIDWKGNDWRPGETGEPAAHPNSR 391


>gnl|CDD|144961 pfam01561, Hanta_G2, Hantavirus glycoprotein G2.  The medium (M)
           genome segment of hantaviruses (family Bunyaviridae)
           encodes the two virion glycoproteins. G1 and G2, as a
           precursor protein in the complementary sense RNA.
          Length = 486

 Score = 26.3 bits (58), Expect = 9.4
 Identities = 15/61 (24%), Positives = 22/61 (36%), Gaps = 3/61 (4%)

Query: 64  HYSLLVEFHLASPHKYKYKTSWE-PVGDAER-PIAHPAARLYSHPAGHALGSFWTGQTLH 121
              L ++F L S  KY Y+     P  + E+ P      +   H     LG  W     +
Sbjct: 21  KTDLELDFSLPSSSKYTYRRKLTNPANEQEKIPFHLQIEKQVIHAEIQHLGH-WMDARFN 79

Query: 122 L 122
           L
Sbjct: 80  L 80


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.135    0.413 

Gapped
Lambda     K      H
   0.267   0.0751    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,875,025
Number of extensions: 682759
Number of successful extensions: 447
Number of sequences better than 10.0: 1
Number of HSP's gapped: 441
Number of HSP's successfully gapped: 18
Length of query: 157
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 68
Effective length of database: 6,990,096
Effective search space: 475326528
Effective search space used: 475326528
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 55 (24.9 bits)