RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15918
         (376 letters)



>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain. 
          Length = 96

 Score =  192 bits (491), Expect = 1e-61
 Identities = 67/96 (69%), Positives = 77/96 (80%)

Query: 117 KPPYSYNALIMMAIRQSPEKRLTLNGIYEYIMTNFPYYKTNKQGWQNSIRHNLSLNKCFV 176
           KPPYSY ALI MAI+QSPEK LTL+ IY++IM  FPYY+ NKQGWQNSIRHNLSLNKCF+
Sbjct: 1   KPPYSYIALITMAIQQSPEKMLTLSEIYQWIMDLFPYYRQNKQGWQNSIRHNLSLNKCFI 60

Query: 177 KVPRHYDDPGKGNYWMLDASSEDVFIGGTTGKLRRR 212
           KVPR  D PGKG+YW LD  SE++F  G   K R+R
Sbjct: 61  KVPRSPDKPGKGSYWTLDPESENMFENGKYLKRRKR 96


>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix".  FH is
           named for the Drosophila fork head protein, a
           transcription factor which promotes terminal rather than
           segmental development. This family of transcription
           factor domains, which bind to B-DNA as monomers, are
           also found in the Hepatocyte nuclear factor (HNF)
           proteins, which provide tissue-specific gene regulation.
           The structure contains 2 flexible loops or "wings" in
           the C-terminal region, hence the term winged helix.
          Length = 78

 Score =  171 bits (437), Expect = 8e-54
 Identities = 57/78 (73%), Positives = 66/78 (84%)

Query: 117 KPPYSYNALIMMAIRQSPEKRLTLNGIYEYIMTNFPYYKTNKQGWQNSIRHNLSLNKCFV 176
           KPPYSY+ALI MAI+ SPEKRLTL+ IY++I  NFPY++    GWQNSIRHNLSLNKCFV
Sbjct: 1   KPPYSYSALIAMAIQSSPEKRLTLSEIYKWISDNFPYFRDAPAGWQNSIRHNLSLNKCFV 60

Query: 177 KVPRHYDDPGKGNYWMLD 194
           KVPR  D+PGKG+YW LD
Sbjct: 61  KVPREPDEPGKGSYWTLD 78


>gnl|CDD|214627 smart00339, FH, FORKHEAD.  FORKHEAD, also known as a "winged
           helix".
          Length = 89

 Score =  167 bits (426), Expect = 4e-52
 Identities = 60/89 (67%), Positives = 71/89 (79%)

Query: 117 KPPYSYNALIMMAIRQSPEKRLTLNGIYEYIMTNFPYYKTNKQGWQNSIRHNLSLNKCFV 176
           KPPYSY ALI MAI  SP+KRLTL+ IY++I  NFPYY+ N+ GWQNSIRHNLSLN CFV
Sbjct: 1   KPPYSYIALIAMAILSSPDKRLTLSEIYKWIEDNFPYYRENRAGWQNSIRHNLSLNDCFV 60

Query: 177 KVPRHYDDPGKGNYWMLDASSEDVFIGGT 205
           KVPR  D PGKG+YW LD ++E++F  G 
Sbjct: 61  KVPREGDRPGKGSYWTLDPAAENMFENGN 89


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
           [Transcription].
          Length = 610

 Score = 89.1 bits (221), Expect = 3e-19
 Identities = 50/140 (35%), Positives = 64/140 (45%), Gaps = 4/140 (2%)

Query: 84  DCSNKKSSEDNNASSEESKANEEGKPKECKKDEKPPYSYNALIMMAIRQSPEKRLTLNGI 143
           D   K S            +       +  +  KP +SY   I  AI  SP  ++TL+ I
Sbjct: 304 DEGRKGSKSSPVPKDAAPPSTLSDLSADVNRTSKPAFSYANSITQAILSSPSGKMTLSEI 363

Query: 144 YEYIMTNFPYYKTNKQGWQNSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDASSEDVFIG 203
           Y +I +N PYY+     WQNSIRHNLSLNK F KVPR    PGKG +W +D S       
Sbjct: 364 YSWISSNLPYYRHKPTAWQNSIRHNLSLNKSFEKVPRSASQPGKGCFWKIDYSYIYEK-- 421

Query: 204 GTTGKLRRRTTAASRSRLAA 223
               K   R+   S S  + 
Sbjct: 422 --ESKRNPRSPKKSPSAHSV 439



 Score = 77.9 bits (192), Expect = 1e-15
 Identities = 44/158 (27%), Positives = 63/158 (39%), Gaps = 11/158 (6%)

Query: 55  NDLSDVDHEYMSDVDVTGDVDVTGDASPLDCSNKKSS-EDNNASSEESKANEEGKPKECK 113
                +  +    V    +V + G +  +             +    +  N     +E +
Sbjct: 13  EREKQMIPKKSITVSKLKEVVINGKSHNVSSVLSSPKAPTRVSLDYSNSLNRSVGSEEIQ 72

Query: 114 KDEK----------PPYSYNALIMMAIRQSPEKRLTLNGIYEYIMTNFPYYKTNKQGWQN 163
              K          PPYSY     +AI  SP+K LTL+ IY +I   F YY      WQN
Sbjct: 73  AYAKLAGSNWTYYVPPYSYATGRGLAILNSPDKPLTLSKIYTWIHNTFFYYAKVVSRWQN 132

Query: 164 SIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDASSEDVF 201
           SIRHNLSLN  F+K+        KG++W +    E  F
Sbjct: 133 SIRHNLSLNDAFIKIEGRNGAKVKGHFWSIGPGHETQF 170



 Score = 32.9 bits (75), Expect = 0.27
 Identities = 12/54 (22%), Positives = 19/54 (35%)

Query: 121 SYNALIMMAIRQSPEKRLTLNGIYEYIMTNFPYYKTNKQGWQNSIRHNLSLNKC 174
             +ALI +    S    ++L        T+ P    +  G +N  R N S    
Sbjct: 226 KSSALIRIPADSSSNLDVSLGHHISQPSTHTPVLDNHSSGEENISRINNSSQID 279


>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
          Length = 440

 Score = 31.8 bits (73), Expect = 0.54
 Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 2/55 (3%)

Query: 203 GGTTGKLRRRTTAASRSRLAAFKRSAVFGAAAAGIYPPGFIPTATPGSYPPPSWA 257
                 LRR  + A+ S  ++F+  A      AG         A   +  PP+WA
Sbjct: 350 SAAASPLRRAASRAAESMKSSFRAGARSTGGGAGGAAAA--AAAGAAAAGPPAWA 402


>gnl|CDD|214574 smart00233, PH, Pleckstrin homology domain.  Domain commonly found
           in eukaryotic signalling proteins. The domain family
           possesses multiple functions including the abilities to
           bind inositol phosphates, and various proteins. PH
           domains have been found to possess inserted domains
           (such as in PLC gamma, syntrophins) and to be inserted
           within other domains. Mutations in Brutons tyrosine
           kinase (Btk) within its PH domain cause X-linked
           agammaglobulinaemia (XLA) in patients. Point mutations
           cluster into the positively charged end of the molecule
           around the predicted binding site for
           phosphatidylinositol lipids.
          Length = 102

 Score = 28.3 bits (63), Expect = 2.0
 Identities = 10/50 (20%), Positives = 20/50 (40%)

Query: 153 YYKTNKQGWQNSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDASSEDVFI 202
           YYK+ K       + ++ L+ C V+     D   K + + +  S     +
Sbjct: 32  YYKSKKDKKSYKPKGSIDLSGCTVREAPDPDSSKKPHCFEIKTSDRKTLL 81


>gnl|CDD|215055 PLN00101, PLN00101, Photosystem I light-harvesting complex type 4
           protein; Provisional.
          Length = 250

 Score = 29.9 bits (67), Expect = 2.2
 Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 2/56 (3%)

Query: 196 SSEDVFIGGTTGKLRR--RTTAASRSRLAAFKRSAVFGAAAAGIYPPGFIPTATPG 249
           SS+  F+ G++ KL R         S   +FK  A  G    G+  PG++  + PG
Sbjct: 17  SSKSRFLSGSSSKLNRELSFKPMPSSATRSFKVEAKKGEWLPGLASPGYLNGSLPG 72


>gnl|CDD|234293 TIGR03648, Na_symport_lg, probable sodium:solute symporter, VC_2705
           subfamily.  This family belongs to a larger family of
           transporters of the sodium:solute symporter superfamily,
           TC 2.A.21. Members of this strictly bacterial protein
           subfamily are found almost invariably immediately
           downstream from a member of family TIGR03647.
           Occasionally, the two genes are fused.
          Length = 552

 Score = 29.9 bits (68), Expect = 2.2
 Identities = 11/32 (34%), Positives = 16/32 (50%), Gaps = 3/32 (9%)

Query: 215 AASRSRLAAFKRS---AVFGAAAAGIYPPGFI 243
           A+ + RL   + +   A   A   G+ PPGFI
Sbjct: 407 ASEKQRLLVARIALVVAAVIAGYFGLNPPGFI 438


>gnl|CDD|234300 TIGR03662, Chlor_Arch_YYY, Chlor_Arch_YYY domain.  Members of this
           highly hydrophobic probable integral membrane family
           belong to two classes. In one, a single copy of the
           region modeled by This model represents essentially the
           full length of a strongly hydrophobic protein of about
           700 to 900 residues (variable because of long inserts in
           some). The domain architecture of the other class
           consists of an additional N-terminal region, two copies
           of the region represented by this model, and three to
           four repeats of TPR, or tetratricopeptide repeat. The
           unusual species range includes several Archaea, several
           Chloroflexi, and Clostridium phytofermentans. An unusual
           motif YYYxG is present, and we suggest the name
           Chlor_Arch_YYY protein. The function is unknown.
          Length = 723

 Score = 29.7 bits (67), Expect = 2.9
 Identities = 20/107 (18%), Positives = 32/107 (29%), Gaps = 16/107 (14%)

Query: 190 YWMLDASSEDVFIGGTTGKLRRRTTAASRSRLAAFKRSAVFGAAAAGIYPPGFIPTATPG 249
            W+L A            + R+       +  A            AGIYP   +PT    
Sbjct: 531 GWLLRAYR--------GRRKRQFLRKGWAAICAIL-------LLLAGIYPVVAVPTRIRD 575

Query: 250 SYPPPSWALQQLYCNPLYRYNPYAYPGLQAAAIPKPTPLPAHGGAFS 296
               P+      Y    +  +  A   L+      P  L A G +++
Sbjct: 576 GEAGPTLDGMA-YLEESHPEDAAAIRWLRQNIDGTPVILEAAGDSYT 621


>gnl|CDD|221592 pfam12471, GTP_CH_N, GTP cyclohydrolase N terminal.  This domain
           family is found in bacteria and eukaryotes, and is
           approximately 190 amino acids in length. This family is
           the N terminal of GTP cyclohydrolase, the rate limiting
           enzyme in the synthesis of tetrahydrobiopterin.
          Length = 195

 Score = 28.3 bits (64), Expect = 4.7
 Identities = 11/32 (34%), Positives = 15/32 (46%), Gaps = 5/32 (15%)

Query: 290 AHGGAFSMERLLAD-----PATFRPNFVQTSP 316
           AH G++S+ R LA          RP+   T P
Sbjct: 51  AHSGSYSIYRALAVAAGALDPDHRPDLTNTEP 82


>gnl|CDD|223815 COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding
           protein) [Cell envelope biogenesis, outer membrane].
          Length = 661

 Score = 28.9 bits (65), Expect = 4.9
 Identities = 12/41 (29%), Positives = 15/41 (36%)

Query: 212 RTTAASRSRLAAFKRSAVFGAAAAGIYPPGFIPTATPGSYP 252
           R T A R   + FK    + A   G   P  I    P +Y 
Sbjct: 356 RATQAKRQPGSTFKPFLYYAALEEGKATPSTIVVDAPVTYG 396


>gnl|CDD|235221 PRK04123, PRK04123, ribulokinase; Provisional.
          Length = 548

 Score = 28.7 bits (65), Expect = 5.9
 Identities = 11/20 (55%), Positives = 13/20 (65%), Gaps = 2/20 (10%)

Query: 227 SAVFGAAAAGIYPPGFIPTA 246
           +A+F A AAG YP   IP A
Sbjct: 480 AAIFAAVAAGAYPD--IPEA 497


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 27.8 bits (62), Expect = 6.3
 Identities = 7/34 (20%), Positives = 11/34 (32%)

Query: 86  SNKKSSEDNNASSEESKANEEGKPKECKKDEKPP 119
               S ++   + EES    E +     K E   
Sbjct: 90  ELSSSDDEEEETEEESTDETEQEDPPETKTESKE 123


>gnl|CDD|224701 COG1787, COG1787, Predicted endonuclease distantly related to
           archaeal Holliday junction resolvase and Mrr-like
           restriction enzymes [Defense mechanisms].
          Length = 217

 Score = 27.9 bits (62), Expect = 6.7
 Identities = 12/45 (26%), Positives = 16/45 (35%)

Query: 216 ASRSRLAAFKRSAVFGAAAAGIYPPGFIPTATPGSYPPPSWALQQ 260
           A R   A  K++     A+   Y        T GSY   +  L Q
Sbjct: 152 AKRYISAVSKKAIQEIVASKAYYGCEKAKVVTNGSYTYAAQKLAQ 196


>gnl|CDD|179825 PRK04326, PRK04326, methionine synthase; Provisional.
          Length = 330

 Score = 28.0 bits (63), Expect = 7.6
 Identities = 9/20 (45%), Positives = 11/20 (55%)

Query: 240 PGFIPTATPGSYPPPSWALQ 259
             F+PT   GSYP P W  +
Sbjct: 6   LPFLPTTVVGSYPKPKWLRE 25


>gnl|CDD|198358 cd07781, FGGY_RBK, Ribulokinases; belongs to the FGGY family of
           carbohydrate kinases.  This subgroup is predominantly
           composed of bacterial ribulokinases (RBK) which catalyze
           the MgATP-dependent phosphorylation of L(or D)-ribulose
           to produce L(or D)-ribulose 5-phosphate and ADP. RBK
           also phosphorylates a variety of other sugar substrates
           including ribitol and arabitol. The reason why L-RBK can
           phosphorylate so many different substrates is not yet
           clear. The presence of Mg2+ is required for catalytic
           activity. This group belongs to the FGGY family of
           carbohydrate kinases, the monomers of which contain two
           large domains, which are separated by a deep cleft that
           forms the active site. This model includes both the
           N-terminal domain, which adopts a ribonuclease H-like
           fold, and the structurally related C-terminal domain.
          Length = 498

 Score = 28.4 bits (64), Expect = 7.8
 Identities = 8/13 (61%), Positives = 11/13 (84%)

Query: 227 SAVFGAAAAGIYP 239
           +A+F A AAG+YP
Sbjct: 477 AAIFAAVAAGVYP 489


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 28.3 bits (63), Expect = 8.2
 Identities = 14/62 (22%), Positives = 24/62 (38%), Gaps = 7/62 (11%)

Query: 53  SDNDLSDVDHEYMSDVDVTGDVDVTGDASPLDCSNKKSSEDNNASSEESKANEEGKPKEC 112
           S ++    +   ++  + T         S  D S+ + SE  + S  E  +    KPKE 
Sbjct: 214 SSDEEEGEEAPSINYNEDT-------SESESDESDSEISESRSVSDSEESSPPSKKPKEK 266

Query: 113 KK 114
           K 
Sbjct: 267 KT 268


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 28.3 bits (63), Expect = 9.1
 Identities = 21/103 (20%), Positives = 25/103 (24%), Gaps = 6/103 (5%)

Query: 214 TAASRSRLAAFKRSAVFGAAAAGIYPPGFIPTATPGSYPPPSWALQQLYCNPLYRYNPYA 273
            AA  +   A        AAA           A P    P   AL         R     
Sbjct: 389 AAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAL------AAARQASAR 442

Query: 274 YPGLQAAAIPKPTPLPAHGGAFSMERLLADPATFRPNFVQTSP 316
            PG   A  P P   PA     +        A       + +P
Sbjct: 443 GPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.130    0.393 

Gapped
Lambda     K      H
   0.267   0.0768    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,303,431
Number of extensions: 1809245
Number of successful extensions: 1350
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1343
Number of HSP's successfully gapped: 34
Length of query: 376
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 278
Effective length of database: 6,590,910
Effective search space: 1832272980
Effective search space used: 1832272980
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.8 bits)