RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14051
         (256 letters)



>gnl|CDD|189427 pfam00157, Pou, Pou domain - N-terminal to homeobox domain. 
          Length = 75

 Score =  101 bits (253), Expect = 9e-28
 Identities = 38/109 (34%), Positives = 53/109 (48%), Gaps = 34/109 (31%)

Query: 2   VDGINLDEIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQ 61
            D  +LDE+++FAK FK RR+ LG TQ  VG AL    GP +SQ+ IC            
Sbjct: 1   DDTTDLDELEQFAKEFKQRRIKLGYTQADVGLALGALYGPVFSQTTICR----------- 49

Query: 62  IASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAE 110
                                  FE L ++ K+  K+KP+LE+W++EAE
Sbjct: 50  -----------------------FEALQLSFKNMCKLKPLLEKWLEEAE 75


>gnl|CDD|197673 smart00352, POU, Found in Pit-Oct-Unc transcription factors. 
          Length = 75

 Score = 95.6 bits (238), Expect = 1e-25
 Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 34/108 (31%)

Query: 3   DGINLDEIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQI 62
           D  +  E++ FAK FK RR+ LG TQ  VG AL    GPA+SQ+ IC             
Sbjct: 2   DDTDPRELEAFAKQFKQRRIKLGFTQADVGLALGALYGPAFSQTTICR------------ 49

Query: 63  ASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAE 110
                                 FE L ++ K+  K+KP+L++W++EAE
Sbjct: 50  ----------------------FEALQLSFKNMCKLKPLLQKWLEEAE 75


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
           Reviewed.
          Length = 830

 Score = 33.6 bits (77), Expect = 0.094
 Identities = 13/27 (48%), Positives = 14/27 (51%)

Query: 55  QMLCAAQIASQQQQQQQQQHQQQQQQQ 81
           Q LC      QQ  Q QQQ QQQ  +Q
Sbjct: 788 QSLCQQSEMQQQPSQPQQQPQQQPAEQ 814


>gnl|CDD|222221 pfam13560, HTH_31, Helix-turn-helix domain.  This domain is a
          helix-turn-helix domain that probably binds to DNA.
          Length = 63

 Score = 30.2 bits (69), Expect = 0.13
 Identities = 8/28 (28%), Positives = 13/28 (46%)

Query: 12 EFAKAFKLRRLSLGLTQTQVGQALSVTE 39
          E     +  R   GLTQ ++ + L V+ 
Sbjct: 1  ELGARLRRLRERAGLTQAELAERLGVSR 28


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 33.0 bits (76), Expect = 0.15
 Identities = 17/65 (26%), Positives = 33/65 (50%), Gaps = 14/65 (21%)

Query: 64  SQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMK-----EAEE-----RQ 113
           ++Q Q+  QQ  Q   Q N+M    ++T +  Q++   +E  M+     EA++     +Q
Sbjct: 513 AEQAQRNPQQGDQPPDQGNSM----ELTQQDLQRMMDRIEELMESGRRAEAQQLLEQLQQ 568

Query: 114 VMSNM 118
           +M N+
Sbjct: 569 MMENL 573



 Score = 31.4 bits (72), Expect = 0.47
 Identities = 9/17 (52%), Positives = 9/17 (52%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
           Q  Q Q Q  QQ QQ Q
Sbjct: 628 QGGQGQGQPGQQGQQGQ 644



 Score = 31.4 bits (72), Expect = 0.55
 Identities = 10/18 (55%), Positives = 10/18 (55%)

Query: 65  QQQQQQQQQHQQQQQQQN 82
            Q  QQ QQ Q QQQ Q 
Sbjct: 634 GQPGQQGQQGQGQQQGQQ 651



 Score = 31.0 bits (71), Expect = 0.63
 Identities = 9/19 (47%), Positives = 9/19 (47%)

Query: 64  SQQQQQQQQQHQQQQQQQN 82
            Q QQ Q  Q Q Q  QQ 
Sbjct: 622 QQGQQGQGGQGQGQPGQQG 640



 Score = 31.0 bits (71), Expect = 0.67
 Identities = 10/20 (50%), Positives = 10/20 (50%)

Query: 65  QQQQQQQQQHQQQQQQQNTM 84
           Q Q Q  QQ QQ Q QQ   
Sbjct: 631 QGQGQPGQQGQQGQGQQQGQ 650



 Score = 30.6 bits (70), Expect = 0.98
 Identities = 10/25 (40%), Positives = 10/25 (40%)

Query: 65  QQQQQQQQQHQQQQQQQNTMFEKLD 89
           Q QQ Q QQ  QQ Q        L 
Sbjct: 639 QGQQGQGQQQGQQGQGGQGGQGSLA 663



 Score = 29.9 bits (68), Expect = 1.4
 Identities = 10/17 (58%), Positives = 10/17 (58%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
             QQ QQ Q QQQ QQ 
Sbjct: 636 PGQQGQQGQGQQQGQQG 652



 Score = 29.9 bits (68), Expect = 1.4
 Identities = 10/24 (41%), Positives = 11/24 (45%)

Query: 65  QQQQQQQQQHQQQQQQQNTMFEKL 88
           Q  QQ QQ   QQQ QQ    +  
Sbjct: 635 QPGQQGQQGQGQQQGQQGQGGQGG 658



 Score = 29.9 bits (68), Expect = 1.5
 Identities = 9/17 (52%), Positives = 9/17 (52%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
           QQ Q  Q Q Q  QQ Q
Sbjct: 625 QQGQGGQGQGQPGQQGQ 641



 Score = 29.9 bits (68), Expect = 1.6
 Identities = 9/19 (47%), Positives = 9/19 (47%)

Query: 65  QQQQQQQQQHQQQQQQQNT 83
           Q Q  QQ Q  Q QQQ   
Sbjct: 633 QGQPGQQGQQGQGQQQGQQ 651



 Score = 29.5 bits (67), Expect = 1.9
 Identities = 7/17 (41%), Positives = 7/17 (41%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
             Q Q Q   Q QQ Q 
Sbjct: 629 GGQGQGQPGQQGQQGQG 645



 Score = 29.5 bits (67), Expect = 2.2
 Identities = 10/22 (45%), Positives = 10/22 (45%)

Query: 60  AQIASQQQQQQQQQHQQQQQQQ 81
           AQ   QQ QQ Q    Q Q  Q
Sbjct: 617 AQRGEQQGQQGQGGQGQGQPGQ 638



 Score = 29.1 bits (66), Expect = 3.1
 Identities = 9/18 (50%), Positives = 9/18 (50%)

Query: 64  SQQQQQQQQQHQQQQQQQ 81
            QQ Q QQQ  Q Q  Q 
Sbjct: 640 GQQGQGQQQGQQGQGGQG 657



 Score = 28.7 bits (65), Expect = 3.3
 Identities = 8/17 (47%), Positives = 8/17 (47%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
            Q Q Q  Q  QQ Q Q
Sbjct: 630 GQGQGQPGQQGQQGQGQ 646



 Score = 28.7 bits (65), Expect = 3.4
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 61  QIASQQQQQQQQQHQQQQQQQ 81
            +A QQ QQQ  Q QQQ +Q 
Sbjct: 738 AMAQQQGQQQGGQGQQQGRQG 758



 Score = 28.7 bits (65), Expect = 3.6
 Identities = 8/17 (47%), Positives = 8/17 (47%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
           Q Q  Q Q    QQ QQ
Sbjct: 626 QGQGGQGQGQPGQQGQQ 642



 Score = 28.7 bits (65), Expect = 3.7
 Identities = 10/25 (40%), Positives = 11/25 (44%)

Query: 59  AAQIASQQQQQQQQQHQQQQQQQNT 83
           A  +     QQQ QQ   Q QQQ  
Sbjct: 732 ARALGEAMAQQQGQQQGGQGQQQGR 756



 Score = 28.7 bits (65), Expect = 3.8
 Identities = 8/17 (47%), Positives = 8/17 (47%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
            Q  Q Q Q  QQ QQ 
Sbjct: 627 GQGGQGQGQPGQQGQQG 643



 Score = 27.9 bits (63), Expect = 5.5
 Identities = 8/18 (44%), Positives = 10/18 (55%)

Query: 65  QQQQQQQQQHQQQQQQQN 82
           Q+Q   Q+  QQ QQ Q 
Sbjct: 612 QEQFNAQRGEQQGQQGQG 629



 Score = 27.9 bits (63), Expect = 6.0
 Identities = 7/30 (23%), Positives = 12/30 (40%)

Query: 55  QMLCAAQIASQQQQQQQQQHQQQQQQQNTM 84
           QM+   Q+    Q  Q +  Q  +    T+
Sbjct: 568 QMMENLQVTQGGQGGQSEMQQAMEGLGETL 597



 Score = 27.9 bits (63), Expect = 6.6
 Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 5/36 (13%)

Query: 59  AAQIASQQQQ-----QQQQQHQQQQQQQNTMFEKLD 89
           A Q+  Q QQ     Q  Q  Q  Q +     E L 
Sbjct: 559 AQQLLEQLQQMMENLQVTQGGQGGQSEMQQAMEGLG 594


>gnl|CDD|237354 PRK13341, PRK13341, recombination factor protein RarA/unknown
           domain fusion protein; Reviewed.
          Length = 725

 Score = 31.6 bits (72), Expect = 0.45
 Identities = 19/89 (21%), Positives = 34/89 (38%), Gaps = 6/89 (6%)

Query: 21  RLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQ 80
           +L L  +   VG A ++         A+ SAL    L A ++  + Q  Q    QQ  + 
Sbjct: 595 KLKLLYSIPAVGPAWALLSEDELLYKALYSAL----LEAEELWLKLQWDQSLL-QQSLEM 649

Query: 81  QNTMFEKLDITPKSAQKI-KPVLERWMKE 108
                       + A  I   +++RW+ +
Sbjct: 650 PGWSTGIEQWPEELALGIDSKLIKRWLAQ 678


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 31.1 bits (71), Expect = 0.52
 Identities = 10/49 (20%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 65  QQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQ 113
           ++ +Q+   +++QQ++     E +     +A K K    R +K  E+ +
Sbjct: 244 ERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSR-IKRLEKLE 291


>gnl|CDD|218813 pfam05929, Phage_GPO, Phage capsid scaffolding protein (GPO) serine
           peptidase.  This family consists of several
           bacteriophage capsid scaffolding proteins (GPO) and some
           related bacterial sequences. GPO is thought to function
           in both the assembly of proheads and the cleavage of
           GPN. The family is found to function as a serine
           peptidase, with a conserved Asp, His and Ser catalytic
           triad, as in subtilisin, and as represented in
           MEROPS:S73. The family includes GpO from Enterobacteria
           phage P2 which cleaves itself and then becomes the
           scaffold protein upon which the bacteriophage prohead is
           built - a mechanism quite common amongst phages.
          Length = 277

 Score = 30.6 bits (69), Expect = 0.59
 Identities = 9/38 (23%), Positives = 16/38 (42%)

Query: 64  SQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPV 101
           S  +Q    Q  ++ +  N +   L  T   AQ+ +P 
Sbjct: 227 SDLEQALNSQGDERDKAFNALKASLSHTDDYAQQARPP 264


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 30.8 bits (70), Expect = 0.59
 Identities = 11/28 (39%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 59  AAQIASQQQQQQQQQH--QQQQQQQNTM 84
           AAQ A  +++Q QQ+    +QQ QQ  +
Sbjct: 183 AAQKAELEEKQSQQKTLLYEQQAQQQKL 210


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 30.6 bits (69), Expect = 0.75
 Identities = 14/31 (45%), Positives = 16/31 (51%)

Query: 48  ICSALATQMLCAAQIASQQQQQQQQQHQQQQ 78
           I +A A Q+     I  Q QQQ Q Q QQ Q
Sbjct: 292 IGNAFANQIQLNFVIPQQAQQQGQGQQQQAQ 322



 Score = 29.8 bits (67), Expect = 1.4
 Identities = 12/20 (60%), Positives = 12/20 (60%)

Query: 67  QQQQQQQHQQQQQQQNTMFE 86
           QQ QQQ   QQQQ Q T  E
Sbjct: 308 QQAQQQGQGQQQQAQATAQE 327



 Score = 28.3 bits (63), Expect = 5.0
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 61  QIASQQQQQQQQQHQQQQQQ 80
           Q A QQ Q QQQQ Q   Q+
Sbjct: 308 QQAQQQGQGQQQQAQATAQE 327


>gnl|CDD|225387 COG2831, FhaC, Hemolysin activation/secretion protein
          [Intracellular trafficking and secretion].
          Length = 554

 Score = 30.4 bits (69), Expect = 0.90
 Identities = 7/34 (20%), Positives = 16/34 (47%)

Query: 49 CSALATQMLCAAQIASQQQQQQQQQHQQQQQQQN 82
           +AL      AA     +   +QQQ ++ ++++ 
Sbjct: 13 SAALPPLSSAAALENPAEGNIRQQQDERARREEL 46


>gnl|CDD|220844 pfam10675, DUF2489, Protein of unknown function (DUF2489).  This
          is a bacterial family of uncharacterized proteins.
          Length = 131

 Score = 29.1 bits (66), Expect = 0.98
 Identities = 6/30 (20%), Positives = 18/30 (60%)

Query: 61 QIASQQQQQQQQQHQQQQQQQNTMFEKLDI 90
          ++  Q++ Q++ Q + +Q ++  + E + I
Sbjct: 14 KLKKQKELQEKAQAEAKQARRARILESIRI 43


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 30.4 bits (68), Expect = 1.1
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 55  QMLCAAQIASQQQQQQQQQHQQQQQQ 80
           Q+L     A QQQQ + QQ QQQ+QQ
Sbjct: 77  QVLLKQTKAHQQQQLENQQRQQQRQQ 102


>gnl|CDD|240373 PTZ00345, PTZ00345, glycerol-3-phosphate dehydrogenase;
           Provisional.
          Length = 365

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 170 LPATNMPHFILASGQLVQGIQGAQLLI 196
           LP   +P  I+A   L + ++ A LLI
Sbjct: 72  LPGIKLPDNIVAVSDLKEAVEDADLLI 98


>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
          Length = 267

 Score = 29.4 bits (67), Expect = 1.6
 Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 4/49 (8%)

Query: 55  QMLCAAQIASQQQQQQQQQHQQQQQQQNTM---FEKLDITPK-SAQKIK 99
           Q+L   Q   + QQQQ     QQ Q+  T+   ++ L ++     Q+IK
Sbjct: 170 QLLRMMQAGFRFQQQQGGGGYQQAQRGPTLEDAYKVLGVSESDDDQEIK 218


>gnl|CDD|238045 cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins.
          Prokaryotic DNA binding proteins belonging to the
          xenobiotic response element family of transcriptional
          regulators.
          Length = 58

 Score = 27.1 bits (61), Expect = 1.7
 Identities = 9/32 (28%), Positives = 15/32 (46%), Gaps = 1/32 (3%)

Query: 14 AKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQ 45
           +  K  R   GLTQ ++ + L V+     S+
Sbjct: 1  GERLKELRKEKGLTQEELAEKLGVSRS-TISR 31


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 28.9 bits (65), Expect = 2.2
 Identities = 18/101 (17%), Positives = 34/101 (33%)

Query: 33  QALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQQNTMFEKLDITP 92
            A S  + PA  ++            + Q+ SQQQ  Q+    Q QQQ N          
Sbjct: 132 PANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNGETPPQQTDG 191

Query: 93  KSAQKIKPVLERWMKEAEERQVMSNMPGIISPPLISQNQNM 133
               + + ++     +    Q +    G  +  ++ Q +  
Sbjct: 192 AGDDESEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQ 232


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 29.3 bits (65), Expect = 2.3
 Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 7/70 (10%)

Query: 62  IASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQVMSNMPGI 121
           +A Q Q QQ QQ    Q Q     + +   P+  Q  +PV       A + Q       +
Sbjct: 772 VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV-------APQPQYQQPQQPV 824

Query: 122 ISPPLISQNQ 131
              P   Q Q
Sbjct: 825 APQPQYQQPQ 834


>gnl|CDD|173487 PTZ00236, PTZ00236, mitochondrial import inner membrane translocase
           subunit tim17; Provisional.
          Length = 164

 Score = 28.5 bits (64), Expect = 2.4
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 61  QIASQQQQQQQQQHQQQQQQQNTMFEK 87
           Q+ + +QQ QQQQ  ++Q ++    E+
Sbjct: 137 QMRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 27.7 bits (62), Expect = 3.9
 Identities = 9/45 (20%), Positives = 20/45 (44%)

Query: 44  SQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQQNTMFEKL 88
            ++AI   +   ++    I   ++Q +  + Q QQQQ+     + 
Sbjct: 113 VRNAIFGGILLGIIELVSIGMNRRQMRTPRQQFQQQQEMEKQLEE 157



 Score = 27.3 bits (61), Expect = 6.5
 Identities = 7/25 (28%), Positives = 13/25 (52%)

Query: 60  AQIASQQQQQQQQQHQQQQQQQNTM 84
            +   QQ QQQQ+  +Q +++    
Sbjct: 138 MRTPRQQFQQQQEMEKQLEEEDKKE 162


>gnl|CDD|133900 PHA00455, PHA00455, hypothetical protein.
          Length = 85

 Score = 27.0 bits (60), Expect = 2.8
 Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 7/65 (10%)

Query: 140 LSLNSNAVSANFHPNVNPRTDLKN-------PSAGSTLPATNMPHFILASGQLVQGIQGA 192
           + L+ N  +  F    N R+ +K        P  G T+P     H ++  G+  +  Q +
Sbjct: 1   MRLHFNKSNGIFSVRRNDRSTVKASEKHAKLPLIGDTVPLAPRVHALITRGKFEKATQKS 60

Query: 193 QLLIP 197
           +  +P
Sbjct: 61  RPFVP 65


>gnl|CDD|222719 pfam14377, DUF4414, Domain of unknown function (DUF4414).  This
           family is frequently found on DNA binding proteins of
           the URE-B1 type and on ligases.
          Length = 105

 Score = 27.7 bits (62), Expect = 2.9
 Identities = 17/85 (20%), Positives = 29/85 (34%), Gaps = 9/85 (10%)

Query: 62  IASQQQQQQQQQHQQQQQQQNTMF-EKLDITPKSAQKIKPVLERWMKEAEERQVMSNMPG 120
           IA Q +   +Q      ++   +  E L   P   +      E   +E +ERQ  +    
Sbjct: 18  IAQQLRASVRQSTSASGEEPTEINPEFLAALPPEIR-----EEVLQQERQERQRRARE-- 70

Query: 121 IISPPLISQNQNMVNAPLNLSLNSN 145
            I+    +  Q M  A    +L   
Sbjct: 71  -IAAAGAATPQEMDPASFLATLPPE 94


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 27.5 bits (62), Expect = 2.9
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 60 AQIASQQQQQQQQQHQQQQQQQ 81
           ++  Q Q+QQQ Q +Q++ Q 
Sbjct: 12 EELQKQAQEQQQAQEEQEEAQA 33



 Score = 26.4 bits (59), Expect = 7.0
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query: 65 QQQQQQQQQHQQQQQQQ 81
           Q+QQQ Q+ Q++ Q Q
Sbjct: 18 AQEQQQAQEEQEEAQAQ 34


>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
           ABF1 is a sequence-specific DNA binding protein involved
           in transcription activation, gene silencing and
           initiation of DNA replication. ABF1 is known to remodel
           chromatin, and it is proposed that it mediates its
           effects on transcription and gene expression by
           modifying local chromatin architecture. These functions
           require a conserved stretch of 20 amino acids in the
           C-terminal region of ABF1 (amino acids 639 to 662 S.
           cerevisiae). The N-terminal two thirds of the protein
           are necessary for DNA binding, and the N-terminus (amino
           acids 9 to 91 in S. cerevisiae) is thought to contain a
           novel zinc-finger motif which may stabilise the protein
           structure.
          Length = 508

 Score = 28.9 bits (64), Expect = 2.9
 Identities = 8/20 (40%), Positives = 9/20 (45%)

Query: 61  QIASQQQQQQQQQHQQQQQQ 80
           Q+    Q   Q  HQ QQQ 
Sbjct: 309 QLPEGHQHDHQHHHQHQQQD 328



 Score = 28.5 bits (63), Expect = 4.0
 Identities = 10/17 (58%), Positives = 11/17 (64%)

Query: 65  QQQQQQQQQHQQQQQQQ 81
            Q QQQ  QHQ+Q QQ 
Sbjct: 322 HQHQQQDLQHQEQHQQH 338


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.9 bits (65), Expect = 3.0
 Identities = 15/91 (16%), Positives = 31/91 (34%), Gaps = 5/91 (5%)

Query: 9   EIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQ 68
           E  E     +LR   L              E    +  A+  A   Q     + A    +
Sbjct: 439 EFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQ-----EQAEANVE 493

Query: 69  QQQQQHQQQQQQQNTMFEKLDITPKSAQKIK 99
           Q Q + +Q +++++   E L    +   +++
Sbjct: 494 QLQSELRQLRKRRDEALEALQRAERRLLQLR 524


>gnl|CDD|206093 pfam13922, PHD_3, PHD domain of transcriptional enhancer, Asx.
          This is the DNA-binding domain on the additional sex
          combs-like 1 proteins. The Asx protein acts as an
          enhancer of trithorax and polycomb in displaying
          bidirectional homoeotic phenotypes in Drosophila,
          suggesting that it is required for maintenance of both
          activation and silencing of Hox genes. Asx is required
          for normal adult haematopoiesis and its function
          depends on its cellular context.
          Length = 64

 Score = 26.7 bits (59), Expect = 3.1
 Identities = 5/19 (26%), Positives = 5/19 (26%)

Query: 64 SQQQQQQQQQHQQQQQQQN 82
            Q      QH  QQ    
Sbjct: 2  QAQNGSGHSQHLPQQNFSG 20



 Score = 26.3 bits (58), Expect = 3.7
 Identities = 8/17 (47%), Positives = 8/17 (47%)

Query: 66 QQQQQQQQHQQQQQQQN 82
          QQ Q    H Q   QQN
Sbjct: 1  QQAQNGSGHSQHLPQQN 17



 Score = 25.9 bits (57), Expect = 5.4
 Identities = 4/19 (21%), Positives = 5/19 (26%)

Query: 64 SQQQQQQQQQHQQQQQQQN 82
           Q Q          QQ  +
Sbjct: 1  QQAQNGSGHSQHLPQQNFS 19


>gnl|CDD|235310 PRK04750, ubiB, putative ubiquinone biosynthesis protein UbiB;
           Reviewed.
          Length = 537

 Score = 28.7 bits (65), Expect = 3.2
 Identities = 9/10 (90%), Positives = 9/10 (90%)

Query: 99  KPVLERWMKE 108
           KP LERWMKE
Sbjct: 427 KPFLERWMKE 436


>gnl|CDD|197775 smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins. 
          Length = 56

 Score = 26.3 bits (59), Expect = 3.3
 Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 6/31 (19%)

Query: 18 KLRRLSLGLTQTQVGQALSVTEGPAYSQSAI 48
          K  R   GLTQ ++ + L V      S+S +
Sbjct: 3  KELREEKGLTQEELAEKLGV------SRSTL 27


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.2 bits (63), Expect = 3.6
 Identities = 16/67 (23%), Positives = 38/67 (56%), Gaps = 12/67 (17%)

Query: 65  QQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEER-------QVMSN 117
            +QQQQ+++ ++++Q++    ++ +   + A++     ++WMK  ++R       Q M +
Sbjct: 192 LKQQQQKREEERRKQRKKQ--QEEEERKQKAEEA---WQKWMKNVKQRPKPVPLNQGMDS 246

Query: 118 MPGIISP 124
           + G ISP
Sbjct: 247 LRGTISP 253


>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit
          SEC24/subunit SFB2/subunit SFB3 [Intracellular
          trafficking and secretion].
          Length = 861

 Score = 28.6 bits (64), Expect = 3.8
 Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 1/62 (1%)

Query: 26 LTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQQ-QQQQHQQQQQQQNTM 84
            Q+QV    + ++  A    A  +  A QM          QQQ ++Q  Q      NT 
Sbjct: 11 QAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDQAATAMHNTG 70

Query: 85 FE 86
            
Sbjct: 71 AN 72


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 28.4 bits (63), Expect = 4.1
 Identities = 13/52 (25%), Positives = 25/52 (48%)

Query: 67  QQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQVMSNM 118
              Q+  H++++++   + E+L  T K  +K+K   E   KE E     S +
Sbjct: 457 GHPQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEI 508


>gnl|CDD|131979 TIGR02933, nifM_nitrog, nitrogen fixation protein NifM.  Members of
           this protein family, found in a subset of
           nitrogen-fixing bacteria, are the nitrogen fixation
           protein NifM. NifM, homologous to peptidyl-prolyl
           cis-trans isomerases, appears to be an accessory protein
           for NifH, the Fe protein, also called component II or
           dinitrogenase reductase, of nitrogenase [Central
           intermediary metabolism, Nitrogen fixation].
          Length = 256

 Score = 27.9 bits (62), Expect = 4.5
 Identities = 10/57 (17%), Positives = 22/57 (38%), Gaps = 5/57 (8%)

Query: 59  AAQIASQQQQQQQQQHQQQQQQQNTMFEK-----LDITPKSAQKIKPVLERWMKEAE 110
             +++  Q QQ  Q  Q+Q+  +  +        + I P   ++    L + + E  
Sbjct: 16  PGELSPDQLQQFDQAWQRQRHIEQAVVRAADEIGVVIPPSLLEEAPQALAQALDEQA 72


>gnl|CDD|227911 COG5624, TAF61, Transcription initiation factor TFIID, subunit
           TAF12 (also component of histone acetyltransferase SAGA)
           [Transcription].
          Length = 505

 Score = 28.1 bits (62), Expect = 4.6
 Identities = 6/20 (30%), Positives = 8/20 (40%)

Query: 61  QIASQQQQQQQQQHQQQQQQ 80
            +    Q QQ Q+     QQ
Sbjct: 230 LVDRYPQFQQGQKQVLSPQQ 249



 Score = 28.1 bits (62), Expect = 4.9
 Identities = 5/23 (21%), Positives = 10/23 (43%)

Query: 60  AQIASQQQQQQQQQHQQQQQQQN 82
              + ++   +  Q QQ Q+Q  
Sbjct: 223 DGRSKRRLVDRYPQFQQGQKQVL 245


>gnl|CDD|181215 PRK08059, PRK08059, general stress protein 13; Validated.
          Length = 123

 Score = 27.3 bits (61), Expect = 4.7
 Identities = 7/49 (14%), Positives = 20/49 (40%), Gaps = 14/49 (28%)

Query: 65  QQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQ 113
           ++++ +       +Q  NT+ +K              LE W++++ +  
Sbjct: 86  KRKKGKILIPNPSEQGFNTLRDK--------------LEEWIEQSNKTD 120


>gnl|CDD|152475 pfam12040, DUF3526, Domain of unknown function (DUF3526).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria. This domain is typically
           between 149 to 170 amino acids in length. This domain
           has a single completely conserved residue P that may be
           functionally important.
          Length = 158

 Score = 27.3 bits (61), Expect = 4.9
 Identities = 9/41 (21%), Positives = 19/41 (46%)

Query: 93  KSAQKIKPVLERWMKEAEERQVMSNMPGIISPPLISQNQNM 133
              + + P+ E   +    +Q ++   G++SP L  Q  +M
Sbjct: 59  YGDKVLAPLAEALDQALLAQQRLARRFGLLSPTLALQRASM 99


>gnl|CDD|176958 CHL00015, ndhE, NADH dehydrogenase subunit 4L.
          Length = 101

 Score = 26.9 bits (60), Expect = 5.1
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 131 QNMVNAPLNLSLNSNAVSANF 151
           +NMV A + L L  NAV+ NF
Sbjct: 25  RNMVRALMCLELILNAVNINF 45


>gnl|CDD|220815 pfam10577, UPF0560, Uncharacterized protein family UPF0560.  This
           family of proteins has no known function.
          Length = 805

 Score = 28.3 bits (63), Expect = 5.2
 Identities = 13/47 (27%), Positives = 15/47 (31%), Gaps = 3/47 (6%)

Query: 38  TEGPAYSQSAICSA---LATQMLCAAQIASQQQQQQQQQHQQQQQQQ 81
            E P   Q  +  A        L A    S  Q Q Q+   QQ   Q
Sbjct: 523 GEHPLVGQQLMEPADQQSEIGTLQAELSHSHAQPQGQRLSAQQISGQ 569


>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1112

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 4/31 (12%), Positives = 14/31 (45%)

Query: 60   AQIASQQQQQQQQQHQQQQQQQNTMFEKLDI 90
             +I  Q+  + ++   +++       +K +I
Sbjct: 1017 GEIPVQEAPEIREAAPERRLDPKYRTQKEEI 1047


>gnl|CDD|197277 cd09180, PLDc_N_DEXD_b, N-terminal putative catalytic domain of
           uncharacterized prokaryotic and archeal HKD family
           nucleases fused to a DEAD/DEAH box helicase domain.
           N-terminal putative catalytic domain of uncharacterized
           prokaryotic and archeal HKD family nucleases fused to a
           DEAD/DEAH box helicase domain. All members of this
           subfamily are uncharacterized. Other characterized
           members of the superfamily that have a related domain
           architecture ( containing a DEAD/DEAH box helicase
           domain), include the DNA/RNA helicase superfamily II
           (SF2) and Res-subunit of type III restriction
           endonucleases. In addition to the helicase-like region,
           members of this subfamily also contain one copy of the
           conserved HKD motif (H-x-K-x(4)-D, where x represents
           any amino acid residue) in the N-terminal putative
           catalytic domain. The HKD motif characterizes the
           phospholipase D (PLD, EC 3.1.4.4) superfamily.  A few
           family members contain additional domains, like a
           C-terminal peptidase S24-like domain.
          Length = 142

 Score = 27.3 bits (61), Expect = 5.6
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 3/65 (4%)

Query: 90  ITPKSAQKIKPVLER-WMKEAEERQVMSNMPGIISPPLISQNQNMVNAPLNLSLNSNAVS 148
           I+    Q + P LE    K    R + S   GI  P  + +  +  N  + + + +N  S
Sbjct: 24  ISESGIQLLIPELEELINKGVPIRIITSTYLGITEPKALRELADYDN--VEVKIYNNIKS 81

Query: 149 ANFHP 153
            +FH 
Sbjct: 82  ESFHA 86


>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
          Provisional.
          Length = 134

 Score = 27.1 bits (60), Expect = 5.7
 Identities = 7/24 (29%), Positives = 18/24 (75%)

Query: 65 QQQQQQQQQHQQQQQQQNTMFEKL 88
          ++++QQQ++  ++Q++ NT  E +
Sbjct: 42 EEEEQQQRELAERQEKINTWLELM 65


>gnl|CDD|148108 pfam06300, Tsp45I, Tsp45I type II restriction enzyme.  This family
           consists of several type II restriction enzymes.
          Length = 261

 Score = 27.6 bits (61), Expect = 5.9
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 77  QQQQQNTMFEKLDITPKSAQKIKPVLERWMKE 108
            +   N + EKL    ++ ++I P+ +RW KE
Sbjct: 98  YEMGLNKIIEKLSEPKETNRQIGPMFKRWYKE 129


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 27.9 bits (62), Expect = 6.0
 Identities = 10/54 (18%), Positives = 23/54 (42%), Gaps = 1/54 (1%)

Query: 59  AAQIASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEER 112
             +  +++ ++Q+   Q +Q++            K A++     E   K+AEE 
Sbjct: 72  KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA-KQAEQAAKQAEEKQKQAEEA 124


>gnl|CDD|182559 PRK10573, PRK10573, type IV pilin biogenesis protein; Provisional.
          Length = 399

 Score = 27.5 bits (62), Expect = 6.7
 Identities = 8/18 (44%), Positives = 11/18 (61%)

Query: 58  CAAQIASQQQQQQQQQHQ 75
           C  Q+A QQ+ QQQ   +
Sbjct: 142 CCFQLARQQEAQQQLTKK 159


>gnl|CDD|226660 COG4197, COG4197, Uncharacterized protein conserved in bacteria,
          prophage-related [Function unknown].
          Length = 96

 Score = 26.4 bits (58), Expect = 7.0
 Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 2/46 (4%)

Query: 16 AFKLRRLSLGLTQTQVGQ-ALSVTEGPAYSQSAICSALATQMLCAA 60
             L RL LG++   V Q      +  A    AI  A + Q+    
Sbjct: 15 QKALARL-LGVSPPSVNQWIKGRRQVAAERALAIERATSGQVSREE 59


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 27.7 bits (61), Expect = 7.2
 Identities = 6/38 (15%), Positives = 16/38 (42%)

Query: 60  AQIASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQK 97
           AQ   ++  ++Q    + QQ+ +   +  D      ++
Sbjct: 215 AQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQ 252



 Score = 27.7 bits (61), Expect = 7.9
 Identities = 10/51 (19%), Positives = 27/51 (52%)

Query: 60  AQIASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAE 110
           +Q  +++ QQ +++  ++Q   +   +K D    +A K +  + +  +EA+
Sbjct: 208 SQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAK 258


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 27.2 bits (60), Expect = 7.3
 Identities = 19/62 (30%), Positives = 28/62 (45%), Gaps = 16/62 (25%)

Query: 64  SQQQQQQQQQHQQQ------------QQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEE 111
            + +Q QQ Q QQQ            QQ++    E LDI    AQ++K   E + K  E+
Sbjct: 29  PEDEQAQQNQPQQQTVDEIEELKQKLQQKEVEAQEYLDI----AQRLKAEFENYRKRTEK 84

Query: 112 RQ 113
            +
Sbjct: 85  EK 86


>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
           C-methyltransferase; Provisional.
          Length = 390

 Score = 27.4 bits (61), Expect = 8.0
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 42  AYSQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQQNT 83
           A +Q+A   ALA Q+    +    Q+Q+ +   +QQ +  + 
Sbjct: 62  AQNQTATNDALANQLTALQKAQESQKQELEGILKQQAKALDQ 103



 Score = 27.4 bits (61), Expect = 9.0
 Identities = 7/54 (12%), Positives = 22/54 (40%), Gaps = 4/54 (7%)

Query: 63  ASQQQQQQ----QQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEER 112
            +     Q    Q+  + Q+Q+   + ++       A + +  L + + E +++
Sbjct: 68  TNDALANQLTALQKAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQQK 121


>gnl|CDD|188064 TIGR00559, pdxJ, pyridoxine 5'-phosphate synthase.  PdxJ is
           required in the biosynthesis of pyridoxine (vitamin B6),
           a precursor to the enzyme cofactor pyridoxal phosphate.
           ECOCYC describes the predicted reaction equation as
           1-amino-propan-2-one-3-phosphate +
           deoxyxylulose-5-phosphate = pyridoxine-5'-phosphate. The
           product of that reaction is oxidized by PdxH to
           pyridoxal 5'-phosphate [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Pyridoxine].
          Length = 236

 Score = 27.1 bits (60), Expect = 8.0
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 4   GINLDEIKEFAKAFKLRRLSLG 25
           G+N   +K FA   +L  L++G
Sbjct: 190 GLNYHNVKAFAAIPELHELNIG 211


>gnl|CDD|140213 PTZ00186, PTZ00186, heat shock 70 kDa precursor protein;
           Provisional.
          Length = 657

 Score = 27.7 bits (61), Expect = 8.1
 Identities = 16/34 (47%), Positives = 17/34 (50%)

Query: 49  CSALATQMLCAAQIASQQQQQQQQQHQQQQQQQN 82
           C     Q   AA   S     +QQQ QQQQQQQN
Sbjct: 619 CGRTEYQQAAAANSGSSSNSGEQQQQQQQQQQQN 652


>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
          Length = 639

 Score = 27.5 bits (61), Expect = 8.6
 Identities = 24/117 (20%), Positives = 39/117 (33%), Gaps = 6/117 (5%)

Query: 4   GINLDEIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQS-----AICSALATQMLC 58
           GI   +          + L L        + LS T+   +SQ+      +          
Sbjct: 72  GIVYSDNSSKTIEPDSQDLLLDKRGEHKARVLSATDDDTHSQTDDIIKQVTQKAGQDDSD 131

Query: 59  AAQIASQQQQQQQQQHQQQQQQQNTMFEKLDI-TPKSAQKIKPVLERWMKEAEERQV 114
             +  SQ Q   Q +  +  QQ     EK+D   P   +  K   +  M +A  RQ+
Sbjct: 132 QQEKNSQSQSASQAESLEHVQQSAQTSEKVDEKEPLLTKTDKQTDQTVMPDARVRQL 188


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 27.4 bits (61), Expect = 8.7
 Identities = 12/70 (17%), Positives = 30/70 (42%), Gaps = 1/70 (1%)

Query: 74  HQQQQQQQNTM-FEKLDITPKSAQKIKPVLERWMKEAEERQVMSNMPGIISPPLISQNQN 132
           +Q+Q +      F+  D+        +    ++   A++  ++ N+  I S   I+Q   
Sbjct: 10  YQEQARAAILAVFKGQDLAEPKNYNQRSANPQFEIGADKDLLLENINNIQSKNGITQGSV 69

Query: 133 MVNAPLNLSL 142
            ++  LN+ +
Sbjct: 70  RIDDKLNIDI 79


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.125    0.342 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,112,799
Number of extensions: 1071288
Number of successful extensions: 4406
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3212
Number of HSP's successfully gapped: 352
Length of query: 256
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 161
Effective length of database: 6,723,972
Effective search space: 1082559492
Effective search space used: 1082559492
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.0 bits)