RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12708
         (695 letters)



>gnl|CDD|201301 pfam00554, RHD, Rel homology domain (RHD).  Proteins containing the
           Rel homology domain (RHD) are eukaryotic transcription
           factors. The RHD is composed of two structural domains.
           This is the N-terminal domain that is similar to that
           found in P53. The C-terminal domain has an
           immunoglobulin-like fold (See pfam01833) that binds to
           DNA.
          Length = 169

 Score =  196 bits (500), Expect = 4e-59
 Identities = 85/178 (47%), Positives = 111/178 (62%), Gaps = 13/178 (7%)

Query: 41  IEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVSKD 99
           +EI+EQPK + +RFRY+CEGRSAGSI G +ST   K++P + I NY G A + VS V+KD
Sbjct: 1   LEIVEQPKQRGMRFRYKCEGRSAGSIPGESSTRSKKTFPTVQICNYDGPAVIRVSLVTKD 60

Query: 100 SPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDI-QAFRLYLQFCQR 158
            P+R HPH LV KD C +G+    L   DM  SF NLGIQC+KKKD+ +A +  ++    
Sbjct: 61  EPHRPHPHSLVGKD-CKDGVCEVELGPEDMVASFQNLGIQCVKKKDVEEALKERIELNID 119

Query: 159 PFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDN-VIKLEPVVSDIIYD 215
           PFN    +              ++DLN VRLCFQA+LP  + N    L PVVS+ IYD
Sbjct: 120 PFNVGFEAL---------RQIKDMDLNVVRLCFQAFLPDTRGNFTTPLPPVVSNPIYD 168


>gnl|CDD|143647 cd07887, RHD-n_Dorsal_Dif, N-terminal sub-domain of the Rel
           homology domain (RHD) of the arthropod protein Dorsal.
           Proteins containing the Rel homology domain (RHD) are
           metazoan transcription factors. The RHD is composed of
           two structural sub-domains; this model characterizes the
           N-terminal RHD sub-domain of the arthropod Dorsal and
           Dif (Dorsal-related immunity factor), and similar
           proteins. Dorsal and Dif are Rel-like transcription
           factors, which play roles in mediating innate immunity
           in Drosophila. They are activated via the Toll pathway.
           Cytoplasmic Dorsal/Dif are inactivated via forming a
           complex with Cactus, the Drosophila homologue of
           mammalian I-kappa B proteins. In response to signals,
           Cactus is degraded and Dorsal/Dif can be transported
           into the nucleus, where they act as transcription
           factors. Dorsal is also an essential gene in
           establishing the proper dorsal/ventral polarity in the
           developing embryo.
          Length = 173

 Score =  189 bits (481), Expect = 2e-56
 Identities = 88/185 (47%), Positives = 117/185 (63%), Gaps = 18/185 (9%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY+ I+EQP S+ +RFRYECEGRSAGSI GANST E K++P I + NY G A +VVSCV+
Sbjct: 1   PYVRIVEQPTSRALRFRYECEGRSAGSIPGANSTSEGKTFPTIQVVNYDGRAVVVVSCVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDIQ-AFRLYLQFC 156
           KD P+R HPH LV K+ C  G+    +   +M   F  LGIQC+KKKD++ + +L  +  
Sbjct: 61  KDEPFRPHPHNLVGKEGCKKGVCTKKINPTEMRIVFQKLGIQCVKKKDVEESLKLREEIN 120

Query: 157 QRPFNSEKNSAWTLLGVMGFNNRD---NLDLNAVRLCFQAYLPKDKDNV-IKLEPVVSDI 212
             PF +            GF+++D   ++DLN VRLCFQ +L  +     + L PVVSD 
Sbjct: 121 VDPFRT------------GFDHKDQINSIDLNVVRLCFQVFLEDENGRFTVPLPPVVSDP 168

Query: 213 IYDAK 217
           IYD K
Sbjct: 169 IYDKK 173


>gnl|CDD|143640 cd07827, RHD-n, N-terminal sub-domain of the Rel homology domain
           (RHD).  Proteins containing the Rel homology domain
           (RHD) are metazoan transcription factors. The RHD is
           composed of two structural sub-domains; this model
           characterizes the N-terminal sub-domain, which may be
           distantly related to the DNA-binding domain found in
           P53. The C-terminal sub-domain has an
           immunoglobulin-like fold and serves as a dimerization
           module that also binds DNA (see cd00102). The RHD is
           found in NF-kappa B, nuclear factor of activated T-cells
           (NFAT), the tonicity-responsive enhancer binding protein
           (TonEBP), and the arthropod proteins Dorsal and Relish
           (Rel).
          Length = 174

 Score =  187 bits (476), Expect = 1e-55
 Identities = 87/184 (47%), Positives = 114/184 (61%), Gaps = 15/184 (8%)

Query: 39  PYIEILEQPKSKV-RFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY+EI EQPK +  RFRYECEGRSAGSI G NST + K++P + +RNY G A +VVS V+
Sbjct: 1   PYLEITEQPKQRGHRFRYECEGRSAGSIPGENSTADRKTFPTVKLRNYNGPAKIVVSLVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLL--EGDMTYSFTNLGIQCMKKKDI-QAFRLYLQ 154
           KD P + HPH+LV K +C +G+ C   L  + +MT SF NLGIQC++KKD+ +A    +Q
Sbjct: 61  KDDPPKPHPHQLVGKTDCRDGV-CEVRLGPKNNMTASFNNLGIQCVRKKDVEEALGQRIQ 119

Query: 155 FCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIK-LEPVVSDII 213
               PF   K             N  ++DLN VRLCFQA++          L PV+S+ I
Sbjct: 120 LGIDPFMVHKGPE---------GNASDIDLNRVRLCFQAFIEDSDGGFTLPLPPVLSNPI 170

Query: 214 YDAK 217
           YD K
Sbjct: 171 YDKK 174


>gnl|CDD|143645 cd07885, RHD-n_RelA, N-terminal sub-domain of the Rel homology
           domain (RHD) of RelA.  Proteins containing the Rel
           homology domain (RHD) are metazoan transcription
           factors. The RHD is composed of two structural
           sub-domains; this model characterizes the N-terminal RHD
           domain of the RelA family of transcription factors,
           categorized as a class II member of the NF-kappa B
           family. In class II NF-kappa Bs, the RHD domain
           co-occurs with a C-terminal transactivation domain
           (TAD). NF-kappa B proteins are part of a protein complex
           that acts as a transcription factor, which is
           responsible for regulating a host of cellular responses
           to a variety of stimuli. This complex tightly regulates
           the expression of a large number of genes, and is
           involved in processes such as adaptive and innate
           immunity, stress response, inflammation, cell adhesion,
           proliferation and apoptosis. The cytosolic NF-kappa B
           complex is activated via phosphorylation of the
           ankyrin-repeat containing inhibitory protein I-kappa B,
           which dissociates from the complex and exposes the
           nuclear localization signal of the heterodimer (NF-kappa
           B and Rel). RelA (also called p65) forms heterodimers
           with NF-kappa B1 (p50) and B2 (p52). RelA also forms
           homodimers.
          Length = 169

 Score =  154 bits (391), Expect = 8e-44
 Identities = 83/179 (46%), Positives = 108/179 (60%), Gaps = 14/179 (7%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY+EI+EQPK + +RFRY+CEGRSAGSI G  ST   K++P I I NY G   + +S V+
Sbjct: 1   PYVEIIEQPKQRGMRFRYKCEGRSAGSIPGERSTDTTKTHPTIKINNYTGPGRVRISLVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDI-QAFRLYLQFC 156
           KD P++ HPH+LV KD C +G + A L      +SF NLGIQC+KK+D+ QA    +Q  
Sbjct: 61  KDPPHKPHPHELVGKD-CKDGYYEAELSPDRCIHSFQNLGIQCVKKRDLEQAVSQRIQTN 119

Query: 157 QRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIKLEPVVSDIIYD 215
             PFN           V     R + DLNAVRLCFQ  +      ++ L PV+S  IYD
Sbjct: 120 NNPFN-----------VPIEEQRADYDLNAVRLCFQVTVRDPSGRLLPLPPVLSQPIYD 167


>gnl|CDD|143643 cd07883, RHD-n_NFkB, N-terminal sub-domain of the Rel homology
           domain (RHD) of nuclear factor of kappa light
           polypeptide gene enhancer in B-cells (NF-kappa B).
           Proteins containing the Rel homology domain (RHD) are
           metazoan transcription factors. The RHD is composed of
           two structural sub-domains; this model characterizes the
           N-terminal RHD sub-domain of the NF-kappa B1 and B2
           families of transcription factors, also referred to as
           class I members of the NF-kappa B family. In class I
           NF-kappa Bs, the RHD domain co-occurs with C-terminal
           ankyrin repeats. Family members include NF-kappa B1 and
           NF-kappa B2. NF-kappa B1 is commonly referred to as p105
           or p50 (proteolytically processed form), while NF-kappa
           B2 is called p100 or p52 (proteolytically processed
           form). NF-kappa B proteins are part of a protein complex
           that acts as a transcription factor, which is
           responsible for regulating a host of cellular responses
           to a variety of stimuli. This complex tightly regulates
           the expression of a large number of genes, and is
           involved in processes such as adaptive and innate
           immunity, stress response, inflammation, cell adhesion,
           proliferation and apoptosis. The cytosolic NF-kappa B
           complex is activated via phosphorylation of the
           ankyrin-repeat containing inhibitory protein I-kappa B,
           which dissociates from the complex and exposes the
           nuclear localization signal of the heterodimer (NF-kappa
           B and REL). p105 and p100 may also act as I-kappa Bs due
           to their C-terminal ankyrin repeats.
          Length = 197

 Score =  152 bits (386), Expect = 1e-42
 Identities = 78/204 (38%), Positives = 102/204 (50%), Gaps = 32/204 (15%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY+EILEQPK +  RFRY CEG S G + GA+S    KSYP + I NY G A +VV  V+
Sbjct: 1   PYLEILEQPKQRGFRFRYGCEGPSHGGLPGASSEKNKKSYPTVKICNYQGPARIVVQLVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDI----------Q 147
              P R H H LV K +C +GI    +   DMT  F NLGI  + KK++          Q
Sbjct: 61  NSEPPRLHAHSLVGK-HCEDGICTVQVGPKDMTAQFPNLGILHVTKKNVVETLEARLLAQ 119

Query: 148 AFRLYLQFCQRPFNSEKNSAWTLLGVMGFNNRDN-------------LDLNAVRLCFQAY 194
             R Y        ++E        G     + +              +DL+ VRLCFQA+
Sbjct: 120 CTRGYNPGDLVHVDAEGG------GDRQLTDEEQAEIRQKAKQQAKSMDLSVVRLCFQAF 173

Query: 195 LPKDKDN-VIKLEPVVSDIIYDAK 217
           LP    +    L+PV+SD IYD+K
Sbjct: 174 LPDSNGSFTRPLKPVISDAIYDSK 197


>gnl|CDD|143649 cd07933, RHD-n_c-Rel, N-terminal sub-domain of the Rel homology
           domain (RHD) of c-Rel.  Proteins containing the Rel
           homology domain (RHD) are metazoan transcription
           factors. The RHD is composed of two structural
           sub-domains; this model characterizes the N-terminal RHD
           sub-domain of the c-Rel family of transcription factors,
           categorized as a class II member of the NF-kappa B
           family. In class II NF-kappa Bs, the RHD domain
           co-occurs with a C-terminal transactivation domain
           (TAD). NF-kappa B proteins are part of a protein complex
           that acts as a transcription factor, which is
           responsible for regulating a host of cellular responses
           to a variety of stimuli. This complex tightly regulates
           the expression of a large number of genes, and is
           involved in processes such as adaptive and innate
           immunity, stress response, inflammation, cell adhesion,
           proliferation and apoptosis. The cytosolic NF-kappa B
           complex is activated via phosphorylation of the
           ankyrin-repeat containing inhibitory protein I-kappa B,
           which dissociates from the complex and exposes the
           nuclear localization signal of the heterodimer (NF-kappa
           B and Rel). c-Rel plays an important role in B cell
           proliferation and survival.
          Length = 172

 Score =  145 bits (367), Expect = 2e-40
 Identities = 75/180 (41%), Positives = 106/180 (58%), Gaps = 13/180 (7%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY+EI EQP+ + +RFRY+CEGRSAGSI G  ST   ++YP I I NY G   + ++ V+
Sbjct: 1   PYVEIFEQPRQRGMRFRYKCEGRSAGSIPGERSTDNNRTYPSIQILNYTGKGKVRITLVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDIQ-AFRLYLQFC 156
           K+ PY+ HPH LV KD C +G + A         +F NLGIQC+++++++ A  L +   
Sbjct: 61  KNEPYKPHPHDLVGKD-CRDGYYEAEFGPERRVLAFQNLGIQCVRRREVKEAIMLRISRG 119

Query: 157 QRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDN-VIKLEPVVSDIIYD 215
             PFN  +     +         +  DLN VRLCFQ +LP +  N    L P+VS+ IYD
Sbjct: 120 INPFNVPEEQLLQI---------EEYDLNVVRLCFQIFLPDEHGNYTTALPPIVSNPIYD 170


>gnl|CDD|143646 cd07886, RHD-n_RelB, N-terminal sub-domain of the Rel homology
           domain (RHD) of the reticuloendotheliosis viral oncogene
           homolog B (RelB) protein.  Proteins containing the Rel
           homology domain (RHD) are metazoan transcription
           factors. The RHD is composed of two structural
           sub-domains; this model characterizes the N-terminal RHD
           sub-domain of the RelB family of transcription factors,
           categorized as class II NF-kappa B family members. In
           class II NF-kappa Bs, the RHD domain co-occurs with a
           C-terminal transactivation domain (TAD). NF-kappa B
           proteins are part of a protein complex that acts as a
           transcription factor, which is responsible for
           regulating a host of cellular responses to a variety of
           stimuli. This complex tightly regulates the expression
           of a large number of genes, and is involved in processes
           such as adaptive and innate immunity, stress response,
           inflammation, cell adhesion, proliferation and
           apoptosis. The cytosolic NF-kappa B complex is activated
           via phosphorylation of the ankyrin-repeat containing
           inhibitory protein I-kappa B, which dissociates from the
           complex and exposes the nuclear localization signal of
           the heterodimer (NF-kappa B and Rel). RelB, is unable to
           homodimerize but is a potent transactivator in a
           heterodimer with NF-kappa B1 (p50) or B2 (p52). It is
           involved in the regulation of genes that play roles in
           inflammatory processes and the immune response.
          Length = 172

 Score =  144 bits (364), Expect = 6e-40
 Identities = 82/184 (44%), Positives = 106/184 (57%), Gaps = 17/184 (9%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVG--DAHLVVSC 95
           P + I EQPK + +RFRYECEGRSAGSI+G +ST   K+ P I I+N +G  +  + V  
Sbjct: 1   PRLLITEQPKQRGMRFRYECEGRSAGSILGESSTEANKTQPAIEIQNCIGLKEVTVTVCL 60

Query: 96  VSKDSPYRSHPHKLVSKDNCTNGIFCATLL-EGDMTYSFTNLGIQCMKKKDIQ-AFRLYL 153
           V KD P+R HPH LV KD C NGI   TL       +SF+NLGIQC++K++I+ A    L
Sbjct: 61  VWKDPPHRVHPHGLVGKD-CPNGICQVTLNPHSSPRHSFSNLGIQCVRKREIEAAIETRL 119

Query: 154 QFCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIKLEPVVSDII 213
           Q    PF +               N + +D+N VRLCFQA    D      L PV+S+ I
Sbjct: 120 QLNIDPFKAGS-----------LKNHEEVDMNVVRLCFQASYRDDDGRKDCLSPVLSEPI 168

Query: 214 YDAK 217
           YD K
Sbjct: 169 YDKK 172


>gnl|CDD|143651 cd07935, RHD-n_NFkB1, N-terminal sub-domain of the Rel homology
           domain (RHD) of nuclear factor of kappa B1 (NF-kappa
           B1).  Proteins containing the Rel homology domain (RHD)
           are metazoan transcription factors. The RHD is composed
           of two structural sub-domains; this model characterizes
           the N-terminal RHD sub-domain of the NF-kappa B1 family
           of transcription factors, a class I member of the
           NF-kappa B family. In class I NF-kappa Bs, the RHD
           domain co-occurs with C-terminal ankyrin repeats.
           NF-kappa B1 is commonly referred to as p105 or p50
           (proteolytically processed form). NF-kappa B proteins
           are part of a protein complex that acts as a
           transcription factor, which is responsible for
           regulating a host of cellular responses to a variety of
           stimuli. This complex tightly regulates the expression
           of a large number of genes, and is involved in processes
           such as adaptive and innate immunity, stress response,
           inflammation, cell adhesion, proliferation and
           apoptosis. The cytosolic NF-kappa B complex is activated
           via phosphorylation of the ankyrin-repeat containing
           inhibitory protein I-kappa B, which dissociates from the
           complex and exposes the nuclear localization signal of
           the heterodimer (NF-kappa B and REL). NF-kappa B1 is
           involved in the canonical NF-kappa B signaling pathway
           which is activated by many agonists and is essential in
           immune and inflammatory responses, as well as cell
           survival. p105 is involved in its own specific NF-kappa
           B signaling pathway which is also implicated in immune
           and inflammatory responses. p105 may also act as an
           I-kappa B due to its C-terminal ankyrin repeats. It is
           also involved in mitogen-activated protein kinase (MAPK)
           signaling as its degradation leads to the activation of
           TPL-2, a MAPK kinase kinase which activates ERK
           pathways.
          Length = 202

 Score =  117 bits (295), Expect = 4e-30
 Identities = 76/206 (36%), Positives = 97/206 (47%), Gaps = 31/206 (15%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY++ILEQPK +  RFRY CEG S G + GA+S    KSYP++ I NYVG A ++V  V+
Sbjct: 1   PYLQILEQPKQRGFRFRYVCEGPSHGGLPGASSEKNKKSYPQVKICNYVGPAKVIVQLVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKD---------IQA 148
                  H H LV K +C +GI   T    DM   F NLGI  + KK           +A
Sbjct: 61  NGKNIHLHAHSLVGK-HCEDGICTVTAGPKDMVVGFANLGILHVTKKKVFETLEARMTEA 119

Query: 149 FRL-------------YLQ---FCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQ 192
            +              YLQ      R     +        V        +DL+ VRL F 
Sbjct: 120 CKKGYNPGLLVHPELAYLQAEGGGDRQLTEREKEIIRQAAVQ---QTKEMDLSVVRLMFT 176

Query: 193 AYLPKDKDNVIK-LEPVVSDIIYDAK 217
           A+LP       + LEPVVSD IYD+K
Sbjct: 177 AFLPDSTGGFTRRLEPVVSDAIYDSK 202


>gnl|CDD|143650 cd07934, RHD-n_NFkB2, N-terminal sub-domain of the Rel homology
           domain (RHD) of nuclear factor kappa B2 (NF-kappa B2).
           Proteins containing the Rel homology domain (RHD) are
           metazoan transcription factors. The RHD is composed of
           two structural sub-domains; this model characterizes the
           N-terminal RHD sub-domain of the NF-kappa B2 family of
           transcription factors, a class I member of the NF-kappa
           B family. In class I NF-kappa Bs, the RHD domain
           co-occurs with C-terminal ankyrin repeats. NF-kappa B2
           is commonly referred to as p100 or p52 (proteolytically
           processed form). NF-kappa B proteins are part of a
           protein complex that acts as a transcription factor,
           which is responsible for regulating a host of cellular
           responses to a variety of stimuli. This complex tightly
           regulates the expression of a large number of genes, and
           is involved in processes such as adaptive and innate
           immunity, stress response, inflammation, cell adhesion,
           proliferation and apoptosis. The cytosolic NF-kappa B
           complex is activated via phosphorylation of the
           ankyrin-repeat containing inhibitory protein I-kappa B,
           which dissociates from the complex and exposes the
           nuclear localization signal of the heterodimer (NF-kappa
           B and REL). NF-kappa B2 is involved in the alternative
           NF-kappa B signaling pathway which is activated by few
           agonists and plays an important role in secondary
           lymphoid organogenesis, maturation of B-cells, and
           adaptive humoral immunity. p100 may also act as an
           I-kappa B due to its C-terminal ankyrin repeats.
          Length = 185

 Score =  111 bits (278), Expect = 4e-28
 Identities = 69/185 (37%), Positives = 95/185 (51%), Gaps = 6/185 (3%)

Query: 39  PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           PY+ I+EQPK +  RFRY CEG S G + GA+S    K+YP + I NYVG A + V  V+
Sbjct: 1   PYLVIIEQPKQRGFRFRYVCEGPSHGGLPGASSEKGRKTYPTVKICNYVGMARIEVDLVT 60

Query: 98  KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKK---DIQAFRLYLQ 154
              P R H H LV K    +G     +   DMT  F+NLGI  + KK   +I   +L  Q
Sbjct: 61  HTDPPRVHAHSLVGKHCNESGNCSVDVGPKDMTAQFSNLGILHVTKKNMMEILKEKLKRQ 120

Query: 155 FCQ-RPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDN-VIKLEPVVSDI 212
             +         +    L       +  +DL+ VRL F AYL     +  + L+PV+SD 
Sbjct: 121 KLRNTGPYKLTEAEERELEQEAKELKKVMDLSIVRLKFTAYLRDSNGSYTLALKPVISDP 180

Query: 213 IYDAK 217
           I+D+K
Sbjct: 181 IHDSK 185


>gnl|CDD|238582 cd01177, IPT_NFkappaB, IPT domain of the transcription factor
           NFkappaB and related transcription factors. NFkappaB is
           considered a central regulator of stress responses,
           activated by different stressful conditions, including
           physical stress, oxidative stress, and exposure to
           certain chemicals. NFkappaB blocking cell apoptosis in
           several cell types, gives it an important role in cell
           proliferation and differentiation.
          Length = 102

 Score = 98.9 bits (247), Expect = 8e-25
 Identities = 44/102 (43%), Positives = 57/102 (55%), Gaps = 4/102 (3%)

Query: 222 LTIHTLSHVSAPVVGDMKMIILCDKVNKDDIEVRFYEEQDGVVVWDER--VKTLEVHKQY 279
           L I  L   S  V G  ++ +LCDKV K+DI+VRF+EE +   VW+        +VH+QY
Sbjct: 1   LKICRLDKTSGSVKGGDEVYLLCDKVQKEDIQVRFFEEDEEETVWEAFGDFSQTDVHRQY 60

Query: 280 AIVLKTPAYKSFEINDPVHVKIQLVTKKD--ISEPYNFMLTP 319
           AIV +TP Y   +I +PV VKIQL    D   SE   F   P
Sbjct: 61  AIVFRTPPYHDPDITEPVKVKIQLKRPSDGERSESVPFTYVP 102


>gnl|CDD|143644 cd07884, RHD-n_Relish, N-terminal sub-domain of the Rel homology
           domain (RHD) of the arthropod protein Relish.  Proteins
           containing the Rel homology domain (RHD) are metazoan
           transcription factors. The RHD is composed of two
           structural sub-domains; this model characterizes the
           N-terminal RHD sub-domain of the arthropod Relish
           protein, in which the RHD domain co-occurs with
           C-terminal ankyrin repeats. Family members are sometimes
           referred to as p110 or p68 (proteolytically processed
           form). Relish is an NF-kappa B-like transcription
           factor, which plays a role in mediating innate immunity
           in Drosophila. It is activated via the Imd (immune
           deficiency) pathway, which triggers phosphorylation of
           Relish. IKK-dependent proteolytic cleavage of Relish
           (which involves Dredd) results in a smaller active form
           (without the C-terminal ankyrin repeats), which is
           transported into the nucleus and functions as a
           transactivator.
          Length = 159

 Score = 93.6 bits (233), Expect = 3e-22
 Identities = 54/186 (29%), Positives = 81/186 (43%), Gaps = 34/186 (18%)

Query: 39  PYIEILEQPKSKVRFRYECEGR-SAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
           P++ I+EQP  K RFRY+ E   + GS++G  ST   K++P + + NY G A +  S   
Sbjct: 1   PFLRIVEQPVDKFRFRYKSEMHGTHGSLLGERSTSSKKTFPTVKLCNYRGQAVIRCSLYQ 60

Query: 98  KDSPYRS-HPHKLVSKDNCTNGIFCATLL----EGDMTYSFTNLGIQCMKKKDIQAFRLY 152
            D   R  H HKLV K    + +     +    EGD    F N+GI    KK+I      
Sbjct: 61  ADDNRRKPHVHKLVGKQ-GDDDVCDPHDIEVSPEGDYVAMFQNMGIIHTAKKNI--PEEL 117

Query: 153 LQFCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIKL-EPVVSD 211
            +                        + N++LN V L FQA+      ++  +  PV S+
Sbjct: 118 YK------------------------KKNMNLNQVVLRFQAFAVSANGHLRPICPPVYSN 153

Query: 212 IIYDAK 217
            I + K
Sbjct: 154 PINNLK 159


>gnl|CDD|238336 cd00602, IPT_TF, IPT domain of eukaryotic transcription factors
           NF-kappaB/Rel, nuclear factor of activated Tcells
           (NFAT), and  recombination signal J-kappa binding
           protein (RBP-Jkappa). The IPT domains in these proteins
           are involved in DNA binding. Most NF-kappaB/Rel proteins
           form homo- and heterodimers, while NFAT proteins are
           largely monomeric (with TonEBP being an exception).
           While the majority of sequence-specific DNA binding
           elements are found in the N-terminal domain, several are
           found in the IPT domain in loops adjacent to, and
           including, the linker region.
          Length = 101

 Score = 82.3 bits (204), Expect = 4e-19
 Identities = 45/102 (44%), Positives = 56/102 (54%), Gaps = 5/102 (4%)

Query: 222 LTIHTLSHVSAPVVGDMKMIILCDKVNKDDIEVRFYEEQDGVVVWDER--VKTLEVHKQY 279
           L I  +S +S  V G  ++ +LCDKVNK DI+V F E+  G  VW+     +  +V  Q 
Sbjct: 1   LPICRVSSLSGSVNGGDEVFLLCDKVNKPDIKVWFGEKGPGETVWEAEAMFRQEDVR-QV 59

Query: 280 AIVLKTPAYKSFEINDPVHVKIQLVTKKDI--SEPYNFMLTP 319
           AIV KTP Y +  I  PV V IQLV   D   SEP  F  TP
Sbjct: 60  AIVFKTPPYHNKWITRPVQVPIQLVRPDDRKRSEPLTFTYTP 101


>gnl|CDD|214657 smart00429, IPT, ig-like, plexins, transcription factors. 
          Length = 90

 Score = 40.9 bits (96), Expect = 1e-04
 Identities = 28/96 (29%), Positives = 31/96 (32%), Gaps = 10/96 (10%)

Query: 221 DLTIHTLSHVSAPVVGDMKMIILCDKVNKDDIEVRFYEEQDGVVVWDERVKTLEVHKQYA 280
           D  I  +S  S PV G  + I LC K  K    V                 T       A
Sbjct: 1   DPVITRISPTSGPVSGGTE-ITLCGKNLKSISVVFVEVGVGEAPC------TFSPSSSTA 53

Query: 281 IVLKTPAYKSFEINDPVHV-KIQLVTKKDISEPYNF 315
           IV KTP Y    I   V V  + L      S P  F
Sbjct: 54  IVCKTPPYH--NIPGSVPVRTVGLRNGGVPSSPQPF 87


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 37.0 bits (86), Expect = 0.005
 Identities = 19/46 (41%), Positives = 20/46 (43%), Gaps = 7/46 (15%)

Query: 516 EDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRP 561
           E   PP D   P P  A PP  P+A         PA K   PPPRP
Sbjct: 68  EPAAPPPDPEPPVPGPAGPPS-PLAPP------APARKPPLPPPRP 106



 Score = 32.3 bits (74), Expect = 0.20
 Identities = 26/88 (29%), Positives = 30/88 (34%), Gaps = 16/88 (18%)

Query: 514 CYEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRP-PSHTKPNLATL 572
            + D  PP   P PP     PPV  V       V+PP S+   PPP P P    P     
Sbjct: 35  AHPDDPPPVGDPRPPVVDTPPPVSAVW------VLPPPSEPAAPPPDPEPPVPGP----- 83

Query: 573 QRKSTSDKFDSVSTHDDNIPPLPPKRSR 600
                              PPLPP R +
Sbjct: 84  ----AGPPSPLAPPAPARKPPLPPPRPQ 107


>gnl|CDD|238050 cd00102, IPT, Immunoglobulin-like fold, Plexins, Transcription
           factors (IPT). IPTs are also known as Transcription
           factor ImmunoGlobin (TIG) domains. They are present in
           intracellular transcription factors, cell surface
           receptors (such as plexins and scatter factor
           receptors), as well as, cyclodextrin glycosyltransferase
           and similar enzymes. Although they are involved in DNA
           binding in transcription factors, their function in
           other proteins is unknown. In these transcription
           factors, IPTs form homo- or heterodimers with the
           exception of the nuclear factor of activated Tcells
           (NFAT) transcription factors which are mainly monomers.
          Length = 89

 Score = 35.9 bits (83), Expect = 0.008
 Identities = 23/95 (24%), Positives = 31/95 (32%), Gaps = 11/95 (11%)

Query: 222 LTIHTLSHVSAPVVGDMKMIILCDKVNKD-DIEVRFYEEQDGVVVWDERVKTLEVHKQYA 280
             I ++S  S PV G  ++ I         ++ V F                L V    A
Sbjct: 1   PVITSISPSSGPVSGGTEVTITGSNFGSGSNLRVTFGGGVP--------CSVLSVS-STA 51

Query: 281 IVLKTPAYKSFEINDPVHVKIQLVTKKDISEPYNF 315
           IV  TP Y +     PV V +        S P  F
Sbjct: 52  IVCTTPPYAN-PGPGPVEVTVDRGNGGITSSPLTF 85


>gnl|CDD|238583 cd01178, IPT_NFAT, IPT domain of the NFAT family of transcription
           factors. NFAT transcription complexes are a target of
           calcineurin, a calcium dependent phosphatase, and
           activate genes mainly involved in cell-cell-interaction.
          Length = 101

 Score = 35.9 bits (83), Expect = 0.009
 Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 4/92 (4%)

Query: 231 SAPVVGDMKMIILCDKVNKDDIEVRFYEE-QDGVVVWDERVKT-LEVHKQYAIVLKTPAY 288
           S  V G  ++ +      KD  +V F E+ QDG   W+       E   Q  +V++ P Y
Sbjct: 11  SCSVNGGEELFLTGKNFLKDS-KVVFQEKGQDGEAQWEAEATIDKEKSHQNHLVVEVPPY 69

Query: 289 KSFEINDPVHVKIQLVT-KKDISEPYNFMLTP 319
            +  +  PV V+  +V  K+  S+P  F  TP
Sbjct: 70  HNKHVAAPVQVQFYVVNGKRKRSQPQTFTYTP 101


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 36.8 bits (85), Expect = 0.047
 Identities = 22/102 (21%), Positives = 32/102 (31%), Gaps = 16/102 (15%)

Query: 510 SGGECYEDVLPPQDIPTPP-PEVATPPVKPVALSNVKVVIPPASKVTTPP--PRPPSHTK 566
               C +  L P     P  PE ++ PV PVA +       P  +  T P      SH  
Sbjct: 329 PTAVCTDAGLLPDTPLLPDAPEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHAD 388

Query: 567 PNLATLQRKSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNL 608
           P               +  T    +    PK +    L++ L
Sbjct: 389 PA-------------VAQPTQAATLAGAAPKGATHLMLEETL 417


>gnl|CDD|143648 cd07927, RHD-n_NFAT_like, N-terminal sub-domain of the Rel homology
           domain (RHD) of nuclear factor of activated T-cells
           (NFAT) proteins and similar proteins.  Proteins
           containing the Rel homology domain (RHD) are metazoan
           transcription factors. The RHD is composed of two
           structural sub-domains; this model characterizes the
           N-terminal RHD sub-domain of the NFAT family of
           transcription factors. NFAT transcription complexes are
           a target of calcineurin, a calcium dependent
           phosphatase, and activate genes that are mainly involved
           in cell-cell interaction. Upon de-phosphorylation of the
           nuclear localization signal, NFAT enters the nucleus and
           acts as a transcription factor; its export from the
           nucleus is triggered by phosphorylation via export
           kinases. NFATs play important roles in mediating the
           immune response, and are found in T cells, B Cells, NK
           cells, mast cells, and monocytes. NFATs are also found
           in various non-hematopoietic cell types, where they play
           roles in development. This group also contains the
           N-terminal RHD sub-domain of the non-calcium regulated
           tonicity-responsive enhancer binding protein (TonEBP),
           also called NFAT5. Mammalian TonEBP regulates the
           expression of genes in response to tonicity. It plays a
           pivotal role in urinary concentrating mechanisms in
           kidney medulla, by triggering the accumulation of
           osmolytes that enable renal medullary cells to tolerate
           high levels of urea and salt.
          Length = 161

 Score = 34.9 bits (80), Expect = 0.051
 Identities = 26/123 (21%), Positives = 45/123 (36%), Gaps = 18/123 (14%)

Query: 39  PYIEILEQPKSKVRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVSK 98
             + I  QP+   R RYE EG S G++   ++      +P + +  Y+    L V   + 
Sbjct: 1   YELRIEVQPEPHHRARYETEG-SRGAVKAPST----GGFPTVKLHGYMEPVGLQVFIGTA 55

Query: 99  DSPYRSHPHKLVSKDNCTNGIFC-------ATLLE------GDMTYSFTNLGIQCMKKKD 145
               + H    V +        C         +LE       +MT +    GI  ++  D
Sbjct: 56  SGRLKPHAFYQVHRITGKTTTPCKEKIIGNTKVLEIPLEPKNNMTATIDCAGILKLRNAD 115

Query: 146 IQA 148
           I+ 
Sbjct: 116 IEL 118


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 36.5 bits (84), Expect = 0.063
 Identities = 14/51 (27%), Positives = 17/51 (33%)

Query: 520  PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLA 570
              +    PP +   PP             PP      PPP PP   +P LA
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA 2944



 Score = 34.5 bits (79), Expect = 0.25
 Identities = 23/99 (23%), Positives = 33/99 (33%), Gaps = 5/99 (5%)

Query: 519  LPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPP-----SHTKPNLATLQ 573
            LPP     P      P   P +L     V P       PP R P     +  +P +  L 
Sbjct: 2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887

Query: 574  RKSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
            R + S   +S +   D     P  ++      +  PP P
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP 2926



 Score = 33.4 bits (76), Expect = 0.60
 Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 5/78 (6%)

Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPP--RPPSHTKPNLATLQRKSTS 578
           P  +PTP P        P   + +    P +     PPP  +PP+            +T 
Sbjct: 412 PASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATR 471

Query: 579 DKFDSVSTHDDNIPPLPP 596
              D++       PP PP
Sbjct: 472 KALDALRERR---PPEPP 486



 Score = 33.0 bits (75), Expect = 0.76
 Identities = 19/55 (34%), Positives = 21/55 (38%), Gaps = 2/55 (3%)

Query: 519  LPPQDIPTPPPEVATPP--VKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
            LP    P  PP     P    P A S    + PP S   T PP PP    P+L  
Sbjct: 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852



 Score = 32.2 bits (73), Expect = 1.3
 Identities = 28/141 (19%), Positives = 36/141 (25%), Gaps = 24/141 (17%)

Query: 520  PPQDIPTPPPEVATPPVKPV---------ALSNVKVVIPPASKVTTPPPR--PPSHTKPN 568
            P       PP V   P  P            +      PPA+    PP R   P+    +
Sbjct: 2733 PALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792

Query: 569  LATLQRKSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVPKYKSFFHKLFFKSKSK 628
             +     S  D  D  +        LPP  S    L       P              + 
Sbjct: 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-------------TAP 2839

Query: 629  SKPATPRERSLSIDSRKVGGG 649
              P  P   SL +      GG
Sbjct: 2840 PPPPGPPPPSLPLGGSVAPGG 2860



 Score = 31.8 bits (72), Expect = 1.7
 Identities = 18/92 (19%), Positives = 26/92 (28%), Gaps = 7/92 (7%)

Query: 520  PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSD 579
            P    P PP   A  P  P          P       PP RP     P   +  R++   
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670

Query: 580  KFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPV 611
               +           PP+R R+      +  +
Sbjct: 2671 GRAAQ-------ASSPPQRPRRRAARPTVGSL 2695



 Score = 29.9 bits (67), Expect = 5.9
 Identities = 19/81 (23%), Positives = 29/81 (35%), Gaps = 7/81 (8%)

Query: 520  PPQDIPTPPPEVATPPVKPVALSNVKVVIP---PASKVTTPPPRP----PSHTKPNLATL 572
             P   PT  P  A  P   V    +  ++P      +   P P P    P+ + P L   
Sbjct: 2940 QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999

Query: 573  QRKSTSDKFDSVSTHDDNIPP 593
                 S    S++ H++  PP
Sbjct: 3000 SLSRVSSWASSLALHEETDPP 3020



 Score = 29.5 bits (66), Expect = 7.4
 Identities = 14/77 (18%), Positives = 20/77 (25%), Gaps = 3/77 (3%)

Query: 520  PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSD 579
            P +   +P  + A P   PV       V         PP +P    +P      +     
Sbjct: 2867 PSR---SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923

Query: 580  KFDSVSTHDDNIPPLPP 596
                        PP P 
Sbjct: 2924 PPPPQPQPPPPPPPRPQ 2940



 Score = 29.5 bits (66), Expect = 7.9
 Identities = 10/72 (13%), Positives = 20/72 (27%), Gaps = 2/72 (2%)

Query: 525  PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSV 584
             + PP+          + ++  +  P     TP P P +            +      ++
Sbjct: 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735

Query: 585  STHDDNIPPLPP 596
                   PP  P
Sbjct: 2736 PA--APAPPAVP 2745



 Score = 29.1 bits (65), Expect = 9.9
 Identities = 11/49 (22%), Positives = 13/49 (26%), Gaps = 1/49 (2%)

Query: 520  PPQDIPTPPPEVATPPVK-PVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
            PP   P P P         P   +  +   P       PP  P     P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 35.1 bits (81), Expect = 0.093
 Identities = 17/86 (19%), Positives = 28/86 (32%), Gaps = 13/86 (15%)

Query: 517 DVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPP-------SHTKPNL 569
           D+L   +        + P   P     ++V  PP      PPP PP       S +  +L
Sbjct: 154 DLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPP------PPPPPPPPPSLQQSTSAIDL 207

Query: 570 ATLQRKSTSDKFDSVSTHDDNIPPLP 595
              ++   S    ++       P  P
Sbjct: 208 IKERKGQRSAAGKTLVLSKPKSPEFP 233


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 33.8 bits (78), Expect = 0.18
 Identities = 21/65 (32%), Positives = 28/65 (43%), Gaps = 15/65 (23%)

Query: 517 DVLPP--QDIPTPPPE----------VATPPVKPVALS--NVKVVIPPASKVTTPPPRPP 562
           D++P   Q +PT PPE           A P + P  ++  N  V   PA  V  P P+P 
Sbjct: 54  DMMPAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPA-PVEPPKPKPV 112

Query: 563 SHTKP 567
              KP
Sbjct: 113 EKPKP 117



 Score = 33.4 bits (77), Expect = 0.29
 Identities = 12/51 (23%), Positives = 15/51 (29%), Gaps = 3/51 (5%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVT---TPPPRPPSHTKP 567
           P   +   P  V  P  KPV     K       +      P P+P    K 
Sbjct: 93  PNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 32.5 bits (74), Expect = 0.23
 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 9/81 (11%)

Query: 519 LPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTS 578
           +P    P P P +  P  + +A+       P A      PP       P  +T+  +   
Sbjct: 24  MPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPV----YPPGSTVLVQGGF 79

Query: 579 D---KFDSVSTHDDNIPPLPP 596
           D   +F   +    +IPP PP
Sbjct: 80  DAGARFGPGTG--SSIPPPPP 98


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 33.6 bits (77), Expect = 0.35
 Identities = 16/83 (19%), Positives = 21/83 (25%), Gaps = 14/83 (16%)

Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIP--PASKVTTPPPRPPSHTKPNLATLQRKSTS 578
           P   P        PP +PV  +     +P  P +      P            +  K   
Sbjct: 385 PSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKY 444

Query: 579 DKFDSVSTHDDNIPPLPPKRSRK 601
                        PP PPK   K
Sbjct: 445 T------------PPAPPKEEEK 455



 Score = 29.0 bits (65), Expect = 9.0
 Identities = 13/77 (16%), Positives = 22/77 (28%), Gaps = 2/77 (2%)

Query: 526 TPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSVS 585
            P P          A S V+    P+++            +P   T            V+
Sbjct: 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPP--RPVA 418

Query: 586 THDDNIPPLPPKRSRKS 602
               + P   PK +R +
Sbjct: 419 PPVPHTPESAPKLTRAA 435


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 33.6 bits (77), Expect = 0.39
 Identities = 17/93 (18%), Positives = 27/93 (29%), Gaps = 6/93 (6%)

Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPR-PPSHTKPNLATLQRKSTSD 579
           P          A       A +   V + P + V   PP   P   +P     ++K    
Sbjct: 402 PSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVS 461

Query: 580 KFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
           K  S+         L P + +      N+   P
Sbjct: 462 KVSSLG-----PSTLRPIQEKAEQATGNIKEAP 489


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 32.8 bits (75), Expect = 0.43
 Identities = 16/48 (33%), Positives = 19/48 (39%), Gaps = 2/48 (4%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
           PP++ P PP E  TPP         K       K   P P+P    KP
Sbjct: 62  PPEEQPKPPTEPETPPEPTPPKPKEKPKPEK--KPKKPKPKPKPKPKP 107


>gnl|CDD|239125 cd02660, Peptidase_C19D, A subfamily of Peptidase C19. Peptidase
           C19 contains ubiquitinyl hydrolases. They are
           intracellular peptidases that remove ubiquitin molecules
           from polyubiquinated peptides by cleavage of isopeptide
           bonds. They hydrolyze bonds involving the carboxyl group
           of the C-terminal Gly residue of ubiquitin. The purpose
           of the de-ubiquitination is thought to be editing of the
           ubiquitin conjugates, which could rescue them from
           degradation, as well as recycling of the ubiquitin. The
           ubiquitin/proteasome system is responsible for most
           protein turnover in the mammalian cell, and with over 50
           members, family C19 is one of the largest families of
           peptidases in the human genome.
          Length = 328

 Score = 33.1 bits (76), Expect = 0.43
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 380 RVDKSVNINMKPHVTFSQHDTDMETNQVPDVLFD 413
            V   + +NM P+ + S  DT    +  PD  +D
Sbjct: 241 YVQFPLELNMTPYTSSSIGDTQDSNSLDPDYTYD 274


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 32.9 bits (75), Expect = 0.62
 Identities = 37/195 (18%), Positives = 56/195 (28%), Gaps = 19/195 (9%)

Query: 433 KDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNESSVTIEEVHEFPKEHLGQRTEGYGS 492
            D +  + D  +I    NA        +L    NES   I    +F + ++ Q  +    
Sbjct: 178 ADDVVNRADPNIITRIKNAEPK-TIATELVQESNESEPVITNEQDFDRLNIDQLVDYINK 236

Query: 493 YSSLQLAMKNQYFID--RNSGGECYEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPP 550
            +  Q        +D  RN     +     P   P P     TP   P    +    +  
Sbjct: 237 NNDGQFNFDAHDSVDDVRNFAKTVWRQKNTPAQTPMPERSWQTPAQTPARRISTP--MTE 294

Query: 551 ASKVTTPPPRPPSHTKPNLAT------------LQRKSTSDKFDSVST--HDDNIPPLPP 596
             K    P + P+    +               L+ +  SD F   S    D       P
Sbjct: 295 EIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGP 354

Query: 597 KRSRKSGLDKNLPPV 611
            R RK      LP  
Sbjct: 355 TRKRKRRRVPPLPEY 369


>gnl|CDD|177697 PLN00067, PLN00067, PsbP domain-containing protein 6; Provisional.
          Length = 263

 Score = 32.2 bits (73), Expect = 0.70
 Identities = 25/103 (24%), Positives = 38/103 (36%), Gaps = 8/103 (7%)

Query: 532 ATPPVKPVALSNVKVVIPPASKVTTPPP--RPPSHTKPNLATLQRKSTSDKFDSVSTHDD 589
           AT  + P++LS        AS   + P         +  +   +R+              
Sbjct: 2   ATASLSPLSLSFSVSSSTSASSSASSPLAVASSVSPRAAVVIHRRELL---LGLALAPLI 58

Query: 590 NIPPLPPKRSRKSGLDKNLPPVPKYKSFFHKLFFKSKSKSKPA 632
            I P PP  +R+  +   LPP P   SF   + FK+  K  PA
Sbjct: 59  LIAPEPPAEAREVEVGSYLPPSPSDPSF---VLFKASPKDTPA 98


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 32.6 bits (75), Expect = 0.72
 Identities = 17/83 (20%), Positives = 25/83 (30%), Gaps = 4/83 (4%)

Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPN-LATLQRKSTSDKFDS 583
               P        P A +         +   TPPP PP+ T+ +  A  Q  + S     
Sbjct: 520 TAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPP 579

Query: 584 VSTHDDNIPPLPPKRSRKSGLDK 606
           +       P      S    +DK
Sbjct: 580 IPEEPTPSPTKD---SSPEEIDK 599



 Score = 30.7 bits (70), Expect = 3.1
 Identities = 13/53 (24%), Positives = 18/53 (33%), Gaps = 3/53 (5%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATL 572
                P       TP   P   S     IP  ++ T P P PP++      +L
Sbjct: 390 IQPSAPKTKQAATTPSPPPAKASPP---IPVPAEPTEPSPTPPANAANAPPSL 439


>gnl|CDD|173133 PRK14670, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 574

 Score = 32.2 bits (73), Expect = 0.93
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 4/78 (5%)

Query: 407 VPDVLFDTKEKIDERFEELLNEVSNLKDTL--EEKNDFLLIED--SNNAVITLDELDDLN 462
           VPD +   K+   +   +L+NE+ N K  +  +E  + + I +   +NA + L E ++  
Sbjct: 282 VPDKIHIFKKIDTKNITKLINELKNTKTEIIYKETKEIIKIMEMAISNAKLALREYENEK 341

Query: 463 NNVNESSVTIEEVHEFPK 480
           N   ES   I E+ + PK
Sbjct: 342 NKALESLKIILEMDKLPK 359


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.4 bits (73), Expect = 1.1
 Identities = 25/105 (23%), Positives = 32/105 (30%), Gaps = 19/105 (18%)

Query: 521 PQDIPTPP---PEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKST 577
           P  I  PP      + PP  P A   V     P +    P P+ PS       +L    +
Sbjct: 178 PPSIQVPPGAALAPSAPPPTPSA-QAVPPQGSPIAAQPAPQPQQPSPL-----SLISAPS 231

Query: 578 SDKFDSVSTHDDNIP----------PLPPKRSRKSGLDKNLPPVP 612
                  S H    P          P P  R  +S      PP+P
Sbjct: 232 LHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMP 276



 Score = 30.8 bits (69), Expect = 3.5
 Identities = 23/89 (25%), Positives = 34/89 (38%), Gaps = 5/89 (5%)

Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSV 584
           P  PP +  PP    AL+       P+++   PP   P   +P     Q    S    + 
Sbjct: 175 PQGPPSIQVPPG--AALAPSAPPPTPSAQ-AVPPQGSPIAAQPAPQPQQPSPLSLI-SAP 230

Query: 585 STHDDNIP-PLPPKRSRKSGLDKNLPPVP 612
           S H   +P P PP + + +      PP P
Sbjct: 231 SLHPQRLPSPHPPLQPQTASQQSPQPPAP 259



 Score = 30.4 bits (68), Expect = 4.3
 Identities = 23/102 (22%), Positives = 35/102 (34%), Gaps = 12/102 (11%)

Query: 519 LPPQDIPTPPPEVATPPVKPVALSN--VKVVIPPASKVTTPPPRPPSHTKPNLATLQRKS 576
           LPP+    P   + + P +     +      +P       PPP  P+ T          S
Sbjct: 431 LPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAI----GPPPSLPTSTPAAPPRASSGS 486

Query: 577 TSDKFDSVSTHDD--NIPPLPPKRSRKSGLDK----NLPPVP 612
                   S+       PPLPP + ++  LD+      PP P
Sbjct: 487 QPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPP 528


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 31.7 bits (72), Expect = 1.2
 Identities = 14/77 (18%), Positives = 22/77 (28%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSD 579
             ++ P P  E    P++ +       V          PP      KP  A ++    S 
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSM 169

Query: 580 KFDSVSTHDDNIPPLPP 596
                +        LPP
Sbjct: 170 AIAVPAIDTPVTLELPP 186



 Score = 29.4 bits (66), Expect = 6.3
 Identities = 14/64 (21%), Positives = 19/64 (29%), Gaps = 6/64 (9%)

Query: 520 PPQDIPT--PPPEVATPPVKPVA----LSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQ 573
           PP    T  P P     PV  +A      +  V +        PPP  P  +   +    
Sbjct: 148 PPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVVPQPSTMVVHRRS 207

Query: 574 RKST 577
           R   
Sbjct: 208 RIKR 211



 Score = 29.0 bits (65), Expect = 7.7
 Identities = 15/54 (27%), Positives = 18/54 (33%), Gaps = 2/54 (3%)

Query: 516 EDVLPPQDIPTPPPEVATPPVK-PVALSNVKVVIP-PASKVTTPPPRPPSHTKP 567
           E+VL P     P  E    P   PV      + I  PA         PP+   P
Sbjct: 138 EEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPP 191


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 32.2 bits (73), Expect = 1.2
 Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 5/73 (6%)

Query: 519 LPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTS 578
            P  D+P P       P KP A       +P  +   TP  +PP       A L     +
Sbjct: 394 APEPDLPQPD--RHPGPAKPEAPGARPAELPSPASAPTPEQQPPV---ARSAPLPPSPQA 448

Query: 579 DKFDSVSTHDDNI 591
               +V++    +
Sbjct: 449 SAPRNVASGKPGV 461


>gnl|CDD|236327 PRK08661, PRK08661, prolyl-tRNA synthetase; Provisional.
          Length = 477

 Score = 31.6 bits (73), Expect = 1.3
 Identities = 27/86 (31%), Positives = 36/86 (41%), Gaps = 18/86 (20%)

Query: 401 DMETNQVPDVLF--DTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDEL 458
           D+E N V  VL   DT EK     +EL+ +V    + LEE  + L       A       
Sbjct: 355 DLENNTV--VLVRRDTLEKETVPLDELVEKV---PELLEEIQENLY----EKA------K 399

Query: 459 DDLNNNVNESSVTIEEVHEFPKEHLG 484
           + L  N  E   T+EE  E  +E  G
Sbjct: 400 EFLEENTVEVD-TLEEFKEAIEEKGG 424


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 31.0 bits (70), Expect = 1.4
 Identities = 33/173 (19%), Positives = 58/173 (33%), Gaps = 23/173 (13%)

Query: 409 DVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNES 468
           D L   K++I    E L +E  +LK+  +E  + LL E   +     +EL  ++ ++N+ 
Sbjct: 11  DELEKLKDEIKAEEEALESEEKHLKEYDKEM-EELLEEKMQHV----EELRQIHADINDM 65

Query: 469 SVTIEEVHEFPKEHLGQRTEGYGSYSSLQLAMKNQYFIDRNSGGECYEDVLP-------- 520
              I++     +  +G+    +G Y         +  I+     E   + LP        
Sbjct: 66  ETEIKQSKSELERRMGKIARMHGEY------NPLKESINEMRKLELGLEELPQLHEEERQ 119

Query: 521 PQDIPTPP-PEVATPPVKP---VALSNVKVVIPPASKVTTPPPRPPSHTKPNL 569
                T P P V  P          +N         +   P      H  P  
Sbjct: 120 KISPATSPVPPVPLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPS 172


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 31.6 bits (72), Expect = 1.5
 Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 11/70 (15%)

Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKV-----TTPPPRPPSHT------KPNLATLQ 573
              P    +   +P  LS+     PP +        TPPP   SH+      +P   + +
Sbjct: 297 GVEPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPK 356

Query: 574 RKSTSDKFDS 583
           RK   DK   
Sbjct: 357 RKREGDKKQK 366


>gnl|CDD|237455 PRK13637, cbiO, cobalt transporter ATP-binding subunit;
           Provisional.
          Length = 287

 Score = 31.2 bits (71), Expect = 1.6
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 261 DGVVVWDERVKTLEVHKQYAIVLKTPAYKSFE 292
           DGV + D++VK  ++ K+  +V + P Y+ FE
Sbjct: 67  DGVDITDKKVKLSDIRKKVGLVFQYPEYQLFE 98


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 31.8 bits (72), Expect = 1.8
 Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 9/58 (15%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVT--TPPPRPPSHTKPNLATLQRK 575
           P  D  + PP  A PP  P A       +PP         PPR P      +A L+  
Sbjct: 906 PGGDAASAPPPGAGPPAPPQA-------VPPPRTTQPPAAPPRGPDVPPAAVAELRET 956


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 31.4 bits (71), Expect = 1.8
 Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 4/67 (5%)

Query: 548 IPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSVSTHDDN--IPPLPPKRSRKSGLD 605
           IP A     PPP       P  ++L   ++S   + +     N  +P   P +       
Sbjct: 182 IPGAI--QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPAS 239

Query: 606 KNLPPVP 612
              PP+P
Sbjct: 240 IPAPPIP 246


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 12/43 (27%), Positives = 13/43 (30%)

Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPS 563
           P   P PP   A  P             PPA +   P   PP 
Sbjct: 41  PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPP 83



 Score = 30.2 bits (68), Expect = 3.6
 Identities = 22/88 (25%), Positives = 26/88 (29%), Gaps = 24/88 (27%)

Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSV 584
           P PPP   +      A        PP +    P P  P+   P                 
Sbjct: 41  PAPPPPPPSTAAAAPA-PAAPPPPPPPAAPPAPQPDDPNAAPPPPPA------------- 86

Query: 585 STHDDNIPPLPPKRSRKSGLDKNLPPVP 612
              D N PP PP       +D N PP P
Sbjct: 87  ---DPNAPPPPP-------VDPNAPPPP 104



 Score = 29.5 bits (66), Expect = 5.7
 Identities = 10/42 (23%), Positives = 10/42 (23%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRP 561
           PP    T     A     P           P      PPP P
Sbjct: 44  PPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85



 Score = 28.7 bits (64), Expect = 9.6
 Identities = 11/48 (22%), Positives = 12/48 (25%)

Query: 515 YEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPP 562
             D  PP   P+       P   P          P        PP PP
Sbjct: 38  NADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85


>gnl|CDD|220880 pfam10812, DUF2561, Protein of unknown function (DUF2561).  This
           family of proteins with unknown function appears to be
           restricted to Mycobacterium spp.
          Length = 207

 Score = 30.2 bits (68), Expect = 2.6
 Identities = 9/41 (21%), Positives = 11/41 (26%)

Query: 527 PPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
           PPP       +P              K + P    PS T  
Sbjct: 95  PPPRSQGLRRRPGQPVRAGYAAARTEKESVPATHAPSRTDA 135


>gnl|CDD|216325 pfam01140, Gag_MA, Matrix protein (MA), p15.  The matrix protein,
           p15, is encoded by the gag gene. MA is involved in
           pathogenicity.
          Length = 129

 Score = 29.4 bits (66), Expect = 2.8
 Identities = 13/42 (30%), Positives = 17/42 (40%)

Query: 522 QDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPS 563
           +D+   PP    P V P  +        P S   + PPRP S
Sbjct: 86  EDLAFEPPPWVKPFVDPPKVLLPSSTPKPVSPSPSAPPRPSS 127


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 29.4 bits (66), Expect = 3.0
 Identities = 15/51 (29%), Positives = 17/51 (33%), Gaps = 3/51 (5%)

Query: 520 PPQDIPTPPPEVATP-PVKPVALSNVKVVIPPASKVTTPPPR--PPSHTKP 567
           P    P  PP   +P P  P         +PPA     P P   PP    P
Sbjct: 85  PAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPP 135


>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
           protein.  Members of this protein family are found
           exclusively in the archaea. This set of DNA binding
           proteins shows homology to the origin recognition
           complex subunit 1/cell division control protein 6 family
           in eukaryotes. Several members may be found in genome
           and interact with each other [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 365

 Score = 30.3 bits (69), Expect = 3.2
 Identities = 14/61 (22%), Positives = 24/61 (39%), Gaps = 14/61 (22%)

Query: 411 LFDTKEKIDER---FEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNE 467
           L  + E++        E+      L   L E+ D L        +I LDE+D L  + ++
Sbjct: 99  LRGSGEEVPTTGLSTSEVFRR---LYKELNERGDSL--------IIVLDEIDYLVGDDDD 147

Query: 468 S 468
            
Sbjct: 148 L 148


>gnl|CDD|238439 cd00862, ProRS_anticodon_zinc, ProRS Prolyl-anticodon binding
           domain, long version found predominantly in eukaryotes
           and archaea. ProRS belongs to class II aminoacyl-tRNA
           synthetases (aaRS). This alignment contains the
           anticodon binding domain, which is responsible for
           specificity in tRNA-binding, so that the activated amino
           acid is transferred to a ribose 3' OH group of the
           appropriate tRNA only, and an additional C-terminal
           zinc-binding domain specific to this subfamily of aaRSs.
          Length = 202

 Score = 30.0 bits (68), Expect = 3.3
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 401 DMETNQVPDVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLL 444
           D+E N V  V  DT EK      EL+ +V    + L+E  + L 
Sbjct: 78  DLEKNTVVIVRRDTGEKKTVPLAELVEKV---PELLDEIQEDLY 118


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
           Papillomavirus proteins, E4, coded for by ORF4. A splice
           variant, E1--E4, exists but neither the function of E4
           or E1--E4 is known.
          Length = 95

 Score = 28.4 bits (64), Expect = 3.3
 Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 8/50 (16%)

Query: 555 TTPPPRPPSHTKPNLATLQRKST----SDKFDSVSTHDDNIPPLPPKRSR 600
           TTPP  PP    P     +++      SD  DS ST   + PP P   + 
Sbjct: 20  TTPPRPPP---PPPPWAPKKRCRRRLESD-SDSSSTTSPSSPPTPQPTTE 65


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 30.5 bits (69), Expect = 3.5
 Identities = 8/48 (16%), Positives = 13/48 (27%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
           P        P     P +P A +     +  A+    P   P +    
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASA 413



 Score = 30.1 bits (68), Expect = 4.8
 Identities = 10/52 (19%), Positives = 13/52 (25%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
            P   P      A  P    A +      PPA+    P   P +        
Sbjct: 388 APAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439



 Score = 29.7 bits (67), Expect = 6.0
 Identities = 11/41 (26%), Positives = 15/41 (36%), Gaps = 1/41 (2%)

Query: 528 PPEVATPPVKPVALSNV-KVVIPPASKVTTPPPRPPSHTKP 567
            PE A P   PVA +          +   + P  PP+   P
Sbjct: 383 RPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP 423


>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein.  The coronavirus
           spike glycoprotein forms the characteristic 'corona'
           after which the group is named. The Spike glycoprotein
           is translated as a large polypeptide that is
           subsequently cleaved to S1 pfam01600 and S2.
          Length = 609

 Score = 30.3 bits (69), Expect = 3.5
 Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 16/87 (18%)

Query: 393 VTFSQHDTDMETNQVPDVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAV 452
           V +      +    +PD   D  +++D+ F+ L + V +L            ++  N  +
Sbjct: 462 VNYVNITNTVLPPIIPD-YIDFNKELDKWFKNLNSTVPDLD-----------LDKFNATI 509

Query: 453 ITL----DELDDLNNNVNESSVTIEEV 475
           + L    D L ++  N+N+S + +E +
Sbjct: 510 LNLTGEIDRLQEVIKNLNDSLIDLEWL 536


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 30.4 bits (68), Expect = 4.2
 Identities = 12/62 (19%), Positives = 18/62 (29%), Gaps = 4/62 (6%)

Query: 506 IDRNSGGECYEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIP----PASKVTTPPPRP 561
             R    +     +P +   T  P V      P  +S      P        V   PP+P
Sbjct: 381 RQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQP 440

Query: 562 PS 563
            +
Sbjct: 441 TN 442


>gnl|CDD|223030 PHA03271, PHA03271, envelope glycoprotein C; Provisional.
          Length = 490

 Score = 30.0 bits (67), Expect = 4.8
 Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 3/56 (5%)

Query: 528 PPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHT---KPNLATLQRKSTSDK 580
            P++ATPPV     S++     P ++V   PP   S T   K  + T   KS S K
Sbjct: 31  NPDLATPPVLAFNPSSISADGGPLAEVPASPPTELSTTVATKTAVPTTGEKSESHK 86


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 29.8 bits (67), Expect = 4.9
 Identities = 16/78 (20%), Positives = 23/78 (29%), Gaps = 1/78 (1%)

Query: 495 SLQLAMKNQYFIDRNSGGECYEDVLPPQDIPTPPPEVATPPVKPVA-LSNVKVVIPPASK 553
           SL+LA+ +       +  E    V PP   P    +      K V  L +       A+ 
Sbjct: 330 SLELALLHALLALGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAG 389

Query: 554 VTTPPPRPPSHTKPNLAT 571
              P   P     P    
Sbjct: 390 APLPDFDPRPRGPPAPEP 407


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 29.9 bits (67), Expect = 5.1
 Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 518 VLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLA 570
           V+ P+ +PTPP    TPP              PA  +  PPP P  H    +A
Sbjct: 159 VVGPRPVPTPPG--GTPPPDDDEGDEAGAPATPAPPLH-PPPAPHPHPIAEVA 208


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 29.9 bits (66), Expect = 5.1
 Identities = 14/38 (36%), Positives = 15/38 (39%), Gaps = 1/38 (2%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTP 557
           PP     PP + A PP K  A    K   PPA     P
Sbjct: 263 PPAKAAAPPAKAAAPPAKAAA-PPAKAAAPPAKAAAAP 299



 Score = 29.1 bits (64), Expect = 8.9
 Identities = 17/47 (36%), Positives = 20/47 (42%), Gaps = 5/47 (10%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPR---PPS 563
            P     PP + A PP K  A    K   PPA K   PP +   PP+
Sbjct: 249 APAKAAAPPAKAAAPPAKAAA-PPAKAAAPPA-KAAAPPAKAAAPPA 293


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 29.9 bits (67), Expect = 5.4
 Identities = 12/52 (23%), Positives = 13/52 (25%)

Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
           P     T PP    P       +      P A     PP        P  AT
Sbjct: 222 PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANAT 273



 Score = 29.5 bits (66), Expect = 6.7
 Identities = 16/79 (20%), Positives = 24/79 (30%), Gaps = 17/79 (21%)

Query: 517 DVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKS 576
           DV  P   P P P     P         +    P++  + P    P+     +A  Q  +
Sbjct: 201 DVFVPA-TPRPTPRTTASP---------ETTPTPSTTTSPPSTTIPA-PSTTIAAPQAGT 249

Query: 577 TSDKFDSVSTHDDNIPPLP 595
           T +            PP P
Sbjct: 250 TPE------AEGTPAPPTP 262


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 29.4 bits (66), Expect = 5.7
 Identities = 13/47 (27%), Positives = 16/47 (34%)

Query: 527 PPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQ 573
           PPP  A PP  P A          +   T   P P    K +   L+
Sbjct: 155 PPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLK 201


>gnl|CDD|179379 PRK02155, ppnK, NAD(+)/NADH kinase family protein; Provisional.
          Length = 291

 Score = 29.5 bits (67), Expect = 6.4
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 535 PVKPVALSNVKVVIPPASKVT 555
           P+ P  LSN  +V+P  S+V 
Sbjct: 210 PIAPHTLSNRPIVLPDDSEVA 230


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 29.8 bits (66), Expect = 6.9
 Identities = 20/67 (29%), Positives = 28/67 (41%), Gaps = 15/67 (22%)

Query: 558 PPRPPSHTKPNLATLQRKSTSDKFDSVSTHD------------DNIPPLPPKRSRKSGLD 605
           P RPP         L+RKS+ D  D +   +            D++PP  P+  R + L 
Sbjct: 783 PDRPPDDDFETRKGLKRKSSEDHADPIPEGNATKKTCGLQGLPDSLPPAVPETDRDNPL- 841

Query: 606 KNLPPVP 612
             LPP P
Sbjct: 842 --LPPCP 846


>gnl|CDD|173403 PTZ00111, PTZ00111, DNA replication licensing factor MCM4;
           Provisional.
          Length = 915

 Score = 29.6 bits (66), Expect = 7.2
 Identities = 24/108 (22%), Positives = 42/108 (38%), Gaps = 18/108 (16%)

Query: 401 DMETNQVPDVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDD 460
             +  ++P +L +  E++ ERF   L   +   D LE K++    + S      L  L  
Sbjct: 71  QRDVARLPFLLDNRLEELSERFTNFLKNFTEFSDVLEFKDE----QQSKPEYTELYYLWK 126

Query: 461 LNNNVNESSVTIEEVHEFPKEHLGQRTEGYGSYSSLQLAMKNQYFIDR 508
           L N              F KE+L   + GY      ++ + + Y  D+
Sbjct: 127 LMN--------------FIKENLRDHSTGYSRILPFEVDLMHVYSFDK 160


>gnl|CDD|143642 cd07882, RHD-n_TonEBP, N-terminal sub-domain of the Rel homology
           domain (RHD) of tonicity-responsive enhancer binding
           protein (TonEBP).  Proteins containing the Rel homology
           domain (RHD) are metazoan transcription factors. The RHD
           is composed of two structural sub-domains; this model
           characterizes the N-terminal RHD sub-domain of the
           tonicity-responsive enhancer binding protein (TonEBP),
           also called NFAT5. Mammalian TonEBP regulates the
           expression of genes in response to tonicity. It plays a
           pivotal role in urinary concentrating mechanisms in
           kidney medulla, by triggering the accumulation of
           osmolytes that enable renal medullary cells to tolerate
           high levels of urea and salt.
          Length = 161

 Score = 28.6 bits (64), Expect = 7.2
 Identities = 29/121 (23%), Positives = 50/121 (41%), Gaps = 18/121 (14%)

Query: 41  IEILEQPKSKVRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVSKDS 100
           ++IL QP+++ R RY  EG S GS+   +     + +P + +  Y     L V   +   
Sbjct: 3   LKILVQPETQHRARYLTEG-SRGSVKDRSQ----QGFPTVKLEGYNKPVVLQVFVGTDSG 57

Query: 101 PYRSHPHKLVSKDNCTNGIFC-------ATLLE------GDMTYSFTNLGIQCMKKKDIQ 147
             + H      K    N   C        T++E       +MT S   +GI  ++  D++
Sbjct: 58  RVKPHGFYQACKVTGRNTTPCEEVDVEGTTVIEVPLDPTNNMTISVDCVGILKLRNADVE 117

Query: 148 A 148
           A
Sbjct: 118 A 118


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 28.9 bits (65), Expect = 7.2
 Identities = 17/66 (25%), Positives = 21/66 (31%), Gaps = 9/66 (13%)

Query: 520 PPQDIPTPPPEVATPPV-----KPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQR 574
               +  P PE    P      KPV     KV   P  +V    PRP S  +        
Sbjct: 86  KEAPVVIPKPEPKPKPKPKPKPKPVK----KVEEQPKREVKPVEPRPASPFENTAPARPT 141

Query: 575 KSTSDK 580
            ST+  
Sbjct: 142 SSTATA 147


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 29.6 bits (66), Expect = 7.3
 Identities = 22/98 (22%), Positives = 30/98 (30%), Gaps = 5/98 (5%)

Query: 520 PPQDIPTP--PPEVATPPVKPVALSNVKVVIPPASKVTTPPPR--PPSHTKPNLA-TLQR 574
           PP   P     P  AT   +P A +  +   P A+     PP   P     P  A    R
Sbjct: 706 PPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765

Query: 575 KSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
              +             PP P +R R +   +  P   
Sbjct: 766 PPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 28.4 bits (63), Expect = 7.6
 Identities = 11/47 (23%), Positives = 16/47 (34%)

Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
           P  +    P VA  P K   + + +    P   V  P P+     K 
Sbjct: 79  PVPVAVVAPVVAPKPKKSQPVMSQEKTASPQKSVPAPSPKEKKKKKV 125


>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
          Length = 509

 Score = 29.4 bits (67), Expect = 7.7
 Identities = 8/48 (16%), Positives = 14/48 (29%), Gaps = 11/48 (22%)

Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATL 572
           P PP +V+  P               A+       + P+      A +
Sbjct: 369 PPPPIQVSAAPAA-----------AAAAPAAKEEEKKPASPWRKYALM 405


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 29.3 bits (66), Expect = 7.7
 Identities = 21/92 (22%), Positives = 36/92 (39%), Gaps = 4/92 (4%)

Query: 524 IPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTS--DKF 581
            P      A+  +   ++  +  +  P+    T PP   +   P L  LQR + +  D  
Sbjct: 33  RPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAA-TTQTPQLNPLQRAAAAALDAV 91

Query: 582 DS-VSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
           +S + +H +   PLP        +  N  PVP
Sbjct: 92  ESALVSHLERQHPLPKTADPAVQIAGNFAPVP 123


>gnl|CDD|183955 PRK13292, PRK13292, trifunctional NADH dehydrogenase I subunit
           B/C/D; Provisional.
          Length = 788

 Score = 29.4 bits (66), Expect = 7.7
 Identities = 12/38 (31%), Positives = 16/38 (42%), Gaps = 5/38 (13%)

Query: 393 VTFSQHDTDMET-----NQVPDVLFDTKEKIDERFEEL 425
           V   +  TDM T       VP+VL   K++    F  L
Sbjct: 236 VRLDEGATDMLTYRCPPELVPEVLRHLKKRASSPFRRL 273


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 29.6 bits (67), Expect = 7.8
 Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 8/62 (12%)

Query: 416 EKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNESSVTIEEV 475
           E+++E  EEL     +    L    DFL  E+        +E D+L     E   T+   
Sbjct: 387 EELEEEIEELRERFGDAPVDLGNAEDFL--EE------LREERDELREREAELEATLRTA 438

Query: 476 HE 477
            E
Sbjct: 439 RE 440


>gnl|CDD|241369 cd13215, PH-GRAM1_AGT26, Autophagy-related protein 26/Sterol
           3-beta-glucosyltransferase Pleckstrin homology (PH)
           domain, repeat 1.  ATG26 (also called
           UGT51/UDP-glycosyltransferase 51), a member of the
           glycosyltransferase 28 family, resulting in the
           biosynthesis of sterol glucoside. ATG26 in decane
           metabolism and autophagy. There are 32 known
           autophagy-related (ATG) proteins, 17 are components of
           the core autophagic machinery essential for all
           autophagy-related pathways and 15 are the additional
           components required only for certain pathways or
           species. The core autophagic machinery includes 1) the
           ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and
           ATG27), 2) the phosphatidylinositol 3-kinase complex
           (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the
           ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7,
           ATG8, ATG10, ATG12, and ATG16). Less is known about how
           the core machinery is adapted or modulated with
           additional components to accommodate the nonselective
           sequestration of bulk cytosol (autophagosome formation)
           or selective sequestration of specific cargos (Cvt
           vesicle, pexophagosome, or bacteria-containing
           autophagosome formation). The pexophagosome-specific
           additions include the ATG30-ATG11-ATG17
           receptor-adaptors complex, the coiled-coil protein
           ATG25, and the sterol glucosyltransferase ATG26. ATG26
           is necessary for the degradation of medium peroxisomes.
           It contains 2 GRAM domains and a single PH domain. PH
           domains are only found in eukaryotes. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. PH domains also have
           diverse functions. They are often involved in targeting
           proteins to the plasma membrane, but few display strong
           specificity in lipid binding. Any specificity is usually
           determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 130

 Score = 27.9 bits (63), Expect = 8.0
 Identities = 10/16 (62%), Positives = 12/16 (75%)

Query: 189 LCFQAYLPKDKDNVIK 204
           LCF AYLPK +  V+K
Sbjct: 16  LCFFAYLPKKEGEVVK 31


>gnl|CDD|184920 PRK14956, PRK14956, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 484

 Score = 29.1 bits (65), Expect = 8.2
 Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 1/69 (1%)

Query: 542 SNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSVSTHDDNIPPLPPKRSRK 601
            N+   + P  K++TPPP     +K    T   +  + +F+S +  D N+   P     +
Sbjct: 393 KNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFES-NQQDSNLDNNPLPSKSE 451

Query: 602 SGLDKNLPP 610
           S  +     
Sbjct: 452 SQSEPPSSK 460


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 29.3 bits (66), Expect = 8.2
 Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 3/54 (5%)

Query: 415 KEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDE-LDDLNNNVNE 467
             +++E+ EEL  E+ +L+  LEE      +E+  + +  L+E L+ L + V +
Sbjct: 339 LAELEEKLEELKEELESLEAELEELEA--ELEELESRLEELEEQLETLRSKVAQ 390


>gnl|CDD|220554 pfam10076, DUF2313, Uncharacterized protein conserved in bacteria
           (DUF2313).  Members of this family of proteins comprise
           various hypothetical and putative bacteriophage tail
           proteins.
          Length = 161

 Score = 28.4 bits (64), Expect = 8.3
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 420 ERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNESSVTIEEVHEF 478
             +E +L    NL DTLEE+   +L + +    I++   + L   +    +TI E   F
Sbjct: 52  PDWERVLGLPDNLTDTLEERRARVLAKLNETGGISIAYFERLAAALGYGEITITENRPF 110


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 28.8 bits (65), Expect = 8.8
 Identities = 12/55 (21%), Positives = 16/55 (29%)

Query: 517 DVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
             +      +  P VAT P   V      VV P  + V T     P+        
Sbjct: 180 APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234


>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
           Fmp27_WPPW is a conserved domain of a family of proteins
           involved in RNA polymerase II transcription initiation.
           It contains characteristic HQR and WPPW sequence motifs.
           and is towards the C-terminal in members which contain
           Fmp27_SW pfam10305.
          Length = 470

 Score = 29.2 bits (66), Expect = 9.5
 Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 1/64 (1%)

Query: 383 KSVNINMKPHVTFSQHDTDMETNQVPDVLFDTKEKIDERFEELLNEVSNLKD-TLEEKND 441
           K    + K   + S  +   +     +    + E++ +RFEELL EV    D  ++E   
Sbjct: 312 KERLKSDKTSESESGSENISQEKDPSEKEEKSDEELLDRFEELLREVKKSDDFDVQESYL 371

Query: 442 FLLI 445
             LI
Sbjct: 372 VRLI 375


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.384 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,681,660
Number of extensions: 3379736
Number of successful extensions: 3444
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3250
Number of HSP's successfully gapped: 155
Length of query: 695
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 591
Effective length of database: 6,324,786
Effective search space: 3737948526
Effective search space used: 3737948526
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (27.7 bits)