RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3598
         (175 letters)



>gnl|CDD|176263 cd08049, TAF8, TATA Binding Protein (TBP) Associated Factor 8.
          The TATA Binding Protein (TBP) Associated Factor 8
          (TAF8) is one of several TAFs that bind TBP, and is
          involved in forming the Transcription Factor IID
          (TFIID) complex. TFIID is one of seven General
          Transcription Factors (GTF) (TFIIA, TFIIB, TFIID,
          TFIIE, TFIIF, and TFIID) that are involved in accurate
          initiation of transcription by RNA polymerase II in
          eukaryotes. TFIID plays an important role in the
          recognition of promoter DNA and the assembly of the
          preinitiation complex. The TFIID complex is composed of
          the TBP and at least 13 TAFs. TAFs from various species
          were originally named by their predicted molecular
          weight or their electrophoretic mobility in
          polyacrylamide gels. A new, unified nomenclature for
          the pol II TAFs has been suggested to show the
          relationship between TAF orthologs and paralogs.
          Several hypotheses are proposed for TAFs' functions,
          such as serving as activator-binding sites, involvement
          in the core-promoter recognition, or a role in the
          essential catalytic activity of the complex. The mouse
          ortholog of TAF8 is called taube nuss protein (TBN),
          and is required for early embryonic development. TBN
          mutant mice exhibit disturbances in the balance between
          cell death and cell survival in the early embryo. TAF8
          plays a role in the differentiation of preadipocyte
          fibroblasts to adipocytes; it is also required for the
          integration of TAF10 into the TAF complex. In yeast and
          human cells, TAFs have been found as components of
          other complexes besides TFIID.  TAF8 is also a
          component of a small TAF complex (SMAT), which contains
          TAF8, TAF10 and SUPT7L. Several TAFs interact via
          histone-fold motifs. The histone fold (HFD) is the
          interaction motif involved in heterodimerization of the
          core histones and their assembly into nucleosome
          octamer. TAF8 contains an H4 related histone fold
          motif, and interacts with several subunits of TFIID,
          including TBP and the histone-fold protein TAF10.
          Currently, five HF-containing TAF pairs have been
          described or suggested to exist in TFIID: TAF6-TAF9,
          TAF4-TAF12, TAF11-TAF13, TAF8-TAF10 and TAF3-TAF10.
          Length = 54

 Score = 36.1 bits (84), Expect = 7e-04
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 22 ETHKQPVTEYEAIREKAATQKRDLARALNKFIIKT 56
           T+ +  T+ + +RE+ A+Q+R   +AL +   KT
Sbjct: 20 PTYSERETDPKKLREELASQRRLAEKALLRLQQKT 54


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 37.7 bits (88), Expect = 0.001
 Identities = 16/39 (41%), Positives = 25/39 (64%)

Query: 105 KKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
           KKK+    ++K  K+  E+D++D +ED+  E  GEG EE
Sbjct: 183 KKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEE 221



 Score = 32.3 bits (74), Expect = 0.11
 Identities = 14/46 (30%), Positives = 24/46 (52%)

Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
             S   +KK    K++ K+ K   +++D +EDE ++ EG G    E
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEE 221


>gnl|CDD|192573 pfam10406, TAF8_C, Transcription factor TFIID complex subunit 8
          C-term.  This is the C-terminal, Delta, part of the
          TAF8 protein. The N-terminal is generally the histone
          fold domain, Bromo_TP (pfam07524). TAF8 is one of the
          key subunits of the transcription factor for pol II,
          TFIID. TAF8 is one of the several general cofactors
          which are typically involved in gene activation to
          bring about the communication between gene-specific
          transcription factors and components of the general
          transcription machinery.
          Length = 51

 Score = 34.5 bits (80), Expect = 0.002
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 22 ETHKQPVTEYEAIREKAATQKRDLARALNKFI 53
            + +PVT+   IREK   + R   +AL K +
Sbjct: 20 PVYTEPVTDPRKIREKLVEESRLAEKALRKLL 51


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 34.6 bits (79), Expect = 0.013
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query: 104 QKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
            K ++     K  +K+  EE EE+  E+EA++EEG+G ++
Sbjct: 204 SKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 35.0 bits (80), Expect = 0.018
 Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 6/73 (8%)

Query: 88   KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
             +Q    +E ++ +     K      ++ K+ + E  ++ G +DE + +  E N +P   
Sbjct: 3924 NEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP--- 3980

Query: 148  SNMVPDSEAIDNP 160
                P++E +D P
Sbjct: 3981 ---PPENEDLDLP 3990



 Score = 28.8 bits (64), Expect = 1.7
 Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 8/42 (19%)

Query: 118  KKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDN 159
            ++ V+E+EE  E+    +EE E  E PED        +AIDN
Sbjct: 4068 EENVQENEESTEDGVKSDEELEQGEVPED--------QAIDN 4101



 Score = 26.9 bits (59), Expect = 7.0
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 13/80 (16%)

Query: 93   EEDEDNIQASPQKKKK--PTPVKKKSKKKKVEED-----------EEDGEEDEAEEEEGE 139
            +  E+N Q  P+ +    P  +K   K+  V +D           +E+ EE +AE++E  
Sbjct: 3972 DIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPM 4031

Query: 140  GNEEPEDTSNMVPDSEAIDN 159
             +E+P + +N + +    D+
Sbjct: 4032 QDEDPLEENNTLDEDIQQDD 4051



 Score = 26.5 bits (58), Expect = 9.9
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 117  KKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPD 153
            K   V E+E    E+E ++ + E N+ PED SN + +
Sbjct: 3869 KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNE 3905


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 34.5 bits (79), Expect = 0.023
 Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 7/55 (12%)

Query: 105 KKKKPTP-------VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVP 152
           +KKKPT         ++++KK   E D E  EED+   E+GE  ++  D     P
Sbjct: 71  RKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEEDQEAAEDGEDYDDGIDFPYDDP 125


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 34.6 bits (79), Expect = 0.025
 Identities = 17/32 (53%), Positives = 21/32 (65%)

Query: 123 EDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDS 154
           E+EE+ EE+E EEEE E  EE E+  N  P S
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895



 Score = 34.2 bits (78), Expect = 0.026
 Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 116 SKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPY--LRPVKYP 168
           S++++ EE+EE+ EE+E EEEE E  E  E  S   P++      Y  L P+ +P
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIYLFLLPIVFP 917



 Score = 29.2 bits (65), Expect = 1.1
 Identities = 20/54 (37%), Positives = 26/54 (48%), Gaps = 3/54 (5%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           +E E  I+A     K  T   +  + +   E E +G EDE E E GE  EE ED
Sbjct: 693 QEGEGEIEAKEADHKGET---EAEEVEHEGETEAEGTEDEGEIETGEEGEEVED 743



 Score = 28.0 bits (62), Expect = 3.3
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 124 DEEDGEEDEAEEEEGEGNEEPED 146
           D E+ EE+E EEEE E  EE E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEE 884



 Score = 27.7 bits (61), Expect = 4.1
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 123 EDEEDGEEDEAEEEEGEGNEEPED 146
           + EE+ EE+E EEEE E  EE E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEE 885


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 32.9 bits (75), Expect = 0.047
 Identities = 30/91 (32%), Positives = 38/91 (41%), Gaps = 14/91 (15%)

Query: 89  DQVFEEDEDNIQASPQKKKKPTPVKKK---SKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
           D +F ED +       KKKK TP KKK     KKK +E EE  E    E  + E NEE +
Sbjct: 36  DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEND 95

Query: 146 D-----------TSNMVPDSEAIDNPYLRPV 165
                       T+  V   + ID   L+  
Sbjct: 96  KKVDYELPKVQNTAAEVNHEDVIDLSDLKLA 126


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 32.8 bits (75), Expect = 0.049
 Identities = 17/74 (22%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 87  PKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
            +D +F ED +       KKK+PT  +K +  KK ++ +++   +E ++ E   +++ E+
Sbjct: 34  VRDDIFPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPE-SDDDKTEE 92

Query: 147 TSNMVPDSEAIDNP 160
             N   ++E   + 
Sbjct: 93  NENDPDNNEESGDS 106


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 32.4 bits (74), Expect = 0.061
 Identities = 9/44 (20%), Positives = 12/44 (27%)

Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
           +A      + T  KKK    +    E      EA        E 
Sbjct: 112 EADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.6 bits (75), Expect = 0.083
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 99  IQASPQKKKKPTPVKKKSKK----KKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
           I    +KK++    +KK K     KK EE+EE+ E+ E E+EE E   E E  
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 29.5 bits (67), Expect = 1.0
 Identities = 18/57 (31%), Positives = 29/57 (50%)

Query: 99  IQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
           I+   +K +K    +KK KKKK    ++  EE+E E+E+ E  +E E+        E
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 28.3 bits (64), Expect = 2.0
 Identities = 18/83 (21%), Positives = 32/83 (38%), Gaps = 14/83 (16%)

Query: 78  EMLPYLCLIPKDQV-----------FEEDEDNIQASPQKKKKPTPVKKKSKKKKVEE--- 123
           E+LP+L +I K                E+E       +K  K      +  +KK EE   
Sbjct: 369 EVLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKK 428

Query: 124 DEEDGEEDEAEEEEGEGNEEPED 146
           +++       ++EE E  E+ + 
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 28.3 bits (64), Expect = 2.5
 Identities = 17/44 (38%), Positives = 29/44 (65%)

Query: 105 KKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTS 148
           KKKK    KKK ++++ E+++++ E++E EEE  E  EE E+  
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 26.8 bits (60), Expect = 7.2
 Identities = 18/45 (40%), Positives = 27/45 (60%)

Query: 103 PQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
            ++KKK     KK ++++ EE E+  EE E EEEE E  +E E+ 
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 26.8 bits (60), Expect = 7.4
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query: 90  QVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           +   E  +  +   +K+KK      K K+++ EE++E  EE++ EEEE    E+ E+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This
           family consists of several plant specific phytosulfokine
           precursor proteins. Phytosulfokines, are active as
           either a pentapeptide or a C-terminally truncated
           tetrapeptide. These compounds were first isolated
           because of their ability to stimulate cell division in
           somatic embryo cultures of Asparagus officinalis.
          Length = 74

 Score = 29.6 bits (67), Expect = 0.16
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
           PT    +S K +  E  +  E++   E EGEG EE
Sbjct: 19  PTDTGHESSKIEAAESSKVDEDEVVSECEGEGEEE 53


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 31.5 bits (71), Expect = 0.16
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEA 156
           +E     + DE E+ + + NE+ +DT+    D + 
Sbjct: 276 DEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDC 310


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.9 bits (73), Expect = 0.17
 Identities = 11/62 (17%), Positives = 29/62 (46%)

Query: 87   PKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
             + +   + +   +       + + + ++ +KKK +   ED ++ E ++ E E +E+ ED
Sbjct: 1327 SEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386

Query: 147  TS 148
              
Sbjct: 1387 DD 1388



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 11/104 (10%), Positives = 30/104 (28%), Gaps = 7/104 (6%)

Query: 36   EKAATQKRDLARALNKFIIKTEPIEHIFDEADDRFTLIVPKPEMLPYLCLIPKDQVFEED 95
             K    + +     +    K               +L   K +         K +     
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVK-------KRLEGSLAALKKKKKSEKKTARKKKSKTRV 1341

Query: 96   EDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGE 139
            +    +   +  +    KK     + ++D E  + ++ ++E+ E
Sbjct: 1342 KQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDE 1385


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.4 bits (71), Expect = 0.24
 Identities = 18/48 (37%), Positives = 30/48 (62%)

Query: 101 ASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTS 148
            SP    + +  ++  ++++ EE+EE+ EE E+EEEEGE  EE E+  
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 30.9 bits (70), Expect = 0.32
 Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 127 DGEEDEAEEEEGEGNEEPEDTSNMVPDSEA-IDNPYLRPVKYPQRIKS 173
           DG ED++E+     +++ +  S      E    N   R ++ P R+ S
Sbjct: 31  DGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMRRRLIRTPPRVDS 78


>gnl|CDD|217093 pfam02540, NAD_synthase, NAD synthase.  NAD synthase (EC:6.3.5.1)
           is involved in the de novo synthesis of NAD and is
           induced by stress factors such as heat shock and glucose
           limitation.
          Length = 242

 Score = 30.4 bits (69), Expect = 0.38
 Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 40  TQKRDLARALN--KFIIKTEPIEHIFDEADDRFTLIVPKPEMLPYLCLIPKDQVFEEDED 97
           TQ  +LA+ LN  + IIK  P   ++    D   L +P  E+   L  + + +  EE+  
Sbjct: 157 TQVYELAKRLNVPERIIKKPPSADLWPGQTDEDELGIPYDELDQILKGLEELKQSEEEII 216

Query: 98  NIQASPQKKKKPTPVKKKSKKKK 120
           ++    +  ++   + +KS+ K+
Sbjct: 217 DLGLPAEVVERVENLIQKSEHKR 239


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.4 bits (69), Expect = 0.41
 Identities = 15/24 (62%), Positives = 19/24 (79%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPE 145
           ++DEE+ +EDE EEEE E  EEPE
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPE 369



 Score = 30.4 bits (69), Expect = 0.47
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 119 KKVEEDEEDGEEDEAEEEEGEGNEEPE 145
           ++ +E+EE  +E+E EEEE     EPE
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPE 371



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
           V  + K ++ E+++E+ EE+E E EE E  E P 
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 26.9 bits (60), Expect = 5.6
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVP 152
            E+EE+ EE    +EE EG EE E+    VP
Sbjct: 170 GEEEEEEEEVGEADEEDEGEEEEEEEPEEVP 200



 Score = 26.9 bits (60), Expect = 7.0
 Identities = 14/37 (37%), Positives = 20/37 (54%)

Query: 121 VEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAI 157
            E +  +GEE+E EEE GE +EE E       + E +
Sbjct: 163 AEVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEV 199



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 119 KKVEEDEEDGEEDEAEEEEGEGNEEPE 145
             V  +++D EE++ +EEE E  EEPE
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPE 366


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 30.4 bits (68), Expect = 0.44
 Identities = 19/43 (44%), Positives = 26/43 (60%)

Query: 101 ASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
           A    KK     KK +K+ + +EDEE+ EE+E EE E E NE+
Sbjct: 115 ALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQ 157



 Score = 27.4 bits (60), Expect = 4.4
 Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 101 ASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDN 159
           A+ Q  KK     KK+ K+   +++E+ EE+E EEEE E  E  + T+    +S  +D+
Sbjct: 114 AALQLPKKAGNAGKKATKEDESDEDEEEEEEE-EEEEAEVEENEQGTNGTSTNSTEVDH 171


>gnl|CDD|220440 pfam09848, DUF2075, Uncharacterized conserved protein (DUF2075). 
          This domain, found in various prokaryotic proteins
          (including putative ATP/GTP binding proteins), has no
          known function.
          Length = 348

 Score = 30.3 bits (69), Expect = 0.44
 Identities = 15/41 (36%), Positives = 18/41 (43%), Gaps = 5/41 (12%)

Query: 31 YEAIREKAATQKRDLARALNKFI---IKTEPIEH--IFDEA 66
          YEA+      +K+ L R    FI    K  P E   I DEA
Sbjct: 48 YEALAGDLKVRKKKLFRKPTSFINNLHKAPPHEDVVIVDEA 88


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 30.5 bits (69), Expect = 0.45
 Identities = 15/41 (36%), Positives = 21/41 (51%)

Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEG 140
           Q   Q K+K T  K K  KK V++D+    E  +E +E E 
Sbjct: 12  QQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAEN 52



 Score = 28.1 bits (63), Expect = 2.9
 Identities = 14/53 (26%), Positives = 24/53 (45%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
            E+ +N     +++ K    KKK+KK K + D++D    E   E  E   +  
Sbjct: 3   NENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQN 55


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.0 bits (68), Expect = 0.51
 Identities = 16/56 (28%), Positives = 25/56 (44%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTS 148
           +E+ED  +   + K       KK+ K K+EE E+   E E +       + PED  
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 30.3 bits (69), Expect = 0.51
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 120 KVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
            +EED++D EE + E+EE   +E+ ED  +   D +
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDD 157



 Score = 27.6 bits (62), Expect = 3.9
 Identities = 9/38 (23%), Positives = 19/38 (50%)

Query: 123 EDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNP 160
           E+++D +E+  EE+E     E ++  +   D + I   
Sbjct: 124 EEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161



 Score = 27.6 bits (62), Expect = 4.1
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 111 PVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           P    + +   E+D++D E DE +EE  +  ++ +D
Sbjct: 114 PRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
           Reviewed.
          Length = 830

 Score = 30.5 bits (69), Expect = 0.52
 Identities = 11/34 (32%), Positives = 22/34 (64%)

Query: 104 QKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEE 137
           +K K   PVK+K  +++  +D++  + D+ E+EE
Sbjct: 11  RKGKPSRPVKQKVSRRRYRDDDDYDDYDDYEDEE 44


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 30.3 bits (68), Expect = 0.53
 Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 92  FEEDEDNIQASPQKKKKPT----PVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
             ED+D ++ S   K+K T      K+ + +K  +E EE+  E+E ++EE + +++ ED 
Sbjct: 9   ELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDE 68

Query: 148 SNMVPDSEAID 158
            +   D +  D
Sbjct: 69  DDDDDDDDEDD 79


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 30.0 bits (68), Expect = 0.53
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
             K  +++K   D++  E D  +E+E E +E P 
Sbjct: 180 FYKPKQQEKGAPDDDLDEYDYGDEDEEEDDEPPW 213



 Score = 26.5 bits (59), Expect = 7.2
 Identities = 17/56 (30%), Positives = 22/56 (39%), Gaps = 3/56 (5%)

Query: 95  DEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
           DE ++ A  +++    P         V   E  G E E E E     EEPE  S M
Sbjct: 106 DEADLLALMEEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEP---EEEPEKKSGM 158


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 30.1 bits (68), Expect = 0.60
 Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 10/84 (11%)

Query: 85  LIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEP 144
           L P D     +    +A+P+ + + TP  +   +   E  E   EE E   EE       
Sbjct: 192 LGPLDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEE------- 244

Query: 145 EDTSNMVPDSEAIDNPYLRPVKYP 168
               N  P    +  PY+ P +YP
Sbjct: 245 ---ENNKPVKPPVPKPYISPDEYP 265


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 29.9 bits (67), Expect = 0.67
 Identities = 8/49 (16%), Positives = 17/49 (34%)

Query: 108 KPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEA 156
           +  P     +     + + +      EE     +E  ++T   VP+  A
Sbjct: 373 REQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAA 421


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 29.6 bits (67), Expect = 0.69
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 89  DQVFEEDEDNIQASPQKKKKPTPVKKKSKKKK 120
            + F  +E    A+ ++++     KK+SKKKK
Sbjct: 165 LEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.0 bits (67), Expect = 0.79
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 123 EDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
           +DE++ E+D+ EE++ E  EE E+    
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEEIKGF 183



 Score = 27.3 bits (60), Expect = 5.1
 Identities = 3/21 (14%), Positives = 14/21 (66%)

Query: 127 DGEEDEAEEEEGEGNEEPEDT 147
           D ++++ +E++ +  ++ E+ 
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEE 174



 Score = 27.3 bits (60), Expect = 5.6
 Identities = 10/22 (45%), Positives = 17/22 (77%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEE 143
           +EDE++ ++DE ++EE E  EE
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEE 178


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 28.8 bits (65), Expect = 0.84
 Identities = 10/27 (37%), Positives = 19/27 (70%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTS 148
            +D+E+ EE+E +EE+ + +E  E+ S
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 26.1 bits (58), Expect = 8.0
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEA 156
           E++ +D EEDE EE++ E ++E E      P  + 
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 29.5 bits (67), Expect = 0.85
 Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 21/81 (25%)

Query: 90  QVFEEDEDNIQA-----------------SPQKKKKPTPVKKKSKKKKVEED----EEDG 128
           Q  EEDE  IQA                 SP+ +      +KK+K++  EE         
Sbjct: 64  QEAEEDEQQIQALKRKYLSSPEKSPVADLSPRLEAISLSPRKKAKRRLFEEQDSGLGNSL 123

Query: 129 EEDEAEEEEGEGNEEPEDTSN 149
           EE+  +  E EG E       
Sbjct: 124 EEESTDVVEEEGVESSGGGEG 144


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 28.9 bits (65), Expect = 1.0
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 89  DQVFEEDEDNIQASPQKKKKPTPVKKKSKKKK 120
            + F  +E     +   KK  TP K ++KKKK
Sbjct: 165 KEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 28.0 bits (63), Expect = 1.3
 Identities = 9/50 (18%), Positives = 25/50 (50%)

Query: 102 SPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMV 151
              K + P    + + KK  +E++ED    + ++++ + ++ P+   + V
Sbjct: 39  EVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDV 88



 Score = 25.7 bits (57), Expect = 6.6
 Identities = 9/47 (19%), Positives = 24/47 (51%)

Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
           P   K ++     E+  +  E++E E++    +++ +D  + +PD +
Sbjct: 38  PEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLD 84


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 10/51 (19%), Positives = 29/51 (56%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
           E+ ++ +    +K K+        + +K EE+EE+ EE++ + ++ + +++
Sbjct: 147 EDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 9/66 (13%)

Query: 104 QKKKKPTPVKKKSKKKKVE-------EDEEDGEEDEAEEEEGEGNEE--PEDTSNMVPDS 154
             K+K T  K+  +   ++       ED     +DE EE+E E  EE   E  S    D 
Sbjct: 356 LNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDD 415

Query: 155 EAIDNP 160
           E  +  
Sbjct: 416 EEFEED 421



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYL 162
           EE+EE+ +EDE   +E   +EE E+        ++  N  L
Sbjct: 396 EEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSSL 436


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 28.8 bits (64), Expect = 1.5
 Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 5/74 (6%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVP 152
           ++DE+N+ +S    +     +K      V E     EE+E E EE E   E E    +  
Sbjct: 111 DDDEENLPSSIAPPEIDPEPRKPIVPDLVLE-----EEEEVEMEEDEEYYEKEPGKVVDE 165

Query: 153 DSEAIDNPYLRPVK 166
            SE  +   L+ +K
Sbjct: 166 KSEEEEEEELKTMK 179


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.5 bits (64), Expect = 1.6
 Identities = 10/60 (16%), Positives = 30/60 (50%)

Query: 88  KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
            ++V EE+++   +  ++ K     + +  +++ EE++E+  ++  +E E +     E  
Sbjct: 62  IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121



 Score = 26.6 bits (59), Expect = 6.6
 Identities = 13/71 (18%), Positives = 31/71 (43%)

Query: 94  EDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPD 153
           +D++  +    K+++      + K+ K + ++ED E +E  EEE E + +  +       
Sbjct: 55  DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKT 114

Query: 154 SEAIDNPYLRP 164
              ++     P
Sbjct: 115 ESNVEKEITNP 125


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 28.6 bits (64), Expect = 1.7
 Identities = 10/54 (18%), Positives = 28/54 (51%), Gaps = 5/54 (9%)

Query: 107 KKPTPVKKKSKKKKVEE-----DEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
           K+P    K SK++ + +      +++G+  + +++E     + +D+ + V D +
Sbjct: 227 KQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDED 280


>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
          Length = 520

 Score = 28.6 bits (64), Expect = 1.8
 Identities = 13/69 (18%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 88  KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
           + +   E+ED  +A       P P   +       +D+++G++++ ++ E  G +  ED 
Sbjct: 394 RPRGPPENED--EADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDD 451

Query: 148 SNMVPDSEA 156
              + D + 
Sbjct: 452 VYDLKDVDE 460


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.8 bits (65), Expect = 1.8
 Identities = 15/70 (21%), Positives = 29/70 (41%)

Query: 91  VFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
              +D D+   S   KKK     K +  K   + +   E D +++ E +   + +D  N 
Sbjct: 61  GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120

Query: 151 VPDSEAIDNP 160
           V D + ++  
Sbjct: 121 VKDIDVLNQA 130



 Score = 26.9 bits (60), Expect = 7.4
 Identities = 18/166 (10%), Positives = 48/166 (28%)

Query: 2   ATSKDTNRSLLKSSVVALLAETHKQPVTEYEAIREKAATQKRDLARALNKFIIKTEPIEH 61
              +           +  +       V + +   E    +K+    A           + 
Sbjct: 37  EIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKL 96

Query: 62  IFDEADDRFTLIVPKPEMLPYLCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKV 121
             +    +        +    L  +    V  + +D+             +      +  
Sbjct: 97  KDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDD 156

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLRPVKY 167
           +ED++D + D+ +EE+ E  E  + + +     +  D+  LR  + 
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK 202


>gnl|CDD|216090 pfam00733, Asn_synthase, Asparagine synthase.  This family is
          always found associated with pfam00310. Members of this
          family catalyze the conversion of aspartate to
          asparagine.
          Length = 195

 Score = 28.4 bits (64), Expect = 1.9
 Identities = 11/54 (20%), Positives = 20/54 (37%), Gaps = 1/54 (1%)

Query: 14 SSVVALLAETHKQPVTEYEAIREKAATQKRDLARALNKFIIKTEPIEHIFDEAD 67
          S + AL A     P+  +    E +   +   A  +    + T+  E I  E +
Sbjct: 31 SLIAALAARQSDPPLKTFSVGFEGSDYDEAPYAELVADH-LGTDHHEIIVTEEE 83


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 28.7 bits (64), Expect = 2.1
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
            E  ++ EEDEAE  +GEG E   D +    DSE
Sbjct: 254 SEGSDESEEDEAEATDGEGEEGEMDAAEASEDSE 287


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 27.9 bits (63), Expect = 2.4
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 116 SKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
            ++   ++D++D +EDE +EEE +   +PE+ 
Sbjct: 49  DEEDLEDDDDDDEDEDEDDEEEADLGPDPEEA 80



 Score = 26.4 bits (59), Expect = 7.1
 Identities = 6/35 (17%), Positives = 24/35 (68%)

Query: 121 VEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
           +E + ++ + ++ ++++ + +E+ E+ +++ PD E
Sbjct: 44  IESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPE 78


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 28.4 bits (63), Expect = 2.5
 Identities = 11/53 (20%), Positives = 25/53 (47%)

Query: 94  EDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           +   +     +++ K +P      + + +ED E+ EE++ EEE G   +  + 
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKL 341



 Score = 26.8 bits (59), Expect = 6.4
 Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 94  EDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPD 153
            ++ + + S +KKKK    KKK    K  +   D + DE + ++G+     ED    + D
Sbjct: 233 GEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREED---YISD 289

Query: 154 SEAIDN 159
           S A  N
Sbjct: 290 SSASGN 295


>gnl|CDD|165416 PHA03145, PHA03145, helicase-primase primase subunit; Provisional.
          Length = 1058

 Score = 28.4 bits (63), Expect = 2.6
 Identities = 10/53 (18%), Positives = 18/53 (33%), Gaps = 4/53 (7%)

Query: 108 KPTPVKKKSKKKKVEEDEEDGE----EDEAEEEEGEGNEEPEDTSNMVPDSEA 156
           +P             E + DG          +EE + + + E  ++   D EA
Sbjct: 676 RPRAAAVDPGLGPDGEYDGDGPWFPAGGGPGDEEDDEDPDAEALAHGGVDEEA 728


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 27.9 bits (63), Expect = 2.6
 Identities = 12/46 (26%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 95  DEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEG 140
           D+D +    ++          +++++ EE+EE+ EE+ +EEE   G
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEE-EEEPSEEEAAAG 324


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 28.0 bits (62), Expect = 2.7
 Identities = 13/60 (21%), Positives = 26/60 (43%)

Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDN 159
           +   Q++ KP   KK+ +K+K  E E +G+ +E +    +             D  ++ N
Sbjct: 58  KGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVAN 117


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.1 bits (62), Expect = 2.7
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 104 QKKKKPTPVKKKSKKKKVEEDEED-GEEDE-AEEEEGEGNEEPEDTSNMVPDSEAID 158
           +++K    V+K+ K KK++ED ED G E E ++ EE E     E+ S    D E  D
Sbjct: 129 EEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDD 185


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 28.0 bits (63), Expect = 2.7
 Identities = 13/69 (18%), Positives = 26/69 (37%)

Query: 87  PKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           P     EE+E         ++K    K++  +KK E++E +    + + E      +  D
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450

Query: 147 TSNMVPDSE 155
                 D +
Sbjct: 451 GETKKVDPD 459


>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
          Length = 177

 Score = 27.5 bits (61), Expect = 2.9
 Identities = 15/52 (28%), Positives = 26/52 (50%)

Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLR 163
           + ++ K ++VEE+ E  E +E  EE  E   E  +       ++  +N YLR
Sbjct: 1   MSEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADEFENKYLR 52


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 27.7 bits (61), Expect = 3.0
 Identities = 14/57 (24%), Positives = 32/57 (56%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
            E++ ++ A+  + K+ +   K+ ++++VE  E   EE +++  +  G E+ ED  N
Sbjct: 273 SEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDEN 329


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 27.5 bits (61), Expect = 3.4
 Identities = 23/76 (30%), Positives = 30/76 (39%), Gaps = 16/76 (21%)

Query: 67  DDRFTLIVPKPEMLPYLCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEE 126
           DD   L  P+    PY    P     E+DE  I+    +  +              E+ E
Sbjct: 42  DDAIELAEPETSDDPYGNPDPFG---EDDEGRIEVRISEDGEE-------------EEVE 85

Query: 127 DGEEDEAEEEEGEGNE 142
           +GEEDE EE E E  E
Sbjct: 86  EGEEDEEEEGEEESEE 101


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 27.8 bits (61), Expect = 3.8
 Identities = 17/45 (37%), Positives = 25/45 (55%)

Query: 93   EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEE 137
             E+E+ I+A+ + KK     KK  + KK EEDE+   E   +E E
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699



 Score = 27.0 bits (59), Expect = 7.8
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query: 93   EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
            E +E       +K ++   +K   + KK EED++  EE +  EE+ +   E
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692


>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
          Length = 445

 Score = 27.7 bits (62), Expect = 3.9
 Identities = 15/78 (19%), Positives = 27/78 (34%), Gaps = 18/78 (23%)

Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGN------------------EEPEDTSNM 150
             P KKK +++    +EE+ EE+      G                     +E +D+   
Sbjct: 24  ELPPKKKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRISRGKDGKRPVRPLKEEKDSEKK 83

Query: 151 VPDSEAIDNPYLRPVKYP 168
                A+ NP   P++  
Sbjct: 84  ASTEAAVRNPLSDPLESK 101


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 27.3 bits (61), Expect = 4.1
 Identities = 8/29 (27%), Positives = 17/29 (58%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
           EE+++  E ++ +EE+ +  EE +D    
Sbjct: 290 EEEDDYSESEDEDEEDEDEEEEEDDDEGD 318


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 27.5 bits (61), Expect = 4.1
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 105 KKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
           K+K    VK+  ++   EE EE+ E  + EE E EG E P 
Sbjct: 389 KEKGHPVVKRVVREVDEEEIEEEEEAMQPEEMEMEGFEVPA 429


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 26.2 bits (58), Expect = 4.6
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 114 KKSKKKKVEEDEEDGEEDEAEEEEGEG 140
             +++++ EE+EE+ EE+E+EEE   G
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMAG 99


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 26.2 bits (58), Expect = 4.7
 Identities = 19/55 (34%), Positives = 24/55 (43%)

Query: 106 KKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNP 160
           K K   VKKK KKKK +   ++    E EEEE    E          D+E I+  
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQE 67


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 27.3 bits (61), Expect = 4.7
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query: 116 SKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAID 158
            + ++ EE+EEDG +DE EE++ +  EE E+  ++  + E  +
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEE 359


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 27.3 bits (61), Expect = 4.9
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
           P   KK  +  + EE EE+ + +     E +G ++ ++ S 
Sbjct: 220 PFFTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGST 260


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 27.5 bits (61), Expect = 5.0
 Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 12/71 (16%)

Query: 98  NIQASPQKKKKPTPVKKKSKKKK--------VEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
           +   S +K +KP        +          ++ED +D EE +  E E    +EPE T N
Sbjct: 410 SASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPESE----QEPETTKN 465

Query: 150 MVPDSEAIDNP 160
              D+   + P
Sbjct: 466 ETKDTAMEEEP 476


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 27.1 bits (60), Expect = 5.4
 Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 92  FEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEE 136
           +E  E N QA PQ K   T  K+     +VE DEE+ + D+ E E
Sbjct: 72  YERREGN-QACPQCK---TRYKRIKGSPRVEGDEEEDDIDDLENE 112


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEE 136
            KKK KKK +   + + EE E E+ 
Sbjct: 149 PKKKVKKKNLLGGKSEKEEPEEEKT 173


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 26.8 bits (59), Expect = 6.7
 Identities = 12/45 (26%), Positives = 19/45 (42%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEE 137
           + D D+  +S  KK    P K++  KK   +      E+E    E
Sbjct: 133 DSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEEENYNTE 177


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 25.3 bits (56), Expect = 6.9
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 123 EDEEDGEEDEAEEEEGEGNEEPED 146
           E+EE+GE+ E+E+EE E  ++ +D
Sbjct: 49  EEEEEGEDLESEDEEDEEEDDDDD 72


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 25.9 bits (56), Expect = 7.0
 Identities = 9/30 (30%), Positives = 20/30 (66%)

Query: 115 KSKKKKVEEDEEDGEEDEAEEEEGEGNEEP 144
            S   + E+D+++ EED+ E+++ + +E P
Sbjct: 13  DSDSDEEEDDDDEDEEDDDEDDDEDDDEVP 42


>gnl|CDD|178137 PLN02522, PLN02522, ATP citrate (pro-S)-lyase.
          Length = 608

 Score = 26.7 bits (59), Expect = 7.1
 Identities = 16/46 (34%), Positives = 20/46 (43%)

Query: 9   RSLLKSSVVALLAETHKQPVTEYEAIREKAATQKRDLARALNKFII 54
           RS   SS+ AL   T +      E + E    Q    ARA NK +I
Sbjct: 89  RSAAASSMEALKQPTIRVVAIIAEGVPESDTKQLIAYARANNKVVI 134


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 26.8 bits (59), Expect = 7.3
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query: 119 KKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAID 158
           + +E+ E + E D  + EE E  ++ E    +VP  +A+ 
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVR 705


>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
          Length = 131

 Score = 26.0 bits (57), Expect = 7.5
 Identities = 11/36 (30%), Positives = 24/36 (66%), Gaps = 3/36 (8%)

Query: 89  DQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEED 124
           +   +E+ED+++   +K++K    ++K +KK VEE+
Sbjct: 98  NNAIQEEEDDMEQQEEKEEKG---REKGRKKNVEEE 130


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 26.9 bits (60), Expect = 7.5
 Identities = 14/64 (21%), Positives = 22/64 (34%), Gaps = 13/64 (20%)

Query: 83  LCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNE 142
           L L P+           +  P     P             ED++D  EDE E+++ E   
Sbjct: 249 LVLAPRATRLPAPPQPPEEEPPPPPPP------------PEDDDDPPEDEEEQDDAED-R 295

Query: 143 EPED 146
             E+
Sbjct: 296 ALEE 299


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 26.7 bits (59), Expect = 7.6
 Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 121 VEEDEEDGEEDEA---EEEEGEGNEEPEDTSNMVPDSEAID 158
            E+D  D +E++    EE+E E  EE ED    V  S   D
Sbjct: 40  AEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASPDAD 80


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 26.5 bits (59), Expect = 7.9
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 6/54 (11%)

Query: 113 KKKSKKKK-----VEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAI-DNP 160
           KKK KKKK      ++DEE+ E++  +E++        D +N+ P+ + I  NP
Sbjct: 1   KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNP 54


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 26.7 bits (59), Expect = 8.7
 Identities = 8/54 (14%), Positives = 27/54 (50%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           ++    +     +       + K+K++K++++E + ++ E  +E+ E  ++ E 
Sbjct: 379 QDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 26.3 bits (58), Expect = 9.5
 Identities = 10/60 (16%), Positives = 30/60 (50%)

Query: 88  KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
           + ++ +E    +Q   +K  +       S++ + +  EE  +++E E+++ E  +E ++ 
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEK 426


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 26.2 bits (58), Expect = 8.7
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 1/51 (1%)

Query: 96  EDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
           E++ + S +  K        S+   V+  E   EE+E   EE E  +EPED
Sbjct: 132 EESKKGSAETLKLEEKDSSSSEGSTVDL-EPPEEEEEEIAEEEEEVKEPED 181


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 25.7 bits (57), Expect = 8.7
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEG 140
                  + ++K EE+EE+ E++E+EEE   G
Sbjct: 69  AAAAAAAAAEEKKEEEEEEEEKEESEEEAAAG 100


>gnl|CDD|215121 PLN02189, PLN02189, cellulose synthase.
          Length = 1040

 Score = 26.5 bits (58), Expect = 8.8
 Identities = 21/104 (20%), Positives = 42/104 (40%), Gaps = 2/104 (1%)

Query: 64  DEADDRFTLIVPKPEMLPYLCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEE 123
           D+ +++F  ++      P     P    +   E  + +S  K+  P PV +    K  E+
Sbjct: 140 DDENNQFPPVITGVRSRPVSGEFPIGSGYGHGEQMLSSSLHKRVHPYPVSEPGSAKWDEK 199

Query: 124 DEEDGEE--DEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLRPV 165
            E   +E  D+ + ++G    +P+D    +   +    P  R V
Sbjct: 200 KEGGWKERMDDWKMQQGNLGPDPDDYDADMALIDEARQPLSRKV 243


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 26.6 bits (59), Expect = 9.0
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 106 KKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
           K+K  PV ++   ++ EE+EE+ EE+E    E      PE
Sbjct: 392 KEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3). 
           This family contains EBNA-3A, -3B, and -3C which are
           latent infection nuclear proteins important for
           Epstein-Barr virus (EBV)-induced B-cell immortalisation
           and the immune response to EBV infection.
          Length = 254

 Score = 26.2 bits (58), Expect = 9.5
 Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLRPVKYPQR 170
           E D++   E  +E E+ E   + E+   +VP  E       RP  + +R
Sbjct: 209 EPDDDATVETSSESEDSESESDDEELPYIVPRMEP---VQGRPPVFLRR 254


>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein.  S-antigens are heat
           stable proteins that are found in the blood of
           individuals infected with malaria.
          Length = 94

 Score = 25.3 bits (55), Expect = 9.6
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 113 KKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
            K  K +K E+ +E+GE +  +EE+    E   D  N
Sbjct: 49  SKNGKGQKYEDLQEEGEGENDDEEDSNSEESNNDEEN 85


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 26.3 bits (58), Expect = 9.8
 Identities = 14/52 (26%), Positives = 29/52 (55%)

Query: 93  EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEP 144
           EE++ N QA+ ++ ++      +S     ++DE   E+++ +EE+ E  E P
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.127    0.342 

Gapped
Lambda     K      H
   0.267   0.0857    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,000,509
Number of extensions: 861483
Number of successful extensions: 4484
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3114
Number of HSP's successfully gapped: 673
Length of query: 175
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 85
Effective length of database: 6,945,742
Effective search space: 590388070
Effective search space used: 590388070
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 56 (25.1 bits)