RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3485
         (304 letters)



>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 22/83 (26%), Positives = 44/83 (53%), Gaps = 12/83 (14%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEIL--DN-EDDTKSQGSVET-----IDDDI---EMV 109
           +E+ + +E++  + + E +E EDGE L  DN E D ++   ++      +DD+    E+ 
Sbjct: 5   DEEEE-LEDDIDDLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDYEELS 63

Query: 110 FAAEKKADPKRDMEKVRLRKQLM 132
            AA + A+ + +     LR++ M
Sbjct: 64  AAARRAAEARLNRRDRLLRRRRM 86


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 34.3 bits (79), Expect = 0.003
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 200 SMAIHLRTHTKEKPFACEMCNYRAAS 225
           ++  H+RTHT EKP+ C +C    +S
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 37.3 bits (87), Expect = 0.008
 Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 11/69 (15%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKS-QGSVETIDDDIEMVFAAEKKADPK 119
                  EE PP P P ED+ +  E  D E+   +   ++E      E+V  A + A P 
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPE--DEEEQDDAEDRALE------EIVLEAVRAALPP 311

Query: 120 RDMEKVRLR 128
            D+   RL 
Sbjct: 312 -DLLA-RLA 318


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 33.1 bits (76), Expect = 0.039
 Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 2/47 (4%)

Query: 61  EEDVDKVEEE--PPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
            ED  K +E+    +    +D+ +D +  D  D       ++  DDD
Sbjct: 50  AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDD 96



 Score = 30.4 bits (69), Expect = 0.40
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 68  EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           + E    + E++E ED  +LD++DD      +  +DDD
Sbjct: 49  DAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDD 86



 Score = 29.6 bits (67), Expect = 0.66
 Identities = 9/41 (21%), Positives = 23/41 (56%)

Query: 68  EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
            E+  + + +E++ +D  + D++DD       +  DDD+++
Sbjct: 50  AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDL 90



 Score = 27.7 bits (62), Expect = 3.4
 Identities = 10/50 (20%), Positives = 19/50 (38%)

Query: 58  TGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIE 107
            G E   +  +   P  + E+   +D +  D +D        +  DDD+ 
Sbjct: 32  CGEEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLP 81


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 34.1 bits (78), Expect = 0.096
 Identities = 22/103 (21%), Positives = 38/103 (36%), Gaps = 10/103 (9%)

Query: 38  KKNHQNRNQK--------KMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILD- 88
           K+  QN   K         +     +  +  +E++    +   E   EE E ED   ++ 
Sbjct: 25  KRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQ 84

Query: 89  -NEDDTKSQGSVETIDDDIEMVFAAEKKADPKRDMEKVRLRKQ 130
              D  KS   V+      +M    +   D K  +E+  LRK+
Sbjct: 85  NQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKR 127


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 33.8 bits (77), Expect = 0.12
 Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 6/64 (9%)

Query: 65  DKVEEEPPEPEPEEDE----VEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
           D V EE  E E EEDE     E G+++D + + + +  ++T+ D I+    ++ K    +
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFID--LESQTKKPSVK 194

Query: 121 DMEK 124
           D  K
Sbjct: 195 DNGK 198


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 7/85 (8%)

Query: 49  MKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
           M S+ ++     EE+ ++ EEE  E +  E+E  + E  + E +  +         + EM
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN-------GSEEEM 486

Query: 109 VFAAEKKADPKRDMEKVRLRKQLMA 133
             ++E   D +   E    R   MA
Sbjct: 487 EGSSEGDGDGEEPEEDAERRNSEMA 511


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 33.1 bits (76), Expect = 0.13
 Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 5/48 (10%)

Query: 49  MKSKTAKYWTGGEEDVDK-----VEEEPPEPEPEEDEVEDGEILDNED 91
            K  +A+     E+D        V+ EPPE E EE   E+ E+ + ED
Sbjct: 134 SKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 33.6 bits (77), Expect = 0.13
 Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 1/91 (1%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
           EE+    +    E E E + V      D++DD       +  ++D E   +  K    KR
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 121 DMEKVR-LRKQLMAIEEKIKAARHPLFSPDD 150
           +  +++ L+KQ     +KI   ++     D 
Sbjct: 61  EKARLKELKKQKKQEIQKILEQQNAAIDADM 91


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 31.1 bits (71), Expect = 0.23
 Identities = 17/62 (27%), Positives = 23/62 (37%), Gaps = 6/62 (9%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETI--DDDIEMVFAAEKKADP 118
           EE+       P   E EE E+ED E+L+  D        +T+    D     A       
Sbjct: 26  EEEAAAAAPAPAPEEEEEAELEDEELLEELDLP----DPDTLTPGSDFSAFLAKGVPEAL 81

Query: 119 KR 120
           KR
Sbjct: 82  KR 83


>gnl|CDD|222450 pfam13909, zf-H2C2_5, C2H2-type zinc-finger domain. 
          Length = 24

 Score = 28.7 bits (65), Expect = 0.24
 Identities = 9/25 (36%), Positives = 13/25 (52%), Gaps = 1/25 (4%)

Query: 248 YHCAFCNFTSPTQWAINRHTKYTHM 272
           Y C+ C ++S T   + RH K  H 
Sbjct: 1   YKCSHCPYSS-TPKKLERHLKLHHG 24


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 32.7 bits (74), Expect = 0.27
 Identities = 24/71 (33%), Positives = 37/71 (52%), Gaps = 8/71 (11%)

Query: 59  GGEEDVDKVE-EEPPEPEPEEDEVE-----DGEILDNEDDTKSQG--SVETIDDDIEMVF 110
            GE + ++VE E   E E  EDE E     +GE +++E + +++G   VET  D  E   
Sbjct: 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766

Query: 111 AAEKKADPKRD 121
             E +A+ K D
Sbjct: 767 EGETEAEGKED 777



 Score = 28.8 bits (64), Expect = 4.0
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNED 91
           GG+ + ++ EEE  E E EE+E E+ E  +NE+
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 32.6 bits (75), Expect = 0.27
 Identities = 15/80 (18%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 60  GEEDVD-KVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
            + D++ ++++EP   +   D  ED +  +  D+   + S    D+D +     +  A  
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161

Query: 119 KRDMEKVRLRKQLMAIEEKI 138
           +R +E+ R R++      ++
Sbjct: 162 ERSLERRRRRREWEEKRAEL 181


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 32.5 bits (74), Expect = 0.31
 Identities = 15/47 (31%), Positives = 20/47 (42%)

Query: 68  EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEK 114
           E +PP P P  +  E  E  D  D T      ET     E++F A +
Sbjct: 270 EPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVE 316


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 31.7 bits (72), Expect = 0.36
 Identities = 18/80 (22%), Positives = 31/80 (38%), Gaps = 4/80 (5%)

Query: 35  TKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVE-DGEILDNEDDT 93
           T  K     +++KK K K  +     E D DK EE   +P+  E+  +       N    
Sbjct: 58  TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSD 117

Query: 94  KSQGSVETIDD-DIEMVFAA 112
                 + +DD D+++    
Sbjct: 118 IDN--EDDMDDSDLKLATEN 135


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 31.6 bits (71), Expect = 0.43
 Identities = 18/55 (32%), Positives = 23/55 (41%), Gaps = 5/55 (9%)

Query: 41  HQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKS 95
              +       K  K     E++ D+ EEE  E E EE EVE+ E   N   T S
Sbjct: 117 QLPKKAGNAGKKATK-----EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNS 166



 Score = 27.4 bits (60), Expect = 8.8
 Identities = 11/44 (25%), Positives = 19/44 (43%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDD 104
           EE+ D    E    E   +E E+ E  D  +D +++    T+  
Sbjct: 52  EENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLST 95


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 32.0 bits (73), Expect = 0.44
 Identities = 15/72 (20%), Positives = 24/72 (33%), Gaps = 11/72 (15%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGE-----------ILDNEDDTKSQGSVETIDDDIE 107
              +D   +EE+P EP P+ ++ ++GE                DD K     E  D    
Sbjct: 317 PQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQEEADGQGS 376

Query: 108 MVFAAEKKADPK 119
               A      +
Sbjct: 377 STDPAGDIFRIR 388


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 30.2 bits (68), Expect = 0.52
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 68  EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
           E+EP EPE EE   E+    +N D+       E  D + +      +K  P
Sbjct: 10  EDEPSEPEIEE-GAEEDSESNNNDE-VKGSIGENEDKEEQEPVQRPRKTTP 58


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 30.4 bits (69), Expect = 0.58
 Identities = 13/47 (27%), Positives = 23/47 (48%)

Query: 9   NLQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAK 55
           NL  ++K  PK + +   +I  +K   K +K  +NR +K    +  K
Sbjct: 80  NLAALKKFEPKYRLIRLGLIKKKKAGRKARKELKNRRKKVRGKEKTK 126


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 31.3 bits (71), Expect = 0.73
 Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 3/75 (4%)

Query: 31  RKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNE 90
           ++ WTK K+  Q+       + +A        + D  EEE  E E EEDE E      ++
Sbjct: 358 KQKWTKAKETEQDYILD---AFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSD 414

Query: 91  DDTKSQGSVETIDDD 105
           D+   +  VE+  +D
Sbjct: 415 DEEFEEDDVESKYED 429


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.5 bits (69), Expect = 0.74
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDD 92
            E  D  EE+  + E EE+E E+ E  D++DD
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 15/45 (33%), Positives = 24/45 (53%)

Query: 48  KMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDD 92
           K+     K      EDVD+ +E+  E E EE+E ++    D++DD
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196



 Score = 29.3 bits (66), Expect = 2.2
 Identities = 17/102 (16%), Positives = 43/102 (42%), Gaps = 4/102 (3%)

Query: 11  QQIQKTNPKLKKVNY-PIIVARKMWTKLKKNHQNR---NQKKMKSKTAKYWTGGEEDVDK 66
           Q+ +K  P +K+  +   +  +++++ +  N +        K K K   +    E+  +K
Sbjct: 93  QKKRKIGPSIKEHPFDLELFPKELYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEK 152

Query: 67  VEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
           +     + +  E E  D E   +E++ + +   +   DD + 
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194


>gnl|CDD|219236 pfam06949, DUF1292, Protein of unknown function (DUF1292).  This
           family consists of several hypothetical bacterial
           proteins of around 90 residues in length. The function
           of this family is unknown.
          Length = 74

 Score = 28.7 bits (65), Expect = 0.76
 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 4/42 (9%)

Query: 75  EPEEDEVEDGE--ILDNEDDTKSQGSVETIDDD--IEMVFAA 112
              E+E ED E  I   E+D   +  +  I+DD   +MV  A
Sbjct: 26  PVGEEEDEDIEVYIFKIEEDEDGEEELLPIEDDEEWDMVEEA 67


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 30.9 bits (70), Expect = 0.80
 Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 5/103 (4%)

Query: 49  MKSKTAKYWTGGEEDVDKVEEEPP---EPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           +K K  K     E+D+ +  E+P    E E E+++ +D    D ++D    G+ E +   
Sbjct: 113 LKQKVKKNNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAAS 172

Query: 106 IEMVFAAEKKAD--PKRDMEKVRLRKQLMAIEEKIKAARHPLF 146
            E      K+ D   +      +   +   + +K +       
Sbjct: 173 SESGVDRVKEDDEEDEDADLSKKDVLEEPKMFKKPEEITWDDV 215


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 31.2 bits (71), Expect = 0.80
 Identities = 27/123 (21%), Positives = 46/123 (37%), Gaps = 12/123 (9%)

Query: 8   PNLQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKV 67
           P L+         KK +     + K+     K   ++   K+K K  K  +   +D    
Sbjct: 495 PWLKTTSSVGKSAKKQDSKKKSSSKLDKAANK--ISKAAVKVKKKKKKEKSIDLDDDLID 552

Query: 68  EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETI-----DDDIEMVFAAEKKADPKRDM 122
           EE+  + + +++E ED E L      K     + I      DD+   F  EKK   + + 
Sbjct: 553 EEDSIKLDVDDEEDEDDEELPFLFKQK-----DLIKEAFAGDDVVAEFEKEKKEVIEEED 607

Query: 123 EKV 125
            K 
Sbjct: 608 PKE 610


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 31.1 bits (70), Expect = 0.83
 Identities = 23/102 (22%), Positives = 45/102 (44%), Gaps = 6/102 (5%)

Query: 48  KMKSKTAKYWTGGEEDVDKVE--EEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           K K K A      E+D    E  ++  + +PEE++ ED E  D++D+ +   +       
Sbjct: 263 KDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKL 322

Query: 106 IEMVFAAEKKADPKR----DMEKVRLRKQLMAIEEKIKAARH 143
            E V        PK     +  + +L++Q+  +E++  A + 
Sbjct: 323 DEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKS 364



 Score = 27.6 bits (61), Expect = 9.8
 Identities = 23/98 (23%), Positives = 33/98 (33%), Gaps = 14/98 (14%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
              ED    EEE  E E  EDE+ D E   +  +            +       E   + 
Sbjct: 109 MDSEDSADDEEEEEEDESLEDEMIDDEDEADLFN------ESESSLEDLSDDETEDDEEK 162

Query: 119 KRDMEKVRLRKQLMAIEEKIKAARHPLFSP---DDQFF 153
           K + E+    K+        +A R   F     DD+FF
Sbjct: 163 KMEEEEAGEEKE-----SVEQATREKKFDKSGVDDKFF 195


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 30.5 bits (68), Expect = 0.86
 Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILD---NEDDTKSQGSVETIDDDIEMVFAAEKKAD 117
            E+++   EEPPE E E +E +  E +D    E+ +  + +++ ++     +++   K D
Sbjct: 43  VEELEDQPEEPPEQEEENEEQKPKEEIDYPIQENKSFDEKNLDDLERSNSDIYSESHKYD 102

Query: 118 PKRD 121
              D
Sbjct: 103 NASD 106


>gnl|CDD|218970 pfam06278, DUF1032, Protein of unknown function (DUF1032).  This
           family consists of several conserved eukaryotic proteins
           of unknown function.
          Length = 565

 Score = 31.1 bits (70), Expect = 0.87
 Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 4/79 (5%)

Query: 34  WTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDT 93
           W  +K+  + + Q   +    ++    E+ +  +EE+  E   E+D   D +  D ED  
Sbjct: 342 WKHVKERLETQRQMLRRRGAERWLPDEEQKLWPLEEDRLEDSVEDDGDAD-DFSDPEDYL 400

Query: 94  KSQGSV---ETIDDDIEMV 109
           +    +   E    D+E  
Sbjct: 401 EPPEGLDPEEQAFQDLEAE 419


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 30.4 bits (69), Expect = 1.0
 Identities = 20/95 (21%), Positives = 39/95 (41%), Gaps = 7/95 (7%)

Query: 47  KKMKSKTAKYWTG-GEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           KK K K +K   G  +E+ D+ E E  +  P+E    D   ++              +  
Sbjct: 2   KKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKI------GKNPS 55

Query: 106 IEMVFAAEKKADPKRDMEKVRLRKQLMAIEEKIKA 140
           ++  F  +K  + K    +  LR++ +  +E +K 
Sbjct: 56  VDTSFLPDKAREEKEAELREELREEFLKKQEAVKE 90


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.9 bits (69), Expect = 1.1
 Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 4/111 (3%)

Query: 30   ARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDN 89
            A+K      K  + +  ++ K K  +      E+  K EE     E  + +  +      
Sbjct: 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671

Query: 90   EDDTKSQGSVETIDDDIEMVFAAEKKADPKRDMEKVRLRKQLMAIEEKIKA 140
            ED  K++ + +  +D+ +   A +K+A+  +  E+++ ++     EEK KA
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA----EEKKKA 1718


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 7/68 (10%)

Query: 46  QKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           Q K ++K++K  T  + D    ++E    + ++DE      LD+ DD       +  D+D
Sbjct: 228 QPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDD-------DVSDED 280

Query: 106 IEMVFAAE 113
            E +F  +
Sbjct: 281 GEDLFDTD 288


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 30.7 bits (69), Expect = 1.1
 Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 16/80 (20%)

Query: 60   GEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPK 119
             +E+ ++ + E  EP  +ED +E+   LD +            DD  ++    EK     
Sbjct: 4016 ADENKEEADAEKDEPMQDEDPLEENNTLDED---------IQQDDFSDLAEDDEK----- 4061

Query: 120  RDMEKVRLRKQLMAIEEKIK 139
              M +    + +   EE  +
Sbjct: 4062 --MNEDGFEENVQENEESTE 4079


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
           G E + D  EE+   PE +ED+    E   +  D +     ++ +++   V   E++ DP
Sbjct: 2   GSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDS-EEEQIFVTRQEEEVDP 60

Query: 119 KRDMEKVRLRKQLMA 133
           + + E  R  +++MA
Sbjct: 61  EAEAEFDREFEKMMA 75


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 21/101 (20%)

Query: 62  EDVDKVEEEPPEPEPE-------EDEVEDGEILDN---EDDTKSQGSVETIDDDIEMVFA 111
            D D  E EPP P  +       +DE ED ++ D+   E+D + +               
Sbjct: 2   ADWDDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKA----------K 51

Query: 112 AEKKADPKRDME-KVRLRKQLMAIEEKIKAARHPLFSPDDQ 151
              KA  K+ ++ K+  +++    +E+         +P+D+
Sbjct: 52  VAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDE 92


>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein.  Members of this
           family are bacterial proteins with a conserved motif
           [KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
           followed by a long region of low complexity sequence in
           which roughly half the residues are Asp and Glu,
           including multiple runs of five or more acidic residues.
           The function of members of this family is unknown.
          Length = 129

 Score = 29.2 bits (65), Expect = 1.2
 Identities = 16/47 (34%), Positives = 23/47 (48%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIE 107
           EE  ++V+   P P   +D  ED +I D++DD       E  D D E
Sbjct: 75  EEADEEVDPGAPTPAGGDDLAEDEDIADDDDDVTFLEDEEDDDIDDE 121


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 20/96 (20%), Positives = 38/96 (39%), Gaps = 3/96 (3%)

Query: 46  QKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           QK    K A      +E+ D+  +   E +  E E  + E    E + +     +  ++ 
Sbjct: 28  QKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEI 87

Query: 106 IEMVFAAEKKADPKRD-MEKVRLRKQL--MAIEEKI 138
            +     E+  D K+  +   + +K    M +EEKI
Sbjct: 88  AQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEKI 123


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 17/95 (17%), Positives = 41/95 (43%), Gaps = 3/95 (3%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAA---EKKAD 117
           EE+  K E E      ++D+ E+   ++ ++D+  +G    ++ D E+  +    E++ D
Sbjct: 80  EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKD 139

Query: 118 PKRDMEKVRLRKQLMAIEEKIKAARHPLFSPDDQF 152
                 K    ++L   +E+  A      +  ++ 
Sbjct: 140 EAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKA 174



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 24/85 (28%), Positives = 39/85 (45%), Gaps = 1/85 (1%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSV-ETIDDDIEMVFAAEKKAD 117
           G E D D  EEE  E E +ED  ++GE +D E D + + S  E  ++  E    A++ +D
Sbjct: 91  GLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSD 150

Query: 118 PKRDMEKVRLRKQLMAIEEKIKAAR 142
            +   E      +    E + + A 
Sbjct: 151 EELSEEDEEEAAEEEEAEAEKEKAS 175


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 30.0 bits (67), Expect = 1.3
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 4/64 (6%)

Query: 60  GEEDVDKVEEEPPEPEPEEDE-VEDGE-ILD--NEDDTKSQGSVETIDDDIEMVFAAEKK 115
           G +   +   E  E E EE E  +DGE ++D   E   K QG  E +++++  +F  E +
Sbjct: 197 GSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWE 256

Query: 116 ADPK 119
            +  
Sbjct: 257 EESP 260



 Score = 28.4 bits (63), Expect = 4.4
 Identities = 20/82 (24%), Positives = 35/82 (42%), Gaps = 6/82 (7%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILD------NEDDTKSQGSVETIDDDIEMVFAAEK 114
           + DVD  EEE  E     +E ED E+        N +  + +       DD E V   E 
Sbjct: 171 QPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEG 230

Query: 115 KADPKRDMEKVRLRKQLMAIEE 136
           +   K+  E+  + ++++ + E
Sbjct: 231 ERIDKKQGEEEEMEEEVINLFE 252


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 30.0 bits (67), Expect = 1.4
 Identities = 13/52 (25%), Positives = 23/52 (44%)

Query: 62  EDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAE 113
           E  +  E+EP + + +E  + DGE    +   K +   ET D  ++     E
Sbjct: 29  ESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTE 80


>gnl|CDD|221757 pfam12761, End3, Actin cytoskeleton-regulatory complex protein
           END3.  Endocytosis is accomplished through the
           sequential recruitment at endocytic sites of proteins
           that drive cargo sorting, membrane invagination and
           vesicle release. End3p is part of the coat module
           protein complex Pan1, along with Pan1p, Sla1p, and
           Sla2p. The proteins in this complex are regulated by
           phosphorylation events. End3p also regulates the
           cortical actin cytoskeleton. The subunits of the Pan1
           complex are homologous to mammalian intersectin.
          Length = 193

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 121 DMEKVRLRKQLMAIEEKIKAA 141
           D E+VRLR++L  +E K+  A
Sbjct: 95  DWEEVRLRRELAELESKLSKA 115


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 28.8 bits (64), Expect = 1.4
 Identities = 13/56 (23%), Positives = 27/56 (48%)

Query: 62  EDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKAD 117
           +D  + EEE  E + EE+   + E  + E++T+        +D+ +     ++K D
Sbjct: 48  DDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTD 103



 Score = 28.4 bits (63), Expect = 2.0
 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 5/78 (6%)

Query: 46  QKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           ++K   K A     G E+ +   +E  +   EE+EV++ +  +  +  + +G  E   + 
Sbjct: 24  EEKENGKNAP--ANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEG 81

Query: 106 IEMVFAAEK---KADPKR 120
                AAE     A+ K+
Sbjct: 82  ATGKRAAEDEEDDAETKK 99


>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
          Length = 211

 Score = 29.8 bits (67), Expect = 1.5
 Identities = 14/78 (17%), Positives = 26/78 (33%), Gaps = 1/78 (1%)

Query: 5   PKIPNLQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDV 64
           P  P          K      P    + + TK  K      + +   K   +    EE V
Sbjct: 25  PPNPLSLFSSPNRKKSAPEQVPTGKNKLLLTKRSKLK-GIPKPRKLHKHGFWAEIFEEKV 83

Query: 65  DKVEEEPPEPEPEEDEVE 82
           ++ E   P  + ++D ++
Sbjct: 84  EREELGNPCKDLDKDIIK 101


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 55  KYWTGGEEDVDKVEEEPPEPEPE----EDEVEDGEILDNEDDTKSQGSVETIDDDIEMVF 110
            +  G  E+ ++ EE+  + E E    +D  E+ E +D  D+ + +   ++ D+D E   
Sbjct: 313 NFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372

Query: 111 AAEKKADPKRDMEKVR 126
             EK+   K+  E  R
Sbjct: 373 EEEKEKKKKKSAESTR 388


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 26.4 bits (58), Expect = 1.8
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 214 FACEMCNYRAASKIAIYAHMRTTH 237
           F C +C    +SK A+  H+R  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 24.5 bits (53), Expect = 8.6
 Identities = 6/24 (25%), Positives = 12/24 (50%)

Query: 248 YHCAFCNFTSPTQWAINRHTKYTH 271
           + C  C  +  ++ A+ RH +  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 10/15 (66%), Positives = 11/15 (73%), Gaps = 1/15 (6%)

Query: 259 TQWAINRHTKYTHMN 273
           T W IN+HT YTH N
Sbjct: 428 TDW-INKHTVYTHGN 441


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 29.8 bits (67), Expect = 2.3
 Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 10/73 (13%)

Query: 45  NQKKMKSKTAKYWTGGEEDVDKVEEEPPEPE----------PEEDEVEDGEILDNEDDTK 94
              + KS+  + + G EED D +EE   + +            E + EDGE  + EDD +
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692

Query: 95  SQGSVETIDDDIE 107
           +   V  ID  + 
Sbjct: 693 ANEGVVPIDKAVR 705


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 29.6 bits (66), Expect = 2.7
 Identities = 14/69 (20%), Positives = 34/69 (49%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
           EED+   +    + + E+++ +D +  D+E++ + +  ++  DD+ E     E     K 
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKS 201

Query: 121 DMEKVRLRK 129
           +++K    K
Sbjct: 202 EVDKTDCFK 210


>gnl|CDD|177300 PHA00732, PHA00732, hypothetical protein.
          Length = 79

 Score = 27.4 bits (61), Expect = 2.8
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 248 YHCAFCNFTSPTQWAINRHTKYTHMNTSGPTIQPECIKI 286
           + C  C FT+ T +A+ +H +  H  T  P       ++
Sbjct: 2   FKCPICGFTTVTLFALKQHARRNHTLTKCPVCNKSYRRL 40


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
          domain.  The centromere protein B (CENP-B) dimerisation
          domain is composed of two alpha-helices, which are
          folded into an antiparallel configuration. Dimerisation
          of CENP-B is mediated by this domain, in which monomers
          dimerise to form a symmetrical, antiparallel,
          four-helix bundle structure with a large hydrophobic
          patch in which 23 residues of one monomer form van der
          Waals contacts with the other monomer. This CENP-B
          dimer configuration may be suitable for capturing two
          distant CENP-B boxes during centromeric heterochromatin
          formation.
          Length = 101

 Score = 27.8 bits (61), Expect = 2.8
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 53 TAKYWTGGEE-DVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQG 97
          T  +  G E+ D D  EEE  + E EED+ ED +  D+E    S G
Sbjct: 2  TLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFG 47


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 29.2 bits (65), Expect = 2.9
 Identities = 7/40 (17%), Positives = 9/40 (22%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGS 98
           G   +       PP P       E      +      Q S
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS 164


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 28.4 bits (62), Expect = 2.9
 Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 7/82 (8%)

Query: 47  KKMKSKTAKYWTGGEEDVDKVE-----EEPPEPEPEEDEVEDGEILDNEDDTKSQGSVET 101
           K  K K  K    G +D +  +     EE  EPEPEE+  E+ E    E+  +     ET
Sbjct: 38  KGGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPE--PEEEGEEEPEPEET 95

Query: 102 IDDDIEMVFAAEKKADPKRDME 123
            +++ E     E + +P+ + E
Sbjct: 96  GEEEPEPEPEPEPEPEPEPEPE 117


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 26/107 (24%), Positives = 42/107 (39%), Gaps = 11/107 (10%)

Query: 21  KKVNYPI--IVARKMWTKLKKNHQNRN---QKKMKSKTAKYWTGGEEDVDKVEEE----- 70
           ++VN  I  +V      +L    +N +    K    + A+      E+VD+ E+E     
Sbjct: 322 RRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQR 381

Query: 71  PPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKAD 117
             E E EE E  + E   + +D  S  S   +  D E     E  +D
Sbjct: 382 SDEHEEEEGEDSEEEGSQSRED-GSSESSSDVGSDSESKADKESASD 427


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 28.9 bits (64), Expect = 3.6
 Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 4/81 (4%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGS----VETIDDDIEMVFAAEKKA 116
           E+ V++ E+   E    E++ +D E L  + + + +       ET + D ++  +  KK 
Sbjct: 474 EDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKR 533

Query: 117 DPKRDMEKVRLRKQLMAIEEK 137
               + E+ +L+  +M+ ++K
Sbjct: 534 KVDEEEEEKKLKMIMMSNKQK 554


>gnl|CDD|223976 COG1047, SlpA, FKBP-type peptidyl-prolyl cis-trans isomerases 2
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 174

 Score = 28.0 bits (63), Expect = 4.0
 Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 1/47 (2%)

Query: 60  GEEDVDKVEEEPPEPEPEEDEVEDGEILD-NEDDTKSQGSVETIDDD 105
           GE D D V+  P +      E+E G  ++    D +  G V  +  D
Sbjct: 70  GEYDPDLVQRVPRDEFQGVGELEVGMEVEAEGGDGEIPGVVTEVSGD 116


>gnl|CDD|217678 pfam03699, UPF0182, Uncharacterized protein family (UPF0182).  This
           family contains uncharacterized integral membrane
           proteins.
          Length = 771

 Score = 29.1 bits (66), Expect = 4.0
 Identities = 11/30 (36%), Positives = 12/30 (40%), Gaps = 9/30 (30%)

Query: 261 WAINRHTKYTH--------MNTSGPTIQPE 282
           W INRH  YTH        +N       PE
Sbjct: 413 W-INRHLVYTHGYGFVMSPVNQVTSEGLPE 441


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.0 bits (66), Expect = 4.0
 Identities = 18/69 (26%), Positives = 29/69 (42%), Gaps = 12/69 (17%)

Query: 63  DVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKRDM 122
            VD   EE P     E E  D +  + E+           +D+ +   AA++   P++ +
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEEE-----------EDENDDSLAADESELPEKVL 221

Query: 123 EKV-RLRKQ 130
           EK   L KQ
Sbjct: 222 EKFKALAKQ 230


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 28.4 bits (64), Expect = 4.0
 Identities = 12/70 (17%), Positives = 29/70 (41%), Gaps = 5/70 (7%)

Query: 72  PEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKRDMEKVRLRK-- 129
           P   P   E E+  +     +       + + D+++       + +  R +E+ R ++  
Sbjct: 14  PPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEE---LDEEEDDRFLEEYRRKRLA 70

Query: 130 QLMAIEEKIK 139
           ++ A+ EK K
Sbjct: 71  EMKALAEKSK 80


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 28.8 bits (65), Expect = 4.1
 Identities = 12/30 (40%), Positives = 14/30 (46%)

Query: 66  KVEEEPPEPEPEEDEVEDGEILDNEDDTKS 95
              EEP E   EE E  D E + +EDD   
Sbjct: 831 LPYEEPEEETAEEPEEVDPEDVKSEDDIDD 860



 Score = 28.8 bits (65), Expect = 4.5
 Identities = 14/49 (28%), Positives = 21/49 (42%), Gaps = 2/49 (4%)

Query: 62  EDVDKVEE-EP-PEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
             V  + E EP P  EPEE+  E+ E +D ED        +     + +
Sbjct: 820 YKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLGL 868


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 28.7 bits (65), Expect = 4.1
 Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 1/61 (1%)

Query: 35  TKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTK 94
            K+KK  +   +KK + +  +          K EEE  E E +E+E E+ E    E+  +
Sbjct: 410 KKIKKIVEKA-EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468

Query: 95  S 95
            
Sbjct: 469 E 469


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 28.3 bits (64), Expect = 4.3
 Identities = 14/72 (19%), Positives = 27/72 (37%), Gaps = 12/72 (16%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
           + +           E E DE +  +  D+++           D+D +    A+   DP+ 
Sbjct: 31  DPNAAAAAATAAAIESELDEEDLEDDDDDDE-----------DEDEDDEEEADLGPDPEE 79

Query: 121 DMEKV-RLRKQL 131
             E+   LR+Q 
Sbjct: 80  ARERFAELREQY 91


>gnl|CDD|224881 COG1970, MscL, Large-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 130

 Score = 27.6 bits (62), Expect = 4.5
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILD 88
              ++K E E P P P  +EV   EI D
Sbjct: 98  RRKLEKEEPEAPAPAPPAEEVLLTEIRD 125


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.8 bits (65), Expect = 4.5
 Identities = 5/47 (10%), Positives = 20/47 (42%), Gaps = 1/47 (2%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETID-DDI 106
           +++ ++ +E     +  +D+    +  D+E   +++   +     D 
Sbjct: 166 DDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTATADP 212



 Score = 28.0 bits (63), Expect = 6.7
 Identities = 11/68 (16%), Positives = 27/68 (39%), Gaps = 7/68 (10%)

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKK----- 115
           ++D D  +++  + + + D+ ++ +    E +  S       D+D        +K     
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207

Query: 116 --ADPKRD 121
             ADP + 
Sbjct: 208 ATADPVKA 215


>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6.  This region covers
           TM regions 4-6 of the ryanodine receptor 1 family.
          Length = 217

 Score = 28.1 bits (63), Expect = 4.9
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 66  KVEEEPPEPEPEEDEVEDGE 85
           K E E P+ EPE+ ++EDGE
Sbjct: 92  KEEGEEPKSEPEKADLEDGE 111


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 27.8 bits (62), Expect = 5.2
 Identities = 14/47 (29%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
           G E+++D  E++  E + E D ++D    D+ED+   +     I+DD
Sbjct: 113 GDEDELDYDEDKEEEEDDEVDSLDDEN--DDEDEDDDEIVEILIEDD 157


>gnl|CDD|234609 PRK00068, PRK00068, hypothetical protein; Validated.
          Length = 970

 Score = 28.4 bits (64), Expect = 5.3
 Identities = 9/13 (69%), Positives = 9/13 (69%), Gaps = 1/13 (7%)

Query: 261 WAINRHTKYTHMN 273
           W INRH  YTH N
Sbjct: 430 W-INRHLVYTHGN 441


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 26.6 bits (59), Expect = 5.4
 Identities = 20/63 (31%), Positives = 29/63 (46%), Gaps = 3/63 (4%)

Query: 36 KLK-KNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTK 94
          KLK K  +   +KK K K  K  +  +E+V   +EE  +   E D  E  E  DNE   +
Sbjct: 9  KLKLKGKKIDVKKKKKKKKKKNKS--KEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQ 66

Query: 95 SQG 97
           + 
Sbjct: 67 EED 69


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 27.3 bits (61), Expect = 5.7
 Identities = 20/58 (34%), Positives = 22/58 (37%), Gaps = 3/58 (5%)

Query: 31  RKMWTKLKKNH---QNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGE 85
           RK   K  K     Q + QKK K K AK     EE       E    E EE E +  E
Sbjct: 77  RKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQE 134


>gnl|CDD|163564 TIGR03852, sucrose_gtfA, sucrose phosphorylase.  In the forward
           direction, this enzyme uses phosphate to cleave sucrose
           into D-fructose + alpha-D-glucose 1-phosphate.
           Characterized representatives from Streptococcus mutans
           and Bifidobacterium adolescentis represent
           well-separated branches of a molecular phylogenetic
           tree. In S. mutans, the region including this gene has
           been associated with neighboring transporter genes and
           multiple sugar metabolism.
          Length = 470

 Score = 28.2 bits (63), Expect = 5.8
 Identities = 11/57 (19%), Positives = 21/57 (36%), Gaps = 3/57 (5%)

Query: 32  KMWTKLKKNHQNRNQKKMKSKTAKYWTGG---EEDVDKVEEEPPEPEPEEDEVEDGE 85
           + +    +   N   K +  +   +W  G   +EDVD + +       +E    DG 
Sbjct: 89  EYYQDFLEKKDNSKYKDLFIRYKDFWPNGRPTQEDVDLIYKRKDRAPYQEVTFADGS 145


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 28.1 bits (63), Expect = 6.4
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 59  GGEEDVDKVEEEPPEPEPEEDEVEDG 84
             EE+ +  EEE  E EPEE E E+G
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEG 373


>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722).  This
           eukaryotic family of proteins has no known function.
          Length = 373

 Score = 27.9 bits (62), Expect = 6.6
 Identities = 13/75 (17%), Positives = 28/75 (37%), Gaps = 10/75 (13%)

Query: 38  KKNHQNRNQKKMKSKTA----KYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDT 93
           K   Q +  ++ KS ++       +  E      +++PP+ + +E + E         D 
Sbjct: 278 KPESQQKKHRRTKSSSSFGVIDLNSISEASQVNEDDDPPDSDSKERKNE------ENSDP 331

Query: 94  KSQGSVETIDDDIEM 108
           +S  S +  D     
Sbjct: 332 ESTPSDDNDDKTCSE 346


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.
          This family includes the HABP4 family of
          hyaluronan-binding proteins, and the PAI-1 mRNA-binding
          protein, PAI-RBP1. HABP4 has been observed to bind
          hyaluronan (a glucosaminoglycan), but it is not known
          whether this is its primary role in vivo. It has also
          been observed to bind RNA, but with a lower affinity
          than that for hyaluronan. PAI-1 mRNA-binding protein
          specifically binds the mRNA of type-1 plasminogen
          activator inhibitor (PAI-1), and is thought to be
          involved in regulation of mRNA stability. However, in
          both cases, the sequence motifs predicted to be
          important for ligand binding are not conserved
          throughout the family, so it is not known whether
          members of this family share a common function.
          Length = 106

 Score = 26.6 bits (59), Expect = 7.3
 Identities = 11/28 (39%), Positives = 13/28 (46%)

Query: 58 TGGEEDVDKVEEEPPEPEPEEDEVEDGE 85
             EE     EE   E   EE+EVE+ E
Sbjct: 48 NEVEEKQAVEEEANKEGVVEEEEVEEEE 75


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.1 bits (63), Expect = 7.5
 Identities = 23/111 (20%), Positives = 45/111 (40%), Gaps = 13/111 (11%)

Query: 42  QNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVET 101
           + R   +  ++ A+      ED D +EE   E   E  E+E       E     +  +E 
Sbjct: 332 ECRVAAQAHNEEAE---SLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEE 388

Query: 102 IDDDIEMV-----FAAEK--KADPKRDM---EKVRLRKQLMAIEEKIKAAR 142
           ++++IE +      A      A+   +    E+  LR++   +E  ++ AR
Sbjct: 389 LEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTAR 439


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 26.8 bits (59), Expect = 7.7
 Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 4/63 (6%)

Query: 57  WT--GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEK 114
           WT  G  E              EE E  D +  ++ D+     + +  D++ E   A ++
Sbjct: 52  WTEAGRAEKGPAAAATAAAEAAEEAEAADAD--EDADEAAEADAADEADEEEETDEAVDE 109

Query: 115 KAD 117
            AD
Sbjct: 110 TAD 112


>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
          Length = 138

 Score = 26.8 bits (60), Expect = 7.7
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 75  EPEEDEVEDGEILDNEDDTKSQG-SVETIDDDIEMVFA-AEK 114
           + EE E+EDG+++   D       + E ID  +E + A AEK
Sbjct: 62  DAEELELEDGDVIFCCDAVSEVALNAELIDAQVEQLLALAEK 103


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 27.7 bits (62), Expect = 9.1
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 68  EEEPPEPEPEEDEVEDGEILDNE 90
           EEE   PE EE+E E+ E    E
Sbjct: 310 EEEEEVPEEEEEEEEEEERTFEE 332


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 25.9 bits (57), Expect = 9.3
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 60 GEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDD 92
          GEE  D  ++E  + + ++D+ +D +  D+EDD
Sbjct: 47 GEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79


>gnl|CDD|234586 PRK00037, hisS, histidyl-tRNA synthetase; Reviewed.
          Length = 412

 Score = 27.4 bits (62), Expect = 9.4
 Identities = 7/23 (30%), Positives = 10/23 (43%)

Query: 161 VLYADLLDHCKLCLRRKRKNPMR 183
            L   L +  +   RR   NP+R
Sbjct: 179 FLEKGLDELDEDSKRRLETNPLR 201


>gnl|CDD|235861 PRK06782, PRK06782, flagellar motor switch protein; Reviewed.
          Length = 528

 Score = 27.5 bits (61), Expect = 9.8
 Identities = 19/115 (16%), Positives = 41/115 (35%), Gaps = 18/115 (15%)

Query: 10  LQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTG---------G 60
           L +  + +  L ++ + + V   + TK+         + +    AK              
Sbjct: 370 LGESIEVDE-LVQITFNLEVGDLLQTKM--------YQILPISEAKEMVRLLLYPMVKEE 420

Query: 61  EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKK 115
           EE   +  E     EP    +E  E+   E        ++ +  ++++VF + KK
Sbjct: 421 EEIATEEIEPEKVVEPVVQPIEFKEVKQMEPVYMDTSILQDVPMNVKVVFGSTKK 475


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 27.7 bits (62), Expect = 9.9
 Identities = 22/95 (23%), Positives = 38/95 (40%), Gaps = 19/95 (20%)

Query: 61  EEDVDKV-EEEPPEPEPEEDEVEDGEILDNEDDTKSQGS---------------VETIDD 104
            E + KV EEE P  +   +E E  EI   E   K                   +E +  
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKR 450

Query: 105 DIEMVFAAEKKADPKRDME-KVRLRKQLMAIEEKI 138
           +IE +    +    +R++  KVR  +++ A + +I
Sbjct: 451 EIEKL--ESELERFRREVRDKVRKDREIRARDRRI 483


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 27.5 bits (61), Expect = 9.9
 Identities = 20/111 (18%), Positives = 39/111 (35%), Gaps = 10/111 (9%)

Query: 15  KTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEP 74
                LK+            + +  +   +  K+ K K     +  +E+ +  EE     
Sbjct: 202 VKAASLKRNPPK-------KSNIMSSFFKKKTKEKKEKKEASESTVKEESE--EESGKRD 252

Query: 75  EPEEDEVEDGEILDNEDDTKS-QGSVETIDDDIEMVFAAEKKADPKRDMEK 124
              EDE  +   LD ++D    + S E  D + E     ++K    + M +
Sbjct: 253 VILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMME 303


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.134    0.423 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,519,745
Number of extensions: 1453288
Number of successful extensions: 2891
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2540
Number of HSP's successfully gapped: 170
Length of query: 304
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 208
Effective length of database: 6,679,618
Effective search space: 1389360544
Effective search space used: 1389360544
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (26.5 bits)