RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14440
         (711 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score =  239 bits (612), Expect = 1e-73
 Identities = 98/286 (34%), Positives = 143/286 (50%), Gaps = 11/286 (3%)

Query: 382 TLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLK--------AMKSSDQLQDI 433
           TL      VTC   S D  +LA G  +  IK+W L   +L          ++      D 
Sbjct: 4   TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADG 63

Query: 434 EMEC---DDILVRIMDERNAETTRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLL 490
                   D  +R+ D    E  RTL GH   V  ++FSPD  +L S S+D TI++W + 
Sbjct: 64  TYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVE 123

Query: 491 LWRCLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCV 550
             +CL   +GH   V  + FSP G + AS+  D   +LW   +   +    GH  +V+ V
Sbjct: 124 TGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSV 183

Query: 551 QFHPNSNYVATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNV 610
            F P+   + + SSD TI+LWD  TG  +  L GH+  V +++FS  G  LAS   DG +
Sbjct: 184 AFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTI 243

Query: 611 HMWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWD 656
            +WDL  G  +  L+GHT+ V +L +S DG  LAS   D ++++WD
Sbjct: 244 RVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score =  225 bits (576), Expect = 2e-68
 Identities = 85/207 (41%), Positives = 116/207 (56%)

Query: 453 TRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLLLWRCLVVYKGHGHPVWDIKFSP 512
            RTL GH G V  ++FSPD  LL + S D TI++W L     L   KGH  PV D+  S 
Sbjct: 2   RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA 61

Query: 513 HGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWD 572
            G Y AS   D+  RLW  ++   +R   GH S V  V F P+   +++ S D+TI++WD
Sbjct: 62  DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121

Query: 573 CVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWDLSNGHLLTLLTGHTDVVR 632
             TG  +  L GH   V +++FS  G F+AS+  DG + +WDL  G  +  LTGHT  V 
Sbjct: 122 VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVN 181

Query: 633 ALCFSRDGNILASSGQDCSVKLWDFAK 659
           ++ FS DG  L SS  D ++KLWD + 
Sbjct: 182 SVAFSPDGEKLLSSSSDGTIKLWDLST 208



 Score =  170 bits (433), Expect = 5e-48
 Identities = 69/168 (41%), Positives = 93/168 (55%)

Query: 494 CLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFH 553
                KGH   V  + FSP G   A+   D   ++W  ++   LR   GH   V  V   
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 554 PNSNYVATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMW 613
            +  Y+A+GSSD+TIRLWD  TG  VR LTGH + V +++FS  GR L+S+  D  + +W
Sbjct: 61  ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120

Query: 614 DLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDFAKLS 661
           D+  G  LT L GHTD V ++ FS DG  +ASS QD ++KLWD     
Sbjct: 121 DVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK 168



 Score =  144 bits (365), Expect = 1e-38
 Identities = 60/193 (31%), Positives = 83/193 (43%), Gaps = 31/193 (16%)

Query: 380 CYTLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLKAMKSSDQLQDIEMECDD 439
             TL      V     S D + +A    +  IKLW L                       
Sbjct: 128 LTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDL----------------------- 164

Query: 440 ILVRIMDERNAETTRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLLLWRCLVVYK 499
                   R  +   TL GH G V  ++FSPD   LLS S D TI+LW L   +CL   +
Sbjct: 165 --------RTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLR 216

Query: 500 GHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV 559
           GH + V  + FSP GY  AS   D   R+W   +   ++   GH + V  + + P+   +
Sbjct: 217 GHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRL 276

Query: 560 ATGSSDRTIRLWD 572
           A+GS+D TIR+WD
Sbjct: 277 ASGSADGTIRIWD 289


>gnl|CDD|176269 cd08044, TAF5_NTD2, TAF5_NTD2 is the second conserved N-terminal
           region of TATA Binding Protein (TBP) Associated Factor 5
           (TAF5), involved in forming Transcription Factor IID
           (TFIID).  The TATA Binding Protein (TBP) Associated
           Factor 5 (TAF5) is one of several TAFs that bind TBP and
           are involved in forming Transcription Factor IID (TFIID)
           complex. TAF5 contains three domains, two conserved
           sequence motifs at the N-terminal and one at the
           C-terminal region. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) involved in accurate initiation of
           transcription by RNA polymerase II in eukaryotes. TFIID
           plays an important role in the recognition of promoter
           DNA and assembly of the preinitiation complex. TFIID
           complex is composed of the TBP and at least 13 TAFs.  In
           yeast and human cells, TAFs have been found as
           components of other complexes besides TFIID. TAF5 may
           play a major role in forming TFIID and its related
           complexes. TAFs from various species were originally
           named by their predicted molecular weight or their
           electrophoretic mobility in polyacrylamide gels. A new,
           unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. TAF5 has a paralog gene (TAF5L) which has
           a redundant function. Several hypotheses are proposed
           for TAFs functions such as serving as activator-binding
           sites, core-promoter recognition or a role in essential
           catalytic activity. C-terminus of TAF5 contains six WD40
           repeats that likely form a closed beta propeller
           structure and may be involved in protein-protein
           interaction. The first part of the TAF5 N-terminal
           (TAF5_NTD1) homodimerizes in the absence of other TAFs.
           The second conserved N-terminal part of TAF5 (TAF5_NTD2)
           has an alpha-helical domain. One study has shown that
           TAF5_NTD2 homodimerizes only at high concentration of
           calcium but not any other metals. No dimerization was
           observed in other structural studies of TAF_NTD2.
           Several TAFs interact via histone-fold (HFD) motifs; HFD
           is the interaction motif involved in heterodimerization
           of the core histones and their assembly into nucleosome
           octamer. However, TAF5 does not have a HFD motif.
          Length = 133

 Score =  167 bits (425), Expect = 9e-49
 Identities = 61/133 (45%), Positives = 91/133 (68%)

Query: 142 PNLYEDAYQDLKKFIENSLDVYKYELGLVLYPVLVHMYIELVYNGHEHQATEMLHKHLMD 201
           PN YE AY  L+K+IE+SLD+YKYEL  +LYP+ VH Y++LV +GH  +A     +   D
Sbjct: 1   PNDYEQAYSKLRKWIESSLDIYKYELSQLLYPIFVHSYLDLVASGHLEEAKSFFERFSGD 60

Query: 202 HEPHYQEDLRKLSSVTKRDQIRTNEIAETFRSTEFIIRMSRDTLSLLKRHLQEKKHTVLL 261
            E  + ED++KLSS+T  + ++ NE+A+ FRS +++IRMSRD  SLL R L+    ++LL
Sbjct: 61  FEDSHSEDIKKLSSITTPEHLKENELAKLFRSNKYVIRMSRDAYSLLLRFLESWGGSLLL 120

Query: 262 NIVQEHLYFDMYE 274
            I+ EH+  D+ +
Sbjct: 121 KILNEHIDIDVRD 133


>gnl|CDD|203028 pfam04494, TFIID_90kDa, WD40 associated region in TFIID subunit.
           This region, possibly a domain is found in subunits of
           transcription factor TFIID. The function of this region
           is unknown.
          Length = 141

 Score =  158 bits (402), Expect = 1e-45
 Identities = 62/141 (43%), Positives = 88/141 (62%), Gaps = 3/141 (2%)

Query: 132 LLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKYELGLVLYPVLVHMYIELVYNGHEHQA 191
           +LSA   E DP  YE AY  L+ +IE+SLD+YK EL  +LYP+ VH Y++LV  GH  +A
Sbjct: 1   VLSASPEESDPEQYERAYSLLRNWIEDSLDIYKPELSRLLYPLFVHSYLDLVAKGHPSEA 60

Query: 192 TEMLHKHLMDHEPHYQEDLRKLSSVTKRDQIRTNEIAETFRSTEFIIRMSRDTLSLLKRH 251
                K   D E  + ED+ KL S++  + ++ NE+A  FR  ++ IR+SRD+ SLL R 
Sbjct: 61  RSFFDKFHGDFEQLHGEDIEKLRSISLPEHLKENELARAFRDNKYRIRLSRDSFSLLLRF 120

Query: 252 LQEKKHTV---LLNIVQEHLY 269
           LQE ++T    LL I+ +HL 
Sbjct: 121 LQENENTGGSLLLRILNQHLQ 141


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  159 bits (402), Expect = 4e-42
 Identities = 111/331 (33%), Positives = 150/331 (45%), Gaps = 23/331 (6%)

Query: 374 SLPSIACYTLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLK----AMKSSDQ 429
           SLP ++   L      +T    S D  +L  G S+  IKLW L   +             
Sbjct: 52  SLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSS 111

Query: 430 LQDIE------------MECDDILVRIMD-ERNAETTRTLVGHEGPVRKLSFSPDRSLLL 476
           +  +                 D  V++ D     +  RTL GH   V  L+FSPD  LL 
Sbjct: 112 VSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLA 171

Query: 477 SCSQ-DNTIRLWSLLLWRCLVVYKGHGHPVWDIKFSPHG-YYFASAGHDRMARLWATDSY 534
           S S  D TI+LW L   + L    GH  PV  + FSP G    AS   D   RLW   + 
Sbjct: 172 SGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTG 231

Query: 535 HPLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWDCVTG-SHVRLLTGHKAPVYALS 593
             LR  +  +SD     F P+ + +A+GSSD TIRLWD  +  S +R L+GH + V +++
Sbjct: 232 KLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVA 291

Query: 594 FSVCGRFLASAGGDGNVHMWDLSNGHLLTLLT--GHTDVVRALCFSRDGNILASSG-QDC 650
           FS  G+ LAS   DG V +WDL  G LL+ LT  GH   V +L FS DG++L S G  D 
Sbjct: 292 FSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDG 351

Query: 651 SVKLWDFAKLSDEVGTEEVNLSHSADIKNSS 681
           +++LWD          E  +   S       
Sbjct: 352 TIRLWDLRTGKPLKTLEGHSNVLSVSFSPDG 382



 Score =  156 bits (395), Expect = 3e-41
 Identities = 113/311 (36%), Positives = 144/311 (46%), Gaps = 23/311 (7%)

Query: 390 VTCAEISEDVSILAVG-FSESYIKLWSLVPQKLKA--MKSSDQLQDIEMECD-------- 438
           VT    S D  +LA G   +  IKLW L   K  +     +D +  +    D        
Sbjct: 158 VTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASG 217

Query: 439 --DILVRIMDERNAETTR-TLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLL-LWRC 494
             D  +R+ D    +  R TL GH   V   SFSPD SLL S S D TIRLW L      
Sbjct: 218 SSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSDGTIRLWDLRSSSSL 276

Query: 495 LVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSY--HPLRLFVGHYSDVDCVQF 552
           L    GH   V  + FSP G   AS   D   RLW  ++          GH   V  + F
Sbjct: 277 LRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSF 336

Query: 553 HPNSNYVATG-SSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVH 611
            P+ + + +G S D TIRLWD  TG  ++ L GH   V ++SFS  GR ++S   DG V 
Sbjct: 337 SPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVR 395

Query: 612 MWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDFAKLSDEV---GTEE 668
           +WDLS G LL  L GHT  V +L FS DG  LAS   D +++LWD       V      +
Sbjct: 396 LWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDGK 455

Query: 669 VNLSHSADIKN 679
           V  S S+D+  
Sbjct: 456 VLASKSSDLSV 466



 Score =  140 bits (352), Expect = 2e-35
 Identities = 91/286 (31%), Positives = 133/286 (46%), Gaps = 13/286 (4%)

Query: 406 FSESYIKLWSLVPQKLKAMKSSDQLQDIEMECDDILVRIMDERNAETTRTLVGHEGPVRK 465
                +K   L P        S    +  +    +L   +      ++  L GHE  +  
Sbjct: 11  NKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITS 70

Query: 466 LSFSPDRSLLLSCSQDNTIRLWSLLL-WRCLVVYKG-HGHPVWDIKF-SPHGYYFASAG- 521
           ++FSPD  LLLS S D TI+LW L    + +   +G H   V  +   SP G     A  
Sbjct: 71  IAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASS 130

Query: 522 -HDRMARLWA-TDSYHPLRLFVGHYSDVDCVQFHPNSNYVATGSS-DRTIRLWDCVTGSH 578
             D   +LW  +     +R   GH   V  + F P+   +A+GSS D TI+LWD  TG  
Sbjct: 131 SLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKP 190

Query: 579 VRLLTGHKAPVYALSFSVCGRF-LASAGGDGNVHMWDLSNGHLL-TLLTGHTDVVRALCF 636
           +  L GH  PV +L+FS  G   +AS   DG + +WDLS G LL + L+GH+D V    F
Sbjct: 191 LSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSF 249

Query: 637 SRDGNILASSGQDCSVKLWDFAKLSDEVGTEEVNLSHSADIKNSSL 682
           S DG++LAS   D +++LWD    S  +        HS+ + + + 
Sbjct: 250 SPDGSLLASGSSDGTIRLWDLRSSSSLL---RTLSGHSSSVLSVAF 292



 Score = 97.1 bits (240), Expect = 3e-21
 Identities = 78/231 (33%), Positives = 98/231 (42%), Gaps = 43/231 (18%)

Query: 381 YTLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLKAMKSSDQLQDIEMECDDI 440
            TL      V     S D  +LA G S+  ++LW L                        
Sbjct: 278 RTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDL------------------------ 313

Query: 441 LVRIMDERNAETTRTLVGHEGPVRKLSFSPDRS-LLLSCSQDNTIRLWSLLLWRCLVVYK 499
                +     ++ TL GHEGPV  LSFSPD S L+   S D TIRLW L   + L   +
Sbjct: 314 -----ETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLE 368

Query: 500 GHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV 559
           GH   V  + FSP G   +S   D   RLW   +   LR   GH S V  + F P+   +
Sbjct: 369 GH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSL 427

Query: 560 ATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNV 610
           A+GSSD TIRLWD  T                +SFS  G+ LAS   D +V
Sbjct: 428 ASGSSDNTIRLWDLKTSLKS------------VSFSPDGKVLASKSSDLSV 466


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 55.4 bits (134), Expect = 3e-10
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 536 PLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWD 572
            LR   GH   V  V F P+ N +A+GS D T+R+WD
Sbjct: 3   LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 53.9 bits (130), Expect = 1e-09
 Identities = 20/38 (52%), Positives = 26/38 (68%)

Query: 451 ETTRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWS 488
           +  RTL GH GPV  ++FSPD +LL S S D T+R+W 
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 53.9 bits (130), Expect = 1e-09
 Identities = 20/39 (51%), Positives = 26/39 (66%)

Query: 618 GHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWD 656
           G LL  L GHT  V ++ FS DGN+LAS   D +V++WD
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 51.6 bits (124), Expect = 6e-09
 Identities = 15/38 (39%), Positives = 18/38 (47%)

Query: 492 WRCLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLW 529
            + L   KGH  PV  + FSP G   AS   D   R+W
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 50.0 bits (120), Expect = 2e-08
 Identities = 18/39 (46%), Positives = 23/39 (58%)

Query: 576 GSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWD 614
           G  +R L GH  PV +++FS  G  LAS   DG V +WD
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 54.6 bits (132), Expect = 6e-10
 Identities = 18/40 (45%), Positives = 23/40 (57%)

Query: 533 SYHPLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWD 572
           S   L+   GH   V  V F P+  Y+A+GS D TI+LWD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 53.1 bits (128), Expect = 2e-09
 Identities = 20/40 (50%), Positives = 25/40 (62%)

Query: 617 NGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWD 656
           +G LL  L GHT  V ++ FS DG  LAS   D ++KLWD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 53.1 bits (128), Expect = 2e-09
 Identities = 19/35 (54%), Positives = 23/35 (65%)

Query: 454 RTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWS 488
           +TL GH GPV  ++FSPD   L S S D TI+LW 
Sbjct: 6   KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 51.5 bits (124), Expect = 6e-09
 Identities = 16/38 (42%), Positives = 18/38 (47%)

Query: 492 WRCLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLW 529
              L   KGH  PV  + FSP G Y AS   D   +LW
Sbjct: 2   GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 50.8 bits (122), Expect = 1e-08
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 575 TGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWD 614
           +G  ++ L GH  PV +++FS  G++LAS   DG + +WD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 55.7 bits (134), Expect = 5e-08
 Identities = 51/229 (22%), Positives = 92/229 (40%), Gaps = 42/229 (18%)

Query: 456 LVGHEGPVRKLSFSP-DRSLLLSCSQDNTIRLWSLLLWRCLVVYKGHGHPVWDIKFSPHG 514
           L+G EGP+  ++F+P D   L + S+D TI  W +              P   +      
Sbjct: 71  LLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGI--------------PEEGLT----- 111

Query: 515 YYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV-ATGSSDRTIRLWDC 573
                            +   P+    GH   V  V FHP++  V A+  +D  + +WD 
Sbjct: 112 ----------------QNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDV 155

Query: 574 VTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWDLSNGHLLTLLTGH--TDVV 631
             G  V ++  H   + +L +++ G  L +   D  +++ D  +G +++ +  H      
Sbjct: 156 ERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQ 215

Query: 632 RALCFSRDGNILA---SSGQDCSVKLWDFAKLSDEVGTEEVNLSHSADI 677
           R L   R   I+    S  Q   + LWD  K++    T +++ S +  I
Sbjct: 216 RCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFI 264



 Score = 51.4 bits (123), Expect = 1e-06
 Identities = 35/128 (27%), Positives = 61/128 (47%), Gaps = 11/128 (8%)

Query: 539 LFVGHYSDVDCVQFHP-NSNYVATGSSDRTIRLWDCV-------TGSHVRLLTGHKAPVY 590
           + +G    +  V F+P +   + T S D TI  W              +  L GH   V 
Sbjct: 70  ILLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVG 129

Query: 591 ALSF--SVCGRFLASAGGDGNVHMWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQ 648
            +SF  S     LASAG D  V++WD+  G  + ++  H+D + +L ++ DG++L ++ +
Sbjct: 130 IVSFHPSAMN-VLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSK 188

Query: 649 DCSVKLWD 656
           D  + + D
Sbjct: 189 DKKLNIID 196



 Score = 32.9 bits (75), Expect = 0.56
 Identities = 31/112 (27%), Positives = 50/112 (44%), Gaps = 18/112 (16%)

Query: 581 LLTGHKAPVYALSFS-VCGRFLASAGGDGNVHMWDLSNGHL-------LTLLTGHTDVVR 632
           +L G + P+  ++F+    + L +A  DG +  W +    L       +  L GHT  V 
Sbjct: 70  ILLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVG 129

Query: 633 ALCFSRDG-NILASSGQDCSVKLWDFAK---------LSDEVGTEEVNLSHS 674
            + F     N+LAS+G D  V +WD  +          SD++ + E NL  S
Sbjct: 130 IVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS 181


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 50.6 bits (121), Expect = 3e-07
 Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 11/82 (13%)

Query: 1   MDQLPSLMDDPPQESFHSVPII--QQTLSPQQP--PVSPQHQNMMSPIQQH--SLSPHTQ 54
             Q P      P    H +P++  QQ + PQQP  PV  QH   M+P Q H  +L    Q
Sbjct: 43  SQQHPPTHTLQPH---HHIPVLPAQQPVVPQQPLMPVPGQHS--MTPTQHHQPNLPQPAQ 97

Query: 55  APSPQPPAQPQQVPSPYHTPPP 76
            P    P QP Q   P    PP
Sbjct: 98  QPFQPQPLQPPQPQQPMQPQPP 119



 Score = 37.5 bits (87), Expect = 0.007
 Identities = 17/62 (27%), Positives = 19/62 (30%), Gaps = 4/62 (6%)

Query: 7   LMDDPPQESF----HSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
           LM  P Q S     H  P + Q       P   Q      P+Q            PQPP 
Sbjct: 73  LMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPL 132

Query: 63  QP 64
            P
Sbjct: 133 PP 134



 Score = 31.3 bits (71), Expect = 0.84
 Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 5/64 (7%)

Query: 19 VPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
          +P+ QQ   P    + P H   + P  Q  + P  Q P    P Q    P+ +H P   Q
Sbjct: 40 IPVSQQH--PPTHTLQPHHHIPVLP-AQQPVVP--QQPLMPVPGQHSMTPTQHHQPNLPQ 94

Query: 79 TSKQ 82
           ++Q
Sbjct: 95 PAQQ 98



 Score = 29.4 bits (66), Expect = 3.6
 Identities = 17/50 (34%), Positives = 20/50 (40%), Gaps = 1/50 (2%)

Query: 29 QQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
          Q  PVS QH      +Q H   P   A  P  P QP       H+  P+Q
Sbjct: 38 QIIPVSQQHPPTH-TLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQ 86


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 50.2 bits (120), Expect = 3e-06
 Identities = 43/223 (19%), Positives = 65/223 (29%), Gaps = 31/223 (13%)

Query: 1   MDQLPSLMDD--PPQE-SFHSVPIIQQTLSPQQPPVSP---QHQNMMSPIQQHSLSPHTQ 54
           + QL    DD    +    H  P+  Q  +  QP  +       +  S   Q S      
Sbjct: 359 LAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSAT 418

Query: 55  APSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNT------LITMLKLLRQYNLRGTE 108
            P+  PP      P+     PPS   +    A  K         ++ L       ++   
Sbjct: 419 QPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478

Query: 109 EQLKKEINVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKYELG 168
           EQ    I     G      EI         E D   Y   +   +   E +L   K  + 
Sbjct: 479 EQATGNIKEAPTGTQK---EIFT-------EEDLQYYWQEFAGTRPQEEKAL---KETMI 525

Query: 169 LVLYPVLVHMYIELVYNGHEHQATEMLHKHLMDHEPHYQEDLR 211
                +L     E+  + +E Q      K L +  P     LR
Sbjct: 526 NCRPKLLNGTTFEVAVD-NELQE-----KELTNLIPDLLGFLR 562


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 49.2 bits (117), Expect = 5e-06
 Identities = 23/87 (26%), Positives = 27/87 (31%), Gaps = 7/87 (8%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
             Q P     P Q     V  + Q  +PQQ P  P       P       P  Q  + QP
Sbjct: 89  QSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPP-------PAQQPQAQQP 141

Query: 61  PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
              PQ      +  PP Q   Q    P
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPPP 168



 Score = 46.1 bits (109), Expect = 4e-05
 Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 4/78 (5%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP-S 69
           PP +         Q   PQ PP  PQ Q   SP QQ       Q P PQ  + PQ     
Sbjct: 124 PPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQ---QNPPPQAQSAPQVSGLY 180

Query: 70  PYHTPPPSQTSKQSGKAP 87
           P  +P   Q+   +   P
Sbjct: 181 PEESPYQPQSYPPNEPLP 198



 Score = 41.5 bits (97), Expect = 0.001
 Identities = 21/73 (28%), Positives = 24/73 (32%), Gaps = 5/73 (6%)

Query: 17  HSVPIIQQTLSPQQPPVSPQHQNMMSPI--QQHSLSPHTQAPSPQPPAQPQQVPSPYHTP 74
             V          Q P   Q Q   +P   Q  S  P  Q  S      PQQ P   + P
Sbjct: 67  QQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEP---YYP 123

Query: 75  PPSQTSKQSGKAP 87
           PPSQ      + P
Sbjct: 124 PPSQPQPPPAQQP 136



 Score = 38.8 bits (90), Expect = 0.008
 Identities = 20/69 (28%), Positives = 27/69 (39%), Gaps = 3/69 (4%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP-QPPAQPQQVPS 69
           P Q+ + S P  QQ    Q PP   Q    +S +         Q+  P +P      +  
Sbjct: 148 PQQQQYQSPP--QQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQP 205

Query: 70  PYHTPPPSQ 78
           PY   PPSQ
Sbjct: 206 PYSGAPPSQ 214



 Score = 36.8 bits (85), Expect = 0.032
 Identities = 23/83 (27%), Positives = 32/83 (38%), Gaps = 6/83 (7%)

Query: 11  PPQESFHSVPIIQQ--TLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP-- 64
           PP  +  S    QQ  + +P   Q P     Q + S  QQ +       P P  P  P  
Sbjct: 74  PPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPA 133

Query: 65  QQVPSPYHTPPPSQTSKQSGKAP 87
           QQ  +    PPP    +Q  ++P
Sbjct: 134 QQPQAQQPQPPPQVPQQQQYQSP 156



 Score = 35.7 bits (82), Expect = 0.075
 Identities = 15/76 (19%), Positives = 22/76 (28%), Gaps = 1/76 (1%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
                        Q ++   PP +P  Q+     QQ S +P +     Q P Q  Q    
Sbjct: 54  EQVAKHELADAPLQQVNAALPP-APAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQ 112

Query: 71  YHTPPPSQTSKQSGKA 86
             TP          + 
Sbjct: 113 QPTPQQEPYYPPPSQP 128



 Score = 34.1 bits (78), Expect = 0.26
 Identities = 8/79 (10%), Positives = 17/79 (21%), Gaps = 3/79 (3%)

Query: 10  DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
           D  ++      + +  L+             + P          Q    Q P   Q    
Sbjct: 45  DSTKQPPAPEQVAKHELADAPLQQVNA---ALPPAPAPQSPQPDQQQQSQAPPSHQYPSQ 101

Query: 70  PYHTPPPSQTSKQSGKAPD 88
                  S   + + +   
Sbjct: 102 LPPQQVQSVPQQPTPQQEP 120



 Score = 33.0 bits (75), Expect = 0.58
 Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 9/69 (13%)

Query: 23  QQTLSPQQPPVSPQ---HQNMMSPIQQHSLS-PHTQAP-SPQPPAQPQQVPSPYHT---- 73
               S +QPP   Q   H+   +P+QQ + + P   AP SPQP  Q Q    P H     
Sbjct: 42  FHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQ 101

Query: 74  PPPSQTSKQ 82
            PP Q    
Sbjct: 102 LPPQQVQSV 110


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 48.6 bits (116), Expect = 9e-06
 Identities = 25/79 (31%), Positives = 27/79 (34%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
             Q P    +  Q         Q    P  PP  PQ    +   Q   LS     P PQP
Sbjct: 203 YPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQP 262

Query: 61  PAQPQQVPSPYHTPPPSQT 79
           P Q QQ P P   PPP   
Sbjct: 263 PQQQQQPPQPQAQPPPQNQ 281



 Score = 47.5 bits (113), Expect = 2e-05
 Identities = 25/88 (28%), Positives = 27/88 (30%), Gaps = 3/88 (3%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS-PQ 59
             +        P  S            PQQPP   Q Q      Q     P  Q P   Q
Sbjct: 210 HPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQM--PPPPPQPPQQQQ 267

Query: 60  PPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
            P QPQ  P P + P P     Q   AP
Sbjct: 268 QPPQPQAQPPPQNQPTPHPGLPQGQNAP 295



 Score = 47.1 bits (112), Expect = 3e-05
 Identities = 14/74 (18%), Positives = 15/74 (20%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
                         +     PP  P  Q    P  Q    P  Q        Q Q  P P
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297

Query: 71  YHTPPPSQTSKQSG 84
               P      Q  
Sbjct: 298 PPQQPQLLPLVQQP 311



 Score = 42.8 bits (101), Expect = 6e-04
 Identities = 16/72 (22%), Positives = 20/72 (27%), Gaps = 1/72 (1%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
            PQ          Q   P+QPP  PQ         Q        + +P  P  P Q+P  
Sbjct: 180 LPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQ 239

Query: 71  YHTPPPSQTSKQ 82
              P        
Sbjct: 240 -PPPLQQPQFPG 250



 Score = 42.4 bits (100), Expect = 8e-04
 Identities = 19/69 (27%), Positives = 23/69 (33%), Gaps = 4/69 (5%)

Query: 23  QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHT----PPPSQ 78
                PQ  P   Q Q  +    Q    P      PQ P   QQ   P  +    PPP Q
Sbjct: 202 GYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQ 261

Query: 79  TSKQSGKAP 87
             +Q  + P
Sbjct: 262 PPQQQQQPP 270



 Score = 42.1 bits (99), Expect = 0.001
 Identities = 22/71 (30%), Positives = 24/71 (33%), Gaps = 7/71 (9%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPH---TQAPSPQPPAQPQQV 67
            P  S    P   Q   PQQ    PQ Q    P  Q +  P     Q     PP QPQ +
Sbjct: 248 FPGLSQQMPPPPPQP--PQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLL 305

Query: 68  PSPYHTPPPSQ 78
           P      P  Q
Sbjct: 306 PL--VQQPQGQ 314



 Score = 41.7 bits (98), Expect = 0.001
 Identities = 20/78 (25%), Positives = 23/78 (29%), Gaps = 3/78 (3%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
           Q P     PPQ      P  Q T  P   Q   +P        +      P  Q   PQ 
Sbjct: 261 QPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQF 320

Query: 61  PAQPQQVPSPYHTPPPSQ 78
             Q  Q+ S       SQ
Sbjct: 321 REQLVQL-SQQQREALSQ 337



 Score = 39.8 bits (93), Expect = 0.006
 Identities = 20/88 (22%), Positives = 23/88 (26%), Gaps = 5/88 (5%)

Query: 3   QLPSLMDDPPQESFHSVP---IIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQ 59
           Q        PQ     +P     +Q   PQQ P          P          Q   P 
Sbjct: 164 QQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFL-PA 222

Query: 60  PPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
           P   P Q P P    P      Q  + P
Sbjct: 223 PSQAPAQPPLP-PQLPQQPPPLQQPQFP 249



 Score = 38.6 bits (90), Expect = 0.013
 Identities = 15/72 (20%), Positives = 18/72 (25%), Gaps = 3/72 (4%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
           PP          Q      QPP  PQ+Q    P      +     P  QP   P      
Sbjct: 256 PPPPPQPPQQQQQPPQPQAQPP--PQNQPTPHPGLPQGQNA-PLPPPQQPQLLPLVQQPQ 312

Query: 71  YHTPPPSQTSKQ 82
                P    + 
Sbjct: 313 GQQRGPQFREQL 324



 Score = 35.5 bits (82), Expect = 0.10
 Identities = 22/83 (26%), Positives = 26/83 (31%), Gaps = 7/83 (8%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
           PSL    P+                QP      Q M+S  +  +     Q   PQ P  P
Sbjct: 123 PSLWGTAPKPEPQPPQA-----PESQPQPQTPAQKMLSLEEVEAQLQQRQQA-PQLPQPP 176

Query: 65  QQVPSPYHTPPPSQTSKQSGKAP 87
           QQV  P   PP      Q G   
Sbjct: 177 QQVL-PQGMPPRQAAFPQQGPPE 198



 Score = 34.7 bits (80), Expect = 0.18
 Identities = 18/84 (21%), Positives = 23/84 (27%), Gaps = 6/84 (7%)

Query: 6   SLMDDPPQESFHSVPIIQQTLSPQQPPVSPQ----HQNMMS--PIQQHSLSPHTQAPSPQ 59
            + D  P  S        +   PQ P   PQ     Q M+S   ++            PQ
Sbjct: 115 YIPDLQPDPSLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQ 174

Query: 60  PPAQPQQVPSPYHTPPPSQTSKQS 83
           PP Q      P       Q     
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPPE 198



 Score = 34.7 bits (80), Expect = 0.19
 Identities = 21/100 (21%), Positives = 28/100 (28%), Gaps = 16/100 (16%)

Query: 4   LPSLMDDPPQESFHSVP--------------IIQQTLSPQQPPVSPQHQNMMSPIQ--QH 47
                +  P ++  S P              +  Q    QQ P  PQ    + P      
Sbjct: 128 TAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPR 187

Query: 48  SLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
             +   Q P  QPP  PQ          P Q      +AP
Sbjct: 188 QAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAP 227


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 48.5 bits (115), Expect = 1e-05
 Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 4/102 (3%)

Query: 559 VATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSF-SVCGRFLASAGGDGNVHMWDLSN 617
           +A+GS D +++LW    G  +  +   KA +  + F S  GR LA    D  V+ +DL N
Sbjct: 591 LASGSDDGSVKLWSINQGVSIGTIKT-KANICCVQFPSESGRSLAFGSADHKVYYYDLRN 649

Query: 618 GHL-LTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDFA 658
             L L  + GH+  V  + F  D + L SS  D ++KLWD +
Sbjct: 650 PKLPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDLS 690



 Score = 38.5 bits (89), Expect = 0.011
 Identities = 44/188 (23%), Positives = 84/188 (44%), Gaps = 19/188 (10%)

Query: 439 DILVRIMDERNAETTRTLVGHEGPVRKLSFSP-DRSLLLSCSQDNTIRLWSLLLWRCLVV 497
           + +V++ D   ++    +  HE  V  + +S  D +LL S S D +++LWS+     +  
Sbjct: 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGT 613

Query: 498 YKGHGHPVWDIKF---SPHGYYFASAGHDRMARLWATDSYHP---LRLFVGHYSDVDCVQ 551
            K   + +  ++F   S     F SA H    +++  D  +P   L   +GH   V  V+
Sbjct: 614 IKTKAN-ICCVQFPSESGRSLAFGSADH----KVYYYDLRNPKLPLCTMIGHSKTVSYVR 668

Query: 552 FHPNSNYVATGSSDRTIRLWDC------VTGSHVRLLTGHKAPVYALSFSVCGRFLASAG 605
           F  +S  V++ S+D T++LWD       +  + +    GH      +  SV   ++A+  
Sbjct: 669 FVDSSTLVSS-STDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGS 727

Query: 606 GDGNVHMW 613
               V ++
Sbjct: 728 ETNEVFVY 735



 Score = 36.6 bits (84), Expect = 0.047
 Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 13/106 (12%)

Query: 396 SEDVSILAVGFSESYIKLWSLVPQ------KLKA-----MKSSDQLQDIEMECDDILVRI 444
           S D ++LA G  +  +KLWS+         K KA        S+  + +     D  V  
Sbjct: 585 SADPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYY 644

Query: 445 MDERNAETTR-TLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSL 489
            D RN +    T++GH   V  + F  D S L+S S DNT++LW L
Sbjct: 645 YDLRNPKLPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDL 689



 Score = 36.2 bits (83), Expect = 0.065
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 601 LASAGGDGNVHMWDLSNGHLLTLLTGHTDVVRALCFSR-DGNILASSGQDCSVKLW 655
           +AS+  +G V +WD++   L+T +  H   V ++ +S  D  +LAS   D SVKLW
Sbjct: 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLW 603



 Score = 34.3 bits (78), Expect = 0.23
 Identities = 32/159 (20%), Positives = 65/159 (40%), Gaps = 9/159 (5%)

Query: 508 IKFSPHGYYFASAGHDRMARLWATDS--------YHPLRLFVGHYSDVDCVQFHPNSNYV 559
           I F   G +FA+AG ++  +++  +S        ++P+                   + V
Sbjct: 489 IGFDRDGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQV 548

Query: 560 ATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCG-RFLASAGGDGNVHMWDLSNG 618
           A+ + +  +++WD      V  +  H+  V+++ +S      LAS   DG+V +W ++ G
Sbjct: 549 ASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQG 608

Query: 619 HLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDF 657
             +  +    ++      S  G  LA    D  V  +D 
Sbjct: 609 VSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDL 647


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 43.1 bits (102), Expect = 3e-04
 Identities = 17/87 (19%), Positives = 25/87 (28%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
            D+   +       SF        + SP  PP S           +   SP   +  P P
Sbjct: 157 EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP 216

Query: 61  PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
            +     P P    P + +      AP
Sbjct: 217 SSFQSDTPPPSPESPTNPSPPPGPAAP 243



 Score = 35.0 bits (81), Expect = 0.098
 Identities = 20/99 (20%), Positives = 26/99 (26%), Gaps = 8/99 (8%)

Query: 5   PSLMDDPPQESFHSVPIIQQTL-------SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS 57
            S     P  S   VP                 PP     Q+   P    S +  +  P 
Sbjct: 180 ASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPG 239

Query: 58  PQPPAQPQQVPSPYH-TPPPSQTSKQSGKAPDKNTLITM 95
           P  P  P     P   T  P+  S  +  AP     +  
Sbjct: 240 PAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278



 Score = 32.7 bits (75), Expect = 0.52
 Identities = 22/82 (26%), Positives = 31/82 (37%), Gaps = 6/82 (7%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
           +   PS  +DP   S  S+P    +     PP SP+     SP       P   AP P P
Sbjct: 194 VPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPP------PGPAAPPPPP 247

Query: 61  PAQPQQVPSPYHTPPPSQTSKQ 82
             Q   + +   TPP +  +  
Sbjct: 248 VQQVPPLSTAKPTPPSASATPA 269



 Score = 30.0 bits (68), Expect = 4.0
 Identities = 10/78 (12%), Positives = 16/78 (20%), Gaps = 1/78 (1%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
            S +   P       P           P          P+QQ       + P+P   +  
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK-PTPPSASAT 267

Query: 65  QQVPSPYHTPPPSQTSKQ 82
                       +    Q
Sbjct: 268 PAPIGGITLDDDAIAKAQ 285



 Score = 29.3 bits (66), Expect = 7.0
 Identities = 10/79 (12%), Positives = 13/79 (16%), Gaps = 5/79 (6%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
           PS    PP  S         +      P  P       P            P       P
Sbjct: 207 PSDSSLPPAPSSFQSDTPPPSPESPTNPSPPP-----GPAAPPPPPVQQVPPLSTAKPTP 261

Query: 65  QQVPSPYHTPPPSQTSKQS 83
               +             +
Sbjct: 262 PSASATPAPIGGITLDDDA 280


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 42.0 bits (99), Expect = 6e-04
 Identities = 15/72 (20%), Positives = 25/72 (34%), Gaps = 2/72 (2%)

Query: 10  DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
           +   +   S   IQQ    QQPP +      ++P +Q +     Q P+P        V  
Sbjct: 109 EQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168

Query: 70  PYHTPPPSQTSK 81
                 P   ++
Sbjct: 169 A--VEAPKVEAE 178



 Score = 40.0 bits (94), Expect = 0.002
 Identities = 22/93 (23%), Positives = 34/93 (36%), Gaps = 6/93 (6%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSL--SPHTQAPSP 58
           ++Q+ + M   P      VP  +QT    +  V  Q Q         +    P T     
Sbjct: 88  LEQMQADMRQQPT-QLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQT 146

Query: 59  QPPAQPQQVPSPYHTPP--PSQTSKQSGKAPDK 89
             P QPQ  P+P  T P  P   + ++ K   +
Sbjct: 147 TAPVQPQT-PAPVRTQPAAPVTQAVEAPKVEAE 178


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 42.2 bits (99), Expect = 8e-04
 Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 10/84 (11%)

Query: 456 LVGHEGPVRKLSFSPDRS-LLLSCSQDNTIRLWSLL--------LWRCLVVYKGHGHPVW 506
           L GH   +  L F+P  S +L S S+D TIR+W +         +     + KGH   + 
Sbjct: 70  LKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS 129

Query: 507 DIKFSPHGYY-FASAGHDRMARLW 529
            I ++P  YY   S+G D    +W
Sbjct: 130 IIDWNPMNYYIMCSSGFDSFVNIW 153



 Score = 40.7 bits (95), Expect = 0.002
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 10/86 (11%)

Query: 542 GHYSDVDCVQFHP-NSNYVATGSSDRTIRLWDCV-TGSHVR-------LLTGHKAPVYAL 592
           GH S +  +QF+P  S  +A+GS D TIR+W+       V+       +L GHK  +  +
Sbjct: 72  GHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISII 131

Query: 593 SFSVCGRF-LASAGGDGNVHMWDLSN 617
            ++    + + S+G D  V++WD+ N
Sbjct: 132 DWNPMNYYIMCSSGFDSFVNIWDIEN 157



 Score = 37.6 bits (87), Expect = 0.021
 Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 10/99 (10%)

Query: 568 IRLWDCVTGSHVRLLTGHKAPVYALSFSVC-GRFLASAGGDGNVHMWDLSNGHL------ 620
           IRL + +    V  L GH + +  L F+ C    LAS   D  + +W++ +         
Sbjct: 56  IRLENQMRKPPVIKLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIK 115

Query: 621 --LTLLTGHTDVVRALCFS-RDGNILASSGQDCSVKLWD 656
               +L GH   +  + ++  +  I+ SSG D  V +WD
Sbjct: 116 DPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWD 154


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 42.2 bits (99), Expect = 0.001
 Identities = 17/78 (21%), Positives = 23/78 (29%), Gaps = 6/78 (7%)

Query: 11   PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
                S  S  +          P +P               P    P P PP +PQ   +P
Sbjct: 2890 AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP----PPQPQPPPPPPPRPQPPLAP 2945

Query: 71   YHTPPPSQTSKQSGKAPD 88
              T  P+   + SG  P 
Sbjct: 2946 --TTDPAGAGEPSGAVPQ 2961



 Score = 41.5 bits (97), Expect = 0.002
 Identities = 21/74 (28%), Positives = 24/74 (32%), Gaps = 1/74 (1%)

Query: 11   PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
               ESF   P   +     Q P  PQ Q    P  Q    P    P PQPP  P   P+ 
Sbjct: 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP-PRPQPPLAPTTDPAG 2951

Query: 71   YHTPPPSQTSKQSG 84
               P  +      G
Sbjct: 2952 AGEPSGAVPQPWLG 2965



 Score = 39.9 bits (93), Expect = 0.006
 Identities = 22/63 (34%), Positives = 26/63 (41%), Gaps = 2/63 (3%)

Query: 30   QPPVSPQHQNMMSPIQQHSLSPHTQAPS-PQPPAQPQQVPSPYHTPPPSQTSKQSGKAPD 88
            +P VS   ++   P  Q    P  QAP  PQP  QP   P P   PPP     Q   AP 
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP-QPPPPPPPRPQPPLAPT 2946

Query: 89   KNT 91
             + 
Sbjct: 2947 TDP 2949



 Score = 33.8 bits (77), Expect = 0.44
 Identities = 13/61 (21%), Positives = 16/61 (26%), Gaps = 4/61 (6%)

Query: 27   SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
            +P +PPV      +  P    S       P         Q P P    P      Q    
Sbjct: 2877 APARPPVR----RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932

Query: 87   P 87
            P
Sbjct: 2933 P 2933



 Score = 33.0 bits (75), Expect = 0.74
 Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 3/67 (4%)

Query: 11   PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHT-QAPSPQPPAQPQQVPS 69
            PP     SVP  +    P +P V+ + +   +P Q     P         P       P 
Sbjct: 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSA--RPRAPVDDRGDPRGPAPPSPL 2617

Query: 70   PYHTPPP 76
            P  T  P
Sbjct: 2618 PPDTHAP 2624



 Score = 32.2 bits (73), Expect = 1.3
 Identities = 20/75 (26%), Positives = 25/75 (33%), Gaps = 8/75 (10%)

Query: 9    DDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP 68
            D PPQ +    P+  +       P SP       P   H+  P    PSP P A      
Sbjct: 2590 DAPPQSARPRAPVDDRGDPRGPAPPSP------LPPDTHAPDP--PPPSPSPAANEPDPH 2641

Query: 69   SPYHTPPPSQTSKQS 83
             P   PPP +     
Sbjct: 2642 PPPTVPPPERPRDDP 2656



 Score = 31.8 bits (72), Expect = 1.8
 Identities = 23/85 (27%), Positives = 31/85 (36%), Gaps = 17/85 (20%)

Query: 9    DDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLS----------PHTQAPSP 58
            D PP  S    P     + P +P   P H  M++ I    L           P    P+ 
Sbjct: 2507 DAPPAPS-RLAP----AILPDEPVGEPVHPRMLTWI--RGLEELASDDAGDPPPPLPPAA 2559

Query: 59   QPPAQPQQVPSPYHTPPPSQTSKQS 83
             P A  + VP P   P PS+ +  S
Sbjct: 2560 PPAAPDRSVPPPRPAPRPSEPAVTS 2584



 Score = 29.5 bits (66), Expect = 7.7
 Identities = 11/38 (28%), Positives = 12/38 (31%)

Query: 50  SPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
                AP P     P   P P   PPP  T   S +  
Sbjct: 404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPG 441


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 40.8 bits (95), Expect = 0.003
 Identities = 29/104 (27%), Positives = 38/104 (36%), Gaps = 3/104 (2%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPV-SPQHQNMMSPIQQHSLSPHTQAPSPQPP 61
           Q P       Q+    V    Q   PQQP    PQ+Q    P+         Q P   P 
Sbjct: 756 QQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV-APQ 814

Query: 62  AQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLR 105
            Q QQ   P    P  Q  +Q      ++TL+  L L+R  + R
Sbjct: 815 PQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPL-LMRNGDSR 857



 Score = 40.5 bits (94), Expect = 0.004
 Identities = 22/84 (26%), Positives = 30/84 (35%), Gaps = 5/84 (5%)

Query: 4   LPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ 63
           + +L+DD P E     P+    + P Q P  P          Q  ++P  Q   PQ P  
Sbjct: 732 MKALLDDGPHE-----PLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVA 786

Query: 64  PQQVPSPYHTPPPSQTSKQSGKAP 87
           PQ        P   Q   Q  + P
Sbjct: 787 PQPQYQQPQQPVAPQPQYQQPQQP 810



 Score = 30.1 bits (67), Expect = 5.3
 Identities = 15/61 (24%), Positives = 22/61 (36%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
           +P+  P   Q+        +    P         PA  Q    PY+ P P Q ++Q   A
Sbjct: 376 APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYA 435

Query: 87  P 87
           P
Sbjct: 436 P 436



 Score = 29.7 bits (66), Expect = 8.0
 Identities = 17/81 (20%), Positives = 23/81 (28%), Gaps = 12/81 (14%)

Query: 11  PPQESFHSVPIIQQTLSPQ----QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ 66
           P  E +       Q         Q PV PQ        +Q +  P+      QP  QP  
Sbjct: 375 PAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434

Query: 67  VPSP--------YHTPPPSQT 79
            P+P        +       T
Sbjct: 435 APAPEQPVAGNAWQAEEQQST 455


>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
           This is a family of eukaryotic translation initiation
           factors.
          Length = 194

 Score = 38.8 bits (91), Expect = 0.004
 Identities = 25/140 (17%), Positives = 47/140 (33%), Gaps = 16/140 (11%)

Query: 445 MDERNAETTRTLVGHEGPVRKLSFSPDRSLLLSC--SQDNTIRLWSLLLWRCLVVYKGHG 502
           + E         +  EGP+   ++SP+              I  + L   +  V++    
Sbjct: 44  IGETGGPDEVVELDKEGPIHDFAWSPNGKEFAVIYGYMPAKITFFDL---KGNVIHSLGE 100

Query: 503 HPVWDIKFSPHGYYFASAGHDRMA---RLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV 559
            P   I +SP G     AG   +A     W   +   +       +  DC ++ P+  Y 
Sbjct: 101 QPRNTIFWSPFGRLVLLAGFGNLAGQIEFWDVKNKKKIATAE-ASNATDC-EWSPDGRYF 158

Query: 560 ATGSS------DRTIRLWDC 573
            T ++      D   ++W  
Sbjct: 159 LTATTSPRLRVDNGFKIWHY 178


>gnl|CDD|221042 pfam11244, Med25_NR-box, Mediator complex subunit 25 C-terminal
          NR box-containing.  The overall function of the
          full-length Med25 is efficiently to coordinate the
          transcriptional activation of RAR/RXR (retinoic acid
          receptor/retinoic X receptor) in higher eukaryotic
          cells. Human Med25 consists of several domains with
          different binding properties, the N-terminal, VWA,
          domain, an SD1 - synapsin 1 - domain from residues
          229-381, a PTOV(B) or ACID domain from 395-545, an SD2
          domain from residues 564-645 and this C-terminal NR
          box-containing domain (646-650) from C69-747. The NR
          box of MED25 is critical for its recruitment to the
          promoter, probably through an interaction with pre
          bound RAR.
          Length = 89

 Score = 36.6 bits (84), Expect = 0.005
 Identities = 19/62 (30%), Positives = 24/62 (38%), Gaps = 7/62 (11%)

Query: 24 QTLSPQQPPVS-PQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
                QP ++  Q Q M    Q H L P    P+  P  Q QQ+      P   Q S Q
Sbjct: 1  HQQQQGQPGLAAMQQQAMGQQQQGHQL-PM-PGPAQFPLQQLQQM----RGPGGGQMSMQ 54

Query: 83 SG 84
          +G
Sbjct: 55 AG 56



 Score = 31.6 bits (71), Expect = 0.28
 Identities = 16/57 (28%), Positives = 19/57 (33%), Gaps = 7/57 (12%)

Query: 20 PIIQQTLSPQQPPVSP------QHQNMMSPIQ-QHSLSPHTQAPSPQPPAQPQQVPS 69
           + QQ    Q P   P      Q Q M  P   Q S+    +AP      QP    S
Sbjct: 17 AMGQQQQGHQLPMPGPAQFPLQQLQQMRGPGGGQMSMQAGGRAPQQMHALQPLLGQS 73


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 37.5 bits (87), Expect = 0.006
 Identities = 23/68 (33%), Positives = 30/68 (44%), Gaps = 5/68 (7%)

Query: 23  QQTLSPQQPPVSPQH---QNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
           ++ L+P  PP  P+        SP       P T  P+  PPA    VP+P   PPPS  
Sbjct: 76  RKILAPLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAV-PPAPAAAVPAPAAAPPPSDP 134

Query: 80  SKQSGKAP 87
             Q  +AP
Sbjct: 135 P-QPPRAP 141


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
          includes the B. subtilis YppG protein, which is
          functionally uncharacterized. This family of proteins
          is found in bacteria. Proteins in this family are
          typically between 115 and 181 amino acids in length.
          There are two completely conserved residues (F and G)
          that may be functionally important.
          Length = 110

 Score = 36.2 bits (84), Expect = 0.008
 Identities = 17/65 (26%), Positives = 20/65 (30%), Gaps = 6/65 (9%)

Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP-QPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
           QQP     +   M P       P  Q      P  QP    SP    PP      S  +
Sbjct: 16 QQQPYQQQPYHQQMPPPPYS--PPQQQQGHFMPPQPQPYPKQSPQQQQPP---QFSSFLS 70

Query: 87 PDKNT 91
            KN+
Sbjct: 71 QFKNS 75



 Score = 35.1 bits (81), Expect = 0.020
 Identities = 17/75 (22%), Positives = 18/75 (24%), Gaps = 12/75 (16%)

Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
            Q         Q    P  PP   Q   M               P P P   PQQ   P
Sbjct: 16 QQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQ------------PQPYPKQSPQQQQPP 63

Query: 71 YHTPPPSQTSKQSGK 85
            +   SQ     G 
Sbjct: 64 QFSSFLSQFKNSDGT 78


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 39.1 bits (92), Expect = 0.009
 Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 2/64 (3%)

Query: 16  FHSVPIIQQTLSPQQP-PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTP 74
           FH    + +   P Q    +   Q   +P    +       P P   A PQQ P+     
Sbjct: 359 FHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASA-PQQAPAVPLPE 417

Query: 75  PPSQ 78
             SQ
Sbjct: 418 TTSQ 421



 Score = 32.5 bits (75), Expect = 0.95
 Identities = 21/110 (19%), Positives = 33/110 (30%), Gaps = 4/110 (3%)

Query: 9   DDPPQESFHSVPIIQQTLSPQQ----PPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
            + P +S       Q T +P      P          S  QQ    P  +  S    A+ 
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQ 427

Query: 65  QQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKE 114
           Q   +   T         + +A   N+ +  L  +R       +   KKE
Sbjct: 428 QLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKE 477


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 37.2 bits (86), Expect = 0.010
 Identities = 26/87 (29%), Positives = 30/87 (34%), Gaps = 17/87 (19%)

Query: 3   QLP--SLMDDPPQESFHSVPIIQQTLSPQQP--PVSPQHQNM--------MSPIQQHSLS 50
           Q+P       P     H + I    L PQQP  PV   H  +        + P  QH L 
Sbjct: 56  QMPQQQQSAHPKLTPHHQLLI----LPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQHPLQ 111

Query: 51  PHT-QAPSPQPPAQPQQVPSPYHTPPP 76
           P   Q P PQ P   Q    P     P
Sbjct: 112 PTYGQNPQPQQPTHTQPPVQPQQPADP 138



 Score = 36.4 bits (84), Expect = 0.018
 Identities = 23/76 (30%), Positives = 29/76 (38%), Gaps = 5/76 (6%)

Query: 4   LPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHT-QAPSPQPPA 62
           +P     P Q+        + T   Q   + PQ   M  P   H + P T Q P  QPPA
Sbjct: 51  IPLSPQMPQQQQSA---HPKLTPHHQLLILPPQQPMMPVPGH-HPMVPMTGQQPHLQPPA 106

Query: 63  QPQQVPSPYHTPPPSQ 78
           Q    P+    P P Q
Sbjct: 107 QHPLQPTYGQNPQPQQ 122


>gnl|CDD|150370 pfam09682, Holin_LLH, Phage holin protein (Holin_LLH).  This entry
           identifies a family of putative phage holin from a
           number of phage and prophage regions of Gram-positive
           bacteria. Like other holins, it is small (about 100
           amino acids) with stretches of hydrophobic sequence and
           is encoded adjacent to lytic enzymes.
          Length = 108

 Score = 35.8 bits (83), Expect = 0.012
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 2/40 (5%)

Query: 78  QTSKQSGKAPDKNTLIT--MLKLLRQYNLRGTEEQLKKEI 115
           Q +K+ GK  +K T     +   L + N+  T+ QLK  I
Sbjct: 58  QIAKEGGKGEEKLTQAKNKLKAELDKANIHPTDAQLKGFI 97


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 37.4 bits (87), Expect = 0.018
 Identities = 10/72 (13%), Positives = 14/72 (19%), Gaps = 4/72 (5%)

Query: 8   MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMM----SPIQQHSLSPHTQAPSPQPPAQ 63
            + P   S  + P + Q   P   P   Q    +                  P       
Sbjct: 45  AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP 104

Query: 64  PQQVPSPYHTPP 75
                    T P
Sbjct: 105 AGPAGPTIQTEP 116



 Score = 34.7 bits (80), Expect = 0.15
 Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 2/38 (5%)

Query: 56 PSPQPPAQPQQVPSPYHTPPPSQTSKQS--GKAPDKNT 91
          PSPQ P    Q+P P   PPP+Q  +    G     NT
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNT 88



 Score = 30.1 bits (68), Expect = 4.4
 Identities = 29/155 (18%), Positives = 44/155 (28%), Gaps = 20/155 (12%)

Query: 19  VPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQV----PSPYHTP 74
           VP++        P   P  Q  +  +QQ   +P     S Q P+Q Q       S     
Sbjct: 124 VPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPA----SGQLPSQQQSAQKNDESQLQQQ 179

Query: 75  P-----PSQTSKQSGKAPDKNTLITMLK---LLRQYNLRGTEEQLKKEINVNDLGIDPND 126
           P     P QT        +    +         R     G       +            
Sbjct: 180 PNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAKSSKRR 239

Query: 127 SEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLD 161
           +         + EGD +  +D  +D    IE+ LD
Sbjct: 240 TIAQIDGIDSDDEGDGSD-DDDDED---AIESDLD 270



 Score = 30.1 bits (68), Expect = 4.6
 Identities = 15/88 (17%), Positives = 21/88 (23%), Gaps = 8/88 (9%)

Query: 5   PSLMDDPPQESFHSVPIIQ---QTLSPQQPPVSPQ---HQNMMSPIQQHSLSPHTQAPSP 58
                           I     Q    Q P +  Q   +  +  P QQ +L    Q    
Sbjct: 96  NPPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRAL-QQLQQRYG 154

Query: 59  QPPA-QPQQVPSPYHTPPPSQTSKQSGK 85
            P + Q             SQ  +Q   
Sbjct: 155 APASGQLPSQQQSAQKNDESQLQQQPNG 182



 Score = 30.1 bits (68), Expect = 4.6
 Identities = 10/61 (16%), Positives = 13/61 (21%), Gaps = 3/61 (4%)

Query: 28  PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP-SPYHTPPPSQTSKQSGKA 86
           P      P  Q                   P    Q    P       PP+  +  +G A
Sbjct: 51  PSPQAPPPVAQL--PQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPA 108

Query: 87  P 87
            
Sbjct: 109 G 109


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 36.7 bits (85), Expect = 0.019
 Identities = 13/71 (18%), Positives = 18/71 (25%), Gaps = 1/71 (1%)

Query: 8   MDDPPQES-FHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ 66
              P +     +     QT    QPP +            H + P    P+P    Q   
Sbjct: 106 ACRPQRPVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPP 165

Query: 67  VPSPYHTPPPS 77
                   PP 
Sbjct: 166 PGLLQPQGPPP 176


>gnl|CDD|221625 pfam12533, Neuro_bHLH, Neuronal helix-loop-helix transcription
           factor.  This domain family is found in eukaryotes, and
           is approximately 80 amino acids in length. The family is
           found C-terminal to pfam00010. There is a single
           completely conserved residue W that may be functionally
           important. Neuronal basic helix-loop-helix (bHLH)
           transcription factors such as neuroD and neurogenin have
           been shown to play important roles in neuronal
           development.
          Length = 120

 Score = 35.4 bits (82), Expect = 0.020
 Identities = 23/89 (25%), Positives = 28/89 (31%), Gaps = 12/89 (13%)

Query: 3   QLPS---LMDDPPQESFHSVPIIQQTLSPQQPP--VSPQHQNMMSPIQQHSLSPHTQAPS 57
           QL     L +     S        Q   P Q P   SP H +M S      L PH    +
Sbjct: 30  QLNPRTLLPEQQQDYSHSGSSPFSQHPYPYQSPGLPSPPHGSM-SSSHSLHLKPHGYCSA 88

Query: 58  PQP---PAQPQQVPSPY---HTPPPSQTS 80
            +P      P     PY    +PP S   
Sbjct: 89  YEPFYESHSPDCGSPPYDGPLSPPLSING 117


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 37.3 bits (87), Expect = 0.025
 Identities = 12/72 (16%), Positives = 19/72 (26%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
           PP+         +      QP   P +Q       Q  +SP          + PQ     
Sbjct: 108 PPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQA 167

Query: 71  YHTPPPSQTSKQ 82
           +    P    + 
Sbjct: 168 FQPAEPVAAPQP 179



 Score = 36.6 bits (85), Expect = 0.035
 Identities = 24/95 (25%), Positives = 32/95 (33%), Gaps = 14/95 (14%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP-- 60
             P+     P +     P  +Q   P Q PVSPQ      P+           P+ Q   
Sbjct: 119 PRPAQPAPQPVQQPAYQPQPEQ---PLQQPVSPQVAPAPQPVHS------APQPAQQAFQ 169

Query: 61  PAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITM 95
           PA+P   P P     P+       K   K  +I M
Sbjct: 170 PAEPVAAPQPEPVAEPAPVMD---KPKRKEAVIVM 201



 Score = 36.6 bits (85), Expect = 0.042
 Identities = 22/81 (27%), Positives = 27/81 (33%), Gaps = 5/81 (6%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQH--QNMMSPIQQHSLSPHTQAPSPQPPAQP--QQ 66
           PP  S      +QQ    Q PP       Q    P+QQ +  P  + P  Q P  P    
Sbjct: 94  PPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQP-LQQPVSPQVAP 152

Query: 67  VPSPYHTPPPSQTSKQSGKAP 87
            P P H+ P           P
Sbjct: 153 APQPVHSAPQPAQQAFQPAEP 173



 Score = 35.4 bits (82), Expect = 0.087
 Identities = 14/82 (17%), Positives = 18/82 (21%), Gaps = 4/82 (4%)

Query: 5   PSLMDDPPQESFHSV-PIIQQTLSPQQPPVSPQHQNMMSPIQQHSL---SPHTQAPSPQP 60
               +       H   P        Q     P+ Q       + +     P  Q      
Sbjct: 78  AQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQ 137

Query: 61  PAQPQQVPSPYHTPPPSQTSKQ 82
           P QP Q P      P  Q    
Sbjct: 138 PEQPLQQPVSPQVAPAPQPVHS 159



 Score = 31.5 bits (72), Expect = 1.5
 Identities = 13/56 (23%), Positives = 17/56 (30%), Gaps = 2/56 (3%)

Query: 32  PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
           P + Q      P  QH   P   +  P+ P Q    P     P  +    Q    P
Sbjct: 75  PANAQEHEAARPSPQHQYQPPYASAQPRQPVQQP--PEAQVPPQHAPRPAQPAPQP 128


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 36.7 bits (85), Expect = 0.026
 Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 21  IIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
           +I++  +  QPP       M  P       PH   P+  P AQP    +P  +PPP+  S
Sbjct: 134 VIRKKEALPQPPPPAPVVMMQPP------PPHAMPPASPPAAQPAP-SAPASSPPPTPAS 186

Query: 81  KQSGKAP 87
               KAP
Sbjct: 187 PPPAKAP 193



 Score = 35.2 bits (81), Expect = 0.086
 Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 3/57 (5%)

Query: 28  PQQPPVSPQHQNMMSPIQQHSLSPHTQAP-SPQPPAQPQQVPSPYHTPPPSQTSKQS 83
           PQ PP +P    MM P   H++ P +     P P A     P    +PPP++  K S
Sbjct: 142 PQPPPPAPVV--MMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSS 196


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 37.0 bits (85), Expect = 0.030
 Identities = 17/74 (22%), Positives = 28/74 (37%)

Query: 28  PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
             +P   P  QN        + +   Q    Q P QP Q P+P +       ++Q  K  
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRS 259

Query: 88  DKNTLITMLKLLRQ 101
            +      LK+L++
Sbjct: 260 REQGNQEFLKMLKE 273


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 37.0 bits (85), Expect = 0.031
 Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 21/111 (18%)

Query: 21  IIQQTLSPQQPPVSP--------QHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP---- 68
           +IQ+ L  +   V P        + ++  S +      P T  P P PPA+ +Q P    
Sbjct: 144 VIQEFLVKEGDTVEPGTKVAIISKSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVES 203

Query: 69  --------SPYHTPPPSQTSKQSGKAP-DKNTLITMLKLLRQYNLRGTEEQ 110
                   +P   PPP Q++K+    P ++   + M +L ++   R  + Q
Sbjct: 204 APVAEKPKAPSSPPPPKQSAKEPQLPPKERERRVPMTRLRKRVATRLKDSQ 254


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.0 bits (85), Expect = 0.038
 Identities = 30/96 (31%), Positives = 35/96 (36%), Gaps = 14/96 (14%)

Query: 5   PSLMDDP---PQESFHSVPIIQQTLSPQ------QPPVSPQHQNMMSPIQQHSLSPHTQA 55
           PS+   P      S        Q + PQ      QP   PQ  + +S I   SL P  + 
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHP-QRL 237

Query: 56  PSPQPPAQP----QQVPSPYHTPPPSQTSKQSGKAP 87
           PSP PP QP    QQ P P         S   G  P
Sbjct: 238 PSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGP 273



 Score = 35.0 bits (80), Expect = 0.15
 Identities = 22/79 (27%), Positives = 28/79 (35%), Gaps = 7/79 (8%)

Query: 11  PPQESFHSVPIIQQTLSPQQ-----PPVSPQHQNMMSPIQQHSLSPHTQAP--SPQPPAQ 63
            PQ+      I   +L PQ+     PP+ PQ  +  SP      S H Q+    P PP  
Sbjct: 217 QPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMP 276

Query: 64  PQQVPSPYHTPPPSQTSKQ 82
                 P     PS    Q
Sbjct: 277 HALQQGPVFLQHPSSNPPQ 295



 Score = 32.4 bits (73), Expect = 1.0
 Identities = 23/78 (29%), Positives = 29/78 (37%), Gaps = 9/78 (11%)

Query: 17  HSVPIIQQTLSPQQPPVSP----QHQNMMSPIQQHSLSPHT---QAPSPQPPAQPQQVPS 69
            S P     L+  Q P  P       +  +P  Q +L P     + P P  P+ P   P 
Sbjct: 290 SSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPP 349

Query: 70  PYHTPPPSQTSKQSGKAP 87
           P  T P  Q   QS K P
Sbjct: 350 P--TTPIPQLPNQSHKHP 365



 Score = 32.0 bits (72), Expect = 1.4
 Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 6/62 (9%)

Query: 24  QTLSPQ---QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
           Q+L P+    P          SP  QH  +         PP+ P   P+    PP + + 
Sbjct: 429 QSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAA---PPRASSG 485

Query: 81  KQ 82
            Q
Sbjct: 486 SQ 487



 Score = 31.6 bits (71), Expect = 1.7
 Identities = 31/103 (30%), Positives = 35/103 (33%), Gaps = 25/103 (24%)

Query: 11  PPQESFHSVPIIQQTLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPS---------PQ 59
           PP+E     P+      P    PP +P  Q    P Q H   PH Q PS         P 
Sbjct: 331 PPREQ----PLPPAPSMPHIKPPPTTPIPQL---PNQSHKHPPHLQGPSPFPQMPSNLPP 383

Query: 60  PPA------QPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITML 96
           PPA       P   P   H PPP Q   QS           +L
Sbjct: 384 PPALKPLSSLPTHHPPSAH-PPPLQLMPQSQPLQSVPAQPPVL 425



 Score = 30.4 bits (68), Expect = 3.8
 Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 14/103 (13%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQ---NMMSPIQQ---HSLSPHTQAP 56
           +LPS    P Q    S    Q      + P S  H     M   +QQ       P +  P
Sbjct: 236 RLPS-PHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPP 294

Query: 57  SPQPPAQ----PQQVPS---PYHTPPPSQTSKQSGKAPDKNTL 92
            P   AQ    P  +PS   P+   PPSQ++ Q  + P +  L
Sbjct: 295 QPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPL 337



 Score = 30.0 bits (67), Expect = 5.1
 Identities = 19/63 (30%), Positives = 22/63 (34%), Gaps = 2/63 (3%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS--PYHTPPPSQTSKQSG 84
            P QPPV  Q Q++      H  S     P   P AQ        P   PPPS  +    
Sbjct: 418 VPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPA 477

Query: 85  KAP 87
             P
Sbjct: 478 APP 480



 Score = 29.3 bits (65), Expect = 9.2
 Identities = 21/82 (25%), Positives = 29/82 (35%), Gaps = 7/82 (8%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
           PP+ S H       +     PP SP  Q+  +     ++ P    P+  P A P+   S 
Sbjct: 432 PPKASTHP-----HSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRA--SS 484

Query: 71  YHTPPPSQTSKQSGKAPDKNTL 92
              PP S      G A     L
Sbjct: 485 GSQPPGSALPSSGGCAGPGPPL 506


>gnl|CDD|214832 smart00817, Amelin, Ameloblastin precursor (Amelin).  This family
           consists of several mammalian Ameloblastin precursor
           (Amelin) proteins. Matrix proteins of tooth enamel
           consist mainly of amelogenin but also of non-amelogenin
           proteins, which, although their volumetric percentage is
           low, have an important role in enamel mineralisation.
           One of the non-amelogenin proteins is ameloblastin, also
           known as amelin and sheathlin. Ameloblastin (AMBN) is
           one of the enamel sheath proteins which is though to
           have a role in determining the prismatic structure of
           growing enamel crystals.
          Length = 411

 Score = 36.8 bits (85), Expect = 0.040
 Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 6/70 (8%)

Query: 13  QESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQA----PSPQPPAQPQQVP 68
           Q+  +S+P+    L P QP + PQ Q  + P  Q +  P  QA      PQPP    Q P
Sbjct: 87  QQYEYSLPVHPPPL-PSQPSLQPQ-QPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPP 144

Query: 69  SPYHTPPPSQ 78
                 P   
Sbjct: 145 LQQAELPMIP 154


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 36.5 bits (84), Expect = 0.054
 Identities = 22/108 (20%), Positives = 37/108 (34%), Gaps = 5/108 (4%)

Query: 11  PPQESFHSVP-IIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
            PQ+       +  Q   PQQ P   Q   + + +QQ           P  P  PQ+   
Sbjct: 262 QPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGP-PPQR--- 317

Query: 70  PYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINV 117
           P   P   Q  +Q   +  +  L  M     +   +  ++Q ++  N 
Sbjct: 318 PGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNH 365



 Score = 34.2 bits (78), Expect = 0.28
 Identities = 21/60 (35%), Positives = 24/60 (40%), Gaps = 5/60 (8%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ--PQQVPSP-YHTPPPSQTSKQS 83
           SP   P    +Q+M  P Q    SP    P  QPP       +PSP    P PS    QS
Sbjct: 413 SPSPVPQVQTNQSMPQPPQPSVPSP--GGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQS 470



 Score = 33.4 bits (76), Expect = 0.49
 Identities = 19/78 (24%), Positives = 25/78 (32%), Gaps = 2/78 (2%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA-QPQQVPS 69
           P   S  S     QT      P  P   +   P  Q      +    P PPA  P   P 
Sbjct: 408 PGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPP-QSVSGGMIPSPPALMPSPSPQ 466

Query: 70  PYHTPPPSQTSKQSGKAP 87
              +P   +T +Q   +P
Sbjct: 467 MSQSPASQRTIQQDMVSP 484



 Score = 33.1 bits (75), Expect = 0.63
 Identities = 19/84 (22%), Positives = 20/84 (23%), Gaps = 8/84 (9%)

Query: 12  PQESFHSVPIIQQTLSPQQPPVSPQHQ--------NMMSPIQQHSLSPHTQAPSPQPPAQ 63
           PQ         QQ +  QQ P   Q Q          M P Q              P  Q
Sbjct: 224 PQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQPPQQQ 283

Query: 64  PQQVPSPYHTPPPSQTSKQSGKAP 87
           PQQ           Q        P
Sbjct: 284 PQQSQLGMLPNQMQQMPGGGQGGP 307



 Score = 31.5 bits (71), Expect = 1.9
 Identities = 22/80 (27%), Positives = 25/80 (31%), Gaps = 8/80 (10%)

Query: 19  VPIIQQTLSPQQ-PPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPS 77
              IQ         P+      MMS     S SP  Q  + Q   QP Q   P    P S
Sbjct: 388 YLNIQGNQGGLGANPMQQGQPGMMS-----SPSPVPQVQTNQSMPQPPQPSVPSPGGPGS 442

Query: 78  Q--TSKQSGKAPDKNTLITM 95
           Q   S   G  P    L+  
Sbjct: 443 QPPQSVSGGMIPSPPALMPS 462



 Score = 30.7 bits (69), Expect = 2.8
 Identities = 12/65 (18%), Positives = 17/65 (26%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
           Q        P         +QQ +         Q +      QQ +     Q     P A
Sbjct: 309 QPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAA 368

Query: 63  QPQQV 67
             QQ+
Sbjct: 369 HQQQM 373


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
           overall function of the full-length Med25 is efficiently
           to coordinate the transcriptional activation of RAR/RXR
           (retinoic acid receptor/retinoic X receptor) in higher
           eukaryotic cells. Human Med25 consists of several
           domains with different binding properties, the
           N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
           from residues 229-381, a PTOV(B) or ACID domain from
           395-545, an SD2 domain from residues 564-645 and a
           C-terminal NR box-containing domain (646-650) from
           646-747. This The function of the SD domains is unclear.
          Length = 168

 Score = 34.8 bits (79), Expect = 0.063
 Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 9/65 (13%)

Query: 23  QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
           +Q L  +  P++P  Q            P +QAP+P  P  P   P P    P SQ S  
Sbjct: 72  KQGLKNRFSPITPLQQA------PIVGPPFSQAPAPVLPPGPPGAPKP---SPASQLSLV 122

Query: 83  SGKAP 87
           +  +P
Sbjct: 123 TTVSP 127



 Score = 28.6 bits (63), Expect = 8.4
 Identities = 20/68 (29%), Positives = 23/68 (33%), Gaps = 1/68 (1%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
           P Q++    P   Q  +P  PP  P      SP  Q SL       S   P   QQ   P
Sbjct: 84  PLQQAPIVGPPFSQAPAPVLPP-GPPGAPKPSPASQLSLVTTVSPGSGLAPVLTQQQVPP 142

Query: 71  YHTPPPSQ 78
                PS 
Sbjct: 143 QQPQQPSM 150


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 35.8 bits (83), Expect = 0.067
 Identities = 21/90 (23%), Positives = 33/90 (36%), Gaps = 17/90 (18%)

Query: 1   MDQLPSLMDDPPQES--------FHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPH 52
            D   +L  D P++S          SVP  Q   +PQQP  S   ++  +  +  S    
Sbjct: 176 DDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPNPQQPMPSASSES--ATSKSAST--- 230

Query: 53  TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
               S +   QPQ  P      P   ++ +
Sbjct: 231 ----SRESSPQPQSPPPRRVPAPTVDSTAK 256


>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
           1).  Dystroglycan is one of the dystrophin-associated
           glycoproteins, which is encoded by a 5.5 kb transcript
           in human. The protein product is cleaved into two
           non-covalently associated subunits, [alpha] (N-terminal)
           and [beta] (C-terminal). In skeletal muscle the
           dystroglycan complex works as a transmembrane linkage
           between the extracellular matrix and the cytoskeleton.
           [alpha]-dystroglycan is extracellular and binds to
           merosin ([alpha]-2 laminin) in the basement membrane,
           while [beta]-dystroglycan is a transmembrane protein and
           binds to dystrophin, which is a large rod-like
           cytoskeletal protein, absent in Duchenne muscular
           dystrophy patients. Dystrophin binds to intracellular
           actin cables. In this way, the dystroglycan complex,
           which links the extracellular matrix to the
           intracellular actin cables, is thought to provide
           structural integrity in muscle tissues. The dystroglycan
           complex is also known to serve as an agrin receptor in
           muscle, where it may regulate agrin-induced
           acetylcholine receptor clustering at the neuromuscular
           junction. There is also evidence which suggests the
           function of dystroglycan as a part of the signal
           transduction pathway because it is shown that Grb2, a
           mediator of the Ras-related signal pathway, can interact
           with the cytoplasmic domain of dystroglycan. In general,
           aberrant expression of dystrophin-associated protein
           complex underlies the pathogenesis of Duchenne muscular
           dystrophy, Becker muscular dystrophy and severe
           childhood autosomal recessive muscular dystrophy.
           Interestingly, no genetic disease has been described for
           either [alpha]- or [beta]-dystroglycan. Dystroglycan is
           widely distributed in non-muscle tissues as well as in
           muscle tissues. During epithelial morphogenesis of
           kidney, the dystroglycan complex is shown to act as a
           receptor for the basement membrane. Dystroglycan
           expression in mouse brain and neural retina has also
           been reported. However, the physiological role of
           dystroglycan in non-muscle tissues has remained unclear.
          Length = 290

 Score = 35.4 bits (81), Expect = 0.084
 Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 9/85 (10%)

Query: 8   MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNM--MSPIQQHSLSPHTQAPSPQPPAQPQ 65
           +DD       S+P+I +   P  PP    +QN+   +P+ Q  L  +T      P A P 
Sbjct: 199 LDDSKPPPSSSMPLILKEEKPPLPPPEYPNQNVPETTPLNQDLLGEYTPLRDEDPNAPPY 258

Query: 66  QVPSPYHTP-------PPSQTSKQS 83
           Q P P+ TP       P + T  +S
Sbjct: 259 QPPPPFTTPMEGKGSRPKNMTPYRS 283


>gnl|CDD|191194 pfam05111, Amelin, Ameloblastin precursor (Amelin).  This family
           consists of several mammalian Ameloblastin precursor
           (Amelin) proteins. Matrix proteins of tooth enamel
           consist mainly of amelogenin but also of non-amelogenin
           proteins, which, although their volumetric percentage is
           low, have an important role in enamel mineralisation.
           One of the non-amelogenin proteins is ameloblastin, also
           known as amelin and sheathlin. Ameloblastin (AMBN) is
           one of the enamel sheath proteins which is though to
           have a role in determining the prismatic structure of
           growing enamel crystals.
          Length = 410

 Score = 35.3 bits (81), Expect = 0.11
 Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 13  QESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQA----PSPQPPAQPQQVP 68
           Q+  +SVP+    L P QP + P  Q  + P  Q + +   Q       PQPP  P Q+P
Sbjct: 102 QQYEYSVPVHPPPL-PSQPSLQP-QQPGLKPFLQPTAATAIQDTPQKAGPQPPMHPGQLP 159


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin
          carboxyl carrier protein subunit; Validated.
          Length = 153

 Score = 33.7 bits (77), Expect = 0.12
 Identities = 13/47 (27%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 41 MSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
          +S +Q+   +P   AP+P  P+ P  V      P P+   +    AP
Sbjct: 45 LSAVQEQVPTP-APAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAP 90


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 264

 Score = 34.8 bits (80), Expect = 0.13
 Identities = 16/91 (17%), Positives = 30/91 (32%), Gaps = 10/91 (10%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP---- 60
           P+ + D  + +     +++Q    Q+    P          +    P TQ+   Q     
Sbjct: 83  PAAVKDAERLTPEQRQLLEQMEVDQKA--QPTQLGEQPEQARIEEQPRTQSQKAQSQATT 140

Query: 61  ----PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
               P +P+  P       P+   +   KAP
Sbjct: 141 VQTQPVKPKPRPEKPQPVAPAPAPEPVEKAP 171


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 35.3 bits (82), Expect = 0.13
 Identities = 11/72 (15%), Positives = 19/72 (26%)

Query: 20  PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
            I +   +      +P       P      +P T+  +  P   P +   P   P     
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTE 423

Query: 80  SKQSGKAPDKNT 91
              +  A   N 
Sbjct: 424 PSPTPPANAANA 435



 Score = 33.8 bits (78), Expect = 0.35
 Identities = 14/91 (15%), Positives = 19/91 (20%), Gaps = 10/91 (10%)

Query: 4   LPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ 63
           L  L+         S        +   P  +P                    PSP P   
Sbjct: 356 LLGLLPSAFI----SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKA 411

Query: 64  ----PQQVPSPYHTPPPSQTSKQSGKAPDKN 90
               P        +P P   +      P  N
Sbjct: 412 SPPIPVPAEPTEPSPTPPANAAN--APPSLN 440



 Score = 33.4 bits (77), Expect = 0.43
 Identities = 13/75 (17%), Positives = 20/75 (26%)

Query: 17  HSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPP 76
            S        +P  P  SP       P+ Q + +     P P PP   Q   +     P 
Sbjct: 513 QSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPA 572

Query: 77  SQTSKQSGKAPDKNT 91
             +           +
Sbjct: 573 DSSPPPPIPEEPTPS 587



 Score = 30.7 bits (70), Expect = 3.1
 Identities = 22/108 (20%), Positives = 28/108 (25%), Gaps = 16/108 (14%)

Query: 22  IQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSK 81
           I+  L  Q    +        P Q+    P    P PQP A       P   PPP+ T  
Sbjct: 506 IKLNLESQSGS-ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPP--PPPPTATQA 562

Query: 82  QSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINVNDLGIDPNDSEI 129
            S                 Q     +      E        D +  EI
Sbjct: 563 SSNAPA-------------QIPADSSPPPPIPEEPTPSPTKDSSPEEI 597


>gnl|CDD|218161 pfam04589, RFX1_trans_act, RFX1 transcription activation region. 
          The RFX family is a family of winged-helix DNA binding
          proteins. RFX1 is a regulatory factor essential for
          expression of MHC class II genes. This region is to
          found N terminal to the RFX DNA binding region
          (pfam02257) in some mammalian RFX proteins, and is
          thought to activate transcription when associated with
          DNA. Deletion analysis has identified the region
          233-351 in human RFX1 as being required for maximal
          activation.
          Length = 150

 Score = 33.4 bits (76), Expect = 0.14
 Identities = 17/53 (32%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 24 QTLSPQQPPVSPQHQN-MMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPP 75
          QT  P Q PV    Q  ++    Q    P  Q P  +    PQQV    H  P
Sbjct: 17 QTSVPAQAPVPASQQRSVVQATSQTKGGPVQQLPVHRVQQVPQQVQQVQHVYP 69


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 34.6 bits (79), Expect = 0.15
 Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 6/68 (8%)

Query: 25  TLSPQQPPVSPQH-QNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
           + SPQ PP +    Q+     Q+H+    T  P+      P    +P  T    +T    
Sbjct: 244 SGSPQHPPSTTSQDQSTTGDGQEHTQRRKT-PPATSNRRSPHSTATPPPTTKRQET---- 298

Query: 84  GKAPDKNT 91
           G+   + T
Sbjct: 299 GRPTPRPT 306



 Score = 31.6 bits (71), Expect = 1.5
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 5/62 (8%)

Query: 29  QQPPVSPQHQNMMSPIQQHSLSPHTQ-----APSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
           Q+    P   N  SP    +  P T+      P+P+P A  Q   SP H+ PP   +  +
Sbjct: 269 QRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328

Query: 84  GK 85
            +
Sbjct: 329 TQ 330


>gnl|CDD|221628 pfam12540, DUF3736, Protein of unknown function (DUF3736).  This
           domain family is found in eukaryotes, and is typically
           between 135 and 160 amino acids in length.
          Length = 147

 Score = 33.3 bits (76), Expect = 0.20
 Identities = 20/102 (19%), Positives = 36/102 (35%), Gaps = 11/102 (10%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
               P ++D   ++    +  + +     Q      +Q  M   ++     H ++PSPQ 
Sbjct: 51  PSTPPLVLDASTKK----LEFLWRVGLTCQQQACAYYQGKMGIRRRPL---HEESPSPQR 103

Query: 61  PAQPQQVPSPYHTPPPSQTSK----QSGKAPDKNTLITMLKL 98
                 VPSP +   P + S     Q  +   K   +  L L
Sbjct: 104 LQGAPPVPSPSNQAYPFELSPSSVMQDPRFQPKKQFLAQLNL 145


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 34.3 bits (78), Expect = 0.20
 Identities = 21/99 (21%), Positives = 28/99 (28%), Gaps = 13/99 (13%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQP-----PVSPQHQNMMSPIQQHSLSPH-TQ 54
           M   P     PP       P   +      P     P   Q+ NMM  + +    P    
Sbjct: 182 MGMRPGFNQMPPHMPGMP-PNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGG 240

Query: 55  APSPQPPAQPQQV------PSPYHTPPPSQTSKQSGKAP 87
              P  P  P         P+    PPP     ++G  P
Sbjct: 241 FNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFPP 279


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 34.3 bits (78), Expect = 0.29
 Identities = 22/60 (36%), Positives = 27/60 (45%), Gaps = 2/60 (3%)

Query: 32  PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNT 91
           PVS Q  N + P +       T A + Q  A+  Q  S   TPP +Q SK   KA D  T
Sbjct: 424 PVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAETPPENQVSKN--KAADNET 481


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is
          found to the N-terminus of the pfam00170 transcription
          factor domain. It is between 150 and 200 amino acids in
          length. The N-terminal half is rather rich in proline
          residues and has been termed the PRD (proline rich
          domain), whereas the C-terminal half is more polar and
          has been called the MFMR (multifunctional mosaic
          region). It has been suggested that this family is
          composed of three sub-families called A, B and C,
          classified according to motif composition. It has been
          suggested that some of these motifs may be involved in
          mediating protein-protein interactions. The MFMR region
          contains a nuclear localisation signal in bZIP opaque
          and GBF-2. The MFMR also contains a transregulatory
          activity in TAF-1. The MFMR in CPRF-2 contains
          cytoplasmic retention signals.
          Length = 189

 Score = 32.9 bits (75), Expect = 0.31
 Identities = 19/69 (27%), Positives = 22/69 (31%), Gaps = 7/69 (10%)

Query: 15 SFHSVPIIQQTLSPQQPPVSPQHQNMMS---PIQQHSLSPHTQAPSPQPP----AQPQQV 67
             S P          P V P    M +   P         + A SPQP        Q +
Sbjct: 9  PSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQPM 68

Query: 68 PSPYHTPPP 76
            PY TPPP
Sbjct: 69 MPPYGTPPP 77


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 33.7 bits (77), Expect = 0.38
 Identities = 18/75 (24%), Positives = 26/75 (34%), Gaps = 2/75 (2%)

Query: 4   LPSLMDDPPQESFHSVPIIQQTLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPP 61
           +P  +  PP  S   +P    +L+P     P +   +   +P           AP    P
Sbjct: 182 IPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIP 241

Query: 62  AQPQQVPSPYHTPPP 76
           A P      Y  PPP
Sbjct: 242 APPIPPVIQYVAPPP 256



 Score = 33.4 bits (76), Expect = 0.51
 Identities = 12/51 (23%), Positives = 19/51 (37%)

Query: 18  SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP 68
           S P       P  P +       + P Q      H +A + + PA P+ +P
Sbjct: 234 SAPPASIPAPPIPPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPANPRDIP 284



 Score = 29.5 bits (66), Expect = 7.9
 Identities = 19/74 (25%), Positives = 22/74 (29%), Gaps = 3/74 (4%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
           PS +   P  S    P    T   + P VS    N   P    +      A  P PP  P
Sbjct: 191 PSSLPGLPPGSSSLAPSASSTPGNRLPRVS---FNPFLPGPSPAQPSAPPASIPAPPIPP 247

Query: 65  QQVPSPYHTPPPSQ 78
                     PP Q
Sbjct: 248 VIQYVAPPPVPPPQ 261


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 33.3 bits (76), Expect = 0.39
 Identities = 20/87 (22%), Positives = 31/87 (35%), Gaps = 5/87 (5%)

Query: 10  DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
           DPP     + P + Q   P++       Q   +P   HS +P     + Q PA P+Q   
Sbjct: 111 DPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQ-PAVPEQQVQ 169

Query: 70  PYHTPPPSQTSKQSGKAPDKNTLITML 96
               P          K P +   + +L
Sbjct: 170 ----PEEVVEPAPEVKRPPRKDTVIIL 192



 Score = 30.2 bits (68), Expect = 3.6
 Identities = 17/77 (22%), Positives = 19/77 (24%), Gaps = 5/77 (6%)

Query: 2   DQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPP 61
            Q P     P   S      + Q   P Q    P H     P       P        P 
Sbjct: 83  RQSPQHQYQPEYASAQIKIPVPQ---PPQISDPPAHPQPTQPALDQEQPPEEARQPVLP- 138

Query: 62  AQPQQVPSPYHTPPPSQ 78
            Q    P P H+  P  
Sbjct: 139 -QEAPAPQPVHSAAPQP 154


>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 451

 Score = 33.6 bits (77), Expect = 0.41
 Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 4/76 (5%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSP--QQPPVSPQHQNMMSPIQQHSLSPHTQAPSP 58
           + QL S +  P            Q  S   QQ   SPQ Q +    Q   L+  +     
Sbjct: 355 LSQLVSQIKSPAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASPQTKK 414

Query: 59  QPPAQPQQVPSPYHTP 74
           +     Q+VPSP  + 
Sbjct: 415 ET--IHQEVPSPSLSK 428


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 33.4 bits (76), Expect = 0.42
 Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 6/89 (6%)

Query: 11  PPQESFHS--VPIIQQTLSPQQPPVSPQHQNMMSPIQQHS--LSPHTQAPSPQPPAQPQQ 66
           PP+ES  +     +       + P  P  +    P       LSP+T     +PP  P  
Sbjct: 329 PPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSP-- 386

Query: 67  VPSPYHTPPPSQTSKQSGKAPDKNTLITM 95
           +P+P  + P S  S  +   P +  ++  
Sbjct: 387 IPTPPSSSPASSKSVDAVAKPAEPDVVPS 415


>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
           activation/proliferation-associated protein-1 C term.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 343 and 708 amino
           acids in length. This family is the C terminal region of
           caprin-1. Caprin-1 is a protein involved in regulating
           cellular proliferation. In mutated phenotypes, the G1
           phase of the cell cycle is greatly lengthened, impairing
           normal proliferation. The C terminal region of caprin-1
           contains RGG motifs which are characteristic of RNA
           binding domains. It is possible that caprin-1 functions
           through an RNA binding mechanism.
          Length = 319

 Score = 33.0 bits (75), Expect = 0.44
 Identities = 24/77 (31%), Positives = 31/77 (40%), Gaps = 7/77 (9%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQH-SLSP-HTQAPSPQP 60
           Q+P  M  P  E + S P + Q     +P   PQ  + + PIQ   SL+   T   S  P
Sbjct: 65  QVP--MVSPTSEGYTSSPPLYQPSHTAEP--RPQ-TDPIDPIQASMSLNSEQTPTSSSLP 119

Query: 61  PAQPQQVPSPYHTPPPS 77
            A   QV      P  S
Sbjct: 120 AASQPQVFQTGSKPLHS 136


>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
          Length = 732

 Score = 33.4 bits (76), Expect = 0.46
 Identities = 29/114 (25%), Positives = 49/114 (42%), Gaps = 17/114 (14%)

Query: 54  QAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLK- 112
           Q   PQ P QPQQ   P     P    K+S    +                 G E++LK 
Sbjct: 621 QPQQPQQPQQPQQPQQPQQPVSPVINDKKSDAGVNVPA-------------GGIEQELKM 667

Query: 113 KEINVNDLGIDPNDSEIGDL--LSAYESEGDPNLYEDAYQDLKKFIENSLDVYK 164
           K     +  + P  SE G++  ++AYE+    N + D  Q +++  E +++V++
Sbjct: 668 KPEEEMEQQLPPGISESGEVVDMAAYEAWQQEN-HPDIQQQMQRREEVNINVHR 720


>gnl|CDD|140276 PTZ00249, PTZ00249, variable surface protein Vir28; Provisional.
          Length = 516

 Score = 33.5 bits (76), Expect = 0.47
 Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 4/87 (4%)

Query: 10  DPPQESFHSVPI-IQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ-QV 67
           DPP+E   +V     + +S +  P  P+H +  SP         T+ P+P   + PQ   
Sbjct: 216 DPPREEQKAVTAHAHRRISGEARP--PKHISFSSPHAHGRPPVETRPPNPVSVSSPQAHG 273

Query: 68  PSPYHTPPPSQTSKQSGKAPDKNTLIT 94
             P  T  P   +  S KA D+N + T
Sbjct: 274 RHPGETHTPPLVTVPSSKAHDRNPVQT 300


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 32.7 bits (75), Expect = 0.58
 Identities = 17/66 (25%), Positives = 21/66 (31%), Gaps = 1/66 (1%)

Query: 26  LSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGK 85
           LS QQ  V+   QN          +P T APS      P    +    P    T K +  
Sbjct: 153 LSQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAK-VPATAETHPTPPQKPATKKPAVN 211

Query: 86  APDKNT 91
                T
Sbjct: 212 HHKTAT 217


>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER).  This
           family consists of several eukaryotic immediate early
           response (IER) 2 and 5 proteins. The role of IER5 is
           unclear although it play an important role in mediating
           the cellular response to mitogenic signals. Again,
           little is known about the function of IER2 although it
           is thought to play a role in mediating the cellular
           responses to a variety of extracellular signals.
          Length = 272

 Score = 32.6 bits (74), Expect = 0.63
 Identities = 12/49 (24%), Positives = 18/49 (36%)

Query: 35  PQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
              Q M+ P +Q   +P    PS   P  P +  SP    P  + +   
Sbjct: 60  QAEQPMVPPPEQQPGAPRLHPPSGAEPPLPARAASPEEEQPAPEPAFYR 108


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 32.3 bits (74), Expect = 0.63
 Identities = 14/77 (18%), Positives = 23/77 (29%), Gaps = 13/77 (16%)

Query: 10  DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
           DP   +  + P+  +    + P   P                    P PQ   +    P 
Sbjct: 86  DPATVAPPNTPVEPEPAPVEPPKPKPV-------------EKPKPKPKPQQKVEAPPAPK 132

Query: 70  PYHTPPPSQTSKQSGKA 86
           P   P   + +  +GKA
Sbjct: 133 PEPKPVVEEKAAPTGKA 149



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 18/75 (24%), Positives = 30/75 (40%), Gaps = 6/75 (8%)

Query: 18  SVPIIQQTLSPQQPP----VSPQHQNMMSP-IQQHSLSPHTQAPSPQPPAQPQQVPSPYH 72
            +P   Q L P QPP     + +  +  +P +   +++P      P+P       P P  
Sbjct: 55  MMPAATQAL-PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVE 113

Query: 73  TPPPSQTSKQSGKAP 87
            P P    +Q  +AP
Sbjct: 114 KPKPKPKPQQKVEAP 128


>gnl|CDD|176758 cd08780, Death_TRADD, Death Domain of Tumor Necrosis Factor
           Receptor 1-Associated Death Domain protein.  Death
           domain (DD) of TRADD (TNF Receptor 1-Associated Death
           Domain or TNFRSF1A-associated via death domain) protein.
           TRADD is a central signaling adaptor for TNF-receptor 1
           (TNFR1), mediating activation of Nuclear Factor -kappaB
           (NF-kB) and c-Jun N-terminal kinase (JNK), as well as
           caspase-dependent apoptosis. It also carries important
           immunological roles including germinal center formation,
           DR3-mediated T-cell stimulation, and TNFalpha-mediated
           inflammatory responses. In general, DDs are
           protein-protein interaction domains found in a variety
           of domain architectures. Their common feature is that
           they form homodimers by self-association or heterodimers
           by associating with other members of the DD superfamily
           including CARD (Caspase activation and recruitment
           domain), DED (Death Effector Domain), and PYRIN. They
           serve as adaptors in signaling pathways and can recruit
           other proteins into signaling complexes.
          Length = 90

 Score = 30.6 bits (69), Expect = 0.66
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 3/33 (9%)

Query: 125 NDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIE 157
            D  I +L   Y+ EG   LYE AYQ L++FI+
Sbjct: 31  RDPAIDNLAYEYDREG---LYEQAYQLLRRFIQ 60


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 32.8 bits (75), Expect = 0.70
 Identities = 19/87 (21%), Positives = 26/87 (29%), Gaps = 15/87 (17%)

Query: 15  SFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP-----------PAQ 63
             H    + Q    +QP +    Q    P +  SL  H    S              P+ 
Sbjct: 95  HVHETLKVSQA-PGEQPKLPRAPQLPSHPRRHRSLPHHLHHSSKHRNRLHARRSDVLPSL 153

Query: 64  PQQVPSPYHT-PPPSQTSKQSGKAPDK 89
                 P H  P PS    Q  ++P K
Sbjct: 154 SN--SRPVHELPKPSTPVLQPRRSPRK 178


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 32.7 bits (75), Expect = 0.76
 Identities = 11/69 (15%), Positives = 19/69 (27%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
            P     + P       P     S Q     +   + + +P    P+   PA     P+ 
Sbjct: 440 APPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499

Query: 71  YHTPPPSQT 79
              P  +  
Sbjct: 500 PAAPAGADD 508



 Score = 32.3 bits (74), Expect = 1.2
 Identities = 17/65 (26%), Positives = 23/65 (35%), Gaps = 4/65 (6%)

Query: 28  PQQPPVSPQHQNMMSPIQQHS----LSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
              P  +PQ     +P          +P   APSP P A P   P+P     P  T+  +
Sbjct: 422 APAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481

Query: 84  GKAPD 88
              P 
Sbjct: 482 PAPPA 486



 Score = 31.9 bits (73), Expect = 1.3
 Identities = 16/65 (24%), Positives = 21/65 (32%)

Query: 23  QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
            Q      P  +P      +P       P   APS QP   P   P P   P P+  +  
Sbjct: 429 PQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAP 488

Query: 83  SGKAP 87
           +  A 
Sbjct: 489 APAAA 493



 Score = 31.1 bits (71), Expect = 2.2
 Identities = 14/64 (21%), Positives = 22/64 (34%), Gaps = 3/64 (4%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPH---TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
           +P  P  +       SP    + S       A +P+P A P   P     P  +  +  +
Sbjct: 440 APPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499

Query: 84  GKAP 87
             AP
Sbjct: 500 PAAP 503



 Score = 30.7 bits (70), Expect = 3.6
 Identities = 8/58 (13%), Positives = 13/58 (22%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSG 84
           +    P +       +P    +  P         P  P         P P   +  S 
Sbjct: 407 AAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464



 Score = 29.6 bits (67), Expect = 7.9
 Identities = 7/46 (15%), Positives = 12/46 (26%), Gaps = 1/46 (2%)

Query: 42  SPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
           +     + +P                P+P   P P+        AP
Sbjct: 397 AAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP-APAPAP 441


>gnl|CDD|180413 PRK06128, PRK06128, oxidoreductase; Provisional.
          Length = 300

 Score = 32.1 bits (73), Expect = 0.79
 Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 5/49 (10%)

Query: 51 PHTQAPSPQPPAQPQQVPSPYH--TPPP---SQTSKQSGKAPDKNTLIT 94
          P TQ P P  P Q Q+ P   H   P P    Q+ K  G+   +  LIT
Sbjct: 13 PLTQYPQPPFPEQTQEAPGTIHEMQPKPDHGEQSYKGFGRLQGRKALIT 61


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 31.4 bits (71), Expect = 0.83
 Identities = 12/41 (29%), Positives = 15/41 (36%)

Query: 36  QHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPP 76
             Q        +  +P  QA  PQ  AQ Q  P+P    P 
Sbjct: 118 DDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQ 158


>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
          protein.  This model is designed to identify biotin
          carboxyl carrier protein as a peptide of acetyl-CoA
          carboxylase. Scoring below the trusted cutoff is a
          related protein encoded in a region associated with
          polyketide synthesis in the prokaryote
          Saccharopolyspora hirsuta, and a reported
          chloroplast-encoded biotin carboxyl carrier protein
          that may be highly derived from the last common
          ancestral sequence. Scoring below the noise cutoff are
          biotin carboxyl carrier domains of other enzymes such
          as pyruvate carboxylase.The gene name is accB or fabE
          [Fatty acid and phospholipid metabolism, Biosynthesis].
          Length = 156

 Score = 31.3 bits (71), Expect = 0.87
 Identities = 13/58 (22%), Positives = 19/58 (32%), Gaps = 10/58 (17%)

Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ---VPSP-----YHTPPP 76
          S  Q   +P       P    + +P     +P P    ++   V SP     Y  P P
Sbjct: 42 SAVQQAAAPVP--AQVPAAPSAQAPAPAVCAPAPAKADKKGHFVRSPMVGTFYRAPSP 97


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 32.3 bits (73), Expect = 0.87
 Identities = 25/107 (23%), Positives = 42/107 (39%), Gaps = 11/107 (10%)

Query: 13  QESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLS--PHTQAPSPQPPAQPQQVPSP 70
           Q +  + P   +  +PQ  P S + +   +P  + + +  P T+  +PQ         S 
Sbjct: 26  QAANATTPSSTKVEAPQSTPPSTKVE---APQSKPNATTPPSTKVEAPQQTPNATTPSST 82

Query: 71  YHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINV 117
               P S T+KQ     +        K LR Y  + + E  K EI +
Sbjct: 83  KVETPQSPTTKQVPTEINPK-----FKDLRAYYTKPSLE-FKNEIGI 123


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 32.2 bits (73), Expect = 0.92
 Identities = 14/79 (17%), Positives = 19/79 (24%)

Query: 6   SLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ 65
           S  D  P   F          SP  P  + +      P+   S          Q P  P 
Sbjct: 120 SAYDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPPP 179

Query: 66  QVPSPYHTPPPSQTSKQSG 84
                   P   +  +Q  
Sbjct: 180 MSSPDSSYPSEHRFQRQLS 198


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 31.8 bits (72), Expect = 0.94
 Identities = 21/94 (22%), Positives = 33/94 (35%), Gaps = 12/94 (12%)

Query: 10  DPPQ-----ESFHSVPIIQQTLS-PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ 63
            PP+     +    +P + Q ++ P QPP        ++   Q +      +  PQP AQ
Sbjct: 43  LPPKPQGDRDEPRVLPAVVQVVALPTQPPEG------VAQEIQDAGDAAAASVDPQPVAQ 96

Query: 64  PQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLK 97
           P    +P   P  +QT K               K
Sbjct: 97  PPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAK 130



 Score = 29.5 bits (66), Expect = 5.7
 Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 11/70 (15%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
           P  +  PP ES           +P   PV+ Q    + P +Q         P+P+P  +P
Sbjct: 91  PQPVAQPPVES-----------TPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKP 139

Query: 65  QQVPSPYHTP 74
              P+   T 
Sbjct: 140 VAEPAAAPTG 149


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 31.7 bits (72), Expect = 0.95
 Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 4/57 (7%)

Query: 24  QTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
           +  SP+    S Q  +   P    +    +  P+P  PA       P  TP  + +S
Sbjct: 140 EYYSPKHS--SSQGTSTTRPSDGSATPNTSAPPTPGNPAAQP--EKPAETPKGNGSS 192


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 30.4 bits (69), Expect = 1.0
 Identities = 12/49 (24%), Positives = 15/49 (30%)

Query: 28  PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPP 76
            Q     P             + P    P+    A PQ+ P PY  PP 
Sbjct: 63  GQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPPQ 111


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 32.1 bits (73), Expect = 1.0
 Identities = 24/110 (21%), Positives = 31/110 (28%), Gaps = 16/110 (14%)

Query: 1   MDQLPSLMDDPPQESFHSVPII----QQTLSPQQPPVSPQHQNMMSPIQ-QHSLSPHTQA 55
           M QLP      P       P       Q     QP   P+   M +P+     L P+  A
Sbjct: 382 MRQLPMGS---PMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLA 438

Query: 56  PSPQ---PPAQPQQVPS-----PYHTPPPSQTSKQSGKAPDKNTLITMLK 97
           P      P    Q         P   PP  Q+   S   P   +  +   
Sbjct: 439 PMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGG 488



 Score = 31.7 bits (72), Expect = 1.4
 Identities = 16/77 (20%), Positives = 26/77 (33%)

Query: 20  PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
           P +    +P  P    +   +       + S + Q  + +PP QP   P  Y + P SQ 
Sbjct: 417 PRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQD 476

Query: 80  SKQSGKAPDKNTLITML 96
             Q      +      L
Sbjct: 477 LPQPQSTASQGGQNKKL 493


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 31.7 bits (72), Expect = 1.0
 Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 10/87 (11%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQ----QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
           P+    P  E     P  Q     +      P  P+ +    P ++         P P+P
Sbjct: 52  PTEEPQPEPE----PPEEQPKPPTEPETPPEPTPPKPKEKPKPEKK--PKKPKPKPKPKP 105

Query: 61  PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
             +P+  P P    PPS+T+ ++  AP
Sbjct: 106 KPKPKVKPQPKPKKPPSKTAAKAPAAP 132


>gnl|CDD|238879 cd01841, NnaC_like, NnaC (CMP-NeuNAc synthetase) _like subfamily of
           SGNH_hydrolases, a diverse family of lipases and
           esterases. The tertiary fold of the enzyme is
           substantially different from that of the alpha/beta
           hydrolase family and unique among all known hydrolases;
           its active site closely resembles two of the three
           components of typical Ser-His-Asp(Glu) triad from other
           serine hydrolases. E. coli NnaC appears to be involved
           in polysaccharide synthesis.
          Length = 174

 Score = 31.1 bits (71), Expect = 1.0
 Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 115 INVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSL 160
           I++ND+ +D      G+L   Y ++G  +     YQ L + +E  L
Sbjct: 134 IDLNDVLVDEF----GNLKKEYTTDG-LHFNPKGYQKLLEILEEYL 174


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 19/81 (23%), Positives = 28/81 (34%), Gaps = 2/81 (2%)

Query: 18  SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPS 77
           S+P ++       PP          P          +A SP PP  P + P+      P+
Sbjct: 46  SIPGLEDVTVQTTPPPPASAITNGGPPPP--PPARAEAASPPPPEAPAEPPAEPEPEAPA 103

Query: 78  QTSKQSGKAPDKNTLITMLKL 98
           + +    K P       MLKL
Sbjct: 104 ENTVTVAKDPRYAKYFKMLKL 124


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 32.3 bits (73), Expect = 1.2
 Identities = 21/157 (13%), Positives = 36/157 (22%), Gaps = 23/157 (14%)

Query: 12  PQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQ-------QHSLSPHTQAPSPQPPAQP 64
             ++       Q+   P   P S   ++ M+             L       SP     P
Sbjct: 371 APQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGP 430

Query: 65  QQVPSPYHTPP-PSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINVNDLGID 123
           + V      P  P                             G E + K+   + +    
Sbjct: 431 EPVGPVPPQPTNPYVMPISMANMVYPGHPQE----------HGHERKRKRGGELKE---- 476

Query: 124 PNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSL 160
               E   L+   + E +    E      K  I+   
Sbjct: 477 -ELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIA 512


>gnl|CDD|220023 pfam08801, Nucleoporin_N, Nup133 N terminal like.  Nup133 is a
           nucleoporin that is crucial for nuclear pore complex
           (NPC) biogenesis. The N terminal forms a seven-bladed
           beta propeller structure. This family now contains other
           sized nucleoporins, including Nup155, Nup8, Nuo132,
           Nup15 and Nup170.
          Length = 411

 Score = 32.1 bits (73), Expect = 1.2
 Identities = 22/116 (18%), Positives = 42/116 (36%), Gaps = 12/116 (10%)

Query: 598 GRFLASAGGDGNVHMWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCS--VKLW 655
           GR L +    G + +WDLS       L    D+   +      +I  S+    S  +K+ 
Sbjct: 203 GRLLYTLTSKGTIRVWDLSPSGE--KLVREVDIYEIILRDLQESIPKSASTLSSKSLKIL 260

Query: 656 DFAKLSDEVGTEEVNLSHSADIKNSSLAYLVRSYKTKASPLLHLIFSRRNILLRGP 711
           D + +       E +L     I ++     VR Y +    L+ +     ++ +   
Sbjct: 261 DISPIPS----GESSLLLLVAITSNG----VRLYFSLLLTLVSVRLPPSSVTISST 308


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 31.6 bits (72), Expect = 1.2
 Identities = 18/69 (26%), Positives = 26/69 (37%), Gaps = 14/69 (20%)

Query: 304 DDDEEEAGEGGDKPKKKKAKKDSM--------FSKKTKSDPNAPPVDRIPLPPLKDSEKI 355
            DD+EE  E   + +KK  K+ S           KK   +P+           L D  + 
Sbjct: 14  GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDT------SFLPDKARE 67

Query: 356 DKLKALREA 364
           +K   LRE 
Sbjct: 68  EKEAELREE 76


>gnl|CDD|151572 pfam11128, Nucleocap_ssRNA, Plant viral coat protein nucleocapsid. 
           This family of nucleocapsid proteins is from ssRNA
           negative-strand viruses of plant origin.
          Length = 180

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 20/87 (22%)

Query: 321 KAKKDSMFSKKTKSDPNAPPVDRIP---LPPLKDSEKIDKLKALR----------EATKR 367
           KA + ++F++K + D  + P    P      ++ +EK++K KA+           EA K+
Sbjct: 63  KAIRYAIFAQKFEKDKLSEPTGATPESIKEYMEKNEKLEKAKAIVDFLCSLGSNFEAQKK 122

Query: 368 V-TLGPESLPSIACYTLLNARQQVTCA 393
           +  L PE  PS   +TL     Q+TCA
Sbjct: 123 MHPLSPER-PSRKNFTL-----QLTCA 143


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 32.1 bits (73), Expect = 1.3
 Identities = 19/77 (24%), Positives = 29/77 (37%), Gaps = 18/77 (23%)

Query: 287 YYGMPREPDLQNTIAVPDDDEEEAGEGGDKPKKKKAKKDSMFSKKTKSDPNAPPVDRIPL 346
           YY      +      +   D    G+ G K KK + KK + FS++  SD           
Sbjct: 208 YYDGCESEEDDEGWYIEPQDATPEGKKG-KNKKGRGKKHNAFSRRGLSD----------- 255

Query: 347 PPLKDSEKIDKLKALRE 363
                 E+ D+ K +RE
Sbjct: 256 ------EEYDEYKKIRE 266


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 32.0 bits (73), Expect = 1.3
 Identities = 14/81 (17%), Positives = 25/81 (30%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
           P + D    ES   +P I+ +    Q           +P  + + S             P
Sbjct: 654 PDVTDSSQLESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPP 713

Query: 65  QQVPSPYHTPPPSQTSKQSGK 85
               +   +  P Q+  +SG 
Sbjct: 714 SSSQAFSLSDLPMQSQSESGL 734


>gnl|CDD|219753 pfam08226, DUF1720, Domain of unknown function (DUF1720).  This
          domain is found in different combinations with cortical
          patch components EF hand, SH3 and ENTH and is therefore
          likely to be involved in cytoskeletal processes. This
          family contains many hypothetical proteins.
          Length = 73

 Score = 29.0 bits (65), Expect = 1.5
 Identities = 15/60 (25%), Positives = 18/60 (30%), Gaps = 3/60 (5%)

Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA-QPQQ--VPSPYHTPPPSQT 79
          QQT         PQ Q  + P     + P       Q    QPQQ           P+ T
Sbjct: 4  QQTGYQPPQQQQPQQQGPLQPQPTGFMQPQPTGFGQQQQGLQPQQTGFQPQAGQQMPTGT 63


>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
          Length = 499

 Score = 31.6 bits (71), Expect = 1.5
 Identities = 18/69 (26%), Positives = 27/69 (39%), Gaps = 3/69 (4%)

Query: 22  IQQTLSPQQPPVSPQHQNMMSP--IQQHSL-SPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
             +      PP++   Q  M P  +QQ  + +PH   P  QPP   Q    P      ++
Sbjct: 405 APKNDHHLLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNE 464

Query: 79  TSKQSGKAP 87
             K S + P
Sbjct: 465 APKPSAQEP 473



 Score = 29.7 bits (66), Expect = 6.1
 Identities = 22/79 (27%), Positives = 31/79 (39%), Gaps = 8/79 (10%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQH--QNMMSP--IQQHSLSPHTQAPS---PQPPAQ 63
                  S  +  Q L  QQ P+   H     M P  +QQ  + P T   S   P+P AQ
Sbjct: 412 LLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNEAPKPSAQ 471

Query: 64  -PQQVPSPYHTPPPSQTSK 81
            P  + + +   P S+  K
Sbjct: 472 EPVHIDASFAQDPVSKIQK 490


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 31.6 bits (71), Expect = 1.6
 Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 10/78 (12%)

Query: 8   MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQV 67
           +  P QE        Q++L+    P   Q Q+  +     S +P T+  +P     PQ +
Sbjct: 95  ISQPKQEE-------QKSLNISATPAPKQEQSQTTT---ESTTPKTKVTTPPSTNTPQPM 144

Query: 68  PSPYHTPPPSQTSKQSGK 85
            S     P S T KQ+  
Sbjct: 145 QSTKSDTPQSPTIKQAQT 162


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 31.7 bits (72), Expect = 1.6
 Identities = 11/60 (18%), Positives = 17/60 (28%)

Query: 19  VPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
            P   +  +    PV P       P    + +   + P  +    P   P P   P P  
Sbjct: 365 APQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHT 424


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 31.2 bits (70), Expect = 2.0
 Identities = 13/73 (17%), Positives = 23/73 (31%), Gaps = 6/73 (8%)

Query: 22  IQQTLSPQQPPVSPQ--HQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ--VPSPYHT--PP 75
           + +     Q   + +   Q      Q   +    +   P    QP Q  + +P HT    
Sbjct: 162 LAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQS 221

Query: 76  PSQTSKQSGKAPD 88
             Q +    +A D
Sbjct: 222 KPQQAAPVTRAAD 234



 Score = 30.0 bits (67), Expect = 4.1
 Identities = 17/84 (20%), Positives = 25/84 (29%), Gaps = 3/84 (3%)

Query: 1   MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP-- 58
           ++Q+ + M   P +    VP  +QT   +Q  +  Q Q      QQ              
Sbjct: 122 LEQMQADMRQQPTQ-LVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQ 180

Query: 59  QPPAQPQQVPSPYHTPPPSQTSKQ 82
           Q     Q  P           S Q
Sbjct: 181 QQTRTSQAAPVQAQPRQSKPASTQ 204


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
          associated protein 2 has a highly conserved sequence
          throughout evolution including a conserved polyproline
          region and several SH2/SH3 binding sites. It occurs as
          a single copy gene with a four-exon organisation and is
          located on chromosome 12. It encodes a ubiquitously
          expressed protein and binds to DAZ and DAZL1 through
          DAZ repeats.
          Length = 136

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 13/59 (22%), Positives = 17/59 (28%), Gaps = 6/59 (10%)

Query: 20 PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
          P     L      V PQ Q   +       +P+       P AQ   V      PP + 
Sbjct: 5  PPAYSELYQPSYVVPPQAQMPQAS------APYPGPSMYLPMAQVMAVGPQSSHPPMAY 57


>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
          Length = 137

 Score = 30.0 bits (67), Expect = 2.2
 Identities = 14/64 (21%), Positives = 16/64 (25%), Gaps = 5/64 (7%)

Query: 23  QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS-----PQPPAQPQQVPSPYHTPPPS 77
            +T  P      P H         H L      P      P  P  PQ   +    P   
Sbjct: 40  HRTWDPADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHHPLSPQCQQNVTGKPVLQ 99

Query: 78  QTSK 81
           Q  K
Sbjct: 100 QEPK 103


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 30.8 bits (69), Expect = 2.2
 Identities = 16/61 (26%), Positives = 20/61 (32%)

Query: 26 LSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGK 85
          L    PP +         IQ  +     Q  +PQ  A P Q P P  T   +    Q   
Sbjct: 15 LVANVPPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQAP 74

Query: 86 A 86
          A
Sbjct: 75 A 75



 Score = 30.8 bits (69), Expect = 2.5
 Identities = 24/141 (17%), Positives = 38/141 (26%), Gaps = 9/141 (6%)

Query: 12  PQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPY 71
           P    + V   Q  + P QP    Q        Q    S  T     Q PAQP     P 
Sbjct: 25  PTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQAPAQPAPAAPPA 84

Query: 72  HTPPPSQTSKQSG---KAPDKNTLITMLKLLRQYNLRGTEEQLKKEINVNDLGIDPNDSE 128
                 +  +        P+   + T+   L        + QL   ++  +     +  +
Sbjct: 85  AGAALPEALEVPPPPAFTPNGEIVGTLAGNLEG------DPQLAPSVSYLEAFSGLDKLD 138

Query: 129 IGDLLSAYESEGDPNLYEDAY 149
                       DP   +  Y
Sbjct: 139 TVRAFGKAAENRDPRFIDTHY 159


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.1 bits (70), Expect = 2.2
 Identities = 15/73 (20%), Positives = 24/73 (32%), Gaps = 3/73 (4%)

Query: 20  PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
           P +++       P   +H     P +     P    PSP     P+Q P P     P   
Sbjct: 388 PDVKKKAPEPDLPQPDRHPGPAKP-EAPGARPAE-LPSPASAPTPEQQP-PVARSAPLPP 444

Query: 80  SKQSGKAPDKNTL 92
           S Q+    +  + 
Sbjct: 445 SPQASAPRNVASG 457


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 30.8 bits (70), Expect = 2.2
 Identities = 24/74 (32%), Positives = 32/74 (43%), Gaps = 9/74 (12%)

Query: 303 PDDDEEEAGEGG----DKPKKKKAKKDSM----FSKKTKSDPNAPP-VDRIPLPPLKDSE 353
           P+ D+EE GE      ++ KKKK  K         KK K DP A          P K SE
Sbjct: 66  PESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125

Query: 354 KIDKLKALREATKR 367
           +I     L ++ +R
Sbjct: 126 RISWAPTLLDSPRR 139


>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
           Provisional.
          Length = 177

 Score = 30.2 bits (68), Expect = 2.2
 Identities = 11/53 (20%), Positives = 16/53 (30%), Gaps = 1/53 (1%)

Query: 23  QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPP 75
           Q   +P    +  Q Q      QQ          +   P Q     +P + PP
Sbjct: 117 QGGGAPAGGNIGGQQQGGWGQPQQPQGGNQFSGGAQSRPQQSAP-AAPSNEPP 168


>gnl|CDD|214939 smart00960, Robl_LC7, Roadblock/LC7 domain.  This family includes
           proteins that are about 100 amino acids long and have
           been shown to be related. Members of this family of
           proteins are associated with both flagellar outer arm
           dynein and Drosophila and rat brain cytoplasmic dynein.
           It is proposed that roadblock/LC7 family members may
           modulate specific dynein functions. This family also
           includes Golgi-associated MP1 adapter protein and MglB
           from Myxococcus xanthus, a protein involved in gliding
           motility. However the family also includes members from
           non-motile bacteria such as Streptomyces coelicolor,
           suggesting that the protein may play a structural or
           regulatory role.
          Length = 88

 Score = 28.7 bits (65), Expect = 2.2
 Identities = 16/31 (51%), Positives = 17/31 (54%)

Query: 619 HLLTLLTGHTDVVRALCFSRDGNILASSGQD 649
            LL LL     V  AL  SRDG +LASSG  
Sbjct: 3   ELLDLLARVPGVRGALVVSRDGLLLASSGLS 33


>gnl|CDD|233777 TIGR02205, septum_zipA, cell division protein ZipA.  This model
           represents the full length of bacterial cell division
           protein ZipA. The N-terminal hydrophobic stretch is an
           uncleaved signal-anchor sequence. This is followed by an
           unconserved, variable length, low complexity region, and
           then a conserved C-terminal region of about 140 amino
           acids (see pfam04354) that interacts with the
           tubulin-like cell division protein FtsZ [Cellular
           processes, Cell division].
          Length = 284

 Score = 30.6 bits (69), Expect = 2.3
 Identities = 14/76 (18%), Positives = 19/76 (25%), Gaps = 4/76 (5%)

Query: 9   DDPPQESF----HSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
           DDP  E      +S     +      P    QH   +S +  +             P QP
Sbjct: 46  DDPTSEEMVQPDNSPNTRVERGEHPIPQPREQHLPSISELVAYQRDKSVDDEEASIPMQP 105

Query: 65  QQVPSPYHTPPPSQTS 80
            Q       P      
Sbjct: 106 TQQQYDMPQPNNVAQQ 121



 Score = 29.5 bits (66), Expect = 5.8
 Identities = 17/80 (21%), Positives = 26/80 (32%), Gaps = 5/80 (6%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
            +  L+     +S             QQ    PQ  N    + Q ++ P      P+   
Sbjct: 81  SISELVAYQRDKSVDDEEASIPMQPTQQQYDMPQPNN----VAQQTVEPRVAKSLPEASP 136

Query: 63  QPQQVPSPYH-TPPPSQTSK 81
           Q ++V      T PP Q  K
Sbjct: 137 QEEEVGKNLEVTAPPKQKDK 156


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
          carrier protein subunit; Validated.
          Length = 130

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 12/44 (27%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
          +    P  P    +  P +  S     QAP P   A    +PSP
Sbjct: 25 AQAAAPAQPASTPVPVPTEA-SPQVEAQAPQPAAAAGADAMPSP 67


>gnl|CDD|233990 TIGR02740, TraF-like, TraF-like protein.  This protein is related
           to the F-type conjugation system pilus assembly proteins
           TraF (TIGR02739)and TrbB (TIGR02738) both of which
           exhibit a thioredoxin fold. The protein represented by
           this model has the same length and architecture as TraF,
           but lacks the CXXC-motif found in TrbB and believed to
           be responsible for the disulfide isomerase activity of
           that protein.
          Length = 271

 Score = 30.5 bits (69), Expect = 2.8
 Identities = 20/109 (18%), Positives = 39/109 (35%), Gaps = 5/109 (4%)

Query: 56  PSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLI--TMLKLLRQYNLRGTEEQLKK 113
             P P  + + +  P    PP+Q   +   AP     +   + +L  +     T E ++ 
Sbjct: 46  EDPPPKEEEEPL-EPAPPAPPAQQPAREELAPFSVAWLRENLPELRDKAIDNPTPENVRA 104

Query: 114 EINVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDV 162
            +    + +D   S   D +S      DP L E   + +     ++ D 
Sbjct: 105 YLEAQRIMLD-KASRFAD-VSQRVIWTDPILDETLRRPVSTLALDAHDT 151


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 30.6 bits (69), Expect = 3.0
 Identities = 10/31 (32%), Positives = 12/31 (38%)

Query: 53  TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
                  P A P   P+P  +PP S  S  S
Sbjct: 80  GTGKVAPPAATPTSAPTPTPSPPASPASGMS 110



 Score = 29.1 bits (65), Expect = 8.8
 Identities = 8/47 (17%), Positives = 19/47 (40%)

Query: 45  QQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNT 91
           +  + +     P+  P + P   PSP  +P    ++  +    +K+ 
Sbjct: 76  KPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSP 122


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 28.4 bits (64), Expect = 3.1
 Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 13/58 (22%)

Query: 316 KPKKKKAKKDSMFSKKTKSDPNAPPVDRIP-LPPLKDS------EKIDKLKALREATK 366
           +  +K+AKK+  +  K K DP       IP   P K+       EK  K +  +E  K
Sbjct: 15  RKLRKEAKKNPTWKSKKKKDPG------IPNSFPFKEEILEEIEEKKRKQEEEKERRK 66


>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
           box-containing transcription factors [Transcription].
          Length = 412

 Score = 30.4 bits (68), Expect = 3.2
 Identities = 16/91 (17%), Positives = 25/91 (27%), Gaps = 14/91 (15%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ----- 65
           P     H+       +  Q P    Q     S  + H   PH+   + +PPA+       
Sbjct: 185 PMGSFQHNGSPQTNFIPLQNPQT--QQYQQHSSRKDHPTVPHSNTNNGRPPAKFMIPELH 242

Query: 66  --QVPSPYHTPPPSQT-----SKQSGKAPDK 89
                    +   S +     S  S    D 
Sbjct: 243 SSHSTLDLPSDFISDSGFPNQSSTSIFPLDS 273


>gnl|CDD|216839 pfam02002, TFIIE_alpha, TFIIE alpha subunit.  The general
           transcription factor TFIIE has an essential role in
           eukaryotic transcription initiation together with RNA
           polymerase II and other general factors. Human TFIIE
           consists of two subunits TFIIE-alpha and TFIIE-beta, and
           joins the pre-initiation complex after RNA polymerase II
           and TFIIF. This family consists of the conserved amino
           terminal region of eukaryotic TFIIE-alpha and proteins
           from archaebacteria that are presumed to be TFIIE-alpha
           subunits also Archaeoglobus fulgidus tfe.
          Length = 105

 Score = 28.8 bits (65), Expect = 3.3
 Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 15/59 (25%)

Query: 107 TEEQLKKEINVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKY 165
           TEE L +      LGID N  E+  LL          LY+      ++  ++    Y+Y
Sbjct: 29  TEEDLAEL-----LGIDLN--ELRKLLY--------RLYDARLVKYRRRRDDETGWYRY 72


>gnl|CDD|187832 cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           inactivated palm domain and Zn-ribbon; signature gene
           for type III.
          Length = 909

 Score = 30.5 bits (69), Expect = 3.7
 Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 11/58 (18%)

Query: 115 INVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKYELGLVLY 172
           I  N LG+     +I D+L+  E      L E+    +KK +E       Y LG  +Y
Sbjct: 231 IRFNGLGLLLKGLKIPDILARKE------LLEEILDRVKKLLE-----ETYPLGNEVY 277


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 30.4 bits (69), Expect = 3.7
 Identities = 13/79 (16%), Positives = 26/79 (32%), Gaps = 3/79 (3%)

Query: 8   MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQN--MMSPIQQHSLSPHTQAPSPQPPAQPQ 65
           +   P      +P+   TL  + P  +P   +  + +P            P+P  P+   
Sbjct: 209 VQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPS-SA 267

Query: 66  QVPSPYHTPPPSQTSKQSG 84
              +P    P +  +  S 
Sbjct: 268 PATAPAAAAPQAAATSSSA 286


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 30.6 bits (69), Expect = 4.2
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 53  TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTL 92
            + P P PPA  Q+  +P    P +  +  +  AP +   
Sbjct: 75  PEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVE 114


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 30.6 bits (69), Expect = 4.2
 Identities = 8/47 (17%), Positives = 13/47 (27%), Gaps = 1/47 (2%)

Query: 51  PHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLK 97
           P    P+P     P +   P    PP           +    +  L+
Sbjct: 916 PGAGPPAPPQAVPPPRTTQP-PAAPPRGPDVPPAAVAELRETLADLR 961


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 30.4 bits (69), Expect = 4.3
 Identities = 15/72 (20%), Positives = 21/72 (29%), Gaps = 3/72 (4%)

Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
             +S  S      + S   PP S   +         S   HT +  P P       PS 
Sbjct: 8  SRSQSHASSS--SSSSSQSSPPSSTSPRPRRRKPSASS-LLHTPSILPLPKLSSPSPPSV 64

Query: 71 YHTPPPSQTSKQ 82
             P  +  + Q
Sbjct: 65 TLPPAATTQTPQ 76


>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region. 
          Length = 57

 Score = 27.4 bits (61), Expect = 4.3
 Identities = 14/38 (36%), Positives = 18/38 (47%), Gaps = 3/38 (7%)

Query: 43 PIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
          P++Q+S  P     SP PP +     SP   PPP   S
Sbjct: 9  PVKQYSPPPPYYYKSPPPPVKSPVYKSP---PPPVYKS 43


>gnl|CDD|151560 pfam11116, DUF2624, Protein of unknown function (DUF2624).  This
           family is conserved in the Bacillaceae family. Several
           members are named as YqfT. The function is not known.
          Length = 84

 Score = 28.1 bits (63), Expect = 4.5
 Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 2/64 (3%)

Query: 95  MLKLLRQYNLRGTEEQLKKEIN-VNDLGIDP-NDSEIGDLLSAYESEGDPNLYEDAYQDL 152
           +LK  +QY +  T  Q K+    +    I+  N+SE   LL        P   ++  +  
Sbjct: 20  LLKYAKQYGIPITRAQAKQIAKLLRGKNINIFNESERSKLLKELAKITSPQTAQEVNELF 79

Query: 153 KKFI 156
            +F+
Sbjct: 80  AQFV 83


>gnl|CDD|234764 PRK00442, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 28.3 bits (63), Expect = 4.6
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 56 PSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
          P+ QP    Q VP   +T P +Q     G+A
Sbjct: 52 PAEQPAPPAQPVPPVQNTAPLNQPHTIDGQA 82


>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
          Length = 332

 Score = 30.1 bits (68), Expect = 4.6
 Identities = 14/80 (17%), Positives = 28/80 (35%), Gaps = 7/80 (8%)

Query: 10  DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSP---IQQHSLSPHTQAPSPQ---PPAQ 63
             PQ    + P   Q    ++     + Q        + +  ++   Q P+       ++
Sbjct: 61  LNPQSYVETTPPPFQQPQTEESESENEVQIQQEVEQSVDEIKITLPNQEPAYYMQNHRSE 120

Query: 64  PQQVPSPYHTPPPSQTSKQS 83
           P Q P+      P+QT+  S
Sbjct: 121 PIQ-PTQPQYQSPTQTNVAS 139


>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is about 100
          amino acids in length. This domain is found associated
          with pfam00096. This domain has two conserved sequence
          motifs: YPSPV and PSP.
          Length = 100

 Score = 28.2 bits (62), Expect = 5.0
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 42 SPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPS 77
            I  +S SP +  PSP   + P  VP+ Y +P  S
Sbjct: 12 PNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSS 47


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 29.9 bits (67), Expect = 5.4
 Identities = 14/62 (22%), Positives = 24/62 (38%), Gaps = 5/62 (8%)

Query: 29  QQPPVSPQHQNMMSPIQQHSL-SPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
              P++ + ++  +P+Q  ++ S   QAP P+P               PS     S   P
Sbjct: 288 ISTPMTEEIKSWQTPLQTPAMYSSDYQAPKPEP---IYTWEELLRERFPSDLFAISSL-P 343

Query: 88  DK 89
           D 
Sbjct: 344 DS 345


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 29.2 bits (65), Expect = 5.4
 Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 9/68 (13%)

Query: 31  PPVSPQHQNMMSPIQQHSLSPHTQAPSPQ-------PPAQPQQVPSPYHTPPPSQTSKQS 83
           PP     +   +P      +P++Q P+ Q       P A  +       T  P  T++Q 
Sbjct: 35  PPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQ 94

Query: 84  GKAPDKNT 91
               D++T
Sbjct: 95  --TVDRHT 100


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 29.1 bits (65), Expect = 5.6
 Identities = 17/77 (22%), Positives = 24/77 (31%), Gaps = 2/77 (2%)

Query: 3   QLPSLMDDPPQ--ESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
               +M +P    + F  VP  Q   S Q   ++ Q+Q        H      Q   P  
Sbjct: 76  MPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGA 135

Query: 61  PAQPQQVPSPYHTPPPS 77
              P     P    PP+
Sbjct: 136 GQPPGMQHMPAPALPPN 152


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 29.8 bits (67), Expect = 5.7
 Identities = 17/84 (20%), Positives = 22/84 (26%), Gaps = 8/84 (9%)

Query: 12  PQESFH---SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS-----PQPPAQ 63
           PQ S       P    T     PP + +        Q  +      A             
Sbjct: 300 PQSSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRA 359

Query: 64  PQQVPSPYHTPPPSQTSKQSGKAP 87
           P+  PS     PPS +   S  A 
Sbjct: 360 PRGRPSGSGQSPPSNSPLLSSLAD 383


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.0 bits (67), Expect = 5.7
 Identities = 20/68 (29%), Positives = 29/68 (42%), Gaps = 12/68 (17%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
           +PQ PP +P      +P Q+        AP+PQPP  PQ  P+     P +   +Q    
Sbjct: 775 TPQPPPQAPP-----APQQRPR-----GAPTPQPP--PQAGPTSMQLMPRAAPGQQGPTK 822

Query: 87  PDKNTLIT 94
                L+T
Sbjct: 823 QILRQLLT 830



 Score = 29.3 bits (65), Expect = 9.8
 Identities = 17/70 (24%), Positives = 23/70 (32%), Gaps = 9/70 (12%)

Query: 11  PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP--PAQPQQVP 68
           PP  +              +PP            +  + +P    P P P  P  PQQ P
Sbjct: 736 PPAAAPGRARPPAAAPGRARPP-----AAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRP 790

Query: 69  --SPYHTPPP 76
             +P   PPP
Sbjct: 791 RGAPTPQPPP 800


>gnl|CDD|221182 pfam11715, Nup160, Nucleoporin Nup120/160.  Nup120 is conserved
           from fungi to plants to humans, and is homologous with
           the Nup160 of vertebrates. The nuclear core complex, or
           NPC, mediates macromolecular transport across the
           nuclear envelope. Deletion of the NUP120 gene causes
           clustering of NPCs at one side of the nuclear envelope,
           moderate nucleolar fragmentation and slower cell growth.
           The vertebrate NPC is estimated to contain between 30
           and 60 different proteins. most of which are not known.
           Two important ones in creating the nucleoporin basket
           are Nup98 and Nup153, and Nup120, in conjunction with
           Nup 133, interacts with these two and itself plays a
           role in mRNA export. Nup160, Nup133, Nup96, and Nup107
           are all targets of phosphorylation. The phosphorylation
           sites are clustered mainly at the N-terminal regions of
           these proteins, which are predicted to be natively
           disordered. The entire Nup107-160 subcomplex is stable
           throughout the cell cycle, thus it seems unlikely that
           phosphorylation affects interactions within the
           Nup107-160 subcomplex, but rather that it regulates the
           association of the subcomplex with the NPC and other
           proteins.
          Length = 511

 Score = 29.8 bits (67), Expect = 6.0
 Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 1/32 (3%)

Query: 546 DVDCVQFHPNSNYVATGSSDRTIRLWDCVTGS 577
            V   + +    ++ T S D T+R+W+  TG 
Sbjct: 223 IVSSSEVN-GQTFLFTLSLDGTLRVWNLDTGQ 253


>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
          Length = 808

 Score = 30.0 bits (67), Expect = 6.2
 Identities = 17/83 (20%), Positives = 28/83 (33%), Gaps = 9/83 (10%)

Query: 9   DDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHT---------QAPSPQ 59
              P+      P   +         SP+ Q+ ++ +Q   L P+T         Q     
Sbjct: 22  ASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTELQPSPLPPNTTLDAPVSDSQGDESS 81

Query: 60  PPAQPQQVPSPYHTPPPSQTSKQ 82
              QPQ   S    PPP +  ++
Sbjct: 82  SEQQPQNPNSTEPAPPPKKRRRR 104


>gnl|CDD|240424 PTZ00455, PTZ00455, 3-ketoacyl-CoA thiolase; Provisional.
          Length = 438

 Score = 29.5 bits (66), Expect = 6.5
 Identities = 16/41 (39%), Positives = 24/41 (58%), Gaps = 8/41 (19%)

Query: 107 TEEQLKKEINVNDLGIDPNDSEIGDLLSAYESEGDPNLYED 147
           +EE L+K      +G+ PNDS + ++ S   + G  NLYED
Sbjct: 269 SEEGLQK------MGLSPNDSRLVEIKSLACASG--NLYED 301


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 29.7 bits (66), Expect = 6.6
 Identities = 29/158 (18%), Positives = 51/158 (32%), Gaps = 32/158 (20%)

Query: 12  PQESFHSV---PIIQQTLSP---QQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ 65
           P++  H V    I  +T+S     +  V P       P+   SLS       P P   P 
Sbjct: 581 PKKKRHKVTDTRITPRTVSRILALRDAVGPAAGTHHQPLHPSSLSASM-GFHPPPFRHPF 639

Query: 66  QVPS------------------PYHTP----PPSQTSKQSGKAPDKNTLITMLK---LLR 100
            +P                   P+ +P    P      +   +PD     T L    L  
Sbjct: 640 PLPLTVAIPNPSLHQSEVFMGYPFQSPHLGAPSGSPPGKDRDSPDLPRPTTSLHPKLLSA 699

Query: 101 QYNLRGTEEQLKKEINVNDLGIDPNDSEIGDLLSAYES 138
            ++   + + L   +  ++ G   + S+I    + Y+ 
Sbjct: 700 HHHPGSSPDGLSLSLIKSECGDLQDMSDISSADAPYDG 737


>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding.  The Creb binding domain
          assumes a structure comprising of three alpha-helices
          which pack in a bundle, exposing a hydrophobic groove
          between alpha-1 and alpha-3 within which complimentary
          domains found in the protein 'activator for thyroid
          hormone and retinoid receptors' (ACTR) can dock.
          Docking of these domains is required for the
          recruitment of RNA polymerase II and the basal
          transcription machinery.
          Length = 104

 Score = 27.7 bits (61), Expect = 6.8
 Identities = 9/47 (19%), Positives = 10/47 (21%), Gaps = 1/47 (2%)

Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
           Q      P        M  P+ Q         P P    QP     
Sbjct: 4  GQWPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLV-QPGISRG 49


>gnl|CDD|130324 TIGR01257, rim_protein, retinal-specific rim ABC transporter.  This
            model describes the photoreceptor protein (rim protein)
            in eukaryotes. It is the member of ABC transporter
            superfamily. Rim protein is a membrane glycoprotein which
            is localized in the photoreceptor outer segment discs.
            Mutation/s in its genetic loci is implicated in the
            recessive Stargardt's disease [Transport and binding
            proteins, Other].
          Length = 2272

 Score = 29.6 bits (66), Expect = 7.7
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 23   QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP--QPPAQPQQVPSPYHT 73
            ++  + + P   P  +   +P   H+ SP   A  P  QPP +P+    P +T
Sbjct: 1294 RENANLRHPCSGPTEKAGQTPQASHTCSPGQPAAHPEGQPPPEPEDPGVPLNT 1346


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 29.4 bits (66), Expect = 7.8
 Identities = 22/112 (19%), Positives = 28/112 (25%), Gaps = 9/112 (8%)

Query: 3   QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
              S   D P  S         + + QQ   SPQ      P    S        S   P 
Sbjct: 137 PGHSSFSDSPSRSASPSRKFSPSSTIQQ---SPQLTPSNKPASPSSSYQSPSYSSSLGPV 193

Query: 63  QPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKE 114
                 S      P           DK  + T  K L  + L   +E+    
Sbjct: 194 NSSGNRSNL-RSSPWALRSSG----DKKDITTDEKYLETF-LAEVDEEQHMI 239


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 29.2 bits (66), Expect = 8.3
 Identities = 17/91 (18%), Positives = 29/91 (31%), Gaps = 10/91 (10%)

Query: 5   PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
                     +    P +    +   P V PQ   +++P Q +  +  T AP+       
Sbjct: 177 APAAPVDTTPTNSQTPAVA---TAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDG 233

Query: 65  QQVPSPYHTPPPSQTSKQSGKAPDKNTLITM 95
                    P P+  +  S  A D N L+  
Sbjct: 234 A-------APLPTDQAGVSTPAADPNALVMN 257


>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3.  The
           C.elegans Notch pathway, involved in the control of
           growth, differentiation and patterning in animal
           development, relies on either of the receptors GLP-1 or
           LIN-12. Both these receptors promote signalling by the
           recruitment of LAG-3 to target promoters, where it then
           acts as a transcriptional activator. LAG-3 works as a
           ternary complex together with the DNA binding protein,
           LAG-1.
          Length = 476

 Score = 29.2 bits (64), Expect = 8.5
 Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 12/78 (15%)

Query: 23  QQTLSPQQP--PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPY---HTPPPS 77
           QQ  S QQ    ++ QH + M   Q H      Q PS         VP+P    H P P+
Sbjct: 163 QQYHSQQQQYIMMAAQHHHFMGMQQIHH-----QMPSTSSADAIHSVPTPAGSIHQPSPA 217

Query: 78  QTSKQSGKAPDKNTLITM 95
           +   Q+G    +N  + M
Sbjct: 218 EM--QNGCGMSRNATMDM 233


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 29.5 bits (67), Expect = 8.6
 Identities = 10/74 (13%), Positives = 20/74 (27%), Gaps = 1/74 (1%)

Query: 27  SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
           +      S       +            A +P  P +P    +     P +  +  +  A
Sbjct: 49  AAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA-AAAPAAPPAAAAAAA 107

Query: 87  PDKNTLITMLKLLR 100
           P    +   +  LR
Sbjct: 108 PAAAAVEDEVTPLR 121


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 29.2 bits (65), Expect = 8.7
 Identities = 8/56 (14%), Positives = 13/56 (23%)

Query: 28  PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
           P +       +    P    +      AP P  PA      +      P  +    
Sbjct: 210 PAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSG 265


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 29.3 bits (65), Expect = 9.3
 Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 16/79 (20%)

Query: 18 SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSP-----HTQAPSPQPPAQPQQVPSP-- 70
           +P +  + + Q+P  +P       P Q  S +P      T A S +P     Q P+P  
Sbjct: 27 PIPELHTSAATQKPDPAPA------PHQAASRAPDPAVAPTSAASRKP--DLAQAPTPAA 78

Query: 71 YHTPPPS-QTSKQSGKAPD 88
               P+    + + +APD
Sbjct: 79 SEKFDPAPAPHQAASRAPD 97


>gnl|CDD|227546 COG5221, DOP1, Dopey and related predicted leucine zipper
            transcription factors [Transcription].
          Length = 1618

 Score = 29.1 bits (65), Expect = 9.8
 Identities = 14/55 (25%), Positives = 26/55 (47%)

Query: 454  RTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLLLWRCLVVYKGHGHPVWDI 508
              +      ++K  F   R L+L  S D+   LW +LL+   +V + +  P +D+
Sbjct: 1437 TLMNSPSSKLKKEVFLLSRMLILRISHDHLGNLWPILLYDLGIVVESYESPDFDV 1491


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.133    0.397 

Gapped
Lambda     K      H
   0.267   0.0792    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,978,354
Number of extensions: 3512614
Number of successful extensions: 6012
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5150
Number of HSP's successfully gapped: 331
Length of query: 711
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 607
Effective length of database: 6,324,786
Effective search space: 3839145102
Effective search space used: 3839145102
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (27.9 bits)