RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18075
         (999 letters)



>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
           inhibitor (RI)-like subfamily. LRRs are 20-29 residue
           sequence motifs present in many proteins that
           participate in protein-protein interactions and have
           different functions and cellular locations. LRRs
           correspond to structural units consisting of a beta
           strand (LxxLxLxxN/CxL conserved pattern) and an alpha
           helix. This alignment contains 12 strands corresponding
           to 11 full repeats, consistent with the extent observed
           in the subfamily acting as Ran GTPase Activating
           Proteins (RanGAP1).
          Length = 319

 Score =  212 bits (543), Expect = 7e-62
 Identities = 131/331 (39%), Positives = 185/331 (55%), Gaps = 22/331 (6%)

Query: 386 KLDNKEDAKVIVDAINEVKVLVSLNLEGNTLGVNAAKAIADALSKHEHFKRALWKDMFTG 445
           +L   E A  ++  +  ++VL    LEGNTLG  AAKA+A AL      K        TG
Sbjct: 8   ELLKTERATELLPKLLCLQVL---RLEGNTLGEEAAKALASALRPQPSLKELCLSLNETG 64

Query: 446 RMKTEIPDALRYLGNGLQQAGARLVELDLSDNAFGPIGVEGLADLLRSSCCFALEELKLN 505
           R    IP  L+ L  GL   G  L ELDLSDNA GP G   L  LLRSS   +L+ELKLN
Sbjct: 65  R----IPRGLQSLLQGLT-KGCGLQELDLSDNALGPDGCGVLESLLRSS---SLQELKLN 116

Query: 506 NNGLGITGCKLLSKALHDCYESSKKEGSPLALKVFIAGRNRLENEGAKMLAAVFKKLKTL 565
           NNGLG  G +LL+K L D          P AL+  + GRNRLE    + LA   +  + L
Sbjct: 117 NNGLGDRGLRLLAKGLKDL---------PPALEKLVLGRNRLEGASCEALAKALRANRDL 167

Query: 566 ERVEMPQNGIYHVGITALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILN 625
           + + +  NGI   GI AL++  + N NL  L+LN+N +T +GA  L + L+ L SL +LN
Sbjct: 168 KELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLN 227

Query: 626 LGDCLLKSAGASSIAK-YLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSE 684
           LGD  L  AGA+++A   L+ N +L  ++L+CN+I+  G  DL + +  K  L ++++  
Sbjct: 228 LGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRG 287

Query: 685 NQFGEEGVEEMEKLMKSFG-MAAALVLEDDE 714
           N+FGEEG + + + +   G    +L ++DD 
Sbjct: 288 NKFGEEGAQLLAESLLEPGNELESLWVKDDS 318


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score =  172 bits (437), Expect = 7e-47
 Identities = 119/367 (32%), Positives = 178/367 (48%), Gaps = 17/367 (4%)

Query: 377 KLSFAGQGLKLDNKEDAKVIVDAINEVKVLVSLNLEGNTLGVNAAKAIADALSKHEHFKR 436
           K S  G+ LKL+ KED K +V+ +  +  LV ++L GNT+G  A + + + ++   + + 
Sbjct: 3   KFSIGGKKLKLETKEDVKGVVEELEMMDELVEVDLSGNTIGTEAMEELCNVIANVRNLRV 62

Query: 437 ALWKDMFTGRMKTEIPDALRYLGNGLQQAGARLVELDLSDNAFGPIGVEGLADLLRSSCC 496
             + D FTGR K E+   L  L   L +   RL ++DLSDNAFG    E L DL+ SS  
Sbjct: 63  VNFSDAFTGRDKDELYSNLVMLLKALLKC-PRLQKVDLSDNAFGSEFPEELGDLISSST- 120

Query: 497 FALEELKLNNNGLGITGCKLLSKALHDCYESSKKEGSPLALKVFIAGRNRLENEGAKMLA 556
             L  LKLNNNGLG      + KAL     + K    P  L+V I GRNRLEN   ++ A
Sbjct: 121 -DLVHLKLNNNGLGPIAGGRIGKALFHLAYNKKAADKP-KLEVVICGRNRLENGSKELSA 178

Query: 557 AVFKKLKTLERVEMPQNGIYHVGITALS-DAFEENKNLRHLNLNDNTITYKGAIPLGQAL 615
           A+ +  + L+ V++ QNGI   G+T L+      + +L  L+L DNT T +G+  L  AL
Sbjct: 179 ALLESHENLKEVKIQQNGIRPEGVTMLAFLGLFYSHSLEVLDLQDNTFTLEGSRYLADAL 238

Query: 616 SKLPSLAILNLGDCLLKSAGASSIAKYLTD--NTTLEDVNLTCNEISVQGGLDL-VKAMK 672
            +   L  L L DCLL + G  S+ +   +     L  +    NE      LD+ +   +
Sbjct: 239 CEWNLLRELRLNDCLLSNEGVKSVLRRFNEKFVPNLMPLPGDYNERRGGIILDISLNEFE 298

Query: 673 --NKTKLKQINVSENQFGEEGV--EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEE 728
                 L  +  + N+  E     +  E + +        V+E  EG+   EE  +E  E
Sbjct: 299 QDAVPLLVDLERNGNRIKELADFGDYYEDIFEVVE-----VVEKQEGDVVTEESTDEESE 353

Query: 729 ENDSDAE 735
           +     E
Sbjct: 354 DEVEIDE 360


>gnl|CDD|203777 pfam07834, RanGAP1_C, RanGAP1 C-terminal domain.  Ran-GTPase
           activating protein 1 (RanGAP1) is a GTPase activator for
           the nuclear Ras-related regulatory protein Ran,
           converting it to the putatively inactive GDP-bound
           state. Its C-terminal domain is required for RanGAP1
           localisation at the vertebrate nuclear pore complex, and
           is sumoylated by the small ubiquitin-related modifier
           protein (SUMO-1). This domain is composed almost
           entirely of helical substructures that are organised
           into an alpha-alpha superhelix fold, with the exception
           of the peptide containing the lysine residue required
           for SUMO-1 conjugation.
          Length = 169

 Score = 48.8 bits (116), Expect = 2e-06
 Identities = 38/138 (27%), Positives = 70/138 (50%), Gaps = 10/138 (7%)

Query: 781 VTDFLAAPSVSLFNSLGPNCAQLFLDEINSEPEDRYLESMLLAIMKVSSLVPSTTTLQSP 840
           ++ FL  PS      LGP  +QL   +++    ++ +E    A +KVSS+      ++  
Sbjct: 17  LSTFLVFPSPEKLIRLGPKRSQLIAQQVDVTDAEKVVE----AFLKVSSVYKDEPEVKQ- 71

Query: 841 SSSLFECTKVLYRELFNWASNSDKLSTVTNALLVHLGLIKCEDKSFKINYNLNGCMTAVR 900
             ++FE T  L R+ F+  ++  + ++   +LLV++GL+K EDK  KI+  L G +  + 
Sbjct: 72  --AVFETTDALMRKAFS--NSPFQSNSFITSLLVNMGLLKSEDKVKKIS-VLPGPLLTLN 126

Query: 901 LLSEAKLLPSLTESQLLA 918
            + + +  P      LLA
Sbjct: 127 HMVQQEYFPKDLAGVLLA 144



 Score = 38.8 bits (90), Expect = 0.005
 Identities = 29/99 (29%), Positives = 53/99 (53%), Gaps = 4/99 (4%)

Query: 174 ESMLLAIMKVSSLVPSTTTLPSSSLFECTKVLYRELFNWASNSDKLSTVTNALLVHLGLI 233
           E ++ A +KVSS+      +   ++FE T  L R+ F+  ++  + ++   +LLV++GL+
Sbjct: 50  EKVVEAFLKVSSVYKDEPEV-KQAVFETTDALMRKAFS--NSPFQSNSFITSLLVNMGLL 106

Query: 234 KCEDKSFKINYNLNGCMTAVRLLSEAKLLPSLTESQLLA 272
           K EDK  KI+  L G +  +  + + +  P      LLA
Sbjct: 107 KSEDKVKKIS-VLPGPLLTLNHMVQQEYFPKDLAGVLLA 144


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 40.1 bits (94), Expect = 0.003
 Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 3/84 (3%)

Query: 665 LDLVKAMKNKTKLKQINVSENQFGEEGVEEMEKL-MKSFGMAAALVLEDDEGECSDEEQD 723
           + + K    K  L +       F EE  +  EKL M    +      + DE +  DEE++
Sbjct: 120 MGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179

Query: 724 EESEEENDSDAEGDNSNLSHNDSN 747
           EE EEE++   + D+ +   +D N
Sbjct: 180 EEEEEEDEDFDDDDDDD--DDDYN 201



 Score = 34.0 bits (78), Expect = 0.25
 Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 7   LEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSN 44
            + DE +  DEE++EE EEE++   + D+ +   +D N
Sbjct: 166 EDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD--DDDYN 201



 Score = 29.7 bits (67), Expect = 5.8
 Identities = 9/36 (25%), Positives = 23/36 (63%)

Query: 2   AAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSN 37
             A  +++++ +  +EE++EE E+E+  D + D+ +
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198


>gnl|CDD|197686 smart00368, LRR_RI, Leucine rich repeat, ribonuclease inhibitor
           type. 
          Length = 28

 Score = 34.3 bits (80), Expect = 0.008
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 499 LEELKLNNNGLGITGCKLLSKALHD 523
           L EL L+NN LG  G + L++AL D
Sbjct: 4   LRELDLSNNKLGDEGARALAEALKD 28



 Score = 31.2 bits (72), Expect = 0.10
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 468 RLVELDLSDNAFGPIGVEGLADLLRS 493
            L ELDLS+N  G  G   LA+ L+ 
Sbjct: 3   SLRELDLSNNKLGDEGARALAEALKD 28



 Score = 30.5 bits (70), Expect = 0.20
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 590 NKNLRHLNLNDNTITYKGAIPLGQALS 616
           N +LR L+L++N +  +GA  L +AL 
Sbjct: 1   NPSLRELDLSNNKLGDEGARALAEALK 27


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 37.6 bits (87), Expect = 0.022
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 9   DDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 49
           DD+   S  + DEE + ++D + + D++N +H D +  + A
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314



 Score = 37.6 bits (87), Expect = 0.022
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 712 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 752
           DD+   S  + DEE + ++D + + D++N +H D +  + A
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 9/36 (25%), Positives = 21/36 (58%)

Query: 12  GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSH 47
           G  SD+++   S + ++ +   D+ N  ++D+N +H
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNH 305



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 9/36 (25%), Positives = 21/36 (58%)

Query: 715 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSH 750
           G  SD+++   S + ++ +   D+ N  ++D+N +H
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNH 305


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 36.7 bits (85), Expect = 0.057
 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 12  GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSH 68
           GE  DE+ D  S+E++D     D+S+ + +DSN S   +N+  +   +  N +  + 
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSN-SDVDTNEEDDRGEKESNGAKSNE 332



 Score = 36.7 bits (85), Expect = 0.057
 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 715 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSH 771
           GE  DE+ D  S+E++D     D+S+ + +DSN S   +N+  +   +  N +  + 
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSN-SDVDTNEEDDRGEKESNGAKSNE 332



 Score = 34.0 bits (78), Expect = 0.32
 Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNL-SHNDSNHSHNASNQSHNNSNQSHNTS 64
           ED +    +++ D +S + +DS +   NS++ ++ + +     SN +   SN+ H TS
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA--KSNELHLTS 337



 Score = 34.0 bits (78), Expect = 0.32
 Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNL-SHNDSNHSHNASNQSHNNSNQSHNTS 767
           ED +    +++ D +S + +DS +   NS++ ++ + +     SN +   SN+ H TS
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA--KSNELHLTS 337


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The
          CAF-1 or chromatin assembly factor-1 consists of three
          subunits, and this is the first, or A. The A domain is
          uniquely required for the progression of S phase in
          mouse cells, independent of its ability to promote
          histone deposition but dependent on its ability to
          interact with HP1 - heterochromatin protein 1-rich
          heterochromatin domains next to centromeres that are
          crucial for chromosome segregation during mitosis. This
          HP1-CAF-1 interaction module functions as a built-in
          replication control for heterochromatin, which, like a
          control barrier, has an impact on S-phase progression
          in addition to DNA-based checkpoints.
          Length = 76

 Score = 33.3 bits (77), Expect = 0.065
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 8  EDDEGECSDEEQDEESEEENDSDAEG 33
          E++EGE  + E +E+ EE++D D +G
Sbjct: 50 EEEEGEDLESEDEEDEEEDDDDDMDG 75



 Score = 33.3 bits (77), Expect = 0.065
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEG 736
           E++EGE  + E +E+ EE++D D +G
Sbjct: 50  EEEEGEDLESEDEEDEEEDDDDDMDG 75


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 36.4 bits (85), Expect = 0.088
 Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 12/90 (13%)

Query: 6  VLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSN 65
           L + E ECSD E + E   +  SD     S+L  +D+      ++ +  +  ++     
Sbjct: 17 FLVEAEAECSDGEDELEDLFDTGSDI----SDLI-DDAEVVQGGNSLALFHQQEAEEDEQ 71

Query: 66 QSHSVSQLKQHSVTDFLAAPSVSLFNSLGP 95
          Q   +  LK+     +L++P  S    L P
Sbjct: 72 Q---IQALKRK----YLSSPEKSPVADLSP 94



 Score = 36.4 bits (85), Expect = 0.088
 Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 12/90 (13%)

Query: 709 VLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSN 768
            L + E ECSD E + E   +  SD     S+L  +D+      ++ +  +  ++     
Sbjct: 17  FLVEAEAECSDGEDELEDLFDTGSDI----SDLI-DDAEVVQGGNSLALFHQQEAEEDEQ 71

Query: 769 QSHSVSQLKQHSVTDFLAAPSVSLFNSLGP 798
           Q   +  LK+     +L++P  S    L P
Sbjct: 72  Q---IQALKRK----YLSSPEKSPVADLSP 94


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 35.7 bits (82), Expect = 0.13
 Identities = 21/89 (23%), Positives = 41/89 (46%), Gaps = 21/89 (23%)

Query: 671 MKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEN 730
           MK +  ++++       G+E  EE                E+D+ E  D++ D+E E+++
Sbjct: 32  MKKENAIRKL-------GKEAEEEAM-------------EEEDDDEEDDDDDDDEDEDDD 71

Query: 731 DSDAEGDNSNLSHNDSNHSHNASNQSHNN 759
           D D + D+ +   +DS   H+ S+    N
Sbjct: 72  DDDDDEDDEDEDDDDST-LHDDSSADDGN 99



 Score = 33.4 bits (76), Expect = 0.58
 Identities = 15/50 (30%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 7  LEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNN 56
           E+D+ E  D++ D+E E+++D D + D+ +   +DS   H+ S+    N
Sbjct: 51 EEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST-LHDDSSADDGN 99



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 9/48 (18%), Positives = 29/48 (60%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHN 55
           ++++ +  D+E +++ ++++D D E ++ + S    + S +  N++ N
Sbjct: 56  DEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDN 103


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 33.5 bits (77), Expect = 0.15
 Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 672 KNKTKLKQINVSENQF-GEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEN 730
           + K K ++  + E +   EE  +E+EK +K    A  L+ E+++ E  ++ + E++E+  
Sbjct: 41  EAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDE--EDAETEDTEDVE 98

Query: 731 DSDAEGDNS 739
           D + EG   
Sbjct: 99  DDEWEGFPE 107



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 10/57 (17%), Positives = 20/57 (35%), Gaps = 9/57 (15%)

Query: 689 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDE---------ESEEENDSDAEG 736
           E+ ++E ++ +K          + +  +  D E DE            EE   D + 
Sbjct: 66  EKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDK 122


>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 31.7 bits (73), Expect = 0.17
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 8/67 (11%)

Query: 565 LERVEMPQNGIYHVGITALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAIL 624
           L+ +++  N +  +       AF+   NL+ L+L+ N +T   +I    A S LPSL  L
Sbjct: 2   LKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLT---SISPE-AFSGLPSLRSL 53

Query: 625 NLGDCLL 631
           +L    L
Sbjct: 54  DLSGNNL 60



 Score = 28.3 bits (64), Expect = 2.3
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 8/68 (11%)

Query: 592 NLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLED 651
           NL+ L+L++N +T         A   LP+L +L+L    L S    +     +   +L  
Sbjct: 1   NLKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLTSISPEA----FSGLPSLRS 52

Query: 652 VNLTCNEI 659
           ++L+ N +
Sbjct: 53  LDLSGNNL 60


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 35.1 bits (81), Expect = 0.23
 Identities = 20/75 (26%), Positives = 35/75 (46%), Gaps = 6/75 (8%)

Query: 693 EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 752
            E + ++ +F    AL +ED   E  DEE+++E EEE + + EG +    H+D       
Sbjct: 367 TEQDYILDAFS---ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK--EHSDDE-EFEE 420

Query: 753 SNQSHNNSNQSHNTS 767
            +      +   N+S
Sbjct: 421 DDVESKYEDSDGNSS 435



 Score = 35.1 bits (81), Expect = 0.23
 Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 3/61 (4%)

Query: 4   ALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNT 63
           AL +ED   E  DEE+++E EEE + + EG +    H+D        +      +   N+
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK--EHSDDE-EFEEDDVESKYEDSDGNS 434

Query: 64  S 64
           S
Sbjct: 435 S 435


>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
           repeats are short sequence motifs present in a number of
           proteins with diverse functions and cellular locations.
           These repeats are usually involved in protein-protein
           interactions. Each Leucine Rich Repeat is composed of a
           beta-alpha unit. These units form elongated non-globular
           structures. Leucine Rich Repeats are often flanked by
           cysteine rich domains.
          Length = 43

 Score = 30.5 bits (70), Expect = 0.30
 Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 6/43 (13%)

Query: 591 KNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKS 633
            NL  L+L++N IT    +P    LS LP+L  L+L    +  
Sbjct: 1   TNLETLDLSNNQIT---DLP---PLSNLPNLETLDLSGNKITD 37


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 34.2 bits (79), Expect = 0.44
 Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 9/63 (14%)

Query: 684 ENQFGEEGV------EEMEKLMKSFGM---AAALVLEDDEGECSDEEQDEESEEENDSDA 734
           +N FG + V      +  E+L            L  +DD+ E SDEE +E S+ E+D D 
Sbjct: 90  DNVFGSDQVVIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149

Query: 735 EGD 737
           + D
Sbjct: 150 DDD 152



 Score = 30.3 bits (69), Expect = 5.2
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query: 4   ALVLEDDEGECSDEEQDEESEEENDSDAEGD 34
            L  +DD+ E SDEE +E S+ E+D D + D
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDD 152


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 31.9 bits (73), Expect = 0.78
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 1   MAAALVLEDDEGECSDEEQDEESEEENDSDAE 32
           +A+     DD+ E  +EE DEE ++E++S+ E
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138



 Score = 31.9 bits (73), Expect = 0.78
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 704 MAAALVLEDDEGECSDEEQDEESEEENDSDAE 735
           +A+     DD+ E  +EE DEE ++E++S+ E
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 32.7 bits (75), Expect = 0.95
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 685 NQFGEE----GVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 739
             +GEE    G+  +E L K          E ++G  SD++ DEE E E + D + D+ 
Sbjct: 58  KGYGEEEVVDGIPGLELLEKWK-EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE 115



 Score = 30.0 bits (68), Expect = 6.2
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNS 36
           E ++G  SD++ DEE E E + D + D+ 
Sbjct: 87  EAEQGLESDDDDDEEEEWEVEEDEDSDDE 115


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 32.7 bits (74), Expect = 1.0
 Identities = 22/106 (20%), Positives = 35/106 (33%), Gaps = 7/106 (6%)

Query: 673 NKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDS 732
              + ++    E++   E VEE E + KS           D  EC  EE++ E EEE   
Sbjct: 109 ETVEEEEK--EESREEREEVEETEGVTKSE----QKNDWRDAEECQKEEKEPEPEEEEKP 162

Query: 733 DAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLKQ 778
                  N +     H    +  + +                +LKQ
Sbjct: 163 KRGSLEEN-NGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQ 207


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 32.3 bits (74), Expect = 1.2
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDN 35
           EDD  E  DE++++E EEE + D EGD 
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 32.3 bits (74), Expect = 1.2
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDN 738
           EDD  E  DE++++E EEE + D EGD 
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 29.6 bits (67), Expect = 7.8
 Identities = 11/69 (15%), Positives = 29/69 (42%), Gaps = 10/69 (14%)

Query: 689 EEGVEEMEKLMKSFG-----MAAALVLEDDEGECSD-----EEQDEESEEENDSDAEGDN 738
              +E +++++K  G          V+  DE +  +     EE++EE +     D + ++
Sbjct: 246 GAALEAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEED 305

Query: 739 SNLSHNDSN 747
            +    + +
Sbjct: 306 EDEEEEEDD 314


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 32.4 bits (73), Expect = 1.6
 Identities = 22/91 (24%), Positives = 41/91 (45%), Gaps = 1/91 (1%)

Query: 685 NQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHN 744
           N  GEE   E+     + G             CS + + ++  ++ +   EG+  NL   
Sbjct: 3   NHIGEEASAEVLSGRNNTGDDGVHGGSCHPSHCSPQPRGDKFNDKFEFAGEGEKPNLL-V 61

Query: 745 DSNHSHNASNQSHNNSNQSHNTSNQSHSVSQ 775
           D+N +H  +  S NNS ++ N +  ++S S+
Sbjct: 62  DANENHINTRSSDNNSCENANMNRDANSSSE 92


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 31.6 bits (71), Expect = 1.7
 Identities = 13/53 (24%), Positives = 26/53 (49%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQS 60
           ++DE E  +EE++E   EEN+    G ++N +  D  +  +  +       +S
Sbjct: 136 DEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEES 188



 Score = 31.6 bits (71), Expect = 1.7
 Identities = 13/53 (24%), Positives = 26/53 (49%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQS 763
           ++DE E  +EE++E   EEN+    G ++N +  D  +  +  +       +S
Sbjct: 136 DEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEES 188



 Score = 31.6 bits (71), Expect = 2.1
 Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 4/90 (4%)

Query: 11  EGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSV 70
           + + SDE+++EE EEE + +AE + +    N ++ +    +  + +S   +    +  SV
Sbjct: 131 KEDESDEDEEEEEEEEEE-EAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESV 189

Query: 71  SQLKQHSVTDFLAAPSV-SLFNSLGPNCAQ 99
           ++ +    T  +A P+  S      P    
Sbjct: 190 TEAEAEGTT--VAGPTTTSPNGGFQPTTPP 217



 Score = 31.6 bits (71), Expect = 2.1
 Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 4/90 (4%)

Query: 714 EGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSV 773
           + + SDE+++EE EEE + +AE + +    N ++ +    +  + +S   +    +  SV
Sbjct: 131 KEDESDEDEEEEEEEEEE-EAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESV 189

Query: 774 SQLKQHSVTDFLAAPSV-SLFNSLGPNCAQ 802
           ++ +    T  +A P+  S      P    
Sbjct: 190 TEAEAEGTT--VAGPTTTSPNGGFQPTTPP 217


>gnl|CDD|225191 COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid
           transport and metabolism].
          Length = 385

 Score = 31.9 bits (73), Expect = 1.9
 Identities = 10/32 (31%), Positives = 15/32 (46%)

Query: 12  GECSDEEQDEESEEENDSDAEGDNSNLSHNDS 43
           G       +  +E +N+S   GDN +  H DS
Sbjct: 326 GSAYPFNVEGGAEMQNESAIHGDNVSDVHVDS 357



 Score = 31.9 bits (73), Expect = 1.9
 Identities = 10/32 (31%), Positives = 15/32 (46%)

Query: 715 GECSDEEQDEESEEENDSDAEGDNSNLSHNDS 746
           G       +  +E +N+S   GDN +  H DS
Sbjct: 326 GSAYPFNVEGGAEMQNESAIHGDNVSDVHVDS 357


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 31.6 bits (71), Expect = 1.9
 Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 7/53 (13%)

Query: 688 GEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSN 740
            E+   + + L K  G        D   E S+ E++E SE E+D D + DN  
Sbjct: 221 SEKEKTKKKDLEKWLG-------SDQSMETSESEEEESSESESDEDEDEDNKG 266



 Score = 30.4 bits (68), Expect = 4.8
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSN 37
            D   E S+ E++E SE E+D D + DN  
Sbjct: 237 SDQSMETSESEEEESSESESDEDEDEDNKG 266


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.
          This family represents the C-terminus (approximately
          300 residues) of proteins that are involved as binding
          partners for Prp19 as part of the nuclear pore complex.
          The family in Drosophila is necessary for pre-mRNA
          splicing, and the human protein has been found in
          purifications of the spliceosome. In the past this
          family was thought, erroneously, to be associated with
          microfibrillin.
          Length = 277

 Score = 31.4 bits (71), Expect = 2.1
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 6  VLEDDEGECSDEEQDEESEEENDSDAEGDN 35
          VLE +E + S EE++EESEEE ++D+E D 
Sbjct: 4  VLELEEEDESGEEEEEESEEEEETDSEDDM 33



 Score = 31.4 bits (71), Expect = 2.1
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 709 VLEDDEGECSDEEQDEESEEENDSDAEGDN 738
           VLE +E + S EE++EESEEE ++D+E D 
Sbjct: 4   VLELEEEDESGEEEEEESEEEEETDSEDDM 33


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.9 bits (72), Expect = 2.1
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAE 32
           E++E E  +EE++EE EEE + + E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 31.9 bits (72), Expect = 2.1
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAE 735
           E++E E  +EE++EE EEE + + E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 30.3 bits (68), Expect = 6.4
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 39
           E++E E  +EE++EE EEE + + E +   LS
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895



 Score = 30.3 bits (68), Expect = 6.4
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 742
           E++E E  +EE++EE EEE + + E +   LS
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 31.6 bits (71), Expect = 2.2
 Identities = 18/71 (25%), Positives = 37/71 (52%)

Query: 672 KNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEND 731
           +N  +L+ +N+      E G ++ E+  +       +   +   E  D+++D E ++E +
Sbjct: 85  ENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144

Query: 732 SDAEGDNSNLS 742
           SDAEGD + L+
Sbjct: 145 SDAEGDENELA 155


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 31.8 bits (73), Expect = 2.2
 Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 647 TTLEDV-NLTCNEISVQGGLD--LVKAMKNKTK--LKQINVSENQFGEEGVEEMEKLMKS 701
           ++LE++  +   E+    G D   V+ ++ + K  L+   +++ +   + +  +E L + 
Sbjct: 378 SSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRE 437

Query: 702 FGMAAA----LVLEDDEGECSDEEQDEESEEE 729
                A      LED   +  DE  D E +EE
Sbjct: 438 LAFKLAEKGIKTLEDLAEQAVDELIDIEGDEE 469


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
          N-terminal.  This is the very acidic N-terminal region
          of the early transcription elongation factor Spt5. The
          Spt5-Spt4 complex regulates early transcription
          elongation by RNA polymerase II and has an imputed role
          in pre-mRNA processing via its physical association
          with mRNA capping enzymes. The actual function of this
          N-terminal domain is not known although it is
          dispensable for binding to Spt4.
          Length = 92

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 9  DDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 42
          D E E  DEE++EE EE++  D   ++  +   +
Sbjct: 3  DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 712 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 745
           D E E  DEE++EE EE++  D   ++  +   +
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36


>gnl|CDD|218773 pfam05841, Apc15p, Apc15p protein.  The anaphase-promoting complex
           (APC) is a conserved multi-subunit ubiquitin ligase
           required for the degradation of key cell cycle
           regulators Members of this family are components of the
           anaphase-promoting complex homologous to Apc15p.
          Length = 114

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 16/80 (20%), Positives = 29/80 (36%), Gaps = 9/80 (11%)

Query: 42  DSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLKQHSVTDFLAAPSVSLFNSLGPNCAQLF 101
           D   S N      ++SN  ++T  +  S   L  +  +      + S  ++   + A+L 
Sbjct: 10  DEQSSRNYRKGRQSHSN--NSTDRELLSAPPLAAYGASSEQPYYTRSSQSASRGSLARLT 67

Query: 102 LDEINSEPEDRYLESMLLAI 121
           L+E         LE     I
Sbjct: 68  LEE-------EALERRKQNI 80



 Score = 29.7 bits (67), Expect = 2.5
 Identities = 16/80 (20%), Positives = 29/80 (36%), Gaps = 9/80 (11%)

Query: 745 DSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLKQHSVTDFLAAPSVSLFNSLGPNCAQLF 804
           D   S N      ++SN  ++T  +  S   L  +  +      + S  ++   + A+L 
Sbjct: 10  DEQSSRNYRKGRQSHSN--NSTDRELLSAPPLAAYGASSEQPYYTRSSQSASRGSLARLT 67

Query: 805 LDEINSEPEDRYLESMLLAI 824
           L+E         LE     I
Sbjct: 68  LEE-------EALERRKQNI 80


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 31.3 bits (71), Expect = 3.0
 Identities = 11/72 (15%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 67
            +D  + SD     +++ ++    + DN     + +N   + SN+  N+   +    + S
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP-SSNNTKPSTSNKQPNSPKPTQPNQSNS 204

Query: 68  HSVSQLKQHSVT 79
              S    +  +
Sbjct: 205 QPASDDTANQKS 216



 Score = 31.3 bits (71), Expect = 3.0
 Identities = 11/72 (15%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 770
            +D  + SD     +++ ++    + DN     + +N   + SN+  N+   +    + S
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP-SSNNTKPSTSNKQPNSPKPTQPNQSNS 204

Query: 771 HSVSQLKQHSVT 782
              S    +  +
Sbjct: 205 QPASDDTANQKS 216



 Score = 30.5 bits (69), Expect = 5.0
 Identities = 20/146 (13%), Positives = 56/146 (38%), Gaps = 18/146 (12%)

Query: 698 LMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSH 757
              +   A A   +D + + + +    +S++ ND +   D S+   + ++ ++N SNQ +
Sbjct: 18  PTLTSPTAYA---DDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKAD-NNNTSNQDN 73

Query: 758 NNSNQSHNTSNQSHSVSQLKQHSVTDFLAAPSVSLFNSLGPNCAQLFLDEINSEPEDRYL 817
           N+   S    + +   + +      +    P  ++   L  N             ++  L
Sbjct: 74  NDKKFS-TIDSSTSDSNNIIDFIYKNL---PQTNINQLLTKNKYD----------DNYSL 119

Query: 818 ESMLLAIMKVSSLVPSTTTLQSPSSS 843
            +++  +  ++S +      ++   S
Sbjct: 120 TTLIQNLFNLNSDISDYEQPRNSEKS 145


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase
          19 kDa subunit.  This family contains several
          DNA-directed RNA polymerase 19 kDa polypeptides. The
          Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6)
          catalyzes DNA-template-directed extension of the 3'-end
          of an RNA strand by one nucleotide at a time.
          Length = 167

 Score = 30.1 bits (68), Expect = 3.1
 Identities = 17/68 (25%), Positives = 33/68 (48%), Gaps = 4/68 (5%)

Query: 8  EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 67
          E D+ +  + E++EE EE+ +S    D S+L  ++        + S N  + + N    S
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNY----KIESASENIEDATSNPKQVS 66

Query: 68 HSVSQLKQ 75
            +S +K+
Sbjct: 67 EKISAIKR 74



 Score = 30.1 bits (68), Expect = 3.1
 Identities = 17/68 (25%), Positives = 33/68 (48%), Gaps = 4/68 (5%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 770
           E D+ +  + E++EE EE+ +S    D S+L  ++        + S N  + + N    S
Sbjct: 11  ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNY----KIESASENIEDATSNPKQVS 66

Query: 771 HSVSQLKQ 778
             +S +K+
Sbjct: 67  EKISAIKR 74


>gnl|CDD|205694 pfam13516, LRR_6, Leucine Rich repeat. 
          Length = 24

 Score = 27.2 bits (61), Expect = 3.3
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 590 NKNLRHLNLNDNTITYKGAIPLGQ 613
             NL  L+L+ N I  +GA  L +
Sbjct: 1   LTNLTSLDLSGNQIGDEGASALAK 24


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.0 bits (70), Expect = 3.5
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 53
           E++E E  +EE+++ESEEE   D E +    + N S      S++ 
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492



 Score = 31.0 bits (70), Expect = 3.5
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 756
           E++E E  +EE+++ESEEE   D E +    + N S      S++ 
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492



 Score = 29.9 bits (67), Expect = 9.6
 Identities = 13/52 (25%), Positives = 22/52 (42%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQ 59
            +DE E  + E D  SEEE +  +EGD       +     N+     +  ++
Sbjct: 467 GEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518



 Score = 29.9 bits (67), Expect = 9.6
 Identities = 13/52 (25%), Positives = 22/52 (42%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQ 762
            +DE E  + E D  SEEE +  +EGD       +     N+     +  ++
Sbjct: 467 GEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 31.0 bits (70), Expect = 3.5
 Identities = 18/89 (20%), Positives = 30/89 (33%), Gaps = 10/89 (11%)

Query: 700 KSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEG-----DNSNLSHNDSNHSHNASN 754
            SF      + +DD+ E     +D+E +  N +  E          L   D +    A+N
Sbjct: 77  ASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN 136

Query: 755 QSHNNSNQ-----SHNTSNQSHSVSQLKQ 778
              N+ +Q     S        SV+    
Sbjct: 137 DQTNDFDQDDSSNSQTDQGLKQSVNLSSA 165



 Score = 31.0 bits (70), Expect = 4.5
 Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 10/78 (12%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEG-----DNSNLSHNDSNHSHNASNQSHNNSNQ--- 59
           +DD+ E     +D+E +  N +  E          L   D +    A+N   N+ +Q   
Sbjct: 88  DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDS 147

Query: 60  --SHNTSNQSHSVSQLKQ 75
             S        SV+    
Sbjct: 148 SNSQTDQGLKQSVNLSSA 165


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 31.1 bits (70), Expect = 3.7
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 8    EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHS 46
            E+D    SD ++ E  E E+D +    +S +S N+++ S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDS 3881



 Score = 31.1 bits (70), Expect = 3.7
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 711  EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHS 749
            E+D    SD ++ E  E E+D +    +S +S N+++ S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDS 3881


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The
          domain is found in the primary vegetative sigma factor.
          The function of this domain is unclear and can be
          removed without loss of function.
          Length = 211

 Score = 30.2 bits (69), Expect = 3.9
 Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 1  MAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 36
           AAA+  E DE +  D++ D+E E+E D + E D  
Sbjct: 40 TAAAIESELDEEDLEDDDDDDEDEDE-DDEEEADLG 74



 Score = 30.2 bits (69), Expect = 3.9
 Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 704 MAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 739
            AAA+  E DE +  D++ D+E E+E D + E D  
Sbjct: 40  TAAAIESELDEEDLEDDDDDDEDEDE-DDEEEADLG 74


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 29.9 bits (67), Expect = 4.0
 Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 8/79 (10%)

Query: 671 MKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALV---LEDDEGECSDEEQDEESE 727
           M  K   K++ V          ++ E+   S           +ED E    +E+++ E  
Sbjct: 1   MPPKGNAKKLKVRPPP-----TKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEV 55

Query: 728 EENDSDAEGDNSNLSHNDS 746
           EE  + +    S+ S + S
Sbjct: 56  EEETAASSKAPSSSSKSSS 74


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 30.8 bits (69), Expect = 4.4
 Identities = 14/51 (27%), Positives = 27/51 (52%)

Query: 689 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 739
           +E  E  E++ +    +     E DE E SDE  ++ SE+E+++D+  +  
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED 979


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 30.7 bits (70), Expect = 4.5
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 42
           +DD+ E  +E+ D   EEE++ D + D+ +    +
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372



 Score = 30.7 bits (70), Expect = 4.5
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 745
           +DD+ E  +E+ D   EEE++ D + D+ +    +
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 30.8 bits (69), Expect = 5.0
 Identities = 33/131 (25%), Positives = 55/131 (41%), Gaps = 20/131 (15%)

Query: 637  SSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQFGEEGVEEME 696
            + IA+   D T+LE+V      I++  G +L      K  L++I+  E +  E  ++ ME
Sbjct: 1197 NEIAEIEKDKTSLEEVK----GINLSYGKNL-----GKLFLEKID-EEKKKSEHMIKAME 1246

Query: 697  KLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 756
              ++            DE +    E + E   E D  AE +  N+SH+D    H  S + 
Sbjct: 1247 AYIEDL----------DEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKH 1296

Query: 757  HNNSNQSHNTS 767
              N +     S
Sbjct: 1297 DENISDIREKS 1307


>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
          Length = 637

 Score = 30.8 bits (70), Expect = 5.0
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 6/72 (8%)

Query: 676 KLKQINVSENQFGEEGVEEMEKLMKSFG-----MAAALVLEDDEGECSDEEQDEESEEEN 730
           KLKQ  + E    EE  +++ + ++  G      AA +++  + G   +  Q+EE E+  
Sbjct: 7   KLKQK-ILEKISEEELEKKINEKIEENGGFLGEEAALMMVAQELGIDIEYGQEEEEEDFK 65

Query: 731 DSDAEGDNSNLS 742
            SD E    N+ 
Sbjct: 66  ISDIEEGQINVE 77


>gnl|CDD|220419 pfam09814, DUF2351, Uncharacterized conserved protein (DUF2351).
           Members of this family of proteins have no known
           function.
          Length = 350

 Score = 30.2 bits (68), Expect = 5.1
 Identities = 17/62 (27%), Positives = 29/62 (46%), Gaps = 7/62 (11%)

Query: 800 CAQLFLDEINSEPEDRYL-------ESMLLAIMKVSSLVPSTTTLQSPSSSLFECTKVLY 852
            A L L+ +NS    R++       + +LL +     LV S+ + ++ S S     KVLY
Sbjct: 237 VAALLLELLNSHSTRRFVIQSSDGKQYILLWVFNSDLLVSSSGSPRAGSLSARNAMKVLY 296

Query: 853 RE 854
           + 
Sbjct: 297 QP 298


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 30.6 bits (69), Expect = 5.2
 Identities = 17/82 (20%), Positives = 38/82 (46%), Gaps = 15/82 (18%)

Query: 665 LDLVKAMKNKTKLKQINVSENQFGEEGVE-EMEKLMKSFGMAAALVLEDDEGECSDEEQD 723
           LD++KA +NK++ +Q+   E +  ++  E + +              ED+     D E +
Sbjct: 629 LDVLKADENKSRHQQLFEGEEEDEDDLEETDDD--------------EDECEAIEDSESE 674

Query: 724 EESEEENDSDAEGDNSNLSHND 745
            ES+ E+  + E ++   ++  
Sbjct: 675 SESDGEDGEEDEQEDDAEANEG 696


>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
           Provisional.
          Length = 1466

 Score = 30.4 bits (68), Expect = 6.0
 Identities = 31/156 (19%), Positives = 66/156 (42%), Gaps = 19/156 (12%)

Query: 718 SDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLK 777
           ++++   +    ND+D + D  + ++ DS   ++    + N+ +++ + SN+       +
Sbjct: 726 NNQKVSSKKSSNNDNDKDSDMKSSAYKDSERGYDPDEMNGNSKHENESASNKKSCKMSDE 785

Query: 778 QHSVTDFLAAPSVSLFNSL------GPNCAQLFLDEINSEPEDRYLESMLLAIMKVSSLV 831
             S  +  A   +    +L       PN  ++   EI S  +D  +  + L+I+    L 
Sbjct: 786 NASENN--AGGKLPFLRNLFKRKPKAPNNLRIVYREIFSYKKD--VTIIALSILVAGGLY 841

Query: 832 PSTTTLQSP-SSSLFECTKVLYRELFNWASNSDKLS 866
           P    L +   S+LF        +  N  +NS+K S
Sbjct: 842 PVFALLYAKYVSTLF--------DFANLEANSNKYS 869



 Score = 30.0 bits (67), Expect = 9.0
 Identities = 22/121 (18%), Positives = 52/121 (42%), Gaps = 10/121 (8%)

Query: 15  SDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLK 74
           ++++   +    ND+D + D  + ++ DS   ++    + N+ +++ + SN+       +
Sbjct: 726 NNQKVSSKKSSNNDNDKDSDMKSSAYKDSERGYDPDEMNGNSKHENESASNKKSCKMSDE 785

Query: 75  QHSVTDFLAAPSVSLFNSL------GPNCAQLFLDEINSEPEDRYLESMLLAIMKVSSLV 128
             S  +  A   +    +L       PN  ++   EI S  +D  +  + L+I+    L 
Sbjct: 786 NASENN--AGGKLPFLRNLFKRKPKAPNNLRIVYREIFSYKKD--VTIIALSILVAGGLY 841

Query: 129 P 129
           P
Sbjct: 842 P 842


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
          domain.  The centromere protein B (CENP-B) dimerisation
          domain is composed of two alpha-helices, which are
          folded into an antiparallel configuration. Dimerisation
          of CENP-B is mediated by this domain, in which monomers
          dimerise to form a symmetrical, antiparallel,
          four-helix bundle structure with a large hydrophobic
          patch in which 23 residues of one monomer form van der
          Waals contacts with the other monomer. This CENP-B
          dimer configuration may be suitable for capturing two
          distant CENP-B boxes during centromeric heterochromatin
          formation.
          Length = 101

 Score = 28.2 bits (62), Expect = 6.5
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 8  EDDEGECSDEEQDEESEEENDSDAEGD 34
          E+D    SDEE+D++ E+E D D + D
Sbjct: 9  EEDSDSDSDEEEDDDDEDEEDDDEDDD 35



 Score = 28.2 bits (62), Expect = 6.5
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGD 737
           E+D    SDEE+D++ E+E D D + D
Sbjct: 9   EEDSDSDSDEEEDDDDEDEEDDDEDDD 35


>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37 pfam03234.
          Length = 89

 Score = 28.1 bits (63), Expect = 7.2
 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 6/56 (10%)

Query: 687 FGEEGVEEMEKLMKSF---GM---AAALVLEDDEGECSDEEQDEESEEENDSDAEG 736
             +  VEE E  M+     G+    A +  E +  E  +E  +EE+E+E   + + 
Sbjct: 34  LAKMPVEEAEYHMERCIDSGLWVPNAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
          function of this eukaryotic protein family is unknown.
          The yeast orthologues have been implicated in cell
          cycle progression and biogenesis of 60S ribosomal
          subunits. The Schistosoma mansoni Mak16 has been shown
          to target protein transport to the nucleolus.
          Length = 97

 Score = 27.9 bits (62), Expect = 7.8
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 8  EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 39
          E+DE E  +EE+++E E E  SD E     + 
Sbjct: 62 ENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93



 Score = 27.9 bits (62), Expect = 7.8
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 742
           E+DE E  +EE+++E E E  SD E     + 
Sbjct: 62  ENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 29.4 bits (66), Expect = 8.3
 Identities = 9/46 (19%), Positives = 23/46 (50%)

Query: 8   EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 53
             D  +  DE+  +E E+++  + E   S+ S ++ + + +   +S
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263



 Score = 29.4 bits (66), Expect = 8.3
 Identities = 9/46 (19%), Positives = 23/46 (50%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 756
             D  +  DE+  +E E+++  + E   S+ S ++ + + +   +S
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 29.7 bits (67), Expect = 8.8
 Identities = 11/46 (23%), Positives = 21/46 (45%)

Query: 693 EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDN 738
           E+M          +  +LE+ + +  +E  DE + E  +S+ E  N
Sbjct: 225 EKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPIN 270


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 29.6 bits (67), Expect = 9.0
 Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 8/53 (15%)

Query: 690 EGVEEM----EKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDN 738
           + +EE     E+L +   +    V E+DE E SD E  EE E++++ D +G+N
Sbjct: 302 KALEEQLRQGEELRRK--IEGKSVSEEDEDEDSDSE--EEDEDDDEDDDDGEN 350


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
          the N-terminal region of RXT2-like proteins. In S.
          cerevisiae, RXT2 has been demonstrated to be involved
          in conjugation with cellular fusion (mating) and
          invasive growth. A high throughput localisation study
          has localised RXT2 to the nucleus.
          Length = 141

 Score = 28.5 bits (64), Expect = 9.1
 Identities = 9/28 (32%), Positives = 21/28 (75%)

Query: 8  EDDEGECSDEEQDEESEEENDSDAEGDN 35
          ++D G+  D+++D+E +EE D++ + +N
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDEN 75



 Score = 28.5 bits (64), Expect = 9.1
 Identities = 9/28 (32%), Positives = 21/28 (75%)

Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDN 738
           ++D G+  D+++D+E +EE D++ + +N
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDEN 75


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 29.6 bits (66), Expect = 9.3
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 9/71 (12%)

Query: 668 VKAMKNKTKLKQINVSENQF---GEEG---VEEMEKLMKSFGMAAALVLEDDEGECSDEE 721
           VK +  KT  + + V+       GEE     E  E+L   F +   ++ E  E     EE
Sbjct: 285 VKPVIGKTFQEMVKVAAKDLILGGEEKEPKQESQEQLFNPFAIDEEMLEETQE---QQEE 341

Query: 722 QDEESEEENDS 732
           ++ E EEEND+
Sbjct: 342 EENEEEEENDT 352


>gnl|CDD|139605 PRK13484, PRK13484, putative iron-regulated outer membrane
           virulence protein; Provisional.
          Length = 682

 Score = 29.6 bits (66), Expect = 9.4
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 338 PAVIELNHKGRMTDKPNDDVSDILKKLESISVESGQDSTKLSFAG 382
           PA + +    ++  KP  D+ D +K +E IS+  G +   +S  G
Sbjct: 48  PASVSVITSEQLQKKPVSDLVDAVKDVEGISITGGNEKPDISIRG 92


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.128    0.355 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,249,974
Number of extensions: 4536590
Number of successful extensions: 6302
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5632
Number of HSP's successfully gapped: 195
Length of query: 999
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 893
Effective length of database: 6,236,078
Effective search space: 5568817654
Effective search space used: 5568817654
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.5 bits)