RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3611
         (472 letters)



>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
           inhibitor (RI)-like subfamily. LRRs are 20-29 residue
           sequence motifs present in many proteins that
           participate in protein-protein interactions and have
           different functions and cellular locations. LRRs
           correspond to structural units consisting of a beta
           strand (LxxLxLxxN/CxL conserved pattern) and an alpha
           helix. This alignment contains 12 strands corresponding
           to 11 full repeats, consistent with the extent observed
           in the subfamily acting as Ran GTPase Activating
           Proteins (RanGAP1).
          Length = 319

 Score =  179 bits (457), Expect = 1e-52
 Identities = 103/256 (40%), Positives = 149/256 (58%), Gaps = 14/256 (5%)

Query: 219 MSLKTGARLVELDLSDNAFGPIGVEGLADLLRSSCCFALEELKLNNNGLGITGCKLLSKA 278
             L  G  L ELDLSDNA GP G   L  LLRSS   +L+ELKLNNNGLG  G +LL+K 
Sbjct: 75  QGLTKGCGLQELDLSDNALGPDGCGVLESLLRSS---SLQELKLNNNGLGDRGLRLLAKG 131

Query: 279 LHDCYESSKKEGSPLALKVFIAGRNRLENEGAKMLAAVFKKLKTLERVEMPQNGIYHVGI 338
           L D          P AL+  + GRNRLE    + LA   +  + L+ + +  NGI   GI
Sbjct: 132 LKDL---------PPALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANNGIGDAGI 182

Query: 339 TALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIA 398
            AL++  + N NL  L+LN+N +T +GA  L + L+ L SL +LNLGD  L  AGA+++A
Sbjct: 183 RALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALA 242

Query: 399 K-YLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQFGEEGVEEMEKLM 457
              L+ N +L  ++L+CN+I+  G  DL + +  K  L ++++  N+FGEEG + + + +
Sbjct: 243 SALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEEGAQLLAESL 302

Query: 458 KSFG-MAAALVLEDDE 472
              G    +L ++DD 
Sbjct: 303 LEPGNELESLWVKDDS 318



 Score =  109 bits (274), Expect = 1e-26
 Identities = 56/162 (34%), Positives = 95/162 (58%), Gaps = 2/162 (1%)

Query: 2   LAAVFKKLKTLEHVEMPQNGIYHVGITALSDAFEENKKLRHLNLNDNTITYKGAIPLGQA 61
           LA   +  + L+ + +  NGI   GI AL++  + N  L  L+LN+N +T +GA  L + 
Sbjct: 157 LAKALRANRDLKELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAET 216

Query: 62  LSKLPSLAILNLGDCLLKSAGASSIAK-YLTDNTTLEDVNLTCNEISVQGGLDLVKAMKN 120
           L+ L SL +LNLGD  L  AGA+++A   L+ N +L  ++L+CN+I+  G  DL + +  
Sbjct: 217 LASLKSLEVLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAE 276

Query: 121 KTKLKQINVSENQFGEEGVEEMEKLMKSFG-MAAALVLEDDE 161
           K  L ++++  N+FGEEG + + + +   G    +L ++DD 
Sbjct: 277 KESLLELDLRGNKFGEEGAQLLAESLLEPGNELESLWVKDDS 318



 Score = 68.9 bits (169), Expect = 7e-13
 Identities = 78/335 (23%), Positives = 134/335 (40%), Gaps = 35/335 (10%)

Query: 59  GQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEI--SVQGGLDLVK 116
            + L KL  L +L L    L    A ++A  L    +L+++ L+ NE     +G   L++
Sbjct: 16  TELLPKLLCLQVLRLEGNTLGEEAAKALASALRPQPSLKELCLSLNETGRIPRGLQSLLQ 75

Query: 117 AMKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEE 176
            +     L+++++S+N  G +G   +E L++S  +    +  +  G              
Sbjct: 76  GLTKGCGLQELDLSDNALGPDGCGVLESLLRSSSLQELKLNNNGLG-------------- 121

Query: 177 NDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHMSLKTGARLVELDLSDNA 236
              D           D   +        N       +      +L+    L EL+L++N 
Sbjct: 122 ---DRGLRLLAKGLKDLPPALEKLVLGRN--RLEGASCEALAKALRANRDLKELNLANNG 176

Query: 237 FGPIGVEGLADLLRSSCCFALEELKLNNNGLGITGCKLLSKALHDCYESSKKEGSPLALK 296
            G  G+  LA+ L+++C   LE L LNNNGL   G   L++ L              +L+
Sbjct: 177 IGDAGIRALAEGLKANC--NLEVLDLNNNGLTDEGASALAETLAS----------LKSLE 224

Query: 297 VFIAGRNRLENEGAKMLAAVFKKL-KTLERVEMPQNGIYHVGITALSDAFEENKNLRHLN 355
           V   G N L + GA  LA+       +L  + +  N I   G   L++   E ++L  L+
Sbjct: 225 VLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELD 284

Query: 356 LNDNTITYKGAIPLGQAL-SKLPSLAILNLGDCLL 389
           L  N    +GA  L ++L      L  L + D   
Sbjct: 285 LRGNKFGEEGAQLLAESLLEPGNELESLWVKDDSF 319



 Score = 68.2 bits (167), Expect = 1e-12
 Identities = 58/244 (23%), Positives = 98/244 (40%), Gaps = 49/244 (20%)

Query: 257 LEELKLNNNGLGITGCKLLSKALHD-------CYESSKKEGSPLALKVFIAG-------- 301
           L+ L+L  N LG    K L+ AL         C   ++    P  L+  + G        
Sbjct: 25  LQVLRLEGNTLGEEAAKALASALRPQPSLKELCLSLNETGRIPRGLQSLLQGLTKGCGLQ 84

Query: 302 --------------------------------RNRLENEGAKMLAAVFKKLK-TLERVEM 328
                                            N L + G ++LA   K L   LE++ +
Sbjct: 85  ELDLSDNALGPDGCGVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLKDLPPALEKLVL 144

Query: 329 PQNGIYHVGITALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCL 388
            +N +      AL+ A   N++L+ LNL +N I   G   L + L    +L +L+L +  
Sbjct: 145 GRNRLEGASCEALAKALRANRDLKELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNG 204

Query: 389 LKSAGASSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKN-KTKLKQINVSENQFGE 447
           L   GAS++A+ L    +LE +NL  N ++  G   L  A+ +    L  +++S N   +
Sbjct: 205 LTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITD 264

Query: 448 EGVE 451
           +G +
Sbjct: 265 DGAK 268



 Score = 64.7 bits (158), Expect = 2e-11
 Identities = 35/123 (28%), Positives = 61/123 (49%), Gaps = 1/123 (0%)

Query: 19  QNGIYHVGITALSDAFEENKKLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLL 78
           +N +      AL+ A   N+ L+ LNL +N I   G   L + L    +L +L+L +  L
Sbjct: 146 RNRLEGASCEALAKALRANRDLKELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGL 205

Query: 79  KSAGASSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKN-KTKLKQINVSENQFGEE 137
              GAS++A+ L    +LE +NL  N ++  G   L  A+ +    L  +++S N   ++
Sbjct: 206 TDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDD 265

Query: 138 GVE 140
           G +
Sbjct: 266 GAK 268



 Score = 63.1 bits (154), Expect = 6e-11
 Identities = 60/259 (23%), Positives = 101/259 (38%), Gaps = 28/259 (10%)

Query: 26  GITALSDAFEENKKLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASS 85
           G+ +L     +   L+ L+L+DN +   G   L ++L +  SL  L L +  L   G   
Sbjct: 69  GLQSLLQGLTKGCGLQELDLSDNALGPDGCGVL-ESLLRSSSLQELKLNNNGLGDRGLRL 127

Query: 86  IAKYLTDNT-TLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQFGEEGVEEMEK 144
           +AK L D    LE + L  N +       L KA++    LK++N++ N  G+ G+  + +
Sbjct: 128 LAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANNGIGDAGIRALAE 187

Query: 145 LMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEG---DNSNLSHNDSNHSHNASN 201
            +K+      L L ++               E  S          +L   +         
Sbjct: 188 GLKANCNLEVLDLNNNGLTD-----------EGASALAETLASLKSLEVLNLGD------ 230

Query: 202 QSHNNSNQSHNTSNQSHMSLKTGARLVELDLSDNAFGPIGVEGLADLLRSSCCFALEELK 261
               N+      +  +   L     L+ L LS N     G + LA++L       L EL 
Sbjct: 231 ----NNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKES--LLELD 284

Query: 262 LNNNGLGITGCKLLSKALH 280
           L  N  G  G +LL+++L 
Sbjct: 285 LRGNKFGEEGAQLLAESLL 303



 Score = 48.9 bits (117), Expect = 2e-06
 Identities = 32/139 (23%), Positives = 64/139 (46%), Gaps = 1/139 (0%)

Query: 11  TLEHVEMPQNGIYHVGITALSDAFEENK-KLRHLNLNDNTITYKGAIPLGQALSKLPSLA 69
           +L+ +++  NG+   G+  L+   ++    L  L L  N +       L +AL     L 
Sbjct: 109 SLQELKLNNNGLGDRGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANRDLK 168

Query: 70  ILNLGDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINV 129
            LNL +  +  AG  ++A+ L  N  LE ++L  N ++ +G   L + + +   L+ +N+
Sbjct: 169 ELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNL 228

Query: 130 SENQFGEEGVEEMEKLMKS 148
            +N   + G   +   + S
Sbjct: 229 GDNNLTDAGAAALASALLS 247



 Score = 41.6 bits (98), Expect = 6e-04
 Identities = 41/186 (22%), Positives = 71/186 (38%), Gaps = 32/186 (17%)

Query: 316 VFKKLKTLERVEMPQNGIYHVGITALSDAFEENKNLRHLN-------------------- 355
           +  KL  L+ + +  N +      AL+ A     +L+ L                     
Sbjct: 18  LLPKLLCLQVLRLEGNTLGEEAAKALASALRPQPSLKELCLSLNETGRIPRGLQSLLQGL 77

Query: 356 ----------LNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNT 405
                     L+DN +   G   L ++L +  SL  L L +  L   G   +AK L D  
Sbjct: 78  TKGCGLQELDLSDNALGPDGCGVL-ESLLRSSSLQELKLNNNGLGDRGLRLLAKGLKDLP 136

Query: 406 -TLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAA 464
             LE + L  N +       L KA++    LK++N++ N  G+ G+  + + +K+     
Sbjct: 137 PALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANNGIGDAGIRALAEGLKANCNLE 196

Query: 465 ALVLED 470
            L L +
Sbjct: 197 VLDLNN 202



 Score = 32.7 bits (75), Expect = 0.35
 Identities = 24/93 (25%), Positives = 47/93 (50%), Gaps = 2/93 (2%)

Query: 370 GQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEI--SVQGGLDLVK 427
            + L KL  L +L L    L    A ++A  L    +L+++ L+ NE     +G   L++
Sbjct: 16  TELLPKLLCLQVLRLEGNTLGEEAAKALASALRPQPSLKELCLSLNETGRIPRGLQSLLQ 75

Query: 428 AMKNKTKLKQINVSENQFGEEGVEEMEKLMKSF 460
            +     L+++++S+N  G +G   +E L++S 
Sbjct: 76  GLTKGCGLQELDLSDNALGPDGCGVLESLLRSS 108


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score =  106 bits (266), Expect = 3e-25
 Identities = 72/179 (40%), Positives = 98/179 (54%), Gaps = 4/179 (2%)

Query: 226 RLVELDLSDNAFGPIGVEGLADLLRSSCCFALEELKLNNNGLGITGCKLLSKALHDCYES 285
           RL ++DLSDNAFG    E L DL+ SS    L  LKLNNNGLG      + KAL     +
Sbjct: 93  RLQKVDLSDNAFGSEFPEELGDLISSST--DLVHLKLNNNGLGPIAGGRIGKALFHLAYN 150

Query: 286 SKKEGSPLALKVFIAGRNRLENEGAKMLAAVFKKLKTLERVEMPQNGIYHVGITALS-DA 344
            K    P  L+V I GRNRLEN   ++ AA+ +  + L+ V++ QNGI   G+T L+   
Sbjct: 151 KKAADKP-KLEVVICGRNRLENGSKELSAALLESHENLKEVKIQQNGIRPEGVTMLAFLG 209

Query: 345 FEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTD 403
              + +L  L+L DNT T +G+  L  AL +   L  L L DCLL + G  S+ +   +
Sbjct: 210 LFYSHSLEVLDLQDNTFTLEGSRYLADALCEWNLLRELRLNDCLLSNEGVKSVLRRFNE 268



 Score = 60.7 bits (147), Expect = 5e-10
 Identities = 49/189 (25%), Positives = 80/189 (42%), Gaps = 13/189 (6%)

Query: 2   LAAVFKKLKTLEHVEMPQNGIYHVGITALS-DAFEENKKLRHLNLNDNTITYKGAIPLGQ 60
            AA+ +  + L+ V++ QNGI   G+T L+      +  L  L+L DNT T +G+  L  
Sbjct: 177 SAALLESHENLKEVKIQQNGIRPEGVTMLAFLGLFYSHSLEVLDLQDNTFTLEGSRYLAD 236

Query: 61  ALSKLPSLAILNLGDCLLKSAGASSIAKYLTD--NTTLEDVNLTCNEISVQGGLDL-VKA 117
           AL +   L  L L DCLL + G  S+ +   +     L  +    NE      LD+ +  
Sbjct: 237 ALCEWNLLRELRLNDCLLSNEGVKSVLRRFNEKFVPNLMPLPGDYNERRGGIILDISLNE 296

Query: 118 MK--NKTKLKQINVSENQFGEEGV--EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEES 173
            +      L  +  + N+  E     +  E + +        V+E  EG+   EE  +E 
Sbjct: 297 FEQDAVPLLVDLERNGNRIKELADFGDYYEDIFEVVE-----VVEKQEGDVVTEESTDEE 351

Query: 174 EEENDSDAE 182
            E+     E
Sbjct: 352 SEDEVEIDE 360



 Score = 43.4 bits (102), Expect = 2e-04
 Identities = 36/167 (21%), Positives = 64/167 (38%), Gaps = 15/167 (8%)

Query: 308 EGAKMLAAVFKKLKTLERVEMPQNGIYHVGITALSDAFEENKNLRHLNLNDNTIT----- 362
           E  K +    + +  L  V++  N I    +  L +     +NLR +N +D         
Sbjct: 17  EDVKGVVEELEMMDELVEVDLSGNTIGTEAMEELCNVIANVRNLRVVNFSDAFTGRDKDE 76

Query: 363 -YKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEISVQG 421
            Y   + L +AL K P L  ++L D    S     +   ++ +T L  + L  N +    
Sbjct: 77  LYSNLVMLLKALLKCPRLQKVDLSDNAFGSEFPEELGDLISSSTDLVHLKLNNNGLGPIA 136

Query: 422 GLDLVKAM---------KNKTKLKQINVSENQFGEEGVEEMEKLMKS 459
           G  + KA+          +K KL+ +    N+      E    L++S
Sbjct: 137 GGRIGKALFHLAYNKKAADKPKLEVVICGRNRLENGSKELSAALLES 183



 Score = 40.3 bits (94), Expect = 0.002
 Identities = 33/160 (20%), Positives = 60/160 (37%), Gaps = 15/160 (9%)

Query: 4   AVFKKLKTLEHVEMPQNGIYHVGITALSDAFEENKKLRHLNLNDNTIT------YKGAIP 57
              + +  L  V++  N I    +  L +     + LR +N +D          Y   + 
Sbjct: 24  EELEMMDELVEVDLSGNTIGTEAMEELCNVIANVRNLRVVNFSDAFTGRDKDELYSNLVM 83

Query: 58  LGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKA 117
           L +AL K P L  ++L D    S     +   ++ +T L  + L  N +    G  + KA
Sbjct: 84  LLKALLKCPRLQKVDLSDNAFGSEFPEELGDLISSSTDLVHLKLNNNGLGPIAGGRIGKA 143

Query: 118 M---------KNKTKLKQINVSENQFGEEGVEEMEKLMKS 148
           +          +K KL+ +    N+      E    L++S
Sbjct: 144 LFHLAYNKKAADKPKLEVVICGRNRLENGSKELSAALLES 183



 Score = 36.8 bits (85), Expect = 0.021
 Identities = 53/255 (20%), Positives = 95/255 (37%), Gaps = 46/255 (18%)

Query: 227 LVELDLSDNAFGPIGVEGLADLLRSSCCFALEELKLNNNGLGITGCKLLSKALHDCYESS 286
           LVE+DLS N  G   +E L +++ +     +  L++ N                D +   
Sbjct: 32  LVEVDLSGNTIGTEAMEELCNVIAN-----VRNLRVVN--------------FSDAFTGR 72

Query: 287 KKEGSPLALKVFIAGRNRLENEGAKMLAAVFKKLKTLERVEMPQNGIYHVGITALSDAFE 346
            K+     L                ML     K   L++V++  N         L D   
Sbjct: 73  DKDELYSNL---------------VMLLKALLKCPRLQKVDLSDNAFGSEFPEELGDLIS 117

Query: 347 ENKNLRHLNLNDNTITYKGAIPLGQALSKL---------PSLAILNLGDCLLKSAGASSI 397
            + +L HL LN+N +       +G+AL  L         P L ++  G   L++      
Sbjct: 118 SSTDLVHLKLNNNGLGPIAGGRIGKALFHLAYNKKAADKPKLEVVICGRNRLENGSKELS 177

Query: 398 AKYLTDNTTLEDVNLTCNEISVQGG--LDLVKAMKNKTKLKQINVSENQFGEEGVEEMEK 455
           A  L  +  L++V +  N I  +G   L  +    + + L+ +++ +N F  EG   +  
Sbjct: 178 AALLESHENLKEVKIQQNGIRPEGVTMLAFLGLFYSHS-LEVLDLQDNTFTLEGSRYLAD 236

Query: 456 LMKSFGMAAALVLED 470
            +  + +   L L D
Sbjct: 237 ALCEWNLLRELRLND 251



 Score = 30.3 bits (68), Expect = 2.4
 Identities = 38/170 (22%), Positives = 67/170 (39%), Gaps = 12/170 (7%)

Query: 1   MLAAVFKKLKTLEHVEMPQNGIYHVGITALSDAFEENKKLRHLNLNDNTITYKGAIPLGQ 60
           ML     K   L+ V++  N         L D    +  L HL LN+N +       +G+
Sbjct: 83  MLLKALLKCPRLQKVDLSDNAFGSEFPEELGDLISSSTDLVHLKLNNNGLGPIAGGRIGK 142

Query: 61  ALSKL---------PSLAILNLGDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEISVQGG 111
           AL  L         P L ++  G   L++      A  L  +  L++V +  N I  +G 
Sbjct: 143 ALFHLAYNKKAADKPKLEVVICGRNRLENGSKELSAALLESHENLKEVKIQQNGIRPEGV 202

Query: 112 --LDLVKAMKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLED 159
             L  +    + + L+ +++ +N F  EG   +   +  + +   L L D
Sbjct: 203 TMLAFLGLFYSHS-LEVLDLQDNTFTLEGSRYLADALCEWNLLRELRLND 251


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 38.2 bits (89), Expect = 0.005
 Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 3/84 (3%)

Query: 112 LDLVKAMKNKTKLKQINVSENQFGEEGVEEMEKL-MKSFGMAAALVLEDDEGECSDEEQD 170
           + + K    K  L +       F EE  +  EKL M    +      + DE +  DEE++
Sbjct: 120 MGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179

Query: 171 EESEEENDSDAEGDNSNLSHNDSN 194
           EE EEE++   + D+ +   +D N
Sbjct: 180 EEEEEEDEDFDDDDDDD--DDDYN 201


>gnl|CDD|197686 smart00368, LRR_RI, Leucine rich repeat, ribonuclease inhibitor
           type. 
          Length = 28

 Score = 33.5 bits (78), Expect = 0.008
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 257 LEELKLNNNGLGITGCKLLSKALHD 281
           L EL L+NN LG  G + L++AL D
Sbjct: 4   LRELDLSNNKLGDEGARALAEALKD 28



 Score = 30.8 bits (71), Expect = 0.087
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 226 RLVELDLSDNAFGPIGVEGLADLLRS 251
            L ELDLS+N  G  G   LA+ L+ 
Sbjct: 3   SLRELDLSNNKLGDEGARALAEALKD 28



 Score = 30.5 bits (70), Expect = 0.11
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 37 NKKLRHLNLNDNTITYKGAIPLGQALS 63
          N  LR L+L++N +  +GA  L +AL 
Sbjct: 1  NPSLRELDLSNNKLGDEGARALAEALK 27



 Score = 29.7 bits (68), Expect = 0.17
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 348 NKNLRHLNLNDNTITYKGAIPLGQALS 374
           N +LR L+L++N +  +GA  L +AL 
Sbjct: 1   NPSLRELDLSNNKLGDEGARALAEALK 27


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 37.2 bits (86), Expect = 0.012
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 159 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 199
           DD+   S  + DEE + ++D + + D++N +H D +  + A
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 9/36 (25%), Positives = 21/36 (58%)

Query: 162 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSH 197
           G  SD+++   S + ++ +   D+ N  ++D+N +H
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNH 305


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 36.7 bits (85), Expect = 0.018
 Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 1/62 (1%)

Query: 162 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHMSL 221
           GE  DE+ D  S+E++D     D+S+ + +DSN S   +N+  +   +  N +  + + L
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSN-SDVDTNEEDDRGEKESNGAKSNELHL 335

Query: 222 KT 223
            +
Sbjct: 336 TS 337



 Score = 33.2 bits (76), Expect = 0.23
 Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNL-SHNDSNHSHNASNQSHNNSNQSHNTS 214
           ED +    +++ D +S + +DS +   NS++ ++ + +     SN +   SN+ H TS
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA--KSNELHLTS 337


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 35.8 bits (82), Expect = 0.040
 Identities = 25/115 (21%), Positives = 41/115 (35%), Gaps = 9/115 (7%)

Query: 120 NKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDS 179
              + ++    E++   E VEE E + KS           D  EC  EE++ E EEE   
Sbjct: 109 ETVEEEEK--EESREEREEVEETEGVTKSE----QKNDWRDAEECQKEEKEPEPEEEEKP 162

Query: 180 ---DAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHMSLKTGARLVELD 231
                E +N     +   H+ N  ++      Q         +  K     +EL+
Sbjct: 163 KRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELE 217


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 35.7 bits (82), Expect = 0.046
 Identities = 21/89 (23%), Positives = 41/89 (46%), Gaps = 21/89 (23%)

Query: 118 MKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEN 177
           MK +  ++++       G+E  EE                E+D+ E  D++ D+E E+++
Sbjct: 32  MKKENAIRKL-------GKEAEEEAM-------------EEEDDDEEDDDDDDDEDEDDD 71

Query: 178 DSDAEGDNSNLSHNDSNHSHNASNQSHNN 206
           D D + D+ +   +DS   H+ S+    N
Sbjct: 72  DDDDDEDDEDEDDDDST-LHDDSSADDGN 99


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 32.6 bits (75), Expect = 0.052
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEG 183
           E++EGE  + E +E+ EE++D D +G
Sbjct: 50  EEEEGEDLESEDEEDEEEDDDDDMDG 75


>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 31.7 bits (73), Expect = 0.073
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 8/67 (11%)

Query: 323 LERVEMPQNGIYHVGITALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAIL 382
           L+ +++  N +  +       AF+   NL+ L+L+ N +T   +I    A S LPSL  L
Sbjct: 2   LKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLT---SISPE-AFSGLPSLRSL 53

Query: 383 NLGDCLL 389
           +L    L
Sbjct: 54  DLSGNNL 60



 Score = 31.0 bits (71), Expect = 0.15
 Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 8/67 (11%)

Query: 12 LEHVEMPQNGIYHVGITALSDAFEENKKLRHLNLNDNTITYKGAIPLGQALSKLPSLAIL 71
          L+ +++  N +  +       AF+    L+ L+L+ N +T   +I    A S LPSL  L
Sbjct: 2  LKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLT---SISPE-AFSGLPSLRSL 53

Query: 72 NLGDCLL 78
          +L    L
Sbjct: 54 DLSGNNL 60



 Score = 28.7 bits (65), Expect = 1.0
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 8/68 (11%)

Query: 350 NLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLED 409
           NL+ L+L++N +T         A   LP+L +L+L    L S    +     +   +L  
Sbjct: 1   NLKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLTSISPEA----FSGLPSLRS 52

Query: 410 VNLTCNEI 417
           ++L+ N +
Sbjct: 53  LDLSGNNL 60



 Score = 27.5 bits (62), Expect = 2.8
 Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 8/68 (11%)

Query: 39  KLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLED 98
            L+ L+L++N +T         A   LP+L +L+L    L S    +     +   +L  
Sbjct: 1   NLKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLTSISPEA----FSGLPSLRS 52

Query: 99  VNLTCNEI 106
           ++L+ N +
Sbjct: 53  LDLSGNNL 60


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 34.3 bits (79), Expect = 0.13
 Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 3/61 (4%)

Query: 154 ALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNT 213
           AL +ED   E  DEE+++E EEE + + EG +    H+D        +      +   N+
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK--EHSDDE-EFEEDDVESKYEDSDGNS 434

Query: 214 S 214
           S
Sbjct: 435 S 435


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 34.5 bits (80), Expect = 0.13
 Identities = 11/44 (25%), Positives = 20/44 (45%), Gaps = 2/44 (4%)

Query: 156 VLEDDEGECSDEEQDEESEEENDSDAEG--DNSNLSHNDSNHSH 197
            L + E ECSD E + E   +  SD     D++ +    ++ + 
Sbjct: 17  FLVEAEAECSDGEDELEDLFDTGSDISDLIDDAEVVQGGNSLAL 60


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 34.6 bits (80), Expect = 0.14
 Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 9/63 (14%)

Query: 131 ENQFGEEGV------EEMEKLMKSFGM---AAALVLEDDEGECSDEEQDEESEEENDSDA 181
           +N FG + V      +  E+L            L  +DD+ E SDEE +E S+ E+D D 
Sbjct: 90  DNVFGSDQVVIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149

Query: 182 EGD 184
           + D
Sbjct: 150 DDD 152


>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
           repeats are short sequence motifs present in a number of
           proteins with diverse functions and cellular locations.
           These repeats are usually involved in protein-protein
           interactions. Each Leucine Rich Repeat is composed of a
           beta-alpha unit. These units form elongated non-globular
           structures. Leucine Rich Repeats are often flanked by
           cysteine rich domains.
          Length = 43

 Score = 30.2 bits (69), Expect = 0.17
 Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 6/43 (13%)

Query: 349 KNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKS 391
            NL  L+L++N IT    +P    LS LP+L  L+L    +  
Sbjct: 1   TNLETLDLSNNQIT---DLP---PLSNLPNLETLDLSGNKITD 37



 Score = 29.0 bits (66), Expect = 0.48
 Identities = 14/43 (32%), Positives = 21/43 (48%), Gaps = 6/43 (13%)

Query: 38 KKLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKS 80
            L  L+L++N IT    +P    LS LP+L  L+L    +  
Sbjct: 1  TNLETLDLSNNQIT---DLP---PLSNLPNLETLDLSGNKITD 37


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 31.5 bits (72), Expect = 0.38
 Identities = 15/51 (29%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 136 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 186
           EE  +E+EK +K    A  L+ E+++ E  ++ + E++E+  D + EG   
Sbjct: 59  EERKQELEKQLKERKEALKLLEEENDDE--EDAETEDTEDVEDDEWEGFPE 107



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 10/57 (17%), Positives = 20/57 (35%), Gaps = 9/57 (15%)

Query: 136 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDE---------ESEEENDSDAEG 183
           E+ ++E ++ +K          + +  +  D E DE            EE   D + 
Sbjct: 66  EKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDK 122


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 31.5 bits (72), Expect = 0.47
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 151 MAAALVLEDDEGECSDEEQDEESEEENDSDAE 182
           +A+     DD+ E  +EE DEE ++E++S+ E
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3). 
           This family contains EBNA-3A, -3B, and -3C which are
           latent infection nuclear proteins important for
           Epstein-Barr virus (EBV)-induced B-cell immortalisation
           and the immune response to EBV infection.
          Length = 254

 Score = 31.6 bits (72), Expect = 0.67
 Identities = 12/49 (24%), Positives = 17/49 (34%), Gaps = 4/49 (8%)

Query: 150 GMAAALVLEDD----EGECSDEEQDEESEEENDSDAEGDNSNLSHNDSN 194
                  L        GE  D+   E S E  DS++E D+  L +    
Sbjct: 192 AFLRGQALGLSALRTPGEPDDDATVETSSESEDSESESDDEELPYIVPR 240


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 31.6 bits (72), Expect = 0.80
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDN 185
           EDD  E  DE++++E EEE + D EGD 
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 28.9 bits (65), Expect = 6.3
 Identities = 11/69 (15%), Positives = 29/69 (42%), Gaps = 10/69 (14%)

Query: 136 EEGVEEMEKLMKSFG-----MAAALVLEDDEGECSD-----EEQDEESEEENDSDAEGDN 185
              +E +++++K  G          V+  DE +  +     EE++EE +     D + ++
Sbjct: 246 GAALEAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEED 305

Query: 186 SNLSHNDSN 194
            +    + +
Sbjct: 306 EDEEEEEDD 314


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.9 bits (72), Expect = 0.83
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAE 182
           E++E E  +EE++EE EEE + + E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 30.3 bits (68), Expect = 2.6
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 189
           E++E E  +EE++EE EEE + + E +   LS
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 32.0 bits (72), Expect = 0.92
 Identities = 22/91 (24%), Positives = 37/91 (40%), Gaps = 4/91 (4%)

Query: 159 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA----SNQSHNNSNQSHNTS 214
           + E     +    +SEE++D   E DN + S  D   S       S  S + ++ +   +
Sbjct: 102 NTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNT 161

Query: 215 NQSHMSLKTGARLVELDLSDNAFGPIGVEGL 245
            QS   L       + D + +  G +GV GL
Sbjct: 162 KQSQTGLGASGSHAQQDPAVSQSGVVGVPGL 192


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 30.7 bits (69), Expect = 0.96
 Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 8/79 (10%)

Query: 118 MKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALV---LEDDEGECSDEEQDEESE 174
           M  K   K++ V          ++ E+   S           +ED E    +E+++ E  
Sbjct: 1   MPPKGNAKKLKVRPPP-----TKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEV 55

Query: 175 EENDSDAEGDNSNLSHNDS 193
           EE  + +    S+ S + S
Sbjct: 56  EEETAASSKAPSSSSKSSS 74


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 203
           E++E E  +EE+++ESEEE   D E +    + N S      S++ 
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492



 Score = 29.9 bits (67), Expect = 3.7
 Identities = 13/52 (25%), Positives = 22/52 (42%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQ 209
            +DE E  + E D  SEEE +  +EGD       +     N+     +  ++
Sbjct: 467 GEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518


>gnl|CDD|225191 COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid
           transport and metabolism].
          Length = 385

 Score = 31.1 bits (71), Expect = 1.4
 Identities = 10/32 (31%), Positives = 15/32 (46%)

Query: 162 GECSDEEQDEESEEENDSDAEGDNSNLSHNDS 193
           G       +  +E +N+S   GDN +  H DS
Sbjct: 326 GSAYPFNVEGGAEMQNESAIHGDNVSDVHVDS 357


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 30.8 bits (69), Expect = 1.4
 Identities = 18/71 (25%), Positives = 37/71 (52%)

Query: 119 KNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEND 178
           +N  +L+ +N+      E G ++ E+  +       +   +   E  D+++D E ++E +
Sbjct: 85  ENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144

Query: 179 SDAEGDNSNLS 189
           SDAEGD + L+
Sbjct: 145 SDAEGDENELA 155


>gnl|CDD|205694 pfam13516, LRR_6, Leucine Rich repeat. 
          Length = 24

 Score = 27.2 bits (61), Expect = 1.6
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 348 NKNLRHLNLNDNTITYKGAIPLGQ 371
             NL  L+L+ N I  +GA  L +
Sbjct: 1   LTNLTSLDLSGNQIGDEGASALAK 24



 Score = 26.0 bits (58), Expect = 4.0
 Identities = 8/24 (33%), Positives = 12/24 (50%)

Query: 37 NKKLRHLNLNDNTITYKGAIPLGQ 60
             L  L+L+ N I  +GA  L +
Sbjct: 1  LTNLTSLDLSGNQIGDEGASALAK 24


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 30.8 bits (69), Expect = 1.6
 Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 7/53 (13%)

Query: 135 GEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSN 187
            E+   + + L K  G        D   E S+ E++E SE E+D D + DN  
Sbjct: 221 SEKEKTKKKDLEKWLG-------SDQSMETSESEEEESSESESDEDEDEDNKG 266


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 31.1 bits (70), Expect = 1.7
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 158  EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHS 196
            E+D    SD ++ E  E E+D +    +S +S N+++ S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDS 3881


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 132 NQFGEE----GVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 186
             +GEE    G+  +E L K          E ++G  SD++ DEE E E + D + D+ 
Sbjct: 58  KGYGEEEVVDGIPGLELLEKWK-EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE 115


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 30.7 bits (69), Expect = 2.0
 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHN 212
           ++D+ +   E  DE ++ E+DSD+    S  S  D + S + S    ++S   H+
Sbjct: 453 DEDDMDEQ-ESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHD 506


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 30.3 bits (68), Expect = 2.1
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 156 VLEDDEGECSDEEQDEESEEENDSDAEGDN 185
           VLE +E + S EE++EESEEE ++D+E D 
Sbjct: 4   VLELEEEDESGEEEEEESEEEEETDSEDDM 33


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 30.6 bits (69), Expect = 2.1
 Identities = 17/82 (20%), Positives = 38/82 (46%), Gaps = 15/82 (18%)

Query: 112 LDLVKAMKNKTKLKQINVSENQFGEEGVE-EMEKLMKSFGMAAALVLEDDEGECSDEEQD 170
           LD++KA +NK++ +Q+   E +  ++  E + +              ED+     D E +
Sbjct: 629 LDVLKADENKSRHQQLFEGEEEDEDDLEETDDD--------------EDECEAIEDSESE 674

Query: 171 EESEEENDSDAEGDNSNLSHND 192
            ES+ E+  + E ++   ++  
Sbjct: 675 SESDGEDGEEDEQEDDAEANEG 696


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 147 KSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEG-----DNSNLSHNDSNHSHNASN 201
            SF      + +DD+ E     +D+E +  N +  E          L   D +    A+N
Sbjct: 77  ASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN 136

Query: 202 QSHNNSNQSHNTSNQSHMSLK 222
              N+ +Q  ++++Q+   LK
Sbjct: 137 DQTNDFDQDDSSNSQTDQGLK 157


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 30.4 bits (68), Expect = 2.3
 Identities = 14/51 (27%), Positives = 27/51 (52%)

Query: 136 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 186
           +E  E  E++ +    +     E DE E SDE  ++ SE+E+++D+  +  
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED 979



 Score = 28.4 bits (63), Expect = 9.6
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 15/87 (17%)

Query: 122 TKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDA 181
           T +K I      F E+G           G +  +V  DDE    DE ++E SE E  SD 
Sbjct: 902 TIMKSIMKDPISFFEDG-----------GWSFLMVGSDDES---DESEEEVSEYEASSDD 947

Query: 182 EGDNSNLSHNDSNHSHNAS-NQSHNNS 207
           E D ++        S + S ++S N+S
Sbjct: 948 ESDETDEDEESDESSEDLSEDESENDS 974


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 29.1 bits (65), Expect = 2.4
 Identities = 21/107 (19%), Positives = 47/107 (43%), Gaps = 9/107 (8%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 217
           + ++ +   E++  E E E  ++ + +++N      +   N   +      +   T+   
Sbjct: 1   DYNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPRKTT--P 58

Query: 218 HMSLKTGARLV---ELDLSDNAFGPIGVEGLADLLRSSCCFALEELK 261
           +M+    AR++    L +S NA   + +EG  D L      A++EL+
Sbjct: 59  YMTKYERARILGTRALQISMNAPVMVELEGETDPLE----IAMKELR 101


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 30.0 bits (67), Expect = 2.6
 Identities = 13/53 (24%), Positives = 26/53 (49%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQS 210
           ++DE E  +EE++E   EEN+    G ++N +  D  +  +  +       +S
Sbjct: 136 DEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEES 188


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 30.2 bits (69), Expect = 2.6
 Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 94  TTLEDV-NLTCNEISVQGGLD--LVKAMKNKTK--LKQINVSENQFGEEGVEEMEKLMKS 148
           ++LE++  +   E+    G D   V+ ++ + K  L+   +++ +   + +  +E L + 
Sbjct: 378 SSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRE 437

Query: 149 FGMAAA----LVLEDDEGECSDEEQDEESEEE 176
                A      LED   +  DE  D E +EE
Sbjct: 438 LAFKLAEKGIKTLEDLAEQAVDELIDIEGDEE 469


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 30.4 bits (69), Expect = 2.9
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 192
           +DD+ E  +E+ D   EEE++ D + D+ +    +
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372


>gnl|CDD|213965 TIGR04352, HprK_rel_A, HprK-related kinase A.  A number of protein
           families resemble HPr kinase (see TIGR00679) but do not
           belong to that system. They include this family, which
           appears instead to be the marker for a different type of
           gene neighborhood, in which one of the conserved
           neighboring proteins resembles (but is distinct from)
           PqqD.
          Length = 280

 Score = 29.6 bits (67), Expect = 3.1
 Identities = 8/34 (23%), Positives = 17/34 (50%), Gaps = 5/34 (14%)

Query: 230 LDLSDNAF-----GPIGVEGLADLLRSSCCFALE 258
           + L++N+F     G +G + +A L+    C+   
Sbjct: 233 MRLAENSFNYEVLGEVGFDAVARLVDRCDCYEFS 266


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 29.5 bits (67), Expect = 3.4
 Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 151 MAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 186
            AAA+  E DE +  D++ D+E E+E D + E D  
Sbjct: 40  TAAAIESELDEEDLEDDDDDDEDEDE-DDEEEADLG 74


>gnl|CDD|173970 cd08205, RuBisCO_IV_RLP, Ribulose bisphosphate carboxylase like
           proteins, Rubisco-Form IV.  Ribulose bisphosphate
           carboxylase (Rubisco) plays an important role in the
           Calvin reductive pentose phosphate pathway. It catalyzes
           the primary CO2 fixation step. Rubisco is activated by
           carbamylation of an active site lysine, stabilized by a
           divalent cation, which then catalyzes the proton
           abstraction from the substrate ribulose 1,5 bisphosphate
           (RuBP) and leads to the formation of two molecules of
           3-phosphoglycerate. Members of the Rubisco family can be
           divided into 4 subgroups, Form I-IV, which differ in
           their taxonomic distribution and subunit composition.
           Form I-III have Rubisco activity, while Form IV, also
           called Rubisco-like proteins (RLP), are missing critical
           active site residues and therefore do not catalyze CO2
           fixation. They are believed to utilize a related
           enzymatic mechanism, but have divergent functions, like
           for example 2,3-diketo-5-methylthiopentyl-1-phosphate
           enolase or 5-methylthio-d-ribulose 1-phosphate
           isomerase.
          Length = 367

 Score = 29.8 bits (68), Expect = 3.5
 Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 5/35 (14%)

Query: 220 SLKTGARLVELDLSDN---AF-GP-IGVEGLADLL 249
           SL  G +LV+L+L D+   AF GP  G+EGL  LL
Sbjct: 91  SLLPGIKLVDLELPDSLLAAFPGPRFGIEGLRRLL 125


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 30.0 bits (67), Expect = 3.8
 Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 1/89 (1%)

Query: 132 NQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHN 191
           N  GEE   E+     + G             CS + + ++  ++ +   EG+  NL   
Sbjct: 3   NHIGEEASAEVLSGRNNTGDDGVHGGSCHPSHCSPQPRGDKFNDKFEFAGEGEKPNLL-V 61

Query: 192 DSNHSHNASNQSHNNSNQSHNTSNQSHMS 220
           D+N +H  +  S NNS ++ N +  ++ S
Sbjct: 62  DANENHINTRSSDNNSCENANMNRDANSS 90


>gnl|CDD|197543 smart00148, PLCXc, Phospholipase C, catalytic domain (part); domain
           X.  Phosphoinositide-specific phospholipases C. These
           enzymes contain 2 regions (X and Y) which together form
           a TIM barrel-like structure containing the active site
           residues. Phospholipase C enzymes (PI-PLC) act as signal
           transducers that generate two second messengers,
           inositol-1,4,5-trisphosphate and diacylglycerol. The
           bacterial enzyme appears to be a homologue of the
           mammalian PLCs.
          Length = 143

 Score = 28.4 bits (64), Expect = 4.0
 Identities = 15/52 (28%), Positives = 17/52 (32%), Gaps = 12/52 (23%)

Query: 203 SHNNSNQSHNTSNQSHM------------SLKTGARLVELDLSDNAFGPIGV 242
           SH     SHNT                  +L  G R VELD  D   G   +
Sbjct: 6   SHYFIPSSHNTYLTGKQLWGESSVEGYIQALDAGCRCVELDCWDGPDGEPVI 57


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 27.8 bits (62), Expect = 4.2
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 159 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 192
           D E E  DEE++EE EE++  D   ++  +   +
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 29.3 bits (65), Expect = 4.8
 Identities = 22/62 (35%), Positives = 26/62 (41%), Gaps = 11/62 (17%)

Query: 131 ENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEE---------NDSDA 181
           EN   E   +    L    G  A    ED++ EC D E +EE EEE         N  DA
Sbjct: 64  ENSETERPRDLTAALFAEAGEEA--EEEDNDRECPDTEAEEEDEEEEIEAPDPEVNPLDA 121

Query: 182 EG 183
           EG
Sbjct: 122 EG 123


>gnl|CDD|148632 pfam07136, DUF1385, Protein of unknown function (DUF1385).  This
           family contains a number of hypothetical bacterial
           proteins of unknown function approximately 300 residues
           in length. Some family members are predicted to be
           metal-dependent.
          Length = 235

 Score = 28.7 bits (65), Expect = 5.1
 Identities = 8/33 (24%), Positives = 14/33 (42%)

Query: 125 KQINVSENQFGEEGVEEMEKLMKSFGMAAALVL 157
           K +N S     EE  E++ K   +  +  +L  
Sbjct: 19  KALNFSAEFADEEEEEKLSKWEMALTVILSLAF 51



 Score = 28.7 bits (65), Expect = 5.1
 Identities = 8/33 (24%), Positives = 14/33 (42%)

Query: 436 KQINVSENQFGEEGVEEMEKLMKSFGMAAALVL 468
           K +N S     EE  E++ K   +  +  +L  
Sbjct: 19  KALNFSAEFADEEEEEKLSKWEMALTVILSLAF 51


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 29.3 bits (66), Expect = 5.3
 Identities = 15/84 (17%), Positives = 30/84 (35%), Gaps = 3/84 (3%)

Query: 126 QINVSENQFGEEGVEEMEKLMKSF---GMAAALVLEDDEGECSDEEQDEESEEENDSDAE 182
           Q+NV       +  +  +K                ED+E E   +E +EE  E+++ +  
Sbjct: 339 QLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGS 398

Query: 183 GDNSNLSHNDSNHSHNASNQSHNN 206
               + S   S+   + S    + 
Sbjct: 399 QSREDGSSESSSDVGSDSESKADK 422


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 27.4 bits (60), Expect = 5.3
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGD 184
           E+D    SDEE+D++ E+E D D + D
Sbjct: 9   EEDSDSDSDEEEDDDDEDEEDDDEDDD 35


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 9/28 (32%), Positives = 21/28 (75%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDN 185
           ++D G+  D+++D+E +EE D++ + +N
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDEN 75


>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
          Length = 637

 Score = 29.2 bits (66), Expect = 5.8
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 6/72 (8%)

Query: 123 KLKQINVSENQFGEEGVEEMEKLMKSFG-----MAAALVLEDDEGECSDEEQDEESEEEN 177
           KLKQ  + E    EE  +++ + ++  G      AA +++  + G   +  Q+EE E+  
Sbjct: 7   KLKQK-ILEKISEEELEKKINEKIEENGGFLGEEAALMMVAQELGIDIEYGQEEEEEDFK 65

Query: 178 DSDAEGDNSNLS 189
            SD E    N+ 
Sbjct: 66  ISDIEEGQINVE 77


>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
           kinase; Provisional.
          Length = 968

 Score = 29.0 bits (65), Expect = 5.9
 Identities = 37/124 (29%), Positives = 59/124 (47%), Gaps = 26/124 (20%)

Query: 343 DAFEENKNLRHLNLNDNTIT---YKGAIPLGQALS--------KLP-------SLAILNL 384
           D F  + +LR+LNL++N  T    +G+IP  + L         ++P       SL +L+L
Sbjct: 112 DIFTTSSSLRYLNLSNNNFTGSIPRGSIPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDL 171

Query: 385 GDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQ 444
           G  +L       I   LT+ T+LE + L  N++  Q   +L + MK+   LK I +  N 
Sbjct: 172 GGNVL----VGKIPNSLTNLTSLEFLTLASNQLVGQIPRELGQ-MKS---LKWIYLGYNN 223

Query: 445 FGEE 448
              E
Sbjct: 224 LSGE 227



 Score = 29.0 bits (65), Expect = 7.4
 Identities = 37/124 (29%), Positives = 58/124 (46%), Gaps = 26/124 (20%)

Query: 32  DAFEENKKLRHLNLNDNTIT---YKGAIPLGQALS--------KLP-------SLAILNL 73
           D F  +  LR+LNL++N  T    +G+IP  + L         ++P       SL +L+L
Sbjct: 112 DIFTTSSSLRYLNLSNNNFTGSIPRGSIPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDL 171

Query: 74  GDCLLKSAGASSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQ 133
           G  +L       I   LT+ T+LE + L  N++  Q   +L + MK+   LK I +  N 
Sbjct: 172 GGNVL----VGKIPNSLTNLTSLEFLTLASNQLVGQIPRELGQ-MKS---LKWIYLGYNN 223

Query: 134 FGEE 137
              E
Sbjct: 224 LSGE 227


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 28.6 bits (64), Expect = 5.9
 Identities = 9/46 (19%), Positives = 23/46 (50%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 203
             D  +  DE+  +E E+++  + E   S+ S ++ + + +   +S
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263



 Score = 28.2 bits (63), Expect = 8.9
 Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 158 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 203
           EDD  E  D++Q EE EE   SD+  ++S+ S  +       + ++
Sbjct: 227 EDDPKEDEDDDQGEE-EESGSSDSLSEDSDASSEEMESGEMEAAEA 271


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 28.8 bits (64), Expect = 6.5
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 9/71 (12%)

Query: 115 VKAMKNKTKLKQINVSENQF---GEEG---VEEMEKLMKSFGMAAALVLEDDEGECSDEE 168
           VK +  KT  + + V+       GEE     E  E+L   F +   ++ E  E     EE
Sbjct: 285 VKPVIGKTFQEMVKVAAKDLILGGEEKEPKQESQEQLFNPFAIDEEMLEETQE---QQEE 341

Query: 169 QDEESEEENDS 179
           ++ E EEEND+
Sbjct: 342 EENEEEEENDT 352


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 28.6 bits (64), Expect = 7.8
 Identities = 9/70 (12%), Positives = 34/70 (48%), Gaps = 3/70 (4%)

Query: 145 LMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSH 204
              +   A A   +D + + + +    +S++ ND +   D S+   + +++++ ++  ++
Sbjct: 18  PTLTSPTAYA---DDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNN 74

Query: 205 NNSNQSHNTS 214
           +    + ++S
Sbjct: 75  DKKFSTIDSS 84


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 28.5 bits (64), Expect = 7.8
 Identities = 11/46 (23%), Positives = 21/46 (45%)

Query: 140 EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDN 185
           E+M          +  +LE+ + +  +E  DE + E  +S+ E  N
Sbjct: 225 EKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPIN 270


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.2 bits (63), Expect = 8.8
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 157 LEDDEGECSDEEQDEESEEENDSDAEGDNSN 187
            ED E +   E++DEESEEEN+ + E  +  
Sbjct: 75  SEDKEDKGDAEKEDEESEEENEEEDEESSDE 105


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 28.5 bits (63), Expect = 9.4
 Identities = 33/131 (25%), Positives = 55/131 (41%), Gaps = 20/131 (15%)

Query: 84   SSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQFGEEGVEEME 143
            + IA+   D T+LE+V      I++  G +L      K  L++I+  E +  E  ++ ME
Sbjct: 1197 NEIAEIEKDKTSLEEVK----GINLSYGKNL-----GKLFLEKID-EEKKKSEHMIKAME 1246

Query: 144  KLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 203
              ++            DE +    E + E   E D  AE +  N+SH+D    H  S + 
Sbjct: 1247 AYIEDL----------DEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKH 1296

Query: 204  HNNSNQSHNTS 214
              N +     S
Sbjct: 1297 DENISDIREKS 1307


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.128    0.349 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,790,280
Number of extensions: 2173002
Number of successful extensions: 3405
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2934
Number of HSP's successfully gapped: 177
Length of query: 472
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 372
Effective length of database: 6,502,202
Effective search space: 2418819144
Effective search space used: 2418819144
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.4 bits)