RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6522
         (1407 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score =  176 bits (449), Expect = 3e-49
 Identities = 95/320 (29%), Positives = 148/320 (46%), Gaps = 37/320 (11%)

Query: 202 GHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEISDIAIDNRNILL 261
           GH   V CV F   G+++ TG+ D  IK+W    G+LL TL+G +G + D+A       L
Sbjct: 7   GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYL 66

Query: 262 AAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWKY 321
           A+G+ DKTIR+W+L+T   +  L GH   ++ V F P        L+S+S D  I  W  
Sbjct: 67  ASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG----RILSSSSRDKTIKVW-- 120

Query: 322 KLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPGGLFLATGSGDHHVRVYKMDGVDSP 381
             D++  K +          +R     +   +FSP G F+A+ S D  ++++ +      
Sbjct: 121 --DVETGKCLT--------TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT-GKC 169

Query: 382 LGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPHTKE 441
           +  L    H+ +V+S+ +S    + +S S DGT  +W     +       + T   H   
Sbjct: 170 VATL--TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKC------LGTLRGHENG 221

Query: 442 TSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESH 501
                     V  VA+      + +  + +  I++WD   G+ VQ L GHTN V  L   
Sbjct: 222 ----------VNSVAFSPDGYLLASG-SEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS 270

Query: 502 PFDSRVLLSAGHDGLIIIWD 521
           P D + L S   DG I IWD
Sbjct: 271 P-DGKRLASGSADGTIRIWD 289



 Score =  142 bits (360), Expect = 2e-37
 Identities = 83/326 (25%), Positives = 135/326 (41%), Gaps = 58/326 (17%)

Query: 238 LLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFC 297
           L  TL+G +G ++ +A      LLA G+ D TI++W+L+T   +  L GH G +  V   
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 298 PLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPG 357
                   YLAS S+D  I  W  +    +     H             +++   +FSP 
Sbjct: 61  A----DGTYLASGSSDKTIRLWDLETGECVRTLTGHT------------SYVSSVAFSPD 104

Query: 358 GLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALI 417
           G  L++ S D  ++V+    V++   +  +  H+D V+S+ +S       S S+DGT  +
Sbjct: 105 GRILSSSSRDKTIKVWD---VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161

Query: 418 WYYKCSQWRFIRLDMNTCLP------HTKETSDETNKKIKVTMVAWDASDRWVITAINFN 471
           W            D+ T         HT E          V  VA+      ++++ +  
Sbjct: 162 W------------DLRTGKCVATLTGHTGE----------VNSVAFSPDGEKLLSSSSDG 199

Query: 472 FQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYK 531
             IK+WD   G  +  L+GH N V  +   P D  +L S   DG I +WD+ + +     
Sbjct: 200 -TIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQ-- 255

Query: 532 WFHNIVEGQG-EGALFDGKWSPDGTT 556
                    G   ++    WSPDG  
Sbjct: 256 ------TLSGHTNSVTSLAWSPDGKR 275



 Score =  133 bits (338), Expect = 2e-34
 Identities = 71/293 (24%), Positives = 118/293 (40%), Gaps = 44/293 (15%)

Query: 190 KYSNLRQHYVNLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEI 249
                       GH   V  V     G  + +G+ D  I+LW    G+ + TL G +  +
Sbjct: 37  DLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYV 96

Query: 250 SDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLAS 309
           S +A      +L++ + DKTI++W+++T   ++ L GH   +  V F P       ++AS
Sbjct: 97  SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDG----TFVAS 152

Query: 310 TSTDGCIGFWKYKLDIDISKTV----FHEPVMFNERIRPGNAHILCSSFSPGGLFLATGS 365
           +S DG I  W    D+   K V     H               +   +FSP G  L + S
Sbjct: 153 SSQDGTIKLW----DLRTGKCVATLTGHTGE------------VNSVAFSPDGEKLLSSS 196

Query: 366 GDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQW 425
            D  ++++ +      LG L    H + V+S+ +S       SGS DGT  +W       
Sbjct: 197 SDGTIKLWDLST-GKCLGTLRG--HENGVNSVAFSPDGYLLASGSEDGTIRVW------- 246

Query: 426 RFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWD 478
               L    C+      ++       VT +AW    + + +    +  I+IWD
Sbjct: 247 ---DLRTGECVQTLSGHTNS------VTSLAWSPDGKRLASGS-ADGTIRIWD 289


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  124 bits (312), Expect = 8e-30
 Identities = 98/433 (22%), Positives = 171/433 (39%), Gaps = 61/433 (14%)

Query: 143 LSRSLMISSHKTSLLVPNSVKVHNTVNVLRSREYSGPLSRFLM---ISSHKYSNLRQHYV 199
           L  +   SS   S L+  S    +  ++      S      L+     S          +
Sbjct: 1   LVDNSSTSSENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLL 60

Query: 200 NLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDG-KLLATLRGCSG----EISDIAI 254
             GH  ++  + F   GE++++G+ D  IKLW   +G KL+ +L G       +++  + 
Sbjct: 61  LRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSP 120

Query: 255 DNRNILLAAGTVDKTIRIWNLQTLA-PISVLVGHQGIITGVNFCPLEVNGFNYLASTSTD 313
           D  +ILLA+ ++D T+++W+L T    I  L GH   +T + F P   +G    + +S D
Sbjct: 121 DGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSP---DGKLLASGSSLD 177

Query: 314 GCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCS--------------------- 352
           G I  W  +    +S    H   + +    P    ++ S                     
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRST 237

Query: 353 ----------SFSPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHS 402
                     SFSP G  LA+GS D  +R++ +    S L  L    HS  V S+ +S  
Sbjct: 238 LSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSG--HSSSVLSVAFSPD 295

Query: 403 HLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDR 462
                SGS DGT  +W              +        +      +  V+ +++     
Sbjct: 296 GKLLASGSSDGTVRLW--------------DLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341

Query: 463 WVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDI 522
            +++  + +  I++WD   G  ++ L+GH+N + V  S   D RV+ S   DG + +WD+
Sbjct: 342 LLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSV--SFSPDGRVVSSGSTDGTVRLWDL 399

Query: 523 LSSKQRGYKWFHN 535
            +         H 
Sbjct: 400 STGSLLRNLDGHT 412



 Score = 89.4 bits (220), Expect = 3e-18
 Identities = 61/289 (21%), Positives = 110/289 (38%), Gaps = 31/289 (10%)

Query: 295 NFCPLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSF 354
           +     +N  + L+  S++  I       D  +S          +  +R     I   +F
Sbjct: 19  SELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLS-----SLLLRGHEDSITSIAF 73

Query: 355 SPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSH---SHLRFVSGSR 411
           SP G  L +GS D  ++++ +D  +  +  LE   H   V  +  S    + +   S S 
Sbjct: 74  SPDGELLLSGSSDGTIKLWDLDNGEKLIKSLE-GLHDSSVSKLALSSPDGNSILLASSSL 132

Query: 412 DGTALIWYYKCSQWRFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFN 471
           DGT  +W             + T   H++           VT +A+    + + +  + +
Sbjct: 133 DGTVKLW-----DLSTPGKLIRTLEGHSES----------VTSLAFSPDGKLLASGSSLD 177

Query: 472 FQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYK 531
             IK+WD   G  +  L GHT+ V  L   P    ++ S   DG I +WD+ +      K
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTG-----K 232

Query: 532 WFHNIVEGQGEGALFDGKWSPDGTTCALTDSYGHLLVYGLGGFKRPQNI 580
              + + G  +  +    +SPDG+  A   S G + ++ L         
Sbjct: 233 LLRSTLSGHSDSVV--SSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRT 279



 Score = 88.6 bits (218), Expect = 5e-18
 Identities = 81/374 (21%), Positives = 137/374 (36%), Gaps = 47/374 (12%)

Query: 132 NVLRSREYSGPLSRSLMISSHKTSLLVPNSVKVHNTVNVLRSREYSGPLSRFLMISSHKY 191
             ++  + S P      +  H  S+    S+       +L S      L   + +   + 
Sbjct: 134 GTVKLWDLSTPGKLIRTLEGHSESV---TSLAFSPDGKLLAS---GSSLDGTIKLWDLRT 187

Query: 192 SNLRQHYVNLGHLSAVFCVLFDKLGE-VIVTGADDLLIKLWRARDGKLLATLRGCSGEIS 250
                     GH   V  + F   G  +I +G+ D  I+LW    GKLL +      +  
Sbjct: 188 GKPLS--TLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV 245

Query: 251 DIAIDNRNILLAAGTVDKTIRIWNLQTLA-PISVLVGHQGIITGVNFCPLEVNGFNYLAS 309
             +      LLA+G+ D TIR+W+L++ +  +  L GH   +  V F P        LAS
Sbjct: 246 VSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDG----KLLAS 301

Query: 310 TSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPGGLFLATG-SGDH 368
            S+DG +  W    D++  K +    +  +E        +   SFSP G  L +G S D 
Sbjct: 302 GSSDGTVRLW----DLETGKLLSSLTLKGHEGP------VSSLSFSPDGSLLVSGGSDDG 351

Query: 369 HVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFI 428
            +R++ +        +    E    V S+ +S       SGS DGT  +W          
Sbjct: 352 TIRLWDLRTGKPLKTL----EGHSNVLSVSFSPDGRVVSSGSTDGTVRLW---------- 397

Query: 429 RLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVL 488
                  L       +      +VT + +    + + +  + N  I++WD     L  V 
Sbjct: 398 ------DLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNT-IRLWDLKT-SLKSVS 449

Query: 489 KGHTNEVFVLESHP 502
                +V   +S  
Sbjct: 450 FSPDGKVLASKSSD 463


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 44.6 bits (106), Expect = 3e-06
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 481 NGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWD 521
           +G+L++ LKGHT  V  +   P D + L S   DG I +WD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40



 Score = 41.5 bits (98), Expect = 5e-05
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 235 DGKLLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWN 274
            G+LL TL+G +G ++ +A       LA+G+ D TI++W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 40.0 bits (94), Expect = 1e-04
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 384 ILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 418
           +  ++ H+  V S+ +S       SGS DGT  +W
Sbjct: 5   LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 38.1 bits (89), Expect = 9e-04
 Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 4/44 (9%)

Query: 277 TLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWK 320
           +   +  L GH G +T V F P   +G  YLAS S DG I  W 
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSP---DG-KYLASGSDDGTIKLWD 40



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 202 GHLSAVFCVLFDKLGEVIVTGADDLLIKLW 231
           GH   V  V F   G+ + +G+DD  IKLW
Sbjct: 10  GHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 35.4 bits (82), Expect = 0.006
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 346 NAHILCSSFSPGGLFLATGSGDHHVRVYK 374
              +   +FSP G +LA+GS D  ++++ 
Sbjct: 12  TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 43.9 bits (104), Expect = 7e-06
 Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 482 GDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWD 521
           G L++ LKGHT  V  +   P D  +L S   DG + +WD
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSP-DGNLLASGSDDGTVRVWD 39



 Score = 41.2 bits (97), Expect = 5e-05
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 236 GKLLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWN 274
           GKLL TL+G +G ++ +A      LLA+G+ D T+R+W+
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 40.8 bits (96), Expect = 9e-05
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 384 ILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 418
           +  ++ H+  V S+ +S       SGS DGT  +W
Sbjct: 4   LRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 35.8 bits (83), Expect = 0.004
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 202 GHLSAVFCVLFDKLGEVIVTGADDLLIKLW 231
           GH   V  V F   G ++ +G+DD  +++W
Sbjct: 9   GHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 4/41 (9%)

Query: 280 PISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWK 320
            +  L GH G +T V F P      N LAS S DG +  W 
Sbjct: 3   LLRTLKGHTGPVTSVAFSPDG----NLLASGSDDGTVRVWD 39



 Score = 34.3 bits (79), Expect = 0.017
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 346 NAHILCSSFSPGGLFLATGSGDHHVRVYK 374
              +   +FSP G  LA+GS D  VRV+ 
Sbjct: 11  TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This protein,
            which interacts with both microtubules and TRAF3 (tumour
            necrosis factor receptor-associated factor 3), is
            conserved from worms to humans. The N-terminal region is
            the microtubule binding domain and is well-conserved; the
            C-terminal 100 residues, also well-conserved, constitute
            the coiled-coil region which binds to TRAF3. The central
            region of the protein is rich in lysine and glutamic acid
            and carries KKE motifs which may also be necessary for
            tubulin-binding, but this region is the least
            well-conserved.
          Length = 506

 Score = 46.8 bits (111), Expect = 7e-05
 Identities = 29/183 (15%), Positives = 65/183 (35%), Gaps = 15/183 (8%)

Query: 1215 QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEG---EEEEESNDQSEEEEVE 1271
            +  + G     ++T+  +     SG+   ++  + ++E +    + +EE  D+  +EE +
Sbjct: 79   RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138

Query: 1272 E--------NSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKR 1323
            E          +E++ EE   R    +R  V    R   P      ++ +   +  + ++
Sbjct: 139  EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQ 198

Query: 1324 TNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEEES 1383
                           + D  +  ++EE  G  R   T    +  +  SSE    S     
Sbjct: 199  AAREAVKG----KPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLK 254

Query: 1384 QAA 1386
            +  
Sbjct: 255  KPD 257



 Score = 36.8 bits (85), Expect = 0.087
 Identities = 28/173 (16%), Positives = 61/173 (35%), Gaps = 3/173 (1%)

Query: 1221 QTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
            + +    ++G  +  K +   +     E E E E  +EE    + ++E+ +E  ++ + +
Sbjct: 78   KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEE---KKKKKEKPKEEPKDRKPK 134

Query: 1281 EAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDS 1340
            E    +RP +     K K+   P       +   VR   R K+   ++  +       + 
Sbjct: 135  EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE 194

Query: 1341 DSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRR 1393
                 ++E  +           + +          T S  EE ++ Q S   R
Sbjct: 195  KQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISR 247


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
            Provisional.
          Length = 672

 Score = 44.1 bits (105), Expect = 5e-04
 Identities = 36/242 (14%), Positives = 68/242 (28%), Gaps = 13/242 (5%)

Query: 1144 SGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAM 1203
             G ++ ++   +      R         R+          Q     +      +    A 
Sbjct: 60   GGGAAAAAATPAAPAAAARRAARAAAAARQA--------EQPAAEAAAAKAEAAPAARAA 111

Query: 1204 RARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESND 1263
             A  ++   + +A +  +         + +  K    G         D  E  EEEE ++
Sbjct: 112  AAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDE 171

Query: 1264 QSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKR 1323
            +    + E+   E E  E   R    R  +     R                    R  R
Sbjct: 172  RRRRGDREDRQAEAERGERGRREERGRDGD--DRDRRDRREQGDRREERGRRDGGDRRGR 229

Query: 1324 TNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARR---TRTKGQSYAVHSSETETESEE 1380
               R         + +   + +  + E  G RR RR      +G+      +E E E  E
Sbjct: 230  RRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELRE 289

Query: 1381 EE 1382
            ++
Sbjct: 290  DD 291



 Score = 38.3 bits (90), Expect = 0.033
 Identities = 21/141 (14%), Positives = 37/141 (26%), Gaps = 4/141 (2%)

Query: 1133 EENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDV 1192
                    ++E     +  E     +   R  +A R    R    G     + R +  + 
Sbjct: 152  PATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQ 211

Query: 1193 SDYDSGDRYAMRA-RQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESED 1251
             D            R+ ++ +  +   RG      R        +  G  RGR+     D
Sbjct: 212  GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRR---FRD 268

Query: 1252 EVEGEEEEESNDQSEEEEVEE 1272
                           E E+ E
Sbjct: 269  RDRRGRRGGDGGNEREPELRE 289



 Score = 34.9 bits (81), Expect = 0.32
 Identities = 23/197 (11%), Positives = 44/197 (22%), Gaps = 15/197 (7%)

Query: 1202 AMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEES 1261
            A++  +     +  A          R                R+A +   E    + E +
Sbjct: 54   AIKEARGGGAAAAAATPAAPAAAARRAARA--------AAAARQAEQPAAEAAAAKAEAA 105

Query: 1262 NDQSEEEEVE-ENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGR 1320
                       E +   E  +A  RR     A      R      +   +  R+      
Sbjct: 106  PAARAAAAAAAEAASAPEAAQARERRERGEAA-RRGAARKAGEGGEQPATEARADAAERT 164

Query: 1321 AKRTNGRQSSSLLMTCSGDSD--SEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETES 1378
             +     +                E   +EE         R   + Q       E     
Sbjct: 165  EEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQG---DRREERGRR 221

Query: 1379 EEEESQAAQVSFSRRGR 1395
            +  + +  +    RR  
Sbjct: 222  DGGDRRGRRRRRDRRDA 238



 Score = 33.7 bits (78), Expect = 0.86
 Identities = 15/149 (10%), Positives = 39/149 (26%), Gaps = 11/149 (7%)

Query: 1143 ESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYA 1202
               +++ ++E  S  E      +  R    R      +     +      +D        
Sbjct: 108  ARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEE 167

Query: 1203 MRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESN 1262
             R  + ++          +       RG++         RGR   + +     E+ +   
Sbjct: 168  ERDERRRRGDREDRQAEAE----RGERGRR-------EERGRDGDDRDRRDRREQGDRRE 216

Query: 1263 DQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            ++   +  +        +    R    R 
Sbjct: 217  ERGRRDGGDRRGRRRRRDRRDARGDDNRE 245


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 44.3 bits (105), Expect = 5e-04
 Identities = 53/237 (22%), Positives = 90/237 (37%), Gaps = 20/237 (8%)

Query: 1141 SSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDR 1200
            S    +  Q       +    R    R+  PR+ NR   +  R  R+N +     ++ + 
Sbjct: 581  SGGEETKPQEQPAPKAEAKPERQQDRRK--PRQNNRRDRNERRDTRDNRTRREGRENREE 638

Query: 1201 YAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEE 1260
               R R+  Q Q+ +     Q  VT + R +    +   R R R+     DE    ++E 
Sbjct: 639  -NRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRR---RNDEKRQAQQEA 694

Query: 1261 SNDQSEEEEVEENSEEEEEEEAVPRRRP------VRRANVAKVKRAVPPAPDTSTSRLRS 1314
                 EE+ V+E  +EE  ++  PRR+       VR       +   P   +T  +    
Sbjct: 695  KALNVEEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAA--EP 752

Query: 1315 VRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQE--EEQYGSRRARRT----RTKGQ 1365
            V Q   A RT   +    ++  +     E+N+ E  +     RR+RR+    R  GQ
Sbjct: 753  VVQEVPAPRTELVKVPLPVVAQTAPEQDEENNAENRDNNGMPRRSRRSPRHLRVSGQ 809



 Score = 39.6 bits (93), Expect = 0.013
 Identities = 33/154 (21%), Positives = 57/154 (37%), Gaps = 13/154 (8%)

Query: 1258 EEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQ 1317
             EE+  Q +     E   E +++   PR+   R  N  +  R      +   +R  + R 
Sbjct: 583  GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRN 642

Query: 1318 NGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSY-------AVH 1370
              +A++       S     +  + ++D  Q++     R+ RR   K Q+           
Sbjct: 643  RRQAQQQTAETRESQQAEVTEKARTQD-EQQQAPRRERQRRRNDEKRQAQQEAKALNVEE 701

Query: 1371 SSETETESEEEESQAAQVSFSRRGRMRKPTAKLR 1404
             S  ETE EE   Q       RR + R+   K+R
Sbjct: 702  QSVQETEQEERVQQVQP----RRKQ-RQLNQKVR 730



 Score = 30.8 bits (70), Expect = 7.5
 Identities = 12/58 (20%), Positives = 17/58 (29%), Gaps = 1/58 (1%)

Query: 1249 SEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPD 1306
            +E EV  +       +   E     + E E   A          N A       PAP+
Sbjct: 971  AEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQV-PEATVEHNHATAPMTRAPAPE 1027


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
            binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 44.2 bits (104), Expect = 6e-04
 Identities = 25/55 (45%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVA 1295
            G G    +SE+E   EEEEE  ++ EEEE EE  EEEE EE +    P  R   A
Sbjct: 855  GGGSDGGDSEEE---EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQA 906



 Score = 43.4 bits (102), Expect = 0.001
 Identities = 20/40 (50%), Positives = 24/40 (60%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            +G       D  + EEEEE  ++ EEEE EE  EEEEEEE
Sbjct: 850  KGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 41.9 bits (98), Expect = 0.003
 Identities = 18/47 (38%), Positives = 24/47 (51%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
              K  +     +G + EE  ++ EEEE EE  EEEEEEE      P+
Sbjct: 848  DEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894



 Score = 40.7 bits (95), Expect = 0.006
 Identities = 18/37 (48%), Positives = 22/37 (59%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            +A + E  V+G    +  D  EEEE EE  EEEEEEE
Sbjct: 844  EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEE 880



 Score = 32.7 bits (74), Expect = 1.8
 Identities = 17/68 (25%), Positives = 27/68 (39%)

Query: 1215 QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENS 1274
            +A   G+       R  ++  +      G    E E E +GE E E    +E +  +E  
Sbjct: 637  EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696

Query: 1275 EEEEEEEA 1282
             E E +EA
Sbjct: 697  GEIEAKEA 704



 Score = 32.3 bits (73), Expect = 2.7
 Identities = 28/141 (19%), Positives = 47/141 (33%), Gaps = 4/141 (2%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDT 1307
            E   E EGE  EES  ++E+E   E   E E E  +P     R+              D 
Sbjct: 650  ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAE---RKGEQEGEGEIEAKEADH 706

Query: 1308 STSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARR-TRTKGQS 1366
                     ++       G +    + T     + ED  + E +          R + + 
Sbjct: 707  KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766

Query: 1367 YAVHSSETETESEEEESQAAQ 1387
                 +E + + +E E QA +
Sbjct: 767  EGETEAEGKEDEDEGEIQAGE 787



 Score = 30.7 bits (69), Expect = 6.9
 Identities = 16/87 (18%), Positives = 30/87 (34%), Gaps = 1/87 (1%)

Query: 1194 DYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEV 1253
            D   GD         +  +  +     +      + G ++  +     +G    E E   
Sbjct: 629  DLSKGDVAEAEHTGERTGEEGERPTEAEGENGEES-GGEAEQEGETETKGENESEGEIPA 687

Query: 1254 EGEEEEESNDQSEEEEVEENSEEEEEE 1280
            E + E+E   + E +E +   E E EE
Sbjct: 688  ERKGEQEGEGEIEAKEADHKGETEAEE 714


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This family
            includes the radial spoke head proteins RSP4 and RSP6
            from Chlamydomonas reinhardtii, and several eukaryotic
            homologues, including mammalian RSHL1, the protein
            product of a familial ciliary dyskinesia candidate gene.
          Length = 481

 Score = 42.0 bits (99), Expect = 0.002
 Identities = 29/101 (28%), Positives = 43/101 (42%), Gaps = 16/101 (15%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
            GR       + + E E+E+E     EEEE EE  EE E EE  P   P+  +  A +   
Sbjct: 337  GRCTWVNPEQKDEEEEQEDE-----EEEEEEEEPEEPEPEEGPPLLTPI--SEDAPLPND 389

Query: 1301 VPP---------APDTSTSRLRSVRQNGRAKRTNGRQSSSL 1332
             P          +P  + + LRS R  G     NG++  ++
Sbjct: 390  DPAWTFRLSSSLSPKYAVAVLRSNRWPGAYTYANGKKFENI 430


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
            Fas-binding protein) is thought to play a role in
            apoptosis, but precise role played by Daxx remains to be
            determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 41.4 bits (97), Expect = 0.003
 Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 18/124 (14%)

Query: 1180 SLGRQYRENPS-DVSDYDSG-DRYAMRA-------RQSKQVQSVQAGR-RGQTIVTSRTR 1229
            SL R+ R N    VS  +    +YAM+        R+ +Q +  Q    R      + + 
Sbjct: 368  SLARKLRSNREVAVSRLEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASST 427

Query: 1230 GKQSYNKFSGRGRGRKAPESEDEVEGE--------EEEESNDQSEEEEVEENSEEEEEEE 1281
              +S +  S      ++ E E+E E E        EEEE  D+ EEEEVE ++  EEE E
Sbjct: 428  SGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487

Query: 1282 AVPR 1285
                
Sbjct: 488  GSSE 491



 Score = 32.2 bits (73), Expect = 2.6
 Identities = 41/263 (15%), Positives = 80/263 (30%), Gaps = 49/263 (18%)

Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYREN 1188
            +S +   S S +S ES       EE+ E+E+                        +  E 
Sbjct: 424  ASSTSGESPSMASQESEEEESVEEEEEEEEE-----------------------EEEEEQ 460

Query: 1189 PSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRT----RGKQSYNKFSGRGRGR 1244
             S+  + +  +        +   + ++    G            R        S    G+
Sbjct: 461  ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQ 520

Query: 1245 ------KAPESEDEVEGE----EEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANV 1294
                    PES  E   +    + E   ++S+EE + E S      E      PV     
Sbjct: 521  QPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETK-- 578

Query: 1295 AKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGS 1354
                R +PP P +      +  +N  A  T+  ++ +     S  +  ++      +   
Sbjct: 579  ISSSRKLPPPPVS------TSLENDSATVTSTTRNGN----VSPHTPQDEQPPSGRKRKR 628

Query: 1355 RRARRTRTKGQSYAVHSSETETE 1377
            +    +   G  Y  H + +E +
Sbjct: 629  KEEVESEPLGNQYLRHHNGSEKD 651



 Score = 30.7 bits (69), Expect = 6.7
 Identities = 34/169 (20%), Positives = 50/169 (29%), Gaps = 15/169 (8%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEEN------SEEEEEEEAVPRRRPVRRANVAKVK 1298
            +  E EDE E EE E  N   EE E            EE+ E        + R +  +  
Sbjct: 463  EEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQP 522

Query: 1299 RA---VPPAPDTSTSRLRSV--RQNGRAKRTNGRQSSSLL---MTCSGDSDSEDNSQEEE 1350
            R     P +P     +  S+     G           S L       G +   +      
Sbjct: 523  RGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETKISSS 582

Query: 1351 QYGSRRARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKP 1399
            +        T  +  S A  +S T   +    +   +   S R R RK 
Sbjct: 583  RKLPPPPVSTSLENDS-ATVTSTTRNGNVSPHTPQDEQPPSGRKRKRKE 630


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family are
            designated YL1. These proteins have been shown to be
            DNA-binding and may be a transcription factor.
          Length = 238

 Score = 40.1 bits (94), Expect = 0.005
 Identities = 31/155 (20%), Positives = 53/155 (34%), Gaps = 16/155 (10%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP--PAP 1305
            E E+E E EE +   D SE++E E + EEE E+E     R  ++  V       P     
Sbjct: 44   EIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKK 103

Query: 1306 DTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQ 1365
                +  +S +      +    + S                        R++ R+ T   
Sbjct: 104  KKDPTAAKSPKAAAPRPKKKSERISWA--------------PTLLDSPRRKSSRSSTVQN 149

Query: 1366 SYAVHSSETETESEEEESQAAQVSFSRRGRMRKPT 1400
              A H    E E   ++ QA       + + ++ T
Sbjct: 150  KEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus spores
            are protected by a protein shell consisting of over 50
            different polypeptides, known as the coat. This family of
            proteins has an important morphogenetic role in coat
            assembly, it is involved in the assembly of at least 5
            different coat proteins including CotB, CotG, CotS, CotSA
            and CotW. It is likely to act at a late stage of coat
            assembly.
          Length = 185

 Score = 38.3 bits (89), Expect = 0.010
 Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVA 1295
               PE ED  E +E+EE   + E+EE  E+  +++E  +  R++P +  N+ 
Sbjct: 70   AGEPEREDIAEQQEKEEIAQEEEKEEEAED-VKQQEVFSFKRKKPFKEMNLE 120


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
            family consists of several Plasmodium falciparum SPAM
            (secreted polymorphic antigen associated with merozoites)
            proteins. Variation among SPAM alleles is the result of
            deletions and amino acid substitutions in non-repetitive
            sequences within and flanking the alanine heptad-repeat
            domain. Heptad repeats in which the a and d position
            contain hydrophobic residues generate amphipathic
            alpha-helices which give rise to helical bundles or
            coiled-coil structures in proteins. SPAM is an example of
            a P. falciparum antigen in which a repetitive sequence
            has features characteristic of a well-defined structural
            element.
          Length = 164

 Score = 37.5 bits (87), Expect = 0.018
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E E+E + EE EE  D  +EEE+ E+ EEEEE+E
Sbjct: 52   EEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85



 Score = 36.0 bits (83), Expect = 0.046
 Identities = 16/35 (45%), Positives = 23/35 (65%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
             + +DE E EE+EE  ++ E+ E EE   E+EEEE
Sbjct: 47   KQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEE 81



 Score = 35.2 bits (81), Expect = 0.084
 Identities = 16/37 (43%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 1248 ESEDEV-EGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E+ED   E +E++E  ++ +EEE+EE  + E+EEE V
Sbjct: 39   ENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIV 75



 Score = 32.5 bits (74), Expect = 0.71
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E E+E+E  E+ E  ++  E+E EE  +EE+  +
Sbjct: 57   EDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
            unknown].
          Length = 769

 Score = 38.9 bits (90), Expect = 0.022
 Identities = 48/226 (21%), Positives = 70/226 (30%), Gaps = 25/226 (11%)

Query: 1138 SDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDS 1197
            S+    +G   Q+  E   D         + +     NR   +L     E     SDYD 
Sbjct: 294  SNKIIVNGQYEQTIREIFADRATKLELDLQTVFESNMNR--DTLDEYAPEGEDLRSDYDE 351

Query: 1198 GDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSY------------NKFSGRGRGRK 1245
               Y               GR   +   +  +G   Y             + +       
Sbjct: 352  DFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMS 411

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP-PA 1304
            A + +D  E + EE + D+    +  E  EE   EE   + R  R     K  R  P  A
Sbjct: 412  AIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDME--KEDREFPDEA 469

Query: 1305 PDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEE 1350
                +          R K   G ++   L TCS  SD +D S  EE
Sbjct: 470  ELQPSESAIE-----RYKEYRGLRN---LYTCSWRSDEKDPSFPEE 507


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 38.6 bits (90), Expect = 0.023
 Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR 1290
            P+ E+E E EE++E  +    ++ EE  EEEE+EE   + + V+
Sbjct: 36   PDEEEEEEKEEKKEEEE-KTTDKEEEVDEEEEKEEKKKKTKKVK 78



 Score = 33.2 bits (76), Expect = 1.2
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVK 1298
            E E E E  +EEE  ++ E++E EE + ++EEE      +  ++    KVK
Sbjct: 28   EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 38.5 bits (89), Expect = 0.025
 Identities = 19/78 (24%), Positives = 38/78 (48%)

Query: 445 ETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFD 504
           E   + K++ + W++  +  + + NF   +++WD     LV  +K H   V+ ++    D
Sbjct: 528 ELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSAD 587

Query: 505 SRVLLSAGHDGLIIIWDI 522
             +L S   DG + +W I
Sbjct: 588 PTLLASGSDDGSVKLWSI 605



 Score = 36.2 bits (83), Expect = 0.16
 Identities = 59/253 (23%), Positives = 105/253 (41%), Gaps = 38/253 (15%)

Query: 180 LSRFLMISSHKY-SNLRQHYVNLGHLSAVFCVL-FDKLGEVIVTGADDLLIKLWRA---- 233
           L ++L  S  +  ++L+Q   +L + S + C + FD+ GE   T   +  IK++      
Sbjct: 459 LCKYLSFSKLRVKADLKQG--DLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESII 516

Query: 234 RDGK-------LLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVG 286
           +DG+        LA+    SG   +  I ++   +A+   +  +++W++     ++ +  
Sbjct: 517 KDGRDIHYPVVELASRSKLSGICWNSYIKSQ---VASSNFEGVVQVWDVARSQLVTEMKE 573

Query: 287 HQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGN 346
           H+  +  +++   +      LAS S DG +  W     + I                   
Sbjct: 574 HEKRVWSIDYSSADPT---LLASGSDDGSVKLWSINQGVSIGTI-------------KTK 617

Query: 347 AHILCSSF-SPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLR 405
           A+I C  F S  G  LA GS DH V  Y +     PL  +    HS  V  +++  S   
Sbjct: 618 ANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTM--IGHSKTVSYVRFVDSS-T 674

Query: 406 FVSGSRDGTALIW 418
            VS S D T  +W
Sbjct: 675 LVSSSTDNTLKLW 687


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 38.7 bits (90), Expect = 0.026
 Identities = 32/175 (18%), Positives = 55/175 (31%), Gaps = 38/175 (21%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP 1302
             R A   +         +S+D  ++  VE +  E        +++  R+   A  K A P
Sbjct: 1295 RRAAARKKPLASVSVISDSDDDDDDFAVEVSLAER------LKKKGGRKPAAANKKAAKP 1348

Query: 1303 PA-------------PDTSTSRLRSVRQNG----------RAKRTNGRQSSSLLMTCSGD 1339
            PA                 T  L+     G          RA   N +  S L    +  
Sbjct: 1349 PAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNK 1408

Query: 1340 ----SDSEDNSQEEEQYG-----SRRARRTRTKGQSYAVHSSETETESEEEESQA 1385
                S++   S   E+         R +R   K  +Y +  SE+E+  + +    
Sbjct: 1409 ETESSENVSGSSSSEKDEIDVSAKPRPQRANRKQTTYVLSDSESESADDSDFDDD 1463


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
            (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
            phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 38.7 bits (90), Expect = 0.026
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
             +ED   G EE E +D+SEE+E E    E EE E
Sbjct: 243  NNEDSEAGREESEGSDESEEDEAEATDGEGEEGE 276



 Score = 32.1 bits (73), Expect = 2.3
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E  DE +G++++  N++  E   EE+   +E EE  
Sbjct: 229  EDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDE 264



 Score = 31.7 bits (72), Expect = 3.2
 Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVE--ENSEEEEEEE 1281
            GR+  E  DE E +E E ++ + EE E++  E SE+ E +E
Sbjct: 250  GREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE 290


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.  Prothymosin
            alpha and parathymosin are two ubiquitous small acidic
            nuclear proteins that are thought to be involved in cell
            cycle progression, proliferation, and cell
            differentiation.
          Length = 106

 Score = 35.7 bits (82), Expect = 0.027
 Identities = 19/36 (52%), Positives = 24/36 (66%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            A E +DE+E EEE + +D+ EE E EE   EEEEE 
Sbjct: 44   AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEET 79



 Score = 32.2 bits (73), Expect = 0.40
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            G+   A  +E+E  G +E +   + EEE  E++ EEE E E 
Sbjct: 29   GKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEE 70



 Score = 32.2 bits (73), Expect = 0.49
 Identities = 13/40 (32%), Positives = 25/40 (62%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRR 1287
            + E+E EGEEEE   ++  E    + + E+EE++A  +++
Sbjct: 61   DEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQ 100



 Score = 31.5 bits (71), Expect = 0.84
 Identities = 14/34 (41%), Positives = 17/34 (50%)

Query: 1249 SEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
               E + E EEE     ++EE E   EEEE EE 
Sbjct: 43   GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEE 76


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
            domain is functionally uncharacterized. This domain is
            found in eukaryotes. This presumed domain is typically
            between 156 to 174 amino acids in length. This domain is
            found associated with pfam07780, pfam01728.
          Length = 154

 Score = 36.5 bits (85), Expect = 0.027
 Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 8/61 (13%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
            G  +K  E E+E E E EE   ++  +E +E+   + + E+        RR N  K K  
Sbjct: 95   GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREK--------RRENERKQKEI 146

Query: 1301 V 1301
            +
Sbjct: 147  L 147



 Score = 29.2 bits (66), Expect = 9.6
 Identities = 13/46 (28%), Positives = 24/46 (52%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR 1290
            +  E E EVE  +EEE  D+  E+E+ +   E+  E    ++  ++
Sbjct: 103  EEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
            family is conserved in dsDNA adenoviruses of vertebrates.
            The function is not known.
          Length = 172

 Score = 36.8 bits (85), Expect = 0.033
 Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 7/79 (8%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPD 1306
               EDE E E+ E+S D       EE+ E EE EE          ++     +     P 
Sbjct: 30   EVEEDEEEMEDWEDSLD-------EEDEEAEEVEEETAASSKAPSSSSKSSSQETISIPP 82

Query: 1307 TSTSRLRSVRQNGRAKRTN 1325
            T  +R  S R +   +  N
Sbjct: 83   TPPARRPSRRWDQTGRFPN 101


>gnl|CDD|99958 cd05529, Bromo_WDR9_I_like, Bromodomain; WDR9 repeat I_like
            subfamily. WDR9 is a human gene located in the Down
            Syndrome critical region-2 of chromosome 21. It encodes
            for a nuclear protein containing WD40 repeats and two
            bromodomains, which may function as a transcriptional
            regulator involved in chromatin remodeling and play a
            role in embryonic development. Bromodomains are 110 amino
            acid long domains, that are found in many chromatin
            associated proteins. Bromodomains can interact
            specifically with acetylated lysine.
          Length = 128

 Score = 35.8 bits (83), Expect = 0.034
 Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 11/54 (20%)

Query: 1086 LYHPRPDEWPQGD--------RDATCEKIITGLEEVMG---LSVAEPFLVPVDI 1128
            LY+P   EW   D        RD   E++I+GL++++    L +AE F  PVD+
Sbjct: 1    LYNPLSSEWELFDPGWEQPHIRDEERERLISGLDKLLLSLQLEIAEYFEYPVDL 54


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
            is found in the primary vegetative sigma factor. The
            function of this domain is unclear and can be removed
            without loss of function.
          Length = 211

 Score = 37.2 bits (87), Expect = 0.038
 Identities = 10/37 (27%), Positives = 21/37 (56%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            A  +      E E +  D  ++++ +E+ +E++EEEA
Sbjct: 35   AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71



 Score = 34.5 bits (80), Expect = 0.25
 Identities = 8/39 (20%), Positives = 18/39 (46%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
            A  +       E E   +  E+++ ++  E+E++EE   
Sbjct: 34   AAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEAD 72



 Score = 32.2 bits (74), Expect = 1.3
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            P +          ES    E+ E +++ +E+E+E+ 
Sbjct: 32   PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDD 67



 Score = 31.0 bits (71), Expect = 3.7
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
            A  +  E E +EE+  +D  ++E+ +E+ EEE +    P
Sbjct: 39   ATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDP 77



 Score = 30.6 bits (70), Expect = 4.2
 Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 1246 APESEDEVEGEE--EEESNDQSEEEEVEENSEEEEEEEA 1282
            A  +       E  EE+  D  +++E E+  +EEE +  
Sbjct: 36   AAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLG 74


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain fusion
            protein; Provisional.
          Length = 131

 Score = 35.7 bits (82), Expect = 0.046
 Identities = 12/46 (26%), Positives = 22/46 (47%)

Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            ++  GR  K P +      E  EE+     +E+ +E +E +  +EA
Sbjct: 52   WTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEA 97



 Score = 33.0 bits (75), Expect = 0.35
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
            A E+++E E +E  +      + E EE  EEE+EE
Sbjct: 94   ADEADEEEETDEAVDETADEADAEAEEADEEEDEE 128



 Score = 30.7 bits (69), Expect = 2.1
 Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 5/42 (11%)

Query: 1246 APESEDEVEGEEEEESN-----DQSEEEEVEENSEEEEEEEA 1282
            A E++   E +EEEE++        E +   E ++EEE+EEA
Sbjct: 88   AAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEA 129


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 36.7 bits (85), Expect = 0.049
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
               E+++E EE E+     + EE  E   EE ++E
Sbjct: 28   KGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62



 Score = 34.7 bits (80), Expect = 0.18
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            E +   E E +EE   + E+ E EE  +EE  E++
Sbjct: 14   EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDS 48



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              E++   EE+ E  +  +EE +E++ E  E + 
Sbjct: 23   NKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKI 56



 Score = 31.6 bits (72), Expect = 2.3
 Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 1248 ESEDEVEGEEEE-ESNDQSEEEEVEENSEEEEEEEA 1282
            +   E E +EE+    +  E EE+E+    E+ EE+
Sbjct: 16   DCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEES 51



 Score = 31.3 bits (71), Expect = 2.9
 Identities = 11/34 (32%), Positives = 16/34 (47%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E +   E + E E  ++ E  E  E S E + EE
Sbjct: 25   EEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58



 Score = 30.5 bits (69), Expect = 5.0
 Identities = 7/35 (20%), Positives = 16/35 (45%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            +       EE+    ++++EE+  +  + E EE  
Sbjct: 6    KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIE 40



 Score = 30.1 bits (68), Expect = 6.7
 Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 7/41 (17%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEV-------EENSEEEEEEE 1281
            E + E E  E+EE  + SEE          +EN++ +EE +
Sbjct: 31   EEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENK 71



 Score = 29.7 bits (67), Expect = 8.2
 Identities = 13/34 (38%), Positives = 25/34 (73%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E+++E +G+EE+   ++ E+EE+ E+SEE  E +
Sbjct: 22   ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVK 55


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
            related to Mpp10 (M phase phosphoprotein 10). The U3
            small nucleolar ribonucleoprotein (snoRNP) is required
            for three cleavage events that generate the mature 18S
            rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
            depletion of Mpp10, a U3 snoRNP-specific protein, halts
            18S rRNA production and impairs cleavage at the three U3
            snoRNP-dependent sites.
          Length = 613

 Score = 37.7 bits (87), Expect = 0.053
 Identities = 42/288 (14%), Positives = 95/288 (32%), Gaps = 26/288 (9%)

Query: 1125 PVDISSLSEEN---SHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRM---PPRRTNRVG 1178
               I  L++     SH D S  S   S+ S +D E+E+     +   +            
Sbjct: 85   KEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNES 144

Query: 1179 TSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFS 1238
             S      ++ ++  +    +       +    Q+ +  +  ++ V  +       N+F 
Sbjct: 145  ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFL 204

Query: 1239 GRGRGRK--APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAK 1296
                  +  A   ED+ E   +++S D  ++E+     +EE++EE               
Sbjct: 205  EATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEE--------GNIEYED 256

Query: 1297 VKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRR 1356
                              +  +   K    +++ S       + D +++ Q+EE+     
Sbjct: 257  FFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAA 316

Query: 1357 ARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRR-GRMRKPTAKL 1403
              + +           E   E  + ES     SF +R  ++++   +L
Sbjct: 317  MDKVKL---------DEPVLEGVDLESPKELSSFEKRQAKLKQQIEQL 355


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
            known as chromatin decondensation proteins. They bind to
            core histones and transfer DNA to them in a reaction that
            requires ATP. This is thought to play a role in the
            assembly of regular nucleosomal arrays.
          Length = 146

 Score = 35.8 bits (83), Expect = 0.054
 Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
            EDE + +EE+E     EE++ E++ E+E EEE  P ++  
Sbjct: 111  EDESDDDEEDEE----EEDDEEDDDEDESEEEESPVKKVK 146



 Score = 32.3 bits (74), Expect = 0.80
 Identities = 14/38 (36%), Positives = 26/38 (68%)

Query: 1255 GEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRA 1292
              EE+ES+D  E+EE E++ E+++E+E+     PV++ 
Sbjct: 108  ASEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
            proteins whose interaction is required for the maturation
            of the 18S rRNA and for 40S ribosome production.
          Length = 809

 Score = 37.3 bits (87), Expect = 0.061
 Identities = 13/34 (38%), Positives = 25/34 (73%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            + +D  E EE+ + +D+ E+EE E++ +E++EEE
Sbjct: 338  DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371



 Score = 36.5 bits (85), Expect = 0.11
 Identities = 13/35 (37%), Positives = 25/35 (71%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            + +D++E EEE+      EE+E +E+S++E++EE 
Sbjct: 337  DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371



 Score = 35.0 bits (81), Expect = 0.36
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
             E ED V+ E+EE+ +D  EEEE + +  +EEE+E 
Sbjct: 324  EEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEE 359



 Score = 34.6 bits (80), Expect = 0.48
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            +  E + ++  EEE+E ++ S++E+ EE  EEE+E++ 
Sbjct: 343  EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKK 380



 Score = 34.2 bits (79), Expect = 0.66
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 1239 GRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            G G+G +  E E++   +E+EE +D   EEE E+    +EEE+  
Sbjct: 315  GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEE 359



 Score = 33.0 bits (76), Expect = 1.4
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E +D+ + EEEEE  D S+EEE EE+ + ++E++  
Sbjct: 335  EEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEE 370



 Score = 31.1 bits (71), Expect = 5.8
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            + ED+ +  EEEE +    +EE +E  E+ ++E+ 
Sbjct: 334  DEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDD 368


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 37.7 bits (88), Expect = 0.062
 Identities = 33/218 (15%), Positives = 75/218 (34%), Gaps = 2/218 (0%)

Query: 1062 PKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEP 1121
            PK  K+  +   S   + ++   +    R D   +   D   +   +           E 
Sbjct: 1172 PKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQ 1231

Query: 1122 FLVPVDISSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRR-TNRVGTS 1180
               P   S    ++  ++SS  S  + + S +D   E   ++   R    +         
Sbjct: 1232 KTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKR 1291

Query: 1181 LGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGR 1240
               +        S      +  +    +   +  ++ ++      S+TR KQ+    S R
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSR 1351

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEE 1278
               R   +  D    ++++   D SE+E+ +E+ E+++
Sbjct: 1352 LLRRPRKKKSDSSSEDDDDSEVDDSEDED-DEDDEDDD 1388


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 37.2 bits (86), Expect = 0.070
 Identities = 15/35 (42%), Positives = 23/35 (65%)

Query: 488 LKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDI 522
           L+GHT +V ++  HP    VL SAG D ++ +WD+
Sbjct: 121 LQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDV 155


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
            Rpc31.  RNA polymerase III contains seventeen subunits in
            yeasts and in human cells. Twelve of these are akin to
            RNA polymerase I or II and the other five are RNA pol
            III-specific, and form the functionally distinct groups
            (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
            and Rpc82 form a cluster of enzyme-specific subunits that
            contribute to transcription initiation in S.cerevisiae
            and H.sapiens. There is evidence that these subunits are
            anchored at or near the N-terminal Zn-fold of Rpc1,
            itself prolonged by a highly conserved but RNA polymerase
            III-specific domain.
          Length = 221

 Score = 36.3 bits (84), Expect = 0.078
 Identities = 12/38 (31%), Positives = 26/38 (68%)

Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
             K  E+ED  E +E++E  ++ EEEE E+  +++++++
Sbjct: 160  LKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 33.2 bits (76), Expect = 0.72
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            K  E E E   EE+E+  ++ EEEE E+   ++++++
Sbjct: 159  KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195



 Score = 31.3 bits (71), Expect = 3.2
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            +    E E E  +EE+  D+ EEEE EE  E+ ++++
Sbjct: 157  EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193



 Score = 30.1 bits (68), Expect = 6.6
 Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 9/62 (14%)

Query: 1228 TRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEE--------SNDQSEEEEVEENSEEEEE 1279
               K + +KF  +  G    E ED  E     E         +   E+E+ EE  EEEEE
Sbjct: 126  AGKKLALSKFKRKV-GLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184

Query: 1280 EE 1281
            E+
Sbjct: 185  ED 186


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
            [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 34.3 bits (79), Expect = 0.082
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            A  +     G E     D++EEEE EE +EEE +++
Sbjct: 67   AGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
            function of this eukaryotic protein family is unknown.
            The yeast orthologues have been implicated in cell cycle
            progression and biogenesis of 60S ribosomal subunits. The
            Schistosoma mansoni Mak16 has been shown to target
            protein transport to the nucleolus.
          Length = 97

 Score = 34.1 bits (78), Expect = 0.086
 Identities = 17/36 (47%), Positives = 20/36 (55%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            A ESE+  E EEEEE  +   E E   + EE EEE 
Sbjct: 57   AEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEI 92



 Score = 32.9 bits (75), Expect = 0.20
 Identities = 14/39 (35%), Positives = 20/39 (51%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              +  E  DE E EEEEE ++   E   ++   EEE E+
Sbjct: 56   EAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIED 94



 Score = 31.8 bits (72), Expect = 0.60
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              E+E E EEEE+  +     + EE  EE E+ E
Sbjct: 63   NDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96



 Score = 29.1 bits (65), Expect = 5.0
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 1257 EEEESNDQSEEEEVEENSEEEEEEEA 1282
            EE E ND+ EEEE EE  E E E  +
Sbjct: 58   EESEENDEEEEEEEEEEDEGEIEYVS 83



 Score = 28.7 bits (64), Expect = 6.1
 Identities = 15/37 (40%), Positives = 23/37 (62%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            KA E+E+  E +EEEE  ++ E+E   E   ++EE E
Sbjct: 53   KALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89



 Score = 28.3 bits (63), Expect = 8.5
 Identities = 15/34 (44%), Positives = 22/34 (64%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E E   EEEEE  ++ +E E+E  S++EE EE +
Sbjct: 59   ESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEI 92


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
            NOA36 proteins which contain 29 highly conserved cysteine
            residues. The function of this protein is unknown.
          Length = 314

 Score = 36.5 bits (84), Expect = 0.090
 Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 4/86 (4%)

Query: 1201 YAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDE----VEGE 1256
             +M  R  K  +  +  R        R +   S NK+          ES+D+        
Sbjct: 225  LSMSTRSYKFGRQTRGFRSDDDQGYGRYKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDY 284

Query: 1257 EEEESNDQSEEEEVEENSEEEEEEEA 1282
            +EEE  D  + E+ ++ +    +E+ 
Sbjct: 285  DEEEDGDDDDNEDNDDTNTNHGDEDC 310


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts harbouring
            premature signals for translation termination are
            recognised and rapidly degraded by eukaryotic cells
            through a pathway known as nonsense-mediated mRNA decay.
            In Saccharomyces cerevisiae, three trans-acting factors
            (Upf1 to Upf3) are required for nonsense-mediated mRNA
            decay.
          Length = 171

 Score = 35.4 bits (82), Expect = 0.090
 Identities = 15/50 (30%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEE------EEEEAVPRRRPVRR 1291
            + E++ E  EE+E ++ S+EEEV+   +E+      EEE+    R+    
Sbjct: 9    DGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEV 58


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 36.6 bits (84), Expect = 0.098
 Identities = 31/153 (20%), Positives = 55/153 (35%), Gaps = 8/153 (5%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR---RPVRRANVAKVKRAVPPA 1304
            ES +E E  EE E   +SE++    ++EE ++EE  P        +R ++ +        
Sbjct: 118  ESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTH 177

Query: 1305 PDTSTSRLRSVRQNGRAKRTNGR-QSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTK 1363
                T    S      A+   G+              + E+  ++ E+         + +
Sbjct: 178  KLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRR 237

Query: 1364 GQSYAVHSSETETE----SEEEESQAAQVSFSR 1392
             Q  A   S  E E     EE E + A+ +  R
Sbjct: 238  KQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
            regulatory subunit R2) is the regulatory subunit of the
            histone H2A phosphatase complex. It has been shown to
            confer resistance to the anticancer drug cisplatin in
            yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 36.0 bits (83), Expect = 0.10
 Identities = 15/36 (41%), Positives = 22/36 (61%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E +D+ +G+  EE   + +EEE E   EEEEE+E  
Sbjct: 250  EEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
            mRNA processing and transport [Signal transduction
            mechanisms / RNA processing and modification].
          Length = 388

 Score = 36.4 bits (84), Expect = 0.11
 Identities = 15/64 (23%), Positives = 20/64 (31%)

Query: 1218 RRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEE 1277
            R G  I      G    + F       K        E  +EE  ++   +E V E   E 
Sbjct: 310  RNGNRIKELADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEM 369

Query: 1278 EEEE 1281
            E  E
Sbjct: 370  ELLE 373


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone sialoprotein
            (BSP) is a major structural protein of the bone matrix
            that is specifically expressed by fully-differentiated
            osteoblasts. The expression of bone sialoprotein (BSP) is
            normally restricted to mineralised connective tissues of
            bones and teeth where it has been associated with mineral
            crystal formation. However, it has been found that
            ectopic expression of BSP occurs in various lesions,
            including oral and extraoral carcinomas, in which it has
            been associated with the formation of microcrystalline
            deposits and the metastasis of cancer cells to bone.
          Length = 291

 Score = 35.8 bits (82), Expect = 0.12
 Identities = 36/149 (24%), Positives = 56/149 (37%), Gaps = 15/149 (10%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP 1302
            G  + E   + +  EEE   + S EEE  E+S+  E+EEA      +    +     A P
Sbjct: 47   GSDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLGYGGDATP 106

Query: 1303 PAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRT 1362
               +   + L+  ++ G A +   ++  S       D D E+  +EEE+          T
Sbjct: 107  GTGNIGLAALQLPKKAGNAGKKATKEDES-------DEDEEEEEEEEEEEAEVEENEQGT 159

Query: 1363 KGQSYAV--------HSSETETESEEEES 1383
             G S            S     E  EEES
Sbjct: 160  NGTSTNSTEVDHGNGSSGGDNGEEGEEES 188



 Score = 34.3 bits (78), Expect = 0.42
 Identities = 35/138 (25%), Positives = 51/138 (36%), Gaps = 28/138 (20%)

Query: 1136 SHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDY 1195
            S SDSS E+G    S EE  E+                     TS   +  E+     D 
Sbjct: 46   SGSDSSEENGDGDSSEEEGEEE---------------------TSNEEENNEDSDGNEDE 84

Query: 1196 DSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEG 1255
            ++         ++  + +V  G  G     +   G  +       G   K    EDE + 
Sbjct: 85   EA-------EAENTTLSTVTLGYGGDATPGTGNIGLAALQLPKKAGNAGKKATKEDESDE 137

Query: 1256 EEEEESNDQSEEEEVEEN 1273
            +EEEE  ++ EE EVEEN
Sbjct: 138  DEEEEEEEEEEEAEVEEN 155



 Score = 30.8 bits (69), Expect = 5.4
 Identities = 53/255 (20%), Positives = 80/255 (31%), Gaps = 59/255 (23%)

Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYREN 1188
            S  SEEN   DSS E G    S+EE   +ED   +               T+L       
Sbjct: 48   SDSSEENGDGDSSEEEGEEETSNEE-ENNEDSDGNED------EEAEAENTTL------- 93

Query: 1189 PSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPE 1248
             S V+    GD  A     +  + ++Q  ++        T+  +S              E
Sbjct: 94   -STVTLGYGGD--ATPGTGNIGLAALQLPKKAGNAGKKATKEDES---------DEDEEE 141

Query: 1249 SEDEVEGEEEEESNDQS---------------------EEEEVEENSEEEEEEEAVPRRR 1287
             E+E E E E E N+Q                        EE EE S  E E E      
Sbjct: 142  EEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEGTTVAG 201

Query: 1288 PVRRANVAKVKRAVPPAPDTST-----SRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDS 1342
            P   +     +   PP     T      ++ +    G  ++T   +         G  + 
Sbjct: 202  PTTTSPNGGFQPTTPPQEVYGTTDPPFGKVTTPEYQGEYEQTGANEY-------DGGYEI 254

Query: 1343 EDNSQEEEQYGSRRA 1357
             ++   E +  S RA
Sbjct: 255  YESENGEPRGDSYRA 269


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 36.2 bits (84), Expect = 0.13
 Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 16/88 (18%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPP-- 1303
             PE E        E+ +D  E+EE ++++E+   EE V          +  V+ A+PP  
Sbjct: 264  PPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEIV----------LEAVRAALPPDL 313

Query: 1304 ----APDTSTSRLRSVRQNGRAKRTNGR 1327
                A   + +R RS    G A++ N R
Sbjct: 314  LARLAAGGARARARSAGGAGAAQKGNRR 341


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
            required for initiation of DNA replication in S.
            cerevisiae, forming a complex with MCM5/CDC46. Homologues
            of CDC45 have been identified in human, mouse and smut
            fungus among others.
          Length = 583

 Score = 36.1 bits (84), Expect = 0.13
 Identities = 11/44 (25%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            + +D+ E +EE+E + +SE++E +++ +++++      R   RR
Sbjct: 126  DDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIAT-RERSLERR 168



 Score = 33.4 bits (77), Expect = 1.0
 Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 11/92 (11%)

Query: 1104 EKIITGLEEVMGLSVAEPFLVPVDISSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRH 1163
              I   L++      A   L   D     +++  SD   E  S S+  E+D +D+D    
Sbjct: 104  GDIEEELQDEPRYDDAYRDLEEDD-----DDDEESDEEDEESSKSEDDEDDDDDDDDDDI 158

Query: 1164 GKARRMPPRRTNRVGTSLGRQYRENPSDVSDY 1195
                R   RR  R      R++ E  +++  Y
Sbjct: 159  ATRERSLERRRRR------REWEEKRAELEFY 184


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 35.0 bits (81), Expect = 0.14
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E  ++VE E EE   +++ E+EVEE + EEE E  +
Sbjct: 3    EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAEL 38



 Score = 33.4 bits (77), Expect = 0.43
 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 2/44 (4%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPR 1285
            +  +  E  +E E EE  E  D+ EEE VEE SE E  +E   +
Sbjct: 4    KNEQVEEEVEETEVEEAVE--DEVEEETVEEESEAELLDEEQAK 45



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 3/40 (7%)

Query: 1248 ESEDEVEGEEEEESNDQSE---EEEVEENSEEEEEEEAVP 1284
              E   + EEE E  +  E   +E  EE  EEE E E + 
Sbjct: 1    MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLD 40


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
            includes the B. subtilis YqfQ protein, also known as
            VrrA, which is functionally uncharacterized. This family
            of proteins is found in bacteria. Proteins in this family
            are typically between 146 and 237 amino acids in length.
            There are two conserved sequence motifs: QYGP and PKLY.
          Length = 155

 Score = 34.3 bits (79), Expect = 0.15
 Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 1/60 (1%)

Query: 1254 EGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDT-STSRL 1312
              +EEEE+ ++S +E  +E+  E + E    ++R V +    K K    P     S  +L
Sbjct: 94   SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 36.3 bits (84), Expect = 0.16
 Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 7/60 (11%)

Query: 1238 SGRGRGRKAPESE------DEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            +G G        E       E + E+    +D  ++ +      EEEE+    RRR +R 
Sbjct: 14   NGSGGLVGTDSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGN-MRRRLIRT 72


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
            family represents the C-terminus (approximately 300
            residues) of proteins that are involved as binding
            partners for Prp19 as part of the nuclear pore complex.
            The family in Drosophila is necessary for pre-mRNA
            splicing, and the human protein has been found in
            purifications of the spliceosome. In the past this family
            was thought, erroneously, to be associated with
            microfibrillin.
          Length = 277

 Score = 35.3 bits (81), Expect = 0.16
 Identities = 18/40 (45%), Positives = 25/40 (62%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
            E EV   EEE+ + + EEEE EE  E + E++  PR +PV
Sbjct: 1    ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPV 40


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
            [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 35.4 bits (81), Expect = 0.17
 Identities = 16/39 (41%), Positives = 19/39 (48%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              +  E  D  E EE+EE    S  E   E +EEEEEE 
Sbjct: 178  EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA 216



 Score = 31.1 bits (70), Expect = 3.5
 Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 2/53 (3%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEE--NSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
            ED V+ E E     Q EEEE+EE   +  E E E       V R N     + 
Sbjct: 223  EDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKK 275



 Score = 30.3 bits (68), Expect = 6.0
 Identities = 15/43 (34%), Positives = 21/43 (48%)

Query: 1239 GRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            G G      E E+E EG ++ E     E E +++   EEEE E
Sbjct: 202  GEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEME 244



 Score = 30.3 bits (68), Expect = 6.7
 Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 2/44 (4%)

Query: 1241 GRGRKAPESEDEVEG--EEEEESNDQSEEEEVEENSEEEEEEEA 1282
                +  E  +E +G  EEE+E        E      EEEEEEA
Sbjct: 173  DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA 216


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
            N-terminal.  This is the very acidic N-terminal region of
            the early transcription elongation factor Spt5. The
            Spt5-Spt4 complex regulates early transcription
            elongation by RNA polymerase II and has an imputed role
            in pre-mRNA processing via its physical association with
            mRNA capping enzymes. The actual function of this
            N-terminal domain is not known although it is dispensable
            for binding to Spt4.
          Length = 92

 Score = 32.8 bits (75), Expect = 0.18
 Identities = 11/42 (26%), Positives = 22/42 (52%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            + E E ++EEE  ++ E++  + + E+E  +EA        R
Sbjct: 3    DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHR 44



 Score = 29.7 bits (67), Expect = 2.9
 Identities = 18/44 (40%), Positives = 24/44 (54%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            E +DE E EEEEE + +   +E E   E E E++   RR   RR
Sbjct: 7    EVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRR 50


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
            protein is covalently attached to the terminii of
            replicating DNA in vivo.
          Length = 548

 Score = 35.8 bits (83), Expect = 0.19
 Identities = 22/42 (52%), Positives = 23/42 (54%)

Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            R R R+ P S  E E EEEEE     EEEE EE  E   EEE
Sbjct: 292  RRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 34.2 bits (79), Expect = 0.49
 Identities = 25/61 (40%), Positives = 28/61 (45%), Gaps = 7/61 (11%)

Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKR 1299
            R R R+ P        EEEEE  +  EEEE EE  EE   EE V       RA VA+  R
Sbjct: 291  RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEV-------RATVAEAIR 343

Query: 1300 A 1300
             
Sbjct: 344  L 344


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
            [Transcription].
          Length = 392

 Score = 35.4 bits (81), Expect = 0.19
 Identities = 13/47 (27%), Positives = 21/47 (44%), Gaps = 2/47 (4%)

Query: 1249 SEDEVEGEEEEESNDQS--EEEEVEENSEEEEEEEAVPRRRPVRRAN 1293
            SE + E ++EE  N ++  EE + +   E  EE+E         R  
Sbjct: 290  SEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHT 336



 Score = 30.1 bits (67), Expect = 8.8
 Identities = 11/35 (31%), Positives = 13/35 (37%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
                 E   EE +        EE EE+ E EE E 
Sbjct: 300  EVENAEAHKEEVQSDRPDEIGEEKEEDDENEENER 334


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 33.0 bits (76), Expect = 0.22
 Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 5/37 (13%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            A  +       EE++     EEEE EE  EE EEE A
Sbjct: 67   AAAAAAAAAAAEEKK-----EEEEEEEEKEESEEEAA 98



 Score = 30.7 bits (70), Expect = 1.2
 Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 5/35 (14%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
            A  +    E ++EEE     EEEE +E SEEE   
Sbjct: 70   AAAAAAAAEEKKEEE-----EEEEEKEESEEEAAA 99


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
            subunit.  The alpha-subunit of the TRAP complex (TRAP
            alpha) is a single-spanning membrane protein of the
            endoplasmic reticulum (ER) which is found in proximity of
            nascent polypeptide chains translocating across the
            membrane.
          Length = 281

 Score = 35.1 bits (81), Expect = 0.23
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
              + E E +  +E+E ++   EE+  E +EEEE+EE   +  P
Sbjct: 35   TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASP 77


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation domain. 
            The centromere protein B (CENP-B) dimerisation domain is
            composed of two alpha-helices, which are folded into an
            antiparallel configuration. Dimerisation of CENP-B is
            mediated by this domain, in which monomers dimerise to
            form a symmetrical, antiparallel, four-helix bundle
            structure with a large hydrophobic patch in which 23
            residues of one monomer form van der Waals contacts with
            the other monomer. This CENP-B dimer configuration may be
            suitable for capturing two distant CENP-B boxes during
            centromeric heterochromatin formation.
          Length = 101

 Score = 32.8 bits (74), Expect = 0.23
 Identities = 16/58 (27%), Positives = 31/58 (53%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAP 1305
            E E++ + + +EE +D  E+EE ++  ++E+++E          A  A VKR +   P
Sbjct: 7    EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFGEAMAYFAMVKRYLTSFP 64



 Score = 31.3 bits (70), Expect = 0.85
 Identities = 10/29 (34%), Positives = 23/29 (79%)

Query: 1253 VEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            +EGEE+ +S+   EE++ +E+ E+++E++
Sbjct: 6    LEGEEDSDSDSDEEEDDDDEDEEDDDEDD 34


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 35.3 bits (81), Expect = 0.23
 Identities = 25/123 (20%), Positives = 37/123 (30%), Gaps = 10/123 (8%)

Query: 1162 RHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQ 1221
            RHG  R    RR      +        P+  +   SG +    A Q  +    Q     +
Sbjct: 25   RHGGTREQAGRRRGTAARAA------KPAPPAPTTSGPQVRAVAEQGHR----QTESDTE 74

Query: 1222 TIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            T   SR   K+   +    G G ++  S         EE       E    +S E     
Sbjct: 75   TAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASH 134

Query: 1282 AVP 1284
            + P
Sbjct: 135  SPP 137



 Score = 31.1 bits (70), Expect = 4.6
 Identities = 9/40 (22%), Positives = 12/40 (30%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
            G    AP         ++  S  Q   E+  E  E    E
Sbjct: 145  GPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184



 Score = 30.7 bits (69), Expect = 5.4
 Identities = 17/91 (18%), Positives = 30/91 (32%), Gaps = 2/91 (2%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
               +  P+   E +    +++   + ++ VE   E  E E   P     R  +   V   
Sbjct: 202  PPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRSHSYTVVGWK 261

Query: 1301 VPPAPD-TSTSRLRSVRQNG-RAKRTNGRQS 1329
                P       LR       R++   G+QS
Sbjct: 262  PSTRPGGVPKLCLRCTSHPSHRSRLPEGQQS 292



 Score = 30.3 bits (68), Expect = 7.5
 Identities = 6/42 (14%), Positives = 13/42 (30%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
                + P S  +   E+  E  +    E   ++    + E  
Sbjct: 157  PSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of the
            RNA polymerase II associated Paf1 complex. The Paf1
            complex functions during the elongation phase of
            transcription in conjunction with Spt4-Spt5 and
            Spt16-Pob3i.
          Length = 431

 Score = 35.1 bits (81), Expect = 0.24
 Identities = 21/69 (30%), Positives = 30/69 (43%), Gaps = 1/69 (1%)

Query: 1214 VQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEEN 1273
            V+     Q  V  R    +       R R R  P   +EV+ +E+EE   +S+E E EE 
Sbjct: 332  VREHNNDQLNVKLRNPSTKESKMRDKR-RARLDPIDFEEVDEDEDEEEEQRSDEHEEEEG 390

Query: 1274 SEEEEEEEA 1282
             + EEE   
Sbjct: 391  EDSEEEGSQ 399



 Score = 33.6 bits (77), Expect = 0.88
 Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
              EDE E EE+     + EE E  E    +  E+ 
Sbjct: 370  VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404



 Score = 32.8 bits (75), Expect = 1.4
 Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
             E  DE E EEEE+ +D+ EEEE  E+SEEE  +  
Sbjct: 367  FEEVDEDEDEEEEQRSDEHEEEE-GEDSEEEGSQSR 401



 Score = 32.4 bits (74), Expect = 2.0
 Identities = 13/75 (17%), Positives = 23/75 (30%), Gaps = 17/75 (22%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPD 1306
             E +++ + EEE+ S++  EEE  +   E  +  E                      +  
Sbjct: 368  EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGS-----------------SESSS 410

Query: 1307 TSTSRLRSVRQNGRA 1321
               S   S      A
Sbjct: 411  DVGSDSESKADKESA 425


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 35.3 bits (80), Expect = 0.24
 Identities = 55/243 (22%), Positives = 82/243 (33%), Gaps = 36/243 (14%)

Query: 1170 PPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTR 1229
            PP +T    T    +  E  S+ S  DS     M A ++   Q+  +G+  Q    S   
Sbjct: 4    PPGKTGPAATQAKAEKPEEDSESSSEDSDSEEEMPAAKNP-PQAKPSGKSPQVKAAS-AP 61

Query: 1230 GKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
             K+S      +G     P        +  EE    SEEE    +SE E    A     P 
Sbjct: 62   AKES----PQKGAPPVTPGKAGPAAAQAGEEEAKSSEEES---DSEGETPTAATLTTSPA 114

Query: 1290 RRANVAKVKR-----AVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSED 1344
            +   + K  +      V P P    + L   ++ G A    G+Q  S   +   +SDS+ 
Sbjct: 115  QAKPLGKNSQVRPASTVTPGPSGKGANLPCPQKAGSAAVQVGKQEDSE-SSSEEESDSDG 173

Query: 1345 NSQEEEQYGSRRARRTR-----TKGQSYAV----------------HSSETETESEEEES 1383
                 +   S +  + R      KG                      SSE  ++SEEE  
Sbjct: 174  PGAPAQAKSSGKLLQARPASGPAKGPPQKAGPVATQVKAERGKEDSESSEESSDSEEEAP 233

Query: 1384 QAA 1386
             A 
Sbjct: 234  AAM 236


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
            found in a wide range of eukaryotes. It is a nuclear
            protein and is suggested to be DNA binding. In plants,
            this family is essential for correct circadian clock
            functioning by acting as a light-quality regulator
            coordinating the activities of blue and red light
            signalling pathways during plant growth - inhibiting
            growth in red light but promoting growth in blue light.
          Length = 233

 Score = 34.6 bits (80), Expect = 0.28
 Identities = 13/47 (27%), Positives = 22/47 (46%)

Query: 1266 EEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRL 1312
            +++E E+  E E+E++          ANV   K+ +   P   TS L
Sbjct: 15   DDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFL 61


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 34.8 bits (80), Expect = 0.31
 Identities = 19/39 (48%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            G   P  ED+ EG  E   ++  EEEEVEE  E+EEEE 
Sbjct: 58   GNPDPFGEDD-EGRIEVRISEDGEEEEVEEGEEDEEEEG 95



 Score = 32.1 bits (73), Expect = 2.1
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E  +  + EEE  ++ EE+E EE  EE EE E +
Sbjct: 72   EVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPL 105



 Score = 31.7 bits (72), Expect = 2.4
 Identities = 16/37 (43%), Positives = 18/37 (48%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
            E   EV   E+ E  +  E EE EE   EEE EE  P
Sbjct: 68   EGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 35.0 bits (81), Expect = 0.32
 Identities = 37/216 (17%), Positives = 71/216 (32%), Gaps = 27/216 (12%)

Query: 1205 ARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKF---SGRGRGRKAPESEDEVEGEEEEES 1261
              +  + +  Q  +  + +    +  ++   +      +   R A  S++  E E E E 
Sbjct: 706  ELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEE 765

Query: 1262 NDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRA 1321
             ++  EE  EE +E E E E +        A + ++K  +          L  +R     
Sbjct: 766  LEERLEEAEEELAEAEAEIEEL-------EAQIEQLKEEL----KALREALDELR--AEL 812

Query: 1322 KRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEE 1381
               N   ++      S +       +  E     +        +S A    E E   EE 
Sbjct: 813  TLLNEEAANLRERLESLERRIAATERRLEDL-EEQIEELSEDIESLAAEIEELEELIEEL 871

Query: 1382 ESQAAQVSFSR----------RGRMRKPTAKLRGLE 1407
            ES+   +   R          R  + + + +LR LE
Sbjct: 872  ESELEALLNERASLEEALALLRSELEELSEELRELE 907


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part of
            a large ribonucleoprotein complex containing the U3
            snoRNA. Depletion of the Utp proteins impedes production
            of the 18S rRNA, indicating that they are part of the
            active pre-rRNA processing complex. This large RNP
            complex has been termed the small subunit (SSU)
            processome.
          Length = 728

 Score = 35.0 bits (81), Expect = 0.33
 Identities = 21/138 (15%), Positives = 49/138 (35%), Gaps = 3/138 (2%)

Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSR 1311
            E +   EE+ ++ S+ EE     E+++E++       + R  + K+K       ++    
Sbjct: 319  EGKSVSEEDEDEDSDSEEE---DEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLS 375

Query: 1312 LRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHS 1371
            ++ +++    K+         L       +  D  + EE       RR            
Sbjct: 376  MKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAE 435

Query: 1372 SETETESEEEESQAAQVS 1389
            S+   +  + E +  + S
Sbjct: 436  SKKLKKENKNEFKEKKES 453



 Score = 33.5 bits (77), Expect = 1.1
 Identities = 28/155 (18%), Positives = 48/155 (30%), Gaps = 26/155 (16%)

Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPP 1303
            ++A   + E    E EE   + E EE  +  E EE  +    RR        K       
Sbjct: 380  QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRR--------KFGPENGE 431

Query: 1304 APDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTK 1363
                S    +  +   + K                  + E+   EEE    + A +   +
Sbjct: 432  KEAESKKLKKENKNEFKEK--------------KESDEEEELEDEEEAKVEKVANKLLKR 477

Query: 1364 GQSYAVHSSETETESEEEESQAAQVSFSRRGRMRK 1398
             +     + + E E E +E      + S  G+  K
Sbjct: 478  SEK----AQKEEEEEELDEENPWLKTTSSVGKSAK 508


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
            family includes the HABP4 family of hyaluronan-binding
            proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
            HABP4 has been observed to bind hyaluronan (a
            glucosaminoglycan), but it is not known whether this is
            its primary role in vivo. It has also been observed to
            bind RNA, but with a lower affinity than that for
            hyaluronan. PAI-1 mRNA-binding protein specifically binds
            the mRNA of type-1 plasminogen activator inhibitor
            (PAI-1), and is thought to be involved in regulation of
            mRNA stability. However, in both cases, the sequence
            motifs predicted to be important for ligand binding are
            not conserved throughout the family, so it is not known
            whether members of this family share a common function.
          Length = 106

 Score = 32.4 bits (74), Expect = 0.34
 Identities = 21/62 (33%), Positives = 24/62 (38%), Gaps = 10/62 (16%)

Query: 1230 GKQSYNKFSGRGRG----------RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEE 1279
            G+   +K  G GRG              E  +E E  E EE     EE   E   EEEE 
Sbjct: 12   GRGGEDKREGAGRGNWGSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEV 71

Query: 1280 EE 1281
            EE
Sbjct: 72   EE 73


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
            fungal and plant proteins and contains many hypothetical
            proteins. VID27 is a cytoplasmic protein that plays a
            potential role in vacuolar protein degradation.
          Length = 794

 Score = 35.1 bits (81), Expect = 0.35
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
                R   E EDE E EEE+E    S+E   +E  EE++ E
Sbjct: 384  ANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVE 424



 Score = 34.3 bits (79), Expect = 0.52
 Identities = 12/48 (25%), Positives = 22/48 (45%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKV 1297
              E + EEEE+  ++ EE+E E  S+E  ++E         +   +  
Sbjct: 385  NTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 33.6 bits (77), Expect = 0.96
 Identities = 17/45 (37%), Positives = 25/45 (55%)

Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            FS         E +DE E +EEEE  +  +E   +E+S++EE EE
Sbjct: 376  FSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEE 420



 Score = 32.4 bits (74), Expect = 2.0
 Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 6/49 (12%)

Query: 1245 KAPESE-----DEVEGEEEEESNDQSE-EEEVEENSEEEEEEEAVPRRR 1287
            KA E+E     D     E E++N + + EEE +E  EEEE+E+  P + 
Sbjct: 363  KAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKE 411


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
            subunit 1; Provisional.
          Length = 319

 Score = 34.6 bits (80), Expect = 0.35
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            ++E+E E ++  ES D+ EE+E EE  E+++E +
Sbjct: 285  KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318



 Score = 33.1 bits (76), Expect = 0.92
 Identities = 10/34 (29%), Positives = 21/34 (61%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E  +E E E++   ++  +EE+ +E  EE+++E 
Sbjct: 284  EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEG 317



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 12/37 (32%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            KA E E+E +  E E+ ++  E+E+ EE  +++E ++
Sbjct: 285  KAEEEEEEDDYSESEDEDE--EDEDEEEEEDDDEGDK 319



 Score = 31.9 bits (73), Expect = 2.0
 Identities = 10/34 (29%), Positives = 22/34 (64%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            ++E   E  E++ ++ EE++  E+ +E+EE+E  
Sbjct: 275  DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDE 308



 Score = 30.8 bits (70), Expect = 4.5
 Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 5/44 (11%)

Query: 1243 GRKAPESEDEVEGEEEEESND-----QSEEEEVEENSEEEEEEE 1281
            G    + E+ +E  EEEE  D     + E+EE E+  EEE+++E
Sbjct: 273  GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter subunit
            G; Reviewed.
          Length = 197

 Score = 33.9 bits (77), Expect = 0.36
 Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 17/102 (16%)

Query: 1195 YDSGDRYAMRARQSKQVQSVQAGR---------RGQTIVTSRTRGKQSYNKFSGRGRGRK 1245
            YD+G   A+R R   Q++SV+            R + I  +R   ++   +     R  K
Sbjct: 93   YDTGVPLAIRIRD--QLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEK 150

Query: 1246 APESEDEVEGEEE------EESNDQSEEEEVEENSEEEEEEE 1281
              E ED+ E E E      EE +D SE E +E++  E E ++
Sbjct: 151  IDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDD 192


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This family
            consists of several uncharacterized plant proteins of
            unknown function.
          Length = 565

 Score = 34.6 bits (79), Expect = 0.41
 Identities = 19/34 (55%), Positives = 25/34 (73%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E E+EVE EE+EE  ++   + V+E SEEEEEEE
Sbjct: 141  EEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 34.5 bits (79), Expect = 0.45
 Identities = 13/41 (31%), Positives = 27/41 (65%)

Query: 487 VLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQ 527
           +LKGH  ++ +++ +P +  ++ S+G D  + IWDI + K+
Sbjct: 120 ILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160



 Score = 32.6 bits (74), Expect = 1.6
 Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 19/83 (22%)

Query: 260 LLAAGTVDKTIRIWNL--------QTLAPISVLVGHQGIITGVNFCPLEVNGFNY--LAS 309
           +LA+G+ D TIR+W +        +   P  +L GH+  I+ +++ P+     NY  + S
Sbjct: 89  ILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPM-----NYYIMCS 143

Query: 310 TSTDGCIGFWKYKLDIDISKTVF 332
           +  D  +  W    DI+  K  F
Sbjct: 144 SGFDSFVNIW----DIENEKRAF 162



 Score = 30.3 bits (68), Expect = 9.6
 Identities = 20/68 (29%), Positives = 34/68 (50%), Gaps = 6/68 (8%)

Query: 455 VAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHD 514
           V W+     +I AI    Q++        +++ LKGHT+ +  L+ +P  S +L S   D
Sbjct: 43  VPWEVEGGGLIGAIRLENQMR-----KPPVIK-LKGHTSSILDLQFNPCFSEILASGSED 96

Query: 515 GLIIIWDI 522
             I +W+I
Sbjct: 97  LTIRVWEI 104


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 34.6 bits (79), Expect = 0.46
 Identities = 16/44 (36%), Positives = 24/44 (54%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            ES++  E   E+ES + S +EE  E+ +E E + A   R   RR
Sbjct: 957  ESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKRR 1000



 Score = 33.1 bits (75), Expect = 1.4
 Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 1237 FSGRGRGRKAPESEDEV---EGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            F   G   ++ ESE+EV   E   ++ES++  E+EE +E+SE+  E+E+
Sbjct: 922  FLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDES 970



 Score = 31.9 bits (72), Expect = 2.9
 Identities = 14/36 (38%), Positives = 23/36 (63%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              ES++  E EE +ES++   E+E E +S +EE+ E
Sbjct: 946  DDESDETDEDEESDESSEDLSEDESENDSSDEEDGE 981


>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
            (NESP55).  This family consists of several mammalian
            neuroendocrine-specific golgi protein P55 (NESP55)
            sequences. NESP55 is a novel member of the chromogranin
            family and is a soluble, acidic, heat-stable secretory
            protein that is expressed exclusively in endocrine and
            nervous tissues, although less widely than chromogranins.
          Length = 261

 Score = 34.1 bits (77), Expect = 0.47
 Identities = 24/118 (20%), Positives = 44/118 (37%), Gaps = 4/118 (3%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE----AVPRRRPVRRANVAKVKRAVPP 1303
            E E ++E E E E+  ++E +       E E E+     VP+     ++   ++      
Sbjct: 121  EPESDIESETEFETEPETEPDTAPTTEPETEPEDEPGPVVPKGATFHQSLTERLHALKLQ 180

Query: 1304 APDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTR 1361
            + D S  R     Q   + R         L     D + E+  +EEE+    R +  +
Sbjct: 181  SADASPRRAPPSTQEPESAREGEEPERGPLDKDPRDPEEEEEEKEEEKQQPHRCKPKK 238



 Score = 32.1 bits (72), Expect = 1.7
 Identities = 27/81 (33%), Positives = 35/81 (43%), Gaps = 8/81 (9%)

Query: 1219 RGQTIVTSRTRGKQSYNKFSGRGRGRKAP----ESEDEVEGEEEEESNDQSEEEEVEENS 1274
            +G T   S T    +    S     R+AP    E E   EGEE E      +  + EE  
Sbjct: 162  KGATFHQSLTERLHALKLQSADASPRRAPPSTQEPESAREGEEPERGPLDKDPRDPEEEE 221

Query: 1275 EEEEEEEAVPRR----RPVRR 1291
            EE+EEE+  P R    +P RR
Sbjct: 222  EEKEEEKQQPHRCKPKKPARR 242


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
            Provisional.
          Length = 2849

 Score = 34.6 bits (79), Expect = 0.48
 Identities = 18/71 (25%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 1215 QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEEN- 1273
            +A      ++  + R  +   +     R     + +DE E E++++  D  EEEE EE  
Sbjct: 121  KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180

Query: 1274 ---SEEEEEEE 1281
                +E+EE+E
Sbjct: 181  KGFDDEDEEDE 191



 Score = 31.9 bits (72), Expect = 3.4
 Identities = 19/64 (29%), Positives = 32/64 (50%), Gaps = 1/64 (1%)

Query: 1222 TIVTSRTRGKQ-SYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
             I+  R R +  +    S R       + EDE E +++EE +++ EEEE E    ++E+E
Sbjct: 129  MIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDE 188

Query: 1281 EAVP 1284
            E   
Sbjct: 189  EDEG 192



 Score = 31.6 bits (71), Expect = 4.7
 Identities = 12/49 (24%), Positives = 23/49 (46%)

Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
            R R R   E +         + +D+ E+E+ ++  ++EEEEE     + 
Sbjct: 134  RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKG 182


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
            subunit 8 N-terminus.  The largest of the mammalian
            translation initiation factors, eIF3, consists of at
            least eight subunits ranging in mass from 35 to 170 kDa.
            eIF3 binds to the 40 S ribosome in an early step of
            translation initiation and promotes the binding of
            methionyl-tRNAi and mRNA.
          Length = 593

 Score = 34.4 bits (79), Expect = 0.50
 Identities = 25/112 (22%), Positives = 47/112 (41%), Gaps = 19/112 (16%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENS----------------EEEEEEEAVPR 1285
            R R+ PESEDE E E+E++ +D S++E+ +E+                   +E++E    
Sbjct: 130  RYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDED 189

Query: 1286 RRPVRRANVAKVKRAVPPA---PDTSTSRLRSVRQNGRAKRTNGRQSSSLLM 1334
                ++  + + K    P     D    +L+ +      K T+  +   LL 
Sbjct: 190  ADLSKKDVLEEPKMFKKPEEITWDDVFKKLKEIMSARGKKTTDRSEQIDLLE 241


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
            Reviewed.
          Length = 129

 Score = 32.7 bits (75), Expect = 0.50
 Identities = 10/40 (25%), Positives = 24/40 (60%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            + +K     D+   +++   +D  +EE+++E  +E+EE+E
Sbjct: 89   KKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 31.5 bits (72), Expect = 1.1
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            K  + +D++  ++    +D  EE+  EE+ E+EE+EE
Sbjct: 93   KFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
            includes archaeal L12p, the protein that is functionally
            equivalent to L7/L12 in bacteria and the P1 and P2
            proteins in eukaryotes. L12p is homologous to P1 and P2
            but is not homologous to bacterial L7/L12. It is located
            in the L12 stalk, with proteins L10, L11, and 23S rRNA.
            L12p is the only protein in the ribosome to occur as
            multimers, always appearing as sets of dimers. Recent
            data indicate that most archaeal species contain six
            copies of L12p (three homodimers), while eukaryotes have
            four copies (two heterodimers), and bacteria may have
            four or six copies (two or three homodimers), depending
            on the species. The organization of proteins within the
            stalk has been characterized primarily in bacteria, where
            L7/L12 forms either two or three homodimers and each
            homodimer binds to the extended C-terminal helix of L10.
            L7/L12 is attached to the ribosome through L10 and is the
            only ribosomal protein that does not directly interact
            with rRNA. Archaeal L12p is believed to function in a
            similar fashion. However, hybrid ribosomes containing the
            large subunit from E. coli with an archaeal stalk are
            able to bind archaeal and eukaryotic elongation factors
            but not bacterial elongation factors. In several
            mesophilic and thermophilic archaeal species, the binding
            of 23S rRNA to protein L11 and to the L10/L12p pentameric
            complex was found to be temperature-dependent and
            cooperative.
          Length = 106

 Score = 32.1 bits (73), Expect = 0.54
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 1256 EEEEESNDQSEEEEVEENSEEEEEEEA 1282
               EE  ++ EEE+ +E  +EEEEEEA
Sbjct: 72   AAAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 12/32 (37%), Positives = 20/32 (62%), Gaps = 5/32 (15%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEE 1277
            A  +E++ E +EEE+     ++EE +E  EEE
Sbjct: 71   AAAAEEKAEEKEEEK-----KKEEEKEEEEEE 97



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 1256 EEEEESNDQSEEEEVEENSEEEEEEEA 1282
              EE++ ++ EE++ EE  EEEEEE  
Sbjct: 73   AAEEKAEEKEEEKKKEEEKEEEEEEAL 99


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
            consists of several bacterial cobalamin biosynthesis
            (CobT) proteins. CobT is involved in the transformation
            of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 33.6 bits (77), Expect = 0.56
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E ED+ +GEEEE  +  S  E+ + +SEE E  E
Sbjct: 232  EDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265



 Score = 33.6 bits (77), Expect = 0.57
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            A E  DE E  + E++ D+ + +E E++ + EEEE 
Sbjct: 209  AEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEES 244



 Score = 33.2 bits (76), Expect = 0.89
 Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E +D+ E EE   S+  SE+ +      E  E EA 
Sbjct: 234  EDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269



 Score = 32.5 bits (74), Expect = 1.4
 Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 5/42 (11%)

Query: 1246 APESEDEVEGEEE-----EESNDQSEEEEVEENSEEEEEEEA 1282
             PES D  + E+E     +E +DQ EEEE   +    E+ +A
Sbjct: 215  EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDA 256


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
            describes Pseudomonas denitrificans CobT gene product,
            which is a cobalt chelatase subunit that functions in
            cobalamin biosynthesis. Cobalamin (vitamin B12) can be
            synthesized via several pathways, including an aerobic
            pathway (found in Pseudomonas denitrificans) and an
            anaerobic pathway (found in P. shermanii and Salmonella
            typhimurium). These pathways differ in the point of
            cobalt insertion during corrin ring formation. There are
            apparently a number of variations on these two pathways,
            where the major differences seem to be concerned with the
            process of ring contraction. Confusion regarding the
            functions of enzymes found in the aerobic vs. anaerobic
            pathways has arisen because nonhomologous genes in these
            different pathways were given the same gene symbols.
            Thus, cobT in the aerobic pathway (P. denitrificans) is
            not a homolog of cobT in the anaerobic pathway (S.
            typhimurium). It should be noted that E. coli synthesizes
            cobalamin only when it is supplied with the precursor
            cobinamide, which is a complex intermediate.
            Additionally, all E. coli cobalamin synthesis genes
            (cobU, cobS and cobT) were named after their Salmonella
            typhimurium homologs which function in the anaerobic
            cobalamin synthesis pathway. This model describes the
            aerobic cobalamin pathway Pseudomonas denitrificans CobT
            gene product, which is a cobalt chelatase subunit, with a
            MW ~70 kDa. The aerobic pathway cobalt chelatase is a
            heterotrimeric, ATP-dependent enzyme that catalyzes
            cobalt insertion during cobalamin biosynthesis. The other
            two subunits are the P. denitrificans CobS (TIGR01650)
            and CobN (pfam02514 CobN/Magnesium Chelatase) proteins.
            To avoid potential confusion with the nonhomologous
            Salmonella typhimurium/E.coli cobT gene product, the P.
            denitrificans gene symbol is not used in the name of this
            model [Biosynthesis of cofactors, prosthetic groups, and
            carriers, Heme, porphyrin, and cobalamin].
          Length = 600

 Score = 34.1 bits (78), Expect = 0.57
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E+E E +GE E E  + S  +E E    E E  E
Sbjct: 222  ENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255



 Score = 32.6 bits (74), Expect = 1.5
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            E E++ EGE E +     +E E  +   E  EEE V
Sbjct: 224  EQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMV 259


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
            or chromatin assembly factor-1 consists of three
            subunits, and this is the first, or A. The A domain is
            uniquely required for the progression of S phase in mouse
            cells, independent of its ability to promote histone
            deposition but dependent on its ability to interact with
            HP1 - heterochromatin protein 1-rich heterochromatin
            domains next to centromeres that are crucial for
            chromosome segregation during mitosis. This HP1-CAF-1
            interaction module functions as a built-in replication
            control for heterochromatin, which, like a control
            barrier, has an impact on S-phase progression in addition
            to DNA-based checkpoints.
          Length = 76

 Score = 31.0 bits (71), Expect = 0.66
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 1249 SEDEVEGEEEEESNDQSEEEEVEENSEEEEE 1279
            S+ E E EEE E  +  +EE+ EE+ +++ +
Sbjct: 44   SDAEWEEEEEGEDLESEDEEDEEEDDDDDMD 74



 Score = 30.3 bits (69), Expect = 1.0
 Identities = 11/30 (36%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            + E EEEEE  D   E+E E+  E+++++ 
Sbjct: 45   DAEWEEEEEGEDLESEDE-EDEEEDDDDDM 73


>gnl|CDD|107161 PHA02264, PHA02264, hypothetical protein.
          Length = 152

 Score = 32.4 bits (73), Expect = 0.72
 Identities = 8/68 (11%), Positives = 24/68 (35%), Gaps = 9/68 (13%)

Query: 927 YELSRLRSSPWGNLILQEPELVKVVGIKYEIAPPRMCCLKLAVMKPNGKLTNEFFNIKYH 986
           Y  +    +    ++++ P+++ V G              L +   + ++   ++N    
Sbjct: 84  YNNTDYNGNIERRIVVKHPKVITVEGKP--------SYQFLTISLEDARVITVWYN-SVD 134

Query: 987 DMPDVIDF 994
           D    +D 
Sbjct: 135 DTHRTLDL 142


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 32.1 bits (73), Expect = 0.73
 Identities = 16/41 (39%), Positives = 23/41 (56%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRR 1287
             E E E   EE+ ESN+  E +     +E++EE+E V R R
Sbjct: 14   SEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPR 54



 Score = 31.8 bits (72), Expect = 1.1
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            ++++   E+E ++   EE  EE+SE    +E
Sbjct: 3    NDMDMGYEDEPSEPEIEEGAEEDSESNNNDE 33


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This family
            consists of several plant specific phytosulfokine
            precursor proteins. Phytosulfokines, are active as either
            a pentapeptide or a C-terminally truncated tetrapeptide.
            These compounds were first isolated because of their
            ability to stimulate cell division in somatic embryo
            cultures of Asparagus officinalis.
          Length = 74

 Score = 30.8 bits (70), Expect = 0.75
 Identities = 13/44 (29%), Positives = 19/44 (43%)

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR 1286
             R  P        + E   + + +E+EV    E E EEE + RR
Sbjct: 15   ARPLPTDTGHESSKIEAAESSKVDEDEVVSECEGEGEEECLMRR 58


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 34.0 bits (77), Expect = 0.77
 Identities = 24/59 (40%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTN-RVGTSLGRQYR 1186
            SS S   S S SSS+S SS +  +E +E ED  R G  +R   +R   R     GR YR
Sbjct: 1256 SSSSSSCSSSSSSSDSSSSEEDGDEKNEKEDRERAGGGKRRGRQRLPIRDRVYRGRDYR 1314


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 32.6 bits (74), Expect = 0.86
 Identities = 14/42 (33%), Positives = 27/42 (64%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
            E E E+  EEE++ N+ S  E++E   E+E+ ++  P+++ V
Sbjct: 3    EVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTV 44


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 34.0 bits (77), Expect = 0.86
 Identities = 28/191 (14%), Positives = 77/191 (40%), Gaps = 5/191 (2%)

Query: 1200 RYAMRARQSKQVQSVQAGRRGQTIVTSRTRGK--QSYNKFSGRGRGRKAPESEDEVEGEE 1257
            + A  A+++++ +     ++ + +  +  + K  ++      +    +  E   + E   
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEAR 1593

Query: 1258 EEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQ 1317
             EE     EEE+  +  E ++ EEA  +   +++A   + K+ V           +   +
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE--EEKKKVEQLKKKEAEEKKKAEE 1651

Query: 1318 NGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETE 1377
              +A+  N  +++        D    + +++ E+   ++A     K    A  + E + +
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE-DEKKAAEALKKEAEEAKKAEELKKK 1710

Query: 1378 SEEEESQAAQV 1388
              EE+ +A ++
Sbjct: 1711 EAEEKKKAEEL 1721



 Score = 33.2 bits (75), Expect = 1.5
 Identities = 24/165 (14%), Positives = 67/165 (40%), Gaps = 6/165 (3%)

Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV-RRANVAKVKRAVP 1302
            +KA E +   E ++ EE     E+++ EE  + EE++    R+    ++A  A+++  + 
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599

Query: 1303 PAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRT 1362
               +    +    +     K    +  +  L     +    +  +++E    ++A   + 
Sbjct: 1600 LYEEEKKMKAEEAK-----KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654

Query: 1363 KGQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKPTAKLRGLE 1407
              +   + ++E   ++EE++ +A +   +     +   A  +  E
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699



 Score = 31.3 bits (70), Expect = 5.6
 Identities = 29/211 (13%), Positives = 92/211 (43%), Gaps = 9/211 (4%)

Query: 1202 AMRARQSKQVQSV----QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEE 1257
            A +A   K+ +      +  ++ +    +    K    + +   R + A ++E+  + +E
Sbjct: 1223 AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADE 1282

Query: 1258 EEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVK-RAVPPAPDTSTSRLRSVR 1316
             +++ ++ + +E ++  E+++ +EA  +    ++A+ AK K        D +  +    +
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK 1342

Query: 1317 QNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETET 1376
            +   A +     ++         +++ +  +EE +   ++A   + K +     + E + 
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK---KKADAAKKKAEEKK-KADEAKK 1398

Query: 1377 ESEEEESQAAQVSFSRRGRMRKPTAKLRGLE 1407
            ++EE++ +A ++  +   + +   AK +  E
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429



 Score = 30.9 bits (69), Expect = 7.0
 Identities = 31/158 (19%), Positives = 63/158 (39%), Gaps = 6/158 (3%)

Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEE--NSEEEEEEEAVPRRRPVRRANVAK---VK 1298
             KA E++    G+ EE    +  +++ E+   +EE  + E   +    R+A  AK   + 
Sbjct: 1098 GKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIA 1157

Query: 1299 RAVPPAPDTSTSR-LRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRA 1357
            R    A     +R     ++   A++    + +  L        +E   + EE+  +  A
Sbjct: 1158 RKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEA 1217

Query: 1358 RRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRRGR 1395
            R+     ++ AV  +E   +  EE  +A +   +   R
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIR 1255


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
            splicing factor.  These splicing factors consist of an
            N-terminal arginine-rich low complexity domain followed
            by three tandem RNA recognition motifs (pfam00076). The
            well-characterized members of this family are auxilliary
            components of the U2 small nuclear ribonuclearprotein
            splicing factor (U2AF). These proteins are closely
            related to the CC1-like subfamily of splicing factors
            (TIGR01622). Members of this subfamily are found in
            plants, metazoa and fungi.
          Length = 509

 Score = 33.3 bits (76), Expect = 0.92
 Identities = 15/128 (11%), Positives = 30/128 (23%), Gaps = 9/128 (7%)

Query: 1183 RQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGR 1242
             + RE           DR + R R+  + +S    R  ++   S     +  ++     R
Sbjct: 6    DREREKSRG----RDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSR 61

Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRA-NVAKVKRAV 1301
                                 +     V    +              R+      +    
Sbjct: 62   SP----RSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIK 117

Query: 1302 PPAPDTST 1309
            PP  +  T
Sbjct: 118  PPGYELVT 125


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
            nuclear protein involved in bud-site selection. It plays
            a role in positioning the proximal bud pole signal. More
            recently it has been shown to be involved in ribosome
            biogenesis.
          Length = 424

 Score = 33.3 bits (76), Expect = 0.93
 Identities = 15/59 (25%), Positives = 26/59 (44%)

Query: 1223 IVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            I T   +GK        + +  K    +D+ E  E E+ +   E  E + + EEEE+ +
Sbjct: 142  IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSD 200



 Score = 31.7 bits (72), Expect = 3.1
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 1238 SGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            + + + ++A ES D+ + EE E  ++   EE  E++S++EEEE++
Sbjct: 155  TKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDS 199



 Score = 31.3 bits (71), Expect = 3.8
 Identities = 10/38 (26%), Positives = 18/38 (47%)

Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
                 +  E  +D+S+ E  E  S  + EE + P ++P
Sbjct: 226  INYNEDTSESESDESDSEISESRSVSDSEESSPPSKKP 263


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
            family of proteins are part of the yeast nuclear pore
            complex-associated pre-60S ribosomal subunit. The family
            functions as a highly conserved exonuclease that is
            required for the 5'-end maturation of 5.8S and 25S rRNAs,
            demonstrating that 5'-end processing also has a redundant
            pathway. Nop25 binds late pre-60S ribosomes, accompanying
            them from the nucleolus to the nuclear periphery; and
            there is evidence for both physical and functional links
            between late 60S subunit processing and export.
          Length = 134

 Score = 31.9 bits (73), Expect = 0.97
 Identities = 9/34 (26%), Positives = 15/34 (44%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              E+E + EE+ E+ D  + E+ E     E    
Sbjct: 78   LLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVT 111


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
            represents the L12P protein of the large (50S) subunit of
            the archaeal ribosome.
          Length = 105

 Score = 31.2 bits (71), Expect = 0.98
 Identities = 19/37 (51%), Positives = 20/37 (54%), Gaps = 10/37 (27%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            A  +  E E EEEEE     EEEE     EEE EEEA
Sbjct: 70   AAAAAAEEEEEEEEE-----EEEE-----EEESEEEA 96



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 17/32 (53%), Positives = 18/32 (56%), Gaps = 4/32 (12%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEE 1277
            A  +    E EEEEE     EEEE EE SEEE
Sbjct: 68   AAAAAAAAEEEEEEE----EEEEEEEEESEEE 95



 Score = 29.6 bits (67), Expect = 3.0
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVE 1271
            A   E+E E EEEEE  ++SEEE + 
Sbjct: 73   AAAEEEEEEEEEEEEEEEESEEEAMA 98


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.5 bits (76), Expect = 1.0
 Identities = 32/150 (21%), Positives = 58/150 (38%), Gaps = 24/150 (16%)

Query: 699 SRANGESSAATEGTS-----PSGTAPTAQNKNFVKRLPLSQ-------HKIVVERINAFA 746
           SR   ES+ A +         S   PT   K   K+  +          ++VV    A  
Sbjct: 48  SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107

Query: 747 EIESEEFIKESKKLFDSSTTTQPLVVKKEKKKKVVRQARTRAEYSEHIQDQYEDPDNLSL 806
           +   +   +  +K   +S+      V++EK +K VR+ R   +  E ++DQ  + +   +
Sbjct: 108 KESKKTPRRTRRKAAAASSD-----VEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDV 162

Query: 807 SGSE-------ESASDSDQSQVDVSDLCES 829
             SE       ES  + D  + D  D+  +
Sbjct: 163 EESEFVTSLENESEEELDLEKDDGEDISHT 192



 Score = 33.1 bits (75), Expect = 1.1
 Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 7/139 (5%)

Query: 1144 SGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGR-QYRENPSDVSDYDSGDRYA 1202
            +GS  +     + DE+   +G   +  P R+ +  T     +  E   + S+        
Sbjct: 46   AGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSEL------V 99

Query: 1203 MRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESN 1262
            +    +   +S +  RR +    + +   +         + RK  + +++VE +  E   
Sbjct: 100  VNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEV 159

Query: 1263 DQSEEEEVEENSEEEEEEE 1281
               EE E   + E E EEE
Sbjct: 160  SDVEESEFVTSLENESEEE 178



 Score = 30.8 bits (69), Expect = 6.1
 Identities = 32/148 (21%), Positives = 50/148 (33%), Gaps = 26/148 (17%)

Query: 1227 RTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEE----------ESNDQSEEEEVEENSEE 1276
            R R +  +   +G    RK  ES   V+ E +E           S  ++ ++ V E SE 
Sbjct: 35   RWRLQNKFLLAAGS---RKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEP 91

Query: 1277 EEE-------------EEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKR 1323
             EE             +E+    R  RR   A          +    + R V++      
Sbjct: 92   LEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVE 151

Query: 1324 TNGRQSSSLLMTCSGDSDSEDNSQEEEQ 1351
              G +S    +  S    S +N  EEE 
Sbjct: 152  DQGSESEVSDVEESEFVTSLENESEEEL 179


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.8 bits (77), Expect = 1.0
 Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 1245 KAPESEDEVEGE-EEEESNDQSEEEEVEENSEEEEEEEAVP 1284
            KA E +D  E E EEE S+D   ++E++ +  +E   +  P
Sbjct: 3943 KALEDKDRQEKEDEEEMSDDVGIDDEIQPD-IQENNSQPPP 3982


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 33.4 bits (76), Expect = 1.1
 Identities = 17/32 (53%), Positives = 18/32 (56%), Gaps = 3/32 (9%)

Query: 1129 SSLSE--ENSHSDSSSESGSSSQSSEEDSEDE 1158
            SS  E   +S   SSS SGS S SS  DSE E
Sbjct: 388  SSNPEPAASSSGSSSSSSGSES-SSGSDSESE 418



 Score = 31.8 bits (72), Expect = 3.7
 Identities = 43/209 (20%), Positives = 72/209 (34%), Gaps = 34/209 (16%)

Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSED---------------------EDWTRHGKAR 1167
            S  S  +S S+SSS S S S+SS  DSE+                     ++W       
Sbjct: 398  SGSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPEPEPPSTNKWQLDNWLNKVNPH 457

Query: 1168 RMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTI--VT 1225
            ++ P  +      + +       +     SG +Y   +++     S +  RR +T     
Sbjct: 458  KVSPAESVSSNPPIKQPME---KEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGP 514

Query: 1226 SRTRGKQ-----SYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
               RGKQ     S      R  G+K P+  ++    +E        E          +  
Sbjct: 515  ESGRGKQKSPAQSEAPPQRRTVGKKQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTP 574

Query: 1281 EAVPRRRPV--RRANVAK-VKRAVPPAPD 1306
               P+      R+ +  K  K +VPPA +
Sbjct: 575  PDRPKAATKGSRKPSPRKEPKSSVPPAAE 603


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1 protein
            homologues. SDA1 is a Saccharomyces cerevisiae protein
            which is involved in the control of the actin
            cytoskeleton. The protein is essential for cell viability
            and is localised in the nucleus.
          Length = 317

 Score = 33.1 bits (76), Expect = 1.1
 Identities = 12/43 (27%), Positives = 23/43 (53%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR 1290
              EDE   +E E  + +S++E    +SE+EEE++   ++    
Sbjct: 106  VEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKED 148



 Score = 31.2 bits (71), Expect = 3.8
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 1254 EGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            +G E ++ +D+ EE EVEE+ + ++E E
Sbjct: 90   QGLESDDDDDEEEEWEVEEDEDSDDEGE 117



 Score = 31.2 bits (71), Expect = 4.4
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            E E+E +   ++   D  EE   E+  E  EEEEA
Sbjct: 133  EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167



 Score = 30.4 bits (69), Expect = 6.0
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR 1286
            ES D  E EEE++   +  +E+ +E   EE+EEEA    
Sbjct: 128  ESSDS-EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165



 Score = 30.0 bits (68), Expect = 9.4
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
              + E E   E + +D  EEE   E  E+ ++E 
Sbjct: 83   RKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This family
            is a region of the Myb-Related Cdc5p/Cef1 proteins, in
            fungi, and is part of the pre-mRNA splicing factor
            complex.
          Length = 363

 Score = 33.1 bits (76), Expect = 1.2
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 13/77 (16%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEE--------EEEEEAVPRRRPVRRANVAKV 1297
             P++E E+E  EEEE   +  EEE+EE++ +        EE +E    R   RR+ V  +
Sbjct: 140  KPKNEFELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELR---RRSQV--I 194

Query: 1298 KRAVPPAPDTSTSRLRS 1314
            +R +P         LR 
Sbjct: 195  QRNLPRPSVLDLIVLRP 211


>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675).  The
           members of this family are sequences derived from
           hypothetical plant proteins of unknown function. One
           member of this family is annotated as a putative
           RNA-binding protein, but no evidence was found to
           support this.
          Length = 283

 Score = 32.6 bits (74), Expect = 1.3
 Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 9/87 (10%)

Query: 762 DSSTTTQPLVVKKEKKKKVVRQARTRAEY-SEHIQDQYEDPDNLSLSGSEESAS------ 814
              TT+ P+  ++E +K+   Q+  R E   +  + +Y    N     S  +A+      
Sbjct: 58  LMRTTSLPVETEEEWRKRKEMQSLRRLEAKRKRSEKEYNGVSNGDDMDSINAANGGGSGR 117

Query: 815 --DSDQSQVDVSDLCESSSSSGGTSSS 839
             +S +     S +  SSSS G +SS 
Sbjct: 118 DKESSKGVETGSPIATSSSSGGPSSSG 144


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8 is
            a nuclear protein, present at the sites of
            transcriptional repressor complexes, which interacts with
            LIN-35 Rb.Lin35 Rb is a product of the class B synMuv
            gene lin-35 which silences genes required for vulval
            specification through chromatin modification and
            remodelling. The biological role of the interaction has
            not yet been determined however predictions have been
            made. The interaction shows that class A synMuv genes
            control vulval induction through the transcriptional
            regulation of gene expression. LIN-8 normally functions
            as part of a protein complex however when the complex is
            absent, other family members can partially replace LIN-8
            activity.
          Length = 316

 Score = 32.7 bits (75), Expect = 1.3
 Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 10/115 (8%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEEN------SEEEEEEEAVPRRRP--VRRANVAKVKRAV 1301
            ED     ++ E ND+ +E EVEE       +   +E E     RP  V+++  A+     
Sbjct: 151  EDRTRESQDMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPN 210

Query: 1302 PPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEE-EQYGSR 1355
            P  P  S S   +  ++    R +  Q  S        + + D++ +E EQ   +
Sbjct: 211  PQQPMPSASSESATSKSASTSRESSPQPQS-PPPRRVPAPTVDSTAKEAEQITYQ 264


>gnl|CDD|224273 COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family)
            [Replication,    recombination, and repair].
          Length = 248

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 5/77 (6%)

Query: 1247 PESEDEVEGEEEEESND--QSEEEEVEENSEEEEEEEAVPR-RRPV--RRANVAKVKRAV 1301
            P+ E+E E EE EE  D   +  EE E   E  E    +   RR V  +     K+K+  
Sbjct: 76   PKEEEEAEDEELEEPRDELVARLEEYERYKEAAELLAELEEERRDVFSKIKPEIKIKKER 135

Query: 1302 PPAPDTSTSRLRSVRQN 1318
             P  + S   L    Q 
Sbjct: 136  RPVEEISLIDLFRAYQK 152


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 33.3 bits (76), Expect = 1.4
 Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKR 1299
            E + E++EE +++ + EEV+E  EEEE+           + N+   K+
Sbjct: 281  EKKEEKDEEKSEEVKTEEVDEEFEEEEKGF---YYELYEKVNIEANKK 325


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 32.9 bits (76), Expect = 1.4
 Identities = 20/153 (13%), Positives = 38/153 (24%), Gaps = 10/153 (6%)

Query: 1255 GEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRS 1314
            G E +  +   EEEE  +   + E EE          A  A+ +     A          
Sbjct: 28   GIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAE 87

Query: 1315 VRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSET 1374
                  A                 ++     +        + A++ +           + 
Sbjct: 88   AAAAAEAAARPAED----------EAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKA 137

Query: 1375 ETESEEEESQAAQVSFSRRGRMRKPTAKLRGLE 1407
                +  +    +     R R RK   K +  E
Sbjct: 138  ARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTE 170


>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
            protein (MOBP).  MOBP is abundantly expressed in central
            nervous system myelin, and shares several characteristics
            with myelin basic protein (MBP), in terms of regional
            distribution and function. This family is the middle and
            C-terminal regions of MOBP which has been shown to be
            essential for normal arrangement of the radial component
            in central nervous system myelin. Most member-proteins
            carry a FVHE-PHD type zinc-finger at their N-terminus.
          Length = 710

 Score = 32.9 bits (74), Expect = 1.6
 Identities = 54/260 (20%), Positives = 103/260 (39%), Gaps = 25/260 (9%)

Query: 1140 SSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGD 1199
            +SS + +  +   +  ED DW+    ++  PP +      +L +  +  P+     D G 
Sbjct: 234  ASSVASAYDEIGSDSEEDFDWSE-ALSKLCPPSQ------ALLKNPQPQPTQAQSSDQGP 286

Query: 1200 RYAM-RARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEE 1258
              A   +  S   +++ +     +  +SR  G Q+   +  R   +     +  + GE +
Sbjct: 287  IAASPSSALSPNPEAMCSDSETSSAGSSREAGCQAKLSWLQRKAPKNPAAEKMPLHGELD 346

Query: 1259 EESNDQSEEEEVEENSEEEEEEEAVPRR-RPVRRANVAKVKRAVPPAPDTSTSRLRSVRQ 1317
               N Q+   E  ++S+ EE      RR R  RRA +          P+     L S   
Sbjct: 347  VNFNPQAAGGETSDSSDPEEALHTADRRARRWRRARL---------GPEEPNKELPSPGA 397

Query: 1318 NGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETE 1377
            + RA  T   Q S  L     ++D  + +Q+ +        + R +    A+  SE ET 
Sbjct: 398  HLRALDTA--QVSDDL----SETDISNEAQDPQSLTDSTEEKLRNRLYELAMKMSEKETS 451

Query: 1378 S-EEEESQAAQVSFSRRGRM 1396
            S E++ES+      +++  +
Sbjct: 452  SGEDQESEPKAEPENQKESL 471



 Score = 30.2 bits (67), Expect = 9.3
 Identities = 31/142 (21%), Positives = 53/142 (37%), Gaps = 13/142 (9%)

Query: 1140 SSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRE-NPSDVSDYDSG 1198
            +  E+  SS   E     +   R  +  R+ P   N+   S G   R  + + VSD  S 
Sbjct: 354  AGGETSDSSDPEEALHTADRRARRWRRARLGPEEPNKELPSPGAHLRALDTAQVSDDLSE 413

Query: 1199 DRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEE 1258
               +  A+  + +      +        R R  +   K S      K   S ++ E E +
Sbjct: 414  TDISNEAQDPQSLTDSTEEKL-------RNRLYELAMKMS-----EKETSSGEDQESEPK 461

Query: 1259 EESNDQSEEEEVEENSEEEEEE 1280
             E  +Q E    E+N++  +EE
Sbjct: 462  AEPENQKESLSSEDNNQGVQEE 483


>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein.  S-antigens are heat stable
            proteins that are found in the blood of individuals
            infected with malaria.
          Length = 94

 Score = 30.3 bits (68), Expect = 1.6
 Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEV-EENSEEEEEEEA 1282
             G+G+K  + ++E EGE ++E +  SEE    EEN   E + E+
Sbjct: 51   NGKGQKYEDLQEEGEGENDDEEDSNSEESNNDEENGLIEGQGES 94


>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
          Length = 177

 Score = 31.3 bits (71), Expect = 2.0
 Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            K  E E+EVE  E EE+ ++   EE  E SE E   E
Sbjct: 6    KNEEVEEEVEATETEETVEEV-VEETPEKSELELANE 41


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein (DUF2052). 
            This entry is of sequences of two conserved domains
            separated by a region of low complexity, spanning some
            200 residues. The function is unknown.
          Length = 178

 Score = 31.2 bits (71), Expect = 2.1
 Identities = 14/43 (32%), Positives = 23/43 (53%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
              ++  ES +E E + EEE+ ++ EE +     EEEEE+    
Sbjct: 75   EQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSY 117


>gnl|CDD|176394 cd01799, Hoil1_N, Ubiquitin-like domain of HOIL1.  HOIL1_N   HOIL-1
           (heme-oxidized IRP2 ubiquitin ligase-1) is an E3
           ubiquitin-protein ligase that recognizes heme-oxidized
           IRP2 (iron regulatory protein2) and is thought to affect
           the turnover of oxidatively damaged proteins.  Hoil-1
           has an amino-terminal ubiquitin-like domain as well as
           an RBR signature consisting of two RING finger domains
           separated by an IBR/DRIL domain.
          Length = 75

 Score = 29.4 bits (66), Expect = 2.2
 Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 628 YPPEVQRLVPGRENADDTELLPD--ITTNDD 656
           +PP VQR V G+  A D E L    I TN D
Sbjct: 38  FPPAVQRWVIGQRLARDQETLYSHGIRTNGD 68


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
            family consists of several hypothetical bacterial
            proteins of around 200 residues in length. The function
            of this family is unknown.
          Length = 214

 Score = 31.6 bits (72), Expect = 2.2
 Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKV 1297
            A  SED+ +  + E+ +++SEEE  EE+ E  +E E     +    +NV K 
Sbjct: 72   AANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE--SNVEKE 121


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes the
            fifth essential DNA polymerase in yeast EC:2.7.7.7. Pol5p
            is localised exclusively to the nucleolus and binds near
            or at the enhancer region of rRNA-encoding DNA repeating
            units.
          Length = 784

 Score = 32.5 bits (74), Expect = 2.2
 Identities = 24/129 (18%), Positives = 40/129 (31%), Gaps = 26/129 (20%)

Query: 1223 IVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            ++ +     +    F G        E  D+ E E E   + +SE E   E+ EE+E+E+ 
Sbjct: 631  VLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690

Query: 1283 VPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDS 1342
                        A+    V P        L  V     A                G    
Sbjct: 691  ------------AEANEGVVPIDKAVRRALPKVLNLPDAL--------------DGGDSE 724

Query: 1343 EDNSQEEEQ 1351
            ++   ++EQ
Sbjct: 725  DEEGMDDEQ 733



 Score = 31.0 bits (70), Expect = 5.9
 Identities = 28/129 (21%), Positives = 44/129 (34%), Gaps = 12/129 (9%)

Query: 1077 VEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEPFLVPVDISSLSEENS 1136
            V  E +Q +L   + DE          +++  G EE       E  L   D     E  +
Sbjct: 620  VTNEGLQLILDVLKADENKSR-----HQQLFEGEEE------DEDDLEETD-DDEDECEA 667

Query: 1137 HSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYD 1196
              DS SES S  +  EED +++D   +     +       +   L      +  D  D +
Sbjct: 668  IEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEE 727

Query: 1197 SGDRYAMRA 1205
              D   M  
Sbjct: 728  GMDDEQMMR 736


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
            presumed domain is functionally uncharacterized. This
            domain family is found in bacteria and eukaryotes, and is
            typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 32.1 bits (73), Expect = 2.2
 Identities = 43/225 (19%), Positives = 68/225 (30%), Gaps = 62/225 (27%)

Query: 1136 SHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTS-------------LG 1182
            SH  S S S S+S+  +  S+         ++ M PRR +    +               
Sbjct: 112  SHPRSRSASVSNSKDGDRPSD----LPPSPSKTMDPRRWSPTKATWLESALNKPESPKHK 167

Query: 1183 RQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRG-----QTIVTSRTRGKQSYNKF 1237
             Q  + P    D         R RQS+   SV  GR         +   RT    S++K 
Sbjct: 168  PQPPQQPEWKKDLS-------RLRQSR--ASVDLGRTNSFKEVTPVGLMRTPPPGSHSKS 218

Query: 1238 SGRGRGRKAPES---------EDEVEGEEEEE----------------------SNDQSE 1266
              +      P S         + + E    +                        +   E
Sbjct: 219  PSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTEE 278

Query: 1267 EEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSR 1311
            E +  E S +E EE +  +R P   +   K +   P A    + R
Sbjct: 279  ELKSPEASPKESEEASARKRSPSLLSPSPKAESPKPLASPGKSPR 323


>gnl|CDD|218218 pfam04702, Vicilin_N, Vicilin N terminal region.  This region is
            found in plant seed storage proteins, N-terminal to the
            Cupin domain (pfam00190). In Macadamia integrifolia, this
            region is processed into peptides of approximately 50
            amino acids containing a C-X-X-X-C-(10-12)X-C-X-X-X-C
            motif. These peptides exhibit antimicrobial activity in
            vitro.
          Length = 147

 Score = 30.8 bits (69), Expect = 2.3
 Identities = 24/135 (17%), Positives = 51/135 (37%), Gaps = 11/135 (8%)

Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTST 1309
            E E  G+ E++  ++  +E+     +EE++++      P RR    + +          T
Sbjct: 21   EQETRGQREQQQCERRCKEQY----KEEQQQQRQREEDPQRRYEQCQQRCQQHEPRHRPT 76

Query: 1310 SRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRAR---RTRTKGQS 1366
             + R  R+  + ++   RQ       C      ++     +Q   R  R   +   +G+ 
Sbjct: 77   CQQRCERRYEKEQQQQQRQ----YQECQQRCQQQEQRPRRQQQCQRECREQYQEHGRGEG 132

Query: 1367 YAVHSSETETESEEE 1381
              ++    E  SEE 
Sbjct: 133  ENINQEGEEERSEEG 147


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
            This family contains a number of defence peptides
            secreted from the skin of amphibians, including the
            opiate-like dermorphins and deltorphins, and the
            antimicrobial dermoseptins and temporins. The alignment
            for this family includes the signal peptide.
          Length = 46

 Score = 28.5 bits (64), Expect = 2.6
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 1258 EEESNDQSEEEEVEENSEEEEEEE 1281
            EEE  +  EE E EE  EE+ E +
Sbjct: 22   EEEKREDEEENEDEEEGEEQSEVK 45



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 1250 EDEVEGEEEEESNDQSEEE 1268
            EDE E E+EEE  +QSE +
Sbjct: 27   EDEEENEDEEEGEEQSEVK 45



 Score = 27.3 bits (61), Expect = 5.4
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 1257 EEEESNDQSEEEEVEENSEEEEEE 1280
            EEE+  D+ E E+ EE  E+ E +
Sbjct: 22   EEEKREDEEENEDEEEGEEQSEVK 45


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is found
            in bacteria. Proteins in this family are typically
            between 234 to 465 amino acids in length. There is a
            conserved GEV sequence motif.Most members are annotated
            as being SprA-related.
          Length = 261

 Score = 31.7 bits (72), Expect = 2.6
 Identities = 14/74 (18%), Positives = 28/74 (37%)

Query: 1207 QSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSE 1266
                 ++  AG               +    + +   ++A ES DE    EEE   DQ+E
Sbjct: 34   APSSSEASNAGSGSSEQKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAE 93

Query: 1267 EEEVEENSEEEEEE 1280
              +     ++E+++
Sbjct: 94   AGKQLSQKQQEQQQ 107


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.8 bits (73), Expect = 2.7
 Identities = 17/38 (44%), Positives = 17/38 (44%), Gaps = 5/38 (13%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            K   S        EEE     EEEE EE  EE  EEEA
Sbjct: 289  KEVLSAQAQAAAAEEE-----EEEEEEEEEEEPSEEEA 321



 Score = 30.6 bits (70), Expect = 6.5
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
               ++E++     ++   + EEE EE  EEEEEE +
Sbjct: 282  DALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317



 Score = 30.2 bits (69), Expect = 7.8
 Identities = 17/37 (45%), Positives = 18/37 (48%), Gaps = 10/37 (27%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            A  +  E E EEEEE     EEEE     E  EEE A
Sbjct: 296  AQAAAAEEEEEEEEE-----EEEE-----EPSEEEAA 322


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 31.6 bits (72), Expect = 2.9
 Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 5/89 (5%)

Query: 698 RSRANGESSAATEGTSPSGTAPTAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFIKES 757
           RSR++  SS+ +    PS  +    ++N   R   S   +V   + A A++   +    S
Sbjct: 155 RSRSSSRSSSRSNSRGPSRGSSRNNSRN---RNSSSPDDLVAAVLAALAKLGFGKQKSSS 211

Query: 758 KKLFDSSTTTQPLVVKKEKKKKVVRQART 786
           KK    +  +     KK+  K   R  RT
Sbjct: 212 KKPSRVTKKSAAEAAKKQLNKP--RWKRT 238


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 31.9 bits (73), Expect = 2.9
 Identities = 9/50 (18%), Positives = 23/50 (46%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVA 1295
              + +D+   +E+EE  +  E E++ ++ +   +E+     R  R+    
Sbjct: 157  DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206


>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
            subunit E2; Reviewed.
          Length = 411

 Score = 31.7 bits (73), Expect = 2.9
 Identities = 10/55 (18%), Positives = 16/55 (29%), Gaps = 1/55 (1%)

Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEE-EAVPRRRPVRRANVAKVKRAVPPA 1304
            +E    E   + + + E    E +        A P       A  A   +A P  
Sbjct: 77   EEEGEAEAAAAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPAAPAAAAAKASPAV 131


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 31.9 bits (72), Expect = 3.0
 Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 10/81 (12%)

Query: 772 VKKEKKKKVVRQA-RTRAEYSEHIQDQYE---DPDNLSLSGSEESASDSDQSQVDVSDLC 827
           VKK   KKV +   R + EYS+    ++E   D D+        S  ++        D  
Sbjct: 22  VKKHAAKKVNKALDRAQDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAAD------DEE 75

Query: 828 ESSSSSGGTSSSDYSDWIHEG 848
           E  +SS  T   D  D I+EG
Sbjct: 76  EGEASSDATEGHDEDDEIYEG 96


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
            family of proteobacterial species proteins has no known
            function.
          Length = 115

 Score = 30.0 bits (68), Expect = 3.3
 Identities = 11/49 (22%), Positives = 14/49 (28%)

Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPR 1285
            F  R   RK     +E     E    + +         EEEE E     
Sbjct: 2    FLSRWSRRKLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEE 50


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
            procyclic acidic repetitive protein (PARP) like
            sequences. The procyclic acidic repetitive protein (parp)
            genes of Trypanosoma brucei encode a small family of
            abundant surface proteins whose expression is restricted
            to the procyclic form of the parasite. They are found at
            two unlinked loci, parpA and parpB; transcription of both
            loci is developmentally regulated.
          Length = 145

 Score = 30.3 bits (67), Expect = 3.4
 Identities = 22/57 (38%), Positives = 26/57 (45%), Gaps = 1/57 (1%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPP 1303
             E E E EGEEE E  +++ EEE E   E E E E  P   P      A +K    P
Sbjct: 78   EEPEPEEEGEEEPEP-EETGEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKSVALP 133



 Score = 30.3 bits (67), Expect = 3.8
 Identities = 27/70 (38%), Positives = 34/70 (48%), Gaps = 8/70 (11%)

Query: 1247 PESEDEVEGEEEEESNDQSEEE-EVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAP 1305
             E E E EGEEE E  ++ EEE E EE  EEE E E  P   P       + +    P P
Sbjct: 68   EEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEP-------EPEPEPEPEP 120

Query: 1306 DTSTSRLRSV 1315
            +   + L+SV
Sbjct: 121  EPGAATLKSV 130


>gnl|CDD|182033 PRK09697, PRK09697, protein secretion protein GspB; Provisional.
          Length = 139

 Score = 30.4 bits (68), Expect = 3.4
 Identities = 23/83 (27%), Positives = 33/83 (39%), Gaps = 4/83 (4%)

Query: 1205 ARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQ 1264
            ARQ   +  V+A     T+       KQS   +  + +     E   E E E      + 
Sbjct: 51   ARQLWVLWIVKAEV---TVEAETPAFKQSTQHYFFKKQPLPVVE-SVEEEDEPGVAVENA 106

Query: 1265 SEEEEVEENSEEEEEEEAVPRRR 1287
                E EEN+ EE +E+A  R R
Sbjct: 107  PSSSEDEENTVEESDEKAGLRER 129


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 31.3 bits (72), Expect = 3.5
 Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 5/40 (12%)

Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
            GR  +  E+ +E E E EEE     EEEE EE +EE E E
Sbjct: 224  GRQGRLAEAAEEEEEEAEEE-----EEEEEEEEAEEAEAE 258


>gnl|CDD|220023 pfam08801, Nucleoporin_N, Nup133 N terminal like.  Nup133 is a
           nucleoporin that is crucial for nuclear pore complex
           (NPC) biogenesis. The N terminal forms a seven-bladed
           beta propeller structure. This family now contains other
           sized nucleoporins, including Nup155, Nup8, Nuo132,
           Nup15 and Nup170.
          Length = 411

 Score = 31.7 bits (72), Expect = 3.5
 Identities = 17/81 (20%), Positives = 28/81 (34%), Gaps = 11/81 (13%)

Query: 242 LRGCSGEISDIAIDN-RNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNF--CP 298
           +     EI  + +D+ R  LL   T   TIR+W+L            + ++  V+     
Sbjct: 186 VPSSDREIVSLRVDSSRGRLLYTLTSKGTIRVWDLSP--------SGEKLVREVDIYEII 237

Query: 299 LEVNGFNYLASTSTDGCIGFW 319
           L     +   S ST       
Sbjct: 238 LRDLQESIPKSASTLSSKSLK 258


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
            timeless (tim) gene is essential for circadian function
            in Drosophila. Putative homologues of Drosophila tim have
            been identified in both mice and humans (mTim and hTIM,
            respectively). Mammalian TIM is not the true orthologue
            of Drosophila TIM, but is the likely orthologue of a fly
            gene, timeout (also called tim-2). mTim has been shown to
            be essential for embryonic development, but does not have
            substantiated circadian function. Some family members
            contain a SANT domain in this region.
          Length = 507

 Score = 31.6 bits (71), Expect = 3.6
 Identities = 28/135 (20%), Positives = 47/135 (34%), Gaps = 7/135 (5%)

Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTS 1310
               +  EE+E +       + E +EE  E E   R   + +A   +  R+         +
Sbjct: 308  RAADDREEDECDQAVPLVPLTEENEEAMENEQFQR---LLKALGLRPPRSGQEGFWRIPA 364

Query: 1311 RLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVH 1370
            +L S +   RA   +G +              + +  E E    R+  R R  G +    
Sbjct: 365  KLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLA---- 420

Query: 1371 SSETETESEEEESQA 1385
            S E E   EEE+   
Sbjct: 421  SPEEEALGEEEQKAP 435


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
            This family contains EBNA-3A, -3B, and -3C which are
            latent infection nuclear proteins important for
            Epstein-Barr virus (EBV)-induced B-cell immortalisation
            and the immune response to EBV infection.
          Length = 254

 Score = 31.2 bits (71), Expect = 3.6
 Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 1239 GRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
            G+  G  A  +  E + +   E    SE E+ E  S++EE    VPR  PV+ 
Sbjct: 196  GQALGLSALRTPGEPDDDATVE--TSSESEDSESESDDEELPYIVPRMEPVQG 246


>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis, outer
            membrane].
          Length = 284

 Score = 31.3 bits (71), Expect = 3.9
 Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 1/40 (2%)

Query: 1057 WDLEPPKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQ 1096
               EPP  G    EP    P  P   Q +L  P P  W +
Sbjct: 246  SWFEPPPPGTTKPEPKAP-PPLPPSCQAVLDEPAPSSWLE 284


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
            Members of this family are bacterial proteins with a
            conserved motif [KR]FYDLN, sometimes flanked by a pair of
            CXXC motifs, followed by a long region of low complexity
            sequence in which roughly half the residues are Asp and
            Glu, including multiple runs of five or more acidic
            residues. The function of members of this family is
            unknown.
          Length = 104

 Score = 29.6 bits (67), Expect = 3.9
 Identities = 6/49 (12%), Positives = 22/49 (44%)

Query: 1233 SYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
                   + R   A   +   + E+EE+ +D   +++ +++ +++  + 
Sbjct: 35   EVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDL 83


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
            [Transcription].
          Length = 175

 Score = 30.5 bits (69), Expect = 4.1
 Identities = 12/34 (35%), Positives = 25/34 (73%), Gaps = 1/34 (2%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E++DE E +++E      E++EV+E+ +++E+EE
Sbjct: 138  ENDDEDE-DDDEIVEILIEDDEVDEDEDDDEDEE 170



 Score = 29.3 bits (66), Expect = 8.8
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            E ED+ E  E    +D+ +E+E ++  EE+EE++
Sbjct: 142  EDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
            subunit (TFIIF-alpha).  Transcription initiation factor
            IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
            II-associating protein 74 (RAP74) is the large subunit of
            transcription factor IIF (TFIIF), which is essential for
            accurate initiation and stimulates elongation by RNA
            polymerase II.
          Length = 528

 Score = 31.5 bits (71), Expect = 4.2
 Identities = 29/144 (20%), Positives = 46/144 (31%), Gaps = 23/144 (15%)

Query: 1138 SDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDS 1197
             D   +       S++  ED D     K ++    +  +      +  R    D  +YDS
Sbjct: 216  KDLEGDDEDDGDESDKGGEDGD-EEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDS 274

Query: 1198 GDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEE 1257
             D                 GR    I  S           SG     +  +   E+  + 
Sbjct: 275  DD-------------GDDEGREEDYISDSSA---------SGNDPEEREDKLSPEIPAKP 312

Query: 1258 EEESNDQSEEEEVEENSEEEEEEE 1281
            E E ++ SEE E E+N EE    +
Sbjct: 313  EIEQDEDSEESEEEKNEEEGGLSK 336


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3 subunit. 
            This is a family of proteins which are subunits of the
            eukaryotic translation initiation factor 3 (eIF3). In
            yeast it is called Hcr1. The Saccharomyces cerevisiae
            protein eIF3j (HCR1) has been shown to be required for
            processing of 20S pre-rRNA and binds to 18S rRNA and eIF3
            subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.8 bits (70), Expect = 4.2
 Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 1254 EGEEEEESNDQS-EEEEVEENSEEEEEEEAVPR 1285
            + E+E++    S +EEE EE  EE+ +  A  +
Sbjct: 25   DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAK 57


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 31.1 bits (70), Expect = 4.4
 Identities = 25/110 (22%), Positives = 39/110 (35%), Gaps = 18/110 (16%)

Query: 1053 NLSPWDLEPPKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQG--DRDATCEKIITGL 1110
              SP  +  P+EG+ A     +     ++  E   H   D+ P      D          
Sbjct: 402  RGSP--MAAPEEGRAAAAAAAA-NAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMD 458

Query: 1111 EEVMGLSVAEPFLVPVDISSLSEENSHSDSSSESGSSSQSSEEDSEDEDW 1160
            E+  G           D  +  E++S  DSS    ++S   E DS D+ W
Sbjct: 459  EQESG-----------DERADGEDDS--DSSYSYSTTSSEDESDSADDSW 495



 Score = 30.0 bits (67), Expect = 9.8
 Identities = 12/44 (27%), Positives = 20/44 (45%)

Query: 1338 GDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEE 1381
             DSD +D  ++E                SY+  SSE E++S ++
Sbjct: 450  ADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADD 493


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
            selection and in elongation by RNA polymerase II
            [Transcription].
          Length = 521

 Score = 31.2 bits (70), Expect = 4.4
 Identities = 37/173 (21%), Positives = 61/173 (35%), Gaps = 20/173 (11%)

Query: 1129 SSLSEENSHSDSSSES-GSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQ--Y 1185
            SS  + N + D + ES G  ++ SEE+     +   GK +    R      T + R+   
Sbjct: 37   SSDEDNNENVDYAEESGGEGNEKSEEEKFKNPYRLEGKFKDEADRAKIMAMTEIERESIL 96

Query: 1186 RENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQ--TIVTSRTRGKQSYNKFSGRGRG 1243
             E   ++S        A+R  Q  +       RR      +TS    K+       + R 
Sbjct: 97   FEREEEISKLMERRELAIRMEQQHRSSGCTDTRRSTRYEPLTSAAEEKKKKLLELKKTRE 156

Query: 1244 RKA-----------PESE----DEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            R+               +    +E E   +EE      EE VE+ S  ++  E
Sbjct: 157  REERLYSERHIELQRFKDYKELEESEQGLQEEYTPSYAEEAVEDISRTDDFAE 209


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
            chain; Provisional.
          Length = 1033

 Score = 31.3 bits (71), Expect = 4.8
 Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 3/61 (4%)

Query: 1241 GRGRKAPESEDEVEG---EEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKV 1297
                   E E+E+E        +S+D     E E+  EE++EE   P +  + +   A++
Sbjct: 6    NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARL 65

Query: 1298 K 1298
            K
Sbjct: 66   K 66


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 30.5 bits (69), Expect = 4.8
 Identities = 12/41 (29%), Positives = 14/41 (34%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            R     E +         E    +EEEE EE   EE   E 
Sbjct: 143  RAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEE 183


>gnl|CDD|180374 PRK06060, PRK06060, acyl-CoA synthetase; Validated.
          Length = 705

 Score = 31.2 bits (70), Expect = 5.0
 Identities = 15/57 (26%), Positives = 25/57 (43%), Gaps = 10/57 (17%)

Query: 1069 TEPGGSLPVEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEPFLVP 1125
            TE  G + V+P E++ L+           + +A  E  +  + E  G S  + FLV 
Sbjct: 390  TEVIGGVNVDPREVERLII----------EDEAVAEAAVVAVRESTGASTLQAFLVA 436


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 30.8 bits (69), Expect = 5.2
 Identities = 24/104 (23%), Positives = 44/104 (42%), Gaps = 20/104 (19%)

Query: 1187 ENPSDVSDY-------DSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSG 1239
            ++ S+  DY         G+  ++   +S     V+       +     RG Q       
Sbjct: 54   QDLSEACDYIAIEEYSACGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQ------- 106

Query: 1240 RGRGRKAPESEDEVEG-EEEEESNDQSEEEEVEENSEEEEEEEA 1282
                 K  E +++VE  EE EE     EE + +E+SE+++E+E+
Sbjct: 107  -----KEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKES 145


>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
            TFIIE alpha.  TFIIE is compiled of two subunits, alpha
            and beta. This family of proteins are the C terminal
            domain of the alpha subunit of the protein which is the
            largest subunit and contains several functional domains
            which are important for basal transcription and cell
            growth. The C terminal end of the protein binds directly
            to the amino-terminal PH domain of p62/Tfb1 (of IIH)
            which is involved in the recruitment of the general
            transcription factor IIH to the transcription
            preinitiation complex. P53 competes for the same binding
            site as TFIIE alpha which shows their structural
            similarity. Like p53, TFIIE alpha 336-439 can activate
            transcription in vivo.
          Length = 86

 Score = 28.6 bits (64), Expect = 5.4
 Identities = 12/36 (33%), Positives = 18/36 (50%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
            ES+D+    E        EEE+ E++  EE E+E  
Sbjct: 7    ESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDEPT 42


>gnl|CDD|224751 COG1838, FumA, Tartrate dehydratase beta subunit/Fumarate hydratase
           class I, C-terminal domain [Energy production and
           conversion].
          Length = 184

 Score = 30.0 bits (68), Expect = 5.9
 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 8/62 (12%)

Query: 619 FLVDLNGTPYPPEVQRLVPGRENADDTELL--PDITTNDDGHVEIVEGSERNGERPVEVE 676
             VDLN TP   E+ +L  G     D   L    +T  D  H  ++E  +R  E PV+++
Sbjct: 1   MKVDLN-TPLKEEIAKLKVG-----DVVYLSGKIVTGRDAAHKRLLEMLDRGEELPVDLK 54

Query: 677 GQ 678
           G 
Sbjct: 55  GH 56


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 30.9 bits (69), Expect = 6.0
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
            E  DE   EE EE+ +++ EE VEEN EE  EE
Sbjct: 1062 EEYDEENVEEIEENIEENIEENVEENVEENVEE 1094


>gnl|CDD|205277 pfam13096, ShortName, Family description. 
          Length = 177

 Score = 29.9 bits (67), Expect = 6.1
 Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 14/84 (16%)

Query: 738 VVERINAFAEIESEEFIKESKKLFDSSTTTQPLVVKKEKKKKVVRQARTRAEYSE----- 792
           + E+ N    +     I E  +  D S      V + E+ + ++   R+   +SE     
Sbjct: 23  IQEKENLSRRVTDLNIIMEGVEFSDLSKF----VSRAEEDRDLLLFFRSLRSFSEWCEHR 78

Query: 793 -----HIQDQYEDPDNLSLSGSEE 811
                H +++Y D   L    S E
Sbjct: 79  RETFEHFKEKYPDVVRLPEGVSGE 102


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 30.8 bits (69), Expect = 6.2
 Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 5/81 (6%)

Query: 1255 GEEEEESNDQSEEEEVEENSEEEEEEEAV--PRRRPVRRANVAKVKRAVPPAPDTSTSRL 1312
            GEE EE ++  E  + E   E+EEEE     P   P+    ++ + R    A   +  R 
Sbjct: 83   GEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPEVNPLDAEGLSGLAREACDALKKALLRH 142

Query: 1313 RSVRQNGRAKRT---NGRQSS 1330
            R + Q  R  R    NG Q S
Sbjct: 143  RFLWQRRRQARCEQHNGPQQS 163


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
            family of proteins are involved in 60S ribosomal
            biogenesis. They are specifically involved in the
            processing beyond the 27S stage of 25S rRNA maturation.
            This family contains sequences that bear similarity to
            the glioma tumour suppressor candidate region gene 2
            protein (p60). This protein has been found to interact
            with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.4 bits (69), Expect = 6.3
 Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR-----RANVAKVKR 1299
                  DE+     EES+D  EEE  +E++ E  E E  P  +PVR     +A   K KR
Sbjct: 229  PEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKR 288

Query: 1300 A 1300
             
Sbjct: 289  R 289


>gnl|CDD|193000 pfam12357, PLD_C, Phospholipase D C terminal.  This domain family
           is found in eukaryotes, and is approximately 70 amino
           acids in length. The family is found in association with
           pfam00168, pfam00614. There is a conserved FPD sequence
           motif. This family is the C terminal of phospholipase D.
           PLD is a major plant lipid-degrading enzyme which is
           involved in signal transduction.
          Length = 74

 Score = 28.2 bits (64), Expect = 6.3
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 7/42 (16%)

Query: 605 DEVTNLPPHLMPPPFLVDLNG--TPYP-----PEVQRLVPGR 639
           +EVT+LP HL+  P  VD +G  T  P     P+    V G 
Sbjct: 31  EEVTDLPGHLLRYPVGVDRDGKVTELPGCEFFPDTGAKVLGA 72


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
            domain family is found in bacteria and eukaryotes, and is
            approximately 100 amino acids in length.
          Length = 100

 Score = 28.7 bits (65), Expect = 6.4
 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEE--NSEEEEEE 1280
            RG      EDE+   EEE+ +D+ EEEE E   +   E++E
Sbjct: 15   RGELPEPEEDEILELEEEDDDDEDEEEEYELLASFYHEDQE 55


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit; Provisional.
          Length = 482

 Score = 30.7 bits (70), Expect = 6.4
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            + +KA   + + E EEEE+   + E+EE EE +EEE+EEE
Sbjct: 430  KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469


>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
          Length = 903

 Score = 31.0 bits (70), Expect = 6.5
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 17/91 (18%)

Query: 710 EGTSPSGTAPTAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFIKESKKLFDSS----- 764
           EG S +G A  A+N+ F   LPL    + +E+            IK +KK+F++S     
Sbjct: 582 EGESAAGNAKQARNREFQAVLPLKGKILNIEK------------IKNNKKVFENSEIKLL 629

Query: 765 TTTQPLVVKKEKKKKVVRQARTRAEYSEHIQ 795
            T+  L V     ++      T+A   E +Q
Sbjct: 630 ITSIGLSVNPVTWRQYDLSHGTKASKDESVQ 660


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.9 bits (71), Expect = 6.6
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
             V+   EE+      E E  ++ E+EEEEE
Sbjct: 173  FVDPNAEEDPAHVGSELEELDDDEDEEEEE 202



 Score = 30.1 bits (69), Expect = 8.8
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
             +G  R R+  +   +   EE+        EE  ++  EEEEE+E 
Sbjct: 160  ENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDEN 205



 Score = 30.1 bits (69), Expect = 9.8
 Identities = 9/36 (25%), Positives = 16/36 (44%)

Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
            P +E++      E      +E+E EE  E ++   A
Sbjct: 176  PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAA 211


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.  Genome
            duplication is precisely regulated by cyclin-dependent
            kinases CDKs, which bring about the onset of S phase by
            activating replication origins and then prevent
            relicensing of origins until mitosis is completed. The
            optimum sequence motif for CDK phosphorylation is
            S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
            potential phosphorylation sites. Drc1 is required for DNA
            synthesis and S-M replication checkpoint control. Drc1
            associates with Cdc2 and is phosphorylated at the onset
            of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
            replication by phosphorylating Drc1 and regulating its
            association with Cut5. Sld2 and Sld3 represent the
            minimal set of S-CDK substrates required for DNA
            replication.
          Length = 397

 Score = 30.5 bits (69), Expect = 6.8
 Identities = 29/158 (18%), Positives = 47/158 (29%), Gaps = 36/158 (22%)

Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR----RPVRRANVAKVKRAVPP 1303
            +  D +   E EE+     EEEV ++    +E   V ++    R  RR  +    R V  
Sbjct: 248  DELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKM----RPVRA 303

Query: 1304 APDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTK 1363
             P    S                            +SD  +   + ++  S         
Sbjct: 304  KPSDEPS--------------------------LPESDIHEEIPKLDE-KSLSEFLGYMG 336

Query: 1364 GQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKPTA 1401
            G        + E   EE E +  +V    R R   P +
Sbjct: 337  GIDEDDEDEDDEESKEEVEKK-QKVKKKPRKRKVNPVS 373


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
            the C subunit of DNA polymerase delta. It carries the
            essential residues for binding to the Pol1 subunit of
            polymerase alpha, from residues 293-332, which are
            characterized by the motif D--G--VT, referred to as the
            DPIM motif. The first 160 residues of the protein form
            the minimal domain for binding to the B subunit, Cdc1, of
            polymerase delta, the final 10 C-terminal residues,
            362-372, being the DNA sliding clamp, PCNA, binding
            motif.
          Length = 427

 Score = 30.6 bits (69), Expect = 6.9
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 3/51 (5%)

Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENS---EEEEEEEAVPRRRPVRRAN 1293
            A ES  + E EEE    D   E+E  E +   E+E+E+E  P         
Sbjct: 235  ASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
            Transcription initiation factor IIA (TFIIA) is a
            heterotrimer, the three subunits being known as alpha,
            beta, and gamma, in order of molecular weight. The N and
            C-terminal domains of the gamma subunit are represented
            in pfam02268 and pfam02751, respectively. This family
            represents the precursor that yields both the alpha and
            beta subunits. The TFIIA heterotrimer is an essential
            general transcription initiation factor for the
            expression of genes transcribed by RNA polymerase II.
            Together with TFIID, TFIIA binds to the promoter region;
            this is the first step in the formation of a
            pre-initiation complex (PIC). Binding of the rest of the
            transcription machinery follows this step. After
            initiation, the PIC does not completely dissociate from
            the promoter. Some components, including TFIIA, remain
            attached and re-initiate a subsequent round of
            transcription.
          Length = 332

 Score = 30.5 bits (69), Expect = 7.0
 Identities = 13/82 (15%), Positives = 32/82 (39%), Gaps = 13/82 (15%)

Query: 1202 AMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEES 1261
            AM+  +  + Q+  + RR    +       +                S+D+ + +  E  
Sbjct: 222  AMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGD-------------GSDDDDDEDAIESD 268

Query: 1262 NDQSEEEEVEENSEEEEEEEAV 1283
             D S+++  +E+ E+  + + V
Sbjct: 269  LDDSDDDVSDEDGEDLFDTDNV 290


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 29.9 bits (67), Expect = 7.5
 Identities = 11/47 (23%), Positives = 22/47 (46%), Gaps = 1/47 (2%)

Query: 1239 GRGRGRKAPESEDEVEGEEE-EESNDQSEEEEVEENSEEEEEEEAVP 1284
            G+ + +K  E+E   +G E+ ++  +   E E  E  +   E+   P
Sbjct: 92   GKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPEDTPPP 138


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
            Superfamily; Provisional.
          Length = 844

 Score = 30.6 bits (69), Expect = 7.8
 Identities = 22/147 (14%), Positives = 46/147 (31%), Gaps = 19/147 (12%)

Query: 1132 SEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSD 1191
             +E+   +++S   S+   S  +  D D     +A                 Q  +   D
Sbjct: 100  DDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN--------------DQTNDFDQD 145

Query: 1192 VSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESED 1251
             S     D+     +QS  + S +     +   T  T     YN  +           + 
Sbjct: 146  DSSNSQTDQ---GLKQSVNLSSAEKLIEEKKGQTENTFKF--YNFGNDGEEAAAKDGGKS 200

Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEE 1278
            +         +D   ++   E++EE++
Sbjct: 201  KSSDPGPLNDSDGQGDDGDPESAEEDK 227


>gnl|CDD|219675 pfam07972, Flavodoxin_NdrI, NrdI Flavodoxin like. 
          Length = 120

 Score = 28.6 bits (65), Expect = 8.1
 Identities = 9/35 (25%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 720 TAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFI 754
           T   + FV++L L   +I +  I+   E++ E ++
Sbjct: 7   TGNTRRFVEKLGLPAQRIPIREIDEELEVD-EPYV 40


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
            region.  This family includes the N-terminal regions of
            the junctin, junctate and aspartyl beta-hydroxylase
            proteins. Junctate is an integral ER/SR membrane calcium
            binding protein, which comes from an alternatively
            spliced form of the same gene that generates aspartyl
            beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
            catalyzes the post-translational hydroxylation of
            aspartic acid or asparagine residues contained within
            epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 29.9 bits (67), Expect = 8.4
 Identities = 25/129 (19%), Positives = 41/129 (31%), Gaps = 13/129 (10%)

Query: 1229 RGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
              K   +K    G G++     DE +    E+S+D+ E  E  +  EE E+   V     
Sbjct: 113  LEKIVVSKQEEDGPGKE--PQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETAS 170

Query: 1289 VRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQE 1348
             +     K K +     D+     ++ R                      D D EDN  E
Sbjct: 171  EQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEE-----------DYDEEDNPVE 219

Query: 1349 EEQYGSRRA 1357
            + +      
Sbjct: 220  DSKAIKEEL 228



 Score = 29.9 bits (67), Expect = 8.7
 Identities = 29/172 (16%), Positives = 55/172 (31%), Gaps = 13/172 (7%)

Query: 1242 RGRKAPESEDEVEGEEEEE----SNDQSEEEEVEENSEEEEEEEAVPRRRPVRRA---NV 1294
                 P  E E   EEE +       Q  EEEV+E  +   E+  V ++          +
Sbjct: 73   SEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQL 132

Query: 1295 AKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGS 1354
             + K  +    D     L + + +      +     +       D   + + QE E    
Sbjct: 133  DEDKFLLAEDSDDRQETLEAGKVH-EETEDSYHVEETASEQYKQDMKEKASEQENEDSKE 191

Query: 1355 RRARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKPTAKLRGL 1406
               +  RTK       + +   E  +EE    + S + +  + K   + +  
Sbjct: 192  PVEKAERTKA-----ETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQE 238


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
            represents a conserved region approximately 60 residues
            long within the eukaryotic targeting protein for Xklp2
            (TPX2). Xklp2 is a kinesin-like protein localised on
            centrosomes throughout the cell cycle and on spindle pole
            microtubules during metaphase. In Xenopus, it has been
            shown that Xklp2 protein is required for centrosome
            separation and maintenance of spindle bi-polarity. TPX2
            is a microtubule-associated protein that mediates the
            binding of the C-terminal domain of Xklp2 to
            microtubules. It is phosphorylated during mitosis in a
            microtubule-dependent way.
          Length = 57

 Score = 27.3 bits (61), Expect = 8.8
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 7/40 (17%)

Query: 1257 EEEESNDQSEEEEVEENSEEEEEEEA-------VPRRRPV 1289
            EE+E   ++E+EE E   +EEEEE         VP+  P+
Sbjct: 18   EEKEKALEAEKEEAEARQKEEEEEAIKQLRKELVPKANPM 57


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is a
            family of uncharacterized proteins.
          Length = 449

 Score = 30.0 bits (67), Expect = 9.4
 Identities = 13/42 (30%), Positives = 29/42 (69%)

Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
            R  G++A E   E E ++EE+ +D  +E+E +++ +++E++E
Sbjct: 39   RKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDE 80


>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449).  This
            presumed domain is functionally uncharacterized. This
            domain is found in eukaryotes. This domain is typically
            between 181 to 207 amino acids in length. This domain has
            two conserved sequence motifs: PIP and CEICG. The domain
            carries a zinc-finger domain of the C2H2-type.
          Length = 187

 Score = 29.5 bits (67), Expect = 9.9
 Identities = 14/45 (31%), Positives = 24/45 (53%)

Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
            RK   +E+E +   +E S D SE+   +++ ++EEE    P   P
Sbjct: 27   RKQALTEEERQASADESSEDASEDGSDDDSDDDEEEPIYNPLNLP 71


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.390 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 71,547,693
Number of extensions: 7130460
Number of successful extensions: 10702
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8614
Number of HSP's successfully gapped: 489
Length of query: 1407
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1298
Effective length of database: 6,103,016
Effective search space: 7921714768
Effective search space used: 7921714768
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (28.9 bits)