RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15301
         (947 letters)



>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 51.3 bits (123), Expect = 2e-06
 Identities = 21/41 (51%), Positives = 25/41 (60%)

Query: 204 RWKLHGYTRNPDPVLNPKKKIFETVAPDLKTNDSNQATYKG 244
           R    G+   PD VLNPKKK++ET+ PDL TN    A YK 
Sbjct: 740 RVTFPGFEGEPDDVLNPKKKVWETLQPDLHTNSELVACYKD 780


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 25/171 (14%), Positives = 64/171 (37%), Gaps = 6/171 (3%)

Query: 711 DNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDE 770
               +++  S+    + +SK  SSK  ++++ +++  + N D    T     S  +S + 
Sbjct: 34  STAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFST--IDSSTSDSNNI 91

Query: 771 GNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTARKSS 830
            + I     +T    +   +  +     ++ +Q+    +SD+            + + ++
Sbjct: 92  IDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDY----EQPRNSEKSTN 147

Query: 831 DLEGTARKSNDLEQSAQKSSDLEGTAQKSSDLQGTAQKSSDLDGTARKSSD 881
           D    +  S   +   Q S   +   QK+     T   +S+    + K + 
Sbjct: 148 DSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQ 198



 Score = 39.0 bits (91), Expect = 0.012
 Identities = 30/236 (12%), Positives = 78/236 (33%), Gaps = 13/236 (5%)

Query: 587 TSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIE 646
           T+      S K  D ET +  S++  ++ ++       +ND+   T   +          
Sbjct: 35  TAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSN----- 89

Query: 647 DRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKS 706
                ++  +       N       ++  D   +        N   ++   +  ++   S
Sbjct: 90  ----NIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFN---LNSDISDYEQPRNS 142

Query: 707 TNILDNKGSSNKSSEAKRPNS-SSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDK 765
               ++   ++ SS     ++ SSK + +   +  + +++   ++   P     T  +  
Sbjct: 143 EKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQS 202

Query: 766 ESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSD 821
            SQ   +   ++   +KD      S ++    + S+     QK    +    K+  
Sbjct: 203 NSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTET 258



 Score = 34.8 bits (80), Expect = 0.23
 Identities = 44/251 (17%), Positives = 92/251 (36%), Gaps = 30/251 (11%)

Query: 522 VTPSKTGGESTSNYQVKQTSND---EKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNN 578
           V P+ T   + ++   K ++         +     TSKD  +++  K    + +    +N
Sbjct: 16  VLPTLTSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKAD--NNNTSNQDN 73

Query: 579 KECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKR 638
            +       S    S SN   D   K +P   I           +  +D  + T      
Sbjct: 74  NDKKFSTIDS--STSDSNNIIDFIYKNLPQTNINQLLTK-----NKYDDNYSLTTLIQNL 126

Query: 639 KRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTN 698
             +  DI D         +    S    N+   +    +K   + Q+ S+  +  ++   
Sbjct: 127 FNLNSDISD--------YEQPRNSEKSTNDSNKNSDSSIKNDTDTQS-SKQDKADNQ--- 174

Query: 699 VLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETS 758
              +   S N   +  +SNK   + +P   ++ NS   ++++    S  K N+ S ++++
Sbjct: 175 ---KAPSSNNTKPS--TSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQ-SMSDSA 228

Query: 759 ITSESDKESQD 769
           + S  D+ S+D
Sbjct: 229 LDSILDQYSED 239



 Score = 32.1 bits (73), Expect = 1.7
 Identities = 44/291 (15%), Positives = 97/291 (33%), Gaps = 18/291 (6%)

Query: 510 KERKTNSTSSFEVTPSKTGGESTS-NYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRR 568
             +K+N   + + T SK   ++ + N   +  ++ +        S S + ++   +   +
Sbjct: 42  DSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQ 101

Query: 569 ISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDE 628
            + +  +T NK       T+ ++   +     S+ ++  ++E     N   +   +S   
Sbjct: 102 TNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSE--KSTNDSNKNSDSSIKN 159

Query: 629 PTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESE 688
            T T+   + K   +             +    S      +Q++      + A   T ++
Sbjct: 160 DTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSN-----SQPASDDTANQ 214

Query: 689 NREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVK 748
                D   ++ D      +ILD   S +     K   S SK + ++ +   N       
Sbjct: 215 KSSSKDN-QSMSDSA--LDSILDQY-SEDAKKTQKDYASQSKKDKTETSNTKNPQLPTQD 270

Query: 749 SNKD------SPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIE 793
             K       S       S +   S  E  P +   +++   NV  S D  
Sbjct: 271 ELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDSGSFNVVDSKDTR 321


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 42.6 bits (100), Expect = 0.001
 Identities = 60/297 (20%), Positives = 101/297 (34%), Gaps = 30/297 (10%)

Query: 564 RKKRRISTSNEITNN-------KECSERLPTSNLEVSQSNKDGDSETKRVPSNEIL--NE 614
           RK+R +   NE           KE +  L  ++ E S+++  G + TK +   +    N 
Sbjct: 181 RKQREVQKENEFYMQLLQQALPKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNH 240

Query: 615 ENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVL-----GLDDADESGNELNED 669
             SK     T   E +   KD+      +     I           L   +   N  N+ 
Sbjct: 241 SLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKS 300

Query: 670 QTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSS 729
           + S +            S          +       + N  ++   S+ S+    P+SS 
Sbjct: 301 RPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSV 360

Query: 730 KPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKS 789
             N SK    S  SS    S KD+   ++               + +   E K       
Sbjct: 361 SDNESKQKRASKSSSGARDSKKDASGMSA------------NGTVENCIPENKISTPSAI 408

Query: 790 SDIEGTGQK-SSDLQDTAQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQS 845
             +E   +K  ++LQ   Q  S+L       + LE + +  SDL G  +K ND+ Q+
Sbjct: 409 ERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLK--SDL-GQLKKENDMLQT 462


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 37.4 bits (86), Expect = 0.046
 Identities = 46/304 (15%), Positives = 103/304 (33%), Gaps = 34/304 (11%)

Query: 493  EESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEKKRV 552
            EE +++  A E    E  ++K       E    K              +  ++  E K+ 
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK-----------AAEALKKEAEEAKKA 1704

Query: 553  STSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEIL 612
               K +  E  +K   +  + E    K              ++ K+ + + K+    +  
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIK------------AEEAKKEAEEDKKKAEEAKKD 1752

Query: 613  NEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLD-------DADESGNE 665
             EE  K   L    ++     +  K   I E++++  E+  + +D       D   +  E
Sbjct: 1753 EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812

Query: 666  LNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRP 725
              ++      D KE+     +S  +E+ D     L+  D       NK + N     K  
Sbjct: 1813 GGKEGNLVINDSKEM----EDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEA 1868

Query: 726  NSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKN 785
            + + + +  + +EE  E +  ++       E  I + +     ++        +E   ++
Sbjct: 1869 DFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDEYIKRD 1928

Query: 786  VRKS 789
              ++
Sbjct: 1929 AEET 1932


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 37.3 bits (86), Expect = 0.050
 Identities = 62/356 (17%), Positives = 122/356 (34%), Gaps = 36/356 (10%)

Query: 465 SETPKKVNL-----DKVTNSTHS---KENGRVNNFREESDRVNTAREQCSEEGKERKTNS 516
            E  KK NL     +++ N+TH      N R     + +D V T  +   +E     ++S
Sbjct: 67  KEEKKKKNLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSS 126

Query: 517 TSSFEVTPSKTGGES--TSNYQVKQTSNDEKYYEKKRVSTSKDEVN----------ENGR 564
           +SS   + +     S  TS Y     S   K+  K         VN          E   
Sbjct: 127 SSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIKENQSTHVNISSDNSPRNKEISN 186

Query: 565 KKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLST 624
           K+ +  T+   T   +   R P  N    ++N + D   ++    +   +++    + + 
Sbjct: 187 KQLKKQTNVTHTTCYDKMRRSP-RNTSTIKNNTN-DKNKEKNKEKDKNIKKDRDGDKQTK 244

Query: 625 SNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQ 684
            N E ++ +  +   RIL       ++    +     S   L         D    + HQ
Sbjct: 245 RNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQ 304

Query: 685 TESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESS 744
                +       N++ R + S+N  ++  SS+    + R ++ +  +  K    +    
Sbjct: 305 ----KKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTT---- 356

Query: 745 SLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSS 800
                 K +    +      K +Q   +P+ H     K  ++   S+ +G G    
Sbjct: 357 ------KHTNNTKNNKYNKTKTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEK 406


>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
           Provisional.
          Length = 1466

 Score = 36.9 bits (85), Expect = 0.065
 Identities = 36/156 (23%), Positives = 55/156 (35%), Gaps = 17/156 (10%)

Query: 613 NEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTS 672
           N E      +    ++PT+  K+N  K                 DD + + N  N    +
Sbjct: 654 NRERGSTVDVDIIGEDPTKDNKENNNKN--------------NKDDNNNNNNNNNNKINN 699

Query: 673 EAMDLKEVAEHQTESENREIV--DKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSK 730
               + E   H    +N+  +      N      KS+N  ++K S  KSS  K       
Sbjct: 700 AGSYIIEQGTHDALMKNKNGIYYTMINNQKVSSKKSSNNDNDKDSDMKSSAYKDSERGYD 759

Query: 731 PNSSKYN-EESNESSSLVKSNKDSPTETSITSESDK 765
           P+    N +  NES+S  KS K S    S  +   K
Sbjct: 760 PDEMNGNSKHENESASNKKSCKMSDENASENNAGGK 795


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.2 bits (84), Expect = 0.094
 Identities = 25/236 (10%), Positives = 63/236 (26%), Gaps = 24/236 (10%)

Query: 436  SLSETPKKSVILYKCPGEDFLVQPAVISLSETPK--KVNLDKVTNSTHSKENGRVNNFRE 493
               E  KK     K      +     +   E  K      +K +NS+ S +        +
Sbjct: 1175 KKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTK 1234

Query: 494  ESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEKKRVS 553
                     +       +   ++        SK G          +  N  K     + S
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEG----------KPKNAPKRVSAVQYS 1284

Query: 554  TSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILN 613
                    +G        S+      +       + L+  + ++   +  K+  +     
Sbjct: 1285 PPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQA 1344

Query: 614  EENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNED 669
              +   R L               RK+  +   +  ++  +   + ++  ++ ++D
Sbjct: 1345 SASQSSRLLR------------RPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 35.4 bits (82), Expect = 0.14
 Identities = 32/181 (17%), Positives = 60/181 (33%), Gaps = 8/181 (4%)

Query: 490 NFREESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEK 549
              EE +   +        G+ +        E    K   E+ + ++ K+ S++E+  E 
Sbjct: 406 ESDEEENEEPSK----KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELED 461

Query: 550 KRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSN 609
           +  +  +   N+  ++  +     E     E       +   V +S K  DS+ K     
Sbjct: 462 EEEAKVEKVANKLLKRSEKAQKEEEEEELDE-ENPWLKTTSSVGKSAKKQDSKKKSSSKL 520

Query: 610 EILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNED 669
           +    + SK            + EK       L D ED I+  V   +D D+        
Sbjct: 521 DKAANKISKAAVKVKKKK---KKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFK 577

Query: 670 Q 670
           Q
Sbjct: 578 Q 578


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 34.7 bits (80), Expect = 0.25
 Identities = 16/85 (18%), Positives = 34/85 (40%), Gaps = 2/85 (2%)

Query: 569 ISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDE 628
           +S     T ++E ++   TS  +  + +K+ +S+ K     +   E+   K    T    
Sbjct: 34  LSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93

Query: 629 --PTRTEKDNKRKRILEDIEDRIEE 651
             P +++K  K+    +  ED    
Sbjct: 94  KTPKKSKKTKKKPPKPKPNEDVDNA 118


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 34.7 bits (80), Expect = 0.31
 Identities = 40/249 (16%), Positives = 85/249 (34%), Gaps = 19/249 (7%)

Query: 491 FREESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTS---NDEKYY 547
             E    +   + +  E  +E +       E+       E   +   ++     N+ +  
Sbjct: 262 LEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEEL 321

Query: 548 EKKRVSTSKDEVNENGRKKRRISTSNEITNN-KECSERLPTSNLEVSQSNKDGDSETKRV 606
           E++     +         + R +   E+     E  E       ++S   ++ +   + +
Sbjct: 322 EERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEAL 381

Query: 607 PSN---------EILNEENSKKRRLSTSNDEPTRT-EKDNKRKRILEDIEDRIEEVVLGL 656
                       EI NE    KR + +  +   R  E+    K  L+++E  +EE+   L
Sbjct: 382 REELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTEL 441

Query: 657 DDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSS 716
           ++ +E   EL E        LKE      E E  E+ ++   +   +      LD   + 
Sbjct: 442 EELNEELEELEEQLEELRDRLKE-----LERELAELQEELQRLEKELSSLEARLDRLEAE 496

Query: 717 NKSSEAKRP 725
            ++S+  R 
Sbjct: 497 QRASQGVRA 505



 Score = 30.5 bits (69), Expect = 5.2
 Identities = 36/273 (13%), Positives = 89/273 (32%), Gaps = 19/273 (6%)

Query: 525 SKTGGESTSNYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSER 584
           S TGG       + Q    ++  E+  ++  + ++ +   + + +        +     R
Sbjct: 651 SITGGSRNKRSSLAQKRELKELEEE--LAELEAQLEKLEEELKSLKNELRSLEDLLEELR 708

Query: 585 LPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILED 644
                LE        +         ++ +     +  L    +E             LE+
Sbjct: 709 RQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEE-------------LEE 755

Query: 645 IEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESEN--REIVDKCTNVLDR 702
           +++R+EE+   L+  +E+  +L E+        + + E   E E    E   +   +   
Sbjct: 756 LQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815

Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESS--SLVKSNKDSPTETSIT 760
           ++      +      +  E +      K +  +   E  E     L +  ++   E    
Sbjct: 816 LESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEEL 875

Query: 761 SESDKESQDEGNPIVHKPEETKDKNVRKSSDIE 793
            +  KE ++E   +  +  E + +      +IE
Sbjct: 876 EDELKELEEEKEELEEELRELESELAELKEEIE 908


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 34.0 bits (78), Expect = 0.35
 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 7/81 (8%)

Query: 710 LDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNE-------SSSLVKSNKDSPTETSITSE 762
           LD    S K+   K+P  +S+P + +  ++S++       SS    +      E++ +S 
Sbjct: 38  LDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSH 97

Query: 763 SDKESQDEGNPIVHKPEETKD 783
              ++    +    +PEET D
Sbjct: 98  PKSKAPSTESEEEEEPEETPD 118



 Score = 30.1 bits (68), Expect = 5.6
 Identities = 17/70 (24%), Positives = 30/70 (42%), Gaps = 6/70 (8%)

Query: 705 KSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESD 764
                  N   S+K +E+   +S  KP   + + ES  SS        +P+     SE +
Sbjct: 57  SRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSH---PKSKAPSTE---SEEE 110

Query: 765 KESQDEGNPI 774
           +E ++  +PI
Sbjct: 111 EEPEETPDPI 120


>gnl|CDD|109608 pfam00558, Vpu, Vpu protein.  The Vpu protein contains an
           N-terminal transmembrane spanning region and a
           C-terminal cytoplasmic region. The HIV-1 Vpu protein
           stimulates virus production by enhancing the release of
           viral particles from infected cells. The VPU protein
           binds specifically to CD4.
          Length = 81

 Score = 31.2 bits (71), Expect = 0.47
 Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 6/47 (12%)

Query: 631 RTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDL 677
           R  +  K++R +  +  RI E       A++SGNE N D+  E  DL
Sbjct: 28  REYRKIKKQREILRLIKRIRER------AEDSGNESNGDEEEELADL 68


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.5 bits (77), Expect = 0.73
 Identities = 29/133 (21%), Positives = 51/133 (38%), Gaps = 19/133 (14%)

Query: 580 ECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRK 639
           E  +RL    +     N++ +S  +     E   EE  ++     S  E  R   +++R+
Sbjct: 325 ELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRRE 384

Query: 640 RILEDIEDRIEEVVLGLDDAD--------------ESGNELNEDQTSEAMDLKEVAEHQT 685
            I E++E+ IEE+     DA               E  +EL E +      L+   E   
Sbjct: 385 EI-EELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARE--R 441

Query: 686 ESENREIVD--KC 696
             E   +++  KC
Sbjct: 442 VEEAEALLEAGKC 454


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
           iron-regulated surface proteins found in Bacillus,
           Staphylococcus and Listeria species and are responsible
           for heme scavenging from hemoproteins. The IsdB protein
           is only observed in Staphylococcus and consists of an
           N-terminal hydrophobic signal sequence, a pair of tandem
           NEAT (NEAr Transporter, pfam05031) domains which confers
           the ability to bind heme and a C-terminal sortase
           processing signal which targets the protein to the cell
           wall. IsdB is believed to make a direct contact with
           methemoglobin facilitating transfer of heme to IsdB. The
           heme is then transferred to other cell wall-bound NEAT
           domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 33.4 bits (75), Expect = 0.77
 Identities = 41/175 (23%), Positives = 71/175 (40%), Gaps = 10/175 (5%)

Query: 692 IVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNK 751
           IVDK        DK+      +   N + +   P + SKP +    +ES +  S    NK
Sbjct: 447 IVDKEAFTKANADKTNK---KEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNK 503

Query: 752 DSPT---ETSITSESDKESQDEGNPI---VHKPEETKDKNVRKSSDIEGTGQKSSDLQDT 805
            SP+   E   +SES K+      P    V     T  K V  + ++      SS+    
Sbjct: 504 QSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKD 563

Query: 806 AQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQSAQKSSDLEGTAQKSS 860
             ++S     A+ S+ L+    K+++ +G  +  N+      K+  L  T ++S+
Sbjct: 564 VVQTSAGSSEAKDSAPLQKANIKNTN-DGHTQSQNNKNTQENKAKSLPQTGEESN 617


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 33.3 bits (76), Expect = 0.79
 Identities = 32/210 (15%), Positives = 73/210 (34%), Gaps = 15/210 (7%)

Query: 590 LEVSQSNKDGDSETKRVPSNEILNEE---NSKKRRLSTSNDEPTRTEKDNKRKRILEDIE 646
           L+ +  + +G S      S  + +E    N     +  +  +       N+RK + +D +
Sbjct: 34  LQSTVEDIEGPSAVDERTSGVLRDEGKHANILYNSILCNQKK--HASFLNQRKSLDDDDD 91

Query: 647 DRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQ----TESENREIVDKCTNVLDR 702
           D  +   L  DD D++GN  + + +++   L E+ +      T++ N +  D   +    
Sbjct: 92  DEFDF--LYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSN 149

Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSE 762
                 +  +   S+     +     ++     YN  ++   +  K    S +       
Sbjct: 150 SQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLN 209

Query: 763 SDKESQDEGNPIVHKPEETKDKNVRKSSDI 792
                 D+G+P        +DK    +   
Sbjct: 210 DSDGQGDDGDP----ESAEEDKAASNTRAA 235


>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
           transduction mechanisms].
          Length = 612

 Score = 33.0 bits (75), Expect = 0.81
 Identities = 34/234 (14%), Positives = 63/234 (26%), Gaps = 22/234 (9%)

Query: 531 STSNYQVKQTS--NDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTS 588
            +S   + Q S  N  +Y    +VS      ++               ++    E     
Sbjct: 32  DSSLCGLNQASDANGNEYSPNNKVSKKDTFSDQL--------------HDALSKEFTLER 77

Query: 589 NLEVSQSNKDGDSETKRVPSN---EILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDI 645
             +  Q NK      +   S    EI           + +          N +       
Sbjct: 78  ERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDAVDPPNHTRSSGNNLSNANVKTLSAPVG 137

Query: 646 EDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDK 705
           E         LD   ++  E +  Q  E           ++   +   +   +  D    
Sbjct: 138 EHSRSNNPPNLDQNLDTEPESSISQWGELQLNPSGKTLSSQPSRKPTSENPKSESDNSKL 197

Query: 706 STNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSI 759
            T+   N    +KS   KR  S+     + YN +     S    +  S ++  I
Sbjct: 198 PTS--VNSPLPDKSL-LKRTLSNFWAERNSYNWKPLVYPSCPSEHIFSDSDVII 248


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 32.9 bits (75), Expect = 1.0
 Identities = 35/166 (21%), Positives = 66/166 (39%), Gaps = 16/166 (9%)

Query: 568 RISTSNEIT---NNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLST 624
            IS S+E T    N    + L  S+ ++   +   DS    +  N+ L+E+  KK +   
Sbjct: 6   NISISSENTIDLKNPLFIKNLKHSSYKIRLESDIKDS-LLNLDINKKLHEKLDKKNKKFN 64

Query: 625 SNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQ 684
             D+   ++K   +++     +  I++      D+  +  +        A+ L    + +
Sbjct: 65  KTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP---LAISLMRKPKPK 121

Query: 685 TESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSK 730
           TE   ++I          V+KSTN    K  S+K    K  N+  K
Sbjct: 122 TEKLKKKIT---------VNKSTNKKKKKVLSSKDELIKYDNNKPK 158


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 32.7 bits (74), Expect = 1.3
 Identities = 39/252 (15%), Positives = 88/252 (34%), Gaps = 11/252 (4%)

Query: 555  SKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNE 614
            +  E +E  + +  +  +N +  + +  +    +  +  + N+DG  E   V  NE   E
Sbjct: 4023 ADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDD-EKMNEDGFEE--NVQENEESTE 4079

Query: 615  ENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEA 674
            +  K        + P     DN  K    D +          ++ D+     NE+   E 
Sbjct: 4080 DGVKSDEELEQGEVPEDQAIDNHPK---MDAKSTFASAEADEENTDKGIVGENEE-LGEE 4135

Query: 675  MDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKS-SEAKRPNSSSKPNS 733
              ++       E E  ++ +  +   + + ++     + G   +   +A R +       
Sbjct: 4136 DGVRGNGTADGEFE--QVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHEWEDLTE 4193

Query: 734  SKYNEESNESSSLVKSNKDSPTETSITSESDK-ESQDEGNPIVHKPEETKDKNVRKSSDI 792
            S+     +     VK +++   +    +E D+ +S D        P+     N+ +    
Sbjct: 4194 SQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNSTNIAEDEAD 4253

Query: 793  EGTGQKSSDLQD 804
            E   ++  D QD
Sbjct: 4254 EVGDKQLQDGQD 4265



 Score = 32.3 bits (73), Expect = 1.7
 Identities = 46/232 (19%), Positives = 84/232 (36%), Gaps = 22/232 (9%)

Query: 646  EDRIEEVVLGLDDADES-GNELNEDQTSEAMD--LKEVAEHQTESENREIVDKCTNVLDR 702
            E   EE      D DES   EL  D      D  + E     +E EN+++ ++   V D 
Sbjct: 3839 ELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE---VNDI 3895

Query: 703  VDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSE 762
             +  +N L+ K     + E            S  N ES+  S    +      +     +
Sbjct: 3896 PEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKED 3955

Query: 763  SDKESQDEGNPIVHKPEETKD--KNVRKSSDI-----------EGTGQKSSDLQDTAQKS 809
             ++ S D G     +P+  ++  +   ++ D+           EG   K SDL+D   ++
Sbjct: 3956 EEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEA 4015

Query: 810  SDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQSAQKSSDLEGTAQKSSD 861
            +D     ++ +D E       +       + D +      SDL    +K ++
Sbjct: 4016 ADEN---KEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNE 4064



 Score = 32.3 bits (73), Expect = 1.9
 Identities = 50/298 (16%), Positives = 103/298 (34%), Gaps = 39/298 (13%)

Query: 493  EESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEKKRV 552
            EES       ++  E+G+  +  +  +     +K+   S    +  + + D+    +   
Sbjct: 4075 EESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASA---EADEENTDKGIVGENEE 4131

Query: 553  STSKDEVNENGRKKRRISTSNE-ITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEI 611
               +D V  NG          E  +  KE             Q    GD   +   +N I
Sbjct: 4132 LGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE------ADRQYQSLGDHLREWQQANRI 4185

Query: 612  LNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNE-LNEDQ 670
               E           +  ++   D++   + ED E+ ++ +     D  +S +   + +Q
Sbjct: 4186 HEWE--------DLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQ 4237

Query: 671  TSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSK 730
              ++M+   +A    E E  E+ DK      ++    +I D K    ++ E   P     
Sbjct: 4238 NPDSMNSTNIA----EDEADEVGDK------QLQDGQDISDIK----QTGEDTLPTEFGS 4283

Query: 731  PNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEE-TKDKNVR 787
             N      E     S  +  +D   + ++   +     DE   + +K E+ TK  ++ 
Sbjct: 4284 IN----QSEKVFELSEDEDIEDELPDYNV-KITPAMPIDEARDLWNKHEDSTKQLSIE 4336


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.0 bits (73), Expect = 1.4
 Identities = 37/176 (21%), Positives = 64/176 (36%), Gaps = 18/176 (10%)

Query: 597 KDGDSETKRVPSNEILNEENSKKRRLST-SNDEPTRTEKDNKRKRILEDIEDRIEEVVLG 655
            D   E + +   E L E  S++  +   S  E     +  + +   E+ E  I+ +  G
Sbjct: 142 ADSKREMEVLERLEELKELQSRRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFG 201

Query: 656 LDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGS 715
                         +T E     +  + + + E+ +      +      K T+IL  K +
Sbjct: 202 P-------------ETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKPTSILK-KSA 247

Query: 716 SNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEG 771
           + +S       S +K NS    +  +  SSLV   K +P  TS  S S  E   E 
Sbjct: 248 AKRSEAPSS--SKAKKNSRGIPKPRDALSSLVVRKKAAPESTS-QSPSSAEPTSES 300


>gnl|CDD|236195 PRK08238, PRK08238, hypothetical protein; Validated.
          Length = 479

 Score = 32.2 bits (74), Expect = 1.4
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 8/45 (17%)

Query: 145 RKSSRPISTGPSSLSLGFGAAAF---AGFVLA-----GFSGLVLV 181
           RK  RP ++G   +  G  AA     AG  LA      F  ++L 
Sbjct: 259 RKRRRPFASGALPIPFGLAAAPLLLLAGLALALALGPAFLLVLLA 303


>gnl|CDD|178752 PLN03213, PLN03213, repressor of silencing 3; Provisional.
          Length = 759

 Score = 32.1 bits (72), Expect = 1.5
 Identities = 65/357 (18%), Positives = 130/357 (36%), Gaps = 54/357 (15%)

Query: 469 KKVNLDKVTNSTHSKENGRVNNFREESDRVNT------AREQCSEEGKERKTNSTSSFEV 522
           +K  L++ T S      GR  N    S +  T       R++ S+  +E+K  S    EV
Sbjct: 245 RKSILNEKTPSEEGYSEGRKGNHIHPSKKRQTISLEESGRQESSQAIREKKKPS----EV 300

Query: 523 TPSKTGGESTSNYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECS 582
            P K+  E +    ++Q               S D ++ + +   +   +N  +N+   S
Sbjct: 301 VPDKSLDEPSRTKDLEQ---------------SIDNISWSQKSSWKSLMANGNSNDFSVS 345

Query: 583 ERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLST-------SNDEPTRTEKD 635
             LP      +      +++   +PS E L E+  +KR  ST         D+  R + D
Sbjct: 346 SFLPGVGSSKAVQPAPRNTDLAELPSRENLKEKTKRKRVTSTIMAEDLPVPDDIKRDDSD 405

Query: 636 NKRKRILEDIEDRIEEVVLGLDDADESGNE-------------------LNEDQTSEAMD 676
                I  D    +E        AD++ N+                   + +D   ++M 
Sbjct: 406 TMADDIERDDSAAVEYYTACESMADDAANDSVAERDDSDAVECDTAIDSMADDTAIDSMA 465

Query: 677 LKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKY 736
               ++   ES++ + V+  T +    D + +    +     + E      S   +++  
Sbjct: 466 DDAASDAVAESDDGDAVENDTAIDSMADDTASNSMAESDDGDNVEDDTAIDSMADDTAND 525

Query: 737 NEESNESSSLVKSNKDSPTET---SITSESDKESQDEGNPIVHKPEETKDKNVRKSS 790
           +  S++S SL  +  D+  +       + ++ +S D  + +       +D N  K S
Sbjct: 526 DVGSDDSESLADTVIDTSVDAVPLDFVANTEGDSGDGKSNVEKHENGAEDLNAEKES 582


>gnl|CDD|215386 PLN02727, PLN02727, NAD kinase.
          Length = 986

 Score = 32.2 bits (73), Expect = 1.8
 Identities = 22/103 (21%), Positives = 38/103 (36%), Gaps = 2/103 (1%)

Query: 708 NILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKES 767
           + +D+  S ++  E  R N      +S    +S+E  +   SN  S    S  S S   S
Sbjct: 510 SKIDDTDSISRLVETGRSNGLVSEKNSSPKYQSSEFDNGKSSNGSS--FASDGSLSVASS 567

Query: 768 QDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSS 810
              GNP  +    +   +  + S    + ++S      A   S
Sbjct: 568 ITNGNPSNNGASSSTVSDNLERSVASVSVRESQRSNGKASLGS 610


>gnl|CDD|227452 COG5123, TOA2, Transcription initiation factor IIA, gamma subunit
           [Transcription].
          Length = 113

 Score = 29.9 bits (67), Expect = 1.9
 Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 4/74 (5%)

Query: 222 KKIFET----VAPDLKTNDSNQATYKGKVWTVENVEGFVTSQRVNPLNNFSENLECGRDK 277
             + ET    V   L  + ++  T+KGK+ T    +G  T    N + + +       + 
Sbjct: 36  MHVLETFDKSVPEVLNKSVTSSLTFKGKLDTYNFCDGVWTFITKNFVMSLNNEYFNAGEV 95

Query: 278 SQVSQKIDIVADKS 291
           + V+ K  IVA   
Sbjct: 96  NIVACKARIVACAG 109


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 32.0 bits (72), Expect = 2.1
 Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 4/87 (4%)

Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNS---SKYNEESNESSSLVKSNKDSPTETSI 759
             K      N+G    SS   +  +S+K  S   SK+ E ++   S   S K S +   I
Sbjct: 129 AGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASVAGI 188

Query: 760 TSESDKESQDEGNPIVHKPEETKDKNV 786
              +D+E+       +   EE  + NV
Sbjct: 189 VG-ADEEAPPAPKNTLTPLEELYETNV 214


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 31.4 bits (71), Expect = 2.9
 Identities = 31/149 (20%), Positives = 55/149 (36%), Gaps = 6/149 (4%)

Query: 715 SSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPI 774
           S N    A   N     +SS     S+ S S   S  DS +E+S +   + E     +P 
Sbjct: 379 SRNTPPSAPSSNPEPAASSS--GSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPE 436

Query: 775 VHKPEETK---DKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTARKSSD 831
              P   K   D  + K +  + +  +S       ++  + EG  + S        K   
Sbjct: 437 PEPPSTNKWQLDNWLNKVNPHKVSPAESVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPP 496

Query: 832 LEGTARKSNDLEQSAQKSSDLEGTAQKSS 860
            + ++++     ++AQK  +     QKS 
Sbjct: 497 PKSSSKEKRRP-RTAQKGPESGRGKQKSP 524


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 31.6 bits (72), Expect = 3.0
 Identities = 33/255 (12%), Positives = 84/255 (32%), Gaps = 22/255 (8%)

Query: 540  TSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDG 599
                 K Y +  ++       +   K+ +     E       +E L        ++ K  
Sbjct: 805  LLGSRKEY-RSYLACIIK--LQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRF 861

Query: 600  DSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDA 659
                K     +        +R+L          + D K    L+ +   +E  ++ L   
Sbjct: 862  SLLKKETIYLQSAQRVELAERQLQ-------ELKIDVKSISSLKLVNLELESEIIEL-KK 913

Query: 660  DESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKS 719
              S + +   +       + +A  +    N ++ +  +    ++ +   + + +    ++
Sbjct: 914  SLSSDLIENLE----FKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKET 969

Query: 720  SEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETS----ITSESDKESQDEGNPIV 775
            SE +  +   K  S+    E N+++S +K+ K    E S       ES K+ ++    + 
Sbjct: 970  SE-EYEDLLKK--STILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVA 1026

Query: 776  HKPEETKDKNVRKSS 790
                 +K  +   + 
Sbjct: 1027 ELQSASKIISSESTE 1041


>gnl|CDD|153001 pfam12567, CD45, Leukocyte receptor CD45.  This family of proteins
           is found in eukaryotes. Proteins in this family are
           typically between 77 and 1130 amino acids in length. The
           family is found in association with pfam00041. CD45
           plays a critical role in T-cell receptor (TCR)-mediated
           signaling. CD45 interacts with SKAP55 which is a
           transcriptional activator of IL-2.
          Length = 61

 Score = 28.3 bits (63), Expect = 3.1
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 543 DEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSER 584
           D  Y +  ++ T+K  VNEN +      T+NE+ N  EC +R
Sbjct: 3   DYLYDKSTKLFTAKLNVNENVKCGNNDCTNNELHNLPECEQR 44


>gnl|CDD|211905 TIGR04170, RNR_1b_NrdE, ribonucleoside-diphosphate reductase, class
           1b, alpha subunit.  Members of this family are NrdE, the
           alpha subunit of class 1b ribonucleotide reductase. This
           form uses a dimanganese moiety associated with a
           tyrosine radical to reduce the cellular requirement for
           iron.
          Length = 698

 Score = 30.9 bits (70), Expect = 3.5
 Identities = 14/30 (46%), Positives = 16/30 (53%)

Query: 632 TEKDNKRKRILEDIEDRIEEVVLGLDDADE 661
             K N  KR LE  EDR+  V L L + DE
Sbjct: 99  ALKTNDGKRYLERYEDRVCMVALTLANGDE 128


>gnl|CDD|185118 PRK15196, PRK15196, secreted effector protein PipB2; Provisional.
          Length = 350

 Score = 30.7 bits (69), Expect = 3.8
 Identities = 32/155 (20%), Positives = 65/155 (41%), Gaps = 13/155 (8%)

Query: 788 KSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQSAQ 847
           K + + G    + +L D     ++LEG     +D EG   K ++L GT+   ++ + +  
Sbjct: 155 KGAVLTGANLTAENLCDADLSGANLEGAVLFMADCEGANFKGANLSGTSLGDSNFKNACL 214

Query: 848 KSSDLEGTAQKSSDLQGTAQKSSDLDGTAR-----KSSDLDRYNIGIEDFLFGSMSEEQT 902
           + S + G     ++L G   + + L G +        +++D  N+     +   MS    
Sbjct: 215 EDSIMCGATLDHANLTGANLQHASLLGCSMIECNCSGANMDHTNLSGATLIRADMSGATL 274

Query: 903 RMLKVYSKVLTLADLVRVLQVPINPFNQQKVSFLS 937
           +   + + ++  A L R         N +K SF+S
Sbjct: 275 QGATIMAAIMEGAVLTRA--------NLRKASFIS 301


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 30.5 bits (68), Expect = 5.1
 Identities = 39/184 (21%), Positives = 79/184 (42%), Gaps = 12/184 (6%)

Query: 608  SNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELN 667
            S  I +E N   + L   N+      K++    I ED E+ +EE      DA+E+  E  
Sbjct: 914  SGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDAEENVEE------DAEENVEENV 967

Query: 668  EDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKST--NILDNKGSSNKSSEAKRP 725
            E+   E  +++E  E   E    E V++  NV + V+++   N+ +N   + + +  +  
Sbjct: 968  EENVEE--NVEENVEENVEENVEENVEE--NVEENVEENIEENVEENVEENIEENVEEYD 1023

Query: 726  NSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKN 785
              + +       E   E+   ++ N +   E +I    ++  ++    I    EE  ++N
Sbjct: 1024 EENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEEN 1083

Query: 786  VRKS 789
            V ++
Sbjct: 1084 VEEN 1087


>gnl|CDD|236179 PRK08188, PRK08188, ribonucleotide-diphosphate reductase subunit
           alpha; Validated.
          Length = 714

 Score = 30.4 bits (69), Expect = 5.5
 Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 3/38 (7%)

Query: 639 KRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMD 676
           KR LE  EDR+  V L L   DE    L      E + 
Sbjct: 120 KRYLERFEDRVCMVALTLAQGDE---TLARQLVDEIIS 154


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 30.0 bits (67), Expect = 5.9
 Identities = 37/153 (24%), Positives = 63/153 (41%), Gaps = 20/153 (13%)

Query: 544 EKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVS----QSNKDG 599
            + + ++  S+   E     R+KR+ S S  +     C E L + + EV     + + D 
Sbjct: 104 PQMFGEEAPSSQDGEQPGPSRRKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDS 163

Query: 600 DSETKRVPSNEILNEENSKKRRL----------------STSNDEPTRTEKDNKRKRILE 643
           DSE K+ P NE   E+  + R+                 S  ++   R  + +KRK   E
Sbjct: 164 DSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEE 223

Query: 644 DIEDRIEEVVLGLDDADESGNELNEDQTSEAMD 676
           D ED   E      + DE G+    D+ + A++
Sbjct: 224 DEEDAESESSFESSNEDEEGSSSEADEMAAALE 256


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.4 bits (68), Expect = 6.2
 Identities = 35/162 (21%), Positives = 72/162 (44%), Gaps = 6/162 (3%)

Query: 712 NKGS-SNKSSEAKRPNSSSKPNSSKYNEE---SNESSSLVKSNKDSPTETSITSESDKES 767
           ++GS S   S  K+  +S    +S  NE+   S  +S    S   + ++   T + +K+ 
Sbjct: 10  SRGSMSTLRSGRKKQTASPDGRASPTNEDQRSSGRNSPSAASTSSNDSKAESTKKPNKKI 69

Query: 768 QDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTAR 827
           ++E    +   +  ++K    + + E    K S  Q+ ++ +S  EG      + E +  
Sbjct: 70  KEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDS 129

Query: 828 KSSDLEGTARKSNDLEQSAQKSS-DLEGTAQKSSDLQGTAQK 868
           +S + EG++    D++Q  + SS  +       SD   +AQ+
Sbjct: 130 RSVNEEGSS-DPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQ 170


>gnl|CDD|185214 PRK15314, PRK15314, outer membrane protein RatB; Provisional.
          Length = 2435

 Score = 30.2 bits (67), Expect = 6.8
 Identities = 21/86 (24%), Positives = 38/86 (44%), Gaps = 3/86 (3%)

Query: 531 STSNYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLP---T 587
           +T  + V  TS DE+Y+   + ST+ D ++ + R +      +   +   C ++ P   T
Sbjct: 408 TTLGWPVGLTSGDEEYWSSSQGSTATDHISIDMRSRALTQMPDATQSLVSCVDKAPPAVT 467

Query: 588 SNLEVSQSNKDGDSETKRVPSNEILN 613
             L +S  N D      +V   E +N
Sbjct: 468 PKLVISADNFDSTVNAAKVKVGEEIN 493


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 30.1 bits (67), Expect = 7.0
 Identities = 45/261 (17%), Positives = 94/261 (36%), Gaps = 30/261 (11%)

Query: 558 EVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRV----------- 606
           E++  GRK RR  T   I       + L   + +VS S+++ D +               
Sbjct: 401 EIDNVGRKTRRQPTGKAIAEETSREDELSFDDSDVSTSDENEDVDFTGKKGAINNEDESD 460

Query: 607 ------PSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDI-------EDRIEEVV 653
                  S+   +E     R       +   ++   + + I +         E+ IEE  
Sbjct: 461 NEEVAFDSDSQFDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEE-Y 519

Query: 654 LGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNK 713
            G        + + +D+  +  D+ +VA     S + ++++     L +   S   L ++
Sbjct: 520 KGESAKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSR 579

Query: 714 GSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNP 773
              + + ++         +  K N E  E       N+   +  S  +  ++ES DE + 
Sbjct: 580 FQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVD- 638

Query: 774 IVHKPEETKDKNVRKSSDIEG 794
                E  +++N RK  ++ G
Sbjct: 639 ----YETEREENARKKEELRG 655


>gnl|CDD|227554 COG5229, LOC7, Chromosome condensation complex Condensin, subunit H
           [Chromatin structure and dynamics / Cell division and
           chromosome partitioning].
          Length = 662

 Score = 30.2 bits (67), Expect = 7.2
 Identities = 52/284 (18%), Positives = 95/284 (33%), Gaps = 7/284 (2%)

Query: 560 NENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKK 619
           +E G K   ++T N     +   +  P  + E     K    E + +  + ++++     
Sbjct: 147 DEGGAKSLLLNTLNIDNTKRLYFDSSPIKDTENVGQGKLQRKEEELIERDSLVDDLMIDS 206

Query: 620 RRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQ-TSEAMDLK 678
           + L  S+++ T     N        +ED I    LG+D        +   + T    D K
Sbjct: 207 QSLGISSNDST----VNDSVISAPSMEDEILG--LGMDFIKVLRMSVCRIEGTVIVDDFK 260

Query: 679 EVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNE 738
              +          + +   + D V++     DN+ S +   +          ++ + NE
Sbjct: 261 MSMDFPAGLLISPSISEDKEIRDIVEEPAVDNDNEVSDSDGFDMGEHAGLFSGDNFELNE 320

Query: 739 ESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQK 798
               SS+   S   S  E  +    D           H   + K K V K SD+    + 
Sbjct: 321 ILPPSSNQNPSTTGSIFEKDVYKYFDFSYFKNWAGPEHWKVQAKKKRVNKESDLLEETRT 380

Query: 799 SSDLQDTAQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDL 842
           +     T  KS D  G  ++         ++ D EG     N L
Sbjct: 381 TITTDKTDDKSMDTSGKQKQKEASFIDFTETVDYEGMLEPGNTL 424


>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
          Length = 603

 Score = 29.9 bits (67), Expect = 7.2
 Identities = 35/154 (22%), Positives = 58/154 (37%), Gaps = 20/154 (12%)

Query: 726 NSSSKPNSSKYNEESNESSS-------LVKSNKDSPTETSITSESDKESQDEGNPIVHK- 777
           NS    NSS  N +S ES           +       +T+++S  + +       + HK 
Sbjct: 83  NSVVSYNSSDKNNDSLESEVDGGNNHKPKEEQAQVSQKTTVSSSEEVQISARDIQLNHKT 142

Query: 778 ----PEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSS------DLEGTAR 827
               P    +KN R       T +K  +++D   ++      A   S      +L    +
Sbjct: 143 QFNPPTVKHEKNTRVQPRRA-TDEKVKEIRDKIIQAKAYLNLAPPGSNSQLVKELRLRTK 201

Query: 828 KSSDLEGTARKSNDLEQSA-QKSSDLEGTAQKSS 860
           +     G A K  DL +SA Q+   +E T  K+S
Sbjct: 202 ELERAVGDATKDKDLSKSALQRMKSMEVTLYKAS 235


>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
          Length = 188

 Score = 29.2 bits (66), Expect = 7.9
 Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 16/82 (19%)

Query: 591 EVSQSNKDGDSETKRVPSN------EILNE---------ENSKKRRLSTSNDEPTRTEKD 635
             S+   + D E + + +       EIL +         E  +++ LS++  E  R E+ 
Sbjct: 17  RASEIRAEADEEAEEIIAEAEADAEEILEDREAEAEREIEQLREQELSSAKLEAKR-ERL 75

Query: 636 NKRKRILEDIEDRIEEVVLGLD 657
           N RK +LED+ +++E+ +  LD
Sbjct: 76  NARKEVLEDVRNQVEDEIASLD 97


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 29.9 bits (67), Expect = 8.1
 Identities = 46/274 (16%), Positives = 89/274 (32%), Gaps = 19/274 (6%)

Query: 580 ECSERLPTSNLEVSQSNKDGDSETKRVPSNE-ILNEENSKKRRLSTSNDEPTRTEKDNKR 638
                  +++     S K  DS        E   N E+    RL           K    
Sbjct: 392 AMLPPESSADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVL---KLVME 448

Query: 639 KRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTN 698
           K I+ +I + +E++   L   D      N D  +E     EVA   +  E    V + + 
Sbjct: 449 KHIISEISEILEDIEAEL---DSLERSSNGDDENE-----EVAMVGSLVEESSAVIERSQ 500

Query: 699 VLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETS 758
            L      + I +            +     K +  K+ +E + +   V +      E S
Sbjct: 501 EL--EGAVSRISEFTSVLEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQ-----EKS 553

Query: 759 ITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARK 818
           +   S + S+ + N +  K  E +  +  +                 ++K +     ++ 
Sbjct: 554 LQDVSVEASEIKKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKC 613

Query: 819 SSDLEGTARKSSDLEGTARKSNDLEQSAQKSSDL 852
             DLE +  K  + E    +      ++Q+S+ L
Sbjct: 614 KDDLEESKNKLQETEKKLEELKSELDASQESNSL 647


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 28.4 bits (64), Expect = 8.3
 Identities = 13/45 (28%), Positives = 25/45 (55%)

Query: 629 PTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSE 673
           P   + D K+K+ ++  +D I++ +L  DD DE   +  +D+  E
Sbjct: 82  PLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE 126


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.124    0.337 

Gapped
Lambda     K      H
   0.267   0.0742    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 44,294,553
Number of extensions: 4177453
Number of successful extensions: 3525
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3377
Number of HSP's successfully gapped: 173
Length of query: 947
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 841
Effective length of database: 6,236,078
Effective search space: 5244541598
Effective search space used: 5244541598
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 64 (28.4 bits)