RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8748
         (2004 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins
            that cover a wide variety of functions including
            adaptor/regulatory modules in signal transduction,
            pre-mRNA processing and cytoskeleton assembly; typically
            contains a GH dipeptide 11-24 residues from its
            N-terminus and the WD dipeptide at its C-terminus and is
            40 residues long, hence the name WD40; between GH and WD
            lies a conserved core; serves as a stable propeller-like
            platform to which proteins can bind either stably or
            reversibly; forms a propeller-like structure with several
            blades where each blade is composed of a four-stranded
            anti-parallel b-sheet; instances with few detectable
            copies are hypothesized to form larger structures by
            dimerization; each WD40 sequence repeat forms the first
            three strands of one blade and the last strand in the
            next blade; the last C-terminal WD40 repeat completes the
            blade structure of the first WD40 repeat to create the
            closed ring propeller-structure; residues on the top and
            bottom surface of the propeller are proposed to
            coordinate interactions with other proteins and/or small
            ligands; 7 copies of the repeat are present in this
            alignment.
          Length = 289

 Score =  272 bits (697), Expect = 2e-82
 Identities = 100/298 (33%), Positives = 151/298 (50%), Gaps = 10/298 (3%)

Query: 1684 PKKSLKGHINKVNSVHYSGDSRHCVSGSLDGKLIIWDTWTGNKTQVIPLRSAWVMSVAFA 1743
             +++LKGH   V  V +S D +   +GS DG + +WD  TG   + +   +  V  VA +
Sbjct: 1    LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 1744 QSGNFVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFLSSCRFIDDTTLLTGSG-D 1802
              G ++A G  D    ++DL         + +R L G+  ++SS  F  D  +L+ S  D
Sbjct: 61   ADGTYLASGSSDKTIRLWDLETG------ECVRTLTGHTSYVSSVAFSPDGRILSSSSRD 114

Query: 1803 MKIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVDKSCKLWDFREKKCVQTF 1862
              I  W++ETG+ +    + H   V S++ SPDG +  + S D + KLWD R  KCV T 
Sbjct: 115  KTIKVWDVETGKCLT-TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATL 173

Query: 1863 FGHEADVNSVCFHPSGLAFATGSEDKTARLFDIRADQQIATYLPPNKNSGFTSCAVSSSG 1922
             GH  +VNSV F P G    + S D T +L+D+   + + T       +G  S A S  G
Sbjct: 174  TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTL--RGHENGVNSVAFSPDG 231

Query: 1923 RYIFCGSDDNNVHVWDLLKTTHNGVLSGHENRVTSISLSPNGMALVTSSWDQFVKVWQ 1980
              +  GS+D  + VWDL        LSGH N VTS++ SP+G  L + S D  +++W 
Sbjct: 232  YLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
            only].
          Length = 466

 Score =  151 bits (382), Expect = 1e-38
 Identities = 97/313 (30%), Positives = 153/313 (48%), Gaps = 15/313 (4%)

Query: 1684 PKKSLKGHINKVNSVHYSGDSRHCVSGS-LDGKLIIWDTWTGNKTQVIPLRSAWVMSVAF 1742
              ++L+GH   V S+ +S D +   SGS LDG + +WD  TG     +   +  V S+AF
Sbjct: 147  LIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAF 206

Query: 1743 AQSGNF-VACGGMDNMCTVYDLNNRDSTGAAKVIRELL-GYEGFLSSCRFIDDTTLLTGS 1800
            +  G   +A G  D    ++DL+        K++R  L G+   + S    D + L +GS
Sbjct: 207  SPDGGLLIASGSSDGTIRLWDLST------GKLLRSTLSGHSDSVVSSFSPDGSLLASGS 260

Query: 1801 GDMKIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVDKSCKLWDFREKKCVQ 1860
             D  I  W+L +   +      H   V+S++ SPDG+   +GS D + +LWD    K + 
Sbjct: 261  SDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLS 320

Query: 1861 --TFFGHEADVNSVCFHPSGLAFATG-SEDKTARLFDIRADQQIATYLPPNKNSGFTSCA 1917
              T  GHE  V+S+ F P G    +G S+D T RL+D+R  + +        +S   S +
Sbjct: 321  SLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPL---KTLEGHSNVLSVS 377

Query: 1918 VSSSGRYIFCGSDDNNVHVWDLLKTTHNGVLSGHENRVTSISLSPNGMALVTSSWDQFVK 1977
             S  GR +  GS D  V +WDL   +    L GH +RVTS+  SP+G +L + S D  ++
Sbjct: 378  FSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIR 437

Query: 1978 VWQESNTIVWKVL 1990
            +W    ++     
Sbjct: 438  LWDLKTSLKSVSF 450



 Score =  142 bits (358), Expect = 2e-35
 Identities = 82/335 (24%), Positives = 147/335 (43%), Gaps = 9/335 (2%)

Query: 1666 KLEDVTASMPDAPKVKLMPKKSLKGHINKVNSVHYSGDSRHCVSGSLDGKLIIWDTWTGN 1725
             +++ + S  +  K+    +     +   + S+  S      ++   D  + + D     
Sbjct: 1    LVDNSSTSSENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDL---- 56

Query: 1726 KTQVIPLRSAWVMSVAFAQSGNFVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFL 1785
             + ++      + S+AF+  G  +  G  D    ++DL+N +    +             
Sbjct: 57   SSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLA 116

Query: 1786 SSCRFIDDTTLLTGSGDMKIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGS-V 1844
             S    +   L + S D  +  W+L T  ++    + H   V S++ SPDG+   +GS +
Sbjct: 117  LSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSL 176

Query: 1845 DKSCKLWDFREKKCVQTFFGHEADVNSVCFHPSG-LAFATGSEDKTARLFDIRADQQIAT 1903
            D + KLWD R  K + T  GH   V+S+ F P G L  A+GS D T RL+D+   + + +
Sbjct: 177  DGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRS 236

Query: 1904 YLPPNKNSGFTSCAVSSSGRYIFCGSDDNNVHVWDLL-KTTHNGVLSGHENRVTSISLSP 1962
             L  + +S  +S   S  G  +  GS D  + +WDL   ++    LSGH + V S++ SP
Sbjct: 237  TLSGHSDSVVSS--FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSP 294

Query: 1963 NGMALVTSSWDQFVKVWQESNTIVWKVLKVYNLEK 1997
            +G  L + S D  V++W      +   L +   E 
Sbjct: 295  DGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEG 329



 Score =  136 bits (343), Expect = 1e-33
 Identities = 93/302 (30%), Positives = 140/302 (46%), Gaps = 14/302 (4%)

Query: 1689 KGHINKVNSVHYSGDSRHCVSGSLDGKLIIWD-TWTGNKTQVIPLRSAWVMSVAFAQSGN 1747
               ++K+      G+S    S SLDG + +WD +  G   + +   S  V S+AF+  G 
Sbjct: 109  DSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGK 168

Query: 1748 -FVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFLSSCRFIDD--TTLLTGSGDMK 1804
               +   +D    ++DL         K +  L G+   +SS  F  D    + +GS D  
Sbjct: 169  LLASGSSLDGTIKLWDLRTG------KPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGT 222

Query: 1805 IVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVDKSCKLWDFR-EKKCVQTFF 1863
            I  W+L TG+ +      H   VVS S SPDG    +GS D + +LWD R     ++T  
Sbjct: 223  IRLWDLSTGKLLRSTLSGHSDSVVS-SFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281

Query: 1864 GHEADVNSVCFHPSGLAFATGSEDKTARLFDIRADQQIATYLPPNKNSGFTSCAVSSSGR 1923
            GH + V SV F P G   A+GS D T RL+D+   + +++          +S + S  G 
Sbjct: 282  GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341

Query: 1924 YIFCG-SDDNNVHVWDLLKTTHNGVLSGHENRVTSISLSPNGMALVTSSWDQFVKVWQES 1982
             +  G SDD  + +WDL        L GH N V S+S SP+G  + + S D  V++W  S
Sbjct: 342  LLVSGGSDDGTIRLWDLRTGKPLKTLEGHSN-VLSVSFSPDGRVVSSGSTDGTVRLWDLS 400

Query: 1983 NT 1984
              
Sbjct: 401  TG 402



 Score =  136 bits (343), Expect = 1e-33
 Identities = 104/363 (28%), Positives = 165/363 (45%), Gaps = 22/363 (6%)

Query: 1645 KKELTQLIDKCKEEQAKQADTKLEDVTASMPDAPKVKLMPKKSLKGHINKVNSVHYSGDS 1704
            K EL   ++         +++ +  +         +  +    L+GH + + S+ +S D 
Sbjct: 18   KSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDG 77

Query: 1705 RHCVSGSLDGKLIIWDTWTGNKTQVIPLRSAWVMSVA----FAQSGN--FVACGGMDNMC 1758
               +SGS DG + +WD     +  +  L      SV+     +  GN   +A   +D   
Sbjct: 78   ELLLSGSSDGTIKLWD-LDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTV 136

Query: 1759 TVYDLNNRDSTGAAKVIRELLGYEGFLSSCRFIDDTTLL--TGSGDMKIVTWNLETGQRI 1816
             ++DL+        K+IR L G+   ++S  F  D  LL    S D  I  W+L TG+ +
Sbjct: 137  KLWDLSTP-----GKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPL 191

Query: 1817 HEVAQAHIGDVVSISLSPDGQ-YFVTGSVDKSCKLWDFREKKCVQ-TFFGHEADVNSVCF 1874
                  H   V S++ SPDG     +GS D + +LWD    K ++ T  GH   V S  F
Sbjct: 192  S-TLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVS-SF 249

Query: 1875 HPSGLAFATGSEDKTARLFDIRADQQIATYLPPNKNSGFTSCAVSSSGRYIFCGSDDNNV 1934
             P G   A+GS D T RL+D+R+   +   L  + +S   S A S  G+ +  GS D  V
Sbjct: 250  SPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGH-SSSVLSVAFSPDGKLLASGSSDGTV 308

Query: 1935 HVWDL--LKTTHNGVLSGHENRVTSISLSPNGMALVTS-SWDQFVKVWQESNTIVWKVLK 1991
             +WDL   K   +  L GHE  V+S+S SP+G  LV+  S D  +++W        K L+
Sbjct: 309  RLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLE 368

Query: 1992 VYN 1994
             ++
Sbjct: 369  GHS 371



 Score =  120 bits (301), Expect = 3e-28
 Identities = 89/271 (32%), Positives = 136/271 (50%), Gaps = 14/271 (5%)

Query: 1684 PKKSLKGHINKVNSVHYSGDSR-HCVSGSLDGKLIIWDTWTGN-KTQVIPLRSAWVMSVA 1741
            P  +L GH + V+S+ +S D      SGS DG + +WD  TG      +   S  V+S +
Sbjct: 190  PLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVS-S 248

Query: 1742 FAQSGNFVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFLSSCRFI-DDTTLLTGS 1800
            F+  G+ +A G  D    ++DL +     ++ ++R L G+   + S  F  D   L +GS
Sbjct: 249  FSPDGSLLASGSSDGTIRLWDLRS-----SSSLLRTLSGHSSSVLSVAFSPDGKLLASGS 303

Query: 1801 GDMKIVTWNLETGQRIHEVA-QAHIGDVVSISLSPDGQYFVTG-SVDKSCKLWDFREKKC 1858
             D  +  W+LETG+ +  +  + H G V S+S SPDG   V+G S D + +LWD R  K 
Sbjct: 304  SDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKP 363

Query: 1859 VQTFFGHEADVNSVCFHPSGLAFATGSEDKTARLFDIRADQQIATYLPPNKNSGFTSCAV 1918
            ++T  GH  +V SV F P G   ++GS D T RL+D+     +         S  TS   
Sbjct: 364  LKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNL--DGHTSRVTSLDF 420

Query: 1919 SSSGRYIFCGSDDNNVHVWDLLKTTHNGVLS 1949
            S  G+ +  GS DN + +WDL  +  +   S
Sbjct: 421  SPDGKSLASGSSDNTIRLWDLKTSLKSVSFS 451



 Score = 99.0 bits (245), Expect = 3e-21
 Identities = 75/234 (32%), Positives = 117/234 (50%), Gaps = 10/234 (4%)

Query: 1685 KKSLKGHINKVNSVHYSGDSRHCVSGSLDGKLIIWDT-WTGNKTQVIPLRSAWVMSVAFA 1743
            + +L GH + V S  +S D     SGS DG + +WD   + +  + +   S+ V+SVAF+
Sbjct: 235  RSTLSGHSDSVVSS-FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFS 293

Query: 1744 QSGNFVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFLSSCRFIDDTTLL--TGSG 1801
              G  +A G  D    ++DL              L G+EG +SS  F  D +LL   GS 
Sbjct: 294  PDGKLLASGSSDGTVRLWDLETGKL----LSSLTLKGHEGPVSSLSFSPDGSLLVSGGSD 349

Query: 1802 DMKIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVDKSCKLWDFREKKCVQT 1861
            D  I  W+L TG+ +  +      +V+S+S SPDG+   +GS D + +LWD      ++ 
Sbjct: 350  DGTIRLWDLRTGKPLKTLE--GHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRN 407

Query: 1862 FFGHEADVNSVCFHPSGLAFATGSEDKTARLFDIRADQQIATYLPPNKNSGFTS 1915
              GH + V S+ F P G + A+GS D T RL+D++   +  ++ P  K     S
Sbjct: 408  LDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDGKVLASKS 461



 Score = 51.2 bits (121), Expect = 4e-06
 Identities = 55/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)

Query: 1668 EDVTASMPDAPKVKLMPKKSLKGHINKVNSVHYSGDSRHCVSG-SLDGKLIIWDTWTGNK 1726
             D T  + D    KL+   +LKGH   V+S+ +S D    VSG S DG + +WD     K
Sbjct: 304  SDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDL-RTGK 362

Query: 1727 TQVIPLRSAWVMSVAFAQSGNFVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFLS 1786
                    + V+SV+F+  G  V+ G  D    ++DL+         ++R L G+   ++
Sbjct: 363  PLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLST------GSLLRNLDGHTSRVT 416

Query: 1787 SCRF-IDDTTLLTGSGDMKIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVD 1845
            S  F  D  +L +GS D  I  W+L+T              + S+S SPDG+   + S D
Sbjct: 417  SLDFSPDGKSLASGSSDNTIRLWDLKT-------------SLKSVSFSPDGKVLASKSSD 463

Query: 1846 KS 1847
             S
Sbjct: 464  LS 465


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 53.1 bits (128), Expect = 5e-09
 Identities = 17/39 (43%), Positives = 23/39 (58%)

Query: 1856 KKCVQTFFGHEADVNSVCFHPSGLAFATGSEDKTARLFD 1894
             K ++T  GH   V SV F P G   A+GS+D T R++D
Sbjct: 1    GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 46.2 bits (110), Expect = 1e-06
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 1821 QAHIGDVVSISLSPDGQYFVTGSVDKSCKLWD 1852
            + H G V S++ SPDG    +GS D + ++WD
Sbjct: 8    KGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 46.2 bits (110), Expect = 1e-06
 Identities = 16/37 (43%), Positives = 21/37 (56%)

Query: 1684 PKKSLKGHINKVNSVHYSGDSRHCVSGSLDGKLIIWD 1720
              ++LKGH   V SV +S D     SGS DG + +WD
Sbjct: 3    LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 45.8 bits (109), Expect = 2e-06
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 1946 GVLSGHENRVTSISLSPNGMALVTSSWDQFVKVW 1979
              L GH   VTS++ SP+G  L + S D  V+VW
Sbjct: 5    RTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 38.5 bits (90), Expect = 7e-04
 Identities = 13/29 (44%), Positives = 14/29 (48%)

Query: 1910 NSGFTSCAVSSSGRYIFCGSDDNNVHVWD 1938
                TS A S  G  +  GSDD  V VWD
Sbjct: 11   TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 1724 GNKTQVIPLRSAWVMSVAFAQSGNFVACGGMDNMCTVYD 1762
            G   + +   +  V SVAF+  GN +A G  D    V+D
Sbjct: 1    GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are permuted
            with respect to the structural repeats (blades) of the
            beta propeller domain.
          Length = 40

 Score = 52.3 bits (126), Expect = 1e-08
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 1856 KKCVQTFFGHEADVNSVCFHPSGLAFATGSEDKTARLFD 1894
             + ++T  GH   V SV F P G   A+GS+D T +L+D
Sbjct: 2    GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 46.9 bits (112), Expect = 9e-07
 Identities = 17/37 (45%), Positives = 23/37 (62%)

Query: 1684 PKKSLKGHINKVNSVHYSGDSRHCVSGSLDGKLIIWD 1720
              K+LKGH   V SV +S D ++  SGS DG + +WD
Sbjct: 4    LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 45.8 bits (109), Expect = 2e-06
 Identities = 17/41 (41%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 1812 TGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVDKSCKLWD 1852
            +G+ +  + + H G V S++ SPDG+Y  +GS D + KLWD
Sbjct: 1    SGELLKTL-KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 44.6 bits (106), Expect = 5e-06
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 1947 VLSGHENRVTSISLSPNGMALVTSSWDQFVKVW 1979
             L GH   VTS++ SP+G  L + S D  +K+W
Sbjct: 7    TLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 38.1 bits (89), Expect = 0.001
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 1910 NSGFTSCAVSSSGRYIFCGSDDNNVHVWD 1938
                TS A S  G+Y+  GSDD  + +WD
Sbjct: 12   TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 35.8 bits (83), Expect = 0.006
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query: 1723 TGNKTQVIPLRSAWVMSVAFAQSGNFVACGGMDNMCTVYD 1762
            +G   + +   +  V SVAF+  G ++A G  D    ++D
Sbjct: 1    SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|227546 COG5221, DOP1, Dopey and related predicted leucine zipper
            transcription factors [Transcription].
          Length = 1618

 Score = 48.8 bits (116), Expect = 4e-05
 Identities = 28/151 (18%), Positives = 71/151 (47%), Gaps = 5/151 (3%)

Query: 1273 VTYGSQEKERVNTLLTTLMYNVI-PYLKNHSQRNAVSFHACSTLLASLSSYQYTRKAWKK 1331
            +      +++    + +++ ++I P +K  +    +  +  + LLA LS      K+W+K
Sbjct: 1285 IVLERLTEDKFQRTVLSIVNSLITPIIKLVNSSLGLKNYGLN-LLAVLSERGLEVKSWRK 1343

Query: 1332 DAMDLLLDPALFQMELRCLRAWKSIVDNLMSQEPATFRDLMTRVSLSQSNSLSLFSNKEQ 1391
            + +++  D  LF      L    S++  ++ ++P+   DL+ R     S  ++ F +++ 
Sbjct: 1344 EFVEIFNDLDLFIYGSDQLHKKSSLMRKVVVEDPSILNDLIVR---LDSGLITFFVSQDS 1400

Query: 1392 EYEQRAQLLKKLAFVILCSETDQFHKHIPEI 1422
            +   +   LK+++++I  S  D F     ++
Sbjct: 1401 DANNKTLNLKRISYLIFSSPYDYFLGFSLKL 1431



 Score = 31.4 bits (71), Expect = 6.0
 Identities = 18/76 (23%), Positives = 38/76 (50%)

Query: 1569 LLASLSSYQYTRKAWKKDAMDLLLDPALFQMELRCLRAWKSIVDNLMSQEPATFRDLMMK 1628
            LLA LS      K+W+K+ +++  D  LF      L    S++  ++ ++P+   DL+++
Sbjct: 1327 LLAVLSERGLEVKSWRKEFVEIFNDLDLFIYGSDQLHKKSSLMRKVVVEDPSILNDLIVR 1386

Query: 1629 PTSSINIYLKLQTAEL 1644
              S +  +   Q ++ 
Sbjct: 1387 LDSGLITFFVSQDSDA 1402


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 46.2 bits (109), Expect = 2e-04
 Identities = 49/165 (29%), Positives = 79/165 (47%), Gaps = 17/165 (10%)

Query: 1688 LKGHINKVNSVHYS-GDSRHCVSGSLDGKLIIWDTWTGNKTQVIPLRSAWVMSVAF-AQS 1745
            +K H  +V S+ YS  D     SGS DG + +W    G     I  + A +  V F ++S
Sbjct: 571  MKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTK-ANICCVQFPSES 629

Query: 1746 GNFVACGGMDNMCTVYDLNNRDSTGAAKVIRELLGYEGFLSSCRFIDDTTLLTGSGDMKI 1805
            G  +A G  D+    YDL N         +  ++G+   +S  RF+D +TL++ S D  +
Sbjct: 630  GRSLAFGSADHKVYYYDLRNPKLP-----LCTMIGHSKTVSYVRFVDSSTLVSSSTDNTL 684

Query: 1806 VTWNLE-TGQRIHE------VAQAHIGDVVSISLSPDGQYFVTGS 1843
              W+L  +   I+E      +   ++ + V +S+S DG Y  TGS
Sbjct: 685  KLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVS-DG-YIATGS 727



 Score = 42.8 bits (100), Expect = 0.002
 Identities = 60/262 (22%), Positives = 108/262 (41%), Gaps = 19/262 (7%)

Query: 1734 SAWVMSVAFAQSGNFVACGGMDNMCTVYDLNN--RDSTGAAKVIRELLGYEGFLSSC--R 1789
            S  V ++ F + G F A  G++    +++  +  +D       + EL         C   
Sbjct: 483  SNLVCAIGFDRDGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNS 542

Query: 1790 FIDDTTLLTGSGDMKIVTWNLETGQRIHEVAQAHIGDVVSISL-SPDGQYFVTGSVDKSC 1848
            +I  + + + + +  +  W++   Q + E+ + H   V SI   S D     +GS D S 
Sbjct: 543  YI-KSQVASSNFEGVVQVWDVARSQLVTEMKE-HEKRVWSIDYSSADPTLLASGSDDGSV 600

Query: 1849 KLWDFREKKCVQTFFGHEADVNSVCF-HPSGLAFATGSEDKTARLFDIRADQ-QIATYLP 1906
            KLW   +   + T    +A++  V F   SG + A GS D     +D+R  +  + T + 
Sbjct: 601  KLWSINQGVSIGT-IKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIG 659

Query: 1907 PNKNSGFTSCAVSSSGRYIFCGSDDNNVHVWDL------LKTTHNGVLSGHENRVTSISL 1960
             +K   +     SS+   +   S DN + +WDL      +  T      GH N    + L
Sbjct: 660  HSKTVSYVRFVDSST---LVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGL 716

Query: 1961 SPNGMALVTSSWDQFVKVWQES 1982
            S +   + T S    V V+ ++
Sbjct: 717  SVSDGYIATGSETNEVFVYHKA 738



 Score = 37.4 bits (86), Expect = 0.085
 Identities = 39/149 (26%), Positives = 65/149 (43%), Gaps = 9/149 (6%)

Query: 1841 TGSVDKSCKLWDFREKKCVQTFFGHEADVNSVCFHPSG-LAFATGSEDKTARLFDIRADQ 1899
            + + +   ++WD    + V     HE  V S+ +  +     A+GS+D + +L+ I    
Sbjct: 550  SSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGV 609

Query: 1900 QIATYLPPNKNSGFTSCAV--SSSGRYIFCGSDDNNVHVWDLLKTTHN-GVLSGHENRVT 1956
             I T     K      C    S SGR +  GS D+ V+ +DL         + GH   V+
Sbjct: 610  SIGTI----KTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVS 665

Query: 1957 SISLSPNGMALVTSSWDQFVKVWQESNTI 1985
             +    +   LV+SS D  +K+W  S +I
Sbjct: 666  YVRFV-DSSTLVSSSTDNTLKLWDLSMSI 693


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 39.9 bits (93), Expect = 0.015
 Identities = 32/128 (25%), Positives = 47/128 (36%), Gaps = 9/128 (7%)

Query: 1778 LLGYEGFLSSCRF--IDDTTLLTGSGDMKIVTWNLETGQRIHEVA------QAHIGDVVS 1829
            LLG EG +    F   D   L T S D  I+ W +        ++      Q H   V  
Sbjct: 71   LLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGI 130

Query: 1830 ISLSPDGQ-YFVTGSVDKSCKLWDFREKKCVQTFFGHEADVNSVCFHPSGLAFATGSEDK 1888
            +S  P       +   D    +WD    K V+    H   + S+ ++  G    T S+DK
Sbjct: 131  VSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDK 190

Query: 1889 TARLFDIR 1896
               + D R
Sbjct: 191  KLNIIDPR 198



 Score = 34.1 bits (78), Expect = 0.89
 Identities = 31/123 (25%), Positives = 51/123 (41%), Gaps = 9/123 (7%)

Query: 1864 GHEADVNSVCFHP-SGLAFATGSEDKTARLFDIRADQQIATYLPP-------NKNSGFTS 1915
            G E  +  V F+P       T SED T   + I  +        P        K  G  S
Sbjct: 73   GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVS 132

Query: 1916 CAVSSSGRYIFCGSDDNNVHVWDLLKTTHNGVLSGHENRVTSISLSPNGMALVTSSWDQF 1975
               S+       G+D   V+VWD+ +     V+  H +++TS+  + +G  L T+S D+ 
Sbjct: 133  FHPSAMNVLASAGAD-MVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKK 191

Query: 1976 VKV 1978
            + +
Sbjct: 192  LNI 194



 Score = 33.3 bits (76), Expect = 1.7
 Identities = 38/164 (23%), Positives = 61/164 (37%), Gaps = 27/164 (16%)

Query: 1796 LLTGSGDMKIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGSVDKSCKLWDFRE 1855
            L +   DM +  W++E G+ + EV + H   + S+  + DG    T S DK   + D R+
Sbjct: 141  LASAGADMVVNVWDVERGKAV-EVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRD 199

Query: 1856 KKCVQTFFGHEADVNSVCF--HPSGLAFATG---SEDKTARLFDIRA----------DQQ 1900
               V +   H +  +  C       L    G   S+ +   L+D R           DQ 
Sbjct: 200  GTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQS 259

Query: 1901 IATYLP-----------PNKNSGFTSCAVSSSGRYIFCGSDDNN 1933
             A ++P            +K  G   C    + R  FC S  + 
Sbjct: 260  SALFIPFFDEDTNLLYIGSKGEGNIRCFELMNERLTFCSSYSSV 303


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 39.1 bits (91), Expect = 0.031
 Identities = 25/150 (16%), Positives = 51/150 (34%), Gaps = 12/150 (8%)

Query: 461 KEATPPPCMKDSLDSASVETAETISLSNTSQGHQSEDSPLPSNKNLD-SPLPSNKNLDSP 519
             ++ P   +    +  +  + + + S +S      D   P++ +L  +P  +    DS 
Sbjct: 75  SFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSG 134

Query: 520 LPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLP---SNKNLDSPLPSNKNLDSPLPSNKNP 576
                      P  K  +  LPS+    +  P   ++ N       N      +PS  + 
Sbjct: 135 PGDG------SPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQ 188

Query: 577 NSPLPNNKNVSLRATSSAGN--SVESTHAH 604
           +  LP+       ++S   N  S  S   +
Sbjct: 189 SMQLPSPHFRQKFSSSDTSNGFSYPSIRKN 218



 Score = 34.1 bits (78), Expect = 1.1
 Identities = 27/147 (18%), Positives = 51/147 (34%), Gaps = 10/147 (6%)

Query: 466 PPCMKDSLDSASVETAETISLSNTSQGHQSEDSPLPSNKNLDSPLPSNKNLDSPLPSNKN 525
           P  +++  D+  + +    S    + GHQ   S   S K       +     SP  ++  
Sbjct: 43  PISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSST 102

Query: 526 LDSPLPSNKNLDSPLPSNKNLD-SPLPSNKNLDS----PLPSNKNLDSPLPSNKNPNSPL 580
             S   ++ +  SP  ++ +L  +P  +    DS      P  K  +  LPS+    +  
Sbjct: 103 -SSLNSNDGDQFSP--ASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGTHP 159

Query: 581 P--NNKNVSLRATSSAGNSVESTHAHV 605
           P     N    A +    S +   +  
Sbjct: 160 PIVFTDNNGSHAGAPNARSRKEIPSLG 186


>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
          Length = 808

 Score = 38.4 bits (89), Expect = 0.043
 Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 13/98 (13%)

Query: 460 EKEATPPPCMKDSL-----DSASVETAETISLSNTSQ-----GHQSEDSPLPSNKNLDSP 509
           E E   P    D++     +++S ET  ++S S + Q     G  S ++  P  +   SP
Sbjct: 2   ETETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTELQPSP 61

Query: 510 LPSNKNLDSPLPSNKNLDS---PLPSNKNLDSPLPSNK 544
           LP N  LD+P+  ++  +S     P N N   P P  K
Sbjct: 62  LPPNTTLDAPVSDSQGDESSSEQQPQNPNSTEPAPPPK 99



 Score = 34.2 bits (78), Expect = 0.82
 Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 12/105 (11%)

Query: 477 SVETAETISLSNTSQGHQSEDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNL 536
             ET E    ++      SE S   ++ +L SP  S +N+++    N + ++  P  +  
Sbjct: 2   ETETKEPEDPADNVNDVVSEASSPETDLSL-SPSQSEQNIEND-GQN-SPETQSPLTELQ 58

Query: 537 DSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNPNSPLP 581
            SPLP N  LD+P+  ++  +S         S     +NPNS  P
Sbjct: 59  PSPLPPNTTLDAPVSDSQGDES---------SSEQQPQNPNSTEP 94


>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein.
            Hid1 (high-temperature-induced dauer-formation protein 1)
            represents proteins of approximately 800 residues long
            and is conserved from fungi to humans. It contains up to
            seven potential transmembrane domains separated by
            regions of low complexity. Functionally it might be
            involved in vesicle secretion or be an inter-cellular
            signalling protein or be a novel insulin receptor.
          Length = 813

 Score = 37.3 bits (87), Expect = 0.091
 Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 6/50 (12%)

Query: 1273 VTYGSQEKERVNTLLTTLMYNVIPYLKNHSQRNAVSFHACSTLLASLSSY 1322
            +T G +  +++   L  ++YN+ PYL        +S++A S LL   S +
Sbjct: 477  ITTGKKRLQQLFPCLLEILYNLSPYLGG------LSYNASSKLLHLFSKF 520


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 36.8 bits (85), Expect = 0.15
 Identities = 23/127 (18%), Positives = 33/127 (25%), Gaps = 8/127 (6%)

Query: 469 MKDSLDSASVETAETISLSNTSQGHQSEDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDS 528
           M+    S       T S     +   S    L    NL     S +       S  N   
Sbjct: 457 MEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEAS-LISKSTGNTHK 515

Query: 529 PLPSNK-NLDSPLPSNKNLDSPLPSNKNLDSPLPS-NKNLDSPLPSNKNPNSPLPNNKNV 586
                +      LP+         S  NL     + N + D   P + N  SP  +   V
Sbjct: 516 HSTPRRLTTLPKLPAASR-----SSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVPLV 570

Query: 587 SLRATSS 593
            +     
Sbjct: 571 RVFDIHL 577


>gnl|CDD|220253 pfam09469, Cobl, Cordon-bleu domain.  The Cordon-bleu protein domain
            is highly conserved among vertebrates. The sequence
            contains three repeated lysine, arginine, and
            proline-rich regions, the KKRAP motif. The exact function
            of the protein is unknown but it is thought to be
            involved in mid-brain neural tube closure. It is
            expressed specifically in the node.
          Length = 349

 Score = 36.1 bits (83), Expect = 0.17
 Identities = 29/129 (22%), Positives = 38/129 (29%), Gaps = 8/129 (6%)

Query: 1460 SNSSSDKDSD-KSFFFFFFFFGSSLLSLLDAPESQSSPTSPVPNYRASSTPKTSG--SKS 1516
            SN S   + + K F  FF            AP S            + S P  SG   KS
Sbjct: 216  SNLSDMTEKEKKGFLGFFQRSKKKRDQCATAPASPLVNKRTFTLGNSISLPYISGVGPKS 275

Query: 1517 VLKKRWRKPRTPNNYNVQGDASNPADT-ADIPTNAPE---PCKEQRNAVSFHACSTLLAS 1572
              KKR R P  P   +       P        +       P +  R        S+L   
Sbjct: 276  EPKKR-RAPPPPMPTSQSVPQDLPPSCIVKSMSVDETDKTPEEVGRVRAGSLQLSSLSGG 334

Query: 1573 LSSYQYTRK 1581
             S  Q  ++
Sbjct: 335  QSDLQKKKR 343


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 35.7 bits (82), Expect = 0.26
 Identities = 23/147 (15%), Positives = 46/147 (31%), Gaps = 8/147 (5%)

Query: 435 DKHGGGEDDKTGDGSGE-ESGPGDGKEKEATPPPCMKDSLDSASVETAETISLSNTSQGH 493
           +     E++K  +  G  + G    K K         DS      + ++     + S   
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVT 377

Query: 494 QSEDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSN 553
             +       + +DS      N  SP  S     SP   +K             + +P+ 
Sbjct: 378 AKKQKEPKKEEPVDS------NPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAK 431

Query: 554 K-NLDSPLPSNKNLDSPLPSNKNPNSP 579
           K   ++   S+    +P   + + +S 
Sbjct: 432 KLKTENAPKSSSGKSTPQTFSGSKSSS 458


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
            identified as the binding site of the SRP72 protein to
            SRP RNA.
          Length = 57

 Score = 31.7 bits (72), Expect = 0.34
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 1504 RASSTPKTSGSKSVLKKRWRKPRTPNNYN 1532
             A   P T   K    K+ RKP+ P N+N
Sbjct: 8    IAKKKPGTEQKKKKSAKKKRKPKLPKNFN 36


>gnl|CDD|149939 pfam09055, Sod_Ni, Nickel-containing superoxide dismutase.  Nickel
           containing superoxide dismutase (NiSOD) is a
           metalloenzyme containing a hexameric assembly of
           right-handed 4-helix bundles of up-down-up-down topology
           with an N-terminal His-Cys-X-X-Pro-Cys-Gly-X-Tyr motif
           that chelates the active site Ni ions. NiSOD catalyzes
           the disproportionation of superoxide to peroxide and
           molecular oxygen through alternate oxidation and
           reduction of Ni, protecting cells from the toxic
           products of aerobic metabolism.
          Length = 128

 Score = 33.5 bits (77), Expect = 0.39
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 602 HAHVLLYCGVYDSARTQYTLQTIRNMI 628
            AH  L CG+YD A+ +    T+  M 
Sbjct: 6   SAHCDLPCGIYDPAQARIAALTVLKMT 32


>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
            This is a family of eukaryotic translation initiation
            factors.
          Length = 194

 Score = 34.2 bits (79), Expect = 0.44
 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 11/66 (16%)

Query: 1796 LLTGSGDM--KIVTWNLETGQRIHEVAQAHIGDVVSISLSPDGQYFVTGS------VDKS 1847
            LL G G++  +I  W++   +   ++A A   +      SPDG+YF+T +      VD  
Sbjct: 116  LLAGFGNLAGQIEFWDV---KNKKKIATAEASNATDCEWSPDGRYFLTATTSPRLRVDNG 172

Query: 1848 CKLWDF 1853
             K+W +
Sbjct: 173  FKIWHY 178


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 34.5 bits (79), Expect = 0.62
 Identities = 27/196 (13%), Positives = 52/196 (26%), Gaps = 10/196 (5%)

Query: 449 SGEESGPGDGKEKEATPPPCMKDSLDSASVETAETISLSNTSQGHQSEDSPLPSNKNLDS 508
           +  +SGPG          P  K     A +     I  +  S   +   +      +  S
Sbjct: 408 TPPDSGPGSNVSSRRVGTPGSKSDRVGAVLLRRMNIKPTLASIPDEKPSNISVFEDSETS 467

Query: 509 PLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNL--DSPL---PSN 563
           P  S    D P          LP+N   +    +  +   PL   +++          S 
Sbjct: 468 PNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTL-SSIPPLSPRQSIITLPTPSRPASR 526

Query: 564 KNLDSPLPSNKNPNSPLPNNKNVSLRATSSAG----NSVESTHAHVLLYCGVYDSARTQY 619
            +  S    + + +   P      +    +AG     SV       +    +  +     
Sbjct: 527 ISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSFNRSVSDIEGERIGRYNLLPTRIPAL 586

Query: 620 TLQTIRNMILPNSRAF 635
             +         S + 
Sbjct: 587 PFKAESTTSSRRSSSL 602


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 34.2 bits (78), Expect = 0.92
 Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 9/65 (13%)

Query: 1928 GSDDNNVHVWDLLKTTHN--------GVLSGHENRVTSISLSP-NGMALVTSSWDQFVKV 1978
            GS+D  + VW++     +         +L GH+ +++ I  +P N   + +S +D FV +
Sbjct: 93   GSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNI 152

Query: 1979 WQESN 1983
            W   N
Sbjct: 153  WDIEN 157


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 33.6 bits (76), Expect = 0.97
 Identities = 35/150 (23%), Positives = 49/150 (32%), Gaps = 18/150 (12%)

Query: 464 TPPPCMKDSLDSASVET-AETISLSNTSQGHQSEDSPLPSNKNLDSPLP-------SNKN 515
             PP      D     +  +  S S       SE + L       +P         S+ +
Sbjct: 26  KGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLAKTPSSSSS 85

Query: 516 LDS----PLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPL---PSNKNLDSPLPSNKNLDS 568
             S    PLP      +     +N   P P N     P    P+   L S    +K   S
Sbjct: 86  QTSTPSQPLPRPLPRPTQSAG-RNSILPGPGNSLSQVPRTSSPARALLASSGSQHKTQKS 144

Query: 569 P--LPSNKNPNSPLPNNKNVSLRATSSAGN 596
           P  +P  +NP  P P +KN S R     G+
Sbjct: 145 PVRIPFMQNPAKPPPLSKNASSRPRPEPGS 174


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 33.3 bits (76), Expect = 1.2
 Identities = 29/150 (19%), Positives = 45/150 (30%), Gaps = 7/150 (4%)

Query: 443 DKTGDGSGEESGPGDGKEKEATPP-PCMKDSLDSASVETAETISLSNTSQGHQSEDSPLP 501
           + T  G      PG   +  +    P +  S DS   +  +    +++     +E S  P
Sbjct: 200 EVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMD---TEKSSAP 256

Query: 502 SNKNLDSPLPSNKNLDSP-LPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPL 560
             +    P    K                 P      S    + +L SP P       PL
Sbjct: 257 KPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEASARKRSPSLLSPSPK-AESPKPL 315

Query: 561 PSNKNLDSPLPSNK-NPNSPLPNNKNVSLR 589
            S         S +  P SP  N+   +LR
Sbjct: 316 ASPGKSPRDPLSPRPKPQSPPVNDFRANLR 345


>gnl|CDD|206712 cd04141, Rit_Rin_Ric, Ras-like protein in all tissues (Rit), Ras-like
            protein in neurons (Rin) and Ras-related protein which
            interacts with calmodulin (Ric).  Rit (Ras-like protein
            in all tissues), Rin (Ras-like protein in neurons) and
            Ric (Ras-related protein which interacts with calmodulin)
            form a subfamily with several unique structural and
            functional characteristics. These proteins all lack a the
            C-terminal CaaX lipid-binding motif typical of Ras family
            proteins, and Rin and Ric contain calmodulin-binding
            domains. Rin, which is expressed only in neurons, induces
            neurite outgrowth in rat pheochromocytoma cells through
            its association with calmodulin and its activation of
            endogenous Rac/cdc42. Rit, which is ubiquitously
            expressed in mammals, inhibits growth-factor
            withdrawl-mediated apoptosis and induces neurite
            extension in pheochromocytoma cells. Rit and Rin are both
            able to form a ternary complex with PAR6, a cell
            polarity-regulating protein, and Rac/cdc42. This ternary
            complex is proposed to have physiological function in
            processes such as tumorigenesis. Activated Ric is likely
            to signal in parallel with the Ras pathway or stimulate
            the Ras pathway at some upstream point, and binding of
            calmodulin to Ric may negatively regulate Ric activity.
          Length = 172

 Score = 31.4 bits (71), Expect = 2.9
 Identities = 16/42 (38%), Positives = 24/42 (57%)

Query: 1355 SIVDNLMSQEPATFRDLMTRVSLSQSNSLSLFSNKEQEYEQR 1396
            S+ D    QE + F++L+TRV L++   L L  NK    +QR
Sbjct: 82   SVTDRHSFQEASEFKELITRVRLTEDIPLVLVGNKVDLEQQR 123


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 32.0 bits (74), Expect = 3.7
 Identities = 10/39 (25%), Positives = 20/39 (51%), Gaps = 5/39 (12%)

Query: 1637 LKLQTAELKKELTQLIDKCKEEQAKQADTKLEDVTASMP 1675
            L  +  ELK+E+  L     E +  + + +LE++   +P
Sbjct: 71   LIAEVKELKEEIKAL-----EAELDELEAELEELLLRIP 104


>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1.  This protein
           family shows weak but suggestive similarity to
           translation initiation factor SUI1 and its prokaryotic
           homologs.
          Length = 173

 Score = 31.0 bits (70), Expect = 3.9
 Identities = 22/77 (28%), Positives = 28/77 (36%), Gaps = 11/77 (14%)

Query: 391 AENCIASGDLEVSTP------ADEYFKPLGTKSSSQTVIERVLDEMCEYVDKHGGGEDDK 444
            E C  SGDL+           D Y K  GT S +       + E  E       GE D 
Sbjct: 14  PEYCEFSGDLKRCKVWLSENAPDLYAKLYGTDSPTAEAEAVTIGEAQE-----EKGEKDL 68

Query: 445 TGDGSGEESGPGDGKEK 461
            G    +E+  G  K+K
Sbjct: 69  LGIQKAQEAREGGKKKK 85


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 31.9 bits (72), Expect = 5.1
 Identities = 36/163 (22%), Positives = 60/163 (36%), Gaps = 17/163 (10%)

Query: 458 GKEKEATPPPCMKDSLDSASVETAETISLSNTSQGHQSEDSPLPSNKNLDS---PLPSNK 514
            ++    PPP      + AS        L+N +Q  + ED               L   +
Sbjct: 494 ERQGIPEPPPGSHQPGNRASQ------DLNNNNQ--KQEDESTNPIGKTSLRYQELTPVQ 545

Query: 515 NLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPSN- 573
             D P     + DS LP  ++ D P   N+       +     +P+  ++     +P   
Sbjct: 546 EEDEPEDQTDDDDSSLPPLESDDDPGSDNE--QGVDLTEVAPPAPVYRDEKEQDEIPHPA 603

Query: 574 KNPNSPLPNNKNVS---LRATSSAGNSVESTHAHVLLYCGVYD 613
           +NP  P  +  NV    LR+ S     +E T+ H+L   G +D
Sbjct: 604 QNPQDPTGSIGNVDSDILRSNSKPSAPLEETYMHLLRTQGPFD 646


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 31.0 bits (70), Expect = 7.4
 Identities = 26/143 (18%), Positives = 44/143 (30%), Gaps = 13/143 (9%)

Query: 466 PPCMKDSLDSASVETAETISLSNTSQGHQSEDSPLPSNKNLDSPLPSNKNLDSPLPSNKN 525
           P     +L     E +E  S   TS         L   +N       +K   S       
Sbjct: 202 PKEENHTLSVTDKEKSEASSKGLTSTK------ELVPVQNSGGNHSLSK---SSNSQTPE 252

Query: 526 LDSPLPSNKNLDSPLPSNKNLDSPLPSNKNLDSPLPS----NKNLDSPLPSNKNPNSPLP 581
           L+       +  S    + ++      +K+ DS L +      + +   PS+ + N    
Sbjct: 253 LEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKE 312

Query: 582 NNKNVSLRATSSAGNSVESTHAH 604
              N S  A  S G+    +  H
Sbjct: 313 TTSNSSSAAAGSIGSKSSKSAKH 335


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 30.7 bits (69), Expect = 7.9
 Identities = 13/67 (19%), Positives = 23/67 (34%), Gaps = 2/67 (2%)

Query: 424 ERVLDEMCEYVDKHGGGEDDKTGDGSGEESGPGDGKEKEATPPPCMKDSLDSASVETAET 483
             + +E  E  +    GED    D  GE      G+E+E          ++      +E 
Sbjct: 206 RELNEEEEEEAEGSDDGEDV--VDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEE 263

Query: 484 ISLSNTS 490
           +  +N  
Sbjct: 264 VPRNNEE 270


>gnl|CDD|113995 pfam05244, Brucella_OMP2, Brucella outer membrane protein 2.  This
            family consists of several outer membrane proteins (2a
            and 2b) from brucella bacteria. Brucellae are
            Gram-negative, facultative intracellular bacteria that
            can infect many species of animals and man.
          Length = 240

 Score = 30.6 bits (68), Expect = 8.2
 Identities = 15/63 (23%), Positives = 30/63 (47%), Gaps = 3/63 (4%)

Query: 1764 NNRDSTGAAKVIRELLGYEGFLSSCRFIDDTTLLTGSG-DMKIVTWNLETGQRIHEVAQA 1822
            ++R++  A + + +L G  G       +DD   +TG G +   +  N ET +   ++ + 
Sbjct: 116  HSREAVSAGEGVGDLAGAVGASRDHVIVDDIAEVTGEGMEFGFIDTNAETTEL--DICEL 173

Query: 1823 HIG 1825
            H G
Sbjct: 174  HDG 176


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.129    0.376 

Gapped
Lambda     K      H
   0.267   0.0728    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 97,333,696
Number of extensions: 9349805
Number of successful extensions: 6322
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6218
Number of HSP's successfully gapped: 69
Length of query: 2004
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1893
Effective length of database: 6,014,308
Effective search space: 11385085044
Effective search space used: 11385085044
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 67 (29.6 bits)