RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10114
         (819 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score =  151 bits (383), Expect = 8e-41
 Identities = 85/288 (29%), Positives = 138/288 (47%), Gaps = 22/288 (7%)

Query: 519 VDWSSQN--VLSVGLGSCVYLWSACTSQV-TRLCDLSADGNSVTSVAWNERGNLVAVGTH 575
           V +S     + +      + +W   T ++   L   +     V + A    G  +A G+ 
Sbjct: 15  VAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD---GTYLASGSS 71

Query: 576 HGYVQVWDVSVAKQVHKLVGHTARVGALAW--NGDMLSSGSRDRMILQRDVRTPNSQSER 633
              +++WD+   + V  L GHT+ V ++A+  +G +LSS SRD+ I   DV T   +   
Sbjct: 72  DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG--KCLT 129

Query: 634 RLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHH 693
            L GH   V  + +SPD  ++AS   D  + +W+L +   + T T H   V ++A+SP  
Sbjct: 130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDG 189

Query: 694 HGLLASGGGTADRCIRFWNTLTGQPMQCVDT----GSQVCNLAWSKHSSELVSTHGYSQN 749
             LL+S     D  I+ W+  TG+   C+ T     + V ++A+S     L S  G    
Sbjct: 190 EKLLSSSS---DGTIKLWDLSTGK---CLGTLRGHENGVNSVAFSPDGYLLAS--GSEDG 241

Query: 750 QILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWN 797
            I VW   T   V  L+GH+  V  LA SPDG+ + +G+ D T+R W+
Sbjct: 242 TIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score =  146 bits (371), Expect = 4e-39
 Identities = 80/254 (31%), Positives = 127/254 (50%), Gaps = 12/254 (4%)

Query: 556 NSVTSVAWNERGNLVAVGTHHGYVQVWDVSVAKQVHKLVGHTARVGALAW--NGDMLSSG 613
             VT VA++  G L+A G+  G ++VWD+   + +  L GHT  V  +A   +G  L+SG
Sbjct: 10  GGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASG 69

Query: 614 SRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSP 673
           S D+ I   D+ T   +  R L GH   V  + +SPD + L+S   D  + VW++ +   
Sbjct: 70  SSDKTIRLWDLETG--ECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKC 127

Query: 674 LQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDTG--SQVCNL 731
           L T   H   V ++A+SP     +AS     D  I+ W+  TG+ +    TG   +V ++
Sbjct: 128 LTTLRGHTDWVNSVAFSPDGT-FVASSS--QDGTIKLWDLRTGKCVA-TLTGHTGEVNSV 183

Query: 732 AWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDE 791
           A+S    +L+S    S   I +W   T   +  L GH   V  +A SPDG  + +G+ D 
Sbjct: 184 AFSPDGEKLLS--SSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDG 241

Query: 792 TLRFWNVFSKVRSQ 805
           T+R W++ +    Q
Sbjct: 242 TIRVWDLRTGECVQ 255



 Score =  134 bits (339), Expect = 7e-35
 Identities = 66/212 (31%), Positives = 110/212 (51%), Gaps = 10/212 (4%)

Query: 590 VHKLVGHTARVGALAWN--GDMLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKW 647
              L GHT  V  +A++  G +L++GS D  I   D+ T   +  R L GH   V  +  
Sbjct: 2   RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETG--ELLRTLKGHTGPVRDVAA 59

Query: 648 SPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRC 707
           S D  YLASG +D  + +W+L +   ++T T H + V ++A+SP    +L+S     D+ 
Sbjct: 60  SADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGR-ILSSSS--RDKT 116

Query: 708 IRFWNTLTGQPMQCVDT-GSQVCNLAWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLT 766
           I+ W+  TG+ +  +      V ++A+S      V++       I +W   T   VA LT
Sbjct: 117 IKVWDVETGKCLTTLRGHTDWVNSVAFS-PDGTFVAS-SSQDGTIKLWDLRTGKCVATLT 174

Query: 767 GHSYRVLYLAMSPDGEAIVTGAGDETLRFWNV 798
           GH+  V  +A SPDGE +++ + D T++ W++
Sbjct: 175 GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206



 Score =  111 bits (280), Expect = 4e-27
 Identities = 53/172 (30%), Positives = 80/172 (46%), Gaps = 12/172 (6%)

Query: 631 SERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWS 690
             R L GH   V  + +SPD + LA+G  D  + VW+L +   L+T   H   V+ +A S
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 691 PHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDT----GSQVCNLAWSKHSSELVSTHGY 746
                L +      D+ IR W+  TG   +CV T     S V ++A+S     L S    
Sbjct: 61  ADGTYLASGSS---DKTIRLWDLETG---ECVRTLTGHTSYVSSVAFSPDGRILSS--SS 112

Query: 747 SQNQILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWNV 798
               I VW   T   +  L GH+  V  +A SPDG  + + + D T++ W++
Sbjct: 113 RDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDL 164


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  144 bits (362), Expect = 1e-36
 Identities = 87/313 (27%), Positives = 156/313 (49%), Gaps = 15/313 (4%)

Query: 515 YLNLVDWSSQN---VLSVGLGSCVYLWSACTSQVTRLCDLSADGNSVTSVAWNERGNLVA 571
            +  + +S           L   + LW   T +      L+   + V+S+A++  G L+ 
Sbjct: 157 SVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST--LAGHTDPVSSLAFSPDGGLLI 214

Query: 572 VGTHH-GYVQVWDVSVAKQVH-KLVGHTAR-VGALAWNGDMLSSGSRDRMILQRDVRTPN 628
                 G +++WD+S  K +   L GH+   V + + +G +L+SGS D  I   D+R+ +
Sbjct: 215 ASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRS-S 273

Query: 629 SQSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSPLQ--TYTEHLAAVKA 686
           S   R L GH   V  + +SPD + LASG +D  + +W+L +   L   T   H   V +
Sbjct: 274 SSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSS 333

Query: 687 IAWSPHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDTGSQVCNLAWSKHSSELVSTHGY 746
           +++SP    L++  GG+ D  IR W+  TG+P++ ++  S V ++++S      V + G 
Sbjct: 334 LSFSPDGSLLVS--GGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPD--GRVVSSGS 389

Query: 747 SQNQILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWNVFSKVRSQR 806
           +   + +W   T + +  L GH+ RV  L  SPDG+++ +G+ D T+R W++ + ++S  
Sbjct: 390 TDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVS 449

Query: 807 ESKSVLNLFSSIR 819
            S     L S   
Sbjct: 450 FSPDGKVLASKSS 462



 Score = 95.9 bits (237), Expect = 1e-20
 Identities = 85/337 (25%), Positives = 144/337 (42%), Gaps = 23/337 (6%)

Query: 477 SPVGPKSQKLLRSPRKATRKISRIPFKVLDAPELQDDFYLNLVDWSSQNVLSVGLGSCVY 536
           S  G     LL     +   +S +  +       +D           + +LS      + 
Sbjct: 36  SESGILLLALLSDSLVSLPDLSSLLLR-----GHEDSITSIAFSPDGELLLSGSSDGTIK 90

Query: 537 LWSACTSQVTRLCDLSADGNSVTSVAW---NERGNLVAVGTHHGYVQVWDVS-VAKQVHK 592
           LW     +           +SV+ +A    +    L+A  +  G V++WD+S   K +  
Sbjct: 91  LWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRT 150

Query: 593 LVGHTARVGALAW--NGDMLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPD 650
           L GH+  V +LA+  +G +L+SGS     + +       +    L GH   V  L +SPD
Sbjct: 151 LEGHSESVTSLAFSPDGKLLASGSSLDGTI-KLWDLRTGKPLSTLAGHTDPVSSLAFSPD 209

Query: 651 NQ-YLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIR 709
               +ASG +D  + +W+L +   L++     +     ++SP    LLASG   +D  IR
Sbjct: 210 GGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGS-LLASGS--SDGTIR 266

Query: 710 FWNTLTGQPMQCVDTG--SQVCNLAWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLT- 766
            W+  +   +    +G  S V ++A+S     L S  G S   + +W   T   ++ LT 
Sbjct: 267 LWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLAS--GSSDGTVRLWDLETGKLLSSLTL 324

Query: 767 -GHSYRVLYLAMSPDGEAIVTGAG-DETLRFWNVFSK 801
            GH   V  L+ SPDG  +V+G   D T+R W++ + 
Sbjct: 325 KGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361



 Score = 78.2 bits (191), Expect = 4e-15
 Identities = 54/226 (23%), Positives = 90/226 (39%), Gaps = 11/226 (4%)

Query: 601 GALAWNGDMLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPDNQYLASGGND 660
            +L   G   S      ++    V  P+  S   L GH   +  + +SPD + L SG +D
Sbjct: 28  LSLLSLGSSESGILLLALLSDSLVSLPDLSS-LLLRGHEDSITSIAFSPDGELLLSGSSD 86

Query: 661 NRLYVWNLHSMSPL--QTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWN--TLTG 716
             + +W+L +   L       H ++V  +A S      +     + D  ++ W+  T   
Sbjct: 87  GTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGK 146

Query: 717 QPMQCVDTGSQVCNLAWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLTGHSYRVLYLA 776
                      V +LA+S     L S        I +W   T   ++ L GH+  V  LA
Sbjct: 147 LIRTLEGHSESVTSLAFSPDGKLLASGSSLD-GTIKLWDLRTGKPLSTLAGHTDPVSSLA 205

Query: 777 MSPDGEAIVTGAG-DETLRFWNVFSKVRSQR----ESKSVLNLFSS 817
            SPDG  ++     D T+R W++ +    +      S SV++ FS 
Sbjct: 206 FSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSP 251


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 47.0 bits (112), Expect = 3e-07
 Identities = 15/38 (39%), Positives = 21/38 (55%)

Query: 630 QSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWN 667
           +  R L GH   V  + +SPD   LASG +D  + VW+
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 40.0 bits (94), Expect = 1e-04
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 760 TQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWN 797
             +  L GH+  V  +A SPDG  + +G+ D T+R W+
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 39.3 bits (92), Expect = 1e-04
 Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 671 MSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWN 712
              L+T   H   V ++A+SP  + LLASG    D  +R W+
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGN-LLASGS--DDGTVRVWD 39



 Score = 34.3 bits (79), Expect = 0.010
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 551 LSADGNSVTSVAWNERGNLVAVGTHHGYVQVWD 583
           L      VTSVA++  GNL+A G+  G V+VWD
Sbjct: 7   LKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 33.1 bits (76), Expect = 0.025
 Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 588 KQVHKLVGHTARVGALAW--NGDMLSSGSRDRMI 619
           K +  L GHT  V ++A+  +G++L+SGS D  +
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTV 35


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 46.9 bits (112), Expect = 4e-07
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query: 628 NSQSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWN 667
           + +  + L GH   V  + +SPD +YLASG +D  + +W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 40.4 bits (95), Expect = 6e-05
 Identities = 13/40 (32%), Positives = 24/40 (60%)

Query: 758 TLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWN 797
           +   +  L GH+  V  +A SPDG+ + +G+ D T++ W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 40.0 bits (94), Expect = 1e-04
 Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 3/43 (6%)

Query: 670 SMSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWN 712
           S   L+T   H   V ++A+SP     LASG    D  I+ W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGK-YLASGS--DDGTIKLWD 40



 Score = 33.4 bits (77), Expect = 0.018
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 551 LSADGNSVTSVAWNERGNLVAVGTHHGYVQVWD 583
           L      VTSVA++  G  +A G+  G +++WD
Sbjct: 8   LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 32.3 bits (74), Expect = 0.054
 Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 588 KQVHKLVGHTARVGALAW--NGDMLSSGSRDRMI 619
           + +  L GHT  V ++A+  +G  L+SGS D  I
Sbjct: 3   ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTI 36


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 42.4 bits (99), Expect = 0.001
 Identities = 44/196 (22%), Positives = 85/196 (43%), Gaps = 25/196 (12%)

Query: 622 RDVRTPNSQSERRLVGHRQEVCGLKWSPDNQYL----ASGGNDNRLYVWNLHSMSPLQTY 677
           RD+  P  +     +  R ++ G+ W   N Y+    AS   +  + VW++     +   
Sbjct: 520 RDIHYPVVE-----LASRSKLSGICW---NSYIKSQVASSNFEGVVQVWDVARSQLVTEM 571

Query: 678 TEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDTGSQVCNLAWSKHS 737
            EH   V +I +S     LLAS  G+ D  ++ W+   G  +  + T + +C + +   S
Sbjct: 572 KEHEKRVWSIDYSSADPTLLAS--GSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSES 629

Query: 738 SELVSTHGYSQNQILVW-----KYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDET 792
              ++  G + +++  +     K P  T +    GHS  V Y+    D   +V+ + D T
Sbjct: 630 GRSLA-FGSADHKVYYYDLRNPKLPLCTMI----GHSKTVSYVRFV-DSSTLVSSSTDNT 683

Query: 793 LRFWNVFSKVRSQRES 808
           L+ W++   +    E+
Sbjct: 684 LKLWDLSMSISGINET 699



 Score = 38.9 bits (90), Expect = 0.011
 Identities = 34/141 (24%), Positives = 68/141 (48%), Gaps = 9/141 (6%)

Query: 553 ADGNSVTSVAWNER-GNLVAVGTHHGYVQVWDVSVAKQVHKLVGHTARVGALAWNG---D 608
           A  + ++ + WN    + VA     G VQVWDV+ ++ V ++  H  RV ++ ++     
Sbjct: 530 ASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPT 589

Query: 609 MLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPDN-QYLASGGNDNRLYVWN 667
           +L+SGS D  +    +    S      +  +  +C +++  ++ + LA G  D+++Y ++
Sbjct: 590 LLASGSDDGSVKLWSINQGVSIGT---IKTKANICCVQFPSESGRSLAFGSADHKVYYYD 646

Query: 668 LHSMS-PLQTYTEHLAAVKAI 687
           L +   PL T   H   V  +
Sbjct: 647 LRNPKLPLCTMIGHSKTVSYV 667


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 38.8 bits (90), Expect = 0.012
 Identities = 37/138 (26%), Positives = 68/138 (49%), Gaps = 20/138 (14%)

Query: 543 SQVTRLCDLSADGNSVTSVAWN-ERGNLVAVGTHHGYVQVWDVSVAKQVHKLVGHTARVG 601
           S+V   C ++     V +V W  E G L+      G +++ +      V KL GHT+ + 
Sbjct: 26  SRVIDSCGIACSSGFV-AVPWEVEGGGLI------GAIRLENQMRKPPVIKLKGHTSSIL 78

Query: 602 ALAWN---GDMLSSGSRDRMILQRDVRTPNSQSERR-------LVGHRQEVCGLKWSPDN 651
            L +N    ++L+SGS D  I   ++   N +S +        L GH++++  + W+P N
Sbjct: 79  DLQFNPCFSEILASGSEDLTIRVWEIPH-NDESVKEIKDPQCILKGHKKKISIIDWNPMN 137

Query: 652 QY-LASGGNDNRLYVWNL 668
            Y + S G D+ + +W++
Sbjct: 138 YYIMCSSGFDSFVNIWDI 155


>gnl|CDD|227668 COG5377, COG5377, Phage-related protein, predicted endonuclease
           [DNA replication, recombination, and repair].
          Length = 319

 Score = 35.3 bits (81), Expect = 0.11
 Identities = 17/83 (20%), Positives = 29/83 (34%), Gaps = 6/83 (7%)

Query: 270 PPKLLHTSSASHTPKRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNKHLDQNMNM 329
           PP L  +S+A       +A TE     DL       +         S  R+    ++   
Sbjct: 198 PPALDGSSAAEKYLALRYAKTEAVKTVDLPHFYAEALAVLH-----SIKRDISELEDFAK 252

Query: 330 DLENE-NNEMEMENLATEDENTA 351
           ++EN   NE++        +  A
Sbjct: 253 EIENRLKNELKEAEYGFTADYDA 275


>gnl|CDD|227829 COG5542, COG5542, Predicted integral membrane protein [Function
           unknown].
          Length = 420

 Score = 33.5 bits (77), Expect = 0.45
 Identities = 23/108 (21%), Positives = 36/108 (33%), Gaps = 13/108 (12%)

Query: 8   PLPLVPILAFSLYWTNILLPIIVHIVLHVTSR-VFIYWKLPSNRTYSAYWVYLPNWKVGL 66
            LP   +   SL     +L      +     R    + +   N      W+    W VGL
Sbjct: 231 LLPSGSLTYLSLLMPAWILGRNAFEIFLGYWRQSNGFKEQGKNAPNIYSWIPNNYWNVGL 290

Query: 67  PTLCINTLAFSLYWTNILLPIIVHILLHVTSRVFIYWKLPSNRTYSAY 114
           P           YW +  +P  + ILL     V  + KL   + Y ++
Sbjct: 291 P----------KYWISNNIPNFLFILL--IIIVVGFSKLYFTKQYLSH 326


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 33.3 bits (76), Expect = 0.57
 Identities = 17/49 (34%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 579 VQVWDVSVAKQVHKLVGHTARVGALAWN--GDMLSSGSRDRMILQRDVR 625
           V VWDV   K V  +  H+ ++ +L WN  G +L + S+D+ +   D R
Sbjct: 150 VNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPR 198


>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
           unknown].
          Length = 397

 Score = 32.7 bits (74), Expect = 0.80
 Identities = 20/94 (21%), Positives = 30/94 (31%), Gaps = 22/94 (23%)

Query: 284 KRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNKHLDQNMNMDLENENNEMEMENL 343
           K   AA +ENSN +   RK   +  +    D S  R +  DQ++             E +
Sbjct: 62  KNEEAAIDENSNVEAAERKRKHISTDF--SDMSLLRKRKNDQSLQ---PTR------EPM 110

Query: 344 ATEDENTAPPLDLYDIMSPAYRSTSANLEVKADR 377
            + D                  + S  L    DR
Sbjct: 111 DSRDSGQDFTE-----------AQSGELGDTGDR 133


>gnl|CDD|132654 TIGR03615, RutF, pyrimidine utilization flavin reductase protein F.
            This protein is observed in operons extremely similar
           to that characterized in E. coli K-12 responsible for
           the import and catabolism of pyrimidines, primarily
           uracil. This protein is a member of the flavin reductase
           family defined by pfam01613. Presumably, this protein
           recycles the flavin of the RutA luciferase-like
           oxidoreductase.
          Length = 156

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 5/35 (14%)

Query: 108 NRTYSAYWVYLPNWKVGLPTLCINTLLFHHRHHST 142
           NR+ SAY  +  N      TLC+NTL       S 
Sbjct: 51  NRSASAYPAFKQN-----GTLCVNTLAAGQEDLSN 80


>gnl|CDD|217344 pfam03060, NMO, Nitronate monooxygenase.  Nitronate monooxygenase
           (NMO), formerly referred to as 2-nitropropane
           dioxygenase (NPD) (EC:1.13.11.32), is an FMN-dependent
           enzyme that uses molecular oxygen to oxidize (anionic)
           alkyl nitronates and, in the case of the enzyme from
           Neurospora crassa, (neutral) nitroalkanes to the
           corresponding carbonyl compounds and nitrite. Previously
           classified as 2-nitropropane dioxygenase, but it is now
           recognized that this was the result of the slow
           ionization of nitroalkanes to their nitronate (anionic)
           forms. The enzymes from the fungus Neurospora crassa and
           the yeast Williopsis saturnus var. mrakii (formerly
           classified as Hansenula mrakii) contain non-covalently
           bound FMN as the cofactor. Active towards linear alkyl
           nitronates of lengths between 2 and 6 carbon atoms and,
           with lower activity, towards propyl-2-nitronate. The
           enzyme from N. crassa can also utilize neutral
           nitroalkanes, but with lower activity. One atom of
           oxygen is incorporated into the carbonyl group of the
           aldehyde product. The reaction appears to involve the
           formation of an enzyme-bound nitronate radical and an
           a-peroxynitroethane species, which then decomposes,
           either in the active site of the enzyme or after
           release, to acetaldehyde and nitrite.
          Length = 329

 Score = 31.6 bits (72), Expect = 1.8
 Identities = 17/99 (17%), Positives = 26/99 (26%), Gaps = 15/99 (15%)

Query: 695 GLLASGGGTADRC---IRFWNTLTGQPMQCVDTGSQVCNLAWSKHSSELVSTHGYSQNQI 751
           G++ SG  T DR    IR    LT +P           NL   + +          ++  
Sbjct: 38  GVIGSGYLTPDRLEKEIRKVRELTDKPFG--------VNLFLPQPAEGDDFALDVKEHGN 89

Query: 752 LVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGD 790
              K      V                   + +  G G 
Sbjct: 90  EALKLAIEEGV----PDYGDDDDSLKDAKPKVVSFGFGL 124


>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin. 
          Length = 293

 Score = 31.2 bits (70), Expect = 1.9
 Identities = 14/47 (29%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 275 HTSSASHTPKRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNK 321
              S  +  K +  + E +S+  +DS++ SKV +   +H+F SH +K
Sbjct: 220 REQSKEYKRKANDESNE-HSDV-IDSQESSKVSQEHQSHEFHSHEDK 264


>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
            This is the conserved N-terminal 350 residues of a
           family of proteins of unknown function possibly
           containing a coiled-coil domain.
          Length = 308

 Score = 31.1 bits (70), Expect = 2.2
 Identities = 31/125 (24%), Positives = 49/125 (39%), Gaps = 8/125 (6%)

Query: 222 LWPRT-PLEF-KLGLRENGSTPLN-PPS----TIMGDDECEMSNYYLKRLVTTVTP-PKL 273
           LW R   LE  K  L+E    P++ PPS        +D       ++  L + V    + 
Sbjct: 183 LWKRMDKLEAEKRMLQEKLDQPVSEPPSPRDIFTEAEDTVGGIASHITSLRSEVRRLRQN 242

Query: 274 LHTSSASHTPKRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNKHLDQNMNMDLEN 333
           L  S A +T K    A EE    + + R   K+ R +   +    +    + ++ MD E 
Sbjct: 243 LAKSEADYTEKMEQYAKEERQTREENIRLQRKLLREVERREALCRQLSESESSLEMDDER 302

Query: 334 ENNEM 338
             NE 
Sbjct: 303 YYNES 307


>gnl|CDD|184663 PRK14411, PRK14411, membrane protein; Provisional.
          Length = 204

 Score = 30.5 bits (69), Expect = 2.8
 Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 20/88 (22%)

Query: 53  SAYWVYLPNWKVGLPTLCINTLAFSLYWTNILLPIIVHILLHVTSRVFIYWKLPSNRTYS 112
           +A      +W+ GL TL I  L  +    N+ +P +V +LL     ++ +W         
Sbjct: 109 TAQVAVFYDWRAGLITLLI-ALVLTAIMQNLTIPPLVFMLLF---SIYTFWN-------- 156

Query: 113 AYWVYLPNWKVGLPTLCINTLL-FHHRH 139
                  N + GL  L I  ++ F  R 
Sbjct: 157 -------NQEAGLVFLIITLIMIFKFRK 177


>gnl|CDD|216201 pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal
           region.  This family is an alignment of the region to
           the N-terminal side of the active site. The Prosite
           motif does not correspond to this Pfam entry.
          Length = 348

 Score = 30.7 bits (70), Expect = 2.9
 Identities = 20/63 (31%), Positives = 22/63 (34%), Gaps = 18/63 (28%)

Query: 646 KWSPDNQYLASGGNDNRLYVWNLHSMSPLQ-TYT----------------EHLAAVKAIA 688
           KWSPD   LA    DN LYV  L +    Q T                  E   +  A  
Sbjct: 49  KWSPDGHKLAY-VRDNNLYVRELATGKETQITSDGSNGIFNGLADWVYEEEVFGSNSAYW 107

Query: 689 WSP 691
           WSP
Sbjct: 108 WSP 110


>gnl|CDD|214343 CHL00035, psbC, photosystem II 44 kDa protein.
          Length = 473

 Score = 30.5 bits (69), Expect = 4.6
 Identities = 13/25 (52%), Positives = 16/25 (64%), Gaps = 3/25 (12%)

Query: 774 YLAMSPDGEAIVTGAGDETLRFWNV 798
           YL  SP GE I    G ET+RFW++
Sbjct: 340 YLMRSPTGEIIF---GGETMRFWDL 361


>gnl|CDD|171470 PRK12400, PRK12400, D-amino acid aminotransferase; Reviewed.
          Length = 290

 Score = 29.6 bits (66), Expect = 7.3
 Identities = 11/33 (33%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 175 YCFALDLPIYLAAWLSCQT--SLWVEYGFKSIS 205
           + F+ D+P  + A+++ +   +LW+EYG ++IS
Sbjct: 107 HTFSYDVPPTIYAYITKKERPALWIEYGVRAIS 139


>gnl|CDD|217394 pfam03155, Alg6_Alg8, ALG6, ALG8 glycosyltransferase family.
           N-linked (asparagine-linked) glycosylation of proteins
           is mediated by a highly conserved pathway in eukaryotes,
           in which a lipid (dolichol phosphate)-linked
           oligosaccharide is assembled at the endoplasmic
           reticulum membrane prior to the transfer of the
           oligosaccharide moiety to the target asparagine
           residues. This oligosaccharide is composed of
           Glc(3)Man(9)GlcNAc(2). The addition of the three glucose
           residues is the final series of steps in the synthesis
           of the oligosaccharide precursor. Alg6 transfers the
           first glucose residue, and Alg8 transfers the second
           one. In the human alg6 gene, a C->T transition, which
           causes Ala333 to be replaced with Val, has been
           identified as the cause of a congenital disorder of
           glycosylation, designated as type Ic OMIM:603147.
          Length = 463

 Score = 29.3 bits (66), Expect = 8.6
 Identities = 28/109 (25%), Positives = 43/109 (39%), Gaps = 24/109 (22%)

Query: 6   IIPLPLVPILAFSLYWTNILLP-IIVHIVLHVTSRVFIYWKLPSNR-TYSAYW-----VY 58
           I+ L LV +  F++    I LP + +  +  V SR+F     P  R  +         +Y
Sbjct: 217 ILKLALVVVGTFAI----IFLPFLYLKQLPQVLSRLF-----PFARGLFEDKVANFWCLY 267

Query: 59  LPNWK----VGLPTLCINTLAFSLYWTNI-LLPIIVHILLHVTSRVFIY 102
               K    V LP +       S   T I  LP +V + L  T + F+ 
Sbjct: 268 NFVDKIKEVVPLPQI---QPYISFILTLIGSLPALVKLFLRPTKKGFLL 313


>gnl|CDD|224423 COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
           [Amino acid transport and metabolism].
          Length = 620

 Score = 29.4 bits (66), Expect = 9.2
 Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 645 LKWSPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHHHGLL 697
           L+WSPD   LA    D    V  L+ +      T+    V    WSP    + 
Sbjct: 65  LRWSPDGSVLAFVSTDGG-RVAQLYLVDVGGLITKTAFGVSDARWSPDGDRIA 116


>gnl|CDD|193528 cd05651, M20_ArgE_DapE_like5, M20 Peptidases with similarity to
           acetylornithine deacetylases and
           succinyl-diaminopimelate desuccinylases.  Peptidase M20
           family, Uncharacterized protein subfamily with
           similarity to acetylornithine
           deacetylase/succinyl-diaminopimelate desuccinylase
           (ArgE/DapE) subfamily. ArgE/DapE enzymes catalyze
           analogous reactions and share a common activator, the
           metal ion (usually Co2+ or Zn2+). ArgE catalyzes a broad
           range of substrates, including N-acetylornithine,
           alpha-N-acetylmethionine and alpha-N-formylmethionine,
           while DapE catalyzes the hydrolysis of
           N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
           L,L-diaminopimelate and succinate. Proteins in this
           subfamily are bacterial, and have been inferred by
           homology as being related to both ArgE and DapE.
          Length = 342

 Score = 29.2 bits (66), Expect = 9.4
 Identities = 10/43 (23%), Positives = 17/43 (39%), Gaps = 6/43 (13%)

Query: 100 FIYWKLPSNRTYSAYWVYLPNWKVGLPTLCINTLLFHHRHHST 142
                +P  R  +  W    ++  G PT+ +N+      HH T
Sbjct: 31  LEQKGIPPKRKGNNIWAKNGHFDEGKPTILLNS------HHDT 67


>gnl|CDD|213589 TIGR01153, psbC, photosystem II 44 kDa subunit reaction center
           protein (also called P6 protein, CP43), bacterial and
           chloroplast.  This model describes the Photosystem II,
           44kDa subunit (also called P6 protein, CP43) in
           bacterial and its equivalents in chloroplast of algae
           and higher plants. Photosystem II is in many ways
           functionally equivalent to bacterial reaction center. At
           the core of Photosystem II are several light harvesting
           cofactors including plastoquinones, pheophytins,
           phyloquinones etc. These cofactors are intimately
           associated with the polypeptides, which principally
           including subunits 44 kDa protein,DI, DII, Cyt.b, Cyt.f,
           iron-sulphur protein and others. Functinally 44 kDa
           subunit is imlicated in chlorophyll binding. Together
           they participate in the electron transfer reactions that
           lead to the net production of the reducting equivalents
           in the form of NADPH, which are used for reduction of
           CO2 to carbohydrates(C6H1206). Phosystem II operates
           during oxygenic photosynthesis and principal electron
           donor is H2O. Although no high resolution X-ray
           structural data is presently available, recently a 3D
           structure of the supercomplex has been described by
           cryo-electron microscopy. Besides a huge body of
           literature exits that describes function using a variety
           of biochemical and biophysical techniques [Energy
           metabolism, Electron transport, Energy metabolism,
           Photosynthesis].
          Length = 432

 Score = 29.4 bits (66), Expect = 9.5
 Identities = 13/24 (54%), Positives = 15/24 (62%), Gaps = 3/24 (12%)

Query: 774 YLAMSPDGEAIVTGAGDETLRFWN 797
           YL  SP GE I    G ET+RFW+
Sbjct: 308 YLMRSPTGEIIF---GGETMRFWD 328


>gnl|CDD|153070 cd03600, CLECT_thrombomodulin_like, C-type lectin-like domain
           (CTLD) of the type found in human thrombomodulin(TM),
           Endosialin, C14orf27, and C1qR.
           CLECT_thrombomodulin_like: C-type lectin-like domain
           (CTLD) of the type found in human thrombomodulin(TM),
           Endosialin, C14orf27, and C1qR.  CTLD refers to a domain
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  In these
           thrombomodulin-like proteins the residues involved in
           coordinating Ca2+ in the classical MBP-A CTLD are not
           conserved.  TM exerts anti-fibrinolytic and
           anti-inflammatory activity.  TM also regulates blood
           coagulation in the anticoagulant protein C pathway.  In
           this pathway, the procoagulant properties of thrombin
           (T) are lost when it binds TM.  TM also plays a key role
           in tumor biology.  It is expressed on endothelial cells
           and on several type of tumor cell including squamous
           cell carcinoma.  Loss of TM expression correlates with
           advanced stage and poor prognosis.  Loss of function of
           TM function may be associated with arterial or venous
           thrombosis and with late fetal loss.  Soluble molecules
           of TM retaining the CTLD are detected in human plasma
           and urine where higher levels indicate injury and/or
           enhanced turnover of the endothelium.  C1qR is expressed
           on endothelial cells and stem cells.  It is also
           expressed on monocots and neutrophils, where it is
           subject to ectodomain shedding.  Soluble forms of C1qR
           retaining the CTLD is detected in human plasma.  C1qR
           modulates the phagocytosis of apoptotic cells in vivo.
           C1qR-deficient mice are defective in clearance of
           apoptotic cells in vivo.  The cytoplasmic tail of C1qR,
           C-terminal to the CTLD of CD93, contains a PDZ binding
           domain which interacts with the PDZ domain-containing
           adaptor protein, GIPC.  The juxtamembrane region of this
           tail interacts with the ezrin/radixin/moesin family.
           Endosialin functions in the growth and progression of
           abdominal tumors and is expressed in the stroma of
           several tumors.
          Length = 141

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 680 HLAAVKAIAWSPHHHGLLASGGGTADRC-IRFWNTLTGQPMQCVDT 724
           +LA V++   +     LLA+G G   R  +R W  L  +P QC D 
Sbjct: 29  NLATVRSGEEADVVSLLLAAGPGRHGRGSLRLWIGLQREPRQCSDP 74


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.134    0.427 

Gapped
Lambda     K      H
   0.267   0.0728    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,026,796
Number of extensions: 3942631
Number of successful extensions: 2986
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2918
Number of HSP's successfully gapped: 60
Length of query: 819
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 714
Effective length of database: 6,280,432
Effective search space: 4484228448
Effective search space used: 4484228448
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.0 bits)