RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5806
         (1332 letters)



>gnl|CDD|238060 cd00112, LDLa, Low Density Lipoprotein Receptor Class A domain, a
          cysteine-rich repeat that plays a central role in
          mammalian cholesterol metabolism; the receptor protein
          binds LDL and transports it into cells by endocytosis;
          7 successive cysteine-rich repeats of about 40 amino
          acids are present in the N-terminal of this multidomain
          membrane protein; other homologous domains occur in
          related receptors, including the very low-density
          lipoprotein receptor and the LDL receptor-related
          protein/alpha 2-macroglobulin receptor, and in proteins
          which are functionally unrelated, such as the C9
          component of complement; the binding of calcium is
          required for in vitro formation of the native disulfide
          isomer and is necessary in establishing and maintaining
          the modular structure.
          Length = 35

 Score = 63.8 bits (156), Expect = 5e-13
 Identities = 19/35 (54%), Positives = 22/35 (62%)

Query: 59 CSPDEHRCTTGHCILKTWLCDGIPDCSLGEDERNC 93
          C P+E RC  G CI  +W+CDG  DC  G DE NC
Sbjct: 1  CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35



 Score = 44.1 bits (105), Expect = 5e-06
 Identities = 15/30 (50%), Positives = 17/30 (56%), Gaps = 1/30 (3%)

Query: 21 KCNRTGACIPAQWQCDNEFDCEMGEDEMKC 50
          +C   G CIP+ W CD E DC  G DE  C
Sbjct: 7  RC-ANGRCIPSSWVCDGEDDCGDGSDEENC 35



 Score = 31.4 bits (72), Expect = 0.15
 Identities = 9/26 (34%), Positives = 13/26 (50%), Gaps = 3/26 (11%)

Query: 135 CFSKKYLCDGKKHCPRGEEGPDERHC 160
           C    ++CDG+  C  G    DE +C
Sbjct: 13  CIPSSWVCDGEDDCGDGS---DEENC 35


>gnl|CDD|200964 pfam00057, Ldl_recept_a, Low-density lipoprotein receptor domain
          class A. 
          Length = 37

 Score = 60.0 bits (146), Expect = 1e-11
 Identities = 20/37 (54%), Positives = 25/37 (67%)

Query: 57 KKCSPDEHRCTTGHCILKTWLCDGIPDCSLGEDERNC 93
            C PDE +C +G CI  +W+CDG PDC  G DE+NC
Sbjct: 1  STCGPDEFQCGSGECIPMSWVCDGDPDCEDGSDEKNC 37



 Score = 41.5 bits (98), Expect = 4e-05
 Identities = 15/30 (50%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 21 KCNRTGACIPAQWQCDNEFDCEMGEDEMKC 50
          +C  +G CIP  W CD + DCE G DE  C
Sbjct: 9  QC-GSGECIPMSWVCDGDPDCEDGSDEKNC 37


>gnl|CDD|197566 smart00192, LDLa, Low-density lipoprotein receptor domain class
          A.  Cysteine-rich repeat in the low-density lipoprotein
          (LDL) receptor that plays a central role in mammalian
          cholesterol metabolism. The N-terminal type A repeats
          in LDL receptor bind the lipoproteins. Other homologous
          domains occur in related receptors, including the very
          low-density lipoprotein receptor and the LDL
          receptor-related protein/alpha 2-macroglobulin
          receptor, and in proteins which are functionally
          unrelated, such as the C9 component of complement.
          Mutations in the LDL receptor gene cause familial
          hypercholesterolemia.
          Length = 33

 Score = 56.9 bits (138), Expect = 1e-10
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 58 KCSPDEHRCTTGHCILKTWLCDGIPDCSLGEDE 90
           C P E +C  G CI  +W+CDG+ DC  G DE
Sbjct: 1  TCPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33



 Score = 40.7 bits (96), Expect = 8e-05
 Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 17 PQYKKCNRTGACIPAQWQCDNEFDCEMGEDE 47
          P   +C   G CIP+ W CD   DC  G DE
Sbjct: 4  PGEFQC-DNGRCIPSSWVCDGVDDCGDGSDE 33


>gnl|CDD|214531 smart00135, LY, Low-density lipoprotein-receptor YWTD domain.  Type
           "B" repeats in low-density lipoprotein (LDL) receptor
           that plays a central role in mammalian cholesterol
           metabolism. Also present in a variety of molecules
           similar to gp300/megalin.
          Length = 43

 Score = 56.1 bits (136), Expect = 4e-10
 Identities = 18/43 (41%), Positives = 28/43 (65%)

Query: 284 KVILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLDGSRR 326
           + +L+  +  PNGL +D+ E RLYWT+     IE ++LDG+ R
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 50.3 bits (121), Expect = 4e-08
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 774 ERLVVNDLNRTESIAVDAIGRKIYWTDMNAQTIMVSDIDGKN 815
             L+ + L     +AVD I  ++YWTD     I V+++DG N
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42



 Score = 49.9 bits (120), Expect = 6e-08
 Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 240 KILIANNIDKPRAVIVHPRDAFVFWTDWGKTPKIERVEMDGSNR 283
           + L+++ +  P  + V   +  ++WTDWG    IE   +DG+NR
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDWG-LDVIEVANLDGTNR 43



 Score = 49.1 bits (118), Expect = 8e-08
 Identities = 17/39 (43%), Positives = 25/39 (64%)

Query: 504 VIRHNTKKPAGVAIDWRTNKLYWTESEYNWIEVSNLDGT 542
           ++      P G+A+DW   +LYWT+   + IEV+NLDGT
Sbjct: 3   LLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGT 41



 Score = 49.1 bits (118), Expect = 9e-08
 Identities = 21/41 (51%), Positives = 24/41 (58%)

Query: 589 HILASDSLKNPTGLAIDLENSRLYWANVGLKTIEFTNLDGT 629
             L S  L +P GLA+D    RLYW + GL  IE  NLDGT
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGT 41



 Score = 48.0 bits (115), Expect = 3e-07
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 202 TGMQSVGSLSVDWVHDFVFWTDTAARRVEFCDLDGKNR 239
           +G+     L+VDW+   ++WTD     +E  +LDG NR
Sbjct: 6   SGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 47.6 bits (114), Expect = 3e-07
 Identities = 15/42 (35%), Positives = 22/42 (52%)

Query: 1173 VLVEEDLAFPNELAIDFKQRRLFWADSTNKRIEYCDFFGRSR 1214
             L+   L  PN LA+D+ + RL+W D     IE  +  G +R
Sbjct: 2    TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 44.1 bits (105), Expect = 5e-06
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 550 QNLSEPRDIAVNPIGGIMYWCSWGDLHKIEMAGMDGSHR 588
             L  P  +AV+ I G +YW  WG L  IE+A +DG++R
Sbjct: 6   SGLGHPNGLAVDWIEGRLYWTDWG-LDVIEVANLDGTNR 43



 Score = 40.7 bits (96), Expect = 9e-05
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 3/43 (6%)

Query: 327 QQVITKGLPHPFGITL--YEDFVFWTDWHTKSISSANKYTGRN 367
           + +++ GL HP G+ +   E  ++WTDW    I  AN   G N
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN-LDGTN 42



 Score = 39.9 bits (94), Expect = 2e-04
 Identities = 12/42 (28%), Positives = 15/42 (35%)

Query: 732 VLPLRQLKMVASVDVDTKNEYIYWSDISEKTIERVRFDMTGR 773
            L    L     + VD     +YW+D     IE    D T R
Sbjct: 2   TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 38.7 bits (91), Expect = 4e-04
 Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 5/42 (11%)

Query: 823 NLYRPRSIVVHYGLGLMVWADWSRTRLTNNRIEMAHMDGTNR 864
            L  P  + V +  G + W DW       + IE+A++DGTNR
Sbjct: 7   GLGHPNGLAVDWIEGRLYWTDWGL-----DVIEVANLDGTNR 43



 Score = 36.4 bits (85), Expect = 0.003
 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 864 RAVFETEVIWPSCLAIDYSDNPKLYWVDTSKHTIE 898
           R +  + +  P+ LA+D+ +  +LYW D     IE
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEG-RLYWTDWGLDVIE 34



 Score = 33.7 bits (78), Expect = 0.024
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 1093 NFVIDWVANNIYYIDSQMHTINVARSDGQH 1122
               +DW+   +Y+ D  +  I VA  DG +
Sbjct: 13   GLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42



 Score = 33.7 bits (78), Expect = 0.028
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 6/49 (12%)

Query: 1123 KKILVNDLMEPLAIAVYPRRGLLFYSHWGLYDNSPTTKIEKVYLDGSYR 1171
            + +L + L  P  +AV    G L+++ WGL        IE   LDG+ R
Sbjct: 1    RTLLSSGLGHPNGLAVDWIEGRLYWTDWGL------DVIEVANLDGTNR 43



 Score = 33.3 bits (77), Expect = 0.034
 Identities = 10/41 (24%), Positives = 16/41 (39%)

Query: 460 VLPVMNVKSVVAIAWESANDSLYWADVENGEINRAHINGTF 500
            L    +     +A +     LYW D     I  A+++GT 
Sbjct: 2   TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42



 Score = 30.6 bits (70), Expect = 0.37
 Identities = 11/29 (37%), Positives = 14/29 (48%), Gaps = 2/29 (6%)

Query: 915 SHPYTLTV--LDYYVYWTDVQHSKIYRAN 941
            HP  L V  ++  +YWTD     I  AN
Sbjct: 9   GHPNGLAVDWIEGRLYWTDWGLDVIEVAN 37


>gnl|CDD|215683 pfam00058, Ldl_recept_b, Low-density lipoprotein receptor repeat
           class B.  This domain is also known as the YWTD motif
           after the most conserved region of the repeat. The YWTD
           repeat is found in multiple tandem repeats and has been
           predicted to form a beta-propeller structure.
          Length = 42

 Score = 54.1 bits (131), Expect = 2e-09
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 261 FVFWTDWGKTPKIERVEMDGSNRKVILTENIFWPNGLTID 300
            ++WTD      I   +++GS+R+ + +E++ WPNG+ +D
Sbjct: 2   RLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVD 41



 Score = 46.8 bits (112), Expect = 6e-07
 Identities = 14/39 (35%), Positives = 22/39 (56%)

Query: 567 MYWCSWGDLHKIEMAGMDGSHRHILASDSLKNPTGLAID 605
           +YW        I +A ++GS R  L S+ L+ P G+A+D
Sbjct: 3   LYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVD 41



 Score = 42.9 bits (102), Expect = 1e-05
 Identities = 14/42 (33%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 794 RKIYWTDMNAQ-TIMVSDIDGKNAKVLFWLNLYRPRSIVVHY 834
            ++YWTD + + +I V+D++G + + LF  +L  P  I V  
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 42.5 bits (101), Expect = 2e-05
 Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 304 DRLYWTEAK-HHFIESSDLDGSRRQQVITKGLPHPFGITLYE 344
            RLYWT++     I  +DL+GS R+ + ++ L  P GI +  
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 41.4 bits (98), Expect = 5e-05
 Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 217 DFVFWTDTAARRVEFC-DLDGKNRKILIANNIDKPRAVIVHP 257
             ++WTD++ R      DL+G +R+ L + ++  P  + V P
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 36.8 bits (86), Expect = 0.002
 Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 1101 NNIYYID-SQMHTINVARSDGQHKKILV-NDLMEPLAIAVYP 1140
              +Y+ D S   +I+VA  +G  ++ L   DL  P  IAV P
Sbjct: 1    GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 35.6 bits (83), Expect = 0.005
 Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 481 LYWADVENGE-INRAHINGTFQYNVIRHNTKKPAGVAIDW 519
           LYW D      I+ A +NG+ +  +   + + P G+A+D 
Sbjct: 3   LYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 35.6 bits (83), Expect = 0.006
 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%)

Query: 751 EYIYWSDIS-EKTIERVRFDMTGRERLVVNDLNRTESIAVD 790
             +YW+D S   +I     + + R  L   DL     IAVD
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVD 41



 Score = 34.5 bits (80), Expect = 0.016
 Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 522 NKLYWTESEYNW-IEVSNLDGTYPTLLFWQNLSEPRDIAVNP 562
            +LYWT+S     I V++L+G+    LF ++L  P  IAV+P
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 31.0 bits (71), Expect = 0.27
 Identities = 6/27 (22%), Positives = 13/27 (48%)

Query: 345 DFVFWTDWHTKSISSANKYTGRNIKTV 371
             ++WTD   ++  S     G + +T+
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTL 27



 Score = 29.1 bits (66), Expect = 1.1
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 1143 GLLFYSHWGLYDNSPTTKIEKVYLDGSYRTVLVEEDLAFPNELAID 1188
            G L+++   L        I    L+GS R  L  EDL +PN +A+D
Sbjct: 1    GRLYWTDSSL-----RASISVADLNGSDRRTLFSEDLQWPNGIAVD 41


>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
            protein.
          Length = 1057

 Score = 41.8 bits (98), Expect = 0.003
 Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 6/69 (8%)

Query: 1144 LLFYSHWGLYDNSPT-TKIEKVYLDGSYRTVLVEEDLAFPNELAIDFKQRRLFWADSTNK 1202
            L +Y    L D++P    +EK   D   R  L+   L FP +LAID    RLF +DS + 
Sbjct: 536  LQYYGEKKLLDSTPLPLSLEK---DNDPR--LLTSPLKFPGKLAIDLLNNRLFISDSNHN 590

Query: 1203 RIEYCDFFG 1211
            RI   D  G
Sbjct: 591  RIVVTDLDG 599



 Score = 37.1 bits (86), Expect = 0.086
 Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 270 TPKIERVEMDGSNRKVILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLDGSRRQQV 329
           TP    +E D   R  +LT  + +P  L ID   +RL+ +++ H+ I  +DLDG+   Q+
Sbjct: 548 TPLPLSLEKDNDPR--LLTSPLKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNFIVQI 605

Query: 330 ITKG 333
            + G
Sbjct: 606 GSTG 609



 Score = 32.5 bits (74), Expect = 1.8
 Identities = 17/38 (44%), Positives = 23/38 (60%)

Query: 591 LASDSLKNPTGLAIDLENSRLYWANVGLKTIEFTNLDG 628
           L +  LK P  LAIDL N+RL+ ++     I  T+LDG
Sbjct: 562 LLTSPLKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDG 599



 Score = 32.5 bits (74), Expect = 1.9
 Identities = 35/120 (29%), Positives = 51/120 (42%), Gaps = 36/120 (30%)

Query: 510 KKPAGVAIDWRTNKLYWTESEYNWIEVSNLDGTYPTLLFWQNLSEPRDIAVNPIGGIMYW 569
           K P  +AID   N+L+ ++S +N I V++LDG +                      I+  
Sbjct: 568 KFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNF----------------------IVQI 605

Query: 570 CSWGDLHKIEMAGMDGSHRHILASDSLKN-PTGLAIDLENSRLYWA---NVGLKTIEFTN 625
            S G     E    DGS       D+  N P GLA + + + LY A   N  L+ I+F N
Sbjct: 606 GSTG-----EEGLRDGSF-----EDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVN 655


>gnl|CDD|225921 COG3386, COG3386, Gluconolactonase [Carbohydrate transport and
           metabolism].
          Length = 307

 Score = 39.3 bits (92), Expect = 0.012
 Identities = 28/124 (22%), Positives = 39/124 (31%), Gaps = 19/124 (15%)

Query: 767 RFDMTGR-ERLVVNDLNRTESIAVDAIGRKIYWTDMNAQTIMVSD-------IDGKNAKV 818
           R D  G   RL+ +DL     +A    G+ +Y  D  A  I   D       I G+   V
Sbjct: 147 RVDPDGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFV 206

Query: 819 LFWLNLYRPRSIVV---HYGLGLMVWADWSRTRLTNNRIEMAHMDGTNRAVFETEVIWPS 875
            F      P  + V          VW           R+   + DG      +  V  P+
Sbjct: 207 DFDEEPGLPDGMAVDADGNLWVAAVWGGG--------RVVRFNPDGKLLGEIKLPVKRPT 258

Query: 876 CLAI 879
             A 
Sbjct: 259 NPAF 262



 Score = 37.4 bits (87), Expect = 0.038
 Identities = 31/148 (20%), Positives = 49/148 (33%), Gaps = 24/148 (16%)

Query: 222 TDTAARRVEFCDLDGKNRKILIANNIDKPRAVIVHPRDAFVFWT--------DWGK-TPK 272
            DT  +     + +     +   N+        V P     F             + T  
Sbjct: 92  PDTGGKITLLAEPE-DGLPLNRPNDGV------VDPDGRIWFGDMGYFDLGKSEERPTGS 144

Query: 273 IERVEMDGSNRKVILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLD------GSRR 326
           + RV+ DG   + +L +++  PNGL    +   LY  +   + I   DLD      G RR
Sbjct: 145 LYRVDPDGGVVR-LLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRR 203

Query: 327 QQVITKGLP-HPFGITLYEDFVFWTDWH 353
             V     P  P G+ +  D   W    
Sbjct: 204 GFVDFDEEPGLPDGMAVDADGNLWVAAV 231



 Score = 33.9 bits (78), Expect = 0.54
 Identities = 19/75 (25%), Positives = 29/75 (38%), Gaps = 3/75 (4%)

Query: 1138 VYPRRGLLFYSHWGLYDNSPTTK-IEKVY-LDGSYRTV-LVEEDLAFPNELAIDFKQRRL 1194
            V P   + F             +    +Y +D     V L+++DL  PN LA     + L
Sbjct: 118  VDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRLLDDDLTIPNGLAFSPDGKTL 177

Query: 1195 FWADSTNKRIEYCDF 1209
            + AD+   RI   D 
Sbjct: 178  YVADTPANRIHRYDL 192


>gnl|CDD|225926 COG3391, COG3391, Uncharacterized conserved protein [Function
           unknown].
          Length = 381

 Score = 38.0 bits (88), Expect = 0.032
 Identities = 33/134 (24%), Positives = 55/134 (41%), Gaps = 4/134 (2%)

Query: 497 NGTFQYNVIRHNTKKPAGVAIDWRTNKLYWTESEYNWIEVSNLDGTYPTLLFWQNLSEPR 556
            GT +        + P GVA++    ++Y   S  N + V +      T         P 
Sbjct: 18  TGTNKVTAAISLGRGPGGVAVNPDGTQVYVANSGSNDVSVIDATSNTVTQSLSVGGVYPA 77

Query: 557 DIAVNPIGGIMYWCSWGDLHKIEMAGMDGSHRHILASDSL-KNPTGLAIDLENSRLYWAN 615
            +AVNP G  +Y  + GD + +    +D +   +L S  +   P GLA+D +   +Y AN
Sbjct: 78  GVAVNPAGNKVYVTT-GDSNTVS--VIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVAN 134

Query: 616 VGLKTIEFTNLDGT 629
            G      + +D  
Sbjct: 135 AGNGNNTVSVIDAA 148



 Score = 35.3 bits (81), Expect = 0.20
 Identities = 37/129 (28%), Positives = 52/129 (40%), Gaps = 24/129 (18%)

Query: 455 PTFETVLPVMNV-KSVVAIAWESANDSLYWADVENGEINRAHINGTFQYNVIRHNTKK-- 511
               TVL  + V    V +A +     +Y A+  NG       N T   +VI   T K  
Sbjct: 103 TATNTVLGSIPVGLGPVGLAVDPDGKYVYVANAGNG-------NNT--VSVIDAATNKVT 153

Query: 512 --------PAGVAIDWRTNKLYWTESEYNWIEVSNLDGTYPTLLFWQNL----SEPRDIA 559
                   P GVA+D   NK+Y T S+ N + V +  G         +L    + P  IA
Sbjct: 154 ATIPVGNTPTGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIA 213

Query: 560 VNPIGGIMY 568
           V+P G  +Y
Sbjct: 214 VDPDGNRVY 222



 Score = 32.2 bits (73), Expect = 2.1
 Identities = 36/185 (19%), Positives = 61/185 (32%), Gaps = 24/185 (12%)

Query: 740 MVASVDVDTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNRT-ESIAVDAIGRKIYW 798
               + VD   +Y+Y ++           D    +      +  T   +AVD  G K+Y 
Sbjct: 117 GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNTPTGVAVDPDGNKVYV 176

Query: 799 TDMNAQTIMVSDIDGKNAKVLFWLNLY----RPRSIVVHYGLGLMVWADWSRTRLTNNRI 854
           T+ +  T+ V D  G +       +L      P  I V      +  A+         +I
Sbjct: 177 TNSDDNTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKI 236

Query: 855 EMAHMDGTNRAVFETEVI----WPSCLAID------YSDN---PKLYWVDTSKHTIEYKT 901
                D     V  T++      P  +A+D      Y  N     +  +D +   +  KT
Sbjct: 237 -----DTATGNVTATDLPVGSGAPRGVAVDPAGKAAYVANSQGGTVSVIDGATDRV-VKT 290

Query: 902 LATGR 906
             TG 
Sbjct: 291 GPTGN 295



 Score = 30.3 bits (68), Expect = 7.2
 Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 1/78 (1%)

Query: 742 ASVDVDTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNRTESIAVDAIGRKIYWTDM 801
           A V V+     +Y +     T+  V    T      +        +AVD  G+ +Y  + 
Sbjct: 77  AGVAVNPAGNKVYVTTGDSNTVS-VIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVANA 135

Query: 802 NAQTIMVSDIDGKNAKVL 819
                 VS ID    KV 
Sbjct: 136 GNGNNTVSVIDAATNKVT 153


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 33.3 bits (77), Expect = 0.036
 Identities = 12/28 (42%), Positives = 14/28 (50%), Gaps = 2/28 (7%)

Query: 972 CGSNNGGCSH--LCLRNPTNFTCACPTG 997
           C  NNGGC     C     +FTC C +G
Sbjct: 1   CAENNGGCHPNATCTNTGGSFTCTCKSG 28


>gnl|CDD|219847 pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region.  This family
            describes a region that is found in proteins expressed by
            a variety of eukaryotic and prokaryotic species. These
            proteins include various enzymes, such as senescence
            marker protein 30 (SMP-30), gluconolactonase and
            luciferin-regenerating enzyme (LRE). SMP-30 is known to
            hydrolyse diisopropyl phosphorofluoridate in the liver,
            and has been noted as having sequence similarity, in the
            region described in this family, with PON1 and LRE.
          Length = 245

 Score = 36.1 bits (84), Expect = 0.087
 Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 1153 YDNSPTTKIEKVY-LDGSYRTVLVEEDLAFPNELAIDFKQRRLFWADSTNKRIEYCDF 1209
            +D +P  +   +Y LD   +   V + +   N LA     + L++ADS  +RI   D+
Sbjct: 105  FDIAPGGEPGALYRLDPDGKVERVLDGITISNGLAWSPDGKTLYFADSPTRRIWAFDY 162



 Score = 33.0 bits (76), Expect = 0.88
 Identities = 33/160 (20%), Positives = 51/160 (31%), Gaps = 21/160 (13%)

Query: 474 WESANDSLYWADVENGEINRAHINGTFQYNVIRHNTKKPAGVAIDWRTNKLYWTESEYNW 533
           W+    +LYW D+  G I+R              +   P G        +L         
Sbjct: 7   WDEEEGALYWVDILGGRIHRLDPATG---KETVWDLPGPVGAIALRDDGRLIVALK--RG 61

Query: 534 IEVSNLDGTYPTLLFWQNLSEPR----DIAVNPIGGIMYWCSWGDLHKIEMAG------- 582
           + + +LD    T L      EP     D  V+P G    W   G +      G       
Sbjct: 62  LALLDLDTGELTTLADLEPDEPLNRFNDGKVDPDGRF--WF--GTMGFDIAPGGEPGALY 117

Query: 583 -MDGSHRHILASDSLKNPTGLAIDLENSRLYWANVGLKTI 621
            +D   +     D +    GLA   +   LY+A+   + I
Sbjct: 118 RLDPDGKVERVLDGITISNGLAWSPDGKTLYFADSPTRRI 157



 Score = 33.0 bits (76), Expect = 0.92
 Identities = 11/52 (21%), Positives = 22/52 (42%)

Query: 767 RFDMTGRERLVVNDLNRTESIAVDAIGRKIYWTDMNAQTIMVSDIDGKNAKV 818
           R D  G+   V++ +  +  +A    G+ +Y+ D   + I   D D     +
Sbjct: 118 RLDPDGKVERVLDGITISNGLAWSPDGKTLYFADSPTRRIWAFDYDADGGLI 169


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 31.7 bits (73), Expect = 0.096
 Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 412 FTCQCPLGLTLSPDNKSC 429
           +TC CP G  LS D ++C
Sbjct: 2   YTCSCPPGYQLSGDGRTC 19



 Score = 29.7 bits (68), Expect = 0.53
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 990  FTCACPTGILLSADRRSC 1007
            +TC+CP G  LS D R+C
Sbjct: 2    YTCSCPPGYQLSGDGRTC 19


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 35.1 bits (81), Expect = 0.17
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 396 CAPKVCSHICLPNKHRFTCQCPLGLTLSPDNKS 428
               VC  +C+     + C C  G  L  DNK+
Sbjct: 192 TLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224



 Score = 33.5 bits (77), Expect = 0.56
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 969  PNACGSNNGGCSHLCLRNPTNFTCACPTGILLSADRRS 1006
            P+ C + +  C  +C+  P ++ CAC  G  L  D ++
Sbjct: 187  PDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 29.3 bits (66), Expect = 0.74
 Identities = 12/28 (42%), Positives = 15/28 (53%), Gaps = 3/28 (10%)

Query: 972 CGSNNGGCSH--LCLRNPTNFTCACPTG 997
           C  NN  CS+   C+  P  +TC CP G
Sbjct: 1   CSPNNP-CSNGGTCVDTPGGYTCECPEG 27



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 11/29 (37%), Positives = 16/29 (55%), Gaps = 3/29 (10%)

Query: 396 CAPKV-CSH--ICLPNKHRFTCQCPLGLT 421
           C+P   CS+   C+     +TC+CP G T
Sbjct: 1   CSPNNPCSNGGTCVDTPGGYTCECPEGYT 29


>gnl|CDD|233685 TIGR02013, rpoB, DNA-directed RNA polymerase, beta subunit.  This
           model describes orthologs of the beta subunit of
           Bacterial RNA polymerase. The core enzyme consists of
           two alpha chains, one beta chain, and one beta' subunit
           [Transcription, DNA-dependent RNA polymerase].
          Length = 1065

 Score = 33.1 bits (76), Expect = 1.3
 Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 7/80 (8%)

Query: 724 DVEYRYPIVLPLRQLKMVASVDVDTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNR 783
            + Y  P+ + LR +        + K + +Y  DI   T +R  F + G ER+VV+ L+R
Sbjct: 86  GLTYSVPLKVKLRLINKEEDGTKEIKEQEVYMGDIPLMT-DRGTFIINGAERVVVSQLHR 144

Query: 784 T------ESIAVDAIGRKIY 797
           +              G+ +Y
Sbjct: 145 SPGVFFSSEKDTHKNGKVLY 164


>gnl|CDD|227604 COG5279, CYK3, Uncharacterized protein involved in cytokinesis,
            contains TGc (transglutaminase/protease-like) domain
            [Cell division and chromosome partitioning].
          Length = 521

 Score = 33.0 bits (75), Expect = 1.3
 Identities = 21/99 (21%), Positives = 30/99 (30%), Gaps = 9/99 (9%)

Query: 1029 LDTADLLPVTLPFPEYMSSIFFDYHYSKNLIYFA---DMRSGNLRTFDMSDSTRIKP--- 1082
                D +  +L      SS+      + N  +          NLR F  +     KP   
Sbjct: 94   RSAMDNVLQSLNAKGKKSSLRLTNSTTDNTKFDFIPIPEAFDNLRRFQNAPGPSPKPLAT 153

Query: 1083 --IPLMNDTIRDNFVI-DWVANNIYYIDSQMHTINVARS 1118
               P          VI DW+  N  YID    + N+   
Sbjct: 154  IIRPSTAQLYPKPPVIYDWIVVNTSYIDKMFESRNIQDP 192


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 28.6 bits (64), Expect = 1.6
 Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 396 CAPK-VCSH-ICLPNKHRFTCQCPLGLTLSPDNKSCS 430
           CA    CS+  C+     +TC CP G T    +K C 
Sbjct: 2   CASGGPCSNGTCINTPGSYTCSCPPGYTG---DKRCE 35



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 10/25 (40%), Positives = 17/25 (68%), Gaps = 1/25 (4%)

Query: 974 SNNGGCSH-LCLRNPTNFTCACPTG 997
           ++ G CS+  C+  P ++TC+CP G
Sbjct: 3   ASGGPCSNGTCINTPGSYTCSCPPG 27


>gnl|CDD|132976 cd06645, STKc_MAP4K3, Catalytic domain of the Protein
           Serine/Threonine Kinase, Mitogen-activated protein
           kinase kinase kinase kinase 3.  Serine/threonine kinases
           (STKs), mitogen-activated protein kinase (MAPK) kinase
           kinase kinase 3 (MAPKKKK3 or MAP4K3) subfamily,
           catalytic (c) domain. STKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to serine/threonine
           residues on protein substrates. The MAP4K3 subfamily is
           part of a larger superfamily that includes the catalytic
           domains of other protein STKs, protein tyrosine kinases,
           RIO kinases, aminoglycoside phosphotransferase, choline
           kinase, and phosphoinositide 3-kinase. Members of this
           subfamily contain an N-terminal catalytic domain and a
           C-terminal citron homology (CNH) regulatory domain,
           similar to MAP4K4/6. MAP4Ks are involved in some MAPK
           signaling pathways that are important in mediating
           cellular responses to extracellular signals by
           activating a MAPK kinase kinase (MAPKKK or MAP3K or
           MKKK). Each MAPK cascade is activated either by a small
           GTP-binding protein or by an adaptor protein, which
           transmits the signal either directly to a MAP3K to start
           the triple kinase core cascade or indirectly through a
           mediator kinase, a MAP4K. MAP4K3 plays a role in the
           nutrient-responsive pathway of mTOR (mammalian target of
           rapamycin) signaling. MAP4K3 is required in the
           activation of S6 kinase by amino acids and for the
           phosphorylation of the mTOR-regulated inhibitor of
           eukaryotic initiation factor 4E. mTOR regulates ribosome
           biogenesis and protein translation, and is frequently
           deregulated in cancer.
          Length = 267

 Score = 31.2 bits (70), Expect = 3.2
 Identities = 14/53 (26%), Positives = 29/53 (54%), Gaps = 5/53 (9%)

Query: 286 ILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLDGSRRQQVITKGLPHPF 338
           ++T++ F P  L     +D++ W+ + HHF++ +     +++    K L HPF
Sbjct: 220 LMTKSNFQPPKL-----KDKMKWSNSFHHFVKMALTKNPKKRPTAEKLLQHPF 267


>gnl|CDD|215623 PLN03190, PLN03190, aminophospholipid translocase; Provisional.
          Length = 1178

 Score = 31.4 bits (71), Expect = 4.3
 Identities = 19/73 (26%), Positives = 30/73 (41%), Gaps = 19/73 (26%)

Query: 472 IAWESANDSLY---WADVENGEINRAHINGTFQYNVIRHNTKKPAGVAIDWRTNKLYWTE 528
           +AW   +D      W D+  GEI +   N T   +++  +T  P GVA            
Sbjct: 172 LAWVLVDDQFQEKKWKDIRVGEIIKIQANDTLPCDMVLLSTSDPTGVA------------ 219

Query: 529 SEYNWIEVSNLDG 541
               +++  NLDG
Sbjct: 220 ----YVQTINLDG 228


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 27.1 bits (60), Expect = 5.3
 Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 6/37 (16%)

Query: 396 CA-PKVCSH--ICLPNKHRFTCQCPLGLTLSPDNKSC 429
           CA    CS+   C+     + C CP G T    ++SC
Sbjct: 2   CAASNPCSNGGTCVNTPGSYRCVCPPGYTG---DRSC 35



 Score = 27.1 bits (60), Expect = 6.2
 Identities = 8/26 (30%), Positives = 14/26 (53%), Gaps = 2/26 (7%)

Query: 974 SNNGGCSH--LCLRNPTNFTCACPTG 997
           + +  CS+   C+  P ++ C CP G
Sbjct: 3   AASNPCSNGGTCVNTPGSYRCVCPPG 28


>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
           4,4'-diapolycopene-dialdehyde dehydrogenase-like.  The
           4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
           involved in C30 carotenoid synthesis in Methylomonas sp.
           strain 16a and other similar sequences are present in
           this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
           into 4,4'-diapolycopene-diacid.
          Length = 453

 Score = 30.3 bits (69), Expect = 7.1
 Identities = 18/43 (41%), Positives = 21/43 (48%), Gaps = 10/43 (23%)

Query: 456 TFETVLPVMNVKSV-VAIAWESANDSLY------WA-DVENGE 490
           TF  VLPVM V     AIA   ANDS Y      ++ D+   E
Sbjct: 358 TFGPVLPVMPVADEDEAIAL--ANDSRYGLSASVFSRDLARAE 398


>gnl|CDD|119132 pfam10612, Spore-coat_CotZ, Spore coat protein Z.  This family has
            members annotated as Spore coat protein Z, otherwise
            known as CotZ, It is a cysteine-rich spore coat family,
            and along with CotY is necessary for assembly of intact
            exosporium.
          Length = 154

 Score = 29.3 bits (66), Expect = 8.2
 Identities = 16/59 (27%), Positives = 22/59 (37%), Gaps = 11/59 (18%)

Query: 972  CGSNNGGCSHLCLRNPTNF--------TCACPTGI---LLSADRRSCFSRTREFLLYTS 1019
            CG +    S  C+ +   F           CPT     LL     +  + TR F+LYT 
Sbjct: 3    CGKHEDHGSSNCVCDVVRFINDLQDAAEEDCPTSCFSPLLGPSNSASLADTRPFVLYTK 61


>gnl|CDD|181983 PRK09603, PRK09603, bifunctional DNA-directed RNA polymerase
           subunit beta/beta'; Reviewed.
          Length = 2890

 Score = 30.7 bits (69), Expect = 8.7
 Identities = 16/41 (39%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 747 DTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNRTESI 787
           D K + I+  +I   T ER  F + G ER+VVN L+R+  +
Sbjct: 119 DIKEQSIFIREIPLMT-ERTSFIINGVERVVVNQLHRSPGV 158


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.137    0.442 

Gapped
Lambda     K      H
   0.267   0.0701    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 67,811,812
Number of extensions: 6626487
Number of successful extensions: 4417
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4399
Number of HSP's successfully gapped: 68
Length of query: 1332
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1223
Effective length of database: 6,103,016
Effective search space: 7463988568
Effective search space used: 7463988568
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 65 (28.9 bits)