RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14257
         (841 letters)



>gnl|CDD|214531 smart00135, LY, Low-density lipoprotein-receptor YWTD domain.  Type
           "B" repeats in low-density lipoprotein (LDL) receptor
           that plays a central role in mammalian cholesterol
           metabolism. Also present in a variety of molecules
           similar to gp300/megalin.
          Length = 43

 Score = 53.0 bits (128), Expect = 3e-09
 Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 436 RTLIAGDMESPRAIALDPRFGLLFWTDWDSKAPRIERCSMSGEHR 480
           RTL++  +  P  +A+D   G L+WTDW      IE  ++ G +R
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDW--GLDVIEVANLDGTNR 43



 Score = 47.2 bits (113), Expect = 3e-07
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 343 LVTKNIQWPSSLVIDYNQDILYWCDSHLNTVESVRVDGSER 383
           L++  +  P+ L +D+ +  LYW D  L+ +E   +DG+ R
Sbjct: 3   LLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 46.1 bits (110), Expect = 7e-07
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 488 LLTSRGAWPNGLTLDYQLTRIYWIDAKSDSIHTVTYDGKDH 528
           LL+S    PNGL +D+   R+YW D   D I     DG + 
Sbjct: 3   LLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 42.6 bits (101), Expect = 1e-05
 Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 529 HVVLSNHDSLTHPFAIALFENHVYWTDWRTNSVIRANKWNGSDV 572
             +LS+     +  A+   E  +YWTDW  + VI     +G++ 
Sbjct: 1   RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLD-VIEVANLDGTNR 43



 Score = 38.4 bits (90), Expect = 4e-04
 Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 9/51 (17%)

Query: 290 VVVQTGLSTAEGLAVDWIGKNLYMFWAFWSLSPSNSSDGGIEKAGMHGENR 340
            ++ +GL    GLAVDWI   LY  W       ++     IE A + G NR
Sbjct: 2   TLLSSGLGHPNGLAVDWIEGRLY--W-------TDWGLDVIEVANLDGTNR 43



 Score = 36.8 bits (86), Expect = 0.001
 Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 2/40 (5%)

Query: 732 VVDAGISHPSGLAIDWIAQNMFIAVSSPTQSKIVVCNLEG 771
           ++ +G+ HP+GLA+DWI   ++   +      I V NL+G
Sbjct: 3   LLSSGLGHPNGLAVDWIEGRLYW--TDWGLDVIEVANLDG 40



 Score = 36.4 bits (85), Expect = 0.002
 Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 2/38 (5%)

Query: 788 LSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKR 825
           L   + +AV  ++G+++W++    V  IE+A +DGT R
Sbjct: 8   LGHPNGLAVDWIEGRLYWTDWGLDV--IEVANLDGTNR 43



 Score = 36.0 bits (84), Expect = 0.003
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 410 WIGKNLYWVESNLDQIEVAKLSGQFR 435
           WI   LYW +  LD IEVA L G  R
Sbjct: 18  WIEGRLYWTDWGLDVIEVANLDGTNR 43



 Score = 31.0 bits (71), Expect = 0.18
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 662 TISIPQVFSPNNLNFDAANKKIYWTDQEYNEVKRSSIVGGQ 702
           T+    +  PN L  D    ++YWTD   + ++ +++ G  
Sbjct: 2   TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42



 Score = 26.4 bits (59), Expect = 6.4
 Identities = 8/27 (29%), Positives = 13/27 (48%), Gaps = 4/27 (14%)

Query: 250 NTIALDFYYTPTANMVFWTDVIDDKIY 276
           N +A+D+        ++WTD   D I 
Sbjct: 12  NGLAVDWIE----GRLYWTDWGLDVIE 34


>gnl|CDD|238060 cd00112, LDLa, Low Density Lipoprotein Receptor Class A domain, a
          cysteine-rich repeat that plays a central role in
          mammalian cholesterol metabolism; the receptor protein
          binds LDL and transports it into cells by endocytosis;
          7 successive cysteine-rich repeats of about 40 amino
          acids are present in the N-terminal of this multidomain
          membrane protein; other homologous domains occur in
          related receptors, including the very low-density
          lipoprotein receptor and the LDL receptor-related
          protein/alpha 2-macroglobulin receptor, and in proteins
          which are functionally unrelated, such as the C9
          component of complement; the binding of calcium is
          required for in vitro formation of the native disulfide
          isomer and is necessary in establishing and maintaining
          the modular structure.
          Length = 35

 Score = 50.3 bits (121), Expect = 2e-08
 Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 3/38 (7%)

Query: 32 CDESHFTCNETLRCIPLAWACDGQPDCGLNDTSDEDHC 69
          C  + F C    RCIP +W CDG+ DCG  D SDE++C
Sbjct: 1  CPPNEFRCA-NGRCIPSSWVCDGEDDCG--DGSDEENC 35



 Score = 46.8 bits (112), Expect = 3e-07
 Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 95  CEPHEFDCQDVLRTCIPKALVCDGEPACFNGEDEMFC 131
           C P+EF C +    CIP + VCDGE  C +G DE  C
Sbjct: 1   CPPNEFRCAN--GRCIPSSWVCDGEDDCGDGSDEENC 35



 Score = 41.0 bits (97), Expect = 4e-05
 Identities = 15/26 (57%), Positives = 19/26 (73%), Gaps = 2/26 (7%)

Query: 2  TCIIAYWKCDGDEDCKDGSDELAEHC 27
           CI + W CDG++DC DGSDE  E+C
Sbjct: 12 RCIPSSWVCDGEDDCGDGSDE--ENC 35


>gnl|CDD|200964 pfam00057, Ldl_recept_a, Low-density lipoprotein receptor domain
          class A. 
          Length = 37

 Score = 48.8 bits (117), Expect = 7e-08
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 30 RKCDESHFTCNETLRCIPLAWACDGQPDCGLNDTSDEDHC 69
            C    F C     CIP++W CDG PDC   D SDE +C
Sbjct: 1  STCGPDEFQCGSG-ECIPMSWVCDGDPDCE--DGSDEKNC 37



 Score = 46.9 bits (112), Expect = 3e-07
 Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 93  PACEPHEFDCQDVLRTCIPKALVCDGEPACFNGEDEMFC 131
             C P EF C      CIP + VCDG+P C +G DE  C
Sbjct: 1   STCGPDEFQCGS--GECIPMSWVCDGDPDCEDGSDEKNC 37



 Score = 39.6 bits (93), Expect = 1e-04
 Identities = 15/25 (60%), Positives = 18/25 (72%), Gaps = 2/25 (8%)

Query: 3  CIIAYWKCDGDEDCKDGSDELAEHC 27
          CI   W CDGD DC+DGSDE  ++C
Sbjct: 15 CIPMSWVCDGDPDCEDGSDE--KNC 37


>gnl|CDD|215683 pfam00058, Ldl_recept_b, Low-density lipoprotein receptor repeat
           class B.  This domain is also known as the YWTD motif
           after the most conserved region of the repeat. The YWTD
           repeat is found in multiple tandem repeats and has been
           predicted to form a beta-propeller structure.
          Length = 42

 Score = 45.2 bits (108), Expect = 1e-06
 Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 414 NLYWVESNLDQ-IEVAKLSGQFRRTLIAGDMESPRAIALDP 453
            LYW +S+L   I VA L+G  RRTL + D++ P  IA+DP
Sbjct: 2   RLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 42.5 bits (101), Expect = 1e-05
 Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 6/47 (12%)

Query: 312 YMFWAFWSLSPSNSSDGGIEKAGMHGENRSPLVTKNIQWPSSLVIDY 358
            ++W   SL  S      I  A ++G +R  L ++++QWP+ + +D 
Sbjct: 2   RLYWTDSSLRAS------ISVADLNGSDRRTLFSEDLQWPNGIAVDP 42



 Score = 41.0 bits (97), Expect = 4e-05
 Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 456 GLLFWTDWDSKAPRIERCSMSGEHRSVIVRVDLLTSRGAWPNGLTLDY 503
           G L+WTD   +A  I    ++G  R  +   DL      WPNG+ +D 
Sbjct: 1   GRLYWTDSSLRA-SISVADLNGSDRRTLFSEDL-----QWPNGIAVDP 42



 Score = 33.7 bits (78), Expect = 0.016
 Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)

Query: 507 RIYWIDAKSD-SIHTVTYDGKDHHVVLSNHDSLTHPFAIALFE 548
           R+YW D+    SI     +G D   + S  + L  P  IA+  
Sbjct: 2   RLYWTDSSLRASISVADLNGSDRRTLFS--EDLQWPNGIAVDP 42



 Score = 31.4 bits (72), Expect = 0.11
 Identities = 13/36 (36%), Positives = 15/36 (41%), Gaps = 1/36 (2%)

Query: 549 NHVYWTDWRTNSVIRANKWNGSDV-TLIQRTLTQPF 583
             +YWTD    + I     NGSD  TL    L  P 
Sbjct: 1   GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPN 36



 Score = 29.4 bits (67), Expect = 0.50
 Identities = 8/31 (25%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 801 GKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 831
           G+++W++      +I +A ++G+ R T+ S+
Sbjct: 1   GRLYWTDS-SLRASISVADLNGSDRRTLFSE 30



 Score = 28.7 bits (65), Expect = 1.2
 Identities = 13/45 (28%), Positives = 18/45 (40%), Gaps = 4/45 (8%)

Query: 263 NMVFWTDVI-DDKIYNATLIGNSLSNIDVVVQTGLSTAEGLAVDW 306
             ++WTD      I  A L G   S+   +    L    G+AVD 
Sbjct: 1   GRLYWTDSSLRASISVADLNG---SDRRTLFSEDLQWPNGIAVDP 42


>gnl|CDD|197566 smart00192, LDLa, Low-density lipoprotein receptor domain class
          A.  Cysteine-rich repeat in the low-density lipoprotein
          (LDL) receptor that plays a central role in mammalian
          cholesterol metabolism. The N-terminal type A repeats
          in LDL receptor bind the lipoproteins. Other homologous
          domains occur in related receptors, including the very
          low-density lipoprotein receptor and the LDL
          receptor-related protein/alpha 2-macroglobulin
          receptor, and in proteins which are functionally
          unrelated, such as the C9 component of complement.
          Mutations in the LDL receptor gene cause familial
          hypercholesterolemia.
          Length = 33

 Score = 42.2 bits (100), Expect = 1e-05
 Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 3/36 (8%)

Query: 31 KCDESHFTCNETLRCIPLAWACDGQPDCGLNDTSDE 66
           C    F C+   RCIP +W CDG  DCG  D SDE
Sbjct: 1  TCPPGEFQCDNG-RCIPSSWVCDGVDDCG--DGSDE 33



 Score = 42.2 bits (100), Expect = 1e-05
 Identities = 16/34 (47%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 95  CEPHEFDCQDVLRTCIPKALVCDGEPACFNGEDE 128
           C P EF C +    CIP + VCDG   C +G DE
Sbjct: 2   CPPGEFQCDN--GRCIPSSWVCDGVDDCGDGSDE 33



 Score = 39.2 bits (92), Expect = 1e-04
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 2  TCIIAYWKCDGDEDCKDGSDE 22
           CI + W CDG +DC DGSDE
Sbjct: 13 RCIPSSWVCDGVDDCGDGSDE 33


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 35.1 bits (82), Expect = 0.004
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 193 SYKCTCHAGYELEPDMFTCK 212
           SY C+C  GY+L  D  TC+
Sbjct: 1   SYTCSCPPGYQLSGDGRTCE 20



 Score = 27.0 bits (61), Expect = 2.6
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 614 YKCQCPHVMKLMSDNKTCQ 632
           Y C CP   +L  D +TC+
Sbjct: 2   YTCSCPPGYQLSGDGRTCE 20


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 34.4 bits (79), Expect = 0.008
 Identities = 12/37 (32%), Positives = 16/37 (43%), Gaps = 6/37 (16%)

Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGYELE 205
           C+  +PC   GTC           SY+C C  GY  +
Sbjct: 2   CAASNPCSNGGTCVNTP------GSYRCVCPPGYTGD 32



 Score = 27.1 bits (60), Expect = 3.3
 Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 3/25 (12%)

Query: 136 CLRSD-CHHD--CYNTPEGFMCVCP 157
           C  S+ C +   C NTP  + CVCP
Sbjct: 2   CAASNPCSNGGTCVNTPGSYRCVCP 26


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 35.4 bits (82), Expect = 0.082
 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%)

Query: 590 VKKSPCEMNNGNCSHLCLIGMNGTYKCQCPHVMKLMSDNKT 630
           V    C   +  C  +C+    G+Y C C     L+ DNKT
Sbjct: 185 VVPDLCATLSHVCQQVCISTP-GSYLCACTEGYALLEDNKT 224



 Score = 29.3 bits (66), Expect = 6.5
 Identities = 13/40 (32%), Positives = 14/40 (35%), Gaps = 3/40 (7%)

Query: 169 CSYLHPC-EAWGTCSQDCEPSVSNNSYKCTCHAGYELEPD 207
           C     C      C Q C  +    SY C C  GY L  D
Sbjct: 184 CVVPDLCATLSHVCQQVCISTPG--SYLCACTEGYALLED 221


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 30.7 bits (70), Expect = 0.19
 Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 6/36 (16%)

Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGYEL 204
           C+  +PC+  GTC        +  SY+C+C  GY  
Sbjct: 5   CASGNPCQNGGTCVN------TVGSYRCSCPPGYTG 34



 Score = 26.8 bits (60), Expect = 4.4
 Identities = 6/16 (37%), Positives = 7/16 (43%)

Query: 142 HHDCYNTPEGFMCVCP 157
              C NT   + C CP
Sbjct: 14  GGTCVNTVGSYRCSCP 29


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 30.7 bits (70), Expect = 0.20
 Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 6/35 (17%)

Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGYE 203
           C+  +PC+  GTC        +  SY+C C  GY 
Sbjct: 5   CASGNPCQNGGTCVN------TVGSYRCECPPGYT 33



 Score = 26.4 bits (59), Expect = 5.8
 Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 5/29 (17%)

Query: 132 DEKQCLRSD-CHHD--CYNTPEGFMCVCP 157
           DE  C   + C +   C NT   + C CP
Sbjct: 3   DE--CASGNPCQNGGTCVNTVGSYRCECP 29


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 29.8 bits (67), Expect = 0.39
 Identities = 11/35 (31%), Positives = 15/35 (42%), Gaps = 3/35 (8%)

Query: 174 PCEAWGTCSQD-CEPSVSNNSYKCTCHAGYELEPD 207
            C + G CS   C  +    SY C+C  GY  +  
Sbjct: 1   ECASGGPCSNGTCINTP--GSYTCSCPPGYTGDKR 33



 Score = 29.0 bits (65), Expect = 0.69
 Identities = 8/29 (27%), Positives = 10/29 (34%), Gaps = 1/29 (3%)

Query: 138 RSDCHHD-CYNTPEGFMCVCPRDMYLQPD 165
              C +  C NTP  + C CP        
Sbjct: 5   GGPCSNGTCINTPGSYTCSCPPGYTGDKR 33


>gnl|CDD|163448 TIGR03736, PRTRC_ThiF, PRTRC system ThiF family protein.  A novel
           genetic system characterized by six major proteins,
           included a ParB homolog and a ThiF homolog, is
           designated PRTRC, or ParB-Related,ThiF-Related Cassette.
           This family is the PRTRC system ThiF family protein.
          Length = 244

 Score = 32.3 bits (74), Expect = 0.75
 Identities = 22/72 (30%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 441 GDMESPRAIALDPRFGLLFWTDWDSKAPRIERCSMSGEHRSVIVRVDLLTSRGAWPNGLT 500
            D+   +AI L  R      TDW +   R+ER S       VI  VD   +R A      
Sbjct: 69  ADVGQNKAIVLVNRLNQAMGTDWTAHPERVERSSTLHRPDIVIGCVDNRAARLAILRAFE 128

Query: 501 LDYQLTRIYWID 512
             Y     YW+D
Sbjct: 129 GGYSGYA-YWLD 139


>gnl|CDD|225926 COG3391, COG3391, Uncharacterized conserved protein [Function
           unknown].
          Length = 381

 Score = 31.4 bits (71), Expect = 2.3
 Identities = 21/108 (19%), Positives = 35/108 (32%), Gaps = 27/108 (25%)

Query: 659 TIPTISIPQVFSPNNLNFDAANKKIYWTDQEYNEVKRSSIVGGQIQLVVDAVFEFYLTPL 718
           T+          P  +  + A  K+Y T  + N V            V+D      L   
Sbjct: 64  TVTQSLSVGGVYPAGVAVNPAGNKVYVTTGDSNTV-----------SVIDTATNTVLG-- 110

Query: 719 SKRSSIVGGQIQLVVDAGISHPSGLAIDWIAQNMFIAVSSPTQSKIVV 766
              S  VG             P GLA+D   + +++A +    + + V
Sbjct: 111 ---SIPVGLG-----------PVGLAVDPDGKYVYVANAGNGNNTVSV 144


>gnl|CDD|225785 COG3246, COG3246, Uncharacterized conserved protein [Function
           unknown].
          Length = 298

 Score = 30.8 bits (70), Expect = 2.5
 Identities = 21/67 (31%), Positives = 28/67 (41%), Gaps = 12/67 (17%)

Query: 391 VLGSDSRVGIPNCRPEDNLWIGKNLYWVESNLDQIEVAKLSGQFRRTLIAGDMESPRAIA 450
           +LG + RVG+     EDNLW+ K     +SN D +E A      RR + A   E      
Sbjct: 241 LLGGNVRVGL-----EDNLWLPKGSL-AKSNADLVERA------RRIVEALGAEIATPAE 288

Query: 451 LDPRFGL 457
                 L
Sbjct: 289 AREILAL 295


>gnl|CDD|179177 PRK00945, PRK00945, acetyl-CoA decarbonylase/synthase complex
           subunit epsilon; Provisional.
          Length = 171

 Score = 30.4 bits (69), Expect = 2.6
 Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 5/27 (18%)

Query: 245 ISSLKN-----TIALDFYYTPTANMVF 266
           +S+LK+     TI +D YY P A+M F
Sbjct: 126 LSALKHFSPLKTITIDRYYHPNADMSF 152


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 27.4 bits (61), Expect = 2.9
 Identities = 12/34 (35%), Positives = 14/34 (41%), Gaps = 6/34 (17%)

Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGY 202
           CS  +PC   GTC        +   Y C C  GY
Sbjct: 1   CSPNNPCSNGGTCVD------TPGGYTCECPEGY 28


>gnl|CDD|185659 PTZ00481, PTZ00481, Membrane attack complex/ Perforin (MACPF)
           Superfamily; Provisional.
          Length = 524

 Score = 30.5 bits (69), Expect = 4.1
 Identities = 10/21 (47%), Positives = 12/21 (57%), Gaps = 2/21 (9%)

Query: 139 SDCHHDCYNTPEG--FMCVCP 157
           +DC    YN+P G  F C CP
Sbjct: 502 TDCIPSVYNSPNGQVFQCSCP 522


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 26.9 bits (60), Expect = 4.1
 Identities = 13/41 (31%), Positives = 15/41 (36%), Gaps = 10/41 (24%)

Query: 173 HPCEAWGTCSQDCEPSVSNN--SYKCTCHAGYELEPDMFTC 211
           H C A   C         N   S++C C  GYE   D   C
Sbjct: 10  HNCPANTVCV--------NTIGSFECVCPDGYENNEDGTNC 42


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.7 bits (60), Expect = 4.3
 Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 175 CEAW-GTCSQDCEPSVSNNSYKCTCHAGYELEPDMFTC 211
           C    G C  +   + +  S+ CTC +GY    D  TC
Sbjct: 1   CAENNGGCHPNATCTNTGGSFTCTCKSGYTG--DGVTC 36


>gnl|CDD|235564 PRK05691, PRK05691, peptide synthase; Validated.
          Length = 4334

 Score = 30.9 bits (70), Expect = 4.3
 Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 19/81 (23%)

Query: 394  SDSRVGIP---NCRPEDNLWIGK--NLYWVESNLD-QIEVAKLSGQFRRTLIAGD----- 442
             D R+G P     RPE    IG   N   +   LD Q+ V++L  Q R+T+I G      
Sbjct: 1994 RDLRIGAPVANRIRPESEGLIGAFLNTQVLRCQLDGQMSVSELLEQVRQTVIEGQSHQDL 2053

Query: 443  --------MESPRAIALDPRF 455
                    ++ PR+ A +P F
Sbjct: 2054 PFDHLVEALQPPRSAAYNPLF 2074


>gnl|CDD|163712 cd08156, catalase_clade_3, Clade 3 of the heme-binding enzyme
           catalase.  Catalase is a ubiquitous enzyme found in both
           prokaryotes and eukaryotes, which is involved in the
           protection of cells from the toxic effects of peroxides.
           It catalyzes the conversion of hydrogen peroxide to
           water and molecular oxygen. Catalases also utilize
           hydrogen peroxide to oxidize various substrates such as
           alcohol or phenols. Clade 3 catalases are the most
           abundant subfamily and are found in all three kingdoms
           of life; they have a relatively small subunit size of 43
           to 75 kDa, and bind a protoheme IX (heme b) group buried
           deep inside the structure. Clade 3 catalases also bind
           NADPH as a second redox-active cofactor. They form
           tetramers, and in eukaryotic cells, catalases are
           located in peroxisomes.
          Length = 429

 Score = 29.4 bits (67), Expect = 8.6
 Identities = 15/34 (44%), Positives = 17/34 (50%), Gaps = 9/34 (26%)

Query: 313 MFWAFWSLSPSNS-------SDGGIEKA--GMHG 337
           MFW FWSLSP +        SD GI      M+G
Sbjct: 114 MFWDFWSLSPESLHQVTILFSDRGIPDGYRHMNG 147


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.134    0.435 

Gapped
Lambda     K      H
   0.267   0.0761    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,089,929
Number of extensions: 3882787
Number of successful extensions: 2548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2535
Number of HSP's successfully gapped: 58
Length of query: 841
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 736
Effective length of database: 6,280,432
Effective search space: 4622397952
Effective search space used: 4622397952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.0 bits)