RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14257
(841 letters)
>gnl|CDD|214531 smart00135, LY, Low-density lipoprotein-receptor YWTD domain. Type
"B" repeats in low-density lipoprotein (LDL) receptor
that plays a central role in mammalian cholesterol
metabolism. Also present in a variety of molecules
similar to gp300/megalin.
Length = 43
Score = 53.0 bits (128), Expect = 3e-09
Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 436 RTLIAGDMESPRAIALDPRFGLLFWTDWDSKAPRIERCSMSGEHR 480
RTL++ + P +A+D G L+WTDW IE ++ G +R
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDW--GLDVIEVANLDGTNR 43
Score = 47.2 bits (113), Expect = 3e-07
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 343 LVTKNIQWPSSLVIDYNQDILYWCDSHLNTVESVRVDGSER 383
L++ + P+ L +D+ + LYW D L+ +E +DG+ R
Sbjct: 3 LLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 46.1 bits (110), Expect = 7e-07
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 488 LLTSRGAWPNGLTLDYQLTRIYWIDAKSDSIHTVTYDGKDH 528
LL+S PNGL +D+ R+YW D D I DG +
Sbjct: 3 LLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 42.6 bits (101), Expect = 1e-05
Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 1/44 (2%)
Query: 529 HVVLSNHDSLTHPFAIALFENHVYWTDWRTNSVIRANKWNGSDV 572
+LS+ + A+ E +YWTDW + VI +G++
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLD-VIEVANLDGTNR 43
Score = 38.4 bits (90), Expect = 4e-04
Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 9/51 (17%)
Query: 290 VVVQTGLSTAEGLAVDWIGKNLYMFWAFWSLSPSNSSDGGIEKAGMHGENR 340
++ +GL GLAVDWI LY W ++ IE A + G NR
Sbjct: 2 TLLSSGLGHPNGLAVDWIEGRLY--W-------TDWGLDVIEVANLDGTNR 43
Score = 36.8 bits (86), Expect = 0.001
Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 2/40 (5%)
Query: 732 VVDAGISHPSGLAIDWIAQNMFIAVSSPTQSKIVVCNLEG 771
++ +G+ HP+GLA+DWI ++ + I V NL+G
Sbjct: 3 LLSSGLGHPNGLAVDWIEGRLYW--TDWGLDVIEVANLDG 40
Score = 36.4 bits (85), Expect = 0.002
Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 2/38 (5%)
Query: 788 LSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKR 825
L + +AV ++G+++W++ V IE+A +DGT R
Sbjct: 8 LGHPNGLAVDWIEGRLYWTDWGLDV--IEVANLDGTNR 43
Score = 36.0 bits (84), Expect = 0.003
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 410 WIGKNLYWVESNLDQIEVAKLSGQFR 435
WI LYW + LD IEVA L G R
Sbjct: 18 WIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 31.0 bits (71), Expect = 0.18
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 662 TISIPQVFSPNNLNFDAANKKIYWTDQEYNEVKRSSIVGGQ 702
T+ + PN L D ++YWTD + ++ +++ G
Sbjct: 2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
Score = 26.4 bits (59), Expect = 6.4
Identities = 8/27 (29%), Positives = 13/27 (48%), Gaps = 4/27 (14%)
Query: 250 NTIALDFYYTPTANMVFWTDVIDDKIY 276
N +A+D+ ++WTD D I
Sbjct: 12 NGLAVDWIE----GRLYWTDWGLDVIE 34
>gnl|CDD|238060 cd00112, LDLa, Low Density Lipoprotein Receptor Class A domain, a
cysteine-rich repeat that plays a central role in
mammalian cholesterol metabolism; the receptor protein
binds LDL and transports it into cells by endocytosis;
7 successive cysteine-rich repeats of about 40 amino
acids are present in the N-terminal of this multidomain
membrane protein; other homologous domains occur in
related receptors, including the very low-density
lipoprotein receptor and the LDL receptor-related
protein/alpha 2-macroglobulin receptor, and in proteins
which are functionally unrelated, such as the C9
component of complement; the binding of calcium is
required for in vitro formation of the native disulfide
isomer and is necessary in establishing and maintaining
the modular structure.
Length = 35
Score = 50.3 bits (121), Expect = 2e-08
Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 3/38 (7%)
Query: 32 CDESHFTCNETLRCIPLAWACDGQPDCGLNDTSDEDHC 69
C + F C RCIP +W CDG+ DCG D SDE++C
Sbjct: 1 CPPNEFRCA-NGRCIPSSWVCDGEDDCG--DGSDEENC 35
Score = 46.8 bits (112), Expect = 3e-07
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 95 CEPHEFDCQDVLRTCIPKALVCDGEPACFNGEDEMFC 131
C P+EF C + CIP + VCDGE C +G DE C
Sbjct: 1 CPPNEFRCAN--GRCIPSSWVCDGEDDCGDGSDEENC 35
Score = 41.0 bits (97), Expect = 4e-05
Identities = 15/26 (57%), Positives = 19/26 (73%), Gaps = 2/26 (7%)
Query: 2 TCIIAYWKCDGDEDCKDGSDELAEHC 27
CI + W CDG++DC DGSDE E+C
Sbjct: 12 RCIPSSWVCDGEDDCGDGSDE--ENC 35
>gnl|CDD|200964 pfam00057, Ldl_recept_a, Low-density lipoprotein receptor domain
class A.
Length = 37
Score = 48.8 bits (117), Expect = 7e-08
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 30 RKCDESHFTCNETLRCIPLAWACDGQPDCGLNDTSDEDHC 69
C F C CIP++W CDG PDC D SDE +C
Sbjct: 1 STCGPDEFQCGSG-ECIPMSWVCDGDPDCE--DGSDEKNC 37
Score = 46.9 bits (112), Expect = 3e-07
Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 93 PACEPHEFDCQDVLRTCIPKALVCDGEPACFNGEDEMFC 131
C P EF C CIP + VCDG+P C +G DE C
Sbjct: 1 STCGPDEFQCGS--GECIPMSWVCDGDPDCEDGSDEKNC 37
Score = 39.6 bits (93), Expect = 1e-04
Identities = 15/25 (60%), Positives = 18/25 (72%), Gaps = 2/25 (8%)
Query: 3 CIIAYWKCDGDEDCKDGSDELAEHC 27
CI W CDGD DC+DGSDE ++C
Sbjct: 15 CIPMSWVCDGDPDCEDGSDE--KNC 37
>gnl|CDD|215683 pfam00058, Ldl_recept_b, Low-density lipoprotein receptor repeat
class B. This domain is also known as the YWTD motif
after the most conserved region of the repeat. The YWTD
repeat is found in multiple tandem repeats and has been
predicted to form a beta-propeller structure.
Length = 42
Score = 45.2 bits (108), Expect = 1e-06
Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 414 NLYWVESNLDQ-IEVAKLSGQFRRTLIAGDMESPRAIALDP 453
LYW +S+L I VA L+G RRTL + D++ P IA+DP
Sbjct: 2 RLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 42.5 bits (101), Expect = 1e-05
Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 6/47 (12%)
Query: 312 YMFWAFWSLSPSNSSDGGIEKAGMHGENRSPLVTKNIQWPSSLVIDY 358
++W SL S I A ++G +R L ++++QWP+ + +D
Sbjct: 2 RLYWTDSSLRAS------ISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 41.0 bits (97), Expect = 4e-05
Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 6/48 (12%)
Query: 456 GLLFWTDWDSKAPRIERCSMSGEHRSVIVRVDLLTSRGAWPNGLTLDY 503
G L+WTD +A I ++G R + DL WPNG+ +D
Sbjct: 1 GRLYWTDSSLRA-SISVADLNGSDRRTLFSEDL-----QWPNGIAVDP 42
Score = 33.7 bits (78), Expect = 0.016
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)
Query: 507 RIYWIDAKSD-SIHTVTYDGKDHHVVLSNHDSLTHPFAIALFE 548
R+YW D+ SI +G D + S + L P IA+
Sbjct: 2 RLYWTDSSLRASISVADLNGSDRRTLFS--EDLQWPNGIAVDP 42
Score = 31.4 bits (72), Expect = 0.11
Identities = 13/36 (36%), Positives = 15/36 (41%), Gaps = 1/36 (2%)
Query: 549 NHVYWTDWRTNSVIRANKWNGSDV-TLIQRTLTQPF 583
+YWTD + I NGSD TL L P
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPN 36
Score = 29.4 bits (67), Expect = 0.50
Identities = 8/31 (25%), Positives = 20/31 (64%), Gaps = 1/31 (3%)
Query: 801 GKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 831
G+++W++ +I +A ++G+ R T+ S+
Sbjct: 1 GRLYWTDS-SLRASISVADLNGSDRRTLFSE 30
Score = 28.7 bits (65), Expect = 1.2
Identities = 13/45 (28%), Positives = 18/45 (40%), Gaps = 4/45 (8%)
Query: 263 NMVFWTDVI-DDKIYNATLIGNSLSNIDVVVQTGLSTAEGLAVDW 306
++WTD I A L G S+ + L G+AVD
Sbjct: 1 GRLYWTDSSLRASISVADLNG---SDRRTLFSEDLQWPNGIAVDP 42
>gnl|CDD|197566 smart00192, LDLa, Low-density lipoprotein receptor domain class
A. Cysteine-rich repeat in the low-density lipoprotein
(LDL) receptor that plays a central role in mammalian
cholesterol metabolism. The N-terminal type A repeats
in LDL receptor bind the lipoproteins. Other homologous
domains occur in related receptors, including the very
low-density lipoprotein receptor and the LDL
receptor-related protein/alpha 2-macroglobulin
receptor, and in proteins which are functionally
unrelated, such as the C9 component of complement.
Mutations in the LDL receptor gene cause familial
hypercholesterolemia.
Length = 33
Score = 42.2 bits (100), Expect = 1e-05
Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 3/36 (8%)
Query: 31 KCDESHFTCNETLRCIPLAWACDGQPDCGLNDTSDE 66
C F C+ RCIP +W CDG DCG D SDE
Sbjct: 1 TCPPGEFQCDNG-RCIPSSWVCDGVDDCG--DGSDE 33
Score = 42.2 bits (100), Expect = 1e-05
Identities = 16/34 (47%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 95 CEPHEFDCQDVLRTCIPKALVCDGEPACFNGEDE 128
C P EF C + CIP + VCDG C +G DE
Sbjct: 2 CPPGEFQCDN--GRCIPSSWVCDGVDDCGDGSDE 33
Score = 39.2 bits (92), Expect = 1e-04
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 2 TCIIAYWKCDGDEDCKDGSDE 22
CI + W CDG +DC DGSDE
Sbjct: 13 RCIPSSWVCDGVDDCGDGSDE 33
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 35.1 bits (82), Expect = 0.004
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 193 SYKCTCHAGYELEPDMFTCK 212
SY C+C GY+L D TC+
Sbjct: 1 SYTCSCPPGYQLSGDGRTCE 20
Score = 27.0 bits (61), Expect = 2.6
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 614 YKCQCPHVMKLMSDNKTCQ 632
Y C CP +L D +TC+
Sbjct: 2 YTCSCPPGYQLSGDGRTCE 20
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 34.4 bits (79), Expect = 0.008
Identities = 12/37 (32%), Positives = 16/37 (43%), Gaps = 6/37 (16%)
Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGYELE 205
C+ +PC GTC SY+C C GY +
Sbjct: 2 CAASNPCSNGGTCVNTP------GSYRCVCPPGYTGD 32
Score = 27.1 bits (60), Expect = 3.3
Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 3/25 (12%)
Query: 136 CLRSD-CHHD--CYNTPEGFMCVCP 157
C S+ C + C NTP + CVCP
Sbjct: 2 CAASNPCSNGGTCVNTPGSYRCVCP 26
>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
extracellular matrix molecules mediate cell-matrix and
matrix-matrix interactions thereby providing tissue
integrity. Some members of the matrilin family are
expressed specifically in developing cartilage
rudiments. The matrilin family consists of at least four
members. All the members of the matrilin family contain
VWA domains, EGF-like domains and a heptad repeat
coiled-coiled domain at the carboxy terminus which is
responsible for the oligomerization of the matrilins.
The VWA domains have been shown to be essential for
matrilin network formation by interacting with matrix
ligands.
Length = 224
Score = 35.4 bits (82), Expect = 0.082
Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%)
Query: 590 VKKSPCEMNNGNCSHLCLIGMNGTYKCQCPHVMKLMSDNKT 630
V C + C +C+ G+Y C C L+ DNKT
Sbjct: 185 VVPDLCATLSHVCQQVCISTP-GSYLCACTEGYALLEDNKT 224
Score = 29.3 bits (66), Expect = 6.5
Identities = 13/40 (32%), Positives = 14/40 (35%), Gaps = 3/40 (7%)
Query: 169 CSYLHPC-EAWGTCSQDCEPSVSNNSYKCTCHAGYELEPD 207
C C C Q C + SY C C GY L D
Sbjct: 184 CVVPDLCATLSHVCQQVCISTPG--SYLCACTEGYALLED 221
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 30.7 bits (70), Expect = 0.19
Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 6/36 (16%)
Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGYEL 204
C+ +PC+ GTC + SY+C+C GY
Sbjct: 5 CASGNPCQNGGTCVN------TVGSYRCSCPPGYTG 34
Score = 26.8 bits (60), Expect = 4.4
Identities = 6/16 (37%), Positives = 7/16 (43%)
Query: 142 HHDCYNTPEGFMCVCP 157
C NT + C CP
Sbjct: 14 GGTCVNTVGSYRCSCP 29
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 30.7 bits (70), Expect = 0.20
Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGYE 203
C+ +PC+ GTC + SY+C C GY
Sbjct: 5 CASGNPCQNGGTCVN------TVGSYRCECPPGYT 33
Score = 26.4 bits (59), Expect = 5.8
Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 5/29 (17%)
Query: 132 DEKQCLRSD-CHHD--CYNTPEGFMCVCP 157
DE C + C + C NT + C CP
Sbjct: 3 DE--CASGNPCQNGGTCVNTVGSYRCECP 29
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 29.8 bits (67), Expect = 0.39
Identities = 11/35 (31%), Positives = 15/35 (42%), Gaps = 3/35 (8%)
Query: 174 PCEAWGTCSQD-CEPSVSNNSYKCTCHAGYELEPD 207
C + G CS C + SY C+C GY +
Sbjct: 1 ECASGGPCSNGTCINTP--GSYTCSCPPGYTGDKR 33
Score = 29.0 bits (65), Expect = 0.69
Identities = 8/29 (27%), Positives = 10/29 (34%), Gaps = 1/29 (3%)
Query: 138 RSDCHHD-CYNTPEGFMCVCPRDMYLQPD 165
C + C NTP + C CP
Sbjct: 5 GGPCSNGTCINTPGSYTCSCPPGYTGDKR 33
>gnl|CDD|163448 TIGR03736, PRTRC_ThiF, PRTRC system ThiF family protein. A novel
genetic system characterized by six major proteins,
included a ParB homolog and a ThiF homolog, is
designated PRTRC, or ParB-Related,ThiF-Related Cassette.
This family is the PRTRC system ThiF family protein.
Length = 244
Score = 32.3 bits (74), Expect = 0.75
Identities = 22/72 (30%), Positives = 28/72 (38%), Gaps = 1/72 (1%)
Query: 441 GDMESPRAIALDPRFGLLFWTDWDSKAPRIERCSMSGEHRSVIVRVDLLTSRGAWPNGLT 500
D+ +AI L R TDW + R+ER S VI VD +R A
Sbjct: 69 ADVGQNKAIVLVNRLNQAMGTDWTAHPERVERSSTLHRPDIVIGCVDNRAARLAILRAFE 128
Query: 501 LDYQLTRIYWID 512
Y YW+D
Sbjct: 129 GGYSGYA-YWLD 139
>gnl|CDD|225926 COG3391, COG3391, Uncharacterized conserved protein [Function
unknown].
Length = 381
Score = 31.4 bits (71), Expect = 2.3
Identities = 21/108 (19%), Positives = 35/108 (32%), Gaps = 27/108 (25%)
Query: 659 TIPTISIPQVFSPNNLNFDAANKKIYWTDQEYNEVKRSSIVGGQIQLVVDAVFEFYLTPL 718
T+ P + + A K+Y T + N V V+D L
Sbjct: 64 TVTQSLSVGGVYPAGVAVNPAGNKVYVTTGDSNTV-----------SVIDTATNTVLG-- 110
Query: 719 SKRSSIVGGQIQLVVDAGISHPSGLAIDWIAQNMFIAVSSPTQSKIVV 766
S VG P GLA+D + +++A + + + V
Sbjct: 111 ---SIPVGLG-----------PVGLAVDPDGKYVYVANAGNGNNTVSV 144
>gnl|CDD|225785 COG3246, COG3246, Uncharacterized conserved protein [Function
unknown].
Length = 298
Score = 30.8 bits (70), Expect = 2.5
Identities = 21/67 (31%), Positives = 28/67 (41%), Gaps = 12/67 (17%)
Query: 391 VLGSDSRVGIPNCRPEDNLWIGKNLYWVESNLDQIEVAKLSGQFRRTLIAGDMESPRAIA 450
+LG + RVG+ EDNLW+ K +SN D +E A RR + A E
Sbjct: 241 LLGGNVRVGL-----EDNLWLPKGSL-AKSNADLVERA------RRIVEALGAEIATPAE 288
Query: 451 LDPRFGL 457
L
Sbjct: 289 AREILAL 295
>gnl|CDD|179177 PRK00945, PRK00945, acetyl-CoA decarbonylase/synthase complex
subunit epsilon; Provisional.
Length = 171
Score = 30.4 bits (69), Expect = 2.6
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 5/27 (18%)
Query: 245 ISSLKN-----TIALDFYYTPTANMVF 266
+S+LK+ TI +D YY P A+M F
Sbjct: 126 LSALKHFSPLKTITIDRYYHPNADMSF 152
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 27.4 bits (61), Expect = 2.9
Identities = 12/34 (35%), Positives = 14/34 (41%), Gaps = 6/34 (17%)
Query: 169 CSYLHPCEAWGTCSQDCEPSVSNNSYKCTCHAGY 202
CS +PC GTC + Y C C GY
Sbjct: 1 CSPNNPCSNGGTCVD------TPGGYTCECPEGY 28
>gnl|CDD|185659 PTZ00481, PTZ00481, Membrane attack complex/ Perforin (MACPF)
Superfamily; Provisional.
Length = 524
Score = 30.5 bits (69), Expect = 4.1
Identities = 10/21 (47%), Positives = 12/21 (57%), Gaps = 2/21 (9%)
Query: 139 SDCHHDCYNTPEG--FMCVCP 157
+DC YN+P G F C CP
Sbjct: 502 TDCIPSVYNSPNGQVFQCSCP 522
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 26.9 bits (60), Expect = 4.1
Identities = 13/41 (31%), Positives = 15/41 (36%), Gaps = 10/41 (24%)
Query: 173 HPCEAWGTCSQDCEPSVSNN--SYKCTCHAGYELEPDMFTC 211
H C A C N S++C C GYE D C
Sbjct: 10 HNCPANTVCV--------NTIGSFECVCPDGYENNEDGTNC 42
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 26.7 bits (60), Expect = 4.3
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 3/38 (7%)
Query: 175 CEAW-GTCSQDCEPSVSNNSYKCTCHAGYELEPDMFTC 211
C G C + + + S+ CTC +GY D TC
Sbjct: 1 CAENNGGCHPNATCTNTGGSFTCTCKSGYTG--DGVTC 36
>gnl|CDD|235564 PRK05691, PRK05691, peptide synthase; Validated.
Length = 4334
Score = 30.9 bits (70), Expect = 4.3
Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 19/81 (23%)
Query: 394 SDSRVGIP---NCRPEDNLWIGK--NLYWVESNLD-QIEVAKLSGQFRRTLIAGD----- 442
D R+G P RPE IG N + LD Q+ V++L Q R+T+I G
Sbjct: 1994 RDLRIGAPVANRIRPESEGLIGAFLNTQVLRCQLDGQMSVSELLEQVRQTVIEGQSHQDL 2053
Query: 443 --------MESPRAIALDPRF 455
++ PR+ A +P F
Sbjct: 2054 PFDHLVEALQPPRSAAYNPLF 2074
>gnl|CDD|163712 cd08156, catalase_clade_3, Clade 3 of the heme-binding enzyme
catalase. Catalase is a ubiquitous enzyme found in both
prokaryotes and eukaryotes, which is involved in the
protection of cells from the toxic effects of peroxides.
It catalyzes the conversion of hydrogen peroxide to
water and molecular oxygen. Catalases also utilize
hydrogen peroxide to oxidize various substrates such as
alcohol or phenols. Clade 3 catalases are the most
abundant subfamily and are found in all three kingdoms
of life; they have a relatively small subunit size of 43
to 75 kDa, and bind a protoheme IX (heme b) group buried
deep inside the structure. Clade 3 catalases also bind
NADPH as a second redox-active cofactor. They form
tetramers, and in eukaryotic cells, catalases are
located in peroxisomes.
Length = 429
Score = 29.4 bits (67), Expect = 8.6
Identities = 15/34 (44%), Positives = 17/34 (50%), Gaps = 9/34 (26%)
Query: 313 MFWAFWSLSPSNS-------SDGGIEKA--GMHG 337
MFW FWSLSP + SD GI M+G
Sbjct: 114 MFWDFWSLSPESLHQVTILFSDRGIPDGYRHMNG 147
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.134 0.435
Gapped
Lambda K H
0.267 0.0761 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,089,929
Number of extensions: 3882787
Number of successful extensions: 2548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2535
Number of HSP's successfully gapped: 58
Length of query: 841
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 736
Effective length of database: 6,280,432
Effective search space: 4622397952
Effective search space used: 4622397952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.0 bits)