RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5806
(1332 letters)
>gnl|CDD|238060 cd00112, LDLa, Low Density Lipoprotein Receptor Class A domain, a
cysteine-rich repeat that plays a central role in
mammalian cholesterol metabolism; the receptor protein
binds LDL and transports it into cells by endocytosis;
7 successive cysteine-rich repeats of about 40 amino
acids are present in the N-terminal of this multidomain
membrane protein; other homologous domains occur in
related receptors, including the very low-density
lipoprotein receptor and the LDL receptor-related
protein/alpha 2-macroglobulin receptor, and in proteins
which are functionally unrelated, such as the C9
component of complement; the binding of calcium is
required for in vitro formation of the native disulfide
isomer and is necessary in establishing and maintaining
the modular structure.
Length = 35
Score = 63.8 bits (156), Expect = 5e-13
Identities = 19/35 (54%), Positives = 22/35 (62%)
Query: 59 CSPDEHRCTTGHCILKTWLCDGIPDCSLGEDERNC 93
C P+E RC G CI +W+CDG DC G DE NC
Sbjct: 1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
Score = 44.1 bits (105), Expect = 5e-06
Identities = 15/30 (50%), Positives = 17/30 (56%), Gaps = 1/30 (3%)
Query: 21 KCNRTGACIPAQWQCDNEFDCEMGEDEMKC 50
+C G CIP+ W CD E DC G DE C
Sbjct: 7 RC-ANGRCIPSSWVCDGEDDCGDGSDEENC 35
Score = 31.4 bits (72), Expect = 0.15
Identities = 9/26 (34%), Positives = 13/26 (50%), Gaps = 3/26 (11%)
Query: 135 CFSKKYLCDGKKHCPRGEEGPDERHC 160
C ++CDG+ C G DE +C
Sbjct: 13 CIPSSWVCDGEDDCGDGS---DEENC 35
>gnl|CDD|200964 pfam00057, Ldl_recept_a, Low-density lipoprotein receptor domain
class A.
Length = 37
Score = 60.0 bits (146), Expect = 1e-11
Identities = 20/37 (54%), Positives = 25/37 (67%)
Query: 57 KKCSPDEHRCTTGHCILKTWLCDGIPDCSLGEDERNC 93
C PDE +C +G CI +W+CDG PDC G DE+NC
Sbjct: 1 STCGPDEFQCGSGECIPMSWVCDGDPDCEDGSDEKNC 37
Score = 41.5 bits (98), Expect = 4e-05
Identities = 15/30 (50%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 21 KCNRTGACIPAQWQCDNEFDCEMGEDEMKC 50
+C +G CIP W CD + DCE G DE C
Sbjct: 9 QC-GSGECIPMSWVCDGDPDCEDGSDEKNC 37
>gnl|CDD|197566 smart00192, LDLa, Low-density lipoprotein receptor domain class
A. Cysteine-rich repeat in the low-density lipoprotein
(LDL) receptor that plays a central role in mammalian
cholesterol metabolism. The N-terminal type A repeats
in LDL receptor bind the lipoproteins. Other homologous
domains occur in related receptors, including the very
low-density lipoprotein receptor and the LDL
receptor-related protein/alpha 2-macroglobulin
receptor, and in proteins which are functionally
unrelated, such as the C9 component of complement.
Mutations in the LDL receptor gene cause familial
hypercholesterolemia.
Length = 33
Score = 56.9 bits (138), Expect = 1e-10
Identities = 16/33 (48%), Positives = 20/33 (60%)
Query: 58 KCSPDEHRCTTGHCILKTWLCDGIPDCSLGEDE 90
C P E +C G CI +W+CDG+ DC G DE
Sbjct: 1 TCPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
Score = 40.7 bits (96), Expect = 8e-05
Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 17 PQYKKCNRTGACIPAQWQCDNEFDCEMGEDE 47
P +C G CIP+ W CD DC G DE
Sbjct: 4 PGEFQC-DNGRCIPSSWVCDGVDDCGDGSDE 33
>gnl|CDD|214531 smart00135, LY, Low-density lipoprotein-receptor YWTD domain. Type
"B" repeats in low-density lipoprotein (LDL) receptor
that plays a central role in mammalian cholesterol
metabolism. Also present in a variety of molecules
similar to gp300/megalin.
Length = 43
Score = 56.1 bits (136), Expect = 4e-10
Identities = 18/43 (41%), Positives = 28/43 (65%)
Query: 284 KVILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLDGSRR 326
+ +L+ + PNGL +D+ E RLYWT+ IE ++LDG+ R
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 50.3 bits (121), Expect = 4e-08
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 774 ERLVVNDLNRTESIAVDAIGRKIYWTDMNAQTIMVSDIDGKN 815
L+ + L +AVD I ++YWTD I V+++DG N
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
Score = 49.9 bits (120), Expect = 6e-08
Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 240 KILIANNIDKPRAVIVHPRDAFVFWTDWGKTPKIERVEMDGSNR 283
+ L+++ + P + V + ++WTDWG IE +DG+NR
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWG-LDVIEVANLDGTNR 43
Score = 49.1 bits (118), Expect = 8e-08
Identities = 17/39 (43%), Positives = 25/39 (64%)
Query: 504 VIRHNTKKPAGVAIDWRTNKLYWTESEYNWIEVSNLDGT 542
++ P G+A+DW +LYWT+ + IEV+NLDGT
Sbjct: 3 LLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGT 41
Score = 49.1 bits (118), Expect = 9e-08
Identities = 21/41 (51%), Positives = 24/41 (58%)
Query: 589 HILASDSLKNPTGLAIDLENSRLYWANVGLKTIEFTNLDGT 629
L S L +P GLA+D RLYW + GL IE NLDGT
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGT 41
Score = 48.0 bits (115), Expect = 3e-07
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 202 TGMQSVGSLSVDWVHDFVFWTDTAARRVEFCDLDGKNR 239
+G+ L+VDW+ ++WTD +E +LDG NR
Sbjct: 6 SGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 47.6 bits (114), Expect = 3e-07
Identities = 15/42 (35%), Positives = 22/42 (52%)
Query: 1173 VLVEEDLAFPNELAIDFKQRRLFWADSTNKRIEYCDFFGRSR 1214
L+ L PN LA+D+ + RL+W D IE + G +R
Sbjct: 2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 44.1 bits (105), Expect = 5e-06
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 550 QNLSEPRDIAVNPIGGIMYWCSWGDLHKIEMAGMDGSHR 588
L P +AV+ I G +YW WG L IE+A +DG++R
Sbjct: 6 SGLGHPNGLAVDWIEGRLYWTDWG-LDVIEVANLDGTNR 43
Score = 40.7 bits (96), Expect = 9e-05
Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 3/43 (6%)
Query: 327 QQVITKGLPHPFGITL--YEDFVFWTDWHTKSISSANKYTGRN 367
+ +++ GL HP G+ + E ++WTDW I AN G N
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN-LDGTN 42
Score = 39.9 bits (94), Expect = 2e-04
Identities = 12/42 (28%), Positives = 15/42 (35%)
Query: 732 VLPLRQLKMVASVDVDTKNEYIYWSDISEKTIERVRFDMTGR 773
L L + VD +YW+D IE D T R
Sbjct: 2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
Score = 38.7 bits (91), Expect = 4e-04
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 5/42 (11%)
Query: 823 NLYRPRSIVVHYGLGLMVWADWSRTRLTNNRIEMAHMDGTNR 864
L P + V + G + W DW + IE+A++DGTNR
Sbjct: 7 GLGHPNGLAVDWIEGRLYWTDWGL-----DVIEVANLDGTNR 43
Score = 36.4 bits (85), Expect = 0.003
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 864 RAVFETEVIWPSCLAIDYSDNPKLYWVDTSKHTIE 898
R + + + P+ LA+D+ + +LYW D IE
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEG-RLYWTDWGLDVIE 34
Score = 33.7 bits (78), Expect = 0.024
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 1093 NFVIDWVANNIYYIDSQMHTINVARSDGQH 1122
+DW+ +Y+ D + I VA DG +
Sbjct: 13 GLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
Score = 33.7 bits (78), Expect = 0.028
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 6/49 (12%)
Query: 1123 KKILVNDLMEPLAIAVYPRRGLLFYSHWGLYDNSPTTKIEKVYLDGSYR 1171
+ +L + L P +AV G L+++ WGL IE LDG+ R
Sbjct: 1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGL------DVIEVANLDGTNR 43
Score = 33.3 bits (77), Expect = 0.034
Identities = 10/41 (24%), Positives = 16/41 (39%)
Query: 460 VLPVMNVKSVVAIAWESANDSLYWADVENGEINRAHINGTF 500
L + +A + LYW D I A+++GT
Sbjct: 2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
Score = 30.6 bits (70), Expect = 0.37
Identities = 11/29 (37%), Positives = 14/29 (48%), Gaps = 2/29 (6%)
Query: 915 SHPYTLTV--LDYYVYWTDVQHSKIYRAN 941
HP L V ++ +YWTD I AN
Sbjct: 9 GHPNGLAVDWIEGRLYWTDWGLDVIEVAN 37
>gnl|CDD|215683 pfam00058, Ldl_recept_b, Low-density lipoprotein receptor repeat
class B. This domain is also known as the YWTD motif
after the most conserved region of the repeat. The YWTD
repeat is found in multiple tandem repeats and has been
predicted to form a beta-propeller structure.
Length = 42
Score = 54.1 bits (131), Expect = 2e-09
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 261 FVFWTDWGKTPKIERVEMDGSNRKVILTENIFWPNGLTID 300
++WTD I +++GS+R+ + +E++ WPNG+ +D
Sbjct: 2 RLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVD 41
Score = 46.8 bits (112), Expect = 6e-07
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 567 MYWCSWGDLHKIEMAGMDGSHRHILASDSLKNPTGLAID 605
+YW I +A ++GS R L S+ L+ P G+A+D
Sbjct: 3 LYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVD 41
Score = 42.9 bits (102), Expect = 1e-05
Identities = 14/42 (33%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 794 RKIYWTDMNAQ-TIMVSDIDGKNAKVLFWLNLYRPRSIVVHY 834
++YWTD + + +I V+D++G + + LF +L P I V
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 42.5 bits (101), Expect = 2e-05
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 304 DRLYWTEAK-HHFIESSDLDGSRRQQVITKGLPHPFGITLYE 344
RLYWT++ I +DL+GS R+ + ++ L P GI +
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 41.4 bits (98), Expect = 5e-05
Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 217 DFVFWTDTAARRVEFC-DLDGKNRKILIANNIDKPRAVIVHP 257
++WTD++ R DL+G +R+ L + ++ P + V P
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 36.8 bits (86), Expect = 0.002
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 1101 NNIYYID-SQMHTINVARSDGQHKKILV-NDLMEPLAIAVYP 1140
+Y+ D S +I+VA +G ++ L DL P IAV P
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 35.6 bits (83), Expect = 0.005
Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 481 LYWADVENGE-INRAHINGTFQYNVIRHNTKKPAGVAIDW 519
LYW D I+ A +NG+ + + + + P G+A+D
Sbjct: 3 LYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 35.6 bits (83), Expect = 0.006
Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%)
Query: 751 EYIYWSDIS-EKTIERVRFDMTGRERLVVNDLNRTESIAVD 790
+YW+D S +I + + R L DL IAVD
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVD 41
Score = 34.5 bits (80), Expect = 0.016
Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 522 NKLYWTESEYNW-IEVSNLDGTYPTLLFWQNLSEPRDIAVNP 562
+LYWT+S I V++L+G+ LF ++L P IAV+P
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTLFSEDLQWPNGIAVDP 42
Score = 31.0 bits (71), Expect = 0.27
Identities = 6/27 (22%), Positives = 13/27 (48%)
Query: 345 DFVFWTDWHTKSISSANKYTGRNIKTV 371
++WTD ++ S G + +T+
Sbjct: 1 GRLYWTDSSLRASISVADLNGSDRRTL 27
Score = 29.1 bits (66), Expect = 1.1
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 5/46 (10%)
Query: 1143 GLLFYSHWGLYDNSPTTKIEKVYLDGSYRTVLVEEDLAFPNELAID 1188
G L+++ L I L+GS R L EDL +PN +A+D
Sbjct: 1 GRLYWTDSSL-----RASISVADLNGSDRRTLFSEDLQWPNGIAVD 41
>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
protein.
Length = 1057
Score = 41.8 bits (98), Expect = 0.003
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 6/69 (8%)
Query: 1144 LLFYSHWGLYDNSPT-TKIEKVYLDGSYRTVLVEEDLAFPNELAIDFKQRRLFWADSTNK 1202
L +Y L D++P +EK D R L+ L FP +LAID RLF +DS +
Sbjct: 536 LQYYGEKKLLDSTPLPLSLEK---DNDPR--LLTSPLKFPGKLAIDLLNNRLFISDSNHN 590
Query: 1203 RIEYCDFFG 1211
RI D G
Sbjct: 591 RIVVTDLDG 599
Score = 37.1 bits (86), Expect = 0.086
Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 270 TPKIERVEMDGSNRKVILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLDGSRRQQV 329
TP +E D R +LT + +P L ID +RL+ +++ H+ I +DLDG+ Q+
Sbjct: 548 TPLPLSLEKDNDPR--LLTSPLKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNFIVQI 605
Query: 330 ITKG 333
+ G
Sbjct: 606 GSTG 609
Score = 32.5 bits (74), Expect = 1.8
Identities = 17/38 (44%), Positives = 23/38 (60%)
Query: 591 LASDSLKNPTGLAIDLENSRLYWANVGLKTIEFTNLDG 628
L + LK P LAIDL N+RL+ ++ I T+LDG
Sbjct: 562 LLTSPLKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDG 599
Score = 32.5 bits (74), Expect = 1.9
Identities = 35/120 (29%), Positives = 51/120 (42%), Gaps = 36/120 (30%)
Query: 510 KKPAGVAIDWRTNKLYWTESEYNWIEVSNLDGTYPTLLFWQNLSEPRDIAVNPIGGIMYW 569
K P +AID N+L+ ++S +N I V++LDG + I+
Sbjct: 568 KFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNF----------------------IVQI 605
Query: 570 CSWGDLHKIEMAGMDGSHRHILASDSLKN-PTGLAIDLENSRLYWA---NVGLKTIEFTN 625
S G E DGS D+ N P GLA + + + LY A N L+ I+F N
Sbjct: 606 GSTG-----EEGLRDGSF-----EDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVN 655
>gnl|CDD|225921 COG3386, COG3386, Gluconolactonase [Carbohydrate transport and
metabolism].
Length = 307
Score = 39.3 bits (92), Expect = 0.012
Identities = 28/124 (22%), Positives = 39/124 (31%), Gaps = 19/124 (15%)
Query: 767 RFDMTGR-ERLVVNDLNRTESIAVDAIGRKIYWTDMNAQTIMVSD-------IDGKNAKV 818
R D G RL+ +DL +A G+ +Y D A I D I G+ V
Sbjct: 147 RVDPDGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFV 206
Query: 819 LFWLNLYRPRSIVV---HYGLGLMVWADWSRTRLTNNRIEMAHMDGTNRAVFETEVIWPS 875
F P + V VW R+ + DG + V P+
Sbjct: 207 DFDEEPGLPDGMAVDADGNLWVAAVWGGG--------RVVRFNPDGKLLGEIKLPVKRPT 258
Query: 876 CLAI 879
A
Sbjct: 259 NPAF 262
Score = 37.4 bits (87), Expect = 0.038
Identities = 31/148 (20%), Positives = 49/148 (33%), Gaps = 24/148 (16%)
Query: 222 TDTAARRVEFCDLDGKNRKILIANNIDKPRAVIVHPRDAFVFWT--------DWGK-TPK 272
DT + + + + N+ V P F + T
Sbjct: 92 PDTGGKITLLAEPE-DGLPLNRPNDGV------VDPDGRIWFGDMGYFDLGKSEERPTGS 144
Query: 273 IERVEMDGSNRKVILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLD------GSRR 326
+ RV+ DG + +L +++ PNGL + LY + + I DLD G RR
Sbjct: 145 LYRVDPDGGVVR-LLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRR 203
Query: 327 QQVITKGLP-HPFGITLYEDFVFWTDWH 353
V P P G+ + D W
Sbjct: 204 GFVDFDEEPGLPDGMAVDADGNLWVAAV 231
Score = 33.9 bits (78), Expect = 0.54
Identities = 19/75 (25%), Positives = 29/75 (38%), Gaps = 3/75 (4%)
Query: 1138 VYPRRGLLFYSHWGLYDNSPTTK-IEKVY-LDGSYRTV-LVEEDLAFPNELAIDFKQRRL 1194
V P + F + +Y +D V L+++DL PN LA + L
Sbjct: 118 VDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRLLDDDLTIPNGLAFSPDGKTL 177
Query: 1195 FWADSTNKRIEYCDF 1209
+ AD+ RI D
Sbjct: 178 YVADTPANRIHRYDL 192
>gnl|CDD|225926 COG3391, COG3391, Uncharacterized conserved protein [Function
unknown].
Length = 381
Score = 38.0 bits (88), Expect = 0.032
Identities = 33/134 (24%), Positives = 55/134 (41%), Gaps = 4/134 (2%)
Query: 497 NGTFQYNVIRHNTKKPAGVAIDWRTNKLYWTESEYNWIEVSNLDGTYPTLLFWQNLSEPR 556
GT + + P GVA++ ++Y S N + V + T P
Sbjct: 18 TGTNKVTAAISLGRGPGGVAVNPDGTQVYVANSGSNDVSVIDATSNTVTQSLSVGGVYPA 77
Query: 557 DIAVNPIGGIMYWCSWGDLHKIEMAGMDGSHRHILASDSL-KNPTGLAIDLENSRLYWAN 615
+AVNP G +Y + GD + + +D + +L S + P GLA+D + +Y AN
Sbjct: 78 GVAVNPAGNKVYVTT-GDSNTVS--VIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVAN 134
Query: 616 VGLKTIEFTNLDGT 629
G + +D
Sbjct: 135 AGNGNNTVSVIDAA 148
Score = 35.3 bits (81), Expect = 0.20
Identities = 37/129 (28%), Positives = 52/129 (40%), Gaps = 24/129 (18%)
Query: 455 PTFETVLPVMNV-KSVVAIAWESANDSLYWADVENGEINRAHINGTFQYNVIRHNTKK-- 511
TVL + V V +A + +Y A+ NG N T +VI T K
Sbjct: 103 TATNTVLGSIPVGLGPVGLAVDPDGKYVYVANAGNG-------NNT--VSVIDAATNKVT 153
Query: 512 --------PAGVAIDWRTNKLYWTESEYNWIEVSNLDGTYPTLLFWQNL----SEPRDIA 559
P GVA+D NK+Y T S+ N + V + G +L + P IA
Sbjct: 154 ATIPVGNTPTGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIA 213
Query: 560 VNPIGGIMY 568
V+P G +Y
Sbjct: 214 VDPDGNRVY 222
Score = 32.2 bits (73), Expect = 2.1
Identities = 36/185 (19%), Positives = 61/185 (32%), Gaps = 24/185 (12%)
Query: 740 MVASVDVDTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNRT-ESIAVDAIGRKIYW 798
+ VD +Y+Y ++ D + + T +AVD G K+Y
Sbjct: 117 GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNTPTGVAVDPDGNKVYV 176
Query: 799 TDMNAQTIMVSDIDGKNAKVLFWLNLY----RPRSIVVHYGLGLMVWADWSRTRLTNNRI 854
T+ + T+ V D G + +L P I V + A+ +I
Sbjct: 177 TNSDDNTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKI 236
Query: 855 EMAHMDGTNRAVFETEVI----WPSCLAID------YSDN---PKLYWVDTSKHTIEYKT 901
D V T++ P +A+D Y N + +D + + KT
Sbjct: 237 -----DTATGNVTATDLPVGSGAPRGVAVDPAGKAAYVANSQGGTVSVIDGATDRV-VKT 290
Query: 902 LATGR 906
TG
Sbjct: 291 GPTGN 295
Score = 30.3 bits (68), Expect = 7.2
Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 1/78 (1%)
Query: 742 ASVDVDTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNRTESIAVDAIGRKIYWTDM 801
A V V+ +Y + T+ V T + +AVD G+ +Y +
Sbjct: 77 AGVAVNPAGNKVYVTTGDSNTVS-VIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVANA 135
Query: 802 NAQTIMVSDIDGKNAKVL 819
VS ID KV
Sbjct: 136 GNGNNTVSVIDAATNKVT 153
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 33.3 bits (77), Expect = 0.036
Identities = 12/28 (42%), Positives = 14/28 (50%), Gaps = 2/28 (7%)
Query: 972 CGSNNGGCSH--LCLRNPTNFTCACPTG 997
C NNGGC C +FTC C +G
Sbjct: 1 CAENNGGCHPNATCTNTGGSFTCTCKSG 28
>gnl|CDD|219847 pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region. This family
describes a region that is found in proteins expressed by
a variety of eukaryotic and prokaryotic species. These
proteins include various enzymes, such as senescence
marker protein 30 (SMP-30), gluconolactonase and
luciferin-regenerating enzyme (LRE). SMP-30 is known to
hydrolyse diisopropyl phosphorofluoridate in the liver,
and has been noted as having sequence similarity, in the
region described in this family, with PON1 and LRE.
Length = 245
Score = 36.1 bits (84), Expect = 0.087
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 1153 YDNSPTTKIEKVY-LDGSYRTVLVEEDLAFPNELAIDFKQRRLFWADSTNKRIEYCDF 1209
+D +P + +Y LD + V + + N LA + L++ADS +RI D+
Sbjct: 105 FDIAPGGEPGALYRLDPDGKVERVLDGITISNGLAWSPDGKTLYFADSPTRRIWAFDY 162
Score = 33.0 bits (76), Expect = 0.88
Identities = 33/160 (20%), Positives = 51/160 (31%), Gaps = 21/160 (13%)
Query: 474 WESANDSLYWADVENGEINRAHINGTFQYNVIRHNTKKPAGVAIDWRTNKLYWTESEYNW 533
W+ +LYW D+ G I+R + P G +L
Sbjct: 7 WDEEEGALYWVDILGGRIHRLDPATG---KETVWDLPGPVGAIALRDDGRLIVALK--RG 61
Query: 534 IEVSNLDGTYPTLLFWQNLSEPR----DIAVNPIGGIMYWCSWGDLHKIEMAG------- 582
+ + +LD T L EP D V+P G W G + G
Sbjct: 62 LALLDLDTGELTTLADLEPDEPLNRFNDGKVDPDGRF--WF--GTMGFDIAPGGEPGALY 117
Query: 583 -MDGSHRHILASDSLKNPTGLAIDLENSRLYWANVGLKTI 621
+D + D + GLA + LY+A+ + I
Sbjct: 118 RLDPDGKVERVLDGITISNGLAWSPDGKTLYFADSPTRRI 157
Score = 33.0 bits (76), Expect = 0.92
Identities = 11/52 (21%), Positives = 22/52 (42%)
Query: 767 RFDMTGRERLVVNDLNRTESIAVDAIGRKIYWTDMNAQTIMVSDIDGKNAKV 818
R D G+ V++ + + +A G+ +Y+ D + I D D +
Sbjct: 118 RLDPDGKVERVLDGITISNGLAWSPDGKTLYFADSPTRRIWAFDYDADGGLI 169
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 31.7 bits (73), Expect = 0.096
Identities = 9/18 (50%), Positives = 12/18 (66%)
Query: 412 FTCQCPLGLTLSPDNKSC 429
+TC CP G LS D ++C
Sbjct: 2 YTCSCPPGYQLSGDGRTC 19
Score = 29.7 bits (68), Expect = 0.53
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 990 FTCACPTGILLSADRRSC 1007
+TC+CP G LS D R+C
Sbjct: 2 YTCSCPPGYQLSGDGRTC 19
>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
extracellular matrix molecules mediate cell-matrix and
matrix-matrix interactions thereby providing tissue
integrity. Some members of the matrilin family are
expressed specifically in developing cartilage
rudiments. The matrilin family consists of at least four
members. All the members of the matrilin family contain
VWA domains, EGF-like domains and a heptad repeat
coiled-coiled domain at the carboxy terminus which is
responsible for the oligomerization of the matrilins.
The VWA domains have been shown to be essential for
matrilin network formation by interacting with matrix
ligands.
Length = 224
Score = 35.1 bits (81), Expect = 0.17
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 396 CAPKVCSHICLPNKHRFTCQCPLGLTLSPDNKS 428
VC +C+ + C C G L DNK+
Sbjct: 192 TLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224
Score = 33.5 bits (77), Expect = 0.56
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 969 PNACGSNNGGCSHLCLRNPTNFTCACPTGILLSADRRS 1006
P+ C + + C +C+ P ++ CAC G L D ++
Sbjct: 187 PDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 29.3 bits (66), Expect = 0.74
Identities = 12/28 (42%), Positives = 15/28 (53%), Gaps = 3/28 (10%)
Query: 972 CGSNNGGCSH--LCLRNPTNFTCACPTG 997
C NN CS+ C+ P +TC CP G
Sbjct: 1 CSPNNP-CSNGGTCVDTPGGYTCECPEG 27
Score = 28.2 bits (63), Expect = 1.8
Identities = 11/29 (37%), Positives = 16/29 (55%), Gaps = 3/29 (10%)
Query: 396 CAPKV-CSH--ICLPNKHRFTCQCPLGLT 421
C+P CS+ C+ +TC+CP G T
Sbjct: 1 CSPNNPCSNGGTCVDTPGGYTCECPEGYT 29
>gnl|CDD|233685 TIGR02013, rpoB, DNA-directed RNA polymerase, beta subunit. This
model describes orthologs of the beta subunit of
Bacterial RNA polymerase. The core enzyme consists of
two alpha chains, one beta chain, and one beta' subunit
[Transcription, DNA-dependent RNA polymerase].
Length = 1065
Score = 33.1 bits (76), Expect = 1.3
Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 7/80 (8%)
Query: 724 DVEYRYPIVLPLRQLKMVASVDVDTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNR 783
+ Y P+ + LR + + K + +Y DI T +R F + G ER+VV+ L+R
Sbjct: 86 GLTYSVPLKVKLRLINKEEDGTKEIKEQEVYMGDIPLMT-DRGTFIINGAERVVVSQLHR 144
Query: 784 T------ESIAVDAIGRKIY 797
+ G+ +Y
Sbjct: 145 SPGVFFSSEKDTHKNGKVLY 164
>gnl|CDD|227604 COG5279, CYK3, Uncharacterized protein involved in cytokinesis,
contains TGc (transglutaminase/protease-like) domain
[Cell division and chromosome partitioning].
Length = 521
Score = 33.0 bits (75), Expect = 1.3
Identities = 21/99 (21%), Positives = 30/99 (30%), Gaps = 9/99 (9%)
Query: 1029 LDTADLLPVTLPFPEYMSSIFFDYHYSKNLIYFA---DMRSGNLRTFDMSDSTRIKP--- 1082
D + +L SS+ + N + NLR F + KP
Sbjct: 94 RSAMDNVLQSLNAKGKKSSLRLTNSTTDNTKFDFIPIPEAFDNLRRFQNAPGPSPKPLAT 153
Query: 1083 --IPLMNDTIRDNFVI-DWVANNIYYIDSQMHTINVARS 1118
P VI DW+ N YID + N+
Sbjct: 154 IIRPSTAQLYPKPPVIYDWIVVNTSYIDKMFESRNIQDP 192
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 28.6 bits (64), Expect = 1.6
Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 5/37 (13%)
Query: 396 CAPK-VCSH-ICLPNKHRFTCQCPLGLTLSPDNKSCS 430
CA CS+ C+ +TC CP G T +K C
Sbjct: 2 CASGGPCSNGTCINTPGSYTCSCPPGYTG---DKRCE 35
Score = 28.6 bits (64), Expect = 1.7
Identities = 10/25 (40%), Positives = 17/25 (68%), Gaps = 1/25 (4%)
Query: 974 SNNGGCSH-LCLRNPTNFTCACPTG 997
++ G CS+ C+ P ++TC+CP G
Sbjct: 3 ASGGPCSNGTCINTPGSYTCSCPPG 27
>gnl|CDD|132976 cd06645, STKc_MAP4K3, Catalytic domain of the Protein
Serine/Threonine Kinase, Mitogen-activated protein
kinase kinase kinase kinase 3. Serine/threonine kinases
(STKs), mitogen-activated protein kinase (MAPK) kinase
kinase kinase 3 (MAPKKKK3 or MAP4K3) subfamily,
catalytic (c) domain. STKs catalyze the transfer of the
gamma-phosphoryl group from ATP to serine/threonine
residues on protein substrates. The MAP4K3 subfamily is
part of a larger superfamily that includes the catalytic
domains of other protein STKs, protein tyrosine kinases,
RIO kinases, aminoglycoside phosphotransferase, choline
kinase, and phosphoinositide 3-kinase. Members of this
subfamily contain an N-terminal catalytic domain and a
C-terminal citron homology (CNH) regulatory domain,
similar to MAP4K4/6. MAP4Ks are involved in some MAPK
signaling pathways that are important in mediating
cellular responses to extracellular signals by
activating a MAPK kinase kinase (MAPKKK or MAP3K or
MKKK). Each MAPK cascade is activated either by a small
GTP-binding protein or by an adaptor protein, which
transmits the signal either directly to a MAP3K to start
the triple kinase core cascade or indirectly through a
mediator kinase, a MAP4K. MAP4K3 plays a role in the
nutrient-responsive pathway of mTOR (mammalian target of
rapamycin) signaling. MAP4K3 is required in the
activation of S6 kinase by amino acids and for the
phosphorylation of the mTOR-regulated inhibitor of
eukaryotic initiation factor 4E. mTOR regulates ribosome
biogenesis and protein translation, and is frequently
deregulated in cancer.
Length = 267
Score = 31.2 bits (70), Expect = 3.2
Identities = 14/53 (26%), Positives = 29/53 (54%), Gaps = 5/53 (9%)
Query: 286 ILTENIFWPNGLTIDFEEDRLYWTEAKHHFIESSDLDGSRRQQVITKGLPHPF 338
++T++ F P L +D++ W+ + HHF++ + +++ K L HPF
Sbjct: 220 LMTKSNFQPPKL-----KDKMKWSNSFHHFVKMALTKNPKKRPTAEKLLQHPF 267
>gnl|CDD|215623 PLN03190, PLN03190, aminophospholipid translocase; Provisional.
Length = 1178
Score = 31.4 bits (71), Expect = 4.3
Identities = 19/73 (26%), Positives = 30/73 (41%), Gaps = 19/73 (26%)
Query: 472 IAWESANDSLY---WADVENGEINRAHINGTFQYNVIRHNTKKPAGVAIDWRTNKLYWTE 528
+AW +D W D+ GEI + N T +++ +T P GVA
Sbjct: 172 LAWVLVDDQFQEKKWKDIRVGEIIKIQANDTLPCDMVLLSTSDPTGVA------------ 219
Query: 529 SEYNWIEVSNLDG 541
+++ NLDG
Sbjct: 220 ----YVQTINLDG 228
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 27.1 bits (60), Expect = 5.3
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 6/37 (16%)
Query: 396 CA-PKVCSH--ICLPNKHRFTCQCPLGLTLSPDNKSC 429
CA CS+ C+ + C CP G T ++SC
Sbjct: 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTG---DRSC 35
Score = 27.1 bits (60), Expect = 6.2
Identities = 8/26 (30%), Positives = 14/26 (53%), Gaps = 2/26 (7%)
Query: 974 SNNGGCSH--LCLRNPTNFTCACPTG 997
+ + CS+ C+ P ++ C CP G
Sbjct: 3 AASNPCSNGGTCVNTPGSYRCVCPPG 28
>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
4,4'-diapolycopene-dialdehyde dehydrogenase-like. The
4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
involved in C30 carotenoid synthesis in Methylomonas sp.
strain 16a and other similar sequences are present in
this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
into 4,4'-diapolycopene-diacid.
Length = 453
Score = 30.3 bits (69), Expect = 7.1
Identities = 18/43 (41%), Positives = 21/43 (48%), Gaps = 10/43 (23%)
Query: 456 TFETVLPVMNVKSV-VAIAWESANDSLY------WA-DVENGE 490
TF VLPVM V AIA ANDS Y ++ D+ E
Sbjct: 358 TFGPVLPVMPVADEDEAIAL--ANDSRYGLSASVFSRDLARAE 398
>gnl|CDD|119132 pfam10612, Spore-coat_CotZ, Spore coat protein Z. This family has
members annotated as Spore coat protein Z, otherwise
known as CotZ, It is a cysteine-rich spore coat family,
and along with CotY is necessary for assembly of intact
exosporium.
Length = 154
Score = 29.3 bits (66), Expect = 8.2
Identities = 16/59 (27%), Positives = 22/59 (37%), Gaps = 11/59 (18%)
Query: 972 CGSNNGGCSHLCLRNPTNF--------TCACPTGI---LLSADRRSCFSRTREFLLYTS 1019
CG + S C+ + F CPT LL + + TR F+LYT
Sbjct: 3 CGKHEDHGSSNCVCDVVRFINDLQDAAEEDCPTSCFSPLLGPSNSASLADTRPFVLYTK 61
>gnl|CDD|181983 PRK09603, PRK09603, bifunctional DNA-directed RNA polymerase
subunit beta/beta'; Reviewed.
Length = 2890
Score = 30.7 bits (69), Expect = 8.7
Identities = 16/41 (39%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 747 DTKNEYIYWSDISEKTIERVRFDMTGRERLVVNDLNRTESI 787
D K + I+ +I T ER F + G ER+VVN L+R+ +
Sbjct: 119 DIKEQSIFIREIPLMT-ERTSFIINGVERVVVNQLHRSPGV 158
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.137 0.442
Gapped
Lambda K H
0.267 0.0701 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 67,811,812
Number of extensions: 6626487
Number of successful extensions: 4417
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4399
Number of HSP's successfully gapped: 68
Length of query: 1332
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1223
Effective length of database: 6,103,016
Effective search space: 7463988568
Effective search space used: 7463988568
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 65 (28.9 bits)