RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy519
(351 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 156 bits (397), Expect = 2e-45
Identities = 69/214 (32%), Positives = 103/214 (48%), Gaps = 5/214 (2%)
Query: 36 NLQSRYLPYYTSGHDMTVWSVEACSKG-LALTGSRDTTARLWSLEQMYPLRIFVGHKQDV 94
+L++ L GH V V A + G +GS D T RLW LE +R GH V
Sbjct: 37 DLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYV 96
Query: 95 TCARFHPKITYIGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTG 154
+ F P + + S DK +++W + G + H V S+AFSP G ++AS
Sbjct: 97 SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVAS---S 153
Query: 155 SYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLVLTHELTTGRVIS 214
S D ++LW + G V + H V S+AFSP G+ L S DG + +L+TG+ +
Sbjct: 154 SQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLG 213
Query: 215 RFHLQEHEPVTSLVWSVTGETLVTGSLTGTVQFY 248
E V S+ +S G L +GS GT++ +
Sbjct: 214 TLRGHE-NGVNSVAFSPDGYLLASGSEDGTIRVW 246
Score = 147 bits (373), Expect = 5e-42
Identities = 72/203 (35%), Positives = 103/203 (50%), Gaps = 7/203 (3%)
Query: 48 GHDMTVWSVEACSKG-LALTGSRDTTARLWSLEQMYPLRIFVGHKQDVTCARFHPKITYI 106
GH V V G L TGS D T ++W LE LR GH V TY+
Sbjct: 7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYL 66
Query: 107 GTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWSIT 166
+GS DK +RLW + G VR + H S V S+AFSP G+ L+S S DK +++W +
Sbjct: 67 ASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSS---SSRDKTIKVWDVE 123
Query: 167 DGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLVLTHELTTGRVISRFHLQEHE-PVT 225
G + H V S+AFSP G ++AS +DG + +L TG+ ++ H V
Sbjct: 124 TGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT--GHTGEVN 181
Query: 226 SLVWSVTGETLVTGSLTGTVQFY 248
S+ +S GE L++ S GT++ +
Sbjct: 182 SVAFSPDGEKLLSSSSDGTIKLW 204
Score = 147 bits (373), Expect = 6e-42
Identities = 73/216 (33%), Positives = 107/216 (49%), Gaps = 9/216 (4%)
Query: 36 NLQSRYLPYYTSGHDMTVWSVEACS--KGLALTGSRDTTARLWSLEQMYPLRIFVGHKQD 93
+L++ +GH V SV A S + + SRD T ++W +E L GH
Sbjct: 79 DLETGECVRTLTGHTSYVSSV-AFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW 137
Query: 94 VTCARFHPKITYIGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGT 153
V F P T++ + S D ++LW + G V + H V S+AFSP G+ L S +
Sbjct: 138 VNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSS 197
Query: 154 GSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLVLTHELTTGRVI 213
D ++LW ++ G + H + V S+AFSP G LASG EDG + +L TG +
Sbjct: 198 ---DGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECV 254
Query: 214 SRFHLQEHE-PVTSLVWSVTGETLVTGSLTGTVQFY 248
L H VTSL WS G+ L +GS GT++ +
Sbjct: 255 QT--LSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
Score = 127 bits (321), Expect = 2e-34
Identities = 57/167 (34%), Positives = 84/167 (50%), Gaps = 6/167 (3%)
Query: 83 PLRIFVGHKQDVTCARFHPKITYIGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFS 142
R GH VTC F P + TGS D +++W + G +R H PV +A S
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 143 PCGKYLASGGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLV 202
G YLAS GS DK +RLW + G VR + H S V S+AFSP G+ L+S D +
Sbjct: 61 ADGTYLAS---GSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTI 117
Query: 203 LTHELTTGRVISRFHLQEHE-PVTSLVWSVTGETLVTGSLTGTVQFY 248
++ TG+ ++ + H V S+ +S G + + S GT++ +
Sbjct: 118 KVWDVETGKCLTTL--RGHTDWVNSVAFSPDGTFVASSSQDGTIKLW 162
Score = 90.5 bits (225), Expect = 8e-21
Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 6/129 (4%)
Query: 125 SVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSI 184
R H V +AFSP GK LA TGS D +++W + G +R H PV +
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLA---TGSGDGTIKVWDLETGELLRTLKGHTGPVRDV 57
Query: 185 AFSPCGKYLASGGEDGLVLTHELTTGRVISRFHLQEHE-PVTSLVWSVTGETLVTGSLTG 243
A S G YLASG D + +L TG + H V+S+ +S G L + S
Sbjct: 58 AASADGTYLASGSSDKTIRLWDLETGECVRTLT--GHTSYVSSVAFSPDGRILSSSSRDK 115
Query: 244 TVQFYGWPS 252
T++ + +
Sbjct: 116 TIKVWDVET 124
Score = 87.4 bits (217), Expect = 1e-19
Identities = 43/119 (36%), Positives = 58/119 (48%), Gaps = 4/119 (3%)
Query: 47 SGHDMTVWSVEACSKGLAL-TGSRDTTARLWSLEQMYPLRIFVGHKQDVTCARFHPKITY 105
+GH V SV G L + S D T +LW L L GH+ V F P
Sbjct: 174 TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYL 233
Query: 106 IGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWS 164
+ +GS D +R+W + G V+ S H + V S+A+SP GK LAS GS D +R+W
Sbjct: 234 LASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS---GSADGTIRIWD 289
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 118 bits (295), Expect = 1e-29
Identities = 85/217 (39%), Positives = 115/217 (52%), Gaps = 11/217 (5%)
Query: 47 SGHDMTVWSVEACSKG--LALTGSRDTTARLWSLEQMYPLRIFVGHKQDVTCARFHPKIT 104
GH +V S+ G LA S D T +LW L PL GH V+ F P
Sbjct: 152 EGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGG 211
Query: 105 -YIGTGSYDKAVRLWSITDGASVR-VFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRL 162
I +GS D +RLW ++ G +R S H V+S +FSP G LASG S D +RL
Sbjct: 212 LLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVS-SFSPDGSLLASG---SSDGTIRL 267
Query: 163 WSI-TDGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLVLTHELTTGRVISRFHLQEH 221
W + + + +R S H S VLS+AFSP GK LASG DG V +L TG+++S L+ H
Sbjct: 268 WDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGH 327
Query: 222 E-PVTSLVWSVTGETLVTG-SLTGTVQFYGWPSASKI 256
E PV+SL +S G LV+G S GT++ + + +
Sbjct: 328 EGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPL 364
Score = 110 bits (274), Expect = 7e-27
Identities = 68/221 (30%), Positives = 103/221 (46%), Gaps = 10/221 (4%)
Query: 35 KNLQSRYLPYYTSGHDMTVWSVEACSKG----LALTGSRDTTARLWSLEQMY-PLRIFVG 89
+ + + HD +V + S L + S D T +LW L +R G
Sbjct: 94 LDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEG 153
Query: 90 HKQDVTCARFHPKITYIGTGS-YDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYL 148
H + VT F P + +GS D ++LW + G + + H PV S+AFSP G L
Sbjct: 154 HSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLL 213
Query: 149 ASGGTGSYDKAVRLWSITDGASVR-VFSDHVSPVLSIAFSPCGKYLASGGEDGLVLTHEL 207
+ +GS D +RLW ++ G +R S H V+S +FSP G LASG DG + +L
Sbjct: 214 IA--SGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVS-SFSPDGSLLASGSSDGTIRLWDL 270
Query: 208 TTGRVISRFHLQEHEPVTSLVWSVTGETLVTGSLTGTVQFY 248
+ + R V S+ +S G+ L +GS GTV+ +
Sbjct: 271 RSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLW 311
Score = 107 bits (268), Expect = 4e-26
Identities = 74/205 (36%), Positives = 104/205 (50%), Gaps = 7/205 (3%)
Query: 47 SGHDMTVWSVEACSKGLALTGSRDTTARLWSLEQMY-PLRIFVGHKQDVTCARFHPKITY 105
SGH +V S + L +GS D T RLW L LR GH V F P
Sbjct: 239 SGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKL 298
Query: 106 IGTGSYDKAVRLWSITDGASVRVFSD--HVSPVLSIAFSPCGKYLASGGTGSYDKAVRLW 163
+ +GS D VRLW + G + + H PV S++FSP G L SGG S D +RLW
Sbjct: 299 LASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGG--SDDGTIRLW 356
Query: 164 SITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLVLTHELTTGRVISRFHLQEHEP 223
+ G ++ H S VLS++FSP G+ ++SG DG V +L+TG ++
Sbjct: 357 DLRTGKPLKTLEGH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLD-GHTSR 414
Query: 224 VTSLVWSVTGETLVTGSLTGTVQFY 248
VTSL +S G++L +GS T++ +
Sbjct: 415 VTSLDFSPDGKSLASGSSDNTIRLW 439
Score = 100 bits (248), Expect = 2e-23
Identities = 68/209 (32%), Positives = 103/209 (49%), Gaps = 11/209 (5%)
Query: 48 GHDMTVWSVE-ACSKGLALTGSRDTTARLWSLEQMYP-LRIFVG-HKQDVTCARFHP--- 101
GH+ ++ S+ + L L+GS D T +LW L+ ++ G H V+
Sbjct: 63 GHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDG 122
Query: 102 KITYIGTGSYDKAVRLWSI-TDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAV 160
+ + S D V+LW + T G +R H V S+AFSP GK LASG S D +
Sbjct: 123 NSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGS--SLDGTI 180
Query: 161 RLWSITDGASVRVFSDHVSPVLSIAFSPCGK-YLASGGEDGLVLTHELTTGRVISRFHLQ 219
+LW + G + + H PV S+AFSP G +ASG DG + +L+TG+++ R L
Sbjct: 181 KLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLL-RSTLS 239
Query: 220 EHEPVTSLVWSVTGETLVTGSLTGTVQFY 248
H +S G L +GS GT++ +
Sbjct: 240 GHSDSVVSSFSPDGSLLASGSSDGTIRLW 268
Score = 72.8 bits (177), Expect = 4e-14
Identities = 59/157 (37%), Positives = 75/157 (47%), Gaps = 18/157 (11%)
Query: 48 GHDMTVWSVEACSKG--LALTGSRDTTARLWSLEQMYPLRIFVGHKQDVTCARFHPKITY 105
GH+ V S+ G L GS D T RLW L PL+ GH V F P
Sbjct: 326 GHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSN-VLSVSFSPDGRV 384
Query: 106 IGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWSI 165
+ +GS D VRLW ++ G+ +R H S V S+ FSP GK LAS GS D +RLW +
Sbjct: 385 VSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLAS---GSSDNTIRLWDL 441
Query: 166 TDGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLV 202
+ + S++FSP GK LAS D V
Sbjct: 442 K------------TSLKSVSFSPDGKVLASKSSDLSV 466
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 45.0 bits (107), Expect = 6e-07
Identities = 20/42 (47%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 123 GASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWS 164
G +R H PV S+AFSP G LAS GS D VR+W
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLAS---GSDDGTVRVWD 39
Score = 44.3 bits (105), Expect = 1e-06
Identities = 14/37 (37%), Positives = 17/37 (45%)
Query: 83 PLRIFVGHKQDVTCARFHPKITYIGTGSYDKAVRLWS 119
LR GH VT F P + +GS D VR+W
Sbjct: 3 LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 43.9 bits (104), Expect = 2e-06
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 168 GASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLV 202
G +R H PV S+AFSP G LASG +DG V
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTV 35
Score = 29.6 bits (67), Expect = 0.19
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 216 FHLQEHE-PVTSLVWSVTGETLVTGSLTGTVQFY 248
L+ H PVTS+ +S G L +GS GTV+ +
Sbjct: 5 RTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 28.9 bits (65), Expect = 0.35
Identities = 13/32 (40%), Positives = 15/32 (46%), Gaps = 1/32 (3%)
Query: 47 SGHDMTVWSVEACSKG-LALTGSRDTTARLWS 77
GH V SV G L +GS D T R+W
Sbjct: 8 KGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 45.0 bits (107), Expect = 7e-07
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 122 DGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWS 164
G ++ H PV S+AFSP GKYLAS GS D ++LW
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS---GSDDGTIKLWD 40
Score = 43.8 bits (104), Expect = 2e-06
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 83 PLRIFVGHKQDVTCARFHPKITYIGTGSYDKAVRLWS 119
L+ GH VT F P Y+ +GS D ++LW
Sbjct: 4 LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 43.8 bits (104), Expect = 2e-06
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 167 DGASVRVFSDHVSPVLSIAFSPCGKYLASGGEDGLV 202
G ++ H PV S+AFSP GKYLASG +DG +
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTI 36
Score = 30.0 bits (68), Expect = 0.15
Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 218 LQEHE-PVTSLVWSVTGETLVTGSLTGTVQFY 248
L+ H PVTS+ +S G+ L +GS GT++ +
Sbjct: 8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 27.3 bits (61), Expect = 1.2
Identities = 12/32 (37%), Positives = 14/32 (43%), Gaps = 1/32 (3%)
Query: 47 SGHDMTVWSVEACSKG-LALTGSRDTTARLWS 77
GH V SV G +GS D T +LW
Sbjct: 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|234019 TIGR02800, propeller_TolB, tol-pal system beta propeller repeat
protein TolB. Members of this protein family are the
TolB periplasmic protein of Gram-negative bacteria. TolB
is part of the Tol-Pal (peptidoglycan-associated
lipoprotein) multiprotein complex, comprising five
envelope proteins, TolQ, TolR, TolA, TolB and Pal, which
form two complexes. The TolQ, TolR and TolA
inner-membrane proteins interact via their transmembrane
domains. The {beta}-propeller domain of the periplasmic
protein TolB is responsible for its interaction with
Pal. TolB also interacts with the outer-membrane
peptidoglycan-associated proteins Lpp and OmpA. TolA
undergoes a conformational change in response to changes
in the proton-motive force, and interacts with Pal in an
energy-dependent manner. The C-terminal periplasmic
domain of TolA also interacts with the N-terminal domain
of TolB. The Tol-PAL system is required for bacterial
outer membrane integrity. E. coli TolB is involved in
the tonB-independent uptake of group A colicins
(colicins A, E1, E2, E3 and K), and is necessary for the
colicins to reach their respective targets after initial
binding to the bacteria. It is also involved in uptake
of filamentous DNA. Study of its structure suggest that
the TolB protein might be involved in the recycling of
peptidoglycan or in its covalent linking with
lipoproteins. The Tol-Pal system is also implicated in
pathogenesis of E. coli, Haemophilus ducreyi ,
Salmonella enterica and Vibrio cholerae, but the
mechanism(s) is unclear [Transport and binding proteins,
Other, Cellular processes, Pathogenesis].
Length = 417
Score = 39.9 bits (94), Expect = 0.001
Identities = 40/163 (24%), Positives = 60/163 (36%), Gaps = 24/163 (14%)
Query: 99 FHPKITYIGTGSYDKAVRLW-SITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYD 157
F +I Y+ + L + DGA+ + + P+LS A+SP G+ LA S
Sbjct: 154 FSTRIAYVSKSGKGRRYELQVADYDGANPQTITRSREPILSPAWSPDGQKLAYVSFESGK 213
Query: 158 KAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLA-----SGGEDGLVL------THE 206
+ + + G + + AFSP G LA G D V+
Sbjct: 214 PEIYVQDLATGQ-REKVASFPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQLTR 272
Query: 207 LTTGRVISRFHLQEHEPVTSLVWSVTGETLV-TGSLTGTVQFY 248
LT G I T WS G+++ T G+ Q Y
Sbjct: 273 LTNGPGID----------TEPSWSPDGKSIAFTSDRGGSPQIY 305
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 39.9 bits (93), Expect = 0.002
Identities = 38/140 (27%), Positives = 64/140 (45%), Gaps = 16/140 (11%)
Query: 114 AVRLWSITDGASVRVFSDHVSPVLSIAFSPC-GKYLASGGTGSYDKAVRLWSIT-DGASV 171
A+RL + V H S +L + F+PC + LAS GS D +R+W I + SV
Sbjct: 55 AIRLENQMRKPPVIKLKGHTSSILDLQFNPCFSEILAS---GSEDLTIRVWEIPHNDESV 111
Query: 172 RVFSD-------HVSPVLSIAFSPCGKY-LASGGEDGLVLTHELTTGRVISRFHLQEHEP 223
+ D H + I ++P Y + S G D V ++ + F + +
Sbjct: 112 KEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEK--RAFQINMPKK 169
Query: 224 VTSLVWSVTGETLVTGSLTG 243
++SL W++ G L++G+ G
Sbjct: 170 LSSLKWNIKG-NLLSGTCVG 188
Score = 35.3 bits (81), Expect = 0.047
Identities = 20/71 (28%), Positives = 32/71 (45%), Gaps = 9/71 (12%)
Query: 59 CSKGLALTGSRDTTARLWSL--------EQMYPLRIFVGHKQDVTCARFHPKITYI-GTG 109
C + +GS D T R+W + E P I GHK+ ++ ++P YI +
Sbjct: 85 CFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSS 144
Query: 110 SYDKAVRLWSI 120
+D V +W I
Sbjct: 145 GFDSFVNIWDI 155
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 34.5 bits (79), Expect = 0.089
Identities = 20/62 (32%), Positives = 25/62 (40%), Gaps = 8/62 (12%)
Query: 135 PVLSIAFSPCGKYLASGGTGSYDKA-VRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYL 193
PV ++ SP G YL S A V W A + F V + FSP KYL
Sbjct: 34 PVAYVSESPLGTYLF-----SEHAAGVECWGGPSKAKLVRF--RHPDVKYLDFSPNEKYL 86
Query: 194 AS 195
+
Sbjct: 87 VT 88
>gnl|CDD|235083 PRK02889, tolB, translocation protein TolB; Provisional.
Length = 427
Score = 32.3 bits (74), Expect = 0.41
Identities = 32/95 (33%), Positives = 42/95 (44%), Gaps = 11/95 (11%)
Query: 161 RLW-SITDGASVRVFSDHVSPVLSIAFSPCGKYLA-SGGEDG--LVLTHELTTG--RVIS 214
+L S DG + + P++S A+SP G LA E +V H+L TG RV++
Sbjct: 177 QLQISDADGQNAQSALSSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVA 236
Query: 215 RFHLQEHEPVTSLVWSVTGETL-VTGSLTGTVQFY 248
F P WS G TL V S G Q Y
Sbjct: 237 NFKGSNSAP----AWSPDGRTLAVALSRDGNSQIY 267
>gnl|CDD|238812 cd01580, AcnA_IRP_Swivel, Aconitase A swivel domain. This is the
major form of the TCA cycle enzyme aconitate hydratase,
also known as aconitase and citrate hydro-lyase. It
includes bacterial and archaeal aconitase A, and the
eukaryotic cytosolic form of aconitase. This group also
includes sequences that have been shown to act as an
iron-responsive element (IRE) binding protein in animals
and may have the same role in other eukaryotes. This is
the aconitase-like swivel domain, which is believed to
undergo swivelling conformational change in the enzyme
mechanism.
Length = 171
Score = 29.9 bits (68), Expect = 1.1
Identities = 16/27 (59%), Positives = 18/27 (66%), Gaps = 1/27 (3%)
Query: 127 RVFSDHVSPVLSIAF-SPCGKYLASGG 152
V +DH+SP SIA SP GKYLA G
Sbjct: 5 SVTTDHISPAGSIAKDSPAGKYLAERG 31
Score = 29.9 bits (68), Expect = 1.1
Identities = 16/27 (59%), Positives = 18/27 (66%), Gaps = 1/27 (3%)
Query: 172 RVFSDHVSPVLSIAF-SPCGKYLASGG 197
V +DH+SP SIA SP GKYLA G
Sbjct: 5 SVTTDHISPAGSIAKDSPAGKYLAERG 31
>gnl|CDD|235351 PRK05137, tolB, translocation protein TolB; Provisional.
Length = 435
Score = 30.7 bits (70), Expect = 1.4
Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 10/99 (10%)
Query: 99 FHPKITYIG-TGSYDKAVRLWSI--TDGASVRVFSDHVSPVLSIAFSPCGK---YLASGG 152
F +I Y+ +G +K ++ +I DGA+VR +D S VL+ FSP + Y++
Sbjct: 164 FDTRIVYVAESGPKNKRIKRLAIMDQDGANVRYLTDGSSLVLTPRFSPNRQEITYMSYAN 223
Query: 153 TGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGK 191
V L + G + + + FSP G+
Sbjct: 224 GRPR---VYLLDLETGQR-ELVGNFPGMTFAPRFSPDGR 258
>gnl|CDD|237137 PRK12561, PRK12561, NAD(P)H-quinone oxidoreductase subunit 4;
Provisional.
Length = 504
Score = 30.4 bits (69), Expect = 1.4
Identities = 11/29 (37%), Positives = 18/29 (62%), Gaps = 2/29 (6%)
Query: 256 IGIGVYPRCAAQPVGFEAATLAANSRVRT 284
IGIG YP+ A Q ++ T+A +++R
Sbjct: 448 IGIGFYPKLATQL--YDVKTVALAAQLRQ 474
>gnl|CDD|237246 PRK12881, acnA, aconitate hydratase; Provisional.
Length = 889
Score = 30.3 bits (69), Expect = 2.0
Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 1/28 (3%)
Query: 173 VFSDHVSPVLSI-AFSPCGKYLASGGED 199
+ +DH+SP +I A SP GKYL G
Sbjct: 669 ITTDHISPAGAIKADSPAGKYLKENGVP 696
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 30.1 bits (67), Expect = 2.1
Identities = 24/98 (24%), Positives = 40/98 (40%), Gaps = 2/98 (2%)
Query: 152 GTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCG-KYLASGGEDGLVLTHELTTG 210
+ +++ V++W + V +H V SI +S LASG +DG V + G
Sbjct: 549 ASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQG 608
Query: 211 RVISRFHLQEHEPVTSLVWSVTGETLVTGSLTGTVQFY 248
I + + S +G +L GS V +Y
Sbjct: 609 VSIGTIKTKANICCVQFP-SESGRSLAFGSADHKVYYY 645
Score = 29.7 bits (66), Expect = 3.1
Identities = 25/82 (30%), Positives = 32/82 (39%), Gaps = 7/82 (8%)
Query: 36 NLQSRYLPYYTS-GHDMTVWSVEACSKGLALTGSRDTTARLWSLEQMY------PLRIFV 88
+L++ LP T GH TV V ++ S D T +LW L PL F+
Sbjct: 646 DLRNPKLPLCTMIGHSKTVSYVRFVDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFM 705
Query: 89 GHKQDVTCARFHPKITYIGTGS 110
GH YI TGS
Sbjct: 706 GHTNVKNFVGLSVSDGYIATGS 727
>gnl|CDD|224897 COG1986, COG1986, Inosine/xanthosine triphosphatase [Nucleotide
transport and metabolism].
Length = 175
Score = 28.8 bits (65), Expect = 2.7
Identities = 20/91 (21%), Positives = 36/91 (39%), Gaps = 9/91 (9%)
Query: 256 IGIGVYPRCAAQPVGFEAATLAANSRVRTLREMHEI---VGPVVVVESFIVEIRPDKWYE 312
+G+ V QP G E A +R + + VG +E+ I+ + K+ +
Sbjct: 31 VGVAVDSGVPPQPFGDEETVQGARNRAKNALRAVDADYGVG----IEAGIIRV-GTKYVD 85
Query: 313 ANLIVLDDDLKECNLETYSQMIPVPSGIVEE 343
+ V+ D E S +P ++EE
Sbjct: 86 FHWAVVVDREGRRT-EGRSAGFELPPRVLEE 115
>gnl|CDD|225256 COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic
ion transport and metabolism].
Length = 299
Score = 29.3 bits (66), Expect = 3.2
Identities = 9/30 (30%), Positives = 11/30 (36%), Gaps = 1/30 (3%)
Query: 28 LANLESQKNLQSRYLPYYTSGHDMTVWSVE 57
LA +K + Y Y GHD W
Sbjct: 259 LAAQLEKKGIPYYYREY-PGGHDWAWWRPA 287
>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
This is a family of eukaryotic translation initiation
factors.
Length = 194
Score = 28.4 bits (64), Expect = 4.2
Identities = 14/80 (17%), Positives = 30/80 (37%), Gaps = 10/80 (12%)
Query: 124 ASVRVFSDHVSPVLS--------IAFSPCGKYLASGGTGSYDKAVRLWSITDGASVRVFS 175
A + F + + S I +SP G+ + G G+ + W + + +
Sbjct: 83 AKITFFDLKGNVIHSLGEQPRNTIFWSPFGRLVLLAGFGNLAGQIEFWDVKNKKKIATA- 141
Query: 176 DHVSPVLSIAFSPCGKYLAS 195
+ + +SP G+Y +
Sbjct: 142 EASNATD-CEWSPDGRYFLT 160
>gnl|CDD|237117 PRK12493, PRK12493, magnesium chelatase subunit H; Provisional.
Length = 1310
Score = 29.1 bits (66), Expect = 4.6
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 8/50 (16%)
Query: 266 AQPVGFEAATLAANSRVRTLREMHEIVGPVVVVESFIVEIRPDKWYEANL 315
+P + A T AN++VR+L EM V +E+ + P KWYE L
Sbjct: 1168 KKPSVYIADTTTANAQVRSLEEM-------VRLEARTKLLNP-KWYEGML 1209
>gnl|CDD|110891 pfam01931, NTPase_I-T, Protein of unknown function DUF84. The
function of this prokaryotic protein family is unknown.
Length = 168
Score = 28.0 bits (63), Expect = 4.9
Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 3/88 (3%)
Query: 256 IGIGVYPRCAAQPVGFEAATLAANSRVRTLREMHEIVGPVVVVESFIVEIRPDKWYEANL 315
+G+ V QP+G E A +R + RE V +E+ I I + W+
Sbjct: 27 VGVEVSSGVPDQPMGAEETRRGARNRAKNAREKVPEADYGVGIEAGIERI-GEGWFTFAW 85
Query: 316 IVLDDDLKECNLETYSQMIPVPSGIVEE 343
+V+++ + E S +P ++E+
Sbjct: 86 MVVENGERRG--EGRSASFELPPKVLEK 111
>gnl|CDD|173674 cd05583, STKc_MSK_N, N-terminal catalytic domain of the Protein
Serine/Threonine Kinase, Mitogen and stress-activated
kinase. Serine/Threonine Kinases (STKs), Mitogen and
stress-activated kinase (MSK) subfamily, N-terminal
catalytic (c) domain. STKs catalyze the transfer of the
gamma-phosphoryl group from ATP to serine/threonine
residues on protein substrates. The MSK subfamily is
part of a larger superfamily that includes the catalytic
domains of other protein STKs, protein tyrosine kinases,
RIO kinases, aminoglycoside phosphotransferase, choline
kinase, and phosphoinositide 3-kinase. MSKs contain an
N-terminal kinase domain (NTD) from the AGC family and a
C-terminal kinase domain (CTD) from the CAMK family,
similar to 90 kDa ribosomal protein S6 kinases (RSKs).
MSKs are activated by two major signaling cascades, the
Ras-MAPK and p38 stress kinase pathways, in response to
various stimuli such as growth factors, hormones,
neurotransmitters, cellular stress, and pro-inflammatory
cytokines. This triggers phosphorylation in the
activation loop (A-loop) of the CTD of MSK. The active
CTD phosphorylates the hydrophobic motif (HM) in the
C-terminal extension of NTD, which facilitates the
phosphorylation of the A-loop and activates the NTD,
which in turn phosphorylates downstream targets. MSKs
are predominantly nuclear proteins. They are widely
expressed in many tissues including heart, brain, lung,
liver, kidney, and pancreas. There are two isoforms of
MSK, called MSK1 and MSK2.
Length = 288
Score = 28.2 bits (63), Expect = 6.4
Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 6/32 (18%)
Query: 140 AFSPCG--KYLA----SGGTGSYDKAVRLWSI 165
A+S CG +Y+A GG+G +DKAV WS+
Sbjct: 163 AYSFCGTIEYMAPEVIRGGSGGHDKAVDWWSL 194
>gnl|CDD|237770 PRK14618, PRK14618, NAD(P)H-dependent glycerol-3-phosphate
dehydrogenase; Provisional.
Length = 328
Score = 27.9 bits (62), Expect = 7.3
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 5/75 (6%)
Query: 59 CSKGLALTGSRDTT-ARLWSLEQMYPLRIFVG--HKQDVTCARFHPKITYIGTGSYDKAV 115
C+KGLA G R + AR+ + + G H +++ ARF P T + + A
Sbjct: 103 CAKGLAPDGGRLSELARVLEFLTQARVAVLSGPNHAEEI--ARFLPAATVVASPEPGLAR 160
Query: 116 RLWSITDGASVRVFS 130
R+ + G S RV++
Sbjct: 161 RVQAAFSGPSFRVYT 175
>gnl|CDD|146279 pfam03557, Bunya_G1, Bunyavirus glycoprotein G1. Bunyavirus has
three genomic segments: small (S), middle-sized (M),
and large (L). The S segment encodes the nucleocapsid
and a non-structural protein. The M segment codes for
two glycoproteins, G1 and G2, and another
non-structural protein (NSm). The L segment codes for
an RNA polymerase. This family contains the G1
glycoprotein which is the viral attachment protein.
Length = 871
Score = 28.1 bits (63), Expect = 8.1
Identities = 16/62 (25%), Positives = 23/62 (37%), Gaps = 22/62 (35%)
Query: 21 PETVEDVLANLESQKN----------LQSRYLPYYT-----SGHDMTVW-------SVEA 58
P+T+ L +ESQK ++Y YYT SG+ T W +
Sbjct: 26 PDTIAGALRYIESQKTYHEMLTLEYAFLTKYCDYYTQFEDNSGYSQTKWRLYLRSHDFDI 85
Query: 59 CS 60
C
Sbjct: 86 CI 87
>gnl|CDD|179615 PRK03634, PRK03634, rhamnulose-1-phosphate aldolase; Provisional.
Length = 274
Score = 27.6 bits (62), Expect = 8.6
Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 25/96 (26%)
Query: 131 DHVSPVLSIAFSP---CGKYLASGGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFS 187
+ ++ G Y G+G + + V+L + +R+ SD
Sbjct: 50 HQQPRYIPLSQPMPELAGTYFLVTGSGKFFRNVQLDPAANLGVIRIDSD----------- 98
Query: 188 PCGKY--LASGGEDGLVLTHELTTGRVISRFHLQEH 221
G + G +G T EL HL H
Sbjct: 99 --GAGYHILWGLTNGGKPTSELPA-------HLMSH 125
>gnl|CDD|132370 TIGR03327, AMP_phos, AMP phosphorylase. This enzyme family is
found, so far, strictly in the Archaea, and only in
those with a type III Rubisco enzyme. Most of the
members previously were annotated as thymidine
phosphorylase, or DeoA. The AMP metabolized by this
enzyme may be produced by ADP-dependent sugar kinases.
Length = 500
Score = 28.1 bits (63), Expect = 8.9
Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 2/57 (3%)
Query: 118 WSITDGASVRVFSDHVSPVLSIAFSPCGKYLASGGTGSYDKAVRLWSITDGASVRVF 174
+ G VR+ S S V + + + G G + + I++G V V
Sbjct: 26 LGVHPGDRVRIESGGKSVVGIVDST--DTLVEKGEIGLSHEVLEELGISEGEVVEVT 80
>gnl|CDD|236445 PRK09277, PRK09277, aconitate hydratase; Validated.
Length = 888
Score = 28.2 bits (64), Expect = 9.0
Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 131 DHVSPVLSI-AFSPCGKYLASGG 152
DH+SP +I A SP GKYL G
Sbjct: 671 DHISPAGAIKADSPAGKYLLEHG 693
Score = 28.2 bits (64), Expect = 9.0
Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 176 DHVSPVLSI-AFSPCGKYLASGG 197
DH+SP +I A SP GKYL G
Sbjct: 671 DHISPAGAIKADSPAGKYLLEHG 693
>gnl|CDD|223893 COG0823, TolB, Periplasmic component of the Tol biopolymer
transport system [Intracellular trafficking and
secretion].
Length = 425
Score = 27.8 bits (62), Expect = 9.4
Identities = 25/112 (22%), Positives = 40/112 (35%), Gaps = 16/112 (14%)
Query: 86 IFVGHKQDVTCARFHPKITYIGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCG 145
F V + P + G YD G + + +D S +L+ A+SP G
Sbjct: 155 AFDTRIAYVAESDGGPLPYELALGDYD----------GYNQQKLTDSGSLILTPAWSPDG 204
Query: 146 K---YLASGGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAFSPCGKYLA 194
K Y++ G + + G + + + AFSP G LA
Sbjct: 205 KKLAYVSFELGGCPR--IYYLDLNTGKRPVI-LNFNGNNGAPAFSPDGSKLA 253
>gnl|CDD|238160 cd00257, Fascin, Fascin-like domain; members include
actin-bundling/crosslinking proteins facsin,
histoactophilin and singed; identified in sea urchin,
Drosophila, Xenopus, rodents, and humans; The
fascin-like domain adopts a beta-trefoil topology and
contains an internal threefold repeat; the fascin
subgroup contains four copies of the domain;
Structurally similar to fibroblast growth factor (FGF).
Length = 119
Score = 26.9 bits (60), Expect = 9.6
Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 7/66 (10%)
Query: 142 SPCGKYLASGGTGSYDKAVRLWSITDGASVRVFSDHVSPVLSIAF-SPCGKYLASGGEDG 200
S GKYL++ G V+L A R + A + G+YL G
Sbjct: 50 SHDGKYLSADSDGG----VQLEG-HPNADCRFTLEF-HGDGKWALRAENGRYLGGDGSGT 103
Query: 201 LVLTHE 206
L + E
Sbjct: 104 LKASSE 109
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.411
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,914,338
Number of extensions: 1722171
Number of successful extensions: 1515
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1465
Number of HSP's successfully gapped: 57
Length of query: 351
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 253
Effective length of database: 6,590,910
Effective search space: 1667500230
Effective search space used: 1667500230
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)