RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6127
(515 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 138 bits (350), Expect = 3e-37
Identities = 89/298 (29%), Positives = 138/298 (46%), Gaps = 16/298 (5%)
Query: 215 KVFRENTDKINSLDYSSDGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYGVDLIKFTHS 274
+ + +T + + +S DG LL + S D I ++D E G T+ V + +
Sbjct: 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD 62
Query: 275 KTTAIHASTKVDATLRYLSLHDNKYVRYFPGHVRKVSSLNLSPVDDSFISGSYDKTVRVW 334
T AS D T+R L + VR GH VSS+ SP S S DKT++VW
Sbjct: 63 GTYL--ASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120
Query: 335 DLRQPNNVGMIQLSG-NKPI--AAYDPEGLAFAIGVNSDSIKLYDVRSYDKGPFASFKCP 391
D+ + L G + A+ P+G A +IKL+D+R+ A+
Sbjct: 121 DVETGKCL--TTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT--GKCVATLT-- 174
Query: 392 GERGCEWTGLKFSPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATFTPD 451
G G + + FSPDG+ +L S++ I+L+D G C+ +L G + F+PD
Sbjct: 175 GHTGEVNS-VAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG---HENGVNSVAFSPD 230
Query: 452 SQFVASGSTDGQVHIWNAERGYKVCVLDGDHPSAVQSLQFNPKYHMLASACSNMAFWI 509
+ASGS DG + +W+ G V L G H ++V SL ++P LAS ++ I
Sbjct: 231 GYLLASGSEDGTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRLASGSADGTIRI 287
Score = 122 bits (307), Expect = 3e-31
Identities = 76/254 (29%), Positives = 119/254 (46%), Gaps = 21/254 (8%)
Query: 221 TDKINSLDYSSDGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYGVDLIKFTHSKTTAIH 280
T + + S+DG+ L S S D I ++D E G T+ V + F+ I
Sbjct: 51 TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGR--IL 108
Query: 281 ASTKVDATLRYLSLHDNKYVRYFPGHVRKVSSLNLSPVDDSFISGSYDKTVRVWDLRQPN 340
+S+ D T++ + K + GH V+S+ SP S S D T+++WDLR
Sbjct: 109 SSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLR--T 166
Query: 341 NVGMIQLSG-NKPI--AAYDPEGLAFAIGVNSDSIKLYDVRSYDKGPFASFKCPGE-RGC 396
+ L+G + A+ P+G + +IKL+D+ + KC G RG
Sbjct: 167 GKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS--------TGKCLGTLRGH 218
Query: 397 E-W-TGLKFSPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATFTPDSQF 454
E + FSPDG L+ + IR++D G CVQ+LSG + S ++PD +
Sbjct: 219 ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSLAWSPDGKR 275
Query: 455 VASGSTDGQVHIWN 468
+ASGS DG + IW+
Sbjct: 276 LASGSADGTIRIWD 289
Score = 106 bits (266), Expect = 1e-25
Identities = 61/212 (28%), Positives = 97/212 (45%), Gaps = 20/212 (9%)
Query: 300 VRYFPGHVRKVSSLNLSPVDDSFISGSYDKTVRVWDLRQPNNVGMIQLSGNK-PIA--AY 356
R GH V+ + SP +GS D T++VWDL + L G+ P+ A
Sbjct: 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLE--TGELLRTLKGHTGPVRDVAA 59
Query: 357 DPEGLAFAIGVNSDSIKLYDVRSYDKGPFASFKCPGE-RG--CEWTGLKFSPDGKLILVS 413
+G A G + +I+L+D+ + +C G + + FSPDG+++ S
Sbjct: 60 SADGTYLASGSSDKTIRLWDLE--------TGECVRTLTGHTSYVSSVAFSPDGRILSSS 111
Query: 414 TNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATFTPDSQFVASGSTDGQVHIWNAERGY 473
+ I+++D G C+ +L G + F+PD FVAS S DG + +W+ G
Sbjct: 112 SRDKTIKVWDVETGKCLTTLRG---HTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK 168
Query: 474 KVCVLDGDHPSAVQSLQFNPKYHMLASACSNM 505
V L G H V S+ F+P L S+ S+
Sbjct: 169 CVATLTG-HTGEVNSVAFSPDGEKLLSSSSDG 199
Score = 71.2 bits (175), Expect = 1e-13
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 2/128 (1%)
Query: 208 VRSYRVAKVFRENTDKINSLDYSSDGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYGVD 267
+R+ + +T ++NS+ +S DG L+S S D I ++D G T+ + GV+
Sbjct: 164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVN 223
Query: 268 LIKFTHSKTTAIHASTKVDATLRYLSLHDNKYVRYFPGHVRKVSSLNLSPVDDSFISGSY 327
+ F+ AS D T+R L + V+ GH V+SL SP SGS
Sbjct: 224 SVAFSPDGYLL--ASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSA 281
Query: 328 DKTVRVWD 335
D T+R+WD
Sbjct: 282 DGTIRIWD 289
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 100 bits (248), Expect = 1e-22
Identities = 98/320 (30%), Positives = 152/320 (47%), Gaps = 15/320 (4%)
Query: 190 SLAHSRSSRTITKCSENLVRSYRVAKVFRENTDKINSLDYSSDGSLLISCSDDD-QIIIY 248
S+ + SS T +L ++ + +++ + SL +S DG LL S S D I ++
Sbjct: 124 SILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLW 183
Query: 249 DCEQGTSKNTINSKKYGVDLIKFTHSKTTAIHASTKVDATLRYLSLHDNKYVR-YFPGHV 307
D G +T+ V + F+ I AS D T+R L K +R GH
Sbjct: 184 DLRTGKPLSTLAGHTDPVSSLAFSPDGGLLI-ASGSSDGTIRLWDLSTGKLLRSTLSGHS 242
Query: 308 RKVSSLNLSPVDDSFISGSYDKTVRVWDLRQPNNVGMIQLSGNKPI--AAYDPEGLAFAI 365
V S SP SGS D T+R+WDLR +++ + + A+ P+G A
Sbjct: 243 DSVVSS-FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLAS 301
Query: 366 GVNSDSIKLYDVRSYDKGPFASFKCPGERGCEWTGLKFSPDGKLILVS-TNGSIIRLFDA 424
G + +++L+D+ + + K G G + L FSPDG L++ ++ IRL+D
Sbjct: 302 GSSDGTVRLWDLETGKLLSSLTLK--GHEGPVSS-LSFSPDGSLLVSGGSDDGTIRLWDL 358
Query: 425 FNGNCVQSLSGVLNNNASPLEATFTPDSQFVASGSTDGQVHIWNAERGYKVCVLDGDHPS 484
G +++L G N +F+PD + V+SGSTDG V +W+ G + LDG H S
Sbjct: 359 RTGKPLKTLEGHSNVL----SVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDG-HTS 413
Query: 485 AVQSLQFNPKYHMLASACSN 504
V SL F+P LAS S+
Sbjct: 414 RVTSLDFSPDGKSLASGSSD 433
Score = 84.4 bits (207), Expect = 2e-17
Identities = 79/339 (23%), Positives = 132/339 (38%), Gaps = 56/339 (16%)
Query: 172 HFVSLFVTFVSRVLTMAASLAHSRSSRTITKCSENLVRSYRVAKVFRENTDKINSLDYSS 231
L + S L S + S + + R + D I S+ +S
Sbjct: 17 KKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSS-LLLRGHEDSITSIAFSP 75
Query: 232 DGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYG---VDLIKFTHSKTTAIHASTKVDAT 288
DG LL+S S D I ++D + G + L + + + AS+ +D T
Sbjct: 76 DGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGT 135
Query: 289 LRYLSL-HDNKYVRYFPGHVRKVSSLNLSPVDDSFISGS-YDKTVRVWDLRQPNNVGMIQ 346
++ L K +R GH V+SL SP SGS D T+++WDLR + +
Sbjct: 136 VKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLA 195
Query: 347 LSGNKPIAAYDPEGLAFAIGVNSDSIKLYDVRSYDKGPFASFKCPGERGCEWTGLKFSPD 406
+ + L FSPD
Sbjct: 196 GHTDPV----------------------------------------------SSLAFSPD 209
Query: 407 GKLILVS-TNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATFTPDSQFVASGSTDGQVH 465
G L++ S ++ IRL+D G ++S L+ ++ + ++F+PD +ASGS+DG +
Sbjct: 210 GGLLIASGSSDGTIRLWDLSTGKLLRS---TLSGHSDSVVSSFSPDGSLLASGSSDGTIR 266
Query: 466 IWNAERGYKVCVLDGDHPSAVQSLQFNPKYHMLASACSN 504
+W+ + H S+V S+ F+P +LAS S+
Sbjct: 267 LWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSD 305
Score = 81.7 bits (200), Expect = 2e-16
Identities = 73/279 (26%), Positives = 123/279 (44%), Gaps = 15/279 (5%)
Query: 208 VRSYRVAKVFRENTDKINSLDYSSDGSLLI-SCSDDDQIIIYDCEQGTSKNTINSKKYGV 266
+R+ + +TD ++SL +S DG LLI S S D I ++D G + S
Sbjct: 185 LRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDS 244
Query: 267 DLIKFTHSKTTAIHASTKVDATLRYLSLH-DNKYVRYFPGHVRKVSSLNLSPVDDSFISG 325
+ F+ + S+ D T+R L + +R GH V S+ SP SG
Sbjct: 245 VVSSFSPDGSLLASGSS--DGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASG 302
Query: 326 SYDKTVRVWDLRQPNNVGMIQLSGNKPI---AAYDPEGLAFAIGVNSDSIKLYDVRSYDK 382
S D TVR+WDL + + L G++ ++ P+G G + D +R +D
Sbjct: 303 SSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGT----IRLWDL 358
Query: 383 GPFASFKCPGERGCEWTGLKFSPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGVLNNNAS 442
K + + FSPDG+++ + +RL+D G+ +++L G + S
Sbjct: 359 RTGKPLKTLEGHSNVLS-VSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTS 417
Query: 443 PLEATFTPDSQFVASGSTDGQVHIWNAERGYKVCVLDGD 481
F+PD + +ASGS+D + +W+ + K D
Sbjct: 418 L---DFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPD 453
Score = 80.9 bits (198), Expect = 3e-16
Identities = 71/292 (24%), Positives = 124/292 (42%), Gaps = 11/292 (3%)
Query: 226 SLDYSSDGSLLISCSDDDQII-IYDCE-QGTSKNTINSKKYGVDLIKFTHSKTTAIHAST 283
+L S+L++ S D + ++D G T+ V + F+ S+
Sbjct: 116 ALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSS 175
Query: 284 KVDATLRYLSLHDNKYVRYFPGHVRKVSSLNLSPVDDSFI-SGSYDKTVRVWDLRQPNNV 342
D T++ L K + GH VSSL SP I SGS D T+R+WDL +
Sbjct: 176 L-DGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLL 234
Query: 343 GM-IQLSGNKPIAAYDPEGLAFAIGVNSDSIKLYDVRSYDKGPFASFKCPGERGCEWTGL 401
+ + ++++ P+G A G + +I+L+D+RS +
Sbjct: 235 RSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLS----V 290
Query: 402 KFSPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATFTPDSQFVASGSTD 461
FSPDGKL+ ++ +RL+D G + SL+ + + S V+ GS D
Sbjct: 291 AFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDD 350
Query: 462 GQVHIWNAERGYKVCVLDGDHPSAVQSLQFNPKYHMLASACSNMAFWIPTLS 513
G + +W+ G + + S V S+ F+P +++S ++ + LS
Sbjct: 351 GTIRLWDLRTGK--PLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLS 400
Score = 57.8 bits (138), Expect = 6e-09
Identities = 43/200 (21%), Positives = 83/200 (41%), Gaps = 7/200 (3%)
Query: 190 SLAHSRSSRTITKCSENLVRSYRVA------KVFRENTDKINSLDYSSDGSLLISCSDDD 243
S S + S+ +R + + + ++ + S+ +S DG LL S S D
Sbjct: 247 SSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDG 306
Query: 244 QIIIYDCEQGTSKNTINSKKYGVDLIKFTHSKTTAIHASTKVDATLRYLSLHDNKYVRYF 303
+ ++D E G +++ K + + + S ++ S D L
Sbjct: 307 TVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKT 366
Query: 304 PGHVRKVSSLNLSPVDDSFISGSYDKTVRVWDLRQPNNVG-MIQLSGNKPIAAYDPEGLA 362
V S++ SP SGS D TVR+WDL + + + + + P+G +
Sbjct: 367 LEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKS 426
Query: 363 FAIGVNSDSIKLYDVRSYDK 382
A G + ++I+L+D+++ K
Sbjct: 427 LASGSSDNTIRLWDLKTSLK 446
Score = 31.6 bits (70), Expect = 1.2
Identities = 17/80 (21%), Positives = 36/80 (45%), Gaps = 6/80 (7%)
Query: 190 SLAHSRSSRTITKCSENL------VRSYRVAKVFRENTDKINSLDYSSDGSLLISCSDDD 243
S++ S R ++ S + + + + + +T ++ SLD+S DG L S S D+
Sbjct: 375 SVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDN 434
Query: 244 QIIIYDCEQGTSKNTINSKK 263
I ++D + + +
Sbjct: 435 TIRLWDLKTSLKSVSFSPDG 454
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 42.7 bits (101), Expect = 6e-06
Identities = 18/38 (47%), Positives = 22/38 (57%)
Query: 298 KYVRYFPGHVRKVSSLNLSPVDDSFISGSYDKTVRVWD 335
K +R GH V+S+ SP + SGS D TVRVWD
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 40.0 bits (94), Expect = 6e-05
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 215 KVFRENTDKINSLDYSSDGSLLISCSDDDQIIIYD 249
+ + +T + S+ +S DG+LL S SDD + ++D
Sbjct: 5 RTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 34.6 bits (80), Expect = 0.004
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 427 GNCVQSLSGVLNNNASPLEATFTPDSQFVASGSTDGQVHIWN 468
G +++L G + F+PD +ASGS DG V +W+
Sbjct: 1 GKLLRTLKG---HTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 26.5 bits (59), Expect = 3.6
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 399 TGLKFSPDGKLILVSTNGSIIRLFD 423
T + FSPDG L+ ++ +R++D
Sbjct: 15 TSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 26.2 bits (58), Expect = 4.8
Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 472 GYKVCVLDGDHPSAVQSLQFNPKYHMLASACSN 504
G + L G H V S+ F+P ++LAS +
Sbjct: 1 GKLLRTLKG-HTGPVTSVAFSPDGNLLASGSDD 32
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 42.3 bits (100), Expect = 1e-05
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 296 DNKYVRYFPGHVRKVSSLNLSPVDDSFISGSYDKTVRVWD 335
+ ++ GH V+S+ SP SGS D T+++WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 40.8 bits (96), Expect = 3e-05
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 210 SYRVAKVFRENTDKINSLDYSSDGSLLISCSDDDQIIIYD 249
S + K + +T + S+ +S DG L S SDD I ++D
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 36.5 bits (85), Expect = 0.001
Identities = 13/43 (30%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 426 NGNCVQSLSGVLNNNASPLEATFTPDSQFVASGSTDGQVHIWN 468
+G +++L G + F+PD +++ASGS DG + +W+
Sbjct: 1 SGELLKTLKG---HTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 26.9 bits (60), Expect = 2.5
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 399 TGLKFSPDGKLILVSTNGSIIRLFD 423
T + FSPDGK + ++ I+L+D
Sbjct: 16 TSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 26.1 bits (58), Expect = 5.4
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 1/33 (3%)
Query: 472 GYKVCVLDGDHPSAVQSLQFNPKYHMLASACSN 504
G + L G H V S+ F+P LAS +
Sbjct: 2 GELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD 33
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 44.7 bits (105), Expect = 9e-05
Identities = 40/122 (32%), Positives = 64/122 (52%), Gaps = 7/122 (5%)
Query: 218 RENTDKINSLDYSS-DGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYGVDLIKFTHSKT 276
+E+ ++ S+DYSS D +LL S SDD + ++ QG S TI +K + ++F S++
Sbjct: 572 KEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTKA-NICCVQFP-SES 629
Query: 277 TAIHASTKVDATLRYLSLHDNKY-VRYFPGHVRKVSSLNLSPVDDS-FISGSYDKTVRVW 334
A D + Y L + K + GH + VS + VD S +S S D T+++W
Sbjct: 630 GRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRF--VDSSTLVSSSTDNTLKLW 687
Query: 335 DL 336
DL
Sbjct: 688 DL 689
Score = 42.0 bits (98), Expect = 7e-04
Identities = 62/288 (21%), Positives = 114/288 (39%), Gaps = 47/288 (16%)
Query: 226 SLDYSSDGSLLISCSDDDQIIIYDCEQGTSKN----------TINSKKYGVDLIKFTHSK 275
++ + DG + + +I I++CE SK G+ + S+
Sbjct: 488 AIGFDRDGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQ 547
Query: 276 TTAIHASTKVDATLRYLSLHDNKYVRYFPGHVRKVSSLNLSPVDDSFI-SGSYDKTVRVW 334
AS+ + ++ + ++ V H ++V S++ S D + + SGS D +V++W
Sbjct: 548 V----ASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLW 603
Query: 335 DLRQPNNVGMIQLSGNKPIAAYDPE-GLAFAIGVNSDSIKLYDVRSYDKGPFASFKCPGE 393
+ Q ++G I+ N + E G + A G + YD+R+ P
Sbjct: 604 SINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRN-----------PKL 652
Query: 394 RGCEWTG-------LKFSPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEA 446
C G ++F L+ ST+ + ++L+D S+SG+ N +PL +
Sbjct: 653 PLCTMIGHSKTVSYVRFVDSSTLVSSSTDNT-LKLWD-----LSMSISGI---NETPLHS 703
Query: 447 --TFTPDSQFVASGSTDGQVHIWNAERGYKVCVLDGDHPSAVQSLQFN 492
T FV +DG +I +V V P V S +F
Sbjct: 704 FMGHTNVKNFVGLSVSDG--YIATGSETNEVFVYHKAFPMPVLSYKFK 749
>gnl|CDD|225044 COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate
transport and metabolism].
Length = 399
Score = 33.5 bits (77), Expect = 0.25
Identities = 37/188 (19%), Positives = 56/188 (29%), Gaps = 42/188 (22%)
Query: 304 PGHVRKVSSLNLSPVDDSFISGSYDKTVRVWDLRQPNNVGMIQLSGNKPIAAYDPEGLAF 363
G +R +S S TV + LR + I LS + G++
Sbjct: 87 TGRLRLISD------GGSASPPV--STVPIVLLRGQGGLLDIALSPDFAQGRLVYFGISE 138
Query: 364 AIG----VNSDSI--------KLYDVRSYDKGPFASFKCPGERGCEWTG-LKFSPDGKLI 410
G N +I KL + + F P + G L F PDGKL
Sbjct: 139 PGGGLYVANRVAIGRLPGGDTKLSEPK--VI--FRGI--PKGGH-HFGGRLVFGPDGKLY 191
Query: 411 LVSTNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATF------TPDSQFVASGSTDGQV 464
V+T + Q + A P+S+ + G + Q
Sbjct: 192 -VTTG-------SNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPNSEIWSYGHRNPQG 243
Query: 465 HIWNAERG 472
W+ G
Sbjct: 244 LAWHPVTG 251
>gnl|CDD|222638 pfam14269, Arylsulfotran_2, Arylsulfotransferase (ASST).
Length = 301
Score = 33.1 bits (76), Expect = 0.32
Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 7/78 (8%)
Query: 224 INSLDYSSDGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYG----VDLIKFTHSKTTAI 279
+NS++ DG LIS + + G +N G F++
Sbjct: 147 LNSVEKFGDGDYLISLRYYCSLFLIRPSNGKVMWQLNGPTGGDFELGPNSTFSYQHDARF 206
Query: 280 HASTKVDATLRYLSLHDN 297
T+ D +SL +N
Sbjct: 207 VNQTEDDQV---ISLFNN 221
>gnl|CDD|234327 TIGR03718, R_switched_Alx, integral membrane protein, TerC family.
Rfam model RF00080 describes a structured RNA element
called the yybP-ykoY leader, or SraF, which may precede
one or several genes in a genome. Members of this highly
hydrophobic protein family often are preceded by a
yybP-ykoY leader, which may serve as a riboswitch. From
the larger group of TerC homologs (pfam03741), this
subfamily contains TerC itself from Alcaligenes sp.
plasmid IncHI2 pMER610 and from Proteus mirabilis. It
also contains the alkaline-inducible E. coli protein
Alx, which unlike the two TerC examples is preceded by a
yybP-ykoY leader.
Length = 302
Score = 32.1 bits (74), Expect = 0.65
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 161 LLHSLSSLESPHFVSLFVTFVSRVLTMAASLAHSR 195
LLH+ P VSL V + +++ ASL +R
Sbjct: 267 LLHATDVYHIPIGVSLGVIVGTLAVSIVASLWKTR 301
>gnl|CDD|227282 COG4946, COG4946, Uncharacterized protein related to the
periplasmic component of the Tol biopolymer transport
system [Function unknown].
Length = 668
Score = 32.1 bits (73), Expect = 0.78
Identities = 48/267 (17%), Positives = 85/267 (31%), Gaps = 65/267 (24%)
Query: 214 AKVFRENTDKINSLDYSSDGSLLISCSDDDQIIIYDCEQGTSKNTINSKKYGVDL----- 268
K ++ I ++ S DG ++ +D ++ + D + G + I+ +YG+
Sbjct: 394 VKRIEKDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVR-LIDKSEYGLITDFDWH 452
Query: 269 ------------------IKFTHSKTTAIHASTK---------VDATLRYLSLHDNKYVR 301
IK I+ T D RYL + +
Sbjct: 453 PNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDVTTPTAYDFSPAFDPDGRYLYFLSARSLD 512
Query: 302 YFPGHVRKVSSLNLSPVDDSF----ISGSYDKTVRVWDLRQPNNVGMIQLSGNKPIAA-- 355
P + R + + + V + G Y + D G + L G +
Sbjct: 513 --PSNDRVIFNFSFQRVSKPYLVVLGRGYYSPFNQPPDEANSEA-GEVDLEGIEDRVEPF 569
Query: 356 -YDPEGLAFAIGVNSDSIKLYDVRSYD-KGPFASFKCPG-ERGC---------------- 396
+ G+ + I L+ SY G F+ + E+G
Sbjct: 570 PVEEGDYRSIAGLKNGKILLF---SYPIHGEFSQYYWGQPEKGRLEKYDLETKKVEEYKD 626
Query: 397 EWTGLKFSPDGKLILVSTNGSIIRLFD 423
+ + S DG ILV +G +RLFD
Sbjct: 627 NVSSFRLSSDGSKILVKLDGK-LRLFD 652
Score = 32.1 bits (73), Expect = 0.92
Identities = 30/124 (24%), Positives = 54/124 (43%), Gaps = 22/124 (17%)
Query: 357 DPEGLAFAIGVNS-DSIKLYDVRSYDKGPFASFKCPGERGCEWTGLKFSPDGKLILVSTN 415
DPEG IG N D + +YD G E +K SPDGK ++V+ +
Sbjct: 370 DPEG--DVIGTNDGDKLGIYDKD----GGEVKRIEKDLGNIE--AVKVSPDGKKVVVAND 421
Query: 416 GSIIRLFDAFNGNCV---QSLSGVLNNNASPLEATFTPDSQFVASGSTDG----QVHIWN 468
+ + D NGN +S G++ + + P+S+++A +G + +++
Sbjct: 422 RFELWVIDIDNGNVRLIDKSEYGLI------TDFDWHPNSRWIAYAFPEGYYTQSIKLYD 475
Query: 469 AERG 472
+ G
Sbjct: 476 MDGG 479
>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
protein.
Length = 1057
Score = 31.7 bits (72), Expect = 1.0
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 397 EWTGLKFSPDGKLILVSTNGSIIRLFD 423
E GL +G+L + TN S+IR D
Sbjct: 860 EPAGLALGENGRLFVADTNNSLIRYLD 886
>gnl|CDD|218748 pfam05787, DUF839, Bacterial protein of unknown function (DUF839).
This family consists of several bacterial proteins of
unknown function that contain a predicted beta-propeller
repeats.
Length = 515
Score = 31.0 bits (70), Expect = 1.7
Identities = 21/89 (23%), Positives = 31/89 (34%), Gaps = 15/89 (16%)
Query: 335 DLRQPNNVG-------MIQLSGNKPIAAYDPEGLAFAIGVNSDSIKLYDVRSYDKGPFAS 387
D P+N+ I GN PEG + + N + +
Sbjct: 427 DFESPDNITFDPAGRLWICTDGNGSTLGVTPEGNVYNLARN---DGNNGMLTEGPIRGEV 483
Query: 388 ---FKCPGERGCEWTGLKFSPDGKLILVS 413
P G E+TG FSPDG+ + V+
Sbjct: 484 KLFLTGP--EGSEFTGPTFSPDGRTLFVN 510
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 30.2 bits (68), Expect = 2.4
Identities = 15/74 (20%), Positives = 32/74 (43%), Gaps = 2/74 (2%)
Query: 80 DESGVLNLSCFPNISSLEIKRIDTEEIVLNPSCLIKLKEIKCMLAQKAVPKILKIVAKSE 139
SGV FP S+ +++ ++ EE PS L+E + +K + + ++
Sbjct: 71 LASGVPPEELFP--SAEKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAK 128
Query: 140 NMCNLTKLDLPYNK 153
NM + ++ +
Sbjct: 129 NMAKMPQMIADWRA 142
>gnl|CDD|239616 cd03545, Rieske_RO_Alpha_OHBDO_like, Rieske non-heme iron oxygenase
(RO) family, Ortho-halobenzoate-1,2-dioxygenase
(OHBDO)-like subfamily, N-terminal Rieske domain of the
oxygenase alpha subunit; composed of the oxygenase alpha
subunits of OHBDO, salicylate 5-hydroxylase (S5H),
terephthalate 1,2-dioxygenase system (TERDOS) and
similar proteins. ROs comprise a large class of aromatic
ring-hydroxylating dioxygenases that enable
microorganisms to tolerate and utilize aromatic
compounds for growth. The oxygenase alpha subunit
contains an N-terminal Rieske domain with an [2Fe-2S]
cluster and a C-terminal catalytic domain with a
mononuclear Fe(II) binding site. The Rieske [2Fe-2S]
cluster accepts electrons from a reductase or ferredoxin
component and transfers them to the mononuclear iron for
catalysis. OHBDO converts 2-chlorobenzoate (2-CBA) to
catechol as well as 2,4-dCBA and 2,5-dCBA to
4-chlorocatechol, as part of the chlorobenzoate
degradation pathway. Although ortho-substituted
chlorobenzoates appear to be particularly recalcitrant
to biodegradation, several strains utilize 2-CBA and the
dCBA derivatives as a sole carbon and energy source. S5H
converts salicylate (2-hydroxybenzoate), a metabolic
intermediate of phenanthrene, to gentisate
(2,5-dihydroxybenzoate) as part of an alternate pathway
for naphthalene catabolism. S5H is a multicomponent
enzyme made up of NagGH (the oxygenase components),
NagAa (the ferredoxin reductase component), and NagAb
(the ferredoxin component). The oxygenase component is
made up of alpha (NagG) and beta (NagH) subunits. TERDOS
is present in gram-positive bacteria and proteobacteria
where it converts terephthalate (1,4-dicarboxybenzene)
to protocatechuate as part of the terephthalate
degradation pathway. The oxygenase component of TERDOS,
called TerZ, is a hetero-hexamer with 3 alpha
(TerZalpha) and 3 beta (TerZbeta) subunits.
Length = 150
Score = 29.3 bits (66), Expect = 2.5
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 440 NASPLEATFTPDSQFVASGSTDGQVHIWN---AERGYKVC 476
NA ++TF D+ V + + DG +H W A RG VC
Sbjct: 37 NAGDFKSTFVGDTPVVVTRAEDGSLHAWVNRCAHRGALVC 76
>gnl|CDD|199833 cd00216, PQQ_DH_like, PQQ-dependent dehydrogenases and related
proteins. This family is composed of dehydrogenases
with pyrroloquinoline quinone (PQQ) as a cofactor, such
as ethanol, methanol, and membrane-bound glucose
dehydrogenases. The alignment model contains an 8-bladed
beta-propeller, and the family also includes distantly
related proteins which are not enzymatically active and
do not bind PQQ.
Length = 434
Score = 30.3 bits (68), Expect = 3.0
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 5/76 (6%)
Query: 400 GLKFSPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGVLNNNASPLEATFTPDSQFVASGS 459
+ PD +L V NG I+ L D G V S + PL +T V G+
Sbjct: 307 PDSYDPDRELFYVPANGRIMAL-DPVTGVVVWEKSELHPLLGGPL-ST---AGNLVFVGT 361
Query: 460 TDGQVHIWNAERGYKV 475
+DG + +NA+ G K+
Sbjct: 362 SDGYLKAYNADTGEKL 377
>gnl|CDD|225752 COG3211, PhoX, Predicted phosphatase [General function prediction
only].
Length = 616
Score = 30.2 bits (68), Expect = 3.4
Identities = 13/26 (50%), Positives = 16/26 (61%), Gaps = 2/26 (7%)
Query: 388 FKCPGERGCEWTGLKFSPDGKLILVS 413
P GCE+TG FSPDGK + V+
Sbjct: 548 LTGPI--GCEFTGPCFSPDGKTLFVN 571
>gnl|CDD|216440 pfam01336, tRNA_anti, OB-fold nucleic acid binding domain. This
family contains OB-fold domains that bind to nucleic
acids. The family includes the anti-codon binding domain
of lysyl, aspartyl, and asparaginyl -tRNA synthetases
(See pfam00152). Aminoacyl-tRNA synthetases catalyze the
addition of an amino acid to the appropriate tRNA
molecule EC:6.1.1.-. This family also includes part of
RecG helicase involved in DNA repair. Replication factor
A is a heterotrimeric complex, that contains a subunit
in this family. This domain is also found at the
C-terminus of bacterial DNA polymerase III alpha chain.
Length = 75
Score = 27.6 bits (62), Expect = 4.0
Identities = 7/31 (22%), Positives = 15/31 (48%)
Query: 78 LADESGVLNLSCFPNISSLEIKRIDTEEIVL 108
L D +G + + F + K++ ++VL
Sbjct: 22 LRDGTGSIQVVLFKEEAEKLAKKLKEGDVVL 52
>gnl|CDD|233789 TIGR02225, recomb_XerD, tyrosine recombinase XerD. The phage
integrase family describes a number of recombinases with
tyrosine active sites that transiently bind covalently
to DNA. Many are associated with mobile DNA elements,
including phage, transposons, and phase variation loci.
This model represents XerD, one of two closely related
chromosomal proteins along with XerC (TIGR02224). XerC
and XerD are site-specific recombinases which help
resolve chromosome dimers to monomers for cell division
after DNA replication. In species with a large
chromosome and with homologs of XerD on other replicons,
the chomosomal copy was preferred for building this
model. This model does not detect all XerD, as some
apparent XerD examples score below the trusted and noise
cutoff scores. XerC and XerD interact with cell division
protein FtsK [DNA metabolism, DNA replication,
recombination, and repair].
Length = 291
Score = 29.5 bits (67), Expect = 4.0
Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 5/52 (9%)
Query: 251 EQGTSKNTINSKKYGVDLIKFTHSKTTAIHASTKVDAT--LRYL-SLHDNKY 299
E+G S+NT+ + Y DL KF +VD + +L L +
Sbjct: 10 ERGLSQNTLEA--YRRDLEKFLEFLEERGIDLEEVDRGDIVDFLAELKEAGL 59
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 29.1 bits (65), Expect = 5.7
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 216 VFRENTDKINSLDYSSDGSLLISCSDDDQIIIYDCEQGT 254
V + ++D+I SL+++ DGSLL + S D ++ I D GT
Sbjct: 163 VIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGT 201
>gnl|CDD|234377 TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated
beta-propeller protein. Members of this protein family
consist of seven repeats each of the YVTN family
beta-propeller repeat (see TIGR02276). Members occur
invariably as part of a transport operon that is
associated with PQQ-dependent catabolism of alcohols
such as phenylethanol.
Length = 300
Score = 28.9 bits (65), Expect = 7.3
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 400 GLKFSPDGKLILVSTNGS-IIRLFDAFNGNCVQSL-SG 435
G+ FS DGKL+ V + S +++ D G + +L SG
Sbjct: 35 GITFSKDGKLLYVCASDSDTVQVIDPATGEVLHTLPSG 72
>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
This is a family of eukaryotic translation initiation
factors.
Length = 194
Score = 28.4 bits (64), Expect = 7.9
Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 7/44 (15%)
Query: 371 SIKLYDVRSYDKGPFASFKCPGERGCEWTGLKFSPDGKLILVST 414
I+ +DV++ K A+ + CEW SPDG+ L +T
Sbjct: 126 QIEFWDVKNKKK--IATAEASNATDCEW-----SPDGRYFLTAT 162
>gnl|CDD|180796 PRK07023, PRK07023, short chain dehydrogenase; Provisional.
Length = 243
Score = 28.4 bits (64), Expect = 8.0
Identities = 11/29 (37%), Positives = 17/29 (58%), Gaps = 2/29 (6%)
Query: 340 NNVGMIQLSGNKPIAAYDPEGLAFAIGVN 368
NN G ++ G P+A D +A A+G+N
Sbjct: 83 NNAGTVEPIG--PLATLDAAAIARAVGLN 109
>gnl|CDD|236331 PRK08667, PRK08667, hydrogenase membrane subunit; Validated.
Length = 644
Score = 28.9 bits (65), Expect = 8.2
Identities = 8/26 (30%), Positives = 15/26 (57%)
Query: 155 GSWDFELLHSLSSLESPHFVSLFVTF 180
GS++ L + + + S F+ LF+ F
Sbjct: 190 GSFELSPLGASAGILSLVFLLLFLGF 215
>gnl|CDD|79055 PRK12580, PRK12580, outer membrane protease; Reviewed.
Length = 312
Score = 28.6 bits (63), Expect = 8.5
Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 4/51 (7%)
Query: 433 LSGVLNNNASPLEATFTPDSQFVA--SGSTDGQVH--IWNAERGYKVCVLD 479
LSG N +S L +PDS VA +G G+ H +++AE G K+ LD
Sbjct: 14 LSGSANAASSQLIPNISPDSFTVAASTGMLSGKSHEMLYDAETGRKISQLD 64
>gnl|CDD|235424 PRK05347, PRK05347, glutaminyl-tRNA synthetase; Provisional.
Length = 554
Score = 28.5 bits (65), Expect = 8.9
Identities = 13/35 (37%), Positives = 15/35 (42%), Gaps = 12/35 (34%)
Query: 50 KSNSTNDFSVLE--------ETAKRLMC----LKL 72
K +S D S+LE E A R M LKL
Sbjct: 320 KQDSVIDMSMLESCIREDLNENAPRAMAVLDPLKL 354
>gnl|CDD|216201 pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal
region. This family is an alignment of the region to
the N-terminal side of the active site. The Prosite
motif does not correspond to this Pfam entry.
Length = 348
Score = 28.4 bits (64), Expect = 9.8
Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 5/58 (8%)
Query: 404 SPDGKLILVSTNGSIIRLFDAFNGNCVQSLSGV----LNNNASPLE-ATFTPDSQFVA 456
SPDGK +L++TN I + L L ++ A ++PD +A
Sbjct: 1 SPDGKYLLLATNYEKIWRHSYTASYYIYDLETGTVEPLPPGEGKIQYAKWSPDGHKLA 58
>gnl|CDD|223893 COG0823, TolB, Periplasmic component of the Tol biopolymer
transport system [Intracellular trafficking and
secretion].
Length = 425
Score = 28.6 bits (64), Expect = 9.8
Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 10/75 (13%)
Query: 402 KFSPDGKLILVSTNGSI---IRLFDAFNGNCVQSLSGVLNNNASPLEATFTPDSQ---FV 455
FSPDG + S++ I L D N + +G N SP +++PD F
Sbjct: 244 AFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNGF-GINTSP---SWSPDGSKIVFT 299
Query: 456 ASGSTDGQVHIWNAE 470
+ Q+++++ E
Sbjct: 300 SDRGGRPQIYLYDLE 314
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.132 0.386
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,778,750
Number of extensions: 2331545
Number of successful extensions: 1934
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1867
Number of HSP's successfully gapped: 60
Length of query: 515
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 414
Effective length of database: 6,457,848
Effective search space: 2673549072
Effective search space used: 2673549072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.3 bits)