RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2110
(622 letters)
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 102 bits (253), Expect = 6e-23
Identities = 90/392 (22%), Positives = 155/392 (39%), Gaps = 32/392 (8%)
Query: 146 TCAFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKR 205
+ AF+ G LL G +DG I LWD +H L S +
Sbjct: 70 SIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSS------PDGN 123
Query: 206 GTLLAVGCNDGRIVLWDFLTRG-IAKIISAHMHPVCSISWSRNGYKIISAST-DNNVCIW 263
LLA DG + LWD T G + + + H V S+++S +G + S S+ D + +W
Sbjct: 124 SILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLW 183
Query: 264 DVLTGECEQKYRFPS-PILKVQFHPRNDHMFLVCPMRHAAVLVDTDGDHQIVPIDEDSDL 322
D+ TG+ + P+ + F P + L D +
Sbjct: 184 DLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSD 243
Query: 323 SIVASFDRRGQHVVTGNSKGKVLVLSCPAL-ELKASFKASTTAIKSIEFARRGNCLLVNT 381
S+V+SF G + +G+S G + + + L + ++++ S+ F+ G L +
Sbjct: 244 SVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGS 303
Query: 382 GDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCCFSGDGEYVCAGSARQHALY 441
D +R+++ L GK ++ + V+ FS DG + +G + +
Sbjct: 304 SDGTVRLWD----LETGKLLSSLTLKGHEGPVS-----SLSFSPDGSLLVSGGSDDGTIR 354
Query: 442 IWEKSVGNLVKILHGTKGELLLDVVWHPVRPIIASISS-GVVSLWAQNQVENWSAFAPDF 500
+W+ G +K L G +L V + P +++S S+ G V LW
Sbjct: 355 LWDLRTGKPLKTLEG--HSNVLSVSFSPDGRVVSSGSTDGTVRLW----------DLSTG 402
Query: 501 KELDENVEYDERESEFDISDEDKSVAQGSDDQ 532
L + R + D S + KS+A GS D
Sbjct: 403 SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDN 434
Score = 78.6 bits (192), Expect = 2e-15
Identities = 101/474 (21%), Positives = 169/474 (35%), Gaps = 99/474 (20%)
Query: 40 GNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCCFSGDG-EYVCA 98
G LL + D I++++ D + I+ L+ L + ++ K S DG + A
Sbjct: 77 GELLLSGSSDGTIKLWD--------LDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLA 128
Query: 99 GSARQHALYIWEKS-VGNLVKILHGTKGELLLDVVEFDGALDCISLAVTCAFNKRGTLLA 157
S+ + +W+ S G L++ L G + AF+ G LLA
Sbjct: 129 SSSLDGTVKLWDLSTPGKLIRTLEGHSESVTS-----------------LAFSPDGKLLA 171
Query: 158 VGCN-DGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGC-ND 215
G + DG I LWD T ++ H + AF+ G LL +D
Sbjct: 172 SGSSLDGTIKLWDLRTGKPLSTLAGH-----------TDPVSSLAFSPDGGLLIASGSSD 220
Query: 216 GRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECEQKYR 275
G I LWD T + + + S+S +G + S S+D + +WD+ + +
Sbjct: 221 GTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTL 280
Query: 276 FP--SPILKVQFHPRNDHMFLVCPMRHAAVLVDTDGDHQIVPIDEDSDLSIVASFDRRGQ 333
S +L V F P D L S D
Sbjct: 281 SGHSSSVLSVAFSP-------------------------------DGKLLASGSSDGT-- 307
Query: 334 HVVTGNSKGKVLVLSCPALELKASFKASTTAIKSIEFARRGNCLLVNTG-DRLIRVYNHQ 392
V + + L+ S + K + S+ F+ G+ L+ D IR+++ +
Sbjct: 308 -VRLWDLETGKLLSSL-------TLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLR 359
Query: 393 EVLNAGKDGDPEPIQKLQDLVNKTMWKKCCFSGDGEYVCAGSARQHALYIWEKSVGNLVK 452
+ + FS DG V +GS + +W+ S G+L++
Sbjct: 360 TGKPLKTLEGHSNVLSVS------------FSPDGRVVSSGST-DGTVRLWDLSTGSLLR 406
Query: 453 ILHGTKGELLLDVVWHPVRPIIASISS-GVVSLWAQNQVENWSAFAPDFKELDE 505
L G + + + P +AS SS + LW +F+PD K L
Sbjct: 407 NLDGHTSRVTS-LDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDGKVLAS 459
Score = 62.8 bits (151), Expect = 2e-10
Identities = 55/254 (21%), Positives = 103/254 (40%), Gaps = 43/254 (16%)
Query: 40 GNCLLVNTGDRLIRVYNHQEVLNAGK--DGDPEPIQKLQDLVNKTMWKKCCFSGDGEYVC 97
G+ L + D IR+++ + + + G + FS DG+ +
Sbjct: 253 GSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSV------------LSVAFSPDGKLLA 300
Query: 98 AGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVTCAFNKRGTLLA 157
+GS+ + +W+ G L+ L + +G + +S F+ G+LL
Sbjct: 301 SGSS-DGTVRLWDLETGKLLS---------SLTLKGHEGPVSSLS------FSPDGSLLV 344
Query: 158 VGC-NDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGCNDG 216
G +DG I LWD T K + H S ++ +F+ G +++ G DG
Sbjct: 345 SGGSDDGTIRLWDLRTGKPLKTLEGH------------SNVLSVSFSPDGRVVSSGSTDG 392
Query: 217 RIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECEQKYRF 276
+ LWD T + + + H V S+ +S +G + S S+DN + +WD+ T +
Sbjct: 393 TVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSP 452
Query: 277 PSPILKVQFHPRND 290
+L + +
Sbjct: 453 DGKVLASKSSDLSV 466
Score = 56.3 bits (134), Expect = 3e-08
Identities = 66/350 (18%), Positives = 133/350 (38%), Gaps = 39/350 (11%)
Query: 231 IISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECE-QKYRF--PSPILKVQFHP 287
++ H + SI++S +G ++S S+D + +WD+ GE + S + K+
Sbjct: 60 LLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSS 119
Query: 288 RNDHMFLVCPMRH---AAVLVDTDGDHQIVPIDEDSDLSIVASFDRRGQHVVTGNS-KGK 343
+ + L+ + + I ++ S+ +F G+ + +G+S G
Sbjct: 120 PDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGT 179
Query: 344 VLVLSCPALELKASFKASTTAIKSIEFARRGNCLLVNTG-DRLIRVYNHQEVLNAGKDGD 402
+ + + ++ T + S+ F+ G L+ + D IR+++ L+ GK
Sbjct: 180 IKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWD----LSTGK--- 232
Query: 403 PEPIQKLQDLVNKTMWKKCCFSGDGEYVCAGSARQHALYIWE-KSVGNLVKILHGTKGEL 461
L + FS DG + A + + +W+ +S +L++ L G
Sbjct: 233 ----LLRSTLSGHSDSVVSSFSPDGSLL-ASGSSDGTIRLWDLRSSSSLLRTLSGHSSS- 286
Query: 462 LLDVVWHPVRPIIASISS-GVVSLW--------------AQNQVENWSAFAPDFKELDEN 506
+L V + P ++AS SS G V LW + +F+PD L
Sbjct: 287 VLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSG 346
Query: 507 VEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEV 556
D +D+ + + L + + V + S+D V
Sbjct: 347 GSDDGTIRLWDLR--TGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTV 394
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 89.3 bits (222), Expect = 1e-19
Identities = 78/315 (24%), Positives = 130/315 (41%), Gaps = 52/315 (16%)
Query: 145 VTC-AFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFN 203
VTC AF+ G LLA G DG I +WD T + + + H P A +
Sbjct: 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-----------VRDVAAS 60
Query: 204 KRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIW 263
GT LA G +D I LWD T + ++ H V S+++S +G + S+S D + +W
Sbjct: 61 ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120
Query: 264 DVLTGECEQKYRF-PSPILKVQFHPRNDHMFLVCPMRHAAVLVDTDGDHQIVPIDE---- 318
DV TG+C R + V F P D F+ DG ++ +
Sbjct: 121 DVETGKCLTTLRGHTDWVNSVAFSP--DGTFVAS--------SSQDGTIKLWDLRTGKCV 170
Query: 319 ------DSDLSIVASFDRRGQHVVTGNSKGKVLVLSCPALELKASFKASTTAIKSIEFAR 372
+++ VA F G+ +++ +S G + + + + + + S+ F+
Sbjct: 171 ATLTGHTGEVNSVA-FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP 229
Query: 373 RGNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQ---DLVNKTMWKKCCFSGDGEY 429
G L + D IRV++ + E +Q L + V +S DG+
Sbjct: 230 DGYLLASGSEDGTIRVWDLR---------TGECVQTLSGHTNSVTS-----LAWSPDGKR 275
Query: 430 VCAGSARQHALYIWE 444
+ +GSA + IW+
Sbjct: 276 LASGSA-DGTIRIWD 289
Score = 85.1 bits (211), Expect = 3e-18
Identities = 52/203 (25%), Positives = 81/203 (39%), Gaps = 32/203 (15%)
Query: 87 CCFSGDGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVT 146
FS DG + + S R + +W+ G + L G V
Sbjct: 99 VAFSPDGRILSSSS-RDKTIKVWDVETGKCLTTLRGHTD------------------WVN 139
Query: 147 C-AFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKR 205
AF+ GT +A DG I LWD T ++ H ++ + AF+
Sbjct: 140 SVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGH--------TGEVN---SVAFSPD 188
Query: 206 GTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDV 265
G L +DG I LWD T + H + V S+++S +GY + S S D + +WD+
Sbjct: 189 GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDL 248
Query: 266 LTGECEQKYR-FPSPILKVQFHP 287
TGEC Q + + + + P
Sbjct: 249 RTGECVQTLSGHTNSVTSLAWSP 271
Score = 64.7 bits (158), Expect = 2e-11
Identities = 51/302 (16%), Positives = 100/302 (33%), Gaps = 60/302 (19%)
Query: 230 KIISAHMHPVCSISWSRNGYKIISASTDNNVCIWDVLTGECEQKYRFPS-PILKVQFHPR 288
+ + H V +++S +G + + S D + +WD+ TGE + + + P+ V
Sbjct: 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA- 61
Query: 289 NDHMFLVCPMRHAAVLVDTDGDHQIVP----------IDEDSDLSIVAS----------- 327
+ L D + + + D I++S
Sbjct: 62 DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121
Query: 328 ---------------------FDRRGQHVVTGNSKGKVLVLSCPALELKASFKASTTAIK 366
F G V + + G + + + A+ T +
Sbjct: 122 VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVN 181
Query: 367 SIEFARRGNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCC-FSG 425
S+ F+ G LL ++ D I++++ + L L FS
Sbjct: 182 SVAFSPDGEKLLSSSSDGTIKLWDLST------------GKCLGTLRGHENGVNSVAFSP 229
Query: 426 DGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVWHPVRPIIASISS-GVVSL 484
DG + +GS + +W+ G V+ L G + + W P +AS S+ G + +
Sbjct: 230 DGYLLASGSEDGT-IRVWDLRTGECVQTLSGHTNS-VTSLAWSPDGKRLASGSADGTIRI 287
Query: 485 WA 486
W
Sbjct: 288 WD 289
Score = 52.0 bits (125), Expect = 4e-07
Identities = 38/137 (27%), Positives = 55/137 (40%), Gaps = 31/137 (22%)
Query: 87 CCFSGDGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVT 146
FS DGE + + S+ +W+ S G + L G + + +
Sbjct: 183 VAFSPDGEKLLSSSSDGTI-KLWDLSTGKCLGTLRGHENGV-----------------NS 224
Query: 147 CAFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTC-AFNKR 205
AF+ G LLA G DG I +WD T + +S H + VT A++
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------------VTSLAWSPD 272
Query: 206 GTLLAVGCNDGRIVLWD 222
G LA G DG I +WD
Sbjct: 273 GKRLASGSADGTIRIWD 289
Score = 38.9 bits (91), Expect = 0.005
Identities = 32/134 (23%), Positives = 56/134 (41%), Gaps = 16/134 (11%)
Query: 354 LKASFKASTTAIKSIEFARRGNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLV 413
L+ + K T + + F+ G L +GD I+V+ D E + L+ L
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVW------------DLETGELLRTLK 48
Query: 414 NKTMWKKCC-FSGDGEYVCAGSARQHALYIWEKSVGNLVKILHGTKGELLLDVVWHPVRP 472
T + S DG Y+ +GS+ + +W+ G V+ L G + V + P
Sbjct: 49 GHTGPVRDVAASADGTYLASGSS-DKTIRLWDLETGECVRTLTGHTSY-VSSVAFSPDGR 106
Query: 473 IIASISS-GVVSLW 485
I++S S + +W
Sbjct: 107 ILSSSSRDKTIKVW 120
Score = 38.9 bits (91), Expect = 0.005
Identities = 30/132 (22%), Positives = 47/132 (35%), Gaps = 33/132 (25%)
Query: 40 GNCLLVNTGDRLIRVYNHQEVLNAGKDGDPEPIQKLQDLVNKTMWKKCC-FSGDGEYVCA 98
G LL ++ D I++++ + L L FS DG + +
Sbjct: 189 GEKLLSSSSDGTIKLWDLST------------GKCLGTLRGHENGVNSVAFSPDGYLLAS 236
Query: 99 GSARQHALYIWEKSVGNLVKILHGTKGELLLDVVEFDGALDCISLAVTC-AFNKRGTLLA 157
GS + +W+ G V+ L G +VT A++ G LA
Sbjct: 237 GSEDGT-IRVWDLRTGECVQTLSGHTN------------------SVTSLAWSPDGKRLA 277
Query: 158 VGCNDGRIVLWD 169
G DG I +WD
Sbjct: 278 SGSADGTIRIWD 289
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 45.0 bits (107), Expect = 1e-06
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 230 KIISAHMHPVCSISWSRNGYKIISASTDNNVCIWD 264
+ + H PV S+++S +G + S S D V +WD
Sbjct: 5 RTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 32.3 bits (74), Expect = 0.034
Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 144 AVTC-AFNKRGTLLAVGCNDGRIVLWD 169
VT AF+ G LLA G +DG + +WD
Sbjct: 13 PVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 32.3 bits (74), Expect = 0.034
Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 197 AVTC-AFNKRGTLLAVGCNDGRIVLWD 222
VT AF+ G LLA G +DG + +WD
Sbjct: 13 PVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 26.2 bits (58), Expect = 5.6
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 353 ELKASFKASTTAIKSIEFARRGNCLLVNTGDRLIRVYN 390
+L + K T + S+ F+ GN L + D +RV++
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 43.8 bits (104), Expect = 3e-06
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 230 KIISAHMHPVCSISWSRNGYKIISASTDNNVCIWD 264
K + H PV S+++S +G + S S D + +WD
Sbjct: 6 KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 32.7 bits (75), Expect = 0.030
Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 144 AVTC-AFNKRGTLLAVGCNDGRIVLWD 169
VT AF+ G LA G +DG I LWD
Sbjct: 14 PVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 32.7 bits (75), Expect = 0.030
Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 197 AVTC-AFNKRGTLLAVGCNDGRIVLWD 222
VT AF+ G LA G +DG I LWD
Sbjct: 14 PVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|222072 pfam13360, PQQ_2, PQQ-like domain. This domain contains several
repeats of the PQQ repeat.
Length = 234
Score = 35.8 bits (83), Expect = 0.040
Identities = 26/147 (17%), Positives = 43/147 (29%), Gaps = 14/147 (9%)
Query: 204 KRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNVCIW 263
G + A+ G+ VLW + V +G ++ A+ +
Sbjct: 1 ADGVVAALDAATGK-VLW---RVDLGGTALGGGVAV-------DGGRLYVATGKGELVAL 49
Query: 264 DVLTGECEQKYRFPSPILKVQFHPRNDHMFLVCPMRHAAVLVDTDGD--HQIVPIDEDSD 321
D TG+ + IL + +V L G
Sbjct: 50 DAATGKLLWRKDLSGEILGAPTV-AGGVVVVVTADGSLYALDAETGKLLWSYQRSLPPLT 108
Query: 322 LSIVASFDRRGQHVVTGNSKGKVLVLS 348
L +S G V+ G S GK++ L
Sbjct: 109 LRGSSSPAIVGDTVIVGFSSGKLVALD 135
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 35.2 bits (81), Expect = 0.095
Identities = 28/109 (25%), Positives = 40/109 (36%), Gaps = 14/109 (12%)
Query: 498 PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVD 557
+ DE E ES+ + S ED D+++E+ + + S SSDEE
Sbjct: 165 ESSDKDDEEESESEDESKSEESAED----DSDDEEEEDSDSEDYSQYDGMLVDSSDEEEG 220
Query: 558 DSKVLEFLPIAPDVEDPE--------DGPGSTSAPVTSSPPSKTTKYKS 598
+ + D + E S S SSPPSK K K
Sbjct: 221 EEA--PSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKK 267
Score = 34.4 bits (79), Expect = 0.18
Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 11/96 (11%)
Query: 502 ELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKV 561
E D + E +E D S D + S D++E E + + SDE +
Sbjct: 188 EDDSDDEEEEDSDSEDYSQYDGMLV-DSSDEEEGEEAPSINYNEDTSESESDESDSEIS- 245
Query: 562 LEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYK 597
+ + +S P KT+
Sbjct: 246 ---ESRSVSDSE------ESSPPSKKPKEKKTSSTF 272
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 33.5 bits (76), Expect = 0.37
Identities = 22/68 (32%), Positives = 27/68 (39%), Gaps = 11/68 (16%)
Query: 491 ENWSAFA---PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVA 547
E WSA PD VE D +SE D ED SDD+ + ID +
Sbjct: 689 EIWSALVKSRPD-------VEDDSDDSELD-FAEDDFSDSTSDDEPKLDAIDDEDAKSEG 740
Query: 548 AFCSSDEE 555
+ S EE
Sbjct: 741 SQESDQEE 748
>gnl|CDD|204739 pfam11768, DUF3312, Protein of unknown function (DUF3312). This is
a eukaryotic family of uncharacterized proteins. This
family shows similarity to WD40 repeat proteins.
Length = 544
Score = 33.2 bits (76), Expect = 0.46
Identities = 18/81 (22%), Positives = 32/81 (39%), Gaps = 13/81 (16%)
Query: 142 SLAVTCAFNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCA 201
S +CA + L +GC D +VL+D RG+ + A + P A
Sbjct: 259 SQVCSCARSPDEEKLLLGCIDSSLVLYD-EHRGVTLLAQAAVIPHL------------IA 305
Query: 202 FNKRGTLLAVGCNDGRIVLWD 222
++ ++ V G + +D
Sbjct: 306 WHPDSAIVLVANEQGELQCFD 326
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 32.7 bits (74), Expect = 0.62
Identities = 24/80 (30%), Positives = 31/80 (38%), Gaps = 16/80 (20%)
Query: 501 KELDENVEYDERESEFDIS---DEDKSVAQGSDDQDEELEIDVTSVEPV----------- 546
KE +E +E + E D DED+ DD+D+E E D S
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETD 102
Query: 547 --AAFCSSDEEVDDSKVLEF 564
A F SD+E DD F
Sbjct: 103 NEAGFADSDDESDDGSEYVF 122
>gnl|CDD|234748 PRK00402, PRK00402, 3-isopropylmalate dehydratase large subunit;
Reviewed.
Length = 418
Score = 32.5 bits (75), Expect = 0.71
Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 15/58 (25%)
Query: 496 FAPDFKELDENVEY--DERESEFDI--SDEDKSVAQGSDDQDEELEIDVTSVEPVAAF 549
FAPD ++ +EY + ++ SDED + EID++ +EP A
Sbjct: 226 FAPD----EKTLEYLKERAGRDYKPWKSDEDAEYEE-------VYEIDLSKLEPQVAA 272
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 32.7 bits (75), Expect = 0.76
Identities = 14/85 (16%), Positives = 27/85 (31%), Gaps = 3/85 (3%)
Query: 518 ISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKVLEFLPIAPDVEDPED- 576
I++ D +VAQ + E + + + E V + P AP V +
Sbjct: 931 ITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAE 990
Query: 577 --GPGSTSAPVTSSPPSKTTKYKSF 599
+ V + + T +
Sbjct: 991 VETVTAVEPEVAPAQVPEATVEHNH 1015
>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
kinases [Cell motility and secretion / Signal
transduction mechanisms].
Length = 716
Score = 32.3 bits (74), Expect = 0.81
Identities = 14/106 (13%), Positives = 39/106 (36%), Gaps = 1/106 (0%)
Query: 498 PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVD 557
PD ++L+ ++E E +++ + + E E+++ +VE A E +
Sbjct: 223 PDLEDLEAEAAFEESEVVLATEQDEELIRDVLELVVEAEELEIAAVELDAELLELAES-E 281
Query: 558 DSKVLEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYKSFDIQL 603
+ A + + + + + ++P +K +
Sbjct: 282 QAADDVLAAQAEPLAEKSAAEAAKLSALEAAPAAKAAAAAAGASSS 327
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 32.3 bits (74), Expect = 0.82
Identities = 21/96 (21%), Positives = 33/96 (34%), Gaps = 22/96 (22%)
Query: 508 EYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKVLEFLPI 567
+ +E E D DE+ DD D E E + + DEE +DS
Sbjct: 322 DEEEEEDGVDDEDEE------DDDDDLEEEEEDVDLS----DEEEDEEDEDS-------- 363
Query: 568 APDVED--PEDGPGSTSAPVTSSPPSKTTKYKSFDI 601
D ED E+ S+ +++ +F
Sbjct: 364 --DDEDDEEEEEEEKEKKKKKSAESTRSELPFTFPC 397
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 31.2 bits (71), Expect = 1.4
Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 10/47 (21%)
Query: 500 FKELDENVEY--------DERESEFDISDEDKSVAQGSDDQDEELEI 538
F+E +++ E+ +E +S+FD S++D+ D+++ E E+
Sbjct: 34 FEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPE--SDDEEEGEKEL 78
>gnl|CDD|233530 TIGR01686, FkbH, FkbH-like domain. This model describes a domain
of a family of proteins of unknown overall function. One
of these, however, is a modular polyketide synthase 4800
amino acids in length from Streptomyces avermilitis in
which this domain is the C-terminal segment. By
contrast, the FkbH protein from Streptomyces
hygroscopicus aparently contains only this domain. The
remaining members of the family all contain an
additional N-terminal domain of between 200 and 275
amino acids which show less than 20% identity to one
another. It seems likely then that these proteins are
involved in disparate functions, probably the
biosynthesis of different natural products. For
instance, the FkbH gene is found in a gene cluster
believed to be responsible for the biosynthesis of
unususal "PKS extender units" in the ascomycin pathway.
This domain is composed of two parts, the first of which
is a member of subfamily IIIC (TIGR01681) of the
haloacid dehalogenase (HAD) superfamily of
aspartate-nucleophile hydrolases. All of the
characterized enzymes within subfamily III are
phosphatases, and all of the active site residues
characteristic of HAD-superfamily phosphatases are
present in this domain. The C-terminal portion of this
domain is unique to this family (by BLAST).
Length = 320
Score = 31.4 bits (71), Expect = 1.5
Identities = 12/59 (20%), Positives = 20/59 (33%)
Query: 202 FNKRGTLLAVGCNDGRIVLWDFLTRGIAKIISAHMHPVCSISWSRNGYKIISASTDNNV 260
K+G LLA+ + R I+ A SI+W + + N+
Sbjct: 43 LKKQGFLLALASKNDEDDAKKVFERRKDFILQAEDFDARSINWGPKSESLRKIAKKLNL 101
>gnl|CDD|213306 cd05940, FATP_FACS, Fatty acid transport proteins (FATP) play dual
roles as fatty acid transporters and its activation
enzymes. Fatty acid transport protein (FATP) transports
long-chain or very-long-chain fatty acids across the
plasma membrane. FATPs also have fatty acid CoA
synthetase activity, thus playing dual roles as fatty
acid transporters and its activation enzymes. At least
five copies of FATPs are identified in mammalian cells.
This family also includes prokaryotic FATPs. FATPs are
the key players in the trafficking of exogenous fatty
acids into the cell and in intracellular fatty acid
homeostasis.
Length = 444
Score = 31.1 bits (71), Expect = 1.7
Identities = 17/64 (26%), Positives = 27/64 (42%), Gaps = 9/64 (14%)
Query: 161 NDGRIVLWD-FLTR-GIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGCNDGRI 218
N R +WD F R G+ +I+ EF G+ + FNK G + + +
Sbjct: 204 NGLRPDIWDEFKERFGVPRIV------EFYGSTEGNV-GFINLFNKPGAVGRLPPAAIAV 256
Query: 219 VLWD 222
V +D
Sbjct: 257 VKYD 260
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 30.8 bits (69), Expect = 2.2
Identities = 13/39 (33%), Positives = 24/39 (61%)
Query: 501 KELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEID 539
K++DE+VE ESE +E + V ++ +EEL+++
Sbjct: 144 KKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLE 182
>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
Length = 601
Score = 30.7 bits (70), Expect = 2.5
Identities = 14/75 (18%), Positives = 24/75 (32%), Gaps = 16/75 (21%)
Query: 563 EFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYK----------------SFDIQLEGA 606
LP+ VE + G T P T P T+ + + +
Sbjct: 186 SLLPVPIRVEGEKGGVNETPPPWTRDYPDPETRREALLAYGEELLGFTPLDVIPLDVPSG 245
Query: 607 ACQGIGIILPTGVNL 621
+G+ +LP V+
Sbjct: 246 GLEGVAYVLPYAVSP 260
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.4 bits (68), Expect = 4.1
Identities = 19/106 (17%), Positives = 33/106 (31%), Gaps = 9/106 (8%)
Query: 504 DENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPV-----AAFCSSDEEVDD 558
D + DE E D+ +D+ D E E + + +V
Sbjct: 3949 DRQEKEDEEEMSDDVGIDDEI----QPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSK 4004
Query: 559 SKVLEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYKSFDIQLE 604
LE + + E+ E+ P+ P + DIQ +
Sbjct: 4005 DSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQD 4050
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 29.9 bits (67), Expect = 5.3
Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 5/104 (4%)
Query: 162 DGRIVLWDFLTRGIAKIISAHMHPEFDGALDCISLAVTCAFNKRGTLLAVGCNDGRIVLW 221
DG I+ W G+ + IS + G + + V+ + L + G D + +W
Sbjct: 97 DGTIMGWGIPEEGLTQNISDPI-VHLQGHTKKVGI-VSFHPSAMNVLASAGA-DMVVNVW 153
Query: 222 DFLTRGIAK-IISAHMHPVCSISWSRNGYKIISASTDNNVCIWD 264
D + RG A +I H + S+ W+ +G + + S D + I D
Sbjct: 154 D-VERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIID 196
>gnl|CDD|130410 TIGR01343, hacA_fam, homoaconitate hydratase family protein. This
model represents a subfamily of proteins consisting of
aconitase, homoaconitase, 3-isopropylmalate dehydratase,
and uncharacterized proteins. The majority of the
members of this family have been designated as
3-isopropylmalate dehydratase large subunit (LeuC) in
microbial genome annotation, but the only characterized
member is Thermus thermophilus homoaconitase, an enzyme
of a non-aspartate pathway of Lys biosynthesis.
Length = 412
Score = 29.7 bits (67), Expect = 5.4
Identities = 16/65 (24%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 507 VEYDERESEF--DISDEDKSVAQGSDDQD--EELEIDVTSVEPVAAFCSSDEEVDDSKVL 562
+E DE+ + + E V + +D + +E+EID + +EPV A + + V +
Sbjct: 223 IEPDEKTRAYLKERYKEPFRVYKSDEDAEYAKEVEIDASQLEPVVAAPHNVDNVHPVSEV 282
Query: 563 EFLPI 567
E + +
Sbjct: 283 EGIEV 287
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 29.2 bits (66), Expect = 5.6
Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 8/98 (8%)
Query: 501 KELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSK 560
+ L+ + + DE E DED SDD+ E IDV S + + + S DEE D
Sbjct: 90 QGLESDDDDDEEEEWEVEEDED------SDDEGE--WIDVESDKEIESSDSEDEEEKDEA 141
Query: 561 VLEFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTTKYKS 598
+ + + ED + + K ++ +
Sbjct: 142 AKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELAT 179
>gnl|CDD|179566 PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated.
Length = 380
Score = 29.4 bits (66), Expect = 5.8
Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 6/36 (16%)
Query: 477 ISSGVVSLWAQNQVENWSAFAPDFKELDENVEYDER 512
+G+ L A ENW A+ F E D + Y ER
Sbjct: 12 FVAGIRELMAS---ENWEAY---FAECDRDSVYPER 41
>gnl|CDD|237212 PRK12808, PRK12808, flagellin; Provisional.
Length = 476
Score = 29.5 bits (66), Expect = 6.2
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 6/62 (9%)
Query: 488 NQVENWSAFAPDFKELDENVEYDERESEFD---ISDEDKSVA---QGSDDQDEELEIDVT 541
N +N +A +F EL + + Y ++F+ + + + S+ S D D+++ ID+
Sbjct: 102 NTDKNQAALQKEFAELQKQITYIADNTQFNDKNLLNSNSSINIQTLDSSDGDKQIGIDLK 161
Query: 542 SV 543
S
Sbjct: 162 SA 163
>gnl|CDD|220430 pfam09829, DUF2057, Uncharacterized protein conserved in bacteria
(DUF2057). This domain, found in various prokaryotic
proteins, has no known function.
Length = 189
Score = 28.4 bits (64), Expect = 7.6
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 9/61 (14%)
Query: 298 MRHAAVLVDTDGDHQIV-----PIDEDSDLSIVASFDRRGQHVVTGNSKGKVLVLSCPAL 352
+ L DG++QIV D D +V S +VT ++ + L LS P +
Sbjct: 24 FSGTSSLELPDGENQIVVRYEKLFDSGGDRELVKS----DPIIVTFDASDQDLTLSLPKI 79
Query: 353 E 353
Sbjct: 80 R 80
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 28.5 bits (64), Expect = 7.9
Identities = 14/49 (28%), Positives = 25/49 (51%)
Query: 498 PDFKELDENVEYDERESEFDISDEDKSVAQGSDDQDEELEIDVTSVEPV 546
D ++ DE E + E + + SDE++ + + + E EI S +PV
Sbjct: 82 GDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPSWKPV 130
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 29.0 bits (65), Expect = 8.1
Identities = 30/102 (29%), Positives = 40/102 (39%), Gaps = 27/102 (26%)
Query: 504 DENVEYDERESEFDISDEDKS-VAQGSDDQDEELEIDVTSVEPVAAFCSSDEEVDDSKVL 562
D N + + E + SDE + A+ +D + EE E+D AA S D E D+S
Sbjct: 242 DNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMD-------AAEASEDSESDESD-- 292
Query: 563 EFLPIAPDVEDPEDGPGSTSAPVTSSPPSKTT----KYKSFD 600
ED E PG + P T P YK F
Sbjct: 293 ---------EDTET-PGEDARPAT---PFTELMEEVDYKVFT 321
>gnl|CDD|225115 COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal
transduction mechanisms].
Length = 890
Score = 29.2 bits (66), Expect = 9.2
Identities = 11/41 (26%), Positives = 18/41 (43%)
Query: 488 NQVENWSAFAPDFKELDENVEYDERESEFDISDEDKSVAQG 528
N +EN +AP E+ N + F + DE + +G
Sbjct: 782 NLLENALKYAPPGSEIRINAGVERENVVFSVIDEGPGIPEG 822
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.417
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,402,273
Number of extensions: 3064770
Number of successful extensions: 2850
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2778
Number of HSP's successfully gapped: 53
Length of query: 622
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 519
Effective length of database: 6,369,140
Effective search space: 3305583660
Effective search space used: 3305583660
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.7 bits)