RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9770
(811 letters)
>gnl|CDD|190637 pfam03451, HELP, HELP motif. The founding member of the EMAP
protein family is the 75 kDa Echinoderm
Microtubule-Associated Protein, so-named for its
abundance in sea urchin, sand dollar and starfish eggs.
The Hydrophobic EMAP-Like Protein (HELP) motif was
identified initially in the human EMAP-Like Protein 2
(EML2) and subsequently in the entire EMAP Protein
family. The HELP motif is approximately 60-70 amino
acids in length and is conserved amongst metazoans.
Although the HELP motif is hydrophobic, there is no
evidence that EMAP-Like Proteins are
membrane-associated. All members of the EMAP-Like
Protein family, identified to-date, are constructed with
an amino terminal HELP motif followed by a WD domain. In
C. elegans, EMAP-Like Protein-1 (ELP-1) is required for
touch sensation indicating that ELP-1 may play a role in
mechanosensation. The localization of ELP-1 to
microtubules and adhesion sites implies that ELP-1 may
transmit forces between the body surface and the touch
receptor neurons.
Length = 77
Score = 122 bits (309), Expect = 2e-33
Identities = 52/78 (66%), Positives = 63/78 (80%), Gaps = 1/78 (1%)
Query: 211 EESSLRMYLRGRPINLYAPSDLAESYDITKVAAPPQNKLKLDWVYGYRGKDCRSNLYLLP 270
E ++M++RGRP+ +Y PSD E+YD+ PP KLKL+WVYGYRG+DCR+NLYLLP
Sbjct: 1 ESGYVKMFIRGRPVTMYIPSD-VENYDLEADKDPPTKKLKLEWVYGYRGRDCRANLYLLP 59
Query: 271 TGEIVYFVAAVAVLYNVE 288
TGEIVYFVAAV VLYNVE
Sbjct: 60 TGEIVYFVAAVVVLYNVE 77
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 97.8 bits (244), Expect = 3e-22
Identities = 74/296 (25%), Positives = 130/296 (43%), Gaps = 19/296 (6%)
Query: 445 IRDKPKYVTCLAFTQGGEVL-SGDSNGNIIVWTRGSNTISKFIRNVHEGSVFSICVLKEG 503
++ VTC+AF+ G++L +G +G I VW + + + ++ H G V + +G
Sbjct: 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADG 63
Query: 504 SIISAGGKDGRLLQFDSNLQPTGYEAQIAEHLGGIRTVSEGRGSQLIVGTTR-NCILVGS 562
+ +++G D + +D + H + +V+ +++ ++R I V
Sbjct: 64 TYLASGSSDKTIRLWDLETGECVRT--LTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121
Query: 563 LNLG-FSPVVLGHTDELWALAAHPSLSQFLTAG-FDRIVQLWDSMSHSVVWS-KDIAEQA 619
+ G + GHTD + ++A P F+ + D ++LWD + V + +
Sbjct: 122 VETGKCLTTLRGHTDWVNSVAFSPD-GTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEV 180
Query: 620 QSATFSPDGSIIIVGCIAGR---WLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALG 676
S FSPDG ++ G W + + H + + V FSPDG LLA G
Sbjct: 181 NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNSVAFSPDGYLLASG 237
Query: 677 SRDNYIYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
S D I V+ D R V +GH++ +T L WS D + S S D + W+
Sbjct: 238 SEDGTIRVW----DLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 95.5 bits (238), Expect = 1e-21
Identities = 64/318 (20%), Positives = 119/318 (37%), Gaps = 38/318 (11%)
Query: 293 RHYLGHTDDIKCLSIHPNKLLIATGQVAGHDAREGKPHIRIWNSVSLFTVSIIGIGEFER 352
R GHT + C++ P+ L+ATG D I++W+ + + G
Sbjct: 3 RTLKGHTGGVTCVAFSPDGKLLATG---SGDGT-----IKVWD-LETGELLRTLKGH-TG 52
Query: 353 SISCLSFSKADGGTFLCAVDEGNDYNISVWDWQKSDRGHKITETKCSVDTVVAAEFHPLD 412
+ ++ S ADG +D I +WD + + +T V +V F P
Sbjct: 53 PVRDVAAS-ADGTYLASG---SSDKTIRLWDLETGECVRTLTGHTSYVSSVA---FSPDG 105
Query: 413 RNNIVTCGKSH--INFWTIDAGGTLYKKQGIFEMIRDKPKYVTCLAFTQGGEVL-SGDSN 469
R I++ I W ++ G L +R +V +AF+ G + S +
Sbjct: 106 R--ILSSSSRDKTIKVWDVETGKCLTT-------LRGHTDWVNSVAFSPDGTFVASSSQD 156
Query: 470 GNIIVWTRGSNTISKFIRN--VHEGSVFSICVLKEGSIISAGGKDGRLLQFDSNLQPTGY 527
G I +W K + H G V S+ +G + + DG + +D +
Sbjct: 157 GTIKLW---DLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL- 212
Query: 528 EAQIAEHLGGIRTVSEGRGSQLIVGTTRNCILVG--SLNLGFSPVVLGHTDELWALAAHP 585
+ H G+ +V+ L+ + + + + GHT+ + +LA P
Sbjct: 213 -GTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
Query: 586 SLSQFLTAGFDRIVQLWD 603
+ + D +++WD
Sbjct: 272 DGKRLASGSADGTIRIWD 289
Score = 87.0 bits (216), Expect = 9e-19
Identities = 70/292 (23%), Positives = 113/292 (38%), Gaps = 46/292 (15%)
Query: 529 AQIAEHLGGIRTVSEGRGSQLIV-GTTRNCILVGSLNLGFSPVVL-GHTDELWALAAHPS 586
+ H GG+ V+ +L+ G+ I V L G L GHT + +AA
Sbjct: 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD 62
Query: 587 LSQFLTAGFDRIVQLWDSMSHSVVW-----SKDIAEQAQSATFSPDGSIIIVGCIAGRWL 641
+ + D+ ++LWD + V + ++ S FSPDG I+
Sbjct: 63 GTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVS----SVAFSPDGRILSSSSRDKTIK 118
Query: 642 VMDSETRE---AYSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVYEVSGDSRKYSRVG 698
V D ET + HTD + V FSPDG +A S+D I ++ D R V
Sbjct: 119 VWDVETGKCLTTLRGHTD---WVNSVAFSPDGTFVASSSQDGTIKLW----DLRTGKCVA 171
Query: 699 RCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWNAGTCRQIPQSSQLRDVVWESNSCTLT 758
TGH+ + + +S D + S+S D + W+ T + + TL
Sbjct: 172 TLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL---------------GTLR 216
Query: 759 FTTIGIWPEGADGTDVNSCERSHDARVIATGDDFGKLKLYSYPAVQPKLCLS 810
VNS S D ++A+G + G ++++ + LS
Sbjct: 217 ----------GHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS 258
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 100 bits (248), Expect = 4e-22
Identities = 101/464 (21%), Positives = 181/464 (39%), Gaps = 44/464 (9%)
Query: 270 PTGEIVYFVAAVAVLYNVEEQTQRHYLGHTDDIKCLSIHPNKLLIATGQVAGHDAREGKP 329
I+ L ++ + + GH D I ++ P+ L+ +G G
Sbjct: 36 SESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGT------- 88
Query: 330 HIRIWNSVSLFTVSIIGIGEFERSISCLSFSKADGGTFLCAVDEGNDYNISVWDWQKSDR 389
I++W+ + + G + S+S L+ S DG + L A D + +WD S
Sbjct: 89 -IKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLAS-SSLDGTVKLWDL--STP 144
Query: 390 GHKITETKCSVDTVVAAEFHPLDRNNIVTCGKSH-INFWTIDAGGTLYKKQGIFEMIRDK 448
G I + ++V + F P + I W + G L +
Sbjct: 145 GKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST-------LAGH 197
Query: 449 PKYVTCLAFTQGGE--VLSGDSNGNIIVWTRGSNTISKFIRNVHEGSVFSICVLKEGSII 506
V+ LAF+ G + SG S+G I +W + + + + H SV S +GS++
Sbjct: 198 TDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVS-SFSPDGSLL 256
Query: 507 SAGGKDGRLLQFDSNLQPTGYEAQIAEHLGGIRTVS-EGRGSQLIVGTTRNCILVGSLNL 565
++G DG + +D + ++ H + +V+ G L G++ + + L
Sbjct: 257 ASGSSDGTIRLWDLRSSSSLL-RTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLET 315
Query: 566 GFSPVVL---GHTDELWALAAHPSLSQFLTAG-FDRIVQLWDSMSHSVVWSKDIAEQAQS 621
G L GH + +L+ P S ++ G D ++LWD + + + + S
Sbjct: 316 GKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLS 375
Query: 622 ATFSPDGSIIIVGCIAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALGSRDNY 681
+FSPDG ++ G G + D T + + FSPDGK LA GS DN
Sbjct: 376 VSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNT 435
Query: 682 IYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYIRSNSGD 725
I ++++ + + + +S D + S S D
Sbjct: 436 IRLWDL----------------KTSLKSVSFSPDGKVLASKSSD 463
Score = 98.6 bits (244), Expect = 1e-21
Identities = 93/424 (21%), Positives = 167/424 (39%), Gaps = 28/424 (6%)
Query: 337 VSLFTVSIIGIGEFERSISCLSFSKADGGTFLCAVDEGNDYNISVWDWQKSDRGHKITET 396
VSL +S + + E SI+ ++FS G L + +D I +WD +
Sbjct: 51 VSLPDLSSLLLRGHEDSITSIAFSP--DGELLLS--GSSDGTIKLWDLDNG-EKLIKSLE 105
Query: 397 KCSVDTVVAAEFHPLDRNNIVTCGKSHINFWTIDAGGTLYKKQGIFEMIRDKPKYVTCLA 456
+V D N+I+ S + T K E + VT LA
Sbjct: 106 GLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLE---GHSESVTSLA 162
Query: 457 FTQGGEVL--SGDSNGNIIVWTRGSNTISKFIRNVHEGSVFSICVLKEGSIISAGGKDGR 514
F+ G++L +G I +W + + H V S+ +G ++ A G
Sbjct: 163 FSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAG-HTDPVSSLAFSPDGGLLIASGSSDG 221
Query: 515 LLQFDSNLQPTGYEAQIAEHLGGIRTVSEGRGSQLIVGTTRNCILVGSLNLGFSPVVL-- 572
++ + ++ H + + GS L G++ I + L S +
Sbjct: 222 TIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281
Query: 573 GHTDELWALAAHPSLSQFLTAGFDRIVQLWDSMSHSVVWSKDI---AEQAQSATFSPDGS 629
GH+ + ++A P + D V+LWD + ++ S + S +FSPDGS
Sbjct: 282 GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341
Query: 630 IIIVGC----IAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVY 685
+++ G W + + + H++ V FSPDG++++ GS D + ++
Sbjct: 342 LLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLS----VSFSPDGRVVSSGSTDGTVRLW 397
Query: 686 EVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWNAGTCRQIPQSSQL 745
++S S + GH+S +T LD+S D + S S D + W+ T + S
Sbjct: 398 DLSTGSLLRNL----DGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPD 453
Query: 746 RDVV 749
V+
Sbjct: 454 GKVL 457
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 42.8 bits (100), Expect = 7e-04
Identities = 43/160 (26%), Positives = 74/160 (46%), Gaps = 32/160 (20%)
Query: 588 SQFLTAGFDRIVQLWDSMSHSV----------VWSKDI--AEQAQSATFSPDGSIIIVGC 635
SQ ++ F+ +VQ+WD + VWS D A+ A+ S DGS+ +
Sbjct: 546 SQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSI 605
Query: 636 IAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPD-GKLLALGSRDNYIYVYEVSGDSRKY 694
G + +I T N I V+F + G+ LA GS D+ +Y Y++ +
Sbjct: 606 NQGVSI---------GTIKTKAN--ICCVQFPSESGRSLAFGSADHKVYYYDL-----RN 649
Query: 695 SRVGRCT--GHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
++ CT GHS ++++ + + S + S+S D L W+
Sbjct: 650 PKLPLCTMIGHSKTVSYVRFVDSSTLV-SSSTDNTLKLWD 688
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 36.2 bits (84), Expect = 0.002
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 573 GHTDELWALAAHPSLSQFLTAGFDRIVQLWD 603
GHT + ++A P + + D V++WD
Sbjct: 9 GHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 32.3 bits (74), Expect = 0.047
Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 8/47 (17%)
Query: 289 EQTQRHYLGHTDDIKCLSIHPNKLLIATGQVAGHDAREGKPHIRIWN 335
+ R GHT + ++ P+ L+A+G D +R+W+
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASG---SDDG-----TVRVWD 39
Score = 31.6 bits (72), Expect = 0.11
Identities = 16/35 (45%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 651 YSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVY 685
HT P+ V FSPDG LLA GS D + V+
Sbjct: 7 LKGHTG---PVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 27.7 bits (62), Expect = 2.2
Identities = 9/38 (23%), Positives = 17/38 (44%)
Query: 695 SRVGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
+ GH+ +T + +S D + S S D + W+
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 26.5 bits (59), Expect = 6.1
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 490 HEGSVFSICVLKEGSIISAGGKDGRLLQFD 519
H G V S+ +G+++++G DG + +D
Sbjct: 10 HTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 35.8 bits (83), Expect = 0.003
Identities = 9/31 (29%), Positives = 15/31 (48%)
Query: 573 GHTDELWALAAHPSLSQFLTAGFDRIVQLWD 603
GHT + ++A P + D ++LWD
Sbjct: 10 GHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 32.3 bits (74), Expect = 0.046
Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 3/36 (8%)
Query: 651 YSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVYE 686
HT P+ V FSPDGK LA GS D I +++
Sbjct: 8 LKGHTG---PVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 31.1 bits (71), Expect = 0.13
Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 8/39 (20%)
Query: 297 GHTDDIKCLSIHPNKLLIATGQVAGHDAREGKPHIRIWN 335
GHT + ++ P+ +A+G D I++W+
Sbjct: 10 GHTGPVTSVAFSPDGKYLASG---SDDG-----TIKLWD 40
Score = 30.7 bits (70), Expect = 0.21
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 697 VGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
+ GH+ +T + +S D Y+ S S D + W+
Sbjct: 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 27.3 bits (61), Expect = 2.9
Identities = 9/30 (30%), Positives = 16/30 (53%)
Query: 490 HEGSVFSICVLKEGSIISAGGKDGRLLQFD 519
H G V S+ +G +++G DG + +D
Sbjct: 11 HTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 26.5 bits (59), Expect = 5.4
Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 450 KYVTCLAFTQGGEVL-SGDSNGNIIVW 475
VT +AF+ G+ L SG +G I +W
Sbjct: 13 GPVTSVAFSPDGKYLASGSDDGTIKLW 39
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 37.6 bits (87), Expect = 0.030
Identities = 25/138 (18%), Positives = 45/138 (32%), Gaps = 16/138 (11%)
Query: 74 SPISSSPLLKNGTLKGKSFSSSKETLRYRPLNDQINRDPVRRVVSY---TPSSLP-QRRG 129
PI+ SP + T S + + P +D ++ +P P Q+R
Sbjct: 90 GPITHSPSATSSTSSLNSNDGDQ----FSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRK 145
Query: 130 VHYQSTGSLHS-------DSPSSSSV-SPVPSSPRPLPPKTSWTSNLHTSKRWSSTGDFH 181
+ S H + S +P S + +P S + L +
Sbjct: 146 NPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSD 205
Query: 182 HHTMSSHHSVHQNHHNSS 199
S+ S+ +N +SS
Sbjct: 206 TSNGFSYPSIRKNSRHSS 223
Score = 31.4 bits (71), Expect = 2.4
Identities = 25/141 (17%), Positives = 38/141 (26%), Gaps = 17/141 (12%)
Query: 63 SSKNIANNS---FGSPISSSPLLKNGTLKGKSFSSSKETLRYRPLNDQINRDPVRRVVSY 119
SS +S GSP+ KN L S + + + N P R
Sbjct: 125 SSTQSRKDSGPGDGSPVQKR---KNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKE 181
Query: 120 TPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNLHTSKRWSSTGD 179
PS + S+ SP + + ++ H+S S
Sbjct: 182 IPSL----------GSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPH 231
Query: 180 FHHHTMSSHHS-VHQNHHNSS 199
+ HS SS
Sbjct: 232 SSTAVLLKRHSGSSGASLISS 252
>gnl|CDD|216201 pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal
region. This family is an alignment of the region to
the N-terminal side of the active site. The Prosite
motif does not correspond to this Pfam entry.
Length = 348
Score = 35.7 bits (83), Expect = 0.082
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 11/77 (14%)
Query: 625 SPDGSIIIVGC---------IAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLAL 675
SPDG +++ + + D ET + G IQ K+SPDG LA
Sbjct: 1 SPDGKYLLLATNYEKIWRHSYTASYYIYDLETGTVEPL-PPGEGKIQYAKWSPDGHKLAY 59
Query: 676 GSRDNYIYVYEVSGDSR 692
RDN +YV E++
Sbjct: 60 V-RDNNLYVRELATGKE 75
>gnl|CDD|227282 COG4946, COG4946, Uncharacterized protein related to the
periplasmic component of the Tol biopolymer transport
system [Function unknown].
Length = 668
Score = 35.6 bits (82), Expect = 0.11
Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 14/101 (13%)
Query: 624 FSPDGSIIIVGCIAGRWL-VMDSETREAYSIHTD-GNEPIQVVKFSPDGKLLALGSRDNY 681
D ++G G L + D + E I D GN I+ VK SPDGK + + +
Sbjct: 367 IQVDPEGDVIGTNDGDKLGIYDKDGGEVKRIEKDLGN--IEAVKVSPDGKKVVVANDRFE 424
Query: 682 IYVYEVSGDSRKY---SRVGRCTGHSSFITHLDWSEDSMYI 719
++V ++ + + S G IT DW +S +I
Sbjct: 425 LWVIDIDNGNVRLIDKSEYG-------LITDFDWHPNSRWI 458
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small,
circular, anucleate mini-cells. Inactivation of divIVA
produces a mini-cell phenotype, whereas overproduction
of DivIVA results in a filamentation phenotype. These
proteins appear to contain coiled-coils.
Length = 131
Score = 33.4 bits (77), Expect = 0.12
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 6/57 (10%)
Query: 19 LETENESLRERVSDLEKKVLEHGDEIVCLRSTL------ADVLRRLAQLESSKNIAN 69
L ENE L+E + LE+K+ E+ + L+ TL A+ ++ AQ E+ I
Sbjct: 37 LYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKE 93
>gnl|CDD|215562 PLN03078, PLN03078, Putative tRNA pseudouridine synthase;
Provisional.
Length = 513
Score = 34.9 bits (80), Expect = 0.16
Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 5/67 (7%)
Query: 110 RDPVRRVVSYTPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNLH 169
R P ++ +SL +R T S S SPSS+S+SP PSS PP T +
Sbjct: 6 RAPWPLWRFWSSTSLLRRVNPRILRTSSSASSSPSSTSLSP-PSSD---PPLTDNSDQGG 61
Query: 170 TSKRWSS 176
K W S
Sbjct: 62 GMK-WES 67
>gnl|CDD|222371 pfam13775, DUF4171, Domain of unknown function (DUF4171). This
short family is frequently found at the N-terminus of
Homeobox proteins.
Length = 128
Score = 32.8 bits (74), Expect = 0.26
Identities = 17/68 (25%), Positives = 26/68 (38%)
Query: 165 TSNLHTSKRWSSTGDFHHHTMSSHHSVHQNHHNSSGGGAKDAQYNEEESSLRMYLRGRPI 224
S K+ S H SS + HH G G+ D ++ + S +Y+ P+
Sbjct: 17 GSEPPLHKKPSKKLGEPSHASSSSSESRKEHHRKKGSGSSDKHSHKSKKSKPLYVNTEPL 76
Query: 225 NLYAPSDL 232
L P L
Sbjct: 77 TLREPDGL 84
>gnl|CDD|237510 PRK13795, PRK13795, hypothetical protein; Provisional.
Length = 636
Score = 34.2 bits (79), Expect = 0.28
Identities = 10/64 (15%), Positives = 28/64 (43%), Gaps = 12/64 (18%)
Query: 625 SPDGSIIIVGCIAGRWLVMDSETREA----YSIH----TDGNEPIQ----VVKFSPDGKL 672
+G+ ++ +W+++D E ++ + + I+ VV + DG++
Sbjct: 113 RLEGAKRLLKKRLKKWVIVDKGALEPIKNGKNVLAPGVVEADLDIKKGDEVVVVTEDGEV 172
Query: 673 LALG 676
+ +G
Sbjct: 173 VGVG 176
>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
This is a family of eukaryotic translation initiation
factors.
Length = 194
Score = 33.0 bits (76), Expect = 0.37
Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 9/70 (12%)
Query: 653 IHTDGNEPIQVVKFSPDGKLLALGSRDN---YIYVYEVSGDSRKYSRVGRCTGHSSFITH 709
IH+ G +P + +SP G+L+ L N I ++V ++ T +S T
Sbjct: 95 IHSLGEQPRNTIFWSPFGRLVLLAGFGNLAGQIEFWDVK----NKKKIA--TAEASNATD 148
Query: 710 LDWSEDSMYI 719
+WS D Y
Sbjct: 149 CEWSPDGRYF 158
>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
beta). This family consists of several telomere-binding
protein beta subunits which appear to be specific to the
family Oxytrichidae. Telomeres are specialised
protein-DNA complexes that compose the ends of
eukaryotic chromosomes. Telomeres protect chromosome
termini from degradation and recombination and act
together with telomerase to ensure complete genome
replication. TEBP beta forms a complex with TEBP alpha
and this complex is able to recognise and bind ssDNA to
form a sequence-specific, telomeric nucleoprotein
complex that caps the very 3' ends of chromosomes.
Length = 375
Score = 33.5 bits (76), Expect = 0.43
Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 1/73 (1%)
Query: 110 RDPVRRVVSYTPSSLPQRRGVHYQS-TGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNL 168
R V ++V YTP+ R+ +S G + ++ S VP++P P K++ T++
Sbjct: 275 RKSVDKIVKYTPNKPSSRKETPQKSQAGKSSAKKTTTGSKKAVPANPSPSGKKSTKTTDQ 334
Query: 169 HTSKRWSSTGDFH 181
T ++ D+H
Sbjct: 335 MTMAQFKKYLDWH 347
>gnl|CDD|192630 pfam10568, Tom37, Outer mitochondrial membrane transport complex
protein. The TOM37 protein is one of the outer membrane
proteins that make up the TOM complex for guiding
cytosolic mitochondrial beta-barrel proteins from the
cytosol across the outer mitochondrial membrane into the
intramembrane space. In conjunction with TOM70 it guides
peptides without an MTS into TOM40, the protein that
forms the passage through the outer membrane. It has
homology with Metaxin-1, also part of the outer
mitochondrial membrane beta-barrel protein transport
complex.
Length = 71
Score = 30.4 bits (69), Expect = 0.50
Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 16/66 (24%)
Query: 610 VWSKDIAEQAQSATFSPDGSIIIVGCIAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPD 669
VW D F S I C+A W + EA+ + N + SP
Sbjct: 3 VWGPD---------FGL--SSISPECLALIWYLKLCLPPEAFEVVFSNNTNL-----SPT 46
Query: 670 GKLLAL 675
G+L AL
Sbjct: 47 GELPAL 52
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 33.4 bits (76), Expect = 0.60
Identities = 20/78 (25%), Positives = 29/78 (37%), Gaps = 5/78 (6%)
Query: 142 SPSSSSVSP-VPSSPRPLPPKTSWTSNLHTSKRWSSTG--DFHHHTMSSHHSVHQNHHNS 198
S +S VP S +SN T + S D HH HHS+ N+H+S
Sbjct: 221 SKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHS 280
Query: 199 SG--GGAKDAQYNEEESS 214
+ + E S+
Sbjct: 281 KHADSKLQTIEVIENHSN 298
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 33.2 bits (76), Expect = 0.62
Identities = 22/82 (26%), Positives = 28/82 (34%), Gaps = 10/82 (12%)
Query: 103 PLNDQINRDPVRRVVSYTPSSLP-------QRRGVHY--QSTGSLHSDSPSSSSVSPVPS 153
P+N P S +PS + Q T S SPSSS SP S
Sbjct: 128 PMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNKPASPSSSYQSPSYS 187
Query: 154 SPRPLPPKTSWTSNLHTSKRWS 175
S P +S + S W+
Sbjct: 188 SSLG-PVNSSGNRSNLRSSPWA 208
>gnl|CDD|234377 TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated
beta-propeller protein. Members of this protein family
consist of seven repeats each of the YVTN family
beta-propeller repeat (see TIGR02276). Members occur
invariably as part of a transport operon that is
associated with PQQ-dependent catabolism of alcohols
such as phenylethanol.
Length = 300
Score = 32.7 bits (75), Expect = 0.63
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 8/89 (8%)
Query: 596 DRIVQLWDSMSHSVVWSKDIAEQAQSATFSPDGSIIIVGCI--AGRWLVMDSETREAYSI 653
D + + D+ + V + + ++ + TFS DG ++ V C + V+D T E +
Sbjct: 10 DNTISVIDTATLEVTRTFPVGQRPRGITFSKDGKLLYV-CASDSDTVQVIDPATGEV--L 66
Query: 654 HT--DGNEPIQVVKFSPDGKLLALGSRDN 680
HT G +P P+GK+L + + D+
Sbjct: 67 HTLPSGPDPELFA-LHPNGKILYIANEDD 94
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 32.9 bits (75), Expect = 0.66
Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 5/69 (7%)
Query: 573 GHTDELWALAAHPSLSQFL-TAGFDRIVQLWDSMSHSVVWS-KDIAEQAQSATFSPDGSI 630
GHT ++ ++ HPS L +AG D +V +WD V K ++Q S ++ DGS+
Sbjct: 123 GHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSL 182
Query: 631 IIVGCIAGR 639
+ C +
Sbjct: 183 L---CTTSK 188
>gnl|CDD|166839 PRK00178, tolB, translocation protein TolB; Provisional.
Length = 430
Score = 32.8 bits (75), Expect = 0.67
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 10/64 (15%)
Query: 666 FSPDGKLLALG-SRDNY--IYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSED--SMYIR 720
+SPDG LA S+D IYV +++ SR+ SRV T H + T W +D ++Y
Sbjct: 250 WSPDGSKLAFVLSKDGNPEIYVMDLA--SRQLSRV---TNHPAIDTEPFWGKDGRTLYFT 304
Query: 721 SNSG 724
S+ G
Sbjct: 305 SDRG 308
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 29.8 bits (68), Expect = 0.70
Identities = 7/19 (36%), Positives = 13/19 (68%)
Query: 19 LETENESLRERVSDLEKKV 37
LE ENE L++ + L +++
Sbjct: 38 LEAENERLKKEIERLRREL 56
>gnl|CDD|237616 PRK14127, PRK14127, cell division protein GpsB; Provisional.
Length = 109
Score = 30.8 bits (70), Expect = 0.83
Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 6/52 (11%)
Query: 17 DMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLA------DVLRRLAQLE 62
+ L+ EN L+ +V +L K+V + + D+L+RL+ LE
Sbjct: 47 EELQQENARLKAQVDELTKQVSVGASSSSVATTQPSSSATNYDILKRLSNLE 98
>gnl|CDD|234019 TIGR02800, propeller_TolB, tol-pal system beta propeller repeat
protein TolB. Members of this protein family are the
TolB periplasmic protein of Gram-negative bacteria. TolB
is part of the Tol-Pal (peptidoglycan-associated
lipoprotein) multiprotein complex, comprising five
envelope proteins, TolQ, TolR, TolA, TolB and Pal, which
form two complexes. The TolQ, TolR and TolA
inner-membrane proteins interact via their transmembrane
domains. The {beta}-propeller domain of the periplasmic
protein TolB is responsible for its interaction with
Pal. TolB also interacts with the outer-membrane
peptidoglycan-associated proteins Lpp and OmpA. TolA
undergoes a conformational change in response to changes
in the proton-motive force, and interacts with Pal in an
energy-dependent manner. The C-terminal periplasmic
domain of TolA also interacts with the N-terminal domain
of TolB. The Tol-PAL system is required for bacterial
outer membrane integrity. E. coli TolB is involved in
the tonB-independent uptake of group A colicins
(colicins A, E1, E2, E3 and K), and is necessary for the
colicins to reach their respective targets after initial
binding to the bacteria. It is also involved in uptake
of filamentous DNA. Study of its structure suggest that
the TolB protein might be involved in the recycling of
peptidoglycan or in its covalent linking with
lipoproteins. The Tol-Pal system is also implicated in
pathogenesis of E. coli, Haemophilus ducreyi ,
Salmonella enterica and Vibrio cholerae, but the
mechanism(s) is unclear [Transport and binding proteins,
Other, Cellular processes, Pathogenesis].
Length = 417
Score = 32.6 bits (75), Expect = 0.94
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 7/79 (8%)
Query: 621 SATFSPDG-SIIIVGCIAGR--WLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALGS 677
++SPDG SI G +MD++ EA + GN SPDG L+A
Sbjct: 282 EPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEARRLTFRGNYNASPSW-SPDGDLIAFVH 340
Query: 678 RDN---YIYVYEVSGDSRK 693
R+ I V ++ G +
Sbjct: 341 REGGGFNIAVMDLDGGGER 359
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 31.6 bits (72), Expect = 1.1
Identities = 15/44 (34%), Positives = 16/44 (36%)
Query: 117 VSYTPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPP 160
S T L V ST S P+ V VP P P PP
Sbjct: 149 NSTTSDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPP 192
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 32.1 bits (73), Expect = 1.3
Identities = 26/116 (22%), Positives = 41/116 (35%), Gaps = 21/116 (18%)
Query: 73 GSPISSSPLLKNGTLKGKSFSSS-------KETLRYRPLNDQI--NRDPVRRVVSYTPSS 123
S + SP+L + + SSS E RP +D I + P + + PS
Sbjct: 90 PSVLDLSPVLASPQTGKRRRSSSPSDDEDEAERPSKRPRSDSISSSSSPAKPPEACLPSP 149
Query: 124 LPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNLHTSKRWSSTGD 179
S + S +S +P+P+ P + T+ KR S G
Sbjct: 150 AA-----------STQDEL-SEASAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDGF 193
>gnl|CDD|235312 PRK04792, tolB, translocation protein TolB; Provisional.
Length = 448
Score = 31.9 bits (73), Expect = 1.3
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 4/52 (7%)
Query: 665 KFSPDGKLLALG-SRDNYIYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSED 715
+FSPDGK LAL S+D +Y V ++ +R+ T H + T W D
Sbjct: 268 RFSPDGKKLALVLSKDGQPEIYVVDIATKALTRI---TRHRAIDTEPSWHPD 316
>gnl|CDD|213367 cd12833, ZntB-like_1, Salmonella typhimurium Zn2+ transporter
ZntB-like subgroup. A bacterial subgroup belonging to
the Escherichia coli CorA-Salmonella typhimurium
ZntB_like family (EcCorA_ZntB-like) of the MIT
superfamily of essential membrane proteins involved in
transporting divalent cations (uptake or efflux) across
membranes. This subgroup includes the Zn2+ transporter
Salmonella typhimurium ZntB which mediates the efflux of
Zn2+ (and Cd2+). Structures of the intracellular domain
of Vibrio parahaemolyticus and Salmonella typhimurium
ZntB form funnel-shaped homopentamers, the tip of the
funnel is formed from two C-terminal transmembrane (TM)
helices from each monomer, and the large opening of the
funnel from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just after
TM1, mutation within this motif is known to abolish Mg2+
transport through Salmonella typhimurium CorA, and
Mrs2p. Natural variants such as GVN and GIN, which occur
in proteins belonging to this subfamily, may be
associated with the transport of different divalent
cations, such as zinc and cadmium. The functional
diversity of MIT transporters may also be due to minor
structural differences regulating gating, substrate
selection, and transport.
Length = 290
Score = 31.7 bits (73), Expect = 1.4
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 3/48 (6%)
Query: 14 LHDDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQL 61
L D M E + L +R+ +LE++VLE DE LR LA++ R+ L
Sbjct: 116 LTDRM-EDVIDELEDRLDELEERVLEEEDEE--LRGELAELRRQAIAL 160
>gnl|CDD|179892 PRK04922, tolB, translocation protein TolB; Provisional.
Length = 433
Score = 31.9 bits (73), Expect = 1.4
Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 6/62 (9%)
Query: 666 FSPDGKLLALG-SRDNYIYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSED--SMYIRSN 722
FSPDG+ LAL SRD +Y + SR ++ R T H T W+ D S+Y S+
Sbjct: 255 FSPDGRRLALTLSRDGNPEIYVMDLGSR---QLTRLTNHFGIDTEPTWAPDGKSIYFTSD 311
Query: 723 SG 724
G
Sbjct: 312 RG 313
>gnl|CDD|205130 pfam12894, Apc4_WD40, Anaphase-promoting complex subunit 4 WD40
domain. Apc4 contains an N-terminal propeller-shaped
WD40 domain.The N-terminus of Afi1 serves to stabilise
the union between Apc4 and Apc5, both of which lie
towards the bottom-front of the APC,.
Length = 47
Score = 28.4 bits (64), Expect = 1.4
Identities = 5/36 (13%), Positives = 18/36 (50%)
Query: 654 HTDGNEPIQVVKFSPDGKLLALGSRDNYIYVYEVSG 689
+ ++++ + P L+AL + D + ++ ++
Sbjct: 7 EKNLPARVKILSWCPTMDLIALATEDGELLLHRLNW 42
>gnl|CDD|129323 TIGR00219, mreC, rod shape-determining protein MreC. MreC (murein
formation C) is involved in the rod shape determination
in E. coli, and more generally in cell shape
determination of bacteria whether or not they are
rod-shaped. Cells defective in MreC are round. Species
with MreC include many of the Proteobacteria,
Gram-positives, and spirochetes [Cell envelope,
Biosynthesis and degradation of murein sacculus and
peptidoglycan].
Length = 283
Score = 31.4 bits (71), Expect = 1.5
Identities = 29/118 (24%), Positives = 38/118 (32%), Gaps = 22/118 (18%)
Query: 17 DMLETENESLRE----RVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLESSKNIANNSF 72
+ LE EN LR+ + LE E V LR L L S++ I N
Sbjct: 69 NNLEYENYKLRQELLKKNQQLEILTQNLKQENVRLRELLNSPLSSDEYKISAEVIYLNY- 127
Query: 73 GSPISSSPLLKNGT----------------LKGKSFSSSKETLRYRPLNDQINRDPVR 114
S+ ++ G L GK S T R L D N P +
Sbjct: 128 -DNYSTQVVINKGFNDGVYKDMPVIADGKGLVGKVVSVGSNTSRVLLLTDYTNFVPAQ 184
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 31.5 bits (71), Expect = 1.6
Identities = 40/187 (21%), Positives = 59/187 (31%), Gaps = 41/187 (21%)
Query: 48 RSTLADVLRRLAQLES--SKNIANNSFGSPISSSPLLKNGTLKGKSFSSSKETLRYRPLN 105
RS+ L S S+ N+ SP SSSP ++ SSS
Sbjct: 18 RSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSP--------ARNTSSSSS-------- 61
Query: 106 DQINRDPVRRVVSYTPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWT 165
+R S + L R SPS S S S L ++
Sbjct: 62 ---FGLSKQRPSSLSRGRLSSR------------FVSPSRGSPSAAASLNGSLATASTSG 106
Query: 166 SNLHTSKRWSSTGDFHHHTMSSHHSVHQNHHNSSGGGAKDAQYNEEESSLRM----YLRG 221
S+ + R +++ D S S + G K E+ LR+ L+
Sbjct: 107 SSSPSRSRRTTSSDLSSGNGPSVLSFMAD----VKRGKKGPSKIEDAHQLRLLYNRLLQW 162
Query: 222 RPINLYA 228
R +N A
Sbjct: 163 RFVNARA 169
>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
Length = 175
Score = 30.4 bits (69), Expect = 2.1
Identities = 13/50 (26%), Positives = 29/50 (58%), Gaps = 5/50 (10%)
Query: 19 LETENESLRERVSDLEKKVLEHGDEIVCL-----RSTLADVLRRLAQLES 63
LE + ++ R+ +LE+++ + D++V R + ++L RL +LE+
Sbjct: 90 LEAQLNTITRRLDELERQLQQKADDVVSYQLLQHRREMEEMLERLQKLEA 139
>gnl|CDD|115305 pfam06635, NolV, Nodulation protein NolV. This family consists of
several nodulation protein NolV sequences from different
Rhizobium species. The function of this family is
unclear.
Length = 207
Score = 30.8 bits (69), Expect = 2.4
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 304 CLSIHPNKLLIATGQVAGHDAREGKPHIRI 333
CL + P K + + A D +G+P +RI
Sbjct: 134 CLHVSPLKADLLAREFADWDGLDGRPKLRI 163
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.2 bits (71), Expect = 2.9
Identities = 15/53 (28%), Positives = 21/53 (39%), Gaps = 7/53 (13%)
Query: 20 ETENESLRERVSDLEKKVLEHGDEIVCLRSTLA-------DVLRRLAQLESSK 65
E E E L E V DL +++ E +E L + V R +LE
Sbjct: 271 EREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323
Score = 29.6 bits (67), Expect = 9.0
Identities = 12/46 (26%), Positives = 23/46 (50%)
Query: 17 DMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLE 62
D LE E LRE ++LE ++ E + + R + ++ + +L
Sbjct: 352 DDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELR 397
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 31.0 bits (70), Expect = 3.4
Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 19/69 (27%)
Query: 11 AKLLHD-DMLETENESLRERVSDL------------EKKVLEHGDEIVCLRSTLADV--L 55
A+LL D LE EN SL+++VS L E + LE +E L S L + L
Sbjct: 93 ARLLQDYSELEEENISLQKQVSVLRQSQVEFEGLKHEIRRLE--EETELLNSQLEEAARL 150
Query: 56 RRLA--QLE 62
+ +A QLE
Sbjct: 151 KEIAEKQLE 159
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 30.8 bits (70), Expect = 3.7
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 18 MLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLE 62
LE E E L+ER+ +LE+ + EI ++S L ++ R+ +LE
Sbjct: 727 QLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE 771
>gnl|CDD|223893 COG0823, TolB, Periplasmic component of the Tol biopolymer
transport system [Intracellular trafficking and
secretion].
Length = 425
Score = 30.5 bits (69), Expect = 3.8
Identities = 23/98 (23%), Positives = 35/98 (35%), Gaps = 16/98 (16%)
Query: 665 KFSPDGKLLALGSRDNY---IYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYI-- 719
FSPDG LA S + IY+ ++ G + R T T WS D I
Sbjct: 244 AFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLP-----RLTNGFGINTSPSWSPDGSKIVF 298
Query: 720 RSNSGDYELLYWNAGTCRQIPQSSQLRDVVWESNSCTL 757
S+ G +Y + SQ+ + + +
Sbjct: 299 TSDRGGRPQIY------LYDLEGSQVTRLTFSGGGNSN 330
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.5 bits (69), Expect = 4.4
Identities = 16/44 (36%), Positives = 27/44 (61%)
Query: 19 LETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLE 62
LE + +SL ++ LE +++E D++ LR +AD+ RL LE
Sbjct: 64 LEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALE 107
>gnl|CDD|225333 COG2706, COG2706, 3-carboxymuconate cyclase [Carbohydrate transport
and metabolism].
Length = 346
Score = 30.0 bits (68), Expect = 4.5
Identities = 13/81 (16%), Positives = 31/81 (38%), Gaps = 4/81 (4%)
Query: 643 MDSETREAYSIHT-DGNEPIQVVKFSPDGKLLALG---SRDNYIYVYEVSGDSRKYSRVG 698
+D++T E + + +PD + L + + + Y + D + + +
Sbjct: 23 LDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLN 82
Query: 699 RCTGHSSFITHLDWSEDSMYI 719
R T S ++ ED ++
Sbjct: 83 RQTLPGSPPCYVSVDEDGRFV 103
>gnl|CDD|114269 pfam05538, Campylo_MOMP, Campylobacter major outer membrane
protein. This family consists of Campylobacter major
outer membrane proteins. The major outer membrane
protein (MOMP), a putative porin and a multifunction
surface protein of Campylobacter jejuni, may play an
important role in the adaptation of the organism to
various host environments.
Length = 437
Score = 30.0 bits (67), Expect = 4.9
Identities = 23/95 (24%), Positives = 36/95 (37%), Gaps = 6/95 (6%)
Query: 399 SVDTVVAAEFHPLDRNNIVTCGKSHINFWTIDAGGTLY--KKQGIFEMIRDKPKYVTCLA 456
S+D + + + N G +N W GG Y K + I D+ + LA
Sbjct: 271 SLDNELDDKLDAANGNFFALKGSIEVNGWDASLGGLYYGDKDKASLTTIEDQGNLGSLLA 330
Query: 457 ----FTQGGEVLSGDSNGNIIVWTRGSNTISKFIR 487
F G L+GD NI + T ++ +R
Sbjct: 331 GEEIFYTNGSNLNGDIGRNIFGYVTAGYTFNETVR 365
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.1 bits (68), Expect = 5.1
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 7/56 (12%)
Query: 20 ETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRL-------AQLESSKNIA 68
E E E+L+E++ +LE + E EI L S + V L +LE I
Sbjct: 269 EAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIK 324
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A splice
variant, E1--E4, exists but neither the function of E4
or E1--E4 is known.
Length = 95
Score = 28.0 bits (63), Expect = 5.2
Identities = 13/41 (31%), Positives = 17/41 (41%), Gaps = 5/41 (12%)
Query: 125 PQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWT 165
P++R L SDS SSS+ SP + WT
Sbjct: 34 PKKRCRR-----RLESDSDSSSTTSPSSPPTPQPTTECPWT 69
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 30.3 bits (68), Expect = 5.3
Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 9/113 (7%)
Query: 660 PIQVVKFSP-DGKLLALGSRDNYIYVYEV--SGDSRKYSRVGRC--TGHSSFITHLDWSE 714
I ++F+P ++LA GS D I V+E+ + +S K + +C GH I+ +DW+
Sbjct: 76 SILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNP 135
Query: 715 DSMYIRSNSG-DYELLYW---NAGTCRQIPQSSQLRDVVWESNSCTLTFTTIG 763
+ YI +SG D + W N QI +L + W L+ T +G
Sbjct: 136 MNYYIMCSSGFDSFVNIWDIENEKRAFQINMPKKLSSLKWNIKGNLLSGTCVG 188
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 30.1 bits (67), Expect = 5.7
Identities = 39/154 (25%), Positives = 57/154 (37%), Gaps = 32/154 (20%)
Query: 4 GDKDSVYAKLLHDDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLES 63
G DSV + HD M ++SLR V D +R D +R A
Sbjct: 591 GVHDSVRGGV-HDSMRSGVHDSLRGGVHD-------------SIRGGAHDGVRSGAHDGV 636
Query: 64 SKNIANNSFGSPISSSPLLKNGTLKGKSFSSSKETLRYRPLNDQINRDPVRRVVSYTPSS 123
K+ G + +P GT G +FS K++ + V SY P +
Sbjct: 637 RKSSGVKVEGMGNARAPPPNQGTHDG-NFSRGKDSSDH--------------VSSYKPYA 681
Query: 124 LPQRRGVHYQST---GSLHSDSPSSSSVSPVPSS 154
R+ +YQ T +H +S + VPSS
Sbjct: 682 YDSRKAKNYQDTFDRNDMHGYMKNSPISTIVPSS 715
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 30.1 bits (68), Expect = 5.8
Identities = 16/50 (32%), Positives = 25/50 (50%)
Query: 17 DMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLESSKN 66
+ LE E E L+E + +LE + E DE+ L ++ L +LES
Sbjct: 852 EELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELA 901
>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein.
Length = 326
Score = 29.9 bits (67), Expect = 5.8
Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 3/77 (3%)
Query: 2 NLGDKDSVYAKLLHDDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQL 61
N GD +Y +L + LE+ SL E VS + ++ + ++ + ++L D L L
Sbjct: 26 NPGDLTPIYERLTN---LESATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGL 82
Query: 62 ESSKNIANNSFGSPISS 78
++ S S S
Sbjct: 83 ITTVQALQTSVDSLSSE 99
>gnl|CDD|152790 pfam12355, Dscam_C, Down syndrome cell adhesion molecule C
terminal. This domain family is found in eukaryotes,
and is approximately 120 amino acids in length. The
family is found in association with pfam00047,
pfam07679, pfam00041. The Down syndrome cell adhesion
molecule (Dscam) belongs to a family of cell membrane
molecules involved in the differentiation of the nervous
system. This is the C terminal cytoplasmic tail region
of Dscam.
Length = 125
Score = 28.6 bits (63), Expect = 6.4
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 131 HYQSTGSLHSDSPSSSSVSPVPSSPRPLPP 160
HY S GS+ S S+ P SP P PP
Sbjct: 40 HYGSRGSMGRRSIGSARNIPGNGSPEPPPP 69
>gnl|CDD|180423 PRK06143, PRK06143, enoyl-CoA hydratase; Provisional.
Length = 256
Score = 29.6 bits (67), Expect = 6.5
Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 9/60 (15%)
Query: 561 GSLNLGFSPVVLGHTDELWALAAHPSLSQFLTAGFDRIVQLWDSMSHSVVWSKDIAEQAQ 620
GSLN+ +PV+L T L LAA P + R++ L + + + DI E A
Sbjct: 26 GSLNILGTPVILALTQALRWLAADPDV---------RVLVLRGAGEKAFIGGADIKEMAT 76
>gnl|CDD|218912 pfam06148, COG2, COG (conserved oligomeric Golgi) complex
component, COG2. The COG complex comprises eight
proteins COG1-8. The COG complex plays critical roles in
Golgi structure and function. The proposed function of
the complex is to mediate the initial physical contact
between transport vesicles and their membrane targets. A
comparable role in tethering vesicles has been suggested
for at least six additional large multisubunit
complexes, including the exocyst, a complex that
mediates trafficking to the plasma membrane. COG2
structure reveals a six-helix bundle with few conserved
surface features but a general resemblance to recently
determined crystal structures of four different exocyst
subunits. These bundles inCOG2 may act as platforms for
interaction with other trafficing proteins including
SNAREs (soluble N-ethylmaleimide factor attachment
protein receptors) and Rabs.
Length = 133
Score = 28.4 bits (64), Expect = 6.7
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 14/76 (18%)
Query: 10 YAKLLHDDMLETENE-------------SLRERVSDLEKKVLEHGDEIVCLRSTLADVLR 56
Y KLL +++E NE L E++ + +L E+ +RS L L
Sbjct: 38 YLKLLKSELIELINEDYADFVSLSTNLVGLDEKLERIRVPLLALRKEVESVRSELEARLE 97
Query: 57 RL-AQLESSKNIANNS 71
+ A+LE K++
Sbjct: 98 EVEAKLEEKKDLREKK 113
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 29.7 bits (67), Expect = 7.8
Identities = 20/105 (19%), Positives = 30/105 (28%), Gaps = 12/105 (11%)
Query: 16 DDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLESSKNIANNS---- 71
D LE E VS + LE + L +L +S+ I +
Sbjct: 13 ADALEEELLISEGGVSSALEVNLELELIEGEKKLELVVCDTKLGNADSAFEIPSEVGEDL 72
Query: 72 -------FGSPISSSPLLKNGTLKGKSFSSSKETLRYRP-LNDQI 108
G SS L G S+ +R D++
Sbjct: 73 RENAEEALGELSSSDEDDYYRMLLGLGHELSRIKVREEVDKEDKL 117
>gnl|CDD|222217 pfam13552, DUF4127, Protein of unknown function (DUF4127). This
family of uncharacterized bacterial proteins are about
500 amino acids in length.
Length = 495
Score = 29.6 bits (67), Expect = 8.5
Identities = 12/28 (42%), Positives = 15/28 (53%), Gaps = 1/28 (3%)
Query: 222 RPINLYAPSDLAESYDITKVAAPPQNKL 249
RP N P+DLA+ I V PP+ L
Sbjct: 10 RPCNYDFPADLAKIAGIE-VLTPPKELL 36
>gnl|CDD|224614 COG1700, COG1700, Uncharacterized conserved protein [Function
unknown].
Length = 503
Score = 29.1 bits (65), Expect = 9.9
Identities = 13/40 (32%), Positives = 17/40 (42%)
Query: 6 KDSVYAKLLHDDMLETENESLRERVSDLEKKVLEHGDEIV 45
+D Y + ML E E S + KV HGDE+
Sbjct: 13 EDCPYYDNESEIMLIEETELFETEGSLKDVKVEPHGDELF 52
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.133 0.403
Gapped
Lambda K H
0.267 0.0799 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 40,958,247
Number of extensions: 3960391
Number of successful extensions: 4007
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3912
Number of HSP's successfully gapped: 96
Length of query: 811
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 706
Effective length of database: 6,280,432
Effective search space: 4433984992
Effective search space used: 4433984992
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (27.9 bits)