RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2823
(122 letters)
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 38.1 bits (89), Expect = 5e-04
Identities = 24/96 (25%), Positives = 46/96 (47%), Gaps = 3/96 (3%)
Query: 30 KEELERSQATLATL---STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKT 86
K EL + L L EL+ LQE+ + +L AEL+ ++ E +R E
Sbjct: 219 KAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSE 278
Query: 87 MQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
++E++E+ Q E+Y L ++ + + + ++E
Sbjct: 279 LEEEIEELQKELYALANEISRLEQQKQILRERLANL 314
Score = 34.6 bits (80), Expect = 0.007
Identities = 19/111 (17%), Positives = 51/111 (45%), Gaps = 2/111 (1%)
Query: 11 TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
L L ++ +E++E + +L+ AE++ L+E ++ ++L E
Sbjct: 825 RLESLERRIAATERRLEDLEEQIEELSEDIESLA--AEIEELEELIEELESELEALLNER 882
Query: 71 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
++ +RSE + + E++ + + + L+ +LE + ++ ++ E
Sbjct: 883 ASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933
Score = 34.3 bits (79), Expect = 0.009
Identities = 25/122 (20%), Positives = 54/122 (44%), Gaps = 12/122 (9%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATL-----STSAELDRLQEKYDKTCA----- 61
L L E +A E+++ EL+ + L L E++ LQ++
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
Query: 62 --DLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEY 119
+ + L ++ E + ++ + ++ K+++ E+ L+ KLE + E+ES++ E
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAEL 360
Query: 120 ER 121
E
Sbjct: 361 EE 362
Score = 31.2 bits (71), Expect = 0.13
Identities = 26/110 (23%), Positives = 57/110 (51%), Gaps = 2/110 (1%)
Query: 11 TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
L L ++ ++ ++D+ EEL + L L EL+ L+ + ++ A+L ++ L
Sbjct: 317 QLEELEAQLEELESKLDELAEELAELEEKLEELK--EELESLEAELEELEAELEELESRL 374
Query: 71 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
++ E +RS+ ++ ++ E+ RL+A+LE + E +++E E
Sbjct: 375 EELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE 424
Score = 28.9 bits (65), Expect = 0.75
Identities = 14/87 (16%), Positives = 43/87 (49%)
Query: 35 RSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 94
++ + L E++ L+EK ++ + + L ++ + E + E + +++++E+
Sbjct: 665 SAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEEL 724
Query: 95 QGEVYRLKAKLENTQGEMESMKEEYER 121
++ L+ L + E+E ++E +
Sbjct: 725 SRQISALRKDLARLEAEVEQLEERIAQ 751
Score = 27.7 bits (62), Expect = 1.5
Identities = 21/117 (17%), Positives = 50/117 (42%), Gaps = 12/117 (10%)
Query: 11 TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
L L + ++ +++ E+E + + L +EL+ L + L ++EL
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEELEELIEELE--SELEALLNERASLEEALALLRSEL 896
Query: 71 RVVQADNERVRSEEKTM-------QEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
+ + + S+ + +EK+ + + + L+ +++N Q E + EEY
Sbjct: 897 EELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQ---ERLSEEYS 950
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 36.9 bits (85), Expect = 0.001
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 2/105 (1%)
Query: 10 GTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAE 69
L E + E+++ + +L S ++ T S EL R + A + + +
Sbjct: 3 AQLENAQRENELLKKELERKQSKLGSSMNSIKTF-WSPELKRERALRKDEAARITVWKEQ 61
Query: 70 LRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMES 114
LRV Q +N+ + + +QE++ KSQ E+ RL+ KLE + E
Sbjct: 62 LRVTQEENQHAQLTVQALQEEL-KSQRELNRLQQKLEKSGNETGE 105
Score = 27.3 bits (60), Expect = 2.3
Identities = 27/103 (26%), Positives = 50/103 (48%), Gaps = 2/103 (1%)
Query: 20 DKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNER 79
D+A E+ + K EL Q L TL+ +++ DK DL RA+ E ++Q + +
Sbjct: 290 DRAAQELSRKKTELLGLQTELETLANQD--SDMRQHLDKLKEDLTRAEQEKAILQTEVDA 347
Query: 80 VRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
+R E + + K + + + GE+E M++ YE++
Sbjct: 348 LRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKT 390
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 35.0 bits (81), Expect = 0.006
Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 9/104 (8%)
Query: 18 ERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADN 77
E +++K K E+ + ELDRLQE+ + +L A + ++A
Sbjct: 386 ELKDYREKLEKLKREINELKR---------ELDRLQEELQRLSEELADLNAAIAGIEAKI 436
Query: 78 ERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
+ E++ +++K + ++ +L A L + E+ +KEEY+R
Sbjct: 437 NELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDR 480
Score = 32.3 bits (74), Expect = 0.040
Identities = 24/114 (21%), Positives = 62/114 (54%), Gaps = 6/114 (5%)
Query: 8 PPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQ 67
P G + SE + + + + E L+R ++L +EL R++ + D+ +L A
Sbjct: 662 PRGGILFSRSEPAE-LQRLRERLEGLKRELSSLQ-----SELRRIENRLDELSQELSDAS 715
Query: 68 AELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
++ ++ + E++ EE+ ++E++E+ + ++ L+ ++EN + E++ ++ E
Sbjct: 716 RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEE 769
Score = 31.2 bits (71), Expect = 0.11
Identities = 32/112 (28%), Positives = 57/112 (50%), Gaps = 9/112 (8%)
Query: 10 GTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAE 69
L++L +E DK + E+++ + E+E + D+L E+Y + +L +AE
Sbjct: 322 ERLAKLEAEIDKLLAEIEELEREIEEERK---------RRDKLTEEYAELKEELEDLRAE 372
Query: 70 LRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
L V + R E K +EK+EK + E+ LK +L+ Q E++ + EE
Sbjct: 373 LEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424
Score = 30.8 bits (70), Expect = 0.15
Identities = 23/110 (20%), Positives = 46/110 (41%), Gaps = 2/110 (1%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
L + +K E+++ K EL+R Q L LS EL L A + + E
Sbjct: 387 LKDYREKLEKLKREINELKRELDRLQEELQRLS--EELADLNAAIAGIEAKINELEEEKE 444
Query: 72 VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
+ ++ + + + + K + E+Y LK + + + E+ ++ E
Sbjct: 445 DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAE 494
Score = 27.7 bits (62), Expect = 1.6
Identities = 21/96 (21%), Positives = 53/96 (55%), Gaps = 2/96 (2%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
LS L SE + +D+ +EL + + + E+++L+++ +K L + +L
Sbjct: 690 LSSLQSELRRIENRLDELSQELSDASRKIGEIE--KEIEQLEQEEEKLKERLEELEEDLS 747
Query: 72 VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
++ + E V+SE K ++ ++E+ + ++++L+ L +
Sbjct: 748 SLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783
Score = 27.3 bits (61), Expect = 2.4
Identities = 12/98 (12%), Positives = 42/98 (42%)
Query: 24 LEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSE 83
+E ++ LE+ + L+E+ ++ + ++ + E + +
Sbjct: 817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAA 876
Query: 84 EKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
+ ++ ++ + E L+A+L + ++E ++ + E+
Sbjct: 877 LRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914
Score = 25.4 bits (56), Expect = 9.4
Identities = 22/118 (18%), Positives = 55/118 (46%), Gaps = 9/118 (7%)
Query: 11 TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
+ L +E + + + ELE ++ LA L AE+D+L + ++ ++ +
Sbjct: 295 KIGELEAEIASLERSIAEKERELEDAEERLAKLE--AEIDKLLAEIEELEREIEEERKRR 352
Query: 71 RVVQADNERVRSEEKTMQEKVEK-------SQGEVYRLKAKLENTQGEMESMKEEYER 121
+ + ++ E + ++ ++E+ ++ E+ + KLE + E+ +K E +R
Sbjct: 353 DKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDR 410
>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Protein
kinase C and Casein kinase Substrate in Neurons (PACSIN)
proteins, also called Synaptic dynamin-associated
proteins (Syndapins), act as regulators of cytoskeletal
and membrane dynamics. They bind both dynamin and
Wiskott-Aldrich syndrome protein (WASP), and may provide
direct links between the actin cytoskeletal machinery
through WASP and dynamin-dependent endocytosis.
Vetebrates harbor three isoforms with distinct
expression patterns and specific functions. PACSINs
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 33.1 bits (76), Expect = 0.018
Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 8/84 (9%)
Query: 47 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 106
++++ ++ Y C + AQ + ++D + K +Q+KVEK + EV + K K E
Sbjct: 133 KKVEKAKKAYHAACKAEKSAQKQENNAKSDTSLSPDQVKKLQDKVEKCKQEVSKTKDKYE 192
Query: 107 NTQGE--------MESMKEEYERS 122
+ ME M++ +++
Sbjct: 193 KALEDLNKYNPRYMEDMEQVFDKC 216
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 33.0 bits (75), Expect = 0.027
Identities = 21/71 (29%), Positives = 40/71 (56%)
Query: 48 ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
+L+ LQ++ ++ +L +AE VQ +R+ E ++E ++K GEVY LK + +
Sbjct: 143 KLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDE 202
Query: 108 TQGEMESMKEE 118
+ +E +EE
Sbjct: 203 LEPGVELPEEE 213
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
This model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 32.7 bits (75), Expect = 0.027
Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 1/79 (1%)
Query: 18 ERDKAVLEMDKSKEELERSQATLATLSTSAE-LDRLQEKYDKTCADLRRAQAELRVVQAD 76
+ D L + + +L ++A L S E +RL ++ + ADL A+A + QAD
Sbjct: 58 DDDDYQLALQAALAQLAAAEAQLELAQRSFERAERLVKRNAVSQADLDDAKAAVEAAQAD 117
Query: 77 NERVRSEEKTMQEKVEKSQ 95
E ++ + Q + ++
Sbjct: 118 LEAAKASLASAQLNLRYTE 136
Score = 29.6 bits (67), Expect = 0.31
Identities = 18/67 (26%), Positives = 30/67 (44%), Gaps = 4/67 (5%)
Query: 50 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 109
D Q A L A+A+L + Q ER + ++ SQ ++ KA +E Q
Sbjct: 60 DDYQLALQAALAQLAAAEAQLELAQRSFERAE----RLVKRNAVSQADLDDAKAAVEAAQ 115
Query: 110 GEMESMK 116
++E+ K
Sbjct: 116 ADLEAAK 122
>gnl|CDD|212132 cd11667, GH38N_Man2A2, N-terminal catalytic domain of Golgi
alpha-mannosidase IIx, and similar proteins; glycoside
hydrolase family 38 (GH38). This subfamily is
represented by human alpha-mannosidase 2x (MX, also
known as mannosyl-oligosaccharide 1,3- 1,6-alpha
mannosidase, EC 3.2.1.114, Man2A2). MX is enzymatically
and functionally very similar to GMII (found in another
subfamily), and as an isoenzyme of GMII. It is thought
to also function in the N-glycosylation pathway. MX
specifically hydrolyzes the same oligosaccharide
substrate as does MII. It specifically removes two
mannosyl residues from GlcNAc(Man)5(GlcNAc)2 to yield
GlcNAc(Man)3(GlcNAc)2(GlcNAc, N-acetylglucosmine).
Length = 344
Score = 31.1 bits (70), Expect = 0.095
Identities = 15/49 (30%), Positives = 24/49 (48%)
Query: 6 VPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQE 54
VPP ++ER + +L+ + K +L RS+ L L D+ QE
Sbjct: 222 VPPRAITEANVAERAQLLLDQYRKKSKLYRSKVLLVPLGDDFRYDKPQE 270
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 30.6 bits (69), Expect = 0.17
Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 5/51 (9%)
Query: 7 PPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYD 57
PP G AV E+ ++ +L +Q + + A LDRL++ D
Sbjct: 939 PPRGP-----DVPPAAVAELRETLADLRSAQRSGDFTAYGAALDRLEKAID 984
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 30.4 bits (69), Expect = 0.21
Identities = 13/46 (28%), Positives = 26/46 (56%)
Query: 77 NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
E++R +EK ++E + E L +KLE Q ++E +++ E+
Sbjct: 323 TEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKL 368
>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
Provisional.
Length = 589
Score = 30.5 bits (69), Expect = 0.21
Identities = 21/60 (35%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 5 LVPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLR 64
L P L RL RD + L + + +E+L R + L ELD LQ++ C DLR
Sbjct: 24 LRCPEEDLRRL---RDDSALRLRRYREDLLRDRLLRRRLGE--ELDDLQKRLQTECEDLR 78
>gnl|CDD|222629 pfam14257, DUF4349, Domain of unknown function (DUF4349). This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 282 and
353 amino acids in length. There is a single completely
conserved residue D that may be functionally important.
Length = 262
Score = 29.9 bits (68), Expect = 0.23
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 4/60 (6%)
Query: 37 QATLATLSTSAELDRLQEKYDK--TCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 94
+A L L A +RL ++ T DL + EL VQA+ E + ++ + + ++V S
Sbjct: 138 EARLKALR--ASEERLLALLERAGTVEDLLEVERELSRVQAEIESLEAQLRYLDDRVAYS 195
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 30.0 bits (68), Expect = 0.28
Identities = 8/44 (18%), Positives = 20/44 (45%)
Query: 61 ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 104
A L +A+A L +A +R++ + Q + +++ +
Sbjct: 91 AALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQN 134
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 29.5 bits (67), Expect = 0.36
Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 13/86 (15%)
Query: 34 ERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEK 93
++++ LA L+ LD LQEK +K DL + + +L EK K +
Sbjct: 229 DKAEFMLAYLNFKEYLDELQEKLEKLEKDLAKLEEKL-------------EKNPNSKKKN 275
Query: 94 SQGEVYRLKAKLENTQGEMESMKEEY 119
E+ + A LE E + + +Y
Sbjct: 276 KLAELEQQLASLEKRIDEAKELIAKY 301
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 29.2 bits (66), Expect = 0.37
Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLST-----SAELDRLQEKYDKTCADLRRA 66
LSR LS+ A + + ++EL Q LA L E L+++ A+L R
Sbjct: 54 LSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERL 113
Query: 67 QAELRVVQ----------ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
Q EL ++ +N +R E ++++ E + E RL+ +
Sbjct: 114 QKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
Score = 28.8 bits (65), Expect = 0.49
Identities = 14/59 (23%), Positives = 31/59 (52%)
Query: 63 LRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
L Q EL +Q + ++ + +Q++ ++ + E+ L+A+LE Q E+ +K+
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSAN 126
Score = 28.8 bits (65), Expect = 0.58
Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 1/77 (1%)
Query: 10 GTLSRLLSERDKAVLEMDKSKEELERSQATLATL-STSAELDRLQEKYDKTCADLRRAQA 68
L+ L E + E+ + ELER Q LA + SA L E+ + +L +
Sbjct: 87 EQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQ 146
Query: 69 ELRVVQADNERVRSEEK 85
E ++A+NER++ E+
Sbjct: 147 ENEALEAENERLQENEQ 163
Score = 25.7 bits (57), Expect = 6.4
Identities = 18/86 (20%), Positives = 39/86 (45%), Gaps = 10/86 (11%)
Query: 46 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG--------- 96
EL LQE+ + L Q E + ++ + + +E + +Q+++ + +
Sbjct: 72 QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELD 131
Query: 97 -EVYRLKAKLENTQGEMESMKEEYER 121
E L+ +L + E E+++ E ER
Sbjct: 132 EENRELREELAELKQENEALEAENER 157
>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
outer membrane / Intracellular trafficking and
secretion].
Length = 457
Score = 29.4 bits (66), Expect = 0.49
Identities = 16/113 (14%), Positives = 49/113 (43%), Gaps = 15/113 (13%)
Query: 13 SRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRV 72
R+ + A ++ +LE ++ +LA +A Y DL AQ +L +
Sbjct: 125 GRVRANVRAAEAAAKAARAQLEAARLSLAAEVATA--------Y----FDLLAAQEQLAL 172
Query: 73 VQADNERVRSEEKTMQEKVEK---SQGEVYRLKAKLENTQGEMESMKEEYERS 122
+ + + +++ + ++ +V + +A+L + + ++ + + + ++
Sbjct: 173 AEETLAAAEEQLELAEKRYDAGLATRLDVLQAEAQLASARAQLAAAQAQLAQA 225
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.0 bits (66), Expect = 0.52
Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 9/67 (13%)
Query: 64 RRAQAELRVVQADNERVRSEEKTMQ---------EKVEKSQGEVYRLKAKLENTQGEMES 114
R + E + + NE + E++ +Q E +EK + E+ + + +LE Q E+E
Sbjct: 69 LRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEK 128
Query: 115 MKEEYER 121
+EE E
Sbjct: 129 KEEELEE 135
>gnl|CDD|221415 pfam12083, DUF3560, Domain of unknown function (DUF3560). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria. This domain is about 120
amino acids in length. This domain has a conserved GHHSE
sequence motif.
Length = 127
Score = 28.4 bits (64), Expect = 0.53
Identities = 14/52 (26%), Positives = 24/52 (46%)
Query: 20 DKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
KAV DK++ +R+ A + D + + K ADLR +Q ++
Sbjct: 67 GKAVEAQDKAEYWEQRAAAAARGAIARDDPDAVLRRLKKLLADLRASQRRMK 118
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 29.0 bits (65), Expect = 0.57
Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 3/85 (3%)
Query: 37 QATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG 96
A + + + + +D R A + +A+ E +T++E V
Sbjct: 865 VAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESV---GA 921
Query: 97 EVYRLKAKLENTQGEMESMKEEYER 121
V ++A+L T+ + S E R
Sbjct: 922 MVDEIRARLAETRAALASGGRELPR 946
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 29.1 bits (65), Expect = 0.59
Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 10/104 (9%)
Query: 11 TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
TL+R + +A E + R A L A L +L+ + A+L+RA L
Sbjct: 194 TLAREAEDAARAADEAKTAAAAAAREAAPLK-----ASLRKLERAKARADAELKRADKAL 248
Query: 71 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMES 114
+ D + R+EE+ Q+K + E +L+ + + E+
Sbjct: 249 AAAKTDEAKARAEER--QQKAAQQAAEA---ATQLDTAKADAEA 287
>gnl|CDD|153363 cd07679, F-BAR_PACSIN2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 2 (PACSIN2). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 2 or Syndapin II is expressed ubiquitously and is
involved in the regulation of tubulin polymerization. It
associates with Golgi membranes and forms a complex with
dynamin II which is crucial in promoting vesicle
formation from the trans-Golgi network. PACSIN 2
contains an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 28.9 bits (64), Expect = 0.64
Identities = 18/72 (25%), Positives = 36/72 (50%)
Query: 48 ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
E++ ++ Y C + + A + +AD + K +Q+KVEK + +V + K K E
Sbjct: 134 EVEAAKKAYHTACKEEKLATSREANSKADPALNPEQLKKLQDKVEKCKQDVLKTKEKYEK 193
Query: 108 TQGEMESMKEEY 119
+ E++ +Y
Sbjct: 194 SLKELDQTTPQY 205
>gnl|CDD|216941 pfam02233, PNTB, NAD(P) transhydrogenase beta subunit. This family
corresponds to the beta subunit of NADP transhydrogenase
in prokaryotes, and either the protein N- or C terminal
in eukaryotes. The domain is often found in conjunction
with pfam01262. Pyridine nucleotide transhydrogenase
catalyzes the reduction of NAD+ to NADPH. A complete
loss of activity occurs upon mutation of Gly314 in E.
coli.
Length = 464
Score = 28.9 bits (65), Expect = 0.69
Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 8/50 (16%)
Query: 9 PGTLSRLLSERD---KAVLEMDKSKEELERSQATLA-----TLSTSAELD 50
PG ++ LL+E + VLEMD+ ++ + L T++ +AE D
Sbjct: 354 PGHMNVLLAEANVPYDIVLEMDEINDDFADTDVVLVIGANDTVNPAAEED 403
>gnl|CDD|240301 PTZ00166, PTZ00166, DNA polymerase delta catalytic subunit;
Provisional.
Length = 1054
Score = 28.8 bits (65), Expect = 0.72
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 9 PGTLSRLLSERDKAVLEMDKSKEELERS 36
P + L++ R KA EM K+ L +
Sbjct: 614 PLIVEELIAARKKAKKEMKDEKDPLLKK 641
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.7 bits (65), Expect = 0.78
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 12/54 (22%)
Query: 25 EMDKSKEELER--------SQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
++ +++ ELE ++ TL+TLS L +L+ + +T L+ AQ +L
Sbjct: 95 KLRQAQAELEALKDDNDEETRETLSTLS----LRQLESRLAQTLDQLQNAQNDL 144
Score = 26.8 bits (60), Expect = 3.1
Identities = 23/104 (22%), Positives = 52/104 (50%), Gaps = 13/104 (12%)
Query: 15 LLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQ 74
LL DK V +++LE++ A L + DR +E+ ++ L +A A+LR Q
Sbjct: 53 LLEAEDKLV------QQDLEQTLALLDKI------DRQKEETEQLKQQLAQAPAKLRQAQ 100
Query: 75 ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
A+ E ++ + + + + +L+++L T ++++ + +
Sbjct: 101 AELEALKDDNDEETRETLSTL-SLRQLESRLAQTLDQLQNAQND 143
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.5 bits (64), Expect = 0.80
Identities = 26/105 (24%), Positives = 54/105 (51%)
Query: 18 ERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADN 77
E + LE +K + E + + EL+ L+E+ ++ L + EL ++
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823
Query: 78 ERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
ER+ E + ++E++E+ + ++ L+ +LE + E+E +KEE E
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEEL 868
Score = 28.5 bits (64), Expect = 0.99
Identities = 20/76 (26%), Positives = 43/76 (56%)
Query: 46 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 105
EL L+ + +K +L+ + ELR ++ E +R + + ++ ++E+ + E+ L+ +L
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL 732
Query: 106 ENTQGEMESMKEEYER 121
E Q +E ++EE E
Sbjct: 733 EQLQSRLEELEEELEE 748
Score = 27.0 bits (60), Expect = 3.0
Identities = 26/113 (23%), Positives = 59/113 (52%), Gaps = 5/113 (4%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
L L E + E+++ +EELE ++ + L +EL+ L+E+ ++ +L + E+
Sbjct: 241 LEELEEELSRLEEELEELQEELEEAEKEIEELK--SELEELREELEELQEELLELKEEIE 298
Query: 72 VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESM---KEEYER 121
++ + +R + ++ ++E+ + + LK K+E + E+E EE E+
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ 351
Score = 26.6 bits (59), Expect = 3.9
Identities = 20/97 (20%), Positives = 47/97 (48%), Gaps = 2/97 (2%)
Query: 25 EMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE 84
+ +EEL +A LA + EL+ L+ + + L R L ++ + + + +E
Sbjct: 377 LFEALREELAELEAELAEIR--NELEELKREIESLEERLERLSERLEDLKEELKELEAEL 434
Query: 85 KTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
+ +Q ++E+ E+ L+ +LE + ++ ++ E
Sbjct: 435 EELQTELEELNEELEELEEQLEELRDRLKELERELAE 471
Score = 26.6 bits (59), Expect = 4.4
Identities = 18/97 (18%), Positives = 48/97 (49%)
Query: 24 LEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSE 83
LE + + + EL L+E+ ++ + L + EL ++ + E ++
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER 759
Query: 84 EKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
+ ++E++E + + +LK ++E + + ++++EE E
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 28.4 bits (64), Expect = 0.81
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 47 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 106
EL L+E+ + +L + + L+ + EK +E++EK +L +LE
Sbjct: 334 EELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEK--REQLEKLLETKEKLSEELE 391
Query: 107 NTQGEMESMKEEYER 121
+ E++ +KEE E
Sbjct: 392 ELEEELKELKEELES 406
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.9 bits (63), Expect = 0.84
Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 2/56 (3%)
Query: 32 ELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL--RVVQADNERVRSEEK 85
EL L + AEL +Y K A R EL + +A+ E + +
Sbjct: 55 ELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 28.5 bits (63), Expect = 0.85
Identities = 18/99 (18%), Positives = 41/99 (41%)
Query: 23 VLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRS 82
+ +K E ER A E + ++++ +L +AQ EL + + +++
Sbjct: 92 TAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQT 151
Query: 83 EEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
KT+ E+ + + + L+A + Q +K +
Sbjct: 152 RLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLD 190
Score = 27.7 bits (61), Expect = 1.8
Identities = 17/78 (21%), Positives = 36/78 (46%)
Query: 44 STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKA 103
S +LD ++ + +L AQ E R + + E RSE + +++ E + E+ +
Sbjct: 71 SGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQ 130
Query: 104 KLENTQGEMESMKEEYER 121
L Q E+ + ++ +
Sbjct: 131 NLAKAQQELARLTKQAQD 148
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 28.4 bits (64), Expect = 0.91
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 14/76 (18%)
Query: 46 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 105
+AEL RL+++ +K ++ R + +L NE ++ E VEK K KL
Sbjct: 813 AAELARLEKELEKLEKEIDRIEKKL-----SNEGFVAKAPE--EVVEKE-------KEKL 858
Query: 106 ENTQGEMESMKEEYER 121
Q ++ ++E
Sbjct: 859 AEYQVKLAKLEERLAV 874
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 28.1 bits (63), Expect = 1.2
Identities = 17/113 (15%), Positives = 48/113 (42%), Gaps = 2/113 (1%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATL-STSAELDRLQ-EKYDKTCADLRRAQAE 69
L L + + E++ + + L +L +L+ E D +L RA+ +
Sbjct: 153 LEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEK 212
Query: 70 LRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
L+ + + + + ++E++++ + ++ L K E+ +++ E+
Sbjct: 213 LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQC 265
>gnl|CDD|222630 pfam14258, DUF4350, Domain of unknown function (DUF4350). This
domain family is found in bacteria, archaea and
eukaryotes, and is approximately 70 amino acids in
length.
Length = 69
Score = 26.8 bits (60), Expect = 1.2
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 5/48 (10%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATL-----ATLSTSAELDRLQE 54
L+ LL ER V + + L+ S TL + + E++ L E
Sbjct: 10 LAELLEERGVEVERVTSPYDALDGSDGTLLVIGPSLNLSPEEIEALLE 57
>gnl|CDD|233594 TIGR01844, type_I_sec_TolC, type I secretion outer membrane
protein, TolC family. Members of This model are outer
membrane proteins from the TolC subfamily within the RND
(Resistance-Nodulation-cell Division) efflux systems.
These proteins, unlike the NodT subfamily, appear not to
be lipoproteins. All are believed to participate in type
I protein secretion, an ABC transporter system for
protein secretion without cleavage of a signal sequence,
although they may, like TolC, participate also in the
efflux of smaller molecules as well. This family
includes the well-documented examples TolC (E. coli),
PrtF (Erwinia), and AprF (Pseudomonas aeruginosa)
[Protein fate, Protein and peptide secretion and
trafficking, Transport and binding proteins, Porins].
Length = 415
Score = 28.1 bits (63), Expect = 1.3
Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 5/79 (6%)
Query: 44 STSAELDRLQEKYD--KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRL 101
S S+ L Q +D T +R A+A A E +R+ + + + ++ EV R
Sbjct: 69 SGSSTLTLSQPLFDGGSTWNAVRAAEAA---ALAARETLRATAQDLILRTAEAYMEVLRA 125
Query: 102 KAKLENTQGEMESMKEEYE 120
+ L + + ++KE+ +
Sbjct: 126 QEILALAEANLAALKEQLD 144
>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
Length = 718
Score = 27.9 bits (62), Expect = 1.4
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 15 LLSERDKAVLEMDKSKEELERSQ 37
LLSERD A +E K KEE +R
Sbjct: 121 LLSERDLAKIERRKRKEERQREV 143
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 27.7 bits (62), Expect = 1.6
Identities = 10/58 (17%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 61 ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
A+++ + L + ++NE +++E + +Q++ E+S +++ +++ E+ E+
Sbjct: 66 AEVKELRKRLAKLISENEALKAENERLQKR-EQSID--QQIQQAVQSETQELTKEIEQ 120
>gnl|CDD|215529 PLN02978, PLN02978, pyridoxal kinase.
Length = 308
Score = 27.4 bits (61), Expect = 1.7
Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 26 MDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNE 78
K + L+++ A LA S A L R Y + AD + + ELR+VQ+ ++
Sbjct: 243 SHKYPDNLDKA-AELAVSSLQAVLRRTLADYKRAGADPKSSSLELRLVQSQDD 294
>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
Length = 256
Score = 27.4 bits (61), Expect = 1.7
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 7/40 (17%)
Query: 84 EKTMQEKVEKSQGEVYRLKAKLENT-------QGEMESMK 116
++ MQEK+ + E+ +++AKL+ T + + E M+
Sbjct: 86 QRGMQEKMAVMKPELDKIQAKLKVTKDLEKQKEYQKEMME 125
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 27.7 bits (62), Expect = 1.7
Identities = 14/96 (14%), Positives = 40/96 (41%), Gaps = 3/96 (3%)
Query: 25 EMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE 84
E+++ +E +E + LA + + + + L + Q E++ ++ + S
Sbjct: 86 EIEQKRERIEELKRALAQRRSDLSS-ASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLL 144
Query: 85 KTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
+ + + +++ L+ + +G +S E Y
Sbjct: 145 AEKRSFLCRELAKLFPLRRVIRGRKG--DSSSEPYT 178
>gnl|CDD|216534 pfam01499, Herpes_UL25, Herpesvirus UL25 family. The herpesvirus
UL25 gene product is a virion component involved in
virus penetration and capsid assembly. The product of
the UL25 gene is required for packaging but not
cleavage of replicated viral DNA. This family includes
a number of herpesvirus proteins: EHV-1 36, EBV BVRF1,
HCMV UL77, ILTV ORF2, and VZV gene 34.
Length = 538
Score = 27.7 bits (62), Expect = 1.8
Identities = 12/78 (15%), Positives = 23/78 (29%), Gaps = 10/78 (12%)
Query: 5 LVPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLR 64
L +L R L + L +++ AELD L++ + A++
Sbjct: 19 LRCDEPSL-FETRPRFALRLRTRAEQLRLAKAKKRAL----KAELDNLEQLHQTRSAEID 73
Query: 65 RA-----QAELRVVQADN 77
V +
Sbjct: 74 ADLRPIENQLEEVADPLD 91
>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
outer membrane].
Length = 372
Score = 27.4 bits (60), Expect = 2.1
Identities = 17/69 (24%), Positives = 28/69 (40%), Gaps = 4/69 (5%)
Query: 11 TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
L+R + A L + + +L +A ++ AELD Q A + AQA L
Sbjct: 113 QLARAQALLAPAELGDLQREAKLAAEKAAVSQ----AELDAAQALLRAAEALVEAAQAAL 168
Query: 71 RVVQADNER 79
+ + E
Sbjct: 169 ASAKLNLEY 177
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 27.4 bits (62), Expect = 2.2
Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
Query: 27 DKSKEELERSQATLATLS--TSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE 84
+ + LE ++A + L+ S E+ E+ + A AEL + V +E
Sbjct: 754 AEDRARLEANEAYIKRLARLESLEILADDEEAPEGAASAVVGGAELFLPLEGLIDVEAEL 813
Query: 85 KTMQEKVEKSQGEVYRLKAKLEN 107
+++++ K + E+ R++ KL N
Sbjct: 814 ARLEKELAKLEKEIERVEKKLSN 836
>gnl|CDD|220640 pfam10234, Cluap1, Clusterin-associated protein-1. This protein is
conserved from worms to humans. The protein of 413 amino
acids contains a central coiled-coil domain, possibly
the region that binds to clusterin. Cluap1 expression is
highest in the nucleus and gradually increases during
late S to G2/M phases of the cell cycle and returns to
the basal level in the G0/G1 phases. In addition, it is
upregulated in colon cancer tissues compared to
corresponding non-cancerous mucosa. It thus plays a
crucial role in the life of the cell.
Length = 269
Score = 26.9 bits (60), Expect = 2.4
Identities = 16/56 (28%), Positives = 34/56 (60%)
Query: 66 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
A L+ +QA+ + ++S+E ++ K+E+ + E+ R + +L+ Q + EEYE+
Sbjct: 176 LAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFMEEYEK 231
>gnl|CDD|223841 COG0770, MurF, UDP-N-acetylmuramyl pentapeptide synthase [Cell
envelope biogenesis, outer membrane].
Length = 451
Score = 26.9 bits (60), Expect = 2.7
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 9 PGTLSRLLSERDKAVLEMDKSK 30
P TL RL ++ + AVLEM +
Sbjct: 145 PLTLLRLPADTEYAVLEMGMNH 166
>gnl|CDD|216101 pfam00755, Carn_acyltransf, Choline/Carnitine o-acyltransferase.
Length = 590
Score = 26.9 bits (60), Expect = 3.1
Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 6/51 (11%)
Query: 6 VPP-PGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEK 55
VP TL R L + K +L S+EE ER++A + S E ++LQE
Sbjct: 7 VPALEDTLDRYL-DSVKPLL----SEEEFERTEALVKDFSAGGEGEKLQEY 52
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 26.6 bits (59), Expect = 3.3
Identities = 15/85 (17%), Positives = 30/85 (35%), Gaps = 11/85 (12%)
Query: 27 DKSKEELERSQATLATLSTS-----AELDRLQEKYDKTCADLRRAQAELRVVQADNERVR 81
+ ++ L +Q A + EL Q + +T AQ L QA
Sbjct: 278 NTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQT------AQNNLATAQAALANAE 331
Query: 82 SEEKTMQEKVEKSQGEVYRLKAKLE 106
+ +E + ++ + +A L+
Sbjct: 332 ARLAKAKEALANLNADLAKKQAALD 356
>gnl|CDD|233229 TIGR00999, 8a0102, Membrane Fusion Protein cluster 2 (function with
RND porters). [Transport and binding proteins, Other].
Length = 265
Score = 26.6 bits (59), Expect = 3.5
Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 6/66 (9%)
Query: 54 EKYDKTCADLRRAQAELRVVQADNERVRSE-EKTMQEKVEKSQGEVYRLKAKLENTQGEM 112
+ K A+L+ AQ + + R E EK + E+ + E + LE Q E+
Sbjct: 16 PELAKMAAELKVAQK-----RVELARKTYEREKKLFEQGVIPRQEFESAEYALEEAQAEV 70
Query: 113 ESMKEE 118
++ K E
Sbjct: 71 QAAKSE 76
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 26.5 bits (59), Expect = 3.5
Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 2/82 (2%)
Query: 27 DKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKT 86
D+ E E + TL L AE++RLQ ++ L + EL ++QA ++ + KT
Sbjct: 46 DRDLEFRESLEETLRKLE--AEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKT 103
Query: 87 MQEKVEKSQGEVYRLKAKLENT 108
+++K++ + EV RLK ++
Sbjct: 104 LEQKLKNEKEEVQRLKNIIQQR 125
>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
resistance protein A). [Transport and binding proteins,
Other].
Length = 334
Score = 26.7 bits (59), Expect = 3.6
Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 2/63 (3%)
Query: 61 ADLRRAQAELRVVQADNERVRSEEKTMQ--EKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
L AEL + +A+ +T Q V++ Q +V LK KLE + ++ + +
Sbjct: 71 VRLDPTNAELALAKAEANLAALVRQTKQLEITVQQLQAKVESLKIKLEQAREKLLQAELD 130
Query: 119 YER 121
R
Sbjct: 131 LRR 133
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 27.0 bits (60), Expect = 3.6
Identities = 14/84 (16%), Positives = 35/84 (41%), Gaps = 8/84 (9%)
Query: 46 SAELDRLQEKYDKTCADLRRAQAELRVVQAD--------NERVRSEEKTMQEKVEKSQGE 97
+L L+ + +L + L Q + +++R+ E ++E ++ E
Sbjct: 255 FEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQE 314
Query: 98 VYRLKAKLENTQGEMESMKEEYER 121
+ AKL + E+E ++++
Sbjct: 315 LSAANAKLAADRSELELLEDQKGA 338
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 26.6 bits (59), Expect = 3.7
Identities = 20/105 (19%), Positives = 55/105 (52%), Gaps = 5/105 (4%)
Query: 18 ERDKAVLEM--DKSKEELERSQATLATLSTSAE--LDRLQEKYDKTCADLRRAQAELRVV 73
KA L+ +S+E+L+ +LA LS+ + + + E+ ++ A+L + + +++
Sbjct: 76 NELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNLK-QLL 134
Query: 74 QADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
+ E + + +++++ +S E L +++ GE++ + +E
Sbjct: 135 KPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLLGEIQQLAQE 179
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 26.5 bits (59), Expect = 3.8
Identities = 12/70 (17%), Positives = 30/70 (42%)
Query: 44 STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKA 103
+ R Q KYD+ + + +AE+ + + + + + + K ++K+
Sbjct: 210 KNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKS 269
Query: 104 KLENTQGEME 113
K+E Q ++
Sbjct: 270 KIEQFQKVIK 279
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 26.6 bits (59), Expect = 4.1
Identities = 30/116 (25%), Positives = 53/116 (45%), Gaps = 8/116 (6%)
Query: 5 LVPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLR 64
L L R E K E+DK+ EEL ++ L L EL+ L++KY + +
Sbjct: 607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEEL--RKELEELEKKYSEE--EYE 662
Query: 65 RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
+ E + + +R+E + ++++ E E+ + KL+ E E K+E E
Sbjct: 663 ELREEYLELSRELAGLRAELEELEKRRE----EIKKTLEKLKEELEEREKAKKELE 714
>gnl|CDD|131452 TIGR02399, salt_tol_Pase, glucosylglycerol 3-phosphatase. Proteins
in this family are glucosylglycerol-phosphate
phosphatase, with the gene symbol stpA (Salt Tolerance
Protein A). A motif characteristic of acid phosphatases
is found, but otherwise this family shows little
sequence similarity to other phosphatases. This enzyme
acts on the glucosylglycerol phosphate, product of
glucosylglycerol phosphate synthase and immediate
precursor of the osmoprotectant glucosylglycerol.
Length = 389
Score = 26.3 bits (58), Expect = 4.1
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 91 VEKSQGEVYRLKAKLENTQG 110
V+ S GEVYR K+E +G
Sbjct: 337 VDSSSGEVYRPSLKIEGLKG 356
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 26.2 bits (58), Expect = 4.1
Identities = 9/53 (16%), Positives = 29/53 (54%)
Query: 69 ELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
++ + + +R+ K +++ ++K++ E+ L LE + E+E ++ + +
Sbjct: 11 AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQ 63
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 26.4 bits (58), Expect = 4.5
Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 2/74 (2%)
Query: 20 DKAVLEMDKSKEELERSQATLAT--LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADN 77
A ++D +++L +QA L + T ELD + + K A L A A Q +
Sbjct: 215 KTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAALEAANAADEDPQDRD 274
Query: 78 ERVRSEEKTMQEKV 91
V + + M
Sbjct: 275 AAVEAAARLMGAAR 288
>gnl|CDD|235595 PRK05762, PRK05762, DNA polymerase II; Reviewed.
Length = 786
Score = 26.4 bits (59), Expect = 4.5
Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 5/30 (16%)
Query: 9 PGTLSRLLSERDKAVLEMDKSKEELERSQA 38
P + RL RD+A EM+K SQA
Sbjct: 468 PEIVERLWEGRDEAKREMNK-----PLSQA 492
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 26.2 bits (58), Expect = 4.6
Identities = 18/73 (24%), Positives = 39/73 (53%)
Query: 45 TSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 104
T+ L +K + L Q E + +Q + E + ++ + +Q K+++ Q E+ + KA+
Sbjct: 22 TTVFAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAE 81
Query: 105 LENTQGEMESMKE 117
++ Q E+ +KE
Sbjct: 82 IKKLQKEIAELKE 94
Score = 25.4 bits (56), Expect = 8.3
Identities = 12/86 (13%), Positives = 39/86 (45%)
Query: 35 RSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 94
T+ S ++ K + + + Q E+ + E ++S+ +Q+++++S
Sbjct: 19 AFLTTVFAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQS 78
Query: 95 QGEVYRLKAKLENTQGEMESMKEEYE 120
+ E+ +L+ ++ + + +E +
Sbjct: 79 KAEIKKLQKEIAELKENIVERQELLK 104
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 26.4 bits (59), Expect = 4.7
Identities = 17/88 (19%), Positives = 30/88 (34%), Gaps = 2/88 (2%)
Query: 33 LERSQATLATLSTSAELDRLQE--KYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEK 90
+ R++A A + + K A + RA+A+ QA + + + K
Sbjct: 554 IARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPK 613
Query: 91 VEKSQGEVYRLKAKLENTQGEMESMKEE 118
+ R KAK Q E +
Sbjct: 614 KAAVAAAIARAKAKKAEQQANAEPEEPV 641
>gnl|CDD|153329 cd07645, I-BAR_IMD_BAIAP2L1, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Brain-specific
Angiogenesis Inhibitor 1-Associated Protein 2-Like 1.
The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
(I-BAR) domain, is a dimerization and lipid-binding
module that bends membranes and induces membrane
protrusions. BAIAP2L1 (Brain-specific Angiogenesis
Inhibitor 1-Associated Protein 2-Like 1) is also known
as IRTKS (Insulin Receptor Tyrosine Kinase Substrate).
It is widely expressed, serves as a substrate for the
insulin receptor, and binds the small GTPase Rac. It
plays a role in regulating the actin cytoskeleton and
colocalizes with F-actin, cortactin, VASP, and vinculin.
BAIAP2L1 expression leads to the formation of short
actin bundles, distinct from filopodia-like protrusions
induced by the expression of the related protein IRSp53.
It contains an N-terminal IMD, an SH3 domain, and a WASP
homology 2 (WH2) actin-binding motif at the C-terminus.
The IMD domain of BAIAP2L1 binds and bundles actin
filaments, and binds the small GTPase Rac.
Length = 226
Score = 26.0 bits (57), Expect = 4.8
Identities = 14/57 (24%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 64 RRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
R+ +++ + A +R ++E K + +EKSQ ++ +++ K +QG + K E++
Sbjct: 101 RKTDLDVKYMTATLKRYQTEHKNKLDSLEKSQADLKKIRRK---SQGRRNASKYEHK 154
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 26.1 bits (58), Expect = 5.3
Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 11/84 (13%)
Query: 47 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRS--------EEKTMQEKVEKSQGEV 98
AE+ LQ + D+ A++ A E Q + E K +EK++ +Q +
Sbjct: 142 AEIAALQAEIDEAQAEVNAAYQE---AQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARL 198
Query: 99 YRLKAKLENTQGEMESMKEEYERS 122
LKA+L+ ++E+ K ER+
Sbjct: 199 ETLKARLDAAIAQLEAQKAALERN 222
>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
DevB family. Members of this protein family are found
mostly in the Cyanobacteria, but also in the
Planctomycetes. DevB from Anabaena sp. strain PCC 7120
is partially characterized as a membrane fusion protein
of the DevBCA ABC exporter, probably a glycolipid
exporter, required for heterocyst formation. Most
Cyanobacteria have one member only, but Nostoc sp. PCC
7120 has seven members.
Length = 327
Score = 25.9 bits (57), Expect = 5.3
Identities = 18/71 (25%), Positives = 34/71 (47%)
Query: 47 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 106
AELD + + D+ A L + +A + + +R + + V Q + RL+A+LE
Sbjct: 55 AELDVARTQLDEAKARLAQVRAGAKKGEIAAQRAARAAAKLFKDVAAQQATLNRLEAELE 114
Query: 107 NTQGEMESMKE 117
Q E++ +
Sbjct: 115 TAQREVDRYRS 125
>gnl|CDD|153283 cd07599, BAR_Rvs167p, The Bin/Amphiphysin/Rvs (BAR) domain of
Saccharomyces cerevisiae Reduced viability upon
starvation protein 167 and similar proteins. BAR
domains are dimerization, lipid binding and curvature
sensing modules found in many different proteins with
diverse functions. This subfamily is composed of fungal
proteins with similarity to Saccharomyces cerevisiae
Reduced viability upon starvation protein 167 (Rvs167p)
and Schizosaccharomyces pombe Hob1 (homolog of Bin1). S.
cerevisiae Rvs167p plays a role in regulation of the
actin cytoskeleton, endocytosis, and sporulation. It
forms a heterodimer with another BAR domain protein
Rvs161p. Rvs161p and Rvs167p share common functions but
are not interchangeable. Their BAR domains cannot be
replaced with each other and the overexpression of one
cannot suppress the mutant phenotypes of the other.
Rvs167p also interacts with the GTPase activating
protein (GAP) Gyp5p, which is involved in ER to Golgi
vesicle trafficking. BAR domains form dimers that bind
to membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 216
Score = 25.7 bits (57), Expect = 6.0
Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 2/60 (3%)
Query: 14 RLLSERDKAVLEMDKSKEELERSQATLATLST--SAELDRLQEKYDKTCADLRRAQAELR 71
+ + +RD L+ DK + +L + LS +L +L+ K ++ + L+
Sbjct: 114 KTIKKRDHKKLDYDKLQNKLNKLLQKKKELSLKDEKQLAKLERKLEEAKEEYEALNELLK 173
>gnl|CDD|236222 PRK08292, PRK08292, AMP nucleosidase; Provisional.
Length = 489
Score = 26.0 bits (58), Expect = 6.2
Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 43 LSTSAELDRLQEKYDKTCADLRRA-QAELRVVQADNERVRSE 83
L +A +DRL+ Y+++ A LR A A +R + +ER R+
Sbjct: 10 LDPAAAVDRLEALYERSTAALRDAFAAYIRGGELPDERARAY 51
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 26.1 bits (58), Expect = 7.1
Identities = 22/108 (20%), Positives = 40/108 (37%), Gaps = 5/108 (4%)
Query: 8 PPGTLSRLLSERDKAVLEMDKSKEELERS---QATLATLSTSAELDRLQEKYDKTCADLR 64
P L L LE+D + E LA L AEL L+ + A +
Sbjct: 417 TPAALEDLRRRIAALELELDALEREAALGADHDERLAELR--AELAALEAELAALEARWQ 474
Query: 65 RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 112
+ + + + A + ++ + + ++ L+A L + QGE
Sbjct: 475 QEKELVEAILALRAELEADADAPADDDAALRAQLAELEAALASAQGEE 522
>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 120
Score = 25.3 bits (56), Expect = 7.2
Identities = 18/58 (31%), Positives = 30/58 (51%)
Query: 14 RLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
LL E + + E+ ++K ELE +A L+ S E L+E+ + +R A AE+
Sbjct: 10 ELLPELRRLLEEVREAKAELEELEAALSEGERSLEARGLKEELRELEEQIRAAIAEIE 67
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 25.7 bits (56), Expect = 7.4
Identities = 14/74 (18%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 48 ELDRLQEKYD---KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 104
+ D L EK K ++ + + R +++D+ + + M++K ++ G++ +LK++
Sbjct: 279 QNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSE 338
Query: 105 LENTQGEMESMKEE 118
+E + E+++++
Sbjct: 339 IELKEEEIKALQSN 352
>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968). This
family of proteins has no known function.
Length = 192
Score = 25.5 bits (56), Expect = 7.5
Identities = 10/68 (14%), Positives = 37/68 (54%)
Query: 51 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 110
+L+ + + + ++A +QAD + +++E+ + + ++++ E L+A+ + Q
Sbjct: 109 QLEAQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQARQEAVALEAQRQAAQA 168
Query: 111 EMESMKEE 118
++ ++ +
Sbjct: 169 QLRKLQRQ 176
>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 1 (PACSIN1). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 1 or Syndapin I is expressed specifically in the
brain and is localized in neurites and synaptic boutons.
It binds the brain-specific proteins dynamin I,
synaptojanin, synapsin I, and neural Wiskott-Aldrich
syndrome protein (nWASP), and functions as a link
between the cytoskeletal machinery and synaptic vesicle
endocytosis. PACSIN 1 interacts with huntingtin and may
be implicated in the neuropathology of Huntington's
disease. It contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 258
Score = 25.4 bits (55), Expect = 8.0
Identities = 15/75 (20%), Positives = 35/75 (46%)
Query: 48 ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
EL+ ++ Y C + + A +A+ ++K +Q+KV+K + +V + + K E
Sbjct: 134 ELEAAKKAYHLACKEEKLAMTREANSKAEQSVTPEQQKKLQDKVDKCKQDVQKTQEKYEK 193
Query: 108 TQGEMESMKEEYERS 122
++ +Y +
Sbjct: 194 VLDDVGKTTPQYMEN 208
>gnl|CDD|221380 pfam12017, Tnp_P_element, Transposase protein. Protein in this
family are transposases found in insects. This region is
about 230 amino acids in length and is found associated
with pfam05485.
Length = 236
Score = 25.6 bits (56), Expect = 8.0
Identities = 10/40 (25%), Positives = 23/40 (57%)
Query: 77 NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK 116
N E K++++K+ + E++ L+ +LE +Q +S+
Sbjct: 13 NHTTLMENKSLRQKIRALEKEMHSLRQQLEESQQLEKSLG 52
>gnl|CDD|131264 TIGR02209, ftsL_broad, cell division protein FtsL. This model
represents FtsL, both forms similar to that in E. coli
and similar to that in B. subtilis. FtsL is one of the
later proteins active in cell division septum
formation. FtsL is small, low in complexity, and highly
divergent. The scope of this model is broader than that
of the pfam04999.3 for FtsL, as this one includes FtsL
from Bacillus subtilis and related species [Cellular
processes, Cell division].
Length = 85
Score = 24.5 bits (54), Expect = 8.0
Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
+L +E K LE+DK ++E Q +A LS ++++ +K ++ A ++
Sbjct: 26 TRQLNNELQKLQLEIDKLQKEWRDLQLEVAELSRHERIEKIAKKQLG----MKLPDANIK 81
Query: 72 VVQ 74
VV
Sbjct: 82 VVV 84
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 25.7 bits (57), Expect = 8.4
Identities = 21/96 (21%), Positives = 41/96 (42%), Gaps = 12/96 (12%)
Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
LSR +S +D A L+R + +A L+ L+R Q D + +A L
Sbjct: 44 LSREISGKDSA----------LDRLNSQIAELADLLSLER-QGNQDLQ-DSVANLRASLS 91
Query: 72 VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
+A+ R+++ + ++G L +L++
Sbjct: 92 AAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127
>gnl|CDD|169982 PRK09578, PRK09578, periplasmic multidrug efflux lipoprotein
precursor; Reviewed.
Length = 385
Score = 25.5 bits (56), Expect = 8.8
Identities = 15/35 (42%), Positives = 19/35 (54%)
Query: 19 RDKAVLEMDKSKEELERSQATLATLSTSAELDRLQ 53
RD+AV E D ++ + QA A S AEL R Q
Sbjct: 132 RDRAVSERDYTEAVADERQAKAAVASAKAELARAQ 166
>gnl|CDD|182843 PRK10925, PRK10925, superoxide dismutase; Provisional.
Length = 206
Score = 25.3 bits (55), Expect = 9.5
Identities = 21/103 (20%), Positives = 42/103 (40%), Gaps = 2/103 (1%)
Query: 20 DKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNER 79
DK +E+ +K A L + E L ++ L + A+ + V +N
Sbjct: 20 DKQTMEIHHTKHHQTYVNNANAALESLPEFANLP--VEELITKLDQLPADKKTVLRNNAG 77
Query: 80 VRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
+ + ++K LKA +E G +++ K E+E++
Sbjct: 78 GHANHSLFWKGLKKGTTLQGDLKAAIERDFGSVDNFKAEFEKA 120
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.122 0.310
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,731,544
Number of extensions: 481800
Number of successful extensions: 1560
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1375
Number of HSP's successfully gapped: 540
Length of query: 122
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 38
Effective length of database: 7,211,866
Effective search space: 274050908
Effective search space used: 274050908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 53 (24.3 bits)