RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5592
(257 letters)
>gnl|CDD|201420 pfam00735, Septin, Septin. Members of this family include CDC3,
CDC10, CDC11 and CDC12/Septin. Members of this family
bind GTP. As regards the septins, these are polypeptides
of 30-65kDa with three characteristic GTPase motifs
(G-1, G-3 and G-4) that are similar to those of the Ras
family. The G-4 motif is strictly conserved with a
unique septin consensus of AKAD. Most septins are
thought to have at least one coiled-coil region, which
in some cases is necessary for intermolecular
interactions that allow septins to polymerise to form
rod-shaped complexes. In turn, these are arranged into
tandem arrays to form filaments. They are
multifunctional proteins, with roles in cytokinesis,
sporulation, germ cell development, exocytosis and
apoptosis.
Length = 280
Score = 227 bits (581), Expect = 2e-74
Identities = 88/141 (62%), Positives = 111/141 (78%)
Query: 1 MQRLHDKVNIIPVIAKADTMTPEECALFKKQILSEIAQHKIEIYQFPPGGSSEDDTSKFN 60
M++LH+KVNIIPVIAKADT+TP+E FKK+I +EI + I+IY FP S ED+ +
Sbjct: 135 MKKLHEKVNIIPVIAKADTLTPDELQRFKKRIRAEIERQNIKIYHFPDEESDEDEEKEQT 194
Query: 61 KNLRDRVPFAVVGSNTVVEIDGKKVRGRKYPWGIAEVENLEHCDFIALRNMLIRTNLQDL 120
+ L+ +PFA+VGSNT +E DGKKVRGRKYPWG+ EVEN HCDF+ LRNMLIRT+LQDL
Sbjct: 195 EQLKSSIPFAIVGSNTEIENDGKKVRGRKYPWGVVEVENPSHCDFLKLRNMLIRTHLQDL 254
Query: 121 KDVTSNVHYENFRCRKLAGLG 141
K+VT +HYEN+R KL+ LG
Sbjct: 255 KEVTHELHYENYRSEKLSALG 275
>gnl|CDD|206649 cd01850, CDC_Septin, CDC/Septin GTPase family. Septins are a
conserved family of GTP-binding proteins associated with
diverse processes in dividing and non-dividing cells.
They were first discovered in the budding yeast S.
cerevisiae as a set of genes (CDC3, CDC10, CDC11 and
CDC12) required for normal bud morphology. Septins are
also present in metazoan cells, where they are required
for cytokinesis in some systems, and implicated in a
variety of other processes involving organization of the
cell cortex and exocytosis. In humans, 12 septin genes
generate dozens of polypeptides, many of which comprise
heterooligomeric complexes. Since septin mutants are
commonly defective in cytokinesis and formation of the
neck formation of the neck filaments/septin rings,
septins have been considered to be the primary
constituents of the neck filaments. Septins belong to
the GTPase superfamily for their conserved GTPase motifs
and enzymatic activities.
Length = 275
Score = 223 bits (571), Expect = 7e-73
Identities = 85/141 (60%), Positives = 110/141 (78%), Gaps = 1/141 (0%)
Query: 1 MQRLHDKVNIIPVIAKADTMTPEECALFKKQILSEIAQHKIEIYQFPPGGSSEDDTSKFN 60
M++L KVNIIPVIAKADT+TPEE FKK+I+ +I ++ I+IY+FP +++ + N
Sbjct: 136 MKKLSKKVNIIPVIAKADTLTPEELTEFKKRIMEDIEENNIKIYKFP-EDEEDEEEIEEN 194
Query: 61 KNLRDRVPFAVVGSNTVVEIDGKKVRGRKYPWGIAEVENLEHCDFIALRNMLIRTNLQDL 120
K L+ +PFA+VGSN VE++GKKVRGRKYPWG+ EVEN EHCDF+ LRN+LIRT+LQDL
Sbjct: 195 KKLKSLIPFAIVGSNEEVEVNGKKVRGRKYPWGVVEVENEEHCDFVKLRNLLIRTHLQDL 254
Query: 121 KDVTSNVHYENFRCRKLAGLG 141
K+ T NVHYEN+R KL L
Sbjct: 255 KETTHNVHYENYRSEKLEALK 275
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 206 bits (526), Expect = 6e-65
Identities = 92/227 (40%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 1 MQRLHDKVNIIPVIAKADTMTPEECALFKKQILSEIAQHKIEIYQFPPGGSSEDDTSKFN 60
M+RL +VN+IPVIAKADT+T +E A FK++I ++ Q+ I ++ ED++ + N
Sbjct: 156 MKRLSKRVNLIPVIAKADTLTDDELAEFKERIREDLEQYNIPVFDPYDPEDDEDESLEEN 215
Query: 61 KNLRDRVPFAVVGSNTVVEIDGKKVRGRKYPWGIAEVENLEHCDFIALRNMLIRTNLQDL 120
++LR +PFA++GSNT +E G++VRGRKYPWG+ E+++ EH DF LRN+LIRT+LQ+L
Sbjct: 216 QDLRSLIPFAIIGSNTEIENGGEQVRGRKYPWGVVEIDDEEHSDFKKLRNLLIRTHLQEL 275
Query: 121 KDVTSNVHYENFRCRKLAGLGTDGKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFE 180
K+ T N+ YEN+R KL+GL G+P K HEA++ + E ++++ F
Sbjct: 276 KETTENLLYENYRTEKLSGLKNSGEP------------SLKEIHEARLNEEERELKKKFT 323
Query: 181 MKVKEKKQKLKDSE-------IDMKKSLEAQIKELEEKRRGLELEIS 220
K++EK+++L++ E ++ LE K+LE+ + LE S
Sbjct: 324 EKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKS 370
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 36.2 bits (84), Expect = 0.014
Identities = 30/99 (30%), Positives = 61/99 (61%), Gaps = 3/99 (3%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
+ ++EEE E E K+ ++E ++E++ E +++E K++L++ E K+ LE ++KELEE++
Sbjct: 830 IEELEEEIEELEEKLDELEEELEEL-EKELEELKEELEELE-AEKEELEDELKELEEEKE 887
Query: 214 GLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVK 252
LE E+ + E + E + ER L+ K ++++
Sbjct: 888 ELEEELRELESEL-AELKEEIEKLRERLEELEAKLERLE 925
Score = 28.9 bits (65), Expect = 3.2
Identities = 20/97 (20%), Positives = 43/97 (44%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
++ E + + E + + E ++ ++ + LE +I+ELEEK LE
Sbjct: 790 ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEE 849
Query: 218 EISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
E+ + E+ +EL E++ D ++ ++K
Sbjct: 850 ELEELEKELEELKEELEELEAEKEELEDELKELEEEK 886
Score = 28.1 bits (63), Expect = 6.0
Identities = 19/66 (28%), Positives = 39/66 (59%), Gaps = 3/66 (4%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
+E K E E K+++ E ++E++ E ++E +++L+ E +K+ Q EL+ + R LEL
Sbjct: 171 KERKEEAERKLERTEENLERL-EDLLEELEKQLEKLERQAEKAERYQ--ELKAELRELEL 227
Query: 218 EISQWE 223
+ +
Sbjct: 228 ALLLAK 233
Score = 27.8 bits (62), Expect = 6.7
Identities = 24/92 (26%), Positives = 50/92 (54%), Gaps = 7/92 (7%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEM--KVKEKKQKLKDSEIDMKKSLEAQ---IKEL 208
L ++EEE E E ++++++ +E++ E ++E KLK+ ++++ +A ++EL
Sbjct: 739 LEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEEL 798
Query: 209 EEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
EE+ E + E+ + E RR LE+
Sbjct: 799 EEELEEAERRLDALERE--LESLEQRRERLEQ 828
Score = 27.8 bits (62), Expect = 7.0
Identities = 21/109 (19%), Positives = 58/109 (53%), Gaps = 9/109 (8%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQV------FEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
L+++EEE E + ++++ E ++E++ +++E +++L + + + + LE +I
Sbjct: 248 LSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE-IEELEGEISL 306
Query: 208 LEEKRRGLELEISQWEQSNGVSMDEL--RRRSLERDSSLDGKEKKVKKK 254
L E+ LE E+ + E+ +++ + LE +L + +++ +
Sbjct: 307 LRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAE 355
Score = 27.8 bits (62), Expect = 8.0
Identities = 20/104 (19%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDM-----KKSLEAQIKEL 208
++ + E E E +++++E +E++ E K++ + + LE +EL
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363
Query: 209 EEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVK 252
EEK L E+ + ++ + EL E + L+ +++++
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 34.6 bits (80), Expect = 0.028
Identities = 22/106 (20%), Positives = 51/106 (48%), Gaps = 4/106 (3%)
Query: 151 KNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
+N ++Q+E E +E ++K+ E + V + + + SLE ++ EL E
Sbjct: 58 ENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELME 117
Query: 211 KRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKKGL 256
+ LE EI ++ ++ L + E ++ L+ + +++++G
Sbjct: 118 EIEKLEKEIEDLKE----RLERLEKNLAEAEARLEEEVAEIREEGQ 159
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 32.6 bits (75), Expect = 0.13
Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 5/48 (10%)
Query: 194 EIDMKKS-LEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
+ + KS LE +IK+LEE++ LE +++ E ++ + +R E
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEA----KLEAIEKREEEE 160
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 32.8 bits (74), Expect = 0.19
Identities = 25/97 (25%), Positives = 45/97 (46%), Gaps = 4/97 (4%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
EE K+ EA + E ++ + K + + K E KK +A K+ EEK++ E
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEA 1396
Query: 218 EISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
+ E DEL++ + + + + K+K +KK
Sbjct: 1397 KKKAEEDKK--KADELKKAAAAKKKADEAKKKAEEKK 1431
Score = 30.9 bits (69), Expect = 0.90
Identities = 23/104 (22%), Positives = 50/104 (48%), Gaps = 3/104 (2%)
Query: 151 KNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
A+ E E EA+ + + + + + + K+K K + KK+ EA+ K E+
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403
Query: 211 KRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
K++ EL+ + + DE ++++ E+ + + K+K + K
Sbjct: 1404 KKKADELKKAAAAKKKA---DEAKKKAEEKKKADEAKKKAEEAK 1444
Score = 30.1 bits (67), Expect = 1.3
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 7/98 (7%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQK-----LKDSEIDMKKSLEAQIKELEEKR 212
EEK++ E K E + + E K ++ K K +E D KK+ EA KE EE +
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702
Query: 213 RGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
+ EL+ + E+ +EL++ E + +K+
Sbjct: 1703 KAEELKKKEAEEKK--KAEELKKAEEENKIKAEEAKKE 1738
Score = 29.3 bits (65), Expect = 2.3
Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 4/97 (4%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
EE K++ +A KK E + ++ E E + ++E +K+ A+ K+ E K++
Sbjct: 1325 EEAKKKADAAKKKAE-EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383
Query: 218 EISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
+ E+ DE ++++ E D +K K
Sbjct: 1384 KKKAEEKKKA---DEAKKKAEEDKKKADELKKAAAAK 1417
Score = 27.8 bits (61), Expect = 6.8
Identities = 25/100 (25%), Positives = 48/100 (48%), Gaps = 8/100 (8%)
Query: 159 EEKREHEAKMKKMEVDMEQVFEMKVKEKKQK----LKDSEIDMKKSLEAQIKELEEKRRG 214
EEK++ + KK E D ++ E+K +K K + KK+ EA+ K+ EE ++
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK-KKAEEAKKA 1446
Query: 215 LELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
E + E +E ++++ E + + K+K + K
Sbjct: 1447 DEAKKKAEEAKKA---EEAKKKAEEAKKADEAKKKAEEAK 1483
Score = 27.8 bits (61), Expect = 7.0
Identities = 23/108 (21%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 147 RLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIK 206
+L + + EE K+ EAK+K E+ + + K+K ++LK E + KK E ++K
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAE----EEKKKVEQLKKKEAEEKKKAE-ELK 1653
Query: 207 ELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
+ EE+ + E ++ + + +E ++ + + + +K+ ++
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
Score = 27.4 bits (60), Expect = 9.6
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 6/99 (6%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQ--KLKDSEIDMKKSLEAQIKELEEKRRGL 215
+K+ EAK K E + K +E K+ + K + KK+ EA+ K+ EE ++
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAK-KKAEEAKKAD 1473
Query: 216 ELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
E + E DE ++++ E D +K + K
Sbjct: 1474 EAKKKAEEAKKA---DEAKKKAEEAKKKADEAKKAAEAK 1509
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.4 bits (75), Expect = 0.21
Identities = 19/104 (18%), Positives = 53/104 (50%), Gaps = 13/104 (12%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSE---IDMKKSLEAQIKELEEK 211
+ E K+E + K+ + FE +++E++ +L+ E + +++L+ +++ LE++
Sbjct: 49 KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
Query: 212 RRGLELEISQWEQSNGVSMDELRRRSLE-RDSSLDGKEKKVKKK 254
LE + + EQ +++ LE ++ L+ ++ ++
Sbjct: 109 EEELEKKEKELEQ---------KQQELEKKEEELEELIEEQLQE 143
Score = 28.2 bits (64), Expect = 5.1
Identities = 17/57 (29%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
L Q EE ++K E ++E+ E ++++K+Q+L+ E ++++ +E Q++ELE
Sbjct: 91 LLQKEENLDRKLELLEKREEELEKK-EKELEQKQQELEKKEEELEELIEEQLQELER 146
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 32.2 bits (73), Expect = 0.29
Identities = 23/82 (28%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
Query: 140 LGTDGKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKK 199
L ++ R+ + LA+ ++ K+E E + E + KQ +D E+++KK
Sbjct: 485 LKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKK 544
Query: 200 SLEAQIKELEEKRRGLELEISQ 221
LE +K EE+ R LE E +
Sbjct: 545 -LEHDLKLKEEECRMLEKEAQE 565
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 31.8 bits (73), Expect = 0.33
Identities = 17/79 (21%), Positives = 47/79 (59%), Gaps = 3/79 (3%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEM---KVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
L ++EEE +E E +++K++ ++++ + ++ +K++ + ++ K+ L +++ELEE
Sbjct: 336 LKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEE 395
Query: 211 KRRGLELEISQWEQSNGVS 229
+ + L+ E+ +S
Sbjct: 396 ELKELKEELESLYSEGKIS 414
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 31.6 bits (72), Expect = 0.48
Identities = 28/99 (28%), Positives = 50/99 (50%), Gaps = 12/99 (12%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
A++ EE R+ + + +E +Q+ E +VKE + +L ++E K + I++LE + R
Sbjct: 695 AARLAEELRQEQEHSQHLERLRKQL-ESQVKELQVRLDEAEAAALKGGKKMIQKLEARVR 753
Query: 214 GLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVK 252
LE E +D +RR E +L E++VK
Sbjct: 754 ELEAE-----------LDGEQRRHAETQKNLRKMERRVK 781
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 31.5 bits (71), Expect = 0.49
Identities = 22/97 (22%), Positives = 46/97 (47%), Gaps = 7/97 (7%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKS------LEAQIKE 207
+ + ++K+E++ ++ + D+E E E KQ+LK I + L+ + ++
Sbjct: 229 IQTLTQKKQEND-RVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQ 287
Query: 208 LEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSL 244
L E R L+ + +Q G+ +EL RD ++
Sbjct: 288 LHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTI 324
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 31.2 bits (71), Expect = 0.58
Identities = 17/84 (20%), Positives = 39/84 (46%), Gaps = 9/84 (10%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
+ ++EEEK + ++KK E +EQ+ K +++ L+ + +E++
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYD---------LKEEYDRVEKELS 486
Query: 214 GLELEISQWEQSNGVSMDELRRRS 237
L+ E+++ E S + +R
Sbjct: 487 KLQRELAEAEAQARASEERVRGGR 510
Score = 30.0 bits (68), Expect = 1.3
Identities = 23/117 (19%), Positives = 47/117 (40%), Gaps = 19/117 (16%)
Query: 154 LAQMEEEKREHEAKMKKMEV--DMEQVFEMKVKEKKQKLKDSE------IDMKKSLEAQI 205
++ + R+ K++E E+ + +++E ++ L E K LEA+I
Sbjct: 708 SQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARI 767
Query: 206 KELEEKRRGLELEI---------SQWEQSNGV--SMDELRRRSLERDSSLDGKEKKV 251
+ELEE LE + S+ + ++E R R ++ K ++
Sbjct: 768 EELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL 824
Score = 27.7 bits (62), Expect = 7.2
Identities = 25/125 (20%), Positives = 57/125 (45%), Gaps = 29/125 (23%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDM----------------- 197
A+ + +K + +++++E ++E++ ++ + EK+Q+L+ +
Sbjct: 166 AEFDRKKEKALEELEEVEENIERL-DLIIDEKRQQLERLRREREKAERYQALLKEKREYE 224
Query: 198 -------KKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
K++LE Q + +E + LE E+ + + EL +R E + L+ KK
Sbjct: 225 GYELLKEKEALERQKEAIERQLASLEEELEKLT----EEISELEKRLEEIEQLLEELNKK 280
Query: 251 VKKKG 255
+K G
Sbjct: 281 IKDLG 285
Score = 27.7 bits (62), Expect = 7.8
Identities = 21/93 (22%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
Query: 149 LNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKEL 208
+ L EE + EA++ K+ ++E+ E +++E++++ + L + EL
Sbjct: 312 EKERELEDAEERLAKLEAEIDKLLAEIEE-LEREIEEERKR--------RDKLTEEYAEL 362
Query: 209 EEKRRGLELEISQWEQSNGVSMDEL--RRRSLE 239
+E+ L E+ + ++ + DEL R LE
Sbjct: 363 KEELEDLRAELEEVDKEFAETRDELKDYREKLE 395
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.9 bits (68), Expect = 0.66
Identities = 16/71 (22%), Positives = 45/71 (63%), Gaps = 2/71 (2%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMK-KSLEAQIKELEEKRR 213
Q+E+E ++ +A+++K E ++++ + K++++ L + K + L+ + +EL++K++
Sbjct: 38 KQLEKEFKKLQAELQKKEKELQKEEQ-KLQKQAATLSEEARKAKQQELQQKQQELQQKQQ 96
Query: 214 GLELEISQWEQ 224
+ E+ Q +Q
Sbjct: 97 AAQQELQQKQQ 107
Score = 26.4 bits (59), Expect = 9.2
Identities = 16/84 (19%), Positives = 44/84 (52%), Gaps = 12/84 (14%)
Query: 173 VDMEQVFE--MKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSM 230
VD+++V K +++L+ ++ L+ + KEL+++ + L Q ++
Sbjct: 21 VDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKL--------QKQAATL 72
Query: 231 DELRRRSLERDSSLDGKEKKVKKK 254
E R++ +++ L K++++++K
Sbjct: 73 SEEARKAKQQE--LQQKQQELQQK 94
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 30.7 bits (69), Expect = 0.73
Identities = 19/108 (17%), Positives = 49/108 (45%), Gaps = 11/108 (10%)
Query: 152 NPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQ------I 205
N + M+++ +E K++K++ ++E +E+ + L+ + ++ K L Q
Sbjct: 316 NYVNAMKQKSQEWPGKLEKLKSEIE-----LKEEEIKALQSNIDELHKQLRKQGISTEQF 370
Query: 206 KELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKK 253
+ + ++R L E+ + + ++ R LE EK +++
Sbjct: 371 ELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQ 418
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.8 bits (70), Expect = 0.74
Identities = 21/69 (30%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVK-EKKQKLKDSEIDMKKSLEAQIKELEEKRRGL 215
+ E E +++K+E +++++ E+K + E+ +K +S K+ LE +I+ELEE+ L
Sbjct: 212 ISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271
Query: 216 ELEISQWEQ 224
+ EI + E+
Sbjct: 272 KKEIEELEE 280
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 30.4 bits (70), Expect = 1.00
Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 9/54 (16%)
Query: 166 AKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEI 219
+ K+ D E + +VKE K+++K LEA++ ELE + L L I
Sbjct: 59 GQAKRKGEDAEALIA-EVKELKEEIKA--------LEAELDELEAELEELLLRI 103
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 29.1 bits (66), Expect = 1.0
Identities = 15/70 (21%), Positives = 36/70 (51%), Gaps = 4/70 (5%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
L Q+E + ++ +++K + + +V++++ K + K L QI++++E
Sbjct: 36 LQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAKRLEQ----KNQLLFQIEQVQELPL 91
Query: 214 GLELEISQWE 223
G E++ Q E
Sbjct: 92 GSEVKQGQVE 101
>gnl|CDD|193505 cd03884, M20_bAS, M20 Peptidase beta-alanine synthase, an
amidohydrolase. Peptidase M20 family, beta-alanine
synthase (bAS; N-carbamoyl-beta-alanine amidohydrolase
and beta-ureidopropionase; EC 3.5.1.6) subfamily. bAS is
an amidohydrolase and is the final enzyme in the
pyrimidine catabolic pathway, which is involved in the
regulation of the cellular pyrimidine pool. The bAS
catalyzes the irreversible hydrolysis of the
N-carbamylated beta-amino acids to beta-alanine or
aminoisobutyrate under the release of carbon dioxide and
ammonia. Also included in this subfamily is allantoate
amidohydrolase (allantoate deiminase), which catalyzes
the conversion of allantoate to (S)-ureidoglycolate, one
of the crucial alternate steps in purine metabolism. It
is possible that these two enzymes arose from the same
ancestral peptidase that evolved into two structurally
related enzymes with distinct catalytic properties and
biochemical roles within the cell. Yeast requires
beta-alanine as a precursor of pantothenate and coenzyme
A biosynthesis, but generates it mostly via degradation
of spermine. Disorders in pyrimidine degradation and
beta-alanine metabolism caused by beta-ureidopropionase
deficiency (UPB1 gene) in humans are normally associated
with neurological disorders.
Length = 399
Score = 30.1 bits (69), Expect = 1.2
Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 196 DMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
+ ++ A +E+ RRG+E+EI + S V D +LE
Sbjct: 291 ALVAAIRAAAEEIAA-RRGVEVEIERLSDSPPVPFDPELVAALEA 334
>gnl|CDD|221482 pfam12240, Angiomotin_C, Angiomotin C terminal. This domain family
is found in eukaryotes, and is typically between 197 and
211 amino acids in length. This family is the C terminal
region of angiomotin. Angiomotin regulates the action of
angiogenesis inhibitor angiostatin. The C terminal
region of angiomotin appears to be involved in directing
the protein chemotactically.
Length = 204
Score = 29.5 bits (66), Expect = 1.5
Identities = 22/103 (21%), Positives = 50/103 (48%), Gaps = 14/103 (13%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSE-----IDMKKSLEAQIKEL 208
LAQ++ + E +++ +EQ E++ +Q+ + + +L+ Q++E
Sbjct: 12 LAQLQAACEKREQLERRLRTRLEQ--ELRSLRAQQRQANRQGASTSSYNADALQEQLREK 69
Query: 209 EEKRRGLELEISQWEQSN-------GVSMDELRRRSLERDSSL 244
EE+ LE ++++WEQ +MD + +RD+++
Sbjct: 70 EERILALEADMTKWEQKYLEESTMRQFAMDAAATAAAQRDTTI 112
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 29.7 bits (68), Expect = 1.5
Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%)
Query: 185 EKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRR 235
E+ + L + E ++ LE +I ELEE+ + + ++ + E+ + EL R
Sbjct: 167 EEFEALPEEE---REELEEKIDELEEELQEILRQLRELEREAREKLRELDR 214
>gnl|CDD|219925 pfam08598, Sds3, Sds3-like. Repression of gene transcription is
mediated by histone deacetylases containing
repressor-co-repressor complexes, which are recruited to
promoters of target genes via interactions with
sequence-specific transcription factors. The
co-repressor complex contains a core of at least seven
proteins. This family represents the conserved region
found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
Length = 184
Score = 29.2 bits (66), Expect = 1.6
Identities = 27/99 (27%), Positives = 46/99 (46%), Gaps = 13/99 (13%)
Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGL 215
++E +RE+EA+ + + Q FE + + +++L + LE +I LEE+RR L
Sbjct: 69 KLECIEREYEAERQAAK----QEFEKEKRLLRERLLEE-------LEEKIYRLEEERRSL 117
Query: 216 ELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
++ S + R R L R + K KKK
Sbjct: 118 DITDSDYGLLGSHP--NRRTRKLRRRVNSPELVAKRKKK 154
>gnl|CDD|133381 cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the
transcription regulators TipAL, Mta, and SkgA.
Helix-turn-helix (HTH) TipAL, Mta, and SkgA
transcription regulators, and related proteins,
N-terminal domain. TipAL regulates resistance to and
activation by numerous cyclic thiopeptide antibiotics,
such as thiostrepton. Mta is a global transcriptional
regulator; the N-terminal DNA-binding domain of Mta
interacts directly with the promoters of mta, bmr, blt,
and ydfK, and induces transcription of these
multidrug-efflux transport genes. SkgA has been shown to
control stationary-phase expression of
catalase-peroxidase in Caulobacter crescentus. These
proteins are comprised of distinct domains that harbor
an N-terminal active (DNA-binding) site and a
regulatory (effector-binding) site. The conserved
N-terminal domain of these transcription regulators
contains winged HTH motifs that mediate DNA binding.
These proteins share the N-terminal DNA binding domain
with other transcription regulators of the MerR
superfamily that promote transcription by reconfiguring
the spacer between the -35 and -10 promoter elements.
Unique to this family, is a TipAL-like, lineage specific
Bacilli subgroup, which has five conserved cysteines in
the C-terminus of the protein.
Length = 103
Score = 28.2 bits (64), Expect = 1.7
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 179 FEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
+KE K+ LKD D+ ++L Q + LEEK+ L+
Sbjct: 56 LGFSLKEIKELLKDPSEDLLEALREQKELLEEKKERLD 93
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 29.5 bits (67), Expect = 1.8
Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 9/59 (15%)
Query: 162 REHEAKMKKMEVDMEQVFEMKVKEKKQKL----KDSEID-----MKKSLEAQIKELEEK 211
+ E K+ K+ + +E VF ++ K+K+ +SEID K LEAQ K L E+
Sbjct: 386 PKVEKKLDKLGLSIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKRLREE 444
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.7 bits (65), Expect = 1.9
Identities = 12/80 (15%), Positives = 45/80 (56%), Gaps = 3/80 (3%)
Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGL 215
+++ + + ++++++ + ++ K KK + + E L ++K+LE++ + L
Sbjct: 66 KLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERT---ELLEELKQLEKELKKL 122
Query: 216 ELEISQWEQSNGVSMDELRR 235
+ E+ ++E+++ +++L+
Sbjct: 123 KAELEKYEKNDPERIEKLKE 142
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.7 bits (67), Expect = 1.9
Identities = 18/69 (26%), Positives = 34/69 (49%)
Query: 186 KKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLD 245
+ + ++ KSLE ++++LEEK LE E+ + + L R E + L+
Sbjct: 312 ALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLE 371
Query: 246 GKEKKVKKK 254
EK+++K
Sbjct: 372 ELEKELEKA 380
>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
Provisional.
Length = 995
Score = 29.6 bits (67), Expect = 2.1
Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 4/64 (6%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQV-FEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKR 212
LA++E++ + + ++ + +E KV E +KL D +ID L +IK+LE+
Sbjct: 931 LAKLEKKLAKLQKSLESYLKKISIPNYEDKVPEDVRKLNDEKID---ELNEEIKQLEQAI 987
Query: 213 RGLE 216
L+
Sbjct: 988 EELK 991
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.5 bits (67), Expect = 2.1
Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 168 MKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQ 224
+ K+ E+ ++ VK ++ +KD E +++K +E +IKELEE+ LE EI + EQ
Sbjct: 66 LPKLNPLREEKKKVSVKSLEELIKDVEEELEK-IEKEIKELEEEISELENEIKELEQ 121
Score = 29.1 bits (66), Expect = 2.5
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
+ + E K K +E++ + V+E+ +K+ + EI K LE +I ELE + + LE
Sbjct: 66 LPKLNPLREEKKKVSVKSLEELIK-DVEEELEKI-EKEI---KELEEEISELENEIKELE 120
Query: 217 LEISQ 221
EI +
Sbjct: 121 QEIER 125
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 29.1 bits (65), Expect = 2.1
Identities = 21/106 (19%), Positives = 41/106 (38%), Gaps = 8/106 (7%)
Query: 150 NKNPLAQMEEEKREHEAKMKKMEVDMEQV--FEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
+ L+ E + E E ++ E+V E + + KL + K+ L+A+ E
Sbjct: 193 DGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAE 252
Query: 208 LEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKK 253
L + LE + E + R ++E + L G + +
Sbjct: 253 LSKALAALEAANAADED------PQDRDAAVEAAARLMGAARALVP 292
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.7 bits (65), Expect = 2.3
Identities = 27/96 (28%), Positives = 50/96 (52%), Gaps = 10/96 (10%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRG 214
A+ E E +E ++++ E + Q E + K + L+ E +SLE + KEL +++
Sbjct: 67 AEAERELKERRNELQRQEKRLLQKEET-LDRKDESLEKKE----ESLEEKEKELAARQQQ 121
Query: 215 LELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
LE + + E+ + E +++ LER S L +E K
Sbjct: 122 LEEKEEELEE-----LIEEQQQELERISGLTAEEAK 152
>gnl|CDD|217877 pfam04079, DUF387, Putative transcriptional regulators (Ypuh-like).
This family of conserved bacterial proteins are thought
to possibly be helix-turn-helix type transcriptional
regulators.
Length = 159
Score = 28.3 bits (64), Expect = 2.3
Identities = 11/34 (32%), Positives = 22/34 (64%)
Query: 185 EKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELE 218
E+ K+ S +++++L+ +E E++ RGLEL
Sbjct: 17 EELAKILGSREEVREALDELKEEYEDRDRGLELV 50
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 29.2 bits (66), Expect = 2.5
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 196 DMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELR-RRSLE 239
+ + AQIK+LE + GL+ ++ Q V +EL RR L+
Sbjct: 144 AQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLK 188
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.3 bits (64), Expect = 2.7
Identities = 10/48 (20%), Positives = 19/48 (39%)
Query: 171 MEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELE 218
+ D+ ++ + + Q K ++ KELE K + E E
Sbjct: 56 LAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAE 103
>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
subunit alpha; Validated.
Length = 319
Score = 28.6 bits (65), Expect = 3.0
Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 7/55 (12%)
Query: 168 MKKMEVDMEQ-VFEM--KVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEI 219
M +D E+ + E+ K++E + +DS++D L +I+ LE+K L +I
Sbjct: 1 MMLNYLDFEKPIAELEAKIEELRAVAEDSDVD----LSEEIERLEKKLEELTKKI 51
>gnl|CDD|221059 pfam11285, DUF3086, Protein of unknown function (DUF3086). This
family of proteins with unknown function appears to be
restricted to Cyanobacteria.
Length = 283
Score = 28.5 bits (64), Expect = 3.1
Identities = 12/39 (30%), Positives = 23/39 (58%), Gaps = 3/39 (7%)
Query: 198 KKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRR 236
+++LE +I++LE ++ +E E+ G S D + RR
Sbjct: 13 RQALEIEIEKLERRKEQIEQELRT--SFAGQS-DAIARR 48
Score = 27.3 bits (61), Expect = 8.4
Identities = 11/36 (30%), Positives = 22/36 (61%), Gaps = 9/36 (25%)
Query: 206 KELEEKRRGLELEISQWEQSNGVSMDELRRRSLERD 241
K+LE++R+ LE+EI + E+ R+ +E++
Sbjct: 7 KDLEQRRQALEIEIEKLER---------RKEQIEQE 33
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 28.8 bits (65), Expect = 3.1
Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 8/72 (11%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKK----QKLKDS----EIDMKKSLEAQIK 206
A ++ + E E + +++ E+ +++ + QKLK+ I++++ +IK
Sbjct: 219 ALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIK 278
Query: 207 ELEEKRRGLELE 218
E E+ R L
Sbjct: 279 EKVEEERNGRLA 290
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 28.9 bits (65), Expect = 3.2
Identities = 25/103 (24%), Positives = 51/103 (49%), Gaps = 11/103 (10%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQ------VFEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
L + EEE E A+++++E +E+ E +++E +++L ++ + LE Q +
Sbjct: 248 LKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISR-LEQQKQI 306
Query: 208 LEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
L E+ LE ++ + E ++EL + E L E+K
Sbjct: 307 LRERLANLERQLEELE----AQLEELESKLDELAEELAELEEK 345
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 27.2 bits (61), Expect = 3.9
Identities = 20/79 (25%), Positives = 48/79 (60%), Gaps = 6/79 (7%)
Query: 154 LAQMEEEKREHEAKMKKME-VD-----MEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
+ Q+E + +E E ++++E +D + + ++ VK+ K+++K+ + K++LE +IK
Sbjct: 21 IKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEEVKEELEERKETLEKEIKT 80
Query: 208 LEEKRRGLELEISQWEQSN 226
LE++ LE E+ + ++
Sbjct: 81 LEKQLEKLEKELEELKEEL 99
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 28.5 bits (64), Expect = 3.9
Identities = 12/62 (19%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
++E + K++ + + ++ E+++ D+ + L+A+I LEE LE
Sbjct: 1 LKELREALAELAKELR-KLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELE 59
Query: 217 LE 218
+
Sbjct: 60 AK 61
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 28.5 bits (64), Expect = 4.1
Identities = 21/86 (24%), Positives = 40/86 (46%), Gaps = 9/86 (10%)
Query: 149 LNKNPLAQMEEEKREHEAKMKKMEVDME----QVFEMKVKEKKQKLKDSEID-----MKK 199
+ + ++E + R AK++ + VD+E QV E K++E +K +D +
Sbjct: 16 IERRRNRELERQSRIFNAKVRTIGVDLEALDLQVQEKKLREAAEKERDEAYAAQMLLNDR 75
Query: 200 SLEAQIKELEEKRRGLELEISQWEQS 225
+ KE E+ RR L ++ + Q
Sbjct: 76 VAQLLQKEQEQDRRRLNKDLQDFRQQ 101
>gnl|CDD|227094 COG4752, COG4752, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 190
Score = 27.9 bits (62), Expect = 4.3
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 2/49 (4%)
Query: 195 IDMKKSLEAQIKELE--EKRRGLELEISQWEQSNGVSMDELRRRSLERD 241
+ + +LE I+++E E RR L + S N +S LR ERD
Sbjct: 85 VRLAYTLEEVIEDIEKEEGRRPLIVGTSARTYPNTISYSWLRNEIQERD 133
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 28.2 bits (63), Expect = 4.7
Identities = 20/65 (30%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
E K E ++K+ +E+ E K + ++ LK+ E +KK LE +++EL+E+ R +
Sbjct: 506 YGEFKEEINVLIEKLS-ALEKELEQKNEHLEKLLKEQEK-LKKELEQEMEELKERERNKK 563
Query: 217 LEISQ 221
LE+ +
Sbjct: 564 LELEK 568
>gnl|CDD|218651 pfam05600, DUF773, Protein of unknown function (DUF773). This
family contains several eukaryotic sequences which are
thought to be CDK5 activator-binding proteins, however,
the function of this family is unknown.
Length = 506
Score = 28.1 bits (63), Expect = 4.9
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 18/81 (22%)
Query: 144 GKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEA 203
PR +++ L + E+K + E K+K+ M EK+Q+ ++ + SLE
Sbjct: 428 ASPRYVDR--LVETLEQKLKQEDKLKEKRELM--------TEKRQEAREE----QSSLEP 473
Query: 204 QIKELEEKRRGL----ELEIS 220
++ L EK R L E +IS
Sbjct: 474 KLDLLIEKTRELQKLIEADIS 494
>gnl|CDD|224073 COG1151, COG1151, 6Fe-6S prismane cluster-containing protein
[Energy production and conversion].
Length = 576
Score = 28.1 bits (63), Expect = 5.0
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 194 EIDMKKSLEAQIKELEEKRRGLEL 217
E D KK+ + + +EEKR+ L L
Sbjct: 553 EEDPKKAADLLEEHIEEKRKKLGL 576
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 5.3
Identities = 16/81 (19%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 147 RLLNKNPLAQMEE-EKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQI 205
R ++K ME+ +++ K + + + + + K+KLK + L+A++
Sbjct: 269 RTVSKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEV 328
Query: 206 KELEEKRRGLELEISQWEQSN 226
KE +++++ E + Q E+
Sbjct: 329 KEKKKEKKKEEKKKKQIERLE 349
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 28.2 bits (63), Expect = 5.3
Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 9/66 (13%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSL---EAQIKELEE 210
LAQ++ + + + + ++ + + KEK Q+L D K++ EA+IK+LE
Sbjct: 442 LAQLQVAIQNVTQEQTQRNAALNEMRQ-RYKEKTQQLADV-----KTICEQEARIKDLEA 495
Query: 211 KRRGLE 216
+R L+
Sbjct: 496 QRAQLQ 501
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 26.8 bits (60), Expect = 5.4
Identities = 11/45 (24%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 161 KREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQI 205
KRE + K KK E + K++K+++ + +K+++A +
Sbjct: 38 KREKKKKKKKSEGEF-----FAEKKEKKEVSEQRKADQKAVDAAL 77
>gnl|CDD|153348 cd07664, BAR_SNX2, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexin 2. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. SNX2 is a component of the retromer
complex, a membrane coat multimeric complex required for
endosomal retrieval of lysosomal hydrolase receptors to
the Golgi. The retromer consists of a cargo-recognition
subcomplex and a subcomplex formed by a dimer of sorting
nexins (SNX1 and/or SNX2), which ensures effcient cargo
sorting by facilitating proper membrane localization of
the cargo-recognition subcomplex. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions.
Length = 234
Score = 27.7 bits (61), Expect = 5.5
Identities = 22/73 (30%), Positives = 44/73 (60%), Gaps = 5/73 (6%)
Query: 143 DGKPRLLNK-NPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKE----KKQKLKDSEIDM 197
+ K + NK + L Q ++E +E EAK+++ E D EQ+ + KE +K+++KD + +
Sbjct: 149 EAKLQYANKPDKLQQAKDEIKEWEAKVQQGERDFEQISKTIRKEVGRFEKERVKDFKTVI 208
Query: 198 KKSLEAQIKELEE 210
K LE+ ++ ++
Sbjct: 209 IKYLESLVQTQQQ 221
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 27.4 bits (61), Expect = 5.7
Identities = 22/103 (21%), Positives = 52/103 (50%), Gaps = 7/103 (6%)
Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEID------MKKSLEAQIKELEEK 211
+ E E + ++ +++ ++E E ++ E + +L+D + D + KSLE +++ LEE
Sbjct: 78 QGELSELKQQLSELQEELED-LEERIAELESELEDLKEDLQLLRELLKSLEERLESLEES 136
Query: 212 RRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
+ L E+ + Q ++ELR +L ++ +++
Sbjct: 137 IKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQEL 179
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 27.8 bits (62), Expect = 5.8
Identities = 17/75 (22%), Positives = 35/75 (46%), Gaps = 7/75 (9%)
Query: 144 GKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEA 203
G R + L + +EE+ E E + + MK E + E+++ + LE
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEELAGD------AMKKLENRTADSKREMEVLERLE- 155
Query: 204 QIKELEEKRRGLELE 218
++KEL+ +R +++
Sbjct: 156 ELKELQSRRADVDVN 170
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 27.8 bits (62), Expect = 5.9
Identities = 20/105 (19%), Positives = 48/105 (45%), Gaps = 7/105 (6%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVK-EKKQKLKDSEIDMKKSLEAQIKELEE--- 210
++ E + +K+K++ ++ V + + E++ K+K +++ E + +L+
Sbjct: 287 EEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQALVV 346
Query: 211 --KRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKK 253
R LEL QWE +DE R + + D ++++ +
Sbjct: 347 ASSERLLELA-QQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDE 390
>gnl|CDD|184042 PRK13415, PRK13415, flagella biosynthesis protein FliZ;
Provisional.
Length = 219
Score = 27.4 bits (61), Expect = 6.1
Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 2/59 (3%)
Query: 156 QMEEEKREHEAKMKKME--VDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKR 212
++EE +HE +++ Q + K K ++ + + L+ ++KEL+EKR
Sbjct: 145 EIEEILAQHEERLESKAEWSRWGQKLRDQWKGKSKQKQTTLPSFSALLKEELKELKEKR 203
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 27.6 bits (62), Expect = 6.3
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
+ EEEK A+ ++ E E+ E K +E++ KL K S E Q K LEEK R
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLA------KLSPEEQRK-LEEKER 316
>gnl|CDD|220582 pfam10117, McrBC, McrBC 5-methylcytosine restriction system
component. Members of this family of bacterial proteins
modify the specificity of mcrB restriction by expanding
the range of modified sequences restricted.
Length = 317
Score = 27.6 bits (62), Expect = 6.6
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 13/64 (20%)
Query: 118 QDLKDVTSNVHYENFR-CRKLAGLGTDGKPRLLNKNPLAQMEEEKREHEAKMKKMEVDME 176
+D + N + LA L L N++PLA + + K + ME
Sbjct: 171 KDFQKWRLNRLNARYELLLPLARLI------LSNQSPLAS------QGDQKAYSLLFPME 218
Query: 177 QVFE 180
++FE
Sbjct: 219 KLFE 222
>gnl|CDD|236456 PRK09290, PRK09290, allantoate amidohydrolase; Reviewed.
Length = 413
Score = 27.8 bits (63), Expect = 6.9
Identities = 12/45 (26%), Positives = 19/45 (42%), Gaps = 1/45 (2%)
Query: 196 DMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
+ L A + + RRG+E+EI + V D +LE
Sbjct: 299 ALVAELRAAAEAIAA-RRGVEVEIELISRRPPVPFDPGLVAALEE 342
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 27.7 bits (62), Expect = 7.3
Identities = 24/104 (23%), Positives = 51/104 (49%), Gaps = 13/104 (12%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
+ + E E EA+++ +E ++E +V+E+ ++ +D E +I+ LEE+R
Sbjct: 470 IEEDRERVEELEAELEDLEEEVE-----EVEERLERAEDL-----VEAEDRIERLEERRE 519
Query: 214 GLELEISQWE---QSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
LE I++ + +ELR R+ E ++ + K + +
Sbjct: 520 DLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEA 563
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 26.9 bits (60), Expect = 7.6
Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 4/74 (5%)
Query: 153 PLAQME-EEKREHEAKMKKMEVDMEQVFEMKVKE--KKQKLKDSEIDMKKSLEAQIKELE 209
PL + K E EA + E++ +MK KQK+K + KK+ + + +E
Sbjct: 65 PLRDLSLHPKDESEAAPDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIE 124
Query: 210 EKRRGL-ELEISQW 222
+ R+ L E E +
Sbjct: 125 KLRKKLAEQEAALL 138
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 27.0 bits (60), Expect = 7.9
Identities = 21/101 (20%), Positives = 43/101 (42%), Gaps = 12/101 (11%)
Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
A +E E ++ + +++KM+ +++ K+ KL+D M+ + E E K+
Sbjct: 46 SADLESEFKKRQKELQKMQKELKA--------KEAKLQDD-GKMEALSDRAKAEAEIKKE 96
Query: 214 GLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
L +Q +L RR E + L K ++ +
Sbjct: 97 KLVNA-FNKKQQE--YEKDLNRREAEEEQKLLEKIQRAIES 134
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 27.1 bits (60), Expect = 7.9
Identities = 17/63 (26%), Positives = 28/63 (44%)
Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
E E E + + E E+ FE K+KE ++ ++ SE KK E + G+E
Sbjct: 93 EEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETNGVE 152
Query: 217 LEI 219
+
Sbjct: 153 EDT 155
>gnl|CDD|235171 PRK03903, PRK03903, transaldolase; Provisional.
Length = 274
Score = 27.3 bits (61), Expect = 8.0
Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 151 KNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSL 201
K PL EE ++K +D+E ++ +K+ + K + D+ KSL
Sbjct: 226 KKPL--KIEEIEAFFKELKSHNIDLENTYQKLLKDGLEAFKQAFEDILKSL 274
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 27.7 bits (62), Expect = 8.5
Identities = 14/58 (24%), Positives = 29/58 (50%)
Query: 199 KSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKKGL 256
+ L+ ++K+LEEK LE EI + E+ + ++ ++ ++ + KGL
Sbjct: 98 EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGL 155
>gnl|CDD|218031 pfam04335, VirB8, VirB8 protein. VirB8 is a bacterial virulence
protein with cytoplasmic, transmembrane, and periplasmic
regions. It is thought that it is a primary constituent
of a DNA transporter. The periplasmic region interacts
with VirB9, VirB10, and itself. This family also
includes the conjugal transfer protein family TrbF, a
family of proteins known to be involved in conjugal
transfer. The TrbF protein is thought to compose part of
the pilus required for transfer. This domain has a
similar fold to the NTF2 protein.
Length = 208
Score = 26.9 bits (60), Expect = 8.5
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 51 SSEDDTSKFNKNLRDRVPFAVVGSNTVVEID 81
S+ +++N R PFA +G NTV+E++
Sbjct: 109 STGAVAAEYNAYFRPNSPFARLGKNTVIEVE 139
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 27.0 bits (60), Expect = 8.5
Identities = 25/87 (28%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMK-KSLEAQIKELEEKRRG 214
+ E E++E E +M++ME DM + + ++ + E++ K K E + + LE+K
Sbjct: 2 EAEREQQELEERMEQMEEDMR---RAQKELEEYEETALELEEKLKQEEEEAQLLEKKADE 58
Query: 215 LELEISQWEQSNGVSMDELRRRSLERD 241
LE E + E+ S +E R E D
Sbjct: 59 LEEENRRLEEEAAASEEERERLEAEVD 85
>gnl|CDD|226799 COG4353, COG4353, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 26.7 bits (59), Expect = 9.2
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 4/45 (8%)
Query: 202 EAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDG 246
E I++LE R +E I + E +L R+S +R +
Sbjct: 151 EEGIRQLETIREEIENFIIRKED----EGIDLYRKSRDRTVYWEI 191
>gnl|CDD|237250 PRK12893, PRK12893, allantoate amidohydrolase; Reviewed.
Length = 412
Score = 27.2 bits (61), Expect = 9.3
Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 1/44 (2%)
Query: 197 MKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
M+ +L A ++ RG+++ + V D +E
Sbjct: 299 MEAALRAACAKIAA-ARGVQVTVETVWDFPPVPFDPALVALVEA 341
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 27.3 bits (61), Expect = 9.4
Identities = 15/66 (22%), Positives = 35/66 (53%), Gaps = 11/66 (16%)
Query: 155 AQMEEEKREHEAKMKKMEVDMEQ-----------VFEMKVKEKKQKLKDSEIDMKKSLEA 203
ME ++R ++ +K++ ME + E K++E+++ LK+ +SL+
Sbjct: 232 QMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQK 291
Query: 204 QIKELE 209
+I++L+
Sbjct: 292 EIQDLK 297
Score = 27.3 bits (61), Expect = 9.7
Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 8/72 (11%)
Query: 158 EEEKREHEAKMKKMEVDME----QVFEMKVKEKKQKLKDSEIDMKKSLEAQ----IKELE 209
E+++E E M+ E + Q+ E E+++ L + E ++ L+ Q + +
Sbjct: 224 REKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFK 283
Query: 210 EKRRGLELEISQ 221
+ L+ EI
Sbjct: 284 TEAESLQKEIQD 295
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 27.3 bits (60), Expect = 9.8
Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 149 LNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKK-QKLKDSEIDMKKSLE 202
L K L +++EEK++ E +K ME +E + E+K K + + E+D+K +E
Sbjct: 1223 LGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEME 1277
>gnl|CDD|152255 pfam11819, DUF3338, Domain of unknown function (DUF3338). This
family of proteins are functionally uncharacterized.
This family is found in eukaryotes. This presumed domain
is about 130 amino acids in length.
Length = 138
Score = 26.3 bits (58), Expect = 10.0
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQ 188
EE+K+E +KK + +E+ E K++E K+
Sbjct: 26 SEEQKKEKLRALKKKQQALEERLEKKLEELKK 57
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.132 0.366
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,200,282
Number of extensions: 1288458
Number of successful extensions: 2473
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2202
Number of HSP's successfully gapped: 444
Length of query: 257
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 162
Effective length of database: 6,723,972
Effective search space: 1089283464
Effective search space used: 1089283464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.2 bits)