RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9160
(297 letters)
>gnl|CDD|152645 pfam12210, Hrs_helical, Hepatocyte growth factor-regulated tyrosine
kinase substrate. This domain family is found in
eukaryotes, and is approximately 100 amino acids in
length. The family is found in association with
pfam00790, pfam01363, pfam02809. This domain is the
helical region of Hrs which forms the core complex of
ESCRT with STAM.
Length = 96
Score = 128 bits (322), Expect = 2e-37
Identities = 65/95 (68%), Positives = 78/95 (82%)
Query: 143 DDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRV 202
++F+ L++ VEIFVNRMKSNS RGRSIANDS+VQ+LF I MH +LL + Q D+KR
Sbjct: 1 EEFLNNLRTSVEIFVNRMKSNSMRGRSIANDSAVQSLFQTINAMHPQLLSLMNQLDEKRG 60
Query: 203 YYEGLQDKLTQVKDARAALNALREEHNEKLRREAQ 237
YYEGLQDKL Q+KDARAAL+ALREEH EKLRR A+
Sbjct: 61 YYEGLQDKLAQIKDARAALDALREEHREKLRRAAE 95
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 40.9 bits (97), Expect = 5e-04
Identities = 18/89 (20%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 208 QDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQ-----MAHKLDLMRKKKQEYL 262
++ L + K+ E E R ++ + +K++ + KL+L+ K+++E
Sbjct: 56 KEALLEAKEE--IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELE 113
Query: 263 QYQRNLA--LQKIQEQEYEMQRRQEQQKQ 289
+ ++ L Q+++++E E++ E+Q Q
Sbjct: 114 KKEKELEQKQQELEKKEEELEELIEEQLQ 142
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 40.0 bits (94), Expect = 0.001
Identities = 30/156 (19%), Positives = 69/156 (44%), Gaps = 12/156 (7%)
Query: 141 ELDDFIRALKSQVEIFVNRMKS--NSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKD 198
+L++ I +S +E + ++S ++ + S V+ L + + L K +Q+ +
Sbjct: 286 QLEELIDKYESHIEKALEELESILDTEKENSEFKLD-VEELKALLEALEEILEKNLQKLE 344
Query: 199 DKR------VYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLD 252
+K + E + D + + D A+N L EHNEK+ + + + + + L
Sbjct: 345 EKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDN---LKKEKNKAKKKLWLH 401
Query: 253 LMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQK 288
L+ + K++ YQ+ + E + +Q + +
Sbjct: 402 LVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 35.7 bits (83), Expect = 0.029
Identities = 19/88 (21%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 208 QDKLTQVKD-ARAALNALREEHNEKLRREAQIAER---QKQIQMAHKLDLMRKKKQEYLQ 263
++ L + K+ L E E+ R E Q ER Q++ + K++ + KK++ +
Sbjct: 50 KEALLEAKEEVHKLRAELERELKER-RNELQRLERRLLQREETLDRKMESLDKKEENLEK 108
Query: 264 YQRNLALQK--IQEQEYEMQRRQEQQKQ 289
++ L+ ++ + E+E E++ +Q++
Sbjct: 109 KEKELSNKEKNLDEKEEELEELIAEQRE 136
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 35.5 bits (82), Expect = 0.029
Identities = 20/95 (21%), Positives = 39/95 (41%), Gaps = 4/95 (4%)
Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALR---EEHNEKLRREAQIAERQKQIQMAHKL 251
Q + L+D+L ++K+ L LR EE E R ++ ++ K+++
Sbjct: 552 LQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE-ERLS 610
Query: 252 DLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQ 286
L + L N + +E E E+++ Q
Sbjct: 611 QLEELLQSLELSEAENELEEAEEELESELEKLNLQ 645
Score = 32.4 bits (74), Expect = 0.33
Identities = 16/115 (13%), Positives = 47/115 (40%), Gaps = 6/115 (5%)
Query: 187 HSRLLKYIQQKDDKRVYYEGLQDKLT----QVKDARAALNALREEHNEKLRREAQI--AE 240
L + +++ ++K E L+ ++ +++ RA L L E + E ++ E
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLE 335
Query: 241 RQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNN 295
+ + + +L +K + + L + + +E E + + ++ ++
Sbjct: 336 EKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEA 390
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 35.0 bits (81), Expect = 0.045
Identities = 27/148 (18%), Positives = 62/148 (41%), Gaps = 16/148 (10%)
Query: 147 RALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRVYYEG 206
++ V IF + + S +S + + + + +L + +++++
Sbjct: 136 PVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEE------ 189
Query: 207 LQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQE----YL 262
L+ L + ++ L+ L EE +L + E+Q +++ + + +RKK +E L
Sbjct: 190 LERALKEKREE--LLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQEL 247
Query: 263 QYQRNLALQK----IQEQEYEMQRRQEQ 286
+ Q QK + Q E+QR +
Sbjct: 248 ERQAEAHEQKLKNELALQAIELQREFNK 275
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 33.7 bits (78), Expect = 0.064
Identities = 24/88 (27%), Positives = 47/88 (53%), Gaps = 11/88 (12%)
Query: 210 KLTQVKDARAALNALREEHNEKL---RREAQIAER---QKQIQMAHKLDLMRKKKQEYLQ 263
L + K+ ++ LR E +L R E Q E+ QK+ + K + + KK++ +
Sbjct: 54 ALLEAKEE---IHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEE 110
Query: 264 YQRNLA--LQKIQEQEYEMQRRQEQQKQ 289
++ LA Q+++E+E E++ E+Q+Q
Sbjct: 111 KEKELAARQQQLEEKEEELEELIEEQQQ 138
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 34.3 bits (79), Expect = 0.087
Identities = 21/85 (24%), Positives = 47/85 (55%), Gaps = 1/85 (1%)
Query: 205 EGLQDKLTQVKDARAALNALREEHNEKLRREAQ-IAERQKQIQMAHKLDLMRKKKQEYLQ 263
E L+++L +++A A L EE EK + + + E +++++ A + +++ E L+
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLE 820
Query: 264 YQRNLALQKIQEQEYEMQRRQEQQK 288
+R Q+I+E E E++ +E+
Sbjct: 821 QRRERLEQEIEELEEEIEELEEKLD 845
Score = 33.1 bits (76), Expect = 0.21
Identities = 28/162 (17%), Positives = 75/162 (46%), Gaps = 21/162 (12%)
Query: 136 EASDTELDDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQ 195
+ EL++ +AL+ ++E + + R A + ++ ++ RL + I+
Sbjct: 778 KEEIEELEEKRQALQEELEEL--EEELEEAERRLDALERELE----SLEQRRERLEQEIE 831
Query: 196 QKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKL-RREAQIAERQKQIQMAHKLDLM 254
+ +++ E L++KL ++++ L EE E+L EA+ E + D +
Sbjct: 832 ELEEEI---EELEEKLDELEEELEELEKELEELKEELEELEAEKEELE---------DEL 879
Query: 255 RKKKQEYLQYQRNL--ALQKIQEQEYEMQRRQEQQKQYYMQN 294
++ ++E + + L ++ E + E+++ +E+ ++ +
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKL 921
Score = 31.2 bits (71), Expect = 0.78
Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 9/90 (10%)
Query: 204 YEGLQDKLTQVKDARAALNA----LREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQ 259
E LQ++L ++K+ L LRE E ++ ER ++++ + K+
Sbjct: 283 LEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIE-----ALKE 337
Query: 260 EYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
E + + L + E E + + ++K
Sbjct: 338 ELEERETLLEELEQLLAELEEAKEELEEKL 367
Score = 30.8 bits (70), Expect = 1.1
Identities = 16/106 (15%), Positives = 49/106 (46%), Gaps = 2/106 (1%)
Query: 187 HSRLLKYIQQKDDKR-VYYEGLQDKLTQVKDARAALNALREEHNEKLRR-EAQIAERQKQ 244
+ L + ++ ++K E L++ +++ A L A E +L + +I +++
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEER 412
Query: 245 IQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQY 290
++ + K++ + L+ + +++E E++ +EQ ++
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEEL 458
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 32.2 bits (74), Expect = 0.13
Identities = 18/81 (22%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 209 DKLTQVKDARAALNALREEHNEKL-RREAQIAERQKQIQMAHKL--DLMRKKKQEYLQYQ 265
+ K A+ L ++ +L ++E ++ + ++++Q + RK KQ+ LQ +
Sbjct: 28 SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87
Query: 266 RNLALQKIQEQEYEMQRRQEQ 286
+ QK Q + E+Q++Q++
Sbjct: 88 QQELQQKQQAAQQELQQKQQE 108
Score = 29.1 bits (66), Expect = 1.6
Identities = 11/72 (15%), Positives = 31/72 (43%), Gaps = 3/72 (4%)
Query: 208 QDKLTQVKDARAALNALREEHNEKLR--REAQIAERQKQIQ-MAHKLDLMRKKKQEYLQY 264
+ +++ L ++ ++ E +Q+++Q +L ++ Q+ LQ
Sbjct: 45 KKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQ 104
Query: 265 QRNLALQKIQEQ 276
++ LQ I ++
Sbjct: 105 KQQELLQPIYDK 116
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 32.6 bits (75), Expect = 0.18
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 3/62 (4%)
Query: 217 ARAALNALREEHNEKLRR-EAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQE 275
A A LRE+ E+ + EAQ ER Q + ++ M ++++ L Q + K+QE
Sbjct: 216 AEAEQELLREKQKEEEQMMEAQ--ERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQE 273
Query: 276 QE 277
QE
Sbjct: 274 QE 275
Score = 28.0 bits (63), Expect = 6.4
Identities = 14/65 (21%), Positives = 32/65 (49%), Gaps = 2/65 (3%)
Query: 225 REEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLA-LQKIQEQEYEMQRR 283
+E+ E R +A+ AE ++++ + + + + + YQ ++ L + E E E
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKL-L 260
Query: 284 QEQQK 288
EQ++
Sbjct: 261 AEQER 265
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 32.7 bits (75), Expect = 0.24
Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 207 LQDKLTQVKDARAALNALREEHNEKL---RREAQIAERQKQIQMAHKLDLMRKKKQEYLQ 263
L++KL +++ + L EE ++L E E+QKQI +L +++ E L+
Sbjct: 265 LEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANL--ERQLEELE 322
Query: 264 YQRNLALQKIQEQEYEMQRRQEQQKQ 289
Q K+ E E+ +E+ ++
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEE 348
Score = 32.7 bits (75), Expect = 0.25
Identities = 16/109 (14%), Positives = 49/109 (44%), Gaps = 10/109 (9%)
Query: 192 KYIQQKDDKR-----VYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQ 246
+Y + K + R + L++ ++++ + L EE E A++ E +++++
Sbjct: 214 RYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEEL---TAELQELEEKLE 270
Query: 247 MAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNN 295
+L + + ++E + Q+ L + E Q++ +++ ++
Sbjct: 271 -ELRL-EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQ 317
Score = 31.6 bits (72), Expect = 0.58
Identities = 17/93 (18%), Positives = 42/93 (45%), Gaps = 4/93 (4%)
Query: 202 VYYEGLQDKLTQVKDARAALNALREEHNE----KLRREAQIAERQKQIQMAHKLDLMRKK 257
+ L++ +++ R L L + + R EA++ + +++I K +
Sbjct: 702 ELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEA 761
Query: 258 KQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQY 290
+ E L+ + A +++ E E E++ + Q +Q
Sbjct: 762 EIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794
Score = 29.6 bits (67), Expect = 2.7
Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 16/83 (19%)
Query: 208 QDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRN 267
++ L +++D L E + L R+A+ AER K++ K E + +
Sbjct: 185 RENLDRLEDILNEL----ERQLKSLERQAEKAERYKEL------------KAELRELELA 228
Query: 268 LALQKIQEQEYEMQRRQEQQKQY 290
L + +++E E++ QE+ K+
Sbjct: 229 LLVLRLEELREELEELQEELKEA 251
Score = 28.9 bits (65), Expect = 4.5
Identities = 13/78 (16%), Positives = 34/78 (43%), Gaps = 3/78 (3%)
Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQK 272
++ + AL LR+E E Q+ R++ +++ ++ +RK L ++
Sbjct: 692 KIAELEKALAELRKELEELEEELEQL--RKELEELSRQISALRKDLARLEAEVEQLE-ER 748
Query: 273 IQEQEYEMQRRQEQQKQY 290
I + E+ + + ++
Sbjct: 749 IAQLSKELTELEAEIEEL 766
Score = 28.1 bits (63), Expect = 6.9
Identities = 13/94 (13%), Positives = 45/94 (47%), Gaps = 7/94 (7%)
Query: 195 QQKDDKRVYYEGLQDKLT----QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHK 250
++ ++ + E L+ +L ++++ + L L E+ + AQ+ + + ++
Sbjct: 344 EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN--NE 401
Query: 251 LDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQ 284
++ + + + L+ +R Q+I+E +++ +
Sbjct: 402 IERLEARLER-LEDRRERLQQEIEELLKKLEEAE 434
Score = 27.7 bits (62), Expect = 8.6
Identities = 21/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)
Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAH--KLD 252
+ ++ E L+ K+ Q++ A+LN E +L R ER +Q KL+
Sbjct: 372 SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431
Query: 253 LMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNNVA 297
K+ + + L+++QE+ ++ E+ ++ + A
Sbjct: 432 EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 31.5 bits (72), Expect = 0.33
Identities = 9/49 (18%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 247 MAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNN 295
+ K++ +K+++E + R L + + ++RQE M++
Sbjct: 38 IDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQE----ELMEDQ 82
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 32.0 bits (73), Expect = 0.38
Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 13/69 (18%)
Query: 215 KDARAALNALREEHNEKL-RREAQIAERQKQIQ--MA-----HKLDL-----MRKKKQEY 261
A+A N R E K+ R +IAER K+ Q A +++D+ +RK+ +E
Sbjct: 162 AAAKAPANFTRPEERVKMTRLRQRIAERLKEAQNSTAMLTTFNEVDMSAVMELRKRYKEE 221
Query: 262 LQYQRNLAL 270
+ + + L
Sbjct: 222 FEKKHGVKL 230
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 32.0 bits (73), Expect = 0.45
Identities = 17/95 (17%), Positives = 42/95 (44%), Gaps = 12/95 (12%)
Query: 196 QKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMR 255
Q E L +++++++ + L EE N+K++ + E Q +++
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGE--EEQLRVK--------- 293
Query: 256 KKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQY 290
+K L+ + + I E+E E++ +E+ +
Sbjct: 294 -EKIGELEAEIASLERSIAEKERELEDAEERLAKL 327
Score = 30.4 bits (69), Expect = 1.3
Identities = 29/170 (17%), Positives = 64/170 (37%), Gaps = 18/170 (10%)
Query: 140 TELDDFIRALKSQVEIFVNRMKS-----NSSRGRSIANDSSVQTLFVNITTMHSRLLKYI 194
EL+ I + + + R D + +L K
Sbjct: 339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLK 398
Query: 195 QQKDDKRVYYEGLQDKLTQ----VKDARAALNALREEHNE-KLRREAQIAE----RQKQI 245
++ ++ + + LQ++L + + D AA+ + + NE + +E + E K
Sbjct: 399 REINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458
Query: 246 QMAHKLDLMRKK----KQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYY 291
Q+A L ++ K+EY + ++ L+ + + E E Q R +++
Sbjct: 459 QLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRG 508
Score = 29.3 bits (66), Expect = 3.2
Identities = 19/104 (18%), Positives = 52/104 (50%), Gaps = 6/104 (5%)
Query: 188 SRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRR-EAQIAERQKQI- 245
+RL + + + + LQ++ +K+ ++ E N K E ++ E + +
Sbjct: 822 NRLTLEKEYLEKEI---QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR 878
Query: 246 QMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
+ +L ++K++ E L+ Q +KI+E E ++++++++ +
Sbjct: 879 DLESRLGDLKKERDE-LEAQLRELERKIEELEAQIEKKRKRLSE 921
>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
Length = 110
Score = 30.2 bits (69), Expect = 0.45
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 251 LDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
L+ +R++K E LQ ++ Q+ QE++ E Q + E QKQ
Sbjct: 3 LEEIRRRKLEELQ-KQAQEQQQAQEEQEEAQAQAEAQKQ 40
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 31.4 bits (71), Expect = 0.53
Identities = 25/141 (17%), Positives = 60/141 (42%), Gaps = 6/141 (4%)
Query: 158 NRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRVYYEGLQDKLTQVKDA 217
N++ +S RS+ +D + L + ++L + K + + ++ +L D+
Sbjct: 434 NQISLLTSLERSLKSD--LGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADS 491
Query: 218 RAALNALREEHNEKLRREAQIAER---QKQIQMAHKLDLMRKKKQEYLQYQRNLALQ-KI 273
R E ++ + E + A R Q + +++ KQ+ + L K+
Sbjct: 492 RVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKL 551
Query: 274 QEQEYEMQRRQEQQKQYYMQN 294
+E+E M ++ Q+ + Y ++
Sbjct: 552 KEEECRMLEKEAQELRKYQES 572
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 31.1 bits (71), Expect = 0.70
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 209 DKLTQV---KDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQ 265
DKL + + ++ REE EK+ + A+ ERQ++ Q + KKK+E
Sbjct: 246 DKLANLKLSPEVLRKVDKTREEEEEKILKAAE-EERQEEAQEKKE----EKKKEEREAKL 300
Query: 266 RNL---ALQKIQEQEYEMQRR 283
L +K++E+E + Q R
Sbjct: 301 AKLSPEEQRKLEEKERKKQAR 321
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.9 bits (70), Expect = 0.82
Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 4/58 (6%)
Query: 236 AQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEY----EMQRRQEQQKQ 289
A+ AE Q++ + + + +++K+ + + L +++ QE E +Q KQ
Sbjct: 74 AKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131
>gnl|CDD|216028 pfam00621, RhoGEF, RhoGEF domain. Guanine nucleotide exchange
factor for Rho/Rac/Cdc42-like GTPases Also called
Dbl-homologous (DH) domain. It appears that pfam00169
domains invariably occur C-terminal to RhoGEF/DH
domains.
Length = 179
Score = 30.3 bits (69), Expect = 0.85
Identities = 24/132 (18%), Positives = 53/132 (40%), Gaps = 27/132 (20%)
Query: 139 DTELDDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTM---HSRLLKYIQ 195
TE ++R LK VE+F+ ++ I ++ ++T+F NI + H L+ ++
Sbjct: 7 QTERS-YVRDLKILVEVFLKPLRE-----SPILSEEEIKTIFSNIEEILELHQEFLEELE 60
Query: 196 QKDDKRVYYEGLQDKLTQVKD--------------ARAALNALREEHNEKLRREAQIAER 241
++ ++ + + D + A L LR++ N + + +
Sbjct: 61 ERLEEWPDIQRIGDIFLKFAPFFKVYSTYCSNYPRALELLKKLRKK-NPRFA---KFLKE 116
Query: 242 QKQIQMAHKLDL 253
+ + LDL
Sbjct: 117 CEASPLCRGLDL 128
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 30.7 bits (70), Expect = 0.94
Identities = 14/66 (21%), Positives = 31/66 (46%), Gaps = 17/66 (25%)
Query: 241 RQKQIQMAHKLDLMRKKKQEY-LQYQRNLA----------------LQKIQEQEYEMQRR 283
RQ+ ++ KL+ MR++ E +Q + N+A + ++ + E++ R
Sbjct: 84 RQRLSRLVAKLEEMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKER 143
Query: 284 QEQQKQ 289
+E Q +
Sbjct: 144 EETQIE 149
>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein.
Length = 304
Score = 30.5 bits (69), Expect = 0.98
Identities = 12/56 (21%), Positives = 23/56 (41%), Gaps = 1/56 (1%)
Query: 235 EAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQ-EQEYEMQRRQEQQKQ 289
+A + + ++ A R + E + Q +A IQ EQE + R ++
Sbjct: 39 QAALDRAEAELAAAQADAQARLARAERDRLQALVAQAAIQSEQEADAARAALREAL 94
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 30.6 bits (70), Expect = 1.2
Identities = 15/91 (16%), Positives = 38/91 (41%), Gaps = 11/91 (12%)
Query: 205 EGLQDKLTQVKDARAALNALREEHNEKLRRE-AQIAERQKQIQMAHKLDLMRK------- 256
L++ +++ AL +E EKL+ E + E+ ++ + + ++
Sbjct: 523 ASLEELERELEQKAEEAEALLKEA-EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581
Query: 257 KKQEYLQYQRNLALQKIQEQEYEMQRRQEQQ 287
K+E + + L+++Q+ Y + E
Sbjct: 582 AKKEADEIIKE--LRQLQKGGYASVKAHELI 610
Score = 29.8 bits (68), Expect = 2.2
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 11/65 (16%)
Query: 227 EHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQ 286
E+L RE + QK + L K K+E + +K + QE E + +E
Sbjct: 523 ASLEELERELE----QKAEEAEALLKEAEKLKEELEE-------KKEKLQEEEDKLLEEA 571
Query: 287 QKQYY 291
+K+
Sbjct: 572 EKEAQ 576
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 30.7 bits (70), Expect = 1.2
Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 4/66 (6%)
Query: 213 QVKDARAALNALREEHNEKLR--REAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLAL 270
+++ + L A E +E + RE ++A RQ+ Q+ ++ + + +L Q AL
Sbjct: 555 ELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQD--AL 612
Query: 271 QKIQEQ 276
+++EQ
Sbjct: 613 ARLREQ 618
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 30.4 bits (70), Expect = 1.2
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 5/59 (8%)
Query: 203 YYEGLQDKLTQ-VKDAR-AALNALREEHNEKLRREAQIAERQKQIQMA-HKL--DLMRK 256
L++ K R AAL+A++EE E L E + E +K+I+ A KL ++R+
Sbjct: 249 AEAKLKEAYQITDKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKEAFKKLEKKIVRR 307
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 30.5 bits (69), Expect = 1.2
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 13/80 (16%)
Query: 209 DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNL 268
+ + ++K A A + L+ +E RREA ER ++I +A K ++ +Y
Sbjct: 356 ESIVRLKQAEADMFQLKA--DEA-RREA---ERLQRIALA-------KTEKSEEEYASKY 402
Query: 269 ALQKIQEQEYEMQRRQEQQK 288
++ E E E Q E+ K
Sbjct: 403 LKLRLSEAEEERQYLFEELK 422
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 30.3 bits (68), Expect = 1.5
Identities = 13/61 (21%), Positives = 25/61 (40%), Gaps = 4/61 (6%)
Query: 193 YIQQKDDKRVYYEGLQDK----LTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMA 248
Y + ++ + Y E + K + A +NA EK++ + +I R Q +A
Sbjct: 164 YKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHNQTPLA 223
Query: 249 H 249
Sbjct: 224 L 224
>gnl|CDD|221626 pfam12537, DUF3735, Protein of unknown function (DUF3735). This
domain family is found in eukaryotes, and is
approximately 70 amino acids in length. There is a
conserved LSG sequence motif. There is a single
completely conserved residue G that may be functionally
important.
Length = 71
Score = 27.9 bits (63), Expect = 1.5
Identities = 10/29 (34%), Positives = 17/29 (58%), Gaps = 2/29 (6%)
Query: 233 RREAQIAERQKQIQMAHKLDLMRKKKQEY 261
E+ I +K++ H LDL+ +KK+E
Sbjct: 44 VSESDINNLEKRLW--HTLDLLAEKKKEL 70
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 29.9 bits (68), Expect = 1.5
Identities = 24/116 (20%), Positives = 54/116 (46%), Gaps = 19/116 (16%)
Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALR----EEHNEKLRREAQIAERQKQIQMAHK 250
++K++K L+ + + +D R L+ LR +E E+ R+ + E +K+ + +
Sbjct: 175 ERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQE 234
Query: 251 LDLMRKK----KQEYLQ---------YQRNLALQKIQEQEYEM--QRRQEQQKQYY 291
L R++ K+E LQ +R L Q E+ + ++R+ ++ ++
Sbjct: 235 LQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHR 290
Score = 28.7 bits (65), Expect = 3.9
Identities = 17/77 (22%), Positives = 40/77 (51%), Gaps = 5/77 (6%)
Query: 222 NALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQ-----KIQEQ 276
++ E E+ RR ++ E ++ +A + + RK+K+E + + L Q K +++
Sbjct: 30 KRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQE 89
Query: 277 EYEMQRRQEQQKQYYMQ 293
EYE + ++ +Q ++
Sbjct: 90 EYEERLQEREQMDEIIE 106
Score = 28.3 bits (64), Expect = 4.5
Identities = 17/82 (20%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMR----KKKQEYLQYQRNL 268
+ + A + E+ E E + AE+++ ++ H+ +L + K+++ + + L
Sbjct: 253 ERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEEL 312
Query: 269 A-LQKIQEQEYEMQRRQEQQKQ 289
++++E+E E Q R E+++Q
Sbjct: 313 EEGERLREEEAERQARIEEERQ 334
Score = 28.0 bits (63), Expect = 5.7
Identities = 14/76 (18%), Positives = 40/76 (52%)
Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQK 272
++++ A + E +KLR E ++ + + + R+++ + L+YQR A ++
Sbjct: 107 RIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAERE 166
Query: 273 IQEQEYEMQRRQEQQK 288
+ + +R++E+++
Sbjct: 167 EEREAERRERKEEKER 182
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.0 bits (68), Expect = 1.6
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 2/84 (2%)
Query: 205 EGLQDKLTQVKDARAALNALREEH-NEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQ 263
+ T +K AR L L+ E + K + E IAER++Q + + L + KQ Q
Sbjct: 642 RAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLR-QLDAQLKQLLEQ 700
Query: 264 YQRNLALQKIQEQEYEMQRRQEQQ 287
Q L K +E +R + Q
Sbjct: 701 QQAFLEALKDDFRELRTERLAKWQ 724
>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain. This
domain is believed to bind double-stranded DNA of 20
bases length.
Length = 105
Score = 28.3 bits (64), Expect = 1.7
Identities = 10/36 (27%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 254 MRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
+R+ + + Q+ + Q+Q+ E Q+++E QKQ
Sbjct: 1 IRRARLA--ELQQQQGGLEEQQQQEEQQQQEEAQKQ 34
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 28.7 bits (65), Expect = 1.8
Identities = 15/68 (22%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 212 TQVKDARAALNALREEHNEK--LRREAQIAERQKQIQ-MAHKLDLMRKKKQEYLQYQRNL 268
+++ L L+E+ + EA +++K++Q + ++K Q+ LQ ++
Sbjct: 32 AELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQE 91
Query: 269 ALQKIQEQ 276
LQKI ++
Sbjct: 92 ELQKILDK 99
Score = 28.7 bits (65), Expect = 2.0
Identities = 19/83 (22%), Positives = 43/83 (51%), Gaps = 15/83 (18%)
Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQ-MAHKLD----LM----RKKKQEYLQ 263
K A+ L ++ R+A++ + +K++Q + KL + R+KK++ LQ
Sbjct: 15 AGKAAQKQLEKEFKK------RQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQ 68
Query: 264 YQRNLALQKIQEQEYEMQRRQEQ 286
+ +K Q+ + ++Q+RQ++
Sbjct: 69 KKVQEFQRKQQKLQQDLQKRQQE 91
>gnl|CDD|216872 pfam02096, 60KD_IMP, 60Kd inner membrane protein.
Length = 193
Score = 29.0 bits (66), Expect = 1.9
Identities = 11/42 (26%), Positives = 24/42 (57%), Gaps = 7/42 (16%)
Query: 257 KKQEYLQYQRNLALQKIQEQEY--EMQRRQEQQKQYYMQNNV 296
K + LQ + +++IQE +Y + Q+ Q++ + Y ++ V
Sbjct: 30 AKMQELQPE----IKEIQE-KYKDDPQKLQQEMMKLYKEHGV 66
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 29.8 bits (67), Expect = 2.0
Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 8/89 (8%)
Query: 209 DKLTQVKDARAALNALREEHNEKLRREAQIAE-----RQKQIQMAHKLDLMR---KKKQE 260
+ + + R L AL + + +R AQ+ Q+Q Q L+ MR K+K +
Sbjct: 416 AQHAEQRPLRQRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQ 475
Query: 261 YLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
L + + Q+ + ++ E QR Q Q Q
Sbjct: 476 QLADVKTICEQEARIKDLEAQRAQLQAGQ 504
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 29.4 bits (66), Expect = 2.1
Identities = 19/82 (23%), Positives = 37/82 (45%)
Query: 208 QDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRN 267
Q KL Q + A + ++L + A + KQ + A K ++K+ E + ++
Sbjct: 70 QKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQA 129
Query: 268 LALQKIQEQEYEMQRRQEQQKQ 289
+ E E E + ++E +KQ
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQ 151
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 29.5 bits (66), Expect = 2.2
Identities = 20/96 (20%), Positives = 43/96 (44%), Gaps = 3/96 (3%)
Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRRE-AQIAERQKQIQMAHKLDL 253
Q+ + + + K QV + A +E ++L +E + E+QKQ + A K
Sbjct: 69 SQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQ 128
Query: 254 MRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
+ +K+QE + R A ++ ++ E + + +
Sbjct: 129 LEQKQQE--EQARKAAAEQKKKAEAAKAKAAAEAAK 162
Score = 28.0 bits (62), Expect = 6.1
Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
Query: 204 YEGLQDKLTQVKDARAALNALREEHNEKLR-REAQIAERQKQIQMAHKLDLMRKKKQEYL 262
Y +Q + + K E+ E+L+ ++A ER KQ++ ++K+ E
Sbjct: 64 YGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEA 123
Query: 263 QYQRNLALQKIQEQEYEMQRRQEQQKQ 289
+ Q L ++ +EQ + EQ+K+
Sbjct: 124 EKQAQLEQKQQEEQ--ARKAAAEQKKK 148
>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter. The ZIP family consists of
zinc transport proteins and many putative metal
transporters. The main contribution to this family is
from the Arabidopsis thaliana ZIP protein family these
proteins are responsible for zinc uptake in the plant.
Also found within this family are C. elegans proteins of
unknown function which are annotated as being similar to
human growth arrest inducible gene product, although
this protein in not found within this family.
Length = 314
Score = 29.2 bits (66), Expect = 2.3
Identities = 12/61 (19%), Positives = 16/61 (26%), Gaps = 3/61 (4%)
Query: 34 HGPHHGQSGYHHQEEDTGNSNPELAKYLDRSYWEQKKVPPNHGSHHGQSGYHHQEEDTGN 93
G HG H E + E S + HH G+ E D +
Sbjct: 104 RGHSHGHDHGHDGEHSHTVPDEESGA--VESNVVHHEHGEGP-HHHHHEGHEKGESDKES 160
Query: 94 S 94
Sbjct: 161 G 161
>gnl|CDD|176480 cd08643, DNA_pol_A_pol_I_B, Polymerase I functions primarily to
fill DNA gaps that arise during DNA repair,
recombination and replication. Family A polymerase
functions primarily to fill DNA gaps that arise during
DNA repair, recombination and replication. DNA-dependent
DNA polymerases can be classified in six main groups
based upon phylogenetic relationships with E. coli
polymerase I (classA), E. coli polymerase II (class B),
E.coli polymerase III (class C), euryarchaaeota
polymerase II (class D), human polymerase beta (class
x), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma
pigmentosum variant (class Y). Family A polymerase are
found primarily in organisms related to prokaryotes and
include prokaryotic DNA polymerase I ,mitochondrial
polymerase delta, and several bacteriphage polymerases
including those from odd-numbered phage (T3, T5, and
T7). Prokaryotic Pol Is have two functional domains
located on the same polypeptide; a 5'-3' polymerase and
5'-3' exonuclease. Pol I uses its 5' nuclease activity
to remove the ribonucleotide portion of newly
synthesized Okazaki fragments and DNA polymerase
activity to fill in the resulting gap. A combination of
phylogenomic and signature sequence-based (or phonetic)
approaches is used to understand the evolutionary
relationships among bacteria. DNA polymerase I is one of
the conserved proteins that is used to search for
protein signatures. The structure of these polymerases
resembles in overall morphology a cupped human right
hand, with fingers (which bind an incoming nucleotide
and interact with the single-stranded template), palm
(which harbors the catalytic amino acid residues and
also binds an incoming dNTP) and thumb (which binds
double-stranded DNA) subdomains.
Length = 429
Score = 29.3 bits (66), Expect = 2.5
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 189 RLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNAL 224
+L+K +++ KR + GL + +V+ A AALN L
Sbjct: 306 KLIKKVKEAAKKRGHLVGLDGRRIRVRSAHAALNTL 341
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 29.4 bits (66), Expect = 2.5
Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 9/55 (16%)
Query: 235 EAQIAERQKQI-QMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQK 288
+ A+ QKQ ++ +LD++ K++ + + +I++ + EQ K
Sbjct: 75 QVTAAQMQKQYEEIRRELDVLNKQRGDDQR--------RIEKLGQDNAALAEQVK 121
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 28.3 bits (64), Expect = 2.6
Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 226 EEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQ-EYEMQRRQ 284
E E+L+ E Q Q + ++ L+ +K + ++ Q +++IQ+Q E E +R
Sbjct: 15 ERWKEELQAELQEQLTQLEQELQQ-LEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAKRL 73
Query: 285 EQQKQYYMQ 293
EQ+ Q Q
Sbjct: 74 EQKNQLLFQ 82
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.4 bits (66), Expect = 2.7
Identities = 23/106 (21%), Positives = 48/106 (45%), Gaps = 10/106 (9%)
Query: 192 KYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKL 251
Y + K++ V E L K+ + EH EKL +E + +++ + +M +L
Sbjct: 505 FYGEFKEEINVLIEKLSA---LEKELE-----QKNEHLEKLLKEQEKLKKELEQEME-EL 555
Query: 252 DLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNNVA 297
+ K+ L+ + AL K ++E E R+ ++K+ + +
Sbjct: 556 KERERNKKLELEKEAQEAL-KALKKEVESIIRELKEKKIHKAKEIK 600
>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 220 and
257 amino acids in length.
Length = 221
Score = 28.8 bits (65), Expect = 2.7
Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 204 YEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQ 263
YE + +++ ++ + +E+ E++ R QI E +K+I K ++K+KQ +
Sbjct: 150 YESIVNQIIKLNASSNTGVYEKEDLKERVERLEQIEELEKEIAKLKK--KLKKEKQFNRK 207
Query: 264 YQRNLALQKIQEQ 276
+ NL ++K++++
Sbjct: 208 VELNLEIKKLKKE 220
>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 120
Score = 28.0 bits (63), Expect = 2.8
Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query: 214 VKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNL--ALQ 271
+++A L LR E +A++ E + + + R K+E + + + A+
Sbjct: 5 LEEANELLPELRRLLEEVREAKAELEELEAALSEGERSLEARGLKEELRELEEQIRAAIA 64
Query: 272 KIQE 275
+I+E
Sbjct: 65 EIEE 68
>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
protein p101 subunit. Class I PI3Ks are dual-specific
lipid and protein kinases involved in numerous
intracellular signaling pathways. Class IB PI3K,
p110gamma, is mainly activated by seven-transmembrane
G-protein-coupled receptors (GPCRs), through its
regulatory subunit p101 and G-protein beta-gamma
subunits.
Length = 856
Score = 29.2 bits (65), Expect = 2.9
Identities = 17/67 (25%), Positives = 28/67 (41%)
Query: 137 ASDTELDDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQ 196
A ELD + L V V R S S + + + S ++ V I SRL
Sbjct: 736 AKYEELDSSMECLNLNVTEVVKRQNSKSKKSFNQISTSQIKVDKVQIIGRDSRLFALCLD 795
Query: 197 KDDKRVY 203
+D++++
Sbjct: 796 QDEQKIL 802
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 28.9 bits (65), Expect = 3.2
Identities = 28/170 (16%), Positives = 67/170 (39%), Gaps = 25/170 (14%)
Query: 140 TELDDFIRALKSQVEIFVNRMK-SNSSRGRSIAND--SSVQTLFVNITTMHS----RLLK 192
EL ++ ++QVE F + ++++RG+ ++ S++ T + + RL
Sbjct: 205 EELRARLQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQSARARLAQAEARLAS 264
Query: 193 YIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAER------QKQIQ 246
+Q R L +V ++ + LR+++ + ++ A ++ Q
Sbjct: 265 LLQLLPLGRE-----AAALREVLESPT-IQDLRQQYAQVRQQIADLSTELGAKHPQLVAL 318
Query: 247 MAHKLDLMRKKKQEYLQY------QRNLALQKIQEQEYEMQRRQEQQKQY 290
A +L ++ E Q + L Q+ E E+ + + + +
Sbjct: 319 EAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKL 368
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 29.0 bits (65), Expect = 3.3
Identities = 20/116 (17%), Positives = 45/116 (38%), Gaps = 12/116 (10%)
Query: 187 HSRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAER----- 241
+ R+ + Q +D L+ +L L E+ + L+ + +E
Sbjct: 480 NERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSLLK 539
Query: 242 ---QKQIQMAHKLDLMRKKKQEYL----QYQRNLALQKIQEQEYEMQRRQEQQKQY 290
++ ++ H+ + +KK+E + Q +KI E E +Q++ E +
Sbjct: 540 SKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAM 595
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 28.0 bits (63), Expect = 3.5
Identities = 18/87 (20%), Positives = 49/87 (56%), Gaps = 11/87 (12%)
Query: 209 DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQY---- 264
+K ++++ + AL+A REE RRE + +R++ ++ K + +++ ++ ++
Sbjct: 4 EKRREMEEVQLALDAKREEFE---RREELLKQREE--ELEKKEEELQESLIKFDKFLKEN 58
Query: 265 --QRNLALQKIQEQEYEMQRRQEQQKQ 289
+R A +K +E++ + ++E+ K+
Sbjct: 59 EAKRRRAEKKAEEEKKLRKEKEEEIKE 85
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 28.8 bits (65), Expect = 3.5
Identities = 18/67 (26%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 207 LQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQR 266
++ ++ V ++ A+L LREE NE+L++E E + Q + + R ++ L +
Sbjct: 57 PREFVSGVFESLASLFDLREE-NEELKKELLELESRLQELEQLEAENARLREL--LNLKE 113
Query: 267 NLALQKI 273
+L Q I
Sbjct: 114 SLDYQFI 120
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF. Sequences
in this family of proteins are members of the chain
length determinant family (pfam02706) which includes the
wzc protein from E.coli. This family of proteins are
homologous to the EpsF protein of the methanolan
biosynthesis operon of Methylobacillus species strain
12S. The distribution of this protein appears to be
restricted to a subset of exopolysaccharide operons
containing a syntenic grouping of genes including a
variant of the EpsH exosortase protein. Exosortase has
been proposed to be involved in the targetting and
processing of proteins containing the PEP-CTERM domain
to the exopolysaccharide layer.
Length = 444
Score = 28.6 bits (64), Expect = 3.8
Identities = 35/179 (19%), Positives = 71/179 (39%), Gaps = 22/179 (12%)
Query: 130 GVTSPDEASDTE---LDDFIRALK-SQVEIFVNRMKSNSSRGRS----IANDSSVQTLFV 181
G+ S DE D E L++ L +Q ++ K S G+ + + +Q L
Sbjct: 202 GIVSSDERLDVERARLNELSAQLVAAQAQVMDASSKEGGSSGKDALPEVIANPIIQNLKT 261
Query: 182 NITTMHSRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAER 241
+I S+L + Q+ Y+ Q A +N+L+ + N ++++
Sbjct: 262 DIARAESKLAELSQRLGPNHPQYKRAQ----------AEINSLKSQLNAEIKKVTSSVGT 311
Query: 242 QKQIQMAHKLDL---MRKKKQEYLQYQRNLALQKIQEQEYEM-QRRQEQQKQYYMQNNV 296
+I + +L + +K + L+ R + +++ E QR + Q Y Q +
Sbjct: 312 NSRILKQREAELREALENQKAKVLELNRQRDEMSVLQRDVENAQRAYDAAMQRYTQTRI 370
>gnl|CDD|153312 cd07628, BAR_Atg24p, The Bin/Amphiphysin/Rvs (BAR) domain of yeast
Sorting Nexin Atg24p. BAR domains are dimerization,
lipid binding and curvature sensing modules found in
many different proteins with diverse functions. Sorting
nexins (SNXs) are Phox homology (PX) domain containing
proteins that are involved in regulating membrane
traffic and protein sorting in the endosomal system.
SNXs differ from each other in their lipid-binding
specificity, subcellular localization and specific
function in the endocytic pathway. A subset of SNXs also
contain BAR domains. The PX-BAR structural unit
determines the specific membrane targeting of SNXs.
Atg24p is involved in membrane fusion events at the
vacuolar surface during pexophagy. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions.
Length = 185
Score = 28.4 bits (64), Expect = 3.8
Identities = 20/74 (27%), Positives = 37/74 (50%), Gaps = 14/74 (18%)
Query: 183 ITTMHSRLLKYIQQ-------KDDKRVYYEGLQDKLTQ--VKDARAALNALREEHNEKLR 233
+T++ LL YI +D K++ YE L D L V++A+ +A N+++
Sbjct: 89 LTSLKD-LLHYILSLKNLIKLRDQKQLDYEELSDYLLTDEVENAKETSDAF----NKEVL 143
Query: 234 REAQIAERQKQIQM 247
+E ER K+ ++
Sbjct: 144 KEYPNFERIKKQEI 157
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 28.9 bits (65), Expect = 3.9
Identities = 21/101 (20%), Positives = 45/101 (44%), Gaps = 2/101 (1%)
Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAE-RQKQIQMAHKLDL 253
++ ++ R L KL ++K+ +L +R E +I R+K+ +A D
Sbjct: 565 EEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDE 624
Query: 254 MRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQN 294
R++ E + +R L + E E R +++ + Y++
Sbjct: 625 RRERLAEKRERKRELE-AEFDEARIEEAREDKERAEEYLEQ 664
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.7 bits (65), Expect = 4.0
Identities = 21/79 (26%), Positives = 35/79 (44%), Gaps = 12/79 (15%)
Query: 212 TQVKDARAALNALREEHNEKLRRE-----AQIAERQKQIQMAHKLDLMRKKKQEYLQYQR 266
+K + ALR L+ E AQ ++K ++ +L Q+ LQ QR
Sbjct: 177 NLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQL-------QDLLQKQR 229
Query: 267 NLALQKIQEQEYEMQRRQE 285
+ +IQ E+++Q QE
Sbjct: 230 DYLTARIQRLEHQLQLLQE 248
>gnl|CDD|238945 cd01987, USP_OKCHK, USP domain is located between the N-terminal
sensor domain and C-terminal catalytic domain of this
Osmosensitive K+ channel histidine kinase family. The
family of KdpD sensor kinase proteins regulates the
kdpFABC operon responsible for potassium transport. The
USP domain is homologous to the universal stress protein
Usp Usp is a small cytoplasmic bacterial protein whose
expression is enhanced when the cell is exposed to
stress agents. Usp enhances the rate of cell survival
during prolonged exposure to such conditions, and may
provide a general "stress endurance" activity.
Length = 124
Score = 27.5 bits (62), Expect = 4.2
Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 8/49 (16%)
Query: 228 HNEKL-RREAQIAERQK------QIQMAHKLDLMRKKKQEYLQYQRNLA 269
+ E+L RR A++A+R K ++ L+ + + ++ L LA
Sbjct: 11 NAERLIRRAARLADRLKAPWYVVYVETPR-LNRLSEAERRRLAEALRLA 58
>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
Length = 372
Score = 28.8 bits (64), Expect = 4.3
Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 1/69 (1%)
Query: 200 KRVYYEGLQ-DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKK 258
K+ YYE L K ++ R A L ++++ L + ++ +I A + L + K+
Sbjct: 3 KKDYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKR 62
Query: 259 QEYLQYQRN 267
++Y Q+
Sbjct: 63 KQYDQFGHA 71
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 28.2 bits (63), Expect = 4.5
Identities = 19/88 (21%), Positives = 44/88 (50%), Gaps = 1/88 (1%)
Query: 207 LQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMA-HKLDLMRKKKQEYLQYQ 265
L+ K+ + + +A R + RR+ + Q++ Q A ++ +L RK+ Q+ L+ Q
Sbjct: 68 LEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQ 127
Query: 266 RNLALQKIQEQEYEMQRRQEQQKQYYMQ 293
R + ++ QE + R++ ++ +
Sbjct: 128 RRQNEELLKMQEESVLRQEAMRRATEEE 155
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 28.1 bits (63), Expect = 4.7
Identities = 13/50 (26%), Positives = 27/50 (54%), Gaps = 8/50 (16%)
Query: 240 ERQKQIQMAHKLDLMR--KKKQEYLQYQRNL-ALQKIQEQEYEMQRRQEQ 286
E+ K++Q +M+ +K+ Q ++ L+K+QE++ EM Q +
Sbjct: 72 EKMKELQ-----KMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRE 116
>gnl|CDD|185267 PRK15369, PRK15369, two component system sensor kinase SsrB;
Provisional.
Length = 211
Score = 27.7 bits (62), Expect = 6.0
Identities = 22/74 (29%), Positives = 29/74 (39%), Gaps = 13/74 (17%)
Query: 209 DKLTQVKDARAALNALREEHNEKL----RREAQIAER----QKQIQMAHKLDLMRK---- 256
L D L RE KL IAE+ K ++ H+L++MRK
Sbjct: 137 LALLNADDTNPPLLTPRERQILKLITEGYTNRDIAEQLSISIKTVE-THRLNMMRKLDVH 195
Query: 257 KKQEYLQYQRNLAL 270
K E L + R L L
Sbjct: 196 KVAELLNWARRLGL 209
>gnl|CDD|222504 pfam14027, DUF4243, Protein of unknown function (DUF4243). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria and
eukaryotes. Proteins in this family are typically
between 348 and 477 amino acids in length.
Length = 322
Score = 28.1 bits (63), Expect = 6.2
Identities = 11/45 (24%), Positives = 15/45 (33%), Gaps = 5/45 (11%)
Query: 33 NHGPHHGQSGYHHQEEDTGNSNPELAKYLDRSYWEQKKVPPNHGS 77
NH PH + Y G S +L + DR + P
Sbjct: 4 NHLPHALLALYAL-----GASPEQLQRAYDRYSSYLEPAVPPDED 43
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 27.7 bits (62), Expect = 6.5
Identities = 11/38 (28%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 251 LDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQK 288
L +R++++++ Q +RN L + Q E +RR+E+++
Sbjct: 154 LAELRQQQRQFEQ-RRNAELAETQRLEEAERRRREEKE 190
>gnl|CDD|169691 PRK09173, PRK09173, F0F1 ATP synthase subunit B; Validated.
Length = 159
Score = 27.4 bits (61), Expect = 6.5
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 22/82 (26%)
Query: 216 DARAALNALREEHNE--KLRREAQ--IAERQKQIQMAHK----------------LDLMR 255
DARA ++ E E +LR EAQ +AE Q++ + A K +
Sbjct: 32 DARAD--RIKNELAEARRLREEAQQLLAEYQRKRKEAEKEAADIVAAAEREAEALTAEAK 89
Query: 256 KKKQEYLQYQRNLALQKIQEQE 277
+K +EY+ + LA QKI + E
Sbjct: 90 RKTEEYVARRNKLAEQKIAQAE 111
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 28.1 bits (63), Expect = 6.7
Identities = 15/83 (18%), Positives = 33/83 (39%), Gaps = 13/83 (15%)
Query: 189 RLLKYIQQKDDKRVYYEGLQDKL-----------TQVKDARAALNALREEHNEKLRREAQ 237
+L+ +Q +DK YE L++ + Q K L ++ K A+
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAE 876
Query: 238 IAERQKQIQMAHKLDLMRKKKQE 260
+ ++ +A K + ++ +E
Sbjct: 877 --KLEELYILAKKEEEFKQFAEE 897
>gnl|CDD|219139 pfam06691, DUF1189, Protein of unknown function (DUF1189). This
family consists of several hypothetical bacterial
proteins of around 260 residues in length. The function
of this family is unknown.
Length = 250
Score = 27.7 bits (62), Expect = 6.8
Identities = 7/37 (18%), Positives = 14/37 (37%)
Query: 121 TDHHMNHTSGVTSPDEASDTELDDFIRALKSQVEIFV 157
+ + S + +L +FI L+S I +
Sbjct: 124 ANGQVQEYSYSLLGLSLTKKDLQNFINQLQSLFGIII 160
>gnl|CDD|182745 PRK10803, PRK10803, tol-pal system protein YbgF; Provisional.
Length = 263
Score = 27.8 bits (62), Expect = 6.9
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 16/63 (25%)
Query: 183 ITTMHSRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQ 242
I+ HS+LL +QQ Q+ D ++ +++LR + E + Q+ ERQ
Sbjct: 48 ISNAHSQLLTQLQQ----------------QLSDNQSDIDSLRGQIQENQYQLNQVVERQ 91
Query: 243 KQI 245
KQI
Sbjct: 92 KQI 94
>gnl|CDD|200475 cd11336, AmyAc_MTSase, Alpha amylase catalytic domain found in
maltooligosyl trehalose synthase (MTSase).
Maltooligosyl trehalose synthase (MTSase) domain. MTSase
and maltooligosyl trehalose trehalohydrolase (MTHase)
work together to produce trehalose. MTSase is
responsible for converting the alpha-1,4-glucosidic
linkage to an alpha,alpha-1,1-glucosidic linkage at the
reducing end of the maltooligosaccharide through an
intramolecular transglucosylation reaction, while MTHase
hydrolyzes the penultimate alpha-1,4 linkage of the
reducing end, resulting in the release of trehalose. The
Alpha-amylase family comprises the largest family of
glycoside hydrolases (GH), with the majority of enzymes
acting on starch, glycogen, and related oligo- and
polysaccharides. These proteins catalyze the
transformation of alpha-1,4 and alpha-1,6 glucosidic
linkages with retention of the anomeric center. The
protein is described as having 3 domains: A, B, C. A is
a (beta/alpha) 8-barrel; B is a loop between the beta 3
strand and alpha 3 helix of A; C is the C-terminal
extension characterized by a Greek key. The majority of
the enzymes have an active site cleft found between
domains A and B where a triad of catalytic residues
(Asp, Glu and Asp) performs catalysis. Other members of
this family have lost the catalytic activity as in the
case of the human 4F2hc, or only have 2 residues that
serve as the catalytic nucleophile and the acid/base,
such as Thermus A4 beta-galactosidase with 2 Glu
residues (GH42) and human alpha-galactosidase with 2 Asp
residues (GH31). The family members are quite extensive
and include: alpha amylase, maltosyltransferase,
cyclodextrin glycotransferase, maltogenic amylase,
neopullulanase, isoamylase, 1,4-alpha-D-glucan
maltotetrahydrolase, 4-alpha-glucotransferase,
oligo-1,6-glucosidase, amylosucrase, sucrose
phosphorylase, and amylomaltase.
Length = 660
Score = 27.8 bits (63), Expect = 7.1
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 10/53 (18%)
Query: 216 DARAALNALRE---EHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQ 265
D RA L L E E E +RR ++ + D EYL YQ
Sbjct: 453 DVRARLAVLSELPEEWAEAVRRWRRLNAPLRTGPAPDPND-------EYLLYQ 498
>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
Length = 378
Score = 27.9 bits (62), Expect = 7.1
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 1/67 (1%)
Query: 199 DKRVYYEGLQ-DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKK 257
+KR YYE L D+ K+ + A L +++ + E E+ K+I A+ + +K
Sbjct: 3 EKRDYYEVLGVDRNADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEK 62
Query: 258 KQEYLQY 264
+Q Y Q+
Sbjct: 63 RQRYDQF 69
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 27.4 bits (61), Expect = 7.3
Identities = 10/48 (20%), Positives = 24/48 (50%)
Query: 229 NEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQ 276
E+ + + E+QK+ + + + R +K + Y++ + +K EQ
Sbjct: 84 REEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQ 131
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 27.8 bits (62), Expect = 7.9
Identities = 11/84 (13%), Positives = 39/84 (46%), Gaps = 6/84 (7%)
Query: 215 KDARAALNALREEHNEKL------RREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNL 268
K + +E E+L + E + A+ ++++++ +L+ +++ + N
Sbjct: 112 KLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADVDVNS 171
Query: 269 ALQKIQEQEYEMQRRQEQQKQYYM 292
L+ + +E + + +E++ + +
Sbjct: 172 MLEALFRREKKEEEEEEEEDEALI 195
>gnl|CDD|198287 cd03177, GST_C_Delta_Epsilon, C-terminal, alpha helical domain of
Class Delta and Epsilon Glutathione S-transferases.
Glutathione S-transferase (GST) C-terminal domain
family, Class Delta and Epsilon subfamily; GSTs are
cytosolic dimeric proteins involved in cellular
detoxification by catalyzing the conjugation of
glutathione (GSH) with a wide range of endogenous and
xenobiotic alkylating agents, including carcinogens,
therapeutic drugs, environmental toxins and products of
oxidative stress. GSTs also show GSH peroxidase activity
and are involved in the synthesis of prostaglandins and
leukotrienes. The GST fold contains an N-terminal
thioredoxin-fold domain and a C-terminal alpha helical
domain, with an active site located in a cleft between
the two domains. GSH binds to the N-terminal domain
while the hydrophobic substrate occupies a pocket in the
C-terminal domain. The class Delta and Epsilon subfamily
is made up primarily of insect GSTs, which play major
roles in insecticide resistance by facilitating
reductive dehydrochlorination of insecticides or
conjugating them with GSH to produce water-soluble
metabolites that are easily excreted. They are also
implicated in protection against cellular damage by
oxidative stress.
Length = 117
Score = 26.7 bits (60), Expect = 8.1
Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 12/64 (18%)
Query: 168 RSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRVYYEGL---QDKLTQVKDARAALNAL 224
R+I N Q LF + T++ RL Y + + G ++KL ++++A L
Sbjct: 3 RAIVN----QRLFFDSGTLYQRLRDYYY-----PILFGGAEPPEEKLDKLEEALEFLETF 53
Query: 225 REEH 228
E
Sbjct: 54 LEGS 57
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 27.7 bits (61), Expect = 9.9
Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 2/55 (3%)
Query: 236 AQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLA-LQKIQEQEYEMQRRQEQQKQ 289
A + E KQ Q+ + L+ + K+E L+Y LA L Q +E+ + ++
Sbjct: 526 ALLKEEAKQRQLQEQKALL-EHKKETLEYTSQLAELLDQQADRFELSAQAAGSQK 579
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 26.9 bits (60), Expect = 10.0
Identities = 15/102 (14%), Positives = 44/102 (43%), Gaps = 20/102 (19%)
Query: 189 RLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKL---RREAQ------IA 239
+LK + ++ K + D L + + + AL E+ ++L R +A
Sbjct: 30 PILKALDERQAK------IADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKK 83
Query: 240 ERQKQIQMAHK-----LDLMRKKKQEYLQYQRNLALQKIQEQ 276
E ++ + L+ +++ + ++ ++ AL++++ +
Sbjct: 84 EAEQIAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAE 125
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.126 0.360
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,758,560
Number of extensions: 1376156
Number of successful extensions: 2176
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1938
Number of HSP's successfully gapped: 314
Length of query: 297
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 201
Effective length of database: 6,679,618
Effective search space: 1342603218
Effective search space used: 1342603218
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 59 (26.4 bits)