RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy318
(438 letters)
>gnl|CDD|224402 COG1485, COG1485, Predicted ATPase [General function prediction
only].
Length = 367
Score = 337 bits (867), Expect = e-113
Identities = 131/387 (33%), Positives = 200/387 (51%), Gaps = 57/387 (14%)
Query: 11 PSKQYYEKLSNNEIQEDKHQEQIVKQLDNVYVSIKNYAPQSKSMFSFFQDKVKQPKGLYI 70
P ++Y + + Q D Q LD +Y + K++ F +GLY+
Sbjct: 11 PVERYAQLVPAGTFQPDPAQPAAAAALDRLYDELVAPRSARKALGWLFGRDHGPVRGLYL 70
Query: 71 YGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVP 130
+G VG GKTMLMD+FYES ++K+R+HF++FM VH ++H ++ G
Sbjct: 71 WGGVGRGKTMLMDLFYESLPGERKRRLHFHRFMARVHQRLHTLQ-------------GQT 117
Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKTWLICFDEFQ 190
+ P+PP+A+++ ++T ++CFDEF+
Sbjct: 118 D-------------------------------------PLPPIADELAAETRVLCFDEFE 140
Query: 191 VTDIADAMILKRLFTELFQLGVVVVATSNRAPDDLYKNGLQRSNFLPFIDVLKTYCDVAS 250
VTDIADAMIL RL LF GVV+VATSN APD+LYK+GLQR FLP ID++K++ +V +
Sbjct: 141 VTDIADAMILGRLLEALFARGVVLVATSNTAPDNLYKDGLQRERFLPAIDLIKSHFEVVN 200
Query: 251 LNSNIDYRSLKANAEESSTKTYFVK-NFANEKKLHGIFKLLCSQENDIVRPRVITIMGRN 309
++ +DYR K Y + E L ++ L + + I GR
Sbjct: 201 VDGPVDYRLRKLEQAP----VYLTPLDAEAEAALDKLWAALSDGAPE--AAANLEIKGRE 254
Query: 310 VTFNKTCGQILEATFDELCSRDLGSSDYLHICQIFHTVIIRNVPQLNIKLRSQSRRFITL 369
+ G +L F +LC +SDYL + + FHTV + +VPQ++ R ++RRFI L
Sbjct: 255 IRVPAVAGGLLWFDFAQLCEAPRSASDYLALAERFHTVFLTDVPQMDPLDRDEARRFIAL 314
Query: 370 IDALYDNNIRLVISSDVPLNKLFSNEA 396
+D LYD ++LV S++VPLN+L+
Sbjct: 315 VDELYDRGVKLVASAEVPLNELYQGGR 341
>gnl|CDD|217821 pfam03969, AFG1_ATPase, AFG1-like ATPase. This family of proteins
contains a P-loop motif and are predicted to be ATPases.
Length = 361
Score = 274 bits (703), Expect = 4e-89
Identities = 118/389 (30%), Positives = 182/389 (46%), Gaps = 61/389 (15%)
Query: 11 PSKQYYEKLSNNEIQEDKHQEQIVKQLDNVY--VSIKNYAPQSKSMFSFFQDKVKQ--PK 66
P ++Y +L I D Q V LD +Y + ++ QS + + K +
Sbjct: 3 PPQRYTAQLQRGAIFPDVAQANAVPALDRLYQRLQAADFVRQSGAGGKLWGRKRSHQPVR 62
Query: 67 GLYIYGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYD 126
GLY++G VG GKT LMD F+ES ++K+R HF++FM VH
Sbjct: 63 GLYLWGGVGRGKTHLMDSFFESLPGERKRRTHFHRFMFRVHD------------------ 104
Query: 127 PGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKTWLICF 186
L ++ + DP+P A+ ++ L+CF
Sbjct: 105 -----ELTTLQGGD---------------------------DPLPIAADRFANEARLLCF 132
Query: 187 DEFQVTDIADAMILKRLFTELFQLGVVVVATSNRAPDDLYKNGLQRSNFLPFIDVLKTYC 246
DEF+V DI DAMIL RLF LF GV +VATSN AP+ LY+NGL R FLP ID+L+++
Sbjct: 133 DEFEVDDIGDAMILGRLFEALFARGVSLVATSNTAPEQLYRNGLNRQRFLPAIDLLESHF 192
Query: 247 DVASLNSNIDYRSLKANAEESSTKTYFVKNFANEKKLHGIFKLLCSQENDIVRPRV-ITI 305
+V ++ +DYR + + A L ++ L E P + +
Sbjct: 193 EVVRVDGPVDYRLRTLEQAPLW---LYPLDGAAWAALDRLWDALTLGEPQ---PLSTLEV 246
Query: 306 MGRNVTFNKTCGQILEATFDELCSRDLGSSDYLHICQIFHTVIIRNVPQLNIKLRSQSRR 365
R G ++ +F +LC SDYL + + F TV + +VP + + ++RR
Sbjct: 247 AARLARARAAGGDLVWFSFADLCLAPRHPSDYLALAERFSTVFLTDVPPMTRCSQDEARR 306
Query: 366 FITLIDALYDNNIRLVISSDVPLNKLFSN 394
FI L+D LYD +++LV S++ L+ L+
Sbjct: 307 FIALVDELYDRHVKLVASAEAALDDLYRG 335
Score = 28.2 bits (63), Expect = 9.7
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 203 LFTELFQLGVVVVATSNRAPDDLYKNGLQRSNFL 236
L EL+ V +VA++ A DDLY+ G F
Sbjct: 310 LVDELYDRHVKLVASAEAALDDLYRGGRLAFEFQ 343
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 57.0 bits (137), Expect = 1e-08
Identities = 21/30 (70%), Positives = 26/30 (86%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
K EE+ERE+E+E+ERERERERE ER AK+
Sbjct: 595 KAREEREREKEKEKEREREREREAERAAKA 624
Score = 53.5 bits (128), Expect = 1e-07
Identities = 21/30 (70%), Positives = 27/30 (90%), Gaps = 1/30 (3%)
Query: 137 KEEEEKER-EREREREREREREREREREAK 165
+E E+K R ERERE+E+E+EREREREREA+
Sbjct: 590 REAEQKAREEREREKEKEKEREREREREAE 619
Score = 51.6 bits (123), Expect = 6e-07
Identities = 19/30 (63%), Positives = 25/30 (83%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
+EE E+E+E+E+ERERERERE ER +A S
Sbjct: 597 REEREREKEKEKEREREREREAERAAKASS 626
Score = 47.0 bits (111), Expect = 2e-05
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDP 169
+ E EKE+E+ERERERERE ER + + S++
Sbjct: 599 EREREKEKEKEREREREREAERAAKASSSSHES 631
Score = 38.5 bits (89), Expect = 0.006
Identities = 13/26 (50%), Positives = 21/26 (80%)
Query: 138 EEEEKERERERERERERERERERERE 163
E+ ++E E++ ERERE+E+E+ERE
Sbjct: 586 EKAKREAEQKAREEREREKEKEKERE 611
Score = 35.0 bits (80), Expect = 0.082
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 127 PGVPNRLDVMKEEEEKERERERERE-RERE-RERER---EREAKSYDPIPPVAN 175
PG+ N ++E E +ERE RERE RERE RER + E + D + P AN
Sbjct: 706 PGLYNADPTIRERELREREM-REREIRERELRERMKPGFEVKPPELDTLHPSAN 758
Score = 33.9 bits (77), Expect = 0.21
Identities = 14/39 (35%), Positives = 25/39 (64%)
Query: 125 YDPGVPNRLDVMKEEEEKERERERERERERERERERERE 163
+ P ++L +EE ++ +RE E++ ERERE+E+E
Sbjct: 569 FTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKE 607
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 41.1 bits (97), Expect = 5e-05
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 112 EVKKILARDKAKSYDPGVPNRL----DVMKEEEEKEREREREREREREREREREREAK 165
E KK K DPG+PN ++++E EEK+R++E E+ER +E + EA+
Sbjct: 20 EAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERAEAR 77
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 36.9 bits (86), Expect = 0.004
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAK 165
R + + +E+++ E E +R+ E +ERERE K
Sbjct: 91 ERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKK 125
Score = 36.5 bits (85), Expect = 0.005
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 137 KEEEEKEREREREREREREREREREREA 164
++EE+K E E +R+ E +ERERE++
Sbjct: 99 RDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 35.8 bits (83), Expect = 0.008
Identities = 13/64 (20%), Positives = 23/64 (35%), Gaps = 10/64 (15%)
Query: 112 EVKKILARDKAKSYDPGV--------PNRLDVMKEEEEKER--ERERERERERERERERE 161
E+ + + + + D+ +EE R E E E+E ERE
Sbjct: 40 EIWAMFNKGRKRYAYRDDDDDDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEERE 99
Query: 162 REAK 165
+ A+
Sbjct: 100 KRAR 103
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 34.2 bits (79), Expect = 0.010
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 136 MKEEEEKERERERERERERERERERE 161
+K EEEK E E R+RE +ERE+
Sbjct: 29 LKAEEEKREEEEEARKREERKEREKN 54
Score = 31.5 bits (72), Expect = 0.12
Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 132 RLDVMKEE-EEKEREREREREREREREREREREAKSY 167
+L K+E + +E +RE E E + ER+ + KS+
Sbjct: 21 KLKAKKKELKAEEEKREEEEEARKREERKEREKNKSF 57
Score = 26.9 bits (60), Expect = 4.5
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDP 169
+EEEE+ R+RE +ERE+ + E D
Sbjct: 36 REEEEEARKREERKEREKNKSFEELLNESELDW 68
>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 154
Score = 35.3 bits (81), Expect = 0.020
Identities = 23/154 (14%), Positives = 49/154 (31%), Gaps = 24/154 (15%)
Query: 64 QPKGLYIYGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAK 123
P + + G G GKT L+ + +L K + ++ ++
Sbjct: 23 GPPSVLLTGPSGTGKTSLLR--------------ELLEGLLVAAGKCDQAERNPPYAFSQ 68
Query: 124 SYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKT-- 181
+ + L E E RE + + + ++++
Sbjct: 69 ALRELLRQLLR--------ELAAELLLLREALLAALGAELIEGLQDLVELLERLLARARP 120
Query: 182 WLICFDEFQVTDIADAMILKRLFTELFQLGVVVV 215
++ D+ Q D +L L L +L ++VV
Sbjct: 121 LVLVLDDLQWADEESLDLLAALLRRLERLPLLVV 154
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 36.3 bits (84), Expect = 0.029
Identities = 9/47 (19%), Positives = 27/47 (57%)
Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
A+ + + + +++++ E+ER ++ E+ER +E++++A
Sbjct: 74 AKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120
Score = 30.9 bits (70), Expect = 1.4
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 138 EEEEKEREREREREREREREREREREAK 165
EE ++++ E+ER ++ E+ER +E K
Sbjct: 90 EELQQKQAAEQERLKQLEKERLAAQEQK 117
Score = 30.5 bits (69), Expect = 1.5
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 138 EEEEKEREREREREREREREREREREAK 165
E+E+ ++ E+ER +E++++ E AK
Sbjct: 98 AEQERLKQLEKERLAAQEQKKQAEEAAK 125
Score = 30.5 bits (69), Expect = 1.8
Identities = 9/27 (33%), Positives = 19/27 (70%)
Query: 137 KEEEEKERERERERERERERERERERE 163
K+ E+ER ++ E+ER +E++++ E
Sbjct: 95 KQAAEQERLKQLEKERLAAQEQKKQAE 121
>gnl|CDD|218693 pfam05687, DUF822, Plant protein of unknown function (DUF822).
This family consists of the N terminal regions of
several plant proteins of unknown function.
Length = 151
Score = 34.8 bits (80), Expect = 0.032
Identities = 8/21 (38%), Positives = 11/21 (52%)
Query: 144 REREREREREREREREREREA 164
+ +ERE + RER R A
Sbjct: 4 GRKPTWKERENNKRRERRRRA 24
Score = 32.1 bits (73), Expect = 0.28
Identities = 7/18 (38%), Positives = 10/18 (55%)
Query: 143 ERERERERERERERERER 160
+ +ERE + RER R
Sbjct: 5 RKPTWKERENNKRRERRR 22
Score = 32.1 bits (73), Expect = 0.29
Identities = 12/27 (44%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 141 EKEREREREREREREREREREREAKSY 167
K +ERE + RER R R AK Y
Sbjct: 5 RKPTWKERENNKRRERRR-RAIAAKIY 30
Score = 28.2 bits (63), Expect = 5.3
Identities = 8/18 (44%), Positives = 11/18 (61%)
Query: 137 KEEEEKERERERERERER 154
++ KERE + RER R
Sbjct: 5 RKPTWKERENNKRRERRR 22
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 35.1 bits (81), Expect = 0.037
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPI 170
+EE E+ERE+++E E +ERE + +E + Y+ +
Sbjct: 45 EEELEEEREKKKEEEERKEREEQARKEQEEYEKL 78
Score = 31.6 bits (72), Expect = 0.58
Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 1/35 (2%)
Query: 132 RLDVMKEEE-EKEREREREREREREREREREREAK 165
+L+ +E E ++E E E ERE+++E E +ERE +
Sbjct: 33 KLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67
Score = 30.4 bits (69), Expect = 1.4
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 136 MKEEEEKEREREREREREREREREREREAKS 166
+ EE K+ E +RE ER+ E E E ERE K
Sbjct: 26 EEREERKKLEEKREGERKEEEELEEEREKKK 56
Score = 30.1 bits (68), Expect = 1.6
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
K EE++E ER+ E E E ERE+++E E +
Sbjct: 33 KLEEKREGERKEEEELEEEREKKKEEEERK 62
Score = 28.9 bits (65), Expect = 4.1
Identities = 12/29 (41%), Positives = 21/29 (72%)
Query: 136 MKEEEEKEREREREREREREREREREREA 164
+++ E++RE ER+ E E E ERE+++E
Sbjct: 30 ERKKLEEKREGERKEEEELEEEREKKKEE 58
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 35.7 bits (83), Expect = 0.037
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 90 ETKQKQRVH---FNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERER 146
E Q++R + ML+ A+ E+++ A + + +R ++ ++ EEKE R
Sbjct: 248 ERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKR---LEHRRELEQQIEEKEERR 304
Query: 147 ERERERER-EREREREREAK 165
ERE E E ER RE EA+
Sbjct: 305 AAEREEELEEGERLREEEAE 324
Score = 33.3 bits (77), Expect = 0.21
Identities = 16/31 (51%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 133 LDVMKEEEEKERERERER-ERERERERERER 162
L+ +E+ E+E ERE ER ER+ E+ERE R
Sbjct: 156 LEYQREKAEREEEREAERRERKEEKEREVAR 186
Score = 31.8 bits (73), Expect = 0.64
Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Query: 131 NRLDVMKEEEEKERERERERERERERER--EREREAK 165
E EE+ E ER RE E ER+ E ER+
Sbjct: 300 KEERRAAEREEELEEGERLREEEAERQARIEEERQRL 336
Score = 31.8 bits (73), Expect = 0.65
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 10/39 (25%)
Query: 136 MKEEEEK---------EREREREREREREREREREREAK 165
+EEE K ERE ERE ER RER+ E+ERE
Sbjct: 148 EREEELKILEYQREKAEREEEREAER-RERKEEKEREVA 185
Score = 31.8 bits (73), Expect = 0.72
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAK 165
+ + +EE + E ER + E ERER+ K
Sbjct: 32 IKAEEKEEERRIDEMMEEERLKALAEEEERERKRK 66
Score = 30.6 bits (70), Expect = 1.4
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 136 MKEEEEKEREREREREREREREREREREAK 165
+EEE + RER+ E+ERE R R ++ +
Sbjct: 164 EREEEREAERRERKEEKEREVARLRAQQEE 193
Score = 30.6 bits (70), Expect = 1.7
Identities = 9/34 (26%), Positives = 14/34 (41%), Gaps = 5/34 (14%)
Query: 136 MKEEEEKERERERERER-----EREREREREREA 164
E E++ E+ERE R E + E +
Sbjct: 170 EAERRERKEEKEREVARLRAQQEEAEDEREELDE 203
Score = 29.9 bits (68), Expect = 2.6
Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 17/50 (34%)
Query: 132 RLDVMKEEEEKERER----------------ERERERERERE-REREREA 164
RL + EEEE+ER+R ERE+ R+ E E R +ERE
Sbjct: 51 RLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQ 100
Score = 29.1 bits (66), Expect = 4.3
Identities = 7/29 (24%), Positives = 14/29 (48%)
Query: 132 RLDVMKEEEEKERERERERERERERERER 160
+ ++ + EE+ E+E + ER E
Sbjct: 232 KQELQRAREEQIEEKEERLQEERAEEEAE 260
Score = 28.7 bits (65), Expect = 6.0
Identities = 8/24 (33%), Positives = 18/24 (75%)
Query: 138 EEEEKERERERERERERERERERE 161
+EE + +ER++E+E +R R+++
Sbjct: 210 QEEYERKERQKEKEEAEKRRRQKQ 233
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 35.7 bits (82), Expect = 0.048
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYD 168
+EEEE+E E E E E E E E E E S +
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 35.4 bits (81), Expect = 0.061
Identities = 16/30 (53%), Positives = 18/30 (60%)
Query: 134 DVMKEEEEKERERERERERERERERERERE 163
D +EEEE+E E E E E E E E E E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 34.6 bits (79), Expect = 0.11
Identities = 14/33 (42%), Positives = 20/33 (60%)
Query: 138 EEEEKEREREREREREREREREREREAKSYDPI 170
+ EE+E E E E E E E E E E E ++ +P+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 35.6 bits (82), Expect = 0.052
Identities = 10/36 (27%), Positives = 14/36 (38%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPP 172
R ++ RER R R R R+R + D
Sbjct: 8 GRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYY 43
Score = 31.4 bits (71), Expect = 0.96
Identities = 9/28 (32%), Positives = 12/28 (42%)
Query: 139 EEEKEREREREREREREREREREREAKS 166
E + R R ++ RER R R R
Sbjct: 6 ERGRLRNDTRRSDKGRERSRRRSRSRDR 33
Score = 30.2 bits (68), Expect = 2.3
Identities = 11/31 (35%), Positives = 13/31 (41%)
Query: 139 EEEKEREREREREREREREREREREAKSYDP 169
+K RER R R R R+R R R
Sbjct: 16 RSDKGRERSRRRSRSRDRSRRRRDRDYYRGR 46
Score = 29.9 bits (67), Expect = 3.1
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 142 KEREREREREREREREREREREAK 165
+ R+RER R R R ++ RE
Sbjct: 1 RYRDRERGRLRNDTRRSDKGRERS 24
Score = 28.7 bits (64), Expect = 6.9
Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 132 RLDVMKEEEEKERERERERERERERERERERE 163
R D + ++ +ER R R R R+R R R R+R+
Sbjct: 11 RNDTRRSDKGRERSRRRSRSRDRSR-RRRDRD 41
Score = 28.7 bits (64), Expect = 7.2
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 146 REREREREREREREREREAKS 166
R R+RER R R R +
Sbjct: 1 RYRDRERGRLRNDTRRSDKGR 21
Score = 28.3 bits (63), Expect = 9.6
Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 1/49 (2%)
Query: 119 RDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSY 167
R+ + D G + + R R+R+ R R R R R Y
Sbjct: 11 RNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRG-RSRSRSPNRY 58
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 34.1 bits (78), Expect = 0.061
Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 117 LARDKAKSYDPGVPNRLDVMKEEEEKERERERERER---EREREREREREAKSYDP 169
+ R +++S ++ E+ RER R RER R R R R +S P
Sbjct: 1 MGRSRSRSPRRSRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSP 56
Score = 29.5 bits (66), Expect = 2.4
Identities = 12/42 (28%), Positives = 19/42 (45%)
Query: 139 EEEKEREREREREREREREREREREAKSYDPIPPVANDIISK 180
R R R R R R+R+RER+ + +P +I +
Sbjct: 53 SRSPRRHRSRSRSPSRRRDRKRERDKDAREPKKRERQKLIKE 94
>gnl|CDD|236485 PRK09368, PRK09368, gas vesicle synthesis-like protein; Reviewed.
Length = 140
Score = 33.5 bits (77), Expect = 0.067
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 110 IHEVKKILARDKAKSYDPGVPNRL-DVMKE--EEEKERERERERERERERERERERE 163
+ EV + AR K+K G + + + ++E ER RER R R+ R R RE
Sbjct: 84 VGEVTESGARGKSKGALTGAAETASEALGQGRGSDEEEERRRERPRPRKAPRRRRRE 140
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 35.3 bits (81), Expect = 0.069
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 138 EEEEKERERERERERERERERERERE 163
EE ++ERE+ R R+R+R ER R R
Sbjct: 3 EEPDREREKSRGRDRDRSSERPRRRS 28
Score = 34.5 bits (79), Expect = 0.10
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 130 PNRLDVMKEEEEKEREREREREREREREREREREAKSYD 168
P+R +++R ER R R R+R R R+R +S +
Sbjct: 5 PDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRE 43
Score = 33.7 bits (77), Expect = 0.18
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 139 EEEKERERERERERERERERERERE 163
+EE +RERE+ R R+R+R ER R
Sbjct: 2 DEEPDREREKSRGRDRDRSSERPRR 26
Score = 32.9 bits (75), Expect = 0.35
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 139 EEEKEREREREREREREREREREREAKSY 167
+ ER R R R+R R R+R R +SY
Sbjct: 18 DRSSERPRRRSRDRSRFRDRHRRSRERSY 46
Score = 31.0 bits (70), Expect = 1.2
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 141 EKEREREREREREREREREREREAK 165
++E +RERE+ R R+R+R ER +
Sbjct: 2 DEEPDREREKSRGRDRDRSSERPRR 26
Score = 29.9 bits (67), Expect = 3.2
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 142 KERERERERERERERERERERE 163
++ E +RERE+ R R+R+R E
Sbjct: 1 RDEEPDREREKSRGRDRDRSSE 22
Score = 29.1 bits (65), Expect = 5.1
Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 10/39 (25%)
Query: 140 EEKERERERERERERERER----------EREREAKSYD 168
E+ R R R+R R R+R R R R+ + YD
Sbjct: 21 SERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYD 59
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 31.0 bits (71), Expect = 0.074
Identities = 8/29 (27%), Positives = 16/29 (55%)
Query: 131 NRLDVMKEEEEKERERERERERERERERE 159
N+ V + E++ E+++ E +E E E
Sbjct: 9 NQEKVWEAEQKALEEQKKIEELRKEIEEE 37
Score = 26.8 bits (60), Expect = 2.1
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 139 EEEKERERERERERERERERERE 161
E E++ E+++ E +E E E
Sbjct: 15 EAEQKALEEQKKIEELRKEIEEE 37
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 31.0 bits (71), Expect = 0.076
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 131 NRLDVMKEEEEKERERERERERERERERE 159
N+ V K E++ E E+++ ER +E E+E
Sbjct: 9 NQKRVWKAEQKAEEEKKKIEERRKEIEKE 37
Score = 27.1 bits (61), Expect = 2.0
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 138 EEEEKERERERERERERERERERE 161
+ E++ E E+++ ER +E E+E
Sbjct: 14 WKAEQKAEEEKKKIEERRKEIEKE 37
Score = 26.3 bits (59), Expect = 3.1
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 140 EEKERERERERERERERERERERE 163
+ E++ E E+++ ER +E E+E
Sbjct: 14 WKAEQKAEEEKKKIEERRKEIEKE 37
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 34.9 bits (79), Expect = 0.081
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 7/82 (8%)
Query: 86 YESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERE 145
Y K K V NK L H IH+VK +K K +K+E+ + +
Sbjct: 120 YRELYVKAKGNVFRNKRNLMEH--IHKVKN----EKKKERQLAEQLAAKRLKDEQHRHKA 173
Query: 146 RERE-REREREREREREREAKS 166
R++E R+RE++RER R +A +
Sbjct: 174 RKQELRKREKDRERARREDAAA 195
>gnl|CDD|205235 pfam13054, DUF3915, Protein of unknown function (DUF3915). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 120 amino acids in length.
Length = 116
Score = 32.6 bits (74), Expect = 0.13
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 144 REREREREREREREREREREAKSYDP 169
R+ + ERERERERE + P
Sbjct: 12 RDCHHHEREDFEREREREREKEPQRP 37
Score = 30.2 bits (68), Expect = 0.71
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 142 KEREREREREREREREREREREAK 165
++ + ERERERERE+E +
Sbjct: 12 RDCHHHEREDFEREREREREKEPQ 35
Score = 29.5 bits (66), Expect = 1.4
Identities = 13/19 (68%), Positives = 16/19 (84%)
Query: 138 EEEEKERERERERERERER 156
E E+ ERERERERE+E +R
Sbjct: 18 EREDFEREREREREKEPQR 36
Score = 29.1 bits (65), Expect = 2.0
Identities = 12/19 (63%), Positives = 16/19 (84%)
Query: 140 EEKERERERERERERERER 158
E ++ ERERERERE+E +R
Sbjct: 18 EREDFEREREREREKEPQR 36
Score = 28.3 bits (63), Expect = 3.6
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 141 EKERERERERERERERERERERE 163
+ + ERERERERE+E +
Sbjct: 13 DCHHHEREDFEREREREREKEPQ 35
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 34.1 bits (78), Expect = 0.14
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 138 EEEEKEREREREREREREREREREREAKSYDP 169
+E+EK+ E R+RE E++RER R + P
Sbjct: 148 KEKEKKVEEPRDREEEKKRERVRAKSRPKKPP 179
Score = 33.3 bits (76), Expect = 0.29
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
++E++ E R+RE E++RER R + R K
Sbjct: 149 EKEKKVEEPRDREEEKKRERVRAKSRPKKP 178
Score = 32.9 bits (75), Expect = 0.34
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
+E+EKE+E++ E R+RE E++RER
Sbjct: 143 PKEKEKEKEKKVEEPRDREEEKKRERVRAK 172
Score = 32.6 bits (74), Expect = 0.45
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
++E+EKE++ E R+RE E++RER R
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSR 174
Score = 32.6 bits (74), Expect = 0.46
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDP 169
+++ E+ R+RE E++RER R + R ++ P
Sbjct: 151 EKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183
Score = 32.2 bits (73), Expect = 0.62
Identities = 12/30 (40%), Positives = 21/30 (70%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
+ +EKE+E+E++ E R+RE E++RE
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVR 170
Score = 31.0 bits (70), Expect = 1.2
Identities = 15/60 (25%), Positives = 30/60 (50%)
Query: 107 HAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKS 166
+ +VK+ + K K + + +E+ +E+E+E+E++ E R+RE E K
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
Score = 30.2 bits (68), Expect = 2.3
Identities = 9/36 (25%), Positives = 26/36 (72%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPP 172
KE+E+ + E+++++E+ +E ++R+ + ++ + PP
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Score = 30.2 bits (68), Expect = 2.5
Identities = 10/40 (25%), Positives = 25/40 (62%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPPVAND 176
KE+ ++E+++++E+ +E ++R+ + EAK P +
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149
Score = 29.1 bits (65), Expect = 5.0
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 137 KEEEEKEREREREREREREREREREREAK 165
EEEK+RER R + R ++ +++ K
Sbjct: 159 DREEEKKRERVRAKSRPKKPPKKKPPNKK 187
Score = 28.3 bits (63), Expect = 8.3
Identities = 9/48 (18%), Positives = 25/48 (52%)
Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAK 165
A++ + +KEE++K++E+ +E ++R+ + E + +
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 33.2 bits (77), Expect = 0.18
Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 3/35 (8%)
Query: 136 MKEEEEKERERERERERERERERERE---REAKSY 167
K+ E+ER RE E+ E RE EA+
Sbjct: 84 SKKGTEEERRERREENLEKALELLREGNRSEAREC 118
>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family. This is a family of
sequences found in both bacteria and bacteriophages.
This region is approximately 130 residues long and in
some cases is found as part of the PVL (Panton-Valentine
leukocidin) group of genes, which encode a member of the
leukocidin group of bacterial toxins that kill
leukocytes by creation of pores in the cell membrane.
PVL appears to be a virulence factor associated with a
number of human diseases.
Length = 118
Score = 32.1 bits (73), Expect = 0.19
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 138 EEEEKERERERERERERERERER 160
E+E KERE ER+R +E E R++
Sbjct: 64 EQERKERELERKRRKEAELRRKK 86
Score = 30.2 bits (68), Expect = 0.88
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 136 MKEEEEKERERERERERERERERERER 162
+ E++ +ERE ER+R +E E R++
Sbjct: 60 SENIEQERKERELERKRRKEAELRRKK 86
Score = 29.8 bits (67), Expect = 1.1
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 134 DVMKEEEEKERERERERERERERERERER 162
++ K E ++ +ERE ER+R +E E R
Sbjct: 56 EIKKSENIEQERKERELERKRRKEAELRR 84
Score = 29.0 bits (65), Expect = 2.3
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 132 RLDVMKEEEEKEREREREREREREREREREREAKSYDP 169
RL +E ++ E + +ERE ER+R +E E + P
Sbjct: 50 RLKEYREIKKSENIEQERKERELERKRRKEAELRRKKP 87
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 33.7 bits (77), Expect = 0.21
Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 6/41 (14%)
Query: 134 DVMKEEEEKERER------EREREREREREREREREAKSYD 168
D + EE+ +E E+ ERER+ E +R RE E+ A D
Sbjct: 248 DFLLEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEAD 288
Score = 32.5 bits (74), Expect = 0.55
Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 11/71 (15%)
Query: 100 NKFMLDVHAKI--HEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERER- 156
F +++ + H + L +K + + +L KEE E+ER+ E +R RE E+
Sbjct: 231 KDFCVEIEGGMDEHSFEDFLLEEKRRELE-----KLA--KEEAERERQAEEQRRREEEKA 283
Query: 157 ERERER-EAKS 166
E +R +AK+
Sbjct: 284 AMEADRAQAKA 294
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 30.8 bits (70), Expect = 0.37
Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 132 RLDVMKEEEEKERERERERERER-EREREREREAK 165
R +K+E+E ER+R + +ER + E+ER K
Sbjct: 46 REKELKDEKEAERQRRIQAIKERRAAKEEKERYEK 80
Score = 29.7 bits (67), Expect = 0.95
Identities = 9/29 (31%), Positives = 19/29 (65%)
Query: 137 KEEEEKEREREREREREREREREREREAK 165
+ E+ E++ + RE+E + E+E ER+ +
Sbjct: 33 RMEKRLEQQAIKAREKELKDEKEAERQRR 61
Score = 27.4 bits (61), Expect = 5.8
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 139 EEEKEREREREREREREREREREREAK 165
EKE + E+E ER+R + +ER A
Sbjct: 45 AREKELKDEKEAERQRRIQAIKERRAA 71
>gnl|CDD|178307 PLN02705, PLN02705, beta-amylase.
Length = 681
Score = 32.6 bits (74), Expect = 0.41
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 13/48 (27%)
Query: 145 EREREREREREREREREREA---------KSYD--PIPPVA--NDIIS 179
+RERE+E+ER + RER R A + Y P+P A ND+++
Sbjct: 78 KREREKEKERTKLRERHRRAITSRMLAGLRQYGNFPLPARADMNDVLA 125
Score = 31.1 bits (70), Expect = 1.5
Identities = 11/18 (61%), Positives = 15/18 (83%)
Query: 143 ERERERERERERERERER 160
+RERE+E+ER + RER R
Sbjct: 78 KREREKEKERTKLRERHR 95
Score = 29.9 bits (67), Expect = 3.3
Identities = 10/18 (55%), Positives = 15/18 (83%)
Query: 141 EKERERERERERERERER 158
++ERE+E+ER + RER R
Sbjct: 78 KREREKEKERTKLRERHR 95
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 31.0 bits (70), Expect = 0.41
Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 10/64 (15%)
Query: 112 EVKKILARDKAKSYDPGV----------PNRLDVMKEEEEKERERERERERERERERERE 161
E+ I + + +SY +++ KEE R E ERE RE E E
Sbjct: 48 EIWAIFGKGRKRSYYDRYDEDDALDNMEATFMEIQKEERRSARMARLEDERELAREEEEE 107
Query: 162 REAK 165
+ K
Sbjct: 108 KRKK 111
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 31.0 bits (70), Expect = 0.47
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPPVA 174
+++ E+ERER R E + Y P PPV+
Sbjct: 29 RKKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVS 66
Score = 29.5 bits (66), Expect = 1.8
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPPV 173
K + +ER++ ERERER R +P P
Sbjct: 22 KYSQRRERKKA-ERERERYRHDHDAYSDSYEEPYDPT 57
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 30.8 bits (70), Expect = 0.60
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 105 DVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERE 161
+ A E K+ ++ ++ P L ++ E ++RERE ER+ ++ERE+E
Sbjct: 26 SIKAAKEEKKQAKLSERKENRKPKYIGSL--LEAAERRKREREIAEERKLQKEREKE 80
Score = 30.5 bits (69), Expect = 0.64
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 142 KEREREREREREREREREREREAKSYD 168
+ R+RERE ER+ ++ERE+E +
Sbjct: 59 ERRKREREIAEERKLQKEREKEGDEFA 85
Score = 30.1 bits (68), Expect = 0.91
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 141 EKEREREREREREREREREREREA 164
E R+RERE ER+ ++ERE
Sbjct: 56 EAAERRKREREIAEERKLQKEREK 79
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 32.3 bits (74), Expect = 0.62
Identities = 11/43 (25%), Positives = 19/43 (44%), Gaps = 1/43 (2%)
Query: 134 DVMKEEEEKEREREREREREREREREREREAKSYDPIPPVAND 176
K + E+++ RER+R R E+ R+A+ V
Sbjct: 661 VTEKARTQDEQQQAPRRERQRRRNDEK-RQAQQEAKALNVEEQ 702
Score = 30.0 bits (68), Expect = 3.3
Identities = 12/63 (19%), Positives = 26/63 (41%), Gaps = 5/63 (7%)
Query: 119 RDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERER-----EAKSYDPIPPV 173
R + ++ + E+ + E+++ RER+R R E+ EAK+ +
Sbjct: 644 RQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQS 703
Query: 174 AND 176
+
Sbjct: 704 VQE 706
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 28.5 bits (64), Expect = 0.73
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 138 EEEEKERERERERERERERERERER 162
EEE++E E E E E E E + E +R
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
Score = 27.7 bits (62), Expect = 1.6
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 137 KEEEEKERERERERERERERER 158
K E+E+E E E E E + E +R
Sbjct: 25 KREDEEENEDEEEGEEQSEVKR 46
Score = 26.5 bits (59), Expect = 3.3
Identities = 14/27 (51%), Positives = 15/27 (55%), Gaps = 3/27 (11%)
Query: 139 EEEKEREREREREREREREREREREAK 165
EEEK RE E E E E E E + E K
Sbjct: 22 EEEK---REDEEENEDEEEGEEQSEVK 45
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 29.6 bits (67), Expect = 0.75
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 138 EEEEKEREREREREREREREREREREAK 165
E+ +E ERER+ ER+ +R+R E +
Sbjct: 53 EKAREETERERKEREERKEKRKRAIEER 80
Score = 29.2 bits (66), Expect = 0.97
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 133 LDVMKEEEEKEREREREREREREREREREREA 164
L+ +EE E+ER+ ER+ +R+R E R+
Sbjct: 52 LEKAREETERERKEREERKEKRKRAIEERRKK 83
Score = 29.2 bits (66), Expect = 1.1
Identities = 12/31 (38%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 136 MKEEEEKERERER-EREREREREREREREAK 165
++ EE E+ RE ERER+ ER+ +R+
Sbjct: 46 KEQMEELEKAREETERERKEREERKEKRKRA 76
Score = 27.2 bits (61), Expect = 5.5
Identities = 14/33 (42%), Positives = 21/33 (63%), Gaps = 4/33 (12%)
Query: 137 KEEEEKERE-RERERER---EREREREREREAK 165
K+EEE++ + E E+ R ERER+ ER+ K
Sbjct: 40 KDEEERKEQMEELEKAREETERERKEREERKEK 72
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.3 bits (71), Expect = 0.85
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 1/75 (1%)
Query: 103 MLDVHAKIHEVKKI-LARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERE 161
++D A + +I + A + +L+ EE EK+R E+ R++E E+ E
Sbjct: 42 LVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAE 101
Query: 162 REAKSYDPIPPVAND 176
+ AK + A +
Sbjct: 102 KAAKQAEQAAKQAEE 116
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 31.6 bits (71), Expect = 0.92
Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 127 PGVPNRLDVMK-EEEEKERER-EREREREREREREREREAKSYDPIPPVANDI 177
PG+P +K EEE + R+R E+++ +++RE+E+ + + DI
Sbjct: 73 PGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125
Score = 29.6 bits (66), Expect = 3.9
Identities = 12/68 (17%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 104 LDVHAKIHEVKKILARDKAKSYDPGVPNRLDV-MKEEEEKERERERERERERERERERER 162
+D+ + + +K+ + + P + DV E++ K+ +++ ++E+E+ER++++++
Sbjct: 161 IDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKK 220
Query: 163 EAKSYDPI 170
E + + +
Sbjct: 221 EVEGFKSL 228
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.9 bits (70), Expect = 1.0
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 137 KEEEEKEREREREREREREREREREREAK 165
K ++ E ++++ERE+ERE R+R AK
Sbjct: 103 KLQKLLEEKQKQEREKEREEAELRQRLAK 131
Score = 30.9 bits (70), Expect = 1.2
Identities = 7/27 (25%), Positives = 21/27 (77%)
Query: 137 KEEEEKERERERERERERERERERERE 163
++++++RE ER ++R++++E E ++
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQ 218
Score = 29.7 bits (67), Expect = 3.0
Identities = 8/27 (29%), Positives = 22/27 (81%)
Query: 136 MKEEEEKERERERERERERERERERER 162
+K+ ++++++RE ER ++R++++E E
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEE 215
Score = 29.3 bits (66), Expect = 3.8
Identities = 6/30 (20%), Positives = 23/30 (76%)
Query: 138 EEEEKEREREREREREREREREREREAKSY 167
+++++E ER ++R++++E E +++ +++
Sbjct: 195 QQQKREEERRKQRKKQQEEEERKQKAEEAW 224
Score = 29.3 bits (66), Expect = 4.0
Identities = 8/30 (26%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 137 KEEEEKER-EREREREREREREREREREAK 165
K+ + +++ ++ E ++++ERE+ERE
Sbjct: 96 KQAQRQKKLQKLLEEKQKQEREKEREEAEL 125
Score = 28.9 bits (65), Expect = 4.2
Identities = 8/25 (32%), Positives = 21/25 (84%)
Query: 139 EEEKERERERERERERERERERERE 163
E +K ++++++RE ER ++R++++E
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQE 212
Score = 28.5 bits (64), Expect = 7.2
Identities = 7/28 (25%), Positives = 22/28 (78%)
Query: 138 EEEEKEREREREREREREREREREREAK 165
+E E ++ ++++++RE ER ++R+++ +
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQE 212
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 31.2 bits (71), Expect = 1.0
Identities = 13/44 (29%), Positives = 16/44 (36%), Gaps = 2/44 (4%)
Query: 138 EEEEKERERE--REREREREREREREREAKSYDPIPPVANDIIS 179
K RE E R R E +R ++Y P P I S
Sbjct: 245 AAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTAPEKMLISS 288
>gnl|CDD|234953 PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed.
Length = 476
Score = 31.2 bits (72), Expect = 1.1
Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 10/73 (13%)
Query: 104 LDVHAKIHEVKKILARDKA-KSYDPGVPNRLDVMKEEEEKERERER--ERERERERERER 160
LD++ + +++L KA Y P L+ M+EE+ E R R R+ +E
Sbjct: 88 LDIYKEY--AEQLLEEGKAYYCYCT--PEELEAMREEQRAAGEPPRYDGRCRDLTKEEVA 143
Query: 161 EREAKSYDPIPPV 173
R A PPV
Sbjct: 144 ARLAAG---EPPV 153
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 31.2 bits (71), Expect = 1.1
Identities = 10/44 (22%), Positives = 20/44 (45%), Gaps = 3/44 (6%)
Query: 138 EEEEKEREREREREREREREREREREAKSYD---PIPPVANDII 178
EE+ +R+ + E + + E R ++ P P V + I+
Sbjct: 165 EEDAADRDARKRAAEEAKEQEELRRRSQVIQRNLPRPSVLDLIV 208
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.8 bits (70), Expect = 1.1
Identities = 15/37 (40%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 133 LDVMKEEEEKEREREREREREREREREREREAKSYDP 169
L+ +K+E +E+ERE E E+ E E+ RE E + +P
Sbjct: 151 LEKIKKERAEEKERE-EEEKAAEEEKAREEEILTGNP 186
Score = 29.3 bits (66), Expect = 3.2
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 136 MKEEEEKEREREREREREREREREREREAK 165
++E E+ ++ER E+ERE E + E +A+
Sbjct: 148 LRELEKIKKERAEEKEREEEEKAAEEEKAR 177
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 30.0 bits (68), Expect = 1.1
Identities = 13/36 (36%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 131 NRLDVMKEEEEKERERER-EREREREREREREREAK 165
RL++M E+ +KE E E +++RE ++ ++ E+ AK
Sbjct: 50 ERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAK 85
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 31.1 bits (71), Expect = 1.2
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 62 VKQPKGLYIYGAVGGGKTML 81
+ PKG+ +YG G GKT+L
Sbjct: 182 IDPPKGVLLYGPPGTGKTLL 201
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 30.8 bits (70), Expect = 1.2
Identities = 12/30 (40%), Positives = 15/30 (50%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
+EEEEK E ERER + R + K
Sbjct: 124 REEEEKRDEEERERLLRAAKSRSEQSRLKQ 153
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic region
found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 29.4 bits (66), Expect = 1.2
Identities = 13/58 (22%), Positives = 21/58 (36%), Gaps = 2/58 (3%)
Query: 112 EVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDP 169
E + + R + D G R + R + + R R R+R+R R D
Sbjct: 20 EDDEEIRRKAERDVDRG--RRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDR 75
Score = 28.2 bits (63), Expect = 2.8
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 136 MKEEEEKERERERERERERERERE-REREAKSYDP 169
++ K R R R+R+R R R+R+ R+R+
Sbjct: 50 RRKRSRKRRRRRRDRDRARYRDRDDRDRDRYDRSR 84
Score = 27.8 bits (62), Expect = 3.9
Identities = 10/28 (35%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 137 KEEEEKERER-ERERERERERERERERE 163
++ ++++R+R +R R R R R R+R R
Sbjct: 70 RDRDDRDRDRYDRSRSRSRSRSRDRRRR 97
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 31.2 bits (71), Expect = 1.3
Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 7/34 (20%)
Query: 143 ERE-RER-ERE-REREREREREREAK----SYDP 169
ERE + R ERE RE R R+++++AK +Y P
Sbjct: 253 EREAKARAEREAREAARVRDKQKQAKRKGSTYKP 286
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 29.9 bits (67), Expect = 1.4
Identities = 13/56 (23%), Positives = 27/56 (48%), Gaps = 4/56 (7%)
Query: 112 EVKKILARDKAKSYDPGV----PNRLDVMKEEEEKERERERERERERERERERERE 163
+VK +L + + + + ++E EE+ + + R+ E ERE R+ R+
Sbjct: 15 DVKGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRD 70
>gnl|CDD|213395 cd12152, F1-ATPase_delta, mitochondrial ATP synthase delta subunit.
The F-ATPase is found in bacterial plasma membranes,
mitochondrial inner membranes and in chloroplast
thylakoid membranes. It has also been found in the
archaea Methanosarcina barkeri. It uses a proton
gradient to drive ATP synthesis and hydrolyzes ATP to
build the proton gradient. The extrinisic membrane
domain, F1, is composed of alpha, beta, gamma, delta,
and epsilon subunits with a stoichiometry of 3:3:1:1:1.
Alpha and beta subunit form the globular catalytic
moiety, a hexameric ring of alternating subunits. Gamma,
delta and epsilon subunits form a stalk, connecting F1
to F0, the integral membrane proton translocating
domain. In bacteria, which is lacking a eukaryotic
epsilon subunit homolog, this subunit is called the
epsilon subunit.
Length = 123
Score = 29.4 bits (67), Expect = 1.7
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 133 LDVMKEEEEKEREREREREREREREREREREA 164
+DV + EE ER ER + + ERE+ R A
Sbjct: 86 IDVERAEEALERAEERLAQAKDEREKARAEAA 117
>gnl|CDD|206316 pfam14147, Spore_YhaL, Sporulation protein YhaL. This family of
proteins is involved in sporulation. In B. subtilis its
expression is regulated by the early
mother-cell-specific transcription factor sigma-E.
Length = 52
Score = 27.7 bits (62), Expect = 1.7
Identities = 17/29 (58%), Positives = 19/29 (65%), Gaps = 2/29 (6%)
Query: 136 MKEEEEKERER-ERERERERER-ERERER 162
KEE EKE+E E+E E ER E ERER
Sbjct: 23 AKEEREKEQEFIEKEGEVYMERLEEERER 51
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 29.2 bits (66), Expect = 2.0
Identities = 14/59 (23%), Positives = 26/59 (44%)
Query: 107 HAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAK 165
+AK+ + K K PG R+ + E + E+E+ + RE++ +R K
Sbjct: 68 NAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQK 126
>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
Length = 188
Score = 29.6 bits (67), Expect = 2.0
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 138 EEEEKEREREREREREREREREREREAKS 166
E + +E +RE E ERE E+ RE+E S
Sbjct: 37 EADAEEILEDREAEAEREIEQLREQELSS 65
>gnl|CDD|237897 PRK15067, PRK15067, ethanolamine ammonia lyase large subunit;
Provisional.
Length = 461
Score = 30.2 bits (69), Expect = 2.1
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 384 SDVPLNKLFSNEAVIDTHSDE-HRMLMDDLNIKA 416
+D+PL K F N VI DE R+++DD + A
Sbjct: 52 ADLPL-KDFLNNPVIPYEEDEVTRLIIDDHDAAA 84
>gnl|CDD|180935 PRK07333, PRK07333, 2-octaprenyl-6-methoxyphenyl hydroxylase;
Provisional.
Length = 403
Score = 30.3 bits (69), Expect = 2.1
Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 11/44 (25%)
Query: 354 QLNIKLRSQS-RRFITLIDALYDNNIRLVISSDVPLNKLFSNEA 396
L++ R Q RRF T+ R+ +++DV LN+LFSN++
Sbjct: 326 SLDVLERYQRWRRFDTV---------RMGVTTDV-LNRLFSNDS 359
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 30.3 bits (69), Expect = 2.2
Identities = 11/47 (23%), Positives = 19/47 (40%)
Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
K+ + G + + E+ E ER+ R R +R+ EA
Sbjct: 139 RGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEA 185
Score = 29.1 bits (66), Expect = 6.1
Identities = 9/33 (27%), Positives = 15/33 (45%)
Query: 131 NRLDVMKEEEEKERERERERERERERERERERE 163
+R D ++ E R R R+R+R R +
Sbjct: 246 DRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGD 278
Score = 29.1 bits (66), Expect = 6.3
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 138 EEEEKEREREREREREREREREREREAKSYD 168
E + +R+ E ER RE + D
Sbjct: 171 ERRRRGDREDRQAEAERGERGRREERGRDGD 201
Score = 28.3 bits (64), Expect = 9.7
Identities = 12/47 (25%), Positives = 16/47 (34%)
Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
A KA E +E ER+ R R +R+ E E
Sbjct: 141 AARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAER 187
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 29.1 bits (65), Expect = 2.3
Identities = 20/81 (24%), Positives = 38/81 (46%), Gaps = 9/81 (11%)
Query: 100 NKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERE 159
K LD +I + +K R+K + KE E+ E +++++ERER R++
Sbjct: 70 KKKKLDEKKEIAKQRKREQREKELAK---------RQKELEKIELSKKKQKERERRRKKL 120
Query: 160 REREAKSYDPIPPVANDIISK 180
++ + P ND++ K
Sbjct: 121 TKKTKSGQPLMGPRINDLLDK 141
>gnl|CDD|224107 COG1186, PrfB, Protein chain release factor B [Translation,
ribosomal structure and biogenesis].
Length = 239
Score = 29.9 bits (68), Expect = 2.5
Identities = 7/33 (21%), Positives = 17/33 (51%)
Query: 135 VMKEEEEKEREREREREREREREREREREAKSY 167
+ +++R +E+ RER ++ + +SY
Sbjct: 167 KLYILAQEKRSQEKNRERALKKLIGWGNQIRSY 199
>gnl|CDD|150737 pfam10097, DUF2335, Predicted membrane protein (DUF2335). Members
of this family of hypothetical bacterial proteins have
no known function.
Length = 50
Score = 26.9 bits (60), Expect = 2.7
Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 124 SYDPGVPNRLDVMKEEEEKERERERERE-RERERERERER 162
S PG R+ M E+E++ R ER+ + + R +R +
Sbjct: 11 SIVPGAAERIIAMAEKEQEHRHELEERQLKAQARRAKRGQ 50
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 2.9
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 132 RLDVMKEEEEKEREREREREREREREREREREAKS 166
L+ ++ E E RE E ERERE E R+ +
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELAEEVRDLRE 286
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 29.3 bits (65), Expect = 2.9
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 136 MKEEEEKEREREREREREREREREREREAKSYD 168
++E E ER E+ ERE + E+ERERE ++ +
Sbjct: 138 LEERMEWERREEKIDEREDQEEQEREREEQTIE 170
>gnl|CDD|232941 TIGR00362, DnaA, chromosomal replication initiator protein DnaA.
DnaA is involved in DNA biosynthesis; initiation of
chromosome replication and can also be transcription
regulator. The C-terminal of the family hits the pfam
bacterial DnaA (bac_dnaA) domain family. For a review,
see Kaguni (2006) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 405
Score = 29.8 bits (68), Expect = 3.0
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 68 LYIYGAVGGGKTMLM 82
L+IYG VG GKT L+
Sbjct: 139 LFIYGGVGLGKTHLL 153
>gnl|CDD|220331 pfam09660, DUF2397, Protein of unknown function (DUF2397).
Proteins in this entry are encoded within a conserved
gene four-gene neighborhood found sporadically in a
phylogenetically broad range of bacteria including:
Nocardia farcinica, Symbiobacterium thermophilum, and
Streptomyces avermitilis (Actinobacteria), Geobacillus
kaustophilus (Firmicutes), Azoarcus sp. EbN1 and
Ralstonia solanacearum (Betaproteobacteria).
Length = 478
Score = 30.0 bits (68), Expect = 3.0
Identities = 6/28 (21%), Positives = 11/28 (39%)
Query: 140 EEKEREREREREREREREREREREAKSY 167
++ E+ E+ RE + ER
Sbjct: 379 RDRSAEKRELAEQAREEQAERAAALAEL 406
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 29.1 bits (66), Expect = 3.1
Identities = 17/57 (29%), Positives = 28/57 (49%), Gaps = 4/57 (7%)
Query: 109 KIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAK 165
K + K+ + K D G D +K+ E+ + + ER+RE R+R+ E E K
Sbjct: 19 KAKKEKRKQRKQARKGADDGD----DELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71
>gnl|CDD|236353 PRK08939, PRK08939, primosomal protein DnaI; Reviewed.
Length = 306
Score = 29.5 bits (67), Expect = 3.1
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 66 KGLYIYGAVGGGKTMLM 82
KGLY+YG G GK+ L+
Sbjct: 157 KGLYLYGDFGVGKSYLL 173
>gnl|CDD|221041 pfam11241, DUF3043, Protein of unknown function (DUF3043). Some
members in this family of proteins with unknown function
are annotated as membrane proteins. This cannot be
confirmed.
Length = 168
Score = 29.1 bits (66), Expect = 3.2
Identities = 9/25 (36%), Positives = 14/25 (56%), Gaps = 1/25 (4%)
Query: 140 EEKEREREREREREREREREREREA 164
E+++ ++ R RER R R R A
Sbjct: 23 EDRKAAKKAARAARRER-RARARAA 46
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 29.8 bits (67), Expect = 3.2
Identities = 20/73 (27%), Positives = 28/73 (38%), Gaps = 6/73 (8%)
Query: 107 HAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERER----ERERERERER 162
HA+ E+KK R K+ + + E+E RE +ER R R R
Sbjct: 326 HAENAEIKK--TRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383
Query: 163 EAKSYDPIPPVAN 175
+AK I N
Sbjct: 384 KAKKKGLIDASPN 396
>gnl|CDD|119354 cd02875, GH18_chitobiase, Chitobiase (also known as
di-N-acetylchitobiase) is a lysosomal glycosidase that
hydrolyzes the reducing-end N-acetylglucosamine from the
chitobiose core of oligosaccharides during the ordered
degradation of asparagine-linked glycoproteins in
eukaryotes. Chitobiase can only do so if the asparagine
that joins the oligosaccharide to protein is previously
removed by a glycosylasparaginase. Chitobiase is
therefore the final step in the lysosomal degradation of
the protein/carbohydrate linkage component of
asparagine-linked glycoproteins. The catalytic domain of
chitobiase is an eight-stranded alpha/beta barrel fold
similar to that of other family 18 glycosyl hydrolases
such as hevamine and chitotriosidase.
Length = 358
Score = 29.7 bits (67), Expect = 3.2
Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 6/44 (13%)
Query: 376 NNIRLVISSDVPLNKLFSNEAVIDTHSDEHRML-----MDDLNI 414
+RLV+ DVPL ++ SN + L MD +NI
Sbjct: 76 KGVRLVLKGDVPLEQI-SNPTYRTQWIQQKVELAKSQFMDGINI 118
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 29.2 bits (66), Expect = 3.4
Identities = 8/36 (22%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPP 172
K E+ K R ++++R+ +++E ++ EA+
Sbjct: 14 KREQRKAR-KKQKRKEAKKKEDAQKSEAEEVKNEEN 48
Score = 28.8 bits (65), Expect = 3.8
Identities = 7/36 (19%), Positives = 22/36 (61%)
Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAKS 166
+R ++++ K +R+ ++++R+ +++E KS
Sbjct: 3 SREALLEQRRRKREQRKARKKQKRKEAKKKEDAQKS 38
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.5 bits (67), Expect = 3.4
Identities = 9/43 (20%), Positives = 23/43 (53%), Gaps = 13/43 (30%)
Query: 137 KEEEEKERERERERERERER-------------EREREREAKS 166
++EE +E++ E+++E + E+ER+++A+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 29.1 bits (66), Expect = 3.9
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 136 MKEEEEKEREREREREREREREREREREAK 165
+K EE+ +E +E++ E+++E + AK
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
>gnl|CDD|192464 pfam10159, MMtag, Kinase phosphorylation protein. This is a
glycine-rich domain that is the most highly conserved
region of a family of proteins that in vertebrates are
associated with tumours in multiple myelomas. The region
may contain phosphorylation sites for several protein
kinases, as well as N-myristoylation sites and nuclear
localisation signals, so it might act as a signal
molecule in the nucleus.
Length = 78
Score = 27.3 bits (61), Expect = 3.5
Identities = 7/28 (25%), Positives = 12/28 (42%)
Query: 137 KEEEEKEREREREREREREREREREREA 164
+ E+ + + E R +E E E A
Sbjct: 49 DDTAEEAEKLKEEIRRIKEAEEEAMAAA 76
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 29.4 bits (66), Expect = 3.5
Identities = 10/33 (30%), Positives = 20/33 (60%)
Query: 132 RLDVMKEEEEKEREREREREREREREREREREA 164
L+ KEEEE+ R ++ E E++ + + ++A
Sbjct: 150 ALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 27.0 bits (60), Expect = 3.8
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 140 EEKEREREREREREREREREREREA 164
EEKE+ E E+E R++E E EA
Sbjct: 18 EEKEKALEAEKEEAEARQKEEEEEA 42
Score = 26.6 bits (59), Expect = 5.2
Identities = 12/28 (42%), Positives = 16/28 (57%), Gaps = 1/28 (3%)
Query: 138 EEEEKEREREREREREREREREREREAK 165
EE+EK E E+E E E ++ E E K
Sbjct: 18 EEKEKALEAEKE-EAEARQKEEEEEAIK 44
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper.
Length = 54
Score = 26.8 bits (60), Expect = 3.9
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 5/34 (14%)
Query: 138 EEEEKERERERE-----REREREREREREREAKS 166
+E R R E RE++++RE E E K
Sbjct: 3 DEYRDRRRRNNEAARRSREKKKQREEELEERVKE 36
>gnl|CDD|234667 PRK00149, dnaA, chromosomal replication initiation protein;
Reviewed.
Length = 401
Score = 29.3 bits (67), Expect = 4.0
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 68 LYIYGAVGGGKTMLM 82
L+IYG VG GKT L+
Sbjct: 102 LFIYGGVGLGKTHLL 116
>gnl|CDD|197400 cd10164, ClassIIa_HDAC5_Gln-rich-N, Glutamine-rich N-terminal
helical domain of HDAC5, a Class IIa histone
deacetylase. This family consists of the glutamine-rich
domain of histone deacetylase 5 (HDAC5). It belongs to a
superfamily that consists of the glutamine-rich
N-terminal helical extension to certain Class IIa
histone deacetylases (HDACs), including HDAC4, HDAC5 and
HDCA9; it is missing from HDAC7. This domain confers
responsiveness to calcium signals and mediates
interactions with transcription factors and cofactors,
and it is able to repress transcription independently of
the HDAC C-terminal, zinc-dependent catalytic domain. It
has many intra- and inter-helical interactions which are
possibly involved in reversible assembly and disassembly
of proteins. HDACs regulate diverse cellular processes
through enzymatic deacetylation of histone as well as
non-histone proteins, in particular deacetylating
N(6)-acetyl-lysine residues.
Length = 97
Score = 27.9 bits (61), Expect = 4.0
Identities = 9/29 (31%), Positives = 23/29 (79%)
Query: 134 DVMKEEEEKERERERERERERERERERER 162
+++ + ++E E++R+RE++R+ E E++R
Sbjct: 64 EILAAKRQQELEQQRKREQQRQEELEKQR 92
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 27.6 bits (62), Expect = 4.3
Identities = 12/31 (38%), Positives = 21/31 (67%), Gaps = 3/31 (9%)
Query: 139 EEEKEREREREREREREREREREREAKSYDP 169
E++ERE+ RE++ + E++ E E K +DP
Sbjct: 63 LEKREREK---REKKEKLEKKLEEELKEWDP 90
>gnl|CDD|184260 PRK13704, PRK13704, plasmid SOS inhibition protein A; Provisional.
Length = 240
Score = 28.9 bits (65), Expect = 4.3
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 16/57 (28%)
Query: 117 LARDKAKSYDPGVPNRLDVMKEEEEKERERERERER------EREREREREREAKSY 167
L+ D A P V R RER R R R++ RE E++ ++Y
Sbjct: 98 LSGDVAARLFPSVRFR----------VRERRRHRSAIAFQRYSRQQAREAEQKWRAY 144
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 29.3 bits (66), Expect = 4.6
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 138 EEEEKEREREREREREREREREREREAKSYD 168
+EEE+ RE+E E ER E+ K +
Sbjct: 398 KEEERPREKEGTEEEERREITVYEKRIKKLE 428
Score = 28.5 bits (64), Expect = 8.6
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 134 DVMKEEEEKERERERERERERERERER 160
+ + + +E+ER RE+E E ER
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREIT 418
>gnl|CDD|215579 PLN03106, TCP2, Protein TCP2; Provisional.
Length = 447
Score = 29.3 bits (65), Expect = 4.6
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 138 EEEEKEREREREREREREREREREREAKSYDPIPPVAND 176
E +K RER RER +E+E+E A ++ P++ +
Sbjct: 205 ELRDKARERARERT-AKEKEKEDHNHAAHHNNNNPISQN 242
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 28.0 bits (63), Expect = 4.7
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 141 EKEREREREREREREREREREREA 164
E++RE + + E+ER+ E+E+E
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEW 38
Score = 27.7 bits (62), Expect = 7.6
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 139 EEEKERERERERERERERERERE 161
EE++E + + E+ER+ E+E+E
Sbjct: 15 EEKREEKEREKEEKERKEEKEKE 37
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 28.7 bits (64), Expect = 4.7
Identities = 9/25 (36%), Positives = 17/25 (68%), Gaps = 1/25 (4%)
Query: 139 EEEKERERERER-ERERERERERER 162
++E ERE+ RE+ E++ ++ ER
Sbjct: 5 DKEAEREKLREKYEQDEQKREATER 29
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 28.9 bits (65), Expect = 4.9
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 137 KEEEEKEREREREREREREREREREREAKS 166
K++EE+ER ER RE E + + K+
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKA 150
Score = 28.9 bits (65), Expect = 5.4
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 137 KEEEEKEREREREREREREREREREREAK 165
KEEEE+ ER RE E + +++AK
Sbjct: 123 KEEEERRVERRRELGLEDPEQLRLKQKAK 151
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 28.9 bits (65), Expect = 5.1
Identities = 13/35 (37%), Positives = 24/35 (68%)
Query: 134 DVMKEEEEKEREREREREREREREREREREAKSYD 168
+ K +E++ +E+ER RE E+ RE+E +R+A +
Sbjct: 284 AIRKGQEQQLQEKERRREEEQLREKEWDRQAINQA 318
Score = 28.9 bits (65), Expect = 5.2
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 140 EEKEREREREREREREREREREREAK 165
+EKER RE E+ RE+E +R+ +A+
Sbjct: 294 QEKERRREEEQLREKEWDRQAINQAR 319
>gnl|CDD|183452 PRK12337, PRK12337, 2-phosphoglycerate kinase; Provisional.
Length = 475
Score = 29.0 bits (65), Expect = 5.4
Identities = 12/44 (27%), Positives = 21/44 (47%)
Query: 135 VMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDII 178
V +E R R R+RE R RER + ++ I + + ++
Sbjct: 389 VTLPDEALHRRRFELRDRETGASRPRERYLRHFEEIRLIQDHLL 432
>gnl|CDD|233480 TIGR01579, MiaB-like-C, MiaB-like tRNA modifying enzyme. This
clade of sequences is closely related to MiaB, a
modifier of isopentenylated adenosine-37 of certain
eukaryotic and bacterial tRNAs (see TIGR01574). Sequence
alignments suggest that this equivalog performs the same
chemical transformation as MiaB, perhaps on a different
(or differently modified) tRNA base substrate. This
clade is a member of a subfamily (TIGR00089) and spans
low GC Gram positive bacteria, alpha and epsilon
proteobacteria, Campylobacter, Porphyromonas, Aquifex,
Thermotoga, Chlamydia, Treponema and Fusobacterium
[Protein synthesis, tRNA and rRNA base modification].
Length = 414
Score = 28.9 bits (65), Expect = 5.4
Identities = 12/58 (20%), Positives = 21/58 (36%), Gaps = 9/58 (15%)
Query: 121 KAKSYDPGVPN-----RLDVMKEEEEKERERERERERERER----EREREREAKSYDP 169
A + VP R+ +KE EK + ++ +E E+E+ Y
Sbjct: 334 PASTMKDKVPETIKKERVKRLKELAEKNYQEFLKKNIGKELEVLVEKEKAGVLTGYSE 391
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 28.5 bits (64), Expect = 5.6
Identities = 9/31 (29%), Positives = 20/31 (64%)
Query: 136 MKEEEEKEREREREREREREREREREREAKS 166
+K E+ +E++R+ E + ER + R+ + K+
Sbjct: 34 LKREKAQEKKRKAEAQAERRELKARKEKLKT 64
Score = 28.1 bits (63), Expect = 6.3
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 136 MKEEEEKEREREREREREREREREREREAK 165
+ + EK +E++R+ E + ER + R+ K
Sbjct: 32 LALKREKAQEKKRKAEAQAERRELKARKEK 61
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 28.6 bits (64), Expect = 5.6
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 128 GVPNRLDVMKEEEEKERERERERERERERERER 160
L+ M EE ++++ + R + RE E +
Sbjct: 96 EWYPSLNQMLEENREQQKEKEARRQAREAEIAK 128
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 29.4 bits (66), Expect = 5.6
Identities = 10/30 (33%), Positives = 13/30 (43%)
Query: 128 GVPNRLDVMKEEEEKERERERERERERERE 157
G P + E+E + RE RERE
Sbjct: 1079 GRPVTPAGLLARLEQEIAQRRELLTARERE 1108
Score = 29.0 bits (65), Expect = 6.0
Identities = 13/41 (31%), Positives = 16/41 (39%), Gaps = 5/41 (12%)
Query: 133 LDVMKEEEEKERERERE-----REREREREREREREAKSYD 168
L ++E E RE ERE ERE + R R
Sbjct: 285 LGRARDELETAREEERELDARTEALEREADALRTRLEALQG 325
>gnl|CDD|240312 PTZ00202, PTZ00202, tuzin; Provisional.
Length = 550
Score = 29.0 bits (65), Expect = 5.9
Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 2/32 (6%)
Query: 132 RLDVMKEEEEKERERERE-RERERERERERER 162
R+ V+ E+ E +R+ R+ RE RER +ER+
Sbjct: 166 RMSVL-EKSEWQRDHHRQMRELLRERVKERDF 196
>gnl|CDD|223666 COG0593, DnaA, ATPase involved in DNA replication initiation [DNA
replication, recombination, and repair].
Length = 408
Score = 28.8 bits (65), Expect = 6.1
Identities = 32/158 (20%), Positives = 46/158 (29%), Gaps = 51/158 (32%)
Query: 68 LYIYGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDP 127
L+IYG VG GKT L+ A +E LA
Sbjct: 116 LFIYGGVGLGKTHLL------------------------QAIGNE---ALANGPNARVVY 148
Query: 128 GVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKTWLICFD 187
E + R+ E E+ +E S D + +I +
Sbjct: 149 ------------LTSEDFTNDFVKALRDNEMEKFKEKYSLDLL------LIDDIQFLAGK 190
Query: 188 EFQVTDIADAMILKRLFTELFQLGVVVVATSNRAPDDL 225
E F L + G +V TS+R P +L
Sbjct: 191 E------RTQEEFFHTFNALLENGKQIVLTSDRPPKEL 222
>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
cellular Activities) superfamily represents an ancient
group of ATPases belonging to the ASCE (for additional
strand, catalytic E) division of the P-loop NTPase
fold. The ASCE division also includes ABC, RecA-like,
VirD4-like, PilT-like, and SF1/2 helicases. Members of
the AAA+ ATPases function as molecular chaperons,
ATPase subunits of proteases, helicases, or
nucleic-acid stimulated ATPases. The AAA+ proteins
contain several distinct features in addition to the
conserved alpha-beta-alpha core domain structure and
the Walker A and B motifs of the P-loop NTPases.
Length = 151
Score = 27.9 bits (62), Expect = 6.2
Identities = 10/20 (50%), Positives = 11/20 (55%)
Query: 63 KQPKGLYIYGAVGGGKTMLM 82
PK L +YG G GKT L
Sbjct: 17 PPPKNLLLYGPPGTGKTTLA 36
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.8 bits (65), Expect = 6.3
Identities = 13/52 (25%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 123 KSYDPGVPNRLDVMKEEEEKERERERERERE--REREREREREAKSYDPIPP 172
K D R KE ++++ + +++E +++E E+ +AK IPP
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK----IPP 593
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.8 bits (64), Expect = 6.5
Identities = 13/34 (38%), Positives = 14/34 (41%)
Query: 134 DVMKEEEEKEREREREREREREREREREREAKSY 167
D E E E E + E ERE E E KS
Sbjct: 103 DSGAENETVEEEEKEESREEREEVEETEGVTKSE 136
>gnl|CDD|226494 COG4006, COG4006, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 278
Score = 28.7 bits (64), Expect = 6.7
Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 7/86 (8%)
Query: 285 GIFKLLCSQENDIVRPRVITIMGRNVTFNKTCGQILEATFDELCSRDLGSSDYLHICQIF 344
G+ L+ DI G V N T G E+ F L G+S +I + F
Sbjct: 132 GLSDLVREVYCDIKVRIG---RGNVVYINATGGFKPESGFLTLAGSLAGASAVYYIHEAF 188
Query: 345 HTV----IIRNVPQLNIKLRSQSRRF 366
+ V IR PQ+++ L +RR
Sbjct: 189 NDVVFIPPIRLSPQVDLHLIEDARRL 214
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.9 bits (65), Expect = 6.8
Identities = 12/49 (24%), Positives = 19/49 (38%), Gaps = 4/49 (8%)
Query: 120 DKAKSYDPGVPN----RLDVMKEEEEKEREREREREREREREREREREA 164
K K + N + M+ E +++E E RE E E E+
Sbjct: 359 GKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEES 407
>gnl|CDD|224648 COG1734, DksA, DnaK suppressor protein [Signal transduction
mechanisms].
Length = 120
Score = 27.4 bits (61), Expect = 6.9
Identities = 24/92 (26%), Positives = 43/92 (46%), Gaps = 14/92 (15%)
Query: 93 QKQRVHFNKFMLDVHAKI-HEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERE---RER 148
++Q HF +L+ + E+++ + ++ P D +EEE+E E R+R
Sbjct: 3 KEQLEHFKNKLLEWKKDLLEELEQTEEHLQDENEASPDPA--DRATQEEERELELRLRDR 60
Query: 149 ERERERERERERER-EAKSY-------DPIPP 172
ER+ R+ E +R E +Y +PIP
Sbjct: 61 ERKLLRKIESALDRIEEGTYGICEECGEPIPE 92
>gnl|CDD|233050 TIGR00609, recB, exodeoxyribonuclease V, beta subunit. The RecBCD
holoenzyme is a multifunctional nuclease with potent
ATP-dependent exodeoxyribonuclease activity. Ejection of
RecD, as occurs at chi recombinational hotspots,
cripples exonuclease activity in favor of recombinagenic
helicase activity. All proteins in this family for which
functions are known are DNA-DNA helicases that are used
as part of an exonuclease-helicase complex (made up of
RecBCD homologs) that function to generate substrates
for the initiation of recombination and recombinational
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1087
Score = 28.9 bits (65), Expect = 7.0
Identities = 29/101 (28%), Positives = 44/101 (43%), Gaps = 4/101 (3%)
Query: 107 HAKIHEVKKILA-RDKAKSYDPGVPNRLDVMKE-EEEKEREREREREREREREREREREA 164
HA E+ K+ A R+ V L ++KE +EE ++E++R RE + R A
Sbjct: 223 HAVFEEIDKLNAERNNLFCLKDRV--FLTLLKEVQEELKKEKKRRREIGFDDLLSRLETA 280
Query: 165 KSYDPIPPVANDIISKTWLICFDEFQVTDIADAMILKRLFT 205
+A I + + DEFQ TD I +LF
Sbjct: 281 LKSAEGEKLAQAIREQYPIALIDEFQDTDPQQYRIFSKLFI 321
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 28.9 bits (64), Expect = 7.1
Identities = 24/59 (40%), Positives = 35/59 (59%), Gaps = 5/59 (8%)
Query: 109 KIHEVKKI--LARDKAKSYDPGVPNRLDVMKEEEEK-ERER-ERER-ERERERERERER 162
KI E K+I L R++ + + R++ + E E+ ERER ER+R ER+R ERER
Sbjct: 453 KILEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERER 511
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 28.5 bits (64), Expect = 7.3
Identities = 20/47 (42%), Positives = 29/47 (61%), Gaps = 9/47 (19%)
Query: 138 EEEEKERER-EREREREREREREREREAK------SYDPIPPVANDI 177
E EE+ R R +RE+E + RER+R+R A+ +YD P+A DI
Sbjct: 123 ESEEEWRARLKREQEEQYLRERQRQRMARLQANAAAYD--SPLAVDI 167
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 28.6 bits (64), Expect = 7.5
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 62 VKQPKGLYIYGAVGGGKTML 81
+K PKG+ +YG G GKT+L
Sbjct: 214 IKPPKGVILYGPPGTGKTLL 233
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 28.8 bits (65), Expect = 7.8
Identities = 7/35 (20%), Positives = 16/35 (45%), Gaps = 5/35 (14%)
Query: 137 KEEEEKERERERERER-----EREREREREREAKS 166
+ E+++++ E + R R + REA+
Sbjct: 439 RAIEQEKKKAEEAKARFEARQARLEREKAAREARH 473
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.5 bits (64), Expect = 8.0
Identities = 16/76 (21%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 87 ESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERER 146
+ + K ++ + ++F + ++ E++K L+R + + + +KE EEKE
Sbjct: 286 KELKEKAEEYIKLSEFYEEYLDELREIEKRLSR-----LEEEINGIEERIKELEEKEERL 340
Query: 147 ERERERERERERERER 162
E +++ +E E+ E
Sbjct: 341 EELKKKLKELEKRLEE 356
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 28.5 bits (64), Expect = 8.3
Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 23/113 (20%)
Query: 110 IHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDP 169
+H V+ IL + + +P + +EEE+++ E E E E E E E
Sbjct: 327 VHHVQHILPQGRCTWVNPEQKD-----EEEEQEDEEEEEEEEEPEEPEPEEGPP-----L 376
Query: 170 IPPVANDIISKTWLICFDEFQVTDIADAMILKRLFTELFQLGVVVVATSNRAP 222
+ P++ D D A RL + L V V SNR P
Sbjct: 377 LTPISED----------APLPNDDPAWTF---RLSSSLSPKYAVAVLRSNRWP 416
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 26.6 bits (59), Expect = 8.5
Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 120 DKAKSYDPGVPNRLDVMKEEEEKERERERE-REREREREREREREAK 165
++ + + + + D + +E E E +R R +R RE+E E +A+
Sbjct: 13 EEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAE 59
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 28.1 bits (63), Expect = 8.8
Identities = 13/59 (22%), Positives = 28/59 (47%)
Query: 106 VHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
+ A V + SY+P + ++++EE EKE + E++R+ E ++ +
Sbjct: 169 LAANGKPVPAVEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKM 227
>gnl|CDD|181091 PRK07720, fliJ, flagellar biosynthesis chaperone; Validated.
Length = 146
Score = 27.3 bits (61), Expect = 9.1
Identities = 18/96 (18%), Positives = 40/96 (41%), Gaps = 17/96 (17%)
Query: 87 ESCETKQKQRV----------HFNKFMLDVHAKIHEVKK--ILARDKAKSYDPGVPNRLD 134
E E +++++ H+ +F+ ++ I + + AR++ +
Sbjct: 47 EDLEQAKEEKLQSGLSIQEIRHYQQFVTNLERTIDHYQLLVMQAREQMNR-----KQQDL 101
Query: 135 VMKEEEEKEREREREREREREREREREREAKSYDPI 170
K E K+ E+ +E+++E E+ E K D I
Sbjct: 102 TEKNIEVKKYEKMKEKKQEMFALEEKAAEMKEMDEI 137
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 28.3 bits (63), Expect = 9.1
Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 2/50 (4%)
Query: 118 ARDKAKSYDPGVPNRLDVMKEEEE--KEREREREREREREREREREREAK 165
RD + P LD ++E+E ER E E+E+E+ + K
Sbjct: 250 KRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLK 299
>gnl|CDD|233714 TIGR02082, metH, 5-methyltetrahydrofolate--homocysteine
methyltransferase. This family represents
5-methyltetrahydrofolate--homocysteine methyltransferase
(EC 2.1.1.13), one of at least three different enzymes
able to convert homocysteine to methionine by
transferring a methyl group on to the sulfur atom. It is
also called the vitamin B12(or cobalamine)-dependent
methionine synthase. Other methionine synthases include
5-methyltetrahydropteroyltriglutamate--homocysteine
S-methyltransferase (MetE, EC 2.1.1.14, the
cobalamin-independent methionine synthase) and
betaine-homocysteine methyltransferase [Amino acid
biosynthesis, Aspartate family].
Length = 1178
Score = 28.6 bits (64), Expect = 9.1
Identities = 18/114 (15%), Positives = 38/114 (33%), Gaps = 23/114 (20%)
Query: 89 CETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERER 148
K V + +LD + + +++ + + +KEE + RE+
Sbjct: 832 IAPIYKGPVVY---VLDASRAVTVMDTLMSAKRKATE-------NGRIKEEYDTAREKHG 881
Query: 149 ERERERE------REREREREAKSYDPIPPVANDIISKTWLICFDEFQVTDIAD 196
E+ +R + S D PP ++ + +DIA+
Sbjct: 882 EQRSKRIAASEQAARKNVFAPDWSDDIEPPAPPFWGTQ-------IVEASDIAE 928
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.4 bits (63), Expect = 9.2
Identities = 15/64 (23%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 103 MLDVHAKIHEVKKILA-RDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERE 161
M+D A + + +I + + AK + + + + EE + ++ E+ER ++ E+ER +
Sbjct: 54 MVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKA 113
Query: 162 REAK 165
+E +
Sbjct: 114 QEQQ 117
>gnl|CDD|227184 COG4847, COG4847, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 103
Score = 26.8 bits (59), Expect = 9.2
Identities = 14/65 (21%), Positives = 22/65 (33%)
Query: 94 KQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERE 153
K VH+ K K +A A + L + E E +E R+
Sbjct: 27 KGSVHYECLAESKRKKPGGDKDEVALLLANLLLLYLIVLLKELAVIAESEEAKESIRQVR 86
Query: 154 RERER 158
+E E+
Sbjct: 87 QEVEK 91
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 27.8 bits (61), Expect = 9.7
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 137 KEEEEKEREREREREREREREREREREAK 165
+ EE+ R E R +E ER RE+E +AK
Sbjct: 63 RAAEERLRREEEARRQEEERAREKEEKAK 91
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 27.1 bits (60), Expect = 9.7
Identities = 13/25 (52%), Positives = 18/25 (72%), Gaps = 1/25 (4%)
Query: 141 EKEREREREREREREREREREREAK 165
K +E+E E+ER ERE RE++AK
Sbjct: 39 AKAQEKEHEKER-AEREEAREKKAK 62
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.134 0.381
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,178,323
Number of extensions: 2205611
Number of successful extensions: 6938
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5734
Number of HSP's successfully gapped: 435
Length of query: 438
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 338
Effective length of database: 6,502,202
Effective search space: 2197744276
Effective search space used: 2197744276
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)