RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15864
(553 letters)
>gnl|CDD|233165 TIGR00879, SP, MFS transporter, sugar porter (SP) family. This
model represent the sugar porter subfamily of the major
facilitator superfamily (pfam00083) [Transport and
binding proteins, Carbohydrates, organic alcohols, and
acids].
Length = 481
Score = 180 bits (458), Expect = 1e-50
Identities = 94/341 (27%), Positives = 160/341 (46%), Gaps = 15/341 (4%)
Query: 101 VLVLTYVAEITQPHLRGMLSATASMTTIFG-------TVSQLFLGSFLHWRSAAILNLLF 153
LV Y++EI LRG L++ + FG ++ L + L WR L L+
Sbjct: 146 ALVPMYLSEIAPKALRGALTSLYQLAITFGILVAYGFGSGKVSLNNTLGWRIPLGLQLIP 205
Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDK-VQTELSQITKAIEESEL 212
L L+F+PESP WL+ +GR++EA SL LRG DK + EL I E
Sbjct: 206 AGLLFLGLFFLPESPRWLVGKGRVEEARKSLARLRGTSGEDKELLDELELIDIK-RSIEK 264
Query: 213 KRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESI-HAPLD 271
+ + R + L + QF G+ + Y+ IFE+ +
Sbjct: 265 RSVQPSWGSL----FSSTRRIRRRLFLGVVLQWFQQFTGINAIMYYSPTIFENAGVSTDH 320
Query: 272 PYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVY-AQFHLSYGWD 330
+ ++++G + + L+ G+RPL LI G A C V+ + A F
Sbjct: 321 AFLVSIIVGAVNFAFTFVAIFLVDRFGRRPLLLIGAAGMAICLFVLGILGASFVTGSSKS 380
Query: 331 SPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYY 390
S V VF++L + +PW+++ E+FP ++R + +++++ F V L+
Sbjct: 381 SGNVAIVFILLFIAFFAMGWGPVPWVIVSEIFPLSLRPKGISIAVAANWLANFIVGFLFP 440
Query: 391 PMLDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDI 431
ML++ + G F+ ++V+G +++YF +PET+GRTL +I
Sbjct: 441 TMLESIGVGGVFIFFGGLNVLGLIFVYFFLPETKGRTLEEI 481
Score = 31.9 bits (73), Expect = 0.85
Identities = 12/22 (54%), Positives = 19/22 (86%)
Query: 483 VVGTLYMYFVMPETEGRTLRDI 504
V+G +++YF +PET+GRTL +I
Sbjct: 460 VLGLIFVYFFLPETKGRTLEEI 481
>gnl|CDD|215702 pfam00083, Sugar_tr, Sugar (and other) transporter.
Length = 449
Score = 164 bits (416), Expect = 8e-45
Identities = 91/343 (26%), Positives = 159/343 (46%), Gaps = 21/343 (6%)
Query: 101 VLVLTYVAEITQPHLRGMLSATASMTTIFGT-VSQLF------LGSFLHWRSAAILNLLF 153
VLV Y++EI LRG L + + FG V+ + + WR L +
Sbjct: 120 VLVPMYISEIAPKKLRGALGSLYQLGITFGILVAAIIGLGLNKYSNSDGWRIPLGLQFVP 179
Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELK 213
IL L L F+PESP WL+ +G+++EA A L LRG D+ + EE +
Sbjct: 180 AILLLIGLLFLPESPRWLVLKGKLEEARAVLAKLRGVSDVDQ---------EIQEEKDSL 230
Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPY 273
+ ++ ++ +T + L Q G+ + Y+ IFE++ D
Sbjct: 231 ERSVEAEKASWLELFRGKTVRQRLLMGVMLQIFQQLTGINAIFYYSPTIFETLG-LSDSL 289
Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAV-YAQFHLSYGWDSP 332
T+++GV + + L+ G+RPL L+ G A CF+V+ V S G
Sbjct: 290 LVTIIVGVVNFVFTFIAIFLVDRFGRRPLLLLGAAGMAICFLVLGVALLGVAKSKGAG-- 347
Query: 333 LVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPM 392
+V VF++L + +PW+++ E+FP +R A + +++++ F + L+ +
Sbjct: 348 IVAIVFILLFIAFFALGWGPVPWVIVSELFPLGVRPKAMAIATAANWLANFLIGFLFPII 407
Query: 393 LDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEHF 435
+A + V+ L+++F +PET+GRTL +I+E F
Sbjct: 408 TGAIG-GYVFLVFAGLLVLFILFVFFFVPETKGRTLEEIDELF 449
Score = 31.9 bits (73), Expect = 0.94
Identities = 13/26 (50%), Positives = 21/26 (80%)
Query: 483 VVGTLYMYFVMPETEGRTLRDIEEHF 508
V+ L+++F +PET+GRTL +I+E F
Sbjct: 424 VLFILFVFFFVPETKGRTLEEIDELF 449
Score = 29.6 bits (67), Expect = 4.9
Identities = 31/209 (14%), Positives = 59/209 (28%), Gaps = 54/209 (25%)
Query: 293 LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWD------SPLVPTVFLVLAAFLT 346
+ G + + +G + F V + D S L+ V V+ A L
Sbjct: 34 ALTSIGACAASTVLSGLIVSIFSVGCLIGSLFAGKLGDRFGRKKSLLIGNVLFVIGALLQ 93
Query: 347 HICIRLLPWMLI---------------------GEVFPNNIRATASGASGSS-------S 378
+MLI E+ P +R +
Sbjct: 94 GFAKGKSFYMLIVGRVIVGLGVGGISVLVPMYISEIAPKKLRGALGSLYQLGITFGILVA 153
Query: 379 YIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEHFADK 438
I +NK D + + L +++ + + F+ PE+
Sbjct: 154 AIIGLGLNKYSNS--DGWRI--PLGLQFVPAILLLIGLLFL-PESP-------------- 194
Query: 439 GKTFVTNIRRAEKKRHAARVEGQEEVKGE 467
+ V + E + A++ G +V E
Sbjct: 195 -RWLVLKGKLEEARAVLAKLRGVSDVDQE 222
>gnl|CDD|182225 PRK10077, xylE, D-xylose transporter XylE; Provisional.
Length = 479
Score = 127 bits (321), Expect = 1e-31
Identities = 95/366 (25%), Positives = 167/366 (45%), Gaps = 63/366 (17%)
Query: 106 YVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSFLH------------WR----SAAIL 149
Y+AEI H+RG L + IFG + F+ F+ WR S AI
Sbjct: 144 YIAEIAPAHIRGKLVSFNQFAIIFGQLVVYFVNYFIARSGDASWLNTDGWRYMFASEAIP 203
Query: 150 NLLFPILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEE 209
LLF +L LYF+PE+P +L+S+G+ ++A L + G L +I +++
Sbjct: 204 ALLFLML----LYFVPETPRYLMSRGKQEQAEGILRKIMG---NTLATQALQEIKHSLDH 256
Query: 210 SELKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAP 269
++ M ++ ++ S+F QF G+ + YA IF+++ A
Sbjct: 257 GRKT----------GGKLLMFGVGVIVIGVMLSVF--QQFVGINVVLYYAPEIFKTLGAS 304
Query: 270 LD-PYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSA--ACFIVVAVYAQFHLS 326
D T+++GV L +L ++ + G++PL +I G A + A Y Q
Sbjct: 305 TDIALLQTIIVGVINLTFTVLAIMTVDKFGRKPLQIIGALGMAIGMFSLGTAFYTQA--- 361
Query: 327 YGWDSPLVPTVFLVL--AAFLTH---ICIRLLPWMLIGEVFPNNIRATASGASGSSSYIF 381
+V + ++ AAF +C W+L+ E+FPN IR A + ++ +I
Sbjct: 362 ----PGIVALLSMLFYVAAFAMSWGPVC-----WVLLSEIFPNAIRGKALAIAVAAQWIA 412
Query: 382 AFAVNKLYYPMLD-------TFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEH 434
+ V+ + PM+D FH + + Y + V+ L+M+ +PET+G+TL ++E
Sbjct: 413 NYFVSWTF-PMMDKNSWLVAHFHNGFSYWIYGCMGVLAALFMWKFVPETKGKTLEEMEAL 471
Query: 435 FADKGK 440
+ + K
Sbjct: 472 WEPETK 477
Score = 29.3 bits (66), Expect = 5.6
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 483 VVGTLYMYFVMPETEGRTLRDIEEHFADKGK 513
V+ L+M+ +PET+G+TL ++E + + K
Sbjct: 447 VLAALFMWKFVPETKGKTLEEMEALWEPETK 477
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 89.7 bits (223), Expect = 2e-19
Identities = 42/191 (21%), Positives = 70/191 (36%), Gaps = 13/191 (6%)
Query: 233 FLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIH--APLDPYFATLLLGVAELGGALLC 290
FLL ++ +L F G L TY + + + + L G+ + GALL
Sbjct: 172 FLLRLLLLLALAFFLLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLSLFGLGGILGALLG 231
Query: 291 VVLIHYTGKRP-LALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHIC 349
+L G+R L LI +A +++A+ L V + L
Sbjct: 232 GLLSDRLGRRRLLLLIGLLLAALGLLLLALAPSLALLL------VALLLLGFGLGFAFPA 285
Query: 350 IRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAIS 409
+ L E+ P R TASG + + L +LDT G AA++
Sbjct: 286 L----LTLASELAPPEARGTASGLFNTFGSLGGALGPLLAGLLLDTGGYGGVFLILAALA 341
Query: 410 VVGTLYMYFVM 420
++ L + +
Sbjct: 342 LLAALLLLLLP 352
Score = 55.8 bits (135), Expect = 3e-08
Identities = 35/189 (18%), Positives = 64/189 (33%), Gaps = 21/189 (11%)
Query: 239 IVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVV---LIH 295
++ L F L + E + L A L++ LG AL ++ L
Sbjct: 2 LLLFLGFFLSGLDRGLLSPALPLLAEDLG--LSASQAGLIVSAFSLGYALGSLLAGYLSD 59
Query: 296 YTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHI---CIRL 352
G+R + L+ A +++A F S L++ FL + +
Sbjct: 60 RFGRRRVLLLGLLLFALGSLLLA----FASSLWL---------LLVGRFLLGLGGGALYP 106
Query: 353 LPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVG 412
LI E FP R A G + + A L + ++ A + ++
Sbjct: 107 AAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLL 166
Query: 413 TLYMYFVMP 421
L + F++
Sbjct: 167 ALLLLFLLR 175
Score = 45.0 bits (107), Expect = 6e-05
Identities = 45/281 (16%), Positives = 73/281 (25%), Gaps = 55/281 (19%)
Query: 96 MAAPLVLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSFLH----WRSAAILNL 151
A +AE P RG S G + LG L WR ++
Sbjct: 101 GGALYPAAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILA 160
Query: 152 LFPILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESE 211
+ +L L F+ L+ A A G+ + T L + +
Sbjct: 161 ILGLLLALLLLFLL--RLLLLL------ALAFFLLSFGYYG---LLTYLPLYLQEVLGLS 209
Query: 212 LKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGM---------------TTLQ 256
G L + + L + GG+ L
Sbjct: 210 AAEAG---------------LLLSLFGLGGILGAL--LGGLLSDRLGRRRLLLLIGLLLA 252
Query: 257 TYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLI------HYTGKRPLALISTGGS 310
+ + + A LLLG + L G L +T GS
Sbjct: 253 ALGLLLLALAPSLALLLVALLLLGFGLGFAFPALLTLASELAPPEARGTA-SGLFNTFGS 311
Query: 311 AACFIVVAVYAQFHLSYGWDSP-LVPTVFLVLAAFLTHICI 350
+ + + G+ L+ +LAA L +
Sbjct: 312 LGGALGPLLAGLLLDTGGYGGVFLILAALALLAALLLLLLP 352
Score = 35.0 bits (81), Expect = 0.083
Identities = 38/197 (19%), Positives = 62/197 (31%), Gaps = 31/197 (15%)
Query: 230 RRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALL 289
RR LL ++ +L + ++L VG F LLG+
Sbjct: 64 RRVLLLG-LLLFALGSLL-LAFASSLWLLLVGRF--------------LLGLGGGALYPA 107
Query: 290 CVVLI-HYTGK----RPLALISTGGSAACFIVVAVYAQFHLSYGWDSP-LVPTVFLVLAA 343
LI + R L L S G + + S GW L+ + +L A
Sbjct: 108 AAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLA 167
Query: 344 FLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLY 403
L +RLL +L F + G G +Y+ + L + L
Sbjct: 168 LLLLFLLRLL-LLLALAFF-----LLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLS--- 218
Query: 404 FYAAISVVGTLYMYFVM 420
+ ++G L +
Sbjct: 219 LFGLGGILGALLGGLLS 235
>gnl|CDD|233176 TIGR00898, 2A0119, cation transport protein. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 505
Score = 76.6 bits (189), Expect = 8e-15
Identities = 80/332 (24%), Positives = 123/332 (37%), Gaps = 64/332 (19%)
Query: 124 SMTTIFGTVSQLF--LGSFL---------HWRSAAILNLL--FPILALCALY-FIPESPH 169
I GT+ Q+F LG L WR L L P L F+PESP
Sbjct: 212 KQRAIVGTLIQVFFSLGLVLLPLVAYFIPDWR---WLQLAVSLPTFLFFLLSWFVPESPR 268
Query: 170 WLISQGRMQEASASLCWLRGWVTPDKVQ-TELSQITKAIEESELKRLG--KDGQRRPNYR 226
WLISQGR++EA L R K E+ ++ + S K+ D R PN R
Sbjct: 269 WLISQGRIEEALKIL--QRIAKINGKKLPAEVLSLSLEKDLSSSKKQYSFLDLFRTPNLR 326
Query: 227 MYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGG 286
+ T L T+ F +G + L + +I Y + G+ EL
Sbjct: 327 ---KTTLCLMMLWFTTAF--SYYGLVLDLG----NLGGNI------YLDLFISGLVELPA 371
Query: 287 ALLCVVLIHYTGKR-PLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLA-AF 344
L+ ++LI G+R +A A +++ V + + + T VL
Sbjct: 372 KLITLLLIDRLGRRYTMAASLLLAGVALLLLLFV----PVDLYF----LRTALAVLGKFG 423
Query: 345 LT--HICIRLLPWMLIGEVFPNNIRATA----SGASGSSSYIFAFAVNKLYYPMLDTFHL 398
+T + ++ E++P +R S + S I F V + L
Sbjct: 424 ITSAFQMV----YLYTAELYPTVVRNLGVGVCSTMARVGSIISPFLVYLGEKWLFLPLVL 479
Query: 399 WGTLYFYAAISVVGTLYMYFVMPETEGRTLRD 430
+G L A I + +PET+G L +
Sbjct: 480 FGGLALLAGILTLF-------LPETKGVPLPE 504
>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily.
Length = 346
Score = 66.3 bits (162), Expect = 9e-12
Identities = 57/295 (19%), Positives = 90/295 (30%), Gaps = 63/295 (21%)
Query: 96 MAAPLVLVLTYVAEITQPHLRGM----LSATASMTTIFGTVSQLFLGSFLHWRSAAILNL 151
A +A+ P RG LSA + G + L S WR+A ++
Sbjct: 99 GGALFPAAAALIADWFPPEERGRALGLLSAGFGLGAALGPLLGGLLASLFGWRAAFLILA 158
Query: 152 LFPILALC---ALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIE 208
+ +LA L P A W
Sbjct: 159 ILALLAAVLAALLLPRPPPESKRPKPAEEAPAPLVPAWK--------------------- 197
Query: 209 ESELKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHA 268
+ R +L + LF F +T L Y E +
Sbjct: 198 -------------------LLLRDPVLWLLLALLLFGFAFFALLTYLPLYQ----EVLG- 233
Query: 269 PLDPYFATLLLGVAELGGALLCVV---LIHYTGKRPLALISTGGSAACFIVVAVYAQFHL 325
L A LLLG+A L GA+ ++ L G+R L++ + +A+ +
Sbjct: 234 -LSALLAGLLLGLAGLLGAIGRLLLGRLSDRLGRRRRLLLALLLLILAALGLALLSLTES 292
Query: 326 SYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYI 380
S LV + L A L + L+ ++ P R TASG ++ +
Sbjct: 293 SLWL---LVALLLLGFGAGLVFPAL----NALVSDLAPKEERGTASGLYNTAGSL 340
Score = 38.6 bits (90), Expect = 0.007
Identities = 33/212 (15%), Positives = 60/212 (28%), Gaps = 29/212 (13%)
Query: 240 VTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELG---GALLCVVLIHY 296
+ F+ G + + E + + P LLL LG L L
Sbjct: 1 LFLAAFLAGLGRSLLGPALPLYLAEDLG--ISPTEIGLLLTAFSLGYALAQPLAGRLSDR 58
Query: 297 TGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLL--P 354
G+R + LI A +++ + L++ L + L
Sbjct: 59 FGRRRVLLIGLLLFALGLLLLLFASSL-------------WLLLVLRVLQGLGGGALFPA 105
Query: 355 WM-LIGEVFPNNIRATASG---ASGSSSYIFAFAVNKLYYPMLDTFHLW-GTLYFYAAIS 409
LI + FP R A G A + +L + W A ++
Sbjct: 106 AAALIADWFPPEERGRALGLLSAGFGLGAALGPLLG----GLLASLFGWRAAFLILAILA 161
Query: 410 VVGTLYMYFVMPETEGRTLRDIEEHFADKGKT 441
++ + ++P + R A
Sbjct: 162 LLAAVLAALLLPRPPPESKRPKPAEEAPAPLV 193
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 58.8 bits (143), Expect = 3e-09
Identities = 18/49 (36%), Positives = 28/49 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
KK K KK K+A+KKEK+KEK K +K+ + K K++ + +
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 56.9 bits (138), Expect = 1e-08
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +K+ K KK K +K++K+KEK K KK+ + K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 54.2 bits (131), Expect = 1e-07
Identities = 13/53 (24%), Positives = 26/53 (49%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
P+ K KK K+ +K +K+++K + KK+ + K K++K +
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 53.8 bits (130), Expect = 1e-07
Identities = 17/41 (41%), Positives = 24/41 (58%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K +KK K KK K +KKEK+K+K K KK+ ++ N
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 49.2 bits (118), Expect = 4e-06
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
K K EKK K KK K+ EKK+K+K+K K KK + + N
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 41.1 bits (97), Expect = 0.001
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E + K K +K+ K KK K+ +KK+K+K+K
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEK 409
Score = 30.7 bits (70), Expect = 2.1
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + E + K K ++K K KK K+ +KK+
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKE 404
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 55.5 bits (134), Expect = 7e-09
Identities = 23/41 (56%), Positives = 32/41 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ K+K+KKK+ KK KK++K K++K E K KKKKKKKKK
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 55.1 bits (133), Expect = 1e-08
Identities = 23/51 (45%), Positives = 35/51 (68%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T + ++ +KKEKKKK++ K+K++KK+KK+K + K KKKKKKK
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190
Score = 54.7 bits (132), Expect = 1e-08
Identities = 24/41 (58%), Positives = 32/41 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ KK++KKKKE K +KKEKK +K+K + K KKKKKKKK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 54.7 bits (132), Expect = 1e-08
Identities = 24/39 (61%), Positives = 30/39 (76%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K+ KKK++ KKEKK KK +K+K + K KKKKKKKKKK
Sbjct: 155 KEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 52.8 bits (127), Expect = 5e-08
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 5/65 (7%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEK-----KKKKKKK 56
S ++ T + + ++++K+KK+KK KKEKK++K KKEK KKKKK
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Query: 57 KKKKK 61
KKKKK
Sbjct: 189 KKKKK 193
Score = 50.5 bits (121), Expect = 3e-07
Identities = 19/44 (43%), Positives = 31/44 (70%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ + K +KE + +++EKK++KKKKE KK+KK+KK KK+
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179
Score = 47.4 bits (113), Expect = 4e-06
Identities = 18/43 (41%), Positives = 29/43 (67%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K+ K +++ ++E+KKEKKKK++ KK+KK+KK KK
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179
Score = 42.0 bits (99), Expect = 2e-04
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +KE K +K + E++++K+KK+KK+ KK+KK+KK
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD 172
Score = 34.3 bits (79), Expect = 0.10
Identities = 9/42 (21%), Positives = 23/42 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ + + + + K +++ + E+++KK+KKKKK+
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK 164
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 52.4 bits (126), Expect = 3e-08
Identities = 27/75 (36%), Positives = 44/75 (58%), Gaps = 4/75 (5%)
Query: 19 DWKKAKKKE--KKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
WKK + E ++K+E+K +K E+K KK K++KKK+KKKKKKK K N + +
Sbjct: 58 KWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKS 117
Query: 75 NNNNNNNNNNNNEEK 89
+ +++ E+K
Sbjct: 118 SEESSDEEEEGEEDK 132
Score = 49.7 bits (119), Expect = 3e-07
Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ K+K+++K KK K++KKK+KKKK KKK KK KK++K + ++ +++ +
Sbjct: 73 EEKKRKDEEKTAKKRAKRQKKKQKKKK-KKKAKKGNKKEEKEGSKSSEESSDEEEEGEED 131
Query: 81 NNNNNNEEKG 90
E
Sbjct: 132 KQEEPVEIME 141
Score = 48.1 bits (115), Expect = 1e-06
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 24 KKKEKKKK-EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
K++EKK+K E+K KK K++KKK++KKKKKK KK KK + ++ +++
Sbjct: 71 KREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEE 130
Query: 83 NNNNEE 88
+ E
Sbjct: 131 DKQEEP 136
Score = 46.6 bits (111), Expect = 3e-06
Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 3/69 (4%)
Query: 25 KKEKKKKEKKAKKKEK-KKEKKKKEKKKKK--KKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
KKE + +E + K++EK +K+++K KK+ K KKK+KKKK N + ++
Sbjct: 60 KKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119
Query: 82 NNNNNEEKG 90
+++ EE+G
Sbjct: 120 ESSDEEEEG 128
Score = 46.6 bits (111), Expect = 3e-06
Identities = 19/66 (28%), Positives = 35/66 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+K ++K KK+ K+ KKK+KKK+KKK +K KK++K+ K + +++ +
Sbjct: 77 RKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEP 136
Query: 81 NNNNNN 86
Sbjct: 137 VEIMEK 142
>gnl|CDD|233175 TIGR00895, 2A0115, benzoate transport. [Transport and binding
proteins, Carbohydrates, organic alcohols, and acids].
Length = 398
Score = 54.7 bits (132), Expect = 7e-08
Identities = 57/281 (20%), Positives = 97/281 (34%), Gaps = 44/281 (15%)
Query: 105 TYVAEITQPHLRGMLSATASMTTIF--GTVSQLFLGSFL----HWRSAAILNLLFP-ILA 157
V+E RG +A M + G FL +L WRS + + P +L
Sbjct: 128 ALVSEYAPKRFRG--TAVGLMFCGYPIGAAVGGFLAGWLIPVFGWRSLFYVGGIAPLLLL 185
Query: 158 LCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELKRLGK 217
L + F+PES +L+S+ P+ V+ ++ I ++ L +
Sbjct: 186 LLLMRFLPESIDFLVSKR-----------------PETVRRIVNAIAPQMQAEAQSALPE 228
Query: 218 DGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGI---FESIHAPLDPYF 274
+ R + F A +T L ++ F M + Y + +
Sbjct: 229 QKATQGTKRSVFKALFQGKTARITVLLWLLYF--MLLVGVYFLTNWLPKLMVELGFSLSL 286
Query: 275 ATLLLGVAELGGALLCVV---LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDS 331
A + GG + ++ L G R AL+ G+ V V + S
Sbjct: 287 AATGGALFNFGGVIGSIIFGWLADRLGPRVTALLLLLGAV---FAVLVGSTLF------S 337
Query: 332 PLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASG 372
P + + +A F + L L+ +P IRAT G
Sbjct: 338 PTLLLLLGAIAGFFVNGGQSGLY-ALMALFYPTAIRATGVG 377
>gnl|CDD|225371 COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport
and metabolism].
Length = 394
Score = 52.6 bits (127), Expect = 3e-07
Identities = 58/345 (16%), Positives = 102/345 (29%), Gaps = 88/345 (25%)
Query: 101 VLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQL-------FLGSFLHWRSAAILNLLF 153
+ A + P RG A A + T T++ + FLG WR+ F
Sbjct: 120 SIAAALAARLVPPGKRG--RALALVFTGL-TLATVLGVPLGTFLGQLFGWRAT------F 170
Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELK 213
+A+ AL +L L + P S+I+ ++
Sbjct: 171 LAIAVLAL--------------------LALLLLWKLLPP-------SEISGSLPGPLRT 203
Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPY 273
L +RR +L + T LF G F T ++ + +
Sbjct: 204 LLRL-----------LRRPGVLLGLLATFLFMTGHFALYTYIRPFLESVAG-----FSVS 247
Query: 274 FATLLL---GVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWD 330
+L+L G+A G LL L G R + + A + +
Sbjct: 248 AVSLVLLAFGIAGFIGNLLGGRLADR-GPRRALIAALLLLALALLALTFTGA-------- 298
Query: 331 SPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASG-------ASGSSSYIFAF 383
SP + L L F ++ L + + P+ S + +
Sbjct: 299 SPALALALLFLWGFAFSPALQGL-QTRLARLAPDAADLAGSLNVAAFNLGIALGAALGG- 356
Query: 384 AVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTL 428
+LD T + A ++ L + + + RT
Sbjct: 357 -------LVLDALGYAATGWV-GAALLLLALLLALLSARKDRRTA 393
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 50.8 bits (122), Expect = 4e-07
Identities = 22/42 (52%), Positives = 32/42 (76%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
WK+ K+K +KKK ++ KK+E+ +K+K +KK KKKKK KKK
Sbjct: 162 WKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203
Score = 41.9 bits (99), Expect = 3e-04
Identities = 17/39 (43%), Positives = 32/39 (82%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K+K+++K K K++ KE+K+K +KKK +++KK+++N
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREEN 183
Score = 41.1 bits (97), Expect = 5e-04
Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 11/53 (20%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK---------KKKEK--KKKKKKKKKKKKN 62
KKA K+++K+K+K K+ +++KEK KK+E+ KK+K KK KKK
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKK 197
Score = 41.1 bits (97), Expect = 5e-04
Identities = 17/55 (30%), Positives = 34/55 (61%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
+++ +K++++KA+KK+K+KE KKKE +K + ++ K + N + N N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 39.6 bits (93), Expect = 0.002
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+KA+KK+K+K+ KK + +K + ++ K ++ K KKK +N N
Sbjct: 18 RKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 39.6 bits (93), Expect = 0.002
Identities = 14/55 (25%), Positives = 31/55 (56%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
++++ K+ ++K +KK+K+K+ KKK+ +K + ++ N N + N N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 39.2 bits (92), Expect = 0.002
Identities = 18/49 (36%), Positives = 29/49 (59%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
++ +K KK+K K+AKKKE ++ + +E K ++ K KKK N N
Sbjct: 15 REQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 38.8 bits (91), Expect = 0.003
Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 7/51 (13%)
Query: 18 LDWKKAKKKEKKKKEKKAKKK-EKKKEK------KKKEKKKKKKKKKKKKK 61
LD KA + E+K+K KA K E K K KK K+K+K+KKK KK+
Sbjct: 111 LDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKE 161
Score = 38.4 bits (90), Expect = 0.004
Identities = 13/55 (23%), Positives = 32/55 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+++ K+++ K KK+++K+ KKK++ +K + ++ K ++N + N N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 38.1 bits (89), Expect = 0.005
Identities = 15/55 (27%), Positives = 29/55 (52%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
E++ +K+E++K +KK+++K+ KKK+ +K N N + N N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 37.7 bits (88), Expect = 0.007
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 16/60 (26%)
Query: 20 WKKAKKK---------EK------KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
W KA K EK K+KEK+ KKK KK+ K++KEK +KKK +++KK+ N
Sbjct: 125 WTKALAKAEGVKVKDDEKLLKKALKRKEKQ-KKKSKKEWKERKEKVEKKKAERQKKREEN 183
Score = 35.4 bits (82), Expect = 0.047
Identities = 12/55 (21%), Positives = 31/55 (56%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
E+++++++ +K KK+++K+ +KK+ +K + ++ N N + N N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 34.6 bits (80), Expect = 0.073
Identities = 14/55 (25%), Positives = 28/55 (50%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
++++ K++++K KK+K K+ KKK+ +K + N N + N N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
Score = 34.2 bits (79), Expect = 0.11
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
++ KK+E+ K++K KK KKK+K KK+ +
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 33.4 bits (77), Expect = 0.18
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK--EKKKKKKKKKKKKK 61
F + KK+KKKK+ K+ KK E +KK E+ + K + ++K
Sbjct: 71 FADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEK 122
Score = 33.4 bits (77), Expect = 0.20
Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 9/50 (18%)
Query: 21 KKAKKKEKKKK---------EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KKK+KKK E + KK E+ E K E ++K+K K K
Sbjct: 82 KLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAK 131
Score = 32.3 bits (74), Expect = 0.42
Identities = 11/34 (32%), Positives = 23/34 (67%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+E +++ +K+E++K KK+K+K+ KKK+
Sbjct: 1 PSSREALLEQRRRKREQRKARKKQKRKEAKKKED 34
Score = 31.9 bits (73), Expect = 0.51
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 25/68 (36%)
Query: 19 DWKKA-KKKEKKKKEKKAKKKEKKKEKKKKEK------------------------KKKK 53
D+K+ KK E +KK+ + ++K E ++KEK K+K+
Sbjct: 93 DYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKE 152
Query: 54 KKKKKKKK 61
K+KKK KK
Sbjct: 153 KQKKKSKK 160
Score = 29.2 bits (66), Expect = 4.2
Identities = 19/84 (22%), Positives = 31/84 (36%), Gaps = 42/84 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE-------------------------------- 48
++ +K+E++K KK K+KE KK++ ++
Sbjct: 10 QRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKV 69
Query: 49 ----------KKKKKKKKKKKKKN 62
K KKKKKKKK +
Sbjct: 70 EFADGEQAKKDLKLKKKKKKKKTD 93
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 52.5 bits (126), Expect = 4e-07
Identities = 20/59 (33%), Positives = 30/59 (50%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
+K+ EKK E K E K KK K+ + NNNN +NN N + N ++ N ++
Sbjct: 27 TSSNSKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKP 85
Score = 42.9 bits (101), Expect = 3e-04
Identities = 14/61 (22%), Positives = 28/61 (45%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K+ ++K K + K K+ K+ + K+NN N + N ++ N + + N+
Sbjct: 32 KQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNK 91
Query: 88 E 88
E
Sbjct: 92 E 92
Score = 39.4 bits (92), Expect = 0.004
Identities = 19/69 (27%), Positives = 28/69 (40%), Gaps = 17/69 (24%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ KK E K E K +K K+ + NNNN +NN N + N
Sbjct: 33 QPEKKPEIKPNENTPKIPKKPDNKEP-----------------SENNNNKSNNENKDEEN 75
Query: 81 NNNNNNEEK 89
++ N E+K
Sbjct: 76 PSSTNPEKK 84
Score = 39.4 bits (92), Expect = 0.005
Identities = 17/72 (23%), Positives = 35/72 (48%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
EP+ + K+ + K ++ + EKK + K +++ +K K + KK + + NN
Sbjct: 56 KEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEIEKPKDEPKKPDKKPQADQPNN 115
Query: 73 NNNNNNNNNNNN 84
+ + NNN +
Sbjct: 116 VHADQPNNNKVD 127
Score = 33.7 bits (77), Expect = 0.26
Identities = 13/73 (17%), Positives = 28/73 (38%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
+P + + + + K + +KK + K K++ +K K + +
Sbjct: 52 KPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEIEKPKDEPKKPDKKPQAD 111
Query: 74 NNNNNNNNNNNNN 86
NN + + NNN
Sbjct: 112 QPNNVHADQPNNN 124
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 48.4 bits (116), Expect = 6e-07
Identities = 15/41 (36%), Positives = 28/41 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K+ KK++ ++E+ KE+ +KEK+ +K ++KK K+
Sbjct: 83 KKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKR 123
Score = 45.3 bits (108), Expect = 7e-06
Identities = 10/40 (25%), Positives = 26/40 (65%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K++ + ++ K+ +KEK+ ++ ++KK K+++K+K
Sbjct: 91 KRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 43.4 bits (103), Expect = 3e-05
Identities = 13/41 (31%), Positives = 28/41 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK + + ++E+ ++ EK+K +K +KK K+++K+K+K
Sbjct: 90 KKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 42.6 bits (101), Expect = 5e-05
Identities = 17/56 (30%), Positives = 30/56 (53%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KH E KK K+K KK + A + +++ K++ EK+K+ +K ++KK
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120
Score = 40.7 bits (96), Expect = 3e-04
Identities = 11/42 (26%), Positives = 27/42 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K++KKK+++ KK+ +++ K++ +K+K+ +KN
Sbjct: 75 LLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116
Score = 38.8 bits (91), Expect = 0.001
Identities = 12/40 (30%), Positives = 27/40 (67%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KKK++K+ KK + + + ++ KE+ +K+K+ +K ++
Sbjct: 79 KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNRE 118
Score = 38.8 bits (91), Expect = 0.001
Identities = 14/48 (29%), Positives = 28/48 (58%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E L K+ KKK K+ +K+ ++E+ K+ +K+K+ +K ++K
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREK 119
Score = 37.3 bits (87), Expect = 0.005
Identities = 11/45 (24%), Positives = 29/45 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+K +EK+KK+K+ + +K++ + +++ K++ +K+K+ N
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 49.3 bits (118), Expect = 7e-07
Identities = 19/42 (45%), Positives = 29/42 (69%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ ++K+K K +KK KK+K K+K KK+ KK K +KK +K
Sbjct: 76 KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117
Score = 48.5 bits (116), Expect = 1e-06
Identities = 18/42 (42%), Positives = 28/42 (66%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K K+KK K+KK K K+KK +KK + +KK +K+ + K
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 47.8 bits (114), Expect = 2e-06
Identities = 18/47 (38%), Positives = 29/47 (61%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
WK KKK KKKK+K KK+ KK+ K ++K +K+ + K + + +
Sbjct: 84 WKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Score = 47.4 bits (113), Expect = 3e-06
Identities = 19/40 (47%), Positives = 27/40 (67%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+K K K KKKK KK K K+K K+ KK+ K +KK +K+ +
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 46.6 bits (111), Expect = 7e-06
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK+K KK+K K +K +K K +KK +K+ + K +
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 46.6 bits (111), Expect = 7e-06
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ K +KKK +KK K + KK+ KK +K +KK +K+ + K
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 45.5 bits (108), Expect = 2e-05
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 13 TEPTPLDWKK---AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
E T KK A++ EK KKE + K+K K K+KK K+KK K K KK KK++
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108
Score = 44.7 bits (106), Expect = 3e-05
Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKK-KKKKKKKKKKKN 62
K+K K KK+K KKK+K K+KK +K K +KK +K+ ++
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Score = 43.9 bits (104), Expect = 5e-05
Identities = 20/48 (41%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 17 PLDWKKAK-KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
L + K KKE ++K+K KK+K K+KK K+K KK KK K +
Sbjct: 66 ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113
Score = 43.5 bits (103), Expect = 8e-05
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + +EK+K + K KK +KKK+K K +K KK K +KK
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Score = 41.6 bits (98), Expect = 3e-04
Identities = 15/42 (35%), Positives = 22/42 (52%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K + KK +KK K EKK EK+ ++K + K +
Sbjct: 92 KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133
Score = 41.2 bits (97), Expect = 4e-04
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K K+K KK+ K K +KK++K+ E K + K +
Sbjct: 92 KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133
Score = 39.7 bits (93), Expect = 0.001
Identities = 21/49 (42%), Positives = 33/49 (67%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
TP+ + + +KKKKE + ++ KKE ++K+K K KKKK KKKK+ +
Sbjct: 50 TPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98
Score = 39.7 bits (93), Expect = 0.001
Identities = 17/38 (44%), Positives = 25/38 (65%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
KK+KK +E +K KK+ E+K+K K KKKK K + +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98
Score = 36.6 bits (85), Expect = 0.013
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K K+ KK+ K++KK++K+ + K E K +
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135
Score = 34.7 bits (80), Expect = 0.066
Identities = 13/43 (30%), Positives = 21/43 (48%), Gaps = 1/43 (2%)
Query: 20 WKKAKKKEKKKKEKKAK-KKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K K+KK +K K +K+ +KE + K + K +
Sbjct: 93 KKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135
Score = 32.7 bits (75), Expect = 0.28
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KK KK+ K ++K K+ E K E K + + K
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
Score = 32.0 bits (73), Expect = 0.44
Identities = 19/39 (48%), Positives = 24/39 (61%), Gaps = 7/39 (17%)
Query: 35 AKKKEK----KKEKKKKE---KKKKKKKKKKKKKNNNNN 66
AKKK+K + EK KKE K+K K KKKK KK + +
Sbjct: 60 AKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98
Score = 32.0 bits (73), Expect = 0.45
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
D K KK +KK + + K +++ ++K + K +
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135
Score = 32.0 bits (73), Expect = 0.49
Identities = 9/44 (20%), Positives = 20/44 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K K ++K +KE + K ++ K + + K +K + +
Sbjct: 106 KDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHKD 149
Score = 30.4 bits (69), Expect = 1.8
Identities = 9/43 (20%), Positives = 21/43 (48%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
D K KK +K +K+ + + ++K ++ K + + K +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPR 142
Score = 28.9 bits (65), Expect = 4.8
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K +KK EK+A+ K + K E + K +K
Sbjct: 104 DKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143
Score = 28.5 bits (64), Expect = 6.6
Identities = 14/58 (24%), Positives = 32/58 (55%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
E KK+KK+ ++ EK KK+ ++K+K K + + + + ++ ++ +E+K
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 48.1 bits (115), Expect = 8e-07
Identities = 20/38 (52%), Positives = 28/38 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+K K+ K+K K+ K+K+KKK+KKKK+ KK KKKK
Sbjct: 98 QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 45.1 bits (107), Expect = 1e-05
Identities = 21/40 (52%), Positives = 28/40 (70%), Gaps = 2/40 (5%)
Query: 25 KKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+E+K K+ K K K+KK++KKKK+KKKKK KK KK
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 43.1 bits (102), Expect = 5e-05
Identities = 19/40 (47%), Positives = 27/40 (67%), Gaps = 2/40 (5%)
Query: 26 KEKKK--KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
E++K K+ +K ++KK+KKKK+KKKKKK KK K
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 33.9 bits (78), Expect = 0.063
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+E+K K+ K+ K K+KK+KKKKKKKKKK +
Sbjct: 95 VEEQKIVKQVLKE--KAKQKKQKKKKKKKKKKKTS 127
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 50.9 bits (122), Expect = 1e-06
Identities = 21/64 (32%), Positives = 34/64 (53%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
KK+KKKK+K +K+ + KK ++KK+ N NN N + +++ N+N
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68
Query: 85 NNEE 88
NEE
Sbjct: 69 KNEE 72
Score = 48.2 bits (115), Expect = 7e-06
Identities = 18/64 (28%), Positives = 30/64 (46%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
KKK+KKKK+ + + KK ++KK+ NN N + +++ N+N
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68
Query: 84 NNNE 87
N E
Sbjct: 69 KNEE 72
Score = 46.7 bits (111), Expect = 2e-05
Identities = 18/65 (27%), Positives = 33/65 (50%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
K KKK+KKK K++ + +K ++KK+ ++ NN N + +++ N+N
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68
Query: 82 NNNNN 86
N N
Sbjct: 69 KNEEN 73
Score = 46.7 bits (111), Expect = 2e-05
Identities = 19/64 (29%), Positives = 31/64 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK KKK+K KE + KK ++KK+ ++ NN N + +++ N+N
Sbjct: 10 KKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSNK 69
Query: 81 NNNN 84
N N
Sbjct: 70 NEEN 73
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 48.7 bits (116), Expect = 1e-06
Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 1/46 (2%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
E TP D + K EKK K+KK + +K+++KKKKEKKKKKK+
Sbjct: 125 PLPEETPSDSEGLKGHEKKHKKKK-HEDDKERKKKKKEKKKKKKRH 169
Score = 41.7 bits (98), Expect = 3e-04
Identities = 22/58 (37%), Positives = 28/58 (48%), Gaps = 12/58 (20%)
Query: 21 KKAKKKEKKKKEKKA------------KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KK K K KK + + K EKK +KKK E K++KKKKK+KK
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 50.0 bits (120), Expect = 2e-06
Identities = 22/36 (61%), Positives = 29/36 (80%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KEK K K K++EKK +K+KK+KK+KKK KK+KKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 48.4 bits (116), Expect = 6e-06
Identities = 20/38 (52%), Positives = 29/38 (76%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K K+K+ +KK +K++KKKK KKK KK+KKK +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 48.0 bits (115), Expect = 7e-06
Identities = 20/38 (52%), Positives = 29/38 (76%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+K K KKK+E+K +K KKK+K+KK+ KK+KKK +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 47.7 bits (114), Expect = 1e-05
Identities = 18/39 (46%), Positives = 29/39 (74%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K+ K KKK ++ K +++KK+KK+K+K KK+KKK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 46.9 bits (112), Expect = 2e-05
Identities = 20/43 (46%), Positives = 31/43 (72%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
++ K+ ++ KEK K +KK+E+KK +K+KKKKK+KKK K
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 46.1 bits (110), Expect = 3e-05
Identities = 19/39 (48%), Positives = 30/39 (76%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K + KK+++ KK +K+K+KKK++KK KK+KKK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 45.7 bits (109), Expect = 4e-05
Identities = 19/38 (50%), Positives = 29/38 (76%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+K K +KK ++K+ +K KKKK++KKK KK+KKK +
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 44.6 bits (106), Expect = 9e-05
Identities = 18/35 (51%), Positives = 27/35 (77%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+K K KKK ++K+ +K++KKKK+KKK KK+K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 40.7 bits (96), Expect = 0.002
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEK 49
P P K+ +KK +K+K+KK +KK+ KK KKK K
Sbjct: 380 PKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 40.0 bits (94), Expect = 0.003
Identities = 18/37 (48%), Positives = 28/37 (75%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+E K+ ++ K+K K KKK+E+KK +K+KKKKK+
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKR 401
Score = 34.9 bits (81), Expect = 0.10
Identities = 12/36 (33%), Positives = 25/36 (69%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
+ +P ++ K +++KKK+K+ KK +K+K+K +K
Sbjct: 379 YPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 30.3 bits (69), Expect = 3.0
Identities = 12/35 (34%), Positives = 24/35 (68%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ K+E + +E K++ K KKK+++KK +K+K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRK 396
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 49.9 bits (120), Expect = 2e-06
Identities = 19/41 (46%), Positives = 31/41 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA KK KK EK KK+E++K++KKK+ KKK++++++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446
Score = 47.6 bits (114), Expect = 1e-05
Identities = 18/49 (36%), Positives = 35/49 (71%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ T K +K EKK++E+K +KK+K KKKE++++++K+KK+++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 46.8 bits (112), Expect = 2e-05
Identities = 19/57 (33%), Positives = 36/57 (63%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T+ +I T K K EK +K+++ +KKEKKK+ +KK+++++++K+KK
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 46.5 bits (111), Expect = 2e-05
Identities = 17/42 (40%), Positives = 34/42 (80%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K KK+E++KKEKK K KK+++++E++K+KK+++K+++
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 45.7 bits (109), Expect = 4e-05
Identities = 15/41 (36%), Positives = 35/41 (85%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+KA+KK +++K++K KK K+K+++E+++K+KK+++K++
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 45.7 bits (109), Expect = 5e-05
Identities = 16/43 (37%), Positives = 32/43 (74%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K ++K+++EKK KKK+ KKK+E+++++K+KK+++K
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 44.1 bits (105), Expect = 1e-04
Identities = 16/39 (41%), Positives = 27/39 (69%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK KK K+ K ++K++E+KK++KKK KKK+++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 44.1 bits (105), Expect = 2e-04
Identities = 15/48 (31%), Positives = 36/48 (75%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E + KK KKK+E++ ++KEKK+E+K++E+++ +++K+++++
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 42.6 bits (101), Expect = 5e-04
Identities = 16/42 (38%), Positives = 37/42 (88%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K +++EKK+K+KKA +KK+E++++EK+KK+++K+++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 41.4 bits (98), Expect = 0.001
Identities = 15/41 (36%), Positives = 35/41 (85%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ +KKEKKKK KKKE+++E++K++K+++K++++++ +
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 40.7 bits (96), Expect = 0.002
Identities = 15/42 (35%), Positives = 34/42 (80%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK KKK+ +KK +++E++KEKK++EK++++++ +++K+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 39.9 bits (94), Expect = 0.003
Identities = 14/41 (34%), Positives = 36/41 (87%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ ++K++KKK+ A KK++++E+++KEKK+++K++++++
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 39.9 bits (94), Expect = 0.003
Identities = 13/42 (30%), Positives = 34/42 (80%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K KKKE++ +++++KKE++K+E++++ +++K++++
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 39.1 bits (92), Expect = 0.005
Identities = 11/43 (25%), Positives = 34/43 (79%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+K KK KK+++ +++EK+K++++KE+++++ +++K+++
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 38.0 bits (89), Expect = 0.011
Identities = 12/37 (32%), Positives = 34/37 (91%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK +++E++K++K+ +K+E+++E +++++++++KKKK
Sbjct: 439 KKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 46.8 bits (112), Expect = 4e-06
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KKE K EK KK EK K K +K+K KK++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 44.8 bits (107), Expect = 2e-05
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKE KA +K +KK +K K K +KKK KK++ K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 41.7 bits (99), Expect = 2e-04
Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 2/47 (4%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
K + KA +K +KK EK K EKKK KK++ K +
Sbjct: 54 LKESLEAALLDK--KELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 40.6 bits (96), Expect = 4e-04
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
K + K +K EKKA+K + K EKKK +K++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 39.4 bits (93), Expect = 0.001
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
KK K +K ++K K K K ++KK K+++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 38.3 bits (90), Expect = 0.003
Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 18 LDWKKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKK 52
LD K+ K EK KK +KAK K +KK+ KK+E K +
Sbjct: 63 LDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 36.4 bits (85), Expect = 0.011
Identities = 19/53 (35%), Positives = 21/53 (39%), Gaps = 10/53 (18%)
Query: 20 WKKAKKKE----------KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK KK E K KE KKE K EK +KK +K K K
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAE 86
Score = 35.2 bits (82), Expect = 0.032
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKN 62
K++ K +K EKK +K K K +KK
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKK 89
Score = 27.9 bits (63), Expect = 10.0
Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 10/55 (18%)
Query: 23 AKKKEKKKKEKKAKK-----KEKKKEKK-----KKEKKKKKKKKKKKKKNNNNNN 67
A+KK+ KK E + K+ K+ + KKE K +K +KK +K
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAE 86
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 49.2 bits (118), Expect = 4e-06
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK +KK ++K EKKA+K+E +K KK+ + KK K
Sbjct: 412 KKLRKK-QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
Score = 48.0 bits (115), Expect = 1e-05
Identities = 17/43 (39%), Positives = 29/43 (67%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
A++K+ +KK++KA+KK +K+E +K KKK + KK K +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Score = 47.2 bits (113), Expect = 2e-05
Identities = 15/42 (35%), Positives = 25/42 (59%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK +K EKK ++++A+K KK+ + KK K + KK +
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVD 457
Score = 45.3 bits (108), Expect = 7e-05
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+KK +KK+ K KK EK++ +K KKK + KK K +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Score = 44.5 bits (106), Expect = 1e-04
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K ++K +KK EK+ +K K+K + KK K + KK
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
Score = 42.2 bits (100), Expect = 6e-04
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+ KK +KK++K EKK ++++ +K KKK +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 41.8 bits (99), Expect = 7e-04
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 3/42 (7%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
E+KK KK +K EKK EK E+ +K KKK +
Sbjct: 408 PAERKKLRKKQRKAEKKAEK---EEAEKAAAKKKAEAAAKKA 446
Score = 33.8 bits (78), Expect = 0.26
Identities = 12/54 (22%), Positives = 25/54 (46%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
A+ +E++ + E+KK +KK+ K +KK +K++ +K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 33.0 bits (76), Expect = 0.40
Identities = 11/56 (19%), Positives = 27/56 (48%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
PL + +++ + A++K+ +K+++K EKK +K++ +K
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 30.7 bits (70), Expect = 2.6
Identities = 9/54 (16%), Positives = 23/54 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
++E+ + + + KK KK+++ +KK +K++ +K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 28.7 bits (65), Expect = 8.2
Identities = 10/38 (26%), Positives = 22/38 (57%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + +E++ + ++KK +KK++K +KK +K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK 427
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 47.6 bits (114), Expect = 4e-06
Identities = 10/46 (21%), Positives = 25/46 (54%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
KAK + ++ +++ K++ +++K +++K K K +KK
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 43.8 bits (104), Expect = 7e-05
Identities = 13/41 (31%), Positives = 28/41 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++A+++ KK++ K+K ++E+K K K +KK+K + +K
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153
Score = 41.1 bits (97), Expect = 6e-04
Identities = 10/49 (20%), Positives = 25/49 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
++AK K + ++ ++ KK + +K+K ++++K K K +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 41.1 bits (97), Expect = 6e-04
Identities = 13/47 (27%), Positives = 31/47 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
++AKK+E +++KA ++E+K + K KK+K + +K + ++ ++
Sbjct: 117 QQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163
Score = 40.3 bits (95), Expect = 0.001
Identities = 10/54 (18%), Positives = 28/54 (51%), Gaps = 1/54 (1%)
Query: 23 AKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
A+K+ ++ K K +A++ E++ +K++ +K+K ++++K
Sbjct: 99 ARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 38.4 bits (90), Expect = 0.005
Identities = 8/41 (19%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ ++ +K++ + +K +++ K K + +KK+K + +K
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 37.6 bits (88), Expect = 0.008
Identities = 10/44 (22%), Positives = 20/44 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+KA ++E+K K K +KK K + +K + + +
Sbjct: 129 EKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172
Score = 36.4 bits (85), Expect = 0.018
Identities = 10/49 (20%), Positives = 22/49 (44%), Gaps = 3/49 (6%)
Query: 31 KEKKAKKKEKKKE---KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
+E KAK + ++ E KK++ +K+K ++++K
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 33.4 bits (77), Expect = 0.22
Identities = 8/41 (19%), Positives = 22/41 (53%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+K++ ++E+K K K +K++K + +K + + +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Score = 29.9 bits (68), Expect = 2.7
Identities = 9/46 (19%), Positives = 18/46 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+K K K +KK K +K + + + + + K K +
Sbjct: 136 RKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKVKAGKSA 181
>gnl|CDD|152107 pfam11671, Apis_Csd, Complementary sex determiner protein. This
family of proteins represents the complementary sex
determiner in the honeybee. In the honeybee, the
mechanism of sex determination depends on the csd gene
which produces an SR-type protein. Males are homozygous
while females are homozygous for the csd gene.
Heterozygosity generates an active protein which
initiates female development.
Length = 146
Score = 46.3 bits (109), Expect = 5e-06
Identities = 16/68 (23%), Positives = 44/68 (64%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
+ +++++E+K + + +E ++ +++ + + ++++ ++ K ++ +N +NN N NNN
Sbjct: 6 YSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYNNNY 65
Query: 80 NNNNNNNE 87
NN NN N+
Sbjct: 66 NNYNNYNK 73
Score = 45.9 bits (108), Expect = 7e-06
Identities = 16/67 (23%), Positives = 41/67 (61%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
++ +E+++K K + ++ + +E+ + + ++++ +++ ++ +N +NN N NNN N
Sbjct: 7 SRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYNNNYN 66
Query: 83 NNNNEEK 89
N NN K
Sbjct: 67 NYNNYNK 73
Score = 44.7 bits (105), Expect = 2e-05
Identities = 20/66 (30%), Positives = 36/66 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+K+ K E +E + +E+ +++ ++E+ ++ K +NN N NNN NN NN N
Sbjct: 14 QKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYNNNYNNYNNYNK 73
Query: 81 NNNNNN 86
+ N N
Sbjct: 74 HYYNIN 79
Score = 38.2 bits (88), Expect = 0.003
Identities = 14/76 (18%), Positives = 40/76 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+K + +++++K K + +E ++ +++ + + ++++ + ++ +N +NN N N
Sbjct: 3 RKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYN 62
Query: 81 NNNNNNEEKGTFLYKM 96
NN NN Y +
Sbjct: 63 NNYNNYNNYNKHYYNI 78
Score = 32.4 bits (73), Expect = 0.26
Identities = 12/85 (14%), Positives = 39/85 (45%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
+K+ +++++E+K K + ++ ++ +++ ++ + + ++ +N +NN N
Sbjct: 3 RKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNYN 62
Query: 89 KGTFLYKMAAPLVLVLTYVAEITQP 113
Y + Y+ +I P
Sbjct: 63 NNYNNYNNYNKHYYNINYIEQIPVP 87
Score = 32.0 bits (72), Expect = 0.38
Identities = 10/63 (15%), Positives = 35/63 (55%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
++K+ + +E++ K + + ++ + +++ + + ++ + + ++ +N +NN N
Sbjct: 2 SRKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHKIISSLSNLSNNYNY 61
Query: 83 NNN 85
NNN
Sbjct: 62 NNN 64
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 45.7 bits (109), Expect = 6e-06
Identities = 16/35 (45%), Positives = 29/35 (82%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E+ K+EKK + KEKK+E+K+++++ K++KKK K +
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 41.5 bits (98), Expect = 2e-04
Identities = 14/35 (40%), Positives = 28/35 (80%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
+ K+EKKK+ K+ K++EK+++++ K++KKK K +
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 38.8 bits (91), Expect = 0.001
Identities = 14/34 (41%), Positives = 26/34 (76%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+ K++KKK K KK+E+K+ K++ +++KKK K +
Sbjct: 97 RNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 38.4 bits (90), Expect = 0.002
Identities = 17/35 (48%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 30 KKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
K E + K+EKKK EKK++EK++K++ K++KKK
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKK 126
Score = 35.3 bits (82), Expect = 0.018
Identities = 14/32 (43%), Positives = 24/32 (75%), Gaps = 1/32 (3%)
Query: 21 KKAKKK-EKKKKEKKAKKKEKKKEKKKKEKKK 51
K+ KKK K+KKE++ ++K + K++KKK K +
Sbjct: 99 KQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 29.9 bits (68), Expect = 1.3
Identities = 19/65 (29%), Positives = 30/65 (46%), Gaps = 18/65 (27%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK--------------EKKKKKKKKK 57
FTE K + KK+KK K+ +++ K+ KK K++KKK+
Sbjct: 51 FTEAV----KVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRS 106
Query: 58 KKKKN 62
K+KK
Sbjct: 107 KEKKE 111
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 47.5 bits (113), Expect = 6e-06
Identities = 22/57 (38%), Positives = 26/57 (45%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
EPTP K+ K EKK K+ K K K K K K K + + K KK K N
Sbjct: 77 PEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPN 133
Score = 39.8 bits (93), Expect = 0.002
Identities = 16/67 (23%), Positives = 24/67 (35%), Gaps = 2/67 (2%)
Query: 13 TEPTPLDWKKAKK--KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
+P + + K+K K KK +K K K K + K K K K + K +
Sbjct: 67 PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126
Query: 71 NNNNNNN 77
N
Sbjct: 127 KAPAAPN 133
Score = 38.6 bits (90), Expect = 0.004
Identities = 16/49 (32%), Positives = 22/49 (44%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+P P + + K K+ K EKK +K K + K K K K K K
Sbjct: 65 EQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113
Score = 38.2 bits (89), Expect = 0.005
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
EP P ++ + + + + K K+K + +KK KK K K K
Sbjct: 55 EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102
Score = 35.9 bits (83), Expect = 0.032
Identities = 14/48 (29%), Positives = 21/48 (43%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
P + K + + E K ++K + +KK KK K K K K K
Sbjct: 60 PEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKP 107
Score = 32.8 bits (75), Expect = 0.32
Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 4/61 (6%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ E + K K K K K K K K K K + + KK K K
Sbjct: 77 PEPTPPKPKEKPKPEKKPKKPKPKP----KPKPKPKPKVKPQPKPKKPPSKTAAKAPAAP 132
Query: 65 N 65
N
Sbjct: 133 N 133
Score = 32.8 bits (75), Expect = 0.38
Identities = 13/55 (23%), Positives = 21/55 (38%), Gaps = 2/55 (3%)
Query: 7 HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
PT + + E++ K + E E + K+K K +KK KK
Sbjct: 44 LAAKVLEAPTEEPQPEPEPPEEQPKPPT--EPETPPEPTPPKPKEKPKPEKKPKK 96
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 46.6 bits (111), Expect = 1e-05
Identities = 16/44 (36%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 19 DWKKAKKKEKKK-KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
W + + +EK++ K K A K + KK K K ++K+K K++K++K
Sbjct: 36 SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79
Score = 45.8 bits (109), Expect = 2e-05
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ K++EK K KAK K+ K K ++++K K++K++K +
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 38.1 bits (89), Expect = 0.007
Identities = 14/37 (37%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++++K+E+KAK K K KK K K ++K+K K++
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKAL-KAKIEEKEKAKREK 76
Score = 35.8 bits (83), Expect = 0.037
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
EK K +K + KKKK K K K N N++
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 35.4 bits (82), Expect = 0.044
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
+K K +K K KKKK K K N N++
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 35.0 bits (81), Expect = 0.060
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +EK++EK K K K KK K K
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIE 67
Score = 34.6 bits (80), Expect = 0.084
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
EK K EK AK +KKK K K +
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 34.6 bits (80), Expect = 0.098
Identities = 13/47 (27%), Positives = 29/47 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
K ++KEK K+EK+ K + +E +++ +K + +K ++ ++ NN
Sbjct: 64 AKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQEESDLNN 110
Score = 34.2 bits (79), Expect = 0.13
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
E+ +K+E+K + K K KK K K ++K
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKE 70
Score = 33.9 bits (78), Expect = 0.16
Identities = 10/44 (22%), Positives = 16/44 (36%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+K K +K K KK+K K K K +++
Sbjct: 191 INEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYDDDF 234
Score = 33.5 bits (77), Expect = 0.21
Identities = 13/32 (40%), Positives = 15/32 (46%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
EK K EK K KKKK K N N++
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 30.0 bits (68), Expect = 3.1
Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 3/50 (6%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKK---KKEKKKKKKKKKKK 59
K K K ++K++ K +K+EK + E + +K + +K
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
Score = 29.6 bits (67), Expect = 3.6
Identities = 9/40 (22%), Positives = 25/40 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KAK +EK+K +++ ++K ++ ++ + + +K + +K
Sbjct: 62 LKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 45.1 bits (107), Expect = 1e-05
Identities = 20/53 (37%), Positives = 30/53 (56%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
P + +K +K+ K+ E+K K KK++K+KKEKK +K KKK N
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTT 143
Score = 44.8 bits (106), Expect = 2e-05
Identities = 21/56 (37%), Positives = 34/56 (60%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
I F E + + K+ K+ ++K+ KAKK++K+K++KK +K KKK K KN
Sbjct: 88 INLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTT 143
Score = 40.9 bits (96), Expect = 4e-04
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 4/47 (8%)
Query: 21 KKAKKKEKKKKEKKA----KKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KAKK++K+KKEKK KK+ K K KK KK KK++
Sbjct: 112 IKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEGK 158
Score = 39.0 bits (91), Expect = 0.002
Identities = 17/41 (41%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K KK+K++K +KK +K KKK K K KK K
Sbjct: 108 EKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATK 148
Score = 38.2 bits (89), Expect = 0.003
Identities = 17/49 (34%), Positives = 26/49 (53%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E + + +K+ K K++K +KKEKK +K +KK K K KK
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKK 145
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 45.6 bits (108), Expect = 1e-05
Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
P D + KK KKK+ +KK+ K+ KKK+K+K++ + ++ +++ N N+ +
Sbjct: 40 PEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVD 99
Query: 77 NN-----NNNNNNNNEE 88
N N+E+
Sbjct: 100 YELPKVQNTAAEVNHED 116
Score = 42.9 bits (101), Expect = 1e-04
Identities = 19/61 (31%), Positives = 30/61 (49%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
+ KK KK+K +KKK KK KKKKK+K++ +++ N N+ +
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELP 103
Query: 89 K 89
K
Sbjct: 104 K 104
Score = 42.9 bits (101), Expect = 1e-04
Identities = 21/78 (26%), Positives = 32/78 (41%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
F E + KK KK+K KK K+ KK+KK+KE+ + ++ N N+
Sbjct: 38 IFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKK 97
Query: 71 NNNNNNNNNNNNNNNNEE 88
+ N N E
Sbjct: 98 VDYELPKVQNTAAEVNHE 115
Score = 40.2 bits (94), Expect = 0.001
Identities = 17/57 (29%), Positives = 28/57 (49%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K +KKK+ KKK KK KKKKK+K++ +++ N N+ + +
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKV 105
>gnl|CDD|233167 TIGR00881, 2A0104, phosphoglycerate transporter family protein.
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 379
Score = 46.6 bits (111), Expect = 2e-05
Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 15/158 (9%)
Query: 249 FGGMTTLQTYAVGIFESIH-APLDPYFATLLLGVAELGGALLCVVLIH--YTGKR-PLAL 304
G + Y + + +A L + L G LL L + G+R PLA+
Sbjct: 233 TGILDWSPLYL---TQEKGFSKEKASWAFTLYELGGLVGTLLAGWLSDKLFNGRRGPLAV 289
Query: 305 ISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPN 364
I+V++ + +PL+ + L FL + +L ++ E+ P
Sbjct: 290 F-----FMALIIVSLLVYWLNPAA--NPLMDLICLFALGFLVYGPQ-MLIGVIASELAPK 341
Query: 365 NIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTL 402
TA+G G +Y+ + D F G
Sbjct: 342 KAAGTAAGFVGFFAYLGGILAGLPLGYLADGFGWAGAF 379
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 46.6 bits (111), Expect = 2e-05
Identities = 19/49 (38%), Positives = 27/49 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
+ + K KK KK++K E KKK +KKKKKKK+KK+ +
Sbjct: 44 EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Score = 46.2 bits (110), Expect = 3e-05
Identities = 23/54 (42%), Positives = 31/54 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
++KKK +KKK+KK +KKE K E + K K KK KK KK N + +N
Sbjct: 64 NESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDN 117
Score = 44.3 bits (105), Expect = 1e-04
Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
Query: 11 TFTEPTPLDWKKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
T T D K+ K E KKK +KK KKKEKK+ K + E K K KK KK
Sbjct: 49 VATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPK 108
Query: 69 NNNNNNNNNNNNNNNNNNEEKGTFL 93
N + +N N E+ ++
Sbjct: 109 PKPNEDVDNAFNKIAELAEKSNVYI 133
Score = 43.2 bits (102), Expect = 3e-04
Identities = 18/66 (27%), Positives = 29/66 (43%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
S + T + + KK+KK K + K+K +K+KKKKK+KK+ K +
Sbjct: 27 IYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSE 86
Query: 62 NNNNNN 67
Sbjct: 87 GETKLG 92
Score = 42.0 bits (99), Expect = 6e-04
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
+E+ K + KK+KK++K + KKK +KKKKKKK+ +
Sbjct: 44 EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Score = 39.3 bits (92), Expect = 0.004
Identities = 18/52 (34%), Positives = 23/52 (44%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
E++ K K+ KKE K E KKK +KKKKKKK +
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Score = 36.2 bits (84), Expect = 0.036
Identities = 12/54 (22%), Positives = 22/54 (40%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
+++ + +K K++ K + KKK +KKKKKK +
Sbjct: 39 LSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Score = 35.1 bits (81), Expect = 0.099
Identities = 14/53 (26%), Positives = 22/53 (41%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+++ K A KK +K+ K + KKK +KKKKK +
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Score = 32.8 bits (75), Expect = 0.54
Identities = 12/76 (15%), Positives = 19/76 (25%), Gaps = 7/76 (9%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN-------N 73
K + KE ++E K KK KK + NN + +
Sbjct: 19 SKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKK 78
Query: 74 NNNNNNNNNNNNNEEK 89
+ K
Sbjct: 79 EKKEPKSEGETKLGFK 94
Score = 30.1 bits (68), Expect = 3.1
Identities = 8/77 (10%), Positives = 20/77 (25%), Gaps = 7/77 (9%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN------ 75
+ K + K+ +++ K KK+ + NN + +
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 76 -NNNNNNNNNNNEEKGT 91
+ + G
Sbjct: 77 KKEKKEPKSEGETKLGF 93
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 43.0 bits (102), Expect = 4e-05
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K K +KK+E+K + EK + +EK + + +K +K+
Sbjct: 17 IAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKE 57
Score = 42.6 bits (101), Expect = 6e-05
Identities = 13/39 (33%), Positives = 25/39 (64%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
AK+K K +K+++ KK E +K K ++K + + +K +K
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56
Score = 41.5 bits (98), Expect = 1e-04
Identities = 11/49 (22%), Positives = 27/49 (55%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
W+K ++++K + EK AK ++K + + EK +K+ ++ + +
Sbjct: 24 WEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKA 72
Score = 39.9 bits (94), Expect = 5e-04
Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK---EKKKKKKKKKKKKKNN 63
KA KEK K EKK ++K+ + EK K E+K + + +K +K+
Sbjct: 14 DKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELE 59
Score = 36.1 bits (84), Expect = 0.009
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K AK ++K E + +K EK+ E+ + E +++ K + KK
Sbjct: 37 KLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKM 77
Score = 35.3 bits (82), Expect = 0.020
Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 3/39 (7%)
Query: 24 KKKEKKKKEK---KAKKKEKKKEKKKKEKKKKKKKKKKK 59
++EK +K KA KEK K +KK+E+KK + +K K
Sbjct: 2 PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAK 40
Score = 34.5 bits (80), Expect = 0.034
Identities = 11/43 (25%), Positives = 25/43 (58%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+A+K K E+KA+ + +K EK+ +E + + +++ K +
Sbjct: 31 KKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAE 73
Score = 34.5 bits (80), Expect = 0.035
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KE K K K K +KK+++KK + +K K ++K
Sbjct: 6 KTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEK 46
Score = 33.0 bits (76), Expect = 0.11
Identities = 6/43 (13%), Positives = 21/43 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+ K + +EK + EK +++ ++ + + +++ K +
Sbjct: 34 EAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKK 76
Score = 31.5 bits (72), Expect = 0.40
Identities = 9/41 (21%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ +K+ + AK+K K ++K++++K + +K K +
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAE 44
Score = 31.5 bits (72), Expect = 0.42
Identities = 14/37 (37%), Positives = 22/37 (59%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ +E+K K K KEK K EKK+++KK + +K
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37
Score = 31.1 bits (71), Expect = 0.54
Identities = 9/37 (24%), Positives = 22/37 (59%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ E++K + + ++K K +KK+++KK + +K
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37
Score = 30.3 bits (69), Expect = 1.0
Identities = 10/41 (24%), Positives = 22/41 (53%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
A++K + + EK K+ E+ + + + + K + KK +K
Sbjct: 42 SAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82
Score = 29.9 bits (68), Expect = 1.4
Identities = 9/41 (21%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K E + ++ + + +E + E ++E K + KK +K
Sbjct: 42 SAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82
Score = 29.1 bits (66), Expect = 2.6
Identities = 7/43 (16%), Positives = 23/43 (53%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +K ++ K +K + +K++++K+ + +K K ++
Sbjct: 3 EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEE 45
Score = 28.4 bits (64), Expect = 4.8
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ ++EK KE K K++ K +KK+++KK +
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEA 35
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 45.9 bits (109), Expect = 5e-05
Identities = 22/99 (22%), Positives = 35/99 (35%), Gaps = 8/99 (8%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK------KKKNNNNNNNNNNNNNNNN 77
E + E K +K++ K++ KK+ KK KK + + N N NN
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 78 NNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQPHLR 116
Y +A L +V + A H+R
Sbjct: 61 KKKKKKKKKNLGEA--YDLAYDLPVVWSSAAFQDNSHIR 97
Score = 40.9 bits (96), Expect = 0.001
Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 9/81 (11%)
Query: 22 KAKKKEKKKKEKKAKKKEKK---------KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
+ K+K KKK KK+KK E ++ E K+ KKKKKKKK N
Sbjct: 16 QNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAY 75
Query: 73 NNNNNNNNNNNNNNEEKGTFL 93
+ + ++ + + +
Sbjct: 76 DLAYDLPVVWSSAAFQDNSHI 96
Score = 32.4 bits (74), Expect = 0.69
Identities = 18/67 (26%), Positives = 29/67 (43%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
E ++ E K K+KKK+KKKK+KK + ++ +N++ N
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWP 103
Query: 85 NNEEKGT 91
E K T
Sbjct: 104 EQEWKQT 110
Score = 32.0 bits (73), Expect = 0.83
Identities = 12/51 (23%), Positives = 23/51 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
++ + K+ K KKK+KKK+KKK + ++ +N+
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94
Score = 31.6 bits (72), Expect = 1.2
Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 18/90 (20%)
Query: 21 KKAKKKEKKKKEKK------------------AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ K+K KK KK A+ K+ K+KKKK+KKKKKK +
Sbjct: 18 KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDL 77
Query: 63 NNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
+ ++ +N++ N + +
Sbjct: 78 AYDLPVVWSSAAFQDNSHIRKLGNWPEQEW 107
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 45.6 bits (108), Expect = 5e-05
Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 7/61 (11%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEK-------KKKEKKKKKKKKKKKKKNNNNN 66
E K + KEK+ ++K K+KEKK E+ KK+E+ + K + KK K N
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185
Query: 67 N 67
Sbjct: 186 K 186
Score = 45.3 bits (107), Expect = 7e-05
Identities = 18/41 (43%), Positives = 32/41 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ K+E+K+KE+ ++K+KKKEK K+E K +K K++ K+K
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Score = 44.5 bits (105), Expect = 1e-04
Identities = 14/54 (25%), Positives = 32/54 (59%)
Query: 8 QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ P K K+ K + K+ +K++++ +++KK+KK+K K++ K +K
Sbjct: 79 RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 44.5 bits (105), Expect = 1e-04
Identities = 15/41 (36%), Positives = 32/41 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ ++K+KKK++ K + K++K +++ KEK+ K+K+K+K+K
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152
Score = 44.1 bits (104), Expect = 1e-04
Identities = 17/40 (42%), Positives = 30/40 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KK KK++ K++ K K KE+ KEK+ ++K+K+K+KK ++
Sbjct: 117 KKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Score = 44.1 bits (104), Expect = 2e-04
Identities = 18/41 (43%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +KK+KK+K K+ K K KE+ K+++ K+K+K+K+KK
Sbjct: 113 VKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 43.7 bits (103), Expect = 2e-04
Identities = 18/41 (43%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K E K+E+K K++ K+++KKKKEK K++ K +K K+
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135
Score = 43.7 bits (103), Expect = 2e-04
Identities = 17/42 (40%), Positives = 32/42 (76%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K KKKEK K+E K +K +++ ++K+ K+K+K+K+KK ++
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Score = 43.3 bits (102), Expect = 2e-04
Identities = 14/41 (34%), Positives = 31/41 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ +K++ K+++KK K+K K++ K +K K++ K+K+ K+K
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEK 146
Score = 42.6 bits (100), Expect = 4e-04
Identities = 16/43 (37%), Positives = 31/43 (72%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
++ KKK++K KE+ +K K++ K+K+ K+K+K+K+KK +
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Score = 42.2 bits (99), Expect = 6e-04
Identities = 18/41 (43%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ +K++++ KE+K KKKEK KE+ K K K++ K+K+ K
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPK 144
Score = 42.2 bits (99), Expect = 6e-04
Identities = 18/50 (36%), Positives = 31/50 (62%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +K + KE+KKK+K+ K+E K K K+E K+K+ K+K+K+
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150
Score = 41.8 bits (98), Expect = 7e-04
Identities = 15/42 (35%), Positives = 31/42 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K+K K++ + + K+E K+++ KEK+K+K+KK ++ ++
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
Score = 41.8 bits (98), Expect = 7e-04
Identities = 15/40 (37%), Positives = 30/40 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K KE++K++++ K+++KKK++K KE+ K +K K++ K
Sbjct: 99 KNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 41.8 bits (98), Expect = 8e-04
Identities = 18/60 (30%), Positives = 31/60 (51%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
D K + P + +K KK E+ + ++ KK+E+ + K + +K KKK KKK+
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189
Score = 41.4 bits (97), Expect = 0.001
Identities = 18/59 (30%), Positives = 35/59 (59%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
D + K ++ K AK+ + + +++ K+KE+ KE+KKK+K+K K++ K +K
Sbjct: 74 DEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 41.0 bits (96), Expect = 0.001
Identities = 14/52 (26%), Positives = 25/52 (48%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
+ + K + KK K K KKKE ++EK+++ ++ K K + N
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219
Score = 40.6 bits (95), Expect = 0.002
Identities = 15/76 (19%), Positives = 35/76 (46%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
E + +AK + KK +KK K+K+ +++K+++ ++ K K + + N
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKE 220
Query: 73 NNNNNNNNNNNNNNEE 88
++ + + EE
Sbjct: 221 EDDGKDRETTTSPMEE 236
Score = 39.9 bits (93), Expect = 0.003
Identities = 18/46 (39%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKK 61
+K K+KEKK +E + +++EKK+E+ + K+ KKK KKK+
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190
Score = 39.9 bits (93), Expect = 0.003
Identities = 15/41 (36%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + ++++KEK+ K+EKKK+K+K +++ K +K K++ K
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 39.5 bits (92), Expect = 0.004
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ +K K K K ++ K + KE++K+K++ K++KK
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKK 118
Score = 38.7 bits (90), Expect = 0.008
Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 18 LDWKKAKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L +A K+ +K K A K + KE K + K+++K+K++ K+
Sbjct: 71 LSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKE 115
Score = 38.7 bits (90), Expect = 0.008
Identities = 15/45 (33%), Positives = 29/45 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K+ K K K+E K K+ K+KEK+K++K ++ + ++++KK
Sbjct: 125 KEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169
Score = 37.9 bits (88), Expect = 0.013
Identities = 13/49 (26%), Positives = 31/49 (63%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E +K K++ K +K K+ K+++ ++K+KEK+KK ++ + +++
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162
Score = 37.2 bits (86), Expect = 0.019
Identities = 14/70 (20%), Positives = 30/70 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K KK KKK KK+ ++EK+++ ++ K K ++ N ++ +
Sbjct: 172 KSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTT 231
Query: 81 NNNNNNEEKG 90
+ +E +
Sbjct: 232 SPMEEDESRQ 241
Score = 37.2 bits (86), Expect = 0.023
Identities = 14/68 (20%), Positives = 33/68 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K++ K+ KK+ KKKE +E+K+++ ++ K K ++ + N ++ +
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230
Query: 81 NNNNNNEE 88
+ +E
Sbjct: 231 TSPMEEDE 238
Score = 36.0 bits (83), Expect = 0.051
Identities = 14/70 (20%), Positives = 32/70 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ + +E++KK ++ + K + K+ KK+ KKK+ +++K + N
Sbjct: 155 EEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVN 214
Query: 81 NNNNNNEEKG 90
E+ G
Sbjct: 215 EEREKEEDDG 224
Score = 35.6 bits (82), Expect = 0.063
Identities = 12/71 (16%), Positives = 30/71 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K+ + + K + +K KKK K+K+ E++K+++ ++ K + N ++
Sbjct: 165 KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDG 224
Query: 81 NNNNNNEEKGT 91
+
Sbjct: 225 KDRETTTSPME 235
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 44.8 bits (106), Expect = 6e-05
Identities = 18/42 (42%), Positives = 27/42 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KKAK++ KK+ E++AK K + KKK + KKK + + K K
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Score = 43.7 bits (103), Expect = 2e-04
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + EKK KE+ K+ E++ + K + KKK + KKK
Sbjct: 135 KAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKA 175
Score = 43.7 bits (103), Expect = 2e-04
Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKKKKK 61
K AK+ E+ K+ + K+K E+ K K+ E K K + + +KK
Sbjct: 102 KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKA 144
Score = 43.3 bits (102), Expect = 2e-04
Identities = 15/40 (37%), Positives = 24/40 (60%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + K K E +A+KK K++ KK+ E++ K K + KK
Sbjct: 127 KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKK 166
Score = 42.9 bits (101), Expect = 3e-04
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 21 KKAKKK---EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++AK K E KKK +AKKK + + K K E K K K ++ K K
Sbjct: 154 EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKA 198
Score = 41.0 bits (96), Expect = 0.001
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+A+++ K K +AKKK + +KK + + K K + K K K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190
Score = 40.6 bits (95), Expect = 0.001
Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 22 KAKKK---EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+AKK+ E K K KK+ + KKK E + K K + K K
Sbjct: 147 EAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 40.6 bits (95), Expect = 0.001
Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
+AK K + + EKKAK++ KK+ E K K + KKK + KK
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKK 173
Score = 40.6 bits (95), Expect = 0.001
Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K K+ E KAK E + EKK KE+ KK+ +++ K K
Sbjct: 122 EEAKAKQAAEAKAKA-EAEAEKKAKEEAKKQAEEEAKAK 159
Score = 40.2 bits (94), Expect = 0.002
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KK + KKK E +AK K + K K K E+ K K + K K
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
Score = 39.0 bits (91), Expect = 0.005
Identities = 12/41 (29%), Positives = 16/41 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA+ K K K E+ K E K K E K + +
Sbjct: 181 AKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAA 221
Score = 38.7 bits (90), Expect = 0.006
Identities = 11/40 (27%), Positives = 25/40 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+A ++ K+ ++A K+ ++K+K+ +E K K+ + K K
Sbjct: 97 RAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136
Score = 38.7 bits (90), Expect = 0.007
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA+ + K K E KAK K ++ + K + K K + K
Sbjct: 173 KKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKA 213
Score = 37.9 bits (88), Expect = 0.010
Identities = 8/39 (20%), Positives = 24/39 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+K + E+ ++++ ++ +K K+ E+ K+ ++K+K
Sbjct: 81 EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK 119
Score = 37.5 bits (87), Expect = 0.013
Identities = 13/41 (31%), Positives = 22/41 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A + +KK E K K + + K K + + K K ++ K K +
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199
Score = 37.1 bits (86), Expect = 0.018
Identities = 11/42 (26%), Positives = 18/42 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KAK +E K K + AK K + K E + + ++
Sbjct: 187 AKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERK 228
Score = 37.1 bits (86), Expect = 0.019
Identities = 14/40 (35%), Positives = 26/40 (65%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +EK+K+ ++AK K+ + K K E + +KK K++ KK
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
Score = 36.0 bits (83), Expect = 0.048
Identities = 9/48 (18%), Positives = 28/48 (58%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A KKE+++++K ++ E+ ++++ E+ ++K+ +++
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAA 100
Score = 34.4 bits (79), Expect = 0.12
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK-EKKKKKKKKKKKKK 61
K K + + K + +AK K K +E K K E K K + K
Sbjct: 171 AKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212
Score = 32.5 bits (74), Expect = 0.53
Identities = 8/50 (16%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
KK +++ ++ ++++A ++ ++K ++ EK K+ ++ K+
Sbjct: 65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQA 114
Score = 29.4 bits (66), Expect = 5.2
Identities = 9/44 (20%), Positives = 18/44 (40%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+AK K + K K A + K E + + ++K + +
Sbjct: 193 EAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGD 236
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 44.3 bits (105), Expect = 7e-05
Identities = 18/61 (29%), Positives = 30/61 (49%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
++K+ + K K+KK ++ KK+ KK +N N + N NN NN N +E
Sbjct: 86 NYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKE 145
Query: 89 K 89
+
Sbjct: 146 E 146
Score = 43.9 bits (104), Expect = 1e-04
Identities = 24/80 (30%), Positives = 32/80 (40%), Gaps = 5/80 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK-----EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
K E K KK+ + ++KK E+ KK+ KKK+N N + N N
Sbjct: 74 GKNNGTEIYYKLNYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECIN 133
Query: 76 NNNNNNNNNNNEEKGTFLYK 95
N NN N EE K
Sbjct: 134 NKNNVINKKIKEESKKKNSK 153
Score = 40.4 bits (95), Expect = 0.002
Identities = 18/69 (26%), Positives = 27/69 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK+ + K K+K+ ++ KK+ KKK K N NN NN N
Sbjct: 86 NYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKE 145
Query: 81 NNNNNNEEK 89
+ N +
Sbjct: 146 ESKKKNSKN 154
Score = 34.7 bits (80), Expect = 0.11
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 3/62 (4%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K+KK ++ KK+ KK K K +K + NN NN N + N+ N +
Sbjct: 101 KEKKIERFKKRVNAYFKK---KSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEK 157
Query: 88 EK 89
K
Sbjct: 158 LK 159
Score = 32.7 bits (75), Expect = 0.37
Identities = 19/59 (32%), Positives = 25/59 (42%), Gaps = 4/59 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+++KKK K EK KK KK KK+ K K K K ++N N N
Sbjct: 145 EESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQK 203
Score = 30.4 bits (69), Expect = 2.0
Identities = 20/53 (37%), Positives = 24/53 (45%), Gaps = 5/53 (9%)
Query: 24 KKKEKKKKE-----KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K EK K + +K K K+ K K +K KKKKK KK N N N
Sbjct: 233 KIYEKYKNKPHFIIEKNKYKDLNKIIGKFKKSFKKKKKNSKKNYENIKINIFN 285
Score = 30.4 bits (69), Expect = 2.2
Identities = 25/62 (40%), Positives = 28/62 (45%), Gaps = 10/62 (16%)
Query: 21 KKAKKKEKKKKEKKAKKK-----EKKKEK-----KKKEKKKKKKKKKKKKKNNNNNNNNN 70
KK +K +K +K K K EK K K K KK KKKKK KKN N N
Sbjct: 224 KKQIEKFFQKIYEKYKNKPHFIIEKNKYKDLNKIIGKFKKSFKKKKKNSKKNYENIKINI 283
Query: 71 NN 72
N
Sbjct: 284 FN 285
Score = 30.0 bits (68), Expect = 2.6
Identities = 19/77 (24%), Positives = 28/77 (36%), Gaps = 8/77 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE--------KKKKKKKKKKKKKNNNNNNNNNNN 72
+K ++ KK+ KKK K +K KK K++ KKKN+ N
Sbjct: 102 EKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLKIK 161
Query: 73 NNNNNNNNNNNNNNEEK 89
N N + K
Sbjct: 162 KYFKKCNFKTNLKKDIK 178
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 39.9 bits (94), Expect = 2e-04
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K+ EK + K+ K++ KK +KKK+K
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRK 32
Score = 38.7 bits (91), Expect = 4e-04
Identities = 14/37 (37%), Positives = 22/37 (59%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK K+ EK + ++ K+ KK +KKK K K+ +K
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40
Score = 37.2 bits (87), Expect = 0.001
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK+++ K + K+ +K+ +K +KKK+K K
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFK 36
Score = 37.2 bits (87), Expect = 0.001
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K+ ++ +AK+ +K+ KK KKK+K K+ +K
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40
Score = 35.7 bits (83), Expect = 0.004
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K++K+ EK K+ KK+ KK +KKK+K K+
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
Score = 33.7 bits (78), Expect = 0.023
Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK-------EKKKKEKKKKKKKKKKKKKNN 63
K+AKK+ KK +K K+K K E + E+ + + K++ KKK N
Sbjct: 15 KRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKKGN 64
Score = 27.2 bits (61), Expect = 5.0
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKK 61
K++K+ +K + K+ KK+ KK +
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAAR 27
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 44.2 bits (105), Expect = 2e-04
Identities = 17/41 (41%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ ++EK++KE ++K +K KK++EKKKK+ +K +K K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 41.2 bits (97), Expect = 0.002
Identities = 18/44 (40%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 20 WKKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
WK K+E +++ EK+A K++K+ K KK+++KKKK+ +K +K
Sbjct: 545 WKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 39.6 bits (93), Expect = 0.004
Identities = 14/41 (34%), Positives = 29/41 (70%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+ ++ K++++ KE+K +K KK+E+KKK++ +K +K K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 37.3 bits (87), Expect = 0.022
Identities = 17/51 (33%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 10 GTFT----EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
G + L +K +K+ K++ K+ +K +K++EKKKKE +K +K K
Sbjct: 541 GPSVWKLDDKEELQREKEEKEALKEQ-KRLRKLKKQEEKKKKELEKLEKAK 590
Score = 30.4 bits (69), Expect = 3.0
Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 35/80 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK----------------------------------- 45
K+ K+ K KK+++ KKKE +K +K
Sbjct: 564 KEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGEEIS 623
Query: 46 KKEKKKKKKKKKKKKKNNNN 65
KKE+KK K+ K+ K +
Sbjct: 624 KKERKKLSKEYDKQAKLHEE 643
Score = 29.6 bits (67), Expect = 5.6
Identities = 10/34 (29%), Positives = 24/34 (70%), Gaps = 1/34 (2%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K +K+ ++EK+ EK+ +++K+ +K KK+++
Sbjct: 546 KLDDKEELQREKE-EKEALKEQKRLRKLKKQEEK 578
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 44.0 bits (105), Expect = 2e-04
Identities = 9/42 (21%), Positives = 15/42 (35%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K + K+ KK K +K K++ +K K
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 41.3 bits (98), Expect = 0.001
Identities = 8/41 (19%), Positives = 16/41 (39%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+K + + A KK K +K +++ +K K
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 41.3 bits (98), Expect = 0.001
Identities = 9/38 (23%), Positives = 16/38 (42%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K + + AK KK K +K K++ +K
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 40.9 bits (97), Expect = 0.002
Identities = 9/44 (20%), Positives = 16/44 (36%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L K K + K +K K ++ K++ +K K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 39.8 bits (94), Expect = 0.004
Identities = 8/45 (17%), Positives = 18/45 (40%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
L+ K ++ + K K+ K +K K++ +K+
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 34.0 bits (79), Expect = 0.23
Identities = 8/41 (19%), Positives = 16/41 (39%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
P L + + +K A K +K +++ +K K
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 43.8 bits (104), Expect = 2e-04
Identities = 19/81 (23%), Positives = 30/81 (37%)
Query: 1 MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ + I D KKK + AK K KKK K + KK +K
Sbjct: 49 PEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108
Query: 61 KNNNNNNNNNNNNNNNNNNNN 81
KN + +++ N + + N
Sbjct: 109 KNALDKDDDLNYVKDIDVLNQ 129
Score = 38.4 bits (90), Expect = 0.008
Identities = 15/69 (21%), Positives = 25/69 (36%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
E +KK K K K KKK K + KK N + +++ N +
Sbjct: 65 DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124
Query: 81 NNNNNNEEK 89
+ N ++
Sbjct: 125 DVLNQADDD 133
Score = 36.5 bits (85), Expect = 0.034
Identities = 15/55 (27%), Positives = 24/55 (43%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
T+ ++AKKK KK K K KE+ K+ + KKK ++ +
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58
Score = 36.5 bits (85), Expect = 0.038
Identities = 14/49 (28%), Positives = 24/49 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
A+++ KKK +K A K + K K+E K+ + KKK + +
Sbjct: 10 LAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58
Score = 36.1 bits (84), Expect = 0.042
Identities = 17/94 (18%), Positives = 26/94 (27%)
Query: 1 MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
T QI K + KK K K K KKK K +
Sbjct: 44 SKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSS 103
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
K N + +++ N + + N +
Sbjct: 104 KKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDD 137
Score = 35.7 bits (83), Expect = 0.056
Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 22 KAKKKEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
KA+ +++ +KK KK K K K K++ K+ + KKK +
Sbjct: 7 KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58
Score = 35.4 bits (82), Expect = 0.081
Identities = 16/50 (32%), Positives = 23/50 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
+ A ++E KKK KK K K K KE+ K+ + KKK +
Sbjct: 9 ELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF 58
Score = 34.2 bits (79), Expect = 0.21
Identities = 14/41 (34%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
A K + E++AKKK KK K K K K++ K+
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEAL 42
Score = 31.9 bits (73), Expect = 1.1
Identities = 12/38 (31%), Positives = 17/38 (44%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K E A+++ KKK KK K K K K++
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIK 39
Score = 30.7 bits (70), Expect = 2.3
Identities = 9/60 (15%), Positives = 16/60 (26%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
TK +I E ++ + + + K+K K K K
Sbjct: 34 TKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAK 93
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 44.4 bits (104), Expect = 2e-04
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ K E K E +A E + ++K E +KKK++ KKK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 44.0 bits (103), Expect = 2e-04
Identities = 18/45 (40%), Positives = 28/45 (62%), Gaps = 4/45 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK----KKKKKKKKK 61
KA+ + + + A++K + EKKK+E KKK KKK ++KKK
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 43.2 bits (101), Expect = 3e-04
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK E+ KK +A K E + + E ++K + +KKK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 42.8 bits (100), Expect = 5e-04
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K +E KK + AK + + + + ++K + +KKK+
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374
Score = 42.4 bits (99), Expect = 6e-04
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 2/41 (4%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKE--KKKKEKKKKKKKKKKKK 60
+A K E + +A+ E+K E +KKKE+ KKK KKK
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 41.7 bits (97), Expect = 0.001
Identities = 12/44 (27%), Positives = 22/44 (50%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE---KKKKEKKKKKKKKKKKKK 61
KKA++ +KK K K +E KK K + + + + ++K
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Score = 41.7 bits (97), Expect = 0.001
Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK---KKK 61
KKA++ +K + KA+ + E + E+K + +KKK KKK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 40.5 bits (94), Expect = 0.003
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K +E KKK AKKK ++ +K + K + + + +
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360
Score = 40.1 bits (93), Expect = 0.004
Identities = 10/42 (23%), Positives = 19/42 (45%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ KK KKK E+ K E K + + + + ++K +
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 40.1 bits (93), Expect = 0.004
Identities = 9/40 (22%), Positives = 20/40 (50%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K KKK ++AKK + + + + + + ++K +
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 39.7 bits (92), Expect = 0.004
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+AKKK ++ K+K K+K +E KK + K + + +
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Score = 39.4 bits (91), Expect = 0.007
Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 4/43 (9%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKK 59
+A+ E+K + + KK+E KK+ KKK E+KKK + KKK
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 39.0 bits (90), Expect = 0.007
Identities = 18/41 (43%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+A +K+K++ +KKA +KK E+KKK + KKK ++ KKK
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
Score = 39.0 bits (90), Expect = 0.009
Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 3/43 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
+KA+ EKKK+E K K KK EKKK ++ KKK ++ KKK
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
Score = 38.2 bits (88), Expect = 0.014
Identities = 16/36 (44%), Positives = 23/36 (63%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KK E+ KK +AKKK ++ +KK KKK ++ KK
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 37.0 bits (85), Expect = 0.029
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KK E+ KK +AKKK ++ +KK E KK + KKK
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 37.0 bits (85), Expect = 0.031
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 4/42 (9%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK----KKK 61
KK E+KKK +AKKK ++ +KK E KK KKK KKK
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Score = 37.0 bits (85), Expect = 0.033
Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKE---KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K++ KKK + KKA++K+K E KKK ++ KKK + KK
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Score = 36.7 bits (84), Expect = 0.035
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKK 60
KKA++ +KK E K KKK E KKK ++KKK + KKK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 36.7 bits (84), Expect = 0.037
Identities = 19/40 (47%), Positives = 29/40 (72%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA++K+K + KKA++K+K E KKK ++ KK + KKK
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
Score = 36.7 bits (84), Expect = 0.039
Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE---KKKKEKKKK----KKKKKKKKKNNN 64
KKA +KK E K K +EKKK KKK E+ KK KKK ++ KK
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461
Score = 36.7 bits (84), Expect = 0.039
Identities = 21/44 (47%), Positives = 29/44 (65%), Gaps = 4/44 (9%)
Query: 22 KAKKKEKKKK----EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+AKK E+KKK +KKA++ +K E KKK ++ KKK KKK
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Score = 36.7 bits (84), Expect = 0.041
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 22 KAKKKEKKKKE---KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K +E KK E KKA++ +K E KKK ++ KK + KKK
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 36.3 bits (83), Expect = 0.049
Identities = 20/42 (47%), Positives = 30/42 (71%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKKE-KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA++ +KK +E KKA + +KK E+ KK + KKK ++ KKK
Sbjct: 1457 KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
Score = 36.3 bits (83), Expect = 0.053
Identities = 18/41 (43%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA + +K +++KKA + +KK E+ KK + KKK ++ KKK
Sbjct: 1290 KKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330
Score = 35.9 bits (82), Expect = 0.064
Identities = 20/42 (47%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKK 59
KKA + +KK +E K K E KK KKK ++ KKK ++KKK
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 35.9 bits (82), Expect = 0.069
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++AKK E+ KK+ + KK ++ EKK E KK+ ++ KK
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Score = 35.9 bits (82), Expect = 0.071
Identities = 17/45 (37%), Positives = 24/45 (53%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
E D K K +E KK ++ KK E+ K+K KKK ++ KK
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 35.5 bits (81), Expect = 0.081
Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKKK 61
KKA + +KK +E K + KKK E KKK KKK ++ KK
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 35.5 bits (81), Expect = 0.096
Identities = 19/41 (46%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 21 KKAKKKEKKKKE-KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA + +KK +E KKA++ +KK E+ KK + KKK ++ KK
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 35.5 bits (81), Expect = 0.096
Identities = 16/40 (40%), Positives = 28/40 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++AKK E+ +K+ K++ +E KK E+ KKK+ ++KKK
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 35.1 bits (80), Expect = 0.12
Identities = 18/42 (42%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 22 KAKKKEKKKKE---KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K +E KK + KKA++ +K +E KKK ++ KK + KKK
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Score = 35.1 bits (80), Expect = 0.13
Identities = 20/44 (45%), Positives = 29/44 (65%), Gaps = 4/44 (9%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK----KKKKKKKKK 61
KA++ K + KKA++K+K E KK E+KKK KKK ++ KK
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316
Score = 34.7 bits (79), Expect = 0.16
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K E KKK ++AKKK + +K + KKK + KK ++
Sbjct: 1481 EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521
Score = 34.7 bits (79), Expect = 0.16
Identities = 19/42 (45%), Positives = 28/42 (66%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKKE-KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA + +KK +E KKA + +KK E+ KK+ + KK + KKK
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 34.7 bits (79), Expect = 0.17
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 3/43 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
+K K E KK E+K K E KK E KK ++ KKK ++ KKK
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330
Score = 34.7 bits (79), Expect = 0.17
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K +E KKK ++AKK E+ ++KK + KK+ ++ KK
Sbjct: 1667 AKKAEEDKKKAEEAKKAEE--DEKKAAEALKKEAEEAKK 1703
Score = 34.7 bits (79), Expect = 0.17
Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 4/45 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK----KKKEKKKKKKKKKKKKK 61
KKA+++ K K ++AKK E+ K+K KK E+ +KK + KK+
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697
Score = 34.3 bits (78), Expect = 0.18
Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 8/48 (16%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKK----KKKKKKKKK 61
K K +EKKK ++ KK E+ K+ KKK E+ KK KKK ++ KK
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 34.3 bits (78), Expect = 0.18
Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKKNNNN 65
KKA++ EKK E K+ E+ K E KKKE ++KKK ++ KK N
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728
Score = 34.3 bits (78), Expect = 0.19
Identities = 16/40 (40%), Positives = 24/40 (60%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K +E KKK +AKK + K+K + KK ++ KK + K
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528
Score = 34.3 bits (78), Expect = 0.20
Identities = 16/38 (42%), Positives = 24/38 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
K A+ K+K + KKA++ +K E KK E+ KK + KK
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Score = 34.3 bits (78), Expect = 0.20
Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK----KKKKKKKKKK 61
K K +E KK E+ KK + +K+ +E KK KKK+ ++KKK
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 34.3 bits (78), Expect = 0.21
Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 4/44 (9%)
Query: 21 KKAKKKEKKKKEKKAKK----KEKKKEKKKKEKKKKKKKKKKKK 60
+ K E KKK ++AKK K+K +E KKK + KK + KKK
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 34.3 bits (78), Expect = 0.21
Identities = 15/41 (36%), Positives = 28/41 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ KK E++ K K A++ +K +E KKK ++ KK ++ +KK
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Score = 34.3 bits (78), Expect = 0.22
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K E+ KK ++ KKK ++ +K ++++KK + KK+ +
Sbjct: 1660 KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
Score = 34.3 bits (78), Expect = 0.22
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KKA++ +K + KKA++ +K E KK E+KKK + KK
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Score = 34.3 bits (78), Expect = 0.23
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KKA + +K + KK + KK E+ KK + KK ++ KK
Sbjct: 1497 KKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Score = 34.0 bits (77), Expect = 0.24
Identities = 17/41 (41%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA + +KK +E K K E KK + K+K + KK ++ KK
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Score = 34.0 bits (77), Expect = 0.24
Identities = 18/38 (47%), Positives = 28/38 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KKA++ +K + KKA++K+K E KK E+ KK ++KKK
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565
Score = 34.0 bits (77), Expect = 0.24
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 2/40 (5%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKK 58
KKA++ +KK E K + KKK E KK E+ KK + KK
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529
Score = 34.0 bits (77), Expect = 0.26
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 9/52 (17%)
Query: 22 KAKKKEKKKKEKKAKKKEKKK---------EKKKKEKKKKKKKKKKKKKNNN 64
+AKK E+ KK +AKK E+KK E KK E+KKK ++ KK +++ N
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577
Score = 34.0 bits (77), Expect = 0.27
Identities = 16/39 (41%), Positives = 29/39 (74%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+KA + +K +++KKA + +K +EKKK ++ KKK ++ KK
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316
Score = 34.0 bits (77), Expect = 0.27
Identities = 20/43 (46%), Positives = 22/43 (51%), Gaps = 3/43 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
KK KKK E+K K E KK E KKK + KK KKK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 33.6 bits (76), Expect = 0.35
Identities = 17/38 (44%), Positives = 25/38 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KK + +EKKK E+ K +E+ K K +E KK ++ KKK
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676
Score = 33.2 bits (75), Expect = 0.43
Identities = 16/43 (37%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKE--KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +++KKK E KKA++ EKK + K++ ++ KK ++ KK
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709
Score = 33.2 bits (75), Expect = 0.43
Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKE--KKKEKKKKEKKKKKKKKKKKKK 61
KK + KKK E+ KK + KK KK+ + KKK ++KKK
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 32.8 bits (74), Expect = 0.54
Identities = 12/41 (29%), Positives = 27/41 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A++ +K +++KK ++ KK E+ +K+ + KK+ ++ K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702
Score = 32.8 bits (74), Expect = 0.58
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K+ E+KKK ++ KK E++ + K E+ KK ++ KKK
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676
Score = 32.4 bits (73), Expect = 0.84
Identities = 16/38 (42%), Positives = 25/38 (65%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +EKKK ++ K +EKKK + K+K ++ KK + KK
Sbjct: 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322
Score = 32.4 bits (73), Expect = 0.85
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ KK E+ KK ++ K + +E KK E+ KKK ++ KK +
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684
Score = 32.4 bits (73), Expect = 0.88
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKK 58
+ KK + KK E K K E KK E KK ++ KK ++ KK
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Score = 32.0 bits (72), Expect = 1.1
Identities = 16/38 (42%), Positives = 27/38 (71%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K+ E+KKK ++ KK E++ + K +E KK+ ++ KKK
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745
Score = 32.0 bits (72), Expect = 1.2
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KK + +EKKK E+ K +E+ K K ++ KK+ ++ KKK
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745
Score = 32.0 bits (72), Expect = 1.2
Identities = 19/43 (44%), Positives = 31/43 (72%), Gaps = 4/43 (9%)
Query: 21 KKAKKKEKKKKEKKAK----KKEKKKEKKKKEKKKKKKKKKKK 59
KKA++ +K ++E K K KKE +++KKK E+ KK +++KKK
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
Score = 31.6 bits (71), Expect = 1.4
Identities = 18/41 (43%), Positives = 30/41 (73%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K K +E KK E++ KK E+ K+K+ +EKKK ++ KK +++N
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659
Score = 31.6 bits (71), Expect = 1.4
Identities = 19/72 (26%), Positives = 34/72 (47%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
L ++ KK E+ +KEK+A +E+ E+ +K + + KK K N N N
Sbjct: 1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVI 1821
Query: 78 NNNNNNNNNEEK 89
N++ ++ K
Sbjct: 1822 NDSKEMEDSAIK 1833
Score = 30.9 bits (69), Expect = 2.1
Identities = 19/46 (41%), Positives = 29/46 (63%), Gaps = 4/46 (8%)
Query: 19 DWKKAKKKEKKKKEKKAK----KKEKKKEKKKKEKKKKKKKKKKKK 60
D KKA + KK+ E+ K KK++ +EKKK E+ KK +++ K K
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK 1731
Score = 30.9 bits (69), Expect = 2.2
Identities = 16/39 (41%), Positives = 28/39 (71%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K E+ KK ++ KKK ++ +KK+ E+KKK ++ KK ++ N
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659
Score = 30.9 bits (69), Expect = 2.7
Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 4/44 (9%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKKK 60
KKA++ +KK+ E+K K +E KK K K E+ KK+ ++ KKK
Sbjct: 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745
Score = 30.9 bits (69), Expect = 2.7
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K +E KK+ ++ KKK ++ +K ++EKKK KK+++K
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Score = 30.5 bits (68), Expect = 2.8
Identities = 13/37 (35%), Positives = 25/37 (67%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KK+ ++KK+ + KK +++ K K ++ KK ++ KKK
Sbjct: 1640 KKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676
Score = 30.5 bits (68), Expect = 3.0
Identities = 20/44 (45%), Positives = 32/44 (72%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEK---KAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K +E KK E+ KA++ +K +E+KKK ++ KKK+ ++KKK
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
Score = 30.5 bits (68), Expect = 3.1
Identities = 16/41 (39%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ KKK ++ K+K+A++K+K +E KK E++ K K ++ KK
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
Score = 30.5 bits (68), Expect = 3.3
Identities = 16/40 (40%), Positives = 29/40 (72%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + +E KKK ++AKK E++K+K KK+++KK ++ +K
Sbjct: 1736 KKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRK 1775
Score = 30.5 bits (68), Expect = 3.3
Identities = 16/41 (39%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ KK E++KK+ + KK++ +EKKK E+ KK +++ K K
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663
Score = 30.5 bits (68), Expect = 3.4
Identities = 24/123 (19%), Positives = 46/123 (37%), Gaps = 7/123 (5%)
Query: 429 RDIEEHFADKGKTFVTNIRRAEKKRHAARVEGQEEVKGEDNFGMEGEDGKYISRVVGTLY 488
R E + + R+AE + A V+ EE K + + E+ + +
Sbjct: 1200 RKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259
Query: 489 MYFVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAAKVEGQEEVKGEDNFGMEGE 548
++ E AD+ +++AE+K+ A + + EE K D + E
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADE-------LKKAEEKKKADEAKKAEEKKKADEAKKKAE 1312
Query: 549 DGK 551
+ K
Sbjct: 1313 EAK 1315
Score = 30.5 bits (68), Expect = 3.5
Identities = 17/38 (44%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 24 KKKEKKKKEKKAKKKE--KKKEKKKKEKKKKKKKKKKK 59
KK ++ KK ++AKK + KK E+ KK + KK ++KKK
Sbjct: 1510 KKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547
Score = 30.1 bits (67), Expect = 3.7
Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 8/79 (10%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEK--------KKKKKKKKKKKKNNNNNNNNNNNN 73
K ++EKKK K++EKK E+ +KEK ++ +K++ + + +N N
Sbjct: 1750 KKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFAN 1809
Query: 74 NNNNNNNNNNNNNEEKGTF 92
N N+ K
Sbjct: 1810 IIEGGKEGNLVINDSKEME 1828
Score = 30.1 bits (67), Expect = 4.1
Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 8 QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK----KKK 61
+I F E + + + K ++ +KA + +K +EKKK ++ KK ++KKK KKK
Sbjct: 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310
Score = 30.1 bits (67), Expect = 4.4
Identities = 15/39 (38%), Positives = 29/39 (74%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KA++ +K+ +E K K +E KK++++K+K KK+++KK
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769
Score = 29.3 bits (65), Expect = 6.2
Identities = 16/41 (39%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K E+ KK ++AK K ++ +K ++EKKK ++ KKK+ +
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
Score = 29.3 bits (65), Expect = 7.2
Identities = 17/43 (39%), Positives = 30/43 (69%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKKK 61
K+A++ +K ++ KK + +EKKK E KK E++ K K ++ KK+
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738
Score = 29.3 bits (65), Expect = 7.5
Identities = 20/44 (45%), Positives = 30/44 (68%), Gaps = 4/44 (9%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEK----KKKEKKKKKKKKKKKKK 61
K ++EKK K ++AKK E+ K K KK E++KKK ++ KKK+
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642
Score = 29.0 bits (64), Expect = 8.5
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KKA + +K ++ KKA++K+K +E KK E+ K +K
Sbjct: 1546 KKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582
Score = 29.0 bits (64), Expect = 9.2
Identities = 16/42 (38%), Positives = 28/42 (66%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K K +E KK E++ K K ++ KK +E KKK ++ KK +++
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
Score = 29.0 bits (64), Expect = 9.6
Identities = 16/48 (33%), Positives = 32/48 (66%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+ E + ++AKK E+ K + + KK ++++KK ++ KKK+ ++KKK
Sbjct: 1601 YEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 40.0 bits (94), Expect = 2e-04
Identities = 5/33 (15%), Positives = 24/33 (72%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
+ +N++++++N++N +++++++ + GT +
Sbjct: 29 SGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRIT 61
Score = 39.2 bits (92), Expect = 4e-04
Identities = 3/30 (10%), Positives = 22/30 (73%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
++ +N++++++N++N +++++++ +
Sbjct: 28 SSGSNSSSSSSNSSNGGSSSSSSSGDSSSG 57
Score = 37.7 bits (88), Expect = 0.001
Identities = 3/25 (12%), Positives = 22/25 (88%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
+++ +N++++++N++N +++++++
Sbjct: 27 SSSGSNSSSSSSNSSNGGSSSSSSS 51
Score = 36.9 bits (86), Expect = 0.003
Identities = 3/25 (12%), Positives = 21/25 (84%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
+++ +N++++++N++N ++++++
Sbjct: 26 VSSSGSNSSSSSSNSSNGGSSSSSS 50
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 42.4 bits (100), Expect = 3e-04
Identities = 13/65 (20%), Positives = 27/65 (41%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
+ P+ K + K K K + K K +K +E+ K+E K + + +N ++
Sbjct: 86 KEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSSTA 145
Query: 74 NNNNN 78
+
Sbjct: 146 TAAAS 150
Score = 42.0 bits (99), Expect = 3e-04
Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 3/72 (4%)
Query: 14 EPTPLDWKKAK---KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
EP P K+A K + K + K K K K +K +++ K++ K + + + N
Sbjct: 79 EPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPA 138
Query: 71 NNNNNNNNNNNN 82
++ +
Sbjct: 139 RPTSSTATAAAS 150
Score = 40.4 bits (95), Expect = 0.001
Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 1/74 (1%)
Query: 14 EPTPL-DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
EP P+ + K K E K K K K K K K+ +++ K++ K + + N
Sbjct: 77 EPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTA 136
Query: 73 NNNNNNNNNNNNNN 86
++ +
Sbjct: 137 PARPTSSTATAAAS 150
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 41.4 bits (97), Expect = 3e-04
Identities = 16/66 (24%), Positives = 33/66 (50%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
+E + E + + E ++E+++ E+ K +KK N+ N+ ++N N + N N
Sbjct: 63 EEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKN 122
Query: 86 NEEKGT 91
+ K T
Sbjct: 123 EKSKKT 128
Score = 39.5 bits (92), Expect = 0.001
Identities = 12/65 (18%), Positives = 34/65 (52%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
++ E + E++ + E+++E+ +++ K +KK + N+ ++N N + N N
Sbjct: 63 EEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKN 122
Query: 84 NNNEE 88
+++
Sbjct: 123 EKSKK 127
Score = 36.8 bits (85), Expect = 0.009
Identities = 14/73 (19%), Positives = 36/73 (49%), Gaps = 9/73 (12%)
Query: 25 KKEKKKKEKKAKKKEKKK------EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
++E+++ E++ ++ E + E +++E++ ++ K N N+ N+ ++N
Sbjct: 52 EEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNA 111
Query: 79 NN---NNNNNNEE 88
N N NE+
Sbjct: 112 QNLISKNYKKNEK 124
>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
Provisional.
Length = 522
Score = 43.2 bits (102), Expect = 3e-04
Identities = 16/47 (34%), Positives = 23/47 (48%)
Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAP 99
++ +NN ++N NNNNNNNN N+NN N K+
Sbjct: 1 NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNNGNNQASKLPRG 47
Score = 42.0 bits (99), Expect = 7e-04
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
++ +NN ++N NNNNNNNN N+NN NN
Sbjct: 1 NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNN 36
Score = 41.6 bits (98), Expect = 0.001
Identities = 17/38 (44%), Positives = 22/38 (57%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
++ NN ++N NNNNNNNN N+NN NNN
Sbjct: 1 NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNNGN 38
Score = 40.9 bits (96), Expect = 0.002
Identities = 19/34 (55%), Positives = 21/34 (61%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
+N NNNNNNNN N+NN NNN NN K K
Sbjct: 16 DNYNNNNNNNNQINSNNPNNNGNNQASKLPRGKK 49
Score = 34.3 bits (79), Expect = 0.15
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
NNNNN N+NN NNN NN + +K
Sbjct: 22 NNNNNQINSNNPNNNGNNQASKLPRGKKKQ 51
Score = 32.8 bits (75), Expect = 0.48
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
NNNN N+NN NNN NN + ++K
Sbjct: 22 NNNNNQINSNNPNNNGNNQASKLPRGKKK 50
Score = 29.3 bits (66), Expect = 6.3
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
NNN N+NN NNN NN + +++
Sbjct: 22 NNNNNQINSNNPNNNGNNQASKLPRGKKKQE 52
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 42.9 bits (102), Expect = 3e-04
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
KKE K K KKKEK++ K K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 42.5 bits (101), Expect = 4e-04
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KKE KA K +KK++K++ K K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 39.4 bits (93), Expect = 0.004
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ K++ K K + K K +KKKEK++ K K K K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 37.1 bits (87), Expect = 0.016
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK+ K K +KKKEK++ + K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 37.1 bits (87), Expect = 0.016
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKK 46
KA K +KKKEK+ K K K K +
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 37.1 bits (87), Expect = 0.019
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K K +KKK++++ K K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 36.3 bits (85), Expect = 0.036
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K+ KE+ KKE K K +KKK+K++ K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87
Score = 36.0 bits (84), Expect = 0.047
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
K + K K +KK +K+E K K K + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 34.8 bits (81), Expect = 0.092
Identities = 13/38 (34%), Positives = 22/38 (57%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KE K++ K A +K+ + K +KKK+K++ K K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87
>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain. This domain is
found in Plasmodium Duffy binding proteins. Plasmodium
vivax and Plasmodium knowlesi merozoites invade human
erythrocytes that express Duffy blood group surface
determinants. The Duffy receptor family is localised in
micronemes, an organelle found in all organisms of the
phylum Apicomplexa.
Length = 276
Score = 42.3 bits (100), Expect = 4e-04
Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK------------EKKKKEKKKKKKKKKKKKKNNNNNNN 68
+K KKK K+KK K K+ K + KKKE K+KKK KK KK NN N
Sbjct: 206 EKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKKRNNKKNY 265
Query: 69 NNNNNNNNNN 78
++ +
Sbjct: 266 SDIKDKYAKE 275
Score = 41.6 bits (98), Expect = 7e-04
Identities = 22/77 (28%), Positives = 30/77 (38%), Gaps = 18/77 (23%)
Query: 26 KEKKKKEKKAKKK----EKKKEKKKKEKK------------KKKKK--KKKKKKNNNNNN 67
+E+KKK +K KKK + K KK K KKK K+KKK
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258
Query: 68 NNNNNNNNNNNNNNNNN 84
NN N ++ +
Sbjct: 259 RNNKKNYSDIKDKYAKE 275
Score = 37.3 bits (87), Expect = 0.013
Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 12/77 (15%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKK------------KKKKKKKKKNNNNNNNNNN 71
++++KK ++ K K KEKK K +K K KKK+ K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258
Query: 72 NNNNNNNNNNNNNNNEE 88
NN N ++ + +E
Sbjct: 259 RNNKKNYSDIKDKYAKE 275
Score = 31.2 bits (71), Expect = 1.1
Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 12/74 (16%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK--KNN----------NNNNNNNNNNNN 75
K KK ++ KK KK K+K K KK ++ + N NN
Sbjct: 106 KDGNTKKLEENLKKIFKKIKDKLKPNYKKLREDWWEANREDIWKAMLCGLKNNAKYFKKY 165
Query: 76 NNNNNNNNNNNEEK 89
+ N+ + +
Sbjct: 166 SYKCRCNSCTSLPE 179
Score = 28.5 bits (64), Expect = 9.7
Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 13/58 (22%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNN-------------NNNNNNNNNNNNNNEEKGTFLY 94
KKKKKK KK N N N N N N NN + ++ F+
Sbjct: 2 KKKKKKNGKKWNCNKNKKKKNKGVCVPPRRQQLCLYNLENLNENNIKDEHDLLEAFIK 59
Score = 28.5 bits (64), Expect = 9.9
Identities = 16/71 (22%), Positives = 23/71 (32%), Gaps = 13/71 (18%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK-------------KKKKKNNNNNNNNN 70
K KK E+ KK KK + K K KK ++ KNN
Sbjct: 106 KDGNTKKLEENLKKIFKKIKDKLKPNYKKLREDWWEANREDIWKAMLCGLKNNAKYFKKY 165
Query: 71 NNNNNNNNNNN 81
+ N+ +
Sbjct: 166 SYKCRCNSCTS 176
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 41.1 bits (97), Expect = 4e-04
Identities = 21/90 (23%), Positives = 40/90 (44%), Gaps = 3/90 (3%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
KKK+ K K K KKK +++ + + +++K+KKK K + + +N N +
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQ 60
Query: 84 NNNEEKGTFLYKMAAPLVLVLTYVAEITQP 113
+ G+ K PL + + +P
Sbjct: 61 KKDPRIGS---KKPIPLGVEEKVKPKKKKP 87
Score = 39.9 bits (94), Expect = 0.001
Identities = 16/59 (27%), Positives = 29/59 (49%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
KK K K+ K KK+ ++E + +++K+KKK K K+ + +N N +
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQA 59
Score = 30.7 bits (70), Expect = 1.3
Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 25/65 (38%)
Query: 22 KAKKKEKKKKEK------------KAKKKEKKKEKK------KK-------EKKKKKKKK 56
+A+++++KKK K K + + +KK KK EK K KKKK
Sbjct: 27 EARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKK 86
Query: 57 KKKKK 61
K KK
Sbjct: 87 PKSKK 91
Score = 29.1 bits (66), Expect = 3.6
Identities = 21/62 (33%), Positives = 26/62 (41%), Gaps = 25/62 (40%)
Query: 22 KAKKKEK--------------KKKEKKAKKKE-----KKK------EKKKKEKKKKKKKK 56
K KKK K K + +A+KK+ KK EK K +KKK K KK
Sbjct: 32 KRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSKK 91
Query: 57 KK 58
K
Sbjct: 92 PK 93
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 41.9 bits (99), Expect = 5e-04
Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 8/60 (13%)
Query: 12 FTEPTPLDWKKAKKKEKKK--------KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
E PLD + K+KK+ E +A+KK K KE+ KK+K K+ K K KK
Sbjct: 217 IPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVV 276
Score = 39.2 bits (92), Expect = 0.004
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+ ++K+K +++ KKK+ K+ K K KK K KK
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 38.9 bits (91), Expect = 0.004
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T D + A+ + +KK++ K + K+KK ++ K K KK K KK
Sbjct: 237 TEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 38.5 bits (90), Expect = 0.006
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+A + +KK K ++ +KKK K+ K K KK K K
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 34.2 bits (79), Expect = 0.12
Identities = 14/44 (31%), Positives = 25/44 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K K K+ K+KK + +E + + + +KK+K K++ KKK
Sbjct: 221 KPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 39.2 bits (92), Expect = 5e-04
Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 11/52 (21%)
Query: 25 KKEKKKKE-----------KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
++E K KE ++ ++KE K++ + KKKKK+K KK +
Sbjct: 11 EEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62
Score = 36.9 bits (86), Expect = 0.003
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+E+++KE K K E KKK+K+K KKK+ +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 36.1 bits (84), Expect = 0.005
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
W + K +++E+K K++ + +KKKK+K KKK+
Sbjct: 21 WDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRD 61
Score = 33.8 bits (78), Expect = 0.042
Identities = 10/35 (28%), Positives = 22/35 (62%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+++++K+ K+ + KKKK++K KKK+ +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 33.0 bits (76), Expect = 0.080
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+++E+K+ ++KA + +KKKK K KKK+ +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 31.5 bits (72), Expect = 0.24
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
++ ++KE K+K + KKK+K+K +KK+ +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 27.2 bits (61), Expect = 6.5
Identities = 9/52 (17%), Positives = 22/52 (42%), Gaps = 10/52 (19%)
Query: 41 KKEKKKKEK----------KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
++E K KE+ +++++K+ K+K + NN+ +
Sbjct: 11 EEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 39.3 bits (92), Expect = 5e-04
Identities = 17/53 (32%), Positives = 22/53 (41%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K K KE K K+ A + + EK K KK KK KK N ++
Sbjct: 25 GKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCKDS 77
Score = 38.2 bits (89), Expect = 0.001
Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 1/46 (2%)
Query: 19 DWKKAKKKEK-KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
W K + K K KE KAKK + + +K K KK K +
Sbjct: 17 KWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62
Score = 32.0 bits (73), Expect = 0.18
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K + + KEK K +K+ + K K K+ K KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAP 40
Score = 30.9 bits (70), Expect = 0.41
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K + + K+K K +KK+ + K K K+ K KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKN 37
Score = 30.5 bits (69), Expect = 0.65
Identities = 14/39 (35%), Positives = 18/39 (46%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK + K + K+ K KKE K K K+ K KK
Sbjct: 3 KKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPV 41
Score = 29.7 bits (67), Expect = 0.93
Identities = 15/36 (41%), Positives = 20/36 (55%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K K K++ K KK++ K K K+ K KKN
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKN 37
Score = 28.9 bits (65), Expect = 1.9
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K K + ++K K +KK++ K + K+ K KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAP 40
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 42.5 bits (100), Expect = 5e-04
Identities = 13/41 (31%), Positives = 18/41 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA + KKK +AK K + K +K K +K
Sbjct: 209 KKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEK 249
Score = 42.1 bits (99), Expect = 5e-04
Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK--KEKKKKKKKKKKKKK 61
KA + KKK E +AKKK + KKK E K K + K
Sbjct: 193 AKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAK 235
Score = 41.7 bits (98), Expect = 7e-04
Identities = 14/41 (34%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA+ + KKK +AKKK + K K + K +K
Sbjct: 201 KKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKA 241
Score = 39.4 bits (92), Expect = 0.004
Identities = 12/39 (30%), Positives = 19/39 (48%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
AKKK + + +KKA + KKK + + K + K
Sbjct: 199 AKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA 237
Score = 38.2 bits (89), Expect = 0.010
Identities = 18/41 (43%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA + KKK E +A K + KKK E + KKK + KK
Sbjct: 177 KKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKK 217
Score = 37.5 bits (87), Expect = 0.015
Identities = 11/39 (28%), Positives = 15/39 (38%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KKA + K K A + + EK K +K K
Sbjct: 217 KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
Score = 35.2 bits (81), Expect = 0.075
Identities = 13/43 (30%), Positives = 17/43 (39%), Gaps = 3/43 (6%)
Query: 21 KKAKKK---EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+AKKK E K KA + K +K K +K K
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
Score = 35.2 bits (81), Expect = 0.094
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 21 KKAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKKKK 60
KKA+ + KK +AKKK + K E KKK + + KKK
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK 210
Score = 34.4 bits (79), Expect = 0.16
Identities = 17/41 (41%), Positives = 22/41 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA + KKK E +A KK + KKK E + K + KK
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKK 201
Score = 32.9 bits (75), Expect = 0.40
Identities = 13/41 (31%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + E K+ AKK + +KK + + KK + KKK
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKK 186
Score = 32.5 bits (74), Expect = 0.52
Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 5/46 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK-----KKKKKKKKKKK 61
K+A K+K+ +E AK K K + E K KK + KKK
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKK 170
Score = 32.5 bits (74), Expect = 0.55
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++ +E+KK+ ++A K+ K+K+ +E K K K
Sbjct: 109 ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAK 148
Score = 31.7 bits (72), Expect = 1.0
Identities = 14/41 (34%), Positives = 19/41 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA K K E +AK+ +K E KKK + + KK
Sbjct: 139 AKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKA 179
Score = 30.5 bits (69), Expect = 2.5
Identities = 9/38 (23%), Positives = 27/38 (71%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KK +++ ++ ++K+A ++E+ K+ +K+ +++KK+
Sbjct: 83 KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120
Score = 30.2 bits (68), Expect = 3.2
Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE---KKKKEKKKKKKKKKKKKK 61
++A K + KA+ + K+ KK + KKK + + KK
Sbjct: 135 EEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK 178
Score = 29.8 bits (67), Expect = 3.8
Identities = 12/41 (29%), Positives = 18/41 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK +AKKK + + KK + KKK + +
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAA 193
Score = 29.8 bits (67), Expect = 4.2
Identities = 10/41 (24%), Positives = 27/41 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+A ++E+ K+ +K + ++++K+ +E K+ K+K+ +
Sbjct: 95 KQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE 135
Score = 29.4 bits (66), Expect = 4.9
Identities = 11/41 (26%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ KKKE+++ E+ +K+ ++E+ K+ +K++ +++KK+
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119
Score = 29.4 bits (66), Expect = 5.7
Identities = 8/37 (21%), Positives = 22/37 (59%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ +K++ ++K++ +E K+ K+K+ ++ K
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAK 140
Score = 29.0 bits (65), Expect = 7.2
Identities = 11/39 (28%), Positives = 30/39 (76%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K AK+ E+++K+K+ ++ E+ ++K+ E+++ K+ +K++
Sbjct: 72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKER 110
Score = 28.6 bits (64), Expect = 9.6
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
A +K E K K + + +K E KKK + + K
Sbjct: 157 AAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 42.7 bits (101), Expect = 5e-04
Identities = 17/63 (26%), Positives = 32/63 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K+ KK E+K ++ +A+ ++K++E +K + K K KK K ++N + N N
Sbjct: 102 KELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNF 161
Query: 81 NNN 83
Sbjct: 162 KEK 164
Score = 42.3 bits (100), Expect = 6e-04
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 6/65 (9%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK------KKKKKKNNNNNNNNNNNNNNNN 77
KKE KK E+K ++ E + EKK++E +K K K KK KK ++N + N
Sbjct: 100 LKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKK 159
Query: 78 NNNNN 82
N
Sbjct: 160 NFKEK 164
Score = 40.8 bits (96), Expect = 0.002
Identities = 16/70 (22%), Positives = 34/70 (48%), Gaps = 1/70 (1%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN-NNNNNNNNNNNNNNNNN 79
+ ++ +K+ K+ + K ++ + E +KKE++ +K K K K ++N +
Sbjct: 95 AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKG 154
Query: 80 NNNNNNNEEK 89
N N +EK
Sbjct: 155 LNYKKNFKEK 164
Score = 37.7 bits (88), Expect = 0.017
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
A +KEKK EK EK+ ++ + E K +K+ K+ +K
Sbjct: 411 DAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451
Score = 32.3 bits (74), Expect = 0.82
Identities = 15/69 (21%), Positives = 26/69 (37%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ K++E +K + K K KK KK + + K K N + N +
Sbjct: 118 IEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEKLLKELKSVILNAS 177
Query: 81 NNNNNNEEK 89
+ + E K
Sbjct: 178 SLLSLEELK 186
Score = 31.5 bits (72), Expect = 1.2
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 4/71 (5%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN--NNNNNNN 78
+ +KKE++ +KAK K K KK KK + K N N +
Sbjct: 116 AEIEKKEEEL--EKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEKLLKELKSVI 173
Query: 79 NNNNNNNNEEK 89
N ++ + E+
Sbjct: 174 LNASSLLSLEE 184
Score = 29.2 bits (66), Expect = 7.7
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
K EK+ K+ +A+ K +KE K+ EK+
Sbjct: 422 KAINSLEKEIKQLEAEIKALEKEIKELEKQ 451
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 38.9 bits (91), Expect = 5e-04
Identities = 18/64 (28%), Positives = 24/64 (37%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLV 101
K K KK KKKKKKKKKK + ++ ++ EE
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70
Query: 102 LVLT 105
+ LT
Sbjct: 71 MNLT 74
Score = 38.2 bits (89), Expect = 0.001
Identities = 15/47 (31%), Positives = 25/47 (53%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
L K K KKKK+KK KK + K+E ++++++K + K
Sbjct: 9 KLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55
Score = 37.0 bits (86), Expect = 0.002
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
K K KK KKK+KKK+K K +++ +K++++K + ++
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55
Score = 36.6 bits (85), Expect = 0.003
Identities = 14/45 (31%), Positives = 26/45 (57%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K K K+ K+KK KKK+K K K++ +K++++K + +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55
Score = 36.2 bits (84), Expect = 0.004
Identities = 18/61 (29%), Positives = 30/61 (49%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K K K KKK+KKKK+K K K++ +K+ ++ ++ + +N EE G
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70
Query: 91 T 91
Sbjct: 71 M 71
Score = 35.5 bits (82), Expect = 0.008
Identities = 18/59 (30%), Positives = 29/59 (49%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
K K K KK + KKK+KKKKKK K K++ ++ ++ + +NE+
Sbjct: 8 GKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQ 66
Score = 35.1 bits (81), Expect = 0.010
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K K KK K+KKKK+KKK K K++ + ++ ++
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55
Score = 34.3 bits (79), Expect = 0.019
Identities = 17/59 (28%), Positives = 22/59 (37%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K K K KK KKKKKKKKKK K ++ ++ +E
Sbjct: 3 SNVVGGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDN 61
Score = 34.3 bits (79), Expect = 0.019
Identities = 14/50 (28%), Positives = 29/50 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
K K K +K KK+KKK+KKK + K++ +K++++ ++ ++
Sbjct: 6 VGGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEG 55
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 41.9 bits (99), Expect = 6e-04
Identities = 11/38 (28%), Positives = 27/38 (71%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
++ K+++KK++ + K +E++K E+K++KK+ +K
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 40.7 bits (96), Expect = 0.001
Identities = 13/44 (29%), Positives = 33/44 (75%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK---EKKKKKKKKKKKKK 61
+ +++E+ +++K+ KKKE+++ K K E+++K ++K++KK+
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQ 319
Score = 38.8 bits (91), Expect = 0.006
Identities = 13/41 (31%), Positives = 31/41 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +++E++K K A+++ +++ ++KKE+KKK++++ K K
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
Score = 37.6 bits (88), Expect = 0.011
Identities = 13/46 (28%), Positives = 30/46 (65%), Gaps = 7/46 (15%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKK-------KKKKKKKKKKN 62
K E++++E+ +KKE+KK+++++ K +K ++K++KK
Sbjct: 274 KAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQ 319
Score = 37.6 bits (88), Expect = 0.014
Identities = 12/41 (29%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +K K +E++ +K K E++++E+ ++KK++KKK++
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEE 295
Score = 32.2 bits (74), Expect = 0.64
Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 9/49 (18%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKK---------KEKKKKKKKKKKKKKNNN 64
E +K K +++E++K K +EKK++KKK++++ K
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 40.9 bits (96), Expect = 6e-04
Identities = 14/68 (20%), Positives = 36/68 (52%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
S + D + A+ +E K++EK+A E K++K EK+ ++ +++ +++
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Query: 62 NNNNNNNN 69
+ +++ N
Sbjct: 99 DEESSDEN 106
Score = 40.9 bits (96), Expect = 7e-04
Identities = 12/69 (17%), Positives = 33/69 (47%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
S + E D ++ + E+ K+E+K + KE K +K+ ++ +++ ++
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 62 NNNNNNNNN 70
+ +++ N
Sbjct: 98 EDEESSDEN 106
Score = 40.1 bits (94), Expect = 0.001
Identities = 10/54 (18%), Positives = 29/54 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
K ++ + +E K ++KE + K++K +K+ ++ ++ N + +++ N
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDEN 106
Score = 38.6 bits (90), Expect = 0.004
Identities = 13/82 (15%), Positives = 33/82 (40%), Gaps = 2/82 (2%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
+ ++ KK ++ A+ +E K+E+K+ + K+ K +K + +
Sbjct: 36 FPSSPSDQAAADEQEAKK--SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93
Query: 69 NNNNNNNNNNNNNNNNNNEEKG 90
N + +++ N E+
Sbjct: 94 ENEEEDEESSDENEKETEEKTE 115
Score = 28.2 bits (63), Expect = 9.2
Identities = 8/42 (19%), Positives = 24/42 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ + K +KE + ++E ++E ++ + +K+ ++K + N
Sbjct: 76 EDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESN 117
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 40.0 bits (94), Expect = 6e-04
Identities = 13/53 (24%), Positives = 34/53 (64%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
D ++ +++++++KE+K +K+EK+KE K +K++++K+ ++ N
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65
Score = 34.6 bits (80), Expect = 0.040
Identities = 12/56 (21%), Positives = 35/56 (62%), Gaps = 7/56 (12%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE-------KKKKKKKKKKKKKNNNNNNNN 69
+ +EK++++++ K+++++KE+K+KE K++++K+ ++ +K N
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65
Score = 32.7 bits (75), Expect = 0.20
Identities = 11/59 (18%), Positives = 34/59 (57%), Gaps = 8/59 (13%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK--------KKKKKKKKKKNNNNNNNNNNN 72
K+ + +++++ K++EK+++++K+EK+K K++++K+ ++ N
Sbjct: 7 KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65
Score = 29.2 bits (66), Expect = 3.2
Identities = 7/35 (20%), Positives = 26/35 (74%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
++K + + ++++++KE++K++K++K++K+
Sbjct: 5 RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWG 39
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 42.1 bits (99), Expect = 7e-04
Identities = 16/45 (35%), Positives = 35/45 (77%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
P + ++ +K+EKK++E+K KE++ ++++++++KKKK KK K+
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 41.3 bits (97), Expect = 0.001
Identities = 16/44 (36%), Positives = 32/44 (72%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ ++ K+++K+++EK K+E+ E+++KE+KKKK KK K+
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82
Score = 38.6 bits (90), Expect = 0.007
Identities = 15/41 (36%), Positives = 31/41 (75%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
++++++KKE++ K +K++E ++E+K++KKKK KK K
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 35.1 bits (81), Expect = 0.088
Identities = 18/48 (37%), Positives = 32/48 (66%), Gaps = 8/48 (16%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKK--------EKKKKKKKKKKKKK 61
K +KE +E++ +K+EKK+E++K E+++K++KKKK KK
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
>gnl|CDD|216682 pfam01757, Acyl_transf_3, Acyltransferase family. This family
includes a range of acyltransferase enzymes. This domain
is found in many as yet uncharacterized C. elegans
proteins and it is approximately 300 amino acids long.
Length = 326
Score = 41.8 bits (98), Expect = 7e-04
Identities = 28/202 (13%), Positives = 66/202 (32%), Gaps = 8/202 (3%)
Query: 221 RRPNYRMYMRR---TFLLPYAIVTSLFFI--GQFGGMTTLQTYAVGIFESIHAPLDPYFA 275
RR + ++++ L+PY + L+F+ GG++ + + + P
Sbjct: 65 RRRSLFKFLKKRLLRLLIPYLFWSLLYFLLGLLVGGLSVYGLLLLLLLLGLLFGGGPNGH 124
Query: 276 TLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVP 335
L A LL +L+ K L+ A +++++ L G ++
Sbjct: 125 LWFLP-ALFVFYLLLPLLLRLLRKLKKLLLL--LLLALLLLLSLLYILILLVGLPPTVLN 181
Query: 336 TVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDT 395
+ +L FL + + + I + A + + F +Y
Sbjct: 182 LLIGLLPFFLLGALLARYRKRIRSKRLLLLIVILLALALLALILLLLFLFGLVYLAPELY 241
Query: 396 FHLWGTLYFYAAISVVGTLYMY 417
+ L + ++ +
Sbjct: 242 GYFSLLLLLLGVLLLLLLALLL 263
Score = 36.7 bits (85), Expect = 0.025
Identities = 31/194 (15%), Positives = 54/194 (27%), Gaps = 20/194 (10%)
Query: 225 YRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAEL 284
R+ + LL ++ L + + L + + L P+F
Sbjct: 142 LRLLRKLKKLLLLLLLALLLLLSLLYILILLVGLPPTVLNLLIGLL-PFFLL-------- 192
Query: 285 GGALLCVVLIHYTGKRPLALISTGGSAA---CFIVVAVYAQFHLSYGWDSPLVPTVFLVL 341
GALL KR L LI + A +++ + L+L
Sbjct: 193 -GALLARYRKRIRSKRLLLLIVILLALALLALILLLLFLFGLVYLAPELYGYFSLLLLLL 251
Query: 342 AAFLTHICIRLLPWMLIGEVFPNNIRATASGASGS-SSYIFAFAVNKLYYPMLDTFHLWG 400
L + LL + R S Y+ + L +L G
Sbjct: 252 GVLLLLLLALLLAN------LRSLKRLLKYLGKYSLGIYLIHPPILLLLTKLLLLLPPLG 305
Query: 401 TLYFYAAISVVGTL 414
+ + V+ L
Sbjct: 306 PILLFLLALVLTLL 319
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 40.0 bits (94), Expect = 7e-04
Identities = 16/51 (31%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKKN 62
P KKAK+K ++ K K K++ EK+KEK+++ K + +K++ +
Sbjct: 58 ESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAEL 108
Score = 38.9 bits (91), Expect = 0.001
Identities = 14/44 (31%), Positives = 29/44 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+K K++ ++ + KAKK+E +KEK+K+E+ K + +K++
Sbjct: 65 RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAEL 108
Score = 32.0 bits (73), Expect = 0.37
Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 20 WKKAKKKEKKK-KEKKAKKKEKKKEKKKKEKKKKK---KKKKKKKK 61
K ++ EK+K KE++ K + EK++ E +KKK K K++KK
Sbjct: 78 KAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123
Score = 30.8 bits (70), Expect = 0.86
Identities = 14/38 (36%), Positives = 26/38 (68%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
D KAKK+E +K+++K ++ K + +KE+ + +KKK
Sbjct: 75 DKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKK 112
>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region. This
family contains sequences that are similar to the
N-terminal region of Red protein. This and related
proteins contain a RED repeat which consists of a
number of RE and RD sequence elements. The region in
question has several conserved NLS sequences and a
putative trimeric coiled-coil region, suggesting that
these proteins are expressed in the nucleus. The
function of Red protein is unknown, but efficient
sequestration to nuclear bodies suggests that its
expression may be tightly regulated of that the protein
self-aggregates extremely efficiently.
Length = 238
Score = 41.0 bits (96), Expect = 7e-04
Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 9/71 (12%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN--------- 78
KKKK +K+E+ EK+ K + + ++++K N + + ++
Sbjct: 2 KKKKYAYLRKQEENAEKEINPKYRDRARERRKGINKDYDPSSLAAYRAVAPDAKSDIDAA 61
Query: 79 NNNNNNNNEEK 89
N NN +E K
Sbjct: 62 ENRNNTIDESK 72
Score = 33.3 bits (76), Expect = 0.22
Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 1/69 (1%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
KK K +K+E+ A+K+ K + + +E++K K + ++ +
Sbjct: 2 KKKKYAYLRKQEENAEKEINPKYRDRARERRKGINKDYDPSSLAAYRAVAPDAKSDIDAA 61
Query: 80 NNNNNNNEE 88
N NN +E
Sbjct: 62 ENRNNTIDE 70
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 42.0 bits (99), Expect = 8e-04
Identities = 14/71 (19%), Positives = 30/71 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ KK +++ + EK+ E KK +K+ K + K+KK+ + + N
Sbjct: 413 EEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVAN 472
Query: 81 NNNNNNEEKGT 91
+E+
Sbjct: 473 KLLKRSEKAQK 483
Score = 38.9 bits (91), Expect = 0.006
Identities = 15/56 (26%), Positives = 26/56 (46%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K+ KKK K K K K + KKKKKK+K +++ + ++ + +
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVD 562
Score = 38.1 bits (89), Expect = 0.011
Identities = 19/75 (25%), Positives = 40/75 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K++ K+K + K +K K K K KKKKKK+K + +++ + ++ + +
Sbjct: 503 VGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVD 562
Query: 81 NNNNNNEEKGTFLYK 95
+ + ++E+ FL+K
Sbjct: 563 DEEDEDDEELPFLFK 577
Score = 36.6 bits (85), Expect = 0.033
Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 6/51 (11%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK------KKKKKNN 63
W K++KKK K+ ++ K E KKEK+K KK K K+ KK
Sbjct: 620 SWAGDGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVIINEKRNKKAA 670
Score = 36.2 bits (84), Expect = 0.046
Identities = 16/85 (18%), Positives = 29/85 (34%), Gaps = 9/85 (10%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKE---------KKKKEKKKKKKKKKKKKKNNNNN 66
+ K K K K + +KE+++E K K KK+ KKK+++
Sbjct: 461 DEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKL 520
Query: 67 NNNNNNNNNNNNNNNNNNNNEEKGT 91
+ N + E+
Sbjct: 521 DKAANKISKAAVKVKKKKKKEKSID 545
Score = 36.2 bits (84), Expect = 0.055
Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 6/66 (9%)
Query: 32 EKKAKKKEKKKEKKKK------EKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
E++ ++ KK ++K EK+ + KK KK+ KN ++ +
Sbjct: 409 EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468
Query: 86 NEEKGT 91
Sbjct: 469 KVANKL 474
Score = 35.0 bits (81), Expect = 0.11
Identities = 14/75 (18%), Positives = 29/75 (38%), Gaps = 15/75 (20%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE---------------KKKKKKKKKKKKK 61
+ + K +K K K+ EK ++++++E K KK+ KKK
Sbjct: 457 EELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKS 516
Query: 62 NNNNNNNNNNNNNNN 76
++ + N +
Sbjct: 517 SSKLDKAANKISKAA 531
Score = 33.9 bits (78), Expect = 0.23
Identities = 14/72 (19%), Positives = 31/72 (43%), Gaps = 7/72 (9%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKK-------KKKKKKKKKKNNNNNNNNNNNNNNNN 77
++E + +E+ +++ ++ KK ++K K+ + KK KK N N ++
Sbjct: 398 RRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEE 457
Query: 78 NNNNNNNNNEEK 89
+ EK
Sbjct: 458 ELEDEEEAKVEK 469
Score = 33.5 bits (77), Expect = 0.29
Identities = 16/64 (25%), Positives = 32/64 (50%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
T K+ KK+ KK+ +K + + K K KKKKKK+K + +++ + ++
Sbjct: 499 TTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIK 558
Query: 76 NNNN 79
+ +
Sbjct: 559 LDVD 562
Score = 32.0 bits (73), Expect = 0.97
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
KK++KK+++K++ K + KK+K+K+ N
Sbjct: 626 IKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658
Score = 31.6 bits (72), Expect = 1.3
Identities = 15/48 (31%), Positives = 29/48 (60%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
P + +K + +K KKE K + KEKK+ +++E + +++ K +K N
Sbjct: 426 GPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANK 473
Score = 30.4 bits (69), Expect = 3.0
Identities = 15/84 (17%), Positives = 31/84 (36%), Gaps = 15/84 (17%)
Query: 23 AKKKEKKKKEKKAKKKEK-------KKEKKKKEKKKKKKKK--------KKKKKNNNNNN 67
A+KKE+ E + ++E ++E ++ KK ++K + + K N
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN 443
Query: 68 NNNNNNNNNNNNNNNNNNNEEKGT 91
N ++ + EE
Sbjct: 444 KNEFKEKKESDEEEELEDEEEAKV 467
Score = 30.4 bits (69), Expect = 3.4
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 34 KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K +KK++K++++ K + KK+K+K K N
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation
of 60S ribosomal subunits.
Length = 132
Score = 39.3 bits (92), Expect = 8e-04
Identities = 13/39 (33%), Positives = 27/39 (69%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++KK K KK+ K++K EKK +++ + +K++ +K + K
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVK 64
Score = 35.0 bits (81), Expect = 0.027
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK K +KK K +K +KK K++ + +K + +K + K
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVK 64
Score = 34.3 bits (79), Expect = 0.050
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK K +KK + +K EKK K++ + +K++
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQ 56
Score = 33.9 bits (78), Expect = 0.056
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ K+ KK+ K EK +K+ K++ + +K++
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59
Score = 32.7 bits (75), Expect = 0.17
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K K+K +K +KK K++ +K++ +K + K
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKP 65
Score = 31.2 bits (71), Expect = 0.53
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK KAKKK K ++ +KK K++ + +K++ +
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59
Score = 30.0 bits (68), Expect = 1.3
Identities = 11/41 (26%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+K ++ K+ K KKK + +K +KK K++ +
Sbjct: 11 ASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLR 51
Score = 27.7 bits (62), Expect = 7.6
Identities = 13/36 (36%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
L K K K +K EKKAK++ + ++++ EK + K
Sbjct: 30 LLKAKKKLKSEK-LEKKAKRQLRAEKRQALEKGRVK 64
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 38.9 bits (91), Expect = 9e-04
Identities = 14/54 (25%), Positives = 32/54 (59%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
L + K+KE++K + KKK KK KK KK+++ ++++++++++
Sbjct: 61 LQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114
Score = 35.9 bits (83), Expect = 0.013
Identities = 9/51 (17%), Positives = 35/51 (68%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
++++K+++K ++ +KK++K + + KK KK+ +++++++++++
Sbjct: 64 PQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114
Score = 34.7 bits (80), Expect = 0.033
Identities = 12/53 (22%), Positives = 33/53 (62%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+ + K+K+++K+ + KKK KK + + KK KK +++++++++++
Sbjct: 62 QQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114
Score = 31.6 bits (72), Expect = 0.39
Identities = 10/56 (17%), Positives = 34/56 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
++ ++ ++K+KE++ + KK++KK + KK K +++++++++++
Sbjct: 59 QQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSS 114
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 42.0 bits (99), Expect = 0.001
Identities = 21/67 (31%), Positives = 34/67 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ K K K K K K K KKKEKKKK+ K KK N+ +++ ++ +N
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215
Query: 81 NNNNNNE 87
+N++
Sbjct: 1216 KKSNSSG 1222
Score = 41.6 bits (98), Expect = 0.001
Identities = 15/59 (25%), Positives = 31/59 (52%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
++ + ++++ + K K K K K K +KK+KKKKK K+ + N+ +++
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205
Score = 39.3 bits (92), Expect = 0.006
Identities = 17/60 (28%), Positives = 34/60 (56%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
++++ KE++ K K K K K ++ K KKK+KKKKK + + + + N+ +++
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205
Score = 38.5 bits (90), Expect = 0.011
Identities = 19/74 (25%), Positives = 38/74 (51%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
+ ++ E+K+ K+ + K K K K K +K K KKK+KKKK ++ + + + N+ +
Sbjct: 1143 EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVD 1202
Query: 82 NNNNNEEKGTFLYK 95
++ + K
Sbjct: 1203 SDEKRKLDDKPDNK 1216
Score = 38.1 bits (89), Expect = 0.014
Identities = 18/91 (19%), Positives = 37/91 (40%)
Query: 1 MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++ + T K +K + KKKEKK KK K KK K+ +K
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEK 1206
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
+ ++ +N +N++ ++ ++ +
Sbjct: 1207 RKLDDKPDNKKSNSSGSDQEDDEEQKTKPKK 1237
Score = 35.8 bits (83), Expect = 0.080
Identities = 16/42 (38%), Positives = 25/42 (59%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KKK KK+ E +KKK+ +KK +KKK K + K+ + +
Sbjct: 1306 TKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
Score = 35.4 bits (82), Expect = 0.091
Identities = 15/67 (22%), Positives = 38/67 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ +++E+ ++++ AK++ K + K K K +K K KKK+K ++ + + + N+
Sbjct: 1139 EALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198
Query: 81 NNNNNNE 87
+++E
Sbjct: 1199 KRVDSDE 1205
Score = 35.4 bits (82), Expect = 0.10
Identities = 18/62 (29%), Positives = 29/62 (46%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
S TK ++ E + KK KK EKK KK K K+ + + ++ +KKK
Sbjct: 1303 SSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362
Query: 62 NN 63
++
Sbjct: 1363 SS 1364
Score = 35.0 bits (81), Expect = 0.12
Identities = 16/72 (22%), Positives = 40/72 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++A +++++ +EK+ K+++ K K K + K +K K KKK+ ++ + + + N
Sbjct: 1138 EEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197
Query: 81 NNNNNNEEKGTF 92
+ +++EK
Sbjct: 1198 SKRVDSDEKRKL 1209
Score = 33.9 bits (78), Expect = 0.29
Identities = 18/59 (30%), Positives = 27/59 (45%)
Query: 4 STKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
S + + P K K K++ + A KK+KK EKK KKK K + K+ +
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
Score = 31.6 bits (72), Expect = 1.3
Identities = 17/72 (23%), Positives = 36/72 (50%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
LD + +E+++ E+K KE++ + K K K K +K K KKK ++ + + +
Sbjct: 1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKAS 1193
Query: 78 NNNNNNNNNEEK 89
N+ + ++
Sbjct: 1194 VVGNSKRVDSDE 1205
Score = 31.2 bits (71), Expect = 1.6
Identities = 15/69 (21%), Positives = 29/69 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK KKK+K +K K K+ +K+K K K +N++ ++ ++
Sbjct: 1176 KKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKP 1235
Query: 81 NNNNNNEEK 89
++ K
Sbjct: 1236 KKSSVKRLK 1244
Score = 30.8 bits (70), Expect = 2.3
Identities = 11/50 (22%), Positives = 17/50 (34%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
K+ + +K K+K KK + KKKKK+
Sbjct: 1286 PPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKK 1335
Score = 30.8 bits (70), Expect = 2.8
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK---KKKKKKKKKKNNNN 65
K KKK +K+ + +KKKK +KK KKK K + K+ + +
Sbjct: 1301 KPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
Score = 29.6 bits (67), Expect = 6.3
Identities = 14/57 (24%), Positives = 22/57 (38%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+ G+ K + + KK KK EKK +KKK K + K+ +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQ 1348
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 41.6 bits (98), Expect = 0.001
Identities = 11/54 (20%), Positives = 26/54 (48%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K +++ K + K ++++ ++K ++NN + N + +N N EE
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREE 638
Score = 40.0 bits (94), Expect = 0.004
Identities = 14/108 (12%), Positives = 29/108 (26%), Gaps = 27/108 (25%)
Query: 9 IGTFTEP-TPLDWKKAKKKEKKKKEKKAKKK------------------------EKKKE 43
+ TF P P A+ E+ K
Sbjct: 529 LATFAMPDVPPAPTPAEPAAPVVAAAPKAAAATPPAQPGLLSRFFGALKALFSGGEETKP 588
Query: 44 KKKKEKKKKKKKKKKKKKNNNNNNNN--NNNNNNNNNNNNNNNNNEEK 89
+++ K + K ++++ + NN N + +N E +
Sbjct: 589 QEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENR 636
Score = 38.9 bits (91), Expect = 0.008
Identities = 17/94 (18%), Positives = 29/94 (30%), Gaps = 18/94 (19%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEK-----------KK------EKKKKEKKKKKKKK 56
PTP A K A + K E K +E+ K +
Sbjct: 540 APTP-AEPAAPVVAAAPKAAAATPPAQPGLLSRFFGALKALFSGGEETKPQEQPAPKAEA 598
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K +++ + NN + N + +N +G
Sbjct: 599 KPERQQDRRKPRQNNRRDRNERRDTRDNRTRREG 632
Score = 38.1 bits (89), Expect = 0.014
Identities = 9/63 (14%), Positives = 29/63 (46%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
E+ K +++ K + K +++++++K ++ ++ + + +N N N N
Sbjct: 584 EETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNR 643
Query: 87 EEK 89
+
Sbjct: 644 RQA 646
Score = 37.7 bits (88), Expect = 0.016
Identities = 9/56 (16%), Positives = 26/56 (46%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
++ K +E+ K + + ++++ ++K ++ N + N + +N N E
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREE 638
Score = 36.6 bits (85), Expect = 0.041
Identities = 9/65 (13%), Positives = 31/65 (47%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
+ K +++ KA+ K ++++ ++K ++ ++ + +++ +N N N N
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRR 644
Query: 84 NNNEE 88
++
Sbjct: 645 QAQQQ 649
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 39.7 bits (93), Expect = 0.001
Identities = 16/41 (39%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ E K + K+ KK+E K K +KEK K + KK K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 37.8 bits (88), Expect = 0.004
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++ ++E + K + KEKKK+E K K +K+K K
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 37.0 bits (86), Expect = 0.008
Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T E T + ++ + K + +KK+++ K K EK+K K + KK K
Sbjct: 101 TEEESTD---ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Score = 36.3 bits (84), Expect = 0.014
Identities = 18/44 (40%), Positives = 23/44 (52%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
TE K + KEKKK+E K EK+K K + +K K K K
Sbjct: 109 TEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 35.5 bits (82), Expect = 0.027
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ + + + KEKK ++ K K +K+K K + KK K K K
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 33.2 bits (76), Expect = 0.15
Identities = 9/38 (23%), Positives = 21/38 (55%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
E++ ++ ++ + + + KEKKK++ K K +K
Sbjct: 101 TEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138
Score = 30.5 bits (69), Expect = 1.3
Identities = 9/44 (20%), Positives = 23/44 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ +E++ +E+ + E++ + K + K+KKK++ K
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTE 135
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 40.9 bits (96), Expect = 0.001
Identities = 11/43 (25%), Positives = 32/43 (74%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+W AK+ +++KK +K ++++K+E++K+ ++ + +++ K+K
Sbjct: 91 NWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
Score = 38.2 bits (89), Expect = 0.008
Identities = 11/40 (27%), Positives = 32/40 (80%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
W+ K K++++K ++ ++K++KK+++++E+K+K ++ +K
Sbjct: 187 WELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 36.6 bits (85), Expect = 0.021
Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 5/47 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK-----KKKEKKKKKKKKKKKKKN 62
+K K++ +K++E+ ++ KEK ++K ++ K++ K KK
Sbjct: 110 EKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKE 156
Score = 35.9 bits (83), Expect = 0.039
Identities = 11/41 (26%), Positives = 36/41 (87%), Gaps = 1/41 (2%)
Query: 22 KAKKK-EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+AKK+ ++ + +K ++++K++E+++K++KK+++++++K+K
Sbjct: 179 EAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
Score = 35.5 bits (82), Expect = 0.048
Identities = 9/37 (24%), Positives = 30/37 (81%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KK K+++ K++E++++++KK+++++++K+K ++
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 34.3 bits (79), Expect = 0.13
Identities = 8/38 (21%), Positives = 30/38 (78%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+K K++++ +++E++K++KK+++++++K+K ++
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 32.8 bits (75), Expect = 0.34
Identities = 10/38 (26%), Positives = 31/38 (81%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K++++K+ +++ K+++K+++E+++K+K ++ +K
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 32.8 bits (75), Expect = 0.40
Identities = 10/41 (24%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K E+ +++AKK+ ++ E KK +++++K++++++K+
Sbjct: 166 SGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQ 206
Score = 32.4 bits (74), Expect = 0.46
Identities = 11/49 (22%), Positives = 33/49 (67%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+P ++ KK ++ E K K++++K ++++ K++KK+++++++K
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217
Score = 31.6 bits (72), Expect = 0.99
Identities = 10/38 (26%), Positives = 29/38 (76%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K+E+++++++ K++E+++ K+K E+ +K K K++
Sbjct: 197 QKREEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQR 234
>gnl|CDD|233172 TIGR00891, 2A0112, putative sialic acid transporter. [Transport
and binding proteins, Carbohydrates, organic alcohols,
and acids].
Length = 405
Score = 41.4 bits (97), Expect = 0.001
Identities = 49/283 (17%), Positives = 94/283 (33%), Gaps = 49/283 (17%)
Query: 105 TYVAEITQPHLRGMLS----ATASMTTIFGTVSQLFLGSF--LHWRSAAILNLLFPILAL 158
YV E HLR S + ++ + + WR+ +++L I AL
Sbjct: 123 AYVIESWPKHLRNKASGLLISGYAVGAVVAAQVYSLVVPVWGDGWRALFFISILPIIFAL 182
Query: 159 CALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELKRLGKD 218
IPE+ W + L G ++ + + A+ +S
Sbjct: 183 WLRKNIPEAEDWKEKHAGKALVRTMVDILYG--GEHRIANIVMTLAAAMVQSA------- 233
Query: 219 GQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLL 278
G+R P + + ++ + L TY L P+ +
Sbjct: 234 GKRWPTFVYLV---------VLVLFANLYSHPIQDLLPTYL-----KADLGLSPHTVANI 279
Query: 279 LGVAELGGALLCVV---LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVP 335
+ + +G + V L + G+R A + + + I+ + G + +
Sbjct: 280 VVFSNIGAIVGGCVFGFLGDWLGRR-KAYVCSLLAGQLLIIPV------FAIGANVAV-- 330
Query: 336 TVFLVLAAFLTHICIR----LLPWMLIGEVFPNNIRATASGAS 374
L L F + ++ +LP +GE FP + RA G +
Sbjct: 331 ---LGLGLFFQQMLVQGIWGILP-KHLGEYFPTDQRAAGLGFT 369
Score = 29.8 bits (67), Expect = 4.5
Identities = 37/182 (20%), Positives = 61/182 (33%), Gaps = 32/182 (17%)
Query: 273 YFATLLLGVAELGGALLCVVL----IHYTGKRPLALISTGGSAACFIVVAVYAQFHL--- 325
F+ LG V L + ++ SAA F L
Sbjct: 13 AFSAAWLGWLLDAFDFFLVALVLAEVAGEFGLTTVDAASLISAALISRWFGALMFGLWGD 72
Query: 326 SYGWDSPLVPTVFL-VLAAFLTHIC--------IRLLPWMLIG-----------EVFPNN 365
YG P+V ++ L RL+ + +G E +P +
Sbjct: 73 RYGRRLPMVTSIVLFSAGTLACGFAPGYITMFIARLVIGIGMGGEYGSSAAYVIESWPKH 132
Query: 366 IRATASG--ASG-SSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPE 422
+R ASG SG + + A V L P+ W L+F + + ++ L++ +PE
Sbjct: 133 LRNKASGLLISGYAVGAVVAAQVYSLVVPVWGDG--WRALFFISILPIIFALWLRKNIPE 190
Query: 423 TE 424
E
Sbjct: 191 AE 192
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 39.6 bits (93), Expect = 0.001
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 24 KKKEKKKKEKKAKKKE---KKKEKKKKEKKKKKKKKKKKK 60
KK E KK K K E K+ K EKK+KK K+KK
Sbjct: 129 KKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168
Score = 36.5 bits (85), Expect = 0.012
Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK--KKKKKKK 61
+ K + K +E++ K+ KK+EK+ ++ KK KK + KK
Sbjct: 93 SRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKK 135
Score = 36.5 bits (85), Expect = 0.015
Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 12/56 (21%)
Query: 21 KKAKKKEK------------KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K+ KK+EK KK E K +KK ++ KK K+ K +KK+KKN
Sbjct: 109 KEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAG 164
Score = 35.7 bits (83), Expect = 0.023
Identities = 12/49 (24%), Positives = 27/49 (55%), Gaps = 1/49 (2%)
Query: 21 KKAKKKEK-KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
K +++KE+ K+ + K + K + K +E++ K+ KK++K+
Sbjct: 76 KDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124
Score = 35.3 bits (82), Expect = 0.032
Identities = 13/53 (24%), Positives = 26/53 (49%), Gaps = 2/53 (3%)
Query: 20 WKKAKKKEKKKKEKKA--KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
KK K E+K++ K+ K + K K K+++++ K+ KK++
Sbjct: 72 LKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124
Score = 34.9 bits (81), Expect = 0.048
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK----KEKKKKKKKKKKKKKN 62
K ++ + KE K ++KE KE KK K+ + KK KKK
Sbjct: 99 KNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDE 144
Score = 34.6 bits (80), Expect = 0.055
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 13/53 (24%)
Query: 22 KAKKKEKKKKEKKAKKKE-------------KKKEKKKKEKKKKKKKKKKKKK 61
K +++ K+ KK+E KK E KK KKK + KK K+
Sbjct: 98 LKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQ 150
Score = 33.4 bits (77), Expect = 0.16
Identities = 12/56 (21%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 21 KKAKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+KA KK K +EK + K+ + + + K K K ++++ K++
Sbjct: 69 EKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124
Score = 30.3 bits (69), Expect = 1.4
Identities = 14/44 (31%), Positives = 26/44 (59%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
D+++ + +E +K KK K E+K+E K+ + K + K K K+
Sbjct: 59 DYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKD 102
Score = 28.0 bits (63), Expect = 9.8
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
L K + K+ K+ +K +KK KK K+K
Sbjct: 138 LKKKFDELKKSKQLDKALEKKRKKNAGKEK 167
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 41.2 bits (96), Expect = 0.001
Identities = 19/60 (31%), Positives = 26/60 (43%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
+ +A K K K KK K E+++EKK K K+KK KK +N N
Sbjct: 517 ETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKK 576
Score = 40.1 bits (93), Expect = 0.003
Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 5/80 (6%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK-----KKKKK 56
D + Q E + + + + +K + K KK++ +E+++K+ K K+KK
Sbjct: 497 DEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKL 556
Query: 57 KKKKKNNNNNNNNNNNNNNN 76
KK K +N N
Sbjct: 557 YKKMKYSNAKKEEQAENLKK 576
Score = 37.0 bits (85), Expect = 0.023
Identities = 16/69 (23%), Positives = 27/69 (39%), Gaps = 5/69 (7%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K K + KK++ +++EKK + K+KK KK K +N N
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMK-----YSNAKKEEQAENLKKK 577
Query: 81 NNNNNNEEK 89
++K
Sbjct: 578 KKQIAKQKK 586
Score = 31.6 bits (71), Expect = 1.2
Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 5/66 (7%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN-----NNNNNNNNNNNNN 83
K E K+ K K KK K +++++KK K +N +N N
Sbjct: 514 KYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAEN 573
Query: 84 NNNEEK 89
++K
Sbjct: 574 LKKKKK 579
Score = 30.4 bits (68), Expect = 2.6
Identities = 19/40 (47%), Positives = 25/40 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK KK K AKK+E+ + KKK+K+ K+KK KK
Sbjct: 552 KQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591
Score = 30.4 bits (68), Expect = 2.7
Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 21/68 (30%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKK-----KEKK----------KKE------KKKKKKKKK 57
D K+K K++K E++ +KK K K+KK KKE KKKKK+ K
Sbjct: 524 DVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAK 583
Query: 58 KKKKNNNN 65
+KK ++
Sbjct: 584 QKKLDSKK 591
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 37.6 bits (88), Expect = 0.001
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
+ + K +K+ ++ +KE KK K KKKK
Sbjct: 1 RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33
Score = 33.0 bits (76), Expect = 0.053
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 3/34 (8%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K EKK + + K +KE KK K KKKK
Sbjct: 3 RYKIEKKVAEHHR---KLRKEAKKNPTWKSKKKK 33
Score = 32.6 bits (75), Expect = 0.061
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 19/62 (30%)
Query: 21 KKAKKKEKKKKEKKAKKK-------------------EKKKEKKKKEKKKKKKKKKKKKK 61
+K +K+ KK K+KKK E+KK K+++EK+++K+ +K ++
Sbjct: 15 RKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERA 74
Query: 62 NN 63
Sbjct: 75 EA 76
Score = 31.5 bits (72), Expect = 0.16
Identities = 15/53 (28%), Positives = 30/53 (56%), Gaps = 14/53 (26%)
Query: 20 WKKAKKKE---------KKK-----KEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
WK KKK+ K++ +EKK K++E+K+ +K+ K ++ + +K+
Sbjct: 27 WKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERAEARKR 79
Score = 31.5 bits (72), Expect = 0.17
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 20/60 (33%)
Query: 22 KAKKKEKKKKEKK-AKKKEKKKEKKKK-------------------EKKKKKKKKKKKKK 61
+ K E +K +K AKK K KKKK EKK+K++++K+++K
Sbjct: 7 EKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRK 66
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 37.9 bits (89), Expect = 0.001
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 5/46 (10%)
Query: 20 WKKAKKK-----EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+KKA +KKKEKK K E K+ KK +++K + +K
Sbjct: 48 YKKAHAAIRADPSRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 29.8 bits (68), Expect = 0.97
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK A + + ++KK+K++ K + K+ KK
Sbjct: 49 KKAHAAIRADPSRKKKEKKEVKAESKRYNAKK 80
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 41.3 bits (97), Expect = 0.001
Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 5/48 (10%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
T TP +KK K KK+ + K + K KKK K K KK KK+
Sbjct: 742 TPKTPY-----EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 39.8 bits (93), Expect = 0.004
Identities = 15/37 (40%), Positives = 16/37 (43%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+KK K KK K K K KKK K K KK
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 39.0 bits (91), Expect = 0.006
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K+ K K+K A K K + KK+ K K KK KK+
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 38.3 bits (89), Expect = 0.013
Identities = 17/37 (45%), Positives = 18/37 (48%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
EKK K KK K K K +KK K K KK KK
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 36.3 bits (84), Expect = 0.040
Identities = 16/35 (45%), Positives = 17/35 (48%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K EKK K K+K K K K KKK K K K
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSK 777
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 38.0 bits (89), Expect = 0.001
Identities = 13/35 (37%), Positives = 25/35 (71%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KE+K++EK +++E K+ K K ++ ++K +K KK
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 34.5 bits (80), Expect = 0.024
Identities = 8/25 (32%), Positives = 19/25 (76%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ K++EK ++E++ K+ K K+++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREE 25
Score = 33.8 bits (78), Expect = 0.037
Identities = 11/33 (33%), Positives = 23/33 (69%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
+K ++K ++++E K K K++E ++K +K KK
Sbjct: 3 RKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 32.6 bits (75), Expect = 0.091
Identities = 8/32 (25%), Positives = 23/32 (71%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ K+ +K ++++E K+ + K+++ ++K +K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32
Score = 30.7 bits (70), Expect = 0.54
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
K+ K++EK ++E++ K+ + K ++ +EK +K KK
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 29.5 bits (67), Expect = 1.1
Identities = 6/24 (25%), Positives = 18/24 (75%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKN 62
+++KE++K +++++ K+ K K+
Sbjct: 1 KERKEEEKAQREEELKRLKNLKRE 24
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 41.1 bits (96), Expect = 0.001
Identities = 12/69 (17%), Positives = 29/69 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
A+ ++ K + K K++ KK E + K+K + +++N + N +
Sbjct: 185 NLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQ 244
Query: 81 NNNNNNEEK 89
+ +E+
Sbjct: 245 ELLRDEQEE 253
Score = 36.1 bits (83), Expect = 0.058
Identities = 13/69 (18%), Positives = 27/69 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ K +E K KE+ K E + K+K E +++ K N + +
Sbjct: 194 EELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEE 253
Query: 81 NNNNNNEEK 89
++ E +
Sbjct: 254 IESSKQELE 262
Score = 36.1 bits (83), Expect = 0.059
Identities = 8/64 (12%), Positives = 22/64 (34%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
L + K ++ K K+ KK + + K++ + +++ N +
Sbjct: 189 LIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLR 248
Query: 78 NNNN 81
+
Sbjct: 249 DEQE 252
Score = 35.7 bits (82), Expect = 0.071
Identities = 9/71 (12%), Positives = 28/71 (39%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+ + K ++ K +++AKK + + K+K + +++ N + +
Sbjct: 190 IIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRD 249
Query: 78 NNNNNNNNNEE 88
++ +E
Sbjct: 250 EQEEIESSKQE 260
Score = 35.3 bits (81), Expect = 0.11
Identities = 15/69 (21%), Positives = 31/69 (44%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ +K E KEK+ ++E+K EK K E++K++K K ++++
Sbjct: 763 EEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQ 822
Query: 81 NNNNNNEEK 89
E+
Sbjct: 823 LLIEQEEKI 831
Score = 34.9 bits (80), Expect = 0.13
Identities = 11/51 (21%), Positives = 29/51 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K K+ +K+K+E + +KE K+ + K+E +++++++ +K +
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376
Score = 33.8 bits (77), Expect = 0.27
Identities = 11/66 (16%), Positives = 25/66 (37%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
+E K +E K K++ KK + + K+K + +++ + N + +
Sbjct: 192 DLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQ 251
Query: 84 NNNEEK 89
E
Sbjct: 252 EEIESS 257
Score = 33.8 bits (77), Expect = 0.28
Identities = 10/52 (19%), Positives = 26/52 (50%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+++ K + KK + +E+KKE ++ ++ ++ K+ + + N N
Sbjct: 988 EERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFF 1039
Score = 33.4 bits (76), Expect = 0.38
Identities = 13/55 (23%), Positives = 30/55 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
K+ KK EK+ K++K + +E +KE K+ E K++ +++++++
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376
Score = 33.4 bits (76), Expect = 0.39
Identities = 15/82 (18%), Positives = 30/82 (36%), Gaps = 13/82 (15%)
Query: 21 KKAKKKEKKKKEKKAK--------KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN----- 67
K+ KK ++ E A+ K ++ K K++ +K + + K+K + N
Sbjct: 173 KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDY 232
Query: 68 NNNNNNNNNNNNNNNNNNNEEK 89
N + + EE
Sbjct: 233 LKLNEERIDLLQELLRDEQEEI 254
Score = 33.0 bits (75), Expect = 0.44
Identities = 13/59 (22%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKK---KKKKKKKNNNNNNNNNNNNNNNNNNN 80
E ++KE++ K E KKE+ ++EKK+ + ++ ++ + N N
Sbjct: 981 IAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFF 1039
Score = 32.6 bits (74), Expect = 0.60
Identities = 14/41 (34%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +KE KK EK+ KK++++ E+ +KE K+ + K++ +++
Sbjct: 317 LKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEE 357
Score = 32.6 bits (74), Expect = 0.70
Identities = 12/42 (28%), Positives = 30/42 (71%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +K + ++K K+++K+ KK EK+ K++K++ ++ +K+ K
Sbjct: 306 LERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKE 347
Score = 32.2 bits (73), Expect = 0.77
Identities = 13/42 (30%), Positives = 28/42 (66%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K +++ K+ EK+ KK EK+ +K+K+E ++ +K+ K+ +
Sbjct: 310 KVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIK 351
Score = 32.2 bits (73), Expect = 0.96
Identities = 10/59 (16%), Positives = 27/59 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
++++++ K KKE+ +E+KK+ ++ ++ ++ K + N N
Sbjct: 981 IAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFF 1039
Score = 32.2 bits (73), Expect = 0.97
Identities = 8/68 (11%), Positives = 24/68 (35%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ K++ KK + + KEK + +++ K +++ + + ++
Sbjct: 200 ELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQ 259
Query: 81 NNNNNNEE 88
E
Sbjct: 260 ELEKEEEI 267
Score = 30.7 bits (69), Expect = 2.6
Identities = 9/70 (12%), Positives = 28/70 (40%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ +++++ K++ K+++ +E+KK+ ++ ++ + N N
Sbjct: 976 VNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLN 1035
Query: 81 NNNNNNEEKG 90
E G
Sbjct: 1036 KVFFYLELGG 1045
Score = 28.8 bits (64), Expect = 8.8
Identities = 15/38 (39%), Positives = 25/38 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+ K E++K + + K KE +KE KK EK+ KK+K++
Sbjct: 301 SELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEI 338
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 40.0 bits (94), Expect = 0.002
Identities = 23/49 (46%), Positives = 32/49 (65%), Gaps = 5/49 (10%)
Query: 18 LDW--KKAKKKEKKK---KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L W KK ++ +KK+ KE +A + +KK ++EKKKKKKK KKKK
Sbjct: 146 LAWLLKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKK 194
Score = 35.7 bits (83), Expect = 0.035
Identities = 19/45 (42%), Positives = 27/45 (60%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
LD K+A+ KKK A++++KKK+KK +KKK KK K
Sbjct: 161 LDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 40.9 bits (96), Expect = 0.002
Identities = 13/40 (32%), Positives = 30/40 (75%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+++ E K KE+K K++E +K++K++ + K+K++K ++K
Sbjct: 394 ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 38.2 bits (89), Expect = 0.009
Identities = 15/36 (41%), Positives = 29/36 (80%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+AK KE+K K+++ +KK+K++ + KEK++K ++KK
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 38.2 bits (89), Expect = 0.010
Identities = 11/41 (26%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K ++ + K ++K K++E +KK+K++ + K+K++K
Sbjct: 389 KDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429
Score = 36.7 bits (85), Expect = 0.026
Identities = 13/39 (33%), Positives = 31/39 (79%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++A+ K K++K K+ + ++K+KE+ ++K+K++K ++KK
Sbjct: 396 EEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 36.3 bits (84), Expect = 0.036
Identities = 13/41 (31%), Positives = 27/41 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +E + K K+ K K+++ EKK+KE+ + K+K++K +
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 34.0 bits (78), Expect = 0.20
Identities = 11/44 (25%), Positives = 27/44 (61%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + K++ ++A+ K K+++ K++E +KK+K++ + K
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Score = 33.6 bits (77), Expect = 0.28
Identities = 13/41 (31%), Positives = 27/41 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K + +E +AK KE+K ++++ EKK+K++ + K+K
Sbjct: 386 GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEK 426
Score = 30.9 bits (70), Expect = 2.0
Identities = 12/39 (30%), Positives = 26/39 (66%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK + K E A ++ + K K++K K+++ +KK+K++ +
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQAD 421
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 37.3 bits (87), Expect = 0.002
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K E KK+KKK+KKK E K KK +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAED 50
Score = 33.5 bits (77), Expect = 0.035
Identities = 13/40 (32%), Positives = 17/40 (42%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K E +KK KKK+KK E K KK + +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 32.3 bits (74), Expect = 0.091
Identities = 12/34 (35%), Positives = 16/34 (47%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K KKK+KKKK+K + K KK +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAED 50
Score = 31.6 bits (72), Expect = 0.16
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
KK+KKKK+KK + K K +
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAED 50
Score = 31.6 bits (72), Expect = 0.16
Identities = 12/37 (32%), Positives = 16/37 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K KKKK+KK KK E K +K + +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53
Score = 28.9 bits (65), Expect = 1.5
Identities = 12/36 (33%), Positives = 18/36 (50%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
T D K KKK+KKKK + K K+ ++ +
Sbjct: 18 TEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 37.6 bits (87), Expect = 0.002
Identities = 14/54 (25%), Positives = 24/54 (44%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K + E K+ K+KK+ ++K+ N N N N N ++ EE+
Sbjct: 3 TKVDAAAELSAKDLKEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEE 56
Score = 36.9 bits (85), Expect = 0.004
Identities = 16/62 (25%), Positives = 23/62 (37%), Gaps = 5/62 (8%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K A + K K+KKE ++K+ K N N N N N ++ E
Sbjct: 1 SDTKVDAAAELSAKDLKEKKEVVEEKENGK-----NAPANGNENEENGAQEGDDEMEEEE 55
Query: 88 EK 89
E
Sbjct: 56 EV 57
>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
Provisional.
Length = 1466
Score = 40.8 bits (95), Expect = 0.002
Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE-----EKGT 91
+E+ + K NNN NN ++NNNNNNNNNN NN E+GT
Sbjct: 655 RERGSTVDVDIIGEDPTKDNKENNNKNNKDDNNNNNNNNNNKINNAGSYIIEQGT 709
Score = 40.4 bits (94), Expect = 0.003
Identities = 18/33 (54%), Positives = 24/33 (72%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
K+N NNN NN ++NNNNNNNNNN G+++
Sbjct: 672 KDNKENNNKNNKDDNNNNNNNNNNKINNAGSYI 704
Score = 36.2 bits (83), Expect = 0.053
Identities = 18/45 (40%), Positives = 24/45 (53%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+E+ + K K+ N NN ++NNNNNNNNNN NN
Sbjct: 655 RERGSTVDVDIIGEDPTKDNKENNNKNNKDDNNNNNNNNNNKINN 699
Score = 30.8 bits (69), Expect = 2.5
Identities = 16/45 (35%), Positives = 22/45 (48%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+E+ + + + K+ K K NNNNNNNNNN NN
Sbjct: 655 RERGSTVDVDIIGEDPTKDNKENNNKNNKDDNNNNNNNNNNKINN 699
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 38.8 bits (91), Expect = 0.002
Identities = 11/42 (26%), Positives = 29/42 (69%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK K++E++++ + + E+++ + EK+ K K++K+++N
Sbjct: 98 KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139
Score = 38.0 bits (89), Expect = 0.003
Identities = 10/39 (25%), Positives = 27/39 (69%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++ + +E ++E+ + EK+ K K+EK+++ ++K+K+
Sbjct: 107 EEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 36.5 bits (85), Expect = 0.010
Identities = 11/48 (22%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKE----KKKEKKKKEKKKKKKKKKKKKK 61
LD K+ +++E+++ E + +E + EK+ + K++K+++ ++K+
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143
Score = 33.8 bits (78), Expect = 0.081
Identities = 10/41 (24%), Positives = 24/41 (58%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +E++ E K+ K K +K++E ++K+K+ K++
Sbjct: 112 EELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 33.0 bits (76), Expect = 0.16
Identities = 10/44 (22%), Positives = 27/44 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K+KE++++E+ ++ ++E+ + +K+ K K++K+ N
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139
Score = 31.5 bits (72), Expect = 0.53
Identities = 11/49 (22%), Positives = 28/49 (57%), Gaps = 5/49 (10%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E LD ++E+ + + + + K+EK+++ ++K+K+ K++ K
Sbjct: 109 VEVEELD-----EEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 31.5 bits (72), Expect = 0.58
Identities = 11/44 (25%), Positives = 28/44 (63%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
KK++++++E++ + +E +E++ E +K+ K K++K N
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 39.7 bits (93), Expect = 0.002
Identities = 15/41 (36%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK + K K KE KKK+K K K + KKK ++
Sbjct: 86 KKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSER 126
Score = 38.5 bits (90), Expect = 0.005
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K ++E++ K+KK K + KE KK+KKK K K
Sbjct: 75 EKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAA 116
Score = 38.1 bits (89), Expect = 0.006
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K+ + K ++ KKK+KK K K + KKK +
Sbjct: 85 KKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125
Score = 37.0 bits (86), Expect = 0.013
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+KE +++E+ KKK K + K+ KKKKKK K
Sbjct: 73 EGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAK 111
Score = 37.0 bits (86), Expect = 0.014
Identities = 14/43 (32%), Positives = 20/43 (46%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ K K K+ KK KKK+ K K + KKK ++
Sbjct: 87 KKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISW 129
Score = 36.6 bits (85), Expect = 0.021
Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE----KKKKKKKKKKKKK 61
++ ++ +KKK+ K KE K+KKKK+ K K + KKK
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKK 123
Score = 35.8 bits (83), Expect = 0.040
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
E+ +KE + +++ KKK++ K + K+ KKKKKK
Sbjct: 72 EEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPT 108
Score = 33.5 bits (77), Expect = 0.18
Identities = 11/39 (28%), Positives = 17/39 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K+ KKKK+K + K + KKK ++
Sbjct: 93 KAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131
Score = 33.1 bits (76), Expect = 0.30
Identities = 16/57 (28%), Positives = 26/57 (45%)
Query: 6 KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K PT LD + K + K E+ KE++ + KK + K +K+K+K
Sbjct: 122 KKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKK 178
Score = 31.6 bits (72), Expect = 0.91
Identities = 12/34 (35%), Positives = 23/34 (67%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K+ + KE++ +KK + K +K+K+KKK+K
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181
Score = 31.2 bits (71), Expect = 1.1
Identities = 10/42 (23%), Positives = 16/42 (38%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ KKK KKK AK + + KK+ ++
Sbjct: 97 EPTKKK-KKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137
Score = 30.8 bits (70), Expect = 1.6
Identities = 11/42 (26%), Positives = 18/42 (42%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K K+KKKK+ A K K + K+K ++ +
Sbjct: 96 KEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137
Score = 30.4 bits (69), Expect = 1.9
Identities = 10/35 (28%), Positives = 22/35 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
K+ ++ K+ + + KK + K +++K+KKK+K
Sbjct: 147 VQNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181
Score = 30.4 bits (69), Expect = 2.1
Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 27/68 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK---------------------------KKEKKKKK 53
K+ + + KK + K K+KEKKKEK+ ++EKKK K
Sbjct: 158 KEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEEKKKAK 217
Query: 54 KKKKKKKK 61
+ KK++
Sbjct: 218 IQALKKRR 225
Score = 28.5 bits (64), Expect = 9.2
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
K E+ K+ + +KK + K +K+KEKKK+K
Sbjct: 149 NKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 38.3 bits (90), Expect = 0.002
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 4 STKHQIGTFTEPTPLD--WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
STK I P D +K+ KKK+KKKKE + +++KKK +E+K+ +K
Sbjct: 63 STKVDISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 35.2 bits (82), Expect = 0.029
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
K KKK+KKKK++ +EKKK+ E++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 35.2 bits (82), Expect = 0.031
Identities = 14/35 (40%), Positives = 25/35 (71%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+KK+KK KKKE + ++KK+K +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 33.7 bits (78), Expect = 0.100
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K++KKKK+KK + + +++KKK +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 32.9 bits (76), Expect = 0.17
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K KKKK+KK K+ E +EKKKK +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 31.4 bits (72), Expect = 0.62
Identities = 13/35 (37%), Positives = 25/35 (71%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
K+K+KKKK+KK + ++K+KK +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 40.4 bits (94), Expect = 0.002
Identities = 11/58 (18%), Positives = 21/58 (36%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
K++K+ + K A K E ++ ++KKK N+ N +
Sbjct: 5 KSRKRSGNNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKLEKE 62
Score = 33.4 bits (76), Expect = 0.31
Identities = 9/55 (16%), Positives = 16/55 (29%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
K++ + K K E + ++KKK K N +
Sbjct: 8 KRSGNNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKLEKE 62
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 40.2 bits (93), Expect = 0.002
Identities = 23/85 (27%), Positives = 33/85 (38%), Gaps = 7/85 (8%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN---- 69
E T + +KKE +++ A K + K K N N NN
Sbjct: 273 ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTST 332
Query: 70 ---NNNNNNNNNNNNNNNNNEEKGT 91
N N N N+N N N+N N +G+
Sbjct: 333 PSKNTNTNTNSNTNTNSNTNANQGS 357
Score = 34.4 bits (78), Expect = 0.14
Identities = 20/89 (22%), Positives = 31/89 (34%), Gaps = 4/89 (4%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++ K + T + P KA + K + K KN
Sbjct: 281 TTEKKETTTQQQTAP----KAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKN 336
Query: 63 NNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
N N N+N N N+N N N ++NN +
Sbjct: 337 TNTNTNSNTNTNSNTNANQGSSNNNSNSS 365
Score = 30.5 bits (68), Expect = 2.3
Identities = 17/66 (25%), Positives = 26/66 (39%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
KE +K++ +++ K + K + N N N N N N N NN N +
Sbjct: 272 KENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTS 331
Query: 86 NEEKGT 91
K T
Sbjct: 332 TPSKNT 337
Score = 30.2 bits (67), Expect = 3.2
Identities = 19/69 (27%), Positives = 27/69 (39%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
P K+A K+ EKK+ +++ K + K + N N N N N
Sbjct: 261 PAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTN 320
Query: 77 NNNNNNNNN 85
N N NN N
Sbjct: 321 TNTNTNNTN 329
>gnl|CDD|225620 COG3078, COG3078, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 169
Score = 38.3 bits (89), Expect = 0.003
Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 9/93 (9%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+ K+ ++ KA KK ++K +++ + + + +K++KK+K + + ++ N N+ N N
Sbjct: 2 KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61
Query: 83 NNNNEEKGTFLYKMAAPLVLVLTYVAE-ITQPH 114
+ G+ K PL V E +T+ H
Sbjct: 62 QKKDPRIGS---KKPIPLG-----VTEKVTKQH 86
Score = 32.9 bits (75), Expect = 0.23
Identities = 15/60 (25%), Positives = 32/60 (53%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
K KK ++ + K KK +K ++ + + +K++KK+K + + ++ N N+ N N
Sbjct: 2 KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61
Score = 32.5 bits (74), Expect = 0.33
Identities = 14/58 (24%), Positives = 32/58 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K ++ + K +KKA++K +++ + +K++KK+K + ++ N N+ N N
Sbjct: 4 SKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61
Score = 31.7 bits (72), Expect = 0.59
Identities = 15/60 (25%), Positives = 33/60 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K++KK + + + K + K +E+ E + +K++KK+K + + ++ N N+ N N
Sbjct: 2 KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQN 61
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 39.5 bits (92), Expect = 0.003
Identities = 21/43 (48%), Positives = 21/43 (48%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKAKK KK KK KA KK KK K K K K K K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGK 292
Score = 38.3 bits (89), Expect = 0.007
Identities = 20/47 (42%), Positives = 21/47 (44%), Gaps = 4/47 (8%)
Query: 21 KKAKK--KEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKKNN 63
K A K K+ KK KKA KK K KK KK K K K K
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKAT 289
Score = 37.9 bits (88), Expect = 0.009
Identities = 19/47 (40%), Positives = 20/47 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
KA KK KK K A K K K K K K KKK KK +
Sbjct: 264 AKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310
Score = 37.9 bits (88), Expect = 0.010
Identities = 20/50 (40%), Positives = 22/50 (44%), Gaps = 2/50 (4%)
Query: 21 KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
KKA K KK KK KA K K K + K K KKK KK +
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310
Score = 37.6 bits (87), Expect = 0.012
Identities = 18/43 (41%), Positives = 19/43 (44%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKK 61
K+ K KA KK KK KK KK K KK KK K
Sbjct: 234 TAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAK 276
Score = 37.2 bits (86), Expect = 0.016
Identities = 17/43 (39%), Positives = 20/43 (46%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKKKKK 61
KKA K K +K +KKK + KK K K KK KK
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKT 255
Score = 37.2 bits (86), Expect = 0.018
Identities = 18/50 (36%), Positives = 21/50 (42%), Gaps = 2/50 (4%)
Query: 13 TEPTPLDWKKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ K +KKK K KKA K K KK K+ KK KK K
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKA 266
Score = 36.8 bits (85), Expect = 0.025
Identities = 19/41 (46%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA KK K +K AKK K K K K K K K KK
Sbjct: 257 KKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKK 297
Score = 36.8 bits (85), Expect = 0.025
Identities = 19/43 (44%), Positives = 20/43 (46%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K AKK KK K KKA KK K K + K K K K K
Sbjct: 254 KTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296
Score = 36.4 bits (84), Expect = 0.029
Identities = 15/41 (36%), Positives = 16/41 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA K K K + K KK KKK K KK
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKK 240
Score = 36.4 bits (84), Expect = 0.035
Identities = 16/40 (40%), Positives = 16/40 (40%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
A K KK KAK KK KKK K KK K
Sbjct: 205 TPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKT 244
Score = 36.4 bits (84), Expect = 0.035
Identities = 16/41 (39%), Positives = 17/41 (41%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK KK KK K KK KK K + K K K K
Sbjct: 253 KKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKA 293
Score = 36.0 bits (83), Expect = 0.040
Identities = 17/40 (42%), Positives = 18/40 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K AK KK K K KK KK +K KK K KK
Sbjct: 230 KAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKK 269
Score = 35.6 bits (82), Expect = 0.049
Identities = 14/47 (29%), Positives = 16/47 (34%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK K K K K KK +K KK KK K
Sbjct: 220 KSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKA 266
Score = 35.6 bits (82), Expect = 0.052
Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 2/41 (4%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKE--KKKKKKKKKKKKK 61
KK K A K KK +K K+ KK K KK KK
Sbjct: 233 KTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKK 273
Score = 35.6 bits (82), Expect = 0.052
Identities = 17/43 (39%), Positives = 19/43 (44%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KAK KK +KKA K +K K K KK KK K
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKA 259
Score = 35.2 bits (81), Expect = 0.067
Identities = 16/46 (34%), Positives = 17/46 (36%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
TP AKK K K K +KK K KK K K K
Sbjct: 205 TPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAK 250
Score = 35.2 bits (81), Expect = 0.075
Identities = 18/43 (41%), Positives = 19/43 (44%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK--KEKKKKEKKKKKKKKKKKKK 61
K K K KKAKK KK K+ K KK KK K K
Sbjct: 238 AKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAK 280
Score = 34.9 bits (80), Expect = 0.087
Identities = 16/50 (32%), Positives = 18/50 (36%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K K KK KKA K K K + K K K KKK +
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310
Score = 34.5 bits (79), Expect = 0.12
Identities = 15/41 (36%), Positives = 18/41 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA K K + AK + K + KKK KK K K
Sbjct: 272 KKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKAT 312
Score = 32.9 bits (75), Expect = 0.38
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KAKKK KK +K K K K+ K KK KK KK+
Sbjct: 293 AKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKR 332
Score = 32.6 bits (74), Expect = 0.49
Identities = 14/41 (34%), Positives = 14/41 (34%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K KK K KK KKK K KK K
Sbjct: 204 KTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKT 244
Score = 32.6 bits (74), Expect = 0.55
Identities = 18/41 (43%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KAK K+K K+ A K K K K K KK KK KK
Sbjct: 291 GKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
Score = 32.6 bits (74), Expect = 0.57
Identities = 14/38 (36%), Positives = 15/38 (39%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
AKK K K KK K K +K KKK K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAV 236
Score = 30.6 bits (69), Expect = 2.0
Identities = 14/40 (35%), Positives = 16/40 (40%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KA K K K K K K K+K K+ K K K
Sbjct: 275 AKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAK 314
Score = 29.5 bits (66), Expect = 4.5
Identities = 17/42 (40%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A K K K K K K KKK KK K K K K
Sbjct: 276 KAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPK 317
Score = 29.1 bits (65), Expect = 5.6
Identities = 14/41 (34%), Positives = 18/41 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA K K + KAK K+K +K K K K K+
Sbjct: 279 AKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRG 319
Score = 29.1 bits (65), Expect = 5.6
Identities = 12/40 (30%), Positives = 14/40 (35%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
A K+ K +K K K KK KKK K
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTA 235
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 39.7 bits (93), Expect = 0.003
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K K K E+ K++ + +KKKEKK K +++ +
Sbjct: 356 KPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 37.8 bits (88), Expect = 0.014
Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 10/61 (16%)
Query: 13 TEPTPLDWK----------KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
EP + + K K + K K + K+++ + +++KK+KK K +++
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Query: 63 N 63
N
Sbjct: 395 N 395
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 37.6 bits (88), Expect = 0.003
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
W + +K+K ++ K ++ +K KEKKKKKKK+ +
Sbjct: 52 WTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90
Score = 37.2 bits (87), Expect = 0.004
Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 13/53 (24%)
Query: 22 KAKKKEKKKKEKKAKK-------KEKKKE------KKKKEKKKKKKKKKKKKK 61
+ K KEKKKK+KK + +EKKKE KK +E KK+ ++ K +K
Sbjct: 75 RRKLKEKKKKKKKELENFYRFQIREKKKEELAELRKKFEEDKKRIEQLKAARK 127
Score = 31.0 bits (71), Expect = 0.58
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +K K + + ++ +K K+KKKKKKK+
Sbjct: 57 RGGRKRKAGASRNKAAEERRKLKEKKKKKKKE 88
Score = 27.6 bits (62), Expect = 9.0
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +K++ + K ++++K K+KKKKK
Sbjct: 56 TRGGRKRKAGASRNKAAEERRKLKEKKKKK 85
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 39.7 bits (93), Expect = 0.004
Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 15/85 (17%)
Query: 21 KKAKKKEKKKKEKKAKK---------------KEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ K++E KKK K + K KK KKKKKKKKK++K + +
Sbjct: 236 AEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDED 295
Query: 66 NNNNNNNNNNNNNNNNNNNNNEEKG 90
++++ + + EE
Sbjct: 296 ELEPEAEGLGSSDSGSRKDVEEENA 320
Score = 37.0 bits (86), Expect = 0.026
Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 17/68 (25%)
Query: 21 KKAKKK-EKKKKEKKAK----------------KKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KA++K E+ K K KK KK++KKK+ ++KK +K+K
Sbjct: 63 AKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKER 122
Query: 64 NNNNNNNN 71
+ +
Sbjct: 123 AAEYTSED 130
Score = 32.0 bits (73), Expect = 0.85
Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 9/51 (17%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEK---------KKKKKKKKKKKKNNN 64
+K++++ +KK K + E + K KK KKKK
Sbjct: 235 EAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKK 285
Score = 31.7 bits (72), Expect = 1.2
Identities = 19/107 (17%), Positives = 36/107 (33%), Gaps = 38/107 (35%)
Query: 2 DSSTKHQIGTFTEPTPLDWKK------AKKKEKKKKEKKAKKKEKKKEKKK--------- 46
S+ + + E WKK AK+K ++ +EK AK +EK++ K
Sbjct: 28 GSTKESRDAAAYEN----WKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGE 83
Query: 47 -------------------KEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
K+K+ ++KK + + +
Sbjct: 84 DDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSED 130
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 37.7 bits (88), Expect = 0.004
Identities = 12/38 (31%), Positives = 24/38 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+ ++EK + K+A K +++E +K +K+ KK KK +
Sbjct: 88 VELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 36.2 bits (84), Expect = 0.012
Identities = 13/43 (30%), Positives = 22/43 (51%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
E ++EK +KE K +++E +K +K+ KK KK
Sbjct: 86 HTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128
Score = 30.0 bits (68), Expect = 1.3
Identities = 8/51 (15%), Positives = 23/51 (45%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
P++ + + E++ + K+ K ++++ +K +K+ KK
Sbjct: 71 DIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKS 121
Score = 28.5 bits (64), Expect = 4.5
Identities = 11/45 (24%), Positives = 20/45 (44%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEK 49
T + TE + +K K +++E + +K KK KK +
Sbjct: 83 TVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 39.9 bits (93), Expect = 0.004
Identities = 13/53 (24%), Positives = 24/53 (45%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K A + ++K KEK KE K E E + ++ K++++ + N
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILN 842
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 39.7 bits (92), Expect = 0.004
Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +K + EKK+KK KKKEKK+KEK++ K KKK+ +
Sbjct: 185 KSPEKGDVPAVEKKSKKP-KKKEKKEKEKERDKDKKKEVEG 224
Score = 39.3 bits (91), Expect = 0.006
Identities = 19/39 (48%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
P KK+KK KKKEKK K+KE+ K+KKK+ + K
Sbjct: 190 GDVPAVEKKSKKP--KKKEKKEKEKERDKDKKKEVEGFK 226
Score = 38.9 bits (90), Expect = 0.006
Identities = 16/31 (51%), Positives = 25/31 (80%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+KK KK KKKEKK+++K+++K KKK+ + K
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 38.1 bits (88), Expect = 0.011
Identities = 18/56 (32%), Positives = 32/56 (57%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
++S + + + D K + E+ KK K KKK+++KEK++K+KKKK +
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 38.1 bits (88), Expect = 0.012
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
P D +KK KK K+K+ K+KEK+++K KK++ + K
Sbjct: 187 PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 37.4 bits (86), Expect = 0.018
Identities = 17/39 (43%), Positives = 25/39 (64%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K EK KK +K K+K+KKEK+K++ K KKK+
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222
Score = 37.0 bits (85), Expect = 0.028
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
T + P K + ++ KK+ K +KKK++K+KE+KKKKKK +
Sbjct: 253 TVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 36.2 bits (83), Expect = 0.047
Identities = 9/40 (22%), Positives = 28/40 (70%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
P+ + K +E+++ ++ +K +++K+K++KEK+ +++
Sbjct: 76 PMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 35.8 bits (82), Expect = 0.058
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K +K +KK KK +KK+K++K+K++ K KKK+
Sbjct: 182 ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221
Score = 34.7 bits (79), Expect = 0.15
Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKK-KEKKKKEKKKKKKKKKKKKK 61
+E P+ + + K ++ EKK K+ KKKEKK+K+K++ K KK
Sbjct: 170 SEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKK 219
Score = 34.3 bits (78), Expect = 0.16
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
S GT + P + K A+ +E KK K KK+K++KEK++KKKKKK
Sbjct: 249 SLANTVSGTAPDSEPDEPKDAEAEETKKSPKH------KKKKQRKEKEEKKKKKKHHHH 301
Score = 33.5 bits (76), Expect = 0.33
Identities = 10/26 (38%), Positives = 22/26 (84%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +E+++ +++ EK K++KKK++K+K
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEK 108
Score = 33.1 bits (75), Expect = 0.45
Identities = 10/34 (29%), Positives = 27/34 (79%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K ++ ++ ++ EK K+EKKK++K+K+ ++++++
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116
Score = 32.3 bits (73), Expect = 0.78
Identities = 7/34 (20%), Positives = 27/34 (79%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K E++ + +++ ++ K+++KK++K+K+ +++ ++
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116
Score = 32.0 bits (72), Expect = 0.96
Identities = 7/33 (21%), Positives = 26/33 (78%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K ++E++ +++ +K +++KK+++K+K+ +++
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 31.6 bits (71), Expect = 1.3
Identities = 14/42 (33%), Positives = 26/42 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K + E K +K +K+ KK +KK+KK+K+K++ K+
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKD 217
Score = 31.2 bits (70), Expect = 1.8
Identities = 8/34 (23%), Positives = 26/34 (76%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K +E++ ++ +K+K++K+K++K+K+ +++ +
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116
Score = 30.8 bits (69), Expect = 2.1
Identities = 10/34 (29%), Positives = 26/34 (76%), Gaps = 1/34 (2%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
K E++++ ++ +K+K++ KKK+EK+K+ +++
Sbjct: 83 KLEEERRHRQRLEKDKRE-KKKREKEKRGRRRHH 115
Score = 30.4 bits (68), Expect = 2.8
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
+ ++ EK K+EKK ++KEK+ ++
Sbjct: 89 RHRQRLEKDKREKKKREKEKRGRRRHHS 116
Score = 29.3 bits (65), Expect = 6.7
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK--KKKKKKN 62
+ + K A+ +E KK K K+KK++K+K KKKKKK+
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKH 298
>gnl|CDD|129965 TIGR00887, 2A0109, phosphate:H+ symporter. This model represents
the phosphate uptake symporter subfamily of the major
facilitator superfamily (pfam00083) [Transport and
binding proteins, Anions].
Length = 502
Score = 39.3 bits (92), Expect = 0.004
Identities = 41/159 (25%), Positives = 63/159 (39%), Gaps = 48/159 (30%)
Query: 286 GALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFL 345
L V+ Y +ST G A + V AQF ++G ++ T F+V
Sbjct: 376 TVLFFVLGFAYNH------LSTHGFLAIY----VLAQFFANFGPNA----TTFIV----- 416
Query: 346 THICIRLLPWMLIGEVFPNNIRATASGASGSSSYI------FAF-------AVNKLYYPM 392
P GEVFP R+TA G S +S F F K Y
Sbjct: 417 --------P----GEVFPTRYRSTAHGISAASGKAGAIIGQFGFLYLAQHGDPTKGYPTG 464
Query: 393 LDTFHLWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDI 431
+ H+ L +A +G L+ ++PET+G++L ++
Sbjct: 465 IWMGHV---LEIFALFMFLGILFT-LLIPETKGKSLEEL 499
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 39.6 bits (92), Expect = 0.004
Identities = 19/63 (30%), Positives = 29/63 (46%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
DSS E AK + ++ ++ + ++EK +E+ KK KK KK K KK
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348
Query: 62 NNN 64
N
Sbjct: 349 NGL 351
Score = 37.6 bits (87), Expect = 0.017
Identities = 16/49 (32%), Positives = 24/49 (48%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
L + K E ++ E + +E+K E++ KK KK KK K K N
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Score = 36.5 bits (84), Expect = 0.040
Identities = 14/77 (18%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKK-KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
P + ++ + E+ ++EK ++ KK KK K+ K KK K ++ +++++++
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDI 367
Query: 73 NNNNNNNNNNNNNNEEK 89
+ ++ + +E
Sbjct: 368 DGEDSVSLVTAKKQKEP 384
Score = 35.3 bits (81), Expect = 0.075
Identities = 15/75 (20%), Positives = 40/75 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K + + K + ++ + E+ +E+K +E+ KK KK KK N + +++++ +++
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDS 362
Query: 81 NNNNNNEEKGTFLYK 95
++++ + E L
Sbjct: 363 DDSDIDGEDSVSLVT 377
Score = 32.6 bits (74), Expect = 0.58
Identities = 14/50 (28%), Positives = 22/50 (44%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
D + K + E+ +E ++EK ++E KK KK KK N
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Score = 31.5 bits (71), Expect = 1.4
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
D + K K+KKKK K KKK +K K+
Sbjct: 235 DGDEEKSKKKKKKLAKNKKKLDDDKKGKRG 264
Score = 31.1 bits (70), Expect = 1.6
Identities = 17/77 (22%), Positives = 29/77 (37%), Gaps = 24/77 (31%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK------------------------KKKK 56
+ +K E++ K KK KK + KK K KK+K
Sbjct: 323 SEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQK 382
Query: 57 KKKKKNNNNNNNNNNNN 73
+ KK+ ++N ++ N
Sbjct: 383 EPKKEEPVDSNPSSPGN 399
Score = 30.3 bits (68), Expect = 3.1
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
D + +K ++ +K +KKK+K K KKK KK K+
Sbjct: 222 DEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263
Score = 28.8 bits (64), Expect = 9.1
Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 12/75 (16%)
Query: 1 MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKE-----------KKKEKKKKEK 49
M + K+ F + + K + K+ K K E K E +EK
Sbjct: 182 MMKAAKNGPAAFGDEDEET-EGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEK 240
Query: 50 KKKKKKKKKKKKNNN 64
KKKKKK K K
Sbjct: 241 SKKKKKKLAKNKKKL 255
Score = 28.8 bits (64), Expect = 9.8
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKK 61
+EK K+KKKK K KKK KK
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDKKG 261
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 38.0 bits (89), Expect = 0.004
Identities = 18/59 (30%), Positives = 35/59 (59%), Gaps = 17/59 (28%)
Query: 19 DWKKAKK--KEKKKKEKKAKK-------------KEKKKEKKKKEKK--KKKKKKKKKK 60
D KKAKK KEK+K+ K+A+K +E K EK +++++ ++++ + ++K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71
Score = 33.7 bits (78), Expect = 0.11
Identities = 11/53 (20%), Positives = 31/53 (58%), Gaps = 12/53 (22%)
Query: 21 KKAKKKEKKKKEKKAKK------------KEKKKEKKKKEKKKKKKKKKKKKK 61
KKAKK+++K++++ K +E K EK +++++ ++++ + ++
Sbjct: 18 KKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQ 70
Score = 33.7 bits (78), Expect = 0.12
Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 14/48 (29%)
Query: 29 KKKEKKAKKKEKKKEKKKK--------------EKKKKKKKKKKKKKN 62
KKK KKAKK+++K+ K+ + E+ K +K ++ ++ N
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELN 61
Score = 29.5 bits (67), Expect = 3.3
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKK 58
KK+ KK KK+K K++K+ +K
Sbjct: 12 VDKKKAKKAKKEKRKQRKQARKGA 35
Score = 28.7 bits (65), Expect = 6.2
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 43 EKKKKEKKKKKKKKKKKKKNN 63
+KK K+ KK+K+K++K+ +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKG 34
>gnl|CDD|218858 pfam06024, DUF912, Nucleopolyhedrovirus protein of unknown function
(DUF912). This family consists of several
Nucleopolyhedrovirus proteins of unknown function.
Length = 101
Score = 36.5 bits (85), Expect = 0.004
Identities = 11/50 (22%), Positives = 24/50 (48%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEIT 111
N++N++NN N NN + N+ + + K A ++++ +A
Sbjct: 24 NDDNDDNNAPINANNTQSVYPKNDITNEDSNASKANAGNIILIGLLAFFC 73
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 38.2 bits (89), Expect = 0.004
Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 4/43 (9%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK----EKKKKEKKKKKKKKKKK 59
K+ K KK K KAKK KK EKK E + + +KK
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152
Score = 34.4 bits (79), Expect = 0.075
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
++ KK K + KK KKA + +KK + + + +KK
Sbjct: 114 VNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152
Score = 30.1 bits (68), Expect = 1.9
Identities = 20/55 (36%), Positives = 27/55 (49%), Gaps = 6/55 (10%)
Query: 10 GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
G FTE +A+ K + KE+K K KK+K K KK KK + +K N
Sbjct: 92 GAFTE------AQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVN 140
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel
repeat in Pneumocystis carinii Major surface
glycoprotein (MSG) some members of the alignment have
up to nine repeats of this family, the repeats
containing several conserved cysteines. The MSG of P.
carinii is an important protein in host-pathogen
interactions. Surface glycoprotein A from Pneumocystis
carinii is a main target for the host immune system,
this protein is implicated in the attachment of
Pneumocystis carinii to the host alveolar epithelial
cells, alveolar macrophages, host surfactant and
possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 35.9 bits (83), Expect = 0.004
Identities = 18/65 (27%), Positives = 27/65 (41%)
Query: 7 HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+ +K K+K K K+ ++ KE K + KKKKK KK K+K
Sbjct: 17 EFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKKKCKKALKEKCTKLK 76
Query: 67 NNNNN 71
+N
Sbjct: 77 KESNE 81
Score = 26.6 bits (59), Expect = 8.5
Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 9/61 (14%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK---------KKKNNNNN 66
L+ K + +E + ++A K+K +K KEK K KK K K +
Sbjct: 3 FKLEEKCKELEEYCEFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKK 62
Query: 67 N 67
Sbjct: 63 K 63
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 39.7 bits (93), Expect = 0.005
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK + K +K A+K +K K +K K K+ KKK
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKK 810
Score = 39.3 bits (92), Expect = 0.006
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K K + KK+ + KK K +K K K+ KKK
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
Score = 36.3 bits (84), Expect = 0.049
Identities = 14/50 (28%), Positives = 20/50 (40%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
EP + K K K K+ K K+ + +K K K+ KKK
Sbjct: 764 EPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
Score = 35.5 bits (82), Expect = 0.089
Identities = 16/47 (34%), Positives = 21/47 (44%), Gaps = 7/47 (14%)
Query: 21 KKAKKKEKKKKEKKAKKKEK-------KKEKKKKEKKKKKKKKKKKK 60
KK +K K ++ KK + EKK K K KK +K KK
Sbjct: 743 KKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789
Score = 34.7 bits (80), Expect = 0.16
Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 9/50 (18%)
Query: 22 KAKKKEKKKKEKKAKKKEKK---------KEKKKKEKKKKKKKKKKKKKN 62
KAKK + KK K ++ KK + +KK + K KK +K KK
Sbjct: 741 KAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKP 790
Score = 33.2 bits (76), Expect = 0.37
Identities = 15/48 (31%), Positives = 18/48 (37%), Gaps = 2/48 (4%)
Query: 10 GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
F K AKK +K KK K +K K K+ KKK
Sbjct: 767 SAFRGEKKAKPKAAKKDARKA--KKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 30.9 bits (70), Expect = 2.0
Identities = 16/55 (29%), Positives = 22/55 (40%), Gaps = 9/55 (16%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK---------KKEKKKKKKKKKKKKKNNNNN 66
K A++K KK K K ++ KK + EKK K K KK +
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKP 790
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The
function is unclear.
Length = 151
Score = 37.7 bits (87), Expect = 0.005
Identities = 16/39 (41%), Positives = 25/39 (64%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
K++++ K + +K +KKKKEK +KK K KKK+ N
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50
Score = 37.3 bits (86), Expect = 0.005
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+++A K++K+ K +K KK+K+K +KK K KKK++K N
Sbjct: 6 YEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50
Score = 29.2 bits (65), Expect = 3.3
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 3/39 (7%)
Query: 33 KKAKKKEKKKEKKKKEKKKKK---KKKKKKKKNNNNNNN 68
K+ K+ K + +K +KKK+K KK K KKK N
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 36.4 bits (84), Expect = 0.005
Identities = 16/36 (44%), Positives = 20/36 (55%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K + K KK EK K K+KKK KKKKK++
Sbjct: 1 MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEA 36
Score = 34.1 bits (78), Expect = 0.041
Identities = 14/37 (37%), Positives = 21/37 (56%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
A K + K++K K K + KKK KKKKK++ +
Sbjct: 1 MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37
Score = 33.0 bits (75), Expect = 0.081
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K + K+K+ + K K+KKK KKKKK++ +
Sbjct: 1 MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37
Score = 29.9 bits (67), Expect = 1.0
Identities = 15/36 (41%), Positives = 22/36 (61%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K + KEKKA+K + K KK+ KKKKK++ +
Sbjct: 2 APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37
Score = 29.9 bits (67), Expect = 1.2
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K K K+KK K+KKKE+ ++ ++ K +K
Sbjct: 12 KAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRK 52
Score = 28.7 bits (64), Expect = 2.7
Identities = 14/41 (34%), Positives = 20/41 (48%), Gaps = 2/41 (4%)
Query: 22 KAKKKEKKK--KEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K KK EK K K KK KKK+K++ + ++ K
Sbjct: 9 KEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDK 49
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
7 homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 36.9 bits (86), Expect = 0.005
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 13/59 (22%)
Query: 19 DWKKAKKKEKKKKE-------------KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
D ++ ++KE+K+KE K + K +KE + +K+KKKKKKK+ N
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKN 89
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 36.3 bits (84), Expect = 0.006
Identities = 11/43 (25%), Positives = 25/43 (58%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ KKK++ K + K +K ++K KK+ + + ++ K++
Sbjct: 21 IKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63
Score = 35.1 bits (81), Expect = 0.012
Identities = 16/46 (34%), Positives = 24/46 (52%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
D KK KKKK++ AK K KK ++K KKK + + + +
Sbjct: 16 DIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61
Score = 34.8 bits (80), Expect = 0.018
Identities = 17/51 (33%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 21 KKAKKKEKKK-----KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KK +K +K+ K +KK+KK+ K K KK +KK KK +
Sbjct: 4 KKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFP 54
Score = 31.7 bits (72), Expect = 0.22
Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 7/48 (14%)
Query: 26 KEKKKKEKKAKKK-------EKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KE+KK EK K+ KK++KKKK K + K KK +
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDK 48
Score = 30.9 bits (70), Expect = 0.40
Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
KK++KKKK K + K KK +KK KKK + + +
Sbjct: 19 SNIKKRKKKKKRTA--KTARPKATKKGQKKDKKKDEFPEFPEESK 61
Score = 30.1 bits (68), Expect = 0.80
Identities = 13/43 (30%), Positives = 22/43 (51%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+K+KK+KK K + + KK +KK KKK + + +
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 38.9 bits (91), Expect = 0.006
Identities = 12/30 (40%), Positives = 22/30 (73%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
K K K ++ KEK+ K+ E++ +++K+ KKK
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 38.5 bits (90), Expect = 0.007
Identities = 13/35 (37%), Positives = 24/35 (68%), Gaps = 1/35 (2%)
Query: 22 KAKKKEKKKKEKKAK-KKEKKKEKKKKEKKKKKKK 55
K + ++K K ++ K K+ K+ E++ KE+K+ KKK
Sbjct: 271 KPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 38.1 bits (89), Expect = 0.009
Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK-KEKKKKEKKKKKKKKK 57
K K+K K + KEK+ KE +++ K++K+ KKK
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 37.8 bits (88), Expect = 0.011
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ K + K+ K ++ KEK++KE +++ K++K+ KK
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308
Score = 37.0 bits (86), Expect = 0.021
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 9/56 (16%)
Query: 21 KKAKKKEKKKKEKKAKK---------KEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
K+ K+ E++ KE+K K KE KE +KEK + +KK+++K++
Sbjct: 287 KRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKL 342
Score = 33.5 bits (77), Expect = 0.28
Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 10/49 (20%)
Query: 20 WKKAKKKE----------KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+K KK+ K+ +K K + +KKE++K+ +KKK K++K
Sbjct: 298 EEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRK 346
Score = 33.1 bits (76), Expect = 0.37
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 6/44 (13%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKK------KKKKKKKKKNNN 64
E + K + K K K ++ KEK++K K++K+ KKK
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308
Score = 33.1 bits (76), Expect = 0.37
Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 16/58 (27%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK----------------KEKKKKEKKKKKKKKKKKKKN 62
K+ ++KE +++ K+ K+ +KK KEK + KK+++K++ +KKK
Sbjct: 285 KEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKL 342
Score = 32.8 bits (75), Expect = 0.44
Identities = 8/43 (18%), Positives = 22/43 (51%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ E + E + K + ++K K ++ K+K++K+ ++
Sbjct: 252 ESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELER 294
Score = 29.7 bits (67), Expect = 4.7
Identities = 17/51 (33%), Positives = 22/51 (43%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
F T D KKK +KK +K K E + + K KK+ KKK
Sbjct: 43 FVVDTTGDEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKKRHVKKKSK 93
Score = 28.9 bits (65), Expect = 8.5
Identities = 18/50 (36%), Positives = 22/50 (44%), Gaps = 6/50 (12%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKE------KKKKEKKKKKKKKKKKKK 61
+D + KKK+ KK KK K E K KK+ KKK KK
Sbjct: 45 VDTTGDEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKKRHVKKKSKK 94
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 39.3 bits (91), Expect = 0.006
Identities = 14/70 (20%), Positives = 36/70 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K++KK ++ + K A +E+K ++K +K++K KK ++ + + + ++ +
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167
Query: 81 NNNNNNEEKG 90
+ NE +
Sbjct: 168 VTSLENESEE 177
Score = 32.7 bits (74), Expect = 0.61
Identities = 12/46 (26%), Positives = 25/46 (54%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
E LD + K + +++ A + ++EK +K+ +K++K KK
Sbjct: 102 EDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMD 147
Score = 29.2 bits (65), Expect = 7.2
Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 11/43 (25%)
Query: 32 EKKAKKKEKKKEKK-----------KKEKKKKKKKKKKKKKNN 63
+K++KK ++ +K K EKK +K++K KK +
Sbjct: 107 DKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDED 149
Score = 28.9 bits (64), Expect = 9.5
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
A KE KK ++ ++K +E+K +KK +K++K
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRK 142
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 39.1 bits (91), Expect = 0.006
Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 4/76 (5%)
Query: 10 GTFTEPTPLDWKKAKKKE---KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
G+ TE + K ++E KKKKE+KAK+KE KK K +++ K K + ++ N
Sbjct: 1 GSRTESEA-EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVP 59
Query: 67 NNNNNNNNNNNNNNNN 82
+ + + + N
Sbjct: 60 KKSEKKSRKRDVEDEN 75
Score = 36.8 bits (85), Expect = 0.038
Identities = 18/69 (26%), Positives = 37/69 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+A+KK ++E + KKK+++K K+K+ KK K +K+ K K ++ N +
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 81 NNNNNNEEK 89
+ + E++
Sbjct: 66 SRKRDVEDE 74
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 39.5 bits (92), Expect = 0.006
Identities = 21/84 (25%), Positives = 36/84 (42%), Gaps = 2/84 (2%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--KKNNNNNNNNNNNNNN 75
LD + AK +E ++K ++A + + KK ++ KK KK KK + + ++
Sbjct: 1168 LDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSS 1227
Query: 76 NNNNNNNNNNNEEKGTFLYKMAAP 99
N + KG K AP
Sbjct: 1228 AMETENVAEVVKPKGRAGAKKKAP 1251
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 39.3 bits (92), Expect = 0.006
Identities = 15/93 (16%), Positives = 36/93 (38%), Gaps = 2/93 (2%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
+ ++ K + ++E+ + E+ + E++K K++ K + +
Sbjct: 807 LIDQIIESEEERLKALRIQREEMLMRPEE--LELINEEQKNLKQEIKLELSEIQEAEEEI 864
Query: 72 NNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVL 104
N N N N N+ K +A +++ L
Sbjct: 865 QNINENKNEFVEFKNDPKKLNKLIIAKDVLIKL 897
Score = 37.0 bits (86), Expect = 0.027
Identities = 9/85 (10%), Positives = 30/85 (35%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ + P + + ++++ K + + ++ + +E + +++K K+
Sbjct: 792 SDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIK 851
Query: 65 NNNNNNNNNNNNNNNNNNNNNNEEK 89
+ N N N N +
Sbjct: 852 LELSEIQEAEEEIQNINENKNEFVE 876
Score = 35.4 bits (82), Expect = 0.088
Identities = 12/92 (13%), Positives = 35/92 (38%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ +++ K + ++ + + +E + +++K K++ K + + N N N N
Sbjct: 813 ESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKN 872
Query: 81 NNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQ 112
+ + A VL+ ++
Sbjct: 873 EFVEFKNDPKKLNKLIIAKDVLIKLVISSDEI 904
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 35.4 bits (82), Expect = 0.006
Identities = 15/35 (42%), Positives = 28/35 (80%), Gaps = 2/35 (5%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KAKKKE K +E K++E+++ +K++E+K+++K K
Sbjct: 23 KAKKKELKAEE--EKREEEEEARKREERKEREKNK 55
Score = 32.7 bits (75), Expect = 0.059
Identities = 13/37 (35%), Positives = 27/37 (72%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
AK K KKK+ K ++K +++E+ +K +++K+++K K
Sbjct: 19 LAKLKAKKKELKAEEEKREEEEEARKREERKEREKNK 55
Score = 32.3 bits (74), Expect = 0.066
Identities = 13/33 (39%), Positives = 27/33 (81%), Gaps = 1/33 (3%)
Query: 34 KAKKKE-KKKEKKKKEKKKKKKKKKKKKKNNNN 65
KAKKKE K +E+K++E+++ +K++++K++ N
Sbjct: 23 KAKKKELKAEEEKREEEEEARKREERKEREKNK 55
Score = 30.4 bits (69), Expect = 0.33
Identities = 12/41 (29%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K+ + AK K KKKE K +E+K++++++ +K++
Sbjct: 6 EPVTLKDMLNADLLAKLKAKKKELKAEEEKREEEEEARKRE 46
Score = 27.3 bits (61), Expect = 4.5
Identities = 7/24 (29%), Positives = 19/24 (79%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK 44
++ K++E+++ K+ ++KE++K K
Sbjct: 32 EEEKREEEEEARKREERKEREKNK 55
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 38.1 bits (89), Expect = 0.006
Identities = 13/62 (20%), Positives = 24/62 (38%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
PT + + K + ++++ KK KKK+KKK KK + ++
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGE 223
Query: 75 NN 76
Sbjct: 224 AK 225
Score = 37.3 bits (87), Expect = 0.011
Identities = 11/64 (17%), Positives = 22/64 (34%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
+++ KKKK+KK KK +K + + + + +N
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDN 237
Query: 84 NNNE 87
+E
Sbjct: 238 EKSE 241
Score = 33.5 bits (77), Expect = 0.25
Identities = 11/54 (20%), Positives = 21/54 (38%), Gaps = 2/54 (3%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKK--EKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+ + +E++AKKK+KKK+KK K + ++
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Score = 33.1 bits (76), Expect = 0.28
Identities = 12/55 (21%), Positives = 18/55 (32%), Gaps = 3/55 (5%)
Query: 21 KKAKKKEKK---KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
KK KKK KK + + + KK KK + +N +
Sbjct: 186 KKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKS 240
Score = 32.7 bits (75), Expect = 0.37
Identities = 11/57 (19%), Positives = 20/57 (35%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K + ++E+ KKK+KKK+K KK + ++
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 38.8 bits (91), Expect = 0.007
Identities = 18/48 (37%), Positives = 22/48 (45%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
FT K K + KKK+KKA K + +E K KKK KK
Sbjct: 346 FTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 36.9 bits (86), Expect = 0.026
Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 7/47 (14%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKE-------KKKKEKKKKKKKKKKKKK 61
A KK KK KAK K+KKK+ K + ++ K KKK KK
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 36.5 bits (85), Expect = 0.034
Identities = 16/50 (32%), Positives = 23/50 (46%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+F T KK+ K K KK+KK + + ++ K KKK KK
Sbjct: 344 SFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 31.1 bits (71), Expect = 1.6
Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 6/58 (10%)
Query: 21 KKAKKKEKKKKEK------KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
++ KK + KK + A KK KK K + KKKKKK + + +
Sbjct: 328 ERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKS 385
Score = 30.4 bits (69), Expect = 2.6
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 6/58 (10%)
Query: 25 KKEKKKKEKKAKK------KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
++ KK K KK + KK KK K K KKKKKK + + + +
Sbjct: 328 ERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKS 385
Score = 29.2 bits (66), Expect = 6.5
Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 8/58 (13%)
Query: 30 KKEKKAKKKEK--------KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
++ KKAK K+ KK +K K K KKKKK + + + +
Sbjct: 328 ERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKS 385
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 39.0 bits (91), Expect = 0.007
Identities = 16/75 (21%), Positives = 29/75 (38%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ + K KK +E + K++K + E + K K + +N N N++ N
Sbjct: 362 DENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKT 421
Query: 81 NNNNNNEEKGTFLYK 95
N N L +
Sbjct: 422 ENILTNSPLKDELLE 436
Score = 36.3 bits (84), Expect = 0.046
Identities = 19/79 (24%), Positives = 34/79 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK ++ + K KK E + + + K K ++ + K N N N++ N N N+
Sbjct: 369 KKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNS 428
Query: 81 NNNNNNEEKGTFLYKMAAP 99
+ EK T + + P
Sbjct: 429 PLKDELLEKTTEIINIENP 447
Score = 33.6 bits (77), Expect = 0.30
Identities = 11/37 (29%), Positives = 14/37 (37%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
+AK K K K EK KK +K K+
Sbjct: 674 INEAKIYWKAINNTKKSVVIKPLEKPKKYEKTSVSKE 710
Score = 31.7 bits (72), Expect = 1.0
Identities = 11/62 (17%), Positives = 23/62 (37%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
++ KEKK + + + K + ++ + K K N++ N N N
Sbjct: 368 FKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTN 427
Query: 77 NN 78
+
Sbjct: 428 SP 429
Score = 29.8 bits (67), Expect = 4.3
Identities = 16/81 (19%), Positives = 24/81 (29%), Gaps = 8/81 (9%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
+ K+KK + E K + K E+ + K K K + N N N+
Sbjct: 369 KKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNS 428
Query: 75 NN--------NNNNNNNNNNE 87
N N E
Sbjct: 429 PLKDELLEKTTEIINIENPQE 449
>gnl|CDD|205936 pfam13763, DUF4167, Domain of unknown function (DUF4167).
Length = 80
Score = 35.4 bits (82), Expect = 0.007
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
NNNN NNN + N N ++N
Sbjct: 1 RGRNNNNRNNNRSGGNPRNRVFDSN 25
Score = 35.0 bits (81), Expect = 0.011
Identities = 10/26 (38%), Positives = 12/26 (46%)
Query: 65 NNNNNNNNNNNNNNNNNNNNNNEEKG 90
NNNN NNN + N N + G
Sbjct: 1 RGRNNNNRNNNRSGGNPRNRVFDSNG 26
Score = 33.8 bits (78), Expect = 0.027
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
+ NNN NNN + N N ++N + + +GT
Sbjct: 3 RNNNNRNNNRSGGNPRNRVFDSNGPDVKVRGT 34
Score = 30.7 bits (70), Expect = 0.29
Identities = 8/23 (34%), Positives = 12/23 (52%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNN 81
+ NN NNN + N N ++N
Sbjct: 3 RNNNNRNNNRSGGNPRNRVFDSN 25
Score = 28.8 bits (65), Expect = 1.8
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 58 KKKKNNNNNNNNNNNNNNNNNNN 80
+ N NNN + N N ++N
Sbjct: 3 RNNNNRNNNRSGGNPRNRVFDSN 25
Score = 28.0 bits (63), Expect = 3.2
Identities = 9/22 (40%), Positives = 10/22 (45%)
Query: 71 NNNNNNNNNNNNNNNNEEKGTF 92
NNNN NNN + N F
Sbjct: 1 RGRNNNNRNNNRSGGNPRNRVF 22
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 37.9 bits (88), Expect = 0.007
Identities = 19/49 (38%), Positives = 26/49 (53%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
E + K EK K KA+ E+ K+ +EKK++KKK KKKK
Sbjct: 29 NPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 32.8 bits (75), Expect = 0.28
Identities = 14/47 (29%), Positives = 25/47 (53%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
P + K ++K ++ K K + E+ KK ++KK++KKK K
Sbjct: 28 GNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74
Score = 32.8 bits (75), Expect = 0.31
Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 14/57 (24%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK--------------KKKKKKKKKKK 61
+K K K + K ++K EK K K KK++KKK KKK
Sbjct: 20 KYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76
Score = 31.3 bits (71), Expect = 1.0
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+A + EK K K E + ++K +K K K
Sbjct: 14 RALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIK 51
Score = 30.5 bits (69), Expect = 1.6
Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 18/62 (29%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK------------------KKKKKK 59
L+ +K K + K + K + + +K K K KKK KK
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75
Query: 60 KK 61
KK
Sbjct: 76 KK 77
Score = 29.0 bits (65), Expect = 5.2
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKK 59
K K KK +K KKKKKKK+
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
Score = 28.2 bits (63), Expect = 9.3
Identities = 13/21 (61%), Positives = 13/21 (61%)
Query: 42 KEKKKKEKKKKKKKKKKKKKN 62
K K KK KK KKKKKKK
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKR 216
Score = 28.2 bits (63), Expect = 9.6
Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 19 DWKKAKK--KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
D+ +AK+ + + ++ KA K + E K + K +K K
Sbjct: 5 DFFRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKA 52
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 38.2 bits (89), Expect = 0.008
Identities = 13/41 (31%), Positives = 18/41 (43%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KAK+ ++ +K AK +K K K KK KK
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKM 212
Score = 35.9 bits (83), Expect = 0.043
Identities = 15/82 (18%), Positives = 34/82 (41%), Gaps = 3/82 (3%)
Query: 21 KKAKKKEKKKKEKKAKK---KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K + E+ KK KK + K KE K K++KK ++ ++ ++ N++ +
Sbjct: 109 KAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTP 168
Query: 78 NNNNNNNNNEEKGTFLYKMAAP 99
+ + K+++
Sbjct: 169 HLLKAKEGPSRRAKKAAKLSST 190
Score = 35.1 bits (81), Expect = 0.072
Identities = 18/76 (23%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK-----KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
KK K K + + KK KK E+K KE K+K++KK +++ + ++ ++ N
Sbjct: 102 KKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPN 161
Query: 76 NNNNNNNNNNNEEKGT 91
++ + + ++G
Sbjct: 162 SSRPSTPHLLKAKEGP 177
Score = 33.6 bits (77), Expect = 0.19
Identities = 15/76 (19%), Positives = 27/76 (35%), Gaps = 5/76 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK-----KKKKKKKKNNNNNNNNNNNNNN 75
KKA K K + K KK KK +K + + + + N+ N +
Sbjct: 182 KKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVLDYSAPDANDENADA 241
Query: 76 NNNNNNNNNNNEEKGT 91
+ + + +GT
Sbjct: 242 PEDVEEVDQESWGRGT 257
Score = 33.6 bits (77), Expect = 0.22
Identities = 11/34 (32%), Positives = 14/34 (41%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK K + EK + K KK KK +K
Sbjct: 181 AKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 33.6 bits (77), Expect = 0.22
Identities = 12/37 (32%), Positives = 15/37 (40%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK K + EK K K KK KK +K
Sbjct: 178 SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 31.3 bits (71), Expect = 1.2
Identities = 8/37 (21%), Positives = 16/37 (43%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++ KK + + ++ K + KK KK +K
Sbjct: 178 SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 30.5 bits (69), Expect = 2.2
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+++ +K K +K K K KK KK +K
Sbjct: 178 SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 38.0 bits (88), Expect = 0.008
Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 1/68 (1%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEK-KKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
T + KKA K +K K K KK K EK + KK+ KK KK+ +
Sbjct: 35 TAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFE 94
Query: 70 NNNNNNNN 77
+N N
Sbjct: 95 ASNKLFKN 102
Score = 37.2 bits (86), Expect = 0.017
Identities = 19/51 (37%), Positives = 21/51 (41%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T E P K KE K +K AK K KK + KK K K KK
Sbjct: 12 TTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62
Score = 35.7 bits (82), Expect = 0.047
Identities = 20/61 (32%), Positives = 25/61 (40%), Gaps = 2/61 (3%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K AKK K KK K K KK + +K E KK+ KK K + +N
Sbjct: 42 KAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFK 101
Query: 79 N 79
N
Sbjct: 102 N 102
Score = 34.2 bits (78), Expect = 0.15
Identities = 16/51 (31%), Positives = 18/51 (35%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T ++ K KK K K KK K K KK K KK K
Sbjct: 4 TKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTK 54
Score = 33.4 bits (76), Expect = 0.26
Identities = 15/40 (37%), Positives = 18/40 (45%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK K K AKK K K+ K + KK K +K
Sbjct: 31 KTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKT 70
Score = 33.4 bits (76), Expect = 0.27
Identities = 14/44 (31%), Positives = 20/44 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K AK K +K AK K+ K K +K K +K + K +
Sbjct: 34 KTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKES 77
Score = 33.4 bits (76), Expect = 0.27
Identities = 19/51 (37%), Positives = 21/51 (41%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T + T + K A KK KE KK K K KK K KK K K
Sbjct: 7 TTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVK 57
Score = 31.8 bits (72), Expect = 0.70
Identities = 16/48 (33%), Positives = 19/48 (39%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
T KK K K KKA K +K K K KK K +K +
Sbjct: 25 TSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTES 72
Score = 30.7 bits (69), Expect = 1.7
Identities = 19/53 (35%), Positives = 21/53 (39%), Gaps = 4/53 (7%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAK-KKEKKKEKK---KKEKKKKKKKKKKKKK 61
T T K K KK K K K K KK K EK + KK+ KK
Sbjct: 29 TAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKK 81
Score = 30.7 bits (69), Expect = 2.1
Identities = 15/37 (40%), Positives = 17/37 (45%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E KK KK +EKK KK K+ K KK K
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAK 37
Score = 29.1 bits (65), Expect = 5.9
Identities = 17/71 (23%), Positives = 22/71 (30%), Gaps = 3/71 (4%)
Query: 4 STKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+ K + K K KK K +K E KKE KK KK+
Sbjct: 35 TAKTTSTKAAKKAAKVKKTKSVK---TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAE 91
Query: 64 NNNNNNNNNNN 74
+N N
Sbjct: 92 VFEASNKLFKN 102
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 38.5 bits (90), Expect = 0.008
Identities = 16/55 (29%), Positives = 32/55 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+K K K ++ EK + ++KK++KKKE+KKKK+ ++ +++ + N
Sbjct: 311 RKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEEN 365
Score = 37.3 bits (87), Expect = 0.016
Identities = 16/44 (36%), Positives = 28/44 (63%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ + K+K + K ++ +K + + KEKKK+KKK++KKKK
Sbjct: 300 ILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343
Score = 37.3 bits (87), Expect = 0.018
Identities = 17/44 (38%), Positives = 29/44 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K+K K K E+ +K + + ++KKKEKKK++KKKK+ ++
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350
Score = 36.9 bits (86), Expect = 0.022
Identities = 16/44 (36%), Positives = 28/44 (63%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L ++ ++K K K + EK + K++KK+KKK++KKKK+
Sbjct: 301 LLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQ 344
Score = 35.4 bits (82), Expect = 0.063
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 5/50 (10%)
Query: 21 KKAKKKEKK-----KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K K+ KK + K+K K K ++ EK + K+KKK+K
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339
Score = 31.9 bits (73), Expect = 0.79
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K KKKEKKK+EKK K+ E+ +E+ +K + + K++ K
Sbjct: 328 VKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
Score = 30.8 bits (70), Expect = 2.1
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K ++K K K + + K+ + K+K K K ++ N
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEK 323
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 38.5 bits (90), Expect = 0.009
Identities = 17/38 (44%), Positives = 19/38 (50%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
K K N N NN +N NNN N NN E+KG L
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNE 249
Score = 38.2 bits (89), Expect = 0.012
Identities = 13/36 (36%), Positives = 16/36 (44%)
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
K K+ N N NN +N NNN N NN +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLS 247
Score = 37.0 bits (86), Expect = 0.026
Identities = 12/37 (32%), Positives = 15/37 (40%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K K+ NN +N NNN N NN N+
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSN 248
Score = 36.6 bits (85), Expect = 0.038
Identities = 16/40 (40%), Positives = 23/40 (57%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K K+ K++KNN +N NNN N NN N+ +NE +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELE 251
Score = 35.5 bits (82), Expect = 0.081
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K K+ K++K N +N NNN N NN N+ +N E
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELE 251
Score = 34.7 bits (80), Expect = 0.15
Identities = 13/46 (28%), Positives = 18/46 (39%)
Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K KE K++K NNN N NN N+ +N +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGP 257
Score = 33.9 bits (78), Expect = 0.28
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 44 KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K K+ K++K NNN N NN N+ +N + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISL 255
Score = 33.2 bits (76), Expect = 0.40
Identities = 14/49 (28%), Positives = 21/49 (42%)
Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
K ++ K++K N NNN N NN N+ +N + G Y
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEY 260
Score = 32.4 bits (74), Expect = 0.79
Identities = 10/48 (20%), Positives = 18/48 (37%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K K+ K++K N N NN N+ +N + + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE 259
Score = 30.8 bits (70), Expect = 2.1
Identities = 10/28 (35%), Positives = 12/28 (42%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
+ N N NN +N NNN N K
Sbjct: 211 IKSKEININKEEKNNGSNVNNNGNKNNK 238
Score = 30.5 bits (69), Expect = 2.6
Identities = 9/43 (20%), Positives = 16/43 (37%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
K KE +++K N NN N+ +N + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDIS 254
Score = 30.5 bits (69), Expect = 3.3
Identities = 9/48 (18%), Positives = 17/48 (35%)
Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K ++ K++K N NN N+ +N + + E
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE 259
Score = 30.1 bits (68), Expect = 4.1
Identities = 10/43 (23%), Positives = 16/43 (37%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K K+ K+EK KNN N+ +N + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDIS 254
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 38.0 bits (88), Expect = 0.010
Identities = 14/42 (33%), Positives = 20/42 (47%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+AK K +KK E A++K ++KK K K K K
Sbjct: 209 AEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAK 250
Score = 37.6 bits (87), Expect = 0.014
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KA ++ K K E A KK+ + E K +K K + + K K
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAK 216
Score = 37.2 bits (86), Expect = 0.017
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA+ + K K EKKA+ ++K +K+K K K K
Sbjct: 208 KAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAA 247
Score = 37.2 bits (86), Expect = 0.018
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKK---EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA+ KKK E KA ++ K E + K K +KK + ++K
Sbjct: 185 AKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEK 228
Score = 36.9 bits (85), Expect = 0.026
Identities = 14/42 (33%), Positives = 21/42 (50%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KA+KK + E+KA ++KK K K K K ++K
Sbjct: 216 KAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAA 257
Score = 36.1 bits (83), Expect = 0.043
Identities = 15/39 (38%), Positives = 20/39 (51%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
EK K E +AK K +KK + E+K +KKK K
Sbjct: 201 KAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAK 239
Score = 35.3 bits (81), Expect = 0.071
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ KK ++ K E+ K E KKK E + K +K K
Sbjct: 168 EAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKA 209
Score = 35.3 bits (81), Expect = 0.074
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 4/45 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK----EKKKKEKKKKKKKKKKKKK 61
K A +K K + E KAK ++K + EK EKKK K K K
Sbjct: 201 KAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKA 245
Score = 33.8 bits (77), Expect = 0.24
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 3/44 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
KK + E K +KAK + + K EKK + ++K +KKK
Sbjct: 193 KKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKA 236
Score = 33.0 bits (75), Expect = 0.36
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K +E K ++AK K + KKK + + K +K K
Sbjct: 170 KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKA 209
Score = 32.2 bits (73), Expect = 0.62
Identities = 13/71 (18%), Positives = 26/71 (36%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+A +EK EKK + K +K K ++K ++ ++ N
Sbjct: 221 AEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSSGKNAPKTGGG 280
Query: 81 NNNNNNEEKGT 91
N +++ G
Sbjct: 281 AKNASSQGSGA 291
Score = 32.2 bits (73), Expect = 0.63
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA+ + K + AK K + KKK E+ K ++ K K
Sbjct: 147 KKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186
Score = 31.8 bits (72), Expect = 0.87
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L+ K+ +++ +K ++ KK E K K E K K + KKK
Sbjct: 129 LEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172
Score = 31.5 bits (71), Expect = 1.1
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K E +KKA+ + K +K K + + K K +KK +
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222
Score = 31.5 bits (71), Expect = 1.3
Identities = 13/35 (37%), Positives = 19/35 (54%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E+KKK + AK K + K K + KKK ++ K
Sbjct: 144 EQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAK 178
Score = 31.5 bits (71), Expect = 1.4
Identities = 15/41 (36%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A + K K +AKKK ++ K +E K K + KKK
Sbjct: 155 KAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKK 195
Score = 31.1 bits (70), Expect = 1.7
Identities = 13/44 (29%), Positives = 21/44 (47%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK + K K +A K + E KKK ++ K ++ K K
Sbjct: 143 AEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186
Score = 30.7 bits (69), Expect = 1.8
Identities = 12/41 (29%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A +++KK + KAK + + K + KKK ++ K
Sbjct: 140 KAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAA 180
Score = 30.3 bits (68), Expect = 2.5
Identities = 12/41 (29%), Positives = 19/41 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K + KK+ + K ++ K K E KKK + + K
Sbjct: 161 AKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAK 201
Score = 30.3 bits (68), Expect = 2.8
Identities = 13/71 (18%), Positives = 26/71 (36%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KKA+ ++K + KK K + K K ++K ++ ++ N
Sbjct: 219 KKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSSGKNAPKTG 278
Query: 81 NNNNNNEEKGT 91
N +G+
Sbjct: 279 GGAKNASSQGS 289
Score = 29.9 bits (67), Expect = 3.9
Identities = 12/43 (27%), Positives = 33/43 (76%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
L+ ++ K +E++K+ ++A+K+ + ++K+++E+ +K ++KKK
Sbjct: 106 LEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKK 148
Score = 29.1 bits (65), Expect = 6.2
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K E KKK ++A K ++ + K + KKK + + K
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA 202
>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
Length = 386
Score = 38.2 bits (89), Expect = 0.010
Identities = 35/240 (14%), Positives = 61/240 (25%), Gaps = 41/240 (17%)
Query: 226 RMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELG 285
R LL + + FF+ + Y+ T L
Sbjct: 173 RRVRFALALLLLLLALASFFL-------------LLALGPFATGAALYYGTATRLWEFLL 219
Query: 286 GALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFL 345
GAL + L + +++AV S + + V + AA L
Sbjct: 220 GALAALWRPKRRPWPALLAAALALLGLALLLLAVAL-IPGSSPFFPGDLLLVPSLGAALL 278
Query: 346 THICIRLLPWMLIGEVFPNNIRATASGASGSSSY--------IFAFAVNKLYYPMLDTFH 397
L+ + + G SY + F + L
Sbjct: 279 VLAVAALVGG---SRLLSSRP----LVFLGDISYSLYLWHWPVLVF-LLYLVGLSGLPIS 330
Query: 398 LWGTLYFYAAISVVGTLYMYFVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAAR 457
+ A+S++ Y +E+ G+ V + RA AR
Sbjct: 331 ALLLILAALALSILLAALSYRF-----------VEKPLRRLGRRRVWSRIRAVLAAGRAR 379
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 37.7 bits (88), Expect = 0.010
Identities = 16/39 (41%), Positives = 25/39 (64%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK+KKKK+K KK K K+ K++K + K KK +++
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR 292
Score = 37.0 bits (86), Expect = 0.018
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 6/48 (12%)
Query: 20 WKKAKKK-----EKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKK 61
+KKA K KK+KK KKK KK K KK K++K + K KK
Sbjct: 241 YKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288
Score = 34.3 bits (79), Expect = 0.15
Identities = 17/40 (42%), Positives = 26/40 (65%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KKK+KKKK+ KK + KK K+ K + K KK ++++
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRE 293
Score = 34.3 bits (79), Expect = 0.16
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K KKK+KKK K K +K K++K + K KK +++++
Sbjct: 255 VKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Score = 33.9 bits (78), Expect = 0.17
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KKK+KKKK K K KK K+++ + K KK +++++
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Score = 33.9 bits (78), Expect = 0.20
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK KKK KK K K K+++ + K KK +++++ +KK
Sbjct: 259 KKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKI 299
Score = 32.7 bits (75), Expect = 0.37
Identities = 16/47 (34%), Positives = 29/47 (61%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
P+ + KK KKK+ K+ K KK K+ K + + KK +++++ +KK
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKK 298
Score = 32.3 bits (74), Expect = 0.54
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K KK K KK ++K + K KK + +++ +KK K
Sbjct: 264 KVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
>gnl|CDD|233166 TIGR00880, 2_A_01_02, Multidrug resistance protein.
Length = 141
Score = 36.5 bits (85), Expect = 0.010
Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 5/76 (6%)
Query: 97 AAPLVLVLTYVAEITQPHLR----GMLSATASMTTIFGTVSQLFLGSFLHWRSAAILNLL 152
A LV +A+I P R G++SA ++ + G L FL WR+ + +
Sbjct: 66 AFALVAGAALIADIYPPEERGVALGLMSAGIALGPLLGPPLGGVLAQFLGWRAPFLFLAI 125
Query: 153 FPILALCALYFI-PES 167
+ A L F+ PE+
Sbjct: 126 LALAAFILLAFLLPET 141
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 37.9 bits (88), Expect = 0.011
Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 2/70 (2%)
Query: 22 KAKKKEKKK--KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
KA KK K K ++K+ + KK K + + K ++ N+NN N + +
Sbjct: 6 KATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYS 65
Query: 80 NNNNNNNEEK 89
+ +++++EE+
Sbjct: 66 DVDSSSSEEE 75
Score = 37.9 bits (88), Expect = 0.014
Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 6/74 (8%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKK------KKKKKKKKKKKNNNNNNNNNNNNNN 75
KA K KK + K ++++ + KK + + K + N+NN N +
Sbjct: 3 KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62
Query: 76 NNNNNNNNNNNEEK 89
++ +++++ EE+
Sbjct: 63 LYSDVDSSSSEEEE 76
Score = 34.9 bits (80), Expect = 0.12
Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K KA KK K K ++K+ + KK N N++ N + N+NN N +EK
Sbjct: 3 KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKN--KEKL 60
Query: 91 TFLY 94
+ LY
Sbjct: 61 SKLY 64
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 38.3 bits (89), Expect = 0.011
Identities = 14/55 (25%), Positives = 23/55 (41%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
+K + K + + N N ++NNNNNN NN++ N + K
Sbjct: 19 IEKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNK 73
Score = 37.9 bits (88), Expect = 0.019
Identities = 16/56 (28%), Positives = 26/56 (46%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
+K + K E + +K ++K +NNNNNN NN++ N +N N
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNY 75
Score = 37.5 bits (87), Expect = 0.024
Identities = 17/62 (27%), Positives = 27/62 (43%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
EK + + + + N N N ++NNNNNN NN++ N +K Y +
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNYMKRS 79
Query: 99 PL 100
L
Sbjct: 80 LL 81
Score = 35.6 bits (82), Expect = 0.077
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
+K + K + + +K ++K++NNNNNN NN++ N +N N
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNY 75
Score = 35.6 bits (82), Expect = 0.077
Identities = 16/56 (28%), Positives = 25/56 (44%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
EK + K + + +K ++K NNNNNN NN++ N +N N
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNY 75
Score = 34.4 bits (79), Expect = 0.19
Identities = 12/53 (22%), Positives = 23/53 (43%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
++ + K + + N N ++NNNNNN NN++ N ++
Sbjct: 20 EKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDN 72
Score = 31.7 bits (72), Expect = 1.3
Identities = 13/62 (20%), Positives = 23/62 (37%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+K + K E + +K ++K NNN NN++ N +N N
Sbjct: 19 IEKSHQNNIYKVNGNNENFQFSHLNFYNEKNEEKSSNNNNNNYNNSSINRKKDNKNYMKR 78
Query: 83 NN 84
+
Sbjct: 79 SL 80
Score = 31.3 bits (71), Expect = 1.8
Identities = 11/76 (14%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN----NNNNNNNN 78
K++E+ + E + ++++ ++++++ +K++++K N + N N+N
Sbjct: 2578 RKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNREVSGTDDTNKNHNTG 2637
Query: 79 NNNNNNNNEEKGTFLY 94
++ +N Y
Sbjct: 2638 HDESNYGRYSNKRNDY 2653
Score = 30.6 bits (69), Expect = 3.1
Identities = 18/68 (26%), Positives = 29/68 (42%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
K K K KE+ +EK + KK + K K + K K++ + N + N + N
Sbjct: 1067 IKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYN 1126
Query: 82 NNNNNEEK 89
+ EK
Sbjct: 1127 KKKKSLEK 1134
Score = 30.6 bits (69), Expect = 3.3
Identities = 24/108 (22%), Positives = 38/108 (35%), Gaps = 14/108 (12%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK-------KKKK 57
+ FT D K KE + + EK K + ++
Sbjct: 2442 RISNVDAFTNKILQDIDNEKNKENNNMNAEKIDDLIENVTSHNEKIKSELLIINDALRRV 2501
Query: 58 KKKKNNNNN-------NNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
K+KK+ N NNNNNNN+ N +N+ E + + K+
Sbjct: 2502 KEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLNE 2549
Score = 30.2 bits (68), Expect = 4.4
Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 2/79 (2%)
Query: 6 KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K ++ +F +D KK K + K++ K ++K + KK E K K + K K +
Sbjct: 1055 KTKLSSFH--FNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVV 1112
Query: 66 NNNNNNNNNNNNNNNNNNN 84
N + N + N +
Sbjct: 1113 NADKEKNKQTEHYNKKKKS 1131
Score = 29.4 bits (66), Expect = 6.8
Identities = 19/75 (25%), Positives = 29/75 (38%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
D K K + + K KEK K++ K K K+NN NNNN + +N
Sbjct: 2343 DENYGKDKNIELNNENNSYIIKTKEKINNLKEEFSKLLKNIKRNNTLCNNNNIKDFISNI 2402
Query: 79 NNNNNNNNEEKGTFL 93
+ + + L
Sbjct: 2403 GKSVETIKQRFSSNL 2417
Score = 29.0 bits (65), Expect = 8.3
Identities = 13/71 (18%), Positives = 21/71 (29%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
D + K+ K + + + + + K KN NN NN+
Sbjct: 2308 DTNNDELKKVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELNNENNSYIIKTKE 2367
Query: 79 NNNNNNNNEEK 89
NN K
Sbjct: 2368 KINNLKEEFSK 2378
Score = 29.0 bits (65), Expect = 9.3
Identities = 17/63 (26%), Positives = 29/63 (46%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
KK K K K++ K E+K + KK + K K N ++ + N + N + N+
Sbjct: 1068 KKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYNK 1127
Query: 88 EKG 90
+K
Sbjct: 1128 KKK 1130
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 36.4 bits (84), Expect = 0.011
Identities = 18/40 (45%), Positives = 30/40 (75%), Gaps = 2/40 (5%)
Query: 21 KKAKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKK 58
K+ +K EKKKK EKK K++K+E+++KE K++K+ +K
Sbjct: 63 KEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEK 102
Score = 36.0 bits (83), Expect = 0.014
Identities = 19/52 (36%), Positives = 36/52 (69%), Gaps = 2/52 (3%)
Query: 12 FTEPTPLDWKKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
F + LD KK K++K ++EK+ K++K+ EK + KKK+K++++++KK
Sbjct: 68 FEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKK 119
Score = 33.3 bits (76), Expect = 0.15
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KE +K EKK K EKK+ K++++++++K+ K++K
Sbjct: 61 SSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQK 98
Score = 32.2 bits (73), Expect = 0.33
Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 21 KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
KK K EKK K++K +++EK+ K++KE +K + KKK+K+
Sbjct: 70 KKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKK 119
Score = 30.6 bits (69), Expect = 1.0
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++KE K++K+ +K E K+K+K+ ++++KK KK K
Sbjct: 89 REKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126
Score = 29.5 bits (66), Expect = 2.3
Identities = 14/42 (33%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKE-KKKKEKKKKKKKKKKKKKN 62
K +++ K K+ +K EKKK+ +KKE K++K+++++K+
Sbjct: 52 KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKEL 93
Score = 29.5 bits (66), Expect = 2.6
Identities = 13/40 (32%), Positives = 30/40 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ A+K+ K KE + +K+KK ++KK+ K++K+++++K+
Sbjct: 53 ESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKE 92
Score = 27.9 bits (62), Expect = 8.2
Identities = 12/35 (34%), Positives = 27/35 (77%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
+ AK++++ +K + +KKK+K++E+++K+ KK K
Sbjct: 92 ELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126
>gnl|CDD|222703 pfam14351, DUF4401, Domain of unknown function (DUF4401). This
family of proteins is found in bacteria. Proteins in
this family are typically between 357 and 735 amino
acids in length. The family is found in association with
pfam09925. There is a single completely conserved
residue K that may be functionally important.
Length = 321
Score = 37.6 bits (88), Expect = 0.012
Identities = 23/108 (21%), Positives = 34/108 (31%), Gaps = 18/108 (16%)
Query: 244 FFIGQFG------GMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYT 297
F+ Q G L A G+F+ LD LL +A + L L+
Sbjct: 57 LFLRQLALALSLAGQLLL---AFGLFDL----LDESVLAALLLLALILALLY--FLMPD- 106
Query: 298 GKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFL 345
R L +S + + PL P + + L FL
Sbjct: 107 --RLLRFLSAAAGVLALLGLLAELLLSGLAYLLLPLAPLLLVALFVFL 152
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 38.2 bits (88), Expect = 0.012
Identities = 12/46 (26%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 22 KAKKKEKKKKEKKA-KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K + K++++KE+ ++ K++E KKEK+++++ +K N
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEM 1068
Score = 36.3 bits (83), Expect = 0.055
Identities = 11/42 (26%), Positives = 27/42 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K ++KE+ + ++AK++E K++K++E++ +K K+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEM 1068
Score = 35.5 bits (81), Expect = 0.082
Identities = 11/41 (26%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ + ++ KE++ KKEK++E++ ++ K+ KK+
Sbjct: 1032 KERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072
Score = 32.8 bits (74), Expect = 0.62
Identities = 13/39 (33%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 21 KKAKKKE--KKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
++AK++E KK+KE++ + ++ + K+ KK+ KKK+
Sbjct: 1039 QRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 31.6 bits (71), Expect = 1.3
Identities = 12/42 (28%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKK-EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ ++ K+++ KK K++E++ K + K+ KK+ KKK
Sbjct: 1035 MESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKK 1076
Score = 29.3 bits (65), Expect = 6.5
Identities = 10/43 (23%), Positives = 25/43 (58%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K + K + K+ K +++KE+ + ++ K+++ KK+K
Sbjct: 1009 PECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEK 1051
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 37.2 bits (87), Expect = 0.013
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 13/56 (23%)
Query: 19 DWKKAKKKEKKKKEKKAKK-------------KEKKKEKKKKEKKKKKKKKKKKKK 61
D++K KKE K +K KEKKK K+EKK K++K K ++
Sbjct: 67 DFRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEE 122
Score = 34.9 bits (81), Expect = 0.058
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK 46
+K KKK K+EKKA K+EK K ++
Sbjct: 98 EKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 34.9 bits (81), Expect = 0.065
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKK 50
+KEKKK K +KK K+EK K E+
Sbjct: 97 AEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 33.3 bits (77), Expect = 0.18
Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 1/28 (3%)
Query: 27 EKKKKEKKAKKK-EKKKEKKKKEKKKKK 53
E +K++KKA K EKK K++K+K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 36.9 bits (86), Expect = 0.013
Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 13/56 (23%)
Query: 19 DWKKAKKKEKKKKEKKAKK-------------KEKKKEKKKKEKKKKKKKKKKKKK 61
D++K KE+K KK K KEKKK K+EKK K++K+K ++
Sbjct: 66 DFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121
Score = 36.5 bits (85), Expect = 0.022
Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 1 MDSSTKHQIGTFTEP--TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
+ KH I ++ TP+ ++KEKKK K +KK K+EK+K E+
Sbjct: 71 LTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein. This family is part
of the major facilitator superfamily of membrane
transport proteins.
Length = 425
Score = 38.0 bits (89), Expect = 0.013
Identities = 46/263 (17%), Positives = 86/263 (32%), Gaps = 51/263 (19%)
Query: 105 TYVAEITQ-PHLRGMLSATASM-TTIFGTVSQLFLGSFLHWRSAAILNLLFPILAL--CA 160
+ + +T+ P R L++ + TI G + + + + + AL
Sbjct: 123 SLIPVLTRDPKERTSLASYRRIGATIGGLLVAVIAPPLVALLGGGNGAKGYFLAALIIAI 182
Query: 161 LYFIPESPHWLISQGRMQEASASLCWL--RGWVTPDKVQTELSQITKAIEESELKRLGKD 218
L + +C+ + V+ Q E + K L K+
Sbjct: 183 LAVV----------------GFLICFAGTKERVSVPPEQKEKVSLRD-----MFKALFKN 221
Query: 219 GQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLL 278
+ LL ++ +L + G + TY +G +A L +
Sbjct: 222 RP--------LLILLLL--YLLNALAMAVRNGLLLYYFTYVLG-----NAGLFSVLLLIG 266
Query: 279 LGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVF 338
A LG A L L GK+ L+ +A +++ S + V
Sbjct: 267 TIAAILG-APLWPWLAKRFGKKRTFLLGMLLAAIGLVLLFFL-------PPGSLWLFLVL 318
Query: 339 LVLAAFLTHICIRLLPWMLIGEV 361
+VLA + LLPW ++ +V
Sbjct: 319 VVLAGIGLGLAT-LLPWAMLADV 340
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 33.7 bits (77), Expect = 0.014
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++ KKK + ++ K + + + KKK KKK K+
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKED 37
Score = 33.7 bits (77), Expect = 0.016
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++ KKK ++ ++ K + + KKK KKK K+ K+K
Sbjct: 2 RERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43
Score = 32.9 bits (75), Expect = 0.025
Identities = 14/39 (35%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+E+KKK ++A E+ K+ + + KKK KKK K+++
Sbjct: 1 NRERKKKWREANS-ERNKDNDLRSRVKKKAKKKFGKEDS 38
Score = 30.2 bits (68), Expect = 0.26
Identities = 12/38 (31%), Positives = 20/38 (52%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+E+K K +E E+ K + + KKK KKK ++
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDS 38
Score = 29.5 bits (66), Expect = 0.40
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
W++A + K + +++ K+K K+K KE K+K
Sbjct: 8 WREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 35.3 bits (82), Expect = 0.016
Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 4 STKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
STK I P ++ + K++K+KK KK E + +KKEKK+ +++K +K
Sbjct: 18 STKVDISGVKVPEKIN----DEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 37.5 bits (87), Expect = 0.016
Identities = 15/60 (25%), Positives = 31/60 (51%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
P KK + K + ++ +K+ K++++KK+ +++KKKKK+ K N+
Sbjct: 261 RRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPA 320
Score = 34.4 bits (79), Expect = 0.13
Identities = 16/49 (32%), Positives = 28/49 (57%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
T +P + ++ KK+EKK + ++E K +KK+E+ + KKK K
Sbjct: 4 TVPQPFQMTKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKAN 52
Score = 33.6 bits (77), Expect = 0.26
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
T P P K +KK+++KK + K E++ K +K++++ + KKK K
Sbjct: 3 ITVPQPFQMTKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKAN 52
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 37.9 bits (88), Expect = 0.016
Identities = 21/64 (32%), Positives = 34/64 (53%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
LD K ++ KK KK K + K+ KK + K+KKK KKK +++ +N ++ NN+
Sbjct: 46 LDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQ 105
Query: 78 NNNN 81
Sbjct: 106 FAGP 109
Score = 37.9 bits (88), Expect = 0.017
Identities = 22/86 (25%), Positives = 29/86 (33%), Gaps = 25/86 (29%)
Query: 18 LDWKKAKKKEKKKKEKKAKK-------------------------KEKKKEKKKKEKKKK 52
D KK K K+KKK +KK K K K +K KKK
Sbjct: 70 KDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129
Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNN 78
K KK ++ + +NN
Sbjct: 130 TVNKSTNKKKKKVLSSKDELIKYDNN 155
Score = 34.8 bits (80), Expect = 0.12
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
KK +K +KK KK K + K +K K K+KKK KKK + +++ +N ++ NN+
Sbjct: 50 KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP 109
Score = 34.0 bits (78), Expect = 0.26
Identities = 9/39 (23%), Positives = 14/39 (35%)
Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
EK+ K K K K+ N + + N +N
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540
Score = 33.7 bits (77), Expect = 0.28
Identities = 22/90 (24%), Positives = 29/90 (32%), Gaps = 21/90 (23%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-----------KEKKKKK----------KKKKKK 59
KK K + K KK K K+KKK KKK K K +K K
Sbjct: 61 KKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKP 120
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K N + N ++ +E
Sbjct: 121 KTEKLKKKITVNKSTNKKKKKVLSSKDELI 150
Score = 33.3 bits (76), Expect = 0.35
Identities = 10/39 (25%), Positives = 14/39 (35%)
Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
EK+ K K K K+ KK + N +N
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540
Score = 32.9 bits (75), Expect = 0.52
Identities = 9/39 (23%), Positives = 14/39 (35%)
Query: 44 KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+KE K K K K+ K + + N +N
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540
Score = 32.5 bits (74), Expect = 0.60
Identities = 9/40 (22%), Positives = 16/40 (40%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
+K+ K K K K NN + + N +N ++
Sbjct: 503 SEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNKKQ 542
Score = 32.1 bits (73), Expect = 0.81
Identities = 9/41 (21%), Positives = 16/41 (39%)
Query: 47 KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
+K+ K K K K+NN + + N +N +
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNKKQ 542
Score = 31.3 bits (71), Expect = 1.4
Identities = 9/47 (19%), Positives = 18/47 (38%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
+K+ K K K NN + + N +N+++ + K
Sbjct: 502 NSEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNKKQINLIIK 548
Score = 30.6 bits (69), Expect = 2.6
Identities = 11/38 (28%), Positives = 15/38 (39%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+KE K K K K+ KK K + N +N
Sbjct: 503 SEKEAKLKIIKNKENNKKDTTKRITLSTTKTINKKDNK 540
Score = 29.8 bits (67), Expect = 4.2
Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 1/69 (1%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKK-KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
KK +K KK KK K K KK K K+KKK + +++ +N ++ NN+
Sbjct: 50 KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP 109
Query: 88 EKGTFLYKM 96
+ + K
Sbjct: 110 LAISLMRKP 118
Score = 29.8 bits (67), Expect = 5.2
Identities = 19/61 (31%), Positives = 26/61 (42%), Gaps = 2/61 (3%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K+ F P + + K K K +K KK K K KKK+K K + K N
Sbjct: 97 FDSKNNSKQFAGPLAISLMR-KPKPKTEKLKK-KITVNKSTNKKKKKVLSSKDELIKYDN 154
Query: 63 N 63
N
Sbjct: 155 N 155
>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
chromatin remodeling [Chromatin structure and dynamics].
Length = 237
Score = 37.0 bits (86), Expect = 0.016
Identities = 12/65 (18%), Positives = 23/65 (35%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
S + + +K K KE+ + K+ + + K+ KKKK
Sbjct: 56 DSLAEEPRVLRKEKYNITRKTTGKNDLPKEEDSSLPSSKETENGDTEGKETDKKKKSSTI 115
Query: 63 NNNNN 67
+ N+
Sbjct: 116 SKNSP 120
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal
RNA recognition motif (RRM), also termed RBD (RNA
binding domain) or RNP (ribonucleoprotein domain), and
a C-terminal RRP7 domain.
Length = 118
Score = 35.3 bits (82), Expect = 0.017
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +K K ++ + K ++K+KKKKKKK+ ++
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90
Score = 33.8 bits (78), Expect = 0.051
Identities = 11/32 (34%), Positives = 22/32 (68%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+ ++ K ++E + K K+++KKKKKKK+ +
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 33.8 bits (78), Expect = 0.060
Identities = 15/59 (25%), Positives = 28/59 (47%), Gaps = 13/59 (22%)
Query: 19 DWKKAKKKEKKKKE-------------KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
++ +K+E K+ K K ++E + + K+K+KKKKKKK+ +
Sbjct: 32 KREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90
Score = 30.7 bits (70), Expect = 0.76
Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 9/53 (16%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKE-----KKKEKKKKE----KKKKKKKKKK 58
+ +AK KEK+KK+KK K+ E + +EKKK+E +KK ++ KK+
Sbjct: 65 TAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEELAELRKKFEEDKKR 117
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 36.5 bits (85), Expect = 0.019
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 10/53 (18%)
Query: 19 DWKKAKKKEKKKKEKKAKK----------KEKKKEKKKKEKKKKKKKKKKKKK 61
D+KK KE+K K K K +K+EKK K++KK K +K+K
Sbjct: 66 DFKKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118
Score = 33.9 bits (78), Expect = 0.14
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 1 MDSSTKHQIGTFTEP--TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
M K I F++ T + +KE+KK K +KK K EK+K E++
Sbjct: 71 MTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 37.1 bits (86), Expect = 0.020
Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 6/52 (11%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN------NNNNNNNNN 74
+ ++ +E K+E +KK+K KKK +K+K +NN N
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLRNKRKKGFK 390
Score = 35.2 bits (81), Expect = 0.075
Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 6/62 (9%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN------NNNNNNN 75
++ + ++ ++ K++ EKK+K KKK +K+K N +NN N
Sbjct: 329 SEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLRNKRKKG 388
Query: 76 NN 77
Sbjct: 389 FK 390
Score = 34.0 bits (78), Expect = 0.18
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 6/68 (8%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN------N 71
LD K + +++ E+ K+E +KK+K KKK +K N +NN
Sbjct: 323 LDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLR 382
Query: 72 NNNNNNNN 79
N
Sbjct: 383 NKRKKGFK 390
Score = 29.4 bits (66), Expect = 5.9
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 11/55 (20%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKK-----------KEKKKKKKKKKKK 59
D + +E K++ +K +K +KK K+K K + K+KK K +
Sbjct: 338 IDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLKLRNKRKKGFKGR 392
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 37.7 bits (87), Expect = 0.020
Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 9/70 (12%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN---------NNNNNNN 76
K + K K KEK K KK K+ K+ +K N++ + N +
Sbjct: 215 KNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEK 274
Query: 77 NNNNNNNNNN 86
N + ++
Sbjct: 275 NVVSGIRSSV 284
Score = 36.9 bits (85), Expect = 0.031
Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 8/76 (10%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN--------NNNNNNNNNNNNN 79
K KEK +K + K+ + +K+ K+ +K K +N++ + N + N +
Sbjct: 220 DKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSG 279
Query: 80 NNNNNNNEEKGTFLYK 95
++ + K L K
Sbjct: 280 IRSSVLLKRKSQCLRK 295
Score = 35.0 bits (80), Expect = 0.14
Identities = 14/59 (23%), Positives = 29/59 (49%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K+K + K +KK K K + ++N+NN++ +++N+ +N +N N
Sbjct: 287 KRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRN 345
Score = 34.6 bits (79), Expect = 0.15
Identities = 13/79 (16%), Positives = 32/79 (40%), Gaps = 13/79 (16%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN-------- 72
+++ + K K ++K ++K K KK + K+ K+N+ + N++
Sbjct: 205 RRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRS 264
Query: 73 ---NNNNNNNN--NNNNNN 86
N + N + ++
Sbjct: 265 YTKENKKDEKNVVSGIRSS 283
Score = 33.8 bits (77), Expect = 0.27
Identities = 17/112 (15%), Positives = 36/112 (32%), Gaps = 41/112 (36%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK-------KKKKKKKKKNNNNNNNNNNNN 73
K K+K K+K + K ++ K+ K+ +K K + + K N + N +
Sbjct: 220 DKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSG 279
Query: 74 NNNNN----------------------------------NNNNNNNNEEKGT 91
++ NN ++N+N + +
Sbjct: 280 IRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTS 331
Score = 33.4 bits (76), Expect = 0.35
Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 15/85 (17%)
Query: 26 KEKKKKEKKAKKKEKKKE--KKKKE-------------KKKKKKKKKKKKKNNNNNNNNN 70
KE KK EK + K+K + KK K K NN ++N+N
Sbjct: 267 KENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSN 326
Query: 71 NNNNNNNNNNNNNNNNEEKGTFLYK 95
N++ +++N+ +N + + YK
Sbjct: 327 NDDTSSSNHLGSNRISNRNPSSPYK 351
Score = 32.7 bits (74), Expect = 0.56
Identities = 18/119 (15%), Positives = 43/119 (36%), Gaps = 48/119 (40%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-------------KEKKKKKKKKKKKKKNNNNNN 67
+K K+K+K K+ + K+ K+ +K + K+ KK +K + ++
Sbjct: 224 EKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSS 283
Query: 68 -----------------------------------NNNNNNNNNNNNNNNNNNNEEKGT 91
NN ++N+NN++ +++N+ + +
Sbjct: 284 VLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRIS 342
Score = 31.9 bits (72), Expect = 1.2
Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 8/80 (10%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN-- 77
+ K ++ + K +K KEK K++ K KK + K+ N+ + N++ +
Sbjct: 201 YDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVR 260
Query: 78 ------NNNNNNNNNEEKGT 91
N + N G
Sbjct: 261 ILRSYTKENKKDEKNVVSGI 280
>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
family. YhcN and YlaJ are predicted lipoproteins that
have been detected as spore proteins but not vegetative
proteins in Bacillus subtilis. Both appear to be
expressed under control of the RNA polymerase sigma-G
factor. The YlaJ-like members of this family have a
low-complexity, strongly acidic 40-residue C-terminal
domain that is not included in the seed alignment for
this model. A portion of the low-complexity region
between the lipoprotein signal sequence and the main
conserved region of the protein family was also excised
from the seed alignment [Cellular processes,
Sporulation and germination].
Length = 158
Score = 35.8 bits (83), Expect = 0.021
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
+KK NNN N + NNN N N+ N+ +
Sbjct: 18 AQKKNPPNNNVGETNVMSKNNNGMNTTNDTNDGDL 52
Score = 35.0 bits (81), Expect = 0.032
Identities = 15/52 (28%), Positives = 21/52 (40%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEI 110
+K N NNN N + NNN N N+ LY +A + V +
Sbjct: 18 AQKKNPPNNNVGETNVMSKNNNGMNTTNDTNDGDLYDVADEIASEAAKVKGV 69
Score = 27.7 bits (62), Expect = 9.7
Identities = 9/33 (27%), Positives = 13/33 (39%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+KK N + NNN N N+ N+
Sbjct: 18 AQKKNPPNNNVGETNVMSKNNNGMNTTNDTNDG 50
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 36.3 bits (84), Expect = 0.022
Identities = 22/77 (28%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
P+P KK + +K K K+K KEK +E+KK + K ++N N+ +NN + +
Sbjct: 46 PSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGD 105
Query: 75 ---NNNNNNNNNNNNEE 88
+ + N+ ++ +NE+
Sbjct: 106 SQESASANSLSDIDNED 122
Score = 35.1 bits (81), Expect = 0.049
Identities = 17/55 (30%), Positives = 28/55 (50%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
+ K+K+ +K KK KKK K+K ++++ N N+ +NNEE G
Sbjct: 50 QPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESG 104
Score = 34.4 bits (79), Expect = 0.090
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
P D + K+ KKK +K +K KK+ K+K +++KK +++++ N N+ +N
Sbjct: 40 PEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDN 99
Query: 77 N 77
N
Sbjct: 100 N 100
Score = 32.4 bits (74), Expect = 0.33
Identities = 14/63 (22%), Positives = 32/63 (50%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K+ KK + +K K+ KKK K+K +++ ++++ N N+ +NN + + +
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQ 107
Query: 89 KGT 91
+
Sbjct: 108 ESA 110
Score = 30.9 bits (70), Expect = 1.3
Identities = 13/61 (21%), Positives = 34/61 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK+KKK+K+K ++ KK E +K ++ + ++ + + N+ ++ +N ++ +
Sbjct: 66 KKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMD 125
Query: 81 N 81
+
Sbjct: 126 D 126
Score = 30.5 bits (69), Expect = 1.6
Identities = 13/68 (19%), Positives = 38/68 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K A K+ KKK+K+ +E+KK + +K ++ + + + ++ + + N+ ++ +N
Sbjct: 61 KPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDN 120
Query: 81 NNNNNNEE 88
++ ++ +
Sbjct: 121 EDDMDDSD 128
Score = 29.7 bits (67), Expect = 3.0
Identities = 15/68 (22%), Positives = 35/68 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+K K+ K K KEK E++KK + K ++ + +NN + ++ + + N+
Sbjct: 56 PTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSL 115
Query: 81 NNNNNNEE 88
++ +N ++
Sbjct: 116 SDIDNEDD 123
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 36.2 bits (84), Expect = 0.023
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
WK AK E+ E+ K+E+ + +KK+ KKKK
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.17
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
A+ K+ K E+ A ++ + K+E+ + +KK+ KKK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 35.0 bits (81), Expect = 0.025
Identities = 13/70 (18%), Positives = 43/70 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+K ++E K+KE++ + +E+K+ +++++++ +K+ K++K+ N++ + +
Sbjct: 35 RKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDT 94
Query: 81 NNNNNNEEKG 90
+ ++E +G
Sbjct: 95 EDVEDDEWEG 104
Score = 34.2 bits (79), Expect = 0.056
Identities = 14/77 (18%), Positives = 43/77 (55%), Gaps = 8/77 (10%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--------KKNNNNNNNNNNNNNN 75
K+K++++K+ + + KEK++E++ +E+K+ ++++K++ K+ N++ +
Sbjct: 29 KRKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEED 88
Query: 76 NNNNNNNNNNNEEKGTF 92
+ + ++E F
Sbjct: 89 AETEDTEDVEDDEWEGF 105
>gnl|CDD|235760 PRK06265, PRK06265, cobalt transport protein CbiM; Validated.
Length = 199
Score = 36.0 bits (84), Expect = 0.025
Identities = 30/135 (22%), Positives = 45/135 (33%), Gaps = 32/135 (23%)
Query: 227 MYMRRTFLLPYAIVTSLFFIG---QFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAE 283
+ F AI+ +L FGG+T L G+ TL + +
Sbjct: 72 VLGWAAFP---AILVALLLQALLFGFGGLTVL-----GV------------NTLNMALPA 111
Query: 284 LGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAA 343
+ L L+ T G+ A F+ + A L+ P VP L+
Sbjct: 112 VLAGYLFRKLLRRTPPPRGLAAFLAGALAVFLAALLCA-LELALSGGFPFVPAAKLL--- 167
Query: 344 FLTHICIRLLPWMLI 358
L H LP M+I
Sbjct: 168 GLAH-----LPLMVI 177
>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1). This
family consists of the yeast mitochondrial ribosomal
proteins VAR1. Mitochondria possess their own ribosomes
responsible for the synthesis of a small number of
proteins encoded by the mitochondrial genome. In yeast
the two ribosomal RNAs and a single ribosomal protein,
VAR1, are products of mitochondrial genes, and the
remaining approximately 80 ribosomal proteins are
encoded in the nucleus. VAR1 along with 15S rRNA are
necessary for the formation of mature 37S subunits.
Length = 350
Score = 36.8 bits (85), Expect = 0.025
Identities = 16/35 (45%), Positives = 18/35 (51%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
K NN NN+NNN N NN NN N N + K
Sbjct: 240 KYNNMINNSNNNLNINNIYNNLNINKISMDLLMNK 274
Score = 36.5 bits (84), Expect = 0.032
Identities = 16/31 (51%), Positives = 17/31 (54%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
NN NNNNN NN NN+NNN N N
Sbjct: 231 NNINNNNNIKYNNMINNSNNNLNINNIYNNL 261
Score = 34.9 bits (80), Expect = 0.095
Identities = 15/27 (55%), Positives = 16/27 (59%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K NN NN+NNN N NN NN N N
Sbjct: 240 KYNNMINNSNNNLNINNIYNNLNINKI 266
Score = 33.4 bits (76), Expect = 0.26
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K N NN+NNN N NN NN N N + +
Sbjct: 240 KYNNMINNSNNNLNINNIYNNLNINKISMD 269
Score = 33.0 bits (75), Expect = 0.37
Identities = 16/28 (57%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 61 KNNNNNNNNNNNNNNNNN--NNNNNNNN 86
N NN NNNNN NN NN+NNN N
Sbjct: 226 SMNYINNINNNNNIKYNNMINNSNNNLN 253
Score = 31.8 bits (72), Expect = 1.0
Identities = 15/27 (55%), Positives = 16/27 (59%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNN 86
N NN NNNNN NN NN+NNN
Sbjct: 226 SMNYINNINNNNNIKYNNMINNSNNNL 252
Score = 31.5 bits (71), Expect = 1.1
Identities = 19/37 (51%), Positives = 19/37 (51%), Gaps = 13/37 (35%)
Query: 63 NNNNNNNNNNNNNN-------------NNNNNNNNNN 86
NNN NN NNNNNN N NNNNN NN
Sbjct: 123 YNNNINNINNNNNNYYMNMINKLMNILNINNNNNMNN 159
Score = 31.1 bits (70), Expect = 1.7
Identities = 17/40 (42%), Positives = 20/40 (50%)
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
K K K+ N N N NNN NN NNNNN + K+
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNNYYMNMINKL 145
Score = 30.7 bits (69), Expect = 1.9
Identities = 15/33 (45%), Positives = 17/33 (51%)
Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K KNN N N N NN+N NNNN N+
Sbjct: 305 SNKLYLKNNINKNYKLNYIPNNHNIYNNNNVNK 337
Score = 29.9 bits (67), Expect = 3.5
Identities = 19/38 (50%), Positives = 20/38 (52%), Gaps = 13/38 (34%)
Query: 62 NNNNNNNNNNNN-------------NNNNNNNNNNNNN 86
NN NN NNNNNN N NNNNN NN +N
Sbjct: 125 NNINNINNNNNNYYMNMINKLMNILNINNNNNMNNISN 162
Score = 29.9 bits (67), Expect = 3.6
Identities = 17/31 (54%), Positives = 17/31 (54%)
Query: 56 KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K K K N N N NNN NN NNNNNN
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNN 136
Score = 29.9 bits (67), Expect = 3.7
Identities = 18/38 (47%), Positives = 18/38 (47%), Gaps = 13/38 (34%)
Query: 62 NNNNNNNNNNNNNNNN-------------NNNNNNNNN 86
NNN NN NNNNNN N NNNNN N
Sbjct: 121 YYYNNNINNINNNNNNYYMNMINKLMNILNINNNNNMN 158
Score = 29.1 bits (65), Expect = 5.7
Identities = 17/43 (39%), Positives = 18/43 (41%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
K K K NNN NN + N NNN N N KG L
Sbjct: 1 MKLLLKNYLLKMNNNRMNNMDINIKNNNMNKYLNEYNNKGNKL 43
Score = 29.1 bits (65), Expect = 6.0
Identities = 20/71 (28%), Positives = 26/71 (36%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
K + K K + K K + NN NN NN N N NN N N
Sbjct: 11 KMNNNRMNNMDINIKNNNMNKYLNEYNNKGNKLQHINNMNNWNNQLYNYNKNNVINTWIN 70
Query: 86 NEEKGTFLYKM 96
++ LYK+
Sbjct: 71 DKLVNKLLYKL 81
Score = 28.8 bits (64), Expect = 8.1
Identities = 17/31 (54%), Positives = 17/31 (54%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
K K K N N N NNN NN NNNNNN
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNN 136
Score = 28.8 bits (64), Expect = 8.6
Identities = 21/52 (40%), Positives = 21/52 (40%), Gaps = 13/52 (25%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN-------------NNNNNEEK 89
K K K K N N NNN NN NNNNNN N NNN
Sbjct: 106 KPKFKHTINKVNINFYYYNNNINNINNNNNNYYMNMINKLMNILNINNNNNM 157
Score = 28.8 bits (64), Expect = 9.1
Identities = 14/32 (43%), Positives = 15/32 (46%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
K K N N N N NN+N NNNN N
Sbjct: 305 SNKLYLKNNINKNYKLNYIPNNHNIYNNNNVN 336
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 36.9 bits (85), Expect = 0.025
Identities = 13/50 (26%), Positives = 21/50 (42%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
KK K + K E + K +E++KKK+K++ KK
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591
Score = 36.9 bits (85), Expect = 0.031
Identities = 14/48 (29%), Positives = 26/48 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
KA +K K + ++ K +E++K+K+K+ KK K KK++
Sbjct: 544 MKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591
Score = 36.1 bits (83), Expect = 0.047
Identities = 16/39 (41%), Positives = 21/39 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++ KKK K++ KKA K KK+ K KEK KK
Sbjct: 562 ERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSS 600
Score = 32.7 bits (74), Expect = 0.54
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
TE D K ++++KK+K + AKK K +K++K K+K KK +
Sbjct: 552 TELDRTD-KNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601
Score = 31.9 bits (72), Expect = 1.1
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K +++ KKK++++A KK K KK+++ K+K KK ++
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601
Score = 31.5 bits (71), Expect = 1.5
Identities = 11/50 (22%), Positives = 21/50 (42%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
KK + +K K + K +++K+K+K++ KK K
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591
Score = 29.2 bits (65), Expect = 6.1
Identities = 11/50 (22%), Positives = 19/50 (38%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
KK K EK K + + K ++++KKK+K+
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKV 591
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 37.2 bits (87), Expect = 0.026
Identities = 10/44 (22%), Positives = 17/44 (38%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L + + E K +K K + KK ++ +KKK
Sbjct: 323 LSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSIS 366
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 34.9 bits (80), Expect = 0.026
Identities = 15/23 (65%), Positives = 16/23 (69%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKK 61
EK K+KKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 34.5 bits (79), Expect = 0.033
Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 7 HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
H + EP+ D + EK KK+KK K K+ KK KK +K
Sbjct: 78 HLTTSSGEPSHPDDHPPEPTEKPKKKKK-KSKKTKKPKKSSKKD 120
Score = 32.2 bits (73), Expect = 0.23
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKK 59
K KKK+KK K+ KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 30.6 bits (69), Expect = 0.77
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNN 63
+ EK KK+KKK KK KK KK +
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSK 118
Score = 29.5 bits (66), Expect = 2.1
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 45 KKKEKKKKKKKKKKKKKNNN 64
K K+KKKK KK KK KK++
Sbjct: 99 KPKKKKKKSKKTKKPKKSSK 118
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 33.8 bits (78), Expect = 0.026
Identities = 13/35 (37%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 24 KKKEKKKKEKKAKKKEKKKE--KKKKEKKKKKKKK 56
++ ++KK+KK K K+K KE +KKE+ +++ K+
Sbjct: 36 SQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70
Score = 31.9 bits (73), Expect = 0.13
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K + ++ ++ KKK+KK +KK KE +KK++ +++
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67
Score = 29.6 bits (67), Expect = 0.85
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 34 KAKKKEKKKEKKKKEKKKKKKKKKKKK 60
A K + ++ +++KKKKKK KKK K
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 29.2 bits (66), Expect = 1.2
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKN 62
K + ++ ++KKKKKKK KKK+
Sbjct: 30 SKVRRISQRNRRKKKKKKKLKKKS 53
Score = 29.2 bits (66), Expect = 1.4
Identities = 11/36 (30%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 27 EKKKKEKKAKKKEKKKEKKK--KEKKKKKKKKKKKK 60
++ +K KKK+K K+K K +KK++ +++ K+
Sbjct: 35 ISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70
Score = 27.6 bits (62), Expect = 4.0
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 4/34 (11%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKK----KKK 60
K ++ ++ +KKK+KKK KKK K +KK
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 27.6 bits (62), Expect = 5.0
Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 4/49 (8%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK----KKK 59
P L + + E+ K + ++ + KKKK+KK KKK K +KK
Sbjct: 13 PNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 27.3 bits (61), Expect = 6.5
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K ++ ++ ++K+KKKK+ KKK K+ +KK
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 26.9 bits (60), Expect = 7.6
Identities = 12/34 (35%), Positives = 24/34 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
++ ++K+KKKK+ K K KE KK++ +++ K+
Sbjct: 37 QRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 36.9 bits (86), Expect = 0.027
Identities = 9/39 (23%), Positives = 16/39 (41%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K++K+K+ K + K K EK K +
Sbjct: 766 EKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLT 804
Score = 36.5 bits (85), Expect = 0.039
Identities = 8/45 (17%), Positives = 14/45 (31%), Gaps = 2/45 (4%)
Query: 20 WKKAKKKEKKKKEKK--AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ + K + + K AK EK K+ +
Sbjct: 769 KKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAA 813
Score = 33.1 bits (76), Expect = 0.48
Identities = 9/44 (20%), Positives = 19/44 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+K KK+++ K E +A K K +K + + + +
Sbjct: 766 EKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPD 809
Score = 32.7 bits (75), Expect = 0.66
Identities = 5/41 (12%), Positives = 7/41 (17%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K K EK + K
Sbjct: 779 EALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVS 819
Score = 31.1 bits (71), Expect = 1.6
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
EK+ K+KE K E + + K +K K
Sbjct: 765 TEKQKKEKESKSELEALKGVGAKTAEKLKDA 795
Score = 30.4 bits (69), Expect = 3.3
Identities = 4/36 (11%), Positives = 9/36 (25%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
K A+K + E + + +
Sbjct: 787 KTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822
Score = 30.4 bits (69), Expect = 3.6
Identities = 10/54 (18%), Positives = 18/54 (33%), Gaps = 12/54 (22%)
Query: 19 DWKKAKKKEKKKKE------------KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ K E +K KK+++ K + + K K +K K
Sbjct: 740 GLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLK 793
Score = 30.0 bits (68), Expect = 4.6
Identities = 7/43 (16%), Positives = 10/43 (23%), Gaps = 1/43 (2%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
L AK E K K+ + K +
Sbjct: 781 LKGVGAKTAE-KLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 35.8 bits (83), Expect = 0.029
Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 20 WKKA---KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
WK A K E+K+KE+ A +KK + +K K+ +KK K K
Sbjct: 146 WKYADVVAKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVA 194
Score = 28.5 bits (64), Expect = 7.8
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+ A +KK K +KA K+ +KK K K KK
Sbjct: 162 RAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 36.2 bits (84), Expect = 0.030
Identities = 17/37 (45%), Positives = 17/37 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK KK KAKK KK KK K KK KK
Sbjct: 260 GPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 35.1 bits (81), Expect = 0.078
Identities = 15/37 (40%), Positives = 16/37 (43%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK +K K KK KK KK K KK KK
Sbjct: 260 GPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 34.3 bits (79), Expect = 0.12
Identities = 16/51 (31%), Positives = 22/51 (43%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ T L+ ++ + +K AKK K KK KK KK K KK
Sbjct: 242 SITLEEALELLAERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKK 292
Score = 34.3 bits (79), Expect = 0.13
Identities = 15/40 (37%), Positives = 18/40 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K +K K+ A K +K KK KK K KK KK
Sbjct: 257 AKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 33.9 bits (78), Expect = 0.20
Identities = 17/41 (41%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++AK KK KKA + KK KK KK K KK K
Sbjct: 255 RRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAK 295
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 33.3 bits (77), Expect = 0.030
Identities = 15/51 (29%), Positives = 29/51 (56%), Gaps = 8/51 (15%)
Query: 22 KAKKKEKKKKEK-----KAKKK---EKKKEKKKKEKKKKKKKKKKKKKNNN 64
KA ++ K+K EK + +++ EK EK+K++K +K+++KK
Sbjct: 14 KALRRFKRKVEKAGILRELRRREFYEKPSEKRKRKKAAARKRRRKKLAREE 64
>gnl|CDD|177978 PLN02344, PLN02344, chorismate mutase.
Length = 284
Score = 36.1 bits (84), Expect = 0.032
Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 446 IRRAEKKRHAARVEGQEEVKGEDNFGMEGEDGKYISRVVGTLYMYFVMPET 496
+R EKK AR GQE V K +V LY ++MP T
Sbjct: 225 KKRVEKK---ARTFGQE-VTLSGEEDGADPKYKVDPSLVARLYGEWIMPLT 271
>gnl|CDD|214408 MTH00036, ATP8, ATP synthase F0 subunit 8; Validated.
Length = 54
Score = 32.7 bits (75), Expect = 0.034
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNN 85
NN + N +N+++N +++NNNN
Sbjct: 28 LLNNWPSTNASNSSSNTSSSNNNN 51
Score = 32.7 bits (75), Expect = 0.034
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 63 NNNNNNNNNNNNNNNNNNNNNNNN 86
NN + N +N+++N +++NNNN
Sbjct: 28 LLNNWPSTNASNSSSNTSSSNNNN 51
Score = 32.7 bits (75), Expect = 0.037
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNN 83
NN + N +N+++N +++NNNN
Sbjct: 29 LNNWPSTNASNSSSNTSSSNNNN 51
Score = 30.8 bits (70), Expect = 0.16
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNN 82
N + N +N+++N +++NNNN
Sbjct: 29 LNNWPSTNASNSSSNTSSSNNNN 51
Score = 30.8 bits (70), Expect = 0.19
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 64 NNNNNNNNNNNNNNNNNNNNNNNE 87
NN + N +N+++N +++NNN
Sbjct: 28 LLNNWPSTNASNSSSNTSSSNNNN 51
Score = 29.7 bits (67), Expect = 0.43
Identities = 7/23 (30%), Positives = 15/23 (65%)
Query: 58 KKKKNNNNNNNNNNNNNNNNNNN 80
+ N +N+++N +++NNNN
Sbjct: 29 LNNWPSTNASNSSSNTSSSNNNN 51
>gnl|CDD|223475 COG0398, COG0398, Uncharacterized conserved protein [Function
unknown].
Length = 223
Score = 35.8 bits (83), Expect = 0.036
Identities = 33/201 (16%), Positives = 48/201 (23%), Gaps = 41/201 (20%)
Query: 233 FLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVV 292
F+L Y + T + L +F P+ L + G+ L +
Sbjct: 54 FILLYLVATLPII-----PGSILTLAGGLLF-------GPFLGFLYSLIGATAGSTLAFL 101
Query: 293 LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWD-------SPLVPTVFLVLAAFL 345
L Y G R L GG V G+ P+ P + AA L
Sbjct: 102 LARYLG-RDWVLKFVGGKEK----VQRIDAGLERNGFWAILLLRLIPIFPFDLVNYAAGL 156
Query: 346 THICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFY 405
T I + + G + L L L
Sbjct: 157 TG--ISFRDFAIA-------------TLLGKLPGTIVYTY--LGSAFLGIGSPLLLLVLA 199
Query: 406 AAISVVGTLYMYFVMPETEGR 426
I V + R
Sbjct: 200 LLILGVFAIVAKRKELRRHKR 220
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 36.2 bits (83), Expect = 0.037
Identities = 15/40 (37%), Positives = 20/40 (50%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K + K K E +A+ KKK K+ KK K +KK
Sbjct: 263 KGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKK 302
Score = 35.4 bits (81), Expect = 0.063
Identities = 16/43 (37%), Positives = 21/43 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KA+ K K + E +A +KK KE KK K +KK N
Sbjct: 264 GKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKN 306
Score = 31.2 bits (70), Expect = 1.3
Identities = 12/41 (29%), Positives = 18/41 (43%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
A K K + + KA+ + + KKK K+ KK K
Sbjct: 260 AALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKME 300
>gnl|CDD|222436 pfam13886, DUF4203, Domain of unknown function (DUF4203). This is
the N-terminal region of 7tm proteins. The function is
not known.
Length = 210
Score = 35.7 bits (83), Expect = 0.038
Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 8/101 (7%)
Query: 232 TFLLPYAIVT-SLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLC 290
+ IV+ LF +G G A + L Y + +L V L GA+L
Sbjct: 70 LLWWVFLIVSVGLFCVGLLLGFLL----ASALLFLPLGNLYIYHSDWVLLVTLLCGAILF 125
Query: 291 VVLIHYTGKRPLALISTG--GSAACFIVVAVYAQFHLSYGW 329
+L ++ L ++ST G+AA + V +A L Y W
Sbjct: 126 ALLTLKW-QKYLLILSTSVVGAAAVVLGVDYFAGASLKYFW 165
Score = 29.5 bits (67), Expect = 3.5
Identities = 27/164 (16%), Positives = 45/164 (27%), Gaps = 34/164 (20%)
Query: 244 FFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLA 303
F G + I + L +GGA+L ++ +
Sbjct: 31 FLFGSLIVF--------LLITKITLLVTNARLGASLVAGVVGGAILGLLWWVFL--IVSV 80
Query: 304 LISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLA-------AFLTHICIRLLPWM 356
+ G F++ A F V LV A LT + L
Sbjct: 81 GLFCVGLLLGFLL-ASALLFLPLGNLYIYHSDWVLLVTLLCGAILFALLTLKWQKYL--- 136
Query: 357 LIGEVFPNNIRATASGASGSSSYIFA---FAVNKLYYPMLDTFH 397
LI ++ G+++ + FA L Y L+
Sbjct: 137 LI----------LSTSVVGAAAVVLGVDYFAGASLKYFWLNILK 170
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 36.4 bits (84), Expect = 0.041
Identities = 16/69 (23%), Positives = 41/69 (59%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K+ K +K K K+K K E+ K + KK++ ++K + +++++++++++N +
Sbjct: 734 KQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESR 793
Query: 81 NNNNNNEEK 89
++ N EE+
Sbjct: 794 KSSRNKEEE 802
Score = 34.5 bits (79), Expect = 0.18
Identities = 17/72 (23%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 22 KAKKKEKKKKEKKAKKKEKKK-----EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
KK+ K +K+ K K+K E K E KK++ ++K + +++++++++++N
Sbjct: 731 APKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNK 790
Query: 77 NNNNNNNNNNEE 88
+ ++ N EE
Sbjct: 791 ESRKSSRNKEEE 802
Score = 29.9 bits (67), Expect = 4.9
Identities = 13/78 (16%), Positives = 38/78 (48%), Gaps = 13/78 (16%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK-------------KKKKKNNNNNNNNNN 71
K +K KK+ K +K K K+K K + ++K+++ + ++++
Sbjct: 722 KPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSS 781
Query: 72 NNNNNNNNNNNNNNNEEK 89
+++++++N + ++ K
Sbjct: 782 SHHHSSSNKESRKSSRNK 799
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 36.6 bits (85), Expect = 0.041
Identities = 16/50 (32%), Positives = 25/50 (50%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
EP ++ K++K E +EK E+ E KK+K KK+ K+N
Sbjct: 1529 EPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578
Score = 35.8 bits (83), Expect = 0.072
Identities = 16/40 (40%), Positives = 24/40 (60%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
G E D + K K+KKKKEKK +++ K++EK + E
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 33.9 bits (78), Expect = 0.28
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 4/43 (9%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+W K E K + EK K+KKKKEKKK+++ K+++K
Sbjct: 727 NWNG-KDAEFKISDSVE---EKTKKKKKKEKKKEEEYKREEKA 765
Score = 32.8 bits (75), Expect = 0.61
Identities = 11/40 (27%), Positives = 20/40 (50%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KEKK + ++K ++ + KK+K KK+ +N
Sbjct: 1539 DKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578
Score = 32.4 bits (74), Expect = 0.79
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 11/56 (19%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK--------KEKKKKEK---KKKKKKKKKKKKNNNN 65
+ ++KE + + KEKK ++K E+ + KK+K KK+ +N
Sbjct: 1523 RNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578
Score = 30.1 bits (68), Expect = 3.7
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K E K +EK KKKKKK+KKK+
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEY 759
Score = 29.7 bits (67), Expect = 5.4
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K E K ++K KKKK+K+KKK+++ K+++
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764
Score = 29.7 bits (67), Expect = 6.0
Identities = 9/34 (26%), Positives = 16/34 (47%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
+EK +E A+ KK++ KK+ K +
Sbjct: 1548 SVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581
Score = 29.3 bits (66), Expect = 7.3
Identities = 13/26 (50%), Positives = 22/26 (84%), Gaps = 1/26 (3%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKK 51
+EK KK+KK K+K+K++E K++EK +
Sbjct: 742 EEKTKKKKK-KEKKKEEEYKREEKAR 766
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 36.5 bits (84), Expect = 0.041
Identities = 21/61 (34%), Positives = 35/61 (57%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
K EK+ KE K E+ +E +++ +++ K ++ + N NNNN+N+NNNNNN
Sbjct: 458 KLPEKEAKESKEAAGEEAREGRRRGPRRESKSQEPSGGESPNPELPANNNNSNSNNNNNN 517
Query: 84 N 84
Sbjct: 518 G 518
Score = 31.5 bits (71), Expect = 1.4
Identities = 17/62 (27%), Positives = 35/62 (56%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K EK+AK+ ++ ++ +E +++ +++ K + + + N NNNN+N+NNNN
Sbjct: 458 KLPEKEAKESKEAAGEEAREGRRRGPRRESKSQEPSGGESPNPELPANNNNSNSNNNNNN 517
Query: 89 KG 90
Sbjct: 518 GA 519
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
70kDa MW N terminal. This domain is found in
eukaryotes. This domain is about 90 amino acids in
length. This domain is found associated with pfam00076.
This domain is part of U1 snRNP, which is the pre-mRNA
binding protein of the penta-snRNP spliceosome complex.
It extends over a distance of 180 A from its RNA
binding domain, wraps around the core domain of U1
snRNP consisting of the seven Sm proteins and finally
contacts U1-C, which is crucial for 5'-splice-site
recognition.
Length = 94
Score = 33.8 bits (78), Expect = 0.041
Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 19 DWKKAKKKEKKKKEK-KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
+ K K + + +K+E++K+EKK+K +KK +++ + N +
Sbjct: 42 QYLSEFKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKEWDPNEDP 94
>gnl|CDD|219312 pfam07150, DUF1390, Protein of unknown function (DUF1390). This
family consists of several Paramecium bursaria
chlorella virus 1 (PBCV-1) proteins of around 250
residues in length. The function of this family is
unknown.
Length = 229
Score = 35.4 bits (82), Expect = 0.044
Identities = 17/58 (29%), Positives = 22/58 (37%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
KK + K + KE +E + +K N NNN NN N N N N
Sbjct: 26 KKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKNITIN 83
Score = 35.1 bits (81), Expect = 0.065
Identities = 16/56 (28%), Positives = 24/56 (42%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K KK + K + K+ ++ + +K +N N NNN NN N N N
Sbjct: 24 KHKKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKN 79
Score = 33.1 bits (76), Expect = 0.27
Identities = 14/53 (26%), Positives = 21/53 (39%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
KK + K E K+ ++ + + +N N NNN NN N K
Sbjct: 26 KKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTK 78
Score = 33.1 bits (76), Expect = 0.28
Identities = 17/62 (27%), Positives = 23/62 (37%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
K KK K E K+ +++ + +K N NNN NN N N N N
Sbjct: 24 KHKKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKNITIN 83
Query: 86 NE 87
Sbjct: 84 LT 85
Score = 30.4 bits (69), Expect = 1.8
Identities = 18/60 (30%), Positives = 22/60 (36%), Gaps = 1/60 (1%)
Query: 25 KKEKKKKEKKAKKKEK-KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
KK E K++ KE +E + EK NNN NN N N N N
Sbjct: 26 KKTSCGHEMKSESKEFVLEEDYRDEKLATTVSNINTGVVNNNVNNINKTINTKNITINLT 85
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 36.2 bits (84), Expect = 0.045
Identities = 12/88 (13%), Positives = 24/88 (27%), Gaps = 10/88 (11%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN---------- 70
K+ +K K + K K +EK K +K KK N+
Sbjct: 445 KEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEILE 504
Query: 71 NNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
++ + + + +
Sbjct: 505 KVDSKSEKLEKISAKIDNIKELFDESKM 532
Score = 30.4 bits (69), Expect = 2.5
Identities = 15/82 (18%), Positives = 27/82 (32%), Gaps = 7/82 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKE-------KKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K + K+ +K K+K K ++K K +K KKNN+ +
Sbjct: 437 KGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDA 496
Query: 74 NNNNNNNNNNNNNEEKGTFLYK 95
++ EK +
Sbjct: 497 LKLQEILEKVDSKSEKLEKISA 518
>gnl|CDD|222131 pfam13440, Polysacc_synt_3, Polysaccharide biosynthesis protein.
Length = 251
Score = 35.8 bits (83), Expect = 0.045
Identities = 31/209 (14%), Positives = 64/209 (30%), Gaps = 29/209 (13%)
Query: 242 SLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRP 301
+ +F G+ +++ + SI +LLL ++ G V G
Sbjct: 7 LAILLARFLGLEDFGAFSLAL--SI--------ISLLLILSSFGLRPSLVRD--SGGDPR 54
Query: 302 -------LALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLP 354
+L++ + ++ L+ + L+LA + LL
Sbjct: 55 ELRQAVRFSLLAALLLSLAVALLVSLIALFFGDEETGLLLLLLGLILA-------LLLLL 107
Query: 355 WMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTL 414
+L +F R A+ S + + L + L L +AA ++V L
Sbjct: 108 DILRSILFARG-RFRAAAVSDLLRLVVVLLL--LVLFLYLGATLASLLLAWAASALVAAL 164
Query: 415 YMYFVMPETEGRTLRDIEEHFADKGKTFV 443
+++ R + G
Sbjct: 165 IAFYLSRRRPARAPKFALLRHLKYGLKLG 193
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 36.4 bits (85), Expect = 0.046
Identities = 14/47 (29%), Positives = 24/47 (51%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K++ EK +KE++ EK KE + KK + +KK + N +
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 35.6 bits (83), Expect = 0.072
Identities = 12/46 (26%), Positives = 26/46 (56%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
K++ +K + +K ++ EK ++ + KK + +KKK+ KN +
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 34.4 bits (80), Expect = 0.18
Identities = 13/44 (29%), Positives = 22/44 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ K + +K++E + KE + KK E +KKK+ K K
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 34.0 bits (79), Expect = 0.24
Identities = 11/47 (23%), Positives = 24/47 (51%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
++ EK + +K+ + ++ KE + KK + +KKK+ K +
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 32.5 bits (75), Expect = 0.74
Identities = 13/55 (23%), Positives = 30/55 (54%)
Query: 8 QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+I + + + +K + +++++ EK AK+ + +K + EKKK+ K K+
Sbjct: 160 KIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 31.7 bits (73), Expect = 1.1
Identities = 11/46 (23%), Positives = 24/46 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K+ +K + +KE++ +K K+ + K+ + +KKK+ K
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 31.7 bits (73), Expect = 1.3
Identities = 10/59 (16%), Positives = 27/59 (45%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
G +D K + ++ + ++++ +K K+ + +K + +KKK+ K +
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 31.3 bits (72), Expect = 1.5
Identities = 11/62 (17%), Positives = 26/62 (41%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
E +++ +K E +KE++ ++ K+ + KK + + N + +
Sbjct: 164 EIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDK 223
Query: 87 EE 88
EE
Sbjct: 224 EE 225
Score = 31.3 bits (72), Expect = 1.7
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K++E +K K+A + KK E +KK++ K K+
Sbjct: 179 KEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215
Score = 30.6 bits (70), Expect = 2.8
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+K ++ EK KE K+ + EKKK+ K K+
Sbjct: 178 QKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215
>gnl|CDD|224874 COG1963, COG1963, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 153
Score = 34.7 bits (80), Expect = 0.047
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 7/75 (9%)
Query: 277 LLLGVAELGGALLCVVLIHY--TGKRPL-ALISTGG--SAACFIVVAVYAQFHLSYGWDS 331
LL + + A + VLI T K + L STGG S+ +V A+ L+ G DS
Sbjct: 13 LLSALVAILLAQVIKVLIELIRTRKLNVTLLFSTGGMPSSHSALVTALATSIALTEGLDS 72
Query: 332 PL--VPTVFLVLAAF 344
PL + VF ++ +
Sbjct: 73 PLFAIAAVFAIIVMY 87
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 32.4 bits (74), Expect = 0.049
Identities = 17/36 (47%), Positives = 22/36 (61%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+ KK KK KKK +++KKK KKK+K K N N
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKPKLPKNFN 36
Score = 32.0 bits (73), Expect = 0.060
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
E KK +K AKKK ++KKKK KKK+K K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 27.8 bits (62), Expect = 2.4
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
+AKK +K K+K +++KKK KKK K K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 27.0 bits (60), Expect = 4.0
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK 47
KK KK KKK + KKK+ K+K+K
Sbjct: 3 KKPKKIAKKKPGTEQKKKKSAKKKRKP 29
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap
is involved in recombination. Rap (recombination adept
with plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 35.0 bits (81), Expect = 0.050
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
++K ++KK K + E+++ K +KEK K + K
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLK 70
Score = 33.5 bits (77), Expect = 0.14
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +K +EKK K + + + ++ K +K+K K + K
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLK 70
Score = 31.9 bits (73), Expect = 0.47
Identities = 9/40 (22%), Positives = 24/40 (60%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+ A +++K ++K +K E + E+++ + +K+K K +
Sbjct: 27 SPECALALKREKAQEKKRKAEAQAERRELKARKEKLKTRS 66
Score = 30.4 bits (69), Expect = 1.8
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
E K+ K +EKK++ + + ++++ K +K+K K
Sbjct: 29 ECALALKREKAQEKKRKAEAQAERRELKARKEKLKT 64
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 35.0 bits (81), Expect = 0.053
Identities = 8/41 (19%), Positives = 28/41 (68%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +++E++K++++ +++ + K + E+ + K+K K+ +K
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161
Score = 34.2 bits (79), Expect = 0.093
Identities = 8/42 (19%), Positives = 24/42 (57%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
E+ ++E++ K+ E+++E+ + K + ++ + K+K
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161
Score = 32.7 bits (75), Expect = 0.37
Identities = 7/41 (17%), Positives = 25/41 (60%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ E++++EK+ +++ ++ + K + ++ + K+K K+
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161
Score = 31.5 bits (72), Expect = 0.70
Identities = 9/48 (18%), Positives = 28/48 (58%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+ +K ++E+++ + AK + ++ K+K K+ +K++ ++ + N
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRAANA 173
Score = 30.0 bits (68), Expect = 2.4
Identities = 7/41 (17%), Positives = 27/41 (65%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +++E+K+ E++ ++ + + + ++ + K+K K+ +K+
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161
Score = 30.0 bits (68), Expect = 2.4
Identities = 9/49 (18%), Positives = 26/49 (53%), Gaps = 7/49 (14%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK-------KKKKKKNNNNN 66
+K+ + E+ +++E+K++++++E+ + K K+K K
Sbjct: 113 RKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161
>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
(Spore_YhcN_YlaJ). This entry contains YhcN and YlaJ,
which are predicted lipoproteins that have been
detected as spore proteins but not vegetative proteins
in Bacillus subtilis. Both appear to be expressed under
control of the RNA polymerase sigma-G factor. The
YlaJ-like members of this family have a low-complexity,
strongly acidic, 40-residue C-terminal domain.
Length = 169
Score = 34.6 bits (80), Expect = 0.057
Identities = 11/38 (28%), Positives = 14/38 (36%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAA 98
+NNNN NN N NN E + + A
Sbjct: 39 RNNNNGYGRLTKNNVTGENGYNNIERGEVADKIARRVA 76
Score = 33.9 bits (78), Expect = 0.10
Identities = 13/29 (44%), Positives = 14/29 (48%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
NN NNNN NN N NN E+G
Sbjct: 37 NNRNNNNGYGRLTKNNVTGENGYNNIERG 65
Score = 31.9 bits (73), Expect = 0.49
Identities = 12/36 (33%), Positives = 14/36 (38%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMA 97
NN NN NNNN NN N ++A
Sbjct: 33 VNNRNNRNNNNGYGRLTKNNVTGENGYNNIERGEVA 68
Score = 31.9 bits (73), Expect = 0.49
Identities = 11/28 (39%), Positives = 11/28 (39%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
N NN NN NNNN NN
Sbjct: 32 NVNNRNNRNNNNGYGRLTKNNVTGENGY 59
Score = 31.5 bits (72), Expect = 0.56
Identities = 13/36 (36%), Positives = 16/36 (44%)
Query: 64 NNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAP 99
+N+ N NN NN NN NNNN K +
Sbjct: 25 DNDGNTNNVNNRNNRNNNNGYGRLTKNNVTGENGYN 60
Score = 31.5 bits (72), Expect = 0.58
Identities = 13/27 (48%), Positives = 13/27 (48%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEE 88
NN NN NN NNNN NN E
Sbjct: 31 NNVNNRNNRNNNNGYGRLTKNNVTGEN 57
Score = 31.5 bits (72), Expect = 0.64
Identities = 14/32 (43%), Positives = 16/32 (50%)
Query: 56 KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
N N +N+ N NN NN NN NNNN
Sbjct: 13 NAADDDNRQNVTDNDGNTNNVNNRNNRNNNNG 44
Score = 31.2 bits (71), Expect = 0.77
Identities = 14/31 (45%), Positives = 14/31 (45%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
NN NN NN NNNN NN E G
Sbjct: 29 NTNNVNNRNNRNNNNGYGRLTKNNVTGENGY 59
Score = 30.8 bits (70), Expect = 0.94
Identities = 9/28 (32%), Positives = 10/28 (35%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
+ NNN NN N NN E
Sbjct: 39 RNNNNGYGRLTKNNVTGENGYNNIERGE 66
Score = 30.8 bits (70), Expect = 0.97
Identities = 13/25 (52%), Positives = 13/25 (52%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
N NN NN NN NNNN NN
Sbjct: 28 GNTNNVNNRNNRNNNNGYGRLTKNN 52
Score = 30.8 bits (70), Expect = 1.1
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
+++N N +N+ N NN NN NN N G
Sbjct: 13 NAADDDNRQNVTDNDGNTNNVNNRNNRNNNNGY 45
Score = 30.4 bits (69), Expect = 1.5
Identities = 12/27 (44%), Positives = 13/27 (48%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEE 88
N+ N NN NN NN NNNN
Sbjct: 26 NDGNTNNVNNRNNRNNNNGYGRLTKNN 52
Score = 30.0 bits (68), Expect = 2.0
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLY 94
N +++N N +N+ N NN NN N Y
Sbjct: 13 NAADDDNRQNVTDNDGNTNNVNNRNNRNNNNGY 45
Score = 30.0 bits (68), Expect = 2.3
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNN 84
+N+ N NN NN NN NNNN
Sbjct: 25 DNDGNTNNVNNRNNRNNNNGYGR 47
Score = 30.0 bits (68), Expect = 2.3
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 63 NNNNNNNNNNNNNNNNNNNNNNN 85
+N+ N NN NN NN NNNN
Sbjct: 25 DNDGNTNNVNNRNNRNNNNGYGR 47
Score = 28.8 bits (65), Expect = 5.4
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
NN +++N N +N+ N NN NN +
Sbjct: 12 NNAADDDNRQNVTDNDGNTNNVNNRNNRNN 41
Score = 28.5 bits (64), Expect = 7.0
Identities = 7/26 (26%), Positives = 8/26 (30%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNN 84
+ NN NN N NN
Sbjct: 39 RNNNNGYGRLTKNNVTGENGYNNIER 64
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 35.8 bits (83), Expect = 0.058
Identities = 14/40 (35%), Positives = 25/40 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
AK +K + KKA ++ +++ ++ K KKKK+KKK+
Sbjct: 398 SAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEW 437
Score = 35.4 bits (82), Expect = 0.085
Identities = 16/41 (39%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A +K + KK + ++E +E K KKKK+KKK+ +K
Sbjct: 400 KTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 32.7 bits (75), Expect = 0.50
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +K + K E+ +E+ +E K KKKK+KKK
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKK 435
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 33.2 bits (76), Expect = 0.060
Identities = 19/54 (35%), Positives = 26/54 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
K A K + K +K A KK+K K KK K KK+ K+++ K N N
Sbjct: 28 KPAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEENPAENGETKTE 81
Score = 32.1 bits (73), Expect = 0.16
Identities = 17/49 (34%), Positives = 21/49 (42%), Gaps = 1/49 (2%)
Query: 30 KKEKKAKKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K E K KK KK+K K KK K KK+ K++ N N
Sbjct: 33 KPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEENPAENGETKTE 81
Score = 27.9 bits (62), Expect = 4.9
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 3/37 (8%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
EP P KKA K+KK K K + KKE K++E K
Sbjct: 35 EPKP---KKAAAKKKKPAVKGKKGAKGKKETKQEEAK 68
Score = 27.5 bits (61), Expect = 6.1
Identities = 15/49 (30%), Positives = 17/49 (34%)
Query: 34 KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
K + K KK KKK+ K KK K KK N N
Sbjct: 33 KPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEENPAENGETKTE 81
Score = 27.1 bits (60), Expect = 9.3
Identities = 17/53 (32%), Positives = 19/53 (35%), Gaps = 13/53 (24%)
Query: 28 KKKKEKKAKKKEKKKEKK-------------KKEKKKKKKKKKKKKKNNNNNN 67
K+K E K K K+E K K E K KK KKKK
Sbjct: 2 KRKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKK 54
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 34.2 bits (79), Expect = 0.064
Identities = 17/39 (43%), Positives = 28/39 (71%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
AKK +K++ EK + +K+ KK K+++ KKK++K K K K
Sbjct: 97 AKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNK 135
Score = 33.4 bits (77), Expect = 0.094
Identities = 17/40 (42%), Positives = 27/40 (67%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KK KK++ +K + K K+ K+E+ K+K++K K K KKK
Sbjct: 98 KKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 32.6 bits (75), Expect = 0.21
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K++++K +K KK K+++ KK+++K K K KKK
Sbjct: 98 KKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 32.2 bits (74), Expect = 0.27
Identities = 15/38 (39%), Positives = 26/38 (68%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
L ++ +K +KK KK K+++ K+K++K+K K KKK
Sbjct: 100 LKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 28.4 bits (64), Expect = 4.7
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+K KKE+ K +KK KK KE++ KKK++K K N
Sbjct: 97 AKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKN 134
Score = 27.6 bits (62), Expect = 8.4
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ KK +K+ +K ++K KK K+++ KKK++K
Sbjct: 93 KEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKD 130
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 35.6 bits (82), Expect = 0.065
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KK K KK+ K KK++ KE K+ +++ + + + K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183
Score = 33.3 bits (76), Expect = 0.32
Identities = 17/55 (30%), Positives = 22/55 (40%), Gaps = 11/55 (20%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK-----------KKKKKKKKKN 62
D K + E + K K+ +KKK KK KKK K+KK N
Sbjct: 359 DAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKAN 413
Score = 32.9 bits (75), Expect = 0.43
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K +K K KKK +K KKK+ K+ K++
Sbjct: 144 TKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 32.5 bits (74), Expect = 0.55
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+ KK K K+K KK KKK+ K+ K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 32.5 bits (74), Expect = 0.57
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
E KAKK + KK+ KK +KK+ K+ K +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 32.5 bits (74), Expect = 0.61
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+ + KK K KK+ KK KKK+ K+ + ++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 31.3 bits (71), Expect = 1.4
Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 8/49 (16%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK--------KKKEKKKKKKKKKKKKK 61
+ +K ++ K+ +K K K+ +K KKK K+KK K + KK
Sbjct: 372 RTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKFQGKK 420
Score = 30.6 bits (69), Expect = 2.6
Identities = 14/43 (32%), Positives = 20/43 (46%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
E K + K KK + KKK KK KKK+ K+ ++
Sbjct: 131 SIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 30.2 bits (68), Expect = 3.4
Identities = 12/31 (38%), Positives = 21/31 (67%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
E K +K K +KK KK KKK+ K++++ ++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 29.8 bits (67), Expect = 4.3
Identities = 10/45 (22%), Positives = 27/45 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K KK+ KK +KK K+ K+ +++ + + + K ++ +++++
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSD 192
Score = 29.4 bits (66), Expect = 5.9
Identities = 9/43 (20%), Positives = 23/43 (53%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K++EK++KE++ ++ E + + K+E K + + +
Sbjct: 332 KEREKEQKEREGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQ 374
Score = 29.0 bits (65), Expect = 6.9
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ K KK K K+K K+ KKKE K+ K +++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESE 176
Score = 29.0 bits (65), Expect = 7.0
Identities = 9/41 (21%), Positives = 24/41 (58%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ K K+ K K+K K K+K+ ++ + +++ + + + K+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183
Score = 28.6 bits (64), Expect = 8.5
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KKAK E++K ++ K EKK + K + +++ + K
Sbjct: 14 KKAKGFERQKLIRRIKNAEKKSKNDKLARIEEEIEALKT 52
Score = 28.6 bits (64), Expect = 8.7
Identities = 13/43 (30%), Positives = 24/43 (55%)
Query: 10 GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
G T+ KK KK KKK+ K++ K+ ++E + +++ K
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 34.7 bits (80), Expect = 0.070
Identities = 10/40 (25%), Positives = 33/40 (82%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+K +++++ + K+ ++ E+++EKKK+E+++K+++++ +K
Sbjct: 31 RKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70
Score = 32.4 bits (74), Expect = 0.34
Identities = 12/41 (29%), Positives = 32/41 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK K +EK+A++++++ E++++E++KK ++K++ ++
Sbjct: 2 KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGER 42
Score = 32.4 bits (74), Expect = 0.37
Identities = 11/42 (26%), Positives = 35/42 (83%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K +K+E ++KE++ ++E++K+K+++E+K+++++ +K+++
Sbjct: 33 KLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74
Score = 31.2 bits (71), Expect = 0.96
Identities = 13/50 (26%), Positives = 35/50 (70%), Gaps = 8/50 (16%)
Query: 21 KKAKKKEKKKKEKKA--------KKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K ++K+ ++++++A KK E+K+E ++KE+++ +++++KKK+
Sbjct: 9 AKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58
Score = 30.4 bits (69), Expect = 1.4
Identities = 12/41 (29%), Positives = 33/41 (80%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +K+E++ +E++ KKKE+++ K+++E+ +K++++ +K K
Sbjct: 39 EGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLK 79
Score = 28.9 bits (65), Expect = 5.1
Identities = 8/47 (17%), Positives = 31/47 (65%), Gaps = 4/47 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKE----KKKEKKKKEKKKKKKKKKKKKKNN 63
KK K E+K+ ++ ++ E ++++K +++++ ++K++++ ++
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEER 52
Score = 28.5 bits (64), Expect = 6.7
Identities = 8/44 (18%), Positives = 36/44 (81%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
L+ K+ +++++++ ++ ++K+K++E++K+ +++ +K++++ +K
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77
Score = 28.1 bits (63), Expect = 8.5
Identities = 9/41 (21%), Positives = 32/41 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ +++E +++ +K K++E++KE++++ +K++++ +K K
Sbjct: 41 ERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSS 81
>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI). YihI activates the
GTPase activity of Der, a 50S ribosomal subunit
stability factor. The stimulation is specific to Der as
YihI does not stimulate the GTPase activity of Era or
ObgE. The interaction of YihI with Der requires only the
C-terminal 78 amino acids of YihI. A yihI deletion
mutant is viable and shows a shorter lag period, but the
same post-lag growth rate as a wild-type strain. yihI is
expressed during the lag period. Overexpression of yihI
inhibits cell growth and biogenesis of the 50S ribosomal
subunit. YihI is an unusual, highly hydrophilic protein
with an uneven distribution of charged residues,
resulting in an N-terminal region with high pI and a
C-terminal region with low pI.
Length = 169
Score = 34.2 bits (79), Expect = 0.073
Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
+K K K A K KKK + + +++ +++K+KKK+K + + +N + +
Sbjct: 1 RKSGKNGPKLAPKG-KKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAAQK 59
Query: 85 NNEEKGTFLYKMAAPLVLVLTYVAEITQP 113
+ G+ K PL++ E P
Sbjct: 60 KDPRIGS---KKPVPLIVEFKVKPEKQAP 85
Score = 32.7 bits (75), Expect = 0.22
Identities = 14/57 (24%), Positives = 28/57 (49%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+K+ K K K KK + +++ +E+K+KKK+K K + +N + +
Sbjct: 1 RKSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAA 57
Score = 31.9 bits (73), Expect = 0.50
Identities = 12/56 (21%), Positives = 27/56 (48%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
K + K KK+ + E ++ +++K+KKK+K K+ + +N + +
Sbjct: 2 KSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAA 57
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 34.5 bits (79), Expect = 0.073
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNN 76
K KKKKKK+ K + ++N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 34.1 bits (78), Expect = 0.086
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 48 EKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
K KKKKKK+ K + ++N ++ + NN
Sbjct: 31 SKSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 33.4 bits (76), Expect = 0.17
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K KKKKKK+ K + ++N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 31.4 bits (71), Expect = 0.77
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 47 KEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K KKKKKK+ K ++N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 31.4 bits (71), Expect = 0.81
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNNN 79
K KKKKKK++ + ++N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 31.1 bits (70), Expect = 0.91
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K KKKKKK+ + ++N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 30.7 bits (69), Expect = 1.5
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K KKKKKK + + ++N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 30.3 bits (68), Expect = 1.6
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNN 72
K +KKKKK+ K +N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 30.3 bits (68), Expect = 2.0
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K K+KKKK+ K N ++ + NN
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNV 58
Score = 29.1 bits (65), Expect = 4.2
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
K KK+KKK+ K ++ + NN
Sbjct: 31 SKSKKKKKKRSKATSPSHNASDQSTNNV 58
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 35.6 bits (82), Expect = 0.073
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
P KKK K+ K+K++ E KE+ +++ K+
Sbjct: 211 PPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDV 253
Score = 33.3 bits (76), Expect = 0.35
Identities = 18/55 (32%), Positives = 23/55 (41%), Gaps = 13/55 (23%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK-------------KKKKKKKKKKN 62
K AKK + KE +K E K K K+ KKK K+KK+K
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKK 233
Score = 33.3 bits (76), Expect = 0.37
Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 1/63 (1%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKE-KKKKEKKKKKKKKKKKK 60
S+ K Q + T K K K KK KKK K+KK+KK+ +
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES 238
Query: 61 KNN 63
Sbjct: 239 TVK 241
Score = 32.5 bits (74), Expect = 0.68
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ L KK K K KEKK++K+ E K++ +++ K
Sbjct: 203 KAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGK 250
Score = 32.1 bits (73), Expect = 0.81
Identities = 11/53 (20%), Positives = 21/53 (39%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+ + + K KK ++ + +K E K K K+ KK N ++
Sbjct: 168 PKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSS 220
Score = 31.7 bits (72), Expect = 1.2
Identities = 11/41 (26%), Positives = 23/41 (56%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
P + + +++ K + E++ E+K+KEK+K+ KK
Sbjct: 260 AEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
Score = 31.3 bits (71), Expect = 1.2
Identities = 11/64 (17%), Positives = 21/64 (32%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
+K + + K KK + KE +K + K K + N +N
Sbjct: 157 TANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSN 216
Query: 75 NNNN 78
++
Sbjct: 217 IMSS 220
Score = 31.3 bits (71), Expect = 1.3
Identities = 13/54 (24%), Positives = 24/54 (44%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
P + + K K KK + +K+ +K + K K K+ KK+N ++
Sbjct: 167 PPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSS 220
Score = 30.6 bits (69), Expect = 2.2
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+++ + E K + E++ +EK+K+K+K+ KK
Sbjct: 265 LDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
Score = 30.2 bits (68), Expect = 3.4
Identities = 11/50 (22%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE--KKKKKKKKKKKKK 61
L+ + A+ + E + + K + +E ++K+K+K+K+ KK
Sbjct: 251 RDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
Score = 29.8 bits (67), Expect = 4.5
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 7/48 (14%)
Query: 21 KKAKKKEKKKKEKK------AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KA ++ +K KKK K+K K+KKE + K++ ++++
Sbjct: 202 VKAASLKRNPPKKSNIMSSFFKKKTKEK-KEKKEASESTVKEESEEES 248
Score = 29.4 bits (66), Expect = 6.2
Identities = 10/31 (32%), Positives = 13/31 (41%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K K K KK K KKKK+++
Sbjct: 389 PPTKPKPKVSTPAVPAAAKKPKAPKKKKQSS 419
Score = 29.0 bits (65), Expect = 6.5
Identities = 9/41 (21%), Positives = 23/41 (56%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KK ++KK++K+A + K+E +++ K+ + + +
Sbjct: 224 KKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTG 264
>gnl|CDD|233174 TIGR00893, 2A0114, D-galactonate transporter. [Transport and
binding proteins, Carbohydrates, organic alcohols, and
acids].
Length = 399
Score = 35.4 bits (82), Expect = 0.074
Identities = 58/339 (17%), Positives = 95/339 (28%), Gaps = 70/339 (20%)
Query: 106 YVAEITQPHLRG-----MLSATASMTTIFGTVSQLFLGSFLHWRSAAIL-NLLFPILALC 159
VA R SA I G + L F W+ A I+ +L I +
Sbjct: 106 IVASWFPASERATAVSIFNSAQGLGGIIGGPLVGWILIHF-SWQWAFIIEGVLGIIWGVL 164
Query: 160 ALYFI---PESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELKRLG 216
L FI P+ WL + + L +G + QI + LK
Sbjct: 165 WLKFIPDPPQKAKWLTEEEKYIVVGGLLAEQQGKG---PSTPKKYQIKEL-----LK--- 213
Query: 217 KDGQRRPNYRMYMRRTFLLPYAIVTSLFFIG--------QFGGMTTLQTYAVGIFESIHA 268
+ R++ L + + L F Q G++ L+
Sbjct: 214 -------DRRVWG--LALGQFLVNIGLGFFLTWFPTYLVQERGLSILEAG---------- 254
Query: 269 PLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLALIS------TGGSAACFIVVAVYAQ 322
F L G+ G +L L +R +L+ G ++ A
Sbjct: 255 -----FMASLPGIVGFIGMILGGRLSDLLLRRGKSLVFARKTAIIAGLVLSLLMFA---- 305
Query: 323 FHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFA 382
+ + P + L F + W LI + P NI G S +
Sbjct: 306 TNYV---NIPYAALALVALGFFGLGAGA--IGWALISDNAPGNIAGLTGGLINSLGNLGG 360
Query: 383 FAVNKLYYPMLDTFHLWGTLYFY--AAISVVGTLYMYFV 419
+ + T + A + Y+ V
Sbjct: 361 IVGPIVIGAIAATTGSFAGALMVVAALALIGALSYLLLV 399
>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962). This
eukaryotic family of proteins has no known function.
Length = 155
Score = 34.2 bits (79), Expect = 0.075
Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 21 KKAKKKEKKKK---EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K K KKKK K K K+ + + +K +KKK ++ +K++
Sbjct: 1 KVTKNIAKKKKVLHPKSRKAKQLNRATLRDDKLEKKKAERAEKRSL 46
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 35.0 bits (81), Expect = 0.075
Identities = 15/45 (33%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+K K+ EK++ EK+ + EKK E+ + KK +K+ +K++ K +
Sbjct: 152 RKLKEAEKQE-EKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195
Score = 30.8 bits (70), Expect = 1.4
Identities = 13/50 (26%), Positives = 25/50 (50%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
E K E+K K+ EK++EK+++ +KK ++ + KK +
Sbjct: 146 ESKSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195
Score = 30.8 bits (70), Expect = 1.5
Identities = 12/48 (25%), Positives = 27/48 (56%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K ++K K+A+K+E+K+E+ ++K ++ + KK +K +
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 35.4 bits (82), Expect = 0.078
Identities = 11/38 (28%), Positives = 18/38 (47%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
K A KK K K+ + +K K++ K K + +K
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47
Score = 34.6 bits (80), Expect = 0.16
Identities = 8/37 (21%), Positives = 17/37 (45%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK+ K + +E+K K++ + K + +
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSR 46
Score = 31.5 bits (72), Expect = 1.2
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K KK KAK+ E+ EK KE+ K K +
Sbjct: 10 KNALKKRLKAKQAEE--EKAAKEEAKAAAAAAAAKGRS 45
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 34.6 bits (80), Expect = 0.079
Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 14/55 (25%)
Query: 21 KKAKKKEKKKKEK--KAKKKEKKKEKKKK------------EKKKKKKKKKKKKK 61
KKAKKK+KK+K+ AKK K K + K KK + K+K KK
Sbjct: 158 KKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDPRMKKDLRALKRKAKK 212
Score = 34.2 bits (79), Expect = 0.094
Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 17/58 (29%)
Query: 21 KKAKKK-------------EKKKKEKKAKKKEKKKEKKKKEK----KKKKKKKKKKKK 61
+KAKKK EK K+ KK KK KKK+KK+K+ KK K K + K
Sbjct: 129 EKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGRPK 186
Score = 31.9 bits (73), Expect = 0.62
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 16/55 (29%)
Query: 23 AKKKEKKKKEK---KAKKK-------------EKKKEKKKKEKKKKKKKKKKKKK 61
AK ++K + K KAKKK EK K+ KK KK KKKKKK+KK
Sbjct: 116 AKARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKT 170
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 33.4 bits (77), Expect = 0.081
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 15/57 (26%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK-------KEKKKKE----KKK----KKKKKKKKKKN 62
K A++++K+K++KK KKKE + +EKKK+E KK K++ +K K+K
Sbjct: 72 KAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDKERVEKMKEKR 128
Score = 31.1 bits (71), Expect = 0.61
Identities = 17/44 (38%), Positives = 28/44 (63%), Gaps = 6/44 (13%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKK------KKEKKKKKKKKKKKK 60
A K +++K++K KKK+KKKE + +EKKK+++ KK
Sbjct: 70 AGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKK 113
Score = 27.6 bits (62), Expect = 8.5
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K ++ K +E+KK+K+KKKKKKK
Sbjct: 61 RKGPAAGEEAGKAAEEEKKEKEKKKKKKK 89
>gnl|CDD|187760 cd09320, TDT_like_2, The Tellurite-resistance/Dicarboxylate
Transporter (TDT) family. The
Tellurite-resistance/Dicarboxylate Transporter (TDT)
family includes members from all three kingdoms, but
only three members of the family have been functionally
characterized: the TehA protein of E. coli functioning
as a tellurite-resistance uptake permease, the Mae1
protein of S. pombe functioning in the uptake of malate
and other dicarboxylates, and the sulfite efflux pump
(SSU1) of Saccharomyces cerevisiae. In plants, the
plasma membrane protein SLAC1 (Slow Anion
Channel-Associated 1), which is preferentially expressed
in guard cells, encodes a distant homolog of fungal and
bacterial dicarboxylate/malic acid transport proteins.
SLAC1 is essential in mediating stomatal responses to
physiological and stress stimuli. Members of the TDT
family exhibit 10 putative transmembrane alpha-helical
spanners (TMSs).
Length = 327
Score = 35.0 bits (81), Expect = 0.081
Identities = 25/120 (20%), Positives = 34/120 (28%), Gaps = 25/120 (20%)
Query: 238 AIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGV-----AELGGALLCVV 292
A+ T L+ IG G+ T + P L V A G L+
Sbjct: 100 AVDTVLWVIGTVLGLVTAVVIPRRLIR--RHRGTPTPVWGLPVVPPMVSATTGAQLV--- 154
Query: 293 LIHY--TGKRPLALISTGGSAACF---------IVVAVYAQFHLSYGWDSPLVPTVFLVL 341
H R L+ ACF I VY + PT ++ L
Sbjct: 155 -PHLPAGQPRLTLLL---ACYACFGLSLLAAAIIFALVYRRLWRGGPLPLAAAPTAWIPL 210
>gnl|CDD|151365 pfam10918, DUF2718, Protein of unknown function (DUF2718). This
viral family of proteins has no known function.
Length = 140
Score = 33.8 bits (77), Expect = 0.082
Identities = 18/98 (18%), Positives = 28/98 (28%), Gaps = 15/98 (15%)
Query: 12 FTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN-------- 63
F + P KK + + EK K K++ +K ++ NN
Sbjct: 40 FHDVLPDPHKK--DDDASVPIMPPEPSPIPNEKPLKLKEQAEKPPISERNNNGVFDFMKI 97
Query: 64 -----NNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
N N N + + FL KM
Sbjct: 98 PNPFKRYYEYCYPYPNKNAKNQPKRVEEKSEKGFLEKM 135
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 34.9 bits (81), Expect = 0.085
Identities = 8/41 (19%), Positives = 32/41 (78%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+ +K+ ++++E K + +++K ++++E++ +++++K++K
Sbjct: 140 RKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180
Score = 31.8 bits (73), Expect = 0.82
Identities = 7/39 (17%), Positives = 29/39 (74%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K +K+E+++ ++ +++ K +++E+++K+K+++++
Sbjct: 33 KAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERRE 71
Score = 31.8 bits (73), Expect = 0.93
Identities = 6/42 (14%), Positives = 31/42 (73%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ +++++ + ++K +K++K +E+ + +++ ++K+++K+
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKE 146
Score = 31.0 bits (71), Expect = 1.7
Identities = 11/42 (26%), Positives = 31/42 (73%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKKKKEKK-AKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK K E+K++E++ + E+++ K E++++++K+K++++
Sbjct: 29 KKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERR 70
Score = 30.6 bits (70), Expect = 1.8
Identities = 8/43 (18%), Positives = 32/43 (74%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+KA+++E+++ E++ +K+EK++E + ++++ + ++++ +
Sbjct: 161 EKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDE 203
Score = 29.5 bits (67), Expect = 4.4
Identities = 10/46 (21%), Positives = 33/46 (71%), Gaps = 6/46 (13%)
Query: 22 KAKKKEKKKKEKKAKKKEKKK------EKKKKEKKKKKKKKKKKKK 61
++ E+K++EK+ +++E+ K EK ++E++++ +++++K++
Sbjct: 134 NEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEE 179
Score = 29.1 bits (66), Expect = 6.6
Identities = 7/39 (17%), Positives = 31/39 (79%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++K ++++E++A+++E+K+EK+++ + + ++++ + +
Sbjct: 159 QREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDE 197
Score = 29.1 bits (66), Expect = 6.7
Identities = 10/46 (21%), Positives = 31/46 (67%), Gaps = 6/46 (13%)
Query: 23 AKKKEKKKKEKKAKKKEKK------KEKKKKEKKKKKKKKKKKKKN 62
A+ +EKK+ + + K++E++ +E+ K +++++++K+K++
Sbjct: 24 AQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEER 69
Score = 28.3 bits (64), Expect = 9.7
Identities = 12/42 (28%), Positives = 32/42 (76%), Gaps = 5/42 (11%)
Query: 22 KAKKKEKKKKEKKAKK-----KEKKKEKKKKEKKKKKKKKKK 58
+A+ +EK++K+KK ++ E++ E+K++EK+++++++ K
Sbjct: 113 EAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELK 154
>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
Length = 536
Score = 35.1 bits (80), Expect = 0.086
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 56 KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
+++ N++ N NNNN N +NNN N +NNN
Sbjct: 84 QQQPNANDSYPNGNNNNPNGDNNNPNGSNNN 114
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 33.3 bits (76), Expect = 0.088
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 9/46 (19%)
Query: 26 KEKKKKEKKAKKKE---------KKKEKKKKEKKKKKKKKKKKKKN 62
+ + KK + + +++ KK K+K++KKKKKKK KN
Sbjct: 77 NKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGAKN 122
Score = 32.1 bits (73), Expect = 0.21
Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 21 KKAKKKEKKKKEKK-------AKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA + KK + +++ KK KKK+EKKKKKKK K
Sbjct: 77 NKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 29.8 bits (67), Expect = 1.4
Identities = 11/44 (25%), Positives = 23/44 (52%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K ++ K+KK+K+ + E+ EK + + K + KK + +
Sbjct: 50 KPAEEVKEKKKKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQ 93
Score = 27.5 bits (61), Expect = 8.2
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
W ++ +K KK+++ KKK+KK K
Sbjct: 97 WMNQRQAKKLKKKREKKKKKKKGAKNLA 124
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7
plays a role in protein translation. Deletions of the
TMA7 gene results in altered protein synthesis rates.
Length = 63
Score = 31.7 bits (72), Expect = 0.094
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
PL K +KKE +++ K+K+K++ K KE K K K
Sbjct: 10 PLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGK 49
Score = 31.4 bits (71), Expect = 0.13
Identities = 17/42 (40%), Positives = 22/42 (52%), Gaps = 9/42 (21%)
Query: 29 KKKEKKAKKKEKK---------KEKKKKEKKKKKKKKKKKKK 61
KKK KA KK+KK K+K+K+E K K+ K K
Sbjct: 7 KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKG 48
Score = 30.2 bits (68), Expect = 0.34
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+K+K+E KA K+ K K K KK KK
Sbjct: 29 KQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGKK 63
Score = 28.3 bits (63), Expect = 1.8
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKK K KK++K+ +++ K+K+K++ K K
Sbjct: 7 KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALK 40
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 35.4 bits (81), Expect = 0.099
Identities = 16/52 (30%), Positives = 35/52 (67%), Gaps = 5/52 (9%)
Query: 16 TPLDWKKAKKK-----EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
TPL K KK EK K+E + K +E+++ +K+KEK+++++++++ ++
Sbjct: 570 TPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERA 621
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 35.2 bits (80), Expect = 0.10
Identities = 23/78 (29%), Positives = 31/78 (39%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
TE P + + E ++ KE +K K K+ K K K N NNNN
Sbjct: 42 TEEQPSEINTGPRYETAREVSSRDIKELEKSNKVKDANKADLIAMLKAKAEKGPNINNNN 101
Query: 73 NNNNNNNNNNNNNNEEKG 90
NN + + N N E G
Sbjct: 102 NNYSEQSENAAINEEASG 119
Score = 31.0 bits (69), Expect = 2.2
Identities = 17/73 (23%), Positives = 32/73 (43%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
+W++ K +E+ + + E +E ++ K+ +K K K N +
Sbjct: 36 EWEEEKTEEQPSEINTGPRYETAREVSSRDIKELEKSNKVKDANKADLIAMLKAKAEKGP 95
Query: 79 NNNNNNNNEEKGT 91
N NNNNNN + +
Sbjct: 96 NINNNNNNYSEQS 108
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 34.0 bits (78), Expect = 0.12
Identities = 18/68 (26%), Positives = 33/68 (48%), Gaps = 27/68 (39%)
Query: 19 DWKKAK-KKEKKKKEKKAKK--------------------------KEKKKEKKKKEKKK 51
DW+ K K+E+K + K +K ++K+KE+KKK K+
Sbjct: 139 DWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEA 198
Query: 52 KKKKKKKK 59
K+++K++K
Sbjct: 199 KRREKEEK 206
Score = 31.3 bits (71), Expect = 0.87
Identities = 10/30 (33%), Positives = 19/30 (63%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
++K+K+EKK K+ K++EK++K
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKRMAALVAA 214
Score = 30.2 bits (68), Expect = 2.0
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
K+KE+KKK K+AK++EK++++
Sbjct: 187 KEKEEKKKVKEAKRREKEEKRMAALVAA 214
Score = 28.6 bits (64), Expect = 6.7
Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 26/68 (38%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEK--------------------------KKKEKKKKK 53
+ +K+K+E+KA+ +++KE+ K+KE+KKK
Sbjct: 136 MIADWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKV 195
Query: 54 KKKKKKKK 61
K+ K+++K
Sbjct: 196 KEAKRREK 203
Score = 28.2 bits (63), Expect = 8.3
Identities = 9/54 (16%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK------EKKKKKKKKKKKKKN 62
P+ + ++++K+KE + + +E + K + +K+K+++K
Sbjct: 99 PSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKAR 152
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 34.3 bits (79), Expect = 0.12
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KK+K ++ + K+ K K+K KKK+K K+ +
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAES 41
Score = 33.1 bits (76), Expect = 0.28
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KK K ++E + ++ K KKK KKK+K K+
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
Score = 32.3 bits (74), Expect = 0.45
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K++K +++ A + K+ + KKK KKK+K K+
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
Score = 32.3 bits (74), Expect = 0.54
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK++ +++ K+ K +KK +KK+K K+
Sbjct: 3 LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
Score = 31.9 bits (73), Expect = 0.56
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E K++KA+++ + K+ + KKK KKK+K
Sbjct: 1 ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIY 35
Score = 31.2 bits (71), Expect = 1.1
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
KK K +++ + + K KKK KKK K K+
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
Score = 30.8 bits (70), Expect = 1.6
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
E KK++ ++E + K+ K KKK KK
Sbjct: 1 ELLLKKRKARQELAVQVAKQAKAKKKANKKKRK 33
Score = 30.0 bits (68), Expect = 2.6
Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKK--KEKKKKEKKKKKKKKKKKKKN 62
KAKKK KKK K K+ + KE +K E++ + K+ KK
Sbjct: 21 AKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPG 64
Score = 28.9 bits (65), Expect = 5.5
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA KK++K K+A+ K+ K ++E + K+ KK K
Sbjct: 25 KKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 33.5 bits (77), Expect = 0.13
Identities = 11/49 (22%), Positives = 20/49 (40%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ P ++ + +K K K K E+KK + +K +KK
Sbjct: 81 PDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 32.4 bits (74), Expect = 0.13
Identities = 12/43 (27%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKK 60
W+K +K +++ KA++KE K EK+ ++ + + K+++ K
Sbjct: 30 WEKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAK 72
Score = 30.8 bits (70), Expect = 0.54
Identities = 16/49 (32%), Positives = 32/49 (65%), Gaps = 5/49 (10%)
Query: 19 DWKKAKKKEKKK--KEKKAKKKEKKKEKKKKEKKKKKK---KKKKKKKN 62
D K+A+++ + + KE++A K+EK++ +K K KK K+++K+N
Sbjct: 52 DEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKVERLKRREKRN 100
Score = 29.7 bits (67), Expect = 1.1
Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKK--EKKKKEKKKKKKK 55
++A K+EK++ EK A K KK K++EK+ K K
Sbjct: 68 RRAAKEEKERYEKMAAKMHAKKVERLKRREKRNKLLK 104
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 34.5 bits (79), Expect = 0.13
Identities = 13/75 (17%), Positives = 22/75 (29%)
Query: 1 MDSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ + + + + KE KK KK + K K K KK
Sbjct: 108 IHHLQMLHHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKP 167
Query: 61 KNNNNNNNNNNNNNN 75
K + +N +
Sbjct: 168 KKKGSVSNRSVKMPG 182
Score = 31.4 bits (71), Expect = 1.1
Identities = 12/42 (28%), Positives = 17/42 (40%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ KK +K + K K + KK KKK + K
Sbjct: 145 KEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186
Score = 31.0 bits (70), Expect = 1.3
Identities = 11/44 (25%), Positives = 16/44 (36%), Gaps = 2/44 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--KKN 62
K + K K K KK +KK + K + K K+
Sbjct: 150 PKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKS 193
Score = 29.1 bits (65), Expect = 5.3
Identities = 10/39 (25%), Positives = 16/39 (41%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K+ +K KK + K K K KK + +N +
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPG 182
Score = 28.7 bits (64), Expect = 9.1
Identities = 15/57 (26%), Positives = 19/57 (33%)
Query: 7 HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
H + T K KK KK + K K K K+ KKK + K
Sbjct: 130 HHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor
involved in ribosome biogenesis.
Length = 112
Score = 32.5 bits (74), Expect = 0.14
Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 9/50 (18%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEK---------KKKEKKKKKKKKKKKKK 61
K+ KKK K+ K+ K+++K E+ KK+ K + K KK+K
Sbjct: 50 KQRKKKPKRLTRKQRKRQKKGLERAEIVTDTLSKKRAKSIGRAKYVKKRK 99
Score = 31.7 bits (72), Expect = 0.30
Identities = 16/52 (30%), Positives = 28/52 (53%), Gaps = 8/52 (15%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEK--------KKKKKKKKKKKKNNNN 65
+KKK++KKK K+ +K++K++KK E+ KK+ K + K
Sbjct: 46 VSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDTLSKKRAKSIGRAKYVKK 97
Score = 29.0 bits (65), Expect = 2.1
Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 2/52 (3%)
Query: 21 KKAKKKEK--KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
K+ +K K +KK+ KKK ++ +K++K++KK +
Sbjct: 28 KRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTD 79
Score = 28.2 bits (63), Expect = 4.9
Identities = 14/40 (35%), Positives = 25/40 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+AK+ ++K+ A K + KKK+ KKK K+ +K++K
Sbjct: 26 RAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRK 65
Score = 27.8 bits (62), Expect = 5.6
Identities = 14/66 (21%), Positives = 30/66 (45%), Gaps = 12/66 (18%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKE------------KKKEKKKKEKKKKKKKKKKKKKNNNN 65
+D K+ + ++ +K+ KKK++KKK K+ +K++K++KK
Sbjct: 14 IDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLER 73
Query: 66 NNNNNN 71
+
Sbjct: 74 AEIVTD 79
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion
and trafficking].
Length = 150
Score = 33.1 bits (76), Expect = 0.15
Identities = 11/59 (18%), Positives = 30/59 (50%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
E ++ +E+ + +K+ E++K+ ++ K+K ++++ K + + N NN
Sbjct: 32 EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90
Score = 32.7 bits (75), Expect = 0.19
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 6/57 (10%)
Query: 24 KKKEKKKKEKKA-----KKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
K E + K+ +E+KK +K K+K ++++ KN + N NN
Sbjct: 34 KDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90
Score = 30.0 bits (68), Expect = 1.8
Identities = 12/53 (22%), Positives = 26/53 (49%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
+ ++E + K+ E++KK +K K+K ++++ N + N NN
Sbjct: 38 INTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90
Score = 28.5 bits (64), Expect = 5.3
Identities = 11/43 (25%), Positives = 21/43 (48%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
TP + KK ++ K+K ++ + K + + E K KK+
Sbjct: 55 TPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQ 97
Score = 28.5 bits (64), Expect = 5.9
Identities = 12/57 (21%), Positives = 25/57 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K + +E+ KE +++K+ +K K+K +++ N + N NN
Sbjct: 34 KDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90
Score = 28.1 bits (63), Expect = 7.0
Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 6/61 (9%)
Query: 14 EPTPLDWKKAKKKEKKK-KEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKKNNNNNN 67
E T LD + +E+KK +K K+K +++E K + K KK+ ++
Sbjct: 45 EETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQLFSSEYE 104
Query: 68 N 68
Sbjct: 105 Q 105
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 34.5 bits (78), Expect = 0.15
Identities = 17/69 (24%), Positives = 32/69 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K+ +KE + K E++K+K ++EK+K +++K+K N N N
Sbjct: 162 KQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENE 221
Query: 81 NNNNNNEEK 89
+ E+K
Sbjct: 222 KQDLIKEQK 230
Score = 31.0 bits (69), Expect = 1.5
Identities = 18/94 (19%), Positives = 45/94 (47%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
D + K I E + +K K +++ + ++ K +++++K +++K+K +++K+K
Sbjct: 142 DRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTS 201
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
N N N N + +E+ F+ +
Sbjct: 202 NIANKNAIELEQEKQKTENEKQDLIKEQKDFIKE 235
Score = 29.9 bits (66), Expect = 4.3
Identities = 13/61 (21%), Positives = 28/61 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K ++K+K ++EK+ K + E++K+K + +K+ + N N+
Sbjct: 185 KTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQENH 244
Query: 81 N 81
N
Sbjct: 245 N 245
Score = 29.1 bits (64), Expect = 6.3
Identities = 13/64 (20%), Positives = 31/64 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+K K +++K+K ++ K+K K + +++K+K + +K + + N
Sbjct: 182 EKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQ 241
Query: 81 NNNN 84
N+N
Sbjct: 242 ENHN 245
Score = 29.1 bits (64), Expect = 6.4
Identities = 15/69 (21%), Positives = 33/69 (47%), Gaps = 2/69 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN--NNNN 78
K ++K+K ++EK+ ++EK+K K + +++K+K N + +
Sbjct: 178 KAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAE 237
Query: 79 NNNNNNNNE 87
N N+N+
Sbjct: 238 QNCQENHNQ 246
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 33.9 bits (78), Expect = 0.15
Identities = 13/35 (37%), Positives = 19/35 (54%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
++E KK K KKK K + K K KK++K+
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 33.9 bits (78), Expect = 0.16
Identities = 18/41 (43%), Positives = 28/41 (68%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KE+K++EKK KK E++ E +++ KK K KKKK K+ +
Sbjct: 188 KERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGK 228
Score = 32.7 bits (75), Expect = 0.39
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 7/54 (12%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK-------KKEKKKKKKKKKKKKKNNNNNN 67
K AKKK K KE K +K+ +KK KK ++E KK K KKKK + +
Sbjct: 175 KSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGK 228
Score = 32.3 bits (74), Expect = 0.47
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E +++ K K +KKK K K+ K K KK++K+
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 31.6 bits (72), Expect = 0.76
Identities = 14/41 (34%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK +++ + ++E K K KKK+ K + K K KK++K
Sbjct: 198 KKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERK 238
Score = 31.2 bits (71), Expect = 1.0
Identities = 16/41 (39%), Positives = 23/41 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK KK E++ + ++ K+ K +KKK K K K K KK
Sbjct: 195 KKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235
Score = 30.4 bits (69), Expect = 2.1
Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 6/48 (12%)
Query: 21 KKAKKKEKK------KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ K++EKK + E + + +K K KKKK K K K K K
Sbjct: 188 KERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235
Score = 30.0 bits (68), Expect = 2.6
Identities = 17/41 (41%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K EK+ KK KKK K KE K++K+++KK KK
Sbjct: 159 ISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKK 199
Score = 29.3 bits (66), Expect = 4.6
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+ + + + K+ K KKK+ K+K K K KK++K+
Sbjct: 203 RLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 28.9 bits (65), Expect = 7.0
Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 17/68 (25%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKK----KEKKKKEKK-------------KKK 53
P +K ++K+K+K AKKK K KE+K++EKK K
Sbjct: 153 RENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMK 212
Query: 54 KKKKKKKK 61
K K KKKK
Sbjct: 213 KGKGKKKK 220
Score = 28.5 bits (64), Expect = 8.8
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ + +K +K+KKK KKK K K+ K++K+++
Sbjct: 157 PRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREK 195
>gnl|CDD|219511 pfam07681, DoxX, DoxX. These proteins appear to have some sequence
similarity with pfam04173 but their function is unknown.
Length = 84
Score = 31.8 bits (73), Expect = 0.17
Identities = 28/95 (29%), Positives = 40/95 (42%), Gaps = 15/95 (15%)
Query: 233 FLLPYAIVTSLFFIGQFGGMTTLQ---TYAVGIFESIHAPLDPYFATLLLGVAELGGALL 289
LL ++ L FI G+ L G F S+ P P L + EL G LL
Sbjct: 1 GLLLLRLLLGLLFI--AHGLQKLFGGFAGTAGYFASLGLPP-PALLAYLAALLELVGGLL 57
Query: 290 CVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFH 324
++L +T R AL A F++VA++ H
Sbjct: 58 -LLLGLFT--RLAALG-----LAVFMLVAIFL-VH 83
>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
I [DNA replication, recombination, and repair].
Length = 140
Score = 32.9 bits (75), Expect = 0.18
Identities = 17/47 (36%), Positives = 23/47 (48%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+K K KEKK + ++K +KKEKKK + K K N N
Sbjct: 94 TEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPCCLKVENLKN 140
Score = 29.1 bits (65), Expect = 3.6
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
+ +K K KEKK K K +KK+KKK +N
Sbjct: 92 RFTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSN 128
Score = 27.9 bits (62), Expect = 8.7
Identities = 13/46 (28%), Positives = 21/46 (45%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+ K ++KK K +K +K++KKK + K K N N
Sbjct: 95 EKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPCCLKVENLKN 140
>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
Length = 862
Score = 34.3 bits (79), Expect = 0.18
Identities = 13/41 (31%), Positives = 16/41 (39%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ K KK KA K +K KK KK KK
Sbjct: 816 REPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKT 856
Score = 34.3 bits (79), Expect = 0.20
Identities = 15/45 (33%), Positives = 17/45 (37%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
EP KK K KA+K KK KK KK +K
Sbjct: 817 EPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861
Score = 32.3 bits (74), Expect = 0.82
Identities = 14/40 (35%), Positives = 16/40 (40%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
A KK K A K +K KK KK KK +K
Sbjct: 821 AAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARK 860
Score = 31.9 bits (73), Expect = 1.0
Identities = 11/36 (30%), Positives = 13/36 (36%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++ K KK K K EK KK KK
Sbjct: 815 PREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKK 850
Score = 31.6 bits (72), Expect = 1.2
Identities = 17/48 (35%), Positives = 18/48 (37%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
EP AKK K A K EK KK KK KK +K
Sbjct: 814 EPREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 30.3 bits (69), Expect = 0.18
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+ +E+ AKK KKK+K K + K K
Sbjct: 9 QARERNAKKAAKKKKKGAKSQLKAAAKAL 37
Score = 27.6 bits (62), Expect = 1.6
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++E+ ++ KK KKKK+ K + K K
Sbjct: 6 QREQARERNAKKAAKKKKKGAKSQLKAAAKALE 38
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 32.0 bits (73), Expect = 0.18
Identities = 16/35 (45%), Positives = 19/35 (54%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK++KKA K K +KKK K K K K NN
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38
Score = 31.6 bits (72), Expect = 0.30
Identities = 15/35 (42%), Positives = 17/35 (48%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KK++K K K KKKK K K K K NN
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38
Score = 30.4 bits (69), Expect = 0.61
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK+++K AK K K+KK K K K K N
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38
Score = 30.0 bits (68), Expect = 1.0
Identities = 15/34 (44%), Positives = 17/34 (50%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KK++KK K A K KKKK K K K K
Sbjct: 2 PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
Score = 29.3 bits (66), Expect = 1.6
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKK 59
KK++KK K K KKKK
Sbjct: 2 PPKKQQKKAAKAAAASAGGKAKKKK 26
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 33.2 bits (76), Expect = 0.18
Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 1/58 (1%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
D ++ + P + E K +++ E KE K K K KK K+K
Sbjct: 102 DLLQDYKKSLEEDTIPNHLNEEVSNETKLT-QESSSDESPKEVKLATKNKTKKHDKEK 158
Score = 30.9 bits (70), Expect = 0.93
Identities = 12/67 (17%), Positives = 29/67 (43%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
KKK K ++ K + + K ++ + KK ++ N+ N +N ++
Sbjct: 76 SLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135
Query: 82 NNNNNEE 88
++ + +E
Sbjct: 136 SDESPKE 142
Score = 30.5 bits (69), Expect = 1.1
Identities = 6/63 (9%), Positives = 21/63 (33%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
E + K + ++ KK ++ ++ + ++++ + N
Sbjct: 94 ENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKK 153
Query: 87 EEK 89
+K
Sbjct: 154 HDK 156
Score = 29.7 bits (67), Expect = 2.4
Identities = 12/64 (18%), Positives = 24/64 (37%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
E KK+ K ++ + K + E K + + KK+ + N+ N +N
Sbjct: 73 AVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQ 132
Query: 85 NNEE 88
+
Sbjct: 133 ESSS 136
Score = 28.9 bits (65), Expect = 4.2
Identities = 14/69 (20%), Positives = 27/69 (39%), Gaps = 2/69 (2%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK K E+ K + E K + + KK ++ N+ N +N ++
Sbjct: 78 KKKLKFEELDDLKITAENEIKS--IQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135
Query: 81 NNNNNNEEK 89
++ + E K
Sbjct: 136 SDESPKEVK 144
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 33.7 bits (78), Expect = 0.19
Identities = 13/42 (30%), Positives = 27/42 (64%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K K ++ + + + ++ K+KK+KKKKKKKK++ ++
Sbjct: 63 QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELRE 104
Score = 28.4 bits (64), Expect = 9.1
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
+ ++ + +E +KK+KKK+KKKKE+ ++
Sbjct: 73 GELYERADRAEELLKEKKKKKKKKKKKEELREW 105
>gnl|CDD|223605 COG0531, PotE, Amino acid transporters [Amino acid transport and
metabolism].
Length = 466
Score = 34.0 bits (78), Expect = 0.19
Identities = 21/144 (14%), Positives = 47/144 (32%), Gaps = 18/144 (12%)
Query: 227 MYMRRTFLLPYAIVTS-LFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELG 285
Y +R + L+ + + + A + LL + +
Sbjct: 80 AYAKRALGPRLGFLAGWLYLLAYVIALAAIAIGAASYLSYLFPGPGLLSIGPLLIIL-IA 138
Query: 286 GALLCVV-LIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYG---------WDSPLVP 335
AL+ ++ L++ G + A I++ + I++ ++ L +P
Sbjct: 139 LALIALLTLLNLRGIKASAKINSIITILKIIILLIFIILGLFAFGFSNGNLFAPFNPGGG 198
Query: 336 TVFLVLAAFLTHICIRLLPWMLIG 359
+ +LAA L L + G
Sbjct: 199 SFGGILAAIL------LAFFAFTG 216
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 33.6 bits (77), Expect = 0.21
Identities = 15/47 (31%), Positives = 26/47 (55%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
E ++ KKK+E +AKKK++ +E K +KK + K+K+
Sbjct: 76 DEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
Score = 30.9 bits (70), Expect = 1.4
Identities = 11/41 (26%), Positives = 21/41 (51%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ ++ KK ++ E KK+K+ +E K +KK + K
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
Score = 28.6 bits (64), Expect = 8.6
Identities = 11/44 (25%), Positives = 23/44 (52%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
+ +P A E ++ +KK + + KKK++ ++ K +KK
Sbjct: 70 SDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
>gnl|CDD|234537 TIGR04313, aro_clust_Mycop, aromatic cluster surface protein.
Members of this family are absolutely restricted to the
Mollicutes (Mycoplasma and Ureaplasma). All have a
signal peptide, usually of the lipoprotein type,
suggesting surface expression. Most members have lengths
of about 280 residues but some members have a nearly
full-length duplication. The mostly nearly invariant
residue, a Trp,is part of a strongly conserved 9-residue
motif, [ND]-W-[LY]-[WF]-X-[LF]-X-N-[LI], where X usually
is hydrophobic. Because the hydrophobic six-residue core
of this motif almost always contains three to four
aromatic residues, we name this family aromatic cluster
surface protein. Multiple paralogs may occur in a given
Mycoplasma, usually clustered on the genome.
Length = 297
Score = 33.5 bits (77), Expect = 0.21
Identities = 15/84 (17%), Positives = 31/84 (36%), Gaps = 8/84 (9%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKE--------KKKKEKKAKKKEKKKEKKKKEKKKKKK 54
K++ + WK + K ++ K K K +K + K
Sbjct: 34 KLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSELYLKL 93
Query: 55 KKKKKKKNNNNNNNNNNNNNNNNN 78
K + NN NN+++N+++ +
Sbjct: 94 LKLSLEYLNNINNSSSNDSDFFSY 117
Score = 32.0 bits (73), Expect = 0.78
Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 17/92 (18%)
Query: 21 KKAKKKEKKKKEKKAK----------------KKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KK +K KK K + EK K K ++ + K K +
Sbjct: 40 FDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSELYLKLLKLSLE 99
Query: 65 -NNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
NN NN+++N+++ + +++K L+
Sbjct: 100 YLNNINNSSSNDSDFFSYLYEKSKKKLNELFS 131
Score = 29.7 bits (67), Expect = 3.5
Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 17/92 (18%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK--------------KKKKKKKKKKKNNNNN 66
+ KK K K K K+ KK K K +K K K +KN N+
Sbjct: 29 ENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSE 88
Query: 67 NNNNNNNNNN---NNNNNNNNNNEEKGTFLYK 95
+ NN NN+++N+ + ++LY+
Sbjct: 89 LYLKLLKLSLEYLNNINNSSSNDSDFFSYLYE 120
Score = 28.9 bits (65), Expect = 6.2
Identities = 19/70 (27%), Positives = 26/70 (37%), Gaps = 7/70 (10%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN-------NNNNNNNNNNNNNNNNN 85
+ +K + K KK KK KK K NN NN N+ + N + N N
Sbjct: 27 QSENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNIN 86
Query: 86 NEEKGTFLYK 95
+E L
Sbjct: 87 SELYLKLLKL 96
Score = 28.5 bits (64), Expect = 8.5
Identities = 14/51 (27%), Positives = 21/51 (41%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K+ E+ K K KKK K+N +N + N + +NN N
Sbjct: 157 KQNEQFNIDAKHNSLFLGKFLKKKPKSNKFIDNYIDKLKNGEESAESNNYN 207
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 33.5 bits (77), Expect = 0.21
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 13/57 (22%)
Query: 23 AKKKEKKKKEKKAKKKEKKK----------EKKKKEKKK---KKKKKKKKKKNNNNN 66
A+ KK E+ K++E KK +++ KEKK K KK KKK+K +
Sbjct: 126 AEAAAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDE 182
Score = 33.1 bits (76), Expect = 0.32
Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 4/43 (9%)
Query: 22 KAKK-KEKKKKEKK---AKKKEKKKEKKKKEKKKKKKKKKKKK 60
+A+K +E KK K+ K +E+ KEKK +K KK KKK+K
Sbjct: 136 EARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKG 178
Score = 30.8 bits (70), Expect = 1.8
Identities = 15/66 (22%), Positives = 21/66 (31%), Gaps = 20/66 (30%)
Query: 19 DWKKAKK------------KEKKKKEKKAKKKEKKKEKKKKEKKKKKKK--------KKK 58
KK +K + K KK K + K K+K K KK+
Sbjct: 171 KLKKKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAKFGFGGKKR 230
Query: 59 KKKNNN 64
K+N
Sbjct: 231 GSKSNT 236
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 32.2 bits (73), Expect = 0.22
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
KK KK ++ EKK +KK+KK N
Sbjct: 50 LNKKPKKISAEEAEKKLLQKKEKKALTN 77
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 33.3 bits (76), Expect = 0.22
Identities = 18/66 (27%), Positives = 32/66 (48%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
++ E+KK + KK+ + +E K+KKKKK KKKK+ + N + ++ +
Sbjct: 64 RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVH 123
Query: 84 NNNEEK 89
E
Sbjct: 124 EQKSET 129
Score = 31.0 bits (70), Expect = 1.3
Identities = 15/36 (41%), Positives = 21/36 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KK + KK+ E ++ K+KKKK+ KKKK K
Sbjct: 69 KKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family
consists of several ATP synthase E chain sequences
which are components of the CF(0) subunit.
Length = 83
Score = 31.3 bits (71), Expect = 0.22
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KKKE+K +E +A++K +K K KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67
Score = 30.1 bits (68), Expect = 0.64
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+K++K ++ + +EK EK K KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67
>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain. Sof1 is essential for cell
growth and is a component of the nucleolar rRNA
processing machinery.
Length = 88
Score = 31.5 bits (72), Expect = 0.23
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK +K K+E K KK K++ ++K K K ++KK
Sbjct: 48 KKAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKK 85
Score = 29.9 bits (68), Expect = 0.67
Identities = 13/38 (34%), Positives = 22/38 (57%), Gaps = 1/38 (2%)
Query: 24 KKKEKKKKEKKAKK-KEKKKEKKKKEKKKKKKKKKKKK 60
K ++ K++ K+AKK KE+ + K K K ++KK
Sbjct: 49 KAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKKH 86
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 34.0 bits (77), Expect = 0.24
Identities = 13/41 (31%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K +++++ +KE++ KK + + K+KK+ KK KK K
Sbjct: 439 RKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGK 479
Score = 32.4 bits (73), Expect = 0.70
Identities = 12/43 (27%), Positives = 20/43 (46%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K K +++E +KE++ K+ K K KK + N N
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKG 478
Score = 32.1 bits (72), Expect = 0.90
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K K ++ ++ +K+ + K+ K K KK+ K N
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKG 478
Score = 29.8 bits (66), Expect = 4.5
Identities = 16/66 (24%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK------KKKKKKNNN 64
TFT P + + + +K +K +++++ +KE++ KK K K KK++ N
Sbjct: 413 TFTSHLPASNESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESIN 472
Query: 65 NNNNNN 70
N
Sbjct: 473 KKNKKG 478
Score = 28.6 bits (63), Expect = 9.3
Identities = 13/52 (25%), Positives = 27/52 (51%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+PT ++KK +K EK+ + K ++ + + + K KK+K +K +
Sbjct: 557 DPTHPEFKKTGGMKKIMDEKRKRLKNNIEQTQDGKPELKIKKRKAEKGDQRQ 608
>gnl|CDD|223553 COG0477, ProP, Permeases of the major facilitator superfamily
[Carbohydrate transport and metabolism / Amino acid
transport and metabolism / Inorganic ion transport and
metabolism / General function prediction only].
Length = 338
Score = 33.5 bits (75), Expect = 0.24
Identities = 32/236 (13%), Positives = 59/236 (25%), Gaps = 9/236 (3%)
Query: 96 MAAPLVLVLTYVAEITQPHLR--GMLSATASMTTIFGTVSQLFLGSFLHWRSAAILNLLF 153
L + ++E + G L L F
Sbjct: 108 GGGLLPVASALLSEWFPEATERGLAVGLVTLGAGALGLALGPLLAGLLLGALLWGWRAAF 167
Query: 154 PILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTPDKVQTELSQITKAIEESELK 213
+ AL L + L+ + A+ L +L + + L
Sbjct: 168 LLAALLGLLLL-----ILVLLLDLLLAAPRL--PLAVGLALIELGKLLLLLLILLLGGLV 220
Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPY 273
L L A++ + + A G ++
Sbjct: 221 LLLLGLYLSAAGFGLRALLLGLGLALLALFVGALILLVLLPILLGAAGGLSALVGAAGGL 280
Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGW 329
A +LLG+A L G L+ L+ R L L + + A + + +
Sbjct: 281 LALILLGLAGLLGTLVGGSLLVDRLGRSLGLRAILLAGALLLAILLLVLLLALVLT 336
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 31.8 bits (73), Expect = 0.24
Identities = 11/39 (28%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 24 KKKEKKKKEKKAKKKEKKKE-KKKKEKKKKKKKKKKKKK 61
+++ K +EK+ + E+++E K+ + K K +KK++K
Sbjct: 66 AERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRK 104
Score = 27.2 bits (61), Expect = 9.3
Identities = 12/44 (27%), Positives = 28/44 (63%), Gaps = 4/44 (9%)
Query: 21 KKAKKKEKKKK----EKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
++AK +EK+++ E++ K+ + K + K EKK++K + + +
Sbjct: 68 RQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAE 111
>gnl|CDD|220909 pfam10935, DUF2637, Protein of unknown function (DUF2637). This
family of proteins has no known function.
Length = 139
Score = 32.4 bits (74), Expect = 0.25
Identities = 16/124 (12%), Positives = 27/124 (21%), Gaps = 28/124 (22%)
Query: 303 ALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVF 362
A ++ A F + A G + L P V L T + L
Sbjct: 8 AAVAGAAFALSFTALRDLAIAAGGPGSLAWLFPLVIDGLILAAT-AALLALAL------- 59
Query: 363 PNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTLYMYFVMPE 422
A+ + +AA+S+ +
Sbjct: 60 ------AGRSAALYRLLAWLLVA------------ASTVAPVFAAMSLAANV--LHAYLL 99
Query: 423 TEGR 426
R
Sbjct: 100 AAAR 103
>gnl|CDD|216669 pfam01732, DUF31, Putative peptidase (DUF31). This domain has no
known function. It is found in various hypothetical
proteins and putative lipoproteins from mycoplasmas. It
appears to be related to the superfamily of trypsin
peptidases and so may have a peptidase function.
Length = 307
Score = 33.6 bits (77), Expect = 0.25
Identities = 17/93 (18%), Positives = 24/93 (25%), Gaps = 10/93 (10%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK----------KKKKKKKKKK 60
T + KK K K K K + + +
Sbjct: 188 NITNDAINAYAALDNSLKKFKFLKLKSLLKDTKYSDLDTPLLYALGYPLSENDYFSNSLW 247
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
N +N N N N+ N N +N G L
Sbjct: 248 INYDNKLNKNLNSPNTNKFKKDNIEGGGLGYLL 280
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 33.6 bits (78), Expect = 0.26
Identities = 18/105 (17%), Positives = 39/105 (37%), Gaps = 6/105 (5%)
Query: 21 KKAKKKEKKKKEKKAK------KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
KK E K++ K + +E++ E +K EK+ +K++ +K
Sbjct: 55 KKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEK 114
Query: 75 NNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQPHLRGML 119
E+K L ++ + L ++ +T + +L
Sbjct: 115 KEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEIL 159
Score = 33.6 bits (78), Expect = 0.26
Identities = 15/45 (33%), Positives = 29/45 (64%), Gaps = 2/45 (4%)
Query: 21 KKAKK-KEKKKKEKKAKKKEKKKEKKKKEKKKKKK-KKKKKKKNN 63
++AK+ E+ KKE +A KKE E K++ K + + +K+ +++ N
Sbjct: 38 EEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82
Score = 32.1 bits (74), Expect = 0.96
Identities = 15/71 (21%), Positives = 29/71 (40%), Gaps = 3/71 (4%)
Query: 22 KAKKKEKK---KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
+A KKE K+E + E +KE +++ + +K +K+ +K N +
Sbjct: 52 EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111
Query: 79 NNNNNNNNEEK 89
E+K
Sbjct: 112 LEKKEKELEQK 122
Score = 29.7 bits (68), Expect = 4.1
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK E K K+ E++ ++ +E KK+ + KK+
Sbjct: 26 KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEAL 59
Score = 29.0 bits (66), Expect = 7.0
Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 4/47 (8%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK----KKKKKKKKN 62
KK + + K+ E++AK+ ++ +K+ + KK+ K++ K +N
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRN 71
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 33.4 bits (77), Expect = 0.27
Identities = 7/48 (14%), Positives = 25/48 (52%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
+KKK++KK+++K+ K+ + +++++ + +++
Sbjct: 170 GLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGE 217
Score = 31.1 bits (71), Expect = 1.6
Identities = 8/47 (17%), Positives = 26/47 (55%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
+KKK++KK+++K++ +++++ + +++ EE+
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEER 222
Score = 30.7 bits (70), Expect = 1.8
Identities = 6/42 (14%), Positives = 25/42 (59%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
+ +KK+++KK+++K+ K+ +++++ + +++
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGE 217
Score = 28.4 bits (64), Expect = 9.5
Identities = 6/41 (14%), Positives = 23/41 (56%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
++KK++KK+++K+ K +++++ + +++
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGE 217
>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103). This
family consists of several repeats of around 30
residues in length which are found specifically in
mature-parasite-infected erythrocyte surface antigen
proteins from Plasmodium falciparum. This family often
found in conjunction with pfam00226.
Length = 215
Score = 32.9 bits (74), Expect = 0.27
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
D + K+K KK+K ++ +KE K+K +K+ K+K KKK K N+ N+
Sbjct: 7 DKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGND 61
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 32.9 bits (75), Expect = 0.28
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKK 45
+P +K KK + KK + K KK E KK+ +
Sbjct: 83 KPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 32.9 bits (75), Expect = 0.30
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
++K A +K+KK + KK + K KK + KKK + NN
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 31.3 bits (71), Expect = 0.76
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
T + W++ +K+KK K K+ K + KK E KKK +
Sbjct: 76 TGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 31.0 bits (70), Expect = 1.1
Identities = 15/30 (50%), Positives = 18/30 (60%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
EKKK+ K K+ K K KK + KKK NN
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 30.6 bits (69), Expect = 1.7
Identities = 16/39 (41%), Positives = 21/39 (53%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
++K ++KKK K KK K K KK + KKK NN
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 29.4 bits (66), Expect = 4.0
Identities = 14/30 (46%), Positives = 16/30 (53%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
EKKKK K KK K K KK ++ NN
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 29.0 bits (65), Expect = 4.3
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
++K +KK+K K KK K K KK+ + NN
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 29.0 bits (65), Expect = 4.8
Identities = 15/39 (38%), Positives = 20/39 (51%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
+ K A +KK+K K +K K K KK + KK NN
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 28.6 bits (64), Expect = 7.0
Identities = 12/30 (40%), Positives = 14/30 (46%)
Query: 48 EKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
EKKKK K KK K K + + NN
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
>gnl|CDD|182964 PRK11102, PRK11102, bicyclomycin/multidrug efflux system;
Provisional.
Length = 377
Score = 33.3 bits (77), Expect = 0.28
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 7/44 (15%)
Query: 377 SSYIFAFAVNKLYY-PMLDTFH-----LWGTLYFYAAISVVGTL 414
S+YI FA+ +L+Y PM D+F L GTL F A +V L
Sbjct: 33 SAYILGFAIGQLFYGPMADSFGRKPVILGGTLVF-ALAAVACAL 75
>gnl|CDD|238227 cd00386, Heme_Cu_Oxidase_III_like, Heme-copper oxidase subunit III.
Heme-copper oxidases are transmembrane protein
complexes in the respiratory chains of prokaryotes and
mitochondria which couple the reduction of molecular
oxygen to water to, proton pumping across the membrane.
The heme-copper oxidase superfamily is diverse in terms
of electron donors, subunit composition, and heme types.
This superfamily includes cytochrome c and ubiquinol
oxidases. Bacterial oxidases typically contain 3 or 4
subunits in contrast to the 13 subunit bovine cytochrome
c oxidase (CcO). Subunits I, II, and III of mammalian
CcO are encoded within the mitochondrial genome and the
remaining 10 subunits are encoded within the nuclear
genome. Subunits I, II and III of ubiquinol oxidase are
homologous to the corresponding subunits in CcO. This
group additionally contains proteins which are fusions
between subunits I and III, such as Sulfolobus
acidocaldarius SoxM, a subunit of the SoxM terminal
oxidase complex. It also includes NorE which has been
speculated to be a subunit of nitric oxide reductase.
Some archaebacterial cytochrome oxidases lack subunit
III. Although not required for catalytic activity,
subunit III is believed to play a role in assembly of
the multimer complex. Rhodobacter CcO subunit III
stabilizes the integrity of the binuclear center in
subunit I. It has been proposed that archaea acquired
heme-copper oxidases through gene transfer from
gram-positive bacteria.
Length = 183
Score = 32.6 bits (75), Expect = 0.28
Identities = 25/123 (20%), Positives = 43/123 (34%), Gaps = 10/123 (8%)
Query: 243 LFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCV---VLIHYTGK 299
+ F F + F + PLD + L G+ + L G
Sbjct: 22 MLFGSFFWAYFHSRLSPPVEFGAGLDPLDLPLLNTNTLL--LSGSSVTWAHASLAARRGN 79
Query: 300 RPLA---LISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAF-LTHICIRLLPW 355
R A L+ T F+ + Y HL + + + F +L F H+ I L+ +
Sbjct: 80 RKKARLWLLLTILLGLAFLGLQAYEYSHLIFTISDSVFGSTFFLLTGFHGLHVIIGLI-F 138
Query: 356 MLI 358
+L+
Sbjct: 139 LLV 141
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 32.8 bits (75), Expect = 0.29
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+E+ + ++ K K +K+KKK+ K+KKK
Sbjct: 177 KEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
Score = 30.5 bits (69), Expect = 1.7
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
K+ + + ++AK K KK+K+K+ K+KKK
Sbjct: 177 KEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
>gnl|CDD|218754 pfam05795, Plasmodium_Vir, Plasmodium vivax Vir protein. This
family consists of several Vir proteins specific to
Plasmodium vivax. The vir genes are present at about
600-1,000 copies per haploid genome and encode proteins
that are immunovariant in natural infections, indicating
that they may have a functional role in establishing
chronic infection through antigenic variation.
Length = 348
Score = 33.3 bits (76), Expect = 0.30
Identities = 11/37 (29%), Positives = 16/37 (43%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
+KK +KKK NNNN + N + N+
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSR 340
Score = 32.9 bits (75), Expect = 0.37
Identities = 11/37 (29%), Positives = 17/37 (45%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
+KK +KKK NNNN + N + N++
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSR 340
Score = 31.8 bits (72), Expect = 1.0
Identities = 16/82 (19%), Positives = 22/82 (26%), Gaps = 20/82 (24%)
Query: 28 KKKKEKKAKKKEKKK----------EKKKKEKKKKKKKKKK----------KKKNNNNNN 67
K K K K + E KK K K K K N N+
Sbjct: 48 KYISNIKNKNDRKDRCSYLNYWLYDEIKKILKSKSKNINIIPFFNKLFDIWNKINKKNSK 107
Query: 68 NNNNNNNNNNNNNNNNNNNEEK 89
N + N + + + K
Sbjct: 108 KKNKCKPDFNYIDILDEWKKRK 129
Score = 31.4 bits (71), Expect = 1.2
Identities = 10/36 (27%), Positives = 16/36 (44%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
+KK +KKK NNN + N + N++
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSS 339
Score = 29.9 bits (67), Expect = 3.5
Identities = 9/36 (25%), Positives = 15/36 (41%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
+KK +KKK NN + N + N++
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSS 339
Score = 29.9 bits (67), Expect = 4.0
Identities = 9/36 (25%), Positives = 13/36 (36%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
+KK +KK NNNN + N +
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSS 339
Score = 29.5 bits (66), Expect = 4.6
Identities = 9/38 (23%), Positives = 14/38 (36%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
KK +KKK N + N + N++ N
Sbjct: 305 KKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSRYN 342
Score = 29.5 bits (66), Expect = 5.1
Identities = 8/38 (21%), Positives = 13/38 (34%)
Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
+KK +K NNNN + N + +
Sbjct: 304 RKKLGRKKKIMNNNNEEEYELELYAYESENPSLNSSRY 341
Score = 29.1 bits (65), Expect = 5.9
Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 7/73 (9%)
Query: 19 DWKKAK-------KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
+WKK K +K KK+ K+K K K K + K KK K+K ++N+NN
Sbjct: 124 EWKKRKDLYDYFENYDKIKKKIKSKNKNCNKYCKYLKYIKSLYKKYKEKCCSSNDNNYCP 183
Query: 72 NNNNNNNNNNNNN 84
+ N N
Sbjct: 184 EYFKCDKKYNPKN 196
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 33.7 bits (77), Expect = 0.31
Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 4/76 (5%)
Query: 20 WKKAKKKEKKKKEKKAK--KKEKKKEKK--KKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
K+ +KK K K+ K K K K KK + K KK K NN N +
Sbjct: 459 SKELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGEC 518
Query: 76 NNNNNNNNNNNEEKGT 91
++ NN N+++ T
Sbjct: 519 SSPPNNKEKNDKQTST 534
Score = 32.2 bits (73), Expect = 0.76
Identities = 17/53 (32%), Positives = 21/53 (39%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
L K K K+ K K K + KK+ + K KK K K NN N
Sbjct: 459 SKELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKAN 511
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 33.4 bits (76), Expect = 0.32
Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN-----N 71
P D + E KK E K + + K + ++ + K K+ N +N+ + +
Sbjct: 413 PEDDRDNNFNEPKKPENKGDGQNEPVIPKPLDNERDQSNKNKQVNPGNRHNSEDRYTRPH 472
Query: 72 NNNNNNNNNNNNNNNEEKGTF 92
NN N N NN N++ K
Sbjct: 473 GRNNENRNYNNKNSDIPKHPE 493
Score = 31.9 bits (72), Expect = 1.1
Identities = 10/54 (18%), Positives = 24/54 (44%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
++ + K K+ + + + + + NN N N NN N++ + + E+
Sbjct: 446 ERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNSDIPKHPERSEHEQ 499
Score = 30.7 bits (69), Expect = 2.0
Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 11/76 (14%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
P PLD E+ K K+ + + + + + N N N NN N++
Sbjct: 440 PKPLD-----------NERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNSDI 488
Query: 75 NNNNNNNNNNNNEEKG 90
+ + + E+K
Sbjct: 489 PKHPERSEHEQPEDKK 504
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 30.8 bits (70), Expect = 0.33
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ ++ KK+KKK K + KK +K+ K KK
Sbjct: 21 PESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 27.8 bits (62), Expect = 4.0
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKK 61
K++KK+ K + KK +K+ K KK
Sbjct: 29 KKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 27.4 bits (61), Expect = 5.3
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
P P +++ KK+KKK + + KK +K+ K K
Sbjct: 19 PPPESYEEELKKQKKKNKLRFLKKLEKRPKDVK 51
Score = 26.6 bits (59), Expect = 9.8
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
T P ++ KK++KK K + KK EK+ + KK
Sbjct: 18 TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells.
Vacuoles/lysosomes play an important role in the
degradation of both lipids and cellular proteins. In
order to perform this degradative function,
vacuoles/lysosomes contain numerous hydrolases which
have been transported in the form of inactive
precursors via the biosynthetic pathway and are
proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or
Golgi complex, or remain in the limiting membrane of
the MVB and are thereby transported to the limiting
membrane of the vacuole/lysosome as a consequence of
fusion. Therefore, the MVB sorting pathway plays a
critical role in the decision between recycling and
degradation of membrane proteins. A few archaeal
sequences are also present within this family.
Length = 169
Score = 32.2 bits (74), Expect = 0.34
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+K ++ +KK + EKK +K + E KK KK K
Sbjct: 7 RKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDA 42
Score = 28.4 bits (64), Expect = 6.1
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 29 KKKEKKAKKKEKKKEKK-KKEKKKKKKKKKKKKK 61
+K ++ KK++ EKK KK + + KK KK K
Sbjct: 7 RKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNK 40
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 32.4 bits (74), Expect = 0.34
Identities = 15/65 (23%), Positives = 29/65 (44%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K+ +K +K+ K K+AK ++ K + ++ K + + KK+K N N + N
Sbjct: 58 KELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQEYEKDLNRRE 117
Query: 81 NNNNN 85
Sbjct: 118 AEEEQ 122
Score = 28.1 bits (63), Expect = 8.1
Identities = 8/45 (17%), Positives = 23/45 (51%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK + + +K+ K+ +K +++ K ++ K + K + ++
Sbjct: 41 QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSD 85
>gnl|CDD|233421 TIGR01453, grpIintron_endo, group I intron endonuclease. This
model represents one subfamily of endonucleases
containing the endo/excinuclease amino terminal domain,
pfam01541 at its amino end. A distinct subfamily
includes excinuclease abc subunit c (uvrC). Members of
pfam01541 are often termed GIY-YIG endonucleases after
conserved motifs near the amino end. This subfamily in
This model is found in open reading frames of group I
introns in both phage and mitochondria. The closely
related endonucleases of phage T4: segA, segB, segC,
segD and segE, score below the trusted cutoff for the
family.
Length = 214
Score = 32.7 bits (75), Expect = 0.34
Identities = 16/49 (32%), Positives = 21/49 (42%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
E K K K K E KK ++ KKK +K+K N + NN
Sbjct: 127 ETKAKISKNKLGENNPFFGKKHSEETKKKISEKEKGAKKVNVYDKNNGL 175
Score = 29.3 bits (66), Expect = 3.7
Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 1/49 (2%)
Query: 22 KAKKKE-KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
+ K K K K + KK ++ K+K +K+K KK + NN
Sbjct: 127 ETKAKISKNKLGENNPFFGKKHSEETKKKISEKEKGAKKVNVYDKNNGL 175
>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain. PfEMP1 (Plasmodium falciparum
erythrocyte membrane protein) has been identified as the
rosetting ligand of the malaria parasite P. falciparum.
Rosetting is the adhesion of infected erythrocytes with
uninfected erythrocytes in the vasculature of the
infected organ, and is associated with severe malaria.
PfEMP1 interacts with Complement Receptor One on
uninfected erythrocytes to form rosettes.
Length = 150
Score = 31.9 bits (73), Expect = 0.35
Identities = 24/117 (20%), Positives = 37/117 (31%), Gaps = 39/117 (33%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKK---------EKKKKEKKKKKKKKKKKKKNNNNNNNN 69
W+K K + K K KK EKKKKE +K K+ K+ K ++ +
Sbjct: 15 KWRKKLKHCINNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDGRDEY 74
Query: 70 NNNNN------------------------------NNNNNNNNNNNNEEKGTFLYKM 96
N + N NN+ N+ N E + +
Sbjct: 75 NVLESFLEDLLFQIDIKKAYGDAKELKKIKELLKCNGCNNSANSAKNGENKDAIDCL 131
Score = 29.6 bits (67), Expect = 2.1
Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 3/81 (3%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK--KKKKKKKKNNNNNNNNNNNNNNN 76
D K +KK K K K K KKK + +K +KKKK+ + + N ++
Sbjct: 12 DSIKWRKKLKHCI-NNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDG 70
Query: 77 NNNNNNNNNNEEKGTFLYKMA 97
+ N + E F +
Sbjct: 71 RDEYNVLESFLEDLLFQIDIK 91
Score = 28.1 bits (63), Expect = 8.7
Identities = 14/52 (26%), Positives = 19/52 (36%), Gaps = 7/52 (13%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKK-------KKEKKKKKKKKKKKKKNN 63
+ KK K+ K + K E K +KK KK KKK +
Sbjct: 99 ELKKIKELLKCNGCNNSANSAKNGENKDAIDCLLDHLQKKAKKCKKKHSEEE 150
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 31.5 bits (72), Expect = 0.36
Identities = 15/34 (44%), Positives = 19/34 (55%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKKK + +KE K +KK K+K K KKK
Sbjct: 99 IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 29.6 bits (67), Expect = 1.8
Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 23 AKKKEKKKKEKKAK--KKEKKKEKKKKEKKKKKKKKKKKKK 61
KK E K + + KK+K K +KE K ++KK + K+K
Sbjct: 84 LKKFEPKYRLIRLGLIKKKKAGRKARKELKNRRKKVRGKEK 124
Score = 28.1 bits (63), Expect = 5.7
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
KKK +K K+ K + KK K+K K KKK
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 28.1 bits (63), Expect = 6.5
Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 21 KKAKKKEKKKKEKKAK-KKEKKKEKKKKEKKKK 52
KK K K +KE K + KK + KEK K KKK
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 27.7 bits (62), Expect = 9.6
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KKK+ +K +K K +KK + K++ K KKK
Sbjct: 99 IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
>gnl|CDD|221826 pfam12878, SICA_beta, SICA extracellular beta domain. The SICA
(schizont-infected cell agglutination) proteins of P.
knowlesi, one of the variant antigen gene families, are
associated with parasitic virulence. These proteins are
comprised of multiple domains, with the extracellular
domains occurring at different frequencies. There can
be between 1 and 10 copies of this cysteine-rich
domain.
Length = 169
Score = 32.1 bits (74), Expect = 0.38
Identities = 14/54 (25%), Positives = 21/54 (38%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
A KK K K + + ++ K E KK K+ K N N+ + N
Sbjct: 9 AAKKWFKNKNGQGGGTDDFWDEVKGELKKLWKELSTAMKQNGVAGYCNDFGDGN 62
Score = 31.4 bits (72), Expect = 0.64
Identities = 14/59 (23%), Positives = 21/59 (35%), Gaps = 4/59 (6%)
Query: 35 AKKKEKKKEKKKKEKKK--KKKKKKKKK--KNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
AKK K K + + K + KK K + N N+ + N + EK
Sbjct: 10 AKKWFKNKNGQGGGTDDFWDEVKGELKKLWKELSTAMKQNGVAGYCNDFGDGNATSAEK 68
Score = 31.4 bits (72), Expect = 0.77
Identities = 14/41 (34%), Positives = 20/41 (48%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
KK K+K K K + KK K + NN ++ N NN+
Sbjct: 112 KKMKEKAKGKCSIDEGIKKAFDKWNDIMNNTSSCNGGGNNS 152
Score = 30.6 bits (70), Expect = 1.1
Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 7/45 (15%)
Query: 42 KEKKKKEKKKKKKK-------KKKKKKNNNNNNNNNNNNNNNNNN 79
KK K+K K K KK K N+ NN ++ N NN+
Sbjct: 108 NAYAKKMKEKAKGKCSIDEGIKKAFDKWNDIMNNTSSCNGGGNNS 152
Score = 30.6 bits (70), Expect = 1.1
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 1/42 (2%)
Query: 35 AKK-KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
AKK KEK K K ++ KK K NN ++ N NN+
Sbjct: 111 AKKMKEKAKGKCSIDEGIKKAFDKWNDIMNNTSSCNGGGNNS 152
Score = 28.3 bits (64), Expect = 7.1
Identities = 11/53 (20%), Positives = 17/53 (32%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK K + + E K + KK K+ N N+ + N
Sbjct: 10 AKKWFKNKNGQGGGTDDFWDEVKGELKKLWKELSTAMKQNGVAGYCNDFGDGN 62
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 33.3 bits (77), Expect = 0.39
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K K + + K+ K ++K+KKKKK+K+K++
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
Score = 33.3 bits (77), Expect = 0.41
Identities = 11/40 (27%), Positives = 27/40 (67%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++E ++K ++A+ K+ EK K+E ++KK+K ++++
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDK 566
Score = 32.5 bits (75), Expect = 0.68
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K E +K K +KKEKKKKK+K+K+++
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
Score = 31.3 bits (72), Expect = 1.4
Identities = 14/41 (34%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K + +A+K+ K +KK++KKKK+K+K+++ K
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 31.0 bits (71), Expect = 1.9
Identities = 11/41 (26%), Positives = 28/41 (68%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+KA++ E KE + K+E +++K+K ++++ K ++ +K+
Sbjct: 534 QKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKE 574
Score = 30.2 bits (69), Expect = 3.5
Identities = 12/44 (27%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
Query: 19 DWKKAKKKEK---KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++A+ K K KE+ +KKEK +E++ K ++ +K+ ++
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQA 578
Score = 29.8 bits (68), Expect = 4.9
Identities = 9/35 (25%), Positives = 21/35 (60%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
E++ ++K ++ E KE +K K++ ++KK+
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQE 562
Score = 29.0 bits (66), Expect = 8.2
Identities = 12/41 (29%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+A+K +++ +EKK K +E++ + ++ +K+ ++ K+ KK
Sbjct: 544 KEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK 584
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 30.3 bits (69), Expect = 0.41
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
A +KE+KKKE K K E++ ++ K KK
Sbjct: 9 AYRKEQKKKELKKNKAERQARREAKLAKK 37
Score = 29.5 bits (67), Expect = 0.75
Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 1/29 (3%)
Query: 35 AKKKEKKKE-KKKKEKKKKKKKKKKKKKN 62
+K++KKKE KK K +++ +++ K KKN
Sbjct: 10 YRKEQKKKELKKNKAERQARREAKLAKKN 38
Score = 29.5 bits (67), Expect = 0.90
Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 2/28 (7%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+ KKKE K K K E++ +++ K KK
Sbjct: 12 KEQKKKELK--KNKAERQARREAKLAKK 37
Score = 28.4 bits (64), Expect = 1.9
Identities = 13/53 (24%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 21 KKAKKKE--KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K+ KKKE K K E++A+++ K +K +++ K K ++++
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQ 64
Score = 26.5 bits (59), Expect = 8.5
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
+K+++KK KK K + + ++E K KK
Sbjct: 10 YRKEQKKKELKKNKAERQARREAKLAKK 37
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 32.0 bits (73), Expect = 0.42
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+KK+K + K +K+ K+K +KKK N N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 30.1 bits (68), Expect = 2.1
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
+K+KK + +K+ K+K +KK N N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 29.7 bits (67), Expect = 2.5
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
KKKK + + K +K K+K++KKK
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINI 30
Score = 29.7 bits (67), Expect = 2.6
Identities = 7/32 (21%), Positives = 13/32 (40%)
Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
+KK+K + K +K+ + N N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 29.3 bits (66), Expect = 3.8
Identities = 7/33 (21%), Positives = 13/33 (39%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
+KKK+ + K +K+ + N N
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.9 bits (65), Expect = 4.3
Identities = 9/33 (27%), Positives = 15/33 (45%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
+KK+K + K +K+ K+K N N
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.9 bits (65), Expect = 5.2
Identities = 9/32 (28%), Positives = 16/32 (50%)
Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
+K+KK + K +K+ K+K+ N N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.6 bits (64), Expect = 5.5
Identities = 7/32 (21%), Positives = 12/32 (37%)
Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+K+KK + K +K + N N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.6 bits (64), Expect = 6.9
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK K + + +K K+K +KKK N
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.2 bits (63), Expect = 7.5
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++KK + K +K+ K+K +KKK N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.2 bits (63), Expect = 7.9
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+K KK K +++ K+K +KKK N
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.2 bits (63), Expect = 8.8
Identities = 8/32 (25%), Positives = 14/32 (43%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
++KKK K +K+ K+ + N N
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 33.1 bits (75), Expect = 0.43
Identities = 11/69 (15%), Positives = 35/69 (50%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+K +++K A +E+ +++++ KKK+ +K K + N++ ++ + + +
Sbjct: 408 LRKNARQRKAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDED 467
Query: 83 NNNNEEKGT 91
+ ++ G
Sbjct: 468 RDLADDGGL 476
>gnl|CDD|216716 pfam01810, LysE, LysE type translocator. This family consists of
various hypothetical proteins and an l-lysine exporter
LysE from Corynebacterium glutamicum which is proposed
to be the first of a novel family of translocators. LysE
exports l-lysine from the cell into the surrounding
medium and is predicted to span the membrane six times.
The physiological function of the exporter is to excrete
excess l-Lysine as a result of natural flux imbalances
or peptide hydrolysis; and also after artificial
deregulation of l-Lysine biosynthesis as used by the
biotechnology. industry for the production of l-lysine.
Length = 191
Score = 32.3 bits (74), Expect = 0.44
Identities = 16/120 (13%), Positives = 28/120 (23%), Gaps = 9/120 (7%)
Query: 201 SQITKAIEESELKRLGKDGQRRPNYRMYMRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAV 260
+ A + + ++ + + P I+ L F
Sbjct: 76 LTLRAAWSPKNESLSSSEASLKKSFLRGLLVSLSNPKVILFWLSVGSAFLDKQQYGDAGR 135
Query: 261 GIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVY 320
F + + +LL A L L I + A I VY
Sbjct: 136 IAFAAGLVVASLIWFSLL--------AFLASRLARRL-STKKVRIINVIAGALLIGFGVY 186
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 31.9 bits (73), Expect = 0.44
Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 5/44 (11%)
Query: 17 PLDWKKA-----KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
P + K A K + KK+++KA+K K+ + +K ++ K K
Sbjct: 24 PKNLKAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 31.0 bits (71), Expect = 0.45
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+A++ E K+ KAKKK++ E E K+ K + K +
Sbjct: 49 QAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKALEA 88
Score = 28.3 bits (64), Expect = 3.8
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KAKKK++ + A+ KE K E K E + ++ + +
Sbjct: 60 GKAKKKKEDAEALIAEVKELKDELKALEAELRELEAE 96
Score = 27.9 bits (63), Expect = 5.4
Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 20 WKKAKKK-EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++ + + E+ + E+ KE K KKKKE + + K+ K+
Sbjct: 38 RRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKD 81
Score = 27.2 bits (61), Expect = 8.5
Identities = 9/40 (22%), Positives = 23/40 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+ K +KKK++ +A E K+ K + + + + ++ + +
Sbjct: 57 KEIGKAKKKKEDAEALIAEVKELKDELKALEAELRELEAE 96
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 32.1 bits (73), Expect = 0.46
Identities = 13/41 (31%), Positives = 23/41 (56%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KE+ EK ++E E KKE K+ K++ ++K+N +
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43
Score = 30.2 bits (68), Expect = 2.1
Identities = 9/38 (23%), Positives = 23/38 (60%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +++ +A KKE K+ K+++++K+ + +K+
Sbjct: 10 KDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKE 47
Score = 29.8 bits (67), Expect = 2.9
Identities = 13/50 (26%), Positives = 21/50 (42%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
++ E KKE K KE+ +EK+ + +K+ K KN
Sbjct: 6 FPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKN 55
Score = 29.8 bits (67), Expect = 3.0
Identities = 10/41 (24%), Positives = 21/41 (51%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
KE+ +K+ +E+ E KK+ K+ K+++ N +
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43
Score = 28.3 bits (63), Expect = 7.8
Identities = 9/41 (21%), Positives = 20/41 (48%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
K+E ++K ++E + KK+ K+ K++ N +
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43
>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1. This family consists
of several bacterial histone H1-like Hc1 proteins. In
Chlamydia, Hc1 is expressed in the late stages of the
life cycle, concomitant with the reorganisation of
chlamydial reticulate bodies into elementary bodies.
This suggests that Hc1 protein plays a role in the
condensation of chromatin during intracellular
differentiation.
Length = 123
Score = 31.3 bits (70), Expect = 0.47
Identities = 18/40 (45%), Positives = 18/40 (45%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA KK K KKA KK K K K KK KK K
Sbjct: 74 KAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVK 113
Score = 29.7 bits (66), Expect = 1.4
Identities = 13/40 (32%), Positives = 16/40 (40%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA K K K+ +K K+ K K K KK
Sbjct: 67 KKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKK 106
Score = 29.7 bits (66), Expect = 1.5
Identities = 15/40 (37%), Positives = 18/40 (45%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +KE K EK K+K +K K K KK K K
Sbjct: 46 KLYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAK 85
Score = 29.3 bits (65), Expect = 1.9
Identities = 13/40 (32%), Positives = 16/40 (40%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA+K KK+ K + K KK K KK K
Sbjct: 54 KAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAK 93
Score = 28.2 bits (62), Expect = 5.6
Identities = 17/41 (41%), Positives = 19/41 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K AK KK AKK + K K +K KK KK KK
Sbjct: 78 KAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVKKMSKK 118
Score = 27.4 bits (60), Expect = 8.9
Identities = 13/41 (31%), Positives = 17/41 (41%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K++ K EK KK K+ K K +K K KK
Sbjct: 50 KESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAA 90
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 29.2 bits (66), Expect = 0.47
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEK 49
KKK +K +KKAKK K K K
Sbjct: 4 KKKINQKLKKKAKKANAKLHPSNKPK 29
Score = 26.5 bits (59), Expect = 4.9
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKK 57
++KK +K KKK KK K K K
Sbjct: 3 RKKKINQKLKKKAKKANAKLHPSNKPK 29
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 33.0 bits (75), Expect = 0.48
Identities = 13/43 (30%), Positives = 20/43 (46%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
K+ KKKK KK K +K+ K+K + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 31.9 bits (72), Expect = 1.0
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K+ KKK+ KK + +K+ K+K KK K+ N N + +
Sbjct: 68 GKRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 31.9 bits (72), Expect = 1.0
Identities = 11/43 (25%), Positives = 19/43 (44%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
++ K+KK KK K +K+ K N + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 31.9 bits (72), Expect = 1.0
Identities = 11/43 (25%), Positives = 21/43 (48%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
++ +KKK +K K +K+ K+K K + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 31.1 bits (70), Expect = 1.6
Identities = 13/43 (30%), Positives = 21/43 (48%)
Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
K+ KKK+ KK K +K+ K+KN + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 31.1 bits (70), Expect = 1.9
Identities = 12/43 (27%), Positives = 19/43 (44%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
K KK+K KK K +K+ K+ N + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 30.7 bits (69), Expect = 2.3
Identities = 11/43 (25%), Positives = 18/43 (41%)
Query: 44 KKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K+ +KKK KK K +K+ N + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 30.7 bits (69), Expect = 2.5
Identities = 11/47 (23%), Positives = 15/47 (31%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKM 96
K+ KKKK KK N + N N ++ L
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPSLLSY 115
Score = 30.3 bits (68), Expect = 3.1
Identities = 13/43 (30%), Positives = 21/43 (48%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K K+KK K+ K +K+ K+K KK + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 29.9 bits (67), Expect = 4.0
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
K+ KK+K KK K +K+ K+K KK + N N + +
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPS 111
Score = 29.6 bits (66), Expect = 5.6
Identities = 13/48 (27%), Positives = 19/48 (39%)
Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
K+ KKKK KK K +K N + N N + ++L
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTAFNEENFETDRAPSLLSYL 116
>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681). This
family of proteins is found in bacteria. Proteins in
this family are typically between 81 and 117 amino
acids in length.
Length = 87
Score = 30.4 bits (69), Expect = 0.49
Identities = 10/45 (22%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKK--KKKKNNNNNNNNNN 71
KK +++ +K + + E+ EK + + K K ++ N N +
Sbjct: 26 KKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLS 70
Score = 29.3 bits (66), Expect = 1.3
Identities = 11/49 (22%), Positives = 26/49 (53%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
+ K KK +++ ++ +A+ ++ EK E + K K ++K + N +
Sbjct: 22 YWKLKKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLS 70
Score = 28.9 bits (65), Expect = 1.6
Identities = 10/47 (21%), Positives = 22/47 (46%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K KKA+++ +K + + ++ +K + + KN N N +
Sbjct: 24 KLKKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLS 70
>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031). This
protein is expressed in Plasmodium; its function is
unknown. It may be the product of gene family pyst-b.
Length = 228
Score = 32.3 bits (74), Expect = 0.49
Identities = 23/96 (23%), Positives = 35/96 (36%), Gaps = 16/96 (16%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEK-KKEKKKKE------KKKKKKKKKKKKKNN----- 63
T D KK KK + K+ E+ KKE K + + K+ KK +N+
Sbjct: 99 TLPDLNNVDKKTKKLINELRKELEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEH 158
Query: 64 -NNN---NNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
+ N N N ++NN + K K
Sbjct: 159 EDFKQLENEENFLETEYNEITSSNNYKKLKINRKLK 194
Score = 29.6 bits (67), Expect = 3.2
Identities = 18/84 (21%), Positives = 31/84 (36%), Gaps = 14/84 (16%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK---------KKKKNNNN-----NNNN 69
K+ E+ KKE K+ + + ++K+ KK + K+ +N N N
Sbjct: 119 KELEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEHEDFKQLENEENFLETEYNEI 178
Query: 70 NNNNNNNNNNNNNNNNNEEKGTFL 93
++NN N EK
Sbjct: 179 TSSNNYKKLKINRKLKKAEKKLIK 202
>gnl|CDD|218514 pfam05231, MASE1, MASE1. Predicted integral membrane sensory
domain found in histidine kinases, diguanylate cyclases
and other bacterial signaling proteins. This entry also
includes members of the 8 transmembrane UhpB type
(8TMR-UT) domain family.
Length = 298
Score = 32.4 bits (74), Expect = 0.49
Identities = 23/124 (18%), Positives = 38/124 (30%), Gaps = 11/124 (8%)
Query: 303 ALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVF 362
L+ A V L + + L LAA L R P +L+G+V
Sbjct: 1 LLLLLLLLYALAAAVLWSLALALVSSGIAAIWLPTGLRLAALLLF-GRRYWPGILLGDVL 59
Query: 363 ----------PNNIRATASGASGSSSYIFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVG 412
+ A + + A A+ + P L L F A +++
Sbjct: 60 ASLFLGLLSGLGLLLALLILLVNALEALLAVALLRRLLPGRHRLQLLFWLRFLLAAAIIA 119
Query: 413 TLYM 416
L +
Sbjct: 120 ALLL 123
>gnl|CDD|222446 pfam13903, Claudin_2, PMP-22/EMP/MP20/Claudin tight junction.
Members of this family are claudins, that form tight
junctions between cells.
Length = 170
Score = 31.9 bits (73), Expect = 0.50
Identities = 25/121 (20%), Positives = 34/121 (28%), Gaps = 28/121 (23%)
Query: 253 TTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRPLALISTGG--- 309
V F P L L + + L CV+ + K+ AL+ G
Sbjct: 47 CFTSEVDVNNFVLTSLPDLTIVVLLFLTLGLVLSLLACVLGLCSALKKSGALLLLTGVLL 106
Query: 310 ---SAACFIVVAVYAQF--------------------HLSYGWDSPLVP--TVFLVLAAF 344
I V +Y Q S GW L TV ++LA
Sbjct: 107 LLAGILGLIAVVLYVQKVQLEIKNVLLTWMSQWQSNGLASLGWSFYLAVVGTVLVLLAGL 166
Query: 345 L 345
L
Sbjct: 167 L 167
>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14
and HMG 17.
Length = 88
Score = 30.3 bits (68), Expect = 0.50
Identities = 15/44 (34%), Positives = 19/44 (43%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K E K KK KK+ + K KK K K+ + NN N
Sbjct: 31 KPEPKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEENNPAEN 74
Score = 29.5 bits (66), Expect = 1.00
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 5/53 (9%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
P P K + K KK KK E K KK K K+ ++ K++NN N
Sbjct: 27 PAP-----PKPEPKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEENNPAEN 74
>gnl|CDD|221630 pfam12542, CWC25, Pre-mRNA splicing factor. This domain family
is found in eukaryotes, and is approximately 100 amino
acids in length. The family is found in association
with pfam10197. There is a single completely conserved
residue Y that may be functionally important. Cwc25 has
been identified to associate with pre-mRNA splicing
factor Cef1/Ntc85, a component of the Prp19-associated
complex (NTC) involved in spliceosome activation. Cwc25
is neither tightly associated with NTC nor required for
spliceosome activation, but is required for the first
catalytic reaction.
Length = 95
Score = 30.5 bits (69), Expect = 0.50
Identities = 15/74 (20%), Positives = 31/74 (41%), Gaps = 14/74 (18%)
Query: 28 KKKKEK---------KAKKKEKKKE-----KKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
KKK E+ +++E KK+ +K ++K ++ K+ ++ +
Sbjct: 1 KKKLERLEWMYEGPPAESGFAEEREEYLLGKKRVDKLLEEKDAEEAKQKSSGEALIEGAS 60
Query: 74 NNNNNNNNNNNNNE 87
N N+ N+ N E
Sbjct: 61 NANSANDTWNKLRE 74
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 32.3 bits (74), Expect = 0.51
Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 9/49 (18%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK---------KKKKKN 62
AKKK++ K+E+ A KE +++++K +K KK K K +KKN
Sbjct: 233 AKKKKQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKKN 281
Score = 30.8 bits (70), Expect = 1.8
Identities = 9/43 (20%), Positives = 26/43 (60%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
D + ++K++ K+ K E+ E+ ++E ++++ + +K+K
Sbjct: 131 DSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 30.4 bits (69), Expect = 2.5
Identities = 12/42 (28%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 22 KAKKKE----KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KA+KK KK + K+K ++K+K + ++KK
Sbjct: 276 KARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKGG 317
Score = 29.6 bits (67), Expect = 4.4
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 6/48 (12%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK------KKKKKKKKN 62
K+ ++ +K +K KK ++ K KEK +KK KK+ + K
Sbjct: 248 KEGREDREKFGSRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQ 295
Score = 29.2 bits (66), Expect = 5.7
Identities = 13/59 (22%), Positives = 26/59 (44%), Gaps = 17/59 (28%)
Query: 20 WKKAKK---------KEKKKKE--------KKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K KK KEK +K+ K+ + K+K+ + K++ + ++KK
Sbjct: 259 SRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKGG 317
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 33.0 bits (75), Expect = 0.52
Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 6/84 (7%)
Query: 140 FLHWRSAAILNLLFPILALCALYFIPESPHWLISQGRMQEASASL-----CWLRGWVTPD 194
F WR I+ + A+ AL F+PESP + + G+ EA L +R P+
Sbjct: 330 FHSWRVFVIVCAFPCVFAIGALTFMPESPRFFLENGKHDEAWMILKLIHDTNMRAKGHPE 389
Query: 195 KVQTELSQITKAIEESELKRLGKD 218
KV + ++ I +E EL + D
Sbjct: 390 KVFS-VNHIKTIHQEDELIEIESD 412
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex
which functions in messenger ribonucleoprotein
metabolism and plays a role in preventing the
transcription-associated genetic instability. Tho2,
along with four other subunits forms THO.
Length = 296
Score = 32.3 bits (74), Expect = 0.52
Identities = 14/52 (26%), Positives = 24/52 (46%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
KE ++ EK+ K+ + K+KK+KK+ K KK + +N
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKT 81
Score = 28.4 bits (64), Expect = 8.9
Identities = 14/49 (28%), Positives = 26/49 (53%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
KK K+K++ K K ++E KK + EK KK+ ++K + + +
Sbjct: 52 KKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSSWFPSKDPDE 100
>gnl|CDD|221947 pfam13160, DUF3995, Protein of unknown function (DUF3995). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 138 and
149 amino acids in length. There are two completely
conserved residues (W and P) that may be functionally
important.
Length = 125
Score = 31.1 bits (71), Expect = 0.53
Identities = 13/48 (27%), Positives = 17/48 (35%)
Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYA 321
FATLL+ L A L + G P + GS V +
Sbjct: 39 FATLLVAALLLLAAALVLAQAGGVGPFPPPGLVRAGSWVLAAVFLLRG 86
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 31.5 bits (72), Expect = 0.55
Identities = 14/38 (36%), Positives = 29/38 (76%), Gaps = 3/38 (7%)
Query: 20 WKKAKKKE---KKKKEKKAKKKEKKKEKKKKEKKKKKK 54
WKK +K+E +K+ EK+A ++ KK+E+ ++ K++++K
Sbjct: 78 WKKNEKEERELRKRAEKEALEQAKKEEELREAKRQQRK 115
>gnl|CDD|234397 TIGR03920, T7SS_EccD, type VII secretion integral membrane protein
EccD. Members of this family are EccD, a component of
actinobacterial type VII secretion systems (T7SS) with
ten to eleven predicted transmembrane helix regions
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 431
Score = 32.7 bits (75), Expect = 0.56
Identities = 23/116 (19%), Positives = 41/116 (35%), Gaps = 19/116 (16%)
Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPL 333
A ++G+ L A++ L R +A +S G+A + VA + G L
Sbjct: 146 IAAGVIGLLVLAAAVVARTL----RDRAVAALSLLGAALVLVAVAGALAVPGALGVPHVL 201
Query: 334 VP-TVFLVLAAFL--------------THICIRLLPWMLIGEVFPNNIRATASGAS 374
+ TV LA L + + L +G ++ + A+G
Sbjct: 202 LAGTVLAALAVLLLRLTGRGLAVFTAVVTVAVILALAAAVGMLWQTPVPGIAAGLI 257
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 32.4 bits (74), Expect = 0.57
Identities = 12/48 (25%), Positives = 19/48 (39%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+E P K K K+ K +K +K + K+ +KKK
Sbjct: 241 SEAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
Score = 32.0 bits (73), Expect = 0.81
Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 3/43 (6%)
Query: 22 KAKKKEKKKK---EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K K K K ++ K EK EK + K+ +K
Sbjct: 244 KPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRK 286
Score = 29.3 bits (66), Expect = 4.4
Identities = 11/40 (27%), Positives = 17/40 (42%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
AK K K K + K+ K +K EK + K+ +
Sbjct: 243 AKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTS 282
Score = 28.5 bits (64), Expect = 9.9
Identities = 12/51 (23%), Positives = 21/51 (41%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
+SS++ + +P K+ K +K EK + K +KKK
Sbjct: 238 NSSSEAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 320
Score = 32.4 bits (74), Expect = 0.58
Identities = 20/82 (24%), Positives = 29/82 (35%), Gaps = 9/82 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN-- 78
K K EK + K AKKK+ K + K K ++N +N N +
Sbjct: 126 KILDKIEKFDEIKGAKKKDFFK--SINIYEIDNFIDKTLSKILDDNIVFQDNLEKNIDDK 183
Query: 79 -----NNNNNNNNEEKGTFLYK 95
N +N EK + K
Sbjct: 184 LSNIFNEKLSNLKNEKISINEK 205
Score = 30.5 bits (69), Expect = 2.2
Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 2/74 (2%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN--NN 81
K+K+K K + KE K+ KK K + +N + + N NN
Sbjct: 46 KEKKKIYISISDNKADIDKENLNKKSKKIKISIIIELSSNEIKSFSIKFYLNYYLEIINN 105
Query: 82 NNNNNEEKGTFLYK 95
+ + E F
Sbjct: 106 DVEDEIELLLFSKN 119
Score = 29.7 bits (67), Expect = 3.8
Identities = 15/66 (22%), Positives = 26/66 (39%), Gaps = 1/66 (1%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK-KKKKNNNNNNNNNNNNNNNNNNNNNN 83
K + +K K EK E K +KK K + N + + ++N +N
Sbjct: 118 KNLEYILKKILDKIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLE 177
Query: 84 NNNEEK 89
N ++K
Sbjct: 178 KNIDDK 183
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
family. This model family represents the major capsid
protein component of the heads (capsids) of
bacteriophage HK97, phi-105, P27, and related phage.
This model represents one of several analogous families
lacking detectable sequence similarity. The gene
encoding this component is typically located in an
operon encoding the small and large terminase subunits,
the portal protein and the prohead or maturation
protease [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 384
Score = 32.3 bits (74), Expect = 0.59
Identities = 10/87 (11%), Positives = 25/87 (28%), Gaps = 11/87 (12%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN-----------NNNNNNN 71
+ +E + K +K + EK + +++K + K++ + +
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 72 NNNNNNNNNNNNNNNEEKGTFLYKMAA 98
+ E K A
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRAY 88
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 31.2 bits (71), Expect = 0.61
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
K KK KKK KK + KK K KK+K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33
Score = 30.8 bits (70), Expect = 0.91
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K KKAKKK KK K K+ K KK+K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31
Score = 30.4 bits (69), Expect = 1.0
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
K KK +KKAKK K +K K KK+K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31
Score = 29.3 bits (66), Expect = 2.5
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ AKK +KK +K K KK K KK+K + +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33
Score = 28.9 bits (65), Expect = 4.2
Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 7/48 (14%)
Query: 21 KKAKKKEKKKKEKKAK--KKEKKKEKKKKE-----KKKKKKKKKKKKK 61
K KK KKK +K AK KK K +K+K + K + KK K
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRPKTLRLPKKPK 48
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.0 bits (73), Expect = 0.62
Identities = 10/27 (37%), Positives = 20/27 (74%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKK 53
EK KKE+ +K+ +++EK +E+K ++
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKARE 178
Score = 30.1 bits (68), Expect = 2.5
Identities = 8/26 (30%), Positives = 21/26 (80%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKN 62
KKE+ +EK+++E++K +++K +++
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREEE 180
Score = 29.3 bits (66), Expect = 4.4
Identities = 7/29 (24%), Positives = 21/29 (72%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
K K+++ E+K++E+++K +++K ++
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 32.3 bits (73), Expect = 0.63
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K A K+ ++ A+ ++++ EK + E KK ++ K K
Sbjct: 262 KPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAK 301
Score = 31.1 bits (70), Expect = 1.6
Identities = 11/41 (26%), Positives = 22/41 (53%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K++E K K A K++K+ E +K++ +K + + N
Sbjct: 253 KQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKN 293
Score = 30.7 bits (69), Expect = 2.1
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
++AK K K+ ++ E +K+E +K + + KK
Sbjct: 255 QEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKN 293
Score = 30.4 bits (68), Expect = 2.6
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K E+ K K K + K+E K EK+ + K+ + +KK
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329
Score = 30.0 bits (67), Expect = 3.5
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+KA+ + KK E+ K K+ K K+E K +K+ + K
Sbjct: 284 EKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDK 322
Score = 30.0 bits (67), Expect = 4.1
Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 21 KKAKKKEKKKKEKKAK--KKEKKKEKKKKEKKKKKKKKKKK 59
KK ++ K K+ KA K+E K +K+ E K+ + +KK++
Sbjct: 291 KKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
Score = 29.2 bits (65), Expect = 7.0
Identities = 7/43 (16%), Positives = 23/43 (53%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
D ++ + ++K+++ K K K+ ++ + +K++ +K
Sbjct: 243 ADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEK 285
Score = 29.2 bits (65), Expect = 7.3
Identities = 9/38 (23%), Positives = 22/38 (57%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ + K ++KA + +K++ E ++K+++ K K
Sbjct: 225 KQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262
Score = 28.8 bits (64), Expect = 9.6
Identities = 8/37 (21%), Positives = 20/37 (54%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
AK+ ++ K+E K+ + K ++K + + K++
Sbjct: 212 AKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRD 248
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 30.7 bits (68), Expect = 0.64
Identities = 14/52 (26%), Positives = 30/52 (57%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K + +++KK EK + EKK EK + E +KK +K + + + +++ ++
Sbjct: 60 KKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111
Score = 30.0 bits (66), Expect = 1.2
Identities = 13/52 (25%), Positives = 31/52 (59%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K++ + EKK +K E + EKK ++ + + +KK +K +N + +++ ++
Sbjct: 60 KKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111
Score = 28.4 bits (62), Expect = 4.6
Identities = 13/55 (23%), Positives = 29/55 (52%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
E KKE + +KK +K E + ++K +K + + +KK N + +++ ++
Sbjct: 57 EHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111
Score = 27.3 bits (59), Expect = 9.2
Identities = 16/62 (25%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEK--KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
D K + KK+ + KK EK +++KK EK + + +KK +K N + +++ ++
Sbjct: 50 DEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASD 109
Query: 77 NN 78
Sbjct: 110 KK 111
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 32.5 bits (74), Expect = 0.65
Identities = 31/141 (21%), Positives = 57/141 (40%), Gaps = 20/141 (14%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK+ + K++K ++ K++ KK K+KK+ K KK K + + + + ++
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTD----RDVSTSTAASQKKSS 239
Query: 81 NNNNNNEEKGTFLYKMAAPLVLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSF 140
+ + E + L+ + L ++ LR ML A T+ GS
Sbjct: 240 DLESKLEAQSKELWSLKDDL------KKHVSTAELREMLEANGQDTS----------GSE 283
Query: 141 LHWRSAAILNLLFPILALCAL 161
L R ++F L C L
Sbjct: 284 LDLRDRCADGMMFGALGPCPL 304
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 32.5 bits (75), Expect = 0.65
Identities = 16/51 (31%), Positives = 25/51 (49%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
E++ K+ KA+ E+K+K+K+KKK KKKK K +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432
Score = 29.5 bits (67), Expect = 6.0
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
++E K+ + + ++++KKKEKKK KKKK K + +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 32.3 bits (72), Expect = 0.66
Identities = 16/42 (38%), Positives = 28/42 (66%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
WK +++ +K++EKK +K + +KKKEK K +K +K +K
Sbjct: 122 WKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEK 163
Score = 32.3 bits (72), Expect = 0.75
Identities = 17/36 (47%), Positives = 25/36 (69%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K++EKK+ + K + EKKKEK K +K +K +K KK
Sbjct: 131 KEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKK 166
Score = 32.0 bits (71), Expect = 0.88
Identities = 14/41 (34%), Positives = 31/41 (75%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA +KE ++ E +AKK+ ++E+ K+ K ++++ +K+++K
Sbjct: 95 KKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEK 135
Score = 31.2 bits (69), Expect = 1.7
Identities = 16/41 (39%), Positives = 29/41 (70%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK +E++ K+ KA+++ +KE++KKE KK + +KKK+
Sbjct: 110 KKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKE 150
Score = 29.3 bits (64), Expect = 6.3
Identities = 14/41 (34%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+A ++E+ K+ K +++ +K+++KK+ KK + +KKK+K
Sbjct: 111 KRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEK 151
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 30.4 bits (69), Expect = 0.69
Identities = 8/34 (23%), Positives = 28/34 (82%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+E+++ + A+ +++++E+ +KE++++K+ +K+K
Sbjct: 73 EEERRSARLARLEDREEERLEKEEEREKRARKRK 106
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 32.4 bits (73), Expect = 0.70
Identities = 4/37 (10%), Positives = 13/37 (35%)
Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
K K+ +N ++ N ++ + + +
Sbjct: 19 TPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRS 55
Score = 30.0 bits (67), Expect = 3.2
Identities = 4/34 (11%), Positives = 11/34 (32%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K K +N ++ N ++ + +
Sbjct: 19 TPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRH 52
Score = 29.7 bits (66), Expect = 4.5
Identities = 9/87 (10%), Positives = 27/87 (31%), Gaps = 3/87 (3%)
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPL---VLVLTYVAEITQP 113
+ ++ N ++ + + + ++ E+ Y L L+
Sbjct: 29 NAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88
Query: 114 HLRGMLSATASMTTIFGTVSQLFLGSF 140
+ + S + ++ + S L S
Sbjct: 89 NPSDLNSKSLPLSNSKASSSSLSSSSS 115
Score = 28.9 bits (64), Expect = 8.0
Identities = 5/41 (12%), Positives = 15/41 (36%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K K + ++ N ++ + + + ++ EK
Sbjct: 20 PKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEK 60
>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
transduction mechanisms].
Length = 208
Score = 31.7 bits (72), Expect = 0.74
Identities = 7/41 (17%), Positives = 19/41 (46%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ + ++ E++AKK+E+ +K ++ + K
Sbjct: 109 RVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPR 149
Score = 29.4 bits (66), Expect = 3.4
Identities = 8/43 (18%), Positives = 22/43 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
++A+++ KK++E A ++ E+ + K K +++
Sbjct: 113 QRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREEVHTP 155
Score = 29.1 bits (65), Expect = 4.6
Identities = 6/39 (15%), Positives = 19/39 (48%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
A++ E++ K+++ +K ++ + K K ++
Sbjct: 112 AQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPRE 150
Score = 28.7 bits (64), Expect = 7.3
Identities = 7/43 (16%), Positives = 20/43 (46%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KA+ + ++ +++ K++E +K ++ + K K
Sbjct: 107 KARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPR 149
>gnl|CDD|197700 smart00398, HMG, high mobility group.
Length = 70
Score = 29.6 bits (67), Expect = 0.74
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
WK ++EK E+KAKK +++ E++ E K
Sbjct: 39 RWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70
Score = 28.8 bits (65), Expect = 1.3
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ E KK + K ++EK E+K KK K++ ++
Sbjct: 22 AENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEE 64
Score = 26.9 bits (60), Expect = 6.0
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+KK ++ K ++K +EK KK+K++ +++ + K
Sbjct: 33 SKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70
>gnl|CDD|234298 TIGR03659, IsdE, heme ABC transporter, heme-binding protein isdE.
This family of ABC substrate-binding proteins is
observed primarily in close proximity with proteins
localized to the cell wall and bearing the NEAT (NEAr
Transporter, pfam05031) heme-binding domain. IsdE has
been shown to bind heme and is involved in the process
of scavenging heme for the purpose of obtaining iron.
Length = 289
Score = 31.8 bits (73), Expect = 0.74
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
EK +E++A+K K+ +K+ E KKK K KKK K
Sbjct: 136 EKYGREEQAEKLVKEINEKEAEVKKKVKGKKKPK 169
>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family. This is a family
of sequences found in both bacteria and bacteriophages.
This region is approximately 130 residues long and in
some cases is found as part of the PVL
(Panton-Valentine leukocidin) group of genes, which
encode a member of the leukocidin group of bacterial
toxins that kill leukocytes by creation of pores in the
cell membrane. PVL appears to be a virulence factor
associated with a number of human diseases.
Length = 118
Score = 30.6 bits (69), Expect = 0.75
Identities = 10/34 (29%), Positives = 25/34 (73%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
++++ KK E ++E+K ++ E+K+ K+ + ++KK
Sbjct: 53 EYREIKKSENIEQERKERELERKRRKEAELRRKK 86
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 31.5 bits (72), Expect = 0.75
Identities = 14/44 (31%), Positives = 22/44 (50%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
EP P++ K K EK K + K ++K + K E K ++K
Sbjct: 100 EPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143
>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
Length = 102
Score = 30.4 bits (68), Expect = 0.75
Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 2/37 (5%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
AKK K KKE+KA E+K + KK KK K K
Sbjct: 61 AKKHHKNKKEQKA--PEQKAQAAKKHAKKHSHKTAAK 95
Score = 30.0 bits (67), Expect = 0.85
Identities = 16/41 (39%), Positives = 25/41 (60%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK KK ++K + AKK K K+++K ++K + KK KK
Sbjct: 47 KKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKK 87
Score = 28.1 bits (62), Expect = 4.4
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+KA+ +K K KK +K ++K + K+ KK K K
Sbjct: 56 QKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAK 95
Score = 27.7 bits (61), Expect = 5.2
Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKKKK 61
T P A K +K+ KK E+K + KK K KK++K ++K
Sbjct: 26 TTAAPAATTTTAAPAKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQK 77
>gnl|CDD|188873 cd09213, Luminal_IRE1_like, The Luminal domain, a dimerization
domain, of Inositol-requiring protein 1-like proteins.
The Luminal domain is a dimerization domain present in
Inositol-requiring protein 1 (IRE1), eukaryotic
translation Initiation Factor 2-Alpha Kinase 3
(EIF2AK3), and similar proteins. IRE1 and EIF2AK3 are
serine/threonine protein kinases (STKs) and are type I
transmembrane proteins that are localized in the
endoplasmic reticulum (ER). They are kinase receptors
that are activated through the release of BiP, a
chaperone bound to their luminal domains under
unstressed conditions. This results in dimerization
through their luminal domains, allowing
trans-autophosphorylation of their kinase domains and
activation. They play roles in the signaling of the
unfolded protein response (UPR), which is activated when
protein misfolding is detected in the ER in order to
decrease the synthesis of new proteins and increase the
capacity of the ER to cope with the stress. IRE1, also
called Endoplasmic reticulum (ER)-to-nucleus signaling
protein (or ERN), contains an endoribonuclease domain in
its cytoplasmic side and acts as an ER stress sensor. It
is the oldest and most conserved component of the UPR in
eukaryotes. Its activation results in the cleavage of
its mRNA substrate, HAC1 in yeast and Xbp1 in metazoans,
promoting a splicing event that enables translation into
a transcription factor which activates the UPR. EIF2AK3,
also called PKR-like Endoplasmic Reticulum Kinase
(PERK), phosphorylates the alpha subunit of eIF-2,
resulting in the downregulation of protein synthesis. It
functions as the central regulator of translational
control during the UPR pathway. In addition to the eIF-2
alpha subunit, EIF2AK3 also phosphorylates Nrf2, a
leucine zipper transcription factor which regulates
cellular redox status and promotes cell survival during
the UPR.
Length = 312
Score = 32.1 bits (73), Expect = 0.77
Identities = 4/32 (12%), Positives = 10/32 (31%)
Query: 55 KKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
K + + +N N + + + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
Score = 31.7 bits (72), Expect = 0.87
Identities = 3/32 (9%), Positives = 10/32 (31%)
Query: 56 KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K ++ + +N N + + +
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
Score = 31.7 bits (72), Expect = 0.88
Identities = 5/32 (15%), Positives = 11/32 (34%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNN 85
K + K + +N N + + + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
Score = 30.5 bits (69), Expect = 2.2
Identities = 5/32 (15%), Positives = 12/32 (37%)
Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
K + K++ +N N + + + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
Score = 29.0 bits (65), Expect = 5.7
Identities = 5/32 (15%), Positives = 11/32 (34%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
K + K+ +N N + + + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
Score = 29.0 bits (65), Expect = 6.4
Identities = 5/32 (15%), Positives = 11/32 (34%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
K + K+ +N N + + + N
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
Score = 29.0 bits (65), Expect = 7.0
Identities = 3/32 (9%), Positives = 10/32 (31%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
+ + + + +N N + + G
Sbjct: 281 SPSKSEDKESAISNILEGENTSPSALEGVSGN 312
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 32.0 bits (73), Expect = 0.77
Identities = 12/45 (26%), Positives = 19/45 (42%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
A KK +K +K K +K K ++ +KK KK +
Sbjct: 331 AAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 31.6 bits (72), Expect = 0.99
Identities = 13/43 (30%), Positives = 20/43 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK +K +EK K +K + ++ KK KK + K K
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 30.8 bits (70), Expect = 1.7
Identities = 10/43 (23%), Positives = 20/43 (46%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
K+ +K A++K K +K + ++ +KK KK +
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 30.8 bits (70), Expect = 2.0
Identities = 13/37 (35%), Positives = 17/37 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K K +K K ++A+KK KK K K K K
Sbjct: 345 KATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381
Score = 30.4 bits (69), Expect = 2.8
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
P +K ++K K +K K + +KK KK + K K
Sbjct: 330 PAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAK 373
Score = 30.0 bits (68), Expect = 3.3
Identities = 10/43 (23%), Positives = 18/43 (41%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
KK ++ +EK K ++ K ++ +KK KK
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 28.9 bits (65), Expect = 8.1
Identities = 12/43 (27%), Positives = 18/43 (41%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
KK K A ++K K +K K ++ +KK KK
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 28.9 bits (65), Expect = 8.4
Identities = 12/42 (28%), Positives = 18/42 (42%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
A++K K +K K + +K+ KK K K K K
Sbjct: 340 VLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 32.5 bits (74), Expect = 0.79
Identities = 25/87 (28%), Positives = 36/87 (41%), Gaps = 17/87 (19%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKK--------------EKKAKKKEKKKEKKKK 47
SS K + P + K KK+K K E+ A KK KK +KK +
Sbjct: 398 GSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGR 457
Query: 48 EKKKKKKKKKK---KKKNNNNNNNNNN 71
+K K K KK++ + +NNN
Sbjct: 458 DKSSKVPSDSKAGGKKESVKSQEDNNN 484
>gnl|CDD|213640 TIGR01601, PYST-C1, Plasmodium yoelii subtelomeric domain
PYST-C1. This model represents the N-terminal domain
of a paralogous family of Plasmodium yoelii genes
preferentially located in the subtelomeric regions of
the chromosomes. There are no obvious homologs to these
genes in any other organism. The C-terminal portions of
the genes which contain this domain are divergent and
some contain other yoelii-specific paralogous domains
such as PYST-C2 (TIGR01604).
Length = 82
Score = 29.6 bits (66), Expect = 0.81
Identities = 8/34 (23%), Positives = 16/34 (47%)
Query: 58 KKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
K+ +N N+ + NNNN ++ ++K
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDN 73
Score = 28.8 bits (64), Expect = 1.6
Identities = 10/42 (23%), Positives = 23/42 (54%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
KE + +K + K + + NNNN ++ ++ +N ++ +N
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDN 81
Score = 28.4 bits (63), Expect = 2.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 47 KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
KE + +K + K+ NNNN ++ ++ +N ++ + K
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDNK 82
Score = 28.0 bits (62), Expect = 3.3
Identities = 9/43 (20%), Positives = 23/43 (53%)
Query: 45 KKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
K+ + +K + K + NNNN ++ ++ +N ++ +N+
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDNK 82
Score = 27.7 bits (61), Expect = 3.8
Identities = 7/36 (19%), Positives = 17/36 (47%)
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
K+ ++N N+ + NNNN ++ +++
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQ 75
Score = 27.3 bits (60), Expect = 6.0
Identities = 9/42 (21%), Positives = 22/42 (52%)
Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
K+ + EK + K + + NNN ++ ++ +N ++ +N
Sbjct: 40 KEINRSNEKNDIETKSEIQLNNNNPKDDIDDKDNPQKDDKDN 81
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 31.1 bits (71), Expect = 0.86
Identities = 12/55 (21%), Positives = 27/55 (49%), Gaps = 11/55 (20%)
Query: 19 DWK-----KAKKKEKKK------KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+W K K+ +K K K++ +K++++K++ K +K++ KN
Sbjct: 108 EWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRREKKERVAKNEKRELKN 162
Score = 30.7 bits (70), Expect = 1.0
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K + K++ AKK+ +KKE+ K +K++ K KK
Sbjct: 130 VKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164
Score = 29.6 bits (67), Expect = 2.9
Identities = 17/50 (34%), Positives = 22/50 (44%), Gaps = 7/50 (14%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKK----EKKKKKKKK---KKKKKNNNNN 66
K K K EK AKKK KK KK K E + + K+ K+ +
Sbjct: 76 KPKPPTKWEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKD 125
Score = 28.4 bits (64), Expect = 6.6
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
KE +K+ +KKE+ + +K+E K KK
Sbjct: 133 NDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164
Score = 28.0 bits (63), Expect = 8.0
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 35/80 (43%)
Query: 20 WKK-AKKK--EKKKKEK----------------KAKKKEKKKE----------------K 44
W+K AKKK +K+KK K K K+ +K+
Sbjct: 83 WEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFA 142
Query: 45 KKKEKKKKKKKKKKKKKNNN 64
KK+ +KK++ K +K++ N
Sbjct: 143 KKRREKKERVAKNEKRELKN 162
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.4 bits (72), Expect = 0.88
Identities = 12/58 (20%), Positives = 29/58 (50%), Gaps = 2/58 (3%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK--KKNNNNNNNNNNNNNNN 76
+KA + E+ K E + + K+ ++EK++ EK+ + + K + +K + +
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 32.1 bits (73), Expect = 0.88
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K A++ + + K K++K KK+K+ K++ K
Sbjct: 338 KTARRMALLAEAARVKAKKEKLAKKRKQVSKEEAAAIKA 376
Score = 30.5 bits (69), Expect = 2.1
Identities = 9/33 (27%), Positives = 16/33 (48%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
E + + K +K KK+++ KE+ K K
Sbjct: 348 EAARVKAKKEKLAKKRKQVSKEEAAAIKAAGKA 380
>gnl|CDD|225100 COG2189, COG2189, Adenine specific DNA methylase Mod [DNA
replication, recombination, and repair].
Length = 590
Score = 32.1 bits (73), Expect = 0.88
Identities = 14/67 (20%), Positives = 21/67 (31%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
IG + W KK K + + + + KK+K KK N +
Sbjct: 300 IGKPFNGSGRIWANFKKYLIKGLKADQNYNDNGYPRYYPFNLRYKKEKLDKKSKNRVIDP 359
Query: 69 NNNNNNN 75
N N
Sbjct: 360 INGNGTT 366
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 32.1 bits (73), Expect = 0.90
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 4/38 (10%)
Query: 29 KKKEKKAKKKEKKKE----KKKKEKKKKKKKKKKKKKN 62
K EK+ KKK K+ + K+ K++K+KK KK+
Sbjct: 124 TKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161
Score = 31.3 bits (71), Expect = 1.5
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KE KKK K + + +++K+ K+KK KK+
Sbjct: 128 KEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
Score = 30.6 bits (69), Expect = 2.4
Identities = 11/39 (28%), Positives = 19/39 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K+ +KK ++ K+ K++K KK KK+
Sbjct: 125 KFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
Score = 29.8 bits (67), Expect = 4.0
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ K EK+ K+K K+ + K+++++K+KK KK+
Sbjct: 121 DRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161
Score = 29.8 bits (67), Expect = 4.5
Identities = 11/37 (29%), Positives = 22/37 (59%), Gaps = 5/37 (13%)
Query: 32 EKKAKKKEKKKEKKKKEK-----KKKKKKKKKKKKNN 63
+ K EK+ +KK ++ + K++K++K+KKN
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNK 156
>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 103
Score = 30.0 bits (68), Expect = 0.91
Identities = 10/44 (22%), Positives = 22/44 (50%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
KK++K K + K +K++ K K N++ + + + +N
Sbjct: 40 KKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQSN 83
Score = 29.6 bits (67), Expect = 1.4
Identities = 10/24 (41%), Positives = 18/24 (75%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKK 53
KK++KAK +E K +K++ K K++
Sbjct: 40 KKQQKAKAREADKARKQQLKAKQR 63
Score = 27.3 bits (61), Expect = 7.0
Identities = 10/46 (21%), Positives = 23/46 (50%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
E KK++K K ++ K +K++ K N++ + + + +N
Sbjct: 38 EYKKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQSN 83
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 31.3 bits (72), Expect = 0.92
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E +K K+ E+ +K ++ +K KKKKK++K
Sbjct: 175 NEIIEKGLNVKQTEELIKKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|149453 pfam08397, IMD, IRSp53/MIM homology domain. The N-terminal
predicted helical stretch of the insulin receptor
tyrosine kinase substrate p53 (IRSp53) is an
evolutionary conserved F-actin bundling domain involved
in filopodium formation. The domain has been named IMD
after the IRSp53 and missing in metastasis (MIM)
proteins in which it occurs. Filopodium-inducing IMD
activity is regulated by Cdc42 and Rac1 and is
SH3-independent.
Length = 218
Score = 31.3 bits (71), Expect = 0.92
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KK K + + KKK + EKK+ + KK ++K +K K
Sbjct: 95 LDKKYANALDKDYQTEYKKKRDELEKKQSDLKKLQRKSQKGK 136
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 31.2 bits (71), Expect = 0.93
Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 3/41 (7%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K AK+KE+ E++ KE KK +K + KKKK
Sbjct: 159 RKDAKQKEEFAAEER---KEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 29.9 bits (68), Expect = 0.94
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
++ K+ EK+ K EK+ EK +KE ++ K++ K
Sbjct: 68 EERKETLEKEIKTLEKQLEKLEKELEELKEELYKLFGQQ 106
Score = 27.6 bits (62), Expect = 6.1
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+ +K+ +K+ K +K+ +K EK+ +E K++ K +
Sbjct: 68 EERKETLEKEIKTLEKQLEKLEKELEELKEELYKLFGQ 105
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 31.2 bits (71), Expect = 0.96
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 18 LDWKKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
LD K K EK KKE K K+++ K + + +K +KK +K K
Sbjct: 105 LDQKVINKFEKDYKKEYKQKREDLDKARSELKKLQKKSQKSGTGKY 150
>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
permease components [Defense mechanisms].
Length = 567
Score = 32.0 bits (73), Expect = 0.96
Identities = 36/174 (20%), Positives = 54/174 (31%), Gaps = 54/174 (31%)
Query: 125 MTTIFGTVSQLFLGSFLHWRSAAILNLLFPILALCALYFIPESPHWLISQGRMQEASASL 184
T+I + L L L WR A IL L+ P+LAL + ++ S
Sbjct: 134 FTSILLLIGSLVLLFSLSWRLALILLLILPLLALVLSLL----------ARKSRKLSR-- 181
Query: 185 CWLRGWVTPDKVQTELSQITKAIEES---------------ELKRLGKDGQ--RRPNYRM 227
+V+ L ++ + ES ELKR + + RR N R
Sbjct: 182 ----------RVREALGELNARLLESLSGIRVIKAFGAEDRELKRFEEANEELRRANLRA 231
Query: 228 YMRRTFLLP-------------YAIVTSLFFIGQ--FGGMTTLQTYAVGIFESI 266
L P A+ L G G + Y + + I
Sbjct: 232 SRLEALLAPLMLLLSSLGTVLVLALGGFLVLSGSLTVGALAAFILYLLRLLTPI 285
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 32.1 bits (73), Expect = 1.00
Identities = 7/44 (15%), Positives = 25/44 (56%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+++ ++K ++ +KA K K+ + +++ ++K+K+ +
Sbjct: 694 YQENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQTS 737
>gnl|CDD|152754 pfam12319, TryThrA_C, Tryptophan-Threonine-rich plasmodium antigen
C terminal. This protein is found in eukaryotes.
Proteins in this family are typically between 254 to 536
amino acids in length. This family is the C terminal of
a surface antigen of malarial Plasmodium species. It is
currently being targeted for use as part of a subunit
vaccine against Plasmodium falciparum, the main species
involved in causing human malaria.
Length = 217
Score = 31.1 bits (71), Expect = 1.00
Identities = 18/51 (35%), Positives = 21/51 (41%), Gaps = 2/51 (3%)
Query: 20 WKKAKKKEKKKKEKKAKKKEK--KKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
WK K E + K ++ E K EKKK K KKKK K N
Sbjct: 111 WKNKKILEWLMSDWKREEDEYWEKWEKKKWSKPLNLKKKKNWLKWKERINR 161
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 30.9 bits (70), Expect = 1.0
Identities = 18/47 (38%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 21 KKAKK-KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KK K+ EK+KKE K KK ++K+ ++ +E K K+K ++++K N
Sbjct: 55 KKLKEISEKEKKELK-KKLDRKRLERIQEAKTKEKHAQEREKTEINR 100
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 29.6 bits (67), Expect = 1.1
Identities = 9/36 (25%), Positives = 16/36 (44%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K + K +KK++ +KK++ KK K
Sbjct: 63 VKLESLVGDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 30.2 bits (68), Expect = 1.1
Identities = 11/34 (32%), Positives = 27/34 (79%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KE+++ + A+ +++++ +++E++K+KKKKK K
Sbjct: 83 KEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
Score = 28.7 bits (64), Expect = 2.9
Identities = 13/45 (28%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 19 DWKKAKKKEKKKKEKKAKK--KEKKKEKKKKEKKKKKKKKKKKKK 61
D +A E +K+E+++ + + + + + +E++++K+KKKKK K
Sbjct: 72 DNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region. Remorins are
plant-specific plasma membrane-associated proteins. In
tobacco remorin co-purifies with lipid rafts. Most
remorins have a variable, proline-rich C-half and a
more conserved N-half that is predicted to form coiled
coils. Consistent with this, circular dichroism studies
have demonstrated that much of the protein is
alpha-helical. Remorins exist in plasma membrane
preparations as oligomeric structures and form
filaments in vitro. The proteins can bind polyanions
including the extracellular matrix component
oligogalacturonic acid (OGA). In vitro, remorin in
plasma membrane preparations is phosphorylated
(principally on threonine residues) in the presence of
OGA and thus co-purifies with a protein kinases(s). The
biological functions of remorins are unknown but roles
as components of the membrane/cytoskeleton are
possible.
Length = 112
Score = 30.0 bits (68), Expect = 1.1
Identities = 16/35 (45%), Positives = 20/35 (57%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K+K K E + KK E+K EKKK E +K K K
Sbjct: 37 NKKKAKAEAELKKIERKLEKKKAEAAEKLKNKLAA 71
Score = 30.0 bits (68), Expect = 1.2
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 16/58 (27%)
Query: 20 WKKAKKKEKKKKEKKAKKKE--------KKKEK-----KKKEKKKKKKKKKKKKKNNN 64
W++A EK K K +++E KKK K KK E+K +KKK + +K N
Sbjct: 13 WEEA---EKAKINNKYQREEAKIQAWENKKKAKAEAELKKIERKLEKKKAEAAEKLKN 67
Score = 30.0 bits (68), Expect = 1.2
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KA+ + KK + K KKK + EK K + KK ++K+
Sbjct: 42 KAEAELKKIERKLEKKKAEAAEKLKNKLAAAHKKAEEKRA 81
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 31.9 bits (73), Expect = 1.1
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
KK + K E+K KK+EKK++ K++EK+
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 30.0 bits (68), Expect = 3.6
Identities = 12/81 (14%), Positives = 36/81 (44%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
K KE + E++ ++ + + E + K K++++ +++ + + + + + +
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284
Query: 84 NNNEEKGTFLYKMAAPLVLVL 104
E+K L+ P +L
Sbjct: 285 ILKEKKDEELFWFEKPWPTLL 305
Score = 29.6 bits (67), Expect = 5.9
Identities = 10/36 (27%), Positives = 25/36 (69%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK 55
++ +K+K+ K +++ KKE+KK++ K+++K+
Sbjct: 643 FRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 31.5 bits (72), Expect = 1.1
Identities = 16/35 (45%), Positives = 23/35 (65%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
E++ KE +A+K+ + EKK EKK KK KK K +
Sbjct: 75 EEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDE 109
Score = 29.9 bits (68), Expect = 3.9
Identities = 12/44 (27%), Positives = 25/44 (56%), Gaps = 3/44 (6%)
Query: 19 DWKKAK---KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+++ K + EKK EKKAKK KK + ++ ++ + + ++
Sbjct: 80 EYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 28.8 bits (65), Expect = 7.4
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +E+ ++E K + EK+ + +K+ +KK KK KK
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKG 106
>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
and toxic compound extrusion (MATE)-like proteins.
Escherichia coli MurJ (MviN) has been identified as
essential for murein biosynthesis. It has been suggested
that MurJ functions as the peptidoglycan lipid II
flippase which is involved in translocation of
lipid-anchored peptidoglycan precursors across the
cytoplasmic membrane, though results obtained in
Bacillus subtilis seem to indicate that its MurJ
homologs are not essential for growth. Some MviN family
members (e.g. in Mycobacterium tuberculosis) possess an
extended C-terminal region that contains an
intracellular pseudo-kinase domain and an extracellular
domain resembling carbohydrate-binding proteins.
Proteins from the MATE family are involved in exporting
metabolites across the cell membrane and are often
responsible for multidrug resistance (MDR).
Length = 420
Score = 31.7 bits (73), Expect = 1.1
Identities = 26/109 (23%), Positives = 42/109 (38%), Gaps = 23/109 (21%)
Query: 233 FLLPYAIVTSLFFIGQFG----GMTT--LQTYAVGI------------FESIHAPLDP-Y 273
+L IV LF G F MT L YA+G+ F ++ P
Sbjct: 314 IVLAEPIVRLLFERGAFTAEDTLMTASALAAYALGLPAFALVKLLSRAFYALKDTKTPVK 373
Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQ 322
A + + V LL ++LI G LAL ++ + +++ V +
Sbjct: 374 IAVIAVAV----NILLNLLLIKPLGHVGLALATSLSAWVNALLLLVLLR 418
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 32.0 bits (73), Expect = 1.1
Identities = 11/49 (22%), Positives = 21/49 (42%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
++E+ K KE+ K+ + K K+ K+ K N++ N
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNIL 572
Score = 31.2 bits (71), Expect = 1.7
Identities = 10/50 (20%), Positives = 23/50 (46%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
A++++ K+ K+ KK + K ++ K+ K+ N++ N
Sbjct: 523 AREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNIL 572
Score = 28.9 bits (65), Expect = 8.7
Identities = 10/49 (20%), Positives = 19/49 (38%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
++++ K KE+ KK + K K+ K N++ N
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNIL 572
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to
196 amino acids in length.
Length = 149
Score = 30.4 bits (69), Expect = 1.2
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K+ ++K K K K K+K+KK K K+
Sbjct: 13 KRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
Score = 29.2 bits (66), Expect = 3.0
Identities = 12/42 (28%), Positives = 19/42 (45%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
A +++ E K +K+ K +K K K K+KK K
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKR 42
Score = 29.2 bits (66), Expect = 3.3
Identities = 13/41 (31%), Positives = 19/41 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K E K+ KK K +K K + K+KK K K+
Sbjct: 5 RKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
Score = 28.1 bits (63), Expect = 6.5
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ +K + K+++KK K +K K K K+KK K K
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKR 42
Score = 27.7 bits (62), Expect = 9.4
Identities = 9/41 (21%), Positives = 18/41 (43%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K+ + + ++K + +K K K K+KK K+
Sbjct: 2 ASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKR 42
>gnl|CDD|222225 pfam13564, DoxX_2, DoxX-like family. This family of
uncharacterized proteins are related to DoxX pfam07681.
Length = 102
Score = 29.8 bits (68), Expect = 1.2
Identities = 32/109 (29%), Positives = 42/109 (38%), Gaps = 11/109 (10%)
Query: 238 AIVTSLFFIGQFGGMTTLQT-YAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHY 296
+ +LFF+ G M + V F + P YF LL G+ EL GAL +L+
Sbjct: 3 TGLLALFFLF-SGVMKLFRPEEVVAAFAWLGYP--LYFVRLL-GILELLGALG--LLLPL 56
Query: 297 TGKRPLALISTGGSAACFIVVAVYAQFHLSYG-WDSPLVPTVFLVLAAF 344
G L ++ G A I HL G P V L L A
Sbjct: 57 VGAPRLKELAAAGLALLMIGAIA---AHLRVGDPFHVAFPAVLLALLAV 102
>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
members of the HMG-box superfamily of DNA-binding
proteins. These proteins bind the minor groove of DNA
in a non-sequence specific fashion and contain two or
more tandem HMG boxes. Class II members include
non-histone chromosomal proteins, HMG1 and HMG2, which
bind to bent or distorted DNA such as four-way DNA
junctions, synthetic DNA cruciforms, kinked
cisplatin-modified DNA, DNA bulges, cross-overs in
supercoiled DNA, and can cause looping of linear DNA.
Class III members include nucleolar and mitochondrial
transcription factors, UBF and mtTF1, which bind
four-way DNA junctions.
Length = 66
Score = 28.7 bits (65), Expect = 1.2
Identities = 13/28 (46%), Positives = 22/28 (78%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
WK+ ++EKKK E+KA+K +++ EK+ K
Sbjct: 39 WKELSEEEKKKYEEKAEKDKERYEKEMK 66
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 30.7 bits (70), Expect = 1.2
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KKK+ KK+E+ K K+K + K + +KK K
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
Score = 30.7 bits (70), Expect = 1.3
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK KKE++ +K KK+ + K + +KKN+
Sbjct: 124 KKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154
Score = 30.3 bits (69), Expect = 1.9
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KKK KK+E+ KK KKK + K + N+
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154
Score = 30.0 bits (68), Expect = 2.4
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKKK KK ++ +K K+K + K + +KK K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
Score = 29.6 bits (67), Expect = 3.3
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
KKK+ KKE++ K+ KKK + K + N+
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154
Score = 29.2 bits (66), Expect = 4.7
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+KK KK+E+ K+ KKK + K + +KK +
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154
Score = 29.2 bits (66), Expect = 4.9
Identities = 15/37 (40%), Positives = 23/37 (62%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
+D K KK +K+++ KKAKKK + K + +KK K
Sbjct: 119 IDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
Score = 28.8 bits (65), Expect = 5.7
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
KK+K KKE++ KK +KK E + + +KK K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
Score = 28.8 bits (65), Expect = 6.1
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
K+KK K+++ KK KKK + K + +KK K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
Score = 28.0 bits (63), Expect = 9.9
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 29 KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKK+ K +++ KK +KK + K + +KK K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 29.2 bits (66), Expect = 1.2
Identities = 10/36 (27%), Positives = 30/36 (83%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+K +++ E++ K++E++KEK+K+ ++++KK ++++
Sbjct: 53 EKAREETERERKEREERKEKRKRAIEERRKKIEERR 88
Score = 26.9 bits (60), Expect = 7.7
Identities = 8/32 (25%), Positives = 25/32 (78%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
++ +++ K+++E+K K+K +E++KK ++++
Sbjct: 57 EETERERKEREERKEKRKRAIEERRKKIEERR 88
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved
in a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 29.9 bits (68), Expect = 1.3
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
E+KK ++ K+ EKK E+ EK+ K K++ K
Sbjct: 60 ERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSK 93
Score = 28.7 bits (65), Expect = 3.1
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 6/47 (12%)
Query: 21 KKAKKKEKKKK------EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KK KK KK KKA ++ K+ EKK +E +K+ K K++
Sbjct: 44 KKRKKMAKKYHMVRFFERKKATRRLKQLEKKLEETSDEKEIKDLKEQ 90
>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains.
Binding of the proteins to calmodulin depends on the
presence of calcium ions. These proteins are thought to
be involved in various processes, such as plant defence
responses and stolonisation or tuberization.
Length = 115
Score = 29.7 bits (67), Expect = 1.3
Identities = 12/55 (21%), Positives = 20/55 (36%)
Query: 17 PLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
KK K +K+K K +K K+ K +K +K + N +
Sbjct: 7 ERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFNPRGPNFLPLEPDP 61
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.4 bits (71), Expect = 1.3
Identities = 8/63 (12%), Positives = 29/63 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK ++++ E++ ++ +++ + + ++ K KK + + N + + N +
Sbjct: 347 KKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEES 406
Query: 81 NNN 83
+
Sbjct: 407 KGS 409
Score = 30.2 bits (68), Expect = 2.7
Identities = 9/47 (19%), Positives = 24/47 (51%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
W+ + + + KK + EK + K +K++ +K++ +++ N
Sbjct: 320 WRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREAR 366
Score = 30.2 bits (68), Expect = 3.5
Identities = 14/73 (19%), Positives = 35/73 (47%), Gaps = 3/73 (4%)
Query: 22 KAKKKEKKKKEKKAKK---KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
KA+KKE +K + A++ +E ++E+ + + ++ K K + + N + + N
Sbjct: 344 KARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSEN 403
Query: 79 NNNNNNNNEEKGT 91
+ + + + T
Sbjct: 404 EESKGSPPQVEAT 416
>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
Length = 476
Score = 31.5 bits (72), Expect = 1.3
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
K+++K+E KK K K+K K + + NN N
Sbjct: 202 KEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNK 237
Score = 31.2 bits (71), Expect = 1.4
Identities = 15/41 (36%), Positives = 21/41 (51%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
K K KE +K+E KK K K+K K+ + + NN N
Sbjct: 197 KYLKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNK 237
Score = 31.2 bits (71), Expect = 1.6
Identities = 14/41 (34%), Positives = 20/41 (48%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNN 73
K K KE +KE+ K+ K K+K K + + NN N
Sbjct: 197 KYLKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNK 237
Score = 31.2 bits (71), Expect = 1.8
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
L K+ +K+E KK K K+K K + KK+++ K K
Sbjct: 199 LKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPK 239
>gnl|CDD|220569 pfam10101, DUF2339, Predicted membrane protein (DUF2339). This
domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 738
Score = 31.6 bits (72), Expect = 1.3
Identities = 24/129 (18%), Positives = 35/129 (27%), Gaps = 13/129 (10%)
Query: 229 MRRTFLLPYAIVTSLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGAL 288
+ R F L ++ L + + G+ A L A L A
Sbjct: 319 LLRAFGLLLQLLALLSLLALLLLGYLNSAFLTGLL-------LWALAAAALAAAWLLLAE 371
Query: 289 LCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHI 348
G L + + A + LS W + + LVLAA
Sbjct: 372 RLSRRRLRQGLAALFAL---AALAALALALTME---LSDAWLTLALAAQVLVLAALARRY 425
Query: 349 CIRLLPWML 357
L W L
Sbjct: 426 VAPRLRWAL 434
>gnl|CDD|224208 COG1289, COG1289, Predicted membrane protein [Function unknown].
Length = 674
Score = 31.6 bits (72), Expect = 1.3
Identities = 26/151 (17%), Positives = 43/151 (28%), Gaps = 28/151 (18%)
Query: 268 APLDPYFATLLLGVAELGGALLCVVL--IHYTG----KRPLALISTGGSAACFIVVAVYA 321
A LLG+ + A+ V + +G K LI G+ F V +
Sbjct: 20 AACLALALAFLLGLPQPSWAVSTVAIVSAPDSGAVLSKGLKRLI---GTLIGFAVALLLV 76
Query: 322 QFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIF 381
W L+ T++L L + R AS A + Y
Sbjct: 77 ALLAQEPWLFLLLLTLWLGLCTAIGS---LY--------------RTIASYAFVLAGYTA 119
Query: 382 AFAVNKLYYPMLDTFHLWGTLYFYAAISVVG 412
P + L+ + ++G
Sbjct: 120 LIIGPAPAIPEPEL--LFDGAVWRVVEILLG 148
>gnl|CDD|237034 PRK12278, PRK12278, 50S ribosomal protein L21/unknown domain fusion
protein; Provisional.
Length = 221
Score = 31.0 bits (70), Expect = 1.3
Identities = 8/40 (20%), Positives = 9/40 (22%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ K K E K KKE K
Sbjct: 107 DASGVKAATGAGKVEVAAEAAPAKAKKEAAPKAAPAPAAA 146
>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
Mitochondrial-encoded NADH-quinone
oxidoreductase/respiratory complex I, the second domain
of the Nad11/75-kDa subunit of some protists.
NADH-quinone oxidoreductase is the first
energy-transducting complex in the respiratory chain and
functions as a redox pump that uses the redox energy to
translocate H+ ions across the membrane, resulting in a
significant contribution to energy production. The nad11
gene codes for the largest (75-kDa) subunit of the
mitochondrial NADH-quinone oxidoreductase, it
constitutes the electron input part of the enzyme, or
the so-called NADH dehydrogenase fragment. The Nad11
subunit is made of two domains: the first contains three
binding sites for FeS clusters (the fer2 domain), the
second domain (this CD), is of unknown function or, as
postulated, has lost an ancestral formate dehydrogenase
activity that became redundant during the evolution of
the complex I enzyme. Although only vestigial sequence
evidence remains of a molybdopterin binding site, this
protein domain belongs to the molybdopterin_binding
(MopB) superfamily of proteins.
Length = 366
Score = 31.2 bits (71), Expect = 1.4
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
KK K N N+NN NNN N + N NN
Sbjct: 107 KKLLNKLGSLNTNSNNFLENNNYFNLDLENYLFNN 141
Score = 30.8 bits (70), Expect = 1.9
Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 2/46 (4%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNN--NNNNNNNEEKGTFL 93
KK K N+NN NNN N + N NN+ N +K
Sbjct: 107 KKLLNKLGSLNTNSNNFLENNNYFNLDLENYLFNNSLKNLDKSDLC 152
>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1. This protein
family shows weak but suggestive similarity to
translation initiation factor SUI1 and its prokaryotic
homologs.
Length = 173
Score = 30.6 bits (69), Expect = 1.4
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ +E+K +K +K ++ ++ KKKKK +K
Sbjct: 55 TIGEAQEEKGEKDLLGIQKAQEAREGGKKKKKLPQK 90
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.5 bits (72), Expect = 1.4
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 6/46 (13%)
Query: 23 AKKKEKKKKEKKAKKKEKKKE------KKKKEKKKKKKKKKKKKKN 62
K K + KKE+K +E +K+ +K KEKK+K + KK K
Sbjct: 752 NKLKAQLKKERKGAMRELRKDNRFIAREKLKEKKEKDAEYHKKMKR 797
Score = 28.8 bits (65), Expect = 8.4
Identities = 11/61 (18%), Positives = 27/61 (44%), Gaps = 9/61 (14%)
Query: 10 GTFTEPTPLDWKKAKKKEKKKK------EKKAKKKEKKK---EKKKKEKKKKKKKKKKKK 60
P P + K E ++ +++A+ ++ K E K+E ++ KK + ++
Sbjct: 220 TVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERL 279
Query: 61 K 61
+
Sbjct: 280 R 280
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 31.7 bits (72), Expect = 1.4
Identities = 14/65 (21%), Positives = 29/65 (44%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
K +K KK+KK KE+ +K+ K + ++ KN + + N + ++
Sbjct: 82 KLKKFKKDKKDLAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLEDFIKNTHKDS 141
Query: 85 NNEEK 89
+E+
Sbjct: 142 EDEQN 146
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding
of TATA binding protein. TMFs are evolutionarily
conserved golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 28.8 bits (65), Expect = 1.4
Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 21 KKAKKKEKKKKE--KKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+K KKE K KK + K K+ EK+ E KKK +K +K+ +N
Sbjct: 22 EKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELEN 65
>gnl|CDD|180892 PRK07220, PRK07220, DNA topoisomerase I; Validated.
Length = 740
Score = 31.3 bits (71), Expect = 1.4
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+KK+KE+KA++ +K+K K K+ K KK +K +
Sbjct: 669 QKKQKEEKAQQPKKEKPKSIKDIKGIGKKTAEKLEE 704
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 31.5 bits (71), Expect = 1.5
Identities = 17/40 (42%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA K+ K+K AKK KK +K KK KK +K
Sbjct: 1008 KKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKA 1047
Score = 31.1 bits (70), Expect = 1.8
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA K+K K+ AKK +K KK KK +K +K
Sbjct: 1013 KKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052
Score = 31.1 bits (70), Expect = 2.2
Identities = 17/41 (41%), Positives = 21/41 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKA K+ K+ AKK KK+ KK KK +K KK
Sbjct: 998 KKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKK 1038
Score = 30.7 bits (69), Expect = 2.6
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
KKA K+ K+ AKKK KK KK +K KK KK
Sbjct: 1003 KKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKK 1042
Score = 30.0 bits (67), Expect = 4.7
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 2/41 (4%)
Query: 23 AKKKEKKKK--EKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
AKK KKK +K KK +K KK KK +K +K
Sbjct: 1012 AKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052
Score = 30.0 bits (67), Expect = 4.8
Identities = 18/48 (37%), Positives = 20/48 (41%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ KKA K+K AKK KK KK KK KKK K
Sbjct: 976 KTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAK 1023
>gnl|CDD|129141 TIGR00030, S21p, ribosomal protein S21. This model describes
bacterial ribosomal protein S21 and most mitochondrial
and chloroplast equivalents [Protein synthesis,
Ribosomal proteins: synthesis and modification].
Length = 58
Score = 28.4 bits (64), Expect = 1.5
Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 3/45 (6%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEK---KKKEKKKKKKKKKKKKKNN 63
A ++ K+K EK+ +E KK + K E++++K+K K+
Sbjct: 13 SALRRFKRKLEKEGILRELKKRRYYEKPSERRRRKEKAAAKRIRK 57
Score = 28.0 bits (63), Expect = 1.9
Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 4/43 (9%)
Query: 21 KKAKKK-EKKKKEKKAKKK---EKKKEKKKKEKKKKKKKKKKK 59
++ K+K EK+ ++ KK+ EK E++++++K K+ +KK
Sbjct: 16 RRFKRKLEKEGILRELKKRRYYEKPSERRRRKEKAAAKRIRKK 58
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 30.8 bits (70), Expect = 1.5
Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKK---AKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
T + ++ ++ K+ K+ AKK ++ EK K+ K + + KKK ++ +
Sbjct: 125 TSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHE 179
Score = 29.6 bits (67), Expect = 3.9
Identities = 13/43 (30%), Positives = 27/43 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+ ++ EKK K +AK+K K+ +K++E K++ ++ +K K
Sbjct: 235 NELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRG 277
>gnl|CDD|224099 COG1178, ThiP, ABC-type Fe3+ transport system, permease component
[Inorganic ion transport and metabolism].
Length = 540
Score = 31.1 bits (71), Expect = 1.6
Identities = 30/171 (17%), Positives = 56/171 (32%), Gaps = 47/171 (27%)
Query: 234 LLPYAIVTSLFFIGQFGGMTTL----QTYAVGIFESIHAPLDPYFATLLLGVAELGGALL 289
+ A++ + + FG L T I+++I + D A LL + L A L
Sbjct: 197 IAAGALLVFMLCLADFGTPLILGGGYTTLETAIYQAIMSSFDLARAALLALLLLLITAAL 256
Query: 290 CVVLIHYTGKRPLALISTGGS-------------------AACFIVVA-----VYAQFHL 325
++ + + +S GS F+++ V F L
Sbjct: 257 LLLERRLRRAKQKSTLSGKGSRPKQRLGRLRRLLGAALIVLLLFLILLPLLAVVLPSFVL 316
Query: 326 SYGWD-----------SPLVPTVFL-VLAAFLTHIC-------IRLLPWML 357
++G L+ ++ L + AA L + +R L L
Sbjct: 317 NWGRPLSLVLASPQFWQALLNSLLLALAAALLAVVLALLLAYAVRRLRSRL 367
>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG. Members of
this family are the protein YabG, demonstrated for
Bacillus subtilis to be an endopeptidase able to
release N-terminal peptides from a number of
sporulation proteins, including CotT, CotF, and SpoIVA.
It appears to be expressed under control of sigma-K
[Cellular processes, Sporulation and germination].
Length = 283
Score = 30.8 bits (70), Expect = 1.6
Identities = 12/52 (23%), Positives = 27/52 (51%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
D + + E+KK++K+ K++ + + ++ K K+KK+ + NN
Sbjct: 48 DLELVDESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQL 99
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 31.2 bits (71), Expect = 1.6
Identities = 13/46 (28%), Positives = 27/46 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K+ ++KE++ +E K K KE +K ++ E++ + ++ K KK
Sbjct: 331 KELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376
Score = 29.6 bits (67), Expect = 4.7
Identities = 12/42 (28%), Positives = 24/42 (57%), Gaps = 3/42 (7%)
Query: 23 AKKKEKKKKEKKAKKKEKKK---EKKKKEKKKKKKKKKKKKK 61
+KE + E +K E+K E++ +E KK+ ++ ++K K
Sbjct: 242 ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283
Score = 29.3 bits (66), Expect = 6.8
Identities = 14/41 (34%), Positives = 26/41 (63%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K E+K +E + + +E KKE ++ E+K K+ K+ K+K
Sbjct: 252 GSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA 292
Score = 29.3 bits (66), Expect = 6.9
Identities = 12/36 (33%), Positives = 24/36 (66%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
+ K+ EK+ KE + K+++ +KE ++ EK KK+ +
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESE 494
Score = 28.9 bits (65), Expect = 9.1
Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK----KKKKKKKKN 62
++ KK +K KE+ ++++ KKE +K EK ++ ++K KK K
Sbjct: 690 EEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKA 735
Score = 28.9 bits (65), Expect = 9.9
Identities = 9/42 (21%), Positives = 25/42 (59%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ ++KE KK E++ K ++ + +K ++ +K+ ++ +K
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655
>gnl|CDD|217091 pfam02537, CRCB, CrcB-like protein. CRCB is a putative integral
membrane protein possibly involved in chromosome
condensation. Over expression in E. coli also leads to
camphor resistance.
Length = 117
Score = 29.5 bits (67), Expect = 1.7
Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 2/54 (3%)
Query: 243 LFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHY 296
L G GG TT T+A+ + + Y LL + L LL V L
Sbjct: 66 LLGTGFCGGFTTFSTFALETVQLLRQ--GKYLRALLYLLLSLLLGLLAVALGFA 117
>gnl|CDD|177120 MTH00045, ND6, NADH dehydrogenase subunit 6; Validated.
Length = 162
Score = 30.3 bits (69), Expect = 1.7
Identities = 7/39 (17%), Positives = 18/39 (46%)
Query: 69 NNNNNNNNNNNNNNNNNNEEKGTFLYKMAAPLVLVLTYV 107
+N +NNN + + ++ + LY +L+ ++
Sbjct: 101 DNFLELSNNNWSLVSLSDLVGSSELYNSGGLYLLLGGFI 139
>gnl|CDD|226646 COG4178, COG4178, ABC-type uncharacterized transport system,
permease and ATPase components [General function
prediction only].
Length = 604
Score = 31.1 bits (71), Expect = 1.7
Identities = 22/80 (27%), Positives = 32/80 (40%), Gaps = 21/80 (26%)
Query: 208 EESELKRLGKD-GQRRPNYRMYMRR--------------TFLLPYAIVTSLFFIGQ--FG 250
E+ E +RL N+R +R + +LP I +F GQ FG
Sbjct: 264 EKVERRRLDDRFDAVLGNWRRLVRAQIRLTWFQLGYGWLSVVLPILIAAPRYFSGQITFG 323
Query: 251 GMTTLQTYAVGIFESIHAPL 270
G+ AVG F +H+ L
Sbjct: 324 GLMQ----AVGAFGQVHSSL 339
>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP). NAP proteins
are involved in moving histones into the nucleus,
nucleosome assembly and chromatin fluidity. They affect
the transcription of many genes.
Length = 236
Score = 30.4 bits (69), Expect = 1.7
Identities = 11/39 (28%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 13 TEPTPLDWKKAKK--KEKKKKEKKAKKKEKKKEKKKKEK 49
+E TP++WK+ K + KK+++ KK + + K
Sbjct: 144 SEGTPIEWKEGKNLTVKTVKKKQRNKKTGQTRTITKTVP 182
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 30.6 bits (69), Expect = 1.8
Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 3/48 (6%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK---KKKKNNN 64
WK+ +EK+ K+A ++ +++K+E KK K +N
Sbjct: 109 WKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEP 156
Score = 29.4 bits (66), Expect = 3.4
Identities = 20/62 (32%), Positives = 24/62 (38%), Gaps = 8/62 (12%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKK----KEKKKKKKKKKKKKKNNN 64
IG F E P K + K + KA +E K K E KKKK K KK
Sbjct: 148 IGPFIENEP----KIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEE 203
Query: 65 NN 66
+
Sbjct: 204 YD 205
Score = 28.7 bits (64), Expect = 6.0
Identities = 10/46 (21%), Positives = 19/46 (41%)
Query: 3 SSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKE 48
S + T+ W + + +KKK K KK +++ + E
Sbjct: 165 PSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210
>gnl|CDD|147641 pfam05582, Peptidase_U57, YabG peptidase U57. YabG is a protease
involved in the proteolysis and maturation of SpoIVA and
YrbA proteins, conserved with the cortex and/or coat
assembly by Bacillus subtilis.
Length = 287
Score = 30.6 bits (70), Expect = 1.8
Identities = 10/52 (19%), Positives = 25/52 (48%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
D +K E + +K+ ++K + K ++ +K ++K++ + N N
Sbjct: 49 DLEKVDASEVTRYKKRFEEKINESLKLIRQDRKLMREKQEYRATRGYNENEL 100
Score = 29.5 bits (67), Expect = 4.2
Identities = 10/49 (20%), Positives = 20/49 (40%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
K + K + +EK E K ++ +K ++K++ N N
Sbjct: 52 KVDASEVTRYKKRFEEKINESLKLIRQDRKLMREKQEYRATRGYNENEL 100
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 30.8 bits (70), Expect = 1.8
Identities = 10/41 (24%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ A+ E ++ E+ +++ ++KE++KK+ K++K+++K+ +
Sbjct: 168 RNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAE 208
Score = 28.9 bits (65), Expect = 7.1
Identities = 10/30 (33%), Positives = 25/30 (83%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKK 54
+ E++++E+K ++K++ KE+K++EK+ +K
Sbjct: 180 EAERRRREEKERRKKQDKERKQREKETAEK 209
>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein. This protein family was
first noted as a paralogous set in Porphyromonas
gingivalis, but it is more widely distributed among the
Bacteroidetes. The protein family is now renamed GLPGLI
after its best-conserved motif.
Length = 227
Score = 30.4 bits (69), Expect = 1.8
Identities = 10/35 (28%), Positives = 14/35 (40%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K + + KK KK +K KK K+K
Sbjct: 193 IKIELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227
Score = 29.6 bits (67), Expect = 3.8
Identities = 14/37 (37%), Positives = 17/37 (45%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KA K E K KK +K +EK KK K+K
Sbjct: 191 KAIKIELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 30.2 bits (68), Expect = 1.8
Identities = 12/39 (30%), Positives = 25/39 (64%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KKA +KE++K+ K K++E+K + ++ +K+K+
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAG 71
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 30.6 bits (69), Expect = 1.8
Identities = 11/40 (27%), Positives = 29/40 (72%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK + + +++ K+KEK++ + ++ K+ ++KK+++K+ N
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRN 192
Score = 30.2 bits (68), Expect = 2.3
Identities = 10/49 (20%), Positives = 28/49 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
+ K+KEK++ E K KE +++K++++++ + K+ ++ +
Sbjct: 162 RSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSG 210
Score = 29.8 bits (67), Expect = 2.8
Identities = 9/50 (18%), Positives = 31/50 (62%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
K++ K+++K++ + ++K ++ + KK++EK++ + K+ ++ +
Sbjct: 161 KRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSG 210
>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
PB1. Two GTP binding sites exist in this protein.
Length = 740
Score = 31.0 bits (70), Expect = 1.9
Identities = 11/41 (26%), Positives = 21/41 (51%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
LD + K + K +++ KE K K+K+ K+K + +
Sbjct: 175 LDKPEVKFTKHKNNKRRYDDKETGKLLKRKQPSKEKDRITR 215
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 30.2 bits (68), Expect = 1.9
Identities = 10/38 (26%), Positives = 24/38 (63%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+ + EK EK ++KEKKKE ++ +++ ++ ++ +
Sbjct: 167 EELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204
>gnl|CDD|145700 pfam02681, DUF212, Divergent PAP2 family. This family is related
to the pfam01569 family (personal obs: C Yeats).
Length = 141
Score = 29.8 bits (68), Expect = 1.9
Identities = 18/61 (29%), Positives = 23/61 (37%), Gaps = 15/61 (24%)
Query: 292 VLIHY--TGK-RPLALISTGG-----SAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAA 343
V+I + L TGG SA V A+ L G+DS L F + A
Sbjct: 22 VIIELFKERRWDFRVLFETGGMPSSHSAL---VTALATGVGLQEGFDSSL----FAIAAV 74
Query: 344 F 344
F
Sbjct: 75 F 75
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 30.7 bits (70), Expect = 1.9
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKK 50
K++E+K+KEKKAKK+ K++ + + K
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNK 417
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 28.4 bits (64), Expect = 2.0
Identities = 16/25 (64%), Positives = 17/25 (68%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKK 51
KKKK KK KK + EKKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.0 bits (63), Expect = 2.9
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKK 60
KK+K K+ KK + +KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 27.2 bits (61), Expect = 5.9
Identities = 14/25 (56%), Positives = 15/25 (60%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKK 56
KK K K+ KK EKKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 27.2 bits (61), Expect = 6.5
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKKKK 59
AKKK+ KK KK + +KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This
family includes outer membrane proteins such as OmpH
among others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 140
Score = 29.5 bits (67), Expect = 2.0
Identities = 13/47 (27%), Positives = 27/47 (57%), Gaps = 5/47 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKKN 62
K ++ E +K EK+ +K ++K +K ++KK+K+ +KK +
Sbjct: 27 FKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQE 73
Score = 29.5 bits (67), Expect = 2.4
Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKE--KKKKKKKKKKKKKNNNNNNNNNNNNN 74
K+ +K ++K ++ A E +EKK+KE KK ++ ++K++K +
Sbjct: 39 KELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQ 94
Score = 29.1 bits (66), Expect = 3.0
Identities = 14/40 (35%), Positives = 25/40 (62%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K +K+ EK+ KK++ + EK +KE +K K+K +K +
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLS 56
Score = 27.5 bits (62), Expect = 9.5
Identities = 11/38 (28%), Positives = 24/38 (63%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
K A+K+ +K+ +K+ + EK +++ +K K+K +K
Sbjct: 15 AGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDA 52
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 30.9 bits (70), Expect = 2.0
Identities = 11/66 (16%), Positives = 23/66 (34%), Gaps = 3/66 (4%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN---NNNNNNNNNN 79
K + K+ K +K K K+ K N +N +++ N +
Sbjct: 157 IKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKS 216
Query: 80 NNNNNN 85
++ +N
Sbjct: 217 SSKDNQ 222
Score = 29.8 bits (67), Expect = 4.2
Identities = 9/65 (13%), Positives = 24/65 (36%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
K + + K K + +K K K+ N +N+ +++ N ++
Sbjct: 158 KNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSS 217
Query: 85 NNEEK 89
+ + +
Sbjct: 218 SKDNQ 222
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 30.1 bits (68), Expect = 2.0
Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 5/61 (8%)
Query: 6 KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK-----KKKKKK 60
K Q+ + +++++++E K + KEKK EKK KE ++ + KK+ +
Sbjct: 79 KLQVEKRALEVWGSEEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGR 138
Query: 61 K 61
Sbjct: 139 V 139
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 30.9 bits (69), Expect = 2.0
Identities = 22/67 (32%), Positives = 30/67 (44%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K K KK +K AKK E++ + K+E + K K NN N+ N N
Sbjct: 757 KHVAKSHLKKPKKAAKKLEQRSKANKEELANENNKLMNILKEYFGNNEQINSITYNFENI 816
Query: 81 NNNNNNE 87
N N +NE
Sbjct: 817 NLNEDNE 823
Score = 30.5 bits (68), Expect = 3.2
Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 6/88 (6%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKK-----KKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
KK KKA KK +++ K KE+ + K K+ NN N+ N N N N +N N
Sbjct: 765 KKPKKAAKKLEQRSKANKEELANENNKLMNILKEYFGNNEQINSITYNFENINLNEDNEN 824
Query: 85 NNEEKGTFLYKMAAPLV-LVLTYVAEIT 111
+++ +K + +++Y+ +I+
Sbjct: 825 GSKKILDLNHKDQKEIFEEIISYIVDIS 852
Score = 29.4 bits (65), Expect = 6.9
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
E K E +A K + + KK+ EK KKKK K++
Sbjct: 310 ELKSLEHRAAKAAEAEMKKRAEKPKKKKSKRR 341
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 30.9 bits (70), Expect = 2.0
Identities = 9/42 (21%), Positives = 22/42 (52%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
++ E+K ++++KKKE+ K K++K++ +
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKR 378
Score = 29.7 bits (67), Expect = 4.2
Identities = 13/49 (26%), Positives = 28/49 (57%)
Query: 11 TFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
F+ P + +K + ++K KKKE+ K K++K++ + +K+ K+
Sbjct: 333 RFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
Score = 29.7 bits (67), Expect = 4.5
Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 24 KKKEKKKKEKKAKKKEKKKEK-KKKEKKKKKKKKKKKKK 61
E+K ++KEKKKE+ K KE+K++ + +K+ K
Sbjct: 342 NDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFK 380
>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2. This family
contains the bacterial histone H1-like nucleoprotein
HC2 (approximately 200 residues long), which seems to
be found mostly in Chlamydia. HC2 functions in DNA
condensation, although it has been suggested that it
also has other roles.
Length = 187
Score = 30.1 bits (67), Expect = 2.0
Identities = 18/41 (43%), Positives = 19/41 (46%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A +K KK AKK KK KK KK KK KK
Sbjct: 42 KPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKK 82
Score = 29.0 bits (64), Expect = 4.4
Identities = 17/41 (41%), Positives = 18/41 (43%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K KK K+ AKK KK KK KK KK KK
Sbjct: 47 KTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKK 87
Score = 29.0 bits (64), Expect = 4.9
Identities = 16/41 (39%), Positives = 17/41 (41%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K A KK +K K KK KK KK KK KK
Sbjct: 37 KVAAKKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKK 77
>gnl|CDD|225168 COG2259, COG2259, Predicted membrane protein [Function unknown].
Length = 142
Score = 29.6 bits (67), Expect = 2.1
Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 10/88 (11%)
Query: 242 SLFFIGQFGGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGGALLCVVLIHYTGKRP 301
+F + G + + FES+ P P +L + EL G LL +L+ R
Sbjct: 25 LIFIVSGLGKLFGGLAGTIQYFESLGLPP-PTLLAILAAILELVGGLL--LLLGL-FTRL 80
Query: 302 LALISTGGSAACFIVVAVYAQFHLSYGW 329
AL+ A F++VA++A H +G+
Sbjct: 81 AALV-----LAVFMLVAIFA-VHAFWGF 102
>gnl|CDD|240414 PTZ00425, PTZ00425, asparagine-tRNA ligase; Provisional.
Length = 586
Score = 30.8 bits (69), Expect = 2.2
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
KK KK +K+ + N N NNNN N++++N ++ +L
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSNAVSSPAYPDQYL 314
Score = 30.4 bits (68), Expect = 3.2
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
KK KK +K+++ N N NNNN N++++N
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSN 302
Score = 30.0 bits (67), Expect = 3.6
Identities = 15/46 (32%), Positives = 26/46 (56%)
Query: 50 KKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYK 95
KK KK +K++ N N NNNN N++++N ++ ++ YK
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSNAVSSPAYPDQYLIDYK 318
Score = 29.6 bits (66), Expect = 5.3
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 49 KKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
KK KK +K++ N N NNNN N++++N
Sbjct: 273 NKKNKKGEKREDILNTCNANNNNGNSSSSN 302
>gnl|CDD|205184 pfam13003, MRL1, Ribosomal protein L1. This family includes
putative ribosomal L1 and L10 proteins and fragments.
Length = 105
Score = 29.2 bits (64), Expect = 2.2
Identities = 20/88 (22%), Positives = 32/88 (36%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
KKK K + K K K +KKK + +K K + N NN + N
Sbjct: 12 YKKKVKTNDKTKKGTKTVKNDKKKVDNIEKIKSYSYMENEKRLYPRNIYEIENNVYLDLN 71
Query: 83 NNNNEEKGTFLYKMAAPLVLVLTYVAEI 110
+ K + A+ ++ + EI
Sbjct: 72 LDMALGKKKKVEPFASKILGPKPLLPEI 99
>gnl|CDD|227497 COG5169, HSF1, Heat shock transcription factor [Transcription].
Length = 282
Score = 30.5 bits (69), Expect = 2.2
Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 7/76 (9%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
+ K K ++ E + + + KK K+KK +N ++NN + + N
Sbjct: 72 GFHKVSHKSGQRSYYNENVWEFGNKNFQLGMIELLKKIKRKKAPSNRVDSNNESKDAMMN 131
Query: 79 ---NN----NNNNNNE 87
N + N
Sbjct: 132 IEVENIILPQSELYNS 147
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 30.8 bits (69), Expect = 2.3
Identities = 9/44 (20%), Positives = 22/44 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K+ KKK ++ + + K+K E++ ++ +KK+ +
Sbjct: 645 KRIKKKAYRRIRRDKRLKKKMPEEENTQENHLGSEKKRHGGVPD 688
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 30.6 bits (69), Expect = 2.3
Identities = 11/42 (26%), Positives = 31/42 (73%), Gaps = 1/42 (2%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
+D++ + +EK ++E++ ++K++ K K ++E+K K++ K++
Sbjct: 1 MDFELRRAREKLEREQR-ERKQRAKLKLERERKAKEEAAKQR 41
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is
functionally equivalent to L7/L12 in bacteria and the
P1 and P2 proteins in eukaryotes. L12p is homologous to
P1 and P2 but is not homologous to bacterial L7/L12. It
is located in the L12 stalk, with proteins L10, L11,
and 23S rRNA. L12p is the only protein in the ribosome
to occur as multimers, always appearing as sets of
dimers. Recent data indicate that most archaeal species
contain six copies of L12p (three homodimers), while
eukaryotes have four copies (two heterodimers), and
bacteria may have four or six copies (two or three
homodimers), depending on the species. The organization
of proteins within the stalk has been characterized
primarily in bacteria, where L7/L12 forms either two or
three homodimers and each homodimer binds to the
extended C-terminal helix of L10. L7/L12 is attached to
the ribosome through L10 and is the only ribosomal
protein that does not directly interact with rRNA.
Archaeal L12p is believed to function in a similar
fashion. However, hybrid ribosomes containing the large
subunit from E. coli with an archaeal stalk are able to
bind archaeal and eukaryotic elongation factors but not
bacterial elongation factors. In several mesophilic and
thermophilic archaeal species, the binding of 23S rRNA
to protein L11 and to the L10/L12p pentameric complex
was found to be temperature-dependent and cooperative.
Length = 106
Score = 29.0 bits (65), Expect = 2.3
Identities = 10/26 (38%), Positives = 22/26 (84%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKE 48
A +EK +++++ KKKE++KE++++E
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 28.6 bits (64), Expect = 2.6
Identities = 9/21 (42%), Positives = 17/21 (80%)
Query: 32 EKKAKKKEKKKEKKKKEKKKK 52
+ K +EK++EKKK+E+K++
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEE 93
Score = 27.8 bits (62), Expect = 6.3
Identities = 10/23 (43%), Positives = 19/23 (82%)
Query: 35 AKKKEKKKEKKKKEKKKKKKKKK 57
A E+K E+K++EKKK+++K++
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEE 93
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 28.1 bits (63), Expect = 2.4
Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 21 KKAKKKEKKKKEKKAK---KKEKKKEKKKKEKKKKKKKKKKKKKN 62
K+ +KE +E K + +K +K K+KK +K+K K+ +K+
Sbjct: 20 KRKVEKEGILREMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQ 64
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 30.6 bits (70), Expect = 2.4
Identities = 5/78 (6%), Positives = 38/78 (48%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
T A++ E+++++++ ++ +++ + + E+ ++ +++++ + ++ + +
Sbjct: 151 QPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRRE 210
Query: 73 NNNNNNNNNNNNNNEEKG 90
+ + + +G
Sbjct: 211 QGDRREERGRRDGGDRRG 228
Score = 29.5 bits (67), Expect = 5.8
Identities = 1/30 (3%), Positives = 8/30 (26%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
+ + + + + + +G
Sbjct: 211 QGDRREERGRRDGGDRRGRRRRRDRRDARG 240
Score = 29.5 bits (67), Expect = 6.0
Identities = 0/35 (0%), Positives = 9/35 (25%)
Query: 56 KKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
+ + + + + + + + +
Sbjct: 196 RGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRR 230
Score = 29.5 bits (67), Expect = 6.1
Identities = 3/70 (4%), Positives = 33/70 (47%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNN 80
++ +++E+ ++ ++ +++++ E ++ E+ +++++ + + + +
Sbjct: 162 ERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRR 221
Query: 81 NNNNNNEEKG 90
+ + +
Sbjct: 222 DGGDRRGRRR 231
Score = 29.1 bits (66), Expect = 6.5
Identities = 0/28 (0%), Positives = 6/28 (21%)
Query: 61 KNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
+ + + + + +
Sbjct: 214 RREERGRRDGGDRRGRRRRRDRRDARGD 241
Score = 29.1 bits (66), Expect = 8.3
Identities = 1/46 (2%), Positives = 18/46 (39%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNE 87
+E++ ++ + ++ ++++ + + + + +
Sbjct: 193 REERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDA 238
Score = 28.7 bits (65), Expect = 9.9
Identities = 2/44 (4%), Positives = 14/44 (31%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
E+ ++E++ + + ++ + + +
Sbjct: 189 ERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRR 232
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 30.2 bits (68), Expect = 2.5
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K+ K EK+ + +K++ ++ K + KKKKK KK+K
Sbjct: 38 KTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEK 77
Score = 30.2 bits (68), Expect = 2.9
Identities = 16/42 (38%), Positives = 27/42 (64%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
D K K+ K ++++ + K+++ E+ K E KKKKK KK+K
Sbjct: 36 DGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEK 77
>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
Length = 175
Score = 29.7 bits (67), Expect = 2.5
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
K K KEKK +++ ++ K K + K K KK
Sbjct: 144 KNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175
Score = 29.7 bits (67), Expect = 2.7
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
K EK K++K + E ++ K K + K+K KK
Sbjct: 144 KNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175
Score = 28.5 bits (64), Expect = 6.4
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K K KEKK + + + ++ K K + K+K
Sbjct: 142 KTKNEKVKEKKIQDQLEARRAKAKALRNKRK 172
Score = 28.1 bits (63), Expect = 8.4
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K K +K K+ K +++ + ++ K K + K+K KK
Sbjct: 142 KTKNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175
Score = 28.1 bits (63), Expect = 9.1
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K EK +KK + + + ++ K K + K+K KK
Sbjct: 142 KTKNEKVKEKKIQDQLEARRAKAKALRNKRKAKK 175
>gnl|CDD|216256 pfam01032, FecCD, FecCD transport family. This is a sub-family of
bacterial binding protein-dependent transport systems
family. This Pfam entry contains the inner components of
this multicomponent transport system.
Length = 311
Score = 30.2 bits (69), Expect = 2.5
Identities = 33/155 (21%), Positives = 51/155 (32%), Gaps = 29/155 (18%)
Query: 278 LLGVAELGGALLCVVLIHYTGKRPLALISTGG--SAACFIVVAVYAQFHLSYGWDSPLVP 335
L V+ GALL V + PLA S G S A V F + PL
Sbjct: 55 ALAVS---GALLQTVT-----RNPLADPSILGVSSGAALGAVLALLLFGGLSLFGLPLAA 106
Query: 336 TVFLVLAAFLTHICIRLL----PWMLI------GEVFPNNIRATASGASGSSSYIFAFAV 385
+ +LAA L R P LI +F ++ S A+
Sbjct: 107 FIGALLAALLVLFLARRRGGLSPLRLILAGIAVSALF-------SALTSLLLLLANPEAL 159
Query: 386 NKLYYPMLDTFHL--WGTLYFYAAISVVGTLYMYF 418
+ + +L + W + + ++G L +
Sbjct: 160 QSILFWLLGSLSGASWSDVLILLPLLLIGLLLLLL 194
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 30.4 bits (69), Expect = 2.5
Identities = 12/57 (21%), Positives = 23/57 (40%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNN 75
DW + K K + K+ +K+++ KK+ K+K+ K N +
Sbjct: 341 DWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQA 397
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid
layers. The innermost capsid (core) is made of VP2. The
genomic RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 30.7 bits (69), Expect = 2.5
Identities = 14/76 (18%), Positives = 30/76 (39%), Gaps = 3/76 (3%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN--- 77
+ + + K EK+ +K + E K+K KK++ +N ++ ++ N
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 78 NNNNNNNNNEEKGTFL 93
+ + EE L
Sbjct: 65 ADEVKKSTKEESKQLL 80
Score = 29.1 bits (65), Expect = 7.5
Identities = 12/66 (18%), Positives = 29/66 (43%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
+ + + + +EK EK+ ++ + + K+K KK +N ++ ++ N
Sbjct: 4 RNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLK 63
Query: 84 NNNEEK 89
+E K
Sbjct: 64 IADEVK 69
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is
known to localise at the desmosomes and is implicated
in anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines.
A common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 29.4 bits (66), Expect = 2.6
Identities = 9/43 (20%), Positives = 31/43 (72%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ + KEK++ E + K +E++K+++++ +K+K++ +++++
Sbjct: 26 EESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRR 68
Score = 28.6 bits (64), Expect = 4.8
Identities = 9/39 (23%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 22 KAKKKEKKKKEKKAKKK--EKKKEKKKKEKKKKKKKKKK 58
K ++E + K+ ++ E+K E+++K+++++ +K+K+
Sbjct: 22 KFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKR 60
>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
Length = 620
Score = 30.7 bits (70), Expect = 2.7
Identities = 14/37 (37%), Positives = 17/37 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K KK K+K KK KK+KKK+KK
Sbjct: 584 PKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
Score = 29.9 bits (68), Expect = 4.2
Identities = 12/39 (30%), Positives = 16/39 (41%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K+ K K +K KK+KKK+KK
Sbjct: 582 NGPKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.9 bits (65), Expect = 2.7
Identities = 13/50 (26%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 21 KKAKKKEKKKKEKKAKKKEK-------KKEKKKKEKKKKKKKKKKKKKNN 63
K +KK+ K E+K +K K E++K+E++ +++K +K++
Sbjct: 30 AKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREK 79
Score = 28.1 bits (63), Expect = 5.1
Identities = 6/38 (15%), Positives = 23/38 (60%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K +++KK+ K ++++ ++ K + +++K+++
Sbjct: 28 KAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKRER 65
>gnl|CDD|200352 TIGR04101, CCGSCS, CCGSCS motif protein. This protein family,
with average protein length about 58 residues, occurs
in several marine bacteria, such as Shewanella benthica
KT99, Marinobacter sp. ELB17, and Photobacterium
profundum 3TCK. The striking feature is a C-terminal
motif CCGSCS, which (perhaps coincidentally) resembles
conserved core motif [LC]CGSC shared by two
methanobactin precursors (see TIGR04071). There is no
detectable conserved gene region for these proteins.
Length = 59
Score = 27.6 bits (61), Expect = 2.7
Identities = 9/36 (25%), Positives = 22/36 (61%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KK+ K+ K + + ++ ++ +E+ + ++KKK K
Sbjct: 12 KKSAAKQVKLELEAIQQAQQSEEQVAADIEEKKKPK 47
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 30.0 bits (67), Expect = 2.7
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 13 TEPTPLDWKKAKKKEK--KKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
TE K+AK+K+K K+ + + K E K K K K K K+K
Sbjct: 173 TEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEK 223
Score = 28.5 bits (63), Expect = 9.8
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+A+K+ +KK +KA+ E +K + EK K K+ K+K
Sbjct: 148 EARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQK 186
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 30.6 bits (69), Expect = 2.7
Identities = 13/53 (24%), Positives = 27/53 (50%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNN 70
+ + K + +KK + K+++ KK+E +K+ KK +K N+ +N
Sbjct: 406 VHFDTLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDN 458
Score = 29.0 bits (65), Expect = 7.1
Identities = 15/68 (22%), Positives = 25/68 (36%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNN 81
K E KK E K ++K + KK ++ KN+ + + +
Sbjct: 364 NPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKI 423
Query: 82 NNNNNEEK 89
N NN E+
Sbjct: 424 NQNNKEQD 431
>gnl|CDD|224194 COG1275, TehA, Tellurite resistance protein and related permeases
[Inorganic ion transport and metabolism].
Length = 329
Score = 30.4 bits (69), Expect = 2.8
Identities = 25/164 (15%), Positives = 40/164 (24%), Gaps = 34/164 (20%)
Query: 274 FATLLLGVAELGGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSY-----G 328
+ G L +VL L + FI +A +
Sbjct: 169 YVFFGAGFF-SYLVLFPIVLYRLFTAGLLPPAARPTI---FIQLAPIGLVGVGLLLIVNS 224
Query: 329 WDSPLVPTVFLVLAAFLTHICIRLLPWMLIGE---------VFPNNIRATASGASGSSSY 379
S + + LL +L+ FP I AT++ G S
Sbjct: 225 GPSLTFVLILWGFGLLFLFFALLLLLRVLLRLPFSPSWWAFTFPLVILATSALELGKSI- 283
Query: 380 IFAFAVNKLYYPMLDTFHLWGTLYFYAAISVVGTL---YMYFVM 420
+ FH G + I + L Y +
Sbjct: 284 ------------GIGVFHYLGLILGTFLIFIWIVLLVAYTIRAI 315
>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
box-containing transcription factors [Transcription].
Length = 412
Score = 30.4 bits (68), Expect = 2.8
Identities = 8/27 (29%), Positives = 12/27 (44%)
Query: 59 KKKNNNNNNNNNNNNNNNNNNNNNNNN 85
KN N + N + NN + N + N
Sbjct: 308 PPKNLPNGSPNQSPLNNLSRGNPASPN 334
Score = 30.0 bits (67), Expect = 3.6
Identities = 7/27 (25%), Positives = 11/27 (40%)
Query: 60 KKNNNNNNNNNNNNNNNNNNNNNNNNN 86
N N + N + NN + N + N
Sbjct: 308 PPKNLPNGSPNQSPLNNLSRGNPASPN 334
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 29.1 bits (66), Expect = 2.8
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
KK + K++ K EKKA++ + E E KK+
Sbjct: 79 KKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111
Score = 28.7 bits (65), Expect = 3.3
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+KK + K+ K EKK E+ K + + KK+
Sbjct: 78 QKKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 30.2 bits (68), Expect = 2.8
Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 2/41 (4%)
Query: 10 GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
G F KKA+ K+K ++K + KK KK + +
Sbjct: 343 GAFDVDA--YPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381
Score = 29.8 bits (67), Expect = 3.7
Identities = 11/34 (32%), Positives = 15/34 (44%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KK E K+K +K K KK KK + +
Sbjct: 348 DAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381
Score = 28.7 bits (64), Expect = 9.3
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK E +++K K K KK KKN
Sbjct: 352 KKAEAARKRKGDRKGVSHKAKKGGKKNQ 379
Score = 28.7 bits (64), Expect = 9.4
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
KK + K+K ++K K KK K N
Sbjct: 347 VDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQ 379
>gnl|CDD|223912 COG0842, COG0842, ABC-type multidrug transport system, permease
component [Defense mechanisms].
Length = 286
Score = 30.3 bits (68), Expect = 2.8
Identities = 36/179 (20%), Positives = 61/179 (34%), Gaps = 28/179 (15%)
Query: 232 TFLLPYAIVTSLFFIGQFGGMTTLQTYAV-GIFESIH-APLDPYFATLL-----LGVAEL 284
F++P I+ S+ F G F + L G E + +P+ F L L VA L
Sbjct: 88 AFIVPGVILMSVLFSGIFSFSSALFREREFGTLERLLVSPVSRLFILLGKIVPYLVVASL 147
Query: 285 GGALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYA------QFHLSYGWDSPLVPTVF 338
L+ +V+ G L + +++A A F S + V +
Sbjct: 148 IAGLVLLVIAFLLGVPFLGSLLLLLLLLLLLLLATVALGLLLSTFAKSQLQCASAVGNLL 207
Query: 339 LVLAAFLTHIC-----IRLLPWML--IGEVFPNN-----IRATASGASGSSSYIFAFAV 385
++ FL + LLP L I + P +R G + + +
Sbjct: 208 ILPLGFL---SGVFFPLELLPAWLQGISYINPLTYAIDALRYVYLGGWRNDGIWISLLI 263
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 29.7 bits (66), Expect = 2.9
Identities = 12/47 (25%), Positives = 34/47 (72%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
E + + +++E++ +EK+ K K K +E++K+E++++++ +K+K+
Sbjct: 67 ERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKE 113
>gnl|CDD|214945 smart00970, s48_45, Sexual stage antigen s48/45 domain. This
family contains sexual stage s48/45 antigens from
Plasmodium (approximately 450 residues long). These are
surface proteins expressed by Plasmodium male and
female gametes that have been shown to play a conserved
and important role in fertilisation.
Length = 116
Score = 29.0 bits (65), Expect = 2.9
Identities = 6/36 (16%), Positives = 14/36 (38%)
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
+ NN + + +N++N N + + F
Sbjct: 9 NEPSTNNLSYSFLSNDDNVENIECTIKSKPNDIIGF 44
>gnl|CDD|215579 PLN03106, TCP2, Protein TCP2; Provisional.
Length = 447
Score = 30.4 bits (68), Expect = 2.9
Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 5/60 (8%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN-----NNNNNNNNNNNNN 84
+ E + K +E+ +E+ KEK+K+ NNN + N++ + NNN
Sbjct: 203 RSELRDKARERARERTAKEKEKEDHNHAAHHNNNNPISQNSSFTELLTGGIDPVNNNRQW 262
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 30.5 bits (69), Expect = 2.9
Identities = 15/128 (11%), Positives = 33/128 (25%), Gaps = 3/128 (2%)
Query: 418 FVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAARVEGQEEVKGEDNFGMEGEDG 477
FV E E + L+ IE+ K + V + + + D +
Sbjct: 376 FVTEEEEVKKLKRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKKLK 435
Query: 478 KYISRVVGTLYMYFVMPETEGRTLRDIEEHFADKGKTFVTNIRRAEKKRHAAKVEGQEEV 537
++ L + F + + ++I + A +E +
Sbjct: 436 SSKKALLRGLGVRFTLSKLLANLGKEI---PGAGDAVTIDPELERRSPNSADDIEYILKG 492
Query: 538 KGEDNFGM 545
Sbjct: 493 LSYRAEER 500
>gnl|CDD|192333 pfam09690, PYST-C1, Plasmodium yoelii subtelomeric region
(PYST-C1). This group of sequences are defined by the
N-terminal domain of a paralogous family of Plasmodium
yoelii genes preferentially located in the subtelomeric
regions of the chromosomes. There are no obvious
homologues to these genes in any other organism. The
C-terminal portions of the genes that contain this
domain are divergent and some contain other
yoelii-specific paralogous domains such as PYST-C2
(IPR006491).
Length = 57
Score = 27.4 bits (61), Expect = 3.0
Identities = 11/44 (25%), Positives = 24/44 (54%)
Query: 46 KKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
KK + +K + K+ NNNNN + ++ +++ + +N + K
Sbjct: 14 KKINRSNEKNGIESKRETQLNNNNNKDPKDDKDDDGDKDNKKNK 57
Score = 27.0 bits (60), Expect = 3.6
Identities = 11/43 (25%), Positives = 24/43 (55%)
Query: 41 KKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNN 83
KK + EK + K++ + NNNN + ++ +++ + +N N
Sbjct: 14 KKINRSNEKNGIESKRETQLNNNNNKDPKDDKDDDGDKDNKKN 56
Score = 27.0 bits (60), Expect = 4.3
Identities = 10/43 (23%), Positives = 25/43 (58%)
Query: 42 KEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNN 84
K+ + +K + K++ + NNNNN + ++ +++ + +N N
Sbjct: 14 KKINRSNEKNGIESKRETQLNNNNNKDPKDDKDDDGDKDNKKN 56
>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
(DUF1713). This domain is found at the C terminal end
of mitochondrial proteins of unknown function.
Length = 34
Score = 26.5 bits (59), Expect = 3.0
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K ++K KK K KK +K+ + ++K
Sbjct: 6 LRKRRRKMKKHKYKKLRKRTRALRRK 31
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This
is a bacterial family of uncharacterized proteins.
Length = 55
Score = 27.2 bits (61), Expect = 3.0
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKK 50
AK K + ++K KK ++ E+ K
Sbjct: 27 AKMKFPQFDKQKKKKAKRNAEQDAKNSA 54
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 29.7 bits (67), Expect = 3.1
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
KK+++KK++ K +KK+ EK E+ K K
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPK 195
Score = 29.3 bits (66), Expect = 4.0
Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 7/46 (15%)
Query: 20 WKKAKKKEK-------KKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
W+ A+K K E KK+++KKE+ K +KK+ +K
Sbjct: 140 WEAAEKGGDVHLQANPTKLELLRKKEKEKKEQLKIQKKQSLLEKYG 185
Score = 29.3 bits (66), Expect = 5.0
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 27 EKKKKEKKAKKKEKKKEKKKK--EKKKKKKKKKKKKKNN 63
E +K++K KK++ K +KK+ EK ++ K K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKEL 197
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 29.5 bits (67), Expect = 3.1
Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 10/68 (14%)
Query: 6 KHQIGTFTEPTPLDWKKAKKKEKKK---------KEKKAKKKEKKKEKKKKE-KKKKKKK 55
+ + + +D K K ++++K ++ K KK EK+++E + +K+
Sbjct: 26 RGLLSELLQWLLIDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEM 85
Query: 56 KKKKKKNN 63
K N
Sbjct: 86 MMDMMKPN 93
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 28.8 bits (65), Expect = 3.1
Identities = 11/38 (28%), Positives = 28/38 (73%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ E+ +++ + +E+KK +K E++KKK++K++ ++
Sbjct: 73 KQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRR 110
Score = 27.6 bits (62), Expect = 9.0
Identities = 10/43 (23%), Positives = 30/43 (69%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
L+ + + ++ +++ ++A ++ KK EK + +KKK++K++ ++
Sbjct: 68 LEQAEKQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRR 110
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 29.6 bits (67), Expect = 3.1
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
+E+K++E+ +E + KKKE +KKKK+K
Sbjct: 149 EEQKEEEEAKLAEEALEALKKKEAEKKKKEK 179
Score = 28.8 bits (65), Expect = 5.4
Identities = 12/44 (27%), Positives = 30/44 (68%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ + ++++ ++ +E+K +++ K E+ + KKK+ +KKKK+K
Sbjct: 137 EDESSEEEIEEFEEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180
>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit. This
family represents the eukaryotic vacuolar (H+)-ATPase
(V-ATPase) G subunit. V-ATPases generate an acidic
environment in several intracellular compartments.
Correspondingly, they are found as membrane-attached
proteins in several organelles. They are also found in
the plasma membranes of some specialised cells.
V-ATPases consist of peripheral (V1) and membrane
integral (V0) heteromultimeric complexes. The G subunit
is part of the V1 subunit, but is also thought to be
strongly attached to the V0 complex. It may be involved
in the coupling of ATP degradation to H+ translocation.
Length = 105
Score = 28.3 bits (64), Expect = 3.2
Identities = 9/37 (24%), Positives = 23/37 (62%), Gaps = 3/37 (8%)
Query: 25 KKEKKKKEKKAK---KKEKKKEKKKKEKKKKKKKKKK 58
+K + K+ K+AK +KE ++ + ++E + K+ + +
Sbjct: 24 RKRRAKRLKQAKEEAEKEIEEYRAQREAEFKEFEAEH 60
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 30.3 bits (69), Expect = 3.2
Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 8/47 (17%)
Query: 22 KAKKKEKKKKE--------KKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+A ++EKKK E ++A+ + +K ++ + KK + + K K
Sbjct: 439 RAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDK 485
Score = 29.1 bits (66), Expect = 6.8
Identities = 9/49 (18%), Positives = 27/49 (55%), Gaps = 7/49 (14%)
Query: 22 KAKKKEKKKKEKKAKK-------KEKKKEKKKKEKKKKKKKKKKKKKNN 63
KA+ + ++++KKA++ ++ + E++K ++ + KK + +
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAK 483
>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
(FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
module that binds and bends membranes. F-BAR domains
are dimerization modules that bind and bend membranes
and are found in proteins involved in membrane dynamics
and actin reorganization. F-BAR domain containing
proteins, also known as Pombe Cdc15 homology (PCH)
family proteins, include Fes and Fer tyrosine kinases,
PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
srGAPs. Many members also contain an SH3 domain and play
roles in endocytosis. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
These tubules have diameters larger than those observed
with N-BARs. The F-BAR domains of some members such as
NOSTRIN and Rgd1 are important for the subcellular
localization of the protein.
Length = 191
Score = 29.6 bits (67), Expect = 3.4
Identities = 9/42 (21%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 25 KKEKKKKEKKAKK--KEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K++K++ KK ++ KK+ ++ K KK ++ ++
Sbjct: 92 KEDKEQARKKELAEGEKLKKKLQELWAKLAKKADEEYREQVE 133
Score = 28.1 bits (63), Expect = 8.9
Identities = 10/45 (22%), Positives = 26/45 (57%)
Query: 18 LDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ K+ +K++ + +K KKK ++ K +K ++ +++ +K N
Sbjct: 92 KEDKEQARKKELAEGEKLKKKLQELWAKLAKKADEEYREQVEKLN 136
>gnl|CDD|222932 PHA02809, PHA02809, hypothetical protein; Provisional.
Length = 111
Score = 28.5 bits (63), Expect = 3.4
Identities = 10/61 (16%), Positives = 19/61 (31%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
K K K + K K + + +N ++ ++ NN+EK
Sbjct: 42 KPNKEKNNFLDNGYDESFIKPNKVSYDETNIDKGQKIEQTDNEKQLLSDFSDILNNDEKN 101
Query: 91 T 91
Sbjct: 102 H 102
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 29.9 bits (67), Expect = 3.5
Identities = 16/60 (26%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-KEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
++A K++ ++E K K +E+K+E K E++ KK+ + KK+ N ++ + ++ N
Sbjct: 55 REAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEE 114
>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal
structure and biogenesis].
Length = 148
Score = 29.1 bits (66), Expect = 3.5
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
A K K E + K EKK ++ E + K+K + K
Sbjct: 37 VPATKGNLKLLEARRAKLEKKAAEELAEAEALKEKLEGKT 76
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 30.2 bits (68), Expect = 3.5
Identities = 13/64 (20%), Positives = 23/64 (35%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
+ EK E + + K + K K K + +N + +++ N N
Sbjct: 1006 GIEAPEKTPPPPEHEPQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGETPKGN 1065
Query: 87 EEKG 90
EEK
Sbjct: 1066 EEKK 1069
>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
component [Carbohydrate transport and metabolism].
Length = 472
Score = 30.0 bits (68), Expect = 3.6
Identities = 10/42 (23%), Positives = 17/42 (40%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
K K + E + K ++ K E K + KN+ +N
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSN 377
Score = 29.6 bits (67), Expect = 5.2
Identities = 7/42 (16%), Positives = 17/42 (40%)
Query: 30 KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNN 71
K K + + + K ++ K + K + + N+ +N
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSN 377
Score = 29.2 bits (66), Expect = 6.4
Identities = 8/42 (19%), Positives = 18/42 (42%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
K K + ++ + K K+ K + K + + N+ +N
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSN 377
>gnl|CDD|227379 COG5046, MAF1, Protein involved in Mod5 protein sorting
[Posttranslational modification, protein turnover,
chaperones].
Length = 282
Score = 29.6 bits (66), Expect = 3.6
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTF 92
++K++ + N+NNN NN+N N++NNNN + N TF
Sbjct: 15 EQKRRISFSEYNSNNNTNNSNGNSSNNNNYSGPNGSSPATF 55
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 29.7 bits (67), Expect = 3.6
Identities = 10/46 (21%), Positives = 22/46 (47%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNN 76
+ +K K +K K+ KK+K+K+ N ++N +++
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPFNMVSDNTDDSQ 264
Score = 29.3 bits (66), Expect = 5.3
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
+K+K K K+ KK+K+K+
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
Score = 28.5 bits (64), Expect = 7.9
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K+K KK KK KK+K+K++ N ++N +++
Sbjct: 226 KRKAPKKVAKKVAAAKKRKQKRSPPPPFNMVSDNTDDSQ 264
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 29.8 bits (67), Expect = 3.6
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 34 KAKKKEKKKEKKKKEKKKKKKKKK 57
+ K E K K K+ KK+KK+++K
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEK 45
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 28.7 bits (65), Expect = 3.7
Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 6/41 (14%)
Query: 27 EKKKKEKKAKKKEKKKEKKKK---EKKKKKKK---KKKKKK 61
E+ E+K KK+K+ E KK + + K KK KK +K
Sbjct: 64 ERYDIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKS 104
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This
is a family of proteins conserved in fungi. The
function is not known.
Length = 436
Score = 29.8 bits (67), Expect = 3.7
Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 9/55 (16%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKK---------KKEKKKKKKKKKKKKKNNNNNNN 68
K KK K+ A K + ++ + EK K KK K+ + N
Sbjct: 11 EKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNP 65
>gnl|CDD|222959 PHA02999, PHA02999, Hypothetical protein; Provisional.
Length = 382
Score = 29.7 bits (67), Expect = 3.7
Identities = 20/69 (28%), Positives = 27/69 (39%), Gaps = 4/69 (5%)
Query: 5 TKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKK----KEKKKEKKKKEKKKKKKKKKKKK 60
TK QI + +D KK K K+K KK ++ E KK K KK +K
Sbjct: 88 TKKQISSEENTMKMDKKKYKLKKKIKKVREINDTIRFAHYDYETIKKGYIKFKKMIRKDI 147
Query: 61 KNNNNNNNN 69
+ N
Sbjct: 148 ATSIPNICQ 156
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 29.6 bits (67), Expect = 3.8
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K +K+++ KK K E++ K K K+ KKK
Sbjct: 268 GKIDKEEEAKKQAKAERRAANAPKPGAKPKRSKKKAP 304
>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462). This
protein is highly conserved, but its function is
unknown. It can be isolated from HeLa cell nucleoli and
is found to be homologous with Leydig cell tumour
protein whose function is unknown.
Length = 82
Score = 27.8 bits (62), Expect = 3.8
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K + KK++K KK + K K+ + + +K KK
Sbjct: 11 KGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKV 50
Score = 26.6 bits (59), Expect = 9.4
Identities = 16/47 (34%), Positives = 21/47 (44%), Gaps = 5/47 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK-----KEKKKKKKKKKKKKKN 62
K K K KA KK++K KK K KK + + +K KK
Sbjct: 4 GALKVKAKGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKV 50
>gnl|CDD|225180 COG2271, UhpC, Sugar phosphate permease [Carbohydrate transport and
metabolism].
Length = 448
Score = 29.9 bits (68), Expect = 3.8
Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 5/48 (10%)
Query: 394 DTFHLWGTLYFYAAISVVGTLYMYFVM---PETEGRTLRDIEEHFADK 438
YF I+++ L + F++ P++EG L IEE+ D
Sbjct: 180 FHGGWRAAFYFPGIIAIIVALILLFLLRDRPQSEG--LPPIEEYRGDP 225
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 27.9 bits (63), Expect = 3.9
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 37 KKEKKKEKKKKEKKKKKKKKKKKKK 61
K+E++K+KKK KK + ++ KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
Score = 27.6 bits (62), Expect = 5.3
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKK 47
K+E++K +KK KK + +E KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKK 91
Score = 27.6 bits (62), Expect = 5.7
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK 45
K+ ++K+KKK KKA+ +E KK+
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 28.6 bits (64), Expect = 3.9
Identities = 17/52 (32%), Positives = 28/52 (53%)
Query: 14 EPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
E + D++K K++ K K K E +E+ K+ KK ++ K+KKK N
Sbjct: 7 EDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAILN 58
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
protein; Provisional.
Length = 230
Score = 29.6 bits (66), Expect = 4.0
Identities = 12/74 (16%), Positives = 25/74 (33%), Gaps = 2/74 (2%)
Query: 7 HQIGTFTEPTPLDWK--KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
Q F + L K + K KK E +EK + ++K + +K +
Sbjct: 101 LQPEVFVQYVELAKKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPE 160
Query: 65 NNNNNNNNNNNNNN 78
+ +++
Sbjct: 161 LSEEKSDSELETQP 174
Score = 28.4 bits (63), Expect = 8.2
Identities = 14/82 (17%), Positives = 23/82 (28%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFLYKMA 97
K+ + + K KK + N N + N + +EEK +
Sbjct: 114 KKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELETQ 173
Query: 98 APLVLVLTYVAEITQPHLRGML 119
+ I L ML
Sbjct: 174 PQKTQLKEKKPSIEHIDLSKML 195
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A
peptide from the central, syntaxin binding domain of
synaphin competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal
exocytosis by promoting interaction between the
complementary syntaxin and synaptobrevin transmembrane
regions that reside in opposing membranes prior to
fusion.
Length = 139
Score = 28.7 bits (64), Expect = 4.0
Identities = 8/36 (22%), Positives = 24/36 (66%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KE + E A++++++ ++ +E ++++K K +K +
Sbjct: 23 KEDEGDESDAEEEDEEIQEALREAEEERKAKHRKME 58
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 27.6 bits (62), Expect = 4.0
Identities = 7/36 (19%), Positives = 21/36 (58%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
+KK + + K++KK +K ++ K +++ +++
Sbjct: 23 RKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREEDGG 58
Score = 26.8 bits (60), Expect = 6.4
Identities = 12/43 (27%), Positives = 19/43 (44%), Gaps = 2/43 (4%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKK--KKEKKKKKKKKKKKKK 61
KK KK EK K K++++ ++E E K + K
Sbjct: 33 KKRKKYEKAVKRLKSQRRVVREEDGGYGGETTGIKTNVVRSVK 75
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 29.6 bits (67), Expect = 4.0
Identities = 12/41 (29%), Positives = 18/41 (43%)
Query: 13 TEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKK 53
P L E+ +K+E K++KKK+ KK K
Sbjct: 334 PPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
Score = 29.6 bits (67), Expect = 4.3
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNN 66
K+K+E KK+KKKK KK N
Sbjct: 355 PKRKREGDKKQKKKKSKKLKLTFN 378
>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
variety of eukaryotic chromosomal proteins and
transcription factors. HMGs bind to the minor groove of
DNA and have been classified by DNA binding
preferences. Two phylogenically distinct groups of
Class I proteins bind DNA in a sequence specific
fashion and contain a single HMG box. One group
(SOX-TCF) includes transcription factors, TCF-1, -3,
-4; and also SRY and LEF-1, which bind four-way DNA
junctions and duplex DNA targets. The second group
(MATA) includes fungal mating type gene products MC,
MATA1 and Ste11. Class II and III proteins (HMGB-UBF)
bind DNA in a non-sequence specific fashion and contain
two or more tandem HMG boxes. Class II members include
non-histone chromosomal proteins, HMG1 and HMG2, which
bind to bent or distorted DNA such as four-way DNA
junctions, synthetic DNA cruciforms, kinked
cisplatin-modified DNA, DNA bulges, cross-overs in
supercoiled DNA, and can cause looping of linear DNA.
Class III members include nucleolar and mitochondrial
transcription factors, UBF and mtTF1, which bind
four-way DNA junctions.
Length = 66
Score = 27.2 bits (61), Expect = 4.0
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKK 47
WK ++EKKK E+KA+K +++ EK+
Sbjct: 39 WKSLSEEEKKKYEEKAEKDKERYEKEMP 66
>gnl|CDD|233674 TIGR01992, PTS-IIBC-Tre, PTS system, trehalose-specific IIBC
component. This model represents the fused enzyme II B
and C components of the trehalose-specific PTS sugar
transporter system. Trehalose is converted to
trehalose-6-phosphate in the process of translocation
into the cell. These transporters lack their own IIA
domains and instead use the glucose IIA protein (IIAglc
or Crr). The exceptions to this rule are Staphylococci
and Streptococci which contain their own A domain as a
C-terminal fusion. This family is closely related to the
sucrose transporting PTS IIBC enzymes and the B and C
domains of each are described by subfamily-domain level
TIGRFAMs models (TIGR00826 and TIGR00852, respectively).
In E. coli, B. subtilis and P. fluorescens the presence
of this gene is associated with the presence of
trehalase which degrades T6P to glucose and glucose-6-P.
Trehalose may also be transported (in Salmonella) via
the mannose PTS or galactose permease systems, or (in
Sinorhizobium, Thermococcus and Sulfolobus, for
instance) by ABC transporters.
Length = 462
Score = 29.7 bits (67), Expect = 4.0
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 5/62 (8%)
Query: 334 VPTVFLVLAAFLTHICIRLLPWMLIGEVFPNNIRATASGASGSSSYIFAFAVNKLYYPML 393
VP V L++ FL H I + + IG + + A + A+ IF LY P++
Sbjct: 263 VPPVSLLVTGFLAHAIIGPIGRL-IGNGITSGVTALFTSAAWLGGAIFGL----LYAPLV 317
Query: 394 DT 395
T
Sbjct: 318 IT 319
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 29.6 bits (67), Expect = 4.2
Identities = 10/39 (25%), Positives = 25/39 (64%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
K A++ KK++E++ ++K+ KE + E+ ++K++
Sbjct: 332 KAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERNP 370
Score = 29.2 bits (66), Expect = 5.4
Identities = 10/46 (21%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
+ KA ++ KK+E+ E+++ KK ++ + ++ ++K N
Sbjct: 330 FSKAAEENIKKEEE-----EEERRKKLVKETTEYEQSSSRQKERNP 370
>gnl|CDD|236610 PRK09669, PRK09669, putative symporter YagG; Provisional.
Length = 444
Score = 29.7 bits (67), Expect = 4.2
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 112 QPHLRGMLSATASMTTIFGTV-SQLFLGSFLHWRSAAILNLLFPILALCALYFIPESPHW 170
+P L + T + +FG + S+ LG F R+ + F IL+ ++FIP S W
Sbjct: 262 RPDLATLFLVTGMIAGLFGALLSERLLGKFDRVRAFKWTIVAFVILSA-LIFFIPPSNVW 320
Query: 171 LI 172
LI
Sbjct: 321 LI 322
>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1)
is a melanoma-associated antigen (MAA). MUM-1 belongs
to the mutated or aberrantly expressed type of MAAs,
along with antigens such as CDK4, beta-catenin,
gp100-in4, p15, and N-acetylglucosaminyltransferase V.
It is highly expressed in several types of human
cancers. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 80
Score = 27.4 bits (61), Expect = 4.2
Identities = 8/30 (26%), Positives = 15/30 (50%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+KK + K+ K + +K+K K K+
Sbjct: 43 EKKGIRVVKRWLKHFDCTEKQKLTNKAKES 72
>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
Length = 128
Score = 28.6 bits (65), Expect = 4.2
Identities = 10/14 (71%), Positives = 10/14 (71%)
Query: 49 KKKKKKKKKKKKKN 62
KKKK KKK KKN
Sbjct: 1 AKKKKTVKKKVKKN 14
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 29.3 bits (66), Expect = 4.4
Identities = 5/36 (13%), Positives = 20/36 (55%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+K K++E+++ ++++E + ++ + K+
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQ 154
Score = 28.9 bits (65), Expect = 6.1
Identities = 9/35 (25%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 26 KEKKKKEKKAKKK-EKKKEKKKKEKKKKKKKKKKK 59
K+K+++E++ +++ E E ++ + K+K K+++K
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQK 155
>gnl|CDD|226703 COG4252, COG4252, Predicted transmembrane sensor domain [Signal
transduction mechanisms].
Length = 400
Score = 29.7 bits (67), Expect = 4.4
Identities = 18/89 (20%), Positives = 29/89 (32%), Gaps = 20/89 (22%)
Query: 266 IHAPLD---------PYFATLLLGVAELGGALLCVVLIHYTGKRPLA--LISTGGSAACF 314
+ A LD L + L G LL L + LA L G +
Sbjct: 307 LSALLDGRPLLPVWPDGAELLWIFAWSLLGGLLAWRLRS-PLRLLLAVGLALAGLLLISY 365
Query: 315 IVVAVYAQFHLSYGWDSPLVPTVFLVLAA 343
++ GW PL+P + ++ +
Sbjct: 366 LL--------FLAGWWIPLIPPLLALVGS 386
>gnl|CDD|226625 COG4145, PanF, Na+/panthothenate symporter [Coenzyme metabolism].
Length = 473
Score = 29.7 bits (67), Expect = 4.4
Identities = 21/100 (21%), Positives = 39/100 (39%), Gaps = 10/100 (10%)
Query: 252 MTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGG---ALLCVVLIHYTGKRPLALISTG 308
++ + T G I A P L A GG L +++ ++ A G
Sbjct: 366 ISMIATLIFGALLLIAALNPPEMIIWLNLFA-FGGLEAVFLWPLVLGLYWEKANA---AG 421
Query: 309 GSAACFIVVAVY---AQFHLSYGWDSPLVPTVFLVLAAFL 345
A+ + + +Y A + P+VP++ + + AFL
Sbjct: 422 ALASMIVGLVLYVLLAFLGIKILGFHPIVPSLLVSIIAFL 461
>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675). The
members of this family are sequences derived from
hypothetical plant proteins of unknown function. One
member of this family is annotated as a putative
RNA-binding protein, but no evidence was found to
support this.
Length = 283
Score = 29.5 bits (66), Expect = 4.5
Identities = 10/44 (22%), Positives = 26/44 (59%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNN 79
++E +K K+ + ++ + K+K+ +K N +N ++ ++ N N
Sbjct: 68 TEEEWRKRKEMQSLRRLEAKRKRSEKEYNGVSNGDDMDSINAAN 111
>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as TM1802 family.
Length = 579
Score = 29.6 bits (67), Expect = 4.5
Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 7/69 (10%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEK-------KKKEKKKKKK 54
D ++I + T KK KK +K KKE K K K EK K
Sbjct: 79 DYLPTNKITYLLKTTEPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDI 138
Query: 55 KKKKKKKNN 63
+ K +N
Sbjct: 139 QDLKPGENV 147
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 28.3 bits (64), Expect = 4.5
Identities = 12/34 (35%), Positives = 26/34 (76%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKK 59
KE + K ++A+KK ++++K +KEK+++ K+ K +
Sbjct: 56 KENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAE 89
Score = 27.6 bits (62), Expect = 8.2
Identities = 15/49 (30%), Positives = 28/49 (57%), Gaps = 5/49 (10%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKK-----KKKKKKKKKKNNNN 65
+AK++ +KK ++ KK K+KE++ KE K K + +K ++K
Sbjct: 59 EAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEY 107
>gnl|CDD|70841 pfam07390, P30, Mycoplasma P30 protein. This family consists of
several P30 proteins which seem to be specific to
Mycoplasma agalactiae. P30 is a 30-kDa immunodominant
antigen and is known to be a transmembrane protein.
Length = 266
Score = 29.4 bits (65), Expect = 4.5
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 40 KKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN--NNNNNNNNNNNNNNEEKGTF 92
K E KKEK K K + K +N+ N+N+N N +N+ N++N+ + F
Sbjct: 25 KCSEDDKKEKVTKPKNEPTKPVDNSKTNDNSNEMVGETNLSNSINSSNSSTQNHF 79
Score = 28.6 bits (63), Expect = 7.3
Identities = 16/53 (30%), Positives = 27/53 (50%)
Query: 38 KEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKG 90
++ KKEK K K + K K N+N+N N +N+ N++N++ G
Sbjct: 28 EDDKKEKVTKPKNEPTKPVDNSKTNDNSNEMVGETNLSNSINSSNSSTQNHFG 80
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 29.5 bits (66), Expect = 4.6
Identities = 16/77 (20%), Positives = 25/77 (32%), Gaps = 3/77 (3%)
Query: 8 QIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
Q+ +T + K K EK K ++ KKKK KK N +++
Sbjct: 26 QLDHYTSVSSS---KNAKSEKNLKRHSIIIPALSWKRLVASGSKKKKTSKKSTANLASSS 82
Query: 68 NNNNNNNNNNNNNNNNN 84
N N +
Sbjct: 83 YQKIVIQLNRENLKKSV 99
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 29.2 bits (65), Expect = 4.7
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
K A K E ++ +++ KK+K KKN + N
Sbjct: 26 KNALKNEDCATPQENDEQNPGKKRKNNKKNRSRKKCN 62
>gnl|CDD|223265 COG0187, GyrB, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), B subunit [DNA replication,
recombination, and repair].
Length = 635
Score = 29.8 bits (68), Expect = 4.8
Identities = 9/41 (21%), Positives = 19/41 (46%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ + +E KK +KA K +E +K ++ ++K
Sbjct: 361 FLEENPQEAKKIVEKAIAAAKAREAARKARELTRRKSALDI 401
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 29.0 bits (65), Expect = 4.9
Identities = 8/47 (17%), Positives = 17/47 (36%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
EK+ K+ K + ++ K+N N + + E+
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDS 48
>gnl|CDD|165332 PHA03035, PHA03035, hypothetical protein; Provisional.
Length = 158
Score = 28.6 bits (63), Expect = 4.9
Identities = 15/38 (39%), Positives = 18/38 (47%)
Query: 51 KKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEE 88
K KK KK KKNN NN N + N N N+ +
Sbjct: 104 KSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAANSTD 141
Score = 27.9 bits (61), Expect = 9.4
Identities = 14/40 (35%), Positives = 16/40 (40%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEKGT 91
K KK KK K NN NN N + N N N +
Sbjct: 104 KSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAANSTDDS 143
Score = 27.9 bits (61), Expect = 10.0
Identities = 13/45 (28%), Positives = 22/45 (48%)
Query: 34 KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K+KK KK +K K K+KKN N+ +++++ N
Sbjct: 104 KSKKNGKKNKKNNKLNNANPSGNGKRKKNTKKAANSTDDSDHKQN 148
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 28.9 bits (65), Expect = 5.1
Identities = 12/43 (27%), Positives = 22/43 (51%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
KK +KK K + +EK K+ +KK+K K + +++
Sbjct: 97 KKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLAEAEEELQE 139
Score = 28.2 bits (63), Expect = 7.9
Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 16 TPLD-WKKAKKKEKKKKEKKAKKKEKKKEK-KKKEKKKKKKKKKKKKKNN 63
PL + K KE KK+ KK K + ++K KK +KKKK K
Sbjct: 83 EPLKEYLKKDLKEIKKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLAE 132
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 29.1 bits (65), Expect = 5.2
Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 6/56 (10%)
Query: 11 TFTEPTPLDWKKAKKKEKKK------KEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
F D K ++ + K ++KEKKK K+EK+ KKK++ K++
Sbjct: 64 VFVNSFEKDHKFIRRCKLSDADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.8 bits (67), Expect = 5.2
Identities = 13/40 (32%), Positives = 29/40 (72%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
A +KE ++K + +K K++EK KKE +++ ++ K++++N
Sbjct: 522 ALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERN 561
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that function
in base excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). Used 2pf model [DNA metabolism,
DNA replication, recombination, and repair].
Length = 328
Score = 29.3 bits (65), Expect = 5.2
Identities = 14/55 (25%), Positives = 26/55 (47%)
Query: 6 KHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K + + P++ KK +K K K+K+K++K K K K+K + +
Sbjct: 61 KRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQEKITDKFKVKRKVDRFNGVSE 115
>gnl|CDD|218167 pfam04599, Pox_G5, Poxvirus G5 protein. This protein has been
predicted to be related to the FEN-1 endonuclease.
Length = 426
Score = 29.6 bits (67), Expect = 5.3
Identities = 14/48 (29%), Positives = 20/48 (41%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNN 86
E KKE KE +K K K K+ N+ + + NN +N
Sbjct: 351 EPKKEISVKEIRKLCKIIDKNKRYKKVTINDIKSVISIFGYEINNTSN 398
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 29.4 bits (67), Expect = 5.3
Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 5/40 (12%)
Query: 9 IGTFTE-----PTPLDWKKAKKKEKKKKEKKAKKKEKKKE 43
+GT++ P LD +K + K+KKK++K K + +K E
Sbjct: 716 VGTYSSIFIAAPLWLDLEKRRLKKKKKRKKVKKWEVEKIE 755
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 28.3 bits (64), Expect = 5.4
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+K E+++ EKK K+E ++ K E+ K K K
Sbjct: 78 AERKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDK 117
>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
Length = 143
Score = 28.4 bits (64), Expect = 5.4
Identities = 6/25 (24%), Positives = 8/25 (32%)
Query: 65 NNNNNNNNNNNNNNNNNNNNNNEEK 89
N N +N N EE+
Sbjct: 3 FLLKKMYNKKKNRKSNIKENKWEER 27
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 29.7 bits (67), Expect = 5.4
Identities = 22/71 (30%), Positives = 28/71 (39%), Gaps = 17/71 (23%)
Query: 7 HQIGTFTEPTPL-----DWKKAKKKEK---------KKKEKKAKKKEKKKEKKKKEKKKK 52
H+IGT P PL D + +EK + E KK K+ KKK
Sbjct: 560 HKIGT---PEPLFKELKDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGG 616
Query: 53 KKKKKKKKKNN 63
KKK+ KK
Sbjct: 617 KKKQGKKAGGG 627
Score = 29.0 bits (65), Expect = 8.9
Identities = 15/39 (38%), Positives = 19/39 (48%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+A+ E KK K+ KKK KKK+ KK K K
Sbjct: 594 RAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKA 632
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 28.6 bits (64), Expect = 5.4
Identities = 16/77 (20%), Positives = 32/77 (41%), Gaps = 18/77 (23%)
Query: 5 TKHQIGTF--TEPTPLDWKKAKKKEKKK----------------KEKKAKKKEKKKEKKK 46
T QI F +E L ++ K+E + E A +KE +++
Sbjct: 50 TDEQIEIFRHSEIHALLRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAA 109
Query: 47 KEKKKKKKKKKKKKKNN 63
+++K + K++K K+
Sbjct: 110 QKRKSCRDKERKSAKDP 126
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 29.5 bits (67), Expect = 5.5
Identities = 11/42 (26%), Positives = 23/42 (54%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+KA++ ++ + ++KE KE++ + K K KK K+
Sbjct: 241 QKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKS 282
>gnl|CDD|232939 TIGR00360, ComEC_N-term, ComEC/Rec2-related protein. The related
model ComEC_Rec2 (TIGR00361) describes a set of proteins
of ~ 700-800 residues, one each from a number of
different species, of which most can become competent
for natural transformation with exogenous DNA. The
best-studied examples are ComEC from Bacillus subtilis
and Rec-2 from Haemophilus influenzae, where the protein
appears to form part of the DNA import structure. This
model represents a region found in full-length
ComEC/Rec2 and shorter homologs of unknown function from
large number of additional bacterial species, most of
which are not known to become competent for
transformation (an exception is Helicobacter pylori)
[Unknown function, General].
Length = 171
Score = 28.5 bits (64), Expect = 5.5
Identities = 12/71 (16%), Positives = 22/71 (30%), Gaps = 7/71 (9%)
Query: 134 QLFLGSFLHWRSAAILNLLFPILALCALYFIPESPHWLISQGRMQEASASLCWLRGWVTP 193
F S L L ++ L +P L+S G A+ + +
Sbjct: 65 LAFKLSLRKLNLIGALLLSAIVILLM-------NPVALLSFGFQLSFLATFGLVVMFPNF 117
Query: 194 DKVQTELSQIT 204
++ LS +
Sbjct: 118 QQLLRPLSSLI 128
>gnl|CDD|235847 PRK06655, flgD, flagellar basal body rod modification protein;
Reviewed.
Length = 225
Score = 29.1 bits (66), Expect = 5.5
Identities = 3/32 (9%), Positives = 16/32 (50%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
N +++ + + ++++++ + + FL
Sbjct: 5 NTVGSSSTSGLSTTSSSSSSKKGAADLQDDFL 36
>gnl|CDD|214998 smart01054, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains..
Binding of the proteins to calmodulin depends on the
presence of calcium ions.. These proteins are thought
to be involved in various processes, such as plant
defence responses.and stolonisation or tuberization.
Length = 115
Score = 28.1 bits (63), Expect = 5.5
Identities = 9/46 (19%), Positives = 20/46 (43%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K K + ++ +K K +K K+ K + K++ ++ N
Sbjct: 9 RGKIEDPKPEGERPRKWKFLKKVILLKRFVKSLEDKRRLNPREPNV 54
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 28.9 bits (65), Expect = 5.6
Identities = 11/51 (21%), Positives = 23/51 (45%)
Query: 32 EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNN 82
+KK +++K+ +K +EK+ + K + N N+ N+ N
Sbjct: 146 DKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKN 196
Score = 28.5 bits (64), Expect = 8.7
Identities = 8/47 (17%), Positives = 19/47 (40%)
Query: 43 EKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
E++K++KK ++K+ + N N+ N+ +
Sbjct: 151 EQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNA 197
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function
prediction only].
Length = 299
Score = 29.3 bits (65), Expect = 5.8
Identities = 9/40 (22%), Positives = 20/40 (50%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K AKK+++ K+ +++ ++K K K + K +
Sbjct: 5 KMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAI 44
Score = 28.9 bits (64), Expect = 6.2
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K K + K+ E K++ EKK+K + ++K++
Sbjct: 35 KNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRR 70
>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
Length = 362
Score = 29.4 bits (66), Expect = 5.9
Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 11/88 (12%)
Query: 21 KKAKKKEKK--KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN---------NN 69
+KAK+ + +KA K +K+ +K + + K+ + + N N +
Sbjct: 252 RKAKRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMRMEANQIRRNEPRARMLMGDK 311
Query: 70 NNNNNNNNNNNNNNNNNEEKGTFLYKMA 97
+ +N +NNN + EE+ ++MA
Sbjct: 312 HWHNAWLHNNNLDFAKLEERAALAWQMA 339
>gnl|CDD|150787 pfam10154, DUF2362, Uncharacterized conserved protein (DUF2362).
This is a family of proteins conserved from nematodes to
humans. The function is not known.
Length = 501
Score = 29.4 bits (66), Expect = 5.9
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+K E ++ E K+ ++ K + + NN +N N N
Sbjct: 132 KKLNELQQSEMNKQMQELGKSLTDQDVNNLAAQHNENQN 170
Score = 29.4 bits (66), Expect = 6.4
Identities = 7/27 (25%), Positives = 14/27 (51%)
Query: 54 KKKKKKKKNNNNNNNNNNNNNNNNNNN 80
K+ ++ K+ + + NN +N N N
Sbjct: 144 KQMQELGKSLTDQDVNNLAAQHNENQN 170
Score = 29.4 bits (66), Expect = 6.5
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 52 KKKKKKKKKKNNNNNNNNNNNNNNNNN 78
K+ ++ K + + NN +N N N
Sbjct: 144 KQMQELGKSLTDQDVNNLAAQHNENQN 170
Score = 29.0 bits (65), Expect = 6.6
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 53 KKKKKKKKKNNNNNNNNNNNNNNNNNN 79
K+ ++ K + + NN +N N N
Sbjct: 144 KQMQELGKSLTDQDVNNLAAQHNENQN 170
>gnl|CDD|238558 cd01138, FeuA, Periplasmic binding protein FeuA. These proteins
have predicted to function as initial receptors in ABC
transport of metal ions in some eubacterial species.
They belong to the TroA superfamily of periplasmic metal
binding proteins that share a distinct fold and ligand
binding mechanism. A typical TroA protein is comprised
of two globular subdomains connected by a single helix
and can bind their ligands in the cleft between these
domains.
Length = 248
Score = 28.8 bits (65), Expect = 6.0
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 31 KEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KE +A+K ++K KE K+K KKK K
Sbjct: 112 KEDEAEKWLADYKQKAKEAKEKIKKKLGNDK 142
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 28.7 bits (64), Expect = 6.0
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKK 46
+K KKK + K+EKKA K+EK KE +
Sbjct: 94 EKEKKKSRTKEEKKALKEEKDKEAEP 119
>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Missing In
Metastasis. The IMD domain, also called
Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
dimerization and lipid-binding module that bends
membranes and induces membrane protrusions. Members of
this subfamily include missing in metastasis (MIM) or
metastasis suppressor 1 (MTSS1), metastasis suppressor
1-like (MTSSL) or ABBA (Actin-Bundling protein with
BAIAP2 homology), and similar proteins. They contain an
N-terminal IMD and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus. MIM was originally identified
as a missing transcript from metastatic bladder and
prostate cancer cells. It is a scaffold protein that
functions in a signaling pathway between the PDGF
receptor, Src kinases, and actin assembly. It may also
function as a cofactor of the Sonic hedgehog (Shh)
transcriptional pathway and may participate in tumor
development and progression via this pathway. ABBA
regulates actin and plasma membrane dynamics to promote
the extension of radial glia, which is important in
neuronal migration, axon guidance and neurogenesis. The
IMD domain of MIM binds and bundles actin filaments,
binds membranes, and interacts with the small GTPase
Rac.
Length = 231
Score = 29.0 bits (65), Expect = 6.1
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
+WKK + K K+ KK ++ +KK + + +KK +K K
Sbjct: 108 EWKKVANQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGK 149
>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
protein.
Length = 342
Score = 29.0 bits (65), Expect = 6.1
Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 12/56 (21%)
Query: 19 DWKKA------KKKEKKKKEKKAKKKEKK---KEKKKKEKKK---KKKKKKKKKKN 62
DW+K KK K K A +EK +++++E + K+ K+N
Sbjct: 267 DWQKKVPPPVPKKPAKGPIVKPAISREKSLDSSDRQRQEARSRLAAAKRAASFKQN 322
>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
TRaffic INducer (NOSTRIN). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Nitric Oxide Synthase TRaffic
INducer (NOSTRIN) is expressed in endothelial and
epithelial cells and is involved in the regulation,
trafficking and targeting of endothelial NOS (eNOS).
NOSTRIN facilitates the endocytosis of eNOS by
coordinating the functions of dynamin and the
Wiskott-Aldrich syndrome protein (WASP). Increased
expression of NOSTRIN may be correlated to preeclampsia.
NOSTRIN contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
The F-BAR domain of NOSTRIN is necessary and sufficient
for its membrane association and is responsible for its
subcellular localization.
Length = 239
Score = 28.9 bits (65), Expect = 6.3
Identities = 13/49 (26%), Positives = 24/49 (48%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
DW+ + K KKK A++ EK +++ + K+ K+K K +
Sbjct: 120 DWRSEQIKVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEV 168
>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1. All proteins in this family for
which functions are known are cyclins that are
components of TFIIH, a complex that is involved in
nucleotide excision repair and transcription initiation.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 305
Score = 29.0 bits (65), Expect = 6.4
Identities = 11/62 (17%), Positives = 25/62 (40%), Gaps = 4/62 (6%)
Query: 13 TEPTPLDWKKAKKKEKKKKE----KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
P +D + + KK E +K +KK ++ ++ K +K+K ++
Sbjct: 240 ELPQLIDIMRELRILVKKYEEPRSEKVAALKKKLDECHSIEENLSSVKIRKRKGYEDDTY 299
Query: 69 NN 70
+
Sbjct: 300 IS 301
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 29.2 bits (66), Expect = 6.6
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 20 WKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+A + K K+KK E K+KK K + KK+K
Sbjct: 571 KVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAKKRK 612
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in
surface proteins of a number of Firmutes. Many members
have LPXTG C-terminal anchoring motifs and a
substantial number have the KxYKxGKxW putative sorting
signal at the N-terminus. The tetracycline resistance
plasmid pCF10 in Enterococcus faecalis promotes
conjugal plasmid transfer in response to sex
pheromones, but PgrA/Sec10 encoded by that plasmid, a
member of this family, specifically inhibits the
ability of cells to receive homologous plasmids. The
phenomenon is called surface exclusion.
Length = 356
Score = 28.9 bits (65), Expect = 6.6
Identities = 12/79 (15%), Positives = 20/79 (25%), Gaps = 11/79 (13%)
Query: 19 DWKKAKKKEKKKKEKKA-----------KKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNN 67
+ A+ K K K + K + K +
Sbjct: 8 ELNNAQAKVDKANALAGVNTITLPAGYNIDALKNIYESGDFDNSDAAKAEAFKAAAEAKS 67
Query: 68 NNNNNNNNNNNNNNNNNNN 86
NN +N + N + NN
Sbjct: 68 LNNYKSNAADKNRTVDINN 86
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.3 bits (64), Expect = 6.6
Identities = 7/48 (14%), Positives = 21/48 (43%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
L+ + + + K + K+K + + + K+ + K K+ + +
Sbjct: 61 LELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHAL 108
>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
(DMAP1). DNA methylation can contribute to
transcriptional silencing through several
transcriptionally repressive complexes, which include
methyl-CpG binding domain proteins (MBDs) and histone
deacetylases (HDACs). The chief enzyme that maintains
mammalian DNA methylation, DNMT1, can also establish a
repressive transcription complex. The non-catalytic
amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
DNMT1 associated protein), and can mediate
transcriptional repression. DMAP1 has intrinsic
transcription repressive activity, and binds to the
transcriptional co-repressor TSG101. DMAP1 is targeted
to replication foci through interaction with the far N
terminus of DNMT1 throughout S phase, whereas HDAC2
joins DNMT1 and DMAP1 only during late S phase,
providing a platform for how histones may become
deacetylated in heterochromatin following replication.
Length = 175
Score = 28.3 bits (63), Expect = 6.7
Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 15/53 (28%)
Query: 24 KKKEKKKKEKKAKKKEKKK---------------EKKKKEKKKKKKKKKKKKK 61
+K E +KKE++ K ++ +K E +K+EKK KKK +K
Sbjct: 10 RKIEARKKEREKKAQDLQKLITAADQQAAGFSTAEARKREKKLPKKKIPQKPG 62
>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1. RNA
polymerases catalyze the DNA dependent polymerisation of
RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 1, represents the clamp domain, which a
mobile domain involved in positioning the DNA,
maintenance of the transcription bubble and positioning
of the nascent RNA strand.
Length = 330
Score = 28.8 bits (65), Expect = 7.0
Identities = 13/54 (24%), Positives = 16/54 (29%), Gaps = 2/54 (3%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKK--KKKKKKKKKKKNNNNNNNNNNNNNNNN 77
+ K K K K KK+ KK KKK K +N
Sbjct: 107 ESVKYFFLKVVIDPKGKNSKKRLKKINNLCKKKSICSKCGEDNGGLKAFEGCGK 160
>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 27.8 bits (62), Expect = 7.0
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 33 KKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNNNN 74
+ ++ +KE++K E ++K KK ++ N N N N
Sbjct: 2 RAQIEERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNR 43
>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
superfamily) [Inorganic ion transport and metabolism].
Length = 554
Score = 29.2 bits (66), Expect = 7.1
Identities = 30/169 (17%), Positives = 53/169 (31%), Gaps = 51/169 (30%)
Query: 214 RLGKDGQRRPNYRMYMRRTFLLPYAIVT-------SLFFIGQFGGMTTLQTYAVGIFESI 266
RLG+ +R +P ++ L + Q + L + G + +
Sbjct: 118 RLGR----------LIR---FIPRPVLIGFTAGIAILIILTQLPVLLGLASKVSGFWAKV 164
Query: 267 HA----PLDPYFATLLLGVAELGGALLCVVLIHYTGKRP-------LALISTGGSAACFI 315
A L ATLLLG LL + ++ + + +AL+ I
Sbjct: 165 SALFTVLLTINLATLLLG-------LLTLAILLFLPRLTPRIPSPLIALV-----LGTLI 212
Query: 316 VVAVYAQFHLSYGWDSPLVPTVFLVLAAFLTHICIRLLPWMLIGEVFPN 364
V DS + L + L H + + L+ + P
Sbjct: 213 VWIF--------PLDSLRYGEIPGSLPSGLPHFRLPNVSLSLLLALLPY 253
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 28.1 bits (63), Expect = 7.1
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 26 KEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
KE K EKK K+E +K + +E++KK+ K +K
Sbjct: 69 KESLKIEKKELKQEVEKLPRFQEQEKKRMKAEK 101
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 29.1 bits (66), Expect = 7.2
Identities = 11/49 (22%), Positives = 24/49 (48%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNN 63
PT L ++ +K +K EK + +KK+K++ ++ K+ +
Sbjct: 131 PTRLLVEQVVEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGD 179
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 28.0 bits (63), Expect = 7.2
Identities = 7/26 (26%), Positives = 12/26 (46%)
Query: 36 KKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+ + K E KKK + K++
Sbjct: 106 LKRVAANDAIKAEAKKKGELPSTKRQ 131
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 29.0 bits (66), Expect = 7.2
Identities = 7/33 (21%), Positives = 19/33 (57%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
+ + +++ K++ + K+ E+K+KK +K
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 25.7 bits (57), Expect = 7.2
Identities = 8/30 (26%), Positives = 20/30 (66%)
Query: 28 KKKKEKKAKKKEKKKEKKKKEKKKKKKKKK 57
K +EK + ++K E++KK ++ +K+ ++
Sbjct: 7 MKNQEKVWEAEQKALEEQKKIEELRKEIEE 36
>gnl|CDD|224756 COG1843, FlgD, Flagellar hook capping protein [Cell motility and
secretion].
Length = 222
Score = 28.5 bits (64), Expect = 7.3
Identities = 6/32 (18%), Positives = 15/32 (46%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNNEEKGTFL 93
N + + + N+ ++ ++N + K FL
Sbjct: 4 NMTSPTSASTITTNSKTDSTSSNASLGKDDFL 35
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 28.1 bits (63), Expect = 7.3
Identities = 14/58 (24%), Positives = 30/58 (51%)
Query: 7 HQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+ G + KE +K K+ +K++ K ++ +KK+KKKKK+++ ++
Sbjct: 53 VEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDDP 110
Score = 27.8 bits (62), Expect = 8.1
Identities = 18/51 (35%), Positives = 30/51 (58%)
Query: 16 TPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
KA K++K+ K + EKK+ K E+ +KK+KKKKK++ N++
Sbjct: 60 PAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDDP 110
>gnl|CDD|235848 PRK06664, fliD, flagellar hook-associated protein FliD; Validated.
Length = 661
Score = 29.0 bits (65), Expect = 7.3
Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 8/60 (13%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKK--------KKKKKKKNNNNNNNNNNNN 73
K K +++ + K +E +K+ + KE+K K K KK K++ N N N + N
Sbjct: 601 KVKGLDERIADNNKKIEEYEKKLESKERKLKGKYLTMDQTVKKMKEQSNYLKNFNKSGRN 660
>gnl|CDD|219630 pfam07901, DUF1672, Protein of unknown function (DUF1672). This
family is composed of hypothetical bacterial proteins of
unknown function.
Length = 277
Score = 28.6 bits (64), Expect = 7.4
Identities = 22/87 (25%), Positives = 34/87 (39%), Gaps = 13/87 (14%)
Query: 19 DWKKAKKKEKKKKEKKAK----------KKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNN 68
++KK K+ KK+ + KA K K+ + ++ KKKK N
Sbjct: 188 EFKKGMKQAKKQLDYKANTEVVTTLFSTKDNFTKDNTVDDVIDLSEELKKKKNIPENTTI 247
Query: 69 N---NNNNNNNNNNNNNNNNNEEKGTF 92
+ N N N ++NN E G F
Sbjct: 248 TLQLSKNKINTKRPNYSDNNPIEYGVF 274
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated
2). NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 28.7 bits (65), Expect = 7.4
Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 15/62 (24%)
Query: 21 KKAKK--KEKKKKEKKAKK----------KEKKKEK---KKKEKKKKKKKKKKKKKNNNN 65
+K KK +E K+ KKA+K K++ KEK KK K +++ K+K +
Sbjct: 20 RKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKKTIKMHEERNVKQKVDDKVP 79
Query: 66 NN 67
Sbjct: 80 EG 81
>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
Length = 147
Score = 28.1 bits (64), Expect = 7.5
Identities = 4/31 (12%), Positives = 14/31 (45%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
K K+ + ++ + K ++ + + +K
Sbjct: 41 KGNLKQLEARRAELEAKAAEELAEAEALAEK 71
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 29.2 bits (66), Expect = 7.5
Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 2/57 (3%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKK--KKKKKKKKKKKNNNNNNNNNNNNNN 75
K+ K E K + +E K+ EK+ + KKK + + +N
Sbjct: 718 KEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAEEIGG 774
>gnl|CDD|215856 pfam00318, Ribosomal_S2, Ribosomal protein S2.
Length = 205
Score = 28.3 bits (64), Expect = 7.7
Identities = 14/47 (29%), Positives = 20/47 (42%)
Query: 10 GTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
GT T + + +E KKE K+E +K +K K KK
Sbjct: 91 GTLTNWKTIKKSIKRLEELSKKEALKLKRELEKLEKYLGGIKNMKKL 137
>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
[Posttranslational modification, protein turnover,
chaperones].
Length = 205
Score = 28.5 bits (64), Expect = 7.7
Identities = 8/38 (21%), Positives = 17/38 (44%)
Query: 24 KKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
++ E++ ++ K+ K E K +K K+K
Sbjct: 58 QRAEQRLQQAAKAKRAKDAAANAAEGKAFLEKNAKEKG 95
>gnl|CDD|193106 pfam12628, Inhibitor_I71, Falstatin, cysteine peptidase inhibitor.
This family of peptidase inhibitors is expressed from
plasmodial protozoal species. Falstatin is found to be a
potent reversible inhibitor of the P. falciparum
cysteine proteases falcipain-2 and falcipain-3, as well
as other parasite- and non-parasite-derived cysteine
proteases, but is only a relatively weak inhibitor of
the P. falciparum cysteine proteases falcipain-1 and
dipeptidyl aminopeptidase 1. Thus, P. falciparum
requires expression of falstatin to limit proteolysis by
certain host or parasite cysteine proteases during
erythrocyte invasion.
Length = 173
Score = 28.3 bits (63), Expect = 7.7
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 57 KKKKKNNNNNNNNNNNNNNNNNNNNNNNNNEEK 89
K K N N+++ N NNN +++ N ++N K
Sbjct: 91 KPIKINENDSSKNVINNNQSSSQNETDSNKPPK 123
>gnl|CDD|227520 COG5193, LHP1, La protein, small RNA-binding pol III transcript
stabilizing protein and related La-motif-containing
proteins involved in translation [Posttranslational
modification, protein turnover, chaperones /
Translation, ribosomal structure and biogenesis].
Length = 438
Score = 28.9 bits (64), Expect = 7.7
Identities = 15/78 (19%), Positives = 23/78 (29%), Gaps = 3/78 (3%)
Query: 16 TPLDWKKAKKKEKKKKEK---KAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNNNNN 72
P K K+ E +K + K K K K KK+ K N
Sbjct: 64 NPSKLTSNTKWTLKQVEFYFSGSKDTDSNFPKDKFLKTTAPKNKKRDKWVPIKTIATFNR 123
Query: 73 NNNNNNNNNNNNNNEEKG 90
N+ + + + K
Sbjct: 124 MKNSGSPVSAVSGALRKS 141
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 29.1 bits (65), Expect = 7.7
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 25 KKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKK 56
++E++ +E+ + K +K + K E K+K K
Sbjct: 440 QEERRLREEAIRAKAEKTARMKAEMKEKTLKM 471
>gnl|CDD|223624 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
and repair].
Length = 570
Score = 29.1 bits (66), Expect = 7.8
Identities = 12/61 (19%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
Query: 2 DSSTKHQIGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
+ + G F+ + + K++ + + K + +E + K K K +KK KK+
Sbjct: 199 KAIFEKGGGEFS----ARLTEIEGKKEGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKR 254
Query: 62 N 62
+
Sbjct: 255 S 255
>gnl|CDD|234582 PRK00028, infC, translation initiation factor IF-3; Reviewed.
Length = 177
Score = 28.2 bits (64), Expect = 7.8
Identities = 14/24 (58%), Positives = 18/24 (75%), Gaps = 3/24 (12%)
Query: 25 KKEKKKKEKKAKKKEKK---KEKK 45
K E+KKK+K+AKKK+K KE K
Sbjct: 72 KYEQKKKQKEAKKKQKVIEVKEIK 95
>gnl|CDD|181784 PRK09334, PRK09334, 30S ribosomal protein S25e; Provisional.
Length = 86
Score = 26.9 bits (60), Expect = 8.0
Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 5/38 (13%)
Query: 21 KKAKKKEKKKKEKKAKKK-----EKKKEKKKKEKKKKK 53
KK KKKE K EK+ K E+ ++ KE KK+K
Sbjct: 3 KKKKKKEGSKTEKEIKSTIITLDEELLKRVAKEVKKEK 40
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 28.9 bits (65), Expect = 8.1
Identities = 13/41 (31%), Positives = 20/41 (48%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K K ++ E A+ KE KKE +KE + ++ K K
Sbjct: 64 AKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDAK 104
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 28.8 bits (65), Expect = 8.1
Identities = 13/49 (26%), Positives = 28/49 (57%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNNNNN 69
+K ++KEK+ +E + E++KE K ++ +KK + +K+ +N
Sbjct: 324 EKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSNK 372
>gnl|CDD|235614 PRK05807, PRK05807, hypothetical protein; Provisional.
Length = 136
Score = 27.8 bits (62), Expect = 8.2
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 15 PTPLDWKKAKKKEKKKK-EKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
P +DW+K K K E + K K E++ ++ KK + K + K +N
Sbjct: 84 PAEIDWQKEKNKNNNGNFEDRLSKFLKDSEERFQDLKKHQDSKGRGYKKSNG 135
>gnl|CDD|132949 cd06618, PKc_MKK7, Catalytic domain of the dual-specificity Protein
Kinase, MAP kinase kinase 7. Protein kinases (PKs), MAP
kinase kinase 7 (MKK7) subfamily, catalytic (c) domain.
PKs catalyze the transfer of the gamma-phosphoryl group
from ATP to serine/threonine or tyrosine residues on
protein substrates. The MKK7 subfamily is part of a
larger superfamily that includes the catalytic domains
of other protein serine/threonine kinases, protein
tyrosine kinases, RIO kinases, aminoglycoside
phosphotransferase, choline kinase, and phosphoinositide
3-kinase. The mitogen-activated protein (MAP) kinase
signaling pathways are important mediators of cellular
responses to extracellular signals. The pathways involve
a triple kinase core cascade comprising the MAP kinase
(MAPK), which is phosphorylated and activated by a MAPK
kinase (MAPKK or MKK), which itself is phosphorylated
and activated by a MAPK kinase kinase (MAPKKK or MKKK).
MKK7 is a dual-specificity PK that phosphorylates and
activates its downstream target, c-Jun N-terminal kinase
(JNK), on specific threonine and tyrosine residues.
Although MKK7 is capable of dual phosphorylation, it
prefers to phosphorylate the threonine residue of JNK.
Thus, optimal activation of JNK requires both MKK4 (not
included in this subfamily) and MKK7. MKK7 is primarily
activated by cytokines. MKK7 is essential for liver
formation during embryogenesis. It plays roles in G2/M
cell cycle arrest and cell growth. In addition, it is
involved in the control of programmed cell death, which
is crucial in oncogenesis, cancer chemoresistance, and
antagonism to TNFalpha-induced killing, through its
inhibition by Gadd45beta and the subsequent suppression
of the JNK cascade.
Length = 296
Score = 28.5 bits (64), Expect = 8.4
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 215 LGKDGQRRPNYRMYMRRTFLLPY 237
L KD ++RP YR ++ F+ Y
Sbjct: 257 LTKDHRKRPKYRELLQHPFIRRY 279
>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
Length = 459
Score = 28.8 bits (64), Expect = 8.4
Identities = 9/14 (64%), Positives = 13/14 (92%)
Query: 61 KNNNNNNNNNNNNN 74
K NN+NNNN++N+N
Sbjct: 195 KINNSNNNNHSNSN 208
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 28.9 bits (65), Expect = 8.4
Identities = 9/39 (23%), Positives = 18/39 (46%)
Query: 23 AKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
A + K E+K+ ++ +E K+ K+ + K K
Sbjct: 903 ALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
>gnl|CDD|218071 pfam04409, DUF530, Protein of unknown function (DUF530). Family of
hypothetical archaeal proteins.
Length = 512
Score = 28.7 bits (64), Expect = 8.4
Identities = 19/58 (32%), Positives = 23/58 (39%), Gaps = 4/58 (6%)
Query: 7 HQIGTFTEPTPLDWKKAKKKEKKKKEKK----AKKKEKKKEKKKKEKKKKKKKKKKKK 60
+ I + E + +E KK K KK EKKK EK K K K KK
Sbjct: 448 YGISDWDEVEVAEILGIDIEELLKKMAKIEPSEILLFKKVEKKKFEKFGKPKSDKAKK 505
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 28.5 bits (64), Expect = 8.5
Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
D +K K+ +K K +KE ++ ++ + KK KK ++K+ +
Sbjct: 70 DQEKMKEL---QKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109
>gnl|CDD|222863 PHA02540, 61, DNA primase; Provisional.
Length = 337
Score = 28.8 bits (65), Expect = 8.5
Identities = 8/34 (23%), Positives = 18/34 (52%)
Query: 19 DWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKK 52
++ + KE+ + + K K + KK+K+ +K
Sbjct: 81 EYIMERFKERGTGKGRPVPKPKFEFKKEKKVIEK 114
>gnl|CDD|221195 pfam11737, DUF3300, Protein of unknown function (DUF3300). This
hypothetical bacterial gene product has a long
hydrophobic segment and is thus likely to be a membrane
protein.
Length = 230
Score = 28.4 bits (64), Expect = 8.6
Identities = 7/25 (28%), Positives = 13/25 (52%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
+N+NN N NN+ ++ +N
Sbjct: 206 HNHNNRINRNNDRVVSSGGQRWQHN 230
Score = 28.4 bits (64), Expect = 9.7
Identities = 7/25 (28%), Positives = 13/25 (52%)
Query: 62 NNNNNNNNNNNNNNNNNNNNNNNNN 86
N+N+NN N NN+ ++ +
Sbjct: 205 NHNHNNRINRNNDRVVSSGGQRWQH 229
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 28.2 bits (63), Expect = 8.7
Identities = 14/40 (35%), Positives = 15/40 (37%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKK 60
K + KK KK K K KK KKKK K
Sbjct: 169 PKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKKKTK 208
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 28.3 bits (63), Expect = 8.8
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 39 EKKKEKKKKEKKKKKKKKKKKKKNNN 64
EK K KK+ K KK KKK+ K+ N
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREAN 232
>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G. This
model describes the vacuolar ATP synthase G subunit in
eukaryotes and includes members from diverse groups
e.g., fungi, plants, parasites etc. V-ATPases are
multi-subunit enzymes composed of two functional
domains: A transmembrane Vo domain and a peripheral
catalytic domain V1. The G subunit is one of the
subunits of the catalytic domain. V-ATPases are
responsible for the acidification of endosomes and
lysosomes, which are part of the central vacuolar
system [Energy metabolism, ATP-proton motive force
interconversion].
Length = 113
Score = 27.5 bits (61), Expect = 8.8
Identities = 13/41 (31%), Positives = 30/41 (73%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+A +K + +++K K+ ++ KE+ +KE +K K++++K+ K
Sbjct: 16 KRAAEKVSEARKRKTKRLKQAKEEAQKEVEKYKQQREKEFK 56
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 28.1 bits (63), Expect = 8.9
Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 21 KKAKKKEKK-KKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
K+EK KKEK++ +K +++ K K+ +K + KK KKK
Sbjct: 76 ANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKK 117
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 27.8 bits (62), Expect = 9.0
Identities = 10/41 (24%), Positives = 23/41 (56%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKN 62
+ + + ++ KK K +K++KE +KK + +K++K
Sbjct: 100 EEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKK 140
Score = 27.8 bits (62), Expect = 9.3
Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKK-----KKKKKK 61
KK ++ E ++ +K ++K KKE+K+ +KK K++KKK
Sbjct: 96 KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 28.8 bits (65), Expect = 9.0
Identities = 9/39 (23%), Positives = 20/39 (51%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNN 65
E+K ++++ + +K+ + E +K K KK K +
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQD 500
>gnl|CDD|224097 COG1176, PotB, ABC-type spermidine/putrescine transport system,
permease component I [Amino acid transport and
metabolism].
Length = 287
Score = 28.3 bits (64), Expect = 9.0
Identities = 32/132 (24%), Positives = 50/132 (37%), Gaps = 35/132 (26%)
Query: 233 FLLPYAIV--TSLFFIGQF----GGMTTLQTYAVGIFESIHAPLDPYFATLLLGVAELGG 286
FLLP IV S G+ TL YA + +S++ L +L +
Sbjct: 24 FLLPLLIVLVPSFLLPLLSFFDYSGVFTLGNYARLLTDSLY--LTVLLRSLWIAAL---V 78
Query: 287 ALLCVVLIHYTGKRPLALISTGGSAACFIVVAVYAQFHLSYGWDSPLVPTVFLVLAAFLT 346
L+C+ L+ Y P+A F+ S L+ + LV+ F T
Sbjct: 79 TLICL-LLGY----PVAY---------FLA--------RSPPRWRNLL--LLLVILPFWT 114
Query: 347 HICIRLLPWMLI 358
+ IR W++I
Sbjct: 115 SLLIRTYAWIVI 126
>gnl|CDD|217483 pfam03305, Lipoprotein_X, Mycoplasma MG185/MG260 protein. Most of
the aligned regions in this family are found towards the
middle of the member proteins.
Length = 237
Score = 28.5 bits (64), Expect = 9.0
Identities = 13/56 (23%), Positives = 20/56 (35%), Gaps = 5/56 (8%)
Query: 9 IGTFTEPTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
+G FT K + + K K+A K E + K + K K K +
Sbjct: 180 LGKFTLD-----KNSLTSPEVKAFKEADKSEDPQTDSKTIELYKDKIPTDKNAGVD 230
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.8 bits (64), Expect = 9.1
Identities = 10/35 (28%), Positives = 28/35 (80%)
Query: 27 EKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKK 61
KKK+E++ K E++++++K+E+ +K +++++K+
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKR 253
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 28.7 bits (64), Expect = 9.1
Identities = 10/44 (22%), Positives = 21/44 (47%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNN 64
K++ K+ +KK K + K ++++KE+K + N
Sbjct: 124 KESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKN 167
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 28.8 bits (65), Expect = 9.1
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 22 KAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKK 58
+ KKK KK EK+ + +KK+KE + +K++
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
>gnl|CDD|173592 PTZ00401, PTZ00401, aspartyl-tRNA synthetase; Provisional.
Length = 550
Score = 28.8 bits (64), Expect = 9.2
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKKEKKK 51
A EKK+ +K+A+K + E+K + +K
Sbjct: 8 AGAPAVEKKQSDKEARKAARLAEEKARAAEK 38
>gnl|CDD|216790 pfam01925, TauE, Sulfite exporter TauE/SafE. This is a family of
integral membrane proteins where the alignment appears
to contain two duplicated modules of three transmembrane
helices. The proteins are involved in the transport of
anions across the cytoplasmic membrane during taurine
metabolism as an exporter of sulfoacetate. This family
used to be known as DUF81.
Length = 236
Score = 28.3 bits (64), Expect = 9.3
Identities = 8/50 (16%), Positives = 17/50 (34%), Gaps = 1/50 (2%)
Query: 100 LVLVLTYVAEITQPHLRGMLSATASMTTIFGTVSQLFLGSFLHWRSAAIL 149
LV L Y+ + A ++++ + G + W +L
Sbjct: 150 LVPALLYLLGLPLKKAVATSLAVFLVSSLAALLGYALAG-LVDWPLLLLL 198
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.2 bits (63), Expect = 9.5
Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 11/59 (18%)
Query: 14 EPTPL-DWKKAKKKE-KKKKEKKAKKKEKKKEKKKKE---------KKKKKKKKKKKKK 61
EP + W++ + +++ E KKKE+ EK +KE +KK+K K + +K+
Sbjct: 110 EPESIRKWRERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKE 168
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It
is a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 28.5 bits (64), Expect = 10.0
Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 15 PTPLDWKKAKKKEKKKKEKKAKKKEKKKEKKKKEKKKKKKKKKKKKKNNNNN 66
TP + K + K+ E + K +K+ KK +K+ ++ +K+K K + ++
Sbjct: 18 DTPEE--KLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDKLQSEHS 67
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 157
Score = 27.6 bits (62), Expect = 10.0
Identities = 14/47 (29%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 21 KKAKKKEKKKKEKKAKKKEKKKEKKKK---EKKKKKKKKKKKKKNNN 64
K + E +KKEK+ +K+E+K +K+ E+ +K K+++ ++K
Sbjct: 44 FKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQE 90
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.398
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,686,579
Number of extensions: 2943830
Number of successful extensions: 30390
Number of sequences better than 10.0: 1
Number of HSP's gapped: 21246
Number of HSP's successfully gapped: 2657
Length of query: 553
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 451
Effective length of database: 6,413,494
Effective search space: 2892485794
Effective search space used: 2892485794
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.8 bits)