RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10021
(716 letters)
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 259 bits (664), Expect = 8e-80
Identities = 130/368 (35%), Positives = 181/368 (49%), Gaps = 63/368 (17%)
Query: 342 SLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKL 401
SL + D E E S + + + +E E
Sbjct: 2 SLVDFEPFPSDADALEEELLISEGGVSSALEVNLELELIEGE-----------------K 44
Query: 402 KFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVAFVKTIKTI 461
K DT + A S + + + E EL +D + + +
Sbjct: 45 KLELVVCDTKLGNADSAFEIPSEVGEDLRENAEE----ALGELSS--SDEDDYYRMLL-- 96
Query: 462 GLAHSLSRYKLKFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTD 521
GL H LSR K++ DK D +I+QA+ LDD+DKE+N MR REWYGWHFPEL +V D
Sbjct: 97 GLGHELSRIKVREEVDKEDKLIIQAIEALDDIDKEINLLAMRLREWYGWHFPELSSLVPD 156
Query: 522 NLK----------RNDNSTRDKTSASDLSEILPEDVEEKVKEAAEISMGTEISDDDIENI 571
N + R + + K S DL LP D+ K+ EAA+ SMG ++S++DI+NI
Sbjct: 157 NEQYAKLVSALGNRENIN---KESLKDLGFALP-DIAIKIAEAAKDSMGADLSEEDIDNI 212
Query: 572 LLLCDQVLEISSYRGQLYDYLKSRMMAVAPNLTILMGELVGARLVAQAGSLLNLAKHPAS 631
L + +LE+ R QL +Y++S+M +APNLT L+G ++GARL++ AG L LAK PAS
Sbjct: 213 RELAEIILELYELREQLEEYIESKMSEIAPNLTALVGPVLGARLISHAGGLTRLAKMPAS 272
Query: 632 TVQILGAEKALFRSLLNLAKHPASTVQILGAEKALFRALKTKRDTPKYGLIYHSQLIGQS 691
T+Q+LGAE KALFRALKT TPKYG+IY S LI +S
Sbjct: 273 TIQVLGAE------------------------KALFRALKTGAKTPKYGVIYQSPLIQKS 308
Query: 692 STKNKGKM 699
+GK+
Sbjct: 309 PPWQRGKI 316
Score = 58.6 bits (142), Expect = 6e-09
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 6/93 (6%)
Query: 211 LIGQSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRR 270
LI +S +GK+AR LAAK A+A R+DA + G R +LE ++ L+E +
Sbjct: 304 LIQKSPPWQRGKIARALAAKLAIAARIDAFSGEPD---GISLREELEKRIEKLKEKPPKP 360
Query: 271 LSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEE 303
TKAK + +K ++K K + +E
Sbjct: 361 ---PTKAKPERDKKERPGRYRRKKKEKKAKSER 390
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 183 bits (466), Expect = 3e-51
Identities = 98/257 (38%), Positives = 140/257 (54%), Gaps = 49/257 (19%)
Query: 466 SLSRYKLKFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTD---- 521
L+R KL+ + K D + +QA+ +DD+DK +N + R REWY HFPEL ++V
Sbjct: 124 ELTRRKLRSAAQKRDKLAIQAIRAIDDIDKTINLFSERLREWYSLHFPELDELVKKHEEY 183
Query: 522 -----NLKRNDNSTRDKTSASDLSEI-LPEDVEEKVKEAAEISMGTEISDDDIENILLLC 575
L +N TR+K L ++ LPE+ K+ EAA+ SMG ++S+ D+E I L
Sbjct: 184 VKLVSELGDRENYTREK-----LKKLGLPENKARKIAEAAKKSMGADLSEFDLEAIKKLA 238
Query: 576 DQVLEISSYRGQLYDYLKSRMMAVAPNLTILMGELVGARLVAQAGSLLNLAKHPASTVQI 635
+++L++ R +L DYL++ M VAPNLT L+G +GARL++ AG L LAK PAST+Q+
Sbjct: 239 NEILDLYKLREELEDYLETVMKEVAPNLTALVGPSLGARLISLAGGLEELAKMPASTIQV 298
Query: 636 LGAEKALFRSLLNLAKHPASTVQILGAEKALFRALKTKRDTPKYGLIYHSQLIGQSSTKN 695
LGAE KALFR L+T PK+G+I+ I
Sbjct: 299 LGAE------------------------KALFRHLRTGAKPPKHGVIFQYPAI------- 327
Query: 696 KGKMGSYGWQPGSASRA 712
GS WQ G +RA
Sbjct: 328 ---HGSPWWQRGKIARA 341
Score = 79.6 bits (197), Expect = 1e-15
Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 9/151 (5%)
Query: 328 VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEE-----EEVSKKKKKKKKKDVED 382
+K DE + S+K ++ + EP + GE E+ + + D E+
Sbjct: 56 LKELGPDEVVVENEEESRKLQELGYRVTVEPPNKIGEFLRENLPELGVEYGFFE--DEEE 113
Query: 383 ETDPLNHVGLAHSLSRYKLKFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFP 442
+ L+ + L+R KL+ + K D + +QA+ +DD+DK +N + R REWY HFP
Sbjct: 114 FYEKLHEWSVE--LTRRKLRSAAQKRDKLAIQAIRAIDDIDKTINLFSERLREWYSLHFP 171
Query: 443 ELGKIVTDNVAFVKTIKTIGLAHSLSRYKLK 473
EL ++V + +VK + +G + +R KLK
Sbjct: 172 ELDELVKKHEEYVKLVSELGDRENYTREKLK 202
Score = 34.2 bits (79), Expect = 0.27
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 220 KGKMARMLAAKAALATRVDAL-GEDSSIELGTDHRAKL-EIKLRLLEEGNLRRLSGTTKA 277
+GK+AR LAAK A+A RVD G EL + ++ EIK + ++
Sbjct: 335 RGKIARALAAKLAIAARVDYFSGRYIGDELKEELNKRIEEIK-EKYPKPPKKKREEKKPQ 393
Query: 278 KAKLEKYHGKRLEKKKFK 295
K K +K K+ +K+K K
Sbjct: 394 KRKKKKKRKKKGKKRKKK 411
>gnl|CDD|216709 pfam01798, Nop, Putative snoRNA binding domain. This family
consists of various Pre RNA processing
ribonucleoproteins. The function of the aligned region
is unknown however it may be a common RNA or snoRNA or
Nop1p binding domain. Nop5p (Nop58p) from yeast is the
protein component of a ribonucleoprotein protein
required for pre-18s rRNA processing and is suggested to
function with Nop1p in a snoRNA complex. Nop56p and
Nop5p interact with Nop1p and are required for ribosome
biogenesis. Prp31p is required for pre-mRNA splicing in
S. cerevisiae.
Length = 149
Score = 161 bits (409), Expect = 2e-46
Identities = 68/133 (51%), Positives = 87/133 (65%), Gaps = 24/133 (18%)
Query: 570 NILLLCDQVLEISSYRGQLYDYLKSRMMAVAPNLTILMGELVGARLVAQAGSLLNLAKHP 629
NIL C+++L ++ YR QL +Y++S+M +APNLT L+GELVGARL++ AGSL NLAK P
Sbjct: 1 NILEFCERILSLAEYRKQLEEYIESKMSKIAPNLTALVGELVGARLISHAGSLTNLAKLP 60
Query: 630 ASTVQILGAEKALFRSLLNLAKHPASTVQILGAEKALFRALKTKRDTPKYGLIYHSQLIG 689
AST+Q+LGAE KALFRALKTK TPKYGLIYHS LI
Sbjct: 61 ASTIQVLGAE------------------------KALFRALKTKAKTPKYGLIYHSPLIQ 96
Query: 690 QSSTKNKGKMGSY 702
++ K +GK+
Sbjct: 97 KAPPKVRGKIARM 109
Score = 68.8 bits (169), Expect = 8e-14
Identities = 27/57 (47%), Positives = 36/57 (63%)
Query: 210 QLIGQSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEG 266
LI ++ K +GK+ARMLAAK ALA R+DA E+ G + R +LE +L LEE
Sbjct: 93 PLIQKAPPKVRGKIARMLAAKLALAARIDAFSEEPDGSFGIELREELEKRLEKLEEK 149
>gnl|CDD|203845 pfam08060, NOSIC, NOSIC (NUC001) domain. This is the central
domain in Nop56/SIK1-like proteins.
Length = 52
Score = 99.5 bits (249), Expect = 1e-25
Identities = 30/51 (58%), Positives = 39/51 (76%)
Query: 412 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVAFVKTIKTIG 462
I+QA+ LLDD+DKE+N + MR REWY WHFPEL K+V D + + K +K IG
Sbjct: 1 IIQAIELLDDIDKEINLFHMRLREWYSWHFPELEKLVPDPIDYAKVVKLIG 51
Score = 88.3 bits (220), Expect = 9e-22
Identities = 26/41 (63%), Positives = 33/41 (80%)
Query: 483 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNL 523
I+QA+ LLDD+DKE+N + MR REWY WHFPEL K+V D +
Sbjct: 1 IIQAIELLDDIDKEINLFHMRLREWYSWHFPELEKLVPDPI 41
>gnl|CDD|197999 smart00931, NOSIC, NOSIC (NUC001) domain. This is the central
domain in Nop56/SIK1-like proteins.
Length = 52
Score = 96.1 bits (240), Expect = 2e-24
Identities = 29/51 (56%), Positives = 40/51 (78%)
Query: 412 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVAFVKTIKTIG 462
I+QA++LLDD+DKE+N + MR REWY WHFPEL K+V D + + K +K +G
Sbjct: 1 IIQAINLLDDIDKEINLFHMRVREWYSWHFPELEKLVPDPLDYAKVVKLLG 51
Score = 85.3 bits (212), Expect = 1e-20
Identities = 27/42 (64%), Positives = 34/42 (80%)
Query: 483 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNLK 524
I+QA++LLDD+DKE+N + MR REWY WHFPEL K+V D L
Sbjct: 1 IIQAINLLDDIDKEINLFHMRVREWYSWHFPELEKLVPDPLD 42
>gnl|CDD|219731 pfam08156, NOP5NT, NOP5NT (NUC127) domain. This N terminal
domain is found in RNA-binding proteins of the NOP5
family.
Length = 67
Score = 86.9 bits (216), Expect = 5e-21
Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
Query: 1 MLVLFETPAGYAFFKLLDEKKLQ-EADNLYESFQTPQGANKVLKLKHFEKFIDTTEALAA 59
MLVLFET AGYA FK+ DEK L + + +SFQ + +K++KLK F+ F +AL
Sbjct: 1 MLVLFETAAGYALFKVKDEKDLIGDDKEVQKSFQDLEKFSKMVKLKAFQPFKSAADALEN 60
Query: 60 TTAAVEG 66
A EG
Sbjct: 61 ANAISEG 67
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 42.3 bits (99), Expect = 8e-04
Identities = 26/105 (24%), Positives = 40/105 (38%), Gaps = 10/105 (9%)
Query: 291 KKKFKTEFDAAEETPATPDTSSSGKKKKKNKNL--DTSIVKAEPEDEPAA----ADVSLS 344
+K A +E P S K + K T + +EP +E + D +L
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107
Query: 345 KKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKK----KKDVEDETD 385
K+ KK + + + EE +KK +K KK ED D
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVED 152
Score = 33.9 bits (77), Expect = 0.34
Identities = 31/145 (21%), Positives = 56/145 (38%), Gaps = 13/145 (8%)
Query: 245 SIELGTDHRAKLEIKLRLLEEGN--LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAE 302
L R K+E L ++E +S ++ KR KK + E
Sbjct: 41 KFLLAAGSRKKIESAL-AVDEEPDENGAVSKKKPTRSV------KRATKKTVVEISEPLE 93
Query: 303 ETPA--TPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVE 360
E + ++ K+ KK A D + K +K++K + +E VE
Sbjct: 94 EGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEK--TEKKVRKRRKVKKMDEDVE 151
Query: 361 EGGEEEEVSKKKKKKKKKDVEDETD 385
+ G E EVS ++ + +E+E++
Sbjct: 152 DQGSESEVSDVEESEFVTSLENESE 176
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 39.8 bits (93), Expect = 0.004
Identities = 25/60 (41%), Positives = 36/60 (60%), Gaps = 4/60 (6%)
Query: 329 KAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLN 388
K P++E KK++++KT D+EE V+E E+EE KKKK KK K+ E + LN
Sbjct: 33 KEVPDEEEEEEKEE--KKEEEEKTTDKEEEVDEEEEKEE--KKKKTKKVKETTTEWELLN 88
Score = 30.1 bits (68), Expect = 5.3
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 351 KTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETD 385
K ++E P EE EE+E K++++K E+ +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDE 63
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 39.5 bits (92), Expect = 0.006
Identities = 21/106 (19%), Positives = 38/106 (35%), Gaps = 3/106 (2%)
Query: 273 GTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEP 332
+K K + K + E + +E KK+K + K E
Sbjct: 81 EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKK---KKEKPKEEPKDRKPKEEA 137
Query: 333 EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
+++ + K+KK ++ +D EE + + KK KKK
Sbjct: 138 KEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183
Score = 34.1 bits (78), Expect = 0.24
Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 7/105 (6%)
Query: 288 RLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKK 347
K K E + ++KKK K E + + + K+
Sbjct: 88 PAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK-----EKPKEEPKDRKPKEEAKEKRP 142
Query: 348 KKKKTQDEEEPVEEGGEEEEVSK--KKKKKKKKDVEDETDPLNHV 390
K+K +++E+ VEE + EE K + + K + + P N
Sbjct: 143 PKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187
Score = 34.1 bits (78), Expect = 0.27
Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 3/95 (3%)
Query: 295 KTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQD 354
K D A + + K K K K E +++ + KK+K K+
Sbjct: 70 KLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129
Query: 355 E---EEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
+ +E +E +E K+K+KK ++ + E +
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEK 164
Score = 32.9 bits (75), Expect = 0.65
Identities = 25/128 (19%), Positives = 45/128 (35%), Gaps = 13/128 (10%)
Query: 272 SGTTKAKAKLEKYHGKRLEKKKFKTEFDA-----AEETPATPDTSSSGKKKKKNKNLDTS 326
SG + K K + K+ +K+K K E + P K+KK + D
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE 161
Query: 327 IVKAEPEDEPAAADVSLSKKKKKKKTQDEEEP--------VEEGGEEEEVSKKKKKKKKK 378
K + KKK K ++ E G+ EE ++++K++
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221
Query: 379 DVEDETDP 386
D + +
Sbjct: 222 DDGKDRET 229
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 38.6 bits (89), Expect = 0.013
Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 2/104 (1%)
Query: 276 KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDE 335
+A + E + K E ++ A ++ KK ++ K D + KAE ED+
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE-EDK 1404
Query: 336 PAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
A ++ KKK + ++ EE + +E KK ++ KK D
Sbjct: 1405 KKADELK-KAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447
Score = 36.3 bits (83), Expect = 0.071
Identities = 25/114 (21%), Positives = 47/114 (41%), Gaps = 4/114 (3%)
Query: 268 LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSI 327
++ + K KA K K+ E+ K E AE A D + + ++K +
Sbjct: 1320 AKKKAEEAKKKADAAK---KKAEEAKKAAEAAKAEAE-AAADEAEAAEEKAEAAEKKKEE 1375
Query: 328 VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
K + + A+ + KKK +++++ +E + KK + KKK E
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
Score = 35.5 bits (81), Expect = 0.12
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 4/107 (3%)
Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
K K ++ K E KK AEE D + KKK + KAE E
Sbjct: 1301 KKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA---KKKAEEAKKAAEAAKAEAEAAAD 1357
Query: 338 AADVSLSKKKK-KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
A+ + K + +KK ++ ++ + ++ E KK + KKK ED+
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDK 1404
Score = 35.1 bits (80), Expect = 0.14
Identities = 24/109 (22%), Positives = 42/109 (38%), Gaps = 5/109 (4%)
Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEP----- 332
K ++ K E KK E ++ + + + + KAE
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Query: 333 EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
E+ AD + K ++KKK + ++ EE ++ + KK KKK E
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422
Score = 35.1 bits (80), Expect = 0.15
Identities = 32/118 (27%), Positives = 51/118 (43%), Gaps = 7/118 (5%)
Query: 264 EEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNL 323
EE + + KA+ + + E + E +AAEE + KKK
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA---- 1380
Query: 324 DTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
D + KAE E AD + K ++ KK DE + ++ + +KKK ++KKK E
Sbjct: 1381 DAAKKKAE---EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
Score = 34.7 bits (79), Expect = 0.19
Identities = 33/127 (25%), Positives = 62/127 (48%), Gaps = 1/127 (0%)
Query: 253 RAKLEIKLRLLEEGNLRRLSGTTKAK-AKLEKYHGKRLEKKKFKTEFDAAEETPATPDTS 311
A++E ++L EE + KA+ AK++ K+ E++K K E +E
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650
Query: 312 SSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKK 371
K +++NK K ED+ A + +++ +KK + ++ EE + EE+ KK
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710
Query: 372 KKKKKKK 378
+ ++KKK
Sbjct: 1711 EAEEKKK 1717
Score = 34.7 bits (79), Expect = 0.24
Identities = 39/167 (23%), Positives = 72/167 (43%), Gaps = 11/167 (6%)
Query: 226 MLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLR--RLSGTTKAKAKLEK 283
M KA A + + + ++L + + + + EE ++ L + K K+E+
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ 1637
Query: 284 YHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSL 343
K E+KK E AEE ++ KK + K EDE AA+
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKI--KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Query: 344 SKKKKKKKTQ-------DEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
+ ++ KK + +E++ EE + EE +K K ++ KK+ E++
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742
Score = 34.3 bits (78), Expect = 0.29
Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 5/102 (4%)
Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
K K ++ K E KK E A D + KK ++ K D + KAE E
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAK--KKAEEKKKADEAKKKAE---EAK 1444
Query: 338 AADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
AD + K ++ KK ++ ++ EE + +E KK ++ KK D
Sbjct: 1445 KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD 1486
Score = 34.0 bits (77), Expect = 0.36
Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 6/94 (6%)
Query: 288 RLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKK 347
+ E+ + E AEE + + +KKK ++ K + E+ A + ++
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADE------AKKKAEEAKKADEAKKKAEE 1326
Query: 348 KKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
KKK ++ EE + E +K + + + E
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360
Score = 33.6 bits (76), Expect = 0.48
Identities = 29/117 (24%), Positives = 52/117 (44%), Gaps = 10/117 (8%)
Query: 276 KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAE---- 331
KA+ + K+ E+KK E AEE + + + KK ++ + ++ KAE
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Query: 332 ------PEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
E + K ++ KK ++ + EE + EE KK ++ KKK+ E+
Sbjct: 1589 AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645
Score = 32.4 bits (73), Expect = 1.0
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 1/107 (0%)
Query: 276 KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDE 335
+AK K E+ K E KK A+E + + KK +A+ + E
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454
Query: 336 PAAADVSLSKKKKKKKTQDE-EEPVEEGGEEEEVSKKKKKKKKKDVE 381
A KK ++ K DE ++ EE + +E KK ++ KKK E
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADE 1501
Score = 32.0 bits (72), Expect = 1.5
Identities = 29/142 (20%), Positives = 60/142 (42%), Gaps = 1/142 (0%)
Query: 242 EDSSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAA 301
ED ++ L AK + R+ E L KA+ + +K +++ ++ K +
Sbjct: 1574 EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE-EAKKAEEAKIKAEELKKAEEEK 1632
Query: 302 EETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEE 361
++ + KKK + K + +E A+ K ++ KK +++E+ E
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
Query: 362 GGEEEEVSKKKKKKKKKDVEDE 383
++E KK ++ KK +E
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEE 1714
Score = 31.3 bits (70), Expect = 2.2
Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 12/105 (11%)
Query: 275 TKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPED 334
KA+AK + L+ +FDA E+ A T + K ++ K +T KAE
Sbjct: 1058 GKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETG--KAEEA- 1114
Query: 335 EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
K ++ KK ++ EE + E+ K ++ +K +D
Sbjct: 1115 ---------RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAED 1150
Score = 31.3 bits (70), Expect = 2.8
Identities = 24/106 (22%), Positives = 41/106 (38%)
Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
K K ++ KKK AEE + ++ KK +A+ +E
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAK 1463
Query: 338 AADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
K + KK +E + +E ++ E +KKK + KK E +
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAK 1509
Score = 30.9 bits (69), Expect = 3.0
Identities = 28/123 (22%), Positives = 47/123 (38%), Gaps = 11/123 (8%)
Query: 270 RLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVK 329
R KA+ + K+ E+KK E AEE + ++ KK K
Sbjct: 1267 RRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA-----K 1321
Query: 330 AEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK-----KKKKDVEDET 384
+ E+ AD + KK ++ K E E +E ++K KKK++ + +
Sbjct: 1322 KKAEEAKKKAD-AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380
Query: 385 DPL 387
D
Sbjct: 1381 DAA 1383
Score = 30.9 bits (69), Expect = 3.3
Identities = 26/105 (24%), Positives = 42/105 (40%)
Query: 277 AKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEP 336
AK K ++ K EKKK AEE + ++ KK + +A+ DE
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475
Query: 337 AAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
K + KK +E + + ++ +KKK + KK E
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
Score = 30.5 bits (68), Expect = 3.8
Identities = 27/104 (25%), Positives = 47/104 (45%), Gaps = 4/104 (3%)
Query: 276 KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDE 335
A+ K E K+ E KK + DAA++ + KKK + +K +
Sbjct: 1361 AAEEKAEAAEKKKEEAKK---KADAAKKKAEEKKKADEAKKKAEEDKKKADELK-KAAAA 1416
Query: 336 PAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
AD + K ++KKK + ++ EE + +E KK ++ KK +
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460
Score = 30.5 bits (68), Expect = 3.8
Identities = 29/131 (22%), Positives = 53/131 (40%), Gaps = 6/131 (4%)
Query: 257 EIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKK 316
E K + E+ + K ++ K E KK + AEE + ++
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Query: 317 KKK----NKNLDTSIVKAEP--EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSK 370
KK K + + KA+ + A +KK ++ K DE + EE + +E K
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Query: 371 KKKKKKKKDVE 381
++KKK +++
Sbjct: 1542 AEEKKKADELK 1552
Score = 29.7 bits (66), Expect = 7.5
Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
K ++ K E KK E A E D + ++ KK + + ++
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE--- 1538
Query: 338 AADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
+KK ++KK DE + EE + EE K ++ KK ++
Sbjct: 1539 ------AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 38.5 bits (90), Expect = 0.014
Identities = 27/125 (21%), Positives = 50/125 (40%), Gaps = 5/125 (4%)
Query: 261 RLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFK-TEFDAAEETPATPDTSSSGKKKKK 319
L E S + + +K G+R KF + E+ + + K+KK
Sbjct: 396 ELRRELEGEEESDEEENEEPSKKNVGRR----KFGPENGEKEAESKKLKKENKNEFKEKK 451
Query: 320 NKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
+ + + E A+ L + +K +K ++EEE EE + S K KK+D
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511
Query: 380 VEDET 384
+ ++
Sbjct: 512 SKKKS 516
Score = 33.9 bits (78), Expect = 0.32
Identities = 34/180 (18%), Positives = 65/180 (36%), Gaps = 15/180 (8%)
Query: 222 KMARMLAAKAAL--ATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKA 279
M M A+A + +E + + + G RR G +
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVG--RRKFGPENGEK 432
Query: 280 KLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAA 339
+ E K+ K +FK + ++ EE + + +K + + E E+E
Sbjct: 433 EAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492
Query: 340 D-----------VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLN 388
+ S K+ KKK+ + + + V KKKKKK+K ++ + D ++
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLID 552
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 38.1 bits (89), Expect = 0.018
Identities = 23/186 (12%), Positives = 51/186 (27%), Gaps = 13/186 (6%)
Query: 214 QSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSG 273
++ K K A+ + + E + + + + + +
Sbjct: 15 EAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDI 74
Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
K K + K D + + ++ K N D ++ +
Sbjct: 75 PKKKTKTAAK--AAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132
Query: 334 DEPAAADVSLSKK-----------KKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
D+ D L + D ++ EE E +E+ K ED
Sbjct: 133 DDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDED 192
Query: 383 ETDPLN 388
+++ L
Sbjct: 193 DSEALR 198
Score = 36.5 bits (85), Expect = 0.050
Identities = 22/136 (16%), Positives = 47/136 (34%)
Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
A + K K+L K F EE ++ ++ + S + + +
Sbjct: 8 AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTD 67
Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLA 393
D + K K + ++ +E + SKK +KK D +D+ + + + +
Sbjct: 68 DATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVL 127
Query: 394 HSLSRYKLKFSPDKVD 409
+ D +D
Sbjct: 128 NQADDDDDDDDDDDLD 143
Score = 36.1 bits (84), Expect = 0.068
Identities = 25/145 (17%), Positives = 49/145 (33%), Gaps = 4/145 (2%)
Query: 244 SSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEE 303
+ EL + AK ++K +L + + + K LE + + +
Sbjct: 6 TKAELAAEEEAKKKLK-KLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVK 64
Query: 304 TPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGG 363
S KKK K A+ + D S KK +KK +++
Sbjct: 65 DTDDATESDIPKKKTKTAAKAA---AAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYV 121
Query: 364 EEEEVSKKKKKKKKKDVEDETDPLN 388
++ +V + D +D+ D +
Sbjct: 122 KDIDVLNQADDDDDDDDDDDLDDDD 146
Score = 29.6 bits (67), Expect = 6.5
Identities = 17/137 (12%), Positives = 34/137 (24%), Gaps = 3/137 (2%)
Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
AAEE + K K K I +A + +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60
Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLKFSPDKVD 409
+D ++ E +++ K K + + + L D
Sbjct: 61 GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120
Query: 410 TMIVQAVSLLDDLDKEL 426
+ +L+ D +
Sbjct: 121 VKDID---VLNQADDDD 134
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 36.1 bits (82), Expect = 0.018
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 298 FDAAEETPATPDTSSSGKKK--KKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDE 355
F AA + PA + GK K K K +P+DEP + +++ +++ + E
Sbjct: 24 FAAAADEPADKGITKGGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPE 83
Query: 356 EEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
EE EE EE ++ + + + + E E +P
Sbjct: 84 EEGEEEPEPEETGEEEPEPEPEPEPEPEPEP 114
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 37.1 bits (86), Expect = 0.027
Identities = 31/114 (27%), Positives = 47/114 (41%), Gaps = 18/114 (15%)
Query: 290 EKKKFKTEFDAAEETPATPD----TSSSGKKKKKNKNLDTSIVKAEPEDEPAAA------ 339
EK + KT AA P SS KKK K K ++ ++E
Sbjct: 194 EKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDV 253
Query: 340 --DVSLSKKKKKKKTQDEEEPVEEG------GEEEEVSKKKKKKKKKDVEDETD 385
+ ++ + +DE+EP G E EE K+K+K+ KK +EDE +
Sbjct: 254 ILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDE 307
Score = 30.6 bits (69), Expect = 3.5
Identities = 15/81 (18%), Positives = 21/81 (25%)
Query: 301 AEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVE 360
T T+ K P + K K KKTQD +
Sbjct: 133 PNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETT 192
Query: 361 EGGEEEEVSKKKKKKKKKDVE 381
E + S K K+ +
Sbjct: 193 TEKTEGKTSVKAASLKRNPPK 213
Score = 29.8 bits (67), Expect = 5.4
Identities = 16/85 (18%), Positives = 30/85 (35%), Gaps = 5/85 (5%)
Query: 300 AAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADV-SLSKKKKKKKTQDEEEP 358
+ T DTS +K + + + S KKK K+K + +E
Sbjct: 177 KVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEAS 236
Query: 359 VEEGGEEEEVSKKKKKKKKKDVEDE 383
EE E ++ K+ + ++
Sbjct: 237 ESTVKEESE----EESGKRDVILED 257
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 36.6 bits (85), Expect = 0.037
Identities = 28/115 (24%), Positives = 46/115 (40%), Gaps = 4/115 (3%)
Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
K + + + + S+ +++ K TS K + ED+ + KKKKK
Sbjct: 19 SKLQPISYIYSNVLVLSKEILSTFSEEENKVAT--TSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLKFS 404
KK +++EP EG + KK KK K + P V A + + S
Sbjct: 77 KK--EKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKS 129
Score = 32.4 bits (74), Expect = 0.82
Identities = 15/76 (19%), Positives = 33/76 (43%), Gaps = 8/76 (10%)
Query: 310 TSSSGKKKKK--------NKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEE 361
+ ++ K K + L I+ E+E A S K KK+ K + ++ E+
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72
Query: 362 GGEEEEVSKKKKKKKK 377
++++ K+ K + +
Sbjct: 73 KKKKKKEKKEPKSEGE 88
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.9 bits (86), Expect = 0.049
Identities = 19/146 (13%), Positives = 45/146 (30%), Gaps = 10/146 (6%)
Query: 241 GEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDA 300
+ SS++ + +E + LS K K ++ + D
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294
Query: 301 AEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVE 360
+ P + + K KK+ + ++ K + ++ A + K+
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRL-- 1352
Query: 361 EGGEEEEVSKKKKKKKKKDVEDETDP 386
++ +KKK ++ D
Sbjct: 1353 --------LRRPRKKKSDSSSEDDDD 1370
Score = 31.6 bits (72), Expect = 1.9
Identities = 16/107 (14%), Positives = 33/107 (30%), Gaps = 7/107 (6%)
Query: 269 RRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIV 328
+ + E G + K E + A KKK + K
Sbjct: 1282 QYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAAL----KKKKKSEKKTARKKKS 1337
Query: 329 KAEPED---EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKK 372
K + ++ + +KKK + ++++ E E+E +
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDD 1384
Score = 30.0 bits (68), Expect = 5.2
Identities = 22/170 (12%), Positives = 46/170 (27%), Gaps = 13/170 (7%)
Query: 214 QSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSG 273
++S K + + S + D E + ++ +++RL
Sbjct: 1191 KASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDD-----EEQKTKPKKSSVKRLKS 1245
Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
+K + + + K + S +K D
Sbjct: 1246 KKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYS---PPPPSKRPDGESNGGSKP 1302
Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
P K KK+ + ++ E++ KKK K + K
Sbjct: 1303 SSPTKK-----KVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 36.8 bits (85), Expect = 0.055
Identities = 35/139 (25%), Positives = 51/139 (36%), Gaps = 13/139 (9%)
Query: 268 LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSI 327
L+ L K KL+K K E ++ A E+ A S KK K
Sbjct: 1155 LKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTT--- 1211
Query: 328 VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPL 387
K E E S +T++ E V+ G +KKK K+ E+E + L
Sbjct: 1212 -KKASESETTEETYGSS----AMETENVAEVVKPKGRAG--AKKKAPAAAKEKEEEDEIL 1264
Query: 388 NHVGLAHSLSRYKLKFSPD 406
+ L L+ Y L +P
Sbjct: 1265 D---LKDRLAAYNLDSAPA 1280
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 35.1 bits (81), Expect = 0.066
Identities = 24/99 (24%), Positives = 45/99 (45%), Gaps = 6/99 (6%)
Query: 304 TPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGG 363
TP + KKKKK + VK E E++ KKKK K +D+++
Sbjct: 50 TPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKD------ 103
Query: 364 EEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLK 402
++++ +KK +K+ + + E ++ +LS K +
Sbjct: 104 DKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPR 142
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 35.7 bits (83), Expect = 0.076
Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 3/79 (3%)
Query: 314 GKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKK 373
G KK K + KAE + E KK K ++EEE E+ +EEE ++++
Sbjct: 404 GSKKATKKIKKI-VEKAEKKREEEK--KEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Query: 374 KKKKKDVEDETDPLNHVGL 392
+ +++ E+E L
Sbjct: 461 EAEEEKEEEEEKKKKQATL 479
Score = 35.3 bits (82), Expect = 0.11
Identities = 19/82 (23%), Positives = 40/82 (48%)
Query: 297 EFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEE 356
E E T ++ K KK + + + + E + A +++++++ + +E
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Query: 357 EPVEEGGEEEEVSKKKKKKKKK 378
E EE EE E K+++++KKK
Sbjct: 453 EEKEEEEEEAEEEKEEEEEKKK 474
Score = 34.5 bits (80), Expect = 0.19
Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 3/81 (3%)
Query: 263 LEEGNLRRLSGTTKAKAKLEKYHG---KRLEKKKFKTEFDAAEETPATPDTSSSGKKKKK 319
L E + L+G+ KA K++K K+ E++K + + A + +KK++
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Query: 320 NKNLDTSIVKAEPEDEPAAAD 340
K + + E E+E
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKK 474
Score = 33.7 bits (78), Expect = 0.32
Identities = 19/72 (26%), Positives = 40/72 (55%)
Query: 312 SSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKK 371
S K KK K + K E++ + + KKK+++ ++E+E EE EEEE +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 372 KKKKKKKDVEDE 383
++K+++++ + +
Sbjct: 464 EEKEEEEEKKKK 475
Score = 32.6 bits (75), Expect = 0.76
Identities = 19/70 (27%), Positives = 40/70 (57%)
Query: 310 TSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVS 369
T S KK K ++ + K E E + KKK++++ +++E+ EE EEEE +
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 370 KKKKKKKKKD 379
+++K+++++
Sbjct: 463 EEEKEEEEEK 472
Score = 31.8 bits (73), Expect = 1.4
Identities = 16/44 (36%), Positives = 21/44 (47%)
Query: 343 LSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
KK KK + E E+ EEE+ KKKK K E+E +
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 34.3 bits (79), Expect = 0.11
Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 6/86 (6%)
Query: 294 FKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQ 353
F T + A + P+ + S +K+ ++ E E + + K+KK+KK +
Sbjct: 114 FPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173
Query: 354 DEEEPVEEGGEEEEVSKKKKKKKKKD 379
E+ +G SKKKKKKKKK
Sbjct: 174 KEKMVEPKG------SKKKKKKKKKK 193
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 33.3 bits (77), Expect = 0.20
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 340 DVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
D +KKKKKK + E E EE ++ VS+++K+ +K
Sbjct: 78 DAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 34.3 bits (79), Expect = 0.23
Identities = 19/64 (29%), Positives = 33/64 (51%)
Query: 316 KKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKK 375
KK K ++ +E ++ A + + + +K + + EE EE EE + KKKK+
Sbjct: 373 KKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKR 432
Query: 376 KKKD 379
KKK+
Sbjct: 433 KKKE 436
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 34.1 bits (79), Expect = 0.24
Identities = 21/99 (21%), Positives = 41/99 (41%), Gaps = 17/99 (17%)
Query: 329 KAEPEDEPAAADVSLSKKKK----KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDET 384
E+E ++S +++KK ++K + + E E + + KK K + ET
Sbjct: 394 AEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453
Query: 385 DPLNHVGLAHSLSRYK-------------LKFSPDKVDT 410
++ L L+R + L+ SPD ++T
Sbjct: 454 KKVDPDPLGEKLARTEDPLEEAMKFLKPLLQLSPDNIET 492
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 33.6 bits (77), Expect = 0.25
Identities = 19/63 (30%), Positives = 29/63 (46%), Gaps = 4/63 (6%)
Query: 315 KKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK 374
+K K NK E ++ + D ++ K K K Q E + +E++ KKK K
Sbjct: 19 EKYKANK----DKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74
Query: 375 KKK 377
KKK
Sbjct: 75 KKK 77
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 34.4 bits (79), Expect = 0.25
Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 7/80 (8%)
Query: 302 EETPATPDTSSSGKKKKKNKNLDTSIVKAE--PEDEPAAADVSLSKKKKKKKTQDEEEPV 359
E+ S G KKKK K+ T AE P+DE D KK +KK +D+ V
Sbjct: 407 NESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDE---EDAPKKGKKNQKKGRDKSSKV 463
Query: 360 EEGGEEEEVSKKKKKKKKKD 379
+ + KK+ K ++D
Sbjct: 464 P--SDSKAGGKKESVKSQED 481
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 32.9 bits (75), Expect = 0.30
Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 16/74 (21%)
Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLS----KKKKKKKTQDEEEPVEEGGEEEEVSKKK 372
KKK+K+ +P E +D KK KKKK +D++E +KK
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKE------------RKK 157
Query: 373 KKKKKKDVEDETDP 386
KKK+KK + P
Sbjct: 158 KKKEKKKKKKRHSP 171
Score = 29.8 bits (67), Expect = 3.6
Identities = 14/47 (29%), Positives = 19/47 (40%)
Query: 335 EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
P KK + + EE P + G + K KKKK + D E
Sbjct: 108 PPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKE 154
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 32.8 bits (75), Expect = 0.47
Identities = 13/72 (18%), Positives = 29/72 (40%), Gaps = 5/72 (6%)
Query: 303 ETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKK----TQDEEEP 358
E + D ++ ++ K + + + + ED+ A +++ ++ + DE E
Sbjct: 50 EAKKSDDQETAEIEEVKEEEKEAA-NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108
Query: 359 VEEGGEEEEVSK 370
E E V K
Sbjct: 109 ETEEKTESNVEK 120
Score = 30.9 bits (70), Expect = 1.9
Identities = 15/54 (27%), Positives = 21/54 (38%)
Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPL 387
AA KK + E E V+E +E S+ K+ K + EDE
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93
Score = 28.9 bits (65), Expect = 8.4
Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 15/86 (17%)
Query: 315 KKKKKNKNLDTSI---------------VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPV 359
K++K N+ L+ +I + P D+ AA + K ++ + EE
Sbjct: 8 KRRKINRVLNIAIGIVVVLIIIVAYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKE 67
Query: 360 EEGGEEEEVSKKKKKKKKKDVEDETD 385
EE K+ K +K+ E+ +
Sbjct: 68 EEKEAANSEDKEDKGDAEKEDEESEE 93
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 32.5 bits (74), Expect = 0.59
Identities = 20/98 (20%), Positives = 37/98 (37%), Gaps = 15/98 (15%)
Query: 304 TPATPDTSSSGKKKKKNKNL----------DTSIVKAEPEDEPAAADVS-----LSKKKK 348
P P+ K N + ++ E E + A S K
Sbjct: 184 GPGEPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVK 243
Query: 349 KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
KK+ DEE+ ++G EE K+ +++++ E+E +
Sbjct: 244 NKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEE 281
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 33.0 bits (75), Expect = 0.66
Identities = 19/63 (30%), Positives = 28/63 (44%)
Query: 324 DTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
D+S +PE+ + K + ++ +D EE EE EEE KK KK KK +
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348
Query: 384 TDP 386
Sbjct: 349 NGL 351
Score = 30.7 bits (69), Expect = 3.1
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 4/57 (7%)
Query: 329 KAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETD 385
K PE ++ + + +EE+ EEGG ++ K KK K KK+ D+ D
Sbjct: 303 KLSPEIPAKPEI----EQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 32.7 bits (75), Expect = 0.70
Identities = 19/63 (30%), Positives = 26/63 (41%), Gaps = 3/63 (4%)
Query: 315 KKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK 374
K KK LD+ + + + S +K KKKKK + +E S KKK
Sbjct: 334 KTKKTQTRLDSFFTATKKPIKKSN---SKAKLKKKKKKAGASAVPKSETSQEAKSSGKKK 390
Query: 375 KKK 377
KK
Sbjct: 391 VKK 393
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 32.7 bits (74), Expect = 0.79
Identities = 30/122 (24%), Positives = 52/122 (42%), Gaps = 14/122 (11%)
Query: 264 EEGNLRRLSGTTKAKAKLEKYH-------GKRLEKKKFKTEFDAAEETPATPDTSSSGKK 316
E+G++ + +K K EK K+ E + FK+ A +++PA+ S +
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASA-ASVAEAD 246
Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKK 376
+ N + DEP A+ +KK K K + + + E+EE KKKK
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRK------EKEEKKKKKKHHH 300
Query: 377 KK 378
+
Sbjct: 301 HR 302
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 32.6 bits (74), Expect = 0.92
Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 1/79 (1%)
Query: 92 DTKRAAYPSLGTT-VLLYPYSWSGSSSNAAAKHFHNSFVINIKKPKKSSSSSKHHHHATS 150
+ A+ L +T L+ + G+ S + + + + +K K S H HH+
Sbjct: 215 EKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIG 274
Query: 151 KKNASVTHATPSVETGHTK 169
N HA ++T
Sbjct: 275 INNHHSKHADSKLQTIEVI 293
>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
Length = 233
Score = 31.7 bits (72), Expect = 0.92
Identities = 14/49 (28%), Positives = 20/49 (40%)
Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEE 365
K NK L+ V+ E+ P + SKKK + + E E E
Sbjct: 183 KSYNKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKSL 231
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 32.0 bits (73), Expect = 0.94
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 331 EPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK 374
+PEDE + SL ++ KKK EE ++ E EE+ +KK
Sbjct: 73 KPEDEKELSASSLEAEQAKKK---EEAEAKKKKEMEELKAVQKK 113
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 32.2 bits (73), Expect = 1.1
Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 13/150 (8%)
Query: 231 AALATRVDALGEDSSIEL--GTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKR 288
A +R ++ +L T++ A+L I L L+ + KL++ K
Sbjct: 162 EAAGSREKRKKKERLKKLIEETENLAELIIDLEELK-----------LQELKLKEQAKKA 210
Query: 289 LEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKK 348
LE + K + + EE D +++ + E E + +
Sbjct: 211 LEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270
Query: 349 KKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
K EEE ++ EEE K++++ K
Sbjct: 271 VLKENKEEEKEKKLQEEELKLLAKEEEELK 300
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 32.3 bits (73), Expect = 1.3
Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 1/71 (1%)
Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKK-KTQDEEEPVEEGGEEEEVSKKKKKK 375
K + +LD I+K A D+S +DE+E ++ ++EE +++++
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180
Query: 376 KKKDVEDETDP 386
K D EDE D
Sbjct: 181 KGFDDEDEEDE 191
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 32.0 bits (73), Expect = 1.3
Identities = 29/125 (23%), Positives = 39/125 (31%), Gaps = 11/125 (8%)
Query: 267 NLRRLSGTTKAKAKLEKYHGKRLEKKKFKT-EFDAAEETPATPDTSSSG-----KKKKKN 320
NL L + EK + +K K D D KK KK
Sbjct: 222 NLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKK 281
Query: 321 KNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDV 380
K K EDE L + VEE E S KK+K++++D
Sbjct: 282 KKKKKKRRKDLDEDELEPEAEGLGSSDSGS-----RKDVEEENARLEDSPKKRKEEQEDD 336
Query: 381 EDETD 385
+ D
Sbjct: 337 DFVED 341
Score = 30.1 bits (68), Expect = 4.3
Identities = 25/104 (24%), Positives = 37/104 (35%), Gaps = 13/104 (12%)
Query: 287 KRLEKKKFKTEFDAAEETPATP-------DTSSSGKKKKKNKNLDTSIVK-----AEPED 334
K LE KK K ++D ++ D GKKKK + AE +
Sbjct: 181 KNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKR 240
Query: 335 EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
+ + ++ + E + E V KK KKKKK
Sbjct: 241 QEVKKKLKINNVSLDDDS-TETPASDYYDVSEMVKFKKPKKKKK 283
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 32.0 bits (73), Expect = 1.5
Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 11/80 (13%)
Query: 311 SSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK----KKTQDEEEPVEEGGEEE 366
S+ KK + +N + + E EPA S K+KK ++E+ +EE E
Sbjct: 1511 PSNEKKSIEIENRN------QEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 367 EVSKKKKKKK-KKDVEDETD 385
++ K+K KK+ K + E E D
Sbjct: 1565 DIKKRKNKKQYKSNTEAELD 1584
>gnl|CDD|149180 pfam07960, CBP4, CBP4. The CBP4 in S. cerevisiae is essential for
the expression and activity of ubiquinol-cytochrome c
reductase. This family appears to be fungal specific.
Length = 128
Score = 30.1 bits (68), Expect = 1.7
Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 1/65 (1%)
Query: 315 KKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEE-EEVSKKKK 373
K+ D I K P + P ++ + +K K + ++E +E EE EE + +
Sbjct: 64 TILKETSKSDKPIWKTGPIESPWEKKRNVREFQKTKAEEAQKEELERIREELEEARAQSE 123
Query: 374 KKKKK 378
+ +K+
Sbjct: 124 EMRKE 128
>gnl|CDD|166134 PLN02493, PLN02493, probable peroxisomal (S)-2-hydroxy-acid
oxidase.
Length = 367
Score = 31.2 bits (70), Expect = 1.8
Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 2/75 (2%)
Query: 608 GELVGARLVAQAGSLLNLAKHPASTVQILGAEKALFRSL-LNLAKHPASTVQILG-AEKA 665
GE AR + AG+++ L+ S+V+ + + R L + K+ Q++ AE+A
Sbjct: 88 GEYATARAASAAGTIMTLSSWATSSVEEVASTGPGIRFFQLYVYKNRNVVEQLVRRAERA 147
Query: 666 LFRALKTKRDTPKYG 680
F+A+ DTP+ G
Sbjct: 148 GFKAIALTVDTPRLG 162
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 31.4 bits (71), Expect = 1.8
Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 7/67 (10%)
Query: 113 SGSSSNAAAKHFHNSFVINIKKPKK-----SSSSSKHHHHATSKKNASVTHATPSVETGH 167
SG S ++ K S + K +K SS+S +H + N + + P + G
Sbjct: 836 SGPFSASSTKSSSKS--SSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSFPVPPLSNGS 893
Query: 168 TKPLRAK 174
+KP R K
Sbjct: 894 SKPRRPK 900
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 31.7 bits (71), Expect = 1.9
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 3/66 (4%)
Query: 297 EFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEE 356
E AA+E+PA P + + L S EP D P D K K+K ++D
Sbjct: 750 EPSAAQESPANP--WPRAPPCDEQEPLSVSPYGPEP-DRPPDDDFETRKGLKRKSSEDHA 806
Query: 357 EPVEEG 362
+P+ EG
Sbjct: 807 DPIPEG 812
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 31.6 bits (72), Expect = 1.9
Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 9/166 (5%)
Query: 223 MARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLE---EGNLRRLSGTTKAKA 279
+ + L ++ LGE+ + + +LE ++ LE R L A+
Sbjct: 267 LEEIEQLLEELNKKIKDLGEEEQLRVKEK-IGELEAEIASLERSIAEKERELE---DAEE 322
Query: 280 KLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAA 339
+L K + + E + E + + + + L+ + E D+ A
Sbjct: 323 RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAE 382
Query: 340 --DVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
D ++K +K + E ++ + + ++ ++ D+
Sbjct: 383 TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 29.3 bits (66), Expect = 2.1
Identities = 12/39 (30%), Positives = 17/39 (43%)
Query: 345 KKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
K K+ K ++ + G E K K KK KKD +
Sbjct: 28 KPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.4 bits (71), Expect = 2.3
Identities = 32/117 (27%), Positives = 45/117 (38%), Gaps = 24/117 (20%)
Query: 300 AAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPV 359
AE+ T + KKK E A + L K K +K +
Sbjct: 7 EAEKKILTEEELERKKKK-----------------EEKAKEKELKKLKAAQKEAKAKLQA 49
Query: 360 EEGGEEEEVSKKKKKK-KKKDVEDET-----DPLNHVGLAHSLSRYKLK-FSPDKVD 409
++ + V KK +KK +K+DVEDE DP G LS K +SP V+
Sbjct: 50 QQASDGTNVPKKSEKKSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYSPSAVE 106
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 30.8 bits (69), Expect = 2.3
Identities = 24/117 (20%), Positives = 35/117 (29%), Gaps = 2/117 (1%)
Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
T + + LE+ + K A E + +KK K K E
Sbjct: 115 TAEQRQLLEQMQAD--MRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVE 172
Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHV 390
E A+ V +K+K+K + E E K K K K E
Sbjct: 173 TEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEKAERAAA 229
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 31.2 bits (70), Expect = 2.3
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 10/74 (13%)
Query: 321 KNLDTSIVKAEPEDE---PAAADVSLSKKKKKKKTQDEEEP-------VEEGGEEEEVSK 370
+NL +SI E + E P D+ L ++++ + +DEE V+E EEEE +
Sbjct: 115 ENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
Query: 371 KKKKKKKKDVEDET 384
K K D+E +T
Sbjct: 175 LKTMKDFIDLESQT 188
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 29.6 bits (67), Expect = 2.7
Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 9/58 (15%)
Query: 329 KAEPEDEPAAAD----VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
+ E +++ A D V+++KK ++ KT +E + K+KKKKKKK++++
Sbjct: 37 EKEEKEKEAEPDEDGWVTVTKKGRRPKTARKE-----SVAAKAAEKEKKKKKKKELKN 89
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 30.7 bits (69), Expect = 2.7
Identities = 19/93 (20%), Positives = 39/93 (41%), Gaps = 6/93 (6%)
Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
++ + + + A+ P DTSS + K+ +N I KA+ E + + K
Sbjct: 246 QRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEA------LK 299
Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
K + +E E+ ++ K+ + +K E
Sbjct: 300 AKDHKAFDLKQESKASEKEAEDKELEAQKKREP 332
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 29.8 bits (67), Expect = 2.9
Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 3/60 (5%)
Query: 328 VKAEP-EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
VK E ED+ + + ++ + +DEEE VE+ EEEE ++ KD+E +
Sbjct: 43 VKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEE--DEEDNVDLKDIEKKNIN 100
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 29.9 bits (67), Expect = 3.0
Identities = 14/52 (26%), Positives = 26/52 (50%)
Query: 327 IVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
+VKA+ E + AA +L+K K+ E E G+ EV+ ++ + +
Sbjct: 142 VVKAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGE 193
>gnl|CDD|223928 COG0859, RfaF, ADP-heptose:LPS heptosyltransferase [Cell envelope
biogenesis, outer membrane].
Length = 334
Score = 30.4 bits (69), Expect = 3.6
Identities = 10/59 (16%), Positives = 20/59 (33%), Gaps = 2/59 (3%)
Query: 149 TSKKNASVTHATPSVETGHTKPLRAKRKHQLKLKRFKQTLCRRDF--CSNTQILMKIIL 205
K A + P ++ + K + +TL + + + Q L+K L
Sbjct: 38 VPKGFAPILKLNPEIDKVIIIDKKKKGLGLKERLALLRTLRKERYDAVIDLQGLLKSAL 96
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 29.2 bits (66), Expect = 3.9
Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 13/64 (20%)
Query: 314 GKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKK 373
G KK+ + + V+ E DE D L K+ K K E+ + ++
Sbjct: 95 GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLK-------------REKRRENER 141
Query: 374 KKKK 377
K+K+
Sbjct: 142 KQKE 145
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.0 bits (68), Expect = 4.2
Identities = 22/90 (24%), Positives = 29/90 (32%), Gaps = 20/90 (22%)
Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
K+ E P D KKKK + ED AA + +K
Sbjct: 207 LYKELLKSLLIPEFKPL--DKYLKESKKKKRET---------EEDVEAA-----ESRAEK 250
Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
K+ EE ++ E SK K KK
Sbjct: 251 KRKSKEEIKKKKPKE----SKGVKALKKVV 276
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 28.4 bits (64), Expect = 4.4
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 345 KKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
++KKKKK + E E E E++EVS+++K +K
Sbjct: 39 REKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|166620 PLN02979, PLN02979, glycolate oxidase.
Length = 366
Score = 30.1 bits (67), Expect = 4.4
Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 2/75 (2%)
Query: 608 GELVGARLVAQAGSLLNLAKHPASTVQILGAEKALFRSL-LNLAKHPASTVQILG-AEKA 665
GE AR + AG+++ L+ S+V+ + + R L + K+ Q++ AE+A
Sbjct: 87 GEYATARAASAAGTIMTLSSWATSSVEEVASTGPGIRFFQLYVYKNRNVVEQLVRRAERA 146
Query: 666 LFRALKTKRDTPKYG 680
F+A+ DTP+ G
Sbjct: 147 GFKAIALTVDTPRLG 161
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 30.1 bits (68), Expect = 4.6
Identities = 18/108 (16%), Positives = 27/108 (25%), Gaps = 19/108 (17%)
Query: 298 FDAAEETPATPDTSSSGKKKKKNKNLDTSI-------------VKAEPEDEPAAADVSLS 344
D+ + A G K + D I + E + +
Sbjct: 5 LDSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRN 64
Query: 345 KKKKKKKTQDEEEPVEEGG------EEEEVSKKKKKKKKKDVEDETDP 386
KK K T+ EE E S K K +E +
Sbjct: 65 PKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEE 112
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 29.6 bits (67), Expect = 4.9
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 8/82 (9%)
Query: 278 KAKLEKYHGKRLEKKKFKTE-FDAAEETPATPDTSSSGKKKKKNKNLDTSIVKA------ 330
KA+LE+ K+ +KKK K + A + A SS+ + + K
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRS 232
Query: 331 -EPEDEPAAADVSLSKKKKKKK 351
P++E + SL KK+K
Sbjct: 233 IAPDNEKSEVYKSLFTSHKKEK 254
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 29.9 bits (68), Expect = 5.5
Identities = 25/134 (18%), Positives = 48/134 (35%), Gaps = 9/134 (6%)
Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
+A+LE+ R + K E AA++ A + K KK I D A
Sbjct: 459 QARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSA 518
Query: 338 AADVSLSKKKKKKKTQDEEEPVEEGGEEEEV---------SKKKKKKKKKDVEDETDPLN 388
++K + + Q E++ ++ +KK ++ +E
Sbjct: 519 VIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPK 578
Query: 389 HVGLAHSLSRYKLK 402
+A +++R K K
Sbjct: 579 KAAVAAAIARAKAK 592
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.1 bits (68), Expect = 5.5
Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 11/88 (12%)
Query: 295 KTEFDAAEE-TPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQ 353
KT + A++ KK N D++ + AA + KK KTQ
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQ 795
Query: 354 DEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
+ +K K+ KKK E
Sbjct: 796 ----------KIAAATKAKRAAKKKVAE 813
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.8 bits (67), Expect = 5.7
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 343 LSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
K + K D E E +EE++ +++ +KK+K+ DE
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422
>gnl|CDD|178608 PLN03046, PLN03046, D-glycerate 3-kinase; Provisional.
Length = 460
Score = 29.9 bits (67), Expect = 5.7
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 3/64 (4%)
Query: 107 LYPYSWSGSSSNAAAKHFHNSFVINIKKPKKSSSSSKHHHHATSKKNASVTHATPSVETG 166
YP +S AA S N+K+ S SS K + + ++ + S +T
Sbjct: 26 DYPIP---CASAAAFNFNPISNPFNLKRRPYSPSSHKFNDRVAASCSSYPSSKLSSRKTH 82
Query: 167 HTKP 170
+ P
Sbjct: 83 LSCP 86
>gnl|CDD|178937 PRK00227, glnD, PII uridylyl-transferase; Provisional.
Length = 693
Score = 29.7 bits (67), Expect = 5.7
Identities = 13/31 (41%), Positives = 17/31 (54%)
Query: 609 ELVGARLVAQAGSLLNLAKHPASTVQILGAE 639
E VGA +VA+A + L + VQ L AE
Sbjct: 423 EQVGAEMVARAARRMGLNLRDRAVVQTLVAE 453
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 29.4 bits (66), Expect = 6.1
Identities = 9/63 (14%), Positives = 18/63 (28%), Gaps = 6/63 (9%)
Query: 324 DTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
D ++ P A L +E +P E EE+ + + + +
Sbjct: 30 DFVGIELVPLAVFLLAAKVLEAP------TEEPQPEPEPPEEQPKPPTEPETPPEPTPPK 83
Query: 384 TDP 386
Sbjct: 84 PKE 86
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 29.2 bits (66), Expect = 6.7
Identities = 22/103 (21%), Positives = 40/103 (38%), Gaps = 10/103 (9%)
Query: 282 EKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPED------- 334
EK + K+ E +E A + + +K+K ++ T I+ P D
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILT--PADFAKIQEL 194
Query: 335 -EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKK 376
D +L K K++ E +E + +++ KKKK
Sbjct: 195 RLEKGVDKALGGKLKRRDKDAPERHSDELVDADDIEGPAKKKK 237
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 29.0 bits (65), Expect = 7.0
Identities = 20/93 (21%), Positives = 42/93 (45%)
Query: 291 KKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKK 350
+K F + +E K+++ K K + EP D++ ++K++
Sbjct: 28 QKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEI 87
Query: 351 KTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
++E+E E +++EV K+KK K++ E
Sbjct: 88 AQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLE 120
>gnl|CDD|235205 PRK04028, PRK04028, glutamyl-tRNA(Gln) amidotransferase subunit E;
Validated.
Length = 630
Score = 29.8 bits (68), Expect = 7.0
Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 10/82 (12%)
Query: 532 DKTSASDLSEILPEDVEEKVKEAAEISMGTEISDDDIENILLLCDQVLEISSYRGQLYDY 591
K + ++ + L E+ + +EAAE +S++++E I D+++E + D+
Sbjct: 537 AKEAIEEILKELAENPGKSAEEAAEELGLKGLSEEEVEKI---IDEIVE------ENIDF 587
Query: 592 LKSR-MMAVAPNLTILMGELVG 612
+K + M A P + +M EL G
Sbjct: 588 VKEKGMGAFGPLMGEVMAELRG 609
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 28.8 bits (65), Expect = 7.2
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 4/80 (5%)
Query: 276 KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTS-SSGKKKKKNKNLDTSIV---KAE 331
+ K + K K+ K+ K E E + S KK +N + +IV
Sbjct: 12 RRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEF 71
Query: 332 PEDEPAAADVSLSKKKKKKK 351
+ E A D+ L KKKKKKK
Sbjct: 72 ADGEQAKKDLKLKKKKKKKK 91
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 29.6 bits (66), Expect = 7.2
Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 4/57 (7%)
Query: 327 IVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
+V+ E E D +KK + + + E V+EG E KKKK KK K+V E
Sbjct: 206 MVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEG----KKKKTKKVKEVTKE 258
>gnl|CDD|240059 cd04708, BAH_plantDCM_II, BAH, or Bromo Adjacent Homology domain,
second copy present in DNA
(Cytosine-5)-methyltransferases (DCM) from plants. DNA
methylation, or the covalent addition of a methyl group
to cytosine within the context of the CpG dinucleotide,
has profound effects on the genome. These effects
include transcriptional repression via inhibition of
transcription factor binding, the recruitment of
methyl-binding proteins and their associated chromatin
remodeling factors, X chromosome inactivation,
imprinting, and the suppression of parasitic DNA
sequences. DNA methylation is also essential for proper
embryonic development and is an important player in both
DNA repair and genome stability. BAH domains are found
in a variety of proteins playing roles in
transcriptional silencing and the remodeling of
chromatin. It is assumed that in most or all of these
instances the BAH domain mediates protein-protein
interactions.
Length = 202
Score = 29.0 bits (65), Expect = 7.3
Identities = 9/37 (24%), Positives = 14/37 (37%)
Query: 331 EPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEE 367
+ E A S +K+K K D E E + +
Sbjct: 150 KEEAYSTGASDSALRKRKGKGKGDSESDSEAPVKAPK 186
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 29.2 bits (65), Expect = 7.7
Identities = 25/111 (22%), Positives = 38/111 (34%), Gaps = 8/111 (7%)
Query: 276 KAKAKLE-KYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPED 334
KAKA K L K+K + + EE A ++ K K L + E
Sbjct: 94 KAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEV 153
Query: 335 EPAAADVSLSKKKKKKKTQDE-------EEPVEEGGEEEEVSKKKKKKKKK 378
+ K K K + ++ E GE E +++K K K
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAK 204
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.9 bits (65), Expect = 7.9
Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 9/92 (9%)
Query: 299 DAAEETPATPDTSSSGKKKKKNKNLDTSIV---KAEPEDEPAAADVSLSKKKKKKKTQDE 355
D E P P + K K +++ D + E ++E ++ K K KK
Sbjct: 5 DDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKA--- 61
Query: 356 EEPVEEGGEEEEVSKKKKKKKKKDVEDETDPL 387
++ EE+E +K++K++K +E P
Sbjct: 62 ---LKAKIEEKEKAKREKEEKGLRELEEDTPE 90
>gnl|CDD|214617 smart00323, RasGAP, GTPase-activator protein for Ras-like GTPases.
All alpha-helical domain that accelerates the GTPase
activity of Ras, thereby "switching" it into an "off"
position. Improved domain limits from structure.
Length = 344
Score = 29.2 bits (66), Expect = 7.9
Identities = 12/51 (23%), Positives = 27/51 (52%), Gaps = 7/51 (13%)
Query: 539 LSEILPEDVEEKVKEAAEIS---MGTEISDDDIENILLLCDQVLE--ISSY 584
L +L + VE K ++ E+ + E + ++EN+L +++ + I+S
Sbjct: 119 LKPVLKKIVESK--KSCEVDPAKLEGEDLETNLENLLQYVERLFDAIINSS 167
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 29.3 bits (66), Expect = 7.9
Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 6/71 (8%)
Query: 331 EPEDEPAAADVSLSKKKKKKKTQDEE------EPVEEGGEEEEVSKKKKKKKKKDVEDET 384
+ + KK KK K ++ E + E E E KKKKKKK + +
Sbjct: 10 VKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKK 69
Query: 385 DPLNHVGLAHS 395
+ LA+
Sbjct: 70 NLGEAYDLAYD 80
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 28.9 bits (65), Expect = 8.0
Identities = 29/118 (24%), Positives = 42/118 (35%), Gaps = 8/118 (6%)
Query: 268 LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTS- 326
LR L +K +AK K E KK K D+ E + K++KK + TS
Sbjct: 94 LRELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKP-KEPGLKRKQRKKAQESATSP 152
Query: 327 ---IVKAEPEDEPAAADVSLSK---KKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
P+ + +K ++ KK G E+ SK KK K
Sbjct: 153 ESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGK 210
>gnl|CDD|239124 cd02659, peptidase_C19C, A subfamily of Peptidase C19. Peptidase
C19 contains ubiquitinyl hydrolases. They are
intracellular peptidases that remove ubiquitin molecules
from polyubiquinated peptides by cleavage of isopeptide
bonds. They hydrolyze bonds involving the carboxyl group
of the C-terminal Gly residue of ubiquitin. The purpose
of the de-ubiquitination is thought to be editing of the
ubiquitin conjugates, which could rescue them from
degradation, as well as recycling of the ubiquitin. The
ubiquitin/proteasome system is responsible for most
protein turnover in the mammalian cell, and with over 50
members, family C19 is one of the largest families of
peptidases in the human genome.
Length = 334
Score = 29.1 bits (66), Expect = 8.3
Identities = 12/65 (18%), Positives = 21/65 (32%), Gaps = 13/65 (20%)
Query: 655 STVQIL----GAEKALFRALKTKRDTPKYGLIYHSQLI---------GQSSTKNKGKMGS 701
S +Q L A++ T+ D + Q + +T+ K S
Sbjct: 16 SLLQQLYMTPEFRNAVYSIPPTEDDDDNKSVPLALQRLFLFLQLSESPVKTTELTDKTRS 75
Query: 702 YGWQP 706
+GW
Sbjct: 76 FGWDS 80
>gnl|CDD|227753 COG5466, COG5466, Predicted small metal-binding protein [Function
unknown].
Length = 59
Score = 26.7 bits (59), Expect = 8.4
Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 4/38 (10%)
Query: 534 TSASDLSEILPEDVEEKVKEAAEISMGTEISDDDIENI 571
A +E++ VE + E TEI +D I I
Sbjct: 19 ARADSEAEVMRRIVEHAKEAHGE----TEIREDMINKI 52
>gnl|CDD|187818 cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily
protein Cas7. CRISPR (Clustered Regularly Interspaced
Short Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Cas7 is a RAMP superfamily protein; Subunit of the
Cascade complex; also known as Cst2/DevR family.
Length = 302
Score = 28.9 bits (65), Expect = 9.3
Identities = 11/60 (18%), Positives = 20/60 (33%)
Query: 349 KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLKFSPDKV 408
K EE E K++K+K+ + + ++ +R SP V
Sbjct: 155 GKFEVGIEEKKCNPRGRLEYRLPKEEKEKRRANNLLKAIRNLSGGAKQARRLEDLSPKFV 214
>gnl|CDD|199835 cd10277, PQQ_ADH_I, Ethanol dehydrogenase, a bacterial quinoprotein
(PQQ-dependent type I alcohol dehydrogenase). This
bacterial family of homodimeric ethanol dehydrogenases
utilize pyrroloquinoline quinone (PQQ) as a cofactor. It
represents proteins whose expression may be induced by
ethanol, and which are similar to quinoprotein methanol
dehydrogenases, but have higher specificities for
ethanol and other primary and secondary alcohols.
Dehydrogenases with PQQ cofactors, such as ethanol,
methanol, and membrane-bound glucose dehydrogenases,
form an 8-bladed beta-propeller.
Length = 529
Score = 29.2 bits (66), Expect = 9.4
Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 418 LLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVA 453
LL+D KE N++ R + G + L +I TDNV
Sbjct: 1 LLND-AKETGNWLTYGRGYNGQRYSPLKQINTDNVK 35
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.0 bits (65), Expect = 9.5
Identities = 17/117 (14%), Positives = 38/117 (32%), Gaps = 25/117 (21%)
Query: 287 KRLEKK-KFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSK 345
KR ++ K + + P+ P+ ++ I K + +
Sbjct: 290 KRTTRRVKMRPVRAKPSDEPSLPE-----------SDIHEEIPKLDEKSLSEFLGYMGGI 338
Query: 346 KKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLK 402
+ + DEE E +++ K +K+K +++ R KL+
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPV-------------SNNFRRLKLR 382
>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
This family represents homologues of COX16 which has
been shown to be involved in assembly of cytochrome
oxidase. Protein in this family are typically between
106 and 134 amino acids in length.
Length = 79
Score = 26.7 bits (60), Expect = 9.5
Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 8/43 (18%)
Query: 341 VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
L +++ +K Q EE EE K KK++K D+E+E
Sbjct: 22 TQLRYERRDRKVQQLEE--------EEALKLLKKRRKVDLEEE 56
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 27.0 bits (60), Expect = 9.6
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 341 VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
+ + KKKKKKK +++ + +EEE + K+ EDE +
Sbjct: 17 IDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNE 62
>gnl|CDD|218242 pfam04751, DUF615, Protein of unknown function (DUF615). This
family of bacterial proteins has no known function.
Length = 153
Score = 28.3 bits (64), Expect = 9.8
Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 9/60 (15%)
Query: 614 RLVAQAGSLLN--LAKHPASTVQILGAEKALFRSLLNLAKHPASTVQILGAEKALFRALK 671
RL+A S L L ++P + Q L R L+ A+ + + A + LF+ L+
Sbjct: 100 RLIADGDSALEEFLEEYPDADRQQL-------RQLIRNARKEKAANKPPKAYRELFQYLR 152
>gnl|CDD|204610 pfam11207, DUF2989, Protein of unknown function (DUF2989). Some
members in this bacterial family of proteins are
annotated as lipoproteins however this cannot be
confirmed.
Length = 203
Score = 28.4 bits (64), Expect = 10.0
Identities = 30/99 (30%), Positives = 42/99 (42%), Gaps = 11/99 (11%)
Query: 27 NLYESFQTPQGANKVLKLKHFEKFIDTTEALAATTAAVEGKLCKKLK-KVLKSLVSSDVQ 85
+ YE + P NK L EK+ E LAA +E K+ K +++L+ S
Sbjct: 41 HRYELKKNPTDKNKYKLLIALEKYSKCIE-LAAG---IEHIKLKERKTDRVRALLHSIQN 96
Query: 86 ENLLVADTKRAAYPSLGTTVLLYPYSWSGSSSNAAAKHF 124
L +TK + P LLY Y WS AA + F
Sbjct: 97 LERLQEETKDSNDPY-----LLY-YHWSRFGDQAALRQF 129
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.128 0.357
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,327,010
Number of extensions: 3484050
Number of successful extensions: 6242
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5145
Number of HSP's successfully gapped: 351
Length of query: 716
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 612
Effective length of database: 6,324,786
Effective search space: 3870769032
Effective search space used: 3870769032
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (28.2 bits)