RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7773
(1281 letters)
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 492 bits (1267), Expect = e-154
Identities = 254/596 (42%), Positives = 363/596 (60%), Gaps = 36/596 (6%)
Query: 488 SEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATYYSIAHTYHEKVTEQASIMVNGKLK 547
D+ KK + K TE++E ++ +E + + + I GK++
Sbjct: 114 KGDQSASAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGSGGTRLLVQPSCI--KGKMR 171
Query: 548 EYQIKGLEWMVSLYNNNLNGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTL 607
+YQ+ GL W++ LY N +NGILADEMGLGKT+QTI+LL YL E + GP++V+ P STL
Sbjct: 172 DYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLGYLHEYRGITGPHMVVAPKSTL 231
Query: 608 SNWVLEFEKWAPSANVVAYKGSPLARRALQGR-MKNSKFNVLLTTYEFIIKDKSVLSKIQ 666
NW+ E ++ P V + G+P R + + KF+V +T++E IK+K+ L +
Sbjct: 232 GNWMNEIRRFCPVLRAVKFHGNPEERAHQREELLVAGKFDVCVTSFEMAIKEKTALKRFS 291
Query: 667 WKCMIIDEGHRMKNHHCKLTQTLNTFYNSSHRVLLTGTPLQNKLPELWALLNFLLPSIFK 726
W+ +IIDE HR+KN + L++T+ F ++++R+L+TGTPLQN L ELWALLNFLLP IF
Sbjct: 292 WRYIIIDEAHRIKNENSLLSKTMRLF-STNYRLLITGTPLQNNLHELWALLNFLLPEIFS 350
Query: 727 SCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEY 786
S TF++WF +GE + ++++LHKVLRPFLLRRLK +VE LP K E
Sbjct: 351 SAETFDEWF----QISGE------NDQQEVVQQLHKVLRPFLLRRLKSDVEKGLPPKKET 400
Query: 787 IIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGGARA-LKNTIVQLRKLCNHPFMFQ 845
I+K MS +QK Y+ L + + + + GG R L N +QLRK CNHP++FQ
Sbjct: 401 ILKVGMSQMQKQYYKALLQKDLDVVNA--------GGERKRLLNIAMQLRKCCNHPYLFQ 452
Query: 846 NIEEKYCQHIGQNNTVVSGPDIYRTSGKFELLDRILPKFKETKHRVLMFCQMTQLMTILE 905
E +G + SGK LLD++LPK KE RVL+F QMT+L+ ILE
Sbjct: 453 GAEP--------GPPYTTGEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILE 504
Query: 906 DYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVII 965
DY ++R + Y R+DG T EDR + FN+P S+ F+FLLSTRAGGLG+NL AD VI+
Sbjct: 505 DYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFVFLLSTRAGGLGINLATADIVIL 564
Query: 966 FDSDWNPHQDLQAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAG- 1024
+DSDWNP DLQAQDRAHRIGQK EV+V R T ++EE+++ A KL +D VIQ G
Sbjct: 565 YDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYKKLALDALVIQQGR 624
Query: 1025 MFDQKSTGSERHEFLQTILHQDDEE-EEENEVPDDETVNQLIARTEDELT-LFQKM 1078
+ +QK+ + E LQ + + + ++ DE ++++IA+ E+ L KM
Sbjct: 625 LAEQKTVNKD--ELLQMVRYGAEMVFSSKDSTITDEDIDRIIAKGEEATAELDAKM 678
Score = 315 bits (808), Expect = 3e-90
Identities = 173/382 (45%), Positives = 228/382 (59%), Gaps = 34/382 (8%)
Query: 1 TPLQNKLPELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKV 60
TPLQN L ELWALLNFLLP IF S TF++WF +GE + ++++LHKV
Sbjct: 328 TPLQNNLHELWALLNFLLPEIFSSAETFDEWF----QISGE------NDQQEVVQQLHKV 377
Query: 61 LRPFLLRRLKKEVESQLPEKVEYIIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGG 120
LRPFLLRRLK +VE LP K E I+K MS +QK Y+ L + + + + GG
Sbjct: 378 LRPFLLRRLKSDVEKGLPPKKETILKVGMSQMQKQYYKALLQKDLDVVNA--------GG 429
Query: 121 ARA-LKNTIVQLRKLCNHPFMFQNIEEKYCQHIGQNNTVVSGPDIYRTSGKFELLDRILP 179
R L N +QLRK CNHP++FQ E +G + SGK LLD++LP
Sbjct: 430 ERKRLLNIAMQLRKCCNHPYLFQGAEP--------GPPYTTGEHLVENSGKMVLLDKLLP 481
Query: 180 KFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYF 239
K KE RVL+F QMT+L+ ILEDY ++R + Y R+DG T EDR + FN+P S+ F
Sbjct: 482 KLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKF 541
Query: 240 IFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRAHRIGQKNEVRVLRLITVNSV 299
+FLLSTRAGGLG+NL AD VI++DSDWNP DLQAQDRAHRIGQK EV+V R T ++
Sbjct: 542 VFLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTI 601
Query: 300 EERILAAARYKLNMDEKEYHRNNTARTQKLNK----AIINYHINAEKEQKKEQERIEKER 355
EE+++ A KL +D + A + +NK ++ Y AE + I E
Sbjct: 602 EEKVIERAYKKLALDALVIQQGRLAEQKTVNKDELLQMVRY--GAEMVFSSKDSTITDED 659
Query: 356 MRRLMAEDEEGYRKLIDQKKDK 377
+ R++A+ EE +L D K K
Sbjct: 660 IDRIIAKGEEATAEL-DAKMKK 680
>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain. This domain is
found in proteins involved in a variety of processes
including transcription regulation (e.g., SNF2, STH1,
brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
recombination (e.g. RAD54), and chromatin unwinding
(e.g. ISWI) as well as a variety of other proteins with
little functional information (e.g. lodestar, ETL1).
Length = 301
Score = 377 bits (971), Expect = e-121
Identities = 149/302 (49%), Positives = 200/302 (66%), Gaps = 7/302 (2%)
Query: 549 YQIKGLEWMVSLYNNNLNGILADEMGLGKTVQTIALL-TYLMEKKKNLGPYLVIVPLSTL 607
YQ++G+ W++SL +N L GILADEMGLGKT+QTIALL TYL E K GP LV+ PLSTL
Sbjct: 1 YQLEGVNWLISLESNGLGGILADEMGLGKTLQTIALLATYLKEGKDRRGPTLVVCPLSTL 60
Query: 608 SNWVLEFEKWAPSANVVAYKGSPLARRALQGRM--KNSKFNVLLTTYEFIIKDK---SVL 662
NW+ EFEKWAP+ VV Y G R L+ M + ++V++TTYE + KDK S+L
Sbjct: 61 HNWLNEFEKWAPALRVVVYHGDGRERSKLRQSMAKRLDTYDVVITTYEVLRKDKKLLSLL 120
Query: 663 SKIQWKCMIIDEGHRMKNHHCKLTQTLNTFYNSSHRVLLTGTPLQNKLPELWALLNFLLP 722
+K++W +++DE HR+KN KL + L + +R+LLTGTP+QN L ELWALLNFL P
Sbjct: 121 NKVEWDRVVLDEAHRLKNSKSKLYKALKKL-KTRNRLLLTGTPIQNNLEELWALLNFLRP 179
Query: 723 SIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPE 782
F S FE+WFN P A T + N E+ I RLHK+L+PFLLRR K +VE LP
Sbjct: 180 GPFGSFKVFEEWFNIPIANTADNKNKNLEKGKEGINRLHKLLKPFLLRRTKDDVEKSLPP 239
Query: 783 KVEYIIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGGARALKNTIVQLRKLCNHPF 842
K E+++ C++S Q+ LY+ L + L + +G G +L N I+QLRK+CNHP+
Sbjct: 240 KTEHVLYCNLSDEQRKLYKKLLTKRRLALSFAVEGGEKNVGIASLLNLIMQLRKICNHPY 299
Query: 843 MF 844
+F
Sbjct: 300 LF 301
Score = 178 bits (454), Expect = 7e-50
Identities = 67/141 (47%), Positives = 89/141 (63%)
Query: 1 TPLQNKLPELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKV 60
TP+QN L ELWALLNFL P F S FE+WFN P A T + N E+ I RLHK+
Sbjct: 161 TPIQNNLEELWALLNFLRPGPFGSFKVFEEWFNIPIANTADNKNKNLEKGKEGINRLHKL 220
Query: 61 LRPFLLRRLKKEVESQLPEKVEYIIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGG 120
L+PFLLRR K +VE LP K E+++ C++S Q+ LY+ L + L + +G G
Sbjct: 221 LKPFLLRRTKDDVEKSLPPKTEHVLYCNLSDEQRKLYKKLLTKRRLALSFAVEGGEKNVG 280
Query: 121 ARALKNTIVQLRKLCNHPFMF 141
+L N I+QLRK+CNHP++F
Sbjct: 281 IASLLNLIMQLRKICNHPYLF 301
>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
[Transcription / DNA replication, recombination, and
repair].
Length = 866
Score = 326 bits (836), Expect = 2e-95
Identities = 244/776 (31%), Positives = 370/776 (47%), Gaps = 61/776 (7%)
Query: 296 VNSVEERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKER 355
+N + R+L A L D K + L I I K + Q +E
Sbjct: 97 LNELALRVLIADEVGL-GDLKTIEAGAILKELLLRGEIKRVLILVPKTLR-AQWVVELLE 154
Query: 356 MR--RLMAEDEEGYRKLIDQ----KKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMR 409
RL D+EG R L+ Q +L D + ++ + E + +
Sbjct: 155 KFNIRLAVLDKEGLRYLLKQYDAYNPFSTEDLVLISLDLAKRSDSKRREALLEAEWGERD 214
Query: 410 EEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEW 469
+ ++ + L+ E L+ L + D K+ A P
Sbjct: 215 LLVIDEAHNLGSSEGTRKLAPLETLEYELLKQLAEKIPSKLLDLKVLLLSATPE------ 268
Query: 470 LASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATYYSIAHT 529
++ ED + ++ ++T +D +E
Sbjct: 269 -------QLKEEDLFARLRLLDPLRLADLSQILEKFVRETLKLSARDLKDELKELLAELR 321
Query: 530 YHEKVTEQASIMVNGK-LKEYQIKGLEWMVS-LYNNNLNGILADEMGLGKTVQTIALLTY 587
E + + L+ YQ++G+ W+ L +N L GILAD+MGLGKTVQTIALL
Sbjct: 322 LSEDLLNAPEPVDLSAELRPYQLEGVNWLSELLRSNLLGGILADDMGLGKTVQTIALLLS 381
Query: 588 LMEKKK-NLGPYLVIVPLSTLSNWVLEFEKWAPSAN-VVAYKGSP----LARRALQGRMK 641
L+E K LGP L++VP S LSNW EFEK+AP V+ Y G R AL+ +K
Sbjct: 382 LLESIKVYLGPALIVVPASLLSNWKREFEKFAPDLRLVLVYHGEKSELDKKREALRDLLK 441
Query: 642 NSK---FNVLLTTYEFIIK---DKSVLSKIQWKCMIIDEGHRMKNHHCKLTQTLNTFYNS 695
F+V++TTYE + + D L KI+W +++DE HR+KN + L +
Sbjct: 442 LHLVIIFDVVITTYELLRRFLVDHGGLKKIEWDRVVLDEAHRIKNDQSSEGKALQFL-KA 500
Query: 696 SHRVLLTGTPLQNKLPELWALLN-FLLPSIFKSC-STFEQWFNAPFATTGEKVELNEEET 753
+R+ LTGTPL+N+L ELW+LL FL P + + + F + F P E+ E
Sbjct: 501 LNRLDLTGTPLENRLGELWSLLQEFLNPGLLGTSFAIFTRLFEKPIQA--EEDIGPLEAR 558
Query: 754 ILIIRRLHKVLRPFLLRRLKKEVE--SQLPEKVEYIIKCDMSGLQKLLYQHLHNRGI--- 808
L I L K+L PF+LRR K++VE +LP K+E +++C++S Q+ LY+ L
Sbjct: 559 ELGIELLRKLLSPFILRRTKEDVEVLKELPPKIEKVLECELSEEQRELYEALLEGAEKNQ 618
Query: 809 -----LLTDGSEKGKTGEGGARALKNTIVQLRKLCNHPFMFQN-IEEKYCQHI-----GQ 857
L S++ + G+ L + +LR++CNHP + +E + + + +
Sbjct: 619 QLLEDLEKADSDENRIGDSELNIL-ALLTRLRQICNHPALVDEGLEATFDRIVLLLREDK 677
Query: 858 NNTVVSGPDIYRTSGKFELLDRIL-PKFKETKH--RVLMFCQMTQLMTILEDYFIWRDFS 914
+ + P I + GK + LD +L K E H +VL+F Q T ++ +LEDY
Sbjct: 678 DFDYLKKPLIQLSKGKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIK 737
Query: 915 YLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQ 974
Y+RLDG+T ++ R EL+ +FN E + +FLLS +AGGLGLNL ADTVI+FD WNP
Sbjct: 738 YVRLDGSTPAKRRQELIDRFNADE-EEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAV 796
Query: 975 DLQAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKS 1030
+LQA DRAHRIGQK V+V RLIT ++EE+IL K + + +I A + S
Sbjct: 797 ELQAIDRAHRIGQKRPVKVYRLITRGTIEEKILELQEKKQELLDSLIDAEGEKELS 852
Score = 199 bits (508), Expect = 1e-52
Identities = 128/338 (37%), Positives = 190/338 (56%), Gaps = 25/338 (7%)
Query: 1 TPLQNKLPELWALLN-FLLPSIFKSC-STFEQWFNAPFATTGEKVELNEEETILIIRRLH 58
TPL+N+L ELW+LL FL P + + + F + F P E+ E L I L
Sbjct: 509 TPLENRLGELWSLLQEFLNPGLLGTSFAIFTRLFEKPIQA--EEDIGPLEARELGIELLR 566
Query: 59 KVLRPFLLRRLKKEVE--SQLPEKVEYIIKCDMSGLQKLLYQHLHNRGI--------LLT 108
K+L PF+LRR K++VE +LP K+E +++C++S Q+ LY+ L L
Sbjct: 567 KLLSPFILRRTKEDVEVLKELPPKIEKVLECELSEEQRELYEALLEGAEKNQQLLEDLEK 626
Query: 109 DGSEKGKTGEGGARALKNTIVQLRKLCNHPFMFQN-IEEKYCQHI-----GQNNTVVSGP 162
S++ + G+ L + +LR++CNHP + +E + + + ++ + P
Sbjct: 627 ADSDENRIGDSELNIL-ALLTRLRQICNHPALVDEGLEATFDRIVLLLREDKDFDYLKKP 685
Query: 163 DIYRTSGKFELLDRIL-PKFKETKH--RVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTT 219
I + GK + LD +L K E H +VL+F Q T ++ +LEDY Y+RLDG+T
Sbjct: 686 LIQLSKGKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGST 745
Query: 220 KSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRA 279
++ R EL+ +FN E + +FLLS +AGGLGLNL ADTVI+FD WNP +LQA DRA
Sbjct: 746 PAKRRQELIDRFNADE-EEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAVELQAIDRA 804
Query: 280 HRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKE 317
HRIGQK V+V RLIT ++EE+IL K + +
Sbjct: 805 HRIGQKRPVKVYRLITRGTIEEKILELQEKKQELLDSL 842
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 117 bits (294), Expect = 1e-29
Identities = 50/199 (25%), Positives = 83/199 (41%), Gaps = 14/199 (7%)
Query: 545 KLKEYQIKGLEWMVSLYNNNLNGILADEMGLGKT-VQTIALLTYLMEKKKNLGPYLVIVP 603
L+ YQ + +E + + + ILA G GKT + L L K G LV+VP
Sbjct: 8 PLRPYQKEAIEAL---LSGLRDVILAAPTGSGKTLAALLPALEALKRGKG--GRVLVLVP 62
Query: 604 LSTL-SNWVLEFEKWAPSANV-VAYKGSPLARRALQGRMKNSKFNVLLTTYEFIIKD--K 659
L W E +K PS + V ++R ++++ K ++L+TT ++
Sbjct: 63 TRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILVTTPGRLLDLLEN 122
Query: 660 SVLSKIQWKCMIIDEGHRMKN--HHCKLTQTLNTFYNSSHRVLLTGTPLQNKLPELWALL 717
LS +I+DE HR+ + +L + L + +LL+ TP + L L
Sbjct: 123 DKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLPKNVQLLLLSATPPEEIENLLELFL 182
Query: 718 N--FLLPSIFKSCSTFEQW 734
N + F EQ+
Sbjct: 183 NDPVFIDVGFTPLEPIEQF 201
>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
with DEXDc-, DEAD-, and DEAH-box proteins, yeast
initiation factor 4A, Ski2p, and Hepatitis C virus NS3
helicases; this domain is found in a wide variety of
helicases and helicase related proteins; may not be an
autonomously folding unit, but an integral part of the
helicase; 4 helicase superfamilies at present according
to the organization of their signature motifs; all
helicases share the ability to unwind nucleic acid
duplexes with a distinct directional polarity; they
utilize the free energy from nucleoside triphosphate
hydrolysis to fuel their translocation along DNA,
unwinding the duplex in the process.
Length = 131
Score = 112 bits (282), Expect = 6e-29
Identities = 36/128 (28%), Positives = 57/128 (44%), Gaps = 3/128 (2%)
Query: 164 IYRTSGKFELLDRILPKFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSED 223
+ K E L +L + + +VL+FC +++ L + L G E+
Sbjct: 7 LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQEE 66
Query: 224 RGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRAHRIG 283
R E+L+ F E E + L++T G++L VI +D W+P LQ RA R G
Sbjct: 67 REEVLKDFREGE---IVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAG 123
Query: 284 QKNEVRVL 291
QK +L
Sbjct: 124 QKGTAILL 131
Score = 112 bits (282), Expect = 6e-29
Identities = 36/128 (28%), Positives = 57/128 (44%), Gaps = 3/128 (2%)
Query: 867 IYRTSGKFELLDRILPKFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSED 926
+ K E L +L + + +VL+FC +++ L + L G E+
Sbjct: 7 LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQEE 66
Query: 927 RGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRAHRIG 986
R E+L+ F E E + L++T G++L VI +D W+P LQ RA R G
Sbjct: 67 REEVLKDFREGE---IVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAG 123
Query: 987 QKNEVRVL 994
QK +L
Sbjct: 124 QKGTAILL 131
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 104 bits (262), Expect = 4e-26
Identities = 36/145 (24%), Positives = 65/145 (44%), Gaps = 7/145 (4%)
Query: 566 NGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSNWVLEFEKWAPSANV-V 624
+ +LA G GKT+ + + L++ K G LV+ P L+N V E K + V
Sbjct: 2 DVLLAAPTGSGKTLAALLPILELLDSLKG-GQVLVLAPTRELANQVAERLKELFGEGIKV 60
Query: 625 AYKGSPLARRALQGRMKNSKFNVLLTTYEFIIKD--KSVLSKIQWKCMIIDEGHRMKNHH 682
Y + + Q ++ + K ++++ T ++ + + LS + +I+DE HR+ N
Sbjct: 61 GYLIGGTSIKQ-QEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAHRLLNQG 119
Query: 683 CKLTQ--TLNTFYNSSHRVLLTGTP 705
L L +LL+ TP
Sbjct: 120 FGLLGLKILLKLPKDRQVLLLSATP 144
>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain. The
Prosite family is restricted to DEAD/H helicases,
whereas this domain family is found in a wide variety of
helicases and helicase related proteins. It may be that
this is not an autonomously folding unit, but an
integral part of the helicase.
Length = 78
Score = 96.8 bits (242), Expect = 4e-24
Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 3/81 (3%)
Query: 203 DYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVII 262
RL G E+R E+L F +SK L++T G G++L + VI
Sbjct: 1 KLLRKPGIKVARLHGGLSQEEREEILEDFRNGKSK---VLVATDVAGRGIDLPDVNLVIN 57
Query: 263 FDSDWNPHQDLQAQDRAHRIG 283
+D WNP +Q RA R G
Sbjct: 58 YDLPWNPASYIQRIGRAGRAG 78
Score = 96.8 bits (242), Expect = 4e-24
Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 3/81 (3%)
Query: 906 DYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVII 965
RL G E+R E+L F +SK L++T G G++L + VI
Sbjct: 1 KLLRKPGIKVARLHGGLSQEEREEILEDFRNGKSK---VLVATDVAGRGIDLPDVNLVIN 57
Query: 966 FDSDWNPHQDLQAQDRAHRIG 986
+D WNP +Q RA R G
Sbjct: 58 YDLPWNPASYIQRIGRAGRAG 78
>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain.
Length = 82
Score = 95.7 bits (239), Expect = 1e-23
Identities = 29/84 (34%), Positives = 39/84 (46%), Gaps = 3/84 (3%)
Query: 200 ILEDYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADT 259
L + RL G E+R E+L KFN + K L++T GL+L D
Sbjct: 2 ELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIK---VLVATDVAERGLDLPGVDL 58
Query: 260 VIIFDSDWNPHQDLQAQDRAHRIG 283
VII+D W+P +Q RA R G
Sbjct: 59 VIIYDLPWSPASYIQRIGRAGRAG 82
Score = 95.7 bits (239), Expect = 1e-23
Identities = 29/84 (34%), Positives = 39/84 (46%), Gaps = 3/84 (3%)
Query: 903 ILEDYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADT 962
L + RL G E+R E+L KFN + K L++T GL+L D
Sbjct: 2 ELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIK---VLVATDVAERGLDLPGVDL 58
Query: 963 VIIFDSDWNPHQDLQAQDRAHRIG 986
VII+D W+P +Q RA R G
Sbjct: 59 VIIYDLPWSPASYIQRIGRAGRAG 82
>gnl|CDD|214727 smart00573, HSA, domain in helicases and associated with SANT
domains.
Length = 73
Score = 53.9 bits (130), Expect = 4e-09
Identities = 21/48 (43%), Positives = 34/48 (70%)
Query: 314 DEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMA 361
D KE H+ A +K+ KA+++YH N EKE+++ +E+ EK R+R+L A
Sbjct: 26 DFKEEHKWKIAAAKKMAKAVMDYHQNKEKEEERREEKNEKRRLRKLAA 73
>gnl|CDD|203672 pfam07533, BRK, BRK domain. The function of this domain is
unknown. It is often found associated with helicases and
transcription factors.
Length = 45
Score = 48.4 bits (116), Expect = 2e-07
Identities = 15/43 (34%), Positives = 24/43 (55%)
Query: 437 SQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVV 479
S + R+ V + +L G+DAP K ++ WL +PG+EV
Sbjct: 1 SLDGEERVPVINRKTGKRLTGDDAPKLKDLERWLQENPGYEVD 43
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 54.7 bits (131), Expect = 2e-07
Identities = 44/136 (32%), Positives = 55/136 (40%), Gaps = 16/136 (11%)
Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAP-----------EDEEEEKP 1197
K IE A D E +E K S K+ K V E + ED +K
Sbjct: 50 KKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKE 109
Query: 1198 VKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEE 1257
KK RR + + + D E+E +KK RK K D+ ED + VS VEE
Sbjct: 110 SKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD-----VEE 164
Query: 1258 EEEVEEAEGEEEEEDD 1273
E V E E EEE D
Sbjct: 165 SEFVTSLENESEEELD 180
Score = 35.0 bits (80), Expect = 0.30
Identities = 20/78 (25%), Positives = 32/78 (41%), Gaps = 11/78 (14%)
Query: 1203 RRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE 1262
RKK++ + ++E DE+ K+K T++ K K ++ E EE
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSV-----------KRATKKTVVEISEPLEEGS 96
Query: 1263 EAEGEEEEEDDKASKPRK 1280
E E+ DK SK
Sbjct: 97 ELVVNEDAALDKESKKTP 114
>gnl|CDD|219455 pfam07529, HSA, HSA. This domain is predicted to bind DNA and is
often found associated with helicases.
Length = 73
Score = 48.1 bits (115), Expect = 5e-07
Identities = 17/48 (35%), Positives = 30/48 (62%)
Query: 314 DEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMA 361
D +E + A+ +KL +A+ YH EKE+++ +ER K+R++ L A
Sbjct: 26 DFREERKWKIAKAKKLARAVAQYHKYIEKEEQRRKEREAKQRLKALKA 73
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 54.0 bits (129), Expect = 5e-07
Identities = 51/205 (24%), Positives = 97/205 (47%), Gaps = 11/205 (5%)
Query: 1081 ERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREK----EEALMGRGSRQRKQV 1136
E RK E + +K+ + K D E E EEER E+ EEA M +R++ +
Sbjct: 1216 EARKAEDAKKAEAVKKAEE--AKKDAE-EAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
++ E KA E AD+ ++ EE+ K K+ + + K ++ A E +++
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKAD 1332
Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
KK+ K + E E E+ + + + + K+++ +K + +KK E
Sbjct: 1333 AAKKKAEEAKK--AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK--AE 1388
Query: 1257 EEEEVEEAEGEEEEEDDKASKPRKS 1281
E+++ +EA+ + EE+ KA + +K+
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKA 1413
Score = 52.8 bits (126), Expect = 1e-06
Identities = 54/230 (23%), Positives = 105/230 (45%), Gaps = 17/230 (7%)
Query: 1066 ARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEV----ENSMYEEERE--- 1118
A+ DE +K + ++ E K + E K+ D K + E EE ++
Sbjct: 1469 AKKADEAK--KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Query: 1119 -REKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGS-G 1176
++ EEA +++ ++ D L E LK E+ + ++ EE K + ++
Sbjct: 1527 AKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEA 1586
Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRR------KKVQISDEDEDEEDESPKKKKRKSTK 1230
KK +++ +EE EEE+K + ++ K ++ +E+++ KKK K
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEK 1646
Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
A++ K ++E K ++ KK E++++ EEA+ EE+E A +K
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Score = 52.5 bits (125), Expect = 2e-06
Identities = 51/216 (23%), Positives = 99/216 (45%), Gaps = 18/216 (8%)
Query: 1081 ERRKGETKPRLVEIKELPDWMVKDDDEV----ENSMYEEEREREKEEALMGRGSRQRKQV 1136
E +K E K + E+K+ + +K +E E EE++ +A + + + +
Sbjct: 1538 EAKKAEEKKKADELKKAEE--LKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE---- 1192
+ +++ +KA E A++ + + EE+K +++ ++ KK EE +E
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655
Query: 1193 EEEKPVKKRR--------RRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
EEE +K ++K + +EDE+ + KK A++ K++ EEK
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715
Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
+ KK E + + EEA+ E EE+ KA + +K
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Score = 49.8 bits (118), Expect = 1e-05
Identities = 49/207 (23%), Positives = 91/207 (43%), Gaps = 18/207 (8%)
Query: 1066 ARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEAL 1125
A+ +E AE K + R+ E+ +L + K E + E + K E L
Sbjct: 1569 AKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE---EAKKAEEAKIKAEEL 1625
Query: 1126 MGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVE 1185
+ ++K+V+ +E KA E EEE ++K + + + + KK+ E
Sbjct: 1626 K-KAEEEKKKVEQLKKKEAEEKKKAEELKK----AEEENKIKAAEEAKKAEEDKKKA--E 1678
Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
E +E+E ++ + E E+ + KKK K A++ K ++E K
Sbjct: 1679 EAKKAEEDE--------KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730
Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEED 1272
+ KK+ E++++ EEA+ +EEE+
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Score = 47.4 bits (112), Expect = 5e-05
Identities = 46/208 (22%), Positives = 107/208 (51%), Gaps = 4/208 (1%)
Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEERERE-KEEALMGRGSRQRK 1134
+K DA ++K E K + E K+ + K DE++ + +++ E K++A + + + K
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437
Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
+ D+ KA E A++ +++ EE K + + ++ KK+ +E + EE
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA--DEAKKKAEEA 1495
Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKK-RKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
+K + ++ + + ++ + +E+ K + +K+ + D+ K+ +E+K +K +
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555
Query: 1254 VVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+++ EE ++AE ++ E+DK RK+
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKA 1583
Score = 47.4 bits (112), Expect = 6e-05
Identities = 50/218 (22%), Positives = 94/218 (43%), Gaps = 26/218 (11%)
Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
+K D ++K E + E K+ + K D + + E ++ EA +
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK---KAEEAKKAAEAAKAEAEAAADE 1358
Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
+ A E A + ++EE + K K+ + +K K ++ A ED+++
Sbjct: 1359 AE-----------AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
Query: 1196 KPVKKRRRRKK----VQISDEDEDEEDESPKK----KKRKSTKTPAKDKHKEDDEEKHVV 1247
+KK KK + E++ + DE+ KK KK K A++ K ++ +K
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Query: 1248 SKRK----KKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+K KK EE ++ +EA+ + EE KA + +K+
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Score = 46.3 bits (109), Expect = 1e-04
Identities = 51/241 (21%), Positives = 99/241 (41%), Gaps = 14/241 (5%)
Query: 1048 EEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDE 1107
EE++N + E + + ++ K E + E K + + K ++E
Sbjct: 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEE 1631
Query: 1108 ---VENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
VE +E E++K E L +K + +E KA ED + ++ E
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEEL-------KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684
Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK 1224
E + + + + EE ++ EE+K ++ ++ ++ +E +++ KK
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744
Query: 1225 KRKSTKTPAKDK----HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K + K ++K H + +EEK RK+K EEE++E + + E DK K
Sbjct: 1745 KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804
Query: 1281 S 1281
Sbjct: 1805 D 1805
Score = 45.1 bits (106), Expect = 3e-04
Identities = 52/239 (21%), Positives = 98/239 (41%), Gaps = 14/239 (5%)
Query: 1048 EEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDE 1107
++ EE + ++E N+ I + E+ K E + E+K+ + K DE
Sbjct: 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE--KKKADE 1294
Query: 1108 VENSMYEEEREREKEEALMGRGSRQ-RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEV 1166
+ + +++ + K++A + + + +K+ + D KA E A + + E E
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA 1354
Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
+ K E A + +EE K ++K + DE ++ KKK
Sbjct: 1355 AADEAEAAEEKA-------EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRK----KKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
K A K K D+ +K K+K KK EE ++ +EA+ + EE +K+
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466
Score = 42.1 bits (98), Expect = 0.002
Identities = 50/213 (23%), Positives = 95/213 (44%), Gaps = 23/213 (10%)
Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
+K DA ++K E + E + E E + E E EK EA + +K+
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAK---------AEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGK--KSYVEETAPEDEE 1193
D ++ K D+ EE++++ + K + KK K EE DE
Sbjct: 1380 ADAAKKKAEE---KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436
Query: 1194 EEKPVKKRRR---RKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
++K + ++ +KK + + + E+ + ++ + KK K A++ K D+ +K +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496
Query: 1251 KK-----KVVEEEEEVEEA-EGEEEEEDDKASK 1277
KK K E +++ +EA + EE ++ D+A K
Sbjct: 1497 KKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529
Score = 41.3 bits (96), Expect = 0.004
Identities = 44/213 (20%), Positives = 88/213 (41%), Gaps = 12/213 (5%)
Query: 1081 ERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTD 1140
E RK E + E ++ D + + E R+ E+A +R+ ++V +
Sbjct: 1132 EARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAE 1191
Query: 1141 SLTDQEWLKAIED----------DGADDYEEEE--EEVKPVTKKRGSGKKGKKSYVEETA 1188
L E + E + A E+ + E VK + + ++ KK+ E
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251
Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS 1248
E + E+ R++ I E+ + DE K +++K K + K+ +E +
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKA 1311
Query: 1249 KRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+ KK E +++ EEA+ + + KA + +K+
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 40.1 bits (93), Expect = 0.010
Identities = 58/256 (22%), Positives = 102/256 (39%), Gaps = 42/256 (16%)
Query: 1049 EEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEV 1108
EE + + + V QL + +E K AE K + ++ E +D +
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEE-----KKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677
Query: 1109 ENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKP 1168
E + EE E++ EAL + + + K+ + +E KA E EEE ++K
Sbjct: 1678 EEAKKAEEDEKKAAEAL-KKEAEEAKKAEELKKKEAEEKKKAEELKK----AEEENKIKA 1732
Query: 1169 VTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRR----------RRKKVQISDEDEDEED 1218
K+ + + KK+ E A +DEEE+K + + R++K + +E+ DEED
Sbjct: 1733 EEAKKEAEEDKKKA---EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEED 1789
Query: 1219 ESPKKKKRKSTK----------------TPAKDKHKE--DDEEKHVVSKRKKKVVEEEEE 1260
E + + K K + KE D K V K +EE +
Sbjct: 1790 EKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADS-KNMQLEEADA 1848
Query: 1261 VEEAEGEEEEEDDKAS 1276
E+ + + E+ +
Sbjct: 1849 FEKHKFNKNNENGEDG 1864
Score = 39.4 bits (91), Expect = 0.015
Identities = 51/224 (22%), Positives = 97/224 (43%), Gaps = 19/224 (8%)
Query: 1066 ARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEAL 1125
A+ DE +K E +K E + E + D K +E + + +E +++ EEA
Sbjct: 1443 AKKADEA---KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA---DEAKKKAEEAK 1496
Query: 1126 MGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVE 1185
++ + E KA E AD+ ++ EE K K+ KK +
Sbjct: 1497 KKADEAKKAAEAKKKA---DEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Query: 1186 ETAPEDEEEEKPVKKRRR--------RKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
+ EE+K ++ ++ +K + + + E+ E K + K A++
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613
Query: 1238 KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
K EE + ++ KK EE+++VE+ + +E EE KA + +K+
Sbjct: 1614 KA--EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655
Score = 38.6 bits (89), Expect = 0.029
Identities = 40/169 (23%), Positives = 75/169 (44%), Gaps = 20/169 (11%)
Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
AE+ +K E+ +I+ E +++ E ++ + + ++K+ A L + +E +
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668
Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGED 459
EE +KK E KK ++ K K + +E+ + E+L+ K E+
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAA----EALKKEAEEAKKAEELK----------KKEAEE 1714
Query: 460 APPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKK 508
A+++K+ E +E+ E EE K+K EE K EKK
Sbjct: 1715 KKKAEELKK------AEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Score = 38.6 bits (89), Expect = 0.031
Identities = 41/177 (23%), Positives = 75/177 (42%), Gaps = 6/177 (3%)
Query: 343 EQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKE 402
E KK +E + E ++ + K + +K + E + L + K+ K
Sbjct: 1550 ELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
Query: 403 EQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPP 462
E+ KK E K K ++ K + +++L E+ + + +K+ +NK++ +
Sbjct: 1610 EEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE-ELKKAEEENKIKAAEEAK 1668
Query: 463 AKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGE 519
+ + A E + EDE E +K++ EE KA KK E +E K A E
Sbjct: 1669 KAEEDKKKA-----EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720
Score = 38.2 bits (88), Expect = 0.042
Identities = 42/206 (20%), Positives = 92/206 (44%), Gaps = 8/206 (3%)
Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
+K D ++K E + E K+ ++ + E + + E ++ +EA + + + K+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKK----KAEEAKKAEEAKKKAEEAKKADEA--KKKAEEAKK 1484
Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
D ++ KA E A + +++ +E K + + + KK+ + A E ++ E
Sbjct: 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK-KADEAKKAEEAKKADEAKKAE 1543
Query: 1196 KPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV 1255
+ K +K ++ +E ++ E KK + K + + EE + K
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603
Query: 1256 EEEEEVEEAEGEEEEEDDKASKPRKS 1281
E++ + EEA+ + EE KA + +K+
Sbjct: 1604 EKKMKAEEAK-KAEEAKIKAEELKKA 1628
Score = 36.3 bits (83), Expect = 0.13
Identities = 38/221 (17%), Positives = 90/221 (40%), Gaps = 8/221 (3%)
Query: 1069 EDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGR 1128
+ + + A+ E + E K+ ++ + E + + E R+ EEA
Sbjct: 1078 DFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAE 1137
Query: 1129 GSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG--------KKGK 1180
+R+ ++ + E + ED + + E+ K R + +K +
Sbjct: 1138 DARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAE 1197
Query: 1181 KSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKED 1240
+ E A + EEE K + R+ + + EE + ++ +K+ + ++ ++
Sbjct: 1198 DARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKF 1257
Query: 1241 DEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+E + R++ ++ EE + E ++ EE KA + +K+
Sbjct: 1258 EEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA 1298
Score = 35.5 bits (81), Expect = 0.27
Identities = 44/230 (19%), Positives = 96/230 (41%), Gaps = 9/230 (3%)
Query: 275 AQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKEYHRNNTARTQKLNKAII 334
A +A + E R+ ++ + E+++ A + E+ + + + K +
Sbjct: 1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE---EAKKAEEAKIKAEELKKAEEEKKKV 1635
Query: 335 NYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLT 394
E E+KK+ E ++K + EE + D+KK + + L
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Query: 395 QMVKEHKE-EQQKKMREEKKKKKKSIKQ-----KILDGNMDKLDLDESSQLEDLRITVKE 448
+ +E K+ E+ KK E+KKK + +K+ KI K ++ + E+ + +E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
Query: 449 TNSDNKLRGEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKE 498
L+ E+ A++I++ + E+ ED + D+++K+ +
Sbjct: 1756 KKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFD 1805
Score = 34.0 bits (77), Expect = 0.69
Identities = 42/196 (21%), Positives = 80/196 (40%), Gaps = 14/196 (7%)
Query: 1094 IKELPDWMVKDDDEVENSMYEEEREREKEEALMGR---GSRQRKQVDYTDSLTDQEWLKA 1150
I+EL ++ DD E + +E+ + E + G + + Y D D
Sbjct: 1029 IEELTEYGNNDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDA----- 1083
Query: 1151 IEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
ED+ AD+ EE KK +GK + EE + E+ K + R+ +
Sbjct: 1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAE 1143
Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDD-----EEKHVVSKRKKKVVEEEEEVEEAE 1265
+ + E+ + + ++ A++ K +D + RK + + + E+ +AE
Sbjct: 1144 EARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAE 1203
Query: 1266 -GEEEEEDDKASKPRK 1280
+ EE+ KA + RK
Sbjct: 1204 AARKAEEERKAEEARK 1219
Score = 34.0 bits (77), Expect = 0.82
Identities = 39/185 (21%), Positives = 70/185 (37%), Gaps = 17/185 (9%)
Query: 338 INAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKK---DKRLAFLLSQTDEYISNLT 394
I AE+ +K ++ + +E+ + A+ E +K + KK + + A + E
Sbjct: 1272 IKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1331
Query: 395 QMVKEHKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNK 454
K+ EE +K E K + + +K + E + E K+ K
Sbjct: 1332 DAAKKKAEEAKKA--AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA-----KKKADAAK 1384
Query: 455 LRGEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEY 514
+ E+ A + K+ + DE + K+K +E K EKK D+
Sbjct: 1385 KKAEEKKKADEAKKKAEED-------KKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437
Query: 515 KDAGE 519
K A E
Sbjct: 1438 KKAEE 1442
Score = 33.2 bits (75), Expect = 1.2
Identities = 50/229 (21%), Positives = 92/229 (40%), Gaps = 11/229 (4%)
Query: 302 RILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMA 361
+ AA + + +E + A+ + K +AE+ +K E+ER +E + A
Sbjct: 1201 KAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEA 1260
Query: 362 EDEEGYRKLIDQK-KDKRLAFLLSQTDEY--ISNLTQMVKEHKEEQQKKMREEKKKK--- 415
R+ K ++ R A L + +E + ++ K ++ KK EE KK
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA 1320
Query: 416 KKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPG 475
KK ++ + K +E+ + + E +D E+ A + K+ A
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380
Query: 476 WEVVMEDSEDEGSED-----EEMKEKKEEPSKAVGEKKTEDDEYKDAGE 519
+ E + +++ EE K+K +E KA KK D+ K A E
Sbjct: 1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
Score = 31.3 bits (70), Expect = 4.2
Identities = 32/172 (18%), Positives = 74/172 (43%), Gaps = 8/172 (4%)
Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
A++ +KK +E + + ++ AE+ + + + ++ + A + E + K+
Sbjct: 1433 ADEAKKKAEEAKKADEAKK-KAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA-KK 1490
Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGED 459
EE +KK E KK + K + DE+ + E+ + + ++ K + ++
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550
Query: 460 APPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTED 511
A+++K+ E + + ED+ M +K E +K E + E+
Sbjct: 1551 LKKAEELKK------AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEE 1596
Score = 30.9 bits (69), Expect = 6.7
Identities = 42/191 (21%), Positives = 80/191 (41%), Gaps = 14/191 (7%)
Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
AE+ +KK +E + + ++ E ++ ++ K+ A + E + E
Sbjct: 1459 AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEA----KKKADE 1514
Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGED 459
K+ ++ K +E KK +++ K +K DE + E+L+ ++ ++ + E+
Sbjct: 1515 AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
Query: 460 --------APPAKQIKE--WLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKT 509
A AK+ +E +E + +E + EE K K EE KA EKK
Sbjct: 1575 DKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
Query: 510 EDDEYKDAGEE 520
+ K EE
Sbjct: 1635 VEQLKKKEAEE 1645
Score = 30.1 bits (67), Expect = 9.5
Identities = 57/299 (19%), Positives = 118/299 (39%), Gaps = 21/299 (7%)
Query: 977 QAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERH 1036
+A+ +A + + E + E A K +E I+A +K+ ++
Sbjct: 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676
Query: 1037 EFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKE 1096
++ ++ EE+E E + + + L +K E++K E + E +
Sbjct: 1677 A-------EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Query: 1097 LPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGA 1156
+ K + E + EE ++ E+E+ ++ + + +E K E
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKDEEEK-------KKIAHLKKEEEKKAEEIRKEKEAVIE 1782
Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKR--RRRKKVQISDEDE 1214
++ +EE+E+ + K+ + + E E K+ K+V S +
Sbjct: 1783 EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQ 1842
Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
EE ++ +K K +D +KE D +K K ++EEE+EEA+ E+ + D
Sbjct: 1843 LEEADAFEKHKFNKNNENGEDGNKEAD-----FNKEKDLKEDDEEEIEEADEIEKIDKD 1896
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 53.5 bits (128), Expect = 7e-07
Identities = 52/241 (21%), Positives = 102/241 (42%), Gaps = 20/241 (8%)
Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAER--RKGETKPRLVEIK-ELPDWM 1101
+ ++E E ++E+ ++ A + E +++A+ KGET+ VE + E
Sbjct: 666 EAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEG 725
Query: 1102 VKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEE 1161
+D+ E+E EE E E E G+ + + + + ED+ + +
Sbjct: 726 TEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQA 785
Query: 1162 EEE-EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK-----PVKKRRRRKKVQISDEDED 1215
E+ E+K G + ++ E + + E VK +++ ++ E
Sbjct: 786 GEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEA 845
Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
++DE S ++++ +E++EE+ EEEEE EE E EEEE ++
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEE-----------EEEEEEEEEEEEEEENEEPL 894
Query: 1276 S 1276
S
Sbjct: 895 S 895
Score = 42.3 bits (99), Expect = 0.002
Identities = 39/165 (23%), Positives = 63/165 (38%), Gaps = 5/165 (3%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
E ER E E + +Q T++ + E I + + E E E +
Sbjct: 648 EGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE-----EDESPKKKKRKS 1228
G + + + ET E E+E ++ ++V+ E E E E E +K+
Sbjct: 708 GETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHE 767
Query: 1229 TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
+T A+ K ED+ E + K E E E EGE E +
Sbjct: 768 GETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEK 812
Score = 37.3 bits (86), Expect = 0.073
Identities = 28/114 (24%), Positives = 44/114 (38%), Gaps = 12/114 (10%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
E EE + G+ G++S E E E E K + + E E E +
Sbjct: 643 ERTGEEGER--PTEAEGENGEESG-GEAEQEGETETKGENESEGEIPAERKGEQEGEGEI 699
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
K+ A K + + EE + + + E+E E+E E EE ED+
Sbjct: 700 EAKE---------ADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744
Score = 36.9 bits (85), Expect = 0.077
Identities = 33/197 (16%), Positives = 67/197 (34%), Gaps = 14/197 (7%)
Query: 1080 AERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYT 1139
ER E + E + + E E E + + E + R+ +Q
Sbjct: 642 GERTGEEGERPTEAEGENGE---ESGGEAEQEGETETKGENESEGEI-PAERKGEQEGEG 697
Query: 1140 DSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE---EEEK 1196
+ + K + ++E E E T+ G + G++ E E E + E
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEG--TEDEGEIETGEEGEEVEDEGEGEAEGKHEV 755
Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
+ R+ + + E E +EDE + + K D+ + V +
Sbjct: 756 ETEGDRKETEHEGETEAEGKEDED-----EGEIQAGEDGEMKGDEGAEGKVEHEGETEAG 810
Query: 1257 EEEEVEEAEGEEEEEDD 1273
E++E E + ++ +
Sbjct: 811 EKDEHEGQSETQADDTE 827
Score = 35.0 bits (80), Expect = 0.37
Identities = 35/137 (25%), Positives = 56/137 (40%), Gaps = 12/137 (8%)
Query: 1146 EWLKAIEDDGADDYEEEEEEVKPVTKKRGSG--KKGKKSYVEETAPEDEEEEKPVKKRRR 1203
+W K IE + +E+ +PV K G KG + E T EE + +
Sbjct: 603 KWNKQIEL-----WVKEQLSRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEG 657
Query: 1204 RKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE- 1262
+ E E E + K + + PA+ K +++ E + + K E EEVE
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717
Query: 1263 ----EAEGEEEEEDDKA 1275
EAEG E+E + +
Sbjct: 718 EGETEAEGTEDEGEIET 734
>gnl|CDD|221124 pfam11496, HDA2-3, Class II histone deacetylase complex subunits 2
and 3. This family of class II histone deacetylase
complex subunits HDA2 and HDA3 is found in fungi, The
member from S. pombe is referred to as Ccq1. These
proteins associate with HDA1 to generate the activity of
the HDA1 histone deacetylase complex. HDA1 interacts with
itself and with the HDA2-HDA3 subcomplex to form a
probable tetramer and these interactions are necessary
for catalytic activity. The HDA1 histone deacetylase
complex is responsible for the deacetylation of lysine
residues on the N-terminal part of the core histones
(H2A, H2B, H3 and H4). Histone deacetylation gives a tag
for epigenetic repression and plays an important role in
transcriptional regulation, cell cycle progression and
developmental events. HDA2 and HDA3 have a conserved
coiled-coil domain towards their C-terminus.
Length = 279
Score = 51.6 bits (124), Expect = 1e-06
Identities = 37/178 (20%), Positives = 73/178 (41%), Gaps = 17/178 (9%)
Query: 870 TSGKFELLDRILP--KFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSEDR 927
TSGKF +L+ ++ E VL+ + + + ++E + + +Y RL G + E+
Sbjct: 93 TSGKFLVLNDLINLLIRSERDLHVLIISRSVKTLDLVEALLLGKGLNYKRLSGESLYEEN 152
Query: 928 GELLRKFNEPESKYFIFLLSTRA---GGLGLNLQAA-DTVIIFDSDWNPH----QDLQAQ 979
+ + +I L ++ L D +I FD + + L+ Q
Sbjct: 153 HK--VSDKKGSLSLWIHLTTSDGLTNTDSSLLSNYKFDLIISFDPSLDTSLPSIESLRTQ 210
Query: 980 DRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERHE 1037
+R N ++RL+ VNS+E L + N + ++QA + + G +
Sbjct: 211 NRRG-----NLTPIIRLVVVNSIEHVELCFPKKYPNRLDYLVQASVVLRDIVGDLPPD 263
Score = 48.5 bits (116), Expect = 9e-06
Identities = 33/148 (22%), Positives = 62/148 (41%), Gaps = 17/148 (11%)
Query: 167 TSGKFELLDRILP--KFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSEDR 224
TSGKF +L+ ++ E VL+ + + + ++E + + +Y RL G + E+
Sbjct: 93 TSGKFLVLNDLINLLIRSERDLHVLIISRSVKTLDLVEALLLGKGLNYKRLSGESLYEEN 152
Query: 225 GELLRKFNEPESKYFIFLLSTRA---GGLGLNLQAA-DTVIIFDSDWNPH----QDLQAQ 276
+ + +I L ++ L D +I FD + + L+ Q
Sbjct: 153 HK--VSDKKGSLSLWIHLTTSDGLTNTDSSLLSNYKFDLIISFDPSLDTSLPSIESLRTQ 210
Query: 277 DRAHRIGQKNEVRVLRLITVNSVEERIL 304
+R N ++RL+ VNS+E L
Sbjct: 211 NRRG-----NLTPIIRLVVVNSIEHVEL 233
>gnl|CDD|197800 smart00592, BRK, domain in transcription and CHROMO domain
helicases.
Length = 45
Score = 43.5 bits (103), Expect = 1e-05
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 439 LEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVV 479
+ R+ V + KL G+DAP AK ++ WL +P +EV
Sbjct: 1 DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVA 41
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3 subunit.
This is a family of proteins which are subunits of the
eukaryotic translation initiation factor 3 (eIF3). In
yeast it is called Hcr1. The Saccharomyces cerevisiae
protein eIF3j (HCR1) has been shown to be required for
processing of 20S pre-rRNA and binds to 18S rRNA and eIF3
subunits Rpg1p and Prt1p.
Length = 242
Score = 47.3 bits (113), Expect = 2e-05
Identities = 33/123 (26%), Positives = 54/123 (43%), Gaps = 26/123 (21%)
Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
D+++EE E K K +DE+E+ VK D ++E
Sbjct: 2 ADWDDEEFEPPAPPAKAVVKDKW----------DDEDEDDDVK-----------DSWDEE 40
Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKAS 1276
EDE +++K K AK K K+ + K + +++K E+EE+ E+ ED+ A
Sbjct: 41 EDEEKEEEK---AKVAAKAKAKKALKAK--IEEKEKAKREKEEKGLRELEEDTPEDELAE 95
Query: 1277 KPR 1279
K R
Sbjct: 96 KLR 98
Score = 38.5 bits (90), Expect = 0.015
Identities = 19/72 (26%), Positives = 37/72 (51%)
Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
+DD D ++EEE+E K K + + K K ++ E E+ ++ +++ R+ + +
Sbjct: 30 DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89
Query: 1212 EDEDEEDESPKK 1223
EDE E +K
Sbjct: 90 EDELAEKLRLRK 101
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit of
transcription factor IIF (TFIIF), which is essential for
accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 48.8 bits (116), Expect = 2e-05
Identities = 42/181 (23%), Positives = 71/181 (39%), Gaps = 22/181 (12%)
Query: 1100 WMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
M+K + +E+ E E E+ GRG K + D D E + D +D
Sbjct: 181 MMMKAAKNGPAAFGDEDEETEGEKGGGGRG----KDLKIKDLEGDDE----DDGDESDKG 232
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
E+ +E K KK+ K KK ++ +++ + D D++ +E
Sbjct: 233 GEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDA--------DEYDSDDGDDEGREE 284
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPR 1279
S P + + K E K +E++E+ EE+E E+ EE+ SK
Sbjct: 285 DYISDSSASGNDPEEREDKLSPEIP------AKPEIEQDEDSEESEEEKNEEEGGLSKKG 338
Query: 1280 K 1280
K
Sbjct: 339 K 339
Score = 45.0 bits (106), Expect = 2e-04
Identities = 36/170 (21%), Positives = 67/170 (39%), Gaps = 10/170 (5%)
Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
+ + G + + ++IK+L D DE + E+ + EK + + ++ +K
Sbjct: 197 DEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKG--GEDGDEEKSKKKKKKLAKNKK- 253
Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEE--EEEVKPVTKKRGSGKKGKKSYVEETAPEDEE 1193
L D + K DD AD+Y+ + ++E + S G E E
Sbjct: 254 -----KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEI 308
Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEE 1243
KP ++ + +++E+E S K KK K K K+D +
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358
Score = 34.2 bits (78), Expect = 0.57
Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 18/104 (17%)
Query: 1187 TAPEDEEEEKPVKK---------RRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
TA E EEE K KK + K + DEDEE E K + KD
Sbjct: 160 TAEEAEEEMKRRKKTANGFQLMMMKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLE 219
Query: 1238 KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+D+++ ++ + E + +++ K +K +K
Sbjct: 220 GDDEDDGDE---------SDKGGEDGDEEKSKKKKKKLAKNKKK 254
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This protein,
which interacts with both microtubules and TRAF3 (tumour
necrosis factor receptor-associated factor 3), is
conserved from worms to humans. The N-terminal region is
the microtubule binding domain and is well-conserved; the
C-terminal 100 residues, also well-conserved, constitute
the coiled-coil region which binds to TRAF3. The central
region of the protein is rich in lysine and glutamic acid
and carries KKE motifs which may also be necessary for
tubulin-binding, but this region is the least
well-conserved.
Length = 506
Score = 48.0 bits (114), Expect = 3e-05
Identities = 38/155 (24%), Positives = 73/155 (47%), Gaps = 7/155 (4%)
Query: 1134 KQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEE 1193
K+V+ S K ++ + +EEE+E + V +++ K+ K ++ P++E
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137
Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKST-------KTPAKDKHKEDDEEKHV 1246
+EK K + ++K + +E D E+E +++ R + K P K + +EEK
Sbjct: 138 KEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQR 197
Query: 1247 VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+ R+ + EE E E+EE+D K + S
Sbjct: 198 QAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTS 232
Score = 47.6 bits (113), Expect = 4e-05
Identities = 18/144 (12%), Positives = 58/144 (40%)
Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
Q + + +A++ + + KP + + K ++ E+ E +++
Sbjct: 61 QALAKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK 120
Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
++ K+ + +K + +++ E K+K++K + +++ K+ + + +K
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Query: 1255 VEEEEEVEEAEGEEEEEDDKASKP 1278
+ + +E EE++
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAV 204
Score = 38.3 bits (89), Expect = 0.027
Identities = 19/106 (17%), Positives = 44/106 (41%), Gaps = 7/106 (6%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKP------VKKRRRRKKVQISDED 1213
+EE +E +P +K +K + + + E + K++ K + E+
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193
Query: 1214 EDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEE 1259
E + + + K K + P ++ +E +E+ + +EE+E
Sbjct: 194 EKQRQAAREAVKGK-PEEPDVNEEREKEEDDGKDRETTTSPMEEDE 238
Score = 34.9 bits (80), Expect = 0.34
Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 4/168 (2%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
EE+++++++ + + +++ + +E K E + + EEE+ + + +
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAK--EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAK 172
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
KK K E EEEK + R K + + D +EE E K++ +
Sbjct: 173 SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEERE--KEEDDGKDRETT 230
Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+ED+ + R+ ++ + + E + +
Sbjct: 231 TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRP 278
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit; Provisional.
Length = 482
Score = 45.7 bits (109), Expect = 1e-04
Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
Query: 1156 ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
A E EEE++ +T GS K KK E + EE+ +K+++ + +E+E+
Sbjct: 389 AAFLELTEEEIEFLT---GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
EE E +++K + + ++K +E++++K
Sbjct: 446 EEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 39.1 bits (92), Expect = 0.013
Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSK 1249
+EE E ++ KK++ E +++ E KK+K+K K + +E++EEK +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKE-EEEEEEKEKKEE 453
Query: 1250 RKKKVVEEEEEVEEAEGEEE 1269
K++ EE EE +E E E++
Sbjct: 454 EKEEEEEEAEEEKEEEEEKK 473
Score = 38.7 bits (91), Expect = 0.018
Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 9/90 (10%)
Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
EEE + + ++ K I E E + ++KK K K A K +E++EE+
Sbjct: 395 TEEEIEFLTGSKKATKK-IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE------ 447
Query: 1251 KKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K +EEE+ EE E EEE++++ K +K
Sbjct: 448 --KEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 36.8 bits (86), Expect = 0.080
Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 1/71 (1%)
Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
E+E E KK +K K + K+ +EEK K+ ++EEE EE + ++EE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKI-VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Query: 1271 EDDKASKPRKS 1281
E ++ + +
Sbjct: 454 EKEEEEEEAEE 464
Score = 36.1 bits (84), Expect = 0.12
Identities = 28/138 (20%), Positives = 64/138 (46%), Gaps = 28/138 (20%)
Query: 288 VRVLRLITVNSVEERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKE 347
+ L +I ++ E AA +L +E E+ + T+K+ K I + E+K+E
Sbjct: 371 LPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKK------IVEKAEKKRE 424
Query: 348 QERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKK 407
+E+ EK+ ++ A ++ + +++K + +E +EE+++
Sbjct: 425 EEKKEKK--KKAFAGKKKEEEEEEEKEKKE--------------------EEKEEEEEEA 462
Query: 408 MREEKKKKKKSIKQKILD 425
E++++++K KQ L
Sbjct: 463 EEEKEEEEEKKKKQATLF 480
Score = 33.7 bits (78), Expect = 0.76
Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 7/76 (9%)
Query: 1213 DEDEE-----DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE--EAE 1265
E+E + KK +K + K + +E E+K KKK EEEEE E E E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 1266 GEEEEEDDKASKPRKS 1281
EEEEE+ + K +
Sbjct: 455 KEEEEEEAEEEKEEEE 470
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 44.9 bits (107), Expect = 3e-04
Identities = 36/152 (23%), Positives = 64/152 (42%), Gaps = 34/152 (22%)
Query: 1049 EEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEI-KELPDWMVKDDDE 1107
E E+ ++ T ++LI ++EL +K + L E+ K+ + ++ +
Sbjct: 201 GFERLELEEEGTPSELIREIKEELEEIEK-----ERESLLEELKELAKKYLEELLALYEY 255
Query: 1108 VENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVK 1167
+E E E+ EAL S+ K D T ++ W+ D ++ +E +
Sbjct: 256 LEI-------ELERAEAL----SKFLKT-DKTFAIEG--WVPE------DRVKKLKELID 295
Query: 1168 PVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK 1199
T G +YVE P++EEEE P K
Sbjct: 296 KAT--------GGSAYVEFVEPDEEEEEVPTK 319
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 44.3 bits (104), Expect = 4e-04
Identities = 38/248 (15%), Positives = 82/248 (33%), Gaps = 20/248 (8%)
Query: 1043 LHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWM- 1101
L Q EE +V VN + ++E +A + + + + +
Sbjct: 20 LRQKQEEGSLGQVTTQVEVNSQNSVPDEESKTSTDDEAALLERLARREERRDERFSEALE 79
Query: 1102 --VKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
+ + E R +E++ G+ + + +E + E +G
Sbjct: 80 RQKEFKPTSTDQSLSEPSRRMQEDS----GAENETVEEEEKEESREEREEVEETEGVTKS 135
Query: 1160 EEEEEEVKPVTKKRGSGKK----GKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
E++ + ++ + +K + E K + E
Sbjct: 136 EQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQ 195
Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE----EEE 1271
E +K ++ + A E +E K + ++KV+EEEE+ + E + EEE
Sbjct: 196 VEAGKEFEKLKQKQQEAAL----ELEELK-KKREERRKVLEEEEQRRKQEEADRKSREEE 250
Query: 1272 DDKASKPR 1279
+ + K
Sbjct: 251 EKRRLKEE 258
Score = 33.9 bits (77), Expect = 0.68
Identities = 52/227 (22%), Positives = 82/227 (36%), Gaps = 28/227 (12%)
Query: 1037 EFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKE 1096
EF T Q E D N+ + E E + ++ + E +G TK
Sbjct: 83 EFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDW- 141
Query: 1097 LPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGA 1156
D E E+E E E+EE + T L E +
Sbjct: 142 --------RDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEG 193
Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
E +E K K+ +E A E EE +K KR R+KV +E +
Sbjct: 194 AQVEAGKEFEKLKQKQ------------QEAALELEELKK---KREERRKVLEEEEQRRK 238
Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
++E+ RKS + K + KE+ E + + K++ V E+ E+
Sbjct: 239 QEEA----DRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGLSED 281
Score = 33.1 bits (75), Expect = 1.1
Identities = 42/193 (21%), Positives = 75/193 (38%), Gaps = 39/193 (20%)
Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE 1163
DDE E + E R RE+EE L RQ+++ +T Q + + +
Sbjct: 1 VDDEEEAA--RERRRREREEQL-----RQKQEEGSLGQVTTQVEVNS-------QNSVPD 46
Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK---KRRRRKKVQISD--------- 1211
EE K T + +E A +E ++ +R++ K +D
Sbjct: 47 EESKTSTDDEAAL-------LERLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRR 99
Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEE---VEEAEGEE 1268
ED E+ ++ + ++ + +E+ EE V+K ++K + E EE E E
Sbjct: 100 MQEDSGAENETVEEEEKEESR---EEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEP 156
Query: 1269 EEEDDKASKPRKS 1281
EEE+ +
Sbjct: 157 EEEEKPKRGSLEE 169
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1, of
polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 44.1 bits (104), Expect = 4e-04
Identities = 42/151 (27%), Positives = 67/151 (44%), Gaps = 30/151 (19%)
Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
+ED+ A+ +E+E + K G E + +EE E+ K++R+R K
Sbjct: 253 VILEDESAEPTGLDEDEDEDEPKPSG-----------ERSDSEEETEEKEKEKRKRLKKM 301
Query: 1209 ISDEDEDEED----ESPK-KKKRKSTKTPAKDKHKEDDEEKHV----VSKRK-----KKV 1254
+ DEDEDEE ESP +++ + + P K +E+ EE V +R KK
Sbjct: 302 MEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKK 361
Query: 1255 VEEEEE-----VEEAEGEEEEEDDKASKPRK 1280
++EE + E E ED+ P K
Sbjct: 362 TFKDEEGYLVTKKVYEWESFSEDEAEPPPTK 392
Score = 40.2 bits (94), Expect = 0.006
Identities = 40/190 (21%), Positives = 66/190 (34%), Gaps = 25/190 (13%)
Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
+K + + T E +E V +DE +E E E E G S ++
Sbjct: 228 EKKEKKEASESTVKEESE-EESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286
Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
+ + ++ LK + +D +D +EE E V + ++ + + + E EE
Sbjct: 287 TEEKEK-EKRKRLKKMMED--EDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVT 343
Query: 1196 KPVKKRRRR------KKVQISDED---------------EDEEDESPKKKKRKSTKTPAK 1234
RRR KK DE+ EDE + P K K K +
Sbjct: 344 VSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTPAVP 403
Query: 1235 DKHKEDDEEK 1244
K+ K
Sbjct: 404 AAAKKPKAPK 413
Score = 39.8 bits (93), Expect = 0.007
Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
KK+ KK KK E T E+ EEE + + +++EDE K + +
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282
Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVE-EEEEVEEAEGEE---EEEDDKASKP 1278
+ + + KE ++ K R KK++E E+E+ E E EEE+ + +P
Sbjct: 283 SEEETEEKEKEKRK-----RLKKMMEDEDEDEEMEIVPESPVEEEESEEPEP 329
Score = 37.9 bits (88), Expect = 0.033
Identities = 18/116 (15%), Positives = 43/116 (37%), Gaps = 10/116 (8%)
Query: 1168 PVTKKRGSGKKGKKSYVEETAPED--EEEEKPVKKRRRRKKVQISDEDEDEEDESP---K 1222
K+ K S + ++ E++E + + + D EDES
Sbjct: 205 ASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTG 264
Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
+ + P + D EE+ +++K + +++ +E+E+++ P
Sbjct: 265 LDEDEDEDEPKPSGERSDSEEETEEKEKEK-----RKRLKKMMEDEDEDEEMEIVP 315
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 44.6 bits (105), Expect = 5e-04
Identities = 42/248 (16%), Positives = 95/248 (38%), Gaps = 2/248 (0%)
Query: 1033 SERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLV 1092
E + + E + + E + + R ++EL + + K +
Sbjct: 674 KELLAEQELQEKAESELAKNEILRRQEEIKKKEQRIKEELK--KLKLEKEELLADKVQEA 731
Query: 1093 EIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIE 1152
+ K + + + E EE+ +KEE + K+ + + E LK E
Sbjct: 732 QDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEE 791
Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
+ +EEE + + + ++ + E +EE+ + K+ Q ++
Sbjct: 792 EKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEK 851
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
+EE E +++ K +E+ EE+ + + + K +E+EE +E E E ++++
Sbjct: 852 LAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDN 911
Query: 1273 DKASKPRK 1280
K +
Sbjct: 912 LLEEKENE 919
Score = 38.8 bits (90), Expect = 0.026
Identities = 45/207 (21%), Positives = 82/207 (39%), Gaps = 6/207 (2%)
Query: 1065 IARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEA 1124
+ E L ++ + L + K E + E E E
Sbjct: 170 RKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLY 229
Query: 1125 LMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYV 1184
L + + + L D++ + E+EEE + V K+ +K KK
Sbjct: 230 LDYLKLNEERIDLLQELLRDEQEEIESS---KQELEKEEEILAQVLKENKEEEKEKKLQE 286
Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
EE +EEE+ + + ++ ++ DE++ +E E KK K K K+K + ++ EK
Sbjct: 287 EELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELK---KEKEEIEELEK 343
Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
+ K+ EEEEE + + +E+ E
Sbjct: 344 ELKELEIKREAEEEEEEQLEKLQEKLE 370
Score = 34.2 bits (78), Expect = 0.60
Identities = 38/222 (17%), Positives = 82/222 (36%), Gaps = 14/222 (6%)
Query: 312 NMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLM----AEDEEGY 367
K+ R + N A + + K Q+ + + K+ + E EE
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 368 RKLIDQKK--------DKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKKKKSI 419
+D K + L + E + +E + K+ +EE+K+KK +
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKK--L 284
Query: 420 KQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVV 479
+++ L + + +S L+ R V + + E K++K+ E
Sbjct: 285 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKE 344
Query: 480 MEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEA 521
+++ E + +EE +E+ E+ + + + + E K E
Sbjct: 345 LKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESER 386
Score = 30.7 bits (69), Expect = 6.9
Identities = 30/216 (13%), Positives = 81/216 (37%), Gaps = 9/216 (4%)
Query: 315 EKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQK 374
+++ + E + +E E+E+ ++L E+ + K ++
Sbjct: 240 IDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEEL 299
Query: 375 KDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKK---KKSIKQKILDGNMDKL 431
K + L L + + ++ ++ +K++++EK++ +K +K+ + ++
Sbjct: 300 KSELL--KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEE 357
Query: 432 DLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVVMEDSED----EG 487
+ ++ +L++ ++E K + + + E E E
Sbjct: 358 EEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQ 417
Query: 488 SEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATY 523
ED +EKKEE +++ + + EE
Sbjct: 418 EEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEE 453
Score = 30.3 bits (68), Expect = 9.8
Identities = 33/251 (13%), Positives = 88/251 (35%), Gaps = 23/251 (9%)
Query: 299 VEERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQK-----KEQERIEK 353
E + R + ++ + + +K K EKE K +E E E+
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEE 359
Query: 354 ERMRRLMAEDEEGYRKLIDQKKDKRLAF---------------LLSQTDEYISNLTQMVK 398
E++ +L + E+ +L+ +KK + + + + L++ +
Sbjct: 360 EQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEE 419
Query: 399 EHKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGE 458
+ +E++K+ + ++ ++S++ K +K +L++ + + + D +
Sbjct: 420 DLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETK 479
Query: 459 DAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAG 518
+Q++ L E ++S+ + + + D G
Sbjct: 480 LVKLLEQLELLLLRQKLEEASQKESK---AREGLAVLLALIKDGVGLRIISAAGRGGDLG 536
Query: 519 EEATYYSIAHT 529
Y +A +
Sbjct: 537 VAVANYKVAIS 547
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 43.4 bits (102), Expect = 7e-04
Identities = 50/243 (20%), Positives = 94/243 (38%), Gaps = 20/243 (8%)
Query: 1057 DDETV-NQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMY-- 1113
D E + QL + + L + E + + + +L D +D + M
Sbjct: 55 DAEQIWQQLKLQNKPILDDLNQKYVEFLINKE--HIRVLAKLQDSESHEDGSDGSDMDSE 112
Query: 1114 ------EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVK 1167
EEE E E E M + + ++S + EDD EEEE +
Sbjct: 113 DSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEE 172
Query: 1168 PVTK-KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
+ + KK KS V++ + +E + ++ ++ + DED+ E+ +
Sbjct: 173 KESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDG 232
Query: 1227 KSTKTPAKDKHKEDDEEKHV--------VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
K + + +EDDEE ++ K KKK ++ E+E+ E ++E +A
Sbjct: 233 KDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSK 292
Query: 1279 RKS 1281
+
Sbjct: 293 PEE 295
Score = 38.4 bits (89), Expect = 0.027
Identities = 49/246 (19%), Positives = 93/246 (37%), Gaps = 14/246 (5%)
Query: 1044 HQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVK 1103
H+D + + + D + EDE + +D E +++L D +
Sbjct: 100 HEDGSDGSDMDSEDSADDEE--EEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETE 157
Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTD----QEWLKAIEDD----G 1155
DD+E M EEE EKE ++ + D E+L+A E +
Sbjct: 158 DDEE--KKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAAL 215
Query: 1156 ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
D+ + E+ + G E+ + E E + + KK D+ E
Sbjct: 216 GDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAEL 275
Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
E+DE K+ +K + +++ +EDDE++ + +++ E + + + E D
Sbjct: 276 EDDEPDKEAVKKEADSKPEEEDEEDDEQED--DQDEEEPPEAAMDKVKLDEPVLEGVDLE 333
Query: 1276 SKPRKS 1281
S S
Sbjct: 334 SPKELS 339
Score = 37.7 bits (87), Expect = 0.042
Identities = 41/230 (17%), Positives = 87/230 (37%), Gaps = 17/230 (7%)
Query: 1046 DDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDD 1105
++ E ++ DDET + + E+E +K E+ E K K D
Sbjct: 142 NESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFD----KSGVDDKFFKL 197
Query: 1106 DEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEE 1165
DE+ + E E E + D+ D D ++D +EE++E
Sbjct: 198 DEMNEFLEATEAEEEA---------ALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDE 248
Query: 1166 VKPVTKKRGSGK----KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESP 1221
+ + K K + + +DE +++ VKK K + +ED+++ED+
Sbjct: 249 EGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQD 308
Query: 1222 KKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
+++ ++ K + K + + ++++ + E+E
Sbjct: 309 EEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKE 358
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 43.3 bits (102), Expect = 0.001
Identities = 76/359 (21%), Positives = 128/359 (35%), Gaps = 72/359 (20%)
Query: 989 NEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERHEF-LQTILHQDD 1047
N+VR + + V+ I+ R K ++ E++ Q G F ++ E +
Sbjct: 1039 NKVRFILAV----VDGEIIVNNRKKADLVEELRQKG-FTPFPKKAKSVEAAVAGATDDAA 1093
Query: 1048 EEEEENEVPDDETV-----NQLIARTEDELTL--FQKMDAERRKGE--------TKPRLV 1092
EEEEE +V V + L++ LTL Q++ A+R K T P+ +
Sbjct: 1094 EEEEEIDVSSSSGVRGSDYDYLLSMAIGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSL 1153
Query: 1093 EIKELPDW-MVKDDDEVENSMYEEEREREKEEALMGRG------SRQRKQ----VDYTDS 1141
+K+L D + E++ EE RE+ + A G SRQ + T
Sbjct: 1154 WLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213
Query: 1142 LTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKK-------------------- 1181
++ E + A + E E VKP + G+ KK
Sbjct: 1214 ASESETTEETYGSSAMETENVAEVVKP-KGRAGAKKKAPAAAKEKEEEDEILDLKDRLAA 1272
Query: 1182 ----------SYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK-----KR 1226
+ +EET +K+ ISD D+D++D + + K+
Sbjct: 1273 YNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKK 1332
Query: 1227 KSTKTPAKDKHKE----DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
K + PA K +K + + E ++ AE + K K R S
Sbjct: 1333 KGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRAS 1391
Score = 40.2 bits (94), Expect = 0.009
Identities = 32/133 (24%), Positives = 49/133 (36%), Gaps = 10/133 (7%)
Query: 1150 AIEDDGADDYEEEEEEV-KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
I D DD + E KK+G K + P ++ P + +K +
Sbjct: 1309 VISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLT 1368
Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE--EEVEEAEG 1266
+ + SP+KK RK +P K V R E E E V +
Sbjct: 1369 EMLKPAEAIGISPEKKVRKMRASPFNKK-------SGSVLGRAATNKETESSENVSGSSS 1421
Query: 1267 EEEEEDDKASKPR 1279
E++E D ++KPR
Sbjct: 1422 SEKDEIDVSAKPR 1434
Score = 33.7 bits (77), Expect = 0.75
Identities = 37/165 (22%), Positives = 65/165 (39%), Gaps = 22/165 (13%)
Query: 381 FLLSQTDEYISNLTQ-MVKEHKEEQQKKMREEKKKKKKSIKQ------KILDGNMDKLDL 433
+LLS I LT V+E ++ K E + KK + K L+ +DKLD
Sbjct: 1114 YLLSMA---IGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDK 1170
Query: 434 DESSQLEDLRITVK-----ETNSDNKLRGE---DAPPAKQIKEWLASHPGWEVVMEDSED 485
+++ E + E+ + K+ + P K K+ S E + +
Sbjct: 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAME 1230
Query: 486 EGSEDEEMKEKKEEPSK----AVGEKKTEDDEYKDAGEEATYYSI 526
+ E +K K +K A ++K E+DE D + Y++
Sbjct: 1231 TENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNL 1275
Score = 30.2 bits (68), Expect = 8.5
Identities = 18/90 (20%), Positives = 33/90 (36%), Gaps = 15/90 (16%)
Query: 1146 EWLKAIEDDG-ADDYEEEEEEVKPVTKKRGS-----GKKGKKSYVEETAPEDEEEE---- 1195
E LK E G + + + + P KK GS + E + E+
Sbjct: 1369 EMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEID 1428
Query: 1196 -----KPVKKRRRRKKVQISDEDEDEEDES 1220
+P + R++ +SD + + D+S
Sbjct: 1429 VSAKPRPQRANRKQTTYVLSDSESESADDS 1458
>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase. Members of this family
include the DEAD and DEAH box helicases. Helicases are
involved in unwinding nucleic acids. The DEAD box
helicases are involved in various aspects of RNA
metabolism, including nuclear transcription, pre mRNA
splicing, ribosome biogenesis, nucleocytoplasmic
transport, translation, RNA decay and organellar gene
expression.
Length = 169
Score = 40.7 bits (96), Expect = 0.001
Identities = 31/143 (21%), Positives = 56/143 (39%), Gaps = 15/143 (10%)
Query: 574 GLGKTVQTIALLTYLMEKKKNLGPY--LVIVPLSTLSNWVLE-FEKWAPSANV---VAYK 627
G GKT LL L G LV+ P L+ + E +K + +
Sbjct: 24 GSGKT--LAFLLPILQALLPKKGGPQALVLAPTRELAEQIYEELKKLFKILGLRVALLTG 81
Query: 628 GSPLARRALQGRMKNSKFNVLLTTYEFI---IKDKSVLSKIQWKCMIIDEGHRM--KNHH 682
G+ L +A + +K K ++L+ T + ++ + K +++DE HR+
Sbjct: 82 GTSLKEQARK--LKKGKADILVGTPGRLLDLLRRGKLKLLKNLKLLVLDEAHRLLDMGFG 139
Query: 683 CKLTQTLNTFYNSSHRVLLTGTP 705
L + L+ +LL+ T
Sbjct: 140 DDLEEILSRLPPDRQILLLSATL 162
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 42.0 bits (99), Expect = 0.002
Identities = 33/178 (18%), Positives = 68/178 (38%), Gaps = 24/178 (13%)
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
+D E+ YE+E + EK+ + R ++ + ++ E + + ++ DD EEE
Sbjct: 193 EDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEE 252
Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
++ S +G +S E + K +R + K+ +
Sbjct: 253 SDD--------ESAWEGFESEYEPINKPVRPKRKTKAQRNKEKR---------------R 289
Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K+ + K + K K + + K+ +E+ + E +E + K K RK
Sbjct: 290 KELEREAKEEKQLKKKLAQLAR-LKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRK 346
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits in
yeasts and in human cells. Twelve of these are akin to
RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
and Rpc82 form a cluster of enzyme-specific subunits that
contribute to transcription initiation in S.cerevisiae
and H.sapiens. There is evidence that these subunits are
anchored at or near the N-terminal Zn-fold of Rpc1,
itself prolonged by a highly conserved but RNA polymerase
III-specific domain.
Length = 221
Score = 40.9 bits (96), Expect = 0.002
Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 12/85 (14%)
Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPK--KKKRKSTKTPAKDKHKEDDEEKHVVS 1248
+++ K + + ++KV + E+E++ DE +KK K + D+ E DEE+
Sbjct: 123 NKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE--- 179
Query: 1249 KRKKKVVEEEEEVEEAEGEEEEEDD 1273
EEEEE +E +++++DD
Sbjct: 180 -------EEEEEEDEDFDDDDDDDD 197
Score = 30.5 bits (69), Expect = 4.7
Identities = 16/60 (26%), Positives = 32/60 (53%)
Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
+++ + K + +ED +EK + ++K K +E E+ EE E +EEEE+++
Sbjct: 123 NKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 42.4 bits (100), Expect = 0.002
Identities = 48/251 (19%), Positives = 99/251 (39%), Gaps = 23/251 (9%)
Query: 1046 DDEEEEENEVPDDETVNQLIAR-------------TEDELTLFQKMDA--ERRKGETKPR 1090
+EE+E+++ DD+ N + R L + M R+K E
Sbjct: 334 SEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAE 393
Query: 1091 LVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDS-LTDQEWLK 1149
+ E++ + + D+E ++ R K G + +++ E+ +
Sbjct: 394 IEELRRELEGEEESDEEENEEPSKKNVGRRK----FGPENGEKEAESKKLKKENKNEFKE 449
Query: 1150 AIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQI 1209
E D ++ E+EEE K K+ +K+ EE E +EE +K K
Sbjct: 450 KKESDEEEE-LEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKS-- 506
Query: 1210 SDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
+ + + ++ S K K + + A K K+ +++ + + EE+ + + EE+
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566
Query: 1270 EEDDKASKPRK 1280
E+D++ K
Sbjct: 567 EDDEELPFLFK 577
Score = 41.2 bits (97), Expect = 0.004
Identities = 36/191 (18%), Positives = 73/191 (38%), Gaps = 30/191 (15%)
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDG------- 1155
+DDDE ++ R+K L ++ + + L ++++ E
Sbjct: 339 EDDDEDDDDGENPWMLRKKLGKL-----KEGEDDEENSGLLSMKFMQRAEARKKEENDAE 393
Query: 1156 ----ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAP-EDEEEEKPVKKRRRRKKVQIS 1210
+ E EEE + ++ G++ + E E E ++ + + K+ + S
Sbjct: 394 IEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKES 453
Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
DE+E+ EDE K ++ + K + S++ +K EEEE EE +
Sbjct: 454 DEEEELEDEEEAKVEKVANKLLKR-------------SEKAQKEEEEEELDEENPWLKTT 500
Query: 1271 EDDKASKPRKS 1281
S ++
Sbjct: 501 SSVGKSAKKQD 511
Score = 38.9 bits (91), Expect = 0.019
Identities = 43/197 (21%), Positives = 83/197 (42%), Gaps = 25/197 (12%)
Query: 1101 MVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYE 1160
K D+E ++ E+ R + E L RK + S D++ E++ DD E
Sbjct: 293 RAKYDEEARKALEEQLR---QGEELR------RKIEGKSVSEEDEDEDSDSEEEDEDDDE 343
Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAP-------------EDEEEEKPVKKRRRRKKV 1207
++++ P ++ GK + EE + + EE + +++ RR +
Sbjct: 344 DDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEG 403
Query: 1208 QISDEDEDEEDESPKKKKRKS---TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEA 1264
+ ++E+ E+ S K R+ + + K+ +E K KK+ EEEE +E
Sbjct: 404 EEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEE 463
Query: 1265 EGEEEEEDDKASKPRKS 1281
E + E+ +K K +
Sbjct: 464 EAKVEKVANKLLKRSEK 480
Score = 37.7 bits (88), Expect = 0.044
Identities = 60/257 (23%), Positives = 93/257 (36%), Gaps = 59/257 (22%)
Query: 1047 DEEEEENEVPDDETVNQLIARTEDELTLFQKMDAE-----------RRK---------GE 1086
E + E D E I EL ++ D E RRK E
Sbjct: 381 RAEARKKEENDAE-----IEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAE 435
Query: 1087 TKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQE 1146
+K E K + D+E E EE + + L+ R + +K+ + + +
Sbjct: 436 SKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENP 495
Query: 1147 WLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKK 1206
WLK G K +KK+ S K K ++ + VK ++++KK
Sbjct: 496 WLKTTSSVGKS-------AKKQDSKKKSSSKLDKA--------ANKISKAAVKVKKKKKK 540
Query: 1207 VQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE---------- 1256
+ D D+D DE S K D+ EDDEE + K+K + E
Sbjct: 541 EKSIDLDDDLIDEE------DSIKLDVDDEEDEDDEELPFLFKQKDLIKEAFAGDDVVAE 594
Query: 1257 ---EEEEVEEAEGEEEE 1270
E++EV E E +E
Sbjct: 595 FEKEKKEVIEEEDPKEI 611
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 41.1 bits (96), Expect = 0.003
Identities = 34/179 (18%), Positives = 50/179 (27%), Gaps = 15/179 (8%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYE-EEEEEVKPVTKK 1172
E R E+ G+G + S T ++ A E P +
Sbjct: 75 TAEESRHGEKEERGQGGPSGSGSESVGSPTPSP--SGSAEELASGLSPENTSGSSPESPA 132
Query: 1173 RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK-------- 1224
S S+ P E P ++ +Q S ED EE E P +
Sbjct: 133 SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192
Query: 1225 KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEE----EVEEAEGEEEEEDDKASKPR 1279
+ T T + DE S ++ E E+ E E E R
Sbjct: 193 PQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHR 251
Score = 35.7 bits (82), Expect = 0.17
Identities = 24/162 (14%), Positives = 42/162 (25%), Gaps = 10/162 (6%)
Query: 1127 GRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEV-------KPVTKKRGSGKKG 1179
+ + + T ++A+ + G E + E K + G G
Sbjct: 37 RGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSG 96
Query: 1180 KKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
+S T E+ S E P PA +
Sbjct: 97 SESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHN 156
Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+ S + E+ EE E E + + P +S
Sbjct: 157 PSPNQQPSSFLQP---SHEDSPEEPEPPTSEPEPDSPGPPQS 195
Score = 34.2 bits (78), Expect = 0.41
Identities = 11/63 (17%), Positives = 22/63 (34%), Gaps = 4/63 (6%)
Query: 1215 DEEDESPKKKKRKS--TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAE--GEEEE 1270
E +++ + PA + V+++ + E + E E GE+EE
Sbjct: 27 GGTREQAGRRRGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEE 86
Query: 1271 EDD 1273
Sbjct: 87 RGQ 89
Score = 32.6 bits (74), Expect = 1.4
Identities = 12/83 (14%), Positives = 24/83 (28%), Gaps = 3/83 (3%)
Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
K+ R + + + +R+ T A + + V E+
Sbjct: 9 KRSRHTLRSSCRGHCKRHGGTREQAGRRRGTAARAAKPAPPAPTTS---GPQVRAVAEQG 65
Query: 1259 EEVEEAEGEEEEEDDKASKPRKS 1281
E++ E EE K +
Sbjct: 66 HRQTESDTETAEESRHGEKEERG 88
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 41.4 bits (97), Expect = 0.004
Identities = 37/179 (20%), Positives = 68/179 (37%), Gaps = 16/179 (8%)
Query: 1101 MVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYE 1160
M +DD E E +ERER+ + S+ + S+ QE + + ++ E
Sbjct: 393 MKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEE 452
Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDES 1220
EEEEE + + G+ ++ E EEE + + D + E
Sbjct: 453 EEEEEEEQ-ESEEEEGEDEEEEEEVEADNGSEEEMEG-------------SSEGDGDGEE 498
Query: 1221 PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEE--EEEVEEAEGEEEEEDDKASK 1277
P++ + A + ++ S + + EE + E +AE EE D++
Sbjct: 499 PEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLA 557
Score = 32.2 bits (73), Expect = 2.4
Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 2/70 (2%)
Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
+D+ EE+E K+++R+ T ++ S +E EE E E EEEEE
Sbjct: 395 QDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMAS--QESEEEESVEEEEEEE 452
Query: 1272 DDKASKPRKS 1281
+++ + ++S
Sbjct: 453 EEEEEEEQES 462
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 38.9 bits (91), Expect = 0.005
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 5/95 (5%)
Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDED-EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVV 1247
P + E KK+++ +I ++ EE + K KK+KS K KDK K+DD++
Sbjct: 51 PIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD-- 108
Query: 1248 SKRKKKVVEEEEEVEEAEGEEEEEDDKAS--KPRK 1280
KK E E+++E+ E S KPRK
Sbjct: 109 KSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143
Score = 33.1 bits (76), Expect = 0.57
Identities = 14/79 (17%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
+E EE++ V K+ +K K + +D++++K K+ + + + +++ +++ E
Sbjct: 65 KELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK-DEKEAEDKLE 123
Query: 1220 SPKKKKRKSTKTPAKDKHK 1238
K ++ T ++ K +
Sbjct: 124 DLTKSYSETLSTLSELKPR 142
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 40.6 bits (95), Expect = 0.006
Identities = 28/123 (22%), Positives = 46/123 (37%), Gaps = 16/123 (13%)
Query: 1155 GADDYEEEEEEVKPV---TKKRGSGKKGK----KSYVEETAPEDEEEEKPVKKRRRRKKV 1207
G+ + E P + K KKGK K E+ P+DEE+ K+ ++K
Sbjct: 398 GSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGR 457
Query: 1208 QISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
S + + KK K +ED+ KK++E ++EE E
Sbjct: 458 DKSSKVPSDSKAGGKK---------ESVKSQEDNNNIPPEEWVMKKILEWVPDLEEDGTE 508
Query: 1268 EEE 1270
+
Sbjct: 509 DPG 511
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 40.0 bits (94), Expect = 0.009
Identities = 18/130 (13%), Positives = 43/130 (33%), Gaps = 2/130 (1%)
Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
+ D + E ++ K + K+ ++ ++ + K +K+ K
Sbjct: 57 IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116
Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE 1268
+ +D + + D DD++ + ++E EE +E
Sbjct: 117 DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEE--KKE 174
Query: 1269 EEEDDKASKP 1278
+E +K S
Sbjct: 175 AKELEKLSDD 184
Score = 38.1 bits (89), Expect = 0.036
Identities = 27/153 (17%), Positives = 54/153 (35%), Gaps = 6/153 (3%)
Query: 1130 SRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAP 1189
Q QV S ++ A E D + + K K + KK K
Sbjct: 49 PEQIDQVLIFLSGMVKDTDDATESDIPKK--KTKTAAKAAAAKAPAKKKLKDELDSSKKA 106
Query: 1190 EDEEEEKPVKKRRRRKKVQISDE----DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
E + K + + ++ D+D++D+ D +DD++
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVD 166
Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
+ KK+ E E+ ++ + +E+D +A +
Sbjct: 167 DEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQ 199
Score = 33.4 bits (77), Expect = 0.77
Identities = 19/139 (13%), Positives = 46/139 (33%), Gaps = 7/139 (5%)
Query: 1143 TDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRR 1202
+ + D ++ E P K + + K + +DE + +
Sbjct: 50 EQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE-- 107
Query: 1203 RRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE 1262
KK + +D+ + + D DD++ ++E++ +
Sbjct: 108 --KKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDD---DDDEDDDEDDDD 162
Query: 1263 EAEGEEEEEDDKASKPRKS 1281
+ +E+EE +A + K
Sbjct: 163 DDVDDEDEEKKEAKELEKL 181
Score = 32.7 bits (75), Expect = 1.3
Identities = 26/187 (13%), Positives = 69/187 (36%), Gaps = 16/187 (8%)
Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKD 1104
++ E ++ + V ++ + + D ++K +T + K K
Sbjct: 39 KEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAK---KK 95
Query: 1105 DDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
+ +S + E++ ++ + ++ D D + ++DD DD +++E+
Sbjct: 96 LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155
Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK 1224
+ + ++ EDEE+++ + + +++D E +K
Sbjct: 156 DDE-------------DDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK 202
Query: 1225 KRKSTKT 1231
K T T
Sbjct: 203 DAKLTAT 209
Score = 30.3 bits (69), Expect = 8.1
Identities = 37/198 (18%), Positives = 72/198 (36%), Gaps = 24/198 (12%)
Query: 341 EKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEH 400
E+ +KK ++ K + + + ++E ++ ++ KK Q D+ + L+ MVK+
Sbjct: 14 EEAKKKLKKLAAKSKSKGFITKEEI--KEALESKKKTP-----EQIDQVLIFLSGMVKDT 66
Query: 401 KEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDA 460
+ + KKK K + K K DE + D+ +D
Sbjct: 67 DDATESD--IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124
Query: 461 PPAKQIKEWLASHPGWEVVMEDSEDEG---------------SEDEEMKEKKEEPSKAVG 505
Q + ++ +D +D+ EDEE KE KE +
Sbjct: 125 DVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184
Query: 506 EKKTEDDEYKDAGEEATY 523
+ D++ +A +A
Sbjct: 185 DDFVWDEDDSEALRQARK 202
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with the
process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli synthesizes
cobalamin only when it is supplied with the precursor
cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with a
MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The other
two subunits are the P. denitrificans CobS (TIGR01650)
and CobN (pfam02514 CobN/Magnesium Chelatase) proteins.
To avoid potential confusion with the nonhomologous
Salmonella typhimurium/E.coli cobT gene product, the P.
denitrificans gene symbol is not used in the name of this
model [Biosynthesis of cofactors, prosthetic groups, and
carriers, Heme, porphyrin, and cobalamin].
Length = 600
Score = 39.9 bits (93), Expect = 0.009
Identities = 27/128 (21%), Positives = 53/128 (41%), Gaps = 4/128 (3%)
Query: 1140 DSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK 1199
+ L E + + DD + EE+ ++ +P ++ +G+ E +AP++ E
Sbjct: 192 EMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRES 251
Query: 1200 KRRRRKKVQISDEDEDEE--DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEE 1257
+ + VQ +D +E D+S + P E D + V + + V+
Sbjct: 252 ESGEEEMVQSDQDDLPDESDDDSETPGEGARPARPFTSTGGEPDYK--VFTTAFDETVDA 309
Query: 1258 EEEVEEAE 1265
EE +E E
Sbjct: 310 EELCDEEE 317
>gnl|CDD|234468 TIGR04095, dnd_restrict_1, DNA phosphorothioation system
restriction enzyme. The DNA phosphorothioate
modification system dnd (DNA instability during
electrophoresis) recently has been shown to provide a
modification essential to a restriction system. This
protein family was detected by Partial Phylogenetic
Profiling as linked to dnd, and its members usually are
clustered with the dndABCDE genes.
Length = 451
Score = 39.2 bits (92), Expect = 0.011
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 20/144 (13%)
Query: 545 KLKEYQIKGL-EWMVSLYNNNLNGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVP 603
+L++YQ + + W + NN GIL G GKT+ +A + L EK L LV+ P
Sbjct: 8 ELRDYQKEAIRAW----FKNNGRGILKMATGTGKTLTALAAASKLYEKIGLL-VLLVVCP 62
Query: 604 LSTL-SNWVLEFEKWA--PSANVVAYKGSP-----LARRALQGRMKNSKFNVLLTTYEFI 655
L W E EK+ P ++ Y+ L+ N KF ++TT
Sbjct: 63 YQHLVDQWAREAEKFGLNP---ILCYESVSNWQSELSTGLYNLNSGNQKFLAIITTNATF 119
Query: 656 IKDK--SVLSKIQWKCMII-DEGH 676
I S L + K ++I DE H
Sbjct: 120 IGKNFQSQLRRFPGKTLLIGDEAH 143
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina
virus topoisomerase, Variola virus topoisomerase, Shope
fibroma virus topoisomeras.
Length = 391
Score = 38.9 bits (91), Expect = 0.014
Identities = 19/82 (23%), Positives = 40/82 (48%)
Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK 1251
+E+ K +K + +R K I + + + K K + + KE +EK K+K
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342
Query: 1252 KKVVEEEEEVEEAEGEEEEEDD 1273
K++ EE +E+ E + ++++
Sbjct: 343 KQIERLEERIEKLEVQATDKEE 364
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 39.0 bits (91), Expect = 0.016
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 1207 VQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEV 1261
++ DE+E+EE E K+++ K+T + +E+ EEK +K+ K+ E E +
Sbjct: 33 KEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87
Score = 38.6 bits (90), Expect = 0.021
Identities = 17/52 (32%), Positives = 28/52 (53%)
Query: 1180 KKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
K+ E E+EEE++ K+ + + + DE+EE E KKK +K +T
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 30.9 bits (70), Expect = 4.8
Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 1228 STKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K K+ E++EE+ + KK+ E+ + EE EEEE+++K K +K
Sbjct: 27 VEKEVEKEVPDEEEEEE---KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 38.9 bits (91), Expect = 0.020
Identities = 34/204 (16%), Positives = 76/204 (37%), Gaps = 8/204 (3%)
Query: 1077 KMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSM-YEEEREREKEEALMGRGSRQRKQ 1135
+++ + V+ E K D++ NS ++E + E++ ++ K
Sbjct: 1186 ADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKS 1245
Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEE--TAPEDEE 1193
S + ++ + DD + + + + K V+ + S K E +
Sbjct: 1246 KKNNSSKSSEDNDEFSSDDLSKE-GKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSS 1304
Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
K K+R + + + E ++ +KKK K+ A + RKKK
Sbjct: 1305 PTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRR----PRKKK 1360
Query: 1254 VVEEEEEVEEAEGEEEEEDDKASK 1277
E+ +++E ++ E++D
Sbjct: 1361 SDSSSEDDDDSEVDDSEDEDDEDD 1384
Score = 34.3 bits (79), Expect = 0.58
Identities = 34/191 (17%), Positives = 66/191 (34%), Gaps = 16/191 (8%)
Query: 365 EGYRKLIDQ--KKDKRLAFLLSQT------------DEYISNLTQMVKEHKEEQQKKMRE 410
E KL + KK+K L L + T +E + ++ ++ ++Q+ +
Sbjct: 1102 EKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSK 1161
Query: 411 EKKKKKKSIKQKILDGNMDKL-DLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEW 469
K K K K K+ K + S+ + K +SD K + +D P K+
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSS 1221
Query: 470 LASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDA-GEEATYYSIAH 528
+ E + + + K+ S ++ + DD K+ + A A
Sbjct: 1222 GSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAV 1281
Query: 529 TYHEKVTEQAS 539
Y +
Sbjct: 1282 QYSPPPPSKRP 1292
Score = 30.4 bits (69), Expect = 8.1
Identities = 21/101 (20%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
E A E++EE + + + ++ + + + KKK K K + DK K+ +
Sbjct: 1138 EEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197
Query: 1246 -----VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
KRK + ++ + ++E+++++ +KP+KS
Sbjct: 1198 SKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKS 1238
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 38.3 bits (89), Expect = 0.031
Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 6/94 (6%)
Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDES 1220
E E E K +T++ KK K EE A E+E K +K ++ K ++ + +
Sbjct: 5 ESEAEKKILTEEELERKKKK----EEKA--KEKELKKLKAAQKEAKAKLQAQQASDGTNV 58
Query: 1221 PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
PKK ++KS K +D++ ED + +KK++
Sbjct: 59 PKKSEKKSRKRDVEDENPEDFIDPDTPFGQKKRL 92
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family are
designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 37.4 bits (87), Expect = 0.034
Identities = 37/179 (20%), Positives = 66/179 (36%), Gaps = 15/179 (8%)
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
+D+ ++EEE + EE + + + D D E E++G + + E
Sbjct: 24 EDEFFWTYLLFEEEED--DEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQRE 81
Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
E K K + K+ K ++ + + R +KK + D +
Sbjct: 82 ERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKA--AAPRPKKKSERISWAPTLLDSPRR 139
Query: 1223 KKKRKST-----KTPAKDKHKEDDEEKHVVSKRKKKVVEE------EEEVEEAEGEEEE 1270
K R ST T + K +E +K RK+K ++ EE + EA+ E
Sbjct: 140 KSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERI 198
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 38.4 bits (89), Expect = 0.034
Identities = 44/222 (19%), Positives = 89/222 (40%), Gaps = 18/222 (8%)
Query: 1059 ETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDD--DEVENSMYEEE 1116
E + + +D L + D + + E++ + + KD E ENS EEE
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884
Query: 1117 REREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
+ EE D ++SL ++ W + E+D + ++ E+ +
Sbjct: 3885 NQDLDEEV-------NDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVS 3937
Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
K+ + A ED++ ++ + V I DE + + E+ + ++ +
Sbjct: 3938 KED-----DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPED 3992
Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
K D++E V K E+ ++E A+ +EE D + +P
Sbjct: 3993 LKLDEKEGDV----SKDSDLEDMDMEAADENKEEADAEKDEP 4030
Score = 37.3 bits (86), Expect = 0.074
Identities = 47/241 (19%), Positives = 101/241 (41%), Gaps = 36/241 (14%)
Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKD 1104
++DEEE ++V D+ + + P E +LP+ + D
Sbjct: 3953 KEDEEEMSDDVGIDDEIQ----------------PDIQENNSQPPPENEDLDLPEDLKLD 3996
Query: 1105 DDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
+ E + S ++ + E+ M +++ D Q+ +E++ D + +++
Sbjct: 3997 EKEGDVS-----KDSDLEDMDMEAADENKEEADAEKDEPMQDE-DPLEENNTLDEDIQQD 4050
Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEE-EKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
+ + + K + EE E+EE E VK ++ + ED+ ++ K
Sbjct: 4051 DFSDLAEDDE---KMNEDGFEENVQENEESTEDGVKSDEELEQ---GEVPEDQAIDNHPK 4104
Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE---EAEGEEEEEDDKASKPRK 1280
KST A + E++ +K +V + ++ + EE+ V A+GE E+ + S P++
Sbjct: 4105 MDAKSTFASA--EADEENTDKGIVGENEE--LGEEDGVRGNGTADGEFEQVQEDTSTPKE 4160
Query: 1281 S 1281
+
Sbjct: 4161 A 4161
Score = 35.7 bits (82), Expect = 0.20
Identities = 35/167 (20%), Positives = 70/167 (41%), Gaps = 16/167 (9%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
E+ E E G GS + D T++L + + L+ + ++ + + +E + +
Sbjct: 3805 EKSNSGELESG-TGLGSGVGAE-DITNTLNEDDDLEELANEEDTANQSDLDESEARELES 3862
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEED-------ESPKKKKR 1226
K S V E D EEE + +++D ED + + P ++
Sbjct: 3863 DMNGVTKDSVVSENENSDSEEEN------QDLDEEVNDIPEDLSNSLNEKLWDEPNEEDL 3916
Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
T+ + ++ ++E V + K +E+++ +E E EEE DD
Sbjct: 3917 LETEQKSNEQSAANNESDLVSKEDDNKALEDKDR-QEKEDEEEMSDD 3962
Score = 34.2 bits (78), Expect = 0.67
Identities = 37/226 (16%), Positives = 94/226 (41%), Gaps = 18/226 (7%)
Query: 1047 DEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDD 1106
+ + +E E + + T+D ++ + + + E + E+ ++P+ +
Sbjct: 3847 ANQSDLDESEARELESDMNGVTKD--SVVSENENSDSEEENQDLDEEVNDIPE---DLSN 3901
Query: 1107 EVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEV 1166
+ +++E E + E + + Q + +D ++ ++ KA+ED +E+EEE+
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQ-KSNEQSAANNESDLVSKEDDNKALED-KDRQEKEDEEEM 3959
Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
+ + + PE+E+ + P + + +K +D D ED
Sbjct: 3960 SDDVGIDDEIQPDIQENNSQPPPENEDLDLP-EDLKLDEKEGDVSKDSDLED-------- 4010
Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
A D++KE+ + + + + +EE ++E +++ D
Sbjct: 4011 --MDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSD 4054
Score = 30.7 bits (69), Expect = 6.8
Identities = 45/248 (18%), Positives = 91/248 (36%), Gaps = 36/248 (14%)
Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAE--------RRKGETKPRLVEIKE 1096
Q++EE E+ V DE + Q + + KMDA+ + K + E +E
Sbjct: 4072 QENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGENEE 4131
Query: 1097 LPDW-MVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDG 1155
L + V+ + + + + + + M RQ + + D L +EW +A
Sbjct: 4132 LGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLG--DHL--REWQQANRIHE 4187
Query: 1156 ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
+D E + + + +V+ EDEEE+ + +++ D DE
Sbjct: 4188 WEDLTESQSQAF---------DDSEFMHVK----EDEEEDLQALGNAEKDQIKSIDRDES 4234
Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK----------VVEEEEEVEEAE 1265
+ A + + ++ +S K+ + + E+V E
Sbjct: 4235 ANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELS 4294
Query: 1266 GEEEEEDD 1273
+E+ ED+
Sbjct: 4295 EDEDIEDE 4302
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There are
two completely conserved residues (G and F) that may be
functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 35.0 bits (81), Expect = 0.035
Identities = 26/102 (25%), Positives = 49/102 (48%), Gaps = 10/102 (9%)
Query: 1185 EETAPEDEEEEKPVKKRRRR-KKVQI----SDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
E T + + E P ++RRR+ K+ I +D+D+D+E E + + R+ +K KD K
Sbjct: 10 EITDYDLDNEFNPNRRRRRQTKEQAIYGIFADDDDDDEGERRRGRGRRRSKKK-KDYSK- 67
Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
VS K+ +E EE ++ + E++++
Sbjct: 68 ---PVAFVSGGIKQAAKEPEEDDKEDESEDDDESDEDDDEDD 106
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 37.5 bits (87), Expect = 0.038
Identities = 31/138 (22%), Positives = 55/138 (39%), Gaps = 13/138 (9%)
Query: 1140 DSLTDQEWLKAIEDDGADDYEEEEEEV----KPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
SL+ EWL + E + V +P + G S +++
Sbjct: 156 VSLSKIEWLNEDFPSELSPFIERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPDSSQDKN 215
Query: 1196 KPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV 1255
K +++ K+ + +D+ + K +KS D+ + EEK ++
Sbjct: 216 KSLEEYYE-KESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEK--------ELK 266
Query: 1256 EEEEEVEEAEGEEEEEDD 1273
E+EEE E E EEEE++D
Sbjct: 267 EDEEEEETEEEEEEEDED 284
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function of
NS2 is not known, but it may be involved in the transport
and condensation of viral mRNAs.
Length = 363
Score = 37.6 bits (87), Expect = 0.038
Identities = 33/137 (24%), Positives = 55/137 (40%), Gaps = 7/137 (5%)
Query: 1093 EIKELPDWMVKDDD--EVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKA 1150
E+ ELP D D E+ + EERE+EKEE M + + + DS D+ L
Sbjct: 140 EMPELPGVTRSDYDVRELRQKI-REEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDE 198
Query: 1151 IEDD---GADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
D +++EE V ++ + +EE + +E++ + R
Sbjct: 199 EAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSD- 257
Query: 1208 QISDEDEDEEDESPKKK 1224
+ ED ED K +
Sbjct: 258 DEDEHGEDSEDGETKPE 274
Score = 34.5 bits (79), Expect = 0.37
Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 1/118 (0%)
Query: 1074 LFQKMDAERRKG-ETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQ 1132
L QK+ ER K E +P K D DE EN + EE + E + + +
Sbjct: 157 LRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEER 216
Query: 1133 RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPE 1190
R V+ + + +E+ D +E+E E + G+ S ET PE
Sbjct: 217 RAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPE 274
Score = 32.5 bits (74), Expect = 1.3
Identities = 32/175 (18%), Positives = 74/175 (42%), Gaps = 22/175 (12%)
Query: 403 EQQKKMREEKKKKKKSIKQKILDG-----NMDKLDLDESSQLEDLRITVKETNSDNKLRG 457
E ++K+REE++K+K+ K D DE+ E+ ET+ ++
Sbjct: 156 ELRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEE 215
Query: 458 EDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDA 517
A +++ E E++ + E+ ++++++E + ED+ +D+
Sbjct: 216 RRAAVERRLAE------LVEMINWNLEER---RRDLRKEQELEENVERDSDDEDEHGEDS 266
Query: 518 GEEATYYS--IAHTYHEKVTEQASIMVNGKLKEYQIKGLEWMVSLYNNNLNGILA 570
+ T I Y E+++E K+K+ ++ L M+ + +G++
Sbjct: 267 EDGETKPESYITSEYIERISEIR------KMKDERLSSLASMMPQASGEFDGMIF 315
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 38.1 bits (89), Expect = 0.042
Identities = 31/184 (16%), Positives = 67/184 (36%), Gaps = 47/184 (25%)
Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE 1163
+ + E + ++ +++ R RQR++ D+ + +
Sbjct: 655 ESQQAEVTEKARTQDEQQQAP---RRERQRRR--------------------NDEKRQAQ 691
Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK--PVKKRRRRK----KVQISDEDEDEE 1217
+E K + E++ E E+EE+ V+ RR+++ KV+I EE
Sbjct: 692 QEAKALNV------------EEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQS-VAEE 738
Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
+P ++ + + ++ E V + E++E AE +
Sbjct: 739 AVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVVAQTAPEQDEENNAENRDNN-----GM 793
Query: 1278 PRKS 1281
PR+S
Sbjct: 794 PRRS 797
Score = 36.2 bits (84), Expect = 0.15
Identities = 16/79 (20%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK-KRKSTKTPAKDKHKEDDEEKHVVSKR 1250
E E+ + RR+ ++ + + + + K + + + + P +++ + ++EK +
Sbjct: 634 ENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQE 693
Query: 1251 KKKVVEEEEEVEEAEGEEE 1269
K + EE+ V+E E EE
Sbjct: 694 AKALNVEEQSVQETEQEER 712
Score = 33.1 bits (76), Expect = 1.2
Identities = 40/154 (25%), Positives = 59/154 (38%), Gaps = 20/154 (12%)
Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETK-PRLVEIKELPDWMVK 1103
Q E E + E ART+DE + + +RR+ + K E K L
Sbjct: 649 QTAETRESQQAEVTEK-----ARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALN----- 698
Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQ-VDYTDSLTDQEWLKAIEDDGADDYEEE 1162
E S+ E E+E ++ R RQ Q V S+ ++ +E+ A
Sbjct: 699 ---VEEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAA----- 750
Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
E V+ V R K V +TAPE +EE
Sbjct: 751 EPVVQEVPAPRTELVKVPLPVVAQTAPEQDEENN 784
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the maturation
of the 18S rRNA and for 40S ribosome production.
Length = 809
Score = 37.7 bits (88), Expect = 0.042
Identities = 38/165 (23%), Positives = 65/165 (39%), Gaps = 49/165 (29%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
E ER ++ E R R+ + ++E K DD D++E ++++
Sbjct: 267 EAERLKKLEAE------RLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNF------ 314
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
G G+ + EE +DE+EE D+D +EE+E
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEE-------------DDDDDLEEEEED------------- 348
Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
D E+++E +E+ + E+ E EEEEE +K K
Sbjct: 349 VDLSDEEEDE-----------EDEDSDDEDDEEEEEEEKEKKKKK 382
Score = 31.9 bits (73), Expect = 2.6
Identities = 37/184 (20%), Positives = 64/184 (34%), Gaps = 53/184 (28%)
Query: 1141 SLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK- 1199
SL++ + + DD D ++ + G + +E E++ E K K
Sbjct: 122 SLSEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGED-----DEEDEEEQPERKKSKK 176
Query: 1200 ----------KRRRRKKVQISDEDEDEEDE------------SPKKKKRKSTKTPAKDKH 1237
K + ++ + +EDED +E K K TP ++K
Sbjct: 177 EVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTP-EEKD 235
Query: 1238 KEDD--------EEKHVVSKRKKKVVEEEEEVEEAE--------------GEEEEEDDKA 1275
E D + + + R K EEE EEAE GEEE+++++
Sbjct: 236 DEYDQRVRELTFDRRAQPTDRTK--TEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEE 293
Query: 1276 SKPR 1279
Sbjct: 294 DSKE 297
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is a
family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 37.5 bits (87), Expect = 0.043
Identities = 25/115 (21%), Positives = 42/115 (36%), Gaps = 4/115 (3%)
Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
K TKK G K S ++ + +E ++ + K E +P+ K+
Sbjct: 12 KAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKK----PKEASRPGTPRNPKK 67
Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
S T + E+ K S + + + E EEEEE ++ P S
Sbjct: 68 SSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPDPIAS 122
Score = 33.6 bits (77), Expect = 0.74
Identities = 21/84 (25%), Positives = 36/84 (42%), Gaps = 15/84 (17%)
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKED-----DE-EKHVVSKRKKKVVE---------E 1257
D ++++P KK +K + + ED DE E+ +K KK E
Sbjct: 6 DSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNP 65
Query: 1258 EEEVEEAEGEEEEEDDKASKPRKS 1281
++ + E ++K +KPRKS
Sbjct: 66 KKSSKPTESSAASSEEKPAKPRKS 89
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 36.3 bits (84), Expect = 0.047
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 1204 RKKVQISDEDEDEEDES--PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEV 1261
R V+ + E D+ ++ P+ + S KT K K ++ +K+KKK EE E+
Sbjct: 21 RAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80
Query: 1262 EEAEGEEEEEDDKASKP 1278
E + EE+++ K
Sbjct: 81 AAEELSDSEENEENDKK 97
Score = 32.1 bits (73), Expect = 1.1
Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 9/93 (9%)
Query: 1152 EDDGADDYEEEEEEV---KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
D DD E+ E+ K KK+ + +K K ++T + +E+E+ + +
Sbjct: 31 VDKVNDDIFPEDVEIPSKKTSKKKKTTPRK--KKTTKKTKKKKKEKEEVPEL----AAEE 84
Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDD 1241
+SD +E+EE++ + + A + + ED
Sbjct: 85 LSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117
Score = 32.1 bits (73), Expect = 1.3
Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 3/59 (5%)
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
+ KKKK K K K+ +EK V + EE + EE E +++ D + K
Sbjct: 50 TSKKKKTTPRKKKTTKKTKKKKKEKEEV---PELAAEELSDSEENEENDKKVDYELPKV 105
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 36.7 bits (85), Expect = 0.049
Identities = 27/97 (27%), Positives = 44/97 (45%), Gaps = 15/97 (15%)
Query: 1133 RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE 1192
R + D +E +ED+G ++ EEE+ +V+ + RG E+T +
Sbjct: 51 RIKADVGGLEVSEETEAEVEDEGGEEEEEEDADVETELRPRG--------LTEKTPELSD 102
Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKST 1229
EE + + +RRR K Q + +D KKKR T
Sbjct: 103 EEARLLTQRRREGKPQFNRQDY-------HKKKRTPT 132
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 37.4 bits (87), Expect = 0.052
Identities = 19/109 (17%), Positives = 46/109 (42%), Gaps = 17/109 (15%)
Query: 361 AEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKKKKSIK 420
+ ++ ++ +KK+K+ D+ + L +++ E++E + K+ ++KKKKKK K
Sbjct: 11 KQQKQQNKQKGTKKKNKKSK---KDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67
Query: 421 QKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEW 469
+K + E + + +D +++ W
Sbjct: 68 KK--------------NLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNW 102
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 37.4 bits (87), Expect = 0.054
Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 13/125 (10%)
Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
D Y+ E KK+ KK ++ ++E E E E RK V +E+
Sbjct: 265 DYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDV-----EEEN 319
Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHV------VSKRKKKVVEEE--EEVEEAEGEE 1268
KKRK + +DD + + K++KK+ E+ ++ E E+
Sbjct: 320 ARLEDSPKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSED 379
Query: 1269 EEEDD 1273
EEE
Sbjct: 380 EEEAA 384
Score = 36.3 bits (84), Expect = 0.11
Identities = 36/175 (20%), Positives = 63/175 (36%), Gaps = 18/175 (10%)
Query: 1106 DEVEN-SMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
DE+EN + E+E++++ E +++K D + K + Y+EE E
Sbjct: 166 DELENVELVEKEKDKKNLE------LKKKKPDYDPD-----DDDKFNKRSILSKYDEEIE 214
Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQI-----SDEDEDEEDE 1219
K S +E + +E +K +K SD + E
Sbjct: 215 G-KKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMV 273
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
KK K+K K + K ++DE + ++VEE E+ K
Sbjct: 274 KFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKK 328
Score = 31.7 bits (72), Expect = 2.8
Identities = 48/249 (19%), Positives = 94/249 (37%), Gaps = 24/249 (9%)
Query: 1044 HQDDEEEEENEVPDDETVNQLIA--RTEDELTLFQKMDAERRKGETKPRLVEIKELPDWM 1101
DDE E++ + + ++ E D KP+ + K+
Sbjct: 231 STDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRK 290
Query: 1102 VKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEE 1161
D+DE+E E L S RK V+ ++ + K E+ DD+ E
Sbjct: 291 DLDEDELEPE----------AEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVE 340
Query: 1162 EEEEVKPV--TKKRGSGKKGKKSYVEETAPE--DEEEEKPVKKRRRRKKVQISDE----- 1212
++++++ ++R + KK KK E+ A + +E E + + DE
Sbjct: 341 DDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEEEAAENNDNGIVIDETSEFV 400
Query: 1213 ---DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
++ +E P+ K + ++ ED+E++ + V+ +EE EE + E
Sbjct: 401 RSLQKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAI 460
Query: 1270 EEDDKASKP 1278
+P
Sbjct: 461 PSTILEEEP 469
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 37.1 bits (86), Expect = 0.056
Identities = 34/168 (20%), Positives = 51/168 (30%), Gaps = 24/168 (14%)
Query: 1085 GETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTD 1144
G + K + E+S ++E E E E+ S + D + +D
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSD 200
Query: 1145 QEWLKAIEDDGADDYEEEEEEVKPVTKK-----RGSGKKGKKSYVEETAPEDEEEEKPVK 1199
E + D +EEE E P + E + D EE P
Sbjct: 201 SEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPS 260
Query: 1200 KRRRRKKVQ-------------------ISDEDEDEEDESPKKKKRKS 1228
K+ + KK DED D + K KRK+
Sbjct: 261 KKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKKPVKRKN 308
Score = 33.3 bits (76), Expect = 0.96
Identities = 15/51 (29%), Positives = 23/51 (45%)
Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
+ K K K K K + E K K ++ E E+E + E E++ DD
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDD 193
Score = 32.9 bits (75), Expect = 1.2
Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
ET + + +K KK ++++ + SD+D++EE ES + K + + D +E+D +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Query: 1246 VVSKRKKKVV----EEEEEVEEAEGEEEEEDDKASKP 1278
S+ +V EEE E + E+ + S
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDE 239
Score = 32.9 bits (75), Expect = 1.2
Identities = 22/111 (19%), Positives = 40/111 (36%), Gaps = 3/111 (2%)
Query: 403 EQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPP 462
++ K ++ KK KKK K+ + ++ + ++ S+ E+ + + ED
Sbjct: 147 KKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED--- 203
Query: 463 AKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDE 513
Q L E E +ED E E S+ + D E
Sbjct: 204 YSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSE 254
Score = 31.3 bits (71), Expect = 3.4
Identities = 16/65 (24%), Positives = 30/65 (46%)
Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
E + + KKK K +K + + D+E+ S+ + K E E+ + E EE+ +
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSD 200
Query: 1272 DDKAS 1276
+ S
Sbjct: 201 SEDYS 205
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 36.3 bits (84), Expect = 0.061
Identities = 20/94 (21%), Positives = 38/94 (40%), Gaps = 9/94 (9%)
Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
+ A DE+E K + E+ EE++ + K K A +++DEE
Sbjct: 41 SDQAAADEQEAKKSDD-----QETAEIEEVKEEEKEAANSEDKEDKGDA----EKEDEES 91
Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
++ + + +E E E E E + + + P
Sbjct: 92 EEENEEEDEESSDENEKETEEKTESNVEKEITNP 125
Score = 31.2 bits (71), Expect = 2.6
Identities = 17/92 (18%), Positives = 40/92 (43%), Gaps = 6/92 (6%)
Query: 1151 IEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
+D +E+E K + + + K +E A +++E+K ++ +
Sbjct: 36 FPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEES---E 92
Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDE 1242
+E+E+E++ES + +T K + + E
Sbjct: 93 EENEEEDEESSDE---NEKETEEKTESNVEKE 121
Score = 30.1 bits (68), Expect = 5.7
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 1/68 (1%)
Query: 1210 SDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
S D+ DE KK ++ KE+++E ++ K E+E+ E E EE
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEV-KEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 1270 EEDDKASK 1277
E+++ + +
Sbjct: 98 EDEESSDE 105
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 37.4 bits (86), Expect = 0.062
Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1178 KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
K + + + + +E K + K+ ++K I +ED D+ DE P+ + S P
Sbjct: 468 KFDEYWTRISKIQFTQEIKKLIKKSKKKLAPIEEEDSDKHDEPPEGPE-ASGLPPKAPGD 526
Query: 1238 KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKAS--KPRK 1280
KE +E +H SK + E + E EGE ++ A KP K
Sbjct: 527 KEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSK 571
Score = 30.0 bits (67), Expect = 9.8
Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 13/89 (14%)
Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE--DEDEEDESPKKK 1224
K V KR KS+ + T E+ EE +K+ + D+ + D+ED P ++
Sbjct: 844 KIVKLKR------SKSFDDLTTVEEAEEMGA-----EARKIVVDDDGTEADDEDTHPPEE 892
Query: 1225 KRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
K KS + K +K K+ KK
Sbjct: 893 KHKSEVRRRRPPKKPSKPKKPSKPKKPKK 921
>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit.
Length = 100
Score = 34.1 bits (79), Expect = 0.074
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 6/42 (14%)
Query: 666 QWKCMIIDEGHRM--KNHHCKLTQTLNTFYNSSHRVLLTGTP 705
Q +IIDE H K + K+ + + + + LT TP
Sbjct: 61 QALVIIIDEAHHSSAKTKYRKILE----KFKPAFLLGLTATP 98
>gnl|CDD|218481 pfam05177, RCSD, RCSD region. Proteins contain this region include
C.elegans UNC-89. This region is found repeated in UNC-89
and shows conservation in prolines, lysines and glutamic
acids. Proteins with RCSD are involved in muscle M-line
assembly, but the function of this region RCSD is not
clear.
Length = 101
Score = 34.1 bits (77), Expect = 0.084
Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 2/101 (1%)
Query: 1165 EVKPVTKKRGSGKKGKKSYVE--ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
VK KK S + S E E EE + P KK + +KV + ++++SP+
Sbjct: 1 GVKSPGKKEKSPLRRSPSRTEKTEEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPE 60
Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
K + K+P K + + E+ S KK+ V+++E+ E
Sbjct: 61 KSATEEVKSPTKKEKSPEKVEEKPASPTKKERVQKKEKSPE 101
>gnl|CDD|206017 pfam13846, DUF4196, Domain of unknown function (DUF4196). This is a
short region of ccdc82_homologues that is conserved from
Schizo. pombe up to humans. The function is not known.
Length = 112
Score = 34.5 bits (78), Expect = 0.086
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 1/82 (1%)
Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDE-EKHVV 1247
P+ E+ + + QI +ED +EE+ K+KR S+ D+ + D + V
Sbjct: 25 PDSEKHPGQEEGDLNKHTGQIIEEDSEEENIKIGKRKRLSSVMSDSDESDDSDILSRKVA 84
Query: 1248 SKRKKKVVEEEEEVEEAEGEEE 1269
KR ++VVE+ E E E+
Sbjct: 85 IKRPRRVVEDSSEEMEQSSPEK 106
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 36.2 bits (84), Expect = 0.10
Identities = 24/117 (20%), Positives = 41/117 (35%), Gaps = 4/117 (3%)
Query: 1168 PVTKKRGSGKKGKKSYVE----ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
K + S V E EEE V +K + +E ++ KK
Sbjct: 15 TTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKK 74
Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
KK+K K P + + + SK+ KK + + E+ + + + A K
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNV 131
Score = 31.2 bits (71), Expect = 3.4
Identities = 19/95 (20%), Positives = 38/95 (40%), Gaps = 5/95 (5%)
Query: 1187 TAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHV 1246
+ + + +P+ V + +E K + K +DK+ E +K
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNES--KKKS 70
Query: 1247 VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
K+KKK ++E ++EGE + K +K+
Sbjct: 71 EKKKKKKKEKKE---PKSEGETKLGFKTPKKSKKT 102
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes the
fifth essential DNA polymerase in yeast EC:2.7.7.7. Pol5p
is localised exclusively to the nucleolus and binds near
or at the enhancer region of rRNA-encoding DNA repeating
units.
Length = 784
Score = 36.4 bits (84), Expect = 0.11
Identities = 29/112 (25%), Positives = 45/112 (40%), Gaps = 13/112 (11%)
Query: 1014 LNMDEKVIQAGMFDQKSTGSERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELT 1073
L DE + Q G E E + DD+E+E + D E+ ++ +E
Sbjct: 632 LKADENKSR---HQQLFEGEE--EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDE 686
Query: 1074 LFQKMDAERRKGETKPR------LVEIKELPDWMVKDDDEVENSMYEEERER 1119
Q+ DAE +G L ++ LPD + D E E M +E+ R
Sbjct: 687 --QEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQMMR 736
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 34.5 bits (80), Expect = 0.12
Identities = 19/70 (27%), Positives = 42/70 (60%), Gaps = 4/70 (5%)
Query: 320 RNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRL 379
++ R ++L + ++ + EKE+++ ++R EKE + + E+E R+ Q+ ++L
Sbjct: 61 KDTQLRAKRLMREMLLFWKKNEKEERELRKRAEKEALEQAKKEEEL--REAKRQQ--RKL 116
Query: 380 AFLLSQTDEY 389
FL++QT+ Y
Sbjct: 117 NFLITQTELY 126
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 36.2 bits (84), Expect = 0.14
Identities = 30/107 (28%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 574 GLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSNWVLE-FEKWAPSAN------VVAY 626
G+GKT + + YL +K K + +IVP +TL V E +K+A A V
Sbjct: 107 GVGKTTFGLLMSLYLAKKGKRV---YIIVPTTTLVRQVYERLKKFAEDAGSLDVLVVYHS 163
Query: 627 KGSPLARRALQGRMKNSKFNVLLTTYEFIIKDKSVLSKIQWKCMIID 673
+ R+++ F++L+TT +F+ K LSK+++ + +D
Sbjct: 164 ALPTKEKEEALERIESGDFDILITTSQFLSKRFEELSKLKFDFIFVD 210
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 35.4 bits (81), Expect = 0.14
Identities = 25/118 (21%), Positives = 47/118 (39%), Gaps = 15/118 (12%)
Query: 1158 DYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEE 1217
D +E+ +E P + ++ E E+EE+E+ ++++E+E+E
Sbjct: 159 DNDEDNDEAPPAQP--DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA 216
Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
+ S D D E + K++ + E EEEV E EE+ +
Sbjct: 217 EGS-------------DDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPS 261
Score = 34.2 bits (78), Expect = 0.41
Identities = 26/120 (21%), Positives = 55/120 (45%), Gaps = 3/120 (2%)
Query: 1157 DDYEEEEEEVK--PVTKKRGSGKKGKKSY-VEETAPEDEEEEKPVKKRRRRKKVQISDED 1213
D EE E+V V ++ + K + + ED +E P + ++ + +E
Sbjct: 126 SDVEEPSEKVDEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEES 185
Query: 1214 EDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
+ E+E ++ S ++ ++E++EE + V E E +++ +GEEEE ++
Sbjct: 186 DGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEE 245
Score = 31.1 bits (70), Expect = 3.0
Identities = 27/122 (22%), Positives = 47/122 (38%), Gaps = 21/122 (17%)
Query: 404 QQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPA 463
+K EE +K +D+ D++ E R+T DN ++APPA
Sbjct: 123 LEKSDVEEPSEK------------VDEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPA 170
Query: 464 K-----QIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAG 518
+ + +E L G E+ EDE + E E ++ E+ D+ +D
Sbjct: 171 QPDVDNEEEERLEESDG----REEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVV 226
Query: 519 EE 520
+
Sbjct: 227 DY 228
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner membrane
complex of itself with TolQ and TolR to the outer
membrane complex of TolB and OprL (also called Pal). Most
of the length of the protein consists of low-complexity
sequence that may differ in both length and composition
from one species to another, complicating efforts to
discriminate TolA (the most divergent gene in the tol-pal
system) from paralogs such as TonB. Selection of members
of the seed alignment and criteria for setting scoring
cutoffs are based largely conserved operon struction.
//The Tol-Pal complex is required for maintaining outer
membrane integrity. Also involved in transport (uptake)
of colicins and filamentous DNA, and implicated in
pathogenesis. Transport is energized by the proton motive
force. TolA is an inner membrane protein that interacts
with periplasmic TolB and with outer membrane porins
ompC, phoE and lamB [Transport and binding proteins,
Other, Cellular processes, Pathogenesis].
Length = 346
Score = 35.6 bits (82), Expect = 0.15
Identities = 25/120 (20%), Positives = 54/120 (45%), Gaps = 8/120 (6%)
Query: 1158 DYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEE 1217
D ++ + +++ K K +E + E++ + +K+R ++ + + ++
Sbjct: 44 DPGAVAQQANRIQQQK---KPAAKK-EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAA 99
Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEV---EEAEGEEEEEDDK 1274
E K+ ++ K A++K K+ +E K + K E E E EEA+ + EEE
Sbjct: 100 AEKAAKQAEQAAK-QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158
Score = 35.2 bits (81), Expect = 0.22
Identities = 32/165 (19%), Positives = 64/165 (38%), Gaps = 13/165 (7%)
Query: 1116 EREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGS 1175
+ E++ EEA R + Q +Q + ++ K E E++++ + K+
Sbjct: 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131
Query: 1176 GKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKD 1235
K E E + +E+ K+ K + + E + + E+ KK + A+
Sbjct: 132 AKA-----KAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE-------AEA 179
Query: 1236 KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K K + + K + K K E + AE + E + A+
Sbjct: 180 KAKAEAKAKAKAEEAKAK-AEAAKAKAAAEAAAKAEAEAAAAAAA 223
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 35.7 bits (82), Expect = 0.16
Identities = 41/224 (18%), Positives = 90/224 (40%), Gaps = 29/224 (12%)
Query: 1057 DDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEE 1116
D V Q R + + + +K + +R+K + + E+K E+E
Sbjct: 56 DPGAVVQQYGRIQSQQSSAKKGEQQRKK-KEEQVAEELKP-------------KQAAEQE 101
Query: 1117 REREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
R ++ E+ + +Q++ + E +E ++EE+ K +++
Sbjct: 102 RLKQLEKERLKAQEQQKQA-------EEAEKQAQLEQK-----QQEEQARKAAAEQKKKA 149
Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
+ K E A E K++ + + ++E + + + + KKK ++ A +K
Sbjct: 150 EAAKAKAAAEAAKLKAAAEA---KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEK 206
Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K + E K K+ + EE+ E+ + + + DKA+ K
Sbjct: 207 AKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAK 250
Score = 30.7 bits (69), Expect = 6.0
Identities = 32/181 (17%), Positives = 66/181 (36%), Gaps = 8/181 (4%)
Query: 341 EKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEH 400
E+ K ++ E+ER+++L E + + ++ ++ A L + E + ++
Sbjct: 88 VAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQE------EQARKA 141
Query: 401 KEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDA 460
EQ+KK K K + K + + E+ + + + K E
Sbjct: 142 AAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAK 201
Query: 461 PPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEE 520
A++ K + E E + +E + E K+K +KA + A
Sbjct: 202 AAAEKAKAEAEAKAKAEKKAEAAAEEKAAAE--KKKAAAKAKADKAAAAAKAAERKAAAA 259
Query: 521 A 521
A
Sbjct: 260 A 260
>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
[Transcription / DNA replication, recombination, and
repair].
Length = 442
Score = 35.5 bits (82), Expect = 0.17
Identities = 51/278 (18%), Positives = 101/278 (36%), Gaps = 27/278 (9%)
Query: 539 SIMVNGKLKEYQIKGLEWMVSLYNNNLNGILADEMGLGKTVQTIALLTYLMEKKKNLGPY 598
+ +L+ YQ + L+ +V G++ G GKTV + L K++
Sbjct: 30 IVAFEFELRPYQEEALDALVKNRRTERRGVIVLPTGAGKTVVAAEAIAEL---KRST--- 83
Query: 599 LVIVPLSTL-SNWVLEFEKWA-PSANVVAYKGSPLARRALQGRMKNSKFNVLLTTYEFII 656
LV+VP L W +K+ + + Y G G + V + T + +
Sbjct: 84 LVLVPTKELLDQWAEALKKFLLLNDEIGIYGG---------GEKELEPAKVTVATVQTLA 134
Query: 657 KDK--SVLSKIQWKCMIIDEGHRMKNHHCKLTQTLNTFYNSSHRVLLTGTP---LQNKLP 711
+ + ++ +I DE H + + L + R+ LT TP ++
Sbjct: 135 RRQLLDEFLGNEFGLIIFDEVHHLPAP--SYRRILELLSAAYPRLGLTATPEREDGGRIG 192
Query: 712 ELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRR 771
+L+ L+ ++ + ++ + AP+ KV L E+E LLR
Sbjct: 193 DLFDLIGPIVYEVSLK-ELIDEGYLAPYKYVEIKVTLTEDE--EREYAKESARFRELLRA 249
Query: 772 LKKEVESQLPEKVEYIIKCDMSGLQKLLYQHLHNRGIL 809
++ + ++ ++ LL +H L
Sbjct: 250 RGTLRAENEARRIAIASERKIAAVRGLLLKHARGDKTL 287
>gnl|CDD|217972 pfam04224, DUF417, Protein of unknown function, DUF417. This
family of uncharacterized proteins appears to be
restricted to proteobacteria.
Length = 153
Score = 34.0 bits (78), Expect = 0.17
Identities = 24/87 (27%), Positives = 33/87 (37%), Gaps = 11/87 (12%)
Query: 545 KLKEYQIKGLE-------WMVSLYNNNLNGILADEMGLGKTVQTIALLTYL---MEKKKN 594
K Y+ + ++ M LY NG LG T + +L + K
Sbjct: 33 KFFPYEAEAIKPFVSNSPLMSFLYEFLKNGTYGVSYLLGITETIVGILLLIGLRNPKAGL 92
Query: 595 LGPYLVIV-PLSTLSNWVLEFEKWAPS 620
LG LV V + TLS + E W PS
Sbjct: 93 LGGLLVFVTFIVTLSFLITTPEAWVPS 119
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 35.7 bits (82), Expect = 0.18
Identities = 33/157 (21%), Positives = 69/157 (43%), Gaps = 11/157 (7%)
Query: 1117 REREKEE----ALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKK 1172
+ERE +E A + +KQ+D + ++ + D D+ ++++E K + K
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263
Query: 1173 -RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
S K K E E E+ + +K K + + + +D + K++ + S K
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIK-----KNDEEALKAKDHKAFDLKQESKASEKE 318
Query: 1232 PAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE 1268
A+DK E +++ V++ +K + E + E+
Sbjct: 319 -AEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNED 354
Score = 30.7 bits (69), Expect = 5.8
Identities = 30/155 (19%), Positives = 67/155 (43%), Gaps = 13/155 (8%)
Query: 1139 TDSLTDQEWLKAI-EDDGAD--------DYEEEEEEV---KPVTKKRGSGKKGKKSYVEE 1186
TDS++D++ ++A+ ED+ D +E E + + K KK + +
Sbjct: 174 TDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQ 233
Query: 1187 TAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDE-EKH 1245
+ ++ ++ R+K Q + D S K+ ++ + ++ K E +K+
Sbjct: 234 QKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKN 293
Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
K K + + +E++ E+E +DK + +K
Sbjct: 294 DEEALKAKDHKAFDLKQESKASEKEAEDKELEAQK 328
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the protein
product of the dentatorubral-pallidoluysian atrophy
(DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the expansion
of a CAG repeat in the DRPLA gene on chromosome 12p. This
results in an extended polyglutamine region in
atrophin-1, that is thought to confer toxicity to the
protein, possibly through altering its interactions with
other proteins. The expansion of a CAG repeat is also the
underlying defect in six other neurodegenerative
disorders, including Huntington's disease. One
interaction of expanded polyglutamine repeats that is
thought to be pathogenic is that with the short glutamine
repeat in the transcriptional coactivator CREB binding
protein, CBP. This interaction draws CBP away from its
usual nuclear location to the expanded polyglutamine
repeat protein aggregates that are characteristic of the
polyglutamine neurodegenerative disorders. This
interferes with CBP-mediated transcription and causes
cytotoxicity.
Length = 979
Score = 35.8 bits (82), Expect = 0.19
Identities = 28/121 (23%), Positives = 45/121 (37%), Gaps = 27/121 (22%)
Query: 1171 KKRGSGKKGKKSYVEET-----------APEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
KK+ + G+ S E A + K ++ KK++ EE
Sbjct: 22 KKQTASPDGRASPTNEDQRSSGRNSPSAASTSSNDSKAESTKKPNKKIK-------EEAT 74
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV-----EEEEEVEEAEGEEEEEDDK 1274
SP K ++ + PA D +E + V +K+ K E E EGE E D +
Sbjct: 75 SPLKSTKRQREKPASDT----EEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSR 130
Query: 1275 A 1275
+
Sbjct: 131 S 131
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 35.7 bits (83), Expect = 0.20
Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 10/107 (9%)
Query: 574 GLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSNWVLE-FEKWAPSANVVAYKGSPLA 632
G+GKT + + YL +K K +I P L V+E EK+ +
Sbjct: 105 GVGKTTFGLVMSLYLAKKGKKS---YIIFPTRLLVEQVVEKLEKFGEKVGCGVKILYYHS 161
Query: 633 ------RRALQGRMKNSKFNVLLTTYEFIIKDKSVLSKIQWKCMIID 673
+ R+K F++L+TT +F+ K+ L K ++ + +D
Sbjct: 162 SLKKKEKEEFLERLKEGDFDILVTTSQFLSKNFDELPKKKFDFVFVD 208
>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
Length = 445
Score = 35.1 bits (81), Expect = 0.25
Identities = 24/98 (24%), Positives = 43/98 (43%), Gaps = 10/98 (10%)
Query: 1181 KSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKED 1240
Y + +P E E P KK+ RR ++S +E+EE+E + + P + +D
Sbjct: 10 SPYSSDDSPSLEPPELPPKKKGRR---RVSPVEEEEEEEEAEVVAVGFSYPPVRISRGKD 66
Query: 1241 DEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
KR + ++EE++ E+ E + S P
Sbjct: 67 -------GKRPVRPLKEEKDSEKKASTEAAVRNPLSDP 97
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal DNA
during the process of transcription.
Length = 193
Score = 33.9 bits (78), Expect = 0.29
Identities = 20/79 (25%), Positives = 37/79 (46%)
Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
D E ++K + K K++ VEE ++++++K VKK ++ KK +
Sbjct: 114 FPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173
Query: 1209 ISDEDEDEEDESPKKKKRK 1227
E + + KKKK+K
Sbjct: 174 KEKMVEPKGSKKKKKKKKK 192
Score = 33.2 bits (76), Expect = 0.53
Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 2/69 (2%)
Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
+ E E +K K + ++ +E+E+ E KKK+ K K KDK ++ E K
Sbjct: 124 PPSELGSESET--SEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181
Query: 1245 HVVSKRKKK 1253
K+KKK
Sbjct: 182 GSKKKKKKK 190
Score = 29.7 bits (67), Expect = 6.8
Identities = 28/100 (28%), Positives = 39/100 (39%), Gaps = 24/100 (24%)
Query: 1154 DGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDED 1213
GA D E ++ S K+ +E E+EE KK +++KK
Sbjct: 118 YGAPDGPPSE----LGSESETSEKETTAKVEKEAEVEEEE-----KKEKKKKK------- 161
Query: 1214 EDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
E K+KK K K K+K E K K+KKK
Sbjct: 162 -----EVKKEKKEKKDK---KEKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region [General
function prediction only].
Length = 303
Score = 34.6 bits (79), Expect = 0.29
Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 14/119 (11%)
Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
E + + EEE + +K KK +E+ D+ E + + + SD
Sbjct: 199 EKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESE-SD 257
Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
EDEDE+++ +K++ ++ H + E+E E E+ +
Sbjct: 258 EDEDEDNKGKIRKRKTDDAKKSRKPHIHIE-------------YEQERENEKIPAVQHS 303
Score = 32.7 bits (74), Expect = 1.1
Identities = 23/106 (21%), Positives = 48/106 (45%), Gaps = 2/106 (1%)
Query: 1142 LTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEE-TAPEDEEEEKPVKK 1200
T++E + ++ ++ EE + E++ VT +K KK +E+ + E ++
Sbjct: 190 ATEREKRQDEKERYVEEEEESDTELEAVTDD-SEKEKTKKKDLEKWLGSDQSMETSESEE 248
Query: 1201 RRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHV 1246
+ DEDED + + K+K + K+ H E ++E+
Sbjct: 249 EESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQEREN 294
Score = 32.7 bits (74), Expect = 1.3
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 1/101 (0%)
Query: 1181 KSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKED 1240
KS + P + E K R+K Q E EE+E + T K+K K+
Sbjct: 170 KSGIYGDTPLNVREHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKK 229
Query: 1241 DEEKHVVSKRKKKVVE-EEEEVEEAEGEEEEEDDKASKPRK 1280
D EK + S + + E EEEE E+E +E+E++D K RK
Sbjct: 230 DLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRK 270
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 34.7 bits (80), Expect = 0.30
Identities = 39/189 (20%), Positives = 64/189 (33%), Gaps = 15/189 (7%)
Query: 1058 DETVNQLIARTEDELTLFQKMDA----------ERRKGETKPRLVEIKELPDWMVKDDDE 1107
D+ + R E E DA +R+ E RL E+KEL + D
Sbjct: 111 DKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQS---RRADV 167
Query: 1108 VENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVK 1167
NSM E REK+E + + +E + +D+ D ++EE+
Sbjct: 168 DVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDE--DSEDDEEDNDN 225
Query: 1168 PVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRK 1227
+ K GS K + + + + E K ++ + D KK +
Sbjct: 226 TPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPE 285
Query: 1228 STKTPAKDK 1236
ST
Sbjct: 286 STSQSPSSA 294
Score = 34.7 bits (80), Expect = 0.30
Identities = 32/151 (21%), Positives = 57/151 (37%), Gaps = 15/151 (9%)
Query: 1103 KDDDEVENSMYEEEREREKEEA---LMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
K D+E E + +E E +A L R + +++++ + L + + L++ D +
Sbjct: 112 KLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADVDVNS 171
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
E KK EE EDE K + ++ + +DED ED+
Sbjct: 172 MLE--------ALFRREKK----EEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDD 219
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
S K+ + K K +KR
Sbjct: 220 EEDNDNTPSPKSGSSSPAKPTSILKKSAAKR 250
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 34.7 bits (80), Expect = 0.33
Identities = 17/63 (26%), Positives = 24/63 (38%), Gaps = 10/63 (15%)
Query: 1176 GKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKD 1235
G+KGK S + RRR + D+ +D EDE P +K K +
Sbjct: 10 GRKGKPSRPVKQK----------VSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRG 59
Query: 1236 KHK 1238
K
Sbjct: 60 KRG 62
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK motif.
Length = 189
Score = 33.9 bits (78), Expect = 0.33
Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 4/84 (4%)
Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSK 1249
E EEEE+ +K+ K+ E+E+ E+E KKK+ + K + KE +E K
Sbjct: 22 EAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE----YEK 77
Query: 1250 RKKKVVEEEEEVEEAEGEEEEEDD 1273
K V EEE ++ +EE +
Sbjct: 78 LKSSFVVEEEGTDKLSADEESNEL 101
Score = 30.1 bits (68), Expect = 5.7
Identities = 22/93 (23%), Positives = 48/93 (51%), Gaps = 12/93 (12%)
Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKK 1252
EEK ++++R + + +E + E++ ++K + ++K KE++E K +R++
Sbjct: 11 LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK----EREE 66
Query: 1253 KVVEEEEEVE--------EAEGEEEEEDDKASK 1277
+ +E+EE E E EG ++ D+ S
Sbjct: 67 QARKEQEEYEKLKSSFVVEEEGTDKLSADEESN 99
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 35.0 bits (81), Expect = 0.34
Identities = 19/139 (13%), Positives = 50/139 (35%), Gaps = 5/139 (3%)
Query: 1144 DQEWLKAIEDDG-----ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPV 1198
+ I DG +D +E + +++ + PE+ E
Sbjct: 783 KILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEE 842
Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
+K +++ E ++ E+E + K+ K+ K+ ++ K V+ +
Sbjct: 843 QKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSD 902
Query: 1259 EEVEEAEGEEEEEDDKASK 1277
E ++ + ++++ D
Sbjct: 903 EIKQDEKTTKKKKKDLEKT 921
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 34.0 bits (78), Expect = 0.38
Identities = 21/90 (23%), Positives = 50/90 (55%), Gaps = 17/90 (18%)
Query: 328 KLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTD 387
K+ + I ++ A+K +++++ R KER RL+AE E + +D + +
Sbjct: 132 KMPQMIADWR--AQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPR---------- 179
Query: 388 EYISNLTQMVKEHKEEQQKKMREEKKKKKK 417
+M+++ ++E++KK++E K+++K+
Sbjct: 180 -----FQEMLQQKEKEEKKKVKEAKRREKE 204
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus spores
are protected by a protein shell consisting of over 50
different polypeptides, known as the coat. This family of
proteins has an important morphogenetic role in coat
assembly, it is involved in the assembly of at least 5
different coat proteins including CotB, CotG, CotS, CotSA
and CotW. It is likely to act at a late stage of coat
assembly.
Length = 185
Score = 33.3 bits (76), Expect = 0.43
Identities = 19/82 (23%), Positives = 35/82 (42%), Gaps = 1/82 (1%)
Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
K + ++ + SDE ++E K + ++ K + E ++ K + EE+
Sbjct: 35 KADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEK-EAGEPEREDIAEQQEKEEIAQEEEK 93
Query: 1259 EEVEEAEGEEEEEDDKASKPRK 1280
EE E ++E K KP K
Sbjct: 94 EEEAEDVKQQEVFSFKRKKPFK 115
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 34.7 bits (80), Expect = 0.46
Identities = 39/196 (19%), Positives = 90/196 (45%), Gaps = 25/196 (12%)
Query: 342 KEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFL----------LSQTDEYIS 391
+E +K+ E++E++ AE E Y++L + ++ LA L L + +E +S
Sbjct: 196 EELEKQLEKLERQ------AEKAERYQELKAELRELELALLLAKLKELRKELEELEEELS 249
Query: 392 NLTQMVKEHKEEQQKKMRE--EKKKKKKSIKQKILDGNMDKLDLDESSQLEDL-----RI 444
L + ++E +EE ++ +E E K + + +++++ + + + L+ ++E+L +
Sbjct: 250 RLEEELEELQEELEEAEKEIEELKSELEELREELEE--LQEELLELKEEIEELEGEISLL 307
Query: 445 TVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAV 504
+ +N+L + + ++ A E E+ E++E KEE + +
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367
Query: 505 GEKKTEDDEYKDAGEE 520
E +E +A E
Sbjct: 368 SALLEELEELFEALRE 383
Score = 33.9 bits (78), Expect = 0.72
Identities = 42/226 (18%), Positives = 91/226 (40%), Gaps = 10/226 (4%)
Query: 301 ERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLM 360
E L A+ K E E +R ++ + + AEKE ++ + +E+ R
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285
Query: 361 AEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNL-----TQMVKEHKEEQQKKMREEKKKK 415
++E K ++ + ++ L + +E + L + K E K+ EE++
Sbjct: 286 LQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345
Query: 416 KKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPG 475
+ ++Q + + +++ + +L L ++E LR E A ++ E
Sbjct: 346 LEELEQLLAE--LEEAKEELEEKLSALLEELEELFEA--LREELAELEAELAEIRNELEE 401
Query: 476 WEVVMEDSEDEGSE-DEEMKEKKEEPSKAVGEKKTEDDEYKDAGEE 520
+ +E E+ E +++ KEE + E + E ++ EE
Sbjct: 402 LKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEE 447
Score = 31.2 bits (71), Expect = 4.3
Identities = 53/281 (18%), Positives = 107/281 (38%), Gaps = 27/281 (9%)
Query: 997 ITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERHEFLQTILHQDDEEEEENEVP 1056
IT S +R A + +L E+ + + E + +D EE
Sbjct: 652 ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711
Query: 1057 DDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEE 1116
++ + E E + + RL E++E + + ++ +E++ E
Sbjct: 712 EELERQLEELKRELAA-------LEEELEQLQSRLEELEEELEELEEELEELQ------E 758
Query: 1117 REREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
R E EE L S + + + + E + + ++ EEE EE +
Sbjct: 759 RLEELEEEL---ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815
Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
+ + E E EE ++++ ++ +EE E +K+ + K++
Sbjct: 816 LESLEQRRERLEQEIEE------LEEEIEELEEKLDELEEELEELEKELEE-----LKEE 864
Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
+E + EK + K++ EE+EE+EE E E E + +
Sbjct: 865 LEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE 905
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 34.2 bits (78), Expect = 0.48
Identities = 36/178 (20%), Positives = 68/178 (38%), Gaps = 14/178 (7%)
Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE 1163
DE + E R+ E E G R+ Y + ED + + EE
Sbjct: 81 LSDEWMQIITEALRQAENEP-----GGHPRENQPYNLQVEKNFPTDHSED--YETQQWEE 133
Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
E++K + R ++ + + T E P + + + E + + K
Sbjct: 134 EKLKHMKFPRRYEDNSEEKHSKRTNEIVEVFYNPQSQATLKSVFE-----EVGKLDGQSK 188
Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV--EEEEEVEEAEGEEEEEDDKASKPR 1279
+KR+ + K ++D+ + + VV E+ +EE E ++ +E+ K SK R
Sbjct: 189 QKRERSDEREKSYQDDEDDTYRQNNIPYEDVVGGEDWNPIEEGEEDQTQEEVKRSKER 246
Score = 30.7 bits (69), Expect = 6.2
Identities = 23/129 (17%), Positives = 48/129 (37%), Gaps = 5/129 (3%)
Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
D+ Y+++E++ G + + E+++ ++ VK+ + R S
Sbjct: 195 DEREKSYQDDEDDTYRQNNIPYEDVVGGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLP 254
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKED-----DEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
DE + + + K + V+ R + + E E +E
Sbjct: 255 DESKRSGQLGLEDEAEEEKEEKGQESRGLSAVQTYLLRWVNARGRGRSQNRAERERSEES 314
Query: 1268 EEEEDDKAS 1276
EE+E D+AS
Sbjct: 315 EEKELDRAS 323
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 32.4 bits (74), Expect = 0.49
Identities = 11/30 (36%), Positives = 24/30 (80%)
Query: 393 LTQMVKEHKEEQQKKMREEKKKKKKSIKQK 422
+ Q++KE +++++K +++KKKKKK+ K+
Sbjct: 101 VKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
prediction only].
Length = 766
Score = 34.3 bits (79), Expect = 0.50
Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 19/111 (17%)
Query: 574 GLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSN-WVLEFEKWAP-SANVVAYKG-SP 630
G GKT+ IALL L + G + IVPL L+ EF + V G
Sbjct: 57 GSGKTL--IALLAILSTLLEGGGKVVYIVPLKALAEEKYEEFSRLEELGIRVGISTGDYD 114
Query: 631 LARRALQGRMKNSKFNVLLTTYEFIIKDKSVLSK----IQW-KCMIIDEGH 676
L L ++++V++TT E K S+ K I+ ++IDE H
Sbjct: 115 LDDERL------ARYDVIVTTPE---KLDSLTRKRPSWIEEVDLVVIDEIH 156
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone sialoprotein
(BSP) is a major structural protein of the bone matrix
that is specifically expressed by fully-differentiated
osteoblasts. The expression of bone sialoprotein (BSP) is
normally restricted to mineralised connective tissues of
bones and teeth where it has been associated with mineral
crystal formation. However, it has been found that
ectopic expression of BSP occurs in various lesions,
including oral and extraoral carcinomas, in which it has
been associated with the formation of microcrystalline
deposits and the metastasis of cancer cells to bone.
Length = 291
Score = 33.5 bits (76), Expect = 0.55
Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 19/118 (16%)
Query: 1172 KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
KR S + G S E + EEE + + + SD +EDEE E+ T
Sbjct: 40 KRFSVQSGSDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLG 99
Query: 1232 PAKD----------------KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
D K + +K + ++ + E+EEE EE E EE E ++
Sbjct: 100 YGGDATPGTGNIGLAALQLPKKAGNAGKK---ATKEDESDEDEEEEEEEEEEEAEVEE 154
Score = 32.0 bits (72), Expect = 2.1
Identities = 24/97 (24%), Positives = 41/97 (42%), Gaps = 23/97 (23%)
Query: 1173 RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTP 1232
+ +G GKK+ E+ + EDEEEE+ ++ + + +E ++ ST+
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEE---------EEAEVEENEQGTNGTSTNSTEVD 170
Query: 1233 AKDKHK--EDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
+ ++ EE EEE V EAE E
Sbjct: 171 HGNGSSGGDNGEE------------GEEESVTEAEAE 195
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 34.1 bits (78), Expect = 0.58
Identities = 23/146 (15%), Positives = 52/146 (35%), Gaps = 21/146 (14%)
Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPED 1191
QRK +D D +D+ YE++E++ T S + + +
Sbjct: 82 QRKSLDDDD-----------DDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDA 130
Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKK-RKSTKTPAKDKHKEDDEEKHVVSKR 1250
+ + Q +D D+D+ S + ++S + +K E+ + + + +
Sbjct: 131 DTQAN---------NDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFK 181
Query: 1251 KKKVVEEEEEVEEAEGEEEEEDDKAS 1276
+ EE +G + + D
Sbjct: 182 FYNFGNDGEEAAAKDGGKSKSSDPGP 207
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 34.0 bits (78), Expect = 0.61
Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 13/85 (15%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
+ ++E V P+ KK S K + EE ++ KK +RKK D D++ +
Sbjct: 173 DSDQEAVLPLVKKSPSEAK---------EEKAEERKQESKKGAKRKK----DASGDDKSK 219
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEK 1244
K + ST T A K D E K
Sbjct: 220 KAKTDRDVSTSTAASQKKSSDLESK 244
Score = 31.7 bits (72), Expect = 2.9
Identities = 25/119 (21%), Positives = 50/119 (42%), Gaps = 16/119 (13%)
Query: 1140 DSLT--DQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKP 1197
D+L+ DQE + + + +EE+ E + K+G+ +K +D +
Sbjct: 169 DTLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRK-----------KDASGDDK 217
Query: 1198 VKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
KK + + V S ++ + K +K K DD +KHV + ++++E
Sbjct: 218 SKKAKTDRDVSTSTAASQKKSSDLESKLEAQSKELWSLK---DDLKKHVSTAELREMLE 273
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 33.2 bits (76), Expect = 0.61
Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 12/87 (13%)
Query: 1202 RRRKKVQISDEDEDEED--------ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
R + D+ +D SPK+ K+K TP K +K ++K
Sbjct: 21 RAHVRSTTESVDKVRDDIFPEDIEIPSPKQPKKKRPTTPRKPATT----KKSKKKDKEKL 76
Query: 1254 VVEEEEEVEEAEGEEEEEDDKASKPRK 1280
EE++ + + EE E+D +
Sbjct: 77 TEEEKKPESDDDKTEENENDPDNNEES 103
Score = 30.1 bits (68), Expect = 5.8
Identities = 17/94 (18%), Positives = 32/94 (34%), Gaps = 9/94 (9%)
Query: 1186 ETAPEDEEE------EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
E+ + ++ E P K+ ++K+ + KKK K T + K +
Sbjct: 29 ESVDKVRDDIFPEDIEIPSPKQPKKKRPTTPRK---PATTKKSKKKDKEKLTEEEKKPES 85
Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
DD++ E + E A + D
Sbjct: 86 DDDKTEENENDPDNNEESGDSQESASANSLSDID 119
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 33.3 bits (76), Expect = 0.69
Identities = 25/131 (19%), Positives = 60/131 (45%), Gaps = 5/131 (3%)
Query: 1152 EDDGADDYEEEEE----EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
+ + A + EEEEE E + ++ +G++ + + E+ E+P K+
Sbjct: 143 DAEPAREDEEEEESFSAEFEHPAQEETAGEEERTDEPKVEHEAHEQHEQPADDDPDEWKI 202
Query: 1208 QISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
S+ + E + ++ + + + + +DEEK + ++ + +E+ +A+
Sbjct: 203 SASEPFQLESEVEASPEEENYEEYEDETELEVEDEEK-ALDEQTEDPQQEDALAGDAKKA 261
Query: 1268 EEEEDDKASKP 1278
EEE++K +P
Sbjct: 262 LEEEEEKGERP 272
Score = 32.2 bits (73), Expect = 1.8
Identities = 33/132 (25%), Positives = 55/132 (41%), Gaps = 15/132 (11%)
Query: 1158 DYEEEEEEVKPVTKKRG-SGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
DYE E +T SG + + + E A EDEEEE+ + E+
Sbjct: 116 DYELLENGCLQLTADISISGIQQENNLDAEPAREDEEEEESFSAEFE----HPAQEETAG 171
Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS-------KRKKKVVEEEEEVEEAEGE-- 1267
E+E + K + ++ +DD ++ +S + + + EEE EE E E
Sbjct: 172 EEERTDEPKVEHEAHEQHEQPADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDETE 231
Query: 1268 -EEEEDDKASKP 1278
E E+++KA
Sbjct: 232 LEVEDEEKALDE 243
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive type)
[DNA replication, recombination, and repair].
Length = 1444
Score = 33.8 bits (78), Expect = 0.71
Identities = 33/128 (25%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 1039 LQTILHQDDEEEEENEVPDDETVNQLI-ARTEDELTLFQKMDAERRKGE-TKPRLVEIKE 1096
L +D EE+ +E +N+ + ++ L +K+ AE K E KP K
Sbjct: 161 LLIEFEVNDISEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKG 220
Query: 1097 LPDWMVKDDDEVE--NSMYEEER---------EREKEEALMGRGSRQRKQVDYTDSLTDQ 1145
+K +E++ + EEE + E +E GR K DYT SL +
Sbjct: 221 RK---IKSTEEIKPLIKINEEETRVKVEGYIFKIEIKELKSGRTLLNIKVTDYTSSLILK 277
Query: 1146 EWLKAIED 1153
++L+ ED
Sbjct: 278 KFLRDEED 285
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin binding
protein sp32 sequences. sp32 is a sperm specific protein
which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved in
packaging the acrosin zymogen into the acrosomal matrix.
Length = 243
Score = 33.1 bits (75), Expect = 0.73
Identities = 17/56 (30%), Positives = 26/56 (46%)
Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
S + K K + K +E +E K+ ++ EEEE EEA+ EE + D
Sbjct: 187 GGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTD 242
Score = 30.4 bits (68), Expect = 4.7
Identities = 21/119 (17%), Positives = 43/119 (36%), Gaps = 18/119 (15%)
Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEE----EEKPVKKRRRRKKVQISDEDEDE 1216
E EV+P T + + + P E E+ ++
Sbjct: 139 EPSAEVQPTTMTLPIAEHPTITENQSFQPWPERLHNNVEELLQSSLS------LGGSVQV 192
Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
+ PK+++ S ++ ++H +++ + +EEEEVEE +EE +
Sbjct: 193 KAPKPKQEQLLS--------KLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 33.8 bits (78), Expect = 0.78
Identities = 14/88 (15%), Positives = 32/88 (36%), Gaps = 4/88 (4%)
Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT--PAKDKHKEDDEEKHVV 1247
E E+ + + R ++ + +D+D + + K K P K + V+
Sbjct: 463 EREKAAR--EARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVI 520
Query: 1248 SKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
+ R+ + + E + + KA
Sbjct: 521 AAREARKAQARARQAEKQAAAAADPKKA 548
Score = 32.2 bits (74), Expect = 1.9
Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 20/181 (11%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
E+ RE + + +L + KA + V R
Sbjct: 464 REKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAR 523
Query: 1174 GSGKKGKKSYVEETAPEDEEEEK------PVKKRRRRKKVQISDEDEDEEDESPKK---- 1223
+ K ++ E + K + + + +K Q + E EE+ PKK
Sbjct: 524 EARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVA 583
Query: 1224 ------KKRKSTKTPAKD----KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
K +K+ + A + E D +K V+ + ++ E + EE D
Sbjct: 584 AAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDP 643
Query: 1274 K 1274
+
Sbjct: 644 R 644
Score = 32.2 bits (74), Expect = 1.9
Identities = 19/109 (17%), Positives = 40/109 (36%), Gaps = 15/109 (13%)
Query: 1184 VEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK----------KKRKSTKTPA 1233
++ A D ++ R+ + E + PKK K +K+ + A
Sbjct: 508 IKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAA 567
Query: 1234 KDKHKEDDEEKHV-----VSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
+ +E+ + K +++ K K ++ E E + E D K +
Sbjct: 568 NAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAA 616
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This family
includes archaebacterial L12, eukaryotic P0, P1 and P2.
Length = 88
Score = 31.1 bits (71), Expect = 0.78
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 1256 EEEEEVEEAEGEEEEEDD 1273
EEE +E E EEEE+DD
Sbjct: 65 AAEEEKKEEEEEEEEDDD 82
Score = 28.4 bits (64), Expect = 5.3
Identities = 10/18 (55%), Positives = 12/18 (66%)
Query: 1256 EEEEEVEEAEGEEEEEDD 1273
EE ++ E EEEEEDD
Sbjct: 64 AAAEEEKKEEEEEEEEDD 81
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and 387
amino acids in length.
Length = 213
Score = 32.7 bits (75), Expect = 0.78
Identities = 19/66 (28%), Positives = 24/66 (36%), Gaps = 14/66 (21%)
Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
QVD D L +E++ A V V +G E PE E E
Sbjct: 104 QVDEADLLA------LMEEEDAAAPTCSCCPVCSVNMTECTGP--------EPEPEPEPE 149
Query: 1195 EKPVKK 1200
E+P KK
Sbjct: 150 EEPEKK 155
Score = 30.8 bits (70), Expect = 3.5
Identities = 18/83 (21%), Positives = 25/83 (30%), Gaps = 34/83 (40%)
Query: 1160 EEEEEEVKPVTKKRGSG---------------------KKGKKSYVEETAPEDEEEEKPV 1198
E E E + KK G G K K+ + +E +
Sbjct: 143 EPEPEPEEEPEKKSGMGPLLLVLAVALIGGGAYYYFKFYKPKQQEKGAPDDDLDEYDYG- 201
Query: 1199 KKRRRRKKVQISDEDEDEEDESP 1221
DEDE+E+DE P
Sbjct: 202 ------------DEDEEEDDEPP 212
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 32.4 bits (74), Expect = 0.82
Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 4/59 (6%)
Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE----EEEEVEEAEGEEEEEDDKASKP 1278
KK K +K K + EK V R + V E E V E EE++ P
Sbjct: 117 KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAP 175
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is a
family of uncharacterized proteins.
Length = 449
Score = 33.4 bits (76), Expect = 0.83
Identities = 57/272 (20%), Positives = 90/272 (33%), Gaps = 50/272 (18%)
Query: 1054 EVPDDETVNQLIARTEDELTLFQKMDAE---RRKGETKPRLVEIKELPDWMVKDDDEVEN 1110
E+ DD+ V + E +LTL M E R+ G+ +E D DDD+ E+
Sbjct: 9 ELEDDDWVRGSLDYKE-KLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDED 67
Query: 1111 SMYEEEREREKEEALMGRGSRQRKQVDYTDSL-TDQEWLKAIEDDGADDYEEEEEEV--- 1166
+++ + E +E S D TD E A DD +DD E
Sbjct: 68 EDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSEYVFWAPGT 127
Query: 1167 --------KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS-------- 1210
K T +R S ++ S + + + +R K +I
Sbjct: 128 TTAATSPRKLETMRRKSRRRTSDSSADSLNERKQRRKWKRPRRSPIKPPKIRPGTPELPD 187
Query: 1211 ---------DEDEDEEDE--SPKKKKRKSTK----------TPAKDKHKEDDEEKHVVSK 1249
DED E S + +R S + P D E+DE
Sbjct: 188 STDFVCGTLDEDRPLEAAYKSCMEARRLSKQVVIPQDIDPSFPTSDPEDEEDEL-----D 242
Query: 1250 RKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
++V+E ++ E + PR S
Sbjct: 243 DVEEVIESDDHFFLDLDGERGRRRRKRSPRTS 274
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 33.1 bits (76), Expect = 0.84
Identities = 19/87 (21%), Positives = 32/87 (36%), Gaps = 11/87 (12%)
Query: 1206 KVQISDEDEDEE-------DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR----KKKV 1254
V ++ +E+ E +E KKKK+K K K+ E V S
Sbjct: 159 VVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGA 218
Query: 1255 VEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+ E + + D++ S+ KS
Sbjct: 219 GQVGEAKKLKKKRSIAPDNEKSEVYKS 245
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 32.8 bits (75), Expect = 0.86
Identities = 20/90 (22%), Positives = 27/90 (30%), Gaps = 15/90 (16%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE-----KPVKKRRRRKKVQISDEDE 1214
E EE +P + K ET PE + KP KK ++ K
Sbjct: 50 EAPTEEPQPEPEPPEEQPKPPT--EPETPPEPTPPKPKEKPKPEKKPKKPKPKPK----- 102
Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
+ K K K P K K +
Sbjct: 103 ---PKPKPKPKVKPQPKPKKPPSKTAAKAP 129
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein. This
family represents Cwf15/Cwc15 (from Schizosaccharomyces
pombe and Saccharomyces cerevisiae respectively) and
their homologues. The function of these proteins is
unknown, but they form part of the spliceosome and are
thus thought to be involved in mRNA splicing.
Length = 241
Score = 32.8 bits (75), Expect = 0.87
Identities = 28/116 (24%), Positives = 47/116 (40%), Gaps = 20/116 (17%)
Query: 1170 TKKRGSGKKGKKSYVE-----------ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEED 1218
++ KK K +E ED++EE +K++R + + SD D+ +
Sbjct: 66 AERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSS 125
Query: 1219 ESPKKKKRKSTKTPAKDKHKEDDEEKHV---VSKRKKKVVEEEEEVEEAEGEEEEE 1271
S D +DE + + K KK+ EE+E EE + EEE+
Sbjct: 126 SDSDSSDDDS------DDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEK 175
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 33.0 bits (76), Expect = 0.91
Identities = 14/90 (15%), Positives = 34/90 (37%), Gaps = 13/90 (14%)
Query: 341 EKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEH 400
KE+ E ER + E E ++ + + E++ L + ++
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQE-----RSYQEHVKQLIEKMEAE 255
Query: 401 KEEQQ--------KKMREEKKKKKKSIKQK 422
+E+ K++E+++ K+ K +
Sbjct: 256 REKLLAEQERMLEHKLQEQEELLKEGFKTE 285
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 33.6 bits (76), Expect = 0.96
Identities = 45/241 (18%), Positives = 95/241 (39%), Gaps = 12/241 (4%)
Query: 1034 ERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVE 1093
E +F ++E+E +NE ++ +Q +E L + + ++ ++ R
Sbjct: 442 EDVDFTGKKGAINNEDESDNEEVAFDSDSQF-DESEGNLRWKEGLASKLAYSQSGKR--- 497
Query: 1094 IKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIED 1153
++ E+ EE E K E+ S Q + D + A E
Sbjct: 498 -----GRNIQKIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSK--VANES 550
Query: 1154 DGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEE-TAPEDEEEEKPVKKRRRRKKVQISDE 1212
++ + E E + + KK S + K + ++ T E EE+ ++ + + DE
Sbjct: 551 ISSNHEKLMESEFEELKKKWSSLAQLKSRFQKDATLDSIEGEEELIQDDEKGNFEDLEDE 610
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
+ ++E + + T + + D E + + RKK+ + E+EE E+++
Sbjct: 611 ENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDV 670
Query: 1273 D 1273
D
Sbjct: 671 D 671
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 32.9 bits (76), Expect = 0.97
Identities = 18/41 (43%), Positives = 23/41 (56%)
Query: 1233 AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
DK D+E K V+S + + EEEE EE E EEEE +
Sbjct: 278 LADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318
>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen. The
hepatitis delta virus (HDV) encodes a single protein, the
hepatitis delta antigen (HDAg). The central region of
this protein has been shown to bind RNA. Several
interactions are also mediated by a coiled-coil region at
the N terminus of the protein.
Length = 194
Score = 32.5 bits (74), Expect = 1.0
Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 20/98 (20%)
Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSK 1249
+D E P K+ R D+ E D P K+ T D+ + D + +
Sbjct: 62 KDGEGAPPAKRPR---------TDQMEVDSGPGKRPHAGGFT---DQERRDHRRRKALEN 109
Query: 1250 RKKKVV--------EEEEEVEEAEGEEEEEDDKASKPR 1279
+KK++ EEEEE+ E+EE + + + PR
Sbjct: 110 KKKQLSSGGKHLSREEEEELRRLTEEDEERERRVAGPR 147
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 33.1 bits (75), Expect = 1.0
Identities = 44/215 (20%), Positives = 74/215 (34%), Gaps = 27/215 (12%)
Query: 1082 RRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEE-REREKEEALMGRGSRQRKQVDYTD 1140
R G RL I + D++ + + N YE+ RE + A Q +
Sbjct: 274 RGIGFNANRLKHIPDFGDFLS--NKIIVNGQYEQTIREIFADRATKLELDLQTVFESNMN 331
Query: 1141 SLTDQEWLKAIEDDGADDYEEEEEEV--------------KPVTKKRGSGKKGKKSYVEE 1186
T E+ ED +D E+ E + + T K+ + KG Y +
Sbjct: 332 RDTLDEYAPEGEDLRSDYDEDFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFY-QA 390
Query: 1187 TAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHV 1246
EDEEEE I D+D E D + ++ DD E
Sbjct: 391 KWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNE---------EVAGDEESAIDDNEGFE 441
Query: 1247 VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+++ + E + + + E D+ +P +S
Sbjct: 442 ELSPEEEERQLREFRDMEKEDREFPDEAELQPSES 476
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 32.6 bits (74), Expect = 1.0
Identities = 20/115 (17%), Positives = 47/115 (40%), Gaps = 7/115 (6%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKS---YVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
+E + + S + + V E + E+ ++ ++ + + E E+E
Sbjct: 128 KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENE 187
Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
+ + P +K ++ ++ DEE + V K +EE+ + EE++E
Sbjct: 188 DSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSK----AIKEELAKEPVEEQQE 238
Score = 32.6 bits (74), Expect = 1.2
Identities = 22/111 (19%), Positives = 49/111 (44%), Gaps = 8/111 (7%)
Query: 1172 KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE--DEDEEDESPKKKKRKST 1229
K S + E +EE + V+K +++ + ++ ++ E+ K+++
Sbjct: 68 KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPG 127
Query: 1230 KTPAKDKHK------EDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
K P D+ K DD ++ + + + + E+ VEE E+ ++D K
Sbjct: 128 KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178
Score = 31.8 bits (72), Expect = 1.7
Identities = 15/85 (17%), Positives = 37/85 (43%), Gaps = 1/85 (1%)
Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
K + ++ +EE + +K ++ + K++ + EK VVSK+++
Sbjct: 66 GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQS-LLEKIVVSKQEED 124
Query: 1254 VVEEEEEVEEAEGEEEEEDDKASKP 1278
+E +++E + E+ D +
Sbjct: 125 GPGKEPQLDEDKFLLAEDSDDRQET 149
Score = 31.8 bits (72), Expect = 1.8
Identities = 22/129 (17%), Positives = 52/129 (40%), Gaps = 2/129 (1%)
Query: 1150 AIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQI 1209
A+ EE +E+++ + +K K+ + +E ++++ R++ ++
Sbjct: 93 AVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEA 152
Query: 1210 SDEDEDEEDESPKKK-KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE 1268
E+ ED ++ + K K+K E + E +K + E + E +
Sbjct: 153 GKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPV-EKAERTKAETDDVTEEDY 211
Query: 1269 EEEDDKASK 1277
+EED+
Sbjct: 212 DEEDNPVED 220
Score = 31.8 bits (72), Expect = 2.1
Identities = 20/122 (16%), Positives = 41/122 (33%), Gaps = 5/122 (4%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE---EKPVKKRRRRKKVQISDEDEDE 1216
E+ V ++ G GK+ + + ED ++ K + E+
Sbjct: 111 SLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETAS 170
Query: 1217 EDESPKKKKRKSTKTPAKDKH--KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
E K++ S + K ++ + K ++ +EE+ E +EE K
Sbjct: 171 EQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAK 230
Query: 1275 AS 1276
Sbjct: 231 EP 232
Score = 31.4 bits (71), Expect = 2.4
Identities = 33/141 (23%), Positives = 64/141 (45%), Gaps = 9/141 (6%)
Query: 401 KEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDA 460
+ ++K+ EE K++ +S+ +KI+ ++ + QL++ + + E + D R E
Sbjct: 94 VRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDD---RQETL 150
Query: 461 PPAKQIKEWLASHPGWEVVME----DSEDEGSEDEEMKEKKEEPSKAVGEKKTEDD-EYK 515
K +E S+ E E D +++ SE +E ++ KE KA K DD +
Sbjct: 151 EAGKVHEETEDSYHVEETASEQYKQDMKEKASE-QENEDSKEPVEKAERTKAETDDVTEE 209
Query: 516 DAGEEATYYSIAHTYHEKVTE 536
D EE + E++ +
Sbjct: 210 DYDEEDNPVEDSKAIKEELAK 230
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this family
was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 32.6 bits (74), Expect = 1.1
Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 3/124 (2%)
Query: 1152 EDDGADDYEEEEEEVKPV-TKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
E++ D ++ E +KPV T+K+ ++ EEE K + R+R+ ++I
Sbjct: 23 EEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIV 82
Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK--KKVVEEEEEVEEAEGEE 1268
+E+ +E E K+ D E++EE++ K + K++ + EE EE E E+
Sbjct: 83 EEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEAWKLRELKRIKRDREEREEMEREK 142
Query: 1269 EEED 1272
E +
Sbjct: 143 AEIE 146
Score = 31.4 bits (71), Expect = 2.8
Identities = 46/189 (24%), Positives = 77/189 (40%), Gaps = 21/189 (11%)
Query: 1093 EIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIE 1152
E+ EL + ++E E S EEE + E + + RK+ T ++E K
Sbjct: 3 EVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKA 62
Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRR-KKVQISD 1211
EEE + K +KR + K VEE ++ E +K V D
Sbjct: 63 -------LEEEAKRKAEERKRETLK-----IVEEEVKKELELKKRNTLLEANIDDVDTDD 110
Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
E+E+EE E+ K ++ K + K D EE+ + + K ++ + EE E +
Sbjct: 111 ENEEEEYEAWKLRELK--------RIKRDREEREEMEREKAEIEKMRNMTEEERRAELRK 162
Query: 1272 DDKASKPRK 1280
+ K S +
Sbjct: 163 NPKVSTNKA 171
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction that
requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 31.5 bits (72), Expect = 1.1
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 38/79 (48%)
Query: 1148 LKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
L A E+D +DD EE+EEE +DEE++
Sbjct: 106 LVASEEDESDDDEEDEEEE-----------------------DDEEDD------------ 130
Query: 1208 QISDEDEDEEDESPKKKKR 1226
DEDE EE+ESP KK +
Sbjct: 131 ---DEDESEEEESPVKKVK 146
Score = 29.2 bits (66), Expect = 7.8
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 1256 EEEEEVEEAEGEEEEEDDKASKPRK 1280
E+EEE ++ E ++E+E ++ P K
Sbjct: 119 EDEEEEDDEEDDDEDESEEEESPVK 143
>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
This group consists of Eukaryotic TruB proteins similar
to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
makes psi55 in the T loop of both cytoplasmic and
mitochondrial tRNAs. Psi55 is almost universally
conserved. Pseudouridine synthases catalyze the
isomerization of specific uridines in an RNA molecule to
pseudouridines (5-ribosyluracil, psi).
Length = 312
Score = 32.8 bits (75), Expect = 1.1
Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 7/107 (6%)
Query: 1133 RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETA---- 1188
R+QV ++ L ++W++ EE ++ + V K G K E TA
Sbjct: 170 RRQVVVSELLV-KDWIEPGPLFTRTVEEEGKQYERSVVKMLGKELKTFAEVTELTATAEG 228
Query: 1189 PEDEEEEKPVK--KRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
EE E + KR+ + + +++ E S + + + +PA
Sbjct: 229 DPVEEVEATHEESKRKSEVEEEANEKSLGPEARSLESDAGRGSFSPA 275
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 33.4 bits (76), Expect = 1.1
Identities = 23/96 (23%), Positives = 34/96 (35%), Gaps = 9/96 (9%)
Query: 1159 YEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEED 1218
EEEE P + SGK +K + +K R K+ D+E
Sbjct: 759 DLEEEEFEYPRMYEEASGKSVRKVNTAVLSTT-------IKAAARAKQKPKEKGPNDKEI 811
Query: 1219 --ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKK 1252
ESP + T + + K D + K+KK
Sbjct: 812 KIESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 32.9 bits (76), Expect = 1.1
Identities = 26/105 (24%), Positives = 43/105 (40%), Gaps = 2/105 (1%)
Query: 1170 TKK-RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKS 1228
KK R G GKK VE TA DE+E R +K + + + K+ R +
Sbjct: 31 NKKVRKGGFLGKK-LVEVTAAVDEDETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTN 89
Query: 1229 TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
++ E K + K + EE + E++ ++E D+
Sbjct: 90 SQDALLQAENALPEWKKELEKPSEPKEEEPKAAAESKVVQKELDE 134
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim have
been identified in both mice and humans (mTim and hTIM,
respectively). Mammalian TIM is not the true orthologue
of Drosophila TIM, but is the likely orthologue of a fly
gene, timeout (also called tim-2). mTim has been shown to
be essential for embryonic development, but does not have
substantiated circadian function. Some family members
contain a SANT domain in this region.
Length = 507
Score = 33.1 bits (75), Expect = 1.1
Identities = 42/238 (17%), Positives = 86/238 (36%), Gaps = 42/238 (17%)
Query: 1080 AERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEERE--------REKEEALMGRGSR 1131
++R+ + KP+ +E + ++D E E + E + E + E + +
Sbjct: 226 GKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQLPESKPEETEKRVSAFQVEGSTLISAEN 285
Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE--------------------EEVKPVTK 1171
R+Q+ + WL++ ADD EE+E E+ + + K
Sbjct: 286 LRQQLKQEKTSWPLLWLQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQFQRLLK 345
Query: 1172 K--------------RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEE 1217
R K A EEE+P + + + +DE E E
Sbjct: 346 ALGLRPPRSGQEGFWRIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHET 405
Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
K +++ + ++ +EE+ K+K+ + +++ +EE DD +
Sbjct: 406 LALRKNARQRKAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTS 463
Score = 31.2 bits (70), Expect = 4.3
Identities = 49/328 (14%), Positives = 100/328 (30%), Gaps = 34/328 (10%)
Query: 970 WNPHQDLQAQD--RAHRIGQKNEVRVLRLITVNSVEERI--------LAAARYKLNMDEK 1019
W +DL+ Q R +++ IT ++ L + + +L +K
Sbjct: 169 WTGDEDLELQRLYEEFRDSDDVLGHIMKNITGKRSRAQVVDKLLQLGLVSEKRELRGGKK 228
Query: 1020 VIQAGMFDQKSTGSERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMD 1079
+ Q + + Q +EEE+ E +ET ++ A + TL +
Sbjct: 229 RRKKLKPKQPNGEESGEDDFQEDP---EEEEQLPESKPEETEKRVSAFQVEGSTLISAEN 285
Query: 1080 AERRKGETKPRLVEIKELPDWMVKDDDEVENSMYE-----EEREREKEEALMGRGSRQRK 1134
++ + K + + DD E+ + E +E + R K
Sbjct: 286 LRQQLKQEKTSWPLLWLQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQFQRLLK 345
Query: 1135 QVDYTDSLTDQE--WLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE 1192
+ + QE W + + K E A E E
Sbjct: 346 ALGLRPPRSGQEGFWRIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDG---EQADESE 402
Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKK 1252
E ++K R++K ++ ++ + K K+ + +
Sbjct: 403 HETLALRKNARQRKAGLA-----------SPEEEALGEEEQKAPPKKKQLNQKNKQQTGS 451
Query: 1253 KVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
+EE + + E+ + D PR
Sbjct: 452 GTNSDEERDDTSLDEDRDLADDGGLPRI 479
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This family
consists of several bovine specific leukaemia virus
receptors which are thought to function as transmembrane
proteins, although their exact function is unknown.
Length = 561
Score = 33.1 bits (75), Expect = 1.2
Identities = 48/210 (22%), Positives = 85/210 (40%), Gaps = 30/210 (14%)
Query: 1097 LPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTD------------ 1144
+ D VK ++E + E+ +REK++ + R+R T+S D
Sbjct: 77 MSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTE 136
Query: 1145 ---QEWLKAIEDDG--ADDYEE---------EEEEVKPVTKKRGSGKKGKKSYVEETAPE 1190
+ L + EDD D Y + E PV K R + + KS + P
Sbjct: 137 EMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNA--ETSKSPEKGDVPA 194
Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
E++ K KK+ +++K + D+D+ +E E K +PA + +E + +
Sbjct: 195 VEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTV 254
Query: 1251 KKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
+ E +E + E EE K+ K +K
Sbjct: 255 SGT--APDSEPDEPKDAEAEETKKSPKHKK 282
Score = 32.7 bits (74), Expect = 1.5
Identities = 22/108 (20%), Positives = 39/108 (36%)
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
+E++E+ K K + +G KS + + +S D E +
Sbjct: 205 KEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPD 264
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
PK + + TK K K K+ +EK K+KK + +
Sbjct: 265 EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQ 312
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular function
of Rrp7A remains unclear currently. Rrp7A harbors an
N-terminal RNA recognition motif (RRM), also termed RBD
(RNA binding domain) or RNP (ribonucleoprotein domain),
and a C-terminal RRP7 domain.
Length = 129
Score = 31.1 bits (71), Expect = 1.3
Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 14/91 (15%)
Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
+Y S+ D L++ D+ ++Y++EEEE +K E+ A DE+
Sbjct: 8 EYNASIPDPAELQSEIDEYMEEYDKEEEE--------------EKEEKEKEAEPDEDGWV 53
Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRK 1227
V K+ RR K + + E KKKK+K
Sbjct: 54 TVTKKGRRPKTARKESVAAKAAEKEKKKKKK 84
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 32.8 bits (74), Expect = 1.3
Identities = 33/181 (18%), Positives = 69/181 (38%), Gaps = 6/181 (3%)
Query: 1102 VKDDDEVENSMYEEEREREKEEALMG--RGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
V ++ N ++ERE L + R R+ + + +E K ++D D
Sbjct: 158 VYHENSFLNKHLKKEREYYYLHGLSPPLKYVRARRFRKKSSKIEIEEVEKKVDDLLEKDM 217
Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
+ E V +K ++ + S E E E ++ ++ + E+ E++
Sbjct: 218 KAESVSVVLKDEKE-LARQERVSSWENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEED 276
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPR 1279
K ++ ++ EE K+ E + + + GEE+EEDD+ +
Sbjct: 277 LDVGAAEIENKEVSEGDKEQQQEEVENAEAHKE---EVQSDRPDEIGEEKEEDDENEENE 333
Query: 1280 K 1280
+
Sbjct: 334 R 334
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 32.6 bits (74), Expect = 1.3
Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 475 GWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEE 520
+VV E+S DE SEDE E E + V E + + + D E
Sbjct: 339 EGDVVTEESTDEESEDEV--EIDESVIEEVAEMELLEVQVDDLAER 382
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are RNase
Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in turnover
of yitJ riboswitch [Transcription, Degradation of RNA].
Length = 514
Score = 32.6 bits (75), Expect = 1.4
Identities = 21/76 (27%), Positives = 43/76 (56%), Gaps = 5/76 (6%)
Query: 1201 RRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEE 1260
+ RR ++Q + + +E+ +K K K+++ E E++ +S ++K + E+EEE
Sbjct: 72 KERRNELQRLERRLLQREETLDRKMESLDK---KEENLEKKEKE--LSNKEKNLDEKEEE 126
Query: 1261 VEEAEGEEEEEDDKAS 1276
+EE E+ EE ++ S
Sbjct: 127 LEELIAEQREELERIS 142
>gnl|CDD|180481 PRK06231, PRK06231, F0F1 ATP synthase subunit B; Validated.
Length = 205
Score = 32.1 bits (73), Expect = 1.4
Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 6/67 (8%)
Query: 327 QKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQT 386
++ N I EKE+++ +E+++KE + M EE +K +D++ D +L
Sbjct: 140 RQANLIIFQARQEIEKERRELKEQLQKESVELAMLAAEELIKKKVDREDDDKL------V 193
Query: 387 DEYISNL 393
DE+I L
Sbjct: 194 DEFIREL 200
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain fusion
protein; Provisional.
Length = 131
Score = 31.1 bits (70), Expect = 1.5
Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 23/100 (23%)
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
+G+ K TA + EE + +D DED ++ +
Sbjct: 54 EAGRAEKGPAAAATAAAEAAEEA-----------EAADADEDADEAA------------E 90
Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
D E DEE+ + E + E EEA+ EE+EE +
Sbjct: 91 ADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which are
members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 31.7 bits (72), Expect = 1.6
Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 4/65 (6%)
Query: 1172 KRGSGKKGKKSYVEETAPE----DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRK 1227
K+ K KK ++ PE D E K +K+ ++KK + E + ++ E KKKKR
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Query: 1228 STKTP 1232
S + P
Sbjct: 170 SPEHP 174
Score = 29.4 bits (66), Expect = 8.7
Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 11/65 (16%)
Query: 1178 KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
+ + EET + E + KK +++K ++ ++KK+K K K +H
Sbjct: 121 RTQDPLPEETPSDSEGLKGHEKKHKKKKH-----------EDDKERKKKKKEKKKKKKRH 169
Query: 1238 KEDDE 1242
+
Sbjct: 170 SPEHP 174
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 32.7 bits (74), Expect = 1.6
Identities = 25/121 (20%), Positives = 50/121 (41%), Gaps = 12/121 (9%)
Query: 1161 EEEEEVKP--------VTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
++ E++KP TKK GK K + + +DE +E + + + + D+
Sbjct: 646 KKVEDIKPEDKFFYQYFTKKTADGKGKKSNKASFDS-DDEMDENEIWSALVKSRPDVEDD 704
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
+D E + + ST + D+ K D + + ++EE ++E + E
Sbjct: 705 SDDSELDFAEDDFSDST---SDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQ 761
Query: 1273 D 1273
D
Sbjct: 762 D 762
Score = 30.4 bits (68), Expect = 9.1
Identities = 34/139 (24%), Positives = 60/139 (43%), Gaps = 24/139 (17%)
Query: 1050 EEENEVPDDETVNQLIAR-------TEDELTLFQKMDAERRKGETKPRLVEIKELPDWMV 1102
+ ++E+ ++E + L+ ++D F + D + +P+L I
Sbjct: 680 DSDDEMDENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPKLDAI-------- 731
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
DDE S E +E ++EE L + D +DS + E+D + + E+E
Sbjct: 732 --DDEDAKS--EGSQESDQEEGLDEIFYSFDGEQDNSDSFAESS-----EEDESSEEEKE 782
Query: 1163 EEEVKPVTKKRGSGKKGKK 1181
EEE K V+ KR K+ K
Sbjct: 783 EEENKEVSAKRAKKKQRKN 801
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 32.7 bits (74), Expect = 1.8
Identities = 19/129 (14%), Positives = 40/129 (31%), Gaps = 13/129 (10%)
Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSY------VE 1185
+ + ++L ++ + + + KR + + +
Sbjct: 324 ETDRAREMETLAHD----NVKQIVGAQHGILSAKREGAVDKRRTISTARAGLDALVKGLG 379
Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
APE EE + R + E E ++ +T A D+ K D+E
Sbjct: 380 GAAPESAEELLELNNAARLTVDEYPAAREALESA---GQRNVEDRTRAVDEFKAADQELS 436
Query: 1246 VVSKRKKKV 1254
+SK +
Sbjct: 437 SLSKGSSNI 445
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter subunit
G; Reviewed.
Length = 197
Score = 31.6 bits (71), Expect = 2.0
Identities = 24/98 (24%), Positives = 47/98 (47%), Gaps = 24/98 (24%)
Query: 1180 KKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
++ +E+ E EE E+ ++ RR +K+ DE ED+E++ +++++ + +
Sbjct: 124 RQEQIEKARQEREELEERMEWERREEKI---DEREDQEEQEREREEQTI-------EEQS 173
Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
DD E ++ E+ E E E +DDK K
Sbjct: 174 DDSEHEII--------------EQDESETESDDDKTEK 197
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 31.9 bits (72), Expect = 2.3
Identities = 40/210 (19%), Positives = 74/210 (35%), Gaps = 48/210 (22%)
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
KD ++++ +E ++ + G+ K ++E KA+ + EE
Sbjct: 5 KDLEDLKKEAARRAKEEARKRLVAKHGAEISKL-----EEENREKEKALPKNDDMTIEEA 59
Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRR------------RRKKVQIS 1210
+ K + + +K E E EEEK K + ++K + +
Sbjct: 60 KRRAAAAAKAKAAALAKQK---REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGT 116
Query: 1211 DEDEDEEDESPKKK--------------KRKSTKTPAKDKHKEDDEEK-----------H 1245
+E +EE + K K +++ ++ +E D+EK
Sbjct: 117 EEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAK 176
Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
+ K+K E E EE EEE KA
Sbjct: 177 AAALAKQKAAEAGEGTEEV---TEEEKAKA 203
>gnl|CDD|147445 pfam05250, UPF0193, Uncharacterized protein family (UPF0193). This
family of proteins is functionally uncharacterized.
Length = 212
Score = 31.5 bits (71), Expect = 2.3
Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 22/143 (15%)
Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPED 1191
QR+Q++ L + E L ++ +E+E + + +R + KK +EE+ D
Sbjct: 44 QRRQINDH--LRNGEPLPVPKEP---KFEQEYDNTLAMADRRKNIKKRSLRTIEESGAFD 98
Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK 1251
E+ +P + E +K+KRK T A + K V +R
Sbjct: 99 VEKYRPADNK-----------------EPGEKEKRKLQNTMAGTHIPDPGIRKRVPRRRS 141
Query: 1252 KKVVEEEEEVEEAEGEEEEEDDK 1274
++ ++ EE E EE + K
Sbjct: 142 EEEMDVTEEDRFDELLEEINERK 164
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen Zg14,
is the Rrp7p homolog mainly found in Metazoans. The
cellular function of Rrp7A remains unclear currently.
Both Rrp7p and Rrp7A harbor an N-terminal RNA recognition
motif (RRM), also termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7 domain.
Length = 118
Score = 30.3 bits (69), Expect = 2.3
Identities = 22/95 (23%), Positives = 38/95 (40%), Gaps = 13/95 (13%)
Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
+Y S D L++ D+ +++++ EEE K K+ DE+
Sbjct: 8 EYKRSRPDPAELQSEVDEYMEEFDKREEEEKEEAKEAR-------------NEPDEDGFV 54
Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
V + R+ K + E + E KKKK+K
Sbjct: 55 TVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1 protein
homologues. SDA1 is a Saccharomyces cerevisiae protein
which is involved in the control of the actin
cytoskeleton. The protein is essential for cell viability
and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 2.4
Identities = 36/147 (24%), Positives = 53/147 (36%), Gaps = 47/147 (31%)
Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
+ + +E + +E D DD EEE E VEE D+E
Sbjct: 76 EKWKEEERKKKEAEQGLESDDDDDEEEEWE-------------------VEEDEDSDDEG 116
Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
E I E + E + S ED+EEK +K+ K+
Sbjct: 117 E------------WIDVESDKEIESS----------------DSEDEEEKDEAAKKAKED 148
Query: 1255 VEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+EE E+ E EEE+ +A K + S
Sbjct: 149 SDEELSEEDEEEAAEEEEAEAEKEKAS 175
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends to
by Lys/Arg-rich, and is followed by a membrane-spanning
region. This is followed by an acidic low-complexity
region of variable length and a well-conserved C-terminal
domain of two tandem regions matched by pfam05036
(Sporulation related repeat), found in several cell
division and sporulation proteins. The role of FtsN as a
suppressor for other cell division mutations is poorly
understood; it may involve cell wall hydrolysis [Cellular
processes, Cell division].
Length = 298
Score = 31.6 bits (71), Expect = 2.4
Identities = 34/152 (22%), Positives = 53/152 (34%), Gaps = 13/152 (8%)
Query: 1093 EIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIE 1152
I+EL V +D E S E + A R ++ Q D +
Sbjct: 85 YIEELEAREVLINDPEEPSNGGGVEESAQLTAEQ-RQLLEQMQADMRAAEK--------- 134
Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
E+ V +K+ + KK +K+ E E EK K + K+ Q +
Sbjct: 135 ---VLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALP 191
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
+ E +S K + K DK K +EK
Sbjct: 192 KQTAETQSNSKPIETAPKADKADKTKPKPKEK 223
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein. Genome
duplication is precisely regulated by cyclin-dependent
kinases CDKs, which bring about the onset of S phase by
activating replication origins and then prevent
relicensing of origins until mitosis is completed. The
optimum sequence motif for CDK phosphorylation is
S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
potential phosphorylation sites. Drc1 is required for DNA
synthesis and S-M replication checkpoint control. Drc1
associates with Cdc2 and is phosphorylated at the onset
of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
replication by phosphorylating Drc1 and regulating its
association with Cut5. Sld2 and Sld3 represent the
minimal set of S-CDK substrates required for DNA
replication.
Length = 397
Score = 31.7 bits (72), Expect = 2.5
Identities = 28/127 (22%), Positives = 49/127 (38%), Gaps = 17/127 (13%)
Query: 1168 PVTKKRGSGK------KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESP 1221
P+ +R GK + ++S +E E E + ++V S DE
Sbjct: 225 PLRPQRPVGKGLSELVQEEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVF 284
Query: 1222 KKKKRKST-----------KTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
KKK +K T K + E D + + +K + E + + ++E+
Sbjct: 285 KKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDED 344
Query: 1271 EDDKASK 1277
EDD+ SK
Sbjct: 345 EDDEESK 351
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 30.5 bits (69), Expect = 2.6
Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 18/90 (20%)
Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEE------KHVVS 1248
E P +K +R +DE + S A K + D+E ++
Sbjct: 37 ETPEEKAKR------------LQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEK 84
Query: 1249 KRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
KR K +VE+ ++ ++ + +EEE DD + +P
Sbjct: 85 KRGKSLVEQHQKKQKKKKKEEENDDPSRRP 114
>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
Length = 603
Score = 31.8 bits (72), Expect = 2.7
Identities = 33/172 (19%), Positives = 63/172 (36%), Gaps = 5/172 (2%)
Query: 1094 IKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIED 1153
I+EL + DD +++ +EE E K L+ V +S E
Sbjct: 43 IEELSTIRYRTDDLRLSAIEQEEGEGLKGPKLVIYQDGDFNSVVSYNSSDKNNDSLESEV 102
Query: 1154 DGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK-KRRRRKKVQISDE 1212
DG ++++ +EE+ + K S + + + + + P K + +VQ
Sbjct: 103 DGGNNHKPKEEQAQVSQKTTVSSSEEVQISARDIQLNHKTQFNPPTVKHEKNTRVQP--- 159
Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEA 1264
DE K+ + K + A +V + + + E E V +A
Sbjct: 160 -RRATDEKVKEIRDKIIQAKAYLNLAPPGSNSQLVKELRLRTKELERAVGDA 210
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during metabolism,
and (2) salt tolerance (in plants), due to Na+ uptake
into vacuoles. This model is specific for the fungal
members of this family [Transport and binding proteins,
Cations and iron carrying compounds].
Length = 810
Score = 31.8 bits (72), Expect = 2.8
Identities = 53/282 (18%), Positives = 88/282 (31%), Gaps = 54/282 (19%)
Query: 1043 LHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMV 1102
L + E E++ E+ D + T + G R ++E +
Sbjct: 516 LRNNKEMEDDIEMNDLGRERLQKEKEAHAATFDLSTTTTTQLGTENGRGGGLEERSKTNI 575
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
K+ E N++Y ++ E + D +S D + + + E
Sbjct: 576 KERSENVNTIYGLDKLARDTE-NRDVTYVPTSRYDGIESEIDDVYTYENDSESIASSERR 634
Query: 1163 ------EEEVKPVTK---------KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRR--- 1204
EEE + + G+ + + + D E R +R
Sbjct: 635 RIKKLREEEQQAYIAYTEDNQVIIENRQGEILEYVDIHDRGARDAEVGVHNGGRLKRALS 694
Query: 1205 ----KKVQISDED----------------EDEEDE-------SPKKKKRKSTKTPAKDKH 1237
K QI++E EDE E SP KRK D
Sbjct: 695 PPLEKLHQITNEAKKSKYYAYKVGNDLIIEDESGEVFRRYRISPHGGKRKIK--KRNDSV 752
Query: 1238 KEDDEEKHV-----VSKRKKK-VVEEEEEVEEAEGEEEEEDD 1273
DEEK + V +R ++ E+E+ + E E E DD
Sbjct: 753 VSVDEEKAIEGPSRVPERGNHDLLHSEDEMADDEAESENMDD 794
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 31.3 bits (71), Expect = 2.8
Identities = 13/58 (22%), Positives = 28/58 (48%)
Query: 368 RKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKKKKSIKQKILD 425
+LI + +K+ L Q + + +E++E+ +K + + ++ KK I KI
Sbjct: 6 TELISKLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKSGIFSKIFG 63
>gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is approximately 90 amino acids in length. The domain is
found in several amino-acyl tRNA synthetase enzymes as
well as in isolation in single domain proteins.
Length = 90
Score = 29.5 bits (67), Expect = 3.0
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 5/36 (13%)
Query: 204 YFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYF 239
Y W F Y L SEDR ELL K E + +
Sbjct: 59 YSGW--FVYRYL---LFSEDRQELLAKIQELKKEIL 89
Score = 29.5 bits (67), Expect = 3.0
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 5/36 (13%)
Query: 907 YFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYF 942
Y W F Y L SEDR ELL K E + +
Sbjct: 59 YSGW--FVYRYL---LFSEDRQELLAKIQELKKEIL 89
>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
Length = 653
Score = 31.6 bits (72), Expect = 3.0
Identities = 23/95 (24%), Positives = 51/95 (53%), Gaps = 8/95 (8%)
Query: 352 EKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREE 411
E ERMR+ + E Y + D+++ K+L L +Q D + + +KE+ E +++++
Sbjct: 506 EIERMRQ----EAEKYAE-EDRRR-KQLIELKNQADSLLYSYESTLKENGELISEELKQR 559
Query: 412 KKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITV 446
++K + ++ + D N+ +L QLE+ + +
Sbjct: 560 AEQKVEQLEAALTDPNISLEEL--KQQLEEFQQAL 592
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 30.4 bits (69), Expect = 3.0
Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 9/71 (12%)
Query: 1152 EDDGADD-YEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
DGA++ E+E+E+ VT K+ EE PE +++ V S
Sbjct: 84 YGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDD--------DVVDSS 135
Query: 1211 DEDEDEEDESP 1221
D DED++D+ P
Sbjct: 136 DADEDDDDDIP 146
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 31.7 bits (72), Expect = 3.0
Identities = 39/201 (19%), Positives = 74/201 (36%), Gaps = 22/201 (10%)
Query: 1078 MDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVD 1137
+ E+ G+ P K L W DE+E+ E+ E +R
Sbjct: 159 LVREKLTGDAPPDSAG-KVLDLW----RDEIEDKAGEDLDGLAAEIDDQQAFAR-----V 208
Query: 1138 YTDSLTDQEWLKAIEDDGA--DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
D L + + DDG D EE+ ++ +P + + + +E+ ++ E
Sbjct: 209 VRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEAT 268
Query: 1196 KPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV 1255
+ + S++ E +E + + T+TP +D + ++ + KV
Sbjct: 269 DGEGEEGEMDAAEASEDSESDESD-------EDTETPGEDA-RPATPFTELMEEVDYKVF 320
Query: 1256 EEE--EEVEEAEGEEEEEDDK 1274
E E V E +E E D+
Sbjct: 321 TREFDEIVLAEELCDEAELDR 341
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this family
are typically between 146 and 237 amino acids in length.
There are two conserved sequence motifs: QYGP and PKLY.
Length = 155
Score = 30.5 bits (69), Expect = 3.1
Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 10/70 (14%)
Query: 1210 SDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
S +DE+EE E + + P ++ +++ V + +K + E +
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP------ 146
Query: 1270 EEDDKASKPR 1279
K SKP+
Sbjct: 147 ----KPSKPK 152
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family consists
of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations in
the sequences of these proteins affect flower and leaf
development.
Length = 382
Score = 31.5 bits (72), Expect = 3.2
Identities = 32/107 (29%), Positives = 54/107 (50%), Gaps = 15/107 (14%)
Query: 1116 EREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGS 1175
ER R +EE + GR R+ + D+ L A+ +G EE V+ + GS
Sbjct: 114 ERRRLEEEDVEGR----RRHLLSGDTTNA---LDALSQEGL-----SEEPVQQEKEAAGS 161
Query: 1176 GKKGKKSYVEETAPE---DEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
G G ++ P D E++K KK+RR++ ++ ++D+D+EDE
Sbjct: 162 GGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDE 208
Score = 30.4 bits (69), Expect = 7.1
Identities = 9/41 (21%), Positives = 22/41 (53%)
Query: 1233 AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
H D E+K K+++K +E E ++ + +E+++ +
Sbjct: 173 GVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGE 213
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.8 bits (72), Expect = 3.3
Identities = 18/101 (17%), Positives = 43/101 (42%), Gaps = 9/101 (8%)
Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
+R + +DE+++ +K R K ++ D+ E+ ++ + +
Sbjct: 5 NRREANINNNDR---MQEKDDEKQD---QKNRMELKEKVLDKKEEVVTDNVDSPVK---E 55
Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
+++ K DE K + K+++E + EE + E + E
Sbjct: 56 QSSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYE 96
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46. Homologues
of CDC45 have been identified in human, mouse and smut
fungus among others.
Length = 583
Score = 31.5 bits (72), Expect = 3.3
Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS- 1248
E+E +++P R + D+DE+ ++E + K + + D +DD S
Sbjct: 107 EEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDED-DDDDDDDDDIATRERSL 165
Query: 1249 --KRKKKVVEEEEEVEEAE 1265
+R+++ EE+ E
Sbjct: 166 ERRRRRREWEEKRAELEFY 184
>gnl|CDD|218889 pfam06088, TLP-20, Nucleopolyhedrovirus telokin-like protein-20
(TLP20). This family consists of several
Nucleopolyhedrovirus telokin-like protein-20 (TLP20)
sequences. The function of this family is unknown but
TLP20 is known to shares some antigenic similarities to
the smooth muscle protein telokin although the amino acid
sequence shows no homologies to telokin.
Length = 162
Score = 30.4 bits (69), Expect = 3.4
Identities = 14/61 (22%), Positives = 20/61 (32%), Gaps = 13/61 (21%)
Query: 1221 PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
KK S P E+DE+ EE+ + A E +D P+K
Sbjct: 107 WNSKKFVSAPVPHHSSEDENDED-------------EEDNADRAGIESGIDDSAPPSPKK 153
Query: 1281 S 1281
Sbjct: 154 Q 154
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 31.1 bits (70), Expect = 3.5
Identities = 20/96 (20%), Positives = 33/96 (34%), Gaps = 12/96 (12%)
Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
T + EE+KP K+ K E+ K KK T + K ++
Sbjct: 7 TTKKKTTEEKKPAAKKATTSK------------ETAKTKKTAKTTSTKAAKKAAKVKKTK 54
Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
V KKV + E+ E + E + + +
Sbjct: 55 SVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 31.7 bits (72), Expect = 3.6
Identities = 39/240 (16%), Positives = 73/240 (30%), Gaps = 22/240 (9%)
Query: 287 EVRVLRLIT--VNSVEERILAA-ARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKE 343
VR + + R+L +L +D R Q +
Sbjct: 518 AVREEKAAVTDIPEELARLLITDELPELAVDLLVQSRIRQHWQQLRKA----------LD 567
Query: 344 QKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEE 403
Q + + R A E ++ ++ D+ L + E ++ +K+ K+
Sbjct: 568 QLEAAYEALEGRFAAAEAAMAE-WQSEWEEALDE-LGLSRELSPEQQLDILSTMKDLKKL 625
Query: 404 QQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPA 463
QKK + + +Q + ++ L +Q DL
Sbjct: 626 MQKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFIDLSTLF-CVQRLRVAA------E 678
Query: 464 KQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATY 523
Q +E + G ++ DE + E+ K+ G TED + A EE
Sbjct: 679 LQKEEARLALEGNIERTKELNDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQL 738
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 31.2 bits (71), Expect = 3.6
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
DEE K + EEE++ E+E E+EE++D+ +
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEE 312
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one of
the members is annotated as an EF-hand family protein.
Length = 349
Score = 31.3 bits (71), Expect = 3.7
Identities = 23/98 (23%), Positives = 52/98 (53%)
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
+G++ K + P ++ E VK R +K + + E +++++ +KKK+K
Sbjct: 251 SAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKV 310
Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
K + + +D + + +RK+++ + +E +E E E E+E
Sbjct: 311 KQRFEANDPAQKLQEERKEQLAKLRKEEKEREKEYEQE 348
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This family
includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 31.2 bits (69), Expect = 3.8
Identities = 24/97 (24%), Positives = 54/97 (55%), Gaps = 1/97 (1%)
Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRK-STKTPAKDKHKEDDEEK 1244
E + ++ EK K +++ + QI+ + E E+ + KK K + + A+ K + EE+
Sbjct: 59 ELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEE 118
Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
H K +++ +++E+E +EA+ ++ + + K K K+
Sbjct: 119 HKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKA 155
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.3 bits (72), Expect = 3.8
Identities = 23/138 (16%), Positives = 58/138 (42%), Gaps = 17/138 (12%)
Query: 339 NAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVK 398
E + +E E+ ++ ++ E +E ++L +K + + E L + +
Sbjct: 562 EEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANE 621
Query: 399 EHKEEQQKKMREEKKKKK------KSIKQK--ILDGNMDKLDLDESS-QLEDLRITVKET 449
+ +++++K+ ++++ K S+ QK +L D E+ Q +++ V +
Sbjct: 622 KKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVL----SIPDDKEAIVQAGIMKMKVPLS 677
Query: 450 NSDNKLRGEDAPPAKQIK 467
+ L P K+ K
Sbjct: 678 D----LEKIQKPKKKKKK 691
>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
Length = 259
Score = 30.9 bits (69), Expect = 3.9
Identities = 18/68 (26%), Positives = 30/68 (44%), Gaps = 3/68 (4%)
Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
D + EE+ V+ ++R KK + D+E P +K RR++ D
Sbjct: 176 DSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDE---PPRKSRRKRHSHRISSSTDS 232
Query: 1217 EDESPKKK 1224
+DE P+ K
Sbjct: 233 DDEEPRHK 240
>gnl|CDD|197366 cd09916, CpxP_like, CpxP component of the bacterial
Cpx-two-component system and related proteins. This
family summarizes bacterial proteins related to CpxP, a
periplasmic protein that forms part of a two-component
system which acts as a global modulator of cell-envelope
stress in gram-negative bacteria. CpxP aids in combating
extracytoplasmic protein-mediated toxicity, and may also
be involved in the response to alkaline pH. Functioning
as a dimer, it inhibits activation of the kinase CpxA,
but also plays a vital role in the quality control
system of P pili. It has been suggested that CpxP
directly interacts with CpxA via its concave polar
surface. Another member of this family, Spy, is also a
periplasmic protein that may be involved in the response
to stress. The homology between CpxP and Spy suggests
similar functions. A characteristic 5-residue sequence
motif LTXXQ is found repeated twice in many members of
this family.
Length = 96
Score = 29.1 bits (66), Expect = 4.0
Identities = 17/80 (21%), Positives = 34/80 (42%), Gaps = 7/80 (8%)
Query: 340 AEKEQKKEQERIEKERMRRLMAE---DEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQM 396
A+ + +EQ R +E +R L+ DE R L + + + + + Q+
Sbjct: 21 AQMKALREQMRAAREELRALLTADTFDEAAVRALAAEMAELQQELAVERAK----ARNQI 76
Query: 397 VKEHKEEQQKKMREEKKKKK 416
+ EQ+ K+ E K++
Sbjct: 77 YQVLTPEQRAKLNELFAKRR 96
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 31.2 bits (71), Expect = 4.0
Identities = 25/137 (18%), Positives = 46/137 (33%), Gaps = 17/137 (12%)
Query: 1151 IEDDGADDYEEEEEEVKPVTKKRGSG------KKGKKSYVEETAPEDEEEEKPVKKRRRR 1204
I D YE+E E K KKR +K K + ++ + +E+P +
Sbjct: 54 IIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPA--- 110
Query: 1205 KKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEA 1264
+ S + + R+ K E+ E ++V E E +
Sbjct: 111 -ETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKEL-------LREVEELESRLATE 162
Query: 1265 EGEEEEEDDKASKPRKS 1281
E +++ + KS
Sbjct: 163 PSPAPELEEQLALMEKS 179
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 30.0 bits (68), Expect = 4.0
Identities = 22/74 (29%), Positives = 35/74 (47%)
Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
K+ ++E+EDE+DES + KR + AK+K + D + K+K EE
Sbjct: 34 KRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEER 93
Query: 1259 EEVEEAEGEEEEED 1272
AE E+E +
Sbjct: 94 FMKALAEAEKERAE 107
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 31.4 bits (71), Expect = 4.0
Identities = 26/95 (27%), Positives = 43/95 (45%)
Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
K+ + Y EE E + E + +K+ R + + D +E+ S K+K +S + DK
Sbjct: 78 KEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDK 137
Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
K DE + V K +++ E E V E E
Sbjct: 138 SKHIDEREEQVEKLEEQKKAELERVAALSQAEARE 172
>gnl|CDD|222875 PHA02558, uvsW, UvsW helicase; Provisional.
Length = 501
Score = 31.1 bits (71), Expect = 4.3
Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 17/133 (12%)
Query: 576 GKTVQTIALLTYLMEKKKNLGPYLVIVPLSTL-SNWVLEFE--KWAPSANVVAYKGSPLA 632
GK++ L Y +E + G L+IVP ++L + + +F + P +
Sbjct: 141 GKSLIQYLLSRYYLENYE--GKVLIIVPTTSLVTQMIDDFVDYRLFPREAMHKIYS---- 194
Query: 633 RRALQGRMKNSKFNVLLTTYEFIIKDKSVLSKIQWKCMIIDEGHRMKNHHCKLTQTLNTF 692
G K++ ++++T++ +K Q+ +I+DE H LT +
Sbjct: 195 -----GTAKDTDAPIVVSTWQSAVKQPKEWFD-QFGMVIVDECHLFTGK--SLTSIITKL 246
Query: 693 YNSSHRVLLTGTP 705
N + LTG+
Sbjct: 247 DNCKFKFGLTGSL 259
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 31.2 bits (70), Expect = 4.4
Identities = 21/78 (26%), Positives = 39/78 (50%)
Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
EET EE+ + R + V ++D+D+E+ +K+ + + E D++
Sbjct: 466 EETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDV 525
Query: 1245 HVVSKRKKKVVEEEEEVE 1262
+ +K+KV EEEEE +
Sbjct: 526 NKSKNKKRKVDEEEEEKK 543
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 31.2 bits (71), Expect = 4.7
Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 11/146 (7%)
Query: 321 NNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKK--DKR 378
+++ +++ + E +EQ ++ + + + A + ++ K D
Sbjct: 347 DSSREIEEIKFDLSEDRSEIEILVFREQSQLSQSSLSGIFAYMQSLKKETETLKSRIDLI 406
Query: 379 LAFLLSQTDEYISNLTQMVKEHKEEQQKK---------MREEKKKKKKSIKQKILDGNMD 429
+ ++S T E L + ++K Q +REE+ KKK+ K+ D
Sbjct: 407 MKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLLLLREEELSKKKTKIHKLNKLRHD 466
Query: 430 KLDLDESSQLEDLRITVKETNSDNKL 455
L S E E S +
Sbjct: 467 LSSLLSSIPEETSDRVESEKASKLRS 492
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 31.0 bits (70), Expect = 4.8
Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 4/59 (6%)
Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS--KRKKKVVEEEEEVEEAEGEEEEE 1271
D + + KK TK ++ H E+ E K K+K+ + +V+ + EEEE+
Sbjct: 21 DRRKQLARSKKVYGTKN--RNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEED 77
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 31.1 bits (71), Expect = 4.9
Identities = 23/96 (23%), Positives = 34/96 (35%), Gaps = 35/96 (36%)
Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
+E++P TK K GK S K V +R KK +K
Sbjct: 374 DELRPKTKAPSEKKTGKPS-------------KKVLAKRAEKKE-------------KEK 407
Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEE 1259
+K K K +H+ + + KR+K EE
Sbjct: 408 EKPK-----VKKRHR----DTKNIGKRRKPSGTSEE 434
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
Provisional.
Length = 387
Score = 30.9 bits (70), Expect = 4.9
Identities = 15/105 (14%), Positives = 44/105 (41%), Gaps = 5/105 (4%)
Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
+++ + + ++ +E +E ++K ++ R K+++ E+ + ++ ++
Sbjct: 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ--- 126
Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
A K K+ +E + K E E + A ++ + K
Sbjct: 127 --AALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169
Score = 30.5 bits (69), Expect = 5.8
Identities = 36/173 (20%), Positives = 63/173 (36%), Gaps = 18/173 (10%)
Query: 1116 EREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE---EEEVKPVTKK 1172
E +R+K+E +Q++ + QE LK +E + E++ EE K K
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAE-------QERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
Query: 1173 RGSGKKGKKSYVEETAPEDEEEEKPV----KKRRRRKKVQISDEDEDEEDESPKKKKRKS 1228
+ ++ + E E K KK K + E + KKK
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAE 190
Query: 1229 TKTP----AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
AK K + + ++K +KK E + +A E + +KA+
Sbjct: 191 AAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243
Score = 30.2 bits (68), Expect = 8.7
Identities = 35/185 (18%), Positives = 70/185 (37%), Gaps = 27/185 (14%)
Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
AE+++KK++++ +E ++ AE E + Q + +RLA Q K+
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQER-----LKQLEKERLA-------------AQEQKK 118
Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLD-ESSQLEDLRITVKETNSDNKLRGE 458
EE K+ ++K+ +++ + K + E+ + K+ ++ K + E
Sbjct: 119 QAEEAAKQAALKQKQAEEAAAKAAAAA---KAKAEAEAKRAAAA---AKKAAAEAKKKAE 172
Query: 459 DAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEK--KEEPSKAVGEKKTEDDEYKD 516
K E +E + + E K+K E KA E K +
Sbjct: 173 AEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAA 232
Query: 517 AGEEA 521
+ A
Sbjct: 233 EAKAA 237
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like protein.
This domain family is found in eukaryotes, and is
typically between 212 and 238 amino acids in length. The
family is found in association with pfam01805. There are
two completely conserved residues (W and H) that may be
functionally important. PRP21 is required for assembly of
the prespliceosome and it interacts with U2 snRNP and/or
pre-mRNA in the prespliceosome. This family also contains
proteins similar to PRP21, such as the mammalian SF3a.
SF3a also interacts with U2 snRNP from the
prespliceosome, converting it to its active form.
Length = 230
Score = 30.5 bits (69), Expect = 4.9
Identities = 16/76 (21%), Positives = 30/76 (39%), Gaps = 6/76 (7%)
Query: 1184 VEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEE 1243
EE E+++EE K K+++I E + K RK E ++
Sbjct: 114 EEEDIDEEKKEESAKKSTPPIKEMKIPAAGE------SRLKIRKDYVPSGNKPRAEKPKK 167
Query: 1244 KHVVSKRKKKVVEEEE 1259
K + +++ E+E
Sbjct: 168 KMIKCPITGELIPEDE 183
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 30.9 bits (70), Expect = 4.9
Identities = 20/102 (19%), Positives = 41/102 (40%)
Query: 1176 GKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKD 1235
K+G K +E + EEE++ + + ++ ++ + K + K K
Sbjct: 364 LKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKV 423
Query: 1236 KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
+ E+D + KK E E + E E+E+E + +
Sbjct: 424 EKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVE 465
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is a
family of eukaryotic proteins of unknown function. Some
of the proteins in this family are putative nucleic acid
binding proteins.
Length = 158
Score = 29.8 bits (67), Expect = 5.0
Identities = 16/88 (18%), Positives = 36/88 (40%), Gaps = 7/88 (7%)
Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK------KRKSTKTPAKDKHKEDDEEK 1244
++RR R + + D SP + +R +++ + + ++ E+
Sbjct: 17 RSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRER 76
Query: 1245 HVVSKRKKKVVEEEEEVEEA-EGEEEEE 1271
++ KK ++ EE EG+ +EE
Sbjct: 77 DKDAREPKKRERQKLIKEEDLEGKSDEE 104
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 30.6 bits (69), Expect = 5.3
Identities = 32/155 (20%), Positives = 52/155 (33%), Gaps = 19/155 (12%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTD--QEWLKAIEDDGADDYEEEEEEV----- 1166
EE R + E L R + R Q T+ L E K + DG +E E+
Sbjct: 136 EEVRADDTPEVLAKRLASYRAQ---TEPLVHYYSEKRKLLTVDGMMTIDEVTREIGRVLA 192
Query: 1167 -------KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
K K + KK+ + + + +K K K + + + +
Sbjct: 193 AVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKK--AAKTAAKAAK 250
Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
KK +K+ K AK K + +K K
Sbjct: 251 KAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGA 285
>gnl|CDD|222428 pfam13872, AAA_34, P-loop containing NTP hydrolase pore-1.
Length = 303
Score = 30.6 bits (70), Expect = 5.6
Identities = 26/93 (27%), Positives = 35/93 (37%), Gaps = 28/93 (30%)
Query: 637 QGRMKNSKFNVLLTTYEFII-------KDKSVLSKI-QW------KCMIIDEGHRMKNHH 682
QG+ K +L TTY + K +S L +I W ++ DE H+ KN
Sbjct: 129 QGKPIRLKEGILFTTYATLRSESQAGGKKRSRLDQIVDWLGEDFDGVIVFDECHKAKNAA 188
Query: 683 ---CKLTQTLNTFYNSSHRVLLTGTPLQNKLPE 712
K +Q L G LQN LP
Sbjct: 189 GGEKKPSQQ-----------GLAGLRLQNALPN 210
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function of
this family is unknown.
Length = 417
Score = 30.6 bits (68), Expect = 5.8
Identities = 28/131 (21%), Positives = 59/131 (45%), Gaps = 4/131 (3%)
Query: 1148 LKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
L A D D E + K R K +E ++E++K K+
Sbjct: 116 LAACFADDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANS 175
Query: 1208 QISDEDEDEEDESPKKK----KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
QI E E ++ E K+K K+K++ K+ + + E++ ++++ + E+++ ++E
Sbjct: 176 QIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKE 235
Query: 1264 AEGEEEEEDDK 1274
AE +E ++
Sbjct: 236 AEQNCQENHNQ 246
>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
ATPase domain and a DNA-binding domain [Transcription /
Signal transduction mechanisms].
Length = 403
Score = 30.4 bits (69), Expect = 5.9
Identities = 32/148 (21%), Positives = 55/148 (37%), Gaps = 17/148 (11%)
Query: 201 LEDYFIWRDFSYLRLDGTTKSEDRGELLRKF-------NEPESKYFIFLLSTRAGGLGLN 253
L ++F+ S R G S D E LR N E K + +A G G +
Sbjct: 259 LAEHFLK---SEARRLGLPLSVDSPEALRALLAYDWPGNIRELKNLVERAVAQASGEGQD 315
Query: 254 LQAADTVIIFDSDWNPHQDLQAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNM 313
L+ I D P + + ++ LR ++ VEER+L A +
Sbjct: 316 LEDLIISIRLDEL--PGLNDVPAGISLQLNLPELPLSLRETSLEQVEERLLQKALEQNKG 373
Query: 314 DEKEYHRNNTARTQKLNKAIINYHINAE 341
++K+ AR +++ + Y +
Sbjct: 374 NKKK-----AARLLGISRKTLRYRLKKY 396
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.1 bits (68), Expect = 6.1
Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 1/62 (1%)
Query: 1219 ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
ES K K K P ++ K K + E V++ +E++E+ K K
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIK-AQMAERVKKLHSQEKKEEKKKPKK 75
Query: 1279 RK 1280
+K
Sbjct: 76 KK 77
>gnl|CDD|218595 pfam05460, ORC6, Origin recognition complex subunit 6 (ORC6). This
family consists of several eukaryotic origin recognition
complex subunit 6 (ORC6) proteins. Despite differences in
their structure and sequences among eukaryotic
replicators, ORC is a conserved feature of replication
initiation in all eukaryotes. ORC-related genes have been
identified in organisms ranging from S. pombe to plants
to humans. All DNA replication initiation is driven by a
single conserved eukaryotic initiator complex termed he
origin recognition complex (ORC). The ORC is a six
protein complex. The function of ORC is reviewed in.
Length = 297
Score = 30.3 bits (68), Expect = 6.2
Identities = 27/151 (17%), Positives = 45/151 (29%), Gaps = 13/151 (8%)
Query: 1028 QKSTGSERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGET 1087
G E F + D+ + + E NQ E ++ + +A + K
Sbjct: 138 HVLLGVESIVFNDGRKKKKDKISIADII---ELCNQFGLPVEVTVSALEGFEAYKHKFLK 194
Query: 1088 KPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYT-----DSL 1142
K L + + + L QR+ VD + L
Sbjct: 195 KSAWQLACGLV--LAAYFRINHRLLRQTPGIDSL---LPQLEGSQREDVDLWIPIVEEWL 249
Query: 1143 TDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
D++W + IE EEE+V K
Sbjct: 250 EDEDWFREIEIKYDYFDGAEEEQVIRKRVKL 280
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 30.0 bits (68), Expect = 6.4
Identities = 25/106 (23%), Positives = 44/106 (41%), Gaps = 20/106 (18%)
Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
KK+ KK K S+ ++ EDE+E + KK + ++ + K+K K
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEP--------DEANVNPNKKKIGK 52
Query: 1231 TP------AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
P DK +E+ E ++ + EE ++ +EEE
Sbjct: 53 NPSVDTSFLPDKAREEKEA------ELREELREEFLKKQEAVKEEE 92
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.0 bits (68), Expect = 6.6
Identities = 26/137 (18%), Positives = 54/137 (39%), Gaps = 6/137 (4%)
Query: 1118 EREKEE-ALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
E+ KEE L GS K + + D +Q+ E + E +P +
Sbjct: 106 EKLKEELHLTDSGSAGPKHIVFVDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKT 165
Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
+ +++A + + K +K+R+ R+K + + E +++ K K
Sbjct: 166 SLVDEKQKKKSAKKKRKLYKELKERKEREK-----KLKKVEQRLELQRELMKKGKGKKKK 220
Query: 1237 HKEDDEEKHVVSKRKKK 1253
+D + K V +K++
Sbjct: 221 IVKDKDGKVVYKWKKER 237
>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family. All
proteins in this family for which functions are known
are 3'-5' DNA-DNA helicases. These proteins are used for
recombination, recombinational repair, and possibly
maintenance of chromosome stability. This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 470
Score = 30.5 bits (69), Expect = 6.7
Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 5/83 (6%)
Query: 599 LVIVPL-STLSNWVLEFEKWAPSANVVAYKGSPLARRALQGRMKNSKFNVLLTTYEFIIK 657
LVI PL S + + VL+ + A + S ++ + +K+ K +L T E
Sbjct: 55 LVISPLISLMEDQVLQLKASGIPATFLNSSQSKEQQKNVLTDLKDGKIKLLYVTPEKCSA 114
Query: 658 DKSVLSKIQ--WK-CMI-IDEGH 676
+L ++ +I +DE H
Sbjct: 115 SNRLLQTLEERKGITLIAVDEAH 137
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.5 bits (69), Expect = 7.0
Identities = 22/92 (23%), Positives = 33/92 (35%), Gaps = 12/92 (13%)
Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDED----EDEEDESPKKKKR 1226
R GK + E A + + +K K+R+ KKV + E + KK
Sbjct: 730 APRNVGKTAR-----EKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDA 784
Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
+ K P+ K K +KR K E
Sbjct: 785 RKAKKPSAKTQKIAAATK---AKRAAKKKVAE 813
>gnl|CDD|240653 cd12176, PGDH_3, Phosphoglycerate dehydrogenases, NAD-binding and
catalytic domains. Phosphoglycerate dehydrogenases
(PGDHs) catalyze the initial step in the biosynthesis of
L-serine from D-3-phosphoglycerate. PGDHs come in 3
distinct structural forms, with this first group being
related to 2-hydroxy acid dehydrogenases, sharing
structural similarity to formate and glycerate
dehydrogenases. PGDH in E. coli and Mycobacterium
tuberculosis form tetramers, with subunits containing a
Rossmann-fold NAD binding domain. Formate/glycerate and
related dehydrogenases of the D-specific 2-hydroxyacid
dehydrogenase superfamily include groups such as formate
dehydrogenase, glycerate dehydrogenase, L-alanine
dehydrogenase, and S-adenosylhomocysteine hydrolase.
Despite often low sequence identity, these proteins
typically have a characteristic arrangement of 2 similar
subdomains of the alpha/beta Rossmann fold NAD+ binding
form. The NAD+ binding domain is inserted within the
linear sequence of the mostly N-terminal catalytic
domain, which has a similar domain structure to the
internal NAD binding domain. Structurally, these domains
are connected by extended alpha helices and create a
cleft in which NAD is bound, primarily to the C-terminal
portion of the 2nd (internal) domain. Some related
proteins have similar structural subdomain but with a
tandem arrangement of the catalytic and NAD-binding
subdomains in the linear sequence.
Length = 304
Score = 30.2 bits (69), Expect = 7.1
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 32 FNAPFATTGEKVELNEEETILIIRRL 57
FNAPF+ T EL E I++ RRL
Sbjct: 91 FNAPFSNTRSVAELVIGEIIMLARRL 116
Score = 30.2 bits (69), Expect = 7.1
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 735 FNAPFATTGEKVELNEEETILIIRRL 760
FNAPF+ T EL E I++ RRL
Sbjct: 91 FNAPFSNTRSVAELVIGEIIMLARRL 116
>gnl|CDD|217920 pfam04137, ERO1, Endoplasmic Reticulum Oxidoreductin 1 (ERO1).
Members of this family are required for the formation of
disulphide bonds in the ER.
Length = 348
Score = 30.4 bits (69), Expect = 7.1
Identities = 13/66 (19%), Positives = 29/66 (43%)
Query: 1048 EEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDE 1107
+E+E+P+ ++L E F ++D + + +L D+ V DD++
Sbjct: 45 CTCDESEIPEVWKPSELEGSACKEGKDFGELDCSLSDCRASCKWKDTDDLDDYCVVDDED 104
Query: 1108 VENSMY 1113
+ +Y
Sbjct: 105 SDKGVY 110
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C terminus.
The C terminus of the plasma membrane Nha1 antiporter
plays an important role in the immediate cell response to
hypo-osmotic shock which prevents an execessive loss of
ions and water. This domain is found with pfam00999.
Length = 430
Score = 30.5 bits (69), Expect = 7.1
Identities = 24/128 (18%), Positives = 49/128 (38%), Gaps = 24/128 (18%)
Query: 1092 VEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAI 1151
V+ + D + +S EE+ E + +L+G + +R++ S + ++
Sbjct: 326 VKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSSPPSRPRSR 385
Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
D +D E ++ + D+EEE P ++RRR +
Sbjct: 386 RDSEDEDTERDDSD------------------------SDDEEETPAERRRRLAALGELP 421
Query: 1212 EDEDEEDE 1219
D++DE
Sbjct: 422 SARDDDDE 429
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 30.7 bits (69), Expect = 7.2
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 10/68 (14%)
Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE--AEGEEEEEDD 1273
E +S K++++KT + E K KRK K EE +++E EE+
Sbjct: 724 AEKDSLSAPKKQTSKTAS-----EKSSSK---GKRKHKNDEEADKIESKKQRLEEKSSSC 775
Query: 1274 KASKPRKS 1281
S
Sbjct: 776 SPSSSSSH 783
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This family
includes gbp a protein from Soybean that binds to GAGA
element dinucleotide repeat DNA. It seems likely that the
this domain mediates DNA binding. This putative domain
contains several conserved cysteines and a histidine
suggesting this may be a zinc-binding DNA interaction
domain.
Length = 301
Score = 30.2 bits (68), Expect = 7.2
Identities = 21/106 (19%), Positives = 29/106 (27%), Gaps = 31/106 (29%)
Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
D+ E EV P+ TAP + +E K KK + K
Sbjct: 120 GDNPYGTREMHHLEVPPI----------------STAPPEAKEVKKPKKGQSPKV----- 158
Query: 1212 EDEDEEDESPKKKK-RKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
PK K +K K + SK K +
Sbjct: 159 ---------PKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195
>gnl|CDD|165245 PHA02934, PHA02934, Hypothetical protein; Provisional.
Length = 253
Score = 30.0 bits (67), Expect = 7.3
Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 7/105 (6%)
Query: 507 KKTEDDEYKDAGEEA-TYYSIAHTYHEKVTEQASIMVNGKLKEYQIKGLE-WMVSLYNNN 564
K+ DD Y+ + +Y I + Y +KV + +N Y + E + V NN
Sbjct: 104 KQRLDDLYRQLPTKTRSYIDIINMYCDKVNNDYNRDMN---IMYDMASTESFTVYDINNE 160
Query: 565 LNGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSN 609
+N IL D GLG + TI+ +T L ++ + P I + LS+
Sbjct: 161 VNTILMDNKGLGVRLATISFITEL--GRRCMNPVETIKMFTLLSH 203
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside from
the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins, so
do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 30.4 bits (69), Expect = 7.4
Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
RRRRK ++ + ++++ ++K + P K EDD E+ R +
Sbjct: 306 PHRRRRKP---FEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQS--GPRGHPTPGND 360
Query: 1259 EEVEEAEGEEEEEDDKASKPR 1279
+E E EE + ++ P
Sbjct: 361 DEKEPDPQEEADGQGSSTDPA 381
>gnl|CDD|220844 pfam10675, DUF2489, Protein of unknown function (DUF2489). This is
a bacterial family of uncharacterized proteins.
Length = 131
Score = 29.1 bits (66), Expect = 7.4
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 8/62 (12%)
Query: 389 YISNLTQMVKEHKEEQQKKMREEKKKKKKSIKQ--KILDGNM--DKLDLDESSQLEDLRI 444
Y L +K+ KE Q+K E K+ ++ I + +I+ M ++ +L E + +RI
Sbjct: 7 YAGYLLLKLKKQKELQEKAQAEAKQARRARILESIRIIARAMLQEQCELSEGA----IRI 62
Query: 445 TV 446
V
Sbjct: 63 KV 64
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic phosphoprotein
1 (DMP1) gene has been mapped to human chromosome 4q21.
DMP1 is a bone and teeth specific protein initially
identified from mineralised dentin. DMP1 is primarily
localised in the nuclear compartment of undifferentiated
osteoblasts. In the nucleus, DMP1 acts as a
transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates mineralised
matrix formation extracellularly, at later stages of
osteoblast maturation. The DMP1 gene has been found to be
ectopically expressed in lung cancer although the reason
for this is unknown.
Length = 514
Score = 30.4 bits (68), Expect = 7.5
Identities = 30/168 (17%), Positives = 63/168 (37%), Gaps = 10/168 (5%)
Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
E R +E++ + ++V S + QE + ++ E +EEV V++ R
Sbjct: 306 EHSRSESQEDSEENQSQEDSQEVQDPSSESSQE----ADLPSQENSSESQEEV--VSESR 359
Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
G S+ E+ + EE + + S E++ D + S ++P
Sbjct: 360 GDNPDNTTSHSEDQEDSESSEEDSLDTPSSSE----SQSTEEQADSESNESLSSSEESPE 415
Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
+ + ++ + S +E E + EE+D + +S
Sbjct: 416 STEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRS 463
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 30.6 bits (69), Expect = 7.7
Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 20/151 (13%)
Query: 1129 GSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETA 1188
G +Q + + L+D+E K +++ A++Y+E++E+ K + +
Sbjct: 528 GKKQEITIQSSGGLSDEEIEKMVKE--AEEYKEQDEKKKELVDAKN-------------- 571
Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDEDE-EDESPKKKKRKSTKTPA--KDKHKEDDEEKH 1245
E E V+K+ K +ISD D+DE + + K + S++ KDK K+ E
Sbjct: 572 -EAETLIYSVEKQLSDLKDKISDADKDELKQKITKLRSTLSSEDVDSIKDKTKQLQEASW 630
Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKAS 1276
+S++ K + + E EE ++K
Sbjct: 631 KISQQAYKQGNSDNQQSEQSTNSEESEEKND 661
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 30.2 bits (68), Expect = 7.7
Identities = 30/161 (18%), Positives = 64/161 (39%), Gaps = 5/161 (3%)
Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
+ E EKE + A+ +K + Q +++ + Q + + + Q
Sbjct: 36 DQSRILNTLEEFEKEANEKR-AQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELA 94
Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLED-LRITVKETNSDNKLRGE 458
++E K E +K + ++L+ +D+L + QL++ I K+ ++
Sbjct: 95 LQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERL 154
Query: 459 DAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEE 499
K+++E L E+ E + D E KE +E+
Sbjct: 155 KFENEKKLEESLELE---REKFEEQLHEANLDLEFKENEEQ 192
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 30.1 bits (68), Expect = 7.8
Identities = 23/160 (14%), Positives = 49/160 (30%), Gaps = 20/160 (12%)
Query: 293 LITVNSVEERILAAARYKLNMDEKEYHRNNTARTQKL---NKAIINYHINAEKEQKKEQE 349
LI + V + +L E+ Q L + +
Sbjct: 9 LILLLLVNLAFIRQLLLRLGRLEQMLGELAAVLEQLLLLLAFRAEAEQLRTFARSLQALN 68
Query: 350 RIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMR 409
+ + L A ++ + +Q + I +L Q+ E +E ++
Sbjct: 69 LELIQELNELKARLQQQLLQSREQLQLL------------IESLAQLSSE-FQELANEIF 115
Query: 410 EEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKET 449
EE ++ + Q+ L +L LE R +++
Sbjct: 116 EELNRRLAELNQQNL----KQLLKPLREVLEKFREQLEQR 151
>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
Length = 1713
Score = 30.6 bits (69), Expect = 8.1
Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 16/90 (17%)
Query: 1194 EEKPVKKRRRRKKVQISDEDEDEED--ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK 1251
+E+ +R K +QI D+ +D+E K K+ K+ K K + K
Sbjct: 4 KERKSGSKRGLKVIQIGDDSDDDEPIGSLFKLKRPKN---SKKVKVGLESTGK------- 53
Query: 1252 KKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
EE+ E E +D AS ++
Sbjct: 54 ----REEKLSALDEDSEGMDDTLASFRKRL 79
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 30.2 bits (68), Expect = 8.2
Identities = 23/131 (17%), Positives = 49/131 (37%), Gaps = 14/131 (10%)
Query: 1159 YEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE------ 1212
E E + + + KK E + + E++ + KK+ I +E
Sbjct: 350 VFEFEIYLYKIINANPDKLENKKI---EISIKPEKKTINNSQIITTKKINIVEESNKNSV 406
Query: 1213 ---DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
+ KKK K +E E++ V K+ +K+ + +E ++ E ++
Sbjct: 407 HFDTLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFV--KKSEKIPKNDELLDNLELAKQ 464
Query: 1270 EEDDKASKPRK 1280
+ +K + K
Sbjct: 465 KFFNKDIELSK 475
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in association
with pfam07719, pfam00515. There is a single completely
conserved residue L that may be functionally important.
NARP1 is the mammalian homologue of a yeast N-terminal
acetyltransferase that regulates entry into the G(0)
phase of the cell cycle.
Length = 516
Score = 30.3 bits (69), Expect = 8.4
Identities = 26/120 (21%), Positives = 48/120 (40%), Gaps = 20/120 (16%)
Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
+ E++G + E K K+R + KK +K E +++ + K+ +
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKE--EAEKAAAKKKAEAAAKKAKGP--- 449
Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHK---------EDDEEKHV----VSKRKKKVV 1255
D + + D P +K T+ P ++ K D+ E H+ V RKKK +
Sbjct: 450 --DGETKKVDPDPLGEKLARTEDPLEEAMKFLKPLLQLSPDNIETHLLAFEVYIRKKKYL 507
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 30.3 bits (68), Expect = 8.6
Identities = 20/106 (18%), Positives = 41/106 (38%), Gaps = 23/106 (21%)
Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEE--------------DESPKKK----KRKSTKT 1231
+EEE+ + + K +++ DE ++ K K R
Sbjct: 1476 NPDEEEEEAFREEKEKVFTYLNKELDEAGLKRVLRRCVIETYQKTDKSKVLLRLRVGNVG 1535
Query: 1232 PAKDKH-----KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
K K E D+ H++ + +V ++E++ E+ + E+E D
Sbjct: 1536 KGKKKETLATADEIDQAAHMLMDQVGQVPDDEDDDEDGDVLEKEVD 1581
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.4 bits (69), Expect = 8.8
Identities = 36/181 (19%), Positives = 81/181 (44%), Gaps = 14/181 (7%)
Query: 342 KEQKKEQERIEKE--RMRRLMAEDEEGYRKLIDQKKD-KRLAFLLSQTDEYISNLTQMVK 398
KE++KE E + +E + + E E KL + K+ + L + + ++ + +L +
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255
Query: 399 --EHKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLR 456
E K + ++ EE KK+ + +++K+ + K +E +L + + + + R
Sbjct: 256 KLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKR 315
Query: 457 GEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKD 516
G E +++ E++ EE+K+K +E K + E + + Y++
Sbjct: 316 LSR---------LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEE 366
Query: 517 A 517
A
Sbjct: 367 A 367
>gnl|CDD|176532 cd08590, PI-PLCc_Rv2075c_like, Catalytic domain of uncharacterized
Mycobacterium tuberculosis Rv2075c-like proteins. This
subfamily corresponds to the catalytic domain present in
uncharacterized Mycobacterium tuberculosis Rv2075c and
its homologs. Members in this family are more closely
related to the Streptomyces antibioticus
phosphatidylinositol-specific phospholipase
C1(SaPLC1)-like proteins rather than the typical
bacterial phosphatidylinositol-specific phospholipase C
(PI-PLC, EC 4.6.1.13), which participate in
Ca2+-independent PI metabolism, hydrolyzing the membrane
lipid phosphatidylinositol (PI) to produce
phosphorylated myo-inositol and diacylglycerol (DAG). In
contrast, SaPLC1-like proteins have two Ca2+-chelating
amino acid substitutions which convert them to
metal-dependent bacterial PI-PLC. Rv2075c and its
homologs have the same amino acid substitutions as well,
which might suggest they have metal-dependent PI-PLC
activity.
Length = 267
Score = 29.7 bits (67), Expect = 8.8
Identities = 13/31 (41%), Positives = 19/31 (61%), Gaps = 4/31 (12%)
Query: 464 KQIKEWLASHPGWEVV---MEDSEDEGSEDE 491
+I +WL ++P EVV +ED D G +DE
Sbjct: 95 NEIADWLNANPD-EVVILYLEDHGDGGKDDE 124
>gnl|CDD|178144 PLN02529, PLN02529, lysine-specific histone demethylase 1.
Length = 738
Score = 30.2 bits (68), Expect = 9.0
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 9/57 (15%)
Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKK------KKRKSTKTPAKDKHKEDDEE 1243
++ KR RKK + + DE+ DE +K KK+ TK +D KE + E
Sbjct: 5 GQDGTAPKRSLRKKAGLKNYDENLMDELIEKHLGGSFKKKNRTK---QDLEKETETE 58
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 29.9 bits (68), Expect = 9.2
Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 6/73 (8%)
Query: 402 EEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSD-NKLRGEDA 460
EE+ K++ EE +K KK +K K+ +L ++ QLE L T ++ + + +L E
Sbjct: 340 EEELKELEEELEKIKKLLK-KLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEEL- 397
Query: 461 PPAKQIKEWLASH 473
K++KE L S
Sbjct: 398 ---KELKEELESL 407
>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2. This family of
mediator complex subunit 2 proteins is conserved in
fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
with and phosphorylates Med2. Post-translational
modifications of Mediator subunits are important for
regulation of gene expression.
Length = 99
Score = 28.2 bits (63), Expect = 9.5
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 379 LAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKK 414
L+ T Y+ L + K+ +EE++KK EE+KK
Sbjct: 64 TESKLNDTKWYVDTLVENKKKQEEEERKKKEEEEKK 99
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 30.1 bits (67), Expect = 9.6
Identities = 29/161 (18%), Positives = 65/161 (40%), Gaps = 10/161 (6%)
Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
+ + S Y + ++++ + Q +++ D + D +DD E
Sbjct: 353 DGIRDKDFSAYTASKLSDEDD---DSVMESKMQKLFSEKEIDFGLNSELVDM-SDDGENG 408
Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
E E + S + +E T + + + + +R+ +K + + +D+ D K
Sbjct: 409 EMEDTFTSHLPASNESESDDKLETTIEKLDRKLR---ERQENRKERQLKKTKDDSDVDLK 465
Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
KK K K KH E+ SK + ++++ ++E +E
Sbjct: 466 DKKESINKKNKKGKHA---IERTAASKEELELIKADDEDDE 503
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 30.1 bits (68), Expect = 9.7
Identities = 17/53 (32%), Positives = 27/53 (50%)
Query: 1225 KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
K+K TK ++ D + + ++EEE +E E EEE+ED+ SK
Sbjct: 358 KQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK 410
>gnl|CDD|178164 PLN02549, PLN02549, asparagine synthase (glutamine-hydrolyzing).
Length = 578
Score = 30.1 bits (68), Expect = 9.7
Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 324 ARTQKLNKAIINYHINAEKEQK---KEQERIEKERMRRLMAEDEEGY 367
AR L+K I+ ++ + E K + RIEK +R+ ++E+ Y
Sbjct: 394 ARVPFLDKEFIDVAMSIDPEWKMIRPGEGRIEKWVLRKAFDDEEDPY 440
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.132 0.374
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 66,791,967
Number of extensions: 6921755
Number of successful extensions: 16399
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12495
Number of HSP's successfully gapped: 1122
Length of query: 1281
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1173
Effective length of database: 6,147,370
Effective search space: 7210865010
Effective search space used: 7210865010
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (28.7 bits)