RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16118
(1070 letters)
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 251 bits (644), Expect = 5e-69
Identities = 224/1074 (20%), Positives = 482/1074 (44%), Gaps = 154/1074 (14%)
Query: 36 GSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAK 95
G K+ E + ++ + EE + + + ++ + + ++AE+YQ ++ E+
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL 225
Query: 96 EVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAK 155
E+ L KL +++ELE+EL + + E+E+++ E+AE + E K E L EL +
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285
Query: 156 VDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 215
+ +E+ E+ EI + + +ER+ ++ +L ++ L E+++ EA +++ + ET
Sbjct: 286 LQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345
Query: 216 LADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 275
L ++ + AE E E +++KL++ + L
Sbjct: 346 LEELEQLLAELE------------------------------EAKEELEEKLSALLEELE 375
Query: 276 EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKR 335
E L +LA++ AE +
Sbjct: 376 E----------LFEALREELAELEAELAEIRNE--------------------------- 398
Query: 336 AGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQN 395
L+ + RE + +++L+ + + E+K+ E+EE Q +++L + + +
Sbjct: 399 -------LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEEL 451
Query: 396 EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVE 455
E L++ + +EL ++ + +Q L+KEL + L + ++ + ++E
Sbjct: 452 EEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVR---AVLE 508
Query: 456 NFKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLD 515
+ GVY + + V ++Y A+ LG ++A+VV++E+ A+ I++LK+++
Sbjct: 509 ALESGLPGVYGPVAELIK-VKEKYETALEAALGNRLQAVVVENEEVAKKAIEFLKENKAG 567
Query: 516 PETFLPIDYLQTKPLKERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPE 575
TFLP+D ++ L++ P + L D++ + P+ + V F + LV + E
Sbjct: 568 RATFLPLDRIKPLR---SLKSDAAPGFLGLASDLIDFDPK-YEPAVRFVLGDTLVVDDLE 623
Query: 576 DAMKVAYDIEPQHRYDAVALDGTFYQKSGIMSGG-----SLDLARKAKRWDDKEMGNLKA 630
A ++A + + +Y V LDG + SG ++GG S ++ + ++E+ L+A
Sbjct: 624 QARRLAR--KLRIKYRIVTLDGDLVEPSGSITGGSRNKRSSLAQKRELKELEEELAELEA 681
Query: 631 QKEKLSEELREAMKKSRK-ESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDA 689
Q EKL EEL+ + R E L ++ ++ LE L+ K ++A LE E++
Sbjct: 682 QLEKLEEELKSLKNELRSLEDLLEELRRQLEELER-------QLEELKRELAALEEELEQ 734
Query: 690 LNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAE 749
L +R + E +++ +E + + + +EE+ S+E+ + + + + RQ + E
Sbjct: 735 LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794
Query: 750 LRSQQE---------RQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAE 800
L +E + + ++ R E+ + + EEE+ + ++L E+
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854
Query: 801 ADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSER 860
+LE ++ + + +++E+ + E + ++++ + L+ ++E +
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914
Query: 861 HDILMNCKMNDIVLPMLRVQKYD-------RKLAKSIQEMTSRLQTIQAPNLRAMEKLEH 913
++ + ++ LP L + + +L + I+ + ++ + NLRA+E+ E
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEE 974
Query: 914 AKENLMKTNEEFENARKRAKKAKANFDRIKKERYDKFTRCFEHVS-------NEIDGAGS 966
+E + + E+ + +K + + KE+ ++F F+ ++ E+ G G+
Sbjct: 975 VEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFKETFDKINENFSEIFKELFGGGT 1034
Query: 967 ESVLPRPFLGPENPEEPLTYRV----------------------STTIVS-----HRYHP 999
L P++PLT + S T ++ +Y P
Sbjct: 1035 AE------LELTEPDDPLTAGIEISARPPGKKLQSLSLLSGGEKSLTALALLFAIQKYRP 1088
Query: 1000 APFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
APF+VLDE+DAALD+ N+ +VA I ++++ Q IVI+ ++ AD LVG+
Sbjct: 1089 APFYVLDEVDAALDDANVERVARLIKEMSKET-QFIVITHRKGTMEAADRLVGV 1141
Score = 40.5 bits (95), Expect = 0.005
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
IE+ FKS+ I FTA++GPNGSG
Sbjct: 6 IELKGFKSFADPTEINFSPGFTAIVGPNGSG 36
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 181 bits (461), Expect = 2e-46
Identities = 205/1062 (19%), Positives = 433/1062 (40%), Gaps = 145/1062 (13%)
Query: 83 EKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREK 142
K ++ EE+ +EVE + +L + I E +L++ + E EK ER + +L+EK
Sbjct: 170 RKKEKALEEL--EEVEENIERL---DLIIDEKRQQLERLRREREKAERYQA----LLKEK 220
Query: 143 KK-EQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQA 201
++ E L +E ++++ ++ ++ L K E +S ++K+L ++ L E+ +
Sbjct: 221 REYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKK 280
Query: 202 NEAHN--------KDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ-DVEINKKRPS 252
+ + I +LE ++A + + AE ER+ D A+ + EI+K
Sbjct: 281 IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL---EDAEERLAKLEAEIDKLLAE 337
Query: 253 LIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPG 312
+ + + + +K+ + E+++ E ++ +++ + A+ R +Y
Sbjct: 338 IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE----- 392
Query: 313 RDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIK 372
K E KR +++ + RE Q++L + I
Sbjct: 393 ----------------KLEKLKR------EINELKRELDRLQEELQRLSEELADLNAAIA 430
Query: 373 KKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE 432
++ E ++ + I++ E L+ + ++ K ++KEL ++
Sbjct: 431 GIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQR 490
Query: 433 ELGDAKTDKHEDTRRKKKQELVEN-FKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYM 491
EL +A+ R + VE K + GV+ + V +RY AI G +
Sbjct: 491 ELAEAEAQARASEERVRGGRAVEEVLKASIQGVHG-TVAQLGSVGERYATAIEVAAGNRL 549
Query: 492 EAIVVDSEKTARLCIQYLKDHQLDPETFLPIDYLQTKPLKERLRNIRDPKNVKLLYDVLK 551
+VV+ + A+ I+ LK + TFLP++ ++ + L + + + D+++
Sbjct: 550 NNVVVEDDAVAKEAIELLKRRKAGRATFLPLNKMRDERRD--LSILSEDGVIGFAVDLVE 607
Query: 552 YQPEDIKRVVLFATNNALVCETPEDAMKVAYDIEPQHRYDAVALDGTFYQKSGIMSGGS- 610
+ P+ + + + LV E E A ++ +Y V L+G ++KSG M+GGS
Sbjct: 608 FDPK-YEPAFKYVFGDTLVVEDIEAARRLM------GKYRMVTLEGELFEKSGAMTGGSR 660
Query: 611 -----------------------------LDLARKAKRWDDKEMGNLKAQKEKLSEELRE 641
L + R + + L + S ++ E
Sbjct: 661 APRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE 720
Query: 642 AMKK-SRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPK 700
K+ + E E ++ ++ LE L+ Q+++N KS++ +LEA I+ L E
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEA 780
Query: 701 IKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQERQKIC 760
+ +EA + I + E++ +E+ V R + +R+ E+ R E++ +
Sbjct: 781 LNDLEARL--SHSRIPEIQAELSKLEEEVSRI------EARLREIEQKLNRLTLEKEYLE 832
Query: 761 QDKDTKKN-VARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVD 819
++ + + + E+E+ G +E+L E+ L ++ + K+ D
Sbjct: 833 KEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERD 892
Query: 820 AMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLRV 879
++ ++ + R++ + I+ +K L++KLE + E +I ++ +
Sbjct: 893 ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELS 952
Query: 880 QKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKE---NLMKTNEEFENARKRAKKAK 936
+ +Q + ++ ++ N+ A+++ E + L + + E RK +
Sbjct: 953 L---EDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERI 1009
Query: 937 ANFDRIKK----ERYDKFTRCFEHVSNEIDGAGSESVLPRPFLGPENPEEPLTYRV---- 988
+++ K+ E ++ F + E+ G E +L ENP++P +
Sbjct: 1010 EEYEKKKREVFMEAFEAINENFNEIFAELSGGTGELIL-------ENPDDPFAGGLELSA 1062
Query: 989 ------------------STTIVS-----HRYHPAPFFVLDEIDAALDNTNIGKVASYIV 1025
S T +S RY P+PF+ DE+D LD N+ +VA I
Sbjct: 1063 KPKGKPVQRLEAMSGGEKSLTALSFIFAIQRYKPSPFYAFDEVDMFLDGVNVERVAKLIR 1122
Query: 1026 TKTQDSLQTIVISLKEEFFSHADSLVGICPGSVTISSICFGH 1067
K ++ Q IV+SL+ +AD +G+ S +
Sbjct: 1123 EKAGEA-QFIVVSLRSPMIEYADRAIGVTMRRNGESQVFGLK 1163
Score = 52.0 bits (125), Expect = 2e-06
Identities = 51/240 (21%), Positives = 115/240 (47%), Gaps = 8/240 (3%)
Query: 38 GALKDDYERLKAEMIQAE-EETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKE 96
L++ L+A + + E K + V+ + E ++ + +E + KE
Sbjct: 775 HKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKE 834
Query: 97 VEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKV 156
++ + + IK +E E++ G+ E++E E+ E LR+ + G L +E ++
Sbjct: 835 IQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDEL 894
Query: 157 DQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQL 216
+ ++RE++ +I + + K ++R+S ++ KL + ++ L E+ + + +++I + E L
Sbjct: 895 EAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSL 953
Query: 217 ADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 276
DV+ E + +N+ + Q+ E KR +K K + KL +K+++E
Sbjct: 954 EDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEK------RAKLEEERKAILE 1007
Score = 49.7 bits (119), Expect = 8e-06
Identities = 38/196 (19%), Positives = 93/196 (47%), Gaps = 11/196 (5%)
Query: 40 LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
L+ + ++L AE+ + E E ++ + E E K E+ + +R E+ + E
Sbjct: 327 LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELK------EELEDLRAELEEVDKEF 380
Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
+ K+ ++L+K K E+ +++R ++ + L+ +E LN +A ++ +
Sbjct: 381 AETRDEL-----KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAK 435
Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 219
I E++ E K + K + ++ + L+ ++ L ++++ + K+++ L+ +LA+
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEA 495
Query: 220 RKRKAEYERQSIPGRD 235
+ E + GR
Sbjct: 496 EAQARASEERVRGGRA 511
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 170 bits (433), Expect = 5e-43
Identities = 193/1039 (18%), Positives = 446/1039 (42%), Gaps = 142/1039 (13%)
Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQG------------ALN 150
KL ++ LED L++ + +++ +ER+ EKAE +E K E L
Sbjct: 180 KLERTRENLDRLEDILNELERQLKSLERQAEKAE-RYKELKAELRELELALLVLRLEELR 238
Query: 151 RELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIA 210
EL ++ +E++E + E+ + L + +E++ ++ +++ ++ + E+++ A +I+
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298
Query: 211 DLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASA 270
LE Q +R+R A ERQ + ++ + L + E ++ +++KL
Sbjct: 299 RLEQQKQILRERLANLERQ---------LEELEAQLEELESKLDELAEELAELEEKLEEL 349
Query: 271 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKA 330
K+ L + E ++ +LE++L ++ + + E ++ L+
Sbjct: 350 KEELESLEAELEELEAELEELESRLEELEE---QLETLR-------------SKVAQLEL 393
Query: 331 EATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR--IDKL 388
+ +I + + R + +++L E+ + ++ E + K + E + +++L
Sbjct: 394 QIASLNNEIERLEARLER-LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEEL 452
Query: 389 EDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRK 448
++ + + E +L++ ++ EE + +++ + +LQ L+ +E L + E +
Sbjct: 453 QEELERLEEALEELREELEEAEQALDAAERELAQLQARLD-SLERLQENLEGFSEGVKAL 511
Query: 449 KKQELVENFKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQY 508
K + SG+ + + V + Y AI LG ++A+VV++ A+ I +
Sbjct: 512 LKN------QSGLSGILGVLSELIS-VDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAF 564
Query: 509 LKDHQLDPETFLPIDYLQ-TKPLKERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNN 567
LK ++L TFLP+D ++ T+ +++ + + L +++ + +
Sbjct: 565 LKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGG 624
Query: 568 ALVCETPEDAMKVAYDIEPQHRYDAVALDGTFYQKSGIMSGGSLD-----LARKAKRWDD 622
LV + ++A+++A + + Y V LDG + G+++GGS L R+
Sbjct: 625 VLVVDDLDNALELA--KKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERR------ 676
Query: 623 KEMGNLKAQKEKLSEELREAMKK-SRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIA 681
+E+ L+ + E+L E++ E K + EL ++ ++ L L + + + +A
Sbjct: 677 REIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLA 736
Query: 682 KLEAEIDALNARADATEPKIKAIEASMTARGD-------TISRKKEEMNSVEDIVFRDFC 734
+LEAE++ L R ++ +EA + + ++ + E+ +E + +
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL-- 794
Query: 735 KSIGVSTIRQYEEAELRSQQERQKICQD----KDTKKNVARWERAVSDDEEELARAQGAE 790
K + +E ++ + ++ +A ER + D EE++
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854
Query: 791 EKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLE 850
E LA E+ + +E + + +++E + R E+ ++++++ + L
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914
Query: 851 SKLEMKKSERHDILMNCK--------------------MNDIVLPMLRVQKYDRKLAKSI 890
+LE + + + + + + + +++ + + + +
Sbjct: 915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRL 974
Query: 891 QEMTSRLQTIQAPNLRAMEKLEHAKE---NLMKTNEEFENARKRAKKAKANFDRIKKERY 947
+ + ++++ + NL A+E+ E KE L E+ A++ ++A DR +ER
Sbjct: 975 KRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARER- 1033
Query: 948 DKFTRCFEHVSNEIDGAGSESVLPRPFLGPE------NPEEPLTYRV------------- 988
F F+ V+ V P+ F G E +PE+ L +
Sbjct: 1034 --FKDTFDQVNENFQR-----VFPKLFGGGEAELRLTDPEDLLEAGIEIFAQPPGKKNQN 1086
Query: 989 ---------STTIVS-----HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQT 1034
+ T ++ + PAPF +LDE+DA LD+ N+ + A+ + ++++ Q
Sbjct: 1087 LSLLSGGEKALTALALLFAIFKVKPAPFCILDEVDAPLDDANVERFANLLKEFSKNT-QF 1145
Query: 1035 IVISLKEEFFSHADSLVGI 1053
IVI+ + AD L G+
Sbjct: 1146 IVITHNKGTMEVADQLYGV 1164
Score = 55.1 bits (133), Expect = 2e-07
Identities = 71/425 (16%), Positives = 178/425 (41%), Gaps = 65/425 (15%)
Query: 39 ALKDDYERLKAEMIQAEEETN--MSYLKKKGVVAERKEAKIEKDEAEKYQRIREEI--VA 94
L++ L+ + + +E L++ E +I + R+ E+ +
Sbjct: 688 ELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS-ALRKDLARLEAEVEQLE 746
Query: 95 KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELA 154
+ + +L E +I+ELE+ L++ + E+ + E E+ E + + K+E AL L
Sbjct: 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD 806
Query: 155 KVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET 214
++ E+ ++ E R ER+ +++++A+ ++ L ++ + E ++DI L
Sbjct: 807 ELRAELTLLNEEAANLR-------ERLESLERRIAATERRLEDLEEQIEELSEDIESLAA 859
Query: 215 QLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 274
++ ++ + E E + L + +++ LA + L
Sbjct: 860 EIEELEELIEELESE-----------------------LEALLNERASLEEALALLRSEL 896
Query: 275 VEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATK 334
E+ + ++L +L ++R++ A+ E + LE ++
Sbjct: 897 EELSEELRELESKRSELRRELEELREKLAQLELR--------LEGLEV------------ 936
Query: 335 RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
+I + ++ E L+ + + +++ ++ R ++ + +I +L
Sbjct: 937 ---RIDNLQERLSEEY---SLTLEEAEALENKIEDDEEEARRRLKRLENKIKELG---PV 987
Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELV 454
N A++++ ++LKE + + + K + E ++ LE+ IEE+ ++ +DT + +
Sbjct: 988 NLAAIEEYEELKERYD-FLTAQKEDLTEAKETLEEAIEEIDREARERFKDTFDQVNENFQ 1046
Query: 455 ENFKK 459
F K
Sbjct: 1047 RVFPK 1051
Score = 47.0 bits (112), Expect = 6e-05
Identities = 46/246 (18%), Positives = 111/246 (45%), Gaps = 32/246 (13%)
Query: 220 RKRKAEYERQSIPGRDINLESAQDV--EINKKRPSLIKSKERVSHIQKKLASAKKSL-VE 276
++R+ E ER+ + NL+ +D+ E+ ++ SL Q + A K L E
Sbjct: 171 KERRKETERK-LERTRENLDRLEDILNELERQLKSL--------ERQAEKAERYKELKAE 221
Query: 277 VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRA 336
+R+ +L + + + + E E ++ E E T
Sbjct: 222 LREL---------ELALLVLRLEELREELEELQ----------EELKEAEEELEELTAEL 262
Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNE 396
++ ++L+ + E +++++ ++ NEI + + + ++R+ LE + + E
Sbjct: 263 QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322
Query: 397 ASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE-DTRRKKKQELVE 455
A L++ + +EL ++ + +++EL++ELE + EL + + + E ++R ++ +E +E
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLE 382
Query: 456 NFKKAY 461
+
Sbjct: 383 TLRSKV 388
Score = 35.8 bits (83), Expect = 0.17
Identities = 33/199 (16%), Positives = 83/199 (41%), Gaps = 22/199 (11%)
Query: 50 EMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNE- 108
++++ L++ E E+K+++ E + E++ + E + + E
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEEL-AELEEKLEELKEELESLEAELEEL 363
Query: 109 -TDIKELED-------ELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEI 160
+++ELE +L+ + +V ++E + N + + L ++ QEI
Sbjct: 364 EAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI 423
Query: 161 R------------EMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 208
E+ E+ + L + +E + +++ L ++ L E QA +A ++
Sbjct: 424 EELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERE 483
Query: 209 IADLETQLADVRKRKAEYE 227
+A L+ +L + + + E
Sbjct: 484 LAQLQARLDSLERLQENLE 502
Score = 30.4 bits (69), Expect = 7.0
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 5 LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
L+ +E+ FKS+ +I K T ++GPNG G
Sbjct: 2 LKKLELAGFKSFADPTTINFDKGITGIVGPNGCG 35
>gnl|CDD|213242 cd03275, ABC_SMC1_euk, ATP-binding cassette domain of eukaryotic SMC1
proteins. The structural maintenance of chromosomes
(SMC) proteins are large (approximately 110 to 170 kDa),
and each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code), which
by mutational studies has been shown to be essential in
several proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B' motif,
and a motif with homology to the signature sequence of
the ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these sequences.
In eukaryotic cells, the proteins are found as
heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
and SMC5 with SMC6 (formerly known as Rad18).
Length = 247
Score = 126 bits (319), Expect = 2e-32
Identities = 43/61 (70%), Positives = 49/61 (80%)
Query: 995 HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGIC 1054
H Y PAPFFVLDE+DAALDNTN+GKVASYI + + Q IVISLKEEFFS AD+LVG+
Sbjct: 173 HSYQPAPFFVLDEVDAALDNTNVGKVASYIREQAGPNFQFIVISLKEEFFSKADALVGVY 232
Query: 1055 P 1055
Sbjct: 233 R 233
Score = 66.4 bits (163), Expect = 6e-12
Identities = 21/34 (61%), Positives = 28/34 (82%)
Query: 5 LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
L+ +E++NFKSYKG+ IGP +FT +IGPNGSG
Sbjct: 1 LKRLELENFKSYKGRHVIGPFDRFTCIIGPNGSG 34
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 130 bits (327), Expect = 2e-30
Identities = 154/994 (15%), Positives = 360/994 (36%), Gaps = 70/994 (7%)
Query: 121 KKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKER 180
K+ + E++++ E+ EN+ + +EL +Q + ++ K++ L +
Sbjct: 169 KRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228
Query: 181 VSHIQKKLASAKKSLVEVRQANEAHN-KDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
K L E+ + + +LE + + + E + + + E
Sbjct: 229 YLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEE 288
Query: 240 SAQ-DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 298
E + + L+K + R ++KL ++K L ++ + + ++I +LE +L ++
Sbjct: 289 LKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKEL 348
Query: 299 RKRKAEYERQ-----SIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGD 353
++ E + + + LE + + + A ++L+ N E+K
Sbjct: 349 EIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEA 408
Query: 354 QDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDV 413
+ L+ +++ + E K++ +EE ++ ++ + + + + L+ + ++
Sbjct: 409 KLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLEL 468
Query: 414 GSSKNRVQELQKELEQVIEELGDAKTDKHEDT-RRKKKQELVENFKKAYSGVYDRMINMC 472
S++ ++E + EL + E + + K +E + I
Sbjct: 469 KKSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESKAREGLAVLLALIKDGVGLRIISA 528
Query: 473 HPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLDPETFLPIDYLQTKPLKE 532
VA+ A++V+ A + K + E LP
Sbjct: 529 AGRGGDLGVAVANYKVAISVAVIVEVSAVADEVDERQKLVRALTELPLPARTFVLLDPLL 588
Query: 533 RLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPEDAMKVAYDIEPQHRYDA 592
+L L Q A V E ++ +E A
Sbjct: 589 KLPLKSIAVLEIDPALNL-AQLNKATLEADEDDKRAKVVEGILKDTELTKLLE-----SA 642
Query: 593 VALDGTFYQKSGIMSGGSLDLARKAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESEL 652
A + + + G + KA + + + + ++ +E + R++ E+
Sbjct: 643 KAKESGLRKGVSLEEGLAEKSELKASLSELTKELLAEQELQEKAESELAKNEILRRQEEI 702
Query: 653 NTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARG 712
+ IK +L +++L K Q A+ + + E + + ++ +
Sbjct: 703 KKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEE 762
Query: 713 DTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARW 772
+ + + + E + K+ + + EE ++E + + ++ + +
Sbjct: 763 EEEEKSELSLKEKELAEEEE--KTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEE 820
Query: 773 ERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREV 832
E+ + + EE++ + E L + + +KL RL ++ + + +E+ E+
Sbjct: 821 EQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEEL 880
Query: 833 GSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLR----VQKYDRKLAK 888
+ K E K E+++ + D L+ K N+I + + KY+ + +
Sbjct: 881 EEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEE 940
Query: 889 S--------IQEMTSRLQTIQ-----------APNLRAMEKLEHAKENLMKTNEEFENAR 929
+E ++ + + N+ M E ++ +E + R
Sbjct: 941 LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKER 1000
Query: 930 KRAKKAKANFDRIKKERYDKFTRCFEHVSNEID---GAGSESVLPRPFLGPENPEEPLTY 986
+K + + I++ VS + L E+ ++P +
Sbjct: 1001 LEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFFYLELGGSAELRLEDSDDPFSG 1060
Query: 987 RVSTTIVS---------------------------HRYHPAPFFVLDEIDAALDNTNIGK 1019
+ + +Y PAPF++LDEIDAALD+ N+ +
Sbjct: 1061 GIEISARPPGKGVKNLDNLSGGEKTLVALALIFAIQKYRPAPFYLLDEIDAALDDQNVSR 1120
Query: 1020 VASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
VA+Y+ ++++ Q IVISL+EE AD LVG+
Sbjct: 1121 VANYLKELSKNA-QFIVISLREEMLEKADRLVGV 1153
Score = 35.3 bits (81), Expect = 0.24
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
IE++ FKSY + FTA++GPNGSG
Sbjct: 5 IEIEGFKSYAKTVILPFSPGFTAIVGPNGSG 35
>gnl|CDD|214944 smart00968, SMC_hinge, SMC proteins Flexible Hinge Domain. This
entry represents the hinge region of the SMC (Structural
Maintenance of Chromosomes) family of proteins. The
hinge region is responsible for formation of the DNA
interacting dimer. It is also possible that the precise
structure of it is an essential determinant of the
specificity of the DNA-protein interaction.
Length = 120
Score = 98.1 bits (245), Expect = 4e-24
Identities = 37/122 (30%), Positives = 68/122 (55%), Gaps = 5/122 (4%)
Query: 462 SGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLDPETFLP 521
GV R+ ++ V +Y A+ LG ++A+VVD+E+TA+ I++LK ++L TFLP
Sbjct: 1 PGVLGRVADLIS-VDPKYETALEAALGGRLQAVVVDTEETAKKAIEFLKKNRLGRATFLP 59
Query: 522 IDYLQTKPL---KERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPEDAM 578
+D ++ + K R + +P V D+++Y PE ++ + + N LV + E A
Sbjct: 60 LDKIKPRSPAGSKLREALLPEPGFVGPAIDLVEYDPE-LRPALEYLLGNTLVVDDLETAR 118
Query: 579 KV 580
++
Sbjct: 119 RL 120
>gnl|CDD|219044 pfam06470, SMC_hinge, SMC proteins Flexible Hinge Domain. This
family represents the hinge region of the SMC
(Structural Maintenance of Chromosomes) family of
proteins. The hinge region is responsible for formation
of the DNA interacting dimer. It is also possible that
the precise structure of it is an essential determinant
of the specificity of the DNA-protein interaction.
Length = 117
Score = 96.6 bits (241), Expect = 1e-23
Identities = 39/119 (32%), Positives = 70/119 (58%), Gaps = 3/119 (2%)
Query: 462 SGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLDPETFLP 521
GVY R+ ++ V + Y A+ LG ++AIVVD E+TA+ I++LK + L TFLP
Sbjct: 2 KGVYGRLADLIE-VDEGYEKAVEAALGGRLQAIVVDDEETAKAAIEFLKKNNLGRATFLP 60
Query: 522 IDYLQTKPLKERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPEDAMKV 580
+D ++ +PL + R I P L D+++Y E +++ + + N LV + ++A+++
Sbjct: 61 LDRIKPRPLDDSPREI--PGGATPLIDLVEYPDEKLRKALRYLLGNTLVVDDLDEALEL 117
>gnl|CDD|213241 cd03274, ABC_SMC4_euk, ATP-binding cassette domain of eukaryotic SMC4
proteins. The structural maintenance of chromosomes
(SMC) proteins are large (approximately 110 to 170 kDa),
and each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code), which
by mutational studies has been shown to be essential in
several proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B' motif,
and a motif with homology to the signature sequence of
the ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these sequences.
In eukaryotic cells, the proteins are found as
heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
and SMC5 with SMC6 (formerly known as Rad18).
Length = 212
Score = 73.1 bits (180), Expect = 2e-14
Identities = 32/59 (54%), Positives = 41/59 (69%), Gaps = 1/59 (1%)
Query: 995 HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
H Y P P +V+DEIDAALD N+ VA+YI +T+++ Q IVISL+ F AD LVGI
Sbjct: 145 HHYKPTPLYVMDEIDAALDFRNVSIVANYIKERTKNA-QFIVISLRNNMFELADRLVGI 202
Score = 55.0 bits (133), Expect = 3e-08
Identities = 19/36 (52%), Positives = 27/36 (75%), Gaps = 1/36 (2%)
Query: 4 ILQYIEVDNFKSYKGKFSIGPL-KKFTAVIGPNGSG 38
I+ + ++NFKSY G+ IGP K F+A++GPNGSG
Sbjct: 2 IITKLVLENFKSYAGEQVIGPFHKSFSAIVGPNGSG 37
>gnl|CDD|213206 cd03239, ABC_SMC_head, The SMC head domain belongs to the ATP-binding
cassette superfamily. The structural maintenance of
chromosomes (SMC) proteins are essential for successful
chromosome transmission during replication and
segregation of the genome in all organisms. SMCs are
generally present as single proteins in bacteria, and as
at least six distinct proteins in eukaryotes. The
proteins range in size from approximately 110 to 170 kDa,
and each has five distinct domains: amino- and
carboxy-terminal globular domains, which contain
sequences characteristic of ATPases, two coiled-coil
regions separating the terminal domains , and a central
flexible hinge. SMC proteins function together with other
proteins in a range of chromosomal transactions,
including chromosome condensation, sister-chromatid
cohesion, recombination, DNA repair, and epigenetic
silencing of gene expression.
Length = 178
Score = 66.9 bits (164), Expect = 1e-12
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 995 HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
P+PF+VLDEIDAALD TN +V+ I + + Q IVI+LK+E F +AD L+G+
Sbjct: 112 QEIKPSPFYVLDEIDAALDPTNRRRVSDMIKEMAKHTSQFIVITLKKEMFENADKLIGV 170
Score = 40.8 bits (96), Expect = 0.001
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
I + NFKSY+ + +G F A++GPNGSG
Sbjct: 4 ITLKNFKSYRDETVVGGSNSFNAIVGPNGSG 34
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 71.3 bits (174), Expect = 3e-12
Identities = 115/483 (23%), Positives = 205/483 (42%), Gaps = 31/483 (6%)
Query: 41 KDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQ 100
K D + KAE + E + + AE E K E E +K + ++ AK+ +
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
Query: 101 LFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEI 160
K + K+ +E KK E++K K+KA+ +KK E+ E K +E
Sbjct: 1390 KKK----ADEAKKKAEEDKKKADELKKAAAAKKKADEA--KKKAEEKKKADEAKKKAEEA 1443
Query: 161 REMDVEINKKRPSLIKSKE--RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 218
++ D E KK K++E + + KK AKK E ++A+EA K A+ + AD
Sbjct: 1444 KKAD-EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK--AEEAKKKAD 1500
Query: 219 VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
K+ AE ++++ + E A+ + KK K+ E +KK A K E++
Sbjct: 1501 EAKKAAEAKKKADEAK--KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK 1558
Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGR------------DINLESAQMTEYT 326
+A E + A + ++ RKAE +++ R + E A+ E
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Query: 327 NLKAEATKRA---GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQK 383
+KAE K+A K ++QL E+K ++L + E KK E ++ +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Query: 384 RIDKLEDHIRQNEASLK---DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTD 440
K E+ ++ +LK + K EEL K + +EL+K E+ + +AK +
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738
Query: 441 KHEDTRRKKKQELVENFKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEK 500
ED ++ ++ + E KK + + + K I + L + E ++ +K
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
Query: 501 TAR 503
+
Sbjct: 1799 KIK 1801
Score = 63.6 bits (154), Expect = 5e-10
Identities = 94/447 (21%), Positives = 193/447 (43%), Gaps = 27/447 (6%)
Query: 40 LKDDYERLKAEMIQ--AEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEV 97
K + + KA+ + AEE+ KKK ++K +++K A K + +E K
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA--DEAKKKAE 1428
Query: 98 EHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILR----EKKKEQGALNREL 153
E + + + + DE KK E +K E K+KAE + +KK E+ E
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA 1488
Query: 154 AKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE---AHNKDIA 210
K +E ++ E K + K+ E + K A K E ++A+E A K A
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548
Query: 211 DLETQLADVRK----RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKK 266
D + +++K +KAE +++ +++ L A++ + ++ K + K
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
Query: 267 LASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYT 326
AKK+ +A E + + + ++ ++K++AE ++++ + E+
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAE--EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
Query: 327 NLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRID 386
KAE K+ + ++ + ++ K E ++ + + + +++ + EE +K +
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726
Query: 387 KLEDHIRQNEASLKDNKKLKEELNSDVGSSK----------NRVQELQKELEQVIEELGD 436
+ + + + +++KK EE D K + +E++KE E VIEE D
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 437 AKTDKHEDTRRKKKQELVENFKKAYSG 463
+ +K KK +++ +NF G
Sbjct: 1787 EEDEKRRMEVDKKIKDIFDNFANIIEG 1813
Score = 61.3 bits (148), Expect = 3e-09
Identities = 95/422 (22%), Positives = 180/422 (42%), Gaps = 26/422 (6%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIRE--EIVAKE 96
A+K + R E+ +AEE+ KK + EAK + +EA+K ++ E K+
Sbjct: 1271 AIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330
Query: 97 VEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNR--ELA 154
+ K + + + E + E E E + E AE E KK+ A + E
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390
Query: 155 KVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET 214
K E ++ E KK L K+ +KK AKK E ++A+EA K E
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAA----KKKADEAKKKAEEKKKADEAKKK---AEEA 1443
Query: 215 QLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 274
+ AD K+KAE +++ + E+ + E KK + ++ +KK AKK
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK----AEEAKKADEAKKKAEEAKKKA 1499
Query: 275 VEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATK 334
E ++A EA K + +KA+ +++ + + A+ E KA+ K
Sbjct: 1500 DEAKKAAEAKKK----ADEAKKAEEAKKADEAKKAEEAK--KADEAKKAEEKK-KADELK 1552
Query: 335 RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHE---MEEAQKRIDKLEDH 391
+A ++ + + E+ ++ N ++ + + ++ R E +++ K E+
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
Query: 392 IRQNEASLK-DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKK 450
+ EA +K + K EE V K + E +K+ E++ + + K E+ ++ ++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672
Query: 451 QE 452
+
Sbjct: 1673 DK 1674
Score = 55.9 bits (134), Expect = 1e-07
Identities = 92/408 (22%), Positives = 176/408 (43%), Gaps = 32/408 (7%)
Query: 48 KAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHN 107
+ E+ + E+ + +K A++ EA + +E K + +R+ A++ E +
Sbjct: 1153 RVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAA--RKAEE 1210
Query: 108 ETDIKELEDELDKKKGE-VEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
E +E D KK E V+K E K+ AE E KK + N E + +E R
Sbjct: 1211 ERKAEEARKAEDAKKAEAVKKAEEAKKDAE----EAKKAEEERNNEEIRKFEEARMAHFA 1266
Query: 167 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 226
+R + IK++E + A K E ++A+EA + E + AD K+KAE
Sbjct: 1267 ---RRQAAIKAEE------ARKADELKKAEEKKKADEAKKAE----EKKKADEAKKKAEE 1313
Query: 227 ERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNK 286
+++ + E+ + + KK+ K + + + +A + EA K
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAA--KAEAEAAADEAEAAEEKAEAAEK 1371
Query: 287 DIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTI 346
+ + AD K+KAE ++++ + E + + A A K+A + ++ +
Sbjct: 1372 KKEE-AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430
Query: 347 NREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK-DNKKL 405
+K D+ K E ++ E KKK E ++A++ K E+ + +EA K + K
Sbjct: 1431 ---KKADEAKKKAEEAKKAD---EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Query: 406 KEELNSDVGSSKNRVQELQK--ELEQVIEELGDAKTDKHEDTRRKKKQ 451
+E +K + E +K E ++ +E A+ K D +K ++
Sbjct: 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532
Score = 54.4 bits (130), Expect = 4e-07
Identities = 113/444 (25%), Positives = 187/444 (42%), Gaps = 51/444 (11%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKI--EKDEAEKYQRIREEIVAKE 96
A KD E KAE + EE + A R+ A E +A++ ++ E+ A E
Sbjct: 1235 AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADE 1294
Query: 97 VEHQLFKLYHNETDIKELE----DELDKKKGEVEK-IERRKEKAENILREKKKEQGALNR 151
+ K +E K E DE KK E +K + K+KAE E KK
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE----EAKKAA----- 1345
Query: 152 ELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 211
E AK + E + E +++ + K+ +KK +AKK E ++A+EA K
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKK--EEAKKKADAAKKKAEEKKKADEAKKK---- 1399
Query: 212 LETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAK 271
A+ K+KA+ +++ + E+ + E KK K E KK AK
Sbjct: 1400 -----AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE----AKKADEAK 1450
Query: 272 KSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAE 331
K E ++A EA K E + AD K+KAE +++ + E+ + + AE
Sbjct: 1451 KKAEEAKKAEEAKKK---AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507
Query: 332 ATKRA--------GKILQQLDTINREQKGDQDKLDNELRQ--QVQTQNEIKKKRHEMEEA 381
A K+A K + +K D+ K E ++ +++ E+KK + +
Sbjct: 1508 AKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAE 1567
Query: 382 QKRI---DKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVI--EELGD 436
+ + DK + EA + +++E + K + +E +K E I EEL
Sbjct: 1568 EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK 1627
Query: 437 AKTDKHEDTRRKKKQELVENFKKA 460
A+ +K + + KKK+ E KKA
Sbjct: 1628 AEEEKKKVEQLKKKEA--EEKKKA 1649
Score = 51.3 bits (122), Expect = 3e-06
Identities = 92/409 (22%), Positives = 166/409 (40%), Gaps = 35/409 (8%)
Query: 45 ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
E K E +AEE KKK A + E + ++A K + R+ AK VE + +
Sbjct: 1102 EAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVE--IARK 1159
Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
+ +E D KK E + KAE E +K + A E A+ +E R+ +
Sbjct: 1160 AEDARKAEEARKAEDAKKAEAARKAEEVRKAE----ELRKAEDARKAEAARKAEEERKAE 1215
Query: 165 VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKA 224
+ ++ ++ +K AKK+ E R E + A + KA
Sbjct: 1216 EARKAEDAKKAEAVKKAEEAKKDAEEAKKA-EEERNNEEIRKFEEARMAHFARRQAAIKA 1274
Query: 225 EYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAH 284
E R++ + + E KK K++E+ KK AKK E ++A+EA
Sbjct: 1275 EEARKA--------DELKKAEEKKKADEAKKAEEK-----KKADEAKKKAEEAKKADEAK 1321
Query: 285 NKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLD 344
K A+ + AD K+KAE E+ + E +AEA + ++
Sbjct: 1322 KK--AEEAKKKADAAKKKAE-------------EAKKAAEAAKAEAEAAADEAEAAEEKA 1366
Query: 345 TINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKK 404
++K + K + +++ + + + + + + EE +K+ D+L+ + + + KK
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Query: 405 LKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQEL 453
+E+ +D K + E ++ EE A+ K + KK E
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475
Score = 39.4 bits (91), Expect = 0.012
Identities = 86/433 (19%), Positives = 181/433 (41%), Gaps = 30/433 (6%)
Query: 40 LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
K E KA+ + EE + KK ++ E K + +EA+K + + + K E
Sbjct: 1527 AKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEA 1586
Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
+ + E +K E+E K E +K E K KAE + + + E K ++
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE---------EEKKKVEQ 1637
Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 219
+++ + E KK L K++E + K AKK+ + ++A EA + + E + A+
Sbjct: 1638 LKKKEAEEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEEAKKAE--EDEKKAAEA 1693
Query: 220 RKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ 279
K++AE +++ + E + E KK K K +K+ KK E ++
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK--AEEAKKEAEEDKKKAEEAKK 1751
Query: 280 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
E K IA L+ + + + + E++++ +++ E + + K +
Sbjct: 1752 DEEEKKK-IAHLKKE-EEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFAN 1809
Query: 340 LQQLDTINREQKGDQDKL-DNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
+ + D ++ D+ +++ ++N ++ E+ + + EA
Sbjct: 1810 IIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEAD 1869
Query: 399 LKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE---------ELGDAKTDKHEDTRR-- 447
K LKE+ ++ + + + ++E+ I ++ D K DK E +R
Sbjct: 1870 FNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDEYIKRDA 1929
Query: 448 -KKKQELVENFKK 459
+ ++E+++ KK
Sbjct: 1930 EETREEIIKISKK 1942
Score = 38.6 bits (89), Expect = 0.024
Identities = 88/392 (22%), Positives = 170/392 (43%), Gaps = 56/392 (14%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
A K + + E +AEE KK A++ E K + DE +K + +++ K+ E
Sbjct: 1515 AKKAEEAKKADEAKKAEE-------AKKADEAKKAEEKKKADELKKAEELKKAEEKKKAE 1567
Query: 99 HQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQ 158
+E ++ +K E +K E + + L E++K+ A E AK +
Sbjct: 1568 EA--------KKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA---EEAKKAE 1616
Query: 159 EIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 218
E + E+ K + K++V ++KK A KK E+++A E NK A E + A+
Sbjct: 1617 EAKIKAEELKKAE----EEKKKVEQLKKKEAEEKKKAEELKKAEE-ENKIKAAEEAKKAE 1671
Query: 219 VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
K+KAE +++ + + E KK + ++ ++KK A KK E++
Sbjct: 1672 EDKKKAEEAKKA------EEDEKKAAEALKKEA---EEAKKAEELKKKEAEEKKKAEELK 1722
Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGK 338
+A E + + + + A+ K+KAE + + + + +LK E K+A +
Sbjct: 1723 KAEEENKIKAEEAKKE-AEEDKKKAE---------EAKKDEEEKKKIAHLKKEEEKKAEE 1772
Query: 339 ILQQLDTINREQKGDQDKLDNELRQQV-QTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
I ++ + + E+ D+ D + R +V + +I + E K +
Sbjct: 1773 IRKEKEAVIEEE---LDEEDEKRRMEVDKKIKDIFDNFANIIEGGK----------EGNL 1819
Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
+ D+K++++ +V SKN E E+
Sbjct: 1820 VINDSKEMEDSAIKEVADSKNMQLEEADAFEK 1851
Score = 37.8 bits (87), Expect = 0.038
Identities = 84/371 (22%), Positives = 147/371 (39%), Gaps = 33/371 (8%)
Query: 89 REEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGA 148
+E+ A E + F + + E +K E +K KAE E +K + A
Sbjct: 1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAE----EARKAEDA 1139
Query: 149 LNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 208
E A+ ++ + VEI +K K++E K A A + EVR+A E +
Sbjct: 1140 RKAEEARKAEDAKR--VEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAE 1197
Query: 209 IADLETQLADVRK----RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQ 264
D RK RKAE ER++ R E A+ E KK K E +
Sbjct: 1198 ---------DARKAEAARKAEEERKAEEAR--KAEDAKKAEAVKKAEEAKKDAE-----E 1241
Query: 265 KKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTE 324
K A +++ E+R+ EA A + + RKA+ +++ + + A+ E
Sbjct: 1242 AKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK--KADEAKKAE 1299
Query: 325 YTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR 384
KA+ K+ + ++ D ++ + + K D ++ + + + + E E A
Sbjct: 1300 EKK-KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Query: 385 IDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQV---IEELGDAKTDK 441
+ E+ E ++ KK + K + E +K+ E+ +EL A K
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKK-AEEKKKADEAKKKAEEDKKKADELKKAAAAK 1417
Query: 442 HEDTRRKKKQE 452
+ KKK E
Sbjct: 1418 KKADEAKKKAE 1428
Score = 34.0 bits (77), Expect = 0.52
Identities = 43/222 (19%), Positives = 91/222 (40%), Gaps = 12/222 (5%)
Query: 629 KAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEID 688
A+ +K ++E ++A + +K E + K E + + + K+ K E+
Sbjct: 1505 AAEAKKKADEAKKAEEA-KKADEAKKAEEAKKADEAK-----KAEEKKKADELKKAEELK 1558
Query: 689 ALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEA 748
+ A E K + +M R ++K EE E + + K + ++ EEA
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Query: 749 ELRSQQ------ERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEAD 802
++++++ E++K+ Q K + + + EEE E K A E + +A+
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Query: 803 KLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQK 844
+ + +A+ EE KA A++ + A++
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720
>gnl|CDD|213245 cd03278, ABC_SMC_barmotin, ATP-binding cassette domain of barmotin, a
member of the SMC protein family. Barmotin is a tight
junction-associated protein expressed in rat epithelial
cells which is thought to have an important regulatory
role in tight junction barrier function. Barmotin belongs
to the SMC protein family. SMC proteins are large
(approximately 110 to 170 kDa), and each is arranged into
five recognizable domains. Amino-acid sequence homology
of SMC proteins between species is largely confined to
the amino- and carboxy-terminal globular domains. The
amino-terminal domain contains a 'Walker A'
nucleotide-binding domain (GxxGxGKS/T, in the
single-letter amino-acid code), which by mutational
studies has been shown to be essential in several
proteins. The carboxy-terminal domain contains a sequence
(the DA-box) that resembles a 'Walker B' motif, and a
motif with homology to the signature sequence of the
ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these sequences.
In eukaryotic cells, the proteins are found as
heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
and SMC5 with SMC6 (formerly known as Rad18).
Length = 197
Score = 65.2 bits (160), Expect = 8e-12
Identities = 23/59 (38%), Positives = 37/59 (62%), Gaps = 1/59 (1%)
Query: 996 RYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGIC 1054
R P+PF VLDE+DAALD+ N+ + A + ++++ Q IVI+ ++ AD L G+
Sbjct: 132 RVRPSPFCVLDEVDAALDDANVERFARLLKEFSKET-QFIVITHRKGTMEAADRLYGVT 189
Score = 43.6 bits (104), Expect = 1e-04
Identities = 14/31 (45%), Positives = 20/31 (64%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
+E+ FKS+ K +I TA++GPNGSG
Sbjct: 4 LELKGFKSFADKTTIPFPPGLTAIVGPNGSG 34
>gnl|CDD|213240 cd03273, ABC_SMC2_euk, ATP-binding cassette domain of eukaryotic SMC2
proteins. The structural maintenance of chromosomes
(SMC) proteins are large (approximately 110 to 170 kDa),
and each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code), which
by mutational studies has been shown to be essential in
several proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B' motif,
and a motif with homology to the signature sequence of
the ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these sequences.
In eukaryotic cells, the proteins are found as
heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
and SMC5 with SMC6 (formerly known as Rad18).
Length = 251
Score = 60.0 bits (146), Expect = 1e-09
Identities = 28/58 (48%), Positives = 40/58 (68%), Gaps = 7/58 (12%)
Query: 996 RYHPAPFFVLDEIDAALD--NT-NIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSL 1050
+ PAP ++LDE+DAALD +T NIG++ I T + S Q IV+SLKE F++A+ L
Sbjct: 185 LFKPAPMYILDEVDAALDLSHTQNIGRM---IKTHFKGS-QFIVVSLKEGMFNNANVL 238
Score = 34.6 bits (80), Expect = 0.18
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 8 IEVDNFKSYKGKFSIGPL-KKFTAVIGPNGSG 38
I +D FKSY + I +F A+ G NGSG
Sbjct: 6 IILDGFKSYATRTVISGFDPQFNAITGLNGSG 37
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 61.2 bits (149), Expect = 2e-09
Identities = 92/431 (21%), Positives = 207/431 (48%), Gaps = 54/431 (12%)
Query: 40 LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
L++ ERL+ E+ + +E + + +EAK +K+E E+ ++ + +++E
Sbjct: 333 LEEKEERLE-ELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEK 391
Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
+L +L + +I+E ++ + GE++K + +KA L++ K + REL + ++
Sbjct: 392 ELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRK 451
Query: 160 --IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 217
+ E E+ + L + +E+ ++K+L +K L + + + K++A+ QL
Sbjct: 452 ELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK--LKELAE---QLK 506
Query: 218 DVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEV 277
++ ++ +Y NLE E+ KK K KE++ ++ ++ S KK L ++
Sbjct: 507 ELEEKLKKY----------NLE-----ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKL 551
Query: 278 RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAG 337
E K +A+LE +L ++ + AE + E L E+ +
Sbjct: 552 ----EELKKKLAELEKKLDELEEELAELLK----------------ELEELGFESVEELE 591
Query: 338 KILQQLDTINR---EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
+ L++L+ E K + +L+ E ++ + + E+ K E+ E +KR+++L + +
Sbjct: 592 ERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE 651
Query: 395 -----NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKK 449
+E ++ ++ EL+ ++ + ++EL+K E++ + L K + E R+K
Sbjct: 652 LEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEE---REK 708
Query: 450 KQELVENFKKA 460
++ +E +KA
Sbjct: 709 AKKELEKLEKA 719
Score = 54.3 bits (131), Expect = 3e-07
Identities = 58/347 (16%), Positives = 164/347 (47%), Gaps = 8/347 (2%)
Query: 112 KELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKR 171
E+ E+ ++ +EK +R E E +++EK+KE + RE+ ++ E+ E+ E+ K
Sbjct: 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLE 227
Query: 172 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
K + + +++++ +K L + + + I +LE ++ +++K E E +
Sbjct: 228 ----KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283
Query: 232 PGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADL 291
+++ ++ + +++++ + + +L + E + E + + +L
Sbjct: 284 ELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEEL 343
Query: 292 ETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQK 351
+ +L ++ KR E E + + + ++ T K+ ++L+ + + ++
Sbjct: 344 KKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP--EKLEKELEELEKAKE 401
Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
++++ + + + EIK+ + +EE +K K + + E + + K+L EE +
Sbjct: 402 EIEEEISKITARIGELKKEIKELKKAIEELKKA--KGKCPVCGRELTEEHRKELLEEYTA 459
Query: 412 DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFK 458
++ + ++E++++ ++ +EL + + +++ K +EL E K
Sbjct: 460 ELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLK 506
Score = 54.3 bits (131), Expect = 3e-07
Identities = 84/409 (20%), Positives = 184/409 (44%), Gaps = 42/409 (10%)
Query: 72 RKEAKIEKDEAEKYQRIREEI--VAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIE 129
KE K + EK+ + E I + KE E +L ++ +I EL ++ ++EK
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230
Query: 130 RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLA 189
+ E+ + + E +KE +L K++++IRE++ I + + + + +E+V + K+L
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL-KELK 289
Query: 190 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKK 249
+ +++ + E + ++ ++E +L+ + + E + +++ + + E+ KK
Sbjct: 290 EKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERI---KELEEKEERLEELKKK 346
Query: 250 RPSLIKSKERVSHIQKKLASAKKSLVEVRQ-ANEAHNKDIADLETQLADVRKRKAEYERQ 308
L K E + + AK E+ + LE +L ++ K K E E +
Sbjct: 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406
Query: 309 --SIPGRDINLES-----------------------AQMTEY--TNLKAEAT---KRAGK 338
I R L+ ++TE L E T KR K
Sbjct: 407 ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEK 466
Query: 339 ILQQLDTINREQKGDQDKLDNELRQQ--VQTQNEIKKKRHEMEEAQKRID--KLEDHIRQ 394
L++++ R+ + + +L+ L+++ + E+ ++ E+EE K+ + +LE +
Sbjct: 467 ELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEE 526
Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
E + KLK E+ S + +++EL+K+L ++ ++L + + + E
Sbjct: 527 YEKLKEKLIKLKGEIKS-LKKELEKLEELKKKLAELEKKLDELEEELAE 574
Score = 50.8 bits (122), Expect = 4e-06
Identities = 81/435 (18%), Positives = 204/435 (46%), Gaps = 33/435 (7%)
Query: 40 LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
L+ +L+ ++ + EE + + + + KE K K++AE+Y ++ E +E
Sbjct: 250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY--EEYLD 307
Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
+L ++ + ++E + ++++ E+E+ E R E+ + L+E +K L E ++ +E
Sbjct: 308 ELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL-EERHELYEE 366
Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 219
+ E+ + + L + ++K+L +K+ E+ + I +L+ ++ ++
Sbjct: 367 AKAKKEELERLKKRL--TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424
Query: 220 RKRKAEYERQS----IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 275
+K E ++ + GR++ E +++ + + L + ++ + I++K +K L
Sbjct: 425 KKAIEELKKAKGKCPVCGRELTEEHRKEL-LEEYTAELKRIEKELKEIEEKERKLRKELR 483
Query: 276 EVRQANEAHNKDIADLET--QLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEAT 333
E+ + + ++ I E QL ++ ++ +Y + + + EY LK +
Sbjct: 484 ELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEEL--------EKKAEEYEKLKEKLI 535
Query: 334 KRAGKILQQLDTINREQ--KGDQDKLDNELRQQVQTQNEIKKKRHEM-----EEAQKRID 386
K G+I + + + K +L+ +L + + E+ K+ E+ EE ++R+
Sbjct: 536 KLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLK 595
Query: 387 KLED------HIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTD 440
+LE ++ E L+ +K ++L ++ + + E +K LE++ +EL + +
Sbjct: 596 ELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655
Query: 441 KHEDTRRKKKQELVE 455
E+ + ++E +E
Sbjct: 656 YSEEEYEELREEYLE 670
Score = 49.7 bits (119), Expect = 8e-06
Identities = 88/426 (20%), Positives = 194/426 (45%), Gaps = 64/426 (15%)
Query: 41 KDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEK--YQRIREEIVAKEVE 98
++ E L E + EE + + + ++EK E E + ++EEI E E
Sbjct: 188 TENIEELIKEKEKELEEV-LREINEISSELPELREELEKLEKEVKELEELKEEIEELEKE 246
Query: 99 HQLFK--LYHNETDIKELED----------ELDKKKGEVEKIERRKEKAENILREKKKEQ 146
+ + E I+ELE+ EL++K E+++++ + E+ + ++
Sbjct: 247 LESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYL 306
Query: 147 GALNR---ELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE 203
L L+++++EI ++ I + L + +ER+ ++KKL +K L E+ + +E
Sbjct: 307 DELREIEKRLSRLEEEINGIEERIKE----LEEKEERLEELKKKLKELEKRLEELEERHE 362
Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHI 263
+ + + +L ++KR + + LE A++ EI ++ + R+ +
Sbjct: 363 LYE-EAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKE-EIEEEISKI---TARIGEL 417
Query: 264 QKKLASAKKSLVEVRQA-----------NEAHNKDI-ADLETQLADVRKRKAEYERQSIP 311
+K++ KK++ E+++A E H K++ + +L + K E E +
Sbjct: 418 KKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKE-- 475
Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQN-- 369
L+ E + + ++ + I ++ +Q K EL ++++ N
Sbjct: 476 --------------RKLRKELRELEKVLKKESELIKLKELAEQLK---ELEEKLKKYNLE 518
Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
E++KK E E+ ++++ KL+ I+ SLK + EEL + + ++ EL++EL +
Sbjct: 519 ELEKKAEEYEKLKEKLIKLKGEIK----SLKKELEKLEELKKKLAELEKKLDELEEELAE 574
Query: 430 VIEELG 435
+++EL
Sbjct: 575 LLKELE 580
Score = 48.5 bits (116), Expect = 2e-05
Identities = 73/322 (22%), Positives = 148/322 (45%), Gaps = 23/322 (7%)
Query: 121 KKGEVEKIERRKEKAENILREKKKEQGALNRE-LAKVDQEIREMDVEINKKRPSLIKSKE 179
++GE++ I E E ++R+ L + + + E+ EI ++ L K +
Sbjct: 133 RQGEIDAILESDESREKVVRQ------ILGLDDYENAYKNLGEVIKEIKRRIERLEKFIK 186
Query: 180 RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
R +I++ + +K L EV + + ++ +L +L + K E E
Sbjct: 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELK------EEI 240
Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR 299
+ E+ S K +E++ +++++ KK + E+ E K++ +L+ + +
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL----EEKVKELKELKEKAEEYI 296
Query: 300 KRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDN 359
K YE R+I +++ E N E K + ++L+ + ++ K + +L
Sbjct: 297 KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRL-E 355
Query: 360 ELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNR 419
EL ++ + E K K+ E+E +KR+ L + E L++ +K KEE+ ++ R
Sbjct: 356 ELEERHELYEEAKAKKEELERLKKRLTGLT--PEKLEKELEELEKAKEEIEEEISKITAR 413
Query: 420 VQELQ---KELEQVIEELGDAK 438
+ EL+ KEL++ IEEL AK
Sbjct: 414 IGELKKEIKELKKAIEELKKAK 435
Score = 47.4 bits (113), Expect = 4e-05
Identities = 82/405 (20%), Positives = 184/405 (45%), Gaps = 52/405 (12%)
Query: 45 ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
E+ K E+ + + + K + E K+A E +A+ + + +E +L +
Sbjct: 397 EKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEE 456
Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKK--EQGALNRELAKVDQEIRE 162
Y E +K +E EL + + + K+ + + E +L+++ + + L +L ++++++++
Sbjct: 457 YTAE--LKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKK 514
Query: 163 MDVE-INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
++E + KK K KE++ ++ ++ S KK L ++ E K +A+LE +L ++ +
Sbjct: 515 YNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKL----EELKKKLAELEKKLDELEE 570
Query: 222 RKAEYERQSIPGRDINLESAQDVE--------INKKRPSLIKSKERVSHIQKKLASAKKS 273
AE ++ ++ ES +++E + L +++ + +K+L ++
Sbjct: 571 ELAELLKEL---EELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEE 627
Query: 274 LVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEAT 333
L + + K + +L +L ++ K+ +E E + + + L E L+AE
Sbjct: 628 LDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSR----ELAGLRAE-- 681
Query: 334 KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR 393
L++L+ E K +KL EL E E+A+K ++KLE +
Sbjct: 682 ------LEELEKRREEIKKTLEKLKEELE--------------EREKAKKELEKLEKALE 721
Query: 394 QNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAK 438
+ E + KK K L + + ++ + ++ EEL + K
Sbjct: 722 RVEELREKVKKYKALL------KERALSKVGEIASEIFEELTEGK 760
Score = 44.7 bits (106), Expect = 3e-04
Identities = 43/222 (19%), Positives = 100/222 (45%), Gaps = 14/222 (6%)
Query: 245 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
EI ++ L K +R +I++ + +K L EV + + ++ +L +L + K E
Sbjct: 173 EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKE 232
Query: 305 YERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQ 364
E ++ E K+ +++ + + + +++ EL ++
Sbjct: 233 LEELK----------EEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIE-ELEEK 281
Query: 365 VQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNE---ASLKDNKKLKEELNSDVGSSKNRVQ 421
V+ E+K+K E + + ++ D +R+ E + L++ EE ++ + R++
Sbjct: 282 VKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341
Query: 422 ELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSG 463
EL+K+L+++ + L + + K K+E +E KK +G
Sbjct: 342 ELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTG 383
Score = 33.9 bits (78), Expect = 0.64
Identities = 39/180 (21%), Positives = 88/180 (48%), Gaps = 23/180 (12%)
Query: 628 LKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEI 687
+K ++++L E LRE + S + EL + ++ LE + ++L+ K +I +LE E+
Sbjct: 195 IKEKEKELEEVLREINEISSELPEL---REELEKLEKEV----KELEELKEEIEELEKEL 247
Query: 688 DALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEE 747
++L E KI+ +E + I K+E+ +E+ V + +++ E
Sbjct: 248 ESLEGSKRKLEEKIRELE-------ERIEELKKEIEELEEKVKE-------LKELKEKAE 293
Query: 748 AELRSQQERQKICQDK-DTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLEN 806
++ + ++ + + +K ++R E ++ EE + + EE+L E++ + +LE
Sbjct: 294 EYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEK 352
Score = 32.3 bits (74), Expect = 1.6
Identities = 71/362 (19%), Positives = 154/362 (42%), Gaps = 28/362 (7%)
Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
KE+ + + E+L ++L+E K+ + E + + K + L ++ L K
Sbjct: 331 KELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTG--LTPEK 388
Query: 683 LEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVED------IVFRDFCKS 736
LE E++ L + E +I I A + I K+ + ++ + R+ +
Sbjct: 389 LEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEE 448
Query: 737 IGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGE 796
+ +Y R ++E ++I + + + R V E EL + ++LA +
Sbjct: 449 HRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK----LKELAEQ 504
Query: 797 MRAEADKLENMRATRLTKK-QAVDAMDEEIGKARREVGSIAKDI---QAAQKSCVNLESK 852
++ +KL+ L KK + + + E++ K + E+ S+ K++ + +K LE K
Sbjct: 505 LKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKK 564
Query: 853 L---EMKKSERHDILMNCKMNDIVLPMLRVQK----YDRKLAKSIQEMTSRLQTIQAPNL 905
L E + +E L + R+++ Y+ L +++ L+ +
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYL--ELKDAEKELEREEKELK 622
Query: 906 RAMEKLEHAKENLMKTNEEFENARKRAKKAKANFDRIKKERYDKFTRCFEHVSNEIDGAG 965
+ E+L+ A E L +T + E RK ++ + + +E Y++ + +S E+ G
Sbjct: 623 KLEEELDKAFEELAETEKRLEELRKELEELEKKYS---EEEYEELREEYLELSRELAGLR 679
Query: 966 SE 967
+E
Sbjct: 680 AE 681
Score = 32.0 bits (73), Expect = 1.9
Identities = 37/141 (26%), Positives = 70/141 (49%), Gaps = 16/141 (11%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
+ + ++L+ E+ +A EE + + + + E +E + + E E+Y+ +REE + E
Sbjct: 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE-EEYEELREEYLELSRE 674
Query: 99 H------------QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQ 146
+ ++ +KE +E +K K E+EK+E+ E+ E LREK K+
Sbjct: 675 LAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEE-LREKVKKY 733
Query: 147 GALNRELAKVDQEIREMDVEI 167
AL +E A ++ E+ EI
Sbjct: 734 KALLKERAL--SKVGEIASEI 752
Score = 30.8 bits (70), Expect = 4.4
Identities = 45/234 (19%), Positives = 96/234 (41%), Gaps = 18/234 (7%)
Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQST---IKGLEIRLN--------YSRQ 671
KE+ ++ ++ KL +ELRE K +KESEL ++ +K LE +L +
Sbjct: 466 KELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAE 525
Query: 672 DLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFR 731
+ + K ++ KL+ EI +L + E K + + E + +E++ F
Sbjct: 526 EYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585
Query: 732 DFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEE 791
+ +++ E + + + + +K + + E + EELA + E
Sbjct: 586 SVEEL--EERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643
Query: 792 KLAGEMRA-----EADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQ 840
+L E+ ++ E +R L + + + E+ + + I K ++
Sbjct: 644 ELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLE 697
Score = 30.0 bits (68), Expect = 8.0
Identities = 26/137 (18%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 343 LDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASL--- 399
L + +E K ++L+ +++ + IK+K E+EE + I+++ + + L
Sbjct: 167 LGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKL 226
Query: 400 ----KDNKKLKEELNS----------DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDT 445
K+ ++LKEE+ + +++EL++ +E++ +E+ + + E
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK 286
Query: 446 RRKKKQELVENFKKAYS 462
K+K E + Y
Sbjct: 287 ELKEKAEEYIKLSEFYE 303
>gnl|CDD|213194 cd03227, ABC_Class2, ATP-binding cassette domain of non-transporter
proteins. ABC-type Class 2 contains systems involved in
cellular processes other than transport. These families
are characterized by the fact that the ABC subunit is
made up of duplicated, fused ABC modules (ABC2). No known
transmembrane proteins or domains are associated with
these proteins.
Length = 162
Score = 54.7 bits (132), Expect = 2e-08
Identities = 20/59 (33%), Positives = 27/59 (45%)
Query: 995 HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
P P ++LDEID LD + +A I+ Q IVI+ E AD L+ I
Sbjct: 95 ASLKPRPLYILDEIDRGLDPRDGQALAEAILEHLVKGAQVIVITHLPELAELADKLIHI 153
Score = 31.6 bits (72), Expect = 1.1
Identities = 11/32 (34%), Positives = 14/32 (43%), Gaps = 1/32 (3%)
Query: 8 IEVDNFKSYKGKFSI-GPLKKFTAVIGPNGSG 38
I + F SY + T + GPNGSG
Sbjct: 2 IVLGRFPSYFVPNDVTFGEGSLTIITGPNGSG 33
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 55.8 bits (135), Expect = 1e-07
Identities = 75/413 (18%), Positives = 175/413 (42%), Gaps = 48/413 (11%)
Query: 77 IEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAE 136
+ D+ +++ +I KE + +L E+++ EL++E+++ + + E+ +++A+
Sbjct: 181 VLSDQRGSLDQLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEAD 240
Query: 137 NIL---REKKKEQGALNRELAKVDQEIRE-------MDVEINKKRPSLIKSKERVSHI-- 184
+L E+++E L E+ + + I E + E+ R L + +E +
Sbjct: 241 EVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLA 300
Query: 185 -------------------QKKLASAKKSLVEVRQANEAHNK-------DIADLETQLAD 218
+ + + L E R A +AHN+ D DLE + +
Sbjct: 301 EAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEE 360
Query: 219 VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
+R+ AE E + R+ + +++E ++ + + +ER L +A+ L E+R
Sbjct: 361 LREEAAELESELEEAREAVEDRREEIEELEEE--IEELRERFGDAPVDLGNAEDFLEELR 418
Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGR----DINLESAQMTEYTNLKAEATK 334
+ + + A+LE L R+R E E G+ +E + E E +
Sbjct: 419 EERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETI---EEDRE 475
Query: 335 RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
R ++ +L+ + E+ + ++ V+ ++ I++ E+ ++ I + + I +
Sbjct: 476 RVEELEAELEDL-EEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEE 534
Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRR 447
++ ++ EL ++ + E ++E E+ EE+ + + E R
Sbjct: 535 KRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKER 587
Score = 54.7 bits (132), Expect = 2e-07
Identities = 78/380 (20%), Positives = 178/380 (46%), Gaps = 37/380 (9%)
Query: 72 RKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERR 131
R++A ++ AE+ + E+ E+E + +I+ELE+E+++ +
Sbjct: 348 REDADDLEERAEELREEAAEL-ESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVD 406
Query: 132 KEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASA 191
AE+ L E ++E+ L A+++ +R + + L K + +
Sbjct: 407 LGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPH 466
Query: 192 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRP 251
+++ E R+ E ++ DLE ++ +V +R LE A+D+ + R
Sbjct: 467 VETIEEDRERVEELEAELEDLEEEVEEVEER---------------LERAEDLVEAEDR- 510
Query: 252 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP 311
+ + +ER +++ +A ++++ E R+ E + A+LE + + R+ AE E ++
Sbjct: 511 -IERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEE 569
Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTIN---------REQKGDQDKLDNELR 362
R+ E A++ E + +I L I RE++ +L++E R
Sbjct: 570 ARE---EVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERR 626
Query: 363 QQVQTQNEIKKKRHE------MEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSS 416
+++ + E K++ +EEA++ ++ E+++ Q E L + ++ +++L +++G+
Sbjct: 627 ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAV 686
Query: 417 KNRVQELQKELEQVIEELGD 436
+N ++EL +EL + E L +
Sbjct: 687 ENELEEL-EELRERREALEN 705
Score = 52.3 bits (126), Expect = 1e-06
Identities = 85/448 (18%), Positives = 194/448 (43%), Gaps = 48/448 (10%)
Query: 42 DDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQL 101
+++E + E+ E E ER+E +E + EE+ +E + L
Sbjct: 244 EEHEERREELETLEAEIEDLRETIAETEREREELA---EEVRDLRERLEEL-EEERDDLL 299
Query: 102 FKLYHNETDIKELE---DELDKKKGEVE---------------KIERRKEKAENI---LR 140
+ ++ D + +E +EL+ + E+ + E +E A+++
Sbjct: 300 AEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAE 359
Query: 141 EKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ 200
E ++E L EL + + + + EI + + + +ER L +A+ L E+R+
Sbjct: 360 ELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELRE 419
Query: 201 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERV 260
+ + A+LE L R+R E E G+ E Q VE + ++ + +ERV
Sbjct: 420 ERDELREREAELEATLRTARERVEEAEALLEAGK--CPECGQPVEGSPHVETIEEDRERV 477
Query: 261 SHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE------RQSIPGRD 314
++ +L ++ + EV + E +D+ + E ++ + +R+ + E R++I +
Sbjct: 478 EELEAELEDLEEEVEEVEERLER-AEDLVEAEDRIERLEERREDLEELIAERRETIEEKR 536
Query: 315 INLES---------AQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQV 365
E A+ E AEA + A + +++ +N + ++++++ L +
Sbjct: 537 ERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIES-LERIR 595
Query: 366 QTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQK 425
I E+E +++ + L + + L + ++ K EL ++ + R++E ++
Sbjct: 596 TLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEF--DEARIEEARE 653
Query: 426 ELEQVIEELGDAKTDKHEDTRRKKKQEL 453
+ E+ E L + ++ D R+++ +L
Sbjct: 654 DKERAEEYL--EQVEEKLDELREERDDL 679
Score = 35.8 bits (83), Expect = 0.16
Identities = 51/227 (22%), Positives = 90/227 (39%), Gaps = 49/227 (21%)
Query: 637 EELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADA 696
EE RE + E+E+ ++ TI E +++++ + ++ +LE E D L A A
Sbjct: 247 EERREEL--ETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGL 304
Query: 697 TEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQER 756
+ +A+E AR + + + EE R ++ R
Sbjct: 305 DDADAEAVE----ARREELEDRD--------------------------EELRDRLEECR 334
Query: 757 QKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQ 816
A R +DD EE RA+ E+ A E+ +E LE R +++
Sbjct: 335 VAA----QAHNEEAESLREDADDLEE--RAEELREE-AAELESE---LEEAREAVEDRRE 384
Query: 817 AVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDI 863
++ ++EEI + R G D+ N E LE + ER ++
Sbjct: 385 EIEELEEEIEELRERFGDAPVDLG-------NAEDFLEELREERDEL 424
Score = 32.3 bits (74), Expect = 1.8
Identities = 47/232 (20%), Positives = 93/232 (40%), Gaps = 26/232 (11%)
Query: 637 EELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADA 696
EE RE + E+EL ++ ++ +E RL + + E I+ L R +
Sbjct: 471 EEDRE--RVEELEAELEDLEEEVEEVEERLE--------RAEDLVEAEDRIERLEERRED 520
Query: 697 TEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQER 756
E I ++ + + +E +E + EAE +++ R
Sbjct: 521 LEELIAERRETIEEKRERAEELRERAAELE---------AEAEEKREAAAEAEEEAEEAR 571
Query: 757 QKICQ-DKDTKKNVARWER--AVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRL- 812
+++ + + + R E + +A A+ E+L E R +L + R RL
Sbjct: 572 EEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLR-EKREALAELNDERRERLA 630
Query: 813 TKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDIL 864
K++ ++ E +AR E +D + A++ +E KL+ + ER D+
Sbjct: 631 EKRERKRELEAEFDEARIE--EAREDKERAEEYLEQVEEKLDELREERDDLQ 680
Score = 31.9 bits (73), Expect = 2.2
Identities = 20/112 (17%), Positives = 47/112 (41%), Gaps = 8/112 (7%)
Query: 624 EMGNLKAQKEKLSEELREAMKKSRKES--------ELNTVQSTIKGLEIRLNYSRQDLQN 675
E+ +L+ + E+L EE + + ++ + ++ + L RL R Q
Sbjct: 280 EVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQA 339
Query: 676 TKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVED 727
+ L + D L RA+ + +E+ + + + ++EE+ +E+
Sbjct: 340 HNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEE 391
Score = 30.8 bits (70), Expect = 4.6
Identities = 42/237 (17%), Positives = 98/237 (41%), Gaps = 29/237 (12%)
Query: 622 DKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIA 681
+ E+ +L+ + E++ E L A E + ++ + LE + R+ ++ + +
Sbjct: 481 EAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAE 540
Query: 682 KLEAEIDALNARADATEPKI-KAIEASMTARG--DTISRK----KEEMNSVEDIVFRDFC 734
+L L A A+ +A E + AR ++ K KE + S+E I
Sbjct: 541 ELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTL--- 597
Query: 735 KSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLA 794
++ I E+ E+ +E+++ + + ++ E LA + + +L
Sbjct: 598 ----LAAIADAED-EIERLREKREALAELNDER------------RERLAEKRERKRELE 640
Query: 795 GEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLES 851
E + ++E R + ++ ++ ++E++ + R E + +I A + LE
Sbjct: 641 AEF--DEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 54.0 bits (130), Expect = 4e-07
Identities = 84/435 (19%), Positives = 196/435 (45%), Gaps = 41/435 (9%)
Query: 40 LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
L++ E+LK+ + E+ K + + +E +E EK+E K R + + + +E
Sbjct: 317 LEELLEKLKSLEERLEKLEE----KLEKLESELEELAEEKNELAKLLEERLKELEERLEE 372
Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
+L +K+LE+ + + K E+ ++ E+ + L E +KE L REL ++++E
Sbjct: 373 LEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432
Query: 160 IREMDVEINKKRPSLIKSKERVSHI--------------QKKLASAKKSLVEVRQANEAH 205
I++++ +IN+ + E +K+L + +E + +
Sbjct: 433 IKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSR 492
Query: 206 NKDIADLETQLADVRKRKAEYERQSIPG-----RDINLESAQDVEINKKRPSLIKSKERV 260
K+ A+L ++ ++ K E E + I + ++ L + KE++
Sbjct: 493 EKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKL 552
Query: 261 S--HIQKKLASAKKSLVEVRQANEAHNK------DIADLETQLADVRKRKAEYERQSIPG 312
++++L + L E+++ E ++ +L +L +++K+ E E + +
Sbjct: 553 QLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEER-LSQ 611
Query: 313 RDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDN------ELRQQVQ 366
+ L+S +++E N EA + L++L+ ++ Q L+ EL +++
Sbjct: 612 LEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIR 671
Query: 367 TQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQE---L 423
+ + + ++EE + +++LE+ + Q L++ K E+ + ++R E L
Sbjct: 672 RELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEEL 731
Query: 424 QKELEQVIEELGDAK 438
+KELE++ + L +
Sbjct: 732 KKELEKLEKALELLE 746
Score = 50.5 bits (121), Expect = 5e-06
Identities = 73/438 (16%), Positives = 198/438 (45%), Gaps = 26/438 (5%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
+++ L+ + + EE+ ++ + +E + + E+ + + E++ K +E
Sbjct: 271 IREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKL--KSLE 328
Query: 99 HQLFKLYHNETDIKELEDELDKKKGEVEK-IERRKEKAENILREKKKEQGALNRELAKVD 157
+L KL ++ +EL ++K E+ K +E R ++ E L E +KE L +++
Sbjct: 329 ERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLE 388
Query: 158 QEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 217
+ I+E+ E+ + +L + +E + ++K+L ++ L E+ + + + I LE++
Sbjct: 389 EAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKEL 448
Query: 218 DVRKRKAEYERQSIPGRDINLESAQDV------------EINKKRPSLIKSKERVSHIQK 265
+ + E+ + G+++ E +++ E + + +E + ++K
Sbjct: 449 MIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEK 508
Query: 266 KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEY 325
+L ++ L+E+ + EA +LE +L + E E + L+ + E
Sbjct: 509 ELRELEEELIELLELEEA---LKEELEEKLEKLENLLEELEELK---EKLQLQQLK--EE 560
Query: 326 TNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRI 385
+ + ++L++L + R +K + ++L L++ + E++++ ++EE + +
Sbjct: 561 LRQLEDRLQELKELLEEL-RLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSL 619
Query: 386 DKLEDH--IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
+ E + + E L+ + + ++EL++++E++ E+
Sbjct: 620 ELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIEN 679
Query: 444 DTRRKKKQELVENFKKAY 461
+ + ++K E +E ++
Sbjct: 680 EEQLEEKLEELEQLEEEL 697
Score = 49.0 bits (117), Expect = 1e-05
Identities = 55/290 (18%), Positives = 136/290 (46%), Gaps = 7/290 (2%)
Query: 178 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 237
KE + + K+ + L E+ + E + + + +L + + + E E + + ++I
Sbjct: 177 KEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELE-QEIE 235
Query: 238 LESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 297
+ E+ +++ L + K R+ I+ A K E + E +++ + +L +
Sbjct: 236 ALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEE 295
Query: 298 VRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKL 357
+ + E E + G LE + LK+ +R K+ ++L+ + E + + +
Sbjct: 296 LEREIEELEEELE-GLRALLEELEE-LLEKLKSL-EERLEKLEEKLEKLESELE-ELAEE 351
Query: 358 DNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSK 417
NEL + ++ +K+ +EE +K ++K + ++Q E ++++ K+ EL++ + +
Sbjct: 352 KNELAKLLEE--RLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQ 409
Query: 418 NRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYDR 467
++EL+KELE++ EL + + + + + + E E +G ++
Sbjct: 410 EELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEK 459
Score = 40.1 bits (94), Expect = 0.006
Identities = 69/416 (16%), Positives = 168/416 (40%), Gaps = 50/416 (12%)
Query: 39 ALKDDYERLKAEMIQAEE-----ETNMSYLKKKGVVAERKEAKIE------KDEAEKYQR 87
L++ E L+ E+ +A E E + LK++ +I+ + E E+ +R
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424
Query: 88 IREEI--VAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKE 145
EE+ K++E Q+ +L E I EL +K V E +E + +L + E
Sbjct: 425 ELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP--VCGQELPEEHEKELLELYELE 482
Query: 146 QGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAH 205
L EL++ ++E E+ EI + L + +E + + + + K+ L + E
Sbjct: 483 LEELEEELSR-EKEEAELREEIEELEKELRELEEELIELLELEEALKEEL---EEKLEKL 538
Query: 206 NKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDV--EINKKRPSLIKSKERVSHI 263
+ +LE ++ ++ + E + + R L+ + + ++ L + +ER+ +
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKEL 598
Query: 264 QKKLASAKKSLVEVRQANEAHNKDIADLETQ--------------------------LAD 297
+KKL ++ L ++ + ++ A+ E + L +
Sbjct: 599 KKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEE 658
Query: 298 VRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKG---DQ 354
+ ++ E E + Q+ E + + ++ ++L+ + ++
Sbjct: 659 LEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLI 718
Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELN 410
++L++ + + + E++K +E ++ +KL + + +++ E N
Sbjct: 719 EELESRKAELEELKKELEKLEKALELLEELREKLGKAGLRADILRNLLAQIEAEAN 774
Score = 36.3 bits (84), Expect = 0.12
Identities = 35/206 (16%), Positives = 84/206 (40%), Gaps = 10/206 (4%)
Query: 41 KDDYERLKAEMIQAEEE--TNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
+ + E+ +AEEE + + L + + E +A +E+ E + + E +
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677
Query: 99 HQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENI---LREKKKEQGALNRELAK 155
+L ++++LE+EL++ + E+E++ ++ + E + L +K E L +EL K
Sbjct: 678 ENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEK 737
Query: 156 VDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 215
+++ + ++ K + ++ + I + L + ++ + + N+ T
Sbjct: 738 LEKALELLEELREKLGKAGLR-----ADILRNLLAQIEAEANEILSKLSLNRYDLRRLTI 792
Query: 216 LADVRKRKAEYERQSIPGRDINLESA 241
D R I S
Sbjct: 793 RKDGNGGLVVVVYDGGEVRPIKTLSG 818
Score = 30.1 bits (68), Expect = 7.9
Identities = 36/242 (14%), Positives = 94/242 (38%), Gaps = 10/242 (4%)
Query: 627 NLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAE 686
KA+ E+L +L E ++ E + + ++ Q+ + + +I LE
Sbjct: 182 EAKAKIEELEGQLSELLEDIEDLLEALEEE-LKELKKLEEIQEEQEEEELEQEIEALEER 240
Query: 687 IDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYE 746
+ L + E + + + + ++EE+ +E + I + E
Sbjct: 241 LAELEEEKERLEELKARLLEIESLELEALKIREEELRELERL------LEELEEKIERLE 294
Query: 747 EAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLEN 806
E E ++ +++ + + + + EE L + + EKL E+ E + +N
Sbjct: 295 ELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESEL-EELAEEKN 353
Query: 807 MRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMN 866
A L ++ ++E + + +E+ + ++ +++ L+ +L + +I
Sbjct: 354 ELAKLLEERLK--ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411
Query: 867 CK 868
+
Sbjct: 412 LE 413
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 52.0 bits (125), Expect = 1e-06
Identities = 66/312 (21%), Positives = 127/312 (40%), Gaps = 51/312 (16%)
Query: 13 FKSYKGKFSIGPLKKFTAVIGPNGSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAER 72
+ +YK K L+K + G + D ER+KA +I+ +++
Sbjct: 354 YLAYKPK-----LEKVERKLPELG---IWKDVERIKALVIRGY--------PLAEALSKV 397
Query: 73 KEAKIEKDEAEKYQRIREEIVA--KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIER 130
KE + +++ + R EI K ++ + E + EL+ EL++ K E+EK+E
Sbjct: 398 KEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLES 457
Query: 131 RKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLAS 190
E+ +R+K ++ D+EIR D I + L + K+RV +++KLA
Sbjct: 458 ELERFRREVRDKVRK-----------DREIRARDRRIERLEKELEEKKKRVEELERKLAE 506
Query: 191 AKK--------SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ 242
+K V+ + + I + E + +++ P A+
Sbjct: 507 LRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYG-IKEGDVILVED--PSG----GGAR 559
Query: 243 DVEIN-KKRPSLIKSKERVSHI------QKKLASAKKSLVEVRQANEAHNKDIADLETQL 295
E K+P I E +SH + ++ + V++ + +E D +L +
Sbjct: 560 TAEELIDKKPRAIIRGEEMSHAAAEEFFKNEIPVLPEGDVQIIRLDEFAVVDSEELRRAI 619
Query: 296 ADVRKRKAEYER 307
+ +KR E ER
Sbjct: 620 EEWKKRFEERER 631
Score = 32.0 bits (73), Expect = 1.9
Identities = 15/87 (17%), Positives = 40/87 (45%), Gaps = 1/87 (1%)
Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
++K++ E+ ++ I E +K ++ E L + K ++EL++E+E+
Sbjct: 396 KVKEEERPREKEGT-EEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEK 454
Query: 430 VIEELGDAKTDKHEDTRRKKKQELVEN 456
+ EL + + + R+ ++ +
Sbjct: 455 LESELERFRREVRDKVRKDREIRARDR 481
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 51.1 bits (122), Expect = 3e-06
Identities = 67/421 (15%), Positives = 171/421 (40%), Gaps = 39/421 (9%)
Query: 70 AERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETD----IKELEDELDKKKGEV 125
+K + ++KD + ++ ++ +Q+ +L E D +K +E K +
Sbjct: 327 IIKKLSVLQKDYNDYIKKKSRY---DDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYS 383
Query: 126 EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI---NKKRPSLIKSKERVS 182
+ IER IL+ ++ + A+ +EL +++ +++++ ++ N++ +L ++ + +S
Sbjct: 384 KNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELS 443
Query: 183 HIQKKLASAKKSLV--------EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGR 234
+ L V + +N+ + LE ++ R+ E E + I +
Sbjct: 444 RNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKI-----REIEIEVKDIDEK 498
Query: 235 DINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET- 293
++L+ ++ +++ I ++ + L K + E++ ++ + + ++
Sbjct: 499 IVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSL 558
Query: 294 QLADVRKRKAEYER--QSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQK 351
+L D+ ++ + I DI ++ E + R +I
Sbjct: 559 KLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYID 618
Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
+++NE NEI++ + +E+ + +ID + I + ++ + D K++ +N
Sbjct: 619 KSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRIND 678
Query: 412 -------------DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFK 458
D +++ R++ + L I EL D D +E KK +
Sbjct: 679 IEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDL 738
Query: 459 K 459
K
Sbjct: 739 K 739
Score = 34.1 bits (78), Expect = 0.45
Identities = 22/87 (25%), Positives = 39/87 (44%), Gaps = 4/87 (4%)
Query: 623 KEMGNLKAQKEKLSEEL----REAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKS 678
E+ K EKL ++ ++ + +L + S I +E L SR+ L + K+
Sbjct: 636 NEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKA 695
Query: 679 QIAKLEAEIDALNARADATEPKIKAIE 705
A+LE+ I+ L R + +I I
Sbjct: 696 NRARLESTIEILRTRINELSDRINDIN 722
Score = 33.3 bits (76), Expect = 0.87
Identities = 16/94 (17%), Positives = 40/94 (42%)
Query: 630 AQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDA 689
AQ++K+ +E+ E R +L V ++ ++Y + L+++ ++ ++ +I
Sbjct: 149 AQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIAD 208
Query: 690 LNARADATEPKIKAIEASMTARGDTISRKKEEMN 723
T +I+ + D + K +N
Sbjct: 209 DEKSHSITLKEIERLSIEYNNAMDDYNNLKSALN 242
Score = 31.4 bits (71), Expect = 2.9
Identities = 52/334 (15%), Positives = 120/334 (35%), Gaps = 43/334 (12%)
Query: 70 AERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIE 129
A+RK+ E E +R + K+V + L ++I LE++L E+E I+
Sbjct: 149 AQRKKILDEILEINSLERNYD--KLKDV---IDMLRAEISNIDYLEEKLKSSNLELENIK 203
Query: 130 RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLA 189
++ E K EI + +E N K ++ +
Sbjct: 204 KQIADDEKSHSITLK--------------EIERLSIEYNNAMDDYNNLKSALNELSSLED 249
Query: 190 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKK 249
+ + +I E+ L+ ++ Y + + R + + + +
Sbjct: 250 MKNR-----------YESEIKTAESDLSMELEKNNYY--KELEERHMKIINDPVYKNRNY 296
Query: 250 RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQS 309
K K + + ++ L++ + A K ++ L+ D K+K+ Y+
Sbjct: 297 INDYFKYKNDIENKKQILSNIDAEI----NKYHAIIKKLSVLQKDYNDYIKKKSRYD--- 349
Query: 310 IPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQN 369
D+N + ++ Y K + ++++ ++ + + L+ Q +
Sbjct: 350 ----DLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPD 405
Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNK 403
IKK+ +E+ + I + Q +L++N
Sbjct: 406 AIKKELNEINVKLQDISSKVSSLNQRIRALRENL 439
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 47.3 bits (112), Expect = 4e-05
Identities = 64/372 (17%), Positives = 156/372 (41%), Gaps = 15/372 (4%)
Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREM--DVE 166
+K E EL KK+ +++ ++I+ K+KE L +L KV+++I+ + D+E
Sbjct: 709 DKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIE 768
Query: 167 INKKRPSLIKSKERVSHIQKKLASAKKSL------VEVRQANEAHNKDIADLETQLADVR 220
+ I +E + + + + VE + A +A +DL+ + V
Sbjct: 769 EQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVN 828
Query: 221 KRKAEY-ERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ 279
+ K E I L + ++ L + + ++ + + + +
Sbjct: 829 QEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEE 888
Query: 280 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
+ ++ L ++ D +++ + E + E E +N KA+ + I
Sbjct: 889 QLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQ--DKVNDI 946
Query: 340 LQQLDTINREQKGDQDKL-DNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
+++ I+ K ++K+ D + Q + E+ ++EE +K +K+ + +R
Sbjct: 947 KEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQD 1006
Query: 399 LKDNKKLKEELNSDVGSSK--NRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVEN 456
+ K + L ++ K N ++E+++EL+Q ++E+G + + + +K ++ + +
Sbjct: 1007 IDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENI-DL 1065
Query: 457 FKKAYSGVYDRM 468
K+ + R
Sbjct: 1066 IKRNHVLALGRQ 1077
Score = 42.0 bits (98), Expect = 0.002
Identities = 73/394 (18%), Positives = 155/394 (39%), Gaps = 39/394 (9%)
Query: 83 EKYQRIREEIVAKEVEHQLFKLYHNETDIK--ELEDELDKKKGEVE---KIERRKEKAEN 137
E +++R+ K EHQ+ Y + K E+ D++ K+ ++E +I + E +
Sbjct: 189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELD 248
Query: 138 ILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 197
L+ + KE ++ K+D EI+ + ++ + K + +K+ +
Sbjct: 249 PLKNRLKEIEHNLSKIMKLDNEIKALK----SRKKQMEKDNSELELKMEKVFQGTDE--Q 302
Query: 198 VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSK 257
+ H + + + E +L D ++ E E+ + R +N E + + + + R L +
Sbjct: 303 LNDLYHNHQRTVREKERELVDCQR---ELEKLNKERRLLNQEKTE-LLVEQGRLQL---Q 355
Query: 258 ERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINL 317
+ + + R + + E Q+ + E RQ +
Sbjct: 356 ADRHQEHIRARDSLIQSLATRLELDGFERG-PFSERQIKNFHTLVIE--RQEDEAKTAAQ 412
Query: 318 ESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHE 377
A + LK E Q D I E+KG ++ + + Q E+K E
Sbjct: 413 LCADLQSKERLKQE----------QADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKE 462
Query: 378 MEEAQ---KRIDKLEDHIRQNEA--SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE 432
+++ + RI +L+ +R+ E S + L E L +V S +N +L ++L ++ +
Sbjct: 463 LQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQ 522
Query: 433 ELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYD 466
E+ H T R + + L ++ +
Sbjct: 523 EMEQL---NHHTTTRTQMEMLTKDKMDKDEQIRK 553
Score = 35.8 bits (82), Expect = 0.15
Identities = 50/303 (16%), Positives = 113/303 (37%), Gaps = 21/303 (6%)
Query: 637 EELREAMKKSRKESELNTV-----QSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALN 691
+ +KK K + QS I E + R LQ I +L+ +I+
Sbjct: 712 KSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQE 771
Query: 692 ARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELR 751
P+ ++ + +T + R + E+ VE + + K G R ++
Sbjct: 772 TLLGTIMPEEESAKVCLTDVT-IMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQE 830
Query: 752 SQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATR 811
Q+++ ++ V E + +++ ++++ ++L++ +
Sbjct: 831 KQEKQHEL--------------DTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQI 876
Query: 812 LTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMND 871
T Q +E++ + EV S+ ++I+ A++ LE+ LE K + + L++ K
Sbjct: 877 GTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLE-KDQQEKEELISSKETS 935
Query: 872 IVLPMLRVQKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKR 931
+V K+ M IQ +++ E + EE E +++
Sbjct: 936 NKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEK 995
Query: 932 AKK 934
+
Sbjct: 996 INE 998
Score = 33.1 bits (75), Expect = 0.94
Identities = 74/450 (16%), Positives = 172/450 (38%), Gaps = 62/450 (13%)
Query: 49 AEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNE 108
E+ +AE+ + LKK+ + ++A +++ + Q + + Q+ L ++
Sbjct: 485 RELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDK 544
Query: 109 TDIKEL--------EDELDKKKG------EVEKIERRKEKAENILREKKKEQGALNRELA 154
D E DEL G ++E K K N R++ + LN+ELA
Sbjct: 545 MDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAK---LNKELA 601
Query: 155 KVDQEIREMDVEINKKRPSLIKSKERV-------------SHIQKKLASAKKSLVEVRQA 201
++Q ++ E+ K L ++++ +++++ + K + A
Sbjct: 602 SLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGA 661
Query: 202 NEAHNKDIADL-----------------ETQLADVRKRKAEYERQSIPGRDINLESAQDV 244
+++ I L E +L + + + + P + + ES
Sbjct: 662 TAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFIS-DLQSKLRLAPDKLKSTESELKK 720
Query: 245 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
+ ++ L + R S I K +K + E+R + N+DI L+ + +
Sbjct: 721 KEKRRDEMLGLAPGRQSIIDLK----EKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGT 776
Query: 305 YERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQ 364
+ + + M + + ++ + +L D D+ ++ Q+
Sbjct: 777 IMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGS------DLDRTVQQVNQE 830
Query: 365 VQ-TQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDV---GSSKNRV 420
Q Q+E+ ++E +K I ++ I+ ++ + K K ++ +++ + ++
Sbjct: 831 KQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQL 890
Query: 421 QELQKELEQVIEELGDAKTDKHEDTRRKKK 450
EL E++ +I E+ DAK +K
Sbjct: 891 VELSTEVQSLIREIKDAKEQDSPLETFLEK 920
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 46.6 bits (111), Expect = 5e-05
Identities = 48/293 (16%), Positives = 99/293 (33%), Gaps = 74/293 (25%)
Query: 172 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
S +++ IQK++A+ +K + E + K + LET++A +
Sbjct: 31 FSAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEA---------- 80
Query: 232 PGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADL 291
LI++ + + ++K++A N + L
Sbjct: 81 --------------------QLIETADDLKKLRKQIADL--------------NARLNAL 106
Query: 292 ETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTN-----LKAEATKRAGKILQQLDTI 346
E Q + R+R AE L + Q + + E +R+ ++ +
Sbjct: 107 EVQEREQRRRLAEQ-----------LAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGAL 155
Query: 347 NREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK----DN 402
N + + T ++ R E+ Q + L R +A L +
Sbjct: 156 NPARAE-------RIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEER 208
Query: 403 KKLKEELNSDVGSSKNRVQELQK---ELEQVIEELGDAKTDKHEDTRRKKKQE 452
KK +LNS++ + + +++EL+ L+ I A E +
Sbjct: 209 KKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAA 261
Score = 45.5 bits (108), Expect = 1e-04
Identities = 36/220 (16%), Positives = 85/220 (38%), Gaps = 20/220 (9%)
Query: 107 NETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
+ +I LE ++ +++ + K+E++ + E + + + +L K+ ++I +++
Sbjct: 43 IQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNAR 102
Query: 167 INKKRPSLIKSKERVSHI------------------QKKLASAKKSLVEVRQANEAHNKD 208
+N + + R++ + + + + N A +
Sbjct: 103 LNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAER 162
Query: 209 IADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLA 268
I L+ L + +AE + L S Q + K L + K+ ++ + +L+
Sbjct: 163 IDALKATLKQLAAVRAEIAAEQAELT--TLLSEQRAQQAKLAQLLEERKKTLAQLNSELS 220
Query: 269 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 308
+ +K L E+R +IA E A R+ A E
Sbjct: 221 ADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAA 260
Score = 40.9 bits (96), Expect = 0.003
Identities = 48/288 (16%), Positives = 99/288 (34%), Gaps = 64/288 (22%)
Query: 110 DIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINK 169
+ +L + + E+ +E++ +RE++ ++ L ++L ++ EI
Sbjct: 32 SAAADDKQLKQIQKEIAALEKK-------IREQQDQRAKLEKQLKSLETEIAS------- 77
Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
++ +L L ++R+ N + LE Q + R+R AE
Sbjct: 78 --------------LEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQ--- 120
Query: 230 SIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIA 289
L + Q N P VS + + + + N A + I
Sbjct: 121 --------LAALQRSGRN---PPP---ALLVS--PEDAQRSVRLAIYYGALNPARAERID 164
Query: 290 DLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINRE 349
L+ L + +AE + E T L +E + K+ Q L+ E
Sbjct: 165 ALKATLKQLAAVRAEIAAEQ-------------AELTTLLSEQRAQQAKLAQLLE----E 207
Query: 350 QKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
+K +L++EL + E++ ++ + R+ A
Sbjct: 208 RKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255
Score = 39.7 bits (93), Expect = 0.008
Identities = 48/301 (15%), Positives = 92/301 (30%), Gaps = 63/301 (20%)
Query: 649 ESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASM 708
+ +L +Q I LE ++ + + Q+ LE EI +L A+ T +K + +
Sbjct: 37 DKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQI 96
Query: 709 TARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQER--QKICQDKDTK 766
++ E + R Q+ R +++ + +
Sbjct: 97 ADLNARLNA----------------------------LEVQEREQRRRLAEQLAALQRSG 128
Query: 767 KNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIG 826
+N +D + R L D L+ + + A E+
Sbjct: 129 RNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELT 188
Query: 827 KARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLRVQKYDRKL 886
E Q AQ++ KL ER +K +L
Sbjct: 189 TLLSE--------QRAQQA------KLAQLLEER-------------------KKTLAQL 215
Query: 887 AKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKRAKKAKANFDRIKKER 946
+ +L+ ++A R ++ A+ K E A A +A+A + E
Sbjct: 216 NSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGET 275
Query: 947 Y 947
Y
Sbjct: 276 Y 276
Score = 35.9 bits (83), Expect = 0.12
Identities = 29/244 (11%), Positives = 87/244 (35%), Gaps = 25/244 (10%)
Query: 609 GSLDLARKAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNY 668
++ A + DDK++ ++ + L +++RE + K ++ +K LE
Sbjct: 24 AAVLAAAFSAAADDKQLKQIQKEIAALEKKIREQQDQRAK------LEKQLKSLE----- 72
Query: 669 SRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDI 728
++ + ++Q+ + ++ L + ++ A+E + ++
Sbjct: 73 --TEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAE-------QLAA 123
Query: 729 VFRDFCKSIGVSTIRQYE-EAELRSQQERQKICQ-DKDTKKNVARWERAVSDDEEELARA 786
+ R + + + +R + + + + ++ E+A
Sbjct: 124 LQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAE 183
Query: 787 QGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSC 846
Q L E RA+ KL +K+ + ++ E+ ++++ + + +
Sbjct: 184 QAELTTLLSEQRAQQAKLA---QLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEI 240
Query: 847 VNLE 850
+ E
Sbjct: 241 ASAE 244
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 46.7 bits (111), Expect = 8e-05
Identities = 75/367 (20%), Positives = 142/367 (38%), Gaps = 63/367 (17%)
Query: 42 DDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQL 101
++ L + +AEEE ++ L+K E ++ + EK E+ +R+ + +
Sbjct: 174 RTWKDLVKALDEAEEE--LANLRK-----ELRQLEKEKQRLERLRRLLPLLAER------ 220
Query: 102 FKLYHNETDIKELEDELDKKKGEV----EKIERRKEKAENILREKKKEQGALNRELAKVD 157
K LE +L GEV R E+A LR ++ L L +
Sbjct: 221 ----------KALEQQLAAL-GEVIDLPPDAVERYEEARAELRAARRNLELLTERLEALQ 269
Query: 158 QEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 217
E+ E+ ++ +E ++ A+A ++L + R +D+ D E ++A
Sbjct: 270 AELDEISLD-----------EELLAQ-----AAAIEALHQQRGEYRNAEQDLPDREGEIA 313
Query: 218 DVRKRKAEYERQSIPGRDI-NLESAQDVEINKKRPS-LIKSKERVSHIQKKLASAKKSLV 275
+ R+ A Q P D +ES + K+ + L K KE + + L SA+ +L
Sbjct: 314 NAREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEA---LDQALKSARDALE 370
Query: 276 EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKR 335
E + + +A L T R A + R ++++A A A +
Sbjct: 371 ERERELKQVRAQLAALPTVTVPEPLRAALADAL----RAGDIDAALAALEQE-VAVAKRE 425
Query: 336 AGKILQQLDTINREQKGDQDKLDNELRQQVQT--------QNEIKKKRHEMEEAQKRIDK 387
+ L +L + + L +QVQ ++KR + E ++ ++
Sbjct: 426 LAQALSRLGLWRGDLEELV-ALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLET 484
Query: 388 LEDHIRQ 394
LE +R
Sbjct: 485 LELQLRH 491
Score = 35.1 bits (81), Expect = 0.24
Identities = 56/338 (16%), Positives = 118/338 (34%), Gaps = 26/338 (7%)
Query: 72 RKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERR 131
R+ + + A Y+ E A ++ +L + +++ L + + + +E++E+
Sbjct: 519 RRGYALLEPTASAYEEAVES--ADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKE 576
Query: 132 KEKAENILREKKKEQGALNRELA--KVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLA 189
E E L ++ A E+ + E R + + + + L
Sbjct: 577 LEVLELALAALREAWQAQWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLD 636
Query: 190 SAKKSLVEVRQA--NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEIN 247
+ +R A A +L L R+ E E+Q+ + E +D E
Sbjct: 637 RRARLRAALRAALKAVAIVLPGEELAELLELARQLLEEAEKQA-ARKASLDERLRDAERA 695
Query: 248 KKRPSLIKSKERVSHIQKKLASA----KKSLVEVRQANEAHNKDIADLETQLADVRKRKA 303
L +++ER Q L + +L+E A D L +++++
Sbjct: 696 -----LEEAEERHDEAQSALEAWQEQWYDALLEAGLGGRASPAGALDALELLQNIKEKLQ 750
Query: 304 EYERQSIPGRDINLESAQMT-EYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELR 362
D+ A M + + E A + ++ ++ K L+
Sbjct: 751 AA-------DDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTPADETARALKQR--LK 801
Query: 363 QQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK 400
+ T +K E+EEA+K + + + + EA L
Sbjct: 802 RARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLT 839
Score = 34.7 bits (80), Expect = 0.28
Identities = 75/390 (19%), Positives = 147/390 (37%), Gaps = 41/390 (10%)
Query: 86 QRIREEIVAK--EVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKK 143
R EEI+ E+ LF + + DEL+K+ E+ K R +K
Sbjct: 101 VRGGEEILEAQGELGQYLFSAAAGVGSLGSVRDELEKEADELWK-----------PRGRK 149
Query: 144 KEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE 203
E +EL +++ EIRE+ ++ + + + +++LA+ +K + E
Sbjct: 150 PEINVALKELKELEAEIREVQLKTR----TWKDLVKALDEAEEELANLRKE----LRQLE 201
Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINL-ESAQDV------EINKKRPSLIKS 256
+ + L L + +RKA ++ + G I+L A + E+ R +L
Sbjct: 202 KEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRNLELL 261
Query: 257 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 316
ER+ +Q +L + + + A I L Q R AE + G N
Sbjct: 262 TERLEALQAELDE-----ISLDEELLAQAAAIEALHQQR--GEYRNAEQDLPDREGEIAN 314
Query: 317 LESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQ-VQTQNEIKKKR 375
A + +A + A + +E + +K L Q ++ ++++
Sbjct: 315 AREAAAALLAQIGPDADEEA-VESLRPSLAAKETVTELEKRKEALDQALKSARDALEERE 373
Query: 376 HEMEEAQKRIDKLEDHI--RQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
E+++ + ++ L A+L D + + + + + V ++EL Q +
Sbjct: 374 RELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALA-ALEQEVAVAKRELAQALSR 432
Query: 434 LGDAKTDKHED-TRRKKKQELVENFKKAYS 462
LG + D E E V+ F K
Sbjct: 433 LGLWRGDLEELVALPVPSAEQVQAFLKEAE 462
Score = 32.0 bits (73), Expect = 2.2
Identities = 34/280 (12%), Positives = 87/280 (31%), Gaps = 26/280 (9%)
Query: 35 NGSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVA 94
+ L + AE + + + + + + +A +
Sbjct: 673 EEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQWYDALLEAGLGGRASP 732
Query: 95 KEVEHQLFKLYHNETDIKELEDELDKKKGEVE-KIERRKEKAENILRE-----KKKEQGA 148
+L N + + D+L ++ +E + R +E+ E +
Sbjct: 733 AGA-LDALELLQNIKEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTPADE 791
Query: 149 LNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 208
R L + + R+ K + ++++ VS L A+ L + +A +
Sbjct: 792 TARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIE 851
Query: 209 IADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLA 268
+ +D + + I +L+++ +S
Sbjct: 852 ELLAAVERSDTYRELRK-------------------RIAALERTLVRAGGGLSLEALVAE 892
Query: 269 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 308
+A E+ E +DI +LE +L ++ + +++
Sbjct: 893 AAALDPDELPARLEELARDIEELEEELNELAQEVGAAKQE 932
Score = 30.9 bits (70), Expect = 4.3
Identities = 73/470 (15%), Positives = 161/470 (34%), Gaps = 53/470 (11%)
Query: 42 DDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIRE-EIVAKEVEHQ 100
+ YE +AE+ A + + + + AE E ++++ + I E +
Sbjct: 242 ERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNA 301
Query: 101 LFKLYHNETDIKELEDELD------KKKGEVEKIERRK--EKAENILREKKKEQGALNRE 152
L E +I + + E +E + A+ + E +K + AL++
Sbjct: 302 EQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEALDQA 361
Query: 153 LAKVDQEIREMDVEINKKRPSL--IKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIA 210
L + E + E+ + R L + + ++ LA A + ++ A A +++A
Sbjct: 362 LKSARDALEERERELKQVRAQLAALPTVTVPEPLRAALADALR-AGDIDAALAALEQEVA 420
Query: 211 DLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASA 270
+ +LA R + + + SA+ V+ K I +R + +L
Sbjct: 421 VAKRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKR--QKRDRLLEL 478
Query: 271 ----KKSLVEVRQANEAH----NKDIADLET---QLADVRKRKAEYERQSIPGRDINLES 319
+ +++R + A +++A L +R + + +ES
Sbjct: 479 EEDLETLELQLRHLDAAGAVPTEEEVAAARARRDALWQDIRRGYALLEPTASAYEEAVES 538
Query: 320 A-QMTEYTNLKAEATKRAGKILQQLDTINR--EQ-KGDQDKLDNELR------QQVQTQN 369
A Q+ + +A+ + QQ + R EQ + + + L+ L Q
Sbjct: 539 ADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQAQWAAA 598
Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEE------------LNSDVGSSK 417
+ EME+ + +R + + L + +
Sbjct: 599 GLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALKAVAIVLPG 658
Query: 418 NRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYDR 467
+ EL + Q++EE R+ E + + ++A +R
Sbjct: 659 EELAELLELARQLLEEAEKQA------ARKASLDERLRDAERALEEAEER 702
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 46.2 bits (110), Expect = 9e-05
Identities = 67/376 (17%), Positives = 134/376 (35%), Gaps = 76/376 (20%)
Query: 106 HNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDV 165
+ ++K+LE+++++ + E+EK E EKA+N +K A + K D + E
Sbjct: 99 ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK-----AWKKLAKKYDSNLSEALK 153
Query: 166 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 225
+N K+ K + + + AS+ SL E++ + L E
Sbjct: 154 GLNYKKNFKEKLLKELKSVILN-ASSLLSLEELKAKIKTLFSSNKPELALLTLSVIDFDE 212
Query: 226 YERQSIPGRDINLESAQDVEINKKRPSLIK------------------------------ 255
E+ I + I S +
Sbjct: 213 IEQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEERKAEL 272
Query: 256 ----------SKERVSHIQKKLASAKKSLVE-------VRQANEAHNKDIADLETQLADV 298
E++ + K S + +E + N D+ +L+ L +
Sbjct: 273 EAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEAL 332
Query: 299 RK-----RKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGD 353
+ + E+ P I LES T I +D IN +
Sbjct: 333 EEILEKNLQKLEEKLKDPSTSIELES------------ITDLIESINDIIDAINELIREH 380
Query: 354 QDKLDNELRQQVQTQNEIKKK--RHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
+K+DN ++ +N+ KKK H + E ++ ID + + E ++ +K ++L +
Sbjct: 381 NEKIDNLKKE----KNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEA 436
Query: 412 DVGSSKNRVQELQKEL 427
++ + + ++EL+K+L
Sbjct: 437 EIKALEKEIKELEKQL 452
Score = 45.8 bits (109), Expect = 1e-04
Identities = 33/199 (16%), Positives = 82/199 (41%), Gaps = 28/199 (14%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
+ I++LE+ +DK + +EK E+ E+IL +K+ ++ ++ + ++ +
Sbjct: 281 QELIEQLEELIDKYESHIEKA---LEELESILDTEKENS-EFKLDVEELKALLEALEEIL 336
Query: 168 NKKRPSLIKSKERVSH------IQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
K L + + S I + S + + + HN+ I +L+ + +K
Sbjct: 337 EKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKK 396
Query: 222 RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQAN 281
+ + + ++++ Q ++K + S +K + ++
Sbjct: 397 KLWLHLVAEL---KEDIDAYQKE---------------KKGLEKAINSLEKEIKQLEAEI 438
Query: 282 EAHNKDIADLETQLADVRK 300
+A K+I +LE QL ++
Sbjct: 439 KALEKEIKELEKQLTNIEP 457
Score = 37.3 bits (87), Expect = 0.049
Identities = 35/186 (18%), Positives = 74/186 (39%), Gaps = 20/186 (10%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDE-AEKYQRIREEIVAKEV 97
L D YE + ++ E + + + +E K + E ++ +++ K
Sbjct: 289 ELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLK 348
Query: 98 E-HQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNR-ELAK 155
+ +L I+ + D +D E I EK +N+ +EK K + L +A+
Sbjct: 349 DPSTSIELESITDLIESINDIIDA---INELIREHNEKIDNLKKEKNKAKKKLWLHLVAE 405
Query: 156 VDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 215
+ ++I E ++K + S +K + ++ +A K+I +LE Q
Sbjct: 406 LKEDIDAYQKEKKG--------------LEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451
Query: 216 LADVRK 221
L ++
Sbjct: 452 LTNIEP 457
Score = 34.6 bits (80), Expect = 0.31
Identities = 18/96 (18%), Positives = 41/96 (42%), Gaps = 12/96 (12%)
Query: 75 AKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEK 134
KI+ + EK + ++++ L + + DI + E + + +E+ ++
Sbjct: 382 EKIDNLKKEK-NKAKKKLW-------LHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQ 433
Query: 135 AENILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
E ++ +KE L ++L ++ EINK
Sbjct: 434 LEAEIKALEKEIKELEKQLTNIEPTAD----EINKL 465
Score = 31.1 bits (71), Expect = 3.5
Identities = 23/90 (25%), Positives = 43/90 (47%), Gaps = 6/90 (6%)
Query: 106 HNETDIKELEDELDKKKGEVEKIERRKEKAE--NILREKKKEQGALN---RELAKVDQEI 160
HNE I L+ E +K K ++ + K + +EKK + A+N +E+ +++ EI
Sbjct: 380 HNEK-IDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEI 438
Query: 161 REMDVEINKKRPSLIKSKERVSHIQKKLAS 190
+ ++ EI + L + I K L +
Sbjct: 439 KALEKEIKELEKQLTNIEPTADEINKLLKA 468
>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
component [Inorganic ion transport and metabolism].
Length = 289
Score = 43.8 bits (103), Expect = 3e-04
Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 7/110 (6%)
Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR-IDKLEDHIRQN 395
GK+L+ Q+ D D L+ EL++++ ++ + +M E Q R I+ + +R
Sbjct: 145 GKLLRFDSN---GQELDGDPLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLE 201
Query: 396 EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDT 445
+ L+ N +L +E + K+ ELQK L Q+ EL + +
Sbjct: 202 KRRLQLNGQLDDEFQQHYVAEKS---ELQKRLAQLQTELDALRAELERQF 248
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 44.4 bits (106), Expect = 3e-04
Identities = 28/119 (23%), Positives = 58/119 (48%), Gaps = 4/119 (3%)
Query: 48 KAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHN 107
+A+ I E + +KK+ ++ ++E ++E EK R R E++ +L
Sbjct: 39 EAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERR----NELQKLEKRLLQK 94
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
E ++ + L+K++ E+EK E+ E+ + L +K++E L E + + I + E
Sbjct: 95 EENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAE 153
Score = 40.5 bits (96), Expect = 0.005
Identities = 30/126 (23%), Positives = 63/126 (50%), Gaps = 18/126 (14%)
Query: 64 KKKGVVAERKEAKIEKDEAEKYQRIREEIV--AKEVEHQLFKLYHNETDIKELEDELDKK 121
+ K AE + +I ++ ++ + I++E + AKE H+L E E EL ++
Sbjct: 30 EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKL---------RNEFEKELRER 80
Query: 122 KGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERV 181
+ E++K+E+R + E L K L K ++E+ + + E+ +K+ L K +E +
Sbjct: 81 RNELQKLEKRLLQKEENLDRKL-------ELLEKREEELEKKEKELEQKQQELEKKEEEL 133
Query: 182 SHIQKK 187
+ ++
Sbjct: 134 EELIEE 139
Score = 39.0 bits (92), Expect = 0.014
Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 13/107 (12%)
Query: 365 VQTQNEIKKKRHEME----EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRV 420
++ + EI K R+E E E + + KLE + Q E +L +L E+ ++ + +
Sbjct: 60 LEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKEL 119
Query: 421 QELQKELEQVIEELGDAKTDKHED--------TRRKKKQELVENFKK 459
++ Q+ELE+ EEL + ++ T + K+ L+E ++
Sbjct: 120 EQKQQELEKKEEEL-EELIEEQLQELERISGLTAEEAKEILLEKVEE 165
Score = 37.5 bits (88), Expect = 0.037
Identities = 23/108 (21%), Positives = 64/108 (59%), Gaps = 6/108 (5%)
Query: 330 AEATKRAGKILQQ----LDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRI 385
EA + A +IL++ + I +E + + ++LR + + E++++R+E+++ +KR+
Sbjct: 34 KEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFE--KELRERRNELQKLEKRL 91
Query: 386 DKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
+ E+++ + L+ ++ E+ ++ + +++ ++ELE++IEE
Sbjct: 92 LQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
Score = 32.8 bits (76), Expect = 1.1
Identities = 25/126 (19%), Positives = 55/126 (43%), Gaps = 6/126 (4%)
Query: 112 KELEDELDKKKGEV-EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
KE E + E E+I + + + E LRE++ E L + L + ++ + + K+
Sbjct: 49 KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
Query: 171 RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET-----QLADVRKRKAE 225
L K ++ + Q++L ++ L E+ + + I+ L L + + +A
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEAR 168
Query: 226 YERQSI 231
+E +
Sbjct: 169 HEAAVL 174
Score = 29.7 bits (68), Expect = 9.1
Identities = 48/235 (20%), Positives = 92/235 (39%), Gaps = 76/235 (32%)
Query: 120 KKKGEVEKIERRKEKAENILREKKKEQGALNREL------------AKVDQEIREMDVEI 167
KK E KI+ +E+A+ IL E KKE A+ +E + ++E+RE E+
Sbjct: 26 KKIAE-AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84
Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE 227
K L++ +E + + L ++ L + + +LE + ++ K++ E E
Sbjct: 85 QKLEKRLLQKEENLDRKLELLEKREEELEKKEK----------ELEQKQQELEKKEEELE 134
Query: 228 RQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 287
+ ++ ER+S + AK+ L+E
Sbjct: 135 --------------------ELIEEQLQELERISGLTA--EEAKEILLE----------- 161
Query: 288 IADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQ 342
++ + +A +E + ++I E+ K EA K+A +IL Q
Sbjct: 162 ------KV----EEEARHEAAVLI-KEIEEEA---------KEEADKKAKEILAQ 196
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 44.2 bits (105), Expect = 3e-04
Identities = 28/136 (20%), Positives = 59/136 (43%), Gaps = 19/136 (13%)
Query: 327 NLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR-- 384
NLK + ++LD ++++ + + + EL + +K+KR E+ +
Sbjct: 156 NLKDDLESLIASAKEELDQLSKKLAELKAEEEEEL------ERALKEKREELLSKLEEEL 209
Query: 385 IDKLEDHIRQNEASLK-DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
+ +LE E L+ + ++ KEEL ++ +++L Q +E +A K +
Sbjct: 210 LARLESKEAALEKQLRLEFEREKEEL----------RKKYEEKLRQELERQAEAHEQKLK 259
Query: 444 DTRRKKKQELVENFKK 459
+ + EL F K
Sbjct: 260 NELALQAIELQREFNK 275
Score = 37.7 bits (88), Expect = 0.031
Identities = 66/292 (22%), Positives = 111/292 (38%), Gaps = 43/292 (14%)
Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKR---KAEY 226
K R S + +++A A+ V K I +L L ++ + +
Sbjct: 76 KDEPKQVKIPRQSGLSEEVAEAEAKATSVAAEA-TTPKSIQELVEALEELLEELLKETAS 134
Query: 227 ER--QSIPGRDINL-ESAQDVEINKKRPSLIKS-KERVSHIQKKLASAKKSLVE--VRQA 280
+ Q + +L +S ++ + SLI S KE + + KKLA K E R
Sbjct: 135 DPVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERAL 194
Query: 281 NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKIL 340
E + ++ LE +L + E + LE E+ K E K+ + L
Sbjct: 195 KEKREELLSKLEEEL----LARLESKE-------AALEKQLRLEFEREKEELRKKYEEKL 243
Query: 341 QQLDTINREQKGDQDKLDNELRQQVQTQN-----EIKKKRHE--------MEEAQKRIDK 387
+Q + R+ + + KL NEL Q EIK+K E + E R+
Sbjct: 244 RQ--ELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKG 301
Query: 388 LEDHIRQNEASLKDNKKLK------EELNSDV-GSSKNRVQELQKELEQVIE 432
LE + + +N K++ E L S + S + L KEL+ + E
Sbjct: 302 LEKALDSRSEAEDENHKVQQLWLAVEALKSALKSGSAGSPRPLVKELDALKE 353
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 43.7 bits (104), Expect = 5e-04
Identities = 49/260 (18%), Positives = 91/260 (35%), Gaps = 49/260 (18%)
Query: 210 ADLETQLADVRKRKAEYERQSIPGRDIN-----LESAQDV--EINKKRPSLIKSKERVSH 262
AD++ QL + K+K + +D+ L+ E + + L ++ ++
Sbjct: 39 ADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQ 98
Query: 263 IQKKLASAKKSLVEVRQANEAHNKDIADLE-------TQLADVRKRKAEYERQSIPGRDI 315
Q +L + K E + + + LE QL + + AEY Q + +
Sbjct: 99 AQAELEALKDDNDEETRETLS-TLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQ 157
Query: 316 NLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKR 375
E AQ Y N + R +I L K L R +Q +
Sbjct: 158 -PERAQAALYANSQ-----RLQQIRNLLKGGKVGGK----ALRPSQRVLLQAE------- 200
Query: 376 HEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELG 435
L SL+ N +L++ L R+Q L+ +L Q+++E
Sbjct: 201 ---------QALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQL-QLLQEAI 250
Query: 436 DAKTDKHEDTRRKKKQELVE 455
++K R ++ V+
Sbjct: 251 NSK-------RLTLSEKTVQ 263
Score = 38.7 bits (91), Expect = 0.020
Identities = 59/329 (17%), Positives = 119/329 (36%), Gaps = 93/329 (28%)
Query: 94 AKEVEHQLFKLY---HNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALN 150
+V+ QL L E + K ++ +L++ ++KI+R+KE+ E L
Sbjct: 38 EADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETE-----------QLK 86
Query: 151 RELAKVDQEIREMDVEINK-KRPSLIKSKER-----VSHIQKKLASAKKSLVEVRQANEA 204
++LA+ ++R+ E+ K + +++E + ++ +LA L +
Sbjct: 87 QQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAE 146
Query: 205 HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQ 264
+N + L+TQ R + A Y N + Q +I + + Q
Sbjct: 147 YNSQLVSLQTQPE--RAQAALYA---------NSQRLQ--QIRNLLKGGKVGGKALRPSQ 193
Query: 265 KKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRD--INLESAQM 322
+ L A+++L L Q + +R+S+ G +L Q
Sbjct: 194 RVLLQAEQAL----------------LNAQN--------DLQRKSLEGNTQLQDLLQKQR 229
Query: 323 TEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQT-QNEIKKKRHEMEEA 381
+Y + I R L Q+Q Q I KR + +
Sbjct: 230 -DYLTAR----------------IQR------------LEHQLQLLQEAINSKR--LTLS 258
Query: 382 QKRIDKLEDHIRQNEASLKDNKKLKEELN 410
+K + + + + A ++ N + +EL
Sbjct: 259 EKTVQEAQS--QDEAARIQANPLVAQELE 285
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 43.1 bits (102), Expect = 8e-04
Identities = 44/237 (18%), Positives = 91/237 (38%), Gaps = 47/237 (19%)
Query: 252 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE-YERQSI 310
L K K R + Q + K ++ Q + +NK+I + + + RK Y+
Sbjct: 170 KLNKDKIRELNQQIQTLDMKIDHIQ--QQIKTYNKNIEEQRKKNGENIARKQNKYDELVE 227
Query: 311 PGRDINLESAQMTE-----------YTNLKAEATKRAGKILQQLDTINRE----QKGD-- 353
+ I E ++T+ + + A KI +++ + +KG
Sbjct: 228 EAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVC 287
Query: 354 -----------------QDKLDN---ELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR 393
+DKL L + +E+++ E E K++ +L++ I
Sbjct: 288 PTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKIS 347
Query: 394 QNEASL-------KDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
N+ SL K K EEL ++ + + +LQ EL+++++ + +K+
Sbjct: 348 TNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYH 404
Score = 38.5 bits (90), Expect = 0.023
Identities = 33/215 (15%), Positives = 79/215 (36%), Gaps = 31/215 (14%)
Query: 112 KELEDELDKKKGEV-----EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
+ +E KK GE K + E+A+ I K E L EL + +I +
Sbjct: 201 NKNIEEQRKKNGENIARKQNKYDELVEEAKTI----KAEIEELTDELLNLVMDIEDPSAA 256
Query: 167 INKKRPSLIKSKERVSHIQK----------------KLASAKKSLVEVRQANEAHNKDIA 210
+NK + K K ++ QK +++ + +++ + +
Sbjct: 257 LNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLE 316
Query: 211 DLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASA 270
L+T + ++ + E+ QS ++ +I+ + SLI ++ ++ +
Sbjct: 317 KLDTAIDELEEIMDEFNEQSKKLLELK------NKISTNKQSLITLVDKAKKVKAAIEEL 370
Query: 271 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 305
+ V+ + ++ + +++ K K
Sbjct: 371 QAEFVDNAEELAKLQDELDKIVKTKSELVKEKYHR 405
Score = 35.8 bits (83), Expect = 0.12
Identities = 18/105 (17%), Positives = 40/105 (38%), Gaps = 6/105 (5%)
Query: 622 DKEMGNLKAQKEKLSEELREAMKK----SRKESELNTVQSTIKGLEIRLNYSRQDLQNTK 677
+ +K + ++L L + E N + L+ +++ ++Q L
Sbjct: 298 PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLV 357
Query: 678 SQIAKLEAEIDALNA-RADATEPKIKAIEASMTARGDTISRKKEE 721
+ K++A I+ L A D E + ++ + T S +E
Sbjct: 358 DKAKKVKAAIEELQAEFVDNAEE-LAKLQDELDKIVKTKSELVKE 401
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 43.1 bits (102), Expect = 9e-04
Identities = 41/263 (15%), Positives = 97/263 (36%), Gaps = 28/263 (10%)
Query: 178 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 237
+ER+ ++ L SA V ++ E + +KR R ++ ++
Sbjct: 606 RERLQQAEEALQSA----VAKQKQAEEQLVQAN----AELEEQKRAEAEARTALKQARLD 657
Query: 238 LESAQDVEINKKRPSLIKSKERVSHIQK---KLASAKKSLVEVRQANEAHNKDIADLETQ 294
L+ Q+ + + K + ER + +L + K L+E +QA KD
Sbjct: 658 LQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKD------D 711
Query: 295 LADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQ 354
++R + + D L L A + +L + ++ +
Sbjct: 712 FRELRTERLAKWQVVEGELDNQL--------AQLSAAIEAARTQAKARLKELKKQYDREL 763
Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKL---EDHIRQNEASLKDNKKLKEELNS 411
LD + + + +I++ +E R ++ +++ ++ + L
Sbjct: 764 ASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAI 823
Query: 412 DVGSSKNRVQELQKELEQVIEEL 434
+ ++ +ELQ+EL ++I++
Sbjct: 824 QLRELESSAEELQQELTRLIKDT 846
Score = 42.4 bits (100), Expect = 0.001
Identities = 72/426 (16%), Positives = 152/426 (35%), Gaps = 57/426 (13%)
Query: 70 AERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIE 129
AE +E + E + Q E KE +L + ++ +L + E+E +E
Sbjct: 280 AEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQ------ELSAANAKLAADRSELELLE 333
Query: 130 RRKEKAENILREK-KKEQGAL---NRELAKVDQEIREMDVEINK------KRPSLIKS-- 177
+K E+ E+ + + L EL +V+ + + + + IK
Sbjct: 334 DQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQL 393
Query: 178 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR---KRKAEYERQSIPGR 234
+ + ++LA+ ++ + A +D+ LE+QL K + E + R
Sbjct: 394 ERDLEKNNERLAAIREEKDRQKAA---IEEDLQALESQLRQQLEAGKLEFNEEEYELELR 450
Query: 235 DINLESAQ----------------DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
L+ D + K + +++ V +Q +L +K
Sbjct: 451 LGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRK---RRD 507
Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGK 338
+A EA + E +L +R+ E E Q P L + N + GK
Sbjct: 508 EALEALQRA----ERRLLQLRQALDELELQLSPQAGSLLHFLR-----NEAPGWEESIGK 558
Query: 339 IL---QQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEA-QKRIDKLEDHIRQ 394
++ T Q + D + Q E ++R+ + E+ ++
Sbjct: 559 VISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANETELRERLQQAEEALQS 618
Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELV 454
A K ++ + N+++ K E + L+Q +L + ++ + + K + +
Sbjct: 619 AVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQ-NEQQSLKDKLELAIA 677
Query: 455 ENFKKA 460
E ++A
Sbjct: 678 ERKQQA 683
Score = 42.0 bits (99), Expect = 0.002
Identities = 77/431 (17%), Positives = 158/431 (36%), Gaps = 59/431 (13%)
Query: 62 YLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKK 121
Y + K + E+ E +EK+ E+ IREE ++ E D++ LE +L ++
Sbjct: 382 YERLKQKIKEQLERDLEKNN-ERLAAIREEKDRQKAAI--------EEDLQALESQLRQQ 432
Query: 122 KGEVEKIERRKEKAENILREKKKEQGALNRELAKVD--QEIREMDVEINKKRPSLIKSKE 179
E+ + + + L+ A + +++ D + K + +++
Sbjct: 433 LEA--GKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEA 490
Query: 180 RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
V +Q +L +K +A EA + E +L +R+ E E Q P L
Sbjct: 491 NVEQLQSELRQLRK---RRDEALEALQRA----ERRLLQLRQALDELELQLSPQAGSLLH 543
Query: 240 --SAQDVEINKKRPSLIKSK--ERVS-HIQKKLASAKKSL------VEVRQANEAHNKD- 287
+ + +I + ER Q S +L ++ + +
Sbjct: 544 FLRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANET 603
Query: 288 -----IADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQ 342
+ E L ++ + E Q + N E + A K+A LQ+
Sbjct: 604 ELRERLQQAEEALQSAVAKQKQAEEQLV---QANAELEEQKRAEAEARTALKQARLDLQR 660
Query: 343 L--------DTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQK------RIDKL 388
L D + + + + +LRQ ++ +++ EA K R ++L
Sbjct: 661 LQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERL 720
Query: 389 EDHIRQNEASLKDNK--KLKEELNSDVGSSKNRVQELQKELEQVIEELG-DAKTDKHEDT 445
Q DN+ +L + + +K R++EL+K+ ++ + L D T K
Sbjct: 721 AK--WQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKR 778
Query: 446 RRKKKQELVEN 456
+ ++ + +E
Sbjct: 779 QIEELETTIER 789
Score = 36.6 bits (85), Expect = 0.078
Identities = 45/246 (18%), Positives = 97/246 (39%), Gaps = 38/246 (15%)
Query: 626 GNLKAQKEKLSEELRE-AMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLE 684
G L A +E+L+EE E K+R +L T++ +K LN Q+L +++A
Sbjct: 270 GELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELN---QELSAANAKLAADR 326
Query: 685 AEIDALNARADATE----PKIKA-------IEASMTARGDTISRKKEEMNSVEDIVFRDF 733
+E++ L + A E +++A I + + + + V+ R +
Sbjct: 327 SELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQ----RKY 382
Query: 734 CKSIGV--------STIRQYEEAELRSQQERQKICQDKDTKKNVARWE-------RAVSD 778
+ A +R +++RQK ++D + ++ ++
Sbjct: 383 ERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNE 442
Query: 779 DEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKD 838
+E EL G ++ A ++LE + +A++ EE +A V + +
Sbjct: 443 EEYELELRLGRLKQRLDSATATPEELEQLEI----NDEALEKAQEEQEQAEANVEQLQSE 498
Query: 839 IQAAQK 844
++ +K
Sbjct: 499 LRQLRK 504
Score = 36.2 bits (84), Expect = 0.13
Identities = 50/316 (15%), Positives = 125/316 (39%), Gaps = 33/316 (10%)
Query: 95 KEVEHQLFKLYHNETDIKELEDELDKKKGEVEK-IERRKEKAENILREKKKEQGALNREL 153
+ L D++ L++E K ++E I RK++AE LR+ + L +
Sbjct: 642 RAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQ 701
Query: 154 AKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLE 213
+ +++ E+ +R L K + + +LA ++ R +A + +L+
Sbjct: 702 QAFLEALKDDFRELRTER--LAKWQVVEGELDNQLAQLSAAIEAARTQAKAR---LKELK 756
Query: 214 TQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKS 273
Q + + D++ + E+ ++ L + ER++ + ++ +
Sbjct: 757 KQY-----------DRELASLDVDPNT--VKELKRQIEELETTIERIAVRRPEVREYRAF 803
Query: 274 LVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEAT 333
+ E ++ ++ +L QL ++ E +++ + + + + +
Sbjct: 804 MQETWLHRDSLREERPNLAIQLRELESSAEELQQE---------LTRLIKDTKLRRKKLE 854
Query: 334 KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR 393
+ + +QLD ++ +G +D++ +Q+ E +R+D+LE+ R
Sbjct: 855 QERKALEKQLDQLDELLRGLRDEM-----RQLAELKEPANANQAEGSISERLDQLEEFKR 909
Query: 394 QNEASLKDNKKLKEEL 409
+ + D KK E
Sbjct: 910 KRKRLSGDLKKFLERF 925
Score = 32.0 bits (73), Expect = 2.1
Identities = 49/296 (16%), Positives = 103/296 (34%), Gaps = 52/296 (17%)
Query: 662 LEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARG--DTISRKK 719
L RL + + LQ+ ++ + E ++ NA + K EA + + R +
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQ--KRAEAEARTALKQARLDLQRLQ 662
Query: 720 EEMNSVED---IVFRDFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKN-----VAR 771
E S++D + + + + + + +Q++ + KD + +A+
Sbjct: 663 NEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAK 722
Query: 772 WERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARRE 831
W+ + + +LA+ A E + +A +L+ L +D
Sbjct: 723 WQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASL----DVDPNT------ 772
Query: 832 VGSIAKDIQAAQKSCVNLESKLEMKKSERHDIL-----MN---CKMNDIVL--PMLRVQK 881
V + + I+ LE+ +E R ++ M + + P L +Q
Sbjct: 773 VKELKRQIE-------ELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQL 825
Query: 882 YDRKLAKSIQEMTSRLQTIQAP----------NLRAMEKLEH-AKENLMKTNEEFE 926
R+L S +E+ L + +A+EK E L +E
Sbjct: 826 --RELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMR 879
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 41.9 bits (99), Expect = 0.001
Identities = 29/117 (24%), Positives = 56/117 (47%), Gaps = 4/117 (3%)
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELA---KVDQEIREMDV-EIN 168
LE+ L+ K + E + + +I + +K+ AL E+A ++ E+ D E+
Sbjct: 151 GLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELE 210
Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 225
K R L ++S +K+L ++ L E+ A EA ++L ++A+ K + E
Sbjct: 211 KARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREE 267
Score = 33.1 bits (76), Expect = 0.84
Identities = 35/189 (18%), Positives = 78/189 (41%), Gaps = 12/189 (6%)
Query: 127 KIERRKEKAENILREKKKEQGALNRELAKVDQEIRE-MDVEI-NKKRPSLIKSKE----- 179
I ++ I E E L RE +I+ MD + K + ++SK+
Sbjct: 81 YISEGRDLFREIEEETLAENPPLFREYFSASPDIKLLMDNQFQLVKTHARLQSKKTWYEW 140
Query: 180 RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
R+ ++ +++L +++ E NKD+ + + +RK+ + + +L
Sbjct: 141 RMQLLEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIA-----SLR 195
Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR 299
D L K+++ + + K++ +K L E++Q + I L + +++
Sbjct: 196 QLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELL 255
Query: 300 KRKAEYERQ 308
+ AE E+
Sbjct: 256 EEIAEAEKI 264
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 41.6 bits (98), Expect = 0.003
Identities = 80/415 (19%), Positives = 169/415 (40%), Gaps = 65/415 (15%)
Query: 66 KGVVAERKEAKIEKDEAEKYQ---RIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKK 122
E ++ + E+E Q ++ +E V K+ E I+ELE+EL+ ++
Sbjct: 2 LERQKRELENQLYRKESELSQLSSKLEDEQAL--VAQLQKKIKELEARIRELEEELEAER 59
Query: 123 GEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD------VEINKKRPSLIK 176
K E+ + L+REL ++ + + E +E+NKKR
Sbjct: 60 AARAKAEKARAD--------------LSRELEELSERLEEAGGATAAQIELNKKR----- 100
Query: 177 SKERVSHIQKKLASAKKSLVEVRQANEA-HNKDIADLETQLADVRKRK--AEYERQSIPG 233
+ ++ ++K L A E H I +L Q+ ++K+K AE E+ +
Sbjct: 101 -EAELAKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQA 159
Query: 234 RDINLESAQDVEINKKRPSLIKSK---ERVSHIQKKLASAKKSLVEVRQA-------NEA 283
+L + D K + K+K ++S +Q KL ++ L ++ N
Sbjct: 160 EVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSD 219
Query: 284 HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQL 343
+ + + E Q++++ K K++ E +Q+ E E ++ + QL
Sbjct: 220 LTRQLEEAEAQVSNLSKLKSQLE-------------SQLEEAKRSLEEESRERANLQAQL 266
Query: 344 DTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQK--------RIDKLEDHIRQN 395
+ + +++L+ E + + + ++ K E+++ + R ++LE+ ++
Sbjct: 267 RQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKL 326
Query: 396 EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKK 450
+ + ++ E N+ S + LQ ELE + EL A E +++K
Sbjct: 327 NQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKN 381
Score = 37.7 bits (88), Expect = 0.040
Identities = 76/359 (21%), Positives = 155/359 (43%), Gaps = 57/359 (15%)
Query: 108 ETDIKELEDELD-KKKGEVEKIER--------RKEKAENILREKKKEQGALN-------- 150
E + E E+E + +K IE K KAE R KKK +G +N
Sbjct: 506 ERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAE-ASRLKKKLEGDINELEIALDH 564
Query: 151 ---------RELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQA 201
+ + K Q+++E+ ++ +++ + ++E+++ +++ + + L E+R A
Sbjct: 565 ANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSA 624
Query: 202 NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVS 261
E A+ +++AE E R L + I +KR K + ++
Sbjct: 625 LEQ------------AERARKQAETELAEASERVNELTAQNSSLIAQKR----KLEGELA 668
Query: 262 HIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ 321
+Q L A E++ A E K AD ++R+ + + + + + +
Sbjct: 669 ALQSDLDEAVN---ELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKE 725
Query: 322 MTE-YTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEM-- 378
+ +A A K K++Q+L+ RE + + LD E R+ +TQ ++K +
Sbjct: 726 LQVRLDEAEAAALKGGKKMIQKLEARVRELEAE---LDGEQRRHAETQKNLRKMERRVKE 782
Query: 379 -----EEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE 432
EE +K +++L+D + + +A +K K+ EE + ++ ++ Q+ELE E
Sbjct: 783 LQFQVEEDKKNLERLQDLVDKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQRELEDAEE 841
Score = 33.5 bits (77), Expect = 0.76
Identities = 63/364 (17%), Positives = 155/364 (42%), Gaps = 37/364 (10%)
Query: 71 ERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIER 130
E+K+ +K AE +++ E + + + T++ L++EL++ K +VE + R
Sbjct: 376 EKKQKNFDKILAEWKRKVDELQAELDTAQREARNL--STELFRLKNELEELKDQVEALRR 433
Query: 131 RKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLAS 190
+ ++ + + + G R + ++++ R ++ E ++ + +L +++ + + K
Sbjct: 434 ENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESK--- 490
Query: 191 AKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKR 250
++ VE+ Q + +A+ E + + RK N + A + +
Sbjct: 491 VLRAQVELSQIRSEIERRLAEKEEEFENTRK---------------NHQRAIE-SLQATL 534
Query: 251 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 310
+ K K S ++KKL + E+ A + NK A+ + + +++ E + Q
Sbjct: 535 EAEAKGKAEASRLKKKLEG---DINELEIALDHANKANAEAQKNVKKYQQQVKELQTQ-- 589
Query: 311 PGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNE 370
+ E + A A +RA + +L+ + + L+ R + Q + E
Sbjct: 590 ----VEEEQRAREDAREQLAVAERRATALEAELEEL-------RSALEQAERARKQAETE 638
Query: 371 IKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQV 430
+ + + E + L R+ E L + +E +++ +++ R ++ Q + ++
Sbjct: 639 LAEASERVNELTAQNSSLIAQKRKLEGELAALQSDLDEAVNELKAAEERAKKAQADAARL 698
Query: 431 IEEL 434
EEL
Sbjct: 699 AEEL 702
Score = 32.7 bits (75), Expect = 1.3
Identities = 66/347 (19%), Positives = 145/347 (41%), Gaps = 61/347 (17%)
Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEIN 168
+++++L+ EL++ ++E++++ + IL E K++ L EL +E R + E+
Sbjct: 356 SELEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELF 415
Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
+ + L + K++V +R+ N+ +I DL QL + + E E+
Sbjct: 416 RLKNELEELKDQVE--------------ALRRENKNLQDEIHDLTDQLGEGGRNVHELEK 461
Query: 229 QSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDI 288
+ E ++ + +L +++ + + K ++ VE+ Q + +
Sbjct: 462 ---------ARRRLEAEKDELQAALEEAEAALELEESK---VLRAQVELSQIRSEIERRL 509
Query: 289 ADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINR 348
A+ E + + RK R I A + KAEA++ K+ +
Sbjct: 510 AEKEEEFENTRKNHQ---------RAIESLQATLEAEAKGKAEASRLKKKLEGDI----- 555
Query: 349 EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEE 408
NEL + N+ EAQK + K + +++ + +++ ++ +E+
Sbjct: 556 ----------NELEIALDHANKAN------AEAQKNVKKYQQQVKELQTQVEEEQRARED 599
Query: 409 LNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVE 455
+ ++ R L+ ELE++ L A E R++ + EL E
Sbjct: 600 AREQLAVAERRATALEAELEELRSALEQA-----ERARKQAETELAE 641
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 41.6 bits (98), Expect = 0.003
Identities = 73/344 (21%), Positives = 148/344 (43%), Gaps = 63/344 (18%)
Query: 128 IERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKR--PSLIKSKERVSHIQ 185
+E RK E K+ Q ++ K+ E+R+ ++N +R P + ++
Sbjct: 57 LEERKGSLERA----KQYQQVID-NFPKLSAELRQ---QLNNERDEPRSVPPNMSTDALE 108
Query: 186 KKLASAKKSLVEV-RQANEAHNK--DIADLETQL----ADVRKRKAEYER--QSIPGRDI 236
+++ L+E RQA + ++ +I+D +QL + R++ E ER Q++ +
Sbjct: 109 QEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNT 168
Query: 237 NLESAQDVEINKKRPSLIKSKERVSHIQKKLASA--KKSL----VEVRQANEAH-NKDIA 289
L AQ + + +L K V ++ SA ++ L E+ + + +
Sbjct: 169 PLAQAQLTALQAESAAL---KALVDELELAQLSANNRQELARLRSELAKKRSQQLDAYLQ 225
Query: 290 DLETQLADVRKRKAE--------YERQS--IP-------------GRDINLESAQMTEYT 326
L QL R+R+AE QS +P + +N ++ +M
Sbjct: 226 ALRNQLNSQRQREAERALESTELLAEQSGDLPKSIVAQFKINRELSQALNQQAQRMDLIA 285
Query: 327 NLKAEATKRAGKILQQLDTINREQK---GDQDKLDNELRQQVQTQNEIKK-KRHEMEEAQ 382
+ + +A + ++ Q L+T+ REQ G + L LR QV E+ K ++ + E AQ
Sbjct: 286 SQQRQAASQTLQVRQALNTL-REQSQWLGVSNALGEALRAQVARLPEMPKPQQLDTEMAQ 344
Query: 383 KRIDKL--EDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQ 424
R+ +L ED + + +++++ + + +NR+ + Q
Sbjct: 345 LRVQRLRYEDLLNK----QPQLRQIRQADGQPLTAEQNRILDAQ 384
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 41.1 bits (97), Expect = 0.003
Identities = 48/258 (18%), Positives = 96/258 (37%), Gaps = 53/258 (20%)
Query: 78 EKDEAEKYQRIREEIVAKEVEHQLFKL--YHNETDIKELEDELDKKKGEVEKIERRKEKA 135
E+ E+ +++R ++ L KL Y + + E + K E I+ +E+
Sbjct: 38 EELSNERLRKLRSL--LTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEEL 95
Query: 136 ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRP--------SLIKSKE-------R 180
E I +E K+ L E+++++ EI+E++ EI + P SL+ +
Sbjct: 96 EKIEKEIKE----LEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGT 151
Query: 181 VSHIQKKLASAKKSLVEVRQANEAHNKD---IADLETQLADVRK--RKAEYERQSIPGRD 235
V + + + + V + + L+ +V + +K +ER +
Sbjct: 152 VPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELE--- 208
Query: 236 INLESAQDVEINKKRPS--LIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET 293
+ PS + + KE + I+K+ S + L E+ + E
Sbjct: 209 -----------EEGTPSELIREIKEELEEIEKERESLLEELKELAKKYL---------EE 248
Query: 294 QLADVRKRKAEYERQSIP 311
LA + E ER
Sbjct: 249 LLALYEYLEIELERAEAL 266
Score = 38.0 bits (89), Expect = 0.026
Identities = 45/291 (15%), Positives = 105/291 (36%), Gaps = 52/291 (17%)
Query: 188 LASAKKSLVEV-RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEI 246
L S K ++E + H I DL+ +L++ R RK + L S
Sbjct: 14 LKSYKDEVLEALHELGVVH---IEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKL- 69
Query: 247 NKKRPSLIKSKERVSHIQKKLASAKKSLVEV-RQANEAHNKDIADLETQLADVRKRKAEY 305
K S +++ + ++ L ++ ++ E + I++LE ++ ++ +
Sbjct: 70 -NPLREEKKKVSVKS-LEELIKDVEEELEKIEKEIKELEEE-ISELENEIKELEQEIERL 126
Query: 306 ERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQV 365
E P + +L+ + + + + G + +DKL+ +
Sbjct: 127 E----PWGNFDLDLSLLLGFKYVSV----FVGTV-------------PEDKLEELKLESD 165
Query: 366 QTQNEIKKKRHEME-----EAQKRIDKLEDHIRQNEAS---LKDNKKLKEELNSDVGSSK 417
E ++ D++E+ +++ L++ E + K
Sbjct: 166 VENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIRE----IK 221
Query: 418 NRVQELQKELEQVIEELGDAKTDKHEDTRR---------KKKQELVENFKK 459
++E++KE E ++EEL + K+ + ++ E + F K
Sbjct: 222 EELEEIEKERESLLEELKE-LAKKYLEELLALYEYLEIELERAEALSKFLK 271
Score = 31.1 bits (71), Expect = 3.5
Identities = 42/263 (15%), Positives = 99/263 (37%), Gaps = 49/263 (18%)
Query: 721 EMNSVEDIVFRDFCKSIGVSTIRQYEE---------AELRSQQERQKICQDKDTKKNVAR 771
E+ V D + + +R+ +LRS + +++ K +V
Sbjct: 27 ELGVVH---IEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKS 83
Query: 772 WERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVD---AMDEEIGKA 828
E + D EEEL + EK E+ E +LEN + + ++ D ++
Sbjct: 84 LEELIKDVEEELEKI----EKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLL 139
Query: 829 RRE------VGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLRVQKY 882
VG++ +D K ++E+ + + + ++ +V
Sbjct: 140 LGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVV------VV--------V 185
Query: 883 DRKLAKSIQEMTSRL--QTIQAPNLR-AMEKLEHAKENLMKTNEEFENARKRAKKAKANF 939
++L+ ++E +L + ++ E + KE L + +E E+ + K+
Sbjct: 186 LKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKE----- 240
Query: 940 DRIKKERYDKFTRCFEHVSNEID 962
+ K+ ++ +E++ E++
Sbjct: 241 --LAKKYLEELLALYEYLEIELE 261
Score = 30.3 bits (69), Expect = 6.5
Identities = 17/77 (22%), Positives = 26/77 (33%), Gaps = 21/77 (27%)
Query: 630 AQKEKLSEELREAMKKSR-------KESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
K E ++ + E EL ++ IK LE +I++
Sbjct: 66 LPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELE--------------EEISE 111
Query: 683 LEAEIDALNARADATEP 699
LE EI L + EP
Sbjct: 112 LENEIKELEQEIERLEP 128
Score = 30.3 bits (69), Expect = 7.0
Identities = 20/120 (16%), Positives = 43/120 (35%), Gaps = 15/120 (12%)
Query: 52 IQAEEETNMSYLKKKGVVAERKEAKIEK-----DEAEKYQRIREEIVAKEVEHQLFKLYH 106
+ ++ + V K E + EE+ K++ + +L
Sbjct: 152 VPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEEL--KKLGFERLELE- 208
Query: 107 NETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
+ + + + K E+E+IE+ +E L+E K+ E E E+++E
Sbjct: 209 ---EEGTPSELIREIKEELEEIEKERESLLEELKELAKK----YLEELLALYEYLEIELE 261
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 40.4 bits (95), Expect = 0.003
Identities = 23/147 (15%), Positives = 59/147 (40%), Gaps = 26/147 (17%)
Query: 323 TEYTNLKAE---ATKRAGKILQQLDTINREQKGDQDKLD--------NELRQQVQTQNEI 371
+ L E ++ + L+ + + D KL+ N++R ++ E
Sbjct: 27 LDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKE- 85
Query: 372 KKKRHEMEEAQKRIDKLEDHIRQNEASLKD-NKKLKEELNSDVGSSKNRVQELQKELEQV 430
E+E+ ++RI++L+ + Q + L + +L++ S + ++ ++ + +L +
Sbjct: 86 -----EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNAL 140
Query: 431 IEELGDAKTDKHEDTRRKKKQELVENF 457
+ R +EL + F
Sbjct: 141 --------HSLLAEKRSFLCRELAKLF 159
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 41.2 bits (97), Expect = 0.003
Identities = 59/318 (18%), Positives = 128/318 (40%), Gaps = 17/318 (5%)
Query: 116 DELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLI 175
D +D+ G++ + RR+ K + E AL EL + ++ ++ EI R L
Sbjct: 182 DLIDRLAGDLTNVLRRR-KKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELE 240
Query: 176 KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP-GR 234
+++ + ++KK S L E R+ E + + ++E R + E +P
Sbjct: 241 EAQRSLESLEKKFRSEGGDLFEEREQLE---RQLKEIEAARKANRAQLRELAADPLPLLL 297
Query: 235 DINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 294
NL + ++ K+ S +++ ++++ +SL ++ H K+IA +
Sbjct: 298 IPNLLDSTKAQLQKEEQSQ-QNQLTQEELEERDKELLESLPKL-ALPAEHVKEIAAELAE 355
Query: 295 LADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQ 354
+ +E + ++T+ L + + QL RE + +
Sbjct: 356 IDKPATTDSEIPHRLSGS--------ELTQLEVLIQQVKRELQDAKSQLLKELRELEEEL 407
Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVG 414
++D ++ + ++ +I + E+ EAQ + + E I + L+ K+ E L +
Sbjct: 408 AEVDKKI-STIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLD 466
Query: 415 SSKNRVQELQKELEQVIE 432
+ ELE+ I
Sbjct: 467 EKTKQKIN-AFELERAIT 483
Score = 35.8 bits (83), Expect = 0.15
Identities = 50/270 (18%), Positives = 107/270 (39%), Gaps = 13/270 (4%)
Query: 49 AEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNE 108
+E+ E E K + + E + E +EA++ E+ E + E
Sbjct: 209 SEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLE 268
Query: 109 TDIKELEDELDKKKGEVEKIERR-------KEKAENILREKKKEQGALNRELAKVDQEIR 161
+KE+E + ++ ++ ++ + +KE+ + +L + E+
Sbjct: 269 RQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQE--ELE 326
Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLA-SAKKSLVEVRQANEAHNKDIADLETQLADVR 220
E D E+ + P L E V I +LA K + + + ++ LE + V+
Sbjct: 327 ERDKELLESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQLEVLIQQVK 386
Query: 221 KRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQA 280
+ + + Q + E +V+ K+ S I S+E+++ + ++L A+ L
Sbjct: 387 RELQDAKSQLLKELRELEEELAEVD---KKISTIPSEEQIAQLLEELGEAQNELFRSEAE 443
Query: 281 NEAHNKDIADLETQLADVRKRKAEYERQSI 310
E + + L+ + +RK E +Q I
Sbjct: 444 IEELLRQLETLKEAIEALRKTLDEKTKQKI 473
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 41.2 bits (97), Expect = 0.003
Identities = 43/247 (17%), Positives = 89/247 (36%), Gaps = 33/247 (13%)
Query: 78 EKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAEN 137
K+ + +I + ++ +K LE L EVEK+E + E
Sbjct: 43 LKELEKLKVAEVAQISLSSLLSEVLDYL---RSVKGLEGRLFILPEEVEKLEAELKSLEE 99
Query: 138 ILREKKKEQGALNRELAKVDQEIREMDVEINKKRP---------------SLIKSKE--- 179
+++ +K + K+++ + E+D E+ SL++ +
Sbjct: 100 VIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPLAYLDFDLSLLRGLKFLL 159
Query: 180 -RVSHIQKKLASAKKSLVEVRQANEAHNKD------IADLETQLADVRK--RKAEYERQS 230
R+ ++++ A ++E A N + +A L V K + +E
Sbjct: 160 VRLGLVRREKLEALVGVIEDEVALYGENVEASVVIVVAHGAEDLDKVSKILNELGFELYE 219
Query: 231 IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKS-LVEVRQANEAHNKDIA 289
+P D E+ + + E + + LA L+ VR+ E K +
Sbjct: 220 VPEFDGGPSELIS-ELEEVIAEIQDELESLRSELEALAEKIAEELLAVREILEI-EKALG 277
Query: 290 DLETQLA 296
D+ ++LA
Sbjct: 278 DVLSKLA 284
Score = 33.9 bits (78), Expect = 0.48
Identities = 41/287 (14%), Positives = 98/287 (34%), Gaps = 39/287 (13%)
Query: 192 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRP 251
L E+ H +D+ + E L ++ K K Q ++ ++ ++ +
Sbjct: 19 DPVLAELHDFGLVHLEDLEEGEKGLKELEKLKVAEVAQ----ISLSSLLSEVLDYLRSVK 74
Query: 252 SLIKS----KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 307
L E V ++ +L S ++ + + + + LE +L+++ + + E
Sbjct: 75 GLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLED 134
Query: 308 QSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQT 367
LE + Y + + +L +L + RE+ + +V
Sbjct: 135 L--------LEELEPLAYLDFDLSLLRGLKFLLVRLGLVRREKLEALVG---VIEDEVAL 183
Query: 368 QNEIKKKRHEM-----EEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRV-Q 421
E + + E ++ K+ + + + + + S++ S V
Sbjct: 184 YGENVEASVVIVVAHGAEDLDKVSKILNELGFELYEVPE----FDGGPSELISELEEVIA 239
Query: 422 ELQKELEQVIEELGDAKTDKHED---------TRRKKKQELVENFKK 459
E+Q ELE + EL +K + K +++ +
Sbjct: 240 EIQDELESLRSELEALA-EKIAEELLAVREILEIEKALGDVLSKLAR 285
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 40.7 bits (95), Expect = 0.005
Identities = 58/269 (21%), Positives = 106/269 (39%), Gaps = 53/269 (19%)
Query: 200 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ-DVEINKKRPSLIKSK- 257
A E +DI L+ +L R+ ++E R I S + D+ KK ++++K
Sbjct: 406 SAIERLEQDIKKLQAELQQARQNESEL-RNQISLLTSLERSLKSDLGQLKKENDMLQTKL 464
Query: 258 -ERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 316
VS QK K+S+ + + ++ + E QLA+ +KRK E E
Sbjct: 465 NSMVSAKQKD----KQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEE---------- 510
Query: 317 LESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQT-QNEIKKKR 375
E RA +RE+ + L+Q Q + EIKK
Sbjct: 511 --------------ETAARAAAQAAA----SREECAES------LKQAKQDLEMEIKKLE 546
Query: 376 HEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELE------- 428
H+++ ++ LE ++ ++++K E L S + + +++ L+ L
Sbjct: 547 HDLKLKEEECRMLEKEAQELR-KYQESEKETEVLMSALQAMQDKNLMLENSLSAETRLKL 605
Query: 429 QVIEELGDAKTDKH--EDTRRKKKQELVE 455
+ LGD K K++ E+V+
Sbjct: 606 DLFSALGDVKRQLEIAHGAIYKREAEIVD 634
Score = 30.3 bits (68), Expect = 6.1
Identities = 45/221 (20%), Positives = 94/221 (42%), Gaps = 17/221 (7%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
E DIK+L+ EL + + ++ + ++ R K + G L +E + ++ M
Sbjct: 412 EQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAK 471
Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE 227
K + S+ ++R+ +A+K L E ++ + + A Q A R+ AE
Sbjct: 472 QKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESL 531
Query: 228 RQSIPGRDINLESAQ-DVEINKKRPSLIKSKERVSHIQKKLASAKKS------LVEVRQA 280
+Q+ +D+ +E + + ++ K ++ ++K S K++ L ++
Sbjct: 532 KQAK--QDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDK 589
Query: 281 NEAHNKDIA-------DLETQLADVRKRKAEYERQSIPGRD 314
N ++ DL + L DV KR+ E +I R+
Sbjct: 590 NLMLENSLSAETRLKLDLFSALGDV-KRQLEIAHGAIYKRE 629
>gnl|CDD|213239 cd03272, ABC_SMC3_euk, ATP-binding cassette domain of eukaryotic SMC3
proteins. The structural maintenance of chromosomes
(SMC) proteins are large (approximately 110 to 170 kDa),
and each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code), which
by mutational studies has been shown to be essential in
several proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B' motif,
and a motif with homology to the signature sequence of
the ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these sequences.
In eukaryotic cells, the proteins are found as
heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
and SMC5 with SMC6 (formerly known as Rad18).
Length = 243
Score = 39.6 bits (93), Expect = 0.005
Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 995 HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
+ PAPF++ DEIDAALD VA+ I + D Q I + + E AD G+
Sbjct: 176 QKCDPAPFYLFDEIDAALDAQYRTAVANMI-KELSDGAQFITTTFRPELLEVADKFYGV 233
Score = 33.4 bits (77), Expect = 0.47
Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 11/53 (20%)
Query: 13 FKSYKGKFSIGPL-KKFTAVIGPNGSG----------ALKDDYERLKAEMIQA 54
FKSYK + I P K V+G NGSG L D+Y L+ E QA
Sbjct: 9 FKSYKDQTVIEPFSPKHNVVVGRNGSGKSNFFAAIRFVLSDEYTHLREEQRQA 61
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 38.7 bits (90), Expect = 0.008
Identities = 43/208 (20%), Positives = 85/208 (40%), Gaps = 26/208 (12%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG----------ALKDDYERLKAEMIQAEEE 57
+ + NF S+K I K T + GPNGSG AL RLK + +
Sbjct: 2 LTIKNFGSFKD-LEIDFSKGLTLIYGPNGSGKTTILDAIRWALYGKTSRLKKSKGRGIVK 60
Query: 58 TNMSYLKKKGVVAERKEAKIE--KDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELE 115
++ K +G E E K + + + K+ ++ K +I EL+
Sbjct: 61 GDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQ 120
Query: 116 DELDKKKGEVEKIER--------RKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
+ +D+ ++I R E+ + +EKK+ L +EL +++ E ++ +
Sbjct: 121 EFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLL 180
Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSL 195
+K + K+ + ++++L + L
Sbjct: 181 EEK-----EKKKELEELKEELEELLEEL 203
Score = 32.1 bits (73), Expect = 0.95
Identities = 30/157 (19%), Positives = 65/157 (41%), Gaps = 8/157 (5%)
Query: 278 RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAG 337
+ + +I E + + E + + R I + ++
Sbjct: 55 GRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIV 114
Query: 338 KILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
+I + + I+ K D++ L L + E+K KR E +E + ++K + + +
Sbjct: 115 EIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKD 174
Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
L+ + KE+ K ++EL++ELE+++EEL
Sbjct: 175 LLEKLLEEKEK--------KKELEELKEELEELLEEL 203
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 38.5 bits (90), Expect = 0.008
Identities = 20/117 (17%), Positives = 49/117 (41%), Gaps = 15/117 (12%)
Query: 341 QQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK 400
++L + +E Q++L + Q ++ + + E +E ++ + LE + + + L
Sbjct: 66 ERLPELQQELAELQEELA-------ELQEQLAELQQENQELKQELSTLEAELERLQKELA 118
Query: 401 DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENF 457
K+L +EL++EL ++ +E + + R ++ E F
Sbjct: 119 RIKQLSANAIE----LDEENRELREELAELKQENEALEAEN----ERLQENEQRRWF 167
Score = 32.3 bits (74), Expect = 0.95
Identities = 14/82 (17%), Positives = 43/82 (52%)
Query: 375 RHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
R + E Q+ + +L++ + + + L + ++ +EL ++ + + ++ LQKEL ++ +
Sbjct: 65 RERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124
Query: 435 GDAKTDKHEDTRRKKKQELVEN 456
+A E+ +++ ++
Sbjct: 125 ANAIELDEENRELREELAELKQ 146
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 39.7 bits (93), Expect = 0.009
Identities = 34/239 (14%), Positives = 83/239 (34%), Gaps = 45/239 (18%)
Query: 258 ERVSHIQKKLASAKKSLVEVRQAN-EAHNKD-IADLETQLADVRKRKAEYERQSIPGRDI 315
RV L S L ++ E +++ + + + ++ AEY RQ+
Sbjct: 152 CRVVVAPSLLESNSSRLSFAQEWRLEWNSQGLEDYRKRKRQRLDQKAAEYLRQAAQAGTT 211
Query: 316 NLESAQ----------------------MTEYTNLKAEATKRAGKILQQLDTINREQKGD 353
SA + A + + + +E++ +
Sbjct: 212 GSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTEEDRTKEREAE 271
Query: 354 QDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKE------ 407
+ L ++ + + E+ + + K++++ +RQ L++ K+K+
Sbjct: 272 LEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELL 331
Query: 408 --------ELNSDVGSSKNRVQELQKELEQV-------IEELGDAKTDKHEDTRRKKKQ 451
+L + V +S R+ EL ++ E L + +K ++T+R+ +
Sbjct: 332 PDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDE 390
Score = 32.4 bits (74), Expect = 1.6
Identities = 27/168 (16%), Positives = 67/168 (39%), Gaps = 6/168 (3%)
Query: 69 VAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFK--LYHNETDIKELEDELDKKKGEVE 126
E K+ + + ++E+I E + + + IK++ +EL + E E
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315
Query: 127 KIE---RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH 183
++E + K++ +L + + L + + + E+ + R LI +
Sbjct: 316 ELEEEYKIKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375
Query: 184 IQK-KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQS 230
+ K ++ L E+++ + ++L+T+ ++ EYE
Sbjct: 376 KNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLDEYENAP 423
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 39.7 bits (93), Expect = 0.009
Identities = 32/220 (14%), Positives = 75/220 (34%), Gaps = 23/220 (10%)
Query: 52 IQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDI 111
++ E + ++ + + +A++EKD KY + + + + DI
Sbjct: 742 LKKLYEFAYTLEEEIQYLEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDI 801
Query: 112 KELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE----- 166
+ + K++ ++ I +L+ + GA +++ I+E +
Sbjct: 802 DFELELIGKQEINIDYI-------LELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKK 854
Query: 167 ------INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR 220
I K + +++ AKK E A + + + E D
Sbjct: 855 KLIERLIEAINQYRAKKLDTAEKLEELYILAKK---EEEFKQFAEEEGLNEEELAFYDDL 911
Query: 221 KRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERV 260
G ++ + A++ + +K K+KE V
Sbjct: 912 ALNGGKL--PENGTELVEKLAKEKSLREKNKDDWKAKEEV 949
Score = 35.5 bits (82), Expect = 0.21
Identities = 50/310 (16%), Positives = 99/310 (31%), Gaps = 19/310 (6%)
Query: 102 FKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIR 161
KLY NE + E E IE EK E++L E KEQ L + +R
Sbjct: 660 LKLYSNEGEEDLKEALE-------EAIEELVEKLEDVLNE-FKEQSEDIGYLEDAGELLR 711
Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
+ E+ K + + ++ + + A +L ++ + ++I LE L +
Sbjct: 712 DFAPELEIKVE---RGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEIQYLEDLLKARLE 768
Query: 222 RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQAN 281
+ + + + + +D E K L + I K+ + L ++ N
Sbjct: 769 KDYQSKYLDLIDNLNDAREIKDKES--KAIDLDDIDFELELIGKQEINIDYILELLQTFN 826
Query: 282 EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQ 341
+ + + E + + E+ + + + K T + L
Sbjct: 827 DKNGAYESLKE----LIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELY 882
Query: 342 QL--DTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASL 399
L +Q +++ L+ E K E E +R+
Sbjct: 883 ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDD 942
Query: 400 KDNKKLKEEL 409
K+ E
Sbjct: 943 WKAKEEVEAK 952
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 39.6 bits (92), Expect = 0.010
Identities = 71/432 (16%), Positives = 169/432 (39%), Gaps = 44/432 (10%)
Query: 43 DYERLKAEMIQAEEETN--MSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQ 100
+ E LK E+ + + + M+ +K ++E + KDEA + +E++ + E+Q
Sbjct: 11 ENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLRVTQEENQ 70
Query: 101 LFKL--------YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRE 152
+L ++ ++ L+ +L+K E ++ ++ EK+E + E+ E
Sbjct: 71 HAQLTVQALQEELKSQRELNRLQQKLEKSGNETGEL-QQLEKSEGQFSRIQAERERQAME 129
Query: 153 LAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADL 212
L + + ++ + + + + +L E + + + L S S + +E I +
Sbjct: 130 LFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSER-LGRIREA 188
Query: 213 ETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKK 272
E Q + + + E++ + R+ + Q + K + K + K+ +K
Sbjct: 189 EDQQSHLEVLLEQKEKEHMMLREEIHQKLQMERDDAKTEASQKLIDE---KDTKIKEFEK 245
Query: 273 SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEA 332
L + A+ E R +E +R N ++ + A
Sbjct: 246 MLEK------------AENEIYRLQSRCDTSEADR--------NRLDKEVEAERSALAAM 285
Query: 333 TKRAGKILQQLDTINREQKGDQDKL------DNELRQQVQ-TQNEIKKKRHEMEEAQKRI 385
+ + Q+L E G Q +L D+++RQ + + ++ + E Q +
Sbjct: 286 KAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEV 345
Query: 386 DKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELG--DAKTDKHE 443
D L + + +L + + + +++++ E+ +L K + +
Sbjct: 346 DALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQ 405
Query: 444 DTRRKKKQELVE 455
+T R+K++ L E
Sbjct: 406 ETFRRKERRLKE 417
Score = 35.8 bits (82), Expect = 0.16
Identities = 74/376 (19%), Positives = 141/376 (37%), Gaps = 33/376 (8%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
+ D+ E E + EV+ + E+ N L +K A E A EI +M
Sbjct: 328 KEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRY 387
Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE 227
K L ++++ ++Q+ ++ L E ++ + D +T L + K AE E
Sbjct: 388 EKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNT-DTALEKLEKALAEKE 446
Query: 228 R--QSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ------ 279
R + + + E + E + KE V ++Q KL+ + L +++
Sbjct: 447 RIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLA 506
Query: 280 ---------------ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTE 324
E + LE +L +R +R S + A
Sbjct: 507 SNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAG 566
Query: 325 YTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR 384
N R K Q+ D E +L+ R + Q + + R E E +K
Sbjct: 567 ARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFE--RKS 624
Query: 385 IDKLEDHIRQN-----EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGD--A 437
+ LE+ R E +L+ + ++ + ++ VQ+L+ +L+Q+ E A
Sbjct: 625 AELLEEAERLEKSEAEEETLRQSTQIGHAQAAAHNHIEHHVQKLESDLKQLRAEREQLVA 684
Query: 438 KTDKHEDTRRKKKQEL 453
+ +K + + + +QEL
Sbjct: 685 QLEKSQQSLMEFQQEL 700
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 39.1 bits (92), Expect = 0.013
Identities = 34/123 (27%), Positives = 62/123 (50%), Gaps = 21/123 (17%)
Query: 66 KGVVAE-RKEAKIEKDEA-----EKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELD 119
K ++ E +KEA+ K EA E+ ++R E+ +E++ + +L E + + E+ LD
Sbjct: 35 KRIIEEAKKEAETLKKEALLEAKEEVHKLRAEL-ERELKERRNELQRLERRLLQREETLD 93
Query: 120 KKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKE 179
+K ++K E EK E L K+K +D++ E++ I ++R L E
Sbjct: 94 RKMESLDKKEENLEKKEKELSNKEKN----------LDEKEEELEELIAEQREEL----E 139
Query: 180 RVS 182
R+S
Sbjct: 140 RIS 142
Score = 34.5 bits (80), Expect = 0.30
Identities = 29/136 (21%), Positives = 59/136 (43%), Gaps = 14/136 (10%)
Query: 322 MTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME-- 379
+ Y K A K+ G + I E K + + L E ++ + E+ K R E+E
Sbjct: 13 IIGYLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEA--LLEAKEEVHKLRAELERE 70
Query: 380 --EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDA 437
E + + +LE + Q E +L + ++ ++ + + +K L++ EEL
Sbjct: 71 LKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL--- 127
Query: 438 KTDKHEDTRRKKKQEL 453
E+ ++++EL
Sbjct: 128 -----EELIAEQREEL 138
Score = 32.2 bits (74), Expect = 1.9
Identities = 26/123 (21%), Positives = 60/123 (48%), Gaps = 13/123 (10%)
Query: 121 KKGEVEKIERRKEKAENILREKKKEQGALNREL------------AKVDQEIREMDVEIN 168
K+ +K+ +E A+ I+ E KKE L +E A++++E++E E+
Sbjct: 20 KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79
Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD-VRKRKAEYE 227
+ L++ +E + + L +++L + + K++ + E +L + + +++ E E
Sbjct: 80 RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELE 139
Query: 228 RQS 230
R S
Sbjct: 140 RIS 142
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 39.3 bits (92), Expect = 0.014
Identities = 21/146 (14%), Positives = 51/146 (34%), Gaps = 6/146 (4%)
Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQ----KGDQDKLDNELRQQVQT 367
I + + +++ N R +++ Q+ E+ + ++++ +
Sbjct: 781 AEKILIITEILSDGINNSD-INDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELI 839
Query: 368 QNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKEL 427
E K + E++ I + E+ I+ + + + K + + L K +
Sbjct: 840 NEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKND-PKKLNKLIIAKDVLIKLV 898
Query: 428 EQVIEELGDAKTDKHEDTRRKKKQEL 453
E D KT K + +K E
Sbjct: 899 ISSDEIKQDEKTTKKKKKDLEKTDEE 924
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been
characterized: one with RAB1A that has been activated by
GTP-binding and another with isoform CASP of CUTL1.
Length = 509
Score = 38.3 bits (89), Expect = 0.021
Identities = 57/280 (20%), Positives = 103/280 (36%), Gaps = 35/280 (12%)
Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
+ S K+ S L+S L + + +DLE +LA ++ R E
Sbjct: 70 SRVSSPSSKKDGTSR---SLSSQVDDLASAVSS-----QSSSDLEDELAALKIRLQEA-A 120
Query: 229 QSIPGRDINLESAQ-----DVEINKKRPSLIKSKERVSHIQKKLASAKKSLV----EVRQ 279
Q + LE + D+ + L + R +Q KL ++ V
Sbjct: 121 QELRELKSELEDLRSERSRDLSDEESIKRLQRGAVRS--LQDKLLLLDAAVKRHDGNVIT 178
Query: 280 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
A K LE ++ +R+ R E Q+ + + E+ K+
Sbjct: 179 AVLIFLKRTLKLEILFQELAERQTA-LRHLSKHLQEEGELQQLLKLLRAEGESEKQE--- 234
Query: 340 LQQLDTINREQKGDQDKLDNELRQQV--------QTQNEIKKKRHEMEEAQKRIDKLEDH 391
LQQ ++K N L++ Q E+++ +HE E Q+ I KLE
Sbjct: 235 LQQYRQKAHRILQSKEKRINFLKEGCLFEGLDSSTAQIELEELKHESEHVQEEITKLEGQ 294
Query: 392 IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVI 431
I Q + +D + E + + S + + +EL +++ I
Sbjct: 295 IIQLRSEAQD---REAEASGEAESFRKQPRELSQQIAPQI 331
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 38.5 bits (90), Expect = 0.022
Identities = 42/205 (20%), Positives = 73/205 (35%), Gaps = 38/205 (18%)
Query: 76 KIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKA 135
K D E+ + EI KEVE L L +++E + LD++K +++ +
Sbjct: 67 KEFLDLEEEILDLEAEI--KEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSEL 124
Query: 136 ENILREKKKEQGA-------------------LNRELAKV-DQEIREMDVEI----NKKR 171
N+ + K +GA REL + IR+ ++E KK
Sbjct: 125 SNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKT 184
Query: 172 PSLI--KSKERVSHIQKKLASAKKSLVEV-------RQANEAHNKDIADLETQLADVRKR 222
+I KE + ++K L S L +V + NK I +L+ L
Sbjct: 185 VFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESH 244
Query: 223 KAEY---ERQSIPGRDINLESAQDV 244
+ + D + + V
Sbjct: 245 LEKVLVKIADELLAWDEQVSKEKAV 269
Score = 34.7 bits (80), Expect = 0.32
Identities = 48/265 (18%), Positives = 99/265 (37%), Gaps = 43/265 (16%)
Query: 192 KKSLVEVRQANEAHNKDIADLE-TQLADVRKRKAEYERQ----SIPGRDINLESAQDVEI 246
+ LV++R NE D++ + + +VR R E ER+ + + +
Sbjct: 6 ELGLVQIRDLNE----DVSAFQRKFVNEVR-RCDEVERKLRKLESKIKKLGIPLKDTGGK 60
Query: 247 NKKRPSLI--KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
PS +E + ++ ++ +++L + K+I +LE L + + K+
Sbjct: 61 PDVPPSKEFLDLEEEILDLEAEIKEVEENLESLE-------KEINELEEWLNVLDEEKSF 113
Query: 305 YER-----QSIPGRDINLESAQMTEYTNLKAEA----TKRAGKILQQLDTINREQKGDQD 355
+ + DI+ + + E L A ++ ++L R Q
Sbjct: 114 LDENLEELSELSNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIR-QA 172
Query: 356 KLDNELRQQVQTQNEIKKKRH----EMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
+++ L + KK +E ++ K+ D + + + + EL S
Sbjct: 173 EIEEPL-------EDPKKTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELIS 225
Query: 412 DVGSSKNRVQELQKELEQVIEELGD 436
V R++ELQ+ LEQ L
Sbjct: 226 KV---NKRIEELQRVLEQTESHLEK 247
Score = 29.7 bits (67), Expect = 9.7
Identities = 23/138 (16%), Positives = 53/138 (38%), Gaps = 18/138 (13%)
Query: 50 EMIQAEEETNMSYLKKKGVVAERKEAKIEK------DEAEKYQRIREEI--VAKEVEHQL 101
+ ++ E + + GV+ K E+ + I E + K V
Sbjct: 132 KYLRGAEGLKLGF--VAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKTVFIIF 189
Query: 102 FKLYHNETDIKELEDELDKKKGEV--------EKIERRKEKAENILREKKKEQGALNREL 153
F + +K++ D + +V E I + ++ E + R ++ + L + L
Sbjct: 190 FVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEKVL 249
Query: 154 AKVDQEIREMDVEINKKR 171
K+ E+ D +++K++
Sbjct: 250 VKIADELLAWDEQVSKEK 267
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 37.3 bits (87), Expect = 0.022
Identities = 26/130 (20%), Positives = 57/130 (43%), Gaps = 10/130 (7%)
Query: 341 QQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK 400
++ I K ++KL V+ + E++ E++ A++RI+ LED + + ++
Sbjct: 66 SEIQEIRERIKRAEEKLSA-----VKDERELRALNIEIQIAKERINSLEDELAELMEEIE 120
Query: 401 DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKA 460
+K E+L + + + E + LE+ + E+ + K++EL E
Sbjct: 121 KLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI-----REEGQELSSKREELKEKLDPE 175
Query: 461 YSGVYDRMIN 470
Y+R+
Sbjct: 176 LLSEYERIRK 185
Score = 35.8 bits (83), Expect = 0.070
Identities = 23/108 (21%), Positives = 59/108 (54%), Gaps = 2/108 (1%)
Query: 124 EVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH 183
E +++E R ++ L++ K E ALN+ L ++ E+ +++ ++++ + + +ER+
Sbjct: 18 EKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKR 77
Query: 184 IQKKLASAKKS--LVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
++KL++ K L + + + I LE +LA++ + + E++
Sbjct: 78 AEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKE 125
Score = 34.6 bits (80), Expect = 0.18
Identities = 40/208 (19%), Positives = 94/208 (45%), Gaps = 35/208 (16%)
Query: 243 DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRK 302
D+E ++ P + + ++ + + +L + K+L + E ++ LE+++ ++R+R
Sbjct: 16 DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERI 75
Query: 303 AEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELR 362
+AE A K ++L +N E + ++++++
Sbjct: 76 K-------------------------RAEEKLSAVKDERELRALNIEIQIAKERINSLED 110
Query: 363 QQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQE 422
+ + EI+K E+E+ ++R+++LE ++ + EA L++ E QE
Sbjct: 111 ELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIRE----------EGQE 160
Query: 423 LQKELEQVIEELGDAKTDKHEDTRRKKK 450
L + E++ E+L ++E R+ KK
Sbjct: 161 LSSKREELKEKLDPELLSEYERIRKNKK 188
Score = 32.7 bits (75), Expect = 0.89
Identities = 38/205 (18%), Positives = 74/205 (36%), Gaps = 29/205 (14%)
Query: 679 QIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIG 738
I KL+ E D L R +K +A + A + + E+ +E+ V +
Sbjct: 11 AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQL------ 64
Query: 739 VSTIRQYEEAELRSQQERQKICQDKDTKKNVAR--------WERAVSDDEEELARAQGAE 790
+ E E+R + +R + + R + ++ E+ELA
Sbjct: 65 -----ESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEI 119
Query: 791 EKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLE 850
EKL E+ ++LE + + ++ EV I ++ Q L+
Sbjct: 120 EKLEKEIEDLKERLERLEKNLAEAEARLEE----------EVAEIREEGQELSSKREELK 169
Query: 851 SKLEMKKSERHDILMNCKMNDIVLP 875
KL+ + ++ + K V+P
Sbjct: 170 EKLDPELLSEYERIRKNKKGVGVVP 194
Score = 30.8 bits (70), Expect = 3.2
Identities = 30/127 (23%), Positives = 54/127 (42%), Gaps = 10/127 (7%)
Query: 611 LDLARKAKRWDDKEMGNLKAQKEKLSEEL----------REAMKKSRKESELNTVQSTIK 660
L+ KA + E+ +L+ Q +L E+ E + + E EL + I+
Sbjct: 40 LEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQ 99
Query: 661 GLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKE 720
+ R+N +L +I KLE EI+ L R + E + EA + I + +
Sbjct: 100 IAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQ 159
Query: 721 EMNSVED 727
E++S +
Sbjct: 160 ELSSKRE 166
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 36.8 bits (85), Expect = 0.039
Identities = 27/104 (25%), Positives = 47/104 (45%)
Query: 110 DIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINK 169
+I+ DEL+K K E++ E E E L+E KE L E + +E+R++ +IN
Sbjct: 5 EIRNKTDELEKLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADIND 64
Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLE 213
+ +SK + K+A ++++ K LE
Sbjct: 65 METEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLE 108
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 37.6 bits (87), Expect = 0.040
Identities = 64/453 (14%), Positives = 155/453 (34%), Gaps = 54/453 (11%)
Query: 48 KAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRI---REEIVAKEVEH----- 99
+ + E L K +++ R++++ KE++H
Sbjct: 177 QYTQLALMEFAKKKSLHGK-AELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREAL 235
Query: 100 -QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRE------ 152
Q + + T +E ++E KK+ ++++ R E+ ++ Q +NR
Sbjct: 236 QQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAPL 295
Query: 153 ------LAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLAS--AKKSLVEVRQANEA 204
+ +++Q+ + + E+ K S K + + K+ +S ++ L++ + E
Sbjct: 296 AAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEI 355
Query: 205 HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKE------ 258
H +D ++ T + ++ ++ Q I + + + I +E
Sbjct: 356 HIRDAHEVATSIREISCQQHT-LTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDT 414
Query: 259 ---RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDI 315
+Q +LA AKK E++Q Q + K + QS+ R+
Sbjct: 415 RTSAFRDLQGQLAHAKKQ-QELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQ 473
Query: 316 NLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKG-----------------DQDKLD 358
L + + L +L G ++
Sbjct: 474 QL--QTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQ 531
Query: 359 NELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKN 418
+ Q + + H++ +K+ L++ +++ + S + D+ + +N
Sbjct: 532 RGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQN 591
Query: 419 RVQELQKELEQVIEELGDAKTDKHEDTRRKKKQ 451
LQ E++ E ++H R+ + +
Sbjct: 592 ITVRLQDLTEKLSEAEDMLACEQHALLRKLQPE 624
Score = 31.5 bits (71), Expect = 3.5
Identities = 53/402 (13%), Positives = 126/402 (31%), Gaps = 28/402 (6%)
Query: 46 RLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLY 105
+A+ I E ++ M K K A K ++ ++ R E + +
Sbjct: 306 EQQAQRIHTELQSKMRSRAK----LLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAH 361
Query: 106 HNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDV 165
T I+E+ + + ++++K L+ KE L RE A +D
Sbjct: 362 EVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRD 421
Query: 166 EINKK-------RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 218
+ ++ + I K + ++++ ++ + E QL
Sbjct: 422 LQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK----EREQQLQT 477
Query: 219 VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
+ + R + ++ P + + ++ L
Sbjct: 478 KEQIHLQETR-------KKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRM 530
Query: 279 QANE----AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATK 334
Q E D+ QL RK++A + Q + Q + +
Sbjct: 531 QRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ 590
Query: 335 RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
LQ L E + + L +++Q + +++ R +++ + + +
Sbjct: 591 NITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHA 650
Query: 395 NEASLKDNKKLKEELNSDV--GSSKNRVQELQKELEQVIEEL 434
+ +L + + L+ V Q ++++ E+L
Sbjct: 651 LQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQL 692
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 37.7 bits (87), Expect = 0.040
Identities = 66/357 (18%), Positives = 148/357 (41%), Gaps = 50/357 (14%)
Query: 136 ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKK---LASAK 192
+N L ++E+ L ++ ++ ++ +M I + +L S E+ ++++ S+
Sbjct: 142 QNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSS 201
Query: 193 KSLVEVRQANEA----HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQD----- 243
+S+ R A A + + I +LE + + ++K E +R +DI E +
Sbjct: 202 ESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELK 261
Query: 244 -------VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 296
++ + I + + + L + ++ L +Q +++A L
Sbjct: 262 QRLKKMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELA----SLG 317
Query: 297 DVRKRKAEYERQSIPGRDINLESAQMT-EYTNLKAEATKRAGKILQQLDTINREQKGDQD 355
+R QS LESAQM+ + + L + + Q+ +T+ + + D+D
Sbjct: 318 SLRDHTIAELHQS------RLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKD 371
Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGS 415
++ + ++ + ++++R + E+ + + +D N L +N++ EL S +
Sbjct: 372 RIQKLSAELLKLEEWLQEERSQREKLEVELGIEKD---CNRVQLSENRRELSELRSAL-- 426
Query: 416 SKNRVQELQKELEQVIEELG---------DAKTDKHEDTRRKKKQELVENFKKAYSG 463
+ LQKE EQ+ EE + + DK D + + E K+Y
Sbjct: 427 -----RVLQKEKEQLQEEKQELLDYIRVLELRLDKEADEKWAEDAATCE-DAKSYLH 477
Score = 35.0 bits (80), Expect = 0.22
Identities = 41/249 (16%), Positives = 107/249 (42%), Gaps = 22/249 (8%)
Query: 46 RLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRI--REEIVAKEVEHQLFK 103
L E ++ + + L+ + K ++E E+++ E ++ E L +
Sbjct: 159 FLALEGDVSDMRSRIEQLETALRHSTEKTEELE----EQHKEAQSSSESMSAERNALLAQ 214
Query: 104 LYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREM 163
N+ I ELE +++ + ++K++ + +L + + L R ++ Q +++M
Sbjct: 215 RAENQQRILELEQ-------DIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKM 267
Query: 164 DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET----QLADV 219
++ + I + + + L + ++ L +Q +++A L + +A++
Sbjct: 268 TIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAEL 327
Query: 220 RKRKAEYERQSIPGRDINLESAQDV-EINKKRPSLIKS----KERVSHIQKKLASAKKSL 274
+ + E + S+ +NL + + ++R +L +S K+R+ + +L ++ L
Sbjct: 328 HQSRLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWL 387
Query: 275 VEVRQANEA 283
E R E
Sbjct: 388 QEERSQREK 396
Score = 33.8 bits (77), Expect = 0.54
Identities = 39/186 (20%), Positives = 82/186 (44%), Gaps = 21/186 (11%)
Query: 263 IQKKLASAKKSLVEVRQANEAHNKDIAD-------LETQLADVRKRKAEYERQSIPGRDI 315
+Q +L ++ ++ + A D++D LET L ++ E E Q ++
Sbjct: 141 LQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQH---KEA 197
Query: 316 NLESAQMTEYTNL----KAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEI 371
S M+ N +AE +R ++ Q + T+ ++K + D++ Q ++ + E
Sbjct: 198 QSSSESMSAERNALLAQRAENQQRILELEQDIQTLT-QKKQENDRVLEGT-QDIEAELER 255
Query: 372 KKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVI 431
K E ++R+ K+ R E D + E+L+ D+ + + R++ Q++ +
Sbjct: 256 MKG-----ELKQRLKKMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLG 310
Query: 432 EELGDA 437
EEL
Sbjct: 311 EELASL 316
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 37.4 bits (87), Expect = 0.044
Identities = 33/174 (18%), Positives = 72/174 (41%), Gaps = 24/174 (13%)
Query: 74 EAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELE--DELDKKKGE------- 124
+ K E+D+ ++ E+ + KE++ KL E +++ELE E ++KGE
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQ 331
Query: 125 -VEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH 183
+E+ + A+ E+ K + ++ ++ Q + ++ + +L + +
Sbjct: 332 LIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKE 391
Query: 184 IQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD---------VRKRKAEYER 228
SAK +L K I ++ +L + RK+K +E+
Sbjct: 392 AIAYYESAKTAL-----EKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 37.7 bits (87), Expect = 0.045
Identities = 78/420 (18%), Positives = 192/420 (45%), Gaps = 43/420 (10%)
Query: 110 DIKELEDELDKKKGEVEKIERRKEKAENILR------EKKKEQGALNRELAKVDQEIREM 163
DIK L+ ++D +E+I+++ E + ++ E ++ N + +++++I +
Sbjct: 1119 DIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEKKIENI 1178
Query: 164 DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRK 223
+I+KK+ + K+ ++ I ++ K SL EV+ N ++ K++ L + D K+K
Sbjct: 1179 VTKIDKKKNIYDEIKKLLNEIA-EIEKDKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKK 1237
Query: 224 AEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ---A 280
+E+ +++ +L+ EI +K P + I+ ++ + S + +
Sbjct: 1238 SEHMIKAMEAYIEDLD-----EIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHII 1292
Query: 281 NEAHNKDIADL-------------ETQLADVRKRKAEYERQSIP-GRDINLESAQMTE-Y 325
++ H+++I+D+ E+ + D++K + + DINL ++ Y
Sbjct: 1293 SKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNEIANIY 1352
Query: 326 TNLKAEATKR-AGKILQQLDTINREQKGDQDKLDN--ELRQQVQTQNEIKKKRHEME--- 379
LK K+ ++ + I K +D+LD +L ++++ +++ + ++E
Sbjct: 1353 NILKLNKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLEECKSKIESTL 1412
Query: 380 ------EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
E K+I +L++HI E+++ K +E N +V ++ + + +++
Sbjct: 1413 DDKDIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLFKNIEMADNKSQHILKI 1472
Query: 434 LGDAKTDKHEDTRRKKKQELVENFK-KAYSGVYDRMINMCHPVHKRYNVAITKVLGKYME 492
D T+ H+ + K+ + ++ K + + I + ++Y +T++L KY
Sbjct: 1473 KKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSA 1532
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 36.6 bits (85), Expect = 0.063
Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 15/113 (13%)
Query: 184 IQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQD 243
+Q LA+A+ L + A + +T A + A +++ L +AQ
Sbjct: 259 LQAALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKE--------LANAQA 310
Query: 244 VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 296
+ + +L ++ +++ + +LA AK++L N D+A + L
Sbjct: 311 QALQTAQNNLATAQAALANAEARLAKAKEAL-------ANLNADLAKKQAALD 356
Score = 30.8 bits (70), Expect = 3.4
Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 12/107 (11%)
Query: 252 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP 311
+L ++ ++ Q L +A+ +L + A A +A + +LA+ + + + +
Sbjct: 262 ALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQTAQN--- 318
Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLD 358
NL +AQ A A R K + L +N + Q LD
Sbjct: 319 ----NLATAQ-----AALANAEARLAKAKEALANLNADLAKKQAALD 356
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 36.5 bits (84), Expect = 0.067
Identities = 39/254 (15%), Positives = 87/254 (34%), Gaps = 34/254 (13%)
Query: 157 DQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQL 216
++ +R +++ RP L + + Q + +A+ R + ++ + +L
Sbjct: 66 NRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQEL 125
Query: 217 ADVRKRKA----EYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKK 272
A R+ A E R + +D+ E +R L + + QK+L ++
Sbjct: 126 AAARQNLAKAQQELARLTKQAQDLQTRLKTLAE---QRRQLEAQAQSLQASQKQLQASAT 182
Query: 273 SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEA 332
L + + I + + R E A A
Sbjct: 183 QLKSQVLDLKLRSAQIEQ--------EAQNLATRANAAQART--------EELARRAAAA 226
Query: 333 TKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHI 392
+ A I QQ D ++ + +Q +I+++ +++ + +LE +
Sbjct: 227 QQTAQAI-QQRDA----------QISQKAQQIAARAEQIRERERQLQRLETAQARLEQEV 275
Query: 393 RQNEASLKDNKKLK 406
Q EA + +L+
Sbjct: 276 AQLEAYYQAYVRLR 289
Score = 35.8 bits (82), Expect = 0.13
Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 23/178 (12%)
Query: 280 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
N + L+ +R + E L +AQ K A
Sbjct: 65 LNRNLRSGVFQLDDIRPQLRALRTE------------LGTAQGE-----KRAAETEREAA 107
Query: 340 LQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASL 399
+L +E++ + +L + + Q E+ + + ++ Q R+ L + RQ EA
Sbjct: 108 RSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQA 167
Query: 400 KDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENF 457
+ + +++L + K++V +L+ Q+ +E ++ TR Q E
Sbjct: 168 QSLQASQKQLQASATQLKSQVLDLKLRSAQIEQE------AQNLATRANAAQARTEEL 219
Score = 35.4 bits (81), Expect = 0.16
Identities = 39/224 (17%), Positives = 87/224 (38%), Gaps = 21/224 (9%)
Query: 630 AQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDA 689
Q + + +LR +EL T Q + E +R +LQ + + + E+ A
Sbjct: 74 FQLDDIRPQLRA------LRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAA 127
Query: 690 LN---ARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYE 746
A+A ++ + R T++ ++ ++ + + + +T + +
Sbjct: 128 ARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQ 187
Query: 747 EAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLEN 806
+L+ + + I Q+ +N+A A EELAR A ++ A ++ ++
Sbjct: 188 VLDLKLRSAQ--IEQEA---QNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQ 242
Query: 807 MRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLE 850
K Q + A E+I + R++ + ++ LE
Sbjct: 243 -------KAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLE 279
Score = 33.1 bits (75), Expect = 0.90
Identities = 34/224 (15%), Positives = 77/224 (34%), Gaps = 34/224 (15%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEA---------------E 83
AL+ + + E AE E + + + ER+ + E A +
Sbjct: 85 ALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTK 144
Query: 84 KYQRIREEIVA-----KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENI 138
+ Q ++ + +++E Q L ++ ++ +L + +++ + E+
Sbjct: 145 QAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQN 204
Query: 139 LREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEV 198
L + A ELA+ ++ I +R + I QK A ++
Sbjct: 205 LATRANAAQARTEELARRAAAAQQTAQAIQ-QRDAQIS--------QKAQQIAARAEQIR 255
Query: 199 RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ 242
+ + LET A + + A+ E + ++A
Sbjct: 256 ERERQLQ-----RLETAQARLEQEVAQLEAYYQAYVRLRQQAAA 294
Score = 33.1 bits (75), Expect = 1.0
Identities = 32/225 (14%), Positives = 86/225 (38%), Gaps = 22/225 (9%)
Query: 245 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
+++ RP L + + Q + +A+ R + ++ + +LA R+ A+
Sbjct: 75 QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134
Query: 305 YERQSIPGRDINLESAQMTEY-----TNLKAEATKRAGKILQQLDTINREQK---GDQDK 356
++ E A++T+ T LK A +R ++ Q ++ QK +
Sbjct: 135 AQQ----------ELARLTKQAQDLQTRLKTLAEQRR-QLEAQAQSLQASQKQLQASATQ 183
Query: 357 LDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSS 416
L +++ +I+++ + + + + A+ ++ + +
Sbjct: 184 LKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAA---QQTAQAIQQRDAQI 240
Query: 417 KNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAY 461
+ Q++ EQ+ E + + R +++ +E + +AY
Sbjct: 241 SQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYYQAY 285
Score = 30.8 bits (69), Expect = 4.9
Identities = 14/157 (8%), Positives = 64/157 (40%), Gaps = 1/157 (0%)
Query: 37 SGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKE 96
A + + + + E+ + ++ + K + +R++ + + + + + + + A +
Sbjct: 125 LAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQ-SLQASQKQLQASATQ 183
Query: 97 VEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKV 156
++ Q+ L I++ L + + + ++ + + ++++
Sbjct: 184 LKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQK 243
Query: 157 DQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKK 193
Q+I +I ++ L + + + +++++A +
Sbjct: 244 AQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEA 280
Score = 30.0 bits (67), Expect = 7.9
Identities = 24/200 (12%), Positives = 71/200 (35%), Gaps = 31/200 (15%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
+ + + E E + + E++K + +E L ++ +ELA++ ++ +++ +
Sbjct: 94 QGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRL 153
Query: 168 NK---KRPSLIKSKE---------------------RVSHIQKKLASAKKSLVEVRQANE 203
+R L + + ++ ++L A +
Sbjct: 154 KTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQ 213
Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKE----R 259
A +++A ++ +++ + A E ++R ++ E R
Sbjct: 214 ARTEELARRAAAA---QQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQAR 270
Query: 260 VSHIQKKLASAKKSLVEVRQ 279
+ +L + ++ V +RQ
Sbjct: 271 LEQEVAQLEAYYQAYVRLRQ 290
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 35.5 bits (82), Expect = 0.072
Identities = 19/103 (18%), Positives = 55/103 (53%), Gaps = 2/103 (1%)
Query: 360 ELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNR 419
++ + Q E+ + + ++ E Q+ ++ LE+ I + E+ L+D K+ + L + S + R
Sbjct: 70 RPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEER 129
Query: 420 VQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYS 462
++ L++ ++++ +EL + + ++ +E +E ++
Sbjct: 130 LESLEESIKELAKELRE--LRQDLREEVEELREELERLQENLQ 170
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 36.5 bits (85), Expect = 0.074
Identities = 32/207 (15%), Positives = 80/207 (38%), Gaps = 25/207 (12%)
Query: 748 AELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENM 807
AEL SQ R + VAR RA +D + + + +++
Sbjct: 84 AELESQVLRLEA--------EVARL-RAEADSQAAIEFPDDLLSAEDPAVPELIKGQQSL 134
Query: 808 RATRLTKKQA-VDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMN 866
+R + +A ++ + +I + E+ + +QA ++ + +LE ++ + L+
Sbjct: 135 FESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLV- 193
Query: 867 CKMNDIVLPMLRVQKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEF- 925
L + + +R+ A++ E+ ++ +++ + + + F
Sbjct: 194 --------SRLELLELERERAEAQGELGRLEAELEV----LKRQIDELQLERQQIEQTFR 241
Query: 926 ENARKRAKKAKANFDRIKKERYDKFTR 952
E + +A+A + +ER +K
Sbjct: 242 EEVLEELTEAQARLAEL-RERLNKARD 267
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 36.8 bits (85), Expect = 0.074
Identities = 79/383 (20%), Positives = 153/383 (39%), Gaps = 58/383 (15%)
Query: 55 EEETNMSYLKKKGVVA-ERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKE 113
+E + + V+ +K + D + E I A + E Q T+ K+
Sbjct: 70 DENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQ--------TNSKD 121
Query: 114 LEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPS 173
E D + ++ + + EK NIL + AL L K+ E + +IN
Sbjct: 122 GEQLSDFQLEDLVGMIQNAEK--NILLLNQARLQALED-LEKILTEKEALQGKINILEMR 178
Query: 174 LIKSKERVSHIQKKLASAKKSLVEV--RQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
L ++ R+ KLA+ +K VE+ Q + N+ + T+ V E +
Sbjct: 179 LSETDARI-----KLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELDVL-- 231
Query: 232 PGRDINLESAQDVEINKKRPSLIK-SKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 290
++ N+ D++ K + ++ERV ++K+ + SL E+ +D++
Sbjct: 232 --KEENMLLKDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSK 289
Query: 291 LETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQ 350
L D K E NL+ +L AT + K LD N++
Sbjct: 290 LSPLQYDCWWEKVE-----------NLQ--------DLLDRATNQVEKAALVLDQ-NQDL 329
Query: 351 KGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELN 410
+ DKL+ L++ N K +++E Q+++ LE+ ++ ++ E++
Sbjct: 330 RDKVDKLEASLKEA----NVSKFSSYKVELLQQKLKLLEERLQASD----------HEIH 375
Query: 411 SDVGSSKNRVQELQKELEQVIEE 433
S + + ++E Q L ++ EE
Sbjct: 376 SYIQLYQESIKEFQDTLSKLKEE 398
>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
nucleotide-binding domain. ABC transporters are a large
family of proteins involved in the transport of a wide
variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are a
subset of nucleotide hydrolases that contain a signature
motif, Q-loop, and H-loop/switch region, in addition to,
the Walker A motif/P-loop and Walker B motif commonly
found in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 157
Score = 34.5 bits (80), Expect = 0.10
Identities = 10/60 (16%), Positives = 24/60 (40%), Gaps = 1/60 (1%)
Query: 998 HPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHA-DSLVGICPG 1056
+LDE + LD + ++ + ++ I+++ E A D ++ + G
Sbjct: 97 LNPDLLLLDEPTSGLDPASRERLLELLRELAEEGRTVIIVTHDPELAELAADRVIVLKDG 156
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 35.7 bits (83), Expect = 0.12
Identities = 23/123 (18%), Positives = 47/123 (38%), Gaps = 16/123 (13%)
Query: 339 ILQQLDTINREQKGD-QDKLDNELRQQVQT-QNEIKKKRHEMEEAQKRIDKLEDHIRQNE 396
I L I +E++ Q ++ R ++ EI + E++EAQ ++ Q E
Sbjct: 110 IDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQ-EAQCE 168
Query: 397 ASLKDN----------KKLKEELN---SDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
A K+ +E+L+ + + + K R+ +LE L +
Sbjct: 169 AEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNRQAAVA 228
Query: 444 DTR 446
+ +
Sbjct: 229 EKQ 231
Score = 34.5 bits (80), Expect = 0.25
Identities = 25/148 (16%), Positives = 52/148 (35%), Gaps = 22/148 (14%)
Query: 92 IVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNR 151
++++ +E ++F+ KE++ L + + E+ + + R K AL
Sbjct: 96 VISEPLELKIFE--------KEIDTVLLEIQ--QEEQAAAQAQVAAGFRPKIA---ALTA 142
Query: 152 ELAKVDQEIREMDVEINKKRPSLIKSKERVSHI---------QKKLASAKKSLVEVRQAN 202
E+A + EI E E+N E ++K + +
Sbjct: 143 EIAALQAEIDEAQAEVNAAYQEAQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARLETLK 202
Query: 203 EAHNKDIADLETQLADVRKRKAEYERQS 230
+ IA LE Q A + + + +
Sbjct: 203 ARLDAAIAQLEAQKAALERNRQAAVAEK 230
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.2 bits (84), Expect = 0.12
Identities = 34/270 (12%), Positives = 89/270 (32%), Gaps = 18/270 (6%)
Query: 111 IKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
+++L EL+KK+ E+EK++ +++ E L++ ++++ + EI K+
Sbjct: 1104 VEKLNAELEKKEKELEKLKNT--TPKDMWLED------LDKFEEALEEQEEVEEKEIAKE 1155
Query: 171 RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQS 230
+ + K + KL K + ++ ++ R E +
Sbjct: 1156 Q----RLKSKTKGKASKLRK-PKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLD 1210
Query: 231 IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 290
+ S+ + + + K V ++ K ++ KS + + +
Sbjct: 1211 DKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGK 1270
Query: 291 LETQLADVRKRKA---EYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQL--DT 345
+ V + ++ + + + T+ K A ++
Sbjct: 1271 PKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKK 1330
Query: 346 INREQKGDQDKLDNELRQQVQTQNEIKKKR 375
R++K Q + +KK+
Sbjct: 1331 TARKKKSKTRVKQASASQSSRLLRRPRKKK 1360
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 36.0 bits (84), Expect = 0.12
Identities = 27/112 (24%), Positives = 57/112 (50%), Gaps = 6/112 (5%)
Query: 352 GDQDKLDNELRQQVQTQN-EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNK-KLKEEL 409
D++KL NEL ++ E+++K E E K +KL++ + + + L++ + KL EE
Sbjct: 513 EDKEKL-NELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA 571
Query: 410 NSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAY 461
+ + ++E +KE +++I+EL + + + + E + KA
Sbjct: 572 EKEA---QQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKAN 620
Score = 34.4 bits (80), Expect = 0.43
Identities = 19/98 (19%), Positives = 43/98 (43%), Gaps = 13/98 (13%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQ--------GALNRELAKVDQE 159
E +E E L + + E++E +KEK + + +E +E ++ +E
Sbjct: 533 EQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKE 592
Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 197
+R++ K + +K+ E + +K+L A + +
Sbjct: 593 LRQL----QKGGYASVKAHE-LIEARKRLNKANEKKEK 625
Score = 32.1 bits (74), Expect = 2.1
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 23/125 (18%)
Query: 39 ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
AL + E+LK E+ + +E+ E ++ +E+ E E Q I+E AK+
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQ-----------EEEDKLLEEAEKEAQQAIKE---AKKEA 586
Query: 99 HQLFKLYHNETDIKELEDELDKKKGEVEKIERRK--EKAENILREKKKEQGALNRELAKV 156
++ K ++++L+ E IE RK KA +KKK+Q EL KV
Sbjct: 587 DEIIK------ELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEEL-KV 639
Query: 157 DQEIR 161
E++
Sbjct: 640 GDEVK 644
Score = 31.7 bits (73), Expect = 2.3
Identities = 25/95 (26%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 379 EEAQKRIDKLEDHIRQNE--ASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGD 436
EEA+K I ED + NE ASL++ ++ E+ + + ++L++ELE+ E+L
Sbjct: 505 EEAKKLIG--EDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKL-Q 561
Query: 437 AKTDKHEDTRRKKKQELVENFKKAYSGVYDRMINM 471
+ DK + K+ Q+ ++ KK + + +
Sbjct: 562 EEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQL 596
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 35.4 bits (82), Expect = 0.17
Identities = 23/95 (24%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 44 YERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFK 103
YE + E + + + LK+ EA++ ++E E ++ E+I KE + +
Sbjct: 284 YENYRTEKLSGLKNSGEPSLKEI------HEARLNEEERELKKKFTEKIREKEKRLEELE 337
Query: 104 LYHNETDIKELEDELDKKKGEVEKIERRKEKAENI 138
E KEL +L++ + ++E +E+R EK ++
Sbjct: 338 QNLIEE-RKELNSKLEEIQKKLEDLEKRLEKLKSN 371
Score = 31.1 bits (71), Expect = 3.5
Identities = 18/83 (21%), Positives = 41/83 (49%), Gaps = 4/83 (4%)
Query: 317 LESAQMTEYTNLKAEATKRAGKILQQLDTINREQ-KGDQDKLDNELRQQVQTQNEIKKKR 375
L + + +LK R + ++L E+ + + +L+ EL Q + E K+
Sbjct: 292 LSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLE-ELEQNLIE--ERKELN 348
Query: 376 HEMEEAQKRIDKLEDHIRQNEAS 398
++EE QK+++ LE + + +++
Sbjct: 349 SKLEEIQKKLEDLEKRLEKLKSN 371
>gnl|CDD|205733 pfam13555, AAA_29, P-loop containing region of AAA domain.
Length = 60
Score = 31.8 bits (73), Expect = 0.17
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
Query: 5 LQYIEVDNFKSYKGK-FSIGPLKKFTAVIGPNGSG 38
L +++ N+ ++ G I P K T + GP+GSG
Sbjct: 1 LTRLQLVNWGTFDGHTIPIDP--KGTLLTGPSGSG 33
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 35.2 bits (81), Expect = 0.18
Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 6/116 (5%)
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRP 172
E + E KK + E +KE E +K + + +E QE+++++ + ++
Sbjct: 52 ERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERAT 111
Query: 173 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
SL + E +S +K L S ++SL + + + + + LE Q +KAE ER
Sbjct: 112 SLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQ------KKAELER 161
Score = 32.1 bits (73), Expect = 2.0
Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 70 AERKEAKIE-KDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKI 128
A +KE +E K+EA KY REEI +E + + +L E+ + E LD+K +
Sbjct: 68 ALKKELLLEAKEEARKY---REEI-EQEFKSERQELKQIESRLTERATSLDRKDENLSSK 123
Query: 129 ERRKEKAENILREKKKEQGALNRELAKVDQEIRE 162
E+ E E L +K K ++ K++++ +
Sbjct: 124 EKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKA 157
>gnl|CDD|223047 PHA03332, PHA03332, membrane glycoprotein; Provisional.
Length = 1328
Score = 35.7 bits (82), Expect = 0.19
Identities = 21/108 (19%), Positives = 41/108 (37%), Gaps = 12/108 (11%)
Query: 323 TEYTNLKAEATKRAGKILQQLDTINREQKGDQD---KLDNELRQQVQT-QNEIKKKRHEM 378
T Y N +AT ++ ++ +N D KL + + + T N I+
Sbjct: 880 TLYVNQLLQATAATAEMASKIGGLNARVDKTSDVITKLGDTIAKISATLDNNIR------ 933
Query: 379 EEAQKRIDKLEDHIRQNEASLKDN-KKLKEELNSDVGSSKNRVQELQK 425
R+ LED + ++ N L +L ++ R++E+
Sbjct: 934 -AVNGRVSDLEDQVNLRFLAVATNFNTLATQLKELGTTTNERIEEVMA 980
Score = 31.5 bits (71), Expect = 3.5
Identities = 12/67 (17%), Positives = 30/67 (44%), Gaps = 3/67 (4%)
Query: 363 QQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQE 422
Q +Q + ++ R+DK D I + ++ K+ L++++ + RV +
Sbjct: 885 QLLQATAATAEMASKIGGLNARVDKTSDVITKLGDTI---AKISATLDNNIRAVNGRVSD 941
Query: 423 LQKELEQ 429
L+ ++
Sbjct: 942 LEDQVNL 948
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 35.6 bits (82), Expect = 0.19
Identities = 26/115 (22%), Positives = 57/115 (49%), Gaps = 6/115 (5%)
Query: 338 KILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
++++L + +E + + L+ L++Q + + E++++ E++E ++ KLE EA
Sbjct: 515 VLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNK-KLELEKEAQEA 573
Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQE 452
K LK+E+ S + K + KE + IE+L K K + ++ +
Sbjct: 574 L----KALKKEVESIIRELKEKKIHKAKE-IKSIEDLVKLKETKQKIPQKPTNFQ 623
Score = 32.9 bits (75), Expect = 1.2
Identities = 25/135 (18%), Positives = 63/135 (46%), Gaps = 5/135 (3%)
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK-R 171
E+ ++K +++E++ E E +L+E++K + L +E+ ++ + R +E+ K+ +
Sbjct: 512 EINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQ 571
Query: 172 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
+L K+ V I ++L K + ++ E D+ L+ + ++ ++ I
Sbjct: 572 EALKALKKEVESIIRELKEKKIHKAKEIKSIE----DLVKLKETKQKIPQKPTNFQADKI 627
Query: 232 PGRDINLESAQDVEI 246
+ Q +I
Sbjct: 628 GDKVRIRYFGQKGKI 642
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 35.1 bits (81), Expect = 0.22
Identities = 28/133 (21%), Positives = 64/133 (48%), Gaps = 17/133 (12%)
Query: 334 KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME----------EAQK 383
K GK +Q TI +D+++ +++ + E K+KR +++ +A+K
Sbjct: 486 KGTGK--EQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEK 543
Query: 384 RIDKLEDHIRQNE-ASLKDN-KKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDK 441
++ +L+D I + + +++ KKL++ L +D + ++ L +EL++ + E+G
Sbjct: 544 QLKELKDKISEEKKEKIENLIKKLRQALQND---NYESIKSLLEELQKALMEIGKEVYSS 600
Query: 442 HEDTRRKKKQELV 454
T + V
Sbjct: 601 TSTTDPASNDDDV 613
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 33.7 bits (78), Expect = 0.23
Identities = 16/63 (25%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 888 KSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKRAKKAKANFDRIKKERY 947
+ +E+ + ++ E++E K+ L++ EFEN RKR ++ + + KK
Sbjct: 4 EEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTERER---EEAKKYAI 60
Query: 948 DKF 950
+KF
Sbjct: 61 EKF 63
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 34.9 bits (81), Expect = 0.24
Identities = 47/238 (19%), Positives = 83/238 (34%), Gaps = 28/238 (11%)
Query: 124 EVEKIERR---KEKAENILREKKKE--QGALNRELAKVDQEIREMDVEINKKRPSLIKSK 178
E EK R K+ AE K K+ AL R AK I K + +
Sbjct: 463 EREKAAREARHKKAAEA-RAAKDKDAVAAALARVKAKKAAA----TQPIVIKAGARPDNS 517
Query: 179 ERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINL 238
++ + + A A+ E + A A K A + +A + +KA
Sbjct: 518 AVIAAREARKAQARARQAEKQAAAAADPKK-AAVAAAIARAKAKKAA------------Q 564
Query: 239 ESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 298
++A + P ++ + K A+ + + E + + A + +A
Sbjct: 565 QAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARA 624
Query: 299 RKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDK 356
+ +KAE + + P ++ A + A A +A K QQ E+ D K
Sbjct: 625 KAKKAEQQANAEPEEPVDPRKAAVA-----AAIARAKARKAAQQQANAEPEEAEDPKK 677
>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 320
Score = 34.7 bits (80), Expect = 0.25
Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 9/148 (6%)
Query: 5 LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG------ALKDDYERLKAEMIQAEEET 58
L+ I++ NFKS K I + +IG NGSG AL + E+ K +
Sbjct: 1 LKKIKIKNFKSIKNLEEISLNEGINVIIGENGSGKTTILEALDNLKEKKKIYIS---ISD 57
Query: 59 NMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDEL 118
N + + K+ + + K+ KI I+ + + + L + ++ D EL
Sbjct: 58 NKADIDKENLNKKSKKIKISIIIELSSNEIKSFSIKFYLNYYLEIINNDVEDEIELLLFS 117
Query: 119 DKKKGEVEKIERRKEKAENILREKKKEQ 146
+ ++KI + EK + I KKK+
Sbjct: 118 KNLEYILKKILDKIEKFDEIKGAKKKDF 145
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 34.8 bits (80), Expect = 0.26
Identities = 37/148 (25%), Positives = 64/148 (43%), Gaps = 13/148 (8%)
Query: 699 PKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQ---QE 755
PK +A+ A+ AR +T++R+ E+ D ++ + E A L++ E
Sbjct: 180 PKAEAL-AAAPARAETLAREAEDAARAAD-------EAKTAAAAAAREAAPLKASLRKLE 231
Query: 756 RQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKK 815
R K D + K+ A A + +E ARA+ ++K A + A +L+ +A K+
Sbjct: 232 RAKARADAELKR--ADKALAAAKTDEAKARAEERQQKAAQQAAEAATQLDTAKADAEAKR 289
Query: 816 QAVDAMDEEIGKARREVGSIAKDIQAAQ 843
A A E A + AK A+
Sbjct: 290 AAAAATKEAAKAAAAKKAETAKAATDAK 317
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 34.9 bits (80), Expect = 0.27
Identities = 69/362 (19%), Positives = 152/362 (41%), Gaps = 49/362 (13%)
Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIRE 162
+L +++ + +LE+EL +K+ E ++ + E+ + L E + + N EL + + E
Sbjct: 62 ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121
Query: 163 MDVEINKKRPSLIKSKERVSHIQKKL-ASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
++ + + +++E ++ KL A + K E + + I+ ++ L++++
Sbjct: 122 LEKKAENE---AAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQC 178
Query: 222 RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKK---SLVEVR 278
R D E+ L + +E++ QK+LA A+K SL +
Sbjct: 179 RAQN----------------ADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQ 222
Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGK 338
++ ++ I LE +L YE+ + + + + Q+ E A + K
Sbjct: 223 ASSADNSVKIKHLEEEL-------KRYEQDAEVVKSMKEQLLQIPELERELAALREENRK 275
Query: 339 ILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
+ R K D + L EL +++ + E+ ++++ LE + E
Sbjct: 276 L--------RSMKEDNELLKEEL-------EDLQSRLERFEKMREKLADLELEKEKLENE 320
Query: 399 LKDNKKLKEELNSDVGSS---KNRVQELQKELEQVIEELGDAKTD-KHEDTRRKKKQELV 454
LK K L +++ ++ + R+ LQ E Q+ E+ G + K +T ++ Q
Sbjct: 321 LKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQLER 380
Query: 455 EN 456
+
Sbjct: 381 QK 382
Score = 34.9 bits (80), Expect = 0.27
Identities = 71/409 (17%), Positives = 170/409 (41%), Gaps = 36/409 (8%)
Query: 53 QAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIK 112
+A E + L K + AE + + EK++ K + + ++ + + +T++K
Sbjct: 129 EAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELK 188
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKV---DQEIR--EMDVEI 167
LE EL++ + ++E+ ++ +AE L+ EQ + K+ ++E++ E D E+
Sbjct: 189 LLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEV 248
Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA---------- 217
K + + L + L +++ NE +++ DL+++L
Sbjct: 249 VKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLA 308
Query: 218 --DVRKRKAEYERQS----IPGRDINLESAQDV--EINKKRPSLIKSKERVSHIQKKLAS 269
++ K K E E +S + +NL + D+ I + ++ KE+ I
Sbjct: 309 DLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQ 368
Query: 270 AKKSLVEVRQANEAHNKDIADLETQL----ADVRK--RKAEYERQSIPGRDINLES---- 319
+ +L +++ + +I +L+ +L A VR+ R+ + G L S
Sbjct: 369 LETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKE 428
Query: 320 AQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME 379
T + + + A ++Q++ + + + +L+ ++ QQ N ++ E++
Sbjct: 429 LTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNNTLET---EIK 485
Query: 380 EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELE 428
++++ E + + + + E L + + L+ +LE
Sbjct: 486 LLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLE 534
Score = 33.0 bits (75), Expect = 1.2
Identities = 80/397 (20%), Positives = 154/397 (38%), Gaps = 51/397 (12%)
Query: 40 LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
K +L+ E++Q E E + ++ ERK + + ++ + R E V +
Sbjct: 66 SKSKLIQLENELMQKELEHKRAQIE-----LERKASTLAENYERELDRNLELEVRLKALE 120
Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
+L K NE E E +L K K + E ++ + EK + + K+ + +L+++
Sbjct: 121 ELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESIS-RIKNDLSEMQCR 179
Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKK---SLVEVRQANEAHNKDIADLETQL 216
+ D E+ L + +E++ QK+LA A+K SL + ++ ++ I LE +L
Sbjct: 180 AQNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEEL 239
Query: 217 ADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 276
QD E+ K KE++ I + E
Sbjct: 240 KRY---------------------EQDAEVVK------SMKEQLLQIPELERELAALREE 272
Query: 277 VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRA 336
R+ +D L+ +L D++ R +E+ D+ LE ++ +
Sbjct: 273 NRKLRSMK-EDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKSWKSLLQDI 331
Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNE 396
G L+ D ++ R+ V QNE + + + +LE ++Q +
Sbjct: 332 GLNLRTPDDLS--------------RRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQ 377
Query: 397 ASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
+ EL + + K V+ LQ+ L V +E
Sbjct: 378 LERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKE 414
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 34.6 bits (80), Expect = 0.27
Identities = 28/123 (22%), Positives = 58/123 (47%), Gaps = 22/123 (17%)
Query: 622 DKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQ---STIKGLEIRLNYSRQDLQNTKS 678
D E+ L+ + E+L EEL E +KK R+E +++++ + + G I + TKS
Sbjct: 147 DPELDELREKLEELEEELEELLKKEREELGIDSLKLGYNKVHGYYIEV---------TKS 197
Query: 679 QIAKLEAEIDALNARADA---TEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCK 735
+ K+ + ++ + T P++K +E + KEE+ +E + R+ +
Sbjct: 198 EAKKVPKDFIRRSSLKNTERFTTPELKELENELLEA-------KEEIERLEKEILRELLE 250
Query: 736 SIG 738
+
Sbjct: 251 KVL 253
>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
Firmicutes type. AddAB, also called RexAB, substitutes
for RecBCD in several bacterial lineages. These DNA
recombination proteins act before synapse and are
particularly important for DNA repair of double-stranded
breaks by homologous recombination. The term AddAB is
used broadly, with AddA homologous between the
Firmicutes (as modeled here) and the
alphaproteobacteria, while the partner AddB proteins
show no strong homology across the two groups of species
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 1230
Score = 35.1 bits (81), Expect = 0.29
Identities = 31/140 (22%), Positives = 52/140 (37%), Gaps = 16/140 (11%)
Query: 317 LESAQMT--EYTNLKAEATKRAGK------ILQQLDTINREQKGDQDKLDNELRQQVQTQ 368
LE Q L A + LQ +D + +E D NELR+ V
Sbjct: 219 LEGLQEKLQRALELFMAEDGLAPRLENFQLDLQNIDELIQESLAQADW--NELRKAVAAF 276
Query: 369 NEI----KKKRHEMEEAQKRIDKLEDHIRQNEASLKDN--KKLKEELNSDVGSSKNRVQE 422
K E + + DKL + ++ LK + + +E+ + K V+
Sbjct: 277 KFKNLKAAKGDEEDADLLEEADKLREEAKKQLEKLKTDYFTRSEEDHLRIMQEMKPVVKT 336
Query: 423 LQKELEQVIEELGDAKTDKH 442
L + ++ IE G K +K+
Sbjct: 337 LVQLVKDFIERFGAEKREKN 356
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 32.7 bits (75), Expect = 0.40
Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 4/111 (3%)
Query: 320 AQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME 379
A++ K +A ++ K+ + L+ Q K + EL + + E++ R ++
Sbjct: 10 AELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLN 69
Query: 380 EAQKRIDKLEDHIRQNEASLKDNKKL----KEELNSDVGSSKNRVQELQKE 426
E +K I +L+ +A L + ++ K+ L ++ + R++EL ++
Sbjct: 70 ELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQ 120
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 33.9 bits (78), Expect = 0.41
Identities = 52/302 (17%), Positives = 123/302 (40%), Gaps = 22/302 (7%)
Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEIN 168
+ + DEL+ K+ ++++ ++ + LR++ E EL +E+RE E+
Sbjct: 2 LAMLDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELR 61
Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
++R + + EV++ E ++ A L+ + R+ K +
Sbjct: 62 EERDEINE--------------------EVQELKEKRDEINAKLQELRKEYRELKEKRNE 101
Query: 229 QSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDI 288
++ GR I + + KK+ + + + E + +K+ +K L + ++A E + K
Sbjct: 102 FNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLK 161
Query: 289 ADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINR 348
+K + +E+ + +M + E K A ++ ++ +++
Sbjct: 162 ELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSK 221
Query: 349 EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA--SLKDNKKLK 406
+ ++ N + + + +IK R + + A++R + E R E K +KL
Sbjct: 222 KIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEKLT 281
Query: 407 EE 408
E
Sbjct: 282 TE 283
Score = 32.3 bits (74), Expect = 1.1
Identities = 42/228 (18%), Positives = 92/228 (40%), Gaps = 23/228 (10%)
Query: 255 KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRD 314
K +E Q+ + EV++ E ++ A L+ + R+ K + ++ GR
Sbjct: 49 KVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRS 108
Query: 315 IN--------LESAQMTEYTNL---------------KAEATKRAGKILQQLDTINREQK 351
I LE Q T + E K+A + ++L + E
Sbjct: 109 IKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEID 168
Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
+ K + + NE ++ EM + + D+L + + K +EL+
Sbjct: 169 ELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHE 228
Query: 412 DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKK 459
+ + +N ++EL+K+++ + + AK + + +++ +E+ E FK+
Sbjct: 229 EFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKR 276
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 33.3 bits (77), Expect = 0.60
Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 9/94 (9%)
Query: 342 QLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKD 401
+ I +E + +DK E + + E+ KK E+ E I KL I+ + A LK+
Sbjct: 5 YISEILKEYEKRRDK---EELELKNRKEEVYKKIPEIAEIDDEIAKLG--IKLSRAILKN 59
Query: 402 NKKLKEELNSDVGSSKNRVQELQKELEQVIEELG 435
K +E L K ++ +L+ + +++ G
Sbjct: 60 PDKKEETLK----ELKEKITDLRVKKAELLVSNG 89
Score = 32.2 bits (74), Expect = 1.3
Identities = 18/78 (23%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 124 EVEKI-ERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVS 182
E+ K E+R++K E L+ +K+E E+A++D EI ++ +++++ +++K+ ++
Sbjct: 8 EILKEYEKRRDKEELELKNRKEEVYKKIPEIAEIDDEIAKLGIKLSR---AILKNPDKKE 64
Query: 183 HIQKKLASAKKSLVEVRQ 200
K+L L +
Sbjct: 65 ETLKELKEKITDLRVKKA 82
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 33.1 bits (76), Expect = 0.71
Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKL-EDHIRQNEASLKDNKKLKEELNSDV 413
++L+ ++++ + + I KK+ E + + I KL E+ + E + K K+L EE +S
Sbjct: 34 ERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSSWF 93
Query: 414 GSSKN 418
S
Sbjct: 94 PSKDP 98
>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
systems, ATPase components [Inorganic ion transport and
metabolism / Coenzyme metabolism].
Length = 258
Score = 32.9 bits (76), Expect = 0.75
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 9/39 (23%)
Query: 8 IEVDNFK-SYKGK-------FSIGPLKKFTAVIGPNGSG 38
+EV+N Y GK FSI P + T ++GPNGSG
Sbjct: 3 LEVENLSFGYGGKPILDDLSFSI-PKGEITGILGPNGSG 40
>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
Length = 141
Score = 32.0 bits (73), Expect = 0.75
Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 670 RQDLQNTK---SQIAKLEAEIDAL--NARADATEPKIKAIEASMTARGDTISRKKEEMNS 724
+ DL+ K S ++++E EI+ + NAR +A + + KAI + I KK E+ S
Sbjct: 44 KNDLEKVKTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAELES 103
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 33.1 bits (76), Expect = 0.76
Identities = 25/97 (25%), Positives = 35/97 (36%), Gaps = 4/97 (4%)
Query: 636 SEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARAD 695
+ELREA+ + KE T + E + + K +I KL+AEID L D
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEE----KAEYDALKEEIDKLDAEIDRLEELLD 56
Query: 696 ATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRD 732
E K A + K E + D
Sbjct: 57 ELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGD 93
Score = 31.2 bits (71), Expect = 2.8
Identities = 17/92 (18%), Positives = 39/92 (42%), Gaps = 3/92 (3%)
Query: 378 MEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDA 437
++E ++ + +L +R+ + EE ++ + K + +L E+++ +EEL D
Sbjct: 1 LKELREALAELAKELRKLTE--DEKLAEAEEEKAEYDALKEEIDKLDAEIDR-LEELLDE 57
Query: 438 KTDKHEDTRRKKKQELVENFKKAYSGVYDRMI 469
K + E E KA + + +
Sbjct: 58 LEAKPAASGEGGGGEEEEEEAKAEAAEFRAYL 89
Score = 29.6 bits (67), Expect = 9.0
Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 20/116 (17%)
Query: 289 ADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINR 348
L ++RK + + + EY LK E K +I
Sbjct: 6 EALAELAKELRKLTED--------EKLAEAEEEKAEYDALKEEIDKLDAEI--------- 48
Query: 349 EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKK 404
++L +EL + E E EEA+ + ++R + +L + +K
Sbjct: 49 ---DRLEELLDELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERK 101
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 33.1 bits (76), Expect = 0.97
Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Query: 380 EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAK 438
E +KR+ KL I +NEA +N++L++ S + VQ +EL + IE+L +
Sbjct: 70 ELRKRLAKL---ISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSER 125
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 33.0 bits (76), Expect = 1.0
Identities = 28/157 (17%), Positives = 63/157 (40%), Gaps = 25/157 (15%)
Query: 15 SYKGKFSIGPLKKFTAVIGPNGSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKE 74
+ + + ++ P V+ + ++ +RL+A N +K + E +
Sbjct: 736 NLRAEMNLSPSAPLKVVLVGS-----EELEDRLEA---------NEDDIKGLANLEELEI 781
Query: 75 AKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERR--- 131
+ +E E+ L L ++ LE EL+K + E+++IE++
Sbjct: 782 LSPDPEEPPVEVTAVVGGA--ELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLSN 839
Query: 132 -----KEKAENILREKKKEQGALNRELAKVDQEIREM 163
K E + +EK+K +LAK+++ + +
Sbjct: 840 EGFVAKAPEEVVEKEKEKLAEYQ-VKLAKLEERLAVL 875
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 32.6 bits (75), Expect = 1.0
Identities = 39/166 (23%), Positives = 63/166 (37%), Gaps = 30/166 (18%)
Query: 635 LSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARA 694
LS E+ S K+S L+ + S I L L+ RQ Q+ + +A L A + A A
Sbjct: 44 LSREI------SGKDSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAER 97
Query: 695 DATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQ- 753
+ + + + A +E++S + + R + V + Q A LR Q
Sbjct: 98 SRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQ---VELLNQQIAA-LRRQL 153
Query: 754 --------------QERQKICQDKDTKKNVARWERAVSDDEEELAR 785
+E Q D + NVA +R +EL R
Sbjct: 154 AALEAALDASEKRDRESQAKIADLGRRLNVALAQRV-----QELNR 194
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 32.8 bits (74), Expect = 1.1
Identities = 77/402 (19%), Positives = 176/402 (43%), Gaps = 50/402 (12%)
Query: 79 KDEAEKYQRIREEIVA------KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRK 132
K++ EK Q + EE K+V L ++ E +K+L L++ + + ++E +
Sbjct: 218 KEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277
Query: 133 EKAENILREKKKEQGALNRELAKVD---------QEIREMDVEINKKRPSLIKSKERVSH 183
+ + L+E +++ L +EL + Q+ E D++I K + ++
Sbjct: 278 KLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQM 337
Query: 184 IQKKLASAKKSLVE-------------VRQANEAHNKDIADLETQLADVRKRKAEYERQS 230
+ A A S V +R + K+ L+ +++K+ +E E +
Sbjct: 338 EELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMT 397
Query: 231 IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 290
+ +E + +I + L+ K++ I ++L ++ L+ + QA E K+I D
Sbjct: 398 KFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQARE---KEIHD 454
Query: 291 LETQLA-----------DVRKRKAEYERQSIPGRDINLESAQMT-EYTNLKAEATKRAGK 338
LE QL +V K E E++ + ++ ++ E L EA+ +
Sbjct: 455 LEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLE 514
Query: 339 ILQQLDTINREQKGDQ---DKLDNELRQQVQTQNEIKKKRHEM----EEAQKRIDKLEDH 391
+ + + I +K ++ +++N +++ ++E++ R E +E + ++DK E++
Sbjct: 515 LKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEEN 574
Query: 392 IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
R E + +K + L + + K +++ K +E++ +E
Sbjct: 575 ARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQE 616
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 32.9 bits (75), Expect = 1.1
Identities = 43/268 (16%), Positives = 89/268 (33%), Gaps = 34/268 (12%)
Query: 148 ALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNK 207
+N+ L K+ QE E+D K+ + K E +LA + L R A
Sbjct: 182 QINQLLEKLKQERNEIDEA-EKEYATYHKLLESRRAEHARLAELRSELRADRDHIRALRD 240
Query: 208 DIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKL 267
+ + + E++ R+ +D +++ ++R +H+QK
Sbjct: 241 AVELW-----PRLQEWKQLEQELTRRREELATFPRD--------GVLRLEKREAHLQKTE 287
Query: 268 ASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTN 327
A LV + + + ++ I E L + + + S
Sbjct: 288 AEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQ-------------LSEIKASAFE 334
Query: 328 LKAEATKRAGKILQQLDT----INREQKGDQDKLDNELRQ---QVQTQNEIKKKRHEMEE 380
L + + + E+ D L+ LR Q + + ++E
Sbjct: 335 LTETLAGIEADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEAAYCKRLDE 394
Query: 381 AQKRIDKLEDHIRQNEASLKDNKKLKEE 408
+ D+ E+ RQ A ++ + +E
Sbjct: 395 KRLFEDEAEEEARQRLADDEEEVRAGDE 422
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 32.8 bits (75), Expect = 1.1
Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 15/83 (18%)
Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGS 415
LD E RQ ++ E++ +K +KL + A LK K+ EL ++
Sbjct: 37 ALDKEWRQ----------RQFELDSLRKEFNKLNKEV----AKLKIAKEDATELIAETKE 82
Query: 416 SKNRVQELQKELEQVIEELGDAK 438
K + E + E+++ L DAK
Sbjct: 83 LKKEITEKEAEVQEAKAAL-DAK 104
>gnl|CDD|147765 pfam05791, Bacillus_HBL, Bacillus haemolytic enterotoxin (HBL).
This family consists of several Bacillus haemolytic
enterotoxins (HblC, HblD, HblA, NheA, and NheB) which
can cause food poisoning in humans.
Length = 181
Score = 31.9 bits (73), Expect = 1.1
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 402 NKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRR 447
++K KE L + + +Q QKE++Q++EEL D K +DTR
Sbjct: 98 DQKDKETLKEGLTKLQGEIQTNQKEVDQLLEELKDFKNKLAKDTRS 143
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 32.9 bits (75), Expect = 1.1
Identities = 65/361 (18%), Positives = 156/361 (43%), Gaps = 53/361 (14%)
Query: 107 NETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
+T ++ELE E++K K ++ K + K + ALN + K+ +EI D+E
Sbjct: 427 VKTPVRELEGEVEKLKEQILKAKESSSKPSEL---------ALNEMIEKLKKEI---DLE 474
Query: 167 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 226
+ ++ +Q++L + ++ + ++ + + + + K K E+
Sbjct: 475 YTEAVIAM--------GLQERLENLREEFSKANSQDQLMHPVLME------KIEKLKDEF 520
Query: 227 -ERQSIPGRDINLESAQDV--EINKKRPSLIKSKERVSHIQKKLASAKKSLV---EVRQA 280
+R S ++L+ D+ E ++ + +L + K + ++ ++ K ++ E+++
Sbjct: 521 NKRLSRAPNYLSLKYKLDMLNEFSRAK-ALSEKKSKAEKLKAEINKKFKEVMDRPEIKEK 579
Query: 281 NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKIL 340
EA ++A D E+ ++I LE A + + L+ + K
Sbjct: 580 MEALKAEVASSGASSGDELD-DDLKEKVEKMKKEIELELAGVLKSMGLEVIGVTKKNKDT 638
Query: 341 QQLDTINREQKGDQDKLDNELRQQVQT-------QNEIKKKRHEMEEAQK--------RI 385
+ T + + L+ E+ ++++ +++I+ + E+ +A K +I
Sbjct: 639 AE-QTPPPNLQEKIESLNEEINKKIERVIRSSDLKSKIELLKLEVAKASKTPDVTEKEKI 697
Query: 386 DKLEDHIRQNEASLKDNKKLK---EELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKH 442
+ LE I+Q A ++ +LK EEL +++ +++ E L+ ++ D+K D
Sbjct: 698 EALEQQIKQKIAEALNSSELKEKFEELEAELAAARETAAESNGSLKNDDDKEEDSKEDGS 757
Query: 443 E 443
Sbjct: 758 R 758
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 32.6 bits (75), Expect = 1.2
Identities = 22/98 (22%), Positives = 47/98 (47%), Gaps = 4/98 (4%)
Query: 368 QNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDN-KKLKEELNSDVGSSKNRVQELQKE 426
E+K++ E+EE + +LE+ + + + LK KK + +L + ++ E +++
Sbjct: 329 FPELKEELKELEE---ELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEK 385
Query: 427 LEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGV 464
L + +EEL + + E+ + + K Y GV
Sbjct: 386 LSEELEELEEELKELKEELESLYSEGKISVNKTIYPGV 423
Score = 31.5 bits (72), Expect = 2.6
Identities = 16/86 (18%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 148 ALNRELAKVDQEIREMDVEINK--KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAH 205
L EL ++++E++E++ E+ K K + K R +K +K + + E
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEK----LLETKEKL 386
Query: 206 NKDIADLETQLADVRKRKAEYERQSI 231
++++ +LE +L ++++ +
Sbjct: 387 SEELEELEEELKELKEELESLYSEGK 412
Score = 30.7 bits (70), Expect = 5.0
Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 3/93 (3%)
Query: 78 EKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAEN 137
+ E+ + + EE+ E E + K + K +K+ ++EK+ KEK
Sbjct: 329 FPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSE 388
Query: 138 ILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
L E ++E L EL + E + + +NK
Sbjct: 389 ELEELEEELKELKEELESLYSEGK---ISVNKT 418
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 32.7 bits (75), Expect = 1.2
Identities = 16/81 (19%), Positives = 32/81 (39%), Gaps = 13/81 (16%)
Query: 628 LKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEI 687
K + +L E+ E E + + + Q + ++A+ EAEI
Sbjct: 280 TKREIAQLEEQKEEEGSAKNGGPERGEIANPV-------------YQQLQIELAEAEAEI 326
Query: 688 DALNARADATEPKIKAIEASM 708
+L AR +I+ +E+ +
Sbjct: 327 ASLEARVAELTARIERLESLL 347
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 32.7 bits (74), Expect = 1.2
Identities = 67/345 (19%), Positives = 134/345 (38%), Gaps = 19/345 (5%)
Query: 49 AEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQ---RIREEIVAKEVEHQLFKLY 105
A + +AEE S + +K V KE K+ E+ EE +
Sbjct: 146 AALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRE 205
Query: 106 HNETDIKELEDELDKKKGEVE-KIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
E ++KE E EL++ K +++ + + K + E +++ ++ +A V +E+ E
Sbjct: 206 TYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAK 265
Query: 165 VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKA 224
+ K + V ++++L KK L E+R+ + + LE +L + K
Sbjct: 266 ANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKE 325
Query: 225 EYERQSIPGRDINLESAQDVEINKK------RPSLIKSKERVSHIQKKLASAKKSLVEVR 278
R+ + + L+ A + R L K KE + L +A+ L
Sbjct: 326 SKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAAL 385
Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE--SAQMTEYTNLKAEATKRA 336
+ EA A ++ +++ + + S ++LE + + A K+
Sbjct: 386 KEAEAAKAAEALALAEIKALQESEESAKADSPRKITLSLEEYEELSKKAEEAEELAEKKV 445
Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEA 381
L Q++ + KL+ + EI++K+ +EEA
Sbjct: 446 AAALAQVEEAKESENESLKKLE-------EASKEIEEKKAALEEA 483
>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 571 and 1310 amino
acids in length. There are two conserved sequence
motifs: LPA and ELH. EVC2 is implicated in Ellis van
Creveld chondrodysplastic dwarfism in humans. Mutations
in this protein can give rise to this congenital
condition. LIMBIN is a protein which shares around 80%
sequence homology with EVC2 and it is implicated in a
similar condition in bovine chondrodysplastic dwarfism.
Length = 429
Score = 32.5 bits (74), Expect = 1.2
Identities = 35/186 (18%), Positives = 68/186 (36%), Gaps = 46/186 (24%)
Query: 44 YERLKAEMIQAEEETN----------MSYLKK-KGVVAERKEAKIEKDEAEKYQRIREEI 92
++R AEM QAEE S L G+ E + + + E + + ++
Sbjct: 238 HQREMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQRSLLLQQEEDFAKAHRQL 297
Query: 93 VAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRE 152
+ +L ++ + +K K E + E+A+++L+E K Q +
Sbjct: 298 AVFQ-RVELHSIFFTQ--LKSA----------TFKGELKPEEAKSLLQEYSKIQETIE-- 342
Query: 153 LAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD-IAD 211
MD +R H+ ++ A ++ LV+ Q+ E+ +
Sbjct: 343 --------ELMDFFQASQR----------YHLSERFAQ-REYLVKALQSQESRVSGLLNT 383
Query: 212 LETQLA 217
TQL
Sbjct: 384 AATQLT 389
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 32.9 bits (75), Expect = 1.2
Identities = 18/97 (18%), Positives = 35/97 (36%), Gaps = 4/97 (4%)
Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEIN 168
E + + + + E+A LR ++ GA+ E+ E R
Sbjct: 882 ARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAMVDEIRARLAETRAALASGG 941
Query: 169 KKRPSLIKSKERVSH----IQKKLASAKKSLVEVRQA 201
++ P L ++ ++K A A +L E +A
Sbjct: 942 RELPRLAEALATAEEARGRAEEKRAEADATLDERAEA 978
Score = 32.5 bits (74), Expect = 1.5
Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 10/107 (9%)
Query: 123 GEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVS 182
+ R +AE+ RE ++ E + +R ++ + V
Sbjct: 875 TRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVG----------AMVD 924
Query: 183 HIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
I+ +LA + +L + + +A E +++AE +
Sbjct: 925 EIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADAT 971
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 32.4 bits (74), Expect = 1.3
Identities = 52/277 (18%), Positives = 95/277 (34%), Gaps = 69/277 (24%)
Query: 173 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD------LETQLADVRKRKAEY 226
S + S+ K A +L + A++ + A L+ +L ++R R E
Sbjct: 155 SRVIELSYTSNDPKLAAKLANALAQAYLADQLEAQLEAFRRASDSLDERLEELRARLQEA 214
Query: 227 ERQ----------SIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 276
E Q + R L Q +N + L ++ R++ + +LAS + L
Sbjct: 215 EAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQ---LQSARARLAQAEARLASLLQLLPL 271
Query: 277 VRQAN----EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEA 332
R+A + I DL Q A VR++ A+
Sbjct: 272 GREAAALREVLESPTIQDLRQQYAQVRQQIADLST------------------------- 306
Query: 333 TKRAGKILQQLDTINREQKGDQDKLDNELRQ-QVQTQNEIKKKRHEMEEAQKRIDKLEDH 391
G QL + + + ++ ELRQ NE+
Sbjct: 307 --ELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELAL------------------ 346
Query: 392 IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELE 428
+ Q EA+L+ + S + + +++EL++E E
Sbjct: 347 LEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAE 383
>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
permease components [Defense mechanisms].
Length = 567
Score = 32.4 bits (74), Expect = 1.3
Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 1/69 (1%)
Query: 991 TIVSHRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSL 1050
I P +LDE +ALD + + + T++I+ + +AD +
Sbjct: 475 AIARALLRNPPILILDEATSALDTETEALIQDALKKLLKGR-TTLIIAHRLSTIKNADRI 533
Query: 1051 VGICPGSVT 1059
+ + G +
Sbjct: 534 IVLDNGRIV 542
Score = 30.1 bits (68), Expect = 7.4
Identities = 14/77 (18%), Positives = 29/77 (37%), Gaps = 6/77 (7%)
Query: 872 IVLPMLRVQKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKR 931
+ L + + + RKL++ ++E L N R +E L + E+ E R
Sbjct: 165 LALVLSLLARKSRKLSRRVREALGEL------NARLLESLSGIRVIKAFGAEDRELKRFE 218
Query: 932 AKKAKANFDRIKKERYD 948
+ ++ R +
Sbjct: 219 EANEELRRANLRASRLE 235
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 32.5 bits (74), Expect = 1.4
Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 9/91 (9%)
Query: 75 AKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEK 134
++EKD E E+ + KE +L + + E+ L K+ E ++ ++
Sbjct: 364 ERLEKDVEEG-----EKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQD 418
Query: 135 AENILREKKKEQGA----LNRELAKVDQEIR 161
EN L+E KE+ L E ++V++
Sbjct: 419 EENFLKELSKEEKELLEKLKMEASEVEKLFG 449
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 32.3 bits (74), Expect = 1.4
Identities = 23/103 (22%), Positives = 43/103 (41%), Gaps = 10/103 (9%)
Query: 338 KILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
+ D + K +D +L + + +E KK E+EE Q ++NE
Sbjct: 6 LLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQA---------KRNEL 56
Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTD 440
S K K K + + K ++EL++EL ++ L + +
Sbjct: 57 S-KQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAE 98
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF. Sequences
in this family of proteins are members of the chain
length determinant family (pfam02706) which includes the
wzc protein from E.coli. This family of proteins are
homologous to the EpsF protein of the methanolan
biosynthesis operon of Methylobacillus species strain
12S. The distribution of this protein appears to be
restricted to a subset of exopolysaccharide operons
containing a syntenic grouping of genes including a
variant of the EpsH exosortase protein. Exosortase has
been proposed to be involved in the targetting and
processing of proteins containing the PEP-CTERM domain
to the exopolysaccharide layer.
Length = 444
Score = 32.5 bits (74), Expect = 1.4
Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 12/87 (13%)
Query: 673 LQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRD 732
+QN K+ IA+ E+++ L+ R P+ K +A + + S + +
Sbjct: 256 IQNLKTDIARAESKLAELSQRLGPNHPQYKRAQAEI-----------NSLKSQLNAEIKK 304
Query: 733 FCKSIG-VSTIRQYEEAELRSQQERQK 758
S+G S I + EAELR E QK
Sbjct: 305 VTSSVGTNSRILKQREAELREALENQK 331
>gnl|CDD|181567 PRK08861, PRK08861, cystathionine gamma-synthase; Provisional.
Length = 388
Score = 32.2 bits (73), Expect = 1.5
Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 9/88 (10%)
Query: 978 ENPEEPLTYRVSTTIVSHRYHPA-PFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIV 1036
E+ ++ L Y + ++V YHP+ P EI A + G + S+ + + L+ V
Sbjct: 258 ESAQQILAYLQTQSLVGTIYHPSLPEHPGHEI-AKKQQSGFGSMLSFEFAGSFEQLKVFV 316
Query: 1037 ISLKEEFFSHADSLVG----IC-PGSVT 1059
+L FS A+SL G IC P S+T
Sbjct: 317 KAL--ALFSLAESLGGVESLICHPASMT 342
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 32.3 bits (73), Expect = 1.6
Identities = 51/396 (12%), Positives = 141/396 (35%), Gaps = 25/396 (6%)
Query: 80 DEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENIL 139
+ Y+ +E+ E + + + ++K D L +K E KI ++ +
Sbjct: 246 KLEDNYEPSEQELKLG-FEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKW 304
Query: 140 REKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 199
R K + + + Q+ +E ++ K + + +E + +Q + K L +
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364
Query: 200 QANEAHNKDIADLETQLADVRKRKAEYE--RQSIPGRDINLESAQDVEINKKRPSLIKSK 257
+ E + E ++ K + + +S+ R + + + K
Sbjct: 365 ISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFK-SLEKTLRQYDSLI 423
Query: 258 ERVSHIQKKLASAKK--SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDI 315
+ ++ + ++ SL + I + + + + ++ + I
Sbjct: 424 QNITRSRSQIGHNVNDSSLKINIEQLFPKGSGINESIKKSILELNDEIQERIKTEENKSI 483
Query: 316 NLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKR 375
LE + + E T+ K+ +L N ++ + + E+ +R
Sbjct: 484 TLEED-IKNLKHDINELTQILEKLELELSEAN--------SKFELSKE--ENERELVAQR 532
Query: 376 HEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELG 435
E+E+ +K ++ L ++ S+ D ++L + + K + + ++ + + +
Sbjct: 533 IEIEKLEKELNDLN---LLSKTSILDAEQLVQSTEIKLDELKVDLNRKRYKIHKQVIHVI 589
Query: 436 DAKTDKHEDTRRKKKQELVENFKKAYSGVYDRMINM 471
D + + Q +E+ + V + + N+
Sbjct: 590 DITSKFKINI-----QSSLEDLENELGKVIEELRNL 620
Score = 31.9 bits (72), Expect = 2.5
Identities = 24/84 (28%), Positives = 42/84 (50%), Gaps = 11/84 (13%)
Query: 623 KEMGNLKAQKEKLSEELREAMKKSRK-----------ESELNTVQSTIKGLEIRLNYSRQ 671
++ NLK Q + L E+++EAMK S+K +S+ N ++ + ++ +
Sbjct: 271 TDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPG 330
Query: 672 DLQNTKSQIAKLEAEIDALNARAD 695
L+ KS+I E EI AL + D
Sbjct: 331 KLEKLKSEIELKEEEIKALQSNID 354
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 32.6 bits (75), Expect = 1.6
Identities = 71/419 (16%), Positives = 159/419 (37%), Gaps = 74/419 (17%)
Query: 84 KYQRIREEIV-AKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKA------- 135
+Y R E + + + + E + EL + +V+K++R +
Sbjct: 769 RYSRFPEVPLFGRAAREKRIEQLRAERE--ELAERYATLSFDVQKLQRLHQAFSRFIGSH 826
Query: 136 ---------ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQK 186
E LR+ + + L R LA + + ++ R L ++KE +S + +
Sbjct: 827 LAVAFEADPEAELRQLNRRRVELERALADHESQEQQQ-------RSQLEQAKEGLSALNR 879
Query: 187 KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEI 246
L +L+ E + ++ QL + + K R + ++
Sbjct: 880 LLPRL--NLLAD----ETLADRVEEIREQLDEAEEAK----------RFVQQHGNALAQL 923
Query: 247 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE-- 304
L E+ +++ A+++ + +Q A L +V +R+A
Sbjct: 924 EPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFA-----------LTEVVQRRAHFS 972
Query: 305 YERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQ-KGDQDKLDNELRQ 363
YE ++A+M + E ++ + +Q T REQ + Q +L +
Sbjct: 973 YE-----------DAAEMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQV 1021
Query: 364 QVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQEL 423
++ KR ++E ++ + L + + + + + ++EL++ + ++++R +L
Sbjct: 1022 LASLKSSYDAKRQMLQELKQELQDLG--VPADSGAEERARARRDELHARLSANRSRRNQL 1079
Query: 424 QKELEQVIEELGDAKTDKHEDTRRKKKQ--ELVENFKKAYSGVYDRMIN--MCHPVHKR 478
+K+L E D T K R + E V N K + V + + + +H+R
Sbjct: 1080 EKQL-TFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAGWCAVLRLVKDNGVERRLHRR 1137
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 32.3 bits (73), Expect = 1.6
Identities = 28/140 (20%), Positives = 64/140 (45%), Gaps = 5/140 (3%)
Query: 109 TDIKELEDELDKKKGEVEKIE--RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
TD+KE E + D K+ + K E +++ A+ ++ Q +++ +V Q+ +E
Sbjct: 201 TDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNL 260
Query: 167 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA-DVRKRKAE 225
S K ++V+ QK+ +K+ +E+++ +E K L + + + E
Sbjct: 261 PKPADTSSPKEDKQVAENQKR--EIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKE 318
Query: 226 YERQSIPGRDINLESAQDVE 245
E + + + A+D++
Sbjct: 319 AEDKELEAQKKREPVAEDLQ 338
Score = 31.9 bits (72), Expect = 2.0
Identities = 29/133 (21%), Positives = 50/133 (37%), Gaps = 11/133 (8%)
Query: 343 LDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDN 402
D RE + D + + + Q + K + + + AQ DK D +RQ ++
Sbjct: 201 TDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQ---KQQEA 257
Query: 403 KKLKEELNSDVGSSKNRVQE--------LQKELEQVIEELGDAKTDKHEDTRRKKKQELV 454
K L + ++ +V E Q E+++ EE AK K D +++ K
Sbjct: 258 KNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEK 317
Query: 455 ENFKKAYSGVYDR 467
E K R
Sbjct: 318 EAEDKELEAQKKR 330
Score = 31.5 bits (71), Expect = 2.8
Identities = 28/145 (19%), Positives = 58/145 (40%), Gaps = 11/145 (7%)
Query: 64 KKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKG 123
+KGV R +++ E+++ + R + + +E++ + + +D DK++
Sbjct: 190 NEKGVNFRRDMTDLKERESQEDAK-RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRD 248
Query: 124 EVEKIERRKEKAENILR-----EKKKEQGALNRELAKVDQEIREMDVEINKKRPSLI--- 175
EV + ++ + E K+ RE+ K EI++ D E K +
Sbjct: 249 EVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL 308
Query: 176 --KSKERVSHIQKKLASAKKSLVEV 198
+SK + K A+K V
Sbjct: 309 KQESKASEKEAEDKELEAQKKREPV 333
>gnl|CDD|235840 PRK06599, PRK06599, DNA topoisomerase I; Validated.
Length = 675
Score = 32.5 bits (75), Expect = 1.6
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 17/80 (21%)
Query: 69 VAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLF--KLYH------------NETDIKEL 114
+ ER E +IE ++Y I ++ + F KL NE K L
Sbjct: 178 ICER-EDEIEAFIPQEYWTIEADLATSN--GEPFTAKLVEVNGKKLEKFSITNEEQAKAL 234
Query: 115 EDELDKKKGEVEKIERRKEK 134
L+ + V+KIE+++ K
Sbjct: 235 VKALEGQAYTVDKIEKKERK 254
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.2 bits (73), Expect = 1.6
Identities = 29/123 (23%), Positives = 59/123 (47%), Gaps = 12/123 (9%)
Query: 45 ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
ERL +E+ AE+E + K +++ + +E E +E+ KE E L +L
Sbjct: 32 ERLDSELRDAEKERDTY----KQYLSKLESQNVEISNYEALDSELDEL-KKEEERLLDEL 86
Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
+ELE E D GE+ +++ KE+ EN + +E +R +++ ++ ++
Sbjct: 87 -------EELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLE 139
Query: 165 VEI 167
++
Sbjct: 140 LQY 142
>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain. These
proteins contain several 22 residue repeats which form a
pair of alpha helices. This family includes:
Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
E.
Length = 191
Score = 31.4 bits (72), Expect = 1.7
Identities = 28/149 (18%), Positives = 61/149 (40%), Gaps = 5/149 (3%)
Query: 322 MTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQ-NEIKKK-RHEME 379
+ +++A ++ ++ E + ELR+++ E+++K E
Sbjct: 44 QKDLEDVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKLAPYAE 103
Query: 380 EAQKRIDKLEDHIRQNEASLKDN--KKLKEELNSDVGSSKNRVQELQKELEQVIEELGDA 437
E + R+ + + +R + +KL E L S +E + +L + +EEL +
Sbjct: 104 ELRDRLRQNVEALRARLGPYVEELRQKLAERLEELKESVGPYAEEYKAQLSEQVEELREK 163
Query: 438 KTDKHEDTRRKKKQELVENFKKAYSGVYD 466
ED R K + E+ KK + + +
Sbjct: 164 LEPYAEDLREKLEP-YAEDLKKQLAQLLE 191
>gnl|CDD|218839 pfam05983, Med7, MED7 protein. This family consists of several
eukaryotic proteins which are homologues of the yeast
MED7 protein. Activation of gene transcription in
metazoans is a multi-step process that is triggered by
factors that recognise transcriptional enhancer sites in
DNA. These factors work with co-activators such as MED7
to direct transcriptional initiation by the RNA
polymerase II apparatus.
Length = 161
Score = 31.1 bits (71), Expect = 1.8
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILR 140
E+ I +E++LD+K+ E+E I + E+ +L
Sbjct: 129 ESLIMLMEEQLDEKREEIEAIRKVLEEVREVLE 161
>gnl|CDD|226844 COG4420, COG4420, Predicted membrane protein [Function unknown].
Length = 191
Score = 31.2 bits (71), Expect = 1.9
Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 9/87 (10%)
Query: 150 NRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDI 209
NR+ A+ D+ E+D ++N +K+++ V+ + +KL + L VR + + +
Sbjct: 112 NRQ-AERDRLRAELDYQVN------LKAEQEVAALHEKLDELRLDLGYVRDELDDLRELL 164
Query: 210 ADLETQLADVRKRKA--EYERQSIPGR 234
A++E +LAD + + S P R
Sbjct: 165 AEIEPELADEEALRRAVRGGQHSKPER 191
Score = 30.5 bits (69), Expect = 3.1
Identities = 18/78 (23%), Positives = 37/78 (47%), Gaps = 8/78 (10%)
Query: 238 LESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 297
L + D ++N +K+++ V+ + +KL + L VR + + +A++E +LAD
Sbjct: 120 LRAELDYQVN------LKAEQEVAALHEKLDELRLDLGYVRDELDDLRELLAEIEPELAD 173
Query: 298 VRKRKA--EYERQSIPGR 313
+ + S P R
Sbjct: 174 EEALRRAVRGGQHSKPER 191
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 31.9 bits (72), Expect = 2.1
Identities = 43/254 (16%), Positives = 88/254 (34%), Gaps = 44/254 (17%)
Query: 215 QLADVRKRKAEYERQSIPGRD-INLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKS 273
Q D R + +R+ + R L S D ++ R ++ R K+ +
Sbjct: 586 QTGDPTTRWEKDDRRKLGDRSTYRLGSTNDAKVETLRETVKAMLSREDFYMIKIMRQQGE 645
Query: 274 LVEVRQANEAHNKDIA------DLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTN 327
+++++ A A DL + + + +A ER + DI + A +
Sbjct: 646 YIKLQEQANALAHIQALNFASIDLPSAQRQIAELQARLERLTHTQSDIAIAKAALDAAQT 705
Query: 328 L--------KAEAT---------KRAGKILQQLDTINREQKGDQDK-------LDNELRQ 363
+ E T KRA + +++ +I ++ + Q
Sbjct: 706 RQKVLERQYQQEVTECAGLKKDLKRAAMLSRKVHSIAKQGMTGALQALGAAHFPQVAPEQ 765
Query: 364 QVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ-NEASLKDNKKLKEELNS----------D 412
+I++ H + QKRID + +R+ E + K+E + D
Sbjct: 766 HDDIV-DIERIEHR-RQLQKRIDAVNARLRRLREEIIGRMSDAKKEDTAALSEVGAELDD 823
Query: 413 VGSSKNRVQELQKE 426
+ R+Q L ++
Sbjct: 824 IPEYLARLQTLTED 837
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 31.9 bits (73), Expect = 2.2
Identities = 34/143 (23%), Positives = 69/143 (48%), Gaps = 16/143 (11%)
Query: 117 ELDKKKGEVEKIERRKEKAENILREK-KKEQGALNRE-LAKVDQEIREMDVEINKKRPSL 174
E+D K E+++++RR + E I RE KKE+ ++E L +++E+ E++ E
Sbjct: 399 EIDSKPEELDELDRRIIQLE-IEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQW 457
Query: 175 IKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGR 234
K + IQ+ K+ + +VR E ++ DL + AE + +P
Sbjct: 458 KAEKAAIQGIQQ----IKEEIEQVRLELEQAERE-GDLA--------KAAELQYGKLPEL 504
Query: 235 DINLESAQDVEINKKRPSLIKSK 257
+ L++A+ + +P L++ +
Sbjct: 505 EKRLQAAEAKLGEETKPRLLREE 527
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.1 bits (71), Expect = 2.2
Identities = 22/91 (24%), Positives = 51/91 (56%), Gaps = 3/91 (3%)
Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
+ K + +E+ +K +++L+ I + +A ++ KK +EE + +++L+KEL++
Sbjct: 63 ALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREE-TEERTELLEELKQLEKELKK 121
Query: 430 VIEELGDAKTDKHEDTRRKKKQELVENFKKA 460
+ EL + K++ R +K +E + K+A
Sbjct: 122 LKAELEKYE--KNDPERIEKLKEETKVAKEA 150
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 2.2
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 3/57 (5%)
Query: 628 LKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLE 684
L KEKL + L+E M K +K EL ++ LE ++ ++IA+ E
Sbjct: 206 LDRAKEKLKKLLQEIMIKVKKLEELEE---ELQELESKIEDLTNKKSELNTEIAEAE 259
Score = 31.1 bits (71), Expect = 2.9
Identities = 27/124 (21%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 310 IPGRDINLESAQMTEYTNLKAEATK---RAGKILQQLDTINREQKGDQDKLDNELRQQVQ 366
G D NLE + +Y L E K+ + D + E +L++EL
Sbjct: 146 KEGLDENLEGLK-EDYKLLMKELELLNSIKPKLRDRKDALEEEL-RQLKQLEDELEDCDP 203
Query: 367 T-----QNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQ 421
T + ++KK E+ K++++LE+ +++ E+ ++D K ELN+++ ++ +++
Sbjct: 204 TELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLE 263
Query: 422 ELQK 425
+ +
Sbjct: 264 QCRG 267
Score = 30.8 bits (70), Expect = 3.6
Identities = 26/136 (19%), Positives = 63/136 (46%), Gaps = 7/136 (5%)
Query: 315 INLESAQMTEYTNLKAEAT------KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQ 368
++ + + + L+A+ K + + LD K D L EL +
Sbjct: 115 MDKQFQLVKTFARLEAKKMWYEWRMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIK 174
Query: 369 NEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLK-EELNSDVGSSKNRVQELQKEL 427
+++ ++ +EE +++ +LED + + + D K K ++L ++ +++EL++EL
Sbjct: 175 PKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEEL 234
Query: 428 EQVIEELGDAKTDKHE 443
+++ ++ D K E
Sbjct: 235 QELESKIEDLTNKKSE 250
Score = 30.4 bits (69), Expect = 5.1
Identities = 16/107 (14%), Positives = 52/107 (48%), Gaps = 9/107 (8%)
Query: 110 DIKELEDELDKKKGEVEKIERRKE--------KAENILREKKKEQGALNRELAKVDQEIR 161
+++ +D L+++ +++++E E +A+ L++ +E ++L ++++E++
Sbjct: 176 KLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQ 235
Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLASAKK-SLVEVRQANEAHNK 207
E++ +I + ++ +KKL + + E+ + E
Sbjct: 236 ELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKLKEQLKL 282
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 30.3 bits (69), Expect = 2.3
Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 7/105 (6%)
Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHI 263
++D+ E+ +RK +AE ER I Q +E ++ +L+++KER +
Sbjct: 44 QRDRDLEFRESLEETLRKLEAEIERLQN---TIERLKTQ-LEDLERELALLQAKER--QL 97
Query: 264 QKKLASAKKSLVEVRQANEAHNKDIADLETQLA-DVRKRKAEYER 307
+KKL + ++ L ++ + I +TQ +++KR E E+
Sbjct: 98 EKKLKTLEQKLKNEKEEVQRLKNIIQQRKTQYNHELKKRDREIEK 142
Score = 30.3 bits (69), Expect = 2.3
Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 9/110 (8%)
Query: 283 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKA---EATKRAGKI 339
++D+ E+ +RK +AE ER I Q+ + A ++ K
Sbjct: 44 QRDRDLEFRESLEETLRKLEAEIERLQN---TIERLKTQLEDLERELALLQAKERQLEKK 100
Query: 340 LQQLDTINREQKGDQDKLDNELRQQVQTQ--NEIKKKRHEMEEAQKRIDK 387
L+ L+ + +K + +L N + QQ +TQ +E+KK+ E+E+ ++R+ +
Sbjct: 101 LKTLEQKLKNEKEEVQRLKNII-QQRKTQYNHELKKRDREIEKLKERLLQ 149
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 31.7 bits (72), Expect = 2.3
Identities = 10/59 (16%), Positives = 22/59 (37%), Gaps = 7/59 (11%)
Query: 649 ESELNTVQSTIKGLEIRLNYSRQDLQ-------NTKSQIAKLEAEIDALNARADATEPK 700
+ +Q T ++ + R++L + + +I KL + AL + A
Sbjct: 68 QHATTEMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGAN 126
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.9 bits (72), Expect = 2.4
Identities = 76/399 (19%), Positives = 159/399 (39%), Gaps = 44/399 (11%)
Query: 55 EEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIRE--EIVAKEVEHQLFKLYHNETDIK 112
E E++M+ + K VV+E + + E++ + + + + E ++ + +L+ NE D+
Sbjct: 3859 ELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWD-EPNEEDLL 3917
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRP 172
E E + +++ + + ++ +N E K Q K D+E DV I+ +
Sbjct: 3918 ETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQ-------EKEDEEEMSDDVGIDDE-- 3968
Query: 173 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAH-----NKDIADLETQLADVRKRKAEYE 227
I+ + ++ Q + L E + +E + D+ D++ + AD K +A+ E
Sbjct: 3969 --IQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAE 4026
Query: 228 RQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 287
+ P +D E +E N I+ + + + E Q NE +D
Sbjct: 4027 KDE-PMQD---EDP--LEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTED 4080
Query: 288 IADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRA---GKILQQLD 344
+ +L ++ E Q+I + T AEA + G + + +
Sbjct: 4081 GVKSDEEL----EQGEVPEDQAIDN-----HPKMDAKSTFASAEADEENTDKGIVGENEE 4131
Query: 345 TINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR--QNEASLKDN 402
+ D E Q Q + + M EA ++ L DH+R Q + +
Sbjct: 4132 LGEEDGVRGNGTADGEFEQ---VQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHEW 4188
Query: 403 KKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDK 441
+ L E + ++++ E+ ++ LG+A+ D+
Sbjct: 4189 EDLTES--QSQAFDDSEFMHVKEDEEEDLQALGNAEKDQ 4225
>gnl|CDD|224034 COG1109, {ManB}, Phosphomannomutase [Carbohydrate transport and
metabolism].
Length = 464
Score = 31.5 bits (72), Expect = 2.6
Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 24/113 (21%)
Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
K +++ E E ++K +E TI G+++ L
Sbjct: 374 KYPQSVEINVRVTDEGKAEVLEKLLEELREAKKVDTIDGVKVEL---------------- 417
Query: 683 LEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCK 735
+ + R TEP I+ + +E + ++V R+
Sbjct: 418 --EDGGRVLVRPSGTEPLIR-----VYVEAKDEELAEELAEEIAELV-REALG 462
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 31.8 bits (72), Expect = 2.8
Identities = 36/228 (15%), Positives = 79/228 (34%), Gaps = 23/228 (10%)
Query: 77 IEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAE 136
+ + ++ I ++ E K +D L KG+ E + EK +
Sbjct: 16 NDTIAVKNINVLKSSIKDSTQFWKM-----LEKQQKLTKDGLSASKGKYEGLSEAMEKQK 70
Query: 137 NILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 196
+ K+E +NR Q ++ E N + + + KL S
Sbjct: 71 KAYEDLKQEVKEVNRAT----QASKKAYQEYNA----------QYTQAENKLRSLSGQFG 116
Query: 197 EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKS 256
+ K+I L+ ++ + K A + + + N D ++ R S +
Sbjct: 117 VASEQLMLQQKEIQRLQYAISTLNKSMA--AQARLLEQTGNKFGTADAKVVGLRESFGRQ 174
Query: 257 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
E ++ ++ + K + +EA K L +L +++ + +
Sbjct: 175 TEALN--KQLERTKKVADALTYVLDEAQQKLSQALSARLERLQESRTQ 220
>gnl|CDD|224046 COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component
[Inorganic ion transport and metabolism].
Length = 254
Score = 31.0 bits (71), Expect = 2.9
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 9/40 (22%)
Query: 7 YIEVDNF-KSYKGK-------FSIGPLKKFTAVIGPNGSG 38
IEV+N SY + S+ + TA+IGPNG+G
Sbjct: 4 MIEVENLTVSYGNRPVLEDISLSVEK-GEITALIGPNGAG 42
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 31.1 bits (71), Expect = 3.0
Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
+ + + EL+++L+K + ++ K+E + EK N ++ K + L ++LA +++ I E
Sbjct: 238 LNFKEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAE--LEQQLASLEKRIDEAK 295
Query: 165 VEINKKRPSLI 175
I K +
Sbjct: 296 ELIAKYGNEVP 306
>gnl|CDD|217963 pfam04203, Sortase, Sortase family. The founder member of this
family is S.aureus sortase, a transpeptidase that
attaches surface proteins by the threonine of an LPXTG
motif to the cell wall.
Length = 124
Score = 29.9 bits (68), Expect = 3.1
Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 3/43 (6%)
Query: 990 TTIVSHRYHPAPFFVLDEI---DAALDNTNIGKVASYIVTKTQ 1029
T I HR F LD++ D T GK +Y VT +
Sbjct: 41 TVIAGHRGGGTMFSDLDKLKKGDKIYVTTLDGKTYTYRVTSVE 83
>gnl|CDD|234806 PRK00635, PRK00635, excinuclease ABC subunit A; Provisional.
Length = 1809
Score = 31.3 bits (71), Expect = 3.4
Identities = 21/77 (27%), Positives = 31/77 (40%), Gaps = 3/77 (3%)
Query: 984 LTYRVSTTIVSHRYHPA---PFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLK 1040
L+ +++ I Y P F+LDEI +LDN + + T I I
Sbjct: 1702 LSEKIAIKIAKFLYLPPKHPTLFLLDEIATSLDNQQKSALLVQLRTLVSLGHSVIYIDHD 1761
Query: 1041 EEFFSHADSLVGICPGS 1057
AD L+ + PGS
Sbjct: 1762 PALLKQADYLIEMGPGS 1778
>gnl|CDD|224300 COG1382, GimC, Prefoldin, chaperonin cofactor [Posttranslational
modification, protein turnover, chaperones].
Length = 119
Score = 29.6 bits (67), Expect = 3.5
Identities = 29/112 (25%), Positives = 55/112 (49%), Gaps = 8/112 (7%)
Query: 45 ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
+ A++ Q +++ L+K+ + A+ KE + +E EK + V K+V + L K
Sbjct: 9 QAQLAQLQQLQQQLQKVILQKQQLEAQLKEIEKALEELEKLDE--DAPVYKKVGNLLVK- 65
Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKE-QGALNRELAK 155
+E DEL+++K +E + EK E L+E+ +E Q + + L
Sbjct: 66 ----VSKEEAVDELEERKETLELRIKTLEKQEEKLQERLEELQSEIQKALGD 113
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 31.4 bits (71), Expect = 3.5
Identities = 33/192 (17%), Positives = 73/192 (38%), Gaps = 29/192 (15%)
Query: 53 QAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIK 112
+A + YL K + E + +++ + +K +++ + + +
Sbjct: 543 RATQVFREQYLLKHDLAREEFQRRVKLLQLQKEKQLED----------IQDCREERKSLS 592
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE---INK 169
E ++L EK E K E ++ K+ + N +L + R+M E INK
Sbjct: 593 ERAEKL------AEKFEEAKYNQELLVNRCKRLLQSANSQLPVLSDSERDMSKELQRINK 646
Query: 170 KRPSLI----KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 225
+ L + K++ ++ + +AS + E +K I ++ +L +
Sbjct: 647 QLQHLANGIKQVKKKKNYQRYHMASQESPKKSSYTLPEKQHKTITEILKELGE------H 700
Query: 226 YERQSIPGRDIN 237
+R + IN
Sbjct: 701 IDRMIKQIKRIN 712
>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
Provisional.
Length = 638
Score = 31.3 bits (71), Expect = 3.5
Identities = 33/129 (25%), Positives = 61/129 (47%), Gaps = 17/129 (13%)
Query: 181 VSHIQKKLASAKKSLVEVRQAN-EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
VSH + L S L V E + D+ D + L+DV+K++ + ++ N
Sbjct: 483 VSHDRHLLRSTTDDLYLVHDGKVEPFDGDLEDYQQWLSDVQKQEN---QTDEAPKENNAN 539
Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV- 298
SAQ + K+R + ++++ + ++K++A +K + E N +A E +L D
Sbjct: 540 SAQARKDQKRREAELRTQTQ--PLRKEIARLEKEM-------EKLNAQLAQAEEKLGDSE 590
Query: 299 ---RKRKAE 304
+ RKAE
Sbjct: 591 LYDQSRKAE 599
Score = 30.5 bits (69), Expect = 6.3
Identities = 45/189 (23%), Positives = 74/189 (39%), Gaps = 52/189 (27%)
Query: 260 VSHIQKKLASAKKSLVEVRQAN-EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 318
VSH + L S L V E + D+ D + L+DV+K++ + ++ N
Sbjct: 483 VSHDRHLLRSTTDDLYLVHDGKVEPFDGDLEDYQQWLSDVQKQEN---QTDEAPKENNAN 539
Query: 319 SAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEM 378
SAQ + DQ + + ELR Q Q R E+
Sbjct: 540 SAQ----------------------------ARKDQKRREAELRTQTQPL------RKEI 565
Query: 379 EEAQKRIDKLEDHIRQNEASLKDN----KKLKEELN---SDVGSSKNRVQE-------LQ 424
+K ++KL + Q E L D+ + K EL S+K+ ++E Q
Sbjct: 566 ARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQ 625
Query: 425 KELEQVIEE 433
++LEQ++ E
Sbjct: 626 EQLEQMLLE 634
>gnl|CDD|224031 COG1106, COG1106, Predicted ATPases [General function prediction
only].
Length = 371
Score = 31.0 bits (70), Expect = 3.6
Identities = 9/34 (26%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 5 LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
++ ++ NFKS++ + + K + G NG+G
Sbjct: 2 IKSFKIKNFKSFR-ELELEDFGKINIIYGANGAG 34
>gnl|CDD|222036 pfam13304, AAA_21, AAA domain.
Length = 256
Score = 30.5 bits (68), Expect = 3.7
Identities = 28/185 (15%), Positives = 56/185 (30%), Gaps = 12/185 (6%)
Query: 28 FTAVIGPNGSG------------ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEA 75
+IGPNGSG L L + + + + + E
Sbjct: 1 LNVIIGPNGSGKSNLLKALALLLLLLSLGLTLDRGLNVGIKLLPFLLDENEIEIPLEFEI 60
Query: 76 KIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKA 135
+ + +Y+ E +E L++ E + E E + + EK +
Sbjct: 61 EEFLIDGIRYRYGFELDKEDILEELLYEYRKGEELLFERERSKESFEKSPEKKRELRGLR 120
Query: 136 ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 195
E +L L ++ I I +L + ++ + K+L L
Sbjct: 121 EVLLLLNLSLSSFLLLASLEILLSILLPFSFILGNLRNLRNIELKLLDLVKRLFLESDLL 180
Query: 196 VEVRQ 200
++
Sbjct: 181 RLLKL 185
>gnl|CDD|213205 cd03238, ABC_UvrA, ATP-binding cassette domain of the excision repair
protein UvrA. Nucleotide excision repair in eubacteria
is a process that repairs DNA damage by the removal of a
12-13-mer oligonucleotide containing the lesion.
Recognition and cleavage of the damaged DNA is a
multistep ATP-dependent reaction that requires the UvrA,
UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases,
with UvrA having two ATP binding sites, which have the
characteristic signature of the family of ABC proteins,
and UvrB having one ATP binding site that is structurally
related to that of helicases.
Length = 176
Score = 30.0 bits (68), Expect = 4.0
Identities = 15/55 (27%), Positives = 23/55 (41%)
Query: 1003 FVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGICPGS 1057
F+LDE L +I ++ I I+I + S AD ++ PGS
Sbjct: 111 FILDEPSTGLHQQDINQLLEVIKGLIDLGNTVILIEHNLDVLSSADWIIDFGPGS 165
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 31.0 bits (70), Expect = 4.0
Identities = 25/124 (20%), Positives = 53/124 (42%), Gaps = 13/124 (10%)
Query: 346 INREQKGDQD-KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKK 404
I + +L+++++ + + KK ++++ K+ +K +D + LK KK
Sbjct: 23 IKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKK 82
Query: 405 LKEELNSD-----VGSSKNRVQELQKELEQVIE-------ELGDAKTDKHEDTRRKKKQE 452
+K++L+ D SKN ++ L + E K ++ T +KKK+
Sbjct: 83 IKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
Query: 453 LVEN 456
L
Sbjct: 143 LSSK 146
Score = 30.2 bits (68), Expect = 6.8
Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 3/93 (3%)
Query: 104 LYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNREL--AKVDQEIR 161
L + + + K ++LDKK + K + K+ + L++KKK + L+ +
Sbjct: 43 LLNLDINKKL-HEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKN 101
Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLASAKKS 194
SL++ + + KK + KS
Sbjct: 102 NSKQFAGPLAISLMRKPKPKTEKLKKKITVNKS 134
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 30.8 bits (70), Expect = 4.1
Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 11/95 (11%)
Query: 317 LESAQMTEYTNLKAEATKRAG--------KILQQLDTINREQKGDQDKLDNELRQQVQTQ 368
L + Q E + EA K+ KI ++ D + +E K D+ K+ + +Q
Sbjct: 187 LPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKI-KSAKNDLQND 245
Query: 369 -NEIKKKRHEMEEA-QKRIDKLEDHIRQNEASLKD 401
++K E+++A Q + +LE+ LK+
Sbjct: 246 KKQLKADLAELKKAPQNDLKRLENKYAIKSGDLKN 280
Score = 30.8 bits (70), Expect = 5.0
Identities = 24/113 (21%), Positives = 47/113 (41%), Gaps = 8/113 (7%)
Query: 111 IKELEDELDKKKGEVEKIERRKEKAENILREKKK--EQGALNRELAKVDQEIREMDVEIN 168
L+ E K E+IE+ + + +++KK E K + I++ D+
Sbjct: 152 RALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDI--- 208
Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
K L K KE ++K+ + K+ + + D L+ LA+++K
Sbjct: 209 KNPLELQKIKEEFDKLKKEGKADKQKIKSAKND---LQNDKKQLKADLAELKK 258
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 31.1 bits (70), Expect = 4.1
Identities = 51/307 (16%), Positives = 113/307 (36%), Gaps = 18/307 (5%)
Query: 128 IERRKEKAENILREKKKEQGALNRELAKVD-QEIREMDVEINKKRPSLIKSKERVSHIQK 186
+ + IL + + E +L R D E M + ++ ++I+ + +
Sbjct: 448 EKHIISEISEILEDIEAELDSLERSSNGDDENEEVAMVGSLVEESSAVIERSQELEGAVS 507
Query: 187 KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEI 246
+++ L + ADLE + + Q +D+++E++ EI
Sbjct: 508 RISEFTSVLEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQDVSVEAS---EI 564
Query: 247 NKKRPSL-IKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 305
K L KE S + K A +L E+ +L D+ + K +
Sbjct: 565 KKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKCKDDLEESKNKL 624
Query: 306 ERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQ-------DKLD 358
+ ++ E E +L K + + L+ ++ + + L+
Sbjct: 625 QETEKKLEELKSELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLE 684
Query: 359 NELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKN 418
+EL ++ Q E+ K E+EE K+E ++ D ++ K + ++ ++
Sbjct: 685 DELEKERQNHQELIAKCRELEE------KIERAEQEENMQKLDEEEQKIKQEKEIAAAAE 738
Query: 419 RVQELQK 425
++ E Q+
Sbjct: 739 KLAECQE 745
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 30.7 bits (70), Expect = 4.3
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 369 NEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
+ R E EE +K + +LE +++ E +N +L+E LN
Sbjct: 69 ASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLNL 111
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 30.2 bits (69), Expect = 4.3
Identities = 18/73 (24%), Positives = 49/73 (67%)
Query: 361 LRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRV 420
+ + + + E+K++R+E++ +KR+ + E+ + + + SL+ ++ EE ++ + + ++
Sbjct: 63 HKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQL 122
Query: 421 QELQKELEQVIEE 433
+E ++ELE++IEE
Sbjct: 123 EEKEEELEELIEE 135
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.5 bits (69), Expect = 4.4
Identities = 30/221 (13%), Positives = 83/221 (37%), Gaps = 28/221 (12%)
Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA--- 296
QD ++++ + + + + ++ + + E+++ + +I L+ ++A
Sbjct: 34 QNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELK 93
Query: 297 -DVRKRKAEYERQSIPGRDINLESAQM----TEYTNLKAEATKRAGKILQQLDTINREQK 351
++ +R+ +++ + Q+ T Y ++ + K ++ ++ I+
Sbjct: 94 ENIVERQELLKKR--------ARAMQVNGTATSYIDVILNS-KSFSDLISRVTAISVIVD 144
Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
D+ L QQ + + +++K+ +E+ + L + E L K E N+
Sbjct: 145 ADK----KILEQQKEDKKSLEEKQAALEDKLE---TLVALQNELETQLNSLNSQKAEKNA 197
Query: 412 DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQE 452
+ + + E L + K KQE
Sbjct: 198 LIAALAAKEASALGE----KAALEEQKALAEAAAAEAAKQE 234
>gnl|CDD|233070 TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ.
All proteins in this family are 5'-3' single-strand DNA
exonucleases. These proteins are used in some aspects of
mismatch repair, recombination, and recombinational
repair [DNA metabolism, DNA replication, recombination,
and repair].
Length = 539
Score = 30.7 bits (70), Expect = 4.4
Identities = 9/47 (19%), Positives = 22/47 (46%)
Query: 328 LKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKK 374
L + K+A ++ ++LD +N E+K + ++ E + +
Sbjct: 279 LLTDDPKQARQLAEELDELNEERKKIEQEVLAEAEKIAEANLLPNDS 325
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 29.1 bits (66), Expect = 4.5
Identities = 20/101 (19%), Positives = 49/101 (48%), Gaps = 3/101 (2%)
Query: 129 ERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKL 188
R E+ + L K++E L + ++E+ + + E+ + SLIK + + + K
Sbjct: 6 RREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQE---SLIKFDKFLKENEAKR 62
Query: 189 ASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
A+K E ++ + ++I +L+ +L +++ + E +
Sbjct: 63 RRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEK 103
>gnl|CDD|234240 TIGR03514, GldB_lipo, gliding motility-associated lipoprotein GldB.
RN RM PMID:10648514 RT Cloning and characterization of
the Flavobacterium johnsoniae gliding-motility genes
gldB and gldC. RA Hunnicutt DW, McBride MJ RL J
Bacteriol. 2000 Feb;182(4):911-8.
Length = 319
Score = 30.4 bits (69), Expect = 4.6
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKA 135
KL + + + EL E++ K + EKIE+ A
Sbjct: 73 KLVYKDPALNELYTEVEAKFPDFEKIEKDLTAA 105
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.8 bits (69), Expect = 4.7
Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 7/66 (10%)
Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
K++ ++ A+ EKL + M+K + S +K E RL + D +T S+I +
Sbjct: 371 KKVADMTAEIEKLKKRHARRMEKFNR-------TSALKDAEQRLRDAAADPADTGSEIWR 423
Query: 683 LEAEID 688
LE ++
Sbjct: 424 LEGRLE 429
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 30.8 bits (70), Expect = 4.7
Identities = 30/151 (19%), Positives = 57/151 (37%), Gaps = 1/151 (0%)
Query: 333 TKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHI 392
K A +ILQ+ E+ EL + + Q ++ R E EE Q+ ++L
Sbjct: 32 AKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQR-QEARREREELQREEERLVQKE 90
Query: 393 RQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQE 452
Q +A + L+ +L + R EL++ +Q+ EL E R+ +
Sbjct: 91 EQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKL 150
Query: 453 LVENFKKAYSGVYDRMINMCHPVHKRYNVAI 483
L ++ + ++ +R I
Sbjct: 151 LDAELEEEKAQRVKKIEEEADLEAERKAQNI 181
Score = 30.4 bits (69), Expect = 5.6
Identities = 21/106 (19%), Positives = 47/106 (44%), Gaps = 4/106 (3%)
Query: 616 KAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQN 675
K ++ KE + + +K +EE EA KE + R++LQ
Sbjct: 26 KKRQRLAKEAERILQEAQKEAEEKLEAALLEAKEL----LLRERNQQRQEARREREELQR 81
Query: 676 TKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEE 721
+ ++ + E ++DA + D E +++ E +++AR + +++
Sbjct: 82 EEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQ 127
Score = 30.4 bits (69), Expect = 6.1
Identities = 19/136 (13%), Positives = 49/136 (36%)
Query: 296 ADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQD 355
A+ + A E + + R+ N + + +R + +QLD + ++
Sbjct: 46 AEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLEN 105
Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGS 415
+L+ + + E+++ +++ R+ L + + +L+EE V
Sbjct: 106 QLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKK 165
Query: 416 SKNRVQELQKELEQVI 431
+ + Q I
Sbjct: 166 IEEEADLEAERKAQNI 181
Score = 29.7 bits (67), Expect = 9.3
Identities = 23/144 (15%), Positives = 61/144 (42%), Gaps = 6/144 (4%)
Query: 729 VFRDFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQG 788
+ + + I Q + E + E + + + + + + EEL R +
Sbjct: 25 LKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEE 84
Query: 789 AEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDI------QAA 842
+ ++ A A+KL+N+ +++A+ A + E+ + +++ + + QA
Sbjct: 85 RLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQAR 144
Query: 843 QKSCVNLESKLEMKKSERHDILMN 866
+ L+++LE +K++R +
Sbjct: 145 KLLLKLLDAELEEEKAQRVKKIEE 168
>gnl|CDD|235239 PRK04172, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Provisional.
Length = 489
Score = 30.6 bits (70), Expect = 4.8
Identities = 26/125 (20%), Positives = 46/125 (36%), Gaps = 19/125 (15%)
Query: 60 MSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELD 119
+ L +KG K I K +A + E+ + E ++ +
Sbjct: 108 LGNLARKGWAKIEKGKVILKPDAYE--DPEEKALKALAE----GDKEELSEEDLKVLKEL 161
Query: 120 KKKGEVEKIERR------KEKAENILR---EKKKEQGALNRELAKV----DQEIREMDVE 166
KK+ VE+ ER + +L+ E K+E L EL K ++E R +V+
Sbjct: 162 KKRKLVEEKERTERSVELTDAGLELLKEGIELKEEITQLTPELLKSGEWKEKEFRPYNVK 221
Query: 167 INKKR 171
+
Sbjct: 222 APPPK 226
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 30.4 bits (70), Expect = 4.9
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 7/60 (11%)
Query: 380 EAQKRIDKLEDHIRQNEAS-----LKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
E Q +++L+ +N S K + E L ++V K ++ L+ EL+++ EL
Sbjct: 39 ELQTELEELQ--AERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAEL 96
Score = 30.4 bits (70), Expect = 5.1
Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 20/87 (22%)
Query: 610 SLDLARKAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYS 669
LD R+ + E+ L+A++ LS+E+ +A +K L
Sbjct: 32 ELDEERRELQ---TELEELQAERNALSKEIGQAKRKGEDAEAL----------------- 71
Query: 670 RQDLQNTKSQIAKLEAEIDALNARADA 696
+++ K +I LEAE+D L A +
Sbjct: 72 IAEVKELKEEIKALEAELDELEAELEE 98
>gnl|CDD|238252 cd00446, GrpE, GrpE is the adenine nucleotide exchange factor of
DnaK (Hsp70)-type ATPases. The GrpE dimer binds to the
ATPase domain of Hsp70 catalyzing the dissociation of
ADP, which enables rebinding of ATP, one step in the
Hsp70 reaction cycle in protein folding. In eukaryotes,
only the mitochondrial Hsp70, not the cytosolic form, is
GrpE dependent.
Length = 137
Score = 29.4 bits (67), Expect = 5.0
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 911 LEHAKENLMKTNEEFENARKRAKKAKAN 938
LE K+ L++ EFEN RKR ++ +
Sbjct: 1 LEELKDKLLRALAEFENYRKRTEREREE 28
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.5 bits (69), Expect = 5.2
Identities = 76/375 (20%), Positives = 151/375 (40%), Gaps = 59/375 (15%)
Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGAL---NRELAKVDQEIREMDVEINK 169
L+ +L++ + E ++E ++ E +KE L N EL + E + + EI+
Sbjct: 243 LLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQALKDEIDV 302
Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
R +S ++ ++ ++ + KK L +D+ DL Q+ + +R A Y +
Sbjct: 303 LR----ESSDKAKKLEAQVETYKKKL-----------EDLNDLRRQVKLLEERNAMYMQN 347
Query: 230 SIPGRDINLESAQDV-EINKKRPSLIKSKERVSHIQKKLAS----AKKSLVEVRQANEAH 284
++ +++ + N R L K +V + KL+ A K E ++ E
Sbjct: 348 -------TVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKL 400
Query: 285 NKDIADLETQLA--DVRKRKAEYER--QSIPGRDINLESAQMTEYTNLKAE----ATKRA 336
+ E LA D + E R Q+ + ++ NL AE +
Sbjct: 401 EALQKEKERLLAERDSLRETNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREK 460
Query: 337 GKILQQLDTINREQKGDQDK---------LD------NELRQQVQTQNE-IKKKRHEMEE 380
LQ + + R + + L+ NEL Q++ N+ I + + ++E+
Sbjct: 461 LIRLQHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVED 520
Query: 381 AQKRIDKLEDHIRQNEASLKDNKKLKEELN--SDVGSSKNRVQELQKELEQVIEELGDAK 438
QK L++ ++E S KL+E L + + +E +ELE ++ K
Sbjct: 521 LQK---ALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRK 577
Query: 439 TDKHEDTRRKKKQEL 453
+ E +KK +++
Sbjct: 578 IAELEAALQKKDEDM 592
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 29.6 bits (67), Expect = 5.2
Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 1/87 (1%)
Query: 357 LDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKD-NKKLKEELNSDVGS 415
L EL+Q + EI+++ E+E+ + + +LE + + E ++ +K L L
Sbjct: 36 LAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHPVLRLLESE 95
Query: 416 SKNRVQELQKELEQVIEELGDAKTDKH 442
+ LQ L ++ E D
Sbjct: 96 VLEENELLQDSLLELSERNFSPNLDPE 122
Score = 29.2 bits (66), Expect = 6.7
Identities = 23/112 (20%), Positives = 48/112 (42%), Gaps = 5/112 (4%)
Query: 89 REEIVAKEVEHQLFKLYHN--ETDIKELEDELDKKKGEVEKI---ERRKEKAENILREKK 143
EEI +E E + Y E + K LE E +++ + + + EN L +
Sbjct: 47 EEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHPVLRLLESEVLEENELLQDS 106
Query: 144 KEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 195
+ + +D E+ ++ ++NK SL + E+++ + + L +L
Sbjct: 107 LLELSERNFSPNLDPELLDLLKQLNKHLESLQGNLEQLAGLVEALERLYAAL 158
>gnl|CDD|214699 smart00503, SynN, Syntaxin N-terminal domain. Three-helix domain
that (in Sso1p) slows the rate of its reaction with the
SNAP-25 homologue Sec9p.
Length = 117
Score = 28.9 bits (65), Expect = 5.4
Identities = 13/77 (16%), Positives = 37/77 (48%)
Query: 375 RHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
R +++ + + +L+ + +K+L+E+L + K +E++ +L+++ +E
Sbjct: 14 RANIQKISQNVAELQKLHEELLTPPDADKELREKLERLIDDIKRLAKEIRAKLKELEKEN 73
Query: 435 GDAKTDKHEDTRRKKKQ 451
+ + R +K Q
Sbjct: 74 LENRASGSASDRTRKAQ 90
>gnl|CDD|213244 cd03277, ABC_SMC5_euk, ATP-binding cassette domain of eukaryotic SMC5
proteins. The structural maintenance of chromosomes
(SMC) proteins are large (approximately 110 to 170 kDa),
and each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code), which
by mutational studies has been shown to be essential in
several proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B' motif,
and a motif with homology to the signature sequence of
the ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these sequences.
In eukaryotic cells, the proteins are found as
heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
and SMC5 with SMC6 (formerly known as Rad18).
Length = 213
Score = 29.9 bits (68), Expect = 5.4
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 999 PAPFFVLDEIDAALDNTNIGKVASYIV 1025
PF V+DEI+ +D TN KV +V
Sbjct: 148 RCPFRVVDEINQGMDPTNERKVFDMLV 174
Score = 29.1 bits (66), Expect = 8.9
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
I+++NF +Y + P +IGPNGSG
Sbjct: 6 IKLENFVTY-DETEFRPGPSLNMIIGPNGSG 35
>gnl|CDD|203795 pfam07889, DUF1664, Protein of unknown function (DUF1664). The
members of this family are hypothetical plant proteins
of unknown function. The region featured in this family
is approximately 100 amino acids long.
Length = 125
Score = 28.9 bits (65), Expect = 5.9
Identities = 18/90 (20%), Positives = 49/90 (54%), Gaps = 6/90 (6%)
Query: 765 TKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEE 824
TK+N++ AV+ ++L + E +A + + +++N+ +K+ ++ +E
Sbjct: 36 TKRNMSD---AVASVGKQL---EKVSESIAAAKKHLSQRIDNLDDKLDEQKEISESTRDE 89
Query: 825 IGKARREVGSIAKDIQAAQKSCVNLESKLE 854
+ + R ++ +I +D+++ ++ LE KL+
Sbjct: 90 VTEIREDLSNIGEDVKSVHQAVEGLEGKLD 119
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome partitioning].
Length = 1480
Score = 30.6 bits (69), Expect = 5.9
Identities = 43/260 (16%), Positives = 107/260 (41%), Gaps = 43/260 (16%)
Query: 210 ADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKS------------- 256
AD E ++ + R+ E ER + N + + K+ + +
Sbjct: 832 ADPEAEIRQLNSRRNELERALSNHENDNQQQRIQFDQAKEGVTALNRLIPQLNLLADESL 891
Query: 257 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 316
+RV I+++L A+++ ++Q H ++ LE + ++ ++E+
Sbjct: 892 ADRVEEIRERLDEAQEAARFIQQ----HGNTLSKLEPIASVLQSDPEQFEQLK------- 940
Query: 317 LESAQMTEYTNLKAEATKRA---GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKK 373
+ ++ +A ++A +++Q+ + + +++L +++
Sbjct: 941 ---EDYAQAQQMQRQARQQAFALTEVVQRRAHFSYSDSAEMLSENSDLNEKL-------- 989
Query: 374 KRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
R +E+A+ + + +RQ++A L ++ L S + K + ELQ+EL ++
Sbjct: 990 -RQRLEQAEAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQEL----QD 1044
Query: 434 LGDAKTDKHEDTRRKKKQEL 453
+G E+ R ++ EL
Sbjct: 1045 IGVRADSGAEERARIRRDEL 1064
>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase. The valyl-tRNA synthetase
(ValS) is a class I amino acyl-tRNA ligase and is
particularly closely related to the isoleucyl tRNA
synthetase [Protein synthesis, tRNA aminoacylation].
Length = 861
Score = 30.4 bits (69), Expect = 6.0
Identities = 28/144 (19%), Positives = 52/144 (36%), Gaps = 25/144 (17%)
Query: 8 IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSGALKDDYERLKAEMIQAEEETNMSYLKKKG 67
+ + N K+ PLK + N +K
Sbjct: 734 VSIRNLKAESNIPPNAPLKV----------------LLIYTEAETAERLKLNAVDIKG-- 775
Query: 68 VVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEK 127
E ++ ++ E + + E + E+ + L + ++ L+ +LDK+K EV +
Sbjct: 776 -AINFSEVEVVIEKPEVTEAVVELVPGFEIIIPVKGLINKAKELARLQKQLDKEKKEVIR 834
Query: 128 IERRKE------KAENILREKKKE 145
IE + E KA + EK+KE
Sbjct: 835 IEGKLENEGFVKKAPKEVIEKEKE 858
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 30.2 bits (68), Expect = 6.2
Identities = 42/184 (22%), Positives = 69/184 (37%), Gaps = 19/184 (10%)
Query: 129 ERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKL 188
+++ + R+KK EQ A E + ++ R+ ++E K E+ K
Sbjct: 58 QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQ----AAKQ 113
Query: 189 ASAKKSLVE---VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVE 245
A K+ E +QA EA K A+ E + + K++AE E + A+
Sbjct: 114 AEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAK-----------AKAAA 162
Query: 246 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE-AHNKDIADLETQLADVRKRKAE 304
KK+ + K K K A AK E + E A K A+ + A
Sbjct: 163 EAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAA 222
Query: 305 YERQ 308
E +
Sbjct: 223 AEAE 226
>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
coiled-coil domain [Function unknown].
Length = 231
Score = 29.8 bits (67), Expect = 6.2
Identities = 24/95 (25%), Positives = 48/95 (50%), Gaps = 4/95 (4%)
Query: 362 RQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQ 421
QQ+ T+ ++++ R E E+ QK + + +R+ +A+ +D K L+ + ++K +
Sbjct: 39 WQQLATKQDVEELRKETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKEDI- 97
Query: 422 ELQKELEQVIEELGDAKTDKHEDTRRKKKQELVEN 456
K LE +I LG ED R+ +E++
Sbjct: 98 ---KRLETIITGLGARWGILSEDAFRQGVREILRE 129
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 30.5 bits (70), Expect = 6.3
Identities = 36/175 (20%), Positives = 72/175 (41%), Gaps = 36/175 (20%)
Query: 71 ERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDI--------KELEDE----L 118
E ++ +IE++ EK + EE+ +E + + F L + L +E L
Sbjct: 114 EARKEEIEEEFQEKREEAFEEL-EEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEFEAL 172
Query: 119 DKKKGEV--EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIK 176
+++ E EKI+ +E+ + ILR+ ++ + +L ++D+E+
Sbjct: 173 PEEEREELEEKIDELEEELQEILRQLRELEREAREKLRELDREV---------------- 216
Query: 177 SKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD-----LETQLADVRKRKAEY 226
+ V + +L L +V +A +DI + LE A+ AE
Sbjct: 217 ALFAVGPLLDELKEKYADLPKVLAYLDAVQEDILENLDLFLEEDEAEAAASGAED 271
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 30.7 bits (70), Expect = 6.3
Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 9/87 (10%)
Query: 366 QTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQK 425
Q E+ + ++E + + + + L + L EL + + QEL+
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAEL-------EEKQQELEA 198
Query: 426 ELEQVIEELGDAKTDKHEDTRRKKKQE 452
+LEQ+ E+ A+T + +RK+ +
Sbjct: 199 QLEQLQEK--AAETSQERKQKRKEITD 223
>gnl|CDD|217686 pfam03713, DUF305, Domain of unknown function (DUF305). Domain
found in small family of bacterial secreted proteins
with no known function. Also found in Paramecium
bursaria chlorella virus 1. This domain is short and
found in one or two copies. The domain has a conserved
HH motif that may be functionally important. This domain
belongs to the ferritin superfamily. It contains two
sequence similar repeats each of which is composed of
two alpha helices.
Length = 150
Score = 29.2 bits (66), Expect = 6.4
Identities = 11/30 (36%), Positives = 21/30 (70%), Gaps = 1/30 (3%)
Query: 816 QAVDAMDEEIGKARR-EVGSIAKDIQAAQK 844
QAV+ + + +++ EV ++A+DI AAQ+
Sbjct: 14 QAVEMAELALERSQDPEVRALAQDIIAAQQ 43
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 30.4 bits (69), Expect = 6.6
Identities = 17/87 (19%), Positives = 42/87 (48%), Gaps = 8/87 (9%)
Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDK-LEDHIRQNEASLKDNKKLKEELNSDVG 414
LD + R+ V+ +N E E ++K L++ ++ +E + ++ +L +
Sbjct: 500 ALDKKFRELVEARN-------EAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLEEALE 552
Query: 415 SSKNRVQELQKELEQVIEELGDAKTDK 441
K ++ +EL++V ++L + K +
Sbjct: 553 GEKEEIKAKIEELQEVTQKLAEKKYQQ 579
Score = 30.0 bits (68), Expect = 9.1
Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 3/85 (3%)
Query: 111 IKELEDELDKKKGEVEKIERR--KEKAENILREKKKEQGALNRE-LAKVDQEIREMDVEI 167
+++ E K E +E R E L + KE ++ E K+++ I +++ +
Sbjct: 492 VEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLEEAL 551
Query: 168 NKKRPSLIKSKERVSHIQKKLASAK 192
++ + E + + +KLA K
Sbjct: 552 EGEKEEIKAKIEELQEVTQKLAEKK 576
>gnl|CDD|214017 cd12924, iSH2_PIK3R1, Inter-Src homology 2 (iSH2) helical domain of
Class IA Phosphoinositide 3-kinase Regulatory subunit 1,
PIK3R1, also called p85alpha. PI3Ks catalyze the
transfer of the gamma-phosphoryl group from ATP to the
3-hydroxyl of the inositol ring of
D-myo-phosphatidylinositol (PtdIns) or its derivatives.
They play an important role in a variety of fundamental
cellular processes, including cell motility, the Ras
pathway, vesicle trafficking and secretion, immune cell
activation and apoptosis. They are classified according
to their substrate specificity, regulation, and domain
structure. Class IA PI3Ks are heterodimers of a p110
catalytic (C) subunit and a p85-related regulatory (R)
subunit. The R subunit down-regulates PI3K basal
activity, stabilizes the C subunit, and plays a role in
the activation downstream of tyrosine kinases. All R
subunits contain two SH2 domains that flank an
intervening helical domain (iSH2), which binds to the
N-terminal adaptor-binding domain (ABD) of the catalytic
subunit. In addition, p85alpha, also called PIK3R1,
contains N-terminal SH3 and GAP domains. p85alpha carry
functions independent of its PI3K regulatory role. It
can independently stimulate signaling pathways involved
in cytoskeletal rearrangements. Insulin-sensitive
tissues express splice variants of the PIK3R1 gene,
p50alpha and p55alpha, which may play important roles in
insulin signaling during lipid and glucose metabolism.
Mice deficient with PIK3R1 die perinatally, indicating
its importance in development.
Length = 161
Score = 29.3 bits (65), Expect = 6.7
Identities = 36/146 (24%), Positives = 66/146 (45%), Gaps = 13/146 (8%)
Query: 43 DYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLF 102
+Y+RL E + +E M K+ + A + KI +++ + +R +E + K F
Sbjct: 23 EYDRLYEEYTRTSQEIQM---KRTAIEAFNETIKIFEEQCQTQERYSKEYIEK------F 73
Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIRE 162
K NE +I+ + +K K + +I + + L E K+Q A RE+ K I+
Sbjct: 74 KREGNEKEIQRIMHNYEKLKSRISEIVDSRRR----LEEDLKKQAAEYREIDKRMNSIKP 129
Query: 163 MDVEINKKRPSLIKSKERVSHIQKKL 188
+++ K R + + QKKL
Sbjct: 130 DLIQLRKTRDQYLMWLTQKGVRQKKL 155
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 30.4 bits (69), Expect = 6.7
Identities = 32/191 (16%), Positives = 81/191 (42%), Gaps = 17/191 (8%)
Query: 83 EKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREK 142
+ +R++E++V +L E ++ ++++++ + + R+ +A+N++ E
Sbjct: 255 SRLERLKEQLVENSELLTQLELDEAEEELGLIQEKIE----SLYDLLEREVEAKNVVEEN 310
Query: 143 KKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH---IQKKLASAKKSLVEVR 199
L L K + + EI + + S ++ + +K+L + L E+
Sbjct: 311 LPI---LPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEIL 367
Query: 200 QANEAHNK-------DIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPS 252
+ EA ++ ++E L D+ + + + R LE+ +++E K +
Sbjct: 368 ENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKLH 427
Query: 253 LIKSKERVSHI 263
IK S++
Sbjct: 428 EIKRYMEKSNL 438
>gnl|CDD|217892 pfam04100, Vps53_N, Vps53-like, N-terminal. Vps53 complexes with
Vps52 and Vps54 to form a multi- subunit complex
involved in regulating membrane trafficking events.
Length = 375
Score = 30.1 bits (68), Expect = 6.9
Identities = 17/75 (22%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 340 LQQLDTINREQKGDQDKLDNELRQQVQTQ-NEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
L +D++ ++ + + +LD ++ V+ Q N +K + ++EAQK I++L I++ ++
Sbjct: 20 LTNIDSLIQKIQEEIRRLDANIKTVVRGQANVGEKGKEALDEAQKAIEELFQKIKEIKSK 79
Query: 399 LKDNKKLKEELNSDV 413
+ +++ +E+ D+
Sbjct: 80 AESTEEMVKEITRDI 94
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.3 bits (69), Expect = 7.1
Identities = 19/65 (29%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
E +K E+EE Q ++L ++ +LK + EEL ++V K +++EL+ L++
Sbjct: 37 ERRKLLRELEELQAERNEL---SKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDE 93
Query: 430 VIEEL 434
+ EL
Sbjct: 94 LEAEL 98
>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component
of iron-siderophores, vitamin B12 and hemin
transporters and related proteins. ABC transporters,
involved in the uptake of siderophores, heme, and
vitamin B12, are widely conserved in bacteria and
archaea. Only very few species lack representatives of
the siderophore family transporters. The E. coli BtuCD
protein is an ABC transporter mediating vitamin B12
uptake. The two ATP-binding cassettes (BtuD) are in
close contact with each other, as are the two
membrane-spanning subunits (BtuC); this arrangement is
distinct from that observed for the E. coli lipid
flippase MsbA. The BtuC subunits provide 20
transmembrane helices grouped around a translocation
pathway that is closed to the cytoplasm by a gate
region, whereas the dimer arrangement of the BtuD
subunits resembles the ATP-bound form of the Rad50 DNA
repair enzyme. A prominent cytoplasmic loop of BtuC
forms the contact region with the ATP-binding cassette
and represent a conserved motif among the ABC
transporters.
Length = 180
Score = 29.3 bits (67), Expect = 7.1
Identities = 8/30 (26%), Positives = 14/30 (46%), Gaps = 6/30 (20%)
Query: 15 SYKGKFSIGPLK------KFTAVIGPNGSG 38
Y G+ + L + ++GPNG+G
Sbjct: 8 GYGGRTVLDDLSLSIEAGEIVGILGPNGAG 37
>gnl|CDD|213207 cd03240, ABC_Rad50, ATP-binding cassette domain of Rad50. The
catalytic domains of Rad50 are similar to the ATP-binding
cassette of ABC transporters, but are not associated with
membrane-spanning domains. The conserved ATP-binding
motifs common to Rad50 and the ABC transporter family
include the Walker A and Walker B motifs, the Q loop, a
histidine residue in the switch region, a D-loop, and a
conserved LSGG sequence. This conserved sequence, LSGG,
is the most specific and characteristic motif of this
family and is thus known as the ABC signature sequence.
Length = 204
Score = 29.5 bits (67), Expect = 7.1
Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 1001 PFFVLDEIDAALDNTNIGKVASYIVT--KTQDSLQTIVISLKEEFFSHADS 1049
LDE LD NI + + I+ K+Q + Q IVI+ EE AD
Sbjct: 141 GILALDEPTTNLDEENIEESLAEIIEERKSQKNFQLIVITHDEELVDAADH 191
>gnl|CDD|213186 cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component
of branched chain amino acids transport system. The
Mj1267/LivG ABC transporter subfamily is involved in
the transport of the hydrophobic amino acids leucine,
isoleucine and valine. MJ1267 is a branched-chain amino
acid transporter with 29% similarity to both the LivF
and LivG components of the E. coli branched-chain amino
acid transporter. MJ1267 contains an insertion from
residues 114 to 123 characteristic of LivG
(Leucine-Isoleucine-Valine) homologs. The
branched-chain amino acid transporter from E. coli
comprises a heterodimer of ABCs (LivF and LivG), a
heterodimer of six-helix TM domains (LivM and LivH),
and one of two alternative soluble periplasmic
substrate binding proteins (LivK or LivJ).
Length = 236
Score = 29.7 bits (68), Expect = 7.6
Identities = 9/19 (47%), Positives = 13/19 (68%), Gaps = 1/19 (5%)
Query: 20 FSIGPLKKFTAVIGPNGSG 38
FS+ P + +IGPNG+G
Sbjct: 21 FSVRP-GEIHGLIGPNGAG 38
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 30.3 bits (68), Expect = 7.7
Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 3/111 (2%)
Query: 331 EATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLED 390
E + G++ +QL + D+ L + E + K +E + + LE+
Sbjct: 2106 EYGRGMGELKEQLSNTVVILGVKEKNADDALSGTPGERLEREVKSVFVEAPRDMLFLLEE 2165
Query: 391 HIRQNEASLKDNKKLKEE--LNSDVGSSKNRVQELQKELEQVIEELGDAKT 439
+R+ + S+ K K+ L + + K + L +E+ I GD
Sbjct: 2166 EVRKRKGSVMKFKSSKKPAVLEAVLFVYKIKKASL-REIRSFIRPPGDLCI 2215
>gnl|CDD|218703 pfam05700, BCAS2, Breast carcinoma amplified sequence 2 (BCAS2).
This family consists of several eukaryotic sequences of
unknown function. The mammalian members of this family
are annotated as breast carcinoma amplified sequence 2
(BCAS2) proteins. BCAS2 is a putative spliceosome
associated protein.
Length = 221
Score = 29.4 bits (66), Expect = 8.2
Identities = 16/76 (21%), Positives = 39/76 (51%)
Query: 326 TNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRI 385
L K ++ ++++ +NR++K Q++ +LR Q ++ K E+E A +
Sbjct: 142 EALLKRLEKELAELKEEIEEVNRQRKYSQEEAGEKLRSLEQRWVDLVSKNLEIEVACAEL 201
Query: 386 DKLEDHIRQNEASLKD 401
++ + +++ +A LK
Sbjct: 202 EQEIEQLKRKKAELKW 217
>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
metal-type transporters. This family includes
transporters involved in the uptake of various metallic
cations such as iron, manganese, and zinc. The ATPases
of this group of transporters are very similar to
members of iron-siderophore uptake family suggesting
that they share a common ancestor. The best
characterized metal-type ABC transporters are the
YfeABCD system of Y. pestis, the SitABCD system of
Salmonella enterica serovar Typhimurium, and the
SitABCD transporter of Shigella flexneri. Moreover
other uncharacterized homologs of these metal-type
transporters are mainly found in pathogens like
Haemophilus or enteroinvasive E. coli isolates.
Length = 213
Score = 29.4 bits (67), Expect = 8.2
Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 11/36 (30%)
Query: 3 PILQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
P+L+ + +F+ G+F A++GPNG+G
Sbjct: 13 PVLEDV---SFEVKPGEF--------LAIVGPNGAG 37
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 29.7 bits (67), Expect = 8.2
Identities = 26/126 (20%), Positives = 60/126 (47%), Gaps = 6/126 (4%)
Query: 83 EKYQRIREEIVAKEVEHQ------LFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAE 136
E+ +R ++E+ + + + +L E ELE++L +++ E + +E++ ++ E
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60
Query: 137 NILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 196
R ++E A E +++ E+ E E+ K K + +Q++L A+++
Sbjct: 61 EENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHE 120
Query: 197 EVRQAN 202
RQ
Sbjct: 121 RARQEL 126
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.1 bits (66), Expect = 8.3
Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 351 KGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELN 410
+ +Q K +EL Q EIKK E EE +KR+ +LE + E ++ ++++E+ +
Sbjct: 117 QAEQGK--SELEQ------EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRH 168
Query: 411 SD-VGSSKNRVQELQKELEQV 430
+D + K + Q+L+ +LEQ+
Sbjct: 169 ADEIAFLKKQNQQLKSQLEQI 189
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 28.3 bits (64), Expect = 8.4
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 16/75 (21%)
Query: 633 EKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQ----------DLQNTKSQIAK 682
E+LS LR R E EL +++ + LE + +RQ +L+ K +I +
Sbjct: 20 ERLSSTLR------RLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEE 73
Query: 683 LEAEIDALNARADAT 697
LE E++ L R + T
Sbjct: 74 LEKELEDLEQRYETT 88
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of this
family are annotated as being transmembrane proteins
induced by tumour necrosis factor alpha, but no
literature was found to support this.
Length = 330
Score = 29.7 bits (67), Expect = 8.4
Identities = 22/98 (22%), Positives = 50/98 (51%), Gaps = 5/98 (5%)
Query: 377 EMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ----VIE 432
E +E +K +L++ R + L++ +KL+E+ S + + R+++L L++ +
Sbjct: 5 EWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKKSLTP 64
Query: 433 ELGDAKTDKHEDTRRKKKQEL-VENFKKAYSGVYDRMI 469
E + E + +K Q +E+F +G+Y R++
Sbjct: 65 EDSELVEQLEEQIKERKNQFFDMESFLPKKNGLYLRLV 102
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.1 bits (68), Expect = 8.5
Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 5/78 (6%)
Query: 64 KKKGVVAERKEAKIE-----KDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDEL 118
KKKG + ++ IE ++E E + E ++ + L + E +I+E E
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 119 DKKKGEVEKIERRKEKAE 136
D KK + +K + +AE
Sbjct: 1565 DIKKRKNKKQYKSNTEAE 1582
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.8 bits (67), Expect = 8.6
Identities = 40/192 (20%), Positives = 80/192 (41%), Gaps = 11/192 (5%)
Query: 119 DKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK-----RPS 173
++K E+RK+K + E +++Q A L ++++E R E K+ + +
Sbjct: 69 QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKE-RLAAQEQKKQAEEAAKQA 127
Query: 174 LIKSKERVSHIQKKLASAK-KSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP 232
+K K+ K A+AK K+ E ++A A K A+ + + +KA E +
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK--- 184
Query: 233 GRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLE 292
+ E+A K+ + ++K++ + KK A+A+ + A EA
Sbjct: 185 -KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243
Query: 293 TQLADVRKRKAE 304
+ A+
Sbjct: 244 AKAAEKAAAAKA 255
>gnl|CDD|221270 pfam11855, DUF3375, Protein of unknown function (DUF3375). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 479 to 499 amino acids in
length.
Length = 471
Score = 30.0 bits (68), Expect = 8.6
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 649 ESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATE 698
ES L TV ++ L + ++++A+LE EID L+A D E
Sbjct: 118 ESRLLTVFELLRQLAEGTE------PDPEARLAELEREIDELDAEIDRLE 161
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 30.1 bits (69), Expect = 8.7
Identities = 15/61 (24%), Positives = 31/61 (50%), Gaps = 9/61 (14%)
Query: 111 IKELEDELDKKKGEVEKIERR--------KEKAENILREKKKEQGALNRELAKVDQEIRE 162
+ LE EL K + E+E++E++ K E + +E++K +LAK+ + +
Sbjct: 813 LARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEE-KLAKLKERLAR 871
Query: 163 M 163
+
Sbjct: 872 L 872
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 30.1 bits (69), Expect = 8.7
Identities = 19/67 (28%), Positives = 31/67 (46%), Gaps = 9/67 (13%)
Query: 384 RI-----DKLEDHIRQNEASLKD----NKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
RI + +++ + EA LK+ K EL V + ++EL+KELEQ+ +L
Sbjct: 683 RIEAVTGEAALEYLNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKL 742
Query: 435 GDAKTDK 441
A
Sbjct: 743 AAAAAGD 749
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 28.8 bits (65), Expect = 8.7
Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%)
Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILRE--KKKEQGALNRELAKVDQEIREMDV 165
E + K+L+ EL KK+ E++K E++ +K L E +K +Q L ++ ++ Q+ +
Sbjct: 41 EKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQ 100
Query: 166 EINKKRPSLIKS-KERVSHIQKKLASAKK 193
E+ +K+ L++ +++ K++A K
Sbjct: 101 ELQQKQQELLQPIYDKIDKAIKEVAKEKG 129
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 29.8 bits (67), Expect = 9.0
Identities = 64/290 (22%), Positives = 117/290 (40%), Gaps = 51/290 (17%)
Query: 213 ETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVS--HIQKKLASA 270
+ + + KR E E+ + NL+ + K +L ++ V+ +K
Sbjct: 1106 DALIENTSKRITELEKAKLA----NLDV-----VKKAISNLNNLQQEVTLLRNEKIRMHT 1156
Query: 271 KKSLVEVRQANEAHNKDIADLETQLAD-----VRKRKAEYERQSIPGRDINLES--AQMT 323
V+ E K + ++T+LAD V K+ + E++ P L+S A+
Sbjct: 1157 GTDKVDFSDI-EKLEKQLQVIDTKLADAYLLEVTKQISALEKE-KPKNQSELKSKIAKFF 1214
Query: 324 EYTN----LKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQ------------T 367
+ T L+ E K+ G LD D DKL L+ Q +
Sbjct: 1215 DTTADIEVLRNERIKKHGSSKDPLDL------SDLDKLSGNLQGVNQSLVSILITTIRSS 1268
Query: 368 QNEIKKKRHEMEEAQ-----KRIDKLEDHIRQNEASLK---DNKKLKEELNSDVGSSKNR 419
N++K K EM+E + + + KLE + +++ + K D KLK+ L + +
Sbjct: 1269 LNQMKPKTFEMQEKEIQQNFELLAKLEKTLDKSDTAEKLREDIPKLKDLLIAKQKAYPEM 1328
Query: 420 VQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYDRMI 469
VQ LQ + E +I +L + H+D + + L E ++ G ++
Sbjct: 1329 VQ-LQYKSEALITQLRELCQAHHDDLAKTRTARLQELDRQDREGGITGIV 1377
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 29.9 bits (67), Expect = 9.0
Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 20/101 (19%)
Query: 74 EAKIEK--DEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKEL---EDELDKKKGEVEKI 128
+ KI K ++ E + IRE I ++ E KL N+ + +L K KG +K+
Sbjct: 74 QTKIRKLTEQDEGLRNIRESIEERQKESDQLKL-QNQKLVNQLGHARGVFIKTKGRYQKL 132
Query: 129 E----RRKEKAENI----------LREKKKEQGALNRELAK 155
E KE+ + + EK +E LNRELA+
Sbjct: 133 EELSRHLKEENQCLQIQLDALVQECNEKIEENQELNRELAE 173
>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968). This
family of proteins has no known function.
Length = 192
Score = 29.3 bits (66), Expect = 9.1
Identities = 24/100 (24%), Positives = 49/100 (49%), Gaps = 10/100 (10%)
Query: 612 DLARKAKRWDDKEM--GNLKAQKEKLSEELREAMKK-SRKESELNTVQSTIKGLEIRLNY 668
D AR+ + D E+ L+AQK +L ++ + + R +++L Q+ + + R
Sbjct: 92 DFARQTAQLADVEIRRAQLEAQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQ 151
Query: 669 SRQDL-------QNTKSQIAKLEAEIDALNARADATEPKI 701
+RQ+ Q ++Q+ KL+ +I L A+ + P +
Sbjct: 152 ARQEAVALEAQRQAAQAQLRKLQRQIRQLQAQQNEPIPHL 191
>gnl|CDD|238140 cd00228, eu-GS, Eukaryotic Glutathione Synthetase (eu-GS);
catalyses the production of glutathione from
gamma-glutamylcysteine and glycine in an ATP-dependent
manner. Belongs to the ATP-grasp superfamily.
Length = 471
Score = 29.6 bits (67), Expect = 9.2
Identities = 26/112 (23%), Positives = 44/112 (39%), Gaps = 30/112 (26%)
Query: 574 PEDAMKVAYDIEPQHR--YDAVALDGTFYQKSGIMSGGSLDLARKAKRWDDKEMGNL--- 628
PE + A +++P D ++ DG F Q+S L+ K D+ L
Sbjct: 53 PEALFEQAVEVQPDFNELVDRISQDGKFLQQS---------LSSTKKV--DEFTSRLLDI 101
Query: 629 --KAQKEKLSEELR------------EAMKKSRKESELNTVQSTIKGLEIRL 666
K +E + +R + K K+ E+NT+ S+ GL R+
Sbjct: 102 HKKVLEENKKQPVRLGIFRSDYMFDEKDGTKGLKQIEINTISSSFGGLSSRV 153
>gnl|CDD|235878 PRK06851, PRK06851, hypothetical protein; Provisional.
Length = 367
Score = 29.5 bits (67), Expect = 9.2
Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 20/106 (18%)
Query: 909 EKLEHAKENLMKTNEE----FENARKRAKKAKA--------NFDRIKKERYDKFTRCFEH 956
+KL KE ++K NEE F+ A + +A A + + + ++ T +
Sbjct: 115 DKLRKHKEEILKINEEISRCFQRAYEYLNEALAIHDEWEKIYIENMDFAKANELT---DE 171
Query: 957 VSNEI-DGAGSESVLPRP---FLGPENPEEPLTYRVS-TTIVSHRY 997
+ E+ GA + + FLG P+ + + S T V +RY
Sbjct: 172 LIQELFKGAPGKISKGKVRHLFLGAITPKGAVDFVPSLTEGVKNRY 217
>gnl|CDD|219955 pfam08657, DASH_Spc34, DASH complex subunit Spc34. The DASH
complex is a ~10 subunit microtubule-binding complex
that is transferred to the kinetochore prior to mitosis.
In Saccharomyces cerevisiae DASH forms both rings and
spiral structures on microtubules in vitro. Components
of the DASH complex, including Dam1, Duo1, Spc34, Dad1
and Ask1, are essential and connect the centromere to
the plus end of spindle microtubules.
Length = 255
Score = 29.4 bits (66), Expect = 9.7
Identities = 23/127 (18%), Positives = 54/127 (42%), Gaps = 8/127 (6%)
Query: 53 QAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAK--------EVEHQLFKL 104
+A+ E S + +E +E+ E R EE+ ++E +
Sbjct: 129 KADSEIKQSAVALVLGSVLTEEITVEEFNVEVLLRGAEELCDVYPLPGAREKIETLRDRY 188
Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
I ELE E+ ++K ++E + + E ++++ ++ + K ++EIRE++
Sbjct: 189 NELSESIAELEAEIAEQKRQLEIMNISSSSPSSDTSESEEDEIDIDELIRKEEEEIRELE 248
Query: 165 VEINKKR 171
++ +
Sbjct: 249 RQLRQLE 255
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.128 0.341
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 53,777,412
Number of extensions: 5548088
Number of successful extensions: 14130
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10875
Number of HSP's successfully gapped: 1697
Length of query: 1070
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 963
Effective length of database: 6,191,724
Effective search space: 5962630212
Effective search space used: 5962630212
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (28.5 bits)