RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9513
(1491 letters)
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 422 bits (1087), Expect = e-125
Identities = 366/1374 (26%), Positives = 681/1374 (49%), Gaps = 126/1374 (9%)
Query: 2 ALLDQLHIMGIRNFPADKN--RVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGA 59
A ++ I+G+R+F + +++ F PLT++VG NG GKTTIIEC+K+ T ++P G
Sbjct: 1 AKFLKMSILGVRSFGIEDKDKQIIDFFSPLTILVGPNGAGKTTIIECLKYICTGDFPPG- 59
Query: 60 SSGKNFVHDPRIGKKDEASASSGKNFVHDPRIGKKDETHAIVKLQCTCENNDTVCVVRSL 119
+ G FVHDP++ ++ + A ++LQ N + VVRS+
Sbjct: 60 --------------------TKGNTFVHDPKVAQETDVRAQIRLQFRDVNGEECAVVRSM 99
Query: 120 LLSNKNGKDNCATRDTTISRKIFATGVQKNLGCLQQESVLEMCNLIGVSKAILNNVIFCH 179
+ + K K T + I+R V + C + + EM + +GVSKA+LNNVIFCH
Sbjct: 100 VCTQKTKKTEFKTLEGVITRYKHGEKVSLSSKCAEIDR--EMISHLGVSKAVLNNVIFCH 157
Query: 180 QENSSWPLDEGKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKK 239
QE+S+WPL EGK +K+ FDEIF AT+Y KALE+++ R +++ E + + YK+
Sbjct: 158 QEDSNWPLSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVQEHQMELKYLKQYKE 217
Query: 240 EADSKKQLIYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTE 299
+A + I + + + S E + + E + P+ +L ++ + + + + K+
Sbjct: 218 KACEIRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSR 277
Query: 300 RDMIQESCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQ 359
+ +++ +ELE ++++F G +L + + + EK EL + +R + +E +
Sbjct: 278 KKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRL 337
Query: 360 SHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIK 419
+ E ++ G+L+ + H++ + + +LA L LD + ++ + +
Sbjct: 338 LNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHT 397
Query: 420 MSQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLT 479
+ + L E KQ + + + EK L I+ K+ +E +++L
Sbjct: 398 LVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELK 457
Query: 480 NVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDEL 539
VI ++ ++ S + L +L + E+ + K+ + LK E+++ ++ +L+ +L
Sbjct: 458 FVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKL 517
Query: 540 CVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNS 599
+D E+ L T +++ L K K I +K RH L ++ K
Sbjct: 518 RKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDE---LTSLLGYFPNKKQ 574
Query: 600 LDKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGS 659
L+ L S + +IN+ ++ + K L +LE N ++ + L ++ L+ D++ V GS
Sbjct: 575 LEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGS 634
Query: 660 KPFEDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKL-EENEPRCPLCTRFFESDYSVP 718
+ E +L+R+ E+++ ++ +M+ +++ +I +L +EN+ CP+C R F+++ +
Sbjct: 635 QDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQ 694
Query: 719 GLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLI 778
++ L++K++ P++ + ++ + + K++ + L P ++I+ L++ +IP LR+KL
Sbjct: 695 EFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQ 754
Query: 779 ELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESK 838
++ ++ K ++++ +T L T +E++A D+T++++ EL ++R++ +Q +K
Sbjct: 755 KVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAK 814
Query: 839 ISGMRSTGVDLD----QVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDI 894
G DLD QV +++EK++EL+T SKIE + + E++Q L+ + N++
Sbjct: 815 -----LQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNEL 869
Query: 895 HSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDAL- 953
S++L + + E++ EL E+++ + +P+ET L Q E + L
Sbjct: 870 KSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELI 929
Query: 954 -KKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKE 1012
KE K ++ K+ D ++++ + +I N + G L + +N + E
Sbjct: 930 SSKETSNKKAQD--KVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLE 987
Query: 1013 DIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELK-----LSEI 1067
+ + + + Q I Q ++E L++NLTL +++ + ++ EELK + ++
Sbjct: 988 ECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQM 1047
Query: 1068 MISDLTKYHHTLE--------NCVIKYHSQKMRSINRLIRE--------------YWTRI 1105
+ + + H LE N V+ QK + Y +
Sbjct: 1048 QVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFRDAEEKYREMM 1107
Query: 1106 YQLKLSEIMISDLTKYHHTLENCVIKYHSQKMRSINRLIREYWTRIYQGNDIDYISIAAD 1165
++ +E++ DL Y+ TL+ ++K+HS KM IN++IR+ W Y+G DI+YI I +D
Sbjct: 1108 IVMRTTELVNKDLDIYYKTLDQAIMKFHSMKMEEINKIIRDLWRSTYRGQDIEYIEIRSD 1167
Query: 1166 VG---TGSEKRRTYNYRVVQKKNGIEQDMRNRCSAGQRVLACLIIRLFISDQKNFQLIVI 1222
+ S+KRR YNYRVV K DMR RCSAGQ+VLA LIIRL
Sbjct: 1168 ADENVSASDKRRNYNYRVVMLKGDTALDMRGRCSAGQKVLASLIIRL------------- 1214
Query: 1223 THDEEFIENLTAIDRAYVVRIVRDHKALAETFSRNCGIFALDEPTTNLD----------- 1271
ALAETF NCGI ALDEPTTNLD
Sbjct: 1215 --------------------------ALAETFCLNCGIIALDEPTTNLDRENIESLAHAL 1248
Query: 1272 ---IKNASDQKNFQLIVITHDEEFIENLTAIDRA-YVVRIVRDHKGLSDIHLRS 1321
IK+ S Q+NFQL+VITHDE+F+E L + R+ ++ S+I S
Sbjct: 1249 VEIIKSRSQQRNFQLLVITHDEDFVELLGRSEYVEKFYRLKKNEDQCSEIVKCS 1302
>gnl|CDD|213207 cd03240, ABC_Rad50, ATP-binding cassette domain of Rad50. The
catalytic domains of Rad50 are similar to the ATP-binding
cassette of ABC transporters, but are not associated with
membrane-spanning domains. The conserved ATP-binding
motifs common to Rad50 and the ABC transporter family
include the Walker A and Walker B motifs, the Q loop, a
histidine residue in the switch region, a D-loop, and a
conserved LSGG sequence. This conserved sequence, LSGG,
is the most specific and characteristic motif of this
family and is thus known as the ABC signature sequence.
Length = 204
Score = 97.3 bits (243), Expect = 9e-23
Identities = 55/137 (40%), Positives = 65/137 (47%), Gaps = 55/137 (40%)
Query: 1190 DMRNRCSAGQRVLACLIIRLFISDQKNFQLIVITHDEEFIENLTAIDRAYVVRIVRDHKA 1249
DMR RCS G++VLA LIIRL A
Sbjct: 111 DMRGRCSGGEKVLASLIIRL---------------------------------------A 131
Query: 1250 LAETFSRNCGIFALDEPTTNLD-----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
LAETF NCGI ALDEPTTNLD I+ QKNFQLIVITHDEE ++ A
Sbjct: 132 LAETFGSNCGILALDEPTTNLDEENIEESLAEIIEERKSQKNFQLIVITHDEELVD---A 188
Query: 1299 IDRAYVVRIVRDHKGLS 1315
D ++ R+ +D + S
Sbjct: 189 AD--HIYRVEKDGRQKS 203
Score = 86.5 bits (215), Expect = 6e-19
Identities = 58/184 (31%), Positives = 75/184 (40%), Gaps = 75/184 (40%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGK 63
+D+L I IR+F + + F PLTLIVG+NG GKTTIIE +K+ALT E P + G
Sbjct: 1 IDKLSIRNIRSF--HERSEIEFFSPLTLIVGQNGAGKTTIIEALKYALTGELPPNSKGG- 57
Query: 64 NFVHDPRIGKKDEASASSGKNFVHDPRIGKKDETHAIVKLQCTCENNDTVCVVRSLLLSN 123
HDP++ ++ E A VKL N
Sbjct: 58 ----------------------AHDPKLIREGEVRAQVKLA----------------FEN 79
Query: 124 KNGKDNCATRDTTISRKIFATGVQKNLGCLQQESVLEMCNLIGVSKAILNNVIFCHQENS 183
NG + TI+R + AIL NVIFCHQ S
Sbjct: 80 ANG------KKYTITRSL----------------------------AILENVIFCHQGES 105
Query: 184 SWPL 187
+WPL
Sbjct: 106 NWPL 109
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 98.7 bits (246), Expect = 1e-20
Identities = 161/845 (19%), Positives = 308/845 (36%), Gaps = 119/845 (14%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGK 63
+ +L + R+F D + F + LIVG NG GK++I++ I FAL
Sbjct: 3 ILRLRLKNFRSF-KDIDIEKLFDSGIFLIVGPNGAGKSSILDAITFALY----------- 50
Query: 64 NFVHDPRIGKKDEASASSGKNFVHDPRIGKKDETHAIVKLQCTCENNDTVCVVRSLLLSN 123
GK A S + + E A V+L+ N + R
Sbjct: 51 --------GKTPRLGAFSLDDLIR------AGEKSASVELEFE-VNGKKYRIEREF---- 91
Query: 124 KNGKDNCATRDTTISRKIFATGVQKNLGCLQQESVLEMC-NLIGVSKAILNNVIFCHQEN 182
+ G+ I K V E L+G+ K ++ Q
Sbjct: 92 RRGRGQSTGSLQIIEVDGERIADGKK-------DVNEKIEELLGLDKDTFTRSVYLPQGE 144
Query: 183 SSWPLDE-GKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEA 241
L K+ KEI DE+F KY K E +K + +I E++ L ++
Sbjct: 145 FDAFLKSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDL 204
Query: 242 -----DSKKQLIYNNTQKRDQSFEELHN-IEESMKPINEKLVQLTEKERNMSVMSTQYQT 295
+ K+L + +Q EEL IE + + E + E + +
Sbjct: 205 LEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESL 264
Query: 296 KKTERDMIQESCNELESSIKQLFSGDK--AELQSKLNLFKINLDEKCSELENQERLKSQY 353
+ + +E ELE +++L + EL+ ++ + L+ + LE E L +
Sbjct: 265 ELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKL 324
Query: 354 IQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEE 413
E++ + + + KL KLE + E + + L L L
Sbjct: 325 KSLEER----LEKLEEKLEKLESELEELAEEKNELAKLLEERLKEL-------------- 366
Query: 414 GEGLIKMSQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEG 473
++ L +++ + + I L E EL + ++ ++++E
Sbjct: 367 ------------EERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEE 414
Query: 474 NKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQL---KNEIEAWIR 530
+K+L + ++ E+ + L+ +L I +L+ + + + + E
Sbjct: 415 LEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKE 474
Query: 531 QRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDM 590
E EL ++ E+S + E L+ E ++ L+E +
Sbjct: 475 LLELYELELEELEEELSREK-------EEAELREEIEELEKELRELEEELIEL------L 521
Query: 591 IPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELM 650
EE K L++ L + N ++E KEK + + L D+ + L EL+
Sbjct: 522 ELEEALKEELEEKLEKL---ENLLEELEELKEK---LQLQQLKEELRQLEDRLQELKELL 575
Query: 651 DRMELVLGSKPFEDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRF 710
+ + L+ K +EL ELK++ +E+ S +LEE L
Sbjct: 576 EELRLLRTRKEELEELRERLKELKKKLKELEERLS----------QLEELLQSLELSEAE 625
Query: 711 FESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDI 770
E + +L+++++++ Q ++ L + + ++EL+ I + I
Sbjct: 626 NELE----EAEEELESELEKLNLQAELEELLQAALEELEEKVEELE---AEIRRE-LQRI 677
Query: 771 PSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQR 830
+ +L E E + + + EL++L+ LE K L +L + EL
Sbjct: 678 ENEE-QLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELE 736
Query: 831 ELERQ 835
+LE+
Sbjct: 737 KLEKA 741
Score = 93.7 bits (233), Expect = 4e-19
Identities = 139/794 (17%), Positives = 277/794 (34%), Gaps = 123/794 (15%)
Query: 559 IKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITF--DINRIQE 616
IK K + E ++ L E + L + + E + + +I ++E
Sbjct: 180 IKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEE 239
Query: 617 DINAKEKHL-------YTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRV 669
+ E+ L S + L+ ++ L EL +E + +EL+R
Sbjct: 240 RLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELERE 299
Query: 670 TLELKREQEEVSM----MTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLK 725
EL+ E E + + S +LE+ E + E L +
Sbjct: 300 IEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEE------LAEEKN 353
Query: 726 TKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVI 785
K + E+ + +++L K+ E E ++ ++ L + L E++E +
Sbjct: 354 ELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELE 413
Query: 786 ETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQ-------RELERQESK 838
E + EL++L+ LE + + K L + I EL +EL + K
Sbjct: 414 ELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEK 473
Query: 839 -ISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSK 897
+ + ++ + +++++ EL ++E L E ++ L+ ++
Sbjct: 474 ELLELYELELEELEEELSREKEEAELREEIEELEKELRELEE--ELIELLELEEALKEEL 531
Query: 898 QLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIE---TQLNLAQSELDALK 954
+ L+ LE+ ELE + + +L++L ++ +E +L EL L
Sbjct: 532 EEK-------LEKLENLLEELEELKE--KLQLQQLKEELRQLEDRLQELKELLEEL-RLL 581
Query: 955 KEHKKKLNEEGAKIQDYTKQLEEVKRIKLEI--------LNYTKRGTLTQLAALRESVQK 1006
+ K++L E ++++ K+L+E++ ++ L+ + L ++K
Sbjct: 582 RTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEK 641
Query: 1007 LNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSE 1066
LN + E + E ++ + A E ++N L EK E + +L EEL+
Sbjct: 642 LNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLR 701
Query: 1067 IMISDLTKYHHTLENCVIKYHSQKMRSINRLIREYWTRIYQLKLSEIMISDLTKYHHTLE 1126
+ +L K +E + + + E +L+ E + L + L
Sbjct: 702 EELEELLKKLGEIEQLIEE--------LESRKAELEELKKELEKLEKALELLEE----LR 749
Query: 1127 NCVIKYHSQKM---RSINRLIREYWTRIYQGNDIDYISIAADVGTGSEKRRTYNYRVVQK 1183
+ K + + ++ E + + + Y + + VV
Sbjct: 750 EKLGKAGLRADILRNLLAQIEAEANEILSKLSLNRYDLRRLTI---RKDGNGGLVVVV-- 804
Query: 1184 KNGIEQDMRNRCSAGQRVLACLIIRLFISDQKNFQLIVITHDEEFIENLTAIDRAYVVRI 1243
+G E S G+R LA L +RL
Sbjct: 805 YDGGEVRPIKTLSGGERFLASLALRL---------------------------------- 830
Query: 1244 VRDHKALAETFSRN--CGIFALDEPTTNLD---------IKNASDQKNFQLIVITHDEEF 1292
AL++ + LDEP LD I Q+I+I+H EE
Sbjct: 831 -----ALSDLLQGRARLELLFLDEPFGTLDEERLEKLAEILEELLSDGRQIIIISHVEEL 885
Query: 1293 IENLTAIDRAYVVR 1306
E D V+
Sbjct: 886 KE---RADVRIRVK 896
Score = 64.4 bits (157), Expect = 4e-10
Identities = 118/646 (18%), Positives = 256/646 (39%), Gaps = 55/646 (8%)
Query: 247 LIYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQES 306
+ + ++R + +EL +E+ + L E + + + + +E E
Sbjct: 148 FLKSKPKERKEILDELFGLEK----YEKLSELLKEVIKEAKAKIEELEGQLSELLEDIED 203
Query: 307 CNE-LESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHIN 365
E LE +K+L ++ + + + + ++ L E K + + + + +
Sbjct: 204 LLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIES 263
Query: 366 EAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTI 425
L E + ++L + L+ K+ L + + + E L++ + +
Sbjct: 264 LELEALKIREEELRELERLLEELEEKIERLEE--LEREIEELEEELEGLRALLEELEELL 321
Query: 426 DKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQI 485
+K S + LE+ + ++E+ L EK EL ++ +++E ++L
Sbjct: 322 EKLKSLEERLEK-LEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEK----- 375
Query: 486 NEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAE 545
E+ ++ L+ L+ + + E+ +LS +E + ELE EL ++ E
Sbjct: 376 -ELEKALERLKQLEEAIQELKEELAELSA---------ALEEIQEELEELEKELEELERE 425
Query: 546 ISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALS 605
+ L+ + + +++ N+ ESK I L ++ + +PEE+ K L+
Sbjct: 426 LEELEEE---IKKLEEQINQLESKELMIAELAGAGEKC-PVCGQELPEEHEKELLEL--- 478
Query: 606 SITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDE 665
++ ++E+++ +EK L + K LR+ + L EL++ E ++E
Sbjct: 479 -YELELEELEEELS-REKEEAELREEIEELEKELRELEEELIELLELEEA------LKEE 530
Query: 666 LDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLK 725
L+ +L+ EE + +L + E R + + L+ K
Sbjct: 531 LEEKLEKLENLLEE-LEELKEKLQLQQLKEELRQLEDR----LQELKELLEELRLLRTRK 585
Query: 726 TKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVI 785
+++E+ E+ K + +L ++ L+EL E L ELEE
Sbjct: 586 EELEELRERLKELKKKLKELEERLSQLEELLQSLE------------LSEAENELEEAEE 633
Query: 786 ETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRST 845
E + EL+KL E + + L+ + L+ IR E+ E K+ +
Sbjct: 634 ELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQL 693
Query: 846 GVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQK 891
+L+Q+ + +E +L IE ++R E + L+K +
Sbjct: 694 EEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLE 739
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 68.2 bits (167), Expect = 3e-11
Identities = 155/877 (17%), Positives = 346/877 (39%), Gaps = 138/877 (15%)
Query: 187 LDEGKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQ 246
L E +K E +E + ++ + LE ++ + + KEI E+K+ + ++E + ++
Sbjct: 234 LKELRKELEELEE--ELSRLEEELEELQEELEEAEKEIEELKSELE---ELREELEELQE 288
Query: 247 LIYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQES 306
+ ++ ++ E+ + E ++ + +L +L E+ + + + ER+ + E
Sbjct: 289 ELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE 348
Query: 307 CNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQ----ERLKSQYIQEEKQSHT 362
+L + +++ K EL+ KL+ L+E L + E ++ E ++
Sbjct: 349 LEQLLAELEEA----KEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKR 404
Query: 363 HINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQ 422
I + +L +L E K+ L+ +L L Q EE ++ +
Sbjct: 405 EIESLEERLERLSERLEDLKEELKELEAELEEL------------QTELEELNEELEELE 452
Query: 423 TTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVI 482
+++ +K LER ++ + Q LE ++ S + +++
Sbjct: 453 EQLEELRDRLKELERELAELQEELQ-----------RLEKELSSLEARLD---------- 491
Query: 483 TQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQ---------LKNEIEAWIRQRN 533
E SQ VL+ + + +++ + + L N ++A +
Sbjct: 492 RLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKVKEKYETALEAALGNRLQAVV---- 547
Query: 534 ELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIP- 592
+E+E A I L+ A L K + +L + L D+I
Sbjct: 548 -VENEEVAKKA-IEFLKENKAGRATFLPLDRIKPLR----SLKSDAAPGFLGLASDLIDF 601
Query: 593 EENFKNSLDKALSSI----TFDINRIQEDINAKEKHLYTLEANV----------SNSSKT 638
+ ++ ++ L + R + + TL+ ++ S + ++
Sbjct: 602 DPKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRS 661
Query: 639 LRDQKRTLAELMDRMELVLGSKPF-EDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKL 697
QKR L EL + + + E+EL + EL+ ++ + + + +L
Sbjct: 662 SLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEEL 721
Query: 698 EENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKP 757
+ +S L+ +++E+ E+ + +++L ++ SL+E
Sbjct: 722 KRELAALEEELEQLQSRLEE------LEEELEELEEELEELQERLEELEEELESLEE--- 772
Query: 758 VYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTL 817
+ L+ ++ ELEE + EL++L+ LE + +
Sbjct: 773 -----------ALAKLKEEIEELEEKRQALQEELEELEEELEEAERR------------- 808
Query: 818 LDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRL 877
LD REL +L++ ER E +I ++ + +E + +L+ ++E + L
Sbjct: 809 LDALERELESLEQRRERLEQEIE----------ELEEEIEELEEKLDELEEELEELEKEL 858
Query: 878 NSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVA 937
E+L+ L+ +K ++ + LK LE+ K ELE ++EL EL ++
Sbjct: 859 EELKEELEELEAEKEELEDE----------LKELEEEKEELEEELRELESELAELKEEIE 908
Query: 938 PIETQLNLAQSELDALKKEHKKKLNE-EGAKIQDYTKQLE-EVKRIKLEI--LNYTKRGT 993
+ +L +++L+ L+ E + E E +LE E++R++ EI L
Sbjct: 909 KLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRA 968
Query: 994 LTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQ 1030
+ + + E ++L ++ED+ + I E+++
Sbjct: 969 IEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDK 1005
Score = 62.4 bits (152), Expect = 2e-09
Identities = 75/343 (21%), Positives = 150/343 (43%), Gaps = 52/343 (15%)
Query: 716 SVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRS 775
S +LK +E+ E + ++L + L+ L+ + LR
Sbjct: 661 SSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLE-----------DLLEELRR 709
Query: 776 KLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIREL----NTLQRE 831
+L ELE + E K EL L+ LE LQ L L++ + EL LQ
Sbjct: 710 QLEELERQLEELKRELAALEEELE----------QLQSRLEELEEELEELEEELEELQER 759
Query: 832 LERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQK 891
LE E ++ + L + + + +EK+ L + ++E + L +L +L+++
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQAL---QEELEELEEELEEAERRLDALEREL 816
Query: 892 NDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELD 951
S + + ++ LE+ ELE + ELEEL +++ ++ +L ++E +
Sbjct: 817 E---SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873
Query: 952 ALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRK 1011
L+ E K+ E+ ++ ++L E++ +LA L+E ++KL +R
Sbjct: 874 ELEDELKELEEEK----EELEEELRELES---------------ELAELKEEIEKLRERL 914
Query: 1012 EDIIAKRGVCERTINEINQSIANQSLE--EIDLKNNLTLLEKK 1052
E++ AK E + E+ + + + + E +L+ + LE++
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957
Score = 61.3 bits (149), Expect = 4e-09
Identities = 63/334 (18%), Positives = 148/334 (44%), Gaps = 26/334 (7%)
Query: 722 NKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDT----DIPSLRSKL 777
+ + ++ + + + +++L +Q + + + + +L+ + LR +L
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241
Query: 778 IELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQES 837
ELEE + + EL++L+ LE EK L+ +L L + + EL EL+ +
Sbjct: 242 EELEEELSRLEEELEELQEELEE---AEKEIEELKSELEELREELEELQEELLELKEEIE 298
Query: 838 KISGMRSTGVD-LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHS 896
++ G S + L+++ + +E + L + KIE+ + L L+ L++ ++
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEE 358
Query: 897 KQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKE 956
+ ++ + L ELE + + EL EL ++A I +L + E+++L++
Sbjct: 359 AKEELE------EKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEER 412
Query: 957 HKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIA 1016
++ ++ + E++ ++ T+L L E +++L ++ E++
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQ------------TELEELNEELEELEEQLEELRD 460
Query: 1017 KRGVCERTINEINQSIANQSLEEIDLKNNLTLLE 1050
+ ER + E+ + + E L+ L LE
Sbjct: 461 RLKELERELAELQEELQRLEKELSSLEARLDRLE 494
Score = 60.5 bits (147), Expect = 6e-09
Identities = 73/328 (22%), Positives = 141/328 (42%), Gaps = 33/328 (10%)
Query: 802 KTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTG------------VDL 849
+ KE+ L+ ++N+ L L ELE+Q K+ ++L
Sbjct: 172 ERKEEAERKLER----TEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227
Query: 850 DQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQ----KNDIHSKQLTVQGGA 905
+LA+ KE + EL ++ + L E+L+ +K+ K+++ + ++
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQ 287
Query: 906 GMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEG 965
L L++ ELEG S+ + LEEL ++ +E +L + +++ALK+E L E
Sbjct: 288 EELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEE----LEERE 343
Query: 966 AKIQDYTKQLEEVKRIK--LEILNYTKRGTL-TQLAALRESVQKLNQRKEDIIAKRGVCE 1022
+++ + L E++ K LE L ALRE + +L +I + +
Sbjct: 344 TLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELK 403
Query: 1023 RTINEINQSIANQSLEEIDLKNNLT-----LLEKKEAVAKLNEELKLSEIMISDLTKYHH 1077
R I + + + S DLK L L E + + +LNEEL+ E + +L
Sbjct: 404 REIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLK 463
Query: 1078 TLENCVIKYHSQKMRSINRLIREYWTRI 1105
LE + + +++ + + + R+
Sbjct: 464 ELERELAELQE-ELQRLEKELSSLEARL 490
Score = 57.0 bits (138), Expect = 8e-08
Identities = 68/351 (19%), Positives = 156/351 (44%), Gaps = 33/351 (9%)
Query: 254 KRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQ------YQTKKTERDMIQESC 307
+++++ +L EE+++ + + L E E+ + + Q YQ K E ++ +
Sbjct: 173 RKEEAERKLERTEENLERLEDLL---EELEKQLEKLERQAEKAERYQELKAELRELELAL 229
Query: 308 --NELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHIN 365
+L+ K+L EL+ +L+ + L+E ELE E+ + E ++ +
Sbjct: 230 LLAKLKELRKEL-----EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELE 284
Query: 366 EAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTI 425
E Q +L +L+ + E + L+ +L L + L + + EE + I+ + +
Sbjct: 285 ELQEELLELKEEIEELEGEISLLRERLEELENEL-----EELEERLEELKEKIEALKEEL 339
Query: 426 DKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQ---QIEGNKKDLTNVI 482
++ + ++ LE+ ++ E K+ L ELE ++ ++ ++E ++ N +
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNEL 399
Query: 483 TQIN-EVNQSQSTLQVLQTKLNRVNSEIDQLSKSL-----DPDQLKNEIEAWIRQRNELE 536
++ E+ + L+ L +L + E+ +L L + ++L E+E Q EL
Sbjct: 400 EELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELR 459
Query: 537 DELCVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLL 587
D L ++ E++ LQ + L ++ + E++L + + +L
Sbjct: 460 DRLKELERELAELQEE---LQRLEKELSSLEARLDRLEAEQRASQGVRAVL 507
Score = 53.2 bits (128), Expect = 1e-06
Identities = 69/362 (19%), Positives = 152/362 (41%), Gaps = 31/362 (8%)
Query: 617 DINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKRE 676
++ E+ L LEA + + L+ K L L D +E + +L+ + +L+
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEEL------RRQLEELERQLEEL 721
Query: 677 QEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTN 736
+ E++ + S + +LEE E + + +L+ +++ + E
Sbjct: 722 KRELAALEEELEQLQSRLEELEEEL------EELEEELEELQERLEELEEELESLEEALA 775
Query: 737 NKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKT 796
K I++L +++++LQE E ++ + + +L +L LE+ + E+++L+
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEE 835
Query: 797 ALETPKTKEKTALSLQGDLT-LLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQ 855
+E + K +L L++ EL L+ E E E ++ + +L+ +
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE---EE 892
Query: 856 QKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDR- 914
+E ++EL + +IE + RL KL+ L+ + ++ + L + + LE
Sbjct: 893 LRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEE-LEEEYEDTLETELEREI 951
Query: 915 ---KCELEGMDSV----------YQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKL 961
+ E+E + V + EEL + +E ++ L KE +++
Sbjct: 952 ERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERF 1011
Query: 962 NE 963
E
Sbjct: 1012 KE 1013
Score = 49.7 bits (119), Expect = 1e-05
Identities = 68/382 (17%), Positives = 165/382 (43%), Gaps = 45/382 (11%)
Query: 456 EKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKS 515
E ELE ++ + Q+E ++ E+ ++ L+ L+ L + ++++L +
Sbjct: 668 ELKELEEELAELEAQLEKLEE----------ELKSLKNELRSLEDLLEELRRQLEELERQ 717
Query: 516 LDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITL-AEIKSLKNRKESKLADIN 574
L ++LK E+ A + +L+ L ++ E+ L+ + L ++ L+ ES +
Sbjct: 718 L--EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775
Query: 575 LLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVSN 634
LKE + E + +L + L + ++ + ++A E+ L +LE
Sbjct: 776 KLKEEIEEL----------EEKRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825
Query: 635 SSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQEEVSMMTSTQYLFNSYI 694
+ + + + + EL ++++ E+EL+ + EL+ +EE + +
Sbjct: 826 LEQEIEELEEEIEELEEKLD------ELEEELEELEKELEELKEE---LEELEAEKEELE 876
Query: 695 GKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQ- 753
+L+E + + + +L++++ E+ E+ + +++L + L+
Sbjct: 877 DELKEL--------EEEKEELE--EELRELESELAELKEEIEKLRERLEELEAKLERLEV 926
Query: 754 ELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQG 813
EL + E + + + + + + IE E IE G + A+E + E+ L+
Sbjct: 927 ELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLR--AIEEYEEVEERYEELKS 984
Query: 814 DLTLLDQNIRELNTLQRELERQ 835
L++ +L + EL+++
Sbjct: 985 QREDLEEAKEKLLEVIEELDKE 1006
Score = 49.3 bits (118), Expect = 2e-05
Identities = 68/353 (19%), Positives = 146/353 (41%), Gaps = 37/353 (10%)
Query: 557 AEIKSLKNRKESKLADINLLKERHDRAFHLLFDM---IPEENFKNS-LDKALSSITFDIN 612
E+ L+ + E ++ LK LL ++ + E + L + L+++ ++
Sbjct: 674 EELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELE 733
Query: 613 RIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLE 672
++Q + E+ L LE + + L + + L L + + + ++E++ + +
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKL------KEEIEELEEK 787
Query: 673 LKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGL----VNKLKTKI 728
+ QEE +LEE E R R ES + +L+ +I
Sbjct: 788 RQALQEE----------LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEI 837
Query: 729 KEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETK 788
+E+ E+ + + +++L K+ L+E E + + ++ L + ELEE + E +
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELE 897
Query: 789 GELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNT----------LQRELERQESK 838
EL +LK +E + + + + L + + E L+RE+ER E +
Sbjct: 898 SELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957
Query: 839 ISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQK 891
I + V+L + + +E + +S+ E + E ++ L K+K
Sbjct: 958 IEALGP--VNLR-AIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEK 1007
Score = 48.6 bits (116), Expect = 3e-05
Identities = 74/396 (18%), Positives = 149/396 (37%), Gaps = 71/396 (17%)
Query: 207 NKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIE 266
L S++ + LR+++ E+ +++ + K+ + ++ +Q L +E
Sbjct: 694 KNELRSLEDLLEELRRQLEEL----------ERQLEELKRELAALEEELEQLQSRLEELE 743
Query: 267 ESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQ 326
E ++ + E+L +L E+ + + + E ELE + L EL+
Sbjct: 744 EELEELEEELEELQERLEELEEELESLEEALAKLKEEIE---ELEEKRQALQEE-LEELE 799
Query: 327 SKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLND 386
+L + LD ELE+ E+ + + QE I E + ++ +LE + ++ +
Sbjct: 800 EELEEAERRLDALERELESLEQRRERLEQE-------IEELEEEIEELEEKLDELEEELE 852
Query: 387 TLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENTK 446
L+ +L L + L
Sbjct: 853 ELEKELEELKEEL--------------------------------------------EEL 868
Query: 447 QAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVN 506
+AE L E ELE + + ++++ + +L + +I ++ + L+ +L
Sbjct: 869 EAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVEL 928
Query: 507 SEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAE-ISILQAQNITLAEIKS---- 561
E+++ + D L+ E+E I + E + L ++ I + E+KS
Sbjct: 929 PELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQRED 988
Query: 562 LKNRKESKLADINLLKERHDRAFHLLFDMIPEENFK 597
L+ KE L I L + F FD I ENF
Sbjct: 989 LEEAKEKLLEVIEELDKEKRERFKETFDKI-NENFS 1023
Score = 46.6 bits (111), Expect = 1e-04
Identities = 70/404 (17%), Positives = 154/404 (38%), Gaps = 79/404 (19%)
Query: 490 QSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISIL 549
+++ L+ + L R+ +++L K L+ L+ RQ + E + AE+ L
Sbjct: 176 EAERKLERTEENLERLEDLLEELEKQLEK--LE-------RQAEKAERYQ-ELKAELREL 225
Query: 550 QAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITF 609
+ + LA++K L+ E +++ L+E + +
Sbjct: 226 ELA-LLLAKLKELRKELEELEEELSRLEEELE------------------------ELQE 260
Query: 610 DINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRV 669
++ +++I + L L + + L + K + EL E +
Sbjct: 261 ELEEAEKEIEELKSELEELREELEELQEELLELKEEIEEL---------------EGEIS 305
Query: 670 TLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIK 729
L + E+ E + +LEE + K++ +
Sbjct: 306 LLRERLEELENEL--------EELEERLEELK--------------------EKIEALKE 337
Query: 730 EIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKG 789
E+ E+ + L + + + +EL+ +++ + +LR +L ELE + E +
Sbjct: 338 ELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRN 397
Query: 790 ELKKLKTALE-TPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVD 848
EL++LK +E + E+ + L+ L + EL LQ ELE ++ + +
Sbjct: 398 ELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457
Query: 849 LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKN 892
L L + + + EL ++E + L + ++L++ Q+
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQ 501
Score = 46.6 bits (111), Expect = 1e-04
Identities = 58/347 (16%), Positives = 149/347 (42%), Gaps = 33/347 (9%)
Query: 191 KKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYN 250
+++ + D + + + LE ++ Q + L++E+ ++ + + +E + + + +
Sbjct: 695 NELRSLEDLLEELRR---QLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEE 751
Query: 251 N----TQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTK---------- 296
++ ++ EEL ++EE++ + E++ +L EK + + + + +
Sbjct: 752 ELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA 811
Query: 297 -KTERDMIQESCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQ 355
+ E + +++ LE I++L + EL+ KL+ + L+E ELE + +
Sbjct: 812 LERELESLEQRRERLEQEIEEL-EEEIEELEEKLDELEEELEELEKELEELKEELEELEA 870
Query: 356 EEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGE 415
E+++ + E + + +LE + + LK ++ L + L + AK + E
Sbjct: 871 EKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLE-ELEAKLERLEVELP 929
Query: 416 GLIKM-----SQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVE-LESKIKSFKQ 469
L + T + +I+ LE +N +E+ E +E + + K
Sbjct: 930 ELEEELEEEYEDTLETELEREIERLEEEIEALGP-----VNLRAIEEYEEVEERYEELKS 984
Query: 470 QIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSL 516
Q E ++ ++ I E++ + + + +++N ++ K L
Sbjct: 985 QREDLEEAKEKLLEVIEELD--KEKRERFKETFDKINENFSEIFKEL 1029
Score = 46.6 bits (111), Expect = 1e-04
Identities = 34/156 (21%), Positives = 71/156 (45%), Gaps = 6/156 (3%)
Query: 925 YQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLE 984
Y+ EE RK+ E L + L+ L+K+ +KL + K + Y + E++ ++L
Sbjct: 170 YKERKEEAERKLERTEENLERLEDLLEELEKQ-LEKLERQAEKAERYQELKAELRELELA 228
Query: 985 ILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIAN-----QSLEE 1039
+L + +L L E + +L + E++ + E+ I E+ + + L+E
Sbjct: 229 LLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQE 288
Query: 1040 IDLKNNLTLLEKKEAVAKLNEELKLSEIMISDLTKY 1075
L+ + E + ++ L E L+ E + +L +
Sbjct: 289 ELLELKEEIEELEGEISLLRERLEELENELEELEER 324
Score = 35.1 bits (81), Expect = 0.39
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 22 VVRFQRPLTLIVGENGCGKTTIIECIKFAL 51
+ F T IVG NG GK+ I++ I+F L
Sbjct: 19 EINFSPGFTAIVGPNGSGKSNIVDAIRFVL 48
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 66.6 bits (163), Expect = 9e-11
Identities = 60/316 (18%), Positives = 141/316 (44%), Gaps = 26/316 (8%)
Query: 766 QDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDL-TLLDQNIRE 824
+ +I L K+ ELEE + E + L +L+ LE + + + +L + ++
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD 734
Query: 825 LNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELN-----------TFRSKIESG 873
L L+ E+E+ E +I+ + +L+ + + +E+ E ++IE
Sbjct: 735 LARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794
Query: 874 QTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELG 933
+ L + E L L+ + ++ + ++ L+SLE R E + ++EEL
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRER---LESLERRIAATERRLEDLEEQIEELS 851
Query: 934 RKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGT 993
+ + ++ + ++ L+ E + LNE + ++ + R +LE L+ R
Sbjct: 852 EDIESLAAEIEELEELIEELESELEALLNE-----RASLEEALALLRSELEELSEELREL 906
Query: 994 LTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQ---SLEEIDLKNNLTLLE 1050
++ + LR +++L ++ + + E I+ + + ++ + +LEE + N +
Sbjct: 907 ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDD 966
Query: 1051 KKEA---VAKLNEELK 1063
++EA + +L ++K
Sbjct: 967 EEEARRRLKRLENKIK 982
Score = 58.5 bits (142), Expect = 3e-08
Identities = 67/304 (22%), Positives = 128/304 (42%), Gaps = 36/304 (11%)
Query: 769 DIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIREL--- 825
+ LR +L EL+E + E + EL++L L+ L+ L L + EL
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQE----------LEEKLEELRLEVSELEEE 282
Query: 826 -NTLQRELERQESKISGM----RSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSH 880
LQ+EL ++IS + + L + Q +E + +L SK++ L
Sbjct: 283 IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342
Query: 881 NEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIE 940
EKL+ L+++ + L+ LE ELE + +LE L KVA +E
Sbjct: 343 EEKLEELKEELESL----------EAELEELEAELEELESRLEELEEQLETLRSKVAQLE 392
Query: 941 TQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAAL 1000
Q+ +E++ L + ++L + ++Q ++L + ++ + +L L
Sbjct: 393 LQIASLNNEIERL-EARLERLEDRRERLQQEIEELLK------KLEEAELKELQAELEEL 445
Query: 1001 RESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEK-KEAVAKLN 1059
E +++L + E + + E Q++ E L+ L LE+ +E + +
Sbjct: 446 EEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFS 505
Query: 1060 EELK 1063
E +K
Sbjct: 506 EGVK 509
Score = 56.2 bits (136), Expect = 1e-07
Identities = 69/370 (18%), Positives = 136/370 (36%), Gaps = 63/370 (17%)
Query: 309 ELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQ 368
E I++L EL+ K+ + L E ELE E Q +E ++ I+ +
Sbjct: 674 ERRREIEEL-EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
Query: 369 MKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKY 428
L +LE + E ++ L +L L
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEA------------------------------- 761
Query: 429 LSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEV 488
+ + E E E++I+ + QIE K++L + ++E
Sbjct: 762 -------------EIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDE- 807
Query: 489 NQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISI 548
L+ +L +N E L + L + L+ I A R+ +LE+++ + +I
Sbjct: 808 ---------LRAELTLLNEEAANLRERL--ESLERRIAATERRLEDLEEQIEELSEDIES 856
Query: 549 LQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLL-FDMIPEENFKNSLDKALSSI 607
L A+ + E++ L ES+L + + + A LL ++ L+ S +
Sbjct: 857 LAAE---IEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSEL 913
Query: 608 TFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRME--LVLGSKPFEDE 665
++ ++E + E L LE + N + L ++ E + +E + +
Sbjct: 914 RRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRR 973
Query: 666 LDRVTLELKR 675
L R+ ++K
Sbjct: 974 LKRLENKIKE 983
Score = 53.5 bits (129), Expect = 8e-07
Identities = 79/441 (17%), Positives = 155/441 (35%), Gaps = 87/441 (19%)
Query: 610 DINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGS-KPFEDELDR 668
I ++E I EK L L + + L ++ L EL ++ + E E+++
Sbjct: 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQ 744
Query: 669 VTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKI 728
+ + + +E++ + + + + EE + +I
Sbjct: 745 LEERIAQLSKELTELEAEIEELEERLEEAEEELAEA--------------------EAEI 784
Query: 729 KEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETK 788
+E+ Q K + L + L+ + L + + +LR +L LE + T+
Sbjct: 785 EELEAQIEQLKEELKALREALDELRA-------ELTLLNEEAANLRERLESLERRIAATE 837
Query: 789 GELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVD 848
L+ L E+ L D+ L I EL L ELE +
Sbjct: 838 RRLEDL----------EEQIEELSEDIESLAAEIEELEELIEELESE------------- 874
Query: 849 LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKL----QSLQKQKNDIHSKQLTVQGG 904
L+ +L ++ + L RS++E L K + L++ + + +L ++G
Sbjct: 875 LEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL 934
Query: 905 AGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEE 964
+ +L++R E Y LEE IE A+ L L+ +
Sbjct: 935 EVRIDNLQERLSEE------YSLTLEEAEALENKIEDDEEEARRRLKRLENK-------- 980
Query: 965 GAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERT 1024
++E+ + L + + L+E L +KED+ + E
Sbjct: 981 ----------IKELGPVNLA--------AIEEYEELKERYDFLTAQKEDLTEAKETLEEA 1022
Query: 1025 INEINQSIANQSLEEIDLKNN 1045
I EI++ + + D N
Sbjct: 1023 IEEIDREARERFKDTFDQVNE 1043
Score = 45.4 bits (108), Expect = 3e-04
Identities = 44/190 (23%), Positives = 80/190 (42%), Gaps = 8/190 (4%)
Query: 926 QTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEI 985
+ ++EEL K+A +E L + EL+ L++E ++ E + + +++ R++ E+
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742
Query: 986 LNYTKRGTLTQ--LAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLK 1043
+R L L +++L +R E+ + E I E+ I E L+
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802
Query: 1044 NNLTLLEK-----KEAVAKLNEELKLSEIMISDLTKYHHTLENCVIKYHSQKMRSINRLI 1098
L L E A L E L+ E I+ + LE I+ S+ + S+ I
Sbjct: 803 EALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEE-QIEELSEDIESLAAEI 861
Query: 1099 REYWTRIYQL 1108
E I +L
Sbjct: 862 EELEELIEEL 871
Score = 44.7 bits (106), Expect = 5e-04
Identities = 66/343 (19%), Positives = 136/343 (39%), Gaps = 42/343 (12%)
Query: 264 NIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKA 323
IEE + I E ++ E E+ ++ + + + + E + +++ EL I L D A
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISAL-RKDLA 736
Query: 324 ELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKK 383
L++++ + + + EL E + + +++ + EA+ ++ +LE E K+
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796
Query: 384 LNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNE 443
L+ L+ L +++ T E + ++ I ER D E
Sbjct: 797 ELKALREALDEL----------RAELTLLNEEAANL--RERLESLERRIAATERRLEDLE 844
Query: 444 NTK---QAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQT 500
+I +L E ELE I+ + ++E + ++ + + S L+ L
Sbjct: 845 EQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLR---SELEELSE 901
Query: 501 KLNRVNSEIDQLSKSLDPDQLK--------NEIEAWIRQ-----RNELEDELCVIDAEIS 547
+L + S+ +L + L+ + K +E I E L +A +
Sbjct: 902 ELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALEN 961
Query: 548 ILQAQNITL-AEIKSLKNRKESKLADINL--------LKERHD 581
++ +K L+N+ + L +NL LKER+D
Sbjct: 962 KIEDDEEEARRRLKRLENKIKE-LGPVNLAAIEEYEELKERYD 1003
Score = 43.5 bits (103), Expect = 0.001
Identities = 33/168 (19%), Positives = 70/168 (41%), Gaps = 17/168 (10%)
Query: 925 YQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLE 984
+ E+EEL K+ +E ++ + L L+KE ++ ++LE++++ E
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKEL-----------EELEEELEQLRKELEE 723
Query: 985 ILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKN 1044
+ + LA L V++L +R + + E I E+ + + E + +
Sbjct: 724 L-SRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
Query: 1045 NLTLLEKKEAVAKLNEELKLSEIMISDLTKYHHTLENCVIKYHSQKMR 1092
+ LE + + +L EELK + +L L + + + R
Sbjct: 783 EIEELEAQ--IEQLKEELKALREALDELRAELTLLN---EEAANLRER 825
Score = 43.1 bits (102), Expect = 0.001
Identities = 66/337 (19%), Positives = 143/337 (42%), Gaps = 27/337 (8%)
Query: 253 QKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELES 312
++R ++ +L E++ + + L E + S + Q +K ER +E EL
Sbjct: 172 ERRKETERKLERTRENLDRLEDIL-----NELERQLKSLERQAEKAER--YKELKAELRE 224
Query: 313 SIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLG 372
L EL+ +L + L E ELE E ++ + E ++++
Sbjct: 225 LELALLVLRLEELREELEELQEELKEAEEELEELTA-------ELQELEEKLEELRLEVS 277
Query: 373 KLERDEETHKKLNDTLKTKLNNLADTLCL--DTTAKSQYTPEEGEGLIKMSQTTIDKYLS 430
+LE + E +K L +++ L + + A + EE E ++ ++ +D+
Sbjct: 278 ELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337
Query: 431 DIKILERTFSDNENTKQ---AEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINE 487
++ LE + + + AE+ L E ELES+++ ++Q+E + V +
Sbjct: 338 ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK---VAQLELQ 394
Query: 488 VNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEIS 547
+ + ++ L+ +L R+ ++L + + + LK EA + EL+ EL ++ E+
Sbjct: 395 IASLNNEIERLEARLERLEDRRERLQQEI-EELLKKLEEA---ELKELQAELEELEEELE 450
Query: 548 ILQAQNITL-AEIKSLKNRKESKLADINLLKERHDRA 583
LQ + L ++ L+ E ++ + +
Sbjct: 451 ELQEELERLEEALEELREELEEAEQALDAAERELAQL 487
Score = 41.6 bits (98), Expect = 0.004
Identities = 74/386 (19%), Positives = 157/386 (40%), Gaps = 73/386 (18%)
Query: 499 QTKLNRVNSEIDQLSKSLDPDQLKNEI-EAWIRQRNELEDELCVIDAEISI--LQAQNIT 555
+ L+R+ +++L + L + + E E + + EL + ++ + + L+
Sbjct: 185 RENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRE----LELALLVLRLEELREE 240
Query: 556 LAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPE-ENFKNSLDKALSSITFDINRI 614
L E++ E +L ++ + + L + E E L K L ++ +I+R+
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
Query: 615 QEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELK 674
++ + L LE + L + + L EL + + + E++L+ + EL+
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL------EEKLEELKEELE 354
Query: 675 REQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQ 734
+ E+ + + +LEE L+++++E+ EQ
Sbjct: 355 SLEAELEELEA----------ELEE------------------------LESRLEELEEQ 380
Query: 735 TNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKL 794
++ + QL Q SL +I L ++L LE+ + E+++L
Sbjct: 381 LETLRSKVAQLELQIASLNN--------------EIERLEARLERLEDRRERLQQEIEEL 426
Query: 795 KTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLA 854
LE + KE LQ +L L++ EL LQ ELER E + +R +L++
Sbjct: 427 LKKLEEAELKE-----LQAELEELEE---ELEELQEELERLEEALEELRE---ELEEAEQ 475
Query: 855 QQKEKKNELNTFRSKIESGQTRLNSH 880
+ EL +++++S + +
Sbjct: 476 ALDAAERELAQLQARLDSLERLQENL 501
Score = 40.0 bits (94), Expect = 0.011
Identities = 48/255 (18%), Positives = 99/255 (38%), Gaps = 58/255 (22%)
Query: 877 LNSHNEKLQSLQKQ----------KNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQ 926
LN +L+SL++Q K ++ +L + L+ L + EL Q
Sbjct: 195 LNELERQLKSLERQAEKAERYKELKAELRELELALL--VLRLEELREELEEL-------Q 245
Query: 927 TELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEIL 986
EL+E ++ + +L + +L+ L+ E +E +I++ K+L +
Sbjct: 246 EELKEAEEELEELTAELQELEEKLEELRLEV----SELEEEIEELQKELYALANE----- 296
Query: 987 NYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNL 1046
+ +L Q+K+ + + ER + E+ + + +L L
Sbjct: 297 -----------------ISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEEL 339
Query: 1047 TLLEKK------------EAVAKLNEELKLSEIMISDLTKYHHTLENCVIKYHSQKMRSI 1094
LE+K + +L EL+ E + +L + TL + V + ++ S+
Sbjct: 340 AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQ-LELQIASL 398
Query: 1095 NRLIREYWTRIYQLK 1109
N I R+ +L+
Sbjct: 399 NNEIERLEARLERLE 413
Score = 37.0 bits (86), Expect = 0.088
Identities = 36/198 (18%), Positives = 76/198 (38%), Gaps = 23/198 (11%)
Query: 194 KEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQ 253
EI + + LE++ +R L + + +++ + +E +SK+ + +
Sbjct: 859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELE 918
Query: 254 KRDQSFEELHNIEESMK-PINEKLVQLTEK-ERNMSVMSTQYQTKKTERDMIQESCNELE 311
+ + +L E ++ I+ +L+E+ + + + + + LE
Sbjct: 919 ELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLE 978
Query: 312 SSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKL 371
+ IK+L G +NL +E E LK +Y Q + EA+ L
Sbjct: 979 NKIKEL--GP------------VNLA----AIEEYEELKERYDFLTAQKED-LTEAKETL 1019
Query: 372 GKL--ERDEETHKKLNDT 387
+ E D E ++ DT
Sbjct: 1020 EEAIEEIDREARERFKDT 1037
Score = 36.2 bits (84), Expect = 0.15
Identities = 22/61 (36%), Positives = 34/61 (55%), Gaps = 8/61 (13%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGK 63
L +L + G ++F AD + F + +T IVG NGCGK+ I++ I++ L G S K
Sbjct: 2 LKKLELAGFKSF-ADPT-TINFDKGITGIVGPNGCGKSNIVDAIRWVL------GEQSAK 53
Query: 64 N 64
Sbjct: 54 A 54
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 63.9 bits (156), Expect = 5e-10
Identities = 122/613 (19%), Positives = 256/613 (41%), Gaps = 105/613 (17%)
Query: 449 EINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSE 508
EI I + + ++ ++ I+ +K+L V+ +INE + ++L + E
Sbjct: 173 EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINE----------ISSELPELREE 222
Query: 509 IDQLSKSL-DPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKE 567
+++L K + + ++LK EIE ++ LE ++ +I L+ + + E+K E
Sbjct: 223 LEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEER---IEELKKEIEELE 279
Query: 568 SKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYT 627
K+ ++ LKE+ + L + ++K LS + +IN I+E I E+
Sbjct: 280 EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEE---- 335
Query: 628 LEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDR------------------- 668
E + K L++ ++ L EL +R EL +K ++EL+R
Sbjct: 336 KEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEE 395
Query: 669 -------VTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLV 721
+ E+ + + + I +L++ + +CP+C R ++ L+
Sbjct: 396 LEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHR-KELL 454
Query: 722 NKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELE 781
+ ++K I ++ K ++ K ++ L+EL+ V + +L KL EL
Sbjct: 455 EEYTAELKRIEKE---LKEIEEKERKLRKELRELEKVLKKESEL---------IKLKELA 502
Query: 782 ENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISG 841
E + E + +LKK +L L++ E L+ +L + + +I
Sbjct: 503 EQLKELEEKLKKY-------------------NLEELEKKAEEYEKLKEKLIKLKGEIKS 543
Query: 842 MRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTV 901
L ++ EK EL +++E +KL L+++ ++ + +
Sbjct: 544 -----------LKKELEKLEELKKKLAELE----------KKLDELEEELAELLKELEEL 582
Query: 902 QGGAGMLKSLEDRKCELEGMDSVY------QTELEELGRKVAPIETQLNLAQSELDALKK 955
G ++ LE+R ELE + Y + ELE +++ +E +L+ A EL +K
Sbjct: 583 --GFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEK 640
Query: 956 EHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDII 1015
++ E + Y+++ E R + L+ G +L L + +++ + E +
Sbjct: 641 RLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLK 700
Query: 1016 AKRGVCERTINEI 1028
+ E+ E+
Sbjct: 701 EELEEREKAKKEL 713
Score = 57.4 bits (139), Expect = 6e-08
Identities = 169/887 (19%), Positives = 330/887 (37%), Gaps = 200/887 (22%)
Query: 12 IRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRI 71
I+NF + K+ VV F + LI+G+NG GK++I+E I L + P+
Sbjct: 8 IKNFRSHKSSVVEFDDGINLIIGQNGSGKSSILEAILVGLYWGHGS----------KPKG 57
Query: 72 GKKDEASASSGKNFVHDPRIGKKDETHAIVKLQCTCENNDTVC-VVRSL-----LLSNKN 125
KKD+ + RIG +++ E N +VRS L +
Sbjct: 58 LKKDDFT-----------RIGGSG-----TEIELKFEKNGRKYRIVRSFNRGESYLKYLD 101
Query: 126 GKDNCATRDTTISRKIFATGVQKNLGCLQQESVLEMCNLIGVSKAILNNVIFCHQENSSW 185
G + D+++ + E + + + N I+ Q
Sbjct: 102 GSEVLEEGDSSVREWV--------------ERL--------IPYHVFLNAIYIRQGEIDA 139
Query: 186 PLDEGKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKK 245
L+ + +++ +I Y A +++ +++ I ++ + T N ++ K+
Sbjct: 140 ILESDESREKVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKE 199
Query: 246 QLIYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQE 305
+EL + + I+ +L +L E E + +++
Sbjct: 200 --------------KELEEVLREINEISSELPELRE-----------------ELEKLEK 228
Query: 306 SCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHIN 365
ELE +K+ + EL L K L+EK ELE + I+E K+ +
Sbjct: 229 EVKELE-ELKEEIEELEKEL-ESLEGSKRKLEEKIRELE-------ERIEELKKEIEELE 279
Query: 366 EAQMKLGKLERDEETHKKLN---DTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQ 422
E +L +L+ E + KL+ + +L + L
Sbjct: 280 EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL----------------------- 316
Query: 423 TTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVI 482
+ +I +E + E K+ + L + ELE +++ +++ E
Sbjct: 317 ---SRLEEEINGIEERIKELEE-KEERLEELKKKLKELEKRLEELEERHE---------- 362
Query: 483 TQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVI 542
E + L+ L+ +L L P++L+ E+E + + E+E+E+ I
Sbjct: 363 -LYEEAKAKKEELERLKKRL-----------TGLTPEKLEKELEELEKAKEEIEEEISKI 410
Query: 543 DAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDR--AFHLLFDMIPEENFKNSL 600
A I LK + I LK+ + +
Sbjct: 411 TARIG-------------ELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEY 457
Query: 601 DKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSK 660
L I ++ I+E K L LE + S+ ++ + LAE + +E L
Sbjct: 458 TAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK--LKELAEQLKELEEKLKKY 515
Query: 661 PFEDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGL 720
E EL++ E ++ +E++ + + KLEE + +
Sbjct: 516 NLE-ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKK----------------- 557
Query: 721 VNKLKTKIKEIPEQTNNKKTHIDQLCKQ-----QRSLQELKPVYENIMKLQDTD------ 769
+ +L+ K+ E+ E+ +++L + + L+EL+P Y ++L+D +
Sbjct: 558 LAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELERE 617
Query: 770 ---IPSLRSKLIELEENVIETKGELKKLKTALETPKTK--EKTALSLQGDLTLLDQNIRE 824
+ L +L + E + ET+ L++L+ LE + K E+ L+ + L RE
Sbjct: 618 EKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELS---RE 674
Query: 825 LNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIE 871
L L+ ELE E + ++ T L + L ++++ K EL +E
Sbjct: 675 LAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALE 721
Score = 55.1 bits (133), Expect = 2e-07
Identities = 81/370 (21%), Positives = 157/370 (42%), Gaps = 42/370 (11%)
Query: 722 NKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELE 781
L IKEI + + I + + ++E + E +++ +I + S+L EL
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLR----EINEISSELPELR 220
Query: 782 ENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISG 841
E + + + E+K+L+ E + EK SL+G L++ IREL EL+++ ++
Sbjct: 221 EELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEE 280
Query: 842 MRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTV 901
+L + + + + ++ + RL+ E++ ++++ ++ K+ +
Sbjct: 281 KVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL 340
Query: 902 QGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKL 961
+ LK LE R ELE EE K EL+ LKK
Sbjct: 341 EELKKKLKELEKRLEELEE----RHELYEEAKAKKE-----------ELERLKKR----- 380
Query: 962 NEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRG-- 1019
G + K+LEE+++ K EI + A + E +++ + K+ I +
Sbjct: 381 -LTGLTPEKLEKELEELEKAKEEI----EEEISKITARIGELKKEIKELKKAIEELKKAK 435
Query: 1020 ----VCERTINEINQS--IANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEIMI---S 1070
VC R + E ++ + + E ++ L +E+KE KL +EL+ E ++ S
Sbjct: 436 GKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKE--RKLRKELRELEKVLKKES 493
Query: 1071 DLTKYHHTLE 1080
+L K E
Sbjct: 494 ELIKLKELAE 503
Score = 48.1 bits (115), Expect = 4e-05
Identities = 88/398 (22%), Positives = 183/398 (45%), Gaps = 45/398 (11%)
Query: 716 SVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPV---YENIMKLQD----- 767
S+ G KL+ KI+E+ E+ K I++L ++ + L+ELK Y + + +
Sbjct: 249 SLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDE 308
Query: 768 -TDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELN 826
+I S+L E + E EL++ + LE K K K L+ L L++
Sbjct: 309 LREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLK---ELEKRLEELEERHELYE 365
Query: 827 T---LQRELERQESKISGMRSTGV--DLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHN 881
+ ELER + +++G+ + +L+++ ++E + E++ ++I + +
Sbjct: 366 EAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK 425
Query: 882 EKLQSLQKQK----------NDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEE 931
+ ++ L+K K + H K+L + A LK +E E+E + + EL E
Sbjct: 426 KAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKELRE 484
Query: 932 LGRKVAPIETQLNLAQ--SELDALKKEHKK-KLNEEGAKIQDYTKQLEEVKRIKLEILNY 988
L + + + L + +L L+++ KK L E K ++Y K E++ ++K EI +
Sbjct: 485 LEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSL 544
Query: 989 TKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIAN------QSLEE--- 1039
K L +L L++ + +L ++ +++ + + + E+ + LE
Sbjct: 545 KKE--LEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYN 602
Query: 1040 --IDLKNNLTLLE-KKEAVAKLNEELKLSEIMISDLTK 1074
++LK+ LE +++ + KL EEL + +++ K
Sbjct: 603 EYLELKDAEKELEREEKELKKLEEELDKAFEELAETEK 640
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 59.1 bits (143), Expect = 2e-09
Identities = 41/223 (18%), Positives = 84/223 (37%), Gaps = 24/223 (10%)
Query: 6 QLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNF 65
+L I +F K+ + F + LTLI G NG GKTTI++ I++AL +
Sbjct: 1 KLTIKNFGSF---KDLEIDFSKGLTLIYGPNGSGKTTILDAIRWALYGKTS--------- 48
Query: 66 VHDPRIGKKDEASASSGKNFVHDPRIGKKDETHAIVKLQCTCENNDTVCVVRSLLLSNKN 125
K K + + K +T+ + + + S L+ K
Sbjct: 49 ------RLKKSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKK 102
Query: 126 GKDNCATRDTTISRKIFATGVQKNLGCLQQESVLEMCNLIGVSKAILNNVIFCHQENSSW 185
GK + + + + L + +L + +G + ++
Sbjct: 103 GKKVKKSILEIVEIDELQEFIDELLK--SDKEILPLLLYLGQERLEELKFKRKEKKERLE 160
Query: 186 P----LDEGKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEI 224
L+E + K++ +++ + + K LE +K + + L +E+
Sbjct: 161 ELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEEL 203
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 57.8 bits (140), Expect = 4e-08
Identities = 51/256 (19%), Positives = 113/256 (44%), Gaps = 20/256 (7%)
Query: 412 EEGEGLIKMSQTTIDKYLSDIKILERTFSDNENTKQ---AEINALIVEKVELESKIKSFK 468
+ GE + I + ++I LER+ ++ E + + L E +L ++I+ +
Sbjct: 283 DLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
Query: 469 QQIEGNKKDLTNVITQI----NEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNE 524
++IE +K + + E+ ++ L+ + + E+ + L+ +LK E
Sbjct: 343 REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLE--KLKRE 400
Query: 525 IEAWIRQRNELEDELCVIDAEISILQAQ-NITLAEIKSLKNRKESKLADINLLKERHDRA 583
I R+ + L++EL + E++ L A A+I L+ KE K +I K++ +
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEI---KKQEWKL 457
Query: 584 FHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQK 643
L D+ E + L + + +R++++++ ++ L EA S + +R +
Sbjct: 458 EQLAADLSKYE-------QELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGR 510
Query: 644 RTLAELMDRMELVLGS 659
L ++ V G+
Sbjct: 511 AVEEVLKASIQGVHGT 526
Score = 52.8 bits (127), Expect = 2e-06
Identities = 68/334 (20%), Positives = 134/334 (40%), Gaps = 48/334 (14%)
Query: 747 KQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEK 806
K++++L+EL+ V ENI +L D I R +L L E + L + E
Sbjct: 171 KKEKALEELEEVEENIERL-DLIIDEKRQQLERLRRER-EKAERYQALLKEKREYEGYEL 228
Query: 807 TALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEK-----KN 861
+ + R+L +L+ ELE+ +IS + +++Q+L + +K +
Sbjct: 229 -LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEE 287
Query: 862 ELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGM 921
E + KI L + L+ +K L+ E+R +LE
Sbjct: 288 EQLRVKEKIGE----LEAEIASLERSIAEKER-------------ELEDAEERLAKLEAE 330
Query: 922 DSVYQTELEELGRK--------------VAPIETQLNLAQSELDALKKEH---KKKLNEE 964
E+EEL R+ A ++ +L ++EL+ + KE + +L +
Sbjct: 331 IDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDY 390
Query: 965 GAKIQDYTKQLEEVKRIKLEILNYTKRGTL------TQLAALRESVQKLNQRKEDIIAKR 1018
K++ +++ E+KR + +R + +A + + +L + KED +
Sbjct: 391 REKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEI 450
Query: 1019 GVCERTINEINQSIANQSLEEIDLKNNLTLLEKK 1052
E + ++ ++ E DLK +EK+
Sbjct: 451 KKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484
Score = 51.6 bits (124), Expect = 3e-06
Identities = 61/332 (18%), Positives = 126/332 (37%), Gaps = 58/332 (17%)
Query: 618 INAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQ 677
A E+ +E +++ + L ++EL R+E + E L+ + ++K
Sbjct: 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEI------EQLLEELNKKIKDLG 285
Query: 678 EEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNN 737
EE + + IG+LE E + + + ++ ++ + +
Sbjct: 286 EEEQLRVKEK------IGELEAEIASLERSIAEKERE------LEDAEERLAKLEAEIDK 333
Query: 738 KKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTA 797
I++L ++ ++ + ++ LR++L E+++ ET+ ELK +
Sbjct: 334 LLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREK 393
Query: 798 LETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQK 857
LE + RE+N L+REL+R + ++ + DL+ +A +
Sbjct: 394 LE--------------------KLKREINELKRELDRLQEELQRLSEELADLNAAIAGIE 433
Query: 858 EKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCE 917
K NEL + E + KL+ L + + L D K E
Sbjct: 434 AKINELEE---EKEDKALEIKKQEWKLEQLAADLSKYEQE-------------LYDLKEE 477
Query: 918 LEGMDSVYQTELEELGRKVAPIETQLNLAQSE 949
+ + EL +L R++A E Q ++
Sbjct: 478 YD----RVEKELSKLQRELAEAEAQARASEER 505
Score = 50.5 bits (121), Expect = 8e-06
Identities = 63/262 (24%), Positives = 119/262 (45%), Gaps = 20/262 (7%)
Query: 810 SLQGDLTLLDQNIRE----LNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNT 865
L+ +L+ L +R L+ L +EL KI + L+Q + KE+ EL
Sbjct: 685 GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEE 744
Query: 866 FRSKIESGQTRLNSHNEKLQS-LQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSV 924
S +E + S ++L++ +++ + D+H + L LE R + +
Sbjct: 745 DLSSLEQEIENVKSELKELEARIEELEEDLHKLEEA-------LNDLEARLSH-SRIPEI 796
Query: 925 YQTELEELGRKVAPIETQLNLAQSELDALKKEHK---KKLNEEGAKIQDYTKQLEEVKRI 981
Q EL +L +V+ IE +L + +L+ L E + K++ E + D +Q++ +++
Sbjct: 797 -QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK- 854
Query: 982 KLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEID 1041
++E LN K +L L +++ L R D+ +R E + E+ + I + +I+
Sbjct: 855 EIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI-EELEAQIE 913
Query: 1042 LKNNLTLLEKKEAVAKLNEELK 1063
K L E K + L EEL
Sbjct: 914 KKRKR-LSELKAKLEALEEELS 934
Score = 48.9 bits (117), Expect = 2e-05
Identities = 71/380 (18%), Positives = 156/380 (41%), Gaps = 60/380 (15%)
Query: 610 DINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLG----SKPFEDE 665
+++ +Q ++ E L L +S++S+ + + ++ + +L E + +
Sbjct: 689 ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748
Query: 666 LDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLK 725
L++ +K E +E+ + I +LEE+ ++KL+
Sbjct: 749 LEQEIENVKSELKEL----------EARIEELEED--------------------LHKLE 778
Query: 726 TKIKEI-----PEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIEL 780
+ ++ + + + +L ++ ++ E + + L ++ EL
Sbjct: 779 EALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQEL 838
Query: 781 EENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKIS 840
+E I+ K ++K ++ +E +L G L++ + EL R+LE + +
Sbjct: 839 QEQRIDLKEQIKSIEKEIE----------NLNGKKEELEEELEELEAALRDLESRLGDLK 888
Query: 841 GMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLT 900
R +L+ L + + K EL +IE + RL+ KL++L+++ ++I +
Sbjct: 889 KERD---ELEAQLRELERKIEELEA---QIEKKRKRLSELKAKLEALEEELSEIEDPKGE 942
Query: 901 VQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKK 960
+ SLED + EL+ + + E+ L + LD LK E + K
Sbjct: 943 DEEIPEEELSLEDVQAELQRV----EEEIRALEPVNMLAIQEYEEVLKRLDELK-EKRAK 997
Query: 961 LNEEGAKIQDYTKQLEEVKR 980
L EE I + ++ E+ KR
Sbjct: 998 LEEERKAILERIEEYEKKKR 1017
Score = 47.8 bits (114), Expect = 6e-05
Identities = 65/311 (20%), Positives = 130/311 (41%), Gaps = 20/311 (6%)
Query: 323 AELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHK 382
L+ +L K L SEL E + QE + I E + ++ +LE++EE K
Sbjct: 677 QRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLK 736
Query: 383 KLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDN 442
+ + L+ L++L + KS+ E I+ + + K + LE S +
Sbjct: 737 ERLEELEEDLSSLEQEI---ENVKSELKELEAR--IEELEEDLHKLEEALNDLEARLSHS 791
Query: 443 E-NTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTK 501
QAE++ L E +E++++ +Q++ N+ L + E+ + Q L+ +
Sbjct: 792 RIPEIQAELSKLEEEVSRIEARLREIEQKL--NRLTLEKEYLE-KEIQELQEQRIDLKEQ 848
Query: 502 LNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKS 561
+ + EI+ L+ ++L+ E+E +LE L + E L+AQ ++
Sbjct: 849 IKSIEKEIENLNGKK--EELEEELEELEAALRDLESRLGDLKKERDELEAQ------LRE 900
Query: 562 LKNRKESKLADINLLKERHDR---AFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDI 618
L+ + E A I ++R L + + E D+ + + +Q ++
Sbjct: 901 LERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAEL 960
Query: 619 NAKEKHLYTLE 629
E+ + LE
Sbjct: 961 QRVEEEIRALE 971
Score = 47.0 bits (112), Expect = 9e-05
Identities = 64/350 (18%), Positives = 137/350 (39%), Gaps = 63/350 (18%)
Query: 207 NKALESIKIQRDRLRKEIPEIKAHYQATLNYK----KEADSKKQLIYNNTQKRDQSFEEL 262
KA + +++ E E+ +A K ++ S ++ + T++ + + L
Sbjct: 208 EKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRL 267
Query: 263 HNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDK 322
IE+ ++ +N+K+ L E+E+ ++E ELE+ I L
Sbjct: 268 EEIEQLLEELNKKIKDLGEEEQL----------------RVKEKIGELEAEIASL----- 306
Query: 323 AELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHK 382
+ ++ EK ELE+ E ++ E I++ ++ +LER+ E +
Sbjct: 307 ----------ERSIAEKERELEDAEERLAKLEAE-------IDKLLAEIEELEREIEEER 349
Query: 383 KLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDN 442
K D L + L + L + EE + K T D+ + LE+
Sbjct: 350 KRRDKLTEEYAELKEEL-----EDLRAELEEVD---KEFAETRDELKDYREKLEKL-KRE 400
Query: 443 ENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKL 502
N + E++ L E L ++ I G + + N+ + + ++
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAAIAGIEAKI----------NELEEEKEDKALEI 450
Query: 503 NRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQ 552
+ +++QL+ L + + E+ + + +E EL + E++ +AQ
Sbjct: 451 KKQEWKLEQLAADL--SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498
Score = 44.7 bits (106), Expect = 4e-04
Identities = 71/391 (18%), Positives = 143/391 (36%), Gaps = 49/391 (12%)
Query: 462 SKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQL 521
+ + F ++ E ++L V I + + + +L R+ E ++ + +
Sbjct: 163 AGVAEFDRKKEKALEELEEVEENIERL---DLIIDEKRQQLERLRREREKAERYQALLKE 219
Query: 522 KNEIEAWI--RQRNELEDELCVIDAEISILQAQNITL-AEIKSLKNRKESKLADINLLKE 578
K E E + +++ LE + I+ +++ L+ + L EI L+ R E + L +
Sbjct: 220 KREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNK 279
Query: 579 RHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKT 638
+ E + + + + + +I ++ I KE+ L E ++
Sbjct: 280 KIKDL---------GEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330
Query: 639 LRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKLE 698
+ + EL +E D++T E +EE + +LE
Sbjct: 331 IDKLLAEIEELEREIEEERK------RRDKLTEEYAELKEE--LEDLRA--------ELE 374
Query: 699 ENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPV 758
E + F E ++LK +++ + +L + Q LQ L
Sbjct: 375 EVD------KEFAE-------TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEE 421
Query: 759 YENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALET-PKTKEKTALSLQGDLTL 817
++ + I + +K+ ELEE + E+KK + LE K L
Sbjct: 422 LADL----NAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEE 477
Query: 818 LDQNIRELNTLQRELERQESKISGMRSTGVD 848
D+ +EL+ LQREL E++
Sbjct: 478 YDRVEKELSKLQRELAEAEAQARASEERVRG 508
Score = 44.7 bits (106), Expect = 4e-04
Identities = 64/342 (18%), Positives = 152/342 (44%), Gaps = 29/342 (8%)
Query: 210 LESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHN-IEES 268
LE +K + L+ E+ I+ +A K I ++ +Q E+L +EE
Sbjct: 683 LEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEL 742
Query: 269 MKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQSK 328
+ ++ ++ + + + + + + + ++E+ N+LE+ + + SK
Sbjct: 743 EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSK 802
Query: 329 LNLFKINLDEKCSELE---NQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLN 385
L ++ + E+E N+ L+ +Y+++E Q E Q + L+ ++ +K
Sbjct: 803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ------ELQEQRIDLKEQIKSIEKEI 856
Query: 386 DTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENT 445
+ L K L + L + + + E + + D+ + ++ LER
Sbjct: 857 ENLNGKKEELEEEL-----EELEAALRDLESRLGDLKKERDELEAQLRELER-------- 903
Query: 446 KQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRV 505
K E+ A I +K + S++K+ + +E ++ + + E+ + + +L+ +Q +L RV
Sbjct: 904 KIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRV 963
Query: 506 NSEIDQLSKSLDPDQLK--NEIEAWIRQRNELEDELCVIDAE 545
EI ++L+P + E E +++ +EL+++ ++ E
Sbjct: 964 EEEI----RALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEE 1001
Score = 43.1 bits (102), Expect = 0.001
Identities = 77/389 (19%), Positives = 161/389 (41%), Gaps = 54/389 (13%)
Query: 501 KLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQ-NITLAEI 559
+L R+ ++ L + L L++E+ + +EL EL +I ++ + E
Sbjct: 675 ELQRLRERLEGLKRELS--SLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEE 732
Query: 560 KSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDIN 619
+ LK R E D++ L++ EN K+ L + + I ++ED++
Sbjct: 733 EKLKERLEELEEDLSSLEQE-------------IENVKSELKELEA----RIEELEEDLH 775
Query: 620 AKEKHLYTLEANVSNSS-KTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQE 678
E+ L LEA +S+S ++ + L E + R+E L E +L+R+TLE + ++
Sbjct: 776 KLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLRE--IEQKLNRLTLEKEYLEK 833
Query: 679 EVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNK 738
E+ + + I +E+ + L K +E+ E+
Sbjct: 834 EIQELQEQRIDLKEQIKSIEKE--------------------IENLNGKKEELEEELEEL 873
Query: 739 KTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTAL 798
+ + L + L++ +L+ + L K+ ELE + + + L +LK L
Sbjct: 874 EAALRDLESRLGDLKK------ERDELE-AQLRELERKIEELEAQIEKKRKRLSELKAKL 926
Query: 799 ETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGV----DLDQVLA 854
E + + +G+ + + L +Q EL+R E +I + + + ++VL
Sbjct: 927 EALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLK 986
Query: 855 QQKEKKNELNTFRSKIESGQTRLNSHNEK 883
+ E K + + ++ R+ + +K
Sbjct: 987 RLDELKEKRAKLEEERKAILERIEEYEKK 1015
Score = 41.6 bits (98), Expect = 0.003
Identities = 41/259 (15%), Positives = 104/259 (40%), Gaps = 14/259 (5%)
Query: 856 QKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRK 915
+ + EL R ++E + L+S L++ +N + + + + +E
Sbjct: 669 SRSEPAELQRLRERLEGLKRELSS---LQSELRRIENRLDELSQELSDASRKIGEIEKEI 725
Query: 916 CELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQL 975
+LE + + LEEL ++ +E ++ +SEL L+ + E + + L
Sbjct: 726 EQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR----IEELEEDLHKLEEAL 781
Query: 976 EEVKRI----KLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQS 1031
+++ ++ + +++ + ++++ Q+ + ++ E+ I E+ +
Sbjct: 782 NDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQ 841
Query: 1032 IANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEIMISDLTKYHHTLENCVIKYHSQKM 1091
+ + ++ + L K +L EEL+ E + DL L+ ++
Sbjct: 842 RIDLKEQIKSIEKEIENLNGK--KEELEEELEELEAALRDLESRLGDLKK-ERDELEAQL 898
Query: 1092 RSINRLIREYWTRIYQLKL 1110
R + R I E +I + +
Sbjct: 899 RELERKIEELEAQIEKKRK 917
Score = 41.2 bits (97), Expect = 0.005
Identities = 37/216 (17%), Positives = 94/216 (43%), Gaps = 18/216 (8%)
Query: 191 KKVKEIFDEIFDATK----YNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQ 246
++ E+ E+ DA++ K +E ++ + ++L++ + E++ + ++E ++ K
Sbjct: 702 NRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSL---EQEIENVKS 758
Query: 247 LIYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQES 306
+ + ++ E+LH +EE++ + +L E + + E I+
Sbjct: 759 ELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQ-----AELSKLEEEVSRIEAR 813
Query: 307 CNELESSI------KQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQS 360
E+E + K+ + ELQ + K + E+EN K + +E ++
Sbjct: 814 LREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEEL 873
Query: 361 HTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLA 396
+ + + +LG L+++ + + L+ K+ L
Sbjct: 874 EAALRDLESRLGDLKKERDELEAQLRELERKIEELE 909
Score = 35.8 bits (83), Expect = 0.23
Identities = 38/200 (19%), Positives = 79/200 (39%), Gaps = 17/200 (8%)
Query: 928 ELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILN 987
ELEE+ + ++ ++ + +L+ L++E +K + Q K+ E + L
Sbjct: 178 ELEEVEENIERLDLIIDEKRQQLERLRREREKAE-----RYQALLKEKRE---YEGYELL 229
Query: 988 YTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLT 1047
K Q A+ + L + E + + E+ + EI Q + + ++I
Sbjct: 230 KEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLL-EELNKKIKDLGEEE 288
Query: 1048 LLEKKEAVAKLNEELKLSEIMISDLTKYHHTLENCVIKYHSQ------KMRSINRLIREY 1101
L KE + +L E+ E I++ + E + K ++ ++ + R I E
Sbjct: 289 QLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEE 348
Query: 1102 WTRIYQLKLSEIMISDLTKY 1121
R + KL+E +
Sbjct: 349 --RKRRDKLTEEYAELKEEL 366
Score = 30.8 bits (70), Expect = 7.5
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 19 KNRVVRFQRPLTLIVGENGCGKTTIIECIKFAL 51
K +V+ F + T+I G NG GK+ I + I FAL
Sbjct: 15 KKKVIPFSKGFTVISGPNGSGKSNIGDAILFAL 47
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 56.4 bits (136), Expect = 1e-07
Identities = 169/865 (19%), Positives = 320/865 (36%), Gaps = 183/865 (21%)
Query: 12 IRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRI 71
++NF + + + F + +I G+NG GK++I++ I+FAL + +
Sbjct: 8 LKNFLSHDDSEIEFDTGINIITGKNGAGKSSIVDAIRFALFTD---------------KR 52
Query: 72 GKKDEASASSGKNFVHDPRIGKKDETHAIVKLQCTCENNDTVCVVRSLLLSNKNGKDNCA 131
+K E GKN + V+L+ + RS+ K +
Sbjct: 53 TEKIEDMIKKGKNNLE-------------VELEFRIGG-HVYQIRRSIERRGKG-----S 93
Query: 132 TRDTTISR--KIFATGVQKNLGCLQQESVLEMCNLIGVSKAILNNVIFCHQ-ENSSWPLD 188
R+ I + I A G +++ N++G+SK + N IF Q E S
Sbjct: 94 RREAYIKKDGSIIAEGFDDTTKYIEK-------NILGISKDVFLNSIFVGQGEMDSLISG 146
Query: 189 EGKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLI 248
+ + K+I DEI + + + +K D LR EI + ++Y +E L
Sbjct: 147 DPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEI--------SNIDYLEEKLKSSNLE 198
Query: 249 YNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCN 308
N +K+ E+ H+I L E ER +S +Y + + ++ + N
Sbjct: 199 LENIKKQIADDEKSHSI------------TLKEIER----LSIEYNNAMDDYNNLKSALN 242
Query: 309 ELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQ 368
EL S E +K++Y E K + + ++
Sbjct: 243 ELSS---------------------------------LEDMKNRYESEIKTAESDLSMEL 269
Query: 369 MKLGKLERDEETHKKL-NDTLKTKLNNLAD--TLCLDTTAKSQYTPEEGEGLIKMSQTTI 425
K + EE H K+ ND + N + D D K Q ++ I
Sbjct: 270 EKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQ--------ILSNIDAEI 321
Query: 426 DKYLSDIKILERTFSDNEN-----TKQAEINALIVEKVELESK-------IKSFKQQIEG 473
+KY + IK L D + ++ ++N I+E E I+S K++IE
Sbjct: 322 NKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEE 381
Query: 474 NKKDLTNVITQINEVNQSQS-TLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQR 532
K++ + I+E+ + Q ++ +LN +N ++ +S + L I A
Sbjct: 382 YSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKV--SSLNQRIRALRENL 439
Query: 533 NELEDELCVIDAEISILQAQNITLAEIKSL---------KNRKESKLADI-NLLKERHDR 582
+EL + +++ + S+ TL E KS K+R E K+ +I +K+ ++
Sbjct: 440 DELSRNMEMLNGQ-SVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEK 498
Query: 583 AFHL--LFDMIPEENFKNSL--DKALSSITFDINRIQEDINA-KEKHLYTLEANVSNSSK 637
L + + E S+ + S D+ I+ IN K+KH E S
Sbjct: 499 IVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSL 558
Query: 638 TLRD--QKRT----LAELMDRMELVLGSKPFEDELDRVTLELKREQEEVSMMTSTQYLFN 691
L D KRT ++ +++ + ++ R QE + +
Sbjct: 559 KLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYID 618
Query: 692 SYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRS 751
I ++E N L K EI E K I++L + +
Sbjct: 619 KSIREIENE--------------------ANNLNNKYNEIQE----NKILIEKLRGKIDN 654
Query: 752 LQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSL 811
++ ++I+ + S++ ++E+N+ +++ L K ++ + +
Sbjct: 655 YKKQIAEIDSIIPDLKE----ITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRT- 709
Query: 812 QGDLTLLDQNIRELNTLQRELERQE 836
+ L I ++N LE +
Sbjct: 710 --RINELSDRINDIN---ETLESMK 729
Score = 50.7 bits (121), Expect = 6e-06
Identities = 106/615 (17%), Positives = 234/615 (38%), Gaps = 90/615 (14%)
Query: 446 KQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRV 505
Q E+++LI KI +I +++ + I+ + S + L+ KL
Sbjct: 136 GQGEMDSLISGDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSS 195
Query: 506 NSEIDQLSKSLDPDQ-----LKNEIEAWIRQRNELEDELCVIDAEISILQAQ-------N 553
N E++ + K + D+ EIE + N D+ + + ++ L +
Sbjct: 196 NLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYE 255
Query: 554 ITLAEIKSLKNRKESKLADINLLKERHDRAF--------HLLFDMIPEENFKNSLDKALS 605
+ +S + + K L+ERH + + + D +N + + LS
Sbjct: 256 SEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILS 315
Query: 606 SITFDIN-----------------------RIQEDINAKEKHLYTLEAN-------VSNS 635
+I +IN +D+N + L E + + +
Sbjct: 316 NIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESL 375
Query: 636 SKTLRDQKRTLAELMDRMELVLGSKPFE-DELDRVTLELKREQEEVSMMTSTQYLFNSYI 694
K + + + + + + +L + + D + + E+ + +++S S+ N I
Sbjct: 376 KKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSS---LNQRI 432
Query: 695 GKLEENEPRCPLCTRFFESDYSVPGLVNKLKT-KIKEIPEQTNNKKTHIDQ-LCKQQRSL 752
L EN P L K I N KK+ +++ + + + +
Sbjct: 433 RALRENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEV 492
Query: 753 QELKPVYENIMKLQDTDIPSLRSKLIELEENVIETK-GELKKLKTALETPKTKEKTALSL 811
+++ ++ K ++ + S E N IE+ +L+ +K + K K +
Sbjct: 493 KDIDEKIVDLKKRKEY-LESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEI 551
Query: 812 QGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSK-- 869
+ L + +L++ + + IS +D++ ++ E K +LN S+
Sbjct: 552 KNRYKSLK--LEDLDSKRTSWLNALAVISL-----IDIETNRSRSNEIKKQLNDLESRLQ 604
Query: 870 -IESGQTRLNSHNEK--------LQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEG 920
IE G S+ +K +L + N+I ++ ++ + +++ K ++
Sbjct: 605 EIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEK---LRGKIDNYKKQIAE 661
Query: 921 MDSVYQTELEELGRKVAPIETQLNLAQSELDALK----------KEHKKKLNEEGAKIQD 970
+DS+ +L+E+ ++ IE L ++ LD K + + ++NE +I D
Sbjct: 662 IDSI-IPDLKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRIND 720
Query: 971 YTKQLEEVKRIKLEI 985
+ LE +K+IK I
Sbjct: 721 INETLESMKKIKKAI 735
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 50.7 bits (121), Expect = 6e-06
Identities = 109/837 (13%), Positives = 257/837 (30%), Gaps = 62/837 (7%)
Query: 207 NKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIE 266
KALE +++ +E + Y ++ D ++L+ + ++ + S +EL E
Sbjct: 208 KKALEYYQLKEKLELEEENLLYLDYLKLN--EERIDLLQELLRDEQEEIESSKQELEKEE 265
Query: 267 ESMKPINEKLVQLTEKERNMSVMSTQYQTKKTE--RDMIQ-ESCNELESSIKQLFSGDKA 323
E + + ++ + ++++ ++ E ++++ E + + +
Sbjct: 266 EILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELK 325
Query: 324 ELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKK 383
+L+ +L K ++E EL+ E + +EE+Q + KLE+ EE
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQL-------EKLQEKLEQLEEELLA 378
Query: 384 LNDTLKTKLNNLADTL-----CLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERT 438
+L++ A + K E + K I
Sbjct: 379 KKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEE 438
Query: 439 FSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVL 498
+ + K E + ++ K K ++ E K+ V
Sbjct: 439 SLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEE 498
Query: 499 QTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQ-----N 553
++ E + +L D + I + + +L + IS+
Sbjct: 499 ASQKESKAREGLAVLLALIKDGVGLRIISAAGRGGDLGVAVANYKVAISVAVIVEVSAVA 558
Query: 554 ITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENF----KNSLDKALSSITF 609
+ E + L L + E L+KA
Sbjct: 559 DEVDERQKLVRALTELPLPARTFVLLDPLLKLPLKSIAVLEIDPALNLAQLNKATLEADE 618
Query: 610 DINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRV 669
D R + + T + + ++ + +L E + + S +
Sbjct: 619 DDKRAKVVEGILKDTELTKLLESAKAKESGLRKGVSLEEGLAEKSELKASLSELTKELLA 678
Query: 670 TLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLC------------TRFFESDYSV 717
EL+ + E + + I K E+ + + +
Sbjct: 679 EQELQEKAESE-LAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINE 737
Query: 718 PGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKL 777
+ + K K KE E+ + K ++ K + SL+E + E + KL
Sbjct: 738 ELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKL 797
Query: 778 IELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQES 837
EE + + ELK+ LE + + ++ + L + + + E +E
Sbjct: 798 KAQEEELRALEEELKEEAELLEEEQLLIEQEEKIK-EEELEELALELKEEQKLEKLAEEE 856
Query: 838 KISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSK 897
+ KE++ E + ++ES + + ++L+ ++ N + K
Sbjct: 857 LERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEK 916
Query: 898 QLTVQGGAGMLKSLEDRKCELEGMDSVY----------------------QTELEELGRK 935
+ ++ + + + EELG
Sbjct: 917 ENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNV 976
Query: 936 VAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRG 992
+ + + + + ++ E+ +++ ++ + + LE+ RG
Sbjct: 977 NLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRG 1033
Score = 47.3 bits (112), Expect = 7e-05
Identities = 117/833 (14%), Positives = 271/833 (32%), Gaps = 44/833 (5%)
Query: 210 LESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNN-TQKRDQSFEELHNIEES 268
LE +K+Q +L+++ + +YQ + E ++ L Y ++R +EL E+
Sbjct: 193 LEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQE 252
Query: 269 MKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQSK 328
E Q EKE + E ++ E +K L ++ EL+S+
Sbjct: 253 ---EIESSKQELEKEEEIL------AQVLKENKEEEKEKKLQEEELKLLAK-EEEELKSE 302
Query: 329 LNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTL 388
L + + +L+ E+ + +E K+ I E + +L +LE E ++ + L
Sbjct: 303 LLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQL 362
Query: 389 KTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYL------SDIKILERTFSDN 442
+ L K + + K +++ E+
Sbjct: 363 EKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLL 422
Query: 443 ENTKQAEINALIVEKVELESKIKSFKQQ----IEGNKKDLTNVITQINEVNQSQSTLQVL 498
+ K+ E+ + + LE+K ++ + K L + + + + T V
Sbjct: 423 KEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVK 482
Query: 499 QTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAE 558
+ + +L ++ + E A + + L +I A
Sbjct: 483 LLEQLELLLLRQKLEEASQKESKAREGLAVLLALIKDGVGLRIISAAGRGGDLGVAVANY 542
Query: 559 ---IKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKA--LSSITFDINR 613
I + S +AD +++ RA L + L K S +I+
Sbjct: 543 KVAISVAVIVEVSAVADEVDERQKLVRALTELPLPARTFVLLDPLLKLPLKSIAVLEIDP 602
Query: 614 IQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLEL 673
+ L E + + L +L++ + L+ E
Sbjct: 603 ALNLAQLNKATLEADEDDKRAKVVEGILKDTELTKLLESAKAKESGLRKGVSLEEGLAEK 662
Query: 674 KREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPE 733
+ +S +T K E + + K + + E
Sbjct: 663 SELKASLSELTKELLAEQELQEKAESEL----------AKNEILRRQEEIKKKEQRIKEE 712
Query: 734 QTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDI--PSLRSKLIELEENVIETKGEL 791
K + L + + Q+ +++ + + +S+L + EE +++ L
Sbjct: 713 LKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSL 772
Query: 792 KKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQ 851
K+ + A E KT++ + + + + + E + + + + +
Sbjct: 773 KEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIK 832
Query: 852 VLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSL 911
++ + + + + E + Q+ + ++L Q L+S
Sbjct: 833 EEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESK 892
Query: 912 EDRKCELEGMDSVYQT---ELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKI 968
E+++ E + LEE ++ + + + ++ +E + +E K
Sbjct: 893 EEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKE 952
Query: 969 QDYTKQLEEVKRI---KLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKR 1018
+D ++ EE + E L + + E K +KE + ++
Sbjct: 953 EDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEK 1005
Score = 40.7 bits (95), Expect = 0.007
Identities = 85/685 (12%), Positives = 216/685 (31%), Gaps = 33/685 (4%)
Query: 188 DEGKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQL 247
+ K KE+ E + + K L+ ++I+R+ +E +++ + ++E +KK+L
Sbjct: 323 ELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKL 382
Query: 248 IYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESC 307
+ EE ++ + E + L E+ ++ + + + + ++ES
Sbjct: 383 ESERLSSAAKLKEEELELKN--EEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEES- 439
Query: 308 NELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEA 367
+ G E + +L + L + EL+ E L + + K E
Sbjct: 440 -------LETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLL-KETKLVKLLEQL--EL 489
Query: 368 QMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDK 427
+ KLE + K + L L + D + L + G +
Sbjct: 490 LLLRQKLEEASQKESKAREGLAVLLALIKDGVGLRIISA--------AGRGGDLGVAVAN 541
Query: 428 YLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINE 487
Y I + + + + E + + + L + E
Sbjct: 542 YKVAISVA--VIVEVSAVADEVDERQKLVRALTELPLPARTFVLLDPLLKLPLKSIAVLE 599
Query: 488 VNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEIS 547
++ + + Q+ + L + E
Sbjct: 600 IDPALNLAQLNKATLEADEDDKRAKVVEGILKD----------TELTKLLESAKAKESGL 649
Query: 548 ILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSI 607
KS S+L L ++ + + K I
Sbjct: 650 RKGVSLEEGLAEKSELKASLSELTKELLAEQELQEKAESELAKNEILRRQEEIKKKEQRI 709
Query: 608 TFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELD 667
++ +++ + + N L +QK E + + + E++ +
Sbjct: 710 KEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSE 769
Query: 668 RVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTK 727
E + +EE K +E E R E++ + + +
Sbjct: 770 LSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEE 829
Query: 728 IKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIET 787
+ E ++ ++ + +EL+ + E I K + L+ + +E ++ E
Sbjct: 830 KIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDEL 889
Query: 788 KGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGV 847
+ + +K K + + + + L+ +++ I E + + E + ++ +
Sbjct: 890 ESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEK 949
Query: 848 DLDQVLAQQKEKKNELNTFRSKIES 872
+ ++ +++E++N+ +
Sbjct: 950 EKEEDNKEEEEERNKRLLLAKEELG 974
>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
nucleotide-binding domain. ABC transporters are a large
family of proteins involved in the transport of a wide
variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are a
subset of nucleotide hydrolases that contain a signature
motif, Q-loop, and H-loop/switch region, in addition to,
the Walker A motif/P-loop and Walker B motif commonly
found in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 157
Score = 46.9 bits (112), Expect = 9e-06
Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 11/67 (16%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD---------QKNFQLIVITHDEEFIENLTAI 1299
ALA N + LDEPT+ LD + ++ +I++THD E E A
Sbjct: 90 ALARALLLNPDLLLLDEPTSGLDPASRERLLELLRELAEEGRTVIIVTHDPELAEL--AA 147
Query: 1300 DRAYVVR 1306
DR V++
Sbjct: 148 DRVIVLK 154
Score = 34.9 bits (81), Expect = 0.12
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 29 LTLIVGENGCGKTTIIECI 47
+ +VG NG GK+T++ I
Sbjct: 27 IVALVGPNGSGKSTLLRAI 45
>gnl|CDD|213192 cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding
cassette component of cobalt transport system. Domain I
of the ABC component of a cobalt transport family found
in bacteria, archaea, and eukaryota. The transition metal
cobalt is an essential component of many enzymes and must
be transported into cells in appropriate amounts when
needed. This ABC transport system of the CbiMNQO family
is involved in cobalt transport in association with the
cobalamin (vitamin B12) biosynthetic pathways. Most of
cobalt (Cbi) transport systems possess a separate CbiN
component, the cobalt-binding periplasmic protein, and
they are encoded by the conserved gene cluster cbiMNQO.
Both the CbiM and CbiQ proteins are integral cytoplasmic
membrane proteins, and the CbiO protein has the linker
peptide and the Walker A and B motifs commonly found in
the ATPase components of the ABC-type transport systems.
Length = 211
Score = 47.8 bits (115), Expect = 1e-05
Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 11/67 (16%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD---------QKNFQLIVITHDEEFIENLTAI 1299
A+A + + I LDEPT LD + + +I++THD + + L
Sbjct: 144 AIAGVLAMDPDILLLDEPTAGLDPAGRRELLELLKKLKAEGKTIIIVTHDLDLLLEL--A 201
Query: 1300 DRAYVVR 1306
DR V+
Sbjct: 202 DRVIVLE 208
Score = 34.4 bits (80), Expect = 0.28
Identities = 9/18 (50%), Positives = 13/18 (72%)
Query: 30 TLIVGENGCGKTTIIECI 47
LIVG NG GK+T++ +
Sbjct: 30 VLIVGPNGSGKSTLLRLL 47
Score = 30.5 bits (70), Expect = 4.8
Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 5/46 (10%)
Query: 1197 AGQRVLACLIIRLFISDQKNFQLIVITHDEEFIENLTAIDRAYVVR 1242
AG+R L L+ +L + +I++THD + + L DR V+
Sbjct: 168 AGRRELLELLKKL---KAEGKTIIIVTHDLDLLLEL--ADRVIVLE 208
>gnl|CDD|213245 cd03278, ABC_SMC_barmotin, ATP-binding cassette domain of
barmotin, a member of the SMC protein family. Barmotin
is a tight junction-associated protein expressed in rat
epithelial cells which is thought to have an important
regulatory role in tight junction barrier function.
Barmotin belongs to the SMC protein family. SMC
proteins are large (approximately 110 to 170 kDa), and
each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code),
which by mutational studies has been shown to be
essential in several proteins. The carboxy-terminal
domain contains a sequence (the DA-box) that resembles
a 'Walker B' motif, and a motif with homology to the
signature sequence of the ATP-binding cassette (ABC)
family of ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 197
Score = 46.7 bits (112), Expect = 2e-05
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 8/61 (13%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGK 63
L +L + G ++F ADK + F LT IVG NG GK+ II+ I++ L G S K
Sbjct: 1 LKKLELKGFKSF-ADKTTI-PFPPGLTAIVGPNGSGKSNIIDAIRWVL------GEQSAK 52
Query: 64 N 64
+
Sbjct: 53 S 53
>gnl|CDD|213188 cd03221, ABCF_EF-3, ATP-binding cassette domain of elongation factor
3, subfamily F. Elongation factor 3 (EF-3) is a
cytosolic protein required by fungal ribosomes for in
vitro protein synthesis and for in vivo growth. EF-3
stimulates the binding of the EF-1: GTP: aa-tRNA ternary
complex to the ribosomal A site by facilitated release of
the deacylated tRNA from the E site. The reaction
requires ATP hydrolysis. EF-3 contains two ATP nucleotide
binding sequence (NBS) motifs. NBSI is sufficient for the
intrinsic ATPase activity. NBSII is essential for the
ribosome-stimulated functions.
Length = 144
Score = 45.1 bits (108), Expect = 3e-05
Identities = 16/56 (28%), Positives = 31/56 (55%), Gaps = 8/56 (14%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASDQ-----KNFQ--LIVITHDEEFIENLT 1297
ALA+ N + LDEPT +LD++ + + K + +I+++HD F++ +
Sbjct: 80 ALAKLLLENPNLLLLDEPTNHLDLE-SIEALEEALKEYPGTVILVSHDRYFLDQVA 134
Score = 30.1 bits (69), Expect = 4.5
Identities = 8/16 (50%), Positives = 13/16 (81%)
Query: 32 IVGENGCGKTTIIECI 47
+VG NG GK+T+++ I
Sbjct: 31 LVGRNGAGKSTLLKLI 46
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 47.9 bits (114), Expect = 4e-05
Identities = 47/204 (23%), Positives = 92/204 (45%), Gaps = 11/204 (5%)
Query: 778 IELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQES 837
E E +I + E K L+ E + + T L LL+ R N L+ +L
Sbjct: 455 SEYREKLIRLQHENKMLRLGQEGSENERITELQ-----QLLEDANRRNNELETQLRLANQ 509
Query: 838 KISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSK 897
+I ++ DL + L +Q K + + +SK+E +L+ NE+LQ ++Q ++
Sbjct: 510 RILELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPD 569
Query: 898 QLT--VQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKK 955
Q + A + +L+ + ++ M+ Y+ +E+ + ++ + N A E+ LKK
Sbjct: 570 QDQNLSRKIAELEAALQKKDEDMRAMEERYKKYVEKAREVIKTLDPKQNPASPEIQLLKK 629
Query: 956 EHKKKLNEEGAKIQDYTKQLEEVK 979
+ L E +I+ + E+ K
Sbjct: 630 Q----LTERDKRIRHLESEYEKAK 649
Score = 32.1 bits (73), Expect = 2.8
Identities = 43/186 (23%), Positives = 88/186 (47%), Gaps = 11/186 (5%)
Query: 463 KIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLK 522
+I +Q +E + + TQ+ NQ LQ L + + +Q SKS D LK
Sbjct: 482 RITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQK--ALQEQGSKSEDSSLLK 539
Query: 523 NEIEAWIRQRNELEDELCVIDAEISILQA-QNITLA-EIKSLKNRKESKLADINLLKERH 580
+++E + Q +E +EL +I L+ Q+ L+ +I L+ + K D+ ++ER+
Sbjct: 540 SKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAMEERY 599
Query: 581 DRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLR 640
+ ++I +LD + + +I +++ + ++K + LE+ +K +R
Sbjct: 600 KKYVEKAREVI------KTLDPKQNPASPEIQLLKKQLTERDKRIRHLESEYE-KAKPMR 652
Query: 641 DQKRTL 646
+Q+ L
Sbjct: 653 EQEEKL 658
Score = 31.7 bits (72), Expect = 3.4
Identities = 69/344 (20%), Positives = 148/344 (43%), Gaps = 66/344 (19%)
Query: 748 QQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKT 807
++ L EL+ + + L + +L+ ++ L E + + KKL+ +ET K K
Sbjct: 272 LEKELAELQHRNDELTSLAA-ESQALKDEIDVLRE----SSDKAKKLEAQVETYKKK--- 323
Query: 808 ALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFR 867
+ +LN L+R+++ E + + V L++ L + + +L T++
Sbjct: 324 --------------LEDLNDLRRQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYK 369
Query: 868 SKIESGQTRLNSHN--------------EKLQSLQKQKNDIHSKQLTVQGGAGMLKSLED 913
+++ +L+ + EKL++LQK+K + +++ +++ L+ +
Sbjct: 370 RQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLLAERDSLRETNEELRCGQA 429
Query: 914 RKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKK-KLNEEGA---KIQ 969
++ +L D+ + L ++ P E + L + L+ E+K +L +EG+ +I
Sbjct: 430 QQDQLTQADAGTSPSGDNLAAELLPSEYREKLIR-----LQHENKMLRLGQEGSENERIT 484
Query: 970 DYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEIN 1029
+ + LE+ N TQL + + +L Q+ ED + +
Sbjct: 485 ELQQLLEDA--------NRRNNELETQLRLANQRILELQQQVED-----------LQKAL 525
Query: 1030 QSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEIMISDLT 1073
Q ++S + LK+ L E E + + NEEL+ I +L
Sbjct: 526 QEQGSKSEDSSLLKSKLE--EHLEQLHEANEELQKKREQIEELE 567
Score = 31.3 bits (71), Expect = 4.6
Identities = 60/275 (21%), Positives = 109/275 (39%), Gaps = 35/275 (12%)
Query: 208 KALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIEE 267
K E + +RD LR+ E++ QA + +AD+ +N EL E
Sbjct: 405 KEKERLLAERDSLRETNEELRCG-QAQQDQLTQADAGTSPSGDNLAA------ELLPSE- 456
Query: 268 SMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQ------ESCNELESSIKQLFSGD 321
EKL++L + E M + + + ++ Q NELE+ ++ L +
Sbjct: 457 ----YREKLIRL-QHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLR-LANQR 510
Query: 322 KAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETH 381
ELQ ++ + L E+ S+ E+ LKS+ + +Q H E Q K ++ EE
Sbjct: 511 ILELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQI---EELE 567
Query: 382 KKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSD 441
+ L K+ L L K + ++ + KY+ + + +T
Sbjct: 568 PDQDQNLSRKIAELEAAL----QKKDED--------MRAMEERYKKYVEKAREVIKTLDP 615
Query: 442 NENTKQAEINALIVEKVELESKIKSFKQQIEGNKK 476
+N EI L + E + +I+ + + E K
Sbjct: 616 KQNPASPEIQLLKKQLTERDKRIRHLESEYEKAKP 650
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 47.3 bits (113), Expect = 6e-05
Identities = 81/464 (17%), Positives = 170/464 (36%), Gaps = 107/464 (23%)
Query: 190 GKKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIY 249
G++ EI +I + K L+ ++ + ++L EI +K+ + ++
Sbjct: 87 GEENIEIEAQI---EELKKELKKLEEKIEQLEAEI-------------EKKEEELEKAKN 130
Query: 250 NNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNE 309
K + + ++ ++E L L K+ + + ++ S E
Sbjct: 131 KFLDKAWKKLAKKYD-----SNLSEALKGLNYKKNFKEKLLKELKSVILN-ASSLLSLEE 184
Query: 310 LESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQM 369
L++ IK LFS +K EL + L L I+ DE +E E L+ I I+E
Sbjct: 185 LKAKIKTLFSSNKPEL-ALLTLSVIDFDE----IEQAEILEKSIIGSSD---VPISELIN 236
Query: 370 KLG---------KLERDEET----HKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEG 416
LG + + +T + + + K +L D + + E+ E
Sbjct: 237 NLGNSDWVKEGLEYHEEGDTCPFCQQTITEERKAELEAHFD----EEYQELI---EQLEE 289
Query: 417 LIKMSQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKK 476
LI ++ I+K L +++ + T +N K ++ L LE ++ Q++E K
Sbjct: 290 LIDKYESHIEKALEELESILDTEKENSEFKL-DVEELKALLEALEEILEKNLQKLEEKLK 348
Query: 477 DLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQ--LKNEIEAWIRQRNE 534
D + I + + ++ + +N + E ++ +L ++ K ++ +
Sbjct: 349 DPSTSIE-LESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAE-- 405
Query: 535 LEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEE 594
+ +I Q +
Sbjct: 406 -------LKEDIDAYQKE------------------------------------------ 416
Query: 595 NFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKT 638
K L+KA++S+ +I +++ +I A EK + LE ++N T
Sbjct: 417 --KKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNIEPT 458
Score = 35.0 bits (81), Expect = 0.40
Identities = 40/188 (21%), Positives = 80/188 (42%), Gaps = 22/188 (11%)
Query: 775 SKLIELEENVIET-KGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELE 833
+LIE E +I+ + ++K LE+ EK + D+ L + L + +
Sbjct: 281 QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNL 340
Query: 834 RQ-ESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKN 892
++ E K+ ST ++L+ + + + ++ I HNEK+ +L+K+KN
Sbjct: 341 QKLEEKLKDP-STSIELESITDLIESINDIIDAINELIRE-------HNEKIDNLKKEKN 392
Query: 893 DIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDA 952
K L + + YQ E + L + + +E ++ ++E+ A
Sbjct: 393 KAKKK----------LWLHLVAEL--KEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKA 440
Query: 953 LKKEHKKK 960
L+KE K+
Sbjct: 441 LEKEIKEL 448
Score = 31.5 bits (72), Expect = 4.1
Identities = 35/212 (16%), Positives = 89/212 (41%), Gaps = 38/212 (17%)
Query: 304 QESCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELEN---QERLKSQYIQEEKQS 360
+E ELE+ + + +L+ ++ ++ ++++ ELE+ E+ S++ + ++
Sbjct: 266 EERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEEL 325
Query: 361 HTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKM 420
+ + L K + L+ KL + + ++ L++ E +I
Sbjct: 326 KALLEALEEILEKNLQ----------KLEEKLKDPSTSIELESITDL---IESINDIIDA 372
Query: 421 SQTTIDK----------------------YLSDIKILERTFSDNENTKQAEINALIVEKV 458
I + ++++K + + + IN+L E
Sbjct: 373 INELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIK 432
Query: 459 ELESKIKSFKQQIEGNKKDLTNVITQINEVNQ 490
+LE++IK+ +++I+ +K LTN+ +E+N+
Sbjct: 433 QLEAEIKALEKEIKELEKQLTNIEPTADEINK 464
Score = 30.7 bits (70), Expect = 6.9
Identities = 74/377 (19%), Positives = 135/377 (35%), Gaps = 46/377 (12%)
Query: 760 ENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALET---PKTKEKTALSLQGDLT 816
I +L+ ++ L K+ +LE + + + EL+K K K +K +L L
Sbjct: 95 AQIEELKK-ELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALK 153
Query: 817 LLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTR 876
L+ L +EL+ S+ + L+++ A+ K + + +
Sbjct: 154 GLNYKKNFKEKLLKELKSVILN----ASSLLSLEELKAKIKTLFSSNKPELALLTLSVID 209
Query: 877 LNSHNEK--LQSLQKQKNDIHSKQLT--------VQGGAGMLKSL-----------EDRK 915
+ + L+ +D+ +L V+ G + E+RK
Sbjct: 210 FDEIEQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEERK 269
Query: 916 CELEGM-DSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQ 974
ELE D YQ +E+L + E+ + A EL+++ +K+ +E +++
Sbjct: 270 AELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDT-EKENSEFKLDVEELKAL 328
Query: 975 LEEVKRI--KLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSI 1032
LE ++ I K K + L + + I A I E N+ I
Sbjct: 329 LEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDA----INELIREHNEKI 384
Query: 1033 ANQSLEEIDLKNNLTLLEKKEAVA---KLNEELKLSEIMISDLTKYHHTLENCV------ 1083
N E+ K L L E +E K E I+ L K LE +
Sbjct: 385 DNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKE 444
Query: 1084 IKYHSQKMRSINRLIRE 1100
IK +++ +I E
Sbjct: 445 IKELEKQLTNIEPTADE 461
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 47.3 bits (113), Expect = 7e-05
Identities = 104/537 (19%), Positives = 221/537 (41%), Gaps = 74/537 (13%)
Query: 448 AEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNS 507
+N L E EL+ +I+ +++Q E ++ + E + + L+ L+ ++ +
Sbjct: 206 ERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRE 265
Query: 508 EIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKE 567
I + + + ++L E+ + ELE+E + AE + A + + ++
Sbjct: 266 TIAETER--EREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323
Query: 568 SKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYT 627
+L D L + R H + S+ D + ++E +
Sbjct: 324 EELRD-RLEECRVAAQAHN---------------EEAESLREDADDLEERAEELREEAAE 367
Query: 628 LEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFED---ELDRVT--LELKREQEE--V 680
LE+ + + + + D++ + EL + +E + + F D +L LE RE+ +
Sbjct: 368 LESELEEAREAVEDRREEIEELEEEIEEL--RERFGDAPVDLGNAEDFLEELREERDELR 425
Query: 681 SMMTSTQYLFNSYIGKLEENEP-----RCPLCTRFFESDYSVPGL------VNKLKTKIK 729
+ + ++EE E +CP C + E V + V +L+ +++
Sbjct: 426 EREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELE 485
Query: 730 EIPEQTNNKKTHIDQ---LCKQQRSLQELKPVYENIMKL---QDTDIPSLRSKLIELEEN 783
++ E+ + +++ L + + ++ L+ E++ +L + I R + EL E
Sbjct: 486 DLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRER 545
Query: 784 V--IETKGELKK---LKTALETPKTKEKTALSLQGDLTLLDQNIRELN---TLQRELERQ 835
+E + E K+ + E + +E+ A L L L + I L TL +
Sbjct: 546 AAELEAEAEEKREAAAEAEEEAEEAREEVA-ELNSKLAELKERIESLERIRTLLAAIADA 604
Query: 836 ESKISGMRSTGVDLDQV-------LAQQKEKKNEL---------NTFRSKIESGQTRLNS 879
E +I +R L ++ LA+++E+K EL R E + L
Sbjct: 605 EDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQ 664
Query: 880 HNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEG----MDSVYQTELEEL 932
EKL L+++++D+ ++ V+ L+ L +R+ LE ++++Y E EEL
Sbjct: 665 VEEKLDELREERDDLQAEIGAVENELEELEELRERREALENRVEALEALYD-EAEEL 720
Score = 38.9 bits (91), Expect = 0.027
Identities = 59/297 (19%), Positives = 125/297 (42%), Gaps = 42/297 (14%)
Query: 824 ELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTF-------RSKIESGQTR 876
EL L E+ER E + R T + D+VL + +E++ EL T R I +
Sbjct: 214 ELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETERE 273
Query: 877 LNSHNEKLQSLQKQKNDIHSKQLTVQGGAGM----LKSLEDRKCELEGMDSVYQTELEE- 931
E+++ L+++ ++ ++ + AG+ +++E R+ ELE D + LEE
Sbjct: 274 REELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEEC 333
Query: 932 -------------LGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEV 978
L +E + + E L+ E L E ++D +++EE+
Sbjct: 334 RVAAQAHNEEAESLREDADDLEERAEELREEAAELESE----LEEAREAVEDRREEIEEL 389
Query: 979 KRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIA-NQSL 1037
+ ++E L L + +++L + ++++ + E T+ + + ++L
Sbjct: 390 EE-EIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEAL 448
Query: 1038 EEI--------DLKNN---LTLLEKKEAVAKLNEELKLSEIMISDLTKYHHTLENCV 1083
E ++ + T+ E +E V +L EL+ E + ++ + E+ V
Sbjct: 449 LEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLV 505
>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7 and
17 amino acids with fairly broad specificity. The PepF
gene is duplicated in L. lactis on the plasmid that
bears it, while a shortened second copy is found in
Bacillus subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 581
Score = 46.4 bits (111), Expect = 1e-04
Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 12/132 (9%)
Query: 498 LQTKLNRVNSEIDQLSKSLDPDQLKNE-----IEAWIRQRNELEDELCVIDAEISILQAQ 552
Q L ++ ID+L LD +E +EA I EL + L + A +S L +
Sbjct: 15 FQEDLKKLKQLIDELKSLLDALLPSDEDAVETLEALITLLQELLELLRTLGAFVSCLLSA 74
Query: 553 NITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSL-DKALSSITFDI 611
+ T E ++L ++ A + L + EE+ L D L F +
Sbjct: 75 DTTDEEAQALLSQLNQLSASLEQLLTL----LDKKLAALSEEDLDALLADPELKEHAFFL 130
Query: 612 NRIQEDINAKEK 623
+ AK
Sbjct: 131 EERRR--QAKHL 140
>gnl|CDD|213194 cd03227, ABC_Class2, ATP-binding cassette domain of
non-transporter proteins. ABC-type Class 2 contains
systems involved in cellular processes other than
transport. These families are characterized by the fact
that the ABC subunit is made up of duplicated, fused
ABC modules (ABC2). No known transmembrane proteins or
domains are associated with these proteins.
Length = 162
Score = 43.1 bits (102), Expect = 2e-04
Identities = 17/53 (32%), Positives = 23/53 (43%), Gaps = 2/53 (3%)
Query: 12 IRNFP--ADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSG 62
+ FP N V + LT+I G NG GK+TI++ I AL
Sbjct: 4 LGRFPSYFVPNDVTFGEGSLTIITGPNGSGKSTILDAIGLALGGAQSATRRRS 56
Score = 41.2 bits (97), Expect = 0.001
Identities = 17/60 (28%), Positives = 24/60 (40%), Gaps = 11/60 (18%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD---------QKNFQLIVITHDEEFIENLTAI 1299
ALA + ++ LDE LD ++ K Q+IVITH E E +
Sbjct: 93 ALASL--KPRPLYILDEIDRGLDPRDGQALAEAILEHLVKGAQVIVITHLPELAELADKL 150
Score = 35.4 bits (82), Expect = 0.081
Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 8/50 (16%)
Query: 1186 GIEQDMRNRCSAGQRVLACLIIRLFISDQKNFQLIVITHDEEFIENLTAI 1235
D R+ + LA I+ K Q+IVITH E E +
Sbjct: 109 DRGLDPRDG-----QALAEAILEHL---VKGAQVIVITHLPELAELADKL 150
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 44.6 bits (106), Expect = 4e-04
Identities = 40/216 (18%), Positives = 78/216 (36%), Gaps = 19/216 (8%)
Query: 604 LSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAEL-----MDRMELVLG 658
+ ++ I+ IQ+ I K++ + +++ L E + EL
Sbjct: 183 IQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDE 242
Query: 659 SKPFEDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVP 718
+++ + L + + + S F I E+ CP CT+ P
Sbjct: 243 LLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGG-VCPTCTQQISEG---P 298
Query: 719 GLVNKLKTKIKEIPEQTNNKKTHIDQLCK-------QQRSLQELKPVYENI---MKLQDT 768
+ K+K K+KE+ T ID+L + Q + L ELK +
Sbjct: 299 DRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVD 358
Query: 769 DIPSLRSKLIELEENVIETKGELKKLKTALETPKTK 804
+++ + EL+ ++ EL KL+ L+
Sbjct: 359 KAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKT 394
Score = 41.2 bits (97), Expect = 0.004
Identities = 49/238 (20%), Positives = 99/238 (41%), Gaps = 33/238 (13%)
Query: 714 DYSVPGLVNKL-KTKIKEIPEQT---NNKKTHI-DQLCKQQRSLQELKPVY-ENIMKLQD 767
D SV ++KL K KI+E+ +Q + K HI Q+ ++++E + ENI + Q+
Sbjct: 161 DISVLSEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN 220
Query: 768 -------------TDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGD 814
+I L +L+ L ++ + L KL TA K+K + Q
Sbjct: 221 KYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQ---FQKV 277
Query: 815 LTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQ 874
+ + ++ T +++ +I+ ++ +L L + +EL +
Sbjct: 278 IKMYEKG-GVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDEL----------E 326
Query: 875 TRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEEL 932
++ NE+ + L + KN I + + ++ K ++ EL+ EL +L
Sbjct: 327 EIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKL 384
Score = 36.5 bits (85), Expect = 0.12
Identities = 43/236 (18%), Positives = 93/236 (39%), Gaps = 22/236 (9%)
Query: 857 KEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQ-KNDIHSKQLTVQGGAGMLKSLEDRK 915
+E ++ T KI+ Q ++ ++N+ ++ +K+ +I KQ K+++
Sbjct: 177 RELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEI 236
Query: 916 CELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQL 975
EL ++E+ + + T +S+++ +K K + E+G T+Q+
Sbjct: 237 EELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIK--MYEKGGVCPTCTQQI 294
Query: 976 EEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQ 1035
E +I + K L L ++ +L + ++ + + + E+ I+
Sbjct: 295 SEGPDRITKIKDKLKELQ-HSLEKLDTAIDELEEIMDEFNEQS----KKLLELKNKISTN 349
Query: 1036 SLEEIDLKNNLTLLEK------------KEAVAKLNEELKLSEIMISDLT--KYHH 1077
I L + ++ E +AKL +EL S+L KYH
Sbjct: 350 KQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYHR 405
Score = 35.4 bits (82), Expect = 0.27
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 30 TLIVGENGCGKTTIIECIKFALTN 53
TLI G+NG GK+T++E + FAL
Sbjct: 30 TLITGKNGAGKSTMLEALTFALFG 53
Score = 32.3 bits (74), Expect = 2.3
Identities = 45/254 (17%), Positives = 93/254 (36%), Gaps = 43/254 (16%)
Query: 346 QERLKSQ--YIQE-EKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLD 402
Q+++K+ I+E K++ +I Q K +L + +T K + L +L NL
Sbjct: 194 QQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLV------ 247
Query: 403 TTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNE---NTKQAEINALIVEKVE 459
P + + I + + + + + Q I E +
Sbjct: 248 ---MDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQ-----ISEGPD 299
Query: 460 LESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPD 519
+KIK ++++ + + L T I+E+ + KL + ++I +SL
Sbjct: 300 RITKIKDKLKELQHSLEKLD---TAIDELEEIMDEFNEQSKKLLELKNKISTNKQSL--- 353
Query: 520 QLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLA-EIKSLKNRKE---SKLADINL 575
I ++ + + A I LQA+ + A E+ L++ + +++
Sbjct: 354 ---------ITLVDKAKK----VKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVK 400
Query: 576 LKERHDRAFHLLFD 589
K LL D
Sbjct: 401 EKYHRGIVTDLLKD 414
>gnl|CDD|213193 cd03226, ABC_cobalt_CbiO_domain2, Second domain of the ATP-binding
cassette component of cobalt transport system. Domain II
of the ABC component of a cobalt transport family found
in bacteria, archaea, and eukaryota. The transition metal
cobalt is an essential component of many enzymes and must
be transported into cells in appropriate amounts when
needed. The CbiMNQO family ABC transport system is
involved in cobalt transport in association with the
cobalamin (vitamin B12) biosynthetic pathways. Most
cobalt (Cbi) transport systems possess a separate CbiN
component, the cobalt-binding periplasmic protein, and
they are encoded by the conserved gene cluster cbiMNQO.
Both the CbiM and CbiQ proteins are integral cytoplasmic
membrane proteins, and the CbiO protein has the linker
peptide and the Walker A and B motifs commonly found in
the ATPase components of the ABC-type transport systems.
Length = 205
Score = 43.0 bits (102), Expect = 4e-04
Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 11/55 (20%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFI 1293
A+A + DEPT+ LD I+ + Q +IVITHD EF+
Sbjct: 136 AIAAALLSGKDLLIFDEPTSGLDYKNMERVGELIRELAAQGK-AVIVITHDYEFL 189
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 44.3 bits (105), Expect = 5e-04
Identities = 54/293 (18%), Positives = 102/293 (34%), Gaps = 34/293 (11%)
Query: 721 VNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIEL 780
+ K+ +I+++ E + +L L + + + LR +L L
Sbjct: 241 IQKVAPEIEKLQEDFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTL 300
Query: 781 E----ENVIETKGELKKLKTALETPKTK------------EKTALSLQGDLTLLDQNIRE 824
E E E EL L +++ + LQ DL L E
Sbjct: 301 EDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSE 360
Query: 825 LNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKI--ESGQTRLNSHNE 882
L ++ L+ K ++ L Q + +Q E+ E N R E + + E
Sbjct: 361 LEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEE 420
Query: 883 KLQSL--------QKQKNDIHSKQLTVQGGAGMLKSL-------EDRKCELEGMDSVYQT 927
LQ+L + K + + ++ ++ G LK + +LE D +
Sbjct: 421 DLQALESQLRQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEK 480
Query: 928 ELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKR 980
EE + A +E QL +L + E + L ++ + L+E++
Sbjct: 481 AQEEQEQAEANVE-QLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELEL 532
Score = 38.9 bits (91), Expect = 0.022
Identities = 57/293 (19%), Positives = 114/293 (38%), Gaps = 36/293 (12%)
Query: 789 GELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVD 848
E++KL+ E + E L G+L + +R E QE +
Sbjct: 246 PEIEKLQEDFEQLLSLELRLQHLHGEL---------VADEERLAEEQEERQEAKNRLRQQ 296
Query: 849 LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGML 908
L + Q KE ++ELN L++ N KL + + + + ++ +
Sbjct: 297 LRTLEDQLKEARDELNQ----------ELSAANAKLAADRSELELLEDQKGAFED----- 341
Query: 909 KSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKI 968
+E + +L+ + S ++ELEE+ ++ + + Q + + LK++ K++L + K
Sbjct: 342 ADIEQLQADLDQLPS-IRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKN 400
Query: 969 QDYT-KQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINE 1027
+ EE R K I L L + + Q+L K + + E +
Sbjct: 401 NERLAAIREEKDRQKAAI-----EEDLQALES--QLRQQLEAGKLEFNEEEYELELRLGR 453
Query: 1028 INQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEIMISDLTKYHHTLE 1080
+ Q + + + + L + + EA+ K EE + +E + L L
Sbjct: 454 LKQRLDSATATP-EELEQLEIND--EALEKAQEEQEQAEANVEQLQSELRQLR 503
Score = 32.0 bits (73), Expect = 2.9
Identities = 51/330 (15%), Positives = 111/330 (33%), Gaps = 61/330 (18%)
Query: 219 RLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIEESMKPINEKLVQ 278
RL+ E+ A + ++E K + + + +E +E + ++ +
Sbjct: 264 RLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEA--RDELNQELSAANAK 321
Query: 279 LTEKERNMSVMSTQYQ--------TKKTERDM---IQESCNELESSIKQLFSGDKAELQS 327
L + ++ Q + + D I+ E+E+ + +G ++Q
Sbjct: 322 LAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLD-ALTGKHQDVQR 380
Query: 328 KLNLFKINLDEKCSEL--ENQERLKSQYIQEEKQSHTHINEAQMKLGKL-ERDEETHKKL 384
K K + E+ +N ERL + ++++Q + Q +L ++ E +
Sbjct: 381 KYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEF 440
Query: 385 NDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNEN 444
N+ L +A TPEE D+ L
Sbjct: 441 NEEEYELELRLGRLKQRLDSA--TATPEE-----LEQLEINDEAL--------------E 479
Query: 445 TKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNR 504
Q E +L+S+++ +++ + ++ LQ + +L +
Sbjct: 480 KAQEEQEQAEANVEQLQSELRQLRKRRD-----------------EALEALQRAERRLLQ 522
Query: 505 VNSEIDQLSKSLDPDQ------LKNEIEAW 528
+ +D+L L P L+NE W
Sbjct: 523 LRQALDELELQLSPQAGSLLHFLRNEAPGW 552
>gnl|CDD|224048 COG1123, COG1123, ATPase components of various ABC-type transport
systems, contain duplicated ATPase [General function
prediction only].
Length = 539
Score = 43.3 bits (103), Expect = 9e-04
Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 12/54 (22%)
Query: 1263 LDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTAIDRAYVVR 1306
DEPTT LD +K+ + ++ ITHD + L DR V+
Sbjct: 178 ADEPTTALDVTTQAQILDLLKDLQRELGMAVLFITHDLGVVAELA--DRVVVMY 229
Score = 30.2 bits (69), Expect = 8.9
Identities = 7/12 (58%), Positives = 11/12 (91%)
Query: 32 IVGENGCGKTTI 43
+VGE+G GK+T+
Sbjct: 322 LVGESGSGKSTL 333
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists of
several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 43.4 bits (102), Expect = 0.001
Identities = 67/270 (24%), Positives = 118/270 (43%), Gaps = 48/270 (17%)
Query: 756 KPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDL 815
K V E++ LQ ++ +SKLI+LE +++ + E K+ + LE
Sbjct: 49 KQVEESMTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELE---------------- 92
Query: 816 TLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQT 875
R+ +TL ER+ + + V L + +K+ +NE + + +
Sbjct: 93 -------RKASTLAENYERELDRNLELE---VRLKALEELEKKAENEAAEAEEEAKLLKD 142
Query: 876 RLNSHNEKLQSLQKQ------------KNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDS 923
+L++ + KLQ+ ++ KND+ Q Q LK LE EL
Sbjct: 143 KLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEEL----- 197
Query: 924 VYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKL 983
+ +LEE +++A E +L SE A ++ K+ +++ Y + E VK +K
Sbjct: 198 --REQLEECQKELAEAEKKLQSLTSE-QASSADNSVKIKHLEEELKRYEQDAEVVKSMKE 254
Query: 984 EILNYTKRGTLTQLAALRESVQKLNQRKED 1013
++L + +LAALRE +KL KED
Sbjct: 255 QLLQIPE--LERELAALREENRKLRSMKED 282
Score = 34.1 bits (78), Expect = 0.74
Identities = 92/633 (14%), Positives = 230/633 (36%), Gaps = 77/633 (12%)
Query: 236 NYKKEADSKKQLIYNNTQKRDQS-FEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQ 294
+ +K+ + L+ R +S +L N + +++ E++ +
Sbjct: 46 SLQKQVEESMTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKAS--------- 96
Query: 295 TKKTERDMIQESCNELESSIKQL------FSGDKAELQSKLNLFKINLDEKCSELENQER 348
T + + ELE +K L + AE + + L K LD + +L+N+
Sbjct: 97 TLAENYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNE-- 154
Query: 349 LKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQ 408
K ++E K+S + I ++ ++ +T KL ++ +L + + +
Sbjct: 155 -KEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEK 213
Query: 409 YTPE---------EGEGLIKMSQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVE 459
+ IK + + +Y D ++++ +I L E
Sbjct: 214 KLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQ-----LLQIPELERELAA 268
Query: 460 LESKIKSFKQQIEGN---KKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSL 516
L + + + E N K++L ++ +++ + + L L+ + ++ +E+ + KSL
Sbjct: 269 LREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENEL-KSWKSL 327
Query: 517 DPDQLKNEI--EAWIRQRNELEDELCVIDAEISILQAQNITLAE-IKSLKNRKESKLADI 573
D N + R+ L++E + + + + L ++ L+ ++ +++I
Sbjct: 328 LQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEI 387
Query: 574 NLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYTLEANVS 633
LK++ + + L + L+ +T + + ++ +N+ +K L +
Sbjct: 388 LELKKKLEA----------LKALVRRLQRRLTLVTKERDGLRAILNSYDKELTETSVSGQ 437
Query: 634 NSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQEEVSMMTSTQYLFNSY 693
+ + ++ E+++ + + E+ ++
Sbjct: 438 LMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNNTLETEIKLLKE-------- 489
Query: 694 IGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQ 753
+L NE N L+ KI+ + + + + L + L
Sbjct: 490 --QLVSNERLLSFVKE----------ATNALRLKIETLERERDRLRQEKSLLEMKLEHLC 537
Query: 754 ELKPVYENIMK-LQDTDIPSLRSKLIELE--ENVIETKGELKKLKTALETPKTKEKTALS 810
+ K L + P+ ++ I + E + +LK+ ALE K++
Sbjct: 538 LQGDYNASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQALEEGKSQPGDLEK 597
Query: 811 LQGDLTLLDQNIRELNTLQRELERQESKISGMR 843
G + +E+ L++++E E K ++
Sbjct: 598 AVGSH----ISSKEIAQLKKQVESAEKKNQRLK 626
Score = 33.4 bits (76), Expect = 1.2
Identities = 52/310 (16%), Positives = 116/310 (37%), Gaps = 13/310 (4%)
Query: 768 TDIPSLRSKLIELEENVIETKGELKKLKTALETPKTK---EKTALSLQGDLTLLDQNIRE 824
+ +L + E E + E K LK L+ K EK + ++ +
Sbjct: 114 VRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIK-ND 172
Query: 825 LNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKL 884
L+ +Q + ++++ + S +L + L + +++ E + S Q ++ K+
Sbjct: 173 LSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKI 232
Query: 885 QSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLN 944
+ L+++ V+ L + + + EL + +L + ++ +L
Sbjct: 233 KHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALRE-ENRKLRSMKEDNELLKEELE 291
Query: 945 LAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESV 1004
QS L E +K+ E+ A ++ ++LE + +L T R V
Sbjct: 292 DLQSRL-----ERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIV 346
Query: 1005 QKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKL 1064
N E++ K + + Q L+ K +LE K+ + L ++
Sbjct: 347 VLQN---EELQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRR 403
Query: 1065 SEIMISDLTK 1074
+ ++ +TK
Sbjct: 404 LQRRLTLVTK 413
>gnl|CDD|224047 COG1122, CbiO, ABC-type cobalt transport system, ATPase component
[Inorganic ion transport and metabolism].
Length = 235
Score = 41.9 bits (99), Expect = 0.001
Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
A+A + I LDEPT LD +K ++ +I++THD E +
Sbjct: 148 AIAGVLAMGPEILLLDEPTAGLDPKGRRELLELLKKLKEEGGKTIIIVTHDLELVLEY-- 205
Query: 1299 IDRAYVVR 1306
DR V+
Sbjct: 206 ADRVVVLD 213
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 40.0 bits (94), Expect = 0.001
Identities = 33/120 (27%), Positives = 52/120 (43%), Gaps = 13/120 (10%)
Query: 447 QAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQS-----------QSTL 495
+ E ++L E L K + +++I+ ++DL NE Q L
Sbjct: 2 EEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEEL 61
Query: 496 QVLQTKLNRVNSEIDQLSKSLDP--DQLKNEIEAWIRQRNELEDELCVIDAEISILQAQN 553
Q L+ +LN + EI QL + +L E+W Q+ LEDEL ++ I L QN
Sbjct: 62 QALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQN 121
Score = 29.2 bits (66), Expect = 7.1
Identities = 24/112 (21%), Positives = 50/112 (44%), Gaps = 4/112 (3%)
Query: 720 LVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTD---IPSLRSK 776
L +LK +++ + + + L +Q E + YE + D + +LR +
Sbjct: 8 LQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQ 67
Query: 777 LIELEENVIETKGELKKLKTALETPKTKEKTA-LSLQGDLTLLDQNIRELNT 827
L EL++ + + K E + + L + + L+ +L+ L++ I ELN
Sbjct: 68 LNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNE 119
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 41.2 bits (97), Expect = 0.002
Identities = 40/157 (25%), Positives = 68/157 (43%), Gaps = 18/157 (11%)
Query: 440 SDNENTKQA-EINALIVEKVELESKIKSF-------KQQIEGNKKDLTNVITQI----NE 487
N N K I L +EK LE +IK K ++E K L + ++ N+
Sbjct: 1 MMNNNLKSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQ 60
Query: 488 VNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEIS 547
V+Q +S +Q ++ ++ R ++ + + L EI+ + N LEDEL + EI
Sbjct: 61 VSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIE 120
Query: 548 ILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAF 584
L+ EI+ LK R E ++ + R +
Sbjct: 121 KLE------KEIEDLKERLERLEKNLAEAEARLEEEV 151
Score = 39.3 bits (92), Expect = 0.010
Identities = 22/129 (17%), Positives = 59/129 (45%), Gaps = 3/129 (2%)
Query: 423 TTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVI 482
I K L K + + E+ L + +LES+I+ +++I+ ++ L+ V
Sbjct: 27 KEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVK 86
Query: 483 TQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLD--PDQLKNEIEAWIRQRNELEDELC 540
+ E+ +Q+ + ++N + E+ +L + ++ ++++ E R L +
Sbjct: 87 DE-RELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEA 145
Query: 541 VIDAEISIL 549
++ E++ +
Sbjct: 146 RLEEEVAEI 154
>gnl|CDD|213246 cd03279, ABC_sbcCD, ATP-binding cassette domain of sbcCD. SbcCD
and other Mre11/Rad50 (MR) complexes are implicated in
the metabolism of DNA ends. They cleave ends sealed by
hairpin structures and are thought to play a role in
removing protein bound to DNA termini.
Length = 213
Score = 40.7 bits (96), Expect = 0.002
Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 5/45 (11%)
Query: 12 IRNF-PADKNRVVRFQ----RPLTLIVGENGCGKTTIIECIKFAL 51
++NF P + +V+ F L LI G G GK+TI++ I +AL
Sbjct: 8 LKNFGPFREEQVIDFTGLDNNGLFLICGPTGAGKSTILDAITYAL 52
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
chromosome partitioning].
Length = 420
Score = 42.0 bits (99), Expect = 0.002
Identities = 39/235 (16%), Positives = 86/235 (36%), Gaps = 30/235 (12%)
Query: 823 RELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNE 882
++L +Q+E+ E KI + ++ Q K + E+ + +++ L +
Sbjct: 38 KQLKQIQKEIAALEKKIREQQDQ---RAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94
Query: 883 KLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQ 942
++ L + N LE ++ E + L+ GR P
Sbjct: 95 QIADLNARLNA-----------------LEVQEREQRRRLAEQLAALQRSGRNPPPALLV 137
Query: 943 LNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRE 1002
L + +I L+++ ++ EI ++ LT L
Sbjct: 138 SPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIA--AEQAELTTL----L 191
Query: 1003 SVQKLNQRKEDI-IAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVA 1056
S Q+ Q K + +R ++T+ ++N ++ + +L+ N + L+ + A A
Sbjct: 192 SEQRAQQAKLAQLLEER---KKTLAQLNSELSADQKKLEELRANESRLKNEIASA 243
Score = 37.4 bits (87), Expect = 0.053
Identities = 36/214 (16%), Positives = 82/214 (38%), Gaps = 42/214 (19%)
Query: 861 NELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEG 920
+L + +I + + ++ ++ L+KQ LKSLE LE
Sbjct: 38 KQLKQIQKEIAALEKKIREQQDQRAKLEKQ-----------------LKSLETEIASLEA 80
Query: 921 MDSVYQTELEELGRKVAPIETQLNLAQSE-----------LDALKKEHKKKLNEEGAKIQ 969
+L++L +++A + +LN + + L AL++ + +
Sbjct: 81 QLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGR----NPPPALL 136
Query: 970 DYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEIN 1029
+ + R+ + ++ AL+ ++++L + +I A++ ++E
Sbjct: 137 VSPEDAQRSVRLAI-YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQR 195
Query: 1030 QSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELK 1063
A + L E+K+ +A+LN EL
Sbjct: 196 AQQAKLAQ---------LLEERKKTLAQLNSELS 220
Score = 35.1 bits (81), Expect = 0.32
Identities = 45/270 (16%), Positives = 86/270 (31%), Gaps = 47/270 (17%)
Query: 469 QQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSL-----DPDQLKN 523
+Q++ +K++ ++ + Q L+ +L + +EI L L D +L+
Sbjct: 38 KQLKQIQKEIA---ALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94
Query: 524 EIEAWIRQRNELEDELC--------VIDAEISILQAQNITLAEIKSLKNRKESKLADIN- 574
+I + N LE + + A +N A + S ++ + S I
Sbjct: 95 QIADLNARLNALEVQEREQRRRLAEQLAA--LQRSGRNPPPALLVSPEDAQRSVRLAIYY 152
Query: 575 --LLKERHDRAFHLLFD---------MIPEENFKNSLDKALSSITFDINRIQEDINAKEK 623
L R +R L I E + L LS ++ + + ++K
Sbjct: 153 GALNPARAERIDALKATLKQLAAVRAEIAAE--QAELTTLLSEQRAQQAKLAQLLEERKK 210
Query: 624 HLYTLEANVS---NSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQEE- 679
L L + +S + LR + L + E R E
Sbjct: 211 TLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAK 270
Query: 680 -----------VSMMTSTQYLFNSYIGKLE 698
M+ S+ F + G+L
Sbjct: 271 RTGETYKPTAPEKMLISSTGGFGALRGQLA 300
Score = 31.6 bits (72), Expect = 3.5
Identities = 34/194 (17%), Positives = 75/194 (38%), Gaps = 29/194 (14%)
Query: 723 KLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEE 782
L+ KI+E +Q + + L + SL+ + +K I L ++L LE
Sbjct: 49 ALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108
Query: 783 NVIETKGELKKLKTAL---------------ETPKTKEKTALSLQ----GDLTLLD---Q 820
E + L + AL E + + A+ +D
Sbjct: 109 QEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKA 168
Query: 821 NIRELNTLQRELERQESKISGMRSTGVD----LDQVLAQQKEKKNELNTFRSKIESGQTR 876
+++L ++ E+ ++++++ + S L Q+L ++K+ +LN S++ + Q +
Sbjct: 169 TLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLN---SELSADQKK 225
Query: 877 LNSHNEKLQSLQKQ 890
L L+ +
Sbjct: 226 LEELRANESRLKNE 239
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 41.9 bits (98), Expect = 0.003
Identities = 75/389 (19%), Positives = 163/389 (41%), Gaps = 46/389 (11%)
Query: 246 QLIYNNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQT-------KKT 298
+I + + L+ + I++K+ L EK R + S +Y+ K
Sbjct: 267 HIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQ 326
Query: 299 ER----DMIQESCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYI 354
E + ++ E IK L S + EL +L I+ ++ EL NQER +
Sbjct: 327 EWPGKLEKLKSEIELKEEEIKALQS-NIDELHKQLRKQGISTEQ--FELMNQERE--KLT 381
Query: 355 QEEKQSHTHINEAQMKLGKLERD-EETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEE 413
+E + + ++ + + + + K L TL+ + ++L + T ++SQ
Sbjct: 382 RELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLR-QYDSLIQNI---TRSRSQIGHNV 437
Query: 414 GEGLIKMSQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEG 473
+ +K++ + S I + E+N I E+++ E ++
Sbjct: 438 NDSSLKINIEQLFPKGSGINESIKKSIL-------ELNDEIQERIKTEENKSITLEEDIK 490
Query: 474 NKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRN 533
N K N +TQI E + + + + +L++ +E + +++ ++ ++L+ E+
Sbjct: 491 NLKHDINELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKEL-----NDL 545
Query: 534 ELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPE 593
L + ++DAE ++Q+ I L E+K NRK K+ H + H++
Sbjct: 546 NLLSKTSILDAE-QLVQSTEIKLDELKVDLNRKRYKI---------HKQVIHVIDITS-- 593
Query: 594 ENFKNSLDKALSSITFDINRIQEDINAKE 622
FK ++ +L + ++ ++ E++ E
Sbjct: 594 -KFKINIQSSLEDLENELGKVIEELRNLE 621
Score = 39.6 bits (92), Expect = 0.014
Identities = 50/271 (18%), Positives = 96/271 (35%), Gaps = 37/271 (13%)
Query: 412 EEGEGLIKMSQTTIDKYLSDIKILE---RTFSDNENTKQAEINA--------------LI 454
+ + L + Q K IK L R + N + +NA L
Sbjct: 278 TQNDNLYEKIQE-AMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLK 336
Query: 455 VEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSK 514
E E +IK+ + I+ K L + + L +L+++N + D+L+K
Sbjct: 337 SEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTK 396
Query: 515 SLDPDQLK-----NEIEAWIRQRNELEDELCVIDAEI------SILQAQNITLAEIKSLK 563
S+ +L+ +E +RQ + L + ++I S L+ L S
Sbjct: 397 SVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFPKGSGI 456
Query: 564 NRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEK 623
N K I L + EEN +L++ + ++ DIN + + + E
Sbjct: 457 NESIKKS--ILELNDEIQERIK------TEENKSITLEEDIKNLKHDINELTQILEKLEL 508
Query: 624 HLYTLEANVSNSSKTLRDQKRTLAELMDRME 654
L + S + + ++++E
Sbjct: 509 ELSEANSKFELSKEENERELVAQRIEIEKLE 539
Score = 38.0 bits (88), Expect = 0.036
Identities = 71/324 (21%), Positives = 144/324 (44%), Gaps = 22/324 (6%)
Query: 751 SLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALS 810
S QELK +E + + +TDI +L+++ L E + E +K+KT E + + +
Sbjct: 254 SEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNK 313
Query: 811 LQGDLTLLDQNIRE----LNTLQRELERQESKISGMRSTGVDLDQVLAQQK---EKKNEL 863
+ + + Q +E L L+ E+E +E +I ++S +L + L +Q E+ +
Sbjct: 314 YENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELM 373
Query: 864 NTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDS 923
N R K+ ++N ++KL + S++L QG KSLE + + +
Sbjct: 374 NQEREKLTRELDKINIQSDKLTK------SVKSRKLEAQGIF---KSLEKTLRQYDSLIQ 424
Query: 924 VYQTELEELGRKVAPIETQLNLAQS--ELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRI 981
++G V ++N+ Q + + + KK + E +IQ+ K EE K I
Sbjct: 425 NITRSRSQIGHNVNDSSLKINIEQLFPKGSGINESIKKSILELNDEIQERIKT-EENKSI 483
Query: 982 KLEILNYTKRGTLTQLAALRESVQ-KLNQRKEDIIAKRGVCERTIN--EINQSIANQSLE 1038
LE + + +L + E ++ +L++ + ER + I + L
Sbjct: 484 TLEEDIKNLKHDINELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKELN 543
Query: 1039 EIDLKNNLTLLEKKEAVAKLNEEL 1062
+++L + ++L+ ++ V +L
Sbjct: 544 DLNLLSKTSILDAEQLVQSTEIKL 567
Score = 33.0 bits (75), Expect = 1.5
Identities = 64/322 (19%), Positives = 136/322 (42%), Gaps = 50/322 (15%)
Query: 759 YENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLL 818
Y++ +KL+D PS + + E+ V ++ LKT + K + A+ + + L
Sbjct: 241 YKSFLKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTL 300
Query: 819 DQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLN 878
+ R L+ + + E+ ++ M+ + +E +L +S+IE +
Sbjct: 301 REKWRA---LKSDSNKYENYVNAMKQ----------KSQEWPGKLEKLKSEIELKE---- 343
Query: 879 SHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAP 938
E++++LQ +++H KQL QG S E E M+ E E+L R++
Sbjct: 344 ---EEIKALQSNIDELH-KQLRKQG-----ISTEQ----FELMNQ----EREKLTRELDK 386
Query: 939 IETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLA 998
I Q + + + K E + ++ Y ++ + R + +I + +
Sbjct: 387 INIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVND------S 440
Query: 999 ALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLE----KKEA 1054
+L+ ++++L + I +++I E+N I + E ++TL E K
Sbjct: 441 SLKINIEQLFPKGSGI---NESIKKSILELNDEIQERIKTE--ENKSITLEEDIKNLKHD 495
Query: 1055 VAKLNEEL-KLSEIMISDLTKY 1075
+ +L + L KL + +K+
Sbjct: 496 INELTQILEKLELELSEANSKF 517
>gnl|CDD|234033 TIGR02857, CydD, thiol reductant ABC exporter, CydD subunit. The
gene pair cydCD encodes an ABC-family transporter in
which each gene contains an N-terminal membrane-spanning
domain (pfam00664) and a C-terminal ATP-binding domain
(pfam00005). In E. coli these genes were discovered as
mutants which caused the terminal heme-copper oxidase
complex cytochrome bd to fail to assemble. Recent work
has shown that the transporter is involved in export of
redox-active thiol compounds such as cysteine and
glutathione. The linkage to assembly of the cytochrome bd
complex is further supported by the conserved operon
structure found outside the gammaproteobacteria (cydABCD)
containing both the transporter and oxidase genes
components. The genes used as the seed members for this
model are all either found in the gammproteobacterial
context or the CydABCD context. All members of this
family scoring above trusted at the time of its creation
were from genomes which encode a cytochrome bd complex.
Unfortunately, the gene symbol nomenclature adopted based
on this operon in B. subtilis assigns cydC to the third
gene in the operon where this gene is actually homologous
to the E. coli cydD gene. We have chosen to name all
homologs in this family in accordance with the precedence
of publication of the E. coli name, CydD.
Length = 529
Score = 41.9 bits (99), Expect = 0.003
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 8/56 (14%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD--------QKNFQLIVITHDEEFIENL 1296
ALA F R+ + LDEPT +LD + ++ + ++++TH
Sbjct: 468 ALARAFLRDAPLLLLDEPTAHLDAETEAEVLEALRALAQGRTVLLVTHRLALAALA 523
>gnl|CDD|227321 COG4988, CydD, ABC-type transport system involved in cytochrome bd
biosynthesis, ATPase and permease components [Energy
production and conversion / Posttranslational
modification, protein turnover, chaperones].
Length = 559
Score = 41.9 bits (99), Expect = 0.003
Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 11/65 (16%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNA-----SDQKNFQ---LIVITHDEEFIENLTAID 1300
ALA + LDEPT +LD + + Q+ + ++VITH E+ D
Sbjct: 466 ALARALLSPASLLLLDEPTAHLDAETEQIILQALQELAKQKTVLVITHRL---EDAADAD 522
Query: 1301 RAYVV 1305
R V+
Sbjct: 523 RIVVL 527
>gnl|CDD|213212 cd03245, ABCC_bacteriocin_exporters, ATP-binding cassette domain of
bacteriocin exporters, subfamily C. Many non-lantibiotic
bacteriocins of lactic acid bacteria are produced as
precursors which have N-terminal leader peptides that
share similarities in amino acid sequence and contain a
conserved processing site of two glycine residues in
positions -1 and -2. A dedicated ATP-binding cassette
(ABC) transporter is responsible for the proteolytic
cleavage of the leader peptides and subsequent
translocation of the bacteriocins across the cytoplasmic
membrane.
Length = 220
Score = 40.3 bits (95), Expect = 0.004
Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 18/75 (24%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASDQKNFQ----------LIVITHDEEFIENLTA 1298
ALA + I LDEPT+ +D S+++ + LI+ITH L
Sbjct: 150 ALARALLNDPPILLLDEPTSAMD--MNSEERLKERLRQLLGDKTLIIITHRPSL---LDL 204
Query: 1299 IDRAYVV---RIVRD 1310
+DR V+ RIV D
Sbjct: 205 VDRIIVMDSGRIVAD 219
>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
systems, ATPase components [Inorganic ion transport and
metabolism / Coenzyme metabolism].
Length = 258
Score = 40.6 bits (96), Expect = 0.004
Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 14/52 (26%)
Query: 1248 KALAETFSRNCGIFALDEPTTNLDIKNA----------SDQKNFQLIVITHD 1289
+ALA + I LDEPT++LDI + + +K ++++ HD
Sbjct: 151 RALA----QETPILLLDEPTSHLDIAHQIEVLELLRDLNREKGLTVVMVLHD 198
Score = 32.5 bits (75), Expect = 1.2
Identities = 9/19 (47%), Positives = 16/19 (84%)
Query: 29 LTLIVGENGCGKTTIIECI 47
+T I+G NG GK+T+++C+
Sbjct: 30 ITGILGPNGSGKSTLLKCL 48
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 40.7 bits (96), Expect = 0.004
Identities = 36/160 (22%), Positives = 69/160 (43%), Gaps = 31/160 (19%)
Query: 823 RELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNE 882
RE++ L+R S+I+ +L +L+ +++ +L + + + + L++
Sbjct: 46 REISGKDSALDRLNSQIA-------ELADLLSLERQGNQDL---QDSVANLRASLSAAEA 95
Query: 883 KLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEG-MDSVYQT------ELEELGRK 935
+ LQ AG + E R EL +DS Q ++E L ++
Sbjct: 96 ERSRLQALLA----------ELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQ 145
Query: 936 VAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQL 975
+A + QL ++ LDA +K + E AKI D ++L
Sbjct: 146 IAALRRQLAALEAALDASEKRDR----ESQAKIADLGRRL 181
>gnl|CDD|205733 pfam13555, AAA_29, P-loop containing region of AAA domain.
Length = 60
Score = 36.4 bits (85), Expect = 0.005
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 20 NRVVRFQRPLTLIVGENGCGKTTIIECIKFALT 52
+ TL+ G +G GK+T+I+ I+ L
Sbjct: 14 GHTIPIDPKGTLLTGPSGSGKSTLIDAIQTLLV 46
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 41.4 bits (97), Expect = 0.005
Identities = 119/651 (18%), Positives = 250/651 (38%), Gaps = 73/651 (11%)
Query: 342 ELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCL 401
E E + L++ + QE+ ++ +KL K++ D ++K L D L+ + L + +
Sbjct: 1988 EYEELKDLRNSFNQEKAETLN-----NLKLNKIKEDFNSYKNLLDELEKSVKTLKASENI 2042
Query: 402 DTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSD-NENTKQAEINALIVEKVEL 460
+++ T I T I+ +I+ + + + + + EI+ +
Sbjct: 2043 KKIVENKKTS------IDAINTNIEDIEKEIESINPSLDELLKKGHKIEISRYTSIIDNV 2096
Query: 461 ESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQ 520
++KI + + I +K +Q L ++ N + +I L++ D Q
Sbjct: 2097 QTKISNDSKNINDIEKK-------------AQIYLAYIKNNYNSIKKDISTLNEYFDEKQ 2143
Query: 521 LKNEIEAWIRQRNELEDELC--VIDAE--ISILQAQNITLAE---IKSLKNRKESKLADI 573
+ N I I + N+L EL V ++E I ++ + I + E + +L+N +
Sbjct: 2144 VSNYILTNIDKANKLSSELSEAVTNSEEIIENIKKEIIEINENTEMNTLENTADKLKELY 2203
Query: 574 NLLKERHD--RAFHLLFDMIPEENFKNSLDK-----------------ALSSITFDINRI 614
LK++ + + + I + +NS +K L IN I
Sbjct: 2204 ENLKKKKNIINNIYKKINFIKLQEIENSSEKYNDISKLFNNVVETQKKKLLDNKNKINNI 2263
Query: 615 QEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELK 674
++ IN KEK L +SS TL + +T E+ D ++ +G D ELK
Sbjct: 2264 KDKINDKEKEL-----INVDSSFTL-ESIKTFNEIYDDIKSNIGD--LYKLEDTNNDELK 2315
Query: 675 REQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQ 734
+ + + +T N+ I L+ + + E + + K K KI + E+
Sbjct: 2316 KVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELNNENNSYIIKTKEKINNLKEE 2375
Query: 735 TNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKL 794
+ +I + + +K NI K +T I S + +E + + + L ++
Sbjct: 2376 FSKLLKNIKR-NNTLCNNNNIKDFISNIGKSVET-IKQRFSSNLPEKEKLHQIEENLNEI 2433
Query: 795 KTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLA 854
K + K L +D + N KI + ++ V +
Sbjct: 2434 KNIMNETKRISNVDAFTNKILQDIDNEKNKENNN-----MNAEKIDDL------IENVTS 2482
Query: 855 QQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLT-VQGGAGMLKSLED 913
++ K+EL + + + + N+ SL + N+ ++ V ++ LE
Sbjct: 2483 HNEKIKSELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELES 2542
Query: 914 RKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEE 964
+L + S E++EL + + + + +S + + E + + +
Sbjct: 2543 HVSKLNELLSYIDNEIKELENEKLKLLEKAKIEESRKERERIESETQEDNT 2593
Score = 40.2 bits (94), Expect = 0.011
Identities = 159/921 (17%), Positives = 346/921 (37%), Gaps = 120/921 (13%)
Query: 214 KIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIEESMKPIN 273
++++D L+K+ E HY ++K + L +K + + +K
Sbjct: 336 QVKKDELKKKYFESAKHYA---SFKFSLEMLSMLDSLLIKKEKILNNLFNKLFGDLKEKI 392
Query: 274 EKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQSKLNLFK 333
E L+ + ++Y + + ++ ++ Q + ++ K
Sbjct: 393 ETLL-------DSEYFISKYTNIISLSEHTLKAAEDVLKENSQKIADYALYSNLEIIEIK 445
Query: 334 INLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLER----DEETHKKLN---- 385
DEK +EL+ I K + I + + E+ D +K++
Sbjct: 446 KKYDEKINELKKSINQLKTLISIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQ 505
Query: 386 --DTLKTKLNNLADTLCLDTTAKSQYTPEEG-----EGLIKMSQTTIDKYLSDIKILERT 438
+++K K N + + + Y EG EGLI++ I YL I+ L
Sbjct: 506 IINSIKEKNNIVNNNF---KNIEDYYITIEGLKNEIEGLIEL----IKYYLQSIETLI-- 556
Query: 439 FSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQ----SQST 494
+E K++ N + + +E + K I N ++ N+I QI E+ ++
Sbjct: 557 --KDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLN-DEIDNIIQQIEELINEALFNKEK 613
Query: 495 LQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNI 554
+ L I D +L +E+ ++ L E ++ + N
Sbjct: 614 FINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHE---AKSKEDLQTLLNT 670
Query: 555 TLAEIKSLKNRKESKLADI-NLLKERHDRAFHLLFDMIPE------ENFKNSLDKALSSI 607
+ E + L+ K + +I LK+ L ++I + ++ NSL++
Sbjct: 671 SKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNNIEQDISNSLNQYTIKY 730
Query: 608 TFDINRIQEDINAKEKHLYTLEANVSNSSK----TLRDQKRTLAELMDRMELVLGSKPFE 663
D+ E+ +E+ L + + N L + + L + + E L K
Sbjct: 731 N-DLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTYEEFLQYKDTI 789
Query: 664 DELDR-VTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVN 722
+ ++ ++ +E +N I KLE + ++D + L+
Sbjct: 790 LNKENKISNDINILKENKKNNQDLLNSYNILIQKLEAHTE---------KNDEELKQLLQ 840
Query: 723 KLKT-----KIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKL 777
K T +KE+ ++ N +D + K ++ + I L S +K
Sbjct: 841 KFPTEDENLNLKELEKEFNENNQIVDNIIKD---IENMNKNINIIKTLNIAINRSNSNKQ 897
Query: 778 IELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQES 837
L E+++ K +LK K + L LL+ +E ++++L ++
Sbjct: 898 --LVEHLLNNKIDLKN-KLEQHMKIINTDNIIQKNEKLNLLNNLNKEKEKIEKQLS--DT 952
Query: 838 KISGMR---------------STGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNE 882
KI+ ++ + + L + ++K+E F+S+I+ N N+
Sbjct: 953 KINNLKMQIEKTLEYYDKSKENINGNDGTHLEKLDKEKDEWEHFKSEIDKLNVNYNILNK 1012
Query: 883 KLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQ 942
K+ L K+++D + K ++++ E+E Y + LE++ K++
Sbjct: 1013 KIDDLIKKQHDDII--------ELIDKLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFN 1064
Query: 943 LNLAQSELDALKKEHKKKLNEEGA----KIQDYTKQLEEVKRIKLEILNYTKRGTLTQLA 998
+++ + + + KE K L E+ KI + +L E+K E + +
Sbjct: 1065 IDIKKYK-NPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADK------- 1116
Query: 999 ALRESVQKLNQRKEDIIAKRGVCERTINEIN----QSIANQSLEEIDLKNNLTLLEKKEA 1054
+ + N++K+ + E+T+ E+ + I + EI+++ L++
Sbjct: 1117 EKNKQTEHYNKKKKSLEKIYKQMEKTLKELENMNLEDITLNEVNEIEIEYERILID--HI 1174
Query: 1055 VAKLNEELKLSEIMISDLTKY 1075
V ++N E K S+ ++ ++ Y
Sbjct: 1175 VEQINNEAKKSKTIMEEIESY 1195
Score = 37.5 bits (87), Expect = 0.075
Identities = 154/847 (18%), Positives = 336/847 (39%), Gaps = 91/847 (10%)
Query: 191 KKVKEIFDEIFDATKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYN 250
K ++E D I D N +++I Q + L E K + ++ D ++++ Y
Sbjct: 574 KYIEENVDHIKDIISLNDEIDNIIQQIEELINEALFNKEKFI-----NEKNDLQEKVKYI 628
Query: 251 NTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQE-SCNE 309
+ +EL + ++ L + + ++ + T K E + ++ +
Sbjct: 629 LNKFYKGDLQELLDELSHFLDDHKYLYHEAKSKEDLQTLLN---TSKNEYEKLEFMKSDN 685
Query: 310 LESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKS--QYIQEEKQSHTHINEA 367
+++ IK L K ELQ+ L+L K N+ +K Q+ S QY + + I E
Sbjct: 686 IDNIIKNL----KKELQNLLSL-KENIIKKQLNNIEQDISNSLNQYTIKYNDLKSSIEEY 740
Query: 368 QMKLGKLERDEETHKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTT--- 424
+ + KLE + + L L ++ + L K ++ +
Sbjct: 741 KEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTYEEFLQYKDTILNKENKISNDI 800
Query: 425 ---------IDKYLSDIKILERTFSDNENTKQAEINAL---IVEKVELESKIKSFKQQIE 472
L+ IL + + E+ L + E +K +++
Sbjct: 801 NILKENKKNNQDLLNSYNILIQKLEAHTEKNDEELKQLLQKFPTEDE-NLNLKELEKEFN 859
Query: 473 GNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSE---IDQLSKSLDPDQLKNEIEAWI 529
N + + N+I I +N++ + ++ L +NR NS ++ L + LKN++E +
Sbjct: 860 ENNQIVDNIIKDIENMNKNINIIKTLNIAINRSNSNKQLVEHLLN--NKIDLKNKLEQHM 917
Query: 530 RQRNELEDELCVIDAEISILQAQNITLAEI-KSLKNRKESKL-ADINLLKERHDRAF--- 584
+ N D + + ++++L N +I K L + K + L I E +D++
Sbjct: 918 KIIN--TDNIIQKNEKLNLLNNLNKEKEKIEKQLSDTKINNLKMQIEKTLEYYDKSKENI 975
Query: 585 ---------HLLFDMIPEENFKNSLDKALSSITFDINRIQEDINA-KEKHLYTLEANVSN 634
L + E+FK+ +DK + +I + I + + ++ +
Sbjct: 976 NGNDGTHLEKLDKEKDEWEHFKSEIDKLNVNYNILNKKIDDLIKKQHDDIIELIDKLIKE 1035
Query: 635 SSKTLRDQKRTLAELMDRMELVLGSKPFEDEL-----DRVTLELKREQEEVSMMTSTQYL 689
K + ++ L+++M+ L S F ++ ++ E+K +E+V +
Sbjct: 1036 KGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEAL------ 1089
Query: 690 FNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQ 749
+ K++EN+ + E V + + E N KK ++++ KQ
Sbjct: 1090 ----LKKIDENK------NKLIEIKNKSHEHVVNADKEKNKQTEHYNKKKKSLEKIYKQM 1139
Query: 750 RS-LQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTA 808
L+EL+ + + L + + + + I ++ V + E KK KT +E ++ +K
Sbjct: 1140 EKTLKELENMNLEDITLNEVNEIEIEYERILIDHIVEQINNEAKKSKTIMEEIESYKKDI 1199
Query: 809 LSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRS 868
++ +++ + L T + ++ S + + A+ + + +T
Sbjct: 1200 DQVKKNMS--KERNDHLTTFEYNAYYDKATASYENIEELTTE---AKGLKGEANRSTNVD 1254
Query: 869 KIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTE 928
+++ + ++ S+ LQ + K+ N + + ++ L S++ K E ++S + E
Sbjct: 1255 ELKEIKLQVFSY---LQQVIKENNKMENALHEIKNMYEFLISIDSEKILKEILNSTKKAE 1311
Query: 929 LEELGRKVAPIETQLNLAQSELDALK-KEHKKKL--NEEGAKIQDYTKQLEEVKR----I 981
K +T + Q E + KEHK K+ + E +I D K++E++K
Sbjct: 1312 EFSNDAKKELEKTDNLIKQVEAKIEQAKEHKNKIYGSLEDKQIDDEIKKIEQIKEEISNK 1371
Query: 982 KLEILNY 988
+ EI Y
Sbjct: 1372 RKEINKY 1378
>gnl|CDD|213196 cd03229, ABC_Class3, ATP-binding cassette domain of the binding
protein-dependent transport systems. This class is
comprised of all BPD (Binding Protein Dependent) systems
that are largely represented in archaea and eubacteria
and are primarily involved in scavenging solutes from the
environment. ABC transporters are a large family of
proteins involved in the transport of a wide variety of
different compounds, like sugars, ions, peptides, and
more complex organic molecules. The nucleotide binding
domain shows the highest similarity between all members
of the family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 178
Score = 38.7 bits (91), Expect = 0.007
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
ALA + + + LDEPT+ LD +K+ Q ++++THD + L
Sbjct: 110 ALARALAMDPDVLLLDEPTSALDPITRREVRALLKSLQAQLGITVVLVTHDLDEAARLA- 168
Query: 1299 IDRAYVVR 1306
DR V+R
Sbjct: 169 -DRVVVLR 175
Score = 31.8 bits (73), Expect = 1.6
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 30 TLIVGENGCGKTTIIECI 47
++G +G GK+T++ CI
Sbjct: 29 VALLGPSGSGKSTLLRCI 46
>gnl|CDD|226425 COG3910, COG3910, Predicted ATPase [General function prediction
only].
Length = 233
Score = 39.0 bits (91), Expect = 0.011
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 14 NFPADKNRVVR--FQRPLTLIVGENGCGKTTIIECI 47
+ PA ++ R F+ P+T I GENG GK+T++E I
Sbjct: 22 SLPAFRHLEERLEFRAPITFITGENGSGKSTLLEAI 57
>gnl|CDD|184491 PRK14079, recF, recombination protein F; Provisional.
Length = 349
Score = 39.4 bits (92), Expect = 0.012
Identities = 19/53 (35%), Positives = 27/53 (50%)
Query: 6 QLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQG 58
+L + N+ + F +T +VGEN GKT ++E I ALT E P G
Sbjct: 2 RLLSLRQLNYRNLAPPTLAFPPGVTAVVGENAAGKTNLLEAIYLALTGELPNG 54
>gnl|CDD|203007 pfam04423, Rad50_zn_hook, Rad50 zinc hook motif. The Mre11 complex
(Mre11 Rad50 Nbs1) is central to chromosomal maintenance
and functions in homologous recombination, telomere
maintenance and sister chromatid association. The Rad50
coiled-coil region contains a dimer interface at the
apex of the coiled coils in which pairs of conserved
Cys-X-X-Cys motifs form interlocking hooks that bind one
Zn ion. This alignment includes the zinc hook motif and
a short stretch of coiled-coil on either side.
Length = 54
Score = 34.6 bits (80), Expect = 0.019
Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 682 MMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTN 736
+ S +N I +L+E + CP+C R + + L+ +LK++++ +PE+
Sbjct: 1 ELKSETEEYNKAIEELKEAKGCCPVCGRPLDEE-HKQELIKELKSELRRLPEELK 54
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of the
C-terminal region.
Length = 546
Score = 38.8 bits (90), Expect = 0.020
Identities = 71/310 (22%), Positives = 123/310 (39%), Gaps = 39/310 (12%)
Query: 817 LLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTR 876
L+ RE N L ++ E +S MRS L+ L EK EL + + Q+
Sbjct: 144 QLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEEL---EEQHKEAQSS 200
Query: 877 LNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKV 936
S + + +L Q+ + + + LE +++ + Q L
Sbjct: 201 SESMSAERNALLAQRAENQQR----------ILELEQ---DIQTLTQKKQENDRVLEGTQ 247
Query: 937 APIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQ 996
++EL+ +K E K++L + + +D EE +RI L++ N L
Sbjct: 248 D--------IEAELERMKGELKQRLKKMTIQRRD-----EETERIDLQLENEQLHEDLRT 294
Query: 997 LAALRE-SVQKLNQRKEDIIAKRGVCERTINEINQS---IANQSLEEIDLKNNLTLLEKK 1052
L E S QK E++ + + + TI E++QS A SL+ L NL L E +
Sbjct: 295 LQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQMSLQLSQL--NLALKEGQ 352
Query: 1053 EAVAK----LNEELKLSEIMISDLTKYHHTLENCVIKYHSQKMRSINRLIREYWTRIYQL 1108
A+ L + + + I L+ LE + + SQ+ + L E QL
Sbjct: 353 SQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQEERSQREKLEVELGIEKDCNRVQL 412
Query: 1109 KLSEIMISDL 1118
+ +S+L
Sbjct: 413 SENRRELSEL 422
Score = 33.4 bits (76), Expect = 0.97
Identities = 54/294 (18%), Positives = 117/294 (39%), Gaps = 32/294 (10%)
Query: 614 IQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPF----EDELDRV 669
++ D++ + LE + +S++ + + E E + + E +
Sbjct: 162 LEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQR 221
Query: 670 TLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTK-- 727
LEL ++++ +T + + + ++ E E + L +LK
Sbjct: 222 ILEL---EQDIQTLTQKKQENDRVLEGTQDIE---------AELERMKGELKQRLKKMTI 269
Query: 728 -IKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYEN---IMKLQDTDIPSLRSKLI-ELEE 782
++ + + + +QL + R+LQE + ++ + + SLR I EL +
Sbjct: 270 QRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQ 329
Query: 783 NVIET---KGELKKLKTALETPK---TKEKTAL--SLQGDLTLLDQNIRELNTLQRELER 834
+ +E+ +L +L AL+ + +E+ L S + D + + EL L+ L+
Sbjct: 330 SRLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQE 389
Query: 835 QESKISGMRST-GVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSL 887
+ S+ + G++ D Q E + EL+ RS + Q E+ Q L
Sbjct: 390 ERSQREKLEVELGIEKDCNRVQLSENRRELSELRSALRVLQKEKEQLQEEKQEL 443
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular medium
across both inner and outer membranes. The secretion
signal is found in the C-terminus of the transported
protein. This model represents the adaptor protein
between the ATP-binding cassette (ABC) protein of the
inner membrane and the outer membrane protein, and is
called the membrane fusion protein. This model selects a
subfamily closely related to HlyD; it is defined narrowly
and excludes, for example, colicin V secretion protein
CvaA and multidrug efflux proteins [Protein fate, Protein
and peptide secretion and trafficking].
Length = 423
Score = 38.5 bits (90), Expect = 0.023
Identities = 20/120 (16%), Positives = 48/120 (40%), Gaps = 10/120 (8%)
Query: 917 ELEGMDS---VYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTK 973
+LE + + + EL L ++ + QL + EL+A +K +K L + +
Sbjct: 145 QLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSR----LELLE 200
Query: 974 QLEEVKRIKLEILNYTKR--GTLTQLAALRESVQKL-NQRKEDIIAKRGVCERTINEINQ 1030
E + E+ Q+ L+ Q++ +E+++ + + + E+ +
Sbjct: 201 LERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRE 260
Score = 37.3 bits (87), Expect = 0.066
Identities = 36/163 (22%), Positives = 63/163 (38%), Gaps = 21/163 (12%)
Query: 752 LQELKPVYENIMKLQDTDIPS----LRSKLIELEENVIETKGELKKLKTALETPKT---- 803
L P ++K Q + S LR++L + + + + EL L+ L+ +
Sbjct: 117 LSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEV 176
Query: 804 --KEKTALS------LQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQ 855
+E A L L LL+ RE Q EL R E+++ ++ +L Q
Sbjct: 177 ISEELEARRKLKEKGLVSRLELLELE-RERAEAQGELGRLEAELEVLKRQIDELQLERQQ 235
Query: 856 QKEKKNELNTFRSKIESGQTRLNSHNEKLQSL--QKQKNDIHS 896
++ E ++ Q RL E+L + Q+ I S
Sbjct: 236 IEQTFRE--EVLEELTEAQARLAELRERLNKARDRLQRLIIRS 276
Score = 31.9 bits (73), Expect = 2.5
Identities = 26/134 (19%), Positives = 53/134 (39%), Gaps = 24/134 (17%)
Query: 412 EEGEGLIKMSQTTIDKYLSDIKILE---RTFSDNENTKQAEIN--ALIVEKVELESKIKS 466
+ G+ L+++ T + +D LE +AE + A I +L S
Sbjct: 66 KAGQVLVELDATDV---EADAAELESQVLRLEAEVARLRAEADSQAAIEFPDDLLSAEDP 122
Query: 467 -FKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEI 525
+ I+G + + +S L+ +L + ++I QL L L+ ++
Sbjct: 123 AVPELIKG-------------QQSLFESRKSTLRAQLELILAQIKQLEAEL--AGLQAQL 167
Query: 526 EAWIRQRNELEDEL 539
+A +Q + +EL
Sbjct: 168 QALRQQLEVISEEL 181
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 38.7 bits (91), Expect = 0.029
Identities = 61/333 (18%), Positives = 121/333 (36%), Gaps = 86/333 (25%)
Query: 727 KIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIE 786
+ + K L + L I + Q + L+ +L + + +
Sbjct: 47 ALNKQKLLEAEDKLVQQDLEQTLALLD-------KIDR-QKEETEQLKQQLAQAPAKLRQ 98
Query: 787 TKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTG 846
+ EL+ LK + + + LSL R+LE +
Sbjct: 99 AQAELEALKDDNDEETRETLSTLSL------------------RQLESR----------- 129
Query: 847 VDLDQVLAQQKEKKNELNTFRSKIESGQTR-------LNSHNEKLQSLQKQKNDIHSKQL 899
L Q L Q + +N+L + S++ S QT+ L +++++LQ ++
Sbjct: 130 --LAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKG------ 181
Query: 900 TVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSEL---DALKKE 956
GG + S Q E A + Q +L + L L+
Sbjct: 182 GKVGGKALRPSQRVL----------LQAEQ-------ALLNAQNDLQRKSLEGNTQLQDL 224
Query: 957 HKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLT-QLAALRESVQKLNQRKED-I 1014
+K+ + A+IQ +LE ++ E +N +KR TL+ + +S + + + + +
Sbjct: 225 LQKQRDYLTARIQ----RLEHQLQLLQEAIN-SKRLTLSEKTVQEAQSQDEAARIQANPL 279
Query: 1015 IAKRGVCERTINEINQSIANQSLEEIDLKNNLT 1047
+A+ EIN ++ + L+ + N LT
Sbjct: 280 VAQE-------LEINLQLSQRLLKATEKLNTLT 305
Score = 33.7 bits (78), Expect = 0.89
Identities = 47/249 (18%), Positives = 88/249 (35%), Gaps = 48/249 (19%)
Query: 341 SELENQ-ERLKSQYIQEEKQ--SHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLAD 397
++++ Q + L Q + E + + + L K++R K+ + LK +L
Sbjct: 39 ADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDR----QKEETEQLKQQLAQAPA 94
Query: 398 TLCLDTTA----KSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENTKQAEINAL 453
L K E E L +S ++ L+ D Q ++
Sbjct: 95 KLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQT-------LDQLQNAQNDLA-- 145
Query: 454 IVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQ------------SQSTLQVLQTK 501
E S++ S + Q E + L ++ ++ S +LQ +
Sbjct: 146 -----EYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAE 200
Query: 502 LNRVNSEIDQLSKSL-DPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIK 560
+N++ D KSL QL++ ++ QR+ L A I L+ Q L E
Sbjct: 201 QALLNAQNDLQRKSLEGNTQLQDLLQK---QRDYLT-------ARIQRLEHQLQLLQEAI 250
Query: 561 SLKNRKESK 569
+ K S+
Sbjct: 251 NSKRLTLSE 259
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 37.6 bits (88), Expect = 0.037
Identities = 26/122 (21%), Positives = 61/122 (50%), Gaps = 3/122 (2%)
Query: 942 QLNLAQSELDALKKEHKKKLNEEGAKIQD-YTKQLEEVKRIKLEILNYTKRGTLTQLAAL 1000
L EL+A +E ++K + E +++ Y +++ E+++ +L+ L + ++ L
Sbjct: 22 FLEQQNKELEAKIEELRQKKSAEPSRLYSLYEQEIRELRK-QLDELTNERARLQLEIDNL 80
Query: 1001 RESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNL-TLLEKKEAVAKLN 1059
R + + ++ ED + R E I + + + +L +DL+ + +L E+ + K +
Sbjct: 81 RLAAEDFREKYEDELNLRQSAENDIVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNH 140
Query: 1060 EE 1061
EE
Sbjct: 141 EE 142
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 38.2 bits (89), Expect = 0.043
Identities = 66/327 (20%), Positives = 123/327 (37%), Gaps = 29/327 (8%)
Query: 774 RSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELE 833
++L+EL ++E + K +L+ + AL D+ L Q +
Sbjct: 661 LAELLELARQLLEEAEKQAARKASLDERLRDAERALEEA--EERHDEAQSALEAWQEQWY 718
Query: 834 R--QESKISGMRSTGVDLDQV--LAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQK 889
E+ + G S LD + L KEK + R +I + + L E++++L +
Sbjct: 719 DALLEAGLGGRASPAGALDALELLQNIKEKLQAADDLRQRIAAMERDLARFEEEVEALAE 778
Query: 890 QKNDI---HSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLA 946
T + LK D E + E+EE ++V+ L+ A
Sbjct: 779 AVAPEMLGTPADETARALKQRLKRARDTAAAAEKLAE----EIEEAEKEVSEAAAALDEA 834
Query: 947 QSELDALKKEHKKKLNEEG-AKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESV- 1004
++ L AL + + EE A ++ E KRI + G L AL
Sbjct: 835 EARLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAA 894
Query: 1005 -QKLNQRKEDI--IAKRGV-CERTINEINQSI--ANQSLEEIDLKNNLTLL--EKKEAVA 1056
++ + +A+ E +NE+ Q + A Q L +D + L E++ +A
Sbjct: 895 ALDPDELPARLEELARDIEELEEELNELAQEVGAAKQELARMDGGSTAAELEAERESLLA 954
Query: 1057 KLNE------ELKLSEIMISDLTKYHH 1077
+L + EL L+ ++ + +
Sbjct: 955 QLRDLAERYLELALASRLLRKAIERYR 981
Score = 36.3 bits (84), Expect = 0.16
Identities = 48/282 (17%), Positives = 103/282 (36%), Gaps = 36/282 (12%)
Query: 728 IKEIPEQTNNKKTHIDQLCKQQRS-LQELKPVYENIMKLQDTDI-PSLRSKLI-ELEENV 784
+ E+ + + Q + + L V + + K D P R I + +
Sbjct: 100 LVRGGEEILEAQGELGQYLFSAAAGVGSLGSVRDELEKEADELWKPRGRKPEINVALKEL 159
Query: 785 IETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRS 844
E + E+++++ T K K + +L L + +R+L ++ LER + +
Sbjct: 160 KELEAEIREVQLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLERLRRLLPLLAE 219
Query: 845 TGVDLDQVLAQQKEKKNELNTF-----------RSKIESGQTRLNSHNEKLQSLQKQKND 893
+ L QQ E+ R+++ + + L E+L++LQ + ++
Sbjct: 220 R-----KALEQQLAALGEVIDLPPDAVERYEEARAELRAARRNLELLTERLEALQAELDE 274
Query: 894 IH-SKQLTVQGGAGMLKSLEDRKCE----------LEGMDSVYQTELEELGRKVAPIETQ 942
I ++L Q A +++L ++ E EG + + L ++ P +
Sbjct: 275 ISLDEELLAQ--AAAIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADE 332
Query: 943 LNLAQSELDALKKEHKKKLNEEGAKIQDY----TKQLEEVKR 980
+ KE +L + + LEE +R
Sbjct: 333 EAVESLRPSLAAKETVTELEKRKEALDQALKSARDALEERER 374
Score = 32.0 bits (73), Expect = 3.3
Identities = 49/248 (19%), Positives = 92/248 (37%), Gaps = 33/248 (13%)
Query: 449 EINALIVEKVELESKIKS----------FKQQIEGNKKDLTNVITQINEVNQSQSTLQVL 498
EIN + E ELE++I+ + ++ +++L N+ ++ ++ + + L+ L
Sbjct: 151 EINVALKELKELEAEIREVQLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLERL 210
Query: 499 QTKLNRVNSEIDQLSKSLD-----PDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQN 553
+ L + E L + L D + +E + R EL ++ L+A
Sbjct: 211 RRLLPLLA-ERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRNLELLTERLEALQ 269
Query: 554 ITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINR 613
L EI SL ++ A I L ++ + D+ +I
Sbjct: 270 AELDEI-SLDEELLAQAAAIEALHQQRGEYRNAEQDL--------------PDREGEIAN 314
Query: 614 IQEDINAKEKHL-YTLEANVSNSSKTLRDQKRTLAELMDRME-LVLGSKPFEDELDRVTL 671
+E A + + S + K T+ EL R E L K D L+
Sbjct: 315 AREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEALDQALKSARDALEERER 374
Query: 672 ELKREQEE 679
ELK+ + +
Sbjct: 375 ELKQVRAQ 382
Score = 30.9 bits (70), Expect = 6.8
Identities = 15/88 (17%), Positives = 40/88 (45%), Gaps = 6/88 (6%)
Query: 948 SELDALKKEHK---KKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESV 1004
E++ KE K ++ E K + + ++ + + E+ N K L QL ++ +
Sbjct: 150 PEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAEEELANLRKE--LRQLEKEKQRL 207
Query: 1005 QKLNQRKEDIIAKRGVCERTINEINQSI 1032
++L + + ++ + E+ + + + I
Sbjct: 208 ERLRRLLPLLAERKAL-EQQLAALGEVI 234
Score = 30.5 bits (69), Expect = 9.0
Identities = 42/239 (17%), Positives = 82/239 (34%), Gaps = 32/239 (13%)
Query: 722 NKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYE--NIMKLQDTDIPSLRSKLIE 779
+L+ +I+E+ +T K + L + + L L+ K + +L
Sbjct: 160 KELEAEIREVQLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQR-------LERLRR 212
Query: 780 LEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKI 839
L + E K ++L E + + EL +R LE ++
Sbjct: 213 LLPLLAERKALEQQLAALGEVIDLPPDAVERYE-------EARAELRAARRNLELLTERL 265
Query: 840 SGMRS--TGVDLDQVLAQQKEKKNELNTFRSKIESGQTRL-------NSHNEKLQSLQKQ 890
+++ + LD+ L Q L+ R + + + L + E +L Q
Sbjct: 266 EALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQ 325
Query: 891 ------KNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQL 943
+ + S + + LE RK L+ + LEE R++ + QL
Sbjct: 326 IGPDADEEAVESLR-PSLAAKETVTELEKRKEALDQALKSARDALEERERELKQVRAQL 383
>gnl|CDD|213222 cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the
transporters involved in export of lipoprotein and
macrolide, and cell division protein. This family is
comprised of MJ0796 ATP-binding cassette,
macrolide-specific ABC-type efflux carrier (MacAB), and
proteins involved in cell division (FtsE), and release of
lipoproteins from the cytoplasmic membrane (LolCDE). They
are clustered together phylogenetically. MacAB is an
exporter that confers resistance to macrolides, while the
LolCDE system is not a transporter at all. An FtsE null
mutants showed filamentous growth and appeared viable on
high salt medium only, indicating a role for FtsE in cell
division and/or salt transport. The LolCDE complex
catalyzes the release of lipoproteins from the
cytoplasmic membrane prior to their targeting to the
outer membrane.
Length = 218
Score = 37.1 bits (87), Expect = 0.044
Identities = 20/70 (28%), Positives = 27/70 (38%), Gaps = 23/70 (32%)
Query: 1260 IFALDEPTTNLDIKNAS----------DQKNFQLIVITHDEEFIENLTAIDRAYVVRIVR 1309
I A DEPT NLD + + ++V+THD E E Y RI+
Sbjct: 162 ILA-DEPTGNLDSETGKEVMELLRELNKEAGTTIVVVTHDPELAE--------YADRIIE 212
Query: 1310 DHKGLSDIHL 1319
L D +
Sbjct: 213 ----LRDGKI 218
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 36.5 bits (85), Expect = 0.048
Identities = 19/92 (20%), Positives = 45/92 (48%), Gaps = 2/92 (2%)
Query: 447 QAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVN 506
Q E+ L E EL+ ++ +Q+ + K++L+ + ++ + + + ++ L ++
Sbjct: 72 QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELD 131
Query: 507 SEIDQLSKSLDPDQLKNEIEAWIRQRNELEDE 538
E +L + L +LK E EA + L++
Sbjct: 132 EENRELREEL--AELKQENEALEAENERLQEN 161
Score = 35.4 bits (82), Expect = 0.13
Identities = 23/100 (23%), Positives = 47/100 (47%), Gaps = 4/100 (4%)
Query: 460 LESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDP- 518
++ +Q++ +++L + Q+ E+ Q L+ +L+ + +E+++L K L
Sbjct: 64 ARERLPELQQELAELQEELAELQEQLAELQQENQELK---QELSTLEAELERLQKELARI 120
Query: 519 DQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAE 558
QL + EL +EL + E L+A+N L E
Sbjct: 121 KQLSANAIELDEENRELREELAELKQENEALEAENERLQE 160
Score = 32.3 bits (74), Expect = 1.1
Identities = 21/107 (19%), Positives = 43/107 (40%), Gaps = 13/107 (12%)
Query: 770 IPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQ 829
+P L+ +L EL+E + E + +L +L+ + K + T + L I++L+
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA 127
Query: 830 RELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTR 876
EL+ + ++ + E K E ++ E Q
Sbjct: 128 IELDEE-------------NRELREELAELKQENEALEAENERLQEN 161
Score = 31.9 bits (73), Expect = 1.6
Identities = 20/105 (19%), Positives = 49/105 (46%), Gaps = 5/105 (4%)
Query: 290 STQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERL 349
+ + E +QE EL+ + +L + EL+ +L+ + L+ EL ++L
Sbjct: 65 RERLPELQQELAELQEELAELQEQLAEL-QQENQELKQELSTLEAELERLQKELARIKQL 123
Query: 350 KSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNN 394
+ I+ ++++ E + +L +L+++ E + N+ L+
Sbjct: 124 SANAIELDEEN----RELREELAELKQENEALEAENERLQENEQR 164
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 37.7 bits (88), Expect = 0.050
Identities = 57/271 (21%), Positives = 107/271 (39%), Gaps = 29/271 (10%)
Query: 717 VPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSK 776
+P L+ +L+T E+P Q + K + ++ L+ + ++ ++ +
Sbjct: 216 IPSLLAELQT---ELPGQLQDLKAGYRDMKEEGYHLEHVN--IDSRLERLKEQLVENSEL 270
Query: 777 LIELEENVIETK-----GELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNT-LQR 830
L +LE + E + +++ L LE + L + +E N L+
Sbjct: 271 LTQLELDEAEEELGLIQEKIESLYDLLEREVEAKNVVEENLPILPDYLEKAKENNEHLKE 330
Query: 831 ELERQESKISGMRSTGVDLDQVLAQQKEKK---NELNTFRSKIESGQTRLNSHNEKLQSL 887
E+ER + R +L V +KE K + L+ IE+ + + + L+ +
Sbjct: 331 EIERVKES---YRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEI 387
Query: 888 QKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQ 947
+K DI +Q VQ L SL + E +++L E+ R + +
Sbjct: 388 EKALTDIEDEQEKVQEH---LTSLRKDELEARENLERLKSKLHEIKRYM---------EK 435
Query: 948 SELDALKKEHKKKLNEEGAKIQDYTKQLEEV 978
S L L + G +IQD K+L EV
Sbjct: 436 SNLPGLPETFLSLFFTAGHEIQDLMKELSEV 466
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile protein
found in all eukaryote cell types. This family consists
of the coiled-coil myosin heavy chain tail region. The
coiled-coil is composed of the tail from two molecules of
myosin. These can then assemble into the macromolecular
thick filament. The coiled-coil region provides the
structural backbone the thick filament.
Length = 859
Score = 37.7 bits (88), Expect = 0.055
Identities = 71/302 (23%), Positives = 137/302 (45%), Gaps = 42/302 (13%)
Query: 790 ELKKLKTALETPKTKEKTALSLQGDLTLL----DQNIRELNTLQRELERQESKISGMRST 845
EL KL+ LE + L + L L I EL+ +L++Q++K +S
Sbjct: 103 ELAKLRKDLE------EANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQ 156
Query: 846 GV-DLDQVLAQ-QKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQG 903
++D +LAQ + K +LN K + +++L+ KL LQ+Q ND+ S++ +Q
Sbjct: 157 LQAEVDDLLAQLDQIAKAKLNA-EKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQS 215
Query: 904 GAG-MLKSLEDRKCELEGM---DSVYQTELEEL-------GRKVAPIETQLNLAQSELDA 952
+ + LE+ + ++ + S +++LEE R+ A ++ QL + +LD+
Sbjct: 216 ENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDS 275
Query: 953 L----------KKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRE 1002
L K E +++L++ A+IQ + + E ++ E L K+ +++ L E
Sbjct: 276 LREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKLNQKISELEE 335
Query: 1003 SVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKK-EAVAKLNEE 1061
+ + N K E+T + + + + +E + LEKK + K+ E
Sbjct: 336 AAEAANA-------KCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFDKILAE 388
Query: 1062 LK 1063
K
Sbjct: 389 WK 390
Score = 33.9 bits (78), Expect = 0.93
Identities = 127/625 (20%), Positives = 253/625 (40%), Gaps = 116/625 (18%)
Query: 466 SFKQQIEGNKKDLTNVITQINEVNQS-QSTLQVLQTKLNRVNSEIDQLSKSLDPD-QLKN 523
K+++E + ++Q++ + Q+ + LQ K+ + + I +L + L+ + +
Sbjct: 4 RQKRELENQLYRKESELSQLSSKLEDEQALVAQLQKKIKELEARIRELEEELEAERAARA 63
Query: 524 EIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLAD----------- 572
+ E + R +L EL + L+ A L ++E++LA
Sbjct: 64 KAE---KARADLSREL---EELSERLEEAGGATAAQIELNKKREAELAKLRKDLEEANLQ 117
Query: 573 ----INLLKERHDRAFHLLFDMIPEEN-FKNSLDKALSSITFDINRIQEDIN-------A 620
+ L+++H A + L + I + K +K S + +++ + ++
Sbjct: 118 HEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKLN 177
Query: 621 KEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQEEV 680
EK LE+ +S L + +R L +L + + E +T +L+ + +V
Sbjct: 178 AEKKAKQLESQLSELQVKLDELQRQLNDLTSQ------KSRLQSENSDLTRQLEEAEAQV 231
Query: 681 SMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNKKT 740
S ++ + +LEE + +R L+ +++++
Sbjct: 232 SNLSKLKS---QLESQLEEAKRSLEEESR----------ERANLQAQLRQLEHD------ 272
Query: 741 HIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKL--------IELEENVIETKGELK 792
+D L +Q E K E + + +I RSK ELEE + ++
Sbjct: 273 -LDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKLNQKIS 331
Query: 793 KLKTALETPKTK----EKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVD 848
+L+ A E K EKT LQ EL LQ ELER + S + +
Sbjct: 332 ELEEAAEAANAKCDSLEKTKSRLQS----------ELEDLQIELERANAAASELEKKQKN 381
Query: 849 LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGML 908
D++LA+ K K +EL ++++++ Q + + +L L KN++ + V+
Sbjct: 382 FDKILAEWKRKVDEL---QAELDTAQREARNLSTELFRL---KNELEELKDQVEALRREN 435
Query: 909 KSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKI 968
K+L+D +L +L E GR V +L A+ L+A K E + L E A +
Sbjct: 436 KNLQDEIHDLTD-------QLGEGGRNV----HELEKARRRLEAEKDELQAALEEAEAAL 484
Query: 969 QDYTKQLEEVKRIKLEILNYTKRGTLTQLAALR-ESVQKLNQRKEDIIAKRGVCERTINE 1027
+ EE K ++ ++ +L+ +R E ++L +++E+ R +R I
Sbjct: 485 EL-----EESKVLRAQV----------ELSQIRSEIERRLAEKEEEFENTRKNHQRAIES 529
Query: 1028 INQSIANQSLEEIDLKNNLTLLEKK 1052
+ ++ E K + L+KK
Sbjct: 530 LQATLE----AEAKGKAEASRLKKK 550
Score = 31.2 bits (71), Expect = 5.4
Identities = 66/334 (19%), Positives = 137/334 (41%), Gaps = 38/334 (11%)
Query: 272 INEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQSKLNL 331
++E V+L E +R ++ +++Q ++E + E E+ + L S K++L+S+L
Sbjct: 189 LSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNL-SKLKSQLESQLEE 247
Query: 332 FKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTK 391
K +L+E+ E N + Q + + E +LER ++K
Sbjct: 248 AKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSK 307
Query: 392 LNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENTK---QA 448
+ + ++ EE E L K I + + E TK Q+
Sbjct: 308 FES-------EGALRA----EELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQS 356
Query: 449 EINALIVE-------KVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTK 501
E+ L +E ELE K K+F + + K+ + + E++ +Q + L T+
Sbjct: 357 ELEDLQIELERANAAASELEKKQKNFDKILAEWKRK-VDELQA--ELDTAQREARNLSTE 413
Query: 502 LNRVNSEIDQL------------SKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISIL 549
L R+ +E+++L + + L +++ R +ELE ++AE L
Sbjct: 414 LFRLKNELEELKDQVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDEL 473
Query: 550 QAQNITL-AEIKSLKNRKESKLADINLLKERHDR 582
QA A ++ +++ +++ ++ +R
Sbjct: 474 QAALEEAEAALELEESKVLRAQVELSQIRSEIER 507
>gnl|CDD|213242 cd03275, ABC_SMC1_euk, ATP-binding cassette domain of eukaryotic
SMC1 proteins. The structural maintenance of
chromosomes (SMC) proteins are large (approximately 110
to 170 kDa), and each is arranged into five
recognizable domains. Amino-acid sequence homology of
SMC proteins between species is largely confined to the
amino- and carboxy-terminal globular domains. The
amino-terminal domain contains a 'Walker A'
nucleotide-binding domain (GxxGxGKS/T, in the
single-letter amino-acid code), which by mutational
studies has been shown to be essential in several
proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B'
motif, and a motif with homology to the signature
sequence of the ATP-binding cassette (ABC) family of
ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 247
Score = 36.8 bits (86), Expect = 0.060
Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 14/90 (15%)
Query: 7 LHIMGIRNFPADK-NRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSG--- 62
L + + NF + K V+ T I+G NG GK+ +++ I F L G S
Sbjct: 1 LKRLELENFKSYKGRHVIGPFDRFTCIIGPNGSGKSNLMDAISFVL------GEKSSHLR 54
Query: 63 ----KNFVHDPRIGKKDEASASSGKNFVHD 88
K+ ++ R+GK D SA + D
Sbjct: 55 SKNLKDLIYRARVGKPDSNSAYVTAVYEDD 84
>gnl|CDD|226359 COG3839, MalK, ABC-type sugar transport systems, ATPase components
[Carbohydrate transport and metabolism].
Length = 338
Score = 36.9 bits (86), Expect = 0.064
Identities = 25/75 (33%), Positives = 33/75 (44%), Gaps = 17/75 (22%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
ALA R +F LDEP +NLD IK ++ I +THD +E +T
Sbjct: 143 ALARALVRKPKVFLLDEPLSNLDAKLRVLMRSEIKKLHERLGTTTIYVTHD--QVEAMTL 200
Query: 1299 IDRAYVVRIVRDHKG 1313
D RIV + G
Sbjct: 201 AD-----RIVVMNDG 210
Score = 31.5 bits (72), Expect = 3.6
Identities = 7/16 (43%), Positives = 13/16 (81%)
Query: 32 IVGENGCGKTTIIECI 47
++G +GCGK+T++ I
Sbjct: 34 LLGPSGCGKSTLLRMI 49
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels of
palindromic sequences in DNA. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1042
Score = 37.6 bits (87), Expect = 0.067
Identities = 102/780 (13%), Positives = 238/780 (30%), Gaps = 84/780 (10%)
Query: 342 ELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNNLADTLCL 401
L+ +L ++K H ++ L + DT + L L
Sbjct: 174 PLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTP---CMPDTYHERKQVLEKELKH 230
Query: 402 DTTA---------------KSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENTK 446
A ++Q + + L+K + I++ + +LE T +
Sbjct: 231 LREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRAR 290
Query: 447 QAEINALIVEKV-ELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRV 505
+A A ++ V ++E + + +++ + ++ + + QS+++ + L +
Sbjct: 291 KAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTL 350
Query: 506 NSEIDQLSKSLDPDQLKNE-----------IEAWIRQRNELEDELCVIDAEISILQAQNI 554
+S+ + + + E I +Q+ L +L + E+ ILQ +
Sbjct: 351 HSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQA 410
Query: 555 T----LAEIKSLKNRKESKLADINLLKERHDR----AFHLLFDMIPEENFKNSLDKALSS 606
T + + L+ + L + + E+ ++L
Sbjct: 411 TIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKE 470
Query: 607 ITFDINRIQEDINAKEKHLYTLEANVSNSSKTL-RDQKRTLAELMDRMELVLGSKPFEDE 665
+ E I+ +E + + + + + P
Sbjct: 471 REQQLQTK-EQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRR 529
Query: 666 LDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPR-----CPLCTRFFESDYSVPGL 720
+ R + + + L+E L S +P L
Sbjct: 530 MQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNL 589
Query: 721 VNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIEL 780
N + + + C+Q L++L+P + D+ L +
Sbjct: 590 QNITVRLQDLTEKLSEAEDM---LACEQHALLRKLQP------EQDLQDV-RLHLQQCSQ 639
Query: 781 EENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRE-------LE 833
E + T L L+ L + +E LL L +Q E E
Sbjct: 640 ELALKLT--ALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKE 697
Query: 834 RQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQK-- 891
+ +R +++ + E +N ++ S + + R ++ N+ L+ L Q
Sbjct: 698 MLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAA---REDALNQSLKELMHQART 754
Query: 892 ------NDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNL 945
+ V L E++ + + + + L A I ++
Sbjct: 755 VLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPS 814
Query: 946 AQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQ 1005
+ L+ + ++ + +++++ + L E+ L+ QLA L +
Sbjct: 815 DEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEE-----CSKQLAQLTQEQA 869
Query: 1006 KLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLS 1065
K+ Q + + + + + + L EI L N+ L + E
Sbjct: 870 KIIQLSDKLNGINQI----KIQFDGDALIKFLHEITLYANVRLANQSEGRFHGRYADSHV 925
Score = 35.7 bits (82), Expect = 0.21
Identities = 149/984 (15%), Positives = 313/984 (31%), Gaps = 148/984 (15%)
Query: 28 PLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRIGKKDEASASSGKNFVH 87
P+ LI G+ G GKTT+++ I +AL + P+ S
Sbjct: 27 PIFLICGKTGAGKTTLLDAITYALYGKLPR-------------------RSEVIRSLNSL 67
Query: 88 DPRIGKKDETHAIVKLQCTCENNDTVCVVRSLLLSNKNGKDNCATRDTTISRKIFATGVQ 147
+ L V R+ L ++ + + + +
Sbjct: 68 YAAPSEAAFAELEFSL-----GTKIYRVHRT-LRCTRSHRKT----EQPEQLYLEQKKGR 117
Query: 148 KNLGCLQQESVLE-MCNLIGVSKAILNNVIFCHQ-ENSSWPLDEGKKVKEIFDEIFDATK 205
+ ++ E + +L+ + V+ Q E + + + K+ KE+ +F +
Sbjct: 118 GRILAAKKSETEEVIHDLLKLDYKTFTRVVLLPQGEFAQFLKAKSKEKKELLMNLFPLDQ 177
Query: 206 YNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNI 265
Y + ++ L + + Q D E +
Sbjct: 178 YTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTP-------------CMPDTYHERKQVL 224
Query: 266 EESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDK--A 323
E+ +K + E L Q + + TQ + + E+ Q+ +L + I++L + +
Sbjct: 225 EKELKHLREALQQTQQSHAYL----TQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLE 280
Query: 324 ELQSKLNLFK-----INLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDE 378
E Q ++N + + +++E Q + +Q + +S + + K +
Sbjct: 281 ETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSI 340
Query: 379 ETHKKLNDTLKTKLNNLADTLCLDTT---AKSQYTPEEGEGLIKMSQTTIDKYLSDIKIL 435
E ++L TL ++ ++ D + T+ Q Q T
Sbjct: 341 EEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCK 400
Query: 436 ERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTL 495
E E QA I+ +L+ ++ K+Q E ++ I Q +
Sbjct: 401 ELDILQRE---QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLE 457
Query: 496 QVLQTKLNRVNSEIDQLSKSLDPDQLK-NEIEAWIRQR-NELEDELCVIDAEISILQAQN 553
++ + + E +Q ++ + L+ +A + R EL++E C +
Sbjct: 458 KIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPAR 517
Query: 554 ITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPE----ENFKNSLDKALSSITF 609
+ L R + L+ + +H L + + + ++ S +T
Sbjct: 518 QDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQ 577
Query: 610 DINRIQEDINAKEKHLYTLEANVSNSSK-------TLRDQKRTLAELMDRMELVLGSKPF 662
NR +EDI + L+ S+ R L D ++ L +
Sbjct: 578 CDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQC 637
Query: 663 EDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVN 722
EL L Q + +E L R +
Sbjct: 638 SQELALKLTALHALQLTL----------------TQERVREHALSIRVLPKEL------- 674
Query: 723 KLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEE 782
L +Q +LQ+++ E + ++ + ++ L ELE
Sbjct: 675 ----------------------LASRQLALQKMQSEKEQLTYWKEM-LAQCQTLLRELET 711
Query: 783 NVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGM 842
++ E E +++ A + + L L+Q+++EL R + + ++
Sbjct: 712 HIEEYDREFNEIENASSSLGSD------LAAREDALNQSLKELMHQARTVLKARTEAHFN 765
Query: 843 RSTGVDLD-QVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTV 901
+ V Q A+ E+ F E L + ++ Q+ +D L
Sbjct: 766 NNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIG--QEIPSDEDILNLQC 823
Query: 902 QGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKL 961
+ + R LEE + I QL + +L
Sbjct: 824 ETLVQEEEQFLSR--------------LEEKSATLGEITHQL-----LKYEECSKQLAQL 864
Query: 962 NEEGAKIQDYTKQLEEVKRIKLEI 985
+E AKI + +L + +IK++
Sbjct: 865 TQEQAKIIQLSDKLNGINQIKIQF 888
>gnl|CDD|213224 cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain of
nickel/oligopeptides specific transporters. The ABC
transporter subfamily specific for the transport of
dipeptides, oligopeptides (OppD), and nickel (NikDE). The
NikABCDE system of E. coli belongs to this family and is
composed of the periplasmic binding protein NikA, two
integral membrane components (NikB and NikC), and two
ATPase (NikD and NikE). The NikABCDE transporter is
synthesized under anaerobic conditions to meet the
increased demand for nickel resulting from hydrogenase
synthesis. The molecular mechanism of nickel uptake in
many bacteria and most archaea is not known. Many other
members of this ABC family are also involved in the
uptake of dipeptides and oligopeptides. The oligopeptide
transport system (Opp) is a five-component ABC transport
composed of a membrane-anchored substrate binding
proteins (SRP), OppA, two transmembrane proteins, OppB
and OppC, and two ATP-binding domains, OppD and OppF.
Length = 228
Score = 36.3 bits (85), Expect = 0.073
Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 12/52 (23%)
Query: 1263 LDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTAIDRAYV 1304
DEPT+ LD +K ++ L+ ITHD + + DR V
Sbjct: 169 ADEPTSALDVSVQAQILDLLKKLQEELGLTLLFITHDLGVVAKIA--DRVAV 218
Score = 32.1 bits (74), Expect = 1.6
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 32 IVGENGCGKTTIIECI 47
+VGE+G GK+T+ I
Sbjct: 36 LVGESGSGKSTLARAI 51
>gnl|CDD|234189 TIGR03375, type_I_sec_LssB, type I secretion system ATPase, LssB
family. Type I protein secretion is a system in some
Gram-negative bacteria to export proteins (often
proteases) across both inner and outer membranes to the
extracellular medium. This is one of three proteins of
the type I secretion apparatus. Targeted proteins are not
cleaved at the N-terminus, but rather carry signals
located toward the extreme C-terminus to direct type I
secretion. This model is related to models TIGR01842 and
TIGR01846, and to bacteriocin ABC transporters that
cleave their substrates during export [Protein fate,
Protein and peptide secretion and trafficking, Cellular
processes, Pathogenesis].
Length = 694
Score = 37.2 bits (87), Expect = 0.079
Identities = 33/121 (27%), Positives = 46/121 (38%), Gaps = 43/121 (35%)
Query: 1225 DEEFIENLTAIDRAYVVRIVRDHK----------------------ALAETFSRNCGIFA 1262
DEE + A + A V VR H ALA R+ I
Sbjct: 568 DEEILR---AAELAGVTEFVRRHPDGLDMQIGERGRSLSGGQRQAVALARALLRDPPILL 624
Query: 1263 LDEPTTNLDIKNASDQKNFQ----------LIVITHDEEFIENLTAIDRAYVV---RIVR 1309
LDEPT+ +D N S+++ L+++TH L +DR V+ RIV
Sbjct: 625 LDEPTSAMD--NRSEERFKDRLKRWLAGKTLVLVTHRTSL---LDLVDRIIVMDNGRIVA 679
Query: 1310 D 1310
D
Sbjct: 680 D 680
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.1 bits (87), Expect = 0.087
Identities = 31/128 (24%), Positives = 55/128 (42%), Gaps = 16/128 (12%)
Query: 946 AQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQ 1005
A+ E +A+KKE + EE I + E+ R + L ++ L + L ++
Sbjct: 47 AKKEAEAIKKEALLEAKEE---IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLE 103
Query: 1006 KLNQRKEDIIAKRGVCERTINEINQSIA---------NQSLEEIDLKNNLTLLE-KKEAV 1055
L +R+E++ K E+ E+ + Q LE I + LT E K+ +
Sbjct: 104 LLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERI---SGLTAEEAKEILL 160
Query: 1056 AKLNEELK 1063
K+ EE +
Sbjct: 161 EKVEEEAR 168
Score = 30.5 bits (70), Expect = 8.6
Identities = 33/127 (25%), Positives = 54/127 (42%), Gaps = 26/127 (20%)
Query: 858 EKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCE 917
E K E++ R++ E E+ LQK + + K+ + L+ LE R+ E
Sbjct: 61 EAKEEIHKLRNEFE------KELRERRNELQKLEKRLLQKEENLDR---KLELLEKREEE 111
Query: 918 LEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEE 977
LE + + + +EL +K + EL+ L +E ++L I T EE
Sbjct: 112 LEKKEKELEQKQQELEKK-----------EEELEELIEEQLQELER----ISGLTA--EE 154
Query: 978 VKRIKLE 984
K I LE
Sbjct: 155 AKEILLE 161
>gnl|CDD|224052 COG1127, Ttg2A, ABC-type transport system involved in resistance to
organic solvents, ATPase component [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 263
Score = 36.0 bits (84), Expect = 0.098
Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 12/67 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
ALA + + + LDEPT+ LD I+ +D +I++THD LT
Sbjct: 155 ALARAIALDPELLFLDEPTSGLDPISAGVIDELIRELNDALGLTVIMVTHD--LDSLLTI 212
Query: 1299 IDRAYVV 1305
DR V+
Sbjct: 213 ADRVAVL 219
>gnl|CDD|224059 COG1136, SalX, ABC-type antimicrobial peptide transport system,
ATPase component [Defense mechanisms].
Length = 226
Score = 35.9 bits (84), Expect = 0.100
Identities = 18/53 (33%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNA----------SDQKNFQLIVITHDEE 1291
A+A N I DEPT NLD K A + ++ +I++THD E
Sbjct: 152 AIARALINNPKIILADEPTGNLDSKTAKEVLELLRELNKERGKTIIMVTHDPE 204
>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of
iron-siderophores, vitamin B12 and hemin transporters and
related proteins. ABC transporters, involved in the
uptake of siderophores, heme, and vitamin B12, are widely
conserved in bacteria and archaea. Only very few species
lack representatives of the siderophore family
transporters. The E. coli BtuCD protein is an ABC
transporter mediating vitamin B12 uptake. The two
ATP-binding cassettes (BtuD) are in close contact with
each other, as are the two membrane-spanning subunits
(BtuC); this arrangement is distinct from that observed
for the E. coli lipid flippase MsbA. The BtuC subunits
provide 20 transmembrane helices grouped around a
translocation pathway that is closed to the cytoplasm by
a gate region, whereas the dimer arrangement of the BtuD
subunits resembles the ATP-bound form of the Rad50 DNA
repair enzyme. A prominent cytoplasmic loop of BtuC forms
the contact region with the ATP-binding cassette and
represent a conserved motif among the ABC transporters.
Length = 180
Score = 35.5 bits (83), Expect = 0.10
Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 17/59 (28%)
Query: 1241 VRIVRDHKALAETFSRNCGIFALDEPTTNLDIKNA----------SDQKNFQLIVITHD 1289
V + R ALA + I LDEPT++LDI + + ++ ++++ HD
Sbjct: 106 VLLAR---ALA----QEPPILLLDEPTSHLDIAHQIELLELLRRLARERGKTVVMVLHD 157
Score = 31.3 bits (72), Expect = 2.4
Identities = 7/19 (36%), Positives = 14/19 (73%)
Query: 29 LTLIVGENGCGKTTIIECI 47
+ I+G NG GK+T+++ +
Sbjct: 27 IVGILGPNGAGKSTLLKTL 45
>gnl|CDD|213228 cd03261, ABC_Org_Solvent_Resistant, ATP-binding cassette transport
system involved in resistant to organic solvents. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. The nucleotide binding domain shows the
highest similarity between all members of the family. ABC
transporters are a subset of nucleotide hydrolases that
contain a signature motif, Q-loop, and H-loop/switch
region, in addition to, the Walker A motif/P-loop and
Walker B motif commonly found in a number of ATP- and
GTP-binding and hydrolyzing proteins.
Length = 235
Score = 35.6 bits (83), Expect = 0.12
Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 15/59 (25%)
Query: 1263 LDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTAIDRAYVV---RIV 1308
DEPT LD I++ + I++THD + DR V+ +IV
Sbjct: 160 YDEPTAGLDPIASGVIDDLIRSLKKELGLTSIMVTHDLDTA--FAIADRIAVLYDGKIV 216
Score = 30.9 bits (71), Expect = 4.4
Identities = 7/18 (38%), Positives = 12/18 (66%)
Query: 30 TLIVGENGCGKTTIIECI 47
I+G +G GK+T++ I
Sbjct: 29 LAIIGPSGSGKSTLLRLI 46
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 36.5 bits (85), Expect = 0.13
Identities = 15/41 (36%), Positives = 26/41 (63%), Gaps = 6/41 (14%)
Query: 1263 LDEPTTNLDIKNAS---DQ-KNFQ--LIVITHDEEFIENLT 1297
LDEPT +LD+++ D K + +IV++HD F++N+
Sbjct: 177 LDEPTNHLDLESIEWLEDYLKRYPGTVIVVSHDRYFLDNVA 217
Score = 35.3 bits (82), Expect = 0.24
Identities = 15/57 (26%), Positives = 30/57 (52%), Gaps = 10/57 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDI------KNASDQKNFQ--LIVITHDEEFIENLT 1297
LA+ + + LDEPT +LDI + A +F+ +++++HD F++ +
Sbjct: 449 LLAKLLLQPPNLLLLDEPTNHLDIESLEALEEA--LLDFEGTVLLVSHDRYFLDRVA 503
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided into
two structural domains; the N-terminal two-thirds of the
protein has no obvious structural motifs but is the
region for binding to Nup98, one of the components of the
nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 36.4 bits (84), Expect = 0.13
Identities = 25/118 (21%), Positives = 49/118 (41%), Gaps = 6/118 (5%)
Query: 937 APIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTK--RGTL 994
P E L L ++++ K + + Q K L+ K +LE + + R +L
Sbjct: 532 PPPEECLQLLSRATQVFREQYLLKHDLAREEFQRRVKLLQLQKEKQLEDIQDCREERKSL 591
Query: 995 TQLA-ALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEK 1051
++ A L E ++ +E ++ + C+R + N + S E D+ L + K
Sbjct: 592 SERAEKLAEKFEEAKYNQELLVNR---CKRLLQSANSQLPVLSDSERDMSKELQRINK 646
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 36.2 bits (84), Expect = 0.13
Identities = 21/94 (22%), Positives = 49/94 (52%), Gaps = 4/94 (4%)
Query: 747 KQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEK 806
++ +E+ + I KL++T + L + EL+ + E K E++KL++ LE + + +
Sbjct: 409 TEEEERREITVYEKRIKKLEET-VERLEEENSELKRELEELKREIEKLESELERFRREVR 467
Query: 807 TALSLQGDLTLLDQNIRELNTLQRELERQESKIS 840
+ ++ D R + L++ELE ++ ++
Sbjct: 468 DKVRKDREIRARD---RRIERLEKELEEKKKRVE 498
Score = 33.5 bits (77), Expect = 0.95
Identities = 40/198 (20%), Positives = 78/198 (39%), Gaps = 24/198 (12%)
Query: 494 TLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEI----------EAWIRQRNELEDELCVID 543
+ KL +V ++ +L D +++K + + E E +
Sbjct: 353 AYLAYKPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEE 412
Query: 544 AEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKA 603
I + IK L+ E + + LK + L I E ++ L++
Sbjct: 413 ERREITVYEK----RIKKLEETVERLEEENSELKRELEE----LKREI--EKLESELERF 462
Query: 604 LSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAEL--MDRMELVLGSKP 661
+ + R +I A+++ + LE + K + + +R LAEL M ++EL P
Sbjct: 463 RREVRDKV-RKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTP 521
Query: 662 FEDELDRVTLELKREQEE 679
+ ++++TLE E EE
Sbjct: 522 VK-VVEKLTLEAIEEAEE 538
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 36.1 bits (83), Expect = 0.15
Identities = 51/268 (19%), Positives = 106/268 (39%), Gaps = 21/268 (7%)
Query: 733 EQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQD--TDIPSLRSKLIELEE-------- 782
++ + + Q ++ L++L + LQ + + S+ S L ELEE
Sbjct: 212 KEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEKA 271
Query: 783 --------NVIET-KGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELE 833
N++E+ K EL++ K LE + KE A L + + +
Sbjct: 272 AEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKESKAREK 331
Query: 834 RQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKND 893
E ++ + ++ + + + EL + + E + L + +L++ K+
Sbjct: 332 MVEI-PLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAALKEAEA 390
Query: 894 IH-SKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDA 952
++ L + + +S E K + ++ E EEL +K E + A
Sbjct: 391 AKAAEALALAEIKALQESEESAKADSPRKITLSLEEYEELSKKAEEAEELAEKKVAAALA 450
Query: 953 LKKEHKKKLNEEGAKIQDYTKQLEEVKR 980
+E K+ NE K+++ +K++EE K
Sbjct: 451 QVEEAKESENESLKKLEEASKEIEEKKA 478
Score = 32.7 bits (74), Expect = 1.8
Identities = 72/375 (19%), Positives = 151/375 (40%), Gaps = 22/375 (5%)
Query: 609 FDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDR 668
++ QE+ N +K L E K L KRT+ +L ++E + + +
Sbjct: 36 TELMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEKEE----QQAK 91
Query: 669 VTLELKREQEEVSMMTSTQYLFNSYIGKLEE-NEPRCPLCTRFFESDYSVPGLVNKLKTK 727
EL + + E + YI E + + L E D V LK
Sbjct: 92 QDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKRA 151
Query: 728 IKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIET 787
+ I N+K +++L K E+ + E++ + + + ++ E +
Sbjct: 152 EEAICASKVNEKK-VEELTK------EIIAMKESLERAHAAHLEAEEERIGAALEKDQDR 204
Query: 788 KGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGV 847
+ K+LK A + + ++ L + DL L + EL +LQ+E+ S + S
Sbjct: 205 ETYEKELKEAEKELERLKQ-DLDPEKDLEKLAEASAELESLQKEISIMAS----VASVLK 259
Query: 848 DLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGM 907
+L++ A ++ E + R+ +ES + L ++L+ L++++ + +++
Sbjct: 260 ELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAELNR 319
Query: 908 LKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKE---HKKKLNEE 964
KS ++ K + ++ +L++ + + A+ EL LK+E K L
Sbjct: 320 TKSEKESKAREKMVEI--PLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETA 377
Query: 965 GAKIQDYTKQLEEVK 979
+++ K+ E K
Sbjct: 378 ELRLEAALKEAEAAK 392
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
the myosin-like domain [Function unknown].
Length = 499
Score = 36.2 bits (83), Expect = 0.16
Identities = 44/244 (18%), Positives = 93/244 (38%), Gaps = 37/244 (15%)
Query: 841 GMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLT 900
+RS LD + Q + + EL T + + + +T + +LQ ++++ +
Sbjct: 68 NLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ---- 123
Query: 901 VQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKK 960
EL +EL R + Q Q+ L L E +++
Sbjct: 124 ----------------ELAAARQNLAKAQQELAR----LTKQAQDLQTRLKTL-AEQRRQ 162
Query: 961 LNEEGAKIQDYTKQLE-EVKRIKLEILNYTKRGTLTQ-----LAALRESVQKLNQRKEDI 1014
L + +Q KQL+ ++K ++L+ R + LA + Q R E++
Sbjct: 163 LEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQ---ARTEEL 219
Query: 1015 IAKRGVCERTINEINQ--SIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEIMISDL 1072
+ ++T I Q + +Q ++I + + E++ + +L E ++ L
Sbjct: 220 ARRAAAAQQTAQAIQQRDAQISQKAQQIAARAE-QIRERERQLQRLETAQARLEQEVAQL 278
Query: 1073 TKYH 1076
Y+
Sbjct: 279 EAYY 282
Score = 34.6 bits (79), Expect = 0.46
Identities = 43/243 (17%), Positives = 93/243 (38%), Gaps = 20/243 (8%)
Query: 777 LIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIR-ELNTLQRELERQ 835
L L N+ +L ++ L +T+ TA + + R EL ++E E
Sbjct: 62 LFLLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAV 121
Query: 836 ESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIH 895
+++ R Q LA+ ++ +L T + + +L + + LQ+ QKQ
Sbjct: 122 RQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQ----- 176
Query: 896 SKQLTVQGGAGMLKS----LEDRKCELEGMDSVYQTE---LEELGRKVAPIETQLNLAQS 948
+Q A LKS L+ R ++E T + ++A
Sbjct: 177 -----LQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQ 231
Query: 949 ELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLN 1008
+ + +K + A+ + ++ +++R++ ++ + QL A ++ +L
Sbjct: 232 AIQQRDAQISQKAQQIAARAEQIRERERQLQRLETA-QARLEQ-EVAQLEAYYQAYVRLR 289
Query: 1009 QRK 1011
Q+
Sbjct: 290 QQA 292
Score = 31.1 bits (70), Expect = 4.9
Identities = 24/125 (19%), Positives = 59/125 (47%), Gaps = 8/125 (6%)
Query: 206 YNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQK----RDQSFEE 261
N+ L S Q D +R ++ ++ K+ A+++++ + QK R+ +E
Sbjct: 65 LNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQE 124
Query: 262 LHNIEESMKPINEKLVQLTEKERN----MSVMSTQYQTKKTERDMIQESCNELESSIKQL 317
L +++ ++L +LT++ ++ + ++ Q + + + +Q S +L++S QL
Sbjct: 125 LAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQL 184
Query: 318 FSGDK 322
S
Sbjct: 185 KSQVL 189
Score = 31.1 bits (70), Expect = 5.4
Identities = 29/162 (17%), Positives = 61/162 (37%), Gaps = 5/162 (3%)
Query: 744 QLCKQQRSLQ-ELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPK 802
+L KQ + LQ LK + E +L + SL++ +L+ + + K ++ LK +
Sbjct: 141 RLTKQAQDLQTRLKTLAEQRRQL-EAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIE 199
Query: 803 TKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNE 862
+ + + EL ++ I + Q +A + E+ E
Sbjct: 200 QEAQNLATRANAAQART---EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRE 256
Query: 863 LNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGG 904
++E+ Q RL +L++ + + + Q G
Sbjct: 257 RERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQRG 298
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 35.9 bits (83), Expect = 0.17
Identities = 47/255 (18%), Positives = 90/255 (35%), Gaps = 53/255 (20%)
Query: 742 IDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETP 801
+D ++ + +Y+ +K R +L + ++ E L L+ LE
Sbjct: 150 LDTFAGANEKVKAYRELYQAWLKA--------RQQLKDRQQKEQELAQRLDFLQFQLEE- 200
Query: 802 KTKEKTALSLQGDLTLLD--QNIRELNTLQRELERQESKISGMRSTGVDLD--------- 850
E+ L D L Q + L L+ + + + G
Sbjct: 201 --LEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQEGSLLEGLGEAQL 258
Query: 851 ------------------QVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKN 892
L + +E EL + ++E RLN E+L +
Sbjct: 259 ALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERLAQI----- 313
Query: 893 DIHSKQLTVQGGA---GMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSE 949
K+L + GA +L+ E K EL+ +D + LE L +V +E +L+ A
Sbjct: 314 ----KRLKRKYGASVEEVLEYAEKIKEELDQLDDSDE-SLEALEEEVDKLEEELDKAAVA 368
Query: 950 LDALKKEHKKKLNEE 964
L ++++ ++L +
Sbjct: 369 LSLIRRKAAERLAKR 383
Score = 32.0 bits (73), Expect = 2.5
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 12 IRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKF 49
I NF + V F+R LT++ GE G GK+ II+ +
Sbjct: 7 INNFALIRVLTVEFERGLTVLTGETGAGKSMIIDALSL 44
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 36.2 bits (84), Expect = 0.17
Identities = 60/334 (17%), Positives = 114/334 (34%), Gaps = 47/334 (14%)
Query: 338 EKCSELENQERLKSQYIQEEKQSHT-----HINEAQMKLGKLERDEETHKKLNDTLKTKL 392
EK L N +RL + + ++ I+ L + R + + +
Sbjct: 156 EKIEALANPDRL-ASLL--KEAIEVLLGLDLIDRLAGDLTNVLRRRKKSELPSS------ 206
Query: 393 NNLADTLCLDTTAKSQYTPEEG-EGLIKMSQTTIDKYLSDIKILERTFSDNENTKQAEIN 451
L++ L+ K Q E I + +++ ++ LE+ F E
Sbjct: 207 -ILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEERE 265
Query: 452 ALIVEKVELESKIKSFKQQIEG----------NKKDLTNVITQINEVNQSQSTLQVLQTK 501
L + E+E+ K+ + Q+ L + Q+ + QSQ +
Sbjct: 266 QLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEEL 325
Query: 502 LNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISI-LQAQNITLAEIK 560
R +L +SL L E I D+ D+EI L +T E+
Sbjct: 326 EERDK----ELLESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQLEVL 381
Query: 561 SLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINA 620
+ ++E + A LLKE + L +DK +S+I +I + +
Sbjct: 382 IQQVKRELQDAKSQLLKELRELEEEL-----------AEVDKKISTIP-SEEQIAQLLEE 429
Query: 621 KEKHLYTLEANVSNSSKTLRDQKRTLAELMDRME 654
+ + + S + + R L L + +E
Sbjct: 430 LGEA----QNELFRSEAEIEELLRQLETLKEAIE 459
Score = 35.0 bits (81), Expect = 0.33
Identities = 50/275 (18%), Positives = 109/275 (39%), Gaps = 36/275 (13%)
Query: 424 TIDKYLSDIKILER--TFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNV 481
ID+ D+ + R S+ ++ +EI AL E E K + Q+I +
Sbjct: 183 LIDRLAGDLTNVLRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLR------ 236
Query: 482 ITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCV 541
NE+ ++Q +L+ L+ K ++ + + L+ EIEA R +L
Sbjct: 237 ----NELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLK--EIEA---ARKANRAQLRE 287
Query: 542 IDAE-ISILQAQNITLAEIKSLKNRKESKLADI--NLLKERHDRAFHLLFDMIPEENFKN 598
+ A+ + +L N+ + L+ ++S+ + L+ER L P+
Sbjct: 288 LAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESL----PKLALPA 343
Query: 599 SLDKALSSITFDI---NRIQ---------EDINAKEKHLYTLEANVSNSSKTLRDQKRTL 646
K +++ +I ++ E + ++ + ++ L + R L
Sbjct: 344 EHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELREL 403
Query: 647 AELMDRMELVLGSKPFEDELDRVTLELKREQEEVS 681
E + ++ + + P E+++ ++ EL Q E+
Sbjct: 404 EEELAEVDKKISTIPSEEQIAQLLEELGEAQNELF 438
Score = 33.9 bits (78), Expect = 0.70
Identities = 23/110 (20%), Positives = 41/110 (37%), Gaps = 18/110 (16%)
Query: 23 VRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRIGKKDEASASSG 82
+P+ LI G NG GKTT+++ I+ AL + + G G + + +
Sbjct: 24 PSSPKPIILIGGLNGAGKTTLLDAIQLALYGKRALCSGRGNKSYEQYLRGLINRQAGKTN 83
Query: 83 KNFVHDPRIGKKDETHAIVKLQCTCENNDTVC---VVRSLLLSNKNGKDN 129
A + L + +VRS ++NK+ K+
Sbjct: 84 P---------------ASITLTFSVVEGGKRHEYTLVRSWHINNKDVKEK 118
Score = 33.1 bits (76), Expect = 1.4
Identities = 62/302 (20%), Positives = 103/302 (34%), Gaps = 79/302 (26%)
Query: 811 LQGDLTLLDQ----------NIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKK 860
L GDLT + + + E+ L+ EL+ Q K + LAQ
Sbjct: 187 LAGDLTNVLRRRKKSELPSSILSEIEALEAELKEQSEKY-----------EDLAQ----- 230
Query: 861 NELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKS----LEDRKC 916
E+ R+++E Q L S +K +S G L LE +
Sbjct: 231 -EIAHLRNELEEAQRSLESLEKKFRS-----------------EGGDLFEEREQLERQLK 272
Query: 917 ELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLN-------------- 962
E+E + +L EL P+ NL S L+KE + + N
Sbjct: 273 EIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKEL 332
Query: 963 -----------EEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRK 1011
E +I ++++ EI + LTQL L + V++ Q
Sbjct: 333 LESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQLEVLIQQVKRELQDA 392
Query: 1012 EDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEIMISD 1071
+ + K E E + ++ + I + + L E + + EL SE I +
Sbjct: 393 KSQLLK----ELRELEEELAEVDKKISTIPSEEQIAQL--LEELGEAQNELFRSEAEIEE 446
Query: 1072 LT 1073
L
Sbjct: 447 LL 448
Score = 32.7 bits (75), Expect = 1.9
Identities = 37/174 (21%), Positives = 67/174 (38%), Gaps = 22/174 (12%)
Query: 352 QYIQEEKQSHTHINEAQMKLGKLERDEETHKKLND--TLKTKLNNLADTLCLDTTAKSQY 409
+Q+E+QS Q +L ERD+E + L + +A L K
Sbjct: 307 AQLQKEEQSQ-QNQLTQEEL--EERDKELLESLPKLALPAEHVKEIAAELA--EIDKPAT 361
Query: 410 TPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQ 469
T S+ S++ LE + Q + L+ E ELE ++ +
Sbjct: 362 TD---------SEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDK 412
Query: 470 QIEGN------KKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLD 517
+I + L + NE+ +S++ ++ L +L + I+ L K+LD
Sbjct: 413 KISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLD 466
>gnl|CDD|224054 COG1131, CcmA, ABC-type multidrug transport system, ATPase component
[Defense mechanisms].
Length = 293
Score = 35.4 bits (82), Expect = 0.18
Identities = 15/68 (22%), Positives = 29/68 (42%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
++A + + LDEPT+ LD ++ + + +++ TH E E L
Sbjct: 146 SIALALLHDPELLILDEPTSGLDPESRREIWELLRELAKEGGVTILLSTHILEEAEEL-- 203
Query: 1299 IDRAYVVR 1306
DR ++
Sbjct: 204 CDRVIILN 211
Score = 30.7 bits (70), Expect = 6.2
Identities = 7/19 (36%), Positives = 14/19 (73%)
Query: 29 LTLIVGENGCGKTTIIECI 47
+ ++G NG GKTT+++ +
Sbjct: 33 IFGLLGPNGAGKTTLLKIL 51
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 36.2 bits (84), Expect = 0.18
Identities = 66/402 (16%), Positives = 132/402 (32%), Gaps = 61/402 (15%)
Query: 749 QRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTA 808
+R LQELK + I SL+ +ELE +IE K L + KT
Sbjct: 881 ERQLQELK--------IDVKSISSLKLVNLELESEIIELKKSLSSD----LIENLEFKTE 928
Query: 809 LSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRS 868
L L + + +LE S + + N+L+ S
Sbjct: 929 LI-----ARLKKLLNN-----IDLEEGPSI--------------EYVKLPELNKLHEVES 964
Query: 869 KI-ESGQTR---LNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSV 924
K+ E+ + L ++ K +++ + + + + +L++ +L+ + V
Sbjct: 965 KLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELP-V 1023
Query: 925 YQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLE 984
EL+ + ++ T+L L LK + N+ A+ + + E +
Sbjct: 1024 EVAELQSASKIISSESTEL-SILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQ 1082
Query: 985 ILNY-TKRGTLTQLAALRESVQKLN----QRKEDIIAKRGVCERTINEINQSI--ANQSL 1037
+ + L + V N I + + + EI++ + +L
Sbjct: 1083 LYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLSQLVNTL 1142
Query: 1038 EEIDLKNNLTLLEKKEAVAKLNEELKLSEI----MISDLTKYHHTLENCVIKYHSQ---- 1089
E + K ++ LE + N E S + + S+
Sbjct: 1143 EPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKLSSSEVNDL 1202
Query: 1090 --KMRSINRLIREYWTRIYQLK--LSEIMISDLTKYHHTLEN 1127
++ ++ I W R +LK +SE + N
Sbjct: 1203 KNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFN 1244
Score = 31.6 bits (72), Expect = 4.6
Identities = 66/387 (17%), Positives = 131/387 (33%), Gaps = 77/387 (19%)
Query: 208 KALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHN-IE 266
+ L S+ R R + I +K +K+L + E L
Sbjct: 800 QPLLSLLGSRKEYRSYLACII-------KLQKTIKREKKLRETEEVEFSLKAEVLIQKFG 852
Query: 267 ESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQES-CNELESSIKQLFSGDKAEL 325
S+K +K L +KE + + + ER + + + SS+K + ++E+
Sbjct: 853 RSLKA--KKRFSLLKKETIYLQ--SAQRVELAERQLQELKIDVKSISSLKLVNLELESEI 908
Query: 326 QSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLN 385
++ +EN E K++ I K+ +N ++ G + +LN
Sbjct: 909 ------IELKKSLSSDLIENLE-FKTELIARLKK---LLNNIDLEEG-PSIEYVKLPELN 957
Query: 386 DTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNENT 445
KL+ E E +K + + L IL R
Sbjct: 958 -----KLH-------------------EVESKLKETSEEYEDLLKKSTILVR-------- 985
Query: 446 KQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRV 505
E + S++K+FK+++ K + ++ + + LQ+ +
Sbjct: 986 ----------EGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKII 1035
Query: 506 NSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELC-------VIDAEISILQAQNITLAE 558
+SE +LS +LK + Q L + D ++ L++ L
Sbjct: 1036 SSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKT 1095
Query: 559 IKSLKNRKESKLADINLLKERHDRAFH 585
I K+ ++ + NL+K + F
Sbjct: 1096 INV----KDLEVTNRNLVKPANVLQFI 1118
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 36.1 bits (84), Expect = 0.19
Identities = 56/282 (19%), Positives = 113/282 (40%), Gaps = 42/282 (14%)
Query: 722 NKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIP----SLRSKL 777
+L+ + + Q +++ ++Q + +L L P + L D + +R +L
Sbjct: 847 VELERALADHESQEQQQRSQLEQAKEGLSALNRLLP---RLNLLADETLADRVEEIREQL 903
Query: 778 IELEEN--VIETKG----ELKKLKTALETPKTK----EKTALSLQGDLTLLDQNIRELNT 827
E EE ++ G +L+ + + L++ + ++ Q Q L
Sbjct: 904 DEAEEAKRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTE 963
Query: 828 L-QRELERQESKISGMRSTGVDL-DQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQ 885
+ QR + M + DL +++ + ++ + E R ++ Q +L +N+ L
Sbjct: 964 VVQRRAHFSYEDAAEMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLA 1023
Query: 886 SLQ-----KQKNDIHSKQ----LTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKV 936
SL+ K++ KQ L V +G + R+ EL S ++ +L
Sbjct: 1024 SLKSSYDAKRQMLQELKQELQDLGVPADSGAEERARARRDELHARLSANRSRRNQL---- 1079
Query: 937 APIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEV 978
E QL ++E+D L K+ +KL DY + E+V
Sbjct: 1080 ---EKQLTFCEAEMDNLTKK-LRKLER------DYHEMREQV 1111
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 34.7 bits (80), Expect = 0.20
Identities = 32/129 (24%), Positives = 60/129 (46%), Gaps = 6/129 (4%)
Query: 745 LCKQQRSLQELKPVYENIMKLQDTD--IPSLRSKLIELEENVIETKGELKKLKTALETPK 802
L K + + E +L + + L+ +L +LEE + E + EL+ LK L+ +
Sbjct: 61 LLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLR 120
Query: 803 TKEKTALSLQGDLTLLDQNIRELNTLQRELERQE-SKISGMRSTGVDLDQVLAQQKEKKN 861
K SL+ L L+++I+EL REL + ++ +R L + L + +E
Sbjct: 121 ELLK---SLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQ 177
Query: 862 ELNTFRSKI 870
EL + ++
Sbjct: 178 ELQSLLEQL 186
>gnl|CDD|237894 PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional.
Length = 530
Score = 35.6 bits (83), Expect = 0.20
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 6/49 (12%)
Query: 1250 LAETFSRNCGIFALDEPTTNLDI------KNASDQKNFQLIVITHDEEF 1292
LA+ N I LDEPT NLDI ++ +++N +I+I+HD F
Sbjct: 166 LAQALFSNPDILLLDEPTNNLDINTIRWLEDVLNERNSTMIIISHDRHF 214
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 34.8 bits (80), Expect = 0.20
Identities = 26/121 (21%), Positives = 55/121 (45%), Gaps = 8/121 (6%)
Query: 840 SGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQS-LQKQKNDIHSKQ 898
G + +++Q LA+ +E+ EL+ R ++ LN E+L+ +Q+ + I +
Sbjct: 63 PGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELR 122
Query: 899 LTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHK 958
+ ++ L + E E + Q EL + +LN + +L L+KE++
Sbjct: 123 SEITSLETEIRDLREELQEKEKDNETLQDEL-------ISLNIELNALEEKLRKLQKENQ 175
Query: 959 K 959
+
Sbjct: 176 E 176
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 35.4 bits (82), Expect = 0.21
Identities = 22/163 (13%), Positives = 64/163 (39%), Gaps = 31/163 (19%)
Query: 849 LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGML 908
L + ++ LN+ + K+ + R ++ E+L+ L++ ++++
Sbjct: 156 LKEDYKLLMKELELLNSIKPKL---RDRKDALEEELRQLKQLEDELEDC----------- 201
Query: 909 KSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKI 968
E + + +L++L +++ +L + EL L+ K+ + K
Sbjct: 202 DPTELDR---------AKEKLKKLLQEIMIKVKKLEELEEELQELES----KIEDLTNKK 248
Query: 969 QDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRK 1011
+ ++ E ++ + + T ++ L+E ++ L
Sbjct: 249 SELNTEIAEAEKKLEQCRGF----TFKEIEKLKEQLKLLQSLT 287
Score = 32.3 bits (74), Expect = 2.0
Identities = 31/121 (25%), Positives = 58/121 (47%), Gaps = 13/121 (10%)
Query: 752 LQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSL 811
L+ LK Y+ +MK + + S++ KL + ++ + E +LK+L+ LE
Sbjct: 153 LEGLKEDYKLLMK-ELELLNSIKPKLRDRKDALEEELRQLKQLEDELE------------ 199
Query: 812 QGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIE 871
D T LD+ +L L +E+ + K+ + +L+ + KK+ELNT ++ E
Sbjct: 200 DCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAE 259
Query: 872 S 872
Sbjct: 260 K 260
Score = 31.9 bits (73), Expect = 2.3
Identities = 22/104 (21%), Positives = 49/104 (47%), Gaps = 4/104 (3%)
Query: 471 IEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIR 530
+EG K+D ++ ++ +N S L+ + + + E+ QL + D + + E R
Sbjct: 153 LEGLKEDYKLLMKELELLN---SIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELD-R 208
Query: 531 QRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADIN 574
+ +L+ L I ++ L+ L E++S +K +++N
Sbjct: 209 AKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELN 252
>gnl|CDD|236243 PRK08327, PRK08327, acetolactate synthase catalytic subunit;
Validated.
Length = 569
Score = 35.7 bits (83), Expect = 0.22
Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 12/72 (16%)
Query: 24 RFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKN------FVHDPRIGKKDEA 77
R+ P+ ++V NG K A+ YP+G ++ K F P K EA
Sbjct: 474 RYGLPVLVVVFNNGGWLAV-----KEAVLEVYPEGYAARKGTFPGTDFDPRPDFAKIAEA 528
Query: 78 SASSGKNFVHDP 89
G+ V DP
Sbjct: 529 FGGYGER-VEDP 539
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only
detects a fraction of this vast family. The poorly
conserved N-terminal helix is missing from the
alignment.
Length = 148
Score = 33.9 bits (77), Expect = 0.24
Identities = 9/18 (50%), Positives = 11/18 (61%)
Query: 30 TLIVGENGCGKTTIIECI 47
LIVG G GKTT+ +
Sbjct: 5 ILIVGPPGSGKTTLARAL 22
>gnl|CDD|236870 PRK11174, PRK11174, cysteine/glutathione ABC transporter
membrane/ATP-binding component; Reviewed.
Length = 588
Score = 35.6 bits (83), Expect = 0.24
Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 15/68 (22%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
ALA + C + LDEPT +LD + AS ++ +++TH +E+L
Sbjct: 495 ALARALLQPCQLLLLDEPTASLDAHSEQLVMQALNAASRRQT--TLMVTHQ---LEDLAQ 549
Query: 1299 IDRAYVVR 1306
D+ +V++
Sbjct: 550 WDQIWVMQ 557
>gnl|CDD|222036 pfam13304, AAA_21, AAA domain.
Length = 256
Score = 34.7 bits (79), Expect = 0.25
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 29 LTLIVGENGCGKTTIIECIKF 49
L +I+G NG GK+ +++ +
Sbjct: 1 LNVIIGPNGSGKSNLLKALAL 21
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869). This
family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 35.7 bits (82), Expect = 0.25
Identities = 56/299 (18%), Positives = 111/299 (37%), Gaps = 18/299 (6%)
Query: 732 PEQTNNKKTHIDQLCKQQRSLQE-LKPVYENIMKLQDTDIPSLRS--KLIELEENVIETK 788
+ K + +L ++ KL D L+S KL+ + + E
Sbjct: 397 ESSADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVLKLVMEKHIISEIS 456
Query: 789 GELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVD 848
L+ ++ L++ + ++ ++ + E + + + E +S +
Sbjct: 457 EILEDIEAELDS-LERSSNGDDENEEVAMVGSLVEESSAVIERSQELEGAVSRISEFTSV 515
Query: 849 LDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGML 908
L+ + ++ + I+ L + +SLQ D+ + ++ L
Sbjct: 516 LEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQ----DVSVEASEIKKNFLGL 571
Query: 909 KSLEDR-KCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAK 967
KS E E +V + LEEL L + LD + K L E K
Sbjct: 572 KSSEKEINSPDEVKGAVCISTLEEL--------ETLKSEKENLDGELSKCKDDLEESKNK 623
Query: 968 IQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTIN 1026
+Q+ K+LEE+K +L+ + TQL + ES + L R +D+ A+ + I+
Sbjct: 624 LQETEKKLEELKS-ELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKIS 681
>gnl|CDD|213195 cd03228, ABCC_MRP_Like, ATP-binding cassette domain of multidrug
resistance protein-like transporters. The MRP (Multidrug
Resistance Protein)-like transporters are involved in
drug, peptide, and lipid export. They belong to the
subfamily C of the ATP-binding cassette (ABC) superfamily
of transport proteins. The ABCC subfamily contains
transporters with a diverse functional spectrum that
includes ion transport, cell surface receptor, and toxin
secretion activities. The MRP-like family, similar to all
ABC proteins, have a common four-domain core structure
constituted by two membrane-spanning domains, each
composed of six transmembrane (TM) helices, and two
nucleotide-binding domains (NBD). ABC transporters are a
subset of nucleotide hydrolases that contain a signature
motif, Q-loop, and H-loop/switch region, in addition to,
the Walker A motif/P-loop and Walker B motif commonly
found in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 171
Score = 33.9 bits (79), Expect = 0.27
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 11/66 (16%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD-QKNFQ-------LIVITHDEEFIENLTAID 1300
A+A R+ I LDE T+ LD + + + + +IVI H + + D
Sbjct: 106 AIARALLRDPPILILDEATSALDPETEALILEALRALAKGKTVIVIAHR---LSTIRDAD 162
Query: 1301 RAYVVR 1306
R V+
Sbjct: 163 RIIVLD 168
Score = 31.2 bits (72), Expect = 2.5
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 30 TLIVGENGCGKTTIIECI 47
IVG +G GK+T+++ +
Sbjct: 31 VAIVGPSGSGKSTLLKLL 48
>gnl|CDD|183016 PRK11176, PRK11176, lipid transporter ATP-binding/permease protein;
Provisional.
Length = 582
Score = 35.4 bits (82), Expect = 0.27
Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 8/59 (13%)
Query: 1245 RDHKALAETFSRNCGIFALDEPTTNLD------IKNASD--QKNFQLIVITHDEEFIEN 1295
R A+A R+ I LDE T+ LD I+ A D QKN +VI H IE
Sbjct: 486 RQRIAIARALLRDSPILILDEATSALDTESERAIQAALDELQKNRTSLVIAHRLSTIEK 544
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 35.5 bits (81), Expect = 0.27
Identities = 95/494 (19%), Positives = 195/494 (39%), Gaps = 67/494 (13%)
Query: 483 TQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVI 542
Q+ E+N++++ + T+ +++L ++ KNE D+L +I
Sbjct: 335 AQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNE------------DQLKII 382
Query: 543 DAEISILQAQNITLAEIKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDK 602
E LQ ++ L E+ KN KE +L ++ + ++ LL + E L
Sbjct: 383 TME---LQKKSSELEEMTKFKNNKEVELEELKKILAEDEK---LLDEKKQFEKIAEELKG 436
Query: 603 ALSSITFDINRIQEDINAKEKHLYTLEANVSNSSKTLRDQKRTLA-ELMDRMELV----- 656
+ F + +++I+ E L ++ + + K + D K L E + +EL
Sbjct: 437 KEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDK 496
Query: 657 --LGSKPFEDELDRVTLELKREQEEVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESD 714
L +K E +TLELK+ QE++ + I LEE E
Sbjct: 497 LLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMN----------- 545
Query: 715 YSVPGLVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLR 774
L ++L++ +E ++ + K +D+ + RS++ E MK+ + +L+
Sbjct: 546 -----LRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLK 600
Query: 775 SKLIELEENVIETKGELKKLKTALETPKTK----EKTALSLQGDLTLLDQNIREL-NTLQ 829
++ +N+ E E K LK + E L+ +L Q E+ + Q
Sbjct: 601 KQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQ 660
Query: 830 RELE----RQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQ 885
+E+E +E + + D+ + QKE + KI + H +
Sbjct: 661 KEIEDKKISEEKLLEEVEKAKAIADEAVKLQKEIDKRC---QHKIAEMVALMEKHKHQYD 717
Query: 886 SLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNL 945
+ ++++ L ++++ E + + EL + ++ ++ QL +
Sbjct: 718 KIIEERDS-------------ELGLYKNKEQEQSSAKAALEIELSNIKAELLSLKKQLEI 764
Query: 946 AQSELDALKKEHKK 959
+ E + LK E K+
Sbjct: 765 EKEEKEKLKMEAKE 778
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from laminin
A, B1 and B2 may come together to form a triple helical
coiled-coil structure.
Length = 263
Score = 34.7 bits (80), Expect = 0.31
Identities = 43/243 (17%), Positives = 90/243 (37%), Gaps = 20/243 (8%)
Query: 805 EKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELN 864
E L G L+ + R+L L++ L + + +Q LA+ ++ +
Sbjct: 26 ENLTQQLSGRLSPISATKRQLEDLEKALSKLAQDA---ENLQKKANQALAKAQKVNAAIE 82
Query: 865 TFRSKIESGQTRLNSH-------NEKLQSLQKQKNDIHSKQL--TVQGGAGMLKSLEDRK 915
++ + + EKL L + + L + ML + +R
Sbjct: 83 RTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEAQRMLGEIRER- 141
Query: 916 CELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQL 975
+ + + EL+ + ++T Q E + L + + LNE AK+ D + L
Sbjct: 142 -DFQKQLQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELL 200
Query: 976 EEV--KRIKLEILNYTKRGTLTQLAALRESVQKLNQRKEDIIAKRG----VCERTINEIN 1029
+E K E LN + L + ++ V +L + E+ +A + ++
Sbjct: 201 DEAQAKTRDAERLNLANQKRLREFQEKKQEVSELKNQAEEHLATGRDSLDQANLLLQLMD 260
Query: 1030 QSI 1032
Q++
Sbjct: 261 QAL 263
>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
permease components [Defense mechanisms].
Length = 567
Score = 35.1 bits (81), Expect = 0.32
Identities = 26/108 (24%), Positives = 42/108 (38%), Gaps = 25/108 (23%)
Query: 1208 RLFISDQKNFQLIVITHDEEFIENL-----TAIDRAYV-------VRIVRDHKALAETFS 1255
R +D++ + + + + EFI NL T + V R+ A+A
Sbjct: 427 RPDATDEEIEEALKLANAHEFIANLPDGYDTIVGERGVNLSGGQRQRL-----AIARALL 481
Query: 1256 RNCGIFALDEPTTNLDIKNASD--------QKNFQLIVITHDEEFIEN 1295
RN I LDE T+ LD + + K ++I H I+N
Sbjct: 482 RNPPILILDEATSALDTETEALIQDALKKLLKGRTTLIIAHRLSTIKN 529
>gnl|CDD|131266 TIGR02211, LolD_lipo_ex, lipoprotein releasing system, ATP-binding
protein. This model represents LolD, a member of the ABC
transporter family (pfam00005). LolD is involved in
localization of lipoproteins in some bacteria. It works
with a transmembrane protein LolC, which in some species
is a paralogous pair LolC and LolE. Depending on whether
the residue immediately following the new, modified
N-terminal Cys residue, the nascent lipoprotein may be
carried further by LolA and LolB to the outer membrane,
or remain at the inner membrane. The top scoring proteins
excluded by this model include homologs from the archaeal
genus Methanosarcina [Protein fate, Protein and peptide
secretion and trafficking].
Length = 221
Score = 34.2 bits (79), Expect = 0.32
Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 10/62 (16%)
Query: 1245 RDHKALAETFSRNCGIFALDEPTTNLDIKNASD----------QKNFQLIVITHDEEFIE 1294
R A+A + DEPT NLD NA + N +V+THD E +
Sbjct: 147 RQRVAIARALVNQPSLVLADEPTGNLDNNNAKIIFDLMLELNRELNTSFLVVTHDLELAK 206
Query: 1295 NL 1296
L
Sbjct: 207 KL 208
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 35.2 bits (82), Expect = 0.32
Identities = 43/232 (18%), Positives = 94/232 (40%), Gaps = 34/232 (14%)
Query: 456 EKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKS 515
+ +++E +I+ K+QI+ N L ++E + K + IDQL
Sbjct: 250 DHLDIEKEIQDLKEQIDENLALLEE--LDLDEAEE----------KNEEIQERIDQLY-- 295
Query: 516 LDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITL-AEIKSLKNR---KESKLA 571
D L+ E++A R +E + + + QN L EI +K ES+L
Sbjct: 296 ---DILEREVKA----RKYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELE 348
Query: 572 DI-NLLKERH--DRAFHLLFDMIPEEN-----FKNSLDKALSSITFDINRIQEDINAKEK 623
+ L K+ ++ + + + I E+ + L++ L + +I + QE ++ +
Sbjct: 349 SVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLE-EIEKEQEKLSEMLQ 407
Query: 624 HLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKR 675
L E + R++ + +++ L + + + V+ E++
Sbjct: 408 GLRKDELEAREKLERYRNKLHEIKRYLEKSNLPGLPEDYLEMFFEVSDEIEA 459
Score = 31.3 bits (72), Expect = 4.5
Identities = 71/357 (19%), Positives = 143/357 (40%), Gaps = 63/357 (17%)
Query: 257 QSFEELHN-----IEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDM--------- 302
+ FEE + S+ I E+L + E +++ + E +
Sbjct: 64 EKFEEWRQKWDEIVTNSLPDIEEQLFEAEELNDKFRFRKAKHEINEIESLLDLIEEDIEQ 123
Query: 303 IQESCNELESS----------IKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQ 352
I E EL S +K L+ + L + F LDE +LEN E SQ
Sbjct: 124 ILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQ 183
Query: 353 YIQEEK-----QSHTHINEAQMKLGKLERDEET----HKKLNDTLKTKLNNLADTLCLDT 403
+++ + ++ +++ + +L LE+ E K+L L +L L +
Sbjct: 184 FVELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTELPDQLQELKAG-YREL 242
Query: 404 TAKSQYTPEEG-EGLIKMSQTTIDKYLSDIKILERTFSDNENTK-QAEINAL--IVEKVE 459
+ + E I+ + ID+ L+ ++ L+ ++ +N + Q I+ L I+E+ E
Sbjct: 243 VEEGYHLDHLDIEKEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILER-E 301
Query: 460 LESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKS--LD 517
++++ + +E N L + + E N + L+ EID++ +S L+
Sbjct: 302 VKAR-----KYVEKNSDTLPDFLEHAKEQN------KELKE-------EIDRVKQSYTLN 343
Query: 518 PDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADIN 574
+L++ + +Q LE + I I+ Q I +E++ +L +I
Sbjct: 344 ESELES-VRQLEKQLESLEKQYDEITERIA---EQEIAYSELQEELEEILKQLEEIE 396
Score = 30.6 bits (70), Expect = 8.1
Identities = 68/337 (20%), Positives = 136/337 (40%), Gaps = 48/337 (14%)
Query: 768 TDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDL--TLLDQN---- 821
+I + S L +EE++ + EL++L + E + + + L +L +LL
Sbjct: 105 HEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFG 164
Query: 822 --IRELNTLQRELERQESKISGMRSTG--VDLDQVLAQQKEKKNELNTFRSKIESGQTRL 877
+ EL LE + S+ + +G V+ ++L Q +E+ L +I L
Sbjct: 165 PALDELEKQLENLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKEL 224
Query: 878 NSHNEKLQSLQKQKNDIHS--KQLTVQGGA----GMLKSLEDRKCELEG-MDSVYQTELE 930
L Q ++ + ++L +G + K ++D K +++ + + + +L+
Sbjct: 225 Q------TELPDQLQELKAGYRELVEEGYHLDHLDIEKEIQDLKEQIDENLALLEELDLD 278
Query: 931 ELGRKVAPIETQLN----LAQSELDALKKEHKKKLNEEGAKIQDY-TKQLEEVKRIKLEI 985
E K I+ +++ + + E+ A +K + + + D+ E+ K +K EI
Sbjct: 279 EAEEKNEEIQERIDQLYDILEREVKA-----RKYVEKNSDTLPDFLEHAKEQNKELKEEI 333
Query: 986 ----LNYT--------KRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIA 1033
+YT R QL +L + ++ +R + + + EI + +
Sbjct: 334 DRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLE 393
Query: 1034 NQSLEEIDLKNNLTLLEKKEAVA--KLNE-ELKLSEI 1067
E+ L L L K E A KL KL EI
Sbjct: 394 EIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEI 430
>gnl|CDD|213226 cd03259, ABC_Carb_Solutes_like, ATP-binding cassette domain of the
carbohydrate and solute transporters-like. This family
is comprised of proteins involved in the transport of
apparently unrelated solutes and proteins specific for
di- and oligosaccharides and polyols. ABC transporters
are a large family of proteins involved in the transport
of a wide variety of different compounds, like sugars,
ions, peptides and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are a
subset of nucleotide hydrolases that contain a signature
motif, Q-loop, and H-loop/switch region, in addition to,
the Walker A motif/P-loop and Walker B motif commonly
found in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 213
Score = 34.0 bits (79), Expect = 0.32
Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD--------QKNFQLIVI--THDEEFIENLTA 1298
ALA +R + LDEP + LD K + Q+ + I THD+E E L
Sbjct: 140 ALARALAREPSLLLLDEPLSALDAKLREELREELKELQRELGITTIYVTHDQE--EALAL 197
Query: 1299 IDRAYVVR 1306
DR V+
Sbjct: 198 ADRIAVMN 205
Score = 31.0 bits (71), Expect = 3.2
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 29 LTLIVGENGCGKTTIIECI 47
++G +GCGKTT++ I
Sbjct: 28 FLALLGPSGCGKTTLLRLI 46
>gnl|CDD|213209 cd03242, ABC_RecF, ATP-binding cassette domain of RecF. RecF is
a recombinational DNA repair ATPase that maintains
replication in the presence of DNA damage. When
replication is prematurely disrupted by DNA damage,
several recF pathway gene products play critical roles
processing the arrested replication fork, allowing it
to resume and complete its task. This CD represents the
nucleotide binding domain of RecF. RecF belongs to a
large superfamily of ABC transporters involved in the
transport of a wide variety of different compounds
including sugars, ions, peptides, and more complex
organic molecules. The nucleotide binding domain shows
the highest similarity between all members of the
family. ABC transporters are a subset of nucleotide
hydrolases with a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 270
Score = 34.6 bits (80), Expect = 0.33
Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALT 52
L L + RN+ + + F+ +T++VGEN GKT ++E I T
Sbjct: 1 LKSLELRNFRNYAELE---LEFEPGVTVLVGENAQGKTNLLEAISLLAT 46
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 34.9 bits (81), Expect = 0.33
Identities = 49/284 (17%), Positives = 94/284 (33%), Gaps = 51/284 (17%)
Query: 363 HINEAQMKLGKLERDEETHKKLNDTLKT--KLNNLADTLCLDTTAKSQYTPEEGEGLIKM 420
HI + + + ER + L + KL + L K + + + E LIK
Sbjct: 32 HIEDLK-EELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKD 90
Query: 421 SQTTIDKYLSDIKILERTFSDNENTK---QAEINALIV---------------------- 455
+ ++K +IK LE S+ EN + EI L
Sbjct: 91 VEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVG 150
Query: 456 ----EKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQ 511
+K+E + + + + S + L+ ++ E +
Sbjct: 151 TVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELK----KLGFERLE 206
Query: 512 LSKSLDPDQ----LKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKE 567
L + P + +K E+E ++R L +EL + + EI+ +
Sbjct: 207 LEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEAL 266
Query: 568 SKLADINLLKERHDRAFHLL-FDMIPEENFKNSLDKALSSITFD 610
SK + D+ F + + +PE+ K L + + T
Sbjct: 267 SKFL-------KTDKTFAIEGW--VPEDRVKK-LKELIDKATGG 300
Score = 34.1 bits (79), Expect = 0.73
Identities = 43/247 (17%), Positives = 90/247 (36%), Gaps = 33/247 (13%)
Query: 764 KLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTK------------EKTALSL 811
+L + + LRS L +L E + + + L KL E K E+ +
Sbjct: 39 ELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKI 98
Query: 812 QGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIE 871
+ ++ L++ I EL +ELE++ ++ + +DL L +
Sbjct: 99 EKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDL----------SLLLGFKYVSVF 148
Query: 872 SGQTRLNSHNEKLQSLQKQ-----KNDIHSKQLTVQGGAGMLKSLED--RKCELEGMDSV 924
G + E + D + V + +E+ +K E ++
Sbjct: 149 VGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELE 208
Query: 925 YQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLE 984
+ EL R+ I+ +L + E ++L +E K+ + ++ + LE + E
Sbjct: 209 EEGTPSELIRE---IKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELE-RAE 264
Query: 985 ILNYTKR 991
L+ +
Sbjct: 265 ALSKFLK 271
>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968). This
family of proteins has no known function.
Length = 192
Score = 33.9 bits (78), Expect = 0.34
Identities = 16/75 (21%), Positives = 38/75 (50%), Gaps = 10/75 (13%)
Query: 828 LQRELERQESKISGMRSTGVDLD-------QVLAQQKEKKNELNTFRSKIESGQTRLNSH 880
L+R++ E++ +++ DL QV A+Q++ + E ++ ++ Q +L
Sbjct: 117 LERQIAASEARAERLQA---DLQLAQAQEQQVAARQQQARQEAVALEAQRQAAQAQLRKL 173
Query: 881 NEKLQSLQKQKNDIH 895
+++ LQ Q+N+
Sbjct: 174 QRQIRQLQAQQNEPI 188
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number of
Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 35.0 bits (81), Expect = 0.34
Identities = 36/164 (21%), Positives = 64/164 (39%), Gaps = 30/164 (18%)
Query: 908 LKSLEDRKCELEGMDS-VYQTELEELGRKVAPIETQLNLAQSELDALKK----------- 955
LK+ + + Y +E+ E ++ +LN A ++ DALK+
Sbjct: 184 LKAFKQENGGILPDQEGDYYSEISEAQEELEAARLELNEAIAQRDALKRQLGGEEPVLLA 243
Query: 956 EHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKR-----GTLTQLAALRESVQKLNQR 1010
+E +I+ KQL+ ++ L YT + T ++A L E Q+
Sbjct: 244 GSSVANSELDGRIEALEKQLDALR------LRYTDKHPDVIATKREIAQLEE------QK 291
Query: 1011 KEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEA 1054
+E+ AK G ER N +E + + + LE + A
Sbjct: 292 EEEGSAKNGGPERGEI-ANPVYQQLQIELAEAEAEIASLEARVA 334
Score = 32.3 bits (74), Expect = 2.3
Identities = 31/154 (20%), Positives = 57/154 (37%), Gaps = 26/154 (16%)
Query: 444 NTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLN 503
+ Q E+ A +E E ++ + K+Q+ G + L + N + ++ L+ +L+
Sbjct: 207 SEAQEELEAARLELNEAIAQRDALKRQLGGEEPVLLAGSSVANSELDGR--IEALEKQLD 264
Query: 504 --------------RVNSEIDQLSKSLDPD-QLKNEIEAWIRQRNELEDELCVIDAEISI 548
EI QL + + + KN N + +L +I +
Sbjct: 265 ALRLRYTDKHPDVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPVYQQL-----QIEL 319
Query: 549 LQAQNITLAEIKSLKNRKESKLADINLLKERHDR 582
+A AEI SL+ R A I L+
Sbjct: 320 AEA----EAEIASLEARVAELTARIERLESLLRT 349
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 34.6 bits (80), Expect = 0.39
Identities = 27/120 (22%), Positives = 46/120 (38%), Gaps = 15/120 (12%)
Query: 776 KLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQ 835
L + + K +L+K AL K + + L +L L D EL ++EL
Sbjct: 165 LLNKDLNLINSIKPKLRKKLQAL---KEEIASLRQLADELNLCDPL--ELEKARQELRSL 219
Query: 836 ESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIH 895
KIS R Q +E + EL IE+ + + E++ +K + +
Sbjct: 220 SVKISEKRK----------QLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECR 269
Score = 33.1 bits (76), Expect = 1.2
Identities = 31/163 (19%), Positives = 63/163 (38%), Gaps = 32/163 (19%)
Query: 849 LDQVLAQQKEKKNELNTFRSKIES----GQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGG 904
L++ L K + LN + I S + +L + E++ SL++ ++++
Sbjct: 152 LEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPL---- 207
Query: 905 AGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEE 964
ELE + EL L K++ QL Q EL E +
Sbjct: 208 ------------ELEKA----RQELRSLSVKISEKRKQLEELQQELQ----ELTIAIEAL 247
Query: 965 GAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKL 1007
K + +++ E ++I+ E + + +++ L+ V L
Sbjct: 248 TNKKSELLEEIAEAEKIREEC----RGWSAKEISKLKAKVSLL 286
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 34.6 bits (79), Expect = 0.42
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 774 RSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDL 815
R + EL+E +IET +KKLK E+ KE A + + ++
Sbjct: 468 RKRGGELKEELIETLKLVKKLKEEQES-LAKELEATAHKSEI 508
>gnl|CDD|226361 COG3842, PotA, ABC-type spermidine/putrescine transport systems,
ATPase components [Amino acid transport and metabolism].
Length = 352
Score = 34.6 bits (80), Expect = 0.44
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
ALA + LDEP + LD +K + + +THD+E E L
Sbjct: 146 ALARALVPEPKVLLLDEPLSALDAKLREQMRKELKELQRELGITFVYVTHDQE--EALAM 203
Query: 1299 IDRAYVVR 1306
DR V+
Sbjct: 204 SDRIAVMN 211
Score = 30.7 bits (70), Expect = 6.2
Identities = 8/16 (50%), Positives = 13/16 (81%)
Query: 32 IVGENGCGKTTIIECI 47
++G +GCGKTT++ I
Sbjct: 36 LLGPSGCGKTTLLRMI 51
>gnl|CDD|213208 cd03241, ABC_RecN, ATP-binding cassette domain of RecN. RecN
ATPase involved in DNA repair; similar to ABC
(ATP-binding cassette) transporter nucleotide-binding
domain; ABC transporters are a large family of proteins
involved in the transport of a wide variety of
different compounds including sugars, ions, peptides,
and more complex organic molecules. The nucleotide
binding domain shows the highest similarity between all
members of the family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 276
Score = 34.1 bits (79), Expect = 0.46
Identities = 15/64 (23%), Positives = 30/64 (46%), Gaps = 9/64 (14%)
Query: 12 IRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRI 71
I+NF + + F+ LT++ GE G GK+ +++ + L G + + + R
Sbjct: 6 IKNFALIEELELDFEEGLTVLTGETGAGKSILLDALSLLL------GGRASADLI---RS 56
Query: 72 GKKD 75
G +
Sbjct: 57 GAEK 60
>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the
AAA superfamily.
Length = 320
Score = 34.3 bits (79), Expect = 0.46
Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 12 IRNFPADKNRV-VRFQRPLTLIVGENGCGKTTIIECIKFALTNE 54
I+NF + KN + + +I+GENG GKTTI+E + +
Sbjct: 6 IKNFKSIKNLEEISLNEGINVIIGENGSGKTTILEALDNLKEKK 49
>gnl|CDD|224046 COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component
[Inorganic ion transport and metabolism].
Length = 254
Score = 33.7 bits (78), Expect = 0.50
Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 9/49 (18%)
Query: 1250 LAETFSRNCGIFALDEPTTNLDIKNASD---------QKNFQLIVITHD 1289
LA ++N + LDEP T +D+ + Q+ ++++THD
Sbjct: 150 LARALAQNPDLLLLDEPFTGVDVAGQKEIYDLLKELRQEGKTVLMVTHD 198
Score = 31.4 bits (72), Expect = 3.1
Identities = 8/19 (42%), Positives = 15/19 (78%)
Query: 29 LTLIVGENGCGKTTIIECI 47
+T ++G NG GK+T+++ I
Sbjct: 32 ITALIGPNGAGKSTLLKAI 50
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF. Sequences in
this family of proteins are members of the chain length
determinant family (pfam02706) which includes the wzc
protein from E.coli. This family of proteins are
homologous to the EpsF protein of the methanolan
biosynthesis operon of Methylobacillus species strain
12S. The distribution of this protein appears to be
restricted to a subset of exopolysaccharide operons
containing a syntenic grouping of genes including a
variant of the EpsH exosortase protein. Exosortase has
been proposed to be involved in the targetting and
processing of proteins containing the PEP-CTERM domain to
the exopolysaccharide layer.
Length = 444
Score = 34.4 bits (79), Expect = 0.52
Identities = 49/239 (20%), Positives = 94/239 (39%), Gaps = 48/239 (20%)
Query: 824 ELNTLQRELERQESKISG-MRSTG-VDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHN 881
++ L+ +L R +SK+S + G V D+ L ++ + NEL+
Sbjct: 179 QIAALREDLARAQSKLSAYQQEKGIVSSDERLDVERARLNELS----------------- 221
Query: 882 EKLQSLQKQKNDIHSKQLTVQGGAGM--------LKSLEDRKCELEGMDSVYQTELEELG 933
+L + Q Q D SK+ G + +++L+ E +L EL
Sbjct: 222 AQLVAAQAQVMDASSKEGGSSGKDALPEVIANPIIQNLKTDIARAES-------KLAELS 274
Query: 934 RKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGT 993
+++ P Q AQ+E+++LK + LN E K+ + + +
Sbjct: 275 QRLGPNHPQYKRAQAEINSLKSQ----LNAEIKKVTSSVGTNSRILKQREA-------EL 323
Query: 994 LTQLAALRESVQKLN-QRKEDIIAKRGV--CERTINEINQSIANQSLEEIDLKNNLTLL 1049
L + V +LN QR E + +R V +R + Q +E + ++++L
Sbjct: 324 REALENQKAKVLELNRQRDEMSVLQRDVENAQRAYDAAMQRYTQTRIEAQSNQTDISIL 382
>gnl|CDD|132313 TIGR03269, met_CoM_red_A2, methyl coenzyme M reductase system,
component A2. The enzyme that catalyzes the final step
in methanogenesis, methyl coenzyme M reductase, contains
alpha, beta, and gamma chains. In older literature, the
complex of alpha, beta, and gamma chains was termed
component C, while this single chain protein was termed
methyl coenzyme M reductase system component A2 [Energy
metabolism, Methanogenesis].
Length = 520
Score = 34.4 bits (79), Expect = 0.53
Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 12/72 (16%)
Query: 1245 RDHKALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIE 1294
R ALA+ + I LDEPT +D I A ++ I+++HD +F+
Sbjct: 433 RHRVALAQVLIKEPRIVILDEPTGTMDPITKVDVTHSILKAREEMEQTFIIVSHDMDFVL 492
Query: 1295 NLTAIDRAYVVR 1306
++ DRA ++R
Sbjct: 493 DVC--DRAALMR 502
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 33.6 bits (77), Expect = 0.56
Identities = 33/181 (18%), Positives = 72/181 (39%), Gaps = 10/181 (5%)
Query: 804 KEKTALSLQ--GDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKN 861
KE+ S DL ++ +++ L+ EL E ++ + G L + E+
Sbjct: 19 KEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEE-- 76
Query: 862 ELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGM 921
+ ++E R E + +++ + Q + + + LE+++ L
Sbjct: 77 ----IQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASE 132
Query: 922 DSVYQ-TELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQ-LEEVK 979
D +EEL +K +E +L + L +L + ++ L E + ++ LEE+
Sbjct: 133 DLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELN 192
Query: 980 R 980
Sbjct: 193 E 193
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 34.4 bits (80), Expect = 0.60
Identities = 36/218 (16%), Positives = 77/218 (35%), Gaps = 26/218 (11%)
Query: 329 LNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTL 388
L+ I D + LE+ E K ++ K+ H+ + E K + L
Sbjct: 11 LDQINIPDDLQSEALESAEIEKVVVDKKSKKWEFHLKFPNIL------PIEDFKLFKEKL 64
Query: 389 KTKLNNLAD--TLCLDTTAKSQYTPEEGEG----LIKMSQTTIDKYLSDIKILERTFSDN 442
K +++AD +T E +I+ ++ + S +K + N
Sbjct: 65 KQSFSHIADIKVTFSIEVENITFTEELLLDYWNEIIEKAKKNSPLFKSLLKKQKVEVEGN 124
Query: 443 ENT--KQAEINALIVEKVELESKIKSFKQ----------QIEGNKKDLTNVITQINEVNQ 490
+ EI ++K L IK +++ +I+ +K++L Q E ++
Sbjct: 125 KLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDE 184
Query: 491 S--QSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIE 526
+ L+ ++ + P Q+ +I+
Sbjct: 185 KLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKID 222
>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the
AAA superfamily.
Length = 154
Score = 32.6 bits (74), Expect = 0.63
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 20 NRVVRFQRPLTLIVGENGCGKTTIIECIKFAL 51
R P L+ G +G GKT+++ + L
Sbjct: 17 RRARSGGPPSVLLTGPSGTGKTSLLRELLEGL 48
>gnl|CDD|223521 COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport
system, ATPase component [Amino acid transport and
metabolism / Inorganic ion transport and metabolism].
Length = 316
Score = 33.7 bits (78), Expect = 0.64
Identities = 21/58 (36%), Positives = 27/58 (46%), Gaps = 15/58 (25%)
Query: 1264 DEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTAIDRAYVV---RIV 1308
DEPTT LD +K +K LI+ITHD + + DR V+ RIV
Sbjct: 178 DEPTTALDVTVQAQILDLLKELQREKGTALILITHDLGVVAEIA--DRVAVMYAGRIV 233
>gnl|CDD|213206 cd03239, ABC_SMC_head, The SMC head domain belongs to the
ATP-binding cassette superfamily. The structural
maintenance of chromosomes (SMC) proteins are essential
for successful chromosome transmission during
replication and segregation of the genome in all
organisms. SMCs are generally present as single
proteins in bacteria, and as at least six distinct
proteins in eukaryotes. The proteins range in size from
approximately 110 to 170 kDa, and each has five
distinct domains: amino- and carboxy-terminal globular
domains, which contain sequences characteristic of
ATPases, two coiled-coil regions separating the
terminal domains , and a central flexible hinge. SMC
proteins function together with other proteins in a
range of chromosomal transactions, including chromosome
condensation, sister-chromatid cohesion, recombination,
DNA repair, and epigenetic silencing of gene
expression.
Length = 178
Score = 33.1 bits (76), Expect = 0.67
Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 12 IRNFPA-DKNRVVRFQRPLTLIVGENGCGKTTIIECIKFAL 51
++NF + VV IVG NG GK+ I++ I F L
Sbjct: 6 LKNFKSYRDETVVGGSNSFNAIVGPNGSGKSNIVDAICFVL 46
>gnl|CDD|213205 cd03238, ABC_UvrA, ATP-binding cassette domain of the excision repair
protein UvrA. Nucleotide excision repair in eubacteria
is a process that repairs DNA damage by the removal of a
12-13-mer oligonucleotide containing the lesion.
Recognition and cleavage of the damaged DNA is a
multistep ATP-dependent reaction that requires the UvrA,
UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases,
with UvrA having two ATP binding sites, which have the
characteristic signature of the family of ABC proteins,
and UvrB having one ATP binding site that is structurally
related to that of helicases.
Length = 176
Score = 32.7 bits (75), Expect = 0.70
Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 11/45 (24%)
Query: 1260 IFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIE 1294
+F LDEP+T L IK D N +I+I H+ + +
Sbjct: 110 LFILDEPSTGLHQQDINQLLEVIKGLIDLGN-TVILIEHNLDVLS 153
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 33.5 bits (77), Expect = 0.71
Identities = 52/270 (19%), Positives = 107/270 (39%), Gaps = 27/270 (10%)
Query: 810 SLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSK 869
L+ L + I EL + EL ++ S+++ R D++ A+ +E + + R +
Sbjct: 10 ELELKRKQLKEEIEELKEKRDELRKEASELAEKR------DELNAKVRELREKAQELREE 63
Query: 870 -------IESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMD 922
++ + + + N KLQ L+K+ ++ K+ G +KSLE LE
Sbjct: 64 RDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQ 123
Query: 923 SVYQTELE---ELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQL---- 975
E EL +K+ + +L A+ E +KL E A+I + K+
Sbjct: 124 QTSVLTPEEERELVQKIKELRKELEDAKK-----ALEENEKLKELKAEIDELKKKAREIH 178
Query: 976 EEVKRIKLEILNYTKRGTLTQLAA--LRESVQKLNQRKEDIIAKRGVCERTINEINQSIA 1033
E+++ + E Y + A LR+ +L++ ++ K + +
Sbjct: 179 EKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELR 238
Query: 1034 NQSLEEIDLKNNLTLLEKKEAVAKLNEELK 1063
+ L+ +++E +L E +
Sbjct: 239 ELEKKIKALRAKEKAAKRREKREELKERAE 268
Score = 32.7 bits (75), Expect = 1.2
Identities = 62/286 (21%), Positives = 118/286 (41%), Gaps = 28/286 (9%)
Query: 720 LVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIE 779
+++L+ K K++ E+ K D+L K+ L E + ++ LR + E
Sbjct: 7 KLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDE 66
Query: 780 LEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKI 839
+ E V E K + ++ L+ K L+ + R + +L+RE+ER E K
Sbjct: 67 INEEVQELKEKRDEINAKLQE---LRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQ 123
Query: 840 SGMRSTGVDLDQVLAQ-------------QKEKKNELNTFRSKIESGQTRLNSHNEKLQS 886
T + +++ + E+ +L +++I+ + + +EK+Q
Sbjct: 124 QTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQE 183
Query: 887 LQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLA 946
L + + H + M+K E+ EL E EL +K+ + +
Sbjct: 184 LANEAQEYHEE---------MIKLFEEAD-ELRKEADELHEEFVELSKKIDELHEEFRNL 233
Query: 947 QSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRG 992
Q+EL L+K+ K +E A + ++ EE+K EI KRG
Sbjct: 234 QNELRELEKKIKALRAKEKAAKR--REKREELKERAEEIYEKFKRG 277
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in this
family are typically between 366 and 597 amino acids in
length. There is a single completely conserved residue R
that may be functionally important.
Length = 297
Score = 33.4 bits (77), Expect = 0.73
Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 16/154 (10%)
Query: 848 DLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGM 907
++D VL + +++ E ++++ +G + +I + Q +
Sbjct: 109 EIDTVLLEIQQE--EQAAAQAQVAAGFR---------PKIAALTAEIAALQAEIDEAQAE 157
Query: 908 LKSLED-RKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGA 966
+ + +CE EG T + G +L+ AQ+ L+ LK + + A
Sbjct: 158 VNAAYQEAQCEAEG---TGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEA 214
Query: 967 KIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAAL 1000
+ + + K G L +L AL
Sbjct: 215 QKAALERNRQAAVAEKQAR-IEANDGLLARLEAL 247
>gnl|CDD|197258 cd09161, PLDc_PaCLS_like_2, Putative catalytic domain, repeat 2, of
Pseudomonas aeruginosa cardiolipin synthase and similar
proteins. Putative catalytic domain, repeat 2, of
Pseudomonas aeruginosa cardiolipin (CL) synthase (PaCLS)
and similar proteins. Although PaCLS and similar proteins
have not been functionally characterized, members in this
subfamily show high sequence homology to bacterial CL
synthases, which catalyze the reversible phosphatidyl
group transfer between two phosphatidylglycerol molecules
to form CL and glycerol. Moreover, PaCLS and other
members of this subfamily contain two HKD motifs
(H-x-K-x(4)-D, where x represents any amino acid residue)
that characterizes the phospholipase D (PLD) superfamily.
The two motifs may be part of the active site and may be
involved in phosphatidyl group transfer.
Length = 176
Score = 32.6 bits (75), Expect = 0.73
Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 17/82 (20%)
Query: 1267 TTNLDIKNASDQKNFQLIVITHDEEFIENLTAI---DRAYVVRIVRDHKGLSDIHLRSLL 1323
T NLD N S + NF++ + D F + + A+ D A + ++ R L
Sbjct: 109 TANLD--NRSFRLNFEITALVADPGFAQEVEAMLEADFAASREVTAA-----ELANRPLW 161
Query: 1324 SSRLLSRNSKLTLYNTIIKPVL 1345
RL +R ++L P+L
Sbjct: 162 -FRLGARVARLF------APIL 176
>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the drug
resistance transporter and related proteins, subfamily A.
This family of ATP-binding proteins belongs to a
multi-subunit transporter involved in drug resistance
(BcrA and DrrA), nodulation, lipid transport, and
lantibiotic immunity. In bacteria and archaea, these
transporters usually include an ATP-binding protein and
one or two integral membrane proteins. Eukaryotic systems
of the ABCA subfamily display ABC domains that are quite
similar to this family. The ATP-binding domain shows the
highest similarity between all members of the ABC
transporter family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 173
Score = 32.8 bits (76), Expect = 0.74
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 11/53 (20%)
Query: 1263 LDEPTTNLDIKNASDQKNF--QL-------IVITHDEEFIENLTAIDRAYVVR 1306
LDEPT+ LD ++ + +L ++ +H E E L DR ++
Sbjct: 119 LDEPTSGLDPESRREFWELLRELKKEGKTILLSSHILEEAERLC--DRVAILN 169
Score = 32.4 bits (75), Expect = 1.1
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 32 IVGENGCGKTTIIECI 47
++G NG GKTT+I+ I
Sbjct: 31 LLGPNGAGKTTLIKII 46
>gnl|CDD|182904 PRK11020, PRK11020, hypothetical protein; Provisional.
Length = 118
Score = 31.5 bits (72), Expect = 0.91
Identities = 21/66 (31%), Positives = 31/66 (46%), Gaps = 5/66 (7%)
Query: 788 KGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELER----QESKISGMR 843
K E+K+L L+ + K A SL+GD Q +E TL+ E+ R Q K+S
Sbjct: 4 KNEIKRLSDRLDAIRHKL-AAASLRGDAEKYAQFEKEKATLEAEIARLKEVQSQKLSKEA 62
Query: 844 STGVDL 849
+ L
Sbjct: 63 QKLMKL 68
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 33.3 bits (76), Expect = 0.92
Identities = 42/168 (25%), Positives = 74/168 (44%), Gaps = 31/168 (18%)
Query: 724 LKTKIKEIPEQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEEN 783
L+TKI+++ EQ + + + ++Q+ +L KLQ+ L ++L
Sbjct: 73 LQTKIRKLTEQDEGLRNIRESIEERQKESDQL--------KLQNQ---KLVNQLGHARGV 121
Query: 784 VIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMR 843
I+TKG +KL+ L +E L +Q LD ++E N ++ E QE
Sbjct: 122 FIKTKGRYQKLEE-LSRHLKEENQCLQIQ-----LDALVQECN--EKIEENQE------- 166
Query: 844 STGVDLDQVLAQQKEKKNELN-TF---RSKIESGQTRLNSHNEKLQSL 887
+L + LA Q+E +E TF + ++ Q + K+Q L
Sbjct: 167 -LNRELAETLAYQQELNDEYQATFVEQHNMLDKRQAYIGKLEAKVQDL 213
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 32.3 bits (74), Expect = 0.98
Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 9/81 (11%)
Query: 486 NEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAE 545
E+ + + +Q+L+ + + E+ L+ L ++L+ EI EL+ E+ I+ +
Sbjct: 79 MEIEELREEVQLLKQDCSTLEIELKSLTSDLTTEELQEEI-------QELKKEVREIEEK 131
Query: 546 ISILQAQN--ITLAEIKSLKN 564
+ L+ +T E++ +K
Sbjct: 132 LESLEEGWKPVTPEEMEKVKK 152
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 33.2 bits (76), Expect = 1.0
Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 928 ELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKL 983
EL EL K+A E Q +L+ + + +KK E +I+ ++ EE K+ +
Sbjct: 4 ELTELISKLA--EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKSGI 57
>gnl|CDD|213213 cd03246, ABCC_Protease_Secretion, ATP-binding cassette domain of
PrtD, subfamily C. This family represents the ABC
component of the protease secretion system PrtD, a 60-kDa
integral membrane protein sharing 37% identity with HlyB,
the ABC component of the alpha-hemolysin secretion
pathway, in the C-terminal domain. They export
degradative enzymes by using a type I protein secretion
system and lack an N-terminal signal peptide, but contain
a C-terminal secretion signal. The Type I secretion
apparatus is made up of three components, an ABC
transporter, a membrane fusion protein (MFP), and an
outer membrane protein (OMP). For the HlyA transporter
complex, HlyB (ABC transporter) and HlyD (MFP) reside in
the inner membrane of E. coli. The OMP component is TolC,
which is thought to interact with the MFP to form a
continuous channel across the periplasm from the
cytoplasm to the exterior. HlyB belongs to the family of
ABC transporters, which are ubiquitous, ATP-dependent
transmembrane pumps or channels. The spectrum of
transport substrates ranges from inorganic ions,
nutrients such as amino acids, sugars, or peptides,
hydrophobic drugs, to large polypeptides, such as HlyA.
Length = 173
Score = 32.2 bits (74), Expect = 1.1
Identities = 20/67 (29%), Positives = 26/67 (38%), Gaps = 12/67 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKN---------ASDQKNFQLIVITHDEEFIENLTAI 1299
LA N I LDEP ++LD++ A IVI H E L +
Sbjct: 106 GLARALYGNPRILVLDEPNSHLDVEGERALNQAIAALKAAGATRIVIAHR---PETLASA 162
Query: 1300 DRAYVVR 1306
DR V+
Sbjct: 163 DRILVLE 169
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 33.2 bits (76), Expect = 1.2
Identities = 41/235 (17%), Positives = 83/235 (35%), Gaps = 17/235 (7%)
Query: 343 LENQERLKSQYIQEEKQSHT-----HINEAQMKL---GKLERDEETHKKLNDTLKTKLNN 394
L+ +S+ I+ S+ + A + +LE E ++ +D+L +L
Sbjct: 147 LKVLRAGRSRVIELSYTSNDPKLAAKLANALAQAYLADQLEAQLEAFRRASDSLDERLEE 206
Query: 395 LADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIK---ILERTFSDNENTKQAEIN 451
L L + + G Q ++ LS + R + A +
Sbjct: 207 LRARLQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQSARARLAQAEARLASLL 266
Query: 452 ALIVEKVELESKIKSF-KQQIEGNKKDLTNVITQINEVNQ----SQSTLQVLQTKLNRVN 506
L+ E + + I+ ++ V QI +++ L L+ +L +
Sbjct: 267 QLLPLGREAAALREVLESPTIQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAELR 326
Query: 507 SEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKS 561
+I + + L NE+ +Q LE EL + +S L + L E++
Sbjct: 327 QQIAAELRQIL-ASLPNELALLEQQEAALEKELAQLKGRLSKLPKLQVQLRELER 380
Score = 31.6 bits (72), Expect = 3.8
Identities = 43/254 (16%), Positives = 90/254 (35%), Gaps = 50/254 (19%)
Query: 772 SLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRE 831
+ R L+E + E + L++ + +E + + L +Q + LNT +
Sbjct: 192 AFRRASDSLDERLEELRARLQEAEAQVEDFRAQH-GLTDAARGQLLSEQQLSALNTQLQS 250
Query: 832 LERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQK 891
+ ++ LA + E+ R + +Q L++Q
Sbjct: 251 ARARLAQAE----------ARLASLLQLLPL------GREAAALREVLESPTIQDLRQQY 294
Query: 892 NDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPI--------ETQL 943
+ + A + L + +L +++ +L EL +++A +L
Sbjct: 295 AQVRQQI------ADLSTELGAKHPQLVALEA----QLAELRQQIAAELRQILASLPNEL 344
Query: 944 NLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRES 1003
L + + AL+KE +L +K+ QL E++R + E R L
Sbjct: 345 ALLEQQEAALEKE-LAQLKGRLSKLPKLQVQLRELER-EAE----AARSLYETL------ 392
Query: 1004 VQKLNQRKEDIIAK 1017
L + +E I +
Sbjct: 393 ---LQRYQELSIQE 403
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 33.3 bits (76), Expect = 1.2
Identities = 13/78 (16%), Positives = 33/78 (42%), Gaps = 13/78 (16%)
Query: 477 DLTNVITQI--NEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNE 534
D+T V+ ++V Q +T + Q+ K + +++ E++ +QR +
Sbjct: 53 DMTGVVDTTFDDKVRQHATTEMQVTAA---------QMQKQYE--EIRRELDVLNKQRGD 101
Query: 535 LEDELCVIDAEISILQAQ 552
+ + + + + L Q
Sbjct: 102 DQRRIEKLGQDNAALAEQ 119
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 33.3 bits (76), Expect = 1.2
Identities = 45/285 (15%), Positives = 92/285 (32%), Gaps = 36/285 (12%)
Query: 430 SDIKILERTFSDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVN 489
SD K LE F +N + E + + + +E + + Q E
Sbjct: 396 SDTKNLEEGFETKDNKNKNSSFINKTE--NILTNSPLKDELLEKTTEIINIENPQEFEFG 453
Query: 490 Q------SQSTLQVLQTKLNRVNSEID---QLSKSLDPDQLKNEIEAWIRQRNELEDELC 540
Q S Q+ + + E D S S +P+ E NE D
Sbjct: 454 QIGNDIISTEIAQLDENQNLIDTGEFDLENNFSNSFNPENGNKIDENI----NETFDT-S 508
Query: 541 VIDAEISILQAQNITLAEIKSLKNRKESKLADIN--LLKERHDRAFHLLFDMIPEENFKN 598
I A +S + N K++ L + + + + + + K+
Sbjct: 509 TISANLSENKTNF------AQSFNNKDTNLINSEIPIDLIKDTITINNSQKNVKKNGNKD 562
Query: 599 SL--DKALSSITFDINRI-QEDINAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMEL 655
L ++ ++ I I Q + K+ N + + +A + +++
Sbjct: 563 YLSVEEVINLIMLAIKFHSQNQVEYKKLV---QNWNKNLPLFEYDVEFMEIAHFLKDLKI 619
Query: 656 VLGSKPF------EDELDRVTLELKREQEEVSMMTSTQYLFNSYI 694
+ S F DE+D + ++L + +V+ + +F
Sbjct: 620 LASSDNFILFSSKRDEIDELIIKLNKNNYKVNFQNFLEKIFGGPK 664
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 32.8 bits (75), Expect = 1.4
Identities = 25/128 (19%), Positives = 40/128 (31%), Gaps = 15/128 (11%)
Query: 763 MKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNI 822
K T IP+ + L L+ + + +L +TAL T + TA Q
Sbjct: 242 NKFDKTPIPNTPNSLKALQAALATAQADLAAAQTALNTAQAALTTA---QTAYAAAQAA- 297
Query: 823 RELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNE 882
L T Q+EL L + L +++ + L + N
Sbjct: 298 --LATAQKELANA---------QAQALQTAQNNLATAQAALANAEARLAKAKEALANLNA 346
Query: 883 KLQSLQKQ 890
L Q
Sbjct: 347 DLAKKQAA 354
>gnl|CDD|234401 TIGR03925, T7SS_EccC_b, type VII secretion protein EccCb. This
model represents the C-terminal domain or EccCb subunit
of the type VII secretion protein EccC as found in the
Actinobacteria. Type VII secretion is defined more
broadly as including secretion systems for ESAT-6-like
proteins in the Firmicutes as well as in the
Actinobacteria, but this family does not show close
homologs in the Firmicutes [Protein fate, Protein and
peptide secretion and trafficking].
Length = 566
Score = 33.0 bits (76), Expect = 1.4
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 4/44 (9%)
Query: 28 PLTLIVGENGCGKTTIIECIKFALTNEY-PQGASSGKNFVHDPR 70
P L+ G++ GKTT++ I +T Y PQ A + + D R
Sbjct: 364 PHLLVFGDSESGKTTLLRAIARGITRRYSPQQA---RLVLVDYR 404
>gnl|CDD|213204 cd03237, ABC_RNaseL_inhibitor_domain2, The ATP-binding cassette
domain 2 of RNase L inhibitor. The ABC ATPase, RNase L
inhibitor (RLI), is a key enzyme in ribosomal biogenesis,
formation of translation preinitiation complexes, and
assembly of HIV capsids. RLI's are not transport proteins
and thus cluster with a group of soluble proteins that
lack the transmembrane components commonly found in other
members of the family. Structurally, RLI's have an
N-terminal Fe-S domain and two nucleotide-binding domains
which are arranged to form two composite active sites in
their interface cleft. RLI is one of the most conserved
enzymes between archaea and eukaryotes with a sequence
identity of more than 48%. The high degree of
evolutionary conservation suggests that RLI performs a
central role in archaeal and eukaryotic physiology.
Length = 246
Score = 32.4 bits (74), Expect = 1.4
Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 17/80 (21%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASDQKNFQLIVITHDEEFIENLTAIDRAYVVRIV 1308
A+A S++ I+ LDEP+ LD +Q+ VI F EN A+VV
Sbjct: 125 AIAACLSKDADIYLLDEPSAYLD----VEQRLMASKVIRR---FAENNEKT--AFVV--- 172
Query: 1309 RDHKGLSDIHLRSLLSSRLL 1328
+H DI + L+ RL+
Sbjct: 173 -EH----DIIMIDYLADRLI 187
>gnl|CDD|220640 pfam10234, Cluap1, Clusterin-associated protein-1. This protein is
conserved from worms to humans. The protein of 413 amino
acids contains a central coiled-coil domain, possibly
the region that binds to clusterin. Cluap1 expression is
highest in the nucleus and gradually increases during
late S to G2/M phases of the cell cycle and returns to
the basal level in the G0/G1 phases. In addition, it is
upregulated in colon cancer tissues compared to
corresponding non-cancerous mucosa. It thus plays a
crucial role in the life of the cell.
Length = 269
Score = 32.3 bits (74), Expect = 1.5
Identities = 26/119 (21%), Positives = 52/119 (43%), Gaps = 12/119 (10%)
Query: 375 ERDEETHKKLND-TLKTKLNNL--ADTLCLDTTAKSQ--YTPEEGEGLIKMSQTTIDKYL 429
+EE + L L +K+++L A L + T + Y E ++ S+
Sbjct: 100 GDEEENTRSLAKFDLGSKISDLKAARQLASEITERGAALYDLLGKELDLRESRQKALSRP 159
Query: 430 SDIKILERTF-------SDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNV 481
++ +E+ + QAE++ L ++ LE+KI+ KQ++E +K L +
Sbjct: 160 LELAEVEKALKEAIKNLAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQAL 218
>gnl|CDD|184037 PRK13409, PRK13409, putative ATPase RIL; Provisional.
Length = 590
Score = 32.9 bits (76), Expect = 1.5
Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 8/49 (16%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIK---NASD-----QKNFQLIVITHD 1289
A+A R+ + DEPT+ LDI+ N + + ++V+ HD
Sbjct: 222 AIAAALLRDADFYFFDEPTSYLDIRQRLNVARLIRELAEGKYVLVVEHD 270
>gnl|CDD|237422 PRK13548, hmuV, hemin importer ATP-binding subunit; Provisional.
Length = 258
Score = 32.4 bits (75), Expect = 1.6
Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 17/61 (27%)
Query: 1263 LDEPTTNLDI----------KNASDQKNFQLIVITHDEEFIENLTAIDRAYVVRIVRDHK 1312
LDEPT+ LD+ + + ++ +IV+ HD NL A Y RIV H+
Sbjct: 164 LDEPTSALDLAHQHHVLRLARQLAHERGLAVIVVLHD----LNLAA---RYADRIVLLHQ 216
Query: 1313 G 1313
G
Sbjct: 217 G 217
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 32.8 bits (75), Expect = 1.6
Identities = 40/228 (17%), Positives = 83/228 (36%), Gaps = 40/228 (17%)
Query: 948 SELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQKL 1007
S ++K++ +L + +++ D L+ +E ++ +L+ QK
Sbjct: 128 SAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEK-------SLKAMQQKW 180
Query: 1008 NQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEKKEAVAKLNEELKLSEI 1067
+RK+D+ K+ + E + + +++++ D+KN L L + KE KL +E K +
Sbjct: 181 KKRKKDLPNKQ-----DLEEYKKRL--EAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQ 233
Query: 1068 MI----SDLTKYHHTLENCVIKYHSQKMRSINRLIREYWTRIYQLKLSEIMISDLTKYHH 1123
I +DL L+ + + + RL Y +K + +
Sbjct: 234 KIKSAKNDLQNDKKQLKADLAELKKAPQNDLKRL-----ENKYAIK--SGDLKNFA---V 283
Query: 1124 TLENCVIKYHSQKMRSINRLIREYWT------------RIYQGNDIDY 1159
L I+ + QK + G D D+
Sbjct: 284 DLFGPEIRKYLQKFLKYYDQAEPLLNKSKKEPKEEAVETLANGEDYDF 331
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is involved
in determining the site of synaptic vesicle fusion. The
C-terminus is a PDZ-binding motif that binds directly to
RIM (a small G protein Rab-3A effector). The family also
contains four coiled-coil domains.
Length = 774
Score = 33.1 bits (75), Expect = 1.6
Identities = 140/844 (16%), Positives = 302/844 (35%), Gaps = 136/844 (16%)
Query: 204 TKYNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELH 263
T ++ L+ + R I K + T +E + QL Q+ +S EL+
Sbjct: 35 TFWSPELKRERALRKDEAARITVWKEQLRVT----QEENQHAQLTVQALQEELKSQRELN 90
Query: 264 NIEESMKPIN---EKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSG 320
+++ ++ +L QL + E S + + + + E +++++ +L+ +++L
Sbjct: 91 RLQQKLEKSGNETGELQQLEKSEGQFSRIQAERERQAMELFLLRDTLEKLQLRMERL--- 147
Query: 321 DKAELQSKLNLFKINLDEKCSELEN-QERLKSQYIQEEKQSHTHINEAQMKLGKLERDEE 379
+ LD + E++ E L+S+ + + + +LG++ E+
Sbjct: 148 ------------QQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSE-----RLGRIREAED 190
Query: 380 THKKLNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTF 439
L L+ K L + K Q ++ + + SQ ID+ + IK E+
Sbjct: 191 QQSHLEVLLEQK-EKEHMMLREEIHQKLQMERDDAK--TEASQKLIDEKDTKIKEFEKML 247
Query: 440 SDNENTKQAEINALIVEKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQ 499
EN EI L E+ +++E + S L ++
Sbjct: 248 EKAEN----EIYRLQSRCDTSEADRNRLDKEVEAER-----------------SALAAMK 286
Query: 500 TKLNRVNSEIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITL-AE 558
K +R E+ + L Q + E A Q +++ L + +++ + + L E
Sbjct: 287 AKCDRAAQELSRKKTELLGLQTELETLA--NQDSDMRQHLDKLKEDLTRAEQEKAILQTE 344
Query: 559 IKSLKNRKESKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDI 618
+ +L+ E K + + + ++ +I +++
Sbjct: 345 VDALRYELERKHNTLTKKTASLQ-----------------AAQEEKATYAGEIEDMRDRY 387
Query: 619 NAKEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKREQE 678
E+ L L+ + N +T R ++R L E +R+ + + + + T K E+
Sbjct: 388 EKTERKLRVLQKKIENLQETFRRKERRLKEEKERL------RSLQTDTNTDTALEKLEKA 441
Query: 679 EVSMMTSTQYLFNSYIGKLEENEPRCPLCTRFFESDYSVPGLVNKLKTKIKEIPEQTNNK 738
I +L+E R + Y K + +++ E+ N
Sbjct: 442 LAEKER--------IIERLKEQRDR--------DERYEQEEF-ETYKKEFEDLKEEVQNL 484
Query: 739 KTHIDQLCKQQRSLQELKPVYENIMKLQDTDIPSLRSKLIELEENVIETKGELKKLKTAL 798
+ +L +++ L+ LK + Q L IELE+ + + K+LK
Sbjct: 485 QL---KLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLR 541
Query: 799 ETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKE 858
P++ ++ + G + N + R L+R L++ ++ +
Sbjct: 542 ANPESADRGSAVDAGTSRSRADSAGARNEVDRLLDR--------------LEKAEQERDD 587
Query: 859 KKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCEL 918
+ E ++E Q L EK ++ + + ++ L ++ E+ +
Sbjct: 588 TEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQS 647
Query: 919 EGMDSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYT------ 972
+ + V +E+ L ++E + L + +K Q+
Sbjct: 648 TQIGHAQAAAHNHIEHHVQKLESDLKQLRAEREQLVAQLEKSQQSLMEFQQELNAAEAER 707
Query: 973 -KQLEEVKRIK-----------------LEILNYTKRGTLTQLAALRESVQKLNQRKEDI 1014
K LEE + +K LE+ K+ T ++A LR KL ++ +D
Sbjct: 708 RKHLEENRELKQEALLAAISEKDANIALLELSASKKKKTQEEVALLRREKDKLVEQLKDQ 767
Query: 1015 IAKR 1018
KR
Sbjct: 768 TQKR 771
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 32.5 bits (74), Expect = 1.7
Identities = 34/173 (19%), Positives = 68/173 (39%), Gaps = 17/173 (9%)
Query: 779 ELEENVIETKGELKKLKTALE-----TPKTKEKTALSLQGDLTLLDQNIRELNTLQRELE 833
EL E + E+ +L +L+ LE T + KE Q L++ I L L
Sbjct: 93 ELRERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALS 152
Query: 834 RQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKND 893
R+ K G+ L + LA K + +R + E + + + +
Sbjct: 153 REVGKQRGL------LSRGLATFKRDRILQQQWREEQEKYDAADKARAIYALQTKADERN 206
Query: 894 IHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLA 946
+ + ++ L + ++ ELE +++ ++E ++ +E +LNL
Sbjct: 207 LETVLQSLSQADFQLAGVAEK--ELETVEA----RIKEARYEIEELENKLNLN 253
>gnl|CDD|168237 PRK05772, PRK05772, translation initiation factor IF-2B subunit
alpha; Provisional.
Length = 363
Score = 32.4 bits (74), Expect = 1.7
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 10/101 (9%)
Query: 815 LTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQ 874
L L++ N++ L+ REL R ++ + R T V+L ++ LN ++ +ESG
Sbjct: 74 LALIENNVKTLDDAIRELTRAKTILDSARPTAVNLVWATSRM------LNKAKNTVESGN 127
Query: 875 TRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGM--LKSLED 913
+ S NE ++ L+ + I ++ + G+ L+ L D
Sbjct: 128 AK--SVNELIELLKVEAKKIFEEEYDAEIQMGLYGLEKLND 166
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 32.5 bits (74), Expect = 1.7
Identities = 23/105 (21%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 281 EKERNMSVMSTQYQTKKTERDMIQESCNE---LESSIKQLFSGDKAEL---QSKLNLFKI 334
E++ + + ++K +++++ E+ E I+Q F ++ EL +S+L
Sbjct: 52 ERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERAT 111
Query: 335 NLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEE 379
+LD K L ++E+ Q HI+E + ++ KLE ++
Sbjct: 112 SLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKK 156
>gnl|CDD|225183 COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an
N-terminal double-glycine peptidase domain [Defense
mechanisms].
Length = 709
Score = 32.6 bits (75), Expect = 1.8
Identities = 21/73 (28%), Positives = 27/73 (36%), Gaps = 14/73 (19%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD------IKNASDQKNFQ--LIVITHDEEFIENLTAID 1300
ALA I LDE T+ LD I Q +I+I H I + D
Sbjct: 619 ALARALLSKPKILLLDEATSALDPETEAIILQNLLQILQGRTVIIIAHRLSTIRS---AD 675
Query: 1301 RAYVV---RIVRD 1310
R V+ +IV
Sbjct: 676 RIIVLDQGKIVEQ 688
>gnl|CDD|130262 TIGR01194, cyc_pep_trnsptr, cyclic peptide transporter. This model
describes cyclic peptide transporter in bacteria.
Bacteria have elaborate pathways for the production of
toxins and secondary metabolites. Many such compounds,
including syringomycin and pyoverdine are synthesized on
non-ribosomal templates consisting of a multienzyme
complex. On several occasions the proteins of the
complex and transporter protein are present on the same
operon. Often times these compounds cross the biological
membrane by specific transporters. Syringomycin is an
amphipathic, cylclic lipodepsipeptide when inserted into
host causes formation of channels, permeable to variety
of cations. On the other hand, pyoverdine is a cyclic
octa-peptidyl dihydroxyquinoline, which is efficient in
sequestering iron for uptake [Transport and binding
proteins, Amino acids, peptides and amines, Transport
and binding proteins, Other].
Length = 555
Score = 32.6 bits (74), Expect = 1.8
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 26 QRPLTLIVGENGCGKTTIIE 45
Q + IVGENGCGK+T+ +
Sbjct: 367 QGDIVFIVGENGCGKSTLAK 386
>gnl|CDD|224116 COG1195, RecF, Recombinational DNA repair ATPase (RecF pathway)
[DNA replication, recombination, and repair].
Length = 363
Score = 32.2 bits (74), Expect = 1.9
Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 4/52 (7%)
Query: 1 MALLDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALT 52
M LL L + RN+ + + ++VGENG GKT ++E I
Sbjct: 1 MYLL-SLLLRNFRNYAELD---LDLSPGVNVLVGENGQGKTNLLEAIYLLAL 48
>gnl|CDD|237648 PRK14250, PRK14250, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 241
Score = 32.1 bits (73), Expect = 1.9
Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 17/84 (20%)
Query: 1225 DEEFIENLTAIDRAYVVRIVRD-------HKALAETFSRNCGIFALDEPTTNLD------ 1271
D E+ ++ +++ Y R V++ ++A T + N + LDEPT+ LD
Sbjct: 110 DVEYYLSIVGLNKEYATRDVKNLSGGEAQRVSIARTLANNPEVLLLDEPTSALDPTSTEI 169
Query: 1272 ----IKNASDQKNFQLIVITHDEE 1291
I ++ N +I ITH+ E
Sbjct: 170 IEELIVKLKNKMNLTVIWITHNME 193
>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein. This family
consists of proteins over 500 amino acids long in
Caenorhabditis elegans and several bacteria (Pseudomonas
aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
etc.). The function is unknown.
Length = 525
Score = 32.4 bits (74), Expect = 2.2
Identities = 23/132 (17%), Positives = 46/132 (34%), Gaps = 12/132 (9%)
Query: 927 TELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEIL 986
L EL +++ +E +L + DALK K ++D + L E + +
Sbjct: 71 ERLAELRKQIRELEAELRDLEDRGDALKA--LAKF------LEDIREGLTEPIKDSAKRN 122
Query: 987 NYTKRGTLTQLAALRESVQKLNQRKEDIIAKRGVCERTI----NEINQSIANQSLEEIDL 1042
+ +++L + + E+ + NE+N + +S +
Sbjct: 123 EPDLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLTGKSQRSHTV 182
Query: 1043 KNNLTLLEKKEA 1054
L E EA
Sbjct: 183 LVRLEAPEDAEA 194
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 32.2 bits (74), Expect = 2.2
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 4/72 (5%)
Query: 492 QSTLQVLQTKLNRVNSEIDQLSKSLDPD--QLKNEIEAWIRQRNELEDELCVIDAEI--S 547
L+ LQ + N ++ EI + K + D +L E++ + ELE L ++AE+
Sbjct: 42 LRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEAELDTL 101
Query: 548 ILQAQNITLAEI 559
+L NI +
Sbjct: 102 LLTIPNIPHESV 113
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 32.4 bits (74), Expect = 2.2
Identities = 64/341 (18%), Positives = 128/341 (37%), Gaps = 62/341 (18%)
Query: 508 EIDQLSKSLDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKE 567
+I L++ + Q K E +R+ +E+E +L ++++I I L +
Sbjct: 11 QIRDLNEDVSAFQRKFVNE--VRRCDEVERKLRKLESKIK---KLGIPLKDTGG--KPDV 63
Query: 568 SKLADINLLKERHDRAFHLLFDMIPEENFKNSLDKALSSITFDINRIQEDINAKEKHLYT 627
+ L+E + ++ E SL+K ++ + +N + E+ + +++L
Sbjct: 64 PPSKEFLDLEEEILDLEAEIKEV---EENLESLEKEINELEEWLNVLDEEKSFLDENLEE 120
Query: 628 LEANVSNSSKTLRDQKRTLAELMDRMELVLGSKP------FEDELDRVTLELKREQE-EV 680
L S S D K ++ V G FE EL R R+ E E
Sbjct: 121 L----SELSNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEE 176
Query: 681 SMMTSTQYLFNSYI-GKLEENEPRCPLCTRFFESDYSVP-------GLVNKLKTKIKEIP 732
+ + +F + GK + ++ + + F Y VP L++K+ +I+E+
Sbjct: 177 PLEDPKKTVFIIFFVGKEDLDKVK-KILDSFGFELYDVPETEGERSELISKVNKRIEELQ 235
Query: 733 EQTNNKKTHIDQLCKQQRSLQELKPVYENIMKLQD-----------------------TD 769
++H++++ + EL E + K + D
Sbjct: 236 RVLEQTESHLEKV--LVKIADELLAWDEQVSKEKAVYETLNLFNYDTKTLIAEGWVPAKD 293
Query: 770 IPSLRSKLIELEENVIETKGELKKLKTALET---PKTKEKT 807
+ L++ L EN E G + ++ +ET P T K
Sbjct: 294 LEKLKAAL----ENATEGSGSVPSIENDIETNEEPPTYLKN 330
Score = 31.2 bits (71), Expect = 5.2
Identities = 43/275 (15%), Positives = 89/275 (32%), Gaps = 64/275 (23%)
Query: 775 SKLIELEENVIETKGELKKLKTAL-----ETPKTKEKTALSLQGDLTLLDQNIRELNTLQ 829
+ E+E + + + ++KKL L + K L L+ ++ L+ I+E+
Sbjct: 31 RRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENL 90
Query: 830 RELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQK 889
LE++ + + +E N L + L +L +L
Sbjct: 91 ESLEKE-----------------INELEEWLNVL---DEEKSFLDENLEEL-SELSNLDI 129
Query: 890 QKNDIHSKQ-LTVQGGAGM--------LKSLEDRKCELEGMDSVYQTELEELGRKVAPIE 940
+ + L + AG+ + R C + + LE+ + V I
Sbjct: 130 DFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKTVFIIF 189
Query: 941 TQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAAL 1000
+ KK L+ G ++ D + E
Sbjct: 190 FV-------GKEDLDKVKKILDSFGFELYDVPETEGE----------------------R 220
Query: 1001 RESVQKLNQRKEDIIAKRGVCERTINEINQSIANQ 1035
E + K+N+R E++ E + ++ IA++
Sbjct: 221 SELISKVNKRIEELQRVLEQTESHLEKVLVKIADE 255
>gnl|CDD|180871 PRK07191, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 456
Score = 32.1 bits (73), Expect = 2.3
Identities = 22/101 (21%), Positives = 44/101 (43%), Gaps = 12/101 (11%)
Query: 766 QDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIREL 825
Q D P +R + VIE+ + + ++K ++ Q D T ++++
Sbjct: 121 QLPDSPPMR-------QQVIESANAMALRFNNVNNFIVQQKKSIGQQRDAT-----VKQI 168
Query: 826 NTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTF 866
N+L R + KI RS G ++ +L Q+ + +L+
Sbjct: 169 NSLTRSIADYNQKILKNRSDGNNISDLLDQRDLQIKKLSGL 209
>gnl|CDD|224044 COG1119, ModF, ABC-type molybdenum transport system, ATPase
component/photorepair protein PhrA [Inorganic ion
transport and metabolism].
Length = 257
Score = 31.9 bits (73), Expect = 2.3
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 6/44 (13%)
Query: 32 IVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRIGKKD 75
IVG NG GKTT++ + T E+P + R GK +
Sbjct: 62 IVGPNGAGKTTLLSLL----TGEHPPSSGDVTLLGR--RFGKGE 99
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 32.2 bits (74), Expect = 2.4
Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 10/67 (14%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKN----ASDQKNFQ--LIVITHDEEFIENL-TAI-- 1299
AL N + LDEPT +LDI+ K FQ +I I+HD FI N+ T I
Sbjct: 166 ALGRALVSNPDVLLLDEPTNHLDIETIEWLEGFLKTFQGSIIFISHDRSFIRNMATRIVD 225
Query: 1300 -DRAYVV 1305
DR +V
Sbjct: 226 LDRGKLV 232
Score = 30.7 bits (70), Expect = 7.6
Identities = 15/52 (28%), Positives = 30/52 (57%), Gaps = 6/52 (11%)
Query: 1250 LAETFSRNCGIFALDEPTTNLDIKN----ASDQKNFQ--LIVITHDEEFIEN 1295
LA F + + LDEPT +LD++ ++Q +++++HD +F++N
Sbjct: 451 LARLFLKPSNLLILDEPTNDLDVETLELLEELLDSYQGTVLLVSHDRQFVDN 502
>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
metal-type transporters. This family includes
transporters involved in the uptake of various metallic
cations such as iron, manganese, and zinc. The ATPases of
this group of transporters are very similar to members of
iron-siderophore uptake family suggesting that they share
a common ancestor. The best characterized metal-type ABC
transporters are the YfeABCD system of Y. pestis, the
SitABCD system of Salmonella enterica serovar
Typhimurium, and the SitABCD transporter of Shigella
flexneri. Moreover other uncharacterized homologs of
these metal-type transporters are mainly found in
pathogens like Haemophilus or enteroinvasive E. coli
isolates.
Length = 213
Score = 31.3 bits (72), Expect = 2.6
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 9/49 (18%)
Query: 1250 LAETFSRNCGIFALDEPTTNLDIKNASD---------QKNFQLIVITHD 1289
LA ++ + LDEP +D K D ++ ++V+THD
Sbjct: 143 LARALVQDPDLLLLDEPFAGVDPKTQEDIYELLRELRREGMTILVVTHD 191
Score = 29.8 bits (68), Expect = 9.1
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 29 LTLIVGENGCGKTTIIECI 47
IVG NG GK+T+++ I
Sbjct: 27 FLAIVGPNGAGKSTLLKAI 45
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 31.9 bits (73), Expect = 2.8
Identities = 23/103 (22%), Positives = 39/103 (37%), Gaps = 5/103 (4%)
Query: 777 LIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQE 836
L EL E + E EL+KL K A + + L + I +L+ LE
Sbjct: 1 LKELREALAELAKELRKLTE-----DEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELL 55
Query: 837 SKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNS 879
++ + + +++E K E FR+ + G L
Sbjct: 56 DELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAE 98
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.8 bits (73), Expect = 2.9
Identities = 45/209 (21%), Positives = 88/209 (42%), Gaps = 73/209 (34%)
Query: 781 EENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELNTLQRELERQESKIS 840
+E ++E K E+ KL+ LE REL + EL+R E ++
Sbjct: 50 KEALLEAKEEVHKLRAELE-----------------------RELKERRNELQRLERRL- 85
Query: 841 GMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLT 900
Q+E E+ ++ S ++K ++L+K++ ++ +K+
Sbjct: 86 --------------LQRE------------ETLDRKMESLDKKEENLEKKEKELSNKE-- 117
Query: 901 VQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQLNLAQSEL-DALKKEHKK 959
K+L++++ ELE + + + ELE + L Q E + L +E ++
Sbjct: 118 --------KNLDEKEEELEELIAEQREELERIS----------GLTQEEAKEILLEEVEE 159
Query: 960 KLNEEGAK-IQDYTKQL-EEVKRIKLEIL 986
+ E AK I++ ++ EE + EIL
Sbjct: 160 EARHEAAKLIKEIEEEAKEEADKKAKEIL 188
Score = 31.0 bits (71), Expect = 5.9
Identities = 27/144 (18%), Positives = 64/144 (44%), Gaps = 18/144 (12%)
Query: 946 AQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLTQLAALRESVQ 1005
A+ E+ L+ E +++L E ++Q ++L + + E L+ + L + +
Sbjct: 56 AKEEVHKLRAELERELKERRNELQRLERRLLQ----REETLDRKMESLDKKEENLEKKEK 111
Query: 1006 KLNQRKEDIIAKRGVCERTINEINQ---SIANQSLEEI------DLKNNLT-----LLEK 1051
+L+ +++++ K E I E + I+ + EE +++ L+++
Sbjct: 112 ELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEEEARHEAAKLIKE 171
Query: 1052 KEAVAKLNEELKLSEIMISDLTKY 1075
E AK + K EI+ + + +Y
Sbjct: 172 IEEEAKEEADKKAKEILATAIQRY 195
>gnl|CDD|234608 PRK00064, recF, recombination protein F; Reviewed.
Length = 361
Score = 31.7 bits (73), Expect = 3.0
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 3/44 (6%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECI 47
L +L + RN+ + + ++VGENG GKT ++E I
Sbjct: 3 LTRLSLTDFRNYEELD---LELSPGVNVLVGENGQGKTNLLEAI 43
>gnl|CDD|213183 cd03216, ABC_Carb_Monos_I, First domain of the ATP-binding
cassette component of monosaccharide transport system.
This family represents the domain I of the carbohydrate
uptake proteins that transport only monosaccharides
(Monos). The Carb_Monos family is involved in the
uptake of monosaccharides, such as pentoses (such as
xylose, arabinose, and ribose) and hexoses (such as
xylose, arabinose, and ribose), that cannot be broken
down to simple sugars by hydrolysis. Pentoses include
xylose, arabinose, and ribose. Important hexoses
include glucose, galactose, and fructose. In members of
the Carb_monos family, the single hydrophobic gene
product forms a homodimer while the ABC protein
represents a fusion of two nucleotide-binding domains.
However, it is assumed that two copies of the ABC
domains are present in the assembled transporter.
Length = 163
Score = 30.9 bits (71), Expect = 3.1
Identities = 7/16 (43%), Positives = 14/16 (87%)
Query: 32 IVGENGCGKTTIIECI 47
++GENG GK+T+++ +
Sbjct: 31 LLGENGAGKSTLMKIL 46
>gnl|CDD|227320 COG4987, CydC, ABC-type transport system involved in cytochrome bd
biosynthesis, fused ATPase and permease components
[Energy production and conversion / Posttranslational
modification, protein turnover, chaperones].
Length = 573
Score = 31.9 bits (73), Expect = 3.3
Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 8/54 (14%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNASD--------QKNFQLIVITHDEEFIE 1294
ALA + ++ LDEPT LD + L+++TH +E
Sbjct: 484 ALARALLHDAPLWLLDEPTEGLDPITERQVLALLFEHAEGKTLLMVTHRLRGLE 537
>gnl|CDD|213243 cd03276, ABC_SMC6_euk, ATP-binding cassette domain of eukaryotic
SM6 proteins. The structural maintenance of chromosomes
(SMC) proteins are large (approximately 110 to 170 kDa),
and each is arranged into five recognizable domains.
Amino-acid sequence homology of SMC proteins between
species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code),
which by mutational studies has been shown to be
essential in several proteins. The carboxy-terminal
domain contains a sequence (the DA-box) that resembles a
'Walker B' motif, and a motif with homology to the
signature sequence of the ATP-binding cassette (ABC)
family of ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 198
Score = 31.0 bits (71), Expect = 3.3
Identities = 32/127 (25%), Positives = 50/127 (39%), Gaps = 31/127 (24%)
Query: 13 RNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRIG 72
+NF ++ + F + IVG NG GK+ I+ + L G +S N
Sbjct: 7 KNFMCHRHLQIEFGPRVNFIVGNNGSGKSAILTALTIGLG-----GKASDTN-------- 53
Query: 73 KKDEASASSGKNFVHDPRIGKKDETHA--IVKLQCTCENNDTVCVV-----RSLLLSNKN 125
SS K+ + K E+ A V L+ + + +CV+ RS L SNK
Sbjct: 54 -----RGSSLKDLI------KDGESSAKITVTLKNQGLDANPLCVLSQDMARSFLTSNKA 102
Query: 126 GKDNCAT 132
+ T
Sbjct: 103 AVRDVKT 109
>gnl|CDD|226967 COG4608, AppF, ABC-type oligopeptide transport system, ATPase
component [Amino acid transport and metabolism].
Length = 268
Score = 31.4 bits (72), Expect = 3.4
Identities = 8/12 (66%), Positives = 12/12 (100%)
Query: 32 IVGENGCGKTTI 43
+VGE+GCGK+T+
Sbjct: 44 LVGESGCGKSTL 55
>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
protein. This family represents an accessory protein
that works with the bacteriocin maturation and ABC
transport secretion protein described by TIGR01193
[Transport and binding proteins, Other].
Length = 457
Score = 31.6 bits (72), Expect = 3.4
Identities = 34/179 (18%), Positives = 61/179 (34%), Gaps = 21/179 (11%)
Query: 206 YNKALESIKIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNI 265
Y Q + L E + Q + A+ K + K DQ ++ +
Sbjct: 138 YRNLFNGYLAQVESLTSETQQQNDKSQTQN---EAAEKTKAQLDQQISKTDQKLQDYQAL 194
Query: 266 EESMK---------PINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQ 316
+ ++ P + ++ S + Q K T IQ+ ++L+ SI
Sbjct: 195 KNAISNGTKVANFNPYQSLYENYQAQLKSASDKDQKNQVKSTILATIQQQIDQLQKSIAS 254
Query: 317 LFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLE 375
+ KA L + S+ +LK Q + + KQ I + KL +LE
Sbjct: 255 -YQVQKAGLTKSTASNYAS-----SQNSKLAQLKEQQLAKVKQE---ITDLNQKLLELE 304
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 31.8 bits (73), Expect = 3.5
Identities = 37/235 (15%), Positives = 91/235 (38%), Gaps = 40/235 (17%)
Query: 456 EKVELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKS 515
+ +++E +++ K+QI+ N L ++E + + + ID L
Sbjct: 246 DHLDIEKELQDLKEQIDQNLALLEE--LDLDEAEE----------ENEEIEERIDTLY-- 291
Query: 516 LDPDQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITL-AEIKSLKNR---KESKLA 571
D L+ E++A + +E + + + + QN L E+ L+ E +L
Sbjct: 292 ---DILEKEVKA----KKFVEKNIDKLTDFLEHAREQNKQLLLELDRLQQSYTLNEDELE 344
Query: 572 DINLLKER---HDRAFHLLFDMIPE--------ENFKNSLDKALSSITFDINRIQEDINA 620
+ L+++ + + L + I E + ++K L +I + QE+++
Sbjct: 345 TVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLE----EIEKEQEELSE 400
Query: 621 KEKHLYTLEANVSNSSKTLRDQKRTLAELMDRMELVLGSKPFEDELDRVTLELKR 675
+ L E + R + + +++ L + + + V+ E++
Sbjct: 401 SLQGLRKDELEAREKLQEYRQKLHEIKRYLEKSNLPGLPEDYLEYFFVVSDEIEA 455
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 31.1 bits (71), Expect = 3.7
Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 10/124 (8%)
Query: 459 ELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDP 518
+LE ++ Q++ ++ L +Q+ E+ Q ++ R EI ++L P
Sbjct: 82 QLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQLSEARRRLQEIRNRLQALSP 141
Query: 519 DQLKNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEIKSLKNRKESKLADINLLKE 578
+A QR L+ EL + A+I L+ E+ S NR+E +LLK+
Sbjct: 142 GG-TPLAQA---QRTLLQAELAALKAQIEELE------LELLSNNNRQELLRLQRDLLKK 191
Query: 579 RHDR 582
R +R
Sbjct: 192 RIER 195
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 31.7 bits (72), Expect = 3.7
Identities = 42/245 (17%), Positives = 96/245 (39%), Gaps = 13/245 (5%)
Query: 266 EESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAEL 325
+E +K +N + +L E N + + + + E E + ++ + + +L
Sbjct: 309 KEQLKTLNLEKQKLLSIEFNTNFLLNNFVSLINRLKASVNQVFEFEIYLYKIINANPDKL 368
Query: 326 QSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERDEETH--KK 383
++K I ++K + N + + ++ I ++S N+ + L + + + KK
Sbjct: 369 ENKKIEISIKPEKK--TINNSQIITTKKINIVEES----NKNSVHFDTLYKTKIFYHKKK 422
Query: 384 LNDTLKTKLNNLADTLCLDTTAKSQYTPEEGEGLIKMSQTTIDKYLSDIKILERTFSDNE 443
+N K + + L + KS+ P+ E L + + DI++ +
Sbjct: 423 INQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFN 482
Query: 444 NTKQAEINALIVEKVELESKI---KSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQT 500
K E + ++ + S I G + +I IN+VN+S ++ +
Sbjct: 483 KFKNEESAENSFIDIIKQADLVLWVSPNALILGVEDL--GLIELINKVNESFEFIKTIFK 540
Query: 501 KLNRV 505
K N+
Sbjct: 541 KFNKN 545
>gnl|CDD|184588 PRK14243, PRK14243, phosphate transporter ATP-binding protein;
Provisional.
Length = 264
Score = 30.9 bits (70), Expect = 4.1
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 10/58 (17%)
Query: 19 KNRVVRFQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGA-SSGKNFVHDPRIGKKD 75
KN++ F +G +GCGK+TI+ C F N+ G GK H + D
Sbjct: 35 KNQITAF-------IGPSGCGKSTILRC--FNRLNDLIPGFRVEGKVTFHGKNLYAPD 83
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 30.6 bits (70), Expect = 4.3
Identities = 37/194 (19%), Positives = 84/194 (43%), Gaps = 30/194 (15%)
Query: 763 MKLQDTDIPSLRSKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLD--Q 820
M+ + +I LR KLI L+ + + + +L++ + L L L+D Q
Sbjct: 1 MRAKRNEISELRLKLITLKHTLAKIQEKLEQKEE--------------LGEGLHLIDFEQ 46
Query: 821 NIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKN----ELNTFRSKIESGQTR 876
E L ++E + ++ ++ + +L KEK + EL + +++ + R
Sbjct: 47 LQIENQALNEKIEERNKELKRLKLKYTNTVHILTHLKEKLHMLEEELERLKKELKEKEER 106
Query: 877 LNSHNEKLQSLQKQKNDIHSK--QLTVQGGA----GMLKSLEDRKCELEGMDSVYQTELE 930
L ++L ++ +++ + + +L QGG +L + ++E + +E
Sbjct: 107 LAKLRKELYKVKLERDKLRKQNKKLRQQGGLLHVPALLYDYDKTVEKVEEK----RKSVE 162
Query: 931 ELGRKVAPIETQLN 944
+L RK +E ++
Sbjct: 163 KLRRKHKILEMRIK 176
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome
partitioning].
Length = 1480
Score = 31.8 bits (72), Expect = 4.3
Identities = 28/109 (25%), Positives = 46/109 (42%), Gaps = 17/109 (15%)
Query: 216 QRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIEESMKPINEK 275
QR LR+E ++++ Q +L + Q Q +QS EE + + E
Sbjct: 579 QRMALRQEQEQLQSRIQ-SLMQRAPVWLAAQNALE--QLSEQSGEEF----TDSQDVTEY 631
Query: 276 LVQLTEKERNMSVMSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAE 324
+ QL E+ER +V ERD + N L+ I++L +E
Sbjct: 632 MQQLLEREREATV----------ERDELGARKNALDEEIERLSQPGGSE 670
>gnl|CDD|182569 PRK10584, PRK10584, putative ABC transporter ATP-binding protein
YbbA; Provisional.
Length = 228
Score = 30.9 bits (70), Expect = 4.4
Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 10/53 (18%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNA--------SDQKNF--QLIVITHDEE 1291
ALA F+ + DEPT NLD + S + LI++THD +
Sbjct: 156 ALARAFNGRPDVLFADEPTGNLDRQTGDKIADLLFSLNREHGTTLILVTHDLQ 208
>gnl|CDD|213186 cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component
of branched chain amino acids transport system. The
Mj1267/LivG ABC transporter subfamily is involved in
the transport of the hydrophobic amino acids leucine,
isoleucine and valine. MJ1267 is a branched-chain amino
acid transporter with 29% similarity to both the LivF
and LivG components of the E. coli branched-chain amino
acid transporter. MJ1267 contains an insertion from
residues 114 to 123 characteristic of LivG
(Leucine-Isoleucine-Valine) homologs. The
branched-chain amino acid transporter from E. coli
comprises a heterodimer of ABCs (LivF and LivG), a
heterodimer of six-helix TM domains (LivM and LivH),
and one of two alternative soluble periplasmic
substrate binding proteins (LivK or LivJ).
Length = 236
Score = 30.9 bits (71), Expect = 4.5
Identities = 10/35 (28%), Positives = 16/35 (45%), Gaps = 6/35 (17%)
Query: 29 LTLIVGENGCGKTTIIECIKFALTNEYPQGASSGK 63
+ ++G NG GKTT+ I + +SG
Sbjct: 28 IHGLIGPNGAGKTTLFNLI----SGFLR--PTSGS 56
>gnl|CDD|213264 cd03297, ABC_ModC_molybdenum_transporter, ATP-binding cassette
domain of the molybdenum transport system. ModC is an
ABC-type transporter and the ATPase component of a
molybdate transport system that also includes the
periplasmic binding protein ModA and the membrane
protein ModB. ABC transporters are a large family of
proteins involved in the transport of a wide variety of
different compounds, like sugars, ions, peptides and
more complex organic molecules. The nucleotide binding
domain shows the highest similarity between all members
of the family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 214
Score = 30.7 bits (70), Expect = 4.5
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 28 PLTLIVGENGCGKTTIIECI 47
+T I G +G GK+T++ CI
Sbjct: 24 EVTGIFGASGAGKSTLLRCI 43
>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain. These
proteins contain several 22 residue repeats which form a
pair of alpha helices. This family includes:
Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
E.
Length = 191
Score = 30.7 bits (70), Expect = 4.6
Identities = 32/182 (17%), Positives = 66/182 (36%), Gaps = 37/182 (20%)
Query: 816 TLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKN-ELNTFRSK----- 869
TLL ++ ELN+ EL+ Q + L + +E+ +L R++
Sbjct: 1 TLLSDSLDELNSYAEELQEQLGPYAQEFWA--RLSKETEALREELQKDLEDVRARLQPYL 58
Query: 870 ----------IESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELE 919
+E + RL + E+L+ + + ++L E
Sbjct: 59 DELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKLAP---------------YAE 103
Query: 920 GMDSVYQTELEELGRKVAPIETQL-NLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEV 978
+ + +E L ++ P +L L+ LK+ E A++ +Q+EE+
Sbjct: 104 ELRDRLRQNVEALRARLGPYVEELRQKLAERLEELKESVGPYAEEYKAQLS---EQVEEL 160
Query: 979 KR 980
+
Sbjct: 161 RE 162
>gnl|CDD|213268 cd03301, ABC_MalK_N, The N-terminal ATPase domain of the maltose
transporter, MalK. ATP binding cassette (ABC) proteins
function from bacteria to human, mediating the
translocation of substances into and out of cells or
organelles. ABC transporters contain two
transmembrane-spanning domains (TMDs) or subunits and two
nucleotide binding domains (NBDs) or subunits that couple
transport to the hydrolysis of ATP. In the maltose
transport system, the periplasmic maltose binding protein
(MBP) stimulates the ATPase activity of the
membrane-associated transporter, which consists of two
transmembrane subunits, MalF and MalG, and two copies of
the ATP binding subunit, MalK, and becomes tightly bound
to the transporter in the catalytic transition state,
ensuring that maltose is passed to the transporter as ATP
is hydrolyzed.
Length = 213
Score = 30.7 bits (70), Expect = 4.7
Identities = 20/68 (29%), Positives = 30/68 (44%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
AL R +F +DEP +NLD +K + I +THD+ +E +T
Sbjct: 140 ALGRAIVREPKVFLMDEPLSNLDAKLRVQMRAELKRLQQRLGTTTIYVTHDQ--VEAMTM 197
Query: 1299 IDRAYVVR 1306
DR V+
Sbjct: 198 ADRIAVMN 205
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 30.8 bits (71), Expect = 5.3
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 14/71 (19%)
Query: 883 KLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGMDSVYQTELEELGRKVAPIETQ 942
+L+ LQ ++N + SK++ G K + L E++EL ++ +E +
Sbjct: 43 ELEELQAERNAL-SKEI------GQAKRKGEDAEAL-------IAEVKELKEEIKALEAE 88
Query: 943 LNLAQSELDAL 953
L+ ++EL+ L
Sbjct: 89 LDELEAELEEL 99
Score = 30.4 bits (70), Expect = 7.6
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 473 GNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQL-SKSLDPDQLKNEIEAWIRQ 531
G D+ ++ E + Q+ L+ LQ + N ++ EI Q K D + L E++ +++
Sbjct: 22 GFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKE-LKE 80
Query: 532 R-NELEDELCVIDAEISILQAQ--NI 554
LE EL ++AE+ L + N+
Sbjct: 81 EIKALEAELDELEAELEELLLRIPNL 106
>gnl|CDD|218452 pfam05130, FlgN, FlgN protein. This family includes the FlgN protein
and export chaperone involved in flagellar synthesis.
Length = 140
Score = 29.7 bits (67), Expect = 5.4
Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 3/98 (3%)
Query: 943 LNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLE--EVKRIKLEILNYTKRGTLTQLAAL 1000
L L + E +ALK + L E + Q+ QLE E +R +L G L
Sbjct: 17 LELLEEEQEALKAGDIEALEELTEEKQELLAQLEQLEEERQQLLAELGLSPGPAGLSELL 76
Query: 1001 RESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLE 1038
++L + +++ + +NE N + QSLE
Sbjct: 77 AAEDEELRELWQELKELLEEL-QELNERNGRLLEQSLE 113
>gnl|CDD|146624 pfam04089, BRICHOS, BRICHOS domain. The BRICHOS domain is about
100 amino acids long. It is found in a variety of
proteins implicated in dementia, respiratory distress
and cancer. Its exact function is unknown; roles that
have been proposed for it include (a) in targeting of
the protein to the secretory pathway, (b) intramolecular
chaperone-like function, and (c) assisting the
specialised intracellular protease processing system.
This C-terminal domain is embedded in the endoplasmic
reticulum lumen, and binds to the N-terminal,
transmembrane, SP_C, pfam08999, provided that it is in
non-helical conformation. Thus the Brichos domain of
proSP-C is a chaperone that induces alpha-helix
formation of an aggregation-prone TM region.
Length = 97
Score = 28.8 bits (65), Expect = 5.4
Identities = 19/69 (27%), Positives = 26/69 (37%), Gaps = 15/69 (21%)
Query: 762 IMKLQDTDIPSLRS-----KLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLT 816
IMK+ IPSL++ K +L TK + + P T DL+
Sbjct: 30 IMKMDKEVIPSLQALTELLKNKQLGGYGPGTKLVRELMVVVSTPPVT----------DLS 79
Query: 817 LLDQNIREL 825
L I EL
Sbjct: 80 FLGPFILEL 88
>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily). This
Pfam entry includes some of the AAA proteins not
detected by the pfam00004 model.
Length = 135
Score = 29.6 bits (67), Expect = 5.4
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 31 LIVGENGCGKTTIIECIKFALTN 53
L+VG G GK+ + E + AL+N
Sbjct: 3 LLVGPPGTGKSELAERLAAALSN 25
>gnl|CDD|213217 cd03250, ABCC_MRP_domain1, ATP-binding cassette domain 1 of
multidrug resistance-associated protein, subfamily C.
This subfamily is also known as MRP (multidrug
resistance-associated protein). Some of the MRP members
have five additional transmembrane segments in their
N-terminus, but the function of these additional
membrane-spanning domains is not clear. The MRP was
found in the multidrug-resisting lung cancer cell in
which p-glycoprotein was not overexpressed. MRP exports
glutathione by drug stimulation, as well as, certain
substrates in conjugated forms with anions, such as
glutathione, glucuronate, and sulfate.
Length = 204
Score = 30.5 bits (70), Expect = 5.5
Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 6/39 (15%)
Query: 29 LTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVH 67
L IVG G GK++++ AL E + SG V
Sbjct: 33 LVAIVGPVGSGKSSLLS----ALLGELEK--LSGSVSVP 65
>gnl|CDD|184593 PRK14258, PRK14258, phosphate ABC transporter ATP-binding
protein; Provisional.
Length = 261
Score = 30.8 bits (69), Expect = 5.5
Identities = 10/23 (43%), Positives = 18/23 (78%)
Query: 25 FQRPLTLIVGENGCGKTTIIECI 47
+Q +T I+G +GCGK+T ++C+
Sbjct: 31 YQSKVTAIIGPSGCGKSTFLKCL 53
>gnl|CDD|130344 TIGR01277, thiQ, thiamine ABC transporter, ATP-binding protein. This
model describes the energy-transducing ATPase subunit
ThiQ of the ThiBPQ thiamine (and thiamine pyrophosphate)
ABC transporter in several Proteobacteria. This protein
is found so far only in Proteobacteria, and is found in
complete genomes only if the ThiB and ThiP subunits are
also found [Transport and binding proteins, Other].
Length = 213
Score = 30.2 bits (68), Expect = 5.9
Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 12/71 (16%)
Query: 1245 RDHKALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIE 1294
R ALA R I LDEP + LD +K ++ L+++TH +
Sbjct: 134 RQRVALARCLVRPNPILLLDEPFSALDPLLREEMLALVKQLCSERQRTLLMVTHHLS--D 191
Query: 1295 NLTAIDRAYVV 1305
+ VV
Sbjct: 192 ARAIASQIAVV 202
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 31.0 bits (70), Expect = 6.0
Identities = 58/296 (19%), Positives = 107/296 (36%), Gaps = 37/296 (12%)
Query: 805 EKTALSLQGDLT--LLDQNIRELNTLQRELERQESKI-SGMRSTGVDLDQVLAQQKEKKN 861
E+ S GD+ L D N+ L + L + + S VD Q + + K
Sbjct: 85 ERNKGSAIGDVKVYLPDGNVGSKTLLSKLLGSIDRNTYEAIFSFDVDGLQEIHELNSKGI 144
Query: 862 ELNTFRSKIESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLEDRKCELEGM 921
+ SG + + + L+ L K+ + ++ +L+ L+ + E++
Sbjct: 145 L----GYLLFSGTSGSPASTKLLEVLNKEADSLYKPSGRNPQINQLLEKLKQERNEIDEA 200
Query: 922 DSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNE-EGAKIQDYTKQLEEVKR 980
+ Y T + L + A L +SEL A + + + E KQLE+
Sbjct: 201 EKEYATYHKLLESRRAEHARLAEL-RSELRADRDHIRALRDAVELWPRLQEWKQLEQELT 259
Query: 981 IKLE---------ILNYTKR------------GTLTQLAALRESVQKLNQRKEDIIAKRG 1019
+ E +L KR L +LA L++ +L KE ++
Sbjct: 260 RRREELATFPRDGVLRLEKREAHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALV 319
Query: 1020 VCERTINEINQSIANQSLEEI--DLKNNLTLLEKKEAVAKLNE---ELKLSEIMIS 1070
+ ++EI S L E ++ +L E+ E +L+ E M+
Sbjct: 320 RLHQQLSEIKAS--AFELTETLAGIEADLRDKEEAAGNGFEAERVHDLRSLECMLR 373
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 29.7 bits (67), Expect = 6.1
Identities = 26/98 (26%), Positives = 44/98 (44%), Gaps = 10/98 (10%)
Query: 465 KSFKQQIEGNK---KDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDPDQL 521
K +++ IE K K+L + Q + LQ L N L K + ++L
Sbjct: 57 KQYEEAIELAKKQRKELKREAGSLT----LQDVISFLQN-LKTTNPSDQALQK--ENERL 109
Query: 522 KNEIEAWIRQRNELEDELCVIDAEISILQAQNITLAEI 559
KN+ E+ ++ ELE EL + +S ++ TL +I
Sbjct: 110 KNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTLIDI 147
>gnl|CDD|218630 pfam05546, She9_MDM33, She9 / Mdm33 family. Members of this family
are mitochondrial inner membrane proteins with a role in
inner mitochondrial membrane organisation and biogenesis.
Length = 208
Score = 30.4 bits (69), Expect = 6.2
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 15/95 (15%)
Query: 995 TQLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEEIDLKNNLTLLEK--- 1051
TQL A +E V++ + + IA+R +R +NE+ Q + S DL+ L
Sbjct: 46 TQLRAAKEEVREAKKAYSEAIARRSSSQREVNELLQRKHSWS--PTDLERFTELYRNDHT 103
Query: 1052 -KEAVAKLNEELKLSEI---------MISDLTKYH 1076
++A + E+LK +E S LT+YH
Sbjct: 104 NEQAEQEAKEKLKEAERKVEELQDQLTQSILTRYH 138
>gnl|CDD|213227 cd03260, ABC_PstB_phosphate_transporter, ATP-binding cassette
domain of the phosphate transport system. Phosphate
uptake is of fundamental importance in the cell
physiology of bacteria because phosphate is required as
a nutrient. The Pst system of E. coli comprises four
distinct subunits encoded by the pstS, pstA, pstB, and
pstC genes. The PstS protein is a phosphate-binding
protein located in the periplasmic space. PstA and PstC
are hydrophobic and they form the transmembrane portion
of the Pst system. PstB is the catalytic subunit, which
couples the energy of ATP hydrolysis to the import of
phosphate across cellular membranes through the Pst
system, often referred as ABC-protein. PstB belongs to
one of the largest superfamilies of proteins
characterized by a highly conserved adenosine
triphosphate (ATP) binding cassette (ABC), which is
also a nucleotide binding domain (NBD).
Length = 227
Score = 30.2 bits (69), Expect = 6.3
Identities = 7/18 (38%), Positives = 14/18 (77%)
Query: 30 TLIVGENGCGKTTIIECI 47
T ++G +GCGK+T++ +
Sbjct: 29 TALIGPSGCGKSTLLRLL 46
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a
family of proteins of unknown function possibly
containing a coiled-coil domain.
Length = 308
Score = 30.7 bits (69), Expect = 6.4
Identities = 38/161 (23%), Positives = 68/161 (42%), Gaps = 10/161 (6%)
Query: 720 LVNKLKTKIKEIPEQTNNKKTHIDQLCKQQRSL-----QELKPVYENIMKLQDTDIPSLR 774
VNKL KI ++ T K+T ++QL +++ L QE + + + K D R
Sbjct: 136 QVNKLMRKIDKLEADTLAKQTSLEQLRREKVDLENTLEQEQEALVNRLWKRMDKLEAEKR 195
Query: 775 SKLIELEENVIETKGELKKLKTALETPKTKEKTALSLQGDLTLLDQNIRELN---TLQRE 831
+L++ V E A +T SL+ ++ L QN+ + T + E
Sbjct: 196 MLQEKLDQPVSEPPSPRDIFTEAEDTVGGIASHITSLRSEVRRLRQNLAKSEADYTEKME 255
Query: 832 LERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIES 872
+E + R + L + L ++ E++ L S+ ES
Sbjct: 256 QYAKEER--QTREENIRLQRKLLREVERREALCRQLSESES 294
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.6 bits (69), Expect = 6.5
Identities = 25/124 (20%), Positives = 49/124 (39%), Gaps = 16/124 (12%)
Query: 289 MSTQYQTKKTERDMIQESCNELESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQER 348
+ ++ + + ERD ++ ++LES ++ + L S+L+ K + ELE E+
Sbjct: 34 LDSELRDAEKERDTYKQYLSKLESQNVEI--SNYEALDSELDELKKEEERLLDELEELEK 91
Query: 349 L--------------KSQYIQEEKQSHTHINEAQMKLGKLERDEETHKKLNDTLKTKLNN 394
K Q EE Q N +LE + ++ + + +L+
Sbjct: 92 EDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLNQLDK 151
Query: 395 LADT 398
L T
Sbjct: 152 LRKT 155
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.8 bits (69), Expect = 6.7
Identities = 41/239 (17%), Positives = 86/239 (35%), Gaps = 19/239 (7%)
Query: 250 NNTQKRDQSFEELHNIEESMKPINEKLVQLTEKERNMSVMSTQYQTKKTERDMIQESCNE 309
N T + ++ E E + + E +++N + + Q ++ E + +E +
Sbjct: 108 NETVEEEEKEES----REEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163
Query: 310 LESSIKQLFSGDKAELQSKLNLFKINLDEKCSELENQERLKSQYIQEEKQSHTHINEAQM 369
S + +L+ N F E +E K + Q+E + +
Sbjct: 164 RGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKR 223
Query: 370 KLGKLERDEETHKKLNDTLKTKLNNLADTLCL---------DTTAKSQYTPEEGEGLIKM 420
+ + +EE ++ + K + L + K Q PE+G K
Sbjct: 224 EERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGLSEDKK 283
Query: 421 SQTTIDKYLSDIKILERTFSDNENTKQAEIN----ALIVEKVELESKIKSFKQQIEGNK 475
S +KI ER N + +++ + L+V K+ + +++ + IEG K
Sbjct: 284 PFKCFTPKGSSLKITERAEFLNRSIQKSSVKKSHPPLVVSKI--DDRLEQYTSAIEGTK 340
>gnl|CDD|182893 PRK11000, PRK11000, maltose/maltodextrin transporter ATP-binding
protein; Provisional.
Length = 369
Score = 30.8 bits (70), Expect = 6.7
Identities = 21/70 (30%), Positives = 32/70 (45%), Gaps = 12/70 (17%)
Query: 1245 RDHKALAETFSRNCGIFALDEPTTNLD--------IKNASDQKNFQ--LIVITHDEEFIE 1294
R A+ T +F LDEP +NLD I+ + K +I +THD+ +E
Sbjct: 139 RQRVAIGRTLVAEPSVFLLDEPLSNLDAALRVQMRIEISRLHKRLGRTMIYVTHDQ--VE 196
Query: 1295 NLTAIDRAYV 1304
+T D+ V
Sbjct: 197 AMTLADKIVV 206
>gnl|CDD|213241 cd03274, ABC_SMC4_euk, ATP-binding cassette domain of eukaryotic
SMC4 proteins. The structural maintenance of
chromosomes (SMC) proteins are large (approximately 110
to 170 kDa), and each is arranged into five
recognizable domains. Amino-acid sequence homology of
SMC proteins between species is largely confined to the
amino- and carboxy-terminal globular domains. The
amino-terminal domain contains a 'Walker A'
nucleotide-binding domain (GxxGxGKS/T, in the
single-letter amino-acid code), which by mutational
studies has been shown to be essential in several
proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B'
motif, and a motif with homology to the signature
sequence of the ATP-binding cassette (ABC) family of
ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 212
Score = 30.0 bits (68), Expect = 6.8
Identities = 13/45 (28%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 7 LHIMGIRNFP--ADKNRVVRFQRPLTLIVGENGCGKTTIIECIKF 49
+ + + NF A + + F + + IVG NG GK+ +I+ + F
Sbjct: 3 ITKLVLENFKSYAGEQVIGPFHKSFSAIVGPNGSGKSNVIDSMLF 47
>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F. This family represents the
oligoendopeptidase F clade of the family of larger M3 or
thimet (for thiol-dependent metallopeptidase)
oligopeptidase family. Lactococcus lactis PepF hydrolyzed
peptides of 7 and 17 amino acids with fairly broad
specificity. The homolog of lactococcal PepF in group B
Streptococcus was named PepB (PMID:8757883), with the
name difference reflecting a difference in species of
origin rather activity; substrate profiles were quite
similar. Differences in substrate specificity should be
expected in other species. The gene is duplicated in
Lactococcus lactis on the plasmid that bears it. A
shortened second copy is found in Bacillus subtilis
[Protein fate, Degradation of proteins, peptides, and
glycopeptides].
Length = 591
Score = 30.7 bits (70), Expect = 7.1
Identities = 16/86 (18%), Positives = 39/86 (45%)
Query: 922 DSVYQTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRI 981
D ++L + + + + E+ ++++ K+ ++ ++ DY + LEE+ R
Sbjct: 81 DPEANAISQKLSNLYTKVASATSFFEPEILEIEEKIIKEWLKDPEELADYKRALEEIFRD 140
Query: 982 KLEILNYTKRGTLTQLAALRESVQKL 1007
K IL+ L+ L+ + S +
Sbjct: 141 KPHILSEEVEKLLSALSEVFGSPSDI 166
>gnl|CDD|226592 COG4107, PhnK, ABC-type phosphonate transport system, ATPase
component [Inorganic ion transport and metabolism].
Length = 258
Score = 30.1 bits (68), Expect = 7.2
Identities = 13/21 (61%), Positives = 17/21 (80%)
Query: 32 IVGENGCGKTTIIECIKFALT 52
IVGE+G GKTT+++CI LT
Sbjct: 37 IVGESGSGKTTLLKCISGRLT 57
>gnl|CDD|224031 COG1106, COG1106, Predicted ATPases [General function prediction
only].
Length = 371
Score = 30.6 bits (69), Expect = 7.3
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 3/46 (6%)
Query: 4 LDQLHIMGIRNFPADKNRVVRFQRPLTLIVGENGCGKTTIIECIKF 49
+ I ++F + + + +I G NG GK+ ++E + F
Sbjct: 2 IKSFKIKNFKSF---RELELEDFGKINIIYGANGAGKSNLLEALYF 44
>gnl|CDD|213230 cd03263, ABC_subfamily_A, ATP-binding cassette domain of the
lipid transporters, subfamily A. The ABCA subfamily
mediates the transport of a variety of lipid compounds.
Mutations of members of ABCA subfamily are associated
with human genetic diseases, such as, familial
high-density lipoprotein (HDL) deficiency, neonatal
surfactant deficiency, degenerative retinopathies, and
congenital keratinization disorders. The ABCA1 protein
is involved in disorders of cholesterol transport and
high-density lipoprotein (HDL) biosynthesis. The ABCA4
(ABCR) protein transports vitamin A derivatives in the
outer segments of photoreceptor cells, and therefore,
performs a crucial step in the visual cycle. The ABCA
genes are not present in yeast. However, evolutionary
studies of ABCA genes indicate that they arose as
transporters that subsequently duplicated and that
certain sets of ABCA genes were lost in different
eukaryotic lineages.
Length = 220
Score = 30.2 bits (69), Expect = 7.4
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 7/52 (13%)
Query: 32 IVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRIGK-KDEASASSG 82
++G NG GKTT ++ LT E +SG +++ I + A S G
Sbjct: 33 LLGHNGAGKTTTLKM----LTGELR--PTSGTAYINGYSIRTDRKAARQSLG 78
>gnl|CDD|226459 COG3950, COG3950, Predicted ATP-binding protein involved in
virulence [General function prediction only].
Length = 440
Score = 30.7 bits (69), Expect = 7.4
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 29 LTLIVGENGCGKTTIIECIKFAL 51
T+IVG NG GKTT+++ I+ AL
Sbjct: 26 TTIIVGPNGSGKTTVLDAIRNAL 48
>gnl|CDD|206711 cd04140, ARHI_like, A Ras homolog member I (ARHI). ARHI (A Ras
homolog member I) is a member of the Ras family with
several unique structural and functional properties.
ARHI is expressed in normal human ovarian and breast
tissue, but its expression is decreased or eliminated in
breast and ovarian cancer. ARHI contains an N-terminal
extension of 34 residues (human) that is required to
retain its tumor suppressive activity. Unlike most other
Ras family members, ARHI is maintained in the
constitutively active (GTP-bound) state in resting cells
and has modest GTPase activity. ARHI inhibits STAT3
(signal transducers and activators of transcription 3),
a latent transcription factor whose abnormal activation
plays a critical role in oncogenesis. Most Ras proteins
contain a lipid modification site at the C-terminus,
with a typical sequence motif CaaX, where a = an
aliphatic amino acid and X = any amino acid. Lipid
binding is essential for membrane attachment, a key
feature of most Ras proteins. Due to the presence of
truncated sequences in this CD, the lipid modification
site is not available for annotation.
Length = 165
Score = 29.4 bits (66), Expect = 7.8
Identities = 10/29 (34%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 745 LCKQQRSLQELKPVYENIMKLQDTDIPSL 773
+ +Q SL+ELKP+YE I +++ ++ +
Sbjct: 82 ITSKQ-SLEELKPIYELICEIKGNNLEKI 109
>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
subunit A'' fusion protein; Provisional.
Length = 509
Score = 30.5 bits (69), Expect = 7.9
Identities = 21/104 (20%), Positives = 44/104 (42%), Gaps = 12/104 (11%)
Query: 936 VAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQLEEVKRIKLEILNYTKRGTLT 995
VA I T + +E+ + + ++K+ E + D + ++ +EI + R
Sbjct: 267 VADIITDI----AEMSVVVELDEEKMKERLIEYDDILAAISKLTFKTVEIDDGIIRLKPQ 322
Query: 996 QLAALRESVQKLNQRKEDIIAKRGVCERTINEINQSIANQSLEE 1039
Q S +KL E + + + + I I ++IA + +E
Sbjct: 323 Q-----PSFKKLYLLAEKV---KSLTIKGIKGIKRAIARKENDE 358
>gnl|CDD|172748 PRK14260, PRK14260, phosphate ABC transporter ATP-binding
protein; Provisional.
Length = 259
Score = 30.0 bits (67), Expect = 7.9
Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 25 FQRPLTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNF----VHDPRI 71
++ +T I+G +GCGK+T I+ + E P +F ++DPRI
Sbjct: 31 YRNKVTAIIGPSGCGKSTFIKTLNRISELEGPVKVEGVVDFFGQNIYDPRI 81
>gnl|CDD|181000 PRK07492, PRK07492, adenylosuccinate lyase; Provisional.
Length = 435
Score = 30.3 bits (69), Expect = 8.0
Identities = 13/35 (37%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 590 MIPEENFKNSLDKALSSITFDINRIQEDINAKEKH 624
+IP+E + +K FD+ RI E I A KH
Sbjct: 39 VIPKEAAETIWEKG-KDAEFDVARIDE-IEAVTKH 71
>gnl|CDD|224049 COG1124, DppF, ABC-type dipeptide/oligopeptide/nickel transport
system, ATPase component [Amino acid transport and
metabolism / Inorganic ion transport and metabolism].
Length = 252
Score = 29.9 bits (68), Expect = 8.0
Identities = 22/82 (26%), Positives = 35/82 (42%), Gaps = 21/82 (25%)
Query: 1263 LDEPTTNLDIKNASD--------QKNFQL--IVITHDEEFIENLTAIDRAYVV---RIVR 1309
LDEPT+ LD+ + +K L + I+HD +E++ DR V+ +IV
Sbjct: 165 LDEPTSALDVSVQAQILNLLLELKKERGLTYLFISHDLALVEHMC--DRIAVMDNGQIVE 222
Query: 1310 D------HKGLSDIHLRSLLSS 1325
S + R LL +
Sbjct: 223 IGPTEELLSHPSHPYTRELLEA 244
>gnl|CDD|216402 pfam01268, FTHFS, Formate--tetrahydrofolate ligase.
Length = 557
Score = 30.5 bits (70), Expect = 8.1
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 1392 DIHALSKQPNIVATMNTNRLRWLGHLMRIDKNRAVWK 1428
DIHA++ N++A N + G+ + ID R WK
Sbjct: 129 DIHAITAAHNLLAAAIDNHIYH-GNELDIDPRRITWK 164
>gnl|CDD|131721 TIGR02673, FtsE, cell division ATP-binding protein FtsE. This model
describes FtsE, a member of the ABC transporter
ATP-binding protein family. This protein, and its
permease partner FtsX, localize to the division site. In
a number of species, the ftsEX gene pair is located next
to FtsY, the signal recognition particle-docking protein
[Cellular processes, Cell division].
Length = 214
Score = 29.9 bits (68), Expect = 8.2
Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 16/74 (21%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLDIKNA---------SDQKNFQLIVITHDEEFIENLTAI 1299
A+A + + DEPT NLD + +++ +IV THD ++ +
Sbjct: 147 AIARAIVNSPPLLLADEPTGNLDPDLSERILDLLKRLNKRGTTVIVATHDLSLVDRVAH- 205
Query: 1300 DRAYVVRIVRDHKG 1313
R++ G
Sbjct: 206 ------RVIILDDG 213
>gnl|CDD|183044 PRK11231, fecE, iron-dicitrate transporter ATP-binding subunit;
Provisional.
Length = 255
Score = 30.0 bits (68), Expect = 8.2
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 6/45 (13%)
Query: 29 LTLIVGENGCGKTTIIECIKFALTNEYPQGASSGKNFVHDPRIGK 73
+T ++G NGCGK+T+++C LT PQ SG F+ D I
Sbjct: 30 ITALIGPNGCGKSTLLKCFARLLT---PQ---SGTVFLGDKPISM 68
>gnl|CDD|227118 COG4778, PhnL, ABC-type phosphonate transport system, ATPase
component [Inorganic ion transport and metabolism].
Length = 235
Score = 30.1 bits (68), Expect = 8.4
Identities = 15/41 (36%), Positives = 19/41 (46%), Gaps = 9/41 (21%)
Query: 1260 IFALDEPTTNLDIKN---------ASDQKNFQLIVITHDEE 1291
I LDEPT +LD N + + L+ I HDEE
Sbjct: 173 ILLLDEPTASLDATNRAVVVELIREAKARGAALVGIFHDEE 213
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 30.6 bits (70), Expect = 8.4
Identities = 29/104 (27%), Positives = 45/104 (43%), Gaps = 16/104 (15%)
Query: 916 CELEGMDSVYQTELEELGRKVAPIETQLN------LAQSELDALKKEHKKKLNEEGAKI- 968
CE Y TELE L ++V +E +L + + KK+ KKK ++ K+
Sbjct: 358 CEQAAASPEYDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVP 417
Query: 969 --------QDYTKQ-LEEVKRIKLEILNYTKRGTLTQLAALRES 1003
++ T+Q LE +K + EIL K + A L S
Sbjct: 418 RGKIYKVLKEATRQDLELLKNVWGEILESLKAQRKSLRALLVNS 461
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.5 bits (69), Expect = 8.7
Identities = 34/169 (20%), Positives = 69/169 (40%), Gaps = 17/169 (10%)
Query: 214 KIQRDRLRKEIPEIKAHYQATLNYKKEADSKKQLIYNNTQKRDQSFEELHNIEESMKPIN 273
K +R RL ++ E + + S + + K + + + +
Sbjct: 188 KRKRQRLDQKAAE---YLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVE 244
Query: 274 EKLVQLTEKERNMSVMSTQYQTKKTERDM--IQESCNELESSIKQLFSGDKAELQSKLNL 331
+ E + + + +TK+ E ++ +QE +ELESSI+++ S KA L SK+
Sbjct: 245 QAATAAAEVP-PAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKA-LASKIKQ 302
Query: 332 FKINLDEK---CSELENQERLKSQYIQEEKQSHTHINEAQMKLGKLERD 377
L ELE + ++K K++ + +A+ + KL+
Sbjct: 303 VNEELTTVRQENEELEEEYKIK-------KRTVELLPDAENNVAKLQAL 344
>gnl|CDD|182852 PRK10938, PRK10938, putative molybdenum transport ATP-binding
protein ModF; Provisional.
Length = 490
Score = 30.4 bits (69), Expect = 8.7
Identities = 14/29 (48%), Positives = 19/29 (65%), Gaps = 4/29 (13%)
Query: 32 IVGENGCGKTTIIECIKFALTNEYPQGAS 60
IVG NG GK+T++ I T ++PQG S
Sbjct: 291 IVGPNGAGKSTLLSLI----TGDHPQGYS 315
>gnl|CDD|211385 cd11299, O-FucT_plant, GDP-fucose protein O-fucosyltransferase,
plant specific subfamily. Some members of this
plant-specific family of O-fucosyltransferases have been
annotated as auxin-independent growth promotors. The
function of the protein seems unclear.
O-fucosyltransferase-like proteins are GDP-fucose
dependent enzymes with similarities to the family 1
glycosyltransferases (GT1). They are soluble ER proteins
that may be proteolytically cleaved from a
membrane-associated preprotein, and are involved in the
O-fucosylation of protein substrates, the core
fucosylation of growth factor receptors, and other
processes.
Length = 290
Score = 30.2 bits (69), Expect = 8.8
Identities = 19/88 (21%), Positives = 41/88 (46%), Gaps = 16/88 (18%)
Query: 162 CNLIGVSKAILNN--VIFCHQENSSWPLDEGKKVKEIFDEIFDATKYNKALES-IKIQRD 218
C+ + V++ +LN V+ +NS W D K F +I+D + K+L+ +++ +
Sbjct: 18 CDAVAVAR-LLNATLVLPELDKNSVWG-DSSK-----FGDIYDVDHFIKSLKDDVRVVKK 70
Query: 219 ------RLRKEIPEIKAHYQATLNYKKE 240
+ EI + +++ +Y E
Sbjct: 71 LPEELASKKPEITVKRVPSRSSPSYYLE 98
>gnl|CDD|225648 COG3106, COG3106, Predicted ATPase [General function prediction
only].
Length = 467
Score = 30.5 bits (69), Expect = 8.9
Identities = 10/67 (14%), Positives = 31/67 (46%), Gaps = 6/67 (8%)
Query: 1099 REYWTRIYQLKLSEIMISDLTKYHHTLENCVIKYHSQKMRSINRLIREYWTRI-YQGNDI 1157
R R++ ++ +++ + H T + H + + +L++ W ++G +
Sbjct: 320 RTLLRRLFSPRIDKLLFAATKADHVTHDQ-----HDNLVSLLRQLVQRAWQNAAFEGISM 374
Query: 1158 DYISIAA 1164
D +++A+
Sbjct: 375 DCLALAS 381
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 29.7 bits (67), Expect = 8.9
Identities = 32/112 (28%), Positives = 43/112 (38%), Gaps = 17/112 (15%)
Query: 760 ENIMKLQDTD----IPSLRSKLIELEE---NVIETKGELKKL-------KTALETPKTKE 805
E + KLQ D SL ELE+ NV + L KL LE K E
Sbjct: 57 EEVKKLQIDDSISEFESLSKCFSELEKHGFNVQALQSRLNKLLSLKDDQTKKLEERKGLE 116
Query: 806 KTALSLQGDLTLLDQNIRELNTLQRELERQESKISGMRSTGVDLDQVLAQQK 857
K + LD I EL EL+RQ + ++ D+ +A+ K
Sbjct: 117 KEIAEKEISRQELDSEIAELERKILELQRQAAL---LKEKKEAEDKEIARLK 165
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 30.1 bits (68), Expect = 9.1
Identities = 36/122 (29%), Positives = 58/122 (47%), Gaps = 21/122 (17%)
Query: 870 IESGQTRLNSHNEKLQSLQKQKNDIHSKQLTVQGGAGMLKSLE----DRKCELEGMDSVY 925
+ESG TR N +KL Q+++ + ++ G M K LE D K E+E ++ +
Sbjct: 100 VESGATR-NYEADKLDEEQEERVEKEREE--ELAGDAM-KKLENRTADSKREMEVLERL- 154
Query: 926 QTELEELGRKVAPIETQLNLAQSELDALKKEHKKKLNEEGAKIQDYTKQL------EEVK 979
EL+EL + A ++ S L+AL + KK+ EE + + K L EE +
Sbjct: 155 -EELKELQSRRADVDVN-----SMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDR 208
Query: 980 RI 981
R
Sbjct: 209 RR 210
>gnl|CDD|224053 COG1129, MglA, ABC-type sugar transport system, ATPase component
[Carbohydrate transport and metabolism].
Length = 500
Score = 30.2 bits (69), Expect = 9.2
Identities = 7/16 (43%), Positives = 14/16 (87%)
Query: 32 IVGENGCGKTTIIECI 47
++GENG GK+T+++ +
Sbjct: 39 LLGENGAGKSTLMKIL 54
>gnl|CDD|212011 cd11621, HR1_PKC-like_1_fungi, First Protein kinase C-related
kinase homology region 1 (HR1) Rho-binding domain of
fungal Protein Kinase C-like proteins. This subfamily
is composed of fungal PKC-like proteins including Pkc1p
from Saccharomyces cerevisiae, and Pck1p and Pck2p from
Schizosaccharomyces pombe. The yeast PKC-like proteins
play a critical role in regulating cell wall
biosynthesis and maintaining cell wall integrity. They
contain two HR1 domains, C2 and C1 domains, and a kinase
domain. This model characterizes the first HR1 domain.
HR1 domains are anti-parallel coiled-coil (ACC) domains
that bind small GTPases from the Rho family. The HR1
domains of Pck1p and Pck2p interact with GTP-bound Rho1p
and Rho2p.
Length = 72
Score = 27.7 bits (62), Expect = 9.4
Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 8/74 (10%)
Query: 825 LNTLQRELERQESKISGMRSTGVDLDQVLAQQKEKKNELNTFRSKIESGQTRLNSHNEKL 884
+ + +++ER+ S I G ++ Q+ S I ++ ++ E+L
Sbjct: 7 IQDVYKKIERERSLIQGAKAMKKSTKNPEVIQR--------LNSNIRESRSNIDYLEERL 58
Query: 885 QSLQKQKNDIHSKQ 898
L +K + S Q
Sbjct: 59 NKLTLRKAGVSSGQ 72
>gnl|CDD|217899 pfam04108, APG17, Autophagy protein Apg17. Apg17 is required for
activating Apg1 protein kinases.
Length = 408
Score = 30.0 bits (68), Expect = 9.4
Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 2/81 (2%)
Query: 459 ELESKIKSFKQQIEGNKKDLTNVITQINEVNQSQSTLQVLQTKLNRVNSEIDQLSKSLDP 518
L +K + +++ + L N++ Q + +S + L K + + Q S D
Sbjct: 126 LLRDNLKIYIKEVPAIRAQLDNILNQFDSDLRSLK--EQLINKRLKDVTLPFQSSFGADT 183
Query: 519 DQLKNEIEAWIRQRNELEDEL 539
I + N LE EL
Sbjct: 184 TSSNPLISTIPSELNSLEHEL 204
>gnl|CDD|213259 cd03292, ABC_FtsE_transporter, ATP-binding cassette domain of the
cell division transporter. FtsE is a hydrophilic
nucleotide-binding protein that binds FtsX to form a
heterodimeric ATP-binding cassette (ABC)-type transporter
that associates with the bacterial inner membrane. The
FtsE/X transporter is thought to be involved in cell
division and is important for assembly or stability of
the septal ring.
Length = 214
Score = 29.7 bits (67), Expect = 9.5
Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 9/56 (16%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD---------IKNASDQKNFQLIVITHDEEFIEN 1295
A+A + I DEPT NLD + ++ ++V TH +E ++
Sbjct: 146 AIARAIVNSPTILIADEPTGNLDPDTTWEIMNLLKKINKAGTTVVVATHAKELVDT 201
>gnl|CDD|213266 cd03299, ABC_ModC_like, ATP-binding cassette domain similar to the
molybdate transporter. Archaeal protein closely related
to ModC. ModC is an ABC-type transporter and the ATPase
component of a molybdate transport system that also
includes the periplasmic binding protein ModA and the
membrane protein ModB. ABC transporters are a large
family of proteins involved in the transport of a wide
variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide binding domain shows the highest similarity
between all members of the family. ABC transporters are a
subset of nucleotide hydrolases that contain a signature
motif, Q-loop, and H-loop/switch region, in addition to,
the Walker A motif/P-loop and Walker B motif commonly
found in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 235
Score = 29.6 bits (67), Expect = 9.7
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 12/68 (17%)
Query: 1249 ALAETFSRNCGIFALDEPTTNLD----------IKNASDQKNFQLIVITHDEEFIENLTA 1298
A+A N I LDEP + LD +K + ++ +THD F E
Sbjct: 139 AIARALVVNPKILLLDEPFSALDVRTKEKLREELKKIRKEFGVTVLHVTHD--FEEAWAL 196
Query: 1299 IDRAYVVR 1306
D+ ++
Sbjct: 197 ADKVAIML 204
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.130 0.362
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 73,138,964
Number of extensions: 7310442
Number of successful extensions: 10508
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9087
Number of HSP's successfully gapped: 968
Length of query: 1491
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1382
Effective length of database: 6,103,016
Effective search space: 8434368112
Effective search space used: 8434368112
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 66 (29.2 bits)