RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12680
(2567 letters)
>gnl|CDD|193250 pfam12774, AAA_6, Hydrolytic ATP binding site of dynein motor
region D1. the 380 kDa motor unit of dynein belongs to
the AAA class of chaperone-like ATPases. The core of the
380 kDa motor unit contains a concatenated chain of six
AAA modules, of which four correspond to the ATP binding
sites with P-loop signatures described previously, and
two are modules in which the P loop has been lost in
evolution. This particular family is the D1 unit of the
motor and contains the hydrolytic ATP binding site.
Length = 231
Score = 194 bits (494), Expect = 8e-56
Identities = 85/145 (58%), Positives = 105/145 (72%)
Query: 1 MGGAPEGPAGTGKTETTKDLAKAVAKLCIVFNCSDSMDYIGLGKFFKGLIATGAWACFDE 60
M GAP GPAGTGKTETTKDL +A+ + VFNCS+ MDY G +KGL TGAW CFDE
Sbjct: 33 MSGAPAGPAGTGKTETTKDLGRALGIMVYVFNCSEQMDYKSCGNIYKGLAQTGAWGCFDE 92
Query: 61 FNRIDVEVLSVVAQQILTIQRGVTLGEEEILFEGTILKLDRTCSVFITMNPGYAGRSELP 120
FNRI VEVLSVVA Q+ +Q + ++ F G + L + +FITMNPGYAGR+ELP
Sbjct: 93 FNRISVEVLSVVAVQVKCVQDAIRDKKQWFNFLGEEISLIPSVGIFITMNPGYAGRTELP 152
Query: 121 DNLKSLFRTVAMMVPDYALISEIIV 145
+NLK+LFR AM+VPD+ LI EI++
Sbjct: 153 ENLKALFRPCAMVVPDFELICEIML 177
Score = 71.2 bits (174), Expect = 3e-13
Identities = 32/63 (50%), Positives = 40/63 (63%), Gaps = 5/63 (7%)
Query: 1106 YRMLSEALQLIEEWSNQGGAPEGPAGTGKTETTKDLAKAVAKLCIVFNCSDSMDYIGLGK 1165
Y L+++L L GAP GPAGTGKTETTKDL +A+ + VFNCS+ MDY G
Sbjct: 22 YITLTQSLHLT-----MSGAPAGPAGTGKTETTKDLGRALGIMVYVFNCSEQMDYKSCGN 76
Query: 1166 FFK 1168
+K
Sbjct: 77 IYK 79
>gnl|CDD|193251 pfam12775, AAA_7, P-loop containing dynein motor region D3. the 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This particular family is the D3 and is an ATP binding
site.
Length = 272
Score = 183 bits (466), Expect = 2e-51
Identities = 90/285 (31%), Positives = 139/285 (48%), Gaps = 18/285 (6%)
Query: 1307 DMPVNQIIVTTEETLRNIALMKLLVTHQKPVMFIGPTGTGKSCYITVSILFIKNNSRLRS 1366
+MP+ +V T ET+R M LL+ Q+PVM +G GTGKS +
Sbjct: 6 EMPLQACLVHTSETIRVRYFMDLLMERQRPVMLVGNAGTGKSVLVGDK------------ 53
Query: 1367 GFQHFLLKELSIEQWQPLIMNFSAQTSANQTQDIIMSKLDKRRKGVYGPPLGKRCVVFVD 1426
L L + + + F+ T++ Q ++ L+K+ YGPP K+ V F+D
Sbjct: 54 ------LASLDPDAYLVKNVPFNYYTTSAMLQAVLEKPLEKKAGRNYGPPGTKKLVYFID 107
Query: 1427 DVNMPLKEEFGAQPPIEILRQWLDHWMWYDRKDVVAVKLIEIQLMCAMGPPSTGNTVTPR 1486
D+NMP + +G P ++RQ +D+ WYDR + ++ Q + M P + T+ R
Sbjct: 108 DMNMPEVDAYGTVQPHTLIRQHMDYGHWYDRSKLTLKEIHNCQYVSCMNPTAGSFTINSR 167
Query: 1487 FSRHFNQIVINKFDDDTMVTIFSKILLWHLDTRGFSKEFDPCIEQLVQATLHIFKESLLN 1546
RHF ++ D + TI++ IL HL S LV A L + ++
Sbjct: 168 LQRHFCVFALSFPGQDALSTIYNIILTQHLALGSVSAALQKSSPPLVDAALALHQKIATT 227
Query: 1547 LLPTPTKSHYLFNLRDFARVIQGVLLSVPEAIESLLSMKRLWVHE 1591
LPT K HY+FNLRD + + QG+L S E ++S + RLW+HE
Sbjct: 228 FLPTAIKFHYVFNLRDLSNIFQGLLFSSVECLKSPWDLIRLWLHE 272
Score = 91.9 bits (228), Expect = 6e-20
Identities = 41/132 (31%), Positives = 68/132 (51%)
Query: 900 FSAQTSANQTQDIIMSKLDKRRKGVYGPPLGKRCVVFVDDVNMPLKEEFGAQPPIEILRQ 959
F+ T++ Q ++ L+K+ YGPP K+ V F+DD+NMP + +G P ++RQ
Sbjct: 69 FNYYTTSAMLQAVLEKPLEKKAGRNYGPPGTKKLVYFIDDMNMPEVDAYGTVQPHTLIRQ 128
Query: 960 WLDHWMWYDRKDVVAVKLIEIQLMCAMGPPSTGNTVTPRFSRHFNQIVINKFDDDTMVTI 1019
+D+ WYDR + ++ Q + M P + T+ R RHF ++ D + TI
Sbjct: 129 HMDYGHWYDRSKLTLKEIHNCQYVSCMNPTAGSFTINSRLQRHFCVFALSFPGQDALSTI 188
Query: 1020 FSKILLWHLDTR 1031
++ IL HL
Sbjct: 189 YNIILTQHLALG 200
Score = 47.2 bits (112), Expect = 5e-05
Identities = 18/57 (31%), Positives = 25/57 (43%)
Query: 1697 FRGFSKEFDPCIEQLVQATLHIFKESLLNLLPTPTKSHYLFNLRDFARVIQYLTVTG 1753
S LV A L + ++ LPT K HY+FNLRD + + Q L +
Sbjct: 199 LGSVSAALQKSSPPLVDAALALHQKIATTFLPTAIKFHYVFNLRDLSNIFQGLLFSS 255
Score = 45.3 bits (107), Expect = 2e-04
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 754 DMPVNQIIVTTEETLRNIALMKLLVTHQKPVMFIGPTGTGKS 795
+MP+ +V T ET+R M LL+ Q+PVM +G GTGKS
Sbjct: 6 EMPLQACLVHTSETIRVRYFMDLLMERQRPVMLVGNAGTGKS 47
>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor. the 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This family is the region between D4 and D5 and is the
two predicted alpha-helical coiled coil segments that
form the stalk supporting the ATP-sensitive microtubule
binding component.
Length = 344
Score = 159 bits (404), Expect = 2e-42
Identities = 115/373 (30%), Positives = 183/373 (49%), Gaps = 36/373 (9%)
Query: 2093 RYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGVKK 2152
R GL KL ASQ+ ++ ++ I+E+ + + E A+ I GV+
Sbjct: 2 RLENGLMKLQSTASQVDDLKAKLA------------IQEVELKQKNEDADKLIQVVGVET 49
Query: 2153 DEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAA 2212
E+V+ E+A+A Q VA V N++ Q +CEE L++A P L AA+ A
Sbjct: 50 -EKVSKEKAIADQEEVKVA--------VINKEVTEKQ---KDCEEDLAKAEPALLAAQEA 97
Query: 2213 LNTLTSNDITVVKTMKSPPDIVKLVMKAVCILKGVKSERVPDAGGQLVED-YWGPSKKLL 2271
LNTL N++T +K+ SPPD V V AV +L GG++ +D W +K ++
Sbjct: 98 LNTLNKNNLTELKSFGSPPDAVVNVTAAVMVLTA--------PGGKIPKDRSWKAAKVMM 149
Query: 2272 GDIK-FLEGLTNFNKDNVPAAVIKRLEDEFLSREDFDPEVVKKSSTAAEGLCKWVIAICK 2330
G + FL+ L NF+K+N+ +K ++ +L +FDPE ++ STAA GLC WVI I +
Sbjct: 150 GKVDTFLDSLINFDKENIHENCLKAIK-PYLGDPEFDPEFIRSKSTAAAGLCSWVINIVR 208
Query: 2331 YDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDL 2390
+ +V V PK++AL +A +L A L + + + L L + +
Sbjct: 209 FYEVYCDVEPKRQALEKANAELAAAQEKLARIKAKIAELNANLANLTAAFEKATADKLKC 268
Query: 2391 QDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGDVLIAAGVVAYLGPFT 2450
Q E + + A L+G L E RW+ + + L GDVL+ V+Y+G FT
Sbjct: 269 QQEAEATNRTILLANRLVGGLASENVRWAEAVENFKSQEVTLCGDVLLITAFVSYVGYFT 328
Query: 2451 VAFRQQ-QVSTWV 2462
+R + W+
Sbjct: 329 KKYRNELMERFWI 341
>gnl|CDD|219818 pfam08393, DHC_N2, Dynein heavy chain, N-terminal region 2.
Dyneins are described as motor proteins of eukaryotic
cells, as they can convert energy derived from the
hydrolysis of ATP to force and movement along
cytoskeletal polymers, such as microtubules. This region
is found C-terminal to the dynein heavy chain N-terminal
region 1 (pfam08385) in many members of this family. No
functions seem to have been attributed specifically to
this region.
Length = 408
Score = 99.2 bits (248), Expect = 2e-21
Identities = 46/134 (34%), Positives = 77/134 (57%), Gaps = 5/134 (3%)
Query: 142 EIIVKKLAPYKQLYDNCIEYQEKHKMWAESIVGTFDPEQIEEETNEFYKNIYKLKLQFSH 201
+ + K+L P K+L+D E+ E + W + T DPE++E E +F K + KL
Sbjct: 4 QELEKELKPLKKLWDLYSEWSESIEEWKNTPWSTIDPEELESEIEKFLKQLRKLPKNLRQ 63
Query: 202 LPGPYGIASKVLDQVQEFKEHMPIIQTLGNPGLQLRHWEKISEVVGFPLSPEMEDFTLQR 261
Y +A ++ ++ +FK+ +P+I+ L NP L+ RHWE+I +++GF P+ EDFTL
Sbjct: 64 ----YEVAEELKKKIDDFKKSLPLIEDLKNPALKERHWEQIFKLLGFKDFPD-EDFTLGD 118
Query: 262 IFNYGLEEYVPKFE 275
+ + GL Y + E
Sbjct: 119 LLDLGLLNYEEEIE 132
Score = 45.7 bits (109), Expect = 2e-04
Identities = 16/36 (44%), Positives = 20/36 (55%)
Query: 276 IWKDIMKTVAADPRVMSVVEIPKLKENLQKCNGLLD 311
WK +MK DP V+ V IP L E L+K N L+
Sbjct: 259 EWKKLMKKANKDPNVLEVCNIPGLLEKLEKLNEQLE 294
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 101 bits (254), Expect = 4e-21
Identities = 129/745 (17%), Positives = 264/745 (35%), Gaps = 116/745 (15%)
Query: 1852 SITMGQLYGQFDAVSHEWSDGILAVSYRQFAMSQNENRKWLIFDG-PVDAIWIENMNSVL 1910
M +++G D ++ ++ D + R IF+ PV++ ++E+ N +L
Sbjct: 1861 PRNMREIFGHRDELTGDFRDSLKVQDLR--RNIHGGRECLFIFESIPVESSFLEDFNPLL 1918
Query: 1911 DDNKKLCLMSG-EIIALAPTTSLIFEPQDLEVASPATITDESMVEDLSNLLNSGEVPNIF 1969
D+N+ LCL SG E I + +FE LE + AT+T +V NL P +F
Sbjct: 1919 DNNRFLCLFSGNERIRIPENLRFVFESTSLEKDTEATLTRVFLVYMEENL------PVVF 1972
Query: 1970 ASDEKAEICEKMGVIDRQKDKSMQTDGTMMALF-KFFVD--TPWPPDALLAVATRFLNEV 2026
+ C +D S+ AL + F+D W + + A N V
Sbjct: 1973 S------ACC-------SQDTSVLAGIRSPALKNRCFIDFKKLWDTEEMSQYA----NSV 2015
Query: 2027 ELSEAERQISIDMCQNFHVSTQ-------NLSDEFLVKT----------SRHVYVTPTSY 2069
E +S D + F ++ + +S+ F S S
Sbjct: 2016 E------TLSRDGGRVFFINGELGVGKGALISEVFGDDAVVIEGRGFEISMIEGSLGESK 2069
Query: 2070 LELI---STFKQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSA 2126
++ I + + EE+ G+ K ++ + ++E++ L V
Sbjct: 2070 IKFIGGLKVYDARCVIYIEELDCTNVNLVEGVRKYNEYGRGMGELKEQLSNTVVILGVK- 2128
Query: 2127 ANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQAL 2186
EK + +A G + + EV + A + M + EE ++
Sbjct: 2129 ----------EKNADDALSGTPGERLEREVKSVFVEAPRDM------LFLLEEEVRKR-- 2170
Query: 2187 AAQAMKSECEEILSEAIPILEAAEAALNTLTSNDITVVKTMKSPPDIVKLVMKAVCILKG 2246
S + S+ +LEA ++ + +++ PP + + M+ VC L G
Sbjct: 2171 ----KGSVMKFKSSKKPAVLEAVLFVYKIKKAS-LREIRSFIRPPGDLCIEMEDVCDLLG 2225
Query: 2247 VKSERVPDAGGQLVEDYWGPSKKLLGDIKFLEGLTNFNKDNVPAAVIKR-LEDEFLSRED 2305
+++ ++G + L D F+ + + + +R E S
Sbjct: 2226 FEAK-----------IWFGEQQSLRRD-DFIRIIGKYPDEIEFDLEARRFREARECSDPS 2273
Query: 2306 FDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQ 2365
F ++ ++S A L +W++ C KV ++ P +E ++ + + L L +
Sbjct: 2274 FTGSILNRASKACGPLKRWLVRECNRSKVLEVKIPLREEEKRIDGEAFLVEDRLTLGKGL 2333
Query: 2366 NRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKML 2425
+ + + ++ + ++D K R+ + L E W +
Sbjct: 2334 SSDLMTFKLRRRSYYSLDILRVHGKIADMDTVHKDVLRSIFVSEILINEDSEWGGVFSEV 2393
Query: 2426 NEKYYQLTGDVLIAAGVVAYLGPFTVAFRQQQVSTWVKEVAACNIVCTKDFQLGAVMGNP 2485
+ +L GD ++ + Y+G T+ F + + E I +K+F+
Sbjct: 2394 PKLMVELDGDGHPSSCLHPYIG--TLGFLCRAI-----EFGMSFIRISKEFRD-----KE 2441
Query: 2486 VEIRAWNIFGLPCD----------SFSIDNGIIIKNSRR-WPLMIDPQEQANKWIKNMEK 2534
+ R + G+ + ++N I K+ + ++ DP + + M
Sbjct: 2442 IRRRQFITEGVQKIEDFKEEACSTDYGLENSRIRKDLQDLTAVLNDPSSKIVTSQRQMYD 2501
Query: 2535 ANGLNVIRLSNTDYMRILEGAIGFG 2559
+ ++ L A G
Sbjct: 2502 EKKAILGSFREMEFAFGLSQARREG 2526
Score = 74.3 bits (182), Expect = 9e-13
Identities = 53/319 (16%), Positives = 114/319 (35%), Gaps = 23/319 (7%)
Query: 1308 MPVNQIIVTTEETLRNIALMKLLVTHQKPVMFIGPTGTGKSCYITVSILFIKNNSRLRSG 1367
M ++++ T +T + + + ++ GP G+GK + S
Sbjct: 1468 MLRKEVVIPTSDTGFVDSFSNEALNTLRSYIYCGPPGSGKE--------MLMCPSLRSEL 1519
Query: 1368 FQHFLLKELSIEQWQPLIMNFSAQTSANQTQDIIMSKLDKRRKGVYGPPLGKRCVVFVDD 1427
S P ++ + + ++ K P+ K V+F D+
Sbjct: 1520 ITEVKYFNFSTCTMTPSKLSVLERETEYYPNTGVVRLYPK--------PVVKDLVLFCDE 1571
Query: 1428 VNMPLKEEFGAQPPIEILRQWLDHWMWYDRKDVVAVKLIEIQLMCAMGPPST--GNTVTP 1485
+N+P E+ I LR ++ ++ V V + I L A P +
Sbjct: 1572 INLPYGFEYYPPTVIVFLRPLVERQGFWSSIAVSWVTICGIILYGACNPGTDEGRVKYYE 1631
Query: 1486 RFSRHFNQIVINKFDDDTMVTIFSKILLWHLDTRGFSKEFDPCIEQLVQATLHIFKESLL 1545
RF R + + ++ I+ +L + + EF+ E+ + A++ ++ S
Sbjct: 1632 RFIRKPVFVFCCYPELASLRNIYEAVL---MGSYLCFDEFNRLSEETMSASVELYLSSKD 1688
Query: 1546 NLLPTPTKSHYLFNLRDFARVIQGVLLSVPEAIES-LLSMKRLWVHEVLRVYYDRLVDSD 1604
+ +Y + R+ R ++ + I++ +S+ W E +R DRLV
Sbjct: 1689 K-TKFFLQMNYGYKPRELTRSLRAIFGYAETRIDTPDVSLIIDWYCEAIREKIDRLVQQK 1747
Query: 1605 DRLWLFNTLKYTVEKFLQE 1623
+ L + ++E
Sbjct: 1748 ESSTSRQDLYDFGLRAIRE 1766
Score = 58.1 bits (140), Expect = 7e-08
Identities = 23/161 (14%), Positives = 48/161 (29%), Gaps = 19/161 (11%)
Query: 9 AGTGKTETTKDLAKAVAKLCIVFNCSDSMDYIGLGKFFKGLIATGAWACFDEFNRIDVEV 68
T + L ++ D + + G I +E +D
Sbjct: 952 VDTENSRVYGMLVAGKGRIY------DGTEP--RSRIEAGPICEEERG-TEESALLDEIS 1002
Query: 69 LSVVAQQILTIQRGVTLGEEEI-LFEGTILKLDRTCSVFITMNPGYAGRSELPDNLKSLF 127
+++ + L + EE LK T V + +N R+ + + +
Sbjct: 1003 RTILVDEYL-NSDEFRMLEELNSAVVEHGLKSPST-PVEMIINE----RNIVLEIGRRAL 1056
Query: 128 RTVAMMVPDYALIS--EIIVKKLAPYKQLYDN-CIEYQEKH 165
+P A+ S E + +++ + D E E
Sbjct: 1057 DMFLSNIPFGAIKSRRESLDREIGAFNNEVDGIAREEDELM 1097
Score = 47.7 bits (113), Expect = 1e-04
Identities = 31/107 (28%), Positives = 55/107 (51%), Gaps = 4/107 (3%)
Query: 412 SITMGQLYGQFDAVSHEWSDGILAVSYRQFAMSQNENRKWLIFDG-PVDAIWIENMNSVL 470
M +++G D ++ ++ D + R IF+ PV++ ++E+ N +L
Sbjct: 1861 PRNMREIFGHRDELTGDFRDSLKVQDLR--RNIHGGRECLFIFESIPVESSFLEDFNPLL 1918
Query: 471 DDNKKLCLMSG-EIIALAPTTSLIFEPQDLEVASPATVSRCGMIYME 516
D+N+ LCL SG E I + +FE LE + AT++R ++YME
Sbjct: 1919 DNNRFLCLFSGNERIRIPENLRFVFESTSLEKDTEATLTRVFLVYME 1965
Score = 36.5 bits (84), Expect = 0.25
Identities = 27/226 (11%), Positives = 60/226 (26%), Gaps = 51/226 (22%)
Query: 289 RVMSVVEIPKLKENLQKCNGLLDMGITSDLFPGIELPETDYTILNKAVREVCEKANLQVT 348
+++S+ +P + + L++ LD ++ + + + + QV+
Sbjct: 1121 KILSITGLPLISDTLRERIDTLD---------------AEWDSFCRISESLKKYESQQVS 1165
Query: 349 PFFLEKIQQIYEMMIVRWGFMIVGLSFAGKTCAYRMLSEALQLIEEWGELGEHKVEIIVI 408
+ + + F K Y + L W H V
Sbjct: 1166 GLDVAQFVSFLRSVDTGAFHAEYFRVFLCKIKHYTDACDYL-----W-----HVKSPYVK 1215
Query: 409 NPKSITMGQLYGQFDAVSHEWSDGILAVSYRQFAMSQNENRKWLIFDGPVDAIWIENMNS 468
FDA E L + +++ + ++E +
Sbjct: 1216 KK----------YFDA-DMELRQFFLMFNREDMEARLADSKMEYEVER-----YVEKTKA 1259
Query: 469 VLDDNKKLCLMSGEIIALAPTTSLIFEPQDLEVASPATVSRCGMIY 514
+ K GE ++ + V RC + Y
Sbjct: 1260 EVSSLKLELSSVGE-------GQVVVSNLG---SIGDKVGRCLVEY 1295
Score = 36.1 bits (83), Expect = 0.35
Identities = 23/103 (22%), Positives = 42/103 (40%), Gaps = 2/103 (1%)
Query: 924 VYGPPLGKRCVVFVDDVNMPLKEEFGAQPPIEILRQWLDHWMWYDRKDVVAVKLIEIQLM 983
+Y P+ K V+F D++N+P E+ I LR ++ ++ V V + I L
Sbjct: 1556 LYPKPVVKDLVLFCDEINLPYGFEYYPPTVIVFLRPLVERQGFWSSIAVSWVTICGIILY 1615
Query: 984 CAMGPPST--GNTVTPRFSRHFNQIVINKFDDDTMVTIFSKIL 1024
A P + RF R + + ++ I+ +L
Sbjct: 1616 GACNPGTDEGRVKYYERFIRKPVFVFCCYPELASLRNIYEAVL 1658
Score = 35.0 bits (80), Expect = 0.68
Identities = 28/245 (11%), Positives = 62/245 (25%), Gaps = 40/245 (16%)
Query: 1767 TDYTILNKAVREVCEKANLQVTPFFLEKIQQIYEMMIVRWGFMIVGLSFAGKTCAYRMLS 1826
++ + + + + QV+ + + + F K Y
Sbjct: 1144 AEWDSFCRISESLKKYESQQVSGLDVAQFVSFLRSVDTGAFHAEYFRVFLCKIKHYTDAC 1203
Query: 1827 EALQLIEEWGELGEHKVEIIVINPKSITMGQLYGQFDAVSHEWSDGILAVSYRQFAMSQN 1886
+ L W H V FDA E L +
Sbjct: 1204 DYL-----W-----HVKSPYVKKK----------YFDA-DMELRQFFLMFNREDMEARLA 1242
Query: 1887 ENRKWLIFDGPVDAIWIENMNSVLDDNKKLCLMSGEIIALAPTTSLIFEPQDLEVASPAT 1946
+++ + ++E + + K GE ++ +
Sbjct: 1243 DSKMEYEVER-----YVEKTKAEVSSLKLELSSVGE-------GQVVVSNLG---SIGDK 1287
Query: 1947 ITDESMVEDLSNLLNSGEVPNIFASDEKAEICEKMGVIDRQKDKSMQTDGTMMAL----F 2002
+ + D + L++ V D K + E + ++ + D M
Sbjct: 1288 VGRCLVEYDSISRLSTKGVFLDELGDTKRYLDECLDFFSCFEEVQKEIDELSMVFCADAL 1347
Query: 2003 KFFVD 2007
+F D
Sbjct: 1348 RFSAD 1352
>gnl|CDD|193257 pfam12781, AAA_9, ATP-binding dynein motor region D5. The 380 kDa
motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This particular family is the D5 ATP-binding region of
the motor, but has lost its P-loop.
Length = 228
Score = 72.7 bits (178), Expect = 9e-14
Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 2487 EIRAWNIFGLPCDSFSIDNGIIIKNSRRWPLMIDPQEQANKWIKNMEKANGLNVIRLSNT 2546
++ WN GLP D S +N I+ N RWPLM+DPQ Q KWIKN + + L VIR+
Sbjct: 7 DVATWNNEGLPSDRMSTENATILGNCERWPLMVDPQLQGIKWIKN-KYGSDLRVIRIGQK 65
Query: 2547 DYMRILEGAIGFG 2559
Y+ ++E AI G
Sbjct: 66 GYLDVIEQAISEG 78
>gnl|CDD|193256 pfam12780, AAA_8, P-loop containing dynein motor region D4. The 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules, of
which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This particular family is the D4 ATP-binding region of
the motor.
Length = 268
Score = 73.1 bits (179), Expect = 1e-13
Identities = 50/175 (28%), Positives = 76/175 (43%), Gaps = 43/175 (24%)
Query: 1945 ATITDESMVEDLSNLLNSGEVPNIFASDEKAEICEKMGVIDRQKDKSMQTDGTMMALFKF 2004
A + DE + +++LL SGE+P++F DE I + R + KS+ + T +KF
Sbjct: 98 AQVADEQFLVLINDLLASGEIPDLFMDDEVENIISSV----RNEVKSLGLNDTRENCWKF 153
Query: 2005 FVD------------TP--------------------------WPPDALLAVATRFLNEV 2026
F+D +P WP DAL++V+ RFL E
Sbjct: 154 FIDRVRRQLKVILCFSPVGSVLRVRSRKFPAVVNCTAIDWFHEWPQDALVSVSARFLEET 213
Query: 2027 ELSEAERQISIDMCQNF-HVSTQNLSDEFLVKTSRHVYVTPTSYLELISTFKQLL 2080
E E + SI + H S +S +L R+ Y TP S+LE I ++ LL
Sbjct: 214 EGIPWEVKASISFFMAYVHTSVNEMSRVYLANERRYNYTTPKSFLEQIKLYQSLL 268
>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily). This Pfam
entry includes some of the AAA proteins not detected by
the pfam00004 model.
Length = 135
Score = 44.6 bits (106), Expect = 7e-05
Identities = 25/162 (15%), Positives = 50/162 (30%), Gaps = 33/162 (20%)
Query: 1336 PVMFIGPTGTGKSCYITVSILFIKNNSRLRSGFQHFLLKELS--IEQWQPLIMNFSAQTS 1393
V+ +GP GTGKS L + L+ + + + T+
Sbjct: 1 GVLLVGPPGTGKS----------------------ELAERLAAALSNRPVFYVQLTRDTT 38
Query: 1394 ANQTQDI--IMSKLDKRRKGVYGPPLGKRCVVFVDDVNMPLKEEFGAQ-PPIEILRQWLD 1450
+ I G + + +D++N + + ++ R L
Sbjct: 39 EEDLKGRRNIDPGGASWVDGPLVRAAREGEIAVLDEINRANPDVLNSLLSLLDERRLLLP 98
Query: 1451 HWMWYDRKDVVAVKLIEIQLMCAMGPPSTG-NTVTPRFSRHF 1491
+ ++V +L+ M P G N ++P F
Sbjct: 99 -----EGGELVKAAPDGFRLIATMNPLDRGLNELSPALRSRF 135
Score = 43.1 bits (102), Expect = 2e-04
Identities = 38/139 (27%), Positives = 55/139 (39%), Gaps = 27/139 (19%)
Query: 7 GPAGTGKTETTKDLAKAV--AKLCIVFNCSD--------SMDYIGLGKFFKGLIAT---- 52
GP GTGK+E + LA A+ + V D + G +
Sbjct: 6 GPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPLVRAAR 65
Query: 53 -GAWACFDEFNRIDVEVLSVVAQQILTIQRGVTLGEEEILFE--GTILKLDRTCSVFI-T 108
G A DE NR + +VL+ + L++ L E +L G ++K I T
Sbjct: 66 EGEIAVLDEINRANPDVLNSL----LSL-----LDERRLLLPEGGELVKAAPDGFRLIAT 116
Query: 109 MNPGYAGRSELPDNLKSLF 127
MNP G +EL L+S F
Sbjct: 117 MNPLDRGLNELSPALRSRF 135
Score = 37.3 bits (87), Expect = 0.022
Identities = 31/154 (20%), Positives = 53/154 (34%), Gaps = 29/154 (18%)
Query: 367 GFMIVGLSFAGKTCAYRMLSEALQLIEEWGELGEHKVEIIVINPKSITMGQLYGQ--FDA 424
G ++VG GK+ + + V + + + T L G+ D
Sbjct: 1 GVLLVGPPGTGKS---ELAERLAAALSNR------PVFYVQLT-RDTTEEDLKGRRNIDP 50
Query: 425 VSHEWSDGILAVSYRQFAMSQNENRKWLIFDGPVDAI---WIENMNSVLDDNKKLCLMSG 481
W DG L + R + ++ + ++ S+LD+ + L G
Sbjct: 51 GGASWVDGPLVRAAR---------EGEIAVLDEINRANPDVLNSLLSLLDERRLLLPEGG 101
Query: 482 EIIALAPT-TSLIFE----PQDLEVASPATVSRC 510
E++ AP LI + L SPA SR
Sbjct: 102 ELVKAAPDGFRLIATMNPLDRGLNELSPALRSRF 135
Score = 34.2 bits (79), Expect = 0.24
Identities = 17/127 (13%), Positives = 40/127 (31%), Gaps = 11/127 (8%)
Query: 883 FLLKELS--IEQWQPLIMNFSAQTSANQTQDI--IMSKLDKRRKGVYGPPLGKRCVVFVD 938
L + L+ + + + T+ + I G + + +D
Sbjct: 14 ELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPLVRAAREGEIAVLD 73
Query: 939 DVNMPLKEEFGAQ-PPIEILRQWLDHWMWYDRKDVVAVKLIEIQLMCAMGPPSTG-NTVT 996
++N + + ++ R L + ++V +L+ M P G N ++
Sbjct: 74 EINRANPDVLNSLLSLLDERRLLLP-----EGGELVKAAPDGFRLIATMNPLDRGLNELS 128
Query: 997 PRFSRHF 1003
P F
Sbjct: 129 PALRSRF 135
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 44.8 bits (106), Expect = 8e-04
Identities = 65/321 (20%), Positives = 141/321 (43%), Gaps = 42/321 (13%)
Query: 2097 GLEKLDKAASQIA----VMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEG--V 2150
GLEK +K + + + +IE L+ L +I++L+ +E+E E + E
Sbjct: 165 GLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEE 224
Query: 2151 KKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEI----LSEAIPIL 2206
+++EE+ E + + + E K+ E + L ++++ E +I L E +L
Sbjct: 225 QEEEELEQEIEALEERLAELEE-EKERLEELKARLLEIESLELEALKIREEELRELERLL 283
Query: 2207 EAAEAALNTLTSNDITVVKTMKSPPDIVKLVMKAVCILKGVKSERVPDAGGQLVEDYWGP 2266
E E + L ++ + + + L+G++ +L E
Sbjct: 284 EELEEKIERLE--------------ELEREIEELEEELEGLR-----ALLEELEELL--- 321
Query: 2267 SKKLLGDIKFLEGLTNFNKDNVPAAVIKRLEDEFLSREDFDPEVVKKSSTAAEGLCKWVI 2326
+KL + LE L K + ++ L +E E +K+ E L K
Sbjct: 322 -EKLKSLEERLEKLE--EKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEK--- 375
Query: 2327 AICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKE 2386
+ +K + + +EA+++ +E+L +AL E QE+ ++++L++L+ L+ ++E
Sbjct: 376 ---ELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432
Query: 2387 LKDLQDELDLCVKKKQRAEDL 2407
+K L+++++ K+ +L
Sbjct: 433 IKKLEEQINQLESKELMIAEL 453
Score = 35.5 bits (82), Expect = 0.46
Identities = 68/346 (19%), Positives = 143/346 (41%), Gaps = 29/346 (8%)
Query: 2077 KQLLKVKQE-EVLNGKN-RYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMV 2134
++L ++++E E L + LE+L++ ++ + E +E L+ L + ++EL
Sbjct: 291 ERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEEL-- 348
Query: 2135 TVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKS- 2193
E+++ A ++ E +K+ EE + + A K E A ++ A S
Sbjct: 349 -AEEKNELAKLLEERLKELEE----RLEELEKELEKALERLKQLEEAIQELKEELAELSA 403
Query: 2194 ---ECEEILSEAIPILEAAEAALNTLTSNDITVVKTMKSPPDIVKLVMKAVCILKGVKSE 2250
E +E L E LE E L L +K ++ + ++ + L G E
Sbjct: 404 ALEEIQEELEELEKELEELERELEELEEE----IKKLEEQINQLESKELMIAELAGAG-E 458
Query: 2251 RVPDAGGQLVEDYWGPSKKLL-GDIKFLEGLTNFNKDNVPA--------AVIKRLEDEFL 2301
+ P G +L E++ +L +++ LE + K+ ++ LE+E +
Sbjct: 459 KCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELI 518
Query: 2302 SREDFDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHE 2361
+ + + ++ E L + + + K+ + + + E + L E
Sbjct: 519 ELLELEEALKEELEEKLEKLENLLEELE--ELKEKLQLQQLKEELRQLEDRLQELKELLE 576
Query: 2362 KQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDL 2407
+ R +E+L++L+ L KK+LK+L++ L + Q E
Sbjct: 577 ELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELS 622
Score = 31.7 bits (72), Expect = 8.0
Identities = 18/81 (22%), Positives = 46/81 (56%), Gaps = 4/81 (4%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLD----AKKKELKDLQDELDL 2396
++E L Q E L+ ++ L E++E+ ++ +L ++++ +++EL++L+ L+
Sbjct: 226 EEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEE 285
Query: 2397 CVKKKQRAEDLIGKLGGEKER 2417
+K +R E+L ++ +E
Sbjct: 286 LEEKIERLEELEREIEELEEE 306
Score = 31.3 bits (71), Expect = 9.4
Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 4/88 (4%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
+ L + EEK++ E +E ++ L+ L+ ++LK L++ L+ K
Sbjct: 279 LERLLEELEEKIERLEELEREIEELEEELEGLRALLEE-LEELLEKLKSLEERLE---KL 334
Query: 2401 KQRAEDLIGKLGGEKERWSSTAKMLNEK 2428
+++ E L +L E + AK+L E+
Sbjct: 335 EEKLEKLESELEELAEEKNELAKLLEER 362
>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
only].
Length = 329
Score = 44.0 bits (104), Expect = 8e-04
Identities = 40/150 (26%), Positives = 57/150 (38%), Gaps = 31/150 (20%)
Query: 6 EGPAGTGKTETTKDLAKAVAKLCIVFNCSDSM---DYIG---------LGKFF---KGLI 50
EGP G GKT + LA+A+ + C+ + D +G F G +
Sbjct: 49 EGPPGVGKTLLARALARALGLPFVRIQCTPDLLPSDLLGTYAYAALLLEPGEFRFVPGPL 108
Query: 51 --ATGAWACFDEFNRIDVEVLSVVAQQILTI--QRGVTLGEEEILFEGTILKLDRTCSVF 106
A DE NR EV + +L +R VT+ T ++L V
Sbjct: 109 FAAVRVILLLDEINRAPPEVQNA----LLEALEERQVTVPGL------TTIRLPPPFIVI 158
Query: 107 ITMNPG-YAGRSELPDNLKSLFRTVAMMVP 135
T NPG Y G LP+ L F + + V
Sbjct: 159 ATQNPGEYEGTYPLPEALLDRF-LLRIYVD 187
Score = 35.5 bits (82), Expect = 0.30
Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 3/51 (5%)
Query: 745 LASAPAIPRDMPVNQIIVTTEETLRNIALMKLLVTHQKPVMFIGPTGTGKS 795
+ + +++V EE I L L + V+ GP G GK+
Sbjct: 10 RVAEILGKIRSELEKVVVGDEEV---IELALLALLAGGHVLLEGPPGVGKT 57
Score = 35.5 bits (82), Expect = 0.30
Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 3/51 (5%)
Query: 1298 LASAPAIPRDMPVNQIIVTTEETLRNIALMKLLVTHQKPVMFIGPTGTGKS 1348
+ + +++V EE I L L + V+ GP G GK+
Sbjct: 10 RVAEILGKIRSELEKVVVGDEEV---IELALLALLAGGHVLLEGPPGVGKT 57
Score = 31.3 bits (71), Expect = 7.5
Identities = 17/67 (25%), Positives = 26/67 (38%), Gaps = 11/67 (16%)
Query: 1127 EGPAGTGKTETTKDLAKAVAKLCIVFNCSDSM---DYIGLGKFFKMVDDAMEAYKKKKRE 1183
EGP G GKT + LA+A+ + C+ + D +G A A + E
Sbjct: 49 EGPPGVGKTLLARALARALGLPFVRIQCTPDLLPSDLLG--------TYAYAALLLEPGE 100
Query: 1184 EWVMDWP 1190
+ P
Sbjct: 101 FRFVPGP 107
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 43.5 bits (103), Expect = 0.002
Identities = 19/95 (20%), Positives = 46/95 (48%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
L++ EEKL+ + E +E+ +Q++L L N + +++ + L++ L ++
Sbjct: 259 TAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQL 318
Query: 2402 QRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGDV 2436
+ E + +L + + + L EK +L ++
Sbjct: 319 EELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Score = 38.9 bits (91), Expect = 0.040
Identities = 18/104 (17%), Positives = 47/104 (45%), Gaps = 3/104 (2%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
+ E L + E+ + ++ + E+ ++ ++++L+ L A ++ L +L+ EL L ++
Sbjct: 762 EIEELEERLEEAEEELAEAEAEIEE---LEAQIEQLKEELKALREALDELRAELTLLNEE 818
Query: 2401 KQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGDVLIAAGVVA 2444
+ + L + L E+ +L+ D+ A +
Sbjct: 819 AANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIE 862
Score = 36.6 bits (85), Expect = 0.24
Identities = 35/157 (22%), Positives = 60/157 (38%), Gaps = 8/157 (5%)
Query: 2067 TSYLELISTFKQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSA 2126
E + ++ LK +EE+ EKL++ +++ + EEIE LQ L A
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALA 294
Query: 2127 ANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAE----GVKKDEEVAN 2182
I L +K+ + +++ E Q ++ +++ E
Sbjct: 295 NEISRL--EQQKQILRERL--ANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK 350
Query: 2183 EQALAAQAMKSECEEILSEAIPILEAAEAALNTLTSN 2219
E+ + +A E E L E LE E L TL S
Sbjct: 351 EELESLEAELEELEAELEELESRLEELEEQLETLRSK 387
Score = 34.6 bits (80), Expect = 0.90
Identities = 22/103 (21%), Positives = 53/103 (51%), Gaps = 4/103 (3%)
Query: 2332 DKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQ 2391
+ I+ ++E + + EEK++ + E ++ ++++L++L+ L+ +KEL++L
Sbjct: 667 KTNSSILERRRE-IEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELS 725
Query: 2392 DELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTG 2434
++ K R E + +L +ER + +K L E ++
Sbjct: 726 RQISALRKDLARLEAEVEQL---EERIAQLSKELTELEAEIEE 765
Score = 34.3 bits (79), Expect = 1.2
Identities = 22/95 (23%), Positives = 45/95 (47%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
+E L E +L+ + L E + + + E+L +L+ L+ K+EL+ L+ EL+ +
Sbjct: 308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL 367
Query: 2402 QRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGDV 2436
+ E + +L + E S L + L ++
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEI 402
Score = 33.9 bits (78), Expect = 1.7
Identities = 20/135 (14%), Positives = 43/135 (31%), Gaps = 14/135 (10%)
Query: 2084 QEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEA 2143
+E+ + E+L++A ++A EIE L+ +KE + + + E
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA----QIEQLKEELKALREALDEL 808
Query: 2144 AIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAI 2203
A ++ E +++ + + E E +
Sbjct: 809 ----------RAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLA 858
Query: 2204 PILEAAEAALNTLTS 2218
+E E + L S
Sbjct: 859 AEIEELEELIEELES 873
Score = 33.5 bits (77), Expect = 2.2
Identities = 22/100 (22%), Positives = 50/100 (50%), Gaps = 10/100 (10%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTL---DAKKKELKDLQDELDLCVK 2399
E + + E +L+ ++ +E +++ +L++L L ++K+ EL+ +EL
Sbjct: 866 ELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEEL----- 920
Query: 2400 KKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGDVLIA 2439
+++ L +L G + R + + L+E+ Y LT + A
Sbjct: 921 -REKLAQLELRLEGLEVRIDNLQERLSEE-YSLTLEEAEA 958
Score = 33.1 bits (76), Expect = 2.3
Identities = 20/99 (20%), Positives = 52/99 (52%)
Query: 2330 KYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKD 2389
K D++A+ + +E L + +E+L+ + L E + + ++ +L++L+ L+ + ++
Sbjct: 331 KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQ 390
Query: 2390 LQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEK 2428
L+ ++ + +R E + +L +ER + L +K
Sbjct: 391 LELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKK 429
Score = 32.0 bits (73), Expect = 5.5
Identities = 26/152 (17%), Positives = 56/152 (36%), Gaps = 2/152 (1%)
Query: 2069 YLELISTFKQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAAN 2128
E + ++ L + E+ E+L+ +IA +E L+ + + +
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSED 853
Query: 2129 IKELMVTVEK-ESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALA 2187
I+ L +E+ E + +E E A+ + A +S E + ++ +
Sbjct: 854 IESLAAEIEELEELIEELESELEALLNERASLEEALALL-RSELEELSEELRELESKRSE 912
Query: 2188 AQAMKSECEEILSEAIPILEAAEAALNTLTSN 2219
+ E E L++ LE E ++ L
Sbjct: 913 LRRELEELREKLAQLELRLEGLEVRIDNLQER 944
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 41.6 bits (98), Expect = 0.008
Identities = 55/323 (17%), Positives = 130/323 (40%), Gaps = 33/323 (10%)
Query: 2099 EKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGVKKDEEVAN 2158
++L++ +++ + EE+E LQ L + I+EL +E+ E + E + + +E
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298
Query: 2159 EQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAALNTLTS 2218
E ++ E ++ + E E+ + +E L E +LE E L L
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELE- 357
Query: 2219 NDITVVKTMKSPPDIVKLVMKAVCILKGVKSERVPDAGGQLVEDYWGPSKKLLGDIKFLE 2278
+ + + + +++ + L E + + +L ++ +++ L+
Sbjct: 358 ------EAKEELEEKLSALLEELEELFEALREELAELEAEL--------AEIRNELEELK 403
Query: 2279 GLTNFNKDNVPAAVIKRLEDEFLSREDFDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIV 2338
I+ LE+ L R E +K+ E + + + +++ + +
Sbjct: 404 ------------REIESLEER-LERLSERLEDLKEELKELEAELEELQT--ELEELNEEL 448
Query: 2339 GPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCV 2398
+E L + ++L+ L E QE+ + ++++L L+ LD + E + Q +
Sbjct: 449 EELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLE 508
Query: 2399 KKKQRAEDLIG---KLGGEKERW 2418
+ + G +L KE++
Sbjct: 509 ALESGLPGVYGPVAELIKVKEKY 531
Score = 40.9 bits (96), Expect = 0.010
Identities = 17/73 (23%), Positives = 41/73 (56%)
Query: 2345 LRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRA 2404
L++ ++L+ L +E+ +QE+L++ + ++ K EL++L++EL+ ++
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLEL 293
Query: 2405 EDLIGKLGGEKER 2417
++ I +L GE
Sbjct: 294 KEEIEELEGEISL 306
Score = 40.9 bits (96), Expect = 0.011
Identities = 18/89 (20%), Positives = 47/89 (52%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
+E L + E++L+ L E + + ++++L++L+ + ++EL++L+ EL ++
Sbjct: 848 EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEI 907
Query: 2402 QRAEDLIGKLGGEKERWSSTAKMLNEKYY 2430
++ + + +L + ER L E+
Sbjct: 908 EKLRERLEELEAKLERLEVELPELEEELE 936
Score = 40.1 bits (94), Expect = 0.019
Identities = 20/85 (23%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
+E L +AE++++ S L E +E+ +QE+L +L+ ++ + E+ L++ L+ +
Sbjct: 258 LQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENE 317
Query: 2401 KQRAEDLIGKLGGEKERWSSTAKML 2425
+ E+ + +L KE+ + + L
Sbjct: 318 LEELEERLEEL---KEKIEALKEEL 339
Score = 38.2 bits (89), Expect = 0.070
Identities = 21/87 (24%), Positives = 46/87 (52%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
+E + + EEKL L E +++ ++E+L++L+ + + ELK+L++E + ++
Sbjct: 834 EEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEEL 893
Query: 2402 QRAEDLIGKLGGEKERWSSTAKMLNEK 2428
+ E + +L E E+ + L K
Sbjct: 894 RELESELAELKEEIEKLRERLEELEAK 920
Score = 37.0 bits (86), Expect = 0.17
Identities = 21/102 (20%), Positives = 53/102 (51%), Gaps = 3/102 (2%)
Query: 2332 DKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQ 2391
++ + +E L E+L+ AL E+ + ++ +L +++N L+ K+E++ L+
Sbjct: 354 AELEEAKEELEEKLSALLEELEELFEALREELAE---LEAELAEIRNELEELKREIESLE 410
Query: 2392 DELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLT 2433
+ L+ ++ + ++ + +L E E + + LNE+ +L
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELE 452
Score = 35.8 bits (83), Expect = 0.44
Identities = 19/82 (23%), Positives = 46/82 (56%)
Query: 2330 KYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKD 2389
+ ++ + + + L E++ + + E +E+ ++EKL +L+ L+ +KEL++
Sbjct: 801 ELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860
Query: 2390 LQDELDLCVKKKQRAEDLIGKL 2411
L++EL+ +K+ ED + +L
Sbjct: 861 LKEELEELEAEKEELEDELKEL 882
Score = 33.5 bits (77), Expect = 1.7
Identities = 29/140 (20%), Positives = 53/140 (37%), Gaps = 6/140 (4%)
Query: 2077 KQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTV 2136
LL+ + EE+ N LE+L + + EE E L L + EL
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEEL---EQLLAELEEAK 360
Query: 2137 EKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECE 2196
E+ + + + E + +E + A+ +AE ++ + E + + +
Sbjct: 361 EELEEKLSALLEEL--EELFEALREELAELEAELAE-IRNELEELKREIESLEERLERLS 417
Query: 2197 EILSEAIPILEAAEAALNTL 2216
E L + L+ EA L L
Sbjct: 418 ERLEDLKEELKELEAELEEL 437
Score = 33.5 bits (77), Expect = 1.8
Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
+ E L + E+ + + AL + E +E+L++ L+ + +EL++ DEL+ +++
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 2401 -KQRAEDLIGKLGGEKERWSSTAKMLNEKYYQL 2432
++ E+L +L + L E +
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEK 886
Score = 32.8 bits (75), Expect = 3.2
Identities = 16/95 (16%), Positives = 38/95 (40%), Gaps = 3/95 (3%)
Query: 2333 KVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQD 2392
+++ + +E + EE + + + ++E+ L+ L+ EL++ ++
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE-LEQLLAELEEAKE 361
Query: 2393 ELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNE 2427
EL+ K E+L +E + L E
Sbjct: 362 ELEE--KLSALLEELEELFEALREELAELEAELAE 394
Score = 32.4 bits (74), Expect = 4.4
Identities = 22/88 (25%), Positives = 46/88 (52%), Gaps = 7/88 (7%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
K L EE+L+ S L E +E+ ++E+L++LQ L+ ++EL+ L++ L
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA------ 775
Query: 2402 QRAEDLIGKLGGEKERWSSTAKMLNEKY 2429
+ ++ I +L +++ + L E+
Sbjct: 776 -KLKEEIEELEEKRQALQEELEELEEEL 802
Score = 31.2 bits (71), Expect = 9.3
Identities = 20/98 (20%), Positives = 44/98 (44%), Gaps = 8/98 (8%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKEL----KDLQDELDLC 2397
K LR E+ L+ L E + Q ++ +L L+ + ++L ++L++EL+
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALE----EELEQLQSRLEELEEELEEL 749
Query: 2398 VKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGD 2435
++ + ++ + +L E E L E+ +L
Sbjct: 750 EEELEELQERLEELEEELESLEEALAKLKEEIEELEEK 787
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
Provisional.
Length = 387
Score = 40.6 bits (95), Expect = 0.010
Identities = 25/75 (33%), Positives = 33/75 (44%)
Query: 2138 KESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEE 2197
A+ AE KK A ++A A A K+ AE KK E A ++A A K+ E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA 223
Query: 2198 ILSEAIPILEAAEAA 2212
+ A EA AA
Sbjct: 224 KAAAAKAAAEAKAAA 238
Score = 36.7 bits (85), Expect = 0.14
Identities = 30/87 (34%), Positives = 35/87 (40%), Gaps = 6/87 (6%)
Query: 2126 AANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQA 2185
A K K AEA A KK A ++A A A K+ AE KK E A
Sbjct: 136 EAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEA-AAK 194
Query: 2186 LAAQAMKSECEEILSEAIPILEAAEAA 2212
AA+A K E +A AAEA
Sbjct: 195 AAAEAKKKAEAEAKKKA-----AAEAK 216
Score = 36.7 bits (85), Expect = 0.14
Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 2137 EKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECE 2196
+K++ EAA A K + A + AA A K+ AE KK E A ++A AA+A K
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKA-AAEAKKKAEA 189
Query: 2197 EILSEAIPILEAAEA 2211
E ++A AAEA
Sbjct: 190 EAAAKA-----AAEA 199
Score = 33.2 bits (76), Expect = 1.8
Identities = 20/80 (25%), Positives = 28/80 (35%)
Query: 2139 ESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEI 2198
A A AE KK E A ++A A K+ AE + A E AA+ +
Sbjct: 189 AEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAE 248
Query: 2199 LSEAIPILEAAEAALNTLTS 2218
+ A + L S
Sbjct: 249 KAAAAKAAAEVDDLFGGLDS 268
Score = 31.7 bits (72), Expect = 5.0
Identities = 30/110 (27%), Positives = 42/110 (38%), Gaps = 12/110 (10%)
Query: 2103 KAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQAL 2162
+AA Q + + E E L A K+ K++A AE + A +A
Sbjct: 96 QAAEQERLKQLEKERLA------AQEQKKQAEEAAKQAALKQKQAEEAAA-KAAAAAKAK 148
Query: 2163 AAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAA 2212
A K A KK A ++A A A K+ +EA EA AA
Sbjct: 149 AEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAA-----AEAKKKAEAEAAA 193
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only detects
a fraction of this vast family. The poorly conserved
N-terminal helix is missing from the alignment.
Length = 148
Score = 37.0 bits (85), Expect = 0.040
Identities = 24/136 (17%), Positives = 48/136 (35%), Gaps = 15/136 (11%)
Query: 7 GPAGTGKTETTKDLAKAVAKL---CIVFNCSDSMDYIGLGKFFKGLIATGAWACFDEFNR 63
GP G+GKT + LA+ + I + D ++ + + A + R
Sbjct: 9 GPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKASGSGELRLR 68
Query: 64 IDVEVLSVVAQQIL---------TIQRGVTLGEEEILFEGTILKLDRTCSVFITMNPGYA 114
+ + + + +L ++ L E L +LK ++ +V +T N
Sbjct: 69 LALALARKLKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKNLTVILTTND--- 125
Query: 115 GRSELPDNLKSLFRTV 130
+ P L+ F
Sbjct: 126 EKDLGPALLRRRFDRR 141
Score = 36.2 bits (83), Expect = 0.076
Identities = 24/162 (14%), Positives = 49/162 (30%), Gaps = 27/162 (16%)
Query: 1335 KPVMFIGPTGTGKSCYITVSILFIKNNSRLRSGFQHFL-----LKELSIEQWQPLIMNFS 1389
+ ++ +GP G+GK T + G ++ L+E+ + ++
Sbjct: 3 EVILIVGPPGSGK----TTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKK 58
Query: 1390 AQTSANQTQDIIMSKLDKRRKGVYGPPLGKRCVVFVDDVNMPLKEEFGAQPPIEILRQWL 1449
A S + ++ K + V+ +D++ L E A + + L
Sbjct: 59 ASGSGELRLRLALALARKLKP----------DVLILDEITSLLDAEQEALLLLLEELRLL 108
Query: 1450 DHWMWYDRKDVVAVKLIEIQLMCAMGPPSTGNTVTPRFSRHF 1491
V+ E L + RF R
Sbjct: 109 LLLKSEKNLTVILTTNDEKDLG--------PALLRRRFDRRI 142
Score = 32.3 bits (73), Expect = 1.3
Identities = 12/58 (20%), Positives = 23/58 (39%), Gaps = 3/58 (5%)
Query: 1128 GPAGTGKTETTKDLAKAVAKL---CIVFNCSDSMDYIGLGKFFKMVDDAMEAYKKKKR 1182
GP G+GKT + LA+ + I + D ++ + +V + + R
Sbjct: 9 GPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKASGSGELR 66
Score = 31.6 bits (71), Expect = 2.4
Identities = 22/150 (14%), Positives = 51/150 (34%), Gaps = 25/150 (16%)
Query: 782 KPVMFIGPTGTGKSCYITVSILFIKNNSRLRSGFQHFL-----LKELSIEQWQPLIMNFS 836
+ ++ +GP G+GK T + G ++ L+E+ + ++
Sbjct: 3 EVILIVGPPGSGK----TTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKK 58
Query: 837 AQTSANQTQDIIMSKLDKRRKE-----EISTIVDIGNEENNNSRLRSGFQHFLLKELSIE 891
A S + ++ K + + EI++++D E L E ++
Sbjct: 59 ASGSGELRLRLALALARKLKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKNL- 117
Query: 892 QWQPLIMNFSAQTSANQTQDIIMSKLDKRR 921
+I+ + + L +RR
Sbjct: 118 ---TVIL-------TTNDEKDLGPALLRRR 137
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set of
eukaryotic tropomyosins. Within the yeast Tmp1 and Tmp2,
biochemical and sequence analyses indicate that Tpm2p
spans four actin monomers along a filament, whereas Tpmlp
spans five. Despite its shorter length, Tpm2p can compete
with Tpm1p for binding to F-actin. Over-expression of
Tpm2p in vivo alters the axial budding of haploids to a
bipolar pattern, and this can be partially suppressed by
co-over-expression of Tpm1p. This suggests distinct
functions for the two tropomyosins, and indicates that
the ratio between them is important for correct
morphogenesis. The family also contains higher eukaryote
Tmp3 members.
Length = 143
Score = 36.4 bits (85), Expect = 0.046
Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 3/85 (3%)
Query: 2347 QAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAED 2406
+ EEKL+ EK+++ +Q+K Q+L+ ++ +++LK+ +++L+ K AE
Sbjct: 18 ELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESEKLATNAEA 77
Query: 2407 LIGK---LGGEKERWSSTAKMLNEK 2428
L + L E E K EK
Sbjct: 78 LTRRIQLLEEELEESEKRLKETTEK 102
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.8 bits (89), Expect = 0.077
Identities = 17/65 (26%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
++ L++ E++L L K E +E+L+K + L+ K++EL+ ++EL+ + +
Sbjct: 80 RRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE-LIE 138
Query: 2401 KQRAE 2405
+Q E
Sbjct: 139 EQLQE 143
Score = 33.6 bits (78), Expect = 1.7
Identities = 21/82 (25%), Positives = 41/82 (50%), Gaps = 14/82 (17%)
Query: 2341 KKEALRQAEEK-LQLAMSALHEKQEQNRIVQEKLQKLQ----------NTLDAKKKELKD 2389
KKEAL +A+E+ +L E +E+ +Q+ ++L L+ +++EL+
Sbjct: 55 KKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEK 114
Query: 2390 LQDELDLCVKKKQRAEDLIGKL 2411
+ EL+ +K+Q E +L
Sbjct: 115 KEKELE---QKQQELEKKEEEL 133
Score = 32.8 bits (76), Expect = 2.9
Identities = 24/109 (22%), Positives = 48/109 (44%), Gaps = 20/109 (18%)
Query: 2326 IAICKYDKVAKIVGPKK--EALRQAEEKLQLA-MSALHEKQEQNRIVQEKLQKLQNTLDA 2382
I + V K + K EA +A+ L+ A A K+E +E++ KL+N +
Sbjct: 18 AVIGYF--VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK 75
Query: 2383 ----KKKELKDLQ-----------DELDLCVKKKQRAEDLIGKLGGEKE 2416
++ EL+ L+ +L+L K+++ E +L +++
Sbjct: 76 ELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQ 124
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 38.0 bits (89), Expect = 0.097
Identities = 17/54 (31%), Positives = 31/54 (57%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
++ L QAE++ A L + Q Q + L L+++ DAK++ L++L+ EL
Sbjct: 991 RQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQ 1044
Score = 31.9 bits (73), Expect = 6.6
Identities = 14/54 (25%), Positives = 25/54 (46%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDL 2396
L + EE+L+ + E EQ + + + + +D K +L D Q LD+
Sbjct: 355 ADLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQLADYQQALDV 408
Score = 31.5 bits (72), Expect = 9.5
Identities = 27/99 (27%), Positives = 40/99 (40%), Gaps = 29/99 (29%)
Query: 2331 YDKVAKIVGP---------KKEALRQAEEKLQLAMSA-------------LHEKQEQNRI 2368
Y V KI G +E LR+ E+ LA L ++Q R+
Sbjct: 479 YQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQRAERL 538
Query: 2369 VQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDL 2407
+ E ++L LD + EL+ LQ+EL+ R E L
Sbjct: 539 LAEFCKRLGKNLDD-EDELEQLQEELE------ARLESL 570
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a family
of proteins of unknown function possibly containing a
coiled-coil domain.
Length = 308
Score = 36.9 bits (85), Expect = 0.11
Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDE-LDLCVK 2399
K LRQ EK+QL + E++ Q + K+ KL+ AK+ L+ L+ E +DL
Sbjct: 114 KLTQLRQ--EKIQLEQTLEQEQEYQVNKLMRKIDKLEADTLAKQTSLEQLRREKVDLENT 171
Query: 2400 KKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQ 2431
+Q E L+ +L ++ + +ML EK Q
Sbjct: 172 LEQEQEALVNRLWKRMDKLEAEKRMLQEKLDQ 203
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the autophagic
process and in the vacuolar protein sorting pathway.
Apg14p may be a component specifically required for the
function of Apg6/Vps30p through the autophagic pathway.
There are 17 auto-phagosomal component proteins which are
categorized into six functional units, one of which is
the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
complex and the Atg2-Atg18 complex are essential for
nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol 3-phosphate
(PtdIns(3)P) at the pre-autophagosomal structure (PAS).
The localisation of this complex at the PAS is controlled
by Atg14. Autophagy mediates the cellular response to
nutrient deprivation, protein aggregation, and pathogen
invasion in humans, and malfunction of autophagy has been
implicated in multiple human diseases including cancer.
This effect seems to be mediated through direct
interaction of the human Atg14 with Beclin 1 in the human
phosphatidylinositol 3-kinase class III complex.
Length = 307
Score = 37.0 bits (86), Expect = 0.11
Identities = 16/97 (16%), Positives = 42/97 (43%), Gaps = 7/97 (7%)
Query: 2331 YDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDL 2390
KV + + + L + K+E+ ++ ++ +L+ ++ K++ +++L
Sbjct: 40 KQKVEEALEGATNEDGK--LAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEEL 97
Query: 2391 QDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNE 2427
+ L +++ +L EK R S K+ +E
Sbjct: 98 KRALA---QRRSDLSSASYQL--EKRRASQLEKLQDE 129
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 36.9 bits (85), Expect = 0.13
Identities = 36/142 (25%), Positives = 51/142 (35%), Gaps = 18/142 (12%)
Query: 2077 KQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTV 2136
QL + +QEE E+ KA + A E L+ +AA K+
Sbjct: 127 AQLEQKQQEE-----QARKAAAEQKKKAEAAKAKAAAEAAKLK-----AAAEAKKKAEEA 176
Query: 2137 EKESAEAAIVAEGV--KKDEEV----ANEQALAAQAMKSVAEGVKKDEEVANEQALAAQA 2190
K + EA AE KK E A E+A A K+ AE + A E+A A +
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAA--AEEKAAAEKK 234
Query: 2191 MKSECEEILSEAIPILEAAEAA 2212
+ + A A A
Sbjct: 235 KAAAKAKADKAAAAAKAAERKA 256
Score = 35.7 bits (82), Expect = 0.30
Identities = 29/116 (25%), Positives = 37/116 (31%), Gaps = 7/116 (6%)
Query: 2098 LEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEA-AIVAEGVKKDEEV 2156
LEK A + EE E K+ K +AE K
Sbjct: 106 LEKERLKAQEQQKQAEEAEKQA------QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAE 159
Query: 2157 ANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAA 2212
A + AA+A K E K EE + AA K+E E + EA A
Sbjct: 160 AAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKA 215
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 36.2 bits (84), Expect = 0.14
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 6/92 (6%)
Query: 2328 ICKYDKVAKIVGPKK---EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKK 2384
I + ++ IV K E ++ ++ L+ +AL +K E +Q +L+ N+L+++K
Sbjct: 133 ISRVTAISVIVDADKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQK 192
Query: 2385 KELKDLQDELDLCVKKKQRAEDLIGKLGGEKE 2416
E L L K+ A L +K
Sbjct: 193 AEKNALIAALAA---KEASALGEKAALEEQKA 221
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 37.4 bits (87), Expect = 0.15
Identities = 14/95 (14%), Positives = 41/95 (43%), Gaps = 6/95 (6%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
K+ L AEE+L + + + + ++ ++++ + D +E +L++EL+
Sbjct: 313 KERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELED---L 369
Query: 2401 KQRAEDLIGKLGGEKERWSSTAK---MLNEKYYQL 2432
+ E++ + ++ + L + +L
Sbjct: 370 RAELEEVDKEFAETRDELKDYREKLEKLKREINEL 404
Score = 34.3 bits (79), Expect = 1.3
Identities = 13/71 (18%), Positives = 38/71 (53%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
+ L A K+ + + +++ ++E+L++L+ L + ++E+++++ EL +
Sbjct: 707 LSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766
Query: 2401 KQRAEDLIGKL 2411
+ E+ + KL
Sbjct: 767 IEELEEDLHKL 777
Score = 33.9 bits (78), Expect = 1.6
Identities = 15/64 (23%), Positives = 34/64 (53%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
E + +E+L+ + L E ++ +++L+ + L+ K+E+ +L+ ELD ++
Sbjct: 356 TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEEL 415
Query: 2402 QRAE 2405
QR
Sbjct: 416 QRLS 419
Score = 33.5 bits (77), Expect = 1.9
Identities = 59/330 (17%), Positives = 121/330 (36%), Gaps = 40/330 (12%)
Query: 2099 EKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGVKKDEE--- 2155
K +KA ++ + E IE L L++ ++ + + +E +A +K+ E
Sbjct: 170 RKKEKALEELEEVEENIERLD--LIID--EKRQQLERLRREREKAERYQALLKEKREYEG 225
Query: 2156 ---VANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAA 2212
+ ++AL Q + +EE+ ++ EEI LE
Sbjct: 226 YELLKEKEALERQKEAIERQLASLEEELEKLTEEI-SELEKRLEEIEQL----LEELNKK 280
Query: 2213 LNTLTSNDITVVKTMKSPPDIVKL---VMKAVCILKGVKSERVPDAGGQLVEDYWGPSKK 2269
+ L + VK I +L + + K + DA +L + K
Sbjct: 281 IKDLGEEEQLRVKE-----KIGELEAEIASLERSIA-EKERELEDAEERLAKLE-AEIDK 333
Query: 2270 LLGDIKFLEG-LTNFNKDNVPAAVIKRLEDEFLSREDFDPEVVKKSSTAAEGLCKWVIAI 2328
LL +I+ LE + K + + + ED E+ + AE +
Sbjct: 334 LLAEIEELEREIEEERKRR--DKLTEEYAELKEELEDLRAELEEVDKEFAE----TRDEL 387
Query: 2329 CKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELK 2388
Y + + + + L++ ++LQ + L E+ + ++ AK EL+
Sbjct: 388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLN---AAIAGIE----AKINELE 440
Query: 2389 DLQDELDLCVKKK-QRAEDLIGKLGGEKER 2417
+ +++ L +KK+ + E L L ++
Sbjct: 441 EEKEDKALEIKKQEWKLEQLAADLSKYEQE 470
Score = 32.3 bits (74), Expect = 4.0
Identities = 20/105 (19%), Positives = 48/105 (45%), Gaps = 2/105 (1%)
Query: 2325 VIAICKYDKV-AKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAK 2383
+ DK+ A+I ++E + + + +L E +E+ ++ +L+++
Sbjct: 325 AKLEAEIDKLLAEIEELEREIEEERKRRDKL-TEEYAELKEELEDLRAELEEVDKEFAET 383
Query: 2384 KKELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEK 2428
+ ELKD +++L+ ++ + + +L E +R S LN
Sbjct: 384 RDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428
Score = 32.0 bits (73), Expect = 6.3
Identities = 30/144 (20%), Positives = 58/144 (40%), Gaps = 13/144 (9%)
Query: 2296 LEDEFLSREDFDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKLQLA 2355
LE LS PE+ + S E + + + + ++ + +KE L + ++LQ
Sbjct: 784 LEAR-LSHSRI-PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQ 841
Query: 2356 MSALHEKQEQNRI-----------VQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRA 2404
L E+ + ++E+L++L+ L + L DL+ E D + +
Sbjct: 842 RIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLREL 901
Query: 2405 EDLIGKLGGEKERWSSTAKMLNEK 2428
E I +L + E+ L K
Sbjct: 902 ERKIEELEAQIEKKRKRLSELKAK 925
Score = 31.6 bits (72), Expect = 7.8
Identities = 18/70 (25%), Positives = 29/70 (41%), Gaps = 4/70 (5%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQK----LQNTLDAKKKELKDLQDELDL 2396
K L + +E L + K EQ K ++ L+ D +KEL LQ EL
Sbjct: 435 KINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAE 494
Query: 2397 CVKKKQRAED 2406
+ + +E+
Sbjct: 495 AEAQARASEE 504
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 37.3 bits (86), Expect = 0.16
Identities = 23/96 (23%), Positives = 37/96 (38%), Gaps = 13/96 (13%)
Query: 7 GPAGTGKTETTKDLAKAVAKLCIVFNC---SDSMDYIGLGKFFKGLIATGAWACFDEFNR 63
G G GKT + LA + +FNC +D MD LG++ +
Sbjct: 1857 GDTGVGKTSLLRFLASIFGQEMTLFNCNSDTDVMDL--LGQYEQMDNGRFESC------- 1907
Query: 64 IDVEVLSVVAQQILTIQRGVTLGEEEILFEGTILKL 99
D EV++ V + + + V E+ I +L
Sbjct: 1908 -DSEVVTSVKEGKMVVFAKVEFNEKSIFDRLNLLFE 1942
Score = 34.2 bits (78), Expect = 1.4
Identities = 39/145 (26%), Positives = 50/145 (34%), Gaps = 24/145 (16%)
Query: 6 EGPAGTGKTETTKDLAKAVAKLCIVFNCSDSMDYIGL----------GKF------FKGL 49
EG G GKT LA+ K I N S+ D L G+F F
Sbjct: 1549 EGSPGVGKTSLITALARKTGKKLIRINLSEQTDLCDLFGSDLPVEEGGEFRWMDAPFLHA 1608
Query: 50 IATGAWACFDEFNRIDVEVLSVVAQQILTIQRGVTLGEEEILFEGTILKLDRTCSVFITM 109
+ G W DE N VL + L +R + E + F + VF
Sbjct: 1609 MRDGGWVLLDEINLASQSVLEGL-NACLDHRREAYIPELDKTF-----DVHPNFRVFAAQ 1662
Query: 110 NPGYA--GRSELPDNLKSLFRTVAM 132
NP GR LP + + F V M
Sbjct: 1663 NPQDQGGGRKGLPKSFLNRFSVVKM 1687
Score = 33.0 bits (75), Expect = 2.8
Identities = 27/137 (19%), Positives = 46/137 (33%), Gaps = 19/137 (13%)
Query: 1759 PPENSFSKT-DYTILNK--AVREVCEKANLQVTPFFLEKIQQIYEMMI----VRWGFMIV 1811
P FS + + VR + L + L Q+ E ++ + W ++V
Sbjct: 1797 PRTIGFSLSSQCFKVGHSVTVRMKERRPRLDDSFVLLHSQLQVLESVMRCINMNWPLILV 1856
Query: 1812 GLSFAGKTCAYRMLSEALQLIEEWGELGEHKVEIIVINPKSITMGQL--YGQFDAVSHEW 1869
G + GKT R L+ ++ + N + M L Y Q D E
Sbjct: 1857 GDTGVGKTSLLRFLASIFG----------QEMTLFNCNSDTDVMDLLGQYEQMDNGRFES 1906
Query: 1870 SDGILAVSYRQFAMSQN 1886
D + S ++ M
Sbjct: 1907 CDSEVVTSVKEGKMVVF 1923
Score = 32.3 bits (73), Expect = 4.4
Identities = 35/139 (25%), Positives = 54/139 (38%), Gaps = 23/139 (16%)
Query: 6 EGPAGTGKTETTKDLAKAVAKLCIVFNC---SDSMDYIGL------GKF-FK-----GLI 50
+GP +GKT LA+ + N +D +YIG G FK +
Sbjct: 894 QGPTSSGKTSMILYLARETGHKFVRINNHEHTDLQEYIGTYVTDDDGSLSFKEGVLVEAL 953
Query: 51 ATGAWACFDEFNRIDVEVLSVVAQQILTIQRGVTLGEEEILFEGTILKLDRTCSVFITMN 110
G W DE N +VL + ++L R + + E + + + +F T N
Sbjct: 954 RRGYWIVLDELNLAPTDVLEAL-NRLLDDNRELFIPETQEV-----VVPHPNFRLFATQN 1007
Query: 111 P--GYAGRSELPDNLKSLF 127
P GY GR L ++ F
Sbjct: 1008 PPGGYGGRKGLSRAFRNRF 1026
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner membrane
complex of itself with TolQ and TolR to the outer
membrane complex of TolB and OprL (also called Pal). Most
of the length of the protein consists of low-complexity
sequence that may differ in both length and composition
from one species to another, complicating efforts to
discriminate TolA (the most divergent gene in the tol-pal
system) from paralogs such as TonB. Selection of members
of the seed alignment and criteria for setting scoring
cutoffs are based largely conserved operon struction.
//The Tol-Pal complex is required for maintaining outer
membrane integrity. Also involved in transport (uptake)
of colicins and filamentous DNA, and implicated in
pathogenesis. Transport is energized by the proton motive
force. TolA is an inner membrane protein that interacts
with periplasmic TolB and with outer membrane porins
ompC, phoE and lamB [Transport and binding proteins,
Other, Cellular processes, Pathogenesis].
Length = 346
Score = 36.4 bits (84), Expect = 0.19
Identities = 29/76 (38%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Query: 2137 EKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECE 2196
+ AE E K+ EE A +A AA+A K AE KK E A +A A K+E
Sbjct: 136 AEAEAEKKAKEEAKKQAEEEAKAKA-AAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEA 194
Query: 2197 EILSEAIPILEAAEAA 2212
+ +EA AAEAA
Sbjct: 195 KAKAEAAKAKAAAEAA 210
Score = 35.6 bits (82), Expect = 0.36
Identities = 25/76 (32%), Positives = 33/76 (43%), Gaps = 7/76 (9%)
Query: 2137 EKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECE 2196
+ +AA AE K E +QA A+A K AE K E E+ +A K E
Sbjct: 97 RAAAEKAAKQAEQAAKQAEEKQKQAEEAKA-KQAAEAKAK-AEAEAEKKAKEEAKKQAEE 154
Query: 2197 EILSEAIPILEAAEAA 2212
E ++A AAEA
Sbjct: 155 EAKAKA-----AAEAK 165
Score = 35.2 bits (81), Expect = 0.46
Identities = 17/60 (28%), Positives = 25/60 (41%)
Query: 2137 EKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECE 2196
+K++AEA AE K + A +A A +A K A +A A A + E
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAE 224
Score = 35.2 bits (81), Expect = 0.48
Identities = 29/85 (34%), Positives = 36/85 (42%), Gaps = 7/85 (8%)
Query: 2138 KESAEAAI-VAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECE 2196
K +AEA AE KK E A +A A K+ AE K E A +A A A K+E E
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKA--KAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216
Query: 2197 EILSEAIPILEAAEAALNTLTSNDI 2221
+ A AE + DI
Sbjct: 217 AAAAAAA----EAERKADEAELGDI 237
Score = 33.7 bits (77), Expect = 1.2
Identities = 28/114 (24%), Positives = 41/114 (35%), Gaps = 6/114 (5%)
Query: 2100 KLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEA-AIVAEGVKKDEEVAN 2158
+ +AA Q E A + +K++ EA A A K E
Sbjct: 81 EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEA 140
Query: 2159 EQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAA 2212
E+ +A K AE K + A + AA+A K E ++A EA A
Sbjct: 141 EKKAKEEAKKQ-AEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA----EAKAKA 189
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome partitioning].
Length = 1480
Score = 36.8 bits (85), Expect = 0.23
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDEL-DLCVKK 2400
++ L QAE + A L + Q Q + L L+++ D KK+ L +LQ EL D+ V+
Sbjct: 990 RQRLEQAEAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQELQDIGVRA 1049
Query: 2401 KQRAED 2406
AE+
Sbjct: 1050 DSGAEE 1055
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are RNase
Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in turnover
of yitJ riboswitch [Transcription, Degradation of RNA].
Length = 514
Score = 36.4 bits (85), Expect = 0.23
Identities = 18/65 (27%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
++ L++ E +L L K E +E L+K + L K+K L + ++EL+ +
Sbjct: 74 RRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELI-A 132
Query: 2401 KQRAE 2405
+QR E
Sbjct: 133 EQREE 137
Score = 32.6 bits (75), Expect = 3.4
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 21/98 (21%)
Query: 2341 KKEALRQAEEKLQLAMSALH---------EKQEQNRIVQ--EKLQKLQNTLDAKK----- 2384
KKEAL +A+E++ + L ++ + R++Q E L + +LD K+
Sbjct: 49 KKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEK 108
Query: 2385 --KELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSS 2420
KEL + + LD +K++ E+LI + E ER S
Sbjct: 109 KEKELSNKEKNLD---EKEEELEELIAEQREELERISG 143
>gnl|CDD|182116 PRK09856, PRK09856, fructoselysine 3-epimerase; Provisional.
Length = 275
Score = 35.6 bits (82), Expect = 0.27
Identities = 27/128 (21%), Positives = 46/128 (35%), Gaps = 12/128 (9%)
Query: 116 RSELPDNLKSLFRTVAMMVPDYALISEIIVKKLAPYKQLYDNCIEYQEKHKMWAESI--- 172
R E D +K M Y LIS L P ++ E + +AE+I
Sbjct: 85 RRESLDMIKLAMDMAKEMNAGYTLISAAHAGYLTPPNVIWGRLAENLSELCEYAENIGMD 144
Query: 173 --VGTFDPEQIEEETNEFYKNIYKLKLQFSHLPGPYGIASKVLDQVQEFKEHMPIIQTLG 230
+ P E+N N + + +P P + ++D + + P++
Sbjct: 145 LILEPLTP----YESNVV-CNANDVLHALALVPSPRLFS--MVDICAPYVQAEPVMSYFD 197
Query: 231 NPGLQLRH 238
G +LRH
Sbjct: 198 KLGDKLRH 205
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile protein
found in all eukaryote cell types. This family consists
of the coiled-coil myosin heavy chain tail region. The
coiled-coil is composed of the tail from two molecules of
myosin. These can then assemble into the macromolecular
thick filament. The coiled-coil region provides the
structural backbone the thick filament.
Length = 859
Score = 36.2 bits (84), Expect = 0.28
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 5/75 (6%)
Query: 2343 EALRQAEEK---LQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVK 2399
AL E K Q+ +S + + E R + EK ++ +NT ++ ++ LQ L+ K
Sbjct: 482 AALELEESKVLRAQVELSQI--RSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAK 539
Query: 2400 KKQRAEDLIGKLGGE 2414
K A L KL G+
Sbjct: 540 GKAEASRLKKKLEGD 554
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 35.4 bits (82), Expect = 0.29
Identities = 24/94 (25%), Positives = 49/94 (52%), Gaps = 6/94 (6%)
Query: 2341 KKEALRQAEEKLQLAMSALHE--KQEQNRI--VQEKLQKLQNTLDAKK--KELKDLQDEL 2394
+ EAL +A E L++ + L Q ++ I ++E++++ + L A K +EL+ L E+
Sbjct: 39 ELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEI 98
Query: 2395 DLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEK 2428
+ ++ ED + +L E E+ + L E+
Sbjct: 99 QIAKERINSLEDELAELMEEIEKLEKEIEDLKER 132
Score = 33.9 bits (78), Expect = 0.88
Identities = 19/96 (19%), Positives = 49/96 (51%), Gaps = 5/96 (5%)
Query: 2326 IAICKYDK----VAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLD 2381
+AI K D + + ++AL++A+ +L+ AL + + ++ ++ +L++ +
Sbjct: 10 LAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQ 69
Query: 2382 AKKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKER 2417
++ +K +++L VK ++ L ++ KER
Sbjct: 70 EIRERIKRAEEKLS-AVKDERELRALNIEIQIAKER 104
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.3 bits (83), Expect = 0.32
Identities = 42/136 (30%), Positives = 55/136 (40%), Gaps = 16/136 (11%)
Query: 2083 KQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAE 2142
K EE N + ++ A + A ++ E A EL EK+ A+
Sbjct: 1244 KAEEERNNEEIRKFEEARMAHFARRQAAIKAE----------EARKADELKKAEEKKKAD 1293
Query: 2143 AAIVAEGVKKDEEV---ANEQALAAQAMKSVAEGVKKDEEV---ANEQALAAQAMKSECE 2196
A AE KK +E A E A +A K E KK + A E AA+A K+E E
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Query: 2197 EILSEAIPILEAAEAA 2212
EA E AEAA
Sbjct: 1354 AAADEAEAAEEKAEAA 1369
Score = 36.3 bits (83), Expect = 0.33
Identities = 59/271 (21%), Positives = 104/271 (38%), Gaps = 16/271 (5%)
Query: 2131 ELMVTVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQA 2190
E E + A+ A AE KK +E+ + L K AE KK EE N A+
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
Query: 2191 MKSECEEILSEAIPILEAAEAALNTLTSNDITVVKTMKSPPDIVKLVMKAVCILKGVKSE 2250
K + EE E + L E + + K + K V LK ++E
Sbjct: 1586 AK-KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
Query: 2251 RVPDAGGQLVEDYWGP------SKKLLGDIKFLEGLTNFNKDNVPAAVIKRLEDEFLSRE 2304
A + +KK D K E +D ++ E L +E
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED-------EKKAAEALKKE 1697
Query: 2305 DFDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQE 2364
+ + ++ K + K ++ KI +EA ++AEE + A A +++E
Sbjct: 1698 AEEAKKAEELKKKEAEEKKKAEELKKAEEENKI--KAEEAKKEAEEDKKKAEEAKKDEEE 1755
Query: 2365 QNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
+ +I K ++ + + +K++ +++ELD
Sbjct: 1756 KKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Score = 34.3 bits (78), Expect = 1.1
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 4/129 (3%)
Query: 2083 KQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAE 2142
K EE K + DK ++ A EE E + A + ++K+ AE
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713
Query: 2143 AAIVAEGVKKDEEV----ANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEI 2198
AE +KK EE A E A+ K AE KKDEE + A + + + EEI
Sbjct: 1714 EKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773
Query: 2199 LSEAIPILE 2207
E ++E
Sbjct: 1774 RKEKEAVIE 1782
Score = 34.3 bits (78), Expect = 1.2
Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 15/125 (12%)
Query: 2077 KQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTV 2136
K+ + + EEV+ EK KA + E ++ + A K+ + +
Sbjct: 1587 KKAEEARIEEVMKLYEE-----EKKMKAEE---AKKAEEAKIKAEELKKAEEEKKKVEQL 1638
Query: 2137 EKESAEAAIVAEGVKKDEEV----ANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMK 2192
+K+ AE AE +KK EE A E+A A+ K AE KK EE ++ AA+A+K
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE---DEKKAAEALK 1695
Query: 2193 SECEE 2197
E EE
Sbjct: 1696 KEAEE 1700
Score = 32.8 bits (74), Expect = 3.1
Identities = 73/303 (24%), Positives = 111/303 (36%), Gaps = 54/303 (17%)
Query: 2130 KELMVTVEKESAEAAIVAEGVKKDEEVAN-EQALAAQAMKSVAEGVKKDEEVANEQALAA 2188
+EL + AEAA AE +K EE E A A+A+K E KKD E
Sbjct: 1191 EELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEE-AKKDAE--------- 1240
Query: 2189 QAMKSECEEILSEAIPILEAAEAALNTLTSNDITVVKTMKSPPDIVKLVMKAVCILKGVK 2248
+A K+E EE +E I E A A I + K+ D +K K
Sbjct: 1241 EAKKAE-EERNNEEIRKFEEARMAHFARRQAAIKAEEARKA--DELK------------K 1285
Query: 2249 SERVPDAGGQLVEDYWGPSKKLLGDIKFLEGLTNFNKDNVPAAVIKRLEDEFLSREDFDP 2308
+E A K + K + ++ A K+ +E + D
Sbjct: 1286 AEEKKKA----------DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKAD--- 1332
Query: 2309 EVVKKSSTAAEGLCKWVIAICKYDKVAKI-VGPKKEALRQAEEKLQLAMSALHEKQEQNR 2367
KK+ A K + AK + AEEK + A E +++
Sbjct: 1333 AAKKKAEEAK-----------KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Query: 2368 IVQEKLQKLQNTLDAKKK--ELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKML 2425
++K ++ + +AKKK E K DEL K++A++ K EK++ K
Sbjct: 1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA-KKKAEEKKKADEAKKKA 1440
Query: 2426 NEK 2428
E
Sbjct: 1441 EEA 1443
Score = 32.4 bits (73), Expect = 4.8
Identities = 32/124 (25%), Positives = 50/124 (40%), Gaps = 12/124 (9%)
Query: 2083 KQEEVLNGKNRYTVGLEKLDKAASQIAVMREEI-----EYLQPFLMVSAANIKELMVTVE 2137
K EE+ + + K + +A+ + E E +M K++
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
Query: 2138 KESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQAL----AAQAMKS 2193
K++ EA I AE +KK EE E+ Q K AE KK EE+ + AA+ K
Sbjct: 1613 KKAEEAKIKAEELKKAEE---EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
Query: 2194 ECEE 2197
E+
Sbjct: 1670 AEED 1673
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts
with SMC proteins [Cell division and chromosome
partitioning].
Length = 622
Score = 35.7 bits (82), Expect = 0.40
Identities = 27/110 (24%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 2330 KYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKD 2389
K + K + K AL+ K + ++A+ +K ++ KL+KL++ ++ K++E+K
Sbjct: 292 KISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEW---PGKLEKLKSEIELKEEEIKA 348
Query: 2390 LQ---DELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGDV 2436
LQ DEL ++K+ + + + E+E+ + +N + +LT V
Sbjct: 349 LQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSV 398
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 35.6 bits (82), Expect = 0.41
Identities = 43/268 (16%), Positives = 88/268 (32%), Gaps = 42/268 (15%)
Query: 2010 WPPDALLAVATRFLNEVELSEAERQISIDMCQNF--HVSTQNLSD-EFLVKTSRH-VYVT 2065
A ++ L+E E +++ + + L++ + ++ V
Sbjct: 253 EQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQ 312
Query: 2066 PTSYLELISTFKQLLKVKQEEVLNGKNRYTVGLEKLDKAASQIAVMREEIEYLQPFLMVS 2125
LE L++ ++E L + R +A + +
Sbjct: 313 HAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEE---------------AKA 357
Query: 2126 AANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQA 2185
AANI E + + + E A E ++ E+ A A A + V E+A
Sbjct: 358 AANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEAAEQEQV--------EIAVRAE 409
Query: 2186 LAAQAMKSECEEILSEAIPILEAAEAALNTL---------TSNDITVVK---TMKSPPDI 2233
A +++ EI +EA I E +A + +++ P++
Sbjct: 410 AAKAEAEAQAAEIKAEAEAIREKGKAEAEAKRALAEAIQVLGDAAAAELFKALVQALPEV 469
Query: 2234 VKLVMKAVCILKGVKSERVPDAGGQLVE 2261
+ + +K + SE+V GG
Sbjct: 470 AEEAAQP---MKNIDSEKVRVIGGANGG 494
>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57. The
CLD or centrosome localisation domain of Cep57 is found
at the N-terminus, and lies approximately between
residues 58 and 239. This region lies within the first
alpha-helical coiled-coil segment of Cep57, and localises
to the centrosome internally to gamma-tubulin, suggesting
that it is either on both centrioles or on a centromatrix
component. This N-terminal region can also multimerise
with the N-terminus of other Cep57 molecules. The
C-terminal part, Family Cep57_MT_bd, pfam06657, is the
microtubule-binding region of Cep57.
Length = 178
Score = 34.0 bits (78), Expect = 0.55
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 2348 AEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKK 2384
AE K+Q L E++ Q ++VQ+K +LQ L+ +
Sbjct: 139 AETKIQQLEEKLQEEEHQRKLVQDKAAELQTGLETNR 175
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
chromosome partitioning].
Length = 420
Score = 34.7 bits (80), Expect = 0.68
Identities = 32/137 (23%), Positives = 50/137 (36%), Gaps = 21/137 (15%)
Query: 2080 LKVKQEEVLNGKNRYTVGL-----------EKLDKAASQIAVMREEIEY----LQPFLMV 2124
L V E+ R + + L Q+A +R EI L L
Sbjct: 135 LLVSPEDAQR-SVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE 193
Query: 2125 SAANIKEL-MVTVEKESAEAA----IVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEE 2179
A +L + E++ A + A+ K +E ANE L + + A K E
Sbjct: 194 QRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREA 253
Query: 2180 VANEQALAAQAMKSECE 2196
A +A AA+A +E +
Sbjct: 254 AAAAEAAAARARAAEAK 270
Score = 31.2 bits (71), Expect = 8.6
Identities = 17/87 (19%), Positives = 30/87 (34%), Gaps = 9/87 (10%)
Query: 2346 RQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQN---TLDAKKKELKDLQDELDLCVKKKQ 2402
Q ++ +LA L E+++ + +L Q L A + LK+ + K +
Sbjct: 193 EQRAQQAKLAQ-LLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAR 251
Query: 2403 RAEDL-----IGKLGGEKERWSSTAKM 2424
A E +R T K
Sbjct: 252 EAAAAAEAAAARARAAEAKRTGETYKP 278
>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
cellular Activities) superfamily represents an ancient
group of ATPases belonging to the ASCE (for additional
strand, catalytic E) division of the P-loop NTPase fold.
The ASCE division also includes ABC, RecA-like,
VirD4-like, PilT-like, and SF1/2 helicases. Members of
the AAA+ ATPases function as molecular chaperons, ATPase
subunits of proteases, helicases, or nucleic-acid
stimulated ATPases. The AAA+ proteins contain several
distinct features in addition to the conserved
alpha-beta-alpha core domain structure and the Walker A
and B motifs of the P-loop NTPases.
Length = 151
Score = 33.3 bits (76), Expect = 0.71
Identities = 20/103 (19%), Positives = 33/103 (32%), Gaps = 21/103 (20%)
Query: 7 GPAGTGKTETTKDLAKAVAKLC-------IVFNCSDSM--------DYIGLGKFFKGLIA 51
GP GTGKT LA+A+A + N SD + L + L
Sbjct: 26 GPPGTGKTT----LARAIANELFRPGAPFLYLNASDLLEGLVVAELFGHFLVRLLFELAE 81
Query: 52 TGAWAC--FDEFNRIDVEVLSVVAQQILTIQRGVTLGEEEILF 92
DE + + + + + + T+ E +
Sbjct: 82 KAKPGVLFIDEIDSLSRGAQNALLRVLETLNDLRIDRENVRVI 124
Score = 32.9 bits (75), Expect = 0.87
Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 12/62 (19%)
Query: 1128 GPAGTGKTETTKDLAKAVAKLC-------IVFNCSDSM-DYIGLGKFFKMVDDAMEAYKK 1179
GP GTGKT LA+A+A + N SD + + F + + +
Sbjct: 26 GPPGTGKTT----LARAIANELFRPGAPFLYLNASDLLEGLVVAELFGHFLVRLLFELAE 81
Query: 1180 KK 1181
K
Sbjct: 82 KA 83
Score = 31.7 bits (72), Expect = 2.3
Identities = 19/111 (17%), Positives = 36/111 (32%), Gaps = 20/111 (18%)
Query: 1319 ETLRNIALMKLLVTHQKPVMFIGPTGTGKSCYITVSILFIKNNSRLRSGFQHFLLKELSI 1378
E L + K ++ GP GTGK+ ++ + EL
Sbjct: 4 EEAIEALREALELPPPKNLLLYGPPGTGKT-----TLA-------------RAIANELFR 45
Query: 1379 EQWQPLIMNFSAQTSANQTQDIIMSKLDKRRKGVYGPPLGKRCVVFVDDVN 1429
L +N S ++ L + + K V+F+D+++
Sbjct: 46 PGAPFLYLNASDLLEGLVVAELFGHFLVRLLFEL--AEKAKPGVLFIDEID 94
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 33.7 bits (77), Expect = 0.71
Identities = 13/53 (24%), Positives = 28/53 (52%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
+ L Q ++Q + E + + ++ +++ L+ L K+K+ + LQDEL
Sbjct: 102 DELEQLRREIQQLEKTIAELRSEITSLETEIRDLREELQEKEKDNETLQDELI 154
Score = 30.6 bits (69), Expect = 8.9
Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 2349 EEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCV-----KKKQR 2403
E +++ L EK++ N +Q++L L L+A +++L+ LQ E V KK Q
Sbjct: 129 ETEIRDLREELQEKEKDNETLQDELISLNIELNALEEKLRKLQKENQELVERWMAKKGQE 188
Query: 2404 AEDL 2407
AE +
Sbjct: 189 AEAM 192
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases, possibly
by mediating the dimerisation of kinase domains or by
targeting these kinases to the nuclear pore complex. MLP1
and MLP2 are involved in the process of telomere length
regulation, where they are thought to interact with
proteins such as Tel1p and modulate their activity.
Length = 132
Score = 33.0 bits (76), Expect = 0.71
Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 9/72 (12%)
Query: 2342 KEALRQAE--EKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKK----ELKDLQDELD 2395
+E ++ AE E+LQ L+E +++ ++ + + Q L ++ + K L+DEL
Sbjct: 49 RELVKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELS 108
Query: 2396 LCVKKKQRAEDL 2407
++R E+L
Sbjct: 109 EL---EKRIEEL 117
>gnl|CDD|193185 pfam12709, Kinetocho_Slk19, Central kinetochore-associated. This is
a family of proteins integrally involved in the central
kinetochore. Slk19 is a yeast member and it may play an
important role in the timing of nuclear migration. It may
also participate, directly or indirectly, in the
maintenance of centromeric tensile strength during
mitotic stagnation, for instance during activation of
checkpoint controls, when cells need to preserve nuclear
integrity until cell cycle progression can be resumed.
Length = 87
Score = 31.9 bits (73), Expect = 0.79
Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 13/72 (18%)
Query: 2346 RQAEEKLQLAMSALH----EKQEQNRIV---------QEKLQKLQNTLDAKKKELKDLQD 2392
++ ++LQL L+ K EQ + Q+KL +L+ A K+E++ L+
Sbjct: 11 QETNKRLQLLAEDLYVQYSSKHEQKVKMLKKGYEAKYQKKLDELELENKALKQEIEQLKK 70
Query: 2393 ELDLCVKKKQRA 2404
+LD K+KQ
Sbjct: 71 QLDTERKEKQEL 82
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 34.7 bits (81), Expect = 0.80
Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 8/68 (11%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQ 2402
E L + E L+ + L K + + E+++ L L +KEL+ L+ +L
Sbjct: 694 EYLNEQEALLKELAALLKAKPSE---LPERVEALLEELKELEKELEQLKAKL-----AAA 745
Query: 2403 RAEDLIGK 2410
A DL+ +
Sbjct: 746 AAGDLLAQ 753
>gnl|CDD|218152 pfam04568, IATP, Mitochondrial ATPase inhibitor, IATP. ATP synthase
inhibitor prevents the enzyme from switching to ATP
hydrolysis during collapse of the electrochemical
gradient, for example during oxygen deprivation ATP
synthase inhibitor forms a one to one complex with the F1
ATPase, possibly by binding at the alpha-beta interface.
It is thought to inhibit ATP synthesis by preventing the
release of ATP. The minimum inhibitory region for bovine
inhibitor is from residues 39 to 72. The inhibitor has
two oligomeric states, dimer (the active state) and
tetramer. At low pH, the inhibitor forms a dimer via
antiparallel coiled coil interactions between the C
terminal regions of two monomers. At high pH, the
inhibitor forms tetramers and higher oligomers by coiled
coil interactions involving the N terminus and inhibitory
region, thus preventing the inhibitory activity.
Length = 90
Score = 32.0 bits (73), Expect = 0.82
Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 14/55 (25%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
K+EA A+E+ +K+ +E+L L+ L+ KKE+++L+ +D
Sbjct: 50 KREA---AQEEE-----YFRQKE------KEQLAALKEKLEEHKKEIEELEKHID 90
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 33.4 bits (77), Expect = 0.88
Identities = 13/56 (23%), Positives = 30/56 (53%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDL 2396
+ E L++ +++ + E E+NR ++E+L +L+ +A + E + LQ+
Sbjct: 109 ELERLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
Score = 33.4 bits (77), Expect = 1.0
Identities = 20/68 (29%), Positives = 34/68 (50%)
Query: 2340 PKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVK 2399
+E L + +++L L E QEQ +Q++ Q+L+ L + EL+ LQ EL +
Sbjct: 63 SARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQ 122
Query: 2400 KKQRAEDL 2407
A +L
Sbjct: 123 LSANAIEL 130
Score = 31.9 bits (73), Expect = 2.8
Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 2364 EQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKE---RWSS 2420
E+ +Q++L +LQ L +++L +LQ E ++ E + +L E + S+
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA 125
Query: 2421 TAKMLNEKYYQLT 2433
A L+E+ +L
Sbjct: 126 NAIELDEENRELR 138
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 34.3 bits (79), Expect = 0.91
Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 13/97 (13%)
Query: 2308 PEVVKKSSTAAEGLCKWVIAICKYDKVAKIVG------PKKEALRQAEEKLQLA--MSAL 2359
P +K S GL +AI +A G K++A Q LA ++AL
Sbjct: 24 PVATEKKSKNRTGLVLSAVAIA----IALAAGAGLYYHGKQQAQNQTATNDALANQLTAL 79
Query: 2360 HEKQEQNRI-VQEKLQKLQNTLDAKKKELKDLQDELD 2395
+ QE + ++ L++ LD ++ L +LD
Sbjct: 80 QKAQESQKQELEGILKQQAKALDQANRQQAALAKQLD 116
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 34.3 bits (79), Expect = 0.95
Identities = 18/56 (32%), Positives = 28/56 (50%), Gaps = 4/56 (7%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIV----QEKLQKLQNTLDAKKKELKDLQDEL 2394
L+Q E+L + L EKQ Q + + Q + QKL+ + +KK L L+ L
Sbjct: 173 AELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSL 228
>gnl|CDD|212580 cd11707, DHR2_DOCK1, Dock Homology Region 2, a GEF domain, of Class
A Dedicator of Cytokinesis 1. Dock1, also called
Dock180, is an atypical guanine nucleotide exchange
factor (GEF) that lacks the conventional Dbl homology
(DH) domain. As a GEF, it activates small GTPases by
exchanging bound GDP for free GTP. Dock1 interacts with
the scaffold protein Elmo and the resulting complex
functions upstream of Rac in many biological events
including phagocytosis of apoptotic cells, cell
migration and invasion. In the nervous system, it
mediates attractive responses to netrin-1 and thus,
plays a role in axon outgrowth and pathfinding. DOCK
proteins are divided into four classes (A-D) based on
sequence similarity and domain architecture; class A
includes Dock1, 2 and 5. All DOCKs contain two homology
domains: the DHR-1 (Dock homology region-1), also called
CZH1 (CED-5, Dock180, and MBC-zizimin homology 1), and
DHR-2 (also called CZH2 or Docker). The DHR-1 domain
binds phosphatidylinositol-3,4,5-triphosphate. This
alignment model represents the DHR-2 domain of Dock1,
which contains the catalytic GEF activity for Rac and/or
Cdc42. Class A DOCKs, like Dock1, are specific GEFs for
Rac and they contain an SH3 domain at the N-terminal
region and a PxxP motif at the C-terminus.
Length = 400
Score = 34.2 bits (78), Expect = 0.96
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 14/57 (24%)
Query: 153 QLYDNCIEYQEKHKMWAESIV-----------GTFDPEQIEE---ETNEFYKNIYKL 195
QLY I Y +K KMW E+I FD EQ+ E + +FY+NI K+
Sbjct: 62 QLYQEIIHYFDKGKMWEEAIALGKELAEQYENEMFDYEQLSELLKKQAQFYENIVKV 118
>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
ATPase domain and a DNA-binding domain [Transcription /
Signal transduction mechanisms].
Length = 403
Score = 34.3 bits (79), Expect = 0.98
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 775 KLLVTHQKPVMFIGPTGTGKS 795
K PV+ IG TGTGK
Sbjct: 95 KAYAPSGLPVLIIGETGTGKE 115
Score = 34.3 bits (79), Expect = 0.98
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 1328 KLLVTHQKPVMFIGPTGTGKS 1348
K PV+ IG TGTGK
Sbjct: 95 KAYAPSGLPVLIIGETGTGKE 115
>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis
sensory transducer). Thought to undergo reversible
methylation in response to attractants or repellants
during bacterial chemotaxis.
Length = 262
Score = 33.8 bits (78), Expect = 0.99
Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Query: 2125 SAANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQA-LAAQAMKSVAEGVKKDEEVANE 2183
SA I+ L+ +++E+ EA E + E E A++ + + V++ ++ E
Sbjct: 145 SAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGDALEEIVDSVEEIADLVQE 204
Query: 2184 QALAAQAMKSECEEILS 2200
A A + EE+ +
Sbjct: 205 IAAATDEQAAGSEEVNA 221
>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
(TACC). This family contains the proteins TACC 1, 2 and
3 the genes for which are found concentrated in the
centrosomes of eukaryotic and may play a conserved role
in organising centrosomal microtubules. The human TACC
proteins have been linked to cancer and TACC2 has been
identified as a possible tumour suppressor (AZU-1). The
functional homologue (Alp7) in Schizosaccharomyces pombe
has been shown to be required for organisation of bipolar
spindles.
Length = 207
Score = 33.2 bits (76), Expect = 1.1
Identities = 28/105 (26%), Positives = 46/105 (43%), Gaps = 30/105 (28%)
Query: 2325 VIAICKYDKVAKIVGPKKEALR------QAEEKLQLAMSALHEK-----------QEQNR 2367
+ C + + ++ KKE R AEEKL++A + + Q R
Sbjct: 115 TLKKCAQEYLDRL---KKEEQRYQALKAHAEEKLEIANEEIAQVRSKAKAETAALQASLR 171
Query: 2368 IVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGKLG 2412
Q K+Q L+ TL+ K KE ++L +C ++LI K+G
Sbjct: 172 KEQMKVQSLEETLEQKNKENEEL---TKIC-------DELISKMG 206
>gnl|CDD|215787 pfam00206, Lyase_1, Lyase.
Length = 312
Score = 33.9 bits (78), Expect = 1.1
Identities = 21/92 (22%), Positives = 38/92 (41%), Gaps = 3/92 (3%)
Query: 2133 MVTVEKESAEAAIVAEGVKKDEEVANEQALAAQAMKSVAEGVKKDEE--VANEQALAAQA 2190
++ E AA+ K + + E A +A+ VAE K D+ + Q + A
Sbjct: 23 RLSEEDIKGLAALKKAAAKANVLLKEEAAAIIKALDEVAEEGKADDAFPLKVIQEGSGTA 82
Query: 2191 MKSECEEILSEAIPILEAAEAALNT-LTSNDI 2221
+ E++ E + L ++T +SND
Sbjct: 83 VNMNLNEVIGELLGQLVHPNDKVHTGQSSNDQ 114
>gnl|CDD|213373 cd12083, DD_cGKI, Dimerization/Docking domain of Cyclic GMP-dependent
Protein Kinase I. Cyclic GMP-dependent Protein Kinase I
(PKG1 or cGKI) is a Serine/Threonine Kinase (STK),
catalyzing the transfer of the gamma-phosphoryl group
from ATP to serine/threonine residues on protein
substrates. cGKI exists as two splice variants,
cGKI-alpha and cGKI-beta. They contain an N-terminal
regulatory domain containing a dimerization/docking
region and an autoinhibitory pseudosubstrate region, two
cGMP-binding domains, and a C-terminal catalytic domain.
Binding of cGMP to both binding sites releases the
inhibition of the catalytic center by the pseudosubstrate
region, allowing autophosphorylation and activation of
the kinase. cGKI is a soluble protein expressed in all
smooth muscles, platelets, cerebellum, and kidney. It is
also expressed at lower concentrations in other tissues.
It is involved in the regulation of smooth muscle tone,
smooth cell proliferation, and platelet activation. The
dimerization/docking (D/D) domain is a leucine/isoleucine
zipper that mediates both homodimerization and
interaction with isotype-specific G-kinase-anchoring
proteins (GKAPs). The D/D domain of the two variants
(alpha and beta) differ, allowing their targeting to
different subcellular compartments and intracellular
substrates.
Length = 48
Score = 30.2 bits (69), Expect = 1.1
Identities = 13/40 (32%), Positives = 25/40 (62%)
Query: 2356 MSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
L EK E+ R E++++L+ L K +E+++L+ +LD
Sbjct: 1 SGLLEEKTEELRKKDERIRELEQELQEKDEEIQELRSQLD 40
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus spores
are protected by a protein shell consisting of over 50
different polypeptides, known as the coat. This family of
proteins has an important morphogenetic role in coat
assembly, it is involved in the assembly of at least 5
different coat proteins including CotB, CotG, CotS, CotSA
and CotW. It is likely to act at a late stage of coat
assembly.
Length = 185
Score = 32.9 bits (75), Expect = 1.3
Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 2/81 (2%)
Query: 2119 QPFLMVSAANIKELMVTVEKESAEAA--IVAEGVKKDEEVANEQALAAQAMKSVAEGVKK 2176
QP L + AN+++ + + + E E VK EE + A+ K E ++
Sbjct: 17 QPKLEEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPERE 76
Query: 2177 DEEVANEQALAAQAMKSECEE 2197
D E+ AQ + E E
Sbjct: 77 DIAEQQEKEEIAQEEEKEEEA 97
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil domain
of unknwon function.
Length = 126
Score = 31.8 bits (73), Expect = 1.3
Identities = 12/58 (20%), Positives = 33/58 (56%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVK 2399
+ L++ E K + A E+++ + +E++++L+ L+ K E++ L+++L+
Sbjct: 52 DKFLKENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQP 109
Score = 30.3 bits (69), Expect = 5.4
Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 3/75 (4%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQ 2402
AL E+ + L +++E+ +++K ++LQ +L K LK+ + + KK +
Sbjct: 14 LALDAKREEFERREELLKQREEE---LEKKEEELQESLIKFDKFLKENEAKRRRAEKKAE 70
Query: 2403 RAEDLIGKLGGEKER 2417
+ L + E +
Sbjct: 71 EEKKLRKEKEEEIKE 85
>gnl|CDD|190576 pfam03245, Phage_lysis, Bacteriophage lysis protein. This protein is
involved in host lysis. This family is not considered to
be a peptidase according to the MEROPs database.
Length = 125
Score = 32.0 bits (73), Expect = 1.4
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 9/56 (16%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAK-KKELKDLQDELD 2395
K +A +L+ A + + + Q + R V LDAK KEL D + E D
Sbjct: 5 YKRQRDKATSELRAANATIEDMQARQRDV--------AALDAKHTKELADAKAEND 52
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of proteins
contain a band 4.1 domain (pfam00373), at their amino
terminus. This family represents the rest of these
proteins.
Length = 244
Score = 33.2 bits (76), Expect = 1.4
Identities = 17/71 (23%), Positives = 34/71 (47%)
Query: 2347 QAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAED 2406
+AE + Q + + +E R Q++L++ + T +++LK ++E L KK E+
Sbjct: 2 EAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEE 61
Query: 2407 LIGKLGGEKER 2417
+L E
Sbjct: 62 ENRRLEEEAAA 72
Score = 31.3 bits (71), Expect = 5.7
Identities = 13/65 (20%), Positives = 26/65 (40%), Gaps = 9/65 (13%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
++EA EE+ +L + ++ KL+ + K+ E + LQ EL ++
Sbjct: 67 EEEAAASEEERERLE--------AEVDEATAEVAKLEEEREKKEAETRQLQQELR-EAQE 117
Query: 2401 KQRAE 2405
Sbjct: 118 AHERA 122
Score = 30.9 bits (70), Expect = 7.9
Identities = 18/98 (18%), Positives = 36/98 (36%), Gaps = 15/98 (15%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELK-----------DL 2390
++ L + EE L EK +Q + L+K + L+ + + L+ L
Sbjct: 25 QKELEEYEETALE----LEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERL 80
Query: 2391 QDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEK 2428
+ E+D + + E+ K E + + E
Sbjct: 81 EAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEA 118
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and
secretion / Signal transduction mechanisms].
Length = 408
Score = 33.4 bits (76), Expect = 1.5
Identities = 23/99 (23%), Positives = 40/99 (40%), Gaps = 8/99 (8%)
Query: 2125 SAANIKELMVTVEKESAEAA--------IVAEGVKKDEEVANEQALAAQAMKSVAEGVKK 2176
SA I L+ ++ E+A+A V+EGVK EE + A A++ V++ + +
Sbjct: 290 SAKEIGLLIEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISE 349
Query: 2177 DEEVANEQALAAQAMKSECEEILSEAIPILEAAEAALNT 2215
EQ + + + EE+ A E
Sbjct: 350 IAAATEEQTAVLEEINASIEELDDVTQENAAAVEELAAA 388
>gnl|CDD|223703 COG0630, VirB11, Type IV secretory pathway, VirB11 components, and
related ATPases involved in archaeal flagella
biosynthesis [Cell motility and secretion /
Intracellular trafficking and secretion].
Length = 312
Score = 33.1 bits (76), Expect = 1.6
Identities = 19/94 (20%), Positives = 35/94 (37%), Gaps = 21/94 (22%)
Query: 719 VPPQGTVYDYRFIKEGKGRWKPWS-DDLASAPAIPRDMPVNQIIVTTEETLRNIALMKLL 777
V P G+ + R K +P + +DL I + A + L
Sbjct: 100 VSPNGSSFTIR-----KFSDEPITPEDLIEYGTISPEQ---------------AAYLWLA 139
Query: 778 VTHQKPVMFIGPTGTGKSCYITVSILFIKNNSRL 811
+ +K ++ G T +GK+ + + FI R+
Sbjct: 140 IEARKSIIICGGTASGKTTLLNALLDFIPPEERI 173
Score = 32.3 bits (74), Expect = 2.9
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 1324 IALMKLLVTHQKPVMFIGPTGTGKSCYITVSILFIKNNSRL 1364
A + L + +K ++ G T +GK+ + + FI R+
Sbjct: 133 AAYLWLAIEARKSIIICGGTASGKTTLLNALLDFIPPEERI 173
>gnl|CDD|202846 pfam03993, DUF349, Domain of Unknown Function (DUF349). This domain
is found singly or as up to five tandem repeats in a
small set of bacterial proteins. There are two or three
alpha-helices, and possibly a beta-strand.
Length = 77
Score = 30.6 bits (70), Expect = 1.6
Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 6/58 (10%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKL----QKLQNTLDAKK--KELKDLQDE 2393
K A E+ + L E++ +N +E L + L + D K + L++LQDE
Sbjct: 8 KAARDAFFERRNAFFAELDEERAENLEKKEALIEEAEALAESTDWKAATERLRELQDE 65
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 33.0 bits (76), Expect = 1.6
Identities = 32/124 (25%), Positives = 70/124 (56%), Gaps = 20/124 (16%)
Query: 2330 KYDKVA-KIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTL-------D 2381
KY++VA K+V + + L +AEE+ + A S + E +E+ ++V L+ L+ + D
Sbjct: 114 KYEEVARKLVVVEGD-LERAEERAEAAESKIVELEEELKVVGNNLKSLEVSEEKASQRED 172
Query: 2382 AKKKELKDLQDEL----------DLCVKKKQR-AEDLIGKLGGEKERWSSTAKMLNEKYY 2430
+ +++++DL ++L + V+K ++ + L +L EKE++ + ++ L++
Sbjct: 173 SYEEKIRDLTEKLKEAETRAEFAERSVQKLEKEVDRLEDELLAEKEKYKAISEELDQTLA 232
Query: 2431 QLTG 2434
+LTG
Sbjct: 233 ELTG 236
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together with
Snu17p and a newly identified factor, Pml1p/Ylr016c, form
a novel trimeric complex. called The RES complex,
pre-mRNA retention and splicing complex. Subunits of this
complex are not essential for viability of yeasts but
they are required for efficient splicing in vitro and in
vivo. Furthermore, inactivation of this complex causes
pre-mRNA leakage from the nucleus. Bud13 contains a
unique, phylogenetically conserved C-terminal region of
unknown function.
Length = 141
Score = 31.9 bits (73), Expect = 1.6
Identities = 16/66 (24%), Positives = 37/66 (56%), Gaps = 6/66 (9%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
K+E R+ E++ + + +++ + R E+L+K +N A+ + +D +EL K+
Sbjct: 25 KEEKERKEEKEKEWGKGLVQKEEREKR--LEELEKAKNKPLARYADDEDYDEEL----KE 78
Query: 2401 KQRAED 2406
++R +D
Sbjct: 79 QERWDD 84
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an obligate
intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification of
the inclusion membrane by a set of C. trachomatis-encoded
proteins collectively designated Incs. One of these Incs,
IncA, is functionally associated with the homotypic
fusion of inclusions. This family probably includes
members of the wider Inc family rather than just IncA.
Length = 186
Score = 32.4 bits (74), Expect = 1.6
Identities = 18/71 (25%), Positives = 36/71 (50%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
K +KL+ L E ++Q +QE+L+ L+ + + EL+DL+++L L +
Sbjct: 63 KAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLREL 122
Query: 2401 KQRAEDLIGKL 2411
+ E+ + L
Sbjct: 123 LKSLEERLESL 133
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 32.4 bits (74), Expect = 1.7
Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 6/79 (7%)
Query: 2137 EKESAEAAIVAEGVKKDEEVANEQALAAQ--AMKSVAEGV---KKDEEVANEQALAAQAM 2191
EKE A + + K ++ A + AL A+ ++ AE V K E A AA A
Sbjct: 109 EKEGKVNAK-KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAE 167
Query: 2192 KSECEEILSEAIPILEAAE 2210
+ E EE +E P E+AE
Sbjct: 168 EEEAEEAPAEEAPAEESAE 186
>gnl|CDD|221983 pfam13207, AAA_17, AAA domain.
Length = 114
Score = 31.5 bits (71), Expect = 1.7
Identities = 21/94 (22%), Positives = 36/94 (38%), Gaps = 14/94 (14%)
Query: 1128 GPAGTGKTETTKDLAKAVAKL--CIVFNCSDSMDYIGLGKFFKMVDDAMEAYKKKKRE-- 1183
GP G+GK+ LAK +A+ V + D + GL + D ++ + E
Sbjct: 6 GPPGSGKST----LAKKLAEKLGIPVISLDDLLREEGLAELDDGELDDIDIDLELLEEIL 61
Query: 1184 ------EWVMDWPGQTVLCVGSAYWTADVHKAIA 1211
EWV+D ++ L + V +
Sbjct: 62 DELAKQEWVIDGVRESTLELRLEEADLVVFLDLP 95
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 31.1 bits (71), Expect = 1.8
Identities = 19/89 (21%), Positives = 39/89 (43%), Gaps = 2/89 (2%)
Query: 2342 KEALRQAEEKLQLAMSA-LHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
E EEK QL S L + E + +K + + L+A ++ ++ L + + +++
Sbjct: 8 DELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEE 67
Query: 2401 KQRAEDLIG-KLGGEKERWSSTAKMLNEK 2428
+ I +L ERW ++ E+
Sbjct: 68 GHPDAEEIEERLEELNERWEELKELAEER 96
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina
virus topoisomerase, Variola virus topoisomerase, Shope
fibroma virus topoisomeras.
Length = 391
Score = 33.1 bits (76), Expect = 1.8
Identities = 18/71 (25%), Positives = 36/71 (50%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
K +AL+ ++L+ + + R ++ K ++ LDA+ KE K + + + K+
Sbjct: 285 KIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQ 344
Query: 2401 KQRAEDLIGKL 2411
+R E+ I KL
Sbjct: 345 IERLEERIEKL 355
>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
activities (AAA). AAA family proteins often perform
chaperone-like functions that assist in the assembly,
operation, or disassembly of protein complexes.
Length = 131
Score = 31.8 bits (73), Expect = 1.9
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 9/42 (21%)
Query: 7 GPAGTGKTETTKDLAKAVAKLC----IVFNCSDSMD-YIGLG 43
GP GTGKT LAKAVAK I + S+ + Y+G
Sbjct: 5 GPPGTGKTT----LAKAVAKELGAPFIEISGSELVSKYVGES 42
Score = 31.8 bits (73), Expect = 1.9
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 9/42 (21%)
Query: 1128 GPAGTGKTETTKDLAKAVAKLC----IVFNCSDSMD-YIGLG 1164
GP GTGKT LAKAVAK I + S+ + Y+G
Sbjct: 5 GPPGTGKTT----LAKAVAKELGAPFIEISGSELVSKYVGES 42
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 33.5 bits (77), Expect = 2.0
Identities = 23/104 (22%), Positives = 45/104 (43%), Gaps = 7/104 (6%)
Query: 2293 IKRLEDEFLSREDFDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKL 2352
I+ L E E+ D E ++ S G + +K A+ KKE + +EK
Sbjct: 394 IEELRRELEGEEESDEEENEEPSKKNVG--RRKFGPENGEKEAESKKLKKENKNEFKEKK 451
Query: 2353 QLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDL 2396
+ E +E + K++K+ N L + ++ + ++E +L
Sbjct: 452 ESDEEEELEDEE-----EAKVEKVANKLLKRSEKAQKEEEEEEL 490
>gnl|CDD|206779 cd11386, MCP_signal, Methyl-accepting chemotaxis protein (MCP),
signaling domain. Methyl-accepting chemotaxis proteins
(MCPs or chemotaxis receptors) are an integral part of
the transmembrane protein complex that controls bacterial
chemotaxis, together with the histidine kinase CheA, the
receptor-coupling protein CheW, receptor-modification
enzymes, and localized phosphatases. MCPs contain a four
helix trans membrane region, an N-terminal periplasmic
ligand binding domain, and a C-terminal HAMP domain
followed by a cytoplasmic signaling domain. This
C-terminal signaling domain dimerizes into a four-helix
bundle and interacts with CheA through the adaptor
protein CheW.
Length = 200
Score = 32.2 bits (74), Expect = 2.1
Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 12/84 (14%)
Query: 2125 SAANIKELMVTVEKESAEAA--------IVAEGVKKDEEVANEQALA--AQAMKSVAEGV 2174
+A I+EL+ +++++ EA V EGV+ EE +A +++ VA+G+
Sbjct: 118 AAKEIEELIEEIQEQTEEAVEAMEETSEEVEEGVELVEETG--RAFEEIVASVEEVADGI 175
Query: 2175 KKDEEVANEQALAAQAMKSECEEI 2198
++ EQ+ + Q + + EEI
Sbjct: 176 QEISAATQEQSASTQEIAAAVEEI 199
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three proteins,
Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin
binding (TnT). The troponin complex regulates Ca++
induced muscle contraction. This family includes troponin
T and troponin I. Troponin I binds to actin and troponin
T binds to tropomyosin.
Length = 131
Score = 31.4 bits (72), Expect = 2.1
Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 16/81 (19%)
Query: 2341 KKEALRQAEEKLQL------AMSALHEKQEQNRIVQEKLQKLQNTLDAKK-----KELKD 2389
+ E ++ EEK + + + E + +KL + LD ++ K K
Sbjct: 17 EFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELC-KKLHARIDRLDEERYDIEEKVAKK 75
Query: 2390 LQDELDLCVKKKQRAEDLIGK 2410
++ DL K++ DL GK
Sbjct: 76 DKEIEDL----KKKVNDLRGK 92
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 33.3 bits (77), Expect = 2.1
Identities = 15/32 (46%), Positives = 16/32 (50%), Gaps = 7/32 (21%)
Query: 7 GPAGTGKTETTKDLAKAVAKLCIVFNCSDSMD 38
GP GTGKT K AKAV NC + D
Sbjct: 45 GPRGTGKTSAAKIFAKAV-------NCLNPPD 69
Score = 33.3 bits (77), Expect = 2.1
Identities = 15/32 (46%), Positives = 16/32 (50%), Gaps = 7/32 (21%)
Query: 1128 GPAGTGKTETTKDLAKAVAKLCIVFNCSDSMD 1159
GP GTGKT K AKAV NC + D
Sbjct: 45 GPRGTGKTSAAKIFAKAV-------NCLNPPD 69
>gnl|CDD|222698 pfam14346, DUF4398, Domain of unknown function (DUF4398). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 127 and 269
amino acids in length.
Length = 105
Score = 30.8 bits (70), Expect = 2.2
Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 9/71 (12%)
Query: 2340 PKKEALRQAEEKLQLAMSALHEKQ--EQNRIVQEKLQKLQNTLDAKKKELKD-LQDELDL 2396
P L+ A EKL A +AL E + E R+ ++ + A+ K Q L
Sbjct: 39 PPYVELKLAREKLAQAKAALDEGKYEEARRLAEQAEADAR---LAEAKARAAKAQAAL-- 93
Query: 2397 CVKKKQRAEDL 2407
+ K E L
Sbjct: 94 -EEAKDSIERL 103
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have LPXTG
C-terminal anchoring motifs and a substantial number have
the KxYKxGKxW putative sorting signal at the N-terminus.
The tetracycline resistance plasmid pCF10 in Enterococcus
faecalis promotes conjugal plasmid transfer in response
to sex pheromones, but PgrA/Sec10 encoded by that
plasmid, a member of this family, specifically inhibits
the ability of cells to receive homologous plasmids. The
phenomenon is called surface exclusion.
Length = 356
Score = 32.8 bits (75), Expect = 2.3
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELK 2388
+ +AL+ A+ L A +AL + + +E L L L K+ L
Sbjct: 309 QAQALQTAQNNLATAQAALANAEARLAKAKEALANLNADLAKKQAALD 356
Score = 32.0 bits (73), Expect = 4.7
Identities = 15/60 (25%), Positives = 23/60 (38%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
+ AL A+ L A +AL+ Q Q Q L +KEL + Q + +
Sbjct: 260 QAALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQTAQNN 319
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is involved
in determining the site of synaptic vesicle fusion. The
C-terminus is a PDZ-binding motif that binds directly to
RIM (a small G protein Rab-3A effector). The family also
contains four coiled-coil domains.
Length = 774
Score = 33.1 bits (75), Expect = 2.4
Identities = 16/70 (22%), Positives = 37/70 (52%), Gaps = 3/70 (4%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
K+ +E++Q L E++ Q +++E++ KL + ++ +L+ EL+ K +
Sbjct: 471 KKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELE---KIR 527
Query: 2402 QRAEDLIGKL 2411
++ E L +L
Sbjct: 528 EKHEKLEKEL 537
Score = 31.1 bits (70), Expect = 8.8
Identities = 19/64 (29%), Positives = 36/64 (56%), Gaps = 6/64 (9%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKK 2400
K+E + +E ++ + +EQ R+ QE+ Q Q T+ A ++ELK Q EL+ +K
Sbjct: 42 KRERALRKDEAARITVW-----KEQLRVTQEENQHAQLTVQALQEELK-SQRELNRLQQK 95
Query: 2401 KQRA 2404
+++
Sbjct: 96 LEKS 99
Score = 31.1 bits (70), Expect = 9.0
Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 16/109 (14%)
Query: 2303 REDFDPEV-VKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHE 2361
R D EV ++S+ AA K D+ A+ + KK L + +L+
Sbjct: 268 RNRLDKEVEAERSALAA--------MKAKCDRAAQELSRKKTELLGLQTELE-------T 312
Query: 2362 KQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGK 2410
Q+ +++ L KL+ L ++E LQ E+D + +R + + K
Sbjct: 313 LANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTK 361
Score = 31.1 bits (70), Expect = 9.1
Identities = 27/102 (26%), Positives = 39/102 (38%), Gaps = 30/102 (29%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLC----- 2397
LR EKLQL M ++LQ TLDA+ +E+K L + L
Sbjct: 131 FLLRDTLEKLQLRM-----------------ERLQQTLDARDEEIKKLLEMLQSKGLSAK 173
Query: 2398 --------VKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQ 2431
+ + + AED L E+ ML E+ +Q
Sbjct: 174 AEEEDSERLGRIREAEDQQSHLEVLLEQKEKEHMMLREEIHQ 215
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 2.4
Identities = 30/121 (24%), Positives = 54/121 (44%), Gaps = 21/121 (17%)
Query: 2102 DKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGV----KKDEEVA 2157
++ A ++ +RE +E L +L+ + A+A V +DEE+
Sbjct: 275 EELAEEVRDLRERLEEL-------EEERDDLLAEAGLDDADAEAVEARREELEDRDEELR 327
Query: 2158 N---EQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAALN 2214
+ E +AAQA AE +++D + E+ A+ ++ E E+ SE LE A A+
Sbjct: 328 DRLEECRVAAQAHNEEAESLREDADDLEER---AEELREEAAELESE----LEEAREAVE 380
Query: 2215 T 2215
Sbjct: 381 D 381
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This family
of bacterial proteins has no known function. The proteins
are in the region of 500-600 amino acid residues in
length.
Length = 450
Score = 33.0 bits (76), Expect = 2.4
Identities = 17/69 (24%), Positives = 35/69 (50%), Gaps = 5/69 (7%)
Query: 2369 VQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQ-----RAEDLIGKLGGEKERWSSTAK 2423
++E+L++L+ L ++EL+ ++ L KK + + + KL KE+ S +
Sbjct: 332 LKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELE 391
Query: 2424 MLNEKYYQL 2432
L E+ +L
Sbjct: 392 ELEEELKEL 400
>gnl|CDD|151045 pfam10482, CtIP_N, Tumour-suppressor protein CtIP N-terminal domain.
CtIP is predominantly a nuclear protein that complexes
with both BRCA1 and the BRCA1-associated RING domain
protein (BARD1). At the protein level, CtIP expression
varies with cell cycle progression in a pattern identical
to that of BRCA1. Thus, the steady-state levels of CtIP
polypeptides, which remain low in resting cells and G1
cycling cells, increase dramatically as Dividing cells
traverse the G1/S boundary. CtIP can potentially modulate
the functions ascribed to BRCA1 in transcriptional
regulation, DNA repair, and/or cell cycle checkpoint
control. This N-terminal domain carries a coiled-coil
region and is essential for homodimerisation of the
protein. The C-terminal domain is family pfam08573.
Length = 120
Score = 31.0 bits (70), Expect = 2.4
Identities = 25/102 (24%), Positives = 49/102 (48%), Gaps = 12/102 (11%)
Query: 2335 AKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDA--------KKKE 2386
AK+ KKE A+ +L+ S + +EQ +++QE ++ L++ L A ++
Sbjct: 21 AKLAKLKKERCLDAQ-RLEEFFSKNQQLREQQKVLQENIRVLEDRLRAGLCDRCTVTEEL 79
Query: 2387 LKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEK 2428
K Q EL+ Q+ +I +L E + + + L+E+
Sbjct: 80 AKKKQQELE---NNHQQNLQVISELTNEMKTLTEENRRLSEE 118
>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
Provisional.
Length = 758
Score = 33.3 bits (76), Expect = 2.5
Identities = 20/65 (30%), Positives = 31/65 (47%), Gaps = 6/65 (9%)
Query: 1101 GKTCAYRMLSEALQLIEEWSNQGGAP------EGPAGTGKTETTKDLAKAVAKLCIVFNC 1154
G+ A L+EA+++ P GP G GKTE T L+KA+ + F+
Sbjct: 462 GQDKAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDM 521
Query: 1155 SDSMD 1159
S+ M+
Sbjct: 522 SEYME 526
Score = 31.0 bits (70), Expect = 9.7
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 7 GPAGTGKTETTKDLAKAVAKLCIVFNCSDSMD 38
GP G GKTE T L+KA+ + F+ S+ M+
Sbjct: 495 GPTGVGKTEVTVQLSKALGIELLRFDMSEYME 526
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 2.5
Identities = 17/87 (19%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKK 2401
+E +++ E++L+ + ++E + ++E+L+KL +KE+K+L++ + + +
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPELREELEKL-------EKEVKELEELKEEIEELE 244
Query: 2402 QRAEDLIGKLGGEKERWSSTAKMLNEK 2428
+ E L G +E+ + + E
Sbjct: 245 KELESLEGSKRKLEEKIRELEERIEEL 271
Score = 32.7 bits (75), Expect = 3.4
Identities = 29/139 (20%), Positives = 60/139 (43%), Gaps = 4/139 (2%)
Query: 2293 IKRLEDEFLSREDFDPEVVKKSSTAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKL 2352
+K +E++ +F E + + + L + I ++ K + K+E L + ++KL
Sbjct: 288 LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347
Query: 2353 QLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKK----ELKDLQDELDLCVKKKQRAEDLI 2408
+ L E +E++ + +E K + KK+ + L+ EL+ K K+ E+ I
Sbjct: 348 KELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
Query: 2409 GKLGGEKERWSSTAKMLNE 2427
K+ K L +
Sbjct: 408 SKITARIGELKKEIKELKK 426
>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit.
Length = 100
Score = 30.6 bits (70), Expect = 2.7
Identities = 15/57 (26%), Positives = 26/57 (45%), Gaps = 11/57 (19%)
Query: 781 QKPVMFIGPTGTGKSCYITVSILFIKNNSRLRSGFQHFLL----KELSIEQWQPLIM 833
+K + + TG+GK+ + I RL G + L K+L +EQ +I+
Sbjct: 18 KKRGLIVMATGSGKT---LTAAALIA---RLAKGKKKVLFVVPRKDL-LEQALVIII 67
Score = 30.6 bits (70), Expect = 2.7
Identities = 15/57 (26%), Positives = 26/57 (45%), Gaps = 11/57 (19%)
Query: 1334 QKPVMFIGPTGTGKSCYITVSILFIKNNSRLRSGFQHFLL----KELSIEQWQPLIM 1386
+K + + TG+GK+ + I RL G + L K+L +EQ +I+
Sbjct: 18 KKRGLIVMATGSGKT---LTAAALIA---RLAKGKKKVLFVVPRKDL-LEQALVIII 67
>gnl|CDD|153293 cd07609, BAR_SIP3_fungi, The Bin/Amphiphysin/Rvs (BAR) domain of
fungal Snf1p-interacting protein 3. BAR domains are
dimerization, lipid binding and curvature sensing
modules found in many different proteins with diverse
functions including organelle biogenesis, membrane
trafficking or remodeling, and cell division and
migration. This group is composed of mostly
uncharacterized fungal proteins with similarity to
Saccharomyces cerevisiae Snf1p-interacting protein 3
(SIP3). These proteins contain an N-terminal BAR domain
followed by a Pleckstrin Homology (PH) domain. SIP3
interacts with SNF1 protein kinase and activates
transcription when anchored to DNA. It may function in
the SNF1 pathway. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 214
Score = 32.3 bits (74), Expect = 2.7
Identities = 16/69 (23%), Positives = 27/69 (39%), Gaps = 6/69 (8%)
Query: 138 ALISEIIVKKLAPYKQLYDNCIEYQEKH-KMWAE--SIVGTFDPEQIEEETNEFY---KN 191
+ + + PYK+L N YQ K+ M A + T +P + E+ + + K
Sbjct: 92 DPLRSFVKSDIRPYKELRKNFEYYQRKYDSMLARYVAQSKTKEPSSLREDAFQLFEARKA 151
Query: 192 IYKLKLQFS 200
K L
Sbjct: 152 YLKASLDLV 160
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays a
role in protein translation. Deletions of the TMA7 gene
results in altered protein synthesis rates.
Length = 63
Score = 29.4 bits (66), Expect = 2.8
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 6/39 (15%)
Query: 2380 LDAKKKELKDLQDELDLCVKKKQRAE-----DLIGKLGG 2413
L A KK+ K+L DE DL K+KQ+ E +L K G
Sbjct: 11 LKAPKKQKKEL-DEEDLAFKQKQKEEAKALKELAAKAKG 48
>gnl|CDD|191033 pfam04576, Zein-binding, Zein-binding. This domain binds to zein
proteins, pfam01559. Zein proteins are seed storage
proteins.
Length = 94
Score = 30.4 bits (69), Expect = 2.9
Identities = 17/54 (31%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 2345 LRQAEEKLQLAMSALHEK---QEQNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
LR EEK L M A + +E+ QE+++ L++ L +++E+ L+ EL+
Sbjct: 41 LRLQEEKAALEMEARQYQRMAEEKMEYDQEEIELLEDLLYEREREIAALEKELE 94
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 32.7 bits (74), Expect = 2.9
Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 8/64 (12%)
Query: 2327 AICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQN---TLDAK 2383
AIC DK + + G RQAEE+L+ A + ++ ++ +E LQK+Q LD
Sbjct: 177 AICSKDKESLLAGK-----RQAEEQLEACGKARELQHQEQQLAEENLQKVQALCIPLDKD 231
Query: 2384 KKEL 2387
K E
Sbjct: 232 KFEA 235
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes. This
domain is about 200 amino acids in length. This domain is
found associated with pfam00013, pfam01966. This domain
has a single completely conserved residue A that may be
functionally important.
Length = 201
Score = 31.7 bits (73), Expect = 3.1
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 21/97 (21%)
Query: 2341 KKEALRQAEEKLQLAMSAL--------HEKQEQ-NRIVQ---------EKLQKLQNTLDA 2382
KKEAL +A+E++ + +E Q Q R++Q E L+K + +L+
Sbjct: 51 KKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEE 110
Query: 2383 KKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWS 2419
K+KEL Q +L+ +K++ E+LI + E ER S
Sbjct: 111 KEKELAARQQQLE---EKEEELEELIEEQQQELERIS 144
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 31.8 bits (72), Expect = 3.1
Identities = 13/57 (22%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 2341 KKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQK--LQNTLDAKKKELKDLQDELD 2395
K ++ +E+L+ L E +++ ++++ L++ + L+ K+EL++L +EL+
Sbjct: 148 LKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 32.5 bits (75), Expect = 3.4
Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 9/92 (9%)
Query: 2346 RQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKEL--------KDLQDELDLC 2397
R+ E+K + A + L E ++ ++EK +KLQ D +E K+ + E D
Sbjct: 530 RELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEI 589
Query: 2398 VKKKQRAEDLIGKLGGEKERWSSTAKMLNEKY 2429
+K+ ++ + E K LN+
Sbjct: 590 IKELRQLQKGGYASVKAHEL-IEARKRLNKAN 620
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 32.4 bits (74), Expect = 3.4
Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 2332 DKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQ 2391
+ + + +++ R+ E+LQ L +K+EQ EKL L+N L+ ++K L +
Sbjct: 59 ELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARE 118
Query: 2392 DELDLCVKKKQRAEDL 2407
EL+ +KQ +L
Sbjct: 119 LELEE--LEKQLDNEL 132
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 31.9 bits (73), Expect = 3.5
Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 12/99 (12%)
Query: 2098 LEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGVKKDEEVA 2157
L+K D+A+S I ++ +IE Q +EL +E+ AE EG +K E
Sbjct: 84 LKKTDEASSIIEKLQMQIEREQ------EEWEEELERLIEEAKAEGY--EEGYEKGREEG 135
Query: 2158 NE--QALAAQAMKSVAEGVKKDEEV--ANEQALAAQAMK 2192
E + L Q K + + +KK EE+ ++E+ + A+
Sbjct: 136 LEEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALD 174
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes MND1
from S. cerevisiae. The mnd1 protein forms a complex with
hop2 to promote homologous chromosome pairing and meiotic
double-strand break repair.
Length = 188
Score = 31.4 bits (72), Expect = 4.1
Identities = 10/71 (14%), Positives = 35/71 (49%), Gaps = 5/71 (7%)
Query: 2329 CKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQ----NTLDAKK 2384
+ ++ + K+ + EE+ +L + L + +++ + ++ +L+K + ++ K
Sbjct: 83 QRIAELQAQIEKLKKGREETEERTEL-LEELKQLEKELKKLKAELEKYEKNDPERIEKLK 141
Query: 2385 KELKDLQDELD 2395
+E K ++ +
Sbjct: 142 EETKVAKEAAN 152
>gnl|CDD|219189 pfam06818, Fez1, Fez1. This family represents the eukaryotic Fez1
protein. Fez1 contains a leucine-zipper region with
similarity to the DNA-binding domain of the
cAMP-responsive activating-transcription factor 5. There
is evidence that Fez1 inhibits cancer cell growth through
regulation of mitosis, and that its alterations result in
abnormal cell growth. Note that some family members
contain more than one copy of this region.
Length = 200
Score = 31.4 bits (71), Expect = 4.2
Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 3/63 (4%)
Query: 2357 SALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKE 2416
+ L E + Q R +E+LQ+LQ+ L K EL+ ++EL +KK AE L K G +
Sbjct: 38 AQLREARAQLRESEEQLQELQDALRTKTLELEVCENELQ---RKKSEAELLREKAGRLEA 94
Query: 2417 RWS 2419
+
Sbjct: 95 ELA 97
>gnl|CDD|222000 pfam13238, AAA_18, AAA domain.
Length = 128
Score = 30.5 bits (69), Expect = 4.4
Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 4/57 (7%)
Query: 1128 GPAGTGKTETTKDLAKA----VAKLCIVFNCSDSMDYIGLGKFFKMVDDAMEAYKKK 1180
G G+GKT K+LA+ + L +D + ++ +D + K
Sbjct: 5 GTPGSGKTTLAKELAERLGDVLRDLAKENGLVLELDEEITDESKRLDEDKLAKLLDK 61
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
the myosin-like domain [Function unknown].
Length = 499
Score = 32.3 bits (73), Expect = 4.4
Identities = 18/121 (14%), Positives = 46/121 (38%), Gaps = 3/121 (2%)
Query: 2316 TAAEGLCKWVIAICKYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQK 2375
L + + D + + + L A+ + + A + + + + +++ +
Sbjct: 64 LLNRNLRS---GVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREA 120
Query: 2376 LQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEKYYQLTGD 2435
++ L A ++ L Q EL K+ Q + + L ++ + + A+ L QL
Sbjct: 121 VRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQAS 180
Query: 2436 V 2436
Sbjct: 181 A 181
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 30.6 bits (70), Expect = 4.5
Identities = 15/67 (22%), Positives = 34/67 (50%), Gaps = 2/67 (2%)
Query: 2330 KYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRI--VQEKLQKLQNTLDAKKKEL 2387
++ K + ++ L++ +EKLQ + L E + + +Q+K+Q+ Q +++L
Sbjct: 26 EFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDL 85
Query: 2388 KDLQDEL 2394
+ Q E
Sbjct: 86 QKRQQEE 92
Score = 29.9 bits (68), Expect = 8.3
Identities = 15/73 (20%), Positives = 36/73 (49%), Gaps = 2/73 (2%)
Query: 2333 KVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQD 2392
++ K ++ L + E++LQ L +++ + + +K + L K +E + Q
Sbjct: 22 QLEKEFKKRQAELEKLEKELQKLKEKL--QKDAATLSEAAREKKEKELQKKVQEFQRKQQ 79
Query: 2393 ELDLCVKKKQRAE 2405
+L ++K+Q+ E
Sbjct: 80 KLQQDLQKRQQEE 92
>gnl|CDD|218559 pfam05335, DUF745, Protein of unknown function (DUF745). This family
consists of several uncharacterized Drosophila
melanogaster proteins of unknown function.
Length = 188
Score = 31.1 bits (71), Expect = 4.6
Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 4/93 (4%)
Query: 2130 KELMVTVEKESAEAAIVAEGVKKD----EEVANEQALAAQAMKSVAEGVKKDEEVANEQA 2185
++L+ +E+E EA V E + E A AAQ ++ +K + A
Sbjct: 66 QQLVEQLEQEVREAEAVVEEESQSLQQSEANATAAQAAAQQAQTQVNTLKALLKNAQANL 125
Query: 2186 LAAQAMKSECEEILSEAIPILEAAEAALNTLTS 2218
+ + S ++ L+E +LEAA+ + L
Sbjct: 126 ENIEQVASGAQQELAEKTQLLEAAKRRVEVLER 158
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is a
small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis. In
most cases, it resides in potential 5' UTRs of homologues
of the yjdF gene whose function is unknown. However, in
Streptococcus thermophilus, a yjdF RNA motif is
associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound that
structurally resembles the ligand bound by other yjdF
RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 30.3 bits (69), Expect = 4.8
Identities = 18/74 (24%), Positives = 34/74 (45%), Gaps = 14/74 (18%)
Query: 2336 KIVGPKKEALRQAEEKLQ------LAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKD 2389
K + PK+ RQA ++++ A AL + E+N+ ++K K + + ++K
Sbjct: 63 KKINPKR-LQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLK 121
Query: 2390 LQDELDLCVKKKQR 2403
Q KKK +
Sbjct: 122 QQ-------KKKAK 128
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 30.7 bits (70), Expect = 4.8
Identities = 11/44 (25%), Positives = 24/44 (54%)
Query: 2352 LQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELD 2395
+Q +S + + ++++ +KLQ L K+KEL+ + +L
Sbjct: 23 VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQ 66
Score = 30.3 bits (69), Expect = 8.2
Identities = 20/67 (29%), Positives = 39/67 (58%), Gaps = 2/67 (2%)
Query: 2330 KYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRI--VQEKLQKLQNTLDAKKKEL 2387
++ K+ + K++ L++ E+KLQ + L E+ + + +Q+K Q+LQ A ++EL
Sbjct: 43 EFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQEL 102
Query: 2388 KDLQDEL 2394
+ Q EL
Sbjct: 103 QQKQQEL 109
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 31.9 bits (73), Expect = 4.9
Identities = 13/53 (24%), Positives = 30/53 (56%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDEL 2394
L +A ++L+ + EK++Q +Q++LQ+L ++A + +L +E+
Sbjct: 206 PLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEI 258
>gnl|CDD|224300 COG1382, GimC, Prefoldin, chaperonin cofactor [Posttranslational
modification, protein turnover, chaperones].
Length = 119
Score = 30.4 bits (69), Expect = 4.9
Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 12/61 (19%)
Query: 2332 DKVAKIVGP------KKEALRQAEEK---LQLAMSALHEKQEQNRIVQEKLQKLQNTLDA 2382
V K VG K+EA+ + EE+ L+L + L EKQE +QE+L++LQ+ +
Sbjct: 53 APVYKKVGNLLVKVSKEEAVDELEERKETLELRIKTL-EKQE--EKLQERLEELQSEIQK 109
Query: 2383 K 2383
Sbjct: 110 A 110
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 31.6 bits (72), Expect = 4.9
Identities = 20/76 (26%), Positives = 37/76 (48%)
Query: 2332 DKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQ 2391
K A+++ L EEKL+ + L E ++ + +++ L+KLQ + +KE LQ
Sbjct: 4 KKAARLLNQSLSKLDTPEEKLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDKLQ 63
Query: 2392 DELDLCVKKKQRAEDL 2407
E + + + E L
Sbjct: 64 SEHSKAILARSKLESL 79
>gnl|CDD|204187 pfam09278, MerR-DNA-bind, MerR, DNA binding. Members of this family
of DNA-binding domains are predominantly found in the
prokaryotic transcriptional regulator MerR. They adopt a
structure consisting of a core of three alpha helices,
with an architecture that is similar to that of the
'winged helix' fold.
Length = 65
Score = 29.0 bits (66), Expect = 5.0
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 4/46 (8%)
Query: 2352 LQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLC 2397
L L + +++EKL +L +A+ EL+ L+ ELD
Sbjct: 24 LALYDDPGRPCADVRALLREKLAEL----EARIAELQALRAELDAL 65
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo kinase
and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 30.8 bits (70), Expect = 5.0
Identities = 18/65 (27%), Positives = 33/65 (50%), Gaps = 8/65 (12%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKE-LKDLQDELDLCVKK 2400
E L + +E+ + M A E+QEQ Q++ ++ + +E L+ LQ+E K+
Sbjct: 83 VEKLPRFQEQEKKRMKAEKEEQEQKH--QKQEREFLAKQEENLEEALQQLQNE-----KR 135
Query: 2401 KQRAE 2405
+ AE
Sbjct: 136 HELAE 140
>gnl|CDD|218651 pfam05600, DUF773, Protein of unknown function (DUF773). This family
contains several eukaryotic sequences which are thought
to be CDK5 activator-binding proteins, however, the
function of this family is unknown.
Length = 506
Score = 32.0 bits (73), Expect = 5.3
Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 4/70 (5%)
Query: 2343 EALRQ---AEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVK 2399
E L Q E+KL+ + EK+++ R Q L+ + L K +EL+ E D+ +
Sbjct: 438 ETLEQKLKQEDKLKEKRELMTEKRQEAREEQSSLEPKLDLLIEKTRELQK-LIEADISKR 496
Query: 2400 KKQRAEDLIG 2409
R +L+G
Sbjct: 497 YNGRPVNLMG 506
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 5.5
Identities = 12/50 (24%), Positives = 30/50 (60%)
Query: 2345 LRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDEL 2394
L +A+EKL+ + + K ++ ++E+LQ+L++ ++ + +L E+
Sbjct: 206 LDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEI 255
>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO. This family
contains the bacterial type III secretion protein YscO,
which is approximately 150 residues long. YscO has been
shown to be required for high-level expression and
secretion of the anti-host proteins V antigen and Yops in
Yersinia pestis.
Length = 152
Score = 30.5 bits (69), Expect = 5.5
Identities = 13/55 (23%), Positives = 25/55 (45%), Gaps = 4/55 (7%)
Query: 2342 KEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDL 2396
K+ L ++L+ A L E ++ QEK +L A+ + + +EL+
Sbjct: 94 KQRLEAERQRLRQARQQLQEARKA----QEKFAELARQEQAEAQAQRQYLEELEQ 144
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 31.5 bits (72), Expect = 5.8
Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 2371 EKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAE-----DLIGKLGGEKERWSSTAKML 2425
+ L+NT ELK+L++EL ++ +R E +L+ K+ E + A+ L
Sbjct: 207 IRRSSLKNTERFTTPELKELENELLEAKEEIERLEKEILRELLEKVLEYLEELRALAEAL 266
Query: 2426 NEKYYQLTGDVLIAAGVVAYLGPFT 2450
E L DVL++ +A G +
Sbjct: 267 AE----L--DVLLSLATLAAEGNYV 285
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827). This
family consists of several plant proteins of unknown
function. Several sequences in this family are described
as being "myosin heavy chain-like".
Length = 484
Score = 31.9 bits (72), Expect = 5.9
Identities = 37/172 (21%), Positives = 65/172 (37%), Gaps = 29/172 (16%)
Query: 2066 PTSYLELISTFKQLLKVKQEEVLNGKNRYTVGLE------KLDKAASQIAVMREEIEYLQ 2119
P LE ++ L+ Q+E+ + +V E L+KAA + +R +E L+
Sbjct: 227 PEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLK 286
Query: 2120 -------------------PFLMVSAANIKELMVTVEKESAEAAIVAEGVKKDEEVANEQ 2160
S+ + EKES + E K ++ ++E
Sbjct: 287 QELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSE- 345
Query: 2161 ALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAA 2212
A+ + AE +++ E+A +A E L A+ EAA+AA
Sbjct: 346 ---AEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAALKEAEAAKAA 394
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 31.9 bits (72), Expect = 5.9
Identities = 16/85 (18%), Positives = 36/85 (42%), Gaps = 2/85 (2%)
Query: 2345 LRQAEEKLQLAMSAL-HEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDEL-DLCVKKKQ 2402
L EK Q L K+ N+ Q+K+ ++ + +KD+++++ D +
Sbjct: 914 LETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLK 973
Query: 2403 RAEDLIGKLGGEKERWSSTAKMLNE 2427
+ E + + + E + +NE
Sbjct: 974 QKETELNTVNAQLEECEKHQEKINE 998
>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
secretion / Intracellular trafficking and secretion /
Posttranslational modification, protein turnover,
chaperones].
Length = 148
Score = 30.4 bits (69), Expect = 6.0
Identities = 15/78 (19%), Positives = 38/78 (48%), Gaps = 8/78 (10%)
Query: 2331 YDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDL 2390
+++ + G + E + EKL+ +SA Q + Q+ + +L+ +D ++ +L L
Sbjct: 36 EEQLKMLSGYRNEYEQNLNEKLKSGVSA-----AQWQNYQQFISQLEVAIDQQQSQLSKL 90
Query: 2391 QDELDLCVK---KKQRAE 2405
+ +++ + +KQ
Sbjct: 91 RKQVEQKREIWQEKQIEL 108
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 31.5 bits (72), Expect = 7.0
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
Query: 7 GPAGTGKTETTKDLAKAV---AKLCIVFNCSDSMD 38
GP G GKTE K LA+A+ + I + S+ M+
Sbjct: 528 GPTGVGKTELAKALAEALFGDEQALIRIDMSEYME 562
Score = 31.5 bits (72), Expect = 7.0
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
Query: 1128 GPAGTGKTETTKDLAKAV---AKLCIVFNCSDSMD 1159
GP G GKTE K LA+A+ + I + S+ M+
Sbjct: 528 GPTGVGKTELAKALAEALFGDEQALIRIDMSEYME 562
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 31.3 bits (71), Expect = 7.0
Identities = 16/65 (24%), Positives = 30/65 (46%)
Query: 2370 QEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQRAEDLIGKLGGEKERWSSTAKMLNEKY 2429
Q ++Q T +K+ ++++ ELD+ K++ + I KLG + + K L
Sbjct: 68 QHATTEMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANP 127
Query: 2430 YQLTG 2434
TG
Sbjct: 128 VTATG 132
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 30.9 bits (70), Expect = 7.6
Identities = 35/165 (21%), Positives = 65/165 (39%), Gaps = 26/165 (15%)
Query: 2098 LEKLDKAASQIAVMREEIEYLQPFLMVSAANIKELMVTVEKESAEAAIVAEGVKKDEEVA 2157
+ + + ++A +E E + AA I +E+ A+AA + E ++++
Sbjct: 25 VTDVPETEQELADEEQEFEQILEKAEAEAAQI------IEQAEAQAAAIREQIEQERAQW 78
Query: 2158 NEQ--ALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEEILSEAIPILEAAEAALNT 2215
E+ L +A EG + A QA ++A+ S + + EA I+E A+
Sbjct: 79 EEERERLIQEAK---QEGYE-----AGFQAGESEAL-SIYQSTIDEANAIVEEAKDDYEE 129
Query: 2216 LTSNDITVVKTMKSPPDIVKLVMKAVCILKGVKSERVPDAGGQLV 2260
K + P I++L + GV +A LV
Sbjct: 130 ---------KIESAQPLIIELACALAEKVIGVSLAEDKNAFQALV 165
>gnl|CDD|182657 PRK10698, PRK10698, phage shock protein PspA; Provisional.
Length = 222
Score = 30.9 bits (70), Expect = 7.8
Identities = 19/54 (35%), Positives = 33/54 (61%), Gaps = 5/54 (9%)
Query: 2344 ALRQAEEKLQLAMSALHEKQEQNRIV---QEKLQKLQNTLDAKKKELKDLQDEL 2394
ALR+ EK LA +AL EKQ+ ++ + ++ + TL KKE+ +L+++L
Sbjct: 78 ALRK--EKEDLARAALIEKQKLTDLIATLEHEVTLVDETLARMKKEIGELENKL 129
>gnl|CDD|214361 CHL00095, clpC, Clp protease ATP binding subunit.
Length = 821
Score = 31.6 bits (72), Expect = 8.1
Identities = 11/16 (68%), Positives = 11/16 (68%)
Query: 7 GPAGTGKTETTKDLAK 22
GP G GKTE TK LA
Sbjct: 546 GPTGVGKTELTKALAS 561
Score = 31.6 bits (72), Expect = 8.1
Identities = 11/16 (68%), Positives = 11/16 (68%)
Query: 1128 GPAGTGKTETTKDLAK 1143
GP G GKTE TK LA
Sbjct: 546 GPTGVGKTELTKALAS 561
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 30.7 bits (70), Expect = 8.4
Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 18/90 (20%)
Query: 2332 DKVAKIVGPKK---EALRQAEEKLQLAMSA----------LHEKQEQNRIVQEKLQKLQN 2378
+ + K P+K +A+R E +L A A L K E+ + EKL++
Sbjct: 17 ELLDKAEDPEKMLEQAIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEKLEE--K 74
Query: 2379 TLDAKKKELKDLQDELDLCVKKKQRAEDLI 2408
A + +DL E +++KQ EDL
Sbjct: 75 AELALQAGNEDLAREA---LEEKQSLEDLA 101
>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain fusion
protein; Provisional.
Length = 906
Score = 31.3 bits (71), Expect = 8.5
Identities = 35/166 (21%), Positives = 61/166 (36%), Gaps = 37/166 (22%)
Query: 2206 LEAAEAALNTLTSNDITVVKTMKS-----------PPDIVKLVMK---AVCILKGVKSER 2251
++ A AL L + I V+K ++ P +K ++K LK +KSE
Sbjct: 344 IQMAITALKPLKRDHIWVLKAVEPKEELVDKFKKDIPWALKTIIKSYDNRMDLKDIKSEL 403
Query: 2252 VPDAGGQLV-EDYWGPSKKLLGDIKFLEGLTNFNKDNV--------PAAVIKRLEDEFLS 2302
VP Q + +K++L + P + ++L D+F +
Sbjct: 404 VPSLLTQSEWSKWSTKAKQILKK----NPNFGMDPKKKDEYVYRERPINLSEKLSDKFKA 459
Query: 2303 REDF----------DPEVVKKSSTAAEGLCKWVIAICKYDKVAKIV 2338
+DF V S A E L ++I +++KV V
Sbjct: 460 EKDFNKRLEIAMEYCKGAVDDSDIAEEMLEYFLIFYNEFEKVDDPV 505
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family is
found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 30.0 bits (68), Expect = 8.7
Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 9/78 (11%)
Query: 2344 ALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQR 2403
+L Q ++ +L E + ++++LQNT++ K +L+DL+ EL L ++ K+R
Sbjct: 40 SLLQQRDRDLEFRESLEETLRKLE---AEIERLQNTIERLKTQLEDLERELAL-LQAKER 95
Query: 2404 A-----EDLIGKLGGEKE 2416
+ L KL EKE
Sbjct: 96 QLEKKLKTLEQKLKNEKE 113
>gnl|CDD|182379 PRK10325, PRK10325, heat shock protein GrpE; Provisional.
Length = 197
Score = 30.4 bits (68), Expect = 8.8
Identities = 23/83 (27%), Positives = 43/83 (51%), Gaps = 6/83 (7%)
Query: 2137 EKESAEAAIVAEGVK-KDEEVANEQALAAQAMKSVAEGV-KKDEEVANEQALAAQ----A 2190
E E+ E AE V +DE++AN +A A+A +G+ + E+ N + A
Sbjct: 24 EIEAVEPEASAEQVDPRDEKIANLEAQLAEAQTRERDGILRVKAEMENLRRRTELDIEKA 83
Query: 2191 MKSECEEILSEAIPILEAAEAAL 2213
K E+ ++E +P++++ + AL
Sbjct: 84 HKFALEKFINELLPVIDSLDRAL 106
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 30.9 bits (70), Expect = 9.2
Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKEL-KDLQDELDLCVKKK 2401
E ++Q E LQ ++ L + N ++L+ L+ LD KEL K LQ+ ++ KK+
Sbjct: 86 EQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKR 145
Query: 2402 QRAEDL 2407
+ ++
Sbjct: 146 ENNKNE 151
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from laminin
A, B1 and B2 may come together to form a triple helical
coiled-coil structure.
Length = 263
Score = 30.8 bits (70), Expect = 9.2
Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 2343 EALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKDLQDELDLCVKKKQ 2402
L+ AEE L + QE+N + E L+ + L+ + +L DL++ LD K +
Sbjct: 152 AELKAAEELLNRVQTWFQTPQEENEGLFEALR---DDLNEYEAKLSDLRELLDEAQAKTR 208
Query: 2403 RAEDL 2407
AE L
Sbjct: 209 DAERL 213
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 31.2 bits (71), Expect = 9.8
Identities = 18/106 (16%), Positives = 48/106 (45%), Gaps = 10/106 (9%)
Query: 2330 KYDKVAKIVGPKKEALRQAEEKLQLAMSALHEKQEQNRIVQEKLQKLQNTLDAKKKELKD 2389
++ + + ++ +++ EE ++ +E+N ++ +L++L+ ++ + EL+
Sbjct: 409 TEEEERREITVYEKRIKKLEETVE-------RLEEENSELKRELEELKREIEKLESELER 461
Query: 2390 LQDELDLCVKKKQRAEDL---IGKLGGEKERWSSTAKMLNEKYYQL 2432
+ E+ V+K + I +L E E + L K +L
Sbjct: 462 FRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAEL 507
>gnl|CDD|223714 COG0641, AslB, Arylsulfatase regulator (Fe-S oxidoreductase) [General
function prediction only].
Length = 378
Score = 30.9 bits (70), Expect = 9.9
Identities = 10/66 (15%), Positives = 24/66 (36%), Gaps = 1/66 (1%)
Query: 938 DDVNMPLKEEFGAQPPIEILRQWLDHWMWYDRKDVVAVKLIEIQLMCAMGPPSTGNTVTP 997
+ + L+ A+ + L D W+ +D + ++ + L +GPP + +
Sbjct: 197 NRGDSLLEFSVTAEEYGQFLIAIFDEWVRHDVGRIF-IQNFDQLLKAWLGPPGSLCIFSE 255
Query: 998 RFSRHF 1003
Sbjct: 256 TCGDEL 261
Score = 30.9 bits (70), Expect = 9.9
Identities = 10/66 (15%), Positives = 24/66 (36%), Gaps = 1/66 (1%)
Query: 1426 DDVNMPLKEEFGAQPPIEILRQWLDHWMWYDRKDVVAVKLIEIQLMCAMGPPSTGNTVTP 1485
+ + L+ A+ + L D W+ +D + ++ + L +GPP + +
Sbjct: 197 NRGDSLLEFSVTAEEYGQFLIAIFDEWVRHDVGRIF-IQNFDQLLKAWLGPPGSLCIFSE 255
Query: 1486 RFSRHF 1491
Sbjct: 256 TCGDEL 261
>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
homologs [Posttranslational modification, protein
turnover, chaperones].
Length = 291
Score = 30.9 bits (70), Expect = 10.0
Identities = 37/166 (22%), Positives = 62/166 (37%), Gaps = 31/166 (18%)
Query: 2085 EEVLNGKNRYTVG--LEKLDKAAS--QIAVMREEIEYLQPFLMVSAANIKELM--VTVEK 2138
+E+L + E LD+AA I V+ EI+ + P ++ M +
Sbjct: 132 DELLTERRAEINAKIREILDEAADPWGIKVVDVEIKDIDP-----PEEVQAAMEKQMAAE 186
Query: 2139 ESAEAAIV-AEGVKKDEEVANEQALAAQAMKSVAEGVKKDEEVANEQALAAQAMKSECEE 2197
A I+ AEG + + E A+A +AE + E +A +A AA +
Sbjct: 187 RDKRAEILEAEGEAQAAILRAEGE--AEAAIILAEAEAEAEVIARAEADAA--------K 236
Query: 2198 ILSEAIPILEAA---------EAALNTLTSNDITVVKTMKSPPDIV 2234
I++ A+ AA E L + + VV S +
Sbjct: 237 IIAAALREAPAAPQALAQRYLEELLEIALAGNSKVVVVPNSAGGAI 282
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.136 0.408
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 133,496,602
Number of extensions: 13648776
Number of successful extensions: 16800
Number of sequences better than 10.0: 1
Number of HSP's gapped: 16527
Number of HSP's successfully gapped: 341
Length of query: 2567
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2454
Effective length of database: 5,925,600
Effective search space: 14541422400
Effective search space used: 14541422400
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 68 (30.0 bits)