RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6936
(1032 letters)
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 134 bits (340), Expect = 7e-36
Identities = 84/213 (39%), Positives = 137/213 (64%), Gaps = 2/213 (0%)
Query: 48 RFQYFKRDADELESWIYEKLQAASDESYKET-TNLQAKIQKHQAFEAEVAAHSNAIVVLD 106
+ Q F RDADELE+W+ EK + S Y + +++A ++KH+A EAE+AAH + L+
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 107 NTGKEMINQNHFASEVIRKRLEELHRLWELLLSKLAEKGMKLQQALVLVQFIRQCDEVMF 166
G+++I + H +E I++RLEEL++ WE L E+ +L++AL L QF R D++
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQ 120
Query: 167 WINDKEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVTEVNQLADKLVLEGHPER 226
W+ +KEA + +++ G DLE VE L +K E ++++ + E R+ +N+LA++L+ EGHP+
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 227 ETIVRRK-EELSEAWMRLKQLALMRQEKLFGAH 258
+ + K EEL+E W L +LA RQ+KL A
Sbjct: 181 DEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 134 bits (340), Expect = 7e-36
Identities = 78/211 (36%), Positives = 126/211 (59%), Gaps = 1/211 (0%)
Query: 261 QRFNRDADETVAWIAEKDAVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAE 320
Q+F RDADE AW++EK+ +LSS +YG DL SV+ L +KHE +E +LAA E++V L
Sbjct: 3 QQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNEL 62
Query: 321 ADRLCGIHSDHGDQIQAKRAEIVEYWERLTGKAKGRKQKLDESYYLHRFLADYRDLISWV 380
++L ++IQ + E+ + WE L A+ R+Q+L+E+ L +F D DL W+
Sbjct: 63 GEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWL 122
Query: 381 SDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADD 440
+ +A +++++L KD+ E LL++H+E + E++A E + + + LL+ H AD+
Sbjct: 123 EEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADE 182
Query: 441 -IREKLRNLENDKTHLLTLWEERRILYEQCM 470
I EKL L LL L EER+ E+ +
Sbjct: 183 EIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 134 bits (340), Expect = 9e-36
Identities = 82/212 (38%), Positives = 139/212 (65%), Gaps = 1/212 (0%)
Query: 154 LVQFIRQCDEVMFWINDKEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVTEVNQ 213
L QF+R DE+ W+++KE +++ ++G DLE VE L +K + + ++A+ E RV +N+
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 214 LADKLVLEGHPERETIVRRKEELSEAWMRLKQLALMRQEKLFGAHEIQRFNRDADETVAW 273
L ++L+ EGHP+ E I R EEL++ W L++LA R+++L A ++Q+F RDAD+ W
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQW 121
Query: 274 IAEKDAVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAEADRL-CGIHSDHG 332
+ EK+A L+S++ G+DL SV+ L +KH+ +E +L A E ++ +L A+ L H D
Sbjct: 122 LEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDAD 181
Query: 333 DQIQAKRAEIVEYWERLTGKAKGRKQKLDESY 364
++I+ K E+ E WE L A+ R++KL+E+
Sbjct: 182 EEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 129 bits (325), Expect = 8e-34
Identities = 75/202 (37%), Positives = 121/202 (59%), Gaps = 2/202 (0%)
Query: 833 KFQQFERDCDETKSWVTEKLKFANDDNYL-DPTNLNGKVQKHQNFEQELNANKTRMDEIT 891
K QQF RD DE ++W++EK + + +Y D ++ ++KH+ E EL A++ R++ +
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 892 STGQELLEANHYASDKIRQRMEQIVHLWESLATATEKKGNKLQEASQQQGFNRTIEDIEL 951
G++L+E H +++I++R+E++ WE L E++ +L+EA Q F R +D+E
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQ 120
Query: 952 WLSEIEGQLMSEDYGKDLTSVQNLQKKHALLEADVASHLDRIESVKAATEQFLERGHFDA 1011
WL E E L SED GKDL SV+ L KKH LE ++ +H R++S+ E+ LE GH DA
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 1012 -DNIKAKQAAMCERYENLQRPI 1032
+ I+ K + ER+E L
Sbjct: 181 DEEIEEKLEELNERWEELLELA 202
Score = 127 bits (320), Expect = 4e-33
Identities = 75/212 (35%), Positives = 131/212 (61%), Gaps = 1/212 (0%)
Query: 366 LHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTAD 425
L +FL D +L +W+S+ + ++S+ + D+ EALL++H+ + E+ A E+ +
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 426 AGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLFYRDTEQADTW 485
G+ L++ H A++I+E+L L L L EERR E+ +DLQ F+RD + + W
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQW 121
Query: 486 MAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDEFATKLIEGQHYAAD 545
+ ++EA L++EDLG L+SVE L+KKH++ E+ L A E ++K+L+E A +L+E H AD
Sbjct: 122 LEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDAD 181
Query: 546 D-VAQRRDLLLKRRAGLLEKSARRRAILEDSY 576
+ + ++ + L +R LLE + R+ LE++
Sbjct: 182 EEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 124 bits (313), Expect = 3e-32
Identities = 82/243 (33%), Positives = 132/243 (54%), Gaps = 31/243 (12%)
Query: 471 DLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALD 530
LQ F RD ++ + W++++E LS+ D GD L+SVEAL+KKHE E LAA EE+++AL+
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 531 EFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILEDSYKFQQFEEIVEYWER 590
E +LIE H A+++ +R + L +R WE
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQR------------------------------WEE 90
Query: 591 LTGKAKGRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQE 650
L A+ R+Q+L+E+ L +F D DL W+ + +A +++++L KD+ E LL++H+E
Sbjct: 91 LRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKE 150
Query: 651 HKGEIDAREDSFRGTADAGQSLLDRNHYAADD-IREKLRNLENDKTHLLTLWEERRILYE 709
+ E++A E + + + LL+ H AD+ I EKL L LL L EER+ E
Sbjct: 151 LEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLE 210
Query: 710 QCM 712
+ +
Sbjct: 211 EAL 213
Score = 122 bits (309), Expect = 1e-31
Identities = 77/226 (34%), Positives = 129/226 (57%), Gaps = 16/226 (7%)
Query: 713 DLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALD 772
LQ F RD ++ + W++++E LS+ D GD L+SVEAL+KKHE E LAA EE+++AL+
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 773 DLLKVNTMQIKALDEFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILEDSY 832
+L +LIE H A+++ +R + L +R L E + RR LE++
Sbjct: 61 EL--------------GEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEAL 106
Query: 833 KFQQFERDCDETKSWVTEKLKFANDDNYL-DPTNLNGKVQKHQNFEQELNANKTRMDEIT 891
QQF RD D+ + W+ EK ++ D ++ ++KH+ E+EL A++ R+ +
Sbjct: 107 DLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLN 166
Query: 892 STGQELLEANHYAS-DKIRQRMEQIVHLWESLATATEKKGNKLQEA 936
+ELLE H + ++I +++E++ WE L E++ KL+EA
Sbjct: 167 ELAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEA 212
Score = 117 bits (296), Expect = 7e-30
Identities = 75/226 (33%), Positives = 132/226 (58%), Gaps = 15/226 (6%)
Query: 608 LHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTAD 667
L +FL D +L +W+S+ + ++S+ + D+ EALL++H+ + E+ A E+ +
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 668 AGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLFYRDTEQADTW 727
G+ L++ H A++I+E+L L L L EERR E+ +DLQ F+RD + + W
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQW 121
Query: 728 MAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDDLLKVNTMQIKALDE 787
+ ++EA L++EDLG L+SVE L+KKH++ E+ L A E ++K+L++L
Sbjct: 122 LEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNEL------------- 168
Query: 788 FATKLIEGQHYAADD-VAQRRDLLLKRRAGLLEKSARRRAILEDSY 832
A +L+E H AD+ + ++ + L +R LLE + R+ LE++
Sbjct: 169 -AEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 78.3 bits (193), Expect = 4e-16
Identities = 49/140 (35%), Positives = 84/140 (60%), Gaps = 2/140 (1%)
Query: 15 ANDIQERREQVLNRYADFKSEARSKREKLEDSRRFQYFKRDADELESWIYEKLQAASDES 74
A +IQER E++ R+ + + A +R++LE++ Q F RDAD+LE W+ EK A + E
Sbjct: 74 AEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASED 133
Query: 75 Y-KETTNLQAKIQKHQAFEAEVAAHSNAIVVLDNTGKEMINQNHFAS-EVIRKRLEELHR 132
K+ +++ ++KH+ E E+ AH + L+ +E++ + H + E I ++LEEL+
Sbjct: 134 LGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNE 193
Query: 133 LWELLLSKLAEKGMKLQQAL 152
WE LL E+ KL++AL
Sbjct: 194 RWEELLELAEERQKKLEEAL 213
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains are
found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of helix
C. Although the domain occurs in ultiple repeats along
sequences, the domains are actually stable on their own
- ie they act, biophysically, like domains rather than
repeats that along function when aggregated.
Length = 105
Score = 86.2 bits (214), Expect = 4e-20
Identities = 45/104 (43%), Positives = 73/104 (70%)
Query: 471 DLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALD 530
LQ F+RD + ++W+ ++EA LS+ED G L+SV+AL+KKH+ E LAA +++++AL+
Sbjct: 2 LLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALN 61
Query: 531 EFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILED 574
E A KLI HYA++++ +R + L +R LLE +A R+ LE+
Sbjct: 62 ELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 80.8 bits (200), Expect = 3e-18
Identities = 47/105 (44%), Positives = 70/105 (66%)
Query: 258 HEIQRFNRDADETVAWIAEKDAVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTL 317
+Q+F RDAD+ +WI EK+A+LSS++YG+DL SVQ L +KH+ +E +LAA +D+V L
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 318 GAEADRLCGIHSDHGDQIQAKRAEIVEYWERLTGKAKGRKQKLDE 362
A++L ++IQ + E+ E WE+L A RKQKL+E
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 78.9 bits (195), Expect = 1e-17
Identities = 42/101 (41%), Positives = 67/101 (66%)
Query: 154 LVQFIRQCDEVMFWINDKEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVTEVNQ 213
L QF R D++ WI +KEA ++++++G DLE V+ L +K + ++A+ + RV +N+
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNE 62
Query: 214 LADKLVLEGHPERETIVRRKEELSEAWMRLKQLALMRQEKL 254
LA+KL+ EGH E I R EEL+E W +L +LA R++KL
Sbjct: 63 LAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKL 103
Score = 78.1 bits (193), Expect = 3e-17
Identities = 33/105 (31%), Positives = 62/105 (59%), Gaps = 1/105 (0%)
Query: 832 YKFQQFERDCDETKSWVTEKLKFANDDNYL-DPTNLNGKVQKHQNFEQELNANKTRMDEI 890
QQF RD D+ +SW+ EK + ++Y D ++ ++KH+ E EL A++ R++ +
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 891 TSTGQELLEANHYASDKIRQRMEQIVHLWESLATATEKKGNKLQE 935
++L+ HYAS++I++R+E++ WE L ++ KL+E
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 76.6 bits (189), Expect = 7e-17
Identities = 45/118 (38%), Positives = 73/118 (61%), Gaps = 14/118 (11%)
Query: 713 DLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALD 772
LQ F+RD + ++W+ ++EA LS+ED G L+SV+AL+KKH+ E LAA ++
Sbjct: 2 LLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQD------ 55
Query: 773 DLLKVNTMQIKALDEFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILED 830
+++AL+E A KLI HYA++++ +R + L +R LLE +A R+ LE+
Sbjct: 56 --------RVEALNELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 75.0 bits (185), Expect = 3e-16
Identities = 48/105 (45%), Positives = 70/105 (66%), Gaps = 1/105 (0%)
Query: 47 RRFQYFKRDADELESWIYEKLQAASDESY-KETTNLQAKIQKHQAFEAEVAAHSNAIVVL 105
Q F RDAD+LESWI EK S E Y K+ ++QA ++KH+A EAE+AAH + + L
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 106 DNTGKEMINQNHFASEVIRKRLEELHRLWELLLSKLAEKGMKLQQ 150
+ +++I + H+ASE I++RLEEL+ WE LL AE+ KL++
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 73.5 bits (181), Expect = 9e-16
Identities = 40/95 (42%), Positives = 58/95 (61%)
Query: 938 QQQGFNRTIEDIELWLSEIEGQLMSEDYGKDLTSVQNLQKKHALLEADVASHLDRIESVK 997
Q F R +D+E W+ E E L SEDYGKDL SVQ L KKH LEA++A+H DR+E++
Sbjct: 2 LLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALN 61
Query: 998 AATEQFLERGHFDADNIKAKQAAMCERYENLQRPI 1032
E+ + GH+ ++ I+ + + ER+E L
Sbjct: 62 ELAEKLIAEGHYASEEIQERLEELNERWEQLLELA 96
Score = 67.7 bits (166), Expect = 1e-13
Identities = 32/103 (31%), Positives = 59/103 (57%)
Query: 366 LHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTAD 425
L +F D DL SW+ + +A++S+++ KD+ +ALL++H+ + E+ A +D +
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNE 62
Query: 426 AGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQ 468
+ L+ HYA+++I+E+L L LL L ER+ E+
Sbjct: 63 LAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 67.7 bits (166), Expect = 1e-13
Identities = 32/103 (31%), Positives = 59/103 (57%)
Query: 608 LHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTAD 667
L +F D DL SW+ + +A++S+++ KD+ +ALL++H+ + E+ A +D +
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNE 62
Query: 668 AGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQ 710
+ L+ HYA+++I+E+L L LL L ER+ E+
Sbjct: 63 LAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 81.6 bits (202), Expect = 1e-18
Identities = 44/101 (43%), Positives = 66/101 (65%)
Query: 473 QLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDEF 532
Q F RD ++ + W+ ++E L++EDLG L+SVEAL+KKHE FE L A EE+++AL+E
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 533 ATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILE 573
+LIE H A+++ +R + L +R L E + RR LE
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 80.1 bits (198), Expect = 4e-18
Identities = 40/101 (39%), Positives = 63/101 (62%)
Query: 261 QRFNRDADETVAWIAEKDAVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAE 320
Q+F RDADE AW+ EK+ +L+S++ G+DL SV+ L +KHE E +L A E++V L
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 321 ADRLCGIHSDHGDQIQAKRAEIVEYWERLTGKAKGRKQKLD 361
++L ++I+ + E+ E WE L A+ R+QKL+
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 79.7 bits (197), Expect = 7e-18
Identities = 42/99 (42%), Positives = 66/99 (66%)
Query: 156 QFIRQCDEVMFWINDKEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVTEVNQLA 215
QF+R DE+ W+ +KE + +++ G DLE VE L +K + F+ ++ + E RV +N+L
Sbjct: 2 QFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELG 61
Query: 216 DKLVLEGHPERETIVRRKEELSEAWMRLKQLALMRQEKL 254
++L+ EGHP+ E I R EEL+E W LK+LA R++KL
Sbjct: 62 EQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 77.0 bits (190), Expect = 6e-17
Identities = 39/91 (42%), Positives = 57/91 (62%)
Query: 940 QGFNRTIEDIELWLSEIEGQLMSEDYGKDLTSVQNLQKKHALLEADVASHLDRIESVKAA 999
Q F R +++E WL E E L SED GKDL SV+ L KKH EA++ +H +R+E++
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 1000 TEQFLERGHFDADNIKAKQAAMCERYENLQR 1030
EQ +E GH DA+ I+ + + ER+E L+
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKE 91
Score = 76.6 bits (189), Expect = 7e-17
Identities = 32/101 (31%), Positives = 61/101 (60%), Gaps = 1/101 (0%)
Query: 835 QQFERDCDETKSWVTEKLKFANDDNY-LDPTNLNGKVQKHQNFEQELNANKTRMDEITST 893
QQF RD DE ++W+ EK + ++ D ++ ++KH+ FE EL A++ R++ +
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 894 GQELLEANHYASDKIRQRMEQIVHLWESLATATEKKGNKLQ 934
G++L+E H +++I +R+E++ WE L E++ KL+
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 75.1 bits (185), Expect = 3e-16
Identities = 42/101 (41%), Positives = 64/101 (63%), Gaps = 1/101 (0%)
Query: 50 QYFKRDADELESWIYEKLQAASDESY-KETTNLQAKIQKHQAFEAEVAAHSNAIVVLDNT 108
Q F RDADELE+W+ EK Q + E K+ +++A ++KH+AFEAE+ AH + L+
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 109 GKEMINQNHFASEVIRKRLEELHRLWELLLSKLAEKGMKLQ 149
G+++I + H +E I +RLEEL+ WE L E+ KL+
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 72.7 bits (179), Expect = 2e-15
Identities = 44/115 (38%), Positives = 67/115 (58%), Gaps = 14/115 (12%)
Query: 715 QLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDDL 774
Q F RD ++ + W+ ++E L++EDLG L+SVEAL+KKHE FE L A EE+++AL++L
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 775 LKVNTMQIKALDEFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILE 829
+LIE H A+++ +R + L +R L E + RR LE
Sbjct: 61 --------------GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 71.2 bits (175), Expect = 5e-15
Identities = 31/101 (30%), Positives = 58/101 (57%)
Query: 367 HRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTADA 426
+FL D +L +W+ + + ++++++L KD+ EALL++H+ + E++A E+ +
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 427 GQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYE 467
G+ L++ H A++I E+L L L L EERR E
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 71.2 bits (175), Expect = 5e-15
Identities = 31/101 (30%), Positives = 58/101 (57%)
Query: 609 HRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTADA 668
+FL D +L +W+ + + ++++++L KD+ EALL++H+ + E++A E+ +
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 669 GQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYE 709
G+ L++ H A++I E+L L L L EERR E
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 36.6 bits (85), Expect = 0.089
Identities = 138/884 (15%), Positives = 300/884 (33%), Gaps = 121/884 (13%)
Query: 168 INDKEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVTEVNQLADKLVLEGHPERE 227
+ + E + E +E LQ + E ++++ + E+ + ++L LE E E
Sbjct: 222 LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLE-VSELE 280
Query: 228 TIVRRKEELSEAWMRLKQL-ALMRQEKLFGAHEIQRFNRDADETVAWIAE---------K 277
EEL + L + + Q+K + R +E A + E +
Sbjct: 281 ---EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337
Query: 278 DAVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAEADRLCGIHSDHGDQIQA 337
+ ++ +++L+ + E +E +L LE ++ L + + L + QI +
Sbjct: 338 ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIAS 397
Query: 338 KRAEIVEYWERLTGKAKGRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKDVA 397
EI RL + R+++L + + +L EL ++
Sbjct: 398 LNNEIERLEARLER-LEDRRERLQQEIEELLKKLEEAEL-------------KELQAELE 443
Query: 398 GAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLT 457
E LE QE ++ + R + + LD ++ +L +LE + +L
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
Query: 458 LWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEK 517
E + L + L L + LI E +E
Sbjct: 504 FSEGVKAL--------------------------LKNQSGLSGILGVLSELISVDEGYEA 537
Query: 518 SLAAQEEKIKALDEFATKLI-EGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILEDSY 576
++ A AL ++ E + A +A + L R L S + I +
Sbjct: 538 AIEA------ALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDR 591
Query: 577 KFQQFEEIVEYWERLTGKAKGRKQKLDESYYLHRFLAD-----------YRDLISWVSDM 625
+ + E + K + +K + D R V+
Sbjct: 592 EILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLD 651
Query: 626 KAIISADELA--KDVAGAEALLERHQE---HKGEIDAREDSFRGTADAGQSLLDRNHYAA 680
++ + ++LER +E + +I+ E+ A L
Sbjct: 652 GDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELE 711
Query: 681 DDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSNEDL 740
+++ + + LE + L ++ L + L+ + T + + L E L
Sbjct: 712 EELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEEL-EERL 770
Query: 741 GDSLDSVEALIKKHEDFEKSLAAQEEKIKALDDLLKVNTMQIKALDEFATKLIEGQHYAA 800
++ + + + E+ E + +E++KAL + L ++ L+E A L E
Sbjct: 771 EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL---- 826
Query: 801 DDVAQRRDLLLKRRAGLLEKSARRRA--ILEDSYKFQQFERDCDETKSWVTEKLKFANDD 858
+ +RR +RR LE+ + I + + ++ E +E +S + L
Sbjct: 827 -ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLN----- 880
Query: 859 NYLDPTNLNGKVQKHQNFEQELNANKTRMDEITSTGQELLEANHYASDKIRQRMEQIVHL 918
+ + E+ L ++ ++E++ +EL ++ + E++ L
Sbjct: 881 -------------ERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQL 927
Query: 919 WESLATATEKKGNKLQEASQQQGFNRTIEDIELWLSEIEGQLMSEDYGKDLTSVQNLQKK 978
L +++ + Q+ + E + + + K + +++
Sbjct: 928 ELRLEGL------EVRIDNLQERLS------EEY--SLTLEEAEALENKIEDDEEEARRR 973
Query: 979 HALLEADVAS----HLDRIESVKAATEQFLERGHFDADNIKAKQ 1018
LE + +L IE + E++ D +AK+
Sbjct: 974 LKRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKE 1017
Score = 35.8 bits (83), Expect = 0.13
Identities = 45/283 (15%), Positives = 106/283 (37%), Gaps = 13/283 (4%)
Query: 76 KETTNLQAKIQKHQAFEAEVAAHSNAIVVLDNTGKEMINQNHFASEVIRKRLEELHRLWE 135
+E L+ KI++ E ++A A+ L +E+ + + + L
Sbjct: 677 REIEELEEKIEE---LEEKIAELEKALAELRKELEELEEELEQLRKELE-ELSRQISALR 732
Query: 136 LLLSKLAEKGMKLQQALVLVQFIRQCDEVMFWINDKEAFVTADEFGADLEHVEVLQRKFD 195
L++L + +L++ + + E ++ +E +E L+ + +
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792
Query: 196 EFQKDMASQEYRVTEVNQLADKLVLEGHPERETIVRRKEELSEAWMRL----KQLALMRQ 251
+ ++++ + + E+ L E RE + + ++ RL +Q+ + +
Sbjct: 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
Query: 252 EKLFGAHEIQRFNRDADETVAWIAEKDAVLSSDEYGRDLASVQTLQRKHEGVERDLAALE 311
+ A EI+ +E E + +E ++ L+ + E + +L LE
Sbjct: 853 DIESLAAEIEELEELIEE-----LESELEALLNERASLEEALALLRSELEELSEELRELE 907
Query: 312 DKVSTLGAEADRLCGIHSDHGDQIQAKRAEIVEYWERLTGKAK 354
K S L E + L + +++ I ERL+ +
Sbjct: 908 SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYS 950
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 35.8 bits (83), Expect = 0.14
Identities = 59/338 (17%), Positives = 127/338 (37%), Gaps = 38/338 (11%)
Query: 10 KILETANDIQERREQVLNRYADFKSEARSKREKLEDSR----RFQYFKRDADELESWIYE 65
+ E A ER E+ L R D E + EKLE R+Q K + ELE +
Sbjct: 172 ERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLL 231
Query: 66 KLQAASDESYKETTNLQAKIQK-HQAFEAEVAAHSNAIVVLDNTGKEMINQNHFASEVIR 124
+ +E +++++ + + E+ I L K + + +R
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEEL----KSELEE-------LR 280
Query: 125 KRLEELHRLWELLLSKLAEKGMKLQQALVLVQFIRQCDEVMFWINDKEAFVTADEFGADL 184
+ LEEL L ++ E ++ ++ + N+ E E
Sbjct: 281 EELEELQEELLELKEEIEELEGEISLLRERLEELE---------NELEELEERLE--ELK 329
Query: 185 EHVEVLQRKFDEFQKDMASQEYRVTEVNQLADKLVLEGHPERETIVRRKEELSEAWMRLK 244
E +E L+ + +E + + E + E+ + ++L + E + E L E L+
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELE 389
Query: 245 QLALMRQEKLFGAHEIQRFNRDADETVAWIAEKDAVLSSDEYGRDLASVQTLQRKHEGVE 304
+ +L E++R +E + ++E+ L ++ L+ + E ++
Sbjct: 390 AELAEIRNEL---EELKREIESLEERLERLSERLEDL--------KEELKELEAELEELQ 438
Query: 305 RDLAALEDKVSTLGAEADRLCGIHSDHGDQIQAKRAEI 342
+L L +++ L + + L + ++ + E+
Sbjct: 439 TELEELNEELEELEEQLEELRDRLKELERELAELQEEL 476
Score = 34.3 bits (79), Expect = 0.43
Identities = 61/353 (17%), Positives = 134/353 (37%), Gaps = 26/353 (7%)
Query: 565 SARRRAILEDSYKFQQFEEIVEYWERLTGKAKGRKQKLDESYYLHRFLADYRDLISWVSD 624
S +R+ L + ++ EE + E K + + L L +L + +
Sbjct: 656 SRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNE--LRSLEDLLEELRRQLEE 713
Query: 625 MKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIR 684
++ +EL +++A E LE+ Q E++ + + Q L+ + +
Sbjct: 714 LE--RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771
Query: 685 EKLRNLENDKTHLLT----LWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSNE-- 738
E L L+ + L L EE L E+ + + E+ + ++ L E
Sbjct: 772 EALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831
Query: 739 -----------DLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDDLLKVNTMQIKALDE 787
L + + +E L K+ E+ ++ L E + + L+D LK + + L+
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE- 890
Query: 788 FATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILEDSYKFQQFERDCDETKSW 847
+ + ++ + + L +R L K R L + + + E +
Sbjct: 891 ---EELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETEL 947
Query: 848 VTEKLKFANDDNYLDPTNLNGKVQKHQNFEQELNANKTRMDEITSTGQELLEA 900
E + + L P NL +++++ E+ K++ +++ ++LLE
Sbjct: 948 EREIERLEEEIEALGPVNLRA-IEEYEEVEERYEELKSQREDLEEAKEKLLEV 999
Score = 33.9 bits (78), Expect = 0.49
Identities = 57/339 (16%), Positives = 122/339 (35%), Gaps = 26/339 (7%)
Query: 225 ERETIVRRKEELSEAWMRLKQLALMRQEKLF----GAHEIQRFNRDADETVAWIAEKDAV 280
E + + E+L E LK ++ L E++R + +A + E+
Sbjct: 675 ELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQ 734
Query: 281 LSSD------EYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAEADRLCGIHSDHGDQ 334
L S E ++ LQ + E +E +L +LE+ ++ L E + L ++
Sbjct: 735 LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794
Query: 335 IQAKRAEIVEYWERLTGKAKGRKQKLDESYYLHRFLAD-YRDLISWVSDMKAIISA-DEL 392
++ E+ E RL + + L + + + ++ + + +EL
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854
Query: 393 AKDVAGAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDK 452
K++ + LE + K E++ + + L +++E++ L
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914
Query: 453 THLLT----LWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSN---------EDLG 499
L L E L E+ + +TE + + E + E+
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYEDTLETELERE-IERLEEEIEALGPVNLRAIEEYE 973
Query: 500 DSLDSVEALIKKHEDFEKSLAAQEEKIKALDEFATKLIE 538
+ + E L + ED E++ E I+ LD+ + +
Sbjct: 974 EVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFK 1012
>gnl|CDD|220980 pfam11102, DUF2886, Protein of unknown function (DUF2886). Some
members in this bacterial family of proteins are
annotated as YjbF however the function is unknown.
Length = 197
Score = 33.5 bits (77), Expect = 0.34
Identities = 16/70 (22%), Positives = 25/70 (35%), Gaps = 9/70 (12%)
Query: 149 QQALVLVQFIRQCDEVMFWIN-DKEAFVTAD-------EFGADLEHVEVLQRKFDEFQKD 200
QA V++ F Q + + W++ DK VT + G DL E +
Sbjct: 48 PQAFVVLAFAEQNGDQLKWVSADKAMLVTRNGRLVKTRGLGNDLMAAENSPLL-ALLRAG 106
Query: 201 MASQEYRVTE 210
Q +R
Sbjct: 107 RTGQAWRRHY 116
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 34.4 bits (79), Expect = 0.39
Identities = 43/216 (19%), Positives = 76/216 (35%), Gaps = 15/216 (6%)
Query: 366 LHRFLADYRDLISWVSD------MKAIISADELAKDVAGAEALLERHQEHKGEIDAREDS 419
L FL YR + A D+L++D+ A LE +E + E+DAR ++
Sbjct: 249 LRNFLQRYRRYARTMLRRRATRLRSAQTQYDQLSRDLGRARDELETAREEERELDARTEA 308
Query: 420 FRGTADAGQSLLD--RNHYAADDIREKLRNLENDKTHLLTLWEE-RRILYEQCMDLQLFY 476
ADA ++ L+ + A D E L D L + R+ + E L+
Sbjct: 309 LEREADALRTRLEALQGSPAYQDAEE-LERARADAEALQAAAADARQAIREAESRLEEER 367
Query: 477 RDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDEFATKL 536
R ++ + E L + + A + E A ++++ L +
Sbjct: 368 RRLDEEAGRLDDAERELRAAREQLARAAERAGLSPAHTAEPDAALAAQELQELGALDARR 427
Query: 537 IEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAIL 572
AD V +R + + + R A
Sbjct: 428 -----QDADRVIAQRSEQVALLRRRDDVADRAEATH 458
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 33.9 bits (78), Expect = 0.58
Identities = 60/374 (16%), Positives = 137/374 (36%), Gaps = 55/374 (14%)
Query: 642 EALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTLW 701
A L+R +E + S + ++ LD D K+ +E + L
Sbjct: 673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE--- 729
Query: 702 EERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSL 761
+E L E+ +L+ EQ +E + + ++ E +
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQE----------------------IENVKSELKELEARI 767
Query: 762 AAQEEKIKALDDLLKVNTMQIKALDEFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKS 821
EE + L++ +N ++ + + I+ + ++ R + L+ L +
Sbjct: 768 EELEEDLHKLEE--ALNDLEARLSHSRIPE-IQAELSKLEEEVSRIEARLREIEQKLNRL 824
Query: 822 ARRRAILEDSYK-FQQFERDCDETKSWVTEKLKFANDDNYLDPTNLNGKVQKHQNFEQEL 880
+ LE + Q+ D E + +++ ++ L ++++ + ++L
Sbjct: 825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI----ENLNGKKEELEEELEELEAALRDL 880
Query: 881 NANKTRMDEITSTGQELLEANHYASDKIRQRMEQIVHLWESLATATEKKGNKLQEASQQQ 940
+ R+ ++ EL KI + QI + L+ K +E S+ +
Sbjct: 881 ES---RLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIE 937
Query: 941 GFNR----------TIEDIELWLSEIEGQLMS---------EDYGKDLTSVQNLQKKHAL 981
++ED++ L +E ++ + ++Y + L + L++K A
Sbjct: 938 DPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAK 997
Query: 982 LEADVASHLDRIES 995
LE + + L+RIE
Sbjct: 998 LEEERKAILERIEE 1011
Score = 32.7 bits (75), Expect = 1.3
Identities = 52/331 (15%), Positives = 118/331 (35%), Gaps = 44/331 (13%)
Query: 34 SEARSKREKLEDSR----RFQYFKRDADELESWI----YEKLQAASDESYKETTNLQAKI 85
E R + E+L R R+Q ++ E E + E L+ + ++ +L+ ++
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253
Query: 86 QKHQAFEAEVAAHSNAI-VVLDNTGKEMINQNHFASEVIRKRLEELHRLWELLLSKLAEK 144
+K +E+ I +L+ K++ + +++++ EL L +AEK
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313
Query: 145 GMKLQQALVLVQFIRQCDEVMFWINDKEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQ 204
+L+ A ++ E L +E L+R+ +E +K
Sbjct: 314 ERELEDA------------------EERLAKLEAEIDKLLAEIEELEREIEEERKRRDKL 355
Query: 205 EYRVTEVNQLADKLVLEGHPERETIVRRKEELSEAWMRLKQLALMRQEKLFGA-HEIQRF 263
E+ + E E + EE+ + + + +EKL EI
Sbjct: 356 TEEYAELKE-----------ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINEL 404
Query: 264 NRDADETVAWIAEKDAVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAEADR 323
R+ D + L+ A++ ++ K +E + ++ + ++
Sbjct: 405 KRELDRLQEELQRLSEELADLN-----AAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ 459
Query: 324 LCGIHSDHGDQIQAKRAEIVEYWERLTGKAK 354
L S + ++ + E + L+ +
Sbjct: 460 LAADLSKYEQELYDLKEEYDRVEKELSKLQR 490
Score = 32.0 bits (73), Expect = 2.0
Identities = 60/365 (16%), Positives = 129/365 (35%), Gaps = 41/365 (11%)
Query: 192 RKFDEFQKDMASQEYRVTEVNQLADKLVLE-GHPERETIVRRKEELSEAWMRLKQLALMR 250
+ +E ++++ + + E Q ++L E ER + +++ E + LK+ +
Sbjct: 177 EELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALE 236
Query: 251 QEKLFGAHEIQR-------FNRDADETVAWIAEKDAVLSS-DEYGRDLASVQ--TLQRKH 300
++K ++ + E + E + +L ++ +DL + ++ K
Sbjct: 237 RQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKI 296
Query: 301 EGVERDLAALEDKVSTLGAEADRLCGIHSDHGDQIQAKRAEIVEYWERLTGKAKGRKQKL 360
+E ++A+LE ++ E + + +I AEI E + + K R +
Sbjct: 297 GELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356
Query: 361 DESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSF 420
+E L L D R + V D + + DEL E L E K E+D ++
Sbjct: 357 EEYAELKEELEDLRAELEEV-DKEFAETRDELKDYREKLEKLKREINELKRELDRLQE-- 413
Query: 421 RGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLFYRDTE 480
L R D+ + +E L EE ++ +
Sbjct: 414 ---------ELQRLSEELADLNAAIAGIEAKINEL----EEE--------------KEDK 446
Query: 481 QADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDEFATKLIEGQ 540
+ + + DL + L ++++ EK L+ + ++ + A E
Sbjct: 447 ALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERV 506
Query: 541 HYAAD 545
Sbjct: 507 RGGRA 511
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.9 bits (77), Expect = 0.65
Identities = 61/352 (17%), Positives = 125/352 (35%), Gaps = 47/352 (13%)
Query: 658 REDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLF 717
R D G A QS++D ++R KL+ + D L EE+ L
Sbjct: 724 RRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLG-------- 775
Query: 718 YRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDDLLKV 777
T M ++E S + + +E + +D E+ +A Q K++ D V
Sbjct: 776 --------TIMPEEE---SAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTV 824
Query: 778 NTMQIKALD------------EFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRR 825
+ + + E KLI+ Q + + + L + + RR+
Sbjct: 825 QQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQ 884
Query: 826 AILED----SYKFQQFERDCDETKSWVTEKLKFANDDNYLDPTNLNGKVQKHQNFEQELN 881
E S + Q R+ + K + F D ++ K ++ + ++N
Sbjct: 885 QFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVN 944
Query: 882 ANKTRMDEITSTGQELLEANHYASDKIRQRMEQIVHLWESLATATEKKGNKLQEASQQQG 941
K ++ I +++ N K ++ TE Q ++
Sbjct: 945 DIKEKVKNIHGYMKDIE--NKIQDGKDDYLKQK----------ETELNTVNAQLEECEKH 992
Query: 942 FNRTIEDIELWLSEIEGQLMSEDYGKDLTSVQNLQKKHALLEADVASHLDRI 993
+ ED+ L +I+ Q + E + +D +++ + + +E ++ HL +
Sbjct: 993 QEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEM 1044
Score = 31.9 bits (72), Expect = 2.1
Identities = 44/219 (20%), Positives = 80/219 (36%), Gaps = 26/219 (11%)
Query: 416 REDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMD---- 471
R D G A QS++D ++R KL+ + D L EE+ L M
Sbjct: 724 RRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEES 783
Query: 472 ----------LQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAA 521
++ F + + + +A+Q A L DL +V+ + ++ ++ + L
Sbjct: 784 AKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLD---RTVQQVNQEKQEKQHELDT 840
Query: 522 QEEKIKALDEFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILEDSYKFQQF 581
KI E KLI+ Q + + + L + + RR+ E Q
Sbjct: 841 VVSKI----ELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEE-----QLV 891
Query: 582 EEIVEYWERLTGKAKGRKQKLDESYYLHRFLADYRDLIS 620
E E + ++Q +L + + +LIS
Sbjct: 892 ELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELIS 930
>gnl|CDD|219134 pfam06683, DUF1184, Protein of unknown function (DUF1184). This
family contains a number of hypothetical proteins of
unknown function from Arabidopsis thaliana.
Length = 191
Score = 32.1 bits (72), Expect = 0.85
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Query: 867 NGKVQKHQNFEQELNANKTRMDEITSTGQELLEANHYASDKIRQRMEQIVHLWESLATAT 926
NGKV K FE N +++E + +++ EAN +A ++I+ I+HLW+SL AT
Sbjct: 121 NGKVVKRSLFEH-CNQELKKLEERLRSAKDVSEANGFAREEIKS---NILHLWKSLFEAT 176
Query: 927 EKKGNKLQ 934
+ N +
Sbjct: 177 PEVRNSNK 184
>gnl|CDD|180814 PRK07057, sdhA, succinate dehydrogenase flavoprotein subunit;
Reviewed.
Length = 591
Score = 32.5 bits (74), Expect = 1.3
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 4/90 (4%)
Query: 258 HEIQRFNRDA-DETVAWIAEKDAVLSSDEYGRDLA-SVQTLQRKHEGVERDLAALEDKVS 315
HE + DA D ++A +A+ D +S EY +D+A ++ +KH GV R A L++ V
Sbjct: 430 HEHKPLPADAADFSLARLAKLDKS-TSGEYAQDVANDIRATMQKHAGVFRTQALLDEGVE 488
Query: 316 TLGAEADRLCGIH-SDHGDQIQAKRAEIVE 344
+ A A+R+ IH D R E +E
Sbjct: 489 QILALAERVENIHLKDKSKVFNTARVEALE 518
>gnl|CDD|227289 COG4953, PbpC, Membrane carboxypeptidase/penicillin-binding protein
PbpC [Cell envelope biogenesis, outer membrane].
Length = 733
Score = 32.4 bits (74), Expect = 1.4
Identities = 24/101 (23%), Positives = 39/101 (38%), Gaps = 9/101 (8%)
Query: 18 IQERREQVLNRYADFK--SEARSKREKLED-SRRFQYFKRDADELESWIYEKLQAASDES 74
+ R +VL R A SE +KR +LE R + A + Q +
Sbjct: 209 AEAARNRVLTRMAVQGVWSEEEAKRARLEPVPNRRRQLPFLAPHFAFRL--LNQYPKGDI 266
Query: 75 YKET--TNLQAKIQK--HQAFEAEVAAHSNAIVVLDNTGKE 111
T NLQ ++++ +S A++V+DN E
Sbjct: 267 IVTTLDANLQRRLEEQARDYASRLGPRNSAAVLVVDNRTME 307
>gnl|CDD|227510 COG5183, SSM4, Protein involved in mRNA turnover and stability [RNA
processing and modification].
Length = 1175
Score = 32.6 bits (74), Expect = 1.5
Identities = 23/103 (22%), Positives = 47/103 (45%), Gaps = 13/103 (12%)
Query: 782 IKALDEFATKLIE--GQHYAADDVAQR--RDLLLKRRAGLLEKSARRRAILEDSYKFQQF 837
+ D ++ E G+ + D+ +R R +L+ RA +LE +R A LE + QQ
Sbjct: 189 VNREDCLLNQIFEKFGERLSDKDLMRRLRRRMLMNPRA-ILESISRESAQLERNTARQQG 247
Query: 838 ERDCDETKSWVTEKLKFANDDNYLDPTNLNGKVQK---HQNFE 877
E + + ++ +++N ++P + N + +N E
Sbjct: 248 EHARENGRDLSSD-----SNNNVINPVSDNVPSRDMNDSRNVE 285
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 32.3 bits (74), Expect = 1.7
Identities = 90/500 (18%), Positives = 180/500 (36%), Gaps = 85/500 (17%)
Query: 336 QAKRAEIVEYWERLTGKAKGRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKD 395
+++ AE+ E ER + + ++ DE L ++ + + ++A I ++L +
Sbjct: 212 ESELAELDEEIERYEEQREQARETRDE---ADEVLEEHEERREELETLEAEI--EDLRET 266
Query: 396 VAGAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHL 455
+A E E E ++ R + D + + A+ + + LE+
Sbjct: 267 IAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDE-- 324
Query: 456 LTLWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDF 515
E R L E + Q E+A++ +++A E + + L + E+
Sbjct: 325 ----ELRDRLEECRVAAQAH---NEEAES--LREDADDLEERAEELREEAAELESELEEA 375
Query: 516 EKSLAAQEEKIKALDEFATKLIEGQHYAA---DDVAQRRDLLLKRRAGLLEKSARRRAIL 572
+++ + E+I+ L+E +L E A + + L + R L E+ A A L
Sbjct: 376 REAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATL 435
Query: 573 EDSYKFQQFEEIVEYWERL--TGKAKGRKQKLDESYYLHRFLADYRDLISWVSDMKAIIS 630
+ E VE E L GK Q ++ S ++ + + R+ + +
Sbjct: 436 RTA------RERVEEAEALLEAGKCPECGQPVEGSPHVET-IEEDRERVEELEAELE--- 485
Query: 631 ADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNL 690
+L ++V E LER ++ E + R + + + L+ ++ RE+ L
Sbjct: 486 --DLEEEVEEVEERLERAEDLV-EAEDRIERLEERREDLEELIAERRETIEEKRERAEEL 542
Query: 691 ENDKTHLLTLWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFLSNEDLGDSLDSVEAL 750
L EE+R EA
Sbjct: 543 RERAAELEAEAEEKR------------------------------------------EAA 560
Query: 751 IKKHEDFEKSLAAQEEKIKALDDLLKVNTMQIKALDEFATKLIEGQHYAADDVAQRRDLL 810
+ E+ E++ E++ L+ L +I++L+ T L D + L
Sbjct: 561 AEAEEEAEEAR----EEVAELNSKLAELKERIESLERIRTLLAA-----IADAEDEIERL 611
Query: 811 LKRRAGLLEKSARRRAILED 830
++R L E + RR L +
Sbjct: 612 REKREALAELNDERRERLAE 631
Score = 32.3 bits (74), Expect = 1.8
Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 17/254 (6%)
Query: 221 EGHPERETIVRRKEELSEAWMRLKQLALMRQ--EKLFGAHEIQRFNRDADETVAWIAEKD 278
E E E + R E+L EA R+++L R+ E+L + +
Sbjct: 489 EEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERR-ETIEEKRERAEELRERAA 547
Query: 279 AVLSSDEYGRDLASVQTLQRKHEGVERDLAALEDKVSTLGAEADRLCGIHSDHGDQIQAK 338
+ + E R+ A + + E ++A L K++ L + L I + + A
Sbjct: 548 ELEAEAEEKREAA--AEAEEEAEEAREEVAELNSKLAELKERIESLERIRT-----LLAA 600
Query: 339 RAEIVEYWERLTGKAKGRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAG 398
A+ + ERL K + + DE LA+ R+ + +E +D
Sbjct: 601 IADAEDEIERLREKREALAELNDER---RERLAEKRERKRELEAEFDEARIEEAREDKER 657
Query: 399 AEALLERHQEHKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTL 458
AE LE+ +E E+ D + A ++ L+ +++RE+ LEN L L
Sbjct: 658 AEEYLEQVEEKLDELREERDDLQAEIGAVENELEE----LEELRERREALENRVEALEAL 713
Query: 459 WEERRILYEQCMDL 472
++E L DL
Sbjct: 714 YDEAEELESMYGDL 727
Score = 31.2 bits (71), Expect = 3.8
Identities = 43/229 (18%), Positives = 84/229 (36%), Gaps = 23/229 (10%)
Query: 496 EDLGDSLDSVEALIKKHEDFEKSLAAQEEKIKALDEFATKL----------IEGQHYAAD 545
E L + + +E LI + + ++ + E+ + L E A +L A+
Sbjct: 512 ERLEERREDLEELIAERRE---TIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAE 568
Query: 546 DVAQRRDLLLKRRAGLLEKSARRRAILEDSYKFQQFEEIVEYWERLTGKAKGRKQKLDES 605
+ + L + A L E+ I I + + + + R+ + +
Sbjct: 569 EAREEVAELNSKLAELKERIESLERIRT------LLAAIADAEDEIERLREKREALAELN 622
Query: 606 YYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGT 665
LA+ R+ + +E +D AE LE+ +E E+ D +
Sbjct: 623 DERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAE 682
Query: 666 ADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDL 714
A ++ L+ +++RE+ LEN L L++E L DL
Sbjct: 683 IGAVENELEE----LEELRERREALENRVEALEALYDEAEELESMYGDL 727
Score = 30.4 bits (69), Expect = 7.0
Identities = 37/192 (19%), Positives = 69/192 (35%), Gaps = 15/192 (7%)
Query: 171 KEAFVTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVTEVNQLADKLVLEGHPERETIV 230
++ D E E L+ E ++ + + R E+ + A +L E +RE
Sbjct: 502 EDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAA 561
Query: 231 RRKEELSEAWMRLKQLALMRQEKLFGAHEIQRFNRDADETVAWIAEKDAVLSSDEYGRDL 290
+EE EA + +L E I+ R A +D + E L
Sbjct: 562 EAEEEAEEAREEVAELNSKLAEL---KERIESLERIRTLLAAIADAEDEIERLREKREAL 618
Query: 291 ASVQTLQRKHEGVERDLAALEDKVSTLGAEADRLCGIHSDHGDQIQAKRAEIVEYWERLT 350
A + +R+ LA ++ L AE D ++ + + EY E++
Sbjct: 619 AELNDERRER------LAEKRERKRELEAEFD------EARIEEAREDKERAEEYLEQVE 666
Query: 351 GKAKGRKQKLDE 362
K +++ D+
Sbjct: 667 EKLDELREERDD 678
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 31.6 bits (72), Expect = 2.7
Identities = 110/670 (16%), Positives = 229/670 (34%), Gaps = 96/670 (14%)
Query: 175 VTADEFGADLEHVEVLQRKFDEFQKDMASQEYRVT-EVNQLADKLVLEGHPERETIVRRK 233
E AD E + Q + E + + Q + ++ + D+L E + +
Sbjct: 267 HLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADR 326
Query: 234 EELSEAWMRLKQLALMRQEKLFGAHEIQRFNRDADETVAWIAEKDAVLSSDEYGRDLASV 293
EL L Q+ F +I++ D D+ + +E + V
Sbjct: 327 SELE---------LLEDQKGAFEDADIEQLQADLDQLPSIRSELEEV--EARLDALTGKH 375
Query: 294 QTLQRKHEGV--------ERDLAALEDKVSTLGAEADRLCGIHSDHGDQIQAKRAEIVEY 345
Q +QRK+E + ERDL ++++ + E DR + + +QA +++ +
Sbjct: 376 QDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQ---KAAIEEDLQALESQLRQQ 432
Query: 346 WERLTGKAKGRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLER 405
E + + +L+ + D A + +EL + + LE+
Sbjct: 433 LEAGKLEFNEEEYELELRLGRLKQRLD-----------SATATPEELE-QLEINDEALEK 480
Query: 406 HQE-------HKGEIDAREDSFRGTADAGQSLLDRNHYAADDIREKLRNLEND---KTHL 455
QE + ++ + R D L R +R+ L LE +
Sbjct: 481 AQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGS 540
Query: 456 L--------TLWEER--RILYEQCM---DLQLFYRDTEQADTWMAKQEAFLSNEDLGDSL 502
L WEE +++ + + DL + +DT L D+ D
Sbjct: 541 LLHFLRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLD-LQRLDVPDYA 599
Query: 503 DSVEALIKKHEDFEKSLAAQEEKIKALDEFA-------TKLIEGQHYAADDVAQRRDLLL 555
+ L ++ + E++L + K K +E + + A + Q R L
Sbjct: 600 ANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDL- 658
Query: 556 KRRAGLLEKSARRRAILEDSYKFQQFEEIVEYWERLTGKAKGRKQKLDESYYLHRFLADY 615
+R ++S + + L + + QQ E + +L + K ++ +L D+
Sbjct: 659 -QRLQNEQQSLKDKLELAIAERKQQAETQL---RQLDAQLKQLLEQQQA--FLEALKDDF 712
Query: 616 RDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDAREDSFRGTADAGQSLLDR 675
R+L + ++ + + + A+ + K + + + + L
Sbjct: 713 RELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYD------RELASL 766
Query: 676 NHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLFYRDTEQADTWMAKQEAFL 735
D ++ L+ L T E + + + + F + +TW+ +
Sbjct: 767 -----DVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQ-----ETWLHRDSLRE 816
Query: 736 SNEDLGDSLDSVEALIKKHEDFEKSLAAQEE----KIKALDDLLKVNTMQIKALDEFATK 791
+L L +E E+ ++ L + + K L+ K Q+ LDE
Sbjct: 817 ERPNLAIQLRELE---SSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRG 873
Query: 792 LIEGQHYAAD 801
L + A+
Sbjct: 874 LRDEMRQLAE 883
>gnl|CDD|115718 pfam07082, DUF1350, Protein of unknown function (DUF1350). This
family consists of several hypothetical proteins from
both cyanobacteria and plants. Members of this family
are typically around 250 residues in length. The
function of this family is unknown but the species
distribution indicates that the family may be involved
in photosynthesis.
Length = 250
Score = 30.2 bits (68), Expect = 5.1
Identities = 16/57 (28%), Positives = 28/57 (49%), Gaps = 5/57 (8%)
Query: 531 EFATKLIEGQHYAADDVAQRRDLLLKRRAGLLEKSARRRAILEDSYKFQQFEEIVEY 587
EF E H + RR+LL+K + ++++AR R+IL +F ++V
Sbjct: 145 EFTPSPTETNHLIQERYPVRRNLLIKFQNDDIDQTARLRSILR-----AKFGDMVTA 196
>gnl|CDD|236202 PRK08249, PRK08249, cystathionine gamma-synthase; Provisional.
Length = 398
Score = 30.1 bits (68), Expect = 7.0
Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
Query: 55 DADELESWIYEKLQAASDESYKET-TNLQAKIQKHQAFEAEVAAHSNAIVVLDNTGKEMI 113
D +++E ++ D Y ET TN KI + A A A+VV+DNT I
Sbjct: 138 DHEQIE----AEIAKGCDLLYLETPTNPTLKIVDIERL-AAAAKKVGALVVVDNTFATPI 192
Query: 114 NQNHFA 119
NQN A
Sbjct: 193 NQNPLA 198
>gnl|CDD|221671 pfam12623, Hen1_L, RNA repair, ligase-Pnkp-associating, region of
Hen1. This domain is the N-terminal region of the
bacterial Hen1 protein. This protein forms stable
heterotetramer with Pnkp. The heterotetramer was able to
repair transfer RNAs cleaved by ribotoxins in vitro.
This domain provides the ligase activity of the
heterotetramer.
Length = 245
Score = 29.5 bits (67), Expect = 7.2
Identities = 12/24 (50%), Positives = 13/24 (54%), Gaps = 6/24 (25%)
Query: 215 ADKLV------LEGHPERETIVRR 232
DKL+ L HPERE I RR
Sbjct: 203 VDKLLRAGEGWLAEHPERELITRR 226
>gnl|CDD|146925 pfam04529, Herpes_U59, Herpesvirus U59 protein. The proteins in
this family have no known function. Cytomegalovirus UL88
is also a member of this family.
Length = 365
Score = 29.7 bits (67), Expect = 8.2
Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 16/79 (20%)
Query: 543 AADDVAQRRDLLLKRRAGLLEKSARRRAILEDSYKFQQF-----------EEIVEYWERL 591
D + LL+ L +A +R +L D + + F E+ E E +
Sbjct: 193 EVDISPRSERELLQHCVEL---AASKRLLLLDGPRLENFFLAQVCLYGLGED--ELAEEM 247
Query: 592 TGKAKGRKQKLDESYYLHR 610
G GR + + ++ LHR
Sbjct: 248 LGMLAGRPEDMSAAFLLHR 266
>gnl|CDD|223935 COG1003, GcvP, Glycine cleavage system protein P
(pyridoxal-binding), C-terminal domain [Amino acid
transport and metabolism].
Length = 496
Score = 30.0 bits (68), Expect = 8.6
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 933 LQEASQQQGFNRTIEDIELWLSEIEG 958
Q Q QG+ I +++ WL EI G
Sbjct: 98 FQPEEQVQGYLELIYELQEWLKEITG 123
>gnl|CDD|223224 COG0146, HyuB, N-methylhydantoinase B/acetone carboxylase, alpha
subunit [Amino acid transport and metabolism / Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 563
Score = 29.9 bits (68), Expect = 9.0
Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 3/52 (5%)
Query: 355 GRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERH 406
+ + E L L + R + D+KA I+A K L++ +
Sbjct: 156 VKNGEFREDI-LRILLRNVRTPDYNIGDLKAQIAA--NLKGRRRVRELIDEY 204
Score = 29.9 bits (68), Expect = 9.0
Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 3/52 (5%)
Query: 597 GRKQKLDESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERH 648
+ + E L L + R + D+KA I+A K L++ +
Sbjct: 156 VKNGEFREDI-LRILLRNVRTPDYNIGDLKAQIAA--NLKGRRRVRELIDEY 204
>gnl|CDD|218133 pfam04533, Herpes_U44, Herpes virus U44 protein. This is a family
of proteins from dsDNA beta-herpesvirinae and
gamma-herpesvirinae viruses. The function is not known,
and the proteins are named variously as U44, BSRF1,
UL71, and M71. The family BSRF1 has been merged into
this.
Length = 211
Score = 29.3 bits (66), Expect = 9.5
Identities = 14/41 (34%), Positives = 20/41 (48%), Gaps = 7/41 (17%)
Query: 245 QLALMRQEKLFGAHEIQRFNRDADETVAWIAEKDAVLSSDE 285
+ALM EKL A ++ D + AW E D +L S+
Sbjct: 141 TVALMEMEKLATAIKM-------DNSKAWATEIDPILHSET 174
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 29.7 bits (67), Expect = 9.7
Identities = 51/318 (16%), Positives = 96/318 (30%), Gaps = 34/318 (10%)
Query: 500 DSLDSVEALIKKHEDF------------EKSLAAQEEKIKALDEFATKLIEGQHYAADDV 547
D EAL K + + E+++ EK++ + + E Y D
Sbjct: 648 DFRGLKEALKKALKLYSNEGEEDLKEALEEAIEELVEKLEDVLNEFKEQSEDIGYLEDAG 707
Query: 548 AQRRDLLLKRRAGLLEKSARRRAILEDSYK----FQQFEEIV-EYWERLTGKAKGRKQKL 602
RD + + + + K ++ E E + K +L
Sbjct: 708 ELLRDFAPELEIKVERGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEIQYLEDLLKARL 767
Query: 603 DESYYLHRFLADYRDLISWVSDMKAIISADELAKDVAGAEALLERHQEHKGEIDARE--- 659
++ Y + Y DLI ++D + I + A D+ + LE K EI+
Sbjct: 768 EKDY-----QSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELE--LIGKQEINIDYILE 820
Query: 660 --DSFRGTADAGQSLLDRNHYAADDIREKLRNLENDKTHLLTLWEERRILYEQCMDLQLF 717
+F A +SL + I+E + +L K + L E + +D
Sbjct: 821 LLQTFNDKNGAYESLKELIE---RIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEK 877
Query: 718 YRDTEQADTWMAKQEAFLSNEDLGDSLDSVEALIKKHEDFEKSLAAQEEKIK-ALDDLLK 776
+ + + F E L + + E K A + L+
Sbjct: 878 LEELYILAKKEEEFKQFAEEEGLNEEEL-AFYDDLALNGGKLPENGTELVEKLAKEKSLR 936
Query: 777 VNTMQIKALDEFATKLIE 794
E ++
Sbjct: 937 EKNKDDWKAKEEVEAKLK 954
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.131 0.364
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 53,859,132
Number of extensions: 5603788
Number of successful extensions: 7870
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7598
Number of HSP's successfully gapped: 292
Length of query: 1032
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 925
Effective length of database: 6,191,724
Effective search space: 5727344700
Effective search space used: 5727344700
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.4 bits)