RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13767
(801 letters)
>gnl|CDD|111117 pfam02187, GAS2, Growth-Arrest-Specific Protein 2 Domain.
Length = 73
Score = 135 bits (342), Expect = 4e-38
Identities = 50/70 (71%), Positives = 57/70 (81%)
Query: 727 EKIHDEVKRLVQLCTCRQKFRVFQVGEGKYRFGDSQKLRLVRILRSTVMVRVGGGWVALD 786
+K+ DEVKR+V+ C C KF V QV EGKYR GDSQ LRLVRILRS VMVRVGGGW LD
Sbjct: 1 DKLDDEVKRIVEDCKCPDKFCVEQVSEGKYRVGDSQILRLVRILRSHVMVRVGGGWETLD 60
Query: 787 EFLIKNDPCR 796
E+L+K+DPCR
Sbjct: 61 EYLLKHDPCR 70
>gnl|CDD|128539 smart00243, GAS2, Growth-Arrest-Specific Protein 2 Domain.
GROWTH-ARREST-SPECIFIC PROTEIN 2 Domain.
Length = 73
Score = 135 bits (341), Expect = 6e-38
Identities = 51/70 (72%), Positives = 60/70 (85%)
Query: 727 EKIHDEVKRLVQLCTCRQKFRVFQVGEGKYRFGDSQKLRLVRILRSTVMVRVGGGWVALD 786
+KI DEVKR+V+ C C KF+V ++ EGKYRFGDSQ LRLVRILRSTVMVRVGGGW LD
Sbjct: 1 DKIDDEVKRIVEDCKCPTKFQVEKISEGKYRFGDSQILRLVRILRSTVMVRVGGGWETLD 60
Query: 787 EFLIKNDPCR 796
E+L+K+DPCR
Sbjct: 61 EYLLKHDPCR 70
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 86.0 bits (213), Expect = 5e-19
Identities = 45/259 (17%), Positives = 107/259 (41%), Gaps = 47/259 (18%)
Query: 90 AKQFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELT 149
+QF + WL + E+ + + D E ++ +++H+AL E+ + L
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDY-GDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 150 DIASSLMGLVGEDEAAGVADKLQDTADRYGALVEASDNLGQYAFLYNQLILSPRFSSVTD 209
++ L+ D +
Sbjct: 61 ELGEQLIEEGHPD--------------------------------------------AEE 76
Query: 210 IKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPP 269
I+++LE LN W E+++ +R + LEEAL L + F + + L + + L S++
Sbjct: 77 IQERLEELNQRWEELRELAEERRQRLEEALDLQQFFR-DADDLEQWLEEKEAALASEDLG 135
Query: 270 AVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLDSA 329
+ +++++ KE++ E++ +P ++ ++L++ E+++ +E+L+
Sbjct: 136 K-DLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNER 194
Query: 330 WDNVTALFAKREENLIHAM 348
W+ + L +R++ L A+
Sbjct: 195 WEELLELAEERQKKLEEAL 213
Score = 77.5 bits (191), Expect = 5e-16
Identities = 47/251 (18%), Positives = 104/251 (41%), Gaps = 42/251 (16%)
Query: 241 LAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQC 300
++F + + A L + ++ L+S + + ++++ + ++AE+ + VE
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGD-DLESVEALLKKHEALEAELAAHEERVEAL 59
Query: 301 RASGQKLMKICGEPDKPEVKKHIEDLDSAWDNVTALFAKREENLIHAMEKAMEFHETLQR 360
G++L++ G PD E+++ +E+L+ W+ + L +R + L A++ F +
Sbjct: 60 NELGEQLIEE-GHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD- 117
Query: 361 KGEQGTITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDD 420
E + E A+ + L +D
Sbjct: 118 --------------LEQWLEEKEAALASED-------------------------LGKDL 138
Query: 421 QEARTQLAEHEKFLRELAEKEIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEV 480
+ L +H++ EL E + LA+ +L + HPD I+ + + RWEE+
Sbjct: 139 ESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEEL 198
Query: 481 SSWAKQREERL 491
A++R+++L
Sbjct: 199 LELAEERQKKL 209
Score = 76.3 bits (188), Expect = 1e-15
Identities = 56/231 (24%), Positives = 99/231 (42%), Gaps = 21/231 (9%)
Query: 392 LQQNRDDCKKADCNADAVQTFVNS--LPEDDQEARTQLAEHEKFLRELAEKEIEKDATIG 449
LQQ D + + + ++S +D + L +HE ELA E +A
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 450 LAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSLQDLDSLLEELL 509
L ++ L++ A I+ + + RWEE+ A++R +RL L Q ++L
Sbjct: 62 LGEQ-LIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDA-DDLE 119
Query: 510 EWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFMEATSKRQHEVDSVRASPSREKLND 569
+WL + E+ L +E L D+ +VE L+++HKE E + + S+
Sbjct: 120 QWLEEKEAAL---ASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNEL-------- 168
Query: 570 NLPHYGPRFPPKGSKGAEPQFRNPRCRLLWDTWRNVWLLAWERQRRLQERL 620
+G A+ + + L + W + LA ERQ++L+E L
Sbjct: 169 -----AEELLEEGHPDADEEIEE-KLEELNERWEELLELAEERQKKLEEAL 213
Score = 72.1 bits (177), Expect = 4e-14
Identities = 42/183 (22%), Positives = 94/183 (51%), Gaps = 3/183 (1%)
Query: 8 PSADYKVVKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQLNELMN 67
D + V+A L++ + L+ LA + + +L ++G ++ + + I+ +L EL
Sbjct: 28 YGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLI-EEGHPDAEEIQERLEELNQ 86
Query: 68 RFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQR 127
R++ L E A +R LE+A+ + + F+D + WL++ E + +L D E +++
Sbjct: 87 RWEELRELAEERRQRLEEALDLQQFFRDADD-LEQWLEEKEAALASEDL-GKDLESVEEL 144
Query: 128 IREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTADRYGALVEASDN 187
+++H L +E+ +P L ++A L+ D + +KL++ +R+ L+E ++
Sbjct: 145 LKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEELLELAEE 204
Query: 188 LGQ 190
+
Sbjct: 205 RQK 207
Score = 47.1 bits (112), Expect = 1e-05
Identities = 23/86 (26%), Positives = 42/86 (48%)
Query: 2 VANQKPPSADYKVVKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQ 61
+ D + V+ L++ K L++ L + + SL ++ E+ P + IE +
Sbjct: 128 ALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEK 187
Query: 62 LNELMNRFDNLNEGASQRMDALEQAM 87
L EL R++ L E A +R LE+A+
Sbjct: 188 LEELNERWEELLELAEERQKKLEEAL 213
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 53.1 bits (128), Expect = 9e-09
Identities = 21/102 (20%), Positives = 53/102 (51%), Gaps = 2/102 (1%)
Query: 243 EKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRA 302
++F + + A L + + L S++ + ++++ + +AE++ + VE
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGK-DLESVEALLKKHEAFEAELEAHEERVEALNE 59
Query: 303 SGQKLMKICGEPDKPEVKKHIEDLDSAWDNVTALFAKREENL 344
G++L++ G PD E+++ +E+L+ W+ + L +R + L
Sbjct: 60 LGEQLIEE-GHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 46.6 bits (111), Expect = 2e-06
Identities = 19/99 (19%), Positives = 47/99 (47%), Gaps = 2/99 (2%)
Query: 92 QFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDI 151
QF + WL++ E+ + + + D E ++ +++H+A E+ + L ++
Sbjct: 2 QFLRDADELEAWLEEKEQLLASED-LGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 152 ASSLMGLVGEDEAAGVADKLQDTADRYGALVEASDNLGQ 190
L+ G +A + ++L++ +R+ L E ++ Q
Sbjct: 61 GEQLIEE-GHPDAEEIEERLEELNERWEELKELAEERRQ 98
Score = 45.0 bits (107), Expect = 8e-06
Identities = 25/101 (24%), Positives = 46/101 (45%), Gaps = 3/101 (2%)
Query: 393 QQNRDDCKKADCNADAVQTFVNS--LPEDDQEARTQLAEHEKFLRELAEKEIEKDATIGL 450
QQ D + + + + + S L +D + L +HE F EL E +A L
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 451 AQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERL 491
++ L++ A I+ + + RWEE+ A++R ++L
Sbjct: 61 GEQ-LIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 41.5 bits (98), Expect = 1e-04
Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 2 VANQKPPSADYKVVKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQ 61
+ + D + V+A L++ + + L + + +L ++G ++ + + IE +
Sbjct: 20 LLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLI-EEGHPDAEEIEER 78
Query: 62 LNELMNRFDNLNEGASQRMDALE 84
L EL R++ L E A +R LE
Sbjct: 79 LEELNERWEELKELAEERRQKLE 101
Score = 36.9 bits (86), Expect = 0.006
Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 3/62 (4%)
Query: 499 QDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFMEATSKRQHEVDSV 558
Q +EL WL + E L +E L D+ +VE L+++H+ F + V+++
Sbjct: 1 QQFLRDADELEAWLEEKEQLL---ASEDLGKDLESVEALLKKHEAFEAELEAHEERVEAL 57
Query: 559 RA 560
Sbjct: 58 NE 59
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains are
found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of helix
C. Although the domain occurs in ultiple repeats along
sequences, the domains are actually stable on their own
- ie they act, biophysically, like domains rather than
repeats that along function when aggregated.
Length = 105
Score = 42.3 bits (100), Expect = 6e-05
Identities = 21/104 (20%), Positives = 54/104 (51%), Gaps = 2/104 (1%)
Query: 241 LAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQC 300
L ++F+ + + + + + + L+S++ + +++Q K ++AE+ + VE
Sbjct: 2 LLQQFFRDADDLESWIEEKEALLSSEDYGK-DLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 301 RASGQKLMKICGEPDKPEVKKHIEDLDSAWDNVTALFAKREENL 344
+KL+ G E+++ +E+L+ W+ + L A+R++ L
Sbjct: 61 NELAEKLIAE-GHYASEEIQERLEELNERWEQLLELAAERKQKL 103
Score = 38.8 bits (91), Expect = 0.001
Identities = 18/95 (18%), Positives = 42/95 (44%), Gaps = 2/95 (2%)
Query: 92 QFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDI 151
QF + W+++ E + + D E +Q +++H AL E+ + L ++
Sbjct: 5 QFFRDADDLESWIEEKEALLSSEDY-GKDLESVQALLKKHKALEAELAAHQDRVEALNEL 63
Query: 152 ASSLMGLVGEDEAAGVADKLQDTADRYGALVEASD 186
A L+ + ++L++ +R+ L+E +
Sbjct: 64 AEKLIAEGHYASEE-IQERLEELNERWEQLLELAA 97
Score = 37.3 bits (87), Expect = 0.004
Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 416 LPEDDQEARTQLAEHEKFLRELAEKEIEKDATIGLAQRILVKSHPDGATVIKHWITIIQS 475
+D + + L +H+ ELA + +A LA++ L+ + I+ + +
Sbjct: 29 YGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEK-LIAEGHYASEEIQERLEELNE 87
Query: 476 RWEEVSSWAKQREERLRN 493
RWE++ A +R+++L
Sbjct: 88 RWEQLLELAAERKQKLEE 105
Score = 35.8 bits (83), Expect = 0.013
Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 3 ANQKPPSADYKVVKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQL 62
+ + D + V+A L++ K L+ LA Q + +L ++ ++ A A + I+ +L
Sbjct: 24 LSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYAS-EEIQERL 82
Query: 63 NELMNRFDNLNEGASQRMDALEQ 85
EL R++ L E A++R LE+
Sbjct: 83 EELNERWEQLLELAAERKQKLEE 105
Score = 30.7 bits (70), Expect = 0.85
Identities = 17/80 (21%), Positives = 34/80 (42%), Gaps = 5/80 (6%)
Query: 498 LQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFMEATSKRQHEVDS 557
LQ ++L W+ + E+ L + E D+ +V+ L+++HK + Q V++
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSS---EDYGKDLESVQALLKKHKALEAELAAHQDRVEA 59
Query: 558 VRASPSREKLNDNLPHYGPR 577
+ EKL +
Sbjct: 60 LNE--LAEKLIAEGHYASEE 77
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 45.5 bits (108), Expect = 1e-04
Identities = 85/478 (17%), Positives = 182/478 (38%), Gaps = 45/478 (9%)
Query: 53 AERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIK 112
E +A+E +L EL + L E ++ ++ + K +++L + L++ E+KI+
Sbjct: 232 QEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIE 291
Query: 113 DMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQ 172
+++ RE + L +E+ + EL ++ L L E +KL+
Sbjct: 292 R----------LEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKL--EEKLE 339
Query: 173 DTADRYGALVEASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRG 232
L E + L + +L R ++++LE L + +
Sbjct: 340 KLESELEELAEEKNELAK--------LLEERLKE---LEERLEELEKELEKALERLKQLE 388
Query: 233 RSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQ 292
+++E + + L+ + L +L+ L E + +++ + +K+++ +I+Q
Sbjct: 389 EAIQELKEELAELSAALEEIQEELEELEKELEELE------RELEELEEEIKKLEEQINQ 442
Query: 293 TKPEVEQCR--ASGQKLMKICGEPDKPEVKKHIEDLDSAWDNVTALFAKREENLIHAMEK 350
+ + A + +CG+ E +K + +L RE+ E+
Sbjct: 443 LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREE 502
Query: 351 AMEFHETLQR-KGEQGTITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADAV 409
E + L+ + E + L +E L +EK E L++ ++ +
Sbjct: 503 IEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQL----- 557
Query: 410 QTFVNSLPEDDQEARTQLAEHEKFL--RELAEKEIEKDATIGLAQRILVKSHPDGATVIK 467
+ + L + QE + L E +E E+ E+ + + L + +++
Sbjct: 558 KEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQ 617
Query: 468 HWITIIQSRWEEVSSWAKQREERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAE 525
+ E ++ EE L + L L L E L L + E + LEAE
Sbjct: 618 S--LELSEAENE----LEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAE 669
Score = 44.8 bits (106), Expect = 2e-04
Identities = 77/521 (14%), Positives = 193/521 (37%), Gaps = 88/521 (16%)
Query: 49 NADPAERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSE 108
+ P ERK I +L L + + + + +++A A ++ + +L+ +L+ ++
Sbjct: 150 KSKPKERKEILDELFGL----EKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLL 205
Query: 109 KKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVA 168
+ +++ E K + I+E +E L + E+ + E+ A +
Sbjct: 206 EALEEE----LKELKKLEEIQEE--QEEEELEQ-----EIEAL---------EERLAELE 245
Query: 169 DKLQDTADRYGALVEASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKAT 228
++ + + L+E + + + + ++LERL E +
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEEL------------RELERLLEELEEKIERL 293
Query: 229 NDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKA 288
+ R +EE E + L+ L +L + L S E + ++++ L+
Sbjct: 294 EELEREIEELEEELEGLRALLEE----LEELLEKLKSLEE---RLEKLEEKLEKLESELE 346
Query: 289 EIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLDSAWDNVTALFAKREENLIHAM 348
E+ + K E+ + K E+++ +E+L+ + + EE +
Sbjct: 347 ELAEEKNELAKLLEERLK-----------ELEERLEELEKELEKALERLKQLEEAIQELK 395
Query: 349 EKAMEFHETLQRKGEQGTITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADA 408
E+ E + E + +E+ + E L++ ++ ++ +
Sbjct: 396 EELAE-----------------LSAALEEIQEELEELEKELEELERELEELEEEIKKLEE 438
Query: 409 -------VQTFVNSLPEDDQEARTQLAEHEKFLRELAEKEIEKDATIGLAQRILVKSHPD 461
+ + L ++ E + + + E + L + + +
Sbjct: 439 QINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEE-LEEELSREKE-- 495
Query: 462 GATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSLQDLDSLLEELLEWLAKCESHLLN 521
++ I ++ E+ ++ E L ++L+ LE+L L + E
Sbjct: 496 -EAELREEIEELEKELREL---EEELIELLELEEALKEELEEKLEKLENLLEELEELKEK 551
Query: 522 LEAEPLPDDIPTVERLIEEHKEFMEA---TSKRQHEVDSVR 559
L+ + L +++ +E ++E KE +E R+ E++ +R
Sbjct: 552 LQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELR 592
Score = 35.9 bits (83), Expect = 0.094
Identities = 45/253 (17%), Positives = 98/253 (38%), Gaps = 23/253 (9%)
Query: 59 ERQLNELMNRFDNLNEGASQRMDALEQA-MAVAKQFQDKLTGILDWLDKSEKKIKDMELI 117
L + L E + + LE+ K +L L L+ +++K++
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577
Query: 118 PTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTADR 177
++ + E L + + K EL + S L L+ E + ++L++ +
Sbjct: 578 LRLLRTRKEELEE---LRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEE 634
Query: 178 YGALVEASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEE 237
+ +E + + L +++ ++++K+E L + + LEE
Sbjct: 635 LESELEKLNLQAELEEL--------LQAALEELEEKVEELEAEIRRELQRIENE-EQLEE 685
Query: 238 ALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEV 297
L E+ EL+ + L +L L I+Q L+ KAE+++ K E+
Sbjct: 686 KLEELEQLEEELEQLREELEELLKKLGE----------IEQLIEELESRKAELEELKKEL 735
Query: 298 EQCRASGQKLMKI 310
E+ + + L ++
Sbjct: 736 EKLEKALELLEEL 748
Score = 29.7 bits (67), Expect = 8.5
Identities = 20/103 (19%), Positives = 49/103 (47%), Gaps = 4/103 (3%)
Query: 53 AERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIK 112
A + +E ++ EL + + + LE+ + +Q +++L + + L++ KK+
Sbjct: 654 AALEELEEKVEELEAEIRRELQ-RIENEEQLEEKLEELEQLEEELEQLREELEELLKKLG 712
Query: 113 DMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSL 155
++E E+++ R E + L KE+ + + L ++ L
Sbjct: 713 EIE---QLIEELESRKAELEELKKELEKLEKALELLEELREKL 752
>gnl|CDD|222177 pfam13499, EF_hand_5, EF-hand domain pair.
Length = 60
Score = 37.3 bits (87), Expect = 0.001
Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 5/61 (8%)
Query: 657 RKMDKNNDGLIPREDFVDGIIKTKF----ETSKLEMGAVADMFDHDYNPGLIDWKEFIAA 712
+ +DK+ DG I E+ + E + + A + D D + G I ++EF+ A
Sbjct: 1 KLLDKDGDGYIDVEELRKLLKALGLKLTDEEVEELIEADFNEIDKDGD-GRISFEEFLEA 59
Query: 713 L 713
+
Sbjct: 60 M 60
Score = 30.4 bits (69), Expect = 0.45
Identities = 10/44 (22%), Positives = 23/44 (52%), Gaps = 6/44 (13%)
Query: 639 KRFLRFMNHKKSR------LTDLFRKMDKNNDGLIPREDFVDGI 676
++ L+ + K + + F ++DK+ DG I E+F++ +
Sbjct: 17 RKLLKALGLKLTDEEVEELIEADFNEIDKDGDGRISFEEFLEAM 60
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 42.0 bits (99), Expect = 0.002
Identities = 39/261 (14%), Positives = 97/261 (37%), Gaps = 30/261 (11%)
Query: 207 VTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNL-NS 265
+ +++K+LE L +++K + R + + +E++ + + L L
Sbjct: 700 LAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTEL 759
Query: 266 QEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICG------------- 312
+ + +++ + L E +AEI++ + ++EQ + + L +
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819
Query: 313 ----------EPDKPEVKKHIEDLDSAWDNVTALFAKREENLIHAMEKAMEFHETLQRKG 362
E ++ +EDL+ + ++ E+L +E+ E E L+ +
Sbjct: 820 ANLRERLESLERRIAATERRLEDLEEQIEELSE----DIESLAAEIEELEELIEELESEL 875
Query: 363 EQGTITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQE 422
E + A EE L + E E L++ + + ++ + L +
Sbjct: 876 EA--LLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933
Query: 423 ARTQLAEHEKFLRELAEKEIE 443
++ ++ L E +E
Sbjct: 934 LEVRIDNLQERLSEEYSLTLE 954
Score = 38.5 bits (90), Expect = 0.016
Identities = 60/311 (19%), Positives = 132/311 (42%), Gaps = 35/311 (11%)
Query: 58 IERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIKDME-L 116
+ RQ++ L L Q + + Q + + ++ + + L+++E+++ + E
Sbjct: 724 LSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE 783
Query: 117 IPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTAD 176
I E +I+Q E AL + + + + T L + A++L + + ++ T
Sbjct: 784 IEELEAQIEQLKEELKALREALDELRAELTLLNEEAANL-----RERLESLERRIAATER 838
Query: 177 RYGALVEASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLE 236
R L E + L + + L+ + ++ ++LE E++ N+R SLE
Sbjct: 839 RLEDLEEQIEELSE-----DIESLAAEIEELEELIEELES------ELEALLNER-ASLE 886
Query: 237 EALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPE 296
EALAL SEL+ + LR+L+ + + +++ + L +++ ++ +
Sbjct: 887 EALALLR---SELEELSEELRELESKRSELR------RELEELREKLAQLELRLEGLEVR 937
Query: 297 VEQCRASGQKLMKICGEPDKPEVKKHIEDLDSAWDNVTALFAKREE----NLIHAMEKAM 352
++ + + + E + K +D + A + L K +E NL A+E
Sbjct: 938 IDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLA-AIE--- 993
Query: 353 EFHETLQRKGE 363
E+ E +R
Sbjct: 994 EYEELKERYDF 1004
Score = 33.1 bits (76), Expect = 0.65
Identities = 60/361 (16%), Positives = 122/361 (33%), Gaps = 96/361 (26%)
Query: 207 VTDIKKKLER-LNGLWNEVQKAT-----NDRGRSLEEALALAEKFWSELQSVMATLRDLQ 260
+ DI +LER L L + +KA R LE AL + L +L+
Sbjct: 191 LEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLR------------LEELR 238
Query: 261 DNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVK 320
+ L + + +++ + L+E+ AE+ + + ++E+ R E++
Sbjct: 239 EELEELQ------EELKEAEEELEELTAELQELEEKLEELRLE------------VSELE 280
Query: 321 KHIEDLDSAWDNVTALFAKREENLIHAMEKAMEFHETLQRKGEQGTITALFAKREENLIH 380
+ IE+L + ++ E+ E+ L+ Q + L +K +E
Sbjct: 281 EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQ--LEELESKLDELAEE 338
Query: 381 AMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELAEK 440
E + E L++ + + + L + +E ++L E E+ L L K
Sbjct: 339 LAELEEKL-EELKEELESLEAE----------LEELEAELEELESRLEELEEQLETLRSK 387
Query: 441 EIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRS--- 497
+ + + L N +
Sbjct: 388 VAQ----------------------------------------LELQIASLNNEIERLEA 407
Query: 498 -LQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEE---HKEFMEATSKRQH 553
L+ L+ E L + + + L E + L ++ +E +EE E +E +
Sbjct: 408 RLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELR 467
Query: 554 E 554
E
Sbjct: 468 E 468
Score = 29.6 bits (67), Expect = 8.7
Identities = 45/273 (16%), Positives = 110/273 (40%), Gaps = 20/273 (7%)
Query: 91 KQFQDKLTGILDWLDKSEKKIKDME-LIPTDEEKIQQRIREHDALHKEILRKKPDFTELT 149
++ +++L + + L ++E++++++ + EEK+++ E L +EI + + L
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALA 294
Query: 150 DIASSLMGLVGEDEAAGVADKLQDTADRYGALVEASDNLGQYAFLYNQLILSPRFSSVTD 209
+ S L E + + ++L + + L + L + + +
Sbjct: 295 NEISRL-----EQQKQILRERLANLERQLEELEAQLEELESKLDELAE--------ELAE 341
Query: 210 IKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPP 269
+++KLE L E ++ LE L E EL+ + TLR L Q
Sbjct: 342 LEEKLEELK----EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQI-- 395
Query: 270 AVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLDSA 329
A I++ + L+ ++ ++ + E+E+ ++ + + E+++ +E+L
Sbjct: 396 ASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
Query: 330 WDNVTALFAKREENLIHAMEKAMEFHETLQRKG 362
+ + + E L A + L +
Sbjct: 456 LERLEEALEELREELEEAEQALDAAERELAQLQ 488
>gnl|CDD|238008 cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily
of calcium sensors and calcium signal modulators; most
examples in this alignment model have 2 active canonical
EF hands. Ca2+ binding induces a conformational change
in the EF-hand motif, leading to the activation or
inactivation of target proteins. EF-hands tend to occur
in pairs or higher copy numbers.
Length = 63
Score = 36.4 bits (85), Expect = 0.003
Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 1/64 (1%)
Query: 651 RLTDLFRKMDKNNDGLIPREDFVDGIIKTKFETSKLEMGAVADMFDHDYNPGLIDWKEFI 710
L + FR DK+ DG I ++ + S+ E+ + D D + G ID++EF+
Sbjct: 1 ELREAFRLFDKDGDGTISADELKAALKSLGEGLSEEEIDEMIREVDKDGD-GKIDFEEFL 59
Query: 711 AALR 714
+
Sbjct: 60 ELMA 63
Score = 27.5 bits (62), Expect = 4.0
Identities = 8/29 (27%), Positives = 19/29 (65%)
Query: 648 KKSRLTDLFRKMDKNNDGLIPREDFVDGI 676
+ + ++ R++DK+ DG I E+F++ +
Sbjct: 34 SEEEIDEMIREVDKDGDGKIDFEEFLELM 62
>gnl|CDD|227455 COG5126, FRQ1, Ca2+-binding protein (EF-Hand superfamily) [Signal
transduction mechanisms / Cytoskeleton / Cell division
and chromosome partitioning / General function
prediction only].
Length = 160
Score = 37.7 bits (88), Expect = 0.008
Identities = 21/92 (22%), Positives = 38/92 (41%), Gaps = 17/92 (18%)
Query: 639 KRFLRFMNHKKSRLTDLFRKMDKNNDGLIPREDFVD-----GIIKTKFETSKLEMGAVAD 693
F + + L + F+ D+++DGLI R + G ++ E +KL
Sbjct: 9 LTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKILRSLGFNPSEAEINKL------- 61
Query: 694 MFDHDYNPGLIDWKEFIAAL-----RPDWEEK 720
+ D +D+ EF+ + R D EE+
Sbjct: 62 FEEIDAGNETVDFPEFLTVMSVKLKRGDKEEE 93
Score = 33.8 bits (78), Expect = 0.13
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 9/79 (11%)
Query: 641 FLRFMNHKKSR------LTDLFRKMDKNNDGLIPREDFVDGIIKTK-FETSKLEMGAVAD 693
FL M+ K R L + F+ DK++DG I + ++K+ S E+ +
Sbjct: 77 FLTVMSVKLKRGDKEEELREAFKLFDKDHDGYISIGELRR-VLKSLGERLSDEEVEKLLK 135
Query: 694 MFDHDYNPGLIDWKEFIAA 712
+D D + G ID++EF
Sbjct: 136 EYDEDGD-GEIDYEEFKKL 153
>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 613
Score = 38.9 bits (91), Expect = 0.010
Identities = 23/93 (24%), Positives = 40/93 (43%), Gaps = 6/93 (6%)
Query: 178 YGALVEASDNLGQYAFLYNQL-ILSPRFSSVTD--IKKKLERLNGLWNEVQKATNDRGRS 234
Y +AS L + ++ L S F + D I K L+ WN+ A+N + +
Sbjct: 75 YSRYKKASTELEYTDYEFSTLQEASQYFPDLDDTGILKDLQDYFNAWNDF--ASNPKDSA 132
Query: 235 LEEALA-LAEKFWSELQSVMATLRDLQDNLNSQ 266
++ALA E + ++ L LQ +N +
Sbjct: 133 QKQALAQKTETLTNNIKDTRERLDTLQKKVNEE 165
>gnl|CDD|238009 cd00052, EH, Eps15 homology domain; found in proteins implicated in
endocytosis, vesicle transport, and signal transduction.
The alignment contains a pair of EF-hand motifs,
typically one of them is canonical and binds to Ca2+,
while the other may not bind to Ca2+. A hydrophobic
binding pocket is formed by residues from both EF-hand
motifs. The EH domain binds to proteins containing NPF
(class I), [WF]W or SWG (class II), or H[TS]F (class
III) sequence motifs.
Length = 67
Score = 33.3 bits (77), Expect = 0.038
Identities = 17/64 (26%), Positives = 29/64 (45%), Gaps = 9/64 (14%)
Query: 654 DLFRKMDKNNDGLIPREDFVDGIIKTKFETSKL---EMGAVADMFDHDYNPGLIDWKEFI 710
+FR +D + DGLI ++ + S L + + D+ D D G +D +EF
Sbjct: 3 QIFRSLDPDGDGLISGDE-----ARPFLGKSGLPRSVLAQIWDLADTD-KDGKLDKEEFA 56
Query: 711 AALR 714
A+
Sbjct: 57 IAMH 60
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 37.2 bits (86), Expect = 0.039
Identities = 96/550 (17%), Positives = 205/550 (37%), Gaps = 73/550 (13%)
Query: 51 DPAERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKK 110
DPA+RK I L+E++ +++LE+ K D L + +D E+K
Sbjct: 147 DPAQRKKI---LDEILE------------INSLERNYDKLKDVIDMLRAEISNIDYLEEK 191
Query: 111 IKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADK 170
+K L + E I+++I + + H L++ + + A A
Sbjct: 192 LKSSNL---ELENIKKQIADDEKSHSITLKEIERLSIEYNNAMD--DYNNLKSALNELSS 246
Query: 171 LQDTADRYGALVEASDNLGQYAFLYNQLI--LSPRFSSVTDIK--KKLERLNGLWNEVQK 226
L+D +RY + ++ +++ N L R + + K +N +
Sbjct: 247 LEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYK-- 304
Query: 227 ATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKE- 285
ND + ++ L+ + ++ +++ L LQ + N + Q L+
Sbjct: 305 --NDIE-NKKQILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGY 361
Query: 286 -------------IKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLDSAWDN 332
+K +I++ +E+ A +++KI E D +KK + +++ +
Sbjct: 362 EMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKI-QEIDPDAIKKELNEINVKLQD 420
Query: 333 VTALFAKREENLIHAMEKAMEFHETLQRKGEQGTI----TALFAKREENLI-HAMEKAME 387
+++ + + + E E ++ Q T L ++ ++I H EK
Sbjct: 421 ISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSR 480
Query: 388 FHETLQQNRDDCKKADCNADAVQTFVNSLPEDD-----------QEARTQLAEHEKFLRE 436
E +++ + K D ++ L ++ + AR L + + + E
Sbjct: 481 LEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINE 540
Query: 437 LAEKEIEKDATI----GLAQRILVKSHPD--GATVIKHWITI--IQSRWEEVSSWAKQRE 488
L +K + + L L A + I I +SR E+ E
Sbjct: 541 LKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLE 600
Query: 489 ERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFMEAT 548
RL+ D S ++ + + + E+ NL ++I + LIE+ + ++
Sbjct: 601 SRLQEIEIGFPDDKSYID---KSIREIENEANNL--NNKYNEIQENKILIEKLRGKIDNY 655
Query: 549 SKRQHEVDSV 558
K+ E+DS+
Sbjct: 656 KKQIAEIDSI 665
>gnl|CDD|233227 TIGR00996, Mtu_fam_mce, virulence factor Mce family protein.
Members of this paralogous family are found as six
tandem homologous proteins in the same orientation per
cassette, in four separate cassettes in Mycobacterium
tuberculosis. The six members of each cassette represent
six subfamilies. One subfamily includes the protein mce
(mycobacterial cell entry), a virulence protein required
for invasion of non-phagocytic cells [Cellular
processes, Pathogenesis].
Length = 291
Score = 36.1 bits (84), Expect = 0.060
Identities = 30/149 (20%), Positives = 49/149 (32%), Gaps = 23/149 (15%)
Query: 204 FSSVTDIKKKL--ERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQD 261
S+T + L E+LN + N + +A +G L L + + L + + L D
Sbjct: 136 LGSLTRLLNGLDPEKLNAILNALAEALAGQGPQLRNLLDGLAQLTAALNARDGDIGALID 195
Query: 262 NLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKK 321
NLN ++ A + Q + +D Q L
Sbjct: 196 NLN----RVLDVLADRSDQ-----LDRLLDNLATLTAQLADRDDALDDA----------- 235
Query: 322 HIEDLDSAWDNVTALFAKREENLIHAMEK 350
+ L A V L A+ NL A+
Sbjct: 236 -LAALSGASAQVRDLLAENRPNLPQALAN 263
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 36.2 bits (84), Expect = 0.088
Identities = 50/322 (15%), Positives = 129/322 (40%), Gaps = 38/322 (11%)
Query: 16 KAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQLNELMNRFDNLNEG 75
++ L +++ LK++ + + L ++ E+ + + E +++E L EL + + L
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKN--ELRSLEDLLEELRRQLEELERQ 717
Query: 76 ASQRMDALEQAMAVAKQFQDKLT-----------GILDWLDKSEKKIKDMELIPTDEEKI 124
+ L +Q Q +L + + ++ E+ +++E + K+
Sbjct: 718 LEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKL 777
Query: 125 QQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTADRYGALVEA 184
++ I E + + + EL ++ L E + +L+ R L +
Sbjct: 778 KEEIEELEEKRQAL------QEELEELEEELEEA--ERRLDALERELESLEQRRERLEQE 829
Query: 185 SDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEK 244
+ L + + + + +++K+LE L E++ LE+ L E+
Sbjct: 830 IEELEEEIEELEEKLDELE-EELEELEKELEELKEELEELEAEK----EELEDELKELEE 884
Query: 245 FWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASG 304
EL+ + L L + I++ + L+E++A++++ + +
Sbjct: 885 EKEELEEELRELESELAELKEE---------IEKLRERLEELEAKLERLE---VELPELE 932
Query: 305 QKLMKICGEPDKPEVKKHIEDL 326
++L + + + E+++ IE L
Sbjct: 933 EELEEEYEDTLETELEREIERL 954
Score = 35.5 bits (82), Expect = 0.12
Identities = 63/365 (17%), Positives = 146/365 (40%), Gaps = 50/365 (13%)
Query: 200 LSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDL 259
L + +K +L L L E+++ + R LEE EL+ + + L +L
Sbjct: 683 LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEEL 742
Query: 260 QDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEV 319
++ L E + +++ Q L+E++ E++ + + + + ++L E + +
Sbjct: 743 EEELEELE------EELEELQERLEELEEELESLEEALAKLKEEIEEL-----EEKRQAL 791
Query: 320 KKHIEDLDSAWDNVTALFAKREENLIHAMEKAMEFHETLQRKGEQGTITALFAKREENLI 379
++ +E+L+ + E + A+E+ +E E + + EQ I + EE +
Sbjct: 792 QEELEELEEE--------LEEAERRLDALERELESLEQRRERLEQ-EI----EELEEEIE 838
Query: 380 HAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRE--- 436
EK E E L++ + ++ + ++ L ++ +E + E E+ LRE
Sbjct: 839 ELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES 898
Query: 437 --------LAEKEIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQRE 488
+ + + +R+ V+ + + + +++ E ++
Sbjct: 899 ELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEI 958
Query: 489 ERLRN-HLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEE-----HK 542
E L +LR++++ + + E E L + E L + + +IEE +
Sbjct: 959 EALGPVNLRAIEEYEEVEERYEE---------LKSQREDLEEAKEKLLEVIEELDKEKRE 1009
Query: 543 EFMEA 547
F E
Sbjct: 1010 RFKET 1014
Score = 35.1 bits (81), Expect = 0.17
Identities = 57/322 (17%), Positives = 129/322 (40%), Gaps = 32/322 (9%)
Query: 54 ERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDK------LTGILDWLDKS 107
E + + E + R ++L E ++++ LE+ A+++Q+ L L
Sbjct: 176 EAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLK 235
Query: 108 EKKIK------DMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGE 161
E + + ++ + + E++Q+ + E + KEI K + EL + L +
Sbjct: 236 ELRKELEELEEELSRLEEELEELQEELEEAE---KEIEELKSELEELREELEEL-----Q 287
Query: 162 DEAAGVADKLQDTADRYGALVEASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLW 221
+E + +++++ L E + L + L + +K++LE L
Sbjct: 288 EELLELKEEIEELEGEISLLRERLEELEN-ELEELEERLEELKEKIEALKEELEERETLL 346
Query: 222 NEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQY 281
E+++ + + EE EL+ + LR+ L ++ I+ +
Sbjct: 347 EELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL------AEIRNE-- 398
Query: 282 ALKEIKAEIDQTKPEVEQCRASGQKLMK--ICGEPDKPEVKKHIEDLDSAWDNVTALFAK 339
L+E+K EI+ + +E+ + L + E + E++ +E+L+ + + +
Sbjct: 399 -LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457
Query: 340 REENLIHAMEKAMEFHETLQRK 361
+ L + E E LQR
Sbjct: 458 LRDRLKELERELAELQEELQRL 479
Score = 31.6 bits (72), Expect = 2.1
Identities = 67/360 (18%), Positives = 127/360 (35%), Gaps = 94/360 (26%)
Query: 210 IKKKLERLNGLWNEVQKATNDRGRSLEEALALAEK--FWSELQSVMATLRDLQDNLNSQE 267
++K+LE+L E Q +R + L+ L E ++L+ + L +L++ L+ E
Sbjct: 198 LEKQLEKL-----ERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLE 252
Query: 268 PPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLD 327
+ +++ Q L+E + EI++ K E+E E+++ +E+L
Sbjct: 253 ------EELEELQEELEEAEKEIEELKSELE-------------------ELREELEELQ 287
Query: 328 SAWDNVTALFAKREENLIHAMEKAMEFHETLQRKGEQGTITALFAKREENLIHAMEKAME 387
+ + E + E+ E L+ +R E L +E E
Sbjct: 288 EELLELKEEIEELEGEISLLRERLEELENELEEL----------EERLEELKEKIEALKE 337
Query: 388 FHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELAEKEIEKDAT 447
+L E E L EL + E +
Sbjct: 338 --------------------------------------ELEERETLLEELEQLLAELEEA 359
Query: 448 IGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSL----QDLDS 503
+ L + +++ + + E + + +RN L L + L+
Sbjct: 360 KEELEEKL-------SALLEELEELFE-ALREELAELEAELAEIRNELEELKREIESLEE 411
Query: 504 LLEELLEWLAKCESHLLNLEAEP--LPDDIPTVERLIEEHKEFMEATSKRQHEVDSVRAS 561
LE L E L + L LEAE L ++ + +EE +E +E R E++ A
Sbjct: 412 RLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAE 471
Score = 31.6 bits (72), Expect = 2.2
Identities = 50/297 (16%), Positives = 126/297 (42%), Gaps = 32/297 (10%)
Query: 274 KAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLDSAWDNV 333
+ + + + L++++ E+ K E+ ++L + E+++ +E+L +
Sbjct: 674 EELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQ-----LEELERQLEELKRELAAL 728
Query: 334 TAL---FAKREENLIHAMEKAMEFHETLQRKGE--QGTITALFAKREENLIHAMEKAMEF 388
R E L +E+ E E LQ + E + + +L + L +E+ E
Sbjct: 729 EEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE-EALAKLKEEIEELEEK 787
Query: 389 HETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELAEKEIEKDATI 448
+ LQ+ ++ ++ + + +++L + + + E+ + EL E+ E + +
Sbjct: 788 RQALQEELEELEEE---LEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844
Query: 449 GLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSLQDLDSLLEEL 508
+ L + ++ + ++ EE+ + ++ E+ L+ ++L+ L EL
Sbjct: 845 DELEEELEE--------LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELREL 896
Query: 509 LEWLAKCES---------HLLNLEAEPLPDDIPTVERLI-EEHKEFMEATSKRQHEV 555
LA+ + L + E L ++P +E + EE+++ +E +R+ E
Sbjct: 897 ESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIER 953
Score = 29.7 bits (67), Expect = 7.6
Identities = 49/253 (19%), Positives = 103/253 (40%), Gaps = 24/253 (9%)
Query: 317 PEVKKHIEDLDSAWDNVTALFAKREENLIHA---MEKAMEFHETLQRKGEQGTITALFAK 373
K+ +++L+ + A K EE L + + E L+R+ E+ K
Sbjct: 663 LAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELK 722
Query: 374 REENLIHAMEKAMEFH-----ETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLA 428
RE + + ++ E L++ ++ ++ + ++ + SL E + + ++
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIE 782
Query: 429 EHEKFLRELAEKEIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQRE 488
E E+ + L E+ E + + A+R L ++ + +E+ ++ E
Sbjct: 783 ELEEKRQALQEELEELEEELEEAERRLDA----LERELESLEQRRERLEQEI----EELE 834
Query: 489 ERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFMEAT 548
E + L +L+ LEEL + L + + L LEAE E L +E KE E
Sbjct: 835 EEIEELEEKLDELEEELEELEKELEELKEELEELEAE--------KEELEDELKELEEEK 886
Query: 549 SKRQHEVDSVRAS 561
+ + E+ + +
Sbjct: 887 EELEEELRELESE 899
>gnl|CDD|222407 pfam13833, EF_hand_6, EF-hand domain pair.
Length = 53
Score = 32.1 bits (74), Expect = 0.091
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)
Query: 664 DGLIPREDFVDGIIKTKFETSKLEMGAVADMFDHDYNPGLIDWKEFIAALR 714
GLI RE+ + S+ E+ + FD D G I ++EF L+
Sbjct: 2 KGLITREELKRALALLGISLSEEEVDILFREFDTD-GDGKISFEEFCVLLQ 51
Score = 27.5 bits (62), Expect = 3.4
Identities = 10/39 (25%), Positives = 16/39 (41%), Gaps = 5/39 (12%)
Query: 641 FLRFMNH-----KKSRLTDLFRKMDKNNDGLIPREDFVD 674
R + + + LFR+ D + DG I E+F
Sbjct: 10 LKRALALLGISLSEEEVDILFREFDTDGDGKISFEEFCV 48
>gnl|CDD|200946 pfam00036, efhand, EF hand. The EF-hands can be divided into two
classes: signaling proteins and buffering/transport
proteins. The first group is the largest and includes
the most well-known members of the family such as
calmodulin, troponin C and S100B. These proteins
typically undergo a calcium-dependent conformational
change which opens a target binding site. The latter
group is represented by calbindin D9k and do not undergo
calcium dependent conformational changes.
Length = 29
Score = 30.9 bits (71), Expect = 0.12
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 651 RLTDLFRKMDKNNDGLIPREDFVDGIIKT 679
L + F++ DK+ DG I E+F + + K
Sbjct: 1 ELKEAFKEFDKDGDGKISFEEFKELLKKL 29
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 35.1 bits (82), Expect = 0.16
Identities = 20/70 (28%), Positives = 30/70 (42%), Gaps = 8/70 (11%)
Query: 485 KQREERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEF 544
Q+EE L L L+ + LE+ + L + + L E E +E LIEE +
Sbjct: 92 LQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE--------LEELIEEQLQE 143
Query: 545 MEATSKRQHE 554
+E S E
Sbjct: 144 LERISGLTAE 153
>gnl|CDD|205583 pfam13405, EF_hand_4, EF-hand domain.
Length = 30
Score = 30.3 bits (69), Expect = 0.18
Identities = 8/26 (30%), Positives = 13/26 (50%)
Query: 651 RLTDLFRKMDKNNDGLIPREDFVDGI 676
L + F+ DK+ DG I E+ +
Sbjct: 1 ELREAFKLFDKDGDGYISAEELRKAL 26
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 32.9 bits (76), Expect = 0.21
Identities = 19/95 (20%), Positives = 38/95 (40%), Gaps = 3/95 (3%)
Query: 9 SADYKVVKAQLQ-EQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQLNELMN 67
S K + QL+ E K + L + + L + + AA A R+ E++L + +
Sbjct: 13 SPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQ 72
Query: 68 RFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILD 102
F + Q + +Q ++ DK+ +
Sbjct: 73 EFQRKQQKLQQDLQKRQQEE--LQKILDKINKAIK 105
>gnl|CDD|217674 pfam03690, UPF0160, Uncharacterized protein family (UPF0160). This
family of proteins contains a large number of metal
binding residues. The patterns are suggestive of a
phosphoesterase function. The conserved DHH motif may
mean this family is related to pfam01368.
Length = 315
Score = 34.1 bits (79), Expect = 0.22
Identities = 26/99 (26%), Positives = 42/99 (42%), Gaps = 17/99 (17%)
Query: 150 DIASSLMGLVGEDEAAGVADKLQDTADRYGALVE---ASDNLGQYAFLYNQLILSPRFSS 206
+I L+ L+ E++ + DK+ Y + VE A DN G +Q P++
Sbjct: 93 EIIRELLKLLDEEDVELLYDKV------YKSFVEEIDAIDN-G-----VSQYDGEPKYKI 140
Query: 207 VTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKF 245
T + ++ RLN WNE D +A+ LA
Sbjct: 141 NTSLSARVGRLNPTWNEPDPEDEDAA--FLKAVELAGSE 177
>gnl|CDD|197492 smart00054, EFh, EF-hand, calcium binding motif. EF-hands are
calcium-binding motifs that occur at least in pairs.
Links between disease states and genes encoding
EF-hands, particularly the S100 subclass, are emerging.
Each motif consists of a 12 residue loop flanked on
either side by a 12 residue alpha-helix. EF-hands
undergo a conformational change unpon binding calcium
ions.
Length = 29
Score = 30.0 bits (69), Expect = 0.25
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 652 LTDLFRKMDKNNDGLIPREDFVDGIIKT 679
L + FR DK+ DG I E+F D +
Sbjct: 2 LKEAFRLFDKDGDGKIDFEEFKDLLKAL 29
>gnl|CDD|224380 COG1463, Ttg2C, ABC-type transport system involved in resistance to
organic solvents, periplasmic component [Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 359
Score = 34.0 bits (78), Expect = 0.28
Identities = 28/156 (17%), Positives = 45/156 (28%), Gaps = 23/156 (14%)
Query: 197 QLILSPRFSSVTDIKKKL--ERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMA 254
+ L + + L +RLN + NE A G L L +F L +
Sbjct: 138 PIDLEDLLGDLLLLLGGLDPDRLNAILNEAAAALAGTGPQLNALLDNLAQFTDALNARDG 197
Query: 255 TLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEP 314
+ L NLN + + A ++ +D A L
Sbjct: 198 DIGALIANLN---------QLLDSLAAASDQLDRLLDNLATLTAALAARRDALDDA---- 244
Query: 315 DKPEVKKHIEDLDSAWDNVTALFAKREENLIHAMEK 350
+ L + V L A+ NL A+
Sbjct: 245 --------LAALSALAATVNDLLAENRPNLNQALAN 272
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 33.0 bits (76), Expect = 0.29
Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 3/95 (3%)
Query: 9 SADYKVVKAQLQ-EQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQLNELMN 67
S K + QL+ E K L+ L ++ + Q + AA RKA +++L +
Sbjct: 30 SPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQ 89
Query: 68 RFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILD 102
+ A Q + +Q + + DK+ +
Sbjct: 90 ELQQKQQAAQQELQQKQQ--ELLQPIYDKIDKAIK 122
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 34.3 bits (79), Expect = 0.34
Identities = 23/84 (27%), Positives = 41/84 (48%)
Query: 474 QSRWEEVSSWAKQREERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPT 533
+ R EE+ E+ + N L++L++ +EEL E L K E L +LEA IP
Sbjct: 736 KERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPE 795
Query: 534 VERLIEEHKEFMEATSKRQHEVDS 557
++ + + +E + R E++
Sbjct: 796 IQAELSKLEEEVSRIEARLREIEQ 819
Score = 34.3 bits (79), Expect = 0.35
Identities = 25/149 (16%), Positives = 55/149 (36%), Gaps = 9/149 (6%)
Query: 10 ADYKVVKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPA--ERKAIERQLNELMN 67
+ + + +L E+ A+ + + L EV E K +L +L
Sbjct: 346 EEERKRRDKLTEE------YAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKR 399
Query: 68 RFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIKDMEL-IPTDEEKIQQ 126
+ L + + L++ + GI +++ E++ +D L I E K++Q
Sbjct: 400 EINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ 459
Query: 127 RIREHDALHKEILRKKPDFTELTDIASSL 155
+ +E+ K ++ + S L
Sbjct: 460 LAADLSKYEQELYDLKEEYDRVEKELSKL 488
Score = 33.9 bits (78), Expect = 0.37
Identities = 47/272 (17%), Positives = 107/272 (39%), Gaps = 38/272 (13%)
Query: 53 AERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIK 112
E R++ E+ + L + + + LE+ + ++ + L + E +I+
Sbjct: 709 QELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIE 768
Query: 113 DMELI-------------PTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLV 159
++E +I + E L +E+ R + E+ + L
Sbjct: 769 ELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEK 828
Query: 160 G--EDEAAGVADKLQDTADRYGALVEASDNL-----------GQYAFLYNQLILSPRFSS 206
E E + ++ D ++ ++ + +NL + L S
Sbjct: 829 EYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLE-----SR 883
Query: 207 VTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQ 266
+ D+KK+ + L E+++ + LE + K SEL+ A L L++ L+
Sbjct: 884 LGDLKKERDELEAQLRELERKIEE----LEAQIEKKRKRLSELK---AKLEALEEELSEI 936
Query: 267 EPPAVEPKAIQQQQYALKEIKAEIDQTKPEVE 298
E P E + I +++ +L++++AE+ + + E+
Sbjct: 937 EDPKGEDEEIPEEELSLEDVQAELQRVEEEIR 968
Score = 30.8 bits (70), Expect = 4.1
Identities = 56/351 (15%), Positives = 125/351 (35%), Gaps = 85/351 (24%)
Query: 209 DIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEP 268
++ +L R+ +E+ + +D R + E EK +L+ L++ + L
Sbjct: 692 SLQSELRRIENRLDELSQELSDASRKIGEI----EKEIEQLEQEEEKLKERLEELEED-- 745
Query: 269 PAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDL-- 326
+ + ++ +K+E+ + + +E+ D ++++ + DL
Sbjct: 746 -------LSSLEQEIENVKSELKELEARIEELEE------------DLHKLEEALNDLEA 786
Query: 327 ---DSAWDNVTALFAKREENLIHAMEKAMEFHETLQRKGEQGTITALFAKREENLIHAME 383
S + A +K EE + + E + L R +E L E
Sbjct: 787 RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL----------TLEKEYL----E 832
Query: 384 KAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELAE--KE 441
K ++ LQ+ R D K+ +++ + +L +E +L E E LR+L +
Sbjct: 833 KEIQ---ELQEQRIDLKE---QIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886
Query: 442 IEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSLQDL 501
++K+ + E +++ E L + +
Sbjct: 887 LKKE----------------------------RDELEAQLRELERKIEELEAQIEKKRKR 918
Query: 502 DSLLEELLEWLAKCESHLLNLEAEPLPD-----DIPTVERLIEEHKEFMEA 547
S L+ LE L + S + + + E + V+ ++ +E + A
Sbjct: 919 LSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRA 969
>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
periplasmic component [Carbohydrate transport and
metabolism].
Length = 332
Score = 33.4 bits (77), Expect = 0.41
Identities = 14/67 (20%), Positives = 31/67 (46%), Gaps = 6/67 (8%)
Query: 81 DALEQAMAVAKQFQDKLTGILDWLDKSEKKIKD--MELIPTDEEKIQQRIREHDALHKEI 138
L +A A + Q KL L+ D+ +K+K+ +E++ D + + + ++ E
Sbjct: 256 TILLEAAKEAAEEQRKLVEELE--DELLEKLKEAGVEVVEPDAAEAFREAAK--PVYDEF 311
Query: 139 LRKKPDF 145
+K +
Sbjct: 312 AKKAGEI 318
>gnl|CDD|238170 cd00272, Chemokine_CC, Chemokine_CC: 1 of 4 subgroup designations
based on the arrangement of the two N-terminal cysteine
residues; includes a number of secreted growth factors
and interferons involved in mitogenic, chemotactic, and
inflammatory activity; some members (e.g. 2HCC) contain
an additional disulfide bond which is thought to
compensate for the highly conserved Trp missing in
these; chemotatic for monocytes, macrophages,
eosinophils, basophils, and T cells, but not
neutrophils; exist as monomers and dimers, but are
believed to be functional as monomers; found only in
vertebrates and a few viruses; a subgroup of CC,
identified by an N-terminal DCCL motif (Exodus-1,
Exodus-2, and Exodus-3), has been shown to inhibit
specific types of human cancer cell growth in a mouse
model. See CDs: Chemokine (cd00169) for the general
alignment of chemokines, or Chemokine_CXC (cd00273),
Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for
the additional chemokine subgroups, and
Chemokine_CC_DCCL for the DCCL subgroup of this CD.
Length = 57
Score = 29.9 bits (68), Expect = 0.47
Identities = 5/20 (25%), Positives = 14/20 (70%)
Query: 308 MKICGEPDKPEVKKHIEDLD 327
++C +P + V+++++ LD
Sbjct: 38 REVCADPKQKWVQRYMKLLD 57
>gnl|CDD|197477 smart00027, EH, Eps15 homology domain. Pair of EF hand motifs that
recognise proteins containing Asn-Pro-Phe (NPF)
sequences.
Length = 96
Score = 31.1 bits (71), Expect = 0.50
Identities = 21/70 (30%), Positives = 31/70 (44%), Gaps = 11/70 (15%)
Query: 649 KSRLTDLFRKMDKNNDGLIPREDFVDG-IIKTKFETSKL---EMGAVADMFDHDYNPGLI 704
K++ +FR +DKN DG V G K S L + + ++ D D N G +
Sbjct: 9 KAKYEQIFRSLDKNQDG------TVTGAQAKPILLKSGLPQTLLAKIWNLADID-NDGEL 61
Query: 705 DWKEFIAALR 714
D EF A+
Sbjct: 62 DKDEFALAMH 71
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 33.5 bits (76), Expect = 0.57
Identities = 75/394 (19%), Positives = 157/394 (39%), Gaps = 73/394 (18%)
Query: 56 KAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGIL-DWLDKSEKKIKDM 114
+ + + + N+ + L E ++ +D ++++ + K ++DK L D +++ +K KD
Sbjct: 927 ENTKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDKFDNTLIDKINELDKAFKDA 986
Query: 115 ELIPTDEEKIQQRIREHDALHKEILRKKPDFTEL--TDIASSLMGLVGEDEAAGVADKLQ 172
L +++A + E+++ D + + L E E A D Q
Sbjct: 987 SL------------NDYEAKNNELIKYFNDLKANLGKNKENMLYHQFDEKEKA-TNDIEQ 1033
Query: 173 DTADR---------------YGALVEASDNLGQYAFLYNQLILSPRFSSVTD---IKKKL 214
D Y + E +G+ L N+ IL ++T+ IK+KL
Sbjct: 1034 KIEDANKNIPNIEIAIHTSIYNIIDEIEKEIGKNIELLNKEILEEAEINITNFNEIKEKL 1093
Query: 215 ERLN----------GLWNEVQKATND----------RGRSLEEALALAEKFWSELQSVMA 254
+ N +E+ K +D ++LEE +E + E+++ +
Sbjct: 1094 KHYNFDDFGKEENIKYADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQIN 1153
Query: 255 TLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEP 314
L D+ D S + +P+ I+++ + I +ID+ K ++ + ++ +I E
Sbjct: 1154 DLEDVADKAISND----DPEEIEKKI---ENIVTKIDKKKNIYDEIKKLLNEIAEI--EK 1204
Query: 315 DKPEVKKHIEDLDSAWDNVTALF-------AKREENLIHAMEKAMEFHETLQRKG---EQ 364
DK +++ S N+ LF K+ E++I AME +E + ++ K E
Sbjct: 1205 DKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIEN 1264
Query: 365 GTITALFAKREENLIHAMEKAMEFHETLQQNRDD 398
+ K E + + H + + D+
Sbjct: 1265 EMGIEMDIKAEMETFNISHDDDKDHHIISKKHDE 1298
>gnl|CDD|213859 TIGR03767, P_acnes_RR, metallophosphoesterase, PPA1498 family.
This model describes a small collection of probable
metallophosphoresterases, related to pfam00149 but with
long inserts separating some of the shared motifs such
that the homology is apparent only through multiple
sequence alignment. Members of this protein family, in
general, have a Sec-independent TAT (twin-arginine
translocation) signal sequence, N-terminal to the region
modeled by This model. Members include YP_056203.1 from
Propionibacterium acnes KPA171202.
Length = 496
Score = 32.9 bits (75), Expect = 0.63
Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 7/60 (11%)
Query: 20 QEQ-KFLKKMLADRQHSMSSLFQ------MGNEVAANADPAERKAIERQLNELMNRFDNL 72
Q Q K++K L ++ LF M NE+ DP E++ + +L L+ N+
Sbjct: 322 QTQFKWIKDTLRASSDTLFVLFSHHTSWSMVNELTDPVDPGEKRHLGTELVSLLLEHPNV 381
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 33.2 bits (76), Expect = 0.67
Identities = 64/307 (20%), Positives = 117/307 (38%), Gaps = 36/307 (11%)
Query: 139 LRKKPDFTELTDIASSLMGLVGEDEAAGVADKL------QDTADRYGALVEASDNLGQYA 192
L KKPD L+D + + + +DE VAD L +D +R AL A N Q
Sbjct: 197 LSKKPDEGVLSDALTEALPPLDDDELTDVADALEQLDEYRDELERLEALERALRNFLQRY 256
Query: 193 FLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSV 252
Y + T ++++ RL + + + D GR+ + L A + EL +
Sbjct: 257 RRYAR----------TMLRRRATRLRSAQTQYDQLSRDLGRA-RDELETAREEERELDAR 305
Query: 253 MATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKIC- 311
L D L ++ A Q + E+++ + + E +A+ +
Sbjct: 306 TEALEREADALRTRLEALQGSPAYQDAE--------ELERARADAEALQAAAADARQAIR 357
Query: 312 -GEPDKPEVKKHIEDLDS-AWDNVTALFAKREENLIHAMEKAMEFHET-----LQRKGEQ 364
E E ++ +++ D L A RE+ A + T E
Sbjct: 358 EAESRLEEERRRLDEEAGRLDDAERELRAAREQLARAAERAGLSPAHTAEPDAALAAQEL 417
Query: 365 GTITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPE--DDQE 422
+ AL A+R++ +++ E L++ D +A+ A + + L E +
Sbjct: 418 QELGALDARRQDADRVIAQRS-EQVALLRRRDDVADRAEATHAAARARRDELDEEAEQAA 476
Query: 423 ARTQLAE 429
AR +LA+
Sbjct: 477 ARAELAD 483
>gnl|CDD|180401 PRK06105, PRK06105, aminotransferase; Provisional.
Length = 460
Score = 33.0 bits (76), Expect = 0.69
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 459 HPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSLQD 500
HP A V + II+ R ++ A +R RL+ LR+L D
Sbjct: 328 HPVAAAVALENLAIIEER--DLVGNAAERGARLQARLRALAD 367
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 31.9 bits (73), Expect = 1.3
Identities = 30/159 (18%), Positives = 59/159 (37%), Gaps = 34/159 (21%)
Query: 45 EVAANADPAERKAIERQLNELMNRFDNLN-----------------EGASQRMDALEQAM 87
++ + DPA + I+ Q + +R L G ++ AL Q +
Sbjct: 115 DLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQL 174
Query: 88 AVAKQ--------FQDKLTGILDWLDKSEKKIKDMEL-IPTDEEKIQQRIREHDALHKEI 138
V + + L L+ L+ E++ + + + E +++ R+ D L E
Sbjct: 175 EVISEELEARRKLKEKGLVSRLELLE-LERERAEAQGELGRLEAELEVLKRQIDELQLER 233
Query: 139 LRKKPDF-----TELTDIASSLMGLVGEDEAAGVADKLQ 172
+ + F ELT+ + L L + D+LQ
Sbjct: 234 QQIEQTFREEVLEELTEAQARLAEL--RERLNKARDRLQ 270
>gnl|CDD|235502 PRK05560, PRK05560, DNA gyrase subunit A; Validated.
Length = 805
Score = 31.5 bits (73), Expect = 1.9
Identities = 25/87 (28%), Positives = 36/87 (41%), Gaps = 23/87 (26%)
Query: 50 ADPAERKAIERQLNELMNRFDNLNEGASQRMDALEQAMA-VAKQFQDKLTG-----ILDW 103
PAE K LM RF L+E QA A + + Q +LTG I D
Sbjct: 396 PTPAEAKE------GLMERFG-LSE---------IQAQAILDMRLQ-RLTGLERDKIEDE 438
Query: 104 LDKSEKKIKDMELIPTDEEKIQQRIRE 130
+ I D++ I E++ + I+E
Sbjct: 439 YKELLALIADLKDILASPERLLEIIKE 465
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 31.3 bits (71), Expect = 2.2
Identities = 42/176 (23%), Positives = 71/176 (40%), Gaps = 50/176 (28%)
Query: 155 LMGLVGEDEAAGVADKLQDTADRYG---ALVE-ASDNLGQYAFLYNQLILSPRFSSVTDI 210
L G+ GE A + A RYG ++E A G++ N LI ++ +
Sbjct: 476 LKGIPGESYAF-------EIAQRYGIPHFIIEQAKTFYGEFKEEINVLI-----EKLSAL 523
Query: 211 KKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPA 270
+K+LE+ N LE+ L EK EL+ M L++ + N
Sbjct: 524 EKELEQKN--------------EHLEKLLKEQEKLKKELEQEMEELKERERNK------- 562
Query: 271 VEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDL 326
+ + ++ Q ALK +K E++ E+++ + K + K IEDL
Sbjct: 563 -KLELEKEAQEALKALKKEVESIIRELKE------------KKIHKAKEIKSIEDL 605
>gnl|CDD|225425 COG2870, RfaE, ADP-heptose synthase, bifunctional sugar
kinase/adenylyltransferase [Cell envelope biogenesis,
outer membrane].
Length = 467
Score = 31.1 bits (71), Expect = 2.3
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 713 LRPDWEEKKPNTESEKIHDEVKRLVQ 738
LR D+EEK P + K+ +++K ++
Sbjct: 117 LRLDFEEKFPIEDENKLLEKIKNALK 142
>gnl|CDD|205383 pfam13202, EF_hand_3, EF hand.
Length = 25
Score = 27.2 bits (62), Expect = 2.4
Identities = 10/19 (52%), Positives = 13/19 (68%)
Query: 654 DLFRKMDKNNDGLIPREDF 672
DLFR+ D N DG I +E+
Sbjct: 3 DLFRQFDTNGDGKISKEEL 21
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 31.1 bits (70), Expect = 2.4
Identities = 46/253 (18%), Positives = 106/253 (41%), Gaps = 22/253 (8%)
Query: 56 KAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIKDME 115
+A++ N+ N + + + + + LE+ + + ++++ + +D+ K+++
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364
Query: 116 LIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTA 175
+ E + Q E + L +E+ + +LT S + EA G+ L+ T
Sbjct: 365 ISTEQFELMNQ---EREKLTRELDKINIQSDKLTKSVKSR-----KLEAQGIFKSLEKTL 416
Query: 176 DRYGALVEA--------SDNLGQYAFLYNQLILSPRFSSVTD-IKKKLERLNGLWNEVQK 226
+Y +L++ N+ + N L P+ S + + IKK + LN E K
Sbjct: 417 RQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFPKGSGINESIKKSILELNDEIQERIK 476
Query: 227 ATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEI 286
++ +LEE + + +EL ++ L+ L+ K +++ L
Sbjct: 477 TEENKSITLEEDIKNLKHDINELTQIL---EKLELELSEANSKFELSKEENERE--LVAQ 531
Query: 287 KAEIDQTKPEVEQ 299
+ EI++ + E+
Sbjct: 532 RIEIEKLEKELND 544
>gnl|CDD|237558 PRK13920, PRK13920, putative anti-sigmaE protein; Provisional.
Length = 206
Score = 30.6 bits (69), Expect = 2.5
Identities = 10/38 (26%), Positives = 17/38 (44%)
Query: 233 RSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPA 270
+E AL + W+EL+++ L L + L P
Sbjct: 15 ARVEAALEAYPELWAELRALQEALAALAEGLPPVPVPP 52
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 31.2 bits (71), Expect = 2.5
Identities = 50/249 (20%), Positives = 97/249 (38%), Gaps = 56/249 (22%)
Query: 323 IEDLDSAWDNVTAL-------------FAKREEN---LIHAMEKAMEFHETLQRKGEQGT 366
++D ++A+ N+ + F KR EN LI EK E E L+ E
Sbjct: 157 LDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEK--ELEEVLREINE--- 211
Query: 367 ITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQ 426
+ L +EK + + L++ +++ ++ + ++++ L E +E +
Sbjct: 212 ----ISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEER 267
Query: 427 LAEHEKFLRELAEK-----EIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVS 481
+ E +K + EL EK E+++ A L + + +
Sbjct: 268 IEELKKEIEELEEKVKELKELKEKAE---EYIKLSE---------------FYEEYLDEL 309
Query: 482 SWAKQREERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEH 541
++R RL + +++ LEE E L + + L LE +E L E H
Sbjct: 310 REIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKR--------LEELEERH 361
Query: 542 KEFMEATSK 550
+ + EA +K
Sbjct: 362 ELYEEAKAK 370
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 30.7 bits (70), Expect = 3.3
Identities = 34/198 (17%), Positives = 75/198 (37%), Gaps = 33/198 (16%)
Query: 91 KQFQDKLTGILDWLDKSEKKIKD-MELIPTDEEKIQQRIREHDALHKEILRKKPDFTELT 149
++ ++L ++D + +K + +E I E++ + + + L + L
Sbjct: 281 QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALL-------EALE 333
Query: 150 DIASSLMGLVGE-----------DEAAGVADKLQDTADRYGALV----EASDNLGQYAFL 194
+I + + E + + + + D D L+ E DNL +
Sbjct: 334 EILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEK-- 391
Query: 195 YNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMA 254
N+ V ++K+ ++ ++KA N SLE+ + E L+
Sbjct: 392 -NKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAIN----SLEKEIKQLEAEIKALE---K 443
Query: 255 TLRDLQDNLNSQEPPAVE 272
+++L+ L + EP A E
Sbjct: 444 EIKELEKQLTNIEPTADE 461
>gnl|CDD|173906 cd00812, LeuRS_core, catalytic core domain of leucyl-tRNA
synthetases. Leucyl tRNA synthetase (LeuRS) catalytic
core domain. This class I enzyme is a monomer which
aminoacylates the 2'-OH of the nucleotide at the 3' of
the appropriate tRNA. The core domain is based on the
Rossman fold and is responsible for the ATP-dependent
formation of the enzyme bound aminoacyl-adenylate. It
contains the characteristic class I HIGH and KMSKS
motifs, which are involved in ATP binding. In Aquifex
aeolicus, the gene encoding LeuRS is split in two, just
before the KMSKS motif. Consequently, LeuRS is a
heterodimer, which likely superimposes with the LeuRS
monomer found in most other organisms. LeuRS has an
insertion in the core domain, which is subject to both
deletions and rearrangements and thus differs between
prokaryotic LeuRS and archaeal/eukaryotic LeuRS. This
editing region hydrolyzes mischarged cognate tRNAs and
thus prevents the incorporation of chemically similar
amino acids.
Length = 314
Score = 30.7 bits (70), Expect = 3.4
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 3/67 (4%)
Query: 274 KAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPD-KPEVKKHIEDLDSAWDN 332
K Q L E K + + V C+ Q +K E + K ++ K +E LD +
Sbjct: 101 KFTQWLFLKLYE-KGLAYKKEAPVNWCKLLDQWFLKY-SETEWKEKLLKDLEKLDGWPEE 158
Query: 333 VTALFAK 339
V A+
Sbjct: 159 VRAMQEN 165
>gnl|CDD|182623 PRK10657, PRK10657, isoaspartyl dipeptidase; Provisional.
Length = 388
Score = 30.6 bits (70), Expect = 3.4
Identities = 14/45 (31%), Positives = 18/45 (40%), Gaps = 9/45 (20%)
Query: 202 PRFSSVTDIKKKLERL-----NGLWNEVQKATNDRGRSLEEALAL 241
P+F D L L L EV++ D G LE+AL
Sbjct: 291 PKF----DEDGNLVGLGVGSVESLLEEVRELVKDEGLPLEDALKP 331
>gnl|CDD|197570 smart00199, SCY, Intercrine alpha family (small cytokine C-X-C)
(chemokine CXC). Family of cytokines involved in
cell-specific chemotaxis, mediation of cell growth, and
the inflammatory response.
Length = 59
Score = 27.6 bits (62), Expect = 3.5
Identities = 6/18 (33%), Positives = 12/18 (66%)
Query: 309 KICGEPDKPEVKKHIEDL 326
K+C +P V+++I+ L
Sbjct: 42 KVCADPKAKWVQRYIKKL 59
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.8 bits (70), Expect = 3.6
Identities = 67/347 (19%), Positives = 136/347 (39%), Gaps = 62/347 (17%)
Query: 201 SPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQ 260
SP ++ + ++++E L ++++ + LE A L E + + L + +
Sbjct: 464 SPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVE-----AEDRIERLEERR 518
Query: 261 DNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVK 320
++L +E A + I++++ +E++ + + E E+ R + +
Sbjct: 519 EDL--EELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAE------------ 564
Query: 321 KHIEDLDSAWDNVTALFAKREEN--LIHAMEKAMEFHETLQRKGEQGTITALFAKREENL 378
E+ + A + V L +K E I ++E+ + ++ I L KRE
Sbjct: 565 ---EEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDE--IERLREKRE--- 616
Query: 379 IHAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELA 438
E E E L + R+ +K + A+ + + ED + A L + E+ L EL
Sbjct: 617 -ALAELNDERRERLAEKRE--RKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELR 673
Query: 439 EKEIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSL 498
E+ + A IG + L + E LR +L
Sbjct: 674 EERDDLQAEIGAVENEL------------------------------EELEELRERREAL 703
Query: 499 QDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFM 545
++ LE L + + ES +L AE ++ T+ER++ E + +
Sbjct: 704 ENRVEALEALYDEAEELESMYGDLRAELRQRNVETLERMLNETFDLV 750
Score = 30.8 bits (70), Expect = 4.1
Identities = 70/352 (19%), Positives = 131/352 (37%), Gaps = 75/352 (21%)
Query: 213 KLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRD-LQDNLNSQEPPAV 271
+ E L L E++ + E LAE+ +L+ + L + D L
Sbjct: 249 RREELETLEAEIEDLRETIAETEREREELAEEV-RDLRERLEELEEERDDLLAEAGLDDA 307
Query: 272 EPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLMKICGEPDKPEVKKHIEDLDSAWD 331
+ +A++ ++ +E++ ++ + +E+CR + + H E+ +S +
Sbjct: 308 DAEAVEARR---EELEDRDEELRDRLEECRVA---------------AQAHNEEAESLRE 349
Query: 332 NVTALFAKREENLIHAMEKAMEFHETLQRKGEQGTITALFAKREENLIHAMEKAMEFHET 391
+ L EE E+A E L+ E R E + E E
Sbjct: 350 DADDL----EERAEELREEAAELESELEEAREA------VEDRREEI-------EELEEE 392
Query: 392 LQQNRDDCKKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELAEKEIEKDATIGLA 451
+++ R+ A + + F+ L E+ E R + AE E LR A + +E+ A
Sbjct: 393 IEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRT-ARERVEE------A 445
Query: 452 QRIL-----------VKSHPDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRS-LQ 499
+ +L V+ P T I+ E V + E+ L ++
Sbjct: 446 EALLEAGKCPECGQPVEGSPHVET--------IEEDRERVEELEAELED-----LEEEVE 492
Query: 500 DLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFMEATSKR 551
+++ LE E L + E E L + +E LI E +E +E +R
Sbjct: 493 EVEERLERA-EDLVEAED-----RIERLEERREDLEELIAERRETIEEKRER 538
Score = 29.6 bits (67), Expect = 9.1
Identities = 107/525 (20%), Positives = 206/525 (39%), Gaps = 56/525 (10%)
Query: 50 ADPAER-KAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSE 108
D ER +E +L EL + E Q + ++A V ++ +++ + + E
Sbjct: 202 KDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREEL----ETLE 257
Query: 109 KKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAA--G 166
+I+D+ E I + RE + L +E+ + EL + L+ G D+A
Sbjct: 258 AEIEDLR------ETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEA 311
Query: 167 VADKLQDTADRYGALVEASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQK 226
V + ++ DR L + + A +N+ S R D++++ E L E++
Sbjct: 312 VEARREELEDRDEELRDRLEECRVAAQAHNEEAESLR-EDADDLEERAEELREEAAELES 370
Query: 227 ATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEI 286
+ ++E+ E+ E++ + D +L + E + +++++ L+E
Sbjct: 371 ELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAED---FLEELREERDELRER 427
Query: 287 KAEIDQTKPEVEQCRASGQKLMKI-----CGEPDKPEVKKHIEDLDSAWDNVTALFAKRE 341
+AE++ T + + L++ CG+P + H+E ++ + V L A+ E
Sbjct: 428 EAELEATLRTARERVEEAEALLEAGKCPECGQPVEGS--PHVETIEEDRERVEELEAELE 485
Query: 342 --ENLIHAMEKAMEFHETLQRKGEQGTITALFAKREENLIHAMEKAMEFHETLQQNRDDC 399
E + +E+ +E E L ++ + EE E E ET+++ R+
Sbjct: 486 DLEEEVEEVEERLERAEDLVEAEDR------IERLEERREDLEELIAERRETIEEKRERA 539
Query: 400 KKADCNADAVQTFVNSLPEDDQEARTQLAEHEKFLRELAEKEIEKDATIGLAQRILVKSH 459
++ A ++ E EA + E + + EL K E I +RI
Sbjct: 540 EELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERI----- 594
Query: 460 PDGATVIKHWITIIQSRWEEVSSWAKQREERLRNHLRSLQDLDSLLEELL---------- 509
I I+ E+ + A+ +ER R L ++ LE
Sbjct: 595 RTLLAAIADAEDEIERLREKREALAELNDER-RERLAEKRERKRELEAEFDEARIEEARE 653
Query: 510 ------EWLAKCESHLLNLEAE--PLPDDIPTVERLIEEHKEFME 546
E+L + E L L E L +I VE +EE +E E
Sbjct: 654 DKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEELRE 698
>gnl|CDD|220979 pfam11101, DUF2884, Protein of unknown function (DUF2884). Some
members in this bacterial family of proteins are
annotated as YggN which currently has no known function.
Length = 228
Score = 30.0 bits (68), Expect = 3.6
Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 11/93 (11%)
Query: 21 EQKFLKKMLADRQHSMSSLFQ-MGNEVAANADPAERKAIERQLNELMNRFDNLNEGASQR 79
+++F + + Q S+ S+ +G+E+ ++ L R D L + +
Sbjct: 133 DEEFEQAIEQLVQQSIGSILIALGSEMMSSEG-------GNNLTAFEQRMDGLGQEIEEE 185
Query: 80 MDALEQAMAVAKQFQDKLTGILDWLDKSEKKIK 112
M++ QA+ + D+L L+ LD+ E++++
Sbjct: 186 MESQSQAL---EAKADELCDRLEGLDEQEQQLQ 215
>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
Length = 857
Score = 30.6 bits (69), Expect = 3.9
Identities = 28/95 (29%), Positives = 41/95 (43%), Gaps = 21/95 (22%)
Query: 210 IKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSELQSVMATLRDLQDNLNSQEPP 269
I+ KLE+ L E +A+ R L E L+ E+ +SEL+
Sbjct: 420 IQLKLEQ-QALMKESDEASKKRLDMLNEELSDKERQYSELEE------------------ 460
Query: 270 AVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASG 304
E KA + + IKAE++Q K +EQ R G
Sbjct: 461 --EWKAEKASLSGTQTIKAELEQAKIAIEQARRVG 493
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 30.4 bits (69), Expect = 4.4
Identities = 38/209 (18%), Positives = 79/209 (37%), Gaps = 43/209 (20%)
Query: 106 KSEKKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAA 165
++E K + T + IQ+ + + L +E+L++ + ++ S L+
Sbjct: 96 EAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELVSIFNDLI------ 149
Query: 166 GVADKLQDTADRYGALVE-ASDNLGQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEV 224
+ K + D +L+ A + L Q + + ++K + E L +
Sbjct: 150 -DSIKEDNLKDDLESLIASAKEELDQLS------------KKLAELKAEEEEE--LERAL 194
Query: 225 QKATNDRGRSLEEAL---------ALAEKFWSELQSVMATLR-----DLQDNLNSQEPP- 269
++ + LEE L AL ++ E + LR L+ L Q
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAH 254
Query: 270 ------AVEPKAIQQQQYALKEIKAEIDQ 292
+ +AI+ Q+ KEIK ++++
Sbjct: 255 EQKLKNELALQAIELQREFNKEIKEKVEE 283
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.4 bits (69), Expect = 5.0
Identities = 16/129 (12%), Positives = 42/129 (32%), Gaps = 4/129 (3%)
Query: 15 VKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQLNELMNRFDNLNE 74
++ + + LKK+L + + N ++ E +
Sbjct: 923 LEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTI 982
Query: 75 GASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRIREHDAL 134
+ A + K F+ +L + ++ K ++ +P + ++Q + +
Sbjct: 983 LVREGNKANSE----LKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSE 1038
Query: 135 HKEILRKKP 143
E+ KP
Sbjct: 1039 STELSILKP 1047
>gnl|CDD|222707 pfam14357, DUF4404, Domain of unknown function (DUF4404). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved residues (P and G)
that may be functionally important.
Length = 85
Score = 27.6 bits (62), Expect = 5.7
Identities = 14/56 (25%), Positives = 18/56 (32%)
Query: 489 ERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEF 544
E+L L LD LL L L E +D +RL + F
Sbjct: 7 EQLHEELEQTDSLDEETRALLSELIDDIEAQLAEEDATEAEDESLADRLNLAVERF 62
>gnl|CDD|130403 TIGR01336, psaB, photosystem I core protein PsaB. The core
proteins of photosystem I are PsaA and PsaB, homologous
integral membrane proteins that form a heterodimer. The
heterodimer binds the electron-donating chlorophyll
dimer P700, as well as chlorophyll, phylloquinone, and
4FE-4S electron acceptors. This model describes PsaB
only [Energy metabolism, Photosynthesis].
Length = 734
Score = 29.9 bits (67), Expect = 6.1
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 473 IQSRWEEVSSWAKQREERLRNHLRSLQDLDSL 504
+Q +++ SW K E RL +HL L + SL
Sbjct: 156 LQPKFKPSLSWFKNAESRLNHHLSGLFGVSSL 187
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 30.2 bits (68), Expect = 6.7
Identities = 22/113 (19%), Positives = 45/113 (39%), Gaps = 2/113 (1%)
Query: 34 HSMSSLFQMGNEVAANADPAERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQF 93
++S+ + N V N E I + L N + L E + ++E + K
Sbjct: 505 QIINSIKEKNNIVNNNFKNIEDYYI--TIEGLKNEIEGLIELIKYYLQSIETLIKDEKLK 562
Query: 94 QDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRIREHDALHKEILRKKPDFT 146
+ I + + E+ + ++ I + ++I I++ + L E L K F
Sbjct: 563 RSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQQIEELINEALFNKEKFI 615
>gnl|CDD|237103 PRK12447, PRK12447, histidinol dehydrogenase; Reviewed.
Length = 426
Score = 29.5 bits (67), Expect = 7.2
Identities = 39/123 (31%), Positives = 49/123 (39%), Gaps = 32/123 (26%)
Query: 153 SSLMGLVGEDEAAG------VADKLQDTADRYGALVEASDNLGQYAFLYNQLILSPRFSS 206
L G VG D AG +AD DTAD LV A+D LGQ N SP
Sbjct: 214 RQLFGRVGIDLFAGPTETLVIAD---DTAD--PELV-ATDLLGQAEHGPN----SPAVL- 262
Query: 207 VTDIKK----KLERLNGLWNEV------QKATNDRG-----RSLEEALALAEKFWSELQS 251
+T +K L + L + A D G LEE +A A+++ SE
Sbjct: 263 ITTSRKLAEEVLAEIERLLAILPTADVASAAWRDYGEVILCDDLEEMVAEADRYASEHVQ 322
Query: 252 VMA 254
VM
Sbjct: 323 VMT 325
>gnl|CDD|176995 CHL00054, psaB, photosystem I P700 chlorophyll a apoprotein A2.
Length = 734
Score = 29.7 bits (67), Expect = 7.3
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 473 IQSRWEEVSSWAKQREERLRNHLRSLQDLDSL 504
+Q +W+ SW K E RL +HL L + SL
Sbjct: 157 LQPKWKPSVSWFKNAESRLNHHLSGLFGVSSL 188
>gnl|CDD|149800 pfam08854, DUF1824, Domain of unknown function (DUF1824). This
uncharacterized family of proteins are principally
found in cyanobacteria.
Length = 125
Score = 28.3 bits (64), Expect = 7.5
Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 4/54 (7%)
Query: 45 EVAANADPAERKAIERQLNELMNRFDNLNEGASQRM-DALEQAMAVAKQFQDKL 97
+A PA+RK + ++L LM+ D G M + E+A+ + ++ L
Sbjct: 11 RIAPQLSPAQRKKLRQELVALMDNADWFTIGI---MAPSAEEAINALRSYEAAL 61
>gnl|CDD|227204 COG4867, COG4867, Uncharacterized protein with a von Willebrand
factor type A (vWA) domain [General function prediction
only].
Length = 652
Score = 29.6 bits (66), Expect = 7.7
Identities = 17/65 (26%), Positives = 26/65 (40%)
Query: 33 QHSMSSLFQMGNEVAANADPAERKAIERQLNELMNRFDNLNEGASQRMDALEQAMAVAKQ 92
Q S + F MG A AD AE + + QL++ +L+ A Q D +
Sbjct: 292 QFSGDNPFGMGEGTQALADIAELEQLAEQLSQSYPGDVDLDALARQLGDQAAVDARTLAE 351
Query: 93 FQDKL 97
+ L
Sbjct: 352 LERAL 356
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 29.4 bits (66), Expect = 8.6
Identities = 28/197 (14%), Positives = 59/197 (29%), Gaps = 22/197 (11%)
Query: 14 VVKAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAERKAIERQLNELMNRFDNLN 73
++ +LQE L ++ + + E ++A+E + L L+
Sbjct: 93 ELRERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALS 152
Query: 74 EGASQRMDALEQAMAVAKQFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRIREHDA 133
++ L + K IL + E++ D +Q + E +
Sbjct: 153 REVGKQRGLLSR-----GLATFKRDRILQQQWREEQEKYDAADKARAIYALQTKADERNL 207
Query: 134 LHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTADRYGALVEASDNLGQYAF 193
+ DF + V E E V ++++ L +
Sbjct: 208 ETVLQSLSQADFQ---------LAGVAEKELETVEARIKEARYEIEEL--------ENKL 250
Query: 194 LYNQLILSPRFSSVTDI 210
N I+S V ++
Sbjct: 251 NLNTRIVSQHSGRVIEL 267
>gnl|CDD|184793 PRK14696, tynA, tyramine oxidase; Provisional.
Length = 721
Score = 29.4 bits (66), Expect = 9.0
Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 1/37 (2%)
Query: 527 LPDDIPTVERLIEEHKEFMEATSKRQ-HEVDSVRASP 562
L DD +V+ +I +EF A KR +V V +P
Sbjct: 182 LLDDFASVQNIINNSEEFAAALKKRGITDVKKVITTP 218
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 29.3 bits (66), Expect = 9.1
Identities = 95/569 (16%), Positives = 178/569 (31%), Gaps = 85/569 (14%)
Query: 16 KAQLQEQKFLKKMLADRQHSMSSLFQMGNEVAANADPAER-KAIERQLNELMNRFDNLNE 74
K +L+ + L +LA+R+ L +G + D ER + +L + L E
Sbjct: 204 KQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRNLELLTE 263
Query: 75 GASQRMDALEQAMAV------AKQFQDKLTGILDWLDKSEKKIKDMELIPTDEEKIQQRI 128
R++AL+ + + + + +D+ E +I
Sbjct: 264 ----RLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAEQDLPDR---EGEIANAR 316
Query: 129 REHDALHKEILRKKPDFTELTDIASSLMGLVGEDEAAGVADKLQDTADRYGALVEASDNL 188
AL +I PD E + S L + + + R AL +A +
Sbjct: 317 EAAAALLAQI---GPDADE-EAVESLRPSLA-------AKETVTELEKRKEALDQALKSA 365
Query: 189 GQYAFLYNQLILSPRFSSVTDIKKKLERLNGLWNEVQKATNDRGRSLEEALALAEKFWSE 248
++ + +++L+++ + T L ALA A +
Sbjct: 366 RD---------------ALEERERELKQVRAQLAALP--TVTVPEPLRAALADALRAGDI 408
Query: 249 LQSVMATLRDLQDNLNSQEPPAVEPKAIQQQQYALKEIKAEIDQTKPEVEQCRASGQKLM 308
++ A +++ AV + + Q L + ++++ Q +
Sbjct: 409 DAALAALEQEV----------AVAKRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFL 458
Query: 309 KICGEPDKPEVKKHIEDLDSAWDNVTALFAKREENLIHAMEKAM-----EFHETLQRKGE 363
K + E+ + + E L H E R+
Sbjct: 459 K-----EAEEIMQAKRQKRDRLLELEEDLETLELQLRHLDAAGAVPTEEEVAAARARRDA 513
Query: 364 QGTITALFAKREENLIHAMEKAMEFHETLQQNRDDCKKADCNADAVQTFVNSLPEDDQEA 423
E A E+A+E + L A V + SL + ++ A
Sbjct: 514 LWQDIRRGYALLEPTASAYEEAVESADQLADRLLR------EAQLVGE-LQSLRQQEEAA 566
Query: 424 RTQLAEHEKFLRELAEKEIEKDATIGLAQRILVKSHPDGATVIKHWITIIQSRWEEVSSW 483
R +L + EK L L P ++ W+ + E+V ++
Sbjct: 567 RRRLEQLEKELEVLELALAALREAWQAQWAAA--GLPLTPAEMEDWLAERATAREQVRAY 624
Query: 484 AKQREE---------RLRNHLRSL---QDLDSLLEELLEWLAKCESHLLNLEAEPLPDDI 531
K R E RLR LR+ + EEL E L LL EAE
Sbjct: 625 FKARAELDALLDRRARLRAALRAALKAVAIVLPGEELAELLELAR-QLLE-EAEKQAARK 682
Query: 532 PTVERLIEEHKEFMEATSKRQHEVDSVRA 560
+++ + + + +E +R E S
Sbjct: 683 ASLDERLRDAERALEEAEERHDEAQSALE 711
>gnl|CDD|190992 pfam04447, DUF550, Protein of unknown function (DUF550). This
family is found in a range of Proteobacteria and a few
P-22 dsDNA virus particles. The function is currently
not known.
Length = 101
Score = 27.4 bits (61), Expect = 9.5
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 5/44 (11%)
Query: 600 DTWRNVWLLAWERQRRL----QERLNYLIE-LEKVKNFSWDDWR 638
W +V LAW+ QRR ++ ++ ++ L K + W DWR
Sbjct: 42 SEWADVQFLAWDGQRRAGISDEQIIDAIVAKLAKNEARQWPDWR 85
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.7 bits (65), Expect = 9.7
Identities = 22/69 (31%), Positives = 31/69 (44%), Gaps = 8/69 (11%)
Query: 486 QREERLRNHLRSLQDLDSLLEELLEWLAKCESHLLNLEAEPLPDDIPTVERLIEEHKEFM 545
Q+EE L SL+ + LEE + LA + L E E +E LIEE ++ +
Sbjct: 89 QKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEE--------LEELIEEQQQEL 140
Query: 546 EATSKRQHE 554
E S E
Sbjct: 141 ERISGLTAE 149
>gnl|CDD|234949 PRK01388, PRK01388, arginine deiminase; Provisional.
Length = 406
Score = 29.0 bits (66), Expect = 9.9
Identities = 12/27 (44%), Positives = 15/27 (55%)
Query: 529 DDIPTVERLIEEHKEFMEATSKRQHEV 555
DD+P VER +EH F + R EV
Sbjct: 41 DDVPWVERAQKEHDAFAQTLRDRGVEV 67
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.132 0.386
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,935,431
Number of extensions: 4269812
Number of successful extensions: 6882
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6659
Number of HSP's successfully gapped: 280
Length of query: 801
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 696
Effective length of database: 6,280,432
Effective search space: 4371180672
Effective search space used: 4371180672
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (27.9 bits)