RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7893
(691 letters)
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 131 bits (330), Expect = 1e-32
Identities = 103/433 (23%), Positives = 169/433 (39%), Gaps = 71/433 (16%)
Query: 29 PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKDGETQ----VGNDISNIEAV- 83
IV DNG+ + G+A + P +F +++ + R E + + VGN+ N
Sbjct: 8 TIVIDNGSGTTKAGFAGNDTPTTVFPSIVGRRRDEDSVMEDTEEKDTYVGNEAQNDRDNS 67
Query: 84 RFQLKTQFDKNVVTHYDIQEQIFDYAFS---HLSINTEGNVNHPIVLTEPFLNPNYSRSL 140
+L+ + ++ ++D EQI+DY F L + E HP++LTEP LNP +R
Sbjct: 68 LLELRYPIENGIILNWDAMEQIWDYTFFNKGDLLPSPEE---HPLLLTEPPLNPPSNREK 124
Query: 141 MSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCG 200
++ELLFE VP++ I ++ S Y +G++I G
Sbjct: 125 ITELLFETLNVPALYLAIQAVLS-LYASGSSDE-------------------TGLVIDSG 164
Query: 201 YQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINS------------I 248
THVIPV++G + KRID+GG + +L KLL+ KYP I
Sbjct: 165 DSVTHVIPVVDGIVLPKAVKRIDIGGRDITDYLKKLLREKYPPSRGYNLKSELVEYSSEI 224
Query: 249 TPSRSEELL---WDYGFVATDYREHLRKWLDAEFYDSNVVKVQL-PYAVPVPNLTTEQQK 304
EE+ + +V+ D E + + S QL ++
Sbjct: 225 VNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKE---SKEL 281
Query: 305 DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINN 364
+E+ + K E L E ++ L E +I E + E + N+
Sbjct: 282 PDGEEI--EFGNEERFKAPEILFKPELPISGLEEAGKIDESKQELVAENYEISPTNLGND 339
Query: 365 IQDLNKSINQ-LQQKIEKTKAKIIAYNN-----GEDLTEEPKAKLSKEI----------- 407
I L + + Q +Q E + Y+N G +L KE+
Sbjct: 340 IAGLPELVYQSIQICDEDVRKS--LYSNIVLTGGTSKIPGFAERLQKELTSLAPSIWKVS 397
Query: 408 AVPESEAEFKAWL 420
+P + AWL
Sbjct: 398 VIPPPDPSLDAWL 410
Score = 76.7 bits (189), Expect = 1e-14
Identities = 46/190 (24%), Positives = 82/190 (43%), Gaps = 2/190 (1%)
Query: 461 ELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQ 520
E +E+ + + + K + A ++ + E + E + PE +
Sbjct: 248 EEFEEEEEKPAEKSTESTFQLSKETS-IAKESKELPDGEEIEFGNEERFKA-PEILFKPE 305
Query: 521 EQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANN 580
+EA ++ + + P+ LG+ AG+ E + + + + +SL +N
Sbjct: 306 LPISGLEEAGKIDESKQELVAENYEISPTNLGNDIAGLPELVYQSIQICDEDVRKSLYSN 365
Query: 581 IFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFA 640
I +TG ++PGF ERL K+L P SV +P LDAW GA + E F
Sbjct: 366 IVLTGGTSKIPGFAERLQKELTSLAPSIWKVSVIPPPDPSLDAWLGASILASLETFQQLW 425
Query: 641 VTQSDYQEKG 650
+T+ +Y+E G
Sbjct: 426 ITKEEYEEHG 435
>gnl|CDD|214592 smart00268, ACTIN, Actin. ACTIN subfamily of
ACTIN/mreB/sugarkinase/Hsp70 superfamily.
Length = 373
Score = 123 bits (311), Expect = 1e-30
Identities = 70/291 (24%), Positives = 125/291 (42%), Gaps = 60/291 (20%)
Query: 29 PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKK--DGETQVGNDISNIEAVRFQ 86
IV DNG+ + G+A + P ++F +++ +P+ +G + VG++ +
Sbjct: 3 AIVIDNGSGTIKAGFAGEDFPQVVFPSIVGRPKDGKGMVGDAKDIFVGDEAQEKRGG-LE 61
Query: 87 LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 145
LK + +V ++D E+I+DY F + L + E HP++LTEP +NP +R + E++
Sbjct: 62 LKYPIENGIVENWDDMEKIWDYTFFNELRVEPE---EHPVLLTEPPMNPKSNREKILEIM 118
Query: 146 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQCTH 205
FE + P++ I ++ S +G +G++I G TH
Sbjct: 119 FETFNFPALYIAIQAVLSLYASG----------------------RTTGLVIDSGDGVTH 156
Query: 206 VIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVAT 265
V+PV++G + KRID+ G + +L +LL + NS
Sbjct: 157 VVPVVDGYVLPHAIKRIDIAGRDITDYLKELLS-ERGYQFNSS----------------- 198
Query: 266 DYREHLRKWLDAEFYDSNVVKVQLPYAVPVPNLTTEQQKDRRKELAKKLVE 316
AEF +K +L Y V + E + R + KL +
Sbjct: 199 -----------AEFEIVREIKEKLCY-VA-EDFEKEMKLARESSESSKLEK 236
Score = 86.5 bits (215), Expect = 3e-18
Identities = 39/146 (26%), Positives = 64/146 (43%)
Query: 505 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNF 564
E+ ++ S E+ + N + +G ER PE LF P ++G Q GI E +
Sbjct: 219 EKEMKLARESSESSKLEKTYELPDGNTIKVGNERFRIPEILFSPELIGLEQKGIHELVYE 278
Query: 565 VLNSYPQHIAQSLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW 624
+ + + L NI ++G +PGF ERL K+L + P + V + W
Sbjct: 279 SIQKCDIDVRKDLYENIVLSGGSTLIPGFGERLEKELKQLAPKKLKVKVIAPPERKYSVW 338
Query: 625 SGARQFSLSENFHDFAVTQSDYQEKG 650
G + F D +T+ +Y+E G
Sbjct: 339 LGGSILASLSTFEDMWITKKEYEESG 364
>gnl|CDD|200935 pfam00022, Actin, Actin.
Length = 367
Score = 99.2 bits (248), Expect = 2e-22
Identities = 51/222 (22%), Positives = 100/222 (45%), Gaps = 41/222 (18%)
Query: 29 PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKD---GETQVGNDISNIEAVRF 85
+V DNG+ + G+A + P + +++ +PR G+ + + +E
Sbjct: 5 ALVIDNGSGTTKAGFAGEDAPRAVIPSVVGRPRGRGVMVKYYVGDEAL-SKRPGLE---- 59
Query: 86 QLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSEL 144
++ + +V ++D E+I+++ F L ++ E HP++LTEP LNP +R +E+
Sbjct: 60 -VRYPIEDGIVENWDAMEKIWEHTFFEELRVDPE---EHPLLLTEPPLNPPANREKATEI 115
Query: 145 LFECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGY 201
+FE + VP++ S ++ G++ +G+++ G
Sbjct: 116 MFETFGVPALYLAKQAVLSAYAS------GRT-------------------TGLVVDSGA 150
Query: 202 QCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPS 243
T V+PV G + +R DL G + +L KLL + S
Sbjct: 151 GVTSVVPVYEGYVLQKAIRRSDLAGDDLTDYLRKLLSSRTYS 192
Score = 84.2 bits (209), Expect = 2e-17
Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 1/118 (0%)
Query: 534 IGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSLCQLPGF 593
+G ER PE LF PS++GS AGI E + +N+ + SL NI VTG PGF
Sbjct: 241 LGNERFRVPEILFNPSLIGSESAGIPELIYDSINACDVDLRPSLLANIVVTGGTTLFPGF 300
Query: 594 VERLNKDLLENRPFQSHFSVSLAEN-PELDAWSGARQFSLSENFHDFAVTQSDYQEKG 650
ERL K+L + P + N + AW G + F V++ +Y+E G
Sbjct: 301 TERLEKELAQLAPSGVKVKIIAPPNERKYSAWIGGSILASLGTFQQMWVSKQEYEEHG 358
>gnl|CDD|240225 PTZ00004, PTZ00004, actin-2; Provisional.
Length = 378
Score = 93.7 bits (233), Expect = 2e-20
Identities = 61/272 (22%), Positives = 122/272 (44%), Gaps = 46/272 (16%)
Query: 30 IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVGNDISNIEAVRFQ 86
V DNG+ + G+A + P +F +++ +P+ G ++ + VG++ + +
Sbjct: 9 AVVDNGSGMVKAGFAGDDAPRCVFPSIVGRPKNPGIMVGMEEKDCYVGDEAQDKRGI-LT 67
Query: 87 LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 145
LK + +VT++D E+I+ + F + L + E HP++LTE LNP +R M++++
Sbjct: 68 LKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQIM 124
Query: 146 FECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQ 202
FE + VP++ I SL++ SG +G+++ G
Sbjct: 125 FETHNVPAMYVAIQAVLSLYA---------SGRT----------------TGIVLDSGDG 159
Query: 203 CTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWD--- 259
+H +P+ G R+D+ G + ++ K+L + T + +E++ D
Sbjct: 160 VSHTVPIYEGYSLPHAIHRLDVAGRDLTEYMMKIL----HERGTTFTTTAEKEIVRDIKE 215
Query: 260 -YGFVATDYREHLRKWLDAEFYDSNVVKVQLP 290
++A D+ E + A D +LP
Sbjct: 216 KLCYIALDFDEEMGN--SAGSSDKYEESYELP 245
Score = 60.9 bits (148), Expect = 8e-10
Identities = 37/122 (30%), Positives = 50/122 (40%), Gaps = 1/122 (0%)
Query: 530 NQLHIGVERMCGPECLFQPSMLGSIQA-GISETLNFVLNSYPQHIAQSLANNIFVTGSLC 588
+ +G ER PE LFQPS++G + GI E +N I + L NI ++G
Sbjct: 248 TIITVGSERFRCPEALFQPSLIGKEEPPGIHELTFQSINKCDIDIRKDLYGNIVLSGGTT 307
Query: 589 QLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQE 648
G ERL K+L P V + W G S F VT+ +Y E
Sbjct: 308 MYRGLPERLTKELTTLAPSTMKIKVVAPPERKYSVWIGGSILSSLPTFQQMWVTKEEYDE 367
Query: 649 KG 650
G
Sbjct: 368 SG 369
>gnl|CDD|240426 PTZ00466, PTZ00466, actin-like protein; Provisional.
Length = 380
Score = 86.9 bits (215), Expect = 3e-18
Identities = 61/226 (26%), Positives = 109/226 (48%), Gaps = 34/226 (15%)
Query: 18 EYLDELRDSSLPIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVG 74
EY +L S+ PI+ DNG + G+A + PNL+F + + +P+ +R G +G VG
Sbjct: 4 EYAKQLY-SNQPIIIDNGTGYIKAGFAGEDVPNLVFPSYVGRPKYKRVMAGAVEGNIFVG 62
Query: 75 NDISNIEAVRFQLKTQFDKN--VVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPFL 132
N E R LK + N ++ +++ E I+ + ++ + IN+E HP++LTE L
Sbjct: 63 N---KAEEYRGLLKVTYPINHGIIENWNDMENIWIHVYNSMKINSE---EHPVLLTEAPL 116
Query: 133 NPNYSRSLMSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQ 192
NP ++ ++E+ FE + VP++ I ++ S + SCG
Sbjct: 117 NPQKNKEKIAEVFFETFNVPALFISIQAILS------------LYSCGK----------T 154
Query: 193 SGVIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQ 238
+G ++ CG H + + G + R D+ G + +L LL+
Sbjct: 155 NGTVLDCGDGVCHCVSIYEGYSITNTITRTDVAGRDITTYLGYLLR 200
Score = 46.9 bits (111), Expect = 2e-05
Identities = 29/123 (23%), Positives = 56/123 (45%)
Query: 528 EANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSL 587
+ +Q+ IG ER PE LF PS+LG G+SE + + + ++L ++I ++G
Sbjct: 249 DGSQILIGSERYRAPEVLFNPSILGLEYLGLSELIVTSITRADMDLRRTLYSHIVLSGGT 308
Query: 588 CQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQ 647
GF +RL ++ + P +S + + G + F +++ ++
Sbjct: 309 TMFHGFGDRLLNEIRKFAPKDITIRISAPPERKFSTFIGGSILASLATFKKIWISKQEFD 368
Query: 648 EKG 650
E G
Sbjct: 369 EYG 371
>gnl|CDD|173506 PTZ00281, PTZ00281, actin; Provisional.
Length = 376
Score = 84.0 bits (207), Expect = 3e-17
Identities = 57/249 (22%), Positives = 115/249 (46%), Gaps = 34/249 (13%)
Query: 30 IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKE-----RGKKDGETQVGNDISNIEAVR 84
+V DNG+ C+ G+A + P +F +++ +PR G+KD + VG++ + +
Sbjct: 9 LVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHTGVMVGMGQKD--SYVGDEAQSKRGI- 65
Query: 85 FQLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSE 143
LK + +VT++D E+I+ + F + L + E HP++LTE LNP +R M++
Sbjct: 66 LTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQ 122
Query: 144 LLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQC 203
++FE + P++ I ++ S +G +G+++ G
Sbjct: 123 IMFETFNTPAMYVAIQAVLSLYASG----------------------RTTGIVMDSGDGV 160
Query: 204 THVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFV 263
+H +P+ G R+DL G + ++ K+L + S + ++ +V
Sbjct: 161 SHTVPIYEGYALPHAILRLDLAGRDLTDYMMKILTERGYSFTTTAEREIVRDIKEKLAYV 220
Query: 264 ATDYREHLR 272
A D+ ++
Sbjct: 221 ALDFEAEMQ 229
Score = 61.3 bits (148), Expect = 7e-10
Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 6/137 (4%)
Query: 516 TSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQ 575
++L + EL + + IG ER PE LFQPS LG AGI ET + I +
Sbjct: 235 SALEKSYELP--DGQVITIGNERFRCPEALFQPSFLGMESAGIHETTYNSIMKCDVDIRK 292
Query: 576 SLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPE--LDAWSGARQFSLS 633
L N+ ++G PG +R+NK+L P S + + PE W G +
Sbjct: 293 DLYGNVVLSGGTTMFPGIADRMNKELTALAP--STMKIKIIAPPERKYSVWIGGSILASL 350
Query: 634 ENFHDFAVTQSDYQEKG 650
F +++ +Y E G
Sbjct: 351 STFQQMWISKEEYDESG 367
>gnl|CDD|240343 PTZ00280, PTZ00280, Actin-related protein 3; Provisional.
Length = 414
Score = 66.7 bits (163), Expect = 1e-11
Identities = 65/295 (22%), Positives = 121/295 (41%), Gaps = 50/295 (16%)
Query: 26 SSLPI-VFDNGAWCCRVGWASCEKPNLIFKNLIA------KPRKERGKKDGETQVGNDIS 78
S+LP+ V DNG ++G+A +P I LIA + R ++G +D + +G++
Sbjct: 2 STLPVVVIDNGTGYTKMGYAGNTEPTYIIPTLIADNSKQSRRRSKKGFEDLDFYIGDEAL 61
Query: 79 NIEAVRFQLKTQFDKNVVTHYDIQEQIFDYA-FSHLSINTEGNVNHPIVLTEPFLNPNYS 137
+ + L +V +D+ E+ ++ F +L E H +LTEP +NP +
Sbjct: 62 A-ASKSYTLTYPMKHGIVEDWDLMEKFWEQCIFKYLRCEPE---EHYFILTEPPMNPPEN 117
Query: 138 RSLMSELLFECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSG 194
R +E++FE + V + + +L + + + G +G
Sbjct: 118 REYTAEIMFETFNVKGLYIAVQAVLALRASWTSKKAKELG---------------GTLTG 162
Query: 195 VIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSE 254
+I G THVIPV++G + S K I L G + + + ++L+ + ++
Sbjct: 163 TVIDSGDGVTHVIPVVDGYVIGSSIKHIPLAGRDITNFIQQMLRERGEPIPAEDILLLAQ 222
Query: 255 ELLWDYGFVATDYREHLRKWLDAEF--YDSNVVKVQLPYAVPVPNLTTEQQKDRR 307
+ Y +VA D + EF YDS+ P + +
Sbjct: 223 RIKEKYCYVAPD--------IAKEFEKYDSD----------PKNHFKKYTAVNSV 259
Score = 42.0 bits (99), Expect = 7e-04
Identities = 33/153 (21%), Positives = 56/153 (36%), Gaps = 18/153 (11%)
Query: 515 FTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSML-GSIQAGISETLNFVLNSYPQHI 573
F ++ K+ + +G ER GPE F P + + E ++ + S P
Sbjct: 250 FKKYTAVNSVT-KKPYTVDVGYERFLGPEMFFHPEIFSSEWTTPLPEVVDDAIQSCPIDC 308
Query: 574 AQSLANNIFVTGSLCQLPGFVERLNKD--------------LLENRPFQSHFSVSLAENP 619
+ L NI ++G GF +RL +D L + V++ +P
Sbjct: 309 RRPLYKNIVLSGGSTMFKGFDKRLQRDVRKRVDRRLKKAEELSGGKLKPIPIDVNVVSHP 368
Query: 620 --ELDAWSGARQFSLSENFHDFAVTQSDYQEKG 650
W G + S F T+++Y E G
Sbjct: 369 RQRYAVWYGGSMLASSPEFEKVCHTKAEYDEYG 401
>gnl|CDD|185631 PTZ00452, PTZ00452, actin; Provisional.
Length = 375
Score = 63.2 bits (153), Expect = 1e-10
Identities = 65/290 (22%), Positives = 122/290 (42%), Gaps = 46/290 (15%)
Query: 30 IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRK---ERGKKDGETQVGNDISNIEAVRFQ 86
+V DNG+ C++G A + P F ++ + ++ + E VG + V
Sbjct: 8 VVIDNGSGYCKIGIAGDDAPTSCFPAIVGRSKQNDGIFSTFNKEYYVGEEAQAKRGV-LA 66
Query: 87 LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 145
+K ++ +D E I+ +AF + L ++ E + P+ +T+ +N ++R M++++
Sbjct: 67 IKEPIQNGIINSWDDIEIIWHHAFYNELCMSPE---DQPVFMTDAPMNSKFNRERMTQIM 123
Query: 146 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQCTH 205
FE + P + +++ S +G G+++ G TH
Sbjct: 124 FETFNTPCLYISNEAVLSLYTSG----------------------KTIGLVVDSGEGVTH 161
Query: 206 VIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQ-LKYP---SHINSITPSRSEELLWDYG 261
+PV G +I+L G +L ++LQ L Y H I + E L
Sbjct: 162 CVPVFEGHQIPQAITKINLAGRLCTDYLTQILQELGYSLTEPHQRIIVKNIKERLC---- 217
Query: 262 FVATDYREHLRKWLDAEFYDSNVVKVQLPYAVPVPN-LTTEQQKDRRKEL 310
+ A D ++ R + ++ DS PY +P N LT + QK R E+
Sbjct: 218 YTALDPQDEKRIYKESNSQDS-------PYKLPDGNILTIKSQKFRCSEI 260
Score = 46.7 bits (110), Expect = 3e-05
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 2/135 (1%)
Query: 517 SLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQS 576
S +Q+ + N L I ++ E LFQP ++G AGI + + Q
Sbjct: 233 SNSQDSPYKLPDGNILTIKSQKFRCSEILFQPKLIGLEVAGIHHLAYSSIKKCDLDLRQE 292
Query: 577 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW-SGARQFSLSEN 635
L NI ++G PG RL+ +L P Q V+ + AW G+ Q +LS
Sbjct: 293 LCRNIVLSGGTTLFPGIANRLSNELTNLVPSQLKIQVAAPPDRRFSAWIGGSIQCTLSTQ 352
Query: 636 FHDFAVTQSDYQEKG 650
+ + + +Y E+G
Sbjct: 353 QPQW-IKRQEYDEQG 366
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 46.2 bits (110), Expect = 5e-05
Identities = 43/263 (16%), Positives = 95/263 (36%), Gaps = 34/263 (12%)
Query: 305 DRRKELAK-------------KLVEMNAKKREERLVDDERHLNELLELREIV-ELTPSDH 350
DR+KE A L+ +++ ERL + L + E +
Sbjct: 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYEL 228
Query: 351 SHAREAF---KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNG-EDLTEEPKAKLSKE 406
+EA K + L + + +L ++I + + ++ E+L ++ K +E
Sbjct: 229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEE 288
Query: 407 IAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE----- 461
K + E + + A + +++ +D +R E +L++E
Sbjct: 289 QL------RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
Query: 462 --LARKEKRDDDFGMR-DEDWDVYKVINKDAG--DTDSEEEQERLIELEEILRQHDPEFT 516
+ + KR D E + + + + D + E ++ L + E L + E
Sbjct: 343 REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402
Query: 517 SLNQEQELSPKEANQLHIGVERM 539
L +E + +E +L + +
Sbjct: 403 ELKRELDRLQEELQRLSEELADL 425
Score = 42.0 bits (99), Expect = 0.001
Identities = 49/253 (19%), Positives = 99/253 (39%), Gaps = 29/253 (11%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
E+ + EL K+L E+ E + E L ++E + ++ + +
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313
Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEP--KAKLSKEIA---------- 408
++D + + +L+ +I+K A+I ++ EE + KL++E A
Sbjct: 314 E-RELEDAEERLAKLEAEIDKLLAEIEELE--REIEEERKRRDKLTEEYAELKEELEDLR 370
Query: 409 --VPESEAEFKAWLIETKKKRAYI---IDKKNARKQRRQDLAKRRTAAAQERMRLISELA 463
+ E + EF E K R + + N K+ L + ++E L + +A
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430
Query: 464 RKEKRDDDFGMRDEDWDVYKVINKDAGD-----TDSEEEQERLIELEEILRQHDPEFTSL 518
E + ++ E D I K D + ++ L +L+E + + E + L
Sbjct: 431 GIEAKINELEEEKE--DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKL 488
Query: 519 NQEQELSPKEANQ 531
++EL+ EA
Sbjct: 489 --QRELAEAEAQA 499
Score = 36.2 bits (84), Expect = 0.065
Identities = 45/248 (18%), Positives = 92/248 (37%), Gaps = 24/248 (9%)
Query: 302 QQKDRRKELAKKLVEMNAKKREERLVDD----ERHLNELLEL-----REIVELTPSDHSH 352
+ + L ++L + K+ L + E L+EL + R+I E+
Sbjct: 671 SEPAELQRLRERLEGL--KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQL 728
Query: 353 AREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 412
+E K +++L + ++ L+Q+IE K+++ + EE KL + + E+
Sbjct: 729 EQEEEKLKER--LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEA 786
Query: 413 EAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDF 472
+ +++ + R + R R+ L E KE ++
Sbjct: 787 RLSHSRIPEIQAELSK--LEE----EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840
Query: 473 GMRDEDWDVYKVINK---DAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 529
D + + + EE +E L ELE LR + L +E++ +
Sbjct: 841 QRI--DLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898
Query: 530 NQLHIGVE 537
+L +E
Sbjct: 899 RELERKIE 906
Score = 34.3 bits (79), Expect = 0.31
Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 6/105 (5%)
Query: 304 KDRRKELAKKLVEMNAKKRE--ERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
K++ K + K++ +N KK E E L E L +L + + + +
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEEL---EELEAALRDLESRLGDLKKERDELEAQLRELE 902
Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKE 406
I++L I + ++++ + KAK+ A E+PK + +
Sbjct: 903 -RKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEI 946
Score = 31.2 bits (71), Expect = 2.2
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 5/123 (4%)
Query: 299 TTEQQKDRRKELAKKLVEMNAKKREE-RLVDDERHLN-ELLELREIVELTPSDHSHAREA 356
T ++ KD R++L K E+N KRE RL ++ + L+ EL +L + + + E
Sbjct: 383 TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEE 442
Query: 357 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 416
+ + I+ + QL + K + ++ D E+ +KL +E+A E+EA+
Sbjct: 443 KEDKAL-EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA--EAEAQA 499
Query: 417 KAW 419
+A
Sbjct: 500 RAS 502
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 43.2 bits (102), Expect = 5e-04
Identities = 39/237 (16%), Positives = 87/237 (36%), Gaps = 26/237 (10%)
Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLELREIVELTPSDHSHARE 355
L + ++ R+ K ++ E+L L EL EL E +E E
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765
Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
+ L++ + K K +I +E +L +E+ E +
Sbjct: 766 E---------------LESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810
Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
+E+ ++R ++++ + + + + +E + + + + K + + +
Sbjct: 811 ALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE-ELE 869
Query: 476 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 532
E ++ + +E +E ELEE LR+ + E L +E E + +L
Sbjct: 870 AEKEELEDEL---------KELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917
Score = 38.9 bits (91), Expect = 0.009
Identities = 39/212 (18%), Positives = 87/212 (41%), Gaps = 12/212 (5%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSD---HSHAREAF 357
+++ + +E + L E K +E + + E L E E +E + A E
Sbjct: 757 QERLEELEEELESLEEA-LAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815
Query: 358 KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 417
+ L + I +L+++IE+ + K+ + E+ +L +E+ E EAE +
Sbjct: 816 LESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE--ELEAEKE 873
Query: 418 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDE 477
+E + K ++K ++ ++L +E +L L E + + +
Sbjct: 874 E--LEDELKELE--EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929
Query: 478 DWDVYKVINKDAGDTDSEEEQERLIELEEILR 509
++ + + ++ DT E + + LEE +
Sbjct: 930 --ELEEELEEEYEDTLETELEREIERLEEEIE 959
Score = 37.0 bits (86), Expect = 0.042
Identities = 22/136 (16%), Positives = 51/136 (37%), Gaps = 9/136 (6%)
Query: 301 EQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDHSHARE 355
E+ + ELA+ E+ KRE ERL L +L E + +E + E
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442
Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
+++L + + +L+ ++++ + ++ E+ + L + E+E
Sbjct: 443 ELNE----ELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQR 498
Query: 416 FKAWLIETKKKRAYII 431
+ + +
Sbjct: 499 ASQGVRAVLEALESGL 514
Score = 33.1 bits (76), Expect = 0.63
Identities = 42/229 (18%), Positives = 84/229 (36%), Gaps = 48/229 (20%)
Query: 304 KDRRKELAKKL--VEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
K+R++E +KL E N ++ E+ L + E+ L +L E E + RE ++
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALL 230
Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
+ +++L K + +L++++ + + ++ +E + KEI
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEEL-------QEELEEAEKEI-------------- 269
Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 481
K ++L + +E + L E+ E R
Sbjct: 270 -------------EELKSELEELREELEELQEELLELKEEIEELEGEISLLRER------ 310
Query: 482 YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSL-NQEQELSPKEA 529
+ + + EE +ERL EL+E + E +EL A
Sbjct: 311 -----LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354
Score = 30.1 bits (68), Expect = 5.6
Identities = 27/172 (15%), Positives = 69/172 (40%), Gaps = 23/172 (13%)
Query: 365 IQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETK 424
+ +L + +L+++++ K ++ + + L E + E + E + E A E +
Sbjct: 676 LAELEAQLEKLEEELKSLKNELRSLE--DLLEELRRQLEELERQLEELKRELAALEEELE 733
Query: 425 KKRAYI--IDKKNARKQRRQDLAKRRTAAAQERMR-LISELARKEKRDDDFGMRDEDWDV 481
+ ++ + ++++ + + + R +E + L LA+ ++ ++
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE-------- 785
Query: 482 YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQLH 533
+ QE L ELEE L + + +L +E E + +L
Sbjct: 786 ----------EKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827
>gnl|CDD|212657 cd00012, NBD_sugar-kinase_HSP70_actin, Nucleotide-Binding Domain of
the sugar kinase/HSP70/actin superfamily. This
superfamily includes the actin family, the HSP70 family
of molecular chaperones and nucleotide exchange factors,
the ROK (repressor, ORF, kinase) family, the hexokinase
family, the FGGY family (which includes glycerol kinase
and similar carbohydrate kinases such as rhamnulokinase
and xylulokinase), the exopolyphosphatase/guanosine
pentaphosphate phosphohydrolase/nucleoside triphosphate
diphosphohydrolase family, propionate kinase/acetate
kinase family, glycerol dehydratase reactivase,
2-hydroxyglutaryl-CoA dehydratase component A,
N-acetylglucosamine kinase, butyrate kinase 2,
Escherichia coli YeaZ and similar glycoproteases, the
cell shape-determining protein MreB, the plasmid DNA
segregation factor ParM, cell cycle proteins FtsA, Pili
assembly protein PilM, ethanolamine utilization protein
EutJ, and similar proteins. The nucleotide-binding site
residues are conserved; the nucleotide sits in a deep
cleft formed between the two lobes of the
nucleotide-binding domain (NBD). Substrate binding to
superfamily members is associated with closure of this
catalytic site cleft. The functional activities of
several members of the superfamily, including
hexokinases, actin, and HSP70s, are modulated by
allosteric effectors, which may act on the cleft
closure.
Length = 185
Score = 40.6 bits (95), Expect = 9e-04
Identities = 28/185 (15%), Positives = 50/185 (27%), Gaps = 34/185 (18%)
Query: 72 QVGNDISNIEAVRFQLKTQFDKNVVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPF 131
V + I VT D E+ + + + +TEP
Sbjct: 13 GVADLDGEILPEEIVPTPVGRPGAVTDLDELEEALRELLKEALRQLKSEI-DAVGITEPG 71
Query: 132 LNPNYSRSLMS---------ELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQ 182
P +R ++ L E V D++ + G G+
Sbjct: 72 GVPKENREVIILPNLLLIPLALALEDLGGVPVAVVNDAVAAALAEGLFGK---------- 121
Query: 183 VYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQL--K 240
E + +++ G T + V +G K G + L +L L
Sbjct: 122 -------EEDTVLVVDLGTGTTGIAIVEDG-----KGGVGAAGELGIAEALAAVLNLLDG 169
Query: 241 YPSHI 245
P +
Sbjct: 170 LPDAV 174
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 41.3 bits (97), Expect = 0.002
Identities = 37/188 (19%), Positives = 74/188 (39%), Gaps = 23/188 (12%)
Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDD---------------ERHLNELLE-LR 340
L +Q K+ ++L +L E+ E RL+ ++ L EL E L
Sbjct: 551 KLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLS 610
Query: 341 EIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 400
++ EL S E ++L + +L + E + A E+ EE +
Sbjct: 611 QLEELLQSLELSEAENEL---EEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667
Query: 401 AKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLIS 460
A++ +E+ E+E + + E ++ + ++ ++ ++L K+ Q L S
Sbjct: 668 AEIRRELQRIENEEQLE----EKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELES 723
Query: 461 ELARKEKR 468
A E+
Sbjct: 724 RKAELEEL 731
Score = 29.3 bits (66), Expect = 9.7
Identities = 47/265 (17%), Positives = 103/265 (38%), Gaps = 42/265 (15%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
+ +++ +EL + L E +++ ERL + ER + EL E E + A
Sbjct: 270 KIREEELRELERLLEE--LEEKIERLEELEREIEELEEELEGLR--------ALLEELEE 319
Query: 361 GINNIQDLNKSINQLQQKIEKTKAKI----IAYNNGEDLTEEPKAKLSKEIAVPESEAE- 415
+ ++ L + + +L++K+EK ++++ N L EE +L + + E E E
Sbjct: 320 LLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEK 379
Query: 416 -------FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
+ + E K++ A + ++ ++L K +E L E+ + E++
Sbjct: 380 ALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQ 439
Query: 469 DDDFGMRDEDW------------------DVYKVINKDAGDTDSE--EEQERLIELEEIL 508
+ ++ + ++ + + + E EE+ + E L
Sbjct: 440 INQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAEL 499
Query: 509 RQHDPEFTSLNQEQELSPKEANQLH 533
R+ E +E E E +L
Sbjct: 500 REEIEELEKELRELEEELIELLELE 524
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 40.0 bits (94), Expect = 0.005
Identities = 44/234 (18%), Positives = 93/234 (39%), Gaps = 29/234 (12%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERH--LNELLELREIVELTPSDHSHAREAFK 358
E+ ++ +KEL E++ +++++++ + +L EL +E S E
Sbjct: 281 EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA 340
Query: 359 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 418
+ +++L + + L+ ++E+ +A++ + + EE L ++A E +
Sbjct: 341 ELE-EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA--- 396
Query: 419 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 478
+ + + R + L RR QE L+ +L E + E+
Sbjct: 397 ---SLNNEIERL-------EARLERLEDRRERLQQEIEELLKKLEEAEL--KELQAELEE 444
Query: 479 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 532
+ + +EE ERL E E LR+ E Q + + +E QL
Sbjct: 445 --------LEEELEELQEELERLEEALEELRE---ELEEAEQALDAAERELAQL 487
Score = 39.7 bits (93), Expect = 0.007
Identities = 48/243 (19%), Positives = 107/243 (44%), Gaps = 15/243 (6%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELR-EIVELTPSDHSHAREAFKS 359
E+ ++ +EL ++L E + E E +L ELR E+ EL + ++ +
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEE-KLEELRLEVSELE-EEIEELQKELYA 292
Query: 360 MGI------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE 413
+ Q L + + L++++E+ +A++ + D E A+L +++ E +
Sbjct: 293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE--ELK 350
Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDL--AKRRTAAAQERMRLI-SELARKEKRDD 470
E ++ E ++ A + + ++ ++ + L + + A + ++ + +E+ R E R +
Sbjct: 351 EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
Query: 471 DFGMRDEDWD-VYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 529
R E + + K + + +E Q L ELEE L + E L + E +E
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470
Query: 530 NQL 532
+
Sbjct: 471 EEA 473
Score = 38.5 bits (90), Expect = 0.015
Identities = 44/236 (18%), Positives = 84/236 (35%), Gaps = 36/236 (15%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
Q EL ++ E+ + E E E+ EL +E + REA +
Sbjct: 750 AQLSKELTELEAEIEELEERLEEAEEELAE-AEAEIEELEAQIEQLKEELKALREALDEL 808
Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIA------------ 408
+ LN+ L++++E + +I A + EE +LS++I
Sbjct: 809 R-AELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEEL 867
Query: 409 VPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
+ E E+E +A L E + + ++L++ +R L EL ++
Sbjct: 868 IEELESELEALLNERASLEEAL----ALLRSELEELSEELRELESKRSELRRELEELREK 923
Query: 469 DDDFGMR------------DEDWDVYKVI------NKDAGDTDSEEEQERLIELEE 506
+R + + Y + ++ + D EE + RL LE
Sbjct: 924 LAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLEN 979
Score = 34.3 bits (79), Expect = 0.27
Identities = 43/242 (17%), Positives = 92/242 (38%), Gaps = 60/242 (24%)
Query: 304 KDRRKELAKKLV--EMNAKKREERLVDDERHLNEL----------LELREIVELTPSDHS 351
K+RRKE +KL N + E+ L + ER L L EL+ EL
Sbjct: 171 KERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKA--EL------ 222
Query: 352 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPE 411
RE ++ + +++L + + +LQ+++++ EE +L+ E+ E
Sbjct: 223 --RELELALLVLRLEELREELEELQEELKE--------------AEEELEELTAELQELE 266
Query: 412 SEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDD 471
+ +E + ++++ Q+ + +++ +++ E + +
Sbjct: 267 EK-------LEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE---RLANLER 316
Query: 472 FGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQ 531
+ + EE + +L EL E L + + + L +E E E +
Sbjct: 317 --------------QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEE 362
Query: 532 LH 533
L
Sbjct: 363 LE 364
Score = 33.9 bits (78), Expect = 0.36
Identities = 49/247 (19%), Positives = 89/247 (36%), Gaps = 18/247 (7%)
Query: 296 PNLTTEQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDH 350
N + +++ +EL +K+ E+ K E L + L E LE S
Sbjct: 668 TNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ 727
Query: 351 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 410
A + ++ L + I QL +++ + +A+I + EE A+ EI
Sbjct: 728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE-- 785
Query: 411 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 470
E EA+ + E K R +D+ A L + + L +A E+R +
Sbjct: 786 ELEAQIEQLKEELKALRE-ALDELRAELTL---LNEEAANLRERLESLERRIAATERRLE 841
Query: 471 DF-----GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS 525
D + ++ + I + + EE + L L + L E E
Sbjct: 842 DLEEQIEELSEDIESLAAEIEE--LEELIEELESELEALLNERASLEEALALLRSELEEL 899
Query: 526 PKEANQL 532
+E +L
Sbjct: 900 SEELREL 906
Score = 33.5 bits (77), Expect = 0.49
Identities = 32/213 (15%), Positives = 77/213 (36%), Gaps = 22/213 (10%)
Query: 334 NELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGE 393
E+ EL E +E + +A + +L K + +L++++E+ + ++ +
Sbjct: 677 REIEELEEKIEELEEKIAELEKA--------LAELRKELEELEEELEQLRKELEELSRQI 728
Query: 394 DLTEEPKAKLSKEIAVPESE--------AEFKAWLIETKKKRAYIIDKKNARKQRRQDLA 445
+ A+L E+ E E +A + E +++ ++ + ++L
Sbjct: 729 SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
Query: 446 KRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDT-DSEEEQERLIEL 504
+ +E L L + + + + + + RL +L
Sbjct: 789 AQIEQLKEELKALREALDELRAE-----LTLLNEEAANLRERLESLERRIAATERRLEDL 843
Query: 505 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVE 537
EE + + + SL E E + +L +E
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELE 876
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.0 bits (92), Expect = 0.008
Identities = 40/143 (27%), Positives = 62/143 (43%), Gaps = 25/143 (17%)
Query: 301 EQQKDRRKELAKKLVEMNAK--KREERLVDDERHLN---ELLELRE-IVELTPSDHSHAR 354
E+ R E K+L E + K E+RL+ E +L+ ELLE RE +E + +
Sbjct: 64 EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123
Query: 355 EAFKSMGINNIQDLNKSINQLQQKIEK----TK--AKIIAYNNGEDLTEEPKAKLSKEIA 408
+ + ++L + I + Q++E+ T AK I E+ A L KEI
Sbjct: 124 QELEKK----EEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEI- 178
Query: 409 VPESEAEFKAWLIETKKKRAYII 431
E EA+ E K+A I
Sbjct: 179 --EEEAK------EEADKKAKEI 193
Score = 31.3 bits (72), Expect = 1.7
Identities = 27/143 (18%), Positives = 58/143 (40%), Gaps = 22/143 (15%)
Query: 368 LNKSINQLQQKIEKTKAKII---AYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETK 424
+ K I + + K + +AK I A E + +E + +EI +E E E +
Sbjct: 24 VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK-----ELR 78
Query: 425 KKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKV 484
++R + + Q+ ++L ++ + L + E++ + ++E+ +
Sbjct: 79 ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELE---- 134
Query: 485 INKDAGDTDSEEEQERLIELEEI 507
E +E+L ELE I
Sbjct: 135 ----------ELIEEQLQELERI 147
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 36.9 bits (86), Expect = 0.017
Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 15/94 (15%)
Query: 376 QQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK- 434
+++EK K K A ED++E KAK K++ KA + K+K+ Y++ KK
Sbjct: 125 LKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYK-------KAKKKKKKRKKTYVVAKKG 177
Query: 435 NARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
N K R K + RM +K+ R
Sbjct: 178 NKGKAGRPKGVKGKYKVVDPRM-------KKDLR 204
>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
transduction. This family is a four helix bundle that
operates as a ubiquitous sensory module in prokaryotic
signal-transduction. The 4HB_MCP is always found between
two predicted transmembrane helices indicating that it
detects only extracellular signals. In many cases the
domain is associated with a cytoplasmic HAMP domain
suggesting that most proteins carrying the bundle might
share the mechanism of transmembrane signalling which is
well-characterized in E coli chemoreceptors.
Length = 181
Score = 36.4 bits (85), Expect = 0.023
Identities = 27/104 (25%), Positives = 39/104 (37%), Gaps = 27/104 (25%)
Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
+ +I++L I++L +K EKT LTEE K EFK L
Sbjct: 81 LKDIEELRAEIDKLLKKYEKTI-----------LTEEEKKLF----------NEFKEQLK 119
Query: 422 ETKKKRAYIID--KKN----ARKQRRQDLAKRRTAAAQERMRLI 459
+K R ++D K A +L R A + LI
Sbjct: 120 AYRKVRNKVLDLAKAGKNDEAYALYLTELEPARDAVIEALDELI 163
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 36.9 bits (85), Expect = 0.039
Identities = 30/249 (12%), Positives = 85/249 (34%), Gaps = 21/249 (8%)
Query: 300 TEQQKDRRKELAKK------LVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHA 353
+ +++ +++ L K L E+ E +L + + L +
Sbjct: 166 SREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEE 225
Query: 354 REAFKSMG---INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 410
+ I L + + Q++IE +K ++ + + KE +
Sbjct: 226 NLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQ 285
Query: 411 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ---ERMRLISELARKEK 467
E E + A E K ++++ + + +++ + ++ + E KE
Sbjct: 286 EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKEL 345
Query: 468 RDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPK 527
++ ++ + ++ + + +++ EE+L + E L+ +L +
Sbjct: 346 ---------KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEE 396
Query: 528 EANQLHIGV 536
E +
Sbjct: 397 ELELKNEEE 405
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 35.4 bits (82), Expect = 0.082
Identities = 21/113 (18%), Positives = 42/113 (37%), Gaps = 23/113 (20%)
Query: 302 QQKDRRKELAKKLVEMNA-----KKREERLVDD-----------ERHLNELLEL-REIVE 344
+ ++ L + L + K E L E L +L +L E+ +
Sbjct: 141 LLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELED 200
Query: 345 LTPSDHSHAREAFKSM------GINNIQDLNKSINQLQQKIEKTKAKIIAYNN 391
P++ A+E K + + +++L + + +L+ KIE K N
Sbjct: 201 CDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 34.0 bits (79), Expect = 0.089
Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 12/93 (12%)
Query: 374 QLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDK 433
+ + +E+ + KI + L +AK E+A + EA+ L E + + A II+
Sbjct: 28 PIMKALEERQKKIA-----DGLAAAERAKKELELAQAKYEAQ----LAEARAEAAEIIE- 77
Query: 434 KNARKQRRQDLAKRRTAAAQERMRLISELARKE 466
A+K+ Q + + + A E R+ A+ E
Sbjct: 78 -QAKKRAAQIIEEAKAEAEAEAARIK-AQAQAE 108
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 35.4 bits (82), Expect = 0.11
Identities = 55/216 (25%), Positives = 99/216 (45%), Gaps = 33/216 (15%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
E++++R +EL KKL E+ +KR E L + E +E K +
Sbjct: 334 EEKEERLEELKKKLKEL--EKRLEELEERHELYEEAKAKKE----------ELERLKKRL 381
Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAW 419
+ L K + +++EK K +I EE +K++ I + E E K
Sbjct: 382 TGLTPEKLEKEL----EELEKAKEEI----------EEEISKITARIGELKKEIKELKKA 427
Query: 420 LIETKK-KRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 478
+ E KK K + + ++ R++L + TA E R+ EL E+++ +R E
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHRKELLEEYTA---ELKRIEKELKEIEEKERK--LRKEL 482
Query: 479 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPE 514
++ KV+ K++ +E E+L ELEE L++++ E
Sbjct: 483 RELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518
Score = 33.9 bits (78), Expect = 0.39
Identities = 38/169 (22%), Positives = 79/169 (46%), Gaps = 20/169 (11%)
Query: 308 KELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTPSDHSHARE-----AFK 358
E + +E + +K EE++ + E + EL EL E V+ A E F
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY 302
Query: 359 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 418
++ ++++ K +++L+++I + +I ++L EE + +L +E+ E E +
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGIEERI------KEL-EEKEERL-EELKKKLKELEKR- 353
Query: 419 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
+E ++R + ++ A+K+ + L KR T E++ E K K
Sbjct: 354 --LEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAK 400
Score = 32.7 bits (75), Expect = 0.75
Identities = 52/227 (22%), Positives = 95/227 (41%), Gaps = 36/227 (15%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDE-----RHLNELLELREIVELTPSDHSHARE 355
E K++ KEL + L E+N E + +E + + EL EL+E +E +E
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIE------ELEKE 246
Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
G + + L + I +L+++IE+ K +I E+L E K K KE+ + +AE
Sbjct: 247 LESLEG--SKRKLEEKIRELEERIEELKKEI------EEL--EEKVKELKEL---KEKAE 293
Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
L E ++ +D+ ++R L + I EL KE+R ++ +
Sbjct: 294 EYIKLSEFYEE---YLDELREIEKRLSRLEEE----INGIEERIKELEEKEERLEELKKK 346
Query: 476 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQ 522
+ ++ + + E E +E L + T L E+
Sbjct: 347 LK-----ELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 35.4 bits (82), Expect = 0.12
Identities = 53/269 (19%), Positives = 97/269 (36%), Gaps = 49/269 (18%)
Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLE-LREIVELTPSDHSHAREA 356
L EQQ + K + + ++ E +L + L +LLE + +E D R
Sbjct: 661 LQNEQQSLKDK--LELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTE 718
Query: 357 FKSMGINNIQDLNKSINQLQQKIEK--------TKAKIIAYNN---GEDLTEEPKAKLSK 405
+ +L+ + QL IE K Y+ D+ +L +
Sbjct: 719 RLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKR 778
Query: 406 EI----------AVPESEA-EFKAWLIETKKKRAYIIDKKNARKQR-----------RQD 443
+I AV E E++A++ ET R + +++ + +Q+
Sbjct: 779 QIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQE 838
Query: 444 LAKRRTAAAQERMRLISELARKEKRDDDF-----GMRDEDWDVYKVINKDAGDTDSEEEQ 498
L + R +L E EK+ D G+RDE + + + ++ + +
Sbjct: 839 LTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDE----MRQLAELKEPANANQAE 894
Query: 499 ----ERLIELEEILRQHDPEFTSLNQEQE 523
ERL +LEE R+ L + E
Sbjct: 895 GSISERLDQLEEFKRKRKRLSGDLKKFLE 923
Score = 29.7 bits (67), Expect = 7.8
Identities = 51/257 (19%), Positives = 95/257 (36%), Gaps = 37/257 (14%)
Query: 300 TEQQKDRRKELAKKLVEMNAK----KREERLVDD--ERHLNELLELR----EIVELTPSD 349
+Q K+ R EL ++L NAK + E L++D + +E + + S+
Sbjct: 301 EDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSE 360
Query: 350 HSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNN-GEDLTEEPKAKLSKEIA 408
++ QD+ + +L+QKI++ + + NN E K + I
Sbjct: 361 LEEVEARLDAL-TGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIE 419
Query: 409 VPES--EAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKE 466
E++ + L K + +++ + R L +R +A EL + E
Sbjct: 420 EDLQALESQLRQQLEAGKLEFN---EEEYELELRLGRLKQRLDSATATP----EELEQLE 472
Query: 467 KRDDDFGMRDEDWDVYKVINKDAG------DTDSEEEQERL----IELEEILRQHDPEFT 516
D+ E+ + + + +E E L L + LRQ E
Sbjct: 473 INDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQ-LRQALDEL- 530
Query: 517 SLNQEQELSPKEANQLH 533
E +LSP+ + LH
Sbjct: 531 ----ELQLSPQAGSLLH 543
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 33.4 bits (77), Expect = 0.14
Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 11/107 (10%)
Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE-FKAWL 420
I + + L K +L+++IE+ + K+ Y E+ L K + V + AE KA
Sbjct: 31 IKDYEALYKENEELKEEIERLEEKLEEYKELEE-------TLQKTLVVAQETAEEVKA-- 81
Query: 421 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
+K+ II + A+ +R + A + +R +
Sbjct: 82 -NAQKEAELIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFR 127
>gnl|CDD|234792 PRK00558, uvrC, excinuclease ABC subunit C; Validated.
Length = 598
Score = 34.7 bits (81), Expect = 0.16
Identities = 22/96 (22%), Positives = 39/96 (40%), Gaps = 20/96 (20%)
Query: 277 AEFYDSNVV--KVQLPYAVPVPNLTTEQQKDRRKELA-------KKLVEM---NAK-KRE 323
+FY + ++ +P+ + L R+ ++ K+L+E+ NA+ E
Sbjct: 298 GQFYLQKSIPKEILVPHELDDEELAEALLAGRKVKIRVPQRGDKKELLELAEKNAREALE 357
Query: 324 ERLVDDERHLNELLELREIVEL--TPS-----DHSH 352
+L D L EL E++ L P D SH
Sbjct: 358 RKLSDQSTQQQALEELAELLGLPEPPYRIECFDISH 393
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 34.6 bits (80), Expect = 0.19
Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 3/133 (2%)
Query: 378 KIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNAR 437
K + + + + A S A A KK+ + +
Sbjct: 46 KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105
Query: 438 KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYK---VINKDAGDTDS 494
+++ L K + + ++++ + DDD + D+D D ++D D D
Sbjct: 106 AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDV 165
Query: 495 EEEQERLIELEEI 507
++E E E +E+
Sbjct: 166 DDEDEEKKEAKEL 178
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 34.3 bits (79), Expect = 0.21
Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 21/117 (17%)
Query: 366 QDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAWLIET- 423
+K + Q+Q++I + KI ++ +AKL K++ E+E A +A LIET
Sbjct: 34 AADDKQLKQIQKEIAALEKKI-------REQQDQRAKLEKQLKSLETEIASLEAQLIETA 86
Query: 424 ------KKKRAYIIDKKNA----RKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 470
+K+ A + + NA +++R+ LA+ AA +R A +D
Sbjct: 87 DDLKKLRKQIADLNARLNALEVQEREQRRRLAE--QLAALQRSGRNPPPALLVSPED 141
Score = 32.4 bits (74), Expect = 0.73
Identities = 29/175 (16%), Positives = 62/175 (35%), Gaps = 10/175 (5%)
Query: 302 QQKDRRKELAKKLVEMNA--KKREERLVDDERHLNELLELREIVELTPS----DHSHARE 355
+ D K+L K++ ++NA E + + R L E L + P +
Sbjct: 84 ETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQ 143
Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
+ I LN + + ++ T ++ A ++ E + +A+
Sbjct: 144 RSVRLAIY-YGALNPARAERIDALKATLKQLAAV--RAEIAAEQAELTTLLSEQRAQQAK 200
Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK-EKRD 469
L E KK A + + +A +++ ++L + E + A+ E
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255
Score = 30.5 bits (69), Expect = 3.0
Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 21/189 (11%)
Query: 292 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHS 351
AV + + K++ K++ A +++ R D+R +L + + +E +
Sbjct: 25 AVLAAAFSAAADDKQLKQIQKEI---AALEKKIREQQDQR--AKLEKQLKSLETEIASLE 79
Query: 352 HAREAFKSMGIN----NIQDLNKSINQLQQKIEKTK---AKIIAYNNGEDLTEEPKAKLS 404
+ I DLN +N L+ + + + A+ +A P +S
Sbjct: 80 AQLIETA-DDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVS 138
Query: 405 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRL---ISE 461
PE I ++ +A K + LA R A E+ L +SE
Sbjct: 139 -----PEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE 193
Query: 462 LARKEKRDD 470
++ +
Sbjct: 194 QRAQQAKLA 202
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 34.4 bits (80), Expect = 0.22
Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 26/132 (19%)
Query: 305 DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINN 364
+ R E+ K+ E +K EERL E L L ++ ++E+ I
Sbjct: 315 EHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVIEV----------------IRG 358
Query: 365 IQDLNKSINQLQQKIEKTKAKIIA----YNNGEDLTEEPKAKLSKEIAVPESEAEFKAWL 420
DL K+ +L +++ + +A + LT+E KL KEI E E E
Sbjct: 359 SIDLKKAKKELIEELSEIQADYLLDMRLRR----LTKEEIEKLEKEIE--ELEKEIAELE 412
Query: 421 IETKKKRAYIID 432
++ I
Sbjct: 413 KILASEKKLWIK 424
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 34.2 bits (79), Expect = 0.24
Identities = 30/177 (16%), Positives = 69/177 (38%), Gaps = 20/177 (11%)
Query: 293 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLEL-------REIVE 344
+P + +E+ + + E K + LV+ L ELL+ +E+V
Sbjct: 84 IPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELVS 143
Query: 345 LTPS--DHSHAREAFKSMG------INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLT 396
+ D + + L+K + +L+ + E+ + + E L
Sbjct: 144 IFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELL- 202
Query: 397 EEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ 453
+ + +L + E+ E K +E ++++ + +K ++ RQ+L ++ A Q
Sbjct: 203 SKLEEELLARLESKEAALE-KQLRLEFEREKEEL--RKKYEEKLRQELERQAEAHEQ 256
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 33.0 bits (76), Expect = 0.29
Identities = 18/96 (18%), Positives = 41/96 (42%), Gaps = 13/96 (13%)
Query: 293 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSH 352
V +P LT E RRKEL K ++ + + +++ +L + E++ +
Sbjct: 96 VNLPPLTEE----RRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKK 151
Query: 353 AREAFKSMGINNIQDLNK-SINQLQQKIEKTKAKII 387
A E +Q L I ++ + ++ + +++
Sbjct: 152 AEED--------LQKLTDEYIKKIDELLKSKEKELL 179
>gnl|CDD|151681 pfam11239, DUF3040, Protein of unknown function (DUF3040). Some
members in this family of proteins with unknown function
are annotated as membrane proteins however this cannot
be confirmed.
Length = 82
Score = 31.0 bits (71), Expect = 0.34
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 494 SEEEQERLIELEEILRQHDPEFTS 517
SE EQ RL E+E L DP+F +
Sbjct: 4 SEHEQRRLEEIERALYAEDPKFAA 27
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 32.3 bits (74), Expect = 0.37
Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 19/99 (19%)
Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 427
+ K++++ Q KI +DL E + K + + E E E L E +++
Sbjct: 31 ILKALDERQAKIA------------DDLAEAERLKEEAQALLAEYEQE----LEEAREQA 74
Query: 428 AYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKE 466
+ II+ A+K+ Q + + A +E R+ E A E
Sbjct: 75 SEIIE--QAKKEAEQIAEEIKAEAEEELERIK-EAAEAE 110
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 33.5 bits (76), Expect = 0.38
Identities = 46/228 (20%), Positives = 80/228 (35%), Gaps = 32/228 (14%)
Query: 302 QQKDRRKELAKKLVEMNAKKREERLVDDERHL--NELLELREIVELTPSDHSHAREAFKS 359
Q D R +L E+ + E+R + ER +EL + R+ E + + AR+
Sbjct: 75 QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134
Query: 360 MGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAW 419
Q+L + Q Q + K L + ++ + + + S + K+
Sbjct: 135 A----QQELARLTKQAQDLQTRLKTLAEQR---RQLEAQAQSLQASQKQLQASATQLKSQ 187
Query: 420 LIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDW 479
+++ K + A I +Q Q+LA R AA L A ++ RD
Sbjct: 188 VLDLKLRSAQI-------EQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA-- 238
Query: 480 DVYKVINKD----AGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE 523
++ K A E + +L LE L QE
Sbjct: 239 ---QISQKAQQIAARAEQIRERERQLQRLET-------AQARLEQEVA 276
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from
laminin A, B1 and B2 may come together to form a triple
helical coiled-coil structure.
Length = 263
Score = 33.1 bits (76), Expect = 0.41
Identities = 34/167 (20%), Positives = 67/167 (40%), Gaps = 31/167 (18%)
Query: 292 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTP 347
A+P P ++ ++L+ +L ++A KR +L D E+ L++L L++
Sbjct: 14 ALPAPYKILYNIENLTQQLSGRLSPISATKR--QLEDLEKALSKLAQDAENLQKKANQAL 71
Query: 348 SDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEI 407
+ A + + + + L ++I L+ I++ K+ GE+ P + LS+ +
Sbjct: 72 AKAQKVNAAIERT-LGHAKTLAEAIKNLRDNIKEILEKLAGL--GENAFALPGSALSRML 128
Query: 408 AVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQE 454
A EA+ I R +D K+ A E
Sbjct: 129 A----EAQRMLGEI------------------RERDFQKQLQNAEAE 153
>gnl|CDD|217630 pfam03592, Terminase_2, Terminase small subunit. Packaging of
double-stranded viral DNA concatemers requires
interaction of the prohead with virus DNA. This process
is mediated by a phage-encoded DNA recognition and
terminase protein. The terminase enzymes described so
far, which are hetero-oligomers composed of a small and
a large subunit, do not have a significant level of
sequence homology. The small terminase subunit is
thought to form a nucleoprotein structure that helps to
position the terminase large subunit at the packaging
initiation site.
Length = 141
Score = 31.9 bits (73), Expect = 0.43
Identities = 26/115 (22%), Positives = 40/115 (34%), Gaps = 24/115 (20%)
Query: 424 KKKRAYIIDKKNARK--------QRRQDLAKRRTAAAQERMRLISELAR---KEKRDDDF 472
KK A I + K + ++ KR A E + ++E+AR E
Sbjct: 28 SKKTARSIGSELLTKPKIKAYIDELMKEREKRTIITADEVLEELTEIARGDETELVLVGT 87
Query: 473 GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFT-SLNQEQELSP 526
+ + KV KD RL LE + + H FT + E E+
Sbjct: 88 KEGEVEKIEKKVKLKD-----------RLKALELLGK-HYGLFTDKVEVEGEVKK 130
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 33.4 bits (77), Expect = 0.43
Identities = 23/123 (18%), Positives = 51/123 (41%), Gaps = 19/123 (15%)
Query: 394 DLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK--NARKQRRQDLAKRRTAA 451
DL +E A++ EI PE E + L + + ++ + ++ +K + + +
Sbjct: 384 DLLDEAGARVRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRI 443
Query: 452 AQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAG---DTDSEEEQERLIELEEIL 508
+ L +E+ D+ +V+ + G E+E+E+L+ LE L
Sbjct: 444 PELEKELEAEVDED--------------DIAEVVARWTGIPVAKLLEDEKEKLLNLERRL 489
Query: 509 RQH 511
++
Sbjct: 490 KKR 492
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 33.0 bits (75), Expect = 0.44
Identities = 36/132 (27%), Positives = 63/132 (47%), Gaps = 9/132 (6%)
Query: 276 DAEFYDSNVVKVQLPYAVPVPNLTTEQQ-------KDRRKELAKKLVEMNAKKREERLVD 328
D E D +V+ +L A+ L T Q+ K+ +EL +KL E+ +K EE L +
Sbjct: 99 DVEHIDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEK-EELLKE 157
Query: 329 DERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIA 388
E E E++E ++ ++S E K + + DL K ++L+ +E + ++I+
Sbjct: 158 LEELEAEYEEVQERLKRLEVENSRLEEMLKKL-PGEVYDLKKRWDELEPGVELPEEELIS 216
Query: 389 YNNGEDLTEEPK 400
E L PK
Sbjct: 217 DLVKETLNLAPK 228
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 30.6 bits (70), Expect = 0.49
Identities = 8/36 (22%), Positives = 17/36 (47%)
Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEE 398
I L + +L+ + E+ +A++ + D EE
Sbjct: 24 QEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEE 59
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 31.1 bits (71), Expect = 0.63
Identities = 22/103 (21%), Positives = 42/103 (40%), Gaps = 22/103 (21%)
Query: 301 EQQKDRRKELAKKLVEMNAK-KREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
E++++ +K A+KL +M+A+ K E L E+ L EL EL + A++
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELK----AEAKKMLSE 80
Query: 360 MGIN-----------------NIQDLNKSINQLQQKIEKTKAK 385
G+ N++ K ++ + K + K
Sbjct: 81 KGLPVELLDLVVGEDAEETKANVKAFKKLFDKAVEAGVKERLK 123
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 33.2 bits (75), Expect = 0.66
Identities = 48/225 (21%), Positives = 85/225 (37%), Gaps = 19/225 (8%)
Query: 298 LTTEQQKDRRKELAKKLVEM----NAKKREERLVDDERHLNELLELREIVELTPSDHSHA 353
E+ K + KK E AKK+ E +D++ +EL + + A
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAE---EDKKKADELKKAAAAKKKADEAKKKA 1427
Query: 354 REAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE 413
E K+ + K ++ ++K E+ K A E+ + +AK E A E
Sbjct: 1428 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE 1487
Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK-------- 465
A+ KA E KK+A K K++ + K A + + E A+K
Sbjct: 1488 AKKKA---EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE-AKKADEAKKAE 1543
Query: 466 EKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQ 510
EK+ D + E+ + K +EE++ + E ++
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Score = 30.5 bits (68), Expect = 4.2
Identities = 44/218 (20%), Positives = 87/218 (39%), Gaps = 32/218 (14%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
E++K + E AKK E K E + ++E+ E L+ +E E A E K+
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEK-----KKAEELKKAE 1656
Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWL 420
N I+ + +K E+ K K E+ + + + A+ + E
Sbjct: 1657 EENKIKA-----AEEAKKAEEDKKK------AEEAKKAEEDEKKAAEALKKEAEE----- 1700
Query: 421 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRD----DDFGMRD 476
K+A + KK A ++++ + K+ A +E E ++ + D ++ +
Sbjct: 1701 ----AKKAEELKKKEAEEKKKAEELKK---AEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753
Query: 477 EDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPE 514
E+ + K+ E +E+ +EE L + D +
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEK 1791
Score = 30.5 bits (68), Expect = 4.3
Identities = 44/202 (21%), Positives = 82/202 (40%), Gaps = 15/202 (7%)
Query: 302 QQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
+Q + E A+K E+ KK EE+ DE E E ++ E A EA K+
Sbjct: 1268 RQAAIKAEEARKADEL--KKAEEKKKADEAKKAE--EKKKADEAK----KKAEEAKKA-- 1317
Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
+ K + ++K + K K + + + E E +AE
Sbjct: 1318 ----DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 481
E KK+A KK A ++++ D AK++ +++ + + A +K+ D+ + E+
Sbjct: 1374 EEAKKKADAA-KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Query: 482 YKVINKDAGDTDSEEEQERLIE 503
K A + +E ++ E
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAE 1454
>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
component [Carbohydrate transport and metabolism].
Length = 472
Score = 32.7 bits (75), Expect = 0.67
Identities = 24/114 (21%), Positives = 49/114 (42%), Gaps = 14/114 (12%)
Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIV-----ELTPSDH-- 350
+ + +D + K+ EM + + + N+L +++I+ + S
Sbjct: 337 TSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKKIIFACDAGMGSSAMGA 396
Query: 351 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS 404
S R+ K+ G+N+I N +IN L A ++ + +DLT+ K K+
Sbjct: 397 SILRKKLKNAGLNDISVTNYAINNL-----PADADLVITH--KDLTDRAKRKVP 443
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.71
Identities = 37/181 (20%), Positives = 78/181 (43%), Gaps = 22/181 (12%)
Query: 318 NAKKRE-ERLVDDERHLNELLELRE-------IVELTPSDHSHAREAFKSMGINNI--QD 367
NA + + ++DD L +L E RE VE SD + + K+ I +D
Sbjct: 145 NATPSDRQDMIDDLLQLGKLEEYRERASDARLGVERVLSDQRGSLDQLKAQ-IEEKEEKD 203
Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS----------KEIAVPESE-AEF 416
L++ +N L+ ++ + +I Y + E + + +E+ E+E +
Sbjct: 204 LHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDL 263
Query: 417 KAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRD 476
+ + ET+++R + ++ ++R ++L + R E ++ E R ++ RD
Sbjct: 264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323
Query: 477 E 477
E
Sbjct: 324 E 324
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 32.6 bits (75), Expect = 0.75
Identities = 31/147 (21%), Positives = 63/147 (42%), Gaps = 24/147 (16%)
Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 356
NL E L K+V + +++ E+RL +E + L L E+ L+P E
Sbjct: 736 NLRAEMNLSPSAPL--KVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEV 793
Query: 357 FKSMGI--------------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGED-LTEEPKA 401
+G + L K + +L+++I++ + K+ + E + + P+
Sbjct: 794 TAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL----SNEGFVAKAPEE 849
Query: 402 KLSKEIAVPESEAEFKAWLIETKKKRA 428
+ KE E AE++ L + +++ A
Sbjct: 850 VVEKEK---EKLAEYQVKLAKLEERLA 873
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 31.9 bits (73), Expect = 0.76
Identities = 25/98 (25%), Positives = 42/98 (42%), Gaps = 11/98 (11%)
Query: 299 TTEQQKDRRKELAKKLVEM--NAKKREER-----LVDDERHLNELLELREIVELTPSDHS 351
Q R+K+L +KL E A+K EE+ +E E LE ++ +E
Sbjct: 46 ALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALE 105
Query: 352 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAY 389
+ + ++ L K + L+QKI + +AK A
Sbjct: 106 AELQQAEEQ----VEKLKKQLAALEQKIAELRAKKEAL 139
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 31.7 bits (72), Expect = 0.93
Identities = 33/149 (22%), Positives = 58/149 (38%), Gaps = 20/149 (13%)
Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 427
++ I + I + II D+ E + +L+ E E E + + +
Sbjct: 1 MSNIIKSTKSVIPAQERGIIPLRQVTDVPET-EQELADEEQEFEQILE------KAEAEA 53
Query: 428 AYIIDKKNAR-KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGM-RDEDWDVYKVI 485
A II++ A+ R+ + + R +ER RLI E A++E + F E +Y+
Sbjct: 54 AQIIEQAEAQAAAIREQIEQERAQWEEERERLIQE-AKQEGYEAGFQAGESEALSIYQST 112
Query: 486 NKDAGDTDS----------EEEQERLIEL 504
+A E Q +IEL
Sbjct: 113 IDEANAIVEEAKDDYEEKIESAQPLIIEL 141
>gnl|CDD|237358 PRK13351, PRK13351, elongation factor G; Reviewed.
Length = 687
Score = 32.2 bits (74), Expect = 1.1
Identities = 22/86 (25%), Positives = 30/86 (34%), Gaps = 5/86 (5%)
Query: 483 KVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQLHIGVERMCGP 542
V + EE +E +L E L + D E L E E E QL +
Sbjct: 192 TVEEGPIPEELLEEVEEAREKLIEALAEFDDELLELYLEGEELSAE--QLRAPLREGTR- 248
Query: 543 ECLFQPSMLGSIQ--AGISETLNFVL 566
P + GS GI L+ V+
Sbjct: 249 SGHLVPVLFGSALKNIGIEPLLDAVV 274
>gnl|CDD|239776 cd04243, AAK_AK-HSDH-like, AAK_AK-HSDH-like: Amino Acid Kinase
Superfamily (AAK), AK-HSDH-like; this family includes
the N-terminal catalytic domain of aspartokinase (AK) of
the bifunctional enzyme AK- homoserine dehydrogenase
(HSDH). These aspartokinases are found in such bacteria
as E. coli (AKI-HSDHI, ThrA and AKII-HSDHII, MetL) and
in higher plants (Z. mays AK-HSDH). AK and HSDH are the
first and third enzymes in the biosynthetic pathway of
the aspartate family of amino acids. AK catalyzes the
phosphorylation of Asp to P-aspartyl phosphate. HSDH
catalyzes the NADPH-dependent conversion of Asp
3-semialdehyde to homoserine. ThrA and MetL are involved
in threonine and methionine biosynthesis, respectively.
In E. coli, ThrA is subject to allosteric regulation by
the end product L-threonine and the native enzyme is
reported to be tetrameric. As with bacteria, plant AK
and HSDH are feedback inhibited by pathway end products.
Maize AK-HSDH is a Thr-sensitive 180-kD enzyme.
Arabidopsis AK-HSDH is an alanine-activated,
threonine-sensitive enzyme whose ACT domains, located
C-terminal to the AK catalytic domain, were shown to be
involved in allosteric activation. Also included in this
CD is the catalytic domain of the aspartokinase (AK) of
the lysine-sensitive aspartokinase isoenzyme AKIII, a
monofunctional class enzyme (LysC) found in some
bacteria such as E. coli. In E. coli, LysC is reported
to be a homodimer of 50 kD subunits. Also included in
this CD is the catalytic domain of aspartokinase (AK)
of the bifunctional enzyme AK - DAP decarboxylase
(DapDC) found in some bacteria. DapDC, which is the lysA
gene product, catalyzes the decarboxylation of DAP to
lysine.
Length = 293
Score = 31.8 bits (73), Expect = 1.1
Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 8/87 (9%)
Query: 309 ELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDL 368
LA+ + + ERHL+ + EL L+ + A S+ + ++DL
Sbjct: 48 ALAELAASGDDAQAIVLQEIRERHLDLIKEL-----LSGESAAELLAALDSL-LERLKDL 101
Query: 369 NKSINQLQQKIEKTKAKIIAYNNGEDL 395
+ I L + +KT+A+++++ GE L
Sbjct: 102 LEGIRLLGELSDKTRAEVLSF--GELL 126
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 31.9 bits (73), Expect = 1.2
Identities = 22/147 (14%), Positives = 47/147 (31%), Gaps = 15/147 (10%)
Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 356
+ DR +EL +++E ++ + + E L EL I E +
Sbjct: 794 GINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQ--------KN 845
Query: 357 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 416
K + ++ ++ ++Q E N + +PK IA
Sbjct: 846 LKQEIKLELSEIQEAEEEIQNINEN-------KNEFVEFKNDPKKLNKLIIAKDVLIKLV 898
Query: 417 KAWLIETKKKRAYIIDKKNARKQRRQD 443
+ + ++ KK+ K +
Sbjct: 899 ISSDEIKQDEKTTKKKKKDLEKTDEEA 925
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.8 bits (73), Expect = 1.4
Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 14/130 (10%)
Query: 411 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 470
+E K +IE KK A + KK A + ++++ K R +E +EL R E+R
Sbjct: 29 SAEELAKR-IIEEAKKEAETL-KKEALLEAKEEVHKLRAELERELKERRNELQRLERRLL 86
Query: 471 DFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE------- 523
R+E D K+ + D + + E++++ L E+ L + + E L EQ
Sbjct: 87 Q---REETLD-RKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142
Query: 524 -LSPKEANQL 532
L+ +EA ++
Sbjct: 143 GLTQEEAKEI 152
>gnl|CDD|234275 TIGR03599, YloV, DAK2 domain fusion protein YloV. This model
describes a protein family that contains an N-terminal
DAK2 domain (pfam02734), so named because of similarity
to the dihydroxyacetone kinase family family. The
GTP-binding protein CgtA (a member of the obg family) is
a bacterial GTPase associated with ribosome biogenesis,
and it has a characteristic extension (TIGR03595) in
certain lineages. This protein family described here was
found, by the method of partial phylognetic profiling,
to have a phylogenetic distribution strongly correlated
to that of TIGR03595. This correlation implies some form
of functional coupling.
Length = 530
Score = 31.8 bits (73), Expect = 1.5
Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 353 AREAFKSMGINNI----QDLNKSINQLQQKIEKTKAK--IIAYNNGE 393
E FKS+G + + Q +N S + + IEK AK I+ NN
Sbjct: 326 IAELFKSLGADVVIEGGQTMNPSTEDILKAIEKVNAKNVIVLPNNKN 372
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 31.4 bits (72), Expect = 1.6
Identities = 28/174 (16%), Positives = 70/174 (40%), Gaps = 13/174 (7%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
++K+ R+E L E ++ + R + E L E ++ EI+E + + +
Sbjct: 63 RKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK 122
Query: 361 GINNIQDLNKSINQLQQKIEKTKA-------KIIAYNNGEDLTEEPKAKLSKEIAVPESE 413
+++++ + ++ E+ K KI+ Y + EE + +E +
Sbjct: 123 QKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRER-----K 177
Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
E + + + ++ D++ + R DL + +ER + E ++ +
Sbjct: 178 EEKEREVARLRAQQEEAEDEREELDELRADLYQEE-YERKERQKEKEEAEKRRR 230
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 31.2 bits (71), Expect = 1.6
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
E ++KR I + K A QRR DL+ + R + +L + KR
Sbjct: 86 EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKR 132
Score = 29.3 bits (66), Expect = 7.7
Identities = 30/135 (22%), Positives = 61/135 (45%), Gaps = 13/135 (9%)
Query: 333 LNELLELREIVE--LTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYN 390
L E EL++ VE L + + + A + + + +NQ++ +I + K +I
Sbjct: 33 LLENEELKQKVEEALEGATNEDGKLA-ADLLKLEVARKKERLNQIRARISQLKEEIEQ-- 89
Query: 391 NGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK-NARKQRRQDLAKRRT 449
E +L + +A S+ ++ + +K+RA ++K + K+ R L +
Sbjct: 90 -----KRERIEELKRALAQRRSDLSSASY--QLEKRRASQLEKLQDEIKRTRSKLNALHS 142
Query: 450 AAAQERMRLISELAR 464
A++R L ELA+
Sbjct: 143 LLAEKRSFLCRELAK 157
>gnl|CDD|237301 PRK13200, psaA, photosystem I P700 chlorophyll a apoprotein A1;
Provisional.
Length = 766
Score = 31.3 bits (71), Expect = 1.9
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 4/48 (8%)
Query: 610 HFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQEKGGEFFRVH 657
HF +LA+ P+ W ++L + HDF SD +E + F H
Sbjct: 33 HFDRTLAKGPKTTTWI----WNLHADAHDFDTHTSDLEEVSRKIFSAH 76
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.8 bits (70), Expect = 2.2
Identities = 22/90 (24%), Positives = 32/90 (35%), Gaps = 10/90 (11%)
Query: 364 NIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIET 423
+L +N L + + A I A E KA L ++ A+ E+ A E
Sbjct: 177 LQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEEQKALAEAAAA------EA 230
Query: 424 KKKRAYIIDKKNARKQRRQDLAKRRTAAAQ 453
K+ A A Q + L TAA
Sbjct: 231 AKQEA----AAKAAAQEQAALQAAATAAQP 256
Score = 29.7 bits (67), Expect = 5.1
Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 7/50 (14%)
Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 412
N I+ L+ I ++Q KI++ + +I D ++ KL KEIA +
Sbjct: 52 NEIESLDNQIEEIQSKIDELQKEI-------DQSKAEIKKLQKEIAELKE 94
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 31.4 bits (71), Expect = 2.3
Identities = 57/234 (24%), Positives = 95/234 (40%), Gaps = 54/234 (23%)
Query: 307 RKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQ 366
R EL +K E +D E+ E EL+E +++ H+ E + + +IQ
Sbjct: 1071 RTELDASRALAEKQKHE---LDTEKRCAE--ELKEAMQMAMEGHARMLEQYADLEEKHIQ 1125
Query: 367 DLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE------AEFKAWL 420
L + ++Q+ I+ K KA + ES+ AE A
Sbjct: 1126 LLARH-RRIQEGIDDVK----------------KAAARAGVRGAESKFINALAAEISALK 1168
Query: 421 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE---LARKEKRDDDFGMRDE 477
+E +K+R Y+ D+ + + + +D A+ AA + +RL L +KR D E
Sbjct: 1169 VEREKERRYLRDENKSLQAQLRDTAEAVQAAGELLVRLKEAEEALTVAQKRAMD--AEQE 1226
Query: 478 DWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS--PKEA 529
+ YK I+K + R+H+ E ++LNQ S PKEA
Sbjct: 1227 AAEAYKQIDK-------------------LKRKHENEISTLNQLVAESRLPKEA 1261
>gnl|CDD|212668 cd10225, MreB_like, MreB and similar proteins. MreB is a bacterial
protein which assembles into filaments resembling those
of eukaryotic F-actin. It is involved in determining the
shape of rod-like bacterial cells, by assembling into
large fibrous spirals beneath the cell membrane. MreB
has also been implicated in chromosome segregation;
specifically MreB is thought to bind to and segregate
the replication origin of bacterial chromosomes.
Length = 320
Score = 30.9 bits (71), Expect = 2.4
Identities = 11/47 (23%), Positives = 18/47 (38%), Gaps = 8/47 (17%)
Query: 577 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDA 623
L I +TG L G E ++++ V +AE+P
Sbjct: 270 LDRGIVLTGGGALLRGLDELISEET--------GLPVRVAEDPLTCV 308
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 30.2 bits (69), Expect = 2.4
Identities = 27/94 (28%), Positives = 41/94 (43%), Gaps = 21/94 (22%)
Query: 293 VPVPNLTTEQQKDRRKELAKKLVEM--NAKK--REERLVDDERHLNELLELREIVELTPS 348
+P+P LT E RRKEL K+ ++ AK R R R N+ L+ E +
Sbjct: 82 IPIPPLTEE----RRKELVKQAKKLAEEAKVAIRNIR-----RDANDKLKKLEKDKEISE 132
Query: 349 DHSHAREAFKSMGINNIQDL-NKSINQLQQKIEK 381
D E IQ L +K I ++ + ++K
Sbjct: 133 DEVKRAEK-------EIQKLTDKYIKKIDELLKK 159
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.0 bits (70), Expect = 2.4
Identities = 43/184 (23%), Positives = 70/184 (38%), Gaps = 36/184 (19%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
+ + +E KKL + + ++R + R EL + A +A K
Sbjct: 60 KPAAKKEQERQKKLEQQAEEAEKQRAAEQAR----QKELEQRAA--------AEKAAK-- 105
Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKL-------SKEIAVPESE 413
+ K + Q++ E+ KAK A E KAK +KE A ++E
Sbjct: 106 ---QAEQAAKQAEEKQKQAEEAKAKQAA---------EAKAKAEAEAEKKAKEEAKKQAE 153
Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFG 473
E KA KK+A KK + + + AK + A E + +E A K K +
Sbjct: 154 EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK--AKAEEAKAKAEAA-KAKAAAEAA 210
Query: 474 MRDE 477
+ E
Sbjct: 211 AKAE 214
>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
AldA1 (SACOL0154) aldehyde dehydrogenase-like.
Uncharacterized aldehyde dehydrogenase from
Staphylococcus aureus (AldA1, locus SACOL0154) and other
similar sequences are present in this CD.
Length = 475
Score = 30.5 bits (69), Expect = 3.0
Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 339 LREIVELTPSDHSHA----REAFKSMGINNIQDLNKSINQLQQKIEKTK---AKIIAYNN 391
L EI + T +D A +EAFK+ + + +N++ I++ K A + +N
Sbjct: 29 LSEITDATDADVDRAVKAAQEAFKTWRKTTVAERANILNKIADIIDENKELLAMVETLDN 88
Query: 392 GEDLTEEPKAKLSKEIAVPESEAEFK--AWLIETKKKRAYIIDKK 434
G+ + E ++ + +P + F+ A +I ++ A +ID+
Sbjct: 89 GKPIRE------TRAVDIPLAADHFRYFAGVIRAEEGSANMIDED 127
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.8 bits (69), Expect = 3.2
Identities = 26/122 (21%), Positives = 45/122 (36%), Gaps = 10/122 (8%)
Query: 392 GEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAA 451
D P+ + ++ PE EF + + I+K R RR + R +A
Sbjct: 342 TGDSDHGPEKPQATKLE-PEVADEFADRVAKKVADMTAEIEKLKKRHARRMEKFNRTSAL 400
Query: 452 AQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEE---EQERLIELEEIL 508
RL A D G E W + + D D S + E + ++++I+
Sbjct: 401 KDAEQRLRDAAADPA----DTG--SEIWRLEGRLEIDTEDDSSRQGPREHKAKYKVDDIV 454
Query: 509 RQ 510
R+
Sbjct: 455 RE 456
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 30.6 bits (69), Expect = 3.2
Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 5/81 (6%)
Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAF 357
L +E R+ +L + M + RE+ L ER + + R +E+ A E
Sbjct: 182 LESELVSRRKNDLKQ----MYEEDREDYLGKLERDITKFFVDRGFLEIKSPILIPA-EYI 236
Query: 358 KSMGINNIQDLNKSINQLQQK 378
+ MGI+N +L+K I ++ +
Sbjct: 237 ERMGIDNDTELSKQIFRVDKN 257
>gnl|CDD|222317 pfam13684, Dak1_2, Dihydroxyacetone kinase family. This is the
kinase domain of the dihydroxyacetone kinase family.
Length = 309
Score = 30.2 bits (69), Expect = 3.4
Identities = 29/94 (30%), Positives = 45/94 (47%), Gaps = 9/94 (9%)
Query: 323 EERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNI----QDLNKSINQLQQK 378
EE L +E + L+ +V + P + E FKS+G++ + Q +N S +
Sbjct: 76 EEILEKEEEDEPKELKDYGVVAVAPGEG--IAELFKSLGVDVVIEGGQTMNPSTEDILAA 133
Query: 379 IEKTKAK--IIAYNNGED-LTEEPKAKLSKEIAV 409
IEK AK I+ NN L E A+L+K+ V
Sbjct: 134 IEKVNAKNVIVLPNNKNIILAAEQAAELAKDKNV 167
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 29.9 bits (67), Expect = 3.5
Identities = 24/131 (18%), Positives = 39/131 (29%), Gaps = 7/131 (5%)
Query: 398 EPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMR 457
K K + E VP EAE A R + K++ Q L ++ + QE
Sbjct: 66 GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDG 125
Query: 458 LISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTS 517
E + D D D + EE E +EE + +
Sbjct: 126 PGKEP--QLDEDKFLLAEDSD-----DRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178
Query: 518 LNQEQELSPKE 528
++ +
Sbjct: 179 EKASEQENEDS 189
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.1 bits (68), Expect = 3.6
Identities = 11/65 (16%), Positives = 24/65 (36%), Gaps = 5/65 (7%)
Query: 473 GMRDEDWDVYKVIN-----KDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPK 527
+ D D + + K D ++ QE L E + L+ + + + E + S +
Sbjct: 338 NLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEE 397
Query: 528 EANQL 532
+
Sbjct: 398 AEAKA 402
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 30.0 bits (68), Expect = 3.7
Identities = 29/159 (18%), Positives = 66/159 (41%), Gaps = 8/159 (5%)
Query: 300 TEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
+Q E ++LV+ K+ + L D ++ L E +L+E+ ARE +
Sbjct: 122 KQQTSVLTPEEERELVQ-KIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180
Query: 360 MG--INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 417
+ N Q+ ++ + +L ++ ++ + + + E+ E K + E +
Sbjct: 181 IQELANEAQEYHEEMIKLFEEADELRKEADELH--EEFVELSKKIDELHEEFRNLQNELR 238
Query: 418 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERM 456
+ K RA +K A+++ +++ K R E+
Sbjct: 239 ELEKKIKALRAK---EKAAKRREKREELKERAEEIYEKF 274
Score = 29.6 bits (67), Expect = 4.6
Identities = 32/178 (17%), Positives = 68/178 (38%), Gaps = 21/178 (11%)
Query: 300 TEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
E+ K++R EL K+ E+ K+ E LN ++RE+ E E +
Sbjct: 22 IEELKEKRDELRKEASELAEKRDE---------LNA--KVRELREKAQELREERDEINEE 70
Query: 360 MGI--NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 417
+ ++N + +L+++ + K K +N G L +EI E K
Sbjct: 71 VQELKEKRDEINAKLQELRKEYRELKEKRNEFNLG----GRSIKSLEREI----ERLEKK 122
Query: 418 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
+ + +K ++ + AK+ ++ L +E+ +K+ + +
Sbjct: 123 QQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 30.1 bits (67), Expect = 3.8
Identities = 24/96 (25%), Positives = 42/96 (43%), Gaps = 6/96 (6%)
Query: 295 VPNLTTEQQK--DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSH 352
V N E++ D+ ++AK+L E NAK + + + ++ NE EL + V T ++
Sbjct: 99 VENAKNEKKAIDDKNAQIAKELAERNAKI-QSNIEELKKINNEAFELSKTVNKTIAEVEK 157
Query: 353 A---REAFKSMGINNIQDLNKSINQLQQKIEKTKAK 385
+ FK N DL Q+ + T +
Sbjct: 158 KFKIPKDFKEQLENFADDLLDKSRQIDEFTTVTSTQ 193
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 29.2 bits (66), Expect = 3.8
Identities = 16/79 (20%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 421 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWD 480
+ K++ I ++K R++R+Q+L K+ +ER + L + ++D D +
Sbjct: 42 AKEKEREERIEERKRIREERKQELEKQL----KERKEALKLLEEENDDEEDAETEDTEDV 97
Query: 481 VYKVINKDAGDTDSEEEQE 499
T ++ E+E
Sbjct: 98 EDDEWEGFPEPTVTDYEEE 116
>gnl|CDD|130402 TIGR01335, psaA, photosystem I core protein PsaA. The core
proteins of photosystem I are PsaA and PsaB, homologous
integral membrane proteins that form a heterodimer. The
heterodimer binds the electron-donating chlorophyll
dimer P700, as well as chlorophyll, phylloquinone, and
4FE-4S electron acceptors. This model describes PsaA
only [Energy metabolism, Photosynthesis].
Length = 752
Score = 30.3 bits (68), Expect = 4.2
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 610 HFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQEKGGEFFRVH 657
HFS +LA+ P+ W ++L + HDF SD ++ + F H
Sbjct: 34 HFSRTLAKGPKTTTWI----WNLHADAHDFDSHTSDLEDISRKIFSAH 77
>gnl|CDD|176459 cd03586, PolY_Pol_IV_kappa, DNA Polymerase IV/Kappa. Pol IV, also
known as Pol kappa, DinB, and Dpo4, is a translesion
synthesis (TLS) polymerase. Translesion synthesis is a
process that allows the bypass of a variety of DNA
lesions. TLS polymerases lack proofreading activity and
have low fidelity and low processivity. They use
damaged DNA as templates and insert nucleotides opposite
the lesions. Known primarily as Pol IV in prokaryotes
and Pol kappa in eukaryotes, this polymerase has a
propensity for generating frameshift mutations. The
eukaryotic Pol kappa differs from Pol IV and Dpo4 by an
N-terminal extension of ~75 residues known as the
"N-clasp" region. The structure of Pol kappa shows DNA
that is almost totally encircled by Pol kappa, with the
N-clasp region augmenting the interactions between DNA
and the polymerase. Pol kappa is more resistant than Pol
eta and Pol iota to bulky guanine adducts and is
efficient at catalyzing the incorporation of dCTP.
Bacterial pol IV has a higher error rate than other
Y-family polymerases.
Length = 334
Score = 30.2 bits (69), Expect = 4.2
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 8/89 (8%)
Query: 355 EAFKSMGINNIQDL-NKSINQLQQKIEKTKAKIIAYNNGED---LTEEPKAK-LSKEIAV 409
E K +GI I DL + L++ K+ ++ G D + + + K + E
Sbjct: 185 EKLKELGIKTIGDLAKLDVELLKKLFGKSGRRLYELARGIDNRPVEPDRERKSIGVERTF 244
Query: 410 PE---SEAEFKAWLIETKKKRAYIIDKKN 435
E E L+E ++ A + K+
Sbjct: 245 SEDLTDPEELLEELLELAEELAERLRKRG 273
>gnl|CDD|233252 TIGR01056, topB, DNA topoisomerase III, bacteria and conjugative
plasmid. This model describes topoisomerase III from
bacteria and its equivalents encoded on plasmids. The
gene is designated topB if found in the bacterial
chromosome, traE on conjugative plasmid RP4, etc. These
enzymes are involved in the control of DNA topology. DNA
topoisomerase III belongs to the type I topoisomerases,
which are ATP-independent [DNA metabolism, DNA
replication, recombination, and repair].
Length = 660
Score = 30.2 bits (68), Expect = 4.3
Identities = 27/108 (25%), Positives = 46/108 (42%), Gaps = 14/108 (12%)
Query: 390 NNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRR-------- 441
N + ++ K KE +EA +A +IE KR +I KKN +
Sbjct: 495 NPAAFVQDKGLKKTLKETKGLGTEAT-RADIIENLFKRGFIQKKKNKIYITKNGKLLCLL 553
Query: 442 --QDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINK 487
+ L K A E+ ++ ++ EK DDDF + + + + IN+
Sbjct: 554 LPELLTKPDLTAQWEQY--LNGISAGEKDDDDF-INTINEMIKQTINE 598
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 29.5 bits (67), Expect = 4.3
Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 15/98 (15%)
Query: 347 PSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEP------K 400
PS + + ++ L K + +L+Q+I + +A+I G + TEE
Sbjct: 60 PSQALNKLK-------TRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEEL 112
Query: 401 AKLSKEIAVPESE-AEFKAWLIET-KKKRAYIIDKKNA 436
+L KE+ ++E +++ E +K + K A
Sbjct: 113 KQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEA 150
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 30.4 bits (68), Expect = 4.3
Identities = 44/239 (18%), Positives = 95/239 (39%), Gaps = 27/239 (11%)
Query: 302 QQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
Q+K + +E+N K +++ + +HL + +L + R+ F+
Sbjct: 832 QEKQHELDTVVSKIELNRKLIQDQ-QEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQL 890
Query: 362 ---INNIQDLNKSINQLQQKI---EKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
+Q L + I +++ E K D E+ + SKE + +++ +
Sbjct: 891 VELSTEVQSLIREIKDAKEQDSPLETFLEK--------DQQEKEELISSKETSNKKAQDK 942
Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
+ K Y+ D +N + + D K++ E + ++L EK +
Sbjct: 943 VNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKE---TELNTVNAQLEECEKHQEKI--- 996
Query: 476 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQLHI 534
+ED + D D+++ QER ++ LR+ + E + +E + KE Q+ +
Sbjct: 997 NED------MRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQV 1049
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 30.4 bits (69), Expect = 4.4
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 20/99 (20%)
Query: 315 VEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQ 374
+E+N KKRE EL +LR+ +E H A + + I +L++ I Q
Sbjct: 94 IELN-KKREA----------ELAKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQ 142
Query: 375 LQ---QKIEKTKAKIIAYNNGEDLT----EEPKAKLSKE 406
LQ K EK K+++ A +DL + KAKL+ E
Sbjct: 143 LQKQKAKAEKEKSQLQAEV--DDLLAQLDQIAKAKLNAE 179
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 29.6 bits (67), Expect = 4.4
Identities = 19/105 (18%), Positives = 45/105 (42%), Gaps = 4/105 (3%)
Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIE 422
++ LNK++ L+ ++E + ++ + E + ++++ + E E +A IE
Sbjct: 38 AELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIE 97
Query: 423 TKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
+ ++ N+ + +L + +E L L R EK
Sbjct: 98 IQI----AKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEK 138
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 29.3 bits (66), Expect = 5.1
Identities = 19/102 (18%), Positives = 41/102 (40%), Gaps = 10/102 (9%)
Query: 363 NNIQDLNKSINQL---QQKIEKTK---AKIIAYNNGEDLTEEPKAKLSKEIAVPESEA-- 414
N++ L +++L +K +K K A I A + E++ P + +P
Sbjct: 109 LNVKRLKAYLSKLVLFPRKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPK 168
Query: 415 EFKAWLI--ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQE 454
KA +I E + AY ++ + +++ A ++
Sbjct: 169 REKARVITKEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEK 210
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class
3 flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 28.2 bits (63), Expect = 5.2
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 372 INQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEI 407
I LQ++I+ ++ ++ EDLT E K K + I
Sbjct: 21 IKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQLI 56
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 29.9 bits (67), Expect = 5.6
Identities = 17/122 (13%), Positives = 39/122 (31%), Gaps = 9/122 (7%)
Query: 405 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELAR 464
+E P E ++ +K + + R + + +++ S
Sbjct: 763 EEFEYPRMYEEASG---KSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVE 819
Query: 465 KEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQEL 524
E R+E K I+ A +++ ++ + IL Q + S ++
Sbjct: 820 TEGERCTIKQREE-----KGIDAPAILNVKKKKPYKVDNMTRILPQQSR-YISFIKDDRF 873
Query: 525 SP 526
P
Sbjct: 874 VP 875
>gnl|CDD|180481 PRK06231, PRK06231, F0F1 ATP synthase subunit B; Validated.
Length = 205
Score = 29.0 bits (65), Expect = 5.7
Identities = 20/88 (22%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 300 TEQQKDRRKELAKKLVEM--NAKKREERLVDD--ERHLNELLELREIVELTPSDHSHARE 355
T++ ++RKEL + + K++ ++L+++ +RH N L + +EI++ + +
Sbjct: 73 TQRFLNKRKELIEAEINQANELKQQAQQLLENAKQRHENALAQAKEIIDQANYEALQLKS 132
Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTK 383
+ + N I Q +Q+IEK +
Sbjct: 133 ELEK---EANRQANLIIFQARQEIEKER 157
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 29.7 bits (67), Expect = 6.0
Identities = 22/146 (15%), Positives = 48/146 (32%), Gaps = 14/146 (9%)
Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYN-NGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
++ L + + + ++E+ + GE L A L + E +
Sbjct: 293 KELKKLENKLEKQEDELEELEKAAEELRQKGELLY----ANLQL---IEEGLKSVRLADF 345
Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 481
++ +DK + Q K+ ++ L +L+ ++ +
Sbjct: 346 YGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAK----- 400
Query: 482 YKVINKDAGDTDSEEEQERLIELEEI 507
+ K G EE +E LIE +
Sbjct: 401 -TALEKAEGKKAIEEIREELIEEGLL 425
>gnl|CDD|199850 cd03868, M14_CPD_I, Peptidase M14 carboxypeptidase subfamily
N/E-like; Carboxypeptidase D, domain I subgroup. The
first carboxypeptidase (CP)-like domain of
Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD
differs from all other metallocarboxypeptidases in that
it contains multiple CP-like domains. CPD belongs to the
N/E-like subfamily of the M14 family of
metallocarboxypeptidases (MCPs).The M14 family are
zinc-binding CPs which hydrolyze single, C-terminal
amino acids from polypeptide chains, and have a
recognition site for the free C-terminal carboxyl group,
which is a key determinant of specificity. CPD is a
single-chain protein containing a signal peptide, three
tandem repeats of CP-like domains separated by short
bridge regions, followed by a transmembrane domain, and
a C-terminal cytosolic tail. The first two CP-like
domains of CPD contain all of the essential active site
and substrate-binding residues, the third CP-like domain
lacks critical residues necessary for enzymatic activity
and is inactive towards standard CP substrates. Domain I
is optimally active at pH 6.3-7.5 and prefers substrates
with C-terminal Arg, whereas domain II is active at pH
5.0-6.5 and prefers substrates with C-terminal Lys. This
Domain I family contains two contiguous surface
cysteines that may become palmitoylated and target the
enzyme to membranes, thus regulating intracellular
trafficking. CPD functions in the processing of proteins
that transit the secretory pathway, and is present in
all vertebrates as well as Drosophila. It is broadly
distributed in all tissue types. Within cells, CPD is
present in the trans Golgi network and immature
secretory vesicles, but is excluded from mature
vesicles. It is thought to play a role in the processing
of proteins that are initially processed by furin or
related endopeptidases present in the trans Golgi
network, such as growth factors and receptors. CPD is
implicated in the pathogenesis of lupus erythematosus
(LE), it is regulated by TGF-beta in various cell types
of murine and human origin and is significantly
down-regulated in CD14 positive cells isolated from
patients with LE. As down-regulation of CPD leads to
down-modulation of TGF-beta, CPD may have a role in a
positive feedback loop. In D. melanogaster, the CPD
variant 1B short (DmCPD1Bs) is necessary and sufficient
for viability of the fruit fly.
Length = 293
Score = 29.2 bits (66), Expect = 6.1
Identities = 21/68 (30%), Positives = 24/68 (35%), Gaps = 12/68 (17%)
Query: 587 LCQLPGFVER---LNKDLLENRPFQSHFSVSLAEN----PELDA---WSGARQFSLSENF 636
C G+ R N DL NR F F PE A W + F LS N
Sbjct: 116 DCSCGGYGGRGNANNVDL--NRNFPDQFEGKHVRAQERQPETLAMINWIRSNPFVLSANL 173
Query: 637 HDFAVTQS 644
H +V S
Sbjct: 174 HGGSVVAS 181
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 29.7 bits (67), Expect = 6.2
Identities = 21/106 (19%), Positives = 39/106 (36%), Gaps = 21/106 (19%)
Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELRE------IVELTPSDHS 351
L ++ D +EL K++ + ER + L L EL
Sbjct: 854 LAAVERSDTYRELRKRIAAL------ERTLVRAGGGLSLEALVAEAAALDPDELPARLEE 907
Query: 352 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTE 397
AR+ I++L + +N+L Q++ K ++ + G E
Sbjct: 908 LARD---------IEELEEELNELAQEVGAAKQELARMDGGSTAAE 944
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 29.3 bits (66), Expect = 6.4
Identities = 30/164 (18%), Positives = 64/164 (39%), Gaps = 15/164 (9%)
Query: 290 PYAVPVPNLTTEQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVE 344
P+A L+ Q+K R+ EL K+ E A++ + R+ +ER + ++ +
Sbjct: 48 PHAKKAFELSKMQEKTRQAELEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQ 107
Query: 345 LTPSDHSHAREAF----KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 400
AR+ + + N + L + ++ +A T E +
Sbjct: 108 RAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEE 167
Query: 401 AKLSKEIAVPESEAEFKAWLIETKKK-----RAYIIDKKNARKQ 439
A+L +E + EAE + + +++ R + K N ++
Sbjct: 168 AELERENIRAKIEAEARG-RAKEERENEDINREMLKLKANEERE 210
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 27.8 bits (62), Expect = 6.5
Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 21/84 (25%)
Query: 399 PKAKLS---KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQER 455
PK+KL+ K + + KA E K D+K A +QRR K R AA +E+
Sbjct: 23 PKSKLTSWEKRMEKRLEQQAIKAREKELK-------DEKEAERQRRIQAIKERRAAKEEK 75
Query: 456 MRL-----------ISELARKEKR 468
R + L R+EKR
Sbjct: 76 ERYEKMAAKMHAKKVERLKRREKR 99
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 29.5 bits (67), Expect = 6.6
Identities = 17/92 (18%), Positives = 43/92 (46%), Gaps = 5/92 (5%)
Query: 318 NAKKREERLVDDERHLNE-LLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQ 376
K+ + L ++ + L E L +++++++ P AR ++ L ++ +L
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPK---KARGQLPPEKREQLEKLLETKEKLS 387
Query: 377 QKIEKTKAKI-IAYNNGEDLTEEPKAKLSKEI 407
+++E+ + ++ E L E K ++K I
Sbjct: 388 EELEELEEELKELKEELESLYSEGKISVNKTI 419
>gnl|CDD|178850 PRK00083, frr, ribosome recycling factor; Reviewed.
Length = 185
Score = 28.9 bits (66), Expect = 6.6
Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 4/22 (18%)
Query: 293 VPVPNLTTEQQKDRRKELAKKL 314
+P+P LT E RRKEL K++
Sbjct: 100 LPIPPLTEE----RRKELVKQV 117
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 29.7 bits (67), Expect = 6.8
Identities = 25/130 (19%), Positives = 52/130 (40%), Gaps = 13/130 (10%)
Query: 419 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 478
L+ KKR + + Q Q A+ + AA + + R ++R + R+E
Sbjct: 20 VLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREEL 79
Query: 479 WDVYKVINKDAGDTDSEEE-----QERLIELEEILRQHDPEFTSLNQEQE--------LS 525
+ + + D+ E + +L E E+ L + E L ++ + L+
Sbjct: 80 QREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLT 139
Query: 526 PKEANQLHIG 535
P++A +L +
Sbjct: 140 PEQARKLLLK 149
>gnl|CDD|212669 cd10227, ParM_like, Plasmid segregation protein ParM and similar
proteins. ParM is a plasmid-encoded bacterial homolog
of actin, which polymerizes into filaments similar to
F-actin, and plays a vital role in plasmid segregation.
ParM filaments segregate plasmids paired at midcell into
the individual daughter cells. This subfamily also
contains Thermoplasma acidophilum Ta0583, an active
ATPase at physiological temperatures, which has a
propensity to form filaments.
Length = 312
Score = 29.3 bits (66), Expect = 7.1
Identities = 16/92 (17%), Positives = 36/92 (39%), Gaps = 3/92 (3%)
Query: 190 EGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSIT 249
+ + ++I G T V+ NG S + ++LG + + K L +Y ++
Sbjct: 165 KDKKVLVIDIGGGTTDVVVFDNGKPVESSSGSLELGVSDLYEAIAKELNKEYGI---DLS 221
Query: 250 PSRSEELLWDYGFVATDYREHLRKWLDAEFYD 281
EE+L + E + + ++ +
Sbjct: 222 DEEIEEILRNGKIKNYGKEEDITEIIEEAAEE 253
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.9 bits (65), Expect = 7.3
Identities = 26/149 (17%), Positives = 43/149 (28%), Gaps = 18/149 (12%)
Query: 389 YNNGEDLTEEPKAKL-SKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKR 447
E + +A+L E A + A + + K + +D KR
Sbjct: 49 GTEDELRKRDLRAELEEAERAHKSKKENKLA-IEDADKSTNLDASNEGDEDDDEEDEIKR 107
Query: 448 RTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEI 507
+ R + A D D+D D +E L ELE+I
Sbjct: 108 KRIEEDAR----NSDADDSDSSSDSDSSDDD---------SDDDDSEDETAALLRELEKI 154
Query: 508 LRQHDPEFTSL---NQEQELSPKEANQLH 533
++ E +E +E L
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREEEILT 183
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit.
Within the family is an example from a
sodium-translocating rather than proton-translocating
ATP synthase [Energy metabolism, ATP-proton motive force
interconversion].
Length = 147
Score = 28.5 bits (64), Expect = 7.5
Identities = 26/99 (26%), Positives = 49/99 (49%), Gaps = 16/99 (16%)
Query: 378 KIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNAR 437
K +T+ K IA + L +AK KE A+ + +A+ L E K + II+ NA
Sbjct: 22 KAIETRQKKIA----DGLASAERAK--KEAALAQKKAQVI--LKEAKDEAQEIIE--NAN 71
Query: 438 KQRRQDLAKRRTAAAQERMRLIS------ELARKEKRDD 470
K+ + L + + A +ER ++ + E +++ R++
Sbjct: 72 KRGSEILEEAKAEAREEREKIKAQARAEIEAEKEQAREE 110
>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3). Pre-mRNA
processing factor 3 (PRP3) is a U4/U6-associated
splicing factor. The human PRP3 has been implicated in
autosomal retinitis pigmentosa.
Length = 222
Score = 28.9 bits (65), Expect = 7.6
Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 15/81 (18%)
Query: 442 QDLAKRRTAAAQERMR-LISELARKEKRDDDFGMRD--------------EDWDVYKVIN 486
Q + R A E ISE A+ + + E WD+ + +
Sbjct: 32 QQAEELRREAKLEEEEARISEEAQNAGLKSATDLAEKIPKKELPDDVPDIEWWDMPILDD 91
Query: 487 KDAGDTDSEEEQERLIELEEI 507
D D + E +++ E+ I
Sbjct: 92 PDYNDLEDESDEDIDEEMSSI 112
>gnl|CDD|189049 cd09879, PIN_AF0591, PIN domain of Archaeoglobus fulgidus AF0591
protein and other similar archaeal homologs. PIN (PilT
N terminus) domain of Archaeoglobus fulgidus AF0591
protein and other similar uncharacterized archaeal
homologs are included in this family. These PIN domains
are structural homologs of flap endonuclease-1
(FEN1)-like PIN domains, but lack the extensive
arch/clamp region and the H3TH (helix-3-turn-helix)
domain, an atypical helix-hairpin-helix-2-like region,
seen in FEN1-like PIN domains. PIN domains within this
subgroup contain four highly conserved acidic residues
(putative metal-binding, active site residues) which
cluster at the C-terminal end of the beta-sheet and form
a negatively charged pocket near the center of the
molecule.
Length = 117
Score = 27.9 bits (63), Expect = 7.9
Identities = 9/31 (29%), Positives = 13/31 (41%)
Query: 382 TKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 412
T A + G D+ EE + E VP +
Sbjct: 5 TNALMYPVQFGVDIFEELERLGYPEFVVPSA 35
>gnl|CDD|153336 cd07652, F-BAR_Rgd1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Saccharomyces cerevisiae
Rho GTPase activating protein Rgd1 and similar proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization.
Saccharomyces cerevisiae Rgd1 is a GTPase activating
protein (GAP) with activity towards Rho3p and Rho4p,
which are involved in bud growth and cytokinesis,
respectively. At low pH, S. cerevisiae Rgd1 is required
for cell survival and the activation of the protein
kinase C pathway, which is important in cell integrity
and the maintenance of cell shape. It contains an
N-terminal F-BAR domain and a C-terminal Rho GAP domain.
The F-BAR domain of S. cerevisiae Rgd1 binds to
phosphoinositides and plays an important role in the
localization of the protein to the bud tip/neck during
the cell cycle. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 234
Score = 28.9 bits (65), Expect = 8.0
Identities = 41/177 (23%), Positives = 63/177 (35%), Gaps = 54/177 (30%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
E+ D AK L EM+ +EL L + VE +R++ K
Sbjct: 75 EKLADNGLRFAKALNEMS---------------DELSSLAKTVE-------KSRKSIKET 112
Query: 361 GI---NNIQDLNKSINQLQQKIEKTKAKIIAYNNGED-----LTEEPKAKL------SKE 406
G +QD + EK KA+ Y++ D T +P KL +K
Sbjct: 113 GKRAEKKVQDA-------EAAAEKAKAR---YDSLADDLERVKTGDPGKKLKFGLKGNKS 162
Query: 407 IAVPESEAEFKAWLIET--KKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE 461
A E E K + K A+ R++ L++ R A ++ LI E
Sbjct: 163 AAQHEDELLRKVQAADQDYASK------VNAAQALRQELLSRHRPEAVKDLFDLILE 213
>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 472 to 574 amino acids in
length.
Length = 485
Score = 29.4 bits (66), Expect = 8.2
Identities = 9/43 (20%), Positives = 20/43 (46%)
Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEE 398
+ + DL + I L++++ + KA++ D TE+
Sbjct: 17 SLPAAAAAADIDLLQKIEALKKELAELKAQLKDLQKRVDKTEK 59
>gnl|CDD|151391 pfam10944, DUF2630, Protein of unknown function (DUF2630). This
bacterial family of proteins have no known function.
Length = 81
Score = 26.9 bits (60), Expect = 8.7
Identities = 16/58 (27%), Positives = 23/58 (39%), Gaps = 14/58 (24%)
Query: 453 QERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQ 510
Q+ + I+EL +E +R + G+ D EE RL LE L Q
Sbjct: 4 QDILARINELVAEEHE-----LRAK---------LQQGEIDESEEHARLARLEVALDQ 47
>gnl|CDD|221178 pfam11709, Mit_ribos_Mrp51, Mitochondrial ribosomal protein
subunit. This family is the mitochondrial ribosomal
small-subunit protein Mrp51. Its function is not
entirely clear, but deletion of the MRP51 gene
completely blocked mitochondrial gene expression.
Length = 302
Score = 28.9 bits (65), Expect = 8.8
Identities = 21/103 (20%), Positives = 36/103 (34%), Gaps = 13/103 (12%)
Query: 331 RHL--NELLELREIVELTP-SDHSHAREAFKSMGI------NNIQDLNKSINQLQQKIEK 381
RH+ N+L L + + H R F+ +G+ L +S +
Sbjct: 64 RHISVNDLDTLEHMTDFESAGGHQWTRLRFQELGLPVSLPPPESNPLFESASDKTDNSPP 123
Query: 382 TKAKIIAYNNGEDLT-EEPKAKLSKEIAVPESEAEFKAWLIET 423
+ + + +T E L K V EF+ WL+E
Sbjct: 124 LSSLLNSGPWLAGMTPGEFSEYLKK---VSPLRPEFRKWLLEK 163
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 29.2 bits (65), Expect = 9.2
Identities = 48/226 (21%), Positives = 85/226 (37%), Gaps = 28/226 (12%)
Query: 269 EHLRKWLDAEFYDSNVVKVQLPYAVPVPNLTTEQQKDRRKELAKKLVEM------NAKKR 322
E R+ ++ + K+QL +Q D R E KKL+EM +AK
Sbjct: 122 ERERQAMELFLLRDTLEKLQLRME------RLQQTLDARDEEIKKLLEMLQSKGLSAKAE 175
Query: 323 EERLVDDER---------HLNELLELREIVELTPSDHSH-----AREAFKSMGINN-IQD 367
EE R HL LLE +E + + H R+ K+ I +
Sbjct: 176 EEDSERLGRIREAEDQQSHLEVLLEQKEKEHMMLREEIHQKLQMERDDAKTEASQKLIDE 235
Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF-KAWLIETKKK 426
+ I + ++ +EK + +I + D +E + +L KE+ S KA ++
Sbjct: 236 KDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQE 295
Query: 427 RAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDF 472
+ + + + LA + + Q +L +L R E+
Sbjct: 296 LSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAIL 341
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 28.4 bits (64), Expect = 9.3
Identities = 39/206 (18%), Positives = 82/206 (39%), Gaps = 55/206 (26%)
Query: 301 EQQKDRRKELAKKLVEM-NAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
EQ+K R+K+ K+ + +A+K E V +E + + + P +++ F
Sbjct: 16 EQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKS-------KKKAAPIENAEGNIVFSK 68
Query: 360 MGINNIQDLNKS-------------INQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKE 406
+ + + K QL +K+E K K+ E+L E+ A++ ++
Sbjct: 69 VEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKL------EELDEDKAAEIEEK 122
Query: 407 I-----------------------AVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQD 443
A+ E + K E K+++ + KK R+++R++
Sbjct: 123 EKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREE 182
Query: 444 -LAKRRTAAAQERMRLISELARKEKR 468
L KR+ ++ + A+K+ R
Sbjct: 183 NLKKRKDDKKNKKKK----KAKKKGR 204
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 28.8 bits (65), Expect = 9.6
Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 13/76 (17%)
Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
E Q+ +E K+L+E +RE+ L + ER +LE + + EA
Sbjct: 235 EAQERSYQEHVKQLIEKMEAEREKLLAEQER----MLEHKLQEQEELLKEGFKTEA---- 286
Query: 361 GINNIQDLNKSINQLQ 376
+ L K I L+
Sbjct: 287 -----ESLQKEIQDLK 297
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 29.4 bits (66), Expect = 9.7
Identities = 29/156 (18%), Positives = 57/156 (36%), Gaps = 16/156 (10%)
Query: 299 TTEQQKDRRKELAKKLVEMNAKKR--EERLVDDERHLNELLELREIVELTPSDHSHAREA 356
Q D K + E +E+ +ELL + + + +
Sbjct: 2452 KILQDIDNEKNKENNNMNAEKIDDLIENVTSHNEKIKSELLIINDALRRVKEKKDEMNKL 2511
Query: 357 FKSMGINN----------IQDLNKSINQLQQKIEKTKAKIIAYNNGE--DLTEEPKAKLS 404
F S+ NN + + IN+L+ + K +++Y + E +L E K KL
Sbjct: 2512 FNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLNE-LLSYIDNEIKELENE-KLKLL 2569
Query: 405 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQR 440
++ + ES E + ET++ +++R
Sbjct: 2570 EKAKIEESRKERERIESETQEDNTDEEQINRQQQER 2605
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 28.0 bits (63), Expect = 9.7
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 436 ARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
KQ ++ +K + +ER R + + +K K
Sbjct: 97 RNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.134 0.396
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,836,871
Number of extensions: 3596620
Number of successful extensions: 5299
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5022
Number of HSP's successfully gapped: 289
Length of query: 691
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 587
Effective length of database: 6,324,786
Effective search space: 3712649382
Effective search space used: 3712649382
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.5 bits)