RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14063
(531 letters)
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 135 bits (343), Expect = 1e-33
Identities = 99/378 (26%), Positives = 177/378 (46%), Gaps = 11/378 (2%)
Query: 7 KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLG 66
+ EE E + ++ +++ LE E+ LE E + + E E ++ L
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLE-ELENELEELEERLEELKEKIEALK 336
Query: 67 EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVF-SAK 125
E + + + +EL L +++ +E+E S L E+ + + + A+
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALL--EELEELFEALREELAELEAELAE 394
Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
R E + K E+E + +++L DL+ L++++ L E +T EEL + +E +
Sbjct: 395 IRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQ 454
Query: 186 IAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLK--SENRV 243
+ E R RL +L RE A+L +++++L +E+S + + R V + E+ +
Sbjct: 455 LEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGL 514
Query: 244 SGILGRLGDLGGIDQKYDIAVSTACGA-LNYIVTETVEAGEAVIAAVKRQNVGRVNVIPL 302
G+ G + +L + +KY+ A+ A G L +V E E + I +K GR +PL
Sbjct: 515 PGVYGPVAELIKVKEKYETALEAALGNRLQAVVVENEEVAKKAIEFLKENKAGRATFLPL 574
Query: 303 DKMQQYHSQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKRIGYS 362
D+++ S D P + DLI D K A F +TLV DL QA+R+
Sbjct: 575 DRIKPLRSLKSD--AAPGFLGLASDLIDF-DPKYEPAVRFVLGDTLVVDDLEQARRLARK 631
Query: 363 -GGGYRMVTLEGAIIEPS 379
YR+VTL+G ++EPS
Sbjct: 632 LRIKYRIVTLDGDLVEPS 649
Score = 58.2 bits (141), Expect = 7e-09
Identities = 62/286 (21%), Positives = 124/286 (43%), Gaps = 9/286 (3%)
Query: 10 EERTEKLTRVQL----VETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNL 65
E E L R++ +E L+ LE + KA + EL+ ++ + + K L
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241
Query: 66 GEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAK 125
E E ++ ++E+EL E +++ E+E KSEL+ + ++ +E + + +
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE--IEE 299
Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
E + + LE + + ++L +L+ +E ++ L ER+TL EEL + +E
Sbjct: 300 LEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEA 359
Query: 186 IAESRARLADLTREEAKLLDQV-EKLAREVSEKRESMQTSRSNNRLIDFVMQLKSENRVS 244
E +L+ L E +L + + E+LA +E E R I+ + +
Sbjct: 360 KEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLS-E 418
Query: 245 GILGRLGDLGGIDQKYDIAVSTACGALNYIVTETVEAGEAVIAAVK 290
+ +L ++ + + + T LN + E E E + +K
Sbjct: 419 RLEDLKEELKELEAELE-ELQTELEELNEELEELEEQLEELRDRLK 463
Score = 53.9 bits (130), Expect = 2e-07
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 415 TLVAQDLNQAKRIGYS-GGGYRMVTLEGAIIEPSGTMSGGGSNPIRGLMGRKATVSTD-- 471
TLV DL QA+R+ YR+VTL+G ++EPSG+++GG N L ++ +
Sbjct: 616 TLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRSSLAQKRELKELEEE 675
Query: 472 -TSLVKDLEVKEKRLASLETELRILSQQKMEVETQL 506
L LE E+ L SL+ ELR L E+ QL
Sbjct: 676 LAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711
Score = 45.1 bits (107), Expect = 9e-05
Identities = 53/233 (22%), Positives = 112/233 (48%), Gaps = 10/233 (4%)
Query: 2 EVLYGKYDEERTEKLTRVQLVETDLKALEPELRKAVNFL-ELENCVQRKHNEIYQYERYV 60
E L + + E +++ ++ +L ALE EL + + L ELE ++ E+ + + +
Sbjct: 701 EDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERL 760
Query: 61 NMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWS 120
E E +++ +E+ L E I++ ++ ++ + EL+ +E++ +E+
Sbjct: 761 E-----ELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815
Query: 121 VFSAKTRFECDIAKSELEIFLSTQSKETK-KLADLEANLEKVQTTLTERKTLCEELTTRV 179
+ S + R E + E+E L + +E + KL +LE LE+++ L E K EEL
Sbjct: 816 LESLEQRRE--RLEQEIE-ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEK 872
Query: 180 PAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLID 232
+E E+ E +L E +L ++ +L E+ + RE ++ + ++
Sbjct: 873 EELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925
Score = 42.8 bits (101), Expect = 4e-04
Identities = 43/226 (19%), Positives = 96/226 (42%), Gaps = 19/226 (8%)
Query: 8 YDEERTEKLTRVQLVETDLKALE---PELRKAVNFLELENCVQRKHNEIYQYERYVNMKN 64
+EE E ++ ++ L+ LE L +A+ L+ E E + +R +
Sbjct: 742 LEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI-------EELEEKRQALQEE 794
Query: 65 LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSA 124
L E E ++++ E+ L A ++ E + E++ +E++ +E +
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE--LEEKL-------D 845
Query: 125 KTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMES 184
+ E + + ELE + + +LE L++++ E + EL + + ++
Sbjct: 846 ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE 905
Query: 185 EIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRL 230
EI + R RL +L + +L ++ +L E+ E+ E + +
Sbjct: 906 EIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREI 951
Score = 37.8 bits (88), Expect = 0.018
Identities = 24/89 (26%), Positives = 47/89 (52%)
Query: 134 KSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL 193
+++LE +L LE LE+++ L E + EEL + A+E E+ + ++RL
Sbjct: 680 EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRL 739
Query: 194 ADLTREEAKLLDQVEKLAREVSEKRESMQ 222
+L E +L +++E+L + E E ++
Sbjct: 740 EELEEELEELEEELEELQERLEELEEELE 768
Score = 36.2 bits (84), Expect = 0.054
Identities = 28/156 (17%), Positives = 78/156 (50%), Gaps = 5/156 (3%)
Query: 67 EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
+++ + ++ E++L E++++ D +E + +L+ +E++ +++ +++ + +
Sbjct: 169 KYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAE----RYQ-ELKAELR 223
Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
E + ++L+ + ++L+ LE LE++Q L E + EEL + + + E+
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283
Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQ 222
E + L +L E +L ++ L + E ++
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELE 319
Score = 34.7 bits (80), Expect = 0.16
Identities = 21/78 (26%), Positives = 40/78 (51%)
Query: 148 TKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQV 207
+L LE L+ ++ L + L EEL ++ +E ++ E + LA L E +L ++
Sbjct: 680 EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRL 739
Query: 208 EKLAREVSEKRESMQTSR 225
E+L E+ E E ++ +
Sbjct: 740 EELEEELEELEEELEELQ 757
Score = 30.1 bits (68), Expect = 4.1
Identities = 50/242 (20%), Positives = 97/242 (40%), Gaps = 36/242 (14%)
Query: 5 YGKYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQ-YERYVNMK 63
+ + E+L ++ + + L + + LE +R EI + E ++
Sbjct: 784 LEEKRQALQEELEELEEELEEAERRLDALERELESLEQR--RERLEQEIEELEEEIEELE 841
Query: 64 N-LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVF 122
L E E +++++E+EL E +++ E E + ELK +E++ +E +
Sbjct: 842 EKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELE---- 897
Query: 123 SAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTER--KTLCEELTTRVP 180
S + +I K + + KL LE L +++ L E TL EL +
Sbjct: 898 SELAELKEEIEKLRERL-----EELEAKLERLEVELPELEEELEEEYEDTLETELEREIE 952
Query: 181 AMESEIA---------------------ESRARLADLTREEAKLLDQVEKLAREVSEKRE 219
+E EI E +++ DL + KLL+ +E+L +E E+ +
Sbjct: 953 RLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFK 1012
Query: 220 SM 221
Sbjct: 1013 ET 1014
>gnl|CDD|214944 smart00968, SMC_hinge, SMC proteins Flexible Hinge Domain. This
entry represents the hinge region of the SMC (Structural
Maintenance of Chromosomes) family of proteins. The
hinge region is responsible for formation of the DNA
interacting dimer. It is also possible that the precise
structure of it is an essential determinant of the
specificity of the DNA-protein interaction.
Length = 120
Score = 111 bits (279), Expect = 3e-29
Identities = 39/121 (32%), Positives = 60/121 (49%), Gaps = 6/121 (4%)
Query: 244 SGILGRLGDLGGIDQKYDIAVSTACGA-LNYIVTETVEAGEAVIAAVKRQNVGRVNVIPL 302
G+LGR+ DL +D KY+ A+ A G L +V +T E + I +K+ +GR +PL
Sbjct: 1 PGVLGRVADLISVDPKYETALEAALGGRLQAVVVDTEETAKKAIEFLKKNRLGRATFLPL 60
Query: 303 DKMQQYHSQC----YDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKR 358
DK++ P V IDL++ D ++R A + TLV DL A+R
Sbjct: 61 DKIKPRSPAGSKLREALLPEPGFVGPAIDLVEY-DPELRPALEYLLGNTLVVDDLETARR 119
Query: 359 I 359
+
Sbjct: 120 L 120
>gnl|CDD|219044 pfam06470, SMC_hinge, SMC proteins Flexible Hinge Domain. This
family represents the hinge region of the SMC
(Structural Maintenance of Chromosomes) family of
proteins. The hinge region is responsible for formation
of the DNA interacting dimer. It is also possible that
the precise structure of it is an essential determinant
of the specificity of the DNA-protein interaction.
Length = 117
Score = 109 bits (275), Expect = 1e-28
Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 2/117 (1%)
Query: 244 SGILGRLGDLGGIDQKYDIAVSTACG-ALNYIVTETVEAGEAVIAAVKRQNVGRVNVIPL 302
G+ GRL DL +D+ Y+ AV A G L IV + E +A I +K+ N+GR +PL
Sbjct: 2 KGVYGRLADLIEVDEGYEKAVEAALGGRLQAIVVDDEETAKAAIEFLKKNNLGRATFLPL 61
Query: 303 DKMQQYHSQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKRI 359
D+++ P LIDL++ DEK+R A + TLV DL++A +
Sbjct: 62 DRIKP-RPLDDSPREIPGGATPLIDLVEYPDEKLRKALRYLLGNTLVVDDLDEALEL 117
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 113 bits (285), Expect = 3e-26
Identities = 90/370 (24%), Positives = 168/370 (45%), Gaps = 21/370 (5%)
Query: 22 VETDLKALEPELRKAV-NFLELENCVQRKHNEIYQYERYVNMKNLGEHE---TKVQQMEQ 77
V+ + LE E+ + E E ++ + + E ++ K L E E ++++ +
Sbjct: 292 VKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEID-KLLAEIEELEREIEEERK 350
Query: 78 ELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKSEL 137
E + +E+E ++EL+ V+K+ E+ +R + K + E + K EL
Sbjct: 351 RRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKL--EKLKREINELKREL 408
Query: 138 EIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRA------ 191
+ + +++LADL A + ++ + E + E+ + E ++ + A
Sbjct: 409 DRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
Query: 192 -RLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLKSENRVSGILGRL 250
L DL E ++ ++ KL RE++E + S R V ++ + + G+ G +
Sbjct: 469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKAS-IQGVHGTV 527
Query: 251 GDLGGIDQKYDIAVSTACGA-LNYIVTETVEAGEAVIAAVKRQNVGRVNVIPLDKMQQYH 309
LG + ++Y A+ A G LN +V E + I +KR+ GR +PL+KM+
Sbjct: 528 AQLGSVGERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRATFLPLNKMRDER 587
Query: 310 SQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKRIGYSGGGYRMV 369
+ +DL++ D K AF + +TLV +D+ A+R+ G YRMV
Sbjct: 588 RDL-SILSEDGVIGFAVDLVEF-DPKYEPAFKYVFGDTLVVEDIEAARRL---MGKYRMV 642
Query: 370 TLEGAIIEPS 379
TLEG + E S
Sbjct: 643 TLEGELFEKS 652
Score = 50.8 bits (122), Expect = 2e-06
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 10/93 (10%)
Query: 415 TLVAQDLNQAKRIGYSGGGYRMVTLEGAIIEPSGTMSGGGSNPIRGLMGRKATVSTDTSL 474
TLV +D+ A+R+ G YRMVTLEG + E SG M+ GGS RG + S
Sbjct: 623 TLVVEDIEAARRLM---GKYRMVTLEGELFEKSGAMT-GGSRAPRGGILFSR------SE 672
Query: 475 VKDLEVKEKRLASLETELRILSQQKMEVETQLN 507
+L+ +RL L+ EL L + +E +L+
Sbjct: 673 PAELQRLRERLEGLKRELSSLQSELRRIENRLD 705
Score = 42.7 bits (101), Expect = 5e-04
Identities = 50/221 (22%), Positives = 100/221 (45%), Gaps = 33/221 (14%)
Query: 10 EERTEKLTRVQLVE----TDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNL 65
EE E + R+ L+ L+ L E KA + L E +YE Y +K
Sbjct: 180 EEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLK-------EKREYEGYELLKEK 232
Query: 66 GEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAK 125
E + + +E++L + E ++K T+E+ + L+ +E+ +E N
Sbjct: 233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELN-------------- 278
Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
+ + E E Q + +K+ +LEA + ++ ++ E++ E+ R+ +E+E
Sbjct: 279 KKIK---DLGEEE-----QLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330
Query: 186 IAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRS 226
I + A + +L RE + + +KL E +E +E ++ R+
Sbjct: 331 IDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRA 371
Score = 39.7 bits (93), Expect = 0.004
Identities = 31/144 (21%), Positives = 63/144 (43%), Gaps = 13/144 (9%)
Query: 76 EQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKS 135
Q L LE +K+ E+ S +SEL+ +E + E S S K +
Sbjct: 676 LQRLRERLEGLKR---ELSSLQSELRRIENRLDEL-----SQELSDASRKIG----EIEK 723
Query: 136 ELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLAD 195
E+E + K ++L +LE +L ++ + K+ +EL R+ +E ++ + L D
Sbjct: 724 EIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783
Query: 196 LTREEA-KLLDQVEKLAREVSEKR 218
L + + +++ ++ E+
Sbjct: 784 LEARLSHSRIPEIQAELSKLEEEV 807
Score = 37.7 bits (88), Expect = 0.017
Identities = 54/258 (20%), Positives = 96/258 (37%), Gaps = 38/258 (14%)
Query: 18 RVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGEHETKVQQMEQ 77
+Q + L+ L+ EL + L+ E + E + +GE E +++Q+EQ
Sbjct: 675 ELQRLRERLEGLKREL----SSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQ 730
Query: 78 ELTANLESIK-------KCTDEMESAKSELKTVEKK-----------GKESNVIGSKFRW 119
E E ++ E+E+ KSELK +E + + N + ++
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790
Query: 120 SVFSAKTRFECDIAK--SELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLC----- 172
S + + S +E L ++ +L + LEK L E++
Sbjct: 791 SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIK 850
Query: 173 ------EELTTRVPAMESEIAESRARLADLTREEAKL---LDQVEKLAREVSEKRESMQT 223
E L + +E E+ E A L DL L D++E RE+ K E ++
Sbjct: 851 SIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEA 910
Query: 224 SRSNNRLIDFVMQLKSEN 241
R ++ K E
Sbjct: 911 QIEKKRKRLSELKAKLEA 928
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 77.0 bits (190), Expect = 1e-14
Identities = 115/575 (20%), Positives = 231/575 (40%), Gaps = 64/575 (11%)
Query: 7 KYDEERTEKLTRVQLVETDLKALEP---ELRKAVNFLELE-NCVQR---KHNEIYQYERY 59
KY E R E +++ +L LE EL + + LE + +R E+ + E
Sbjct: 169 KYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELA 228
Query: 60 VNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKK----GKESNVIGS 115
+ + L E +++++++EL E +++ T E++ + +L+ + + +E +
Sbjct: 229 LLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQK 288
Query: 116 KFRWSVFSAKTRFECDI--AKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCE 173
+ +++ + +R E + L + +L +LE+ L+++ L E + E
Sbjct: 289 EL-YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347
Query: 174 ELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESM------------ 221
EL + ++E+E+ E A L +L +L +Q+E L +V++ +
Sbjct: 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA 407
Query: 222 ---QTSRSNNRLIDFVMQL---KSENRVSGILGRLGDLGGIDQKYDIAVSTACGALNYIV 275
+ RL + +L E + + L +L ++ + AL +
Sbjct: 408 RLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELR 467
Query: 276 TETVEAGEAVIAAVKRQNVGRVNVIPLDKMQQYHSQCYDKYRTPEN--------VPRLID 327
E EA +A+ AA + + + L+++Q+ + + + L +
Sbjct: 468 EELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSE 527
Query: 328 LIQVQDEKIRLAFYFATRETL---VAQDLNQAKRIGYS---GGGYRMVTLEGAIIEPSET 381
LI V DE A A L V ++LN AK+ R+ L I+ +E
Sbjct: 528 LISV-DEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEI 586
Query: 382 LVAQDLNQAKRIGYSGGGYRMVTLEGAI---IEP--SETLVAQDLNQAKRI-GYSGGGYR 435
G+ G +V + + + LV DL+ A + GYR
Sbjct: 587 QGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR 646
Query: 436 MVTLEGAIIEPSGTMSGGGSNPIRGLMGRKATVSTDTSLVKDLEVK-----------EKR 484
+VTL+G ++ P G ++GG + ++ R+ + +++LE K K
Sbjct: 647 IVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKE 706
Query: 485 LASLETELRILSQQKMEVETQLNCTGNELKYKKQE 519
L LE EL L ++ E+ Q++ +L + E
Sbjct: 707 LEELEEELEQLRKELEELSRQISALRKDLARLEAE 741
Score = 40.8 bits (96), Expect = 0.002
Identities = 44/221 (19%), Positives = 97/221 (43%), Gaps = 8/221 (3%)
Query: 10 EERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNM--KNLGE 67
EE EK+ ++ +L+ ELRK ELE +++ E+ + R ++ K+L
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRKELEELSRQISALRKDLAR 737
Query: 68 HETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTR 127
E +V+Q+E+ + + + + E+E + L+ E++ E+ + + + +
Sbjct: 738 LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI--EQLK 795
Query: 128 FECDIAKSELEIFLSTQSKETK-KLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
E + L+ L + + A+L LE ++ + + E+L ++ + +I
Sbjct: 796 EELKALREALDE-LRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854
Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSN 227
A + +L +L ++E L E + E++ RS
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.8 bits (94), Expect = 0.003
Identities = 40/166 (24%), Positives = 70/166 (42%), Gaps = 29/166 (17%)
Query: 59 YVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFR 118
Y K + E K+++ E+E LE KK E E+ K E +E K +
Sbjct: 22 YFVRKKIAE--AKIKEAEEEAKRILEEAKK---EAEAIKKE-ALLEAKEE---------- 65
Query: 119 WSVFSAKTRFECDIAKSELEIFL---------STQSKETKKLADLEANLEKVQTTLTERK 169
+ + FE ++ + E+ ++ + L E LEK + L +++
Sbjct: 66 --IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123
Query: 170 TLCEELTTRVPAMESEIAESRARLADLTREEAK--LLDQVEKLARE 213
E+ + + E + R++ LT EEAK LL++VE+ AR
Sbjct: 124 QELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARH 169
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 38.9 bits (91), Expect = 0.007
Identities = 42/219 (19%), Positives = 92/219 (42%), Gaps = 28/219 (12%)
Query: 8 YDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGE 67
Y E +R++ ++ L L + LEL+ + + I E+ ++ +L E
Sbjct: 246 YHLEHVNIDSRLERLKEQLVENSELLTQ----LELDE-AEEELGLIQ--EKIESLYDLLE 298
Query: 68 HETKVQQMEQELTANL-ESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
E + + + +E L + ++K + E K E++ V++ + + R K
Sbjct: 299 REVEAKNVVEENLPILPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVR------KF 352
Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
E +S L+ L + ++L+ NLE+++ LT+ +E E
Sbjct: 353 EKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTD--------------IEDEQ 398
Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
+ + L L ++E + + +E+L ++ E + M+ S
Sbjct: 399 EKVQEHLTSLRKDELEARENLERLKSKLHEIKRYMEKSN 437
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 37.7 bits (88), Expect = 0.015
Identities = 35/187 (18%), Positives = 65/187 (34%), Gaps = 19/187 (10%)
Query: 65 LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVF-- 122
L E Q +E + E + E + E +E + +E+ R +
Sbjct: 330 LEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEEL 389
Query: 123 -----SAKTRFE-----CDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLC 172
+ RF A+ LE + + ++ A+LEA L + + E + L
Sbjct: 390 EEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALL 449
Query: 173 EE---LTTRVPAMESEIA----ESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
E P S E R R+ +L E L ++VE++ + + ++
Sbjct: 450 EAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAED 509
Query: 226 SNNRLID 232
RL +
Sbjct: 510 RIERLEE 516
Score = 33.5 bits (77), Expect = 0.35
Identities = 26/99 (26%), Positives = 40/99 (40%), Gaps = 4/99 (4%)
Query: 129 ECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAE 188
E ELE L +E ++E LE+ L E + E L R +E IAE
Sbjct: 472 EDRERVEELEAELEDLEEEV---EEVEERLERA-EDLVEAEDRIERLEERREDLEELIAE 527
Query: 189 SRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSN 227
R + + +L ++ +L E EKRE+ +
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEE 566
Score = 30.0 bits (68), Expect = 4.3
Identities = 31/164 (18%), Positives = 63/164 (38%), Gaps = 12/164 (7%)
Query: 65 LGEHETKVQQ---MEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSV 121
L EHE + ++ +E E+ E+I + E E E++ + ++ +E + +
Sbjct: 243 LEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEE---LEEE----- 294
Query: 122 FSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPA 181
E + ++ E + + + + +L LE+ + E L
Sbjct: 295 -RDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADD 353
Query: 182 MESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
+E E R A+L E + + VE E+ E E ++ R
Sbjct: 354 LEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELR 397
Score = 28.9 bits (65), Expect = 8.1
Identities = 25/109 (22%), Positives = 45/109 (41%), Gaps = 14/109 (12%)
Query: 133 AKSELEIFLSTQSKETKKLADLEANLEKVQTTLT----ERKTLCEELTTRVPAMESEIAE 188
+ E + L + ++L LEA +E ++ T+ ER+ L EE+ +E E
Sbjct: 235 TRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEE 294
Query: 189 SRARLADLTREEA----------KLLDQVEKLAREVSEKRESMQTSRSN 227
LA+ ++A +L D+ E+L + E R + Q
Sbjct: 295 RDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 37.4 bits (87), Expect = 0.022
Identities = 48/240 (20%), Positives = 92/240 (38%), Gaps = 34/240 (14%)
Query: 2 EVLYGKYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVN 61
E L + E RTE+L + Q+VE +L +L A+ Q K
Sbjct: 706 EALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAAR----TQAKA----------- 750
Query: 62 MKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSV 121
L E + + + L + ++K+ ++E ++ ++ + + E +
Sbjct: 751 --RLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVR------EYRA 802
Query: 122 FSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPA 181
F +T D + E L +LE++ E++Q LT L ++ R
Sbjct: 803 FMQETWLHRDSLREERPNLAIQ-------LRELESSAEELQQELT---RLIKDTKLRRKK 852
Query: 182 MESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLKSEN 241
+E E +L L L D++ +LA E+ E + Q S + +D + + K +
Sbjct: 853 LEQERKALEKQLDQLDELLRGLRDEMRQLA-ELKEPANANQAEGSISERLDQLEEFKRKR 911
Score = 30.4 bits (69), Expect = 3.0
Identities = 28/178 (15%), Positives = 71/178 (39%), Gaps = 26/178 (14%)
Query: 68 HETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSK----FRWSVFS 123
ET+++Q++ +L LE + + ++ EL E+ K V G + S
Sbjct: 683 AETQLRQLDAQLKQLLEQQQAFLEALKDDFREL-RTERLAKWQVVEGELDNQLAQLSAAI 741
Query: 124 AKTRFECDIAKSELE-----------IFLSTQSKETKKLADLEANLEKV----------Q 162
R + EL+ + +T + +++ +LE +E++ +
Sbjct: 742 EAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYR 801
Query: 163 TTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRES 220
+ E + L P + ++ E + +L +E +L+ + +++ ++R++
Sbjct: 802 AFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKA 859
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 36.9 bits (85), Expect = 0.029
Identities = 43/230 (18%), Positives = 80/230 (34%), Gaps = 29/230 (12%)
Query: 10 EERTEKLTRVQ-------------------LVETDLKALEPELRKAVNFLELENCVQRKH 50
+ER +KL L E KALE K LE EN + +
Sbjct: 173 KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDY 232
Query: 51 NEIYQYERYVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKES 110
++ + + + L + + +++ +QEL E + + E + + E K E++ K
Sbjct: 233 LKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLL 292
Query: 111 NVIGSKFRWSVFSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKT 170
+ + + E + + K+L LE L+K + + E +
Sbjct: 293 ---------AKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343
Query: 171 LCEELTTRVPAMESEIAESRAR-LADLTREEAKLLDQVEKLAREVSEKRE 219
+EL + A E E + EE L + + R S +
Sbjct: 344 ELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKL 393
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 36.1 bits (84), Expect = 0.044
Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 8/78 (10%)
Query: 146 KETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL---ADLTREEAK 202
++ + L E NLEK + L+ ++ + L + +E IAE R L + LT+EEAK
Sbjct: 94 RKMESLDKKEENLEKKEKELSNKE---KNLDEKEEELEELIAEQREELERISGLTQEEAK 150
Query: 203 --LLDQVEKLAREVSEKR 218
LL++VE+ AR + K
Sbjct: 151 EILLEEVEEEARHEAAKL 168
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 35.9 bits (83), Expect = 0.057
Identities = 36/216 (16%), Positives = 83/216 (38%), Gaps = 8/216 (3%)
Query: 7 KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLG 66
EE EKL +++ + +L+ L+ +L+ EL R E ++
Sbjct: 526 ALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRK 585
Query: 67 EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
E ++++ +EL L+ +++ ++E L+ E + + + +
Sbjct: 586 EELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAE-EELESELEKLNL 644
Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
+ E + L + +E + E + + L E+ E+L E E+
Sbjct: 645 QAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQL-------EEEL 697
Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQ 222
+ R L +L ++ ++ +E+L +E E +
Sbjct: 698 EQLREELEELLKKLGEIEQLIEELESRKAELEELKK 733
Score = 31.7 bits (72), Expect = 1.3
Identities = 39/235 (16%), Positives = 91/235 (38%), Gaps = 26/235 (11%)
Query: 7 KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLG 66
E E L E E+++ ELE + + ER ++
Sbjct: 195 SELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQ----EIEALE--ERLAELEEEK 248
Query: 67 EHETKVQ-QMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAK 125
E +++ ++ + + LE++K +E+ + L+ +E+K +
Sbjct: 249 ERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLE-------------- 294
Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
E + ELE L ++L +L L+ ++ L + + E+L + + + E
Sbjct: 295 ---ELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351
Query: 186 IAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLKSE 240
E L + +E + L+++EK + E+ + ++ + L + + +L +
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQ--ELKEELAELSAA 404
>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein.
Length = 326
Score = 35.3 bits (81), Expect = 0.071
Identities = 16/48 (33%), Positives = 29/48 (60%)
Query: 150 KLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLT 197
+L+DL A+L+ + T+L + K L T V A+++ + + LADL+
Sbjct: 57 RLSDLSADLQDMTTSLDDAKATLNGLITTVQALQTSVDSLSSELADLS 104
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 35.3 bits (82), Expect = 0.10
Identities = 30/165 (18%), Positives = 58/165 (35%), Gaps = 34/165 (20%)
Query: 77 QELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAK-- 134
+E E +K+ + + ++E+ + ++GS+ E DI
Sbjct: 718 EEAEKEFELLKEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRL---EANEDDIKGLA 774
Query: 135 -----------------------SELEIFLSTQSK--ETKKLADLEANLEKVQTTL--TE 167
E+FL +LA LE LEK++ + E
Sbjct: 775 NLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIE 834
Query: 168 RKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAR 212
+K E + P E + + + +LA+ + AKL +++ L
Sbjct: 835 KKLSNEGFVAKAP--EEVVEKEKEKLAEYQVKLAKLEERLAVLKA 877
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 34.6 bits (79), Expect = 0.15
Identities = 31/191 (16%), Positives = 67/191 (35%), Gaps = 11/191 (5%)
Query: 38 NFLELENCVQRKHNEIYQYERYVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAK 97
N+ E ++ + ++ NL + + QE + IK +
Sbjct: 250 NYEPSEQELKLGFEKFVH-IINTDIANLKTQNDNLYEKIQEAMKISQKIK----TLREKW 304
Query: 98 SELKTVEKK--GKESNVIGSKFRWSVFSAKTRFECDIAKSELEIFLSTQSKETKKLADLE 155
LK+ K + + W K + E ++ + E++ S + K+L
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364
Query: 156 ANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVS 215
+ E+ + ER E+LT + + + + + E + +EK R+
Sbjct: 365 ISTEQFELMNQER----EKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYD 420
Query: 216 EKRESMQTSRS 226
+++ SRS
Sbjct: 421 SLIQNITRSRS 431
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 32.6 bits (75), Expect = 0.18
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 14/88 (15%)
Query: 148 TKKLADLEANLEKVQTTLTERKTLCEE----------LTTRVPAMESEIAESRARLADLT 197
KK LE +EK++ L E K EE LT R+ +E E+ ES RL + T
Sbjct: 41 QKKNQQLEEEVEKLEEQLKEAKEKLEESEKLATNAEALTRRIQLLEEELEESEKRLKETT 100
Query: 198 REEAKLLDQVEKLAREVSEKRESMQTSR 225
+ L + +K A E K ++++ R
Sbjct: 101 EK----LREADKKAEESERKVKALENER 124
>gnl|CDD|221303 pfam11902, DUF3422, Protein of unknown function (DUF3422). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria, archaea and
eukaryotes. Proteins in this family are typically
between 426 to 444 amino acids in length.
Length = 419
Score = 34.1 bits (79), Expect = 0.18
Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 5/56 (8%)
Query: 174 ELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNR 229
EL+ + +E +A +AD ++ +LL ++ +LA EV E++ + R
Sbjct: 212 ELSPELAELERRLAALTQAMADSEADDEELLSELTELAAEV----EALAA-ATTFR 262
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 33.7 bits (77), Expect = 0.29
Identities = 41/205 (20%), Positives = 85/205 (41%), Gaps = 9/205 (4%)
Query: 18 RVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGEHETKVQQMEQ 77
R Q +T+LK LE EL + LE + + Q + ++ K++ +E+
Sbjct: 179 RAQNADTELKLLESELEELREQLEECQKELAEAEKKLQ-SLTSEQASSADNSVKIKHLEE 237
Query: 78 ELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKSEL 137
EL + + ++S K +L + + +E + + R + +L
Sbjct: 238 ELKRYEQDA----EVVKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDL 293
Query: 138 EIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLT 197
+ L K +KLADLE EK++ L K+L +++ + + + R+ L
Sbjct: 294 QSRLERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPD----DLSRRIVVLQ 349
Query: 198 REEAKLLDQVEKLAREVSEKRESMQ 222
EE +L ++ ++ + ++Q
Sbjct: 350 NEELQLKEKNGSISSSAKQLETTLQ 374
Score = 29.9 bits (67), Expect = 4.4
Identities = 49/229 (21%), Positives = 88/229 (38%), Gaps = 20/229 (8%)
Query: 1 MEVLYGKYDEERTEKL-TRVQL--VETDLKALEPELRKAVNFLELENCVQRKHNEI---Y 54
+V +R E + ++ +L +E +L E E ++A LE + ++ E
Sbjct: 49 KQVEESMTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDR 108
Query: 55 QYERYVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIG 114
E V +K L E E K + E + +K D ES K + + ++ + I
Sbjct: 109 NLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDA-ESLKLQNEKEDQLKEAKESIS 167
Query: 115 ------SKFRWSVFSAKT-----RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQT 163
S+ + +A T E + + +LE ++ KKL L +
Sbjct: 168 RIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSAD 227
Query: 164 TLTERKTLCEELTTRVP--AMESEIAESRARLADLTREEAKLLDQVEKL 210
+ K L EEL + + E ++ +L RE A L ++ KL
Sbjct: 228 NSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREENRKL 276
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 33.5 bits (77), Expect = 0.30
Identities = 32/149 (21%), Positives = 63/149 (42%), Gaps = 14/149 (9%)
Query: 9 DEERTEKLTRVQLVETDLKALEPELRKAVNFLELE-NCVQRKHNEIYQYERYVNM-KNLG 66
E T+ +Q V+ +L+ + +L K + LE E V +K + I E+ + + LG
Sbjct: 372 GSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELG 431
Query: 67 EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
E + ++ + E E+ L ++ + +E+ + L K+ + + +
Sbjct: 432 EAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKINAFELERAITIA------ 485
Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLE 155
D AK L+ F + +KL LE
Sbjct: 486 ----DKAKKTLKEFREKLLE--RKLQQLE 508
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 32.5 bits (74), Expect = 0.37
Identities = 25/92 (27%), Positives = 45/92 (48%), Gaps = 1/92 (1%)
Query: 151 LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKL 210
+A++E L K++ LTE EL R+ + E+ + R + L + A+L ++ L
Sbjct: 69 IAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSL 128
Query: 211 AREVSEKRESMQT-SRSNNRLIDFVMQLKSEN 241
E+ + RE +Q + N L D ++ L E
Sbjct: 129 ETEIRDLREELQEKEKDNETLQDELISLNIEL 160
>gnl|CDD|181676 PRK09165, PRK09165, replicative DNA helicase; Provisional.
Length = 497
Score = 33.3 bits (77), Expect = 0.37
Identities = 11/44 (25%), Positives = 27/44 (61%), Gaps = 3/44 (6%)
Query: 173 EELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSE 216
E+L TR+ + +SEI+ S+ R ++ E+ +++ ++E+ +
Sbjct: 273 EQLATRILSEQSEISSSKIRRGKISEED---FEKLVDASQELQK 313
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 33.1 bits (76), Expect = 0.40
Identities = 27/96 (28%), Positives = 48/96 (50%), Gaps = 12/96 (12%)
Query: 125 KTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTER-KTLCEELTTRVPAME 183
+TR E K+ELE QS++ L + +A +K++ ER KTL +E
Sbjct: 177 QTREELAAQKAELE---EKQSQQKTLLYEQQAQQQKLEQARNERKKTLT--------GLE 225
Query: 184 SEIAESRARLADLTREEAKLLDQVEKLAREVSEKRE 219
S + + + +L++L E++L D + + RE + E
Sbjct: 226 SSLQKDQQQLSELRANESRLRDSIARAEREAKARAE 261
>gnl|CDD|236042 PRK07525, PRK07525, sulfoacetaldehyde acetyltransferase; Validated.
Length = 588
Score = 33.0 bits (76), Expect = 0.41
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 263 AVSTACGALNYIVTETVEAGEAVIAAVKRQNVGRVNVI 300
++ A GA +V E G A+ A+ QN G+ VI
Sbjct: 520 GIAEAMGAEGVVVDTQEELGPALKRAIDAQNEGKTTVI 557
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.4 bits (74), Expect = 0.58
Identities = 24/141 (17%), Positives = 46/141 (32%), Gaps = 26/141 (18%)
Query: 73 QQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDI 132
+ L A L+ + E+ + ++EL T+ + +
Sbjct: 160 AERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQ---------------------- 197
Query: 133 AKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRAR 192
+++L + K LA L + L Q L E + L + + E+ A++R
Sbjct: 198 -QAKLAQ---LLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREA 253
Query: 193 LADLTREEAKLLDQVEKLARE 213
A A+ K E
Sbjct: 254 AAAAEAAAARARAAEAKRTGE 274
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 30.7 bits (70), Expect = 0.67
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 141 LSTQSKETKK-LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTRE 199
L Q E KK +A L+A E Q L+E + EE +E E++E R+ +L +
Sbjct: 64 LRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKM---LEDELSELEKRIEELNEQ 120
Query: 200 EAKLLDQVEKLA 211
L DQ+E L+
Sbjct: 121 NKLLHDQIELLS 132
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 31.5 bits (72), Expect = 0.77
Identities = 19/85 (22%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEK 217
L ++Q L E + EL ++ ++ E E + L+ L E +L ++ ++ +++S
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARI-KQLSA- 125
Query: 218 RESMQTSRSNNRLIDFVMQLKSENR 242
+++ N L + + +LK EN
Sbjct: 126 -NAIELDEENRELREELAELKQENE 149
Score = 28.8 bits (65), Expect = 5.0
Identities = 16/72 (22%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 175 LTTRVPAMESEIAESRARLADLTREEAKL---LDQVEKLAREVSEKRESMQTSRSNNRLI 231
R+P ++ E+AE + LA+L + A+L ++++ + + E +Q + + +
Sbjct: 64 ARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL 123
Query: 232 D-FVMQLKSENR 242
++L ENR
Sbjct: 124 SANAIELDEENR 135
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.4 bits (72), Expect = 0.79
Identities = 20/83 (24%), Positives = 45/83 (54%), Gaps = 8/83 (9%)
Query: 167 ERKTLCEELTTRVPAMESEIAESRARLADL------TREEAKLLDQVEKLAREVSEKRES 220
+ KT E+L + ++ IAE +A++ L T E +LL+++++L +E+ + +
Sbjct: 66 KLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAE 125
Query: 221 MQTSRSNNRLIDFVMQLKSENRV 243
++ N+ + + +LK E +V
Sbjct: 126 LEKYEKND--PERIEKLKEETKV 146
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 32.0 bits (73), Expect = 0.82
Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 14/99 (14%)
Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLT---REEAKLLDQVEKLAREV 214
L +++ L + K L + V + + + L L L ++V+KL E+
Sbjct: 303 LNEIEERLAQIKRLKRKYGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDKLEEEL 362
Query: 215 SEKRESMQTSR--SNNRLIDFVMQLKSENRVSGILGRLG 251
+ ++ R + RL RV L L
Sbjct: 363 DKAAVALSLIRRKAAERL---------AKRVEQELKALA 392
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 31.6 bits (72), Expect = 1.1
Identities = 53/239 (22%), Positives = 97/239 (40%), Gaps = 32/239 (13%)
Query: 7 KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKH---NEIYQYERYVNMK 63
+Y E ++ +E + L ELR+ L+ E+ + + ++ + E +
Sbjct: 456 EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKY 515
Query: 64 NLGEHETKVQQMEQ------ELTANLESIKKCTDEMESAKSELKTVEKKGKE-----SNV 112
NL E E K ++ E+ +L ++S+KK +++E K +L +EKK E + +
Sbjct: 516 NLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL 575
Query: 113 IGSKFRWSVFSAKTRFECDI---------------AKSELEIFLSTQSKETKKLADLEAN 157
+ + F + E + A+ ELE K ++L
Sbjct: 576 LK-ELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEE 634
Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSE 216
L + + L E + EEL E E E R +L+RE A L ++E+L + E
Sbjct: 635 LAETEKRLEELRKELEEL--EKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREE 691
>gnl|CDD|223434 COG0357, GidB, Predicted S-adenosylmethionine-dependent
methyltransferase involved in bacterial cell division
[Cell envelope biogenesis, outer membrane].
Length = 215
Score = 31.1 bits (71), Expect = 1.2
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 3/54 (5%)
Query: 226 SNNRLIDFVMQLKSE---NRVSGILGRLGDLGGIDQKYDIAVSTACGALNYIVT 276
S + I F+ ++K E V + GR + G ++YD+ S A +LN ++
Sbjct: 99 SLGKKIAFLREVKKELGLENVEIVHGRAEEFGQEKKQYDVVTSRAVASLNVLLE 152
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 29.8 bits (68), Expect = 1.4
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 170 TLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQ 222
L E L++ + +E E+A + LA L E + ++ KL E E RE +
Sbjct: 17 QLVERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKK 69
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 31.2 bits (71), Expect = 1.5
Identities = 19/79 (24%), Positives = 28/79 (35%), Gaps = 2/79 (2%)
Query: 133 AKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAE-SRA 191
LE F + KE LE EKV + EE+ VP +I E
Sbjct: 448 ILKSLEEFKQSIVKEAALSI-LEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTY 506
Query: 192 RLADLTREEAKLLDQVEKL 210
L + + +++E L
Sbjct: 507 LLLEELGINEETYEKLEAL 525
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 30.2 bits (69), Expect = 1.7
Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 145 SKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAK-- 202
++ + L E +LE+ + L R+ EE + + E + R++ LT EEAK
Sbjct: 95 DRKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQELERISGLTAEEAKEI 154
Query: 203 LLDQVEKLARE 213
LL++VE +
Sbjct: 155 LLEEVEAELKH 165
>gnl|CDD|183048 PRK11239, PRK11239, hypothetical protein; Provisional.
Length = 215
Score = 30.5 bits (69), Expect = 1.7
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 173 EELTTRVPAMESEIAESRARLADL 196
+L RV A+E E+AE + RL L
Sbjct: 186 GDLQARVEALEIEVAELKQRLDSL 209
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 30.7 bits (69), Expect = 2.1
Identities = 44/244 (18%), Positives = 96/244 (39%), Gaps = 36/244 (14%)
Query: 9 DEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGEH 68
+ + ++ +R++ +ET L+ K E Q + ER + E+
Sbjct: 163 EGDVSDMRSRIEQLETALRHST---EKTEELEEQHKEAQSSSESMSA-ERNALLAQRAEN 218
Query: 69 ETKVQQMEQELTA----------NLESIKKCTDEMESAKSELK------TVEKKGKESNV 112
+ ++ ++EQ++ LE + E+E K ELK T++++ +E+
Sbjct: 219 QQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETER 278
Query: 113 IGSKFRWSVFSAKTRFECDIAKSELEIFLSTQSKET---KKLADLEANLEKVQTTLTERK 169
I + R + E S+Q K ++LA L + + L + +
Sbjct: 279 IDLQLENEQLHEDLR-------TLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSR 331
Query: 170 TLCEELTTRVPAMESEIAE-----SRARLADLTREEAKLLDQVEKLAREVSEKRESMQTS 224
+++ ++ + + E ++ R EA D+++KL+ E+ + E +Q
Sbjct: 332 LESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADK-DRIQKLSAELLKLEEWLQEE 390
Query: 225 RSNN 228
RS
Sbjct: 391 RSQR 394
>gnl|CDD|114591 pfam05873, Mt_ATP-synt_D, ATP synthase D chain, mitochondrial
(ATP5H). This family consists of several ATP synthase D
chain, mitochondrial (ATP5H) proteins. Subunit d has no
extensive hydrophobic sequences, and is not apparently
related to any subunit described in the simpler ATP
synthases in bacteria and chloroplasts.
Length = 160
Score = 29.8 bits (67), Expect = 2.2
Identities = 15/68 (22%), Positives = 28/68 (41%), Gaps = 12/68 (17%)
Query: 33 LRKAVNFLELENCVQRKHNEI---YQYERY---------VNMKNLGEHETKVQQMEQELT 80
+K V L + ++K+ + Y ++Y ++K + E++TK QE
Sbjct: 56 YKKNVPKAGLVDSFEKKYEALKVPYPEDKYTAQVDAEEKEDVKRIAEYKTKSADRIQEYE 115
Query: 81 ANLESIKK 88
LE K
Sbjct: 116 KELEKWKN 123
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 30.8 bits (69), Expect = 2.4
Identities = 41/222 (18%), Positives = 94/222 (42%), Gaps = 4/222 (1%)
Query: 7 KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENC-VQRKHNEIYQYERYV-NMKN 64
K E+ T +++T++ AL EL + N L + +Q E Y + +M++
Sbjct: 326 KLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRD 385
Query: 65 LGEH-ETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFS 123
E E K++ +++++ E+ ++ ++ K L++++ + +
Sbjct: 386 RYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEK 445
Query: 124 AKTRFECDIAKSELEIF-LSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAM 182
+ + E + K+ DL+ ++ +Q L+ER+ E L V +
Sbjct: 446 ERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKL 505
Query: 183 ESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTS 224
S + R+ L E K+ ++ EKL +E+ R + +++
Sbjct: 506 ASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESA 547
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 30.6 bits (69), Expect = 2.5
Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 142 STQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEA 201
++ ++ + L+ E LE + +LT++ +E +V +E + R+A L++ EA
Sbjct: 111 TSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEA 170
Query: 202 K---LLDQVEKLAREVSEK 217
+ L + KL E++ +
Sbjct: 171 REIILAETENKLTHEIATR 189
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 30.3 bits (68), Expect = 2.6
Identities = 20/71 (28%), Positives = 39/71 (54%)
Query: 149 KKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVE 208
+ DL+ + E+++ L E + EEL + +E+E E + RL L E ++L + ++
Sbjct: 128 PEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLK 187
Query: 209 KLAREVSEKRE 219
KL EV + ++
Sbjct: 188 KLPGEVYDLKK 198
Score = 29.1 bits (65), Expect = 5.6
Identities = 42/217 (19%), Positives = 89/217 (41%), Gaps = 22/217 (10%)
Query: 4 LYGKYDEERTEKLTRVQLVET----DLKALEPELRKAVNFLELENCVQRKHNEIYQYER- 58
L K+ EE + ++ ++ D++ ++ EL + EL+N + R + Q
Sbjct: 74 LAEKFFEELRGMVGHIERMKIPIGHDVEHIDVELVRK----ELKNALVRAGLKTLQRVPE 129
Query: 59 YVNMKNLGEH-ETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKF 117
Y+++K E + K++++++E L+ +++ E E + LK +E + + K
Sbjct: 130 YMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKL 189
Query: 118 RWSVFSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTT 177
V+ K R++ E+ + E + ++DL + E +
Sbjct: 190 PGEVYDLKKRWD--------ELEPGVELPEEELISDLVKETLNLAPKDIEG----QGYIY 237
Query: 178 RVPAMESEIAESRARLADLTREEAKLLDQVEKLAREV 214
E EI +A +RE+A ++ K A E
Sbjct: 238 AEDEKEVEILLGTVYIAAPSREDAVEELEIIKEAIEE 274
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated
Ndc80 complex and is involved in chromosome segregation.
Length = 117
Score = 28.8 bits (65), Expect = 2.7
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 149 KKLADLEANLEKVQTTLTERKTLCEEL 175
K+L DLE+ L +Q+ L E K EEL
Sbjct: 13 KELNDLESELNNLQSELEELKERLEEL 39
>gnl|CDD|218079 pfam04420, CHD5, CHD5-like protein. Members of this family are
probably coiled-coil proteins that are similar to the
CHD5 (Congenital heart disease 5) protein. In
Saccharomyces cerevisiae this protein localises to the
ER and is thought to play a homeostatic role.
Length = 161
Score = 29.6 bits (67), Expect = 2.7
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 191 ARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLI 231
A+ A L R+ KL D++EKL +E++ ++ + RL+
Sbjct: 66 AKWAKLNRKLDKLTDELEKLKKELTAQKSKFKWVLKKLRLL 106
>gnl|CDD|237625 PRK14147, PRK14147, heat shock protein GrpE; Provisional.
Length = 172
Score = 29.5 bits (66), Expect = 2.8
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 165 LTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRE 219
L + + L V ++ SEIA +A D RE A L +Q +++AR+V + R+
Sbjct: 13 LAQNPPETDPLKAEVESLRSEIALVKA---DALRERADLENQRKRIARDVEQARK 64
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 30.6 bits (69), Expect = 2.9
Identities = 28/140 (20%), Positives = 55/140 (39%), Gaps = 34/140 (24%)
Query: 71 KVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFEC 130
++ ++E+ ANL+ +KK + + + E+ + + K R + K F
Sbjct: 1115 RITELEKAKLANLDVVKKAISNLNNLQQEVTLLRNE---------KIRMHTGTDKVDFS- 1164
Query: 131 DIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESR 190
DI K LE L+ + T L + E+T ++ A+E E ++
Sbjct: 1165 DIEK-------------------LEKQLQVIDTKLAD--AYLLEVTKQISALEKEKPKN- 1202
Query: 191 ARLADLTREEAKLLDQVEKL 210
++L + AK D +
Sbjct: 1203 --QSELKSKIAKFFDTTADI 1220
>gnl|CDD|190201 pfam02058, Guanylin, Guanylin precursor.
Length = 87
Score = 28.1 bits (63), Expect = 2.9
Identities = 8/28 (28%), Positives = 12/28 (42%)
Query: 72 VQQMEQELTANLESIKKCTDEMESAKSE 99
V + + +LES+KK D E
Sbjct: 1 VTVQDGGFSFSLESVKKLKDLQEQWAQS 28
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 29.9 bits (68), Expect = 3.5
Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 13/96 (13%)
Query: 151 LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI-------AESRARLADLTREEAKL 203
L+ EA ++Q L E R + E+ A + A++ L ++ A L
Sbjct: 90 LSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIAAL 149
Query: 204 LDQVEKL--AREVSEKRESMQTSRSNNRLIDFVMQL 237
Q+ L A + SEKR+ S ++ D +L
Sbjct: 150 RRQLAALEAALDASEKRD----RESQAKIADLGRRL 181
Score = 28.8 bits (65), Expect = 8.1
Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 181 AMESEIAESRARLADL-TREEAKLLDQVEKLAREVSE 216
A E ES+A++ADL R L +V++L R SE
Sbjct: 162 ASEKRDRESQAKIADLGRRLNVALAQRVQELNRYRSE 198
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 30.1 bits (69), Expect = 3.6
Identities = 17/65 (26%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Query: 148 TKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQV 207
K+LA LE +E+V+ L+ + + P E + + R +LA+ + AKL +++
Sbjct: 817 EKELAKLEKEIERVEKKLSNEGFV-----AKAP--EEVVEKEREKLAEYEEKLAKLKERL 869
Query: 208 EKLAR 212
+L
Sbjct: 870 ARLKA 874
>gnl|CDD|132835 cd00147, cPLA2_like, Cytosolic phospholipase A2, catalytic domain;
hydrolyses arachidonyl phospholipids. Catalytic domain
of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4)
hydrolyzes the sn-2-acyl ester bond of phospholipids to
release arachidonic acid. At the active site, cPLA2
contains a serine nucleophile through which the
catalytic mechanism is initiated. The active site is
partially covered by a solvent-accessible flexible lid.
cPLA2 displays interfacial activation as it exists in
both "closed lid" and "open lid" forms. Movement of the
cPLA2 lid possibly exposes a greater hydrophobic surface
and the active site. cPLA2 belongs to the alpha-beta
hydrolase family which is identified by a characteristic
nucleophile elbow with a consensus sequence of
Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu =
nucleophile). Calcium is required for cPLA2 to bind with
membranes or phospholipids. Group IV cPLA2 includes six
intercellular enzymes: cPLA2alpha, cPLA2beta,
cPLA2gamma, cPLA2delta, cPLA2epsilon, and cPLA2zeta.
Length = 438
Score = 29.9 bits (68), Expect = 3.6
Identities = 29/113 (25%), Positives = 46/113 (40%), Gaps = 22/113 (19%)
Query: 427 IGYSGGGYR-MVTLEGAIIEPSGTMSGG---GSNPIRGLMGRKATVST-----DTSLVKD 477
I SGGGYR M GA+ GG + GL G +++ D S KD
Sbjct: 46 ILGSGGGYRAMTGGAGAL---KALDEGGLLDCVTYLSGLSGSTWLMASLYSNPDWS-QKD 101
Query: 478 LEVKEKRLASLETELRILSQQKMEVET--QLNCTGNELKYKKQE-YDTCLIDV 527
L+ + L R + + + + + +L EL+ KK+ ++ L D
Sbjct: 102 LDEAIEWL------KRHVIKSPLLLFSPERLKYYAKELEEKKKAGFNVSLTDF 148
>gnl|CDD|240354 PTZ00310, PTZ00310, AMP deaminase; Provisional.
Length = 1453
Score = 30.2 bits (68), Expect = 3.7
Identities = 36/154 (23%), Positives = 57/154 (37%), Gaps = 24/154 (15%)
Query: 163 TTLTERKTLCEE----------LTTRVPAME----------SEIAESRARLADLTREEAK 202
TL E +CE+ L+T AME I RA++ L R++
Sbjct: 1295 KTLEEELAICEKIVDVRAETVLLSTHDQAMEVMLREMETTNERIVHLRAQVDSLRRQQRS 1354
Query: 203 LLDQVEKLAREVSEKRESMQTSRSNNRLIDF-VMQLKSENRVSGILGRLGDLGGIDQKYD 261
L VE+L E ++E + N I V +L+ + G+ G G +++
Sbjct: 1355 L---VERLTEEGIRRQEVSEQVAGENNSIALRVNRLRERLQSEGLTVGGGLYGRSNEEES 1411
Query: 262 IAVSTACGALNYIVTETVEAGEAVIAAVKRQNVG 295
A ST G + V+ AV +V
Sbjct: 1412 EASSTQQGETVKRLLRWKPMPPDVMRAVTVASVP 1445
>gnl|CDD|226055 COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope
biogenesis, outer membrane].
Length = 372
Score = 29.8 bits (67), Expect = 3.9
Identities = 25/121 (20%), Positives = 54/121 (44%), Gaps = 18/121 (14%)
Query: 76 EQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFR--WSVFSAKTRFECDIA 133
+ E N S + D + A+ E++ E++ K+++ + +R VF K A
Sbjct: 164 QSEKLINQLSERARRDTVRFAEEEVQKAEERVKKASNDLTDYRIKNGVFDPK-------A 216
Query: 134 KSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL 193
++E+++ L ++ LE L +Q L K++ ++P +++ I R +L
Sbjct: 217 QAEVQMSL---------VSKLEDELIVIQAQLDTVKSVMNPENPQIPGLKARIESLRKQL 267
Query: 194 A 194
Sbjct: 268 L 268
>gnl|CDD|215217 PLN02387, PLN02387, long-chain-fatty-acid-CoA ligase family
protein.
Length = 696
Score = 29.7 bits (67), Expect = 4.2
Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 15/69 (21%)
Query: 145 SKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLL 204
SK+ KKL D+ + LE V+ RV M+ E +S + L+ +
Sbjct: 186 SKQLKKLIDISSQLETVK---------------RVIYMDDEGVDSDSSLSGSSNWTVSSF 230
Query: 205 DQVEKLARE 213
+VEKL +E
Sbjct: 231 SEVEKLGKE 239
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 29.6 bits (67), Expect = 4.3
Identities = 16/88 (18%), Positives = 32/88 (36%)
Query: 151 LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKL 210
+ +LE L+K++ L V M+ + E+ +L L + ++ ++L
Sbjct: 285 VKELEKGLKKLERELELSALDEHPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKEL 344
Query: 211 AREVSEKRESMQTSRSNNRLIDFVMQLK 238
E + L DF+ K
Sbjct: 345 TEYFGEDPKETSPEEFFKILRDFLRMFK 372
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 29.0 bits (65), Expect = 5.2
Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 5/86 (5%)
Query: 149 KKLADLEANLEKVQTTLTERKTL-----CEELTTRVPAMESEIAESRARLADLTREEAKL 203
+ +LEA L + + L+ E L + A+E+E+A R+ L +L
Sbjct: 7 RDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQL 66
Query: 204 LDQVEKLAREVSEKRESMQTSRSNNR 229
+++ A E+ E+ E + R
Sbjct: 67 IEEGHPDAEEIQERLEELNQRWEELR 92
>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain. Mitosin or
centromere-associated protein-F (Cenp-F) is found bound
across the centromere as one of the proteins of the
outer layer of the kinetochore. Most of the
kinetochore/centromere functions appear to depend upon
binding of the C-terminal par to f the molecule, whereas
the N-terminal part, here, may be a cytoplasmic player
in controlling the function of microtubules and dynein.
Length = 288
Score = 29.0 bits (64), Expect = 5.4
Identities = 16/82 (19%), Positives = 39/82 (47%)
Query: 145 SKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLL 204
++ +K+ +LE L+K++ +R+ + L + + ++ + A+L RE +L+
Sbjct: 14 TRALQKIQELEGQLDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLM 73
Query: 205 DQVEKLAREVSEKRESMQTSRS 226
+ E L + + +Q S
Sbjct: 74 EICEHLEKTKQKISHELQVKES 95
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 29.3 bits (66), Expect = 5.5
Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 10/87 (11%)
Query: 143 TQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAK 202
T+ +E +++ E ++K++ T+ + EL + ++ EI + + L RE
Sbjct: 409 TEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRD 468
Query: 203 ----------LLDQVEKLAREVSEKRE 219
++E+L +E+ EK++
Sbjct: 469 KVRKDREIRARDRRIERLEKELEEKKK 495
Score = 28.5 bits (64), Expect = 9.6
Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 20/154 (12%)
Query: 67 EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
E E ++ +E+T + IKK + +E + E ++++ +E K + +
Sbjct: 405 EKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRR 464
Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
+ K +++ + +E+++ L E+K EEL E ++
Sbjct: 465 EVRDKVRKD-------------REIRARDRRIERLEKELEEKKKRVEEL-------ERKL 504
Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRES 220
AE R + + VEKL E E+ E
Sbjct: 505 AELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEE 538
>gnl|CDD|203593 pfam07200, Mod_r, Modifier of rudimentary (Mod(r)) protein. This
family represents a conserved region approximately 150
residues long within a number of eukaryotic proteins
that show homology with Drosophila melanogaster Modifier
of rudimentary (Mod(r)) proteins. The N-terminal half of
Mod(r) proteins is acidic, whereas the C-terminal half
is basic, and both of these regions are represented in
this family. Members of this family include the Vps37
subunit of the endosomal sorting complex ESCRT-I, a
complex involved in recruiting transport machinery for
protein sorting at the multivesicular body (MVB). The
yeast ESCRT-I complex consists of three proteins (Vps23,
Vps28 and Vps37). The mammalian homologue of Vps37
interacts with Tsg101 (Pfam: PF05743) through its mod(r)
domain and its function is essential for lysosomal
sorting of EGF receptors.
Length = 150
Score = 28.4 bits (64), Expect = 5.8
Identities = 13/63 (20%), Positives = 26/63 (41%)
Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEK 217
++ + E L E ++ E +A+L D E KL ++ E+ +++ E
Sbjct: 29 VQDLNLEKDALLAQNESLAEENLEKEPQLEELKAQLRDKYEELKKLEEKYEEKKQQLDEL 88
Query: 218 RES 220
S
Sbjct: 89 SSS 91
>gnl|CDD|227251 COG4914, COG4914, Predicted nucleotidyltransferase [General
function prediction only].
Length = 190
Score = 28.6 bits (64), Expect = 5.9
Identities = 18/88 (20%), Positives = 31/88 (35%), Gaps = 3/88 (3%)
Query: 309 HSQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQD-LNQAKRIGYSGGGYR 367
+D T PRL V+ E +R+ + V + L AK + G +
Sbjct: 66 EENGWDMGTTDHGTPRLE--AIVEGEAVRVDLLENILDFYVPPELLEDAKEVDIDGIKVK 123
Query: 368 MVTLEGAIIEPSETLVAQDLNQAKRIGY 395
+ LE I ++ +D K +
Sbjct: 124 SIGLEDLIALKAKAARKEDEEFLKEVAR 151
>gnl|CDD|185292 PRK15394, PRK15394,
4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol
deformylase ArnD; Provisional.
Length = 296
Score = 29.1 bits (66), Expect = 6.2
Identities = 11/30 (36%), Positives = 19/30 (63%), Gaps = 3/30 (10%)
Query: 314 DKYR-TPENVPRLIDLIQVQDEKIRLAFYF 342
D +R T E VPRL++++ I+ +F+F
Sbjct: 11 DTFRGTREGVPRLLEILSKHG--IQASFFF 38
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 27.6 bits (61), Expect = 6.3
Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 134 KSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL 193
K E +++++ KLA LE +LE + + E ++L + + ++E++ AE RL
Sbjct: 1 KEEKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRL 60
Query: 194 ADLTREEAKLLDQVE--KLAREVSEKRES 220
+L + +LL +E KLA+ E R S
Sbjct: 61 RELQAKIDELLKNLELEKLAKMELESRLS 89
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 29.1 bits (66), Expect = 6.4
Identities = 18/71 (25%), Positives = 36/71 (50%), Gaps = 1/71 (1%)
Query: 154 LEANLEKVQTTLTERKTLCEELTTRVPAM-ESEIAESRARLADLTREEAKLLDQVEKLAR 212
LE ++++ + E + +R+ ++ E EI E R +L +LT E A+L +++ L
Sbjct: 23 LEQQNKELEAKIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLRL 82
Query: 213 EVSEKRESMQT 223
+ RE +
Sbjct: 83 AAEDFREKYED 93
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 28.8 bits (65), Expect = 6.7
Identities = 27/147 (18%), Positives = 54/147 (36%), Gaps = 13/147 (8%)
Query: 65 LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSA 124
L E+ +++ E+ L +L I ++ LK ++ +
Sbjct: 152 LEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLA--------DELNL 203
Query: 125 KTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMES 184
E + A+ EL S++ K+L +L+ L+++ + EL + E
Sbjct: 204 CDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEK 263
Query: 185 EIAESRARLADLTREE-AKLLDQVEKL 210
E R + +E +KL +V L
Sbjct: 264 IREECR----GWSAKEISKLKAKVSLL 286
>gnl|CDD|217325 pfam03028, Dynein_heavy, Dynein heavy chain and region D6 of dynein
motor. This family represents the C-terminal region of
dynein heavy chain. The chain also contains ATPase
activity and microtubule binding ability and acts as a
motor for the movement of organelles and vesicles along
microtubules. Dynein is also involved in cilia and
flagella movement. The dynein subunit consists of at
least two heavy chains and a number of intermediate and
light chains. The 380 kDa motor unit of dynein belongs
to the AAA class of chaperone-like ATPases. The core of
the 380 kDa motor unit contains a concatenated chain of
six AAA modules, of which four correspond to the ATP
binding sites with P-loop signatures described
previously, and two are modules in which the P loop has
been lost in evolution. This C-terminal domain carries
the D6 region of the dynein motor where the P-loop has
been lost in evolution but the general structure of a
potential ATP binding site appears to be retained.
Length = 706
Score = 29.2 bits (66), Expect = 7.3
Identities = 9/23 (39%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 283 EAVIAAVKRQNVGRVNVIPLDKM 305
++ + A+ Q+V R N PLDK+
Sbjct: 573 QSFLTAI-MQSVARKNKWPLDKL 594
>gnl|CDD|233080 TIGR00665, DnaB, replicative DNA helicase. This model describes
the helicase DnaB, a homohexameric protein required for
DNA replication. The homohexamer can form a ring around
a single strand of DNA near a replication fork. An
intein of > 400 residues is found at a conserved
location in DnaB of Synechocystis PCC6803, Rhodothermus
marinus (both experimentally confirmed), and
Mycobacterium tuberculosis. The intein removes itself by
a self-splicing reaction. The seed alignment contains
inteins so that the model built from the seed alignment
will model a low cost at common intein insertion sites
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 434
Score = 28.9 bits (66), Expect = 8.3
Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 173 EELTTRVPAMESEIAESRARLADLTREE-AKLLDQVEKLAR 212
E+L R+ + ES + + R L+ E+ KL KL+
Sbjct: 237 EQLVMRMLSSESRVDSQKLRTGKLSDEDWEKLTSAAGKLSE 277
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 28.8 bits (65), Expect = 8.3
Identities = 16/89 (17%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 131 DIAKSELEI---FLSTQSKETK-KLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
D + +++ + Q K+ +A + +++ K EELT + + +I
Sbjct: 191 DHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDI 250
Query: 187 AESRARLADLTREEAKLLDQVEKLAREVS 215
+ A L L AK+ ++E+ + +
Sbjct: 251 EDPSAALNKLNTAAAKIKSKIEQFQKVIK 279
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 28.4 bits (64), Expect = 8.3
Identities = 15/83 (18%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 144 QSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKL 203
+ + +L + L+K+ + + EEL + +ES+I + + ++L E A+
Sbjct: 199 EDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEA 258
Query: 204 LDQVEKL----AREVSEKRESMQ 222
++E+ +E+ + +E ++
Sbjct: 259 EKKLEQCRGFTFKEIEKLKEQLK 281
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 28.9 bits (64), Expect = 9.5
Identities = 39/152 (25%), Positives = 70/152 (46%), Gaps = 13/152 (8%)
Query: 83 LESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKSELEIFLS 142
LE +KK E E E K EK +E + K + +F + R E +I E+++
Sbjct: 407 LEELKKILAEDEKLLDEKKQFEKIAEE---LKGKEQELIFLLQAR-EKEIHDLEIQLTAI 462
Query: 143 TQSKE--TKKLADLEANLEKVQTTLTERKTLC-------EELTTRVPAMESEIAESRARL 193
S+E K++ DL+ LEK + E C +ELT M E+ + + +
Sbjct: 463 KTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDI 522
Query: 194 ADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
+ ++E ++L Q+E L + R+ +++ R
Sbjct: 523 INCKKQEERMLKQIENLEEKEMNLRDELESVR 554
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 26.0 bits (58), Expect = 9.5
Identities = 13/42 (30%), Positives = 22/42 (52%)
Query: 168 RKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEK 209
+K EEL +V +E+E + + L RE KL ++E+
Sbjct: 24 KKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELEE 65
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.130 0.348
Gapped
Lambda K H
0.267 0.0771 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,717,717
Number of extensions: 2520742
Number of successful extensions: 3441
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3091
Number of HSP's successfully gapped: 336
Length of query: 531
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 430
Effective length of database: 6,457,848
Effective search space: 2776874640
Effective search space used: 2776874640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (27.2 bits)