RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17955
(547 letters)
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 43.2 bits (101), Expect = 3e-04
Identities = 57/333 (17%), Positives = 130/333 (39%), Gaps = 8/333 (2%)
Query: 63 EKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQMSGYKRMRREHQAA 122
KK E + + +K EA KK E + K++ + + E+ ++ A
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475
Query: 123 LLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLT 182
K EE K + + K+ K+ ++ + +K +E KKA E+
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Query: 183 KEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQ--DERRRAAGPSESIASSLTQGATE 240
+ + + +K E K +E K ++E+ K E ++ +E + A A + E
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
Query: 241 ASRGRPDEARLCAD---RKIIRQRAGKELNKRQQQLEQAHSMLLRHH--EKTQELEYRQQ 295
+E + +K + E K+ ++ ++ L + EK + E ++
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655
Query: 296 KSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVEC 355
+ + ++ + ++ + + E+ E E K AL+ + + L + + E
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715
Query: 356 HHLKE-RLHYELEILMAYQSKNKMQAEAQQLED 387
+E + E + A ++K + + + ++ E+
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748
Score = 38.2 bits (88), Expect = 0.012
Identities = 76/440 (17%), Positives = 173/440 (39%), Gaps = 32/440 (7%)
Query: 63 EKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQMSGYKRM--RREHQ 120
E++ E I + + R A ++ KA + R EL + E ++ K+ +++
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
Query: 121 AALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKK 180
A K EE K + + K+ K+ ++ + E +K +E E A + E+K
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Query: 181 LTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERR----RAAGPSESIASSLTQ 236
+++ +K A KK E K + K++ +D+++ + A ++ A +
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKK 1425
Query: 237 GATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQK 296
A E + DEA+ A+ + E K+ ++ ++A + E + E + K
Sbjct: 1426 KAEEKKKA--DEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK--K 1477
Query: 297 SVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECH 356
++ ++ E + D ++ + KK + ++ K + DE+++ E
Sbjct: 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK----KADEAKKAEEA 1533
Query: 357 HLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIR 416
+ E A + K +A++L+ ++A E K E + + E +
Sbjct: 1534 KKADEAKKAEEKKKADELK-----KAEELKKAEEKKKA-EEAKKAEEDKNMALRKAEEAK 1587
Query: 417 LLHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFL 476
E + E+ +E +M + +++ E ++ ++ + +Q
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMK--------AEEAKKAEEAKIKAEELKKAEEEKKKVEQLK 1639
Query: 477 GERQERIRLLHERQERELES 496
+ E + E ++ E E+
Sbjct: 1640 KKEAEEKKKAEELKKAEEEN 1659
Score = 34.3 bits (78), Expect = 0.18
Identities = 64/316 (20%), Positives = 123/316 (38%), Gaps = 15/316 (4%)
Query: 50 QKEHMQAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQM 109
KE +A T +E A++ K EA K +E K E R+ + E
Sbjct: 1083 AKEDNRADEATEEAFGKAEE--AKKTETGKAEEARKAEEAKKKAED-ARKAEEARKAEDA 1139
Query: 110 SGYKRMRREHQAALL----KLEEKCKIEMDQHKSLLDKEYETLLQQFSR---ELEKLTLK 162
+ R+ A + K E+ K E + K + R EL K
Sbjct: 1140 RKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDA 1199
Query: 163 HTSELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKE--YKASKERWKRELSQDER 220
+E +KA+ E+ E + + +K EA K E KA +ER E+ + E
Sbjct: 1200 RKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259
Query: 221 RRAAGPSESIASSLTQGATEASRGRPDEARLCAD--RKIIRQRAGKELNKRQQQLEQAHS 278
R A + A+ + A +A + E + AD +K ++ E K+ ++ ++A
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319
Query: 279 MLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQ 338
+ E ++ + ++K+ + + + + + E E + E +K E K+
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKA-EAEAAADEAEAAEEKAEAAEKKKEEAKK 1378
Query: 339 QPKSLKIRLDESQEVE 354
+ + K + +E ++ +
Sbjct: 1379 KADAAKKKAEEKKKAD 1394
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 42.6 bits (100), Expect = 6e-04
Identities = 62/396 (15%), Positives = 136/396 (34%), Gaps = 35/396 (8%)
Query: 49 QQKEHMQAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKK----EYKASKERWKRELSQEE 104
Q + + + +E A RK E KK E A EL +E
Sbjct: 141 QGGKVEIIAMMKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQE 200
Query: 105 MHEQMSGYKRMRREHQAALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHT 164
+ + K + L+LEE+ + +D K +E LLQ+ R+ ++
Sbjct: 201 LKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLN--EERIDLLQELLRDEQEEIESSK 258
Query: 165 SELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAA 224
ELEK+ ++ K+ +E ++ + ++ AK++E S+ D+ +
Sbjct: 259 QELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLK 318
Query: 225 GPSESIASSLTQGATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHH 284
+ + ++ KEL K ++++E+ L
Sbjct: 319 ESEKEL-----------------------------KKLEKELKKEKEEIEELEKELKELE 349
Query: 285 EKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLK 344
K + E +++ + + + E S +LK++ ++ K K
Sbjct: 350 IKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAK 409
Query: 345 IRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMET 404
+ L+ S++ E +E+ + +S Q + + ++ + + L +
Sbjct: 410 LLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELK 469
Query: 405 QQFLGERQERIRLLHERQERELESFDQESNRMGFRY 440
+ ++ ++ L E+ E L E
Sbjct: 470 KSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESK 505
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 37.4 bits (87), Expect = 0.024
Identities = 55/343 (16%), Positives = 131/343 (38%), Gaps = 25/343 (7%)
Query: 162 KHTSELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERR 221
+ ELE++ + +KL +E+ + ++ R + + ++ + + + E + E
Sbjct: 667 RELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELA 726
Query: 222 RAAGPSESIASSLTQGATEASRGRPDEARLCADRKIIRQRAG---KELNKRQQQLEQAHS 278
E + S L + E + L + + + + L K ++++E+
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEE 786
Query: 279 MLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALE--- 335
E+ +ELE ++ R D + R+ ++ +E E+ ++ LE
Sbjct: 787 KRQALQEELEELE--EELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844
Query: 336 --LKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRR 393
L+++ + L+ L+E +E E+ E E+ + K +++ E ++LE
Sbjct: 845 DELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES----EL 900
Query: 394 ALLEQKMDMETQQFLGERQERIRLLHERQERELESFDQESNRMGFRYRQILVSSQSRVEL 453
A L++++ E+ E + LE E + + ++ +E
Sbjct: 901 AELKEEI---------EKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETE--LER 949
Query: 454 EDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELES 496
E +++ + E +ER L ++E E+
Sbjct: 950 EIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEA 992
Score = 29.7 bits (67), Expect = 5.0
Identities = 48/252 (19%), Positives = 105/252 (41%), Gaps = 20/252 (7%)
Query: 264 KELNKRQQQLEQAHSMLLRHHEKTQEL----------EYRQQKSVHSLREDQVTR----- 308
+EL K+ ++LE+ R+ E EL + ++ + E++++R
Sbjct: 196 EELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEEL 255
Query: 309 -QHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLD-ESQEVECHHLKERL-HYE 365
+ Q EL E + + EL++ + Q + L+++ + E E E L+ERL E
Sbjct: 256 EELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELE 315
Query: 366 LEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQ--QFLGERQERIRLLHERQE 423
E+ + +++ + + L++ + R LLE+ + + + E +E++ L E E
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375
Query: 424 RELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFLGERQERI 483
E+ +E + +I + + + R L ++++ ++ E
Sbjct: 376 ELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELE 435
Query: 484 RLLHERQERELE 495
L E +E E
Sbjct: 436 ELQTELEELNEE 447
>gnl|CDD|219320 pfam07172, GRP, Glycine rich protein family. This family of
proteins includes several glycine rich proteins as well
as two nodulins 16 and 24. The family also contains
proteins that are induced in response to various
stresses.
Length = 91
Score = 33.2 bits (76), Expect = 0.061
Identities = 10/28 (35%), Positives = 11/28 (39%)
Query: 9 RHQGAHHSGAGGGSNNVGGGGGGGAPNN 36
G + G GGG G GGGG
Sbjct: 47 GGGGGYGGGGGGGYGGGGYYGGGGGYGG 74
Score = 29.7 bits (67), Expect = 0.86
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 9 RHQGAHHSGAGGGSNNVGGGGGGG 32
+ G + G GGG GGG GG
Sbjct: 59 GYGGGGYYGGGGGYGGGGGGYPGG 82
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.8 bits (83), Expect = 0.069
Identities = 42/299 (14%), Positives = 103/299 (34%), Gaps = 19/299 (6%)
Query: 142 DKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTKEIIARQDCDRKVFEAAKK 201
+E E L ++ EK+ L + K E++L + ++ R++
Sbjct: 676 RREIEELEEKIEELEEKIAELEK-ALAELRKELEELEEELEQLRKELEELSRQISAL--- 731
Query: 202 KEYKASKERWKRELSQDERRRAAGPSESIASSLTQGATEASRGRPDEARLCADRKIIRQR 261
++ + E ++ + + + + + + + A A+ +
Sbjct: 732 -RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE----- 785
Query: 262 AGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYM 321
EL + +QL++ L ++ + + +LRE + + + T +
Sbjct: 786 ---ELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
Query: 322 HRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAE 381
+ E + L + + L+ ++E + L ER E + + ++ E
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902
Query: 382 AQQLEDRVSVRRALLEQ------KMDMETQQFLGERQERIRLLHERQERELESFDQESN 434
++LE + S R LE+ ++++ + L E LE + N
Sbjct: 903 LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALEN 961
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 35.1 bits (82), Expect = 0.095
Identities = 27/106 (25%), Positives = 50/106 (47%), Gaps = 14/106 (13%)
Query: 326 QELKKKHALELKQQPKSLKIRLD---ESQEVECHHLKERLHYELEILMAYQSKNKMQAEA 382
+ +KK+ LE K++ L+ + + E L++RL + E L ++
Sbjct: 52 EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENL------DRKLELL 105
Query: 383 QQLEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELES 428
++ E+ + + LEQK QQ L +++E + L E Q +ELE
Sbjct: 106 EKREEELEKKEKELEQK-----QQELEKKEEELEELIEEQLQELER 146
Score = 32.4 bits (75), Expect = 0.62
Identities = 37/181 (20%), Positives = 71/181 (39%), Gaps = 31/181 (17%)
Query: 255 RKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLRE--DQVTRQHQT 312
+KI + + + ++ LE+A +E ++ +H LR ++ R+ +
Sbjct: 26 KKIAEAKIKEAEEEAKRILEEAK---KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82
Query: 313 ELTNQEDYMHRSEQEL--------KKKHALELKQQPKSLKIRLDESQEVECHHLKERLHY 364
EL E + + E+ L K++ LE K++ K + E +E E L E
Sbjct: 83 ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142
Query: 365 ELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQER 424
ELE + EA + +L +K++ E + E I+ + E +
Sbjct: 143 ELERISGLT-----AEEA----------KEILLEKVEEEARH---EAAVLIKEIEEEAKE 184
Query: 425 E 425
E
Sbjct: 185 E 185
Score = 30.1 bits (69), Expect = 3.2
Identities = 26/124 (20%), Positives = 55/124 (44%), Gaps = 13/124 (10%)
Query: 64 KKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQEEMHEQMSGYKRMRREHQAAL 123
+KL K ++ +++ + E +K+E + K+ + E Q+E+ ++ + + E
Sbjct: 85 QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ-- 142
Query: 124 LKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTK 183
+LE + ++ K + LL++ E E+E++AK K K
Sbjct: 143 -ELERISGLTAEEAKEI-------LLEKVEEEARHEAAVLIKEIEEEAKEEA---DKKAK 191
Query: 184 EIIA 187
EI+A
Sbjct: 192 EILA 195
>gnl|CDD|204270 pfam09579, Spore_YtfJ, Sporulation protein YtfJ (Spore_YtfJ).
Proteins in this family are encoded by bacterial
genomes if, and only if, the species is capable of
endospore formation. YtfJ was confirmed in spores of B.
subtilis; it appears to be expressed in the forespore
under control of SigF.
Length = 81
Score = 32.1 bits (74), Expect = 0.10
Identities = 11/24 (45%), Positives = 11/24 (45%), Gaps = 7/24 (29%)
Query: 17 GAGGGS-------NNVGGGGGGGA 33
GAGGG GGGGG G
Sbjct: 19 GAGGGEGKDKKGKTGFGGGGGAGV 42
>gnl|CDD|185612 PTZ00430, PTZ00430, glucose-6-phosphate isomerase; Provisional.
Length = 552
Score = 34.2 bits (79), Expect = 0.17
Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 10/90 (11%)
Query: 99 ELSQEEMHEQMSGYKR-MRREHQAALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRE-L 156
+L + ++ + +++ H LLK EE+ K L KE++ + SR+ L
Sbjct: 2 DLESLKSYKNLLSLAEKLKKVHLRDLLKDEERNK--------SLIKEFKGVTLDLSRQRL 53
Query: 157 EKLTLKHTSELEKKAKLNTSCEKKLTKEII 186
++ TLK EL ++AKL + E I
Sbjct: 54 DEETLKLLIELAEEAKLKEKIKDMFNGEKI 83
>gnl|CDD|217958 pfam04197, Birna_RdRp, Birnavirus RNA dependent RNA polymerase
(VP1). Birnaviruses are dsRNA viruses. This family
corresponds to the RNA dependent RNA polymerase. This
protein is also known as VP1. All of the birnavirus VP1
proteins contain conserved RdRp motifs that reside in
the catalytic "palm" domain of all classes of
polymerases. However, the birnavirus RdRps lack the
highly conserved Gly-Asp-Asp (GDD) sequence, a component
of the proposed catalytic site of this enzyme family
that exists in the conserved motif VI of the palm domain
of other RdRps.
Length = 855
Score = 33.7 bits (77), Expect = 0.31
Identities = 37/158 (23%), Positives = 58/158 (36%), Gaps = 20/158 (12%)
Query: 149 LQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTKEIIARQ---DCDRKVFEAAKKKEYK 205
+ +F +LE L+LK + A LNT K + + +KV A K YK
Sbjct: 667 MSEFGEDLEGLSLKEPVTNQTLADLNTPLPSK--DPKVRKPTPRSGLKKVSNALKSGTYK 724
Query: 206 ASKERWKRELSQDERRRAAGPSESIASSLTQGATEASRGRPDEAR--LCADR----KIIR 259
K R+L+ + R ++ A A A++ +P + A R +
Sbjct: 725 NPKSAVGRKLTARAKSRINDDPQAEAL-----ADLATKLKPASSTVDNWASRTEELSVQL 779
Query: 260 QRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKS 297
Q K + QLE+ L E Q + KS
Sbjct: 780 QLLLKANTLAKSQLEETREAL----EVVQSDKIITGKS 813
>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763). This
eukaryotic family of proteins has no known function.
Length = 91
Score = 30.8 bits (70), Expect = 0.37
Identities = 9/20 (45%), Positives = 11/20 (55%)
Query: 17 GAGGGSNNVGGGGGGGAPNN 36
G GG +G GGGG P+
Sbjct: 66 GPPGGGRRMGRIGGGGGPSR 85
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 32.9 bits (76), Expect = 0.43
Identities = 19/124 (15%), Positives = 55/124 (44%), Gaps = 10/124 (8%)
Query: 249 ARLCADRKIIRQRAGKELNKRQQQLEQAHSML------LRHHEKTQELEYRQQKSVHSLR 302
RL ++ + + + L +++Q+L+Q L ++ Q L+ QQ+ +L
Sbjct: 267 QRLQQLQQRLARAMRRRLEQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQQRLQRALE 326
Query: 303 EDQVTRQHQTELTNQEDYMHRSEQELK--KKHALELKQQ-PKSLKIRLDESQEVECHHLK 359
+ + E +Q ++ ++ ++ +L+Q+ ++++ +L ++ L
Sbjct: 327 RRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQ-RLEALA 385
Query: 360 ERLH 363
++L
Sbjct: 386 QQLE 389
Score = 29.0 bits (66), Expect = 6.4
Identities = 26/139 (18%), Positives = 54/139 (38%), Gaps = 14/139 (10%)
Query: 285 EKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPK--- 341
E Q L+ QQ+ ++R ++ + + R LK + L Q +
Sbjct: 264 ELLQRLQQLQQRLARAMRRRLEQKRQRLD-----QLARR----LKFQSPERLLAQQQQRL 314
Query: 342 -SLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKM 400
L+ RL + E K+RL + L + +++ Q+LE R + +++
Sbjct: 315 DRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQL 374
Query: 401 DMETQQFLGERQERIRLLH 419
+ Q+ L +++ L
Sbjct: 375 KRKRQR-LEALAQQLEALS 392
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 32.7 bits (75), Expect = 0.49
Identities = 25/121 (20%), Positives = 51/121 (42%), Gaps = 7/121 (5%)
Query: 299 HSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHL 358
+ E + EL +E+ E+ LK+K L + + L RL+ + L
Sbjct: 167 SAKEELDQLSKKLAELKAEEE--EELERALKEKREELLSKLEEELLARLESKEAALEKQL 224
Query: 359 K---ERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGER--QE 413
+ ER EL + + +++ +A+ E ++ AL ++ E + + E+ +E
Sbjct: 225 RLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEE 284
Query: 414 R 414
R
Sbjct: 285 R 285
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 32.8 bits (75), Expect = 0.50
Identities = 79/425 (18%), Positives = 164/425 (38%), Gaps = 58/425 (13%)
Query: 85 KKKEYKASKERWKRELSQEEMHEQMSGYKRMRREHQAALLKLEEKCKIEMDQHKSLLDKE 144
++ + + EE+ E++ + + + L KLE + + E+ + K+ L K
Sbjct: 300 IEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELE-ELAEEKNELAKL 358
Query: 145 YETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTK--EIIAR-QDCDRKVFEAAKK 201
E L++ LE+L + LE+ +L + ++ + E+ A ++ ++ E K+
Sbjct: 359 LEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKE 418
Query: 202 KEYKASKERWKRELSQDERRRAAGPSESIASSLTQGATEASRGRPDEARLCADRKIIRQR 261
E +REL + E + + G ++ +C
Sbjct: 419 LEEL------ERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCG-------- 464
Query: 262 AGKELNKRQQQLEQAHSMLLRHHEKTQ--ELEYRQQKSVHSLREDQVTRQHQTELTNQED 319
Q+L + H L + + ELE + + + + EL E+
Sbjct: 465 ---------QELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEE 515
Query: 320 YMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQ 379
+ + + EL+++ + L+ L+E + LKE+L +++
Sbjct: 516 ELIELLELEEALKE-ELEEKLEKLENLLEELE-----ELKEKLQ-----------LQQLK 558
Query: 380 AEAQQLEDRVSVRRALLEQKMDMET-QQFLGERQERIRLLHERQERELESFDQESNRMGF 438
E +QLEDR+ + LLE+ + T ++ L E +ER++ L ++ + E Q +
Sbjct: 559 EELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELL-- 616
Query: 439 RYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELESFD 498
QS E + A E + ++E E +E ++ E E ++E +
Sbjct: 617 ---------QSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667
Query: 499 QESNR 503
E R
Sbjct: 668 AEIRR 672
Score = 31.3 bits (71), Expect = 1.5
Identities = 44/246 (17%), Positives = 90/246 (36%), Gaps = 9/246 (3%)
Query: 113 KRMRREHQAALLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAK 172
+ + RE + A L+ E + E+++ L++E LL+ E+L K +
Sbjct: 488 EELSREKEEAELREEIE---ELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEE 544
Query: 173 LNTSCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAAGPS---ES 229
L EK +++ + K+ + R ++E ++ R R +
Sbjct: 545 LEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE 604
Query: 230 IASSLTQGATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQE 289
+ L+Q EA + + E Q +LE+ L E+ E
Sbjct: 605 LEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVE 664
Query: 290 --LEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQ-ELKKKHALELKQQPKSLKIR 346
+++ E+Q+ + + +E+ E+ E K E++Q + L+ R
Sbjct: 665 ELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESR 724
Query: 347 LDESQE 352
E +E
Sbjct: 725 KAELEE 730
>gnl|CDD|233909 TIGR02520, pilus_B_mal_scr, type IVB pilus formation outer membrane
protein, R64 PilN family. Several related protein
families encode outer membrane pore proteins for type II
secretion, type III secretion, and type IV pilus
formation. This protein family appears to encode a
secretin for pilus formation, although it is quite
different from PilQ. Members include the PilN
lipoprotein of the plasmid R64 thin pilus, a type IV
pilus. Scoring between the trusted and noise cutoffs are
examples of bundle-forming pilus B (bfpB) [Cell
envelope, Surface structures, Protein fate, Protein and
peptide secretion and trafficking].
Length = 497
Score = 32.6 bits (74), Expect = 0.56
Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 16 SGAGGGSNNVGGGGGGGAPNNFATIRTTSIVTKQQKEHMQAKLNTS 61
S AG GS++ GG G G+ + A +S+ Q+ +++ L++S
Sbjct: 178 STAGSGSSSSGGSGNSGSTQSTAVKLESSVHNDIQQS-IKSMLSSS 222
>gnl|CDD|221784 pfam12810, Gly_rich, Glycine rich protein. This family of proteins
is greatly expanded in Trichomonas vaginalis. The
proteins are composed of several glycine rich motifs
interspersed through the sequence. Although many
proteins have been annotated by similarity in the family
these annotations given the biased composition of the
sequences these are unlikely to be functionally
relevant.
Length = 248
Score = 31.8 bits (73), Expect = 0.74
Identities = 13/38 (34%), Positives = 14/38 (36%), Gaps = 12/38 (31%)
Query: 7 SSRHQGAHHSGA------------GGGSNNVGGGGGGG 32
SS G G GGG + GGGGGG
Sbjct: 156 SSGGNGGFGYGGSGNGGGGGGGYFGGGGGHYAGGGGGG 193
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 32.2 bits (73), Expect = 0.88
Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 376 NKMQAEAQQLEDRVSVRRALLE-QKMDMETQQFLGERQERIRLLHERQERELE 427
++ Q LE V+R L E Q +E Q+ + ERQ R+ L + LE
Sbjct: 1578 KEVDQLQQSLERLAEVQRELAEGQVKVIEGQKQMAERQSRLSQLENKILGVLE 1630
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 31.9 bits (72), Expect = 1.1
Identities = 41/253 (16%), Positives = 90/253 (35%), Gaps = 30/253 (11%)
Query: 246 PDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLREDQ 305
P +RK + ++ K L + QQ +Q+H+ L + E +E +QQ +
Sbjct: 211 PCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIE 270
Query: 306 VTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYE 365
R + L ++ + + K+ + + +R+H E
Sbjct: 271 ELRAQEAVLEETQE---------------RINRARKAAPLAAHIKAVTQIEQQAQRIHTE 315
Query: 366 LEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERE 425
L+ M ++K M+ R A ++Q+ +E Q+ L + +
Sbjct: 316 LQSKMRSRAKLLMK------------RAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEV 363
Query: 426 LESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQ---QFLGERQER 482
S + S + + I Q + L ++ L+ + + R +
Sbjct: 364 ATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQ 423
Query: 483 IRLLHERQERELE 495
+L H ++++EL+
Sbjct: 424 GQLAHAKKQQELQ 436
>gnl|CDD|130587 TIGR01524, ATPase-IIIB_Mg, magnesium-translocating P-type ATPase.
This model describes the magnesium translocating P-type
ATPase found in a limited number of bacterial species
and best described in Salmonella typhimurium, which
contains two isoforms. These transporters are active in
low external Mg2+ concentrations and pump the ion into
the cytoplasm. The magnesium ATPases have been
classified as type IIIB by a phylogenetic analysis
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 867
Score = 31.4 bits (71), Expect = 1.4
Identities = 23/102 (22%), Positives = 46/102 (45%), Gaps = 11/102 (10%)
Query: 415 IRLLHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRAL----LEQKMDM 470
+ L E R+ S ++ + + F + + +S V +E+R V R + +E+ + +
Sbjct: 391 LAKLDESAARQTASRWKKVDEIPFDFDRRRLS----VVVENRAEVTRLICKGAVEEMLTV 446
Query: 471 ETQQFLGERQERIRLLHERQERELESFDQESNRMGFSALAIA 512
T + + L E ++ EL+ E NR G +A+A
Sbjct: 447 CTHK---RFGGAVVTLSESEKSELQDMTAEMNRQGIRVIAVA 485
>gnl|CDD|236474 PRK09350, PRK09350, poxB regulator PoxA; Provisional.
Length = 306
Score = 30.7 bits (70), Expect = 1.7
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 312 TELTNQEDYMHRSEQELKKKHALELKQQP 340
ELT+ + R EQ+ +K+ A L QQP
Sbjct: 239 HELTDAREQRQRFEQDNRKRAARGLPQQP 267
>gnl|CDD|227244 COG4907, COG4907, Predicted membrane protein [Function unknown].
Length = 595
Score = 31.1 bits (70), Expect = 2.0
Identities = 11/26 (42%), Positives = 12/26 (46%)
Query: 7 SSRHQGAHHSGAGGGSNNVGGGGGGG 32
SSR + G G GGGGGG
Sbjct: 569 SSRRSSSSGGGGGFSGGGSGGGGGGA 594
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 28.8 bits (65), Expect = 2.0
Identities = 7/40 (17%), Positives = 12/40 (30%)
Query: 5 RDSSRHQGAHHSGAGGGSNNVGGGGGGGAPNNFATIRTTS 44
SS + S + + G ++ I T S
Sbjct: 25 SVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRITTES 64
>gnl|CDD|131941 TIGR02895, spore_sigI, RNA polymerase sigma-I factor. Members of
this sigma factor protein family are strictly limited to
endospore-forming species in the Firmicutes lineage of
bacteria, but are not universally present among such
species. Sigma-I was shown to be induced by heat shock
(PMID:11157964) in Bacillus subtilis and is suggested by
its phylogenetic profile to be connected to the program
of sporulation (PMID:16311624) [Transcription,
Transcription factors, Cellular processes, Sporulation
and germination].
Length = 218
Score = 29.7 bits (67), Expect = 2.9
Identities = 23/90 (25%), Positives = 39/90 (43%), Gaps = 9/90 (10%)
Query: 317 QEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHY--------ELEI 368
E+Y + E E ++ LE K+ K I E +V H R E E
Sbjct: 107 MEEYRNEIENENRRLEILEYKKLLKQFGIEFVELVKVSPKHRDTRKKAIKIAKVIVENEE 166
Query: 369 LMAYQSKNKMQAEAQQLEDRVSVRRALLEQ 398
L+ Y + K + +++E+RV + R +E+
Sbjct: 167 LLEYLIRKK-KLPIKEIEERVRISRKTIER 195
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 30.1 bits (67), Expect = 3.2
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 14/83 (16%)
Query: 346 RLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQ 405
E E E LKERL + L Y+ + Q + Q K+ E Q
Sbjct: 363 TQAEKYEQEIARLKERLRVSVRRLEEYERRLLGQEQQMQ--------------KLLQEYQ 408
Query: 406 QFLGERQERIRLLHERQERELES 428
L + +ER+R E ++ +++S
Sbjct: 409 ARLEDSEERLRRQQEEKDSQMKS 431
Score = 29.3 bits (65), Expect = 6.9
Identities = 29/113 (25%), Positives = 50/113 (44%), Gaps = 11/113 (9%)
Query: 410 ERQERIRLLHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMD 469
E E + + + E ++QE R+ R R VS + E E R+ + + QK+
Sbjct: 349 EDIEYETEENREEGTQAEKYEQEIARLKERLR---VSVRRLEEYERRLLGQEQQM-QKLL 404
Query: 470 METQQFLGERQERIRLLHERQERELESFDQESNRMGFSALAIAEASKESYPDM 522
E Q L + +ER+R R+ E D + + +A+ E K+ + DM
Sbjct: 405 QEYQARLEDSEERLR-------RQQEEKDSQMKSIISRLMAVEEELKKDHADM 450
>gnl|CDD|235903 PRK06991, PRK06991, ferredoxin; Provisional.
Length = 270
Score = 29.8 bits (67), Expect = 3.4
Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 17/85 (20%)
Query: 198 AAKKKEYKASKERWKRELSQDERR---RAAGPSESIASSLTQGATEASRGRPDEARLCAD 254
A + + A + R +RE E R RAA + + A+ + A A+ EA+
Sbjct: 153 DAARARHDARQARLRREREAAEARAAARAAASAAAAAAEASAAAAPAADD--AEAK---K 207
Query: 255 RKIIR---QRAGKELNKRQQQLEQA 276
R II +RA R+++ E A
Sbjct: 208 RAIIAAALERA------RKKKEELA 226
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 30.3 bits (68), Expect = 3.5
Identities = 54/321 (16%), Positives = 131/321 (40%), Gaps = 24/321 (7%)
Query: 66 LTKEIIARQDCDRKVFEAAKKKEYKASK-----ERWKRELSQEEMHEQ-MSGYKRMRREH 119
+ +I + + ++E ++ + K E+W+ S +E ++ K+ +E
Sbjct: 269 INTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEW 328
Query: 120 QAALLKLEEKCKIEMDQHKSLLDKE----YETLLQQFSRELEKLTLKHTSELEKKAKLNT 175
L KL+ + +++ ++ K+L + Q S E +L + +L ++
Sbjct: 329 PGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKIN 388
Query: 176 SCEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAAGPSE---SIAS 232
KLTK + +R+ + +F++ +K + S+ + S +I
Sbjct: 389 IQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQ 448
Query: 233 SLTQGATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEY 292
+G+ + L I++R E NK E ++ +E TQ LE
Sbjct: 449 LFPKGSGINESIKKSILEL---NDEIQERIKTEENKSITLEEDIKNLKHDINELTQILEK 505
Query: 293 RQQKSVHSLREDQVTRQHQTELTNQEDY-MHRSEQELKKKH------ALELKQQPKSLKI 345
+ + + + +++++ + + + E+EL + L+ +Q +S +I
Sbjct: 506 LELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLNLLSKTSILDAEQLVQSTEI 565
Query: 346 RLDESQEVECHHLKERLHYEL 366
+LDE + V+ + + ++H ++
Sbjct: 566 KLDELK-VDLNRKRYKIHKQV 585
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.2 bits (68), Expect = 3.6
Identities = 33/130 (25%), Positives = 63/130 (48%), Gaps = 5/130 (3%)
Query: 300 SLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDE--SQEVECHH 357
S E Q E N+ + ++ L + LEL+QQ + L+ L E S+ +
Sbjct: 478 SENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSL 537
Query: 358 LKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQERIRL 417
LK +L LE L ++ ++Q + +Q+E+ + L +K+ E + L ++ E +R
Sbjct: 538 LKSKLEEHLEQLH--EANEELQKKREQIEELEPDQDQNLSRKIA-ELEAALQKKDEDMRA 594
Query: 418 LHERQERELE 427
+ ER ++ +E
Sbjct: 595 MEERYKKYVE 604
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 29.5 bits (67), Expect = 4.4
Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 3/50 (6%)
Query: 310 HQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLD---ESQEVECH 356
+ T + M R ++ + ELK++P+ L+ R+ S E +
Sbjct: 275 IEKFGTRANEVMKREKELFELYKKPELKEKPEELEKRIGAGKYSSEYASN 324
>gnl|CDD|221737 pfam12726, SEN1_N, SEN1 N terminal. This domain is found at the N
terminal of the helicase SEN1. SEN1 is a Pol II
termination factor for noncoding RNA genes. The N
terminal of SEN1, unlike the C terminal, is not required
for growth.
Length = 724
Score = 30.0 bits (68), Expect = 4.5
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 123 LLKLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEK 169
L K K K E+ K+ +K + L + S+ LE+L+ SEL
Sbjct: 469 LEKFPPKGKDELTAEKTQFNKIVQQLTELISQILERLSDFDPSELSS 515
>gnl|CDD|161783 TIGR00237, xseA, exodeoxyribonuclease VII, large subunit. This
family consist of exodeoxyribonuclease VII, large
subunit XseA which catalyses exonucleolytic cleavage in
either the 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. Exonuclease VII consists of
one large subunit and four small subunits [DNA
metabolism, Degradation of DNA].
Length = 432
Score = 29.8 bits (67), Expect = 4.6
Identities = 21/113 (18%), Positives = 44/113 (38%), Gaps = 1/113 (0%)
Query: 327 ELKKKHALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLE 386
E+ + EL Q+ +RL + + H K RL + L +NK+ + Q E
Sbjct: 251 EIVSPNQDELLQRLDGFNVRLHRAFDTLLHQKKARLEQLVASLQRQHPQNKLALQQLQFE 310
Query: 387 DRVSVRRALLEQKMDMETQQFLGERQERIRLL-HERQERELESFDQESNRMGF 438
++A L ++++ Q+ + + + R +Q + R+
Sbjct: 311 KLEKRKQAALNKQLERTRQKKTRLTKRLTQTNPSPQILRAQTRTEQLNRRLNA 363
>gnl|CDD|223219 COG0141, HisD, Histidinol dehydrogenase [Amino acid transport and
metabolism].
Length = 425
Score = 29.5 bits (67), Expect = 4.7
Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 1/47 (2%)
Query: 398 QKMDMETQQFLGERQERIRLLHERQ-ERELESFDQESNRMGFRYRQI 443
Q++D E ++ L E I HE Q ++ + +G R+R I
Sbjct: 71 QRLDPEVKEALEVAAENIEAFHEAQLPKDWLVETEPGVVLGQRWRPI 117
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This
family consist of exonuclease VII, large subunit
EC:3.1.11.6 This enzyme catalyzes exonucleolytic
cleavage in either 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme
is composed of one large subunit and 4 small ones.
Length = 295
Score = 29.2 bits (66), Expect = 5.1
Identities = 23/118 (19%), Positives = 44/118 (37%), Gaps = 11/118 (9%)
Query: 332 HALELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSV 391
EL Q+ + L+ RL + + ++RL L + S+ ++ ++L +
Sbjct: 144 DRTELLQKLEGLEQRLSRALKNRLEKEQDRL-NLLRERLKSLSRRLLEQHEERLAE---- 198
Query: 392 RRALLEQKMDMETQQFLGERQERIRLLHERQERELESFDQESNRMGFRYRQILVSSQS 449
L ++ Q L +Q RL RELE N++ + + S
Sbjct: 199 ----LRDRLISSIQNLLSRKQS--RLERLILNRELEKNSLLENKLATANLTAQLKALS 250
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 29.3 bits (66), Expect = 5.8
Identities = 55/266 (20%), Positives = 101/266 (37%), Gaps = 17/266 (6%)
Query: 245 RPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLRED 304
RPDE R D K + Q +A L +K QEL R L E
Sbjct: 142 RPDEQRQLLDTFAGANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLDFLQFQLEEL 201
Query: 305 QVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQEVECHHLKERL-H 363
+ + +L ED +EQ+ + + +L++ ++ L +V+ L E L
Sbjct: 202 E-----EADLQPGEDEALEAEQQ-RLSNLEKLRELSQNALAALRGDVDVQEGSLLEGLGE 255
Query: 364 YELEILMAYQSK-NKMQAEAQQLEDRV-----SVRRALLEQKMDMETQQFLGERQERIRL 417
+L + ++ + V ++ L E + D E + ER +I+
Sbjct: 256 AQLALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERLAQIKR 315
Query: 418 LHERQERELESFDQESNRMGFRYRQILVSSQSRVELEDRVSVRRALLEQKMDMETQQFLG 477
L + +E + + ++ Q+ S +S LE+ V LE+++D
Sbjct: 316 LKRKYGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDK----LEEELDKAAVALSL 371
Query: 478 ERQERIRLLHERQERELESFDQESNR 503
R++ L +R E+EL++ E
Sbjct: 372 IRRKAAERLAKRVEQELKALAMEKAE 397
>gnl|CDD|234722 PRK00310, rpsC, 30S ribosomal protein S3; Reviewed.
Length = 232
Score = 29.0 bits (66), Expect = 5.9
Identities = 11/13 (84%), Positives = 11/13 (84%)
Query: 382 AQQLEDRVSVRRA 394
AQQLE RVS RRA
Sbjct: 121 AQQLERRVSFRRA 133
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 28.1 bits (63), Expect = 6.0
Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 6/129 (4%)
Query: 293 RQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALELKQQPKSLKIRLDESQE 352
Q K L Q+ ++H+ EL E R+ ++L+++ EL++ PK ++ Q+
Sbjct: 7 EQLKDRFFLERTQLLKRHEKELEQLERQQKRTIEKLEQRQTQELRRLPKRIRA----EQK 62
Query: 353 VECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALLEQKMDMETQQFLGERQ 412
KE L E + L Q K+ +Q + R+ + EQK + ++FL +++
Sbjct: 63 TRLKMFKESLKIEKKELK--QEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQEREFLAKQE 120
Query: 413 ERIRLLHER 421
E + ++
Sbjct: 121 ENLEEALQQ 129
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.9 bits (65), Expect = 6.9
Identities = 25/121 (20%), Positives = 53/121 (43%)
Query: 177 CEKKLTKEIIARQDCDRKVFEAAKKKEYKASKERWKRELSQDERRRAAGPSESIASSLTQ 236
+KKL K + +Q +R+ + + +KE+++ Q ++ A + +
Sbjct: 100 RQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAE 159
Query: 237 GATEASRGRPDEARLCADRKIIRQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQK 296
A+ + G R + + ++ EL K +QQ ++ + +K QE E R+QK
Sbjct: 160 SASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
Query: 297 S 297
+
Sbjct: 220 A 220
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 28.6 bits (64), Expect = 6.9
Identities = 12/61 (19%), Positives = 26/61 (42%)
Query: 259 RQRAGKELNKRQQQLEQAHSMLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQE 318
+QR +Q+ ++A R+ +K Q LE ++ + +E ++ +E
Sbjct: 38 QQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEARE 97
Query: 319 D 319
D
Sbjct: 98 D 98
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 29.3 bits (66), Expect = 7.1
Identities = 44/233 (18%), Positives = 79/233 (33%), Gaps = 16/233 (6%)
Query: 277 HSMLLRHHEKTQELEYRQQKSVHSLREDQVTRQHQTELTNQEDYMHRSEQELKKKHALEL 336
H L H E +E E ++ + + L+ DY+ + + L
Sbjct: 26 HDFGLVHLEDLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLFILP- 84
Query: 337 KQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQSKNKMQAEAQQLEDRVSVRRALL 396
++ + L+ L +EV K E + +++ E + LED + L
Sbjct: 85 -EEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPL- 142
Query: 397 EQKMDMETQQFLGERQERIRL-LHERQEREL--ESFDQESNRMGFRYRQ---ILVSSQSR 450
+D + G + +RL L R++ E + E G I+V+ +
Sbjct: 143 -AYLDFDLSLLRGLKFLLVRLGLVRREKLEALVGVIEDEVALYGENVEASVVIVVAHGA- 200
Query: 451 VELEDRVS-VRRALLEQKMDMETQQFLGERQERIRLLHERQE--RELESFDQE 500
E D+VS + L + ++ L E ELES E
Sbjct: 201 -EDLDKVSKILNELGFELYEVPEFDGGPSELISE-LEEVIAEIQDELESLRSE 251
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.0 bits (66), Expect = 7.4
Identities = 29/140 (20%), Positives = 58/140 (41%), Gaps = 17/140 (12%)
Query: 168 EKKAKLNTSCEKKLTKEIIARQDCDRKVFEA-AKKKEYKASKERWKRELSQDERRRAAGP 226
E K KLN E Q K EA A KE + KE + + + +
Sbjct: 513 EDKEKLNELIASLEELERELEQ----KAEEAEALLKEAEKLKEELEEKKEKLQEEE---- 564
Query: 227 SESIASSLTQGATEASRGRPDEARLCAD---RKIIRQRAGKELNKRQQQLEQAHSMLLRH 283
+ + + A +A + EA+ AD +++ + + G + + +L +A L +
Sbjct: 565 -DKLLEEAEKEAQQAIK----EAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKA 619
Query: 284 HEKTQELEYRQQKSVHSLRE 303
+EK ++ + +Q++ L+
Sbjct: 620 NEKKEKKKKKQKEKQEELKV 639
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 29.3 bits (65), Expect = 7.5
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
Query: 448 QSRVELEDRVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERE-LESFDQES-NRMG 505
+ R+E +R R E+ +E ++ ER ER RL +R ER+ L+ ++E +R+
Sbjct: 457 KKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLE 516
Query: 506 FSALAIAEASKESYPDMDGSLSG 528
L A + M+ LS
Sbjct: 517 RDRLEKARRNSYFLKGMENGLSA 539
>gnl|CDD|216179 pfam00898, Orbi_VP2, Orbivirus outer capsid protein VP2. VP2 acts
as an anchor for VP1 and VP3. VP2 contains a
non-specific DNA and RNA binding domain in the
N-terminus.
Length = 946
Score = 28.9 bits (65), Expect = 8.4
Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 6/69 (8%)
Query: 125 KLEEKCKIEMDQHKSLLDKEYETLLQQFSRELEKLTLKHTSELEKKAKLNTSCEKKLTKE 184
K E CKI + +LD E E ++ L+++ +L++K + + +
Sbjct: 260 KALELCKILSAIGRKMLDTEEE------PKDEMDLSVRFQFKLDEKFRRTDQERVNIFEV 313
Query: 185 IIARQDCDR 193
R D DR
Sbjct: 314 GGHRTDEDR 322
>gnl|CDD|236092 PRK07772, PRK07772, single-stranded DNA-binding protein;
Provisional.
Length = 186
Score = 28.1 bits (63), Expect = 8.6
Identities = 12/29 (41%), Positives = 12/29 (41%)
Query: 7 SSRHQGAHHSGAGGGSNNVGGGGGGGAPN 35
G G GGG GGGGGGG
Sbjct: 122 RGGGGGGGGGGFGGGGGGSGGGGGGGGGG 150
>gnl|CDD|233515 TIGR01659, sex-lethal, sex-lethal family splicing factor. This
model describes the sex-lethal family of splicing
factors found in Dipteran insects. The sex-lethal
phenotype, however, may be limited to the Melanogasters
and closely related species. In Drosophila the protein
acts as an inhibitor of splicing. This subfamily is most
closely related to the ELAV/HUD subfamily of splicing
factors (TIGR01661).
Length = 346
Score = 28.4 bits (63), Expect = 8.9
Identities = 9/25 (36%), Positives = 10/25 (40%)
Query: 12 GAHHSGAGGGSNNVGGGGGGGAPNN 36
G + G G N GG G P N
Sbjct: 288 GHGNMGNMGHGNMGMAGGSGMNPPN 312
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 28.8 bits (65), Expect = 9.3
Identities = 25/126 (19%), Positives = 49/126 (38%), Gaps = 18/126 (14%)
Query: 335 ELKQQPKSLKIRLDESQEVECHHLKERLHYELEILMAYQS-------KNKMQAEAQQLED 387
EL QQ L+ RL L+ L + + L + + + Q+L++
Sbjct: 265 ELLQQLDQLQRRLH-------RALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDE 317
Query: 388 RVSVRRALLEQKMDMETQQFLGERQERIRLLHERQERELESFDQESNRMGFRYRQILVSS 447
R LE ++ ++ Q+ Q L+ + +R+ + Q R+ R+ L
Sbjct: 318 LAIRLRRALENQLALKKQRLERLTQR----LNPQIQRQQQRLQQLERRLDKALRRQLKRK 373
Query: 448 QSRVEL 453
+ R+E
Sbjct: 374 RERLEA 379
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.125 0.338
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,914,402
Number of extensions: 2679235
Number of successful extensions: 4394
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3903
Number of HSP's successfully gapped: 497
Length of query: 547
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 445
Effective length of database: 6,413,494
Effective search space: 2854004830
Effective search space used: 2854004830
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (27.4 bits)