RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14923
(180 letters)
>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
members of the HMG-box superfamily of DNA-binding
proteins. These proteins bind the minor groove of DNA in
a non-sequence specific fashion and contain two or more
tandem HMG boxes. Class II members include non-histone
chromosomal proteins, HMG1 and HMG2, which bind to bent
or distorted DNA such as four-way DNA junctions,
synthetic DNA cruciforms, kinked cisplatin-modified DNA,
DNA bulges, cross-overs in supercoiled DNA, and can
cause looping of linear DNA. Class III members include
nucleolar and mitochondrial transcription factors, UBF
and mtTF1, which bind four-way DNA junctions.
Length = 66
Score = 58.0 bits (141), Expect = 4e-12
Identities = 27/61 (44%), Positives = 42/61 (68%)
Query: 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEI 108
PK+PL+ YF F + R K+ + NP+ + EVTK++ E+WK++ E EKK+ E + K+KE
Sbjct: 1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKER 60
Query: 109 Y 109
Y
Sbjct: 61 Y 61
>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
variety of eukaryotic chromosomal proteins and
transcription factors. HMGs bind to the minor groove of
DNA and have been classified by DNA binding preferences.
Two phylogenically distinct groups of Class I proteins
bind DNA in a sequence specific fashion and contain a
single HMG box. One group (SOX-TCF) includes
transcription factors, TCF-1, -3, -4; and also SRY and
LEF-1, which bind four-way DNA junctions and duplex DNA
targets. The second group (MATA) includes fungal mating
type gene products MC, MATA1 and Ste11. Class II and III
proteins (HMGB-UBF) bind DNA in a non-sequence specific
fashion and contain two or more tandem HMG boxes. Class
II members include non-histone chromosomal proteins,
HMG1 and HMG2, which bind to bent or distorted DNA such
as four-way DNA junctions, synthetic DNA cruciforms,
kinked cisplatin-modified DNA, DNA bulges, cross-overs
in supercoiled DNA, and can cause looping of linear DNA.
Class III members include nucleolar and mitochondrial
transcription factors, UBF and mtTF1, which bind
four-way DNA junctions.
Length = 66
Score = 56.1 bits (136), Expect = 2e-11
Identities = 24/66 (36%), Positives = 41/66 (62%)
Query: 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEI 108
PK+PL+ YF F + R ++ NP + + E++K++ E WK + E EKK+ E + K+KE
Sbjct: 1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKER 60
Query: 109 YAQNVK 114
Y + +
Sbjct: 61 YEKEMP 66
>gnl|CDD|197700 smart00398, HMG, high mobility group.
Length = 70
Score = 51.5 bits (124), Expect = 1e-09
Identities = 24/69 (34%), Positives = 40/69 (57%)
Query: 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKE 107
KPK+P++ + F + R KI NP++ E++K + E WK + E EK E + K+KE
Sbjct: 1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKE 60
Query: 108 IYAQNVKKY 116
Y + + +Y
Sbjct: 61 RYEEEMPEY 69
>gnl|CDD|189580 pfam00505, HMG_box, HMG (high mobility group) box.
Length = 69
Score = 51.1 bits (123), Expect = 2e-09
Identities = 26/68 (38%), Positives = 40/68 (58%)
Query: 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEI 108
PK+PL+ +F F + R K+ NP +K E++K++ E+WK + E EKK E + KEK
Sbjct: 1 PKRPLSAFFLFSQEQRAKLKAENPGLKNAEISKILGEKWKNLSEEEKKPYEEKAEKEKAR 60
Query: 109 YAQNVKKY 116
Y + Y
Sbjct: 61 YEKAYPAY 68
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 47.2 bits (112), Expect = 6e-07
Identities = 32/110 (29%), Positives = 56/110 (50%), Gaps = 1/110 (0%)
Query: 45 IPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAK 104
P PK+PL+ YF + + R++I + NP + EV KL+SE+WK++ + EK+ E
Sbjct: 67 DPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANS 126
Query: 105 EKEIYAQNVKKYHDELTP-EAIEFLRKEKEVKKQKKEKREMKKLFRETGK 153
++E Y + ++Y+ +L I + + + K E K E K
Sbjct: 127 DRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETK 176
Score = 26.7 bits (59), Expect = 6.8
Identities = 20/62 (32%), Positives = 40/62 (64%)
Query: 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKE 107
K P+ P+ + + +IR K+ +P+ ++E TK++S+ W ++ E++KK+ ++Y K KE
Sbjct: 143 PNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKE 202
Query: 108 IY 109
Y
Sbjct: 203 EY 204
>gnl|CDD|185511 PTZ00199, PTZ00199, high mobility group protein; Provisional.
Length = 94
Score = 35.2 bits (81), Expect = 0.003
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 9/75 (12%)
Query: 46 PEKPKKPLTPYFKFQGQIRNKIMERNPNIK--IIEVTKLVSEEWKKIGEAEKKRLENE-- 101
P PK+ L+ Y F + R +I+ NP + + V K+V E W K+ E EK E +
Sbjct: 20 PNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQ 79
Query: 102 -----YAKEKEIYAQ 111
Y KEK YA+
Sbjct: 80 EDKVRYEKEKAEYAK 94
>gnl|CDD|204115 pfam09011, DUF1898, Domain of unknown function (DUF1898). This
domain is predominantly found in Maelstrom homolog
proteins. It has no known function.
Length = 69
Score = 34.3 bits (79), Expect = 0.004
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 46 PEKPKKPLTPYFKFQGQIRNKIMERNPNI-KIIEVTKLVSEEWKKIGEAEKKRLENEYAK 104
P KPK YF F +R ++ P + + E +KL SE+WK + E EK++ E + +
Sbjct: 1 PNKPKAKRNAYFFFVQTMRPELKREGPQVPGVAEFSKLCSEKWKAMSEEEKEKYEEKARE 60
Query: 105 EKEIYAQ 111
+K+ Y +
Sbjct: 61 DKKRYDR 67
>gnl|CDD|238684 cd01388, SOX-TCF_HMG-box, SOX-TCF_HMG-box, class I member of the
HMG-box superfamily of DNA-binding proteins. These
proteins contain a single HMG box, and bind the minor
groove of DNA in a highly sequence-specific manner.
Members include SRY and its homologs in insects and
vertebrates, and transcription factor-like proteins,
TCF-1, -3, -4, and LEF-1. They appear to bind the minor
groove of the A/T C A A A G/C-motif.
Length = 72
Score = 31.1 bits (71), Expect = 0.051
Identities = 15/60 (25%), Positives = 32/60 (53%)
Query: 50 KKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIY 109
K+P+ + F + R K+++ P + ++K++ + WK + EK+ E K KE++
Sbjct: 3 KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELH 62
>gnl|CDD|213349 cd12789, RasGAP_plexin_C1, Ras-GTPase Activating Domain of
plexin-C1. Plexins form a conserved family of
transmembrane receptors for semaphorins and may be the
ancestors of semaphorins. Plexins are divided into four
types (A-D) according to sequence similarity. Plexin-C1
has been identified as the receptor of semaphorin 7A,
which plays regulatory roles in both the immune and
nervous systems. Unlike other semaphorins which act as
repulsive guidance cues, Sema7A enhances central and
peripheral axon growth and is required for proper axon
tract formation during embryonic development. Plexin-C1
is a potential tumor suppressor for melanoma
progression. The expression of Plexin-C1 is diminished
or absent in human melanoma cell lines. Cofilin, an
actin-binding protein involved in cell migration, is a
downstream target of Sema7A and Plexin-C1 signaling.
Melanoma invasion and metastasis may be promoted through
the loss of Plexin-C1 inhibitory signaling on cofilin
activation. Plexins contain a C-terminal RasGAP domain,
which functions as an enhancer of the hydrolysis of GTP
that is bound to Ras-GTPases. Plexins display GAP
activity towards the Ras homolog Rap. Although the Rho
(Ras homolog) GTPases are most closely related to
members of the Ras family, RhoGAP and RasGAP show no
sequence homology at their amino acid level. RasGTPases
function as molecular switches in a large number of of
signaling pathways. When bound to GTP they are in the on
state and when bound to GDP they are in the off state.
The RasGAP domain speeds up the hydrolysis of GTP in
Ras-like proteins acting as a negative regulator.
Length = 393
Score = 33.3 bits (76), Expect = 0.051
Identities = 26/83 (31%), Positives = 45/83 (54%), Gaps = 8/83 (9%)
Query: 76 IIEVTKLVSEEWKKIGEAEKKRLENEYAKE---KEIYAQNVKKYHDELTPEAIEFLRKEK 132
I ++ L S E ++ E K+ ENE+ +E EIY + + KY DE+ + L +E+
Sbjct: 315 IRDLPPLSSSELEEFLTQESKKHENEFNEEVALMEIY-KYIVKYFDEI----LNKLERER 369
Query: 133 EVKKQKKEKREMKKLFRETGKPK 155
+++ +K+ +K LF E K K
Sbjct: 370 GLEEVQKQLLHVKALFDEKKKCK 392
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 33.1 bits (76), Expect = 0.077
Identities = 26/90 (28%), Positives = 40/90 (44%), Gaps = 18/90 (20%)
Query: 48 KPKKPLTP-YFKFQGQIRNK---------IMERNPNIKIIEVTKLVSEEWKKIGEAEKKR 97
+ +PL P Y F+G I I E+ P+ +E+T+L K E K+
Sbjct: 850 EEVRPLVPWYRGFKGTIERNPKGKFISKGIYEKVPD-GTVEITEL---PIGKWTEDYKEF 905
Query: 98 LENEYAKEKEIYAQNVKKYHDELTPEAIEF 127
LE+E KEK++ + Y D T + F
Sbjct: 906 LESETLKEKDV----IVDYRDYSTANTVHF 931
>gnl|CDD|237005 PRK11864, PRK11864, 2-ketoisovalerate ferredoxin oxidoreductase
subunit beta; Provisional.
Length = 300
Score = 31.2 bits (71), Expect = 0.25
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 40 SQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKK 96
S+ L +K K P+ Y K QG+ ++ E IK ++ + + E W++I + KK
Sbjct: 248 SKTLLDKKKRK-PVEEYLKLQGRFKHLTEE---EIKGLQ--EEIDEMWEEIKKLAKK 298
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 30.0 bits (68), Expect = 0.45
Identities = 13/55 (23%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 96 KRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRE 150
+RL + + +N+K HD + +++++ +K +K E++K+FRE
Sbjct: 10 RRLGKDLHDAHYVCPKNLKAAHDRAV-AKYQAKKEKEKAEKMRKRYPELEKIFRE 63
>gnl|CDD|183346 PRK11865, PRK11865, pyruvate ferredoxin oxidoreductase subunit
beta; Provisional.
Length = 299
Score = 30.5 bits (69), Expect = 0.47
Identities = 16/56 (28%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 39 TSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAE 94
T +PL + + +KP+ Y K QG+ ++ +I+I++ K + E+WK++G
Sbjct: 249 TYEPLHLDRRTRKPIEEYLKVQGRFKHL---TEEDIEILQ--KYIDEKWKELGLML 299
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.5 bits (68), Expect = 0.48
Identities = 19/73 (26%), Positives = 33/73 (45%)
Query: 84 SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
+EE KK EA+KK E + A E + A+ KK D +A E + + K + + +
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Query: 144 MKKLFRETGKPKK 156
+ E + +
Sbjct: 1358 EAEAAEEKAEAAE 1370
Score = 30.5 bits (68), Expect = 0.57
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 1/85 (1%)
Query: 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
K + K +EE KK EA+KK E++ ++ A KK DE +A E +K E
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE-KKKADEA 1436
Query: 135 KKQKKEKREMKKLFRETGKPKKPGN 159
KK+ +E ++ + ++ + KK
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEE 1461
Score = 30.1 bits (67), Expect = 0.75
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
Query: 84 SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
E KK A+KK E + A E + A+ KK DEL A +K E KK+ +EK++
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAA-KKKADEAKKKAEEKKK 1432
Query: 144 MKKLFRETGKPKK 156
+ ++ + KK
Sbjct: 1433 ADEAKKKAEEAKK 1445
Score = 29.0 bits (64), Expect = 1.9
Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 2/82 (2%)
Query: 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
K E K E+ KK E +K + A E + A+ KK DE +A E +K E
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK-ADEAKKKAEE-AKKADEA 1449
Query: 135 KKQKKEKREMKKLFRETGKPKK 156
KK+ +E ++ ++ ++ + KK
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKK 1471
Score = 28.6 bits (63), Expect = 2.1
Identities = 23/79 (29%), Positives = 40/79 (50%)
Query: 64 RNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPE 123
+ + ++ +I EV KL EE K E KK E + E+ A+ KK ++L +
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641
Query: 124 AIEFLRKEKEVKKQKKEKR 142
E +K +E+KK ++E +
Sbjct: 1642 EAEEKKKAEELKKAEEENK 1660
Score = 28.2 bits (62), Expect = 3.4
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQ 137
+ K +EE KK EA+KK E + A E + A+ KK DE +A E +K E KK
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK-ADEAKKKAEEAKKKADEAKKA 1505
Query: 138 KKEKREMKKLFRETGKPK 155
+ K++ + + K
Sbjct: 1506 AEAKKKADEAKKAEEAKK 1523
Score = 27.8 bits (61), Expect = 4.3
Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
K + K +EE KK EA+KK E + A E + A+ KK E + E +K E
Sbjct: 1418 KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK--AEEAKKKAEEAKKADEA 1475
Query: 135 KKQKKEKREMKKLFRETGKPKK 156
KK+ +E ++ + ++ + KK
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKK 1497
Score = 26.6 bits (58), Expect = 8.7
Identities = 22/70 (31%), Positives = 34/70 (48%)
Query: 84 SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
+EE KK EA+KK E + A E + A+ KK DE A + ++ K ++ +K +
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKAD 1525
Query: 144 MKKLFRETGK 153
K E K
Sbjct: 1526 EAKKAEEAKK 1535
>gnl|CDD|223087 COG0008, GlnS, Glutamyl- and glutaminyl-tRNA synthetases
[Translation, ribosomal structure and biogenesis].
Length = 472
Score = 30.0 bits (68), Expect = 0.65
Identities = 17/85 (20%), Positives = 29/85 (34%), Gaps = 2/85 (2%)
Query: 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIE-FLRKEKE 133
+IE+ +L ++ E KRL N E + + T E I +++
Sbjct: 366 TLIEIERLDFFFFEDKEEVRLKRLANVIVAE-VLEKDAEGLITSDWTKENIIHWVKAVAR 424
Query: 134 VKKQKKEKREMKKLFRETGKPKKPG 158
+ K K TG +P
Sbjct: 425 LFGVKGPKLFPPLRVALTGGYVEPE 449
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 0.82
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
++ E+ K + E K++ E E++ E AK K+ + +KK LTPE +E ++ +E+
Sbjct: 339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLE--KELEEL 396
Query: 135 KKQKKE-KREMKKLFRETGKPKKPGNS 160
+K K+E + E+ K+ G+ KK
Sbjct: 397 EKAKEEIEEEISKITARIGELKKEIKE 423
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 29.5 bits (67), Expect = 1.1
Identities = 21/60 (35%), Positives = 31/60 (51%)
Query: 119 ELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPGNSFYLFIKDNIDKKEYEGKS 178
ELT E IEFL K+ K+ K+ E + RE K +K +F K+ +++E E K
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 28.0 bits (63), Expect = 3.1
Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 11/92 (11%)
Query: 74 IKIIEVTKLVSEEWKKI-GEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEK 132
I+ + +K +++ KKI +AEKKR E + K+K+ +A K+ +E +EK
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE----------EEK 448
Query: 133 EVKKQKKEKREMKKLFRETGKPKKPGNSFYLF 164
E K+++KE+ E + + + +K LF
Sbjct: 449 EKKEEEKEEEEEEAEEEKEEEEEKKKKQATLF 480
>gnl|CDD|235623 PRK05849, PRK05849, hypothetical protein; Provisional.
Length = 783
Score = 29.6 bits (67), Expect = 1.1
Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 3/66 (4%)
Query: 109 YAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPGNSFYLFIKDN 168
Y ++ + Y D E+ +KE + +E + + KL E G F F+K+
Sbjct: 528 YDEDPELYFDWTKDESTV--DSDKEFELSPEELKALNKLLHEHGLNIDA-EEFLDFLKEA 584
Query: 169 IDKKEY 174
I+ +E
Sbjct: 585 IEGREL 590
>gnl|CDD|227217 COG4880, COG4880, Secreted protein containing C-terminal
beta-propeller domain distantly related to WD-40 repeats
[General function prediction only].
Length = 603
Score = 29.1 bits (65), Expect = 1.3
Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 6/59 (10%)
Query: 59 FQGQIRNKIMERNP------NIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQ 111
G++R KI NP N K++E+ + GE + RL+ E Q
Sbjct: 288 LPGEVREKIDLTNPLEDLKNNAKVVEILTTAERVYISEGEGPRLRLKAAAENPGENLDQ 346
>gnl|CDD|238685 cd01389, MATA_HMG-box, MATA_HMG-box, class I member of the HMG-box
superfamily of DNA-binding proteins. These proteins
contain a single HMG box, and bind the minor groove of
DNA in a highly sequence-specific manner. Members
include the fungal mating type gene products MC, MATA1
and Ste11.
Length = 77
Score = 27.2 bits (61), Expect = 1.4
Identities = 10/60 (16%), Positives = 25/60 (41%)
Query: 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKE 107
K +P + ++ ++ NP + E+++++ W+ K + +EKE
Sbjct: 1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKE 60
>gnl|CDD|224309 COG1391, GlnE, Glutamine synthetase adenylyltransferase
[Posttranslational modification, protein turnover,
chaperones / Signal transduction mechanisms].
Length = 963
Score = 28.9 bits (65), Expect = 1.8
Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 3/37 (8%)
Query: 123 EAI--EFLRKEKEVKKQKKEKREMK-KLFRETGKPKK 156
EAI E L ++ K K E EM+ K+ E G+P +
Sbjct: 813 EAIRAEILTYPRDGAKLKTEVEEMREKMRAELGRPPR 849
>gnl|CDD|192760 pfam11444, DUF2895, Protein of unknown function (DUF2895). This is
a bacterial family of uncharacterized proteins.
Length = 199
Score = 28.0 bits (63), Expect = 2.1
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 106 KEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
+E Y N+++ LTP FL+K+ E++ E R+
Sbjct: 82 EEDYGANIERLSPYLTPRCQAFLQKDYELRLLNGELRD 119
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 28.5 bits (65), Expect = 2.2
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 14/63 (22%)
Query: 92 EAEKKRLENEYAK-EKEIYAQNVKKYHDELTPEAIEFLRK------EKEVKKQKKEKREM 144
EAE RLE E AK EKEI + KK +E F+ K EKE +K + + ++
Sbjct: 810 EAELARLEKELAKLEKEI-ERVEKKLSNE------GFVAKAPEEVVEKEREKLAEYEEKL 862
Query: 145 KKL 147
KL
Sbjct: 863 AKL 865
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 28.6 bits (64), Expect = 2.2
Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 60 QGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE 119
+ Q+ N + E NI + + EE + + L+N + + E +QN KY++E
Sbjct: 860 KHQVDNMVTEYEGNIGDTQ-NSIRGEE--NTVKGQYSELQNHH--KTEALSQN-NKYNEE 913
Query: 120 LT-PEAIEFLRKEKEVKKQKKEKREMKK 146
+ E + +E+ K+ KE ++ K
Sbjct: 914 KSAQERMPGADSPEELMKRAKEYQDKHK 941
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 28.3 bits (63), Expect = 2.6
Identities = 18/105 (17%), Positives = 35/105 (33%), Gaps = 13/105 (12%)
Query: 60 QGQIRNKIMERNPNIKIIEVTKL---------VSEEW--KKIGEAEKKRLENEYAKEKEI 108
+ +R+K + V L S + K+ A KRL+ + E E
Sbjct: 343 EADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEA 402
Query: 109 YAQNVKKYHDELTPE--AIEFLRKEKEVKKQKKEKREMKKLFRET 151
+ ++ D+ E ++ Q +K E+ L+
Sbjct: 403 EEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDRR 447
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 28.3 bits (64), Expect = 2.7
Identities = 12/53 (22%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 92 EAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFL-RKEKEVKKQKKEKRE 143
E ++ LE + A+E E + +K +EL + + ++K +++ +KE ++
Sbjct: 526 EELERELE-QKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577
>gnl|CDD|179174 PRK00941, PRK00941, acetyl-CoA decarbonylase/synthase complex
subunit alpha; Validated.
Length = 781
Score = 28.0 bits (63), Expect = 2.9
Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 1/43 (2%)
Query: 114 KKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKK 156
KKY EL + F+R E ++ KE+ +K L + KPKK
Sbjct: 724 KKYLGELPDDLELFVRTEADLPLAMKEEL-LKILEEKDWKPKK 765
>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
methyltransferase subunit [Defense mechanisms].
Length = 489
Score = 28.2 bits (63), Expect = 2.9
Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 16/79 (20%)
Query: 88 KKIGEAEKKRLENEYAKEKEI----------------YAQNVKKYHDELTPEAIEFLRKE 131
K++ E +++ + Y + KEI Y N +Y E E I+
Sbjct: 401 KRLTEENIEKIVDTYREFKEIEGFSKSVSLEEIRENDYNLNPGRYVAEEEEEDIDLDELL 460
Query: 132 KEVKKQKKEKREMKKLFRE 150
+E+ + E E++K E
Sbjct: 461 QELSEIDAELAELEKELEE 479
>gnl|CDD|219279 pfam07044, DUF1329, Protein of unknown function (DUF1329). This
family consists of several hypothetical bacterial
proteins of around 475 residues in length. The majority
of family members are from Pseudomonas species but the
family also contains sequences from Shewanella
oneidensis and Thauera aromatica.
Length = 368
Score = 27.9 bits (63), Expect = 3.2
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 108 IYAQNVKKYHDELTPEAIEFLRK 130
I A NV +Y D+LTP L++
Sbjct: 2 ITAANVDQYKDKLTPGQKALLKR 24
>gnl|CDD|223529 COG0452, Dfp, Phosphopantothenoylcysteine synthetase/decarboxylase
[Coenzyme metabolism].
Length = 392
Score = 27.7 bits (62), Expect = 3.6
Identities = 13/73 (17%), Positives = 24/73 (32%), Gaps = 6/73 (8%)
Query: 107 EIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPG-NSFYLFI 165
A NV Y T E ++ L+ E V + + E+ + G +
Sbjct: 118 LAPAMNVIMYTHPATQENLQRLKSEG-VLFIEPIEGELA----DVGDGRLAEPEEIVEAA 172
Query: 166 KDNIDKKEYEGKS 178
+ + +GK
Sbjct: 173 LALLKTPDLKGKK 185
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 27.8 bits (61), Expect = 3.7
Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 1/96 (1%)
Query: 77 IEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKK 136
IE + + ++ +K + + EN E E + +E + + +++ E+E
Sbjct: 589 IEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDY-ETEREENA 647
Query: 137 QKKEKREMKKLFRETGKPKKPGNSFYLFIKDNIDKK 172
+KKE+ E G P+K +Y K I+++
Sbjct: 648 RKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 27.8 bits (63), Expect = 3.7
Identities = 16/69 (23%), Positives = 41/69 (59%), Gaps = 2/69 (2%)
Query: 78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQ 137
E K + E ++ + EK+ L+ E ++++ + ++K +EL + E +K++E++K+
Sbjct: 72 EFEKELRERRNELQKLEKRLLQKEENLDRKL--ELLEKREEELEKKEKELEQKQQELEKK 129
Query: 138 KKEKREMKK 146
++E E+ +
Sbjct: 130 EEELEELIE 138
Score = 26.7 bits (60), Expect = 9.4
Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 8/68 (11%)
Query: 87 WKKIGEAEKKRLENEYA---KEKEIYAQNVKKYHDELTPEA-IEFLRKEKEVKKQKKEKR 142
KKI EA+ K E E +E + A+ +KK E EA E + E +K+ +E+R
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKK---EALLEAKEEIHKLRNEFEKELRERR 81
Query: 143 -EMKKLFR 149
E++KL +
Sbjct: 82 NELQKLEK 89
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 27.4 bits (61), Expect = 3.9
Identities = 15/57 (26%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 91 GEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKL 147
GEA + ++A E +V H + P +E LRK+++ KK++ + ++ + L
Sbjct: 127 GEALEFEELQKFAWEAAEKGGDV---HLQANPTKLELLRKKEKEKKEQLKIQKKQSL 180
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 27.7 bits (61), Expect = 4.1
Identities = 23/107 (21%), Positives = 48/107 (44%), Gaps = 1/107 (0%)
Query: 37 QQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKK 96
+ P I +P+KP+ P + + + ME + E K+V E+ ++ E E K
Sbjct: 118 PSSIAPPEIDPEPRKPIVPDLVLEEEEEVE-MEEDEEYYEKEPGKVVDEKSEEEEEEELK 176
Query: 97 RLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
+++ E + +VK A F +K ++ ++++K K+
Sbjct: 177 TMKDFIDLESQTKKPSVKDNGKSFWSAASVFSKKLQKWRQKQKLKKP 223
>gnl|CDD|106979 PHA00671, PHA00671, hypothetical protein.
Length = 135
Score = 26.9 bits (58), Expect = 4.2
Identities = 21/70 (30%), Positives = 28/70 (40%), Gaps = 7/70 (10%)
Query: 91 GEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRE 150
+AE RL EY K+ E Y Q K +L F R +KQ +R + +
Sbjct: 48 AKAESDRLTQEYQKQTEAYQQQAKIMQQQLLDSQQSFNRAN---QKQPNSQRALSAV--- 101
Query: 151 TGKPKKPGNS 160
G K G S
Sbjct: 102 -GSAAKSGQS 110
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 27.5 bits (61), Expect = 4.3
Identities = 17/76 (22%), Positives = 29/76 (38%)
Query: 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
K E + +EE K AE K A++K + + A E +K E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEA 171
Query: 135 KKQKKEKREMKKLFRE 150
KK+ + + + K +
Sbjct: 172 KKKAEAEAKAKAEAKA 187
>gnl|CDD|235772 PRK06290, PRK06290, aspartate aminotransferase; Provisional.
Length = 410
Score = 27.3 bits (61), Expect = 5.1
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 10/51 (19%)
Query: 117 HDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETG-KPKKPGNSFYLFIK 166
H E+T EK +K + ++ K+ E G K + PG +FYL++K
Sbjct: 298 HPEIT---------EKIREKYSRRLDKLVKILNEVGFKAEMPGGTFYLYVK 339
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 27.2 bits (61), Expect = 5.4
Identities = 22/86 (25%), Positives = 34/86 (39%), Gaps = 14/86 (16%)
Query: 69 ERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAK-EKEIYAQNVKKYHDELTPEAIEF 127
+ + AE RLE E K EKEI ++K +L+ E F
Sbjct: 790 PVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI--DRIEK---KLSNEG--F 842
Query: 128 LRK------EKEVKKQKKEKREMKKL 147
+ K EKE +K + + ++ KL
Sbjct: 843 VAKAPEEVVEKEKEKLAEYQVKLAKL 868
>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein. The frequency clock
protein, is the central component of the frq-based
circadian negative feedback loop, regulates various
aspects of the circadian clock in Neurospora crassa.
This protein has been shown to interact with itself via
a coiled-coil.
Length = 989
Score = 27.3 bits (60), Expect = 5.6
Identities = 13/53 (24%), Positives = 21/53 (39%)
Query: 3 LFKLTTGVFTGGNVFNLRSCLMPHLNSVTTAGLKQQTSQPLSIPEKPKKPLTP 55
L ++ TG +G NV +S L P +++ QQ P +P
Sbjct: 303 LEQIFTGKISGRNVRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPPSNFIT 355
>gnl|CDD|216319 pfam01134, GIDA, Glucose inhibited division protein A.
Length = 391
Score = 27.1 bits (61), Expect = 5.8
Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 58 KFQGQIRNKIMERNPNIKII--EVTKLVSEEWKKIG 91
+ ++ + E +PN+ +I EVT L+ E K G
Sbjct: 96 LYSKEMTETL-ENHPNLTLIQGEVTDLIPENGKVKG 130
>gnl|CDD|238410 cd00795, NOS_oxygenase_euk, Nitric oxide synthase (NOS) eukaryotic
oxygenase domain. NOS produces nitric oxide (NO) by
catalyzing a five-electron heme-based oxidation of a
guanidine nitrogen of L-arginine to L-citrulline via two
successive monooxygenation reactions producing
N(omega)-hydroxy-L-arginine (NHA) as an intermediate. In
mammals, there are three distinct NOS isozymes: neuronal
(nNOS or NOS-1), cytokine-inducible (iNOS or NOS-2) and
endothelial (eNOS or NOS-3) . Nitric oxide synthases are
homodimers. In eukaryotes, each monomer has an
N-terminal oxygenase domain, which binds to the
substrate L-Arg, zinc, and to the cofactors heme and
5.6.7.8-(6R)-tetrahydrobiopterin (BH4) . Eukaryotic
NOS's also have a C-terminal electron supplying
reductase region, which is homologous to cytochrome P450
reductase and binds NADH, FAD and FMN.
Length = 412
Score = 26.9 bits (60), Expect = 6.4
Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 6/49 (12%)
Query: 110 AQNVKKYHDELTPEAIEFLR----KEKEVKKQKKEKR--EMKKLFRETG 152
+ +EL P+A +F+ K + R E+ K TG
Sbjct: 45 RPRDGRPKEELLPQAKDFINQYYSSIKRSGSEAHLARLEEVTKEIEATG 93
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 26.8 bits (60), Expect = 6.6
Identities = 16/81 (19%), Positives = 40/81 (49%), Gaps = 10/81 (12%)
Query: 62 QIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT 121
++ +++ K+ + + E+ K AE++R E K++E + + +L+
Sbjct: 252 KLSPEVLR-----KVDKTREEEEEKILK--AAEEERQEEAQEKKEEKKKEEREAKLAKLS 304
Query: 122 PEAIEFLRKEKEVKKQKKEKR 142
PE RK +E +++K+ ++
Sbjct: 305 PEEQ---RKLEEKERKKQARK 322
>gnl|CDD|114524 pfam05802, EspB, Enterobacterial EspB protein. EspB is a
type-III-secreted pore-forming protein of
enteropathogenic Escherichia coli (EPEC) which is
essential for EPEC pathogenesis. EspB is also found in
Citrobacter rodentium.
Length = 317
Score = 27.0 bits (59), Expect = 6.8
Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 83 VSEEWK---KIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT 121
+SE K +I E K +EN E+Y Q+V++ D++
Sbjct: 248 ISESVKSNHEINEQRAKSVENFQQGNLELYKQDVRRSQDDIA 289
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 26.9 bits (60), Expect = 7.0
Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 7/43 (16%)
Query: 92 EAEKKRLENEYAKEKEIYAQNVKKYHD-------ELTPEAIEF 127
EAE +L +Y K Y Q + + E+ +A+ F
Sbjct: 354 EAELTQLNRDYEVNKSNYEQLLTRRESAEVSKQMEVQDKAVSF 396
>gnl|CDD|188995 cd06456, M3A_DCP, Peptidase family M3 dipeptidyl carboxypeptidase
(DCP). Peptidase family M3 dipeptidyl carboxypeptidase
(DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This
metal-binding M3A family also includes oligopeptidase A
(OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off
the C-termini of various peptides and proteins, the
smallest substrate being N-blocked tripeptides and
unblocked tetrapeptides. DCP from E. coli is inhibited
by the anti-hypertensive drug captopril, an inhibitor of
the mammalian angiotensin converting enzyme (ACE, also
called peptidyl dipeptidase A). Oligopeptidase A (OpdA)
may play a specific role in the degradation of signal
peptides after they are released from precursor forms of
secreted proteins. It can also cleave N-acetyl-L-Ala.
Length = 654
Score = 27.0 bits (61), Expect = 7.1
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 121 TPEA-IEFLRK--EKEVKKQKKEKREMKKLFRETGKPKK 156
TPEA + FL EK + KKE E++ +E G +
Sbjct: 268 TPEAVLAFLEDLAEKAKPQAKKELAELQAFAKEEGGDDE 306
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 26.9 bits (60), Expect = 7.8
Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 14/67 (20%)
Query: 90 IGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFR 149
IG+ K+ L + KE Y + K +K +E K QK++K++ +K
Sbjct: 360 IGDELKEELNKRIEEIKEKYPKPPK--------------KKREEKKPQKRKKKKKRKKKG 405
Query: 150 ETGKPKK 156
+ K K
Sbjct: 406 KKRKKKG 412
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 26.9 bits (59), Expect = 7.9
Identities = 12/60 (20%), Positives = 24/60 (40%)
Query: 95 KKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKP 154
K+R E AK + + + K + + + KQ+ E R+ ++ + KP
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 26.8 bits (59), Expect = 8.1
Identities = 12/78 (15%), Positives = 32/78 (41%), Gaps = 2/78 (2%)
Query: 102 YAKEKEIYAQNVKKYHDELTPEAIEFLRKEK--EVKKQKKEKREMKKLFRETGKPKKPGN 159
K+ + ++ + + + + ++ E +VK Q++EK E + K
Sbjct: 109 LGKKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNE 168
Query: 160 SFYLFIKDNIDKKEYEGK 177
+ +N +K+++ K
Sbjct: 169 ERFKEYLENFKRKKFKRK 186
>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family. This is a
family of transcription elongation factors which
includes those referred to as Bex proteins as well as
those named TCEAL7. Bex1 was shown to be a novel link
between neurotrophin signalling, the cell cycle, and
neuronal differentiation, suggesting it might function
by coordinating internal cellular states with the
ability of cells to respond to external signals. TCEAL7
has been shown negatively to regulate the NF-kappaB
pathway, hence being important in ovarian cancer as it
one of the genes frequently downregulated in this
cancer. A closely related protein, TFIIS/TCEA, found in
pfam07500 is involved in transcription elongation and
transcript fidelity. TFIIS/TCEA promotes 3'
endoribonuclease activity of RNA polymerase II (pol II)
and allows pol II to bypass transcript pause or 'arrest'
during elongation process. It is thus possible that BEX
is also acting in this way.
Length = 97
Score = 25.4 bits (56), Expect = 8.7
Identities = 14/37 (37%), Positives = 20/37 (54%)
Query: 125 IEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPGNSF 161
+E KE E K + + K E +K E G+ KKP +F
Sbjct: 1 MEKPCKENEGKPESEPKEEEEKRPLEEGEGKKPEGNF 37
>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain. PfEMP1 (Plasmodium falciparum
erythrocyte membrane protein) has been identified as the
rosetting ligand of the malaria parasite P. falciparum.
Rosetting is the adhesion of infected erythrocytes with
uninfected erythrocytes in the vasculature of the
infected organ, and is associated with severe malaria.
PfEMP1 interacts with Complement Receptor One on
uninfected erythrocytes to form rosettes.
Length = 150
Score = 26.1 bits (58), Expect = 8.8
Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 8/87 (9%)
Query: 85 EEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREM 144
+EW+KI K+ +Y + NV + E I+ K+ KE +++
Sbjct: 52 KEWEKI----KEHFNKQYKIDDGRDEYNVLESFLEDLLFQIDI----KKAYGDAKELKKI 103
Query: 145 KKLFRETGKPKKPGNSFYLFIKDNIDK 171
K+L + G ++ KD ID
Sbjct: 104 KELLKCNGCNNSANSAKNGENKDAIDC 130
>gnl|CDD|214371 CHL00122, secA, preprotein translocase subunit SecA; Validated.
Length = 870
Score = 26.5 bits (59), Expect = 9.1
Identities = 13/48 (27%), Positives = 21/48 (43%)
Query: 100 NEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKL 147
E ++ K +L+ + EK K Q+KEK+ +KKL
Sbjct: 544 ENLKILNEASEISIPKNSYQLSLRFLYNELLEKYKKLQEKEKKIVKKL 591
>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
component YidC; Provisional.
Length = 329
Score = 26.2 bits (58), Expect = 9.6
Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 6/36 (16%)
Query: 119 ELTPEAIEFLRKEKEVKKQKKEK--REMKKLFRETG 152
E+ P+ ++K +E K +E+ +EM KL++ETG
Sbjct: 73 EIQPK----MKKIQEKYKGDRERMSQEMMKLYKETG 104
>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
Length = 256
Score = 26.3 bits (58), Expect = 9.6
Identities = 8/25 (32%), Positives = 18/25 (72%)
Query: 128 LRKEKEVKKQKKEKREMKKLFRETG 152
L+ K+++KQK+ ++EM +L++
Sbjct: 107 LKVTKDLEKQKEYQKEMMELYKSGN 131
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.132 0.370
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,246,462
Number of extensions: 871060
Number of successful extensions: 2146
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1960
Number of HSP's successfully gapped: 357
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.5 bits)