RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10054
(910 letters)
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 104 bits (261), Expect = 4e-25
Identities = 67/285 (23%), Positives = 112/285 (39%), Gaps = 73/285 (25%)
Query: 217 ILDEVKYRADYIRYQEAQRKQEEARVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPP 276
IL+ RA++ ++Q+ + +++E E++R+A+A IDW DFVVVET+++ + PPP
Sbjct: 19 ILERAYKRAEWEKHQKKEEQKKEEEEEEERIAFASIDWQDFVVVETIEFNDIDVLELPPP 78
Query: 277 TTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRG 336
T +E+ R L Q+ E + E+++M+
Sbjct: 79 LTLNELIYRSLEQKSKA----------LEEPDAAAAPEEEEEME---------------- 112
Query: 337 RNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEE 396
+E+I+E+ + EE+ K
Sbjct: 113 --------EEEEDIDEE--------------KKEESAKKSTPPIK--------------- 135
Query: 397 QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQE 456
E P LK + PS P K++ E IP + E
Sbjct: 136 -EMKIPAAGESRLKIRK-----DYVPSGNKPRAEKPKKKMIKCP---ITGELIPEDEFDE 186
Query: 457 HMRIGLLDPRWVEQRDKHLNEKINQETVYAPGQAIEASLKQLAER 501
HMRI LLDPR+ EQ+D++ K Q + +LK+LA++
Sbjct: 187 HMRILLLDPRYKEQKDQYEA-KNQQYASNLTTDDVYRNLKRLAKK 230
Score = 75.2 bits (185), Expect = 5e-15
Identities = 48/171 (28%), Positives = 77/171 (45%), Gaps = 31/171 (18%)
Query: 644 GNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSE 703
PPP T +E+ R L E+ + EE + PE+++ + EED E
Sbjct: 73 LELPPPLTLNELIYRSL--------EQKSKALEEPDAAAA---PEEEEEMEEEEEDIDEE 121
Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD-KVVVKKGYDPKQARPPVVK 762
+++E S P I+ +P A + ++ ++K Y P +P K
Sbjct: 122 KKEE-----------------SAKKSTPPIKEMKIPAAGESRLKIRKDYVPSGNKPRAEK 164
Query: 763 PSPTDEFLISPITGEKIPASKVQEHMRIGLLDPRWVEQRDKHLNEKINQET 813
P + + PITGE IP + EHMRI LLDPR+ EQ+D++ + +
Sbjct: 165 PK--KKMIKCPITGELIPEDEFDEHMRILLLDPRYKEQKDQYEAKNQQYAS 213
>gnl|CDD|197818 smart00648, SWAP, Suppressor-of-White-APricot splicing regulator.
domain present in regulators which are responsible for
pre-mRNA splicing processes.
Length = 54
Score = 90.0 bits (224), Expect = 4e-22
Identities = 30/54 (55%), Positives = 43/54 (79%)
Query: 31 RNIVDKTANFVARNGPEFESRIRQNEIGNPKFNFLSPGDPYHAYYQHRVKDIRE 84
+I+DKTA FVARNGPEFE+++ + E NP+F+FL P DPYHAYY+ ++ + R+
Sbjct: 1 LDIIDKTAQFVARNGPEFEAKLMERERNNPQFDFLKPNDPYHAYYRKKLAEYRQ 54
Score = 84.6 bits (210), Expect = 3e-20
Identities = 27/54 (50%), Positives = 35/54 (64%)
Query: 142 LDIVKLTAQFVARNGRGFLTSLMQKEQRNYQFDFLRPQHSLFQYFTKLLEQYTK 195
LDI+ TAQFVARNG F LM++E+ N QFDFL+P Y+ K L +Y +
Sbjct: 1 LDIIDKTAQFVARNGPEFEAKLMERERNNPQFDFLKPNDPYHAYYRKKLAEYRQ 54
>gnl|CDD|216714 pfam01805, Surp, Surp module. This domain is also known as the
SWAP domain. SWAP stands for
Suppressor-of-White-APricot. It has been suggested that
these domains may be RNA binding.
Length = 54
Score = 82.3 bits (204), Expect = 2e-19
Identities = 28/54 (51%), Positives = 38/54 (70%), Gaps = 1/54 (1%)
Query: 31 RNIVDKTANFVARNGPEFESRIRQNEIGNPKFNFL-SPGDPYHAYYQHRVKDIR 83
+I+DKTA FVARNGPEFE+ + + E NP+FNFL DPYH YY+ ++ +
Sbjct: 1 LDIIDKTAQFVARNGPEFEALLMEREQDNPQFNFLKPTNDPYHVYYRWKLAEYS 54
Score = 77.3 bits (191), Expect = 1e-17
Identities = 26/53 (49%), Positives = 33/53 (62%), Gaps = 1/53 (1%)
Query: 142 LDIVKLTAQFVARNGRGFLTSLMQKEQRNYQFDFLRPQ-HSLFQYFTKLLEQY 193
LDI+ TAQFVARNG F LM++EQ N QF+FL+P Y+ L +Y
Sbjct: 1 LDIIDKTAQFVARNGPEFEALLMEREQDNPQFNFLKPTNDPYHVYYRWKLAEY 53
>gnl|CDD|176395 cd01800, SF3a120_C, Ubiquitin-like domain of Mammalian splicing
factor SF3a_120. SF3a120_C Mammalian splicing factor
SF3a consists of three subunits of 60, 66, and 120 kDa
and functions early during pre-mRNA splicing by
converting the U2 snRNP to its active form. The 120kDa
subunit (SF3a120) has a carboxy-terminal ubiquitin-like
domain and two SWAP (suppressor-of-white-apricot)
domains, referred to collectively as the SURP module, at
its amino-terminus.
Length = 76
Score = 64.4 bits (157), Expect = 6e-13
Identities = 37/81 (45%), Positives = 45/81 (55%), Gaps = 18/81 (22%)
Query: 806 NEKINQETVYAPEFLLDPRWVEQRDKHLNEKINQET------------GLFFKDSNSLAY 853
K+N + + L DP V L KI++ET G+F KDSNSLAY
Sbjct: 2 EWKLNGQMLNFTLQLSDPVSV------LKVKIHEETGMPAGKQKLQYEGIFIKDSNSLAY 55
Query: 854 YNITPGTKVHLQLKERGGRKK 874
YN+ GT +HLQLKERGGRKK
Sbjct: 56 YNLANGTIIHLQLKERGGRKK 76
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 42.2 bits (99), Expect = 3e-04
Identities = 35/110 (31%), Positives = 46/110 (41%), Gaps = 21/110 (19%)
Query: 635 MPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQI 694
MPP K PP T DE +E D+ E++EEDEE E+ +D +
Sbjct: 1 MPPKGNAKKLKVRPPPTKDE-------EEDWDSQAEEVEEDEEEMED------WEDSLD- 46
Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 744
EED +EE +E+ PS SQ +I PP PPA
Sbjct: 47 --EEDEEAEEVEEETAASSKAPSSS-----SKSSSQETISIPPTPPARRP 89
Score = 33.0 bits (75), Expect = 0.37
Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 5/77 (6%)
Query: 359 TTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL 418
EE+ D++ EE + +E ++ + EED +EE +E+ PS
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSS-----SKS 72
Query: 419 PSQPSIQPPPLPPAPDK 435
SQ +I PP PPA
Sbjct: 73 SSQETISIPPTPPARRP 89
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 41.2 bits (97), Expect = 0.002
Identities = 27/139 (19%), Positives = 41/139 (29%), Gaps = 3/139 (2%)
Query: 669 EEDLEEDEEMEEEYGQRRPEDDDM-QIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL 727
DEE E+ D+ D + S+S + P P +P+ P
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPS 208
Query: 728 PSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEH 787
S P + +P P P P + + T + P S
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATP 268
Query: 788 MRIG--LLDPRWVEQRDKH 804
IG LD + + KH
Sbjct: 269 APIGGITLDDDAIAKAQKH 287
Score = 34.3 bits (79), Expect = 0.24
Identities = 19/129 (14%), Positives = 30/129 (23%), Gaps = 2/129 (1%)
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPP 407
E+ + + E + + + + PP P
Sbjct: 159 EDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSS 218
Query: 408 SLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQEHMRIG--LLDP 465
P P PP P V + + P S IG LD
Sbjct: 219 FQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278
Query: 466 RWVEQRDKH 474
+ + KH
Sbjct: 279 DAIAKAQKH 287
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 40.0 bits (93), Expect = 0.004
Identities = 22/100 (22%), Positives = 36/100 (36%), Gaps = 13/100 (13%)
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPP 407
+ D+ + E+++ +E+ +S + + + S P PPPPP
Sbjct: 121 PPPDVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFPPGPPPPP 180
Query: 408 --------SLKKVEPA-----PPLPSQPSIQPPPLPPAPD 434
P PP P P + PPP P +PD
Sbjct: 181 PGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPMSPD 220
Score = 29.2 bits (65), Expect = 9.6
Identities = 19/76 (25%), Positives = 28/76 (36%)
Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPP 417
G +EEE + + +D + ++ E SS +E + S PP
Sbjct: 106 GYGNEEEQNLADLLTPPPDVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPP 165
Query: 418 LPSQPSIQPPPLPPAP 433
P P P P PP P
Sbjct: 166 SPWNPRFPPGPPPPPP 181
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 40.8 bits (95), Expect = 0.005
Identities = 35/148 (23%), Positives = 62/148 (41%), Gaps = 36/148 (24%)
Query: 289 QERMDNGEEDLEEDEEMEEEYGQ-RRPEDD--DMQIGSDEENDYYNARMRGRNANRGRGR 345
+E + EE E+ + +EE Q PED D D ++ + +A N ++G
Sbjct: 4068 EENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVG 4127
Query: 346 MQEEINEDD--------------MQEGTTSEEESDNEN--------------EEARKSKE 377
EE+ E+D +QE T++ +E+ +E ++A + E
Sbjct: 4128 ENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHE 4187
Query: 378 AKDNTQVQDMEEDSSS-----EEEQEDI 400
+D T+ Q D S E+E+ED+
Sbjct: 4188 WEDLTESQSQAFDDSEFMHVKEDEEEDL 4215
Score = 40.4 bits (94), Expect = 0.005
Identities = 23/123 (18%), Positives = 50/123 (40%), Gaps = 19/123 (15%)
Query: 292 MDNGEEDLEEDE-EMEEEYGQRRPEDDDMQ--IGSDEENDYYNARMRGRNANRGRGRMQE 348
+++ + +EDE EM ++ G DD++Q I + + +
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGI----DDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000
Query: 349 EINED------DMQEGTTSEEESDNEN------EEARKSKEAKDNTQVQDMEEDSSSEEE 396
++++D DM+ ++EE+D E E+ + D QD D + ++E
Sbjct: 4001 DVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDE 4060
Query: 397 QED 399
+ +
Sbjct: 4061 KMN 4063
Score = 39.2 bits (91), Expect = 0.014
Identities = 29/134 (21%), Positives = 56/134 (41%), Gaps = 5/134 (3%)
Query: 288 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRM- 346
+ E D E + ++ E + + E+D +G+ E+ D + R +AN+ M
Sbjct: 4185 IHEWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEK-DQIKSIDRDESANQNPDSMN 4243
Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQ--VQDMEEDSSSEEEQEDIPPPP 404
I ED+ E + + + + +++ E T+ + E E EDI
Sbjct: 4244 STNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDEL 4303
Query: 405 PPPSLKKVEPAPPL 418
P ++ K+ PA P+
Sbjct: 4304 PDYNV-KITPAMPI 4316
Score = 38.8 bits (90), Expect = 0.017
Identities = 25/128 (19%), Positives = 44/128 (34%), Gaps = 33/128 (25%)
Query: 294 NGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINED 353
E+ D E E + D D ++ E+ +N ++ +E NE+
Sbjct: 3872 VVSENENSDSEEENQ-------DLDEEVNDIPEDL----------SNSLNEKLWDEPNEE 3914
Query: 354 DMQEGTTSEEESDNENEEARKSKEAKDNTQVQDME----------------EDSSSEEEQ 397
D+ E E N E+ + DN ++D + +D + Q
Sbjct: 3915 DLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQ 3974
Query: 398 EDIPPPPP 405
E+ PPP
Sbjct: 3975 ENNSQPPP 3982
Score = 33.8 bits (77), Expect = 0.56
Identities = 27/133 (20%), Positives = 51/133 (38%), Gaps = 30/133 (22%)
Query: 294 NGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINED 353
N E +EE++E ++EYG++ D D + Y+ + +EE E
Sbjct: 2924 NVSEQIEEEDEKDDEYGEQEKVSDKNDYLKDFID--YDDDGSISTESE----PEEEKREC 2977
Query: 354 DMQ-EGTTSEEESDNENEEARKSKEAKDNTQ-----------------------VQDMEE 389
+ +G EE ++++E+ K E + N + E
Sbjct: 2978 ESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEKIGSEGSLESGGEELVCSNLSEG 3037
Query: 390 DSSSEEEQEDIPP 402
+ SSE E++D+
Sbjct: 3038 NESSEIEEDDVES 3050
Score = 32.3 bits (73), Expect = 1.8
Identities = 24/104 (23%), Positives = 41/104 (39%), Gaps = 18/104 (17%)
Query: 296 EEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDM 355
E+ LEE+ ++E+ Q DD + D+E N + NE+
Sbjct: 4034 EDPLEENNTLDEDIQQ----DDFSDLAEDDEKM-----------NEDGFEENVQENEEST 4078
Query: 356 QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
++G S+EE + +A DN D + +S E E+
Sbjct: 4079 EDGVKSDEELEQGEVP---EDQAIDNHPKMDAKSTFASAEADEE 4119
Score = 30.4 bits (68), Expect = 6.0
Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 7/88 (7%)
Query: 320 QIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDN---ENEEARKSK 376
+ ++ N G + G + +NEDD E +EE++ N +E +
Sbjct: 3801 DLPQEKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEAREL 3860
Query: 377 EAKDNTQVQDM----EEDSSSEEEQEDI 400
E+ N +D E+S SEEE +D+
Sbjct: 3861 ESDMNGVTKDSVVSENENSDSEEENQDL 3888
Score = 30.4 bits (68), Expect = 6.2
Identities = 16/50 (32%), Positives = 30/50 (60%), Gaps = 8/50 (16%)
Query: 665 MDNGEEDLEEDE-EMEEEYGQRRPEDDDMQIDMEEDSSS---EEEQEDIP 710
+++ + +EDE EM ++ G DD++Q D++E++S E E D+P
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGI----DDEIQPDIQENNSQPPPENEDLDLP 3990
Score = 30.0 bits (67), Expect = 7.3
Identities = 32/172 (18%), Positives = 58/172 (33%), Gaps = 14/172 (8%)
Query: 555 DEETAIGKKIGEEDTRKDDKATWDGHTSSVEAATRAARANITLEDQIHQIHKVKGLLPDE 614
+ T +G +G ED D + E T A ++++ + + G+ D
Sbjct: 3813 ESGTGLGSGVGAEDITNTLNEDDDLEELANEEDT-ANQSDLDESEARELESDMNGVTKDS 3871
Query: 615 EKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEE 674
+ S N L + + + N DE L++ + E+
Sbjct: 3872 VVSENENSDSEEENQDLDEEVNDIPEDLS-NSLNEKLWDEPNEEDLLETEQKSNEQSAAN 3930
Query: 675 DEE--MEEEYGQRRPEDDDMQIDMEE----------DSSSEEEQEDIPPPPP 714
+E + +E + ED D Q +E D + QE+ PPP
Sbjct: 3931 NESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPP 3982
Score = 30.0 bits (67), Expect = 8.7
Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 346 MQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
++E NE+D + +ES+ E+ + KD+ ++ DS E + D
Sbjct: 3837 LEELANEEDTAN-QSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLD 3889
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 39.8 bits (93), Expect = 0.008
Identities = 22/94 (23%), Positives = 31/94 (32%), Gaps = 5/94 (5%)
Query: 341 RGRGRMQEEINEDD--MQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
GR++ E+ E + EG E A ++ EA + E +S E
Sbjct: 23 SRIGRIERELRELRSLVAEGAAGPVAKAAEQMAAPEAAEAAP---LPAAAESIASPEVPP 79
Query: 399 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA 432
+PP P P PS P P P
Sbjct: 80 PVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPV 113
Score = 36.3 bits (84), Expect = 0.075
Identities = 12/47 (25%), Positives = 14/47 (29%)
Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA 741
E +S E +PP P P PS P P P
Sbjct: 67 AAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPV 113
>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family. The
proteins of the Trk family are derived from
Gram-negative and Gram-positive bacteria, yeast and
wheat. The proteins of E. coli K12 TrkH and TrkG as well
as several yeast proteins have been functionally
characterized.The E. coli TrkH and TrkG proteins are
complexed to two peripheral membrane proteins, TrkA, an
NAD-binding protein, and TrkE, an ATP-binding protein.
This complex forms the potassium uptake system. This
family is specific for the eukaryotic Trk system
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 800
Score = 39.6 bits (92), Expect = 0.008
Identities = 22/159 (13%), Positives = 51/159 (32%), Gaps = 11/159 (6%)
Query: 285 RVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRG 344
+L +E D E + + +RR D + G + D + ++
Sbjct: 152 TLLQRELEDRPETGVAGRVTVPHGSAKRRDFQDKLFSGEFVKRDEPDQNSPDVKSDT--- 208
Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKE--------AKDNTQVQDMEEDSSSEEE 396
R E I++ + ++ D + E+ +S ++ + ++ SSE
Sbjct: 209 RADESISDLEFEKFAKRRGSRDVDPEDLYRSIMMLQGIHERIREKSSANSRSDERSSESI 268
Query: 397 QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 435
QE + P +++ + + K
Sbjct: 269 QEQVERRPSTSDIERNSQSLTRRYDDKSFDKAVRLRRSK 307
Score = 35.7 bits (82), Expect = 0.13
Identities = 16/135 (11%), Positives = 32/135 (23%), Gaps = 10/135 (7%)
Query: 612 PDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKP--GNFPPPTTPDEVGARVLMQERMDNGE 669
++K G + + + G+R + E +
Sbjct: 181 DFQDKLFSGEFVKRDEPDQNSPDVKSDTRADESISDLEFEKFAKRRGSRDVDPEDLYRSI 240
Query: 670 EDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPS 729
L+ E E D+ SSE QE + P +++ +
Sbjct: 241 MMLQGIHERIREKSSANSRSDE--------RSSESIQEQVERRPSTSDIERNSQSLTRRY 292
Query: 730 QPSIQPPPLPPAPDK 744
+ K
Sbjct: 293 DDKSFDKAVRLRRSK 307
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 38.7 bits (90), Expect = 0.009
Identities = 19/54 (35%), Positives = 22/54 (40%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
PPPP + APP P P+ P P P P+ DP PP V P
Sbjct: 45 PPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDP 98
Score = 34.5 bits (79), Expect = 0.23
Identities = 17/36 (47%), Positives = 17/36 (47%)
Query: 399 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
D PPPPPPS PAP P P P P PD
Sbjct: 40 DPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPD 75
Score = 32.6 bits (74), Expect = 0.78
Identities = 16/58 (27%), Positives = 17/58 (29%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTD 767
P P PP P PP PPA DP PP +P D
Sbjct: 55 PAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGRID 112
Score = 30.6 bits (69), Expect = 3.1
Identities = 10/33 (30%), Positives = 10/33 (30%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
P P PA P QP PP P
Sbjct: 53 AAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85
Score = 30.2 bits (68), Expect = 4.3
Identities = 14/51 (27%), Positives = 14/51 (27%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPA 451
P P PP P PP PPA DP PA
Sbjct: 55 PAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPA 105
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 36.6 bits (85), Expect = 0.016
Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 8/95 (8%)
Query: 383 QVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGY 442
V + PPPPPP+ + PP + P P AP + V
Sbjct: 52 DVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTVTVAK 111
Query: 443 DPKQEK--------IPASKVQEHMRIGLLDPRWVE 469
DP+ K +PA V+ M+ LDP ++
Sbjct: 112 DPRYAKYFKMLKLGVPAQAVKNKMQSEGLDPNLLD 146
Score = 32.0 bits (73), Expect = 0.67
Identities = 22/72 (30%), Positives = 31/72 (43%), Gaps = 3/72 (4%)
Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQ-PSIQPPPLPPAPDKV 745
P +D+ + + PPPPPP + P PP P +P P PA + V
Sbjct: 48 PGLEDVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTV 107
Query: 746 VVKKGYDPKQAR 757
V K DP+ A+
Sbjct: 108 TVAK--DPRYAK 117
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 38.4 bits (89), Expect = 0.021
Identities = 28/134 (20%), Positives = 54/134 (40%), Gaps = 1/134 (0%)
Query: 278 TPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGR 337
T E +E D GE ED EM+ + G + + + + E++++
Sbjct: 764 TEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQA 823
Query: 338 NANRGRGRMQE-EINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEE 396
+ + E E+N ++ E E+ D E ++ + ++ EE+ EEE
Sbjct: 824 DDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883
Query: 397 QEDIPPPPPPPSLK 410
+E+ P SL+
Sbjct: 884 EEEEEENEEPLSLE 897
Score = 33.0 bits (75), Expect = 0.88
Identities = 20/109 (18%), Positives = 40/109 (36%), Gaps = 5/109 (4%)
Query: 295 GEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDD 354
+ + E + E+E + + E + ++ + E + G G E+ E +
Sbjct: 689 RKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748
Query: 355 MQEGTTSEEESDN-----ENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
+ E E D E E + KE +D ++Q E+ +E
Sbjct: 749 AEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGA 797
Score = 31.9 bits (72), Expect = 2.0
Identities = 27/137 (19%), Positives = 50/137 (36%), Gaps = 26/137 (18%)
Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYY-------NARMRGRNANR 341
+E D GE + E E+E E ++ E + +E++ + M+G
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798
Query: 342 GRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQ---------------- 385
G+ + E + E E ++ E ++ E + N + Q
Sbjct: 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS 858
Query: 386 ---DMEEDSSSEEEQED 399
D EE+ EEE+E+
Sbjct: 859 DGGDSEEEEEEEEEEEE 875
Score = 30.0 bits (67), Expect = 6.7
Identities = 21/116 (18%), Positives = 35/116 (30%), Gaps = 17/116 (14%)
Query: 293 DNGEEDLEEDEEMEE----------EYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRG 342
D GE + E+ E E + + + + + E + G
Sbjct: 728 DEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGE 787
Query: 343 RGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
G M+ + EG + E + E E K + + D E EQE
Sbjct: 788 DGEMKGD-------EGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQE 836
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 37.6 bits (87), Expect = 0.029
Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 9/79 (11%)
Query: 702 SEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
E EQ PP P + PA P+ + P P +K K P++ R P
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDK---PRRERKP- 67
Query: 761 VKPSPT---DEFLISPITG 776
KP+ ++F++ P G
Sbjct: 68 -KPASLWKLEDFVVEPQEG 85
Score = 32.6 bits (74), Expect = 1.0
Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 4/59 (6%)
Query: 393 SEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIP 450
E EQ PP P + PA P+ + P P +K K P++E+ P
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDK---PRRERKP 67
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 37.3 bits (87), Expect = 0.031
Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 5/83 (6%)
Query: 380 DNTQVQDMEEDSSSE---EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 436
+ V+ S S+ D P PP S P P+ S PP P +
Sbjct: 294 EANGVEPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLAL 353
Query: 437 --VVKKGYDPKQEKIPASKVQEH 457
K+ D KQ+K + K++
Sbjct: 354 SPKRKREGDKKQKKKKSKKLKLT 376
Score = 37.3 bits (87), Expect = 0.035
Identities = 16/67 (23%), Positives = 20/67 (29%), Gaps = 2/67 (2%)
Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV--VVKKGYDPKQAR 757
D P PP S P P+ S PP P + K+ D KQ +
Sbjct: 308 DRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKK 367
Query: 758 PPVVKPS 764
K
Sbjct: 368 KKSKKLK 374
>gnl|CDD|215813 pfam00240, ubiquitin, Ubiquitin family. This family contains a
number of ubiquitin-like proteins: SUMO (smt3
homologue), Nedd8, Elongin B, Rub1, and Parkin. A number
of them are thought to carry a distinctive five-residue
motif termed the proteasome-interacting motif (PIM),
which may have a biologically significant role in
protein delivery to proteasomes and recruitment of
proteasomes to transcription sites.
Length = 69
Score = 33.7 bits (78), Expect = 0.036
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 12/51 (23%)
Query: 831 KHLNEKINQETG-------LFFK-----DSNSLAYYNITPGTKVHLQLKER 869
L EKI + G L F D +L+ Y I G+ +HL L+ R
Sbjct: 19 SELKEKIEDKEGIPVDQQRLIFSGKVLEDDTTLSEYGIQDGSTLHLVLRLR 69
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 37.6 bits (87), Expect = 0.037
Identities = 16/33 (48%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
P PPPP + P PP P QP PPP P P
Sbjct: 2911 PQAPPPPQPQPQPPPPPQP-QPPPPPPPRPQPP 2942
Score = 37.6 bits (87), Expect = 0.038
Identities = 42/158 (26%), Positives = 57/158 (36%), Gaps = 17/158 (10%)
Query: 621 PKRSHAPNPPLPRSMPPMNKPKPGN---FPPPTTPDEVGA--RVLMQERMDNGEEDLEED 675
P +HAP+PP P P N+P P PPP P + A RV R +
Sbjct: 2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAS 2677
Query: 676 EEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLP-----SQ 730
+ RR + S ++ P P P +L P PP P +
Sbjct: 2678 SPPQ---RPRRRAARPTVGSLT--SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
Query: 731 PSIQPPPLPPAPDKVVVKKGYDPKQARPPVVK--PSPT 766
P++ P PPA G + ARPP P+P
Sbjct: 2733 PALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
Score = 36.8 bits (85), Expect = 0.072
Identities = 17/61 (27%), Positives = 21/61 (34%), Gaps = 1/61 (1%)
Query: 706 QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA-PDKVVVKKGYDPKQARPPVVKPS 764
+ PP P + +P P P P QPPP PP P + P P
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961
Query: 765 P 765
P
Sbjct: 2962 P 2962
Score = 36.1 bits (83), Expect = 0.11
Identities = 13/36 (36%), Positives = 16/36 (44%)
Query: 397 QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPA 432
+ PP P + +P P P P QPPP PP
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 36.1 bits (83), Expect = 0.12
Identities = 31/163 (19%), Positives = 45/163 (27%), Gaps = 23/163 (14%)
Query: 621 PKRSHAPNPPLPRSMPPMNKPKPGN-----------FPPPTTPDEVGARVLMQERMDNGE 669
P P P P S+P PG P P R L + +
Sbjct: 2836 PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRST 2895
Query: 670 EDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPS 729
E + E +P+ + + Q PPPPPP + AP
Sbjct: 2896 ESFALPPDQPER--PPQPQAPPPPQPQPQPPPPPQPQ----PPPPPPPRPQPPLAPTTDP 2949
Query: 730 QPSIQPPPLPPAPDKVVVKKGYDP------KQARPPVVKPSPT 766
+ +P P P + G Q P P+ +
Sbjct: 2950 AGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASS 2992
Score = 34.9 bits (80), Expect = 0.27
Identities = 29/158 (18%), Positives = 37/158 (23%), Gaps = 3/158 (1%)
Query: 612 PDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEED 671
PD + P+ PP P + P P P A
Sbjct: 2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP 2648
Query: 672 LEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 731
E + RP + SS + P SL + PP P P
Sbjct: 2649 PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
Query: 732 SIQPPPLPPA---PDKVVVKKGYDPKQARPPVVKPSPT 766
P L A P + P P P
Sbjct: 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
Score = 33.4 bits (76), Expect = 0.75
Identities = 11/55 (20%), Positives = 13/55 (23%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQ 455
S P P QP PPP P + P P +
Sbjct: 2890 AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA 2944
Score = 33.0 bits (75), Expect = 1.0
Identities = 18/67 (26%), Positives = 20/67 (29%), Gaps = 10/67 (14%)
Query: 701 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
S E +PP P + P PP P PP P P P RP
Sbjct: 2892 SRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP----------PPPPRPQP 2941
Query: 761 VKPSPTD 767
TD
Sbjct: 2942 PLAPTTD 2948
Score = 32.2 bits (73), Expect = 1.6
Identities = 17/53 (32%), Positives = 20/53 (37%), Gaps = 2/53 (3%)
Query: 382 TQVQDMEEDSSSEEEQEDIPPPP--PPPSLKKVEPAPPLPSQPSIQPPPLPPA 432
T+ + D Q PPPP P +P PP P P QPP P
Sbjct: 2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
Score = 31.1 bits (70), Expect = 3.6
Identities = 18/65 (27%), Positives = 23/65 (35%), Gaps = 1/65 (1%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLP-PAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
P PP L + A S+ S+ P P P V+ P A P P PT
Sbjct: 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
Query: 769 FLISP 773
+P
Sbjct: 2835 QPTAP 2839
Score = 31.1 bits (70), Expect = 4.2
Identities = 28/126 (22%), Positives = 39/126 (30%), Gaps = 14/126 (11%)
Query: 629 PPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEE----------- 677
PP P + P ++ P P + V R+L R G E+L D+
Sbjct: 2503 PPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIR---GLEELASDDAGDPPPPLPPAA 2559
Query: 678 MEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPP 737
+ P E +S + D PP P + P P PP
Sbjct: 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP 2619
Query: 738 LPPAPD 743
APD
Sbjct: 2620 DTHAPD 2625
Score = 30.3 bits (68), Expect = 6.5
Identities = 21/72 (29%), Positives = 25/72 (34%), Gaps = 4/72 (5%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEF 769
P P P+ P P + P P PPA G + RP V S + E
Sbjct: 2742 PAVPAGPATPGGPARPARPPTTAGPPAPAPPA----APAAGPPRRLTRPAVASLSESRES 2797
Query: 770 LISPITGEKIPA 781
L SP PA
Sbjct: 2798 LPSPWDPADPPA 2809
Score = 30.3 bits (68), Expect = 7.3
Identities = 13/33 (39%), Positives = 15/33 (45%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
P P P +L P PP P+ P LP AP
Sbjct: 2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
Score = 30.3 bits (68), Expect = 7.4
Identities = 19/53 (35%), Positives = 23/53 (43%), Gaps = 5/53 (9%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP--DKVVVKKGYDPKQARPPV 760
PPPP PP+ PA P S P +P P P P + P+ ARP
Sbjct: 2551 PPPPLPPAAP---PAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRA 2600
Score = 29.9 bits (67), Expect = 7.9
Identities = 11/40 (27%), Positives = 14/40 (35%)
Query: 394 EEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
PPPPPP + AP + +P P P
Sbjct: 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
Score = 29.9 bits (67), Expect = 8.6
Identities = 19/83 (22%), Positives = 26/83 (31%), Gaps = 7/83 (8%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKG-------YDPKQARPPVVK 762
P P P PAP P+ P+ PP P + + +DP V+
Sbjct: 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA 2813
Query: 763 PSPTDEFLISPITGEKIPASKVQ 785
P+ SP P S
Sbjct: 2814 PAAALPPAASPAGPLPPPTSAQP 2836
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 36.8 bits (85), Expect = 0.050
Identities = 28/164 (17%), Positives = 49/164 (29%), Gaps = 11/164 (6%)
Query: 272 NFPPPTT-PDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYY 330
N PT+ P R DL +D+ E D + S
Sbjct: 265 NSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHS-------DPEPLPASLPPPPVR 317
Query: 331 NARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEED 390
R++ A + + + TS + ++K++ +
Sbjct: 318 RPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSK---GSSSAQNKDSGSTGPGSSLAAA 374
Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
SS E+ + PP + + P+P + S P PL D
Sbjct: 375 SSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418
Score = 33.7 bits (77), Expect = 0.43
Identities = 24/134 (17%), Positives = 38/134 (28%), Gaps = 21/134 (15%)
Query: 620 GPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEME 679
H+ PLP S+PP +P P +E +G + E E +
Sbjct: 296 DDGSEHSDPEPLPASLPPPPVRRPRVKHPEAGKEE-----------PDGARNAEAKEPAQ 344
Query: 680 EEYGQRRPEDDDMQIDMEEDSSS----------EEEQEDIPPPPPPPSLKKVEPAPPLPS 729
Q + E+ + PP + + P+P + S
Sbjct: 345 PATSTSSKGSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTS 404
Query: 730 QPSIQPPPLPPAPD 743
P PL D
Sbjct: 405 APEPPSIPLTYLSD 418
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 36.9 bits (86), Expect = 0.057
Identities = 26/131 (19%), Positives = 55/131 (41%), Gaps = 12/131 (9%)
Query: 289 QERMDNGEEDLEED--EEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRM 346
ER+ + LE + M E E+D + D ++++ + N G G+
Sbjct: 268 AERL----KKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDD----DDNFGLGQG 319
Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPP 406
+E+ E++ EE+ D++ EE + + + + +D E++ S +E+ E+
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEE--DVDLSDEEEDEEDEDSDDEDDEEEEEEEKE 377
Query: 407 PSLKKVEPAPP 417
KK +
Sbjct: 378 KKKKKSAESTR 388
Score = 35.0 bits (81), Expect = 0.20
Identities = 32/138 (23%), Positives = 55/138 (39%), Gaps = 17/138 (12%)
Query: 285 RVLMQERMD--NGEEDLEEDEEMEEEYGQ----RRPEDDDMQIGSDEENDYYNARMRGRN 338
+ L ER+ GEE+ +E+EE +E DDD G + + G +
Sbjct: 272 KKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVD 331
Query: 339 ANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
EE ++DD++E + SD E +E + + +D D EE+ +E+++
Sbjct: 332 DE------DEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDED-----DEEEEEEEKEKKK 380
Query: 399 DIPPPPPPPSLKKVEPAP 416
L P P
Sbjct: 381 KKSAESTRSELPFTFPCP 398
Score = 35.0 bits (81), Expect = 0.22
Identities = 22/119 (18%), Positives = 46/119 (38%), Gaps = 31/119 (26%)
Query: 290 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEE 349
+ G+ + +E+EE + + +DDD +E+ D
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVD-------------------LS 352
Query: 350 INEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPS 408
E+D ++ + +E+ + E EE ++ K+ K S+E + ++P P P
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKK------------SAESTRSELPFTFPCPK 399
Score = 31.1 bits (71), Expect = 3.7
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 666 DNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
D EED ++D E EEE E++D + + +D EEE+E+
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEE 374
Score = 30.7 bits (70), Expect = 4.8
Identities = 13/47 (27%), Positives = 24/47 (51%)
Query: 663 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDI 709
+ G+ + +E+EE + + +DDD + EED +E+ED
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDE 358
Score = 30.4 bits (69), Expect = 5.7
Identities = 28/147 (19%), Positives = 50/147 (34%), Gaps = 44/147 (29%)
Query: 298 DLEEDEEMEEE---YGQ---RRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEIN 351
+LE+D+E E GQ +DDD++ D ++D +
Sbjct: 105 NLEDDDEDEFVLTHLGQSLSEIDKDDDVRDDDDFDDD--------------DLGDLASDD 150
Query: 352 EDDMQEGTTSEEESDNENEEARKSKE-----------------AKDNTQVQDMEEDSSSE 394
G +EE + E E +KSK+ K + +D+ E+ +
Sbjct: 151 RAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREE--LD 208
Query: 395 EEQEDI-----PPPPPPPSLKKVEPAP 416
++ +D+ PPP E
Sbjct: 209 DDFKDLMSLLRTVKPPPKPPMTPEEKD 235
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 36.7 bits (85), Expect = 0.058
Identities = 31/129 (24%), Positives = 47/129 (36%), Gaps = 5/129 (3%)
Query: 290 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRG--RGRMQ 347
E +D+ + L E E E G E D Q +DE + + + R R R
Sbjct: 244 ESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRA 303
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDN-TQVQDM-EEDSSSEEEQEDIPPPPP 405
+ +E + E EE +E++ D E+D S+EE E
Sbjct: 304 KPSDEPSLPESDIHEEI-PKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKK 362
Query: 406 PPSLKKVEP 414
P +KV P
Sbjct: 363 KPRKRKVNP 371
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 35.8 bits (83), Expect = 0.062
Identities = 14/68 (20%), Positives = 27/68 (39%), Gaps = 1/68 (1%)
Query: 351 NEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLK 410
E D++ + S++ +SK+ T + ++ SSSE D+ PP +
Sbjct: 110 GESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEE-E 168
Query: 411 KVEPAPPL 418
E +
Sbjct: 169 IAEEEEEV 176
Score = 30.8 bits (70), Expect = 2.6
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 9/68 (13%)
Query: 668 GEEDLE--EDEEMEEEYGQRRPEDD------DMQIDMEEDSSSEEEQEDIPPPPPPPSLK 719
GE DLE DE+ E E+ ++++ ++ SSSE D+ PP +
Sbjct: 110 GESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEE-E 168
Query: 720 KVEPAPPL 727
E +
Sbjct: 169 IAEEEEEV 176
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 36.2 bits (84), Expect = 0.068
Identities = 24/103 (23%), Positives = 48/103 (46%), Gaps = 12/103 (11%)
Query: 296 EEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDM 355
+ EE ++ E E G +DDD + + E D + G E I+ +
Sbjct: 77 KWKEEERKKKEAEQGLESDDDDDEEEEWEVEED---------EDSDDEG---EWIDVESD 124
Query: 356 QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
+E +S+ E + E +EA K + + ++ + +E+ ++EEE+
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167
Score = 31.5 bits (72), Expect = 1.8
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 2/48 (4%)
Query: 666 DNGEEDL--EEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPP 711
+ E + EDEE ++E ++ ED D ++ E++ + EE+E
Sbjct: 123 SDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.7 bits (84), Expect = 0.069
Identities = 34/172 (19%), Positives = 68/172 (39%), Gaps = 9/172 (5%)
Query: 232 EAQRKQEEARVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPPTTPDEVGARVLMQER 291
E +K EEA+++ + + A+ + VE + E E ++ E
Sbjct: 1610 EEAKKAEEAKIKAEELKKAE---EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
Query: 292 MDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRG----RMQ 347
EED ++ EE ++ + + + + E ++ + A + +
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAE--ALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
EE N+ +E EE + EEA+K +E K EE+ +EE +++
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
Score = 30.9 bits (69), Expect = 4.8
Identities = 35/185 (18%), Positives = 77/185 (41%), Gaps = 9/185 (4%)
Query: 219 DEVKYRADYIRYQEAQRKQEEARVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPPTT 278
DE+K +A+ ++ E ++K EEA+ ++ A + E
Sbjct: 1549 DELK-KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM 1607
Query: 279 PDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDM---QIGSDEENDYYNARMR 335
E A+ + ++ E E+E+ + E +++ ++ ++ EE + A
Sbjct: 1608 KAE-EAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
Query: 336 GRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAK--DNTQVQDMEEDSSS 393
+ A + + +E + ++ + E E EEA+K++E K + + + EE +
Sbjct: 1667 AKKAEEDKKKAEEAKKAE--EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724
Query: 394 EEEQE 398
EEE +
Sbjct: 1725 EEENK 1729
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 36.2 bits (84), Expect = 0.077
Identities = 12/54 (22%), Positives = 19/54 (35%)
Query: 370 EEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPS 423
+ AK + + ++ E P PP + K EPAP P+
Sbjct: 127 AAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPT 180
Score = 32.3 bits (74), Expect = 1.1
Identities = 10/46 (21%), Positives = 17/46 (36%)
Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
+ + ++ E+ + P P PP+ K P PP P
Sbjct: 136 KAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTP 181
Score = 31.6 bits (72), Expect = 2.3
Identities = 12/60 (20%), Positives = 17/60 (28%), Gaps = 4/60 (6%)
Query: 708 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTD 767
PP P + + P +P P P P A+PP P+
Sbjct: 122 GAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPT----PPAAAKPPEPAPAAKP 177
Score = 30.4 bits (69), Expect = 4.2
Identities = 11/62 (17%), Positives = 13/62 (20%), Gaps = 1/62 (1%)
Query: 702 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV-KKGYDPKQARPPV 760
P K P P P+ PP P+ K P P
Sbjct: 129 PAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPR 188
Query: 761 VK 762
Sbjct: 189 ET 190
Score = 30.0 bits (68), Expect = 6.4
Identities = 16/79 (20%), Positives = 25/79 (31%), Gaps = 8/79 (10%)
Query: 654 EVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPP 713
EVGA + +D G + + + ++ E P
Sbjct: 110 EVGAPLS---EIDTGGAPPAAAPAAAAAAKAEKTTPE-----KPKAAAPTPEPPAASKPT 161
Query: 714 PPPSLKKVEPAPPLPSQPS 732
PP + K EPAP P+
Sbjct: 162 PPAAAKPPEPAPAAKPPPT 180
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 36.6 bits (85), Expect = 0.080
Identities = 28/161 (17%), Positives = 46/161 (28%), Gaps = 23/161 (14%)
Query: 310 GQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNEN 369
+RR D+ EN N R R + + + + E + T E++
Sbjct: 618 NERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRR 677
Query: 370 EEARKSKEAKDNTQ----VQDMEEDSSSEEEQEDIPPPPPP------------------- 406
E R+ + K Q ++EE S E EQE+ P
Sbjct: 678 ERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQSVAE 737
Query: 407 PSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQE 447
++ V + P KV + E
Sbjct: 738 EAVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVVAQTAPE 778
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 35.9 bits (83), Expect = 0.11
Identities = 16/71 (22%), Positives = 20/71 (28%), Gaps = 4/71 (5%)
Query: 701 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
+E + PPP P P P P L A V K Y P
Sbjct: 398 PPKEPVRETATPPPVPP----RPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEE 453
Query: 761 VKPSPTDEFLI 771
K D ++
Sbjct: 454 EKALIADGDVL 464
Score = 34.4 bits (79), Expect = 0.27
Identities = 14/53 (26%), Positives = 16/53 (30%), Gaps = 4/53 (7%)
Query: 392 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDP 444
+E + PPP P P P P L A V K Y P
Sbjct: 398 PPKEPVRETATPPPVPP----RPVAPPVPHTPESAPKLTRAAIPVDEKPKYTP 446
Score = 33.2 bits (76), Expect = 0.75
Identities = 14/72 (19%), Positives = 16/72 (22%), Gaps = 16/72 (22%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEF 769
P P P PP PP P PPV +
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP------------VAPPVPHTPESAP- 429
Query: 770 LISPITGEKIPA 781
+T IP
Sbjct: 430 ---KLTRAAIPV 438
Score = 30.9 bits (70), Expect = 3.7
Identities = 9/55 (16%), Positives = 12/55 (21%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQ 455
P P P PP PP P K+ + +
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIP 437
Score = 30.2 bits (68), Expect = 6.0
Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 1/61 (1%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV-VVKKGYDPKQARPPVVKPSPTD 767
PP E A P P P PP+P P+ + + P +P P+P
Sbjct: 392 AAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPK 451
Query: 768 E 768
E
Sbjct: 452 E 452
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 35.2 bits (81), Expect = 0.12
Identities = 33/163 (20%), Positives = 46/163 (28%), Gaps = 15/163 (9%)
Query: 601 IHQIHKVKGLLPDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVL 660
H K +P + G PN P K P E A
Sbjct: 34 YFITHHKKAPIPFLSNQGTGSL---LPNKPEEVWSYI--KALEDRQIGVPQPTEPAAVKD 88
Query: 661 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKK 720
+ + LE+ E ++ + E + EEQ
Sbjct: 89 AERLTPEQRQLLEQMEVDQKAQPTQLGEQPEQA--------RIEEQPRTQSQKAQSQATT 140
Query: 721 VEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
V+ P P +P P+ PAP V+K PK P KP
Sbjct: 141 VQTQPVKPKPRPEKPQPVAPAPAPEPVEK--APKAEAAPPPKP 181
Score = 32.5 bits (74), Expect = 0.85
Identities = 26/143 (18%), Positives = 39/143 (27%), Gaps = 9/143 (6%)
Query: 605 HKVKGLLPDEEKEK---IGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLM 661
LLP++ +E I P P +P TP++ M
Sbjct: 50 QGTGSLLPNKPEEVWSYIKALEDRQIGVPQP------TEPAAVKDAERLTPEQRQLLEQM 103
Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKV 721
+ L E E Q R + Q + + + P P P +
Sbjct: 104 EVDQKAQPTQLGEQPEQARIEEQPRTQSQKAQSQATTVQTQPVKPKPRPEKPQPVAPAPA 163
Query: 722 EPAPPLPSQPSIQPPPLPPAPDK 744
+ PPP P A D
Sbjct: 164 PEPVEKAPKAEAAPPPKPKAEDA 186
Score = 30.2 bits (68), Expect = 5.0
Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 5/92 (5%)
Query: 370 EEARKSKEAKDNTQVQDMEEDSSSEEEQ--EDIPPPPPPPSLKKVEPAPPLPSQPSI--- 424
E R+ E + Q + E+ E+ P + + P +P
Sbjct: 94 PEQRQLLEQMEVDQKAQPTQLGEQPEQARIEEQPRTQSQKAQSQATTVQTQPVKPKPRPE 153
Query: 425 QPPPLPPAPDKVVVKKGYDPKQEKIPASKVQE 456
+P P+ PAP V+K + P K ++
Sbjct: 154 KPQPVAPAPAPEPVEKAPKAEAAPPPKPKAED 185
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 36.0 bits (84), Expect = 0.12
Identities = 19/101 (18%), Positives = 31/101 (30%), Gaps = 5/101 (4%)
Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKK 749
D ++ P + PA P P+ P+ P PP P
Sbjct: 34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93
Query: 750 G--YDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEHM 788
P A + E ++P+ G A+ V ++M
Sbjct: 94 AAPAAPPAAAAAAAPAAAAVEDEVTPLRG---AAAAVAKNM 131
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 35.6 bits (83), Expect = 0.13
Identities = 23/111 (20%), Positives = 37/111 (33%), Gaps = 1/111 (0%)
Query: 289 QERMDNGEE-DLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQ 347
+R D E + E EE D + + + R G R R R +
Sbjct: 176 GDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDR 235
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQE 398
+ DD +E + D E R+ + +D + D +E E E
Sbjct: 236 RDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPE 286
Score = 32.6 bits (75), Expect = 1.3
Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 1/66 (1%)
Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQE 348
+ R + ++ E+ G R +D + + G R RGR G +
Sbjct: 227 RGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDR-RGRRGGDGGNEREP 285
Query: 349 EINEDD 354
E+ EDD
Sbjct: 286 ELREDD 291
Score = 30.6 bits (70), Expect = 4.5
Identities = 25/132 (18%), Positives = 41/132 (31%), Gaps = 12/132 (9%)
Query: 268 GEPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEEN 327
GE E G + + R D E EE+ + G R + + G
Sbjct: 134 GEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGE---- 189
Query: 328 DYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDM 387
RGR RGR + D ++G EE + + R + +D +
Sbjct: 190 -------RGRREERGRDGDDRD-RRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGD 241
Query: 388 EEDSSSEEEQED 399
+ + D
Sbjct: 242 DNREDRGDRDGD 253
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.0 bits (78), Expect = 0.14
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 365 SDNENEEARKSKEAKDNTQVQDMEE--DSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 422
SD+E EE +E+ D T+ +D E S E+++ ++P P K EP P PS+P
Sbjct: 94 SDDEEEE--TEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151
Score = 29.7 bits (67), Expect = 3.3
Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 26/102 (25%)
Query: 630 PLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPED 689
PL R++P M K + +E + +E+ E +EE +E Q P +
Sbjct: 76 PLVRNLPAMWK-------------------IFRELSSSDDEEEETEEESTDETEQEDPPE 116
Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 731
+ S E+++ ++P P K EP P PS+P
Sbjct: 117 -------TKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 34.8 bits (80), Expect = 0.14
Identities = 19/70 (27%), Positives = 26/70 (37%)
Query: 387 MEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 446
+ +E P P PP K P P +P+ P P P+K K PK
Sbjct: 44 LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKP 103
Query: 447 EKIPASKVQE 456
+ P KV+
Sbjct: 104 KPKPKPKVKP 113
Score = 32.1 bits (73), Expect = 1.2
Identities = 19/90 (21%), Positives = 27/90 (30%), Gaps = 2/90 (2%)
Query: 696 MEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 755
+ +E P P PP K P P +P+ P P P+K K PK
Sbjct: 44 LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKP--KPKP 101
Query: 756 ARPPVVKPSPTDEFLISPITGEKIPASKVQ 785
P KP + + +
Sbjct: 102 KPKPKPKPKVKPQPKPKKPPSKTAAKAPAA 131
Score = 30.1 bits (68), Expect = 5.0
Identities = 15/69 (21%), Positives = 21/69 (30%), Gaps = 1/69 (1%)
Query: 381 NTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQ-PPPLPPAPDKVVVK 439
Q + + + E PP P P K +P P + P P P KV +
Sbjct: 55 EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQ 114
Query: 440 KGYDPKQEK 448
K
Sbjct: 115 PKPKKPPSK 123
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 35.2 bits (82), Expect = 0.14
Identities = 21/80 (26%), Positives = 31/80 (38%), Gaps = 13/80 (16%)
Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKK 749
+ + EE+ EE+E P P+ + PAP Q Q PP P
Sbjct: 221 AILLGEEEEEEEEVEEEEAQASPAAEPATAQAAPAPK---QEQQQAPPQRQEP------- 270
Query: 750 GYDPKQARPPVVKPSPTDEF 769
K+A+P V+P+ F
Sbjct: 271 ---EKEAQPVNVQPAQFQSF 287
Score = 34.1 bits (79), Expect = 0.29
Identities = 13/53 (24%), Positives = 20/53 (37%)
Query: 385 QDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 437
+ EE+ EE+E P P+ + PAP Q + P + V
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPV 277
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 33.8 bits (78), Expect = 0.15
Identities = 30/137 (21%), Positives = 56/137 (40%), Gaps = 22/137 (16%)
Query: 279 PDEVGARVLMQERMDNGEEDLEEDEE--------MEEEYGQRRPEDDDMQIGSDEENDY- 329
DE +++ +D+ +E+ EE+E+ ME +Y D + G D+E+DY
Sbjct: 1 EDEPDEEEELEDDIDDLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDYE 60
Query: 330 ---------YNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKD 380
AR+ R+ R RM +DD + + + D + R+ ++ +
Sbjct: 61 ELSAAARRAAEARLNRRDRLLRRRRMPAAFLDDDDDD----DGDFDLTAQPRRRRRQYDE 116
Query: 381 NTQVQDMEEDSSSEEEQ 397
+ D + D EE
Sbjct: 117 DRDADDEDIDPLEEELS 133
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 35.4 bits (82), Expect = 0.17
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 5/48 (10%)
Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 438
+ + +E+ PPPPPPP E P Q A +++V+
Sbjct: 260 APPQPPEEEPPPPPPPP-----EDDDDPPEDEEEQDDAEDRALEEIVL 302
Score = 35.4 bits (82), Expect = 0.17
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 5/48 (10%)
Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 747
+ + +E+ PPPPPPP E P Q A +++V+
Sbjct: 260 APPQPPEEEPPPPPPPP-----EDDDDPPEDEEEQDDAEDRALEEIVL 302
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 34.4 bits (79), Expect = 0.23
Identities = 22/68 (32%), Positives = 26/68 (38%), Gaps = 5/68 (7%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEF 769
PPPP PA P +QP+ P P P PK + PP+ P F
Sbjct: 154 QPPPPHAMP----PASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSPM-AGTF 208
Query: 770 LISPITGE 777
SP GE
Sbjct: 209 YRSPAPGE 216
Score = 31.0 bits (70), Expect = 2.6
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 2/34 (5%)
Query: 402 PPPPPPSLKKVEPAP--PLPSQPSIQPPPLPPAP 433
PPPP ++ P P PS P+ PPP P +P
Sbjct: 154 QPPPPHAMPPASPPAAQPAPSAPASSPPPTPASP 187
Score = 30.2 bits (68), Expect = 5.0
Identities = 11/33 (33%), Positives = 13/33 (39%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
P PP S +PAP P+ P PP
Sbjct: 158 PHAMPPASPPAAQPAPSAPASSPPPTPASPPPA 190
Score = 29.8 bits (67), Expect = 5.6
Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 4/33 (12%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
PPPP PA P +QP+ P P P
Sbjct: 154 QPPPPHAMP----PASPPAAQPAPSAPASSPPP 182
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 34.6 bits (80), Expect = 0.25
Identities = 14/63 (22%), Positives = 22/63 (34%)
Query: 371 EARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLP 430
A + + + S+ + +Q P PPP P P P++PS PP
Sbjct: 373 APANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432
Query: 431 PAP 433
Sbjct: 433 ANA 435
Score = 33.8 bits (78), Expect = 0.47
Identities = 14/57 (24%), Positives = 19/57 (33%)
Query: 377 EAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
A + + + PP P PS K + A PS P + P P P
Sbjct: 362 SAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVP 418
Score = 33.0 bits (76), Expect = 0.91
Identities = 14/89 (15%), Positives = 20/89 (22%), Gaps = 8/89 (8%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
I + PAP P+ P P + PP P
Sbjct: 365 ISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQA-------ATTPSPPPAKASPPIPV 417
Query: 769 FLISPITGEKIPASKVQEHMRIGLLDPRW 797
PA+ + L+ W
Sbjct: 418 PAEPTEPSPTPPANAANAPPSLN-LEELW 445
Score = 32.6 bits (75), Expect = 1.0
Identities = 13/46 (28%), Positives = 19/46 (41%)
Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
+ S+ + +Q P PPP P P P++PS PP
Sbjct: 390 IQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANA 435
Score = 32.2 bits (74), Expect = 1.5
Identities = 17/68 (25%), Positives = 21/68 (30%), Gaps = 5/68 (7%)
Query: 703 EEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVK 762
PP P PS K + A PS P + PP P + P P
Sbjct: 379 PAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAS--PPIPVPAEPTE---PSPTPPANAA 433
Query: 763 PSPTDEFL 770
+P L
Sbjct: 434 NAPPSLNL 441
Score = 31.1 bits (71), Expect = 2.9
Identities = 17/69 (24%), Positives = 23/69 (33%)
Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPP 759
S++ + PPPP+ P P + P PPP P A P + PP
Sbjct: 518 SNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPP 577
Query: 760 VVKPSPTDE 768
P
Sbjct: 578 PPIPEEPTP 586
Score = 31.1 bits (71), Expect = 3.6
Identities = 20/78 (25%), Positives = 28/78 (35%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
PPPP S P PPLP + PPP PP P + + P P P
Sbjct: 522 KTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIP 581
Query: 769 FLISPITGEKIPASKVQE 786
+P + ++ +
Sbjct: 582 EEPTPSPTKDSSPEEIDK 599
Score = 29.9 bits (68), Expect = 6.7
Identities = 13/75 (17%), Positives = 21/75 (28%)
Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIP 450
+ Q+ PPP P P L + P P+ P P + + IP
Sbjct: 522 KTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIP 581
Query: 451 ASKVQEHMRIGLLDP 465
+ +
Sbjct: 582 EEPTPSPTKDSSPEE 596
Score = 29.5 bits (67), Expect = 8.8
Identities = 14/72 (19%), Positives = 19/72 (26%), Gaps = 4/72 (5%)
Query: 367 NENEEARKSKEAKDNTQVQDMEEDSSSEE---EQEDIPPPPPPPSLKKVEPAP-PLPSQP 422
+E A S A + + + + P PP K P P P
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTE 423
Query: 423 SIQPPPLPPAPD 434
PP A
Sbjct: 424 PSPTPPANAANA 435
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 34.6 bits (80), Expect = 0.27
Identities = 14/66 (21%), Positives = 28/66 (42%), Gaps = 12/66 (18%)
Query: 659 VLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSL 718
V +Q +D D++++EE + ++DDM DM++D E + P
Sbjct: 216 VEIQNELD----DIDDEEEERVLADEDDDDEDDM-FDMDDDDEEESD-------PEVERT 263
Query: 719 KKVEPA 724
++
Sbjct: 264 STIKEV 269
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 34.6 bits (80), Expect = 0.28
Identities = 21/116 (18%), Positives = 46/116 (39%), Gaps = 6/116 (5%)
Query: 288 MQERMDNGEEDLE----EDEEMEEEYGQRRPEDDDMQIGSDE-ENDYYNARMRGRNANRG 342
++E++ E+ L E EE+EEE Q R E +++ D
Sbjct: 689 LEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEER 748
Query: 343 RGRMQEEINEDDMQ-EGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQ 397
++ +E+ E + + E E E +++ + Q++ ++E+ + E
Sbjct: 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA 804
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 33.8 bits (78), Expect = 0.31
Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 3/57 (5%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP 765
+ PP P + EPAP P +P P P + V+ PK PVV+
Sbjct: 90 VAPPNTPV---EPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143
Score = 32.3 bits (74), Expect = 0.74
Identities = 15/56 (26%), Positives = 20/56 (35%)
Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
PP P + P P +P +P P P KV P+ K +PT
Sbjct: 91 APPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146
Score = 31.9 bits (73), Expect = 1.2
Identities = 17/65 (26%), Positives = 23/65 (35%), Gaps = 1/65 (1%)
Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
+ PSL APP + +P P+ P K V K PK + P
Sbjct: 71 AAEAVRAGDAAAPSLDPATVAPP-NTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPP 129
Query: 764 SPTDE 768
+P E
Sbjct: 130 APKPE 134
Score = 30.7 bits (70), Expect = 2.9
Identities = 14/74 (18%), Positives = 25/74 (33%), Gaps = 3/74 (4%)
Query: 698 EDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP---PLPPAPDKVVVKKGYDPK 754
E ++ D P P+ P P ++PP P+ K ++ +
Sbjct: 69 EGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAP 128
Query: 755 QARPPVVKPSPTDE 768
A P KP ++
Sbjct: 129 PAPKPEPKPVVEEK 142
Score = 30.0 bits (68), Expect = 5.2
Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 3/56 (5%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQ 455
+ PP P + EPAP P +P P P + V+ PK E P + +
Sbjct: 90 VAPPNTPV---EPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEK 142
Score = 29.2 bits (66), Expect = 8.7
Identities = 12/48 (25%), Positives = 16/48 (33%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
+ P P P K +P +P Q P K K + K A
Sbjct: 97 VEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAA 144
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 34.4 bits (79), Expect = 0.34
Identities = 15/97 (15%), Positives = 22/97 (22%)
Query: 668 GEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL 727
DE E + + D + +S ++D PP + A
Sbjct: 11 RWNYDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETGSGGGVATST 70
Query: 728 PSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPS 764
P DK P V P
Sbjct: 71 IYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPG 107
Score = 32.4 bits (74), Expect = 1.1
Identities = 23/119 (19%), Positives = 39/119 (32%), Gaps = 14/119 (11%)
Query: 336 GRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEE 395
G RG+ + D+ S E S +EN +A + ++ E
Sbjct: 1 GPGKPRGQPGRWNYDSSDE------SPEGSRDENFDAERDDFLTPLGSTS-----EATSE 49
Query: 396 EQEDIPPPPPPPSLKKVEPAPP--LPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPAS 452
+ +D+ PP S V + +P P P PD + + P +P
Sbjct: 50 DDDDLYPPRETGSGGGVATSTIYTVPRPPR-GPEQTLDKPDSLPASRELPPGPTPVPPG 107
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 34.6 bits (79), Expect = 0.35
Identities = 18/31 (58%), Positives = 18/31 (58%), Gaps = 10/31 (32%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
PPPPPPP EP PSQP PPP PP
Sbjct: 10 PPPPPPPGF---EP----PSQP---PPPPPP 30
Score = 34.6 bits (79), Expect = 0.35
Identities = 18/31 (58%), Positives = 18/31 (58%), Gaps = 10/31 (32%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
PPPPPPP EP PSQP PPP PP
Sbjct: 10 PPPPPPPGF---EP----PSQP---PPPPPP 30
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 34.1 bits (78), Expect = 0.35
Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 3/103 (2%)
Query: 300 EEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGT 359
E EE +R P D + + + R R R E + D+ E
Sbjct: 319 EPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDE-DEDD 377
Query: 360 TSEEESDNENEEARKSKEAKDNTQVQDME--EDSSSEEEQEDI 400
+ E+E D E E R+ +E K+ +++ +E + + E E I
Sbjct: 378 SDEDEVDYEKERKRRREEDKNFLRLKALELSKYAGVNERMEKI 420
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 34.3 bits (79), Expect = 0.36
Identities = 42/192 (21%), Positives = 73/192 (38%), Gaps = 18/192 (9%)
Query: 280 DEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGR-- 337
DE + +L + M E + EE + E + R E + + +EEN+ + + GR
Sbjct: 367 DEENSGLLSMKFMQRAEA--RKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRK 424
Query: 338 -----NANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARK-----SKEAKDNTQVQDM 387
+ ++ N+++ +E S+EE + E+EE K +K K + + Q
Sbjct: 425 FGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKE 484
Query: 388 EEDS-SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 446
EE+ EE S KK + S+ + A KV KK K+
Sbjct: 485 EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKK---KKE 541
Query: 447 EKIPASKVQEHM 458
+ I
Sbjct: 542 KSIDLDDDLIDE 553
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 34.0 bits (78), Expect = 0.37
Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 352 EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKK 411
++D E S E SD+E E K KE + +++ M ED +EE E +P P +
Sbjct: 268 DEDEDEPKPSGERSDSEEETEEKEKEKRK--RLKKMMEDEDEDEEMEIVPESPVEEE-ES 324
Query: 412 VEPAPPLPSQPSIQPPPLPPAPD 434
EP PP + + + +PD
Sbjct: 325 EEPEPPPLPKKEEEKEEVTVSPD 347
Score = 32.1 bits (73), Expect = 1.3
Identities = 26/114 (22%), Positives = 44/114 (38%), Gaps = 16/114 (14%)
Query: 614 EEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLE 673
EE E+ KR ++ + + P P ER D+ EE E
Sbjct: 242 EESEEESGKRDVILEDESAEPTGL-DEDEDEDEPKP-----------SGERSDSEEETEE 289
Query: 674 EDEEMEEEYGQRRPEDDDMQIDMEED--SSSEEEQEDIPPPPPPPSLKKVEPAP 725
+++E + ++ +D+ + + E S EE+E P PPP K+ E
Sbjct: 290 KEKEKRKR--LKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEE 341
Score = 29.8 bits (67), Expect = 7.5
Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 3/97 (3%)
Query: 649 PTTPDEVGARVLMQERMDNGEEDLEEDE--EMEEEYGQRRPEDDDMQIDMEEDSSSEEEQ 706
++ A + ++ +E E + EEE ++ E M ED +EE
Sbjct: 252 DVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEM 311
Query: 707 EDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
E +P P + EP PP + + + +PD
Sbjct: 312 EIVPESPVEEE-ESEEPEPPPLPKKEEEKEEVTVSPD 347
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 33.4 bits (76), Expect = 0.39
Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 10/82 (12%)
Query: 626 APNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQR 685
A P + R + + + PP PD V + EE LEE + EEE +
Sbjct: 146 AEKPRVTRFNIVWDNDEDNDEAPPAQPD-VD---------NEEEERLEESDGREEEEDEE 195
Query: 686 RPEDDDMQIDMEEDSSSEEEQE 707
D + + E + EEE E
Sbjct: 196 VGSDSYGEGNRELNEEEEEEAE 217
Score = 30.3 bits (68), Expect = 4.5
Identities = 27/123 (21%), Positives = 48/123 (39%), Gaps = 25/123 (20%)
Query: 293 DNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINE 352
+ EE LEE + EEE ED+++ S E + E+NE
Sbjct: 176 NEEEERLEESDGREEE------EDEEVGSDSYGEGN-------------------RELNE 210
Query: 353 DDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKV 412
++ +E S++ D + E + + + + + E + E E E+ P P +
Sbjct: 211 EEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEE 270
Query: 413 EPA 415
PA
Sbjct: 271 SPA 273
>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 211
Score = 33.3 bits (76), Expect = 0.40
Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 3/75 (4%)
Query: 690 DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKK 749
DDMQ+D D S +E P P P + A P+ S P+ P P P +P +
Sbjct: 75 DDMQLD-GIDKLSRDEPRRTEPQEPDPLDESPASAAPVASAPA--PAPSPQSPKPASRRA 131
Query: 750 GYDPKQARPPVVKPS 764
D ++ P + S
Sbjct: 132 SRDMRRIAPFGMNAS 146
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 34.0 bits (78), Expect = 0.43
Identities = 38/239 (15%), Positives = 63/239 (26%), Gaps = 41/239 (17%)
Query: 606 KVKGLLPDEEKEKIGPKRSHAPNP--PLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQE 663
+ +L EE E +R AP P + +P P+ FP P
Sbjct: 149 PAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPP----------- 197
Query: 664 RMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPS---LKK 720
E Y Q + + + + P PP P +
Sbjct: 198 -------------EQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQ 244
Query: 721 VEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIP 780
P L Q PP P + + P Q +P P + P
Sbjct: 245 QPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPP----PQ 300
Query: 781 ASKVQEHMRIGLLDPRWVEQRDKHLNEKINQETVYAPEFLLDPRWVEQRDKHLNEKINQ 839
++ ++ R + R++ + Q + E E + KI
Sbjct: 301 QPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQE--------EAKRAKRRHKIVS 351
>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
Length = 313
Score = 33.6 bits (77), Expect = 0.44
Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 6/91 (6%)
Query: 352 EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDME---EDSSSEEEQEDIPPPPPPPS 408
+D + G EEE E EEA +A+ Q + +E E+ EE E+ P
Sbjct: 90 DDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVP 149
Query: 409 LKKVEPAPPLPSQPSIQPPPLP-PAPDKVVV 438
E P ++ + P P P P+ V+
Sbjct: 150 --MAEVQPEEETEIEVDEPEEPKPEPELDVI 178
Score = 32.5 bits (74), Expect = 0.85
Identities = 23/99 (23%), Positives = 37/99 (37%), Gaps = 6/99 (6%)
Query: 673 EEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPP--PPPPPSLKKVEPAPPLPSQ 730
E D E + +D ++EE+ E++E P P +KVEP P
Sbjct: 75 EPDFGRENSFHDPLIDDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRD 134
Query: 731 PSIQPPPLPPAPD----KVVVKKGYDPKQARPPVVKPSP 765
+ P P A +V ++ + + P KP P
Sbjct: 135 EEVLEEPEPVAAKVPMAEVQPEEETEIEVDEPEEPKPEP 173
Score = 31.7 bits (72), Expect = 1.6
Identities = 24/107 (22%), Positives = 39/107 (36%), Gaps = 6/107 (5%)
Query: 352 EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPP--PPPPPSL 409
EDD + +E D E + D ++EE+ E++E P P
Sbjct: 63 EDDFEIIRKERKEPDFGRENSFHDPLIDDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQP 122
Query: 410 KKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQE 456
+KVEP P + P P A + + + E+ +V E
Sbjct: 123 EKVEPQVEEPRDEEVLEEPEPVAAKVPMA----EVQPEEETEIEVDE 165
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough protein
is thought to play a role in the circulative aphid
transmission of potato leaf roll virus. Also in the
family is open reading frame 6 from beet western yellows
virus and potato leaf roll virus both luteovirus and an
unknown protein from cucurbit aphid-borne yellows virus
a closterovirus.
Length = 460
Score = 33.5 bits (77), Expect = 0.46
Identities = 21/127 (16%), Positives = 39/127 (30%), Gaps = 10/127 (7%)
Query: 316 DDDMQIGS-----DEENDYYNA----RMRGRNANRGRGRMQEEINEDDMQEGTTSEEESD 366
D DM+ G DE + + R R + E + + E ++ +
Sbjct: 219 DRDMEFGLISVVLDEHLEASRSSNHVRRPPRPGHLLVVSTPELVLPPEPDEQSSERQTFK 278
Query: 367 NENEEARKSK-EAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQ 425
+ S E + V + +++ S + + D PP + E P P
Sbjct: 279 TPPQPESSSDAENGLVSLVDEDDKEEVSRDSESDAPPDDTDLTRALAEYEAAAPEVPDAA 338
Query: 426 PPPLPPA 432
L
Sbjct: 339 RTVLQGK 345
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 33.5 bits (77), Expect = 0.49
Identities = 25/143 (17%), Positives = 40/143 (27%), Gaps = 11/143 (7%)
Query: 613 DEEKEKIGPKRSHAPNP-PLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEED 671
E+ E +G R H N P +P P + P + + Q E
Sbjct: 57 VEDDEGVGEVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPP----EAQ 112
Query: 672 LEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPP------PPPSLKKVEPAP 725
+ +P E + + P P P P+ + +PA
Sbjct: 113 VPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAE 172
Query: 726 PLPSQPSIQPPPLPPAPDKVVVK 748
P+ + P DK K
Sbjct: 173 PVAAPQPEPVAEPAPVMDKPKRK 195
Score = 33.1 bits (76), Expect = 0.61
Identities = 16/90 (17%), Positives = 29/90 (32%), Gaps = 5/90 (5%)
Query: 684 QRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
R+P + + + Q P P + E P P + P P P
Sbjct: 101 PRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSA 160
Query: 744 KVVVKKGYDPKQARP-----PVVKPSPTDE 768
++ + P + PV +P+P +
Sbjct: 161 PQPAQQAFQPAEPVAAPQPEPVAEPAPVMD 190
Score = 29.6 bits (67), Expect = 6.9
Identities = 20/159 (12%), Positives = 41/159 (25%), Gaps = 28/159 (17%)
Query: 296 EEDLEEDEEMEEEYGQRR---------PEDDDMQIGSDEENDYYNARMRGRNANRGRGRM 346
++ +ED E +E G+ R + ++ Y + +
Sbjct: 50 DDSYDEDVEDDEGVGEVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPP 109
Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP-- 404
+ ++ + Q E + + P P
Sbjct: 110 EAQVPPQHAPRPAQPAPQ-------------PVQQPAYQPQPEQPLQQPVSPQVAPAPQP 156
Query: 405 ----PPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVK 439
P P+ + +PA P+ + P DK K
Sbjct: 157 VHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKRK 195
Score = 29.6 bits (67), Expect = 8.7
Identities = 28/124 (22%), Positives = 46/124 (37%), Gaps = 10/124 (8%)
Query: 669 EEDLEEDEEMEEEYGQRR-----PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEP 723
++ +ED E +E G+ R + Q E +++ Q PP ++
Sbjct: 50 DDSYDEDVEDDEGVGEVRVHRVNHAPANAQ---EHEAARPSPQHQYQPPYASAQPRQPVQ 106
Query: 724 APPL-PSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPAS 782
PP P P P PAP + V + Y P+ +P SP P+ PA
Sbjct: 107 QPPEAQVPPQHAPRPAQPAP-QPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQ 165
Query: 783 KVQE 786
+ +
Sbjct: 166 QAFQ 169
Score = 29.2 bits (66), Expect = 8.8
Identities = 12/76 (15%), Positives = 24/76 (31%), Gaps = 2/76 (2%)
Query: 383 QVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPL-PSQPSIQPPPLPPAPDKVVVKKG 441
+ + + P P P+ + P P P + P P P ++
Sbjct: 108 PPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQA 167
Query: 442 YDPKQEKIPASKVQEH 457
+ P E + A + +
Sbjct: 168 FQP-AEPVAAPQPEPV 182
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 33.8 bits (77), Expect = 0.52
Identities = 19/109 (17%), Positives = 37/109 (33%), Gaps = 14/109 (12%)
Query: 298 DLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQE 357
++ E+ EE G +D+ E+ D + + ++ E + QE
Sbjct: 192 EMLRSMELAEEMGDDTESEDE------EDGD--------DDQPTENEQEEQGEGEGEGQE 237
Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPP 406
G+ +E + E +E + Q +E E + P P
Sbjct: 238 GSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPARP 286
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 32.2 bits (73), Expect = 0.54
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
PPPPP P V P PPLP P + PPP P
Sbjct: 97 PPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPF 128
Score = 32.2 bits (73), Expect = 0.54
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
PPPPP P V P PPLP P + PPP P
Sbjct: 97 PPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPF 128
Score = 32.2 bits (73), Expect = 0.59
Identities = 20/39 (51%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 438
IPPPPP P + P PPLP I PPP PP P V
Sbjct: 72 IPPPPPLPGAAGIPPPPPLPGGAGIPPPP-PPLPGGAAV 109
Score = 32.2 bits (73), Expect = 0.59
Identities = 20/39 (51%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 747
IPPPPP P + P PPLP I PPP PP P V
Sbjct: 72 IPPPPPLPGAAGIPPPPPLPGGAGIPPPP-PPLPGGAAV 109
Score = 30.3 bits (68), Expect = 2.3
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
IPPPPP P + + P PPLP +I PPP P
Sbjct: 48 IPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79
Score = 30.3 bits (68), Expect = 2.3
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
IPPPPP P + + P PPLP +I PPP P
Sbjct: 48 IPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79
Score = 30.3 bits (68), Expect = 2.9
Identities = 17/32 (53%), Positives = 19/32 (59%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
IPPPPP P + P PPLP +I PPP P
Sbjct: 24 IPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLP 55
Score = 30.3 bits (68), Expect = 2.9
Identities = 17/32 (53%), Positives = 19/32 (59%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
IPPPPP P + P PPLP +I PPP P
Sbjct: 24 IPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLP 55
Score = 29.9 bits (67), Expect = 3.1
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
IPPPPP P + P PPLP I PPP P
Sbjct: 60 IPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91
Score = 29.9 bits (67), Expect = 3.1
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
IPPPPP P + P PPLP I PPP P
Sbjct: 60 IPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91
Score = 29.5 bits (66), Expect = 4.2
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
IPPPPP P + P PPLP I PPP P
Sbjct: 36 IPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67
Score = 29.5 bits (66), Expect = 4.2
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
IPPPPP P + P PPLP I PPP P
Sbjct: 36 IPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67
Score = 29.5 bits (66), Expect = 4.2
Identities = 14/32 (43%), Positives = 14/32 (43%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
PPPP P P PP P P I PPP
Sbjct: 110 PPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGM 141
Score = 29.1 bits (65), Expect = 6.2
Identities = 16/31 (51%), Positives = 17/31 (54%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
PPPPP P + P PPLP I PPP P
Sbjct: 1 PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLP 31
Score = 29.1 bits (65), Expect = 6.2
Identities = 16/31 (51%), Positives = 17/31 (54%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
PPPPP P + P PPLP I PPP P
Sbjct: 1 PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLP 31
Score = 28.4 bits (63), Expect = 9.8
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 400 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
IPPPPP P + P PPLP I PPP P
Sbjct: 12 IPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43
Score = 28.4 bits (63), Expect = 9.8
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
IPPPPP P + P PPLP I PPP P
Sbjct: 12 IPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 33.2 bits (76), Expect = 0.57
Identities = 11/72 (15%), Positives = 21/72 (29%), Gaps = 6/72 (8%)
Query: 388 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQE 447
E + ++ AP + P L PA ++ + G D
Sbjct: 76 EGNDATAAPPAKSGEEKEETPAASAAAAPTAAA----NRPSLSPAARRLAKEHGIDL--S 129
Query: 448 KIPASKVQEHMR 459
+P + V +
Sbjct: 130 AVPGTGVTGRVT 141
Score = 32.8 bits (75), Expect = 0.88
Identities = 13/91 (14%), Positives = 25/91 (27%), Gaps = 8/91 (8%)
Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
E + ++ AP + P L PA ++ + G D
Sbjct: 76 EGNDATAAPPAKSGEEKEETPAASAAAAPTAAA----NRPSLSPAARRLAKEHGIDLSAV 131
Query: 757 RPPVVKPSPTDE----FLISPITGEKIPASK 783
V T E +P + ++ +
Sbjct: 132 PGTGVTGRVTKEDIIKKTEAPASAQQPAPAA 162
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 33.4 bits (77), Expect = 0.58
Identities = 21/83 (25%), Positives = 33/83 (39%), Gaps = 6/83 (7%)
Query: 280 DEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMR--GR 337
DE ++ ++ ++D E DEE EE EDDD D+++D R
Sbjct: 112 DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDD---DDDDDDDIATRERSLERR 168
Query: 338 NANRGRGRMQEEINEDDMQEGTT 360
R + E+ E +GT
Sbjct: 169 RRRREWEEKRAEL-EFYYYQGTY 190
Score = 31.1 bits (71), Expect = 3.4
Identities = 16/55 (29%), Positives = 26/55 (47%)
Query: 653 DEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQE 707
DE ++ ++ ++D E DEE EE EDDD D ++ ++ E E
Sbjct: 112 DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166
Score = 29.6 bits (67), Expect = 9.2
Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 665 MDNGEEDLEEDEEMEEEYGQRR---PEDDDMQIDMEEDSSSEEEQED 708
D+G+ + E +E + R +DDD + D E++ SS+ E ++
Sbjct: 101 FDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDE 147
>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
Provisional.
Length = 464
Score = 33.4 bits (77), Expect = 0.58
Identities = 12/57 (21%), Positives = 17/57 (29%), Gaps = 3/57 (5%)
Query: 395 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPA 451
E D P + + + + P P AP + DP IPA
Sbjct: 82 ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADP---DIPA 135
Score = 33.0 bits (76), Expect = 0.68
Identities = 9/50 (18%), Positives = 14/50 (28%)
Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDP 753
E D P + + + + P P AP + DP
Sbjct: 82 ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADP 131
Score = 30.7 bits (70), Expect = 3.6
Identities = 8/44 (18%), Positives = 15/44 (34%)
Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
++ E P + + A P P+ P+ + PD
Sbjct: 89 AAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPD 132
Score = 30.7 bits (70), Expect = 3.6
Identities = 8/44 (18%), Positives = 15/44 (34%)
Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
++ E P + + A P P+ P+ + PD
Sbjct: 89 AAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPD 132
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 130
Score = 31.7 bits (72), Expect = 0.59
Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 3/40 (7%)
Query: 396 EQEDI--PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
E E+I P P+ P P P++ S Q P P
Sbjct: 17 EMEEIGAPAQAAAPAQPASTPVPV-PTEASPQVEAQAPQP 55
Score = 31.7 bits (72), Expect = 0.59
Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 3/40 (7%)
Query: 705 EQEDI--PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
E E+I P P+ P P P++ S Q P P
Sbjct: 17 EMEEIGAPAQAAAPAQPASTPVPV-PTEASPQVEAQAPQP 55
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 33.1 bits (76), Expect = 0.60
Identities = 25/108 (23%), Positives = 40/108 (37%), Gaps = 18/108 (16%)
Query: 657 ARVLMQERMDNGEE-DLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPP 715
R + M+ +E D E EE+ ++YG D+ Q M +E ++ +P P
Sbjct: 152 DRTRESQDMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSR---PQEVKQSVPAQQAP 208
Query: 716 PSLKKVEPAPPLPSQPSIQ--------------PPPLPPAPDKVVVKK 749
P+ ++ P+ S S PP PAP K
Sbjct: 209 PNPQQPMPSASSESATSKSASTSRESSPQPQSPPPRRVPAPTVDSTAK 256
>gnl|CDD|214563 smart00213, UBQ, Ubiquitin homologues. Ubiquitin-mediated
proteolysis is involved in the regulated turnover of
proteins required for controlling cell cycle
progression.
Length = 72
Score = 30.3 bits (69), Expect = 0.61
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 12/49 (24%)
Query: 831 KHLNEKINQET------------GLFFKDSNSLAYYNITPGTKVHLQLK 867
L EKI + T G +D +LA Y I G+ +HL L+
Sbjct: 24 SELKEKIAELTGIPPEQQRLIYKGKVLEDDRTLADYGIQDGSTIHLVLR 72
>gnl|CDD|234706 PRK00269, zipA, cell division protein ZipA; Reviewed.
Length = 293
Score = 32.8 bits (75), Expect = 0.61
Identities = 12/95 (12%), Positives = 31/95 (32%), Gaps = 6/95 (6%)
Query: 346 MQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPP 405
+ +++E D+ + E K+ + + + + ++ + + E P
Sbjct: 67 KEPQLDEHDLPSMSARPRER----RRDTKTAKQQKRGRGSEPQQGDLNLDLDEVEPALFS 122
Query: 406 PPSLKKVEPAPPLPSQPSIQPPP--LPPAPDKVVV 438
+ PP LPP + +V+
Sbjct: 123 DRDDDFTPDKRKSKGREPRIEPPKELPPVEEVLVI 157
>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
Length = 472
Score = 32.8 bits (75), Expect = 0.77
Identities = 22/94 (23%), Positives = 37/94 (39%), Gaps = 7/94 (7%)
Query: 367 NENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPP--PPSLKKVEPAPPLPSQPSI 424
+N A+ KE NT + ++ + ++ I P P P + + PLP
Sbjct: 55 KQNAPAQGPKEKAGNTSM--FAKEIAGDQTGGLIEPASPLKVPEMPTGPASAPLPI-ARP 111
Query: 425 QPPPLPPAPDKVVVKKGY--DPKQEKIPASKVQE 456
P PP P G D + ++I +K+Q
Sbjct: 112 DNPDAPPTPPANPGNPGQVNDDEAQRIRMAKLQM 145
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 33.0 bits (75), Expect = 0.81
Identities = 26/124 (20%), Positives = 38/124 (30%), Gaps = 2/124 (1%)
Query: 621 PKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARV--LMQERMDNGEEDLEEDEEM 678
S + +P S + E +E EE+ EDEE
Sbjct: 413 GTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472
Query: 679 EEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPL 738
EEE +++M+ E D EE +ED + P S+QP
Sbjct: 473 EEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESP 532
Query: 739 PPAP 742
P
Sbjct: 533 QEEP 536
Score = 33.0 bits (75), Expect = 0.91
Identities = 37/200 (18%), Positives = 59/200 (29%), Gaps = 12/200 (6%)
Query: 229 RYQEAQRKQEEA----RVEKDRVAYAQIDWHDFVVVETVDYMPGEPGNFPPPTTPDEVGA 284
+Y Q EE R E++R + T P E
Sbjct: 390 KYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEE 449
Query: 285 RVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIG------SDEENDYYNARMRGRN 338
+E + E + EE E+ EEE + ++ D E +A R
Sbjct: 450 EEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509
Query: 339 ANRGRGRMQEEIN-EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQ 397
+ + +Q + EE E+ +A E D + + E SS E
Sbjct: 510 MAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAE-ESPLSSHTEL 568
Query: 398 EDIPPPPPPPSLKKVEPAPP 417
E + P + PP
Sbjct: 569 EGVATPVETKISSSRKLPPP 588
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 32.2 bits (73), Expect = 0.81
Identities = 31/113 (27%), Positives = 45/113 (39%), Gaps = 7/113 (6%)
Query: 627 PNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRR 686
P P P S PP P PG+ PP T D + +Q +++ ++ E + G +
Sbjct: 54 PTTPHPSSQPPPCPPSPGH--PPQTNDTHEKDLALQPPPGGKKKEKKKKETEKPAQGGEK 111
Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLP 739
P D + E + E+ PPPP +VE P P P PL
Sbjct: 112 P-DQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEVEGGP----SPGPGPGPLD 159
>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
Provisional.
Length = 445
Score = 32.9 bits (75), Expect = 0.92
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 5/33 (15%)
Query: 399 DIPPPPPPPSL--KKVEP---APPLPSQPSIQP 426
D+PPPPPPP + K+VE P S+P+ P
Sbjct: 128 DVPPPPPPPPVVAKRVETPAVVAPRVSEPARNP 160
Score = 32.9 bits (75), Expect = 0.92
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 5/33 (15%)
Query: 708 DIPPPPPPPSL--KKVEP---APPLPSQPSIQP 735
D+PPPPPPP + K+VE P S+P+ P
Sbjct: 128 DVPPPPPPPPVVAKRVETPAVVAPRVSEPARNP 160
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 32.8 bits (75), Expect = 0.92
Identities = 31/205 (15%), Positives = 48/205 (23%), Gaps = 29/205 (14%)
Query: 586 AATRAARANITLEDQIHQIHKVKGLLPDEEKEKIGPKRSHAPNPPLPRSMPPMNKPKPGN 645
A RA+ +IT D + + E + P P + +
Sbjct: 744 ARARASAWDIT--DALFSNPSLVPAKLAEALALLEPAEPQRGAGSSP-PVRAEAAFRRPG 800
Query: 646 FPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEY-----GQRRPED----------- 689
+ P A +R E + P
Sbjct: 801 RLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAA 860
Query: 690 --DDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVV 747
+ E P PP A P P+ +P P P +
Sbjct: 861 AGGRARGKNGRRRPRPPEPRARPGAAAPPKAA---AAAPPAGAPAPRPRPAPRVKLGPMP 917
Query: 748 KKGYDPKQA--RPPV---VKPSPTD 767
G DP+ R P P+P+
Sbjct: 918 PGGPDPRGGFRRVPPGDLHTPAPSA 942
Score = 32.5 bits (74), Expect = 1.3
Identities = 21/145 (14%), Positives = 27/145 (18%), Gaps = 29/145 (20%)
Query: 626 APNPPLPRSMPPMNKPKPGN------FPPPTTPDEVGARVLMQERMDNGEEDLEEDEEME 679
+ + P S P P PP P DL E
Sbjct: 91 SLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP---------APDLSEMLRPV 141
Query: 680 EEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLP 739
G P ++ + A PL S P
Sbjct: 142 GSPG---PPPAASPPAAGASPAAVAS-----DAASSR-----QAALPLSSPEETARAPSS 188
Query: 740 PAPDKVVVKKGYDPKQARPPVVKPS 764
P + RPP
Sbjct: 189 PPAE-PPPSTPPAAASPRPPRRSSP 212
Score = 31.3 bits (71), Expect = 3.2
Identities = 16/82 (19%), Positives = 17/82 (20%), Gaps = 4/82 (4%)
Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
+ S P PPPP PA P PS L P
Sbjct: 91 SLSTLAPASPAREGSPTPPGPSSPDPPPPTP----PPASPPPSPAPDLSEMLRPVGSPGP 146
Query: 747 VKKGYDPKQARPPVVKPSPTDE 768
P P S
Sbjct: 147 PPAASPPAAGASPAAVASDAAS 168
Score = 30.9 bits (70), Expect = 3.9
Identities = 17/59 (28%), Positives = 22/59 (37%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASKVQEHMR 459
PPP P P L ++ P P PP A V +Q +P S +E R
Sbjct: 126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETAR 184
Score = 29.8 bits (67), Expect = 8.0
Identities = 16/81 (19%), Positives = 20/81 (24%)
Query: 353 DDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKV 412
D E T A +S+ + + E P P P
Sbjct: 62 CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTP 121
Query: 413 EPAPPLPSQPSIQPPPLPPAP 433
PA P PS L P
Sbjct: 122 PPASPPPSPAPDLSEMLRPVG 142
Score = 29.8 bits (67), Expect = 9.6
Identities = 23/127 (18%), Positives = 31/127 (24%), Gaps = 9/127 (7%)
Query: 620 GPKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGA---RVLMQERMDNGEEDLEEDE 676
P R +P PP P S P P PPP+ ++ V
Sbjct: 99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158
Query: 677 EMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP 736
+ + E++ P PP PA P P P
Sbjct: 159 PAAVASDAASSRQAALPLSSPEET------ARAPSSPPAEPPPSTPPAAASPRPPRRSSP 212
Query: 737 PLPPAPD 743
A
Sbjct: 213 ISASASS 219
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 32.5 bits (74), Expect = 0.96
Identities = 20/90 (22%), Positives = 31/90 (34%), Gaps = 7/90 (7%)
Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
RM+ + ++D E ++ + EA TQ + S + Q +
Sbjct: 42 RMKSKRDDDPYDE---VADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQ 98
Query: 405 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
+ P PP S P P P PA D
Sbjct: 99 IKIPV----PQPPQISDPPAHPQPTQPALD 124
Score = 32.1 bits (73), Expect = 1.1
Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 11/126 (8%)
Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKV 721
+R D+ +++ +DE + E + +++ Q E +++ + Q P +K
Sbjct: 45 SKRDDDPYDEVADDEGVGEV--RVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQIKIP 102
Query: 722 EPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPA 781
P PP S P P P PA D+ + P + P+ P
Sbjct: 103 VPQPPQISDPPAHPQPTQPALDQ--------EQPPEEARQPVLPQEAPAPQPVHSAA-PQ 153
Query: 782 SKVQEH 787
VQ
Sbjct: 154 PAVQTV 159
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 32.9 bits (75), Expect = 1.0
Identities = 20/119 (16%), Positives = 27/119 (22%), Gaps = 3/119 (2%)
Query: 627 PNPPLPRS--MPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQ 684
P + P + P + R E D +
Sbjct: 159 PRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDW 218
Query: 685 RRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
RP D E + P PP V P P + QP P P +
Sbjct: 219 DRPRRDRTDRP-EPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGE 276
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 32.5 bits (74), Expect = 1.0
Identities = 21/119 (17%), Positives = 45/119 (37%), Gaps = 3/119 (2%)
Query: 285 RVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRG 344
R ++E + +E+ + E P+ D+ + DEE ++ R
Sbjct: 158 RQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERR 217
Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPP 403
E + ++ + EE + RK +E ++N + +ED E+ ++ P
Sbjct: 218 AAVERRLAELVEMINWNLEERR---RDLRKEQELEENVERDSDDEDEHGEDSEDGETKP 273
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 32.6 bits (74), Expect = 1.1
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
P P P + PAPP+P P IQ PP P
Sbjct: 228 PSPAQPSAPPASIPAPPIP--PVIQYVAPPPVP 258
Score = 32.6 bits (74), Expect = 1.1
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
P P P + PAPP+P P IQ PP P
Sbjct: 228 PSPAQPSAPPASIPAPPIP--PVIQYVAPPPVP 258
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 32.6 bits (74), Expect = 1.1
Identities = 26/160 (16%), Positives = 47/160 (29%), Gaps = 15/160 (9%)
Query: 306 EEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEES 365
E G P Q G+ D N + + + QE T +EE
Sbjct: 493 RERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDESTNPIGKTS----LRYQELTPVQEED 548
Query: 366 DNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQ 425
+ E++ + ++ ++ S E+ D+ PP + + E P+
Sbjct: 549 EPEDQ---TDDDDSSLPPLESDDDPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQN 605
Query: 426 PPPLPPAPDKVVVKKGYDPKQEKIPASKV--QEHMRIGLL 463
P D D + + E + LL
Sbjct: 606 P------QDPTGSIGNVDSDILRSNSKPSAPLEETYMHLL 639
Score = 30.7 bits (69), Expect = 4.4
Identities = 31/153 (20%), Positives = 46/153 (30%), Gaps = 17/153 (11%)
Query: 621 PKRSHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARV-----LMQERMDNGEEDLEED 675
+R P PP P + N DE + QE EED ED
Sbjct: 493 RERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQEEDEPED 552
Query: 676 EEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQP 735
+ +++ E DD + S E+ D+ PP + + E P+ P
Sbjct: 553 QTDDDDSSLPPLESDD------DPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNP 606
Query: 736 PPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
D D R +P +E
Sbjct: 607 ------QDPTGSIGNVDSDILRSNSKPSAPLEE 633
>gnl|CDD|220462 pfam09899, DUF2126, Putative amidoligase enzyme (DUF2126). Members
of this family of bacterial domains are predominantly
found in transglutaminase and transglutaminase-like
proteins. Their exact function is, as yet, unknown, but
they are likely to act as amidoligase enzymes Protein in
this family are found in conserved gene neighborhoods
encoding a glutamine amidotransferase-like thiol
peptidase (in proteobacteria) or an Aig2 family
cyclotransferase protein (in firmicutes).
Length = 817
Score = 32.6 bits (75), Expect = 1.3
Identities = 19/88 (21%), Positives = 25/88 (28%), Gaps = 21/88 (23%)
Query: 726 PLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQ 785
PL S P + P P + DP RPP+ PA
Sbjct: 260 PLDSLPWVPPADYPYIHPR-------DPFAPRPPLPPAQREQA-------AASAPAGAAG 305
Query: 786 EHMRIGLLDPRW-------VEQRDKHLN 806
+ +R + VE RD L
Sbjct: 306 QDVREQVPHAEGVVRTALCVEPRDGRLY 333
Score = 29.9 bits (68), Expect = 7.8
Identities = 15/64 (23%), Positives = 18/64 (28%), Gaps = 5/64 (7%)
Query: 699 DSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARP 758
DS D P P P PPLP Q PA + +
Sbjct: 262 DSLPWVPPADYPYIHPRDPF---APRPPLPPAQREQAAASAPA--GAAGQDVREQVPHAE 316
Query: 759 PVVK 762
VV+
Sbjct: 317 GVVR 320
>gnl|CDD|185411 PTZ00044, PTZ00044, ubiquitin; Provisional.
Length = 76
Score = 29.4 bits (66), Expect = 1.4
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 847 DSNSLAYYNITPGTKVHLQLKERGG 871
D L+ Y + PG+ +H+ L+ RGG
Sbjct: 52 DDLKLSDYKVVPGSTIHMVLQLRGG 76
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 32.4 bits (73), Expect = 1.4
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 335 RGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKE 377
R + G R + E ++++ E ++SE S+NENE K +E
Sbjct: 549 REDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEE 591
>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
Reviewed.
Length = 275
Score = 31.8 bits (73), Expect = 1.4
Identities = 18/54 (33%), Positives = 19/54 (35%), Gaps = 16/54 (29%)
Query: 390 DSSSEEEQEDIPPPPPPP-----------SLKKVEPAPPLPSQPSIQPPPLPPA 432
S E+Q PPPP S EP P PPPLPPA
Sbjct: 219 GSPECEDQ-----APPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPPA 267
Score = 31.8 bits (73), Expect = 1.4
Identities = 18/54 (33%), Positives = 19/54 (35%), Gaps = 16/54 (29%)
Query: 699 DSSSEEEQEDIPPPPPPP-----------SLKKVEPAPPLPSQPSIQPPPLPPA 741
S E+Q PPPP S EP P PPPLPPA
Sbjct: 219 GSPECEDQ-----APPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPPA 267
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 32.3 bits (73), Expect = 1.5
Identities = 17/74 (22%), Positives = 24/74 (32%)
Query: 370 EEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPL 429
E A + A + + ++ PP P P K PA P P +
Sbjct: 209 EPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYV 268
Query: 430 PPAPDKVVVKKGYD 443
P K+ KG D
Sbjct: 269 TPLVRKLAKDKGVD 282
Score = 29.6 bits (66), Expect = 9.4
Identities = 14/43 (32%), Positives = 16/43 (37%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYD 752
PP P P K PA P P + P K+ KG D
Sbjct: 240 PPAPAPAPAKTAAPAAAAPVSSGDSGPYVTPLVRKLAKDKGVD 282
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 32.1 bits (73), Expect = 1.5
Identities = 23/118 (19%), Positives = 36/118 (30%), Gaps = 1/118 (0%)
Query: 649 PTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
+ EV LM+ + + R E + +E SS + E+
Sbjct: 195 TNSFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSS 254
Query: 709 IPPPPPPPSLKKVEPAPPLPSQPSI-QPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP 765
P P K E APP+ + + P P + + P P SP
Sbjct: 255 APKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEASARKRSPSLLSPSPKAESP 312
Score = 29.4 bits (66), Expect = 8.4
Identities = 11/72 (15%), Positives = 24/72 (33%), Gaps = 3/72 (4%)
Query: 368 ENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP 427
+ ++ K+K K + M+ + SS + + P P +K P +
Sbjct: 229 SSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSP---EKAPPIDTTEEELKSPEA 285
Query: 428 PLPPAPDKVVVK 439
+ + K
Sbjct: 286 SPKESEEASARK 297
>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein. Members of
this family confer resistance to the metalloid element
tellurium and its salts.
Length = 98
Score = 30.1 bits (68), Expect = 1.5
Identities = 9/24 (37%), Positives = 10/24 (41%)
Query: 413 EPAPPLPSQPSIQPPPLPPAPDKV 436
AP P P+ P PPA V
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAPPV 24
Score = 30.1 bits (68), Expect = 1.5
Identities = 9/24 (37%), Positives = 10/24 (41%)
Query: 722 EPAPPLPSQPSIQPPPLPPAPDKV 745
AP P P+ P PPA V
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAPPV 24
>gnl|CDD|227676 COG5386, COG5386, Cell surface protein [Cell envelope biogenesis,
outer membrane].
Length = 352
Score = 31.8 bits (72), Expect = 1.6
Identities = 12/58 (20%), Positives = 19/58 (32%), Gaps = 6/58 (10%)
Query: 689 DDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
D + I +E D++ ++ P A P + P P P KV
Sbjct: 168 DHEYTIRLEFDTAIDQAANAATPNDDKN------VAQPGTTTPKTPTPQTTPNKPKVE 219
Score = 29.5 bits (66), Expect = 8.4
Identities = 9/48 (18%), Positives = 14/48 (29%), Gaps = 6/48 (12%)
Query: 390 DSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 437
D++ ++ P A P + P P P KV
Sbjct: 178 DTAIDQAANAATPNDDKN------VAQPGTTTPKTPTPQTTPNKPKVE 219
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 32.3 bits (73), Expect = 1.6
Identities = 29/111 (26%), Positives = 40/111 (36%), Gaps = 21/111 (18%)
Query: 638 MNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDME 697
M+K +PG P +EV + +QE+ G D + + +
Sbjct: 1 MDKDRPGLPAPDDNIEEVPSTSGVQEQASGG--------------------DWENVLIED 40
Query: 698 EDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVK 748
DSSSEEE ED PP K+ QPP PP D +
Sbjct: 41 SDSSSEEEAEDAHLEPPQKGKKRKRVDDDAGGSAPAQPPC-PPQLDLPGRE 90
Score = 32.0 bits (72), Expect = 2.0
Identities = 23/78 (29%), Positives = 30/78 (38%), Gaps = 3/78 (3%)
Query: 362 EEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQ 421
EE + + S +N ++D DSSSEEE ED PP K+
Sbjct: 16 EEVPSTSGVQEQASGGDWENVLIED--SDSSSEEEAEDAHLEPPQKGKKRKRVDDDAGGS 73
Query: 422 PSIQPPPLPPAPDKVVVK 439
QPP PP D +
Sbjct: 74 APAQPPC-PPQLDLPGRE 90
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 31.6 bits (72), Expect = 1.6
Identities = 26/124 (20%), Positives = 41/124 (33%), Gaps = 10/124 (8%)
Query: 669 EEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEP----- 723
EE+ EEDE + + ++D + +EDS +EE D P P S +P
Sbjct: 185 EEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKPTSILK 244
Query: 724 -----APPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGEK 778
PS + + + K+A P SP+ S
Sbjct: 245 KSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQSPSSAEPTSESPQTA 304
Query: 779 IPAS 782
+S
Sbjct: 305 GNSS 308
Score = 30.9 bits (70), Expect = 2.9
Identities = 30/135 (22%), Positives = 54/135 (40%), Gaps = 8/135 (5%)
Query: 281 EVGA-RVLMQERMDNGEEDLEEDEEMEEEYGQR------RPEDDDMQIGSDEENDYYNAR 333
E GA R +++D +E+ E E EE G R D ++ E +
Sbjct: 101 ESGATRNYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKE- 159
Query: 334 MRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSS 393
++ R A+ M E + + +E EEE + + E +++ + D E+
Sbjct: 160 LQSRRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDD 219
Query: 394 EEEQEDIPPPPPPPS 408
EE+ ++ P P S
Sbjct: 220 EEDNDNTPSPKSGSS 234
>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
Length = 248
Score = 31.5 bits (71), Expect = 1.7
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 395 EEQEDIPPPPPPPSLKKVEPAPPLP 419
E PPPPPPP + EPAPP+
Sbjct: 113 ERAAVSPPPPPPPPPARAEPAPPVA 137
Score = 31.5 bits (71), Expect = 1.7
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLP 728
E PPPPPPP + EPAPP+
Sbjct: 113 ERAAVSPPPPPPPPPARAEPAPPVA 137
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 31.7 bits (72), Expect = 1.7
Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 12/70 (17%)
Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPP--------LPPAPDKVVVKKG 750
+ + E++ I PP ++++ P PPLP P + PP LP + +
Sbjct: 44 IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAKDQ- 102
Query: 751 YDPKQARPPV 760
P++ R
Sbjct: 103 --PRRLRAAE 110
Score = 30.9 bits (70), Expect = 2.8
Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 9/57 (15%)
Query: 391 SSSEEEQEDIPPPPPPPSLKKVEPA-PPLPSQPSIQPPP--------LPPAPDKVVV 438
+ + E++ I PP ++++ P PPLP P + PP LP +
Sbjct: 44 IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAK 100
>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
This family of eukaryotic proteins includes
Saccharomyces cerevisiae Ydl063c and Chaetomium
thermophilum Syo1, which mediate the co-import of two
ribosomal proteins, Rpl5 and Rpl11 (which both interact
with 5S rRNA) into the nucleus. Import precedes their
association with rRNA and subsequent ribosome assembly
in the nucleolus. The primary structure of syo1 is a
mixture of Armadillo- (ARM, N-terminal part of syo1) and
HEAT-repeats (C-terminal part of syo1).
Length = 597
Score = 31.9 bits (73), Expect = 1.8
Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 11/75 (14%)
Query: 669 EEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPP-----PPPSLKKVEP 723
E D ++DE ME+E + D+ + + E+D +SE ++ P L+ + P
Sbjct: 304 ELDKKDDEAMEDE------QSDEDEEEDEDDEASETLVPELSPLEISLDLLTALLELIAP 357
Query: 724 APPLPSQPSIQPPPL 738
P
Sbjct: 358 TPESVEPILPSSTIE 372
>gnl|CDD|237928 PRK15220, PRK15220, fimbrial chaperone protein; Provisional.
Length = 178
Score = 30.9 bits (70), Expect = 1.9
Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 4/42 (9%)
Query: 33 IVDKTANFVARNGPEFESRIRQNEIGNPKFNFLSPGDPYHAY 74
+V T F+ N I N++G + L+PG Y Y
Sbjct: 36 VVGTTCQFLGTNSAT----IELNQVGVDRLANLNPGQIYTGY 73
>gnl|CDD|223067 PHA03380, PHA03380, transactivating tegument protein VP16;
Provisional.
Length = 432
Score = 31.6 bits (72), Expect = 1.9
Identities = 12/49 (24%), Positives = 17/49 (34%), Gaps = 1/49 (2%)
Query: 694 IDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
+ M D + E + PP + A PS + PP PA
Sbjct: 1 MLMLVDDFIADFDESLLGPPESETAPAP-YATSAPSSAQLALPPPRPAS 48
Score = 31.6 bits (72), Expect = 2.3
Identities = 12/48 (25%), Positives = 16/48 (33%), Gaps = 1/48 (2%)
Query: 386 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
M D + E + PP + A PS + PP PA
Sbjct: 2 LMLVDDFIADFDESLLGPPESETAPAP-YATSAPSSAQLALPPPRPAS 48
Score = 30.8 bits (70), Expect = 3.3
Identities = 13/51 (25%), Positives = 18/51 (35%), Gaps = 2/51 (3%)
Query: 685 RRPEDDDMQIDMEED--SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSI 733
DD D +E E E P PS ++ PP P+ P+
Sbjct: 1 MLMLVDDFIADFDESLLGPPESETAPAPYATSAPSSAQLALPPPRPASPAA 51
>gnl|CDD|188533 TIGR04018, Bthiol_YpdA, putative bacillithiol system
oxidoreductase, YpdA family. Members of this protein
family, including YpdA from Bacillus subtilis, are
apparent oxidoreductases present only in species with an
active bacillithiol system. They have been suggested
actually to be thiol disulfide oxidoreductases (TDOR),
although the evidence is incomplete [Unknown function,
Enzymes of unknown specificity].
Length = 316
Score = 31.4 bits (72), Expect = 2.0
Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 14/42 (33%)
Query: 46 PEFESRIRQNEIGNPKFNFLSPGDPYHAYYQHRVKDIREGKV 87
P+ E+RI++ I AY+ RVK+I E V
Sbjct: 197 PDIENRIKEGSI--------------KAYFNSRVKEITEDSV 224
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 31.2 bits (71), Expect = 2.0
Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 2/44 (4%)
Query: 356 QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
+ T EE E +EA S++ +D +ED SEEE E+
Sbjct: 56 DQETAEIEEVKEEEKEAANSEDKEDKGD--AEKEDEESEEENEE 97
Score = 29.3 bits (66), Expect = 6.4
Identities = 9/46 (19%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 358 GTTSEEESDNENEEARKS-KEAKDNTQVQDMEEDSSSEEEQEDIPP 402
+ S++ + +E E + +E + +V++ E+++++ E++ED
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGD 83
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.8 bits (73), Expect = 2.0
Identities = 13/78 (16%), Positives = 20/78 (25%), Gaps = 8/78 (10%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV-VKKGYDPKQARPPVVKPSPTDE 768
P P P PP + PLP +++ ++ Q K P
Sbjct: 387 PTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEP--- 443
Query: 769 FLISPITGEKIPASKVQE 786
P + E
Sbjct: 444 ----AAASRARPVNSALE 457
Score = 31.0 bits (71), Expect = 3.0
Identities = 20/152 (13%), Positives = 35/152 (23%), Gaps = 10/152 (6%)
Query: 626 APNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQR 685
P + P P V + E + R
Sbjct: 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPP---PASAPQQAPAVPLPETTSQLLAAR 426
Query: 686 RPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 745
Q+ + ++ ++ E P +E + PS P +
Sbjct: 427 ------QQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKA-PAKKEAY 479
Query: 746 VVKKGYDPKQARPPVVKPSPTDEFLISPITGE 777
K + + PV P + L T E
Sbjct: 480 RWKATNPVEVKKEPVATPKALKKALEHEKTPE 511
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 30.0 bits (68), Expect = 2.3
Identities = 28/102 (27%), Positives = 51/102 (50%), Gaps = 9/102 (8%)
Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMR-GRNANRGRGRMQ 347
ER++ +E +++ E EE+ Q+R E + DEE R + + + +
Sbjct: 49 YERLELMDEKWKKETED-EEFQQKREE----KKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEE 389
++ N+ + +EG+ S EES +E EE +E K V+ ME+
Sbjct: 104 KKGNKKEEKEGSKSSEESSDEEEE---GEEDKQEEPVEIMEK 142
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 31.6 bits (72), Expect = 2.4
Identities = 16/56 (28%), Positives = 20/56 (35%), Gaps = 3/56 (5%)
Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
P PP + PA P P+ PP P AP + P A P +P
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPA---AARPAPAPAPPAAAAPP 434
Score = 30.5 bits (69), Expect = 4.8
Identities = 12/51 (23%), Positives = 21/51 (41%)
Query: 383 QVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
+++ +E + + P P P+ APP+P +P PAP
Sbjct: 373 RLEALERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAP 423
Score = 30.1 bits (68), Expect = 6.5
Identities = 11/35 (31%), Positives = 13/35 (37%), Gaps = 1/35 (2%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 435
PP P PAP P+ P+ PP A
Sbjct: 408 VPPAAPARPAAARPAPA-PAPPAAAAPPARSADPA 441
Score = 30.1 bits (68), Expect = 6.5
Identities = 11/35 (31%), Positives = 13/35 (37%), Gaps = 1/35 (2%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 744
PP P PAP P+ P+ PP A
Sbjct: 408 VPPAAPARPAAARPAPA-PAPPAAAAPPARSADPA 441
>gnl|CDD|234048 TIGR02887, spore_ger_x_C, germination protein, Ger(x)C family.
Members of this protein family are restricted to
endospore-forming members of the Firmicutes lineage of
bacteria, including the genera Bacillus, Clostridium,
Thermoanaerobacter, Carboxydothermus, etc. Members are
nearly all predicted lipoproteins and belong to probable
transport operons, some of which have been characterized
as crucial to germination in response to alanine.
Members typically have been gene symbols gerKC, gerAC,
gerYC, etc [Transport and binding proteins, Amino acids,
peptides and amines, Cellular processes, Sporulation and
germination].
Length = 371
Score = 31.5 bits (72), Expect = 2.4
Identities = 15/89 (16%), Positives = 38/89 (42%), Gaps = 18/89 (20%)
Query: 468 VEQRDKHLNEKINQETVYAPGQAIEASLKQLAERRTDIFGVGDEETAIGKKIGEEDTRKD 527
++ + + KI+ + +E + + + + ++ + +K E++ K+
Sbjct: 261 IKVKGGKPHFKIHIK--------LEGRITESQDPKENLEK--PKNLKKIEKEAEKEIEKE 310
Query: 528 DKLFPGQAIEASLKQLAERRTDIFGVGDE 556
IE +K+L + + D G+GDE
Sbjct: 311 --------IEQLIKKLQKYKIDPLGLGDE 331
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 31.4 bits (71), Expect = 2.6
Identities = 16/60 (26%), Positives = 28/60 (46%)
Query: 377 EAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 436
E +D + +D + SS+ P PPPP ++VE P + +P P + D++
Sbjct: 190 EEEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRI 249
Score = 30.2 bits (68), Expect = 5.9
Identities = 14/51 (27%), Positives = 23/51 (45%)
Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 745
D + SS+ P PPPP ++VE P + +P P + D++
Sbjct: 199 DYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRI 249
Score = 30.2 bits (68), Expect = 6.4
Identities = 14/55 (25%), Positives = 21/55 (38%)
Query: 688 EDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
E+D + + SSS P PPP ++ P +P P PP+
Sbjct: 191 EEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245
Score = 29.8 bits (67), Expect = 8.2
Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 19/78 (24%)
Query: 704 EEQEDI-------PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
EE+EDI P P+ K +P+PP P + ++ P P P +A
Sbjct: 189 EEEEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEP------------KA 236
Query: 757 RPPVVKPSPTDEFLISPI 774
P PS D SP+
Sbjct: 237 SKPSAPPSSGDRIFASPL 254
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 31.4 bits (71), Expect = 2.6
Identities = 19/117 (16%), Positives = 28/117 (23%), Gaps = 8/117 (6%)
Query: 626 APNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQR 685
AP + P A ++D EE+ E+
Sbjct: 457 APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA-------PADDDPPPWEELPPEFASP 509
Query: 686 RPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
P D S + P +L A P P + P + P P
Sbjct: 510 APAQPDAAPAGWVAESIPDPA-TADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRP 565
Score = 30.2 bits (68), Expect = 6.5
Identities = 14/66 (21%), Positives = 18/66 (27%), Gaps = 7/66 (10%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQP-------PPLPPAPDKVVVKKGYDPKQARPPVVK 762
P P + A P + P+ P PP P + P A V
Sbjct: 464 PAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523
Query: 763 PSPTDE 768
S D
Sbjct: 524 ESIPDP 529
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 31.3 bits (71), Expect = 2.7
Identities = 14/67 (20%), Positives = 18/67 (26%), Gaps = 9/67 (13%)
Query: 698 EDSSSEEEQEDIPPPPP-PPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQA 756
+S Q P P S + PP S P+ P P+
Sbjct: 396 AAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPP--------STAPQAV 447
Query: 757 RPPVVKP 763
RP K
Sbjct: 448 RPAQFKE 454
Score = 30.9 bits (70), Expect = 3.8
Identities = 20/70 (28%), Positives = 25/70 (35%), Gaps = 6/70 (8%)
Query: 389 EDSSSEEEQEDIPPPPP-PPSLKKVEPAPPLPSQPSIQPPPLPP---APDKVVVKKGYDP 444
+S Q P P S + PP S P+ P AP V + +
Sbjct: 396 AAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEE 455
Query: 445 KQEKIPASKV 454
K KIP SKV
Sbjct: 456 K--KIPVSKV 463
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 31.2 bits (71), Expect = 2.7
Identities = 11/86 (12%), Positives = 17/86 (19%), Gaps = 2/86 (2%)
Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFL 770
P + AP + PP P V P A
Sbjct: 393 PVAQAAAAPAPAAAPAAAASAPAAPPA-AAPPAPVAAPAAAAPAAAPAAAPAAVALAPAP 451
Query: 771 ISPITGEKIPASKVQEHMRIGLLDPR 796
+ + A V+ +
Sbjct: 452 PAQAA-PETVAIPVRVAPEPAVASAA 476
Score = 30.8 bits (70), Expect = 4.0
Identities = 15/86 (17%), Positives = 20/86 (23%), Gaps = 4/86 (4%)
Query: 711 PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP-TDEF 769
P P + APP + P P PA P P+ E
Sbjct: 403 PAAAPAAAASAPAAPPAAAPP---APVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET 459
Query: 770 LISPITGEKIPASKVQEHMRIGLLDP 795
+ P+ PA
Sbjct: 460 VAIPVRVAPEPAVASAAPAPAAAPAA 485
Score = 30.1 bits (68), Expect = 6.5
Identities = 12/59 (20%), Positives = 16/59 (27%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDE 768
PP P A P + ++ P PPA P V +P
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPA 480
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 30.9 bits (70), Expect = 2.7
Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 16/73 (21%)
Query: 705 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP-------------PLPPAPDKVVVKKGY 751
E+ +P P PPS ++ P P + + LPPAP
Sbjct: 138 EEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVV-- 195
Query: 752 DPKQARPPVVKPS 764
P+ + V + S
Sbjct: 196 -PQPSTMVVHRRS 207
Score = 30.5 bits (69), Expect = 3.0
Identities = 13/55 (23%), Positives = 19/55 (34%), Gaps = 13/55 (23%)
Query: 396 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP-------------PLPPAPDKVV 437
E+ +P P PPS ++ P P + + LPPAP
Sbjct: 138 EEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPP 192
>gnl|CDD|178744 PLN03205, PLN03205, ATR interacting protein; Provisional.
Length = 652
Score = 31.3 bits (70), Expect = 2.9
Identities = 25/93 (26%), Positives = 37/93 (39%), Gaps = 4/93 (4%)
Query: 688 EDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLP--SQPSIQPPPLPPAPDKV 745
ED+D I + + E + D+ PPPSL PAPP+ + PS + P ++
Sbjct: 7 EDEDDFIKAIDIIEAIEAKADMYRLLPPPSLPTFLPAPPVSEMTTPSTKISSSLSHPMQL 66
Query: 746 VVKKGYDPK--QARPPVVKPSPTDEFLISPITG 776
G Q P + SP E + G
Sbjct: 67 QSSAGQKRDQIQVPDPFISYSPPRELSQRVVGG 99
>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
Length = 767
Score = 31.3 bits (71), Expect = 3.0
Identities = 14/51 (27%), Positives = 19/51 (37%)
Query: 386 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 436
D + + + Q PP P P V PAP S + P P A +
Sbjct: 45 DEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAIGDL 95
Score = 31.3 bits (71), Expect = 3.0
Identities = 14/51 (27%), Positives = 19/51 (37%)
Query: 695 DMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKV 745
D + + + Q PP P P V PAP S + P P A +
Sbjct: 45 DEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAIGDL 95
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.1 bits (70), Expect = 3.1
Identities = 15/70 (21%), Positives = 24/70 (34%), Gaps = 16/70 (22%)
Query: 689 DDDMQIDMEEDSSSEEEQEDIPPPPPP----------PSLK-KVEPAP---PLPSQ--PS 732
D + +++ + P P P + P+P P P Q P
Sbjct: 377 DGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPV 436
Query: 733 IQPPPLPPAP 742
+ PLPP+P
Sbjct: 437 ARSAPLPPSP 446
Score = 31.1 bits (70), Expect = 3.1
Identities = 14/61 (22%), Positives = 22/61 (36%), Gaps = 16/61 (26%)
Query: 389 EDSSSEEEQEDIPPPPPP----------PSLK-KVEPAP---PLPSQ--PSIQPPPLPPA 432
+++ + P P P + P+P P P Q P + PLPP+
Sbjct: 386 GSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLPPS 445
Query: 433 P 433
P
Sbjct: 446 P 446
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 31.0 bits (70), Expect = 3.1
Identities = 22/104 (21%), Positives = 34/104 (32%), Gaps = 11/104 (10%)
Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPP------PPSLKK 411
G+ ++ S S K Q+ + E S D PS +
Sbjct: 64 GSKKKKTSKKSTANLASSSYQKIVIQL-NRENLKKSVPRPSDTTLSCANLSSLREPSPGQ 122
Query: 412 VEPAPP---LPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPAS 452
PAPP L + + P AP + G P++ + AS
Sbjct: 123 P-PAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSPRRVIVQAS 165
Score = 30.2 bits (68), Expect = 4.4
Identities = 19/64 (29%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 687 PEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSL--KKVEPAPPLPSQPSIQPPPLPPAPDK 744
P D + SS E PP PPP L K P + PSI +P +
Sbjct: 100 PRPSDTTLSCANLSSLREPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSPRR 159
Query: 745 VVVK 748
V+V+
Sbjct: 160 VIVQ 163
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 31.2 bits (70), Expect = 3.4
Identities = 24/110 (21%), Positives = 51/110 (46%), Gaps = 4/110 (3%)
Query: 291 RMDNGEEDLEEDE-EMEEEYGQRRPEDDDMQIGSDEE-NDYYNARMR-GRNANRGRGRMQ 347
++DN + L+ + +ME++ + + + + G+DE+ ND Y+ R R R Q
Sbjct: 266 KLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQ 325
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQ 397
E+ + + + ++E+++ E+ R +A D Q DS +
Sbjct: 326 RELEKLNKERRLLNQEKTELLVEQGRLQLQA-DRHQEHIRARDSLIQSLA 374
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 31.0 bits (70), Expect = 3.4
Identities = 9/49 (18%), Positives = 20/49 (40%)
Query: 401 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKI 449
P P + + + P +P + P P P+ V+ + ++K+
Sbjct: 398 AAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWSTVRDKV 446
Score = 29.8 bits (67), Expect = 8.7
Identities = 23/118 (19%), Positives = 45/118 (38%), Gaps = 3/118 (2%)
Query: 685 RRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDK 744
+R E + ++DM ++ P P + + + P +P + P P P+
Sbjct: 374 QRVERIETRLDMSIPANLLHNAPQ-AAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNA 432
Query: 745 VVVKKGYDPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEHMRIGLLDPRWVEQRD 802
V+ + R V + S T E +++ T + + + L R EQR+
Sbjct: 433 AAVRSMW--STVRDKVRQRSRTTEVMLAGATVRALEGNTLVLTHESAPLARRLSEQRN 488
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 30.6 bits (69), Expect = 3.6
Identities = 20/91 (21%), Positives = 35/91 (38%), Gaps = 5/91 (5%)
Query: 701 SSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPV 760
++E+ P PP +P P P I PP PPA + V Q+ P
Sbjct: 50 GGRSKKENEPAPPSTMIATNTKPFHPAP----IDVPPDPPAAQEAVQPTAPPSAQSEPER 105
Query: 761 VKPSPTDEFLISPITGEKIPASKVQEHMRIG 791
+P P + + + +G++ + + H
Sbjct: 106 NEPRPEETPIFAYSSGDQGGSKRAG-HGDTD 135
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 30.7 bits (69), Expect = 3.8
Identities = 39/186 (20%), Positives = 62/186 (33%), Gaps = 34/186 (18%)
Query: 290 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEE 349
+ D + D +DE EE+Y D G+D E R E
Sbjct: 267 DDADEYDSDDGDDEGREEDY-----ISDSSASGNDPEE-------------REDKLSPEI 308
Query: 350 IN-EDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQ----DMEEDSSSEEEQED----- 399
+ Q+ + E E + EE SK+ K +++ +++D S + D
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDID 368
Query: 400 ------IPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQEKIPASK 453
+ KK EP PS P P P K K+ + K PAS
Sbjct: 369 GEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASV 428
Query: 454 VQEHMR 459
+ ++
Sbjct: 429 PAKKLK 434
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 31.0 bits (70), Expect = 3.8
Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 18/106 (16%)
Query: 298 DLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQE 357
D+ +M EE G DD ++ +DEE+ + ++ GR E+
Sbjct: 211 DMLGSMDMAEETG-----DDGIEEDADEEDGDDDQPDNNEDSEAGR--------EESEGS 257
Query: 358 GTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPP 403
+ E+E++ + E + + + EDS S+E ED P
Sbjct: 258 DESEEDEAEATDGEGEEGEM-----DAAEASEDSESDESDEDTETP 298
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.8 bits (69), Expect = 3.8
Identities = 15/56 (26%), Positives = 22/56 (39%)
Query: 349 EINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
E N+D T SEE+ D+ EE KS +KD + + S+
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDL 159
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 30.4 bits (69), Expect = 4.2
Identities = 16/82 (19%), Positives = 24/82 (29%), Gaps = 3/82 (3%)
Query: 690 DDMQIDMEEDSSSEEE---QEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
+ MQ DM + + E E P P + + P + QP P+ P
Sbjct: 89 EQMQADMRQQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTA 148
Query: 747 VKKGYDPKQARPPVVKPSPTDE 768
+ P R P
Sbjct: 149 PVQPQTPAPVRTQPAAPVTQAV 170
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.5 bits (69), Expect = 4.2
Identities = 13/42 (30%), Positives = 17/42 (40%), Gaps = 4/42 (9%)
Query: 394 EEEQEDIPPPPPPPSLKKVEPAPPLPSQ----PSIQPPPLPP 431
E + PPPPPPP + E P P Q + +P
Sbjct: 267 PEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
Score = 30.5 bits (69), Expect = 4.2
Identities = 13/42 (30%), Positives = 17/42 (40%), Gaps = 4/42 (9%)
Query: 703 EEEQEDIPPPPPPPSLKKVEPAPPLPSQ----PSIQPPPLPP 740
E + PPPPPPP + E P P Q + +P
Sbjct: 267 PEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
Provisional.
Length = 653
Score = 30.7 bits (69), Expect = 4.2
Identities = 25/127 (19%), Positives = 37/127 (29%), Gaps = 16/127 (12%)
Query: 651 TPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIP 710
P + V + R EDL+ E Y ++ E DS SE E E
Sbjct: 21 IPSSLEDEVFLDPRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGR 80
Query: 711 ---------PPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVV 761
P K + P ++ PA G PK + PP +
Sbjct: 81 RAGMGGRNKATKPSRRNKTTQCRPT---SLALATAATMPATPSS----GKSPKVSSPPSI 133
Query: 762 KPSPTDE 768
++
Sbjct: 134 PSLSEED 140
>gnl|CDD|152901 pfam12467, CMV_1a, Cucumber mosaic virus 1a protein. This domain
family is found in viruses, and is typically between 156
and 171 amino acids in length. The family is found in
association with pfam01443, pfam01660. 1a protein is the
major virulence factor of the cucumber mosaic virus
(CMV). The Ns strain of CMV causes necrotic lesions to
Nicotiana spp. while other strains cause systemic
mosaic. The determinant of the pathogenesis of these
different strains is the specific amino acid residue at
the 461 residue of the 1a protein.
Length = 175
Score = 29.8 bits (67), Expect = 4.3
Identities = 15/81 (18%), Positives = 23/81 (28%), Gaps = 2/81 (2%)
Query: 351 NEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLK 410
+ D+ + E +K K + Q + S E+ D P +K
Sbjct: 60 SAADVLKRAAWAVEDGKTLRAEKKKKLEEALQQPVQEDSVSEEFEDAPDAPSESVRDDVK 119
Query: 411 KVEPAPPLPSQPSIQPPPLPP 431
P P PP L
Sbjct: 120 PENPVVGQTQAPE--PPELKS 138
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 30.3 bits (68), Expect = 4.3
Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 2/58 (3%)
Query: 710 PPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDP--KQARPPVVKPSP 765
PPP P ++ + P P PP AP + P Q+ PP
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPS 199
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 30.5 bits (69), Expect = 4.3
Identities = 14/68 (20%), Positives = 22/68 (32%)
Query: 714 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISP 773
P + + +P P +++ P P G P P P+P LI+
Sbjct: 673 RPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIAS 732
Query: 774 ITGEKIPA 781
T P
Sbjct: 733 PTAPPEPE 740
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 30.6 bits (69), Expect = 4.4
Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 2/55 (3%)
Query: 388 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGY 442
EED ++ + PP +V P P S P P P K + GY
Sbjct: 167 EEDVEISSDRHALIVPPSTGHGNRVHPMPF--PDSSASLQPRPMVPQKDLAVYGY 219
Score = 30.6 bits (69), Expect = 4.4
Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 2/55 (3%)
Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGY 751
EED ++ + PP +V P P S P P P K + GY
Sbjct: 167 EEDVEISSDRHALIVPPSTGHGNRVHPMPF--PDSSASLQPRPMVPQKDLAVYGY 219
>gnl|CDD|215224 PLN02400, PLN02400, cellulose synthase.
Length = 1085
Score = 30.7 bits (69), Expect = 4.5
Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 8/108 (7%)
Query: 340 NRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
++G R++ + +EDD+ ++ +NE A+ + +A+ Q +D+E SSS E +
Sbjct: 91 HKGSPRVEGDEDEDDV-------DDLENEFNYAQGNGKARHQWQGEDIELSSSSRHESQP 143
Query: 400 IPP-PPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQ 446
IP P ++ A P PL PA DP+Q
Sbjct: 144 IPLLTHGQPVSGEIPCATPDNQSVRTTSGPLGPAERNANSSPYIDPRQ 191
>gnl|CDD|237538 PRK13876, PRK13876, conjugal transfer coupling protein TraG;
Provisional.
Length = 663
Score = 30.7 bits (70), Expect = 4.6
Identities = 12/30 (40%), Positives = 12/30 (40%)
Query: 405 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 434
PPP L P P S P P PAP
Sbjct: 555 PPPDLAPPAGPPARPDDWSGLPIPAVPAPA 584
Score = 30.7 bits (70), Expect = 4.6
Identities = 12/30 (40%), Positives = 12/30 (40%)
Query: 714 PPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
PPP L P P S P P PAP
Sbjct: 555 PPPDLAPPAGPPARPDDWSGLPIPAVPAPA 584
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 30.1 bits (68), Expect = 4.6
Identities = 33/147 (22%), Positives = 50/147 (34%), Gaps = 23/147 (15%)
Query: 609 GLLPDEEKEKIGPKRSHAP---NPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERM 665
+ +E+E RS + P + PM + P+ P+ +Q
Sbjct: 60 AWVAADEEESYARFRSDTRPNRHEWKPSPLLPMKLSRQ-----PSVPNLRQEEPRLQSLK 114
Query: 666 DNGEEDLEEDEEMEEEYGQRRPEDDDMQI----DMEEDSSS----EEEQEDIPPPPPPPS 717
EE L++ +E+E R QI +E S+S ++ P S
Sbjct: 115 PPNEEALQKISALEDELAFLRA-----QIAKIVAAQEQSNSTTSDLLSSDESVPSSSTTS 169
Query: 718 LKKVEP--APPLPSQPSIQPPPLPPAP 742
P P L P PPP PP P
Sbjct: 170 FPISPPTEEPVLEVPPPPPPPPPPPPP 196
Score = 29.3 bits (66), Expect = 7.4
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 5/37 (13%)
Query: 393 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPL 429
+EE ++PPPPPPP P PP Q S L
Sbjct: 176 TEEPVLEVPPPPPPPP-----PPPPPSLQQSTSAIDL 207
>gnl|CDD|182338 PRK10255, PRK10255, PTS system N-acetyl glucosamine specific
transporter subunits IIABC; Provisional.
Length = 648
Score = 30.6 bits (69), Expect = 4.7
Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 13/67 (19%)
Query: 718 LKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPTDEFLISPITGE 777
+KKV P+ + S + P AP A+P V + + L+SPITG+
Sbjct: 462 MKKVVARGPVAAA-SAEATPATAAP------------VAKPQAVPNAVSIAELVSPITGD 508
Query: 778 KIPASKV 784
+ +V
Sbjct: 509 VVALDQV 515
>gnl|CDD|216263 pfam01044, Vinculin, Vinculin family.
Length = 850
Score = 30.5 bits (69), Expect = 4.8
Identities = 15/35 (42%), Positives = 18/35 (51%)
Query: 388 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 422
E + SE E+ED P P PP + EP P P P
Sbjct: 620 ELPTDSEFEREDEPADPRPPPPVETEPPPRPPPPP 654
Score = 30.5 bits (69), Expect = 4.8
Identities = 15/35 (42%), Positives = 18/35 (51%)
Query: 697 EEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQP 731
E + SE E+ED P P PP + EP P P P
Sbjct: 620 ELPTDSEFEREDEPADPRPPPPVETEPPPRPPPPP 654
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 30.0 bits (68), Expect = 4.9
Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 12/76 (15%)
Query: 703 EEEQEDIPPPPPPPSLKKVEPAP-PLPSQPSIQPPPLP---------PAPDKVVVKKGYD 752
+ + PPP + + EP P P+P P P +P P P VKK
Sbjct: 56 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKK--V 113
Query: 753 PKQARPPVVKPSPTDE 768
+Q + V P
Sbjct: 114 EEQPKREVKPVEPRPA 129
Score = 29.7 bits (67), Expect = 5.7
Identities = 20/67 (29%), Positives = 26/67 (38%), Gaps = 2/67 (2%)
Query: 693 QIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYD 752
Q E E E E IP PP + V P P +P +P P P + K+
Sbjct: 65 QPPPEPVVEPEPEPEPIPEPPKEAPV--VIPKPEPKPKPKPKPKPKPVKKVEEQPKREVK 122
Query: 753 PKQARPP 759
P + RP
Sbjct: 123 PVEPRPA 129
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 30.5 bits (69), Expect = 5.1
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 663 ERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
ER D EED EE+EE +E+ G + DD + + E+D S+ E D
Sbjct: 387 ERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFE-EDDVESKYEDSD 431
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 30.4 bits (68), Expect = 5.2
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 5/53 (9%)
Query: 347 QEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
+EE++E + S+E ++E +S E+ ++ + E DSS EE+ ED
Sbjct: 935 EEEVSEYEASSDDESDETDEDE-----ESDESSEDLSEDESENDSSDEEDGED 982
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 30.4 bits (68), Expect = 5.2
Identities = 8/38 (21%), Positives = 13/38 (34%)
Query: 396 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
+Q + P P + + QP+ Q P P
Sbjct: 214 QQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQ 251
Score = 30.4 bits (68), Expect = 5.2
Identities = 8/38 (21%), Positives = 13/38 (34%)
Query: 705 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
+Q + P P + + QP+ Q P P
Sbjct: 214 QQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQ 251
Score = 30.0 bits (67), Expect = 6.4
Identities = 15/68 (22%), Positives = 24/68 (35%)
Query: 374 KSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
+ + A TQ ++ + Q P P + V+PA Q Q PP
Sbjct: 196 RHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQG 255
Query: 434 DKVVVKKG 441
K ++G
Sbjct: 256 HKRSREQG 263
>gnl|CDD|237410 PRK13524, PRK13524, outer membrane receptor FepA; Provisional.
Length = 744
Score = 30.4 bits (69), Expect = 5.2
Identities = 19/51 (37%), Positives = 23/51 (45%), Gaps = 14/51 (27%)
Query: 501 RRTDIFGVGDEETAI---GKKI-----------GEEDTRKDDKLFPGQAIE 537
R+ DI G+G E T I GK + GE DTR D P + IE
Sbjct: 97 RQIDIRGMGPENTLILIDGKPVSSRNSVRYGWRGERDTRGDTNWVPPEMIE 147
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 30.1 bits (68), Expect = 5.3
Identities = 20/74 (27%), Positives = 29/74 (39%), Gaps = 4/74 (5%)
Query: 300 EEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGT 359
ED ++ E QR E+ + E + A R R R + ED+ ++
Sbjct: 380 AEDAKVAELISQREEEEALQR----EAEERLEAEQAERAEEDARLRELYPLPEDEFEDED 435
Query: 360 TSEEESDNENEEAR 373
EE E EEAR
Sbjct: 436 ELEEAQPEEEEEAR 449
Score = 29.8 bits (67), Expect = 8.6
Identities = 13/61 (21%), Positives = 30/61 (49%)
Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
++ E I++ + +E E E E E+A +++E ++ + ED +E++ + P
Sbjct: 384 KVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPE 443
Query: 405 P 405
Sbjct: 444 E 444
>gnl|CDD|233677 TIGR01995, PTS-II-ABC-beta, PTS system, beta-glucoside-specific
IIABC component. This model represents a family of PTS
enzyme II proteins in which all three domains are found
in the same polypeptide chain and which appear to have a
broad specificity for beta-glucosides including salicin
(beta-D-glucose-1-salicylate) and arbutin
(Hydroquinone-O-beta-D-glucopyranoside). These are
distinct from the closely related sucrose-specific and
trehalose-specific PTS transporters.
Length = 610
Score = 30.4 bits (69), Expect = 5.3
Identities = 9/36 (25%), Positives = 16/36 (44%)
Query: 752 DPKQARPPVVKPSPTDEFLISPITGEKIPASKVQEH 787
D + DE L +P+ GE +P ++V +
Sbjct: 447 DQEDVVTQEAAALSKDESLYAPVAGEMLPLNEVPDE 482
>gnl|CDD|236122 PRK07877, PRK07877, hypothetical protein; Provisional.
Length = 722
Score = 30.3 bits (69), Expect = 5.4
Identities = 14/53 (26%), Positives = 22/53 (41%), Gaps = 4/53 (7%)
Query: 694 IDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVV 746
+D++E ++ PPPPP + + E APP P A + V
Sbjct: 333 VDLDELLDRLDD----PPPPPSGAGWEAESAPPAPPADRAAEALPQDALEIVA 381
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 29.2 bits (66), Expect = 5.6
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 12/55 (21%)
Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPP 716
Q + + E++ ++DEE EEE EDD EED +E +E+ P
Sbjct: 104 QHLVASEEDESDDDEEDEEE------EDD------EEDDDEDESEEEESPVKKVK 146
>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
Length = 955
Score = 30.5 bits (68), Expect = 5.7
Identities = 24/76 (31%), Positives = 30/76 (39%), Gaps = 15/76 (19%)
Query: 700 SSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPS-IQPPPLPPA-PDKVVVKKGYDPKQAR 757
+S E E I P P V+P P P P + P P+ P PD V+ P
Sbjct: 555 TSYIEPDEPIIPDP-------VDPVIPDPVIPDPVDPDPVDPVIPDPVI------PDPVD 601
Query: 758 PPVVKPSPTDEFLISP 773
P V P P D + P
Sbjct: 602 PDPVDPEPVDPVIPDP 617
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 28.5 bits (64), Expect = 5.8
Identities = 17/59 (28%), Positives = 23/59 (38%)
Query: 708 DIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
D PPP P V+ PP+ + + PP P AP P P+ P+P
Sbjct: 38 DDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPA 96
Score = 28.5 bits (64), Expect = 7.1
Identities = 14/36 (38%), Positives = 15/36 (41%)
Query: 396 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 431
E PP P PP P PL + PPLPP
Sbjct: 68 EPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPP 103
Score = 28.5 bits (64), Expect = 7.1
Identities = 14/36 (38%), Positives = 15/36 (41%)
Query: 705 EQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPP 740
E PP P PP P PL + PPLPP
Sbjct: 68 EPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPP 103
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 30.4 bits (68), Expect = 5.9
Identities = 21/137 (15%), Positives = 36/137 (26%), Gaps = 9/137 (6%)
Query: 624 SHAPNPPLPRSMPPMNKPKPGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYG 683
+ + N+P P A Q+ E +
Sbjct: 379 GYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQ------PYYAPAPEQPAQQP 432
Query: 684 QRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPD 743
P + + ++ EEQ+ P ++ P QP P+ P
Sbjct: 433 YYAPAP---EQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPV 489
Query: 744 KVVVKKGYDPKQARPPV 760
+ K ARPP+
Sbjct: 490 VEPEPVVEETKPARPPL 506
Score = 30.4 bits (68), Expect = 5.9
Identities = 22/105 (20%), Positives = 40/105 (38%), Gaps = 1/105 (0%)
Query: 661 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKK 720
M+ +D+G + +E ++P Q + + + Q P P P +
Sbjct: 732 MKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQY 791
Query: 721 VEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSP 765
+P P+ QP Q P P AP + ++ P +P +P
Sbjct: 792 QQPQQPVAPQPQYQQPQQPVAP-QPQYQQPQQPVAPQPQYQQPQQ 835
>gnl|CDD|226435 COG3921, COG3921, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 300
Score = 29.8 bits (67), Expect = 6.3
Identities = 18/100 (18%), Positives = 29/100 (29%), Gaps = 10/100 (10%)
Query: 677 EMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPP 736
+ E+ E Q + ++ + E + P P + V P P +P P
Sbjct: 19 TVPEQDVMPGAEPVSGQANEQKRIAEEAHPQ--PVARPSSTDDPVTPTEGKPVRPKGLPI 76
Query: 737 PLPPAP--------DKVVVKKGYDPKQARPPVVKPSPTDE 768
P D DP P P+ +E
Sbjct: 77 LALAGPVGELGQPMDLPAPANPGDPLALPEPPSPPTKPEE 116
>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
Members of this protein family are FHA
(forkhead-associated) domain-containing proteins that
are part of type VI secretion loci in a considerable
number of bacteria, most of which are known pathogens.
Species include Pseudomonas aeruginosa PAO1, Aeromonas
hydrophila, Yersinia pestis, Burkholderia mallei, etc
[Protein fate, Protein and peptide secretion and
trafficking, Cellular processes, Pathogenesis].
Length = 396
Score = 30.0 bits (68), Expect = 6.3
Identities = 11/51 (21%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 384 VQDMEEDSSSEEEQEDIPPP-PPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
++ ++++ S + +D+ P PP + A P+ ++P++ PP +P
Sbjct: 141 LKALDQEPLSAADLDDLSAPLFPPLDARLPAFAAPIDAEPTMVPPFVPLPA 191
Score = 29.6 bits (67), Expect = 7.0
Identities = 11/48 (22%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 696 MEEDSSSEEEQEDIPPP-PPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
++++ S + +D+ P PP + A P+ ++P++ PP +P
Sbjct: 144 LDQEPLSAADLDDLSAPLFPPLDARLPAFAAPIDAEPTMVPPFVPLPA 191
>gnl|CDD|227464 COG5135, COG5135, Uncharacterized conserved protein [Function
unknown].
Length = 245
Score = 29.6 bits (66), Expect = 6.5
Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 1/68 (1%)
Query: 361 SEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSLKKVEPAPPLPS 420
SE D +EE R D+ +++E D E+E + PPP K + P P
Sbjct: 125 SETNDDLTDEEIRTPINWDDDDDKRNIENDEHHEDEADFYPPPQLSRHQKSLYRK-PAPG 183
Query: 421 QPSIQPPP 428
Q
Sbjct: 184 QKLTSETS 191
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 30.2 bits (68), Expect = 6.7
Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 1/83 (1%)
Query: 289 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRG-RGRMQ 347
++ D E D +EDE E + E D DE+ D A ++ R +
Sbjct: 650 EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALP 709
Query: 348 EEINEDDMQEGTTSEEESDNENE 370
+ +N D +G SE+E ++E
Sbjct: 710 KVLNLPDALDGGDSEDEEGMDDE 732
Score = 29.5 bits (66), Expect = 9.9
Identities = 12/48 (25%), Positives = 24/48 (50%)
Query: 661 MQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQED 708
+ E + E+DLEE ++ E+E + + + D E+ E+E +
Sbjct: 644 LFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDA 691
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 30.2 bits (68), Expect = 6.9
Identities = 18/63 (28%), Positives = 26/63 (41%)
Query: 662 QERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIDMEEDSSSEEEQEDIPPPPPPPSLKKV 721
+E N + + EEDEE E + D D ED +EE ++ P + K
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 722 EPA 724
E A
Sbjct: 61 EKA 63
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 29.3 bits (66), Expect = 7.5
Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 5/43 (11%)
Query: 287 LMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDY 329
L E +D +E EE+EE EEE + +DDD D+++DY
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD-----DDDDDY 200
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 29.8 bits (67), Expect = 7.5
Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 359 TTSEEESDNENEEA-RKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPPSL 409
T S S E RKS E+ ++ + + SEEE+E P P S+
Sbjct: 72 TESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPDPIASI 123
>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 165 and 237 amino acids in length.
CD99L2 and CD99 are involved in trans-endothelial
migration of neutrophils in vitro and in the recruitment
of neutrophils into inflamed peritoneum.
Length = 154
Score = 29.0 bits (65), Expect = 7.5
Identities = 12/64 (18%), Positives = 16/64 (25%), Gaps = 7/64 (10%)
Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKP 763
++ + P P + KK P L D K PK P
Sbjct: 8 DDDPAMKPTKKPATPKK----PSKGDDDFDLEDALDGGNDGGAGPK---PKPRPQPDPGN 60
Query: 764 SPTD 767
D
Sbjct: 61 PGGD 64
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 29.6 bits (66), Expect = 7.6
Identities = 25/124 (20%), Positives = 48/124 (38%), Gaps = 11/124 (8%)
Query: 291 RMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNARMRGRNANRGRGRMQEEI 350
+ + + + +E +EE E++ + GS EEN+ + ++ R E
Sbjct: 134 KSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEG 193
Query: 351 NEDDM-----------QEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQED 399
+ + QE EE + EE RK E ++ + Q+ + S EEE++
Sbjct: 194 AQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKR 253
Query: 400 IPPP 403
Sbjct: 254 RLKE 257
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 29.6 bits (66), Expect = 8.1
Identities = 34/150 (22%), Positives = 44/150 (29%), Gaps = 37/150 (24%)
Query: 274 PPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDYYNAR 333
P P P G +V + + G E D E EE R E ++ G
Sbjct: 46 PAPPAPTTSGPQV--RAVAEQGHRQTESDTETAEE--SRHGEKEERGQGGPS-------- 93
Query: 334 MRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSS 393
G E + ++EE + + E SS
Sbjct: 94 ----------GSGSESVGSPTPSPSGSAEELASGLSPEN---------------TSGSSP 128
Query: 394 EEEQEDIPPPPPPPSLKKVEPAPPLPSQPS 423
E PPP PP EPAPP PS
Sbjct: 129 ESPASHSPPPSPPSHPGPHEPAPPESHNPS 158
>gnl|CDD|219005 pfam06387, Calcyon, D1 dopamine receptor-interacting protein
(calcyon). This family consists of several D1 dopamine
receptor-interacting (calcyon) proteins. D1/D5 dopamine
receptors in the basal ganglia, hippocampus, and
cerebral cortex modulate motor, reward, and cognitive
behaviour. D1-like dopamine receptors likely modulate
neocortical and hippocampal neuronal excitability and
synaptic function via Ca(2+) as well as cAMP-dependent
signaling. Defective calcyon proteins have been
implicated in both attention-deficit/hyperactivity
disorder (ADHD) and schizophrenia.
Length = 186
Score = 29.2 bits (65), Expect = 8.3
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 4/48 (8%)
Query: 739 PPAPDKVVVKKG--YDPKQARPPVVKPSPTDEFLIS--PITGEKIPAS 782
P PDKVVVK Y P A + EF + P G ++P +
Sbjct: 38 PSFPDKVVVKTQTEYQPTSADKGKKRAPLIAEFTVCSHPEEGRRLPTA 85
>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
Provisional.
Length = 517
Score = 29.6 bits (67), Expect = 8.7
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)
Query: 393 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 433
SEEE E+I PPP + A P ++PS PAP
Sbjct: 108 SEEEGEEILASSPPP----IPEAVPATAEPSESANAQFPAP 144
Score = 29.6 bits (67), Expect = 8.7
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)
Query: 702 SEEEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQPPPLPPAP 742
SEEE E+I PPP + A P ++PS PAP
Sbjct: 108 SEEEGEEILASSPPP----IPEAVPATAEPSESANAQFPAP 144
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 29.3 bits (66), Expect = 9.0
Identities = 18/86 (20%), Positives = 28/86 (32%)
Query: 348 EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPPPPP 407
E + +E E+ E + E T ++ E+ E + PP P P
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVPKPY 258
Query: 408 SLKKVEPAPPLPSQPSIQPPPLPPAP 433
P+ PLP + P P
Sbjct: 259 ISPDEYPSAPLPPELPQLLQPSLVIP 284
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 29.3 bits (65), Expect = 9.0
Identities = 21/76 (27%), Positives = 35/76 (46%)
Query: 323 SDEENDYYNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNT 382
S N A + N N I +++ + GT EEE D + EE + +E +
Sbjct: 69 SACGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVE 128
Query: 383 QVQDMEEDSSSEEEQE 398
+ D +EDS ++E+E
Sbjct: 129 EEYDDDEDSEKDDEKE 144
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 28.6 bits (64), Expect = 9.1
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 395 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQP-PPLPP 431
+ Q+ + P PP + + P PPLP +QP PPL P
Sbjct: 109 QPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLP 146
Score = 28.6 bits (64), Expect = 9.1
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 704 EEQEDIPPPPPPPSLKKVEPAPPLPSQPSIQP-PPLPP 740
+ Q+ + P PP + + P PPLP +QP PPL P
Sbjct: 109 QPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLP 146
>gnl|CDD|129265 TIGR00161, TIGR00161, TIGR00161 family protein. This model
represents one out of two closely related ortholgous
sets of proteins that, so far, are found only in the
Archaea. This ortholog set includes MJ0106 from
Methanococcus jannaschii and AF1251 from Archaeoglobus
fulgidus, but not MJ1210 or AF0525 [Hypothetical
proteins, Conserved].
Length = 238
Score = 29.0 bits (65), Expect = 9.2
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 4/41 (9%)
Query: 464 DPR----WVEQRDKHLNEKINQETVYAPGQAIEASLKQLAE 500
DPR VE +K LN ++ E + +AIE+ LK+LAE
Sbjct: 182 DPRAAASLVEVLNKMLNTNVDPEPLLKEAEAIESRLKKLAE 222
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 29.5 bits (66), Expect = 9.3
Identities = 27/129 (20%), Positives = 45/129 (34%), Gaps = 15/129 (11%)
Query: 270 PGNFPPPTTPDEVGARVLMQERMDNGEEDLEEDEEMEEEYGQRRPEDDDMQIGSDEENDY 329
P P P +E +++ ++ E+D E G R E D DY
Sbjct: 272 PNLMPLPGDYNERRGGIILDISLNEFEQDAVPLLVDLERNGNRIKELADF-------GDY 324
Query: 330 YNARMRGRNANRGRGRMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEE 389
Y QE + +E T +EES++E E E ++ +++
Sbjct: 325 Y----EDIFEVVEVVEKQE--GDVVTEEST--DEESEDEVEIDESVIEEVAEMELLEVQV 376
Query: 390 DSSSEEEQE 398
D +E E
Sbjct: 377 DDLAERLAE 385
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 28.3 bits (63), Expect = 9.4
Identities = 14/54 (25%), Positives = 17/54 (31%)
Query: 713 PPPPSLKKVEPAPPLPSQPSIQPPPLPPAPDKVVVKKGYDPKQARPPVVKPSPT 766
PP + + P PS + P Y P A PPV P T
Sbjct: 19 PPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGST 72
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 28.9 bits (65), Expect = 9.5
Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 10/71 (14%)
Query: 335 RGRNANRGRGRMQ------EEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDME 388
R R +N G RM+ E +E EEE D E E +E ++ D +
Sbjct: 4 RARRSNAGN-RMKKLLEEELEEDEFFWTYLLFEEEEDDEEFEI---EEEEEEEEVDSDFD 59
Query: 389 EDSSSEEEQED 399
+ E E +D
Sbjct: 60 DSEDDEPESDD 70
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 29.5 bits (67), Expect = 9.5
Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 12/90 (13%)
Query: 487 PGQAI-EASLKQLAERRTDIFGVGDEETAIGKKIG--EEDTRKDDKLFPGQAIEASLKQL 543
+AI + L++L + + E + K+I E+ +++L I+ L ++
Sbjct: 419 QAEAILDLRLRRLT--GLEEEKIEKELKELEKEIADLEKILASEERLL--DIIKKELLEI 474
Query: 544 A-----ERRTDIFGVGDEETAIGKKIGEED 568
ERRT+I ++E I EED
Sbjct: 475 KKKFGDERRTEIVEEEEDEIEDEDLIAEED 504
>gnl|CDD|119406 cd00225, API3, Ascaris pepsin inhibitor-3 (API3); protein inhibitor
that reversibly inhibits aspartic proteinase cathepsin
E, and gastric enzymes pepsin and gastricsin.
Length = 159
Score = 28.8 bits (64), Expect = 9.5
Identities = 12/62 (19%), Positives = 24/62 (38%), Gaps = 2/62 (3%)
Query: 349 EINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDS--SSEEEQEDIPPPPPP 406
E+ D+ QE E+ + EE +++ + + + +E +P P
Sbjct: 30 ELTPDEQQELAQYVEDVADYKEEVKQALKERQEGLKLRRAGKKKKAVTLAEEKLPKAPKK 89
Query: 407 PS 408
PS
Sbjct: 90 PS 91
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 29.5 bits (66), Expect = 9.8
Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 9/67 (13%)
Query: 345 RMQEEINEDDMQEGTTSEEESDNENEEARKSKEAKDNTQVQDMEEDSSSEEEQEDIPPPP 404
R N D MQ E D+E ++ + E K+ +V D +E+ ++ +
Sbjct: 7 REANINNNDRMQ-------EKDDEKQDQKNRMELKE--KVLDKKEEVVTDNVDSPVKEQS 57
Query: 405 PPPSLKK 411
+LK
Sbjct: 58 SQENLKI 64
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.132 0.385
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 49,515,871
Number of extensions: 5175565
Number of successful extensions: 16100
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12157
Number of HSP's successfully gapped: 1082
Length of query: 910
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 804
Effective length of database: 6,236,078
Effective search space: 5013806712
Effective search space used: 5013806712
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 64 (28.6 bits)