RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16142
(426 letters)
>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model represents one of several closely related clades
of the dihydrolipoamide acetyltransferase subunit of the
pyruvate dehydrogenase complex. It includes sequences
from mitochondria and from alpha and beta branches of
the proteobacteria, as well as from some other bacteria.
Sequences from Gram-positive bacteria are not included.
The non-enzymatic homolog protein X, which serves as an
E3 component binding protein, falls within the clade
phylogenetically but is rejected by its low score
[Energy metabolism, Pyruvate dehydrogenase].
Length = 436
Score = 353 bits (907), Expect = e-119
Identities = 175/368 (47%), Positives = 222/368 (60%), Gaps = 50/368 (13%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +PALSPTM G + W KKEGDK+N GD++AEIETDKATM FE EEGYLAKILVP G
Sbjct: 2 ITMPALSPTMTTGNLAKWLKKEGDKVNPGDVIAEIETDKATMEFEAVEEGYLAKILVPEG 61
Query: 145 SKDVPIGKLVCIIVENESDVA-AFKDFK--DDAPPAAGASAPAPPPPKVAAAPPPPPPKA 201
+KDVP+ K + ++VE + DVA AFK++K A PA S AP P A P P P K
Sbjct: 62 TKDVPVNKPIAVLVEEKEDVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQ 121
Query: 202 APAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADL----- 256
+P PS P P G R++ASPLAK+LA EKG+DLS++ AGSG G I D+
Sbjct: 122 SPEPSS-PAPLSDKESGDRIFASPLAKKLAKEKGIDLSAV-AGSGPNGRIVKKDIESFVP 179
Query: 257 -----SKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQ---------- 301
+ A A +A P + G + D+P+S +R +IAKRLL+SKQ
Sbjct: 180 QSPASANQQAAATTPATYPAAAPVSTGSYEDVPLSNIRKIIAKRLLESKQTIPHYYVSIE 239
Query: 302 -----VIKLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVD 356
++ LR+++N + KLS+NDFIIKA+ALA R VPEANSSW D FIR Y +VD
Sbjct: 240 CNVDKLLALRKELNAMASEV-YKLSVNDFIIKASALALREVPEANSSWTDNFIRRYKNVD 298
Query: 357 VSVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEG 416
+SVAV T GL TPIV +AD KGL I SN++K LA +A+
Sbjct: 299 ISVAVATPDGLITPIVRNADAKGLSTI-------------------SNEIKDLAKRARNN 339
Query: 417 KLQPHEFQ 424
KL+P EFQ
Sbjct: 340 KLKPEEFQ 347
Score = 70.2 bits (172), Expect = 4e-13
Identities = 29/40 (72%), Positives = 35/40 (87%)
Query: 39 KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLV 78
+GGT +ISNLGMFGIK+F+AIINPPQACILAVG++ V
Sbjct: 347 QGGTFTISNLGMFGIKDFTAIINPPQACILAVGAVEDVAV 386
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 306 bits (785), Expect = 2e-99
Identities = 150/360 (41%), Positives = 208/360 (57%), Gaps = 44/360 (12%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +P+LSPTM G I W KKEGDK++ G++L E+ETDKAT+ E EEGYLAKI+ G
Sbjct: 115 IGMPSLSPTMTEGNIARWLKKEGDKVSPGEVLCEVETDKATVEMECMEEGYLAKIVKGDG 174
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAP--PAAGASAPAPPPPKVAAAPPPPPPKAA 202
+K++ +G+++ I VE E D+ FKD+K + PAA + P+PPPPK P
Sbjct: 175 AKEIKVGEVIAITVEEEEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEP 234
Query: 203 PAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSK--AS 260
A P+ P S G R++ASPLA++LA + + LSSI G+G G I AD+ AS
Sbjct: 235 KASKPSAPP----SSGDRIFASPLARKLAEDNNVPLSSI-KGTGPDGRIVKADIEDYLAS 289
Query: 261 KAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQVI---------------KL 305
AP + +TD+P + +R V A RLLQSKQ I L
Sbjct: 290 GGKGATAPPSTDSKAPALDYTDIPNTQIRKVTASRLLQSKQTIPHYYLTVDTRVDKLMAL 349
Query: 306 REQMNKALEKRGAK-LSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTD 364
R Q+N E G K +S+ND +IKA ALA R+VP+ NSSW D +IR+YH+V+++VAV T+
Sbjct: 350 RSQLNSLQEASGGKKISVNDLVIKAAALALRKVPQCNSSWTDDYIRQYHNVNINVAVQTE 409
Query: 365 KGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
GL+ P+V DADKKGL I + +VK LA KA+E L+P +++
Sbjct: 410 NGLYVPVVKDADKKGLSTI-------------------AEEVKQLAQKARENSLKPEDYE 450
Score = 61.4 bits (149), Expect = 3e-10
Identities = 26/40 (65%), Positives = 32/40 (80%), Gaps = 1/40 (2%)
Query: 40 GGTVSISNLG-MFGIKNFSAIINPPQACILAVGSLSQRLV 78
GGT ++SNLG FGIK F AIINPPQ+ ILAVGS +R++
Sbjct: 451 GGTFTVSNLGGPFGIKQFCAIINPPQSAILAVGSAEKRVI 490
>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
subunit E2; Reviewed.
Length = 411
Score = 282 bits (723), Expect = 2e-91
Identities = 132/358 (36%), Positives = 181/358 (50%), Gaps = 60/358 (16%)
Query: 87 LPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
+P L M G IV W K GD + EG LAE+ETDKAT+ +P G +AK+LV G
Sbjct: 7 MPDLGEGMTEGEIVEWLVKVGDTVKEGQPLAEVETDKATVEIPSPVAGTVAKLLVEEGDV 66
Query: 147 DVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
VP+G ++ +I E AA A A APAP P AAA P AA AP+
Sbjct: 67 -VPVGSVIAVIEEEGEAEAA-------AAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPA 118
Query: 207 PTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVA 266
+ K ASP ++LA E G+DLS++ GSG G IT D+ A+ A A A
Sbjct: 119 APAAAAAK--------ASPAVRKLARELGVDLSTV-KGSGPGGRITKEDVEAAAAAAAPA 169
Query: 267 APSKSAKP-----TANGPFTDLPVSGVRGVIAKRLLQSKQVI---------------KLR 306
A + +A A +P+SG+R IAKR+++SK+ I LR
Sbjct: 170 AAAAAAAAAAPPAAAAEGEERVPLSGMRKAIAKRMVESKREIPHFTLTDEVDVTALLALR 229
Query: 307 EQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTDKG 366
+Q+ K L++ DF+IKA ALA ++ PE N+SW D I V++ +AV TD G
Sbjct: 230 KQLKAIGVK----LTVTDFLIKAVALALKKFPELNASWDDDAIVLKKYVNIGIAVATDGG 285
Query: 367 LFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
L P++ DADKK L ++ + ++K LA KA+EGKL+P E Q
Sbjct: 286 LIVPVIRDADKKSLFEL-------------------AREIKDLAEKAREGKLKPEELQ 324
Score = 71.4 bits (176), Expect = 2e-13
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 39 KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLV 78
+GGT +ISNLGMFG F+ IINPP+ IL VG++ +R V
Sbjct: 324 QGGTFTISNLGMFGGDYFTPIINPPEVAILGVGAIVERPV 363
>gnl|CDD|223582 COG0508, AceF, Pyruvate/2-oxoglutarate dehydrogenase complex,
dihydrolipoamide acyltransferase (E2) component, and
related enzymes [Energy production and conversion].
Length = 404
Score = 239 bits (612), Expect = 5e-75
Identities = 132/361 (36%), Positives = 187/361 (51%), Gaps = 69/361 (19%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +P L TM GTIV W KK GDK+ EGD+L E+ETDKATM P+ G LAKILV G
Sbjct: 5 IKMPDLGETMTEGTIVEWLKKVGDKVKEGDVLVEVETDKATMEVPAPDAGVLAKILVEEG 64
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
VP+G ++ I E GA APA AA PP P AAPA
Sbjct: 65 DT-VPVGAVIARIEE------------------EGADAPA------AAEAPPEPAAAAPA 99
Query: 205 PSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADL----SKAS 260
+P S RV ASP +RLA E G+DLS + G+G G IT D+ ++ +
Sbjct: 100 SAPATAASAAAG---RVLASPAVRRLAREAGIDLSKV-KGTGPGGRITKKDVEAAVAEKA 155
Query: 261 KAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQ---------------VIKL 305
A A AP+ +A +A G +P+S +R IA+R+++SKQ ++ L
Sbjct: 156 AAAAAPAPAAAAPASAAGEEERVPMSRIRKAIAERMVESKQTIPHLTLFNEVDMTKLMAL 215
Query: 306 REQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSW--QDTFIREYHSVDVSVAVNT 363
R+++ + EK+G KL+ F++KA A ++ PE N+S I + V++ +AV+T
Sbjct: 216 RKKLKEEFEKKGVKLTFLSFLVKAVVKALKKFPEVNASIDGDGEEIVYHKYVNIGIAVDT 275
Query: 364 DKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEF 423
+GL P++ DADKK L +I + ++K LA KA++GKL P E
Sbjct: 276 PRGLVVPVIRDADKKSLAEI-------------------AKEIKDLAKKARDGKLTPEEM 316
Query: 424 Q 424
Q
Sbjct: 317 Q 317
Score = 60.1 bits (146), Expect = 7e-10
Identities = 27/53 (50%), Positives = 35/53 (66%)
Query: 38 KKGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPAL 90
+GGT +ISNLGMFG F+ IINPPQ IL VG++ +R V G + V P +
Sbjct: 316 MQGGTFTISNLGMFGSLMFTPIINPPQVAILGVGAIEERPVVVGGEIVVRPMM 368
>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
Length = 547
Score = 182 bits (465), Expect = 5e-52
Identities = 109/363 (30%), Positives = 158/363 (43%), Gaps = 69/363 (19%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
++ W K GD + E L +ETDKATM +P G + +I V G K V +G L+ +
Sbjct: 133 VEVIEWLVKVGDTVEEDQSLITVETDKATMEIPSPVAGVVKEIKVKVGDK-VSVGSLLVV 191
Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
I + A A A+APA P AAA P P AA A P+ +
Sbjct: 192 I-------------EVAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAA 238
Query: 217 GGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAG-AVAAPSKSAKPT 275
G +ASP +RLA E G+DLS + G+G G IT D+ K + AA + +A
Sbjct: 239 PGKAPHASPAVRRLARELGVDLSQV-KGTGKKGRITKEDVQAFVKGAMSAAAAAAAAAAA 297
Query: 276 ANGPFTDL-----------------PVSGVRGVIAKRLLQSKQVI--------------- 303
A G L P+S ++ + A L +S I
Sbjct: 298 AGGGGLGLLPWPKVDFSKFGEIETKPLSRIKKISAANLHRSWVTIPHVTQFDEADITDLE 357
Query: 304 KLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQD--TFIREYHSVDVSVAV 361
LR+Q+ K EK G KL++ F IKA A + P N+S + + ++ AV
Sbjct: 358 ALRKQLKKEAEKAGVKLTMLPFFIKAVVAALKEFPVFNASLDEDGDELTYKKYFNIGFAV 417
Query: 362 NTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPH 421
+T GL P++ D DKK L++I + E + LA KA++GKL+P
Sbjct: 418 DTPNGLVVPVIKDVDKKSLLEI----------------AREIAE---LAKKARDGKLKPD 458
Query: 422 EFQ 424
+ Q
Sbjct: 459 DMQ 461
Score = 73.7 bits (182), Expect = 4e-14
Identities = 36/110 (32%), Positives = 48/110 (43%), Gaps = 9/110 (8%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
++ W KEGD + E L +ETDKATM +P G + +I V G V +G L+
Sbjct: 16 VEVIEWLVKEGDTVEEDQPLVTVETDKATMEIPSPAAGVVKEIKVKVGDT-VSVGGLL-A 73
Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
++E AA A A A+APA A P AA A
Sbjct: 74 VIEAAGAAAA-------AAAPAAAAAPAAAAAAAPAPAAAAPAAAAAAAG 116
Score = 46.7 bits (112), Expect = 1e-05
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 39 KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVA 86
+GG +IS+LG G F+ IIN P+ IL VG + V +G + V
Sbjct: 461 QGGCFTISSLGGIGGTAFTPIINAPEVAILGVGKSQMKPVWDGKEFVP 508
>gnl|CDD|235571 PRK05704, PRK05704, dihydrolipoamide succinyltransferase;
Validated.
Length = 407
Score = 175 bits (446), Expect = 2e-50
Identities = 108/360 (30%), Positives = 157/360 (43%), Gaps = 71/360 (19%)
Query: 88 PALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKD 147
P L ++ TI +W KK GD + ++L EIETDK + P G L++IL G
Sbjct: 8 PTLPESVTEATIATWHKKPGDAVKRDEVLVEIETDKVVLEVPAPAAGVLSEILAEEGD-T 66
Query: 148 VPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
V +G+++ I E AAGA+A A AAA P AA A
Sbjct: 67 VTVGQVLGRIDEGA---------------AAGAAAAAAAAAAAAAAAPAQAQAAAAAEQS 111
Query: 208 TPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAA 267
SP A++LAAE GLD S++ G+G G +T D+ A A A A
Sbjct: 112 NDA------------LSPAARKLAAENGLDASAV-KGTGKGGRVTKEDVLAALAAAAAAP 158
Query: 268 PSKSAKPTANGPFTD-------LPVSGVRGVIAKRLLQSKQ---------------VIKL 305
+ +A A P +P++ +R IA+RLL+++ V+ L
Sbjct: 159 AAPAAAAPAAAPAPLGARPEERVPMTRLRKTIAERLLEAQNTTAMLTTFNEVDMTPVMDL 218
Query: 306 REQMNKALEKR-GAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTD 364
R+Q A EK+ G KL F +KA A +R PE N+S I ++ D+ +AV T
Sbjct: 219 RKQYKDAFEKKHGVKLGFMSFFVKAVVEALKRYPEVNASIDGDDIVYHNYYDIGIAVGTP 278
Query: 365 KGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
+GL P++ DAD+ +I + LA KA++GKL E
Sbjct: 279 RGLVVPVLRDADQLSFAEI-------------------EKKIAELAKKARDGKLSIEELT 319
Score = 41.7 bits (99), Expect = 5e-04
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 4/56 (7%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALP----ALS 91
GGT +I+N G+FG + IINPPQ+ IL + + +R V V P ALS
Sbjct: 320 GGTFTITNGGVFGSLMSTPIINPPQSAILGMHKIKERPVAVNGQIVIRPMMYLALS 375
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 154 bits (391), Expect = 9e-43
Identities = 101/361 (27%), Positives = 160/361 (44%), Gaps = 69/361 (19%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +P L+ ++ GT+ W KK GD + + + EIETDK + +P +G L +IL G
Sbjct: 3 IKVPELAESITEGTVAEWHKKVGDTVKRDENIVEIETDKVVLEVPSPADGVLQEILFKEG 62
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
V G+++ I+ E +DA A A + AA+ P AA
Sbjct: 63 DT-VESGQVLAILEEG-----------NDATAAPPAKSGEEKEETPAASAAAAPTAAANR 110
Query: 205 PSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGA 264
PS SP A+RLA E G+DLS++ G+G+ G +T D+ K ++A A
Sbjct: 111 PS----------------LSPAARRLAKEHGIDLSAV-PGTGVTGRVTKEDIIKKTEAPA 153
Query: 265 VAA--PSKSAKPTANG---PFTDLPVSGVRGVIAKRLLQSKQ---------------VIK 304
A +A P + ++ +R IA+RL +++ V++
Sbjct: 154 SAQQPAPAAAAKAPANFTRPEERVKMTRLRQRIAERLKEAQNSTAMLTTFNEVDMSAVME 213
Query: 305 LREQMNKALEKR-GAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNT 363
LR++ + EK+ G KL F +KA A +R PE N+ I D+SVAV+T
Sbjct: 214 LRKRYKEEFEKKHGVKLGFMSFFVKAVVAALKRFPEVNAEIDGDDIVYKDYYDISVAVST 273
Query: 364 DKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEF 423
D+GL P+V +AD+ DI ++ L KA++GKL +
Sbjct: 274 DRGLVVPVVRNADRMSFADI-------------------EKEIADLGKKARDGKLTLEDM 314
Query: 424 Q 424
Sbjct: 315 T 315
Score = 38.2 bits (89), Expect = 0.007
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALP----ALS 91
GGT +I+N G+FG + IINPPQ+ IL + + +R V P ALS
Sbjct: 316 GGTFTITNGGVFGSLMSTPIINPPQSAILGMHGIKERPVAVNGQIEIRPMMYLALS 371
>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
Provisional.
Length = 464
Score = 138 bits (351), Expect = 2e-36
Identities = 57/128 (44%), Positives = 74/128 (57%), Gaps = 3/128 (2%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +PALSPTME GT+ W KKEGDK+ GD++AEIETDKATM E +EG L KILVP G
Sbjct: 5 ILMPALSPTMEEGTLAKWLKKEGDKVKSGDVIAEIETDKATMEVEAVDEGTLGKILVPEG 64
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
++ V + + +++E + D AA A+A AP AAA P AAPA
Sbjct: 65 TEGVKVNTPIAVLLEEGESAS---DAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPA 121
Query: 205 PSPTPVPS 212
V +
Sbjct: 122 APAAEVAA 129
>gnl|CDD|236999 PRK11854, aceF, pyruvate dehydrogenase dihydrolipoyltransacetylase;
Validated.
Length = 633
Score = 130 bits (329), Expect = 8e-33
Identities = 91/367 (24%), Positives = 144/367 (39%), Gaps = 77/367 (20%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
+ K GDK+ L +E DKA+M P G + +I V G K
Sbjct: 219 VEVTEVMVKVGDKVEAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDK---------- 268
Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT- 215
V+ S + F+ A A AP + AA P APA +P K+
Sbjct: 269 -VKTGSLIMRFEV-------EGAAPAAAPAKQEAAAPAPAAAKAEAPAAAPAAKAEGKSE 320
Query: 216 --SGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAK 273
V+A+PL +RLA E G++L+ + G+G G I D+ K A + A
Sbjct: 321 FAENDAYVHATPLVRRLAREFGVNLAKVK-GTGRKGRILKEDVQAYVKDAVKRAEAAPAA 379
Query: 274 PTANGPFTDL-----------------PVSGVRGVIAKRLLQSKQVI------------- 303
A G L + ++ + L ++ +I
Sbjct: 380 AAAGGGGPGLLPWPKVDFSKFGEIEEVELGRIQKISGANLHRNWVMIPHVTQFDKADITE 439
Query: 304 --KLREQMNKALEKR--GAKLSINDFIIKATALASRRVPEANSSW-QDTFIREYHS-VDV 357
R+Q N EKR G K++ FI+KA A A ++P NSS +D V++
Sbjct: 440 LEAFRKQQNAEAEKRKLGVKITPLVFIMKAVAAALEQMPRFNSSLSEDGQRLTLKKYVNI 499
Query: 358 SVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGK 417
+AV+T GL P+ D +KKG++++ S ++ ++ KA++GK
Sbjct: 500 GIAVDTPNGLVVPVFKDVNKKGIIEL-------------------SRELMDISKKARDGK 540
Query: 418 LQPHEFQ 424
L + Q
Sbjct: 541 LTAGDMQ 547
Score = 58.9 bits (143), Expect = 2e-09
Identities = 31/106 (29%), Positives = 43/106 (40%), Gaps = 16/106 (15%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
+ K GDK+ L +E DKA+M +P+ G + +I V G K V G L+ I
Sbjct: 15 VEVTEILVKVGDKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVKVGDK-VETGALIMI 73
Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
++ A +APA K AAP P AA
Sbjct: 74 F---------------ESADGAADAAPAQAEEKKEAAPAAAPAAAA 104
Score = 51.9 bits (125), Expect = 4e-07
Identities = 30/97 (30%), Positives = 39/97 (40%), Gaps = 8/97 (8%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
+ K GD + L +E DKA+M P G + +I V G K V G L+ +
Sbjct: 118 VEVTEILVKVGDTVEAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDK-VSTGSLIMV 176
Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAA 193
AA AP AA A+APA P A
Sbjct: 177 FEVAGEAPAA-------APAAAEAAAPAAAPAAAAGV 206
Score = 38.8 bits (91), Expect = 0.006
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREG 81
GG +IS++G G +F+ I+N P+ IL V + V G
Sbjct: 548 GGCFTISSIGGLGTTHFTPIVNAPEVAILGVSKSAMEPVWNG 589
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies
of the lipoyl-binding domain. E. coli AceF is a member
of this model, while mitochondrial and some other
bacterial forms belong to a separate model [Energy
metabolism, Pyruvate dehydrogenase].
Length = 546
Score = 129 bits (325), Expect = 1e-32
Identities = 97/367 (26%), Positives = 160/367 (43%), Gaps = 71/367 (19%)
Query: 95 EMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLV 154
E T++ K GD ++ L +E+DKA+M P G + + V G VP G L+
Sbjct: 128 EKVTVIEVLVKVGDTVSADQSLITLESDKASMEVPAPASGVVKSVKVKVGDS-VPTGDLI 186
Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAPAPPP---PKVAAAPPPPPPKAAPAPSPTPVP 211
+ S P A A A A P P P P AA A +P P
Sbjct: 187 LTLSVAGS-----------TPATAPAPASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQ 235
Query: 212 SQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKS 271
+ + +A+P +RLA E G+DLS++ G+G+ G I D+ + K +V A + +
Sbjct: 236 AGTQNPAKVDHAAPAVRRLAREFGVDLSAV-KGTGIKGRILREDVQRFVKEPSVRAQAAA 294
Query: 272 AKPTANGP---------------FTDLPVSGVRGVIAKRLLQSKQVI------------- 303
A P ++ +S +R + L ++ +I
Sbjct: 295 ASAAGGAPGALPWPNVDFSKFGEVEEVDMSRIRKISGANLTRNWTMIPHVTHFDKADITE 354
Query: 304 --KLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSW----QDTFIREYHSVDV 357
R+Q N A+EK G KL++ ++KA A A ++ P+ N+S + +++Y V++
Sbjct: 355 MEAFRKQQNAAVEKEGVKLTVLHILMKAVAAALKKFPKFNASLDLGGEQLILKKY--VNI 412
Query: 358 SVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGK 417
VAV+T GL P++ D D+KG+ ++ + ++ LA KA++GK
Sbjct: 413 GVAVDTPNGLLVPVIKDVDRKGITEL-------------------ALELSDLAKKARDGK 453
Query: 418 LQPHEFQ 424
L P E Q
Sbjct: 454 LTPDEMQ 460
Score = 49.9 bits (119), Expect = 2e-06
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 8/112 (7%)
Query: 95 EMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLV 154
E G ++ K GDK+ G L +E+DKA+M + G + +I V G +P+G ++
Sbjct: 12 EEGEVIEVLVKPGDKVEAGQSLITLESDKASMEVPSSAAGIIKEIKVKVGDT-LPVGGVI 70
Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
+ E + A + K +A PA A APAP AA P A +
Sbjct: 71 ATL-EVGAGAQAQAEAKKEAAPAPTAGAPAP------AAQAQAAPAAGQSSG 115
Score = 32.9 bits (75), Expect = 0.35
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 39 KGGTVSISNLGMFGIKNFSAIINPPQACILAV 70
+G +IS+LG G F+ I+N P+ IL V
Sbjct: 460 QGACFTISSLGGIGGTAFTPIVNAPEVAILGV 491
>gnl|CDD|215782 pfam00198, 2-oxoacid_dh, 2-oxoacid dehydrogenases acyltransferase
(catalytic domain). These proteins contain one to three
copies of a lipoyl binding domain followed by the
catalytic domain.
Length = 212
Score = 120 bits (303), Expect = 6e-32
Identities = 42/124 (33%), Positives = 70/124 (56%), Gaps = 20/124 (16%)
Query: 302 VIKLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQ-DTFIREYHSVDVSVA 360
++ LRE++ + +K G KL+ DF++KA ALA + PE N+SW D I V++ +A
Sbjct: 22 LLALREKLKEDAKKEGLKLTFLDFLVKAVALALKEFPELNASWDGDAEIVYKKYVNIGIA 81
Query: 361 VNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQP 420
V T GL P++ +AD+K L++I + ++K LA +A++GKL+P
Sbjct: 82 VATPDGLIVPVIRNADRKSLLEI-------------------AKEIKDLAERARDGKLKP 122
Query: 421 HEFQ 424
+ Q
Sbjct: 123 EDLQ 126
Score = 71.0 bits (175), Expect = 3e-14
Identities = 24/44 (54%), Positives = 31/44 (70%)
Query: 39 KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGN 82
+GGT +ISNLGMFG+ F+ IINPPQ IL VG++ +R V
Sbjct: 126 QGGTFTISNLGMFGVTFFTPIINPPQVAILGVGAIRKRPVVVDG 169
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 124 bits (314), Expect = 9e-32
Identities = 79/352 (22%), Positives = 127/352 (36%), Gaps = 92/352 (26%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +P + ++ GT+V W KK GD + E +++ IETDK ++ P G + KI G
Sbjct: 47 IKVPTMGDSISEGTVVEWKKKVGDYVKEDEVICIIETDKVSVDIRAPASGVITKIFAEEG 106
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAP--APPPPKVAAAPPPPPPKAA 202
V +G + I APPAA +A A P P
Sbjct: 107 D-TVEVGAPLSEIDTGG------------APPAAAPAAAAAAKAEKTTPEKPKAAAPTPE 153
Query: 203 PAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKA 262
P + P P K +
Sbjct: 154 PPAASKPTPPAAA------------------------------------------KPPEP 171
Query: 263 GAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQ---------------VIKLRE 307
A P + A+ T +P+S +R IA+RL S+ +++LR+
Sbjct: 172 APAAKPPPTPVARADPRETRVPMSRMRQRIAERLKASQNTCAMLTTFNECDMSALMELRK 231
Query: 308 QMNKALEKR-GAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTDKG 366
+ +K+ G KL +KA+ +A +++P N+ I + VD+SVAV T G
Sbjct: 232 EYKDDFQKKHGVKLGFMSAFVKASTIALKKMPIVNAYIDGDEIVYRNYVDISVAVATPTG 291
Query: 367 LFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKL 418
L P++ + + K +I + L LA KA+ KL
Sbjct: 292 LVVPVIRNCENKSFAEI---EKELAD----------------LAEKARNNKL 324
Score = 44.3 bits (105), Expect = 9e-05
Identities = 23/49 (46%), Positives = 32/49 (65%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALP 88
GGT +ISN G+FG + IINPPQ+ IL + ++ +R V GN+ V P
Sbjct: 331 GGTFTISNGGVFGSLMGTPIINPPQSAILGMHAIKKRPVVVGNEIVIRP 379
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 125 bits (315), Expect = 3e-31
Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 61/378 (16%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
V +P L ++ GT+ SW K GD + + L E+ TDK +P G L +I P
Sbjct: 129 VKMPELGESVTEGTVTSWLKAVGDTVEVDEPLLEVSTDKVDTEIPSPVAGTLLEIRAPE- 187
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPP---PKVAAAPPPPPPKA 201
V +G ++ II + + A ++ P+ S PAP P AA PP P A
Sbjct: 188 DDTVEVGTVLAII-GDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPA 246
Query: 202 APAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASK 261
+ + +SG + Y +PL ++LA +KG+DLS++ G+G+ G I D+ A+K
Sbjct: 247 PAKTAAPAAAAPVSSGDSGPYVTPLVRKLAKDKGVDLSTV-KGTGVGGRIRKQDVLAAAK 305
Query: 262 AG-----AVAAPSKSAKPTANGPFTDLP-------------VSGVRGVIAKRLLQSKQ-- 301
A A AAP+ +A P A ++ +R + A + ++S Q
Sbjct: 306 AAEEARAAAAAPAAAAAPAAPAAAAKPAEPDTAKLRGTTQKMNRIRQITADKTIESLQTS 365
Query: 302 -------------VIKLREQMNKA-LEKRGAKLSINDFIIKATALASRRVPEANSSW-QD 346
V LR + LEK G L+ F ++A A + P N+S+ +
Sbjct: 366 AQLTQVHEVDMTRVAALRARAKNDFLEKNGVNLTFLPFFVQAVTEALKAHPNVNASYNAE 425
Query: 347 TFIREYH-SVDVSVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESND 405
T YH V +AV+T +GL P++ +A L + +
Sbjct: 426 TKEVTYHDVEHVGIAVDTPRGLLVPVIHNAGDLSLPGL-------------------AKA 466
Query: 406 VKTLAAKAKEGKLQPHEF 423
+ LAA+A++ KL+P E
Sbjct: 467 INDLAARARDNKLKPDEL 484
Score = 65.0 bits (158), Expect = 3e-11
Identities = 44/142 (30%), Positives = 60/142 (42%), Gaps = 22/142 (15%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
V +PAL ++ GT+ SW K GD + + L E+ TDK +P G L +I P
Sbjct: 5 VKMPALGESVTEGTVTSWLKAVGDTVEADEPLLEVSTDKVDTEIPSPAAGVLLEIRAPE- 63
Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAP- 203
V +G ++ II E P S PAP P+ AAP P P AP
Sbjct: 64 DDTVEVGGVLAIIGE----------------PGEAGSEPAPAAPEPEAAPEPEAPAPAPT 107
Query: 204 ----APSPTPVPSQKTSGGTRV 221
AP+P + + T V
Sbjct: 108 PAAEAPAPAAPQAGGSGEATEV 129
Score = 32.7 bits (74), Expect = 0.46
Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 4/52 (7%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLV----REGNDRVAL 87
GGT +I+N+G G + I+NPPQA IL G++ +R +G + +A+
Sbjct: 486 GGTFTITNIGSGGALFDTPILNPPQAAILGTGAIVKRPRVIKDEDGGESIAI 537
>gnl|CDD|133458 cd06849, lipoyl_domain, Lipoyl domain of the dihydrolipoyl
acyltransferase component (E2) of 2-oxo acid
dehydrogenases. 2-oxo acid dehydrogenase multienzyme
complexes, like pyruvate dehydrogenase (PDH),
2-oxoglutarate dehydrogenase (OGDH) and branched-chain
2-oxo acid dehydrogenase (BCDH), contain at least three
different enzymes, 2-oxo acid dehydrogenase (E1),
dihydrolipoyl acyltransferase (E2) and dihydrolipoamide
dehydrogenase (E3) and play a key role in redox
regulation. E2, the central component of the complex,
catalyzes the transfer of the acyl group of CoA from E1
to E3 via reductive acetylation of a lipoyl group
covalently attached to a lysine residue.
Length = 74
Score = 102 bits (256), Expect = 7e-27
Identities = 38/71 (53%), Positives = 49/71 (69%), Gaps = 1/71 (1%)
Query: 87 LPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
+P L +M GTIV W KEGD + EGD+LAE+ETDKAT+ E P G LAKILV G
Sbjct: 5 MPDLGESMTEGTIVEWLVKEGDSVEEGDVLAEVETDKATVEVEAPAAGVLAKILVEEGDT 64
Query: 147 DVPIGKLVCII 157
VP+G+++ +I
Sbjct: 65 -VPVGQVIAVI 74
>gnl|CDD|133456 cd06663, Biotinyl_lipoyl_domains, Biotinyl_lipoyl_domains are
present in biotin-dependent carboxylases/decarboxylases,
the dihydrolipoyl acyltransferase component (E2) of
2-oxo acid dehydrogenases, and the H-protein of the
glycine cleavage system (GCS). These domains transport
CO2, acyl, or methylamine, respectively, between
components of the complex/protein via a biotinyl or
lipoyl group, which is covalently attached to a highly
conserved lysine residue.
Length = 73
Score = 75.2 bits (185), Expect = 4e-17
Identities = 31/73 (42%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ +P L+ + GT+V W KK GDK+ +GD+LAEIE KAT E P+ G + K+LV G
Sbjct: 2 ILIPDLAQHLGDGTVVKWLKKVGDKVKKGDVLAEIEAMKATSDVEAPKSGTVKKVLVKEG 61
Query: 145 SKDVPIGKLVCII 157
+K V + I
Sbjct: 62 TK-VEGDTPLVKI 73
>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
dihydrolipoyllysine-residue acetyltransferase;
Provisional.
Length = 371
Score = 78.1 bits (193), Expect = 7e-16
Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 8/110 (7%)
Query: 84 RVALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPA 143
+ +P +M G + W +EGD++ +GD L ++ETDK T E P G L + +
Sbjct: 4 PITMPKWGLSMTEGKVAGWLVQEGDEVEKGDELLDVETDKITNEVEAPAAGTLRRQVAQE 63
Query: 144 GSKDVPIGKLVCIIVENE-SD------VAAFKDFKDDAPPAAGASAPAPP 186
G +P+G L+ ++ + E SD +A F + PAP
Sbjct: 64 GE-TLPVGALLAVVADAEVSDAEIDAFIAPFARRFAPEGIDEEDAGPAPR 112
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 78.3 bits (192), Expect = 1e-15
Identities = 89/349 (25%), Positives = 137/349 (39%), Gaps = 89/349 (25%)
Query: 70 VGSLSQRLVR----EGNDRV--ALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDK 123
V S QR VR E D V +P + ++ GT+ ++ KK G+++ + +A+IETDK
Sbjct: 73 VSSTLQRWVRPFSSESGDTVEAVVPHMGESITDGTLATFLKKPGERVQADEAIAQIETDK 132
Query: 124 ATMGFETPEEGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAP 183
T+ +P G + + LV G P G V II ++E AA P
Sbjct: 133 VTIDIASPASGVIQEFLVKEGDTVEP-GTKVAIISKSED--------------AASQVTP 177
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGA 243
+ P+ P PP + P +V ++P+A++
Sbjct: 178 SQKIPETTDPKPSPPAEDKQKP--------------KVESAPVAEK-------------- 209
Query: 244 GSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTD----LPVSGVRGVIAKRLLQS 299
KA + P K + P + +P++ +R +A RL S
Sbjct: 210 ----------------PKAPSSPPPPKQSAKEPQLPPKERERRVPMTRLRKRVATRLKDS 253
Query: 300 KQ---------------VIKLREQMNKA-LEKRGAKLSINDFIIKATALASRRVPEANS- 342
+ ++KLR Q A EK G KL + IKA A + P N+
Sbjct: 254 QNTFALLTTFNEVDMTNLMKLRSQYKDAFYEKHGVKLGLMSGFIKAAVSALQHQPVVNAV 313
Query: 343 -SWQDTFIREYHSVDVSVAVNTDKGLFTPIVFDADKKGLVDISNDSRSL 390
D R+Y VD+S+AV T KGL P++ ADK +I L
Sbjct: 314 IDGDDIIYRDY--VDISIAVGTSKGLVVPVIRGADKMNFAEIEKTINGL 360
Score = 36.3 bits (83), Expect = 0.027
Identities = 19/51 (37%), Positives = 30/51 (58%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPAL 90
GG+ ++SN G++G + IINPPQ+ IL + S+ R + G V P +
Sbjct: 376 GGSFTVSNGGVYGSLISTPIINPPQSAILGMHSIVSRPMVVGGSVVPRPMM 426
>gnl|CDD|201182 pfam00364, Biotin_lipoyl, Biotin-requiring enzyme. This family
covers two Prosite entries, the conserved lysine residue
binds biotin in one group and lipoic acid in the other.
Note that the HMM does not currently recognise the
Glycine cleavage system H proteins.
Length = 73
Score = 63.0 bits (154), Expect = 8e-13
Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 85 VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
+ P + +++ GT W K GDK+ G +L E+E K M P G + +ILV G
Sbjct: 3 IKSPMIGESVKEGT-AEWLVKVGDKVKAGQVLCEVEAMKMEMEIPAPVAGVVKEILVKEG 61
Query: 145 SKDVPIGKLVCII 157
V +G + I
Sbjct: 62 DT-VEVGDPLAKI 73
>gnl|CDD|184847 PRK14843, PRK14843, dihydrolipoamide acetyltransferase;
Provisional.
Length = 347
Score = 67.2 bits (164), Expect = 3e-12
Identities = 58/230 (25%), Positives = 102/230 (44%), Gaps = 49/230 (21%)
Query: 224 SPLAKRLAAEKGLDLSSIGAGSGLFGSITSAD----LSKASKAGAVAAPSKSAK------ 273
SPLAKR+A E + I G+G G I D L + + ++ +P++ K
Sbjct: 52 SPLAKRIALEHNIAWQEI-QGTGHRGKIMKKDVLALLPENIENDSIKSPAQIEKVEEVPD 110
Query: 274 -PTANGPFTDLPVSGVRGVIAKRLLQS---------------KQVIKLREQ-MNKALEKR 316
T G +P++ +R VIA+R+++S +++ LR++ + +E
Sbjct: 111 NVTPYGEIERIPMTPMRKVIAQRMVESYLTAPTFTLNYEVDMTEMLALRKKVLEPIMEAT 170
Query: 317 GAKLSINDFIIKATALASRRVPEANSSWQD--TFIREYHSVDVSVAVNTDKGLFTPIVFD 374
G K ++ D + A + P N+S + I ++ V++++AV D GL TP+V++
Sbjct: 171 GKKTTVTDLLSLAVVKTLMKHPYINASLTEDGKTIITHNYVNLAMAVGMDNGLMTPVVYN 230
Query: 375 ADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
A+K L SE K + + +GKL P E Q
Sbjct: 231 AEKMSL-------------------SELVVAFKDVIGRTLDGKLAPSELQ 261
Score = 46.4 bits (110), Expect = 2e-05
Identities = 24/62 (38%), Positives = 35/62 (56%)
Query: 33 LAGLRKKGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPALSP 92
LA + T +ISNLGMFG+++F IIN P + IL V S ++ V + V P +S
Sbjct: 255 LAPSELQNSTFTISNLGMFGVQSFGPIINQPNSAILGVSSTIEKPVVVNGEIVIRPIMSL 314
Query: 93 TM 94
+
Sbjct: 315 GL 316
Score = 36.4 bits (84), Expect = 0.023
Identities = 45/170 (26%), Positives = 74/170 (43%), Gaps = 18/170 (10%)
Query: 223 ASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA---NGP 279
A+P A++LA + G++L + +GSG G + D+ + K V S AK A N
Sbjct: 8 ATPAARKLADDLGINLYDV-SGSGANGRVHKEDV-ETYKDTNVVRISPLAKRIALEHNIA 65
Query: 280 FTDLPVSGVRGVIAKR----LL---QSKQVIKLREQMNKALEKRGAKLSINDF-IIKATA 331
+ ++ +G RG I K+ LL IK Q+ K E + I T
Sbjct: 66 WQEIQGTGHRGKIMKKDVLALLPENIENDSIKSPAQIEKVEEVPDNVTPYGEIERIPMTP 125
Query: 332 L---ASRRVPEANSSWQDTFIREYHSVDVSVAVNTDKGLFTPIVFDADKK 378
+ ++R+ E+ + TF Y VD++ + K + PI+ KK
Sbjct: 126 MRKVIAQRMVESYLT-APTFTLNY-EVDMTEMLALRKKVLEPIMEATGKK 173
>gnl|CDD|215289 PLN02528, PLN02528, 2-oxoisovalerate dehydrogenase E2 component.
Length = 416
Score = 67.4 bits (165), Expect = 3e-12
Identities = 67/320 (20%), Positives = 130/320 (40%), Gaps = 63/320 (19%)
Query: 99 IVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGK-LVCII 157
++ W KEGD++ E L E+++DKAT+ + +G +A+I G V +G+ L+ I+
Sbjct: 15 LLRWFVKEGDQVEEFQPLCEVQSDKATIEITSRYKGKVAQINFSPGDI-VKVGETLLKIM 73
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
VE+ + + P + ++ S
Sbjct: 74 VEDSQHLRS---------------------------DSLLLPTDSSNIVSLAESDERGSN 106
Query: 218 GTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSAD-LSKASKAGAVAAPS------- 269
+ V ++P + LA + G+DL+ I G+G G + D L A++ G V S
Sbjct: 107 LSGVLSTPAVRHLAKQYGIDLNDI-LGTGKDGRVLKEDVLKYAAQKGVVKDSSSAEEATI 165
Query: 270 ---------KSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQV--------------IKLR 306
S + +P+ G + + K + + +V ++L+
Sbjct: 166 AEQEEFSTSVSTPTEQSYEDKTIPLRGFQRAMVKTMTAAAKVPHFHYVEEINVDALVELK 225
Query: 307 EQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSW-QDTF-IREYHSVDVSVAVNTD 364
+ K + F+IK+ ++A + P NS + ++T IR S ++ VA+ T+
Sbjct: 226 ASFQENNTDPTVKHTFLPFLIKSLSMALSKYPLLNSCFNEETSEIRLKGSHNIGVAMATE 285
Query: 365 KGLFTPIVFDADKKGLVDIS 384
GL P + + L++I+
Sbjct: 286 HGLVVPNIKNVQSLSLLEIT 305
Score = 42.0 bits (99), Expect = 4e-04
Identities = 19/50 (38%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 40 GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPA 89
GGT+++SN+G G K S ++N P+ I+A+G + Q++ R +D PA
Sbjct: 327 GGTITLSNIGAIGGKFGSPVLNLPEVAIIALGRI-QKVPRFVDDGNVYPA 375
>gnl|CDD|237002 PRK11857, PRK11857, dihydrolipoamide acetyltransferase; Reviewed.
Length = 306
Score = 61.3 bits (149), Expect = 2e-10
Identities = 62/246 (25%), Positives = 107/246 (43%), Gaps = 68/246 (27%)
Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADL--------SKASKAGAVAAPS-- 269
++ A+P+A+ LA + G+D+S + GSG G I + D+ S + A A + S
Sbjct: 1 KILATPIARALAKKLGIDISLL-KGSGRDGKILAEDVENFIKSLKSAPTPAEAASVSSAQ 59
Query: 270 ---KSAKPTANGPFTDL---PVSGVRGVIAKRLLQSKQVI---------------KLREQ 308
K+A P A P + V+ +R IA+ + S + LR+
Sbjct: 60 QAAKTAAPAAAPPKLEGKREKVAPIRKAIARAMTNSWSNVAYVNLVNEIDMTKLWDLRKS 119
Query: 309 MNKALEK-RGAKLSINDFIIKATALASRRVP-------EANSS--WQDTFIREYHSVDVS 358
+ + K G KL+ FI KA +A + P EA S + DT +++
Sbjct: 120 VKDPVLKTEGVKLTFLPFIAKAILIALKEFPIFAAKYDEATSELVYPDT-------LNLG 172
Query: 359 VAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKL 418
+AV+T+ GL P++ +A K +V+I + ++ LA A+E K+
Sbjct: 173 IAVDTEAGLMVPVIKNAQKLSIVEI-------------------AKEISRLAKAARERKI 213
Query: 419 QPHEFQ 424
+P E +
Sbjct: 214 KPDEMK 219
Score = 29.4 bits (66), Expect = 3.5
Identities = 10/47 (21%), Positives = 13/47 (27%), Gaps = 3/47 (6%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
DV ++F A + A AA P A P
Sbjct: 33 AEDV---ENFIKSLKSAPTPAEAASVSSAQQAAKTAAPAAAPPKLEG 76
>gnl|CDD|202412 pfam02817, E3_binding, e3 binding domain. This family represents a
small domain of the E2 subunit of 2-oxo-acid
dehydrogenases responsible for the binding of the E3
subunit.
Length = 39
Score = 50.5 bits (122), Expect = 8e-09
Identities = 21/39 (53%), Positives = 28/39 (71%), Gaps = 1/39 (2%)
Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSK 258
RV+ASP A++LAAEKG+DLS + G+G G IT D+
Sbjct: 2 RVFASPAARKLAAEKGIDLSQV-KGTGPGGRITKEDVLA 39
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 49.1 bits (118), Expect = 3e-06
Identities = 25/64 (39%), Positives = 28/64 (43%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
A AA ASAPA P A A P P P AA AP+ P P+ + A P A A
Sbjct: 47 AAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAA 106
Query: 232 AEKG 235
A
Sbjct: 107 APAA 110
Score = 46.0 bits (110), Expect = 3e-05
Identities = 18/58 (31%), Positives = 22/58 (37%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
AP A+APA PP AAA P A PA + P+ A +A
Sbjct: 71 APAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGAAAAVA 128
Score = 43.0 bits (102), Expect = 3e-04
Identities = 20/73 (27%), Positives = 26/73 (35%), Gaps = 5/73 (6%)
Query: 167 FKDFKDDAPPAAGASAP-----APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
F D+ + A A+A A P AA P P AP + P K +
Sbjct: 33 FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92
Query: 222 YASPLAKRLAAEK 234
A+P A AA
Sbjct: 93 AAAPAAPPAAAAA 105
Score = 42.2 bits (100), Expect = 5e-04
Identities = 14/45 (31%), Positives = 16/45 (35%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
A A PA AA PP AA AP+ V + T
Sbjct: 74 PPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118
Score = 41.8 bits (99), Expect = 6e-04
Identities = 14/60 (23%), Positives = 19/60 (31%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
AP AA A+ P A P PA + + A+P A + E
Sbjct: 57 APAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116
Score = 41.8 bits (99), Expect = 6e-04
Identities = 14/46 (30%), Positives = 15/46 (32%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
AP A A A AAPP AAPA + G
Sbjct: 77 AAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRG 122
Score = 41.4 bits (98), Expect = 9e-04
Identities = 12/57 (21%), Positives = 15/57 (26%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
A A + P AA P P AA A P + + L
Sbjct: 64 AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120
Score = 40.6 bits (96), Expect = 0.001
Identities = 43/197 (21%), Positives = 66/197 (33%), Gaps = 48/197 (24%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
++++F D P + A+ A AAA P AA AP+ P P+ +
Sbjct: 27 PSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAA-PAPAPPAAAAPAAPPK 85
Query: 225 PLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
P A A+ A A P+ +A +
Sbjct: 86 PAAA-----------------------------AAAAAAPAAPPAAAAAAAPAAAAVEDE 116
Query: 285 VSGVRGV---IAKRLLQSKQV----------IKL----REQMNKALEK-RGAKLSINDFI 326
V+ +RG +AK + S +V KL R +N L++ RG K+S I
Sbjct: 117 VTPLRGAAAAVAKNMDASLEVPTATSVRAVPAKLLIDNRIVINNHLKRTRGGKVSFTHLI 176
Query: 327 IKATALASRRVPEANSS 343
A A + P N
Sbjct: 177 GYALVQALKAFPNMNRH 193
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 47.0 bits (112), Expect = 1e-05
Identities = 22/61 (36%), Positives = 25/61 (40%), Gaps = 1/61 (1%)
Query: 174 APPAAGASAPAPPPPKVAA-APPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
A P A A A AP AA AP P AA AP+ P + A+P A AA
Sbjct: 382 ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAA 441
Query: 233 E 233
Sbjct: 442 P 442
Score = 45.9 bits (109), Expect = 3e-05
Identities = 18/51 (35%), Positives = 22/51 (43%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
AA A APA P A+AP PP A PAP P + + A+
Sbjct: 394 VAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
Score = 42.4 bits (100), Expect = 4e-04
Identities = 31/123 (25%), Positives = 43/123 (34%), Gaps = 10/123 (8%)
Query: 166 AFK-DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
AFK +A A PA P AA P AAPAP+ P + ++ A+
Sbjct: 363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP-AAAASAPAAPPAAA 421
Query: 225 PLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
P A A ++ A A ++ A A AAP A P P +
Sbjct: 422 PPAPVAAPAAAAPAAAPAAA--------PAAVALAPAPPAQAAPETVAIPVRVAPEPAVA 473
Query: 285 VSG 287
+
Sbjct: 474 SAA 476
Score = 38.9 bits (91), Expect = 0.004
Identities = 15/47 (31%), Positives = 17/47 (36%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
APPA A P +VA P APA +P T G
Sbjct: 448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
Score = 35.5 bits (82), Expect = 0.061
Identities = 20/68 (29%), Positives = 27/68 (39%), Gaps = 5/68 (7%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV-----PSQKTSGGTRVYASPLA 227
AP AA +A P A P PP +AAP PV P+ ++ A A
Sbjct: 427 AAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAA 486
Query: 228 KRLAAEKG 235
+ E+G
Sbjct: 487 RLTPTEEG 494
>gnl|CDD|133459 cd06850, biotinyl_domain, The biotinyl-domain or biotin carboxyl
carrier protein (BCCP) domain is present in all
biotin-dependent enzymes, such as acetyl-CoA
carboxylase, pyruvate carboxylase, propionyl-CoA
carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA
carboxylase, oxaloacetate decarboxylase,
methylmalonyl-CoA decarboxylase, transcarboxylase and
urea amidolyase. This domain functions in transferring
CO2 from one subsite to another, allowing carboxylation,
decarboxylation, or transcarboxylation. During this
process, biotin is covalently attached to a specific
lysine.
Length = 67
Score = 40.1 bits (95), Expect = 9e-05
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
GT+V KEGDK+ G LA +E K P G + +ILV G + V G+L+ +
Sbjct: 8 GTVVKVLVKEGDKVEAGQPLAVLEAMKMENEVTAPVAGVVKEILVKEGDQ-VEAGQLLVV 66
Query: 157 I 157
I
Sbjct: 67 I 67
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 44.2 bits (105), Expect = 1e-04
Identities = 17/40 (42%), Positives = 20/40 (50%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
A PAA + A P AA P P+ APAP+P P P
Sbjct: 404 AAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPS 443
Score = 38.8 bits (91), Expect = 0.005
Identities = 15/39 (38%), Positives = 18/39 (46%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
A A +A AP A A P P AAPA + P P+
Sbjct: 388 AGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPA 426
Score = 38.4 bits (90), Expect = 0.006
Identities = 16/52 (30%), Positives = 19/52 (36%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
AP A +APA P A P P A A P P Q G + +
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAA 735
Score = 37.7 bits (88), Expect = 0.011
Identities = 18/46 (39%), Positives = 20/46 (43%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
AP A A APA P AA P PP APA +P + G
Sbjct: 460 AAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
Score = 37.7 bits (88), Expect = 0.012
Identities = 20/41 (48%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 174 APPAAGASAPAPPPPKVAAAPP---PPPPKAAPAPSPTPVP 211
AP A A APAP PP A P P P A APS P P
Sbjct: 428 APQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAP 468
Score = 37.3 bits (87), Expect = 0.017
Identities = 18/61 (29%), Positives = 22/61 (36%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
A PAA APAP P A P P APA +P + + A + A
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPA 734
Query: 233 E 233
Sbjct: 735 A 735
Score = 36.9 bits (86), Expect = 0.019
Identities = 15/42 (35%), Positives = 17/42 (40%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
A + APAP A P P P AAPAP+ P
Sbjct: 457 PPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498
Score = 36.9 bits (86), Expect = 0.020
Identities = 17/55 (30%), Positives = 20/55 (36%), Gaps = 1/55 (1%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
AP A A AP PPP AAP+ P P P+ +P A
Sbjct: 433 PAPAPAPAPPSPAGNAPAGGAPSPPPA-AAPSAQPAPAPAAAPEPTAAPAPAPPA 486
Score = 36.9 bits (86), Expect = 0.022
Identities = 14/39 (35%), Positives = 16/39 (41%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
AP AA A APA P A P P P + +P
Sbjct: 415 AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGA 453
Score = 36.9 bits (86), Expect = 0.023
Identities = 16/42 (38%), Positives = 18/42 (42%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
A A PAP P AA P + APAP+ TP Q
Sbjct: 672 KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQ 713
Score = 36.1 bits (84), Expect = 0.037
Identities = 13/39 (33%), Positives = 14/39 (35%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
A +APAP PP A P AAPA
Sbjct: 470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
Score = 36.1 bits (84), Expect = 0.042
Identities = 14/57 (24%), Positives = 21/57 (36%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
+ A + PA P A P P+ AP+P P + + A+P A
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAA 502
Score = 35.7 bits (83), Expect = 0.053
Identities = 21/103 (20%), Positives = 30/103 (29%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
PPA +S P + AA P P A P++ ++ A+P
Sbjct: 599 GPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA 658
Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
G A A A AAP+ +A
Sbjct: 659 VPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701
Score = 35.7 bits (83), Expect = 0.055
Identities = 14/54 (25%), Positives = 16/54 (29%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
A PA +A AP A AP P+ P P S G
Sbjct: 623 APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
Score = 34.6 bits (80), Expect = 0.11
Identities = 37/170 (21%), Positives = 53/170 (31%), Gaps = 6/170 (3%)
Query: 116 LAEIETDKATMGFETPEEGYL------AKILVPAGSKDVPIGKLVCIIVENESDVAAFKD 169
+ + D +GF T A++LV A ++++ V +V A +
Sbjct: 540 VLGVRGDTLVLGFSTGGLARRFASPGNAEVLVTALAEELGGDWQVEAVVGPAPGAAGGEG 599
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
A A P P AAP P P A A + +
Sbjct: 600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
Query: 230 LAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
A G D AG + A A A A AAP++ A A P
Sbjct: 660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709
Score = 33.8 bits (78), Expect = 0.18
Identities = 20/55 (36%), Positives = 23/55 (41%), Gaps = 1/55 (1%)
Query: 172 DDAPPAAGASAPA-PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
D P AG +APA PPP AAP P A P+P P + A P
Sbjct: 667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721
Score = 33.0 bits (76), Expect = 0.35
Identities = 23/97 (23%), Positives = 29/97 (29%), Gaps = 1/97 (1%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
AP A A A A P A PP A P+ P P A+ L
Sbjct: 685 PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP-PQAAQGASAPSPAADDPVPLPP 743
Query: 233 EKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPS 269
E GA + A + + A + PS
Sbjct: 744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
Score = 32.7 bits (75), Expect = 0.45
Identities = 16/40 (40%), Positives = 16/40 (40%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
DD P AGA A PPPP A A P PS
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785
Score = 32.3 bits (74), Expect = 0.56
Identities = 12/42 (28%), Positives = 14/42 (33%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
P AA A AP P AA P P A ++
Sbjct: 474 PEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRER 515
Score = 31.9 bits (73), Expect = 0.70
Identities = 14/40 (35%), Positives = 16/40 (40%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P A A P PP P AAAP PP + P+
Sbjct: 748 PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAE 787
Score = 31.1 bits (71), Expect = 1.2
Identities = 9/46 (19%), Positives = 13/46 (28%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
PA A+ A PPP + AP+ +
Sbjct: 760 PPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVA 805
Score = 31.1 bits (71), Expect = 1.2
Identities = 11/40 (27%), Positives = 13/40 (32%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
D PP + PPPP A P P+P
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785
Score = 30.7 bits (70), Expect = 1.9
Identities = 12/43 (27%), Positives = 14/43 (32%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
P A AP P AP P AA P P ++
Sbjct: 743 PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785
Score = 30.7 bits (70), Expect = 2.0
Identities = 14/62 (22%), Positives = 19/62 (30%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
PP A APA P PP + + + V L + L A
Sbjct: 756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEELGA 815
Query: 233 EK 234
+K
Sbjct: 816 KK 817
Score = 30.3 bits (69), Expect = 2.1
Identities = 12/59 (20%), Positives = 17/59 (28%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
A A A A P PAP+P + + + +P A A
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712
Score = 30.0 bits (68), Expect = 3.1
Identities = 19/80 (23%), Positives = 24/80 (30%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
K A P A P AAP PPP APA P + A+P A +
Sbjct: 655 KHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
Query: 231 AAEKGLDLSSIGAGSGLFGS 250
+ S +
Sbjct: 715 DDPAAQPPQAAQGASAPSPA 734
Score = 30.0 bits (68), Expect = 3.2
Identities = 11/43 (25%), Positives = 11/43 (25%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
D D A P P P AA PPP
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
Score = 30.0 bits (68), Expect = 3.3
Identities = 16/42 (38%), Positives = 17/42 (40%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
D P P P A PPPP P A AP+ P PS
Sbjct: 737 DPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
Score = 29.6 bits (67), Expect = 3.5
Identities = 24/107 (22%), Positives = 28/107 (26%), Gaps = 3/107 (2%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
AP A A APA A A P P A P P P AA
Sbjct: 619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAA-PEHHPKHVAVPDASDGGDGWPAKAGGAA 677
Query: 233 EKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
+ A +A A A ++ P A P
Sbjct: 678 PAAPPPAPAPAAPA--APAGAAPAQPAPAPAATPPAGQADDPAAQPP 722
Score = 29.6 bits (67), Expect = 3.7
Identities = 16/56 (28%), Positives = 17/56 (30%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
AP A +APA P P AA PP A P PL
Sbjct: 687 PAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742
Score = 28.8 bits (65), Expect = 6.8
Identities = 13/44 (29%), Positives = 14/44 (31%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
AP P AAAPPP PP + PS
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPS 792
Score = 28.4 bits (64), Expect = 8.1
Identities = 13/61 (21%), Positives = 18/61 (29%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
+ A AP A + PAP P A P A P + + +
Sbjct: 679 AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDP 738
Query: 221 V 221
V
Sbjct: 739 V 739
Score = 28.4 bits (64), Expect = 8.7
Identities = 14/42 (33%), Positives = 17/42 (40%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
D A A+ A P A P P PP+ P P P+Q
Sbjct: 716 DPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQ 757
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 43.7 bits (103), Expect = 1e-04
Identities = 31/132 (23%), Positives = 41/132 (31%), Gaps = 6/132 (4%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPP---PPPKAAPAPSPTPVPSQKTSGGTR 220
+AA + P A A APAP AAA P P P AA A + + +
Sbjct: 433 LAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPA 492
Query: 221 VYASPLAKRLA---AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
P + L A + SI + A AP+ +A P
Sbjct: 493 DDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPR 552
Query: 278 GPFTDLPVSGVR 289
PV R
Sbjct: 553 AAAATEPVVAPR 564
Score = 43.3 bits (102), Expect = 2e-04
Identities = 31/109 (28%), Positives = 41/109 (37%), Gaps = 5/109 (4%)
Query: 173 DAPPAAGASAPAPPPPKVAAAP-----PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
A PA A+AP P AAAP P P AAPA +P + + SP
Sbjct: 371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430
Query: 228 KRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
+ LAA + G + +A + A A A P +A A
Sbjct: 431 EALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479
Score = 35.2 bits (81), Expect = 0.074
Identities = 25/113 (22%), Positives = 35/113 (30%), Gaps = 10/113 (8%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
A A A A VAAAP P + ++ G +P A A
Sbjct: 401 APPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAA 460
Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
A A G A + A+ A A A + + P+ +LP
Sbjct: 461 A----------ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP 503
Score = 33.3 bits (76), Expect = 0.25
Identities = 12/45 (26%), Positives = 15/45 (33%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
P A AA P P AA P P P + ++ G
Sbjct: 529 PATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGL 573
Score = 32.2 bits (73), Expect = 0.68
Identities = 13/53 (24%), Positives = 19/53 (35%)
Query: 163 DVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
+ A+ + DA PA + P P APA +P P + T
Sbjct: 505 EFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAAT 557
Score = 30.6 bits (69), Expect = 1.8
Identities = 16/62 (25%), Positives = 23/62 (37%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
DDA + A P P+ AAA P P S + +P + A + LA
Sbjct: 535 DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPVRGLA 594
Query: 232 AE 233
+
Sbjct: 595 QQ 596
Score = 29.5 bits (66), Expect = 4.2
Identities = 11/48 (22%), Positives = 14/48 (29%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
+ F+ AP A A AP P PP +A
Sbjct: 529 PATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 43.0 bits (101), Expect = 2e-04
Identities = 17/38 (44%), Positives = 18/38 (47%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
AA A APA PPP A PP P P +P P P
Sbjct: 48 PSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85
Score = 40.6 bits (95), Expect = 0.001
Identities = 23/59 (38%), Positives = 25/59 (42%), Gaps = 5/59 (8%)
Query: 174 APPAAGASAPAPPPPK-----VAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
A PA + PAPPPP A AP PPP PA P P P + A P A
Sbjct: 32 ALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNA 90
Score = 38.3 bits (89), Expect = 0.005
Identities = 17/38 (44%), Positives = 18/38 (47%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
PP +AP P P A PPPPP AP P PV
Sbjct: 60 PPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVD 97
Score = 38.0 bits (88), Expect = 0.006
Identities = 17/37 (45%), Positives = 18/37 (48%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
D P + A P AA PPPPPP A PAP P
Sbjct: 40 DPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDD 76
Score = 36.8 bits (85), Expect = 0.016
Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 1/60 (1%)
Query: 174 APPAAGASAPAPPPPKVAAAP-PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
PP+ A+APAP P P PP P+ + P P + P A A
Sbjct: 46 PPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPA 105
Score = 34.9 bits (80), Expect = 0.065
Identities = 21/47 (44%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPP--PKAAPAPSPTPVPSQKTSGG 218
AP +A PPPP APPPPP P A P P+P P GG
Sbjct: 71 APQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGRIDNAVGG 117
Score = 34.1 bits (78), Expect = 0.11
Identities = 19/49 (38%), Positives = 20/49 (40%), Gaps = 5/49 (10%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPP-----PKAAPAPSPTPVPSQKTSG 217
PP A+ PAP P AAPPPPP P P P P G
Sbjct: 61 PPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPG 109
Score = 31.8 bits (72), Expect = 0.56
Identities = 17/39 (43%), Positives = 17/39 (43%), Gaps = 5/39 (12%)
Query: 178 AGASAPAPPPPKVAAA-----PPPPPPKAAPAPSPTPVP 211
AGASA A P A A PPPP AA P P
Sbjct: 24 AGASAVAIALPATANADPAPPPPPPSTAAAAPAPAAPPP 62
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 43.8 bits (103), Expect = 2e-04
Identities = 28/111 (25%), Positives = 37/111 (33%), Gaps = 2/111 (1%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
PP A+ A P A APP P A P T+ G A P A
Sbjct: 2721 LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP 2780
Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
+ L ++ + S S+ S + A +A P A P LP
Sbjct: 2781 RRLTRPAVASLSESRESLPSP--WDPADPPAAVLAPAAALPPAASPAGPLP 2829
Score = 41.5 bits (97), Expect = 0.001
Identities = 21/60 (35%), Positives = 24/60 (40%), Gaps = 3/60 (5%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT---PVPSQKTSGGTRVYASPLAKR 229
D P A A A A PP A P PPP A P P P P GG+ + +R
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRR 2865
Score = 38.8 bits (90), Expect = 0.006
Identities = 17/64 (26%), Positives = 22/64 (34%), Gaps = 7/64 (10%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
DD G + P+P PP P P P PSP+P ++ P R
Sbjct: 2603 DDRGDPRGPAPPSPLPP----DTHAPDP---PPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
Query: 232 AEKG 235
G
Sbjct: 2656 PAPG 2659
Score = 37.6 bits (87), Expect = 0.013
Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPP-PKAAPAPSPTPVPSQKTSGG 218
APP PPPP+ PPPPP P+ AP+ P + + SG
Sbjct: 2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA 2958
Score = 36.5 bits (84), Expect = 0.033
Identities = 27/106 (25%), Positives = 36/106 (33%), Gaps = 6/106 (5%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIG 242
P PP A PP P P+ P P P P P PLA G S
Sbjct: 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP-----QPPLAPT-TDPAGAGEPSGA 2958
Query: 243 AGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGV 288
G++ ++ APS+ A ++ P T +S V
Sbjct: 2959 VPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
Score = 36.5 bits (84), Expect = 0.035
Identities = 27/112 (24%), Positives = 33/112 (29%), Gaps = 8/112 (7%)
Query: 176 PAAGASAPAPPPP----KVAAAPPPPPPKAAPAPSP----TPVPSQKTSGGTRVYASPLA 227
P A P A PPPPPP PAP TP+P + A P A
Sbjct: 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA 2738
Query: 228 KRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
A + G T+ + A A A P + A
Sbjct: 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
Score = 35.7 bits (82), Expect = 0.049
Identities = 17/36 (47%), Positives = 20/36 (55%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
A A PPPP AAPP P ++ P P P P PS+
Sbjct: 2544 ASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSE 2579
Score = 35.7 bits (82), Expect = 0.052
Identities = 14/43 (32%), Positives = 15/43 (34%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
PP P PPPP P P A A P+ Q G
Sbjct: 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
Score = 35.7 bits (82), Expect = 0.060
Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 5/67 (7%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT-----PVPSQKTSGGTRVYASPLAKRL 230
AS P P P V A+ PPPP P+ P P ++ +P
Sbjct: 409 APVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDD 468
Query: 231 AAEKGLD 237
A K LD
Sbjct: 469 ATRKALD 475
Score = 35.3 bits (81), Expect = 0.080
Identities = 29/108 (26%), Positives = 39/108 (36%), Gaps = 4/108 (3%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
APPA + PA P A P PP A P+P P + R+ +A
Sbjct: 2738 APAPPAV-PAGPATP---GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
+ + L A +A L A+ P SA+PTA P
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
Score = 33.8 bits (77), Expect = 0.20
Identities = 15/58 (25%), Positives = 18/58 (31%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
P P P P P PPP P P P P+ +G + L A
Sbjct: 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA 2966
Score = 33.8 bits (77), Expect = 0.21
Identities = 23/114 (20%), Positives = 31/114 (27%), Gaps = 11/114 (9%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
A PAA A P P P P A P P P + G + A +E
Sbjct: 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
Query: 234 KGLDLSS-----------IGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
L S + + L + + A + AP P
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
Score = 33.0 bits (75), Expect = 0.39
Identities = 14/39 (35%), Positives = 15/39 (38%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
PP G A P A PPPPP+AA P
Sbjct: 257 PPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPD 295
Score = 32.2 bits (73), Expect = 0.64
Identities = 15/47 (31%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 174 APPAAGASAPAPPPPKVAAAP-PPPPPKAAPAPSPTPVPSQKTSGGT 219
A G P P A+ P P P P A AP P P G+
Sbjct: 396 ARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGS 442
Score = 31.4 bits (71), Expect = 1.0
Identities = 21/75 (28%), Positives = 25/75 (33%), Gaps = 20/75 (26%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPP--------------------KAAPAPSPTPVPSQK 214
PPAA + P PPP PPPPP + P+ SP P+
Sbjct: 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
Query: 215 TSGGTRVYASPLAKR 229
R A P R
Sbjct: 2879 ARPPVRRLARPAVSR 2893
Score = 31.4 bits (71), Expect = 1.0
Identities = 18/76 (23%), Positives = 23/76 (30%), Gaps = 15/76 (19%)
Query: 169 DFKDDAPPAAGASAPAPPPP---------------KVAAAPPPPPPKAAPAPSPTPVPSQ 213
D + APP+ P P PPP P+ PAP P +
Sbjct: 2607 DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR 2666
Query: 214 KTSGGTRVYASPLAKR 229
G AS +R
Sbjct: 2667 ARRLGRAAQASSPPQR 2682
Score = 30.3 bits (68), Expect = 2.3
Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 1/52 (1%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
P GA P P P P + APA S P+ S + +AS LA
Sbjct: 2962 PWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSS-WASSLA 3012
Score = 30.3 bits (68), Expect = 2.5
Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 14/55 (25%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPP--------------PKAAPAPSPTPVPS 212
D P A PPPP AAAP P A PAP P P+
Sbjct: 265 ADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA 319
Score = 29.9 bits (67), Expect = 3.7
Identities = 30/118 (25%), Positives = 42/118 (35%), Gaps = 7/118 (5%)
Query: 174 APPAAGASAPAPPPPKVAAAP-----PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAK 228
P ++ P P P +A P P PPP+ P T TR L +
Sbjct: 420 PTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRE 479
Query: 229 RLAAE-KGLDLSSI-GAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
R E G DL+ + G G++ +A+ A VA S+ PF P
Sbjct: 480 RRPPEPPGADLAELLGRHPDTAGTVVRLAAREAAIAREVAECSRLTINALRSPFPASP 537
Score = 29.1 bits (65), Expect = 6.7
Identities = 16/45 (35%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPP-PKAAPAPSPTPVPSQKTSGG 218
P + PA P P P P P P +AP P TP+PS +
Sbjct: 399 PGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSD 443
Score = 28.8 bits (64), Expect = 8.6
Identities = 16/65 (24%), Positives = 21/65 (32%), Gaps = 7/65 (10%)
Query: 171 KDDAPPAAGASAPAPPPPKV-----AAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVY--A 223
P AA + A PP V A AP P P P +++
Sbjct: 278 PPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVV 337
Query: 224 SPLAK 228
SPL +
Sbjct: 338 SPLPR 342
Score = 28.4 bits (63), Expect = 9.1
Identities = 12/43 (27%), Positives = 15/43 (34%), Gaps = 5/43 (11%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
D D PA + P P PPPP +PA +
Sbjct: 2604 DRGDPRGPAPPSPLPPDTHA-----PDPPPPSPSPAANEPDPH 2641
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 40.5 bits (95), Expect = 3e-04
Identities = 21/59 (35%), Positives = 23/59 (38%), Gaps = 5/59 (8%)
Query: 168 KDFKDDAPPAAGASA-----PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
+D P ASA P PPPP A A PPPP+A P P P V
Sbjct: 51 EDVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTVTV 109
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 42.5 bits (101), Expect = 3e-04
Identities = 22/101 (21%), Positives = 32/101 (31%), Gaps = 14/101 (13%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
+ PP + A A + A PP A P P P Q + S L LA
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL---LA 424
Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSA 272
A + L + + K+ A A A ++
Sbjct: 425 ARQQLQRAQ-----------GATKAKKSEPAAASRARPVNS 454
Score = 37.5 bits (88), Expect = 0.011
Identities = 16/58 (27%), Positives = 20/58 (34%)
Query: 156 IIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
+ E E + AA +A APP PP P+ APA SQ
Sbjct: 364 PLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ 421
Score = 33.3 bits (77), Expect = 0.27
Identities = 12/47 (25%), Positives = 13/47 (27%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
F AP P P +A P A P VP S
Sbjct: 359 FHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPAS 405
Score = 31.0 bits (71), Expect = 1.3
Identities = 17/67 (25%), Positives = 25/67 (37%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
K P AA + P + A+ P P AP+ K + V P+A
Sbjct: 439 KKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498
Query: 231 AAEKGLD 237
A +K L+
Sbjct: 499 ALKKALE 505
Score = 28.7 bits (65), Expect = 7.9
Identities = 13/61 (21%), Positives = 17/61 (27%), Gaps = 3/61 (4%)
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPP---PPPPKAAPAPSPTPVPSQK 214
+E + V + AP A P P P K A TP + K
Sbjct: 456 LERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAK 515
Query: 215 T 215
Sbjct: 516 L 516
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 41.5 bits (96), Expect = 6e-04
Identities = 40/118 (33%), Positives = 50/118 (42%), Gaps = 6/118 (5%)
Query: 165 AAFKDFKDDAPPAAGASAP--APPPPKVAAAPPP----PPPKAAPAPSPTPVPSQKTSGG 218
AA K APPA A+AP A P AAAPP PP KAA P+ P K +
Sbjct: 225 AAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAP 284
Query: 219 TRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
A+P AK AA + A + + + A A A P+K+A P A
Sbjct: 285 PAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPA 342
Score = 39.9 bits (92), Expect = 0.002
Identities = 35/112 (31%), Positives = 46/112 (41%), Gaps = 6/112 (5%)
Query: 171 KDDAPPAAGASAP--APPPPKVAAAPP----PPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
K A PA A+ P A P AAA P PP KAA P+ P K + A+
Sbjct: 224 KAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAA 283
Query: 225 PLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
P AK A + A + + + + A A A A P+K+A P A
Sbjct: 284 PPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPA 335
Score = 38.0 bits (87), Expect = 0.007
Identities = 38/116 (32%), Positives = 49/116 (42%), Gaps = 10/116 (8%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAP-----PP-----PPPKAAPAPSPTPVPSQKTSGGTR 220
K A P+ SA A P K AAAP PP P KAA AP+ P K +
Sbjct: 206 KKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPA 265
Query: 221 VYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
A+P AK A A + + + + A+ A A AAP+K+A P A
Sbjct: 266 KAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPA 321
Score = 37.6 bits (86), Expect = 0.009
Identities = 32/88 (36%), Positives = 39/88 (44%), Gaps = 6/88 (6%)
Query: 165 AAFKDFKDDAPPAAGASAPA---PPPPKVAAAPP---PPPPKAAPAPSPTPVPSQKTSGG 218
AA K APPA A+ PA PP K AAAP P KAA AP+ P K +
Sbjct: 267 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAP 326
Query: 219 TRVYASPLAKRLAAEKGLDLSSIGAGSG 246
A+P AK A + +G +G
Sbjct: 327 PAKAATPPAKAAAPPAKAAAAPVGKKAG 354
Score = 36.9 bits (84), Expect = 0.016
Identities = 37/110 (33%), Positives = 44/110 (40%), Gaps = 6/110 (5%)
Query: 165 AAFKDFKDDAPPAAGASAP--APPPPKVAAAPPP----PPPKAAPAPSPTPVPSQKTSGG 218
AA K A PA A+ P A PP AAAPP PP KAA P+ P K +
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAA 298
Query: 219 TRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAP 268
A+ AK AA A + + T + A A A AAP
Sbjct: 299 PAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAP 348
Score = 31.5 bits (70), Expect = 0.91
Identities = 29/100 (29%), Positives = 37/100 (37%), Gaps = 2/100 (2%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGL 236
AA A AP K A A P KAA AP+ P K + A+ AK A
Sbjct: 203 AAAKKAAAPSGKKSAKAAAPA--KAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKA 260
Query: 237 DLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
A + + + A A A A P+K+A A
Sbjct: 261 AAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPA 300
Score = 29.5 bits (65), Expect = 3.2
Identities = 22/70 (31%), Positives = 31/70 (44%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
AA K A PA A+APA A A PP AAP P++ + + A+
Sbjct: 288 AAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAA 347
Query: 225 PLAKRLAAEK 234
P+ K+ +K
Sbjct: 348 PVGKKAGGKK 357
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 41.5 bits (98), Expect = 6e-04
Identities = 30/117 (25%), Positives = 41/117 (35%), Gaps = 8/117 (6%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
+SD+AA APPAA +APA P AAAP AAPA P P + T
Sbjct: 45 DSDLAAL------APPAA--AAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTA 96
Query: 221 VYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
+A+ A + + L++A+ A A
Sbjct: 97 QREQLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVA 153
Score = 29.5 bits (67), Expect = 3.7
Identities = 25/137 (18%), Positives = 47/137 (34%), Gaps = 8/137 (5%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
A + +A APP AA PPP A A S P+ + + + AKRL A+
Sbjct: 42 ALADSDLAALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPA----PWLVEHAKRLTAQ 97
Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIA 293
+ ++ A + + + + ++ A ++ A + +A
Sbjct: 98 REQLVARAAAPA----APEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVA 153
Query: 294 KRLLQSKQVIKLREQMN 310
+ + I M
Sbjct: 154 DAVKARIERIVNDTVMQ 170
Score = 28.8 bits (65), Expect = 5.8
Identities = 7/35 (20%), Positives = 9/35 (25%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
+ A A A P AA A +
Sbjct: 121 ERAAAENAARRLARAAAAAPRPRVPADAAAAVADA 155
>gnl|CDD|181304 PRK08225, PRK08225, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 70
Score = 37.5 bits (87), Expect = 8e-04
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 9/50 (18%)
Query: 78 VREGNDRVALPALSPTMEM-------GTIVSWAKKEGDKLNEGDLLAEIE 120
V EG D V L S ME+ GT+ +EGD +NEGD+L EIE
Sbjct: 23 VEEGQDVVILE--SMKMEIPIVAEEAGTVKKINVQEGDFVNEGDVLLEIE 70
>gnl|CDD|236712 PRK10547, PRK10547, chemotaxis protein CheA; Provisional.
Length = 670
Score = 41.3 bits (97), Expect = 0.001
Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 5/85 (5%)
Query: 136 LAKILVPAGSKDVPIGKLVCIIVENES----DVAAFKDFKDDAPPAAGASAPAPPPPKVA 191
L L + ++D I ++C ++E + A K + +P P V
Sbjct: 196 LEATLPGSVAED-DITAVLCFVIEADQITFETAVAAPQEKAEETTEVVEVSPKISVPPVL 254
Query: 192 AAPPPPPPKAAPAPSPTPVPSQKTS 216
P T S+ TS
Sbjct: 255 KLAAEQAPAGRVEREKTARSSESTS 279
Score = 28.2 bits (63), Expect = 9.5
Identities = 8/54 (14%), Positives = 13/54 (24%), Gaps = 2/54 (3%)
Query: 182 APAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKG 235
A P+ A + +P S PV ++ A
Sbjct: 225 ETAVAAPQEKAEETTEVVEVSPKISVPPVLKLAAEQAPA--GRVEREKTARSSE 276
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 40.2 bits (93), Expect = 0.002
Identities = 15/38 (39%), Positives = 17/38 (44%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
P GA+AP P P AA PP P A P+P
Sbjct: 100 PGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPD 137
Score = 32.1 bits (72), Expect = 0.60
Identities = 14/34 (41%), Positives = 14/34 (41%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
AA APAP P A P P AP P V
Sbjct: 106 AAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAV 139
Score = 31.7 bits (71), Expect = 0.70
Identities = 17/37 (45%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAP-SPTP 209
AP A A APA PP AA P AAP P + P
Sbjct: 107 APVPAPAPAPAAAPP--AAPAPAADTPAAPIPDAVQP 141
Score = 31.7 bits (71), Expect = 0.76
Identities = 13/35 (37%), Positives = 14/35 (40%), Gaps = 3/35 (8%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
AP A A AP P AA P P A P+
Sbjct: 111 APAPAPAAAPPAAPAP---AADTPAAPIPDAVQPA 142
Score = 31.3 bits (70), Expect = 1.1
Identities = 18/50 (36%), Positives = 20/50 (40%), Gaps = 6/50 (12%)
Query: 172 DDAPPAAGASAPAPPPPKV-----AAAP-PPPPPKAAPAPSPTPVPSQKT 215
D P A P P + AAAP P P P A AP P P+ T
Sbjct: 81 DLGPQVGSFLAQLPAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADT 130
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 130
Score = 38.3 bits (89), Expect = 0.002
Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
GA A A P + A+ P P P +A+P P + G SP+
Sbjct: 22 GAPAQAAAPAQPASTPVPVPTEASPQVEAQA-PQPAAAAGADAMPSPM 68
Score = 34.8 bits (80), Expect = 0.023
Identities = 16/42 (38%), Positives = 19/42 (45%), Gaps = 6/42 (14%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPK-AAPA-----PSPTP 209
A PA AS P P P + + P+ AA A PSP P
Sbjct: 28 AAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPMP 69
Score = 31.3 bits (71), Expect = 0.36
Identities = 11/35 (31%), Positives = 15/35 (42%)
Query: 193 APPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
AP A PA +P PVP++ + P A
Sbjct: 23 APAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAA 57
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 40.2 bits (94), Expect = 0.002
Identities = 16/70 (22%), Positives = 26/70 (37%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
P + P P PP+ A P P P++AP + +P + T ++
Sbjct: 400 KEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIA 459
Query: 234 KGLDLSSIGA 243
G L + A
Sbjct: 460 DGDVLEQLEA 469
Score = 39.8 bits (93), Expect = 0.002
Identities = 16/58 (27%), Positives = 18/58 (31%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
P A A+A PP V PPP P P P + TR K
Sbjct: 388 RPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYT 445
Score = 37.5 bits (87), Expect = 0.013
Identities = 19/60 (31%), Positives = 22/60 (36%), Gaps = 4/60 (6%)
Query: 171 KDDAPPAAGASAPAPPP--PKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT--RVYASPL 226
AP A +A AP P P A + P AA P PV T R A P+
Sbjct: 362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV 421
Score = 32.9 bits (75), Expect = 0.35
Identities = 16/56 (28%), Positives = 17/56 (30%), Gaps = 3/56 (5%)
Query: 175 PPAAGASAPA-PPPPKVAAAPPPPPPKAAPA--PSPTPVPSQKTSGGTRVYASPLA 227
P AP+ P AA PP P A P P P T A L
Sbjct: 377 SPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLT 432
Score = 32.9 bits (75), Expect = 0.37
Identities = 15/51 (29%), Positives = 17/51 (33%), Gaps = 5/51 (9%)
Query: 174 APPAAGASAP-----APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
P A ++ P A PPK PP P P PVP S
Sbjct: 380 RPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPK 430
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 39.4 bits (92), Expect = 0.002
Identities = 14/42 (33%), Positives = 16/42 (38%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
PP A PA PP A P SP P + K+S
Sbjct: 155 PPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSS 196
Score = 38.7 bits (90), Expect = 0.003
Identities = 14/39 (35%), Positives = 18/39 (46%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
PPA PPP + A PP A AP+ +P P+
Sbjct: 145 PPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPT 183
Score = 36.0 bits (83), Expect = 0.026
Identities = 15/40 (37%), Positives = 17/40 (42%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
A P A AP+ P P PPP AP S P+ S
Sbjct: 163 PASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKS 202
Score = 31.7 bits (72), Expect = 0.62
Identities = 14/51 (27%), Positives = 16/51 (31%)
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
PPP V PPPP PA P P+ + A K
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPK 194
Score = 28.3 bits (63), Expect = 6.7
Identities = 16/46 (34%), Positives = 18/46 (39%), Gaps = 3/46 (6%)
Query: 183 PAPPPPKVAAAPPPPP---PKAAPAPSPTPVPSQKTSGGTRVYASP 225
P PP P V PPPP P + PA P P + T P
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPP 189
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 39.8 bits (93), Expect = 0.003
Identities = 14/42 (33%), Positives = 19/42 (45%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P AA A++P+P AA P P PA +P V +
Sbjct: 392 PSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPA 433
Score = 38.2 bits (89), Expect = 0.007
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
AP + A+A +P P + +AA P P++A P+ TP
Sbjct: 389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426
Score = 35.5 bits (82), Expect = 0.059
Identities = 10/42 (23%), Positives = 15/42 (35%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
+P + A+A P PP + P+ PV T
Sbjct: 401 SPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPST 442
Score = 34.4 bits (79), Expect = 0.13
Identities = 19/67 (28%), Positives = 21/67 (31%), Gaps = 5/67 (7%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP-----TPVPSQKTSGG 218
+A DDA G P AA P P AA +PSP PS S
Sbjct: 359 LAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSAT 418
Query: 219 TRVYASP 225
P
Sbjct: 419 QPAGTPP 425
Score = 33.6 bits (77), Expect = 0.20
Identities = 15/45 (33%), Positives = 17/45 (37%), Gaps = 3/45 (6%)
Query: 174 APPAAGASAPAPPP---PKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
AP +A A PP AA P PP A A P +K
Sbjct: 413 APQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKK 457
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 39.2 bits (91), Expect = 0.003
Identities = 16/61 (26%), Positives = 19/61 (31%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
PP S PPP + P PP P P P P + + G A
Sbjct: 255 PPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRD 314
Query: 235 G 235
G
Sbjct: 315 G 315
Score = 30.7 bits (69), Expect = 1.4
Identities = 20/71 (28%), Positives = 24/71 (33%), Gaps = 8/71 (11%)
Query: 142 PAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPP------PKVAAAPP 195
P +D G+ E E A +D P G PAP P + A
Sbjct: 289 PGPVEDGAPGRETG--GEGEGPEPAGRDGAAGGEPKPGPPRPAPDADRPEGWPSLEAITF 346
Query: 196 PPPPKAAPAPS 206
PPP A PA
Sbjct: 347 PPPTPATPAVP 357
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 39.4 bits (92), Expect = 0.003
Identities = 20/113 (17%), Positives = 30/113 (26%), Gaps = 14/113 (12%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYA 223
A A P AAA AP +P P +
Sbjct: 390 AVGASAVPAVTAVTGAAGAALAPK---AAAAAAATRAEAPPAAPAPPATADRGDDAADGD 446
Query: 224 SPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
+P+ + A D D + +G+ +AP+ A P A
Sbjct: 447 APVPAKANARASADSRCD-----------ERDAQPPADSGSASAPASDAPPDA 488
Score = 36.4 bits (84), Expect = 0.028
Identities = 20/104 (19%), Positives = 28/104 (26%), Gaps = 5/104 (4%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
AP AA A+A AA PP + P R A A+
Sbjct: 411 APKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPA-KANARASADSRCDERDAQ 469
Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
D S A + D + A A + + +
Sbjct: 470 PPADSGSASAPAS----DAPPDAAFEPAPRAAAPSAATPAAVPD 509
Score = 35.2 bits (81), Expect = 0.072
Identities = 20/96 (20%), Positives = 32/96 (33%), Gaps = 3/96 (3%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
D+ + P +A PP AA P+P + A A A
Sbjct: 459 ADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPA---A 515
Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAA 267
A + ++ + T A + A++AG AA
Sbjct: 516 ASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551
Score = 34.8 bits (80), Expect = 0.092
Identities = 22/101 (21%), Positives = 34/101 (33%), Gaps = 4/101 (3%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKG 235
G + P +VA A P P +AA A + VP+ G A+ A A
Sbjct: 362 VTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTG----AAGAALAPKAAAA 417
Query: 236 LDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
+ A +AD + G P+K+ +
Sbjct: 418 AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARAS 458
Score = 34.8 bits (80), Expect = 0.11
Identities = 18/107 (16%), Positives = 27/107 (25%), Gaps = 3/107 (2%)
Query: 173 DAPPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
APPA P A A + P+ S +P
Sbjct: 430 PAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA 489
Query: 230 LAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
S + + + A S+ A A P+ A+P
Sbjct: 490 FEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPT 536
Score = 33.7 bits (77), Expect = 0.20
Identities = 15/61 (24%), Positives = 18/61 (29%)
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
E A A APA + A A PP A P+P +G
Sbjct: 488 AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG 547
Query: 218 G 218
G
Sbjct: 548 G 548
Score = 32.1 bits (73), Expect = 0.72
Identities = 15/58 (25%), Positives = 18/58 (31%), Gaps = 4/58 (6%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
D AA A+ PA P A P P PTP T A+ +
Sbjct: 566 DRGARAAAAAKPAAAPA----AAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQA 619
Score = 31.0 bits (70), Expect = 1.6
Identities = 16/54 (29%), Positives = 18/54 (33%), Gaps = 3/54 (5%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
GA A A P AA P PK A VP+ + T A R
Sbjct: 568 GARAAAAAKP---AAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ 618
>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
Length = 94
Score = 36.3 bits (83), Expect = 0.003
Identities = 25/75 (33%), Positives = 30/75 (40%)
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGA 243
P P A PPPPP AP P PT P + R S ++ G I
Sbjct: 12 PPAPLPPAPVPPPPPAPPAPVPEPTVKPVNAEAPKIREAQSKRGQKQQKAGGTASLRIPK 71
Query: 244 GSGLFGSITSADLSK 258
SG GSI++ SK
Sbjct: 72 QSGTQGSISAGGTSK 86
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 39.3 bits (92), Expect = 0.003
Identities = 20/44 (45%), Positives = 21/44 (47%), Gaps = 5/44 (11%)
Query: 175 PPAAGASAPAPP--PPKVAAAPPPPPPKAAPA---PSPTPVPSQ 213
A GA PA P PP AA P PP A PA P+P P P
Sbjct: 386 SAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPA 429
Score = 38.6 bits (90), Expect = 0.005
Identities = 20/51 (39%), Positives = 23/51 (45%), Gaps = 4/51 (7%)
Query: 174 APPAAGASAPAPPPPKVAA----APPPPPPKAAPAPSPTPVPSQKTSGGTR 220
APP A A P P A AP P PP AA P+ + P+ S G R
Sbjct: 399 APPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAASAGDR 449
Score = 35.5 bits (82), Expect = 0.047
Identities = 17/46 (36%), Positives = 20/46 (43%), Gaps = 3/46 (6%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPP---PPKAAPAPSPTPVPSQKTS 216
P A A PA PP AAP P P APAP P +++
Sbjct: 393 TPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSA 438
Score = 31.3 bits (71), Expect = 1.1
Identities = 12/33 (36%), Positives = 13/33 (39%)
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
AP PP A P P AAP P+ P
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAP 413
Score = 29.7 bits (67), Expect = 3.7
Identities = 11/51 (21%), Positives = 13/51 (25%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
GA AP P P A AP P + + A
Sbjct: 378 ERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPP 428
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 39.1 bits (91), Expect = 0.004
Identities = 13/50 (26%), Positives = 17/50 (34%), Gaps = 1/50 (2%)
Query: 172 DDAPPAAGASAPAP-PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
+ PA +PA P P A P KAA + P + R
Sbjct: 13 EAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPR 62
Score = 34.1 bits (78), Expect = 0.13
Identities = 14/50 (28%), Positives = 15/50 (30%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
AA A AA A+ A P A P K P P K
Sbjct: 25 AAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWK 74
Score = 32.2 bits (73), Expect = 0.52
Identities = 12/37 (32%), Positives = 14/37 (37%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
G + P P AA P PPP A A T +
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAA 48
Score = 32.2 bits (73), Expect = 0.54
Identities = 16/35 (45%), Positives = 17/35 (48%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
A PAP PP AAAP PPPP AP+
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAP 46
Score = 28.7 bits (64), Expect = 6.0
Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 1/50 (2%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
+ + A A PA A+APA P+ A P P+ P P
Sbjct: 23 SPAAAPAPPPPAKTAAPATKAAAPAAAAPR-AEKPKKDKPRRERKPKPAS 71
Score = 28.7 bits (64), Expect = 7.0
Identities = 14/56 (25%), Positives = 19/56 (33%), Gaps = 5/56 (8%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
G P P PP P A P P + T A+P A++ +K
Sbjct: 10 GKGEAEQPAP-----APPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDK 60
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 38.1 bits (89), Expect = 0.005
Identities = 20/64 (31%), Positives = 24/64 (37%), Gaps = 1/64 (1%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPP-PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
D A P+ + APP V P P PPK P P P P + P K +
Sbjct: 79 DAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPV 138
Query: 231 AAEK 234
EK
Sbjct: 139 VEEK 142
Score = 33.4 bits (77), Expect = 0.14
Identities = 11/42 (26%), Positives = 12/42 (28%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
AP P P AP P P PV +K
Sbjct: 102 APVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143
Score = 32.3 bits (74), Expect = 0.31
Identities = 20/59 (33%), Positives = 23/59 (38%), Gaps = 4/59 (6%)
Query: 174 APPAAGASAPAPPPPKVAAAPPP----PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAK 228
A A A+AP+ P VA P P P P P P P K +V A P K
Sbjct: 74 AVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPK 132
Score = 30.7 bits (70), Expect = 1.2
Identities = 10/29 (34%), Positives = 12/29 (41%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P P P + APP P P+ P P
Sbjct: 117 PKPKPQQKVEAPPAPKPEPKPVVEEKAAP 145
>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 155
Score = 36.7 bits (86), Expect = 0.007
Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 10/63 (15%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG--------GTRVY 222
+ A P A + A P AA AAPA +P + + G GT Y
Sbjct: 34 RAAAAPVAPVAQQAAAAPVAAAPAAAAAAAAAPAAAPAA-AAAEAEGHVVTSPMVGT-FY 91
Query: 223 ASP 225
+P
Sbjct: 92 RAP 94
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 38.0 bits (88), Expect = 0.007
Identities = 14/51 (27%), Positives = 21/51 (41%), Gaps = 6/51 (11%)
Query: 174 APPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTSGG 218
+ P S PA P ++AAP P ++A A +P + S G
Sbjct: 93 SAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSG 143
Score = 37.6 bits (87), Expect = 0.009
Identities = 14/46 (30%), Positives = 18/46 (39%), Gaps = 3/46 (6%)
Query: 174 APPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P A SAP P PP A+ P A SP+ + T+
Sbjct: 86 PPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATA 131
Score = 36.0 bits (83), Expect = 0.034
Identities = 17/62 (27%), Positives = 22/62 (35%), Gaps = 1/62 (1%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDL 238
G AP P A P P PP A+PA + P+ + S A +
Sbjct: 82 GKVAPPAATPTSAPTPTPSPP-ASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVP 140
Query: 239 SS 240
SS
Sbjct: 141 SS 142
Score = 31.8 bits (72), Expect = 0.75
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 189 KVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
APP P +AP P+P+P S +
Sbjct: 81 TGKVAPPAATPTSAPTPTPSPPASPASG 108
>gnl|CDD|234994 PRK01973, PRK01973, septum formation inhibitor; Reviewed.
Length = 271
Score = 37.4 bits (87), Expect = 0.009
Identities = 26/68 (38%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP---PKAAPAPSPTPVPSQKT 215
+D AA + AP AA A+ A P AAAP PPP P+A A S T V +
Sbjct: 111 AKAADEAAAAAAEAAAPAAAAAAEAAAAAPAAAAAPEPPPAPAPEAVAAQSQTLVIDRPL 170
Query: 216 SGGTRVYA 223
G ++YA
Sbjct: 171 RSGQQIYA 178
Score = 37.0 bits (86), Expect = 0.014
Identities = 20/59 (33%), Positives = 22/59 (37%), Gaps = 3/59 (5%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP---SQKTSGGTRVYASPL 226
K AA A+ A P AA P AA AP P P P + T V PL
Sbjct: 112 KAADEAAAAAAEAAAPAAAAAAEAAAAAPAAAAAPEPPPAPAPEAVAAQSQTLVIDRPL 170
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 37.6 bits (88), Expect = 0.011
Identities = 15/43 (34%), Positives = 17/43 (39%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P P+ P K AA P PPP A P P P + S
Sbjct: 383 PSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPS 425
Score = 34.6 bits (80), Expect = 0.12
Identities = 14/42 (33%), Positives = 18/42 (42%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P+ + P PK A P P A A P PVP++ T
Sbjct: 382 NPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTE 423
Score = 33.4 bits (77), Expect = 0.21
Identities = 13/52 (25%), Positives = 17/52 (32%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
+ A P PP A PP P P A P P P + + +
Sbjct: 518 SNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQ 569
Score = 32.6 bits (75), Expect = 0.42
Identities = 7/42 (16%), Positives = 9/42 (21%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P P P PA + P+P
Sbjct: 380 APNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEP 421
Score = 32.6 bits (75), Expect = 0.46
Identities = 8/43 (18%), Positives = 12/43 (27%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P+A + A P A PP P+ +
Sbjct: 390 IQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432
Score = 32.2 bits (74), Expect = 0.52
Identities = 24/135 (17%), Positives = 39/135 (28%), Gaps = 20/135 (14%)
Query: 86 ALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIE-----TDKATMGFETPEEGYLAKIL 140
L L+ + + I+ W + L E + + + E G L
Sbjct: 315 ELCKLAKQINLERILQWQQH----------LKGSEYQLKNSTQPRLWLEVTLLGLLPSAF 364
Query: 141 VPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPK 200
+ + + + P +A P PP A+PP P P
Sbjct: 365 ISEIANASAPANPTPAPNPSPPPAPI-----QPSAPKTKQAATTPSPPPAKASPPIPVPA 419
Query: 201 AAPAPSPTPVPSQKT 215
PSPTP +
Sbjct: 420 EPTEPSPTPPANAAN 434
Score = 32.2 bits (74), Expect = 0.54
Identities = 16/46 (34%), Positives = 17/46 (36%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
A A P P A PP P A AP PTP P T+
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQA 562
Score = 32.2 bits (74), Expect = 0.58
Identities = 15/35 (42%), Positives = 17/35 (48%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
+PP A AS P P P + P PP AA AP
Sbjct: 405 SPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439
Score = 32.2 bits (74), Expect = 0.59
Identities = 15/44 (34%), Positives = 19/44 (43%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
PP A ++ P A + PPPP P PSPT S +
Sbjct: 553 PPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEE 596
Score = 31.1 bits (71), Expect = 1.4
Identities = 11/37 (29%), Positives = 14/37 (37%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
G+++ P PPPP P P PT T
Sbjct: 515 GSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPT 551
Score = 30.7 bits (70), Expect = 1.6
Identities = 11/39 (28%), Positives = 13/39 (33%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
A + A A PP V A P P P + P
Sbjct: 401 ATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439
Score = 30.3 bits (69), Expect = 2.3
Identities = 11/37 (29%), Positives = 17/37 (45%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
A+ P+PPP K + P P P+P+P +
Sbjct: 401 ATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437
Score = 29.9 bits (68), Expect = 3.0
Identities = 13/44 (29%), Positives = 14/44 (31%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
A PAP P A P PK A + P K S
Sbjct: 369 ANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAS 412
Score = 29.5 bits (67), Expect = 4.0
Identities = 16/42 (38%), Positives = 16/42 (38%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
PPA P P PPPPPP A A S P S
Sbjct: 533 PPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADS 574
Score = 29.2 bits (66), Expect = 5.6
Identities = 7/42 (16%), Positives = 10/42 (23%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
+ PA P + P P P + P
Sbjct: 398 KQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439
Score = 28.4 bits (64), Expect = 9.7
Identities = 17/63 (26%), Positives = 19/63 (30%), Gaps = 8/63 (12%)
Query: 180 ASAPAPPPPKVA--------AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
AS A PP P P P AP P+P P P T + A A
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSP 576
Query: 232 AEK 234
Sbjct: 577 PPP 579
>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
Length = 767
Score = 37.9 bits (88), Expect = 0.011
Identities = 24/96 (25%), Positives = 36/96 (37%), Gaps = 7/96 (7%)
Query: 156 IIVENES-----DVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
+++ N S ++ A D A A+ + P P + AA P P AA +P V
Sbjct: 27 LVLSNRSVAEGVEIVAMLDEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAV 86
Query: 211 PSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSG 246
P+ G R + R E+ L AG
Sbjct: 87 PAPGAIGDLR--GELQSMRGMLERQLAGLLWAAGEV 120
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 36.0 bits (83), Expect = 0.012
Identities = 17/40 (42%), Positives = 17/40 (42%), Gaps = 3/40 (7%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P S PA PP A A P P AAP PS P P
Sbjct: 101 PAVPTPPPTSTPAVPP---APAAAVPAPAAAPPPSDPPQP 137
Score = 35.2 bits (81), Expect = 0.020
Identities = 16/43 (37%), Positives = 18/43 (41%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
+ A A PPP A PP P A PAP+ P PS
Sbjct: 93 PPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPP 135
Score = 30.2 bits (68), Expect = 0.97
Identities = 17/54 (31%), Positives = 20/54 (37%), Gaps = 3/54 (5%)
Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP---TPVPSQKTS 216
A K PPA P A A P PPP + PA P VP+ +
Sbjct: 75 ARKILAPLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAA 128
Score = 28.6 bits (64), Expect = 3.7
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKA 201
A P A A+A P + PP PP+A
Sbjct: 113 AVPPAPAAAVPAPAAAPPPSDPPQPPRA 140
Score = 28.3 bits (63), Expect = 5.0
Identities = 11/32 (34%), Positives = 12/32 (37%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
P + A PAP A A PPP P
Sbjct: 107 PPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPP 138
Score = 27.9 bits (62), Expect = 5.8
Identities = 14/45 (31%), Positives = 16/45 (35%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
P PP V P A P P PT P+ + V A A
Sbjct: 84 PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAA 128
Score = 27.5 bits (61), Expect = 9.1
Identities = 13/31 (41%), Positives = 13/31 (41%), Gaps = 1/31 (3%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPP-PKAAP 203
P A A A P P A P PP P AP
Sbjct: 111 TPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 36.0 bits (83), Expect = 0.013
Identities = 13/39 (33%), Positives = 16/39 (41%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
D + PAP P + P P AAPAP+P
Sbjct: 44 DLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASA 82
Score = 34.5 bits (79), Expect = 0.042
Identities = 13/36 (36%), Positives = 14/36 (38%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
A P P P A P P P A AP+P P
Sbjct: 46 SAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPAS 81
Score = 29.8 bits (67), Expect = 1.5
Identities = 9/37 (24%), Positives = 12/37 (32%), Gaps = 1/37 (2%)
Query: 177 AAGASAPAPPPPKVAAAP-PPPPPKAAPAPSPTPVPS 212
A G + P P P P AP+P +
Sbjct: 38 AKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAA 74
>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
Length = 509
Score = 37.5 bits (88), Expect = 0.013
Identities = 9/31 (29%), Positives = 11/31 (35%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
PPP +V+AAP A S
Sbjct: 370 PPPIQVSAAPAAAAAAPAAKEEEKKPASPWR 400
Score = 31.3 bits (72), Expect = 0.96
Identities = 9/29 (31%), Positives = 11/29 (37%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
PPPP +A P A A P+
Sbjct: 369 PPPPIQVSAAPAAAAAAPAAKEEEKKPAS 397
Score = 30.6 bits (70), Expect = 2.0
Identities = 10/33 (30%), Positives = 13/33 (39%), Gaps = 4/33 (12%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
PP ++APA AAP + PA
Sbjct: 370 PPPIQVSAAPAAAA----AAPAAKEEEKKPASP 398
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 35.6 bits (82), Expect = 0.013
Identities = 15/42 (35%), Positives = 16/42 (38%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
PP + P P P A PPPP A P P P P
Sbjct: 77 FPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPG 118
Score = 28.7 bits (64), Expect = 2.9
Identities = 12/37 (32%), Positives = 12/37 (32%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
P PA PP A PPP P P P
Sbjct: 87 PPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDP 123
>gnl|CDD|223585 COG0511, AccB, Biotin carboxyl carrier protein [Lipid metabolism].
Length = 140
Score = 35.4 bits (82), Expect = 0.016
Identities = 17/49 (34%), Positives = 23/49 (46%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGS 145
GT+ + GD + G LA IE K E P +G + +ILV G
Sbjct: 79 GTVYKPFVEVGDTVKAGQTLAIIEAMKMENEIEAPADGVVKEILVKNGD 127
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 36.8 bits (85), Expect = 0.021
Identities = 12/37 (32%), Positives = 15/37 (40%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
PA P +AA P P + AP P P P+
Sbjct: 388 PANLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPT 424
Score = 35.2 bits (81), Expect = 0.073
Identities = 25/89 (28%), Positives = 37/89 (41%)
Query: 133 EGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAA 192
E A++L+P+ S +E D++ + +AP AA A + A P PK
Sbjct: 354 EVVCARLLLPSASDAESALLQRVERIETRLDMSIPANLLHNAPQAAPAPSAAAPEPKHQP 413
Query: 193 APPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
AP P P A S P + S + V
Sbjct: 414 APEPRPVLAPTPASGEPNAAAVRSMWSTV 442
Score = 29.5 bits (66), Expect = 3.7
Identities = 26/133 (19%), Positives = 39/133 (29%), Gaps = 27/133 (20%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVY----------- 222
A P P P +A P P AA S K +R
Sbjct: 405 AAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWSTVRDKVRQRSRTTEVMLAGATVRA 464
Query: 223 -----------ASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKS 271
++PLA+RL+ ++ D+ + L + + G AA +
Sbjct: 465 LEGNTLVLTHESAPLARRLSEQRNADVLA----EALK-DALGVNWRVRCETGKPAAAASP 519
Query: 272 AKPTANGPFTDLP 284
A AN P
Sbjct: 520 AGGGANAPPAKPV 532
>gnl|CDD|223968 COG1038, PycA, Pyruvate carboxylase [Energy production and
conversion].
Length = 1149
Score = 36.9 bits (86), Expect = 0.022
Identities = 18/50 (36%), Positives = 28/50 (56%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
G +V K+GDK+ +GD+LA IE K P +G + ++LV G +
Sbjct: 1088 GVVVEVKVKKGDKVKKGDVLAVIEAMKMETTISAPFDGTVKEVLVKDGDQ 1137
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 36.8 bits (85), Expect = 0.022
Identities = 20/79 (25%), Positives = 26/79 (32%), Gaps = 12/79 (15%)
Query: 176 PAAGASAPAP-----PPPKVAAAPPPPPPKAAPAPSPTPV------PSQKTSGGTRVYAS 224
P G A AP P A PP PP P + P P + + +
Sbjct: 897 PGTGRVATAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAELRET 956
Query: 225 PLAKRLAAEKGLDLSSIGA 243
R A G D ++ GA
Sbjct: 957 LADLRSAQRSG-DFTAYGA 974
Score = 30.6 bits (69), Expect = 1.7
Identities = 14/38 (36%), Positives = 15/38 (39%), Gaps = 2/38 (5%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
P GA PAPP P P AAP P P+
Sbjct: 913 APPPGAGPPAPPQA--VPPPRTTQPPAAPPRGPDVPPA 948
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 36.6 bits (85), Expect = 0.022
Identities = 11/40 (27%), Positives = 13/40 (32%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
+ + P AAAP PPP A P P
Sbjct: 281 RMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320
Score = 34.7 bits (80), Expect = 0.11
Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 7/61 (11%)
Query: 156 IIVENESDVAAFKDFKDDAPPAAGASAPAP-------PPPKVAAAPPPPPPKAAPAPSPT 208
I + D + F A AA + AP + P AAPAP P
Sbjct: 248 ITLTGAGDERIVRAFLTWADAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPN 307
Query: 209 P 209
P
Sbjct: 308 P 308
Score = 30.5 bits (69), Expect = 1.9
Identities = 10/32 (31%), Positives = 11/32 (34%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
P AA A+ PP P PP A
Sbjct: 294 QPAAAAAAPAPPPNPPATPPEPPARRGRGSAA 325
Score = 29.7 bits (67), Expect = 4.0
Identities = 14/43 (32%), Positives = 17/43 (39%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
S + D K A A+A PPP A PP PP +
Sbjct: 278 APSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320
Score = 28.9 bits (65), Expect = 5.5
Identities = 12/35 (34%), Positives = 13/35 (37%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
A+A AP PP A PP PP S
Sbjct: 290 SVTSQPAAAAAAPAPPPNPPATPPEPPARRGRGSA 324
Score = 28.9 bits (65), Expect = 6.6
Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 10/82 (12%)
Query: 164 VAAFKDFKDD-APPAAGASAPAPPP---------PKVAAAPPPPPPKAAPAPSPTPVPSQ 213
V AF + D A PA A AP+ + AAA P P P +P P++
Sbjct: 259 VRAFLTWADAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPAR 318
Query: 214 KTSGGTRVYASPLAKRLAAEKG 235
+ G ++ + AA G
Sbjct: 319 RGRGSAALWFVVVVLACAAAVG 340
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 35.9 bits (83), Expect = 0.022
Identities = 11/43 (25%), Positives = 14/43 (32%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
P A P P +V P PP P P P ++
Sbjct: 123 ADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEP 165
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 34.7 bits (80), Expect = 0.023
Identities = 18/58 (31%), Positives = 23/58 (39%), Gaps = 1/58 (1%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
D PP A PPP + AAPPP P P P+ P P + + P +
Sbjct: 51 VDTPPPVSAVWVLPPPSE-PAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRPQ 107
Score = 33.9 bits (78), Expect = 0.038
Identities = 16/53 (30%), Positives = 19/53 (35%), Gaps = 11/53 (20%)
Query: 171 KDDAPPAAGASAPAPP-----------PPKVAAAPPPPPPKAAPAPSPTPVPS 212
+PP + PP PP V+A PPP AP P P P
Sbjct: 27 SCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPP 79
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 36.5 bits (84), Expect = 0.030
Identities = 21/82 (25%), Positives = 27/82 (32%), Gaps = 5/82 (6%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP-----PKAAPAPSPTPVPSQKTSGG 218
A D K AP P P AP P P +AP P P ++
Sbjct: 384 PAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLP 443
Query: 219 TRVYASPLAKRLAAEKGLDLSS 240
AS + + G+DL S
Sbjct: 444 PSPQASAPRNVASGKPGVDLGS 465
Score = 33.0 bits (75), Expect = 0.30
Identities = 22/112 (19%), Positives = 33/112 (29%), Gaps = 8/112 (7%)
Query: 174 APPAAGA----SAPAPPPPKVAAAPPPPPPKAA---PAPSPTPVPSQKTSGGTRVYASPL 226
PA AP P P+ P P P+A PA P+P + V S
Sbjct: 382 PSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAP 441
Query: 227 AKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANG 278
+ G GS ++ G+ P +++ A
Sbjct: 442 LPPSPQASAPRNVASGKPGVDLGSWQGK-FMNFTRNGSRKQPVQASSSDAAQ 492
>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597). This
family of proteins is found in bacteria, eukaryotes and
viruses. Proteins in this family are typically between
126 and 281 amino acids in length. The function of this
domain is unknown. The structure of this domain has been
found to contain five helices with a long flexible loop
between helices one and two.
Length = 124
Score = 34.2 bits (79), Expect = 0.035
Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 7/60 (11%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDL 238
G +A A P P A A P AA AP+ TP + V + + LAA+KG L
Sbjct: 11 GEAAAAAPAPAAAPATAPAAA-AAAAPAATPPAAAS------VDVAAVLDALAAKKGQKL 63
Score = 32.7 bits (75), Expect = 0.12
Identities = 9/32 (28%), Positives = 10/32 (31%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
A A+APAP A A P
Sbjct: 12 EAAAAAPAPAAAPATAPAAAAAAAPAATPPAA 43
Score = 32.3 bits (74), Expect = 0.17
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
+ A A A+APA P AAA P P AA +
Sbjct: 12 EAAAAAPAPAAAPATAPAAAAAAAPAATPPAAAS 45
Score = 31.9 bits (73), Expect = 0.18
Identities = 14/38 (36%), Positives = 16/38 (42%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
A F + AP A A A AP AA PP A+
Sbjct: 8 AIFGEAAAAAPAPAAAPATAPAAAAAAAPAATPPAAAS 45
Score = 30.4 bits (69), Expect = 0.66
Identities = 11/36 (30%), Positives = 13/36 (36%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
F + A A +A P AAA P A A
Sbjct: 10 FGEAAAAAPAPAAAPATAPAAAAAAAPAATPPAAAS 45
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 35.8 bits (82), Expect = 0.035
Identities = 24/107 (22%), Positives = 38/107 (35%), Gaps = 1/107 (0%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLS 239
+A P ++ P A P+P+ P S T AS +A A D++
Sbjct: 301 VAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATT-TQASAVALSSAGVLPSDVT 359
Query: 240 SIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVS 286
G + + S + + + TANGP T LP +
Sbjct: 360 LPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAA 406
>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP). This entry
represents proteins encoded by genes which are always
found in type III secretion operons, although their
function in the processes of secretion and virulence is
unclear. Hpa stands for Hrp-associated gene, where Hrp
stands for hypersensitivity response and virulence. see
also PMID:18584024.
Length = 185
Score = 35.1 bits (81), Expect = 0.036
Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 1/39 (2%)
Query: 172 DDAPPAAGASAPAPP-PPKVAAAPPPPPPKAAPAPSPTP 209
A A PAPP P + A P P+ APAP P
Sbjct: 21 RRAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPP 59
Score = 32.0 bits (73), Expect = 0.30
Identities = 13/38 (34%), Positives = 15/38 (39%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
PA A P P+ A APP PP P P+
Sbjct: 35 PGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPLA 72
Score = 29.7 bits (67), Expect = 2.1
Identities = 13/40 (32%), Positives = 14/40 (35%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
APP A P + AP PP P A P P
Sbjct: 33 APPGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPLA 72
Score = 29.0 bits (65), Expect = 3.4
Identities = 11/39 (28%), Positives = 14/39 (35%), Gaps = 6/39 (15%)
Query: 180 ASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPS 212
+A A PP A P P + AP+P P
Sbjct: 22 RAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPPR 60
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 35.0 bits (81), Expect = 0.044
Identities = 12/59 (20%), Positives = 18/59 (30%), Gaps = 7/59 (11%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
E+D+ A + A+AP V + P P P P + G
Sbjct: 107 EADLLALME-------EEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKKSGM 158
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 36.0 bits (83), Expect = 0.048
Identities = 21/76 (27%), Positives = 24/76 (31%), Gaps = 9/76 (11%)
Query: 163 DVAAFKDFKDDAPPAAGASAPAPPPP-------KVAAAPP--PPPPKAAPAPSPTPVPSQ 213
+ A A A P P PP AA P P P AAPAP+ PS
Sbjct: 58 EAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVEPSL 117
Query: 214 KTSGGTRVYASPLAKR 229
+ L R
Sbjct: 118 AANPFAAAIEWLLGGR 133
Score = 30.2 bits (68), Expect = 2.5
Identities = 21/76 (27%), Positives = 24/76 (31%), Gaps = 13/76 (17%)
Query: 173 DAPPAAGASAPAPPPPKVA-----AAPPPPPPKA--------APAPSPTPVPSQKTSGGT 219
+ A A+ AP P PPP PP A PS P PS +
Sbjct: 52 EQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAE 111
Query: 220 RVYASPLAKRLAAEKG 235
V S A AA
Sbjct: 112 PVEPSLAANPFAAAIE 127
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 35.6 bits (83), Expect = 0.048
Identities = 17/70 (24%), Positives = 23/70 (32%), Gaps = 5/70 (7%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
PP G + AP A P PA + VP A+P A A +
Sbjct: 400 GPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVP-----DAAAAAAAPPAPAAAPQ 454
Query: 234 KGLDLSSIGA 243
+ L+S
Sbjct: 455 PAVRLNSFED 464
Score = 29.9 bits (68), Expect = 3.1
Identities = 13/62 (20%), Positives = 15/62 (24%), Gaps = 7/62 (11%)
Query: 174 APPAAGASAPAPPPPK-------VAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
AP P AAAP P AA A P + P
Sbjct: 397 GGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPA 456
Query: 227 AK 228
+
Sbjct: 457 VR 458
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 35.9 bits (83), Expect = 0.051
Identities = 19/94 (20%), Positives = 28/94 (29%), Gaps = 5/94 (5%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
P + + +P+P P A P P K P+ TR A+ A +
Sbjct: 343 PGPSPSRSPSPSRPP-PPADPSSPRKRPRPSRAPSSPAASAGRPTR----RRARAAVAGR 397
Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAP 268
+ G + D AS A P
Sbjct: 398 ARRRDATGRFPAGRPRPSPLDAGAASGAFYARYP 431
Score = 34.4 bits (79), Expect = 0.14
Identities = 22/116 (18%), Positives = 36/116 (31%), Gaps = 9/116 (7%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
+ + GA+ P P + +P PPP A P+ + A +R A
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRA 390
Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSG 287
+ +G F + + + A + S A P L SG
Sbjct: 391 RAAVAGRARRRDATGRFP-------AGRPRPSPLDAGAASGAFYARYP--LLTPSG 437
Score = 33.6 bits (77), Expect = 0.22
Identities = 14/60 (23%), Positives = 19/60 (31%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
PP +PPP P +P P P P + V + + R AA
Sbjct: 114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAA 173
Score = 33.2 bits (76), Expect = 0.27
Identities = 19/122 (15%), Positives = 29/122 (23%), Gaps = 1/122 (0%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
A PAA AS A P AP+ P P +
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRR-SSPISA 215
Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIA 293
+ S + S+ S +S++ ++ P LP
Sbjct: 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275
Query: 294 KR 295
Sbjct: 276 NG 277
Score = 33.2 bits (76), Expect = 0.28
Identities = 26/125 (20%), Positives = 41/125 (32%), Gaps = 15/125 (12%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
+ S +A ++ +P G S+P PPPP A PPP P + PV S
Sbjct: 91 SLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA 150
Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
A+ + A + +S + + A + S+ P P
Sbjct: 151 SPPAAGASPAAVAS---------------DAASSRQAALPLSSPEETARAPSSPPAEPPP 195
Query: 280 FTDLP 284
T
Sbjct: 196 STPPA 200
Score = 32.5 bits (74), Expect = 0.56
Identities = 18/65 (27%), Positives = 27/65 (41%), Gaps = 2/65 (3%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPP--PKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
AP + A P PP A+ PP P +A A SP P P + + +S +
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE 243
Query: 231 AAEKG 235
++ G
Sbjct: 244 SSGCG 248
Score = 31.7 bits (72), Expect = 0.91
Identities = 15/47 (31%), Positives = 18/47 (38%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
A K+ + P + P P AAA PP AP P P P
Sbjct: 865 ARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRV 911
Score = 30.9 bits (70), Expect = 1.6
Identities = 17/44 (38%), Positives = 18/44 (40%), Gaps = 1/44 (2%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
P A A APP AAAPP P P P+P GG
Sbjct: 878 EPRARPGAAAPPKAA-AAAPPAGAPAPRPRPAPRVKLGPMPPGG 920
Score = 30.5 bits (69), Expect = 2.2
Identities = 20/69 (28%), Positives = 24/69 (34%), Gaps = 1/69 (1%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
A P + A P A PPP P AA +P P P S S A + A
Sbjct: 171 QAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP-PRRSSPISASASSPAPAPGRSAA 229
Query: 232 AEKGLDLSS 240
+ G S
Sbjct: 230 DDAGASSSD 238
Score = 30.1 bits (68), Expect = 2.8
Identities = 11/49 (22%), Positives = 16/49 (32%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
P P A PP AA P+ P P + + ++ P
Sbjct: 870 GRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPP 918
Score = 29.8 bits (67), Expect = 3.3
Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 10/132 (7%)
Query: 168 KDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPK----------AAPAPSPTPVPSQKTSG 217
+ PPAA + P ++A+ P P A+ + S + S G
Sbjct: 191 AEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWG 250
Query: 218 GTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
P + + +S G S+ S ++ + + S + P +
Sbjct: 251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS 310
Query: 278 GPFTDLPVSGVR 289
P S R
Sbjct: 311 SPRASSSSSSSR 322
Score = 29.0 bits (65), Expect = 5.9
Identities = 16/121 (13%), Positives = 37/121 (30%), Gaps = 7/121 (5%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
P++ + +P P P P+P+ + +S +R S + ++ +
Sbjct: 277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR--ESSSSSTSSSSE 334
Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAK 294
+++ G S S + S +P + + S R +
Sbjct: 335 SSRGAAVSPGPSP-----SRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
Query: 295 R 295
Sbjct: 390 A 390
Score = 28.6 bits (64), Expect = 7.6
Identities = 12/38 (31%), Positives = 13/38 (34%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
A P A+A P P P K P P P P
Sbjct: 886 AAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923
Score = 28.6 bits (64), Expect = 7.8
Identities = 17/71 (23%), Positives = 22/71 (30%), Gaps = 8/71 (11%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTS--GGTRVY 222
PPA +S P P A + P P +A A + T R
Sbjct: 354 SRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413
Query: 223 ASPLAKRLAAE 233
SPL A+
Sbjct: 414 PSPLDAGAASG 424
Score = 28.6 bits (64), Expect = 8.4
Identities = 15/62 (24%), Positives = 19/62 (30%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
PP A PPKAA A P P+ + RV P+
Sbjct: 864 RARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923
Query: 234 KG 235
+G
Sbjct: 924 RG 925
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 35.8 bits (82), Expect = 0.052
Identities = 17/35 (48%), Positives = 18/35 (51%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
PPAA AP P PP A P P+ AP P P P
Sbjct: 766 PPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPP 800
Score = 35.0 bits (80), Expect = 0.084
Identities = 25/61 (40%), Positives = 29/61 (47%), Gaps = 8/61 (13%)
Query: 174 APPAAGASAPAPPPPKVAAAPPP-PPPKAAPAP------SPTPVPSQKTSGGTRVYASPL 226
A P A A A PP AP P PPP+A PAP +PTP P +G T + P
Sbjct: 754 ARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQP-PPQAGPTSMQLMPR 812
Query: 227 A 227
A
Sbjct: 813 A 813
Score = 32.7 bits (74), Expect = 0.45
Identities = 22/61 (36%), Positives = 25/61 (40%), Gaps = 9/61 (14%)
Query: 174 APPAAGASAPAPPP--------PKVAAAPPPPPPKAAP-APSPTPVPSQKTSGGTRVYAS 224
A P A A A PP P AA PP AAP AP+P P P + R +
Sbjct: 734 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGA 793
Query: 225 P 225
P
Sbjct: 794 P 794
Score = 31.6 bits (71), Expect = 0.93
Identities = 13/60 (21%), Positives = 17/60 (28%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
PP A P P A P A P+ P ++ + P A A
Sbjct: 697 PPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARP 756
Score = 28.9 bits (64), Expect = 6.3
Identities = 17/64 (26%), Positives = 19/64 (29%), Gaps = 10/64 (15%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAP--------APSPTPVPSQKTSGGTRVYASP 225
P A A P AA PP AAP AP P+ A+P
Sbjct: 704 MRPPAAPPGRAQRP--AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAP 761
Query: 226 LAKR 229
R
Sbjct: 762 GRAR 765
Score = 28.9 bits (64), Expect = 7.4
Identities = 15/61 (24%), Positives = 20/61 (32%), Gaps = 1/61 (1%)
Query: 175 PPAAGASAPAPPPPKVAAA-PPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
P GA+ P PPP P P+ P +Q+ + T P A A
Sbjct: 676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRAR 735
Query: 234 K 234
Sbjct: 736 P 736
Score = 28.5 bits (63), Expect = 9.5
Identities = 15/56 (26%), Positives = 20/56 (35%), Gaps = 2/56 (3%)
Query: 174 APPAAGASAPAPPPPKV--AAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
A A A+ A PP A PP P A P+ P ++ + P A
Sbjct: 714 AQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 769
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 35.3 bits (81), Expect = 0.062
Identities = 16/33 (48%), Positives = 16/33 (48%), Gaps = 3/33 (9%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
P A AS PAPP P V PPP P P P
Sbjct: 233 PSAPPASIPAPPIPPVIQYVAPPPV---PPPQP 262
Score = 34.1 bits (78), Expect = 0.15
Identities = 14/40 (35%), Positives = 16/40 (40%), Gaps = 2/40 (5%)
Query: 176 PAAGASAPAPPPPKVAAAPPPP--PPKAAPAPSPTPVPSQ 213
P +PA P A+ P PP P AP P P P
Sbjct: 223 PFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQP 262
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 34.7 bits (79), Expect = 0.066
Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 2/76 (2%)
Query: 148 VPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
VP D A + D+AP S P + A P P S
Sbjct: 126 VPEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTRAARETTEANRPTA--TPPQSSS 183
Query: 208 TPVPSQKTSGGTRVYA 223
V ++ G RVYA
Sbjct: 184 ALVITKPLRSGQRVYA 199
Score = 30.0 bits (67), Expect = 2.1
Identities = 30/128 (23%), Positives = 39/128 (30%), Gaps = 21/128 (16%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAP---SPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
GA P +++ P AP TPVP + P A AA K
Sbjct: 88 QGARDAGLVPVELSTPVARAPQVIDTAPPNDVATPVP-----------SVPEATAEAAAK 136
Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAK 294
G AD + A +V + AN P P S VI K
Sbjct: 137 AGPQDDEADGE-------QADEAPAHNPESVPTRAARETTEANRPTATPPQSSSALVITK 189
Query: 295 RLLQSKQV 302
L ++V
Sbjct: 190 PLRSGQRV 197
Score = 28.5 bits (63), Expect = 6.6
Identities = 12/52 (23%), Positives = 17/52 (32%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
P A PP A P P P+A + P + G + +P
Sbjct: 102 TPVARAPQVIDTAPPNDVATPVPSVPEATAEAAAKAGPQDDEADGEQADEAP 153
>gnl|CDD|235658 PRK05972, ligD, ATP-dependent DNA ligase; Reviewed.
Length = 860
Score = 35.3 bits (82), Expect = 0.072
Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 5/58 (8%)
Query: 165 AAFKDFKDDAPPAA-GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
A FK ++D P A PAPP A A P P AA A + ++ G R+
Sbjct: 521 AVFKGLREDKPAREVVAERPAPP----ATAEPAAPAAAAAAAATAAAAAKAEVAGVRI 574
Score = 32.6 bits (75), Expect = 0.41
Identities = 14/70 (20%), Positives = 20/70 (28%)
Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
K + A P P VA+ A +P P + K + G A
Sbjct: 158 IKHRDEFARPGDEYDVLEALPDSVASGRTMAAIAAGKGRAPKPFMTPKGNAGLAAAARAA 217
Query: 227 AKRLAAEKGL 236
A A +
Sbjct: 218 AAAAAKKAKK 227
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 34.7 bits (79), Expect = 0.095
Identities = 14/37 (37%), Positives = 15/37 (40%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P AA A + PP P A PP P AA P
Sbjct: 185 PGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQP 221
Score = 29.7 bits (66), Expect = 4.3
Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 3/68 (4%)
Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
E++SD +A + P S PP +A + PPP P A P + + +
Sbjct: 161 ESDSDSSAQQQLLQ---PQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ 217
Query: 219 TRVYASPL 226
+ +
Sbjct: 218 PQQPSPLS 225
Score = 29.3 bits (65), Expect = 5.5
Identities = 13/45 (28%), Positives = 18/45 (40%), Gaps = 8/45 (17%)
Query: 174 APPAAGASAPAPPPPKVAAAPP--------PPPPKAAPAPSPTPV 210
+ P PP ++ P PPPP +P+P PT V
Sbjct: 495 SSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVV 539
Score = 28.9 bits (64), Expect = 7.1
Identities = 11/49 (22%), Positives = 16/49 (32%), Gaps = 4/49 (8%)
Query: 175 PPAAGASAPAPPP----PKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
P + S P P P + P P + S P + +SG
Sbjct: 439 PHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQ 487
>gnl|CDD|236851 PRK11114, PRK11114, cellulose synthase regulator protein;
Provisional.
Length = 756
Score = 34.5 bits (80), Expect = 0.10
Identities = 14/43 (32%), Positives = 16/43 (37%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P A A+APA P AA P AA AP +
Sbjct: 18 FPAFATAAAPATQPLANAAGQVMPAAPAAAAPVVAQAAPSRDV 60
Score = 33.8 bits (78), Expect = 0.18
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 1/51 (1%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
A PA A+A AP +A A P A A +P + + T +A
Sbjct: 17 AFPAF-ATAAAPATQPLANAAGQVMPAAPAAAAPVVAQAAPSRDVTLTFAQ 66
>gnl|CDD|237263 PRK12999, PRK12999, pyruvate carboxylase; Reviewed.
Length = 1146
Score = 34.7 bits (81), Expect = 0.11
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 8/54 (14%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFET----PEEGYLAKILVPAGSK 146
G++V+ KEGD++ GD LA IE M ET P +G + ++LV AG +
Sbjct: 1085 GSVVTVLVKEGDEVKAGDPLAVIE----AMKMETTITAPVDGTVKRVLVKAGDQ 1134
Score = 30.1 bits (69), Expect = 2.5
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIE 120
GT+ K GD++ GDLL E+E
Sbjct: 1122 GTVKRVLVKAGDQVEAGDLLVELE 1145
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 34.4 bits (79), Expect = 0.12
Identities = 12/37 (32%), Positives = 13/37 (35%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P G PPP AAA P P + A P
Sbjct: 702 GPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPE 738
Score = 32.8 bits (75), Expect = 0.40
Identities = 32/191 (16%), Positives = 52/191 (27%), Gaps = 38/191 (19%)
Query: 41 GTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGND-RVALPALSPTMEMGT- 98
G V++ G+ + ++ P+ V R R R A M +
Sbjct: 581 GNVTVRVTRPTGLASGASNAEDPE-----VRRARTRGARALAQARTFGRATVGEMIISGF 635
Query: 99 ----IVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLV 154
+ + + ++ E M T +A+ P+
Sbjct: 636 PPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARG--------DPVR--- 684
Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAP------APPPPKVAAAPPPPPPKAAPAPSPT 208
AP A P PPPP + AA P P ++ SPT
Sbjct: 685 ----------PTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPT 734
Query: 209 PVPSQKTSGGT 219
P + G
Sbjct: 735 APPEPEPPGAE 745
Score = 30.2 bits (68), Expect = 2.3
Identities = 10/40 (25%), Positives = 13/40 (32%), Gaps = 1/40 (2%)
Query: 175 PPAAGASAPAPP-PPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
PP A+A P + A+P PP P
Sbjct: 713 PPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGAEN 752
Score = 30.2 bits (68), Expect = 2.6
Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 7/61 (11%)
Query: 175 PPAAGASAPAPPPPKV-------AAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
P A A+ AP P+ PPPP AA P+P + P A
Sbjct: 685 PTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGA 744
Query: 228 K 228
+
Sbjct: 745 E 745
Score = 29.8 bits (67), Expect = 3.3
Identities = 9/54 (16%), Positives = 13/54 (24%), Gaps = 10/54 (18%)
Query: 182 APAPPPPKVAAAPPPPPPKA----------APAPSPTPVPSQKTSGGTRVYASP 225
P P AA P P+ P P G + + +
Sbjct: 681 DPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPT 734
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 32.4 bits (74), Expect = 0.12
Identities = 12/37 (32%), Positives = 12/37 (32%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
P A APPP P PP P P P
Sbjct: 55 HLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPP 91
Score = 30.8 bits (70), Expect = 0.39
Identities = 15/52 (28%), Positives = 17/52 (32%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
A P + S PPP P P AP P P VY +P
Sbjct: 37 AQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTP 88
Score = 28.5 bits (64), Expect = 2.3
Identities = 14/38 (36%), Positives = 15/38 (39%), Gaps = 1/38 (2%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
APP G P P P P PPPP + P P
Sbjct: 67 APPPYGPPPPYYPAPP-GVYPTPPPPNSGYMADPQEPP 103
Score = 27.4 bits (61), Expect = 6.0
Identities = 12/46 (26%), Positives = 12/46 (26%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
P GA P P PPP P P P T
Sbjct: 46 YYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPP 91
Score = 27.0 bits (60), Expect = 8.8
Identities = 11/50 (22%), Positives = 12/50 (24%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
PP P P A PPP P P Y +
Sbjct: 48 PPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMA 97
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 32.5 bits (74), Expect = 0.13
Identities = 19/43 (44%), Positives = 23/43 (53%), Gaps = 7/43 (16%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV-PSQKTSGGT 219
+G+SAPAPP P P P A PAP+ V S +SGG
Sbjct: 12 SGSSAPAPPAG------PGPGPNAPPAPAAPGVDSSAGSSGGE 48
Score = 27.1 bits (60), Expect = 9.7
Identities = 13/45 (28%), Positives = 16/45 (35%), Gaps = 3/45 (6%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
+ A AP P APP P AAP + S +G
Sbjct: 9 GVSSGSSAPAPPAGPGPGPNAPPAP---AAPGVDSSAGSSGGEAG 50
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 34.1 bits (78), Expect = 0.13
Identities = 16/74 (21%), Positives = 27/74 (36%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSS 240
+ P VA P P A A + P + + +P+ A+ ++
Sbjct: 170 ATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAA 229
Query: 241 IGAGSGLFGSITSA 254
G +G GS+ SA
Sbjct: 230 GGKTAGNVGSLKSA 243
Score = 29.1 bits (65), Expect = 4.8
Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 3/47 (6%)
Query: 174 APPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
A P A APA P PK A P + + TP KT+G
Sbjct: 189 AAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAG 235
Score = 28.4 bits (63), Expect = 6.9
Identities = 8/45 (17%), Positives = 12/45 (26%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
A A P + P AP + +P + T
Sbjct: 186 TEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAG 230
>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
This family represents the C-terminal region of
merozoite surface protein 1 (MSP1) which are found in a
number of Plasmodium species. MSP-1 is a 200-kDa protein
expressed on the surface of the P. vivax merozoite.
MSP-1 of Plasmodium species is synthesised as a
high-molecular-weight precursor and then processed into
several fragments. At the time of red cell invasion by
the merozoite, only the 19-kDa C-terminal fragment
(MSP-119), which contains two epidermal growth
factor-like domains, remains on the surface. Antibodies
against MSP-119 inhibit merozoite entry into red cells,
and immunisation with MSP-119 protects monkeys from
challenging infections. Hence, MSP-119 is considered a
promising vaccine candidate.
Length = 574
Score = 34.1 bits (78), Expect = 0.13
Identities = 11/55 (20%), Positives = 17/55 (30%), Gaps = 11/55 (20%)
Query: 166 AFKDFKDDAP-----PAAGASAPAPPPPKVAAAPPPPPPKAAPA------PSPTP 209
A + +K+ P A + PP A +P P + PT
Sbjct: 250 ALEQYKELLPKGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASGSTQIPTS 304
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 32.5 bits (74), Expect = 0.14
Identities = 14/47 (29%), Positives = 15/47 (31%), Gaps = 6/47 (12%)
Query: 173 DAPPA------AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
DAPPA P A+AP P P P V Q
Sbjct: 3 DAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQ 49
Score = 31.7 bits (72), Expect = 0.31
Identities = 12/49 (24%), Positives = 13/49 (26%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYA 223
P A A AP P P P S P+ VY
Sbjct: 20 PQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYP 68
>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
Members of this protein family are FHA
(forkhead-associated) domain-containing proteins that
are part of type VI secretion loci in a considerable
number of bacteria, most of which are known pathogens.
Species include Pseudomonas aeruginosa PAO1, Aeromonas
hydrophila, Yersinia pestis, Burkholderia mallei, etc
[Protein fate, Protein and peptide secretion and
trafficking, Cellular processes, Pathogenesis].
Length = 396
Score = 33.9 bits (78), Expect = 0.15
Identities = 18/67 (26%), Positives = 23/67 (34%), Gaps = 3/67 (4%)
Query: 158 VENESDVAAFKDFKD---DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
++ E AA D P A A A P PP P AP P+P P
Sbjct: 144 LDQEPLSAADLDDLSAPLFPPLDARLPAFAAPIDAEPTMVPPFVPLPAPEPAPAPASQAP 203
Query: 215 TSGGTRV 221
+S +
Sbjct: 204 SSDAVAL 210
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 33.9 bits (77), Expect = 0.15
Identities = 30/141 (21%), Positives = 45/141 (31%), Gaps = 7/141 (4%)
Query: 137 AKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPP 196
A +L P + + N++ V++ K ++ PP +APA P P P
Sbjct: 321 AAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLP-QAAPAVVKPGPMEIPTP 379
Query: 197 PPPKAAPAPSPTPVPS--QKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLF----GS 250
PP PS P T ASP K + + ++
Sbjct: 380 VPPPGLAIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGALDDTLAWKEPSKED 439
Query: 251 ITSADLSKASKAGAVAAPSKS 271
TS D + G AA
Sbjct: 440 QTSEDGKMLAIMGEAAAALAL 460
>gnl|CDD|130302 TIGR01235, pyruv_carbox, pyruvate carboxylase. This enzyme plays a
role in gluconeogensis but not glycolysis [Energy
metabolism, Glycolysis/gluconeogenesis].
Length = 1143
Score = 34.0 bits (78), Expect = 0.16
Identities = 15/50 (30%), Positives = 26/50 (52%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
G I+ G +N+GD L +E K + P++G + ++LV AG +
Sbjct: 1083 GVIIEVKVSSGQAVNKGDPLVVLEAMKMETAIQAPKDGTIKEVLVKAGEQ 1132
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 33.5 bits (77), Expect = 0.17
Identities = 12/39 (30%), Positives = 13/39 (33%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
PA AP P + A P P P P P P
Sbjct: 149 QVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAP 187
Score = 32.7 bits (75), Expect = 0.32
Identities = 9/42 (21%), Positives = 14/42 (33%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
+ P S P P+ + P P +A P P +
Sbjct: 139 EQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPE 180
Score = 32.7 bits (75), Expect = 0.33
Identities = 12/42 (28%), Positives = 14/42 (33%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
P SAP P A P P+ P P PV +
Sbjct: 150 VAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDK 191
Score = 30.0 bits (68), Expect = 2.3
Identities = 17/51 (33%), Positives = 19/51 (37%), Gaps = 5/51 (9%)
Query: 175 PPAAGASAPAPPPPKVAAAPP-----PPPPKAAPAPSPTPVPSQKTSGGTR 220
P + APAP P A P P P AAP P P P+ R
Sbjct: 144 QPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194
Score = 30.0 bits (68), Expect = 2.3
Identities = 10/47 (21%), Positives = 11/47 (23%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
A + PA A PA P P P P
Sbjct: 148 PQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194
Score = 28.8 bits (65), Expect = 4.9
Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 1/50 (2%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA-PSPTPVPSQ 213
AA + P ++ P P + A PP PA P+P PV
Sbjct: 83 AARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQP 132
Score = 28.8 bits (65), Expect = 5.4
Identities = 11/46 (23%), Positives = 13/46 (28%), Gaps = 1/46 (2%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
P A P P P P P P P P+Q+
Sbjct: 123 QPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVH-SAPQPAQQA 167
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 33.8 bits (77), Expect = 0.17
Identities = 10/47 (21%), Positives = 13/47 (27%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
P+ S P+ P + P P TP P G
Sbjct: 222 PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAP 268
Score = 29.9 bits (67), Expect = 2.8
Identities = 13/62 (20%), Positives = 18/62 (29%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
AP A P P+ P + T PS + A+P A
Sbjct: 194 APRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253
Query: 234 KG 235
+G
Sbjct: 254 EG 255
Score = 29.2 bits (65), Expect = 5.0
Identities = 8/48 (16%), Positives = 15/48 (31%), Gaps = 3/48 (6%)
Query: 175 PPAAGASAP--APPPPKVAAAPPPPPPKAAP-APSPTPVPSQKTSGGT 219
P ++P P P + P P + +P + + G
Sbjct: 210 PTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTP 257
>gnl|CDD|139048 PRK12538, PRK12538, RNA polymerase sigma factor; Provisional.
Length = 233
Score = 33.3 bits (76), Expect = 0.17
Identities = 13/32 (40%), Positives = 13/32 (40%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
PA A A P P VA AP P P P
Sbjct: 10 PAEAAPATTAPSPIVAEAPDAASVSIVPDPPP 41
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 33.7 bits (78), Expect = 0.19
Identities = 33/165 (20%), Positives = 53/165 (32%), Gaps = 9/165 (5%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
A A+A A K AA P +AA A + ++ A A+R A+
Sbjct: 58 AEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAK 117
Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIA 293
K +A K K G + + P + +
Sbjct: 118 KAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREV- 176
Query: 294 KRLLQSKQVIKLREQMN-------KALEKRGAKLSINDFIIKATA 331
+ ++ V +L E+M K L K G +IN + + TA
Sbjct: 177 -VIPETITVAELAEKMAVKAAEVIKKLFKLGVMATINQSLDQETA 220
>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal.
Length = 313
Score = 33.3 bits (76), Expect = 0.20
Identities = 13/38 (34%), Positives = 16/38 (42%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
+ P AA SA +APPPPPP P+
Sbjct: 209 WSKKGPVAAAKSALPAVSSSAPSAPPPPPPPPPPSVPT 246
Score = 28.6 bits (64), Expect = 5.9
Identities = 11/54 (20%), Positives = 19/54 (35%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
+A P V+++ P PP P P P+ + + + AE
Sbjct: 215 VAAAKSALPAVSSSAPSAPPPPPPPPPPSVPTISNSVESASSDSKGGRGAVFAE 268
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 33.1 bits (75), Expect = 0.22
Identities = 19/66 (28%), Positives = 24/66 (36%), Gaps = 7/66 (10%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS-----PTPVPSQKTSGGTRVYA--SPLA 227
P + AP PP P P A PS P P+ T T+V SP
Sbjct: 33 PSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETPQSPTT 92
Query: 228 KRLAAE 233
K++ E
Sbjct: 93 KQVPTE 98
>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein. Members of
this family confer resistance to the metalloid element
tellurium and its salts.
Length = 98
Score = 31.2 bits (71), Expect = 0.22
Identities = 12/22 (54%), Positives = 12/22 (54%)
Query: 182 APAPPPPKVAAAPPPPPPKAAP 203
AP PP A P P PP AAP
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAP 22
Score = 30.8 bits (70), Expect = 0.32
Identities = 12/22 (54%), Positives = 13/22 (59%)
Query: 191 AAAPPPPPPKAAPAPSPTPVPS 212
AAP PPP A PAP+P P
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAP 22
Score = 29.7 bits (67), Expect = 0.73
Identities = 12/23 (52%), Positives = 12/23 (52%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPP 198
PAA PAP PP A P PP
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAPP 23
Score = 29.3 bits (66), Expect = 1.3
Identities = 11/30 (36%), Positives = 13/30 (43%)
Query: 187 PPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P P P PP AP P+ PV K +
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAPPVSLSKIT 30
Score = 28.9 bits (65), Expect = 1.3
Identities = 12/23 (52%), Positives = 12/23 (52%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAP 205
PA P P A APP P P A P
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAPP 23
Score = 27.0 bits (60), Expect = 8.3
Identities = 9/23 (39%), Positives = 9/23 (39%)
Query: 174 APPAAGASAPAPPPPKVAAAPPP 196
APAPP P A PP
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAPP 23
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 32.9 bits (75), Expect = 0.22
Identities = 14/45 (31%), Positives = 17/45 (37%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
P AA P PP + A P P P P P P+ +G
Sbjct: 106 VPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPVAEPAAAPTGQ 150
Score = 29.9 bits (67), Expect = 2.1
Identities = 12/37 (32%), Positives = 12/37 (32%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
A A P P P P K P P P PV
Sbjct: 104 AGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPV 140
Score = 29.5 bits (66), Expect = 2.7
Identities = 11/41 (26%), Positives = 12/41 (29%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
AG A P V PP P+P P P
Sbjct: 101 STPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPVA 141
Score = 28.7 bits (64), Expect = 5.6
Identities = 17/65 (26%), Positives = 22/65 (33%), Gaps = 2/65 (3%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT--RVYASPLAKR 229
D AA + P P + P P AA P P P Q +G + P K
Sbjct: 80 DAGDAAAASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKP 139
Query: 230 LAAEK 234
+A
Sbjct: 140 VAEPA 144
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough protein
is thought to play a role in the circulative aphid
transmission of potato leaf roll virus. Also in the
family is open reading frame 6 from beet western yellows
virus and potato leaf roll virus both luteovirus and an
unknown protein from cucurbit aphid-borne yellows virus
a closterovirus.
Length = 460
Score = 33.5 bits (77), Expect = 0.22
Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 4/29 (13%)
Query: 186 PPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
P PK P PPPP AP+P P P++
Sbjct: 5 PGPKPGPTPTPPPP----APTPEPTPAKH 29
Score = 32.4 bits (74), Expect = 0.44
Identities = 9/24 (37%), Positives = 10/24 (41%)
Query: 184 APPPPKVAAAPPPPPPKAAPAPSP 207
P P PPPP P P P+
Sbjct: 5 PGPKPGPTPTPPPPAPTPEPTPAK 28
>gnl|CDD|234730 PRK00339, minC, septum formation inhibitor; Reviewed.
Length = 249
Score = 32.8 bits (75), Expect = 0.23
Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 2/51 (3%)
Query: 175 PPAAGASAPAPPPPKVA--AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYA 223
PP+ P P P A P PA PT V + GG ++YA
Sbjct: 108 PPSGARERPLEPSPAEAEPEPAKAPEKPPEPAVRPTRVITTPVRGGQQIYA 158
>gnl|CDD|133457 cd06848, GCS_H, Glycine cleavage H-protein. Glycine cleavage
H-proteins are part of the glycine cleavage system (GCS)
found in bacteria, archea and the mitochondria of
eukaryotes. GCS is a multienzyme complex consisting of 4
different components (P-, H-, T- and L-proteins) which
catalyzes the oxidative cleavage of glycine. The
H-protein shuttles the methylamine group of glycine from
the P-protein (glycine dehydrogenase) to the T-protein
(aminomethyltransferase) via a lipoyl group, attached to
a completely conserved lysine residue.
Length = 96
Score = 31.0 bits (71), Expect = 0.24
Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 1/41 (2%)
Query: 95 EMGTIVS-WAKKEGDKLNEGDLLAEIETDKATMGFETPEEG 134
+G IV + G ++ +GD +E+ KA +P G
Sbjct: 27 LLGDIVFVELPEVGTEVKKGDPFGSVESVKAASDLYSPVSG 67
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 33.1 bits (76), Expect = 0.25
Identities = 13/61 (21%), Positives = 21/61 (34%), Gaps = 4/61 (6%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPP----KVAAAPPPPPPKAAPAPSPTPVPSQKT 215
D D P ++ P PP + + PP P + + +P P P T
Sbjct: 173 PGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT 232
Query: 216 S 216
+
Sbjct: 233 N 233
Score = 32.3 bits (74), Expect = 0.37
Identities = 21/114 (18%), Positives = 34/114 (29%), Gaps = 16/114 (14%)
Query: 112 EGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFK 171
E + T+ + A D P + V +F
Sbjct: 157 EDEDADVATTNSDNSFPGEDADP------ASASPSDPP---------SSSPGVPSFPSPP 201
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
+D + +S P P P + PPP P++ PSP P P+ +
Sbjct: 202 EDPSSPSDSSLP-PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPL 254
Score = 31.6 bits (72), Expect = 0.74
Identities = 16/54 (29%), Positives = 20/54 (37%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
D + P A +S + PP +P P P PA P P Q T P
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPP 262
Score = 29.7 bits (67), Expect = 2.9
Identities = 17/54 (31%), Positives = 19/54 (35%), Gaps = 6/54 (11%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPP------PPKAAPAPSPTPVPSQKTSGGT 219
D PP+ + PPP AA PPPP A P P G T
Sbjct: 222 DTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275
>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
Length = 136
Score = 31.9 bits (73), Expect = 0.26
Identities = 8/37 (21%), Positives = 10/37 (27%), Gaps = 2/37 (5%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
D + +P APP P A P
Sbjct: 96 NFDQTGNGRPTPSSPDAA--GPAPPDATPLAPGERPP 130
Score = 27.3 bits (61), Expect = 8.2
Identities = 6/40 (15%), Positives = 8/40 (20%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
D G ++P P A P
Sbjct: 86 LDGDDSLFTGNFDQTGNGRPTPSSPDAAGPAPPDATPLAP 125
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 33.3 bits (76), Expect = 0.27
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 3/50 (6%)
Query: 174 APPAAGASAPAPPP---PKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
AP AA +A AP P P+ P P P ++A AP + R
Sbjct: 380 APTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLALR 429
Score = 31.7 bits (72), Expect = 0.70
Identities = 12/57 (21%), Positives = 19/57 (33%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
+ + + SAP A P P+ P+P P S + A+P
Sbjct: 369 RLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAG 425
Score = 29.0 bits (65), Expect = 5.0
Identities = 11/69 (15%), Positives = 15/69 (21%), Gaps = 4/69 (5%)
Query: 176 PAAGASAPAPPPPKVAAAP----PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
PA +V + AP P P P + A
Sbjct: 360 PADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAP 419
Query: 232 AEKGLDLSS 240
A L+
Sbjct: 420 AAAPAGLAL 428
Score = 28.3 bits (63), Expect = 8.5
Identities = 9/35 (25%), Positives = 9/35 (25%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
D P G AP P A P A
Sbjct: 394 DFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLAL 428
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 33.2 bits (75), Expect = 0.27
Identities = 36/142 (25%), Positives = 52/142 (36%), Gaps = 22/142 (15%)
Query: 148 VPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS- 206
+PI +L + D A AP A + AP P +AA P AP P+
Sbjct: 26 IPIPELHTSAATQKPDPAP-------APHQAASRAPDPAVAPTSAASRKPDLAQAPTPAA 78
Query: 207 ---PTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAG 263
P P +R +A +LAA D + + TSA + + A
Sbjct: 79 SEKFDPAP-APHQAASRAPDPAVAPQLAAAPKPDAAE---------AFTSAAQAHEAPAD 128
Query: 264 A-VAAPSKSAKPTANGPFTDLP 284
A +A SK P A+ + P
Sbjct: 129 AGTSAASKKPDPAAHTQHSPPP 150
>gnl|CDD|221079 pfam11321, DUF3123, Protein of unknown function (DUF3123). This
eukaryotic family of proteins has no known function.
Length = 113
Score = 31.5 bits (71), Expect = 0.27
Identities = 16/43 (37%), Positives = 17/43 (39%)
Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
A KD K PAA A A AA + A AP PT
Sbjct: 55 ARKDVKLIGAPAAAAPAVTSSTATDAAVRRARSRRNAAAPRPT 97
>gnl|CDD|205711 pfam13533, Biotin_lipoyl_2, Biotin-lipoyl like.
Length = 50
Score = 29.8 bits (68), Expect = 0.28
Identities = 8/27 (29%), Positives = 18/27 (66%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDK 123
G +V+ KEG + +GD+L +++++
Sbjct: 11 GRVVAVNVKEGQSVKKGDVLFTLDSEE 37
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 31.2 bits (71), Expect = 0.29
Identities = 10/36 (27%), Positives = 13/36 (36%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P PP + PP P+ P SP + S
Sbjct: 31 PPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFS 66
Score = 28.9 bits (65), Expect = 1.6
Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 2/45 (4%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
PP PP P+ P P + P + + K S GT
Sbjct: 36 PPQQQQGHFMPPQPQ--PYPKQSPQQQQPPQFSSFLSQFKNSDGT 78
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 33.1 bits (76), Expect = 0.29
Identities = 13/32 (40%), Positives = 13/32 (40%), Gaps = 3/32 (9%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
D PPA PA P V AA P PA
Sbjct: 536 DVPPA---PTPAEPAAPVVAAAPKAAAATPPA 564
Score = 32.7 bits (75), Expect = 0.42
Identities = 13/39 (33%), Positives = 16/39 (41%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
P A + P PP A P P AAP + P+Q
Sbjct: 527 PALATFAMPDVPPAPTPAEPAAPVVAAAPKAAAATPPAQ 565
Score = 29.6 bits (67), Expect = 3.6
Identities = 10/40 (25%), Positives = 10/40 (25%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
A AP VA P A P P V
Sbjct: 874 PVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVE 913
>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding. The F-actin binding
domain forms a compact bundle of four antiparallel
alpha-helices, which are arranged in a left-handed
topology. Binding of F-actin to the F-actin binding
domain may result in cytoplasmic retention and
subcellular distribution of the protein, as well as
possible inhibition of protein function.
Length = 179
Score = 32.0 bits (72), Expect = 0.31
Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 1/103 (0%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLS 239
PA P P+ A P PP P PS +P PS K + GT+ ++ ++ L +
Sbjct: 6 PVPPAVPKPQSTAKPVGTPPSPVPLPSTSPSPS-KMANGTQPSSAAFIPLISTRVSLRKT 64
Query: 240 SIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTD 282
G IT + +++A +A S + ++ +
Sbjct: 65 RQPPERIASGKITKGVVLDSTEALCLAISRNSEQMASHSAVLE 107
>gnl|CDD|182725 PRK10781, rcsF, outer membrane lipoprotein; Reviewed.
Length = 133
Score = 31.6 bits (72), Expect = 0.32
Identities = 10/30 (33%), Positives = 11/30 (36%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
S P + A PP P AP PV
Sbjct: 20 SRSPVEPVQSTAPPPKAEPAKPKAPRAAPV 49
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 32.7 bits (74), Expect = 0.33
Identities = 10/38 (26%), Positives = 13/38 (34%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
PA P + P P + +P P Q G R
Sbjct: 221 PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKR 258
Score = 30.8 bits (69), Expect = 1.3
Identities = 9/43 (20%), Positives = 12/43 (27%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
P P P + A P P P P + + G
Sbjct: 221 PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQG 263
Score = 28.5 bits (63), Expect = 8.4
Identities = 8/39 (20%), Positives = 11/39 (28%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
A P + P P + A P+P Q
Sbjct: 210 QNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQ 248
Score = 28.5 bits (63), Expect = 8.4
Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 2/32 (6%)
Query: 176 PAAGASAPAPPPPKVAAAP--PPPPPKAAPAP 205
PA G PA +A+ PP PP+ APAP
Sbjct: 152 PAGGTYIPAGGTYILASGTYIPPNPPREAPAP 183
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 32.3 bits (74), Expect = 0.34
Identities = 16/46 (34%), Positives = 17/46 (36%), Gaps = 3/46 (6%)
Query: 172 DDAPPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
D PP A P P P P+ P PP P P P P K
Sbjct: 57 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPK 102
Score = 31.2 bits (71), Expect = 0.90
Identities = 16/34 (47%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 183 PAPPPPKVAAAP-PPPPPKAAPAPSPTPVPSQKT 215
P P PPK A P P PK P P P P P +K
Sbjct: 80 PIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKV 113
Score = 30.8 bits (70), Expect = 1.3
Identities = 13/33 (39%), Positives = 13/33 (39%), Gaps = 4/33 (12%)
Query: 183 PAPPPP----KVAAAPPPPPPKAAPAPSPTPVP 211
PAP P VA A PP P P P P
Sbjct: 42 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEP 74
Score = 29.7 bits (67), Expect = 2.6
Identities = 15/44 (34%), Positives = 16/44 (36%), Gaps = 6/44 (13%)
Query: 174 APPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVP 211
AP PPP+ P P PPK AP P P P
Sbjct: 54 APADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEP 97
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.8 bits (75), Expect = 0.35
Identities = 18/68 (26%), Positives = 23/68 (33%), Gaps = 7/68 (10%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
++S VA F D +P A P P P PPP +A A T G+
Sbjct: 40 SQSGVAEF--PWDPSPQA-----PPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGS 92
Query: 220 RVYASPLA 227
P
Sbjct: 93 PAANPPAT 100
>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein. A novel
genetic system characterized by six major proteins,
included a ParB homolog and a ThiF homolog, is
designated PRTRC, or ParB-Related,ThiF-Related Cassette.
It is often found on plasmids. This protein family the
member related to ParB, and is designated PRTRC system
ParB family protein.
Length = 554
Score = 32.8 bits (75), Expect = 0.35
Identities = 22/83 (26%), Positives = 30/83 (36%), Gaps = 9/83 (10%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPP----PPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
N + VAA K + A AA A P K + AA + VPS +
Sbjct: 314 NTTKVAARKKAERAAAAAAQKPAAPAAGPGTPAKEKSPAETATSGAAKPAAKKAVPSSQP 373
Query: 216 SGG-----TRVYASPLAKRLAAE 233
S +V+ LA+ LA
Sbjct: 374 SNRVKDYREKVWRKALARELALN 396
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 32.7 bits (75), Expect = 0.36
Identities = 13/42 (30%), Positives = 16/42 (38%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
D + A+ PAP P P A AP+P P Q
Sbjct: 165 DTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQ 206
Score = 30.4 bits (69), Expect = 2.0
Identities = 9/48 (18%), Positives = 13/48 (27%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
+ PA + P+ A P A +P P G
Sbjct: 188 NSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAA 235
Score = 30.0 bits (68), Expect = 2.0
Identities = 15/65 (23%), Positives = 17/65 (26%), Gaps = 4/65 (6%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPP----KAAPAPSPTPVPSQKTSGGTRVYASPLA 227
AP A + P A P P A APS V + T A
Sbjct: 175 TPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234
Query: 228 KRLAA 232
L
Sbjct: 235 APLPT 239
Score = 29.6 bits (67), Expect = 2.8
Identities = 17/91 (18%), Positives = 23/91 (25%), Gaps = 6/91 (6%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLS 239
+ PA P A P PA + P P+ V S AA
Sbjct: 169 TTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAAT------ 222
Query: 240 SIGAGSGLFGSITSADLSKASKAGAVAAPSK 270
A +A + A P+
Sbjct: 223 PAPAAPATPDGAAPLPTDQAGVSTPAADPNA 253
Score = 29.6 bits (67), Expect = 3.2
Identities = 10/42 (23%), Positives = 11/42 (26%), Gaps = 1/42 (2%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
PAA A A P P + PSQ
Sbjct: 174 TTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVA-PSQ 214
Score = 29.6 bits (67), Expect = 3.3
Identities = 10/43 (23%), Positives = 13/43 (30%), Gaps = 3/43 (6%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP---TPVPSQ 213
PA A P A P A +P P+P+
Sbjct: 198 PAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTD 240
Score = 28.8 bits (65), Expect = 5.8
Identities = 11/56 (19%), Positives = 15/56 (26%), Gaps = 1/56 (1%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
P A P + A P P A V + + T +P A
Sbjct: 173 ATTPAPAAPVDTTPTNSQTPAVATAPAP-AVDPQQNAVVAPSQANVDTAATPAPAA 227
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 32.3 bits (74), Expect = 0.41
Identities = 12/49 (24%), Positives = 15/49 (30%), Gaps = 5/49 (10%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAA-----PAPSPTPVPSQKTSG 217
PA + P PP P K A PA S P ++
Sbjct: 188 KVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASAR 236
Score = 31.6 bits (72), Expect = 0.80
Identities = 8/46 (17%), Positives = 9/46 (19%), Gaps = 1/46 (2%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS-QKTSGG 218
P PA K A PP P +
Sbjct: 199 PPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASARALSSAPAS 244
Score = 28.5 bits (64), Expect = 6.2
Identities = 7/39 (17%), Positives = 9/39 (23%), Gaps = 3/39 (7%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
AP PP P PA + +
Sbjct: 182 APSKGAKVPATAETHPT---PPQKPATKKPAVNHHKTAT 217
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 32.3 bits (73), Expect = 0.41
Identities = 16/83 (19%), Positives = 29/83 (34%), Gaps = 4/83 (4%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKG 235
P A A P P + A P + + +P P + + ++P ++ A
Sbjct: 93 PHAQPEASGPGPAR--GARGPAGSRGRGRRAESPSP--RDPPNPKGASAPRGRKSACADS 148
Query: 236 LDLSSIGAGSGLFGSITSADLSK 258
L A + T A L++
Sbjct: 149 AALLDAPAPAAPKRQKTPAGLAR 171
>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region.
Length = 57
Score = 29.4 bits (66), Expect = 0.44
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
PP S P P V +PPPP K+ P P
Sbjct: 17 PPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYV 50
Score = 27.4 bits (61), Expect = 1.9
Identities = 11/33 (33%), Positives = 13/33 (39%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
PP +P PP PPPP +P P
Sbjct: 16 PPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPK 48
Score = 27.0 bits (60), Expect = 2.7
Identities = 17/51 (33%), Positives = 20/51 (39%), Gaps = 2/51 (3%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
PP PPP + PPPP K+ SP P + VY SP
Sbjct: 6 PPPPVKQYSPPPPYYYKS--PPPPVKSPVYKSPPPPVYKSPPPPKYVYKSP 54
Score = 26.3 bits (58), Expect = 5.4
Identities = 14/38 (36%), Positives = 17/38 (44%), Gaps = 1/38 (2%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
+K PP +PPPP V +PPPP P P
Sbjct: 21 YKSPPPPVKSPVYKSPPPP-VYKSPPPPKYVYKSPPPP 57
Score = 25.9 bits (57), Expect = 8.1
Identities = 17/41 (41%), Positives = 17/41 (41%), Gaps = 1/41 (2%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
PPP V PPPP P P P K S VY SP
Sbjct: 5 SPPPPVKQYSPPPPYYYKSPPPPVKSPVYK-SPPPPVYKSP 44
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 32.0 bits (72), Expect = 0.47
Identities = 20/46 (43%), Positives = 21/46 (45%), Gaps = 3/46 (6%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPV---PSQKTSGGTRVYASP 225
P P APPPPPP A P P P P P+ T GG SP
Sbjct: 217 PNHFLPTPMLAPPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSP 262
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 32.2 bits (73), Expect = 0.47
Identities = 12/49 (24%), Positives = 17/49 (34%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
+ PAPPP ++ P APS + + V AS
Sbjct: 117 EPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSPRRVIVQAS 165
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 32.6 bits (74), Expect = 0.47
Identities = 14/45 (31%), Positives = 15/45 (33%), Gaps = 7/45 (15%)
Query: 174 APPAAGASAPAPPPPKV-------AAAPPPPPPKAAPAPSPTPVP 211
P PPP A A P PP PAP P P+
Sbjct: 161 GPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPPPAPHPHPIA 205
Score = 29.9 bits (67), Expect = 2.8
Identities = 13/37 (35%), Positives = 13/37 (35%), Gaps = 5/37 (13%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
DD AGA A PP PPP P P
Sbjct: 174 PPDDDEGDEAGAPATPAPP-----LHPPPAPHPHPIA 205
>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
Length = 248
Score = 31.8 bits (72), Expect = 0.49
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAP 203
F+ AA + P PPPP A P PP + AP
Sbjct: 108 FRAQYERAAVSPPPPPPPPPARAEPAPPVARPAP 141
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 31.7 bits (72), Expect = 0.49
Identities = 14/46 (30%), Positives = 17/46 (36%), Gaps = 2/46 (4%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
P+ G++ P P P P K A P S T GG V
Sbjct: 157 PSDGSATPNTSAPPTPGNPAAQPEKPAETPKGN--GSSFTFGGLTV 200
>gnl|CDD|183757 PRK12800, fliF, flagellar MS-ring protein; Reviewed.
Length = 574
Score = 32.3 bits (73), Expect = 0.54
Identities = 15/48 (31%), Positives = 22/48 (45%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVY 222
PP A +++P PP AA P P A + P++ + TR Y
Sbjct: 317 PPGATSNSPGQPPAPAAAGAPGTPAAANGQAAAAAAPTESSKSATRNY 364
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 31.7 bits (72), Expect = 0.55
Identities = 14/33 (42%), Positives = 14/33 (42%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
P PP PK P P K P P P P P K
Sbjct: 79 PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
Score = 29.4 bits (66), Expect = 3.4
Identities = 17/104 (16%), Positives = 27/104 (25%), Gaps = 6/104 (5%)
Query: 175 PPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAK 228
P P PP P P P P P K +V P K
Sbjct: 59 EPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK 118
Query: 229 RLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSA 272
+ ++ + S SA + + + + + A
Sbjct: 119 KPPSKTAAKAPAAPNQPARPPSAASASGAATGPSASYLSGLRRA 162
>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
Provisional.
Length = 517
Score = 32.3 bits (74), Expect = 0.55
Identities = 16/60 (26%), Positives = 16/60 (26%), Gaps = 4/60 (6%)
Query: 152 KLVC--IIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
KLV I E P A A P A P PP A SP
Sbjct: 99 KLVAPVIWYSEEEGEEILAS--SPPPIPEAVPATAEPSESANAQFPAPPLTRAVCQSPAK 156
Score = 31.2 bits (71), Expect = 1.1
Identities = 12/37 (32%), Positives = 15/37 (40%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
AS+P P P V A P A P+P + S
Sbjct: 117 ASSPPPIPEAVPATAEPSESANAQFPAPPLTRAVCQS 153
>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 30.1 bits (68), Expect = 0.55
Identities = 9/49 (18%), Positives = 14/49 (28%)
Query: 168 KDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
K+ + D A A+A P P A S + +
Sbjct: 44 KEMQSDNKAEASAAAAQAPYQVATPTPVQSQRVDPAAASGQDSTEARPA 92
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 32.0 bits (73), Expect = 0.58
Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 2/44 (4%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
A G + P P ++ P P AA + S SG
Sbjct: 248 TLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAAATSSS--ASGN 289
Score = 30.4 bits (69), Expect = 1.7
Identities = 15/58 (25%), Positives = 20/58 (34%), Gaps = 5/58 (8%)
Query: 172 DDAPPAAGAS-----APAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
+D A S AP P V P P + AP+ P + + T AS
Sbjct: 230 EDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAAATSSSAS 287
>gnl|CDD|223010 PHA03215, PHA03215, nuclear protein UL24; Provisional.
Length = 262
Score = 31.8 bits (72), Expect = 0.60
Identities = 15/42 (35%), Positives = 15/42 (35%), Gaps = 2/42 (4%)
Query: 179 GASAPAPP--PPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
GA P P AA PP P AAP P GG
Sbjct: 203 GARRPRSRRRPRGGAAGPPVPAEPAAPERPPADPTPTSARGG 244
Score = 28.3 bits (63), Expect = 7.9
Identities = 11/32 (34%), Positives = 12/32 (37%), Gaps = 3/32 (9%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
P A P P P AAP PP P +
Sbjct: 213 PRGGAAGPPVPAEP---AAPERPPADPTPTSA 241
>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
Reviewed.
Length = 275
Score = 31.8 bits (73), Expect = 0.61
Identities = 16/44 (36%), Positives = 17/44 (38%), Gaps = 13/44 (29%)
Query: 171 KDDAPPAAG-------AS------APAPPPPKVAAAPPPPPPKA 201
+D APP G AS P P K PPPP P A
Sbjct: 224 EDQAPPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPPA 267
>gnl|CDD|221020 pfam11191, DUF2782, Protein of unknown function (DUF2782). This is
a bacterial family of proteins whose function is
unknown.
Length = 104
Score = 30.0 bits (68), Expect = 0.65
Identities = 10/30 (33%), Positives = 10/30 (33%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAP 203
AP A A A P A P P P
Sbjct: 10 APALAAADAATDDDPIPLPAIPTTPEDKEP 39
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 31.1 bits (70), Expect = 0.69
Identities = 14/33 (42%), Positives = 15/33 (45%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
P G P PP P PPPPP +P P P
Sbjct: 116 PGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148
Score = 30.7 bits (69), Expect = 0.91
Identities = 18/41 (43%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 174 APPAAGASAPAPPPP---KVAAAPPPPPPKAAPAPSPTPVP 211
PP G + PPPP A PPPP P A P P P+P
Sbjct: 27 PPPLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67
Score = 29.9 bits (67), Expect = 1.5
Identities = 18/37 (48%), Positives = 19/37 (51%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P AG P P P A PPPP P AA P P P+P
Sbjct: 55 PGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91
Score = 29.5 bits (66), Expect = 1.8
Identities = 14/37 (37%), Positives = 15/37 (40%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P P P P PPPP P A P P P+P
Sbjct: 19 PGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLP 55
Score = 29.1 bits (65), Expect = 2.5
Identities = 21/41 (51%), Positives = 23/41 (56%), Gaps = 3/41 (7%)
Query: 174 APPAAGASAPAPPPP--KVAAAPPPPP-PKAAPAPSPTPVP 211
PP G +A PPPP VA PPPPP P A P P P+P
Sbjct: 39 PPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79
Score = 29.1 bits (65), Expect = 2.6
Identities = 15/36 (41%), Positives = 17/36 (47%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
PP GA PPPP + + PPPP PA P
Sbjct: 125 PPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160
Score = 29.1 bits (65), Expect = 2.8
Identities = 18/37 (48%), Positives = 18/37 (48%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P AG P PP P AA PPPPP P P P P
Sbjct: 91 PGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPP 127
Score = 28.8 bits (64), Expect = 3.8
Identities = 17/36 (47%), Positives = 17/36 (47%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P A P PP P A PPPPPP A P P P
Sbjct: 79 PGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPP 114
Score = 27.6 bits (61), Expect = 8.6
Identities = 15/37 (40%), Positives = 16/37 (43%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
P G P P P V PPPP P P P P+P
Sbjct: 7 PGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43
>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus. This family includes the
NUT protein. The gene encoding for NUT protein (Nuclear
Testis protein) is found fused to BRD3 or BRD4 genes, in
some aggressive types of carcinoma, due to chromosomal
translocations. Proteins of the BRD family contain two
bromodomains that bind transcriptionally active
chromatin through associations with acetylated histones
H3 and H4. Such proteins are crucial for the regulation
of cell cycle progression. On the other hand, little is
known about NUT protein. NUT is known to have a Nuclear
Export Sequence (NES) as well as a Nuclear Localization
Signal (NLS), both located towards the C-terminal end of
the protein. A fused NUT-GFP protein showed either
cytoplasmic or nuclear localization, suggesting that it
is subject to nuclear/cytoplasmic shuttling. Consistent
with this possibility, treatment with leptomycin B an
inhibitor of CRM1-dependent nuclear export resulted in
re-distribution of NUT-GFP to the nucleus. Inspection of
NUT revealed a C-terminal sequence similar to known
nuclear export sequences (NES) which are often regulated
by phosphorylation.
Length = 328
Score = 31.5 bits (71), Expect = 0.71
Identities = 15/52 (28%), Positives = 15/52 (28%), Gaps = 5/52 (9%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPT-----PVPSQKTSGGTRVYAS 224
G P PPPP P A P P P QK S G
Sbjct: 151 LGLPLPPPPPPVAQLVPIVSLENAWPGPQGATGEGGPAAIQKPSPGDYSSKP 202
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 31.6 bits (72), Expect = 0.72
Identities = 13/40 (32%), Positives = 17/40 (42%)
Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
G S P PPPP ++ PPP A S ++ G
Sbjct: 208 GGSGPGPPPPGFKSSFPPPYGPGAGPSSGYGSGGTRSGQG 247
>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631). The
members of this family are sequences derived from a
group of hypothetical proteins expressed by certain
bacterial species. The region concerned is approximately
440 amino acid residues in length.
Length = 729
Score = 31.9 bits (73), Expect = 0.75
Identities = 13/44 (29%), Positives = 18/44 (40%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
+ AF+ + A A AP P+VA P+ A AP
Sbjct: 595 LEAFQAPPELAAQLTALPAVAPAEPRVAEDIVAVAPEEARAPRL 638
>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
Length = 400
Score = 31.7 bits (72), Expect = 0.76
Identities = 14/61 (22%), Positives = 18/61 (29%), Gaps = 5/61 (8%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
+ A SA A + P A A +P P T+ A P K A
Sbjct: 244 SEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAA-----AEPAPKEAAKA 298
Query: 234 K 234
Sbjct: 299 A 299
Score = 31.3 bits (71), Expect = 0.94
Identities = 15/53 (28%), Positives = 20/53 (37%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
E++ A A A AP P AA P P +AA A + P+
Sbjct: 255 ETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAAAKPAL 307
Score = 30.5 bits (69), Expect = 1.6
Identities = 25/113 (22%), Positives = 32/113 (28%), Gaps = 9/113 (7%)
Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
+ A +D D G P P + AAP PP AA S +
Sbjct: 208 ASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPVPPSE-AARPKSADAETNAALKTPATA 266
Query: 222 YASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKP 274
+ A E + A A A A A A P+ KP
Sbjct: 267 PKAAAKNAKAPEAQPVSGTAAA--------EPAPKEAAKAAAAAAKPALEDKP 311
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 31.4 bits (71), Expect = 0.77
Identities = 13/56 (23%), Positives = 17/56 (30%)
Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
E + A +AP + AP P P P + P P P K
Sbjct: 126 EQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQPAVPEQQVQPEEVVEPAPEVK 181
>gnl|CDD|218839 pfam05983, Med7, MED7 protein. This family consists of several
eukaryotic proteins which are homologues of the yeast
MED7 protein. Activation of gene transcription in
metazoans is a multi-step process that is triggered by
factors that recognise transcriptional enhancer sites in
DNA. These factors work with co-activators such as MED7
to direct transcriptional initiation by the RNA
polymerase II apparatus.
Length = 161
Score = 30.7 bits (70), Expect = 0.79
Identities = 15/62 (24%), Positives = 23/62 (37%), Gaps = 17/62 (27%)
Query: 181 SAPAPPPP----------KVAAAPPPPPPKA-------APAPSPTPVPSQKTSGGTRVYA 223
S PPPP ++ PPPPPP +PS ++ G ++Y
Sbjct: 4 SLYPPPPPYYKLFTDENLELRFLPPPPPPTEGSYRVFGETYSLEDKLPSLESQGIEQLYP 63
Query: 224 SP 225
+
Sbjct: 64 TS 65
>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
Length = 427
Score = 31.6 bits (72), Expect = 0.79
Identities = 14/48 (29%), Positives = 15/48 (31%)
Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
N S PP G PA P A P P AA +P
Sbjct: 1 NTSSDPFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGRGAASGLNP 48
Score = 30.0 bits (68), Expect = 2.6
Identities = 13/46 (28%), Positives = 15/46 (32%), Gaps = 3/46 (6%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
P +AG+ PP P P P A AP SG
Sbjct: 4 SDPFSAGSGGFVPPNPG-DRTPAAAPA--AGAPFQPRPGRGAASGL 46
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 31.0 bits (70), Expect = 0.84
Identities = 29/96 (30%), Positives = 39/96 (40%), Gaps = 9/96 (9%)
Query: 186 PPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL-AAEKGLDL 238
PPP A PP P P + SP +PS + G T + AK +KG
Sbjct: 75 PPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPGSTPLSMETDAKSSDNKDKGSIK 134
Query: 239 SSIG--AGSGLFGSITSADLSKASKAGAVAAPSKSA 272
S G GL S + + KAS + A S+S+
Sbjct: 135 KSKGSDGSLGLAMSGKNGESGKASGSSANGGSSQSS 170
>gnl|CDD|177677 PLN00045, PLN00045, photosystem I reaction center subunit IV;
Provisional.
Length = 101
Score = 29.6 bits (66), Expect = 0.86
Identities = 13/36 (36%), Positives = 17/36 (47%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPP 199
V A +D + +A + A A P AA P PPP
Sbjct: 2 VRAAEDAEPATSSSAASPAAAAAPAAAPAAKPKPPP 37
Score = 28.8 bits (64), Expect = 1.8
Identities = 10/33 (30%), Positives = 13/33 (39%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
A A++ + P AAAP P P P
Sbjct: 8 AEPATSSSAASPAAAAAPAAAPAAKPKPPPIGP 40
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 31.2 bits (71), Expect = 0.87
Identities = 7/43 (16%), Positives = 11/43 (25%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P +AP P P P +P ++
Sbjct: 140 VTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEAEKEKE 182
Score = 30.0 bits (68), Expect = 2.3
Identities = 10/41 (24%), Positives = 13/41 (31%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
P A + P P P P P AP + P +
Sbjct: 129 QPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQA 169
Score = 28.5 bits (64), Expect = 5.6
Identities = 11/45 (24%), Positives = 13/45 (28%), Gaps = 4/45 (8%)
Query: 174 APPAAGASAPAPPPPKVAA----APPPPPPKAAPAPSPTPVPSQK 214
P P PP + A P P AP V + K
Sbjct: 130 PPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPK 174
>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
Length = 466
Score = 31.4 bits (71), Expect = 0.88
Identities = 15/52 (28%), Positives = 21/52 (40%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
AGA AP + AP P +A P TP P++ T + P +
Sbjct: 21 AGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGP 72
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 31.5 bits (72), Expect = 0.89
Identities = 15/37 (40%), Positives = 15/37 (40%), Gaps = 2/37 (5%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
FK D P P P AA PPPP AA A
Sbjct: 11 FKWD--PRPAVPPPPPRLLPPAAVPPPPLAAAAAARK 45
>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
PatA/PatG family. This model describes a protease
domain associated with the maturation of various members
of the cyanobactin family of ribosomally produced,
heavily modified bioactive metabolites. Members include
the PatA protein and C-terminal domain of the PatG
protein of Prochloron didemni, TenA and a region of TenG
from Nostoc spongiaeforme var. tenue, etc.
Length = 602
Score = 31.6 bits (72), Expect = 0.89
Identities = 17/63 (26%), Positives = 20/63 (31%), Gaps = 9/63 (14%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAA---------PAPSPTPVPSQKTSGGTRVYAS 224
P A+ PAP VAA P AA P P + S G +
Sbjct: 261 VIPPGRAAEPAPVSIPVAAPGEGATPAAAQIELSAGVLPNAISPATPVRPASNGVTPSQA 320
Query: 225 PLA 227
P A
Sbjct: 321 PSA 323
>gnl|CDD|237545 PRK13888, PRK13888, conjugal transfer protein TrbN; Provisional.
Length = 206
Score = 31.0 bits (70), Expect = 0.90
Identities = 11/49 (22%), Positives = 13/49 (26%), Gaps = 2/49 (4%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA--PSPTPVPSQKTSGG 218
D A S P+ P A P P AP + G
Sbjct: 146 DVTKAGATPSTPSQPATVAQRATPAARPAPAPKQAAPAAAAARSLSVAG 194
>gnl|CDD|233980 TIGR02712, urea_carbox, urea carboxylase. Members of this family are
ATP-dependent urea carboxylase, including characterized
members from Oleomonas sagaranensis (alpha class
Proteobacterium) and yeasts such as Saccharomyces
cerevisiae. The allophanate hydrolase domain of the yeast
enzyme is not included in this model and is represented
by an adjacent gene in Oleomonas sagaranensis. The fusion
of urea carboxylase and allophanate hydrolase is
designated urea amidolyase. The enzyme from Oleomonas
sagaranensis was shown to be highly active on acetamide
and formamide as well as urea [Central intermediary
metabolism, Nitrogen metabolism].
Length = 1201
Score = 31.5 bits (72), Expect = 0.96
Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 106 EGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCII 157
GD++ G L +E K M P G + KIL G V G +V ++
Sbjct: 1150 VGDRVEAGQPLVILEAMKMEMPVSAPVAGKVTKILCQPGDM-VDAGDIVAVL 1200
>gnl|CDD|234012 TIGR02784, addA_alphas, double-strand break repair helicase AddA,
alphaproteobacterial type. AddAB, also called RexAB,
substitutes for RecBCD in several bacterial lineages.
These DNA recombination proteins act before synapse and
are particularly important for DNA repair of
double-stranded breaks by homologous recombination. The
term AddAB is used broadly, with AddA homologous between
the alphaproteobacteria (as modeled here) and the
Firmicutes, while the partner AddB proteins show no
strong homology across the two groups of species [DNA
metabolism, DNA replication, recombination, and repair].
Length = 1135
Score = 31.6 bits (72), Expect = 0.99
Identities = 13/66 (19%), Positives = 17/66 (25%), Gaps = 4/66 (6%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP----TPVPSQKTSGGTRVYASPLAKR 229
P A + P AP P P +P + S R A
Sbjct: 918 LPVEAAQTDTLEALPDWLRAPAPAEPALPRPLAPSGLGGAIDSALPGEAVRSSLGTQAPA 977
Query: 230 LAAEKG 235
A +G
Sbjct: 978 FALLRG 983
>gnl|CDD|217047 pfam02455, Hex_IIIa, Hexon-associated protein (IIIa). The major
capsid protein of the adenovirus strain is also known as
a hexon. This is a family of hexon-associated proteins
(protein IIIa).
Length = 491
Score = 31.2 bits (71), Expect = 0.99
Identities = 19/84 (22%), Positives = 30/84 (35%), Gaps = 7/84 (8%)
Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
++ A + K +A S APPP A P P P + S
Sbjct: 339 DDVDSPAELRLLKPEAGEEEDDSLFAPPPS--VAGPSSPAPSLSRYSSVPRSAFDGRLAD 396
Query: 219 TRVYAS-----PLAKRLAAEKGLD 237
+R+ + PLA++ A L+
Sbjct: 397 SRLPGAGDLLAPLAEKNAYNNPLE 420
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 31.5 bits (71), Expect = 1.0
Identities = 15/32 (46%), Positives = 15/32 (46%), Gaps = 5/32 (15%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
AS P PP PPPPPP P P P P
Sbjct: 2 ASLPPGNPP-----PPPPPPGFEPPSQPPPPP 28
Score = 31.5 bits (71), Expect = 1.2
Identities = 14/30 (46%), Positives = 15/30 (50%), Gaps = 8/30 (26%)
Query: 183 PAPPPPKVAAAPPPPP---PKAAPAPSPTP 209
P PPP PPPPP P + P P P P
Sbjct: 6 PGNPPP-----PPPPPGFEPPSQPPPPPPP 30
Score = 31.1 bits (70), Expect = 1.4
Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 4/27 (14%)
Query: 175 PPAAGASAPAPPPP--KVAAAPPPPPP 199
PP G P PPPP + + PPPPPP
Sbjct: 5 PP--GNPPPPPPPPGFEPPSQPPPPPP 29
Score = 30.3 bits (68), Expect = 2.4
Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 8/35 (22%)
Query: 185 PPPPKVAAAPPPP---PPKAAPAPSPTPVPSQKTS 216
PPPP PPPP PP P P P V +K S
Sbjct: 9 PPPP-----PPPPGFEPPSQPPPPPPPGVNVKKRS 38
>gnl|CDD|225044 COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate
transport and metabolism].
Length = 399
Score = 31.2 bits (71), Expect = 1.2
Identities = 10/43 (23%), Positives = 14/43 (32%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
A + P AP P P+ P P V + + G
Sbjct: 23 GAFLLAQLQPATTAPQLAPTPKGPRLPPTPFAKGVSVEVVAQG 65
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 31.2 bits (71), Expect = 1.2
Identities = 10/33 (30%), Positives = 12/33 (36%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
PP + P PPPK P P + S
Sbjct: 361 PPTKRRLRLLPVPPPKVQALPLTALAPLVRHSP 393
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 31.2 bits (70), Expect = 1.2
Identities = 23/85 (27%), Positives = 30/85 (35%), Gaps = 16/85 (18%)
Query: 172 DDAPPAAGASAPAPPP--PKVAAAPPPPPPKAAP--------------APSPTPVPSQKT 215
D PP A APP P+V A P P AP P+ P Q
Sbjct: 559 DRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPP 618
Query: 216 SGGTRVYASPLAKRLAAEKGLDLSS 240
S R A + + E+ L+ S+
Sbjct: 619 SSAPRDMAPSVVRMFLRERLLEQST 643
>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
protease mycosin. Members of this family are
subtilisin-related serine proteases, found strictly in
the Actinobacteria and associated with type VII
secretion operons. The designation mycosin is used for
members from Mycobacterium [Protein fate, Protein and
peptide secretion and trafficking, Protein fate, Protein
modification and repair].
Length = 350
Score = 30.8 bits (70), Expect = 1.2
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 1/36 (2%)
Query: 172 DDAPPAAGASAPAPPPPK-VAAAPPPPPPKAAPAPS 206
+ P PAP P + VAA PPPPP P
Sbjct: 285 ELPPEDGRPLRPAPAPARPVAAPAPPPPPDDTPRGR 320
Score = 30.4 bits (69), Expect = 2.0
Identities = 11/34 (32%), Positives = 12/34 (35%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
+ PP G P P A P PPP P
Sbjct: 284 GELPPEDGRPLRPAPAPARPVAAPAPPPPPDDTP 317
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 30.6 bits (69), Expect = 1.3
Identities = 19/108 (17%), Positives = 32/108 (29%), Gaps = 7/108 (6%)
Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP----PKAAPAPSPTPVPSQK 214
+ ++ A D A P A A +APP A P ++
Sbjct: 79 GDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARD 138
Query: 215 --TSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKAS 260
T T A+P +R + + +S + + L A
Sbjct: 139 GPTPDPTAQPATP-DERRSPRQRPPVSGEPPTPSTPDAHVAGTLQAAR 185
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 30.6 bits (68), Expect = 1.3
Identities = 17/47 (36%), Positives = 18/47 (38%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
A PA A AP P A P P P PA +PT P R
Sbjct: 89 AAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPAR 135
Score = 30.6 bits (68), Expect = 1.6
Identities = 20/53 (37%), Positives = 23/53 (43%), Gaps = 10/53 (18%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPP----------PKAAPAPSPTPVPSQKTS 216
AP A+AP PPP V AP P P A P P+P P P+ K
Sbjct: 113 APACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKPI 165
>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER). This
family consists of several eukaryotic immediate early
response (IER) 2 and 5 proteins. The role of IER5 is
unclear although it play an important role in mediating
the cellular response to mitogenic signals. Again,
little is known about the function of IER2 although it
is thought to play a role in mediating the cellular
responses to a variety of extracellular signals.
Length = 272
Score = 30.6 bits (69), Expect = 1.4
Identities = 22/114 (19%), Positives = 29/114 (25%), Gaps = 10/114 (8%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
PP AP PP A P P + P P P+ + + +
Sbjct: 68 PPEQQPGAPRLHPPSGAEPPLPARAASPEEEQPAPEPAFYRPSVSEGFPA---------S 118
Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGV 288
G + GSG T DL V +P V
Sbjct: 119 GERPDATSTGSGQCSQTTVLDLDTHV-VTTVESPYLHQDCPCQAEDGQPAPPAV 171
>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
Length = 423
Score = 30.8 bits (70), Expect = 1.4
Identities = 11/71 (15%), Positives = 15/71 (21%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
D PP AA P P P +++ A+
Sbjct: 190 VEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIADAKKAA 249
Query: 225 PLAKRLAAEKG 235
A K
Sbjct: 250 TGEPAANAAKA 260
>gnl|CDD|217214 pfam02749, QRPTase_N, Quinolinate phosphoribosyl transferase,
N-terminal domain. Quinolinate phosphoribosyl
transferase (QPRTase) or nicotinate-nucleotide
pyrophosphorylase EC:2.4.2.19 is involved in the de novo
synthesis of NAD in both prokaryotes and eukaryotes. It
catalyzes the reaction of quinolinic acid with
5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence
of Mg2+ to give rise to nicotinic acid mononucleotide
(NaMN), pyrophosphate and carbon dioxide. The QA
substrate is bound between the C-terminal domain of one
subunit, and the N-terminal domain of the other. The
N-terminal domain has an alpha/beta hammerhead fold.
Length = 88
Score = 28.6 bits (65), Expect = 1.4
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 100 VSWAKKEGDKLNEGDLLAEIE 120
V W K+G+++ GD++ EIE
Sbjct: 47 VEWLVKDGERVEAGDVILEIE 67
>gnl|CDD|236449 PRK09282, PRK09282, pyruvate carboxylase subunit B; Validated.
Length = 592
Score = 31.0 bits (71), Expect = 1.4
Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 89 ALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
A++ M GT+V KEGDK+ GD + +E K + P +G + +ILV G
Sbjct: 524 AVTSPMP-GTVVKVKVKEGDKVKAGDTVLVLEAMKMENEIQAPVDGTVKEILVKEG 578
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 31.2 bits (70), Expect = 1.4
Identities = 17/81 (20%), Positives = 27/81 (33%), Gaps = 2/81 (2%)
Query: 207 PTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVA 266
P P S GG P + + +D+ S L + + + +
Sbjct: 467 PEPDVSCVEKGGLLEKIEPEPSLIIGDSDIDMG--HTDSHLLDQFNTGGTTFQTASTPAP 524
Query: 267 APSKSAKPTANGPFTDLPVSG 287
S A PTA G + P +G
Sbjct: 525 NASGQATPTATGVASPQPPAG 545
Score = 29.2 bits (65), Expect = 5.6
Identities = 25/89 (28%), Positives = 32/89 (35%), Gaps = 7/89 (7%)
Query: 175 PPAAGASAPAPPPPKVAAAPPP---PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
P AG S PP P A P A A P K G R AS A + A
Sbjct: 206 PQMAGQSMYQPPGPYPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPRGHR--ASQAAGQAA 263
Query: 232 AEKG-LDLSSIGAGSGLFGSITSADLSKA 259
+ L+ + G L +T ++SK
Sbjct: 264 SLDFALEDEDVSRGD-LMDLLTPREVSKM 291
>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
Length = 1250
Score = 31.2 bits (70), Expect = 1.4
Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 4/75 (5%)
Query: 143 AGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
AGS V +G +VE+ +D + ++ PP+ P P P P P P
Sbjct: 890 AGSGYVNMGAYDYTLVEDNND--WYLRSQEVTPPSPPDPDPTPDPD--PTPDPDPTPDPE 945
Query: 203 PAPSPTPVPSQKTSG 217
P P+ PV + K G
Sbjct: 946 PTPAYQPVLNAKVGG 960
>gnl|CDD|218950 pfam06236, MelC1, Tyrosinase co-factor MelC1. This family consists
of several tyrosinase co-factor MELC1 proteins from a
number of Streptomyces species. The melanin operon
(melC) of Streptomyces antibioticus contains two genes,
melC1 and melC2 (apotyrosinase). It is thought that
MelC1 forms a transient binary complex with the
downstream apotyrosinase MelC2 to facilitate the
incorporation of copper ion and the secretion of
tyrosinase indicating that MelC1 is a chaperone for the
apotyrosinase MelC2.
Length = 124
Score = 29.5 bits (66), Expect = 1.5
Identities = 15/51 (29%), Positives = 18/51 (35%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
AAG +A AA P P AP P + G R+ P A
Sbjct: 10 AAGLAAGGEAAAAPDAAAHPGPSTGRGAPGGAPASFDEVYKGRRIQGGPSA 60
>gnl|CDD|216534 pfam01499, Herpes_UL25, Herpesvirus UL25 family. The herpesvirus
UL25 gene product is a virion component involved in
virus penetration and capsid assembly. The product of
the UL25 gene is required for packaging but not cleavage
of replicated viral DNA. This family includes a number
of herpesvirus proteins: EHV-1 36, EBV BVRF1, HCMV UL77,
ILTV ORF2, and VZV gene 34.
Length = 538
Score = 30.8 bits (70), Expect = 1.5
Identities = 9/37 (24%), Positives = 11/37 (29%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
P +A P AA P +AAP
Sbjct: 88 DPLDLLETAARAAEPADAAQPDATAGEAAPGEGVGGT 124
>gnl|CDD|225401 COG2845, COG2845, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 354
Score = 30.5 bits (69), Expect = 1.6
Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 1/40 (2%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPP-PPPPKAAPAPSPT 208
F+ PP P+ P+ +AA P P A P+ T
Sbjct: 69 FRQGFPPPIAKVGPSLDMPRASAASEVAPRPGATAEPATT 108
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 30.5 bits (69), Expect = 1.6
Identities = 12/46 (26%), Positives = 18/46 (39%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
+ + P ++ S P PP + PPPP P P P +
Sbjct: 153 SDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPPPPPPSL 198
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 30.2 bits (68), Expect = 1.6
Identities = 11/44 (25%), Positives = 15/44 (34%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
+ + A + P P P APAP+P PV
Sbjct: 128 RTQSQKAQSQATTVQTQPVKPKPRPEKPQPVAPAPAPEPVEKAP 171
Score = 28.7 bits (64), Expect = 6.1
Identities = 19/59 (32%), Positives = 24/59 (40%), Gaps = 2/59 (3%)
Query: 175 PPAAGASAPAPPPPKVA--AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
P A APAP P + A A PPP A + T Q + T A + +LA
Sbjct: 154 KPQPVAPAPAPEPVEKAPKAEAAPPPKPKAEDAAETRYMLQCGAFKTAEQAESVRAQLA 212
>gnl|CDD|236098 PRK07789, PRK07789, acetolactate synthase 1 catalytic subunit;
Validated.
Length = 612
Score = 30.7 bits (70), Expect = 1.7
Identities = 15/33 (45%), Positives = 18/33 (54%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
P PP AAA PP AAPA P V ++ +G
Sbjct: 1 PSPPTPAAAASAAPPPAAPAARPRIVAPERMTG 33
Score = 29.6 bits (67), Expect = 3.7
Identities = 12/32 (37%), Positives = 13/32 (40%)
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
PP AAA PPP A A P + T
Sbjct: 1 PSPPTPAAAASAAPPPAAPAARPRIVAPERMT 32
>gnl|CDD|237873 PRK14969, PRK14969, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 527
Score = 30.5 bits (69), Expect = 1.7
Identities = 16/62 (25%), Positives = 19/62 (30%), Gaps = 7/62 (11%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP-------VPSQKTSGGTRV 221
+ A A GA V A P P A P+ + V K SG R
Sbjct: 362 EQDSGAGRAPGAVPGVNRTRAVPTAAPAAPAAATPSDRQSFNGNWRALVDQLKLSGLARE 421
Query: 222 YA 223
A
Sbjct: 422 LA 423
>gnl|CDD|223583 COG0509, GcvH, Glycine cleavage system H protein (lipoate-binding)
[Amino acid transport and metabolism].
Length = 131
Score = 29.2 bits (66), Expect = 1.7
Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 3/60 (5%)
Query: 78 VREGNDRVALPALSPTM--EMGTIVSW-AKKEGDKLNEGDLLAEIETDKATMGFETPEEG 134
VR D A ++ ++G IV + G ++ G+ LA +E+ KA P G
Sbjct: 18 VRVEGDGTATVGITDYAQDQLGDIVFVELPEVGAEVKAGESLAVVESVKAASDVYAPVSG 77
>gnl|CDD|235709 PRK06132, PRK06132, hypothetical protein; Provisional.
Length = 359
Score = 30.4 bits (69), Expect = 1.8
Identities = 14/39 (35%), Positives = 15/39 (38%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
AP A AS A A AP P A A +PT
Sbjct: 187 PKAAPAPAPASDAAANGRPAAPAPAPAEAAWADADAPTG 225
>gnl|CDD|236365 PRK09041, motB, flagellar motor protein MotB; Validated.
Length = 317
Score = 30.3 bits (69), Expect = 1.9
Identities = 13/59 (22%), Positives = 20/59 (33%), Gaps = 7/59 (11%)
Query: 149 PIGKLVCIIVEN-------ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPK 200
P+ + + I+V N + A P + APA P A P P+
Sbjct: 259 PVNRRISIVVLNKKAEEAILHENAESVALSAQNEPVSALEAPAAAPAASVPAAPAAEPR 317
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 30.4 bits (68), Expect = 1.9
Identities = 14/55 (25%), Positives = 17/55 (30%), Gaps = 4/55 (7%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAP---SPTPVP-SQKTSGGTRVYASPL 226
P A AP PP+ P P +P+P P S A P
Sbjct: 226 PVATNRGEAPSPPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPD 280
>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 755
Score = 30.7 bits (69), Expect = 1.9
Identities = 18/46 (39%), Positives = 24/46 (52%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
+P +A ++ P+ P P AAA PPPP A P P P+ G T
Sbjct: 145 SPKSALSAEPSHPVPAPAAASAPPPPPRAARPVRQPAPAPAAPGDT 190
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 29.9 bits (67), Expect = 2.0
Identities = 32/157 (20%), Positives = 49/157 (31%), Gaps = 12/157 (7%)
Query: 142 PAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAP-PPPKVAAAPPPPPPK 200
P P+ + ++ AA +A A P P P V A P
Sbjct: 34 PHDDAAEPVLTDQIVPGAEQAASAAPVHAAREATADPEFVAVEPVPTPHVPAVALPGDTD 93
Query: 201 AAPAPSPTP-VPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKA 259
A P P V +++ + S LA ++ AG + D +
Sbjct: 94 APAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADAGD------AAPDATPP 147
Query: 260 SKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRL 296
+ A + A +A TDL IA+RL
Sbjct: 148 AAGDASPPAAAQAAASAAAALTDLDAQ----HIAERL 180
>gnl|CDD|223067 PHA03380, PHA03380, transactivating tegument protein VP16;
Provisional.
Length = 432
Score = 30.4 bits (69), Expect = 2.1
Identities = 8/43 (18%), Positives = 12/43 (27%), Gaps = 3/43 (6%)
Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPP---PPPPKAAPA 204
DF + + P A + PPP+ A
Sbjct: 7 DFIADFDESLLGPPESETAPAPYATSAPSSAQLALPPPRPASP 49
>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit. This
model describes the bacterial oxaloacetate decarboxylase
alpha subunit and its equivalents in archaea. The
oxaloacetate decarboxylase Na+ pump is the paradigm of
the family of Na+ transport decarboxylases that present
in bacteria and archaea. It a multi subunit enzyme
consisting of a peripheral alpha-subunit and integral
membrane subunits beta and gamma. The energy released by
the decarboxylation reaction of oxaloacetate is coupled
to Na+ ion pumping across the membrane [Transport and
binding proteins, Cations and iron carrying compounds,
Energy metabolism, Other].
Length = 582
Score = 30.2 bits (68), Expect = 2.2
Identities = 15/50 (30%), Positives = 23/50 (46%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
G+IV EG + EG++L +E K + G + +ILV G
Sbjct: 526 GSIVKVKVSEGQTVAEGEVLLILEAMKMETEIKAAAAGTVREILVKVGDA 575
>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp).
Length = 305
Score = 30.1 bits (68), Expect = 2.2
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 195 PPPPPKAAPAPSPTPVPSQKTSG 217
P P A P+PSPTPVPS G
Sbjct: 86 PSPTTVATPSPSPTPVPSSPAVG 108
>gnl|CDD|220732 pfam10401, IRF-3, Interferon-regulatory factor 3. This is the
interferon-regulatory factor 3 chain of the
hetero-dimeric structure which also contains the shorter
chain CREB-binding protein. These two subunits make up
the DRAF1 (double-stranded RNA-activated factor 1).
Viral dsRNA produced during viral transcription or
replication leads to the activation of DRAF1. The
DNA-binding specificity of DRAF1 correlates with
transcriptional induction of ISG
(interferon-alpha,beta-stimulated gene). IRF-3 preexists
in the cytoplasm of uninfected cells and translocates to
the nucleus following viral infection. Translocation of
IRF-3 is accompanied by an increase in serine and
threonine phosphorylation, and association with the CREB
coactivator occurs only after infection.
Length = 180
Score = 29.5 bits (67), Expect = 2.3
Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 10/69 (14%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSS 240
S PP P PSP +P +K TR L K L E+GL L
Sbjct: 29 SYGPSPPQGHLCGPGTLEQ--VLFPSPEGIPDEKQREYTRR----LLKHL--ERGLLLEV 80
Query: 241 IGAGSGLFG 249
G G++
Sbjct: 81 --NGQGIYA 87
>gnl|CDD|217428 pfam03213, Pox_P35, Poxvirus P35 protein.
Length = 325
Score = 30.0 bits (68), Expect = 2.4
Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 5/76 (6%)
Query: 143 AGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
K +PI ++ ++ S+V D K A PPK + AP P P K
Sbjct: 3 PPDKKIPI-YVIPVVGRPPSEV--IPDLKSL-VIEADEVDDVKDPPKPSPAPAPLP-KPK 57
Query: 203 PAPSPTPVPSQKTSGG 218
P PSP+ Q SGG
Sbjct: 58 PDPSPSYKYYQWKSGG 73
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 30.3 bits (68), Expect = 2.4
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
S P P + PP PKAA S P P +
Sbjct: 560 SEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRK 592
>gnl|CDD|197828 smart00674, CENPB, Putative DNA-binding domain in centromere
protein B, mouse jerky and transposases.
Length = 66
Score = 27.6 bits (62), Expect = 2.4
Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 10/60 (16%)
Query: 304 KLREQMNKAL-----EKRGAKLSINDFIIKATALASRRV-----PEANSSWQDTFIREYH 353
+L + KAL + + I+ I+ AL + +A+ W F + ++
Sbjct: 2 RLYPLLEKALYEWILRQEALGIPISGEDIREKALEILQRLGLENFKASKGWLTRFKKRHN 61
>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP. This family of
genes is always found in type III secretion operons,
althought its function in the processes of secretion and
virulence is unclear. Hpa stands for Hrp-associated
gene, where Hrp stands for hypersensitivity response and
virulence.
Length = 201
Score = 29.5 bits (66), Expect = 2.4
Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 13/55 (23%)
Query: 174 APPAAGASAPAPPP-------------PKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
A A A A P +A APPP P P P P T
Sbjct: 5 ARLARPAPADPARPARRRTPLAQLRRRDALAYAPPPRPEPPPPCDEDRPEPRADT 59
>gnl|CDD|146531 pfam03948, Ribosomal_L9_C, Ribosomal protein L9, C-terminal domain.
Length = 86
Score = 27.8 bits (63), Expect = 2.5
Identities = 14/42 (33%), Positives = 24/42 (57%), Gaps = 5/42 (11%)
Query: 227 AKRLAAE-KGLDLS---SIGAGSGLFGSITSADLSKA-SKAG 263
A+ LA + +GL ++ G LFGS+T+ D+++A G
Sbjct: 4 AEALAEKLEGLTVTIKAKAGEDGKLFGSVTTKDIAEALKAQG 45
>gnl|CDD|221009 pfam11162, DUF2946, Protein of unknown function (DUF2946). This
family of proteins has no known function.
Length = 119
Score = 28.6 bits (64), Expect = 2.6
Identities = 11/46 (23%), Positives = 14/46 (30%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
A PA P A P P P +P + + A P
Sbjct: 74 AHTPALPAALPLLLALVRLAAAVPPPPPASLPPRSRWPSAQPRAPP 119
>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
Provisional.
Length = 170
Score = 29.3 bits (66), Expect = 2.6
Identities = 8/43 (18%), Positives = 17/43 (39%), Gaps = 2/43 (4%)
Query: 172 DDAPPAAGASAPAPPPPKVAAA--PPPPPPKAAPAPSPTPVPS 212
+ +P ++ + + ++A P P PS VP+
Sbjct: 14 NSSPSSSSSGKSSNSSKSASSAVNPASVVAGYEPLPSFRDVPN 56
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 29.9 bits (67), Expect = 2.6
Identities = 21/127 (16%), Positives = 35/127 (27%), Gaps = 7/127 (5%)
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
+D+A + P AS P PPP + P K P + +
Sbjct: 287 AGETADLADDDGSEHSDPEPLPASLP-PPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQP 345
Query: 218 GTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
T + + + S G G + S+ L + S + +
Sbjct: 346 ATSTSSK------GSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPS 399
Query: 278 GPFTDLP 284
T P
Sbjct: 400 PSVTSAP 406
Score = 28.3 bits (63), Expect = 9.1
Identities = 7/27 (25%), Positives = 8/27 (29%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTP 209
P P P +AP P P
Sbjct: 386 PPLDLTTSLRHMPSPSVTSAPEPPSIP 412
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 29.2 bits (64), Expect = 2.6
Identities = 15/88 (17%), Positives = 22/88 (25%), Gaps = 7/88 (7%)
Query: 124 ATMGFETPEEGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAP 183
A +GF + K + G G V N +D DD P P
Sbjct: 20 AGVGFAAAADEPADKGITKGGKGKGEKGTKVGADDTNGTD-------PDDEPEEEEEPEP 72
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVP 211
+ + P + P
Sbjct: 73 EEEGEEEPEPEEEGEEEPEPEETGEEEP 100
>gnl|CDD|234404 TIGR03931, T7SS_Rv3446c, type VII secretion-associated protein,
Rv3446c family, C-terminal domain. Members of this
protein family occur as part of the ESX-4 cluster of
type VII secretion system (T7SS) proteins in
Mycobacterium tuberculosis and in similar T7SS clusters
in other Actinobacteria genera, including
Corynebacterium, Nocardia, Rhodococcus, and
Saccharopolyspora. This model describes the
better-conserved C-terminal region [Protein fate,
Protein and peptide secretion and trafficking].
Length = 172
Score = 29.4 bits (66), Expect = 2.8
Identities = 12/39 (30%), Positives = 13/39 (33%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
AP A A A S TP P G TR+
Sbjct: 2 GAPSGDTPKAKVRESSTAQATAESTTPAPPPPRPGVTRL 40
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 29.4 bits (66), Expect = 2.8
Identities = 18/107 (16%), Positives = 28/107 (26%), Gaps = 3/107 (2%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT-RVYASPLAKRLA 231
P A P + P PP PAP+P + + + A P AK +
Sbjct: 103 SEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPRAK--S 160
Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANG 278
+ + + A A P+K
Sbjct: 161 PRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKKKT 207
Score = 28.6 bits (64), Expect = 4.5
Identities = 18/77 (23%), Positives = 23/77 (29%), Gaps = 2/77 (2%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP-TPVPSQK-TSGG 218
+ P A P AAA PKA P +P P+ K
Sbjct: 112 AEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPRAKSPRPAAKAAPKP 171
Query: 219 TRVYASPLAKRLAAEKG 235
T + AK+ AA
Sbjct: 172 TETITAKKAKKTAAAPK 188
>gnl|CDD|216245 pfam01015, Ribosomal_S3Ae, Ribosomal S3Ae family.
Length = 195
Score = 29.4 bits (67), Expect = 2.8
Identities = 8/36 (22%), Positives = 19/36 (52%)
Query: 291 VIAKRLLQSKQVIKLREQMNKALEKRGAKLSINDFI 326
K+ Q+ Q +R++M + + K ++L +F+
Sbjct: 123 AFTKKRAQTSQKRAIRKKMTEIITKEASELDFKEFV 158
>gnl|CDD|165416 PHA03145, PHA03145, helicase-primase primase subunit; Provisional.
Length = 1058
Score = 30.0 bits (67), Expect = 2.9
Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVA---AAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
D +DD P A A A + A A PP +AAPA +P + G RV
Sbjct: 707 DEEDDEDPDAEALAHGGVDEEAAYLDALHEQPPAEAAPAEPESPRCTCADKIGLRV 762
>gnl|CDD|233219 TIGR00976, /NonD, putative hydrolase, CocE/NonD family. This model
represents a protein subfamily that includes the cocaine
esterase CocE, several glutaryl-7-ACA acylases, and the
putative diester hydrolase NonD of Streptomyces griseus
(all hydrolases). This family shows extensive, low-level
similarity to a family of xaa-pro dipeptidyl-peptidases,
and local similarity by PSI-BLAST to many other
hydrolases [Unknown function, Enzymes of unknown
specificity].
Length = 550
Score = 29.8 bits (67), Expect = 3.0
Identities = 13/53 (24%), Positives = 16/53 (30%), Gaps = 1/53 (1%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
+ GA + APP P A P P PS S + R
Sbjct: 356 SGGALSTAPPSPT-EGADPYLYDPWDPVPSLGGQLSGAGGPAGPAKQDRIDDR 407
>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
Length = 226
Score = 29.5 bits (66), Expect = 3.0
Identities = 11/28 (39%), Positives = 12/28 (42%)
Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVP 211
A P PK A PPP + P P P
Sbjct: 187 ASPQPKSLATPPPVAAPSRRTPRPRRKP 214
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 29.6 bits (67), Expect = 3.0
Identities = 12/65 (18%), Positives = 24/65 (36%), Gaps = 4/65 (6%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT----SGGTRVYASPLAKR 229
P G+ + P ++P PP + + TP P+ + + R A ++
Sbjct: 299 EPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRK 358
Query: 230 LAAEK 234
+K
Sbjct: 359 REGDK 363
>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
Provisional.
Length = 450
Score = 29.7 bits (67), Expect = 3.0
Identities = 16/65 (24%), Positives = 21/65 (32%), Gaps = 5/65 (7%)
Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPK--VAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
+ A +D A PAA A A AA P + +P + G
Sbjct: 1 SMEAPATEDAGRPAAPAASDDAAATDGAGANAAAPAAPAESDPSKKRPGVMLPLEV---G 57
Query: 219 TRVYA 223
TRV
Sbjct: 58 TRVMC 62
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 30.0 bits (67), Expect = 3.0
Identities = 18/47 (38%), Positives = 22/47 (46%), Gaps = 11/47 (23%)
Query: 174 APPAAGAS----APAPPPPKVAAAP-------PPPPPKAAPAPSPTP 209
PAAG + APA P P A +P PP PP++ P P P
Sbjct: 408 TAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAP 454
Score = 29.6 bits (66), Expect = 3.2
Identities = 31/125 (24%), Positives = 43/125 (34%), Gaps = 37/125 (29%)
Query: 176 PAAGASAPAPPPPKV--------------AAAPPPPPPKAAPAPSP--TPVPSQKTSGGT 219
PA+G +A P P +AAP P P AAP+P P G
Sbjct: 389 PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGI 448
Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGA---VAAPSKSAKPTA 276
+P + S + A S AS + A + PS +A+ T
Sbjct: 449 PPRPAP--------------RMPEAS----PVPGAPDSVASASDAPPTLGDPSDTAEHTP 490
Query: 277 NGPFT 281
+GP T
Sbjct: 491 SGPRT 495
Score = 29.6 bits (66), Expect = 3.6
Identities = 25/113 (22%), Positives = 34/113 (30%), Gaps = 4/113 (3%)
Query: 174 APPAAGASAPAPPPPKVAAAPPP---PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
A +G++A P A P P P APA TP + + SP
Sbjct: 378 ASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWD 437
Query: 231 AAEKGLDLSSIGA-GSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTD 282
A S I + + + S A A AP P+ T
Sbjct: 438 DAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTP 490
>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
Provisional.
Length = 445
Score = 29.8 bits (67), Expect = 3.0
Identities = 18/51 (35%), Positives = 22/51 (43%), Gaps = 10/51 (19%)
Query: 173 DAPPAAGASAPAPPPPKVA--AAPPP--PPPKAAPAPSPTPVPSQKTSGGT 219
D PP P PPPP VA P P + PA +P S +T+G
Sbjct: 128 DVPP------PPPPPPVVAKRVETPAVVAPRVSEPARNPFKTESNRTTGVI 172
>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 702
Score = 30.0 bits (67), Expect = 3.0
Identities = 15/57 (26%), Positives = 22/57 (38%)
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
+ + V+A + + + PA P P P P P P+ P P P P P
Sbjct: 389 AQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPQPN 445
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 29.5 bits (66), Expect = 3.1
Identities = 12/33 (36%), Positives = 13/33 (39%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
S P PPP + A P PP P P Q
Sbjct: 126 SQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQ 158
Score = 28.8 bits (64), Expect = 5.7
Identities = 13/46 (28%), Positives = 16/46 (34%), Gaps = 2/46 (4%)
Query: 174 APPAAGASAPAPPPPKVAAAP--PPPPPKAAPAPSPTPVPSQKTSG 217
AP +G P + P P P A P PSQ+ G
Sbjct: 173 APQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYG 218
>gnl|CDD|234068 TIGR02946, acyl_WS_DGAT, acyltransferase, WS/DGAT/MGAT. This
bacteria-specific protein family includes a
characterized, homodimeric, broad specificity
acyltransferase from Acinetobacter sp. strain ADP1,
active as wax ester synthase, as acyl coenzyme
A:diacylglycerol acyltransferase, and as
acyl-CoA:monoacylglycerol acyltransferase [Unknown
function, Enzymes of unknown specificity].
Length = 446
Score = 29.6 bits (67), Expect = 3.2
Identities = 40/144 (27%), Positives = 59/144 (40%), Gaps = 26/144 (18%)
Query: 194 PPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITS 253
P PPP AP P P P SG S L +R+A S G + +
Sbjct: 150 DPDPPPLPAPPPPPQPSTRGLLSGALSGLPSALLRRVA-------------STAPGVVRA 196
Query: 254 ADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQVIKLREQMNKAL 313
A AV ++SA+P PFT P + + G I+++ + Q + L +
Sbjct: 197 A-------GRAVEGVARSARPAL--PFTA-PPTPLNGPISRKRRFAAQSLPL-ADVKAVA 245
Query: 314 EKRGAKLSINDFIIKATALASRRV 337
+ G +IND ++ A A A RR
Sbjct: 246 KAFGV--TINDVVLAAVAGALRRY 267
>gnl|CDD|204676 pfam11574, DUF3235, Protein of unknown function (DUF3235). Some
members in this family of proteins with unknown function
are annotated as RpfA however this cannot be confirmed.
Length = 90
Score = 27.7 bits (61), Expect = 3.2
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 184 APPPPKVAAAPPPPPPKAAPA 204
AP P + A P P P AAPA
Sbjct: 4 APTPRDLVANPKPAPAAAAPA 24
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 29.9 bits (67), Expect = 3.3
Identities = 20/86 (23%), Positives = 24/86 (27%), Gaps = 18/86 (20%)
Query: 159 ENESDVAAFKDFKDDAPPAA----GASAPAPPPPKVAAAPP-----------PPPPKAAP 203
+ V D P G+S P P VAAA P P A
Sbjct: 327 LAPTAVCTDAGLLPDTPLLPDAPEGSSDPVVP---VAAATPVDASLPDVRTGTAPTSLAN 383
Query: 204 APSPTPVPSQKTSGGTRVYASPLAKR 229
P +Q T T A+P
Sbjct: 384 VSHADPAVAQPTQAATLAGAAPKGAT 409
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 29.8 bits (67), Expect = 3.4
Identities = 26/116 (22%), Positives = 42/116 (36%), Gaps = 12/116 (10%)
Query: 159 ENESDVAAFKD-------FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
E++ D+ F D K P + P PK ++ P ++ P
Sbjct: 29 EDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRK 88
Query: 212 SQKTSGGTRVY-----ASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKA 262
S +++ + + E ++SIG L+GSITS S AS A
Sbjct: 89 SAESTRSSHPKSKAPSTESEEEEEPEETPDPIASIGGWWSLWGSITSTATSTASAA 144
>gnl|CDD|227111 COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit
[Lipid metabolism].
Length = 645
Score = 29.6 bits (67), Expect = 3.4
Identities = 20/48 (41%), Positives = 28/48 (58%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
GT+VS A KEG +++ GDLL +E K P +G +AK+ V G
Sbjct: 584 GTVVSVAVKEGQEVSAGDLLVVLEAMKMENTLRAPRDGVVAKLAVAEG 631
>gnl|CDD|225501 COG2951, MltB, Membrane-bound lytic murein transglycosylase B [Cell
envelope biogenesis, outer membrane].
Length = 343
Score = 29.6 bits (67), Expect = 3.4
Identities = 21/103 (20%), Positives = 31/103 (30%), Gaps = 9/103 (8%)
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP--SPTPVPSQKT 215
++ + AA AA A P+V A P P P S + +
Sbjct: 45 LQGFAPEAAAAGISAAVLDAAFAG--VSYDPRVIARDRPQPEFRQPVAEFSARRISEARV 102
Query: 216 SGGTRVYASPLA--KRLAAEKGLDLSSIGAGSGL---FGSITS 253
G A A R G+ + A G+ FG +
Sbjct: 103 QQGRGFPAQYAAALARAERRYGVPAPILVAIWGMETGFGRVMG 145
>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 100
amino acids in length. This domain is found associated
with pfam00096. This domain has two conserved sequence
motifs: YPSPV and PSP.
Length = 100
Score = 27.8 bits (61), Expect = 3.5
Identities = 15/52 (28%), Positives = 23/52 (44%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
A AA S +P +++P P P+PVP+ +S + Y SP
Sbjct: 1 ADKAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSP 52
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 29.7 bits (66), Expect = 3.5
Identities = 17/83 (20%), Positives = 31/83 (37%), Gaps = 4/83 (4%)
Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP----VPSQKTSG 217
S ++ F D+P A + +PP + + P P +P+ SP PS S
Sbjct: 576 SAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSS 635
Query: 218 GTRVYASPLAKRLAAEKGLDLSS 240
+ + E + ++S
Sbjct: 636 PESSIKVASTETASPESSIKVAS 658
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 29.6 bits (67), Expect = 3.6
Identities = 22/75 (29%), Positives = 25/75 (33%), Gaps = 11/75 (14%)
Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG----------TRVYASP-LAKRLA 231
PAPP P PPPPPP P Q + R P L RLA
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEIVLEAVRAALPPDLLARLA 318
Query: 232 AEKGLDLSSIGAGSG 246
A + G+G
Sbjct: 319 AGGARARARSAGGAG 333
>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
Length = 214
Score = 29.0 bits (64), Expect = 3.8
Identities = 17/57 (29%), Positives = 22/57 (38%), Gaps = 2/57 (3%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAA--PAPSPTPVPSQKTSGGTRVYASPLA 227
A PAP P A P P P PAP T VP+ + A+P++
Sbjct: 133 PGAGQAHTPVPAPAPVIAQAQPIAPAPHQTLVPAPHDTIVPAPHAAHLPSAPATPVS 189
Score = 28.6 bits (63), Expect = 4.6
Identities = 16/43 (37%), Positives = 18/43 (41%), Gaps = 3/43 (6%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKA---APAPSPTPVPS 212
AAGA A AP P +A APAP T VP+
Sbjct: 124 QVDAAAGAEPGAGQAHTPVPAPAPVIAQAQPIAPAPHQTLVPA 166
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 29.1 bits (65), Expect = 3.8
Identities = 13/48 (27%), Positives = 18/48 (37%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
A A P K A A A+PA +P P+ K + + S
Sbjct: 159 AVSEPAVEPKAAKTATATEAKVQTASPAQTPATPPAGKGAAASGQLKS 206
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 29.6 bits (66), Expect = 3.8
Identities = 13/55 (23%), Positives = 15/55 (27%), Gaps = 1/55 (1%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
D P P P + PP P+ P P S GT P
Sbjct: 379 ADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLV-SPYNPQSPGTSYGPEP 432
Score = 28.4 bits (63), Expect = 9.3
Identities = 22/90 (24%), Positives = 25/90 (27%), Gaps = 16/90 (17%)
Query: 142 PAGSKDVPIGKLVCIIVENESDVAAFKDF---------KDDAPPAAGASAPAPPPPKVAA 192
PA L + A A S PAP P A+
Sbjct: 568 PAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQASAQPAGL----SLPAPAVPVNAS 623
Query: 193 APPPPPPKAAPAPSPTPV---PSQKTSGGT 219
P PP AP P P PS +TS
Sbjct: 624 TPASTPPPLAPQEPPQPGTSAPSLETSLPQ 653
>gnl|CDD|236063 PRK07586, PRK07586, hypothetical protein; Validated.
Length = 514
Score = 29.4 bits (67), Expect = 3.8
Identities = 14/39 (35%), Positives = 16/39 (41%), Gaps = 4/39 (10%)
Query: 172 DDAPPA----AGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
+DA A A A P P +AA PP P A P
Sbjct: 302 EDAAAALEALADALGAKPAAPPLAAPARPPLPTGALTPE 340
>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
Length = 866
Score = 29.4 bits (66), Expect = 3.8
Identities = 15/39 (38%), Positives = 16/39 (41%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
+ PPA ASAPAP A P A P P P
Sbjct: 15 EATPPAPAASAPAPAAAAPAPVAAAAPAAAGPRPDDEPF 53
Score = 28.3 bits (63), Expect = 9.7
Identities = 14/40 (35%), Positives = 17/40 (42%)
Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
F + P A+ PAP A A P P AA AP+
Sbjct: 3 FATDAKEEPAKEEATPPAPAASAPAPAAAAPAPVAAAAPA 42
>gnl|CDD|173798 cd05562, Peptidases_S53_like, Peptidase domain in the S53 family.
Members of the peptidase S53 (sedolisin) family include
endopeptidases and exopeptidases. The S53 family
contains a catalytic triad Glu/Asp/Ser with an
additional acidic residue Asp in the oxyanion hole,
similar to that of Asn in subtilisin. The stability of
these enzymes may be enhanced by calcium, some members
have been shown to bind up to 4 ions via binding sites
with different affinity. Some members of this clan
contain disulfide bonds. These enzymes can be intra- and
extracellular, some function at extreme temperatures and
pH values. Characterized sedolisins include Kumamolisin,
an extracellular calcium-dependent thermostable
endopeptidase from Bacillus. The enzyme is synthesized
with a 188 amino acid N-terminal preprotein region which
is cleaved after the extraction into the extracellular
space with low pH. One kumamolysin paralog,
kumamolisin-As, is believed to be a collagenase. TPP1 is
a serine protease that functions as a tripeptidyl
exopeptidase as well as an endopeptidase. Less is known
about PSCP from Pseudomonas which is thought to be an
aspartic proteinase.
Length = 275
Score = 29.2 bits (66), Expect = 3.8
Identities = 28/93 (30%), Positives = 36/93 (38%), Gaps = 12/93 (12%)
Query: 190 VAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR---LAAEKGLDLSSIGAGSG 246
V A P P+P PS G R +P ++ + A G++ + G G G
Sbjct: 151 VGAVDYGNTPAFGSDPAPGGTPSSFDPVGIR-LPTPEVRQKPDVTAPDGVNGTVDGDGDG 209
Query: 247 L---FGSITSADLSKASKAGAVAAPSKSAKPTA 276
FG TSA A A VAA SA P
Sbjct: 210 PPNFFG--TSA---AAPHAAGVAALVLSANPGL 237
>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
Length = 336
Score = 29.2 bits (65), Expect = 4.0
Identities = 15/63 (23%), Positives = 19/63 (30%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
A P A A + P V AA A P V A+P A + A
Sbjct: 268 AVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDVPQVAPVAAPAAPEVPAV 327
Query: 234 KGL 236
+
Sbjct: 328 PVV 330
Score = 28.4 bits (63), Expect = 7.9
Identities = 15/68 (22%), Positives = 19/68 (27%), Gaps = 1/68 (1%)
Query: 144 GSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAP 203
G +D+ I N V A A +A A + P P P
Sbjct: 249 GGEDLIIKPKSKAAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLP 308
Query: 204 -APSPTPV 210
P PV
Sbjct: 309 DVPQVAPV 316
>gnl|CDD|220585 pfam10123, Mu-like_Pro, Mu-like prophage I protein. Members of
this family of proteins comprise various viral Mu-like
prophage I proteins.
Length = 326
Score = 29.3 bits (66), Expect = 4.0
Identities = 13/46 (28%), Positives = 14/46 (30%), Gaps = 5/46 (10%)
Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
D AA K F APP A + PP A S
Sbjct: 268 QDPAALKAFLAKAPPVAALLG-----TQTGGKIPPGKDTGTAALSA 308
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 29.6 bits (66), Expect = 4.0
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 6/48 (12%)
Query: 175 PPAAGASAPAP------PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
P S P+P PP V+ P PP P+PSP S +
Sbjct: 427 PQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQ 474
>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
Length = 732
Score = 29.5 bits (66), Expect = 4.1
Identities = 12/65 (18%), Positives = 20/65 (30%), Gaps = 4/65 (6%)
Query: 170 FKDDAP---PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
F+ D P + P P P + P + +K+ G V A +
Sbjct: 603 FEPDVPEVASGEDVTQAEQPQQPQQPQQPQQPQQPQ-QPVSPVINDKKSDAGVNVPAGGI 661
Query: 227 AKRLA 231
+ L
Sbjct: 662 EQELK 666
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 29.1 bits (65), Expect = 4.2
Identities = 11/38 (28%), Positives = 15/38 (39%), Gaps = 5/38 (13%)
Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
A + +P PP P PP+ P SP P +
Sbjct: 4 ARSRSPSPP-----RRPSPPRPTPPRSPDASPEETPPS 36
Score = 27.9 bits (62), Expect = 9.6
Identities = 17/41 (41%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 174 APPAAGASAPAPPPPKV---AAAPPPPPPKAAPAPSPTPVP 211
A PAA P P V ++APP PP AP+ TP P
Sbjct: 50 AGPAAPRRRPRGCPAGVTFSSSAPPRPPLGLDDAPAATPPP 90
>gnl|CDD|234273 TIGR03593, yidC_nterm, membrane protein insertase, YidC/Oxa1
family, N-terminal domain. Essentially all bacteria
have a member of the YidC family, whose C-terminal
domain is modeled by TIGR03592. The two copies are found
in endospore-forming bacteria such as Bacillus subtilis
appear redundant during vegetative growth, although the
member designated spoIIIJ (stage III sporulation protein
J) has a distinct role in spore formation. YidC, its
mitochondrial homolog Oxa1, and its chloroplast homolog
direct insertion into the bacterial/organellar inner (or
only) membrane. This model describes an N-terminal
sequence region, including a large periplasmic domain
lacking in YidC members from Gram-positive species. The
multifunctional YidC protein acts both with and
independently of the Sec system [Protein fate, Protein
and peptide secretion and trafficking].
Length = 366
Score = 29.2 bits (66), Expect = 4.2
Identities = 10/46 (21%), Positives = 12/46 (26%)
Query: 182 APAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
P PP P AA A A P+ + A
Sbjct: 26 GPPPPKPPAAAQTASAQSATAAATPAAAPPAADAPAAAAAADAAAA 71
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.4 bits (66), Expect = 4.3
Identities = 20/131 (15%), Positives = 34/131 (25%), Gaps = 15/131 (11%)
Query: 152 KLVCIIVENESDVAAFKDFKDDAPPAAGASAPAP-------PPPKVAAAPPPPPPKAAPA 204
+ I N + A PP + +
Sbjct: 126 QAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQ 185
Query: 205 PSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGA 264
+ ++KT G T V A+ L KR +K S F T K + +
Sbjct: 186 DTSKETTTEKTEGKTSVKAASL-KRNPPKKS------NIMSSFFKKKTKEKKEKKEASES 238
Query: 265 VAAPSKSAKPT 275
+S + +
Sbjct: 239 -TVKEESEEES 248
>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
Length = 746
Score = 29.5 bits (66), Expect = 4.4
Identities = 14/61 (22%), Positives = 23/61 (37%), Gaps = 3/61 (4%)
Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
D P GAS+PA + AP PP A + + + G++ P+
Sbjct: 672 LGDLFSGLPSTVGASSPAFQAASGSQAPASPP---TAASAIEDLFANGMGSGSQTVPLPI 728
Query: 227 A 227
+
Sbjct: 729 S 729
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 29.3 bits (66), Expect = 4.7
Identities = 16/72 (22%), Positives = 26/72 (36%), Gaps = 13/72 (18%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPA---PSPTPVPSQKTSGGTRVYASP----- 225
A ++ +S+ + PP + P P A+ PS P+P + V P
Sbjct: 14 ASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQ 73
Query: 226 -----LAKRLAA 232
+R AA
Sbjct: 74 TPQLNPLQRAAA 85
Score = 28.5 bits (64), Expect = 8.8
Identities = 9/41 (21%), Positives = 13/41 (31%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
P+A + P + P PP P+ T Q
Sbjct: 36 RRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQTPQ 76
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 28.4 bits (63), Expect = 4.9
Identities = 8/31 (25%), Positives = 10/31 (32%)
Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
SAP + A A P+P P
Sbjct: 131 SAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161
>gnl|CDD|189051 cd09881, PIN_VapC-FitB, PIN domain of ribonucleases (toxins), VapC
and FitB, of prokaryotic toxin/antitoxin operons,
Pyrococcus horikoshii protein PH0500, and other similar
bacterial and archaeal homologs. PIN (PilT N terminus)
domain-containing proteins of prokaryotic
toxin/antitoxin (TA) operons, such as, Mycobacterium
tuberculosis VapC of the VapBC (virulence associated
proteins) TA operon, and Neisseria gonorrhoeae FitB of
the FitAB (fast intracellular trafficking) TA operon, as
well as, the archaeal Pyrococcus horikoshii protein
PH0500 are included in this family. Toxins of TA operons
are believed to be involved in growth inhibition by
regulating translation and are nearly always
co-expressed with an antitoxin, a cognate protein
inhibitor, forming an inert protein complex.
Disassociation of the complex activates the ribonuclease
activity of the toxin. In N. gonorrhoeae, FitA and FitB
form a heterodimer: FitA is the DNA binding subunit and
FitB contains a ribonuclease activity that is blocked by
the presence of FitA. A tetramer of FitAB heterodimers
binds DNA from the fitAB upstream promoter region with
high affinity. This results in both sequestration of
FitAB and repression of fitAB transcription. It is
thought that FitAB release from the DNA and subsequent
dissociation both slows N. gonorrhoeae replication and
transcytosis by an as yet undefined mechanism. The
toxin M. tuberculosis VapC is a structural homolog of N.
gonorrhoeae FitB, but their antitoxin partners, VapB and
FitA, respectively, differ structurally. The M.
tuberculosis VapC-5 is proposed to be both an
endoribonuclease and an exoribonuclease that can act on
free RNA in a similar manner to the endo and exonuclease
flap endonuclease-1 (FEN1). VapC-like toxins are
structural homologs of FEN1-like PIN domains, but lack
the extensive arch/clamp region and the H3TH
(helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region, seen in FEN1-like PIN
domains. PIN domains within this group typically
contain three or four conserved acidic residues that
cluster at the C-terminal end of the beta-sheet and form
a negatively charged pocket near the center of the
molecule. These putative active site residues are
thought to bind Mg2+ and/or Mn2+ ions and be essential
for single-stranded ribonuclease activity. VapC-like PIN
domains are single domain proteins that form dimers and
dimerization configures the active sites in a groove
along the long-axis of the structure.
Length = 127
Score = 27.9 bits (63), Expect = 5.0
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 312 ALEKRGAKLSINDFIIKATALA 333
L ++G + +ND +I ATALA
Sbjct: 83 RLRRKGRPIGLNDALIAATALA 104
>gnl|CDD|225369 COG2812, DnaX, DNA polymerase III, gamma/tau subunits [DNA
replication, recombination, and repair].
Length = 515
Score = 29.3 bits (66), Expect = 5.0
Identities = 31/153 (20%), Positives = 44/153 (28%), Gaps = 10/153 (6%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
A V+ A P APA S P + T++ S A
Sbjct: 368 SPLLQRAPRLEQELVLKANVSLAAAPAQSPQAPALSKNPELKAREQLATKIKKSEPAPES 427
Query: 231 AAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRG 290
+ + S +G + A K A A K + P P
Sbjct: 428 SEPSLAPVRSF------WGELLDE--VTAQKLKAEAKVLKDSDPVLAKPGALKRAFEGEI 479
Query: 291 VIAKRLLQSKQVIKLREQ--MNKALEKRGAKLS 321
K +V + EQ N A+E+R AK
Sbjct: 480 TAKKLETGRVEVERGGEQLAANAAVEERAAKQP 512
>gnl|CDD|238286 cd00516, PRTase_typeII, Phosphoribosyltransferase (PRTase) type II;
This family contains two enzymes that play an important
role in NAD production by either allowing quinolinic
acid (QA) , quinolinate phosphoribosyl transferase
(QAPRTase), or nicotinic acid (NA), nicotinate
phosphoribosyltransferase (NAPRTase), to be used in the
synthesis of NAD. QAPRTase catalyses the reaction of
quinolinic acid (QA) with
5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence
of Mg2+ to produce nicotinic acid mononucleotide (NAMN),
pyrophosphate and carbon dioxide, an important step in
the de novo synthesis of NAD. NAPRTase catalyses a
similar reaction leading to NAMN and pyrophosphate,
using nicotinic acid an PPRP as substrates, used in the
NAD salvage pathway.
Length = 281
Score = 28.7 bits (65), Expect = 5.0
Identities = 10/36 (27%), Positives = 15/36 (41%), Gaps = 1/36 (2%)
Query: 97 GTIVSWAKKEGDKLNEGDLLAEIE-TDKATMGFETP 131
G +V A EG + G+ L IE + + E
Sbjct: 49 GPLVILAVPEGTVVEPGEPLLTIEGPARELLLLERV 84
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 28.7 bits (64), Expect = 5.0
Identities = 6/31 (19%), Positives = 7/31 (22%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
P G P + P P P
Sbjct: 122 HPGGGGPQQFRPGAGQPPGMQHMPAPALPPN 152
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 28.7 bits (65), Expect = 5.0
Identities = 10/40 (25%), Positives = 12/40 (30%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
++ A A P A P P P TPV
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161
>gnl|CDD|234822 PRK00720, tatA, twin arginine translocase protein A; Provisional.
Length = 78
Score = 26.9 bits (60), Expect = 5.2
Identities = 17/50 (34%), Positives = 17/50 (34%), Gaps = 8/50 (16%)
Query: 163 DVA-AFKDFK-----DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
DVA K FK DDAPPA K A P P K
Sbjct: 31 DVAKGIKSFKKGMADDDAPPAEPPEPVRTIDHK--ADEPVAPVKKDVGSK 78
>gnl|CDD|172536 PRK14042, PRK14042, pyruvate carboxylase subunit B; Provisional.
Length = 596
Score = 28.9 bits (64), Expect = 5.4
Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 8/69 (11%)
Query: 81 GNDRVALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKIL 140
G+ VA+P G+I++ GD++ G + IE K + P G +A+IL
Sbjct: 526 GDITVAIP--------GSIIAIHVSAGDEVKAGQAVLVIEAMKMETEIKAPANGVVAEIL 577
Query: 141 VPAGSKDVP 149
G K P
Sbjct: 578 CQKGDKVTP 586
>gnl|CDD|139164 PRK12700, flgH, flagellar basal body L-ring protein; Reviewed.
Length = 230
Score = 28.5 bits (63), Expect = 5.5
Identities = 35/147 (23%), Positives = 52/147 (35%), Gaps = 29/147 (19%)
Query: 177 AAGASAPAPPPPKVAA---APPPPPPKAAPAPSPTPVPSQKTSGGT-------------R 220
A A PP P V APPPPPP P+ P Q ++ G
Sbjct: 18 LAAGCAMIPPEPVVTGPLTAPPPPPP--QPSARPNGSIYQPSAYGNYPLFEDRRPRNVGD 75
Query: 221 VYASPLAKRLAAEKGLDLSS---------IGAGSGLFGSITSADLSKASKAGAVAAPSKS 271
+ L ++ A KG+ ++ + A + + L G A + +
Sbjct: 76 IVTIVLEEKTNAAKGVATNTSRDGSATLGVAAAPRFMDGVINDKLDTDISGGNTA--NGT 133
Query: 272 AKPTANGPFTDLPVSGVRGVIAKRLLQ 298
K +AN FT + V GV+ LQ
Sbjct: 134 GKSSANNTFTGTITTTVIGVLPNGNLQ 160
>gnl|CDD|236716 PRK10564, PRK10564, maltose regulon periplasmic protein;
Provisional.
Length = 303
Score = 28.8 bits (65), Expect = 5.5
Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 1/35 (2%)
Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
P G+SAPAP AAP P A S P+
Sbjct: 219 PLFGSSAPAPVTVGNTAAPVAAPAPAPAKKS-EPM 252
Score = 28.5 bits (64), Expect = 7.1
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
P A AP AAP +P P P++K+
Sbjct: 219 PLFGSSAPAPVTVGNTAAPVAAPAPAPAKKS 249
>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668). Members
in this family of proteins are annotated as Cysteine and
tyrosine-rich protein 1, however currently no function
is known.
Length = 154
Score = 27.9 bits (62), Expect = 5.6
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 8/38 (21%)
Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
++ D PP P P P+ +A PPPP P S
Sbjct: 123 EYPTDLPP------PYSPAPQASAQRSPPPP--YPGNS 152
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 29.0 bits (65), Expect = 5.7
Identities = 10/89 (11%), Positives = 20/89 (22%), Gaps = 18/89 (20%)
Query: 160 NESDVAAFKDFKDDAPPA----AGASAPAPPPPKVAAAP--------------PPPPPKA 201
+ A +P + + P P P+
Sbjct: 75 VRYNQNAPGAPSVGPDSDLSQKTSTFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEP 134
Query: 202 APAPSPTPVPSQKTSGGTRVYASPLAKRL 230
P +P P +T + + +L
Sbjct: 135 QPPQAPESQPQPQTPAQKMLSLEEVEAQL 163
>gnl|CDD|238802 cd01568, QPRTase_NadC, Quinolinate phosphoribosyl transferase
(QAPRTase or QPRTase), also called nicotinate-nucleotide
pyrophosphorylase, is involved in the de novo synthesis
of NAD in both prokaryotes and eukaryotes. It catalyses
the reaction of quinolinic acid (QA) with
5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence
of Mg2+ to produce nicotinic acid mononucleotide (NAMN),
pyrophosphate and carbon dioxide. QPRTase functions as a
homodimer with two active sites, each formed by the
C-terminal region of one subunit and the N-terminal
region of the other.
Length = 269
Score = 28.6 bits (65), Expect = 5.7
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 100 VSWAKKEGDKLNEGDLLAEIE 120
V W K+GD++ G +L E+E
Sbjct: 59 VEWLVKDGDRVEAGQVLLEVE 79
>gnl|CDD|234896 PRK01077, PRK01077, cobyrinic acid a,c-diamide synthase; Validated.
Length = 451
Score = 28.9 bits (66), Expect = 6.0
Identities = 13/32 (40%), Positives = 15/32 (46%)
Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
DA A +AP PPP P PP + A A
Sbjct: 220 DALLALARAAPPPPPAAAPPPPAPPGVRIAVA 251
>gnl|CDD|221294 pfam11891, DUF3411, Domain of unknown function (DUF3411). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 168 to 186 amino acids in length. This domain
has a conserved RYQ sequence motif.
Length = 179
Score = 28.1 bits (63), Expect = 6.1
Identities = 15/67 (22%), Positives = 23/67 (34%), Gaps = 11/67 (16%)
Query: 222 YASPLAKRLAA--EKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
+ L +R KG L+++G +GL G S L A KA +
Sbjct: 97 RSYSLQQRFGTVFYKGAKLAAVGFIAGLVGQAISNALMAARKA---------VDKNSEES 147
Query: 280 FTDLPVS 286
P+
Sbjct: 148 VKVPPLF 154
>gnl|CDD|234764 PRK00442, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 27.1 bits (60), Expect = 6.1
Identities = 14/46 (30%), Positives = 16/46 (34%), Gaps = 5/46 (10%)
Query: 168 KDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
K +D A PAPP P PP AP P + Q
Sbjct: 41 KAMNEDEKEDKPAEQPAPPAQ-----PVPPVQNTAPLNQPHTIDGQ 81
Score = 26.4 bits (58), Expect = 9.6
Identities = 10/34 (29%), Positives = 14/34 (41%)
Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPP 199
A + + + PA + PA P P V P P
Sbjct: 42 AMNEDEKEDKPAEQPAPPAQPVPPVQNTAPLNQP 75
>gnl|CDD|233382 TIGR01372, soxA, sarcosine oxidase, alpha subunit family,
heterotetrameric form. This model describes the alpha
subunit of a family of known and putative
heterotetrameric sarcosine oxidases. Five operons of
such oxidases are found in Mesorhizobium loti and three
in Agrobacterium tumefaciens, a high enough copy number
to suggest that not all members are share the same
function. The model is designated as subfamily rather
than equivalog for this reason.Sarcosine oxidase
catalyzes the oxidative demethylation of sarcosine to
glycine. The reaction converts tetrahydrofolate to
5,10-methylene-tetrahydrofolate. The enzyme is known in
monomeric and heterotetrameric (alpha,beta,gamma,delta)
forms [Energy metabolism, Amino acids and amines].
Length = 985
Score = 28.9 bits (65), Expect = 6.1
Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 2/61 (3%)
Query: 216 SGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPT 275
GG + + +A L + GA +GLFG +A L+ + AGA AA + + T
Sbjct: 419 RGGKLAWDAAIAAFLPGDAVQGCILAGAANGLFG--LAAALADGAAAGAAAARAAGFEGT 476
Query: 276 A 276
A
Sbjct: 477 A 477
>gnl|CDD|222346 pfam13728, TraF, F plasmid transfer operon protein. TraF protein
undergoes proteolytic processing associated with export.
The 19 amino acids at the amino terminus of the
polypeptides appear to constitute a typical membrane
leader peptide - not included in this family, while the
remainder of the molecule is predicted to be primarily
hydrophilic in character. F plasmid TraF and TraH are
required for F pilus assembly and F plasmid transfer,
and they are both localised to the outer membrane in the
presence of the complete F transfer region, especially
TraV, the putative anchor.
Length = 215
Score = 28.3 bits (64), Expect = 6.2
Identities = 9/23 (39%), Positives = 10/23 (43%)
Query: 187 PPKVAAAPPPPPPKAAPAPSPTP 209
P P P AAPA SP+
Sbjct: 7 EPPEPKEPEKAPAPAAPAQSPSA 29
>gnl|CDD|172484 PRK13982, PRK13982, bifunctional
SbtC-like/phosphopantothenoylcysteine
decarboxylase/phosphopantothenate synthase; Provisional.
Length = 475
Score = 29.0 bits (65), Expect = 6.2
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 6/69 (8%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP-LAKR 229
+D P G +A + P AA P + + T + GG Y + L +R
Sbjct: 37 RDATTPRHGPAASSAAPVSAAAPPAAREQASLASKRVTLI----IGGGIAAYKALDLIRR 92
Query: 230 LAAEKGLDL 238
L E+G +
Sbjct: 93 L-KERGAHV 100
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 28.3 bits (63), Expect = 6.2
Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 3/63 (4%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV-PSQKTSGGTRVYASPLAKRLAAEKG 235
+GA P + P P + +PSP S ++ G R +S A+ +EK
Sbjct: 69 YSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTGVRPQSS--AENANSEKR 126
Query: 236 LDL 238
L
Sbjct: 127 KLL 129
>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein. This domain is
found in the aflatoxin regulatory protein (AflR) which
is involved in the regulation of the biosynthesis of
aflatoxin in the fungal genus Aspergillus. It occurs
together with the fungal Zn(2)-Cys(6) binuclear cluster
domain (pfam00172).
Length = 275
Score = 28.8 bits (64), Expect = 6.2
Identities = 32/120 (26%), Positives = 47/120 (39%), Gaps = 16/120 (13%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAP-APSPTPVPSQKTSGG-----TRVYASPLAKRL 230
A S+ P PP + PP A P P+ T PS K S ++ S L+
Sbjct: 19 ANTTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQSPPAEPELWGSILSPNA 78
Query: 231 AAEKGLDLSSIGAG----SGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVS 286
+ + DLSS+ + LF S++ + L + A A+ T DL VS
Sbjct: 79 SNQDQGDLSSLLSVNTDFGQLFASLSPSPLFDGNDA------DLHAEATGELSVADLEVS 132
>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
Length = 413
Score = 28.7 bits (64), Expect = 6.2
Identities = 13/64 (20%), Positives = 18/64 (28%)
Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
E A + P ++ A P A P AAP P + +
Sbjct: 345 AEAAPRAADKPAGQAADPASSSADKPGASADAAARTPARARDAAAPDADTPPGGASLAAA 404
Query: 218 GTRV 221
RV
Sbjct: 405 QARV 408
>gnl|CDD|232814 TIGR00078, nadC, nicotinate-nucleotide pyrophosphorylase. Synonym:
quinolinate phosphoribosyltransferase (decarboxylating)
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Pyridine nucleotides].
Length = 265
Score = 28.4 bits (64), Expect = 6.3
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 95 EMGTIVSWAKKEGDKLNEGDLLAEIE 120
++G V W K+GD++ G+++AE+E
Sbjct: 51 QLGVQVEWLVKDGDRVEPGEVVAEVE 76
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 28.2 bits (63), Expect = 6.4
Identities = 18/69 (26%), Positives = 24/69 (34%), Gaps = 5/69 (7%)
Query: 157 IVENESDVAAFKDFKDDA-----PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
+ E +V F ++ DA P P P P +A P PP+ A P P
Sbjct: 105 VEEVRKEVEFFNNYLTDAKRPALPEIKPLQPPKPDPGGLAPGLPGYPPQTPQALMPYGQP 164
Query: 212 SQKTSGGTR 220
G R
Sbjct: 165 RPPMMGYGR 173
>gnl|CDD|215520 PLN02964, PLN02964, phosphatidylserine decarboxylase.
Length = 644
Score = 28.7 bits (64), Expect = 6.5
Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 2/31 (6%)
Query: 84 RVALPALSPTMEMGTIVSWAKKEGDKLNEGD 114
+VA A+ TM +G+I ++ KKEGD + +GD
Sbjct: 551 KVAFVAIGATM-VGSI-TFVKKEGDHVKKGD 579
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 28.7 bits (64), Expect = 7.0
Identities = 26/112 (23%), Positives = 41/112 (36%), Gaps = 7/112 (6%)
Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT-SGGTRVYASPLAKRLAAEKGLDLSSIGA 243
PP AA P P P P P ++ V PL+ A E DL +
Sbjct: 329 PPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYE---DLKPPTS 385
Query: 244 GSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKR 295
+ S+ + + AVA P++ + G +++P V AK+
Sbjct: 386 PI---PTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKK 434
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 28.1 bits (63), Expect = 7.1
Identities = 20/121 (16%), Positives = 30/121 (24%), Gaps = 19/121 (15%)
Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
F AP A+ PP A + P +Q G L
Sbjct: 87 FGGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPGSF------LGGA 140
Query: 230 LAAEKGLDLSSIGAGSGLFGSITS--------ADLSKASKAGAVAAPSKSAKPTANGPFT 281
G + G L + + A++ A G + A T +
Sbjct: 141 AQTAAG-----VAGGMLLGNGLENLFGGHSQPAEIVDAIGEGGDGSGPAPADDTGINNYG 195
Query: 282 D 282
D
Sbjct: 196 D 196
>gnl|CDD|223235 COG0157, NadC, Nicotinate-nucleotide pyrophosphorylase [Coenzyme
metabolism].
Length = 280
Score = 28.3 bits (64), Expect = 7.2
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 100 VSWAKKEGDKLNEGDLLAEIE 120
+ W K+GD++ GD+LAEIE
Sbjct: 66 IQWLVKDGDRVKPGDVLAEIE 86
>gnl|CDD|113156 pfam04375, HemX, HemX. This family consists of several bacterial
HemX proteins. The hemX gene is not essential for haem
synthesis in B. subtilis. HemX is a polytopic membrane
protein which by an unknown mechanism down-regulates the
level of HemA.
Length = 372
Score = 28.3 bits (63), Expect = 7.2
Identities = 11/47 (23%), Positives = 16/47 (34%), Gaps = 2/47 (4%)
Query: 192 AAPPPPPPKAAPAPS--PTPVPSQKTSGGTRVYASPLAKRLAAEKGL 236
++ P S P P+ K+ G T + A L L G
Sbjct: 1 SSVPAETASQTTVTSSPVAPGPAAKSGGATGLAALALLVALGLGAGG 47
>gnl|CDD|235553 PRK05672, dnaE2, error-prone DNA polymerase; Validated.
Length = 1046
Score = 28.7 bits (65), Expect = 7.3
Identities = 10/30 (33%), Positives = 11/30 (36%), Gaps = 1/30 (3%)
Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
AAGA A + AP A P S
Sbjct: 880 AAGA-AATERLDLLPGAPASEAAPALPGMS 908
>gnl|CDD|237530 PRK13843, PRK13843, conjugal transfer protein TraH; Provisional.
Length = 207
Score = 28.1 bits (63), Expect = 7.5
Identities = 17/73 (23%), Positives = 19/73 (26%), Gaps = 16/73 (21%)
Query: 173 DAPPAAGASAPAPPPPKVA-AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
A P P A PP+ A S P G R+
Sbjct: 148 RAEDPGDVKLAEPEPGDSDEAEESSDPPENAEDASEPAEPDDPNKAGIRI---------- 197
Query: 232 AEKGLDLSSIGAG 244
DLS IGA
Sbjct: 198 -----DLSGIGAQ 205
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 28.4 bits (63), Expect = 7.6
Identities = 12/39 (30%), Positives = 16/39 (41%)
Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
P + APA P + A+ P P A A + P Q
Sbjct: 117 PVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQ 155
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 28.6 bits (64), Expect = 7.6
Identities = 9/48 (18%), Positives = 14/48 (29%)
Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
A+ ++ P A A + PP A S P +
Sbjct: 145 AYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQER 192
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 27.8 bits (62), Expect = 7.7
Identities = 13/63 (20%), Positives = 19/63 (30%), Gaps = 8/63 (12%)
Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAP--------APPPPKVAAAPPPPPPKAAPAPS 206
C+ +E ++ A + DA PAA + P A P P P
Sbjct: 36 CMTMEQANEKAKKLEQSSDAKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPE 95
Query: 207 PTP 209
Sbjct: 96 QKL 98
>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
Length = 987
Score = 28.8 bits (64), Expect = 7.7
Identities = 14/60 (23%), Positives = 17/60 (28%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
A A A P K AA AA P ++ LA + AE
Sbjct: 870 AFGGTVAVKAAKPAKKAAAKKVAAKTAAAKTPRKAAKKKAAPPAAGLKPSAALAAVIGAE 929
>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 233
Score = 28.3 bits (63), Expect = 7.8
Identities = 10/35 (28%), Positives = 14/35 (40%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
+ PA PP A PP + P+ P P+
Sbjct: 95 QEPAPPANAPPPKEPAAPPSWRSSPAGPTTQPSPA 129
>gnl|CDD|215430 PLN02800, PLN02800, imidazoleglycerol-phosphate dehydratase.
Length = 261
Score = 28.3 bits (63), Expect = 8.0
Identities = 6/51 (11%), Positives = 12/51 (23%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
P P + ++P P+ S + + A
Sbjct: 13 LRPKLSFIDLLPRRAAIVSSPSSSLPRFLRMESQLRQSISCAASSSSSNAL 63
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 26.8 bits (60), Expect = 8.1
Identities = 10/42 (23%), Positives = 14/42 (33%)
Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
VAA +D + A+AP AAA +
Sbjct: 42 VAALEDVNIEEAIKKAAAAPVAAAAAAAAAAAAAAAEEKKEE 83
>gnl|CDD|133414 cd04787, HTH_HMRTR_unk, Helix-Turn-Helix DNA binding domain of
putative Heavy Metal Resistance transcription
regulators. Putative helix-turn-helix (HTH) heavy metal
resistance transcription regulators (HMRTR), unknown
subgroup. Based on sequence similarity, these proteins
are predicted to function as transcription regulators
that mediate responses to heavy metal stress in
eubacteria. They belong to the MerR superfamily of
transcription regulators that promote transcription of
various stress regulons by reconfiguring the operator
sequence located between the -35 and -10 promoter
elements. A typical MerR regulator is comprised of two
distinct domains that harbor the regulatory
(effector-binding) site and the active (DNA-binding)
site. Their N-terminal domains are homologous and
contain a DNA-binding winged HTH motif, while the
C-terminal domains are often dissimilar and bind
specific coactivator molecules, such as, metal ions,
drugs, and organic substrates. This subgroup lacks one
of the conserved, metal-binding cysteines seen in the
MerR1 group.
Length = 133
Score = 27.3 bits (61), Expect = 8.5
Identities = 12/39 (30%), Positives = 24/39 (61%), Gaps = 4/39 (10%)
Query: 281 TDLPVSGVRGVIAKRLLQSKQVI----KLREQMNKALEK 315
+ P VR +I +RL ++++ I KLR++M +A+ +
Sbjct: 73 GESPCPMVRRLIEQRLAETERRIKELLKLRDRMQQAVSQ 111
>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus. This family represents the
N-terminus (approximately 200 residues) of the
proline-rich protein BAT2. BAT2 is similar to other
proteins with large proline-rich domains, such as some
nuclear proteins, collagens, elastin, and synapsin.
Length = 189
Score = 28.0 bits (62), Expect = 8.5
Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 5/57 (8%)
Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
SD + + P + SA +PPPP P P P V S K
Sbjct: 99 SDALPPQTPESQPPAVSQTSASSPPPP-----PQPATPLVPGGAKSWAVASAKPGAQ 150
>gnl|CDD|129483 TIGR00388, glyQ, glycyl-tRNA synthetase, tetrameric type, alpha
subunit. This tetrameric form of glycyl-tRNA synthetase
(2 alpha, 2 beta) is found in the majority of completed
eubacterial genomes, with the two genes fused in a few
species. A substantially different homodimeric form (not
recognized by this model) replaces this form in the
Archaea, animals, yeasts, and some eubacteria [Protein
synthesis, tRNA aminoacylation].
Length = 293
Score = 28.3 bits (63), Expect = 8.6
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 100 VSWAKKEGDKLNEGDLLAEIETDKATMGFET 130
+ W+ K GD+ + E +++T FET
Sbjct: 177 LEWSDGPLGKTTYGDVFHQNEVEQSTYNFET 207
>gnl|CDD|234700 PRK00249, flgH, flagellar basal body L-ring protein; Reviewed.
Length = 222
Score = 27.9 bits (63), Expect = 8.6
Identities = 12/41 (29%), Positives = 18/41 (43%), Gaps = 6/41 (14%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
AG ++ P P AAP +PTP+P+ +G
Sbjct: 8 AGCASIPRLPLV------GGAPAAAPVEAPTPLPNPAPNGS 42
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
This model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 28.0 bits (63), Expect = 9.1
Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 10/56 (17%)
Query: 69 AVGSLSQRLVREGNDRVALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKA 124
GSL E D L A G I + +EG K+ +G +LA ++ D
Sbjct: 17 FPGSL------EAVDEADLAAEVA----GKITKISVREGQKVKKGQVLARLDDDDY 62
>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter. The Epithelial Na+
Channel (ENaC) Family (TC 1.A.06)The ENaC family
consists of sodium channels from animals and has no
recognizable homologues in other eukaryotes or bacteria.
The vertebrate ENaC proteins from epithelial cells
cluster tightly together on the phylogenetic tree:
voltage-insensitive ENaC homologues are also found in
the brain. Eleven sequenced C. elegans proteins,
including the degenerins, are distantly related to the
vertebrate proteins as well as to each other. At least
some ofthese proteins form part of a mechano-transducing
complex for touch sensitivity. Other members of the ENaC
family, the acid-sensing ion channels, ASIC1-3,are homo-
or hetero-oligomeric neuronal H+-gated channels that
mediate pain sensation in response to tissue acidosis.
The homologous Helix aspersa(FMRF-amide)-activated Na+
channel is the first peptide neurotransmitter-gated
ionotropic receptor to be sequenced.Mammalian ENaC is
important for the maintenance of Na+ balance and the
regulation of blood pressure. Three homologous ENaC
subunits, a, b and g, havebeen shown to assemble to form
the highly Na+-selective channel.This model is designed
from the vertebrate members of the ENaC family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 595
Score = 28.2 bits (63), Expect = 9.1
Identities = 8/32 (25%), Positives = 13/32 (40%)
Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP 198
+ D PP ++ P P ++ PPP
Sbjct: 559 PPSLQLDDPPTFPSALPLPHASGLSLPGTPPP 590
>gnl|CDD|219130 pfam06674, DUF1176, Protein of unknown function (DUF1176). This
family consists of several hypothetical bacterial
proteins of around 340 residues in length. Members of
this family contain six highly conserved cysteine
residues. The function of this family is unknown.
Length = 338
Score = 28.1 bits (63), Expect = 9.3
Identities = 13/37 (35%), Positives = 15/37 (40%), Gaps = 3/37 (8%)
Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
K P + S P PP V A P PP A P+
Sbjct: 158 KGTKPAS---SVPPAPPLPVIRAAPAPPAAAPLDPAE 191
>gnl|CDD|223064 PHA03375, PHA03375, hypothetical protein; Provisional.
Length = 844
Score = 28.3 bits (63), Expect = 9.4
Identities = 11/46 (23%), Positives = 16/46 (34%), Gaps = 2/46 (4%)
Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
P A S APPP P ++ + S + P T +
Sbjct: 376 RPRTARLS--APPPSAAPPVPRSGSSPSSLSSSSSSAPPPATDPSS 419
>gnl|CDD|219241 pfam06958, Pyocin_S, S-type Pyocin. This family represents a
conserved region approximately 180 residues long within
bacterial S-type pyocins. Pyocins are polypeptide toxins
produced by, and active against, bacteria. S-type
pyocins cause cell death by DNA breakdown due to
endonuclease activity.
Length = 139
Score = 27.3 bits (61), Expect = 9.5
Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 2/45 (4%)
Query: 172 DDAPPAAGASAPAPPPPKVAAAPP--PPPPKAAPAPSPTPVPSQK 214
P +S +P PP P P P +P P P +K
Sbjct: 74 ASPPGDIPSSTSSPAPPPDTPVITGTPITPVVDPVITPYPAPDEK 118
>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
Length = 537
Score = 28.3 bits (63), Expect = 9.8
Identities = 21/67 (31%), Positives = 27/67 (40%), Gaps = 10/67 (14%)
Query: 195 PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSA 254
P PPP PS +PS R + +RL E L L+ I +FG S+
Sbjct: 83 PLPPP-----PSSPSLPSSLLLDHFRNRSLSEIERLKVEAELSLNHI-----MFGIAGSS 132
Query: 255 DLSKASK 261
L K K
Sbjct: 133 QLWKRRK 139
>gnl|CDD|215258 PLN02465, PLN02465, L-galactono-1,4-lactone dehydrogenase.
Length = 573
Score = 28.3 bits (63), Expect = 9.8
Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 5/55 (9%)
Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
+GA+A P PPPPK P + ++GG A+ + + L
Sbjct: 1 SGAAAGNFAGP-----LAPPPPKPTQGPRSFARGALSSAGGEPASAATVRRYLGY 50
>gnl|CDD|183147 PRK11465, PRK11465, putative mechanosensitive channel protein;
Provisional.
Length = 741
Score = 28.2 bits (63), Expect = 9.9
Identities = 12/45 (26%), Positives = 15/45 (33%), Gaps = 2/45 (4%)
Query: 189 KVAAAPP--PPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
AA PP P P P Q + +R Y L+ R
Sbjct: 71 TAAATPPAEPVPKIVPPTLVEEKTVLQNVTEVSRHYGEALSARFG 115
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.131 0.376
Gapped
Lambda K H
0.267 0.0700 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,982,181
Number of extensions: 2202076
Number of successful extensions: 10721
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6507
Number of HSP's successfully gapped: 1009
Length of query: 426
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 326
Effective length of database: 6,502,202
Effective search space: 2119717852
Effective search space used: 2119717852
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (26.9 bits)