RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy18020
(434 letters)
>gnl|CDD|203324 pfam05743, UEV, UEV domain. This family includes the eukaryotic
tumour susceptibility gene 101 protein (TSG101). Altered
transcripts of this gene have been detected in sporadic
breast cancers and many other human malignancies.
However, the involvement of this gene in neoplastic
transformation and tumorigenesis is still elusive.
TSG101 is required for normal cell function of embryonic
and adult tissues but that this gene is not a tumour
suppressor for sporadic forms of breast cancer. This
family is related to the ubiquitin conjugating enzymes.
Length = 119
Score = 111 bits (279), Expect = 1e-29
Identities = 36/76 (47%), Positives = 48/76 (63%)
Query: 5 SGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHTWTPQNSK 64
G Y+IP+ IWL DT+P PIC+V+PT DM I V+ +VD G++Y PYLH W +S
Sbjct: 44 RGNTYNIPILIWLPDTYPFSPPICFVKPTPDMVIKVNHHVDAQGRVYLPYLHNWNHPSSN 103
Query: 65 LEGLIKTLASAFSALP 80
L L++ LA F P
Sbjct: 104 LVDLVQELAQVFGEEP 119
Score = 66.5 bits (163), Expect = 1e-13
Identities = 22/51 (43%), Positives = 31/51 (60%)
Query: 101 IRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALP 151
++PT DM I V+ +VD G++Y PYLH W +S L L++ LA F P
Sbjct: 69 VKPTPDMVIKVNHHVDAQGRVYLPYLHNWNHPSSNLVDLVQELAQVFGEEP 119
>gnl|CDD|117992 pfam09454, Vps23_core, Vps23 core domain. ESCRT complexes form the
main machinery driving protein sorting from endosomes to
lysosomes. The core domain of the Vps23 subunit of the
heterotrimeric ESCRT-I complex is a helical hairpin
sandwiched in a fan-like formation between two other
helical hairpins from Vps28 (pfam03997) and Vps37. Vps23
gives ESCRT-I complex its stability.
Length = 65
Score = 96.2 bits (240), Expect = 8e-25
Identities = 35/65 (53%), Positives = 45/65 (69%)
Query: 361 DEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFR 420
DEA+ P+ Q+L E+ A EDTIYY+ AL+RG IDLE YLK VR L+R+QF+ R
Sbjct: 1 DEAIVAEDPLLNQLLELVAEDHAIEDTIYYLERALKRGKIDLETYLKHVRELAREQFLKR 60
Query: 421 ALLFK 425
AL+ K
Sbjct: 61 ALIQK 65
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 48.2 bits (115), Expect = 8e-06
Identities = 26/122 (21%), Positives = 54/122 (44%), Gaps = 7/122 (5%)
Query: 276 SLKSAVEDKIRR--KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
SL+ E + ++ E+ + +E+++ L++ EL + +L E ++ EK ELE
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELED 877
Query: 334 SISFLKEKETELDEILAVLT---EKTEVDVDEAVT--TTAPIYKQILNTFTEEAATEDTI 388
+ L+E++ EL+E L L + + ++++ + L E E
Sbjct: 878 ELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEE 937
Query: 389 YY 390
Y
Sbjct: 938 EY 939
Score = 47.8 bits (114), Expect = 8e-06
Identities = 35/162 (21%), Positives = 68/162 (41%), Gaps = 22/162 (13%)
Query: 271 EHIKMSLKSAVEDKIR-----RKMNEQKVQAQEEIEILKQTENELN-------QGKFKLA 318
+ ++ L+ E+ R ++ E+ +A++EIE LK EL + +L
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELK 294
Query: 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEV---------DVDEAVTTTAP 369
E ++ E + L + L+ + EL+E L L EK E + E +
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354
Query: 370 IYKQILNTFTEEAAT-EDTIYYMGEALRRGVIDLEVYLKQVR 410
++ E+ + + + + EALR + +LE L ++R
Sbjct: 355 ELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396
Score = 39.3 bits (92), Expect = 0.004
Identities = 18/94 (19%), Positives = 43/94 (45%)
Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
++ +K + ++E++ L++ + EL + +L + +E +L + L+ K L+
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926
Query: 348 ILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381
L L E+ E + ++ + T + L E
Sbjct: 927 ELPELEEELEEEYEDTLETELEREIERLEEEIEA 960
Score = 39.3 bits (92), Expect = 0.004
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 12/99 (12%)
Query: 271 EHIKMSLKSAVEDKIRRKM-----NEQKVQAQEEIEILKQTENELNQGKFKLARMF---- 321
E ++ L+S E + K E++ QEE+E L++ E + L R
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLE 820
Query: 322 ---ERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
ER+++E ELE I L+EK EL+E L L ++ E
Sbjct: 821 QRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELE 859
Score = 38.5 bits (90), Expect = 0.007
Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 3/78 (3%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID---KEKAELERSISFLK 339
+++ R++ E K + E L+Q ++ L + + +L + E ++ + ELE + L+
Sbjct: 712 EELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771
Query: 340 EKETELDEILAVLTEKTE 357
E +L E + L EK +
Sbjct: 772 EALAKLKEEIEELEEKRQ 789
Score = 37.8 bits (88), Expect = 0.013
Identities = 30/149 (20%), Positives = 61/149 (40%), Gaps = 22/149 (14%)
Query: 281 VEDKIR------RKMNEQKVQAQEEIEILKQ--------TENELNQGKFKLARMFERIDK 326
+ED + K+ Q +A+ E+ + +L + + +L + E + +
Sbjct: 191 LEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSR 250
Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386
+ ELE L+E E E++E+ + L E E +++E + ++I E
Sbjct: 251 LEEELEELQEELEEAEKEIEELKSELEELRE-ELEELQEELLELKEEI-------EELEG 302
Query: 387 TIYYMGEALRRGVIDLEVYLKQVRSLSRK 415
I + E L +LE +++ L K
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEK 331
Score = 37.0 bits (86), Expect = 0.019
Identities = 21/90 (23%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEI--EILKQTENELNQGKFKLARMFERIDKEK-- 328
++ L+ K ++ + E L++ E EL + + +L + ER+++ +
Sbjct: 707 LRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEE 766
Query: 329 -AELERSISFLKEKETELDEILAVLTEKTE 357
LE +++ LKE+ EL+E L E+ E
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELE 796
Score = 36.6 bits (85), Expect = 0.025
Identities = 24/143 (16%), Positives = 57/143 (39%), Gaps = 6/143 (4%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
E ++ L+ E+ ++ E+ + +EE+E L++ +L + +L + + +E E
Sbjct: 740 EELEEELEELEEE--LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE 797
Query: 331 LERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYY 390
LE + + + L+ L E E + + ++I + E+ +
Sbjct: 798 LEEELEEAERRLDALEREL----ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 391 MGEALRRGVIDLEVYLKQVRSLS 413
+ + L +LE + L
Sbjct: 854 LEKELEELKEELEELEAEKEELE 876
Score = 35.1 bits (81), Expect = 0.082
Identities = 28/160 (17%), Positives = 65/160 (40%), Gaps = 14/160 (8%)
Query: 270 DEHIKMSLK--SAVEDKIRRKMNEQKVQAQEEIEILKQ----TENELNQGKFKLARMFER 323
E ++ L ++++ K++ + +E E L++ E EL + + +L +
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405
Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAA 383
I+ + LER L++ + EL E+ E +++E T + +++ +
Sbjct: 406 IESLEERLERLSERLEDLKEELKEL--------EAELEELQTELEELNEELEELEEQLEE 457
Query: 384 TEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALL 423
D + + L +L+ K++ SL + A
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQ 497
Score = 33.5 bits (77), Expect = 0.26
Identities = 28/140 (20%), Positives = 61/140 (43%), Gaps = 7/140 (5%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
LK+ + + + + + ++E+E L++ + L + +L E +KE EL+ +
Sbjct: 217 ELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSEL 276
Query: 336 SFLKEKETELDEILAVLT---EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392
L+E+ EL E L L E+ E ++ + ++ ++ I
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKI---- 332
Query: 393 EALRRGVIDLEVYLKQVRSL 412
EAL+ + + E L+++ L
Sbjct: 333 EALKEELEERETLLEELEQL 352
Score = 32.8 bits (75), Expect = 0.38
Identities = 21/97 (21%), Positives = 45/97 (46%), Gaps = 11/97 (11%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKF-KLARMFERID 325
+ +E+ +L++ +E +I R +EEIE L N ++ ++ +E +
Sbjct: 934 ELEEEYE-DTLETELEREIER--------LEEEIEALGPV-NLRAIEEYEEVEERYEELK 983
Query: 326 KEKAELERSISFLKEKETELDEILAVLTEKTEVDVDE 362
++ +LE + L E ELD+ ++T ++E
Sbjct: 984 SQREDLEEAKEKLLEVIEELDKEKRERFKETFDKINE 1020
Score = 32.0 bits (73), Expect = 0.66
Identities = 33/143 (23%), Positives = 62/143 (43%), Gaps = 12/143 (8%)
Query: 283 DKIRRKMNE---QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 339
D++ ++ E + + +EE+E L+ + EL +L E +++E ELE ++ LK
Sbjct: 845 DELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELK 904
Query: 340 EKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRR-G 398
E+ +L E L L K E E + ++ +T E E I + E + G
Sbjct: 905 EEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETEL--EREIERLEEEIEALG 962
Query: 399 VIDL------EVYLKQVRSLSRK 415
++L E ++ L +
Sbjct: 963 PVNLRAIEEYEEVEERYEELKSQ 985
Score = 30.8 bits (70), Expect = 1.9
Identities = 23/134 (17%), Positives = 50/134 (37%), Gaps = 24/134 (17%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELERSISFLKEKE 342
RRK+ EE + + + + + KL R ER++ ELE+ + L+ +
Sbjct: 157 RRKL-------IEEAAGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQA 209
Query: 343 TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDL 402
+ + + E E+++ + ++ E E E L R +L
Sbjct: 210 EKAERYQELKAELRELELAL-------LLAKLKELRKELEELE-------EELSRLEEEL 255
Query: 403 EVYLKQVRSLSRKQ 416
E +++ ++
Sbjct: 256 EELQEELEEAEKEI 269
Score = 30.5 bits (69), Expect = 2.1
Identities = 18/79 (22%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338
++ED + + + + + ++E LK+ L + +L E +++E ELE + L
Sbjct: 698 RSLEDLLEELRRQLE-ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEEL 756
Query: 339 KEKETELDEILAVLTEKTE 357
+E L+E+ L E
Sbjct: 757 QE---RLEELEEELESLEE 772
Score = 30.5 bits (69), Expect = 2.1
Identities = 29/140 (20%), Positives = 64/140 (45%), Gaps = 11/140 (7%)
Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI---DKEKAELERSI 335
+A+E+++ + + + +EE+E L++ EL + +L + E + ++ A+L+ I
Sbjct: 726 AALEEELEQLQSRLEEL-EEELEELEEELEELQE---RLEELEEELESLEEALAKLKEEI 781
Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395
L+EK L E L L E+ E EA + +++ + E I + E +
Sbjct: 782 EELEEKRQALQEELEELEEELE----EAERRLDALERELESLEQRRERLEQEIEELEEEI 837
Query: 396 RRGVIDLEVYLKQVRSLSRK 415
L+ +++ L ++
Sbjct: 838 EELEEKLDELEEELEELEKE 857
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 45.5 bits (109), Expect = 3e-05
Identities = 32/119 (26%), Positives = 55/119 (46%), Gaps = 4/119 (3%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
E IK +++I + NE + + +E L++ E L Q + L R E ++K + E
Sbjct: 52 EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111
Query: 331 LERSISFLKEKETELDEILAVLTEKTEVDVDE----AVTTTAPIYKQILNTFTEEAATE 385
LE+ L++K+ EL++ L E E + E + T + +L EEA E
Sbjct: 112 LEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHE 170
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 45.8 bits (108), Expect = 3e-05
Identities = 42/187 (22%), Positives = 66/187 (35%), Gaps = 24/187 (12%)
Query: 151 PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNV---NTPYPTYPPSA----- 202
P ++ P NP + SN P Y +P SN N P PPS+
Sbjct: 422 PGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHH 481
Query: 203 ---YPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYP----PGGSTPATPYPVPT 255
+ Y AA P+ ++ P+ P A G P P GS P T
Sbjct: 482 SAYHAAYQHRAANQPAANL-PTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATT 540
Query: 256 PSPA--------PAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE 307
P P + ++ + S++S+ + NE + +E +LK+T
Sbjct: 541 ADPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEEIYSLLKKTI 600
Query: 308 NELNQGK 314
N ++ K
Sbjct: 601 NRIDMNK 607
Score = 44.7 bits (105), Expect = 1e-04
Identities = 37/154 (24%), Positives = 50/154 (32%), Gaps = 20/154 (12%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSN---VNTPYPTYPPSAYPGYNPNAAAAPSGSI 218
+NP YN SN PY P SN N PY P S P N + AP S
Sbjct: 418 GNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSA 477
Query: 219 --YPS-YPT----------PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
+ S Y P A + + G++ P+ AP +
Sbjct: 478 KDHHSAYHAAYQHRAANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNA 537
Query: 266 GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEE 299
T D + + ED N QK + +E
Sbjct: 538 ATTADP----NGIAKREDHPEGGTNRQKYEQSDE 567
Score = 36.2 bits (83), Expect = 0.034
Identities = 29/101 (28%), Positives = 40/101 (39%), Gaps = 12/101 (11%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
P +N +YN SNA + S N Y + P ++ PGYN + + P+
Sbjct: 383 PHSNASYNCAAYSNAAQSNAAQSNAGFS--NAGY-SNPGNSNPGYNNAPNSNTPYNNPPN 439
Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTP--ATPYPVPTPSPAP 260
TP + P P PP + P TPY S AP
Sbjct: 440 SNTPYSNP------PNS-NPPYSNLPYSNTPYSNAPLSNAP 473
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 45.9 bits (109), Expect = 3e-05
Identities = 19/94 (20%), Positives = 38/94 (40%), Gaps = 2/94 (2%)
Query: 266 GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID 325
G E + L E ++ E+++ ++E L++ EL + +L +
Sbjct: 464 GQELPEEHEKELLELYELELEEL--EEELSREKEEAELREEIEELEKELRELEEELIELL 521
Query: 326 KEKAELERSISFLKEKETELDEILAVLTEKTEVD 359
+ + L+ + EK L E L L EK ++
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQ 555
Score = 42.8 bits (101), Expect = 3e-04
Identities = 28/146 (19%), Positives = 61/146 (41%), Gaps = 12/146 (8%)
Query: 269 TDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
E +++ E+++R + + +E+IE L++ E E+ + L E +
Sbjct: 260 EIESLELEALKIREEELRELERLLE-ELEEKIERLEELEREIEE----LEEELEGLRALL 314
Query: 329 AELERSISFLKEKETELDEI---LAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
ELE + LK E L+++ L L + E ++ E A + ++ L E
Sbjct: 315 EELEELLEKLKSLEERLEKLEEKLEKLESELE-ELAEEKNELAKLLEERLKELEER---L 370
Query: 386 DTIYYMGEALRRGVIDLEVYLKQVRS 411
+ + E + LE +++++
Sbjct: 371 EELEKELEKALERLKQLEEAIQELKE 396
Score = 39.7 bits (93), Expect = 0.003
Identities = 30/141 (21%), Positives = 63/141 (44%), Gaps = 6/141 (4%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
L++ +E+ K Q Q +EE+ L+ EL + +L + R ++ + EL +
Sbjct: 537 KLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELE-ELRERL 595
Query: 336 SFLKEKETELDEILAVL-TEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEA 394
LK+K EL+E L+ L +++ EA ++ E + + E
Sbjct: 596 KELKKKLKELEERLSQLEELLQSLELSEAENEL----EEAEEELESELEKLNLQAELEEL 651
Query: 395 LRRGVIDLEVYLKQVRSLSRK 415
L+ + +LE ++++ + R+
Sbjct: 652 LQAALEELEEKVEELEAEIRR 672
Score = 37.4 bits (87), Expect = 0.016
Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 10/91 (10%)
Query: 277 LKSAVEDKIRRKMNEQKVQA--QEEIEI-LKQTENELNQGKFKLARMFERIDKEKAE--- 330
L+S +E+ K K+ +E+E L++ E EL + +L ++ E I + K E
Sbjct: 341 LESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAE 400
Query: 331 ----LERSISFLKEKETELDEILAVLTEKTE 357
LE L+E E EL+E+ L E E
Sbjct: 401 LSAALEEIQEELEELEKELEELERELEELEE 431
Score = 37.0 bits (86), Expect = 0.018
Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 11/90 (12%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK-- 328
E L+ +E + E+ Q +E I+ LK+ EL + L + E +++ +
Sbjct: 367 EERLEELEKELEKAL-----ERLKQLEEAIQELKE---ELAELSAALEEIQEELEELEKE 418
Query: 329 -AELERSISFLKEKETELDEILAVLTEKTE 357
ELER + L+E+ +L+E + L K
Sbjct: 419 LEELERELEELEEEIKKLEEQINQLESKEL 448
Score = 35.9 bits (83), Expect = 0.048
Identities = 26/112 (23%), Positives = 52/112 (46%), Gaps = 22/112 (19%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQE-------EIEILKQTENELNQGKFKLAR 319
+ E I+ L+ A+E++++ +++Q ++ EIE L++ EL + K +L
Sbjct: 195 SELLEDIEDLLE-ALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEE 253
Query: 320 MFERIDKEK--------------AELERSISFLKEKETELDEILAVLTEKTE 357
+ R+ + + ELER + L+EK L+E+ + E E
Sbjct: 254 LKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305
Score = 32.8 bits (75), Expect = 0.44
Identities = 16/80 (20%), Positives = 37/80 (46%), Gaps = 2/80 (2%)
Query: 277 LKSAVEDKIRRKMNEQKVQA--QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
L+ E K ++++ E ++I ++ EL + +L ER+++ + E+E
Sbjct: 244 LEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEEL 303
Query: 335 ISFLKEKETELDEILAVLTE 354
L+ L+E+ +L +
Sbjct: 304 EEELEGLRALLEELEELLEK 323
Score = 32.4 bits (74), Expect = 0.51
Identities = 28/143 (19%), Positives = 56/143 (39%), Gaps = 9/143 (6%)
Query: 275 MSLKSAVEDKIRRKM-NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
S +E+ ++ +E + + +E E L E+EL + + + E + ELE
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEEL---ESELEKLNLQAE-LEELLQAALEELEE 661
Query: 334 SISFLKEKETELD-EILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392
+ +E E E+ E+ + E+ + E + +Q+ E I +
Sbjct: 662 KV---EELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLI 718
Query: 393 EALRRGVIDLEVYLKQVRSLSRK 415
E L +LE K++ L +
Sbjct: 719 EELESRKAELEELKKELEKLEKA 741
Score = 32.0 bits (73), Expect = 0.63
Identities = 41/166 (24%), Positives = 70/166 (42%), Gaps = 29/166 (17%)
Query: 271 EHIKMSLKSAVEDKIRRKM--NEQKVQAQEEIEILKQTENELNQGKFKLA---------- 318
E ++ ++ +E +IRR++ E + Q +E++E L+Q E EL Q + +L
Sbjct: 657 EELEEKVE-ELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIE 715
Query: 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378
++ E ++ KAELE L++ E L+ L L EK A I + +L
Sbjct: 716 QLIEELESRKAELEELKKELEKLEKALEL-LEELREKLGKAGLRAD-----ILRNLLAQI 769
Query: 379 TEEA----ATEDTIYYMGEAL-----RRGVIDLEVYLK-QVRSLSR 414
EA + Y L G + + VY +VR +
Sbjct: 770 EAEANEILSKLSLNRYDLRRLTIRKDGNGGLVVVVYDGGEVRPIKT 815
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.8 bits (109), Expect = 4e-05
Identities = 29/143 (20%), Positives = 60/143 (41%), Gaps = 10/143 (6%)
Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
DK + ++ +E++E LK+ NEL + +L +R+ +E A+L +I+ ++ K
Sbjct: 376 VDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAK 435
Query: 342 ETELDEILAVLTEKTEVDVDEAVTTTA---PIYKQILNTFTEEAATEDTIYYMGEALRRG 398
EL+E + + + A +++ + E E + L+R
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEL----SKLQRE 491
Query: 399 VIDLEVYLKQVRSLSRKQFMFRA 421
+ + E Q R+ + RA
Sbjct: 492 LAEAE---AQARASEERVRGGRA 511
Score = 40.8 bits (96), Expect = 0.001
Identities = 26/128 (20%), Positives = 54/128 (42%), Gaps = 10/128 (7%)
Query: 284 KIRRKMNEQKVQA--------QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
K + K E++++ +EE+E L+ +L L + + ++ + ELER I
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI 905
Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395
L+ + + + L+ L K E +E ++ + EE + ED + + +
Sbjct: 906 EELEAQIEKKRKRLSELKAKLEALEEE-LSEIEDPKGEDEEIPEEELSLED-VQAELQRV 963
Query: 396 RRGVIDLE 403
+ LE
Sbjct: 964 EEEIRALE 971
Score = 37.7 bits (88), Expect = 0.012
Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 276 SLKSAVEDKIRR--KMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKA 329
SL+ ++ +K R E+ + + EI+ L + E E+ + + + ++ E + K
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364
Query: 330 ELERSISFLKEKETELDE 347
ELE + L+E + E E
Sbjct: 365 ELEDLRAELEEVDKEFAE 382
Score = 37.7 bits (88), Expect = 0.014
Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 8/80 (10%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK- 341
+I +++E + + + + E E+ Q + + ++ ER+ ELE +S L+++
Sbjct: 698 RRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL----EELEEDLSSLEQEI 753
Query: 342 ---ETELDEILAVLTEKTEV 358
++EL E+ A + E E
Sbjct: 754 ENVKSELKELEARIEELEED 773
Score = 37.4 bits (87), Expect = 0.019
Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 9/84 (10%)
Query: 276 SLKSAVED---KIRRKMNEQKVQAQEE---IEILKQTENELNQGKFKLA---RMFERIDK 326
L + D KI E + QEE E L++ E +L+ + ++ + ++
Sbjct: 706 ELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765
Query: 327 EKAELERSISFLKEKETELDEILA 350
ELE + L+E +L+ L+
Sbjct: 766 RIEELEEDLHKLEEALNDLEARLS 789
Score = 35.8 bits (83), Expect = 0.051
Identities = 19/90 (21%), Positives = 46/90 (51%), Gaps = 4/90 (4%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
L++ + + R + E++ + ++ E L + E E+++ ++ + I++E+ ++
Sbjct: 298 ELEAEIA-SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356
Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVT 365
E + EL+++ A E EVD + A T
Sbjct: 357 EEYAELKEELEDLRA---ELEEVDKEFAET 383
Score = 35.0 bits (81), Expect = 0.086
Identities = 31/159 (19%), Positives = 64/159 (40%), Gaps = 24/159 (15%)
Query: 279 SAVEDKIRRKMN------EQKVQAQEEIEILKQTE-NELNQGKFKLARMFERIDKEKAEL 331
+ D+ R+++ E+ + Q ++ ++ E EL + K L R E I+++ A L
Sbjct: 190 DLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASL 249
Query: 332 ERSISFLK-----------EKETELDEILAVLTEKTEVDVDEAVTTTAPIYK---QILNT 377
E + L E E L+E+ + + E +E + I + +I +
Sbjct: 250 EEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGE---EEQLRVKEKIGELEAEIASL 306
Query: 378 FTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQ 416
A E + E L + +++ L ++ L R+
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345
Score = 34.3 bits (79), Expect = 0.14
Identities = 23/127 (18%), Positives = 57/127 (44%), Gaps = 6/127 (4%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
+++L+ +K +++ EQ++ +E+I+ +++ LN K +L E ++ +LE
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLES 882
Query: 334 SISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAAT-EDTIYYMG 392
+ LK++ EL+ L L K ++E ++ L+ + E+ + +
Sbjct: 883 RLGDLKKERDELEAQLRELERK----IEEL-EAQIEKKRKRLSELKAKLEALEEELSEIE 937
Query: 393 EALRRGV 399
+
Sbjct: 938 DPKGEDE 944
Score = 33.9 bits (78), Expect = 0.17
Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 14/99 (14%)
Query: 279 SAVEDKIRRKMNEQKVQAQEEIE-----------ILKQTENELNQGKFKLARMFERIDKE 327
+ KI+ E++++ +E+I + + E EL + +LA++ IDK
Sbjct: 275 EELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKL 334
Query: 328 KAE---LERSISFLKEKETELDEILAVLTEKTEVDVDEA 363
AE LER I +++ +L E A L E+ E E
Sbjct: 335 LAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEL 373
Score = 32.7 bits (75), Expect = 0.46
Identities = 17/82 (20%), Positives = 37/82 (45%), Gaps = 7/82 (8%)
Query: 283 DKIRRKMNEQKV---QAQEEIEILKQTEN----ELNQGKFKLARMFERIDKEKAELERSI 335
+++ R++ E++ + EE LK+ EL + + A + + + +LE+
Sbjct: 339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLK 398
Query: 336 SFLKEKETELDEILAVLTEKTE 357
+ E + ELD + L +E
Sbjct: 399 REINELKRELDRLQEELQRLSE 420
Score = 31.6 bits (72), Expect = 0.95
Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 8/80 (10%)
Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL-- 345
+ + Q E+ ++ +EL+Q +R I+KE +LE+ LKE+ EL
Sbjct: 685 GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEE 744
Query: 346 ------DEILAVLTEKTEVD 359
EI V +E E++
Sbjct: 745 DLSSLEQEIENVKSELKELE 764
Score = 31.6 bits (72), Expect = 1.2
Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 12/88 (13%)
Query: 268 ITDEHIKMSLK-SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
+ +E ++ + +E K+ R E K ++EI+ L++ +L E+I
Sbjct: 803 LEEEVSRIEARLREIEQKLNRLTLE-KEYLEKEIQELQEQRIDLK----------EQIKS 851
Query: 327 EKAELERSISFLKEKETELDEILAVLTE 354
+ E+E +E E EL+E+ A L +
Sbjct: 852 IEKEIENLNGKKEELEEELEELEAALRD 879
Score = 30.0 bits (68), Expect = 3.0
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 312 QGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
+L R+ ER++ K EL S L+ E LDE+ L++ +
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR 716
Score = 29.3 bits (66), Expect = 5.5
Identities = 15/74 (20%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
+++R + E+ + E+ L+ + +L++ ++ E+E+ I L+++
Sbjct: 673 PAELQR-LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731
Query: 342 ETELDEILAVLTEK 355
E +L E L L E
Sbjct: 732 EEKLKERLEELEED 745
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 42.6 bits (101), Expect = 4e-05
Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344
+RR E ++E+ L+ +E Q KL E + + K E+E L++ E
Sbjct: 26 LRRLEGE-LASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQR 84
Query: 345 LDEILAVLTEKTE 357
+ L +L EK+E
Sbjct: 85 YETTLELLGEKSE 97
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 43.9 bits (104), Expect = 9e-05
Identities = 29/133 (21%), Positives = 45/133 (33%), Gaps = 15/133 (11%)
Query: 160 QEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIY 219
++ T + N S P + + P++P +P+ ++ P
Sbjct: 159 EDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP---A 215
Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPY---------PVPTPSPA---PAVTSGGT 267
PS P + P PPG + P P PTP A PA G T
Sbjct: 216 PSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275
Query: 268 ITDEHIKMSLKSA 280
+ D+ I + K A
Sbjct: 276 LDDDAIAKAQKHA 288
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 43.5 bits (103), Expect = 2e-04
Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 18/132 (13%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
++ E+ + Q+E+ L + L Q K L ++++ ELE + L+ K EL
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335
Query: 346 DEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVY 405
E LA L EK E + + E A E L + +LE
Sbjct: 336 AEELAELEEKLEE--------LKEELESLEAELEELEAEL-------EELESRLEELE-- 378
Query: 406 LKQVRSLSRKQF 417
+Q+ +L K
Sbjct: 379 -EQLETLRSKVA 389
Score = 40.4 bits (95), Expect = 0.002
Identities = 22/149 (14%), Positives = 54/149 (36%), Gaps = 6/149 (4%)
Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
+ + + ++ + + +E++E L+ +L L ER++ LE
Sbjct: 356 LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDR 415
Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEA 394
L++ EI +L + E ++ E + +++ E E+ + + E
Sbjct: 416 RERLQQ------EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
Query: 395 LRRGVIDLEVYLKQVRSLSRKQFMFRALL 423
L L+ +++ L + L
Sbjct: 470 LEEAEQALDAAERELAQLQARLDSLERLQ 498
Score = 38.1 bits (89), Expect = 0.009
Identities = 17/85 (20%), Positives = 36/85 (42%), Gaps = 6/85 (7%)
Query: 277 LKSAVEDKIRRKMN--EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
L+ +E E+ +A++ ++ ++ +L L R+ E ++
Sbjct: 452 LQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG----FSEG 507
Query: 335 ISFLKEKETELDEILAVLTEKTEVD 359
+ L + ++ L IL VL+E VD
Sbjct: 508 VKALLKNQSGLSGILGVLSELISVD 532
Score = 37.3 bits (87), Expect = 0.017
Identities = 24/131 (18%), Positives = 54/131 (41%), Gaps = 20/131 (15%)
Query: 276 SLKSAVEDKIRRKM---------NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
+L + + ++K Q + + ++E L+ +EL +LA + ++
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL---AEELAEL----EE 344
Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386
+ EL+ + L+ + EL+ L L E +++E + T Q+ + A+ +
Sbjct: 345 KLEELKEELESLEAELEELEAELEEL-ESRLEELEEQLETLRSKVAQLEL---QIASLNN 400
Query: 387 TIYYMGEALRR 397
I + L R
Sbjct: 401 EIERLEARLER 411
Score = 36.2 bits (84), Expect = 0.038
Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 3/85 (3%)
Query: 276 SLKSAVEDKIRR--KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
SL+ + RR + EQ + E+IE L EL + +L E + E+A LE
Sbjct: 828 SLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEE 887
Query: 334 SISFLKEKETELD-EILAVLTEKTE 357
+++ L+ + EL E+ + ++++E
Sbjct: 888 ALALLRSELEELSEELRELESKRSE 912
Score = 34.3 bits (79), Expect = 0.16
Identities = 15/64 (23%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 292 QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351
+ + +EE+E L++ E + E + E ELE + L+ + +EL+E +
Sbjct: 233 RLEELREELEELQEELKEAEE-------ELEELTAELQELEEKLEELRLEVSELEEEIEE 285
Query: 352 LTEK 355
L ++
Sbjct: 286 LQKE 289
Score = 31.2 bits (71), Expect = 1.4
Identities = 22/132 (16%), Positives = 42/132 (31%), Gaps = 26/132 (19%)
Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 339
+++ R+++ + +++ L+ +L ++ KE ELE
Sbjct: 719 KELEELSRQISALR----KDLARLEAEVE-------QLEERIAQLSKELTELE------- 760
Query: 340 EKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGV 399
+ EL+E L E+ E A I EE + LR +
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQI-----EQLKEELKALREAL---DELRAEL 812
Query: 400 IDLEVYLKQVRS 411
L +R
Sbjct: 813 TLLNEEAANLRE 824
Score = 30.4 bits (69), Expect = 2.1
Identities = 18/106 (16%), Positives = 36/106 (33%), Gaps = 24/106 (22%)
Query: 276 SLKSAVEDKI---RRKMNEQKVQAQEEIEILKQTENELNQGKFKLA-------------- 318
+ E +I ++ + K + + E L + EL + A
Sbjct: 775 EELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIA 834
Query: 319 ---RMFERIDKEKAELERSISFL----KEKETELDEILAVLTEKTE 357
R E ++++ EL I L +E E ++E+ + L
Sbjct: 835 ATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLN 880
Score = 30.4 bits (69), Expect = 2.4
Identities = 31/151 (20%), Positives = 63/151 (41%), Gaps = 24/151 (15%)
Query: 281 VED---KIRRKMN---EQKVQAQEEIEI---LKQTE-----NELNQGKFKLARMFERIDK 326
+ED ++ R++ Q +A+ E+ L++ E L + + +L + E + +
Sbjct: 191 LEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKE 250
Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAAT-E 385
+ ELE + L+E E +L+E+ ++E ++E + ++ L E + E
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSE-----LEEEIE----ELQKELYALANEISRLE 301
Query: 386 DTIYYMGEALRRGVIDLEVYLKQVRSLSRKQ 416
+ E L LE Q+ L K
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESKL 332
Score = 30.4 bits (69), Expect = 2.6
Identities = 22/158 (13%), Positives = 56/158 (35%), Gaps = 20/158 (12%)
Query: 263 TSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFE 322
+ E + + A ++ + + + EE+ L+ +EL + +L
Sbjct: 866 ELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLA 925
Query: 323 RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382
+++ L E +D + L+E+ + ++EA I +E
Sbjct: 926 QLELRLEGL----------EVRIDNLQERLSEEYSLTLEEAEALENKIE-------DDEE 968
Query: 383 ATEDTIYYMGEALRR-GVIDLEVYLKQVRSLS-RKQFM 418
+ + ++ G ++L +++ L R F+
Sbjct: 969 EARRRLKRLENKIKELGPVNLAA-IEEYEELKERYDFL 1005
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 42.8 bits (100), Expect = 2e-04
Identities = 33/136 (24%), Positives = 63/136 (46%), Gaps = 16/136 (11%)
Query: 260 PAVTSGGTITDEHIKMSLKSAVEDK---IRRKMNEQKVQAQEEIEILKQTENE------- 309
PA+ S + E+ K + K A++DK I +++ E+ + Q IE LK+ NE
Sbjct: 89 PAIISKLSAAVENAK-NEKKAIDDKNAQIAKELAERNAKIQSNIEELKKINNEAFELSKT 147
Query: 310 LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAP 369
+N+ ++ + F+ K +LE L +K ++DE V + + + E +
Sbjct: 148 VNKTIAEVEKKFKIPKDFKEQLENFADDLLDKSRQIDEFTTVTSTQEGFTLAELES---- 203
Query: 370 IYKQILNTFTEEAATE 385
+K+I T+ +E
Sbjct: 204 -FKEITTTWFNGMKSE 218
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 40.7 bits (96), Expect = 2e-04
Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 23/114 (20%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQG-----KF------KLARMFERIDKEKA-- 329
D R + ++ ++ E L++ E EL + KF K R ++ ++EK
Sbjct: 17 DAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLR 76
Query: 330 -ELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382
E E I LK + EL + EK E ++E Y++ L + E +
Sbjct: 77 KEKEEEIKELKAELEELKAEI----EKLEEKLEEYQ-----PYEEFLESVVERS 121
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 39.1 bits (91), Expect = 8e-04
Identities = 27/99 (27%), Positives = 32/99 (32%), Gaps = 2/99 (2%)
Query: 147 FSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGY 206
+ PP YS + P AS Y+PM P ++PP AY
Sbjct: 1 YPDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPI 60
Query: 207 NPNAAAAPSGSIYPSYPT--PGATPGAGGYSPYPLYPPG 243
P GS GA G G S P PPG
Sbjct: 61 GAPPPVYPPGSTVLVQGGFDAGARFGPGTGSSIPPPPPG 99
Score = 29.0 bits (65), Expect = 2.3
Identities = 16/65 (24%), Positives = 23/65 (35%), Gaps = 1/65 (1%)
Query: 195 YPTYPPSAYPGYNPNAAAAPSGSIYPSY-PTPGATPGAGGYSPYPLYPPGGSTPATPYPV 253
YP PP+ Y P+ P + + P PG + + P P YP+
Sbjct: 1 YPDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPI 60
Query: 254 PTPSP 258
P P
Sbjct: 61 GAPPP 65
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 41.2 bits (96), Expect = 8e-04
Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 22/126 (17%)
Query: 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
EE+ +Q E N+ + K+ M + K+ +ELE F KE EL+E+ +L E +
Sbjct: 362 EELLRTEQQRLEKNEDQLKIITM--ELQKKSSELEEMTKFKNNKEVELEELKKILAEDEK 419
Query: 358 VDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEV-----------YL 406
+ +DE KQ E E + ++ +A + + DLE+ YL
Sbjct: 420 L-LDEK--------KQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYL 470
Query: 407 KQVRSL 412
K+V L
Sbjct: 471 KEVEDL 476
Score = 30.5 bits (68), Expect = 2.0
Identities = 28/98 (28%), Positives = 49/98 (50%), Gaps = 10/98 (10%)
Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327
IT++ KM + + ++ R K N Q +E+ ++ + EL + K L + E I
Sbjct: 249 ITEKENKMKDLTFLLEESRDKAN----QLEEKTKLQDENLKELIEKKDHLTKELEDI--- 301
Query: 328 KAELERSISFLKEKETELD---EILAVLTEKTEVDVDE 362
K L+RS+S K E +L + + LTE+ E ++E
Sbjct: 302 KMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEE 339
Score = 28.5 bits (63), Expect = 8.1
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350
E+ + A EE+ + Q EN + FKL E+I + E ++ I+ +KE ++ +L
Sbjct: 193 EKMILAFEELRV--QAENARLEMHFKLKEDHEKIQHLEEEYKKEIN---DKEKQVSLLLI 247
Query: 351 VLTEK 355
+TEK
Sbjct: 248 QITEK 252
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 41.5 bits (98), Expect = 9e-04
Identities = 34/158 (21%), Positives = 49/158 (31%), Gaps = 15/158 (9%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
A A +A ++ A+ AP P P P P A
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAG 451
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTP-SPAPAV 262
+P AAAPS P P +P P P + PA P P P +PA
Sbjct: 452 GAPSPPPAAAPSAQ-----PAPAPAA-----APEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
Query: 263 TSGGTITDEHIKMS---LKSAVEDKIRRKMNEQKVQAQ 297
G ++ + +AV + R+ +A
Sbjct: 502 APAGADDAATLRERWPEILAAVPKRSRKTWAILLPEAT 539
Score = 40.7 bits (96), Expect = 0.001
Identities = 19/97 (19%), Positives = 27/97 (27%), Gaps = 3/97 (3%)
Query: 170 GNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
+A A P + + P P+A A A + P P P
Sbjct: 385 LGVAGGAGAPAAAAPSAAAA---APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAP 441
Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
+ + P A P P P+PA A
Sbjct: 442 PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTA 478
Score = 35.7 bits (83), Expect = 0.057
Identities = 26/116 (22%), Positives = 37/116 (31%), Gaps = 10/116 (8%)
Query: 147 FSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQ-SYMPMPGSNVNTPYPTYPPSAYPG 205
S+ PP ++R A A A P + S P PG +P +
Sbjct: 604 ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDAS 663
Query: 206 YNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
+ A +G P+ P P P A P + A P P P+ P
Sbjct: 664 DGGDGWPAKAGGAAPAAPPPAPAPAA---------PAAPAGAAPAQPAPAPAATPP 710
Score = 34.6 bits (80), Expect = 0.11
Identities = 16/75 (21%), Positives = 21/75 (28%), Gaps = 3/75 (4%)
Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
P P P P A AAP+ P+ P P A + P
Sbjct: 592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAA---PAAPAPAGAAAAPAEASAAPAPGVA 648
Query: 245 STPATPYPVPTPSPA 259
+ P V P +
Sbjct: 649 APEHHPKHVAVPDAS 663
Score = 31.9 bits (73), Expect = 0.75
Identities = 21/101 (20%), Positives = 26/101 (25%), Gaps = 6/101 (5%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTP----YPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT 228
A+ P P P P P + P A A S + P P
Sbjct: 594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHH 653
Query: 229 PGAGGYSPYPLYPPGGSTPATPYPV--PTPSPAPAVTSGGT 267
P G A P P+PAPA +
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPA 694
Score = 30.7 bits (70), Expect = 1.6
Identities = 23/89 (25%), Positives = 28/89 (31%), Gaps = 3/89 (3%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
A P P + P P + P PP A G + + AA P P P
Sbjct: 697 APAQPAPAPAATP-PAGQADDPAAQ-PPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGA 754
Query: 233 GYSPYP-LYPPGGSTPATPYPVPTPSPAP 260
P P P + PA P PS
Sbjct: 755 PAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Score = 30.3 bits (69), Expect = 2.3
Identities = 20/98 (20%), Positives = 26/98 (26%), Gaps = 9/98 (9%)
Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYS 235
P + P P P A + P A +
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPA 734
Query: 236 --------PYPLYPPGGS-TPATPYPVPTPSPAPAVTS 264
P P PP + PA P P P P+PA A +
Sbjct: 735 ADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAA 772
Score = 29.6 bits (67), Expect = 4.3
Identities = 14/70 (20%), Positives = 14/70 (20%), Gaps = 2/70 (2%)
Query: 200 PSAYPGYNPNAAAA--PSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPS 257
A G P AA P P A P A P P A P
Sbjct: 584 VEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAP 643
Query: 258 PAPAVTSGGT 267
Sbjct: 644 APGVAAPEHH 653
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 41.5 bits (97), Expect = 0.001
Identities = 32/131 (24%), Positives = 44/131 (33%), Gaps = 6/131 (4%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPM---PGSNVNTPYPTYPP 200
A+A A P L ++ P G A A PT + P P + P
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
Query: 201 SAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG---GYSPYPLYPPGGSTPATPYPVPTPS 257
A + + + PS P P A SP PP S T P P
Sbjct: 2786 PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
Query: 258 PAPAVTSGGTI 268
P P++ GG++
Sbjct: 2846 PPPSLPLGGSV 2856
Score = 38.0 bits (88), Expect = 0.012
Identities = 27/104 (25%), Positives = 35/104 (33%), Gaps = 7/104 (6%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
A PT G++ S A P P P + PP A+P+ P+
Sbjct: 2686 RAARPTV-GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA----RQASPALPAAPA 2740
Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYP--VPTPSPAPAVT 263
P A P G P PP + P P P P P +T
Sbjct: 2741 PPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784
Score = 37.2 bits (86), Expect = 0.022
Identities = 34/131 (25%), Positives = 44/131 (33%), Gaps = 9/131 (6%)
Query: 142 TLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPS 201
T A + PP + T P S P+ P + V P PP+
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 202 AYP--GYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP---YPLYPPGGS---TPATPYPV 253
A P P +A P+ P P P + P G +P PP S PA P
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881
Query: 254 PTPS-PAPAVT 263
P PAV+
Sbjct: 2882 PVRRLARPAVS 2892
Score = 36.5 bits (84), Expect = 0.030
Identities = 42/189 (22%), Positives = 53/189 (28%), Gaps = 4/189 (2%)
Query: 78 ALPPLYSSRTKQEPATNPTYNGNIRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLE 137
A P S T P P A + + SP L A +
Sbjct: 2687 AARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
Query: 138 GLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPT 197
G A A PP + P P P S S + P
Sbjct: 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP-AVASLSESRESLPSPWDPA 2805
Query: 198 YPPSAYPGYNP--NAAAAPSGSI-YPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVP 254
PP+A AA+P+G + P+ P A P G P L G P
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRR 2865
Query: 255 TPSPAPAVT 263
PS +PA
Sbjct: 2866 PPSRSPAAK 2874
Score = 36.5 bits (84), Expect = 0.031
Identities = 26/112 (23%), Positives = 35/112 (31%), Gaps = 6/112 (5%)
Query: 161 EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYP 220
EPA + + P + P + P P A PG P A P + P
Sbjct: 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP-ATPG-GPARPARPPTTAGP 2766
Query: 221 SYPTPGATPGAG---GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTIT 269
P P A P AG + + S + P P P+ PA
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAA 2817
Score = 35.3 bits (81), Expect = 0.073
Identities = 37/132 (28%), Positives = 44/132 (33%), Gaps = 20/132 (15%)
Query: 128 TWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMP 187
TWT +S LE L A P R+ + AT P G + P P+P
Sbjct: 358 TWTPPSS-LEDLSAGRHHPKRASLPTRKRRSARHAAT-PFARGPGGDDQTRPAA---PVP 412
Query: 188 GSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP 247
S P P P A+A P P+ P P A PG P P P
Sbjct: 413 AS---VPTPAPTPVP-------ASAPPP----PATPLPSAEPG-SDDGPAPPPERQPPAP 457
Query: 248 ATPYPVPTPSPA 259
AT P A
Sbjct: 458 ATEPAPDDPDDA 469
Score = 34.5 bits (79), Expect = 0.13
Identities = 28/121 (23%), Positives = 34/121 (28%), Gaps = 18/121 (14%)
Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
ALPP R Q A P P Q P P P PP P P
Sbjct: 2899 ALPPDQPERPPQPQA-------------PPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP-----APAVT 263
A +G + P P G P + P+ P + P V+
Sbjct: 2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
Query: 264 S 264
S
Sbjct: 3006 S 3006
Score = 34.1 bits (78), Expect = 0.15
Identities = 23/91 (25%), Positives = 27/91 (29%), Gaps = 6/91 (6%)
Query: 177 PYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP 236
P P + P +V P P PS P + P P P
Sbjct: 2554 PLPPAAPPAAPDRSVPPPRPAPRPS-----EPAVTSRARRPDAPPQSARPRAPVDDRGDP 2608
Query: 237 YPLYPPGGSTPATPYP-VPTPSPAPAVTSGG 266
PP P T P P PSP+PA
Sbjct: 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPD 2639
Score = 34.1 bits (78), Expect = 0.16
Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 3/70 (4%)
Query: 193 TPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPY- 251
+ P + PG + P+ + S PTP TP P P P + P +
Sbjct: 388 ARHAATPFARGPG--GDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDG 445
Query: 252 PVPTPSPAPA 261
P P P P
Sbjct: 446 PAPPPERQPP 455
Score = 33.8 bits (77), Expect = 0.24
Identities = 31/126 (24%), Positives = 40/126 (31%), Gaps = 15/126 (11%)
Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPM-----PGSNVNTPYPTYPPSA 202
+ALPP S P T+ P P +P+ PG +V P+ P+A
Sbjct: 2816 AALPPAASPAGPLPPPTSAQPTA--PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873
Query: 203 YPGYNPN--------AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVP 254
P A + S + P P P P P P P P P
Sbjct: 2874 KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP 2933
Query: 255 TPSPAP 260
P P P
Sbjct: 2934 PPPPRP 2939
Score = 33.4 bits (76), Expect = 0.31
Identities = 34/133 (25%), Positives = 39/133 (29%), Gaps = 22/133 (16%)
Query: 151 PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTP-------YPTYPPSAY 203
PP S PA N + P P PG V+ P S
Sbjct: 2626 PPPPSPS----PAANEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPP 2680
Query: 204 PGYNPNAAAAPSGSIY----PSYPTPGATPGAGGYSPYPLYPPG-----GSTPATPY-PV 253
AA GS+ P P P P PPG ++PA P P
Sbjct: 2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPA 2740
Query: 254 PTPSPAPAVTSGG 266
P PA T GG
Sbjct: 2741 PPAVPAGPATPGG 2753
Score = 33.0 bits (75), Expect = 0.38
Identities = 20/95 (21%), Positives = 28/95 (29%), Gaps = 6/95 (6%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
A + P P P + + P P P P+ A S + P P A P A
Sbjct: 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAP------RPSEPAVTSRARRPDAPPQSARPRAP 2601
Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGT 267
P +P P P P+ +
Sbjct: 2602 VDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
Score = 32.6 bits (74), Expect = 0.55
Identities = 22/68 (32%), Positives = 24/68 (35%), Gaps = 4/68 (5%)
Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPY-PLYPPGGSTPATPYPVPTPS 257
PP P P A S P P P A A P P P + PATP P
Sbjct: 2701 PPP--PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPARP 2757
Query: 258 PAPAVTSG 265
P T+G
Sbjct: 2758 ARPPTTAG 2765
Score = 31.8 bits (72), Expect = 0.96
Identities = 30/129 (23%), Positives = 34/129 (26%), Gaps = 17/129 (13%)
Query: 147 FSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGY 206
PP S K P T+S+ P P P PP P
Sbjct: 2862 VRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQP 2921
Query: 207 NPNAAAAPSGSIYPSYPTPG----------ATPGAGGYSPYPLYPPGGSTPATPYPVP-- 254
P P P P P GAG S P G+ VP
Sbjct: 2922 QPPPPPQPQ----PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRF 2977
Query: 255 -TPSPAPAV 262
P PAP+
Sbjct: 2978 RVPQPAPSR 2986
Score = 31.4 bits (71), Expect = 1.2
Identities = 21/118 (17%), Positives = 27/118 (22%), Gaps = 4/118 (3%)
Query: 151 PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY----PTYPPSAYPGY 206
PP + P G A+ P P P + P P P
Sbjct: 2705 PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
Query: 207 NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
P A A P+ A P P V P+ A +
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822
Score = 30.3 bits (68), Expect = 3.1
Identities = 23/97 (23%), Positives = 25/97 (25%), Gaps = 8/97 (8%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTPY---PTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT- 228
A + P P P T P PP + P P P P T
Sbjct: 2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH 2622
Query: 229 ---PGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
P SP P P P P P PA
Sbjct: 2623 APDPPPPSPSPAANEPDPHPPPTVP-PPERPRDDPAP 2658
Score = 29.5 bits (66), Expect = 4.1
Identities = 19/92 (20%), Positives = 26/92 (28%), Gaps = 3/92 (3%)
Query: 174 SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG 233
S A P P P + P + ++ P+ P P P P P
Sbjct: 2870 SPAAKPAAPARP-PVRRLARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
Query: 234 YSPYPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
P P PP P P+ A +
Sbjct: 2928 -QPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA 2958
Score = 29.5 bits (66), Expect = 4.9
Identities = 22/97 (22%), Positives = 28/97 (28%), Gaps = 10/97 (10%)
Query: 174 SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG 233
S +P + P + P P+ P A + S P A
Sbjct: 2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA-- 2687
Query: 234 YSPYPLYPPGGSTPAT---PYPVPTPSPAPAVTSGGT 267
P GS + P P PTP PAP T
Sbjct: 2688 -----ARPTVGSLTSLADPPPPPPTPEPAPHALVSAT 2719
Score = 29.1 bits (65), Expect = 6.9
Identities = 13/61 (21%), Positives = 15/61 (24%)
Query: 211 AAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITD 270
AA P P P P ++ P P PS P G
Sbjct: 391 AATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPP 450
Query: 271 E 271
E
Sbjct: 451 E 451
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 40.5 bits (95), Expect = 0.002
Identities = 29/146 (19%), Positives = 59/146 (40%), Gaps = 23/146 (15%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
E + ++ + R+ N+Q+ +A+ E E L++ E L Q + +L E++D + +
Sbjct: 47 EEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQ 106
Query: 331 LERSISFLKEKETELDE-------------------ILAVLTEKTEVDVDEAVTTTAPIY 371
LE L +E EL+E +L + + +++E
Sbjct: 107 LEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQ---RV 163
Query: 372 KQILNTFTEEAATEDTIYYMGEALRR 397
K+I E A + +A++R
Sbjct: 164 KKIEEEADLE-AERKAQNILAQAMQR 188
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 40.4 bits (94), Expect = 0.002
Identities = 37/160 (23%), Positives = 50/160 (31%), Gaps = 24/160 (15%)
Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP-- 236
P+ S P S PT PG AP+ + PS P + SP
Sbjct: 381 PSGSAAEGPASGGAATIPT------PGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRV 434
Query: 237 ----YPLYPPGGSTPATPYP-VPTPSPAP----AVTSGGTITDEHIKMSLKSAVEDKIRR 287
P PP P P P +P SP P +V S S + R
Sbjct: 435 PWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPR 494
Query: 288 KMN-------EQKVQAQEEIEILKQTENELNQGKFKLARM 320
+ + Q +L+Q E G+ +LA M
Sbjct: 495 TWDGFLEFCQGRNGQGGRLATVLRQATPEHADGRLRLATM 534
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 40.1 bits (94), Expect = 0.002
Identities = 27/98 (27%), Positives = 36/98 (36%), Gaps = 1/98 (1%)
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP-AV 262
P+AAAA S S S + P P P PV PS AP AV
Sbjct: 388 AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAV 447
Query: 263 TSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI 300
++ I +S S++ R + E+ QA I
Sbjct: 448 RPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNI 485
Score = 32.8 bits (75), Expect = 0.37
Identities = 19/104 (18%), Positives = 26/104 (25%), Gaps = 11/104 (10%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
P S A + S P + P+ P SA A P+ S+ P
Sbjct: 381 PVFTQPAAAPQPSAAAAASPS----PSQSSAAAQPSAPQSA----TQPAGTPPTVSVDPP 432
Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
P P P + P S P+
Sbjct: 433 AAVPVNPPSTA---PQAVRPAQFKEEKKIPVSKVSSLGPSTLRP 473
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 39.9 bits (93), Expect = 0.002
Identities = 20/75 (26%), Positives = 26/75 (34%), Gaps = 1/75 (1%)
Query: 191 VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPA-T 249
P P PP+A AAAA + + P+ +P A P G PA
Sbjct: 394 AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPA 453
Query: 250 PYPVPTPSPAPAVTS 264
P P + PA
Sbjct: 454 PAAAPAAAARPAAAG 468
Score = 34.1 bits (78), Expect = 0.16
Identities = 25/117 (21%), Positives = 35/117 (29%), Gaps = 7/117 (5%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
A +A P ++ P P A+ A + + +
Sbjct: 384 QPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
PG P A AP + P A P A G P + PA P P+PA
Sbjct: 444 PGGAPAPAPAP-----AAAPAAAARPAAAGPRPVA--AAAAAAPARAAPAAAPAPAD 493
Score = 32.9 bits (75), Expect = 0.40
Identities = 25/141 (17%), Positives = 40/141 (28%), Gaps = 21/141 (14%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASN-----APYPTQSYMPMPGSNVNTPYPTY 198
+A +A + ++ ++ PA AS AP P + P +
Sbjct: 410 PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGP 469
Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYPT-------PGATPGAGGYSPYPLYP---------P 242
P A A AAP+ + P+ P P P
Sbjct: 470 RPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDP 529
Query: 243 GGSTPATPYPVPTPSPAPAVT 263
+ P + P+PA A
Sbjct: 530 ATADPDDAFETLAPAPAAAPA 550
Score = 32.2 bits (73), Expect = 0.61
Identities = 19/84 (22%), Positives = 28/84 (33%), Gaps = 7/84 (8%)
Query: 187 PGSNVNTPYPTYPPSAY-------PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPL 239
P + P P+A P AA A + + + A P +P L
Sbjct: 374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAL 433
Query: 240 YPPGGSTPATPYPVPTPSPAPAVT 263
++ P P P+PAPA
Sbjct: 434 AAARQASARGPGGAPAPAPAPAAA 457
Score = 30.6 bits (69), Expect = 1.7
Identities = 24/121 (19%), Positives = 34/121 (28%), Gaps = 6/121 (4%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
A+A +A + A P A+ A P + + + P A
Sbjct: 375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPAR-RSPAPEAL 433
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
+A P G+ P+ P P A P A G A APA
Sbjct: 434 AAARQASARGPGGAPAPA-PAPAAAPAAAARPA----AAGPRPVAAAAAAAPARAAPAAA 488
Query: 264 S 264
Sbjct: 489 P 489
Score = 29.5 bits (66), Expect = 4.4
Identities = 28/126 (22%), Positives = 39/126 (30%), Gaps = 4/126 (3%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
A A + P + + A P A+ AP ++ + P P +
Sbjct: 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEF 506
Query: 204 PGYNP-NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP---ATPYPVPTPSPA 259
P AAP+G + S P P + P P A PV P P
Sbjct: 507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
Query: 260 PAVTSG 265
A SG
Sbjct: 567 RASASG 572
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 38.4 bits (90), Expect = 0.003
Identities = 31/137 (22%), Positives = 56/137 (40%), Gaps = 30/137 (21%)
Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA----ELERS--ISFLKEKET 343
++ + + +E LK+ EL Q +L +I+K K ER+ + LK+ E
Sbjct: 61 SQALNKLKTRLEKLKKELEELKQRIAELQ---AQIEKLKKGREETEERTELLEELKQLEK 117
Query: 344 ELDEILAVLTEKTEVD---VDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVI 400
EL ++ A L + + D +++ T + K+ N +T D I+
Sbjct: 118 ELKKLKAELEKYEKNDPERIEKLKEET-KVAKEAANRWT------DNIF----------- 159
Query: 401 DLEVYLKQVRSLSRKQF 417
L+ YL + L Q
Sbjct: 160 SLKSYLSKKFGLDEAQL 176
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 39.1 bits (91), Expect = 0.005
Identities = 27/127 (21%), Positives = 47/127 (37%), Gaps = 13/127 (10%)
Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
K + + Q + +E LKQ NE+++ + + A + ++ +AE R L E +EL
Sbjct: 175 KPSGRNPQINQLLEKLKQERNEIDEAEKEYATYHKLLESRRAEHAR----LAELRSELRA 230
Query: 348 ILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLK 407
+ + +Q L EE AT R GV+ LE
Sbjct: 231 DRDHIRALRDAVELWPRLQEWKQLEQELTRRREELATFP---------RDGVLRLEKREA 281
Query: 408 QVRSLSR 414
++
Sbjct: 282 HLQKTEA 288
Score = 29.0 bits (65), Expect = 5.6
Identities = 24/131 (18%), Positives = 49/131 (37%), Gaps = 9/131 (6%)
Query: 278 KSAVEDKIRR--KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
++ ++ +R +++E K A E E L E +L + FE E+ RS+
Sbjct: 312 EAVLQALVRLHQQLSEIKASAFELTETLAGIEADLRDKEEAAGNGFE---AERVHDLRSL 368
Query: 336 SFLKE---KETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392
+ + EL + A ++ + +++ + E A ++
Sbjct: 369 ECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLADDEEEVRAGDEAREEKI 428
Query: 393 EALRRGVIDLE 403
A + VID E
Sbjct: 429 AANSQ-VIDKE 438
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 37.6 bits (87), Expect = 0.006
Identities = 25/113 (22%), Positives = 37/113 (32%), Gaps = 5/113 (4%)
Query: 166 PTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSY-PT 224
P + +P QS N ++ + P + + Y +
Sbjct: 59 PRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGA 118
Query: 225 PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSL 277
P PG GG + PG P +P P+ P V I DE + MSL
Sbjct: 119 PAGHPGGGGPQQ---FRPGAGQPPGMQHMPAPALPPNVIDTDLI-DEEVLMSL 167
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 38.0 bits (88), Expect = 0.006
Identities = 24/78 (30%), Positives = 30/78 (38%), Gaps = 6/78 (7%)
Query: 184 MPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG 243
+ +P + P P PP AAAAP+ + P P P A P P PP
Sbjct: 31 IALPATANADPAPPPPPP------STAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPP 84
Query: 244 GSTPATPYPVPTPSPAPA 261
+ P P P P AP
Sbjct: 85 PADPNAPPPPPVDPNAPP 102
Score = 30.2 bits (68), Expect = 1.8
Identities = 25/94 (26%), Positives = 31/94 (32%), Gaps = 2/94 (2%)
Query: 144 ASAFS-ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSA 202
ASA + ALP ++ P T A AP P P + P PP
Sbjct: 26 ASAVAIALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85
Query: 203 YPGYNPNAAAAPSGSIYPSYPTPGATPGA-GGYS 235
P + P P PG A GG+S
Sbjct: 86 ADPNAPPPPPVDPNAPPPPAPEPGRIDNAVGGFS 119
Score = 28.7 bits (64), Expect = 6.2
Identities = 23/87 (26%), Positives = 26/87 (29%), Gaps = 10/87 (11%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
A AP P P P PP+A P P+ A P P A P A
Sbjct: 39 ADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAP-------PPPPADPNA- 90
Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPA 259
P P P P P P +
Sbjct: 91 --PPPPPVDPNAPPPPAPEPGRIDNAV 115
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 38.0 bits (89), Expect = 0.007
Identities = 19/92 (20%), Positives = 40/92 (43%), Gaps = 8/92 (8%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
+ L+ +E++ +R+ E + + QE ++ + E + + + E+ K + E++
Sbjct: 73 RAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDE 132
Query: 334 SIS-------FLKEKETELDE-ILAVLTEKTE 357
KE+E E + IL EK E
Sbjct: 133 FNEERIERKEEEKEREREEELKILEYQREKAE 164
Score = 34.1 bits (79), Expect = 0.14
Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 2/73 (2%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
DE + E K R+K E+ + + + + L++ E Q + K R+ E +E+A
Sbjct: 202 DELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREE--QIEEKEERLQEERAEEEA 259
Query: 330 ELERSISFLKEKE 342
E ER + E E
Sbjct: 260 ERERMLEKQAEDE 272
Score = 33.3 bits (77), Expect = 0.23
Identities = 19/93 (20%), Positives = 34/93 (36%), Gaps = 19/93 (20%)
Query: 282 EDKIRRKMNEQ------KVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
E KI E+ + + E + K+ E +L E + E+ EL+
Sbjct: 152 ELKILEYQREKAEREEEREAERRERKEEKEREVA------RLRAQQEEAEDEREELDELR 205
Query: 336 SFL-------KEKETELDEILAVLTEKTEVDVD 361
+ L KE++ E +E +K E+
Sbjct: 206 ADLYQEEYERKERQKEKEEAEKRRRQKQELQRA 238
Score = 32.6 bits (75), Expect = 0.37
Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 9/80 (11%)
Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER--IDKEKAELERS 334
+ E++I RK ++ + +E E LK E + + + + R ER +EK
Sbjct: 130 IDEFNEERIERK---EEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVAR 186
Query: 335 ISFLKEKET----ELDEILA 350
+ +E+ ELDE+ A
Sbjct: 187 LRAQQEEAEDEREELDELRA 206
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 38.1 bits (88), Expect = 0.008
Identities = 42/150 (28%), Positives = 51/150 (34%), Gaps = 11/150 (7%)
Query: 123 SPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQS 182
+PY TWT + A + LP ++ T Q P PT + P Q
Sbjct: 660 TPYKPTWTQIGHI--PYQPSPTGANTMLPIQWAPGTMQPPPRAPT-PMRPPAAPPGRAQR 716
Query: 183 YMPMPGSN---VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS-----YPTPGATPGAGGY 234
G P PP+A PG AAAP + P+ P A PGA
Sbjct: 717 PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTP 776
Query: 235 SPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
P P PP P P P P TS
Sbjct: 777 QPPPQAPPAPQQRPRGAPTPQPPPQAGPTS 806
Score = 32.7 bits (74), Expect = 0.48
Identities = 26/103 (25%), Positives = 33/103 (32%), Gaps = 7/103 (6%)
Query: 165 NPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPT 224
P+ G M P P P PP+A PG AAA + P+
Sbjct: 675 QPSPTGANTMLPIQWAPGTMQPPPR---APTPMRPPAAPPGRAQRPAAATGRARPPAAAP 731
Query: 225 PGATPGAGGYSPYPLYPPGGSTPAT--PYPVPTPSPAPAVTSG 265
A P A +P PP + P P + PA G
Sbjct: 732 GRARPPAA--APGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 38.1 bits (89), Expect = 0.009
Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
E+K++ + E K +A EE L + E ++ + R+++E +E I L+EK
Sbjct: 279 EEKVKE-LKELKEKA-EEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336
Query: 342 ETELDEILAVLTE 354
E L+E+ L E
Sbjct: 337 EERLEELKKKLKE 349
Score = 36.6 bits (85), Expect = 0.026
Identities = 32/104 (30%), Positives = 58/104 (55%), Gaps = 13/104 (12%)
Query: 280 AVEDKIRRKMNEQKV-QAQEEIEILKQTENELNQGKFK-LARMFERID---KEKAELERS 334
+E+K++ K N +++ + EE E LK+ +L +G+ K L + E+++ K+ AELE+
Sbjct: 507 ELEEKLK-KYNLEELEKKAEEYEKLKEKLIKL-KGEIKSLKKELEKLEELKKKLAELEKK 564
Query: 335 ISFLKEKETELDEILAVLTE---KTEVDVDEAVTTTAPIYKQIL 375
L E E EL E+L L E ++ +++E + P Y + L
Sbjct: 565 ---LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYL 605
Score = 36.6 bits (85), Expect = 0.028
Identities = 19/84 (22%), Positives = 38/84 (45%), Gaps = 9/84 (10%)
Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLA--------RMFERIDKEKAELER 333
++ +++ E + + E E K + EL + K +L + E ++K K E+E
Sbjct: 347 LKELEKRLEELEERH-ELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEE 405
Query: 334 SISFLKEKETELDEILAVLTEKTE 357
IS + + EL + + L + E
Sbjct: 406 EISKITARIGELKKEIKELKKAIE 429
Score = 36.2 bits (84), Expect = 0.035
Identities = 27/117 (23%), Positives = 55/117 (47%), Gaps = 8/117 (6%)
Query: 270 DEHIKMS-LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
+E+IK+S D++R + ++ + +EEI +++ EL + + +L E + K+
Sbjct: 293 EEYIKLSEFYEEYLDELRE-IEKRLSRLEEEINGIEERIKELEEKEERL----EELKKKL 347
Query: 329 AELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
ELE+ + L+E+ +E A E + T + K++ E+A E
Sbjct: 348 KELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEE--LEKAKEE 402
Score = 35.8 bits (83), Expect = 0.047
Identities = 18/84 (21%), Positives = 44/84 (52%), Gaps = 7/84 (8%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA---E 330
+++ S+ ++R ++ + ++E++ L++ + E+ + + +L + K + E
Sbjct: 208 EINEISSELPELREELEK----LEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263
Query: 331 LERSISFLKEKETELDEILAVLTE 354
LE I LK++ EL+E + L E
Sbjct: 264 LEERIEELKKEIEELEEKVKELKE 287
Score = 34.7 bits (80), Expect = 0.11
Identities = 21/107 (19%), Positives = 47/107 (43%), Gaps = 12/107 (11%)
Query: 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTEN---ELNQGKFKLARMFERIDKEKAELERS 334
+ + ++ E + +E ++ L+ N EL + +L R + + K + EL+++
Sbjct: 572 LAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKA 631
Query: 335 ISFLKEKETELDEILAVLTE-KTEVDVDEAVTTTAPIYKQILNTFTE 380
L E E L+E+ L E + + +E Y+++ + E
Sbjct: 632 FEELAETEKRLEELRKELEELEKKYSEEE--------YEELREEYLE 670
Score = 33.5 bits (77), Expect = 0.25
Identities = 17/82 (20%), Positives = 45/82 (54%), Gaps = 1/82 (1%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNE-QKVQAQEEIEILKQTENELNQGKFKLARMFERID 325
+ +++ +++R+++ E +K ++EE E L++ EL++ L E ++
Sbjct: 627 ELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELE 686
Query: 326 KEKAELERSISFLKEKETELDE 347
K + E+++++ LKE+ E ++
Sbjct: 687 KRREEIKKTLEKLKEELEEREK 708
Score = 32.3 bits (74), Expect = 0.64
Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
+K+ +++ E + + +EEIE L++ L K KL +++ EL++ I L+EK
Sbjct: 224 EKLEKEVKELE-ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV 282
Query: 343 TELDEI 348
EL E+
Sbjct: 283 KELKEL 288
Score = 31.6 bits (72), Expect = 1.1
Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 4/74 (5%)
Query: 278 KSAVEDKIRRKMN-EQKVQAQEE--IEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
+E I+R N E+ ++ +E+ E+L++ NE++ +L E+++KE ELE
Sbjct: 178 IERLEKFIKRTENIEELIKEKEKELEEVLREI-NEISSELPELREELEKLEKEVKELEEL 236
Query: 335 ISFLKEKETELDEI 348
++E E EL+ +
Sbjct: 237 KEEIEELEKELESL 250
Score = 30.4 bits (69), Expect = 2.1
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 5/91 (5%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID---KE 327
E+I+ L E ++ + E + E E+ ++ E L + +L + E I+ KE
Sbjct: 189 ENIE-ELIKEKEKELEEVLREINEISSELPELREELEK-LEKEVKELEELKEEIEELEKE 246
Query: 328 KAELERSISFLKEKETELDEILAVLTEKTEV 358
LE S L+EK EL+E + L ++ E
Sbjct: 247 LESLEGSKRKLEEKIRELEERIEELKKEIEE 277
Score = 30.0 bits (68), Expect = 2.8
Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 13/80 (16%)
Query: 284 KIRRKMNEQKVQAQEEIE-ILKQTEN--EL--NQGKFKLARMFERIDK---EKAELERSI 335
++ +++ + E +E +K+TEN EL + K +L + I++ E EL +
Sbjct: 169 EVIKEIKRRI----ERLEKFIKRTENIEELIKEKEK-ELEEVLREINEISSELPELREEL 223
Query: 336 SFLKEKETELDEILAVLTEK 355
L+++ EL+E+ + E
Sbjct: 224 EKLEKEVKELEELKEEIEEL 243
Score = 29.3 bits (66), Expect = 5.2
Identities = 31/144 (21%), Positives = 62/144 (43%), Gaps = 18/144 (12%)
Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
+ K +E +R + ++E+E L++ + E+ + K+ + KE EL+++
Sbjct: 368 KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKA 427
Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE-AATEDTIYYMGE 393
I LK+ + + LTE+ K++L +T E E + + E
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHR--------------KELLEEYTAELKRIEKELKEIEE 473
Query: 394 A---LRRGVIDLEVYLKQVRSLSR 414
LR+ + +LE LK+ L +
Sbjct: 474 KERKLRKELRELEKVLKKESELIK 497
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 37.1 bits (87), Expect = 0.009
Identities = 28/112 (25%), Positives = 52/112 (46%), Gaps = 18/112 (16%)
Query: 294 VQAQEEIEILK-QTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352
++A+EEI L+ + E EL + + +L R +R+ +++ L+R L++KE L+E L
Sbjct: 56 LEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKEL 115
Query: 353 T------EKTEVDVDEAVTT-----------TAPIYKQILNTFTEEAATEDT 387
E+ E +++E + TA K+IL E +
Sbjct: 116 AARQQQLEEKEEELEELIEEQQQELERISGLTAEEAKEILLEEVEAELKHEA 167
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 37.6 bits (88), Expect = 0.010
Identities = 29/118 (24%), Positives = 49/118 (41%), Gaps = 19/118 (16%)
Query: 281 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340
++++ + E + + +E L++ E L Q + L R E +DK++ LE+ L
Sbjct: 56 AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSN 115
Query: 341 KETELDEILAVLTEK-----------TEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
KE LDE L E + + +EA K+IL EE A +
Sbjct: 116 KEKNLDEKEEELEELIAEQREELERISGLTQEEA--------KEILLEEVEEEARHEA 165
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 37.4 bits (87), Expect = 0.011
Identities = 28/155 (18%), Positives = 54/155 (34%), Gaps = 14/155 (9%)
Query: 218 IYPSYPTPGATPGAGGYSPYPLYPPGGSTP----ATPYPVPTPSPAPAVTSGGTITDE-- 271
+ S + +P + + + D
Sbjct: 269 LDSSKSLENTSEPGRLIAPQEACRNRIAEVVQEAVRKMESVAEEKMRMLKKARSALDACD 328
Query: 272 -HIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM----FERI-- 324
++ + E K+ R+ + ++ E I LKQ E ++ Q K AR +RI
Sbjct: 329 RELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKADEARREAERLQRIAL 388
Query: 325 -DKEKAELERSISFLKEKETELDEILAVLTEKTEV 358
EK+E E + +LK + +E +E L E+ ++
Sbjct: 389 AKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKL 423
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 37.8 bits (87), Expect = 0.011
Identities = 24/88 (27%), Positives = 38/88 (43%), Gaps = 7/88 (7%)
Query: 186 MPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG-- 243
+PG V Y P + G + + P+GS S T + + +P P G
Sbjct: 550 IPGKGV-----PYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHL 604
Query: 244 GSTPATPYPVPTPSPAPAVTSGGTITDE 271
GS PATP + +PS +P + G+ +
Sbjct: 605 GSPPATPSKIVSPSTSPPASHLGSPSTT 632
Score = 33.5 bits (76), Expect = 0.25
Identities = 23/88 (26%), Positives = 37/88 (42%), Gaps = 4/88 (4%)
Query: 177 PYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP 236
PY + PGS +TP SA ++ +PS + PS P G+ +P
Sbjct: 556 PYIPGLFAGNPGSTNSTP----TGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATP 611
Query: 237 YPLYPPGGSTPATPYPVPTPSPAPAVTS 264
+ P S PA+ P+ +P+ +S
Sbjct: 612 SKIVSPSTSPPASHLGSPSTTPSSPESS 639
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 37.3 bits (87), Expect = 0.012
Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 1/76 (1%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQ-GKFKLARMFERIDKEKAELERSISFLKEK 341
+ I+ K+ ++ +EEI L+Q +ELN +L + + + ++ L+E
Sbjct: 174 NSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEEL 233
Query: 342 ETELDEILAVLTEKTE 357
+ EL E+ + T
Sbjct: 234 QQELQELTIAIEALTN 249
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 37.3 bits (87), Expect = 0.012
Identities = 22/111 (19%), Positives = 41/111 (36%), Gaps = 24/111 (21%)
Query: 269 TDEHIKMSLKSAVE--DKIRRKM--NEQKVQAQ---------EEIEILKQTENE--LNQG 313
E ++ + + A E K + +K+Q Q EE+E L + +G
Sbjct: 68 AFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEG 127
Query: 314 KFK-LARMFERIDKEKAELERSISFLKEK--------ETELDEILAVLTEK 355
K E + + +L + I L +K E E+ E+ + +K
Sbjct: 128 YEKGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKK 178
Score = 28.8 bits (65), Expect = 5.2
Identities = 13/64 (20%), Positives = 28/64 (43%), Gaps = 4/64 (6%)
Query: 282 EDKIRRKMNEQKVQAQEE-IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340
++ E A+EE E L + +E + + ++ +I++E+ E E + L E
Sbjct: 61 IEEAEEDAFEIVEAAEEEAKEELLKKTDEASS---IIEKLQMQIEREQEEWEEELERLIE 117
Query: 341 KETE 344
+
Sbjct: 118 EAKA 121
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 36.8 bits (85), Expect = 0.013
Identities = 21/99 (21%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
T+ M LK E+ ++ K+ E + + +E ++ L++ E E + + +L +R++
Sbjct: 123 TLQRVPEYMDLKEDYEE-LKEKLEELQKEKEELLKELEELEAEYEEVQERL----KRLEV 177
Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVT 365
E + LE + L + +L + L E+ +E ++
Sbjct: 178 ENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS 216
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 37.0 bits (86), Expect = 0.014
Identities = 30/146 (20%), Positives = 55/146 (37%), Gaps = 37/146 (25%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIE---------------------ILKQ 305
E + +LK ++D++ N +E E +LK
Sbjct: 106 KELTEKFQSTLKD-IQDQMEEHSNPNHKLCEENEELREKLKELIEQYELREQHFEKLLKT 164
Query: 306 TENE--LNQGKFKLARMF-----ERIDKEKAEL---ERSISFLKEKETELDEILAVLTEK 355
+ E LN K + A+ E+ +EK L +S LKE E EL L + ++K
Sbjct: 165 KDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKELRAQLNLYSDK 224
Query: 356 TEVDVDEAVTTTAPIYKQILNTFTEE 381
+ + + + + ++ TF +E
Sbjct: 225 FK-EFQDTLNKS----NEVFLTFKQE 245
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 37.5 bits (86), Expect = 0.015
Identities = 26/119 (21%), Positives = 36/119 (30%), Gaps = 9/119 (7%)
Query: 150 LPPLYSSRTKQ-EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
LPP + T T+ T G +S P T P S + P P
Sbjct: 455 LPPASTGPTVSTADPTSGTPTGTTSSTLPEDTS-----PTSRTTSATPNATSPTPAVTTP 509
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTP---SPAPAVTS 264
NA + + + TP G + PP G+T P SP +
Sbjct: 510 NATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNT 568
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 37.6 bits (88), Expect = 0.015
Identities = 19/63 (30%), Positives = 29/63 (46%)
Query: 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354
Q+E+ LKQ + K + + E +E LE + L+EK+ EL+ L L E
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205
Query: 355 KTE 357
K
Sbjct: 206 KAA 208
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 34.9 bits (81), Expect = 0.016
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQ--GKFKLARMFERIDKEKAELERSISFLKE 340
D++ +++E++ + Q E+E L+ NEL++ GK K + E + AE++ LK
Sbjct: 29 DELL-ELDEERRELQVELEELQAERNELSKEIGKAKKKK--EDAEALIAEVKELKDELKA 85
Query: 341 KETELDEILAVLTEK 355
E EL E+ A L +
Sbjct: 86 LEAELRELEAELDKL 100
>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
Length = 364
Score = 37.0 bits (87), Expect = 0.017
Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 17/107 (15%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
+++ +K +A+ E+E+L++ L +G K AR E + E+ +L +S+ L K
Sbjct: 145 ERLEKKAKGGDKEAKAELELLEKLLEHLEEG--KPARTLE-LTDEEEKLLKSLQLLTAK- 200
Query: 343 TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIY 389
+L V +VDE Y + + E AA E
Sbjct: 201 ----PVLYV------ANVDEDDLADGNPYVKKVR---EIAAKEGAEV 234
>gnl|CDD|217372 pfam03105, SPX, SPX domain. We have named this region the SPX
domain after (SYG1, Pho81 and XPR1). This 180 residue
length domain is found at the amino terminus of a
variety of proteins. In the yeast protein SYG1, the
N-terminus directly binds to the G- protein beta subunit
and inhibits transduction of the mating pheromone
signal. This finding suggests that all the members of
this family are involved in G-protein associated signal
transduction. The N-termini of several proteins involved
in the regulation of phosphate transport, including the
putative phosphate level sensors PHO81 from
Saccharomyces cerevisiae and NUC-2 from Neurospora
crassa, are also members of this family. The SPX domain
of S. cerevisiae low-affinity phosphate transporters
Pho87 and Pho90 auto-regulates uptake and prevents
efflux. This SPX dependent inhibition is mediated by the
physical interaction with Spl2 NUC-2 contains several
ankyrin repeats pfam00023. Several members of this
family are annotated as XPR1 proteins: the xenotropic
and polytropic retrovirus receptor confers
susceptibility to infection with murine leukaemia
viruses (MLV). The similarity between SYG1, phosphate
regulators and XPR1 sequences has been previously noted,
as has the additional similarity to several predicted
proteins, of unknown function, from Drosophila
melanogaster, Arabidopsis thaliana, Caenorhabditis
elegans, Schizosaccharomyces pombe, and Saccharomyces
cerevisiae. In addition, given the similarities between
XPR1 and SYG1 and phosphate regulatory proteins, it has
been proposed that XPR1 might be involved in G-protein
associated signal transduction and may itself function
as a phosphate sensor.
Length = 176
Score = 35.9 bits (83), Expect = 0.019
Identities = 34/135 (25%), Positives = 57/135 (42%), Gaps = 20/135 (14%)
Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
+K K+ NE E +L + E E ++ +F FE +DKE LE+ S
Sbjct: 29 IKELTRAKLSLTTNED--GEDESSNLLNEEELEEDESEF-----FEALDKE---LEKVNS 78
Query: 337 FLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396
F KEKE EL E L L ++ E + T++ + E L+
Sbjct: 79 FYKEKEAELLERLEELEKQLEELEERRDETSSAKSSPSDKNLLKA----------FEELK 128
Query: 397 RGVIDLEVYLKQVRS 411
+ +++L L+ ++S
Sbjct: 129 KALLELYRELQLLKS 143
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 36.9 bits (85), Expect = 0.019
Identities = 22/80 (27%), Positives = 29/80 (36%), Gaps = 7/80 (8%)
Query: 181 QSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLY 240
Q + P PS P N + PS P+PG PG+ P
Sbjct: 395 QGGLGANPMQQGQPGMMSSPSPVPQVQTNQSMPQPPQ--PSVPSPG-GPGSQP----PQS 447
Query: 241 PPGGSTPATPYPVPTPSPAP 260
GG P+ P +P+PSP
Sbjct: 448 VSGGMIPSPPALMPSPSPQM 467
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 36.5 bits (85), Expect = 0.020
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)
Query: 297 QEEIEILKQTENELNQGKFK-LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
+EE+ LKQ E+EL L R E++ K E+ + L+E E EL E+ + + +
Sbjct: 185 EEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDL 244
Query: 356 TE 357
T
Sbjct: 245 TN 246
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 33.4 bits (77), Expect = 0.022
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 8/77 (10%)
Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338
S++E K+ K +EQ Q EE E L + E + N KL + ELE+ I+ L
Sbjct: 1 SSLEKKLAEK-DEQIAQLMEEGEKLSKKELKHNNTIKKL-------RAKNKELEKEIAEL 52
Query: 339 KEKETELDEILAVLTEK 355
K+K +L++ L L E+
Sbjct: 53 KKKLEKLEKELENLEER 69
Score = 28.4 bits (64), Expect = 1.5
Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 9/67 (13%)
Query: 282 EDKIRRKMNE-QKVQAQE-----EIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
+++I + M E +K+ +E I+ L+ EL + +L + E+++KE LE
Sbjct: 11 DEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLE--- 67
Query: 336 SFLKEKE 342
LK E
Sbjct: 68 ERLKRAE 74
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 35.9 bits (83), Expect = 0.025
Identities = 18/90 (20%), Positives = 39/90 (43%), Gaps = 14/90 (15%)
Query: 271 EHIKMSLK--SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
E ++ + K E+ + E+ Q +EE ++L++ +EL + +L ++E+
Sbjct: 19 EDMRRAQKELEEYEETALE-LEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEER 77
Query: 329 -------AELERSISFLKE----KETELDE 347
E ++ L+E KE E +
Sbjct: 78 ERLEAEVDEATAEVAKLEEEREKKEAETRQ 107
Score = 28.2 bits (63), Expect = 8.0
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 291 EQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAEL---ERSISFLKEKETELD 346
E+ + Q+E+E ++Q E ++ + + +L E + + +L E L++K EL+
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60
Query: 347 EILAVLTEKTE 357
E L E+
Sbjct: 61 EENRRLEEEAA 71
Score = 28.2 bits (63), Expect = 8.4
Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 1/71 (1%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE-L 345
R++ E+ ++EE E L+ +E KL E+ + E +L++ + +E
Sbjct: 64 RRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERAR 123
Query: 346 DEILAVLTEKT 356
E+L T
Sbjct: 124 QELLEAAAAPT 134
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 36.5 bits (85), Expect = 0.026
Identities = 18/107 (16%), Positives = 48/107 (44%), Gaps = 7/107 (6%)
Query: 263 TSGGTITDEHIKMSLKSAVE-----DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
+ GT T+ + + E ++ +++ E+ + ++ ++ L + Q +
Sbjct: 315 SPAGTKTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLP--KKARGQLPPEK 372
Query: 318 ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364
E++ + K +L + L+E+ EL E L L + ++ V++ +
Sbjct: 373 REQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKISVNKTI 419
>gnl|CDD|224312 COG1394, NtpD, Archaeal/vacuolar-type H+-ATPase subunit D [Energy
production and conversion].
Length = 211
Score = 35.3 bits (82), Expect = 0.035
Identities = 21/130 (16%), Positives = 54/130 (41%), Gaps = 11/130 (8%)
Query: 273 IKMSLKSAVED--KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
+ M ++ V++ ++R ++ ++ +A E + + E + + + KEK E
Sbjct: 36 LIMEFRAIVKEAKELREELEKELEEAYESLALASAAEGID-----AVEEIAL-VQKEKLE 89
Query: 331 LE---RSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
++ +I + EL E+ + + + ++++L E A E T
Sbjct: 90 VDVDVENIMGVVVPTFELVELTPPPYDLGILSTSAWLDEAIEKFEELLEKLIELAELETT 149
Query: 388 IYYMGEALRR 397
+ + E +R+
Sbjct: 150 LRLLLEEIRK 159
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 35.4 bits (82), Expect = 0.038
Identities = 21/100 (21%), Positives = 40/100 (40%), Gaps = 5/100 (5%)
Query: 282 EDKIRRKMN--EQKVQA---QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
+IR ++ E+K+ A + E+ L + L + +E +LE+ I
Sbjct: 68 IQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIE 127
Query: 337 FLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILN 376
LKE+ L++ LA + E +V E + +
Sbjct: 128 DLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREE 167
Score = 31.6 bits (72), Expect = 0.59
Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 12/72 (16%)
Query: 280 AVEDKIRRKMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKAELERSI 335
++ED++ M E + + ++EIE LK + E L + + R+++E AE+
Sbjct: 107 SLEDELAELMEEIE-KLEKEIEDLKERLERLEKNLAEAE-------ARLEEEVAEIREEG 158
Query: 336 SFLKEKETELDE 347
L K EL E
Sbjct: 159 QELSSKREELKE 170
Score = 29.6 bits (67), Expect = 2.6
Identities = 26/147 (17%), Positives = 58/147 (39%), Gaps = 13/147 (8%)
Query: 277 LKSAVE-DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELE 332
LKS + K+ + + + + +E + LK+ + EL L + E ++ + ++LE
Sbjct: 6 LKSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLE 65
Query: 333 RSISFLKEKETELDEILAVLT-----EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
I ++E+ +E L+ + +++ A + ++ E E
Sbjct: 66 SEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKE 125
Query: 388 IYYMGEALRRGVIDLEVYLKQVRSLSR 414
I E L+ + LE L + +
Sbjct: 126 I----EDLKERLERLEKNLAEAEARLE 148
>gnl|CDD|235360 PRK05183, hscA, chaperone protein HscA; Provisional.
Length = 616
Score = 35.9 bits (84), Expect = 0.039
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 4/39 (10%)
Query: 269 TDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEIL 303
TD+ I +M S+ A ED R + EQKV+A+ +E L
Sbjct: 505 TDDEIARMLKDSMSHAEEDMQARALAEQKVEAERVLEAL 543
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 36.1 bits (83), Expect = 0.040
Identities = 21/107 (19%), Positives = 37/107 (34%), Gaps = 15/107 (14%)
Query: 214 PSGSIYPSYPTP----GATPGAGGYSPYPLYPPGGST---------PATPYPVPTPSPAP 260
PSG P+ P + + +T T P+P S P
Sbjct: 492 PSGGESPNPELPANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVPLPVSSEPP 551
Query: 261 AVTSGGTITDEHIKMS-LKS-AVEDKIRRKMNEQKVQAQEEIEILKQ 305
G IT KM +K + +K+ + ++ AQ ++++ Q
Sbjct: 552 QHKEGPVITFYSEKMKGMKELLLAEKLNSSAIKLQLTAQSQVQMKVQ 598
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 35.7 bits (82), Expect = 0.040
Identities = 33/181 (18%), Positives = 63/181 (34%), Gaps = 8/181 (4%)
Query: 172 MASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAA--APSGSIYPSYPTPGATP 229
++S+ + S P + +P + P + + PS + +P
Sbjct: 25 LSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLSRGRLSSRFVSP 84
Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKM 289
G S +T +T + SP+ + + + S+ S + D R K
Sbjct: 85 SRGSPSAAASLNGSLATASTS---GSSSPSRSRRTTSSDLSSGNGPSVLSFMADVKRGKK 141
Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
K E+ L+ N L Q +F AR + +K E+ + + +EL +
Sbjct: 142 GPSK---IEDAHQLRLLYNRLLQWRFVNARAEAAMAVQKLNAEKQLFNAWLRISELRNSV 198
Query: 350 A 350
A
Sbjct: 199 A 199
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 33.6 bits (77), Expect = 0.043
Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 17/91 (18%)
Query: 282 EDKIRRKMNEQKV---QAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELER 333
E++I + E K +A+EE EI+K+ E E ++ E+ KE E +
Sbjct: 16 EERIEKAKEEAKKIIRKAKEEAKKIEEEIIKKAEEEAQ-------KLIEKKKKEGEEEAK 68
Query: 334 SISFLKEKETELDEILAVLTEKTEVDVDEAV 364
I L+E E E++E+ E E V EA+
Sbjct: 69 KI--LEEGEKEIEELKVKAEENFETAVSEAI 97
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 35.7 bits (83), Expect = 0.045
Identities = 27/103 (26%), Positives = 50/103 (48%), Gaps = 15/103 (14%)
Query: 262 VTSGGTITDEHIKMSLKSA----VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
+T+ +++E I+ +K A EDK R++ E + A + Q E L + KL
Sbjct: 492 ITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNAD---SLAYQAEKTLKEAGDKL 548
Query: 318 ARMFERIDKEKAELERSISFLKE--KETELDEILAVLTEKTEV 358
+EK ++E++++ LKE K +++EI A E +
Sbjct: 549 P------AEEKEKIEKAVAELKEALKGEDVEEIKAKTEELQQA 585
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA
a protein that suppresses sigma54-dependent
transcription. The PspA protein, a negative regulator of
the Escherichia coli phage shock psp operon, is produced
when virulence factors are exported through secretins in
many Gram-negative pathogenic bacteria and its homologue
in plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to
the AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 34.7 bits (80), Expect = 0.066
Identities = 19/115 (16%), Positives = 42/115 (36%), Gaps = 14/115 (12%)
Query: 266 GTITDEHIKMSLKSAVEDKIRRKM--NEQKVQAQE-EIEILKQTEN------ELNQ--GK 314
+++ + + +++R+++ E K+Q + + LK +N G
Sbjct: 98 KQAEALETQLTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKAQEAVNTSLGS 157
Query: 315 FKL---ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTT 366
FERI+++ E E E + D + +++V E V
Sbjct: 158 ASTESATDSFERIEEKIEEREARADAAAELASAQDLDAKLEAAGIQMEVSEDVLA 212
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 35.2 bits (81), Expect = 0.069
Identities = 19/74 (25%), Positives = 33/74 (44%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
+ + + ++ + E K + IE L E EL Q L ++ + +K K ELE+
Sbjct: 491 RYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQ 550
Query: 334 SISFLKEKETELDE 347
+ LKE+E
Sbjct: 551 EMEELKERERNKKL 564
Score = 29.0 bits (65), Expect = 6.8
Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 6/73 (8%)
Query: 291 EQKVQAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELERSIS-FLKEKETE 344
+ E+ ++LK+ E + + ++ + ER +K ELE+ LK + E
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580
Query: 345 LDEILAVLTEKTE 357
++ I+ L EK
Sbjct: 581 VESIIRELKEKKI 593
>gnl|CDD|218701 pfam05698, Trigger_C, Bacterial trigger factor protein (TF)
C-terminus. In the E. coli cytosol, a fraction of the
newly synthesised proteins requires the activity of
molecular chaperones for folding to the native state.
The major chaperones implicated in this folding process
are the ribosome-associated Trigger Factor (TF), and the
DnaK and GroEL chaperones with their respective
co-chaperones. Trigger Factor is an ATP-independent
chaperone and displays chaperone and
peptidyl-prolyl-cis-trans-isomerase (PPIase) activities
in vitro. It is composed of at least three domains, an
N-terminal domain which mediates association with the
large ribosomal subunit, a central substrate binding and
PPIase domain with homology to FKBP proteins, and a
C-terminal domain of unknown function. The positioning
of TF at the peptide exit channel, together with its
ability to interact with nascent chains as short as 57
residues renders TF a prime candidate for being the
first chaperone that binds to the nascent polypeptide
chains. This family represents the C-terminal region of
the protein.
Length = 162
Score = 33.8 bits (78), Expect = 0.070
Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 10/91 (10%)
Query: 269 TDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
++E ++ K E +++ + +++ +E+IE+ T+ E+ + +LA +
Sbjct: 78 SEEELREEFKEEAEKRVKLGLILEEIAKEEKIEV---TDEEIKEEIEELASQYG------ 128
Query: 329 AELERSISFLKEKETELDEILAVLTEKTEVD 359
E E F K K +L + L E+ VD
Sbjct: 129 MEPEEVKEFYK-KNEQLSALKEDLLEEKAVD 158
>gnl|CDD|224802 COG1890, RPS1A, Ribosomal protein S3AE [Translation, ribosomal
structure and biogenesis].
Length = 214
Score = 34.2 bits (79), Expect = 0.071
Identities = 26/117 (22%), Positives = 43/117 (36%), Gaps = 27/117 (23%)
Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327
T K S K A IR+ M EI+++ +EL +F + RI E
Sbjct: 125 FTRRRAKTSQKRA----IRKIM----------FEIIEEKASELTFEEFVQELIPGRIAAE 170
Query: 328 KAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAAT 384
E + I L++ V K++V A + +L+ +EE +
Sbjct: 171 IEEAAKKIYPLRK----------VEIRKSKV---LKEPKEAEPEQAVLHGESEEVGS 214
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 33.9 bits (78), Expect = 0.085
Identities = 25/113 (22%), Positives = 49/113 (43%), Gaps = 13/113 (11%)
Query: 281 VEDKIRRKMNEQKVQAQEEIEILK-----QTENELNQ---GKFKLARMFERIDKEKAELE 332
+ + K E + +A+EE EI K + E ++ + K K A M ++I A+ E
Sbjct: 2 IRQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNE 61
Query: 333 RSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
+ L +E LD + E+ + + ++ YK +L ++A +
Sbjct: 62 ARLKVLNAREELLDSVF----EEAKERLAN-LSEDKDEYKDLLKDLIKQALVK 109
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 34.2 bits (79), Expect = 0.089
Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 11/100 (11%)
Query: 270 DEHIKMSLKSAVEDKI-RRKMNEQKV-QAQEEIEIL-KQTENELNQGKFKLARMFERIDK 326
E K + A+ I R+K E+K+ +AQ E L ++ E L G LAR
Sbjct: 38 SELAK--ARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALE--- 92
Query: 327 EKAELERSISFLKEKETELDEILAVLT---EKTEVDVDEA 363
EK LE L+ + + +E + L E + E
Sbjct: 93 EKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAEL 132
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 34.3 bits (79), Expect = 0.093
Identities = 22/130 (16%), Positives = 29/130 (22%), Gaps = 25/130 (19%)
Query: 154 YSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAA 213
Y P P P S+ P GY +
Sbjct: 194 YGGGGGGGGGGGGGGGSG---PGPPP-----PGFKSS-FPPPYGPGAGPSSGYGSGGTRS 244
Query: 214 PSGSIYPSYPTPGATPGAGGY-----------SPYPLYPPGGST--PATPYPVPTPSPAP 260
G P + T GA GY S ++ + S +
Sbjct: 245 GQGGWGPGFWTGLGAGGALGYLFGSRRNNNSSYGRSYGSGSPSYSPSSSSNSSSSSSSSS 304
Query: 261 AVTS---GGT 267
TS GGT
Sbjct: 305 TRTSSGFGGT 314
>gnl|CDD|233673 TIGR01991, HscA, Fe-S protein assembly chaperone HscA. The Heat
Shock Cognate proteins HscA and HscB act together as
chaperones. HscA resembles DnaK but belongs in a
separate clade. The apparent function is to aid assembly
of iron-sulfur cluster proteins. Homologs from Buchnera
and Wolbachia are clearly in the same clade but are
highly derived and score lower than some examples of
DnaK [Protein fate, Protein folding and stabilization].
Length = 599
Score = 34.6 bits (80), Expect = 0.10
Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 4/43 (9%)
Query: 267 TITDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 305
++DE I +M S K A ED R + EQKV+A+ +E L+
Sbjct: 487 GLSDEEIERMLKDSFKHAEEDMYARALAEQKVEAERILEALQA 529
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 34.1 bits (78), Expect = 0.11
Identities = 20/86 (23%), Positives = 24/86 (27%), Gaps = 4/86 (4%)
Query: 188 GSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTP---GATPGAGGYSPYPLYPPGG 244
G+ N P P Y A G P G PGAGG +P GG
Sbjct: 2 GNMQNYPQAMNPQMGGGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGG-PVGGG 60
Query: 245 STPATPYPVPTPSPAPAVTSGGTITD 270
+ P A + D
Sbjct: 61 GGGSGGPPGGGEVAGEAEDAMSEFDD 86
Score = 32.2 bits (73), Expect = 0.43
Identities = 25/108 (23%), Positives = 29/108 (26%), Gaps = 13/108 (12%)
Query: 170 GNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
GNM + YP M G N YP A GY P G GA
Sbjct: 2 GNMQN---YPQAMNPQMGGGN----YPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPG 54
Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS------GGTITDE 271
G G P G S +S TI+ +
Sbjct: 55 GPVGGGGGGSGGPPGGGEVAGEAEDAMSEFDDYSSSSIEEGDTTISAD 102
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 32.2 bits (73), Expect = 0.12
Identities = 22/100 (22%), Positives = 47/100 (47%), Gaps = 4/100 (4%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
+ ++ + + +E K ++ EL + K +L + ++ +KAE E+ L+E + ++
Sbjct: 11 KNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQ---RLRELQAKI 67
Query: 346 DEILAVLTEKTEVDVD-EAVTTTAPIYKQILNTFTEEAAT 384
DE+L L + ++ E+ + K IL EA
Sbjct: 68 DELLKNLELEKLAKMELESRLSKTEKDKAILELKLAEAKD 107
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 34.2 bits (79), Expect = 0.12
Identities = 26/105 (24%), Positives = 46/105 (43%), Gaps = 11/105 (10%)
Query: 283 DKIRRKMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
D++ +K+ E K + +EE+E LK+ EL KL + A LE + L+++
Sbjct: 173 DQLSKKLAELKAEEEEELERALKEKREELLS---KLEE------ELLARLESKEAALEKQ 223
Query: 342 -ETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
E + L +K E + + + A ++Q L A E
Sbjct: 224 LRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIE 268
Score = 33.8 bits (78), Expect = 0.17
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 280 AVEDKIRRKMNE---QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
++++R+K E Q+++ Q E ++ +NEL +L R F + KEK E ER+
Sbjct: 230 REKEELRKKYEEKLRQELERQAEA-HEQKLKNELALQAIELQREFNKEIKEKVEEERN-G 287
Query: 337 FLKEKETELDEILAVLTE 354
L +L E+ + L
Sbjct: 288 RLA----KLAELNSRLKG 301
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 33.6 bits (77), Expect = 0.12
Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 5/97 (5%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
+ + + +E L + + +L++ L E +++ AELE + LKE L
Sbjct: 64 APVQSVRPQKLEELQGELSELKQQLSE----LQEELEDLEERIAELESELEDLKEDLQLL 119
Query: 346 DEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382
E+L L E+ ++E++ A +++ EE
Sbjct: 120 RELLKSLEER-LESLEESIKELAKELRELRQDLREEV 155
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 32.2 bits (74), Expect = 0.12
Identities = 14/73 (19%), Positives = 32/73 (43%), Gaps = 8/73 (10%)
Query: 291 EQKVQAQEEIEILKQTENELNQGK------FKLARMFERIDKEKA--ELERSISFLKEKE 342
+Q Q + +++ ++ EL + + + DKE+ ELE L+++
Sbjct: 19 QQIKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEEVKEELEERKETLEKEI 78
Query: 343 TELDEILAVLTEK 355
L++ L L ++
Sbjct: 79 KTLEKQLEKLEKE 91
>gnl|CDD|218167 pfam04599, Pox_G5, Poxvirus G5 protein. This protein has been
predicted to be related to the FEN-1 endonuclease.
Length = 426
Score = 34.2 bits (79), Expect = 0.12
Identities = 24/94 (25%), Positives = 40/94 (42%), Gaps = 16/94 (17%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
+ +K ++ +K + + + EEIE L+ L+ M+E I K +LE
Sbjct: 76 SIDIKESLREKRKNSLKNTSKRKTEEIENLEDAIKNLDVEDE----MYEEI---KTDLEL 128
Query: 334 SIS------FLKEK---ETELDEILAVLTEKTEV 358
I FL K + L+EIL+ L E +
Sbjct: 129 RIDKLSFHNFLANKSNLKITLEEILSSLKEDVTI 162
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 33.5 bits (77), Expect = 0.12
Identities = 22/104 (21%), Positives = 40/104 (38%), Gaps = 10/104 (9%)
Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
MS K ++ + E+ + E E ++ E E + + I + +A+LE
Sbjct: 1 MSDKEQKTEEPDAEETEE--AEKSEEEEAEEEEPEEENELEEEQ---QEIAELEAQLEE- 54
Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTT-TAPIYKQILNT 377
LK+K L ++TE + +EA K +L
Sbjct: 55 ---LKDKYLRAQAEFENLRKRTEREREEAKKYAIEKFAKDLLPV 95
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 32.4 bits (74), Expect = 0.13
Identities = 25/77 (32%), Positives = 26/77 (33%), Gaps = 15/77 (19%)
Query: 183 YMPMPGSNVNTPYPTYPPSAYP---GYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPL 239
Y P P P Y A P G P AP G YP+ P P GY P
Sbjct: 47 YPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGV-YPTPPPPN-----SGYMADPQ 100
Query: 240 YPPGGSTPATPYPVPTP 256
PP PYP P
Sbjct: 101 EPP------PPYPGPPQ 111
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 34.3 bits (78), Expect = 0.13
Identities = 27/123 (21%), Positives = 33/123 (26%), Gaps = 12/123 (9%)
Query: 138 GLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPT 197
+ + A S+ S+ PA P+ TQS S V
Sbjct: 303 AVTPSSAVTQSSAITPSSAAIPS-PAVIPS---------SVTTQSATTTQASAVALSSAG 352
Query: 198 YPPS--AYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPT 255
PS PG AA P T + G P S PA P
Sbjct: 353 VLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIP 412
Query: 256 PSP 258
SP
Sbjct: 413 VSP 415
Score = 32.4 bits (73), Expect = 0.43
Identities = 22/93 (23%), Positives = 30/93 (32%), Gaps = 5/93 (5%)
Query: 158 TKQEPATNPTYNGNMA-SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSG 216
P T + + S+A P+ + +P P P SA A A S
Sbjct: 296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVI---PSSVTTQSATTTQAS-AVALSSA 351
Query: 217 SIYPSYPTPGATPGAGGYSPYPLYPPGGSTPAT 249
+ PS T T P + P ST T
Sbjct: 352 GVLPSDVTLPGTVALPAAEPVNMQPQPMSTTET 384
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 34.2 bits (79), Expect = 0.14
Identities = 20/84 (23%), Positives = 44/84 (52%), Gaps = 6/84 (7%)
Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS-- 334
L++ +++++K+N++ + +E E + L + K +L ++ + ELER+
Sbjct: 314 LRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQ---SELEDLQIELERANA 370
Query: 335 -ISFLKEKETELDEILAVLTEKTE 357
S L++K+ D+ILA K +
Sbjct: 371 AASELEKKQKNFDKILAEWKRKVD 394
Score = 32.3 bits (74), Expect = 0.57
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 7/75 (9%)
Query: 269 TDEHIKM--SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
+E K L+ V DK++ K+ K Q +E E+ + ++ + + +L ER D
Sbjct: 787 VEEDKKNLERLQDLV-DKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQRELEDAEERAD- 844
Query: 327 EKAELERSISFLKEK 341
ERS++ L+ K
Sbjct: 845 ---TAERSLNKLRAK 856
Score = 31.6 bits (72), Expect = 0.87
Identities = 21/74 (28%), Positives = 34/74 (45%), Gaps = 7/74 (9%)
Query: 275 MSLKSAVE---DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 331
LK+ +E D++ E K Q+EI L E + +L + R++ EK EL
Sbjct: 415 FRLKNELEELKDQVEALRRENK-NLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDEL 473
Query: 332 ERSISFLKEKETEL 345
+ + L+E E L
Sbjct: 474 QAA---LEEAEAAL 484
Score = 30.0 bits (68), Expect = 3.4
Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 4/81 (4%)
Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS-FL 338
+K +++ Q + Q +++ L++ N+L K +L + ++ E E +S
Sbjct: 176 LNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLS 235
Query: 339 KEK---ETELDEILAVLTEKT 356
K K E++L+E L E++
Sbjct: 236 KLKSQLESQLEEAKRSLEEES 256
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 33.7 bits (78), Expect = 0.14
Identities = 28/139 (20%), Positives = 54/139 (38%), Gaps = 29/139 (20%)
Query: 288 KMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL- 345
K +Q + EI +LK+ EN +L R+ +E A L++ I +++ L
Sbjct: 47 KWADQTQELLAEIRQLLKEIENLRVYND-QLQRLVANQQQEIASLQQQIEQIEKTRQGLV 105
Query: 346 ---DEILAVLTEKTEVDV------------------DEAVTTTAPIYKQILNTFTEEAAT 384
++ L + +D+ D A + A ++++L + EA
Sbjct: 106 PLMYRMIDGLEQFVALDLPFLKEERLARLARLRDLMDRADVSLAEKFRRVLEAYQIEADY 165
Query: 385 EDTIYYMGEALRRGVIDLE 403
TI + +G I L+
Sbjct: 166 GRTI-----EVYQGEITLD 179
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 34.1 bits (78), Expect = 0.15
Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 10/118 (8%)
Query: 256 PSPAPAVTSGGTITD----EHIKMSLKSAVEDKIRRKMNEQ--KVQAQEEIEILKQTENE 309
TS + + + E + K EQ Q ++ + +++ E
Sbjct: 84 MELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKN 143
Query: 310 ---LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT-EKTEVDVDEA 363
LNQ + + E+I EK L+ I+ L+ + +E D + + EK V++ E
Sbjct: 144 ILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEE 201
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 34.1 bits (79), Expect = 0.15
Identities = 27/99 (27%), Positives = 50/99 (50%), Gaps = 15/99 (15%)
Query: 262 VTSGGTITDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
+T+ ++D+ I +M + + A EDK R++ E K +A+E + E L + KL
Sbjct: 495 ITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVY---SLEKSLKEEGDKL 551
Query: 318 ARMFERIDKEKAELERSISFLKE--KETELDEILAVLTE 354
+ +K ++E +I +LKE + + +EI A E
Sbjct: 552 P------EADKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584
>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein. Members of
this family confer resistance to the metalloid element
tellurium and its salts.
Length = 98
Score = 31.6 bits (72), Expect = 0.15
Identities = 15/50 (30%), Positives = 20/50 (40%)
Query: 241 PPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMN 290
P PA P P P P+ P S T+T E +SL ++N
Sbjct: 4 PVPPPAPAPPAPAPPPAAPPVSLSKITLTKEGPSVSLTKQGGGFGTIRVN 53
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 33.5 bits (77), Expect = 0.16
Identities = 17/72 (23%), Positives = 38/72 (52%), Gaps = 9/72 (12%)
Query: 283 DKIRRK---MNEQKVQAQEEI----EILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
D++R++ ++E+ V+ ++I E + +NEL + + K+ + R ++ A+
Sbjct: 203 DELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKAL--RAKEKAAKRREKR 260
Query: 336 SFLKEKETELDE 347
LKE+ E+ E
Sbjct: 261 EELKERAEEIYE 272
Score = 32.3 bits (74), Expect = 0.45
Identities = 14/85 (16%), Positives = 31/85 (36%), Gaps = 2/85 (2%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
+E +K D++++K E + QE ++ E+ + + + + D+
Sbjct: 157 NEKLKELKAEI--DELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHE 214
Query: 330 ELERSISFLKEKETELDEILAVLTE 354
E + E E + L E
Sbjct: 215 EFVELSKKIDELHEEFRNLQNELRE 239
Score = 31.6 bits (72), Expect = 0.68
Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
++ Q +EEIE LK+ +EL + + + + E+ D+ A++ +E E DEI
Sbjct: 13 LKRKQLKEEIEELKEKRDELRK---EASELAEKRDELNAKVRELREKAQELREERDEIN 68
Score = 30.4 bits (69), Expect = 1.6
Identities = 19/86 (22%), Positives = 34/86 (39%), Gaps = 11/86 (12%)
Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE-- 332
L E ++ +K+ E ++E+E K+ E KL + ID+ K +
Sbjct: 126 SVLTPEEERELVQKIKE----LRKELEDAKKALEE----NEKLKELKAEIDELKKKAREI 177
Query: 333 -RSISFLKEKETELDEILAVLTEKTE 357
I L + E E + L E+ +
Sbjct: 178 HEKIQELANEAQEYHEEMIKLFEEAD 203
Score = 30.4 bits (69), Expect = 1.9
Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 10/76 (13%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID---KEKAELERSISF-- 337
D++ K+ E + +AQE E + E+ + K K + ++ KE EL+ +
Sbjct: 44 DELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFN 103
Query: 338 -----LKEKETELDEI 348
+K E E++ +
Sbjct: 104 LGGRSIKSLEREIERL 119
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 33.8 bits (77), Expect = 0.17
Identities = 22/109 (20%), Positives = 45/109 (41%), Gaps = 26/109 (23%)
Query: 270 DEHIKMSLKSAVEDKIRR------KMNEQKVQAQEEIEILKQTENEL------------- 310
D + + +A++ K + K+ + +EEI+ L+ +EL
Sbjct: 310 DSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQ 369
Query: 311 ----NQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
NQ + KL R ++I+ + +L +S+ K ++ E I L +
Sbjct: 370 FELMNQEREKLTRELDKINIQSDKLTKSV---KSRKLEAQGIFKSLEKT 415
Score = 28.8 bits (64), Expect = 6.2
Identities = 16/86 (18%), Positives = 41/86 (47%), Gaps = 2/86 (2%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI--DKEKAELERSISFLKE 340
DK+ + + +K++AQ + L++T + + + R +I + + L+ +I L
Sbjct: 392 DKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFP 451
Query: 341 KETELDEILAVLTEKTEVDVDEAVTT 366
K + ++E + + ++ E + T
Sbjct: 452 KGSGINESIKKSILELNDEIQERIKT 477
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 33.7 bits (77), Expect = 0.17
Identities = 28/112 (25%), Positives = 36/112 (32%), Gaps = 14/112 (12%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYP---PSAYPGYNPNAAAAPSGSI 218
P+ N A + P+ +PY Y P P +PS S
Sbjct: 414 PSPGSASNVPEVEPAQVEAKKTRPL------SPYARYEDLKPPTSPSPTAPTGVSPSVSS 467
Query: 219 YPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPV-----PTPSPAPAVTSG 265
S P T A + PP P +PY V P SP+PA G
Sbjct: 468 TSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVG 519
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 33.7 bits (78), Expect = 0.17
Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 13/96 (13%)
Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQ--------GKFKLARMFERIDKEKAE 330
+ DK+ +++E++ + E+E L+ NEL++ G+ + + + K +
Sbjct: 25 ALDVDKLL-ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83
Query: 331 LERSISFLKEKETELDEILAVL----TEKTEVDVDE 362
L+ + L E E ELD +L + E V DE
Sbjct: 84 LKELEAALDELEAELDTLLLTIPNIPHESVPVGKDE 119
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 31.8 bits (72), Expect = 0.18
Identities = 15/81 (18%), Positives = 32/81 (39%), Gaps = 7/81 (8%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE-RS 334
S+ + + + NE K + E K+TE E L + + + E E ++
Sbjct: 21 SIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEES------LNMYNAALMEARKEAEKKA 74
Query: 335 ISFLKEKETELDEILAVLTEK 355
+ + + + E I ++E
Sbjct: 75 VEIINKAKQEASLIKLKISED 95
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 34.1 bits (79), Expect = 0.19
Identities = 15/58 (25%), Positives = 22/58 (37%), Gaps = 2/58 (3%)
Query: 205 GYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
Y P + AAP+ + + A+ A + P + PA P P PA A
Sbjct: 35 DYGPGSTAAPTAA--AAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAA 90
Score = 32.6 bits (75), Expect = 0.57
Identities = 15/63 (23%), Positives = 17/63 (26%)
Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP 258
P S AAAA + S + P A P P PA
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPA 97
Query: 259 APA 261
AP
Sbjct: 98 APP 100
Score = 31.0 bits (71), Expect = 1.3
Identities = 13/61 (21%), Positives = 20/61 (32%)
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
PG AA + + + A +P P P + A P P + A A
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPA 97
Query: 264 S 264
+
Sbjct: 98 A 98
Score = 29.9 bits (68), Expect = 3.4
Identities = 17/81 (20%), Positives = 27/81 (33%)
Query: 194 PYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPV 253
P T P+A AA+AP+ + P A +P P + A P
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPA 97
Query: 254 PTPSPAPAVTSGGTITDEHIK 274
P+ A A ++ +
Sbjct: 98 APPAAAAAAAPAAAAVEDEVT 118
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit.
Within the family is an example from a
sodium-translocating rather than proton-translocating
ATP synthase [Energy metabolism, ATP-proton motive force
interconversion].
Length = 147
Score = 32.4 bits (74), Expect = 0.20
Identities = 23/126 (18%), Positives = 47/126 (37%), Gaps = 22/126 (17%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
+ I L SA K +E K+ + L + K + + E +K +E
Sbjct: 29 KKIADGLASAERAK------------KEAALAQKKAQVILKEAKDEAQEIIENANKRGSE 76
Query: 331 LERSISFLKEKETELDEILA-----VLTEKTEV--DVDEAVTTTA-PIYKQILNTFTEEA 382
+ E E ++I A + EK + ++ + V + ++I+ ++
Sbjct: 77 ILEEA--KAEAREEREKIKAQARAEIEAEKEQAREELRKQVADLSVLGAEKIIERNIDKQ 134
Query: 383 ATEDTI 388
A +D I
Sbjct: 135 AQKDLI 140
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 33.6 bits (78), Expect = 0.20
Identities = 17/70 (24%), Positives = 36/70 (51%), Gaps = 4/70 (5%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF-LKEKETE 344
R++ ++ +A+ ++ ++ + EL + K ++ E DK E E+ +KE + E
Sbjct: 529 ERELEQKAEEAEALLKEAEKLKEELEE---KKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585
Query: 345 LDEILAVLTE 354
DEI+ L +
Sbjct: 586 ADEIIKELRQ 595
Score = 32.5 bits (75), Expect = 0.45
Identities = 12/81 (14%), Positives = 34/81 (41%), Gaps = 8/81 (9%)
Query: 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF 337
K ++++ + + E + +AQ+ I+ K+ +E+ + + + A ++
Sbjct: 557 KEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEI------IKELRQLQKGGYASVKAHE-- 608
Query: 338 LKEKETELDEILAVLTEKTEV 358
L E L++ +K +
Sbjct: 609 LIEARKRLNKANEKKEKKKKK 629
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 33.6 bits (77), Expect = 0.21
Identities = 38/189 (20%), Positives = 60/189 (31%), Gaps = 21/189 (11%)
Query: 181 QSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPL- 239
PM G+ PY P +N P S+ P+ PG G +P
Sbjct: 386 PMGSPMGGAMGQPPYYGQGPQQQ--FNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAV 443
Query: 240 -YPPGGSTPATPYPVPTPSPA-PAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQ 297
P + A P P P S D S S + +K+
Sbjct: 444 RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTAS-------QGGQNKKLAQV 496
Query: 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS--FLKEKETELDEILAVLTEK 355
+ + L + R+F ++ + L I+ L+ +EL +L E
Sbjct: 497 LASATPQMQKQVLGE------RLFPLVEAIEPALAAKITGMLLEMDNSELLHLLES-PEL 549
Query: 356 TEVDVDEAV 364
+ VDEA+
Sbjct: 550 LKSKVDEAL 558
>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
Length = 866
Score = 33.7 bits (77), Expect = 0.22
Identities = 18/69 (26%), Positives = 27/69 (39%), Gaps = 1/69 (1%)
Query: 235 SPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKV 294
+ P PA P P + APA +G DE K S ++ K ++
Sbjct: 16 ATPPAPAASAPAPAAAAPAPVAAAAPA-AAGPRPDDEPFKASDALHALVALKLKKRIDQI 74
Query: 295 QAQEEIEIL 303
+A + IE L
Sbjct: 75 EALDSIEDL 83
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 33.5 bits (77), Expect = 0.22
Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 12/96 (12%)
Query: 268 ITDEHIKMSLKSAVEDKIRR----KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER 323
I E S + +RR N E+ + LK+ + L +L + ER
Sbjct: 678 IVKE-------SGIAAGVRRIEAVTGNAAIEYLNEQEDKLKELADILKVTPNELPKKIER 730
Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVD 359
+E L++ + LK K + +I + + ++
Sbjct: 731 FFEEWKALQKELESLKLKIADQ-KIDELKDKAETIN 765
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 33.5 bits (77), Expect = 0.24
Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 2/88 (2%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQ--EEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
SL S V D +R + EE+E L+ L + + +++ +LE
Sbjct: 61 SLLSEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEE 120
Query: 334 SISFLKEKETELDEILAVLTEKTEVDVD 361
+S L E+ +L+++L L +D D
Sbjct: 121 RLSELDEELEDLEDLLEELEPLAYLDFD 148
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 33.1 bits (75), Expect = 0.25
Identities = 25/130 (19%), Positives = 40/130 (30%), Gaps = 16/130 (12%)
Query: 155 SSRTKQEPA--TNPTYNGNMASNAPYPTQ-----------SYMPMPGSNVNTPYPTYPPS 201
+S+T P ++PTY + + TQ S G+ T P
Sbjct: 174 TSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQ 233
Query: 202 AYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG--YSPYPLYPPGGSTPATPYPVPTPSPA 259
P + + PS + + G ++ PP S +P+ TP P
Sbjct: 234 TEPPPSQRGPSGSPQHP-PSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292
Query: 260 PAVTSGGTIT 269
G T
Sbjct: 293 TKRQETGRPT 302
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 33.0 bits (75), Expect = 0.26
Identities = 25/120 (20%), Positives = 34/120 (28%), Gaps = 6/120 (5%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPG-SNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYP 220
A P G+ + P G ++V P T P+ +P PS
Sbjct: 172 LAAPPLGEGSADGSCDPALPLSAPRLGPADVFVP-ATPRPTPRTTASPETTPTPST---T 227
Query: 221 SYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSA 280
+ P P P TP P PTP A + T E + L
Sbjct: 228 TSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAP-PTPGGGEAPPANATPAPEASRYELTVT 286
>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
Length = 438
Score = 33.0 bits (76), Expect = 0.27
Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
+K ++ +K R+ + +EEIE L+ L+ M+E I K +LE I
Sbjct: 79 PIKQSLREKRRKASKNTIKRKREEIEKLEDDIKNLDVDDE----MYEEI---KTDLELKI 131
Query: 336 S------FLKEK---ETELDEILAVLTEKTEV 358
FL + L+ LA L E E+
Sbjct: 132 DKLSFQLFLANSNNLKRLLESALARLGENVEI 163
>gnl|CDD|115367 pfam06705, SF-assemblin, SF-assemblin/beta giardin. This family
consists of several eukaryotic SF-assemblin and related
beta giardin proteins. During mitosis the
SF-assemblin-based cytoskeleton is reorganised; it
divides in prophase and is reduced to two dot-like
structures at each spindle pole in metaphase. During
anaphase, the two dots present at each pole are
connected again. In telophase there is an asymmetrical
outgrowth of new fibres. It has been suggested that
SF-assemblin is involved in re-establishing the
microtubular root system characteristic of interphase
cells after mitosis.
Length = 247
Score = 32.8 bits (75), Expect = 0.27
Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 6/66 (9%)
Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFK--LARMFERIDKEKAELERSISFLKEKETEL 345
K+NE EE KQ ++++ + K AR+ RID EK E +S L+ +E+
Sbjct: 129 KVNECVAAIDEERISRKQEQSQILK-KVAEDFARISARIDTEKNTREAEVSALR---SEI 184
Query: 346 DEILAV 351
E+
Sbjct: 185 HEVKGN 190
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 33.4 bits (77), Expect = 0.27
Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
QE+ ++LK+ + L L ++ ER+ +E ELE+ + LK+K E+ +L+
Sbjct: 710 LQEQEDLLKEIASLLKVPPELLPKV-ERLLEELKELEKELERLKKKL-AAAELADLLSNA 767
Query: 356 TEVDV 360
++
Sbjct: 768 KAEEI 772
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 33.2 bits (76), Expect = 0.27
Identities = 37/168 (22%), Positives = 49/168 (29%), Gaps = 16/168 (9%)
Query: 131 AQNSKLEGLIKTLASAFSALPPLYSSRT---KQEPATNPTYNGNMASN-APYPTQSYMPM 186
+Q + LE L K + AFS L + RT Q P + AP P +
Sbjct: 319 SQIANLEALTKWV-KAFSQLD--FQLRTTSYGQLPLELAVIEALLVPVPAPQPAKPTAAA 375
Query: 187 PGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGST 246
P TP P+ P A N + P P P A P P +
Sbjct: 376 PSPVRPTPAPSTRPKAAAAANIPPKEPVRETATP--PPVPPRPVAPPVPHTPESAPKLTR 433
Query: 247 PATPYPVPTPSPAPA-------VTSGGTITDEHIKMSLKSAVEDKIRR 287
A P PA E ++ K + D R
Sbjct: 434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLEQLEAIWKQILRDVPPR 481
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 32.4 bits (74), Expect = 0.27
Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 4/90 (4%)
Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
K+ K + + ++ +Q+ +EE E K+ E + + + + E +K+K E E
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 333 RSISFLKEKETELDEILAVLTEKTEVDVDE 362
R + +E E K+ V+E
Sbjct: 61 R----KEREEQARKEQEEYEKLKSSFVVEE 86
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 33.1 bits (76), Expect = 0.28
Identities = 24/105 (22%), Positives = 34/105 (32%), Gaps = 17/105 (16%)
Query: 166 PTYNGNMAS----NAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
P +G M + AP P +P+ S + + PT P G+ G + S
Sbjct: 198 PDLSGGMGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGS 257
Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
PTP A A P + A S + SG
Sbjct: 258 EPTPTAPSSA----------PATAPAAAAPQAAATSSS---ASGN 289
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 33.5 bits (77), Expect = 0.30
Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE------NELNQGKFKLARMFERIDKE 327
M + S ++K+ K+N + + ++E+E LK T +L++ + L E +KE
Sbjct: 1093 SMPIWSLTKEKVE-KLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKE 1151
Query: 328 KAELERSISFLKEKETELDE 347
A+ +R S K K ++L +
Sbjct: 1152 IAKEQRLKSKTKGKASKLRK 1171
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.1 bits (75), Expect = 0.32
Identities = 19/85 (22%), Positives = 37/85 (43%), Gaps = 10/85 (11%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
E + E+ I K +AQ+++ +K+ ++ + + +I K
Sbjct: 914 LETFLEKDQQEKEELISSK-ETSNKKAQDKVNDIKE---KVKNIHGYMKDIENKIQDGKD 969
Query: 330 ELERSISFLKEKETELDEILAVLTE 354
+ +LK+KETEL+ + A L E
Sbjct: 970 D------YLKQKETELNTVNAQLEE 988
Score = 32.7 bits (74), Expect = 0.44
Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344
IR ++ ++ Q + EI+K ENEL+ K +L + + K +L+ I LK ++ +
Sbjct: 222 IRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHNLSK-IMKLDNEIKALKSRKKQ 280
Query: 345 LDEILAVLTEKTE 357
+++ + L K E
Sbjct: 281 MEKDNSELELKME 293
Score = 31.9 bits (72), Expect = 0.85
Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 9/95 (9%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAE 330
K LK D+IR + + + EIL++ + EL +L ++ +RI + E
Sbjct: 420 KERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQE 479
Query: 331 L---ERSISFLKEK---ETELDEILAVLTEKTEVD 359
L ER +S ++ ET E+ ++ EK ++D
Sbjct: 480 LRKAERELSKAEKNSLTETLKKEVKSLQNEKADLD 514
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 32.2 bits (74), Expect = 0.32
Identities = 19/105 (18%), Positives = 41/105 (39%), Gaps = 26/105 (24%)
Query: 267 TITDEHIKM--SLKSAVEDKIRRKMNEQKVQAQE-EIEILKQTENELNQGKFKLARMFER 323
IT ++++ SLK + + +++ + E E + E ++
Sbjct: 20 DITHNNLELIKSLKEEIAE-MKKNEEHNEKLMAEIAQENKRLVEP------------LKK 66
Query: 324 IDKEKAELERSISF----------LKEKETELDEILAVLTEKTEV 358
++E EL + + LK + EL++ L L ++EV
Sbjct: 67 AEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKELKNLKWESEV 111
>gnl|CDD|221787 pfam12815, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
The C-terminal domain of the transcription elongation
factor protein Spt5 is necessary for binding to Spt4 to
form the functional complex that regulates early
transcription elongation by RNA polymerase II. The
complex may be involved in pre-mRNA processing through
its association with mRNA capping enzymes. This CTD
domain carries a regular nonapeptide repeat that can be
present in up to 18 copies, as in S. pombe. The repeat
has a characteristic TPA motif.
Length = 112
Score = 31.3 bits (71), Expect = 0.33
Identities = 21/74 (28%), Positives = 22/74 (29%), Gaps = 9/74 (12%)
Query: 185 PMPGSNVNTP-YPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG 243
P G TP Y P Y Y TPGA P +P P G
Sbjct: 46 PAFGDGSRTPAYGGESNGRTPAY-------GGSGSYLGASTPGAAPTPAKSAPTPGAWGG 98
Query: 244 GSTPAT-PYPVPTP 256
TP PTP
Sbjct: 99 APTPGYSADSAPTP 112
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 33.0 bits (75), Expect = 0.35
Identities = 35/179 (19%), Positives = 54/179 (30%), Gaps = 20/179 (11%)
Query: 158 TKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGS 217
T A + Q++ P P G P + A S
Sbjct: 350 TASLTAPSRVLAAAAKVAVIAAPQTH--------TGPADRQRPQRPDGI-PYSVPARS-P 399
Query: 218 IYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPY-PVPTPSPAPAVTS---GGTITDEHI 273
+ P P G SPY PG S P PVP P V + H
Sbjct: 400 MTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHP 459
Query: 274 KMSLKSAVEDKIRRK--MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
+ + E K +R + E+ ++ + ++ LK+ + L + A I K
Sbjct: 460 Q---EHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHK-SEIKKIAES 514
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 33.0 bits (75), Expect = 0.35
Identities = 33/228 (14%), Positives = 52/228 (22%), Gaps = 17/228 (7%)
Query: 71 TLASAFSALPPLYSSRTKQEPATNPTYNGNIRPTADMRINVSQNVDHTGKIYSPYLHTW- 129
TL A S P + + +P+ G P A + G +
Sbjct: 548 TLIVALSLASPSTAGSPRPKPSLGKFVIGT-DPFAFANTVRLTDNMRGGNGVGSSVKPKG 606
Query: 130 TAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPT----YNGNMASNAPYPTQSYMP 185
+A + L G L A + SS + P
Sbjct: 607 SASSKPLTGPGSDLKPA-TLNGKTPSSSLVGAARNAGASSKVKIPSGLGGFTSPISLLES 665
Query: 186 MPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTP---------GATPGAGGYSP 236
+ + T P G + T TP G + P
Sbjct: 666 ALEDVLTSATSTPVKKNDPYLWDTNGEKAGGGTESASTTDVFQNFAGLNKKTPVGGPFQP 725
Query: 237 YPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDK 284
P + ++P P G D+ + L D
Sbjct: 726 KPPLSRALDSASSPGGSG-GKPGLDGVEGAKGKDDDVVGILSKIKGDS 772
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 33.1 bits (76), Expect = 0.35
Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 5/99 (5%)
Query: 262 VTSGGTITDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
+ + ++DE I +M + +A DK R++ E + +A+ I L++ E+ + +
Sbjct: 477 IKASSGLSDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEE 536
Query: 318 ARMFER-IDKEKAELERSISFLKEKETELDEILAVLTEK 355
E I + LE +K K EL E+ L EK
Sbjct: 537 KEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEK 575
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.8 bits (75), Expect = 0.36
Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 5/86 (5%)
Query: 276 SLKSAVEDKI-RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
L A+ + E++ +EE + E + KL ER+++E +EL+R
Sbjct: 389 PLAEALSKVKEEERPREKEGTEEEERREITVYEKRIK----KLEETVERLEEENSELKRE 444
Query: 335 ISFLKEKETELDEILAVLTEKTEVDV 360
+ LK + +L+ L + V
Sbjct: 445 LEELKREIEKLESELERFRREVRDKV 470
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 32.9 bits (76), Expect = 0.36
Identities = 13/90 (14%), Positives = 17/90 (18%)
Query: 172 MASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGA 231
P S T PT + A + P+ P P T
Sbjct: 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL 422
Query: 232 GGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
G + P P
Sbjct: 423 LAARQQLQRAQGATKAKKSEPAAASRARPV 452
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 31.7 bits (72), Expect = 0.37
Identities = 17/51 (33%), Positives = 18/51 (35%), Gaps = 7/51 (13%)
Query: 216 GSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
S PSYP P Y Y PP PA PY + P P G
Sbjct: 131 PSTAPSYPGPQ-------YQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGP 174
Score = 29.0 bits (65), Expect = 3.6
Identities = 19/77 (24%), Positives = 23/77 (29%), Gaps = 14/77 (18%)
Query: 171 NMASNAPYPTQSY-MPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
M Q+ +P P S P+YP Y GY+P G P Y
Sbjct: 113 VMTRATSTTVQTTPLPQPPST----APSYPGPQYQGYHPMPPQ--PGMPAPPYSLQYP-- 164
Query: 230 GAGGYSPYPLYPPGGST 246
P L P G
Sbjct: 165 -----PPGLLQPQGPPP 176
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 32.3 bits (74), Expect = 0.43
Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 7/77 (9%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENE---LNQGKFKL--ARMFERIDKEK 328
K+S + ++++E + EE LK+ E + + + + E +
Sbjct: 291 KLSGLKNSGEPSLKEIHEAR--LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 329 AELERSISFLKEKETEL 345
++LE L++ E L
Sbjct: 349 SKLEEIQKKLEDLEKRL 365
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 32.5 bits (74), Expect = 0.44
Identities = 22/97 (22%), Positives = 22/97 (22%), Gaps = 17/97 (17%)
Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAA--PSGSIYPSYPT------------ 224
P P Y GY A P PT
Sbjct: 94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRP 153
Query: 225 -PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
PGA P A G P PY P S AP
Sbjct: 154 EPGAWPRAADDYG-WQQQRLGFPPRAPYASPA-SYAP 188
Score = 30.2 bits (68), Expect = 2.7
Identities = 15/72 (20%), Positives = 16/72 (22%), Gaps = 5/72 (6%)
Query: 189 SNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPA 248
S P P P Y Y P A G P + PA
Sbjct: 91 SAGEPAPPPPHARRTSE--PELPR-PGRRPYEGYGGPRADDRPPGLPRQDQLPT--ARPA 145
Query: 249 TPYPVPTPSPAP 260
P P P
Sbjct: 146 YPAYQQRPEPGA 157
Score = 29.0 bits (65), Expect = 5.7
Identities = 28/158 (17%), Positives = 29/158 (18%), Gaps = 28/158 (17%)
Query: 137 EGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYP 196
EG A P PA P Y P
Sbjct: 119 EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA 178
Query: 197 TYPPSAYPGYNPNAAAAPSGSIYPSYPT----------------------PGATPGAGGY 234
Y A P + P Y P PGAG
Sbjct: 179 PYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHV 238
Query: 235 ---SPYPLYPPGGSTPATPYPVPTP---SPAPAVTSGG 266
P P P P PAPA G
Sbjct: 239 HRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGE 276
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 32.6 bits (75), Expect = 0.46
Identities = 18/88 (20%), Positives = 39/88 (44%), Gaps = 18/88 (20%)
Query: 278 KSAVEDKIRRKMNEQKVQAQEEI---EILKQTENELNQGKFKLARMFERIDKEKAELERS 334
S ++ +K+ ++++ +EE E++++ + EL + I+KE+ L
Sbjct: 189 LSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEE-----------IEKERESLLEE 237
Query: 335 ISFLKEKETEL----DEILAVLTEKTEV 358
+ L +K E E L + E+ E
Sbjct: 238 LKELAKKYLEELLALYEYLEIELERAEA 265
Score = 31.4 bits (72), Expect = 1.1
Identities = 21/79 (26%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
DK+R + + +E+ ++ ++ EL + + E+I+KE ELE IS L+ +
Sbjct: 60 DKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEE---ELEKIEKEIKELEEEISELENEI 116
Query: 343 TELDEILAVLTEKTEVDVD 361
EL++ + L D+D
Sbjct: 117 KELEQEIERLEPWGNFDLD 135
>gnl|CDD|107179 PHA02414, PHA02414, hypothetical protein.
Length = 111
Score = 30.6 bits (69), Expect = 0.46
Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 2/85 (2%)
Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350
E K+Q E + K EL +L + +DK+ A S + + L+E ++
Sbjct: 17 EDKIQEGELTD--KGDNKELEVAVAELRDIVVSLDKDVAVNSEKQSHIYYQIERLEEKIS 74
Query: 351 VLTEKTEVDVDEAVTTTAPIYKQIL 375
L E + + E T ++ +L
Sbjct: 75 ALAESNKKEDTEKKDTVEKVFMIVL 99
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 32.3 bits (73), Expect = 0.48
Identities = 22/85 (25%), Positives = 41/85 (48%), Gaps = 18/85 (21%)
Query: 274 KMSLKSAVEDKIRRKMNEQKV---QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
++S+ ++V + +++ E K +A EE + L+ L Q +++EK E
Sbjct: 247 EISIMASV-ASVLKELEEAKANLEKAAEEEKSLRNLVESLKQ----------ELEEEKKE 295
Query: 331 LERSISFLKEKETELDEILAVLTEK 355
LE L+EKE E +E + L +
Sbjct: 296 LEE----LREKEGEAEEAASSLEAE 316
Score = 30.7 bits (69), Expect = 1.7
Identities = 22/93 (23%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIE----ILKQTENELNQGKFKLARMFERIDKEKAEL 331
S++E ++ R +E++ +A+E++ L+Q +E + + + E + K K E
Sbjct: 308 EAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEA 367
Query: 332 ERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364
E++ + L+ E L A L E EA+
Sbjct: 368 EQTKAALETAELRL---EAALKEAEAAKAAEAL 397
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 32.2 bits (73), Expect = 0.49
Identities = 30/157 (19%), Positives = 50/157 (31%), Gaps = 15/157 (9%)
Query: 149 ALPPLYSSRT--KQEPATNPTYNGNMA---SNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
+LPP+ S+ T ++ A + + +NA Q+ + V + PT P +
Sbjct: 77 SLPPISSTPTQGQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVA 136
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA-- 261
P N + A P+ P P + AT V T SPA
Sbjct: 137 PVRNASVPTAER----PAITRPVRAQAVS--EPAVEPKAAKTATATEAKVQTASPAQTPA 190
Query: 262 --VTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQA 296
G +K + S ++ + V
Sbjct: 191 TPPAGKGAAASGQLKSAPSSHYTLQLSASGSYDNVNG 227
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 31.1 bits (71), Expect = 0.50
Identities = 21/93 (22%), Positives = 44/93 (47%), Gaps = 14/93 (15%)
Query: 279 SAVEDKIRRKMNEQKVQAQE-----EIEILKQTEN---------ELNQGKFKLARMFERI 324
K++ + EQ A E E E++K E+ +LN+ K ++A++
Sbjct: 23 EEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLNELKKEIAQLKAEA 82
Query: 325 DKEKAELERSISFLKEKETELDEILAVLTEKTE 357
+ +AEL + +E++ L++ L+ L ++ E
Sbjct: 83 ESAQAELSEAEESWEEQKKMLEDELSELEKRIE 115
Score = 30.3 bits (69), Expect = 0.70
Identities = 14/57 (24%), Positives = 28/57 (49%), Gaps = 4/57 (7%)
Query: 288 KMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340
+ +Q + ++EI LK + EL++ + + ++ E +ELE+ I L E
Sbjct: 63 ALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNE 119
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 32.7 bits (74), Expect = 0.50
Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 22/97 (22%)
Query: 280 AVEDKIRRKMNEQK------VQAQEEIEILKQ-----------TENELNQGKFKLARMFE 322
A + I K + K +A+ EI L+ + E+ + LA M
Sbjct: 228 ASQKLIDEKDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKA 287
Query: 323 RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVD 359
+ D+ EL R + L +TEL+ + + D
Sbjct: 288 KCDRAAQELSRKKTELLGLQTELETL-----ANQDSD 319
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 32.6 bits (75), Expect = 0.50
Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 13/90 (14%)
Query: 273 IKMSLKSAVE--DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
I+M + S E D++ R++ +Q + E E LK+ ++E ++ R+ E ++KE AE
Sbjct: 396 IRMEIDSKPEELDELDRRI----IQLEIEREALKKEKDEASK-----ERL-EDLEKELAE 445
Query: 331 LERSISFLKEK-ETELDEILAVLTEKTEVD 359
LE + L+E+ + E I + K E++
Sbjct: 446 LEEEYADLEEQWKAEKAAIQGIQQIKEEIE 475
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 31.0 bits (70), Expect = 0.50
Identities = 26/68 (38%), Positives = 28/68 (41%), Gaps = 3/68 (4%)
Query: 193 TPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYP 252
TPYP PP + P Y PN+ P P PG PG P PPG P YP
Sbjct: 57 TPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQPGYNPADYPP--PPGAVPPPQNYP 114
Query: 253 VPTPSPAP 260
P P P
Sbjct: 115 YP-PGPGQ 121
Score = 30.2 bits (68), Expect = 0.84
Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 5/62 (8%)
Query: 178 YPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTP-GATPGAGGYSP 236
YP +Y P P + TP P PP PGYNP P G++ P P PG Y+P
Sbjct: 71 YPNSNYFPPPPGS--TPVP--PPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAP 126
Query: 237 YP 238
P
Sbjct: 127 RP 128
>gnl|CDD|177114 MTH00039, COX3, cytochrome c oxidase subunit III; Validated.
Length = 260
Score = 32.0 bits (73), Expect = 0.51
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
+PP G P P+ VP + A ++SG TIT H
Sbjct: 115 WPPTGINPINPFLVPLLNTAVLLSSGVTITWSH 147
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 32.5 bits (74), Expect = 0.51
Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 7/86 (8%)
Query: 262 VTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEIL-KQTENELNQGKFKLARM 320
+ S G ++DE I+ +K A E K + + ++ V A+ E E L E +L+ K K++
Sbjct: 535 IQSSGGLSDEEIEKMVKEAEEYKEQDEKKKELVDAKNEAETLIYSVEKQLSDLKDKIS-- 592
Query: 321 FERIDKEKAELERSISFLKEKETELD 346
D +K EL++ I+ L+ + D
Sbjct: 593 ----DADKDELKQKITKLRSTLSSED 614
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 31.8 bits (72), Expect = 0.53
Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 9/124 (7%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQE----EIEILKQTENELNQGKFKLARMF- 321
T + K+ L+ E + K +KV+ EI+ L++ +EL + ++ +
Sbjct: 79 TFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLL 138
Query: 322 ----ERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNT 377
ER+++ K + + L+E E EL+E+ ++ ++ K+ L
Sbjct: 139 YLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEE 198
Query: 378 FTEE 381
EE
Sbjct: 199 LLEE 202
>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
Length = 3095
Score = 32.6 bits (74), Expect = 0.54
Identities = 17/112 (15%), Positives = 28/112 (25%), Gaps = 6/112 (5%)
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGG-YSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGT 267
A P YP+ T G G Y P T P
Sbjct: 2634 EDRAVPENVEYPALATQGDQDGFVSKVGSAMKYKPPNVTAMAQ-----SIVQPDTQKSSI 2688
Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLAR 319
I+++ + R+K N + + + E+ + R
Sbjct: 2689 KKKTDIRLAGYRTMPADNRQKGNGLRHKPKTGRSHTTIKNVEVYPSPPRWTR 2740
>gnl|CDD|237695 PRK14376, PRK14376, hypothetical protein; Provisional.
Length = 176
Score = 31.5 bits (71), Expect = 0.55
Identities = 13/36 (36%), Positives = 13/36 (36%), Gaps = 2/36 (5%)
Query: 231 AGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
AGGY P P PG S P V P G
Sbjct: 102 AGGYDPVPGEAPGTS--VGPGTVRKPRGRTRTAGGP 135
Score = 28.4 bits (63), Expect = 5.0
Identities = 15/60 (25%), Positives = 20/60 (33%), Gaps = 5/60 (8%)
Query: 200 PSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA 259
P + GY+P AP S PG G + P G TP+ P+
Sbjct: 99 PLSAGGYDPVPGEAPGTS-----VGPGTVRKPRGRTRTAGGPVGAVDAGTPHRSRRPTGG 153
>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
Length = 238
Score = 31.6 bits (72), Expect = 0.55
Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 9/66 (13%)
Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFE----RIDKEKAELERSISFLKEKETEL 345
+ Q ++E+E LKQ ELN ++A F+ R +E+ +L L+ K L
Sbjct: 66 AARLAQLEQELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLR-----LQLKCNTL 120
Query: 346 DEILAV 351
EIL V
Sbjct: 121 SEILPV 126
>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein. This family includes the
hamartin protein which is thought to function as a
tumour suppressor. The hamartin protein interacts with
the tuberin protein pfam03542. Tuberous sclerosis
complex (TSC) is an autosomal dominant disorder and is
characterized by the presence of hamartomas in many
organs, such as brain, skin, heart, lung, and kidney. It
is caused by mutation either TSC1 or TSC2 tumour
suppressor gene. TSC1 encodes a protein, hamartin,
containing two coiled-coil regions, which have been
shown to mediate binding to tuberin. The TSC2 gene codes
for tuberin pfam03542. These two proteins function
within the same pathway(s) regulating cell cycle, cell
growth, adhesion, and vesicular trafficking.
Length = 667
Score = 32.2 bits (73), Expect = 0.56
Identities = 33/191 (17%), Positives = 57/191 (29%), Gaps = 24/191 (12%)
Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY----PTYPPSAY 203
S+ P S R P+ + N + + + S P + + PS+
Sbjct: 279 SSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNTTFDKAASSGTKDSLWSPSSL 338
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
G A PS G +P SP L T + P P+ +
Sbjct: 339 CGM----ATPPSS--------IGMSPLILSLSPSHLSGRAPGTTGSGKGEPASESTPSTS 386
Query: 264 SGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI---EILKQTENELNQGKFKLARM 320
+ I ++ + RK Q + ++ E L+ E G A
Sbjct: 387 PPPPGLADDIVRAIFATSSRSAPRKEELQNESSFPKLVRQENLQNIEKSAEGGILDAA-- 444
Query: 321 FERIDKEKAEL 331
+ +E +L
Sbjct: 445 ---VTEELLKL 452
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 32.2 bits (73), Expect = 0.57
Identities = 20/67 (29%), Positives = 23/67 (34%), Gaps = 7/67 (10%)
Query: 208 PNAAAAPSGSIYPSYPTPGATPG-------AGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
P AP GS P P ATP G +P L + PA P + A
Sbjct: 343 PLLPDAPEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAATLAG 402
Query: 261 AVTSGGT 267
A G T
Sbjct: 403 AAPKGAT 409
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 31.7 bits (73), Expect = 0.59
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338
+++ +K +A+EE+E+L++ + L +G K AR E D+E L +S+ L
Sbjct: 141 ERLEKKAKSGDKEAKEELELLEKIKEHLEEG--KPARTLELTDEEIKIL-KSLQLL 193
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.4 bits (73), Expect = 0.60
Identities = 29/124 (23%), Positives = 44/124 (35%), Gaps = 10/124 (8%)
Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNT---PYPTYPPSAYPG 205
AL PL S T P+ +P M + P + P + + T+P S
Sbjct: 386 ALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHS 445
Query: 206 YNPNAAAA-----PSG--SIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP 258
P + A G +I P P +TP A + PPG + P++ P
Sbjct: 446 GPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPP 505
Query: 259 APAV 262
P +
Sbjct: 506 LPPI 509
Score = 30.0 bits (67), Expect = 2.7
Identities = 39/153 (25%), Positives = 51/153 (33%), Gaps = 33/153 (21%)
Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNA-PYPTQSYMPMPGSNVNTPYPT--------- 197
LPP S + P T P S+ P Q P P N P P
Sbjct: 335 QPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLP 394
Query: 198 --YPPSAYP----------GYNPNAAAAPSGSIYPSYPTPGAT-PGAGGY-----SPYPL 239
+PPSA+P A P + S P +T P +G + SP+
Sbjct: 395 THHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQ 454
Query: 240 YP--PGGSTPATPYPVP---TPSPAPAVTSGGT 267
+P GG P P TP+ P +SG
Sbjct: 455 HPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQ 487
>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
complex, found in both eukaryotes and archaea, that
binds and stabilizes newly synthesized polypeptides
allowing them to fold correctly. The complex contains
two alpha and four beta subunits, the two subunits being
evolutionarily related. In archaea, there is usually
only one gene for each subunit while in eukaryotes there
two or more paralogous genes encoding each subunit
adding heterogeneity to the structure of the hexamer.
The structure of the complex consists of a double beta
barrel assembly with six protruding coiled-coils.
Length = 129
Score = 30.8 bits (70), Expect = 0.60
Identities = 23/99 (23%), Positives = 43/99 (43%), Gaps = 16/99 (16%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMF---ERIDKEKA- 329
++ K+ ++ E + +A+E +E LK+ E E A +F E D +K
Sbjct: 14 QLEALQQQLQKLEAQLTEYE-KAKETLETLKKAEEEKELLVPLGAGLFVKAEVKDDDKVL 72
Query: 330 -----------ELERSISFLKEKETELDEILAVLTEKTE 357
LE +I FLK++ L++ + L ++ E
Sbjct: 73 VDLGTGVYVEKSLEEAIEFLKKRLETLEKQIEKLEKQLE 111
Score = 30.0 bits (68), Expect = 1.2
Identities = 20/103 (19%), Positives = 37/103 (35%), Gaps = 18/103 (17%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346
+++ Q Q Q+++E L+Q +L +K K LE +EKE +
Sbjct: 2 QELAAQLQQLQQQLEALQQQLQKLEA-------QLTEYEKAKETLETLKKAEEEKELLVP 54
Query: 347 EILAVLTEKTEVDVDEAVTTTAPIYKQI-LNTFTE---EAATE 385
+ + D D+ + + + E E A E
Sbjct: 55 LGAGLFVKAEVKDDDK-------VLVDLGTGVYVEKSLEEAIE 90
>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737). This
family consists of several uncharacterized mammalian
proteins of unknown function.
Length = 187
Score = 31.3 bits (71), Expect = 0.62
Identities = 33/182 (18%), Positives = 59/182 (32%), Gaps = 42/182 (23%)
Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKS 279
PS P+ P P + + + +
Sbjct: 26 PSQPSSKPQPSP------GFGPSSVPESELRRLIAGELKGALEDAKRPSEETAGGLQSSE 79
Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENE----------------LNQGKFKLARMFER 323
ED ++R EQ QEE+ + + E E Q + K + +
Sbjct: 80 VKEDLLKRYEQEQ-AIVQEELARIAKREREAAEEQLSRAVLREKASAEQERQKAKHLARQ 138
Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAA 383
+++++AEL+R +F KE+ LA L EK + YK + +AA
Sbjct: 139 LEEKEAELKRLDAFYKEQ-------LARLEEK-----------NSEFYKLTSEQY-HKAA 179
Query: 384 TE 385
++
Sbjct: 180 SK 181
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 32.1 bits (73), Expect = 0.63
Identities = 9/53 (16%), Positives = 12/53 (22%)
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
A + P P L S + P T + AP
Sbjct: 340 LALGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPL 392
Score = 31.3 bits (71), Expect = 1.1
Identities = 16/81 (19%), Positives = 20/81 (24%)
Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
P G P P N S P+ A + P P PP
Sbjct: 346 PSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAP 405
Query: 245 STPATPYPVPTPSPAPAVTSG 265
+ P +PA A
Sbjct: 406 EPARSAEAPPLVAPAAAPAGL 426
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.0 bits (73), Expect = 0.65
Identities = 14/62 (22%), Positives = 27/62 (43%)
Query: 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 343
+++ + + E L +E + KLA++ E K A+L +S ++K
Sbjct: 168 ATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLE 227
Query: 344 EL 345
EL
Sbjct: 228 EL 229
Score = 29.7 bits (67), Expect = 3.3
Identities = 32/151 (21%), Positives = 61/151 (40%), Gaps = 33/151 (21%)
Query: 279 SAVEDKIRRKMNE-QKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELERS 334
+A+E KIR + ++ K++ Q LK E E+ + +L +++ K+ A+L
Sbjct: 48 AALEKKIREQQDQRAKLEKQ-----LKSLETEIASLEAQLIETADDLKKLRKQIADLNAR 102
Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNT------FTEEAATED-- 386
++ L+ +E E LA A + + N E+A
Sbjct: 103 LNALEVQEREQRRRLAEQ--------------LAALQRSGRNPPPALLVSPEDAQRSVRL 148
Query: 387 TIYY--MGEALRRGVIDLEVYLKQVRSLSRK 415
IYY + A + L+ LKQ+ ++ +
Sbjct: 149 AIYYGALNPARAERIDALKATLKQLAAVRAE 179
Score = 29.3 bits (66), Expect = 3.8
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 8/60 (13%)
Query: 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDE 362
LKQ + E+ + K+ ++A+LE+ + K ETE+ + A L E T D+ +
Sbjct: 40 LKQIQKEIAALEKKIRE----QQDQRAKLEKQL---KSLETEIASLEAQLIE-TADDLKK 91
Score = 29.3 bits (66), Expect = 4.9
Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 10/76 (13%)
Query: 273 IKMSLKSAVEDKIRRKMNEQKVQA--QEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
+ +S + A + +R + + E I+ LK T +L + ++A E+AE
Sbjct: 135 LLVSPEDA-QRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAA-------EQAE 186
Query: 331 LERSISFLKEKETELD 346
L +S + ++ +L
Sbjct: 187 LTTLLSEQRAQQAKLA 202
Score = 28.1 bits (63), Expect = 10.0
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 289 MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI 348
++Q Q Q+EI L++ E + KL + + ++ E A LE L E +L ++
Sbjct: 36 DDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ---LIETADDLKKL 92
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated
Ndc80 complex and is involved in chromosome segregation.
Length = 117
Score = 30.4 bits (69), Expect = 0.66
Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 309 ELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTA 368
EL KF+LA + ++ ++EL S L+E + L+E+ E EVD DE +
Sbjct: 3 ELENEKFRLA---KELNDLESELNNLQSELEELKERLEELEEQEVEGDEVDEDETPVDDS 59
Query: 369 PIYK 372
+ K
Sbjct: 60 NVLK 63
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 31.7 bits (72), Expect = 0.66
Identities = 27/112 (24%), Positives = 43/112 (38%), Gaps = 27/112 (24%)
Query: 286 RRKMNEQKVQAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK- 339
+R+ NE+ ++ QEE + + TE E+ + M +E+AELER K
Sbjct: 127 QRRQNEELLKMQEESVLRQEAMRRATEEEILE-------MRRETIEEEAELERENIRAKI 179
Query: 340 ----------EKETELDEILAVLTEKTEVD---VDEAVTTTAPIYKQILNTF 378
E+E E D +L K + V E++ TT
Sbjct: 180 EAEARGRAKEERENE-DINREMLKLKANEERETVLESIKTTFSHIGGGFRAL 230
>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like. This family
represents a conserved region within a number of myosin
II heavy chain-like proteins that seem to be specific to
Arabidopsis thaliana.
Length = 351
Score = 31.9 bits (72), Expect = 0.66
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 13/68 (19%)
Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKF----------KLARMFERIDKEKAEL 331
E+KI KM+ QKV E+E L Q+ EL + R F+ +++E+ L
Sbjct: 22 ENKILDKMHRQKVA---EVEKLSQSVRELEEAVLAGGAAANAVRDYQRKFQEMNEERKTL 78
Query: 332 ERSISFLK 339
ER ++ K
Sbjct: 79 ERELARAK 86
>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
family. Included in this family of heterogeneous
ribonucleoproteins are PTB (polypyrimidine tract binding
protein ) and hnRNP-L. These proteins contain four RNA
recognition motifs (rrm: pfam00067).
Length = 481
Score = 32.1 bits (73), Expect = 0.67
Identities = 18/79 (22%), Positives = 22/79 (27%), Gaps = 6/79 (7%)
Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSY--PTPGATPGAGGYS-PYPLYPPGGSTP---AT 249
P + PS + Y P AGG P PP P A
Sbjct: 199 PGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMGPPHGPPSRYRPAYEAA 258
Query: 250 PYPVPTPSPAPAVTSGGTI 268
P S PA G++
Sbjct: 259 PLAPAISSYGPAGGGPGSV 277
>gnl|CDD|185320 PRK15422, PRK15422, septal ring assembly protein ZapB; Provisional.
Length = 79
Score = 29.7 bits (66), Expect = 0.68
Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 291 EQKVQ-AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
E KVQ A + I +L+ EL + L++ + ++ ELER + LKE++ E L
Sbjct: 10 EAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERL 69
Query: 350 AVLTEKTE 357
L + E
Sbjct: 70 QALLGRME 77
>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
Members of this protein family resemble the widely
distributed ATP-dependent protease La, also called Lon
and LonA. It resembles even more closely LonB, which is
a LonA paralog found in genomes if and only if the
species is capable of endospore formation (as in
Bacillus subtilis, Clostridium tetani, and select other
members of the Firmicutes) and expressed specifically in
the forespore compartment. Members of this family are
restricted to a subset of spore-forming species, and are
very likely to participate in the program of endospore
formation. We propose the designation LonC [Protein
fate, Degradation of proteins, peptides, and
glycopeptides, Cellular processes, Sporulation and
germination].
Length = 615
Score = 32.1 bits (73), Expect = 0.68
Identities = 24/97 (24%), Positives = 48/97 (49%), Gaps = 7/97 (7%)
Query: 253 VPTPSPAPAV--TSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI--EILKQTEN 308
VP P P + + I D + ++++ +E K+ +M E++ + EEI ++LK+ +
Sbjct: 60 VPDPRELPEILEDTEDHIADILARRTVENRIERKVETRMQERQNKYLEEIRLQVLKEEKG 119
Query: 309 ELNQGKFKLARMFERIDKEKAELERSI-SFLKEKETE 344
N K E+++K+K L +S S L+ +
Sbjct: 120 PENSSTLKKLERLEKLEKKK--LHKSAQSLLRPRAFS 154
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 31.4 bits (71), Expect = 0.68
Identities = 27/85 (31%), Positives = 34/85 (40%), Gaps = 13/85 (15%)
Query: 198 YPPSAYPGY-NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYP-LYPPGG----------S 245
Y P P Y N + A++P Y P P G PY +YPPGG S
Sbjct: 38 YGPRPPPPYFNSSVASSPQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGS 97
Query: 246 TPATPYPVPTPSPAPAVTSGGTITD 270
P +PY +P+ P T TD
Sbjct: 98 HPFSPYAMPSAEV-PGSTPLSMETD 121
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 32.2 bits (73), Expect = 0.69
Identities = 31/116 (26%), Positives = 41/116 (35%), Gaps = 13/116 (11%)
Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQ 292
G S + GG+ +P P S SL+ VE+ M
Sbjct: 3 GESNTTVMADGGNQSILRSDLPKFLSQPLEGSHHLGVSTSALSSLQKQVEES----MTLL 58
Query: 293 KVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK-ETELD 346
+ + L Q ENEL Q + +R + ELER S L E E ELD
Sbjct: 59 QRAELIRSKSKLIQLENELMQKE----LEHKR---AQIELERKASTLAENYERELD 107
>gnl|CDD|184943 PRK14981, PRK14981, DNA-directed RNA polymerase subunit F;
Provisional.
Length = 112
Score = 30.2 bits (69), Expect = 0.69
Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 8/73 (10%)
Query: 284 KIRRKMNEQKVQAQEEIEILKQ--TENELN--QGK-FKLARMFERIDKEKAELERSISFL 338
++ + E+ + E EIL + E EL+ + F ++D E A E + L
Sbjct: 2 IGKKILEEEYITIAEAKEILSEIEEERELSYELRRTLDYLNRFSKLDPEDA--EELVEEL 59
Query: 339 KEKETELDEILAV 351
E E ++ E AV
Sbjct: 60 LELE-KMKEKTAV 71
>gnl|CDD|191033 pfam04576, Zein-binding, Zein-binding. This domain binds to zein
proteins, pfam01559. Zein proteins are seed storage
proteins.
Length = 94
Score = 30.0 bits (68), Expect = 0.70
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 22/75 (29%)
Query: 269 TDEHIKMSL-----KSAVE---DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM 320
E + M L K+A+E + +R E+ QEEIE+L+ +
Sbjct: 33 ASEAMAMILRLQEEKAALEMEARQYQRMAEEKMEYDQEEIELLEDL-------------L 79
Query: 321 FERIDKEKAELERSI 335
+ER ++E A LE+ +
Sbjct: 80 YER-EREIAALEKEL 93
>gnl|CDD|191331 pfam05645, RNA_pol_Rpc82, RNA polymerase III subunit RPC82. This
family consists of several DNA-directed RNA polymerase
III polypeptides which are related to the Saccharomyces
cerevisiae RPC82 protein. RNA polymerase C (III)
promotes the transcription of tRNA and 5S RNA genes. In
Saccharomyces cerevisiae, the enzyme is composed of 15
subunits, ranging from 160 to about 10 kDa.
Length = 215
Score = 31.2 bits (71), Expect = 0.70
Identities = 28/136 (20%), Positives = 50/136 (36%), Gaps = 24/136 (17%)
Query: 252 PVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELN 311
VP+P P + +EH++ + +S+ +I+R+ + E ++ + E L
Sbjct: 14 RVPSPHFRPIGDLWNQLYEEHLRSTPRSSTTSEIKRRAEAKVKAKTELRDLRESPEILLK 73
Query: 312 QGKFKLARMFERIDKEKAE------------LERSISFLKEKETELDEILAVLTEKTEVD 359
LA + +K LER L+ DE L L +
Sbjct: 74 GLGRSLADDTDNPTGKKRTKILDPGLVWRVNLERFHKHLR------DEALVDLAKSR--- 124
Query: 360 VDEAVTTTAPIYKQIL 375
+ TTA +Y+ L
Sbjct: 125 IGS---TTAEVYRAAL 137
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 31.0 bits (70), Expect = 0.71
Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 4/35 (11%)
Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
GAGG P ST + YPVP+PS +P+ S
Sbjct: 71 GAGGSYSVPSGD--TSTYS--YPVPSPSYSPSPGS 101
>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding. The F-actin binding
domain forms a compact bundle of four antiparallel
alpha-helices, which are arranged in a left-handed
topology. Binding of F-actin to the F-actin binding
domain may result in cytoplasmic retention and
subcellular distribution of the protein, as well as
possible inhibition of protein function.
Length = 179
Score = 31.2 bits (70), Expect = 0.71
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 6/45 (13%)
Query: 223 PTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGT 267
P P A P + P G TP +P P+P+ SP+P+ + GT
Sbjct: 6 PVPPAVPKPQSTAK----PVG--TPPSPVPLPSTSPSPSKMANGT 44
>gnl|CDD|212016 cd11626, HR1_ROCK, Protein kinase C-related kinase homology region
1 (HR1) Rho-binding domain of Rho-associated coiled-coil
containing protein kinase. ROCK is also referred to as
Rho-associated kinase or simply as Rho kinase. It is a
serine/threonine protein kinase that is activated via
interaction with Rho GTPases and is involved in many
cellular functions including contraction, adhesion,
migration, motility, proliferation, and apoptosis. ROCKs
are the best-described effectors of RhoA. There are two
isoforms, ROCK1 and ROCK2, which may be functionally
redundant in some systems, but exhibit different tissue
distributions. Both isoforms are ubiquitously expressed
in most tissues, but ROCK2 is more prominent in brain
and skeletal muscle while ROCK1 is more pronounced in
the liver, testes, and kidney. Studies in knockout mice
result in different phenotypes, suggesting that the two
isoforms do not compensate for each other during
embryonic development. ROCK contains an N-terminal
extension, a catalytic kinase domain, and a long
C-terminal extension, which contains a Rho-binding HR1
domain and a pleckstrin homology (PH) domain. ROCK is
auto-inhibited by HR1 and PH domains interacting with
the catalytic domain. HR1 domains are anti-parallel
coiled-coil (ACC) domains that bind small GTPases from
the Rho family.
Length = 66
Score = 29.2 bits (65), Expect = 0.72
Identities = 14/68 (20%), Positives = 33/68 (48%), Gaps = 4/68 (5%)
Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344
++ + E + +A E E + EN++ L E + K+K E +++ ++ + +
Sbjct: 3 LQHRQQEYQRKADMEAEKRRNVENDVAA----LKDQLEDLKKQKQESQKAEEKARQLQKQ 58
Query: 345 LDEILAVL 352
L+E +L
Sbjct: 59 LEEANRLL 66
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 31.1 bits (71), Expect = 0.73
Identities = 23/81 (28%), Positives = 35/81 (43%), Gaps = 4/81 (4%)
Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
++ E+ + QE++ L+Q EL Q L ER+ KE A +++ L ELDE
Sbjct: 77 ELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQ----LSANAIELDE 132
Query: 348 ILAVLTEKTEVDVDEAVTTTA 368
L E+ E A
Sbjct: 133 ENRELREELAELKQENEALEA 153
>gnl|CDD|214562 smart00212, UBCc, Ubiquitin-conjugating enzyme E2, catalytic domain
homologues. Proteins destined for proteasome-mediated
degradation may be ubiquitinated. Ubiquitination follows
conjugation of ubiquitin to a conserved cysteine residue
of UBC homologues. This pathway functions in regulating
many fundamental processes required for cell
viability.TSG101 is one of several UBC homologues that
lacks this active site cysteine.
Length = 145
Score = 30.7 bits (70), Expect = 0.78
Identities = 21/82 (25%), Positives = 31/82 (37%), Gaps = 10/82 (12%)
Query: 2 TNLSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH--TWT 59
T G V+ + I + +P P I NVD +G+I L W+
Sbjct: 40 TPYEGGVFKL--TIEFPEDYPFKPPKVKFITK----IY-HPNVDSSGEICLDILKQEKWS 92
Query: 60 PQNSKLEGLIKTLASAFSALPP 81
P LE ++ +L S S P
Sbjct: 93 P-ALTLETVLLSLQSLLSEPNP 113
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 31.9 bits (72), Expect = 0.79
Identities = 19/98 (19%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
E+ ++ K +E++ ++ K+ +E I++L++ + + ++ + ++KE+
Sbjct: 212 EYYQLKEKLELEEENLLYLDYLKLN-EERIDLLQELLRDEQE---EIESSKQELEKEEEI 267
Query: 331 LERSISFLKEKETE---LDEILAVLTEKTEVDVDEAVT 365
L + + KE+E E +E L +L ++ E E +
Sbjct: 268 LAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLK 305
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 31.8 bits (72), Expect = 0.80
Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 15/91 (16%)
Query: 282 EDKIRRKMNEQK-----VQAQ----EEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
E+K R+K E+ QA E E LKQ + +L KL + ++E LE
Sbjct: 501 EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLE 560
Query: 333 RSISFLK-----EKETE-LDEILAVLTEKTE 357
+ L+ EKETE L L + +K
Sbjct: 561 KEAQELRKYQESEKETEVLMSALQAMQDKNL 591
>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein
[RNA processing and modification].
Length = 654
Score = 31.6 bits (71), Expect = 0.81
Identities = 24/93 (25%), Positives = 32/93 (34%), Gaps = 11/93 (11%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP----NAAAAPSGS 217
P NP NM +P M P ++ N + + Y P PS
Sbjct: 510 PGMNPAMGMNMGGMMGFP----MGGPSASPNPMMNGFAAGSMGMYMPFQPQPMFYHPSPQ 565
Query: 218 IYPSYPTPGATPGAGGYSPYPLYPPGGSTPATP 250
+ P + GA G G SP + P G A P
Sbjct: 566 MMPVMGSNGAEEGGGNISP---HVPAGFMAAGP 595
>gnl|CDD|206350 pfam14182, YgaB, YgaB-like protein. The YgaB-like protein family
includes the B. subtilis YgaB protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are
approximately 90 amino acids in length.
Length = 79
Score = 29.2 bits (66), Expect = 0.86
Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 12/61 (19%)
Query: 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKT 356
Q E+E ++ E EL ++KAELE + EL EI + ++T
Sbjct: 20 QSELERCQEIEKEL------------LELEKKAELESIQQEISRMRKELKEIQELFEKQT 67
Query: 357 E 357
E
Sbjct: 68 E 68
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 31.8 bits (73), Expect = 0.86
Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 6/81 (7%)
Query: 281 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQ---GKFKLARMFERIDKEKAELERSISF 337
+E K+ +K+ K+ E ++ L + + + + ++ KAELE
Sbjct: 384 IEPKVEKKL--DKLGLSIE-DVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKR 440
Query: 338 LKEKETELDEILAVLTEKTEV 358
L+E+ ++D L E E
Sbjct: 441 LREEAAKIDPSLEGALEANEA 461
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 31.2 bits (70), Expect = 0.87
Identities = 26/98 (26%), Positives = 29/98 (29%), Gaps = 6/98 (6%)
Query: 150 LPPLYSSRTKQ-EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
LPP + PA PT N P Q PG+ PT P P
Sbjct: 8 LPPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQP--GQPGAPQQLAIPTQQPQPVPTSAM 65
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGST 246
P+ P P A P AG P L P
Sbjct: 66 TPHVVQQA---PAQPAPAAPPAAGAALPEALEVPPPPA 100
Score = 28.9 bits (64), Expect = 5.0
Identities = 19/72 (26%), Positives = 23/72 (31%), Gaps = 10/72 (13%)
Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYP-------TPGATPGAGGYSPY-PLYPPGGSTPATP 250
PP+A P PN P + P P P P S P PA P
Sbjct: 20 PPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQA--PAQP 77
Query: 251 YPVPTPSPAPAV 262
P P+ A+
Sbjct: 78 APAAPPAAGAAL 89
>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p. In Saccharomyces
cerevisiae, Sec2p is a GDP/GTP exchange factor for
Sec4p, which is required for vesicular transport at the
post-Golgi stage of yeast secretion.
Length = 90
Score = 29.2 bits (66), Expect = 0.90
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 12/80 (15%)
Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLA-RMFERIDKEKAELE----RS 334
A E K R + ++K + ++E+E +L F A +M +E+ +E +
Sbjct: 4 AEEKKKRLRAEKEKKKLEQEVE-------DLTASLFDEANKMVADARREREAVEIKNEKL 56
Query: 335 ISFLKEKETELDEILAVLTE 354
LKEKET LD + A L E
Sbjct: 57 EEQLKEKETLLDSLQAQLKE 76
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 31.0 bits (70), Expect = 0.91
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 15/83 (18%)
Query: 283 DKIRRKMNEQKVQAQEEIE-----------ILKQTENELNQGKFKLARMFERIDKEKAEL 331
DK ++ E+K+Q EE+ +KQ+++EL + K+ARM E L
Sbjct: 38 DKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKSELERRMGKIARMHG----EYNPL 93
Query: 332 ERSISFLKEKETELDEILAVLTE 354
+ SI+ +++ E L+E+ + E
Sbjct: 94 KESINEMRKLELGLEELPQLHEE 116
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 31.4 bits (71), Expect = 0.93
Identities = 18/98 (18%), Positives = 28/98 (28%), Gaps = 9/98 (9%)
Query: 181 QSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT----PGAGGYSP 236
S + + P PS +P + + + Y + A+ P P
Sbjct: 134 ASLQAARKEGLPSDSPAPAPSPRGRPHPASGSGRLSASYLQDLSTSASECIDPSV--VFP 191
Query: 237 YPL---YPPGGSTPATPYPVPTPSPAPAVTSGGTITDE 271
YPL TP P + +SG E
Sbjct: 192 YPLNERSKSSKVASPTPRLGLRTPPNSSSSSGSDSESE 229
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 31.8 bits (72), Expect = 0.93
Identities = 14/81 (17%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 282 EDKIRR---KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA-ELERSISF 337
E+K NE+K + +E+I ++ ++++ L + E ++ E+ + +
Sbjct: 464 EEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNK 523
Query: 338 LKEKETELDEI---LAVLTEK 355
++ +L++I + L +K
Sbjct: 524 IESARADLEDIKIKINELKDK 544
>gnl|CDD|191136 pfam04949, Transcrip_act, Transcriptional activator. This family
of proteins may act as a transcriptional activator. It
plays a role in stress response in plants.
Length = 160
Score = 30.6 bits (69), Expect = 0.98
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 11/92 (11%)
Query: 275 MSLKSAVEDKIRRKMNE--QKVQAQ-----EEIEILKQTENEL----NQGKFKLARMFER 323
+SL A E++I R+ E ++VQAQ EE + L + EL + + +++ + +R
Sbjct: 33 LSLFKAKEEEIERRKMEVRERVQAQLGRVEEETKRLAEIREELEGMADPMRKEVSMIRKR 92
Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEK 355
ID EL+ ++KE E E L EK
Sbjct: 93 IDSVNRELKPLGQTCQKKEKEYKEALEAYNEK 124
>gnl|CDD|177188 MTH00130, COX3, cytochrome c oxidase subunit III; Provisional.
Length = 261
Score = 30.9 bits (70), Expect = 1.0
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)
Query: 228 TPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
TP GG +PP G T P+ VP + A + SG T+T H
Sbjct: 109 TPELGG-----CWPPTGITTLDPFEVPLLNTAVLLASGVTVTWAH 148
>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1112
Score = 31.5 bits (72), Expect = 1.0
Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 9/77 (11%)
Query: 277 LKSAVEDKIRRKMNE-----QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 331
LK DK R + E +K++A+ E+ + +EL K + I + A++
Sbjct: 5 LKKLFGDKSERDLKEIQPIVEKIKAEYP-ELEALSNDELRA---KTDEFKQYIKEAVADI 60
Query: 332 ERSISFLKEKETELDEI 348
+ I LK + E +I
Sbjct: 61 DAKIEELKAEAIESLDI 77
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 31.2 bits (71), Expect = 1.1
Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 12/87 (13%)
Query: 290 NEQKVQAQEEIEILKQTENELN--------QGKFKLARMFERIDKEKAELERSISFLKEK 341
++++ + EIE L+ NEL+ Q K K+ + + + + K EL + LK
Sbjct: 36 DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKAL 95
Query: 342 ETELDEILA----VLTEKTEVDVDEAV 364
E EL + L + E V DE
Sbjct: 96 EAELQDKLLSIPNIPHESVPVGKDEED 122
>gnl|CDD|220001 pfam08748, DUF1789, Domain of unknown function (DUF1789). Putative
uncharacterized domain found in phage-related conserved
hypothetical protein from Bordetella.
Length = 125
Score = 29.7 bits (67), Expect = 1.1
Identities = 17/85 (20%), Positives = 27/85 (31%), Gaps = 14/85 (16%)
Query: 315 FKLARMFERIDKEKAELERSISFLK--------EKETELDEILAVLTEKTEVDVDEAVTT 366
FKL R+ E A++E + + +K E D L E + +
Sbjct: 14 FKLPVKIPRLGGEDAKVEFTFKHMDRKELAELYDKWGESDAKLGEKAAAEEAFTAKQIDL 73
Query: 367 TAPIYKQIL------NTFTEEAATE 385
A + K I F +E
Sbjct: 74 DAELIKAIAVGWDLDEEFNDENVEL 98
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 29.1 bits (66), Expect = 1.1
Identities = 14/59 (23%), Positives = 24/59 (40%), Gaps = 5/59 (8%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE-KAELERSISFLKEKET 343
++N++ Q E+ LK EL L + I++ ++EL K ET
Sbjct: 19 YYQLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERARSELG----LSKPGET 73
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 31.1 bits (71), Expect = 1.2
Identities = 12/60 (20%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER-SISFLKEKETELDEIL 349
EQ++ +++ E + E + + + ++ E+++ E+ +L L+ K E +E+L
Sbjct: 219 EQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELL 278
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 31.4 bits (71), Expect = 1.2
Identities = 22/112 (19%), Positives = 31/112 (27%), Gaps = 5/112 (4%)
Query: 188 GSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP 247
G P A P AAAA S P+ GA + P +
Sbjct: 365 GGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGA---ALAP--KAAAAAA 419
Query: 248 ATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEE 299
AT P +PAP T+ + + + + AQ
Sbjct: 420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPP 471
Score = 30.6 bits (69), Expect = 1.9
Identities = 21/113 (18%), Positives = 33/113 (29%), Gaps = 6/113 (5%)
Query: 155 SSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAP 214
+ P A + + P + + P PP A A
Sbjct: 441 DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDA-----AFEPAPR 495
Query: 215 SGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA-PAVTSGG 266
+ + + P A + P + PA PTP+ A PA +GG
Sbjct: 496 AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGG 548
Score = 29.0 bits (65), Expect = 5.4
Identities = 16/87 (18%), Positives = 22/87 (25%), Gaps = 10/87 (11%)
Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPN-AAAAPSGSIYPSYPTPGATPGAGGY 234
P P G++ + G AAA + + P P A
Sbjct: 380 VPAPGARAAAAVGASAVPAVT--AVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATAD 437
Query: 235 SPYPLYPPGGSTPATPYPVPTPSPAPA 261
G PVP + A A
Sbjct: 438 R-------GDDAADGDAPVPAKANARA 457
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 30.7 bits (70), Expect = 1.2
Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 19/90 (21%)
Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
L +A+E K+ E + A E K EN + E++++ +A+L+ +
Sbjct: 59 LATALE-----KLEEAEKAADESERGRKVLENRSLKDD-------EKMEQLEAQLKEAKE 106
Query: 337 FLKEKETELDEI---LAVLTEKTEVDVDEA 363
+E + + +E+ L V E D++ A
Sbjct: 107 IAEEADRKYEEVARKLVV----VEGDLERA 132
Score = 29.9 bits (68), Expect = 2.1
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 16/68 (23%)
Query: 303 LKQTENELNQGKFKLARMFERIDK-----EKAELE-----RSISFLKEKETELD---EIL 349
+KQ E EL++ + +L E++++ EKAE E R I L+E +L+ E L
Sbjct: 3 IKQLEEELDKAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEE---DLERSEERL 59
Query: 350 AVLTEKTE 357
A EK E
Sbjct: 60 ATALEKLE 67
Score = 29.9 bits (68), Expect = 2.3
Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
K E+ +AQE+ L++ E + + ++A + RI + +LERS L +L+E
Sbjct: 12 KAEERLEEAQEK---LEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLATALEKLEE 68
>gnl|CDD|205306 pfam13125, DUF3958, Protein of unknown function (DUF3958). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 120 amino acids in length.
There are two conserved sequence motifs: RLF and TWH.
Length = 99
Score = 29.0 bits (65), Expect = 1.3
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 19/76 (25%)
Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERI------DKEKAEL-----------ER 333
E++ + Q I+ ++ E + ++ K + R+F RI DKE + ER
Sbjct: 3 EEQDRNQSAIQTQERAEADFHELKNRSNRLFNRILETWHGDKEMSLFFMNIRQEAQQIER 62
Query: 334 SISFL--KEKETELDE 347
++F +KET L E
Sbjct: 63 KLTFELENQKETLLKE 78
>gnl|CDD|184060 PRK13453, PRK13453, F0F1 ATP synthase subunit B; Provisional.
Length = 173
Score = 30.3 bits (68), Expect = 1.3
Identities = 24/115 (20%), Positives = 57/115 (49%), Gaps = 18/115 (15%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE-K 328
+E K LK ++++++ + + KVQA+ +Q E +++ + M E E
Sbjct: 72 EEENKQKLKET-QEEVQKILEDAKVQAR------QQQEQIIHEANVRANGMIETAQSEIN 124
Query: 329 AELERSISFLKEKETELDEILAVLTEKTEV-DVDEAVTTTAPIYKQILNTFTEEA 382
++ ER+I+ + + +EL ++A + E+ + D+ K +++ + +EA
Sbjct: 125 SQKERAIADINNQVSELSVLIASKVLRKEISEQDQ---------KALVDKYLKEA 170
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 30.9 bits (70), Expect = 1.4
Identities = 17/52 (32%), Positives = 18/52 (34%)
Query: 210 AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
A A PS + P A P P P P A P P P P A A
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAA 432
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 30.7 bits (69), Expect = 1.4
Identities = 26/121 (21%), Positives = 31/121 (25%), Gaps = 11/121 (9%)
Query: 148 SALPPLYSSRTKQEPATNPT-YNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGY 206
S LPP Q+P Y + P P Q P P
Sbjct: 100 SQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQ-----PQPPPQVPQQQQYQ 154
Query: 207 NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYP---VPTPSPAPAVT 263
+P P P A +G Y Y P P P P P + A
Sbjct: 155 SP--PQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPP 212
Query: 264 S 264
S
Sbjct: 213 S 213
Score = 28.4 bits (63), Expect = 7.2
Identities = 25/100 (25%), Positives = 33/100 (33%), Gaps = 7/100 (7%)
Query: 155 SSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY-PTYPPSAYPGYNPNAAAA 213
S + P +P G S P S Y + PP + + A+ A
Sbjct: 213 SQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNHGSVASYA 272
Query: 214 PSGSI------YPSYPTPGATPGAGGYSPYPLYPPGGSTP 247
P GS YPS P P A S P+ G +P
Sbjct: 273 PQGSSQSYSTAYPSLPAATVLPQALPMSSAPMSGGGSGSP 312
Score = 28.0 bits (62), Expect = 9.4
Identities = 24/114 (21%), Positives = 33/114 (28%), Gaps = 7/114 (6%)
Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
ALPP + ++ Q S AP Q +P V + P P Y P
Sbjct: 72 ALPPAPAPQSPQPDQQQQ-------SQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPP 124
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
+ P + P P P Y P P P +P +
Sbjct: 125 PSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSG 178
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 30.9 bits (70), Expect = 1.4
Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 11/78 (14%)
Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
+K + K+A EDK +R+ + K QA+ + Q E +L + K K++ +++K ++E
Sbjct: 511 VKEAEKNAAEDKEKREKIDLKNQAE---SLCYQAEKQLKELKDKIS------EEKKEKIE 561
Query: 333 RSISFLKE--KETELDEI 348
I L++ + + I
Sbjct: 562 NLIKKLRQALQNDNYESI 579
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 30.4 bits (68), Expect = 1.4
Identities = 20/56 (35%), Positives = 25/56 (44%), Gaps = 8/56 (14%)
Query: 212 AAPSGSIYPSYPT-PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA-PAVTSG 265
+AP+ + P + PG+ P P PGG PA P P P P P PA S
Sbjct: 156 SAPA--VRPGDLSGPGSRPT---SGPVAGGLPGGGAPAPPTP-PRPGPTDPASESD 205
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 29.8 bits (67), Expect = 1.4
Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 2/36 (5%)
Query: 236 PYPLYPPGGSTPATPYPV--PTPSPAPAVTSGGTIT 269
P P P + P+ P PV P+PAPA +T
Sbjct: 53 PTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVT 88
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 31.2 bits (70), Expect = 1.4
Identities = 26/108 (24%), Positives = 31/108 (28%), Gaps = 21/108 (19%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTY-----PPSAYPGYNPNAAAAPSGSIYPSYPTP-G 226
+ APY PYP Y P + Y GY A S YP Y P G
Sbjct: 743 SHQAPYSGHEEPQAQ----QAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWG 798
Query: 227 ATPGAGGY----SPYPLYPPGGST-----PATPYPVPT--PSPAPAVT 263
Y + + YP G P P+ P S
Sbjct: 799 LRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHGQD 846
>gnl|CDD|177161 MTH00099, COX3, cytochrome c oxidase subunit III; Validated.
Length = 261
Score = 30.5 bits (69), Expect = 1.5
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)
Query: 228 TPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
TP GG +PP G TP P VP + + + SG +IT H
Sbjct: 109 TPELGGC-----WPPTGITPLNPLEVPLLNTSVLLASGVSITWAH 148
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 30.6 bits (68), Expect = 1.5
Identities = 21/77 (27%), Positives = 24/77 (31%), Gaps = 2/77 (2%)
Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
P A P A A + P+ P P P P P PP
Sbjct: 87 ACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPA--RPAPACPPST 144
Query: 245 STPATPYPVPTPSPAPA 261
P+PTP PAPA
Sbjct: 145 RQCPPAPPLPTPKPAPA 161
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 30.9 bits (70), Expect = 1.6
Identities = 30/114 (26%), Positives = 39/114 (34%), Gaps = 18/114 (15%)
Query: 160 QEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIY 219
E N A+ P P Q PG P P P ++P A P+
Sbjct: 117 AEQVKALGANPVTATGEPVP-QMPASPPGPE-GEPQPGNTPVSFPPQGSVAVPPPTA--- 171
Query: 220 PSYPTPGAT-PGAGGYSPYPL----------YPPGGSTPATPYPVPTPSPAPAV 262
YP G T P Y P+ Y G P+ PY +P+ S A A+
Sbjct: 172 -FYPGNGVTPPPQVTYQSVPVPNRIQRKTFTYNEGKKGPSLPY-IPSGSFAKAM 223
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.8 bits (70), Expect = 1.6
Identities = 32/148 (21%), Positives = 62/148 (41%), Gaps = 15/148 (10%)
Query: 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL--ERSI 335
+ +++DK+ + E+K QA+ ++ L +L Q + E + + EL ER
Sbjct: 665 QQSLKDKLELAIAERKQQAETQLRQLDA---QLKQLLEQQQAFLEALKDDFRELRTERL- 720
Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395
+ + E ELD LA L+ E +A + KQ +T+ + L
Sbjct: 721 AKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTV----KEL 776
Query: 396 RRGVIDLEVYLKQVRS-----LSRKQFM 418
+R + +LE ++++ + FM
Sbjct: 777 KRQIEELETTIERIAVRRPEVREYRAFM 804
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 30.8 bits (70), Expect = 1.7
Identities = 18/89 (20%), Positives = 34/89 (38%), Gaps = 14/89 (15%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE---LERSISFL- 338
DK M A+E+ + L++ ++L KL + K+ + + FL
Sbjct: 269 DKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLVEYYGEDPKDTSPEEFFKDFNEFLK 328
Query: 339 ----------KEKETELDEILAVLTEKTE 357
K++E E + ++ E TE
Sbjct: 329 EFSKAAEENIKKEEEEEERRKKLVKETTE 357
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 30.5 bits (69), Expect = 1.7
Identities = 19/119 (15%), Positives = 28/119 (23%), Gaps = 6/119 (5%)
Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYN 207
A + + + P + + S P P+ + N N
Sbjct: 282 IACSSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPK 341
Query: 208 PNAAAAPSGSIYPSYP-TPGATPGAGGYSP-----YPLYPPGGSTPATPYPVPTPSPAP 260
AA S + P P G SP + A P PA
Sbjct: 342 QEGGAAVSQNHDGGTVRAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASLLPPAV 400
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 30.7 bits (70), Expect = 1.8
Identities = 17/73 (23%), Positives = 34/73 (46%), Gaps = 11/73 (15%)
Query: 273 IKMSLKSAVEDKIRRK-----MNEQKVQAQEEIEILKQTENELNQGKFKL------ARMF 321
++ + VE+K+ + + EQ ++E+ I K ++EL + K KL +
Sbjct: 207 LQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVK 266
Query: 322 ERIDKEKAELERS 334
+ I+KE +L
Sbjct: 267 KVIEKELTKLSLL 279
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.8 bits (69), Expect = 1.8
Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 12/160 (7%)
Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPT 255
P P +++ S + S + G S G ++ + V
Sbjct: 132 PGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASVAGIVGA 191
Query: 256 PSPAPAVTSGGTITDEHI----------KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 305
AP E + K L+ +E++I N+ A+EE L +
Sbjct: 192 DEEAPPAPKNTLTPLEELYETNVNLFALKHPLEK-LEEEIDILKNDGDKVAEEEEFELDE 250
Query: 306 TENELNQGKFK-LARMFERIDKEKAELERSISFLKEKETE 344
E + K + L ++ D+EK + + I F++ +
Sbjct: 251 EHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKD 290
>gnl|CDD|147949 pfam06068, TIP49, TIP49 C-terminus. This family consists of the
C-terminal region of several eukaryotic and archaeal
RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin
or TIP49b) proteins. The N-terminal domain contains the
pfam00004 domain. In zebrafish, the liebeskummer (lik)
mutation, causes development of hyperplastic embryonic
hearts. lik encodes Reptin, a component of a
DNA-stimulated ATPase complex. Beta-catenin and Pontin,
a DNA-stimulated ATPase that is often part of complexes
with Reptin, are in the same genetic pathways. The
Reptin/Pontin ratio serves to regulate heart growth
during development, at least in part via the
beta-catenin pathway. TBP-interacting protein 49 (TIP49)
was originally identified as a TBP-binding protein, and
two related proteins are encoded by individual genes,
tip49a and b. Although the function of this gene family
has not been elucidated, they are supposed to play a
critical role in nuclear events because they interact
with various kinds of nuclear factors and have DNA
helicase activities.TIP49a has been suggested to act as
an autoantigen in some patients with autoimmune
diseases.
Length = 395
Score = 30.4 bits (69), Expect = 1.8
Identities = 31/133 (23%), Positives = 47/133 (35%), Gaps = 28/133 (21%)
Query: 249 TPYPVPTP-SPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE 307
P S G IT + KM + KI ++ ++KVQA + I I
Sbjct: 126 EIEEAENPLSGYGKTIKHGKITLKTTKMEKTLKLGPKIYEQLQKEKVQAGDVIYI----- 180
Query: 308 NELNQGKF-KLARMFERIDKEKAELERSIS-----FLKEKET-------ELDEI------ 348
+ N GK KL R F R E + K KE ++D
Sbjct: 181 -DKNTGKVKKLGRSFARATDFDLEATEFVPCPKGEVHKRKEVVQTVTLHDIDVANARPQG 239
Query: 349 -LAVLT-EKTEVD 359
L++ + +K E+
Sbjct: 240 ILSLFSPKKGEIT 252
>gnl|CDD|215457 PLN02852, PLN02852, ferredoxin-NADP+ reductase.
Length = 491
Score = 30.4 bits (69), Expect = 1.8
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 321 FERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
+E+ID + E R+ +EK T ++E+L EK
Sbjct: 457 WEKIDSAEKEAGRARGKPREKITSIEEMLKAANEK 491
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 30.8 bits (70), Expect = 1.8
Identities = 16/90 (17%), Positives = 39/90 (43%)
Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327
I D+ ++ D ++ E + E+ + L + +N+++ K L + ++ K
Sbjct: 304 IKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKV 363
Query: 328 KAELERSISFLKEKETELDEILAVLTEKTE 357
KA +E + + EL ++ L + +
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVK 393
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 30.0 bits (68), Expect = 1.8
Identities = 23/117 (19%), Positives = 50/117 (42%), Gaps = 17/117 (14%)
Query: 282 EDKIRRKMNEQKVQAQEEIEILKQT-------ENELNQGKFKLARM---FE------RID 325
E+ + ++ E+ V+ + E E+L + E +L++ + + R+ FE + +
Sbjct: 18 EEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQADFENYKRRIQKE 77
Query: 326 KEKAELERSISFLKEKETELDEI-LAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381
E AE R+ S + LD A+ E + + +++Q+L +E
Sbjct: 78 NEAAEKYRAQSLASDLLPALDNFERALQIEADDEQTKSLLKGVEMVHRQLLEALKKE 134
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 30.6 bits (70), Expect = 1.9
Identities = 14/83 (16%), Positives = 37/83 (44%), Gaps = 12/83 (14%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
+L S + + + K + + ++E + +++ ++ +++ + E +
Sbjct: 264 TLVSIINEHRKLKREAAEEKEEKEEKEIRKLY------------LYSKLEGKLPTEEEFL 311
Query: 336 SFLKEKETELDEILAVLTEKTEV 358
+++EK EL E L E+ EV
Sbjct: 312 EYIEEKNPELLETAEELIEEEEV 334
>gnl|CDD|129394 TIGR00293, TIGR00293, prefoldin, archaeal alpha subunit/eukaryotic
subunit 5. Members of this protein family, rich in
coiled coil regions, are molecular chaperones in the
class of the prefoldin (GimC) alpha subunit. Prefoldin
is a hexamer of two alpha and four beta subunits. This
protein appears universal in the archaea but is
restricted to Aquifex aeolicus among bacteria so far.
Eukaryotes have several related proteins; only prefoldin
subunit 5, which appeared the most similar to archaeal
prefoldin alpha, is included in this model. This model
finds a set of small proteins from the Archaea and from
Aquifex aeolicus that may represent two orthologous
groups. The proteins are predicted to be mostly coiled
coil [Protein fate, Protein folding and stabilization].
Length = 126
Score = 29.2 bits (66), Expect = 1.9
Identities = 16/86 (18%), Positives = 31/86 (36%), Gaps = 25/86 (29%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE-KETEL 345
+++ + Q+++E L+ + AELE +I L++ K E
Sbjct: 2 QQLAAELQILQQQVESLQAQ--------------IAALRALIAELETAIETLEDLKGAEG 47
Query: 346 DEIL----------AVLTEKTEVDVD 361
E L A + + +V V
Sbjct: 48 KETLVPVGAGSFVKAKVKDTDKVLVS 73
>gnl|CDD|219444 pfam07516, SecA_SW, SecA Wing and Scaffold domain. SecA protein
binds to the plasma membrane where it interacts with
proOmpA to support translocation of proOmpA through the
membrane. SecA protein achieves this translocation, in
association with SecY protein, in an ATP dependent
manner. This family is composed of two C-terminal alpha
helical subdomains: the wing and scaffold subdomains.
Length = 213
Score = 29.8 bits (68), Expect = 1.9
Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 12/86 (13%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
E I ++ ++D + E+ + ++E L E EL + L + + E
Sbjct: 57 KEIILEMIEDVIDDIVEEYAPEKSYPEEWDLEGL---EEELRE-LLGLDLDIDEEELEGL 112
Query: 330 ELERSISFL--------KEKETELDE 347
E L +EKE EL E
Sbjct: 113 TEEELKERLIEAAKEAYEEKEAELGE 138
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 30.5 bits (69), Expect = 1.9
Identities = 24/101 (23%), Positives = 30/101 (29%)
Query: 161 EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYP 220
E A G A + P P PG+ P+ A+ A GS P
Sbjct: 49 ELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTP 108
Query: 221 SYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
P+ P P P PV +P P PA
Sbjct: 109 PGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA 149
Score = 29.4 bits (66), Expect = 5.1
Identities = 15/84 (17%), Positives = 17/84 (20%), Gaps = 3/84 (3%)
Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYS 235
P P S + P G N P P A P +
Sbjct: 838 PPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPE---PRARPGAAAPPKAAAA 894
Query: 236 PYPLYPPGGSTPATPYPVPTPSPA 259
P P P P P
Sbjct: 895 APPAGAPAPRPRPAPRVKLGPMPP 918
Score = 29.0 bits (65), Expect = 6.3
Identities = 23/77 (29%), Positives = 27/77 (35%), Gaps = 9/77 (11%)
Query: 187 PGSNVNTPYPTYPPSA--YPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
PG+ P S+ P A +G P P P A PGA PP
Sbjct: 839 PGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAA-------PPKA 891
Query: 245 STPATPYPVPTPSPAPA 261
+ A P P P P PA
Sbjct: 892 AAAAPPAGAPAPRPRPA 908
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 30.5 bits (69), Expect = 1.9
Identities = 17/69 (24%), Positives = 24/69 (34%), Gaps = 5/69 (7%)
Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPT 255
P P +A P P A AA + + P+ A + P P A P
Sbjct: 381 PARPEAAAPAAAPVAQAAAAPA--PAAAPAAAASAP---AAPPAAAPPAPVAAPAAAAPA 435
Query: 256 PSPAPAVTS 264
+PA A +
Sbjct: 436 AAPAAAPAA 444
Score = 28.5 bits (64), Expect = 8.5
Identities = 22/110 (20%), Positives = 30/110 (27%), Gaps = 2/110 (1%)
Query: 151 PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNA 210
P A A A + P P P P+A A
Sbjct: 380 TPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439
Query: 211 AAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
AA + ++ P+ P A P + P A P P P+ A
Sbjct: 440 AAPAAVALAPAPPAQAAPETVA--IPVRVAPEPAVASAAPAPAAAPAAAR 487
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 30.7 bits (69), Expect = 2.0
Identities = 23/95 (24%), Positives = 30/95 (31%), Gaps = 8/95 (8%)
Query: 184 MPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG 243
+PG+ P P P + + A + S + P P G SP P
Sbjct: 180 GNIPGAI--QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPP 237
Query: 244 GSTPATP----YPVPTPSPAPAVTSGGTITDEHIK 274
S PA P P P P I HI+
Sbjct: 238 ASIPAPPIPPVIQYVAPPPVPPPQPIIPIQ--HIR 270
>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein. This is a family
of proteins which regulate checkpoint kinases. In
Schizosaccharomyces pombe this protein is called Rad26
and in Saccharomyces cerevisiae it is called LCD1.
Length = 648
Score = 30.5 bits (69), Expect = 2.0
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 6/50 (12%)
Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
+R K++ + Q QEE K NEL + K + + K K EL+
Sbjct: 2 LRDKLDMLQQQKQEERNKQKSRVNEL-KEKHD-----QELQKLKQELQSL 45
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 30.2 bits (68), Expect = 2.0
Identities = 28/82 (34%), Positives = 33/82 (40%), Gaps = 3/82 (3%)
Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
P P + V P P + P P A APS PTP + P A +P +PP
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAK--APKSSHPPLK 201
Query: 245 STPA-TPYPVPTPSPAPAVTSG 265
S A T Y P P P V G
Sbjct: 202 SPMAGTFYRSPAPGEPPFVKVG 223
>gnl|CDD|129241 TIGR00135, gatC, aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase,
C subunit. Archaea, organelles, and many bacteria
charge Gln-tRNA by first misacylating it with Glu and
then amidating Glu to Gln. This small protein is part of
the amidotransferase heterotrimer and appears to be
important to the stability of the amidase subunit encode
by gatA, but its function may not be required in every
organism that expresses gatA and gatB. The seed
alignment for This model does not include any eukaryotic
sequence and is not guaranteed to find eukaryotic
examples, although it does find some. Saccharomyces
cerevisiae, which expresses the amidotransferase for
mitochondrial protein translation, seems to lack a gatC
ortholog. This model has been revised to remove the
candidate sequence from Methanococcus jannaschii, now
part of a related model [Protein synthesis, tRNA
aminoacylation].
Length = 93
Score = 28.4 bits (64), Expect = 2.0
Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 7/56 (12%)
Query: 338 LKEKE-----TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTI 388
L E+E +LDEIL + + EVD + T P+ I N E+ E
Sbjct: 17 LSEEEAESFAGDLDEILDYVEQLNEVDTENVEPMTHPLE--ISNVLREDEPEEPLS 70
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 30.3 bits (69), Expect = 2.1
Identities = 14/68 (20%), Positives = 16/68 (23%), Gaps = 5/68 (7%)
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
P P G PGA A P G A VP + A A
Sbjct: 392 PSPGGGGGGPPGGG-----GAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAP 446
Query: 264 SGGTITDE 271
+
Sbjct: 447 PAPAAAPQ 454
Score = 30.3 bits (69), Expect = 2.5
Identities = 9/43 (20%), Positives = 11/43 (25%)
Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
P P G GG + A PA A+
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAAL 433
>gnl|CDD|218232 pfam04731, Caudal_act, Caudal like protein activation region. This
family consists of the amino termini of proteins
belonging to the caudal-related homeobox protein family.
This region is thought to mediate transcription
activation. The level of activation caused by mouse Cdx2
is affected by phosphorylation at serine 60 via the
mitogen-activated protein kinase pathway. Caudal family
proteins are involved in the transcriptional regulation
of multiple genes expressed in the intestinal
epithelium, and are important in differentiation and
maintenance of the intestinal epithelial lining. Caudal
proteins always have a homeobox DNA binding domain
(pfam00046).
Length = 135
Score = 29.3 bits (66), Expect = 2.1
Identities = 24/95 (25%), Positives = 31/95 (32%), Gaps = 18/95 (18%)
Query: 179 PTQSYMPMPGSNVNTPYPTYPP-SAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPY 237
Q+++ P YP Y PG N + PSG+ Y P +S Y
Sbjct: 14 GAQNFVSAP------QYPDYGGYHHVPGMNLDPHGQPSGAWGSPYGPPRED-----WSAY 62
Query: 238 PLYPPGGSTPAT------PYPVPTPSPAPAVTSGG 266
P +T AT Y P SP G
Sbjct: 63 GPGPGPSATAATGSPGQMAYSPPDYSPVHPPGPGP 97
Score = 29.0 bits (65), Expect = 2.7
Identities = 29/110 (26%), Positives = 35/110 (31%), Gaps = 26/110 (23%)
Query: 171 NMASNAPYPT-QSYMPMPGSNVNTP-------YPTYPP-----SAYPGYNPNAAAAPSGS 217
N S YP Y +PG N++ Y P SAY +A A +GS
Sbjct: 17 NFVSAPQYPDYGGYHHVPGMNLDPHGQPSGAWGSPYGPPREDWSAYGPGPGPSATAATGS 76
Query: 218 IYPSYPTPGATPGAGGYSP--YPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
PG YSP Y P G P P + S S
Sbjct: 77 -----------PGQMAYSPPDYSPVHPPGPGPGLLQPADSGSVEQLSPSA 115
>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1025
Score = 30.5 bits (69), Expect = 2.1
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 14/101 (13%)
Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
I+ SL S +D +R K E K + + +E ++Q + +L + KL I E+AE
Sbjct: 30 IQASLASLSDDALREKGMELKSRVRGALEPIEQKKKDLEK---KLDN--PDISLEEAE-- 82
Query: 333 RSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQ 373
SI+ ELD + E T ++E + T + K+
Sbjct: 83 -SIN------EELDTLAKEYEEATAAALEEILPETFALVKE 116
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.1 bits (68), Expect = 2.1
Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 7/78 (8%)
Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER-- 333
+ K+ E+K Q EIE L E+ K+ + + ID+ KAE+++
Sbjct: 31 DKIQNQDSKLSELQKEKK-NIQNEIESLDNQIEEIQS---KIDELQKEIDQSKAEIKKLQ 86
Query: 334 -SISFLKEKETELDEILA 350
I+ LKE E E+L
Sbjct: 87 KEIAELKENIVERQELLK 104
Score = 28.1 bits (63), Expect = 7.5
Identities = 16/75 (21%), Positives = 31/75 (41%), Gaps = 4/75 (5%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFE----RIDKEKAELERSISFLKEKE 342
+K+ EQ+ + ++ +E + + + L E ++ +KAE I+ L KE
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKE 206
Query: 343 TELDEILAVLTEKTE 357
A L E+
Sbjct: 207 ASALGEKAALEEQKA 221
>gnl|CDD|177146 MTH00075, COX3, cytochrome c oxidase subunit III; Provisional.
Length = 261
Score = 30.1 bits (68), Expect = 2.1
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
+PP G TP P+ VP + A + SG T+T H
Sbjct: 116 WPPTGITPLDPFEVPLLNTAVLLASGVTVTWAH 148
>gnl|CDD|177101 MTH00009, COX3, cytochrome c oxidase subunit III; Validated.
Length = 259
Score = 29.8 bits (67), Expect = 2.2
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
+PP G P P+ VP + A + SG T+T H
Sbjct: 114 WPPTGIEPLNPFSVPLLNTAVLLASGVTVTWAH 146
>gnl|CDD|215026 smart01104, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
The C-terminal domain of the transcription elongation
factor protein Spt5 is necessary for binding to Spt4 to
form the functional complex that regulates early
transcription elongation by RNA polymerase II. The
complex may be involved in pre-mRNA processing through
its association with mRNA capping enzymes. This CTD
domain carries a regular nonapeptide repeat that can be
present in up to 18 copies, as in S. pombe. The repeat
has a characteristic TPA motif.
Length = 121
Score = 29.0 bits (65), Expect = 2.2
Identities = 22/110 (20%), Positives = 29/110 (26%), Gaps = 3/110 (2%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
+ P + A + GS TP S P + +
Sbjct: 11 GSKTPAWGSRTPGTAAGGAPTARGGSGSR--TPAWGGAGSRTPAWGGAGPTGSRTPAWGG 68
Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP-APAVTSGGTITD 270
G G S + P G TP TPS PA GG +
Sbjct: 69 ASAWGNKSSEGSASSWAAGPGGAYGAPTPGYGGTPSAYGPATPGGGAMAG 118
>gnl|CDD|179381 PRK02193, truB, tRNA pseudouridine synthase B; Provisional.
Length = 279
Score = 30.1 bits (68), Expect = 2.3
Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 1/46 (2%)
Query: 112 SQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSR 157
S D G+I LE + L + +PP++S++
Sbjct: 74 STTYDSEGQII-NVSQNIKVTKENLEEALNNLVGSQKQVPPVFSAK 118
Score = 29.0 bits (65), Expect = 4.3
Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 1/46 (2%)
Query: 41 SQNVDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSALPPLYSSR 86
S D G+I LE + L + +PP++S++
Sbjct: 74 STTYDSEGQII-NVSQNIKVTKENLEEALNNLVGSQKQVPPVFSAK 118
>gnl|CDD|220373 pfam09738, DUF2051, Double stranded RNA binding protein (DUF2051).
This is a novel protein identified as interacting with
the leucine-rich repeat domain of human flightless-I,
FliI protein.
Length = 294
Score = 29.9 bits (67), Expect = 2.3
Identities = 31/133 (23%), Positives = 61/133 (45%), Gaps = 26/133 (19%)
Query: 259 APAVTSGGTITDEHIKMSLKSAVEDKIRRKM-------NEQKVQAQEEIEILKQTENELN 311
+ +V + +I + +K SL VE+K+++ M NE+ E +++LK EL
Sbjct: 69 SSSVDTEASIRE--LKDSL-IEVEEKVKKAMVSNAQLDNEKTNLMYE-VDLLKDKLEELE 124
Query: 312 QGKFKLARMFERIDKEKAELERSI-------SFLKEKETELDEILA-----VLTEKTEVD 359
+ +L R FE ++ L+R+I LKE+ + D+++ ++ E +
Sbjct: 125 ESLAELQREFEEKTRDLEALKRAIDILQHQLKELKEQLRQRDQLIEEHGLVLVPEVPNGE 184
Query: 360 VDEA---VTTTAP 369
+ + TAP
Sbjct: 185 GSDEVNNASVTAP 197
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 29.9 bits (67), Expect = 2.3
Identities = 21/94 (22%), Positives = 26/94 (27%), Gaps = 15/94 (15%)
Query: 179 PTQSYMPMPGSN--VNTPYPTYPPSAYPGYNPNAAAAPSGSI-----YPSYP-----TPG 226
Q TP P Y P+A + A APS SI P++P P
Sbjct: 140 LYQHGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPA 199
Query: 227 ATPGAGGYSPYPLYPP---GGSTPATPYPVPTPS 257
P P + V T S
Sbjct: 200 LPASMSTIEPKLPSTPLLTQQGPTLPKHSVKTAS 233
>gnl|CDD|224307 COG1389, COG1389, DNA topoisomerase VI, subunit B [DNA replication,
recombination, and repair].
Length = 538
Score = 30.1 bits (68), Expect = 2.3
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 1/66 (1%)
Query: 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDV 360
EI + L + KL R +E E ++ + K E+ + LA + EK E D+
Sbjct: 442 EIENEIRLALMEVARKLKLYLSRKRREMEERKKRKTIEKYLP-EIAKKLAEILEKPEEDI 500
Query: 361 DEAVTT 366
E +
Sbjct: 501 REKLAK 506
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 29.9 bits (66), Expect = 2.4
Identities = 29/124 (23%), Positives = 36/124 (29%), Gaps = 1/124 (0%)
Query: 141 KTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPP 200
K A A +A P ++ + A P + A P P P P
Sbjct: 218 KAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAP 277
Query: 201 SAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
A P AAAP + A P +P P A P TP PA
Sbjct: 278 PAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATP-PAK 336
Query: 261 AVTS 264
A
Sbjct: 337 AAAP 340
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 30.1 bits (68), Expect = 2.4
Identities = 34/140 (24%), Positives = 58/140 (41%), Gaps = 12/140 (8%)
Query: 289 MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI 348
+ K++A E+ E L+Q + + + LA FE+ DKE + S S E+ E E
Sbjct: 54 IIGDKMKAYEQ-EALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPAE- 111
Query: 349 LAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQ 408
T + + + + L +F E T++ + L R V +LE L
Sbjct: 112 ----TAGSLRPIRSSA-AAYRDINRELGSFYEYPKTDEE-----KELLREVEELESRLAT 161
Query: 409 VRSLSRKQFMFRALLFKCYK 428
S + + AL+ K Y+
Sbjct: 162 EPSPAPELEEQLALMEKSYE 181
>gnl|CDD|187842 cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
RAMP superfamily protein; predicted Cas5 ortholog; also
known as CasA/Cse1 family.
Length = 210
Score = 29.8 bits (67), Expect = 2.4
Identities = 15/87 (17%), Positives = 25/87 (28%), Gaps = 6/87 (6%)
Query: 137 EGLIKTLASAF-SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY 195
E I A + + PTYN ++A + + N
Sbjct: 21 EPYIGNTALDYALGYSLGEHPYRYVDATQRPTYNEDLAPINDFGYVTPAKPVYRNTMLGI 80
Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSY 222
Y + YP A + YP++
Sbjct: 81 WYYQTNNYP-----AEMVGTLYNYPNF 102
>gnl|CDD|220944 pfam11018, Cuticle_3, Pupal cuticle protein C1. Insect cuticles
are composite structures whose mechanical properties are
optimised for biological function. The major components
are the chitin filament system and the cuticular
proteins, and the cuticle's properties are determined
largely by the interactions between these two sets of
molecules. The proteins can be ordered by species.
Length = 164
Score = 29.2 bits (65), Expect = 2.4
Identities = 27/122 (22%), Positives = 35/122 (28%), Gaps = 17/122 (13%)
Query: 154 YSSRTKQE-PATNPTYN-GNMASNAPYPTQSYMPMPGSNVNTP--------YPTYPPSAY 203
YSS K + +N Y + Y + Y A
Sbjct: 33 YSSVRKSDTRISNNAYQPAYAKTAYAYAAPAVYAAAAPVYAAHAYAAPAVHYAAAAHYAA 92
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTP---SPAP 260
P Y A AAP+ + +Y A P + Y P A P SPA
Sbjct: 93 PAYAKYAYAAPAVTAKAAY----AAPAPVYKTAYAAAAPAVYAHAAPVVATATVAYSPAA 148
Query: 261 AV 262
AV
Sbjct: 149 AV 150
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 29.2 bits (66), Expect = 2.5
Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 11/82 (13%)
Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAE--LER 333
+ ++ + ++ K A+ EIE ++ EL +LA AE L +
Sbjct: 88 IAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVAELAV-------AIAEKLLGK 140
Query: 334 SISFLKEKETELDEILAVLTEK 355
+ ++ +D +A L E
Sbjct: 141 KVDE-AAQKDLIDAFIAELGEN 161
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 30.1 bits (68), Expect = 2.5
Identities = 25/170 (14%), Positives = 44/170 (25%), Gaps = 17/170 (10%)
Query: 174 SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG 233
A +P Q P P + P + AP+ P P
Sbjct: 187 RQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQP 246
Query: 234 YSPYPLY--PPGGSTPA-------------TPYPVPTPSPAPAVTSGGTITDEHIKMSLK 278
P PP P P PTP P + L
Sbjct: 247 QFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLP 306
Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
+ + +++ + + Q + + +Q E + + R + + K
Sbjct: 307 LVQQPQGQQRGPQFREQLVQLSQ--QQREALSQEEAKRAKRRHKIVSLSK 354
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 29.3 bits (66), Expect = 2.6
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Query: 291 EQKVQAQEE---IEILKQTENELNQGKFKLARMFERIDKEKAELERSIS-FLKEKETELD 346
++++QA E I+ + + +G E ++E+ + S+ +++ + D
Sbjct: 88 DEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD 147
Query: 347 EILAVLTEKTEVDVDE 362
EI+ +L E EVD DE
Sbjct: 148 EIVEILIEDDEVDEDE 163
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 29.9 bits (68), Expect = 2.7
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 7/67 (10%)
Query: 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI---LAVLTEKTE 357
EI K+ L + KL + R KE+ +++ +F K + EI LA + EK E
Sbjct: 439 EIEKEIRLALQEVARKLKKYLSRKRKEEEAKKKAKTFEK----YIPEIARKLAEILEKEE 494
Query: 358 VDVDEAV 364
D++E +
Sbjct: 495 PDIEEVL 501
>gnl|CDD|224729 COG1816, Add, Adenosine deaminase [Nucleotide transport and
metabolism].
Length = 345
Score = 29.6 bits (67), Expect = 3.0
Identities = 10/53 (18%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 37 RINVSQNVDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSALPPLYSSRTKQ 89
+ NV + + PYLH T + ++ +++ L + F + +K
Sbjct: 92 EDAAADNVVYAEIRFDPYLH--TKRGLSVDTVVEGLIAGFRPAERDFGIHSKL 142
Score = 29.2 bits (66), Expect = 3.8
Identities = 10/53 (18%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 108 RINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQ 160
+ NV + + PYLH T + ++ +++ L + F + +K
Sbjct: 92 EDAAADNVVYAEIRFDPYLH--TKRGLSVDTVVEGLIAGFRPAERDFGIHSKL 142
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 29.7 bits (67), Expect = 3.0
Identities = 22/120 (18%), Positives = 35/120 (29%), Gaps = 13/120 (10%)
Query: 200 PSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA 259
P A AA A + P P P P P P
Sbjct: 124 PPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTP--PAAAKPPEPA---PAAKPP 178
Query: 260 PAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLAR 319
P + + + MS ++R+++ E+ +Q +L T NE + R
Sbjct: 179 PTPVARADPRETRVPMS-------RMRQRIAERLKASQNTCAMLT-TFNECDMSALMELR 230
>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family.
Length = 424
Score = 29.7 bits (67), Expect = 3.1
Identities = 10/58 (17%), Positives = 17/58 (29%), Gaps = 1/58 (1%)
Query: 191 VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPA 248
+ + Y P + + S S Y S+ + + PY P P
Sbjct: 141 SSAQHLFYIIPGGPSWRTSIKPFSS-SHYGSHNSSAYSDHLRPLQPYSGELPPPLGPN 197
>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
[Transcription].
Length = 217
Score = 29.3 bits (66), Expect = 3.2
Identities = 22/105 (20%), Positives = 39/105 (37%), Gaps = 3/105 (2%)
Query: 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 343
+ R + ++ E I L E+N+ +L+R + K+ EL I E +
Sbjct: 104 ESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQTELQKQLNELMDRIKEEIEDKD 163
Query: 344 ELDEILAV---LTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
V L + DV+E T +++L E+ E
Sbjct: 164 GDMTERIVLEYLLKNGAADVEETSRRTVLKIEEVLEILAEKGFVE 208
>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein. A novel
genetic system characterized by six major proteins,
included a ParB homolog and a ThiF homolog, is
designated PRTRC, or ParB-Related,ThiF-Related Cassette.
It is often found on plasmids. This protein family the
member related to ParB, and is designated PRTRC system
ParB family protein.
Length = 554
Score = 29.7 bits (67), Expect = 3.2
Identities = 16/54 (29%), Positives = 20/54 (37%), Gaps = 5/54 (9%)
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
AAA P+ P G A SP G + PA VP+ P+ V
Sbjct: 329 AAAAQK-----PAAPAAGPGTPAKEKSPAETATSGAAKPAAKKAVPSSQPSNRV 377
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.0 bits (67), Expect = 3.3
Identities = 15/63 (23%), Positives = 23/63 (36%), Gaps = 7/63 (11%)
Query: 150 LPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPN 209
LPP + ++ +P NA Q + P PG+ P A G+ +
Sbjct: 204 LPPQMAGQSMYQPPGP-------YPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPRGHRAS 256
Query: 210 AAA 212
AA
Sbjct: 257 QAA 259
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 28.6 bits (64), Expect = 3.3
Identities = 21/106 (19%), Positives = 36/106 (33%), Gaps = 30/106 (28%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
RR EQK++ QE+ E E + KEK EL +E+ +
Sbjct: 36 RRAEIEQKLEEQEKQER-------------------EELRKEKREL------FEERRRKQ 70
Query: 346 DEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYM 391
E+ + + + + E + T T+ +YY
Sbjct: 71 LELRKLEQKMEDEKLQETWHEHNLALANFIRT-----KTKPHLYYR 111
>gnl|CDD|225471 COG2919, COG2919, Septum formation initiator [Cell division and
chromosome partitioning].
Length = 117
Score = 28.5 bits (64), Expect = 3.4
Identities = 6/26 (23%), Positives = 9/26 (34%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQ 312
++ Q Q E+E L L
Sbjct: 53 LQLQRQIAAQQAELEKLSARNTALEA 78
>gnl|CDD|237463 PRK13668, PRK13668, hypothetical protein; Provisional.
Length = 267
Score = 29.2 bits (66), Expect = 3.4
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 14/79 (17%)
Query: 320 MFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFT 379
M +++ K + + SF +EKET L + D + VT + P I+ +
Sbjct: 6 MRDKLKKRLSHPDWDFSFDREKET-----LRI----ERKDNKKGVTISLP---PIVAKWE 53
Query: 380 E--EAATEDTIYYMGEALR 396
E E A ++ +YY+ EAL
Sbjct: 54 EKKEKAIDEIVYYVEEALA 72
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 29.8 bits (68), Expect = 3.4
Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 23/93 (24%)
Query: 288 KMNEQKVQAQEEIEILKQT--------------ENELNQGKFKLARMFERIDKEKAELER 333
EQ Q E++ L+Q+ E +L +L ++++ + AE +
Sbjct: 317 HAREQNKQLLLELDRLQQSYTLNEDELETVRELEKQLE----ELEAQYDQLVERIAEKKV 372
Query: 334 SISFLKEKETELDEILAVLT--EKTEVDVDEAV 364
+ S L+E EL+EI L EK + ++ E++
Sbjct: 373 AYSELQE---ELEEIEKQLEEIEKEQEELSESL 402
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 29.7 bits (67), Expect = 3.5
Identities = 12/77 (15%), Positives = 36/77 (46%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
+KI + Q ++ ++ ++ L+Q + K + + + + EL L++ +
Sbjct: 251 EKISSREKHLNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELEQVK 310
Query: 343 TELDEILAVLTEKTEVD 359
E++E A +++ + +
Sbjct: 311 QEMEERGASMSDGSPLV 327
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a
family of proteins of unknown function possibly
containing a coiled-coil domain.
Length = 308
Score = 29.6 bits (66), Expect = 3.5
Identities = 22/98 (22%), Positives = 39/98 (39%), Gaps = 3/98 (3%)
Query: 252 PVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQ--EEIEILKQTENE 309
PV P + + T I + S + R + N K +A E++E + E +
Sbjct: 204 PVSEPPSPRDIFTEAEDTVGGIASHITSLRSEVRRLRQNLAKSEADYTEKMEQYAKEERQ 263
Query: 310 LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
+ +L R R + + L R +S E E+D+
Sbjct: 264 TREENIRLQRKLLREVERREALCRQLS-ESESSLEMDD 300
Score = 28.0 bits (62), Expect = 9.8
Identities = 29/119 (24%), Positives = 54/119 (45%), Gaps = 30/119 (25%)
Query: 292 QKVQA-QEEIEILKQT-ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
+K+QA ++E E L E E L+R ++ +EK +LE+++ +E+E ++++++
Sbjct: 84 KKIQALKKEKETLAMNYEQEEEFLTNDLSRKLTQLRQEKIQLEQTLE--QEQEYQVNKLM 141
Query: 350 AVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQ 408
+ +K E D T+ E LRR +DLE L+Q
Sbjct: 142 RKI-DKLEADTLAKQTSL-------------------------EQLRREKVDLENTLEQ 174
>gnl|CDD|225502 COG2952, COG2952, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 183
Score = 29.0 bits (65), Expect = 3.6
Identities = 22/122 (18%), Positives = 53/122 (43%), Gaps = 8/122 (6%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
E +K + +E ++ E + Q + E+L++ +E+ + ++F + ++ A+
Sbjct: 32 ELLKELIIGILEKSVK---KENALDEQAK-ELLEENTDEIEFMRMDERQLFWMVKRKLAQ 87
Query: 331 LERSISFLKEKETEL-DEILAVLTEK--TEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
E +E+ +L +IL + ++ V E + IYK I + E+
Sbjct: 88 EENFHLDWEERCNDLSHQILNKILDEDLIMFSVSENLIRNL-IYKSIDTYLKIYESIENE 146
Query: 388 IY 389
++
Sbjct: 147 VH 148
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 28.6 bits (64), Expect = 3.7
Identities = 15/55 (27%), Positives = 18/55 (32%), Gaps = 2/55 (3%)
Query: 206 YNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
P AP + P PT + A P P PA P P +P P
Sbjct: 79 LAPLTPPAPPEPVTP--PTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPP 131
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 28.5 bits (64), Expect = 3.7
Identities = 21/112 (18%), Positives = 39/112 (34%), Gaps = 18/112 (16%)
Query: 252 PVPTPSPAPAVTSGG------TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 305
SP + + K A E K+RR + K + E+ ++
Sbjct: 34 KRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKE-KLRRDKLKAKKEEAEK----EK 88
Query: 306 TENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
+ E + + +KE+AELE+ + K + E +I+ T
Sbjct: 89 EKEE------RFMKALAEAEKERAELEKKKAEAKLMKEE-KKIMFADTSSLS 133
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough protein
is thought to play a role in the circulative aphid
transmission of potato leaf roll virus. Also in the
family is open reading frame 6 from beet western yellows
virus and potato leaf roll virus both luteovirus and an
unknown protein from cucurbit aphid-borne yellows virus
a closterovirus.
Length = 460
Score = 29.7 bits (67), Expect = 3.8
Identities = 12/21 (57%), Positives = 12/21 (57%)
Query: 241 PPGGSTPATPYPVPTPSPAPA 261
P G TP P P PTP P PA
Sbjct: 7 PKPGPTPTPPPPAPTPEPTPA 27
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 29.6 bits (67), Expect = 3.8
Identities = 16/67 (23%), Positives = 34/67 (50%), Gaps = 7/67 (10%)
Query: 298 EEIEILKQTENELNQGKFKLARMFERID--KEKAELERSISFLKEKETELDEILAVLTEK 355
E +E L E EL + ++ + ER++ ++ E E I L+E+ +L+E++A +
Sbjct: 475 ERVEEL---EAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIA--ERR 529
Query: 356 TEVDVDE 362
++
Sbjct: 530 ETIEEKR 536
>gnl|CDD|177179 MTH00118, COX3, cytochrome c oxidase subunit III; Provisional.
Length = 261
Score = 29.1 bits (66), Expect = 3.8
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)
Query: 228 TPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
TP GG +PP G P P+ VP + A + SG T+T H
Sbjct: 109 TPELGG-----QWPPTGIKPLNPFEVPLLNTAVLLASGVTVTWAH 148
>gnl|CDD|214403 MTH00024, COX3, cytochrome c oxidase subunit III; Validated.
Length = 261
Score = 29.3 bits (66), Expect = 3.8
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
+PP G P P+ VP + A ++SG T+T H
Sbjct: 116 WPPQGINPLNPFSVPLLNTAVLLSSGATVTWAH 148
>gnl|CDD|217305 pfam02970, TBCA, Tubulin binding cofactor A.
Length = 91
Score = 27.6 bits (62), Expect = 4.1
Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 3/58 (5%)
Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI 348
E+ + LK+ E L + + L R+ K A+LE +L+E E L+E+
Sbjct: 31 EKLKAEGADEYDLKKQEEVLKETEMMLPDCKRRLQKAVADLE---EYLEEYEEGLEEL 85
>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
Length = 175
Score = 28.9 bits (65), Expect = 4.1
Identities = 17/94 (18%), Positives = 38/94 (40%), Gaps = 16/94 (17%)
Query: 271 EHIKMSLKSAVEDK-IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
E++K ++ + ++ + + Q + + + Q + +L + R+D
Sbjct: 47 EYVKSQIEEGQNMQDTQKPSSNDVEETQVNTIVQNISSVDFEQLEAQLNTITRRLD---- 102
Query: 330 ELER----------SISFLKEKETELDEILAVLT 353
ELER S L+ + E++E+L L
Sbjct: 103 ELERQLQQKADDVVSYQLLQHRR-EMEEMLERLQ 135
>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain
fusion protein; Provisional.
Length = 906
Score = 29.7 bits (67), Expect = 4.1
Identities = 16/58 (27%), Positives = 23/58 (39%), Gaps = 6/58 (10%)
Query: 340 EKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRR 397
K E ++ V + D D I K+ L T EE E I Y+ +A+ R
Sbjct: 127 AKLNENKKLKGVWERLVKADRDNPE-----IVKK-LATSYEEEDKEKAITYLKKAIYR 178
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 29.6 bits (67), Expect = 4.1
Identities = 16/82 (19%), Positives = 39/82 (47%)
Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
+ K K + + ++ E Q+E + L++ N L + +L + ++KE E
Sbjct: 388 KKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKE 447
Query: 331 LERSISFLKEKETELDEILAVL 352
LE+ ++ ++ E++++L
Sbjct: 448 LEKQLTNIEPTADEINKLLKAY 469
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 29.3 bits (65), Expect = 4.2
Identities = 14/65 (21%), Positives = 20/65 (30%), Gaps = 1/65 (1%)
Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSY-PTPGATPGAGGYSPY 237
P + P N P + P +P P+ P + P G P P
Sbjct: 215 PRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPR 274
Query: 238 PLYPP 242
+PP
Sbjct: 275 AGFPP 279
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 29.1 bits (65), Expect = 4.3
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
+E K +++K + NE + E+ E K +++ + + ++ KA
Sbjct: 166 EETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYD--EEDNPVEDSKA 223
Query: 330 ELERSISFLKEKETE 344
E E++ E
Sbjct: 224 IKEELAKEPVEEQQE 238
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 28.3 bits (64), Expect = 4.3
Identities = 19/66 (28%), Positives = 38/66 (57%), Gaps = 4/66 (6%)
Query: 283 DKIRRKMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
KI ++ K AQ+++E K+ + EL + + +L ++ E++ K+ A L S + ++K
Sbjct: 7 QKILQESPAGK-AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATL--SEAAREKK 63
Query: 342 ETELDE 347
E EL +
Sbjct: 64 EKELQK 69
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 29.4 bits (66), Expect = 4.4
Identities = 20/109 (18%), Positives = 38/109 (34%), Gaps = 17/109 (15%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQ------TENELNQGKFKLARMFERIDKEKAELERSIS 336
+ R K EQ +A ++E + L + K + A + + + +A + S
Sbjct: 167 ELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLK-RRAELGSQQVQGEALELPNES 225
Query: 337 FLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
F++ K D EK D+ + K + + E E
Sbjct: 226 FIRSK-FPSDIN-----EK----ADKGFRGFDCLNKVPIFSIQIEGKIE 264
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 29.7 bits (66), Expect = 4.4
Identities = 22/114 (19%), Positives = 27/114 (23%), Gaps = 4/114 (3%)
Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
A P + + Q+P A+ P Y P P PY P P
Sbjct: 387 AQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYA-PAPEQPVAGN 445
Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
A S + P P P P P P V
Sbjct: 446 AWQAEEQQSTFA--PQSTYQTEQTYQQPAAQEPLYQQPQPVE-QQPVVEPEPVV 496
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 29.3 bits (66), Expect = 4.5
Identities = 15/94 (15%), Positives = 32/94 (34%), Gaps = 11/94 (11%)
Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
DE +S E + ++K +AQ + ++ K + +K+
Sbjct: 255 DESAWEGFESEYEPINKPVRPKRKTKAQRN-KEKRR----------KELEREAKEEKQLK 303
Query: 330 ELERSISFLKEKETELDEILAVLTEKTEVDVDEA 363
+ ++ LKE E+ + K E +
Sbjct: 304 KKLAQLARLKEIAKEVAQKEKARARKKEQRKERG 337
>gnl|CDD|203376 pfam06016, Reovirus_L2, Reovirus core-spike protein lambda-2 (L2).
This family consists of several Reovirus core-spike
protein lambda-2 (L2) sequences. The reovirus L2 genome
segment encodes the core spike protein lambda-2, which
mediates enzymatic reactions in 5' capping of the viral
plus-strand transcripts.
Length = 1290
Score = 29.4 bits (66), Expect = 4.5
Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 3/61 (4%)
Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYS 235
APY ++ + N + P G+ A S S+ PS PT PG
Sbjct: 1232 APYDSREWAVKESGNTFVILDSQPFVLPQGW---DVLALSTSVSPSRPTYIVPPGDYTLV 1288
Query: 236 P 236
P
Sbjct: 1289 P 1289
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 29.4 bits (67), Expect = 4.6
Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 15/89 (16%)
Query: 288 KMNEQKVQAQEEIEILKQ----TENEL-NQGKFK-----LARMFERIDKEKAELERSISF 337
EQ + +EEI+ +KQ E+EL + + + L + ++ I + AE E + S
Sbjct: 321 HAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSE 380
Query: 338 LKEKETELDEILAVLT--EKTEVDVDEAV 364
L+E EL+EIL L EK + + E +
Sbjct: 381 LQE---ELEEILKQLEEIEKEQEKLSEML 406
Score = 29.0 bits (66), Expect = 5.4
Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 11/76 (14%)
Query: 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE--KTEV 358
E L + E ++ +F+ K K E+ S L E ++++IL L E ++E
Sbjct: 86 EQLFEAEELNDKFRFR---------KAKHEINEIESLLDLIEEDIEQILEELQELLESEE 136
Query: 359 DVDEAVTTTAPIYKQI 374
E V +Y+++
Sbjct: 137 KNREEVEQLKDLYREL 152
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 28.6 bits (64), Expect = 4.6
Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 1/44 (2%)
Query: 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDE 362
R +R +E ELE+ +D++L+ L E++
Sbjct: 40 RREQR-KQEIEELEKERIMNDRLLERVDKLLSELKEESLDSSQA 82
>gnl|CDD|129690 TIGR00602, rad24, checkpoint protein rad24. All proteins in this
family for which functions are known are involved in DNA
damage tolerance (likely cell cycle checkpoints).This
family is based on the phylogenomic analysis of JA Eisen
(1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 637
Score = 29.2 bits (65), Expect = 4.7
Identities = 24/110 (21%), Positives = 35/110 (31%), Gaps = 4/110 (3%)
Query: 274 KMSLKS-AVEDKIRRKMNEQKVQAQEEIEILKQTENEL-NQGKFKLARMFERIDKEKAEL 331
+MS KS A K + K +EI+ L + L +R + +
Sbjct: 339 RMSTKSDAHASKSKIKGKHSSNNENQEIQALGGKDVSLFLFRALGKILYCKRATLNELDS 398
Query: 332 ERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381
R S L E D ++ E E+ T Q N F E
Sbjct: 399 PRLPSHLSELSR--DTLMVGPEEVVEMSHMPGDKTFNLYSHQNYNDFFVE 446
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 29.1 bits (65), Expect = 4.7
Identities = 19/53 (35%), Positives = 21/53 (39%)
Query: 208 PNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
PN+ AA + P P TP G P PATP P P PAP
Sbjct: 147 PNSTAARAVVFLVVGPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPPPAP 199
>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region. The region
described in this family is found towards the N-terminus
of various eukaryotic fork head/HNF-3-related
transcription factors (which contain the pfam00250
domain). These proteins play key roles in embryogenesis,
maintenance of differentiated cell states, and
tumorigenesis.
Length = 137
Score = 28.0 bits (62), Expect = 4.9
Identities = 18/85 (21%), Positives = 26/85 (30%), Gaps = 6/85 (7%)
Query: 170 GNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
+M S Y + ++ P A G + ++ S P Y G+
Sbjct: 19 YSMNSMNTYGPMNTSQGSANSSMNMSGYAGPGAMNGMSSSSMNGMS----PGYGGAGSPM 74
Query: 230 GAGGYSPYP--LYPPGGSTPATPYP 252
G G S L P G P P
Sbjct: 75 GMMGMSSMGTSLSPSGTMGAMGPMP 99
>gnl|CDD|214464 MTH00219, COX3, cytochrome c oxidase subunit III; Provisional.
Length = 262
Score = 28.6 bits (64), Expect = 5.3
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
+PP G P P+ VP + A + SG T+T H
Sbjct: 117 WPPTGINPLNPFQVPLLNTAVLLASGVTVTWAH 149
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 29.1 bits (65), Expect = 5.3
Identities = 25/104 (24%), Positives = 47/104 (45%), Gaps = 20/104 (19%)
Query: 276 SLKSAVEDKIRR-KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
LKS+++D + KM E++ K T++ L+ K K + E ++K+K E
Sbjct: 26 VLKSSIKDSTQFWKMLEKQ---------QKLTKDGLSASKGKYEGLSEAMEKQKKAYED- 75
Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDE--AVTTTAPIYKQILN 376
LK++ E++ T+ ++ E A T A + L+
Sbjct: 76 ---LKQEVKEVNR----ATQASKKAYQEYNAQYTQAENKLRSLS 112
>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus. This family includes the
NUT protein. The gene encoding for NUT protein (Nuclear
Testis protein) is found fused to BRD3 or BRD4 genes, in
some aggressive types of carcinoma, due to chromosomal
translocations. Proteins of the BRD family contain two
bromodomains that bind transcriptionally active
chromatin through associations with acetylated histones
H3 and H4. Such proteins are crucial for the regulation
of cell cycle progression. On the other hand, little is
known about NUT protein. NUT is known to have a Nuclear
Export Sequence (NES) as well as a Nuclear Localization
Signal (NLS), both located towards the C-terminal end of
the protein. A fused NUT-GFP protein showed either
cytoplasmic or nuclear localization, suggesting that it
is subject to nuclear/cytoplasmic shuttling. Consistent
with this possibility, treatment with leptomycin B an
inhibitor of CRM1-dependent nuclear export resulted in
re-distribution of NUT-GFP to the nucleus. Inspection of
NUT revealed a C-terminal sequence similar to known
nuclear export sequences (NES) which are often regulated
by phosphorylation.
Length = 328
Score = 28.8 bits (64), Expect = 5.3
Identities = 18/60 (30%), Positives = 23/60 (38%), Gaps = 1/60 (1%)
Query: 207 NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
NP A+ +P ++ P P P PL G S P P + P VT GG
Sbjct: 10 NPGASLSPFAALPFLPPAPQPPDQPFWEPPPPLVTAGFS-PGNPLVLSALPSMPLVTEGG 68
>gnl|CDD|164623 MTH00052, COX3, cytochrome c oxidase subunit III; Provisional.
Length = 262
Score = 28.6 bits (64), Expect = 5.4
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 239 LYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
++PP G P P+ VP + A ++SG T+T H
Sbjct: 116 VWPPRGVDPLNPFSVPLLNTAVLLSSGATVTWAH 149
>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
This family represents the C-terminal region of
merozoite surface protein 1 (MSP1) which are found in a
number of Plasmodium species. MSP-1 is a 200-kDa protein
expressed on the surface of the P. vivax merozoite.
MSP-1 of Plasmodium species is synthesised as a
high-molecular-weight precursor and then processed into
several fragments. At the time of red cell invasion by
the merozoite, only the 19-kDa C-terminal fragment
(MSP-119), which contains two epidermal growth
factor-like domains, remains on the surface. Antibodies
against MSP-119 inhibit merozoite entry into red cells,
and immunisation with MSP-119 protects monkeys from
challenging infections. Hence, MSP-119 is considered a
promising vaccine candidate.
Length = 574
Score = 29.1 bits (65), Expect = 5.4
Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 1/71 (1%)
Query: 214 PSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP-ATPYPVPTPSPAPAVTSGGTITDEH 272
P G+ + T TP +P PL + + P+ + G T+
Sbjct: 259 PKGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASGSTQIPTSGSVLGPGAAATELQ 318
Query: 273 IKMSLKSAVED 283
+ L++ E+
Sbjct: 319 QVVQLQNYDEE 329
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 28.7 bits (64), Expect = 5.4
Identities = 19/74 (25%), Positives = 22/74 (29%), Gaps = 6/74 (8%)
Query: 199 PPSAYPGYNPNAAAAPSGS---IYPSYPTPGATP--GAGGYSPYPLYPPGGSTPATPYPV 253
PP P +P S PS P PGA P G P P G +
Sbjct: 11 PPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVTFSS 70
Query: 254 PTPSPAPAVTSGGT 267
P P P +
Sbjct: 71 SAP-PRPPLGLDDA 83
>gnl|CDD|224333 COG1415, COG1415, Uncharacterized conserved protein [Function
unknown].
Length = 373
Score = 28.9 bits (65), Expect = 5.5
Identities = 4/31 (12%), Positives = 16/31 (51%)
Query: 385 EDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415
++ I ++ E + + + + L+ ++ L+
Sbjct: 334 DELIEFLEELVEKARLGRQEKLRALKRLAEL 364
>gnl|CDD|223091 COG0012, COG0012, Predicted GTPase, probable translation factor
[Translation, ribosomal structure and biogenesis].
Length = 372
Score = 29.1 bits (66), Expect = 5.5
Identities = 20/103 (19%), Positives = 40/103 (38%), Gaps = 15/103 (14%)
Query: 283 DKIRRKM---NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 339
+++ ++ + + +EE+ +L + E L +GK + +E E S++ L
Sbjct: 146 ERLEKRAKAGKKLDKELKEELSLLGKLEEHLEEGKPARGLDLSKWSEEDLEALASLNLLT 205
Query: 340 EK------------ETELDEILAVLTEKTEVDVDEAVTTTAPI 370
K L+E + L E + E V +A I
Sbjct: 206 AKPMLYVANVSEDDLANLNEYVKRLKELAAKENAEVVPVSAAI 248
>gnl|CDD|173423 PTZ00133, PTZ00133, ADP-ribosylation factor; Provisional.
Length = 182
Score = 28.3 bits (63), Expect = 5.6
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 8/47 (17%)
Query: 322 ERIDKEKAELERSISFLKEKETEL-DEILAVLTEKTEVDVDEAVTTT 367
ERI + ELER +S E EL D +L V K D+ A++TT
Sbjct: 98 ERIGDAREELERMLS-----EDELRDAVLLVFANKQ--DLPNAMSTT 137
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 29.0 bits (65), Expect = 5.6
Identities = 16/65 (24%), Positives = 33/65 (50%), Gaps = 4/65 (6%)
Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
+ +KV A E ++ E+ L + K A +RI + +AEL++ + L + E + +
Sbjct: 6 SHRKVAADGE----EREESLLQESASKEAYYLQRILELQAELKQLRAELSNVQAENERLS 61
Query: 350 AVLTE 354
++ E
Sbjct: 62 SLSQE 66
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 29.1 bits (65), Expect = 5.6
Identities = 19/101 (18%), Positives = 30/101 (29%), Gaps = 16/101 (15%)
Query: 79 LPPLYSSRTKQEPATNPT-------------YNGNIRPTADMRINVSQNVDHTGKIY--- 122
L PL E +P Y +R R N Q+ + I+
Sbjct: 528 LDPLKDDGVINEYNLDPQAAAVNAARRASTVYLLPLRDRITARRNELQDPQASTPIFPYP 587
Query: 123 SPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPA 163
+P T+T + + LA + P S + P
Sbjct: 588 APSNFTFTNITAGDVKYVAALAPGYDLTAPTDLSLVEYNPV 628
Score = 28.8 bits (64), Expect = 8.7
Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 8/48 (16%)
Query: 206 YNPNAAAAPSGSIYPSYPTPGATPGAGGYSPY--PLYPPGGSTPATPY 251
YN + P+ I P+Y T G G PY L+ P G PY
Sbjct: 1035 YNCTELSNPTYGISPTYNTSGIKGETIG--PYCKELFNPQG----LPY 1076
>gnl|CDD|149453 pfam08397, IMD, IRSp53/MIM homology domain. The N-terminal
predicted helical stretch of the insulin receptor
tyrosine kinase substrate p53 (IRSp53) is an
evolutionary conserved F-actin bundling domain involved
in filopodium formation. The domain has been named IMD
after the IRSp53 and missing in metastasis (MIM)
proteins in which it occurs. Filopodium-inducing IMD
activity is regulated by Cdc42 and Rac1 and is
SH3-independent.
Length = 218
Score = 28.6 bits (64), Expect = 5.7
Identities = 19/84 (22%), Positives = 32/84 (38%), Gaps = 10/84 (11%)
Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
K LK A+ED +++ ++ + Q E L + + +F +DK
Sbjct: 139 SKYQLKEALEDVNDKQLELEEFEKQGVREALLE-----EKRRF---CFL--VDKHCPVSN 188
Query: 333 RSISFLKEKETELDEILAVLTEKT 356
++F E T L L E T
Sbjct: 189 EELAFHGEAITLLQTKLPDWQELT 212
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 28.0 bits (63), Expect = 5.7
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 5/63 (7%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346
K++ + + +EE+++LKQ + L L EL+ I LK++ E++
Sbjct: 75 NKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDL-----TTEELQEEIQELKKEVREIE 129
Query: 347 EIL 349
E L
Sbjct: 130 EKL 132
>gnl|CDD|223961 COG1030, NfeD, Membrane-bound serine protease (ClpP class)
[Posttranslational modification, protein turnover,
chaperones].
Length = 436
Score = 28.9 bits (65), Expect = 5.7
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 377 TFTEEAATEDTIYYMGEALRRGVIDL 402
T+ E TE+ EALR+GVIDL
Sbjct: 159 TWAERFVTENLSLTAEEALRQGVIDL 184
>gnl|CDD|221923 pfam13102, DUF3946, Protein of unknown function (DUF3946). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences. This family appears
related to the N-terminal domain of phage integrases.
Length = 140
Score = 28.0 bits (63), Expect = 5.8
Identities = 14/83 (16%), Positives = 34/83 (40%), Gaps = 8/83 (9%)
Query: 265 GGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI 324
G E + + L +DK ++ N++ + E I + EL ++ ++
Sbjct: 23 NGKRKYEFLGLYLYPEPKDKAEKEQNKETLLLAE--AIRAKRILELQNEEYGFE---DKT 77
Query: 325 DKEKAELERSISFLKEKETELDE 347
+K K + + + KE + ++
Sbjct: 78 EKSKIDF---LDYFKEYIEKKEK 97
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 28.7 bits (64), Expect = 5.9
Identities = 17/86 (19%), Positives = 25/86 (29%), Gaps = 11/86 (12%)
Query: 191 VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATP 250
V+ P A P P +A P+ S P+ P G + P +P
Sbjct: 74 VSKPKTGTGKVAPPAATPTSAPTPTPS-PPASPASGMSAA----------PASAVEEKSP 122
Query: 251 YPVPTPSPAPAVTSGGTITDEHIKMS 276
+ AP S + S
Sbjct: 123 SEESATATAPESPSTSVPSSGSDAAS 148
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 28.9 bits (65), Expect = 5.9
Identities = 32/132 (24%), Positives = 52/132 (39%), Gaps = 23/132 (17%)
Query: 288 KMNEQKVQAQEEIEILKQT----ENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 343
K E +EEIE +K++ E EL G + + +KE LKE E+
Sbjct: 320 KAKENNEHLKEEIERVKESYRLAETEL--GSVR------KFEKE----------LKELES 361
Query: 344 ELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLE 403
LDEIL E EV E I K + + E+ ++ + + + +LE
Sbjct: 362 VLDEILEN-IEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLE 420
Query: 404 VYLKQVRSLSRK 415
++ + R
Sbjct: 421 RLKSKLHEIKRY 432
>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
synthase PfaA. Members of the seed for this alignment
are involved in omega-3 polyunsaturated fatty acid
biosynthesis, such as the protein PfaA from the
eicosapentaenoic acid biosynthesis operon in
Photobacterium profundum strain SS9. PfaA is encoded
together with PfaB, PfaC, and PfaD, and the functions of
the individual polypeptides have not yet been described.
More distant homologs of PfaA, also included with the
reach of this model, appear to be involved in
polyketide-like biosynthetic mechanisms of
polyunsaturated fatty acid biosynthesis, an alternative
to the more familiar iterated mechanism of chain
extension and desaturation, and in most cases are encoded
near genes for homologs of PfaB, PfaC, and/or PfaD.
Length = 2582
Score = 29.2 bits (65), Expect = 6.0
Identities = 45/204 (22%), Positives = 71/204 (34%), Gaps = 21/204 (10%)
Query: 169 NGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT 228
+ N+ +P TQ+ P+ S V T P + A AAP S P A
Sbjct: 1113 DSNIVKLSPLATQA--PVIKSVV-TQAPVVQVTISV-----APAAPVLPAVVSPPVVSAA 1164
Query: 229 PGAGGYSPYPLYPPGG---STPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKI 285
P + + P + P PS A A ++ D SA++ +
Sbjct: 1165 PAQSVATAVAMAPVAEVPIAVPVQQSVDYMPSVAQAAAPQASVND--------SAIQQVM 1216
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL-ERSISFLKEKETE 344
+ E+ E +E+ E +L K + + + +L E + L E T
Sbjct: 1217 MEVVAEKTGYPTEMLELEMDMEADLGIDSIKRVEILGSVQEIINDLPELNPEDLAELRT- 1275
Query: 345 LDEILAVLTEKTEVDVDEAVTTTA 368
L EI+ + K V A TA
Sbjct: 1276 LGEIVNYMQSKVSAAVASAPVNTA 1299
>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
Length = 140
Score = 28.0 bits (63), Expect = 6.0
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 328 KAELERSISFLKEKETELDEILAVLTEK 355
+ +L+ +I L +++ EL++ L L E
Sbjct: 89 EKDLDEAIEILDKRKEELEKALEKLEEA 116
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 28.9 bits (66), Expect = 6.1
Identities = 23/88 (26%), Positives = 43/88 (48%), Gaps = 15/88 (17%)
Query: 280 AVEDKIRRK-----------MNEQKVQAQEEIEILKQTENELNQ--GKFKLARMFERIDK 326
AV++ + ++ ++E++ + Q E+E L+ N L++ G+ K E +
Sbjct: 13 AVKEALAKRGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKG--EDAEA 70
Query: 327 EKAELERSISFLKEKETELDEILAVLTE 354
AE++ +K E ELDE+ A L E
Sbjct: 71 LIAEVKELKEEIKALEAELDELEAELEE 98
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 28.7 bits (64), Expect = 6.5
Identities = 35/179 (19%), Positives = 54/179 (30%), Gaps = 28/179 (15%)
Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
N P T PMP + TP T S+ TP TP
Sbjct: 263 QKNTPAQT----PMPERSWQTPAQTPARRISTPMTEEI---------KSWQTPLQTPAMY 309
Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQ 292
P T PS A++S E + K RR
Sbjct: 310 SSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSDSE--ASDSGPTRKRKRRRVPPLP 367
Query: 293 KVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351
+ + E+ + + E + +E+ K + E ++ +FL+ K EL + V
Sbjct: 368 EYSSDEDEDDSDEDEVD-----------YEKERKRRREEDK--NFLRLKALELSKYAGV 413
>gnl|CDD|224417 COG1500, COG1500, Predicted exosome subunit [Translation, ribosomal
structure and biogenesis].
Length = 234
Score = 28.4 bits (64), Expect = 6.7
Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 17/89 (19%)
Query: 324 IDKEKAELERSISFLKEKETELDEILAVLT--------EK-TEVDVDEAVTTTAP--IYK 372
+D KA + + KE +L+E+LA T EK +E D+ +A TT P I +
Sbjct: 23 VDPNKAL-----EYREGKEVDLEEVLATETVFKDASKGEKASEEDLKKAFGTTDPDEIAE 77
Query: 373 QILNTFTEEAATEDTIYYMGEALRRGVID 401
+IL E T + M E +R +I+
Sbjct: 78 EILKK-GEIQLTAEQRREMLEEKKRQIIN 105
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 28.6 bits (63), Expect = 6.7
Identities = 13/41 (31%), Positives = 16/41 (39%)
Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
P+ P+ GA P P P + PA P P AP
Sbjct: 94 PAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAP 134
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 28.7 bits (64), Expect = 6.7
Identities = 12/61 (19%), Positives = 25/61 (40%), Gaps = 7/61 (11%)
Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346
E++ LK+ E L E ++KE +L+ + L+E++ +L+
Sbjct: 59 NVEISNYEALDSELDELKKEEERLLD-------ELEELEKEDDDLDGELVELQEEKEQLE 111
Query: 347 E 347
Sbjct: 112 N 112
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 28.9 bits (64), Expect = 6.9
Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 4/84 (4%)
Query: 217 SIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPY---PVPTPSPAPAVTSGGTITDEHI 273
I P PG G SP P G +T TP P P + + T + + I
Sbjct: 258 VIEPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSI 317
Query: 274 KMSLKSAVEDK-IRRKMNEQKVQA 296
+ S + I +M E+++ A
Sbjct: 318 SGPIPSPGAPRAITGQMGEREMFA 341
>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 211
Score = 28.3 bits (63), Expect = 7.0
Identities = 13/62 (20%), Positives = 20/62 (32%), Gaps = 6/62 (9%)
Query: 210 AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTIT 269
A+AAP S P+P + P P+ + +PA +
Sbjct: 106 ASAAPVASAPAPAPSPQSPK------PASRRASRDMRRIAPFGMNASAPAQEAAQASSDE 159
Query: 270 DE 271
DE
Sbjct: 160 DE 161
>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 27.4 bits (61), Expect = 7.2
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 8/73 (10%)
Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA---E 330
K L+ +E + R Q ++ Q+E E K+ ++EL + K A+ E+++ EKA E
Sbjct: 50 KPELQRVLEKRKRD----QVLKEQKEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDE 105
Query: 331 LERSIS-FLKEKE 342
+ I FLK K
Sbjct: 106 EQEQIPEFLKVKA 118
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 28.7 bits (64), Expect = 7.3
Identities = 17/53 (32%), Positives = 19/53 (35%), Gaps = 3/53 (5%)
Query: 210 AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
A G S P PGA P A P + PP + P P P AV
Sbjct: 901 RVATAPGGDAASAPPPGAGPPA---PPQAVPPPRTTQPPAAPPRGPDVPPAAV 950
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 28.7 bits (64), Expect = 7.8
Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 3/76 (3%)
Query: 143 LASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSA 202
L SA A L + E + + N+ NAP + + P P P
Sbjct: 362 LPSASDAESALLQRVERIETRLDMSIPANLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVL 421
Query: 203 YP---GYNPNAAAAPS 215
P PNAAA S
Sbjct: 422 APTPASGEPNAAAVRS 437
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 28.5 bits (63), Expect = 7.8
Identities = 17/91 (18%), Positives = 24/91 (26%), Gaps = 9/91 (9%)
Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYP 238
Q P ++V TP P+ P + A A P A P
Sbjct: 107 APQVIDTAPPNDVATPVPSVPEATAEA---AAKAGPQDDEADGEQADEAPAHNPESVPTR 163
Query: 239 LYPPGGSTPATPYPVPTPSPAPAVTSGGTIT 269
+ T + P +S IT
Sbjct: 164 ------AARETTEANRPTATPPQSSSALVIT 188
>gnl|CDD|235262 PRK04235, PRK04235, hypothetical protein; Provisional.
Length = 196
Score = 28.0 bits (63), Expect = 8.0
Identities = 11/32 (34%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 298 EEIEILKQTENE-LNQGKFKLARMFERIDKEK 328
E ++ L + NE L +GK +L R+ E +++ K
Sbjct: 163 EYLKFLVEVANEKLLRGKERLNRLEEELEELK 194
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 28.5 bits (63), Expect = 8.4
Identities = 26/123 (21%), Positives = 38/123 (30%), Gaps = 21/123 (17%)
Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
A A +A+ + ++ PT GN A + ++ P P +A
Sbjct: 317 AVAGAAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEA----------EEVPPLPQAA- 365
Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
A G + PTP PG P + PPG P P SP +
Sbjct: 366 ------PAVVKPGPM--EIPTPVPPPGLA--IPSLVAPPGLVAPTEINPSFLASPRKKMK 415
Query: 264 SGG 266
Sbjct: 416 REK 418
>gnl|CDD|233004 TIGR00513, accA, acetyl-CoA carboxylase, carboxyl transferase,
alpha subunit. The enzyme acetyl-CoA carboxylase
contains a biotin carboxyl carrier protein or domain, a
biotin carboxylase, and a carboxyl transferase. This
model represents the alpha chain of the carboxyl
transferase for cases in which the architecture of the
protein is as in E. coli, in which the
carboxyltransferase portion consists of two
non-identical subnits, alpha and beta [Fatty acid and
phospholipid metabolism, Biosynthesis].
Length = 316
Score = 28.2 bits (63), Expect = 8.6
Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 9/81 (11%)
Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
+ +R + ++ V EEIE L++ EL + F ++R +L R ++
Sbjct: 20 ESLRARSRDEDVDLSEEIERLEKRSVELTKKIFSNLGAWQR-----LQLAR----HPDRP 70
Query: 343 TELDEILAVLTEKTEVDVDEA 363
LD I + + E+ D A
Sbjct: 71 YTLDYIELIFDDFFELAGDRA 91
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.8 bits (64), Expect = 8.7
Identities = 24/130 (18%), Positives = 48/130 (36%), Gaps = 21/130 (16%)
Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
+ M+ + ED+ ++Q Q+ +I + E+ L I++
Sbjct: 4234 SANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGS-------INQ 4286
Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386
+ E S ++ E EL + +T + +DEA + + N E +T+
Sbjct: 4287 SEKVFELSE--DEDIEDELPDYNVKIT--PAMPIDEA--------RDLWN--KHEDSTKQ 4332
Query: 387 TIYYMGEALR 396
+ E LR
Sbjct: 4333 LSIELCEQLR 4342
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 27.8 bits (62), Expect = 8.7
Identities = 14/56 (25%), Positives = 18/56 (32%), Gaps = 1/56 (1%)
Query: 210 AAAAPSGSIYPS-YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
A A + S T P G +P PP PA P +P +S
Sbjct: 137 APAEYYSPKHSSSQGTSTTRPSDGSATPNTSAPPTPGNPAAQPEKPAETPKGNGSS 192
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 28.5 bits (63), Expect = 8.7
Identities = 20/100 (20%), Positives = 33/100 (33%), Gaps = 5/100 (5%)
Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNT-PYPTYPPSAYPGYNPNAAAAPSGSIYP 220
P + + A+ P P P P + P P P++ P+ A AP+ +
Sbjct: 25 NIPIPELHTSAATQKPDPA----PAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASE 80
Query: 221 SYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
+ A A +P P P + P + A
Sbjct: 81 KFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAA 120
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 28.6 bits (64), Expect = 8.9
Identities = 32/141 (22%), Positives = 48/141 (34%), Gaps = 13/141 (9%)
Query: 133 NSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVN 192
S + L+ L S S+ P SSR K A + T GN S+ T S S N
Sbjct: 92 TSGSQSLLMELPS--SSFNPSTSSRNKSNSALSSTQQGNANSSV---TLSSSTAS-SMFN 145
Query: 193 TPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGY---SPYPLYPPGGSTPAT 249
+ P P ++ +A SGS + + P ++ S +P
Sbjct: 146 SN---KLPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFF 202
Query: 250 PYPVPTPSPAPAVTSGGTITD 270
Y + S + S T
Sbjct: 203 NYLIDPSSDSA-TASADTSPS 222
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 28.3 bits (64), Expect = 9.0
Identities = 24/121 (19%), Positives = 40/121 (33%), Gaps = 11/121 (9%)
Query: 251 YPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENE- 309
+ A+ E K K + + K + Q+E++ L+ E
Sbjct: 83 KRPSRTNSQDALLQAENALPEWKKELEKPSEPKEEEPKAAAESKVVQKELDELRDELKEL 142
Query: 310 ---LNQGKFKLARMFERIDKEKAELERSISFLKE---KETELDEILAVLTEKTEVDVDEA 363
L L ++ ERI E AEL + LK +++L +L E A
Sbjct: 143 KNLLEDQLSGLRQV-ERIPPEFAELYK---RLKRSGLSPEIAEKLLKLLLEHMPPRERTA 198
Query: 364 V 364
Sbjct: 199 W 199
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.4 bits (63), Expect = 9.1
Identities = 18/56 (32%), Positives = 28/56 (50%)
Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
R + Q+ + Q+ I L+Q L Q K + R+ E +AELER LK++
Sbjct: 208 RNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQR 263
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.130 0.382
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,168,786
Number of extensions: 2143914
Number of successful extensions: 4324
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3476
Number of HSP's successfully gapped: 642
Length of query: 434
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 334
Effective length of database: 6,502,202
Effective search space: 2171735468
Effective search space used: 2171735468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (27.1 bits)