RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15301
(947 letters)
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 51.3 bits (123), Expect = 2e-06
Identities = 21/41 (51%), Positives = 25/41 (60%)
Query: 204 RWKLHGYTRNPDPVLNPKKKIFETVAPDLKTNDSNQATYKG 244
R G+ PD VLNPKKK++ET+ PDL TN A YK
Sbjct: 740 RVTFPGFEGEPDDVLNPKKKVWETLQPDLHTNSELVACYKD 780
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 45.5 bits (108), Expect = 1e-04
Identities = 25/171 (14%), Positives = 64/171 (37%), Gaps = 6/171 (3%)
Query: 711 DNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDE 770
+++ S+ + +SK SSK ++++ +++ + N D T S +S +
Sbjct: 34 STAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFST--IDSSTSDSNNI 91
Query: 771 GNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTARKSS 830
+ I +T + + + ++ +Q+ +SD+ + + ++
Sbjct: 92 IDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDY----EQPRNSEKSTN 147
Query: 831 DLEGTARKSNDLEQSAQKSSDLEGTAQKSSDLQGTAQKSSDLDGTARKSSD 881
D + S + Q S + QK+ T +S+ + K +
Sbjct: 148 DSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQ 198
Score = 39.0 bits (91), Expect = 0.012
Identities = 30/236 (12%), Positives = 78/236 (33%), Gaps = 13/236 (5%)
Query: 587 TSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIE 646
T+ S K D ET + S++ ++ ++ +ND+ T +
Sbjct: 35 TAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSN----- 89
Query: 647 DRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKS 706
++ + N ++ D + N ++ + ++ S
Sbjct: 90 ----NIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFN---LNSDISDYEQPRNS 142
Query: 707 TNILDNKGSSNKSSEAKRPNS-SSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDK 765
++ ++ SS ++ SSK + + + + +++ ++ P T +
Sbjct: 143 EKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQS 202
Query: 766 ESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSD 821
SQ + ++ +KD S ++ + S+ QK + K+
Sbjct: 203 NSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTET 258
Score = 34.8 bits (80), Expect = 0.23
Identities = 44/251 (17%), Positives = 92/251 (36%), Gaps = 30/251 (11%)
Query: 522 VTPSKTGGESTSNYQVKQTSND---EKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNN 578
V P+ T + ++ K ++ + TSKD +++ K + + +N
Sbjct: 16 VLPTLTSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKAD--NNNTSNQDN 73
Query: 579 KECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKR 638
+ S S SN D K +P I + +D + T
Sbjct: 74 NDKKFSTIDS--STSDSNNIIDFIYKNLPQTNINQLLTK-----NKYDDNYSLTTLIQNL 126
Query: 639 KRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTN 698
+ DI D + S N+ + +K + Q+ S+ + ++
Sbjct: 127 FNLNSDISD--------YEQPRNSEKSTNDSNKNSDSSIKNDTDTQS-SKQDKADNQ--- 174
Query: 699 VLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETS 758
+ S N + +SNK + +P ++ NS ++++ S K N+ S ++++
Sbjct: 175 ---KAPSSNNTKPS--TSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQ-SMSDSA 228
Query: 759 ITSESDKESQD 769
+ S D+ S+D
Sbjct: 229 LDSILDQYSED 239
Score = 32.1 bits (73), Expect = 1.7
Identities = 44/291 (15%), Positives = 97/291 (33%), Gaps = 18/291 (6%)
Query: 510 KERKTNSTSSFEVTPSKTGGESTS-NYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRR 568
+K+N + + T SK ++ + N + ++ + S S + ++ + +
Sbjct: 42 DSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQ 101
Query: 569 ISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDE 628
+ + +T NK T+ ++ + S+ ++ ++E N + +S
Sbjct: 102 TNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSE--KSTNDSNKNSDSSIKN 159
Query: 629 PTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESE 688
T T+ + K + + S +Q++ + A T ++
Sbjct: 160 DTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSN-----SQPASDDTANQ 214
Query: 689 NREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVK 748
D ++ D +ILD S + K S SK + ++ + N
Sbjct: 215 KSSSKDN-QSMSDSA--LDSILDQY-SEDAKKTQKDYASQSKKDKTETSNTKNPQLPTQD 270
Query: 749 SNKD------SPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIE 793
K S S + S E P + +++ NV S D
Sbjct: 271 ELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDSGSFNVVDSKDTR 321
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 42.6 bits (100), Expect = 0.001
Identities = 60/297 (20%), Positives = 101/297 (34%), Gaps = 30/297 (10%)
Query: 564 RKKRRISTSNEITNN-------KECSERLPTSNLEVSQSNKDGDSETKRVPSNEIL--NE 614
RK+R + NE KE + L ++ E S+++ G + TK + + N
Sbjct: 181 RKQREVQKENEFYMQLLQQALPKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNH 240
Query: 615 ENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVL-----GLDDADESGNELNED 669
SK T E + KD+ + I L + N N+
Sbjct: 241 SLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKS 300
Query: 670 QTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSS 729
+ S + S + + N ++ S+ S+ P+SS
Sbjct: 301 RPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSV 360
Query: 730 KPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKS 789
N SK S SS S KD+ ++ + + E K
Sbjct: 361 SDNESKQKRASKSSSGARDSKKDASGMSA------------NGTVENCIPENKISTPSAI 408
Query: 790 SDIEGTGQK-SSDLQDTAQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQS 845
+E +K ++LQ Q S+L + LE + + SDL G +K ND+ Q+
Sbjct: 409 ERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLK--SDL-GQLKKENDMLQT 462
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 37.4 bits (86), Expect = 0.046
Identities = 46/304 (15%), Positives = 103/304 (33%), Gaps = 34/304 (11%)
Query: 493 EESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEKKRV 552
EE +++ A E E ++K E K + ++ E K+
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK-----------AAEALKKEAEEAKKA 1704
Query: 553 STSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEIL 612
K + E +K + + E K ++ K+ + + K+ +
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIK------------AEEAKKEAEEDKKKAEEAKKD 1752
Query: 613 NEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLD-------DADESGNE 665
EE K L ++ + K I E++++ E+ + +D D + E
Sbjct: 1753 EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812
Query: 666 LNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRP 725
++ D KE+ +S +E+ D L+ D NK + N K
Sbjct: 1813 GGKEGNLVINDSKEM----EDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEA 1868
Query: 726 NSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKN 785
+ + + + + +EE E + ++ E I + + ++ +E ++
Sbjct: 1869 DFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDEYIKRD 1928
Query: 786 VRKS 789
++
Sbjct: 1929 AEET 1932
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 37.3 bits (86), Expect = 0.050
Identities = 62/356 (17%), Positives = 122/356 (34%), Gaps = 36/356 (10%)
Query: 465 SETPKKVNL-----DKVTNSTHS---KENGRVNNFREESDRVNTAREQCSEEGKERKTNS 516
E KK NL +++ N+TH N R + +D V T + +E ++S
Sbjct: 67 KEEKKKKNLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSS 126
Query: 517 TSSFEVTPSKTGGES--TSNYQVKQTSNDEKYYEKKRVSTSKDEVN----------ENGR 564
+SS + + S TS Y S K+ K VN E
Sbjct: 127 SSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIKENQSTHVNISSDNSPRNKEISN 186
Query: 565 KKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLST 624
K+ + T+ T + R P N ++N + D ++ + +++ + +
Sbjct: 187 KQLKKQTNVTHTTCYDKMRRSP-RNTSTIKNNTN-DKNKEKNKEKDKNIKKDRDGDKQTK 244
Query: 625 SNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQ 684
N E ++ + + RIL ++ + S L D + HQ
Sbjct: 245 RNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQ 304
Query: 685 TESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESS 744
+ N++ R + S+N ++ SS+ + R ++ + + K +
Sbjct: 305 ----KKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTT---- 356
Query: 745 SLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSS 800
K + + K +Q +P+ H K ++ S+ +G G
Sbjct: 357 ------KHTNNTKNNKYNKTKTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEK 406
>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
Provisional.
Length = 1466
Score = 36.9 bits (85), Expect = 0.065
Identities = 36/156 (23%), Positives = 55/156 (35%), Gaps = 17/156 (10%)
Query: 613 NEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTS 672
N E + ++PT+ K+N K DD + + N N +
Sbjct: 654 NRERGSTVDVDIIGEDPTKDNKENNNKN--------------NKDDNNNNNNNNNNKINN 699
Query: 673 EAMDLKEVAEHQTESENREIV--DKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSK 730
+ E H +N+ + N KS+N ++K S KSS K
Sbjct: 700 AGSYIIEQGTHDALMKNKNGIYYTMINNQKVSSKKSSNNDNDKDSDMKSSAYKDSERGYD 759
Query: 731 PNSSKYN-EESNESSSLVKSNKDSPTETSITSESDK 765
P+ N + NES+S KS K S S + K
Sbjct: 760 PDEMNGNSKHENESASNKKSCKMSDENASENNAGGK 795
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.2 bits (84), Expect = 0.094
Identities = 25/236 (10%), Positives = 63/236 (26%), Gaps = 24/236 (10%)
Query: 436 SLSETPKKSVILYKCPGEDFLVQPAVISLSETPK--KVNLDKVTNSTHSKENGRVNNFRE 493
E KK K + + E K +K +NS+ S + +
Sbjct: 1175 KKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTK 1234
Query: 494 ESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEKKRVS 553
+ + ++ SK G + N K + S
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEG----------KPKNAPKRVSAVQYS 1284
Query: 554 TSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILN 613
+G S+ + + L+ + ++ + K+ +
Sbjct: 1285 PPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQA 1344
Query: 614 EENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNED 669
+ R L RK+ + + ++ + + ++ ++ ++D
Sbjct: 1345 SASQSSRLLR------------RPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.4 bits (82), Expect = 0.14
Identities = 32/181 (17%), Positives = 60/181 (33%), Gaps = 8/181 (4%)
Query: 490 NFREESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEK 549
EE + + G+ + E K E+ + ++ K+ S++E+ E
Sbjct: 406 ESDEEENEEPSK----KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELED 461
Query: 550 KRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSN 609
+ + + N+ ++ + E E + V +S K DS+ K
Sbjct: 462 EEEAKVEKVANKLLKRSEKAQKEEEEEELDE-ENPWLKTTSSVGKSAKKQDSKKKSSSKL 520
Query: 610 EILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNED 669
+ + SK + EK L D ED I+ V +D D+
Sbjct: 521 DKAANKISKAAVKVKKKK---KKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFK 577
Query: 670 Q 670
Q
Sbjct: 578 Q 578
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 34.7 bits (80), Expect = 0.25
Identities = 16/85 (18%), Positives = 34/85 (40%), Gaps = 2/85 (2%)
Query: 569 ISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDE 628
+S T ++E ++ TS + + +K+ +S+ K + E+ K T
Sbjct: 34 LSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Query: 629 --PTRTEKDNKRKRILEDIEDRIEE 651
P +++K K+ + ED
Sbjct: 94 KTPKKSKKTKKKPPKPKPNEDVDNA 118
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 34.7 bits (80), Expect = 0.31
Identities = 40/249 (16%), Positives = 85/249 (34%), Gaps = 19/249 (7%)
Query: 491 FREESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTS---NDEKYY 547
E + + + E +E + E+ E + ++ N+ +
Sbjct: 262 LEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEEL 321
Query: 548 EKKRVSTSKDEVNENGRKKRRISTSNEITNN-KECSERLPTSNLEVSQSNKDGDSETKRV 606
E++ + + R + E+ E E ++S ++ + + +
Sbjct: 322 EERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEAL 381
Query: 607 PSN---------EILNEENSKKRRLSTSNDEPTRT-EKDNKRKRILEDIEDRIEEVVLGL 656
EI NE KR + + + R E+ K L+++E +EE+ L
Sbjct: 382 REELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTEL 441
Query: 657 DDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSS 716
++ +E EL E LKE E E E+ ++ + + LD +
Sbjct: 442 EELNEELEELEEQLEELRDRLKE-----LERELAELQEELQRLEKELSSLEARLDRLEAE 496
Query: 717 NKSSEAKRP 725
++S+ R
Sbjct: 497 QRASQGVRA 505
Score = 30.5 bits (69), Expect = 5.2
Identities = 36/273 (13%), Positives = 89/273 (32%), Gaps = 19/273 (6%)
Query: 525 SKTGGESTSNYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSER 584
S TGG + Q ++ E+ ++ + ++ + + + + + R
Sbjct: 651 SITGGSRNKRSSLAQKRELKELEEE--LAELEAQLEKLEEELKSLKNELRSLEDLLEELR 708
Query: 585 LPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILED 644
LE + ++ + + L +E LE+
Sbjct: 709 RQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEE-------------LEE 755
Query: 645 IEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESEN--REIVDKCTNVLDR 702
+++R+EE+ L+ +E+ +L E+ + + E E E E + +
Sbjct: 756 LQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815
Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESS--SLVKSNKDSPTETSIT 760
++ + + E + K + + E E L + ++ E
Sbjct: 816 LESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEEL 875
Query: 761 SESDKESQDEGNPIVHKPEETKDKNVRKSSDIE 793
+ KE ++E + + E + + +IE
Sbjct: 876 EDELKELEEEKEELEEELRELESELAELKEEIE 908
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 34.0 bits (78), Expect = 0.35
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 7/81 (8%)
Query: 710 LDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNE-------SSSLVKSNKDSPTETSITSE 762
LD S K+ K+P +S+P + + ++S++ SS + E++ +S
Sbjct: 38 LDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSH 97
Query: 763 SDKESQDEGNPIVHKPEETKD 783
++ + +PEET D
Sbjct: 98 PKSKAPSTESEEEEEPEETPD 118
Score = 30.1 bits (68), Expect = 5.6
Identities = 17/70 (24%), Positives = 30/70 (42%), Gaps = 6/70 (8%)
Query: 705 KSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESD 764
N S+K +E+ +S KP + + ES SS +P+ SE +
Sbjct: 57 SRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSH---PKSKAPSTE---SEEE 110
Query: 765 KESQDEGNPI 774
+E ++ +PI
Sbjct: 111 EEPEETPDPI 120
>gnl|CDD|109608 pfam00558, Vpu, Vpu protein. The Vpu protein contains an
N-terminal transmembrane spanning region and a
C-terminal cytoplasmic region. The HIV-1 Vpu protein
stimulates virus production by enhancing the release of
viral particles from infected cells. The VPU protein
binds specifically to CD4.
Length = 81
Score = 31.2 bits (71), Expect = 0.47
Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 6/47 (12%)
Query: 631 RTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDL 677
R + K++R + + RI E A++SGNE N D+ E DL
Sbjct: 28 REYRKIKKQREILRLIKRIRER------AEDSGNESNGDEEEELADL 68
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.5 bits (77), Expect = 0.73
Identities = 29/133 (21%), Positives = 51/133 (38%), Gaps = 19/133 (14%)
Query: 580 ECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRK 639
E +RL + N++ +S + E EE ++ S E R +++R+
Sbjct: 325 ELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRRE 384
Query: 640 RILEDIEDRIEEVVLGLDDAD--------------ESGNELNEDQTSEAMDLKEVAEHQT 685
I E++E+ IEE+ DA E +EL E + L+ E
Sbjct: 385 EI-EELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARE--R 441
Query: 686 ESENREIVD--KC 696
E +++ KC
Sbjct: 442 VEEAEALLEAGKC 454
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 33.4 bits (75), Expect = 0.77
Identities = 41/175 (23%), Positives = 71/175 (40%), Gaps = 10/175 (5%)
Query: 692 IVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNK 751
IVDK DK+ + N + + P + SKP + +ES + S NK
Sbjct: 447 IVDKEAFTKANADKTNK---KEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNK 503
Query: 752 DSPT---ETSITSESDKESQDEGNPI---VHKPEETKDKNVRKSSDIEGTGQKSSDLQDT 805
SP+ E +SES K+ P V T K V + ++ SS+
Sbjct: 504 QSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKD 563
Query: 806 AQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQSAQKSSDLEGTAQKSS 860
++S A+ S+ L+ K+++ +G + N+ K+ L T ++S+
Sbjct: 564 VVQTSAGSSEAKDSAPLQKANIKNTN-DGHTQSQNNKNTQENKAKSLPQTGEESN 617
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 33.3 bits (76), Expect = 0.79
Identities = 32/210 (15%), Positives = 73/210 (34%), Gaps = 15/210 (7%)
Query: 590 LEVSQSNKDGDSETKRVPSNEILNEE---NSKKRRLSTSNDEPTRTEKDNKRKRILEDIE 646
L+ + + +G S S + +E N + + + N+RK + +D +
Sbjct: 34 LQSTVEDIEGPSAVDERTSGVLRDEGKHANILYNSILCNQKK--HASFLNQRKSLDDDDD 91
Query: 647 DRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQ----TESENREIVDKCTNVLDR 702
D + L DD D++GN + + +++ L E+ + T++ N + D +
Sbjct: 92 DEFDF--LYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSN 149
Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSE 762
+ + S+ + ++ YN ++ + K S +
Sbjct: 150 SQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLN 209
Query: 763 SDKESQDEGNPIVHKPEETKDKNVRKSSDI 792
D+G+P +DK +
Sbjct: 210 DSDGQGDDGDP----ESAEEDKAASNTRAA 235
>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
transduction mechanisms].
Length = 612
Score = 33.0 bits (75), Expect = 0.81
Identities = 34/234 (14%), Positives = 63/234 (26%), Gaps = 22/234 (9%)
Query: 531 STSNYQVKQTS--NDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTS 588
+S + Q S N +Y +VS ++ ++ E
Sbjct: 32 DSSLCGLNQASDANGNEYSPNNKVSKKDTFSDQL--------------HDALSKEFTLER 77
Query: 589 NLEVSQSNKDGDSETKRVPSN---EILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDI 645
+ Q NK + S EI + + N +
Sbjct: 78 ERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDAVDPPNHTRSSGNNLSNANVKTLSAPVG 137
Query: 646 EDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDK 705
E LD ++ E + Q E ++ + + + D
Sbjct: 138 EHSRSNNPPNLDQNLDTEPESSISQWGELQLNPSGKTLSSQPSRKPTSENPKSESDNSKL 197
Query: 706 STNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSI 759
T+ N +KS KR S+ + YN + S + S ++ I
Sbjct: 198 PTS--VNSPLPDKSL-LKRTLSNFWAERNSYNWKPLVYPSCPSEHIFSDSDVII 248
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 32.9 bits (75), Expect = 1.0
Identities = 35/166 (21%), Positives = 66/166 (39%), Gaps = 16/166 (9%)
Query: 568 RISTSNEIT---NNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLST 624
IS S+E T N + L S+ ++ + DS + N+ L+E+ KK +
Sbjct: 6 NISISSENTIDLKNPLFIKNLKHSSYKIRLESDIKDS-LLNLDINKKLHEKLDKKNKKFN 64
Query: 625 SNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQ 684
D+ ++K +++ + I++ D+ + + A+ L + +
Sbjct: 65 KTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP---LAISLMRKPKPK 121
Query: 685 TESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSK 730
TE ++I V+KSTN K S+K K N+ K
Sbjct: 122 TEKLKKKIT---------VNKSTNKKKKKVLSSKDELIKYDNNKPK 158
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 32.7 bits (74), Expect = 1.3
Identities = 39/252 (15%), Positives = 88/252 (34%), Gaps = 11/252 (4%)
Query: 555 SKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNE 614
+ E +E + + + +N + + + + + + + N+DG E V NE E
Sbjct: 4023 ADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDD-EKMNEDGFEE--NVQENEESTE 4079
Query: 615 ENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEA 674
+ K + P DN K D + ++ D+ NE+ E
Sbjct: 4080 DGVKSDEELEQGEVPEDQAIDNHPK---MDAKSTFASAEADEENTDKGIVGENEE-LGEE 4135
Query: 675 MDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKS-SEAKRPNSSSKPNS 733
++ E E ++ + + + + ++ + G + +A R +
Sbjct: 4136 DGVRGNGTADGEFE--QVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHEWEDLTE 4193
Query: 734 SKYNEESNESSSLVKSNKDSPTETSITSESDK-ESQDEGNPIVHKPEETKDKNVRKSSDI 792
S+ + VK +++ + +E D+ +S D P+ N+ +
Sbjct: 4194 SQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNSTNIAEDEAD 4253
Query: 793 EGTGQKSSDLQD 804
E ++ D QD
Sbjct: 4254 EVGDKQLQDGQD 4265
Score = 32.3 bits (73), Expect = 1.7
Identities = 46/232 (19%), Positives = 84/232 (36%), Gaps = 22/232 (9%)
Query: 646 EDRIEEVVLGLDDADES-GNELNEDQTSEAMD--LKEVAEHQTESENREIVDKCTNVLDR 702
E EE D DES EL D D + E +E EN+++ ++ V D
Sbjct: 3839 ELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE---VNDI 3895
Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSE 762
+ +N L+ K + E S N ES+ S + + +
Sbjct: 3896 PEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKED 3955
Query: 763 SDKESQDEGNPIVHKPEETKD--KNVRKSSDI-----------EGTGQKSSDLQDTAQKS 809
++ S D G +P+ ++ + ++ D+ EG K SDL+D ++
Sbjct: 3956 EEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEA 4015
Query: 810 SDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQSAQKSSDLEGTAQKSSD 861
+D ++ +D E + + D + SDL +K ++
Sbjct: 4016 ADEN---KEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNE 4064
Score = 32.3 bits (73), Expect = 1.9
Identities = 50/298 (16%), Positives = 103/298 (34%), Gaps = 39/298 (13%)
Query: 493 EESDRVNTAREQCSEEGKERKTNSTSSFEVTPSKTGGESTSNYQVKQTSNDEKYYEKKRV 552
EES ++ E+G+ + + + +K+ S + + + D+ +
Sbjct: 4075 EESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASA---EADEENTDKGIVGENEE 4131
Query: 553 STSKDEVNENGRKKRRISTSNE-ITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEI 611
+D V NG E + KE Q GD + +N I
Sbjct: 4132 LGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE------ADRQYQSLGDHLREWQQANRI 4185
Query: 612 LNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNE-LNEDQ 670
E + ++ D++ + ED E+ ++ + D +S + + +Q
Sbjct: 4186 HEWE--------DLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQ 4237
Query: 671 TSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSK 730
++M+ +A E E E+ DK ++ +I D K ++ E P
Sbjct: 4238 NPDSMNSTNIA----EDEADEVGDK------QLQDGQDISDIK----QTGEDTLPTEFGS 4283
Query: 731 PNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEE-TKDKNVR 787
N E S + +D + ++ + DE + +K E+ TK ++
Sbjct: 4284 IN----QSEKVFELSEDEDIEDELPDYNV-KITPAMPIDEARDLWNKHEDSTKQLSIE 4336
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.0 bits (73), Expect = 1.4
Identities = 37/176 (21%), Positives = 64/176 (36%), Gaps = 18/176 (10%)
Query: 597 KDGDSETKRVPSNEILNEENSKKRRLST-SNDEPTRTEKDNKRKRILEDIEDRIEEVVLG 655
D E + + E L E S++ + S E + + + E+ E I+ + G
Sbjct: 142 ADSKREMEVLERLEELKELQSRRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFG 201
Query: 656 LDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGS 715
+T E + + + + E+ + + K T+IL K +
Sbjct: 202 P-------------ETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKPTSILK-KSA 247
Query: 716 SNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEG 771
+ +S S +K NS + + SSLV K +P TS S S E E
Sbjct: 248 AKRSEAPSS--SKAKKNSRGIPKPRDALSSLVVRKKAAPESTS-QSPSSAEPTSES 300
>gnl|CDD|236195 PRK08238, PRK08238, hypothetical protein; Validated.
Length = 479
Score = 32.2 bits (74), Expect = 1.4
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 8/45 (17%)
Query: 145 RKSSRPISTGPSSLSLGFGAAAF---AGFVLA-----GFSGLVLV 181
RK RP ++G + G AA AG LA F ++L
Sbjct: 259 RKRRRPFASGALPIPFGLAAAPLLLLAGLALALALGPAFLLVLLA 303
>gnl|CDD|178752 PLN03213, PLN03213, repressor of silencing 3; Provisional.
Length = 759
Score = 32.1 bits (72), Expect = 1.5
Identities = 65/357 (18%), Positives = 130/357 (36%), Gaps = 54/357 (15%)
Query: 469 KKVNLDKVTNSTHSKENGRVNNFREESDRVNT------AREQCSEEGKERKTNSTSSFEV 522
+K L++ T S GR N S + T R++ S+ +E+K S EV
Sbjct: 245 RKSILNEKTPSEEGYSEGRKGNHIHPSKKRQTISLEESGRQESSQAIREKKKPS----EV 300
Query: 523 TPSKTGGESTSNYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECS 582
P K+ E + ++Q S D ++ + + + +N +N+ S
Sbjct: 301 VPDKSLDEPSRTKDLEQ---------------SIDNISWSQKSSWKSLMANGNSNDFSVS 345
Query: 583 ERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKKRRLST-------SNDEPTRTEKD 635
LP + +++ +PS E L E+ +KR ST D+ R + D
Sbjct: 346 SFLPGVGSSKAVQPAPRNTDLAELPSRENLKEKTKRKRVTSTIMAEDLPVPDDIKRDDSD 405
Query: 636 NKRKRILEDIEDRIEEVVLGLDDADESGNE-------------------LNEDQTSEAMD 676
I D +E AD++ N+ + +D ++M
Sbjct: 406 TMADDIERDDSAAVEYYTACESMADDAANDSVAERDDSDAVECDTAIDSMADDTAIDSMA 465
Query: 677 LKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKY 736
++ ES++ + V+ T + D + + + + E S +++
Sbjct: 466 DDAASDAVAESDDGDAVENDTAIDSMADDTASNSMAESDDGDNVEDDTAIDSMADDTAND 525
Query: 737 NEESNESSSLVKSNKDSPTET---SITSESDKESQDEGNPIVHKPEETKDKNVRKSS 790
+ S++S SL + D+ + + ++ +S D + + +D N K S
Sbjct: 526 DVGSDDSESLADTVIDTSVDAVPLDFVANTEGDSGDGKSNVEKHENGAEDLNAEKES 582
>gnl|CDD|215386 PLN02727, PLN02727, NAD kinase.
Length = 986
Score = 32.2 bits (73), Expect = 1.8
Identities = 22/103 (21%), Positives = 38/103 (36%), Gaps = 2/103 (1%)
Query: 708 NILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKES 767
+ +D+ S ++ E R N +S +S+E + SN S S S S S
Sbjct: 510 SKIDDTDSISRLVETGRSNGLVSEKNSSPKYQSSEFDNGKSSNGSS--FASDGSLSVASS 567
Query: 768 QDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSS 810
GNP + + + + S + ++S A S
Sbjct: 568 ITNGNPSNNGASSSTVSDNLERSVASVSVRESQRSNGKASLGS 610
>gnl|CDD|227452 COG5123, TOA2, Transcription initiation factor IIA, gamma subunit
[Transcription].
Length = 113
Score = 29.9 bits (67), Expect = 1.9
Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 4/74 (5%)
Query: 222 KKIFET----VAPDLKTNDSNQATYKGKVWTVENVEGFVTSQRVNPLNNFSENLECGRDK 277
+ ET V L + ++ T+KGK+ T +G T N + + + +
Sbjct: 36 MHVLETFDKSVPEVLNKSVTSSLTFKGKLDTYNFCDGVWTFITKNFVMSLNNEYFNAGEV 95
Query: 278 SQVSQKIDIVADKS 291
+ V+ K IVA
Sbjct: 96 NIVACKARIVACAG 109
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 32.0 bits (72), Expect = 2.1
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 4/87 (4%)
Query: 703 VDKSTNILDNKGSSNKSSEAKRPNSSSKPNS---SKYNEESNESSSLVKSNKDSPTETSI 759
K N+G SS + +S+K S SK+ E ++ S S K S + I
Sbjct: 129 AGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASVAGI 188
Query: 760 TSESDKESQDEGNPIVHKPEETKDKNV 786
+D+E+ + EE + NV
Sbjct: 189 VG-ADEEAPPAPKNTLTPLEELYETNV 214
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 31.4 bits (71), Expect = 2.9
Identities = 31/149 (20%), Positives = 55/149 (36%), Gaps = 6/149 (4%)
Query: 715 SSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPI 774
S N A N +SS S+ S S S DS +E+S + + E +P
Sbjct: 379 SRNTPPSAPSSNPEPAASSS--GSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPE 436
Query: 775 VHKPEETK---DKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTARKSSD 831
P K D + K + + + +S ++ + EG + S K
Sbjct: 437 PEPPSTNKWQLDNWLNKVNPHKVSPAESVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPP 496
Query: 832 LEGTARKSNDLEQSAQKSSDLEGTAQKSS 860
+ ++++ ++AQK + QKS
Sbjct: 497 PKSSSKEKRRP-RTAQKGPESGRGKQKSP 524
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 31.6 bits (72), Expect = 3.0
Identities = 33/255 (12%), Positives = 84/255 (32%), Gaps = 22/255 (8%)
Query: 540 TSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDG 599
K Y + ++ + K+ + E +E L ++ K
Sbjct: 805 LLGSRKEY-RSYLACIIK--LQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRF 861
Query: 600 DSETKRVPSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDA 659
K + +R+L + D K L+ + +E ++ L
Sbjct: 862 SLLKKETIYLQSAQRVELAERQLQ-------ELKIDVKSISSLKLVNLELESEIIEL-KK 913
Query: 660 DESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKS 719
S + + + + +A + N ++ + + ++ + + + + ++
Sbjct: 914 SLSSDLIENLE----FKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKET 969
Query: 720 SEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETS----ITSESDKESQDEGNPIV 775
SE + + K S+ E N+++S +K+ K E S ES K+ ++ +
Sbjct: 970 SE-EYEDLLKK--STILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVA 1026
Query: 776 HKPEETKDKNVRKSS 790
+K + +
Sbjct: 1027 ELQSASKIISSESTE 1041
>gnl|CDD|153001 pfam12567, CD45, Leukocyte receptor CD45. This family of proteins
is found in eukaryotes. Proteins in this family are
typically between 77 and 1130 amino acids in length. The
family is found in association with pfam00041. CD45
plays a critical role in T-cell receptor (TCR)-mediated
signaling. CD45 interacts with SKAP55 which is a
transcriptional activator of IL-2.
Length = 61
Score = 28.3 bits (63), Expect = 3.1
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 543 DEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSER 584
D Y + ++ T+K VNEN + T+NE+ N EC +R
Sbjct: 3 DYLYDKSTKLFTAKLNVNENVKCGNNDCTNNELHNLPECEQR 44
>gnl|CDD|211905 TIGR04170, RNR_1b_NrdE, ribonucleoside-diphosphate reductase, class
1b, alpha subunit. Members of this family are NrdE, the
alpha subunit of class 1b ribonucleotide reductase. This
form uses a dimanganese moiety associated with a
tyrosine radical to reduce the cellular requirement for
iron.
Length = 698
Score = 30.9 bits (70), Expect = 3.5
Identities = 14/30 (46%), Positives = 16/30 (53%)
Query: 632 TEKDNKRKRILEDIEDRIEEVVLGLDDADE 661
K N KR LE EDR+ V L L + DE
Sbjct: 99 ALKTNDGKRYLERYEDRVCMVALTLANGDE 128
>gnl|CDD|185118 PRK15196, PRK15196, secreted effector protein PipB2; Provisional.
Length = 350
Score = 30.7 bits (69), Expect = 3.8
Identities = 32/155 (20%), Positives = 65/155 (41%), Gaps = 13/155 (8%)
Query: 788 KSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDLEQSAQ 847
K + + G + +L D ++LEG +D EG K ++L GT+ ++ + +
Sbjct: 155 KGAVLTGANLTAENLCDADLSGANLEGAVLFMADCEGANFKGANLSGTSLGDSNFKNACL 214
Query: 848 KSSDLEGTAQKSSDLQGTAQKSSDLDGTAR-----KSSDLDRYNIGIEDFLFGSMSEEQT 902
+ S + G ++L G + + L G + +++D N+ + MS
Sbjct: 215 EDSIMCGATLDHANLTGANLQHASLLGCSMIECNCSGANMDHTNLSGATLIRADMSGATL 274
Query: 903 RMLKVYSKVLTLADLVRVLQVPINPFNQQKVSFLS 937
+ + + ++ A L R N +K SF+S
Sbjct: 275 QGATIMAAIMEGAVLTRA--------NLRKASFIS 301
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 30.5 bits (68), Expect = 5.1
Identities = 39/184 (21%), Positives = 79/184 (42%), Gaps = 12/184 (6%)
Query: 608 SNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELN 667
S I +E N + L N+ K++ I ED E+ +EE DA+E+ E
Sbjct: 914 SGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDAEENVEE------DAEENVEENV 967
Query: 668 EDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKST--NILDNKGSSNKSSEAKRP 725
E+ E +++E E E E V++ NV + V+++ N+ +N + + + +
Sbjct: 968 EENVEE--NVEENVEENVEENVEENVEE--NVEENVEENIEENVEENVEENIEENVEEYD 1023
Query: 726 NSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKN 785
+ + E E+ ++ N + E +I ++ ++ I EE ++N
Sbjct: 1024 EENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEEN 1083
Query: 786 VRKS 789
V ++
Sbjct: 1084 VEEN 1087
>gnl|CDD|236179 PRK08188, PRK08188, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 714
Score = 30.4 bits (69), Expect = 5.5
Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 3/38 (7%)
Query: 639 KRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMD 676
KR LE EDR+ V L L DE L E +
Sbjct: 120 KRYLERFEDRVCMVALTLAQGDE---TLARQLVDEIIS 154
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 30.0 bits (67), Expect = 5.9
Identities = 37/153 (24%), Positives = 63/153 (41%), Gaps = 20/153 (13%)
Query: 544 EKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLPTSNLEVS----QSNKDG 599
+ + ++ S+ E R+KR+ S S + C E L + + EV + + D
Sbjct: 104 PQMFGEEAPSSQDGEQPGPSRRKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDS 163
Query: 600 DSETKRVPSNEILNEENSKKRRL----------------STSNDEPTRTEKDNKRKRILE 643
DSE K+ P NE E+ + R+ S ++ R + +KRK E
Sbjct: 164 DSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEE 223
Query: 644 DIEDRIEEVVLGLDDADESGNELNEDQTSEAMD 676
D ED E + DE G+ D+ + A++
Sbjct: 224 DEEDAESESSFESSNEDEEGSSSEADEMAAALE 256
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.4 bits (68), Expect = 6.2
Identities = 35/162 (21%), Positives = 72/162 (44%), Gaps = 6/162 (3%)
Query: 712 NKGS-SNKSSEAKRPNSSSKPNSSKYNEE---SNESSSLVKSNKDSPTETSITSESDKES 767
++GS S S K+ +S +S NE+ S +S S + ++ T + +K+
Sbjct: 10 SRGSMSTLRSGRKKQTASPDGRASPTNEDQRSSGRNSPSAASTSSNDSKAESTKKPNKKI 69
Query: 768 QDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSSDLEGTAR 827
++E + + ++K + + E K S Q+ ++ +S EG + E +
Sbjct: 70 KEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDS 129
Query: 828 KSSDLEGTARKSNDLEQSAQKSS-DLEGTAQKSSDLQGTAQK 868
+S + EG++ D++Q + SS + SD +AQ+
Sbjct: 130 RSVNEEGSS-DPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQ 170
>gnl|CDD|185214 PRK15314, PRK15314, outer membrane protein RatB; Provisional.
Length = 2435
Score = 30.2 bits (67), Expect = 6.8
Identities = 21/86 (24%), Positives = 38/86 (44%), Gaps = 3/86 (3%)
Query: 531 STSNYQVKQTSNDEKYYEKKRVSTSKDEVNENGRKKRRISTSNEITNNKECSERLP---T 587
+T + V TS DE+Y+ + ST+ D ++ + R + + + C ++ P T
Sbjct: 408 TTLGWPVGLTSGDEEYWSSSQGSTATDHISIDMRSRALTQMPDATQSLVSCVDKAPPAVT 467
Query: 588 SNLEVSQSNKDGDSETKRVPSNEILN 613
L +S N D +V E +N
Sbjct: 468 PKLVISADNFDSTVNAAKVKVGEEIN 493
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 30.1 bits (67), Expect = 7.0
Identities = 45/261 (17%), Positives = 94/261 (36%), Gaps = 30/261 (11%)
Query: 558 EVNENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRV----------- 606
E++ GRK RR T I + L + +VS S+++ D +
Sbjct: 401 EIDNVGRKTRRQPTGKAIAEETSREDELSFDDSDVSTSDENEDVDFTGKKGAINNEDESD 460
Query: 607 ------PSNEILNEENSKKRRLSTSNDEPTRTEKDNKRKRILEDI-------EDRIEEVV 653
S+ +E R + ++ + + I + E+ IEE
Sbjct: 461 NEEVAFDSDSQFDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEE-Y 519
Query: 654 LGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTNVLDRVDKSTNILDNK 713
G + + +D+ + D+ +VA S + ++++ L + S L ++
Sbjct: 520 KGESAKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSR 579
Query: 714 GSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETSITSESDKESQDEGNP 773
+ + ++ + K N E E N+ + S + ++ES DE +
Sbjct: 580 FQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVD- 638
Query: 774 IVHKPEETKDKNVRKSSDIEG 794
E +++N RK ++ G
Sbjct: 639 ----YETEREENARKKEELRG 655
>gnl|CDD|227554 COG5229, LOC7, Chromosome condensation complex Condensin, subunit H
[Chromatin structure and dynamics / Cell division and
chromosome partitioning].
Length = 662
Score = 30.2 bits (67), Expect = 7.2
Identities = 52/284 (18%), Positives = 95/284 (33%), Gaps = 7/284 (2%)
Query: 560 NENGRKKRRISTSNEITNNKECSERLPTSNLEVSQSNKDGDSETKRVPSNEILNEENSKK 619
+E G K ++T N + + P + E K E + + + ++++
Sbjct: 147 DEGGAKSLLLNTLNIDNTKRLYFDSSPIKDTENVGQGKLQRKEEELIERDSLVDDLMIDS 206
Query: 620 RRLSTSNDEPTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQ-TSEAMDLK 678
+ L S+++ T N +ED I LG+D + + T D K
Sbjct: 207 QSLGISSNDST----VNDSVISAPSMEDEILG--LGMDFIKVLRMSVCRIEGTVIVDDFK 260
Query: 679 EVAEHQTESENREIVDKCTNVLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNE 738
+ + + + D V++ DN+ S + + ++ + NE
Sbjct: 261 MSMDFPAGLLISPSISEDKEIRDIVEEPAVDNDNEVSDSDGFDMGEHAGLFSGDNFELNE 320
Query: 739 ESNESSSLVKSNKDSPTETSITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQK 798
SS+ S S E + D H + K K V K SD+ +
Sbjct: 321 ILPPSSNQNPSTTGSIFEKDVYKYFDFSYFKNWAGPEHWKVQAKKKRVNKESDLLEETRT 380
Query: 799 SSDLQDTAQKSSDLEGTARKSSDLEGTARKSSDLEGTARKSNDL 842
+ T KS D G ++ ++ D EG N L
Sbjct: 381 TITTDKTDDKSMDTSGKQKQKEASFIDFTETVDYEGMLEPGNTL 424
>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
Length = 603
Score = 29.9 bits (67), Expect = 7.2
Identities = 35/154 (22%), Positives = 58/154 (37%), Gaps = 20/154 (12%)
Query: 726 NSSSKPNSSKYNEESNESSS-------LVKSNKDSPTETSITSESDKESQDEGNPIVHK- 777
NS NSS N +S ES + +T+++S + + + HK
Sbjct: 83 NSVVSYNSSDKNNDSLESEVDGGNNHKPKEEQAQVSQKTTVSSSEEVQISARDIQLNHKT 142
Query: 778 ----PEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARKSS------DLEGTAR 827
P +KN R T +K +++D ++ A S +L +
Sbjct: 143 QFNPPTVKHEKNTRVQPRRA-TDEKVKEIRDKIIQAKAYLNLAPPGSNSQLVKELRLRTK 201
Query: 828 KSSDLEGTARKSNDLEQSA-QKSSDLEGTAQKSS 860
+ G A K DL +SA Q+ +E T K+S
Sbjct: 202 ELERAVGDATKDKDLSKSALQRMKSMEVTLYKAS 235
>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
Length = 188
Score = 29.2 bits (66), Expect = 7.9
Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 16/82 (19%)
Query: 591 EVSQSNKDGDSETKRVPSN------EILNE---------ENSKKRRLSTSNDEPTRTEKD 635
S+ + D E + + + EIL + E +++ LS++ E R E+
Sbjct: 17 RASEIRAEADEEAEEIIAEAEADAEEILEDREAEAEREIEQLREQELSSAKLEAKR-ERL 75
Query: 636 NKRKRILEDIEDRIEEVVLGLD 657
N RK +LED+ +++E+ + LD
Sbjct: 76 NARKEVLEDVRNQVEDEIASLD 97
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 29.9 bits (67), Expect = 8.1
Identities = 46/274 (16%), Positives = 89/274 (32%), Gaps = 19/274 (6%)
Query: 580 ECSERLPTSNLEVSQSNKDGDSETKRVPSNE-ILNEENSKKRRLSTSNDEPTRTEKDNKR 638
+++ S K DS E N E+ RL K
Sbjct: 392 AMLPPESSADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVL---KLVME 448
Query: 639 KRILEDIEDRIEEVVLGLDDADESGNELNEDQTSEAMDLKEVAEHQTESENREIVDKCTN 698
K I+ +I + +E++ L D N D +E EVA + E V + +
Sbjct: 449 KHIISEISEILEDIEAEL---DSLERSSNGDDENE-----EVAMVGSLVEESSAVIERSQ 500
Query: 699 VLDRVDKSTNILDNKGSSNKSSEAKRPNSSSKPNSSKYNEESNESSSLVKSNKDSPTETS 758
L + I + + K + K+ +E + + V + E S
Sbjct: 501 EL--EGAVSRISEFTSVLEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQ-----EKS 553
Query: 759 ITSESDKESQDEGNPIVHKPEETKDKNVRKSSDIEGTGQKSSDLQDTAQKSSDLEGTARK 818
+ S + S+ + N + K E + + + ++K + ++
Sbjct: 554 LQDVSVEASEIKKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKC 613
Query: 819 SSDLEGTARKSSDLEGTARKSNDLEQSAQKSSDL 852
DLE + K + E + ++Q+S+ L
Sbjct: 614 KDDLEESKNKLQETEKKLEELKSELDASQESNSL 647
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 28.4 bits (64), Expect = 8.3
Identities = 13/45 (28%), Positives = 25/45 (55%)
Query: 629 PTRTEKDNKRKRILEDIEDRIEEVVLGLDDADESGNELNEDQTSE 673
P + D K+K+ ++ +D I++ +L DD DE + +D+ E
Sbjct: 82 PLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE 126
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.124 0.337
Gapped
Lambda K H
0.267 0.0742 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 44,294,553
Number of extensions: 4177453
Number of successful extensions: 3525
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3377
Number of HSP's successfully gapped: 173
Length of query: 947
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 841
Effective length of database: 6,236,078
Effective search space: 5244541598
Effective search space used: 5244541598
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 64 (28.4 bits)