RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10178
(655 letters)
>gnl|CDD|152466 pfam12031, DUF3518, Domain of unknown function (DUF3518). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 260
amino acids in length. This domain is found associated
with pfam01388.
Length = 257
Score = 361 bits (929), Expect = e-122
Identities = 136/264 (51%), Positives = 178/264 (67%), Gaps = 10/264 (3%)
Query: 212 DNIARRCIALSNILRNLTFVPGNELEFSKNSTFLRLLGRLILAYHEHGLRATKTRNYDKD 271
D++A+RCI +SNI+RNL+FVPGN++E S++ L +LG+L+L +HEH R R Y ++
Sbjct: 1 DSLAKRCICVSNIVRNLSFVPGNDIEMSRHPGLLLILGKLLLLHHEHPERKPAPRTYQRE 60
Query: 272 DD-DWTDCCSSLSPQAESEWWWDYMENIRENVLVSLANISGHIDLDQYPEDISRPILDGL 330
++ D CS + EWWWD +E +REN LV+LANISG +DL YPE I PILDGL
Sbjct: 61 EEEDDGLSCS------KDEWWWDCLEVLRENTLVTLANISGQLDLSFYPESICLPILDGL 114
Query: 331 LHWAVCPAAQGQDPFPSSSCN--LSPQRLALEALCKLCVTDNNVDLVIATPPYARLERLT 388
LHW VCP+A+ QDPFP++ N LSPQRL LEALCKL + DNNVDL++ATPP++RLE+L
Sbjct: 115 LHWMVCPSAEAQDPFPTAGPNSVLSPQRLVLEALCKLSIQDNNVDLILATPPFSRLEKLF 174
Query: 389 YVLTRFLCKNEDQVLREFGVNLLHYLAAADSAMSRTIALQSPCISLLVAFIEQAEQNALG 448
L R L ++QV RE V LL LA DS +R IALQ CI LV F+E A A
Sbjct: 175 ATLVRHLGDRKNQVCREMAVVLLSNLAQGDSTAARAIALQKGCIGNLVGFLEDAVAMAQY 234
Query: 449 VANQHGIGALRDNPDSMGTSLDML 472
+QH + +PD TS+DM+
Sbjct: 235 QQSQHSL-LHMGSPDFEPTSVDMM 257
Score = 31.7 bits (72), Expect = 1.1
Identities = 9/31 (29%), Positives = 17/31 (54%), Gaps = 1/31 (3%)
Query: 566 SLDMLRRVANQHGIGALRDNPDSMGTSLDML 596
++ M + +QH + +PD TS+DM+
Sbjct: 228 AVAMAQYQQSQHSL-LHMGSPDFEPTSVDMM 257
Score = 30.2 bits (68), Expect = 3.4
Identities = 8/24 (33%), Positives = 13/24 (54%), Gaps = 1/24 (4%)
Query: 547 VANQHGIGALRDNPDSMGTSLDML 570
+QH + +PD TS+DM+
Sbjct: 235 QQSQHSL-LHMGSPDFEPTSVDMM 257
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 38.4 bits (89), Expect = 0.004
Identities = 20/58 (34%), Positives = 23/58 (39%)
Query: 8 PPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVGP 65
PP+P +G P + PPPP PP P PG PPP P M P
Sbjct: 88 PPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPP 145
Score = 32.2 bits (73), Expect = 0.41
Identities = 23/58 (39%), Positives = 27/58 (46%), Gaps = 1/58 (1%)
Query: 8 PPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVGP 65
PP+P A+G + P + G PPPP L G PP P PG PPP P P
Sbjct: 76 PPLPGAAG-IPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAP 132
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This
family contains bacterial fibronectin-attachment
proteins (FAP). Family members are rich in alanine and
proline, are approximately 300 long, and seem to be
restricted to mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 36.8 bits (85), Expect = 0.030
Identities = 18/57 (31%), Positives = 23/57 (40%)
Query: 3 ATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGP 59
A A PP P ++ P + P P + Q + N PP P P PPP P
Sbjct: 39 ADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPP 95
Score = 33.7 bits (77), Expect = 0.26
Identities = 22/106 (20%), Positives = 26/106 (24%), Gaps = 18/106 (16%)
Query: 2 AATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDM 61
A PP P S A PPP P P PPP D
Sbjct: 37 ANADPAPPPPPPSTAAAAPAPA------APPPPPPPAAP--PAPQPDDPNAAPPPPPADP 88
Query: 62 NVGPKVKCISYYDETAPSKPNTIVNSDVHVVKSEDKANGMTTTKPE 107
N P P PN + ++ G + P
Sbjct: 89 NAPPP----------PPVDPNAPPPPAPEPGRIDNAVGGFSYVVPA 124
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 36.5 bits (84), Expect = 0.042
Identities = 18/70 (25%), Positives = 21/70 (30%), Gaps = 12/70 (17%)
Query: 6 AEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGP------------PTPG 53
PP P P PG P P + G + GP P PG
Sbjct: 265 EPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPG 324
Query: 54 GPPPGPDMNV 63
P P PD +
Sbjct: 325 PPRPAPDADR 334
Score = 32.3 bits (73), Expect = 0.75
Identities = 16/65 (24%), Positives = 24/65 (36%), Gaps = 6/65 (9%)
Query: 2 AATQAEPPVP-----EASGPVDPGASVDMEHGGPPPPQ-SLQNGEKNNKPPSGPPTPGGP 55
+ A P GPV+ GA G P+ + ++G +P GPP P
Sbjct: 272 GGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRPAPD 331
Query: 56 PPGPD 60
P+
Sbjct: 332 ADRPE 336
Score = 30.0 bits (67), Expect = 4.1
Identities = 20/74 (27%), Positives = 26/74 (35%), Gaps = 7/74 (9%)
Query: 8 PPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVGPKV 67
PP E S +P + GG P P E KP GP G PG + +
Sbjct: 255 PPYFEESKGYEPPPAPS---GGSPAPPGDDRPEA--KPEPGPVEDG--APGRETGGEGEG 307
Query: 68 KCISYYDETAPSKP 81
+ D A +P
Sbjct: 308 PEPAGRDGAAGGEP 321
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 35.3 bits (81), Expect = 0.10
Identities = 34/152 (22%), Positives = 47/152 (30%), Gaps = 18/152 (11%)
Query: 20 GASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVGPKVKCISYYDETAPS 79
G+ ++ P L + + P G P PD +V VK E A
Sbjct: 287 GSEYVVDEALEREPAELAAAAVASAASAVGPVGPGEPNQPD-DVAEAVK-----AEVAEV 340
Query: 80 KPNTIVNSDVHVVKSEDKANGMTTTKPEETS-VKIENNKEGESTVKKENTEGEETVKTDN 138
S V V +G +T EETS IE + G+ G+
Sbjct: 341 TDEVAAESVVQV----ADRDGESTPAVEETSEADIEREQPGD-------LAGQAPAAHQV 389
Query: 139 TKEGEATVKLENGGEEPSATDKAEKETTEKPE 170
E + E A D+ E E EK
Sbjct: 390 DAEAASAAPEEPAALASEAHDETEPEVPEKAA 421
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 35.2 bits (81), Expect = 0.12
Identities = 21/67 (31%), Positives = 24/67 (35%), Gaps = 4/67 (5%)
Query: 2 AATQAEPPVPEASGPVDPGAS--VDMEHGGPPPPQSLQNGEK-NNKPPSGPPTPGGPPPG 58
AA Q P + P+ A E P PP Q GE + PS P P P
Sbjct: 50 AAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPA- 108
Query: 59 PDMNVGP 65
P V P
Sbjct: 109 PAEPVEP 115
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 34.8 bits (80), Expect = 0.14
Identities = 34/188 (18%), Positives = 63/188 (33%), Gaps = 15/188 (7%)
Query: 28 GGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPD-MNVGPKVKCISYYDETAPSKPNTIVN 86
G PP + +G PP V K + + T
Sbjct: 140 GVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGK 199
Query: 87 SDVHVVKSE----DKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKEG 142
+ V + K+N M++ ++T K E + ESTVK+E+ E E
Sbjct: 200 TSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDES 259
Query: 143 EATVKLENGGEEPSATDKAEKETTEKPEVKFVPKIIDPAGNAKRRRLSEYEDESYCRDEA 202
L+ +E E+ +E+ + + KR+RL + ++ ++
Sbjct: 260 AEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKE--------KRKRLKKMMEDE--DEDE 309
Query: 203 SLYPVSDS 210
+ V +S
Sbjct: 310 EMEIVPES 317
Score = 33.3 bits (76), Expect = 0.46
Identities = 42/253 (16%), Positives = 72/253 (28%), Gaps = 57/253 (22%)
Query: 8 PPVPEASGPVDPGASVDMEHGGPPPP--------------QSLQNG---EKNNKPPSGPP 50
P P AS + P A+ PP Q EK S
Sbjct: 145 PVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKA 204
Query: 51 TPG--GPPPGPD-MNVGPKVKCISYYDETAPSKPNTIVNSDVH-----VVKSEDKANGMT 102
PP + M+ K K ++ S+ S+ V+ ++ A
Sbjct: 205 ASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTG 264
Query: 103 TTKPEETSVKIENNK------EGESTVKKENTEGEETVKTDNTKEGEATVKLENGGEEPS 156
+ E+ + + E E K++ ++ ++ ++ E V E P
Sbjct: 265 LDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIV-----PESPV 319
Query: 157 ATDKAEKETTEKPEVKFVPK--IIDPAGNAKRRR-------------------LSEYEDE 195
+++E+ K K + +RR YE E
Sbjct: 320 EEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWE 379
Query: 196 SYCRDEASLYPVS 208
S+ DEA P
Sbjct: 380 SFSEDEAEPPPTK 392
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 33.9 bits (78), Expect = 0.16
Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 1/90 (1%)
Query: 93 KSEDKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKEGEATVKLENGG 152
D+A + + +E + +KE E+ + ++ + + EN
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEE-EKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 153 EEPSATDKAEKETTEKPEVKFVPKIIDPAG 182
E+ ++D+ EKET EK E +I +P+
Sbjct: 98 EDEESSDENEKETEEKTESNVEKEITNPSW 127
Score = 33.9 bits (78), Expect = 0.19
Identities = 15/83 (18%), Positives = 30/83 (36%), Gaps = 5/83 (6%)
Query: 93 KSEDKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKEGEATVKLENGG 152
+ + + K EE +KE + +KE+ E EE + ++ + + K
Sbjct: 54 SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113
Query: 153 EEPSATDKAEKETTEKPEVKFVP 175
E + ++ P K V
Sbjct: 114 TESN-----VEKEITNPSWKPVG 131
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 33.3 bits (76), Expect = 0.19
Identities = 17/56 (30%), Positives = 20/56 (35%), Gaps = 4/56 (7%)
Query: 4 TQAEPPVPEASGPVDPGASVDMEHGGPPPPQS--LQNGEKNNKPPSGPPTPGGPPP 57
P P ++ P PG + P PPQ PP G P GPPP
Sbjct: 123 QTTPLPQPPSTAPSYPGPQY--QGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176
>gnl|CDD|218902 pfam06121, DUF959, Domain of Unknown Function (DUF959). This
N-terminal domain is not expressed in the 'Short'
isoform of Collagen A.
Length = 202
Score = 33.6 bits (76), Expect = 0.20
Identities = 18/75 (24%), Positives = 27/75 (36%), Gaps = 16/75 (21%)
Query: 9 PVPEASGPVDPGASVDMEHGGP---------------PPPQSLQNGEKNNKPPSGPPTPG 53
VP+AS PV S H P P + ++ G++N P + TP
Sbjct: 40 AVPQASTPVQSTESTTT-HVVPRPGETEESTTPASSEEPKEIVEKGKQNVVPGTVATTPT 98
Query: 54 GPPPGPDMNVGPKVK 68
P D+ P +
Sbjct: 99 VTPVAMDVASSPDLS 113
>gnl|CDD|181214 PRK08058, PRK08058, DNA polymerase III subunit delta'; Validated.
Length = 329
Score = 34.1 bits (79), Expect = 0.21
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 452 QHGIGALRDNPDSMGTSLDMLRRTARTLLHLSRHPDNKPLFLQQEQRL--LALVMSQILD 509
Q L D LD+L R LL+L +++ +F +Q++ L LAL SQ
Sbjct: 236 QEKWMPLFKEKDQQQLGLDLLLLIYRDLLYLQLGEEDRLVFREQKEMLQQLALSYSQ--- 292
Query: 510 QQVAAIISRVLYQTRPDLYQN 530
QQ+ A + +L + + L N
Sbjct: 293 QQIVAALELIL-EAKRRLNSN 312
Score = 34.1 bits (79), Expect = 0.21
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 576 QHGIGALRDNPDSMGTSLDMLRRTARTLLHLSRHPDNKPLFLQQEQRL--LALVMSQILD 633
Q L D LD+L R LL+L +++ +F +Q++ L LAL SQ
Sbjct: 236 QEKWMPLFKEKDQQQLGLDLLLLIYRDLLYLQLGEEDRLVFREQKEMLQQLALSYSQ--- 292
Query: 634 QQVAAIISRVLYQTRPDLYQN 654
QQ+ A + +L + + L N
Sbjct: 293 QQIVAALELIL-EAKRRLNSN 312
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 32.0 bits (73), Expect = 0.23
Identities = 12/60 (20%), Positives = 12/60 (20%), Gaps = 3/60 (5%)
Query: 1 MAATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPD 60
A EP P P PP N P PPP
Sbjct: 51 GAYVHLEPLPA-YGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGY--MADPQEPPPPYP 107
Score = 30.8 bits (70), Expect = 0.65
Identities = 17/61 (27%), Positives = 23/61 (37%), Gaps = 8/61 (13%)
Query: 2 AATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDM 61
A +A+P E+ PGA V +E P P + PP P P P P +
Sbjct: 33 RAQRAQPVSRESGYYPPPGAYVHLE---PLPA-----YGQYAAPPPYGPPPPYYPAPPGV 84
Query: 62 N 62
Sbjct: 85 Y 85
Score = 29.7 bits (67), Expect = 1.6
Identities = 13/60 (21%), Positives = 14/60 (23%), Gaps = 5/60 (8%)
Query: 3 ATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSG-----PPTPGGPPP 57
A P+P P P PP N PP GPP
Sbjct: 52 AYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPPQ 111
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 33.8 bits (77), Expect = 0.33
Identities = 22/104 (21%), Positives = 39/104 (37%), Gaps = 4/104 (3%)
Query: 93 KSEDKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKEGEATVKLENGG 152
A+ T++ E+ V +E E + EE +K + K + K E+
Sbjct: 259 NLPKPAD--TSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL--KQESKA 314
Query: 153 EEPSATDKAEKETTEKPEVKFVPKIIDPAGNAKRRRLSEYEDES 196
E A DK + ++ V + P A+ L+E +S
Sbjct: 315 SEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAIDS 358
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 33.1 bits (75), Expect = 0.54
Identities = 23/99 (23%), Positives = 30/99 (30%), Gaps = 10/99 (10%)
Query: 1 MAATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSL--QNGEKNNKPPSGPPTPGGPPPG 58
+ A + P P V P PPQ G ++PP P P
Sbjct: 398 LGANPMQQGQPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPP 457
Query: 59 -------PDMNVGPKVKCISYYDETAPSKP-NTIVNSDV 89
P M+ P + D +P P NT S V
Sbjct: 458 ALMPSPSPQMSQSPASQRTIQQDMVSPGGPLNTPGQSSV 496
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 32.0 bits (72), Expect = 0.90
Identities = 27/155 (17%), Positives = 47/155 (30%), Gaps = 16/155 (10%)
Query: 26 EHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVGPKVKCISYYDETAPSKPNTIV 85
E P++LQ N+ +G PP + Y E ++ I
Sbjct: 50 ESKQANEPETLQP---KNQTENGETAADLPPKPEERWS---------YIEELEAREVLIN 97
Query: 86 N----SDVHVVKSEDKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKE 141
+ S+ V+ + E+ + ++ +T E T E K K+
Sbjct: 98 DPEEPSNGGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKK 157
Query: 142 GEATVKLENGGEEPSATDKAEKETTEKPEVKFVPK 176
+ E A K K + K +PK
Sbjct: 158 PQKARTAEAQKTPVETEKIASKVKEAKQKQKALPK 192
>gnl|CDD|178752 PLN03213, PLN03213, repressor of silencing 3; Provisional.
Length = 759
Score = 32.1 bits (72), Expect = 0.97
Identities = 25/84 (29%), Positives = 33/84 (39%)
Query: 74 DETAPSKPNTIVNSDVHVVKSEDKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEET 133
D S + N DV SE A+ + T + + N EG+S K N E E
Sbjct: 512 DTAIDSMADDTANDDVGSDDSESLADTVIDTSVDAVPLDFVANTEGDSGDGKSNVEKHEN 571
Query: 134 VKTDNTKEGEATVKLENGGEEPSA 157
D E E+ V EN +E A
Sbjct: 572 GAEDLNAEKESLVVKENVVDEEEA 595
>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 780
Score = 32.0 bits (72), Expect = 1.3
Identities = 24/85 (28%), Positives = 31/85 (36%), Gaps = 5/85 (5%)
Query: 4 TQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNV 63
T E P +G D G E+ PP NGE PP+G GG P P +
Sbjct: 662 TGGENPTTPPTGEGDNG-----ENPTTPPTGEGNNGENPTTPPTGEGNNGGNPTTPSTDE 716
Query: 64 GPKVKCISYYDETAPSKPNTIVNSD 88
G + SK T V+ +
Sbjct: 717 GNNAGSGQTTTDNQNSKETTTVSEN 741
>gnl|CDD|227725 COG5438, COG5438, Predicted multitransmembrane protein [Function
unknown].
Length = 385
Score = 31.6 bits (72), Expect = 1.3
Identities = 23/92 (25%), Positives = 33/92 (35%), Gaps = 1/92 (1%)
Query: 67 VKCISYYDETAPSKPNTIVNSDVHVVKSEDKANGMTTTKPEETSVKIENNKEGESTVKKE 126
VK I+ AP T V + + + + + VKI N + VK
Sbjct: 31 VKFIAESAYNAPKGEITDVKIVSNNSEDKSSNKNRQVLQKQSIKVKILNGEYKGELVKIN 90
Query: 127 NTEGEETVKTDNTKEGEATVKLENGGEEPSAT 158
NT E NT+ G+ + E G PS
Sbjct: 91 NTVNESQAHNINTRVGDVVLI-EISGNTPSYY 121
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 31.8 bits (72), Expect = 1.6
Identities = 18/60 (30%), Positives = 20/60 (33%), Gaps = 15/60 (25%)
Query: 2 AATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPP-GPD 60
A P +A+ PGA GPP P PP P PP GPD
Sbjct: 899 TGRVATAPGGDAASAPPPGA-------GPPAPPQAV-------PPPRTTQPPAAPPRGPD 944
>gnl|CDD|225213 COG2334, COG2334, Putative homoserine kinase type II (protein
kinase fold) [General function prediction only].
Length = 331
Score = 31.2 bits (71), Expect = 1.7
Identities = 12/48 (25%), Positives = 18/48 (37%)
Query: 228 LTFVPGNELEFSKNSTFLRLLGRLILAYHEHGLRATKTRNYDKDDDDW 275
++PG LE N+ L LGR++ H R +W
Sbjct: 107 FEYLPGRPLERDDNAEQLEELGRMLGRLHLALRGFPFERPNALRRLEW 154
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.8 bits (70), Expect = 2.5
Identities = 15/84 (17%), Positives = 30/84 (35%), Gaps = 1/84 (1%)
Query: 94 SEDKANGMTTTKPEETSVKIENNKEGESTVKKE-NTEGEETVKTDNTKEGEATVKLENGG 152
S +N + +K ++ E NK ++ KK+ + K + K+ + + +
Sbjct: 25 SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84
Query: 153 EEPSATDKAEKETTEKPEVKFVPK 176
E + K K PK
Sbjct: 85 SEGETKLGFKTPKKSKKTKKKPPK 108
Score = 30.5 bits (69), Expect = 2.9
Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 17/98 (17%)
Query: 91 VVKSEDKANGMTTTKPEETSVKIENNKEGESTVKKENTEGEET--VKTDNTKEGEATVKL 148
++ + + T TS K + ++ + KK++ + ++ K + EGE +
Sbjct: 38 ILSTFSEEENKVAT----TSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Query: 149 ENGGEEPSATDKAEKETTEKPEVKFVPKIIDPAGNAKR 186
T K K+T +KP PK + NA
Sbjct: 94 --------KTPKKSKKTKKKPP---KPKPNEDVDNAFN 120
>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
Members of this family belong to the collagen
superfamily. Collagens are generally extracellular
structural proteins involved in formation of connective
tissue structure. The alignment contains 20 copies of
the G-X-Y repeat that forms a triple helix. The first
position of the repeat is glycine, the second and third
positions can be any residue but are frequently proline
and hydroxyproline. Collagens are post translationally
modified by proline hydroxylase to form the
hydroxyproline residues. Defective hydroxylation is the
cause of scurvy. Some members of the collagen
superfamily are not involved in connective tissue
structure but share the same triple helical structure.
Length = 60
Score = 27.8 bits (63), Expect = 2.5
Identities = 17/40 (42%), Positives = 17/40 (42%), Gaps = 6/40 (15%)
Query: 29 GPPPPQSLQNGEKNNKPPSGPPTPGGPP--PGPDMNVGPK 66
GPP P G P GPP P GPP PGP G
Sbjct: 22 GPPGPP----GPPGPPGPPGPPGPPGPPGPPGPPGAPGAP 57
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 31.1 bits (70), Expect = 2.9
Identities = 20/59 (33%), Positives = 22/59 (37%), Gaps = 15/59 (25%)
Query: 11 PEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPG--PDMNVGPKV 67
EA P GA+ D GGPP P + PP P P PD VG V
Sbjct: 2484 AEARFPFAAGAAPDPGGGGPPDPDA-------------PPAPSRLAPAILPDEPVGEPV 2529
Score = 30.3 bits (68), Expect = 4.4
Identities = 16/56 (28%), Positives = 19/56 (33%), Gaps = 10/56 (17%)
Query: 4 TQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGP 59
PP PEA+ P A PP + P + P P PPP P
Sbjct: 275 ATGPPPPPEAAAPNGAAA----------PPDGVWGAALAGAPLALPAPPDPPPPAP 320
Score = 29.9 bits (67), Expect = 6.6
Identities = 18/58 (31%), Positives = 24/58 (41%), Gaps = 2/58 (3%)
Query: 2 AATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGP 59
+ +++ +P P DP A+V PP S P S PT PPPGP
Sbjct: 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP--PPTSAQPTAPPPPPGP 2845
Score = 29.5 bits (66), Expect = 8.1
Identities = 15/59 (25%), Positives = 20/59 (33%), Gaps = 1/59 (1%)
Query: 3 ATQAEPPVPEASGPVDPGASVDMEHGGPPP-PQSLQNGEKNNKPPSGPPTPGGPPPGPD 60
T+ A+ P G D + P P S+ P S PP P P P +
Sbjct: 381 PTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE 439
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 30.4 bits (68), Expect = 3.8
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 32 PPQSLQNGEKNNKPPSGPPTPGGPPPGP 59
P ++ +K+++PP GP G PP P
Sbjct: 497 APIEEEDSDKHDEPPEGPEASGLPPKAP 524
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.2 bits (68), Expect = 4.1
Identities = 18/89 (20%), Positives = 24/89 (26%), Gaps = 8/89 (8%)
Query: 3 ATQAEPPVPEASGPVDPGASVDMEHGGP--PPPQSLQNGEKNNKPPSGPPTPGGPPPGPD 60
A P P+ + P S P P + +P TP PP P
Sbjct: 358 ALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPV 417
Query: 61 MNVGPKVKCISYYDETAPSKPNTIVNSDV 89
P E+AP + D
Sbjct: 418 APPVPHTP------ESAPKLTRAAIPVDE 440
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 29.8 bits (67), Expect = 4.1
Identities = 27/94 (28%), Positives = 32/94 (34%), Gaps = 27/94 (28%)
Query: 8 PPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVG-PK 66
PP P A P P A+ P P S PPTP PPP P
Sbjct: 155 PPPPHAMPPASPPAAQPA----PSAP-----------ASSPPPTPASPPPAKAPKSSHPP 199
Query: 67 VKCI---SYYDETAPSKPNTIVNSDVHVVKSEDK 97
+K ++Y AP +P VK DK
Sbjct: 200 LKSPMAGTFYRSPAPGEP--------PFVKVGDK 225
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human
Gammaherpesvirus that infects and establishes latency
in B lymphocytes in vivo. The latent membrane protein 2
(LMP2) gene is expressed in latently infected B cells
and encodes two protein isoforms, LMP2A and LMP2B, that
are identical except for an additional N-terminal 119
aa cytoplasmic domain which is present in the LMP2A
isoform. LMP2A is thought to play a key role in either
the establishment or the maintenance of latency and/or
the reactivation of productive infection from the
latent state. The significance of LMP2B and its role in
pathogenesis remain unclear.
Length = 489
Score = 30.2 bits (68), Expect = 4.3
Identities = 14/55 (25%), Positives = 19/55 (34%), Gaps = 10/55 (18%)
Query: 21 ASVDMEHGGPPPPQSL-----QNGEKNNKPPSG----PPTPGGP-PPGPDMNVGP 65
S++M+ G P+S G+ N PS PG P P D
Sbjct: 2 VSLEMQPLGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHR 56
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 29.9 bits (67), Expect = 4.4
Identities = 21/90 (23%), Positives = 30/90 (33%), Gaps = 12/90 (13%)
Query: 4 TQAEPPVPEAS--GPVDPG----------ASVDMEHGGPPPPQSLQNGEKNNKPPSGPPT 51
T A PP+ E S G DP A V + P P++ + E P +
Sbjct: 171 TLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSP 230
Query: 52 PGGPPPGPDMNVGPKVKCISYYDETAPSKP 81
P P P + + E P+ P
Sbjct: 231 PSTTIPAPSTTIAAPQAGTTPEAEGTPAPP 260
>gnl|CDD|226193 COG3667, PcoB, Uncharacterized protein involved in copper
resistance [Inorganic ion transport and metabolism].
Length = 321
Score = 29.8 bits (67), Expect = 4.6
Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 2/57 (3%)
Query: 1 MAATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPP 57
A ++ P A+ +D GA M+H PPP + E++ P S PP
Sbjct: 53 HAHSKM-PGPEMAAPQMDHGAMPHMDH-APPPIPTQHAAERSRSPASAAARVAAFPP 107
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 30.0 bits (67), Expect = 4.9
Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 1/65 (1%)
Query: 3 ATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMN 62
A AEP EA P + G+ + P + + P+ P P +
Sbjct: 205 AAPAEPAEEEAPAPSEAGSEPAPD-PAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGD 263
Query: 63 VGPKV 67
GP V
Sbjct: 264 SGPYV 268
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 30.0 bits (68), Expect = 5.3
Identities = 10/62 (16%), Positives = 14/62 (22%), Gaps = 3/62 (4%)
Query: 1 MAATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSL---QNGEKNNKPPSGPPTPGGPPP 57
A P P +G G + P Q + P+ P P
Sbjct: 432 APAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA 491
Query: 58 GP 59
Sbjct: 492 AA 493
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 30.0 bits (67), Expect = 5.3
Identities = 24/90 (26%), Positives = 33/90 (36%), Gaps = 10/90 (11%)
Query: 84 IVNSDVHVVKSEDKANGMTTTKP-----EETSVKIENNKEGESTVKK----ENTEGEETV 134
+VN D + K K T K + K E VKK +G E+
Sbjct: 99 VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESE 158
Query: 135 KTDNTKEGEATVKLENGGEEPSATDKAEKE 164
+D +E E LEN EE +K + E
Sbjct: 159 VSD-VEESEFVTSLENESEEELDLEKDDGE 187
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 29.8 bits (67), Expect = 5.4
Identities = 32/172 (18%), Positives = 59/172 (34%), Gaps = 14/172 (8%)
Query: 66 KVKCISYYDETAPSK-PNTIVNSDVHVVKSEDKANGMTTTKPEETSVKIENNKEGESTVK 124
K S Y S ++++N D++ K +K + K +T ++ K K
Sbjct: 24 KNLKHSSYKIRLESDIKDSLLNLDINK-KLHEKLD-KKNKKFNKTDDLKDSKKTKLKQKK 81
Query: 125 KENTEGEETVKTDNTKEGEATVKLENGGEEPSATDKAEKETTEKPEVKFVPKIIDPAGNA 184
K + DN + + K G S K + +T + + V K
Sbjct: 82 KIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKST---NKK 138
Query: 185 KRRRLSEYEDESYCRDEASLYPVSDSQDNIARRCIALSNILRNLTFVPGNEL 236
K++ LS +DE Y + + + + L L +P E+
Sbjct: 139 KKKVLSS-------KDELIKYDNNKPKSISIHSPLTIQE-LSTLLCIPETEI 182
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 29.8 bits (67), Expect = 5.6
Identities = 14/52 (26%), Positives = 16/52 (30%), Gaps = 2/52 (3%)
Query: 8 PPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGP 59
P P GP G + P G P+ PP P PPG
Sbjct: 695 PQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEP--EPPGA 744
Score = 29.8 bits (67), Expect = 6.1
Identities = 12/59 (20%), Positives = 14/59 (23%), Gaps = 5/59 (8%)
Query: 12 EASGPVDPGASVDMEHGGPPPPQSLQNGEKNN-----KPPSGPPTPGGPPPGPDMNVGP 65
EA GP PG P + G+ P P P P
Sbjct: 653 EAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLP 711
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 29.8 bits (67), Expect = 5.7
Identities = 12/55 (21%), Positives = 15/55 (27%), Gaps = 2/55 (3%)
Query: 6 AEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPD 60
P SGP AS +G + P+ PP PP
Sbjct: 797 RRPGRLRRSGPAADAASRTASKRKSRSHT--PDGGSESSGPARPPGAAARPPPAR 849
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 28.1 bits (63), Expect = 5.9
Identities = 17/61 (27%), Positives = 17/61 (27%), Gaps = 21/61 (34%)
Query: 3 ATQAEPPVPE----ASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPG 58
A PP E P P GP P S P PP P PP
Sbjct: 59 AVWVLPPPSEPAAPPPDPEPPVP-------GPAGPPS----------PLAPPAPARKPPL 101
Query: 59 P 59
P
Sbjct: 102 P 102
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 29.5 bits (66), Expect = 6.1
Identities = 13/40 (32%), Positives = 16/40 (40%), Gaps = 2/40 (5%)
Query: 28 GGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMNVGPKV 67
GG PPP + E PP PPP P + +V
Sbjct: 170 GGTPPPDDDEGDEAGAPATPAPPLH--PPPAPHPHPIAEV 207
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 29.7 bits (66), Expect = 6.3
Identities = 18/79 (22%), Positives = 26/79 (32%), Gaps = 2/79 (2%)
Query: 3 ATQAEPPVPEASGPVDPGASVDMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPPGPDMN 62
Q P PV A E PQS E+ + P+ P +
Sbjct: 428 PAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQ 487
Query: 63 VGPKVKCISYYDETAPSKP 81
P V+ +ET P++P
Sbjct: 488 --PVVEPEPVVEETKPARP 504
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 29.7 bits (66), Expect = 6.4
Identities = 29/109 (26%), Positives = 48/109 (44%), Gaps = 18/109 (16%)
Query: 97 KANGMTTTKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKEGEATVKLENGGE-EP 155
N TT P K E++ EG + +K EE+V++ ++ EN E E
Sbjct: 534 GGNAATTADPN-GIAKREDHPEGGTNRQKYEQSDEESVESSSS---------ENSSENEN 583
Query: 156 SATDKAE------KETTEKPEVKFVPK-IIDPAGNAKRRRLSEYEDESY 197
TDK E K+T + ++ +P+ II+ K++ L +E Y
Sbjct: 584 EVTDKGEEIYSLLKKTINRIDMNKIPRPIINTQEKKKKKNLKVFETCKY 632
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 29.5 bits (67), Expect = 6.6
Identities = 12/69 (17%), Positives = 29/69 (42%)
Query: 104 TKPEETSVKIENNKEGESTVKKENTEGEETVKTDNTKEGEATVKLENGGEEPSATDKAEK 163
T ++ + KI+ E ++E + ++ K+ E + + EE ++ E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 164 ETTEKPEVK 172
E ++ E +
Sbjct: 463 EEEKEEEEE 471
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 29.6 bits (66), Expect = 7.4
Identities = 27/82 (32%), Positives = 35/82 (42%), Gaps = 5/82 (6%)
Query: 96 DKANGMTTTKPEETSVKIENNKEGESTV-KKENTEGE---ETVKTDNTKEGEATVKLENG 151
+ +G + ET K EN EGE +K EGE E + D+ E EA G
Sbjct: 660 GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEG 719
Query: 152 GEEPSAT-DKAEKETTEKPEVK 172
E T D+ E ET E+ E
Sbjct: 720 ETEAEGTEDEGEIETGEEGEEV 741
>gnl|CDD|150310 pfam09602, PhaP_Bmeg, Polyhydroxyalkanoic acid inclusion protein
(PhaP_Bmeg). This entry describes a protein found in
polyhydroxyalkanoic acid (PHA) gene regions and
incorporated into PHA inclusions in Bacillus cereus and
Bacillus megaterium. The role of the protein may include
amino acid storage.
Length = 168
Score = 28.6 bits (63), Expect = 7.9
Identities = 22/95 (23%), Positives = 39/95 (41%), Gaps = 2/95 (2%)
Query: 103 TTKPEETSVKI--ENNKEGESTVKKENTEGEETVKTDNTKEGEATVKLENGGEEPSATDK 160
TK +T + EN +G + E+ ++ +A KL +E A DK
Sbjct: 2 ETKKYDTLIDAFWENWSKGLQLFADAGKQIEQLTLEALEQQQDALHKLTKAVDELEAEDK 61
Query: 161 AEKETTEKPEVKFVPKIIDPAGNAKRRRLSEYEDE 195
K E + V + AGNA ++ E++++
Sbjct: 62 QFKAEFEDLQQDTVENLRKTAGNALADQIEEWQEK 96
>gnl|CDD|220948 pfam11027, DUF2615, Protein of unknown function (DUF2615). This
small. approximately 100 residue, family is conserved
from worms to humans. It is cysteine-rich with a
characteristic FDxCEC sequence motif. The function is
not known.
Length = 103
Score = 27.4 bits (61), Expect = 8.5
Identities = 11/28 (39%), Positives = 14/28 (50%), Gaps = 2/28 (7%)
Query: 33 PQSLQNGEKNNKP--PSGPPTPGGPPPG 58
P SL+N +NKP + P PPP
Sbjct: 74 PSSLRNPSGDNKPRDNNSNDGPPPPPPA 101
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 28.9 bits (65), Expect = 8.7
Identities = 22/86 (25%), Positives = 32/86 (37%), Gaps = 6/86 (6%)
Query: 2 AATQAEPPVPEASGPVDPGASV----DMEHGGPPPPQSLQNGEKNNKPPSGPPTPGGPPP 57
+A+ ++PP P P + PP P S Q+ P PT PPP
Sbjct: 181 SASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQS--DTPPPSPESPTNPSPPP 238
Query: 58 GPDMNVGPKVKCISYYDETAPSKPNT 83
GP P V+ + P+ P+
Sbjct: 239 GPAAPPPPPVQQVPPLSTAKPTPPSA 264
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.131 0.381
Gapped
Lambda K H
0.267 0.0635 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,315,142
Number of extensions: 3298846
Number of successful extensions: 3286
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3114
Number of HSP's successfully gapped: 151
Length of query: 655
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 552
Effective length of database: 6,369,140
Effective search space: 3515765280
Effective search space used: 3515765280
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 62 (27.9 bits)