RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7576
(481 letters)
>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein. Members of this family are
part of the Paf1/RNA polymerase II complex. The Paf1
complex probably functions during the elongation phase
of transcription. The Leo1 subunit of the yeast
Paf1-complex binds RNA and contributes to complex
recruitment. The subunit acts by co-ordinating
co-transcriptional chromain modifications and helping
recruitment of mRNA 3prime-end processing factors.
Length = 312
Score = 227 bits (581), Expect = 5e-71
Identities = 111/297 (37%), Positives = 163/297 (54%), Gaps = 25/297 (8%)
Query: 130 EEGDGEGERRERDEDARSDASRRGSDR------EDDEDKENRTETGEDKEEEEPVETRID 183
+ +R+ + D RG + +D + + E + E+ V +
Sbjct: 1 DREQDSHRSNDRNSLSGDDDLSRGGNGDDGEINGEDPSQFSEIEEATETFPEKDVAVKFQ 60
Query: 184 VEIPKISCDL---GREMHFVKLPNFLSVETRPFDPETYEDEIEDEETL---DEEGRARLK 237
E+P + L F K+PNFLSVE +PFDPET+E E+ E+ + R R+K
Sbjct: 61 TEVPTQNFGLPAGSNPSFFAKIPNFLSVEPKPFDPETFEAEMTKEDAKDEHEAHWRQRIK 120
Query: 238 LKVENTIRWRDTFNKDGDLVKESNARLVKWSDGSLSLHLGSEIFDVYKQPLQGDHNHLYI 297
L+VENT+RWR+ +KDG KESNA++V+WSDGS SLH+G+EIFD+ QPL + NHL +
Sbjct: 121 LRVENTVRWRENKSKDGKTTKESNAQIVRWSDGSYSLHIGNEIFDIMHQPLSVNQNHLTV 180
Query: 298 RQ-GTGLQGQAVFRTKLSFRPHSTESFTHRKMTLSLADRSNKTTGIKILGQVGYDPDANR 356
T LQ Q V + KL+FRP ST SFTHRK+T +LA R K+ G+KI VG DP+ +
Sbjct: 181 SHEQTLLQPQGVIKKKLTFRPSSTNSFTHRKLTRNLATRQRKSGGVKIT-IVGKDPEHEK 239
Query: 357 YEKIKKEEEKLRQSLRKETKTKRTREK-----------GASSRLSAGYLEGDDDDDE 402
E+ KKEE+KLR R++ + K + ++ + Y E +D+ +
Sbjct: 240 REREKKEEQKLRARRRRQNREKMKNKPPNRPGHGSGSDSNVAKAATTYSEDEDEGSD 296
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 41.1 bits (96), Expect = 0.001
Identities = 26/136 (19%), Positives = 56/136 (41%), Gaps = 16/136 (11%)
Query: 56 DHLGRIEVAAD-QLQGQTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASD 114
+ G I+ D +++G +E + + D H + G+ +
Sbjct: 778 EDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQ-E 836
Query: 115 ISTDDDDDGK--LQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDK 172
++ ++ + K + V+ G +G E +E+ E++E++E E E++
Sbjct: 837 LNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE------------EEEEEEEEEEEEEEEE 884
Query: 173 EEEEPVETRIDVEIPK 188
EEEE E + +E P+
Sbjct: 885 EEEEENEEPLSLEWPE 900
Score = 33.0 bits (75), Expect = 0.46
Identities = 36/172 (20%), Positives = 62/172 (36%), Gaps = 11/172 (6%)
Query: 119 DDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPV 178
+ +G+ ++ +E D +G + +A ++ E ED E ETGE+ EE E
Sbjct: 691 GEQEGEGEIEAKEADHKG-----ETEAEEVEHEGETEAEGTED-EGEIETGEEGEEVED- 743
Query: 179 ETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDEEGRARLKL 238
E + E G K + E EI+ E + +G +
Sbjct: 744 EGEGEAEGKHEVETEGDR----KETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEG 799
Query: 239 KVENTIRWRDTFNKDGDLVKESNARLVKWSDGSLSLHLGSEIFDVYKQPLQG 290
KVE+ + + E+ A + D + L +E KQ +G
Sbjct: 800 KVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851
Score = 31.9 bits (72), Expect = 0.93
Identities = 31/167 (18%), Positives = 57/167 (34%), Gaps = 6/167 (3%)
Query: 71 QTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDDDGKLQVVEE 130
E + E ++ ++ DH E + G + + ++G+ E
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG 745
Query: 131 EGDGEGER---RERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETRIDVEIP 187
EG+ EG+ E D ++ ++DED E + GED E + VE
Sbjct: 746 EGEAEGKHEVETEGDRKETEHEGETEAEGKEDED-EGEIQAGEDGEMKGDEGAEGKVEHE 804
Query: 188 KISCDLGREMHFVKLPNFLSVETRPFDPETYE--DEIEDEETLDEEG 232
+ ++ H + + E E + E DE+G
Sbjct: 805 GETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 37.5 bits (87), Expect = 0.016
Identities = 38/182 (20%), Positives = 57/182 (31%), Gaps = 45/182 (24%)
Query: 86 GEDLDHHLEALVALLLKLNAEALFGDASDISTDDDDD-----------GKLQVVEEEGDG 134
G+ ++ L + D ++ + DD G + + EE GD
Sbjct: 166 GDAPPDSAGKVLDLWRDEIEDKAGEDLDGLAAEIDDQQAFARVVRDMLGSMDMAEETGDD 225
Query: 135 EGERRERDEDARSDASRRGSDRE--------------------DDEDKENRTETGEDKEE 174
E +ED D D E D E +E + E E+
Sbjct: 226 GIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASED 285
Query: 175 EEPVETRIDVEIP-------KISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEET 227
E E+ D E P +L E+ + V TR FD +E+ DE
Sbjct: 286 SESDESDEDTETPGEDARPATPFTELMEEVDY-------KVFTREFDEIVLAEELCDEAE 338
Query: 228 LD 229
LD
Sbjct: 339 LD 340
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 34.6 bits (79), Expect = 0.15
Identities = 21/68 (30%), Positives = 31/68 (45%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
S D+ D+ + +V E E + E E DED SD S ++ E+ + E GED +E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986
Query: 176 EPVETRID 183
E
Sbjct: 987 ESKAAYDS 994
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 34.1 bits (78), Expect = 0.17
Identities = 33/120 (27%), Positives = 51/120 (42%), Gaps = 9/120 (7%)
Query: 111 DASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGE 170
++ D DDD EE+G+GEGE +E S+A+ R S+ +E+ +
Sbjct: 208 ESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESE-SGEEEMVQSDQDDL 266
Query: 171 DKEEEEPVETRIDVEIP-KISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLD 229
E ++ ET + P + G E + V T FD +E+ DEE LD
Sbjct: 267 PDESDDDSETPGEGARPARPFTSTGGEPDY-------KVFTTAFDETVDAEELCDEEELD 319
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.8 bits (77), Expect = 0.26
Identities = 28/140 (20%), Positives = 49/140 (35%), Gaps = 30/140 (21%)
Query: 114 DISTDDDDDGKLQVVEEEGDGEGERR-----ERDEDARSDASRRGSDREDDEDKENRTE- 167
D+ + + G+L+ G G G D+D A+ + + D D+ E
Sbjct: 3801 DLPQEKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEAREL 3860
Query: 168 ----TGE---------DKEEEEPVETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFD 214
G + + E +D E+ I D L N S+ + +D
Sbjct: 3861 ESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPED---------LSN--SLNEKLWD 3909
Query: 215 PETYEDEIEDEETLDEEGRA 234
ED +E E+ +E+ A
Sbjct: 3910 EPNEEDLLETEQKSNEQSAA 3929
Score = 28.8 bits (64), Expect = 8.2
Identities = 20/86 (23%), Positives = 36/86 (41%), Gaps = 5/86 (5%)
Query: 114 DISTDDDDD--GKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGED 171
D S + + D + E+E D G+++ +D SD + G ED E + +
Sbjct: 4232 DESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTG---EDTLPTEFGSINQSE 4288
Query: 172 KEEEEPVETRIDVEIPKISCDLGREM 197
K E + I+ E+P + + M
Sbjct: 4289 KVFELSEDEDIEDELPDYNVKITPAM 4314
>gnl|CDD|237115 PRK12489, PRK12489, anaerobic C4-dicarboxylate transporter;
Reviewed.
Length = 443
Score = 32.5 bits (75), Expect = 0.48
Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 149 ASRRGSDREDDEDKENRTETGEDKEE-EEPVETRIDVEIPK 188
+ RRG D + D + + R + E +E T +D ++PK
Sbjct: 192 SLRRGKDLDKDPEFQERLKDPEFREYVYGTTATLLDKKLPK 232
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.9 bits (75), Expect = 0.51
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 129 EEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETRIDVEIPK 188
EEE E DE+ +A R + + D+D+ + ED+E++E E+ EI K
Sbjct: 1 EEEQVNTQANEEEDEE-ELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISK 59
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 32.7 bits (75), Expect = 0.51
Identities = 17/86 (19%), Positives = 37/86 (43%), Gaps = 15/86 (17%)
Query: 111 DASDISTDDDDDGKL---------------QVVEEEGDGEGERRERDEDARSDASRRGSD 155
+ S S DD DD + EE+G + + + D+D +
Sbjct: 293 EDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLS 352
Query: 156 REDDEDKENRTETGEDKEEEEPVETR 181
E++++++ ++ +D+EEEE + +
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEK 378
Score = 28.8 bits (65), Expect = 7.8
Identities = 29/122 (23%), Positives = 52/122 (42%), Gaps = 15/122 (12%)
Query: 124 KLQVVEEE------GDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEP 177
+L+ +E E G+ E + E D +D + +DD++ +++EEE+
Sbjct: 270 RLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDG 329
Query: 178 VETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDEEGRARLK 237
V+ D + DL E V L + D E + + ED+E +EE + + K
Sbjct: 330 VD---DEDEEDDDDDLEEEEEDVDLSDEEE------DEEDEDSDDEDDEEEEEEEKEKKK 380
Query: 238 LK 239
K
Sbjct: 381 KK 382
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 32.6 bits (75), Expect = 0.59
Identities = 17/56 (30%), Positives = 26/56 (46%)
Query: 110 GDASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENR 165
D D DD+ + + ++G+G G RR R R RRG D ++ + E R
Sbjct: 233 RDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELR 288
Score = 30.6 bits (70), Expect = 2.0
Identities = 18/67 (26%), Positives = 24/67 (35%), Gaps = 1/67 (1%)
Query: 110 GDASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETG 169
G + D DD + E+ GD ER RD R RR R+ D
Sbjct: 191 GRREERGRDGDDRDRRDRREQ-GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGD 249
Query: 170 EDKEEEE 176
D ++ E
Sbjct: 250 RDGDDGE 256
Score = 29.5 bits (67), Expect = 5.2
Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 2/66 (3%)
Query: 118 DDDDDGKLQVVEEEGDGE--GERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
+ G+ E D E ERDE R + ++ R E G D ++
Sbjct: 144 KAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDR 203
Query: 176 EPVETR 181
+ + R
Sbjct: 204 DRRDRR 209
>gnl|CDD|226950 COG4584, COG4584, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 278
Score = 31.7 bits (72), Expect = 0.65
Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 2/49 (4%)
Query: 231 EGRARLKLKVENTIR--WRDTFNKDGDLVKESNARLVKWSDGSLSLHLG 277
GR R K KVE + R F ++ N R + W + G
Sbjct: 221 PGRPRTKGKVERMVGYVARSFFLPGTLDLENLNRRALLWLADVANRRHG 269
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.8 bits (72), Expect = 0.84
Identities = 21/118 (17%), Positives = 46/118 (38%), Gaps = 6/118 (5%)
Query: 113 SDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDK 172
+ +++++ + + E+E + E E +E+ + GS+ E + E + E +
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN--GSEEEMEGSSEGDGDGEEPE 500
Query: 173 EEEEPVETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDE 230
E+ E + + R S + P PE+ + E EE+ +E
Sbjct: 501 EDAERRNSEMAGISRMSEGQQPRGSSVQP----ESPQEEPLQPESMDAESVGEESDEE 554
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 31.3 bits (71), Expect = 1.3
Identities = 21/112 (18%), Positives = 43/112 (38%), Gaps = 3/112 (2%)
Query: 73 EVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDD---DGKLQVVE 129
+ ++ + ++D A + E D+S ++ + + L
Sbjct: 300 DFIKQIRNLDPKARREVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSL 359
Query: 130 EEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETR 181
+ R+ R+ + S SR S R+ +++ R ++ D EEE P E R
Sbjct: 360 LGSEASERRKNREPSSSSPPSRPRSRRDSEDEDTERDDSDSDDEEETPAERR 411
>gnl|CDD|224595 COG1681, FlaB, Archaeal flagellins [Cell motility and secretion].
Length = 209
Score = 30.4 bits (69), Expect = 1.6
Identities = 7/24 (29%), Positives = 13/24 (54%)
Query: 339 TTGIKILGQVGYDPDANRYEKIKK 362
++GI+++G VGY + I
Sbjct: 51 SSGIEVVGAVGYAGNTTPGGTIDY 74
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 31.0 bits (70), Expect = 1.7
Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 3/64 (4%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
S DDDDD + + E+ + + E + D S D D++ T+ D+ +
Sbjct: 85 SLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLEL---PDRDEDADTQANNDQTND 141
Query: 176 EPVE 179
+
Sbjct: 142 FDQD 145
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 30.6 bits (69), Expect = 1.7
Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 1/76 (1%)
Query: 113 SDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKE-NRTETGED 171
+ D + L+ E+ + + D SD + SD E+D+D++ + E E
Sbjct: 206 GLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKEL 265
Query: 172 KEEEEPVETRIDVEIP 187
KE+EE ET + E
Sbjct: 266 KEDEEEEETEEEEEEE 281
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 30.6 bits (69), Expect = 1.7
Identities = 28/106 (26%), Positives = 40/106 (37%), Gaps = 19/106 (17%)
Query: 83 VVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDDDG-KLQVVEEEGDGEGERRER 141
VV G H + ++AL D++ D DD +L E D G
Sbjct: 18 VVTGSTPTEHEQNMLAL--------------DVAARDGDDAIELAEPETSDDPYGNPDPF 63
Query: 142 DEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETRIDVEIP 187
ED R S+ ++E+ E GE+ EEEE E + E
Sbjct: 64 GEDDEGRIEVRISEDGEEEE----VEEGEEDEEEEGEEESEEFEPL 105
>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
subunit.
Length = 1044
Score = 30.7 bits (69), Expect = 2.1
Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 3/71 (4%)
Query: 123 GKLQVVEEEGDGEGERRERDEDARSDASRRG---SDREDDEDKENRTETGEDKEEEEPVE 179
GK E EG G E ++R + ++ +RG D DD D + E E +P+
Sbjct: 137 GKDLEAEREGYGNAEWKDRVDKWKTRQEKRGLVNKDDSDDGDDKGDEEEYLLAEARQPLW 196
Query: 180 TRIDVEIPKIS 190
++ + KI+
Sbjct: 197 RKVPIPSSKIN 207
>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor. This
product is encoded by astrovirus ORF2, one of the three
astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
undergoes an intracellular cleavage to form a 79kD
protein. Subsequently, extracellular trypsin cleavage
yields the three proteins forming the infectious virion.
Length = 787
Score = 30.5 bits (69), Expect = 2.1
Identities = 22/94 (23%), Positives = 29/94 (30%), Gaps = 15/94 (15%)
Query: 100 LLKLNAEALFGDASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASR----RGSD 155
L + L G D D + + EE D E E ED + R S
Sbjct: 643 SLPQSRAELSGLEERAVEDAPDPYEDLISLEETDTED---ESTEDEDDELDRFDLHDSSG 699
Query: 156 REDDEDKENRTET--------GEDKEEEEPVETR 181
E ++D EN T G E + R
Sbjct: 700 SEPEDDDENNRVTLLSTLINQGMSVERATRITKR 733
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 29.7 bits (67), Expect = 2.3
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 1/62 (1%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRG-SDREDDEDKENRTETGEDKEE 174
S + DDG+ E D + E + +E D + SD E+++ R E D E
Sbjct: 3 SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62
Query: 175 EE 176
E
Sbjct: 63 EA 64
>gnl|CDD|181997 PRK09620, PRK09620, hypothetical protein; Provisional.
Length = 229
Score = 29.9 bits (67), Expect = 2.5
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 4/55 (7%)
Query: 188 KISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDEE---GRARLKLK 239
KIS D+ +HF K P L + + +DPET + E ++EE RA+ +++
Sbjct: 118 KISSDIAPIIHFQKAPKVLK-QIKQWDPETVLVGFKLESDVNEEELFERAKNRME 171
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.4 bits (68), Expect = 3.2
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 140 ERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPV 178
E D R + D ++DED ++ + E++EEEE +
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 30.0 bits (67), Expect = 3.4
Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 5/34 (14%)
Query: 109 FGDASDISTDDDDDGKLQVVEEEGDGEGERRERD 142
+GD SD + DD G Q +EEE E ER
Sbjct: 495 WGDESDSGIEHDDGGVGQAIEEE-----EEEERA 523
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 29.2 bits (66), Expect = 4.5
Identities = 18/67 (26%), Positives = 34/67 (50%), Gaps = 1/67 (1%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
D DD+G+ VE + + E E DE+ + +A+++ + D+E E E ++EE
Sbjct: 109 DEDSDDEGEWIDVESDKEIESSDSE-DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167
Query: 176 EPVETRI 182
E + +
Sbjct: 168 EAEKEKA 174
Score = 29.2 bits (66), Expect = 4.8
Identities = 36/143 (25%), Positives = 53/143 (37%), Gaps = 14/143 (9%)
Query: 62 EVAADQLQ----GQTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGD-ASDIS 116
EV + L+ G+ + + GE+ + LL K E A
Sbjct: 33 EVNPELLKKKDRGKPASMGLEAGEKKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGL 92
Query: 117 TDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDK---------ENRTE 167
DDDD + + E E D + + D SD SD ED+E+K ++ E
Sbjct: 93 ESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEE 152
Query: 168 TGEDKEEEEPVETRIDVEIPKIS 190
E+ EEE E + E K S
Sbjct: 153 LSEEDEEEAAEEEEAEAEKEKAS 175
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 29.2 bits (65), Expect = 5.3
Identities = 24/120 (20%), Positives = 41/120 (34%), Gaps = 7/120 (5%)
Query: 62 EVAADQLQGQTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDD 121
E+AA + V + Q +D+ ++A VA S + D D
Sbjct: 302 ELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVA-----EVTDEVAAESVVQVADRD 356
Query: 122 DGKLQVVEEEGDGEGERRERDEDA--RSDASRRGSDREDDEDKENRTETGEDKEEEEPVE 179
VEE + + ER + + A A + ++ +E E +E EP
Sbjct: 357 GESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEV 416
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 29.2 bits (66), Expect = 5.7
Identities = 13/52 (25%), Positives = 26/52 (50%), Gaps = 5/52 (9%)
Query: 332 LADRSNKTTGIKILGQVGYDPDANRYEKIKKEEEKLRQSLRKETKTKRTREK 383
+ D+ + + K+ D + + EK +KE K ++ LRK K + ++K
Sbjct: 535 IEDKPDGPSVWKL-----DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKK 581
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 28.9 bits (65), Expect = 5.8
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 6/60 (10%)
Query: 118 DDDDDGKLQV-----VEEEGDGEGERRERDEDARSDASRR-GSDREDDEDKENRTETGED 171
+D+D+ + Q EE D E E + ED S++S GSD E DKE+ +++ +
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 29.1 bits (65), Expect = 6.3
Identities = 21/110 (19%), Positives = 36/110 (32%), Gaps = 10/110 (9%)
Query: 367 LRQSLRKE-TKTKRTREKGASSRLSAGYLEGDDDDDEGAI---------SLSAIKNKYNA 416
LRQ +R+E K K + + + D D+DE + S K
Sbjct: 157 LRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEER 216
Query: 417 KKNPAGASGAAPSRNIYSSDEDASDLELRTKKHDKPKRVLQDSDDEDEKS 466
+ ++ +E DL + + +R D D+ E S
Sbjct: 217 RAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDS 266
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 28.8 bits (64), Expect = 6.6
Identities = 14/61 (22%), Positives = 23/61 (37%), Gaps = 8/61 (13%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
S++ D K E D + D D D DD+D E+ +T + +E
Sbjct: 257 SSNKYGDFKGVNYGYESDDDEGSSSNDYDEEED--------GDDDDNEDNDDTNTNHGDE 308
Query: 176 E 176
+
Sbjct: 309 D 309
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 29.2 bits (66), Expect = 6.6
Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 118 DDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEP 177
D D + +LQ D + E D+D + +++ K E +D ++++
Sbjct: 103 DGDIEEELQDEPRYDDAYRDLEEDDDDDEES-----DEEDEESSKSEDDEDDDDDDDDDD 157
Query: 178 VETR 181
+ TR
Sbjct: 158 IATR 161
>gnl|CDD|132321 TIGR03278, methan_mark_10, methanogenesis marker radical SAM
protein. Members of this protein family, to date, are
found in a completed prokaryotic genome if and only if
the species is one of the archaeal methanogens. It is a
radical SAM enzyme by homology. The exact function is
unknown, but likely is linked to methanogenesis. In most
genomes, the member of this family is encoded by a gene
next to, and divergently transcribed from, the methyl
coenzyme M reductase operon.
Length = 404
Score = 28.9 bits (65), Expect = 6.7
Identities = 19/120 (15%), Positives = 42/120 (35%), Gaps = 27/120 (22%)
Query: 253 DGDLVKESNARLVKW-SDGSLSLHLGSEIFDVYKQPLQGDHNHLYIRQGTGLQGQAVFRT 311
DGD++ ++ A L W + + + + + L + + G
Sbjct: 181 DGDVLWKTCADLESWGAKALILMRFANT-----------EEQGLILGNAPIIPGI----- 224
Query: 312 KLSFRPHSTESFTHRKMTLSLADRSNKTTGIKILGQVGYDPDANRYEKIKKEEEKLRQSL 371
+PH+ F ++ ++K I++ G DP+ + KE + + L
Sbjct: 225 ----KPHTVSEFK------NIVRETHKEFPIRVTGTPLCDPETGAPFALAKEPNEYLEIL 274
>gnl|CDD|236198 PRK08243, PRK08243, 4-hydroxybenzoate 3-monooxygenase; Validated.
Length = 392
Score = 28.6 bits (65), Expect = 6.9
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 70 GQTEVVQDLEAARVVQGEDL 89
GQTEV +DL AAR+ G +
Sbjct: 101 GQTEVTRDLMAARLAAGGPI 120
>gnl|CDD|227847 COG5560, UBP12, Ubiquitin C-terminal hydrolase [Posttranslational
modification, protein turnover, chaperones].
Length = 823
Score = 29.1 bits (65), Expect = 7.5
Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 3/80 (3%)
Query: 169 GEDKEEEEPVETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETL 228
DK +E E + VE+ K DL E V+L S + ET E + + EE +
Sbjct: 574 IYDKLVKEFEELLVLVEMKKTDVDLVSEQ--VRLLREESSPSSWLKLET-EIDTKREEQV 630
Query: 229 DEEGRARLKLKVENTIRWRD 248
+EEG+ V + W +
Sbjct: 631 EEEGQMNFNDAVVISCEWEE 650
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 28.5 bits (63), Expect = 8.4
Identities = 12/59 (20%), Positives = 22/59 (37%)
Query: 118 DDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEE 176
++D E + +E+ ++ +A + + D E E ED E EE
Sbjct: 273 SEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEE 331
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 28.7 bits (64), Expect = 8.6
Identities = 16/68 (23%), Positives = 34/68 (50%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
+TDDD + + ++ + + + D+ ++ S+ S E E + +T E +E+
Sbjct: 105 ATDDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQSASQAESLEHVQQSAQTSEKVDEK 164
Query: 176 EPVETRID 183
EP+ T+ D
Sbjct: 165 EPLLTKTD 172
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 28.7 bits (64), Expect = 8.7
Identities = 12/37 (32%), Positives = 13/37 (35%)
Query: 138 RRERDEDARSDASRRGSDREDDEDKENRTETGEDKEE 174
RR RD D RR +R ED R D
Sbjct: 26 RRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRS 62
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 27.8 bits (62), Expect = 9.6
Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
+DD + +++ V+EE ++++ ++ S+ E++E+ E ++ E KE E
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENE-KETE 111
Query: 176 EPVETRIDVEI 186
E E+ ++ EI
Sbjct: 112 EKTESNVEKEI 122
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.130 0.356
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,650,273
Number of extensions: 2498522
Number of successful extensions: 3815
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3255
Number of HSP's successfully gapped: 241
Length of query: 481
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 380
Effective length of database: 6,457,848
Effective search space: 2453982240
Effective search space used: 2453982240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 61 (27.3 bits)