RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6522
(1407 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 176 bits (449), Expect = 3e-49
Identities = 95/320 (29%), Positives = 148/320 (46%), Gaps = 37/320 (11%)
Query: 202 GHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEISDIAIDNRNILL 261
GH V CV F G+++ TG+ D IK+W G+LL TL+G +G + D+A L
Sbjct: 7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYL 66
Query: 262 AAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWKY 321
A+G+ DKTIR+W+L+T + L GH ++ V F P L+S+S D I W
Sbjct: 67 ASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG----RILSSSSRDKTIKVW-- 120
Query: 322 KLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPGGLFLATGSGDHHVRVYKMDGVDSP 381
D++ K + +R + +FSP G F+A+ S D ++++ +
Sbjct: 121 --DVETGKCLT--------TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT-GKC 169
Query: 382 LGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPHTKE 441
+ L H+ +V+S+ +S + +S S DGT +W + + T H
Sbjct: 170 VATL--TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKC------LGTLRGHENG 221
Query: 442 TSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESH 501
V VA+ + + + + I++WD G+ VQ L GHTN V L
Sbjct: 222 ----------VNSVAFSPDGYLLASG-SEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS 270
Query: 502 PFDSRVLLSAGHDGLIIIWD 521
P D + L S DG I IWD
Sbjct: 271 P-DGKRLASGSADGTIRIWD 289
Score = 142 bits (360), Expect = 2e-37
Identities = 83/326 (25%), Positives = 135/326 (41%), Gaps = 58/326 (17%)
Query: 238 LLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFC 297
L TL+G +G ++ +A LLA G+ D TI++W+L+T + L GH G + V
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 298 PLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPG 357
YLAS S+D I W + + H +++ +FSP
Sbjct: 61 A----DGTYLASGSSDKTIRLWDLETGECVRTLTGHT------------SYVSSVAFSPD 104
Query: 358 GLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALI 417
G L++ S D ++V+ V++ + + H+D V+S+ +S S S+DGT +
Sbjct: 105 GRILSSSSRDKTIKVWD---VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
Query: 418 WYYKCSQWRFIRLDMNTCLP------HTKETSDETNKKIKVTMVAWDASDRWVITAINFN 471
W D+ T HT E V VA+ ++++ +
Sbjct: 162 W------------DLRTGKCVATLTGHTGE----------VNSVAFSPDGEKLLSSSSDG 199
Query: 472 FQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYK 531
IK+WD G + L+GH N V + P D +L S DG I +WD+ + +
Sbjct: 200 -TIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQ-- 255
Query: 532 WFHNIVEGQG-EGALFDGKWSPDGTT 556
G ++ WSPDG
Sbjct: 256 ------TLSGHTNSVTSLAWSPDGKR 275
Score = 133 bits (338), Expect = 2e-34
Identities = 71/293 (24%), Positives = 118/293 (40%), Gaps = 44/293 (15%)
Query: 190 KYSNLRQHYVNLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDGKLLATLRGCSGEI 249
GH V V G + +G+ D I+LW G+ + TL G + +
Sbjct: 37 DLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYV 96
Query: 250 SDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNFCPLEVNGFNYLAS 309
S +A +L++ + DKTI++W+++T ++ L GH + V F P ++AS
Sbjct: 97 SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDG----TFVAS 152
Query: 310 TSTDGCIGFWKYKLDIDISKTV----FHEPVMFNERIRPGNAHILCSSFSPGGLFLATGS 365
+S DG I W D+ K V H + +FSP G L + S
Sbjct: 153 SSQDGTIKLW----DLRTGKCVATLTGHTGE------------VNSVAFSPDGEKLLSSS 196
Query: 366 GDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQW 425
D ++++ + LG L H + V+S+ +S SGS DGT +W
Sbjct: 197 SDGTIKLWDLST-GKCLGTLRG--HENGVNSVAFSPDGYLLASGSEDGTIRVW------- 246
Query: 426 RFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWD 478
L C+ ++ VT +AW + + + + I+IWD
Sbjct: 247 ---DLRTGECVQTLSGHTNS------VTSLAWSPDGKRLASGS-ADGTIRIWD 289
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 124 bits (312), Expect = 8e-30
Identities = 98/433 (22%), Positives = 171/433 (39%), Gaps = 61/433 (14%)
Query: 143 LSRSLMISSHKTSLLVPNSVKVHNTVNVLRSREYSGPLSRFLM---ISSHKYSNLRQHYV 199
L + SS S L+ S + ++ S L+ S +
Sbjct: 1 LVDNSSTSSENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLL 60
Query: 200 NLGHLSAVFCVLFDKLGEVIVTGADDLLIKLWRARDG-KLLATLRGCSG----EISDIAI 254
GH ++ + F GE++++G+ D IKLW +G KL+ +L G +++ +
Sbjct: 61 LRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSP 120
Query: 255 DNRNILLAAGTVDKTIRIWNLQTLA-PISVLVGHQGIITGVNFCPLEVNGFNYLASTSTD 313
D +ILLA+ ++D T+++W+L T I L GH +T + F P +G + +S D
Sbjct: 121 DGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSP---DGKLLASGSSLD 177
Query: 314 GCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCS--------------------- 352
G I W + +S H + + P ++ S
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRST 237
Query: 353 ----------SFSPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHS 402
SFSP G LA+GS D +R++ + S L L HS V S+ +S
Sbjct: 238 LSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSG--HSSSVLSVAFSPD 295
Query: 403 HLRFVSGSRDGTALIWYYKCSQWRFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDR 462
SGS DGT +W + + + V+ +++
Sbjct: 296 GKLLASGSSDGTVRLW--------------DLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341
Query: 463 WVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDI 522
+++ + + I++WD G ++ L+GH+N + V S D RV+ S DG + +WD+
Sbjct: 342 LLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSV--SFSPDGRVVSSGSTDGTVRLWDL 399
Query: 523 LSSKQRGYKWFHN 535
+ H
Sbjct: 400 STGSLLRNLDGHT 412
Score = 89.4 bits (220), Expect = 3e-18
Identities = 61/289 (21%), Positives = 110/289 (38%), Gaps = 31/289 (10%)
Query: 295 NFCPLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSF 354
+ +N + L+ S++ I D +S + +R I +F
Sbjct: 19 SELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLS-----SLLLRGHEDSITSIAF 73
Query: 355 SPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSH---SHLRFVSGSR 411
SP G L +GS D ++++ +D + + LE H V + S + + S S
Sbjct: 74 SPDGELLLSGSSDGTIKLWDLDNGEKLIKSLE-GLHDSSVSKLALSSPDGNSILLASSSL 132
Query: 412 DGTALIWYYKCSQWRFIRLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFN 471
DGT +W + T H++ VT +A+ + + + + +
Sbjct: 133 DGTVKLW-----DLSTPGKLIRTLEGHSES----------VTSLAFSPDGKLLASGSSLD 177
Query: 472 FQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQRGYK 531
IK+WD G + L GHT+ V L P ++ S DG I +WD+ + K
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTG-----K 232
Query: 532 WFHNIVEGQGEGALFDGKWSPDGTTCALTDSYGHLLVYGLGGFKRPQNI 580
+ + G + + +SPDG+ A S G + ++ L
Sbjct: 233 LLRSTLSGHSDSVV--SSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRT 279
Score = 88.6 bits (218), Expect = 5e-18
Identities = 81/374 (21%), Positives = 137/374 (36%), Gaps = 47/374 (12%)
Query: 132 NVLRSREYSGPLSRSLMISSHKTSLLVPNSVKVHNTVNVLRSREYSGPLSRFLMISSHKY 191
++ + S P + H S+ S+ +L S L + + +
Sbjct: 134 GTVKLWDLSTPGKLIRTLEGHSESV---TSLAFSPDGKLLAS---GSSLDGTIKLWDLRT 187
Query: 192 SNLRQHYVNLGHLSAVFCVLFDKLGE-VIVTGADDLLIKLWRARDGKLLATLRGCSGEIS 250
GH V + F G +I +G+ D I+LW GKLL + +
Sbjct: 188 GKPLS--TLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV 245
Query: 251 DIAIDNRNILLAAGTVDKTIRIWNLQTLA-PISVLVGHQGIITGVNFCPLEVNGFNYLAS 309
+ LLA+G+ D TIR+W+L++ + + L GH + V F P LAS
Sbjct: 246 VSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDG----KLLAS 301
Query: 310 TSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGNAHILCSSFSPGGLFLATG-SGDH 368
S+DG + W D++ K + + +E + SFSP G L +G S D
Sbjct: 302 GSSDGTVRLW----DLETGKLLSSLTLKGHEGP------VSSLSFSPDGSLLVSGGSDDG 351
Query: 369 HVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIWYYKCSQWRFI 428
+R++ + + E V S+ +S SGS DGT +W
Sbjct: 352 TIRLWDLRTGKPLKTL----EGHSNVLSVSFSPDGRVVSSGSTDGTVRLW---------- 397
Query: 429 RLDMNTCLPHTKETSDETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVL 488
L + +VT + + + + + + N I++WD L V
Sbjct: 398 ------DLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNT-IRLWDLKT-SLKSVS 449
Query: 489 KGHTNEVFVLESHP 502
+V +S
Sbjct: 450 FSPDGKVLASKSSD 463
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 44.6 bits (106), Expect = 3e-06
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 481 NGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWD 521
+G+L++ LKGHT V + P D + L S DG I +WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
Score = 41.5 bits (98), Expect = 5e-05
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 235 DGKLLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWN 274
G+LL TL+G +G ++ +A LA+G+ D TI++W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 40.0 bits (94), Expect = 1e-04
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 384 ILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 418
+ ++ H+ V S+ +S SGS DGT +W
Sbjct: 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 38.1 bits (89), Expect = 9e-04
Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 4/44 (9%)
Query: 277 TLAPISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWK 320
+ + L GH G +T V F P +G YLAS S DG I W
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSP---DG-KYLASGSDDGTIKLWD 40
Score = 35.8 bits (83), Expect = 0.005
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 202 GHLSAVFCVLFDKLGEVIVTGADDLLIKLW 231
GH V V F G+ + +G+DD IKLW
Sbjct: 10 GHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 35.4 bits (82), Expect = 0.006
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 346 NAHILCSSFSPGGLFLATGSGDHHVRVYK 374
+ +FSP G +LA+GS D ++++
Sbjct: 12 TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 43.9 bits (104), Expect = 7e-06
Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 482 GDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWD 521
G L++ LKGHT V + P D +L S DG + +WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSP-DGNLLASGSDDGTVRVWD 39
Score = 41.2 bits (97), Expect = 5e-05
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 236 GKLLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWN 274
GKLL TL+G +G ++ +A LLA+G+ D T+R+W+
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 40.8 bits (96), Expect = 9e-05
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 384 ILEVEEHSDKVDSIQWSHSHLRFVSGSRDGTALIW 418
+ ++ H+ V S+ +S SGS DGT +W
Sbjct: 4 LRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 35.8 bits (83), Expect = 0.004
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 202 GHLSAVFCVLFDKLGEVIVTGADDLLIKLW 231
GH V V F G ++ +G+DD +++W
Sbjct: 9 GHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 35.8 bits (83), Expect = 0.005
Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 4/41 (9%)
Query: 280 PISVLVGHQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWK 320
+ L GH G +T V F P N LAS S DG + W
Sbjct: 3 LLRTLKGHTGPVTSVAFSPDG----NLLASGSDDGTVRVWD 39
Score = 34.3 bits (79), Expect = 0.017
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 346 NAHILCSSFSPGGLFLATGSGDHHVRVYK 374
+ +FSP G LA+GS D VRV+
Sbjct: 11 TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This protein,
which interacts with both microtubules and TRAF3 (tumour
necrosis factor receptor-associated factor 3), is
conserved from worms to humans. The N-terminal region is
the microtubule binding domain and is well-conserved; the
C-terminal 100 residues, also well-conserved, constitute
the coiled-coil region which binds to TRAF3. The central
region of the protein is rich in lysine and glutamic acid
and carries KKE motifs which may also be necessary for
tubulin-binding, but this region is the least
well-conserved.
Length = 506
Score = 46.8 bits (111), Expect = 7e-05
Identities = 29/183 (15%), Positives = 65/183 (35%), Gaps = 15/183 (8%)
Query: 1215 QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEG---EEEEESNDQSEEEEVE 1271
+ + G ++T+ + SG+ ++ + ++E + + +EE D+ +EE +
Sbjct: 79 RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Query: 1272 E--------NSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKR 1323
E +E++ EE R +R V R P ++ + + + ++
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQ 198
Query: 1324 TNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEEES 1383
+ D + ++EE G R T + + SSE S
Sbjct: 199 AAREAVKG----KPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLK 254
Query: 1384 QAA 1386
+
Sbjct: 255 KPD 257
Score = 36.8 bits (85), Expect = 0.087
Identities = 28/173 (16%), Positives = 61/173 (35%), Gaps = 3/173 (1%)
Query: 1221 QTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
+ + ++G + K + + E E E E +EE + ++E+ +E ++ + +
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEE---KKKKKEKPKEEPKDRKPK 134
Query: 1281 EAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDS 1340
E +RP + K K+ P + VR R K+ ++ + +
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE 194
Query: 1341 DSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRR 1393
++E + + + T S EE ++ Q S R
Sbjct: 195 KQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISR 247
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 44.1 bits (105), Expect = 5e-04
Identities = 36/242 (14%), Positives = 68/242 (28%), Gaps = 13/242 (5%)
Query: 1144 SGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAM 1203
G ++ ++ + R R+ Q + + A
Sbjct: 60 GGGAAAAAATPAAPAAAARRAARAAAAARQA--------EQPAAEAAAAKAEAAPAARAA 111
Query: 1204 RARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESND 1263
A ++ + +A + + + + K G D E EEEE ++
Sbjct: 112 AAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDE 171
Query: 1264 QSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKR 1323
+ + E+ E E E R R + R R R
Sbjct: 172 RRRRGDREDRQAEAERGERGRREERGRDGD--DRDRRDRREQGDRREERGRRDGGDRRGR 229
Query: 1324 TNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARR---TRTKGQSYAVHSSETETESEE 1380
R + + + + + E G RR RR +G+ +E E E E
Sbjct: 230 RRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELRE 289
Query: 1381 EE 1382
++
Sbjct: 290 DD 291
Score = 38.3 bits (90), Expect = 0.033
Identities = 21/141 (14%), Positives = 37/141 (26%), Gaps = 4/141 (2%)
Query: 1133 EENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDV 1192
++E + E + R +A R R G + R + +
Sbjct: 152 PATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQ 211
Query: 1193 SDYDSGDRYAMRA-RQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESED 1251
D R+ ++ + + RG R + G RGR+ D
Sbjct: 212 GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRR---FRD 268
Query: 1252 EVEGEEEEESNDQSEEEEVEE 1272
E E+ E
Sbjct: 269 RDRRGRRGGDGGNEREPELRE 289
Score = 34.9 bits (81), Expect = 0.32
Identities = 23/197 (11%), Positives = 44/197 (22%), Gaps = 15/197 (7%)
Query: 1202 AMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEES 1261
A++ + + A R R+A + E + E +
Sbjct: 54 AIKEARGGGAAAAAATPAAPAAAARRAARA--------AAAARQAEQPAAEAAAAKAEAA 105
Query: 1262 NDQSEEEEVE-ENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGR 1320
E + E +A RR A R + + R+
Sbjct: 106 PAARAAAAAAAEAASAPEAAQARERRERGEAA-RRGAARKAGEGGEQPATEARADAAERT 164
Query: 1321 AKRTNGRQSSSLLMTCSGDSD--SEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETES 1378
+ + E +EE R + Q E
Sbjct: 165 EEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQG---DRREERGRR 221
Query: 1379 EEEESQAAQVSFSRRGR 1395
+ + + + RR
Sbjct: 222 DGGDRRGRRRRRDRRDA 238
Score = 33.7 bits (78), Expect = 0.86
Identities = 15/149 (10%), Positives = 39/149 (26%), Gaps = 11/149 (7%)
Query: 1143 ESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYA 1202
+++ ++E S E + R R + + +D
Sbjct: 108 ARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEE 167
Query: 1203 MRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESN 1262
R + ++ + RG++ RGR + + E+ +
Sbjct: 168 ERDERRRRGDREDRQAEAE----RGERGRR-------EERGRDGDDRDRRDRREQGDRRE 216
Query: 1263 DQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
++ + + + R R
Sbjct: 217 ERGRRDGGDRRGRRRRRDRRDARGDDNRE 245
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 44.3 bits (105), Expect = 5e-04
Identities = 53/237 (22%), Positives = 90/237 (37%), Gaps = 20/237 (8%)
Query: 1141 SSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDR 1200
S + Q + R R+ PR+ NR + R R+N + ++ +
Sbjct: 581 SGGEETKPQEQPAPKAEAKPERQQDRRK--PRQNNRRDRNERRDTRDNRTRREGRENREE 638
Query: 1201 YAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEE 1260
R R+ Q Q+ + Q VT + R + + R R R+ DE ++E
Sbjct: 639 -NRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRR---RNDEKRQAQQEA 694
Query: 1261 SNDQSEEEEVEENSEEEEEEEAVPRRRP------VRRANVAKVKRAVPPAPDTSTSRLRS 1314
EE+ V+E +EE ++ PRR+ VR + P +T +
Sbjct: 695 KALNVEEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAA--EP 752
Query: 1315 VRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQE--EEQYGSRRARRT----RTKGQ 1365
V Q A RT + ++ + E+N+ E + RR+RR+ R GQ
Sbjct: 753 VVQEVPAPRTELVKVPLPVVAQTAPEQDEENNAENRDNNGMPRRSRRSPRHLRVSGQ 809
Score = 39.6 bits (93), Expect = 0.013
Identities = 33/154 (21%), Positives = 57/154 (37%), Gaps = 13/154 (8%)
Query: 1258 EEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQ 1317
EE+ Q + E E +++ PR+ R N + R + +R + R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRN 642
Query: 1318 NGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSY-------AVH 1370
+A++ S + + ++D Q++ R+ RR K Q+
Sbjct: 643 RRQAQQQTAETRESQQAEVTEKARTQD-EQQQAPRRERQRRRNDEKRQAQQEAKALNVEE 701
Query: 1371 SSETETESEEEESQAAQVSFSRRGRMRKPTAKLR 1404
S ETE EE Q RR + R+ K+R
Sbjct: 702 QSVQETEQEERVQQVQP----RRKQ-RQLNQKVR 730
Score = 30.8 bits (70), Expect = 7.5
Identities = 12/58 (20%), Positives = 17/58 (29%), Gaps = 1/58 (1%)
Query: 1249 SEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPD 1306
+E EV + + E + E E A N A PAP+
Sbjct: 971 AEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQV-PEATVEHNHATAPMTRAPAPE 1027
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 44.2 bits (104), Expect = 6e-04
Identities = 25/55 (45%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVA 1295
G G +SE+E EEEEE ++ EEEE EE EEEE EE + P R A
Sbjct: 855 GGGSDGGDSEEE---EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQA 906
Score = 43.4 bits (102), Expect = 0.001
Identities = 20/40 (50%), Positives = 24/40 (60%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+G D + EEEEE ++ EEEE EE EEEEEEE
Sbjct: 850 KGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 41.9 bits (98), Expect = 0.003
Identities = 18/47 (38%), Positives = 24/47 (51%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
K + +G + EE ++ EEEE EE EEEEEEE P+
Sbjct: 848 DEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894
Score = 40.7 bits (95), Expect = 0.006
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+A + E V+G + D EEEE EE EEEEEEE
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEE 880
Score = 32.7 bits (74), Expect = 1.8
Identities = 17/68 (25%), Positives = 27/68 (39%)
Query: 1215 QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENS 1274
+A G+ R ++ + G E E E +GE E E +E + +E
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696
Query: 1275 EEEEEEEA 1282
E E +EA
Sbjct: 697 GEIEAKEA 704
Score = 32.3 bits (73), Expect = 2.7
Identities = 28/141 (19%), Positives = 47/141 (33%), Gaps = 4/141 (2%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDT 1307
E E EGE EES ++E+E E E E E +P R+ D
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAE---RKGEQEGEGEIEAKEADH 706
Query: 1308 STSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARR-TRTKGQS 1366
++ G + + T + ED + E + R + +
Sbjct: 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766
Query: 1367 YAVHSSETETESEEEESQAAQ 1387
+E + + +E E QA +
Sbjct: 767 EGETEAEGKEDEDEGEIQAGE 787
Score = 30.7 bits (69), Expect = 6.9
Identities = 16/87 (18%), Positives = 30/87 (34%), Gaps = 1/87 (1%)
Query: 1194 DYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEV 1253
D GD + + + + + G ++ + +G E E
Sbjct: 629 DLSKGDVAEAEHTGERTGEEGERPTEAEGENGEES-GGEAEQEGETETKGENESEGEIPA 687
Query: 1254 EGEEEEESNDQSEEEEVEENSEEEEEE 1280
E + E+E + E +E + E E EE
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEE 714
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This family
includes the radial spoke head proteins RSP4 and RSP6
from Chlamydomonas reinhardtii, and several eukaryotic
homologues, including mammalian RSHL1, the protein
product of a familial ciliary dyskinesia candidate gene.
Length = 481
Score = 42.0 bits (99), Expect = 0.002
Identities = 29/101 (28%), Positives = 43/101 (42%), Gaps = 16/101 (15%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
GR + + E E+E+E EEEE EE EE E EE P P+ + A +
Sbjct: 337 GRCTWVNPEQKDEEEEQEDE-----EEEEEEEEPEEPEPEEGPPLLTPI--SEDAPLPND 389
Query: 1301 VPP---------APDTSTSRLRSVRQNGRAKRTNGRQSSSL 1332
P +P + + LRS R G NG++ ++
Sbjct: 390 DPAWTFRLSSSLSPKYAVAVLRSNRWPGAYTYANGKKFENI 430
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 41.4 bits (97), Expect = 0.003
Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 18/124 (14%)
Query: 1180 SLGRQYRENPS-DVSDYDSG-DRYAMRA-------RQSKQVQSVQAGR-RGQTIVTSRTR 1229
SL R+ R N VS + +YAM+ R+ +Q + Q R + +
Sbjct: 368 SLARKLRSNREVAVSRLEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASST 427
Query: 1230 GKQSYNKFSGRGRGRKAPESEDEVEGE--------EEEESNDQSEEEEVEENSEEEEEEE 1281
+S + S ++ E E+E E E EEEE D+ EEEEVE ++ EEE E
Sbjct: 428 SGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Query: 1282 AVPR 1285
Sbjct: 488 GSSE 491
Score = 32.2 bits (73), Expect = 2.6
Identities = 41/263 (15%), Positives = 80/263 (30%), Gaps = 49/263 (18%)
Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYREN 1188
+S + S S +S ES EE+ E+E+ + E
Sbjct: 424 ASSTSGESPSMASQESEEEESVEEEEEEEEE-----------------------EEEEEQ 460
Query: 1189 PSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRT----RGKQSYNKFSGRGRGR 1244
S+ + + + + + ++ G R S G+
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQ 520
Query: 1245 ------KAPESEDEVEGE----EEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANV 1294
PES E + + E ++S+EE + E S E PV
Sbjct: 521 QPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETK-- 578
Query: 1295 AKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGS 1354
R +PP P + + +N A T+ ++ + S + ++ +
Sbjct: 579 ISSSRKLPPPPVS------TSLENDSATVTSTTRNGN----VSPHTPQDEQPPSGRKRKR 628
Query: 1355 RRARRTRTKGQSYAVHSSETETE 1377
+ + G Y H + +E +
Sbjct: 629 KEEVESEPLGNQYLRHHNGSEKD 651
Score = 30.7 bits (69), Expect = 6.7
Identities = 34/169 (20%), Positives = 50/169 (29%), Gaps = 15/169 (8%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEEN------SEEEEEEEAVPRRRPVRRANVAKVK 1298
+ E EDE E EE E N EE E EE+ E + R + +
Sbjct: 463 EEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQP 522
Query: 1299 RA---VPPAPDTSTSRLRSV--RQNGRAKRTNGRQSSSLL---MTCSGDSDSEDNSQEEE 1350
R P +P + S+ G S L G + +
Sbjct: 523 RGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETKISSS 582
Query: 1351 QYGSRRARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKP 1399
+ T + S A +S T + + + S R R RK
Sbjct: 583 RKLPPPPVSTSLENDS-ATVTSTTRNGNVSPHTPQDEQPPSGRKRKRKE 630
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family are
designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 40.1 bits (94), Expect = 0.005
Identities = 31/155 (20%), Positives = 53/155 (34%), Gaps = 16/155 (10%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP--PAP 1305
E E+E E EE + D SE++E E + EEE E+E R ++ V P
Sbjct: 44 EIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKK 103
Query: 1306 DTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQ 1365
+ +S + + + S R++ R+ T
Sbjct: 104 KKDPTAAKSPKAAAPRPKKKSERISWA--------------PTLLDSPRRKSSRSSTVQN 149
Query: 1366 SYAVHSSETETESEEEESQAAQVSFSRRGRMRKPT 1400
A H E E ++ QA + + ++ T
Sbjct: 150 KEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus spores
are protected by a protein shell consisting of over 50
different polypeptides, known as the coat. This family of
proteins has an important morphogenetic role in coat
assembly, it is involved in the assembly of at least 5
different coat proteins including CotB, CotG, CotS, CotSA
and CotW. It is likely to act at a late stage of coat
assembly.
Length = 185
Score = 38.3 bits (89), Expect = 0.010
Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVA 1295
PE ED E +E+EE + E+EE E+ +++E + R++P + N+
Sbjct: 70 AGEPEREDIAEQQEKEEIAQEEEKEEEAED-VKQQEVFSFKRKKPFKEMNLE 120
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with merozoites)
proteins. Variation among SPAM alleles is the result of
deletions and amino acid substitutions in non-repetitive
sequences within and flanking the alanine heptad-repeat
domain. Heptad repeats in which the a and d position
contain hydrophobic residues generate amphipathic
alpha-helices which give rise to helical bundles or
coiled-coil structures in proteins. SPAM is an example of
a P. falciparum antigen in which a repetitive sequence
has features characteristic of a well-defined structural
element.
Length = 164
Score = 37.5 bits (87), Expect = 0.018
Identities = 18/34 (52%), Positives = 24/34 (70%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E E+E + EE EE D +EEE+ E+ EEEEE+E
Sbjct: 52 EEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85
Score = 36.0 bits (83), Expect = 0.046
Identities = 16/35 (45%), Positives = 23/35 (65%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ +DE E EE+EE ++ E+ E EE E+EEEE
Sbjct: 47 KQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEE 81
Score = 35.2 bits (81), Expect = 0.084
Identities = 16/37 (43%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 1248 ESEDEV-EGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E+ED E +E++E ++ +EEE+EE + E+EEE V
Sbjct: 39 ENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIV 75
Score = 32.5 bits (74), Expect = 0.71
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E E+E+E E+ E ++ E+E EE +EE+ +
Sbjct: 57 EDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 38.9 bits (90), Expect = 0.022
Identities = 48/226 (21%), Positives = 70/226 (30%), Gaps = 25/226 (11%)
Query: 1138 SDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDS 1197
S+ +G Q+ E D + + NR +L E SDYD
Sbjct: 294 SNKIIVNGQYEQTIREIFADRATKLELDLQTVFESNMNR--DTLDEYAPEGEDLRSDYDE 351
Query: 1198 GDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSY------------NKFSGRGRGRK 1245
Y GR + + +G Y + +
Sbjct: 352 DFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMS 411
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP-PA 1304
A + +D E + EE + D+ + E EE EE + R R K R P A
Sbjct: 412 AIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDME--KEDREFPDEA 469
Query: 1305 PDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEE 1350
+ R K G ++ L TCS SD +D S EE
Sbjct: 470 ELQPSESAIE-----RYKEYRGLRN---LYTCSWRSDEKDPSFPEE 507
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 38.6 bits (90), Expect = 0.023
Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR 1290
P+ E+E E EE++E + ++ EE EEEE+EE + + V+
Sbjct: 36 PDEEEEEEKEEKKEEEE-KTTDKEEEVDEEEEKEEKKKKTKKVK 78
Score = 33.2 bits (76), Expect = 1.2
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVK 1298
E E E E +EEE ++ E++E EE + ++EEE + ++ KVK
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 38.5 bits (89), Expect = 0.025
Identities = 19/78 (24%), Positives = 38/78 (48%)
Query: 445 ETNKKIKVTMVAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFD 504
E + K++ + W++ + + + NF +++WD LV +K H V+ ++ D
Sbjct: 528 ELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSAD 587
Query: 505 SRVLLSAGHDGLIIIWDI 522
+L S DG + +W I
Sbjct: 588 PTLLASGSDDGSVKLWSI 605
Score = 36.2 bits (83), Expect = 0.16
Identities = 59/253 (23%), Positives = 105/253 (41%), Gaps = 38/253 (15%)
Query: 180 LSRFLMISSHKY-SNLRQHYVNLGHLSAVFCVL-FDKLGEVIVTGADDLLIKLWRA---- 233
L ++L S + ++L+Q +L + S + C + FD+ GE T + IK++
Sbjct: 459 LCKYLSFSKLRVKADLKQG--DLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESII 516
Query: 234 RDGK-------LLATLRGCSGEISDIAIDNRNILLAAGTVDKTIRIWNLQTLAPISVLVG 286
+DG+ LA+ SG + I ++ +A+ + +++W++ ++ +
Sbjct: 517 KDGRDIHYPVVELASRSKLSGICWNSYIKSQ---VASSNFEGVVQVWDVARSQLVTEMKE 573
Query: 287 HQGIITGVNFCPLEVNGFNYLASTSTDGCIGFWKYKLDIDISKTVFHEPVMFNERIRPGN 346
H+ + +++ + LAS S DG + W + I
Sbjct: 574 HEKRVWSIDYSSADPT---LLASGSDDGSVKLWSINQGVSIGTI-------------KTK 617
Query: 347 AHILCSSF-SPGGLFLATGSGDHHVRVYKMDGVDSPLGILEVEEHSDKVDSIQWSHSHLR 405
A+I C F S G LA GS DH V Y + PL + HS V +++ S
Sbjct: 618 ANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTM--IGHSKTVSYVRFVDSS-T 674
Query: 406 FVSGSRDGTALIW 418
VS S D T +W
Sbjct: 675 LVSSSTDNTLKLW 687
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 38.7 bits (90), Expect = 0.026
Identities = 32/175 (18%), Positives = 55/175 (31%), Gaps = 38/175 (21%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP 1302
R A + +S+D ++ VE + E +++ R+ A K A P
Sbjct: 1295 RRAAARKKPLASVSVISDSDDDDDDFAVEVSLAER------LKKKGGRKPAAANKKAAKP 1348
Query: 1303 PA-------------PDTSTSRLRSVRQNG----------RAKRTNGRQSSSLLMTCSGD 1339
PA T L+ G RA N + S L +
Sbjct: 1349 PAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNK 1408
Query: 1340 ----SDSEDNSQEEEQYG-----SRRARRTRTKGQSYAVHSSETETESEEEESQA 1385
S++ S E+ R +R K +Y + SE+E+ + +
Sbjct: 1409 ETESSENVSGSSSSEKDEIDVSAKPRPQRANRKQTTYVLSDSESESADDSDFDDD 1463
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 38.7 bits (90), Expect = 0.026
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ED G EE E +D+SEE+E E E EE E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGE 276
Score = 32.1 bits (73), Expect = 2.3
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E DE +G++++ N++ E EE+ +E EE
Sbjct: 229 EDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDE 264
Score = 31.7 bits (72), Expect = 3.2
Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVE--ENSEEEEEEE 1281
GR+ E DE E +E E ++ + EE E++ E SE+ E +E
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE 290
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family. Prothymosin
alpha and parathymosin are two ubiquitous small acidic
nuclear proteins that are thought to be involved in cell
cycle progression, proliferation, and cell
differentiation.
Length = 106
Score = 35.7 bits (82), Expect = 0.027
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
A E +DE+E EEE + +D+ EE E EE EEEEE
Sbjct: 44 AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEET 79
Score = 32.2 bits (73), Expect = 0.40
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
G+ A +E+E G +E + + EEE E++ EEE E E
Sbjct: 29 GKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEE 70
Score = 32.2 bits (73), Expect = 0.49
Identities = 13/40 (32%), Positives = 25/40 (62%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRR 1287
+ E+E EGEEEE ++ E + + E+EE++A +++
Sbjct: 61 DEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQ 100
Score = 31.5 bits (71), Expect = 0.84
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 1249 SEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
E + E EEE ++EE E EEEE EE
Sbjct: 43 GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEE 76
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 36.5 bits (85), Expect = 0.027
Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 8/61 (13%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
G +K E E+E E E EE ++ +E +E+ + + E+ RR N K K
Sbjct: 95 GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREK--------RRENERKQKEI 146
Query: 1301 V 1301
+
Sbjct: 147 L 147
Score = 29.2 bits (66), Expect = 9.6
Identities = 13/46 (28%), Positives = 24/46 (52%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR 1290
+ E E EVE +EEE D+ E+E+ + E+ E ++ ++
Sbjct: 103 EEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of vertebrates.
The function is not known.
Length = 172
Score = 36.8 bits (85), Expect = 0.033
Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 7/79 (8%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPD 1306
EDE E E+ E+S D EE+ E EE EE ++ + P
Sbjct: 30 EVEEDEEEMEDWEDSLD-------EEDEEAEEVEEETAASSKAPSSSSKSSSQETISIPP 82
Query: 1307 TSTSRLRSVRQNGRAKRTN 1325
T +R S R + + N
Sbjct: 83 TPPARRPSRRWDQTGRFPN 101
>gnl|CDD|99958 cd05529, Bromo_WDR9_I_like, Bromodomain; WDR9 repeat I_like
subfamily. WDR9 is a human gene located in the Down
Syndrome critical region-2 of chromosome 21. It encodes
for a nuclear protein containing WD40 repeats and two
bromodomains, which may function as a transcriptional
regulator involved in chromatin remodeling and play a
role in embryonic development. Bromodomains are 110 amino
acid long domains, that are found in many chromatin
associated proteins. Bromodomains can interact
specifically with acetylated lysine.
Length = 128
Score = 35.8 bits (83), Expect = 0.034
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 11/54 (20%)
Query: 1086 LYHPRPDEWPQGD--------RDATCEKIITGLEEVMG---LSVAEPFLVPVDI 1128
LY+P EW D RD E++I+GL++++ L +AE F PVD+
Sbjct: 1 LYNPLSSEWELFDPGWEQPHIRDEERERLISGLDKLLLSLQLEIAEYFEYPVDL 54
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 37.2 bits (87), Expect = 0.038
Identities = 10/37 (27%), Positives = 21/37 (56%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
A + E E + D ++++ +E+ +E++EEEA
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
Score = 34.5 bits (80), Expect = 0.25
Identities = 8/39 (20%), Positives = 18/39 (46%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
A + E E + E+++ ++ E+E++EE
Sbjct: 34 AAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEAD 72
Score = 32.2 bits (74), Expect = 1.3
Identities = 8/36 (22%), Positives = 17/36 (47%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
P + ES E+ E +++ +E+E+E+
Sbjct: 32 PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDD 67
Score = 31.0 bits (71), Expect = 3.7
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
A + E E +EE+ +D ++E+ +E+ EEE + P
Sbjct: 39 ATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDP 77
Score = 30.6 bits (70), Expect = 4.2
Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Query: 1246 APESEDEVEGEE--EEESNDQSEEEEVEENSEEEEEEEA 1282
A + E EE+ D +++E E+ +EEE +
Sbjct: 36 AAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLG 74
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain fusion
protein; Provisional.
Length = 131
Score = 35.7 bits (82), Expect = 0.046
Identities = 12/46 (26%), Positives = 22/46 (47%)
Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
++ GR K P + E EE+ +E+ +E +E + +EA
Sbjct: 52 WTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEA 97
Score = 33.0 bits (75), Expect = 0.35
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
A E+++E E +E + + E EE EEE+EE
Sbjct: 94 ADEADEEEETDEAVDETADEADAEAEEADEEEDEE 128
Score = 30.7 bits (69), Expect = 2.1
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 5/42 (11%)
Query: 1246 APESEDEVEGEEEEESN-----DQSEEEEVEENSEEEEEEEA 1282
A E++ E +EEEE++ E + E ++EEE+EEA
Sbjct: 88 AAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEA 129
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 36.7 bits (85), Expect = 0.049
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E+++E EE E+ + EE E EE ++E
Sbjct: 28 KGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62
Score = 34.7 bits (80), Expect = 0.18
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
E + E E +EE + E+ E EE +EE E++
Sbjct: 14 EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDS 48
Score = 32.0 bits (73), Expect = 1.6
Identities = 9/34 (26%), Positives = 18/34 (52%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E++ EE+ E + +EE +E++ E E +
Sbjct: 23 NKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKI 56
Score = 31.6 bits (72), Expect = 2.3
Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 1248 ESEDEVEGEEEE-ESNDQSEEEEVEENSEEEEEEEA 1282
+ E E +EE+ + E EE+E+ E+ EE+
Sbjct: 16 DCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEES 51
Score = 31.3 bits (71), Expect = 2.9
Identities = 11/34 (32%), Positives = 16/34 (47%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E + E + E E ++ E E E S E + EE
Sbjct: 25 EEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58
Score = 30.5 bits (69), Expect = 5.0
Identities = 7/35 (20%), Positives = 16/35 (45%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ EE+ ++++EE+ + + E EE
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIE 40
Score = 30.1 bits (68), Expect = 6.7
Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 7/41 (17%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEV-------EENSEEEEEEE 1281
E + E E E+EE + SEE +EN++ +EE +
Sbjct: 31 EEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENK 71
Score = 29.7 bits (67), Expect = 8.2
Identities = 13/34 (38%), Positives = 25/34 (73%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E+++E +G+EE+ ++ E+EE+ E+SEE E +
Sbjct: 22 ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVK 55
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 37.7 bits (87), Expect = 0.053
Identities = 42/288 (14%), Positives = 95/288 (32%), Gaps = 26/288 (9%)
Query: 1125 PVDISSLSEEN---SHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRM---PPRRTNRVG 1178
I L++ SH D S S S+ S +D E+E+ + +
Sbjct: 85 KEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNES 144
Query: 1179 TSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFS 1238
S ++ ++ + + + Q+ + + ++ V + N+F
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFL 204
Query: 1239 GRGRGRK--APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAK 1296
+ A ED+ E +++S D ++E+ +EE++EE
Sbjct: 205 EATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEE--------GNIEYED 256
Query: 1297 VKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRR 1356
+ + K +++ S + D +++ Q+EE+
Sbjct: 257 FFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAA 316
Query: 1357 ARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRR-GRMRKPTAKL 1403
+ + E E + ES SF +R ++++ +L
Sbjct: 317 MDKVKL---------DEPVLEGVDLESPKELSSFEKRQAKLKQQIEQL 355
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction that
requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 35.8 bits (83), Expect = 0.054
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
EDE + +EE+E EE++ E++ E+E EEE P ++
Sbjct: 111 EDESDDDEEDEE----EEDDEEDDDEDESEEEESPVKKVK 146
Score = 32.3 bits (74), Expect = 0.80
Identities = 14/38 (36%), Positives = 26/38 (68%)
Query: 1255 GEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRA 1292
EE+ES+D E+EE E++ E+++E+E+ PV++
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the maturation
of the 18S rRNA and for 40S ribosome production.
Length = 809
Score = 37.3 bits (87), Expect = 0.061
Identities = 13/34 (38%), Positives = 25/34 (73%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ +D E EE+ + +D+ E+EE E++ +E++EEE
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371
Score = 36.5 bits (85), Expect = 0.11
Identities = 13/35 (37%), Positives = 25/35 (71%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ +D++E EEE+ EE+E +E+S++E++EE
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371
Score = 35.0 bits (81), Expect = 0.36
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
E ED V+ E+EE+ +D EEEE + + +EEE+E
Sbjct: 324 EEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEE 359
Score = 34.6 bits (80), Expect = 0.48
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ E + ++ EEE+E ++ S++E+ EE EEE+E++
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKK 380
Score = 34.2 bits (79), Expect = 0.66
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 1239 GRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
G G+G + E E++ +E+EE +D EEE E+ +EEE+
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEE 359
Score = 33.0 bits (76), Expect = 1.4
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E +D+ + EEEEE D S+EEE EE+ + ++E++
Sbjct: 335 EEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEE 370
Score = 31.1 bits (71), Expect = 5.8
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ ED+ + EEEE + +EE +E E+ ++E+
Sbjct: 334 DEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDD 368
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 37.7 bits (88), Expect = 0.062
Identities = 33/218 (15%), Positives = 75/218 (34%), Gaps = 2/218 (0%)
Query: 1062 PKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEP 1121
PK K+ + S + ++ + R D + D + + E
Sbjct: 1172 PKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQ 1231
Query: 1122 FLVPVDISSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRR-TNRVGTS 1180
P S ++ ++SS S + + S +D E ++ R +
Sbjct: 1232 KTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKR 1291
Query: 1181 LGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGR 1240
+ S + + + + ++ ++ S+TR KQ+ S R
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSR 1351
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEE 1278
R + D ++++ D SE+E+ +E+ E+++
Sbjct: 1352 LLRRPRKKKSDSSSEDDDDSEVDDSEDED-DEDDEDDD 1388
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 37.2 bits (86), Expect = 0.070
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 488 LKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDI 522
L+GHT +V ++ HP VL SAG D ++ +WD+
Sbjct: 121 LQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDV 155
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits in
yeasts and in human cells. Twelve of these are akin to
RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
and Rpc82 form a cluster of enzyme-specific subunits that
contribute to transcription initiation in S.cerevisiae
and H.sapiens. There is evidence that these subunits are
anchored at or near the N-terminal Zn-fold of Rpc1,
itself prolonged by a highly conserved but RNA polymerase
III-specific domain.
Length = 221
Score = 36.3 bits (84), Expect = 0.078
Identities = 12/38 (31%), Positives = 26/38 (68%)
Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
K E+ED E +E++E ++ EEEE E+ +++++++
Sbjct: 160 LKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 33.2 bits (76), Expect = 0.72
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
K E E E EE+E+ ++ EEEE E+ ++++++
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195
Score = 31.3 bits (71), Expect = 3.2
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ E E E +EE+ D+ EEEE EE E+ ++++
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193
Score = 30.1 bits (68), Expect = 6.6
Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 9/62 (14%)
Query: 1228 TRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEE--------SNDQSEEEEVEENSEEEEE 1279
K + +KF + G E ED E E + E+E+ EE EEEEE
Sbjct: 126 AGKKLALSKFKRKV-GLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184
Query: 1280 EE 1281
E+
Sbjct: 185 ED 186
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 34.3 bits (79), Expect = 0.082
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
A + G E D++EEEE EE +EEE +++
Sbjct: 67 AGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits. The
Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 34.1 bits (78), Expect = 0.086
Identities = 17/36 (47%), Positives = 20/36 (55%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
A ESE+ E EEEEE + E E + EE EEE
Sbjct: 57 AEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEI 92
Score = 32.9 bits (75), Expect = 0.20
Identities = 14/39 (35%), Positives = 20/39 (51%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ E DE E EEEEE ++ E ++ EEE E+
Sbjct: 56 EAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIED 94
Score = 31.8 bits (72), Expect = 0.60
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E+E E EEEE+ + + EE EE E+ E
Sbjct: 63 NDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96
Score = 29.1 bits (65), Expect = 5.0
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 1257 EEEESNDQSEEEEVEENSEEEEEEEA 1282
EE E ND+ EEEE EE E E E +
Sbjct: 58 EESEENDEEEEEEEEEEDEGEIEYVS 83
Score = 28.7 bits (64), Expect = 6.1
Identities = 15/37 (40%), Positives = 23/37 (62%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
KA E+E+ E +EEEE ++ E+E E ++EE E
Sbjct: 53 KALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89
Score = 28.3 bits (63), Expect = 8.5
Identities = 15/34 (44%), Positives = 22/34 (64%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E E EEEEE ++ +E E+E S++EE EE +
Sbjct: 59 ESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEI 92
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved cysteine
residues. The function of this protein is unknown.
Length = 314
Score = 36.5 bits (84), Expect = 0.090
Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 4/86 (4%)
Query: 1201 YAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDE----VEGE 1256
+M R K + + R R + S NK+ ES+D+
Sbjct: 225 LSMSTRSYKFGRQTRGFRSDDDQGYGRYKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDY 284
Query: 1257 EEEESNDQSEEEEVEENSEEEEEEEA 1282
+EEE D + E+ ++ + +E+
Sbjct: 285 DEEEDGDDDDNEDNDDTNTNHGDEDC 310
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts harbouring
premature signals for translation termination are
recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 35.4 bits (82), Expect = 0.090
Identities = 15/50 (30%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEE------EEEEAVPRRRPVRR 1291
+ E++ E EE+E ++ S+EEEV+ +E+ EEE+ R+
Sbjct: 9 DGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEV 58
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 36.6 bits (84), Expect = 0.098
Identities = 31/153 (20%), Positives = 55/153 (35%), Gaps = 8/153 (5%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR---RPVRRANVAKVKRAVPPA 1304
ES +E E EE E +SE++ ++EE ++EE P +R ++ +
Sbjct: 118 ESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTH 177
Query: 1305 PDTSTSRLRSVRQNGRAKRTNGR-QSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTK 1363
T S A+ G+ + E+ ++ E+ + +
Sbjct: 178 KLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRR 237
Query: 1364 GQSYAVHSSETETE----SEEEESQAAQVSFSR 1392
Q A S E E EE E + A+ + R
Sbjct: 238 KQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 36.0 bits (83), Expect = 0.10
Identities = 15/36 (41%), Positives = 22/36 (61%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E +D+ +G+ EE + +EEE E EEEEE+E
Sbjct: 250 EEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 36.4 bits (84), Expect = 0.11
Identities = 15/64 (23%), Positives = 20/64 (31%)
Query: 1218 RRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEE 1277
R G I G + F K E +EE ++ +E V E E
Sbjct: 310 RNGNRIKELADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEM 369
Query: 1278 EEEE 1281
E E
Sbjct: 370 ELLE 373
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone sialoprotein
(BSP) is a major structural protein of the bone matrix
that is specifically expressed by fully-differentiated
osteoblasts. The expression of bone sialoprotein (BSP) is
normally restricted to mineralised connective tissues of
bones and teeth where it has been associated with mineral
crystal formation. However, it has been found that
ectopic expression of BSP occurs in various lesions,
including oral and extraoral carcinomas, in which it has
been associated with the formation of microcrystalline
deposits and the metastasis of cancer cells to bone.
Length = 291
Score = 35.8 bits (82), Expect = 0.12
Identities = 36/149 (24%), Positives = 56/149 (37%), Gaps = 15/149 (10%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVP 1302
G + E + + EEE + S EEE E+S+ E+EEA + + A P
Sbjct: 47 GSDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLGYGGDATP 106
Query: 1303 PAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRT 1362
+ + L+ ++ G A + ++ S D D E+ +EEE+ T
Sbjct: 107 GTGNIGLAALQLPKKAGNAGKKATKEDES-------DEDEEEEEEEEEEEAEVEENEQGT 159
Query: 1363 KGQSYAV--------HSSETETESEEEES 1383
G S S E EEES
Sbjct: 160 NGTSTNSTEVDHGNGSSGGDNGEEGEEES 188
Score = 34.3 bits (78), Expect = 0.42
Identities = 35/138 (25%), Positives = 51/138 (36%), Gaps = 28/138 (20%)
Query: 1136 SHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDY 1195
S SDSS E+G S EE E+ TS + E+ D
Sbjct: 46 SGSDSSEENGDGDSSEEEGEEE---------------------TSNEEENNEDSDGNEDE 84
Query: 1196 DSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEG 1255
++ ++ + +V G G + G + G K EDE +
Sbjct: 85 EA-------EAENTTLSTVTLGYGGDATPGTGNIGLAALQLPKKAGNAGKKATKEDESDE 137
Query: 1256 EEEEESNDQSEEEEVEEN 1273
+EEEE ++ EE EVEEN
Sbjct: 138 DEEEEEEEEEEEAEVEEN 155
Score = 30.8 bits (69), Expect = 5.4
Identities = 53/255 (20%), Positives = 80/255 (31%), Gaps = 59/255 (23%)
Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYREN 1188
S SEEN DSS E G S+EE +ED + T+L
Sbjct: 48 SDSSEENGDGDSSEEEGEEETSNEE-ENNEDSDGNED------EEAEAENTTL------- 93
Query: 1189 PSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPE 1248
S V+ GD A + + ++Q ++ T+ +S E
Sbjct: 94 -STVTLGYGGD--ATPGTGNIGLAALQLPKKAGNAGKKATKEDES---------DEDEEE 141
Query: 1249 SEDEVEGEEEEESNDQS---------------------EEEEVEENSEEEEEEEAVPRRR 1287
E+E E E E E N+Q EE EE S E E E
Sbjct: 142 EEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEGTTVAG 201
Query: 1288 PVRRANVAKVKRAVPPAPDTST-----SRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDS 1342
P + + PP T ++ + G ++T + G +
Sbjct: 202 PTTTSPNGGFQPTTPPQEVYGTTDPPFGKVTTPEYQGEYEQTGANEY-------DGGYEI 254
Query: 1343 EDNSQEEEQYGSRRA 1357
++ E + S RA
Sbjct: 255 YESENGEPRGDSYRA 269
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 36.2 bits (84), Expect = 0.13
Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 16/88 (18%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPP-- 1303
PE E E+ +D E+EE ++++E+ EE V + V+ A+PP
Sbjct: 264 PPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEIV----------LEAVRAALPPDL 313
Query: 1304 ----APDTSTSRLRSVRQNGRAKRTNGR 1327
A + +R RS G A++ N R
Sbjct: 314 LARLAAGGARARARSAGGAGAAQKGNRR 341
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46. Homologues
of CDC45 have been identified in human, mouse and smut
fungus among others.
Length = 583
Score = 36.1 bits (84), Expect = 0.13
Identities = 11/44 (25%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
+ +D+ E +EE+E + +SE++E +++ +++++ R RR
Sbjct: 126 DDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIAT-RERSLERR 168
Score = 33.4 bits (77), Expect = 1.0
Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 11/92 (11%)
Query: 1104 EKIITGLEEVMGLSVAEPFLVPVDISSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRH 1163
I L++ A L D +++ SD E S S+ E+D +D+D
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDD-----DDDEESDEEDEESSKSEDDEDDDDDDDDDDI 158
Query: 1164 GKARRMPPRRTNRVGTSLGRQYRENPSDVSDY 1195
R RR R R++ E +++ Y
Sbjct: 159 ATRERSLERRRRR------REWEEKRAELEFY 184
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 35.0 bits (81), Expect = 0.14
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E ++VE E EE +++ E+EVEE + EEE E +
Sbjct: 3 EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAEL 38
Score = 33.4 bits (77), Expect = 0.43
Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 2/44 (4%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPR 1285
+ + E +E E EE E D+ EEE VEE SE E +E +
Sbjct: 4 KNEQVEEEVEETEVEEAVE--DEVEEETVEEESEAELLDEEQAK 45
Score = 32.3 bits (74), Expect = 1.1
Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 3/40 (7%)
Query: 1248 ESEDEVEGEEEEESNDQSE---EEEVEENSEEEEEEEAVP 1284
E + EEE E + E +E EE EEE E E +
Sbjct: 1 MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLD 40
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this family
are typically between 146 and 237 amino acids in length.
There are two conserved sequence motifs: QYGP and PKLY.
Length = 155
Score = 34.3 bits (79), Expect = 0.15
Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 1/60 (1%)
Query: 1254 EGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDT-STSRL 1312
+EEEE+ ++S +E +E+ E + E ++R V + K K P S +L
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 36.3 bits (84), Expect = 0.16
Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 7/60 (11%)
Query: 1238 SGRGRGRKAPESE------DEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
+G G E E + E+ +D ++ + EEEE+ RRR +R
Sbjct: 14 NGSGGLVGTDSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGN-MRRRLIRT 72
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this family
was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 35.3 bits (81), Expect = 0.16
Identities = 18/40 (45%), Positives = 25/40 (62%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
E EV EEE+ + + EEEE EE E + E++ PR +PV
Sbjct: 1 ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPV 40
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 35.4 bits (81), Expect = 0.17
Identities = 16/39 (41%), Positives = 19/39 (48%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ E D E EE+EE S E E +EEEEEE
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA 216
Score = 31.1 bits (70), Expect = 3.5
Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEE--NSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
ED V+ E E Q EEEE+EE + E E E V R N +
Sbjct: 223 EDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKK 275
Score = 30.3 bits (68), Expect = 6.0
Identities = 15/43 (34%), Positives = 21/43 (48%)
Query: 1239 GRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
G G E E+E EG ++ E E E +++ EEEE E
Sbjct: 202 GEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEME 244
Score = 30.3 bits (68), Expect = 6.7
Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 2/44 (4%)
Query: 1241 GRGRKAPESEDEVEG--EEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ E +E +G EEE+E E EEEEEEA
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA 216
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region of
the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is dispensable
for binding to Spt4.
Length = 92
Score = 32.8 bits (75), Expect = 0.18
Identities = 11/42 (26%), Positives = 22/42 (52%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
+ E E ++EEE ++ E++ + + E+E +EA R
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHR 44
Score = 29.7 bits (67), Expect = 2.9
Identities = 18/44 (40%), Positives = 24/44 (54%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
E +DE E EEEEE + + +E E E E E++ RR RR
Sbjct: 7 EVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRR 50
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 35.8 bits (83), Expect = 0.19
Identities = 22/42 (52%), Positives = 23/42 (54%)
Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
R R R+ P S E E EEEEE EEEE EE E EEE
Sbjct: 292 RRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 34.2 bits (79), Expect = 0.49
Identities = 25/61 (40%), Positives = 28/61 (45%), Gaps = 7/61 (11%)
Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKR 1299
R R R+ P EEEEE + EEEE EE EE EE V RA VA+ R
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEV-------RATVAEAIR 343
Query: 1300 A 1300
Sbjct: 344 L 344
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 35.4 bits (81), Expect = 0.19
Identities = 13/47 (27%), Positives = 21/47 (44%), Gaps = 2/47 (4%)
Query: 1249 SEDEVEGEEEEESNDQS--EEEEVEENSEEEEEEEAVPRRRPVRRAN 1293
SE + E ++EE N ++ EE + + E EE+E R
Sbjct: 290 SEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHT 336
Score = 30.1 bits (67), Expect = 8.8
Identities = 11/35 (31%), Positives = 13/35 (37%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E EE + EE EE+ E EE E
Sbjct: 300 EVENAEAHKEEVQSDRPDEIGEEKEEDDENEENER 334
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 33.0 bits (76), Expect = 0.22
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 5/37 (13%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
A + EE++ EEEE EE EE EEE A
Sbjct: 67 AAAAAAAAAAAEEKK-----EEEEEEEEKEESEEEAA 98
Score = 30.7 bits (70), Expect = 1.2
Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 5/35 (14%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
A + E ++EEE EEEE +E SEEE
Sbjct: 70 AAAAAAAAEEKKEEE-----EEEEEKEESEEEAAA 99
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity of
nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 35.1 bits (81), Expect = 0.23
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
+ E E + +E+E ++ EE+ E +EEEE+EE + P
Sbjct: 35 TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASP 77
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation domain.
The centromere protein B (CENP-B) dimerisation domain is
composed of two alpha-helices, which are folded into an
antiparallel configuration. Dimerisation of CENP-B is
mediated by this domain, in which monomers dimerise to
form a symmetrical, antiparallel, four-helix bundle
structure with a large hydrophobic patch in which 23
residues of one monomer form van der Waals contacts with
the other monomer. This CENP-B dimer configuration may be
suitable for capturing two distant CENP-B boxes during
centromeric heterochromatin formation.
Length = 101
Score = 32.8 bits (74), Expect = 0.23
Identities = 16/58 (27%), Positives = 31/58 (53%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAP 1305
E E++ + + +EE +D E+EE ++ ++E+++E A A VKR + P
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFGEAMAYFAMVKRYLTSFP 64
Score = 31.3 bits (70), Expect = 0.85
Identities = 10/29 (34%), Positives = 23/29 (79%)
Query: 1253 VEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+EGEE+ +S+ EE++ +E+ E+++E++
Sbjct: 6 LEGEEDSDSDSDEEEDDDDEDEEDDDEDD 34
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 35.3 bits (81), Expect = 0.23
Identities = 25/123 (20%), Positives = 37/123 (30%), Gaps = 10/123 (8%)
Query: 1162 RHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQ 1221
RHG R RR + P+ + SG + A Q + Q +
Sbjct: 25 RHGGTREQAGRRRGTAARAA------KPAPPAPTTSGPQVRAVAEQGHR----QTESDTE 74
Query: 1222 TIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
T SR K+ + G G ++ S EE E +S E
Sbjct: 75 TAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASH 134
Query: 1282 AVP 1284
+ P
Sbjct: 135 SPP 137
Score = 31.1 bits (70), Expect = 4.6
Identities = 9/40 (22%), Positives = 12/40 (30%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
G AP ++ S Q E+ E E E
Sbjct: 145 GPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184
Score = 30.7 bits (69), Expect = 5.4
Identities = 17/91 (18%), Positives = 30/91 (32%), Gaps = 2/91 (2%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRA 1300
+ P+ E + +++ + ++ VE E E E P R + V
Sbjct: 202 PPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRSHSYTVVGWK 261
Query: 1301 VPPAPD-TSTSRLRSVRQNG-RAKRTNGRQS 1329
P LR R++ G+QS
Sbjct: 262 PSTRPGGVPKLCLRCTSHPSHRSRLPEGQQS 292
Score = 30.3 bits (68), Expect = 7.5
Identities = 6/42 (14%), Positives = 13/42 (30%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ P S + E+ E + E ++ + E
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of the
RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 35.1 bits (81), Expect = 0.24
Identities = 21/69 (30%), Positives = 30/69 (43%), Gaps = 1/69 (1%)
Query: 1214 VQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEEN 1273
V+ Q V R + R R R P +EV+ +E+EE +S+E E EE
Sbjct: 332 VREHNNDQLNVKLRNPSTKESKMRDKR-RARLDPIDFEEVDEDEDEEEEQRSDEHEEEEG 390
Query: 1274 SEEEEEEEA 1282
+ EEE
Sbjct: 391 EDSEEEGSQ 399
Score = 33.6 bits (77), Expect = 0.88
Identities = 11/35 (31%), Positives = 15/35 (42%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
EDE E EE+ + EE E E + E+
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404
Score = 32.8 bits (75), Expect = 1.4
Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
E DE E EEEE+ +D+ EEEE E+SEEE +
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEE-GEDSEEEGSQSR 401
Score = 32.4 bits (74), Expect = 2.0
Identities = 13/75 (17%), Positives = 23/75 (30%), Gaps = 17/75 (22%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPD 1306
E +++ + EEE+ S++ EEE + E + E +
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGS-----------------SESSS 410
Query: 1307 TSTSRLRSVRQNGRA 1321
S S A
Sbjct: 411 DVGSDSESKADKESA 425
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 35.3 bits (80), Expect = 0.24
Identities = 55/243 (22%), Positives = 82/243 (33%), Gaps = 36/243 (14%)
Query: 1170 PPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTR 1229
PP +T T + E S+ S DS M A ++ Q+ +G+ Q S
Sbjct: 4 PPGKTGPAATQAKAEKPEEDSESSSEDSDSEEEMPAAKNP-PQAKPSGKSPQVKAAS-AP 61
Query: 1230 GKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
K+S +G P + EE SEEE +SE E A P
Sbjct: 62 AKES----PQKGAPPVTPGKAGPAAAQAGEEEAKSSEEES---DSEGETPTAATLTTSPA 114
Query: 1290 RRANVAKVKR-----AVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSED 1344
+ + K + V P P + L ++ G A G+Q S + +SDS+
Sbjct: 115 QAKPLGKNSQVRPASTVTPGPSGKGANLPCPQKAGSAAVQVGKQEDSE-SSSEEESDSDG 173
Query: 1345 NSQEEEQYGSRRARRTR-----TKGQSYAV----------------HSSETETESEEEES 1383
+ S + + R KG SSE ++SEEE
Sbjct: 174 PGAPAQAKSSGKLLQARPASGPAKGPPQKAGPVATQVKAERGKEDSESSEESSDSEEEAP 233
Query: 1384 QAA 1386
A
Sbjct: 234 AAM 236
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 34.6 bits (80), Expect = 0.28
Identities = 13/47 (27%), Positives = 22/47 (46%)
Query: 1266 EEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRL 1312
+++E E+ E E+E++ ANV K+ + P TS L
Sbjct: 15 DDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFL 61
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 34.8 bits (80), Expect = 0.31
Identities = 19/39 (48%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
G P ED+ EG E ++ EEEEVEE E+EEEE
Sbjct: 58 GNPDPFGEDD-EGRIEVRISEDGEEEEVEEGEEDEEEEG 95
Score = 32.1 bits (73), Expect = 2.1
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E + + EEE ++ EE+E EE EE EE E +
Sbjct: 72 EVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPL 105
Score = 31.7 bits (72), Expect = 2.4
Identities = 16/37 (43%), Positives = 18/37 (48%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
E EV E+ E + E EE EE EEE EE P
Sbjct: 68 EGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.32
Identities = 37/216 (17%), Positives = 71/216 (32%), Gaps = 27/216 (12%)
Query: 1205 ARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKF---SGRGRGRKAPESEDEVEGEEEEES 1261
+ + + Q + + + + ++ + + R A S++ E E E E
Sbjct: 706 ELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEE 765
Query: 1262 NDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRA 1321
++ EE EE +E E E E + A + ++K + L +R
Sbjct: 766 LEERLEEAEEELAEAEAEIEEL-------EAQIEQLKEEL----KALREALDELR--AEL 812
Query: 1322 KRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEE 1381
N ++ S + + E + +S A E E EE
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDL-EEQIEELSEDIESLAAEIEELEELIEEL 871
Query: 1382 ESQAAQVSFSR----------RGRMRKPTAKLRGLE 1407
ES+ + R R + + + +LR LE
Sbjct: 872 ESELEALLNERASLEEALALLRSELEELSEELRELE 907
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.0 bits (81), Expect = 0.33
Identities = 21/138 (15%), Positives = 49/138 (35%), Gaps = 3/138 (2%)
Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSR 1311
E + EE+ ++ S+ EE E+++E++ + R + K+K ++
Sbjct: 319 EGKSVSEEDEDEDSDSEEE---DEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLS 375
Query: 1312 LRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHS 1371
++ +++ K+ L + D + EE RR
Sbjct: 376 MKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAE 435
Query: 1372 SETETESEEEESQAAQVS 1389
S+ + + E + + S
Sbjct: 436 SKKLKKENKNEFKEKKES 453
Score = 33.5 bits (77), Expect = 1.1
Identities = 28/155 (18%), Positives = 48/155 (30%), Gaps = 26/155 (16%)
Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPP 1303
++A + E E EE + E EE + E EE + RR K
Sbjct: 380 QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRR--------KFGPENGE 431
Query: 1304 APDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTK 1363
S + + + K + E+ EEE + A + +
Sbjct: 432 KEAESKKLKKENKNEFKEK--------------KESDEEEELEDEEEAKVEKVANKLLKR 477
Query: 1364 GQSYAVHSSETETESEEEESQAAQVSFSRRGRMRK 1398
+ + + E E E +E + S G+ K
Sbjct: 478 SEK----AQKEEEEEELDEENPWLKTTSSVGKSAK 508
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically binds
the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 32.4 bits (74), Expect = 0.34
Identities = 21/62 (33%), Positives = 24/62 (38%), Gaps = 10/62 (16%)
Query: 1230 GKQSYNKFSGRGRG----------RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEE 1279
G+ +K G GRG E +E E E EE EE E EEEE
Sbjct: 12 GRGGEDKREGAGRGNWGSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEV 71
Query: 1280 EE 1281
EE
Sbjct: 72 EE 73
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 35.1 bits (81), Expect = 0.35
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
R E EDE E EEE+E S+E +E EE++ E
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVE 424
Score = 34.3 bits (79), Expect = 0.52
Identities = 12/48 (25%), Positives = 22/48 (45%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKV 1297
E + EEEE+ ++ EE+E E S+E ++E + +
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432
Score = 33.6 bits (77), Expect = 0.96
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
FS E +DE E +EEEE + +E +E+S++EE EE
Sbjct: 376 FSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEE 420
Score = 32.4 bits (74), Expect = 2.0
Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 6/49 (12%)
Query: 1245 KAPESE-----DEVEGEEEEESNDQSE-EEEVEENSEEEEEEEAVPRRR 1287
KA E+E D E E++N + + EEE +E EEEE+E+ P +
Sbjct: 363 KAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKE 411
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 34.6 bits (80), Expect = 0.35
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
++E+E E ++ ES D+ EE+E EE E+++E +
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 33.1 bits (76), Expect = 0.92
Identities = 10/34 (29%), Positives = 21/34 (61%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E +E E E++ ++ +EE+ +E EE+++E
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEG 317
Score = 32.7 bits (75), Expect = 1.3
Identities = 12/37 (32%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
KA E E+E + E E+ ++ E+E+ EE +++E ++
Sbjct: 285 KAEEEEEEDDYSESEDEDE--EDEDEEEEEDDDEGDK 319
Score = 31.9 bits (73), Expect = 2.0
Identities = 10/34 (29%), Positives = 22/34 (64%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
++E E E++ ++ EE++ E+ +E+EE+E
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDE 308
Score = 30.8 bits (70), Expect = 4.5
Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 5/44 (11%)
Query: 1243 GRKAPESEDEVEGEEEEESND-----QSEEEEVEENSEEEEEEE 1281
G + E+ +E EEEE D + E+EE E+ EEE+++E
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter subunit
G; Reviewed.
Length = 197
Score = 33.9 bits (77), Expect = 0.36
Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 17/102 (16%)
Query: 1195 YDSGDRYAMRARQSKQVQSVQAGR---------RGQTIVTSRTRGKQSYNKFSGRGRGRK 1245
YD+G A+R R Q++SV+ R + I +R ++ + R K
Sbjct: 93 YDTGVPLAIRIRD--QLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEK 150
Query: 1246 APESEDEVEGEEE------EESNDQSEEEEVEENSEEEEEEE 1281
E ED+ E E E EE +D SE E +E++ E E ++
Sbjct: 151 IDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDD 192
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This family
consists of several uncharacterized plant proteins of
unknown function.
Length = 565
Score = 34.6 bits (79), Expect = 0.41
Identities = 19/34 (55%), Positives = 25/34 (73%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E E+EVE EE+EE ++ + V+E SEEEEEEE
Sbjct: 141 EEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 34.5 bits (79), Expect = 0.45
Identities = 13/41 (31%), Positives = 27/41 (65%)
Query: 487 VLKGHTNEVFVLESHPFDSRVLLSAGHDGLIIIWDILSSKQ 527
+LKGH ++ +++ +P + ++ S+G D + IWDI + K+
Sbjct: 120 ILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160
Score = 32.6 bits (74), Expect = 1.6
Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 19/83 (22%)
Query: 260 LLAAGTVDKTIRIWNL--------QTLAPISVLVGHQGIITGVNFCPLEVNGFNY--LAS 309
+LA+G+ D TIR+W + + P +L GH+ I+ +++ P+ NY + S
Sbjct: 89 ILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPM-----NYYIMCS 143
Query: 310 TSTDGCIGFWKYKLDIDISKTVF 332
+ D + W DI+ K F
Sbjct: 144 SGFDSFVNIW----DIENEKRAF 162
Score = 30.3 bits (68), Expect = 9.6
Identities = 20/68 (29%), Positives = 34/68 (50%), Gaps = 6/68 (8%)
Query: 455 VAWDASDRWVITAINFNFQIKIWDAFNGDLVQVLKGHTNEVFVLESHPFDSRVLLSAGHD 514
V W+ +I AI Q++ +++ LKGHT+ + L+ +P S +L S D
Sbjct: 43 VPWEVEGGGLIGAIRLENQMR-----KPPVIK-LKGHTSSILDLQFNPCFSEILASGSED 96
Query: 515 GLIIIWDI 522
I +W+I
Sbjct: 97 LTIRVWEI 104
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 34.6 bits (79), Expect = 0.46
Identities = 16/44 (36%), Positives = 24/44 (54%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
ES++ E E+ES + S +EE E+ +E E + A R RR
Sbjct: 957 ESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKRR 1000
Score = 33.1 bits (75), Expect = 1.4
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 1237 FSGRGRGRKAPESEDEV---EGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
F G ++ ESE+EV E ++ES++ E+EE +E+SE+ E+E+
Sbjct: 922 FLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDES 970
Score = 31.9 bits (72), Expect = 2.9
Identities = 14/36 (38%), Positives = 23/36 (63%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
ES++ E EE +ES++ E+E E +S +EE+ E
Sbjct: 946 DDESDETDEDEESDESSEDLSEDESENDSSDEEDGE 981
>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
(NESP55). This family consists of several mammalian
neuroendocrine-specific golgi protein P55 (NESP55)
sequences. NESP55 is a novel member of the chromogranin
family and is a soluble, acidic, heat-stable secretory
protein that is expressed exclusively in endocrine and
nervous tissues, although less widely than chromogranins.
Length = 261
Score = 34.1 bits (77), Expect = 0.47
Identities = 24/118 (20%), Positives = 44/118 (37%), Gaps = 4/118 (3%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE----AVPRRRPVRRANVAKVKRAVPP 1303
E E ++E E E E+ ++E + E E E+ VP+ ++ ++
Sbjct: 121 EPESDIESETEFETEPETEPDTAPTTEPETEPEDEPGPVVPKGATFHQSLTERLHALKLQ 180
Query: 1304 APDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTR 1361
+ D S R Q + R L D + E+ +EEE+ R + +
Sbjct: 181 SADASPRRAPPSTQEPESAREGEEPERGPLDKDPRDPEEEEEEKEEEKQQPHRCKPKK 238
Score = 32.1 bits (72), Expect = 1.7
Identities = 27/81 (33%), Positives = 35/81 (43%), Gaps = 8/81 (9%)
Query: 1219 RGQTIVTSRTRGKQSYNKFSGRGRGRKAP----ESEDEVEGEEEEESNDQSEEEEVEENS 1274
+G T S T + S R+AP E E EGEE E + + EE
Sbjct: 162 KGATFHQSLTERLHALKLQSADASPRRAPPSTQEPESAREGEEPERGPLDKDPRDPEEEE 221
Query: 1275 EEEEEEEAVPRR----RPVRR 1291
EE+EEE+ P R +P RR
Sbjct: 222 EEKEEEKQQPHRCKPKKPARR 242
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 34.6 bits (79), Expect = 0.48
Identities = 18/71 (25%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 1215 QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEEN- 1273
+A ++ + R + + R + +DE E E++++ D EEEE EE
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180
Query: 1274 ---SEEEEEEE 1281
+E+EE+E
Sbjct: 181 KGFDDEDEEDE 191
Score = 31.9 bits (72), Expect = 3.4
Identities = 19/64 (29%), Positives = 32/64 (50%), Gaps = 1/64 (1%)
Query: 1222 TIVTSRTRGKQ-SYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
I+ R R + + S R + EDE E +++EE +++ EEEE E ++E+E
Sbjct: 129 MIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDE 188
Query: 1281 EAVP 1284
E
Sbjct: 189 EDEG 192
Score = 31.6 bits (71), Expect = 4.7
Identities = 12/49 (24%), Positives = 23/49 (46%)
Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
R R R E + + +D+ E+E+ ++ ++EEEEE +
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKG 182
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 34.4 bits (79), Expect = 0.50
Identities = 25/112 (22%), Positives = 47/112 (41%), Gaps = 19/112 (16%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENS----------------EEEEEEEAVPR 1285
R R+ PESEDE E E+E++ +D S++E+ +E+ +E++E
Sbjct: 130 RYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDED 189
Query: 1286 RRPVRRANVAKVKRAVPPA---PDTSTSRLRSVRQNGRAKRTNGRQSSSLLM 1334
++ + + K P D +L+ + K T+ + LL
Sbjct: 190 ADLSKKDVLEEPKMFKKPEEITWDDVFKKLKEIMSARGKKTTDRSEQIDLLE 241
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 32.7 bits (75), Expect = 0.50
Identities = 10/40 (25%), Positives = 24/40 (60%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ +K D+ +++ +D +EE+++E +E+EE+E
Sbjct: 89 KKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 31.5 bits (72), Expect = 1.1
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
K + +D++ ++ +D EE+ EE+ E+EE+EE
Sbjct: 93 KFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria, where
L7/L12 forms either two or three homodimers and each
homodimer binds to the extended C-terminal helix of L10.
L7/L12 is attached to the ribosome through L10 and is the
only ribosomal protein that does not directly interact
with rRNA. Archaeal L12p is believed to function in a
similar fashion. However, hybrid ribosomes containing the
large subunit from E. coli with an archaeal stalk are
able to bind archaeal and eukaryotic elongation factors
but not bacterial elongation factors. In several
mesophilic and thermophilic archaeal species, the binding
of 23S rRNA to protein L11 and to the L10/L12p pentameric
complex was found to be temperature-dependent and
cooperative.
Length = 106
Score = 32.1 bits (73), Expect = 0.54
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 1256 EEEEESNDQSEEEEVEENSEEEEEEEA 1282
EE ++ EEE+ +E +EEEEEEA
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 31.3 bits (71), Expect = 1.0
Identities = 12/32 (37%), Positives = 20/32 (62%), Gaps = 5/32 (15%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEE 1277
A +E++ E +EEE+ ++EE +E EEE
Sbjct: 71 AAAAEEKAEEKEEEK-----KKEEEKEEEEEE 97
Score = 30.9 bits (70), Expect = 1.2
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 1256 EEEEESNDQSEEEEVEENSEEEEEEEA 1282
EE++ ++ EE++ EE EEEEEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEAL 99
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 33.6 bits (77), Expect = 0.56
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E ED+ +GEEEE + S E+ + +SEE E E
Sbjct: 232 EDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265
Score = 33.6 bits (77), Expect = 0.57
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
A E DE E + E++ D+ + +E E++ + EEEE
Sbjct: 209 AEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEES 244
Score = 33.2 bits (76), Expect = 0.89
Identities = 12/36 (33%), Positives = 17/36 (47%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E +D+ E EE S+ SE+ + E E EA
Sbjct: 234 EDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269
Score = 32.5 bits (74), Expect = 1.4
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 5/42 (11%)
Query: 1246 APESEDEVEGEEE-----EESNDQSEEEEVEENSEEEEEEEA 1282
PES D + E+E +E +DQ EEEE + E+ +A
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDA 256
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with the
process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli synthesizes
cobalamin only when it is supplied with the precursor
cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with a
MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The other
two subunits are the P. denitrificans CobS (TIGR01650)
and CobN (pfam02514 CobN/Magnesium Chelatase) proteins.
To avoid potential confusion with the nonhomologous
Salmonella typhimurium/E.coli cobT gene product, the P.
denitrificans gene symbol is not used in the name of this
model [Biosynthesis of cofactors, prosthetic groups, and
carriers, Heme, porphyrin, and cobalamin].
Length = 600
Score = 34.1 bits (78), Expect = 0.57
Identities = 13/34 (38%), Positives = 17/34 (50%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E+E E +GE E E + S +E E E E E
Sbjct: 222 ENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255
Score = 32.6 bits (74), Expect = 1.5
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
E E++ EGE E + +E E + E EEE V
Sbjct: 224 EQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMV 259
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in mouse
cells, independent of its ability to promote histone
deposition but dependent on its ability to interact with
HP1 - heterochromatin protein 1-rich heterochromatin
domains next to centromeres that are crucial for
chromosome segregation during mitosis. This HP1-CAF-1
interaction module functions as a built-in replication
control for heterochromatin, which, like a control
barrier, has an impact on S-phase progression in addition
to DNA-based checkpoints.
Length = 76
Score = 31.0 bits (71), Expect = 0.66
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 1249 SEDEVEGEEEEESNDQSEEEEVEENSEEEEE 1279
S+ E E EEE E + +EE+ EE+ +++ +
Sbjct: 44 SDAEWEEEEEGEDLESEDEEDEEEDDDDDMD 74
Score = 30.3 bits (69), Expect = 1.0
Identities = 11/30 (36%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ E EEEEE D E+E E+ E+++++
Sbjct: 45 DAEWEEEEEGEDLESEDE-EDEEEDDDDDM 73
>gnl|CDD|107161 PHA02264, PHA02264, hypothetical protein.
Length = 152
Score = 32.4 bits (73), Expect = 0.72
Identities = 8/68 (11%), Positives = 24/68 (35%), Gaps = 9/68 (13%)
Query: 927 YELSRLRSSPWGNLILQEPELVKVVGIKYEIAPPRMCCLKLAVMKPNGKLTNEFFNIKYH 986
Y + + ++++ P+++ V G L + + ++ ++N
Sbjct: 84 YNNTDYNGNIERRIVVKHPKVITVEGKP--------SYQFLTISLEDARVITVWYN-SVD 134
Query: 987 DMPDVIDF 994
D +D
Sbjct: 135 DTHRTLDL 142
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 32.1 bits (73), Expect = 0.73
Identities = 16/41 (39%), Positives = 23/41 (56%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRR 1287
E E E EE+ ESN+ E + +E++EE+E V R R
Sbjct: 14 SEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPR 54
Score = 31.8 bits (72), Expect = 1.1
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
++++ E+E ++ EE EE+SE +E
Sbjct: 3 NDMDMGYEDEPSEPEIEEGAEEDSESNNNDE 33
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This family
consists of several plant specific phytosulfokine
precursor proteins. Phytosulfokines, are active as either
a pentapeptide or a C-terminally truncated tetrapeptide.
These compounds were first isolated because of their
ability to stimulate cell division in somatic embryo
cultures of Asparagus officinalis.
Length = 74
Score = 30.8 bits (70), Expect = 0.75
Identities = 13/44 (29%), Positives = 19/44 (43%)
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR 1286
R P + E + + +E+EV E E EEE + RR
Sbjct: 15 ARPLPTDTGHESSKIEAAESSKVDEDEVVSECEGEGEEECLMRR 58
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 34.0 bits (77), Expect = 0.77
Identities = 24/59 (40%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTN-RVGTSLGRQYR 1186
SS S S S SSS+S SS + +E +E ED R G +R +R R GR YR
Sbjct: 1256 SSSSSSCSSSSSSSDSSSSEEDGDEKNEKEDRERAGGGKRRGRQRLPIRDRVYRGRDYR 1314
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 32.6 bits (74), Expect = 0.86
Identities = 14/42 (33%), Positives = 27/42 (64%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV 1289
E E E+ EEE++ N+ S E++E E+E+ ++ P+++ V
Sbjct: 3 EVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTV 44
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 0.86
Identities = 28/191 (14%), Positives = 77/191 (40%), Gaps = 5/191 (2%)
Query: 1200 RYAMRARQSKQVQSVQAGRRGQTIVTSRTRGK--QSYNKFSGRGRGRKAPESEDEVEGEE 1257
+ A A+++++ + ++ + + + + K ++ + + E + E
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEAR 1593
Query: 1258 EEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQ 1317
EE EEE+ + E ++ EEA + +++A + K+ V + +
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE--EEKKKVEQLKKKEAEEKKKAEE 1651
Query: 1318 NGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETE 1377
+A+ N +++ D + +++ E+ ++A K A + E + +
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE-DEKKAAEALKKEAEEAKKAEELKKK 1710
Query: 1378 SEEEESQAAQV 1388
EE+ +A ++
Sbjct: 1711 EAEEKKKAEEL 1721
Score = 33.2 bits (75), Expect = 1.5
Identities = 24/165 (14%), Positives = 67/165 (40%), Gaps = 6/165 (3%)
Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPV-RRANVAKVKRAVP 1302
+KA E + E ++ EE E+++ EE + EE++ R+ ++A A+++ +
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Query: 1303 PAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRT 1362
+ + + K + + L + + +++E ++A +
Sbjct: 1600 LYEEEKKMKAEEAK-----KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654
Query: 1363 KGQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKPTAKLRGLE 1407
+ + ++E ++EE++ +A + + + A + E
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
Score = 31.3 bits (70), Expect = 5.6
Identities = 29/211 (13%), Positives = 92/211 (43%), Gaps = 9/211 (4%)
Query: 1202 AMRARQSKQVQSV----QAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEE 1257
A +A K+ + + ++ + + K + + R + A ++E+ + +E
Sbjct: 1223 AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADE 1282
Query: 1258 EEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVK-RAVPPAPDTSTSRLRSVR 1316
+++ ++ + +E ++ E+++ +EA + ++A+ AK K D + + +
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK 1342
Query: 1317 QNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETET 1376
+ A + ++ +++ + +EE + ++A + K + + E +
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK---KKADAAKKKAEEKK-KADEAKK 1398
Query: 1377 ESEEEESQAAQVSFSRRGRMRKPTAKLRGLE 1407
++EE++ +A ++ + + + AK + E
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
Score = 30.9 bits (69), Expect = 7.0
Identities = 31/158 (19%), Positives = 63/158 (39%), Gaps = 6/158 (3%)
Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEE--NSEEEEEEEAVPRRRPVRRANVAK---VK 1298
KA E++ G+ EE + +++ E+ +EE + E + R+A AK +
Sbjct: 1098 GKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIA 1157
Query: 1299 RAVPPAPDTSTSR-LRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRA 1357
R A +R ++ A++ + + L +E + EE+ + A
Sbjct: 1158 RKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEA 1217
Query: 1358 RRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRRGR 1395
R+ ++ AV +E + EE +A + + R
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIR 1255
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 33.3 bits (76), Expect = 0.92
Identities = 15/128 (11%), Positives = 30/128 (23%), Gaps = 9/128 (7%)
Query: 1183 RQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGR 1242
+ RE DR + R R+ + +S R ++ S + ++ R
Sbjct: 6 DREREKSRG----RDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSR 61
Query: 1243 GRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRA-NVAKVKRAV 1301
+ V + R+ +
Sbjct: 62 SP----RSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIK 117
Query: 1302 PPAPDTST 1309
PP + T
Sbjct: 118 PPGYELVT 125
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 33.3 bits (76), Expect = 0.93
Identities = 15/59 (25%), Positives = 26/59 (44%)
Query: 1223 IVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
I T +GK + + K +D+ E E E+ + E E + + EEEE+ +
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSD 200
Score = 31.7 bits (72), Expect = 3.1
Identities = 15/45 (33%), Positives = 30/45 (66%)
Query: 1238 SGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+ + + ++A ES D+ + EE E ++ EE E++S++EEEE++
Sbjct: 155 TKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDS 199
Score = 31.3 bits (71), Expect = 3.8
Identities = 10/38 (26%), Positives = 18/38 (47%)
Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
+ E +D+S+ E E S + EE + P ++P
Sbjct: 226 INYNEDTSESESDESDSEISESRSVSDSEESSPPSKKP 263
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S rRNAs,
demonstrating that 5'-end processing also has a redundant
pathway. Nop25 binds late pre-60S ribosomes, accompanying
them from the nucleolus to the nuclear periphery; and
there is evidence for both physical and functional links
between late 60S subunit processing and export.
Length = 134
Score = 31.9 bits (73), Expect = 0.97
Identities = 9/34 (26%), Positives = 15/34 (44%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E+E + EE+ E+ D + E+ E E
Sbjct: 78 LLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVT 111
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit of
the archaeal ribosome.
Length = 105
Score = 31.2 bits (71), Expect = 0.98
Identities = 19/37 (51%), Positives = 20/37 (54%), Gaps = 10/37 (27%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
A + E E EEEEE EEEE EEE EEEA
Sbjct: 70 AAAAAAEEEEEEEEE-----EEEE-----EEESEEEA 96
Score = 30.8 bits (70), Expect = 1.3
Identities = 17/32 (53%), Positives = 18/32 (56%), Gaps = 4/32 (12%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEE 1277
A + E EEEEE EEEE EE SEEE
Sbjct: 68 AAAAAAAAEEEEEEE----EEEEEEEEESEEE 95
Score = 29.6 bits (67), Expect = 3.0
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVE 1271
A E+E E EEEEE ++SEEE +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMA 98
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 33.5 bits (76), Expect = 1.0
Identities = 32/150 (21%), Positives = 58/150 (38%), Gaps = 24/150 (16%)
Query: 699 SRANGESSAATEGTS-----PSGTAPTAQNKNFVKRLPLSQ-------HKIVVERINAFA 746
SR ES+ A + S PT K K+ + ++VV A
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107
Query: 747 EIESEEFIKESKKLFDSSTTTQPLVVKKEKKKKVVRQARTRAEYSEHIQDQYEDPDNLSL 806
+ + + +K +S+ V++EK +K VR+ R + E ++DQ + + +
Sbjct: 108 KESKKTPRRTRRKAAAASSD-----VEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDV 162
Query: 807 SGSE-------ESASDSDQSQVDVSDLCES 829
SE ES + D + D D+ +
Sbjct: 163 EESEFVTSLENESEEELDLEKDDGEDISHT 192
Score = 33.1 bits (75), Expect = 1.1
Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 7/139 (5%)
Query: 1144 SGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGR-QYRENPSDVSDYDSGDRYA 1202
+GS + + DE+ +G + P R+ + T + E + S+
Sbjct: 46 AGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSEL------V 99
Query: 1203 MRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESN 1262
+ + +S + RR + + + + + RK + +++VE + E
Sbjct: 100 VNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEV 159
Query: 1263 DQSEEEEVEENSEEEEEEE 1281
EE E + E E EEE
Sbjct: 160 SDVEESEFVTSLENESEEE 178
Score = 30.8 bits (69), Expect = 6.1
Identities = 32/148 (21%), Positives = 50/148 (33%), Gaps = 26/148 (17%)
Query: 1227 RTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEE----------ESNDQSEEEEVEENSEE 1276
R R + + +G RK ES V+ E +E S ++ ++ V E SE
Sbjct: 35 RWRLQNKFLLAAGS---RKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEP 91
Query: 1277 EEE-------------EEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKR 1323
EE +E+ R RR A + + R V++
Sbjct: 92 LEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVE 151
Query: 1324 TNGRQSSSLLMTCSGDSDSEDNSQEEEQ 1351
G +S + S S +N EEE
Sbjct: 152 DQGSESEVSDVEESEFVTSLENESEEEL 179
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.8 bits (77), Expect = 1.0
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 1245 KAPESEDEVEGE-EEEESNDQSEEEEVEENSEEEEEEEAVP 1284
KA E +D E E EEE S+D ++E++ + +E + P
Sbjct: 3943 KALEDKDRQEKEDEEEMSDDVGIDDEIQPD-IQENNSQPPP 3982
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 33.4 bits (76), Expect = 1.1
Identities = 17/32 (53%), Positives = 18/32 (56%), Gaps = 3/32 (9%)
Query: 1129 SSLSE--ENSHSDSSSESGSSSQSSEEDSEDE 1158
SS E +S SSS SGS S SS DSE E
Sbjct: 388 SSNPEPAASSSGSSSSSSGSES-SSGSDSESE 418
Score = 31.8 bits (72), Expect = 3.7
Identities = 43/209 (20%), Positives = 72/209 (34%), Gaps = 34/209 (16%)
Query: 1129 SSLSEENSHSDSSSESGSSSQSSEEDSED---------------------EDWTRHGKAR 1167
S S +S S+SSS S S S+SS DSE+ ++W
Sbjct: 398 SGSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPEPEPPSTNKWQLDNWLNKVNPH 457
Query: 1168 RMPPRRTNRVGTSLGRQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQTI--VT 1225
++ P + + + + SG +Y +++ S + RR +T
Sbjct: 458 KVSPAESVSSNPPIKQPME---KEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGP 514
Query: 1226 SRTRGKQ-----SYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
RGKQ S R G+K P+ ++ +E E +
Sbjct: 515 ESGRGKQKSPAQSEAPPQRRTVGKKQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTP 574
Query: 1281 EAVPRRRPV--RRANVAK-VKRAVPPAPD 1306
P+ R+ + K K +VPPA +
Sbjct: 575 PDRPKAATKGSRKPSPRKEPKSSVPPAAE 603
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1 protein
homologues. SDA1 is a Saccharomyces cerevisiae protein
which is involved in the control of the actin
cytoskeleton. The protein is essential for cell viability
and is localised in the nucleus.
Length = 317
Score = 33.1 bits (76), Expect = 1.1
Identities = 12/43 (27%), Positives = 23/43 (53%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR 1290
EDE +E E + +S++E +SE+EEE++ ++
Sbjct: 106 VEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKED 148
Score = 31.2 bits (71), Expect = 3.8
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 1254 EGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+G E ++ +D+ EE EVEE+ + ++E E
Sbjct: 90 QGLESDDDDDEEEEWEVEEDEDSDDEGE 117
Score = 31.2 bits (71), Expect = 4.4
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
E E+E + ++ D EE E+ E EEEEA
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167
Score = 30.4 bits (69), Expect = 6.0
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR 1286
ES D E EEE++ + +E+ +E EE+EEEA
Sbjct: 128 ESSDS-EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165
Score = 30.0 bits (68), Expect = 9.4
Identities = 10/34 (29%), Positives = 16/34 (47%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ E E E + +D EEE E E+ ++E
Sbjct: 83 RKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This family
is a region of the Myb-Related Cdc5p/Cef1 proteins, in
fungi, and is part of the pre-mRNA splicing factor
complex.
Length = 363
Score = 33.1 bits (76), Expect = 1.2
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 13/77 (16%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEE--------EEEEEAVPRRRPVRRANVAKV 1297
P++E E+E EEEE + EEE+EE++ + EE +E R RR+ V +
Sbjct: 140 KPKNEFELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELR---RRSQV--I 194
Query: 1298 KRAVPPAPDTSTSRLRS 1314
+R +P LR
Sbjct: 195 QRNLPRPSVLDLIVLRP 211
>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675). The
members of this family are sequences derived from
hypothetical plant proteins of unknown function. One
member of this family is annotated as a putative
RNA-binding protein, but no evidence was found to
support this.
Length = 283
Score = 32.6 bits (74), Expect = 1.3
Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 9/87 (10%)
Query: 762 DSSTTTQPLVVKKEKKKKVVRQARTRAEY-SEHIQDQYEDPDNLSLSGSEESAS------ 814
TT+ P+ ++E +K+ Q+ R E + + +Y N S +A+
Sbjct: 58 LMRTTSLPVETEEEWRKRKEMQSLRRLEAKRKRSEKEYNGVSNGDDMDSINAANGGGSGR 117
Query: 815 --DSDQSQVDVSDLCESSSSSGGTSSS 839
+S + S + SSSS G +SS
Sbjct: 118 DKESSKGVETGSPIATSSSSGGPSSSG 144
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8 is
a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts with
LIN-35 Rb.Lin35 Rb is a product of the class B synMuv
gene lin-35 which silences genes required for vulval
specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 32.7 bits (75), Expect = 1.3
Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 10/115 (8%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEEN------SEEEEEEEAVPRRRP--VRRANVAKVKRAV 1301
ED ++ E ND+ +E EVEE + +E E RP V+++ A+
Sbjct: 151 EDRTRESQDMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPN 210
Query: 1302 PPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEE-EQYGSR 1355
P P S S + ++ R + Q S + + D++ +E EQ +
Sbjct: 211 PQQPMPSASSESATSKSASTSRESSPQPQS-PPPRRVPAPTVDSTAKEAEQITYQ 264
>gnl|CDD|224273 COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family)
[Replication, recombination, and repair].
Length = 248
Score = 32.3 bits (74), Expect = 1.4
Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 5/77 (6%)
Query: 1247 PESEDEVEGEEEEESND--QSEEEEVEENSEEEEEEEAVPR-RRPV--RRANVAKVKRAV 1301
P+ E+E E EE EE D + EE E E E + RR V + K+K+
Sbjct: 76 PKEEEEAEDEELEEPRDELVARLEEYERYKEAAELLAELEEERRDVFSKIKPEIKIKKER 135
Query: 1302 PPAPDTSTSRLRSVRQN 1318
P + S L Q
Sbjct: 136 RPVEEISLIDLFRAYQK 152
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 33.3 bits (76), Expect = 1.4
Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKR 1299
E + E++EE +++ + EEV+E EEEE+ + N+ K+
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGF---YYELYEKVNIEANKK 325
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 32.9 bits (76), Expect = 1.4
Identities = 20/153 (13%), Positives = 38/153 (24%), Gaps = 10/153 (6%)
Query: 1255 GEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSRLRS 1314
G E + + EEEE + + E EE A A+ + A
Sbjct: 28 GIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAE 87
Query: 1315 VRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSET 1374
A ++ + + A++ + +
Sbjct: 88 AAAAAEAAARPAED----------EAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKA 137
Query: 1375 ETESEEEESQAAQVSFSRRGRMRKPTAKLRGLE 1407
+ + + R R RK K + E
Sbjct: 138 ARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTE 170
>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
protein (MOBP). MOBP is abundantly expressed in central
nervous system myelin, and shares several characteristics
with myelin basic protein (MBP), in terms of regional
distribution and function. This family is the middle and
C-terminal regions of MOBP which has been shown to be
essential for normal arrangement of the radial component
in central nervous system myelin. Most member-proteins
carry a FVHE-PHD type zinc-finger at their N-terminus.
Length = 710
Score = 32.9 bits (74), Expect = 1.6
Identities = 54/260 (20%), Positives = 103/260 (39%), Gaps = 25/260 (9%)
Query: 1140 SSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDSGD 1199
+SS + + + + ED DW+ ++ PP + +L + + P+ D G
Sbjct: 234 ASSVASAYDEIGSDSEEDFDWSE-ALSKLCPPSQ------ALLKNPQPQPTQAQSSDQGP 286
Query: 1200 RYAM-RARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEE 1258
A + S +++ + + +SR G Q+ + R + + + GE +
Sbjct: 287 IAASPSSALSPNPEAMCSDSETSSAGSSREAGCQAKLSWLQRKAPKNPAAEKMPLHGELD 346
Query: 1259 EESNDQSEEEEVEENSEEEEEEEAVPRR-RPVRRANVAKVKRAVPPAPDTSTSRLRSVRQ 1317
N Q+ E ++S+ EE RR R RRA + P+ L S
Sbjct: 347 VNFNPQAAGGETSDSSDPEEALHTADRRARRWRRARL---------GPEEPNKELPSPGA 397
Query: 1318 NGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETE 1377
+ RA T Q S L ++D + +Q+ + + R + A+ SE ET
Sbjct: 398 HLRALDTA--QVSDDL----SETDISNEAQDPQSLTDSTEEKLRNRLYELAMKMSEKETS 451
Query: 1378 S-EEEESQAAQVSFSRRGRM 1396
S E++ES+ +++ +
Sbjct: 452 SGEDQESEPKAEPENQKESL 471
Score = 30.2 bits (67), Expect = 9.3
Identities = 31/142 (21%), Positives = 53/142 (37%), Gaps = 13/142 (9%)
Query: 1140 SSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRE-NPSDVSDYDSG 1198
+ E+ SS E + R + R+ P N+ S G R + + VSD S
Sbjct: 354 AGGETSDSSDPEEALHTADRRARRWRRARLGPEEPNKELPSPGAHLRALDTAQVSDDLSE 413
Query: 1199 DRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEE 1258
+ A+ + + + R R + K S K S ++ E E +
Sbjct: 414 TDISNEAQDPQSLTDSTEEKL-------RNRLYELAMKMS-----EKETSSGEDQESEPK 461
Query: 1259 EESNDQSEEEEVEENSEEEEEE 1280
E +Q E E+N++ +EE
Sbjct: 462 AEPENQKESLSSEDNNQGVQEE 483
>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein. S-antigens are heat stable
proteins that are found in the blood of individuals
infected with malaria.
Length = 94
Score = 30.3 bits (68), Expect = 1.6
Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEV-EENSEEEEEEEA 1282
G+G+K + ++E EGE ++E + SEE EEN E + E+
Sbjct: 51 NGKGQKYEDLQEEGEGENDDEEDSNSEESNNDEENGLIEGQGES 94
>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
Length = 177
Score = 31.3 bits (71), Expect = 2.0
Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
K E E+EVE E EE+ ++ EE E SE E E
Sbjct: 6 KNEEVEEEVEATETEETVEEV-VEETPEKSELELANE 41
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein (DUF2052).
This entry is of sequences of two conserved domains
separated by a region of low complexity, spanning some
200 residues. The function is unknown.
Length = 178
Score = 31.2 bits (71), Expect = 2.1
Identities = 14/43 (32%), Positives = 23/43 (53%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVP 1284
++ ES +E E + EEE+ ++ EE + EEEEE+
Sbjct: 75 EQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSY 117
>gnl|CDD|176394 cd01799, Hoil1_N, Ubiquitin-like domain of HOIL1. HOIL1_N HOIL-1
(heme-oxidized IRP2 ubiquitin ligase-1) is an E3
ubiquitin-protein ligase that recognizes heme-oxidized
IRP2 (iron regulatory protein2) and is thought to affect
the turnover of oxidatively damaged proteins. Hoil-1
has an amino-terminal ubiquitin-like domain as well as
an RBR signature consisting of two RING finger domains
separated by an IBR/DRIL domain.
Length = 75
Score = 29.4 bits (66), Expect = 2.2
Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 628 YPPEVQRLVPGRENADDTELLPD--ITTNDD 656
+PP VQR V G+ A D E L I TN D
Sbjct: 38 FPPAVQRWVIGQRLARDQETLYSHGIRTNGD 68
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 31.6 bits (72), Expect = 2.2
Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKV 1297
A SED+ + + E+ +++SEEE EE+ E +E E + +NV K
Sbjct: 72 AANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE--SNVEKE 121
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes the
fifth essential DNA polymerase in yeast EC:2.7.7.7. Pol5p
is localised exclusively to the nucleolus and binds near
or at the enhancer region of rRNA-encoding DNA repeating
units.
Length = 784
Score = 32.5 bits (74), Expect = 2.2
Identities = 24/129 (18%), Positives = 40/129 (31%), Gaps = 26/129 (20%)
Query: 1223 IVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
++ + + F G E D+ E E E + +SE E E+ EE+E+E+
Sbjct: 631 VLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690
Query: 1283 VPRRRPVRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDS 1342
A+ V P L V A G
Sbjct: 691 ------------AEANEGVVPIDKAVRRALPKVLNLPDAL--------------DGGDSE 724
Query: 1343 EDNSQEEEQ 1351
++ ++EQ
Sbjct: 725 DEEGMDDEQ 733
Score = 31.0 bits (70), Expect = 5.9
Identities = 28/129 (21%), Positives = 44/129 (34%), Gaps = 12/129 (9%)
Query: 1077 VEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEPFLVPVDISSLSEENS 1136
V E +Q +L + DE +++ G EE E L D E +
Sbjct: 620 VTNEGLQLILDVLKADENKSR-----HQQLFEGEEE------DEDDLEETD-DDEDECEA 667
Query: 1137 HSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYD 1196
DS SES S + EED +++D + + + L + D D +
Sbjct: 668 IEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEE 727
Query: 1197 SGDRYAMRA 1205
D M
Sbjct: 728 GMDDEQMMR 736
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and is
typically between 384 and 430 amino acids in length.
Length = 414
Score = 32.1 bits (73), Expect = 2.2
Identities = 43/225 (19%), Positives = 68/225 (30%), Gaps = 62/225 (27%)
Query: 1136 SHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTS-------------LG 1182
SH S S S S+S+ + S+ ++ M PRR + +
Sbjct: 112 SHPRSRSASVSNSKDGDRPSD----LPPSPSKTMDPRRWSPTKATWLESALNKPESPKHK 167
Query: 1183 RQYRENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRG-----QTIVTSRTRGKQSYNKF 1237
Q + P D R RQS+ SV GR + RT S++K
Sbjct: 168 PQPPQQPEWKKDLS-------RLRQSR--ASVDLGRTNSFKEVTPVGLMRTPPPGSHSKS 218
Query: 1238 SGRGRGRKAPES---------EDEVEGEEEEE----------------------SNDQSE 1266
+ P S + + E + + E
Sbjct: 219 PSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTEE 278
Query: 1267 EEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTSR 1311
E + E S +E EE + +R P + K + P A + R
Sbjct: 279 ELKSPEASPKESEEASARKRSPSLLSPSPKAESPKPLASPGKSPR 323
>gnl|CDD|218218 pfam04702, Vicilin_N, Vicilin N terminal region. This region is
found in plant seed storage proteins, N-terminal to the
Cupin domain (pfam00190). In Macadamia integrifolia, this
region is processed into peptides of approximately 50
amino acids containing a C-X-X-X-C-(10-12)X-C-X-X-X-C
motif. These peptides exhibit antimicrobial activity in
vitro.
Length = 147
Score = 30.8 bits (69), Expect = 2.3
Identities = 24/135 (17%), Positives = 51/135 (37%), Gaps = 11/135 (8%)
Query: 1250 EDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTST 1309
E E G+ E++ ++ +E+ +EE++++ P RR + + T
Sbjct: 21 EQETRGQREQQQCERRCKEQY----KEEQQQQRQREEDPQRRYEQCQQRCQQHEPRHRPT 76
Query: 1310 SRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRAR---RTRTKGQS 1366
+ R R+ + ++ RQ C ++ +Q R R + +G+
Sbjct: 77 CQQRCERRYEKEQQQQQRQ----YQECQQRCQQQEQRPRRQQQCQRECREQYQEHGRGEG 132
Query: 1367 YAVHSSETETESEEE 1381
++ E SEE
Sbjct: 133 ENINQEGEEERSEEG 147
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 28.5 bits (64), Expect = 2.6
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 1258 EEESNDQSEEEEVEENSEEEEEEE 1281
EEE + EE E EE EE+ E +
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVK 45
Score = 27.7 bits (62), Expect = 4.5
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 1250 EDEVEGEEEEESNDQSEEE 1268
EDE E E+EEE +QSE +
Sbjct: 27 EDEEENEDEEEGEEQSEVK 45
Score = 27.3 bits (61), Expect = 5.4
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 1257 EEEESNDQSEEEEVEENSEEEEEE 1280
EEE+ D+ E E+ EE E+ E +
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVK 45
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is found
in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 31.7 bits (72), Expect = 2.6
Identities = 14/74 (18%), Positives = 28/74 (37%)
Query: 1207 QSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSE 1266
++ AG + + + ++A ES DE EEE DQ+E
Sbjct: 34 APSSSEASNAGSGSSEQKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAE 93
Query: 1267 EEEVEENSEEEEEE 1280
+ ++E+++
Sbjct: 94 AGKQLSQKQQEQQQ 107
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 31.8 bits (73), Expect = 2.7
Identities = 17/38 (44%), Positives = 17/38 (44%), Gaps = 5/38 (13%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
K S EEE EEEE EE EE EEEA
Sbjct: 289 KEVLSAQAQAAAAEEE-----EEEEEEEEEEEPSEEEA 321
Score = 30.6 bits (70), Expect = 6.5
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
++E++ ++ + EEE EE EEEEEE +
Sbjct: 282 DALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317
Score = 30.2 bits (69), Expect = 7.8
Identities = 17/37 (45%), Positives = 18/37 (48%), Gaps = 10/37 (27%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
A + E E EEEEE EEEE E EEE A
Sbjct: 296 AQAAAAEEEEEEEEE-----EEEE-----EPSEEEAA 322
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 31.6 bits (72), Expect = 2.9
Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 5/89 (5%)
Query: 698 RSRANGESSAATEGTSPSGTAPTAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFIKES 757
RSR++ SS+ + PS + ++N R S +V + A A++ + S
Sbjct: 155 RSRSSSRSSSRSNSRGPSRGSSRNNSRN---RNSSSPDDLVAAVLAALAKLGFGKQKSSS 211
Query: 758 KKLFDSSTTTQPLVVKKEKKKKVVRQART 786
KK + + KK+ K R RT
Sbjct: 212 KKPSRVTKKSAAEAAKKQLNKP--RWKRT 238
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 31.9 bits (73), Expect = 2.9
Identities = 9/50 (18%), Positives = 23/50 (46%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVA 1295
+ +D+ +E+EE + E E++ ++ + +E+ R R+
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206
>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
subunit E2; Reviewed.
Length = 411
Score = 31.7 bits (73), Expect = 2.9
Identities = 10/55 (18%), Positives = 16/55 (29%), Gaps = 1/55 (1%)
Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEE-EAVPRRRPVRRANVAKVKRAVPPA 1304
+E E + + + E E + A P A A +A P
Sbjct: 77 EEEGEAEAAAAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPAAPAAAAAKASPAV 131
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 31.9 bits (72), Expect = 3.0
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 10/81 (12%)
Query: 772 VKKEKKKKVVRQA-RTRAEYSEHIQDQYE---DPDNLSLSGSEESASDSDQSQVDVSDLC 827
VKK KKV + R + EYS+ ++E D D+ S ++ D
Sbjct: 22 VKKHAAKKVNKALDRAQDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAAD------DEE 75
Query: 828 ESSSSSGGTSSSDYSDWIHEG 848
E +SS T D D I+EG
Sbjct: 76 EGEASSDATEGHDEDDEIYEG 96
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 30.0 bits (68), Expect = 3.3
Identities = 11/49 (22%), Positives = 14/49 (28%)
Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPR 1285
F R RK +E E + + EEEE E
Sbjct: 2 FLSRWSRRKLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEE 50
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein (parp)
genes of Trypanosoma brucei encode a small family of
abundant surface proteins whose expression is restricted
to the procyclic form of the parasite. They are found at
two unlinked loci, parpA and parpB; transcription of both
loci is developmentally regulated.
Length = 145
Score = 30.3 bits (67), Expect = 3.4
Identities = 22/57 (38%), Positives = 26/57 (45%), Gaps = 1/57 (1%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPP 1303
E E E EGEEE E +++ EEE E E E E E P P A +K P
Sbjct: 78 EEPEPEEEGEEEPEP-EETGEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKSVALP 133
Score = 30.3 bits (67), Expect = 3.8
Identities = 27/70 (38%), Positives = 34/70 (48%), Gaps = 8/70 (11%)
Query: 1247 PESEDEVEGEEEEESNDQSEEE-EVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAP 1305
E E E EGEEE E ++ EEE E EE EEE E E P P + + P P
Sbjct: 68 EEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEP-------EPEPEPEPEP 120
Query: 1306 DTSTSRLRSV 1315
+ + L+SV
Sbjct: 121 EPGAATLKSV 130
>gnl|CDD|182033 PRK09697, PRK09697, protein secretion protein GspB; Provisional.
Length = 139
Score = 30.4 bits (68), Expect = 3.4
Identities = 23/83 (27%), Positives = 33/83 (39%), Gaps = 4/83 (4%)
Query: 1205 ARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQ 1264
ARQ + V+A T+ KQS + + + E E E E +
Sbjct: 51 ARQLWVLWIVKAEV---TVEAETPAFKQSTQHYFFKKQPLPVVE-SVEEEDEPGVAVENA 106
Query: 1265 SEEEEVEENSEEEEEEEAVPRRR 1287
E EEN+ EE +E+A R R
Sbjct: 107 PSSSEDEENTVEESDEKAGLRER 129
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 31.3 bits (72), Expect = 3.5
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 1241 GRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
GR + E+ +E E E EEE EEEE EE +EE E E
Sbjct: 224 GRQGRLAEAAEEEEEEAEEE-----EEEEEEEEAEEAEAE 258
>gnl|CDD|220023 pfam08801, Nucleoporin_N, Nup133 N terminal like. Nup133 is a
nucleoporin that is crucial for nuclear pore complex
(NPC) biogenesis. The N terminal forms a seven-bladed
beta propeller structure. This family now contains other
sized nucleoporins, including Nup155, Nup8, Nuo132,
Nup15 and Nup170.
Length = 411
Score = 31.7 bits (72), Expect = 3.5
Identities = 17/81 (20%), Positives = 28/81 (34%), Gaps = 11/81 (13%)
Query: 242 LRGCSGEISDIAIDN-RNILLAAGTVDKTIRIWNLQTLAPISVLVGHQGIITGVNF--CP 298
+ EI + +D+ R LL T TIR+W+L + ++ V+
Sbjct: 186 VPSSDREIVSLRVDSSRGRLLYTLTSKGTIRVWDLSP--------SGEKLVREVDIYEII 237
Query: 299 LEVNGFNYLASTSTDGCIGFW 319
L + S ST
Sbjct: 238 LRDLQESIPKSASTLSSKSLK 258
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim have
been identified in both mice and humans (mTim and hTIM,
respectively). Mammalian TIM is not the true orthologue
of Drosophila TIM, but is the likely orthologue of a fly
gene, timeout (also called tim-2). mTim has been shown to
be essential for embryonic development, but does not have
substantiated circadian function. Some family members
contain a SANT domain in this region.
Length = 507
Score = 31.6 bits (71), Expect = 3.6
Identities = 28/135 (20%), Positives = 47/135 (34%), Gaps = 7/135 (5%)
Query: 1251 DEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKVKRAVPPAPDTSTS 1310
+ EE+E + + E +EE E E R + +A + R+ +
Sbjct: 308 RAADDREEDECDQAVPLVPLTEENEEAMENEQFQR---LLKALGLRPPRSGQEGFWRIPA 364
Query: 1311 RLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVH 1370
+L S + RA +G + + + E E R+ R R G +
Sbjct: 365 KLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLA---- 420
Query: 1371 SSETETESEEEESQA 1385
S E E EEE+
Sbjct: 421 SPEEEALGEEEQKAP 435
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 31.2 bits (71), Expect = 3.6
Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 2/53 (3%)
Query: 1239 GRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRR 1291
G+ G A + E + + E SE E+ E S++EE VPR PV+
Sbjct: 196 GQALGLSALRTPGEPDDDATVE--TSSESEDSESESDDEELPYIVPRMEPVQG 246
>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis, outer
membrane].
Length = 284
Score = 31.3 bits (71), Expect = 3.9
Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 1/40 (2%)
Query: 1057 WDLEPPKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQ 1096
EPP G EP P P Q +L P P W +
Sbjct: 246 SWFEPPPPGTTKPEPKAP-PPLPPSCQAVLDEPAPSSWLE 284
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair of
CXXC motifs, followed by a long region of low complexity
sequence in which roughly half the residues are Asp and
Glu, including multiple runs of five or more acidic
residues. The function of members of this family is
unknown.
Length = 104
Score = 29.6 bits (67), Expect = 3.9
Identities = 6/49 (12%), Positives = 22/49 (44%)
Query: 1233 SYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ R A + + E+EE+ +D +++ +++ +++ +
Sbjct: 35 EVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDL 83
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 30.5 bits (69), Expect = 4.1
Identities = 12/34 (35%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E++DE E +++E E++EV+E+ +++E+EE
Sbjct: 138 ENDDEDE-DDDEIVEILIEDDEVDEDEDDDEDEE 170
Score = 29.3 bits (66), Expect = 8.8
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
E ED+ E E +D+ +E+E ++ EE+EE++
Sbjct: 142 EDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit of
transcription factor IIF (TFIIF), which is essential for
accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 31.5 bits (71), Expect = 4.2
Identities = 29/144 (20%), Positives = 46/144 (31%), Gaps = 23/144 (15%)
Query: 1138 SDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSDVSDYDS 1197
D + S++ ED D K ++ + + + R D +YDS
Sbjct: 216 KDLEGDDEDDGDESDKGGEDGD-EEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDS 274
Query: 1198 GDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEE 1257
D GR I S SG + + E+ +
Sbjct: 275 DD-------------GDDEGREEDYISDSSA---------SGNDPEEREDKLSPEIPAKP 312
Query: 1258 EEESNDQSEEEEVEENSEEEEEEE 1281
E E ++ SEE E E+N EE +
Sbjct: 313 EIEQDEDSEESEEEKNEEEGGLSK 336
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3 subunit.
This is a family of proteins which are subunits of the
eukaryotic translation initiation factor 3 (eIF3). In
yeast it is called Hcr1. The Saccharomyces cerevisiae
protein eIF3j (HCR1) has been shown to be required for
processing of 20S pre-rRNA and binds to 18S rRNA and eIF3
subunits Rpg1p and Prt1p.
Length = 242
Score = 30.8 bits (70), Expect = 4.2
Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 1254 EGEEEEESNDQS-EEEEVEENSEEEEEEEAVPR 1285
+ E+E++ S +EEE EE EE+ + A +
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAK 57
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 31.1 bits (70), Expect = 4.4
Identities = 25/110 (22%), Positives = 39/110 (35%), Gaps = 18/110 (16%)
Query: 1053 NLSPWDLEPPKEGKEATEPGGSLPVEPEEIQELLYHPRPDEWPQG--DRDATCEKIITGL 1110
SP + P+EG+ A + ++ E H D+ P D
Sbjct: 402 RGSP--MAAPEEGRAAAAAAAA-NAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMD 458
Query: 1111 EEVMGLSVAEPFLVPVDISSLSEENSHSDSSSESGSSSQSSEEDSEDEDW 1160
E+ G D + E++S DSS ++S E DS D+ W
Sbjct: 459 EQESG-----------DERADGEDDS--DSSYSYSTTSSEDESDSADDSW 495
Score = 30.0 bits (67), Expect = 9.8
Identities = 12/44 (27%), Positives = 20/44 (45%)
Query: 1338 GDSDSEDNSQEEEQYGSRRARRTRTKGQSYAVHSSETETESEEE 1381
DSD +D ++E SY+ SSE E++S ++
Sbjct: 450 ADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADD 493
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 31.2 bits (70), Expect = 4.4
Identities = 37/173 (21%), Positives = 61/173 (35%), Gaps = 20/173 (11%)
Query: 1129 SSLSEENSHSDSSSES-GSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQ--Y 1185
SS + N + D + ES G ++ SEE+ + GK + R T + R+
Sbjct: 37 SSDEDNNENVDYAEESGGEGNEKSEEEKFKNPYRLEGKFKDEADRAKIMAMTEIERESIL 96
Query: 1186 RENPSDVSDYDSGDRYAMRARQSKQVQSVQAGRRGQ--TIVTSRTRGKQSYNKFSGRGRG 1243
E ++S A+R Q + RR +TS K+ + R
Sbjct: 97 FEREEEISKLMERRELAIRMEQQHRSSGCTDTRRSTRYEPLTSAAEEKKKKLLELKKTRE 156
Query: 1244 RKA-----------PESE----DEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
R+ + +E E +EE EE VE+ S ++ E
Sbjct: 157 REERLYSERHIELQRFKDYKELEESEQGLQEEYTPSYAEEAVEDISRTDDFAE 209
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 31.3 bits (71), Expect = 4.8
Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 3/61 (4%)
Query: 1241 GRGRKAPESEDEVEG---EEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVRRANVAKV 1297
E E+E+E +S+D E E+ EE++EE P + + + A++
Sbjct: 6 NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARL 65
Query: 1298 K 1298
K
Sbjct: 66 K 66
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 30.5 bits (69), Expect = 4.8
Identities = 12/41 (29%), Positives = 14/41 (34%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
R E + E +EEEE EE EE E
Sbjct: 143 RAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEE 183
>gnl|CDD|180374 PRK06060, PRK06060, acyl-CoA synthetase; Validated.
Length = 705
Score = 31.2 bits (70), Expect = 5.0
Identities = 15/57 (26%), Positives = 25/57 (43%), Gaps = 10/57 (17%)
Query: 1069 TEPGGSLPVEPEEIQELLYHPRPDEWPQGDRDATCEKIITGLEEVMGLSVAEPFLVP 1125
TE G + V+P E++ L+ + +A E + + E G S + FLV
Sbjct: 390 TEVIGGVNVDPREVERLII----------EDEAVAEAAVVAVRESTGASTLQAFLVA 436
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 30.8 bits (69), Expect = 5.2
Identities = 24/104 (23%), Positives = 44/104 (42%), Gaps = 20/104 (19%)
Query: 1187 ENPSDVSDY-------DSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSG 1239
++ S+ DY G+ ++ +S V+ + RG Q
Sbjct: 54 QDLSEACDYIAIEEYSACGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQ------- 106
Query: 1240 RGRGRKAPESEDEVEG-EEEEESNDQSEEEEVEENSEEEEEEEA 1282
K E +++VE EE EE EE + +E+SE+++E+E+
Sbjct: 107 -----KEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKES 145
>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
TFIIE alpha. TFIIE is compiled of two subunits, alpha
and beta. This family of proteins are the C terminal
domain of the alpha subunit of the protein which is the
largest subunit and contains several functional domains
which are important for basal transcription and cell
growth. The C terminal end of the protein binds directly
to the amino-terminal PH domain of p62/Tfb1 (of IIH)
which is involved in the recruitment of the general
transcription factor IIH to the transcription
preinitiation complex. P53 competes for the same binding
site as TFIIE alpha which shows their structural
similarity. Like p53, TFIIE alpha 336-439 can activate
transcription in vivo.
Length = 86
Score = 28.6 bits (64), Expect = 5.4
Identities = 12/36 (33%), Positives = 18/36 (50%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAV 1283
ES+D+ E EEE+ E++ EE E+E
Sbjct: 7 ESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDEPT 42
>gnl|CDD|224751 COG1838, FumA, Tartrate dehydratase beta subunit/Fumarate hydratase
class I, C-terminal domain [Energy production and
conversion].
Length = 184
Score = 30.0 bits (68), Expect = 5.9
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 8/62 (12%)
Query: 619 FLVDLNGTPYPPEVQRLVPGRENADDTELL--PDITTNDDGHVEIVEGSERNGERPVEVE 676
VDLN TP E+ +L G D L +T D H ++E +R E PV+++
Sbjct: 1 MKVDLN-TPLKEEIAKLKVG-----DVVYLSGKIVTGRDAAHKRLLEMLDRGEELPVDLK 54
Query: 677 GQ 678
G
Sbjct: 55 GH 56
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 30.9 bits (69), Expect = 6.0
Identities = 17/33 (51%), Positives = 21/33 (63%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEE 1280
E DE EE EE+ +++ EE VEEN EE EE
Sbjct: 1062 EEYDEENVEEIEENIEENIEENVEENVEENVEE 1094
>gnl|CDD|205277 pfam13096, ShortName, Family description.
Length = 177
Score = 29.9 bits (67), Expect = 6.1
Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 14/84 (16%)
Query: 738 VVERINAFAEIESEEFIKESKKLFDSSTTTQPLVVKKEKKKKVVRQARTRAEYSE----- 792
+ E+ N + I E + D S V + E+ + ++ R+ +SE
Sbjct: 23 IQEKENLSRRVTDLNIIMEGVEFSDLSKF----VSRAEEDRDLLLFFRSLRSFSEWCEHR 78
Query: 793 -----HIQDQYEDPDNLSLSGSEE 811
H +++Y D L S E
Sbjct: 79 RETFEHFKEKYPDVVRLPEGVSGE 102
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 30.8 bits (69), Expect = 6.2
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 5/81 (6%)
Query: 1255 GEEEEESNDQSEEEEVEENSEEEEEEEAV--PRRRPVRRANVAKVKRAVPPAPDTSTSRL 1312
GEE EE ++ E + E E+EEEE P P+ ++ + R A + R
Sbjct: 83 GEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPEVNPLDAEGLSGLAREACDALKKALLRH 142
Query: 1313 RSVRQNGRAKRT---NGRQSS 1330
R + Q R R NG Q S
Sbjct: 143 RFLWQRRRQARCEQHNGPQQS 163
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.4 bits (69), Expect = 6.3
Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 5/61 (8%)
Query: 1245 KAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRPVR-----RANVAKVKR 1299
DE+ EES+D EEE +E++ E E E P +PVR +A K KR
Sbjct: 229 PEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKR 288
Query: 1300 A 1300
Sbjct: 289 R 289
>gnl|CDD|193000 pfam12357, PLD_C, Phospholipase D C terminal. This domain family
is found in eukaryotes, and is approximately 70 amino
acids in length. The family is found in association with
pfam00168, pfam00614. There is a conserved FPD sequence
motif. This family is the C terminal of phospholipase D.
PLD is a major plant lipid-degrading enzyme which is
involved in signal transduction.
Length = 74
Score = 28.2 bits (64), Expect = 6.3
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 7/42 (16%)
Query: 605 DEVTNLPPHLMPPPFLVDLNG--TPYP-----PEVQRLVPGR 639
+EVT+LP HL+ P VD +G T P P+ V G
Sbjct: 31 EEVTDLPGHLLRYPVGVDRDGKVTELPGCEFFPDTGAKVLGA 72
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and is
approximately 100 amino acids in length.
Length = 100
Score = 28.7 bits (65), Expect = 6.4
Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEE--NSEEEEEE 1280
RG EDE+ EEE+ +D+ EEEE E + E++E
Sbjct: 15 RGELPEPEEDEILELEEEDDDDEDEEEEYELLASFYHEDQE 55
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit; Provisional.
Length = 482
Score = 30.7 bits (70), Expect = 6.4
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 1242 RGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
+ +KA + + E EEEE+ + E+EE EE +EEE+EEE
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
Length = 903
Score = 31.0 bits (70), Expect = 6.5
Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 17/91 (18%)
Query: 710 EGTSPSGTAPTAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFIKESKKLFDSS----- 764
EG S +G A A+N+ F LPL + +E+ IK +KK+F++S
Sbjct: 582 EGESAAGNAKQARNREFQAVLPLKGKILNIEK------------IKNNKKVFENSEIKLL 629
Query: 765 TTTQPLVVKKEKKKKVVRQARTRAEYSEHIQ 795
T+ L V ++ T+A E +Q
Sbjct: 630 ITSIGLSVNPVTWRQYDLSHGTKASKDESVQ 660
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.9 bits (71), Expect = 6.6
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
V+ EE+ E E ++ E+EEEEE
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEEEE 202
Score = 30.1 bits (69), Expect = 8.8
Identities = 13/46 (28%), Positives = 21/46 (45%)
Query: 1237 FSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
+G R R+ + + EE+ EE ++ EEEEE+E
Sbjct: 160 ENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDEN 205
Score = 30.1 bits (69), Expect = 9.8
Identities = 9/36 (25%), Positives = 16/36 (44%)
Query: 1247 PESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEA 1282
P +E++ E +E+E EE E ++ A
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAA 211
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein. Genome
duplication is precisely regulated by cyclin-dependent
kinases CDKs, which bring about the onset of S phase by
activating replication origins and then prevent
relicensing of origins until mitosis is completed. The
optimum sequence motif for CDK phosphorylation is
S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
potential phosphorylation sites. Drc1 is required for DNA
synthesis and S-M replication checkpoint control. Drc1
associates with Cdc2 and is phosphorylated at the onset
of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
replication by phosphorylating Drc1 and regulating its
association with Cut5. Sld2 and Sld3 represent the
minimal set of S-CDK substrates required for DNA
replication.
Length = 397
Score = 30.5 bits (69), Expect = 6.8
Identities = 29/158 (18%), Positives = 47/158 (29%), Gaps = 36/158 (22%)
Query: 1248 ESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRR----RPVRRANVAKVKRAVPP 1303
+ D + E EE+ EEEV ++ +E V ++ R RR + R V
Sbjct: 248 DELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKM----RPVRA 303
Query: 1304 APDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGSRRARRTRTK 1363
P S +SD + + ++ S
Sbjct: 304 KPSDEPS--------------------------LPESDIHEEIPKLDE-KSLSEFLGYMG 336
Query: 1364 GQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKPTA 1401
G + E EE E + +V R R P +
Sbjct: 337 GIDEDDEDEDDEESKEEVEKK-QKVKKKPRKRKVNPVS 373
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1, of
polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 30.6 bits (69), Expect = 6.9
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 3/51 (5%)
Query: 1246 APESEDEVEGEEEEESNDQSEEEEVEENS---EEEEEEEAVPRRRPVRRAN 1293
A ES + E EEE D E+E E + E+E+E+E P
Sbjct: 235 ASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 30.5 bits (69), Expect = 7.0
Identities = 13/82 (15%), Positives = 32/82 (39%), Gaps = 13/82 (15%)
Query: 1202 AMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESEDEVEGEEEEES 1261
AM+ + + Q+ + RR + + S+D+ + + E
Sbjct: 222 AMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGD-------------GSDDDDDEDAIESD 268
Query: 1262 NDQSEEEEVEENSEEEEEEEAV 1283
D S+++ +E+ E+ + + V
Sbjct: 269 LDDSDDDVSDEDGEDLFDTDNV 290
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 29.9 bits (67), Expect = 7.5
Identities = 11/47 (23%), Positives = 22/47 (46%), Gaps = 1/47 (2%)
Query: 1239 GRGRGRKAPESEDEVEGEEE-EESNDQSEEEEVEENSEEEEEEEAVP 1284
G+ + +K E+E +G E+ ++ + E E E + E+ P
Sbjct: 92 GKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPEDTPPP 138
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 30.6 bits (69), Expect = 7.8
Identities = 22/147 (14%), Positives = 46/147 (31%), Gaps = 19/147 (12%)
Query: 1132 SEENSHSDSSSESGSSSQSSEEDSEDEDWTRHGKARRMPPRRTNRVGTSLGRQYRENPSD 1191
+E+ +++S S+ S + D D +A Q + D
Sbjct: 100 DDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN--------------DQTNDFDQD 145
Query: 1192 VSDYDSGDRYAMRARQSKQVQSVQAGRRGQTIVTSRTRGKQSYNKFSGRGRGRKAPESED 1251
S D+ +QS + S + + T T YN + +
Sbjct: 146 DSSNSQTDQ---GLKQSVNLSSAEKLIEEKKGQTENTFKF--YNFGNDGEEAAAKDGGKS 200
Query: 1252 EVEGEEEEESNDQSEEEEVEENSEEEE 1278
+ +D ++ E++EE++
Sbjct: 201 KSSDPGPLNDSDGQGDDGDPESAEEDK 227
>gnl|CDD|219675 pfam07972, Flavodoxin_NdrI, NrdI Flavodoxin like.
Length = 120
Score = 28.6 bits (65), Expect = 8.1
Identities = 9/35 (25%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 720 TAQNKNFVKRLPLSQHKIVVERINAFAEIESEEFI 754
T + FV++L L +I + I+ E++ E ++
Sbjct: 7 TGNTRRFVEKLGLPAQRIPIREIDEELEVD-EPYV 40
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 29.9 bits (67), Expect = 8.4
Identities = 25/129 (19%), Positives = 41/129 (31%), Gaps = 13/129 (10%)
Query: 1229 RGKQSYNKFSGRGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
K +K G G++ DE + E+S+D+ E E + EE E+ V
Sbjct: 113 LEKIVVSKQEEDGPGKE--PQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETAS 170
Query: 1289 VRRANVAKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQE 1348
+ K K + D+ ++ R D D EDN E
Sbjct: 171 EQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEE-----------DYDEEDNPVE 219
Query: 1349 EEQYGSRRA 1357
+ +
Sbjct: 220 DSKAIKEEL 228
Score = 29.9 bits (67), Expect = 8.7
Identities = 29/172 (16%), Positives = 55/172 (31%), Gaps = 13/172 (7%)
Query: 1242 RGRKAPESEDEVEGEEEEE----SNDQSEEEEVEENSEEEEEEEAVPRRRPVRRA---NV 1294
P E E EEE + Q EEEV+E + E+ V ++ +
Sbjct: 73 SEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQL 132
Query: 1295 AKVKRAVPPAPDTSTSRLRSVRQNGRAKRTNGRQSSSLLMTCSGDSDSEDNSQEEEQYGS 1354
+ K + D L + + + + + D + + QE E
Sbjct: 133 DEDKFLLAEDSDDRQETLEAGKVH-EETEDSYHVEETASEQYKQDMKEKASEQENEDSKE 191
Query: 1355 RRARRTRTKGQSYAVHSSETETESEEEESQAAQVSFSRRGRMRKPTAKLRGL 1406
+ RTK + + E +EE + S + + + K + +
Sbjct: 192 PVEKAERTKA-----ETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQE 238
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle pole
microtubules during metaphase. In Xenopus, it has been
shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 27.3 bits (61), Expect = 8.8
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 7/40 (17%)
Query: 1257 EEEESNDQSEEEEVEENSEEEEEEEA-------VPRRRPV 1289
EE+E ++E+EE E +EEEEE VP+ P+
Sbjct: 18 EEKEKALEAEKEEAEARQKEEEEEAIKQLRKELVPKANPM 57
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is a
family of uncharacterized proteins.
Length = 449
Score = 30.0 bits (67), Expect = 9.4
Identities = 13/42 (30%), Positives = 29/42 (69%)
Query: 1240 RGRGRKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEE 1281
R G++A E E E ++EE+ +D +E+E +++ +++E++E
Sbjct: 39 RKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDE 80
>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 181 to 207 amino acids in length. This domain has
two conserved sequence motifs: PIP and CEICG. The domain
carries a zinc-finger domain of the C2H2-type.
Length = 187
Score = 29.5 bits (67), Expect = 9.9
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 1244 RKAPESEDEVEGEEEEESNDQSEEEEVEENSEEEEEEEAVPRRRP 1288
RK +E+E + +E S D SE+ +++ ++EEE P P
Sbjct: 27 RKQALTEEERQASADESSEDASEDGSDDDSDDDEEEPIYNPLNLP 71
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.132 0.390
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 71,547,693
Number of extensions: 7130460
Number of successful extensions: 10702
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8614
Number of HSP's successfully gapped: 489
Length of query: 1407
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1298
Effective length of database: 6,103,016
Effective search space: 7921714768
Effective search space used: 7921714768
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (28.9 bits)