RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11693
(579 letters)
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 62.9 bits (153), Expect = 3e-10
Identities = 31/123 (25%), Positives = 41/123 (33%), Gaps = 13/123 (10%)
Query: 463 NIPLQHITPQVQLHQQQQQQPPPQPAH--IPPPQIQHIPPPIQHQPVQ---------NIH 511
P PQ QQ PPPQP PPQ Q PPP +
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297
Query: 512 PSIQHIQQNLQQNIQQNLQQNIQQNIQ-HLLQQNMHVANSSHHMSQQRMHQLYAL-HHDG 569
P Q L Q Q + + L QQ + +R H++ +L ++G
Sbjct: 298 PPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEAKRAKRRHKIVSLSKYNG 357
Query: 570 YMT 572
MT
Sbjct: 358 LMT 360
Score = 54.8 bits (132), Expect = 9e-08
Identities = 23/120 (19%), Positives = 29/120 (24%), Gaps = 1/120 (0%)
Query: 461 NMNIPLQHITPQVQLHQQ-QQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQ 519
P Q PQ +Q PPQ Q +P P Q + P +
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPP 241
Query: 520 NLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGYMTATASIMQ 579
LQQ L Q + QQ Q + G Q
Sbjct: 242 PLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQ 301
Score = 49.8 bits (119), Expect = 3e-06
Identities = 21/110 (19%), Positives = 26/110 (23%), Gaps = 8/110 (7%)
Query: 466 LQHITPQVQLHQQQQQQPPPQ----PAHIPPPQI---QHIPPPIQHQ-PVQNIHPSIQHI 517
L+ + Q+Q QQ Q P P P +PP Q Q PP P Q
Sbjct: 156 LEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQ 215
Query: 518 QQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHH 567
Q Q Q + Q
Sbjct: 216 PQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQ 265
Score = 49.0 bits (117), Expect = 6e-06
Identities = 27/112 (24%), Positives = 30/112 (26%), Gaps = 17/112 (15%)
Query: 465 PLQHITPQVQLHQQQQQQ-----PPPQPAHIPPPQIQHI--------PPPIQHQPVQNIH 511
+ + PQ L Q PP P PP Q PPP Q Q
Sbjct: 210 HPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQP 269
Query: 512 PSIQHIQ--QNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
P Q QN Q Q QQ + QQR Q
Sbjct: 270 PQPQAQPPPQNQPTPHPGLPQG--QNAPLPPPQQPQLLPLVQQPQGQQRGPQ 319
Score = 37.1 bits (86), Expect = 0.029
Identities = 20/105 (19%), Positives = 26/105 (24%), Gaps = 17/105 (16%)
Query: 471 PQVQLHQQQQQQPPPQPAHIPP-------------PQIQHIP-PPIQHQPVQNIHPSIQH 516
P+ Q Q + QP PQ Q +P PP Q P
Sbjct: 132 PEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQ-AA 190
Query: 517 IQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
Q Q Q + + + Q A S Q
Sbjct: 191 FPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQ--APAQPPLP 233
Score = 29.7 bits (67), Expect = 5.2
Identities = 17/100 (17%), Positives = 24/100 (24%), Gaps = 20/100 (20%)
Query: 477 QQQQQQPPPQ----PAHIP-----------PPQIQHIPPPIQHQPVQNIHPSIQHIQQNL 521
Q + P +IP P+ + PP P Q Q +
Sbjct: 99 TFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPPQAPES-----QPQPQTPAQKM 153
Query: 522 QQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
+ Q +Q L Q V Q Q
Sbjct: 154 LSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQ 193
Score = 29.0 bits (65), Expect = 7.9
Identities = 8/67 (11%), Positives = 16/67 (23%), Gaps = 1/67 (1%)
Query: 457 SNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQH 516
+ + P Q Q PQ PQ + + Q + +
Sbjct: 283 TPHPGLPQGQNAPLPPPQQPQLLPLVQQPQG-QQRGPQFREQLVQLSQQQREALSQEEAK 341
Query: 517 IQQNLQQ 523
+ +
Sbjct: 342 RAKRRHK 348
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 51.2 bits (122), Expect = 1e-06
Identities = 26/99 (26%), Positives = 32/99 (32%), Gaps = 6/99 (6%)
Query: 467 QHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
Q P QQ Q Q P Q PP QP +QQ Q
Sbjct: 159 QSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQ--QPPQMGQPGMPGGGGQGQMQQQGQP 216
Query: 524 NIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
QQ +QQ +Q+ QQ M +Q +M Q
Sbjct: 217 GGQQQQNPQMQQQLQNQQQQQMD-QQQGPADAQAQMGQQ 254
Score = 47.3 bits (112), Expect = 2e-05
Identities = 18/99 (18%), Positives = 21/99 (21%), Gaps = 2/99 (2%)
Query: 466 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
Q QQ PPQ P Q +QQ LQ
Sbjct: 175 GQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQ 234
Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYA 564
QQ + Q M Q+M
Sbjct: 235 QQQMDQQQGPADAQAQMGQQQQGQGG--MQPQQMQGGQM 271
Score = 41.1 bits (96), Expect = 0.002
Identities = 22/93 (23%), Positives = 29/93 (31%), Gaps = 6/93 (6%)
Query: 471 PQVQLHQQQQ--QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 528
Q Q+ QQQQ PQ Q+ P Q P Q S + N Q +
Sbjct: 247 AQAQMGQQQQGQGGMQPQQMQGGQMQV----PMQQQPPQQQPQQSQLGMLPNQMQQMPGG 302
Query: 529 LQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
Q Q + Q+ V + Q M
Sbjct: 303 GQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSA 335
Score = 40.4 bits (94), Expect = 0.003
Identities = 22/92 (23%), Positives = 30/92 (32%), Gaps = 1/92 (1%)
Query: 467 QHITPQVQLHQQQ-QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
Q QV + QQ QQQP + P Q+Q +P Q P Q + P Q Q
Sbjct: 267 QGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQ 326
Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQ 557
Q + ++ QQ
Sbjct: 327 AVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQ 358
Score = 39.6 bits (92), Expect = 0.005
Identities = 20/94 (21%), Positives = 32/94 (34%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
L + Q+Q Q P QP PP + +P Q + Q ++Q +N
Sbjct: 288 QLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRN 347
Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQR 558
++ Q QQ Q H + + Q
Sbjct: 348 MRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGG 381
Score = 37.7 bits (87), Expect = 0.019
Identities = 20/97 (20%), Positives = 28/97 (28%), Gaps = 7/97 (7%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
P Q P + Q Q Q + +Q+Q Q + QQ +
Sbjct: 193 PPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMD------QQQGPAD 246
Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
Q + Q Q + Q M M QQ Q
Sbjct: 247 AQAQMGQQQQGQ-GGMQPQQMQGGQMQVPMQQQPPQQ 282
Score = 33.4 bits (76), Expect = 0.35
Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 4/69 (5%)
Query: 462 MNIPLQHITPQVQLHQQQQQQPPP--QPAHIPPPQI--QHIPPPIQHQPVQNIHPSIQHI 517
P+ + + Q Q P P PP + IP P P + S
Sbjct: 413 SPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPA 472
Query: 518 QQNLQQNIQ 526
Q Q
Sbjct: 473 SQRTIQQDM 481
Score = 33.4 bits (76), Expect = 0.39
Identities = 23/97 (23%), Positives = 30/97 (30%), Gaps = 7/97 (7%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAH-IPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
Q+ Q QL QQQQQ Q Q+ Q + Q +
Sbjct: 220 QQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQ----GGQMQVPM 275
Query: 524 NIQQNLQQNIQQN-IQHLLQQNMHVANSSHHMSQQRM 559
QQ QQ QQ+ + L Q + Q M
Sbjct: 276 Q-QQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPM 311
Score = 29.2 bits (65), Expect = 7.0
Identities = 19/109 (17%), Positives = 24/109 (22%), Gaps = 24/109 (22%)
Query: 472 QVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP-SIQHIQQNLQQNI----- 525
+V Q Q PQ Q +Q Q N Q
Sbjct: 137 RVGTMQPGGQAGGMMQQSSGQPQSQQ-----PNQMGPQQGQAQGQAGGMNQGQQGPVGQQ 191
Query: 526 -------------QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
Q Q QQN + + QQ+M Q
Sbjct: 192 QPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQ 240
Score = 29.2 bits (65), Expect = 8.1
Identities = 21/103 (20%), Positives = 26/103 (25%), Gaps = 1/103 (0%)
Query: 460 ANMNIPLQHITPQVQLHQQ-QQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
M Q Q QQQQ P + Q Q + Q Q Q
Sbjct: 199 PGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQ 258
Query: 519 QNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
+Q Q Q + Q QQ Q+M
Sbjct: 259 GGMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPG 301
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 44.3 bits (104), Expect = 2e-04
Identities = 25/82 (30%), Positives = 29/82 (35%), Gaps = 2/82 (2%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
P Q + PQ Q QQ QQP PQ P P QP Q + P Q+ Q
Sbjct: 768 PQQPVAPQPQ--YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 825
Query: 525 IQQNLQQNIQQNIQHLLQQNMH 546
Q QQ Q +H
Sbjct: 826 PQPQYQQPQQPVAPQPQDTLLH 847
Score = 40.1 bits (93), Expect = 0.003
Identities = 25/98 (25%), Positives = 31/98 (31%), Gaps = 3/98 (3%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
P Q + PQ Q Q QQ P P Q + P Q+Q Q QQ Q
Sbjct: 755 PQQPVAPQQQYQQPQQPVAPQPQYQQPQ---QPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811
Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
Q Q QQ + Q + +H L
Sbjct: 812 APQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPL 849
Score = 38.9 bits (90), Expect = 0.008
Identities = 27/92 (29%), Positives = 36/92 (39%), Gaps = 6/92 (6%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQH-QPVQNIHPSIQHIQQN---LQQNI 525
TP V+ QQ QQ PQ + P Q + P Q+ QP Q + P Q+ Q Q
Sbjct: 746 TPIVEPVQQPQQPVAPQQQYQQPQQ--PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ 803
Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQ 557
Q QQ + Q+ Q + QQ
Sbjct: 804 YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ 835
Score = 33.5 bits (76), Expect = 0.37
Identities = 16/95 (16%), Positives = 25/95 (26%), Gaps = 2/95 (2%)
Query: 469 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 528
I P + + QQ Q P + P Q P + P + +Q QQ
Sbjct: 373 IAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQ- 431
Query: 529 LQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLY 563
Q + + Q +Q
Sbjct: 432 -PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
Score = 33.1 bits (75), Expect = 0.54
Identities = 21/112 (18%), Positives = 35/112 (31%), Gaps = 5/112 (4%)
Query: 407 NKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIPL 466
N+ Q Q Q Y + +P A + Q + + N+
Sbjct: 394 NEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQ 453
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
PQ Q +Q QP + P P++ QPV P ++ +
Sbjct: 454 STFAPQST---YQTEQTYQQP--AAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 42.8 bits (101), Expect = 3e-04
Identities = 23/127 (18%), Positives = 30/127 (23%), Gaps = 19/127 (14%)
Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPP------------QPAHIPPPQ---- 494
+ Q P Q Q Q P PA PP
Sbjct: 45 AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP 104
Query: 495 IQHIPPPIQHQPVQN---IHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSS 551
P IQ +P Q P + Q Q+ +QQ Q S
Sbjct: 105 AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQ 164
Query: 552 HHMSQQR 558
+Q+
Sbjct: 165 QQSAQKN 171
Score = 36.3 bits (84), Expect = 0.040
Identities = 17/113 (15%), Positives = 22/113 (19%), Gaps = 19/113 (16%)
Query: 479 QQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNI----- 533
Q PP A +P P Q P Q
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAG---DQQQHNTPTGSPAANPPATFALPAGP 107
Query: 534 -QQNIQ----HLLQQNMHVANS-----SHHMSQQRMHQLYALHHDGYMTATAS 576
IQ L + V + S + L L Y +
Sbjct: 108 AGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQR-YGAPASG 159
Score = 33.9 bits (78), Expect = 0.20
Identities = 18/69 (26%), Positives = 22/69 (31%), Gaps = 4/69 (5%)
Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 510
V ++T N AN + P Q QQ QQ PA P Q +Q
Sbjct: 124 VPVMVTQNPANSPLD----QPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQ 179
Query: 511 HPSIQHIQQ 519
QQ
Sbjct: 180 PNGETPPQQ 188
Score = 32.0 bits (73), Expect = 0.75
Identities = 17/107 (15%), Positives = 28/107 (26%), Gaps = 7/107 (6%)
Query: 457 SNSANMNIPL-QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 515
+ +P+ P Q QQ Q P Q Q Q
Sbjct: 117 GQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDE--- 173
Query: 516 HIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
Q Q N + QQ + + + + + +QR+
Sbjct: 174 -SQLQQQPNGETPPQQ-TDGAGDDESEALVRLREADGTL-EQRIKGA 217
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 42.8 bits (100), Expect = 4e-04
Identities = 38/278 (13%), Positives = 71/278 (25%), Gaps = 30/278 (10%)
Query: 286 NSLFRCDQCGKAFSRKGHLVQHSFVHGGEKPFKCSYCPKAFSHKGHLNEHLYTHSENKCF 345
C C +FSR HL +H H GEKP +CSY S L
Sbjct: 31 PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLE------------ 78
Query: 346 QCKYCPKTFTLRKHLKAHTNKHEGVLPHTCLECSKCFSTKSELNRHAQIHGGVKPFACTV 405
L +HL+ H N + + S S + + +
Sbjct: 79 ----------LSRHLRTHHNNPSDLNSKS---LPLSNSKASSSSLSSSSSNSNDNNLLSS 125
Query: 406 CNKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIP 465
+ + L + S ++ S V + S + N+
Sbjct: 126 HSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLS 185
Query: 466 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
L + + P ++ P Q+ + + Q +++
Sbjct: 186 LLISSNVSTSIPSSSENSPLSSSYSIPSSSSD-----QNLENSSSSLPLTTNSQLSPKSL 240
Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLY 563
++ + ++ SQ
Sbjct: 241 LSQSPSSLSSSDSSSSASESPRSSLPTASSQSSSPNES 278
Score = 37.0 bits (85), Expect = 0.023
Identities = 48/202 (23%), Positives = 67/202 (33%), Gaps = 19/202 (9%)
Query: 266 SKAFTQQYNLKHHLLL-HGDGNSLFRCDQCGKAFSRKGHLVQH--SFVHGGE--KPFKC- 319
A +Q + + + QC +FSR L +H S H GE KPF C
Sbjct: 266 PTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCP 325
Query: 320 -SYCPKAFSHKGHLNEHLYTHSENKCFQCKYCPKTFTLRKHLKA-------HTNKHEGVL 371
S C K FS L H+ H+ + K + L +
Sbjct: 326 YSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385
Query: 372 PHTCLECSKCFSTKSELNR--HAQIHGGVKP--FACTVCNKSFLQKTQLQNHLYTHSKVK 427
L S + K + N H H +P C+KSF + L H H+
Sbjct: 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA 445
Query: 428 PYNCGICAKSFVQRNTYIKHLE 449
P C I KSF + H +
Sbjct: 446 PLLCSIL-KSFRRDLDLSNHGK 466
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 40.5 bits (95), Expect = 5e-04
Identities = 19/56 (33%), Positives = 22/56 (39%), Gaps = 2/56 (3%)
Query: 465 PLQHITPQVQLHQQQ--QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
P QH P + QQ Q QP P P Q Q PI P Q P + +Q
Sbjct: 84 PTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139
Score = 37.5 bits (87), Expect = 0.007
Identities = 22/76 (28%), Positives = 23/76 (30%), Gaps = 11/76 (14%)
Query: 475 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSI-----------QHIQQNLQQ 523
LH Q P HIP QPV P + QH Q NL Q
Sbjct: 35 LHHQIIPVSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQ 94
Query: 524 NIQQNLQQNIQQNIQH 539
QQ Q Q Q
Sbjct: 95 PAQQPFQPQPLQPPQP 110
Score = 33.2 bits (76), Expect = 0.16
Identities = 17/63 (26%), Positives = 22/63 (34%), Gaps = 8/63 (12%)
Query: 452 HGVITSNSANMNIPLQHITP------QVQLHQQQQQQPPPQPAHIPPPQIQHIPP--PIQ 503
H + + N+P P Q QQ Q PP P P +PP P+Q
Sbjct: 80 HSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139
Query: 504 HQP 506
P
Sbjct: 140 PLP 142
Score = 33.2 bits (76), Expect = 0.20
Identities = 19/88 (21%), Positives = 26/88 (29%), Gaps = 11/88 (12%)
Query: 454 VITSNSANMNIPLQHITPQVQLHQQQQQQPP--PQPAHIPPPQIQH----IPPPIQH--- 504
V + + H P + Q Q P P P QH +P P Q
Sbjct: 42 VSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQ 101
Query: 505 -QPVQNIHPSIQHIQQNLQQNIQQNLQQ 531
QP+Q Q +Q + L
Sbjct: 102 PQPLQPP-QPQQPMQPQPPVHPIPPLPP 128
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 39.9 bits (93), Expect = 0.003
Identities = 27/82 (32%), Positives = 28/82 (34%), Gaps = 3/82 (3%)
Query: 461 NMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHI 517
P H P QQ Q QQP PQ PP Q PPP Q Q P Q
Sbjct: 89 QSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVP 148
Query: 518 QQNLQQNIQQNLQQNIQQNIQH 539
QQ Q+ Q Q Q
Sbjct: 149 QQQQYQSPPQQPQYQQNPPPQA 170
Score = 37.6 bits (87), Expect = 0.013
Identities = 24/86 (27%), Positives = 24/86 (27%), Gaps = 3/86 (3%)
Query: 461 NMNIPLQHITPQVQLHQQQQQ--QPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
P Q Q Q Q PPQ P Q P P Q P Q Q
Sbjct: 78 APQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQ 137
Query: 519 -QNLQQNIQQNLQQNIQQNIQHLLQQ 543
Q Q Q QQ Q Q Q
Sbjct: 138 AQQPQPPPQVPQQQQYQSPPQQPQYQ 163
Score = 34.5 bits (79), Expect = 0.14
Identities = 14/71 (19%), Positives = 19/71 (26%), Gaps = 3/71 (4%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
Q P P PP Q+ P + Q VQ++ Q+
Sbjct: 66 LQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQY-PSQLPPQQVQSVPQQPTPQQEPYYPP 124
Query: 525 IQQNLQQNIQQ 535
Q QQ
Sbjct: 125 PSQPQPPPAQQ 135
Score = 34.1 bits (78), Expect = 0.18
Identities = 18/47 (38%), Positives = 19/47 (40%)
Query: 478 QQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
QQ Q PQP P Q Q+ PP Q Q QN P Q Q
Sbjct: 134 QQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY 180
Score = 33.0 bits (75), Expect = 0.46
Identities = 23/92 (25%), Positives = 30/92 (32%), Gaps = 7/92 (7%)
Query: 471 PQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 529
QV H+ A +PP P Q P Q Q PS Q+ Q Q +Q
Sbjct: 54 EQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQS--QAPPSHQYPSQLPPQQVQSVP 111
Query: 530 QQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
QQ Q + + +QQ Q
Sbjct: 112 QQPTPQQEPYYPPP----SQPQPPPAQQPQAQ 139
Score = 32.6 bits (74), Expect = 0.51
Identities = 22/77 (28%), Positives = 27/77 (35%), Gaps = 4/77 (5%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 522
+Q + Q Q+ PP QP P PQ Q PP Q Q Q Q Q
Sbjct: 106 QVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQY--Q 163
Query: 523 QNIQQNLQQNIQQNIQH 539
QN Q Q + +
Sbjct: 164 QNPPPQAQSAPQVSGLY 180
Score = 31.8 bits (72), Expect = 1.0
Identities = 18/57 (31%), Positives = 21/57 (36%), Gaps = 5/57 (8%)
Query: 462 MNIPLQHITPQVQLHQQQQQQ---PP--PQPAHIPPPQIQHIPPPIQHQPVQNIHPS 513
Q PQ QQQQ PP PQ PPPQ Q P P ++ +
Sbjct: 132 PAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQP 188
Score = 28.8 bits (64), Expect = 7.6
Identities = 13/49 (26%), Positives = 16/49 (32%), Gaps = 2/49 (4%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 513
P Q Q QQ PPPQ P + P Q+ P+
Sbjct: 148 PQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY--PEESPYQPQSYPPN 194
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 38.2 bits (88), Expect = 0.011
Identities = 22/77 (28%), Positives = 33/77 (42%), Gaps = 7/77 (9%)
Query: 209 NSVHKDVKKYNCE--ECDKYYLSQGGVGCVFLY-YLSQGELDSHVQTSHRGI----KPYV 261
KD K Y C C+K Y +Q G+ L+ + +Q ++ KPY
Sbjct: 341 MLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYR 400
Query: 262 CEICSKAFTQQYNLKHH 278
CE+C K + LK+H
Sbjct: 401 CEVCDKRYKNLNGLKYH 417
Score = 38.2 bits (88), Expect = 0.012
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 13/87 (14%)
Query: 256 GIKPYVCEI--CSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQHSFVHGG 313
KPY C + C+K + Q LK+H +LHG + + +
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYH-MLHG---------HQNQKLHENPSPEKMNIFSAK 395
Query: 314 EKPFKCSYCPKAFSHKGHLNEHLYTHS 340
+KP++C C K + + L H HS
Sbjct: 396 DKPYRCEVCDKRYKNLNGLKYHR-KHS 421
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 38.4 bits (89), Expect = 0.012
Identities = 17/45 (37%), Positives = 18/45 (40%), Gaps = 3/45 (6%)
Query: 471 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 515
P V + PP QP P PQ PPP Q QP P Q
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQ---APPPPQPQPQPPPPPQPQ 2930
Score = 35.3 bits (81), Expect = 0.12
Identities = 14/32 (43%), Positives = 14/32 (43%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 501
PQ Q Q QPPP P PPP P P
Sbjct: 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
Score = 34.9 bits (80), Expect = 0.14
Identities = 15/44 (34%), Positives = 15/44 (34%), Gaps = 5/44 (11%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 513
PQ Q Q QPPP P P P P P PS
Sbjct: 2918 QPQPQPPPPPQPQPPPPPPPRPQP-----PLAPTTDPAGAGEPS 2956
Score = 34.9 bits (80), Expect = 0.16
Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQP 506
+ + Q + PPQP PPPQ Q PPP Q QP
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
Score = 34.5 bits (79), Expect = 0.22
Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 1/37 (2%)
Query: 471 PQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQP 506
P Q + Q Q PP P P P P PP P
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 32.6 bits (74), Expect = 0.77
Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 1/51 (1%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 515
L P+ Q P PQP PPPQ Q PPP +P + P+
Sbjct: 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTD 2948
Score = 31.8 bits (72), Expect = 1.3
Identities = 8/37 (21%), Positives = 9/37 (24%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 501
P P Q PPP+P P
Sbjct: 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 37.3 bits (87), Expect = 0.015
Identities = 21/92 (22%), Positives = 34/92 (36%), Gaps = 7/92 (7%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
P P Q +Q Q P P P PQ H P Q Q P + + +
Sbjct: 126 PQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEP 185
Query: 525 IQQNLQQNIQQNIQHLLQQNMHVANSSHHMSQ 556
+ ++ + ++ M+VA +HH S+
Sbjct: 186 APVMDKPKRKEAV--IV---MNVA--AHHGSE 210
Score = 33.9 bits (78), Expect = 0.22
Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 8/76 (10%)
Query: 468 HITPQVQLHQQQQQQPP-----PQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 522
+ Q Q QPP P+ PP+ +PP +P Q +Q Q Q
Sbjct: 79 QEHEAARPSPQHQYQPPYASAQPRQPVQQPPEA-QVPPQHAPRPAQPAPQPVQ--QPAYQ 135
Query: 523 QNIQQNLQQNIQQNIQ 538
+Q LQQ + +
Sbjct: 136 PQPEQPLQQPVSPQVA 151
Score = 28.5 bits (64), Expect = 9.7
Identities = 19/79 (24%), Positives = 26/79 (32%), Gaps = 8/79 (10%)
Query: 473 VQLHQQQQQQPPPQPAHIPPPQIQH-IPPPIQH-QPVQNIHPSIQ------HIQQNLQQN 524
V++H+ Q P QH PP QP Q + + H + Q
Sbjct: 66 VRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPA 125
Query: 525 IQQNLQQNIQQNIQHLLQQ 543
Q Q Q + LQQ
Sbjct: 126 PQPVQQPAYQPQPEQPLQQ 144
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 37.8 bits (87), Expect = 0.018
Identities = 9/52 (17%), Positives = 35/52 (67%), Gaps = 3/52 (5%)
Query: 494 QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
Q +++P ++ NI + ++N++++ ++N+++N+++N++ +++N+
Sbjct: 931 QNENVPEHLKEHAEANIE---EDAEENVEEDAEENVEENVEENVEENVEENV 979
Score = 35.1 bits (80), Expect = 0.12
Identities = 16/112 (14%), Positives = 53/112 (47%), Gaps = 26/112 (23%)
Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 510
+ G I +N L++ V H ++ + A+I + ++ +N+
Sbjct: 913 LSGNIAHEINLINKELKNQNENVPEHLKEHAE-----ANIE----EDAEENVEEDAEENV 963
Query: 511 HPSIQ-----------------HIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
+++ ++++N+++N+++N+++NI++N++ +++N+
Sbjct: 964 EENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENI 1015
Score = 31.7 bits (71), Expect = 1.4
Identities = 9/43 (20%), Positives = 30/43 (69%)
Query: 496 QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQ 538
++I I+ +N+ +++ I++N+++N+++N ++N ++N +
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAE 1116
Score = 30.5 bits (68), Expect = 3.2
Identities = 9/38 (23%), Positives = 27/38 (71%)
Query: 508 QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
+NI +I+ + + I++N+++NI++N++ +++N+
Sbjct: 1055 ENIEENIEEYDEENVEEIEENIEENIEENVEENVEENV 1092
Score = 30.5 bits (68), Expect = 3.4
Identities = 7/31 (22%), Positives = 28/31 (90%)
Query: 515 QHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
+++++N+++N+++N+++NI++N++ ++N+
Sbjct: 997 ENVEENIEENVEENVEENIEENVEEYDEENV 1027
Score = 30.1 bits (67), Expect = 4.3
Identities = 7/28 (25%), Positives = 26/28 (92%)
Query: 513 SIQHIQQNLQQNIQQNLQQNIQQNIQHL 540
+++ I++N+++NI++N+++N+++N++ +
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEI 1095
Score = 29.4 bits (65), Expect = 6.2
Identities = 8/45 (17%), Positives = 27/45 (60%)
Query: 508 QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSH 552
+N+ +++++N+++N ++N ++N ++N + +N N +
Sbjct: 1090 ENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEEY 1134
Score = 29.0 bits (64), Expect = 9.0
Identities = 8/56 (14%), Positives = 32/56 (57%)
Query: 494 QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVAN 549
++ I I+ +N+ +++ + +++N+++N+++N ++N + ++N +
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYD 1123
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 37.6 bits (88), Expect = 0.018
Identities = 12/60 (20%), Positives = 18/60 (30%), Gaps = 1/60 (1%)
Query: 477 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
QQ Q Q Q Q Q Q S+ QQ L+ + + + +
Sbjct: 625 QQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGR-QRGGLPGM 683
Score = 32.6 bits (75), Expect = 0.70
Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 6/69 (8%)
Query: 477 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
Q+ +QQ Q Q Q Q Q +L + QQ L+ + +
Sbjct: 618 QRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAER-QQALRDELGR- 675
Query: 537 IQHLLQQNM 545
+ +
Sbjct: 676 ----QRGGL 680
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 36.6 bits (85), Expect = 0.021
Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 458 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQPVQN 509
N +P + Q Q QQQ QP + PP+ P P PV+
Sbjct: 108 NEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRT 160
Score = 36.6 bits (85), Expect = 0.021
Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 1/43 (2%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP 512
P+ + QQQ Q PA PQ PP PVQ P
Sbjct: 114 VPRSTVQIQQQAQQQQPPATTAQPQ-PVTPPRQTTAPVQPQTP 155
Score = 35.0 bits (81), Expect = 0.083
Identities = 11/43 (25%), Positives = 11/43 (25%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQN 509
Q P Q Q PP Q PQ PV
Sbjct: 126 QQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168
Score = 30.8 bits (70), Expect = 1.6
Identities = 30/108 (27%), Positives = 38/108 (35%), Gaps = 18/108 (16%)
Query: 440 QRNTYIKHLEIVH-GVIT----SNSANMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIP 491
+R YIK LE GV T S +N P Q Q QL +Q Q +Q P Q + +P
Sbjct: 47 ERWRYIKELENRQIGVPTPTEPSAGGEVNSPTQLTDEQRQLLEQMQADMRQQPTQLSEVP 106
Query: 492 PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQH 539
Q Q ++Q QQ QQ Q
Sbjct: 107 ----------YNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPR 144
Score = 29.6 bits (67), Expect = 3.8
Identities = 10/44 (22%), Positives = 10/44 (22%)
Query: 463 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQP 506
I Q Q Q Q P P Q Q P
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPA 163
Score = 29.6 bits (67), Expect = 4.3
Identities = 9/39 (23%), Positives = 12/39 (30%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ 503
P + +Q P QP P + Q P Q
Sbjct: 130 PPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168
Score = 28.5 bits (64), Expect = 8.6
Identities = 9/37 (24%), Positives = 11/37 (29%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 501
P P+ Q Q P P P Q + P
Sbjct: 137 PQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 32.4 bits (74), Expect = 0.026
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 303 HLVQHSFVHGGEKPFKCSYCPKAFSH 328
+L +H H GEKP+KC C K+FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 32.0 bits (73), Expect = 0.031
Identities = 13/26 (50%), Positives = 15/26 (57%)
Query: 387 ELNRHAQIHGGVKPFACTVCNKSFLQ 412
L RH + H G KP+ C VC KSF
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 31.6 bits (72), Expect = 0.054
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 416 LQNHLYTHSKVKPYNCGICAKSFVQ 440
L+ H+ TH+ KPY C +C KSF
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 29.3 bits (66), Expect = 0.36
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 331 HLNEHLYTHSENKCFQCKYCPKTFT 355
+L H+ TH+ K ++C C K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
Score = 28.9 bits (65), Expect = 0.39
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 245 ELDSHVQTSHRGIKPYVCEICSKAFTQ 271
L H++T H G KPY C +C K+F+
Sbjct: 1 NLRRHMRT-HTGEKPYKCPVCGKSFSS 26
Score = 27.4 bits (61), Expect = 1.8
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 359 HLKAHTNKHEGVLPHTCLECSKCFST 384
+L+ H H G P+ C C K FS+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 26.2 bits (58), Expect = 4.1
Identities = 8/26 (30%), Positives = 10/26 (38%)
Query: 175 EFNEHIMKHEKEKLFSCNFCLKPFLR 200
H+ H EK + C C K F
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein. The frequency clock
protein, is the central component of the frq-based
circadian negative feedback loop, regulates various
aspects of the circadian clock in Neurospora crassa.
This protein has been shown to interact with itself via
a coiled-coil.
Length = 989
Score = 37.3 bits (86), Expect = 0.027
Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 5/65 (7%)
Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP----IQHQP 506
V ++ + +++ L + Q Q Q PP I P P I Q
Sbjct: 316 VRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPP-SNFITNPFSSVPEPLREARILPQE 374
Query: 507 VQNIH 511
QN H
Sbjct: 375 GQNPH 379
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 36.2 bits (83), Expect = 0.034
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)
Query: 469 ITPQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHP 512
I P +++ Q+ QP P PQ+Q P P Q I+P
Sbjct: 130 IDPTEEVNTQEPTQPAGVNVANNPQPQVQ---PQFGPNPQQRINP 171
Score = 30.4 bits (68), Expect = 2.1
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 11/71 (15%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 526
+ + Q+Q +Q +Q Q I P + + P Q P+ ++ N Q +Q
Sbjct: 110 EQMAEQLQRISEQNEQ---QAIEIDPTEEVNTQEPTQ--------PAGVNVANNPQPQVQ 158
Query: 527 QNLQQNIQQNI 537
N QQ I
Sbjct: 159 PQFGPNPQQRI 169
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 36.2 bits (83), Expect = 0.040
Identities = 17/64 (26%), Positives = 20/64 (31%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
P P VQ QQ P PQ Q + P Q P Q Q + +
Sbjct: 201 PKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSR 260
Query: 525 IQQN 528
Q N
Sbjct: 261 EQGN 264
Score = 34.3 bits (78), Expect = 0.18
Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 1/74 (1%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 526
+H Q Q P QP + P Q P Q P+ Q+ Q Q Q
Sbjct: 196 RHRHAPKPTQQPTVQNPAQQPT-VQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
Query: 527 QNLQQNIQQNIQHL 540
+ + Q N + L
Sbjct: 255 GHKRSREQGNQEFL 268
Score = 33.9 bits (77), Expect = 0.23
Identities = 25/74 (33%), Positives = 28/74 (37%), Gaps = 14/74 (18%)
Query: 483 PPPQPAHIPPPQI------------QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 530
PP P P P + +H P P Q VQN P+ Q QN Q QQ Q
Sbjct: 173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQN--PAQQPTVQNPAQQPQQQPQ 230
Query: 531 QNIQQNIQHLLQQN 544
Q Q Q QN
Sbjct: 231 QQPVQPAQQPTPQN 244
Score = 33.9 bits (77), Expect = 0.26
Identities = 23/75 (30%), Positives = 29/75 (38%), Gaps = 3/75 (4%)
Query: 489 HIPPPQIQ-HIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHV 547
H P P Q + P Q VQN P+ Q QQ QQ +Q Q Q Q Q
Sbjct: 199 HAPKPTQQPTVQNPAQQPTVQN--PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGH 256
Query: 548 ANSSHHMSQQRMHQL 562
S +Q+ + L
Sbjct: 257 KRSREQGNQEFLKML 271
Score = 29.6 bits (66), Expect = 4.4
Identities = 12/34 (35%), Positives = 13/34 (38%)
Query: 463 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ 496
N Q Q Q QQP PQ PPQ +
Sbjct: 220 NPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253
Score = 29.6 bits (66), Expect = 5.5
Identities = 16/44 (36%), Positives = 16/44 (36%), Gaps = 1/44 (2%)
Query: 463 NIPLQHITPQVQLHQQQQQQ-PPPQPAHIPPPQIQHIPPPIQHQ 505
N Q QQQ Q P QPA P PQ PP Q
Sbjct: 211 NPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
ABF1 is a sequence-specific DNA binding protein involved
in transcription activation, gene silencing and
initiation of DNA replication. ABF1 is known to remodel
chromatin, and it is proposed that it mediates its
effects on transcription and gene expression by
modifying local chromatin architecture. These functions
require a conserved stretch of 20 amino acids in the
C-terminal region of ABF1 (amino acids 639 to 662 S.
cerevisiae). The N-terminal two thirds of the protein
are necessary for DNA binding, and the N-terminus (amino
acids 9 to 91 in S. cerevisiae) is thought to contain a
novel zinc-finger motif which may stabilise the protein
structure.
Length = 508
Score = 36.3 bits (83), Expect = 0.047
Identities = 25/72 (34%), Positives = 33/72 (45%), Gaps = 5/72 (6%)
Query: 493 PQIQHIPPPIQHQPVQNIHPSI----QHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVA 548
P I HQ + HPSI QH Q+ Q+ Q+ QQ++QH Q HV
Sbjct: 281 PNAHQISGADHHQHQHHHHPSIPAHHQHQLPEGHQHDHQHHHQHQQQDLQHQEQHQQHVD 340
Query: 549 NSSHHMSQQRMH 560
N HH+ Q R+
Sbjct: 341 N-DHHVYQDRIG 351
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 35.9 bits (83), Expect = 0.054
Identities = 30/204 (14%), Positives = 50/204 (24%), Gaps = 42/204 (20%)
Query: 381 CFSTKSELNR-----HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYTHSKVKPYNCGICA 435
CFS E NR H ++ GG KP + + ++ LQ+
Sbjct: 332 CFSNPEEANRAVTEMHGRMLGG-KPLYVALAQRKEQRRAHLQD----------------- 373
Query: 436 KSFVQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQI 495
Q ++ + + P QQQ P
Sbjct: 374 ----QFM------QLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP 423
Query: 496 QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNMHVANSSHHMS 555
+ P +P + QN Q+ Q + + S +S
Sbjct: 424 TPMGPGGPLRPNGLAPMNAVRAPSRNAQNA---AQKPPMQPVMYPPNYQ------SLPLS 474
Query: 556 QQRMHQLYALHHDGYMTATASIMQ 579
Q G A ++
Sbjct: 475 QDLPQPQSTASQGGQNKKLAQVLA 498
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 35.7 bits (82), Expect = 0.063
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 6/41 (14%)
Query: 482 QPPPQPAHIP----PPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
QP PA IP PP IQ++ PP P Q I P IQHI+
Sbjct: 232 QPSAPPASIPAPPIPPVIQYVAPP-PVPPPQPIIP-IQHIR 270
Score = 30.3 bits (68), Expect = 3.7
Identities = 12/40 (30%), Positives = 13/40 (32%), Gaps = 1/40 (2%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQH 504
P Q P + A P P Q I PIQH
Sbjct: 230 PAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPI-IPIQH 268
>gnl|CDD|188742 cd09356, LIM2_TRIP6, The second LIM domain of Thyroid
receptor-interacting protein 6 (TRIP6). The second LIM
domain of Thyroid receptor-interacting protein 6
(TRIP6): TRIP6 is a member of the zyxin LIM protein
family and contains three LIM zinc-binding domains at
the C-terminal. TRIP6 protein localizes to focal
adhesion sites and along actin stress fibers.
Recruitment of this protein to the plasma membrane
occurs in a lysophosphatidic acid (LPA)-dependent
manner. TRIP6 recruits a number of molecules involved in
actin assembly, cell motility, survival and
transcriptional control. The function of TRIP6 in cell
motility is regulated by Src-dependent phosphorylation
at a Tyr residue. The phosphorylation activates the
coupling to the Crk SH2 domain, which is required for
the function of TRIP6 in promoting lysophosphatidic acid
(LPA)-induced cell migration. TRIP6 can shuttle to the
nucleus to serve as a coactivator of AP-1 and NF-kappaB
transcriptional factors. Moreover, TRIP6 can form a
ternary complex with the NHERF2 PDZ protein and LPA2
receptor to regulate LPA-induced activation of ERK and
AKT, rendering cells resistant to chemotherapy. Recent
evidence shows that TRIP6 antagonizes Fas-Induced
apoptosis by enhancing the antiapoptotic effect of LPA
in cells. As in other LIM domains, this domain family is
50-60 amino acids in size and shares two characteristic
zinc finger motifs. The two zinc fingers contain eight
conserved residues, mostly cysteines and histidines,
which coordinately bond to two zinc atoms. LIM domains
function as adaptors or scaffolds to support the
assembly of multimeric protein.
Length = 53
Score = 31.8 bits (72), Expect = 0.075
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 137 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 171
C +C K +++L K H FTC VC RS +
Sbjct: 1 CSVCSKPIMERILRATGKAYHPHCFTCVVCHRSLD 35
>gnl|CDD|220871 pfam10759, DUF2587, Protein of unknown function (DUF2587). This is
a bacterial family of proteins with no known function.
Length = 168
Score = 34.3 bits (79), Expect = 0.083
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 6/57 (10%)
Query: 452 HGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 508
HG+ T+ + Q + + QL Q +++ PP PP Q Q Q
Sbjct: 116 HGIQTA------LFAQQMAARAQLEQMRRRALPPGVGIAPPGQPQGARGGPPPGTGQ 166
>gnl|CDD|213398 cd12191, gal11_coact, gall11 coactivator domain. Gall11/MED15 acts
in the general regulation of GAL structural genes and is
required for full expression for several genes in this
pathway, including GALs 1,7, and 10 in Saccharomyces
cerevisiae. GAL11 function is dependent on GCN4
functionality and binds GCN4 in a degenerate manner with
multiple orientations found at the GCN4-Gal11 interface.
Length = 90
Score = 32.3 bits (74), Expect = 0.10
Identities = 26/89 (29%), Positives = 34/89 (38%), Gaps = 23/89 (25%)
Query: 471 PQVQLHQQQQQQPPPQPAH---------IPPP---QIQHIPPPI-----------QHQPV 507
Q Q QQQQQQ PQ IPP +I ++PP + Q+
Sbjct: 2 QQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAKIPNLPPEVKTWRQLKAWLQQNNIP 61
Query: 508 QNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
I+ +QQ Q + Q QQ QQ
Sbjct: 62 PQAMEQIKEVQQTHFQLLMQRRQQQQQQQ 90
Score = 29.6 bits (67), Expect = 0.89
Identities = 24/97 (24%), Positives = 38/97 (39%), Gaps = 22/97 (22%)
Query: 478 QQQQQPPPQPAHIPPPQIQH---------IPPPIQHQPVQNIHPSI---QHIQQNLQQNI 525
QQQQ P Q PQ IPP + + + N+ P + + ++ LQQN
Sbjct: 1 PQQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAK-IPNLPPEVKTWRQLKAWLQQN- 58
Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQL 562
N+ + I+ + Q H + Q+R Q
Sbjct: 59 --NIPPQAMEQIKEV--QQTHF----QLLMQRRQQQQ 87
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 30.0 bits (68), Expect = 0.17
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 318 KCSYCPKAFSHKGHLNEHLYTH 339
KC C K+FS K +L HL TH
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 29.6 bits (67), Expect = 0.22
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 261 VCEICSKAFTQQYNLKHHLLLH 282
C C K+F+++ NLK HL H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 29.2 bits (66), Expect = 0.31
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 346 QCKYCPKTFTLRKHLKAHTNKH 367
+C C K+F+ + +LK H H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 28.5 bits (64), Expect = 0.52
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 374 TCLECSKCFSTKSELNRHAQIH 395
C +C K FS KS L RH + H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 27.3 bits (61), Expect = 1.6
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 290 RCDQCGKAFSRKGHLVQHSFVH 311
+C CGK+FSRK +L +H H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 26.5 bits (59), Expect = 3.1
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 403 CTVCNKSFLQKTQLQNHLYTH 423
C C KSF +K+ L+ HL TH
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
Score = 25.8 bits (57), Expect = 5.8
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 108 TCEVCQQAFTRNADLKVHSMVH 129
C C ++F+R ++LK H H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|225387 COG2831, FhaC, Hemolysin activation/secretion protein
[Intracellular trafficking and secretion].
Length = 554
Score = 34.2 bits (79), Expect = 0.17
Identities = 21/111 (18%), Positives = 41/111 (36%), Gaps = 9/111 (8%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ--HIQQ-NLQQNIQ 526
P +QQQ + + +P P ++ P P + I + L+ N
Sbjct: 27 NPAEGNIRQQQDERARREELLPAPDVRL-EQPDPAAPAVDAPDETPCFPINRIELEGNTL 85
Query: 527 QNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGYMTATASI 577
+ +Q + L + + +A+ + S + LY GY+T A +
Sbjct: 86 LDA-LELQAILAPYLGRCLGLADLNQLAS--ALTNLYI--DRGYVTTRALL 131
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 32.5 bits (74), Expect = 0.21
Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 1/38 (2%)
Query: 471 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 508
PQ Q+ Q P P +P Q+ + P H P+
Sbjct: 20 PQAQMPQASAPYPGPSMY-LPMAQVMAVGPQSSHPPMA 56
Score = 29.8 bits (67), Expect = 1.8
Identities = 11/60 (18%), Positives = 17/60 (28%), Gaps = 7/60 (11%)
Query: 471 PQVQLHQQQQQQPPP-----QPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ--HIQQNLQQ 523
+L+Q PP A P P + + Q+ HP + I
Sbjct: 7 AYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPV 66
Score = 29.0 bits (65), Expect = 3.2
Identities = 12/49 (24%), Positives = 16/49 (32%), Gaps = 2/49 (4%)
Query: 459 SANMN-IPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQ 505
A M + P + L Q PQ +H P PPP+
Sbjct: 21 QAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 29.7 bits (67), Expect = 0.24
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 317 FKCSYCPKAFSHKGHLNEHLYTH 339
++C C K F K L EH+ TH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 27.8 bits (62), Expect = 1.1
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 373 HTCLECSKCFSTKSELNRHAQIH 395
+ C EC K F +KS L H + H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 27.8 bits (62), Expect = 1.1
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 260 YVCEICSKAFTQQYNLKHHLLLH 282
Y C C K F + L+ H+ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 27.8 bits (62), Expect = 1.1
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 401 FACTVCNKSFLQKTQLQNHLYTH 423
+ C C K F K+ L+ H+ TH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 27.4 bits (61), Expect = 1.3
Identities = 6/23 (26%), Positives = 10/23 (43%)
Query: 161 FTCEVCMRSFNDRKEFNEHIMKH 183
+ C C + F + EH+ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 3.4
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 289 FRCDQCGKAFSRKGHLVQHSFVH 311
+RC +CGK F K L +H H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 3.8
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 345 FQCKYCPKTFTLRKHLKAHTNKH 367
++C C K F + L+ H H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|227561 COG5236, COG5236, Uncharacterized conserved protein, contains RING
Zn-finger [General function prediction only].
Length = 493
Score = 33.8 bits (77), Expect = 0.26
Identities = 30/163 (18%), Positives = 54/163 (33%), Gaps = 34/163 (20%)
Query: 133 QLHKCELCYKMFSD---KLLYEKHKMV------HRQVFTCEVCMRSFNDRKEFNEHIMKH 183
KC + + ++ +++ F E+ + +H
Sbjct: 155 PKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFR---SSTLRDHKNGG 211
Query: 184 EKEKLFS----CNFCLKPFLRKDLLEAHVNSVHKDVKKYNCEECDKYYLSQGGVGCV-FL 238
+E+ F C FC F D L H H+ C CD VG + +
Sbjct: 212 LEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE-----ACHICDM-------VGPIRYQ 259
Query: 239 YYLSQGELDSHVQTSHRGIKPYVCEICSKAFTQQYNLKHHLLL 281
Y+ S +L++H + +H Y C + + Y +H L
Sbjct: 260 YFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHTEL 297
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 31.2 bits (71), Expect = 0.37
Identities = 17/57 (29%), Positives = 18/57 (31%), Gaps = 4/57 (7%)
Query: 461 NMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPP----PIQHQPVQNIHPS 513
P Q+ Q Q QQ PP P P Q H P P Q Q P
Sbjct: 7 TNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPP 63
Score = 29.3 bits (66), Expect = 1.7
Identities = 20/73 (27%), Positives = 24/73 (32%), Gaps = 8/73 (10%)
Query: 471 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 530
Q QQQQP Q P Q +PPP P Q + Q +Q+ Q
Sbjct: 7 TNQYPPQNQQQQPYQQ-----QPYHQQMPPPPYSPPQQ---QQGHFMPPQPQPYPKQSPQ 58
Query: 531 QNIQQNIQHLLQQ 543
Q L Q
Sbjct: 59 QQQPPQFSSFLSQ 71
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 32.7 bits (74), Expect = 0.46
Identities = 22/91 (24%), Positives = 30/91 (32%), Gaps = 16/91 (17%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 526
Q + Q Q Q +QQ Q + Q Q PV Q +
Sbjct: 151 QTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPV--------------QAQPR 196
Query: 527 QNLQQNIQQNIQHLLQQNMHVANSSHHMSQQ 557
Q+ + QQ Q LLQ H ++ QQ
Sbjct: 197 QSKPASTQQPYQDLLQTPAH--TTAQSKPQQ 225
Score = 29.3 bits (65), Expect = 6.4
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 2/60 (3%)
Query: 466 LQHITPQVQLHQQQQ-QQPPPQPAHIPPPQIQHI-PPPIQHQPVQNIHPSIQHIQQNLQQ 523
LQ QL +QQ+ Q Q + P+Q QP Q+ S Q Q+L Q
Sbjct: 153 LQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQ 212
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 32.9 bits (76), Expect = 0.55
Identities = 21/101 (20%), Positives = 44/101 (43%), Gaps = 13/101 (12%)
Query: 466 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 525
L+ +P+ L QQQQ+ + + +Q + + + Q +++ Q+
Sbjct: 298 LKFQSPERLLAQQQQR-------------LDRLQQRLQRALERRLRLAKQRLERLSQRLQ 344
Query: 526 QQNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALH 566
QQN Q+ I++ Q L Q + + +++ +L AL
Sbjct: 345 QQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQRLEALA 385
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 31.4 bits (72), Expect = 0.56
Identities = 10/45 (22%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 500 PPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ-QNIQQNIQHLLQQ 543
PP + P +I P + ++Q + I ++ + Q + ++I L ++
Sbjct: 23 PPADYTPYGDISPFLPELRQKHKARIAKDPEFQYLNEDIAWLKEE 67
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 32.0 bits (73), Expect = 0.65
Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 1/58 (1%)
Query: 451 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 508
VH VI + I + + P Q QPPP+P P P+ + IP P + PV
Sbjct: 35 VHQVIELPAPAQPISVTMVAPADL-EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 91
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 32.0 bits (73), Expect = 0.72
Identities = 10/47 (21%), Positives = 13/47 (27%)
Query: 483 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 529
P P+ P P PP Q P+ P+ L
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276
Score = 30.4 bits (69), Expect = 2.2
Identities = 15/58 (25%), Positives = 20/58 (34%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 522
Q TP PPP PA PPP +Q +PP +P + +
Sbjct: 218 SFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 32.2 bits (74), Expect = 0.85
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 68 QSKEKKAKKTKVPMVQVLEYVEQSKEKKAKKTKV 101
++K KKT P +VL + KEK+ +K KV
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412
>gnl|CDD|234905 PRK01122, PRK01122, potassium-transporting ATPase subunit B;
Provisional.
Length = 679
Score = 32.1 bits (74), Expect = 0.87
Identities = 13/27 (48%), Positives = 16/27 (59%), Gaps = 2/27 (7%)
Query: 9 EIYLLILLLTLTIVLVLPVVSVELLPM 35
EI L ILL LTI+ +L V + L P
Sbjct: 215 EIALTILLAGLTIIFLLVVAT--LPPF 239
>gnl|CDD|216066 pfam00695, vMSA, Major surface antigen from hepadnavirus.
Length = 364
Score = 31.9 bits (72), Expect = 0.88
Identities = 14/20 (70%), Positives = 16/20 (80%)
Query: 6 RRFEIYLLILLLTLTIVLVL 25
RRF I+LLILLL L +LVL
Sbjct: 223 RRFIIFLLILLLALGFLLVL 242
>gnl|CDD|235654 PRK05957, PRK05957, aspartate aminotransferase; Provisional.
Length = 389
Score = 32.0 bits (73), Expect = 0.90
Identities = 15/41 (36%), Positives = 19/41 (46%)
Query: 483 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
PPP A P ++Q VQ I P ++ I Q LQQ
Sbjct: 40 PPPPEAIEALNNFLANPENHKYQAVQGIPPLLEAITQKLQQ 80
>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
Length = 230
Score = 31.2 bits (70), Expect = 1.1
Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 5/43 (11%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQ---HIPPPIQHQP 506
H PQ + PPP P P P Q H PP+ QP
Sbjct: 40 HHPNPQED--PDKNPSPPPDPPLTPRPPAQPNGHNKPPVTKQP 80
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 31.1 bits (70), Expect = 1.1
Identities = 12/36 (33%), Positives = 15/36 (41%)
Query: 458 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPP 493
+ N + P Q Q QQQ + P QPA P
Sbjct: 126 GNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161
>gnl|CDD|225126 COG2216, KdpB, High-affinity K+ transport system, ATPase chain B
[Inorganic ion transport and metabolism].
Length = 681
Score = 31.9 bits (73), Expect = 1.1
Identities = 12/27 (44%), Positives = 16/27 (59%), Gaps = 2/27 (7%)
Query: 9 EIYLLILLLTLTIVLVLPVVSVELLPM 35
EI L ILL LT++ +L V + L P
Sbjct: 215 EIALTILLSGLTLIFLLAVAT--LYPF 239
>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
Length = 332
Score = 31.6 bits (72), Expect = 1.2
Identities = 14/89 (15%), Positives = 30/89 (33%), Gaps = 13/89 (14%)
Query: 463 NIPLQHITPQVQLHQQQQQQP-----PPQPAHIPPPQ-IQHIPPPIQHQPVQNIHPSIQH 516
+P Q +Q H+ + QP I+ P P++ I+ S +
Sbjct: 104 TLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIEETQSPNV--PIEGINSSSEQ 161
Query: 517 IQQNLQQNIQQNLQQNIQQNIQHLLQQNM 545
++ L + + I + H ++
Sbjct: 162 LRVELAELAAE-----IYSDASHRVELAK 185
Score = 30.5 bits (69), Expect = 2.1
Identities = 7/76 (9%), Positives = 20/76 (26%), Gaps = 4/76 (5%)
Query: 439 VQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHI 498
Q N +E +N + +V+L + + + + +
Sbjct: 133 TQTNVASMTIEETQSPNVPI-EGINSSSEQ--LRVELAELAAEIYSDASHRVELAK-NFM 188
Query: 499 PPPIQHQPVQNIHPSI 514
P + + +
Sbjct: 189 EPQAETEAQPEATTNF 204
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.6 bits (71), Expect = 1.3
Identities = 23/84 (27%), Positives = 25/84 (29%), Gaps = 4/84 (4%)
Query: 464 IPLQHITPQVQLHQQQQQQPP-PQPAHIPPPQIQHIPP---PIQHQPVQNIHPSIQHIQQ 519
+P H Q Q QQ QPP P H PP +Q PV HPS Q
Sbjct: 237 LPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQP 296
Query: 520 NLQQNIQQNLQQNIQQNIQHLLQQ 543
Q Q H
Sbjct: 297 FGLAQSQVPPLPLPSQAQPHSHTP 320
Score = 30.0 bits (67), Expect = 4.2
Identities = 15/50 (30%), Positives = 19/50 (38%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQH 516
H Q QQPP + P P + HI PP Q + S +H
Sbjct: 315 PHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKH 364
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 31.1 bits (70), Expect = 1.4
Identities = 13/69 (18%), Positives = 21/69 (30%), Gaps = 16/69 (23%)
Query: 38 SKEKKAKKTKVPMVQVADFVSGVWSLEYVEQSKEKKAKKTKVPMVQVLEYVEQSKEKKAK 97
++EKK K + K K K + + + K+ K+
Sbjct: 13 TEEKKPAAKKAT----------------TSKETAKTKKTAKTTSTKAAKKAAKVKKTKSV 56
Query: 98 KTKVPMVQV 106
KT V V
Sbjct: 57 KTTTKKVTV 65
Score = 28.8 bits (64), Expect = 7.0
Identities = 18/89 (20%), Positives = 27/89 (30%), Gaps = 13/89 (14%)
Query: 34 PMQQSKEKKAKKTKVPMVQVADFVSGVWSLEYVEQSKEKKAKKTKVPMVQVLEYVEQSKE 93
+ K K K + A + K+ K+ KT V V + K
Sbjct: 24 TTSKETAKTKKTAKTTSTKAAK---------KAAKVKKTKSVKTTTKKVTV----KFEKT 70
Query: 94 KKAKKTKVPMVQVFTCEVCQQAFTRNADL 122
+ KK V V V + F + L
Sbjct: 71 ESVKKESVAKKTVKKEAVSAEVFEASNKL 99
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.5 bits (72), Expect = 1.4
Identities = 11/70 (15%), Positives = 13/70 (18%), Gaps = 4/70 (5%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQ-IQHIPPPIQHQPVQNIHP--SIQHIQQNLQQNIQ 526
TP P P P P N
Sbjct: 378 TPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437
Query: 527 Q-NLQQNIQQ 535
NL++ QQ
Sbjct: 438 SLNLEELWQQ 447
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The Polycystin
Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
huge protein of 4303aas. Its repeated leucine-rich (LRR)
segment is found in many proteins. It contains 16
polycystic kidney disease (PKD) domains, one
LDL-receptor class A domain, one C-type lectin family
domain, and 16-18 putative TMSs in positions between
residues 2200 and 4100. Polycystin-L has been shown to
be a cation (Na+, K+ and Ca2+) channel that is activated
by Ca2+. Two members of the PCC family (polycystin 1 and
2) are mutated in autosomal dominant polycystic kidney
disease, and polycystin-L is deleted in mice with renal
and retinal defects. Note: this model is restricted to
the amino half for technical reasons.
Length = 2740
Score = 31.6 bits (71), Expect = 1.6
Identities = 13/44 (29%), Positives = 17/44 (38%)
Query: 475 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 518
PPP A P ++HI P + P+Q H I Q
Sbjct: 153 ESLCSGPPPPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQ 196
>gnl|CDD|216927 pfam02203, TarH, Tar ligand binding domain homologue.
Length = 146
Score = 30.0 bits (68), Expect = 1.7
Identities = 6/29 (20%), Positives = 13/29 (44%)
Query: 530 QQNIQQNIQHLLQQNMHVANSSHHMSQQR 558
+ + + LQQ +A++ + Q R
Sbjct: 12 NDALDEVYTNSLQQQAALADAWVLLLQAR 40
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF.
Sequences in this family of proteins are members of the
chain length determinant family (pfam02706) which
includes the wzc protein from E.coli. This family of
proteins are homologous to the EpsF protein of the
methanolan biosynthesis operon of Methylobacillus
species strain 12S. The distribution of this protein
appears to be restricted to a subset of
exopolysaccharide operons containing a syntenic
grouping of genes including a variant of the EpsH
exosortase protein. Exosortase has been proposed to be
involved in the targetting and processing of proteins
containing the PEP-CTERM domain to the
exopolysaccharide layer.
Length = 444
Score = 30.9 bits (70), Expect = 1.8
Identities = 19/60 (31%), Positives = 29/60 (48%), Gaps = 8/60 (13%)
Query: 1 MVVLGRRFEIYLLILLLTLTIVLVLPVVSVELLPMQQSKEKKAKKTKVPMVQVADFVSGV 60
+++L R+ I L LL+T+T V V LL KE A + V + D V+G+
Sbjct: 7 LLILKARYWIVLFTLLITVTTTAV-----VSLL---LPKEYTATTSVVLDYKGPDPVTGM 58
>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3. The
C.elegans Notch pathway, involved in the control of
growth, differentiation and patterning in animal
development, relies on either of the receptors GLP-1 or
LIN-12. Both these receptors promote signalling by the
recruitment of LAG-3 to target promoters, where it then
acts as a transcriptional activator. LAG-3 works as a
ternary complex together with the DNA binding protein,
LAG-1.
Length = 476
Score = 31.1 bits (69), Expect = 1.9
Identities = 27/95 (28%), Positives = 38/95 (40%), Gaps = 1/95 (1%)
Query: 478 QQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 536
Q P P HI Q Q+ +Q Q ++ Q QQ + Q+ QQ Q++ QQ
Sbjct: 310 QSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQ 369
Query: 537 IQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGYM 571
+ QQ MH H M+ A H Y+
Sbjct: 370 MLLQQQQQMHQLQQHHQMNGGGQFATQAHQHAAYL 404
Score = 29.5 bits (65), Expect = 5.3
Identities = 26/104 (25%), Positives = 36/104 (34%), Gaps = 6/104 (5%)
Query: 473 VQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQP------VQNIHPSIQHIQQNLQQNIQ 526
+ HQQQQQQ Q + Q Q HQ H ++QQ +Q
Sbjct: 354 MHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNGGGQFATQAHQHAAYLQQMQHMRLQ 413
Query: 527 QNLQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQLYALHHDGY 570
+ +Q QQ H Q H + H + Y H G+
Sbjct: 414 EQIQHQQQQAQHHQQAQQQHQQPAQHGQMGYGIPNGYPAHMHGH 457
Score = 29.2 bits (64), Expect = 6.8
Identities = 21/93 (22%), Positives = 30/93 (32%)
Query: 469 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 528
+ + PQ A P QHI Q Q + + Q ++Q QQ
Sbjct: 293 EGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQ 352
Query: 529 LQQNIQQNIQHLLQQNMHVANSSHHMSQQRMHQ 561
+ QQ Q QQ + M Q + H
Sbjct: 353 IMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHH 385
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 27.2 bits (60), Expect = 2.0
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 317 FKCSYCPKAFSHKGHLNEHLYTH 339
FKC C K+FS K L HL H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 26.8 bits (59), Expect = 2.4
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 345 FQCKYCPKTFTLRKHLKAHTNKH 367
F+C C K+F+ + LK H KH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 25.7 bits (56), Expect = 6.5
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 401 FACTVCNKSFLQKTQLQNHLYTH 423
F C +C KSF K L+ HL H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 25.3 bits (55), Expect = 7.8
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 260 YVCEICSKAFTQQYNLKHHLLLH 282
+ C +C K+F+ + LK HL H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 25.3 bits (55), Expect = 8.0
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 161 FTCEVCMRSFNDRKEFNEHIMKH 183
F C +C +SF+ + H+ KH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|188739 cd09353, LIM2_Zyxin, The second LIM domain of Zyxin. The second
LIM domain of Zyxin: Zyxin exhibits three copies of the
LIM domain, an extensive proline-rich domain and a
nuclear export signal. Localized at sites of
cellsubstratum adhesion in fibroblasts, Zyxin interacts
with alpha-actinin, members of the cysteine-rich protein
(CRP) family, proteins that display Src homology 3 (SH3)
domains and Ena/VASP family members. Zyxin and its
partners have been implicated in the spatial control of
actin filament assembly as well as in pathways important
for cell differentiation. In addition to its functions
at focal adhesion plaques, recent work has shown that
zyxin moves from the sites of cell contacts to the
nucleus, where it directly participates in the
regulation of gene expression. As in other LIM domains,
this domain family is 50-60 amino acids in size and
shares two characteristic zinc finger motifs. The two
zinc fingers contain eight conserved residues, mostly
cysteines and histidines, which coordinately bond to two
zinc atoms. LIM domains function as adaptors o r
scaffolds to support the assembly of multimeric protein.
Length = 60
Score = 28.0 bits (62), Expect = 2.1
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 137 CELCYKMFSDKLLYEKHKMVHRQVFTCEVC 166
C +C + +D++L K H Q FTC VC
Sbjct: 1 CAVCDQKITDRMLKATGKSYHPQCFTCVVC 30
>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
Length = 459
Score = 30.7 bits (69), Expect = 2.1
Identities = 8/24 (33%), Positives = 10/24 (41%)
Query: 476 HQQQQQQPPPQPAHIPPPQIQHIP 499
QQQQ P Q I +Q +
Sbjct: 430 PHNQQQQEPLQNPDIYTKDLQRLT 453
>gnl|CDD|181567 PRK08861, PRK08861, cystathionine gamma-synthase; Provisional.
Length = 388
Score = 30.6 bits (69), Expect = 2.1
Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 13/86 (15%)
Query: 59 GVWSLEYVEQSKEKK-----AKKTKV--------PMVQVLEYVEQSKEKKAKKTKVPMVQ 105
G + +++V+QS AKK K+ P+V+V++ E ++ KA V +
Sbjct: 116 GDFKVQFVDQSDAAALDAALAKKPKLILLETPSNPLVRVVDIAELCQKAKAVGALVAVDN 175
Query: 106 VFTCEVCQQAFTRNADLKVHSMVHKI 131
F V Q+ AD +HS I
Sbjct: 176 TFLTPVLQKPLELGADFVIHSTTKYI 201
>gnl|CDD|188740 cd09354, LIM2_LPP, The second LIM domain of lipoma preferred
partner (LPP). The second LIM domain of lipoma
preferred partner (LPP): LPP is a member of the zyxin
LIM protein family and contains three LIM zinc-binding
domains at the C-terminal and proline-rich region at the
N-terminal. LPP initially identified as the most
frequent translocation partner of HMGA2 (High Mobility
Group A2) in a subgroup of benign tumors of adipose
tissue (lipomas). It was also shown to be rearranged in
a number of other soft tissues, as well as in a case of
acute monoblastic leukemia. In addition to its
involvement in tumors, LPP was inedited as a smooth
muscle restricted LIM protein that plays an important
role in SMC migration. LPP is localized at sites of cell
adhesion, cell-cell contacts and transiently in the
nucleus. In nucleus, it acts as a coactivator for the
ETS domain transcription factor PEA3. In addition to
PEA3, it interacts with alpha-actinin,vasodilator
stimulated phosphoprotein (VASP),Palladin, and Scrib.
The LIM domains are the main focal adhesion targeting
elements and that the proline- rich region, which
harbors binding sites for alpha-actinin and vasodilator-
stimulated phosphoprotein (VASP), has a weak targeting
capacity. As in other LIM domains, this domain family is
50-60 amino acids in size and shares two characteristic
zinc finger motifs. The two zinc fingers contain eight
conserved residues, mostly cysteines and histidines,
which coordinately bond to two zinc atoms. LIM domains
function as adaptors or scaffolds to support the
assembly of multimeric protein.
Length = 60
Score = 27.9 bits (62), Expect = 2.4
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 137 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 171
C +C K D++L K H Q FTC VC +S +
Sbjct: 1 CSVCSKPILDRILRATGKPYHPQCFTCVVCGKSLD 35
>gnl|CDD|221266 pfam11851, DUF3371, Domain of unknown function (DUF3371). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 125 to 142 amino acids in length.
Length = 131
Score = 29.3 bits (66), Expect = 2.8
Identities = 11/66 (16%), Positives = 20/66 (30%), Gaps = 9/66 (13%)
Query: 451 VHGVITSNSANMNIPLQHITPQVQLH---QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPV 507
HG+ + L T + Q+ Q + P + P Q PP +
Sbjct: 5 AHGL------PVVASLGLCTADLMARVIKQEPQLEDNQCPQDLYPHSSQPDLPPPTTLDL 58
Query: 508 QNIHPS 513
+ +
Sbjct: 59 NDGTIT 64
>gnl|CDD|192241 pfam09280, XPC-binding, XPC-binding domain. Members of this family
adopt a structure consisting of four alpha helices,
arranged in an array. They bind specifically and
directly to the xeroderma pigmentosum group C protein
(XPC) to initiate nucleotide excision repair.
Length = 59
Score = 27.6 bits (62), Expect = 2.8
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 5/39 (12%)
Query: 511 HPSIQHIQQNLQQN---IQQNLQQNIQQNIQ--HLLQQN 544
+P Q ++Q +QQN + LQQ Q N Q L+QQN
Sbjct: 7 NPQFQQLRQMVQQNPELLPPLLQQLGQSNPQLLQLIQQN 45
>gnl|CDD|181818 PRK09393, ftrA, transcriptional activator FtrA; Provisional.
Length = 322
Score = 30.3 bits (69), Expect = 3.1
Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 4/39 (10%)
Query: 490 IPP----PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
+PP Q Q +P P+ + + P I ++ +L +
Sbjct: 196 VPPHRDGGQAQFVPRPVASRESDRLGPLIDWMRAHLAEP 234
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 28.7 bits (64), Expect = 3.6
Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 255 RGIKPYVCEICSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQH 307
+ + PYVC +C F+ +LK H+ C CGK F + H
Sbjct: 69 KAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV---CPVCGKEFRNTDSTLDH 118
Score = 28.7 bits (64), Expect = 4.1
Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 2/52 (3%)
Query: 161 FTCEVCMRSFNDRKEFNEHIMKHEKEKLFSCNFCLKPFLRKDLLEAHVNSVH 212
+ C +C+ F+ +HI E K+ C C K F D HV H
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRYTEHSKV--CPVCGKEFRNTDSTLDHVCKKH 123
Score = 28.3 bits (63), Expect = 5.8
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 6/62 (9%)
Query: 398 VKPFACTVCNKSFLQKTQLQNHL-YT-HSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVI 455
V P+ C +C F L+ H+ YT HSKV C +C K F ++ + H+ H +
Sbjct: 71 VSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV----CPVCGKEFRNTDSTLDHVCKKHNIC 126
Query: 456 TS 457
S
Sbjct: 127 VS 128
>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
Provisional.
Length = 450
Score = 30.1 bits (68), Expect = 3.6
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 187 KLFSCNFCLKPFLRKDLLEAH 207
KL+ C FCLK RK+ L+ H
Sbjct: 197 KLYFCEFCLKFMKRKEQLQRH 217
>gnl|CDD|116585 pfam07975, C1_4, TFIIH C1-like domain. The carboxyl-terminal
region of TFIIH is essential for transcription activity.
This regions binds three zinc atoms through two
independent domain. The first contains a C4 zinc finger
motif, whereas the second is characterized by a
CX(2)CX(2-4)FCADCD motif. The solution structure of the
second C-terminal domain revealed homology with the
regulatory domain of protein kinase C (pfam00130).
Length = 48
Score = 27.0 bits (60), Expect = 3.8
Identities = 11/47 (23%), Positives = 17/47 (36%), Gaps = 6/47 (12%)
Query: 94 KKAKKTKVPMVQVFTCEVCQQAFTRNADLKVHSMVHKIAQLHKCELC 140
+K V+ C C+ F + D+ +H LH C C
Sbjct: 7 QKLPDGINKTSSVYRCPKCKNVFCLDCDVFIHE------SLHNCPGC 47
>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
Length = 84
Score = 28.0 bits (62), Expect = 3.9
Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 7/41 (17%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 510
TP+ Q Q QQQ P Q I PQ P QH Q +
Sbjct: 51 TPEQQQQQPQQQIAPNQ---IEAPQ----PNFQQHYQGQQV 84
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.0 bits (67), Expect = 4.0
Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 5/61 (8%)
Query: 470 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ---HQPVQNIHPSIQHIQQNLQQNIQ 526
P Q QQ+P P PPPQ P +Q P+ Q ++Q L ++
Sbjct: 777 QPPPQAPPAPQQRPRGAPTPQPPPQA--GPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834
Query: 527 Q 527
+
Sbjct: 835 R 835
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 29.1 bits (65), Expect = 4.0
Identities = 25/91 (27%), Positives = 30/91 (32%), Gaps = 8/91 (8%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ---HQPVQNIHPSIQHIQQNLQQ 523
I P QQQQ P+ + P I PP Q P + + Q +LQ
Sbjct: 48 HPIIPLSPQMPQQQQSAHPK---LTPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQP 104
Query: 524 NIQQNLQQNIQQNIQHLLQQNMHVANSSHHM 554
Q LQ QN Q QQ H
Sbjct: 105 PAQHPLQPTYGQNPQP--QQPTHTQPPVQPQ 133
Score = 28.3 bits (63), Expect = 8.1
Identities = 16/62 (25%), Positives = 21/62 (33%), Gaps = 1/62 (1%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 524
P+ + + QQP QP P Q + P QP P +Q Q Q
Sbjct: 82 PMMPVPGHHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQPTHTQPP-VQPQQPADPQP 140
Query: 525 IQ 526
Q
Sbjct: 141 GQ 142
>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3. The
reovirus inner capsid protein lambda-1 displays
nucleoside triphosphate phosphohydrolase (NTPase),
RNA-5'-triphosphatase (RTPase), and RNA helicase
activity and may play a role in the transcription of the
virus genome, the unwinding or reannealing of
double-stranded RNA during RNA synthesis. The RTPase
activity constitutes the first step in the capping of
RNA, resulting in a 5'-diphosphorylated RNA plus-strand.
lambda1 is an Orthoreovirus core protein, VP3 is the
homologous core protein in Aquareoviruses.
Length = 1166
Score = 30.1 bits (68), Expect = 4.1
Identities = 10/35 (28%), Positives = 11/35 (31%), Gaps = 1/35 (2%)
Query: 391 HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYT-HS 424
+ C VC F QL H T HS
Sbjct: 64 KKATPINPSSYVCNVCMAEFSSMDQLAEHQRTTHS 98
>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
C2H2 Zn-finger [General function prediction only].
Length = 65
Score = 27.3 bits (60), Expect = 4.3
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 284 DGNSLFRCDQCGKAFSRKGHLVQH 307
DG RC +CG F R+ ++H
Sbjct: 13 DGEEFLRCPRCGMVFRRRKDYIRH 36
>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding. The Creb binding domain
assumes a structure comprising of three alpha-helices
which pack in a bundle, exposing a hydrophobic groove
between alpha-1 and alpha-3 within which complimentary
domains found in the protein 'activator for thyroid
hormone and retinoid receptors' (ACTR) can dock. Docking
of these domains is required for the recruitment of RNA
polymerase II and the basal transcription machinery.
Length = 104
Score = 28.1 bits (62), Expect = 5.0
Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 2/80 (2%)
Query: 470 TPQVQLHQQQQQQPP-PQPAHIPPPQI-QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQ 527
PQ L QQQQ QP P+P Q P P QP + + Q+L + ++
Sbjct: 6 WPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLVQPGISRGIVSPNALQDLLRTLKS 65
Query: 528 NLQQNIQQNIQHLLQQNMHV 547
QQ + ++L+ N +
Sbjct: 66 PSSPQQQQQVLNILKSNPQL 85
>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
Provisional.
Length = 177
Score = 28.7 bits (64), Expect = 5.7
Identities = 7/28 (25%), Positives = 10/28 (35%)
Query: 465 PLQHITPQVQLHQQQQQQPPPQPAHIPP 492
P + QQ P P++ PP
Sbjct: 141 PQGGNQFSGGAQSRPQQSAPAAPSNEPP 168
>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
Length = 542
Score = 29.5 bits (67), Expect = 6.3
Identities = 6/18 (33%), Positives = 11/18 (61%)
Query: 518 QQNLQQNIQQNLQQNIQQ 535
Q +Q +++ LQQ I+
Sbjct: 235 QLKYKQQVEKRLQQRIES 252
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 29.4 bits (66), Expect = 6.5
Identities = 12/67 (17%), Positives = 18/67 (26%), Gaps = 3/67 (4%)
Query: 467 QHITPQVQLHQQQQQQPPPQPAHIP---PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 523
TP P P+ P P I V ++ PS Q +
Sbjct: 420 PAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAE 479
Query: 524 NIQQNLQ 530
N++
Sbjct: 480 QATGNIK 486
>gnl|CDD|182696 PRK10748, PRK10748, flavin mononucleotide phosphatase; Provisional.
Length = 238
Score = 28.5 bits (64), Expect = 9.5
Identities = 12/54 (22%), Positives = 25/54 (46%), Gaps = 3/54 (5%)
Query: 507 VQNIHPSIQHIQQNLQQNIQQNLQQ---NIQQNIQHLLQQNMHVANSSHHMSQQ 557
VQN HP+++ Q Q ++Q L++ I ++ + + A +S +
Sbjct: 39 VQNYHPALRSFQNEDLQRLRQALREAEPEIYHDVTRWRWRAIEQAMLDAGLSAE 92
>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 27.9 bits (62), Expect = 9.8
Identities = 12/51 (23%), Positives = 18/51 (35%), Gaps = 2/51 (3%)
Query: 455 ITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQ 505
+ N N P + P + Q Q Q A+ Q Q PP ++
Sbjct: 121 SFHENFNNNYPGNYNNPSQDPYMNQAQS-YNQNAYAKENQ-QAQPPKYENS 169
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.133 0.414
Gapped
Lambda K H
0.267 0.0704 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,445,280
Number of extensions: 2514350
Number of successful extensions: 5463
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4663
Number of HSP's successfully gapped: 286
Length of query: 579
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 477
Effective length of database: 6,413,494
Effective search space: 3059236638
Effective search space used: 3059236638
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.7 bits)