RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11675
(431 letters)
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 55.5 bits (134), Expect = 3e-08
Identities = 25/89 (28%), Positives = 29/89 (32%), Gaps = 8/89 (8%)
Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAH--IPPPQIQHIPPPIQH-----QPVQNIHPSIQ 400
P PQ QQ PPPQP PPQ Q PPP Q + +
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297
Query: 401 -HIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
Q L +QQ Q + L Q
Sbjct: 298 PPQQPQLLPLVQQPQGQQRGPQFREQLVQ 326
Score = 53.2 bits (128), Expect = 2e-07
Identities = 25/92 (27%), Positives = 26/92 (28%), Gaps = 9/92 (9%)
Query: 349 IPLQHITPQVQLH-QQQQQQPPPQ---PAHIPPPQIQHIPPPIQHQ--PVQNIHPSIQHI 402
PL PQ QQ Q Q P PPQ Q PP Q Q P P
Sbjct: 230 PPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLP 289
Query: 403 QQN---LQQNIQQNLQQNIQQNIQHLLQQNMH 431
Q L Q L +QQ
Sbjct: 290 QGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFR 321
Score = 52.1 bits (125), Expect = 4e-07
Identities = 21/85 (24%), Positives = 24/85 (28%), Gaps = 1/85 (1%)
Query: 346 NMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS-IQHIQQ 404
P Q PQ +Q P P H Q Q P P Q P +
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPP 241
Query: 405 NLQQNIQQNLQQNIQQNIQHLLQQN 429
LQQ L Q + QQ
Sbjct: 242 PLQQPQFPGLSQQMPPPPPQPPQQQ 266
Score = 48.2 bits (115), Expect = 5e-06
Identities = 20/82 (24%), Positives = 21/82 (25%), Gaps = 2/82 (2%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
P Q Q Q PP P +P P Q P Q QQ Q
Sbjct: 211 PEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPP 270
Query: 410 IQQNLQ--QNIQQNIQHLLQQN 429
Q QN L Q
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQ 292
Score = 46.3 bits (110), Expect = 2e-05
Identities = 23/91 (25%), Positives = 29/91 (31%), Gaps = 13/91 (14%)
Query: 351 LQHITPQVQLHQQQQQQPPPQ----PAHIPPPQI---QHIPPPIQHQ-PVQNIHPSI--- 399
L+ + Q+Q QQ Q P P P +PP Q Q PP P
Sbjct: 156 LEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQ 215
Query: 400 --QHIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
Q + Q Q L + Q L Q
Sbjct: 216 PQQFLPAPSQAPAQPPLPPQLPQQPPPLQQP 246
Score = 46.3 bits (110), Expect = 2e-05
Identities = 15/96 (15%), Positives = 21/96 (21%), Gaps = 9/96 (9%)
Query: 337 HGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQH---------IPPPI 387
V+ P Q Q P + +PP +
Sbjct: 177 QQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQL 236
Query: 388 QHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQ 423
QP P + Q + Q QQ Q
Sbjct: 237 PQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQP 272
Score = 46.3 bits (110), Expect = 3e-05
Identities = 20/88 (22%), Positives = 20/88 (22%), Gaps = 9/88 (10%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQP-----AHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ- 403
Q Q P QP P Q PPP Q Q P Q
Sbjct: 218 QFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPP 277
Query: 404 -QNLQQNIQQNLQQNIQQNIQHLLQQNM 430
QN Q Q QQ
Sbjct: 278 PQNQPTPHPGLPQG--QNAPLPPPQQPQ 303
Score = 45.9 bits (109), Expect = 3e-05
Identities = 19/77 (24%), Positives = 25/77 (32%), Gaps = 7/77 (9%)
Query: 348 NIPLQHITPQVQ--LHQQQQQQPPPQPAHIPPPQ-----IQHIPPPIQHQPVQNIHPSIQ 400
+P P Q Q Q QPPPQ P P +PPP Q Q + +
Sbjct: 254 QMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQG 313
Query: 401 HIQQNLQQNIQQNLQQN 417
+ + L Q
Sbjct: 314 QQRGPQFREQLVQLSQQ 330
Score = 45.9 bits (109), Expect = 3e-05
Identities = 19/76 (25%), Positives = 22/76 (28%), Gaps = 7/76 (9%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPA-HIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
P+ QQ P PA PPQ+ PPP+Q QQ
Sbjct: 205 QPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFP------GLSQQMPPP 258
Query: 409 NIQQNLQQNIQQNIQH 424
Q QQ Q
Sbjct: 259 PPQPPQQQQQPPQPQA 274
Score = 45.1 bits (107), Expect = 5e-05
Identities = 20/86 (23%), Positives = 22/86 (25%), Gaps = 7/86 (8%)
Query: 350 PLQHITPQVQLHQQQQQQ-----PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQ 404
+ + PQ L Q PP P PP Q P Q P P Q Q
Sbjct: 210 HPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQP 269
Query: 405 NLQQNIQ--QNLQQNIQQNIQHLLQQ 428
Q QN Q
Sbjct: 270 PQPQAQPPPQNQPTPHPGLPQGQNAP 295
Score = 40.5 bits (95), Expect = 0.002
Identities = 13/76 (17%), Positives = 23/76 (30%), Gaps = 3/76 (3%)
Query: 349 IPLQHITPQVQLH---QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQN 405
P PQ Q Q Q P P P + + P Q + + Q
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQ 330
Query: 406 LQQNIQQNLQQNIQQN 421
++ + Q + ++
Sbjct: 331 QREALSQEEAKRAKRR 346
Score = 35.9 bits (83), Expect = 0.046
Identities = 19/90 (21%), Positives = 24/90 (26%), Gaps = 14/90 (15%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPP-------------PQIQHIP-PPIQHQPVQNIHPSIQH 401
P+ Q Q + QP PQ Q +P PP Q P
Sbjct: 132 PEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAF 191
Query: 402 IQQNLQQNIQQNLQQNIQQNIQHLLQQNMH 431
QQ + Q Q QQ +
Sbjct: 192 PQQGPPEQPPGYPQPPQGHPEQVQPQQFLP 221
Score = 29.7 bits (67), Expect = 3.5
Identities = 15/61 (24%), Positives = 19/61 (31%), Gaps = 5/61 (8%)
Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQ 427
P+P PP P Q QP + + Q +Q Q Q Q L Q
Sbjct: 128 TAPKPEPQPPQA-----PESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQ 182
Query: 428 Q 428
Sbjct: 183 G 183
Score = 29.0 bits (65), Expect = 5.4
Identities = 7/61 (11%), Positives = 14/61 (22%), Gaps = 2/61 (3%)
Query: 350 PLQHITPQVQLHQQQQQ--QPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
Q + QQ Q PQ + + Q + + +
Sbjct: 288 LPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEAKRAKRRH 347
Query: 408 Q 408
+
Sbjct: 348 K 348
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 46.9 bits (111), Expect = 1e-05
Identities = 23/83 (27%), Positives = 27/83 (32%), Gaps = 5/83 (6%)
Query: 352 QHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
Q P QQ Q Q P Q PP QP +QQ Q
Sbjct: 159 QSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQ--QPPQMGQPGMPGGGGQGQMQQQGQP 216
Query: 409 NIQQNLQQNIQQNIQHLLQQNMH 431
QQ +QQ +Q+ QQ M
Sbjct: 217 GGQQQQNPQMQQQLQNQQQQQMD 239
Score = 38.8 bits (90), Expect = 0.005
Identities = 15/67 (22%), Positives = 17/67 (25%)
Query: 351 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 410
Q QQ PPQ P Q +QQ LQ
Sbjct: 175 GQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQ 234
Query: 411 QQNLQQN 417
QQ + Q
Sbjct: 235 QQQMDQQ 241
Score = 38.4 bits (89), Expect = 0.008
Identities = 19/82 (23%), Positives = 29/82 (35%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
L + Q+Q Q P QP PP + +P Q + Q ++Q +N
Sbjct: 288 QLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRN 347
Query: 410 IQQNLQQNIQQNIQHLLQQNMH 431
++ Q QQ Q H
Sbjct: 348 MRGQQQTQQQQQQQGGNHPAAH 369
Score = 36.9 bits (85), Expect = 0.020
Identities = 15/80 (18%), Positives = 21/80 (26%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
P Q P + Q Q Q + +Q+Q Q + Q
Sbjct: 193 PPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMG 252
Query: 410 IQQNLQQNIQQNIQHLLQQN 429
QQ Q +Q Q
Sbjct: 253 QQQQGQGGMQPQQMQGGQMQ 272
Score = 36.9 bits (85), Expect = 0.024
Identities = 19/81 (23%), Positives = 25/81 (30%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
+Q + Q Q PPPQ P Q + + Q + + QQ
Sbjct: 295 QMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQT 354
Query: 410 IQQNLQQNIQQNIQHLLQQNM 430
QQ QQ H Q N
Sbjct: 355 QQQQQQQGGNHPAAHQQQMNQ 375
Score = 36.5 bits (84), Expect = 0.029
Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 11/89 (12%)
Query: 352 QHITPQVQLHQQQ-QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP----------SIQ 400
Q QV + QQ QQQP + P Q+Q +P Q P Q + P Q
Sbjct: 267 QGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQ 326
Query: 401 HIQQNLQQNIQQNLQQNIQQNIQHLLQQN 429
+QQ + QQ L+Q +N++ Q
Sbjct: 327 AVQQGVMSAGQQQLKQMKLRNMRGQQQTQ 355
Score = 35.0 bits (80), Expect = 0.082
Identities = 19/75 (25%), Positives = 25/75 (33%), Gaps = 6/75 (8%)
Query: 356 PQVQLHQQQQ--QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQN 413
Q Q+ QQQQ PQ Q+ P Q P Q S + N Q +
Sbjct: 247 AQAQMGQQQQGQGGMQPQQMQGGQMQV----PMQQQPPQQQPQQSQLGMLPNQMQQMPGG 302
Query: 414 LQQNIQQNIQHLLQQ 428
Q Q + Q+
Sbjct: 303 GQGGPGQPMGPPPQR 317
Score = 33.1 bits (75), Expect = 0.33
Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 4/69 (5%)
Query: 347 MNIPLQHITPQVQLHQQQQQQPPP--QPAHIPPPQI--QHIPPPIQHQPVQNIHPSIQHI 402
P+ + + Q Q P P PP + IP P P + S
Sbjct: 413 SPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPA 472
Query: 403 QQNLQQNIQ 411
Q Q
Sbjct: 473 SQRTIQQDM 481
Score = 32.7 bits (74), Expect = 0.43
Identities = 19/83 (22%), Positives = 27/83 (32%), Gaps = 4/83 (4%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAH-IPPPQIQHIPPPIQHQPVQNIH---PSIQHIQQN 405
Q+ Q QL QQQQQ Q Q+ Q + + QQ
Sbjct: 220 QQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQP 279
Query: 406 LQQNIQQNLQQNIQQNIQHLLQQ 428
QQ QQ+ + +Q +
Sbjct: 280 PQQQPQQSQLGMLPNQMQQMPGG 302
Score = 30.7 bits (69), Expect = 1.6
Identities = 21/89 (23%), Positives = 24/89 (26%), Gaps = 6/89 (6%)
Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPP------PIQHQPVQNIHPSIQH 401
Q P Q Q Q Q Q Q P Q Q + P
Sbjct: 208 GQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQ 267
Query: 402 IQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
Q QQ QQ QQ+ +L M
Sbjct: 268 GGQMQVPMQQQPPQQQPQQSQLGMLPNQM 296
Score = 30.7 bits (69), Expect = 1.6
Identities = 18/74 (24%), Positives = 21/74 (28%), Gaps = 6/74 (8%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
Q QQQPP P Q QP Q +QQ +Q Q
Sbjct: 182 QGQQGP-VGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQ-----QQQNPQMQQQLQNQQQ 235
Query: 416 QNIQQNIQHLLQQN 429
Q + Q Q
Sbjct: 236 QQMDQQQGPADAQA 249
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 44.7 bits (105), Expect = 1e-04
Identities = 25/82 (30%), Positives = 29/82 (35%), Gaps = 2/82 (2%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
P Q + PQ Q QQ QQP PQ P P QP Q + P Q+ Q
Sbjct: 768 PQQPVAPQPQ--YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 825
Query: 410 IQQNLQQNIQQNIQHLLQQNMH 431
Q QQ Q +H
Sbjct: 826 PQPQYQQPQQPVAPQPQDTLLH 847
Score = 40.5 bits (94), Expect = 0.002
Identities = 20/55 (36%), Positives = 23/55 (41%), Gaps = 7/55 (12%)
Query: 350 PLQHITPQVQLHQQQQQQPP------PQPAHIPPPQIQHIPPPIQHQPVQN-IHP 397
P Q + PQ Q Q QQ P PQ P PQ Q P+ QP +HP
Sbjct: 794 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
Score = 38.9 bits (90), Expect = 0.005
Identities = 22/69 (31%), Positives = 27/69 (39%), Gaps = 2/69 (2%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 414
TP V+ QQ QQ PQ + P Q + P Q+Q Q QQ Q Q
Sbjct: 746 TPIVEPVQQPQQPVAPQQQYQQPQQ--PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ 803
Query: 415 QQNIQQNIQ 423
Q QQ +
Sbjct: 804 YQQPQQPVA 812
Score = 37.8 bits (87), Expect = 0.014
Identities = 23/71 (32%), Positives = 28/71 (39%), Gaps = 8/71 (11%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQI-----QHIPPPIQH-QPVQNIHPSIQHIQQNLQQ 408
PQ + QQQ Q P QP P Q + P Q+ QP Q + P Q+ Q
Sbjct: 754 QPQQPVAPQQQYQQPQQPV--APQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811
Query: 409 NIQQNLQQNIQ 419
Q QQ Q
Sbjct: 812 APQPQYQQPQQ 822
Score = 33.5 bits (76), Expect = 0.28
Identities = 21/112 (18%), Positives = 35/112 (31%), Gaps = 5/112 (4%)
Query: 292 NKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIPL 351
N+ Q Q Q Y + +P A + Q + + N+
Sbjct: 394 NEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQ 453
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
PQ Q +Q QP + P P++ QPV P ++ +
Sbjct: 454 STFAPQST---YQTEQTYQQP--AAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
Score = 30.1 bits (67), Expect = 2.6
Identities = 13/59 (22%), Positives = 19/59 (32%)
Query: 354 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQ 412
I P + + QQ Q P + P Q P + P + +Q QQ
Sbjct: 373 IAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQ 431
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 39.4 bits (92), Expect = 0.001
Identities = 19/56 (33%), Positives = 22/56 (39%), Gaps = 2/56 (3%)
Query: 350 PLQHITPQVQLHQQQ--QQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
P QH P + QQ Q QP P P Q Q PI P Q P + +Q
Sbjct: 84 PTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139
Score = 36.3 bits (84), Expect = 0.012
Identities = 22/76 (28%), Positives = 23/76 (30%), Gaps = 11/76 (14%)
Query: 360 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSI-----------QHIQQNLQQ 408
LH Q P HIP QPV P + QH Q NL Q
Sbjct: 35 LHHQIIPVSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQ 94
Query: 409 NIQQNLQQNIQQNIQH 424
QQ Q Q Q
Sbjct: 95 PAQQPFQPQPLQPPQP 110
Score = 32.1 bits (73), Expect = 0.29
Identities = 17/63 (26%), Positives = 22/63 (34%), Gaps = 8/63 (12%)
Query: 337 HGVITSNSANMNIPLQHITP------QVQLHQQQQQQPPPQPAHIPPPQIQHIPP--PIQ 388
H + + N+P P Q QQ Q PP P P +PP P+Q
Sbjct: 80 HSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQ 139
Query: 389 HQP 391
P
Sbjct: 140 PLP 142
Score = 31.7 bits (72), Expect = 0.43
Identities = 18/88 (20%), Positives = 24/88 (27%), Gaps = 11/88 (12%)
Query: 339 VITSNSANMNIPLQHITPQVQLHQQQQQQPP--PQPAHIP--------PPQIQHIPPPIQ 388
V + + H P + Q Q P P P P Q P Q
Sbjct: 42 VSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQ 101
Query: 389 HQPVQNIHPSIQHIQQNLQQNIQQNLQQ 416
QP+Q Q +Q + L
Sbjct: 102 PQPLQPP-QPQQPMQPQPPVHPIPPLPP 128
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 39.3 bits (92), Expect = 0.003
Identities = 17/97 (17%), Positives = 27/97 (27%), Gaps = 8/97 (8%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
+ S N P P Q P Q + P + P
Sbjct: 84 QQHNTPTGSPAANPPATFALP-AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQ 142
Query: 396 HPSIQHIQQNLQQN-------IQQNLQQNIQQNIQHL 425
++Q +QQ QQ+ Q+N + +Q
Sbjct: 143 QRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQ 179
Score = 37.8 bits (88), Expect = 0.008
Identities = 21/114 (18%), Positives = 26/114 (22%), Gaps = 19/114 (16%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPP------------QPAHIPPPQ---- 379
+ Q P Q Q Q P PA PP
Sbjct: 45 AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP 104
Query: 380 IQHIPPPIQHQPVQN---IHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
P IQ +P Q P + Q Q+ +QQ Q
Sbjct: 105 AGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPAS 158
Score = 33.6 bits (77), Expect = 0.16
Identities = 18/69 (26%), Positives = 22/69 (31%), Gaps = 4/69 (5%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
V ++T N AN + P Q QQ QQ PA P Q +Q
Sbjct: 124 VPVMVTQNPANSPLD----QPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQ 179
Query: 396 HPSIQHIQQ 404
QQ
Sbjct: 180 PNGETPPQQ 188
Score = 33.6 bits (77), Expect = 0.18
Identities = 19/97 (19%), Positives = 31/97 (31%), Gaps = 8/97 (8%)
Query: 338 GVITSNSANM-NIPLQHITPQVQLHQQQQQQPPP---QPAHIPPPQIQHIPPPIQHQPVQ 393
V + N N PL Q L Q QQ+ P Q ++ +Q QP
Sbjct: 123 QVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNG 182
Query: 394 NIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
P Q + + + +++ L Q+
Sbjct: 183 ETPPQ----QTDGAGDDESEALVRLREADGTLEQRIK 215
Score = 32.8 bits (75), Expect = 0.36
Identities = 21/99 (21%), Positives = 23/99 (23%), Gaps = 23/99 (23%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQ----------PVQNI---HPSIQHI 402
PQ Q QP PQP Q QH P P+ I
Sbjct: 53 PQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTI 112
Query: 403 QQNLQQNIQQNL----------QQNIQQNIQHLLQQNMH 431
Q Q + Q Q LQQ
Sbjct: 113 QTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQ 151
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 39.2 bits (91), Expect = 0.003
Identities = 27/82 (32%), Positives = 28/82 (34%), Gaps = 3/82 (3%)
Query: 346 NMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHI 402
P H P QQ Q QQP PQ PP Q PPP Q Q P Q
Sbjct: 89 QSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVP 148
Query: 403 QQNLQQNIQQNLQQNIQQNIQH 424
QQ Q+ Q Q Q
Sbjct: 149 QQQQYQSPPQQPQYQQNPPPQA 170
Score = 37.2 bits (86), Expect = 0.012
Identities = 17/51 (33%), Positives = 20/51 (39%), Gaps = 2/51 (3%)
Query: 350 PLQHITPQVQLHQQQQQQPP--PQPAHIPPPQIQHIPPPIQHQPVQNIHPS 398
Q P QQQ Q PP PQ PPPQ Q P P ++ +
Sbjct: 138 AQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQP 188
Score = 36.8 bits (85), Expect = 0.018
Identities = 24/86 (27%), Positives = 24/86 (27%), Gaps = 3/86 (3%)
Query: 346 NMNIPLQHITPQVQLHQQQQQ--QPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
P Q Q Q Q PPQ P Q P P Q P Q Q
Sbjct: 78 APQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQ 137
Query: 404 -QNLQQNIQQNLQQNIQQNIQHLLQQ 428
Q Q Q QQ Q Q Q
Sbjct: 138 AQQPQPPPQVPQQQQYQSPPQQPQYQ 163
Score = 33.4 bits (76), Expect = 0.19
Identities = 14/71 (19%), Positives = 19/71 (26%), Gaps = 3/71 (4%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
Q P P PP Q+ P + Q VQ++ Q+
Sbjct: 66 LQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQY-PSQLPPQQVQSVPQQPTPQQEPYYPP 124
Query: 410 IQQNLQQNIQQ 420
Q QQ
Sbjct: 125 PSQPQPPPAQQ 135
Score = 33.4 bits (76), Expect = 0.20
Identities = 18/47 (38%), Positives = 19/47 (40%)
Query: 363 QQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
QQ Q PQP P Q Q+ PP Q Q QN P Q Q
Sbjct: 134 QQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY 180
Score = 31.8 bits (72), Expect = 0.66
Identities = 22/77 (28%), Positives = 27/77 (35%), Gaps = 4/77 (5%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIP--PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
+Q + Q Q+ PP QP P PQ Q PP Q Q Q Q Q
Sbjct: 106 QVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQY--Q 163
Query: 408 QNIQQNLQQNIQQNIQH 424
QN Q Q + +
Sbjct: 164 QNPPPQAQSAPQVSGLY 180
Score = 31.4 bits (71), Expect = 0.78
Identities = 20/66 (30%), Positives = 24/66 (36%), Gaps = 3/66 (4%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 414
QV H+ A +PP P Q P Q Q PS Q+ Q Q +Q
Sbjct: 54 EQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQS--QAPPSHQYPSQLPPQQVQSVP 111
Query: 415 QQNIQQ 420
QQ Q
Sbjct: 112 QQPTPQ 117
Score = 28.4 bits (63), Expect = 8.8
Identities = 13/49 (26%), Positives = 16/49 (32%), Gaps = 2/49 (4%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 398
P Q Q QQ PPPQ P + P Q+ P+
Sbjct: 148 PQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY--PEESPYQPQSYPPN 194
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 38.8 bits (90), Expect = 0.007
Identities = 17/45 (37%), Positives = 18/45 (40%), Gaps = 3/45 (6%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 400
P V + PP QP P PQ PPP Q QP P Q
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQ---APPPPQPQPQPPPPPQPQ 2930
Score = 35.3 bits (81), Expect = 0.069
Identities = 14/32 (43%), Positives = 14/32 (43%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 386
PQ Q Q QPPP P PPP P P
Sbjct: 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
Score = 35.3 bits (81), Expect = 0.079
Identities = 15/44 (34%), Positives = 15/44 (34%), Gaps = 5/44 (11%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS 398
PQ Q Q QPPP P P P P P PS
Sbjct: 2918 QPQPQPPPPPQPQPPPPPPPRPQP-----PLAPTTDPAGAGEPS 2956
Score = 34.9 bits (80), Expect = 0.090
Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQP 391
+ + Q + PPQP PPPQ Q PPP Q QP
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
Score = 34.5 bits (79), Expect = 0.13
Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 1/37 (2%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQP 391
P Q + Q Q PP P P P P PP P
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 32.6 bits (74), Expect = 0.45
Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 1/51 (1%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ 400
L P+ Q P PQP PPPQ Q PPP +P + P+
Sbjct: 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTD 2948
Score = 31.8 bits (72), Expect = 0.80
Identities = 8/37 (21%), Positives = 9/37 (24%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 386
P P Q PPP+P P
Sbjct: 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
Score = 31.4 bits (71), Expect = 1.2
Identities = 9/38 (23%), Positives = 14/38 (36%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
P V+ + + +PP Q + P P P Q
Sbjct: 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQ 2918
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 38.3 bits (90), Expect = 0.008
Identities = 12/60 (20%), Positives = 18/60 (30%), Gaps = 1/60 (1%)
Query: 362 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
QQ Q Q Q Q Q Q S+ QQ L+ + + + +
Sbjct: 625 QQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGR-QRGGLPGM 683
Score = 33.3 bits (77), Expect = 0.31
Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 6/69 (8%)
Query: 362 QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
Q+ +QQ Q Q Q Q Q +L + QQ L+ + +
Sbjct: 618 QRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAER-QQALRDELGR- 675
Query: 422 IQHLLQQNM 430
+ +
Sbjct: 676 ----QRGGL 680
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 37.5 bits (86), Expect = 0.013
Identities = 9/52 (17%), Positives = 35/52 (67%), Gaps = 3/52 (5%)
Query: 379 QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
Q +++P ++ NI + ++N++++ ++N+++N+++N++ +++N+
Sbjct: 931 QNENVPEHLKEHAEANIE---EDAEENVEEDAEENVEENVEENVEENVEENV 979
Score = 34.8 bits (79), Expect = 0.094
Identities = 16/112 (14%), Positives = 53/112 (47%), Gaps = 26/112 (23%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
+ G I +N L++ V H ++ + A+I + ++ +N+
Sbjct: 913 LSGNIAHEINLINKELKNQNENVPEHLKEHAE-----ANIE----EDAEENVEEDAEENV 963
Query: 396 HPSIQ-----------------HIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
+++ ++++N+++N+++N+++NI++N++ +++N+
Sbjct: 964 EENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENI 1015
Score = 31.7 bits (71), Expect = 1.0
Identities = 9/43 (20%), Positives = 30/43 (69%)
Query: 381 QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQ 423
++I I+ +N+ +++ I++N+++N+++N ++N ++N +
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAE 1116
Score = 30.5 bits (68), Expect = 2.4
Identities = 9/38 (23%), Positives = 27/38 (71%)
Query: 393 QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
+NI +I+ + + I++N+++NI++N++ +++N+
Sbjct: 1055 ENIEENIEEYDEENVEEIEENIEENIEENVEENVEENV 1092
Score = 30.1 bits (67), Expect = 2.5
Identities = 7/31 (22%), Positives = 28/31 (90%)
Query: 400 QHIQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
+++++N+++N+++N+++NI++N++ ++N+
Sbjct: 997 ENVEENIEENVEENVEENIEENVEEYDEENV 1027
Score = 30.1 bits (67), Expect = 3.2
Identities = 7/28 (25%), Positives = 26/28 (92%)
Query: 398 SIQHIQQNLQQNIQQNLQQNIQQNIQHL 425
+++ I++N+++NI++N+++N+++N++ +
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEI 1095
Score = 28.6 bits (63), Expect = 7.8
Identities = 7/37 (18%), Positives = 25/37 (67%)
Query: 393 QNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQN 429
+N+ +++++N+++N ++N ++N ++N + +N
Sbjct: 1090 ENVEEIEENVEENVEENAEENAEENAEENAEEYDDEN 1126
Score = 28.6 bits (63), Expect = 8.2
Identities = 8/51 (15%), Positives = 31/51 (60%)
Query: 379 QIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQN 429
++ I I+ +N+ +++ + +++N+++N+++N ++N + ++N
Sbjct: 1068 NVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 36.6 bits (85), Expect = 0.014
Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 343 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIP-PPIQHQPVQN 394
N +P + Q Q QQQ QP + PP+ P P PV+
Sbjct: 108 NEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRT 160
Score = 36.6 bits (85), Expect = 0.014
Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 1/43 (2%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP 397
P+ + QQQ Q PA PQ PP PVQ P
Sbjct: 114 VPRSTVQIQQQAQQQQPPATTAQPQ-PVTPPRQTTAPVQPQTP 155
Score = 35.0 bits (81), Expect = 0.056
Identities = 11/43 (25%), Positives = 11/43 (25%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQN 394
Q P Q Q PP Q PQ PV
Sbjct: 126 QQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168
Score = 33.5 bits (77), Expect = 0.15
Identities = 27/109 (24%), Positives = 37/109 (33%), Gaps = 16/109 (14%)
Query: 325 QRNTYIKHLEIVH-GVIT----SNSANMNIPLQHITPQVQLHQQQQ---QQPPPQPAHIP 376
+R YIK LE GV T S +N P Q Q QL +Q Q +Q P Q + +P
Sbjct: 47 ERWRYIKELENRQIGVPTPTEPSAGGEVNSPTQLTDEQRQLLEQMQADMRQQPTQLSEVP 106
Query: 377 -----PPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQ 420
P + +Q Q Q + +Q
Sbjct: 107 YNEQTPQVPR---STVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQP 152
Score = 29.6 bits (67), Expect = 2.6
Identities = 10/44 (22%), Positives = 10/44 (22%)
Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQP 391
I Q Q Q Q P P Q Q P
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPA 163
Score = 29.6 bits (67), Expect = 3.0
Identities = 9/39 (23%), Positives = 12/39 (30%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ 388
P + +Q P QP P + Q P Q
Sbjct: 130 PPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168
Score = 28.5 bits (64), Expect = 6.0
Identities = 9/37 (24%), Positives = 11/37 (29%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP 386
P P+ Q Q P P P Q + P
Sbjct: 137 PQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 36.6 bits (85), Expect = 0.018
Identities = 15/59 (25%), Positives = 19/59 (32%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
P P Q +Q Q P P P PQ H P Q Q P + + +
Sbjct: 126 PQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAE 184
Score = 33.9 bits (78), Expect = 0.13
Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 8/76 (10%)
Query: 353 HITPQVQLHQQQQQQPP-----PQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
+ Q Q QPP P+ PP+ +PP +P Q +Q Q Q
Sbjct: 79 QEHEAARPSPQHQYQPPYASAQPRQPVQQPPEA-QVPPQHAPRPAQPAPQPVQ--QPAYQ 135
Query: 408 QNIQQNLQQNIQQNIQ 423
+Q LQQ + +
Sbjct: 136 PQPEQPLQQPVSPQVA 151
Score = 31.2 bits (71), Expect = 1.0
Identities = 12/58 (20%), Positives = 18/58 (31%), Gaps = 8/58 (13%)
Query: 350 PLQHITPQVQ---LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQ 404
Q ++PQV QP Q P Q +PV P + ++
Sbjct: 142 LQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPV-----AAPQPEPVAEPAPVMDKPKR 194
Score = 30.4 bits (69), Expect = 1.8
Identities = 21/59 (35%), Positives = 25/59 (42%), Gaps = 2/59 (3%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
P QH Q Q QQP QP P Q Q + P + P Q +H + Q QQ Q
Sbjct: 114 PPQHAPRPAQPAPQPVQQPAYQPQPEQPLQ-QPVSPQVAPAP-QPVHSAPQPAQQAFQP 170
Score = 28.8 bits (65), Expect = 5.9
Identities = 19/79 (24%), Positives = 26/79 (32%), Gaps = 8/79 (10%)
Query: 358 VQLHQQQQQQPPPQPAHIPPPQIQH-IPPPIQH-QPVQNIHPSIQ------HIQQNLQQN 409
V++H+ Q P QH PP QP Q + + H + Q
Sbjct: 66 VRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPA 125
Query: 410 IQQNLQQNIQQNIQHLLQQ 428
Q Q Q + LQQ
Sbjct: 126 PQPVQQPAYQPQPEQPLQQ 144
>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein. The frequency clock
protein, is the central component of the frq-based
circadian negative feedback loop, regulates various
aspects of the circadian clock in Neurospora crassa.
This protein has been shown to interact with itself via
a coiled-coil.
Length = 989
Score = 36.9 bits (85), Expect = 0.021
Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 5/65 (7%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPP----IQHQP 391
V ++ + +++ L + Q Q Q PP I P P I Q
Sbjct: 316 VRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPP-SNFITNPFSSVPEPLREARILPQE 374
Query: 392 VQNIH 396
QN H
Sbjct: 375 GQNPH 379
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 36.6 bits (84), Expect = 0.021
Identities = 22/77 (28%), Positives = 29/77 (37%), Gaps = 20/77 (25%)
Query: 107 NSVHKDVKKYNCE--ECDKYYLSQGELDSHVQTSHRGI------------------KPYV 146
KD K Y C C+K Y +Q L H+ H+ KPY
Sbjct: 341 MLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYR 400
Query: 147 CEICSKAFTQQYNLKHH 163
CE+C K + LK+H
Sbjct: 401 CEVCDKRYKNLNGLKYH 417
Score = 36.6 bits (84), Expect = 0.022
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 13/87 (14%)
Query: 141 GIKPYVCEI--CSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQHSFVHGG 198
KPY C + C+K + Q LK+H +LHG + + +
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYH-MLHG---------HQNQKLHENPSPEKMNIFSAK 395
Query: 199 EKPFKCSYCPKAFSHKGHLNEHLYTHS 225
+KP++C C K + + L H HS
Sbjct: 396 DKPYRCEVCDKRYKNLNGLKYHR-KHS 421
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 36.2 bits (83), Expect = 0.030
Identities = 17/64 (26%), Positives = 20/64 (31%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
P P VQ QQ P PQ Q + P Q P Q Q + +
Sbjct: 201 PKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSR 260
Query: 410 IQQN 413
Q N
Sbjct: 261 EQGN 264
Score = 33.9 bits (77), Expect = 0.14
Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 1/74 (1%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
+H Q Q P QP + P Q P Q P+ Q+ Q Q Q
Sbjct: 196 RHRHAPKPTQQPTVQNPAQQPT-VQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
Query: 412 QNLQQNIQQNIQHL 425
+ + Q N + L
Sbjct: 255 GHKRSREQGNQEFL 268
Score = 33.5 bits (76), Expect = 0.19
Identities = 25/74 (33%), Positives = 28/74 (37%), Gaps = 14/74 (18%)
Query: 368 PPPQPAHIPPPQI------------QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
PP P P P + +H P P Q VQN P+ Q QN Q QQ Q
Sbjct: 173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQN--PAQQPTVQNPAQQPQQQPQ 230
Query: 416 QNIQQNIQHLLQQN 429
Q Q Q QN
Sbjct: 231 QQPVQPAQQPTPQN 244
Score = 33.5 bits (76), Expect = 0.22
Identities = 20/56 (35%), Positives = 23/56 (41%), Gaps = 3/56 (5%)
Query: 374 HIPPPQIQ-HIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
H P P Q + P Q VQN P+ Q QQ QQ +Q Q Q Q Q
Sbjct: 199 HAPKPTQQPTVQNPAQQPTVQN--PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQT 252
Score = 29.6 bits (66), Expect = 3.5
Identities = 12/34 (35%), Positives = 13/34 (38%)
Query: 348 NIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ 381
N Q Q Q QQP PQ PPQ +
Sbjct: 220 NPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253
Score = 29.3 bits (65), Expect = 4.4
Identities = 16/44 (36%), Positives = 16/44 (36%), Gaps = 1/44 (2%)
Query: 348 NIPLQHITPQVQLHQQQQQQ-PPPQPAHIPPPQIQHIPPPIQHQ 390
N Q QQQ Q P QPA P PQ PP Q
Sbjct: 211 NPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 35.8 bits (82), Expect = 0.034
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)
Query: 354 ITPQVQLHQQQQQQPPPQPAHIPP-PQIQHIPPPIQHQPVQNIHP 397
I P +++ Q+ QP P PQ+Q P P Q I+P
Sbjct: 130 IDPTEEVNTQEPTQPAGVNVANNPQPQVQ---PQFGPNPQQRINP 171
Score = 30.0 bits (67), Expect = 2.1
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 11/71 (15%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
+ + Q+Q +Q +Q Q I P + + P Q P+ ++ N Q +Q
Sbjct: 110 EQMAEQLQRISEQNEQ---QAIEIDPTEEVNTQEPTQ--------PAGVNVANNPQPQVQ 158
Query: 412 QNLQQNIQQNI 422
N QQ I
Sbjct: 159 PQFGPNPQQRI 169
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 35.9 bits (83), Expect = 0.041
Identities = 29/164 (17%), Positives = 45/164 (27%), Gaps = 20/164 (12%)
Query: 266 CFSTKSELNR-----HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYTHSKV---KPYNCG 317
CFS E NR H ++ GG KP + + ++ LQ+ + + +
Sbjct: 332 CFSNPEEANRAVTEMHGRMLGG-KPLYVALAQRKEQRRAHLQD-QFMQLQPRMRQLPMGS 389
Query: 318 ICAKSFVQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPA-HIP 376
+ Q Y + TP + P A P
Sbjct: 390 PMGGAMGQPPYYGQG---PQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
Query: 377 PPQIQ-----HIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
Q P+ + P P Q + Q Q Q Q
Sbjct: 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQP-QSTASQGGQ 489
>gnl|CDD|220871 pfam10759, DUF2587, Protein of unknown function (DUF2587). This is
a bacterial family of proteins with no known function.
Length = 168
Score = 34.7 bits (80), Expect = 0.042
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 6/57 (10%)
Query: 337 HGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
HG+ T+ + Q + + QL Q +++ PP PP Q Q Q
Sbjct: 116 HGIQTA------LFAQQMAARAQLEQMRRRALPPGVGIAPPGQPQGARGGPPPGTGQ 166
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 30.8 bits (70), Expect = 0.064
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 188 HLVQHSFVHGGEKPFKCSYCPKAFSH 213
+L +H H GEKP+KC C K+FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 30.8 bits (70), Expect = 0.076
Identities = 13/26 (50%), Positives = 15/26 (57%)
Query: 272 ELNRHAQIHGGVKPFACTVCNKSFLQ 297
L RH + H G KP+ C VC KSF
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 30.1 bits (68), Expect = 0.13
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 301 LQNHLYTHSKVKPYNCGICAKSFVQ 325
L+ H+ TH+ KPY C +C KSF
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 27.7 bits (62), Expect = 0.78
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 216 HLNEHLYTHSENKCFQCKYCPKTFT 240
+L H+ TH+ K ++C C K+F+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
Score = 27.7 bits (62), Expect = 1.0
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 130 ELDSHVQTSHRGIKPYVCEICSKAFTQ 156
L H++T H G KPY C +C K+F+
Sbjct: 1 NLRRHMRT-HTGEKPYKCPVCGKSFSS 26
Score = 26.2 bits (58), Expect = 3.5
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 244 HLKAHTNKHEGVLPHTCLECSKCFST 269
+L+ H H G P+ C C K FS+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 25.0 bits (55), Expect = 7.9
Identities = 8/26 (30%), Positives = 10/26 (38%)
Query: 73 EFNEHIMKHEKEKLFSCNFCLKPFLR 98
H+ H EK + C C K F
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 35.3 bits (81), Expect = 0.070
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 6/41 (14%)
Query: 367 QPPPQPAHIP----PPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
QP PA IP PP IQ++ PP P Q I P IQHI+
Sbjct: 232 QPSAPPASIPAPPIPPVIQYVAPP-PVPPPQPIIP-IQHIR 270
Score = 29.5 bits (66), Expect = 3.6
Identities = 12/40 (30%), Positives = 13/40 (32%), Gaps = 1/40 (2%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQH 389
P Q P + A P P Q I PIQH
Sbjct: 230 PAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPI-IPIQH 268
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 32.5 bits (74), Expect = 0.14
Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 1/38 (2%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
PQ Q+ Q P P +P Q+ + P H P+
Sbjct: 20 PQAQMPQASAPYPGPSMY-LPMAQVMAVGPQSSHPPMA 56
Score = 29.8 bits (67), Expect = 1.2
Identities = 11/60 (18%), Positives = 17/60 (28%), Gaps = 7/60 (11%)
Query: 356 PQVQLHQQQQQQPPP-----QPAHIPPPQIQHIPPPIQHQPVQNIHPSIQ--HIQQNLQQ 408
+L+Q PP A P P + + Q+ HP + I
Sbjct: 7 AYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPV 66
Score = 29.0 bits (65), Expect = 2.3
Identities = 12/49 (24%), Positives = 16/49 (32%), Gaps = 2/49 (4%)
Query: 344 SANMN-IPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQ-HIPPPIQHQ 390
A M + P + L Q PQ +H P PPP+
Sbjct: 21 QAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69
>gnl|CDD|213398 cd12191, gal11_coact, gall11 coactivator domain. Gall11/MED15 acts
in the general regulation of GAL structural genes and is
required for full expression for several genes in this
pathway, including GALs 1,7, and 10 in Saccharomyces
cerevisiae. GAL11 function is dependent on GCN4
functionality and binds GCN4 in a degenerate manner with
multiple orientations found at the GCN4-Gal11 interface.
Length = 90
Score = 31.6 bits (72), Expect = 0.15
Identities = 26/89 (29%), Positives = 34/89 (38%), Gaps = 23/89 (25%)
Query: 356 PQVQLHQQQQQQPPPQPAH---------IPPP---QIQHIPPPI-----------QHQPV 392
Q Q QQQQQQ PQ IPP +I ++PP + Q+
Sbjct: 2 QQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAKIPNLPPEVKTWRQLKAWLQQNNIP 61
Query: 393 QNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
I+ +QQ Q + Q QQ QQ
Sbjct: 62 PQAMEQIKEVQQTHFQLLMQRRQQQQQQQ 90
>gnl|CDD|188742 cd09356, LIM2_TRIP6, The second LIM domain of Thyroid
receptor-interacting protein 6 (TRIP6). The second LIM
domain of Thyroid receptor-interacting protein 6
(TRIP6): TRIP6 is a member of the zyxin LIM protein
family and contains three LIM zinc-binding domains at
the C-terminal. TRIP6 protein localizes to focal
adhesion sites and along actin stress fibers.
Recruitment of this protein to the plasma membrane
occurs in a lysophosphatidic acid (LPA)-dependent
manner. TRIP6 recruits a number of molecules involved
in actin assembly, cell motility, survival and
transcriptional control. The function of TRIP6 in cell
motility is regulated by Src-dependent phosphorylation
at a Tyr residue. The phosphorylation activates the
coupling to the Crk SH2 domain, which is required for
the function of TRIP6 in promoting lysophosphatidic
acid (LPA)-induced cell migration. TRIP6 can shuttle to
the nucleus to serve as a coactivator of AP-1 and
NF-kappaB transcriptional factors. Moreover, TRIP6 can
form a ternary complex with the NHERF2 PDZ protein and
LPA2 receptor to regulate LPA-induced activation of ERK
and AKT, rendering cells resistant to chemotherapy.
Recent evidence shows that TRIP6 antagonizes
Fas-Induced apoptosis by enhancing the antiapoptotic
effect of LPA in cells. As in other LIM domains, this
domain family is 50-60 amino acids in size and shares
two characteristic zinc finger motifs. The two zinc
fingers contain eight conserved residues, mostly
cysteines and histidines, which coordinately bond to
two zinc atoms. LIM domains function as adaptors or
scaffolds to support the assembly of multimeric
protein.
Length = 53
Score = 30.6 bits (69), Expect = 0.15
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 35 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 69
C +C K +++L K H FTC VC RS +
Sbjct: 1 CSVCSKPIMERILRATGKAYHPHCFTCVVCHRSLD 35
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 33.5 bits (77), Expect = 0.23
Identities = 14/96 (14%), Positives = 30/96 (31%), Gaps = 11/96 (11%)
Query: 342 SNSANMNIP-LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPS-- 398
L+ +Q ++Q +PA + + IQ + N S
Sbjct: 17 RELLTAESAQLEAALQLLQEAVNSKRQEEAEPA---AEEAELQAELIQQELAINDQLSQA 73
Query: 399 ----IQHIQQNLQQNIQ-QNLQQNIQQNIQHLLQQN 429
+ + + Q NL + Q+ + + +Q
Sbjct: 74 LNQQTERLNALASDDRQLANLLLQLLQSSRTIREQI 109
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 31.8 bits (73), Expect = 0.27
Identities = 10/45 (22%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 385 PPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ-QNIQQNIQHLLQQ 428
PP + P +I P + ++Q + I ++ + Q + ++I L ++
Sbjct: 23 PPADYTPYGDISPFLPELRQKHKARIAKDPEFQYLNEDIAWLKEE 67
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 33.1 bits (75), Expect = 0.28
Identities = 42/164 (25%), Positives = 58/164 (35%), Gaps = 17/164 (10%)
Query: 200 KPFKCSYCPKAFSHKGHLNEHLYTHS----ENKCFQCKY--CPKTFTLRKHLKAHTNKHE 253
P K C +FS L HL + + K F C Y C K F+ LK H H
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347
Query: 254 GVLPHTC--LECSKCFSTKSELNRHAQIHG-----GVKPFACTV--CNKSFLQKTQLQNH 304
+ P L S FS + K C ++F + + L H
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLH 407
Query: 305 LYTHSKVKPYNC--GICAKSFVQRNTYIKHLEIVHGVITSNSAN 346
+ TH +PYNC C+KSF + I H +I +
Sbjct: 408 IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSI 451
Score = 33.1 bits (75), Expect = 0.31
Identities = 36/261 (13%), Positives = 67/261 (25%), Gaps = 30/261 (11%)
Query: 171 NSLFRCDQCGKAFSRKGHLVQHSFVHGGEKPFKCSYCPKAFSHKGHLNEHLYTHSENKCF 230
C C +FSR HL +H H GEKP +CSY S L
Sbjct: 31 PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLE------------ 78
Query: 231 QCKYCPKTFTLRKHLKAHTNKHEGVLPHTCLECSKCFSTKSELNRHAQIHGGVKPFACTV 290
L +HL+ H N + + S S + + +
Sbjct: 79 ----------LSRHLRTHHNNPSDLNSKS---LPLSNSKASSSSLSSSSSNSNDNNLLSS 125
Query: 291 CNKSFLQKTQLQNHLYTHSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVITSNSANMNIP 350
+ + L + S ++ S V + S + N+
Sbjct: 126 HSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLS 185
Query: 351 LQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNI 410
L + + P ++ P Q+ + + Q +++
Sbjct: 186 LLISSNVSTSIPSSSENSPLSSSYSIPSSSSD-----QNLENSSSSLPLTTNSQLSPKSL 240
Query: 411 QQNLQQNIQQNIQHLLQQNMH 431
++ +
Sbjct: 241 LSQSPSSLSSSDSSSSASESP 261
Score = 30.4 bits (68), Expect = 1.9
Identities = 48/202 (23%), Positives = 67/202 (33%), Gaps = 19/202 (9%)
Query: 151 SKAFTQQYNLKHHLLL-HGDGNSLFRCDQCGKAFSRKGHLVQH--SFVHGGE--KPFKC- 204
A +Q + + + QC +FSR L +H S H GE KPF C
Sbjct: 266 PTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCP 325
Query: 205 -SYCPKAFSHKGHLNEHLYTHSENKCFQCKYCPKTFTLRKHLKA-------HTNKHEGVL 256
S C K FS L H+ H+ + K + L +
Sbjct: 326 YSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDK 385
Query: 257 PHTCLECSKCFSTKSELNR--HAQIHGGVKP--FACTVCNKSFLQKTQLQNHLYTHSKVK 312
L S + K + N H H +P C+KSF + L H H+
Sbjct: 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA 445
Query: 313 PYNCGICAKSFVQRNTYIKHLE 334
P C I KSF + H +
Sbjct: 446 PLLCSIL-KSFRRDLDLSNHGK 466
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 32.7 bits (75), Expect = 0.28
Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 1/58 (1%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQ 393
VH VI + I + + P Q QPPP+P P P+ + IP P + PV
Sbjct: 35 VHQVIELPAPAQPISVTMVAPADL-EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 91
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 32.3 bits (73), Expect = 0.47
Identities = 20/80 (25%), Positives = 26/80 (32%), Gaps = 14/80 (17%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
Q + Q Q Q +QQ Q + Q Q PV Q +
Sbjct: 151 QTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPV--------------QAQPR 196
Query: 412 QNLQQNIQQNIQHLLQQNMH 431
Q+ + QQ Q LLQ H
Sbjct: 197 QSKPASTQQPYQDLLQTPAH 216
Score = 29.3 bits (65), Expect = 4.6
Identities = 20/72 (27%), Positives = 27/72 (37%), Gaps = 2/72 (2%)
Query: 351 LQHITPQVQLHQQQQ-QQPPPQPAHIPPPQIQHI-PPPIQHQPVQNIHPSIQHIQQNLQQ 408
LQ QL +QQ+ Q Q + P+Q QP Q+ S Q Q+L Q
Sbjct: 153 LQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQ 212
Query: 409 NIQQNLQQNIQQ 420
Q+ Q
Sbjct: 213 TPAHTTAQSKPQ 224
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 32.3 bits (74), Expect = 0.48
Identities = 10/47 (21%), Positives = 13/47 (27%)
Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNL 414
P P+ P P PP Q P+ P+ L
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276
Score = 30.8 bits (70), Expect = 1.5
Identities = 15/58 (25%), Positives = 20/58 (34%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
Q TP PPP PA PPP +Q +PP +P + +
Sbjct: 218 SFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 28.5 bits (64), Expect = 0.50
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 203 KCSYCPKAFSHKGHLNEHLYTH 224
KC C K+FS K +L HL TH
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 28.1 bits (63), Expect = 0.64
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 146 VCEICSKAFTQQYNLKHHLLLH 167
C C K+F+++ NLK HL H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 27.7 bits (62), Expect = 0.88
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 231 QCKYCPKTFTLRKHLKAHTNKH 252
+C C K+F+ + +LK H H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 26.9 bits (60), Expect = 1.4
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 259 TCLECSKCFSTKSELNRHAQIH 280
C +C K FS KS L RH + H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 25.8 bits (57), Expect = 4.2
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 175 RCDQCGKAFSRKGHLVQHSFVH 196
+C CGK+FSRK +L +H H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 25.0 bits (55), Expect = 7.5
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 288 CTVCNKSFLQKTQLQNHLYTH 308
C C KSF +K+ L+ HL TH
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 30.5 bits (69), Expect = 0.50
Identities = 17/57 (29%), Positives = 18/57 (31%), Gaps = 4/57 (7%)
Query: 346 NMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPP----PIQHQPVQNIHPS 398
P Q+ Q Q QQ PP P P Q H P P Q Q P
Sbjct: 7 TNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPP 63
Score = 28.5 bits (64), Expect = 2.5
Identities = 20/73 (27%), Positives = 24/73 (32%), Gaps = 8/73 (10%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQ 415
Q QQQQP Q P Q +PPP P Q + Q +Q+ Q
Sbjct: 7 TNQYPPQNQQQQPYQQ-----QPYHQQMPPPPYSPPQQ---QQGHFMPPQPQPYPKQSPQ 58
Query: 416 QNIQQNIQHLLQQ 428
Q L Q
Sbjct: 59 QQQPPQFSSFLSQ 71
>gnl|CDD|235654 PRK05957, PRK05957, aspartate aminotransferase; Provisional.
Length = 389
Score = 32.0 bits (73), Expect = 0.61
Identities = 15/41 (36%), Positives = 19/41 (46%)
Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
PPP A P ++Q VQ I P ++ I Q LQQ
Sbjct: 40 PPPPEAIEALNNFLANPENHKYQAVQGIPPLLEAITQKLQQ 80
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 28.2 bits (63), Expect = 0.67
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 202 FKCSYCPKAFSHKGHLNEHLYTH 224
++C C K F K L EH+ TH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 2.7
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 258 HTCLECSKCFSTKSELNRHAQIH 280
+ C EC K F +KS L H + H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 2.7
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 145 YVCEICSKAFTQQYNLKHHLLLH 167
Y C C K F + L+ H+ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 2.8
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 286 FACTVCNKSFLQKTQLQNHLYTH 308
+ C C K F K+ L+ H+ TH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.3 bits (58), Expect = 3.2
Identities = 6/23 (26%), Positives = 10/23 (43%)
Query: 59 FTCEVCMRSFNDRKEFNEHIMKH 81
+ C C + F + EH+ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 25.1 bits (55), Expect = 7.9
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 174 FRCDQCGKAFSRKGHLVQHSFVH 196
+RC +CGK F K L +H H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 24.7 bits (54), Expect = 8.8
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 230 FQCKYCPKTFTLRKHLKAHTNKH 252
++C C K F + L+ H H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 30.7 bits (69), Expect = 0.82
Identities = 12/36 (33%), Positives = 15/36 (41%)
Query: 343 NSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPP 378
+ N + P Q Q QQQ + P QPA P
Sbjct: 126 GNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161
>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
Length = 332
Score = 31.2 bits (71), Expect = 1.1
Identities = 14/89 (15%), Positives = 30/89 (33%), Gaps = 13/89 (14%)
Query: 348 NIPLQHITPQVQLHQQQQQQP-----PPQPAHIPPPQ-IQHIPPPIQHQPVQNIHPSIQH 401
+P Q +Q H+ + QP I+ P P++ I+ S +
Sbjct: 104 TLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIEETQSPNV--PIEGINSSSEQ 161
Query: 402 IQQNLQQNIQQNLQQNIQQNIQHLLQQNM 430
++ L + + I + H ++
Sbjct: 162 LRVELAELAAE-----IYSDASHRVELAK 185
Score = 30.5 bits (69), Expect = 1.8
Identities = 7/76 (9%), Positives = 20/76 (26%), Gaps = 4/76 (5%)
Query: 324 VQRNTYIKHLEIVHGVITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHI 383
Q N +E +N + +V+L + + + + +
Sbjct: 133 TQTNVASMTIEETQSPNVPI-EGINSSSEQ--LRVELAELAAEIYSDASHRVELAK-NFM 188
Query: 384 PPPIQHQPVQNIHPSI 399
P + + +
Sbjct: 189 EPQAETEAQPEATTNF 204
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.5 bits (72), Expect = 1.1
Identities = 11/70 (15%), Positives = 13/70 (18%), Gaps = 4/70 (5%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQ-IQHIPPPIQHQPVQNIHP--SIQHIQQNLQQNIQ 411
TP P P P P N
Sbjct: 378 TPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437
Query: 412 Q-NLQQNIQQ 420
NL++ QQ
Sbjct: 438 SLNLEELWQQ 447
>gnl|CDD|114474 pfam05750, Rubella_Capsid, Rubella capsid protein. Rubella virus
is an enveloped positive-strand RNA virus of the family
Togaviridae. Virions are composed of three structural
proteins: a capsid and two membrane-spanning
glycoproteins, E2 and E1. During virus assembly, the
capsid interacts with genomic RNA to form nucleocapsids.
It has been discovered that capsid phosphorylation
serves to negatively regulate binding of viral genomic
RNA. This may delay the initiation of nucleocapsid
assembly until sufficient amounts of virus glycoproteins
accumulate at the budding site and/or prevent
non-specific binding to cellular RNA when levels of
genomic RNA are low. It follows that at a late stage in
replication, the capsid may undergo dephosphorylation
before nucleocapsid assembly occurs.
Length = 300
Score = 31.0 bits (69), Expect = 1.2
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 356 PQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ 388
P + + + Q P P+P+ PP Q Q PP +Q
Sbjct: 80 PPEERQESRSQTPAPKPSRAPPQQPQ--PPRMQ 110
>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
Length = 94
Score = 29.3 bits (65), Expect = 1.2
Identities = 14/44 (31%), Positives = 16/44 (36%)
Query: 368 PPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQ 411
PP P PPP P +PV P I+ Q Q Q
Sbjct: 16 LPPAPVPPPPPAPPAPVPEPTVKPVNAEAPKIREAQSKRGQKQQ 59
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The Polycystin
Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
huge protein of 4303aas. Its repeated leucine-rich (LRR)
segment is found in many proteins. It contains 16
polycystic kidney disease (PKD) domains, one
LDL-receptor class A domain, one C-type lectin family
domain, and 16-18 putative TMSs in positions between
residues 2200 and 4100. Polycystin-L has been shown to
be a cation (Na+, K+ and Ca2+) channel that is activated
by Ca2+. Two members of the PCC family (polycystin 1 and
2) are mutated in autosomal dominant polycystic kidney
disease, and polycystin-L is deleted in mice with renal
and retinal defects. Note: this model is restricted to
the amino half for technical reasons.
Length = 2740
Score = 31.6 bits (71), Expect = 1.2
Identities = 13/44 (29%), Positives = 17/44 (38%)
Query: 360 LHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQ 403
PPP A P ++HI P + P+Q H I Q
Sbjct: 153 ESLCSGPPPPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQ 196
>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
Length = 230
Score = 30.5 bits (68), Expect = 1.3
Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 5/43 (11%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQ---HIPPPIQHQP 391
H PQ + PPP P P P Q H PP+ QP
Sbjct: 40 HHPNPQED--PDKNPSPPPDPPLTPRPPAQPNGHNKPPVTKQP 80
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.8 bits (69), Expect = 1.7
Identities = 23/84 (27%), Positives = 25/84 (29%), Gaps = 4/84 (4%)
Query: 349 IPLQHITPQVQLHQQQQQQPP-PQPAHIPPPQIQHIPP---PIQHQPVQNIHPSIQHIQQ 404
+P H Q Q QQ QPP P H PP +Q PV HPS Q
Sbjct: 237 LPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQP 296
Query: 405 NLQQNIQQNLQQNIQQNIQHLLQQ 428
Q Q H
Sbjct: 297 FGLAQSQVPPLPLPSQAQPHSHTP 320
Score = 29.7 bits (66), Expect = 4.4
Identities = 15/50 (30%), Positives = 19/50 (38%)
Query: 352 QHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQH 401
H Q QQPP + P P + HI PP Q + S +H
Sbjct: 315 PHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKH 364
>gnl|CDD|181818 PRK09393, ftrA, transcriptional activator FtrA; Provisional.
Length = 322
Score = 30.3 bits (69), Expect = 1.9
Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 4/39 (10%)
Query: 375 IPP----PQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQN 409
+PP Q Q +P P+ + + P I ++ +L +
Sbjct: 196 VPPHRDGGQAQFVPRPVASRESDRLGPLIDWMRAHLAEP 234
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 30.2 bits (69), Expect = 2.1
Identities = 15/91 (16%), Positives = 38/91 (41%), Gaps = 14/91 (15%)
Query: 351 LQHITPQVQLHQQQQ-------------QQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHP 397
L+ +P+ L QQQQ ++ Q + + ++
Sbjct: 298 LKFQSPERLLAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQ 357
Query: 398 SIQHIQQNLQQNIQQNLQQNIQQNIQHLLQQ 428
++ ++Q L++ +++ L++ +Q ++ L QQ
Sbjct: 358 RLEQLEQRLRRAMRRQLKRK-RQRLEALAQQ 387
>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
Length = 459
Score = 30.3 bits (68), Expect = 2.2
Identities = 8/24 (33%), Positives = 10/24 (41%)
Query: 361 HQQQQQQPPPQPAHIPPPQIQHIP 384
QQQQ P Q I +Q +
Sbjct: 430 PHNQQQQEPLQNPDIYTKDLQRLT 453
>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
Length = 542
Score = 30.2 bits (69), Expect = 2.4
Identities = 6/18 (33%), Positives = 11/18 (61%)
Query: 403 QQNLQQNIQQNLQQNIQQ 420
Q +Q +++ LQQ I+
Sbjct: 235 QLKYKQQVEKRLQQRIES 252
Score = 29.5 bits (67), Expect = 3.6
Identities = 6/21 (28%), Positives = 13/21 (61%)
Query: 407 QQNIQQNLQQNIQQNIQHLLQ 427
Q +Q +++ +QQ I+ +L
Sbjct: 235 QLKYKQQVEKRLQQRIESILA 255
>gnl|CDD|221266 pfam11851, DUF3371, Domain of unknown function (DUF3371). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 125 to 142 amino acids in length.
Length = 131
Score = 28.9 bits (65), Expect = 2.5
Identities = 11/66 (16%), Positives = 20/66 (30%), Gaps = 9/66 (13%)
Query: 336 VHGVITSNSANMNIPLQHITPQVQLH---QQQQQQPPPQPAHIPPPQIQHIPPPIQHQPV 392
HG+ + L T + Q+ Q + P + P Q PP +
Sbjct: 5 AHGL------PVVASLGLCTADLMARVIKQEPQLEDNQCPQDLYPHSSQPDLPPPTTLDL 58
Query: 393 QNIHPS 398
+ +
Sbjct: 59 NDGTIT 64
>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
Provisional.
Length = 450
Score = 30.1 bits (68), Expect = 2.5
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 85 KLFSCNFCLKPFLRKDLLEAH 105
KL+ C FCLK RK+ L+ H
Sbjct: 197 KLYFCEFCLKFMKRKEQLQRH 217
>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631). The
members of this family are sequences derived from a
group of hypothetical proteins expressed by certain
bacterial species. The region concerned is approximately
440 amino acid residues in length.
Length = 729
Score = 29.9 bits (68), Expect = 2.7
Identities = 13/72 (18%), Positives = 26/72 (36%), Gaps = 5/72 (6%)
Query: 357 QVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQ 416
Q L Q + P P P+ + + S+Q QQ + +
Sbjct: 266 QGLLAQARSSSAAPAAPAGQAPGSA---QPLSTRELLAALSSLQ--QQAPALALGEPADF 320
Query: 417 NIQQNIQHLLQQ 428
++++ +Q LL +
Sbjct: 321 DLRELLQQLLSR 332
>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3. The
reovirus inner capsid protein lambda-1 displays
nucleoside triphosphate phosphohydrolase (NTPase),
RNA-5'-triphosphatase (RTPase), and RNA helicase
activity and may play a role in the transcription of the
virus genome, the unwinding or reannealing of
double-stranded RNA during RNA synthesis. The RTPase
activity constitutes the first step in the capping of
RNA, resulting in a 5'-diphosphorylated RNA plus-strand.
lambda1 is an Orthoreovirus core protein, VP3 is the
homologous core protein in Aquareoviruses.
Length = 1166
Score = 30.1 bits (68), Expect = 2.7
Identities = 10/35 (28%), Positives = 11/35 (31%), Gaps = 1/35 (2%)
Query: 276 HAQIHGGVKPFACTVCNKSFLQKTQLQNHLYT-HS 309
+ C VC F QL H T HS
Sbjct: 64 KKATPINPSSYVCNVCMAEFSSMDQLAEHQRTTHS 98
>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
Length = 84
Score = 28.0 bits (62), Expect = 2.8
Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 7/41 (17%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNI 395
TP+ Q Q QQQ P Q I PQ P QH Q +
Sbjct: 51 TPEQQQQQPQQQIAPNQ---IEAPQ----PNFQQHYQGQQV 84
>gnl|CDD|192241 pfam09280, XPC-binding, XPC-binding domain. Members of this family
adopt a structure consisting of four alpha helices,
arranged in an array. They bind specifically and
directly to the xeroderma pigmentosum group C protein
(XPC) to initiate nucleotide excision repair.
Length = 59
Score = 27.2 bits (61), Expect = 3.2
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 5/39 (12%)
Query: 396 HPSIQHIQQNLQQN---IQQNLQQNIQQNIQ--HLLQQN 429
+P Q ++Q +QQN + LQQ Q N Q L+QQN
Sbjct: 7 NPQFQQLRQMVQQNPELLPPLLQQLGQSNPQLLQLIQQN 45
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.0 bits (67), Expect = 3.2
Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 5/61 (8%)
Query: 355 TPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQ---HQPVQNIHPSIQHIQQNLQQNIQ 411
P Q QQ+P P PPPQ P +Q P+ Q ++Q L ++
Sbjct: 777 QPPPQAPPAPQQRPRGAPTPQPPPQA--GPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834
Query: 412 Q 412
+
Sbjct: 835 R 835
>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
Provisional.
Length = 177
Score = 29.0 bits (65), Expect = 3.5
Identities = 7/28 (25%), Positives = 10/28 (35%)
Query: 350 PLQHITPQVQLHQQQQQQPPPQPAHIPP 377
P + QQ P P++ PP
Sbjct: 141 PQGGNQFSGGAQSRPQQSAPAAPSNEPP 168
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 28.3 bits (63), Expect = 3.6
Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 140 RGIKPYVCEICSKAFTQQYNLKHHLLLHGDGNSLFRCDQCGKAFSRKGHLVQH 192
+ + PYVC +C F+ +LK H+ C CGK F + H
Sbjct: 69 KAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV---CPVCGKEFRNTDSTLDH 118
Score = 28.3 bits (63), Expect = 4.1
Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 2/52 (3%)
Query: 59 FTCEVCMRSFNDRKEFNEHIMKHEKEKLFSCNFCLKPFLRKDLLEAHVNSVH 110
+ C +C+ F+ +HI E K+ C C K F D HV H
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRYTEHSKV--CPVCGKEFRNTDSTLDHVCKKH 123
Score = 27.5 bits (61), Expect = 6.3
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 6/62 (9%)
Query: 283 VKPFACTVCNKSFLQKTQLQNHL-YT-HSKVKPYNCGICAKSFVQRNTYIKHLEIVHGVI 340
V P+ C +C F L+ H+ YT HSKV C +C K F ++ + H+ H +
Sbjct: 71 VSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV----CPVCGKEFRNTDSTLDHVCKKHNIC 126
Query: 341 TS 342
S
Sbjct: 127 VS 128
>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
C2H2 Zn-finger [General function prediction only].
Length = 65
Score = 27.3 bits (60), Expect = 3.6
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 169 DGNSLFRCDQCGKAFSRKGHLVQH 192
DG RC +CG F R+ ++H
Sbjct: 13 DGEEFLRCPRCGMVFRRRKDYIRH 36
>gnl|CDD|188739 cd09353, LIM2_Zyxin, The second LIM domain of Zyxin. The second
LIM domain of Zyxin: Zyxin exhibits three copies of the
LIM domain, an extensive proline-rich domain and a
nuclear export signal. Localized at sites of
cellsubstratum adhesion in fibroblasts, Zyxin interacts
with alpha-actinin, members of the cysteine-rich
protein (CRP) family, proteins that display Src
homology 3 (SH3) domains and Ena/VASP family members.
Zyxin and its partners have been implicated in the
spatial control of actin filament assembly as well as
in pathways important for cell differentiation. In
addition to its functions at focal adhesion plaques,
recent work has shown that zyxin moves from the sites
of cell contacts to the nucleus, where it directly
participates in the regulation of gene expression. As
in other LIM domains, this domain family is 50-60 amino
acids in size and shares two characteristic zinc finger
motifs. The two zinc fingers contain eight conserved
residues, mostly cysteines and histidines, which
coordinately bond to two zinc atoms. LIM domains
function as adaptors o r scaffolds to support the
assembly of multimeric protein.
Length = 60
Score = 26.8 bits (59), Expect = 3.7
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 35 CELCYKMFSDKLLYEKHKMVHRQVFTCEVC 64
C +C + +D++L K H Q FTC VC
Sbjct: 1 CAVCDQKITDRMLKATGKSYHPQCFTCVVC 30
>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
ABF1 is a sequence-specific DNA binding protein involved
in transcription activation, gene silencing and
initiation of DNA replication. ABF1 is known to remodel
chromatin, and it is proposed that it mediates its
effects on transcription and gene expression by
modifying local chromatin architecture. These functions
require a conserved stretch of 20 amino acids in the
C-terminal region of ABF1 (amino acids 639 to 662 S.
cerevisiae). The N-terminal two thirds of the protein
are necessary for DNA binding, and the N-terminus (amino
acids 9 to 91 in S. cerevisiae) is thought to contain a
novel zinc-finger motif which may stabilise the protein
structure.
Length = 508
Score = 29.7 bits (66), Expect = 3.8
Identities = 23/84 (27%), Positives = 30/84 (35%), Gaps = 13/84 (15%)
Query: 349 IPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQQ 408
+P H HQ Q P PAH QHQ + QH Q+ QQ
Sbjct: 280 LPNAHQISGADHHQHQHHHHPSIPAHH------------QHQLPEGHQHDHQHHHQHQQQ 327
Query: 409 NIQ-QNLQQNIQQNIQHLLQQNMH 431
++Q Q Q N H+ Q +
Sbjct: 328 DLQHQEQHQQHVDNDHHVYQDRIG 351
>gnl|CDD|234764 PRK00442, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 27.5 bits (61), Expect = 4.0
Identities = 9/28 (32%), Positives = 12/28 (42%)
Query: 369 PPQPAHIPPPQIQHIPPPIQHQPVQNIH 396
+PA P P Q +PP P+ H
Sbjct: 49 EDKPAEQPAPPAQPVPPVQNTAPLNQPH 76
>gnl|CDD|188740 cd09354, LIM2_LPP, The second LIM domain of lipoma preferred
partner (LPP). The second LIM domain of lipoma
preferred partner (LPP): LPP is a member of the zyxin
LIM protein family and contains three LIM zinc-binding
domains at the C-terminal and proline-rich region at
the N-terminal. LPP initially identified as the most
frequent translocation partner of HMGA2 (High Mobility
Group A2) in a subgroup of benign tumors of adipose
tissue (lipomas). It was also shown to be rearranged in
a number of other soft tissues, as well as in a case of
acute monoblastic leukemia. In addition to its
involvement in tumors, LPP was inedited as a smooth
muscle restricted LIM protein that plays an important
role in SMC migration. LPP is localized at sites of
cell adhesion, cell-cell contacts and transiently in
the nucleus. In nucleus, it acts as a coactivator for
the ETS domain transcription factor PEA3. In addition
to PEA3, it interacts with alpha-actinin,vasodilator
stimulated phosphoprotein (VASP),Palladin, and Scrib.
The LIM domains are the main focal adhesion targeting
elements and that the proline- rich region, which
harbors binding sites for alpha-actinin and
vasodilator- stimulated phosphoprotein (VASP), has a
weak targeting capacity. As in other LIM domains, this
domain family is 50-60 amino acids in size and shares
two characteristic zinc finger motifs. The two zinc
fingers contain eight conserved residues, mostly
cysteines and histidines, which coordinately bond to
two zinc atoms. LIM domains function as adaptors or
scaffolds to support the assembly of multimeric
protein.
Length = 60
Score = 26.7 bits (59), Expect = 4.1
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 35 CELCYKMFSDKLLYEKHKMVHRQVFTCEVCMRSFN 69
C +C K D++L K H Q FTC VC +S +
Sbjct: 1 CSVCSKPILDRILRATGKPYHPQCFTCVVCGKSLD 35
>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding. The Creb binding domain
assumes a structure comprising of three alpha-helices
which pack in a bundle, exposing a hydrophobic groove
between alpha-1 and alpha-3 within which complimentary
domains found in the protein 'activator for thyroid
hormone and retinoid receptors' (ACTR) can dock. Docking
of these domains is required for the recruitment of RNA
polymerase II and the basal transcription machinery.
Length = 104
Score = 27.7 bits (61), Expect = 4.7
Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 2/79 (2%)
Query: 355 TPQVQLHQQQQQQPP-PQPAHIPPPQI-QHIPPPIQHQPVQNIHPSIQHIQQNLQQNIQQ 412
PQ L QQQQ QP P+P Q P P QP + + Q+L + ++
Sbjct: 6 WPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLVQPGISRGIVSPNALQDLLRTLKS 65
Query: 413 NLQQNIQQNIQHLLQQNMH 431
QQ + ++L+ N
Sbjct: 66 PSSPQQQQQVLNILKSNPQ 84
>gnl|CDD|219971 pfam08690, GET2, GET complex subunit GET2. This family corresponds
to the GET complex subunit GET2. The GET complex is
involved in the retrieval of ER resident proteins from
the Golgi.
Length = 298
Score = 29.0 bits (65), Expect = 5.1
Identities = 16/60 (26%), Positives = 22/60 (36%), Gaps = 7/60 (11%)
Query: 369 PPQPAHIP-PPQIQHI------PPPIQHQPVQNIHPSIQHIQQNLQQNIQQNLQQNIQQN 421
AH P+IQ I P P +NI P ++ QQ + Q N N +
Sbjct: 56 STTSAHDQSTPEIQDILEAIDPPKDESESPAENIDPEVEMFQQLAKMQQQGNGSDNPPAD 115
>gnl|CDD|201555 pfam01023, S_100, S-100/ICaBP type calcium binding domain. The
S-100 domain is a subfamily of the EF-hand calcium
binding proteins.
Length = 44
Score = 26.0 bits (58), Expect = 5.5
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 102 LEAHVNSVHKDVKKYNCEECDKYYLSQGELDSHVQT 137
LE + ++ +Y+ +E DK LS+ EL ++
Sbjct: 1 LEKAIETIINVFHQYSGKEGDKDTLSKKELKELLEK 36
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 25.3 bits (55), Expect = 6.3
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 202 FKCSYCPKAFSHKGHLNEHLYTH 224
FKC C K+FS K L HL H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 25.3 bits (55), Expect = 7.5
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 230 FQCKYCPKTFTLRKHLKAHTNKH 252
F+C C K+F+ + LK H KH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|218161 pfam04589, RFX1_trans_act, RFX1 transcription activation region.
The RFX family is a family of winged-helix DNA binding
proteins. RFX1 is a regulatory factor essential for
expression of MHC class II genes. This region is to
found N terminal to the RFX DNA binding region
(pfam02257) in some mammalian RFX proteins, and is
thought to activate transcription when associated with
DNA. Deletion analysis has identified the region 233-351
in human RFX1 as being required for maximal activation.
Length = 150
Score = 28.0 bits (62), Expect = 6.4
Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 2/66 (3%)
Query: 349 IPLQ-HITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ 407
+ LQ + Q + QQ+ + +Q +P Q Q + +QH+
Sbjct: 14 VALQTSVPAQAPVPASQQRSVVQATSQTKGGPVQQLPVHRVQQVPQQV-QQVQHVYPAQV 72
Query: 408 QNIQQN 413
Q ++
Sbjct: 73 QYVEGG 78
>gnl|CDD|237481 PRK13723, PRK13723, conjugal transfer pilus assembly protein TraH;
Provisional.
Length = 451
Score = 28.6 bits (64), Expect = 6.6
Identities = 5/42 (11%), Positives = 19/42 (45%), Gaps = 5/42 (11%)
Query: 392 VQNIHPSIQHIQQNLQQ-----NIQQNLQQNIQQNIQHLLQQ 428
+ ++ ++ Q+ + +QQ+ + + + ++ QQ
Sbjct: 393 MDHLRENLNQAQRQIAAFQSQVQVQQDALLVVDRQMSYMRQQ 434
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.7 bits (65), Expect = 6.8
Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 403 QQNLQQNIQQNLQQ--NIQQNIQHLLQQNM 430
Q NL N+Q N QQ ++ ++Q L Q +
Sbjct: 426 QLNLAINLQLNQQQLLSVSDSLQSTLTQQI 455
>gnl|CDD|217789 pfam03915, AIP3, Actin interacting protein 3.
Length = 424
Score = 28.8 bits (65), Expect = 6.8
Identities = 14/75 (18%), Positives = 27/75 (36%), Gaps = 2/75 (2%)
Query: 354 ITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQPVQNIHPSIQHIQQNLQ--QNIQ 411
I ++ QQ + +PA PPP +Q +++ L +
Sbjct: 108 ILQRLSELQQNTAKEVARPAAAPPPSSLPNSSSSPDSINPAGLKELQSLRRELAVLRQTH 167
Query: 412 QNLQQNIQQNIQHLL 426
+ + IQ +I +L
Sbjct: 168 SDFKSEIQASISSIL 182
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 28.7 bits (64), Expect = 7.1
Identities = 9/23 (39%), Positives = 10/23 (43%)
Query: 349 IPLQHITPQVQLHQQQQQQPPPQ 371
I L + PQ Q Q QQ Q
Sbjct: 300 IQLNFVIPQQAQQQGQGQQQQAQ 322
>gnl|CDD|182696 PRK10748, PRK10748, flavin mononucleotide phosphatase; Provisional.
Length = 238
Score = 28.2 bits (63), Expect = 7.6
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 392 VQNIHPSIQHIQQNLQQNIQQNLQQ 416
VQN HP+++ Q Q ++Q L++
Sbjct: 39 VQNYHPALRSFQNEDLQRLRQALRE 63
>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 27.9 bits (62), Expect = 8.9
Identities = 12/51 (23%), Positives = 18/51 (35%), Gaps = 2/51 (3%)
Query: 340 ITSNSANMNIPLQHITPQVQLHQQQQQQPPPQPAHIPPPQIQHIPPPIQHQ 390
+ N N P + P + Q Q Q A+ Q Q PP ++
Sbjct: 121 SFHENFNNNYPGNYNNPSQDPYMNQAQS-YNQNAYAKENQ-QAQPPKYENS 169
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.134 0.433
Gapped
Lambda K H
0.267 0.0704 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,374,955
Number of extensions: 1848299
Number of successful extensions: 5416
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4650
Number of HSP's successfully gapped: 346
Length of query: 431
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 331
Effective length of database: 6,502,202
Effective search space: 2152228862
Effective search space used: 2152228862
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 60 (26.9 bits)