RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6088
(289 letters)
>gnl|CDD|197673 smart00352, POU, Found in Pit-Oct-Unc transcription factors.
Length = 75
Score = 144 bits (365), Expect = 3e-44
Identities = 59/75 (78%), Positives = 63/75 (84%)
Query: 148 DETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNM 207
D+ TD ELE FAK FKQRRIKLGFTQ DVGLA+G LYG FSQTTI RFEAL LSFKNM
Sbjct: 1 DDDTDPRELEAFAKQFKQRRIKLGFTQADVGLALGALYGPAFSQTTICRFEALQLSFKNM 60
Query: 208 CKLKPLLQKWLSDAD 222
CKLKPLLQKWL +A+
Sbjct: 61 CKLKPLLQKWLEEAE 75
>gnl|CDD|189427 pfam00157, Pou, Pou domain - N-terminal to homeobox domain.
Length = 75
Score = 140 bits (356), Expect = 9e-43
Identities = 60/75 (80%), Positives = 67/75 (89%)
Query: 148 DETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNM 207
D+TTDL+ELEQFAK FKQRRIKLG+TQ DVGLA+G LYG FSQTTI RFEAL LSFKNM
Sbjct: 1 DDTTDLDELEQFAKEFKQRRIKLGYTQADVGLALGALYGPVFSQTTICRFEALQLSFKNM 60
Query: 208 CKLKPLLQKWLSDAD 222
CKLKPLL+KWL +A+
Sbjct: 61 CKLKPLLEKWLEEAE 75
>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain.
Length = 57
Score = 41.7 bits (99), Expect = 1e-05
Identities = 15/39 (38%), Positives = 21/39 (53%)
Query: 245 RKKRTSIETSVRVALEKAFLQNPKPTSEEISVLADNLCM 283
R+KRT+ LEK F +N P++EE LA L +
Sbjct: 1 RRKRTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGL 39
>gnl|CDD|197696 smart00389, HOX, Homeodomain. DNA-binding factors that are
involved in the transcriptional regulation of key
developmental processes.
Length = 57
Score = 37.6 bits (88), Expect = 3e-04
Identities = 17/38 (44%), Positives = 21/38 (55%)
Query: 244 RRKKRTSIETSVRVALEKAFLQNPKPTSEEISVLADNL 281
+R+KRTS LEK F +NP P+ EE LA L
Sbjct: 1 KRRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKL 38
>gnl|CDD|238039 cd00086, homeodomain, Homeodomain; DNA binding domains involved in
the transcriptional regulation of key eukaryotic
developmental processes; may bind to DNA as monomers or
as homo- and/or heterodimers, in a sequence-specific
manner.
Length = 59
Score = 36.1 bits (84), Expect = 0.001
Identities = 16/37 (43%), Positives = 19/37 (51%)
Query: 245 RKKRTSIETSVRVALEKAFLQNPKPTSEEISVLADNL 281
R+KRT LEK F +NP P+ EE LA L
Sbjct: 1 RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKEL 37
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 37.5 bits (88), Expect = 0.007
Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 8/108 (7%)
Query: 14 NHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSS-AANS 72
+ L + L L+ QQ L + Q LQ L++ + + S +
Sbjct: 292 DQLARRLKFQSPERL----LAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQN 347
Query: 73 PTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKL---SPQS 117
P +AQ L+ Q+ +QL++ +++ Q+L SP +
Sbjct: 348 PQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQRLEALAQQLEALSPLA 395
Score = 31.3 bits (72), Expect = 0.56
Identities = 23/114 (20%), Positives = 43/114 (37%), Gaps = 17/114 (14%)
Query: 36 LQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQ----VQQA 91
L Q LQQ Q L +++ + ++ A P+ Q Q +QQ
Sbjct: 265 LLQRLQQLQQRLARAMRRRL----EQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQQR 320
Query: 92 VAQAAQ-QLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRVPAHHIKQTPARL 144
+ +A + +L+ +++ Q+L Q+ P I ++Q RL
Sbjct: 321 LQRALERRLRLAKQRLERLSQRLQQQN--------PQRRIERAQQRLEQLEQRL 366
Score = 30.9 bits (71), Expect = 0.71
Identities = 18/79 (22%), Positives = 32/79 (40%), Gaps = 2/79 (2%)
Query: 32 ELSTLQQTL--QQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQ 89
L L + L Q + L Q Q+ Q Q+ + L +Q Q Q
Sbjct: 290 RLDQLARRLKFQSPERLLAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQ 349
Query: 90 QAVAQAAQQLQQLQKQSHH 108
+ + +A Q+L+QL+++
Sbjct: 350 RRIERAQQRLEQLEQRLRR 368
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 36.9 bits (86), Expect = 0.010
Identities = 22/98 (22%), Positives = 39/98 (39%), Gaps = 10/98 (10%)
Query: 30 PSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQ 89
P + L LQQ +L+ QL+ ++ SQ A + A L +++
Sbjct: 137 PEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQEL------VALEGLAAELE 190
Query: 90 QAVAQAAQQLQQLQKQ----SHHHHQKLSPQSQQSGNT 123
+ + QL+QLQ++ S QK + Q+
Sbjct: 191 EKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKR 228
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 36.7 bits (85), Expect = 0.014
Identities = 20/130 (15%), Positives = 26/130 (20%), Gaps = 11/130 (8%)
Query: 12 LQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAAN 71
+ P L P +L LQQ Q + Q
Sbjct: 219 FLPAPSQAPAQPPL----PPQLPQQPPPLQQP----QFPGLSQQMPPPPPQPPQQQQQPP 270
Query: 72 SPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIR 131
P PP Q + Q +Q PQ QQ G
Sbjct: 271 QPQAQPPPQN---QPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQL 327
Query: 132 VPAHHIKQTP 141
+
Sbjct: 328 SQQQREALSQ 337
Score = 36.3 bits (84), Expect = 0.016
Identities = 19/104 (18%), Positives = 27/104 (25%), Gaps = 8/104 (7%)
Query: 41 QQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQ 100
+ Q +Q Q + + + P LP Q Q Q Q
Sbjct: 201 PGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPP 260
Query: 101 QLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRVPAHHIKQTPARL 144
Q Q+ PQ Q PP + + A L
Sbjct: 261 QPP-----QQQQQPPQPQAQP---PPQNQPTPHPGLPQGQNAPL 296
Score = 35.5 bits (82), Expect = 0.030
Identities = 26/117 (22%), Positives = 33/117 (28%), Gaps = 6/117 (5%)
Query: 8 SQLLLQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASS 67
SQ Q LP P QQ Q Q Q QN +
Sbjct: 224 SQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPT 283
Query: 68 S------AANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
N+P P Q Q Q Q + Q +QL + S + LS +
Sbjct: 284 PHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEA 340
Score = 34.4 bits (79), Expect = 0.078
Identities = 20/111 (18%), Positives = 32/111 (28%), Gaps = 3/111 (2%)
Query: 32 ELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQA 91
++ +L++ Q Q QA Q Q AA G P Q + Q
Sbjct: 152 KMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHP 211
Query: 92 VAQAAQQLQQLQKQS---HHHHQKLSPQSQQSGNTTPPSSNIRVPAHHIKQ 139
QQ Q+ +L Q P + ++P +
Sbjct: 212 EQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQP 262
Score = 32.4 bits (74), Expect = 0.31
Identities = 21/115 (18%), Positives = 24/115 (20%), Gaps = 9/115 (7%)
Query: 37 QQTLQQHQHSLQAQLQQFVLFQSASQQ---NASSSAANSPTGLPPQAQFFLQNQVQQAVA 93
Q QQ + Q L QQ +PP Q Q Q
Sbjct: 213 QVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQP 272
Query: 94 QAAQQLQQLQKQSHHHHQ------KLSPQSQQSGNTTPPSSNIRVPAHHIKQTPA 142
QA Q Q Q Q P R P +
Sbjct: 273 QAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQL 327
Score = 28.2 bits (63), Expect = 6.8
Identities = 16/100 (16%), Positives = 29/100 (29%)
Query: 29 SPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQV 88
+ S + + ++ Q + + + A S AQ L +
Sbjct: 99 TFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEE 158
Query: 89 QQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSS 128
+A Q QQ QL + + P Q + P
Sbjct: 159 VEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPE 198
>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3. Aip3p/Bud6p is a
regulator of cell and cytoskeletal polarity in
Saccharomyces cerevisiae that was previously identified
as an actin-interacting protein. Actin-interacting
protein 3 (Aip3p) localizes at the cell cortex where
cytoskeleton assembly must be achieved to execute
polarized cell growth, and deletion of AIP3 causes gross
defects in cell and cytoskeletal polarity. Aip3p
localization is mediated by the secretory pathway,
mutations in early- or late-acting components of the
secretory apparatus lead to Aip3p mislocalization.
Length = 426
Score = 36.2 bits (84), Expect = 0.018
Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 10/107 (9%)
Query: 21 PLPNLSSLSPSELSTLQQTLQQHQHSLQAQ---LQQFVLFQSASQQNASSSAANSPTGL- 76
L + E+ TLQ L + + +L++Q +Q+ Q S N + AA+ L
Sbjct: 75 ELDEVKKHIDDEIDTLQNELDEVKQALESQREAIQRLKERQQNSAANIARPAASPSPVLA 134
Query: 77 PPQAQFFLQNQVQQAVAQAAQQLQQLQ------KQSHHHHQKLSPQS 117
+ L N + + +L+ LQ +Q+H+ +S
Sbjct: 135 SSSSAISLANNPDKLNKEQRAELKSLQRELAVLRQTHNSFFTEIKES 181
>gnl|CDD|222221 pfam13560, HTH_31, Helix-turn-helix domain. This domain is a
helix-turn-helix domain that probably binds to DNA.
Length = 63
Score = 31.7 bits (73), Expect = 0.042
Identities = 10/41 (24%), Positives = 20/41 (48%), Gaps = 6/41 (14%)
Query: 158 QFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
+ ++ R + G TQ ++ +G S++T+SR E
Sbjct: 1 ELGARLRRLRERAGLTQAELAERLG------VSRSTLSRIE 35
>gnl|CDD|238045 cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins.
Prokaryotic DNA binding proteins belonging to the
xenobiotic response element family of transcriptional
regulators.
Length = 58
Score = 31.0 bits (71), Expect = 0.076
Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 8/53 (15%)
Query: 160 AKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEA--LNLSFKNMCKL 210
+ K+ R + G TQ ++ +G S++TISR E N S + + KL
Sbjct: 1 GERLKELRKEKGLTQEELAEKLG------VSRSTISRIENGKRNPSLETLEKL 47
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 34.3 bits (78), Expect = 0.090
Identities = 34/146 (23%), Positives = 54/146 (36%), Gaps = 15/146 (10%)
Query: 29 SPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQV 88
P + QQ QQ Q + Q Q Q + Q P PQ Q Q
Sbjct: 754 QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQP-QYQQPQQPVAPQPQYQQPQQPVA 812
Query: 89 QQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQS---------GNTTP---PSSNIRVPAHH 136
Q Q QQ Q Q Q ++PQ Q + G++ P P++ + P+
Sbjct: 813 PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPL--PSLD 870
Query: 137 IKQTPARLLDPDETTDLEELEQFAKT 162
+ P ++P +T LE++ + +
Sbjct: 871 LLTPPPSEVEPVDTFALEQMARLVEA 896
Score = 32.4 bits (73), Expect = 0.33
Identities = 26/105 (24%), Positives = 31/105 (29%), Gaps = 6/105 (5%)
Query: 23 PNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQF 82
P+ +P Q Q QQ V Q QQ P PQ Q
Sbjct: 740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQ------PQQPVAPQPQYQQ 793
Query: 83 FLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPS 127
Q Q Q QQ Q Q Q ++PQ Q P +
Sbjct: 794 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 838
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 34.0 bits (78), Expect = 0.097
Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 38 QTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQ 97
L+Q + +L+ Q QQ + + Q + + P GL Q+QQ +AQ AQ
Sbjct: 780 TQLEQLKQNLENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVT---VEQIQQELAQLAQ 836
Query: 98 QLQQ 101
QL++
Sbjct: 837 QLRE 840
Score = 32.5 bits (74), Expect = 0.29
Identities = 28/93 (30%), Positives = 33/93 (35%), Gaps = 7/93 (7%)
Query: 31 SELSTLQQTLQQHQHSLQAQLQQF--VLFQSASQQNASSSAA----NSPTGLPPQAQFFL 84
S+L TLQQ LQ QF L S + AA + T L Q L
Sbjct: 731 SQLQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQQAFLAALLDEETLTQLEQLKQN-L 789
Query: 85 QNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQS 117
+NQ QQA Q Q L + H L
Sbjct: 790 ENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTV 822
>gnl|CDD|205103 pfam12844, HTH_19, Helix-turn-helix domain. Members of this family
contains a DNA-binding helix-turn-helix domain.
Length = 64
Score = 30.2 bits (69), Expect = 0.14
Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 6/39 (15%)
Query: 160 AKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
+ K+ R G +Q D+ S++ +S+ E
Sbjct: 1 GERLKELREAKGLSQEDLA------EKLGISKSYLSKIE 33
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 32.7 bits (74), Expect = 0.27
Identities = 24/120 (20%), Positives = 32/120 (26%), Gaps = 5/120 (4%)
Query: 2 GASPNASQLLLQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSAS 61
G P+ + + S S + Q Q Q +
Sbjct: 129 GMGPHQMSRVGTMQPGGQAG-GMMQQSSGQPQSQQPNQMGPQQGQAQGQAGG----MNQG 183
Query: 62 QQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSG 121
QQ G P Q Q+QQ QQ Q Q Q +Q+ QQ G
Sbjct: 184 QQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQG 243
Score = 31.9 bits (72), Expect = 0.40
Identities = 23/72 (31%), Positives = 25/72 (34%), Gaps = 2/72 (2%)
Query: 37 QQTLQQHQHSLQAQLQQFVLFQS--ASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQ 94
QQ Q Q LQ Q QQ + Q A Q G+ PQ Q QV
Sbjct: 220 QQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQP 279
Query: 95 AAQQLQQLQKQS 106
QQ QQ Q
Sbjct: 280 PQQQPQQSQLGM 291
Score = 29.6 bits (66), Expect = 2.2
Identities = 18/101 (17%), Positives = 27/101 (26%), Gaps = 4/101 (3%)
Query: 30 PSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQ----FFLQ 85
S+L L +QQ Q Q + G+ Q
Sbjct: 286 QSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKL 345
Query: 86 NQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPP 126
++ QQ QQ H Q+++ Q Q G
Sbjct: 346 RNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVAL 386
Score = 27.7 bits (61), Expect = 8.1
Identities = 22/84 (26%), Positives = 24/84 (28%)
Query: 37 QQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAA 96
Q Q Q QQ Q Q P AQ + Q Q
Sbjct: 204 GGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQP 263
Query: 97 QQLQQLQKQSHHHHQKLSPQSQQS 120
QQ+Q Q Q Q Q QQS
Sbjct: 264 QQMQGGQMQVPMQQQPPQQQPQQS 287
>gnl|CDD|197775 smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins.
Length = 56
Score = 29.4 bits (67), Expect = 0.27
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 6/35 (17%)
Query: 164 KQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
K+ R + G TQ ++ +G S++T+SR E
Sbjct: 3 KELREEKGLTQEELAEKLGV------SRSTLSRIE 31
>gnl|CDD|182828 PRK10907, PRK10907, intramembrane serine protease GlpG;
Provisional.
Length = 276
Score = 31.9 bits (73), Expect = 0.28
Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 23/71 (32%)
Query: 39 TLQQHQHS------------LQAQLQQFVL------FQSASQQNASSSAANSPTGLPPQA 80
T+QQH S ++A+L +F+ + +AS Q+ +++ GL +
Sbjct: 28 TIQQHNQSDIWLADESQAERVRAELARFLENPADPRYLAASWQSGHTNS-----GLRYRR 82
Query: 81 QFFLQNQVQQA 91
FL ++A
Sbjct: 83 FPFLATLRERA 93
>gnl|CDD|216879 pfam02112, PDEase_II, cAMP phosphodiesterases class-II.
Length = 323
Score = 32.1 bits (73), Expect = 0.33
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 119 QSGNTTPPSSNIRVPAHHIKQTPARLLDPDETTDLEELEQFA 160
+ +T+ P + V H+K PA +P T LEEL Q
Sbjct: 268 LTTSTSQPLLGLNVIVTHVKSPPADPNNP-RDTILEELNQLN 308
>gnl|CDD|227863 COG5576, COG5576, Homeodomain-containing transcription factor
[Transcription].
Length = 156
Score = 31.3 bits (71), Expect = 0.34
Identities = 18/93 (19%), Positives = 32/93 (34%), Gaps = 8/93 (8%)
Query: 191 QTTISRFEALNLSFKNMCKLKPLLQKWLSDADSTTANPSCLSNPLTTPEAIGRRRKKRTS 250
+S E+ + K K + ++ + + +RR+ T
Sbjct: 6 MQPLSALESQRMPQIRTIKT----TKNKREVEAADSEMKLERKQDGSSPPKSKRRRT-TD 60
Query: 251 IETSVRVALEKAFLQNPKPTSEEISVLADNLCM 283
+ V LE+ F NP P+S L+ L M
Sbjct: 61 EQLMV---LEREFEINPYPSSITRIKLSLLLNM 90
>gnl|CDD|201759 pfam01381, HTH_3, Helix-turn-helix. This large family of DNA
binding helix-turn helix proteins includes Cro and CI.
Within the Neisseria gonorrhoeae phage associated
protein NGO0477, the full protein fold incorporates a
helix-turn-helix motif, but the function of this member
is unlikely to be that of a DNA-binding regulator, the
function of most other members, so is not necessarily
characteristic of the whole family.
Length = 55
Score = 28.3 bits (64), Expect = 0.59
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 8/49 (16%)
Query: 164 KQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEA--LNLSFKNMCKL 210
K+ R +LG +Q ++ +G S++TIS+ E S + + KL
Sbjct: 2 KELREELGLSQEELAEKLGV------SRSTISKIENGKREPSLETLKKL 44
>gnl|CDD|234937 PRK01294, PRK01294, lipase chaperone; Provisional.
Length = 336
Score = 30.8 bits (70), Expect = 0.82
Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 9/96 (9%)
Query: 12 LQNHQLASLPLPNLSSLSPSELSTLQQTL-QQHQHSLQAQLQQFVLFQSASQQNASSSAA 70
Q + L L + SLS ++ + L Q L+A LQ+ Q+ QQ A A+
Sbjct: 187 YQRYALERLRIAQDPSLSDAQKAARLAALEAQLPEDLRAALQESQRQQALLQQLAQLQAS 246
Query: 71 NSPTGLPPQAQFFLQNQVQQAV-AQAAQQLQQLQKQ 105
G PQ Q V +AAQ+L+QL +Q
Sbjct: 247 ----GASPQELR---LMRAQLVGPEAAQRLEQLDQQ 275
>gnl|CDD|132242 TIGR03198, pucE, xanthine dehydrogenase E subunit. This gene has
been characterized in B. subtilis as the Iron-sulfur
cluster binding-subunit of xanthine dehydrogenase
(pucE), acting in conjunction with pucC, the FAD-binding
subunit and pucD, the molybdopterin binding subunit. The
more common XDH complex (GenProp0640) includes the xdhA
gene as the Fe-S cluster binding component.
Length = 151
Score = 29.8 bits (67), Expect = 0.88
Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 1/27 (3%)
Query: 257 VALEKAFLQNPKPTSEEIS-VLADNLC 282
VAL+ F + P+P+ E++ L+ NLC
Sbjct: 107 VALKALFRETPQPSDEDMEEGLSGNLC 133
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
SFB2/subunit SFB3 [Intracellular trafficking and
secretion].
Length = 861
Score = 30.9 bits (70), Expect = 0.97
Identities = 28/129 (21%), Positives = 44/129 (34%), Gaps = 17/129 (13%)
Query: 40 LQQHQHSLQAQLQQFV------LFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVA 93
+ QH+ + Q Q V +SA A ++ + G+PP LQ Q ++ +
Sbjct: 1 MSQHKKGVYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQID 60
Query: 94 QAAQQLQQLQKQS---------HHHHQKLSPQSQQSG--NTTPPSSNIRVPAHHIKQTPA 142
QAA + + QK S S T P +N VP + P
Sbjct: 61 QAATAMHNTGANNPAPSVMSPAFQSQQKFSSPYGGSMADGTAPKPTNPLVPVDLFEDQPP 120
Query: 143 RLLDPDETT 151
+ D
Sbjct: 121 PISDLFLPP 129
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 30.9 bits (70), Expect = 0.98
Identities = 19/114 (16%), Positives = 33/114 (28%), Gaps = 12/114 (10%)
Query: 58 QSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHH----HQKL 113
+ Q ++ +A +PTGL Q + +A+ L K++ QK
Sbjct: 10 TKTAVQPEATPSAGAPTGL--------QQSSESPTQRASHSLASEGKKNRKKMPKVFQKS 61
Query: 114 SPQSQQSGNTTPPSSNIRVPAHHIKQTPARLLDPDETTDLEELEQFAKTFKQRR 167
S Q + + R L D E + +R
Sbjct: 62 SAPRQIQAAPPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRL 115
>gnl|CDD|213767 TIGR03070, couple_hipB, transcriptional regulator, y4mF family.
Members of this family belong to a clade of
helix-turn-helix DNA-binding proteins, among the larger
family pfam01381 (HTH_3; Helix-turn-helix). Members are
similar in sequence to the HipB protein of E. coli.
Genes for members of the seed alignment for this protein
family were found to be closely linked to genes encoding
proteins related to HipA. The HibBA operon appears to
have some features in common with toxin-antitoxin
post-segregational killing systems.
Length = 58
Score = 28.0 bits (63), Expect = 1.0
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 157 EQFAKTFKQRRIKLGFTQGDVGLAMG 182
+Q + RR LG TQ D+ G
Sbjct: 1 KQIGILVRARRKALGLTQADLADLAG 26
>gnl|CDD|233592 TIGR01834, PHA_synth_III_E, poly(R)-hydroxyalkanoic acid synthase,
class III, PhaE subunit. This model represents the PhaE
subunit of the heterodimeric class (class III) of
polymerase for poly(R)-hydroxyalkanoic acids (PHAs),
carbon and energy storage polymers of many bacteria. The
most common PHA is polyhydroxybutyrate but about 150
different constituent hydroxyalkanoic acids (HAs) have
been identified in various species. This model must be
designated subfamily to indicate the heterogeneity of
PHAs [Cellular processes, Adaptations to atypical
conditions, Fatty acid and phospholipid metabolism,
Biosynthesis].
Length = 320
Score = 30.3 bits (68), Expect = 1.0
Identities = 22/90 (24%), Positives = 28/90 (31%), Gaps = 1/90 (1%)
Query: 18 ASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSA-ANSPTGL 76
A P P + S S QQ LQQ Q + QF + S Q A+ A L
Sbjct: 27 AQAPPPWMGSADQWFESLPQQALQQQQFYGELLKLQFEAWFSLWPQFANGFAPGAVQGYL 86
Query: 77 PPQAQFFLQNQVQQAVAQAAQQLQQLQKQS 106
Q QA+ Q +
Sbjct: 87 QLQTWIEQYEATTQALQGDMDGQGQCYIKE 116
>gnl|CDD|102369 PRK06424, PRK06424, transcription factor; Provisional.
Length = 144
Score = 29.5 bits (66), Expect = 1.1
Identities = 15/69 (21%), Positives = 27/69 (39%), Gaps = 5/69 (7%)
Query: 110 HQKLSPQSQQSGNTTPPSSNIRVPAHHI-KQTPARLLDPDETTDLEELEQFAKTFKQRRI 168
H K ++ P I VP + + + DL+ +E +A+ K R
Sbjct: 39 HNKFKE-VKEDIKVKLPEKKIIVPTYKKAYKKYKK---KASDEDLDIVEDYAELVKNARE 94
Query: 169 KLGFTQGDV 177
+L +Q D+
Sbjct: 95 RLSMSQADL 103
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 29.1 bits (66), Expect = 1.3
Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 6/82 (7%)
Query: 38 QTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQN------QVQQA 91
QQ + +L++ ++ +Q S S Q F+ Q QQ
Sbjct: 8 AEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQQQQE 67
Query: 92 VAQAAQQLQQLQKQSHHHHQKL 113
+ QA +Q++Q ++Q +Q+
Sbjct: 68 LEQAEKQVEQAREQWQEANQER 89
>gnl|CDD|225251 COG2376, DAK1, Dihydroxyacetone kinase [Carbohydrate transport and
metabolism].
Length = 323
Score = 29.9 bits (68), Expect = 1.4
Identities = 9/16 (56%), Positives = 9/16 (56%)
Query: 174 QGDVGLAMGKLYGNDF 189
G GLA G YG DF
Sbjct: 94 LGKGGLASGPNYGGDF 109
>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631). The
members of this family are sequences derived from a
group of hypothetical proteins expressed by certain
bacterial species. The region concerned is approximately
440 amino acid residues in length.
Length = 729
Score = 30.3 bits (69), Expect = 1.4
Identities = 22/115 (19%), Positives = 34/115 (29%), Gaps = 12/115 (10%)
Query: 18 ASLPLPNLSSLSPSELSTLQQTLQQHQHSLQA--QLQQFVLFQSASQQNASSSAANSPTG 75
S P S S + ++ Q + + A L SA+ + A S
Sbjct: 233 RSQAPPAASGPSTDDAASGQGGQESAEALFAALQGLLAQARSSSAAPAAPAGQAPGSAQP 292
Query: 76 LPPQAQFFLQNQVQQAVAQAA----------QQLQQLQKQSHHHHQKLSPQSQQS 120
L + + +QQ A + LQQL + K QQ
Sbjct: 293 LSTRELLAALSSLQQQAPALALGEPADFDLRELLQQLLSRLSSEDGKDRALGQQD 347
Score = 27.6 bits (62), Expect = 8.2
Identities = 23/102 (22%), Positives = 37/102 (36%), Gaps = 8/102 (7%)
Query: 59 SASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
SQ ++S ++ Q + +A+ A Q L + S +P Q
Sbjct: 232 PRSQAPPAASGPSTDDAASGQGG----QESAEALFAALQGLLAQARSS--SAAPAAPAGQ 285
Query: 119 QSGNTTPPSSN--IRVPAHHIKQTPARLLDPDETTDLEELEQ 158
G+ P S+ + + +Q PA L DL EL Q
Sbjct: 286 APGSAQPLSTRELLAALSSLQQQAPALALGEPADFDLRELLQ 327
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 29.5 bits (67), Expect = 1.9
Identities = 15/88 (17%), Positives = 34/88 (38%), Gaps = 7/88 (7%)
Query: 33 LSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAV 92
LS QQ L + L+ L+ + + + + + Q LQ ++
Sbjct: 308 LSEQQQRLDELAIRLRRALENQLALKKQRLERLTQRLNPQIQ----RQQQRLQQLERRLD 363
Query: 93 AQAAQQLQQLQKQSHHHHQKL---SPQS 117
+QL++ +++ ++L SP +
Sbjct: 364 KALRRQLKRKRERLEALVEQLESLSPLA 391
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 29.6 bits (66), Expect = 1.9
Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 8/79 (10%)
Query: 41 QQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQ 100
+H+H+ + Q V Q+ +QQ + A Q Q VQ A Q
Sbjct: 195 HRHRHAPKPTQQPTV--QNPAQQPTVQNPA------QQPQQQPQQQPVQPAQQPTPQNPA 246
Query: 101 QLQKQSHHHHQKLSPQSQQ 119
Q Q+ H++ Q Q
Sbjct: 247 QQPPQTEQGHKRSREQGNQ 265
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 29.5 bits (66), Expect = 2.3
Identities = 27/140 (19%), Positives = 48/140 (34%), Gaps = 19/140 (13%)
Query: 38 QTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSP-------TGLPPQAQFFLQNQVQQ 90
T H+ +A + + S + + + T P ++F L +Q +
Sbjct: 476 VTSSGHEAVPKAVQSLLLGATNDSPIPSEFTILDRKSELSIERTVKPVSSEFGLPSQRED 535
Query: 91 AVAQAAQQLQQLQKQSHH------------HHQKLSPQSQQSGNTTPPSSNIRVPAHHIK 138
+A + Q LQ S H K + + S T S N +P+
Sbjct: 536 KLAIPTEGTQNLQGISDHPEQFEFQNELSPLETKNNESNLSSDAETEGSPNPEMPSWSSV 595
Query: 139 QTPARLLDPDETTDLEELEQ 158
P+ LD ETT+ ++
Sbjct: 596 TVPSEALDNYETTNPSNAKE 615
>gnl|CDD|99778 cd00322, FNR_like, Ferredoxin reductase (FNR), an FAD and NAD(P)
binding protein, was intially identified as a
chloroplast reductase activity, catalyzing the electron
transfer from reduced iron-sulfur protein ferredoxin to
NADP+ as the final step in the electron transport
mechanism of photosystem I. FNR transfers electrons from
reduced ferredoxin to FAD (forming FADH2 via a
semiquinone intermediate) and then transfers a hydride
ion to convert NADP+ to NADPH. FNR has since been shown
to utilize a variety of electron acceptors and donors
and has a variety of physiological functions including
nitrogen assimilation, dinitrogen fixation, steroid
hydroxylation, fatty acid metabolism, oxygenase
activity, and methane assimilation in many organisms.
FNR has an NAD(P)-binding sub-domain of the alpha/beta
class and a discrete (usually N-terminal) flavin
sub-domain which vary in orientation with respect to the
NAD(P) binding domain. The N-terminal moeity may contain
a flavin prosthetic group (as in flavoenzymes) or use
flavin as a substrate. Because flavins such as FAD can
exist in oxidized, semiquinone (one- electron reduced),
or fully reduced hydroquinone forms, FNR can interact
with one and 2 electron carriers. FNR has a strong
preference for NADP(H) vs NAD(H).
Length = 223
Score = 29.0 bits (65), Expect = 2.4
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 4/57 (7%)
Query: 124 TPPSSNIRVPAHHIKQTPARLL----DPDETTDLEELEQFAKTFKQRRIKLGFTQGD 176
TP S +R A LL P + L+ELE+ AK R+ L ++
Sbjct: 110 TPFRSMLRHLAADKPGGEITLLYGARTPADLLFLDELEELAKEGPNFRLVLALSRES 166
>gnl|CDD|239723 cd03754, proteasome_alpha_type_6, proteasome_alpha_type_6. The 20S
proteasome, multisubunit proteolytic complex, is the
central enzyme of nonlysosomal protein degradation in
both the cytosol and nucleus. It is composed of 28
subunits arranged as four homoheptameric rings that
stack on top of one another forming an elongated
alpha-beta-beta-alpha cylinder with a central cavity.
The proteasome alpha and beta subunits are members of
the N-terminal nucleophile (Ntn)-hydrolase superfamily.
Their N-terminal threonine residues are exposed as a
nucleophile in peptide bond hydrolysis. Mammals have 7
alpha and 7 beta proteasome subunits while archaea have
one of each.
Length = 215
Score = 28.7 bits (65), Expect = 2.6
Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 18/84 (21%)
Query: 138 KQTPARLLDPDETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRF 197
K+ P +L+DP T L + ++G M + + SQ +R+
Sbjct: 47 KKVPDKLIDPSTVTHLFRITD-----------------EIGCVMTGMIADSRSQVQRARY 89
Query: 198 EALNLSFKNMCKLKP-LLQKWLSD 220
EA +K ++ +L K ++D
Sbjct: 90 EAAEFKYKYGYEMPVDVLAKRIAD 113
>gnl|CDD|182947 PRK11073, glnL, nitrogen regulation protein NR(II); Provisional.
Length = 348
Score = 29.3 bits (66), Expect = 2.7
Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 3 ASPNASQLLLQNHQ-LASLPLPNLSSLSPSELSTLQQTLQQHQ 44
A+P A QLL Q+ + L PLP L S + ++++LQ Q
Sbjct: 32 ANPAAQQLLAQSSRKLFGTPLPELLSYFSLNIELMRESLQAGQ 74
>gnl|CDD|217789 pfam03915, AIP3, Actin interacting protein 3.
Length = 424
Score = 29.2 bits (66), Expect = 2.8
Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 13/92 (14%)
Query: 32 ELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQA 91
++ +L+Q L Q ++ +L S QQN + A PP + +
Sbjct: 93 KIKSLKQALDDQQSTILQRL-------SELQQNTAKEVARPAAAPPPSSLPNSSSSPDSI 145
Query: 92 VAQAAQQLQQLQ------KQSHHHHQKLSPQS 117
++LQ L+ +Q+H + S
Sbjct: 146 NPAGLKELQSLRRELAVLRQTHSDFKSEIQAS 177
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 28.9 bits (65), Expect = 3.4
Identities = 26/139 (18%), Positives = 46/139 (33%), Gaps = 29/139 (20%)
Query: 60 ASQQNASSSAANSPTGLPPQAQFFL-QNQVQQAVAQAAQQLQQLQKQS--HHHHQKLSPQ 116
A+ A+ +AA + L P A+ ++ + + ++ K+ S Q
Sbjct: 97 AASAAAAPTAAANRPSLSPAARRLAKEHGIDLSAVPGTGVTGRVTKEDIIKKTEAPASAQ 156
Query: 117 SQQSGNTTP-----PSSNIRVPAHHIKQTPA-RLLDPDETT------------DLEEL-- 156
RV ++Q A RL + +T + EL
Sbjct: 157 QPAPAAAAKAPANFTRPEERVKMTRLRQRIAERLKEAQNSTAMLTTFNEVDMSAVMELRK 216
Query: 157 ---EQFAKTFKQRRIKLGF 172
E+F K+ +KLGF
Sbjct: 217 RYKEEF---EKKHGVKLGF 232
>gnl|CDD|132237 TIGR03193, 4hydroxCoAred, 4-hydroxybenzoyl-CoA reductase, gamma
subunit. 4-hydroxybenzoyl-CoA reductase converts
4-hydroxybenzoyl-CoA to benzoyl-CoA, a common
intermediate in the degradation of aromatic compounds.
This protein family represents the gamma chain of this
three-subunit enzyme.
Length = 148
Score = 27.9 bits (62), Expect = 3.7
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 257 VALEKAFLQNPKPTSEEI-SVLADNLC 282
+A E +NP P+ +EI + LA NLC
Sbjct: 106 MAAEALLRRNPSPSRDEIRAALAGNLC 132
>gnl|CDD|168934 PRK07394, PRK07394, hypothetical protein; Provisional.
Length = 342
Score = 28.3 bits (64), Expect = 4.2
Identities = 15/76 (19%), Positives = 34/76 (44%), Gaps = 10/76 (13%)
Query: 32 ELSTLQQTLQQHQHSLQAQ---LQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQV 88
+ ++ L+Q Q +L + L Q +++ + +++ G+ +A+ L +
Sbjct: 274 PWESTEEWLEQAQAALNGEPGPLTQALIWNGGFYLWRAGISSSLEEGIE-KAEELLNSG- 331
Query: 89 QQAVAQAAQQLQQLQK 104
+A Q+LQQL
Sbjct: 332 -----KALQKLQQLIA 342
>gnl|CDD|216112 pfam00773, RNB, RNB domain. This domain is the catalytic domain of
ribonuclease II.
Length = 322
Score = 28.4 bits (64), Expect = 4.8
Identities = 9/38 (23%), Positives = 17/38 (44%)
Query: 134 AHHIKQTPARLLDPDETTDLEELEQFAKTFKQRRIKLG 171
+ + + DLE L + AK +++R+K G
Sbjct: 127 VDELLEGKPSEREAPVADDLELLYELAKALRKKRLKRG 164
>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 514
Score = 28.3 bits (63), Expect = 4.9
Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 7/87 (8%)
Query: 17 LASLPLPNLSSLSPSELSTLQ---QTLQQHQHSLQAQL--QQFVLFQSASQQNASSSAAN 71
+LP ++ S++ LQ LQQ + L+A L + A Q+A+++AA
Sbjct: 11 AGALPGAAAAASDASQIKALQAQLTALQQQVNELRAALAAKPAAAGGGAKIQSAAAAAAA 70
Query: 72 SPTGLPPQAQFFLQNQVQQAVAQAAQQ 98
+P+ A + V Q Q A Q
Sbjct: 71 APSSDAAAA--LTNDDVTQMREQIANQ 95
>gnl|CDD|147759 pfam05782, ECM1, Extracellular matrix protein 1 (ECM1). This
family consists of several eukaryotic extracellular
matrix protein 1 (ECM1) sequences. ECM1 has been shown
to regulate endochondral bone formation, stimulate the
proliferation of endothelial cells and induce
angiogenesis. Mutations in the ECM1 gene can cause
lipoid proteinosis, a disorder which causes generalised
thickening of skin, mucosae and certain viscera.
Classical features include beaded eyelid papules and
laryngeal infiltration leading to hoarseness.
Length = 419
Score = 28.4 bits (63), Expect = 5.4
Identities = 18/86 (20%), Positives = 28/86 (32%)
Query: 58 QSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQS 117
QS Q S P ++ AV Q A LQ+ ++ +
Sbjct: 76 QSEVQPPPSPEDIPVYEEDWPTFLNPNVDKAGPAVPQEAIPLQKELPPPQVPIEQKEVKP 135
Query: 118 QQSGNTTPPSSNIRVPAHHIKQTPAR 143
+ +PP PA H +Q +R
Sbjct: 136 APLADQSPPEPESWNPAQHCQQGRSR 161
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 28.4 bits (64), Expect = 6.2
Identities = 16/91 (17%), Positives = 27/91 (29%), Gaps = 5/91 (5%)
Query: 31 SELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQ 90
LS L+Q L+Q Q + L +F + + L V +
Sbjct: 520 MRLSELEQRLRQQQ-RAERLLAEFC--KRLGKNLDDEDELEQLQEELEARLESLSESVSE 576
Query: 91 AVAQAAQQLQQLQ--KQSHHHHQKLSPQSQQ 119
A + QQL+ + +P
Sbjct: 577 ARERRMALRQQLEQLQARIQRLAARAPAWLA 607
Score = 27.6 bits (62), Expect = 9.7
Identities = 20/89 (22%), Positives = 39/89 (43%), Gaps = 2/89 (2%)
Query: 30 PSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQ 89
P + L+Q QQ Q + + QQ Q+ A S ++ L + L +++
Sbjct: 934 PEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAKNSD--LNEKLR 991
Query: 90 QAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
Q + QA Q+ + ++Q +L+ +Q
Sbjct: 992 QRLEQAEQERTRAREQLRQAQAQLAQYNQ 1020
>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
Length = 617
Score = 27.9 bits (63), Expect = 6.7
Identities = 11/103 (10%), Positives = 30/103 (29%), Gaps = 2/103 (1%)
Query: 34 STLQQTLQQHQHSLQAQLQQFVL--FQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQA 91
+Q+ L Q A VL + + S + P + + +
Sbjct: 319 QAIQEALAQSGLIPAAAGANQVLEPAEPEPLPLQQTPLYASGSSPPASSPSSAPPEQSEE 378
Query: 92 VAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRVPA 134
+ + + + + + + S ++ P ++ A
Sbjct: 379 SQEESSAEKNPLQPNASQSEAAAAASAEAAAAAPAAAPEPAEA 421
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 27.8 bits (62), Expect = 7.1
Identities = 18/88 (20%), Positives = 28/88 (31%), Gaps = 7/88 (7%)
Query: 24 NLSSLSPSELSTLQQTLQQHQHSLQA------QLQQFVLFQSASQQNASSSAANSPTGLP 77
+P+ L LQ L Q L A Q + + A ++ A + L
Sbjct: 247 TPIPNTPNSLKALQAALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELA 306
Query: 78 PQAQFFLQNQVQQAVAQAAQQLQQLQKQ 105
AQ Q +A A L + +
Sbjct: 307 -NAQAQALQTAQNNLATAQAALANAEAR 333
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 27.8 bits (62), Expect = 7.6
Identities = 25/172 (14%), Positives = 35/172 (20%)
Query: 13 QNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANS 72
+A LP P L L QQ ++ A + +
Sbjct: 55 APPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQT 114
Query: 73 PTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRV 132
G Q + A + Q QQ Q Q N
Sbjct: 115 EPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDES 174
Query: 133 PAHHIKQTPARLLDPDETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKL 184
D D E ++ + G AM L
Sbjct: 175 QLQQQPNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGAEGGGAMKVL 226
>gnl|CDD|204415 pfam10212, TTKRSYEDQ, Predicted coiled-coil domain-containing
protein. This is the C-terminal 500 amino acids of a
family of proteins with a predicted coiled-coil domain
conserved from nematodes to humans. It carries a
characteristic TTKRSYEDQ sequence-motif. The function is
not known.
Length = 518
Score = 27.6 bits (61), Expect = 8.4
Identities = 20/101 (19%), Positives = 39/101 (38%), Gaps = 5/101 (4%)
Query: 20 LPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQF--VLFQSASQQNASSSAANSPTGLP 77
LP S E S L+ L +++F V + + + + + P GL
Sbjct: 103 LPYQLKSLEEECESSLCTAALRARNQELHNDMKKFTAVFEKLQTYVSLLALPSTKPDGLL 162
Query: 78 PQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
+ Q+ A+ +++L K H++QK S + +
Sbjct: 163 RTNYSTVFTQLAAALHGLHDAVKELSK---HYNQKASLEQE 200
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 27.5 bits (62), Expect = 8.6
Identities = 13/59 (22%), Positives = 20/59 (33%)
Query: 48 QAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQS 106
A Q +A + + P P QA + + A QQLQ+ Q +
Sbjct: 378 AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGAT 436
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 27.6 bits (61), Expect = 9.0
Identities = 18/96 (18%), Positives = 38/96 (39%), Gaps = 4/96 (4%)
Query: 10 LLLQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQA--QLQQFVLFQSASQQNASS 67
+ N + + P +PS + L+Q +++ Q LQ Q + + Q + +
Sbjct: 386 GMSANGTVENCI-PENKISTPSAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLER 444
Query: 68 SAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQ 103
S + L + LQ ++ V+ + Q +Q
Sbjct: 445 SLKSDLGQL-KKENDMLQTKLNSMVSAKQKDKQSMQ 479
>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
protein. This family represents an accessory protein
that works with the bacteriocin maturation and ABC
transport secretion protein described by TIGR01193
[Transport and binding proteins, Other].
Length = 457
Score = 27.4 bits (61), Expect = 9.1
Identities = 25/110 (22%), Positives = 39/110 (35%), Gaps = 25/110 (22%)
Query: 34 STLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQF----------- 82
+ L Q +LQ + ++A + N P Q+ +
Sbjct: 171 EKTKAQLDQQISKTDQKLQDYQALKNAISNGTKVANFN-----PYQSLYENYQAQLKSAS 225
Query: 83 --FLQNQV-QQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSN 129
+NQV +A QQ+ QLQK S Q Q++G T +SN
Sbjct: 226 DKDQKNQVKSTILATIQQQIDQLQKSIA------SYQVQKAGLTKSTASN 269
>gnl|CDD|234137 TIGR03176, AllC, allantoate amidohydrolase. This enzyme catalyzes
the breakdown of allantoate , first to ureidoglycine by
hydrolysis and then decarboxylation of one of the two
equivalent ureido groups. Ureidoglycine then
spontaneously exchanges ammonia for water resulting in
ureidoglycolate. This enzyme is an alternative to
allantoicase (3.5.3.4) which releases urea [Central
intermediary metabolism, Nitrogen metabolism].
Length = 406
Score = 27.5 bits (61), Expect = 9.5
Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 143 RLLDPDETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTI 194
RLL E L +QF K + ++ F DVG G+L G +F + TI
Sbjct: 25 RLLYSPEW--LAAQQQFKKRMAESGLETRFD--DVGNLYGRLVGTEFPEETI 72
>gnl|CDD|224314 COG1396, HipB, Predicted transcriptional regulators
[Transcription].
Length = 120
Score = 26.5 bits (57), Expect = 9.9
Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 6/41 (14%)
Query: 158 QFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
+ K+ R K G +Q ++ +G S++TISR E
Sbjct: 1 SIGERLKELRKKKGLSQEELAERLG------VSRSTISRIE 35
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.124 0.341
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,951,064
Number of extensions: 1275146
Number of successful extensions: 2572
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2254
Number of HSP's successfully gapped: 275
Length of query: 289
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 193
Effective length of database: 6,679,618
Effective search space: 1289166274
Effective search space used: 1289166274
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.0 bits)