RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10349
(517 letters)
>gnl|CDD|235000 PRK02106, PRK02106, choline dehydrogenase; Validated.
Length = 560
Score = 337 bits (866), Expect = e-109
Identities = 185/547 (33%), Positives = 263/547 (48%), Gaps = 95/547 (17%)
Query: 1 GCVVTNRLTENPDWKVLLLEAGDEENA---LTDIPETAHY-LQFTKFNWNFTTEFQPGAC 56
GCV+ NRL+E+PD VLLLEAG + +P + LQ ++NW + TE +P
Sbjct: 17 GCVLANRLSEDPDVSVLLLEAGGPDYRWDFFIQMPAALAFPLQGKRYNWAYETEPEPH-- 74
Query: 57 RGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLE-AGNVGWGYKDVLPYFKKSE 115
+NN R P G+ +GG+S IN +Y RGN D+D W E G GW Y D LPYFKK+E
Sbjct: 75 --MNNRRMECPRGKVLGGSSSINGMVYIRGNAMDYDNWAELPGLEGWSYADCLPYFKKAE 132
Query: 116 DIDVPELKRSEYHGVGGYLNVDYS-PYKSKLMDAFLESAPEVGLNLT-DYNSPDGNVGFS 173
D E +Y G G L+V P + L AF+E+ + G T D N GF
Sbjct: 133 TRDGGE---DDYRGGDGPLSVTRGKPGTNPLFQAFVEAGVQAGYPRTDDLNGYQQE-GFG 188
Query: 174 RIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKR 233
+ T+ GRR+SA++A+L P ++RPN ++ A ++L + KR GVE+ + +
Sbjct: 189 PMDRTVTNGRRWSAARAYLDPALKRPNLTIVTHALTDRILFEG--KRAVGVEYERGGGRE 246
Query: 234 VVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQ-VGENLQEHPAFASLA 292
A++EV+LSAGA SP LL LSGIGP E L++ IPV+ DL VGENLQ+H L
Sbjct: 247 TARARREVILSAGAINSPQLLQLSGIGPAEHLKELGIPVVHDLPGVGENLQDH-----LE 301
Query: 293 FTVNQKVGL-VSDRIFSNLA---KETIKA---FTNQGWTTTLGCEGLGYVRTKYNNYPPG 345
+ + VS ++ L K I A FT G + E G++R++ G
Sbjct: 302 VYIQYECKQPVS--LYPALKWWNKPKIGAEWLFTGTGLGASNHFEAGGFIRSR-----AG 354
Query: 346 V--PDIEYIFVPASL-----AIEEEKG-----GSLLRKTMGIPDRTFKELFRDVKNKDAW 393
V P+I+Y F+P ++ + G G
Sbjct: 355 VDWPNIQYHFLPVAIRYDGSNAVKGHGFQAHVGP-------------------------- 388
Query: 394 SIWPMLMYPESRGYVRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQK 453
M SRG V+LKSADP +P++ N+ D + I++ E+ A
Sbjct: 389 ------MRSPSRGSVKLKSADPRAHPSILFNYMSTEQDWREFRDAIRLTREIMAQPALDP 442
Query: 454 YKSKLSTRILPACKKHKYGSD---DYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPR 510
Y+ + I P +D D + VR+ +H TCKMG D AVVDP
Sbjct: 443 YRGR---EISPGADVQ---TDEEIDAF---VREHAETAYHPSCTCKMGT--DPMAVVDPE 491
Query: 511 LRVHGIK 517
RVHG++
Sbjct: 492 GRVHGVE 498
>gnl|CDD|225186 COG2303, BetA, Choline dehydrogenase and related flavoproteins
[Amino acid transport and metabolism].
Length = 542
Score = 298 bits (765), Expect = 5e-95
Identities = 170/529 (32%), Positives = 241/529 (45%), Gaps = 61/529 (11%)
Query: 1 GCVVTNRLTENPDWKVLLLEAGDEENALTDIPETAHY-------LQFTKFNWNFTTEFQP 53
G V+ RL++ VL+LEAG D + +++W F TE +P
Sbjct: 19 GSVLAARLSD-AGLSVLVLEAGGP-----DRRPLIQMPAAYAFLMNGPRYDWGFRTEPEP 72
Query: 54 GACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEA-GNVGWGYKDVLPYFK 112
L WP G+ +GG+S IN +Y RG+P DFD W + G GW Y DVLPYFK
Sbjct: 73 H----LRGRELAWPRGKVLGGSSSINGMVYVRGHPEDFDAWAQESGAPGWPYDDVLPYFK 128
Query: 113 KSED-IDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGLNLT-DYNSPDGNV 170
++ED + V +HG GG L V + + AF+E+ ++G T D N D
Sbjct: 129 RAEDLLGVGGQDLRTWHGGGGPLPVSPPRSPNPIARAFIEAGEQLGFPTTPDPNGADQE- 187
Query: 171 GFSRIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNN 230
GF TI GRR+SA++A+L+P ++RPN ++ ARV ++L++ + VE
Sbjct: 188 GFGPYCVTICNGRRWSAARAYLKPALKRPNLTLLTGARVRRILLEGDRAVGVEVEIGDGG 247
Query: 231 KKRVVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADL-QVGENLQEHPAFA 289
A +EVVL+AGA SP LLLLSGIGP + L + I V+ L VG+NLQ+H
Sbjct: 248 TIETAVAAREVVLAAGAINSPKLLLLSGIGPADHLLEHGIDVVGRLPGVGQNLQDHLEIY 307
Query: 290 SLAFTVNQKVGLVSDRIFSNLAKETI-KAFTNQGWTTTLGCEGLGYVRTKYNNYPPG-VP 347
+AF + +FS L T G T EG G+VR+ P G P
Sbjct: 308 -VAFEATEPTNDSVLSLFSKLGIGADRYLLTRDGPGATNHFEG-GFVRSG----PAGEYP 361
Query: 348 DIEYIFVPASLAIEEEKGGSLLRKTMGIPDRTFKELFRDVKNKDAWSIWPMLMYPESRGY 407
D +Y F P LAI R + +++ M P+SRG
Sbjct: 362 DGQYHFAPLPLAI------------------------RAAGAEHGFTLHVGPMRPKSRGS 397
Query: 408 VRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTRILPACK 467
V L+S DP P + N+ D GI++ E+ A + P
Sbjct: 398 VTLRSPDPDNRPVIDPNYLSAEGDRAIFRAGIRLTREIIGQPALDAR---RKAELAPGP- 453
Query: 468 KHKYGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGI 516
+ +D+ +R L +H GTC+MG V DP LRVHG+
Sbjct: 454 --RVTTDEDISAAIRFLARTAYHPMGTCRMG-SDPAAVVDDPYLRVHGL 499
>gnl|CDD|233581 TIGR01810, betA, choline dehydrogenase. Choline dehydrogenase
catalyzes the conversion of exogenously supplied choline
into the intermediate glycine betaine aldehyde, as part
of a two-step oxidative reaction leading to the
formation of osmoprotectant betaine. This enzymatic
system can be found in both gram-positive and
gram-negative bacteria. As in Escherichia coli ,
Staphylococcus xylosus , and Sinorhizobium meliloti,
this enzyme is found associated in a transciptionally
co-induced gene cluster with betaine aldehyde
dehydrogenase, the second catalytic enzyme in this
reaction. Other gram-positive organisms have been shown
to employ a different enzymatic system, utlizing a
soluable choline oxidase or type III alcohol
dehydrogenase instead of choline dehydrogenase. This
enzyme is a member of the GMC oxidoreductase family
(pfam00732 and pfam05199), sharing a common
evoluntionary origin and enzymatic reaction with alcohol
dehydrogenase. Outgrouping from this model, Caulobacter
crescentus shares sequence homology with choline
dehydrogenase, yet other genes participating in this
enzymatic reaction have not currently been identified
[Cellular processes, Adaptations to atypical
conditions].
Length = 532
Score = 277 bits (711), Expect = 3e-87
Identities = 176/526 (33%), Positives = 245/526 (46%), Gaps = 54/526 (10%)
Query: 1 GCVVTNRLTENPDWKVLLLEAGDEE---NALTDIPETAHY-LQFTKFNWNFTTEFQPGAC 56
G V+ RL+E+ VL+LEAG + + L +P Y ++NW + TE +P
Sbjct: 11 GSVLAGRLSEDVSNSVLVLEAGGSDYPWDLLIQMPAALAYPAGNKRYNWIYETEPEPH-- 68
Query: 57 RGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEA-GNVGWGYKDVLPYFKKSE 115
+NN R G+ +GG+S IN IY RGNP D+++W + G W Y D LPY+K+ E
Sbjct: 69 --MNNRRVGHARGKVLGGSSSINGMIYQRGNPMDYEKWAKPEGMESWDYADCLPYYKRLE 126
Query: 116 DIDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGLNLT-DYNSPDGNVGFSR 174
E Y G G + V P + L AF+E+ E G N T D N GF
Sbjct: 127 TTFGGE---KPYRGHDGPIKVRRGPADNPLFQAFIEAGVEAGYNKTPDVNGFRQE-GFGP 182
Query: 175 IQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRV 234
+ T+ GRR SA++A+L P ++RPN V +A V K+ + R GVEF K +K
Sbjct: 183 MDSTVHNGRRVSAARAYLHPAMKRPNLEVQTRAFVTKINFE--GNRATGVEFKKGGRKEH 240
Query: 235 VYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQ-VGENLQEHPAFASLAF 293
A KEV+LSAGA SP LL LSGIG E L + I L VGENLQ+H +
Sbjct: 241 TEANKEVILSAGAINSPQLLQLSGIGDAEHLRELGIEPRIHLPGVGENLQDHLEV-YVQH 299
Query: 294 TVNQKVGLVSDRIFSNLAKETIK-AFTNQGWTTTLGCEGLGYVRTKYNNYPPGV--PDIE 350
Q V L + + F +G + EG G+VR+ V P+I+
Sbjct: 300 ACKQPVSLYPSLNWLKQPFIGAQWLFGRKGAGASNHFEGGGFVRSN-----DDVDYPNIQ 354
Query: 351 YIFVPASLAIEEEKGGSLLRKTMGIPDRTFKELFRDVKNKDAWSIWPMLMYPESRGYVRL 410
Y F+P ++ + G+ K G + + MY SRG+V++
Sbjct: 355 YHFLPVAIRYD----GTKAPKAHG------------------FQVHVGPMYSNSRGHVKI 392
Query: 411 KSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTRILPACKKHK 470
KS DP P + N+ D E I++ E+ K A Y+ I P +
Sbjct: 393 KSKDPFEKPEIVFNYMSHEEDWREFREAIRVTREILKQKALDPYRGG---EISPGPEVQT 449
Query: 471 YGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGI 516
D + VR+ H CGTCKMGP D +VVDP RVHG+
Sbjct: 450 DEEIDEF---VRRHGETALHPCGTCKMGPASDEMSVVDPETRVHGM 492
>gnl|CDD|234421 TIGR03970, Rv0697, dehydrogenase, Rv0697 family. This model
describes a set of dehydrogenases belonging to the
glucose-methanol-choline oxidoreductase (GMC
oxidoreductase) family. Members of the present family
are restricted to Actinobacterial genome contexts
containing also members of families TIGR03962 and
TIGR03969 (the mycofactocin system), and are proposed to
be uniform in function.
Length = 487
Score = 220 bits (563), Expect = 6e-66
Identities = 152/526 (28%), Positives = 213/526 (40%), Gaps = 95/526 (18%)
Query: 1 GCVVTNRLTENPDWKVLLLEAGDEENALTDIP-ETAHYLQF-----TKFNWNFTTEFQPG 54
GCVV RL+E+P V +LEAG + +P + L+ + W + E G
Sbjct: 12 GCVVAARLSEDPSCTVTVLEAGPGYRDPSRLPAQLTDGLRLPIGPASPVVWRYGVELTDG 71
Query: 55 ACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEAGNVGWGYKDVLPYFKKS 114
R R GR +GG+ +N + R P DFD W GW + DVLP+F+
Sbjct: 72 PRRASAIVR-----GRVLGGSGAVNGGYFCRALPADFDAW---PIPGWSWDDVLPHFRAI 123
Query: 115 E-DIDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGL-NLTDYNS--PDGNV 170
E D+D HG G + V + + AF+ +A G + D N PD
Sbjct: 124 ETDLDF----DGPLHGTAGPIPVRRTAELDGISAAFVAAALGAGFGWIADLNGSGPDAPT 179
Query: 171 GFSRIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNN 230
G + + GRR S + A+L P ++RPN V RV+++L R GVE + +
Sbjct: 180 GVGAVPLNVDGGRRVSTAVAYLLPALKRPNLTVEADTRVVRILFSGT--RAVGVEVLGDG 237
Query: 231 KKRVVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQVGENLQEHPAFAS 290
R + A + VVL AGA S HLLLLSGIGP EQL I V+ DL VG + +HP +
Sbjct: 238 GPRTLRADR-VVLCAGAVESAHLLLLSGIGPAEQLRAAGIAVVLDLPVGSDFVDHPEW-- 294
Query: 291 LAFTVNQKVGLVSDRIFSNLAKETIKAFTNQGWTTTLGCEGLGYVRTKYNNYPPGVPDIE 350
L Y ++ P P +E
Sbjct: 295 ----------------------------------------VLPYRWRPTHDRPDTSPVLE 314
Query: 351 YIFVPASLAIEEEKGGSLLRKTMGIPDRTFKELFRDVKNKDAWSIWPMLMYPESRGYVRL 410
+ A + I G F L +D + LM P SRG +RL
Sbjct: 315 TVLNTADIEIRPYTAG-------------FTALVPGS-PRDDPHLGVALMRPHSRGRIRL 360
Query: 411 KSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTRILPACKKHK 470
SADP P ++ + D + G + EL + L + PA ++
Sbjct: 361 ASADPADPPRIEHRYDSSAADRAALRAGAALAHELLGSPE-------LGPLLEPAVRE-- 411
Query: 471 YGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGI 516
+ W R T Q H CG+C+MG D AVVD R RV G+
Sbjct: 412 --GEASW-VLARLATSQ--HLCGSCRMGGRDDPGAVVDARCRVRGV 452
>gnl|CDD|216089 pfam00732, GMC_oxred_N, GMC oxidoreductase. This family of
proteins bind FAD as a cofactor.
Length = 218
Score = 161 bits (410), Expect = 1e-46
Identities = 79/241 (32%), Positives = 106/241 (43%), Gaps = 38/241 (15%)
Query: 54 GACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRW-LEAGNVGWGYKDVLPYFK 112
G+ +N R PAG VGG S +N + R D W E G GWGY D LPY
Sbjct: 9 GSAGSVNGRRMILPAGSTVGGGSSVNWSACIRTPAAVLDEWASEFGLEGWGYDDYLPYMD 68
Query: 113 KSEDIDVPELKRSEYHGVGGYLNV-DYSPYKSKLMDAFLESAPEVGLNLTDY--NSPDG- 168
K E G L V +S L A L++A E+G + NS
Sbjct: 69 KVE----------------GPLGVTTKGIEESPLNQALLKAAEELGYPVEAVPRNSNGCH 112
Query: 169 NVGFSRIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEF-- 226
GF + G + S ++ +LRP +ER N ++ A+ K++I R GVE
Sbjct: 113 YCGFCGLG--CPTGAKQSTARTWLRPALER-NLRILTGAKAEKIIILGRGGRAVGVEARD 169
Query: 227 MKNNKKRVVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQVGENLQEHP 286
KR++ A KEVV++AGA +P LLL SG+G VG+NLQ HP
Sbjct: 170 GGGGIKRLITAAKEVVVAAGALNTPPLLLRSGLGKNPH------------PVGKNLQLHP 217
Query: 287 A 287
Sbjct: 218 V 218
>gnl|CDD|218493 pfam05199, GMC_oxred_C, GMC oxidoreductase. This domain found
associated with pfam00732.
Length = 139
Score = 115 bits (289), Expect = 2e-30
Identities = 37/116 (31%), Positives = 53/116 (45%), Gaps = 6/116 (5%)
Query: 402 PESRGYVRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTR 461
P SRG V L S+DP+ P + N+ DP DL + +++ + +L+
Sbjct: 1 PRSRGRVTLSSSDPLGLPVIDPNYLSDPADLAALRAALRLARRILAAAL-----VELTPG 55
Query: 462 ILPACKKHKYGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGIK 517
+ SDD +R + H GTC+MG D D VVDP LRVHG+
Sbjct: 56 PVEVVPGAAEASDDEILEYIRAAAVTSWHPMGTCRMGADPD-DGVVDPDLRVHGVD 110
>gnl|CDD|215420 PLN02785, PLN02785, Protein HOTHEAD.
Length = 587
Score = 79.5 bits (196), Expect = 1e-15
Identities = 87/316 (27%), Positives = 135/316 (42%), Gaps = 84/316 (26%)
Query: 1 GCVVTNRLTENPDWKVLLLEAGDE----------EN---ALTDIPETAHYLQFTKFNWNF 47
GC + L++N + VLLLE G EN L D T+ F
Sbjct: 67 GCPLAATLSQN--FSVLLLERGGVPFGNANVSFLENFHIGLADTSPTSASQAF------I 118
Query: 48 TTEFQPGACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEAGNVGWGYKDV 107
+T+ G+ N R R +GG + IN Y+R + + GW K V
Sbjct: 119 STD-------GVINAR-----ARVLGGGTCINAGFYSRASTRFIQK------AGWDAKLV 160
Query: 108 LPYFKKSEDIDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGLNLTDYNSPD 167
+ E V K +P+++ L D+ LE + ++ +N
Sbjct: 161 NESYPWVERQIVHWPK--------------VAPWQAALRDSLLE------VGVSPFN--- 197
Query: 168 GNVGFS-------RIQGTI--QFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNT 218
GF+ ++ GTI +FGRR +A A L V+ A V K++ D +
Sbjct: 198 ---GFTYDHVYGTKVGGTIFDEFGRRHTA--AELLAAGNPNKLRVLLHATVQKIVFDTSG 252
Query: 219 K--RVFGVEFM--KNNKKRVVYAKK---EVVLSAGAFFSPHLLLLSGIGPREQLEQFNIP 271
K R GV F N+ + + E++LSAGA SP +LLLSGIGP+++L++ IP
Sbjct: 253 KRPRATGVIFKDENGNQHQAFLSNNKGSEIILSAGAIGSPQMLLLSGIGPKKELKKHKIP 312
Query: 272 -VLADLQVGENLQEHP 286
VL + VG+ + ++P
Sbjct: 313 VVLHNEHVGKGMADNP 328
Score = 49.0 bits (117), Expect = 5e-06
Identities = 19/54 (35%), Positives = 31/54 (57%)
Query: 401 YPESRGYVRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKY 454
P S G++ L + + P+V N+F+ P DL R V GI+ + ++ KTN F +
Sbjct: 423 GPISTGHLSLINTNVDDNPSVTFNYFKHPQDLQRCVYGIRTIEKIVKTNHFTNF 476
>gnl|CDD|180854 PRK07121, PRK07121, hypothetical protein; Validated.
Length = 492
Score = 38.7 bits (91), Expect = 0.007
Identities = 18/72 (25%), Positives = 28/72 (38%), Gaps = 4/72 (5%)
Query: 177 GTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRVVY 236
G G L + R ++++D RV GVE + + +
Sbjct: 171 GDSGGGAML---MDPLAKRAAALGVQIRYDTRATRLIVDD-DGRVVGVEARRYGETVAIR 226
Query: 237 AKKEVVLSAGAF 248
A+K VVL+AG F
Sbjct: 227 ARKGVVLAAGGF 238
>gnl|CDD|237532 PRK13858, PRK13858, type IV secretion system T-DNA border
endonuclease VirD1; Provisional.
Length = 147
Score = 31.3 bits (71), Expect = 0.56
Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 8/73 (10%)
Query: 130 VGGYLNVDYSPYKSKLMDAFLESAPEVGLNLTD----YNSPDGNVGFSRIQGT-IQFGRR 184
+GG+L +D + M+A L+S + N+ Y + ++ I FG+
Sbjct: 61 IGGFLEID--AETREKMEAILQSIGTLSSNIAALLSAYAE-NPRPDLEALRAERIAFGKE 117
Query: 185 FSASQAFLRPIVE 197
F+ LR I+
Sbjct: 118 FADLDGLLRSILS 130
>gnl|CDD|236010 PRK07395, PRK07395, L-aspartate oxidase; Provisional.
Length = 553
Score = 32.3 bits (74), Expect = 0.73
Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 1/55 (1%)
Query: 192 LRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRVVYAKKEVVLSAG 246
+++RPN ++ +A L + ++P T R G+ + + + A V+L+ G
Sbjct: 141 TEQVLQRPNIEIISQALALSLWLEPETGRCQGISLLYQGQITWLRAGA-VILATG 194
>gnl|CDD|181196 PRK08010, PRK08010, pyridine nucleotide-disulfide oxidoreductase;
Provisional.
Length = 441
Score = 31.9 bits (72), Expect = 0.88
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 2/58 (3%)
Query: 172 FSRIQGTIQFGRRFSAS-QAFLRPIVERPNFHVMKKAR-VLKVLIDPNTKRVFGVEFM 227
SR+ T + R A Q P+ P VM R VLK ++D T+R+ G +
Sbjct: 338 LSRVGMTEEQARESGADIQVVTLPVAAIPRARVMNDTRGVLKAIVDNKTQRILGASLL 395
>gnl|CDD|223108 COG0029, NadB, Aspartate oxidase [Coenzyme metabolism].
Length = 518
Score = 31.8 bits (73), Expect = 0.96
Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 1/58 (1%)
Query: 189 QAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRVVYAKKEVVLSAG 246
A L+ + RPN V++ A L ++I+ V GV + N + + K VVL+ G
Sbjct: 137 TALLKKVRNRPNITVLEGAEALDLIIEDGIG-VAGVLVLNRNGELGTFRAKAVVLATG 193
>gnl|CDD|233583 TIGR01813, flavo_cyto_c, flavocytochrome c. This model describes a
family of redox proteins related to the succinate
dehydrogenases and fumarate reductases of E. coli,
mitochondria, and other well-characterized systems. A
member of this family from Shewanella frigidimarina
NCIMB400 is characterized as a water-soluble periplasmic
protein with four heme groups, a non-covalently bound
FAD, and essentially unidirectional fumarate reductase
activity. At least seven distinct members of this family
are found in Shewanella oneidensis, a species able to
use a wide variety of pathways for respiraton [Energy
metabolism, Electron transport].
Length = 439
Score = 31.2 bits (71), Expect = 1.3
Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 1/49 (2%)
Query: 207 ARVLKVLIDPNTKRVFGVEFMKNNKKRVVYAKKEVVLSAGAFFSPHLLL 255
++V ++ D + V GV K + A K VVL+ G F S ++
Sbjct: 151 SKVEDLIQD-DQGTVVGVVVKGKGKGIYIKAAKAVVLATGGFGSNKEMI 198
>gnl|CDD|215257 PLN02464, PLN02464, glycerol-3-phosphate dehydrogenase.
Length = 627
Score = 30.1 bits (68), Expect = 2.9
Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 203 VMKKARVLKVLIDPNTKRVFGVEFMKN--NKKRVVYAKKEVVLSAGAF 248
V+ A V+ ++ D +T R+ G N K+ VYAK VV +AG F
Sbjct: 249 VLNYAEVVSLIKDESTGRIVGARVRDNLTGKEFDVYAKV-VVNAAGPF 295
>gnl|CDD|238970 cd02012, TPP_TK, Thiamine pyrophosphate (TPP) family, Transketolase
(TK) subfamily, TPP-binding module; TK catalyzes the
transfer of a two-carbon unit from ketose phosphates to
aldose phosphates. In heterotrophic organisms, TK
provides a link between glycolysis and the pentose
phosphate pathway and provides precursors for
nucleotide, aromatic amino acid and vitamin
biosynthesis. In addition, the enzyme plays a central
role in the Calvin cycle in plants. Typically, TKs are
homodimers. They require TPP and divalent cations, such
as magnesium ions, for activity.
Length = 255
Score = 29.4 bits (67), Expect = 3.7
Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 13/49 (26%)
Query: 242 VLSAG----AFFSPHLLLLSGIGPREQLEQFNIPVLADLQVGENLQEHP 286
VLS G A ++ +L L+G P E L+ F Q+G L HP
Sbjct: 53 VLSKGHASPALYA--VLALAGYLPEEDLKTFR-------QLGSRLPGHP 92
>gnl|CDD|234752 PRK00413, thrS, threonyl-tRNA synthetase; Reviewed.
Length = 638
Score = 29.7 bits (68), Expect = 5.0
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 7/40 (17%)
Query: 439 IKMVIELSKTNAFQKYKSKLSTRILPACKKHKY-GSDDYW 477
I +++++ K F+ Y+ KLSTR P K GSD+ W
Sbjct: 401 IDLILDVYKDFGFEDYEVKLSTR--PE----KRIGSDEMW 434
>gnl|CDD|147821 pfam05874, PBAN, Pheromone biosynthesis activating neuropeptide
(PBAN). This family consists of several moth pheromone
biosynthesis activating neuropeptide (PBAN) sequences.
Female moths produce and release species specific sex
pheromones to attract males for mating. Pheromone
biosynthesis is hormonally regulated by the Pheromone
Biosynthesis Activating Neuropeptide (PBAN) which is
biosynthesised in the subesophageal ganglion (SOG).
Length = 195
Score = 28.6 bits (64), Expect = 6.2
Identities = 30/97 (30%), Positives = 44/97 (45%), Gaps = 13/97 (13%)
Query: 54 GACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEAGNVGWGYKDVLPYFKK 113
GA RG +++R G +G S+ + T N F R LEA + Y D LPY++
Sbjct: 29 GADRGAHSDRGGLWFGPRLGKRSL---RLSTEDNRQAFLRLLEAADALKYYYDQLPYYEM 85
Query: 114 SED----------IDVPELKRSEYHGVGGYLNVDYSP 140
D I P+L RS + Y NV+++P
Sbjct: 86 QADEPETKVTKKVIFTPKLGRSLAYEDKSYENVEFTP 122
>gnl|CDD|214663 smart00437, TOP1Ac, Bacterial DNA topoisomerase I DNA-binding
domain. Bacterial DNA topoisomerase I and III,
Eukaryotic DNA topoisomeraes III, reverse gyrase alpha
subunit.
Length = 259
Score = 28.8 bits (65), Expect = 6.5
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 103 GYKDVLPYFKKSEDIDVPELKRSE 126
G+ V P KK E+I++P LK+ +
Sbjct: 167 GWLKVYPEEKKEEEIELPTLKKGD 190
>gnl|CDD|240837 cd12391, RRM1_SART3, RNA recognition motif 1 in squamous cell
carcinoma antigen recognized by T-cells 3 (SART3) and
similar proteins. This subfamily corresponds to the
RRM1 of SART3, also termed Tat-interacting protein of
110 kDa (Tip110), an RNA-binding protein expressed in
the nucleus of the majority of proliferating cells,
including normal cells and malignant cells, but not in
normal tissues except for the testes and fetal liver. It
is involved in the regulation of mRNA splicing probably
via its complex formation with RNA-binding protein with
a serine-rich domain (RNPS1), a pre-mRNA-splicing
factor. SART3 has also been identified as a nuclear
Tat-interacting protein that regulates Tat
transactivation activity through direct interaction and
functions as an important cellular factor for HIV-1 gene
expression and viral replication. In addition, SART3 is
required for U6 snRNP targeting to Cajal bodies. It
binds specifically and directly to the U6 snRNA,
interacts transiently with the U6 and U4/U6 snRNPs, and
promotes the reassembly of U4/U6 snRNPs after splicing
in vitro. SART3 contains an N-terminal
half-a-tetratricopeptide repeat (HAT)-rich domain, a
nuclearlocalization signal (NLS) domain, and two
C-terminal RNA recognition motifs (RRMs), also termed
RBDs (RNA binding domains) or RNPs (ribonucleoprotein
domains). .
Length = 72
Score = 26.5 bits (59), Expect = 7.1
Identities = 18/70 (25%), Positives = 27/70 (38%), Gaps = 8/70 (11%)
Query: 135 NVDYSPYKSKLMDAFLESAPEVGLNLTDYNSPDGNVGFSRIQGTIQFGRRFSASQAFLR- 193
N+DYS + +L F + G +TD G S+ ++F S +A
Sbjct: 6 NLDYSVPEDELRKLF----SKCG-EITDVRLVKNYKGKSKGYAYVEFENEESVQEALKLD 60
Query: 194 --PIVERPNF 201
I RP F
Sbjct: 61 RELIKGRPMF 70
>gnl|CDD|201191 pfam00378, ECH, Enoyl-CoA hydratase/isomerase family. This family
contains a diverse set of enzymes including: Enoyl-CoA
hydratase. Napthoate synthase. Carnitate racemase.
3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA
delta-isomerase.
Length = 245
Score = 28.4 bits (64), Expect = 7.1
Identities = 17/86 (19%), Positives = 34/86 (39%), Gaps = 5/86 (5%)
Query: 170 VGFSRIQGTIQFGRRFSASQAFLRPIVER--PNFHVMKKAR-VLKVLID-PNTKRVFGVE 225
+G S + GRR A +A +V++ P ++++A + + L D P
Sbjct: 146 IGVSAALEMLLTGRRIRAQEALKMGLVDKVVPEEQLVEEAIELAQRLADKPPLALAALKA 205
Query: 226 FM-KNNKKRVVYAKKEVVLSAGAFFS 250
M + + + + + A FS
Sbjct: 206 AMRAALEDALPEVRAQALRLYPAPFS 231
>gnl|CDD|179392 PRK02249, PRK02249, DNA primase large subunit; Validated.
Length = 343
Score = 28.4 bits (64), Expect = 8.6
Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 5/40 (12%)
Query: 273 LADLQVGENLQEHPAFASLAFTVNQKVGLVSDRI---FSN 309
L+ LQ GENL FA +F +N +G+ D I F N
Sbjct: 230 LSALQAGENLPHTARFAITSFLLN--IGMSVDEIVELFRN 267
>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
dehydrogenase, RocA. Delta(1)-pyrroline-5-carboxylate
dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
enzyme of the aldehyde dehydrogenase (ALDH) protein
superfamily. The proline catabolic enzymes, proline
dehydrogenase and Delta(1)-pyrroline-5-carboxylate
dehydrogenase (P5CDH), catalyze the two-step oxidation
of proline to glutamate; P5CDH catalyzes the oxidation
of glutamate semialdehyde, utilizing NAD+ as the
electron acceptor. In some bacteria, the two enzymes are
fused into the bifunctional flavoenzyme, proline
utilization A (PutA). In this CD, monofunctional enzyme
sequences such as seen in the Bacillus subtilis RocA
P5CDH are also present. These enzymes play important
roles in cellular redox control, superoxide generation,
and apoptosis.
Length = 512
Score = 28.7 bits (65), Expect = 8.7
Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 3/36 (8%)
Query: 277 QVGENLQEHPAFASLAFTVNQKVGLVSDRIFSNLAK 312
+VG+ L EHP +AFT +++VGL RI+ AK
Sbjct: 235 EVGDYLVEHPDVRFIAFTGSREVGL---RIYERAAK 267
>gnl|CDD|189541 pfam00426, VP4_haemagglut, Outer Capsid protein VP4
(Hemagglutinin).
Length = 776
Score = 28.6 bits (64), Expect = 9.8
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 4/38 (10%)
Query: 429 PLDLLRIVEGIKMVIELSK---TNAFQKYK-SKLSTRI 462
PLD+ + GIK I+ +K TN +K+K SKL+ +
Sbjct: 524 PLDMFSMFSGIKSTIDAAKSMATNVMKKFKKSKLANSV 561
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.138 0.427
Gapped
Lambda K H
0.267 0.0762 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,969,120
Number of extensions: 2657201
Number of successful extensions: 2190
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2154
Number of HSP's successfully gapped: 33
Length of query: 517
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 416
Effective length of database: 6,457,848
Effective search space: 2686464768
Effective search space used: 2686464768
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.2 bits)