RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10349
         (517 letters)



>gnl|CDD|235000 PRK02106, PRK02106, choline dehydrogenase; Validated.
          Length = 560

 Score =  337 bits (866), Expect = e-109
 Identities = 185/547 (33%), Positives = 263/547 (48%), Gaps = 95/547 (17%)

Query: 1   GCVVTNRLTENPDWKVLLLEAGDEENA---LTDIPETAHY-LQFTKFNWNFTTEFQPGAC 56
           GCV+ NRL+E+PD  VLLLEAG  +        +P    + LQ  ++NW + TE +P   
Sbjct: 17  GCVLANRLSEDPDVSVLLLEAGGPDYRWDFFIQMPAALAFPLQGKRYNWAYETEPEPH-- 74

Query: 57  RGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLE-AGNVGWGYKDVLPYFKKSE 115
             +NN R   P G+ +GG+S IN  +Y RGN  D+D W E  G  GW Y D LPYFKK+E
Sbjct: 75  --MNNRRMECPRGKVLGGSSSINGMVYIRGNAMDYDNWAELPGLEGWSYADCLPYFKKAE 132

Query: 116 DIDVPELKRSEYHGVGGYLNVDYS-PYKSKLMDAFLESAPEVGLNLT-DYNSPDGNVGFS 173
             D  E    +Y G  G L+V    P  + L  AF+E+  + G   T D N      GF 
Sbjct: 133 TRDGGE---DDYRGGDGPLSVTRGKPGTNPLFQAFVEAGVQAGYPRTDDLNGYQQE-GFG 188

Query: 174 RIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKR 233
            +  T+  GRR+SA++A+L P ++RPN  ++  A   ++L +   KR  GVE+ +   + 
Sbjct: 189 PMDRTVTNGRRWSAARAYLDPALKRPNLTIVTHALTDRILFEG--KRAVGVEYERGGGRE 246

Query: 234 VVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQ-VGENLQEHPAFASLA 292
              A++EV+LSAGA  SP LL LSGIGP E L++  IPV+ DL  VGENLQ+H     L 
Sbjct: 247 TARARREVILSAGAINSPQLLQLSGIGPAEHLKELGIPVVHDLPGVGENLQDH-----LE 301

Query: 293 FTVNQKVGL-VSDRIFSNLA---KETIKA---FTNQGWTTTLGCEGLGYVRTKYNNYPPG 345
             +  +    VS  ++  L    K  I A   FT  G   +   E  G++R++      G
Sbjct: 302 VYIQYECKQPVS--LYPALKWWNKPKIGAEWLFTGTGLGASNHFEAGGFIRSR-----AG 354

Query: 346 V--PDIEYIFVPASL-----AIEEEKG-----GSLLRKTMGIPDRTFKELFRDVKNKDAW 393
           V  P+I+Y F+P ++        +  G     G                           
Sbjct: 355 VDWPNIQYHFLPVAIRYDGSNAVKGHGFQAHVGP-------------------------- 388

Query: 394 SIWPMLMYPESRGYVRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQK 453
                 M   SRG V+LKSADP  +P++  N+     D     + I++  E+    A   
Sbjct: 389 ------MRSPSRGSVKLKSADPRAHPSILFNYMSTEQDWREFRDAIRLTREIMAQPALDP 442

Query: 454 YKSKLSTRILPACKKHKYGSD---DYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPR 510
           Y+ +    I P        +D   D +   VR+     +H   TCKMG   D  AVVDP 
Sbjct: 443 YRGR---EISPGADVQ---TDEEIDAF---VREHAETAYHPSCTCKMGT--DPMAVVDPE 491

Query: 511 LRVHGIK 517
            RVHG++
Sbjct: 492 GRVHGVE 498


>gnl|CDD|225186 COG2303, BetA, Choline dehydrogenase and related flavoproteins
           [Amino acid transport and metabolism].
          Length = 542

 Score =  298 bits (765), Expect = 5e-95
 Identities = 170/529 (32%), Positives = 241/529 (45%), Gaps = 61/529 (11%)

Query: 1   GCVVTNRLTENPDWKVLLLEAGDEENALTDIPETAHY-------LQFTKFNWNFTTEFQP 53
           G V+  RL++     VL+LEAG       D              +   +++W F TE +P
Sbjct: 19  GSVLAARLSD-AGLSVLVLEAGGP-----DRRPLIQMPAAYAFLMNGPRYDWGFRTEPEP 72

Query: 54  GACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEA-GNVGWGYKDVLPYFK 112
                L      WP G+ +GG+S IN  +Y RG+P DFD W +  G  GW Y DVLPYFK
Sbjct: 73  H----LRGRELAWPRGKVLGGSSSINGMVYVRGHPEDFDAWAQESGAPGWPYDDVLPYFK 128

Query: 113 KSED-IDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGLNLT-DYNSPDGNV 170
           ++ED + V       +HG GG L V      + +  AF+E+  ++G   T D N  D   
Sbjct: 129 RAEDLLGVGGQDLRTWHGGGGPLPVSPPRSPNPIARAFIEAGEQLGFPTTPDPNGADQE- 187

Query: 171 GFSRIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNN 230
           GF     TI  GRR+SA++A+L+P ++RPN  ++  ARV ++L++ +      VE     
Sbjct: 188 GFGPYCVTICNGRRWSAARAYLKPALKRPNLTLLTGARVRRILLEGDRAVGVEVEIGDGG 247

Query: 231 KKRVVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADL-QVGENLQEHPAFA 289
                 A +EVVL+AGA  SP LLLLSGIGP + L +  I V+  L  VG+NLQ+H    
Sbjct: 248 TIETAVAAREVVLAAGAINSPKLLLLSGIGPADHLLEHGIDVVGRLPGVGQNLQDHLEIY 307

Query: 290 SLAFTVNQKVGLVSDRIFSNLAKETI-KAFTNQGWTTTLGCEGLGYVRTKYNNYPPG-VP 347
            +AF   +        +FS L         T  G   T   EG G+VR+     P G  P
Sbjct: 308 -VAFEATEPTNDSVLSLFSKLGIGADRYLLTRDGPGATNHFEG-GFVRSG----PAGEYP 361

Query: 348 DIEYIFVPASLAIEEEKGGSLLRKTMGIPDRTFKELFRDVKNKDAWSIWPMLMYPESRGY 407
           D +Y F P  LAI                        R    +  +++    M P+SRG 
Sbjct: 362 DGQYHFAPLPLAI------------------------RAAGAEHGFTLHVGPMRPKSRGS 397

Query: 408 VRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTRILPACK 467
           V L+S DP   P +  N+     D      GI++  E+    A           + P   
Sbjct: 398 VTLRSPDPDNRPVIDPNYLSAEGDRAIFRAGIRLTREIIGQPALDAR---RKAELAPGP- 453

Query: 468 KHKYGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGI 516
             +  +D+     +R L    +H  GTC+MG       V DP LRVHG+
Sbjct: 454 --RVTTDEDISAAIRFLARTAYHPMGTCRMG-SDPAAVVDDPYLRVHGL 499


>gnl|CDD|233581 TIGR01810, betA, choline dehydrogenase.  Choline dehydrogenase
           catalyzes the conversion of exogenously supplied choline
           into the intermediate glycine betaine aldehyde, as part
           of a two-step oxidative reaction leading to the
           formation of osmoprotectant betaine. This enzymatic
           system can be found in both gram-positive and
           gram-negative bacteria. As in Escherichia coli ,
           Staphylococcus xylosus , and Sinorhizobium meliloti,
           this enzyme is found associated in a transciptionally
           co-induced gene cluster with betaine aldehyde
           dehydrogenase, the second catalytic enzyme in this
           reaction. Other gram-positive organisms have been shown
           to employ a different enzymatic system, utlizing a
           soluable choline oxidase or type III alcohol
           dehydrogenase instead of choline dehydrogenase. This
           enzyme is a member of the GMC oxidoreductase family
           (pfam00732 and pfam05199), sharing a common
           evoluntionary origin and enzymatic reaction with alcohol
           dehydrogenase. Outgrouping from this model, Caulobacter
           crescentus shares sequence homology with choline
           dehydrogenase, yet other genes participating in this
           enzymatic reaction have not currently been identified
           [Cellular processes, Adaptations to atypical
           conditions].
          Length = 532

 Score =  277 bits (711), Expect = 3e-87
 Identities = 176/526 (33%), Positives = 245/526 (46%), Gaps = 54/526 (10%)

Query: 1   GCVVTNRLTENPDWKVLLLEAGDEE---NALTDIPETAHY-LQFTKFNWNFTTEFQPGAC 56
           G V+  RL+E+    VL+LEAG  +   + L  +P    Y     ++NW + TE +P   
Sbjct: 11  GSVLAGRLSEDVSNSVLVLEAGGSDYPWDLLIQMPAALAYPAGNKRYNWIYETEPEPH-- 68

Query: 57  RGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEA-GNVGWGYKDVLPYFKKSE 115
             +NN R     G+ +GG+S IN  IY RGNP D+++W +  G   W Y D LPY+K+ E
Sbjct: 69  --MNNRRVGHARGKVLGGSSSINGMIYQRGNPMDYEKWAKPEGMESWDYADCLPYYKRLE 126

Query: 116 DIDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGLNLT-DYNSPDGNVGFSR 174
                E     Y G  G + V   P  + L  AF+E+  E G N T D N      GF  
Sbjct: 127 TTFGGE---KPYRGHDGPIKVRRGPADNPLFQAFIEAGVEAGYNKTPDVNGFRQE-GFGP 182

Query: 175 IQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRV 234
           +  T+  GRR SA++A+L P ++RPN  V  +A V K+  +    R  GVEF K  +K  
Sbjct: 183 MDSTVHNGRRVSAARAYLHPAMKRPNLEVQTRAFVTKINFE--GNRATGVEFKKGGRKEH 240

Query: 235 VYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQ-VGENLQEHPAFASLAF 293
             A KEV+LSAGA  SP LL LSGIG  E L +  I     L  VGENLQ+H     +  
Sbjct: 241 TEANKEVILSAGAINSPQLLQLSGIGDAEHLRELGIEPRIHLPGVGENLQDHLEV-YVQH 299

Query: 294 TVNQKVGLVSDRIFSNLAKETIK-AFTNQGWTTTLGCEGLGYVRTKYNNYPPGV--PDIE 350
              Q V L     +        +  F  +G   +   EG G+VR+        V  P+I+
Sbjct: 300 ACKQPVSLYPSLNWLKQPFIGAQWLFGRKGAGASNHFEGGGFVRSN-----DDVDYPNIQ 354

Query: 351 YIFVPASLAIEEEKGGSLLRKTMGIPDRTFKELFRDVKNKDAWSIWPMLMYPESRGYVRL 410
           Y F+P ++  +    G+   K  G                  + +    MY  SRG+V++
Sbjct: 355 YHFLPVAIRYD----GTKAPKAHG------------------FQVHVGPMYSNSRGHVKI 392

Query: 411 KSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTRILPACKKHK 470
           KS DP   P +  N+     D     E I++  E+ K  A   Y+      I P  +   
Sbjct: 393 KSKDPFEKPEIVFNYMSHEEDWREFREAIRVTREILKQKALDPYRGG---EISPGPEVQT 449

Query: 471 YGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGI 516
               D +   VR+      H CGTCKMGP  D  +VVDP  RVHG+
Sbjct: 450 DEEIDEF---VRRHGETALHPCGTCKMGPASDEMSVVDPETRVHGM 492


>gnl|CDD|234421 TIGR03970, Rv0697, dehydrogenase, Rv0697 family.  This model
           describes a set of dehydrogenases belonging to the
           glucose-methanol-choline oxidoreductase (GMC
           oxidoreductase) family. Members of the present family
           are restricted to Actinobacterial genome contexts
           containing also members of families TIGR03962 and
           TIGR03969 (the mycofactocin system), and are proposed to
           be uniform in function.
          Length = 487

 Score =  220 bits (563), Expect = 6e-66
 Identities = 152/526 (28%), Positives = 213/526 (40%), Gaps = 95/526 (18%)

Query: 1   GCVVTNRLTENPDWKVLLLEAGDEENALTDIP-ETAHYLQF-----TKFNWNFTTEFQPG 54
           GCVV  RL+E+P   V +LEAG      + +P +    L+      +   W +  E   G
Sbjct: 12  GCVVAARLSEDPSCTVTVLEAGPGYRDPSRLPAQLTDGLRLPIGPASPVVWRYGVELTDG 71

Query: 55  ACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEAGNVGWGYKDVLPYFKKS 114
             R     R     GR +GG+  +N   + R  P DFD W      GW + DVLP+F+  
Sbjct: 72  PRRASAIVR-----GRVLGGSGAVNGGYFCRALPADFDAW---PIPGWSWDDVLPHFRAI 123

Query: 115 E-DIDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGL-NLTDYNS--PDGNV 170
           E D+D         HG  G + V  +     +  AF+ +A   G   + D N   PD   
Sbjct: 124 ETDLDF----DGPLHGTAGPIPVRRTAELDGISAAFVAAALGAGFGWIADLNGSGPDAPT 179

Query: 171 GFSRIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNN 230
           G   +   +  GRR S + A+L P ++RPN  V    RV+++L      R  GVE + + 
Sbjct: 180 GVGAVPLNVDGGRRVSTAVAYLLPALKRPNLTVEADTRVVRILFSGT--RAVGVEVLGDG 237

Query: 231 KKRVVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQVGENLQEHPAFAS 290
             R + A + VVL AGA  S HLLLLSGIGP EQL    I V+ DL VG +  +HP +  
Sbjct: 238 GPRTLRADR-VVLCAGAVESAHLLLLSGIGPAEQLRAAGIAVVLDLPVGSDFVDHPEW-- 294

Query: 291 LAFTVNQKVGLVSDRIFSNLAKETIKAFTNQGWTTTLGCEGLGYVRTKYNNYPPGVPDIE 350
                                                    L Y     ++ P   P +E
Sbjct: 295 ----------------------------------------VLPYRWRPTHDRPDTSPVLE 314

Query: 351 YIFVPASLAIEEEKGGSLLRKTMGIPDRTFKELFRDVKNKDAWSIWPMLMYPESRGYVRL 410
            +   A + I     G             F  L      +D   +   LM P SRG +RL
Sbjct: 315 TVLNTADIEIRPYTAG-------------FTALVPGS-PRDDPHLGVALMRPHSRGRIRL 360

Query: 411 KSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTRILPACKKHK 470
            SADP   P ++  +     D   +  G  +  EL  +         L   + PA ++  
Sbjct: 361 ASADPADPPRIEHRYDSSAADRAALRAGAALAHELLGSPE-------LGPLLEPAVRE-- 411

Query: 471 YGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGI 516
              +  W    R  T Q  H CG+C+MG   D  AVVD R RV G+
Sbjct: 412 --GEASW-VLARLATSQ--HLCGSCRMGGRDDPGAVVDARCRVRGV 452


>gnl|CDD|216089 pfam00732, GMC_oxred_N, GMC oxidoreductase.  This family of
           proteins bind FAD as a cofactor.
          Length = 218

 Score =  161 bits (410), Expect = 1e-46
 Identities = 79/241 (32%), Positives = 106/241 (43%), Gaps = 38/241 (15%)

Query: 54  GACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRW-LEAGNVGWGYKDVLPYFK 112
           G+   +N  R   PAG  VGG S +N +   R      D W  E G  GWGY D LPY  
Sbjct: 9   GSAGSVNGRRMILPAGSTVGGGSSVNWSACIRTPAAVLDEWASEFGLEGWGYDDYLPYMD 68

Query: 113 KSEDIDVPELKRSEYHGVGGYLNV-DYSPYKSKLMDAFLESAPEVGLNLTDY--NSPDG- 168
           K E                G L V      +S L  A L++A E+G  +     NS    
Sbjct: 69  KVE----------------GPLGVTTKGIEESPLNQALLKAAEELGYPVEAVPRNSNGCH 112

Query: 169 NVGFSRIQGTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEF-- 226
             GF  +      G + S ++ +LRP +ER N  ++  A+  K++I     R  GVE   
Sbjct: 113 YCGFCGLG--CPTGAKQSTARTWLRPALER-NLRILTGAKAEKIIILGRGGRAVGVEARD 169

Query: 227 MKNNKKRVVYAKKEVVLSAGAFFSPHLLLLSGIGPREQLEQFNIPVLADLQVGENLQEHP 286
                KR++ A KEVV++AGA  +P LLL SG+G                 VG+NLQ HP
Sbjct: 170 GGGGIKRLITAAKEVVVAAGALNTPPLLLRSGLGKNPH------------PVGKNLQLHP 217

Query: 287 A 287
            
Sbjct: 218 V 218


>gnl|CDD|218493 pfam05199, GMC_oxred_C, GMC oxidoreductase.  This domain found
           associated with pfam00732.
          Length = 139

 Score =  115 bits (289), Expect = 2e-30
 Identities = 37/116 (31%), Positives = 53/116 (45%), Gaps = 6/116 (5%)

Query: 402 PESRGYVRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKYKSKLSTR 461
           P SRG V L S+DP+  P +  N+  DP DL  +   +++   +           +L+  
Sbjct: 1   PRSRGRVTLSSSDPLGLPVIDPNYLSDPADLAALRAALRLARRILAAAL-----VELTPG 55

Query: 462 ILPACKKHKYGSDDYWGCCVRQLTMQMHHQCGTCKMGPDGDRTAVVDPRLRVHGIK 517
            +         SDD     +R   +   H  GTC+MG D D   VVDP LRVHG+ 
Sbjct: 56  PVEVVPGAAEASDDEILEYIRAAAVTSWHPMGTCRMGADPD-DGVVDPDLRVHGVD 110


>gnl|CDD|215420 PLN02785, PLN02785, Protein HOTHEAD.
          Length = 587

 Score = 79.5 bits (196), Expect = 1e-15
 Identities = 87/316 (27%), Positives = 135/316 (42%), Gaps = 84/316 (26%)

Query: 1   GCVVTNRLTENPDWKVLLLEAGDE----------EN---ALTDIPETAHYLQFTKFNWNF 47
           GC +   L++N  + VLLLE G            EN    L D   T+    F       
Sbjct: 67  GCPLAATLSQN--FSVLLLERGGVPFGNANVSFLENFHIGLADTSPTSASQAF------I 118

Query: 48  TTEFQPGACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEAGNVGWGYKDV 107
           +T+       G+ N R      R +GG + IN   Y+R +     +       GW  K V
Sbjct: 119 STD-------GVINAR-----ARVLGGGTCINAGFYSRASTRFIQK------AGWDAKLV 160

Query: 108 LPYFKKSEDIDVPELKRSEYHGVGGYLNVDYSPYKSKLMDAFLESAPEVGLNLTDYNSPD 167
              +   E   V   K               +P+++ L D+ LE      + ++ +N   
Sbjct: 161 NESYPWVERQIVHWPK--------------VAPWQAALRDSLLE------VGVSPFN--- 197

Query: 168 GNVGFS-------RIQGTI--QFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNT 218
              GF+       ++ GTI  +FGRR +A  A L          V+  A V K++ D + 
Sbjct: 198 ---GFTYDHVYGTKVGGTIFDEFGRRHTA--AELLAAGNPNKLRVLLHATVQKIVFDTSG 252

Query: 219 K--RVFGVEFM--KNNKKRVVYAKK---EVVLSAGAFFSPHLLLLSGIGPREQLEQFNIP 271
           K  R  GV F     N+ +   +     E++LSAGA  SP +LLLSGIGP+++L++  IP
Sbjct: 253 KRPRATGVIFKDENGNQHQAFLSNNKGSEIILSAGAIGSPQMLLLSGIGPKKELKKHKIP 312

Query: 272 -VLADLQVGENLQEHP 286
            VL +  VG+ + ++P
Sbjct: 313 VVLHNEHVGKGMADNP 328



 Score = 49.0 bits (117), Expect = 5e-06
 Identities = 19/54 (35%), Positives = 31/54 (57%)

Query: 401 YPESRGYVRLKSADPMVYPAVQSNFFQDPLDLLRIVEGIKMVIELSKTNAFQKY 454
            P S G++ L + +    P+V  N+F+ P DL R V GI+ + ++ KTN F  +
Sbjct: 423 GPISTGHLSLINTNVDDNPSVTFNYFKHPQDLQRCVYGIRTIEKIVKTNHFTNF 476


>gnl|CDD|180854 PRK07121, PRK07121, hypothetical protein; Validated.
          Length = 492

 Score = 38.7 bits (91), Expect = 0.007
 Identities = 18/72 (25%), Positives = 28/72 (38%), Gaps = 4/72 (5%)

Query: 177 GTIQFGRRFSASQAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRVVY 236
           G    G         L          +    R  ++++D    RV GVE  +  +   + 
Sbjct: 171 GDSGGGAML---MDPLAKRAAALGVQIRYDTRATRLIVDD-DGRVVGVEARRYGETVAIR 226

Query: 237 AKKEVVLSAGAF 248
           A+K VVL+AG F
Sbjct: 227 ARKGVVLAAGGF 238


>gnl|CDD|237532 PRK13858, PRK13858, type IV secretion system T-DNA border
           endonuclease VirD1; Provisional.
          Length = 147

 Score = 31.3 bits (71), Expect = 0.56
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 8/73 (10%)

Query: 130 VGGYLNVDYSPYKSKLMDAFLESAPEVGLNLTD----YNSPDGNVGFSRIQGT-IQFGRR 184
           +GG+L +D      + M+A L+S   +  N+      Y   +       ++   I FG+ 
Sbjct: 61  IGGFLEID--AETREKMEAILQSIGTLSSNIAALLSAYAE-NPRPDLEALRAERIAFGKE 117

Query: 185 FSASQAFLRPIVE 197
           F+     LR I+ 
Sbjct: 118 FADLDGLLRSILS 130


>gnl|CDD|236010 PRK07395, PRK07395, L-aspartate oxidase; Provisional.
          Length = 553

 Score = 32.3 bits (74), Expect = 0.73
 Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 192 LRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRVVYAKKEVVLSAG 246
              +++RPN  ++ +A  L + ++P T R  G+  +   +   + A   V+L+ G
Sbjct: 141 TEQVLQRPNIEIISQALALSLWLEPETGRCQGISLLYQGQITWLRAGA-VILATG 194


>gnl|CDD|181196 PRK08010, PRK08010, pyridine nucleotide-disulfide oxidoreductase;
           Provisional.
          Length = 441

 Score = 31.9 bits (72), Expect = 0.88
 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 2/58 (3%)

Query: 172 FSRIQGTIQFGRRFSAS-QAFLRPIVERPNFHVMKKAR-VLKVLIDPNTKRVFGVEFM 227
            SR+  T +  R   A  Q    P+   P   VM   R VLK ++D  T+R+ G   +
Sbjct: 338 LSRVGMTEEQARESGADIQVVTLPVAAIPRARVMNDTRGVLKAIVDNKTQRILGASLL 395


>gnl|CDD|223108 COG0029, NadB, Aspartate oxidase [Coenzyme metabolism].
          Length = 518

 Score = 31.8 bits (73), Expect = 0.96
 Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 1/58 (1%)

Query: 189 QAFLRPIVERPNFHVMKKARVLKVLIDPNTKRVFGVEFMKNNKKRVVYAKKEVVLSAG 246
            A L+ +  RPN  V++ A  L ++I+     V GV  +  N +   +  K VVL+ G
Sbjct: 137 TALLKKVRNRPNITVLEGAEALDLIIEDGIG-VAGVLVLNRNGELGTFRAKAVVLATG 193


>gnl|CDD|233583 TIGR01813, flavo_cyto_c, flavocytochrome c.  This model describes a
           family of redox proteins related to the succinate
           dehydrogenases and fumarate reductases of E. coli,
           mitochondria, and other well-characterized systems. A
           member of this family from Shewanella frigidimarina
           NCIMB400 is characterized as a water-soluble periplasmic
           protein with four heme groups, a non-covalently bound
           FAD, and essentially unidirectional fumarate reductase
           activity. At least seven distinct members of this family
           are found in Shewanella oneidensis, a species able to
           use a wide variety of pathways for respiraton [Energy
           metabolism, Electron transport].
          Length = 439

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 1/49 (2%)

Query: 207 ARVLKVLIDPNTKRVFGVEFMKNNKKRVVYAKKEVVLSAGAFFSPHLLL 255
           ++V  ++ D +   V GV      K   + A K VVL+ G F S   ++
Sbjct: 151 SKVEDLIQD-DQGTVVGVVVKGKGKGIYIKAAKAVVLATGGFGSNKEMI 198


>gnl|CDD|215257 PLN02464, PLN02464, glycerol-3-phosphate dehydrogenase.
          Length = 627

 Score = 30.1 bits (68), Expect = 2.9
 Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 203 VMKKARVLKVLIDPNTKRVFGVEFMKN--NKKRVVYAKKEVVLSAGAF 248
           V+  A V+ ++ D +T R+ G     N   K+  VYAK  VV +AG F
Sbjct: 249 VLNYAEVVSLIKDESTGRIVGARVRDNLTGKEFDVYAKV-VVNAAGPF 295


>gnl|CDD|238970 cd02012, TPP_TK, Thiamine pyrophosphate (TPP) family, Transketolase
           (TK) subfamily, TPP-binding module; TK catalyzes the
           transfer of a two-carbon unit from ketose phosphates to
           aldose phosphates. In heterotrophic organisms, TK
           provides a link between glycolysis and the pentose
           phosphate pathway and provides precursors for
           nucleotide, aromatic amino acid and vitamin
           biosynthesis. In addition, the enzyme plays a central
           role in the Calvin cycle in plants. Typically, TKs are
           homodimers. They require TPP and divalent cations, such
           as magnesium ions, for activity.
          Length = 255

 Score = 29.4 bits (67), Expect = 3.7
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 242 VLSAG----AFFSPHLLLLSGIGPREQLEQFNIPVLADLQVGENLQEHP 286
           VLS G    A ++  +L L+G  P E L+ F        Q+G  L  HP
Sbjct: 53  VLSKGHASPALYA--VLALAGYLPEEDLKTFR-------QLGSRLPGHP 92


>gnl|CDD|234752 PRK00413, thrS, threonyl-tRNA synthetase; Reviewed.
          Length = 638

 Score = 29.7 bits (68), Expect = 5.0
 Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 7/40 (17%)

Query: 439 IKMVIELSKTNAFQKYKSKLSTRILPACKKHKY-GSDDYW 477
           I +++++ K   F+ Y+ KLSTR  P     K  GSD+ W
Sbjct: 401 IDLILDVYKDFGFEDYEVKLSTR--PE----KRIGSDEMW 434


>gnl|CDD|147821 pfam05874, PBAN, Pheromone biosynthesis activating neuropeptide
           (PBAN).  This family consists of several moth pheromone
           biosynthesis activating neuropeptide (PBAN) sequences.
           Female moths produce and release species specific sex
           pheromones to attract males for mating. Pheromone
           biosynthesis is hormonally regulated by the Pheromone
           Biosynthesis Activating Neuropeptide (PBAN) which is
           biosynthesised in the subesophageal ganglion (SOG).
          Length = 195

 Score = 28.6 bits (64), Expect = 6.2
 Identities = 30/97 (30%), Positives = 44/97 (45%), Gaps = 13/97 (13%)

Query: 54  GACRGLNNERCPWPAGRAVGGASVINNNIYTRGNPNDFDRWLEAGNVGWGYKDVLPYFKK 113
           GA RG +++R     G  +G  S+    + T  N   F R LEA +    Y D LPY++ 
Sbjct: 29  GADRGAHSDRGGLWFGPRLGKRSL---RLSTEDNRQAFLRLLEAADALKYYYDQLPYYEM 85

Query: 114 SED----------IDVPELKRSEYHGVGGYLNVDYSP 140
             D          I  P+L RS  +    Y NV+++P
Sbjct: 86  QADEPETKVTKKVIFTPKLGRSLAYEDKSYENVEFTP 122


>gnl|CDD|214663 smart00437, TOP1Ac, Bacterial DNA topoisomerase I DNA-binding
           domain.  Bacterial DNA topoisomerase I and III,
           Eukaryotic DNA topoisomeraes III, reverse gyrase alpha
           subunit.
          Length = 259

 Score = 28.8 bits (65), Expect = 6.5
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 103 GYKDVLPYFKKSEDIDVPELKRSE 126
           G+  V P  KK E+I++P LK+ +
Sbjct: 167 GWLKVYPEEKKEEEIELPTLKKGD 190


>gnl|CDD|240837 cd12391, RRM1_SART3, RNA recognition motif 1 in squamous cell
           carcinoma antigen recognized by T-cells 3 (SART3) and
           similar proteins.  This subfamily corresponds to the
           RRM1 of SART3, also termed Tat-interacting protein of
           110 kDa (Tip110), an RNA-binding protein expressed in
           the nucleus of the majority of proliferating cells,
           including normal cells and malignant cells, but not in
           normal tissues except for the testes and fetal liver. It
           is involved in the regulation of mRNA splicing probably
           via its complex formation with RNA-binding protein with
           a serine-rich domain (RNPS1), a pre-mRNA-splicing
           factor. SART3 has also been identified as a nuclear
           Tat-interacting protein that regulates Tat
           transactivation activity through direct interaction and
           functions as an important cellular factor for HIV-1 gene
           expression and viral replication. In addition, SART3 is
           required for U6 snRNP targeting to Cajal bodies. It
           binds specifically and directly to the U6 snRNA,
           interacts transiently with the U6 and U4/U6 snRNPs, and
           promotes the reassembly of U4/U6 snRNPs after splicing
           in vitro. SART3 contains an N-terminal
           half-a-tetratricopeptide repeat (HAT)-rich domain, a
           nuclearlocalization signal (NLS) domain, and two
           C-terminal RNA recognition motifs (RRMs), also termed
           RBDs (RNA binding domains) or RNPs (ribonucleoprotein
           domains). .
          Length = 72

 Score = 26.5 bits (59), Expect = 7.1
 Identities = 18/70 (25%), Positives = 27/70 (38%), Gaps = 8/70 (11%)

Query: 135 NVDYSPYKSKLMDAFLESAPEVGLNLTDYNSPDGNVGFSRIQGTIQFGRRFSASQAFLR- 193
           N+DYS  + +L   F     + G  +TD        G S+    ++F    S  +A    
Sbjct: 6   NLDYSVPEDELRKLF----SKCG-EITDVRLVKNYKGKSKGYAYVEFENEESVQEALKLD 60

Query: 194 --PIVERPNF 201
              I  RP F
Sbjct: 61  RELIKGRPMF 70


>gnl|CDD|201191 pfam00378, ECH, Enoyl-CoA hydratase/isomerase family.  This family
           contains a diverse set of enzymes including: Enoyl-CoA
           hydratase. Napthoate synthase. Carnitate racemase.
           3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA
           delta-isomerase.
          Length = 245

 Score = 28.4 bits (64), Expect = 7.1
 Identities = 17/86 (19%), Positives = 34/86 (39%), Gaps = 5/86 (5%)

Query: 170 VGFSRIQGTIQFGRRFSASQAFLRPIVER--PNFHVMKKAR-VLKVLID-PNTKRVFGVE 225
           +G S     +  GRR  A +A    +V++  P   ++++A  + + L D P         
Sbjct: 146 IGVSAALEMLLTGRRIRAQEALKMGLVDKVVPEEQLVEEAIELAQRLADKPPLALAALKA 205

Query: 226 FM-KNNKKRVVYAKKEVVLSAGAFFS 250
            M    +  +   + + +    A FS
Sbjct: 206 AMRAALEDALPEVRAQALRLYPAPFS 231


>gnl|CDD|179392 PRK02249, PRK02249, DNA primase large subunit; Validated.
          Length = 343

 Score = 28.4 bits (64), Expect = 8.6
 Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 5/40 (12%)

Query: 273 LADLQVGENLQEHPAFASLAFTVNQKVGLVSDRI---FSN 309
           L+ LQ GENL     FA  +F +N  +G+  D I   F N
Sbjct: 230 LSALQAGENLPHTARFAITSFLLN--IGMSVDEIVELFRN 267


>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, RocA.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily. The proline catabolic enzymes, proline
           dehydrogenase and Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), catalyze the two-step oxidation
           of proline to glutamate; P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). In this CD, monofunctional enzyme
           sequences such as seen in the Bacillus subtilis RocA
           P5CDH are also present. These enzymes play important
           roles in cellular redox control, superoxide generation,
           and apoptosis.
          Length = 512

 Score = 28.7 bits (65), Expect = 8.7
 Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 277 QVGENLQEHPAFASLAFTVNQKVGLVSDRIFSNLAK 312
           +VG+ L EHP    +AFT +++VGL   RI+   AK
Sbjct: 235 EVGDYLVEHPDVRFIAFTGSREVGL---RIYERAAK 267


>gnl|CDD|189541 pfam00426, VP4_haemagglut, Outer Capsid protein VP4
           (Hemagglutinin). 
          Length = 776

 Score = 28.6 bits (64), Expect = 9.8
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 4/38 (10%)

Query: 429 PLDLLRIVEGIKMVIELSK---TNAFQKYK-SKLSTRI 462
           PLD+  +  GIK  I+ +K   TN  +K+K SKL+  +
Sbjct: 524 PLDMFSMFSGIKSTIDAAKSMATNVMKKFKKSKLANSV 561


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.138    0.427 

Gapped
Lambda     K      H
   0.267   0.0762    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,969,120
Number of extensions: 2657201
Number of successful extensions: 2190
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2154
Number of HSP's successfully gapped: 33
Length of query: 517
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 416
Effective length of database: 6,457,848
Effective search space: 2686464768
Effective search space used: 2686464768
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.2 bits)