RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14277
         (217 letters)



>gnl|CDD|215689 pfam00067, p450, Cytochrome P450.  Cytochrome P450s are
           haem-thiolate proteins involved in the oxidative
           degradation of various compounds. They are particularly
           well known for their role in the degradation of
           environmental toxins and mutagens. They can be divided
           into 4 classes, according to the method by which
           electrons from NAD(P)H are delivered to the catalytic
           site. Sequence conservation is relatively low within the
           family - there are only 3 absolutely conserved residues
           - but their general topography and structural fold are
           highly conserved. The conserved core is composed of a
           coil termed the 'meander', a four-helix bundle, helices
           J and K, and two sets of beta-sheets. These constitute
           the haem-binding loop (with an absolutely conserved
           cysteine that serves as the 5th ligand for the haem
           iron), the proton-transfer groove and the absolutely
           conserved EXXR motif in helix K. While prokaryotic P450s
           are soluble proteins, most eukaryotic P450s are
           associated with microsomal membranes. their general
           enzymatic function is to catalyze regiospecific and
           stereospecific oxidation of non-activated hydrocarbons
           at physiological temperatures.
          Length = 461

 Score =  160 bits (407), Expect = 5e-47
 Identities = 62/156 (39%), Positives = 91/156 (58%), Gaps = 4/156 (2%)

Query: 55  MQEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDL 114
            +E+ + V  L   G + ++ST +  L+ELA + EVQ+K R E+ +V        TY+DL
Sbjct: 259 DEELRATVLELFFAGTDTTSSTLSWALYELAKHPEVQEKLREEIDEVI-GDKRSPTYDDL 317

Query: 115 AKMTYLEQVISEALRLYPLG-NGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATL 173
             M YL+ VI E LRL+P+    L RE T+D  IP   ++I KG LV +  YAL  D  +
Sbjct: 318 QNMPYLDAVIKETLRLHPVVPLLLPREVTKDTVIPG--YLIPKGTLVIVNLYALHRDPEV 375

Query: 174 WSDPLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
           + +P EF+P+RF  EN       +++PFG GPR C+
Sbjct: 376 FPNPEEFDPERFLDENGKFRKSFAFLPFGAGPRNCL 411


>gnl|CDD|225035 COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis,
           transport, and catabolism].
          Length = 411

 Score =  109 bits (275), Expect = 3e-28
 Identities = 41/154 (26%), Positives = 64/154 (41%), Gaps = 28/154 (18%)

Query: 56  QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLA 115
            E+   +  L+V GHE + +     L+ L  + +   K R E  +               
Sbjct: 235 DEIRDELITLLVAGHETTANALAWALYALLRHPDQLAKLRAEPDR--------------- 279

Query: 116 KMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWS 175
               LE V+ E LRLYP      R  T+D  +    + I  G +V +   A   D  ++ 
Sbjct: 280 --PLLEAVVEETLRLYPPVPLARRVATED--VELGGYRIPAGTVVLLSIGAANRDPEVFP 335

Query: 176 DPLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
           DP EF+P+RF           +++PFG GP  C+
Sbjct: 336 DPDEFDPERF---------NNAHLPFGGGPHRCL 360


>gnl|CDD|215164 PLN02290, PLN02290, cytokinin trans-hydroxylase.
          Length = 516

 Score =  101 bits (254), Expect = 4e-25
 Identities = 62/179 (34%), Positives = 82/179 (45%), Gaps = 16/179 (8%)

Query: 32  NNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQ 91
           N      +N  N+N         +Q ++         GHE +    T  L  LA N   Q
Sbjct: 300 NEMEKKRSNGFNLN---------LQLIMDECKTFFFAGHETTALLLTWTLMLLASNPTWQ 350

Query: 92  DKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSP 151
           DK R EV +V   GG   + + L+K+T L  VI+E+LRLYP    L R   +D  + D  
Sbjct: 351 DKVRAEVAEVC--GGETPSVDHLSKLTLLNMVINESLRLYPPATLLPRMAFEDIKLGDL- 407

Query: 152 HVIQKGVLVHIPTYALQTDATLW-SDPLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
             I KG+ + IP  A+     LW  D  EFNPDRFA           ++PF  GPR CI
Sbjct: 408 -HIPKGLSIWIPVLAIHHSEELWGKDANEFNPDRFA--GRPFAPGRHFIPFAAGPRNCI 463


>gnl|CDD|178524 PLN02936, PLN02936, epsilon-ring hydroxylase.
          Length = 489

 Score = 84.8 bits (210), Expect = 3e-19
 Identities = 48/149 (32%), Positives = 75/149 (50%), Gaps = 8/149 (5%)

Query: 65  LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVI 124
           ++V GHE + S  T  L+ L+ N E   KA+ E+ +V    G   TYED+ ++ YL + I
Sbjct: 286 MLVAGHETTGSVLTWTLYLLSKNPEALRKAQEELDRVL--QGRPPTYEDIKELKYLTRCI 343

Query: 125 SEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDR 184
           +E++RLYP    L R    +  +P   + +  G  + I  Y +     +W    EF P+R
Sbjct: 344 NESMRLYPHPPVLIRRAQVEDVLPGG-YKVNAGQDIMISVYNIHRSPEVWERAEEFVPER 402

Query: 185 FAPE----NESKIVPGSYVPFGDGPRTCI 209
           F  +    NE+      Y+PF  GPR C+
Sbjct: 403 FDLDGPVPNETN-TDFRYIPFSGGPRKCV 430


>gnl|CDD|215393 PLN02738, PLN02738, carotene beta-ring hydroxylase.
          Length = 633

 Score = 83.8 bits (207), Expect = 1e-18
 Identities = 48/148 (32%), Positives = 72/148 (48%), Gaps = 7/148 (4%)

Query: 65  LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVI 124
           +++ GHE S +  T   + L+    V  K + EV  V   G    T ED+ K+ Y  +VI
Sbjct: 399 MLIAGHETSAAVLTWTFYLLSKEPSVVAKLQEEVDSVL--GDRFPTIEDMKKLKYTTRVI 456

Query: 125 SEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDR 184
           +E+LRLYP    L R   ++  +   P  I++G  + I  + L      W D  +FNP+R
Sbjct: 457 NESLRLYPQPPVLIRRSLENDMLGGYP--IKRGEDIFISVWNLHRSPKHWDDAEKFNPER 514

Query: 185 F---APENESKIVPGSYVPFGDGPRTCI 209
           +    P         SY+PFG GPR C+
Sbjct: 515 WPLDGPNPNETNQNFSYLPFGGGPRKCV 542


>gnl|CDD|215171 PLN02302, PLN02302, ent-kaurenoic acid oxidase.
          Length = 490

 Score = 77.1 bits (190), Expect = 1e-16
 Identities = 47/143 (32%), Positives = 71/143 (49%), Gaps = 8/143 (5%)

Query: 69  GHENSTSTSTNVLHELAYNQEVQDKARREVQKVYK---EGGGKVTYEDLAKMTYLEQVIS 125
           GHE+S   +      L  + EV  KA+ E +++ K    G   +T +D+ KM YL QVI 
Sbjct: 299 GHESSGHLTMWATIFLQEHPEVLQKAKAEQEEIAKKRPPGQKGLTLKDVRKMEYLSQVID 358

Query: 126 EALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRF 185
           E LRL  +   +FRE   D  +  + + I KG  V      +  D  ++ +P EF+P R+
Sbjct: 359 ETLRLINISLTVFREAKTDVEV--NGYTIPKGWKVLAWFRQVHMDPEVYPNPKEFDPSRW 416

Query: 186 APENESKIVPGSYVPFGDGPRTC 208
                 K   G+++PFG G R C
Sbjct: 417 D-NYTPK--AGTFLPFGLGSRLC 436


>gnl|CDD|215371 PLN02687, PLN02687, flavonoid 3'-monooxygenase.
          Length = 517

 Score = 76.8 bits (189), Expect = 2e-16
 Identities = 54/156 (34%), Positives = 76/156 (48%), Gaps = 21/156 (13%)

Query: 65  LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK-VTYEDLAKMTYLEQV 123
           L   G + ++ST    + EL  + ++  KA+ E+  V   G  + V+  DL ++TYL+ V
Sbjct: 305 LFTAGTDTTSSTVEWAIAELIRHPDILKKAQEELDAVV--GRDRLVSESDLPQLTYLQAV 362

Query: 124 ISEALRLYP-----LGNGLFRECTQD-YAIPDSPHVIQKGVLVHIPTYALQTDATLWSDP 177
           I E  RL+P     L      EC  + Y IP       KG  + +  +A+  D   W DP
Sbjct: 363 IKETFRLHPSTPLSLPRMAAEECEINGYHIP-------KGATLLVNVWAIARDPEQWPDP 415

Query: 178 LEFNPDRFAP--ENESKIVPGS---YVPFGDGPRTC 208
           LEF PDRF P  E+    V GS    +PFG G R C
Sbjct: 416 LEFRPDRFLPGGEHAGVDVKGSDFELIPFGAGRRIC 451


>gnl|CDD|215354 PLN02655, PLN02655, ent-kaurene oxidase.
          Length = 466

 Score = 75.5 bits (186), Expect = 5e-16
 Identities = 46/139 (33%), Positives = 63/139 (45%), Gaps = 15/139 (10%)

Query: 76  TSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLY---P 132
           T+   ++ELA N + Q++  RE+++V   G  +VT EDL  + YL  V  E LR Y   P
Sbjct: 281 TTEWAMYELAKNPDKQERLYREIREVC--GDERVTEEDLPNLPYLNAVFHETLRKYSPVP 338

Query: 133 LGNGLF-RECTQ--DYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPEN 189
           L    F  E T    Y IP        G  + I  Y    D   W +P E++P+RF  E 
Sbjct: 339 LLPPRFVHEDTTLGGYDIP-------AGTQIAINIYGCNMDKKRWENPEEWDPERFLGEK 391

Query: 190 ESKIVPGSYVPFGDGPRTC 208
                    + FG G R C
Sbjct: 392 YESADMYKTMAFGAGKRVC 410


>gnl|CDD|173595 PTZ00404, PTZ00404, cytochrome P450; Provisional.
          Length = 482

 Score = 75.1 bits (185), Expect = 8e-16
 Identities = 41/153 (26%), Positives = 70/153 (45%), Gaps = 7/153 (4%)

Query: 58  VVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKM 117
           +++ +    + G + S ++   ++  L    E+Q+KA  E++     G  KV   D    
Sbjct: 284 ILATILDFFLAGVDTSATSLEWMVLMLCNYPEIQEKAYNEIKSTVN-GRNKVLLSDRQST 342

Query: 118 TYLEQVISEALRLYPLGN-GLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSD 176
            Y   +I E LR  P+   GL R  + D  I    H I K   + I  Y+L  +   + +
Sbjct: 343 PYTVAIIKETLRYKPVSPFGLPRSTSNDIIIGGG-HFIPKDAQILINYYSLGRNEKYFEN 401

Query: 177 PLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
           P +F+P RF    +S     +++PF  GPR C+
Sbjct: 402 PEQFDPSRFL-NPDS---NDAFMPFSIGPRNCV 430


>gnl|CDD|215086 PLN00168, PLN00168, Cytochrome P450; Provisional.
          Length = 519

 Score = 72.7 bits (178), Expect = 5e-15
 Identities = 44/145 (30%), Positives = 72/145 (49%), Gaps = 10/145 (6%)

Query: 72  NSTSTSTN-VLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRL 130
           ++TST+   ++ EL  N  +Q K   E++    +   +V+ ED+ KM YL+ V+ E LR 
Sbjct: 320 DTTSTALQWIMAELVKNPSIQSKLHDEIKAKTGDDQEEVSEEDVHKMPYLKAVVLEGLRK 379

Query: 131 YPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAP-- 187
           +P  + +   +  +D  +    ++I KG  V+     +  D   W  P+EF P+RF    
Sbjct: 380 HPPAHFVLPHKAAEDMEV--GGYLIPKGATVNFMVAEMGRDEREWERPMEFVPERFLAGG 437

Query: 188 ENESKIVPGS----YVPFGDGPRTC 208
           + E   V GS     +PFG G R C
Sbjct: 438 DGEGVDVTGSREIRMMPFGVGRRIC 462


>gnl|CDD|215583 PLN03112, PLN03112, cytochrome P450 family protein; Provisional.
          Length = 514

 Score = 71.0 bits (174), Expect = 2e-14
 Identities = 44/144 (30%), Positives = 62/144 (43%), Gaps = 13/144 (9%)

Query: 74  TSTSTN--VLHELAYNQEVQDKARREVQKVYKEGGGK-VTYEDLAKMTYLEQVISEALRL 130
           TS  TN   + E+  N  V  K + E+  V   G  + V   DL  + YL  V+ E  R+
Sbjct: 311 TSAVTNEWAMAEVIKNPRVLRKIQEELDSVV--GRNRMVQESDLVHLNYLRCVVRETFRM 368

Query: 131 YPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPEN 189
           +P G  L   E  +   I  + + I     V I T+ L  +  +W D  EF P+R  P  
Sbjct: 369 HPAGPFLIPHESLRATTI--NGYYIPAKTRVFINTHGLGRNTKIWDDVEEFRPERHWPAE 426

Query: 190 ESK--IVPGS---YVPFGDGPRTC 208
            S+  I  G     +PF  G R C
Sbjct: 427 GSRVEISHGPDFKILPFSAGKRKC 450


>gnl|CDD|177725 PLN00110, PLN00110, flavonoid 3',5'-hydroxylase (F3'5'H);
           Provisional.
          Length = 504

 Score = 65.6 bits (160), Expect = 1e-12
 Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 22/180 (12%)

Query: 47  PNFLGVIRM-QEVVSGVFV-----------LIVGGHENSTSTSTNVLHELAYNQEVQDKA 94
           P+FL V+   QE  +G  +           L   G + S+S     L E+  N  +  +A
Sbjct: 267 PDFLDVVMANQENSTGEKLTLTNIKALLLNLFTAGTDTSSSVIEWSLAEMLKNPSILKRA 326

Query: 95  RREVQKVYKEGGGKVTYE-DLAKMTYLEQVISEALRLYP-LGNGLFRECTQDYAIPDSPH 152
             E+ +V   G  +   E DL K+ YL+ +  E+ R +P     L R  TQ  A   + +
Sbjct: 327 HEEMDQVI--GRNRRLVESDLPKLPYLQAICKESFRKHPSTPLNLPRVSTQ--ACEVNGY 382

Query: 153 VIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPENESKIVPG----SYVPFGDGPRTC 208
            I K   + +  +A+  D  +W +P EF P+RF  E  +KI P       +PFG G R C
Sbjct: 383 YIPKNTRLSVNIWAIGRDPDVWENPEEFRPERFLSEKNAKIDPRGNDFELIPFGAGRRIC 442


>gnl|CDD|215235 PLN02426, PLN02426, cytochrome P450, family 94, subfamily C
           protein.
          Length = 502

 Score = 65.5 bits (160), Expect = 2e-12
 Identities = 48/159 (30%), Positives = 70/159 (44%), Gaps = 17/159 (10%)

Query: 58  VVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKM 117
           VVS  F+L   G +   S  T+    L+ + EV    R E  +V        ++E++ +M
Sbjct: 298 VVS--FLL--AGRDTVASALTSFFWLLSKHPEVASAIREEADRVMGPNQEAASFEEMKEM 353

Query: 118 TYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLW-SD 176
            YL   + E++RL+P      +   +D  +PD   V  KG  V    YA+     +W  D
Sbjct: 354 HYLHAALYESMRLFPPVQFDSKFAAEDDVLPDGTFV-AKGTRVTYHPYAMGRMERIWGPD 412

Query: 177 PLEFNPDR------FAPENESKIVPGSYVPFGDGPRTCI 209
            LEF P+R      F PEN  K     Y  F  G R C+
Sbjct: 413 CLEFKPERWLKNGVFVPENPFK-----YPVFQAGLRVCL 446


>gnl|CDD|165828 PLN02183, PLN02183, ferulate 5-hydroxylase.
          Length = 516

 Score = 64.5 bits (157), Expect = 4e-12
 Identities = 48/154 (31%), Positives = 67/154 (43%), Gaps = 27/154 (17%)

Query: 68  GGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYE-DLAKMTYLEQVISE 126
           GG E   S     + EL  + E   + ++E+  V   G  +   E DL K+TYL+  + E
Sbjct: 315 GGTETVASAIEWAMAELMKSPEDLKRVQQELADVV--GLNRRVEESDLEKLTYLKCTLKE 372

Query: 127 ALRLYPLGNGLFRECTQD-----YAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFN 181
            LRL+P    L  E  +D     Y IP    V+       I  +A+  D   W DP  F 
Sbjct: 373 TLRLHPPIPLLLHETAEDAEVAGYFIPKRSRVM-------INAWAIGRDKNSWEDPDTFK 425

Query: 182 PDRF----APENESKIVPGS---YVPFGDGPRTC 208
           P RF     P+       GS   ++PFG G R+C
Sbjct: 426 PSRFLKPGVPD-----FKGSHFEFIPFGSGRRSC 454


>gnl|CDD|177826 PLN02169, PLN02169, fatty acid (omega-1)-hydroxylase/midchain
           alkane hydroxylase.
          Length = 500

 Score = 63.9 bits (155), Expect = 5e-12
 Identities = 44/151 (29%), Positives = 72/151 (47%), Gaps = 11/151 (7%)

Query: 62  VFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLE 121
           +F L++ G + ++S  T     L+ + +V  K R E+   +         EDL K+ YL 
Sbjct: 306 IFSLVLAGRDTTSSALTWFFWLLSKHPQVMAKIRHEINTKFDN-------EDLEKLVYLH 358

Query: 122 QVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLW-SDPLEF 180
             +SE++RLYP      +   +   +P S H +     + I  YAL    ++W  D L+F
Sbjct: 359 AALSESMRLYPPLPFNHKAPAKPDVLP-SGHKVDAESKIVICIYALGRMRSVWGEDALDF 417

Query: 181 NPDRFAPENESKIVPGSY--VPFGDGPRTCI 209
            P+R+  +N       SY  + F  GPRTC+
Sbjct: 418 KPERWISDNGGLRHEPSYKFMAFNSGPRTCL 448


>gnl|CDD|178373 PLN02774, PLN02774, brassinosteroid-6-oxidase.
          Length = 463

 Score = 63.6 bits (155), Expect = 5e-12
 Identities = 42/156 (26%), Positives = 76/156 (48%), Gaps = 8/156 (5%)

Query: 56  QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVY--KEGGGKVTYED 113
           +E++  +  ++  G+E  ++TS   +  L  + +   + R+E   +   K     + + D
Sbjct: 263 EEIIDQIITILYSGYETVSTTSMMAVKYLHDHPKALQELRKEHLAIRERKRPEDPIDWND 322

Query: 114 LAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATL 173
              M +   VI E  RL  + NG+ R+ TQD  +  + +VI KG  +++ T  +  D  L
Sbjct: 323 YKSMRFTRAVIFETSRLATIVNGVLRKTTQDMEL--NGYVIPKGWRIYVYTREINYDPFL 380

Query: 174 WSDPLEFNPDRFAPEN-ESKIVPGSYVPFGDGPRTC 208
           + DP+ FNP R+  ++ ES      +  FG G R C
Sbjct: 381 YPDPMTFNPWRWLDKSLESH---NYFFLFGGGTRLC 413


>gnl|CDD|215221 PLN02394, PLN02394, trans-cinnamate 4-monooxygenase.
          Length = 503

 Score = 62.4 bits (152), Expect = 2e-11
 Identities = 45/136 (33%), Positives = 64/136 (47%), Gaps = 19/136 (13%)

Query: 83  ELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLY-------PLGN 135
           EL  + E+Q K R E+  V   G  +VT  D  K+ YL+ V+ E LRL+       P  N
Sbjct: 319 ELVNHPEIQKKLRDELDTVLGPGN-QVTEPDTHKLPYLQAVVKETLRLHMAIPLLVPHMN 377

Query: 136 GLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPEN---ESK 192
            L       Y IP      +  +LV+   + L  +  LW +P EF P+RF  E    E+ 
Sbjct: 378 -LEDAKLGGYDIPA-----ESKILVN--AWWLANNPELWKNPEEFRPERFLEEEAKVEAN 429

Query: 193 IVPGSYVPFGDGPRTC 208
                ++PFG G R+C
Sbjct: 430 GNDFRFLPFGVGRRSC 445


>gnl|CDD|178550 PLN02966, PLN02966, cytochrome P450 83A1.
          Length = 502

 Score = 61.3 bits (148), Expect = 4e-11
 Identities = 45/148 (30%), Positives = 73/148 (49%), Gaps = 6/148 (4%)

Query: 65  LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK-VTYEDLAKMTYLEQV 123
           ++V G + + +     +  L    +V  KA+ EV++  KE G   VT +D+  + Y   +
Sbjct: 297 IVVAGTDTAAAAVVWGMTYLMKYPQVLKKAQAEVREYMKEKGSTFVTEDDVKNLPYFRAL 356

Query: 124 ISEALRLYPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWS-DPLEFN 181
           + E LR+ P+   L  R C QD  I  + + I  G  V++  +A+  D   W  +P EF 
Sbjct: 357 VKETLRIEPVIPLLIPRACIQDTKI--AGYDIPAGTTVNVNAWAVSRDEKEWGPNPDEFR 414

Query: 182 PDRF-APENESKIVPGSYVPFGDGPRTC 208
           P+RF   E + K     ++PFG G R C
Sbjct: 415 PERFLEKEVDFKGTDYEFIPFGSGRRMC 442


>gnl|CDD|178773 PLN03234, PLN03234, cytochrome P450 83B1; Provisional.
          Length = 499

 Score = 61.2 bits (148), Expect = 5e-11
 Identities = 47/158 (29%), Positives = 75/158 (47%), Gaps = 8/158 (5%)

Query: 56  QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLA 115
           + V + +  ++V G + + +     +  L    E   KA+ EV+ V  + G  V+ ED+ 
Sbjct: 287 ENVKAMILDIVVPGTDTAAAVVVWAMTYLIKYPEAMKKAQDEVRNVIGDKG-YVSEEDIP 345

Query: 116 KMTYLEQVISEALRLYPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLW 174
            + YL+ VI E+LRL P+   L  RE   D  I    + I    ++ +  +A+  D   W
Sbjct: 346 NLPYLKAVIKESLRLEPVIPILLHRETIADAKI--GGYDIPAKTIIQVNAWAVSRDTAAW 403

Query: 175 SD-PLEFNPDRFAPENESKIVPGS---YVPFGDGPRTC 208
            D P EF P+RF  E++     G     +PFG G R C
Sbjct: 404 GDNPNEFIPERFMKEHKGVDFKGQDFELLPFGSGRRMC 441


>gnl|CDD|177847 PLN02196, PLN02196, abscisic acid 8'-hydroxylase.
          Length = 463

 Score = 59.6 bits (144), Expect = 2e-10
 Identities = 47/158 (29%), Positives = 80/158 (50%), Gaps = 14/158 (8%)

Query: 56  QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVY--KEGGGKVTYED 113
           +++   +  +I    + + S  T +L  LA N  V +    E   +   KE G  +T+ED
Sbjct: 263 EQIADNIIGVIFAARDTTASVLTWILKYLAENPSVLEAVTEEQMAIRKDKEEGESLTWED 322

Query: 114 LAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYA-LQTDAT 172
             KM    +VI E LR+  + +  FRE  +D  +    ++I KG  V +P +  +   A 
Sbjct: 323 TKKMPLTSRVIQETLRVASILSFTFREAVED--VEYEGYLIPKGWKV-LPLFRNIHHSAD 379

Query: 173 LWSDPLEFNPDRF--APENESKIVPGSYVPFGDGPRTC 208
           ++SDP +F+P RF  AP+      P +++PFG+G  +C
Sbjct: 380 IFSDPGKFDPSRFEVAPK------PNTFMPFGNGTHSC 411


>gnl|CDD|166628 PLN02987, PLN02987, Cytochrome P450, family 90, subfamily A.
          Length = 472

 Score = 58.8 bits (142), Expect = 2e-10
 Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 6/156 (3%)

Query: 56  QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKV--YKEGGGKVTYED 113
           +E+V  +  L+V G+E +++  T  +  L        + + E +K+   K     + + D
Sbjct: 266 EEIVDFLVALLVAGYETTSTIMTLAVKFLTETPLALAQLKEEHEKIRAMKSDSYSLEWSD 325

Query: 114 LAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATL 173
              M + + V++E LR+  +  G+FR    D  +    + I KG  V     A+  D   
Sbjct: 326 YKSMPFTQCVVNETLRVANIIGGIFRRAMTDIEVKG--YTIPKGWKVFASFRAVHLDHEY 383

Query: 174 WSDPLEFNPDRFAPENESKIVPGS-YVPFGDGPRTC 208
           + D   FNP R+   N    VP + + PFG GPR C
Sbjct: 384 FKDARTFNPWRWQ-SNSGTTVPSNVFTPFGGGPRLC 418


>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
          Length = 490

 Score = 58.7 bits (142), Expect = 3e-10
 Identities = 48/175 (27%), Positives = 76/175 (43%), Gaps = 35/175 (20%)

Query: 56  QEVVSGVFVLIVGGHENST----------STSTNVLHELAYNQEVQDKARREVQKVYKEG 105
           ++++  +  L+  GHE S+                + EL   +E  + AR + Q     G
Sbjct: 278 EQILDLILSLLFAGHETSSVAIALAIFFLQGCPKAVQEL--REEHLEIARAKKQS----G 331

Query: 106 GGKVTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQD-----YAIPDSPHVIQKGVLV 160
             ++ +ED  KM + + VI+E LRL  +   L R+  +D     Y IP    V+     V
Sbjct: 332 ESELNWEDYKKMEFTQCVINETLRLGNVVRFLHRKALKDVRYKGYDIPSGWKVLPVIAAV 391

Query: 161 HIPTYALQTDATLWSDPLEFNPDRFAPENESKIVPGS-------YVPFGDGPRTC 208
           H+       D++L+  P  FNP R+   N      GS       ++PFG GPR C
Sbjct: 392 HL-------DSSLYDQPQLFNPWRWQQNNNRGGSSGSSSATTNNFMPFGGGPRLC 439


>gnl|CDD|215627 PLN03195, PLN03195, fatty acid omega-hydroxylase; Provisional.
          Length = 516

 Score = 56.7 bits (137), Expect = 1e-09
 Identities = 41/168 (24%), Positives = 72/168 (42%), Gaps = 28/168 (16%)

Query: 66  IVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK----------------- 108
           ++ G + + +T +  ++ +  N  V +K   E++ + KE   +                 
Sbjct: 301 VIAGRDTTATTLSWFVYMIMMNPHVAEKLYSELKALEKERAKEEDPEDSQSFNQRVTQFA 360

Query: 109 --VTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYA 166
             +TY+ L K+ YL  VI+E LRLYP      +   +D  +PD    ++ G +V    Y+
Sbjct: 361 GLLTYDSLGKLQYLHAVITETLRLYPAVPQDPKGILEDDVLPDGT-KVKAGGMVTYVPYS 419

Query: 167 LQTDATLW-SDPLEFNPDRFAPE----NESKIVPGSYVPFGDGPRTCI 209
           +      W  D   F P+R+  +    N S   P  +  F  GPR C+
Sbjct: 420 MGRMEYNWGPDAASFKPERWIKDGVFQNAS---PFKFTAFQAGPRICL 464


>gnl|CDD|215350 PLN02648, PLN02648, allene oxide synthase.
          Length = 480

 Score = 53.4 bits (129), Expect = 2e-08
 Identities = 43/130 (33%), Positives = 64/130 (49%), Gaps = 18/130 (13%)

Query: 87  NQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLYP---LGNGLFRECTQ 143
            +E+Q +   EV+   K GGG VT+  L KM  ++ V+ EALR+ P      G  RE   
Sbjct: 303 GEELQARLAEEVRSAVKAGGGGVTFAALEKMPLVKSVVYEALRIEPPVPFQYGRARE--- 359

Query: 144 DYAIP--DSPHVIQKGVLVHIPTYALQ----TDATLWSDPLEFNPDRFAPENESKIVPGS 197
           D+ I   D+   I+KG ++    +  Q     D  ++  P EF PDRF  E   K++   
Sbjct: 360 DFVIESHDAAFEIKKGEML----FGYQPLVTRDPKVFDRPEEFVPDRFMGEEGEKLLK-- 413

Query: 198 YVPFGDGPRT 207
           YV + +G  T
Sbjct: 414 YVFWSNGRET 423


>gnl|CDD|166612 PLN02971, PLN02971, tryptophan N-hydroxylase.
          Length = 543

 Score = 53.1 bits (127), Expect = 2e-08
 Identities = 43/156 (27%), Positives = 68/156 (43%), Gaps = 23/156 (14%)

Query: 65  LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK---VTYEDLAKMTYLE 121
           L++   +N ++     + E+    E+  KA  E+ +V     GK   V   D+ K+ Y++
Sbjct: 335 LVMAAPDNPSNAVEWAMAEMINKPEILHKAMEEIDRVV----GKERFVQESDIPKLNYVK 390

Query: 122 QVISEALRLYPLGN------GLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWS 175
            +I EA RL+P+         L       Y IP       KG  V +  Y L  +  +WS
Sbjct: 391 AIIREAFRLHPVAAFNLPHVALSDTTVAGYHIP-------KGSQVLLSRYGLGRNPKVWS 443

Query: 176 DPLEFNPDRFAPE-NESKIVPGS--YVPFGDGPRTC 208
           DPL F P+R   E +E  +      ++ F  G R C
Sbjct: 444 DPLSFKPERHLNECSEVTLTENDLRFISFSTGKRGC 479


>gnl|CDD|215600 PLN03141, PLN03141, 3-epi-6-deoxocathasterone 23-monooxygenase;
           Provisional.
          Length = 452

 Score = 51.3 bits (123), Expect = 8e-08
 Identities = 33/125 (26%), Positives = 60/125 (48%), Gaps = 7/125 (5%)

Query: 84  LAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQ 143
           +A  Q  ++  + +  K   + G  + + D   + + + VI+E LR+  + NG+ R+  +
Sbjct: 283 VALQQLTEENMKLKRLKA--DTGEPLYWTDYMSLPFTQNVITETLRMGNIINGVMRKAMK 340

Query: 144 DYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPENESKIVPGSYVPFGD 203
           D  I    ++I KG  V     ++  D   + +P +FNP R+    E  +   S+ PFG 
Sbjct: 341 DVEIKG--YLIPKGWCVLAYFRSVHLDEENYDNPYQFNPWRW---QEKDMNNSSFTPFGG 395

Query: 204 GPRTC 208
           G R C
Sbjct: 396 GQRLC 400


>gnl|CDD|178592 PLN03018, PLN03018, homomethionine N-hydroxylase.
          Length = 534

 Score = 42.3 bits (99), Expect = 1e-04
 Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 12/147 (8%)

Query: 71  ENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYE-DLAKMTYLEQVISEALR 129
           +N  +     L E+  N E+  KA +E+ +V   G  ++  E D+  + YL+    E  R
Sbjct: 328 DNPANNMEWTLGEMLKNPEILRKALKELDEVV--GKDRLVQESDIPNLNYLKACCRETFR 385

Query: 130 LYPLGNGLFRECT-QDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRF--- 185
           ++P  + +      QD  +    + I KG  +H+    L  +  +W DPL + P+R    
Sbjct: 386 IHPSAHYVPPHVARQDTTL--GGYFIPKGSHIHVCRPGLGRNPKIWKDPLVYEPERHLQG 443

Query: 186 -APENESKIVPGS--YVPFGDGPRTCI 209
                E  +V     +V F  G R C+
Sbjct: 444 DGITKEVTLVETEMRFVSFSTGRRGCV 470


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
          Provisional.
          Length = 522

 Score = 37.8 bits (88), Expect = 0.003
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 24 TTTTNNNNNNNNNNNNNKNNINSPN 48
           +  NN ++N NNNNNN N INS N
Sbjct: 8  LSVHNNADDNYNNNNNNNNQINSNN 32



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 22 TTTTTTNNNNNNNNNNNNNKNNINSPNFLG 51
           +     ++N NNNNNNNN+ N N+PN  G
Sbjct: 8  LSVHNNADDNYNNNNNNNNQINSNNPNNNG 37



 Score = 37.0 bits (86), Expect = 0.005
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 21 TTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           +     ++N NNNNNNNN  N+ N  N
Sbjct: 8  LSVHNNADDNYNNNNNNNNQINSNNPNN 35



 Score = 35.9 bits (83), Expect = 0.012
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 20 TTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           +      +N NNNNNNNN   +N  + N
Sbjct: 8  LSVHNNADDNYNNNNNNNNQINSNNPNNN 36



 Score = 35.9 bits (83), Expect = 0.012
 Identities = 13/32 (40%), Positives = 15/32 (46%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           +         NNNNNNNN  N+N  N N  N
Sbjct: 8  LSVHNNADDNYNNNNNNNNQINSNNPNNNGNN 39



 Score = 32.4 bits (74), Expect = 0.15
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
          +          NNNNNNN  N+NN NN  +  
Sbjct: 9  SVHNNADDNYNNNNNNNNQINSNNPNNNGNNQ 40



 Score = 29.3 bits (66), Expect = 1.9
 Identities = 12/32 (37%), Positives = 13/32 (40%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
                     NNN  N+NN NNN NN  S  
Sbjct: 13 NADDNYNNNNNNNNQINSNNPNNNGNNQASKL 44


>gnl|CDD|219312 pfam07150, DUF1390, Protein of unknown function (DUF1390).  This
          family consists of several Paramecium bursaria
          chlorella virus 1 (PBCV-1) proteins of around 250
          residues in length. The function of this family is
          unknown.
          Length = 229

 Score = 35.1 bits (81), Expect = 0.016
 Identities = 12/29 (41%), Positives = 13/29 (44%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNIN 45
           TT +   T   NNN NN N   N  NI 
Sbjct: 53 ATTVSNINTGVVNNNVNNINKTINTKNIT 81



 Score = 30.4 bits (69), Expect = 0.55
 Identities = 10/29 (34%), Positives = 11/29 (37%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNIN 45
          T +   T    NN NN N   N     IN
Sbjct: 55 TVSNINTGVVNNNVNNINKTINTKNITIN 83



 Score = 29.7 bits (67), Expect = 1.0
 Identities = 10/42 (23%), Positives = 14/42 (33%)

Query: 19 TTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVS 60
            TT +       NNN NN N   N  +      +  +  V 
Sbjct: 52 LATTVSNINTGVVNNNVNNINKTINTKNITINLTVPDKSTVE 93


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
          short domain found in bacterial type II/III secretory
          system proteins. The architecture of these proteins
          suggest that this family may be functionally analogous
          to pfam03958.
          Length = 95

 Score = 32.7 bits (75), Expect = 0.029
 Identities = 6/30 (20%), Positives = 23/30 (76%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
          T+ T+ + +++ +N++++++N++N  + +S
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSS 48



 Score = 31.5 bits (72), Expect = 0.072
 Identities = 5/31 (16%), Positives = 22/31 (70%)

Query: 18 TTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
          T+ T+ + +++ +N++++++N++    +S +
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSSS 49



 Score = 26.9 bits (60), Expect = 2.9
 Identities = 2/30 (6%), Positives = 21/30 (70%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
          + + +++ + ++++++N++N  ++ ++ + 
Sbjct: 23 SGSVSSSGSNSSSSSSNSSNGGSSSSSSSG 52


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 34.0 bits (77), Expect = 0.049
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 4/36 (11%)

Query: 17  TTTTTTTTTTTNNNNNN----NNNNNNNKNNINSPN 48
             T T T T TNN N +    N N N N N   + N
Sbjct: 315 NKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSN 350



 Score = 33.2 bits (75), Expect = 0.080
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNN 43
           T  T T+T + N N N N+N N N N 
Sbjct: 325 TNNTNTSTPSKNTNTNTNSNTNTNSNT 351



 Score = 33.2 bits (75), Expect = 0.082
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
            T  T T+T + N N N N+N N  +N N+
Sbjct: 324 NTNNTNTSTPSKNTNTNTNSNTNTNSNTNA 353



 Score = 32.9 bits (74), Expect = 0.11
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           T T+T +  T  N N+N N N+N   N  S N
Sbjct: 328 TNTSTPSKNTNTNTNSNTNTNSNTNANQGSSN 359



 Score = 32.5 bits (73), Expect = 0.17
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
            T+T +  T TN N+N N N+N N N  +S N
Sbjct: 329 NTSTPSKNTNTNTNSNTNTNSNTNANQGSSNN 360



 Score = 31.7 bits (71), Expect = 0.29
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
            T T  T T+T + N N N N+N   N N+
Sbjct: 322 NTNTNNTNTSTPSKNTNTNTNSNTNTNSNT 351



 Score = 31.7 bits (71), Expect = 0.30
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           T T  T T+T + N N N N+N N N+  + N
Sbjct: 323 TNTNNTNTSTPSKNTNTNTNSNTNTNSNTNAN 354



 Score = 29.8 bits (66), Expect = 1.1
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 18  TTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
           +  T T T +N N N+N N N   +N NS
Sbjct: 334 SKNTNTNTNSNTNTNSNTNANQGSSNNNS 362



 Score = 29.0 bits (64), Expect = 2.2
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
           +  T T T +  N N+N N N  + NN ++ +   +I
Sbjct: 334 SKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSASAII 370



 Score = 28.6 bits (63), Expect = 3.1
 Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 6/37 (16%)

Query: 18  TTTTTTTTTTNNNNNNNNNNN------NNKNNINSPN 48
           +T T    T  N N N NN N      N   N NS  
Sbjct: 309 STNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNT 345


>gnl|CDD|151307 pfam10858, DUF2659, Protein of unknown function (DUF2659).  This
           bacterial family of proteins has no known function.
          Length = 220

 Score = 33.4 bits (76), Expect = 0.063
 Identities = 26/92 (28%), Positives = 35/92 (38%), Gaps = 11/92 (11%)

Query: 6   IFF-FYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNN---INSPNFLGVIRMQEVVSG 61
           IFF   L I+   +    T    NNNN      NN KN    + S +        E    
Sbjct: 19  IFFKKLLPIVIIISLIAITIMVINNNNKARQIANNQKNGDIFVKSVD-------LESTKN 71

Query: 62  VFVLIVGGHENSTSTSTNVLHELAYNQEVQDK 93
              L V   EN  S S   + E+A+ ++V  K
Sbjct: 72  NSELAVQTLENLVSNSNTKIQEIAHLEQVAIK 103


>gnl|CDD|214408 MTH00036, ATP8, ATP synthase F0 subunit 8; Validated.
          Length = 54

 Score = 30.0 bits (68), Expect = 0.11
 Identities = 10/33 (30%), Positives = 19/33 (57%)

Query: 11 LFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNN 43
          L  + FT        +TN +N+++N +++N NN
Sbjct: 19 LVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51



 Score = 30.0 bits (68), Expect = 0.13
 Identities = 8/34 (23%), Positives = 17/34 (50%)

Query: 6  IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNN 39
             F +F         +T  + +++N +++NNNN
Sbjct: 18 SLVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51



 Score = 26.2 bits (58), Expect = 2.9
 Identities = 9/42 (21%), Positives = 18/42 (42%)

Query: 5  GIFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
             F   + L F   T        + N +N+++N + +N N+
Sbjct: 10 LFNFLLAWSLVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51


>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
           Provisional.
          Length = 1466

 Score = 32.7 bits (74), Expect = 0.14
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 7/71 (9%)

Query: 24  TTTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLIVGGHENSTSTSTNVLHE 83
           T     NNN NN ++NN NN N+ N +         +G +++  G H+        + + 
Sbjct: 671 TKDNKENNNKNNKDDNNNNNNNNNNKIN-------NAGSYIIEQGTHDALMKNKNGIYYT 723

Query: 84  LAYNQEVQDKA 94
           +  NQ+V  K 
Sbjct: 724 MINNQKVSSKK 734


>gnl|CDD|214445 MTH00166, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 160

 Score = 30.9 bits (71), Expect = 0.25
 Identities = 9/54 (16%), Positives = 15/54 (27%), Gaps = 4/54 (7%)

Query: 1   MFTPGIFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNI----NSPNFL 50
                       ++           +  +N NNN N   N  ++    N P  L
Sbjct: 76  FKFSLKLLILFLLMLLLMLNNFIKNSEMSNFNNNFNFFENSFSLSKLYNFPTML 129


>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 628

 Score = 31.9 bits (73), Expect = 0.26
 Identities = 17/59 (28%), Positives = 30/59 (50%)

Query: 8  FFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLI 66
          F  L I  F ++     +T+N +   NNNNN+  N  NS   + +++    +S + +LI
Sbjct: 17 FIILLISIFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFSISLLKTLAFLSLINLLI 75



 Score = 29.2 bits (66), Expect = 1.9
 Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 6  IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
          IFF        T+  +      NN+  N NN+N + + + +  FL +I
Sbjct: 24 IFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFSISLLKTLAFLSLI 71



 Score = 26.9 bits (60), Expect = 9.9
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 6  IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKN 42
          I  F+   L   +T+  +    NNNN+  N NN+N +
Sbjct: 22 ISIFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFS 58


>gnl|CDD|178602 PLN03034, PLN03034, phosphoglycerate kinase; Provisional.
          Length = 481

 Score = 31.5 bits (71), Expect = 0.35
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 183 DRFAPENESKIVPGSYVPFG 202
           D+FAP+  SKIVP S +P G
Sbjct: 350 DKFAPDANSKIVPASAIPDG 369


>gnl|CDD|205936 pfam13763, DUF4167, Domain of unknown function (DUF4167). 
          Length = 80

 Score = 29.2 bits (66), Expect = 0.40
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 28 NNNNNNNNNNNNNKNNINSPN 48
          NNN NNN +  N +N +   N
Sbjct: 5  NNNRNNNRSGGNPRNRVFDSN 25



 Score = 28.0 bits (63), Expect = 0.98
 Identities = 9/18 (50%), Positives = 10/18 (55%)

Query: 28 NNNNNNNNNNNNNKNNIN 45
            NNNN NNN +  N  N
Sbjct: 2  GRNNNNRNNNRSGGNPRN 19



 Score = 26.9 bits (60), Expect = 2.8
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 30 NNNNNNNNNNNKNNINSPN 48
             NNNN NNN++  N  N
Sbjct: 1  RGRNNNNRNNNRSGGNPRN 19



 Score = 26.5 bits (59), Expect = 3.9
 Identities = 9/22 (40%), Positives = 11/22 (50%)

Query: 28 NNNNNNNNNNNNNKNNINSPNF 49
             NNNN NNN +  N  +  F
Sbjct: 1  RGRNNNNRNNNRSGGNPRNRVF 22


>gnl|CDD|150787 pfam10154, DUF2362, Uncharacterized conserved protein (DUF2362).
           This is a family of proteins conserved from nematodes to
           humans. The function is not known.
          Length = 501

 Score = 31.3 bits (71), Expect = 0.41
 Identities = 5/21 (23%), Positives = 9/21 (42%)

Query: 23  TTTTTNNNNNNNNNNNNNKNN 43
             + T+ + NN    +N   N
Sbjct: 150 GKSLTDQDVNNLAAQHNENQN 170



 Score = 29.4 bits (66), Expect = 1.7
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query: 22  TTTTTTNNNNNNNNNNNNNKNNI 44
             + T  + NN    +N N+N+I
Sbjct: 150 GKSLTDQDVNNLAAQHNENQNSI 172


>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
          C-terminus.  Intimin and its translocated intimin
          receptor (Tir) are bacterial proteins that mediate
          adhesion between mammalian cells and attaching and
          effacing (A/E) pathogens. A unique and essential
          feature of A/E bacterial pathogens is the formation of
          actin-rich pedestals beneath the intimately adherent
          bacteria and localised destruction of the intestinal
          brush border. The bacterial outer membrane adhesin,
          intimin, is necessary for the production of the A/E
          lesion and diarrhoea. The A/E bacteria translocate
          their own receptor for intimin, Tir, into the membrane
          of mammalian cells using the type III secretion system.
          The translocated Tir triggers additional host
          signalling events and actin nucleation, which are
          essential for lesion formation. This family represents
          the Tir C-terminal domain which has been reported to
          bind uninfected host cells and beta-1 integrins
          although the role of intimin binding to integrins is
          unclear. This intimin C-terminal domain has also been
          shown to be sufficient for Tir recognition.
          Length = 222

 Score = 30.7 bits (69), Expect = 0.44
 Identities = 14/31 (45%), Positives = 17/31 (54%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSP 47
          TTTTTTTTTT     N   NN   + N ++ 
Sbjct: 61 TTTTTTTTTTHTTVENKPANNTPAQGNTDTS 91



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNN 38
          TTTTTTTTTTT+    N   NN
Sbjct: 60 TTTTTTTTTTTHTTVENKPANN 81



 Score = 27.2 bits (60), Expect = 5.8
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 18 TTTTTTTTTTNNNNNNNNNNNN 39
          TTTTTTTTTT +    N   NN
Sbjct: 60 TTTTTTTTTTTHTTVENKPANN 81


>gnl|CDD|214442 MTH00160, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 335

 Score = 30.6 bits (70), Expect = 0.50
 Identities = 6/48 (12%), Positives = 17/48 (35%)

Query: 6   IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
           I  ++   + +++          N     N  +   N ++  +  G+I
Sbjct: 280 ITLYFYLRISYSSLMLNYNKNKWNLKKKKNKFSLILNFLSFISLFGLI 327



 Score = 30.2 bits (69), Expect = 0.66
 Identities = 12/50 (24%), Positives = 17/50 (34%), Gaps = 7/50 (14%)

Query: 1   MFTPGIFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFL 50
           M +    +FYL I +       ++   N N N  N            NFL
Sbjct: 276 MSSLITLYFYLRISY-------SSLMLNYNKNKWNLKKKKNKFSLILNFL 318



 Score = 28.3 bits (64), Expect = 3.3
 Identities = 8/39 (20%), Positives = 10/39 (25%)

Query: 7   FFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNIN 45
           F  Y  I                N   N+NN + K    
Sbjct: 197 FLIYSIISLSIILLFNKFNIFFLNQLFNSNNKSKKIKFL 235


>gnl|CDD|165923 PLN02282, PLN02282, phosphoglycerate kinase.
          Length = 401

 Score = 30.7 bits (69), Expect = 0.52
 Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 13/72 (18%)

Query: 131 YPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPENE 190
           Y +G+ L  E   D A         KGV + +PT     D  +         D+FAP+  
Sbjct: 236 YSVGSSLVEEDKLDLATSLIEKAKAKGVSLLLPT-----DVVI--------ADKFAPDAN 282

Query: 191 SKIVPGSYVPFG 202
           SK+VP S +P G
Sbjct: 283 SKVVPASAIPDG 294


>gnl|CDD|205235 pfam13054, DUF3915, Protein of unknown function (DUF3915).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 120 amino acids in length.
          Length = 116

 Score = 29.1 bits (65), Expect = 0.84
 Identities = 8/27 (29%), Positives = 17/27 (62%)

Query: 29  NNNNNNNNNNNNKNNINSPNFLGVIRM 55
              ++ +N +++KNN N   F G++R+
Sbjct: 82  AMGDDKDNKDDDKNNQNRNKFTGILRV 108


>gnl|CDD|218869 pfam06043, Reo_P9, Reovirus P9-like family. 
          Length = 333

 Score = 29.8 bits (67), Expect = 0.91
 Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 3/42 (7%)

Query: 25 TTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLI 66
          TTTNN NN N N  NN N     N     R   +  G++ L+
Sbjct: 18 TTTNNQNNFNTNTQNNSNT---ENRNSTKRKPILDDGIYHLL 56


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 30.1 bits (67), Expect = 1.00
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNI 44
           +TT + +TTTTN+  + N+ +++N + I
Sbjct: 295 STTNSNSTTTTNSTTSTNSTSSSNSSTI 322



 Score = 29.7 bits (66), Expect = 1.3
 Identities = 9/32 (28%), Positives = 20/32 (62%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           + + +T+ +TTN+N+    N+  + N+ +S N
Sbjct: 287 SNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSN 318



 Score = 29.4 bits (65), Expect = 1.9
 Identities = 9/37 (24%), Positives = 19/37 (51%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
            +T+ +TT + +    N+  + N+ ++ NS    G I
Sbjct: 290 NSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCI 326



 Score = 28.2 bits (62), Expect = 3.8
 Identities = 9/32 (28%), Positives = 20/32 (62%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
            +T+ + +TT+NN+  ++N+     N+ +S N
Sbjct: 114 NSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSN 145



 Score = 27.0 bits (59), Expect = 10.0
 Identities = 8/30 (26%), Positives = 20/30 (66%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
            +TT+  +TT++N+    +N+ ++ N+ +S
Sbjct: 120 NSTTSNNSTTSSNSTTTTSNSTSSSNSTSS 149


>gnl|CDD|201496 pfam00895, ATP-synt_8, ATP synthase protein 8. 
          Length = 54

 Score = 27.3 bits (61), Expect = 1.1
 Identities = 7/42 (16%), Positives = 9/42 (21%)

Query: 6  IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSP 47
          IF      L        ++ T  N               N P
Sbjct: 12 IFLSSWLTLLIIIQLKISSHTPPNPPTKKTTLKKKTTPWNWP 53


>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
          Length = 536

 Score = 29.0 bits (64), Expect = 2.0
 Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 28  NNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVS 60
           N NNNN N +NNN N  N+ N   V RM   +S
Sbjct: 95  NGNNNNPNGDNNNPNGSNN-NIARVQRMPHGIS 126


>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
           Mitochondrial-encoded NADH-quinone
           oxidoreductase/respiratory complex I, the second domain
           of the Nad11/75-kDa subunit of some protists.
           NADH-quinone oxidoreductase is the first
           energy-transducting complex in the respiratory chain and
           functions as a redox pump that uses the redox energy to
           translocate H+ ions across the membrane, resulting in a
           significant contribution to energy production. The nad11
           gene codes for the largest (75-kDa) subunit of the
           mitochondrial NADH-quinone oxidoreductase, it
           constitutes the electron input part of the enzyme, or
           the so-called NADH dehydrogenase fragment. The Nad11
           subunit is made of two domains: the first contains three
           binding sites for FeS clusters (the fer2 domain), the
           second domain (this CD), is of unknown function or, as
           postulated, has lost an ancestral formate dehydrogenase
           activity that became redundant during the evolution of
           the complex I enzyme. Although only vestigial sequence
           evidence remains of a molybdopterin binding site, this
           protein domain belongs to the molybdopterin_binding
           (MopB) superfamily of proteins.
          Length = 366

 Score = 28.5 bits (64), Expect = 2.5
 Identities = 12/41 (29%), Positives = 21/41 (51%)

Query: 28  NNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLIVG 68
            N N+NN   NNN  N++  N+L    ++ +      L++G
Sbjct: 116 LNTNSNNFLENNNYFNLDLENYLFNNSLKNLDKSDLCLLIG 156


>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is typically
          between 46 to 94 amino acids in length. This domain is
          found associated with pfam01462, pfam00560.
          Length = 122

 Score = 27.6 bits (61), Expect = 2.6
 Identities = 11/11 (100%), Positives = 11/11 (100%)

Query: 17 TTTTTTTTTTT 27
          TTTTTTTTTTT
Sbjct: 48 TTTTTTTTTTT 58



 Score = 27.2 bits (60), Expect = 3.2
 Identities = 11/11 (100%), Positives = 11/11 (100%)

Query: 17 TTTTTTTTTTT 27
          TTTTTTTTTTT
Sbjct: 47 TTTTTTTTTTT 57


>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
           superfamily II helicase [DNA replication, recombination,
           and repair].
          Length = 730

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 8/20 (40%), Positives = 10/20 (50%)

Query: 144 DYAIPDSPHVIQKGVLVHIP 163
           DY IP+     Q G  V +P
Sbjct: 17  DYLIPEGLEPDQPGSRVRVP 36


>gnl|CDD|177303 PHA00735, PHA00735, hypothetical protein.
          Length = 808

 Score = 28.7 bits (64), Expect = 3.1
 Identities = 16/32 (50%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 17  TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
           T  TT+TTTT N +  N N      NNI SPN
Sbjct: 695 TQPTTSTTTTVNTSTLNYNYTQPFSNNI-SPN 725


>gnl|CDD|235670 PRK06009, flgD, flagellar basal body rod modification protein;
          Reviewed.
          Length = 140

 Score = 27.4 bits (61), Expect = 3.4
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFL 50
          TT+ T+++TT++ +  +N+ +  NK ++N  +FL
Sbjct: 6  TTSATSSSTTSSTSTASNSTSAANKASVNYDSFL 39


>gnl|CDD|234766 PRK00447, PRK00447, hypothetical protein; Provisional.
          Length = 144

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 2/56 (3%)

Query: 48  NFLGVIRMQEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYK 103
           NF    + +    G + ++VG  E  T  S   LH L   + + D AR+ + K  +
Sbjct: 26  NFFDFEKFEAEDEGEYKILVG--EARTLKSLQKLHRLLRGERILDTARKYLMKGIE 79


>gnl|CDD|187593 cd05332, 11beta-HSD1_like_SDR_c, 11beta-hydroxysteroid
           dehydrogenase type 1 (11beta-HSD1)-like, classical (c)
           SDRs.  Human 11beta_HSD1 catalyzes the NADP(H)-dependent
           interconversion of cortisone and cortisol. This subgroup
           also includes human dehydrogenase/reductase SDR family
           member 7C (DHRS7C) and DHRS7B. These proteins have the
           GxxxGxG nucleotide binding motif and S-Y-K catalytic
           triad characteristic of the SDRs, but have an atypical
           C-terminal domain that contributes to homodimerization
           contacts. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold (alpha/beta folding
           pattern with a central beta-sheet), an NAD(P)(H)-binding
           region, and a structurally diverse C-terminal region.
           Classical SDRs are typically about 250 residues long,
           while extended SDRs are approximately 350 residues.
           Sequence identity between different SDR enzymes are
           typically in the 15-30% range, but the enzymes share the
           Rossmann fold NAD-binding motif and characteristic
           NAD-binding and catalytic sequence patterns. These
           enzymes catalyze a wide range of activities including
           the metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase (15-PGDH)
           numbering). In addition to the Tyr and Lys, there is
           often an upstream Ser (Ser-138, 15-PGDH numbering)
           and/or an Asn (Asn-107, 15-PGDH numbering) contributing
           to the active site; while substrate binding is in the
           C-terminal region, which determines specificity. The
           standard reaction mechanism is a 4-pro-S hydride
           transfer and proton relay involving the conserved Tyr
           and Lys, a water molecule stabilized by Asn, and
           nicotinamide. Extended SDRs have additional elements in
           the C-terminal region, and typically have a TGXXGXXG
           cofactor binding motif. Complex (multidomain) SDRs such
           as ketoreductase domains of fatty acid synthase have a
           GGXGXXG NAD(P)-binding motif and an altered active site
           motif (YXXXN). Fungal type ketoacyl reductases have a
           TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
           have lost catalytic activity and/or have an unusual
           NAD(P)-binding motif and missing or unusual active site
           residues. Reactions catalyzed within the SDR family
           include isomerization, decarboxylation, epimerization,
           C=N bond reduction, dehydratase activity,
           dehalogenation, Enoyl-CoA reduction, and
           carbonyl-alcohol oxidoreduction.
          Length = 257

 Score = 27.9 bits (63), Expect = 4.1
 Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 8/46 (17%)

Query: 94  ARRE--VQKVYKE----GGGKVTY--EDLAKMTYLEQVISEALRLY 131
           ARRE  +++V  E    G         D++ +   EQV+ EAL+L+
Sbjct: 34  ARREERLEEVKSECLELGAPSPHVVPLDMSDLEDAEQVVEEALKLF 79


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 28.0 bits (62), Expect = 4.2
 Identities = 8/23 (34%), Positives = 19/23 (82%)

Query: 25  TTTNNNNNNNNNNNNNKNNINSP 47
           +++++  NN+NNNN++ +N+ +P
Sbjct: 190 SSSSSKINNSNNNNHSNSNLMTP 212



 Score = 27.6 bits (61), Expect = 6.4
 Identities = 8/19 (42%), Positives = 17/19 (89%)

Query: 22  TTTTTTNNNNNNNNNNNNN 40
           +++++  NN+NNNN++N+N
Sbjct: 190 SSSSSKINNSNNNNHSNSN 208



 Score = 27.2 bits (60), Expect = 6.7
 Identities = 8/19 (42%), Positives = 17/19 (89%)

Query: 21  TTTTTTTNNNNNNNNNNNN 39
           +++++  NN+NNNN++N+N
Sbjct: 190 SSSSSKINNSNNNNHSNSN 208


>gnl|CDD|214411 MTH00044, ND4, NADH dehydrogenase subunit 4; Validated.
          Length = 458

 Score = 27.6 bits (62), Expect = 4.9
 Identities = 6/42 (14%), Positives = 13/42 (30%), Gaps = 7/42 (16%)

Query: 12  FILFFTTTTTTTTTT-------TNNNNNNNNNNNNNKNNINS 46
            + F      T              N+ +  +  +N + I+S
Sbjct: 392 ILSFPIIGLATVFGAIYSLLIYQLTNSGSFPSWISNVSPISS 433


>gnl|CDD|205311 pfam13130, DUF3952, Domain of unknown function (DUF3952).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria, and is approximately
           110 amino acids in length. There is a conserved VMSAS
           sequence motif.
          Length = 107

 Score = 26.6 bits (59), Expect = 5.1
 Identities = 8/29 (27%), Positives = 12/29 (41%)

Query: 14  LFFTTTTTTTTTTTNNNNNNNNNNNNNKN 42
             +  TT   TT  +N    + NNN  + 
Sbjct: 77  SLYGETTQEITTDIDNEKEKDKNNNYKEE 105


>gnl|CDD|181285 PRK08199, PRK08199, thiamine pyrophosphate protein; Validated.
          Length = 557

 Score = 27.5 bits (62), Expect = 5.6
 Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 4/43 (9%)

Query: 142 TQDYAIPDSPHVIQKGVLVHIPTYAL----QTDATLWSDPLEF 180
           TQ Y + D P   Q  V VH     L    + D  + +DP  F
Sbjct: 286 TQGYTLLDIPVPRQTLVHVHPDAEELGRVYRPDLAIVADPAAF 328


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 27.6 bits (62), Expect = 5.6
 Identities = 7/25 (28%), Positives = 15/25 (60%)

Query: 88  QEVQDKARREVQKVYKEGGGKVTYE 112
           +E+ +KA +EV    +E G +  ++
Sbjct: 279 EEMVEKATKEVDNAIREEGEQAAFD 303


>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
           protein.  [Transport and binding proteins, Other].
          Length = 1394

 Score = 27.8 bits (62), Expect = 6.1
 Identities = 11/17 (64%), Positives = 11/17 (64%)

Query: 3   TPGIFFFYLFILFFTTT 19
           T G FFFYL ILF  T 
Sbjct: 528 TAGRFFFYLLILFICTL 544


>gnl|CDD|227188 COG4851, CamS, Protein involved in sex pheromone biosynthesis
           [General function prediction only].
          Length = 382

 Score = 27.5 bits (61), Expect = 6.4
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 23  TTTTTNNNNNNNNNNNNNKNNINS--PNFLGVI 53
           +TT  NN  N+N+N    K  I    PN+ GV+
Sbjct: 273 STTAANNARNDNDNFLRFKEEIQGYFPNYTGVV 305


>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
          Length = 143

 Score = 26.5 bits (59), Expect = 6.8
 Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 9/56 (16%)

Query: 26 TTNNNNNNNNNNNNNKNNINSPNF----LGVIRMQEVVSG----VF-VLIVGGHEN 72
                   N   N K+NI    +    + + R+ +VV G     F  ++V G+EN
Sbjct: 1  MLFLLKKMYNKKKNRKSNIKENKWEERVIQIKRVSKVVKGGKKLSFRAIVVIGNEN 56


>gnl|CDD|224842 COG1931, COG1931, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 140

 Score = 26.6 bits (59), Expect = 7.2
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 71  ENSTSTSTNVLHELAYNQEVQDKARREVQK 100
           E  T  S   L EL   Q + D AR  ++K
Sbjct: 44  EGRTLKSLERLRELLRKQRILDTARMVLEK 73


>gnl|CDD|219019 pfam06420, Mgm101p, Mitochondrial genome maintenance MGM101.  The
          mgm101 gene was identified as essential for maintenance
          of the mitochondrial genome in Saccharomyces
          cerevisiae. Based on its DNA-binding activity, and
          experimental work with a temperature-sensitive mgm101
          mutant, it has been proposed that the mgm101 gene
          product performs an essential function in the repair of
          oxidatively damaged mitochondrial DNA.
          Length = 241

 Score = 26.8 bits (59), Expect = 8.7
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 15 FFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNF 49
          F +TT+    + T  NN NN+  N     I +  F
Sbjct: 4  FISTTSLRVPSNTKRNNGNNSKFNKYIKGIFNREF 38


>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
          (Spore_YhcN_YlaJ).  This entry contains YhcN and YlaJ,
          which are predicted lipoproteins that have been
          detected as spore proteins but not vegetative proteins
          in Bacillus subtilis. Both appear to be expressed under
          control of the RNA polymerase sigma-G factor. The
          YlaJ-like members of this family have a low-complexity,
          strongly acidic, 40-residue C-terminal domain.
          Length = 169

 Score = 26.5 bits (59), Expect = 9.0
 Identities = 13/32 (40%), Positives = 14/32 (43%)

Query: 20 TTTTTTTTNNNNNNNNNNNNNKNNINSPNFLG 51
          T     T N NN NN NNNN    +   N  G
Sbjct: 24 TDNDGNTNNVNNRNNRNNNNGYGRLTKNNVTG 55


>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
          family.  YhcN and YlaJ are predicted lipoproteins that
          have been detected as spore proteins but not vegetative
          proteins in Bacillus subtilis. Both appear to be
          expressed under control of the RNA polymerase sigma-G
          factor. The YlaJ-like members of this family have a
          low-complexity, strongly acidic 40-residue C-terminal
          domain that is not included in the seed alignment for
          this model. A portion of the low-complexity region
          between the lipoprotein signal sequence and the main
          conserved region of the protein family was also excised
          from the seed alignment [Cellular processes,
          Sporulation and germination].
          Length = 158

 Score = 26.2 bits (58), Expect = 9.7
 Identities = 13/43 (30%), Positives = 15/43 (34%)

Query: 6  IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
          I F  L +L   T  T        NNN    N  +KNN     
Sbjct: 1  ILFIILLLLLVLTGCTNAQKKNPPNNNVGETNVMSKNNNGMNT 43


>gnl|CDD|151365 pfam10918, DUF2718, Protein of unknown function (DUF2718).  This
           viral family of proteins has no known function.
          Length = 140

 Score = 26.1 bits (57), Expect = 9.7
 Identities = 7/33 (21%), Positives = 12/33 (36%), Gaps = 1/33 (3%)

Query: 28  NNNNNNNNNNNNNKNNINSPNFLGVI-RMQEVV 59
                 N N  N    +   +  G + +M E+V
Sbjct: 107 YCYPYPNKNAKNQPKRVEEKSEKGFLEKMVEMV 139


>gnl|CDD|219031 pfam06444, NADH_dehy_S2_C, NADH dehydrogenase subunit 2
          C-terminus.  This family consists of the C-terminal
          region specific to the eukaryotic NADH dehydrogenase
          subunit 2 protein and is found in conjunction with
          pfam00361.
          Length = 56

 Score = 24.8 bits (55), Expect = 9.8
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 8  FFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNN 40
          FFYL + + TT T    T+T+          N 
Sbjct: 2  FFYLRLSYSTTLTLPPNTSTSTTKWRLKPKKNT 34


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.135    0.397 

Gapped
Lambda     K      H
   0.267   0.0729    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,061,052
Number of extensions: 1018099
Number of successful extensions: 2823
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2410
Number of HSP's successfully gapped: 188
Length of query: 217
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 124
Effective length of database: 6,812,680
Effective search space: 844772320
Effective search space used: 844772320
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.7 bits)