RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17355
         (849 letters)



>gnl|CDD|222339 pfam13716, CRAL_TRIO_2, Divergent CRAL/TRIO domain.  This family
           includes divergent members of the CRAL-TRIO domain
           family. This family includes ECM25 that contains a
           divergent CRAL-TRIO domain identified by Gallego and
           colleagues.
          Length = 149

 Score =  100 bits (251), Expect = 1e-24
 Identities = 45/141 (31%), Positives = 76/141 (53%), Gaps = 11/141 (7%)

Query: 701 NAIIVFSACFLPHHSRADYHYVMDNLFFYVLHTLDQLIT-EDYVLVYLHGATSRHNMPTF 759
             ++VF A  LP     D     D L FY+L  L + +  + +V+V  H   +R+  P++
Sbjct: 13  RPVLVFDAKRLPASDLDDL----DRLLFYLLSILSEELKPKGFVVVIDHTGVTRNK-PSW 67

Query: 760 SWLKRCYQMIDRKLKKNLKQLYLVHATFWLKTII-VMSKPFISSKF-SKKIQFIMSLNEL 817
           SWLK+ Y+++ R   KNLK +Y+VH + +LK  +  + + F S K   KK+ ++ SL+EL
Sbjct: 68  SWLKKLYKLLPRAFPKNLKAVYIVHPSTFLKKFLKTLGRLFSSKKLFKKKVIYVSSLSEL 127

Query: 818 AERLPIEE--ASIPDKVRQYD 836
           +E +   +    +   V  YD
Sbjct: 128 SEYIDPTQLPLELGG-VLSYD 147


>gnl|CDD|221603 pfam12496, BNIP2, Bcl2-/adenovirus E1B nineteen kDa-interacting
           protein 2.  This domain family is found in eukaryotes,
           and is typically between 119 and 133 amino acids in
           length. There is a conserved HGGY sequence motif. This
           family is Bcl2-/adenovirus E1B nineteen kDa-interacting
           protein 2. It interacts with pro- and anti- apoptotic
           molecules in the cell.
          Length = 126

 Score = 97.1 bits (242), Expect = 9e-24
 Identities = 50/130 (38%), Positives = 71/130 (54%), Gaps = 25/130 (19%)

Query: 590 HQSRKKLL-----LSTDLFSGNLMEDDLASLSDE---DLDELLTPSNAD-DDLENSILEQ 640
           H+ RK+L      LS D   G+L+ DDL    D+   ++D+L TP ++D  D   + LE 
Sbjct: 1   HKRRKRLSAPNLNLSLDQSEGSLLSDDLLETPDDLDINVDDLDTPDDSDSLDFPGNELEW 60

Query: 641 LNLSDDDL------------EIQELSAKEEREEGRRWKICVVSGVEKRIDMKVIEPYKRV 688
               +DD              I + +A+EER +GR W+   +   E RIDMKVIEPYK+V
Sbjct: 61  ----EDDTPKAKAAESPPAGSIPQYTAEEERRDGRLWRTVRIGEQEHRIDMKVIEPYKKV 116

Query: 689 LSHGGYLAEG 698
           +SHGGY  +G
Sbjct: 117 ISHGGYYGDG 126


>gnl|CDD|238099 cd00170, SEC14, Sec14p-like lipid-binding domain. Found in
           secretory proteins, such as S. cerevisiae
           phosphatidylinositol transfer protein (Sec14p), and in
           lipid regulated proteins such as RhoGAPs, RhoGEFs and
           neurofibromin (NF1). SEC14 domain of Dbl is known to
           associate with G protein beta/gamma subunits.
          Length = 157

 Score = 83.2 bits (206), Expect = 2e-18
 Identities = 38/153 (24%), Positives = 71/153 (46%), Gaps = 14/153 (9%)

Query: 680 KVIEPYKRVLSHGGYLAEGCHNAIIVFSACFLPHHSRADYHYVMDNLFFYVLHTLDQLIT 739
           + ++   +V   GG   EG    +++  A               + L  Y+++TL++L+ 
Sbjct: 2   EELKELGKVGYLGGRDKEGRP--VLIIRAGNKDLSK----SLDSEELLRYLVYTLEKLLQ 55

Query: 740 EDYVLVY-------LHGATSRHNMPTFSWLKRCYQMIDRKLKKNLKQLYLVHATFWLKTI 792
           ED   V        L G +  H +P  S LK+  +++     + LK +Y+++  ++ K +
Sbjct: 56  EDDEQVEGFVVIIDLKGLSLSHLLPDPSLLKKILKILQDNYPERLKAVYIINPPWFFKVL 115

Query: 793 IVMSKPFISSKFSKKIQFIMS-LNELAERLPIE 824
             + KPF+S K  KKI F+ S   EL + +  E
Sbjct: 116 WKIVKPFLSEKTRKKIVFLGSDKEELLKYIDKE 148


>gnl|CDD|214706 smart00516, SEC14, Domain in homologues of a S. cerevisiae
           phosphatidylinositol transfer protein (Sec14p).  Domain
           in homologues of a S. cerevisiae phosphatidylinositol
           transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and
           the RasGAP, neurofibromin (NF1). Lipid-binding domain.
           The SEC14 domain of Dbl is known to associate with G
           protein beta/gamma subunits.
          Length = 158

 Score = 75.4 bits (186), Expect = 6e-16
 Identities = 27/166 (16%), Positives = 72/166 (43%), Gaps = 19/166 (11%)

Query: 682 IEPYKRVLSHG-GYLAEGCHNAIIVFSACFLPHHSRADYHYVMDNLFFYVLHTLDQLI-- 738
           +E  K  +  G GY  +G    +++  A      S       ++ L  Y+++ L++++  
Sbjct: 2   LELLKAYIPGGRGYDKDGR--PVLIERAGRFDLKSVT-----LEELLRYLVYVLEKILQE 54

Query: 739 -TEDYVLVYLHGATSRHNMP----TFSWLKRCYQMIDRKLKKNLKQLYLVHATFWLKTII 793
             +   +           +       S L++  +++     + L ++Y+++  ++ + + 
Sbjct: 55  EKKTGGIEGFTVIFDLKGLSMSNPDLSVLRKILKILQDHYPERLGKVYIINPPWFFRVLW 114

Query: 794 VMSKPFISSKFSKKIQFIMSL--NELAERLPIEEASIPDKVRQYDK 837
            + KPF+  K  +KI+F+ +    EL E +  E+  +P+++     
Sbjct: 115 KIIKPFLDEKTREKIRFVGNDSKEELLEYIDKEQ--LPEELGGTLD 158


>gnl|CDD|216042 pfam00650, CRAL_TRIO, CRAL/TRIO domain. 
          Length = 152

 Score = 46.9 bits (112), Expect = 5e-06
 Identities = 23/107 (21%), Positives = 51/107 (47%), Gaps = 11/107 (10%)

Query: 729 YVLHTLDQLITEDY--------VLVYLHGAT-SRHNMPTFSWLKRCYQMIDRKLKKNLKQ 779
           +++  L++ +            V++ + G + S  +    S+LK+  +++     + L +
Sbjct: 37  FLVLVLERALLLLPEGQIEGITVIIDMKGLSLSNMDKWPISFLKKILKILQDNYPERLGK 96

Query: 780 LYLVHATFWLKTIIVMSKPFISSKFSKKIQFIMS--LNELAERLPIE 824
           + +V+A +    I  + KPF+  K  +KI F  +  L EL + +P E
Sbjct: 97  ILIVNAPWIFNVIWKLIKPFLDPKTREKIHFHKNSNLEELLKYIPPE 143


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 36.5 bits (84), Expect = 0.070
 Identities = 37/215 (17%), Positives = 73/215 (33%), Gaps = 22/215 (10%)

Query: 16  KPDQQMIKNHVNSYRRETNGEDNLKRSRDNSRSRDHGEVLDNDLAKTDQYVGGDKSLTNS 75
             ++Q+ K    ++   T   D ++RS  N+ +  +           D+    +K    +
Sbjct: 184 ISNKQLKKQTNVTH---TTCYDKMRRSPRNTSTIKNNT--------NDKNKEKNKEKDKN 232

Query: 76  FEPIIEGSQERKATKEK--LMNSSFEQIIEKNSTALNRDTQNNIGLVNNYHNLTPE---- 129
            +   +G ++ K   EK  + NS F+  I ++ T  N+  + N+        L       
Sbjct: 233 IKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKSQC 292

Query: 130 --LDSPDSDDLFDSHKVPSRRFNSHARSIENNNANCNTPENDVETVTELSNNNSDSTNIS 187
              DS    +     K    +   H  +  +N+ N +T  ++      +SN N  S    
Sbjct: 293 LRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKK 352

Query: 188 ANVIETNTP---VHMTEPSELNLNNNPTNHSVTQN 219
               +           +       N+P  H  T N
Sbjct: 353 QTTTKHTNNTKNNKYNKTKTTQKFNHPLRHHATIN 387


>gnl|CDD|224242 COG1323, COG1323, Predicted nucleotidyltransferase [General
           function prediction only].
          Length = 358

 Score = 32.3 bits (74), Expect = 1.1
 Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 2/83 (2%)

Query: 398 TNNISAESYLKKYDDINAQVRAYLNKTQSQDLHKTLQSAAEANG--ATSVRQSELEALNE 455
            NNI    Y+K    I   + AY  K +  D H          G  AT++R++      E
Sbjct: 161 PNNILGLEYVKAIYVIGYSIAAYTIKREGADYHDKFLFEERLEGASATAIRKAIFSGDLE 220

Query: 456 RSHRYLSTNAEDDISSALGASKV 478
           R    +     + +SS  G    
Sbjct: 221 RIANMVPKETLEILSSKRGRPLS 243


>gnl|CDD|140349 PTZ00328, PTZ00328, eukaryotic initiation factor 5a; Provisional.
          Length = 166

 Score = 31.2 bits (70), Expect = 1.1
 Identities = 28/110 (25%), Positives = 53/110 (48%), Gaps = 17/110 (15%)

Query: 588 GNHQSRKKLLLSTDLFSGNLMEDDLASLSDEDLDELLTPS----NADDDLENSILEQLNL 643
           G H   K  +++TD+F+GN +ED   S  + ++  + T +    +   + + S+   L+L
Sbjct: 52  GKHGHAKVSIVATDIFTGNRLEDQAPSTHNVEVPFVKTFTYSVLDIQPNEDPSLPAHLSL 111

Query: 644 SDDDLEIQ---------ELSA--KEEREEGRRWKICVVS--GVEKRIDMK 680
            DD+ E +          L+   KE+ + G+   + VVS  G E+ +  K
Sbjct: 112 MDDEGESREDLDMPPDAALATQIKEQFDSGKEVLVVVVSAMGTEQVLQTK 161


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 32.5 bits (74), Expect = 1.2
 Identities = 15/43 (34%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 393 NASDFTNNISAESYLKKYDDINAQVRAYLNKTQSQDLHKTLQS 435
           NA    N +  E  LKK +    Q+  Y  K Q Q+LH  L  
Sbjct: 341 NAMYMQNTVQLEEELKKANAARGQLETY--KRQVQELHAKLSE 381


>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
            HarA is a heme-binding NEAT-domain (NEAr Transporter,
           pfam05031) protein which has been shown to bind to the
           haptoglobin-hemoglobin complex in order to extract heme
           from it. HarA has also been reported to bind hemoglobin
           directly. HarA (also known as IsdH) contains three NEAT
           domains as well as a sortase A C-terminal signal for
           localization to the cell wall. The heme bound at the
           third of these NEAT domains has been shown to be
           transferred to the IsdA protein also localized at the
           cell wall, presumably through an additional specific
           protein-protein interaction. Haptoglobin is a hemoglobin
           carrier protein involved in scavenging hemoglobin in the
           blood following red blood cell lysis and targetting it
           to the liver.
          Length = 895

 Score = 31.8 bits (71), Expect = 2.1
 Identities = 23/82 (28%), Positives = 38/82 (46%), Gaps = 7/82 (8%)

Query: 150 NSHARSIENNNANCNTPENDVETVTELSNNNSDSTNISANV---IETNTPVHMTEPS-EL 205
           N  + S  +N  N NT   +  T+   +N    +TN+S         N     ++P+ E 
Sbjct: 241 NDQSSSDASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSANADQASSQPAHET 300

Query: 206 NLN---NNPTNHSVTQNDLNRE 224
           N N   N+ TN S  Q+D+N++
Sbjct: 301 NSNGNTNDKTNESSNQSDVNQQ 322


>gnl|CDD|225489 COG2938, COG2938, Uncharacterized conserved protein [Function
           unknown].
          Length = 94

 Score = 29.2 bits (66), Expect = 2.6
 Identities = 17/52 (32%), Positives = 22/52 (42%), Gaps = 14/52 (26%)

Query: 597 LLSTDLFSGNLMEDDLASLSDEDLDELLTPSNADDDLENSILEQLNLSDDDL 648
           +   DL  G   E +  SLSDE+LDE              +LE     D+DL
Sbjct: 26  MRELDLILGPFAEKEFDSLSDEELDEFER-----------LLEC---EDNDL 63


>gnl|CDD|99731 cd06410, PB1_UP2, Uncharacterized protein 2. The PB1 domain is a
           modular domain mediating specific protein-protein
           interaction which play a role in many critical cell
           processes such as osteoclastogenesis, angiogenesis,
           early cardiovascular development, and cell polarity. A
           canonical PB1-PB1 interaction, which involves
           heterodimerization of two PB1 domains, is required for
           the formation of macromolecular signaling complexes
           ensuring specificity and fidelity during cellular
           signaling. The interaction between two PB1 domain
           depends on the type of PB1. There are three types of PB1
           domains: type I which contains an OPCA motif, acidic
           aminoacid cluster, type II which contains a basic
           cluster, and type I/II which contains both an OPCA motif
           and a basic cluster.  Interactions of PB1 domains with
           other protein domains have been described as
           noncanonical PB1-interactions.
          Length = 97

 Score = 29.1 bits (66), Expect = 2.7
 Identities = 14/25 (56%), Positives = 19/25 (76%), Gaps = 1/25 (4%)

Query: 615 LSDEDLDELLTPSNADDDLENSILE 639
           L DEDLD L++ SN D+DL+N + E
Sbjct: 58  LPDEDLDALISVSN-DEDLKNMMEE 81


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 30.7 bits (70), Expect = 4.0
 Identities = 31/155 (20%), Positives = 53/155 (34%), Gaps = 34/155 (21%)

Query: 609 EDDLASLSDEDLDELLTPSNADDDLENSILEQLNLSDDDLEIQELSAKEEREEGRRWKIC 668
           + +     DE+ DE    S+  +D E+         DDD +  +++ +E   E RR +  
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDED--------DDDDDDDDDIATRERSLERRRRR-- 171

Query: 669 VVSGVEKRIDMKVIEPYKRVLSHGGYLAEGCHNAIIVFS-ACFLPHHSRADYHYVMDNLF 727
                E       +E Y      G Y       A+ V+  A  L    + +     D L+
Sbjct: 172 ----REWEEKRAELEFY---YYQGTYYGTSS--ALTVYELASEL---GKDN----NDLLW 215

Query: 728 FYVLHTLDQLIT-----EDYVLVY--LHGATSRHN 755
             ++      +      E Y  +   L    SR N
Sbjct: 216 LAIVGLTSLYLHGKIDPERYETLVELLQDEVSRLN 250


>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
           (Spore_YhcN_YlaJ).  This entry contains YhcN and YlaJ,
           which are predicted lipoproteins that have been detected
           as spore proteins but not vegetative proteins in
           Bacillus subtilis. Both appear to be expressed under
           control of the RNA polymerase sigma-G factor. The
           YlaJ-like members of this family have a low-complexity,
           strongly acidic, 40-residue C-terminal domain.
          Length = 169

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 12/84 (14%), Positives = 26/84 (30%), Gaps = 12/84 (14%)

Query: 161 ANCNTPENDVETVTELSNNNSDSTNISANVIETNTPVHMTEPSELNLNNNPTNHSVTQND 220
           A C T     +   +   +N  +TN   N    N             NN     +     
Sbjct: 7   AGCGTNNAADDDNRQNVTDNDGNTNNVNNRNNRNN------------NNGYGRLTKNNVT 54

Query: 221 LNREKKDRQEKDLSTKVSDLLEKV 244
                 + +  +++ K++  + K+
Sbjct: 55  GENGYNNIERGEVADKIARRVAKL 78


>gnl|CDD|222051 pfam13334, DUF4094, Domain of unknown function (DUF4094).  This
           domain is found in plant proteins that often carry a
           galactosyltransferase domain, pfam01762, at their
           C-terminus.
          Length = 89

 Score = 28.0 bits (63), Expect = 4.9
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 214 HSVTQNDLNREKKDRQEKDLSTKVSDLLEKVSSL 247
             +   D +R+KK+  +KD+  +VSD  E + SL
Sbjct: 39  LQLVSEDCDRKKKESDQKDIMGEVSDTHEAIQSL 72


>gnl|CDD|133110 cd06253, M14_ASTE_ASPA_like_3, Peptidase M14 Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA)-like;
           uncharacterized subgroup.  A functionally
           uncharacterized subgroup of the Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily
           which is part of the M14 family of
           metallocarboxypeptidases. ASTE catalyzes the fifth and
           last step in arginine catabolism by the arginine
           succinyltransferase pathway, and aspartoacylase (ASPA,
           also known as aminoacylase 2, and ACY-2; EC:3.5.1.15)
           cleaves N-acetyl L-aspartic acid (NAA) into aspartate
           and acetate. NAA is abundant in the brain, and
           hydrolysis of NAA by ASPA may help maintain white
           matter. ASPA is an NAA scavenger in other tissues.
           Mutations in the gene encoding ASPA cause Canavan
           disease (CD), a fatal progressive neurodegenerative
           disorder involving dysmyelination and spongiform
           degeneration of white matter in children. This enzyme
           binds zinc which is necessary for activity. Measurement
           of elevated NAA levels in urine is used in the diagnosis
           of CD.
          Length = 298

 Score = 29.9 bits (68), Expect = 5.9
 Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 1/66 (1%)

Query: 237 VSDLLEKVSSLNANLNPKVVFHTPQTTPLQTTPLKLKLNSLPVQSHPLEQHALDRTDQEY 296
                E +  L   LN  V++  P +T  + T L   L      +  +E     R D+EY
Sbjct: 135 YKTGSESLLPLAKFLNLDVIWIHPSSTVDEAT-LAHNLQVWGTPAFSVEMGVGMRIDKEY 193

Query: 297 CNEPLD 302
            N+ + 
Sbjct: 194 ANQIVS 199


>gnl|CDD|223696 COG0623, FabI, Enoyl-[acyl-carrier-protein].
          Length = 259

 Score = 29.5 bits (67), Expect = 6.4
 Identities = 12/17 (70%), Positives = 13/17 (76%), Gaps = 1/17 (5%)

Query: 354 IRSNA-SASPIRTLSPS 369
           IR NA SA PIRTL+ S
Sbjct: 181 IRVNAISAGPIRTLAAS 197


>gnl|CDD|220070 pfam08920, SF3b1, Splicing factor 3B subunit 1.  This family
           consists of several eukaryotic splicing factor 3B
           subunit 1 proteins, which associate with p14 through a
           C-terminus beta-strand that interacts with beta-3 of the
           p14 RNA recognition motif (RRM) beta-sheet, which is in
           turn connected to an alpha-helix by a loop that makes
           extensive contacts with both the shorter C-terminal
           helix and RRM of p14. This subunit is required for 'A'
           splicing complex assembly (formed by the stable binding
           of U2 snRNP to the branchpoint sequence in pre-mRNA) and
           'E' splicing complex assembly.
          Length = 142

 Score = 28.5 bits (64), Expect = 7.4
 Identities = 11/23 (47%), Positives = 16/23 (69%), Gaps = 1/23 (4%)

Query: 605 GNLMEDDLASLSDEDLDELLTPS 627
           G  +++    L+DE+LDELL PS
Sbjct: 75  GTEIDERNRPLTDEELDELL-PS 96


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 29.6 bits (67), Expect = 7.7
 Identities = 21/113 (18%), Positives = 40/113 (35%)

Query: 571 KNARKSVATSDSHSGAQGNHQSRKKLLLSTDLFSGNLMEDDLASLSDEDLDELLTPSNAD 630
           K   K    S   +  +        L    D+   N  +DD     D+DLD+     + D
Sbjct: 93  KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152

Query: 631 DDLENSILEQLNLSDDDLEIQELSAKEEREEGRRWKICVVSGVEKRIDMKVIE 683
           D+ ++   +  ++ D+D E +E    E+  +   +          R   K  +
Sbjct: 153 DEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205


>gnl|CDD|233335 TIGR01271, CFTR_protein, cystic fibrosis transmembrane conductor
           regulator (CFTR).  The model describes the cystis
           fibrosis transmembrane conductor regulator (CFTR) in
           eukaryotes. The principal role of this protein is
           chloride ion conductance. The protein is predicted to
           consist of 12 transmembrane domains. Mutations or
           lesions in the genetic loci have been linked to the
           aetiology of asthma, bronchiectasis, chronic obstructive
           pulmonary disease etc. Disease-causing mutations have
           been studied by 36Cl efflux assays in vitro cell
           cultures and electrophysiology, all of which point to
           the impairment of chloride channel stability and not the
           biosynthetic processing per se [Transport and binding
           proteins, Anions].
          Length = 1490

 Score = 29.9 bits (67), Expect = 8.8
 Identities = 20/112 (17%), Positives = 39/112 (34%), Gaps = 14/112 (12%)

Query: 460 YLSTNAEDDISSALGASKVMYVSDDEDVSVYDTIVSQDCHGVKLVKNARKSVPPPGDNHS 519
                  +D        K   V +DE         +Q  HG++     R+SV     + +
Sbjct: 724 KAQATTIEDAVREPSERKFSLVPEDEQGEESLPRGNQYHHGLQHQAQRRQSVLQLMTHSN 783

Query: 520 GAQRNHQN-----RKKLLLST--------DLFSGNLMEDDCID-DEDVSVYD 557
             +   +      RKK  ++         D++S  L +D   +  E+++  D
Sbjct: 784 RGENRREQLQTSFRKKSSITQQNELASELDIYSRRLSKDSVYEISEEINEED 835


>gnl|CDD|235620 PRK05841, flgE, flagellar hook protein FlgE; Validated.
          Length = 603

 Score = 29.4 bits (66), Expect = 9.0
 Identities = 17/69 (24%), Positives = 25/69 (36%), Gaps = 16/69 (23%)

Query: 155 SIENNNANCNTP---ENDVETVTELSNN----NSDST---------NISANVIETNTPVH 198
           S+ NN AN NT     N  E  T  S++    N+ S            S+N I      +
Sbjct: 21  SLSNNIANVNTLGYRSNIPEFKTLFSSHLDALNAKSVVANDRNYGVTGSSNAISNKDGEY 80

Query: 199 MTEPSELNL 207
                E ++
Sbjct: 81  KPSDGEFHM 89


>gnl|CDD|187630 cd05372, ENR_SDR, Enoyl acyl carrier protein (ACP) reductase (ENR),
           divergent SDR.  This bacterial subgroup of ENRs includes
           Escherichia coli ENR. ENR catalyzes the
           NAD(P)H-dependent reduction of enoyl-ACP in the last
           step of fatty acid biosynthesis. De novo fatty acid
           biosynthesis is catalyzed by the fatty acid synthetase
           complex, through the serial addition of 2-carbon
           subunits. In bacteria and plants,ENR catalyzes one of
           six synthetic steps in this process. Oilseed rape ENR,
           and also apparently the NADH-specific form of
           Escherichia coli ENR, is tetrameric.  Although similar
           to the classical SDRs, this group does not have the
           canonical catalytic tetrad, nor does it have the typical
           Gly-rich NAD-binding pattern. Such so-called divergent
           SDRs have a GXXXXXSXA NAD-binding motif and a YXXMXXXK
           (or YXXXMXXXK) active site motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold (alpha/beta folding pattern with a central
           beta-sheet), an NAD(P)(H)-binding region, and a
           structurally diverse C-terminal region. Classical SDRs
           are typically about 250 residues long, while extended
           SDRs are approximately 350 residues. Sequence identity
           between different SDR enzymes are typically in the
           15-30% range, but the enzymes share the Rossmann fold
           NAD-binding motif and characteristic NAD-binding and
           catalytic sequence patterns. These enzymes catalyze a
           wide range of activities including the metabolism of
           steroids, cofactors, carbohydrates, lipids, aromatic
           compounds, and amino acids, and act in redox sensing.
           Classical SDRs have an TGXXX[AG]XG cofactor binding
           motif and a YXXXK active site motif, with the Tyr
           residue of the active site motif serving as a critical
           catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase (15-PGDH)
           numbering). In addition to the Tyr and Lys, there is
           often an upstream Ser (Ser-138, 15-PGDH numbering)
           and/or an Asn (Asn-107, 15-PGDH numbering) contributing
           to the active site; while substrate binding is in the
           C-terminal region, which determines specificity. The
           standard reaction mechanism is a 4-pro-S hydride
           transfer and proton relay involving the conserved Tyr
           and Lys, a water molecule stabilized by Asn, and
           nicotinamide. Extended SDRs have additional elements in
           the C-terminal region, and typically have a TGXXGXXG
           cofactor binding motif. Complex (multidomain) SDRs such
           as ketoreductase domains of fatty acid synthase have a
           GGXGXXG NAD(P)-binding motif and an altered active site
           motif (YXXXN). Fungal type ketoacyl reductases have a
           TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
           have lost catalytic activity and/or have an unusual
           NAD(P)-binding motif and missing or unusual active site
           residues. Reactions catalyzed within the SDR family
           include isomerization, decarboxylation, epimerization,
           C=N bond reduction, dehydratase activity,
           dehalogenation, Enoyl-CoA reduction, and
           carbonyl-alcohol oxidoreduction.
          Length = 250

 Score = 29.1 bits (66), Expect = 9.3
 Identities = 11/17 (64%), Positives = 13/17 (76%), Gaps = 1/17 (5%)

Query: 354 IRSNA-SASPIRTLSPS 369
           IR NA SA PI+TL+ S
Sbjct: 177 IRVNAISAGPIKTLAAS 193


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.127    0.356 

Gapped
Lambda     K      H
   0.267   0.0735    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 40,971,117
Number of extensions: 3934364
Number of successful extensions: 2884
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2864
Number of HSP's successfully gapped: 51
Length of query: 849
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 744
Effective length of database: 6,280,432
Effective search space: 4672641408
Effective search space used: 4672641408
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.0 bits)