RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8629
         (483 letters)



>gnl|CDD|206637 cd11304, Cadherin_repeat, Cadherin tandem repeat domain.  Cadherins
           are glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers.
          Length = 98

 Score = 98.9 bits (247), Expect = 4e-25
 Identities = 43/91 (47%), Positives = 52/91 (57%), Gaps = 2/91 (2%)

Query: 353 AEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPV 412
           A PGT V  +SA D D G N  +TYS+    +D    F ID  +G IT    +D E    
Sbjct: 10  APPGTVVLTVSATDPDSGENGEVTYSIVSGNEDG--LFSIDPSTGEITTAKPLDREEQSS 67

Query: 413 PKLTVIATDNGVPALSSSATVLVTIHDVNDN 443
             LTV ATD G P LSS+ATV +T+ DVNDN
Sbjct: 68  YTLTVTATDGGGPPLSSTATVTITVLDVNDN 98



 Score = 87.4 bits (217), Expect = 4e-21
 Identities = 36/103 (34%), Positives = 47/103 (45%), Gaps = 7/103 (6%)

Query: 157 SIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQ 216
            +  PEN P        +A D D G      Y+IVSGN +  F +       G +     
Sbjct: 3   EVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPS---TGEITT--- 56

Query: 217 INGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDN 259
               LDRE    Y+L + A DGG PPL  +  V +T+ DVNDN
Sbjct: 57  -AKPLDREEQSSYTLTVTATDGGGPPLSSTATVTITVLDVNDN 98



 Score = 85.1 bits (211), Expect = 3e-20
 Identities = 34/72 (47%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 268 YFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKA 327
           Y  +VPENA  GT +  V ATD D+G NGEV YSI     ++DGLF+ID  +G I   K 
Sbjct: 2   YEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSI--VSGNEDGLFSIDPSTGEITTAKP 59

Query: 328 LDFETKEVHDLV 339
           LD E +  + L 
Sbjct: 60  LDREEQSSYTLT 71



 Score = 50.4 bits (121), Expect = 5e-08
 Identities = 39/132 (29%), Positives = 49/132 (37%), Gaps = 43/132 (32%)

Query: 22  LEISEGVPVGTRVGFIG--DGFVGDSGPP----YLIVPVPGSAVDTDLSIEQNTGEIRTK 75
           + + E  P GT V  +   D    DSG      Y IV      +    SI+ +TGEI T 
Sbjct: 4   VSVPENAPPGTVVLTVSATDP---DSGENGEVTYSIVSGNEDGL---FSIDPSTGEITTA 57

Query: 76  VPLDRETRSSYSLVAIPLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENV 135
            PLDRE +SSY+L         V       P                      PLS    
Sbjct: 58  KPLDREEQSSYTLT--------VTATDGGGP----------------------PLSST-A 86

Query: 136 RVVVRVIDENDN 147
            V + V+D NDN
Sbjct: 87  TVTITVLDVNDN 98


>gnl|CDD|214520 smart00112, CA, Cadherin repeats.  Cadherins are glycoproteins
           involved in Ca2+-mediated cell-cell adhesion. Cadherin
           domains occur as repeats in the extracellular regions
           which are thought to mediate cell-cell contact when
           bound to calcium.
          Length = 81

 Score = 87.4 bits (217), Expect = 3e-21
 Identities = 39/83 (46%), Positives = 49/83 (59%), Gaps = 2/83 (2%)

Query: 363 SAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDN 422
           SA D D G N  +TYS+   + +    F ID ++G IT    +D E  P   LTV ATD 
Sbjct: 1   SATDADSGENGKVTYSI--LSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDG 58

Query: 423 GVPALSSSATVLVTIHDVNDNEP 445
           G P LSS+ATV +T+ DVNDN P
Sbjct: 59  GGPPLSSTATVTITVLDVNDNAP 81



 Score = 78.9 bits (195), Expect = 3e-18
 Identities = 35/87 (40%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 175 AKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIE 234
           A D D G      Y+I+SGN +  F +       G +         LDRE  PEY+L +E
Sbjct: 2   ATDADSGENGKVTYSILSGNDDGLFSIDPE---TGEITT----TKPLDREEQPEYTLTVE 54

Query: 235 ALDGGSPPLRGSMIVNVTIQDVNDNQP 261
           A DGG PPL  +  V +T+ DVNDN P
Sbjct: 55  ATDGGGPPLSSTATVTITVLDVNDNAP 81



 Score = 58.5 bits (142), Expect = 4e-11
 Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 2/53 (3%)

Query: 287 ATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFETKEVHDLV 339
           ATDAD+G NG+V YSI     + DGLF+ID ++G I   K LD E +  + L 
Sbjct: 2   ATDADSGENGKVTYSIL--SGNDDGLFSIDPETGEITTTKPLDREEQPEYTLT 52



 Score = 49.7 bits (119), Expect = 5e-08
 Identities = 23/52 (44%), Positives = 27/52 (51%), Gaps = 8/52 (15%)

Query: 105 EPGEIRTKVPLDRETRSSYSLVAI-------PLSGENVRVVVRVIDENDNAP 149
           E GEI T  PLDRE +  Y+L          PLS     V + V+D NDNAP
Sbjct: 31  ETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSST-ATVTITVLDVNDNAP 81


>gnl|CDD|215665 pfam00028, Cadherin, Cadherin domain. 
          Length = 92

 Score = 72.3 bits (178), Expect = 8e-16
 Identities = 35/91 (38%), Positives = 50/91 (54%), Gaps = 2/91 (2%)

Query: 348 SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDC 407
           S  E A  GT V  ++A D D G N  I YS+         WF+ID  +G ++    +D 
Sbjct: 4   SVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGG--WFRIDPDTGDLSTTKPLDR 61

Query: 408 ETDPVPKLTVIATDNGVPALSSSATVLVTIH 438
           E+    +LTV+ATD+G P LSS+ TV +T+ 
Sbjct: 62  ESIGEYELTVLATDSGGPPLSSTTTVTITVL 92



 Score = 69.6 bits (171), Expect = 7e-15
 Identities = 34/99 (34%), Positives = 44/99 (44%), Gaps = 9/99 (9%)

Query: 157 SIEFPENTPRDVKRTLNAAKDRDLGIYNTQ-RYNIVSGNTNNAFRLSSHREKDGVLYLDL 215
           S   PEN P   +     A D DLG  N +  Y+I+ G     FR+       G L    
Sbjct: 2   SASVPENAPVGTEVLTVTATDADLG-PNGRIFYSILGGGPGGWFRID---PDTGDLST-- 55

Query: 216 QINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQ 254
                LDRE+I EY L + A D G PPL  +  V +T+ 
Sbjct: 56  --TKPLDRESIGEYELTVLATDSGGPPLSSTTTVTITVL 92



 Score = 66.6 bits (163), Expect = 8e-14
 Identities = 32/72 (44%), Positives = 40/72 (55%), Gaps = 2/72 (2%)

Query: 268 YFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKA 327
           Y A+VPENA VGT +  V ATDAD G NG + YSI        G F ID  +G +   K 
Sbjct: 1   YSASVPENAPVGTEVLTVTATDADLGPNGRIFYSI--LGGGPGGWFRIDPDTGDLSTTKP 58

Query: 328 LDFETKEVHDLV 339
           LD E+   ++L 
Sbjct: 59  LDRESIGEYELT 70



 Score = 41.5 bits (98), Expect = 6e-05
 Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 12/75 (16%)

Query: 23 EISEGVPVGTRVGFIGDGFVGD------SGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKV 76
           + E  PVGT V  +      D          Y I+            I+ +TG++ T  
Sbjct: 4  SVPENAPVGTEVLTVT---ATDADLGPNGRIFYSILGGG---PGGWFRIDPDTGDLSTTK 57

Query: 77 PLDRETRSSYSLVAI 91
          PLDRE+   Y L  +
Sbjct: 58 PLDRESIGEYELTVL 72



 Score = 33.0 bits (76), Expect = 0.053
 Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 8/45 (17%)

Query: 105 EPGEIRTKVPLDRETRSSYSLVAI-------PLSGENVRVVVRVI 142
           + G++ T  PLDRE+   Y L  +       PLS     V + V+
Sbjct: 49  DTGDLSTTKPLDRESIGEYELTVLATDSGGPPLSST-TTVTITVL 92


>gnl|CDD|206635 cd00031, CA_like, Cadherin repeat-like domain.  Cadherins are
           glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers. This
           family also includes the cadherin-like repeats of
           extracellular alpha-dystroglycan.
          Length = 98

 Score = 33.9 bits (77), Expect = 0.035
 Identities = 16/60 (26%), Positives = 28/60 (46%)

Query: 384 DDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDN 443
           +  PSW   +  SG++     +D E   V  ++V A   G     +S+   + ++D NDN
Sbjct: 39  EALPSWLHWEPHSGILEGLEKLDREDKGVHYISVSAASLGANVPQTSSVFSIEVYDENDN 98



 Score = 28.5 bits (63), Expect = 2.4
 Identities = 22/100 (22%), Positives = 40/100 (40%), Gaps = 8/100 (8%)

Query: 271 TVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDK-DGLFNIDSKSGLIFVNKALD 329
           +  E  + G+    +  TD  A S       I+    +      + +  SG++   + LD
Sbjct: 5   SAVEGRSRGSFRVSI-PTDLIASSG--EIIKISAAGKEALPSWLHWEPHSGILEGLEKLD 61

Query: 330 FETKEVHDLVPRFAVEISSFDEIAEPGTSVFQISAFDQDE 369
            E K VH      +V  +S        +SVF I  +D+++
Sbjct: 62  REDKGVH----YISVSAASLGANVPQTSSVFSIEVYDEND 97


>gnl|CDD|214795 smart00736, CADG, Dystroglycan-type cadherin-like domains.
           Cadherin-homologous domains present in metazoan
           dystroglycans and alpha/epsilon sarcoglycans, yeast
           Axl2p and in a very large protein from magnetotactic
           bacteria. Likely to bind calcium ions.
          Length = 97

 Score = 32.3 bits (74), Expect = 0.097
 Identities = 26/86 (30%), Positives = 40/86 (46%), Gaps = 16/86 (18%)

Query: 363 SAFDQDEGINSVITYS--LRENTDDHPSWFQIDSKSGLIT-MKSNVDCETDPVPKLT--V 417
           +  D D      +TYS  L + +   PSW   DS +G ++   +N D     V  L+  V
Sbjct: 23  TFTDAD---GDTLTYSATLSDGSAL-PSWLSFDSDTGTLSGTPTNSD-----VGSLSLKV 73

Query: 418 IATDNGVPALSSSATVLVTIHDVNDN 443
            ATD+     S+S T  +T+ + ND 
Sbjct: 74  TATDSS--GASASDTFTITVVNTNDA 97


>gnl|CDD|238749 cd01472, vWA_collagen, von Willebrand factor (vWF) type A domain;
           equivalent to the I-domain of integrins.  This domain
           has a variety of functions including: intermolecular
           adhesion, cell migration, signalling, transcription, and
           DNA repair. In integrins these domains form heterodimers
           while in vWF it forms homodimers and multimers. There
           are different interaction surfaces of this domain as
           seen by its complexes with collagen with either integrin
           or human vWFA. In integrins collagen binding occurs via 
           the metal ion-dependent adhesion site (MIDAS) and
           involves three surface loops located on the upper
           surface of the molecule. In human vWFA, collagen binding
           is thought to occur on the bottom of the molecule and
           does not involve the vestigial MIDAS motif.
          Length = 164

 Score = 31.8 bits (73), Expect = 0.42
 Identities = 23/104 (22%), Positives = 38/104 (36%), Gaps = 20/104 (19%)

Query: 295 NGEVEYSINRRQSDKDGLFNIDS---KSGLIFVNKALDFETKE-------VHDLVPRFAV 344
           +   E+ +N  +S  D L  + +     G     KAL +  +          + VP+  V
Sbjct: 49  DPRTEFYLNTYRSKDDVLEAVKNLRYIGGGTNTGKALKYVRENLFTEASGSREGVPKVLV 108

Query: 345 EIS---SFDEIAEP-------GTSVFQISAFDQDEGINSVITYS 378
            I+   S D++ EP       G  VF +   + DE     I   
Sbjct: 109 VITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASD 152


>gnl|CDD|131215 TIGR02160, PA_CoA_Oxy5, phenylacetate-CoA oxygenase/reductase, PaaK
           subunit.  Phenylacetate-CoA oxygenase is comprised of a
           five gene complex responsible for the hydroxylation of
           phenylacetate-CoA (PA-CoA) as the second catabolic step
           in phenylacetic acid (PA) degradation. Although the
           exact function of this enzyme has not been determined,
           it has been shown to be required for phenylacetic acid
           degradation and has been proposed to function in a
           multicomponent oxygenase acting on phenylacetate-CoA
           [Energy metabolism, Other].
          Length = 352

 Score = 32.5 bits (74), Expect = 0.49
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 11/68 (16%)

Query: 80  RETRSSYSL---VAIPLSGENVRVVSFYEPGEIRT-KVPLD-RETRSSYSLVAIPLSGEN 134
           R T  + ++   +   L+ +       + PG+  T +  +D  E R SYS+ + P  GE 
Sbjct: 11  RLTADAVAISFEIPDELAEDYR-----FAPGQHLTLRREVDGEELRRSYSICSAPAPGE- 64

Query: 135 VRVVVRVI 142
           +RV V+ I
Sbjct: 65  IRVAVKKI 72


>gnl|CDD|227219 COG4882, COG4882, Predicted aminopeptidase, Iap family [General
           function prediction only].
          Length = 486

 Score = 31.5 bits (71), Expect = 1.2
 Identities = 17/80 (21%), Positives = 29/80 (36%), Gaps = 2/80 (2%)

Query: 127 AIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQ 186
           A+     + R +V   D   +  + P+P      PEN  R  +         D  +  T 
Sbjct: 118 ALLFESRDPRRIVTGGDWGYSVSSSPTPIPVAVVPENYSRYAEEAGRVRLWVDACVERTY 177

Query: 187 RYNIV--SGNTNNAFRLSSH 204
            YN++   G  N    + +H
Sbjct: 178 DYNVIAVDGGENGVVLIGAH 197


>gnl|CDD|177701 PLN00072, PLN00072, 3-isopropylmalate isomerase/dehydratase small
           subunit; Provisional.
          Length = 246

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 4/62 (6%)

Query: 54  VPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVSFYEPGEIRTKV 113
           V G  +DTD  I      +    P + E   SY+L+ +P      R   F EPGE++TK 
Sbjct: 75  VVGDNIDTDQIIPAEYLTLVPSKPDEYEKLGSYALIGLPAF-YKTR---FVEPGEMKTKY 130

Query: 114 PL 115
            +
Sbjct: 131 SI 132


>gnl|CDD|175976 cd04009, C2B_Munc13-like, C2 domain second repeat in Munc13
           (mammalian uncoordinated)-like proteins.  C2-like
           domains are thought to be involved in phospholipid
           binding in a Ca2+ independent manner in both Unc13 and
           Munc13. Caenorabditis elegans Unc13 has a central domain
           with sequence similarity to PKC, which includes C1 and
           C2-related domains. Unc13 binds phorbol esters and DAG
           with high affinity in a phospholipid manner.  Mutations
           in Unc13 results in abnormal neuronal connections and
           impairment in cholinergic neurotransmission in the
           nematode.  Munc13 is the mammalian homolog which are
           expressed in the brain.  There are 3 isoforms (Munc13-1,
           -2, -3) and are thought to play a role in
           neurotransmitter release and are hypothesized to be
           high-affinity receptors for phorbol esters.  Unc13 and
           Munc13 contain both C1 and C2 domains.  There are two C2
           related domains present, one central and one at the
           carboxyl end.  Munc13-1 contains a third C2-like domain.
            Munc13 interacts with syntaxin, synaptobrevin, and
           synaptotagmin suggesting a role for these as scaffolding
           proteins. C2 domains fold into an 8-standed
           beta-sandwich that can adopt 2 structural arrangements:
           Type I and Type II, distinguished by a circular
           permutation involving their N- and C-terminal beta
           strands. Many C2 domains are Ca2+-dependent
           membrane-targeting modules that bind a wide variety of
           substances including bind phospholipids, inositol
           polyphosphates, and intracellular proteins.  Most C2
           domain proteins are either signal transduction enzymes
           that contain a single C2 domain, such as protein kinase
           C, or membrane trafficking proteins which contain at
           least two C2 domains, such as synaptotagmin 1.  However,
           there are a few exceptions to this including RIM
           isoforms and some splice variants of piccolo/aczonin and
           intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions. This cd contains the third C2
           repeat, C2C, and has a type-II topology.
          Length = 133

 Score = 29.5 bits (67), Expect = 1.9
 Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 13/89 (14%)

Query: 391 QIDSKSGLITMKSNVDCETDPVPKLTVIATD--NGVPALSSSATVLVTIHDVNDNEPIFD 448
           +I +   L+ + SN     DP  K+ ++       VP   +      T+       P+FD
Sbjct: 21  EILNARNLLPLDSNGSS--DPFVKVELLPRHLFPDVPTPKTQ-VKKKTLF------PLFD 71

Query: 449 QSFY-NVTIPENKPKGSCILKSIF-YDIF 475
           +SF  NV   +   +G+ +L ++  YD+ 
Sbjct: 72  ESFEFNVPPEQCSVEGALLLFTVKDYDLL 100


>gnl|CDD|198310 cd03201, GST_C_DHAR, C-terminal, alpha helical domain of
           Dehydroascorbate Reductase.  Glutathione S-transferase
           (GST) C-terminal domain family, Dehydroascorbate
           Reductase (DHAR) subfamily; composed of plant-specific
           DHARs, which are monomeric enzymes catalyzing the
           reduction of DHA into ascorbic acid (AsA) using
           glutathione as the reductant. DHAR allows plants to
           recycle oxidized AsA before it is lost. AsA serves as a
           cofactor of violaxanthin de-epoxidase in the xanthophyll
           cycle and as an antioxidant in the detoxification of
           reactive oxygen species. Because AsA is the major
           reductant in plants, DHAR serves to regulate their redox
           state. It has been suggested that a significant portion
           of DHAR activity is plastidic, acting to reduce the
           large amounts of ascorbate oxidized during hydrogen
           peroxide scavenging by ascorbate peroxidase. DHAR
           contains a conserved cysteine in its active site and in
           addition to its reductase activity, shows thiol
           transferase activity similar to glutaredoxins.
          Length = 121

 Score = 28.9 bits (65), Expect = 2.7
 Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 5/30 (16%)

Query: 270 ATVPENATVGTSIFQVFAT-----DADAGS 294
            T PE A+VG+ IF  F T     DA+ GS
Sbjct: 2   KTPPEFASVGSKIFSTFVTFLKSKDANDGS 31


>gnl|CDD|188875 cd09769, Luminal_IRE1, The Luminal domain, a dimerization domain,
           of the Serine/Threonine protein kinase,
           Inositol-requiring protein 1.  The Luminal domain is a
           dimerization domain present in Inositol-requiring
           protein 1 (IRE1), a serine/threonine protein kinase
           (STK) and a type I transmembrane protein that is
           localized in the endoplasmic reticulum (ER). IRE1, also
           called Endoplasmic reticulum (ER)-to-nucleus signaling
           protein (or ERN), is a kinase receptor that also
           contains an endoribonuclease domain in the cytoplasmic
           side. It plays roles in the signaling of the unfolded
           protein response (UPR), which is activated when protein
           misfolding is detected in the ER in order to decrease
           the synthesis of new proteins and increase the capacity
           of the ER to cope with the stress. IRE1 acts as an ER
           stress sensor and is the oldest and most conserved
           component of the UPR in eukaryotes. During ER stress,
           IRE1 dimerizes through its luminal domain and forms
           oligomers, allowing the kinase domain to undergo
           trans-autophosphorylation. This leads to a
           conformational change that stimulates its
           endoribonuclease activity and results in the cleavage of
           its mRNA substrate, HAC1 in yeast and Xbp1 in metazoans,
           promoting a splicing event that enables translation into
           a transcription factor which activates the UPR. Mammals
           contain two IRE1 proteins, IRE1alpha (or ERN1) and
           IRE1beta (or ERN2). IRE1alpha is expressed in all cells
           and tissues while IRE1beta is found only in intestinal
           epithelial cells.
          Length = 295

 Score = 29.2 bits (66), Expect = 4.3
 Identities = 26/102 (25%), Positives = 40/102 (39%), Gaps = 20/102 (19%)

Query: 361 QISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIAT 420
           Q S     +GI     Y+  + T    +W+ +D ++G               P+  V   
Sbjct: 80  QSSPCRSSDGI----LYTGSKQT----TWYTVDPRTGEKIQVLGSGGADSNCPESCVDPD 131

Query: 421 DNGVPALSSSATV-------LVTIHDVNDNEPIFDQSFYNVT 455
           D+     SSS+T+        VTI+D    EPI     +NVT
Sbjct: 132 DDEQSECSSSSTIYIGRTEYTVTIYDSKTREPI-----WNVT 168


>gnl|CDD|223949 COG1018, Hmp, Flavodoxin reductases (ferredoxin-NADPH reductases)
           family 1 [Energy production and conversion].
          Length = 266

 Score = 29.2 bits (66), Expect = 4.5
 Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 2/42 (4%)

Query: 104 YEPGE-IRTKVPLDRETRS-SYSLVAIPLSGENVRVVVRVID 143
           +EPG+ I   +P   E    +YSL + P      R+ V+  D
Sbjct: 35  FEPGQYITVGLPNGGEPLLRAYSLSSAPDEDSLYRISVKRED 76


>gnl|CDD|172654 PRK14166, PRK14166, bifunctional 5,10-methylene-tetrahydrofolate
           dehydrogenase/ 5,10-methylene-tetrahydrofolate
           cyclohydrolase; Provisional.
          Length = 282

 Score = 29.2 bits (65), Expect = 4.7
 Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 176 KDRDLGIYNTQR-YNIVSGNTNNAFRLSSHREKDGVLYLDLQIN--------GVLDRETI 226
           K +DL +Y  Q    IV+    N  R  S   K+GV+ +D+ IN        G +D E +
Sbjct: 190 KTKDLSLYTRQADLIIVAAGCVNLLR--SDMVKEGVIVVDVGINRLESGKIVGDVDFEEV 247

Query: 227 PEYSLVIEALDGGSPPLRGSMIVNVTIQDVND 258
            + S  I  + GG  P+  +M++  T++   +
Sbjct: 248 SKKSSYITPVPGGVGPMTIAMLLENTVKSAKN 279


>gnl|CDD|178774 PLN03236, PLN03236, 4-alpha-glucanotransferase; Provisional.
          Length = 745

 Score = 29.3 bits (65), Expect = 5.7
 Identities = 28/151 (18%), Positives = 63/151 (41%), Gaps = 8/151 (5%)

Query: 267 RYFATVPENATVGTSIFQVFATDADA--GSNGEVEYS--INRRQSDKDGLFNIDSKSGLI 322
           ++F+ +  +  +G   F+++   A A  G  G    S  I + +    GL+++D      
Sbjct: 359 QFFSAIRIDHILG--FFRIWELPAHAKTGRLGRFRPSLPIRKDELAARGLWDLDRLCEPW 416

Query: 323 FVNKALDFETKE-VHDLVPRFAVEISSFDEIAEPGTSVFQI-SAFDQDEGINSVITYSLR 380
             ++ L+    +   +   RF  E  +  +       +++    FD ++ I +      R
Sbjct: 417 IQSEELEALFGDNDGEAAGRFFEETDAETKPDGTTRGLWKFRKEFDTEQAIFASEALKPR 476

Query: 381 ENTDDHPSWFQIDSKSGLITMKSNVDCETDP 411
           ++  DH +  Q + ++GL+ +  N     DP
Sbjct: 477 DDFPDHLNDEQEELRAGLMQLFQNRCLLRDP 507


>gnl|CDD|224089 COG1167, ARO8, Transcriptional regulators containing a DNA-binding
           HTH domain and an aminotransferase domain (MocR family)
           and their eukaryotic orthologs [Transcription / Amino
           acid transport and metabolism].
          Length = 459

 Score = 29.2 bits (66), Expect = 5.9
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 155 FMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLS 202
           F+ +E PE    D +  L AA ++ + +          G+  N  RLS
Sbjct: 388 FLWLELPE--GIDARELLAAALEKGVVVTPLGSAFSADGDPRNGLRLS 433


>gnl|CDD|233505 TIGR01646, vgr_GE, Rhs element Vgr protein.  This model represents
           the Vgr family of proteins, associated with some classes
           of Rhs elements. This model does not include a large
           octapeptide repeat region, VGXXXXXX, found in the Vgr of
           Rhs classes G and E.
          Length = 483

 Score = 28.6 bits (64), Expect = 7.7
 Identities = 11/62 (17%), Positives = 18/62 (29%), Gaps = 8/62 (12%)

Query: 320 GLIFVNKALDFETKEVHDLVPRFAVEISSFDEIAEPGTSVFQ--ISAFDQDEGINSVITY 377
            L   +  LD       D      +  ++  +       +F   I+ F      N    Y
Sbjct: 21  ILRSADADLDLAAMLGKDASLSLELPDAASTQ------RIFTGVIAGFSLGSTANGDARY 74

Query: 378 SL 379
           SL
Sbjct: 75  SL 76


>gnl|CDD|226991 COG4644, COG4644, Transposase and inactivated derivatives, TnpA
           family [DNA replication, recombination, and repair].
          Length = 323

 Score = 28.6 bits (64), Expect = 7.7
 Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 9/70 (12%)

Query: 93  LSGENV--------RVVSFYEPGEIRTKVPLDRETR-SSYSLVAIPLSGENVRVVVRVID 143
           L G NV        R + F + GEI       +E    + SL+   L   N + + + ++
Sbjct: 250 LEGLNVGESRNALNRAIFFGKGGEIYENYLEGQEDSALALSLLQNSLVYWNTKYIQQALE 309

Query: 144 ENDNAPTFPS 153
                   P 
Sbjct: 310 ILKEGRNTPE 319


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.135    0.387 

Gapped
Lambda     K      H
   0.267   0.0713    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,670,623
Number of extensions: 2423649
Number of successful extensions: 1957
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1941
Number of HSP's successfully gapped: 40
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.3 bits)