High-resolution structure of a lytic polysaccharide monooxygenase from Hypocrea jecorina reveals a predicted linker as an integral part of the catalytic domain

For decades, the enzymes of the fungus Hypocrea jecorina have served as a model system for the breakdown of cellulose. Three-dimensional structures for almost all H. jecorina cellulose-degrading enzymes are available, except for HjLPMO9A, belonging to the AA9 family of lytic polysaccharide monooxygenases (LPMOs). These enzymes enhance the hydrolytic activity of cellulases and are essential for cost-efficient conversion of lignocellulosic biomass. Here, using structural and spectroscopic analyses, we found that native HjLPMO9A contains a catalytic domain and a family-1 carbohydrate-binding module (CBM1) connected via a linker sequence. A C terminally truncated variant of HjLPMO9A containing 21 residues of the predicted linker was expressed at levels sufficient for analysis. Here, using structural, spectroscopic, and biochemical analyses, we found that this truncated variant exhibited reduced binding to and activity on cellulose compared with the full-length enzyme. Importantly, a 0.95-Å resolution X-ray structure of truncated HjLPMO9A revealed that the linker forms an integral part of the catalytic domain structure, covering a hydrophobic patch on the catalytic AA9 module. We noted that the oxidized catalytic center contains a Cu(II) coordinated by two His ligands, one of which has a His-brace in which the His-1 terminal amine group also coordinates to a copper. The final equatorial position of the Cu(II) is occupied by a water-derived ligand. The spectroscopic characteristics of the truncated variant were not measurably different from those of full-length HjLPMO9A, indicating that the presence of the CBM1 module increases the affinity of HjLPMO9A for cellulose binding, but does not affect the active site.

Lignocellulosic biomass represents an alternative source for fuels and materials and has the potential to replace fossil fuels as we strive to become a sustainable and carbon neutral society. Cellulose is the major structural polysaccharide of plant cell walls and is the most abundant and readily accessible source of renewable organic carbon on the planet. The enzymatic conversion of cellulose to glucose provides a carbon source for the biomanufacturing of fuels and chemicals. For decades, the secreted enzymes of the fungus Hypocrea jecorina (anamorph Trichoderma reesei) have served as a model system for the breakdown of cellulose. Major cellulases of H. jecorina, such as cellobiohydrolases Cel6A and Cel7A, endoglucanases Cel5A and Cel7B, and ␤-glucosidase Cel3A have been studied in detail, including determination of their 3D structures (1)(2)(3)(4)(5).
Lytic polysaccharide monooxygenases (LPMOs) 3 boost the hydrolytic activity of cellulases (6 -10). It is now well established that LPMOs are copper monooxygenases that introduce chain breaks through the oxidation of C1, C4, or both carbons of a sugar ring, using electrons provided by an external donor, which can be an enzyme (e.g. cellobiose dehydrogenase) or a chemical compound with antioxidant activity such as ascorbic acid (11)(12)(13). The boosting effect of LPMOs on cellulases has been attributed to the LPMO-mediated creation of free chain breaks, which are subsequently targeted by hydrolytic cellulases (14). LPMOs are currently categorized in carbohydrate-active enzyme (CAZy) families auxiliary activity family 9 (AA9), AA10, AA11, and AA13 (15). The AA9 family comprises fungal LPMOs that have been shown to be active on cellulose, soluble oligosaccharides (16), and hemicellulosic substrates such as xyloglucan and xylan (17). The H. jecorina genome has 3 AA9 encoding genes; HjLPMO9A (protein ID 73643), Cel61B/ HjLPMO9B (protein ID 120961), and an AA9-like protein (protein ID 27554). HjLPMO9A and -B were shown to be significantly up-regulated along with other cellulases, when T. reesei QM6a and its hypercellulolytic mutant (Rut C30) were grown on corn stover and sawdust (18), indicating its importance in plant biomass decomposition. HjLPMO9B was the first fungal LPMO/AA9 enzyme structure to be published (19). HjLPMO9A was initially characterized as an endoglucanase EG4, due to a weak endoglucanase activity detected from the culture filtrate of an HjLPMO9A-expressing Saccharomyces cerevisiae strain (20). HjLPMO9A expression and endoglucanase activity were also reported in a subsequent study, where it was homologously overexpressed in T. reesei (21). HjLPMO9A expressed in Pichia pastoris was shown to oxidatively cleave phosphoric acid-swollen cellulose (PASC) and soluble oligosaccharides, yielding both C4-and C1-oxidized fragments (22). HjLPMO9A is a major component of enzyme mixtures that have been optimized for the degradation of pretreated biomass (23).
HjLPMO9A is a secreted modular protein of 323 amino acids, containing an AA9 domain (residues 1 to 220), a predicted linker region (residues 221 to 288), and a family 1 carbohydrate-binding module (CBM1). About 20% of AA9 enzymes occur as a catalytic domain C terminally coupled to a flexible linker followed by a CBM1 (PFAM database). CBM1 domains are compact cysteine knots, about 40 amino acid residues large, and have a flat surface with 2-3 aromatic residues that bind tightly to crystalline cellulose. They are associated with fungal enzymes and proteins involved in degradation of lignocellulose and appear most often with AA9 catalytic domains (PFAM database). Deletion of CBM1 modules from fungal (hydrolytic) cellulases reduce the ability of the enzyme to bind to crystalline cellulose although did not affect catalytic activity (24,25).
The linker sequences connecting the CBM1 and the catalytic module display low sequence complexity with an overabundance of serine and threonine residues (26). These serine and threonine residues are often O-glycosylated, where glycosylation has been shown to protect the linker against proteolytic degradation and to increase the binding affinity (27). Linker sequences show little conservation and by all accounts are intrinsically disordered proteins that do not contain structural elements (27). Removal of linker sequences and CBM1 modules has been necessary to obtain crystals of cellulases for structure determination by X-ray crystallography (1,4,5,28).
The role of CBM1 modules on the properties of AA9 enzymes has received little attention and most 3D structures are of AA9 LPMOs that natively do not contain a CBM1 or linker sequence (9, 19, 29 -33). Neurospora crassa LPMO9C contains both a linker and a CBM1, and crystallized when produced without the predicted linker and CBM1 (29). Although NcLPMO9C displays broad specificity for polymeric and oligomeric ␤(1-4)-glycans, the mutant enzyme lacking a CBM1 domain showed reduced binding affinity for phosphoric acidswollen cellulose and xyloglucan (29). LPMOs from Podospora anserina that occur natively with a CBM1 have been shown to release more oxidized products from cellulose than LPMOs without CBM1 (34). Interestingly, a 2016 study by Crouch et al. (35) showed that CBM2a and CBM3a modules had a measureable effect on activity and substrate specificity of AA10 cellulose-active LPMOs and were even found to modulate the mode of action of AA10 LPMOs.
Here we report the crystal structure of HjLPMO9A after removal of the CBM1 and part of the linker sequence. The structure was determined to 0.95-Å resolution and revealed a surprisingly ordered linker structure. The effects of the CBM1 module on substrate binding, activity on cellulose, and spectroscopic properties of the copper atom in the active site were determined in detail. This work expands our knowledge of the carbohydrate-active enzymatic toolkit of the industrially relevant fungus H. jecorina and its model system for the breakdown of cellulose.

Removal of CBM and expression of HjLMPO9A-⌬CBM
A significant fraction, ϳ17%, of the currently available AA9 sequences (PF03443 in Pfam, ϳ409 of 2459) have a CBM1 attached to the C terminus of the catalytic AA9 domain via a variable linker region (pfamseq database, EBI Pfam version 30.0, http://pfam.xfam.org/family/PF03443). 4 For other enzymes, such as cellobiohydrolase HjCel7A, the linker and CBM have been successfully removed by proteolytic cleavage. Structure determination of HjCel7A showed that the linker had been cleaved directly after the annotated catalytic domain (1). Genetic truncations of HjLPMO9A were designed to remove all or part of its linker. HjLPMO9B is a homolog from H. jecorina that lacks a linker and CBM and has a known structure (19). Fig. 1 shows the sequence alignment of HjLPMO9A and HjLPMO9B. HjLPMO9A was truncated after residue 229 based on this alignment, however, no expression was observed. Truncation after residue 231 resulted in very low expression levels of truncated protein (data not shown). Additional truncations were tested and a wild-type level of protein production was observed with 21 extra amino acid residues, i.e. when the protein was truncated after residue 252, denoted HjLPMO9A-gm⌬CBM. In parallel, truncated HjLPMO9A was produced enzymatically using papain, resulting in HjLPMO9A-⌬CBM. Both HjLPMO9A-gm⌬CBM and HjLPMO9A-⌬CBM crystallized. The structure of both HjLPMO9A-⌬CBM and HjLPMO9A-gm⌬CBM were determined by X-ray crystallography and the structure of these are described later in this paper.

Cellulose binding and activity of HjLPMO9A and HjLPMO9A-gm⌬CBM
Removal of the CBM1 module from HjLPMO9A reduced its affinity for cellulose. Fixed concentrations of HjLPMO9A or HjLPMO9A-gm⌬CBM were incubated with increasing amounts of Avicel and the amounts of unbound protein in the supernatant were determined ( Fig. 2A). No electron donor was present, which ensured the inactivity of the enzymes. For both enzymes, the data fit well to a Langmuir adsorption isotherm. HjLPMO9A binds to almost full saturation (0.97 Ϯ 0.01), whereas HjLPMO9A-gm⌬CBM does not (0.83 Ϯ 0.06). The binding constant of HjLPMO9A is 0.26 Ϯ 0.02% dry solids; for HjLPMO9A-gm⌬CBM it is 6.9 Ϯ 1.2% dry solids. Thus, the CBM1 module was found to contribute significantly to the affinity of HjLPMO9A for cellulose and its removal reduced the affinity for cellulose by more than 20-fold compared with the full-length enzyme. This is similar to what has been found for NcLPMO9C that was modified to lack its CBM (29).
Removal of the carbohydrate-binding module from HjLPMO9A also reduced activity on cellulose. Cellulose-degrading activity was determined using PASC at 0.25% dry solids. PASC is insoluble and forms an opaque suspension. The catalytic activity of HjLPMO9A solubilizes PASC, which was monitored by recording the OD420 (Fig. 2B). Only cellooligosaccharides up to a degree of polymerization of 8 are soluble in water (36). Enzymatic cuts that result in longer oligosaccharides are therefore not detected. It is likely they are formed since PASC is made from Avicel and the average degree of polymerization of Avicel is about 200 (37). For HjLPMO9A the maximum ⌬OD420 is 0.85 Ϯ 0.12, whereas it is 0.49 Ϯ 0.13 for HjLPMO9A-gm⌬CBM. The enzyme concentration with half of the maximum activity is not significantly different between HjLPMO9A and HjLPMO9A-gm⌬CBM, i.e. 4.8 Ϯ 1.6 and 5.9 Ϯ 3.9 M, respectively. Thus, deletion of the carbohydratebinding domain reduced the PASC-solubilizing activity of the HjLPMO9A truncated variant by about 50%. A similar reduction in activity was observed when the CBM1 module was removed from NcLPMO9C (29).

Activity profiles analyzed by HPAEC-PAD
Comparison of the activity profiles on reduced PASC of HjLPMO9A and HjLPMO9A-⌬CBM (Fig. 3) shows that the removal of the CBM does not seem to alter the reaction specificity of HjLPMO9A on PASC. HjLPMO9A produces both C1 and C4 oxidized oligosaccharides, as was also reported by Tanghe et al. (22). Ratios of integrated peak areas of HjLPMO9A versus HjLPMO9A-⌬CBM for C1-oxidized products compared with C4-oxidized products were constant, which further indicates that the reaction specificity is not altered.

Overall structure of H. jecorina LPMO9A
The HjLPMO9A-⌬CBM (containing 17 of the linker residues) crystallized in space group P2 1 with unit cell parameters of a ϭ 42.9 Å, b ϭ 61.6 Å, and c ϭ 47.8 with a ␤ angle of 112.1°. The asymmetric unit of the crystal contains one enzyme molecule and the calculated Matthews coefficient was 2.2. The structure was solved by molecular replacement using the structure of H. jecorina LPMO9B as a template (PDB 2vtc) and was refined at 0.95-Å resolution. The refined and deposited structure model exhibits crystallographic R and R-free values of 11.5 and 12.7% and contains a total of 2,655 non-hydrogen atoms, including 248 amino acid residues, 1 copper atom, 2 N-acetyl-D-glucosamine residues, 15 mannose residues, 2 sulfate ions, and 372 water molecules. The amino acid residues are numbered according to the mature protein with the signal peptide removed, beginning at His-1. Statistics of the diffraction data and structure refinement are summarized in Table 1.
The overall fold of the HjLPMO9A catalytic domain is a ␤-sandwich fold consisting of two ␤-sheets formed by eight ␤-strands and common to all AA9 enzymes (19,31

Biochemical characterization of LPMO9A from H. jecorina
residues near the N terminus compared with other AA9s, in the so-called L2 loop, place it as a Type-3 AA9 LPMO (32).
The copper atom at the catalytic center, modeled at 85% occupancy, is coordinated in one plane by three bonds to 2 histidine residues, His-1 and His-86. His-1 shows density of a methylation on the ⑀ 2 nitrogen atom (Fig. 4). There is also a water molecule 2.06 Å from the copper that could be modeled with only 40% occupancy. This indicates an ϳ50 -50 mixed state of Cu(II) and Cu(I) for the copper atom most likely due to photoreduction during X-ray data collection. Also near the active site, although not in the same plane and further from the copper, are the O -atom of the Tyr-174 side chain 2.8 Å from the copper, and one additional water molecule 2.4 Å from copper. A sulfate ion is also present at a distance of 3.9 Å from the copper. Two regions show increased temperature factors for the backbone, each ϳ20 Å from the catalytic center and located on opposite sides of the potential cellulose-binding surface: residues 23-29 on one side and residues 120 -126 and 211-217 on the other. The enzyme contain two cysteine bridges, both located on the same side of the ␤-sheets. The first, Cys-56 to Cys-177, is strictly conserved and located in the core of the enzyme and the second, between Cys-97 and Cys-101, is located in a loop near the enzyme surface.
In HjLPMO9A, the last residue of the typical AA9-fold is a glycine (Gly-227). A glycine is also the last residue of HjLPMO9B and TaLPMO9A (9). Interestingly, all five N. crassa LPMOs with known structures as well as PcLPMO9D have a conserved cysteine as the last residue forming a disulfide bridge. The sequence corresponding to the AA9-fold suggests that HjLPMO9A belongs to Type-3 AA9 LPMOs (11,32). However, the structured backbone of HjLPMO9A contains an additional 21 residues from the linker region in an extended loop conformation that wraps around the molecule and ends on the opposite side of the enzyme (Fig. 5, A and B). In Lentinus similis AA9A (LsAA9A; PDB 5acj), the C terminus is ϳ15 residues longer than the typical LPMO fold, but this native enzyme does not contain any CBM (30). What is striking about the C-terminal extension in HjLPMO9A is the presence of eight O-glyco-

Biochemical characterization of LPMO9A from H. jecorina
sylations on the additional 21 residues. Altogether there are 17 O-glycosylation sites of which 15 could be modeled and refined resulting in reliable temperature factors. All of these sites are located within the last 58 residues when mapped to the sequence. In addition, there are two N-glycosylations at Asn-59 and Asn-137, each with one N-acetylglucosamine attached, and situated on opposite sides of the potential cellulose-binding surface. One of these, the glycosylation at Asn-137 (Fig. 6), has the same approximate location as the N-linked glycosylations of the other three Type-3 AA9 LPMOs (HjLPMO9B (19), TaLPMO9A (9), and NcLPMO9M (32)). Interestingly, these glycans have the same spatial location as the insert region II in Type-2 LPMOs, which is the signature sequence of these strict C4 oxidizing LPMOs (11,32,38 (39,40). In both structures the GH18-fold leads into an extended loop that is highly O-glycosylated. In the H. jecorina glucoamylase, however, the structure of the complete linker and carbohydrate-binding module (CBM20) could be determined together with the catalytic domain revealing the relative orientation of the domains (40). The role of the structured part of the linker in HjLPMO9A may be similar in the sense that it could restrict the localization of the CBM to one side of the catalytic domain.
A superposition of HjLPMO9A-⌬CBM and HjLPMO9A-gm⌬CBM structures using LSQ Superpose, the algorithm encoded in Coot, gave root mean square deviations of 0.136 and 0.164 Å for the main chain atoms 1-248 and all atoms 1-248, respectively. The main difference between the two structures is the loss of dual conformations of the side chains of some of the residues and sugar molecules in the structure of HjLPMO9A-gm⌬CBM, which is most likely because of the difference in the resolution of the two structures.
Room temperature Abs, CD, and low temperature (5 K) MCD spectra were obtained on copper-loaded HjLPMO9A-⌬CBM samples. These spectra were fit simultaneously with Gaussian functions at the energies given in Table 3. Energies were allowed to shift on the order of a few hundred wavenumbers in MCD relative to Abs and CD due to the temperature

Biochemical characterization of LPMO9A from H. jecorina
difference, and the bands sharpened at low temperature in the MCD spectrum. The peak fits (Table 3) are shown as the dashed colored lines in Fig. 8. The low temperature MCD spectrum of HjLPMO9A-⌬CBM shows features that are intense relative to the Abs spectrum and can therefore be assigned as ligand field transitions (44). The MCD spectrum can be fit with a minimum of 3 peaks centered at ϳ12,900 cm Ϫ1 , ϳ13,900 cm Ϫ1 , and ϳ15,900 cm Ϫ1 . The derivative shape of the low temperature MCD spectrum, referred to as a pseudo-A term, is comprised of two opposite signed C-terminal features assigned to d xz/yz 3 d x2-y2 ligand field transitions, which can spin orbit couple. The CD spectrum shown in Fig. 8B contains a negative feature at ϳ13,900 cm Ϫ1 (blue dashed curve in Fig. 8B). The observation of this feature in the room temperature CD spectrum provides two important details. First, it indicates that there must be a corresponding peak at a comparable energy in the low temperature MCD spectrum of Fig. 8C. Second, as the d z2 3 d x2-y2 transition is magnetic dipole forbidden it should not have significant CD intensity; therefore, this peak is assigned as the d xy 3 d x2-y2 transition.
The spectra of Fig. 8 are similar to the ligand field spectra assigned in several tetragonal mononuclear copper sites including peptidylglycine ␣-hydroxylating monoxygenase and the [Cu(II)(1,2-dmIm) 4 ](BF 4 ) 2 model complex (42). These spectroscopic data were then used to correlate the active site of the wild-type enzyme with that of HjLPMO9A-⌬CBM. The spectra of both are shown as overlays in Fig. 9. The X-and Q-band EPR spectra were simulated and the Abs, CD, and MCD spectra were fit with identical parameters to those of HjLPMO9A-⌬CBM. Importantly, the spectra of wild type do not exhibit measurable differences from HjLPMO9A-⌬CBM, showing that they have equivalent ligand field transition energies, spin Hamiltonian parameters, and ground-state copper covalency. This finding shows that the removal of the carbohydrate-binding module does not affect the copper active site.

Conclusions
Lytic polysaccharide monooxygenases enhance the efficiency of enzyme mixtures for the breakdown of cellulosic substrates and are important for the economic production of biofuels. The major enzymes of the cellulose-degrading H. jecorina have been studied in great detail and 3D structures have been determined for most of them. HjLPMO9A is the major LPMO from this well studied cellulase-producing fungus. We removed the carbohydrate-binding module both post-translationally with hydrolysis by papain and at the genetic level. Although the activity of HjLPMO9A-⌬CBM was found to decrease relative to the full-length enzyme, the spectroscopic comparison reveals that this difference is not reflected in the Cu(II) active site. The ligand field spectroscopy presented is highly sensitive to the ligand environment around the Cu(II) and is not perturbed by the removal of the CBM. Given the spectroscopic observations and the data indicating no change in C-1 and C-4 specificity, the decreased activity of HjLPMO9A-⌬CBM is likely a result of a decrease in affinity for cellulose. The structure of HjLPMO9A presented here not only adds to the set of existing structures, but also highlights a previously unknown structural importance of what was previously identified as a linker region. We have shown that ϳ21 amino acid residues of the linker connecting the catalytic domain to the non-catalytic CBM should be retained for enzyme expression levels comparable with wild type, and that the crystal structures show an organized structure of an extensively glycosylated linker for a cellulose-active enzyme.

Generation of full-length and truncated proteins
H. jecorina LPMO9A (HjLPMO9A) full-length gene sequence (GenBank TM number CAA71999.1) was cloned, by PCR, from

Biochemical characterization of LPMO9A from H. jecorina
QM6a genomic DNA. Using Gateway cloning (Life Technologies), it was introduced into the pTTTpyr2 vector to produce the pTTTpyr2-HjLPMO9A plasmid (pTTTpyr2 is similar to the pTTTpyrG vector described before (PCT publication WO 2011/063308), except that the pyrG gene is replaced with the pyr2 gene). Using the pTTTpyr2-HjLPMO9A plasmid, HjLPMO9A-truncated genes were generated using primer combinations that truncated HjLPMO9A after residue 231 of the mature sequence, just after the catalytic domain and after residue 252, in the middle of the predicted linker (HjLPMO9A-gm⌬CBM) (Fig. 1) 7,713,725). After sealing the plate with an oxygen-permeable membrane, the plates were incubated at 28°C for 6 days, with shaking at 220 rpm (Infors incubator shaker). Protein samples were harvested by transferring the culture medium to a 96-well filter plate (Corning 3505) and collecting the filtrate under vacuum. Larger samples were prepared by fermentation in 1-liter DASGIP reactors (Eppendorf D76FB04MBPD) and centrifugation to generate cell-free supernatants.

Protein purification
Cell-free supernatants of H. jecorina cultures expressing HjLPMO9A were concentrated using a centrifugal concentrator with a 10-kDa cut-off (Vivascience, Littleton, MA). For crystallization experiments, 2 ml of the concentrated HjLPMO9A culture filtrate was loaded on an equilibrated (25 mM NaAc, pH 5.0) PD10 column (GE Healthcare) and eluted using the equilibration buffer following the instructions by the manufacturer. One ml of the eluted protein solution was directly applied to a Superdex 75 size exclusion purification column (GE Healthcare) equilibrated with 25 mM NaAc, pH 5.0.
In the subsequent step, the HjLPMO9A protein was treated with papain to remove the CBM. First, papain (Sigma) was activated in 0.1 M BisTris buffer (Sigma) at pH 6.5 containing 10 mM 2-mercaptoethanol (Merck Millipore) and 1 mM EDTA (Merck Millipore). Then, the activated papain was mixed with 3 mg/ml of HjLPMO9A in 0.1 M NaAc, pH 5.0, to a final papain concentration of 1:10 (papain:HjLPMO9A) and the mixture was incubated for 24 h at 37°C. The papain-treated protein solution was then concentrated to 2 ml and applied to a Superdex 75 column equilibrated with 25 mM Tris-HCl, pH 7.5, and 150 mM NaCl. The fractions corresponding to HjLPMO9A-⌬CBM were pooled and the buffer of the protein solution was changed to 25 mM Tris-HCl, pH 7.5, and 25 mM NaCl. The sample was concentrated to a final protein concentration of 30 mg/ml as estimated by measuring the absorbance of the protein solution at 280 nm on a NanoDrop1000 (ThermoScientific, Thermo Fischer Scientific Inc.) and using the calculated extinction coefficient of 54,360 M Ϫ1 cm Ϫ1 for the full-length protein and 48,150 M Ϫ1 cm Ϫ1 for the HjLPMO9A-⌬CBM. For biophysical characterization, Avicel binding assay, and PASC activity assay, protein purification was carried out in a different manner than described above for crystallization. Culture supernatants of H. jecorina expressing HjLPMO9A-and HjLPMO9A-truncated variants were starting materials for purification. Column purifications were performed using an Äkta Explorer FPLC system (GE Healthcare Biosciences). A 560-ml Sephadex G-25 desalting column (GE Healthcare Bio-Sciences) was equilibrated with 20 mM MES, pH 6, containing 25 mM sodium chloride. Samples were loaded and eluted iso- The absorbance and CD spectra in panels A and B were obtained by subtracting the corresponding spectra of the apoenzyme from the copper-loaded spectra. The low temperature MCD spectrum of panel C was obtained by subtracting the 0 T spectrum from the 7 T data.

Biochemical characterization of LPMO9A from H. jecorina
cratically at a flow rate of 5 ml/min. Fractions containing protein were pooled for hydrophobic interaction chromatography. Samples were mixed 1:1 with a saturated ammonium sulfate solution (ϳ3 M) in 20 mM MES, pH 6, incubated at room temperature for 15 min, and centrifuged at 4000 rpm for 10 min. Supernatants were loaded onto a 30-ml phenyl-Sepharose column (GE Healthcare Biosciences) equilibrated with 20 mM MES, pH 6, containing 1.8 M ammonium sulfate and eluted at 0.5 ml/min with a linearly decreasing gradient of ammonium sulfate. Fractions containing the target protein, as determined by SDS-PAGE, were pooled. Samples were concentrated using Sartorius Vivaspin concentrators with a 5-kDa membrane cutoff (Thermo Fisher Scientific, Waltham, MA). A Superdex 75 10/300 column (GE Healthcare) was equilibrated with 50 mM MES, pH 6, containing 100 mM sodium chloride. Samples with 1-ml volumes were filtered, loaded, and eluted isocratically at 0.6 ml/min. Fractions containing pure target protein as judged by SDS-PAGE were pooled for further study.
For the EPR experiment samples of HjLPMO9A and HjLPMO9A-⌬CBM were prepared with 10 -20% EPR active Cu/Fe present. Apoenzyme was generated by dialysis in Slide-A-Lyzer dialysis cassettes (10,000 MWCO; purchased from Thermo Scientific) against a solution of 50 mM EDTA, 50 mM MES buffer, pH 6.0. After a 2-day dialysis, the cassettes were buffer exchanged in 50 mM MES buffer, pH 6.0, overnight. The apoenzyme was extracted from the dialysis cassettes and concentrated in Corning Spin-X ultracentrifuge filters. Copper reconstitution was performed by adding a 5-fold excess of Cu(II) to enzyme from a 3.48 mM Cu(NO 3 ) 2 in 50 mM MES stock, pH 4.0. Excess copper was removed by dialysis against MES buffer, pH 6.0, and samples were reconcentrated. Biquinoline copper assays and EPR spin integration confirmed binding of a single Cu(II) ion per molecule. Samples for magnetic circular dichroism (MCD) were prepared in MES buffer, pH 6.0, and mixed with 50 -60% (v/v) glycerol to obtain high quality glasses. The samples were then injected into MCD cells, which consist of two quartz disks sealed with a 3-mm rubber spacer.

Protein crystallization, structure determination, and refinements
Initial protein crystallization screens were performed by sitting-drop vapor diffusion experiments of 0.3-l drops in a 96-well plate, prepared using a Mosquito crystallization robot (TPP Labtech, Cambridge, United Kingdom). Small needle-like crystals started to appear after 1 week incubation at 28°C, in an ammonium sulfate crystallization screen (Qiagen, Germany) by 1:1 mixing of 30 mg/ml of protein and crystal screen reservoir solution (1.6 M AmSO 4 and 0.1 M citric acid, pH 4.0). Larger crystals were obtained by introducing nuclei from crushed crystals into new drops by streak seeding. Seeding resulted in crystals that grew to optimum size after 1 week. Prior to data collection, crystals were frozen in liquid N 2 using the crystallization solution with 35% PEG 3350 and 30% m-PEG 2000 added as cryoprotectant. The crystals of the HjLPMO9A-⌬CBM X-ray diffraction data on a HjLPMO9A-⌬CBM crystal were collected with a Pilatus detector at wavelength 0.972425 Å at 100 K on beam line ID23-1 at the European Synchrotron Radiation Facility, Grenoble, France. The structure of HjLPMO9A-⌬CBM was determined by molecular replacement using the program PHASER (45), and using the structure of H. jecorina LPMO9B (PDB code 2vtc) as the search model. The best solution was obtained with one molecule in the asymmetric unit.
Purification, crystallization, and structure determination of HjLPMO9A-gm⌬CBM was carried out in a similar manner as for HjLPMO9A-⌬CBM. Two ml of the concentrated HjLPMO9A-gm⌬CBM culture filtrate was loaded on a PD10 column. One ml of eluted protein was loaded onto a Superdex 75 size exclusion purification column (GE Healthcare) equilibrated with 25 mM NaAc, pH 5.0. Fractions corresponding to HjLPMO9A-gm⌬CBM were pooled and concentrated to 27 mg/ml in 25 mM Tris-HCl, pH 7.5, and 10 mM NaCl. Crystals of HjLPMO9A-gm⌬CBM were grown using hanging drop-vapor diffusion crystallization experiments incubated at 20°C. The drops were prepared by mixing protein solution of 17 mg/ml with an equal amount of a well solution containing 1.6 M AmSO 4 and 0.1 M citric acid, pH 4.0. Prior to data collection, crystals were frozen in liquid N 2 using the crystallization solution with 35% PEG 3350 added as cryoprotectant. The crystals of the HjLPMO9A-gm⌬CBM enzyme belong to the space group P2 1 with unit cell parameters a ϭ 42.9, b ϭ 62.1, c ϭ 48.0 Å, and ␤ ϭ 111.8°. X-ray diffraction data from a HjLPMO9A-gm⌬CBM crystal was collected on beam line ID23, ESRF with a Pilatus detector at a wavelength 0.87257 Å. The structure of HjLPMO9A-gm⌬CBM was determined by molecular replacement using the structure of HjLPMO9A-⌬CBM as the search model.
X-ray diffraction data were processed using XDS (46,47). The integrated data were then scaled using the scaling program SCALA (Collaborative Computational Project Number 4, 1994). For cross-validation and R free calculations (48), 5% of the reflections was set aside. The program REFMAC5 (49) was used for structure model refinements, and manual model rebuilding was performed with Coot (50,51), with maximum likelihood A-weighted 2F obs Ϫ F calc electron density maps (50). Solvent molecules were automatically added using the automatic water picking function in the ARP/wARP package (52). Picked water molecules were selected or discarded manually by visual inspection of the 2F obs Ϫ F calc electron density maps. The copper atoms bound in the active site of the enzyme in the two HjLPMO9A structures were introduced at a final stage of the structure refinement. Statistics from data processing and structure refinement are summarized in Table 1. The coordinates for the two final structure models, and the structure factors, have been deposited in the Protein Data Bank with accession codes 5O2X and 5O2W, respectively. PyMOL (version 1.5) was used for analysis of the structures and figure preparations.

Avicel-binding assay
Avicel (PH-101, FMC BioPolymer)-binding assays were conducted as described before (53). Various concentrations of Avi-cel solids were incubated at room temperature overnight with 0.125 mg/ml of LPMO (full-length or truncated) protein. The Avicel was removed by centrifugation and the supernatant concentration of the LPMO was determined by UHPLC. The fraction of bound LPMO was calculated from the starting and remaining concentrations.

Phosphoric acid-swollen cellulose-activity assay
Relative activity of LPMO on PASC was assayed essentially as previously described (54). Purified LPMO enzymes (full-length and truncated) were serially diluted in 96-well microtiter plate wells. PASC (0.25% solids) was combined with ascorbic acid (1 mM), copper sulfate (100 M), and sodium acetate buffer (50 mM), pH 5, or acetate/BisTris buffer (120 mM), pH 7. Plates were sealed with foil and incubated at 50°C with shaking for 6 h. LPMO-dependent PASC solubilization was measured by following the reduction of light scattering optical density at 420 nm. Activity is expressed as the (negative of the) change in absorbance relative to an initial, untreated PASC sample value.

HPAEC-PAD product characterizations
Potential differences in enzymatic behavior of HjLPMO9A and HjLPMO9A-⌬CBM on reduced PASC were analyzed using high performance anion exchange chromatography (HPAEC). PASC was prepared and reduced according to Westereng et al. (7). The LPMO enzymes were preincubated with CuSO 4 corresponding to ϫ10 the enzyme concentration for 30 min at ambient temperature. HjLPMO9A and HjLPMO9A-⌬CBM were prepared as mentioned above. Phanerochaete chrysosporum LPMO9D was used as a C1-only oxidizer positive control and was produced and purified as described by Wu et al. (33). 0.2 ml of mixture containing (final concentrations) 0.1 mM ascorbic acid, 0.06 -0.08 mM enzyme (no enzyme was added to the negative control), and 0.2% PASC was prepared in a 2-ml Eppendorf tube and incubated for 16 h at 50°C and 1000 rpm in a Thermo mixer. After centrifugation, 19 l of the supernatant was analyzed on a Dionex ICS3000 (Dionex, Sunnyvale, CA) equipped with a pulsed amperimetric detector using the Standard Quad waveform for detection. Separation of oligosaccharides was performed using PA1 guard and analytical columns at 30°C as described by Westereng et al. (55) with the exception of an extra 3-min isocratic elution directly following the sample injection.

EPR, CD, and MCD characterizations
X-band EPR spectra were obtained with a Bruker EMX spectrometer, an ER 041 XG microwave bridge, and an ER4116DM cavity. A sample temperature of 77 K was maintained using a liquid nitrogen finger dewar. EPR settings were as follows: frequency Ϸ 9.6 GHz, power Ϸ 10 milliwatt, Modulation Amplitude ϭ 3.00 G. All spectra were averaged over 3 scans. Q-band spectra were obtained using an ER 051 QR microwave bridge, an ER 5106QT resonator, and an Oxford Instruments continuous-flow CF935 cryostat that held the sample at 77 K. Q-band EPR settings were: frequency Ϸ 34 GHz, power Ϸ 0.37 milliwatt, Modulation Amplitude ϭ 5.00 G. All spectra were averaged over 20 scans. EPR spin quantitation of the paramagnetic copper content was performed using a Biochemical characterization of LPMO9A from H. jecorina 0.945 mM AAS copper standard solution, Specpure (purchased from Alfa Aesar), in MES, pH 6.0, and 40% glycerol. EPR spectra were simulated using the SpinCount simulation software developed by Professor Michael Hendrich of Carnegie Mellon University (www.chem.cmu.edu/groups/hendrich/ facilities/index.html). 4 UV-visible absorption spectra were acquired on an Agilent 8453 diode array spectrophotometer, in the energy range 300 -1100 nm. Circular dichroism (CD) spectra (4°C) and magnetic CD (MCD) spectra (4 K) were measured on a Jasco 730 spectrapolarimeter with an S-20 photomultiplier tube or a liquid nitrogen-cooled InSb detector coupled via an 8-foot extended sample compartment to an Oxford Instruments SM4000 -7T magnet. Zero-field baseline effects were eliminated in MCD by subtracting the 0 T scan from the ϩ7 T scan. Simultaneous Gaussian fitting of the UV-visible, CD, and MCD data were done using PeakFit 4.0 (Jandel).