Full article: Numerical solution of a parameter estimation problem arising in Prompt-Gamma Neutron Activation Analysis

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We study a parameter estimation problem which arises from measurements with QUANTOM, a measurement facility utilizing Prompt-Gamma Neutron Activation Analysis (PGNAA) to analyze elemental masses in 200-l nuclear waste drums. The presence of neutron flux measurements in QUANTOM offers additional information for elemental composition reconstruction, motivating an alternative to the standard iterative approach. Therefore, the measurement is modelled using approximate models for neutron and photon transport. We introduce a mathematical formulation for solving the resulting parameter estimation problem of QUANTOM measurements and employ an adjoint-based method to determine the elemental composition. Uniqueness of solutions in this context is studied, and the results offer novel insights for PGNAA applications.

KEYWORDS:

MATHEMATICS SUBJECT CLASSIFICATIONS:

1. Introduction

For the disposal of low level (LLW) and intermediate level (ILW) radioactive waste in Germany a final deep geological repository called Konrad is currently expected to go into operation around the year 2030. In the process of disposal of nuclear wastes in Konrad one of the major challenges are the compliance with waste acceptance criteria in order to assure a safe and sustainable storage of the waste, and especially the proof of compliance. Radiological waste acceptance criteria can be verified using passive measurements of the gamma emitting radionuclide inventory of waste drums [Citation1,Citation2] and correlation of the remaining radionuclides. For the waste acceptance criteria regarding the material (elemental) composition no metrological solution is applied as a standard so far requiring a recourse on existing documentation or visual inspection by opening waste drums. To overcome this situation, a measurement device called QUANTOM [Citation3,Citation4] has been developed that is capable of performing a non-destructive analysis of elemental masses of radioactive wastes in 200-l drums. The parameter estimation problem arising in the evaluation of the spatially resolved measurement data from QUANTOM is investigated in this work.

The determination of the elemental composition of radioactive waste drums with QUANTOM is based on Prompt-Gamma Neutron Activation Analysis (PGNAA). Free neutrons are produced by a built-in neutron generator, which is in fact a small linear accelerator where Deuterium is accelerated to a Deuterium target. The resulting neutrons are moderated to thermal energies by a massive surrounding graphite layer and interact with the material. When a single neutron is captured by an atom, it becomes energetically excited. In the process of de-excitation gamma quants at element-specific energies are emitted [Citation5]. This is the measurement signal resulting in a gamma spectrum ranging between 100 keV up to 12 MeV at max. Therefore, peaks in such a PGNAA spectrum can be correlated with an element, even specific isotopes. For the determination of the elemental composition from the gamma spectra the gamma and neutron transport in the complete measurement facility need to be considered, as shown in [Citation6].

Because of the large volume samples under consideration in QUANTOM (typically 200-l-drums in Germany), a spatially resolved measurement is carried out corresponding to a complete surface scan. This is achieved by lifting the drum into 4 different positions and rotating the drum so that it can be measured from 6 different angles for each position. With two gamma detectors, a total of 48 gamma spectra is recorded. The gamma spectra are evaluated jointly based on a model of evaluation which accounts for the different measurement positions and angles [Citation4]. In order to quantitatively evaluate the neutron flux in the measurement chamber, additional neutron detectors are included in QUANTOM. Gamma spectra are recorded using high purity Germanium detectors (HPGe), while the total neutron flux is measured using 3He counter tubes, as shown in Figure .

Figure 1. Cut-view of the QUANTOM measurement device (taken from [Citation4]).

The standard use of PGNAA comprises the analysis of small samples at research reactors [Citation5]. In the presence of a strong neutron flux and a small sample size, a constant neutron spectrum can be assumed in the sample. Then the model of evaluation reduces to a simple proportionality of element mass $m_{l}$ (index l indicating a specific element) and net photopeak count rate $P_{E_{γ, l}}$ which is obtained by evaluating the gamma spectrum. The proportionality constant contains the so-called photopeak-efficiency $ϵ_{E_{γ, l}}$ (fraction of gammas which are detected at full energy), $σ_{E_{γ, l}}$ , the so-called partial (n,γ)-cross section, total neutron flux in the sample Φ and the constant fraction of Avogadro constant $N_{A}$ and molar mass $M_{l}$ : (1) $P_{E_{γ, l}} = \frac{N_{A}}{M_{l}} \cdot m_{l} \cdot ϵ_{E_{γ, l}} \cdot σ_{E_{γ, l}} \cdot Φ .$ (1) Hence, element masses can be obtained by inverting the proportionality. The computation of neutron flux and the photopeak-efficiency will be discussed in detail in Section 2.

In more complex cases, where the dependence of the neutron flux on the elemental composition can not be neglected, PGNAA evaluations are solved using an iterative formulation. Hereby, the neutron flux for a given elemental composition is computed and, using Equation (Equation1(1) $P_{E_{γ, l}} = \frac{N_{A}}{M_{l}} \cdot m_{l} \cdot ϵ_{E_{γ, l}} \cdot σ_{E_{γ, l}} \cdot Φ .$ (1) ), the updated elemental composition is determined. The element masses in the $(n + 1)$ -th iteration are thus computed as follows: (2) $(m_{l})_{n + 1} = \frac{P_{E_{γ, l}} M_{l}}{N_{A} \cdot ϵ_{E_{γ, l}} \cdot σ_{E_{γ, l}} \cdot Φ [(m_{1})_{n}, \dots, (m_{L})_{n})]},$ (2) where L is the total number of elements. The iterative process terminates when the relative change of compositions between two iterations becomes neglectable. In [Citation7,Citation8] this approach has been studied for the evaluation of one single PGNAA measurement using a HPGe detector. Based on a relative formulation the authors were able to show the existence of a solution under mild assumptions. Nevertheless, the uniqueness of a solution cannot be guaranteed in this setting, although, in accompanying experimental studies ambiguous solutions did not occur. The setup of our paper differs in several aspects from the one considered in [Citation7,Citation8]. The sample volume in QUANTOM is larger than considered in standard literature before, hence spatial distribution of neutron flux and photopeak efficiencies need to be carefully represented in the simulation. Measurements for the neutron flux in QUANTOM present additional information regarding the reconstruction of elemental compositions. Furthermore, the problem formulation of [Citation8], where relative compositions are determined, cannot be applied to the more complex measurement task of QUANTOM. The model of evaluation in this work therefore aims at the determination of absolute quantities.

Hence, we present an alternative to the standard iterative approach which is motivated by the spatially resolved measurement and additional measurement equipment for the neutron flux. We consider a mathematical formulation of the evaluation of QUANTOM measurements as a parameter estimation problem. Based on this formulation, we make use of gradient based methods to solve for the elemental composition. We only consider the parameter estimation problem with respect to neutron transport model, the dependence of photon transport to element composition is much weaker and can therefore be neglected.

This problem is studied in a simplified setting which makes it possible to also address the question of uniqueness of a solution. The material is assumed to be homogeneous, so only one measurement position of the HPGe detector is considered. Both types of detectors, for gamma spectra and total neutron flux, are considered in this setting. Elemental compositions are represented by the vector ρ of all corresponding elemental densities $ρ_{l}$ , which are parameters of the neutron transport equation and therefore a natural candidate for the reconstruction. In this setting we are able to prove uniqueness of solutions to the parameter estimation problem. To our knowledge, this is the first mathematically rigorous result for PGNAA applications. Throughout this work we restrict the numerical setting to two space dimensions. This is sufficient to study and present all major effects arising in the problem setup. The problem setup is in parts comparable to Diffuse Optical Tomography (DOT), as a similar governing equation is used as an approximate model for particle transport. The connection between the two problems are discussed in the conclusive part of this paper.

This paper is structured as follows. In Section 2 we introduce the SP₁-approximation of the neutron transport equation and model functions for the detector response which, combined, represent the forward model. Section 3 comprises a formulation of the parameter estimation problem for QUANTOM in the form of a PDE-constrained optimization problem and a solution method via a formal Lagrange method. The key results of this work are contained in Section 4, where we prove uniqueness of solutions to the parameter estimation problem. In the following Section 5, we describe the computational implementation of the parameter estimation problem, based on an already existing neutron transport solver. Numerical results which support our findings are presented in Section 6, and we conclude the paper in Section 7 by discussing both the generalizability of our results and remaining open questions.

2. Forward model

As introduced in Section 1, two different kinds of detectors are used in QUANTOM to detect the total neutron flux and the energy resolved emitted gammas, respectively. For both detector types a model for the counting rates is required. Figure shows the abstract measurement setup. Neutrons are constantly emitted by a neutron source. As a result, a steady state neutron flux field is formed in the graphite and the sample, while the graphite moderator provides a thermal neutron spectrum both in the graphite and the sample. It is important to note here, that the neutron flux field depends on the elemental composition of the sample, since the neutrons directly interact with its atoms. Both the sample and graphite are activated by the free neutrons, resulting in element-specific gamma emissions. The detectors are placed in proximity to the sample in order to collect the related measurement signal. Activation of the graphite can be neglected since scattering is dominant in the moderator. Furthermore, detectors can be collimated to the sample to further decrease this effect.

Figure 2. Abstract sketch of the measurement setup, including the neutron source, sample and both types of detectors in a surrounding graphite moderator.

In order to evaluate the counting rates of the detectors, it is necessary to introduce models for the gamma and photon transport first. The transport of gammas and neutrons can be described by the linear Boltzmann equation [Citation9]. For both types of transport we make use of approximate models. For the neutron transport equation, the SP₁-equation, also known as neutron diffusion equation, is used as an approximation. The applicability of the SP₁-equation for this setup was shown in a previous publication [Citation6]. Note that in this work we apply the one-group SP₁-equation, since we can assume a thermal neutron flux. The equation then reads (3) $- \nabla \cdot (\frac{1}{3 Σ_{t} (x)} ∇ϕ (x)) + Σ_{a} (x) ϕ (x) = Q (x) .$ (3) with the Marshak boundary conditions (4) $\frac{1}{2} ϕ (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ (x) = 0$ (4) where $ϕ (x)$ is the neutron flux and $Q (x)$ the source term while $Σ_{t} (x)$ and $Σ_{a} (x)$ are the total and absorption cross sections of the neutrons. It is important to note that the total and absorption cross sections within the sample can be expressed using the elemental densities $ρ_{l}$ as well as the total and absorption cross sections of the elements, $σ_{tl}$ and $σ_{al}$ , while these cross sections are independent of ρ anywhere else, leading to the representation: (5) $\begin{aligned} Σ_{t} (x) & = {\begin{cases} \sum_{l} σ_{tl} ρ_{l} & x \in V_{sam} \\ M_{t} (x) & else \end{cases} \\ Σ_{a} (x) & = {\begin{cases} \sum_{l} σ_{al} ρ_{l} & x \in V_{sam} \\ M_{a} (x) & else \end{cases} \end{aligned}$ (5) Here $M_{t} (x)$ and $M_{a} (x)$ are piecewise constant functions that represent the cross sections in the actual facility. The dependence of the cross sections on ρ in the sample leads to a non-linearity of this parameter estimation problem.

According to Equation (Equation1(1) $P_{E_{γ, l}} = \frac{N_{A}}{M_{l}} \cdot m_{l} \cdot ϵ_{E_{γ, l}} \cdot σ_{E_{γ, l}} \cdot Φ .$ (1) ), photon transport is relevant for the evaluation of a single peak as it determines the so-called photo-peak efficiency $ϵ_{E_{γ, l}}$ . This quantity represents the fraction of gammas which are detected at full energy. Consequently, scattered photons do not contribute to $ϵ_{E_{γ, l}}$ , allowing relatively simple models to calculate photo-peak efficiencies, rather than employing full photon transport simulations. The main effects which have to be considered are absorption, the geometry of the measurement setup and detector physics. Photopeak efficiency is a parameter dependent on gamma energy. Therefore, evaluating a measurement requires either point-wise values for all gamma energies considered or determining a (gamma-)energy-dependent calibration curve for a given measurement facility or setup, accounting for all these effects. This calibration curve provides photopeak efficiencies for any given gamma-energy and can be obtained experimentally, by measuring at given energies using calibration emitters and then interpolating between measured values. Alternatively, calibration curves can be determined using either analytical models or Monte Carlo simulations. As prompt gamma energies range between 50 keV up to 12 MeV, experimental calibration is of limited use, making modeling or simulation-based calibrations preferable. Optimal performance is typically achieved using Monte Carlo simulations with MCNP [Citation10] to account for the intricate physics within a HPGe detector. Further details on the calibration of gamma detectors can be found in [Citation11].

In principle, the photopeak efficiency also depends on the composition of the sample, which changes in every iteration of the evaluation (Equation (Equation2(2) $(m_{l})_{n + 1} = \frac{P_{E_{γ, l}} M_{l}}{N_{A} \cdot ϵ_{E_{γ, l}} \cdot σ_{E_{γ, l}} \cdot Φ [(m_{1})_{n}, \dots, (m_{L})_{n})]},$ (2) )) (as the current result of the measurement, not the physical composition!). However, the tabulated values in [Citation12] suggest that the mass attenuation coefficients for most of the materials relevant for the application are similar. Therefore, the sample's impact on the photo-peak efficiency is primarily governed by its average density, a known parameter that remains constant during the evaluation. Thus, we assume photo-peak efficiencies to be constant throughout the iterative evaluation.

In this work, we employ a simplistic model for the photopeak efficiencies in order to simplify notations and calculations, but all results remain valid when using experimentally obtained calibration curves, more sophisticated analytical models, or calibration curves based on extensive Monte Carlo simulations. The photopeak efficiency is given by (6) $ϵ_{E_{γ, l}} (x) = ϵ_{geom, E_{γ, l}} (x) \cdot ϵ_{att, E_{γ, l}} (x) \cdot ϵ_{\det, E_{γ, l}},$ (6) where $ϵ_{att, E_{γ, l}} (x)$ is the photon attenuation, $ϵ_{geom, E_{γ, l}} (x)$ is the geometric efficiency and $ϵ_{\det, E_{γ, l}}$ is the detector efficiency. It should be noted that the photopeak efficiency is dependent of the location x where the photon is emitted. Since there exists no closed analytical expression for the full photopeak efficiency, we use a model that captures its main features. The geometric efficiency in this setup is given by (7) $ϵ_{geom, E_{γ, l}} (x) = \frac{1}{π} \arctan (\frac{r_{\det} (x)}{d_{\det} (x)}),$ (7) where $r_{\det} (x)$ is the radius of the detector and $d_{\det} (x)$ the distance to the center of the detector. The photon attenuation is given by (8) $ϵ_{att, E_{γ, l}} (x) = e^{- μ \cdot d (x)},$ (8) where µ is the attenuation coefficient and d the traversed distance. Hereby the attenuation coefficient is mainly determined by the average density of the material, which can be easily determined in practice. The detector efficiency $ϵ_{\det, E_{γ, l}}$ can be interpreted as the probability for a photon entering the detector to contribute to the measured signal and has to be determined by photon transport simulations. For a given energy and detector, it is a constant. For simplicity it is set to one in the following.

Using both the neutron flux $ϕ (x)$ and the photopeak efficiency $ϵ_{E_{γ, l}}$ , we can define the counting rates of the detectors. The neutron detectors measure neutron counting rates given by (9) $R_{n} [ϕ] = \int_{V_{\det, n}} σ_{d} \cdot ϕ (x) d x,$ (9) where $σ_{d}$ is the so-called detector cross section. The integral is carried out over the volume of the nth detector $V_{\det, n}$ .

In contrast to the neutron detectors, the gamma detectors do resolve the energy of the measured photons. The resulting gamma spectrum can then be analyzed for element-specific photopeaks. Assuming a homogeneous sample, the counting rate for photons forming a photopeak at energy $E_{γ}$ specific for element l is given by (10) $P_{E_{γ, l}} [ρ, ϕ] = \int_{V_{sam}} ϵ_{E_{γ, l}} ρ_{l} σ_{E_{γ, l}} ϕ (x) d x,$ (10) where $σ_{E_{γ, l}}$ is the so-called partial (n,γ)-cross-section and $ρ_{l}$ is the density of the element l within the sample. Equation (Equation10(10) $P_{E_{γ, l}} [ρ, ϕ] = \int_{V_{sam}} ϵ_{E_{γ, l}} ρ_{l} σ_{E_{γ, l}} ϕ (x) d x,$ (10) ) is a generalization of Equation (Equation1(1) $P_{E_{γ, l}} = \frac{N_{A}}{M_{l}} \cdot m_{l} \cdot ϵ_{E_{γ, l}} \cdot σ_{E_{γ, l}} \cdot Φ .$ (1) ), accounting for the additional spatial variation of the parameters $ϵ_{E_{γ, l}}$ and ϕ. The integral in Equation (Equation10(10) $P_{E_{γ, l}} [ρ, ϕ] = \int_{V_{sam}} ϵ_{E_{γ, l}} ρ_{l} σ_{E_{γ, l}} ϕ (x) d x,$ (10) ) is carried out over the volume where the gamma photons are produced, the sample volume $V_{sam}$ .

3. Parameter estimation problem

The parameter estimation problem to be considered in this work is the following: given measurements from the neutron and gamma detectors, determine the elemental densities $ρ_{l}$ . These are themselves parameters of the neutron transport equation. In the following subsections we present a rigorous formulation and a corresponding solution method.

3.1. Formulation as PDE-constrained optimization problem

The parameter estimation problem can be formulated as a PDE-constrained optimization problem with SP₁-equation as a constraint. Therefore, it can be written as an optimal control problem [Citation13] with the vector of the material densities ρ as the control variable and the neutron flux ϕ as the state variable.

The forward model provides a map from parameter to data space. Given the measurements for the peak counting rates $\bar{P_{E_{γ, l}}}$ and the neutron counting rates $\bar{R_{n}}$ , an objective function including peak counting rates for all elements and the counting rates for all neutron detectors can be defined : (11) $J_{\det} [ρ, ϕ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ϕ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ϕ] - \bar{R_{n}})^{2} .$ (11) Here, $w_{γ}$ and $w_{d}$ are weights that determine the contribution of the different detectors to the objective function, while L is the number of elements within the sample and N the number of neutron detectors. The weight of the neutron detectors is set to $w_{d} = \frac{1}{2}$ , while the weight of the gamma detectors is set to $w_{γ} = \frac{3}{4}$ to ensure an equal weight of both terms despite the higher number of neutron detectors. In addition, we include a term punishing deviations from the known total mass $\bar{M}$ : (12) $J_{m} [ρ] = \frac{w_{m}}{{\bar{M}}^{2}} (c_{1} ρ_{1} + c_{2} ρ_{2} - \bar{M})^{2}$ (12) Here $c_{1}$ and $c_{2}$ are material-specific constants and $w_{m} = \frac{1}{2}$ . Since the total mass $\bar{M}$ can be measured with high accuracy, we assume in the following that the error on the total mass is negligible. This term effectively acts as a regularization. Its effect is investigated in Section 6.2.

The complete objective function is then given by (13) $J [ρ, ϕ] = J_{\det} [ρ, ϕ] + J_{m} [ρ] .$ (13) The parameter estimation problem can then be formulated as minimizing the objective function with the SP₁-equation as a constraint, so we want to solve the problem (14) $\begin{aligned} min_{ρ} J [ρ, ϕ] subjectto D (x) ϕ (x) & = Q (x) in D \\ and \frac{1}{2} ϕ (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ (x) & = 0 on ∂D . \end{aligned}$ (14) The differential operator in Equation (Equation14(14) $\begin{aligned} min_{ρ} J [ρ, ϕ] subjectto D (x) ϕ (x) & = Q (x) in D \\ and \frac{1}{2} ϕ (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ (x) & = 0 on ∂D . \end{aligned}$ (14) ) is defined by (15) $D (x) ϕ (x) = - \nabla \cdot (\frac{1}{3 Σ_{t} (x)} ∇ϕ (x)) + Σ_{a} (x) ϕ (x),$ (15) while D is the domain with corresponding boundary $∂D$ . Outside the sample, $Σ_{t}$ and $Σ_{a}$ are independent of ρ. Thus, the differential operator D depends on ρ on a part of the domain.

3.2. Solution via formal Lagrange method

In order to solve the problem posed in Equation (Equation14(14) $\begin{aligned} min_{ρ} J [ρ, ϕ] subjectto D (x) ϕ (x) & = Q (x) in D \\ and \frac{1}{2} ϕ (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ (x) & = 0 on ∂D . \end{aligned}$ (14) ), we use a formal Lagrange method, from which we derive an expression for the gradient of the objective function, which in turn can be used for a gradient based optimization scheme. In the following, D is the computational domain with the corresponding boundary $∂D$ .

The Lagrangian for this problem is given by the objective function with a Lagrange multiplier $ϕ^{⋆}$ and the SP₁-equation as a constraint: (16) $L [ρ, ϕ, ϕ^{⋆}] = J [ρ, ϕ] - \int_{D} ϕ^{⋆} (x) (D (x) ϕ (x) - Q (x)) d x .$ (16) By formal integration by parts and using Equation (Equation4(4) $\frac{1}{2} ϕ (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ (x) = 0$ (4) ), we obtain (17) $\begin{aligned} L [ρ, ϕ, ϕ^{⋆}] & = J [ρ, ϕ] - \int_{D} ϕ (x) (D (x) ϕ^{⋆} (x)) d x - \int_{D} Q (x) ϕ^{⋆} (x) d x \\ + \int_{\partial D} \frac{1}{2} ϕ (x) ϕ^{⋆} (x) + \frac{1}{3 Σ_{t} (x)} ϕ (x) \frac{\partial}{\partial n} ϕ^{⋆} (x) d s . \end{aligned}$ (17) Since the operator $D (x)$ is self-adjoint, i.e. $D (x) = D^{⋆} (x)$ , we can identify $ϕ^{⋆}$ with the adjoint flux. The boundary terms in Equation (Equation17(17) $\begin{aligned} L [ρ, ϕ, ϕ^{⋆}] & = J [ρ, ϕ] - \int_{D} ϕ (x) (D (x) ϕ^{⋆} (x)) d x - \int_{D} Q (x) ϕ^{⋆} (x) d x \\ + \int_{\partial D} \frac{1}{2} ϕ (x) ϕ^{⋆} (x) + \frac{1}{3 Σ_{t} (x)} ϕ (x) \frac{\partial}{\partial n} ϕ^{⋆} (x) d s . \end{aligned}$ (17) ) need to vanish. This is only the case if the relation (18) $\frac{1}{2} ϕ^{⋆} (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ^{⋆} (x) = 0$ (18) is fulfilled on the boundary. The derivatives of the Lagrangian can be used within an iterative scheme to reconstruct the elemental densities ρ. Differentiation of Equation (Equation16(16) $L [ρ, ϕ, ϕ^{⋆}] = J [ρ, ϕ] - \int_{D} ϕ^{⋆} (x) (D (x) ϕ (x) - Q (x)) d x .$ (16) ) with respect to the adjoint flux $ϕ^{⋆}$ yields the weak form of the forward SP₁-equation, while the differentiation with respect to the flux ϕ yields the weak form of the adjoint SP₁-equation (19) $D^{⋆} (x) ϕ^{⋆} (x) = Q^{⋆} (x)$ (19) with the adjoint source (20) $\begin{aligned} Q^{⋆} = \frac{δJ}{δϕ} [ρ, ϕ] \\ = \sum_{l = 1}^{L} \frac{2 w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ϕ] - \bar{P_{E_{γ, l}}}) \int_{V_{sam}} ϵ_{E_{γ, l}} (x) ρ_{l} σ_{E_{γ, l}} d x \\ + \sum_{n = 1}^{N} \frac{2 w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ϕ] - \bar{R_{n}}) \int_{V_{\det, n}} σ_{d} d x . \end{aligned}$ (20) and the boundary condition defined in Equation (Equation18(18) $\frac{1}{2} ϕ^{⋆} (x) + \frac{1}{3 Σ_{t} (x)} \frac{\partial}{\partial n} ϕ^{⋆} (x) = 0$ (18) ). The derivative of Equation (Equation16(16) $L [ρ, ϕ, ϕ^{⋆}] = J [ρ, ϕ] - \int_{D} ϕ^{⋆} (x) (D (x) ϕ (x) - Q (x)) d x .$ (16) ) with respect to the density of the lth element is given by (21) $\begin{aligned} \frac{δL}{δ ρ_{l}} [ρ, ϕ, ϕ^{⋆}] & = \frac{σ_{t, l}}{\sum_{j} σ_{t, j} ρ_{j}} \int_{V_{sam}} ∇ϕ \cdot \nabla ϕ^{⋆} d x - σ_{a, l} \int_{V_{sam}} ϕ ϕ^{⋆} d x \\ + \frac{2 w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (\int_{V_{sam}} ϵ_{E_{γ, l}} σ_{E_{γ, l}} ρ_{l} ϕ d x - \bar{P_{E_{γ, l}}}) \int_{V_{sam}} ϵ_{E_{γ, l}} σ_{E_{γ, l}} ϕ d x \\ + \frac{2 w_{m} c_{l}}{{\bar{M}}^{2}} (c_{1} ρ_{1} + c_{2} ρ_{2} - \bar{M}) . \end{aligned}$ (21) Using the results for ϕ and $ϕ^{⋆}$ , the gradient of the Lagrangian with respect to the different elemental density $ρ_{l}$ is computed and can be used in a gradient-based optimization scheme.

4. Uniqueness

In this chapter, we investigate the uniqueness of the solution of the unperturbed parameter estimation problem, i.e. without noise in the measurement data. First, we introduce a change of variables to simplify the discussion. We then investigate the objective function to motivate our proof for the uniqueness result that we finally present.

4.1. Change of variables

Within this chapter, we introduce a change of variables [Citation14] to simplify the following discussion: (22) $ψ = γϕ with γ = \frac{1}{\sqrt{3 Σ_{t}}} .$ (22) By substituting Equation (Equation22(22) $ψ = γϕ with γ = \frac{1}{\sqrt{3 Σ_{t}}} .$ (22) ) into (Equation3(3) $- \nabla \cdot (\frac{1}{3 Σ_{t} (x)} ∇ϕ (x)) + Σ_{a} (x) ϕ (x) = Q (x) .$ (3) ) we obtain the Helmholtz-like equation (23) $\nabla^{2} ψ + ηψ = \frac{Q}{γ} with η = \frac{\nabla^{2} γ}{γ} + \frac{Σ_{a}}{γ^{2}} .$ (23) Since the cross sections as defined in Equation (Equation5(5) $\begin{aligned} Σ_{t} (x) & = {\begin{cases} \sum_{l} σ_{tl} ρ_{l} & x \in V_{sam} \\ M_{t} (x) & else \end{cases} \\ Σ_{a} (x) & = {\begin{cases} \sum_{l} σ_{al} ρ_{l} & x \in V_{sam} \\ M_{a} (x) & else \end{cases} \end{aligned}$ (5) ) are dependent on ρ only within the sample volume $V_{sam}$ , both the source term and the boundary conditions are independent of ρ. The parameter η in Equation (Equation23(23) $\nabla^{2} ψ + ηψ = \frac{Q}{γ} with η = \frac{\nabla^{2} γ}{γ} + \frac{Σ_{a}}{γ^{2}} .$ (23) ) is only varied within $V_{sam}$ and effectively reduces there to (24) $η = \frac{Σ_{a}}{γ^{2}} = 3 Σ_{a} Σ_{t}$ (24) due to the homogeneity of the sample. For the investigation of uniqueness, the term punishing deviations from the total mass M is not needed, so we set $w_{m}$ in Equation (Equation12(12) $J_{m} [ρ] = \frac{w_{m}}{{\bar{M}}^{2}} (c_{1} ρ_{1} + c_{2} ρ_{2} - \bar{M})^{2}$ (12) ) to zero. Using the definition of ψ in Equation (Equation22(22) $ψ = γϕ with γ = \frac{1}{\sqrt{3 Σ_{t}}} .$ (22) ), we then obtain a transformed cost functional (25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) with neutron counting rates given by (26) $R_{n} [ψ] = \int_{V_{\det, n}} σ_{d} \frac{1}{γ} ψ (x) d x,$ (26) and gamma counting rates given by (27) $P_{E_{γ, l}} [ρ, ψ] = \int_{V_{sam}} ϵ_{E_{γ, l}} (x) ρ_{l} σ_{E_{γ, l}} \frac{1}{γ} ψ (x) d x .$ (27)

4.2. Objective function

Figure shows an exemplary objective function where the cross sections of the elements are set to $σ_{t 1} = σ_{t 2} = 1$ , $σ_{a 1} = 0.1$ and $σ_{a 2} = 0.5$ . The different terms in the objective function defined in Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) ) are displayed separately. Figure (a) and (b) show the objective function including only the first and the second photopeak, respectively. It can be seen that there exists no unique minimum in both figures, but instead a large ‘valley’ and thus many admissible values for ρ.

Figure 3. Subfigures (a)–(c) depict the different terms of the objective function after decomposition, subfigure (d) depicts the full objective function. The analytically computed minimum of the objective function of the neutron detectors is green. A logarithmic scale is used for all plots. (a) Objective function including the first Photopeak. (b) Objective function including the second Photopeak. (c) Objective function including all neutron detectors and (d) Full objective Function.

Since the neutron counting rates, as can be seen from Equations (Equation23(23) $\nabla^{2} ψ + ηψ = \frac{Q}{γ} with η = \frac{\nabla^{2} γ}{γ} + \frac{Σ_{a}}{γ^{2}} .$ (23) ) and (Equation26(26) $R_{n} [ψ] = \int_{V_{\det, n}} σ_{d} \frac{1}{γ} ψ (x) d x,$ (26) ), only depend on the parameter η, they can only be used to reconstruct this single parameter, independent of the number of detectors. Thus additional detectors can possibly reduce the measurement error, but cannot reconstruct additional parameters. Therefore, in Figure (c) the sum over the partial objective functions for all detectors is shown. It should be noted that all combinations of $Σ_{t}$ and $Σ_{a}$ that lead to the same parameter η give rise to the same solution ψ, since both the boundary conditions and the source term are independent of ρ. Thus the neutron detectors alone are not sufficient to uniquely reconstruct the material composition of the sample, although they might reconstruct the parameter η. In fact, for this specific problem one can derive an analytic expression that allows to compute the density of one material for a given density of the other material that leads to a desired parameter η. By inserting Equation (Equation5(5) $\begin{aligned} Σ_{t} (x) & = {\begin{cases} \sum_{l} σ_{tl} ρ_{l} & x \in V_{sam} \\ M_{t} (x) & else \end{cases} \\ Σ_{a} (x) & = {\begin{cases} \sum_{l} σ_{al} ρ_{l} & x \in V_{sam} \\ M_{a} (x) & else \end{cases} \end{aligned}$ (5) ) into (Equation24(24) $η = \frac{Σ_{a}}{γ^{2}} = 3 Σ_{a} Σ_{t}$ (24) ) for the case $x \in V_{sam}$ , we obtain a relation between $ρ_{2}$ and $ρ_{1}$ (28) $ρ_{2} = \frac{- (σ_{a 1} σ_{t 2} + σ_{a 2} σ_{t 1}) ρ_{1} + \sqrt{(σ_{a 1} σ_{t 2} + σ_{a 2} σ_{t 1})^{2} ρ_{1}^{2} - 4 (σ_{a 1} σ_{t 1} ρ_{1}^{2} - η / 3) σ_{a 2} σ_{t 2}}}{2 σ_{a 2} σ_{t 2}}$ (28) If the value of η belonging to the minimum is used, Equation (Equation28(28) $ρ_{2} = \frac{- (σ_{a 1} σ_{t 2} + σ_{a 2} σ_{t 1}) ρ_{1} + \sqrt{(σ_{a 1} σ_{t 2} + σ_{a 2} σ_{t 1})^{2} ρ_{1}^{2} - 4 (σ_{a 1} σ_{t 1} ρ_{1}^{2} - η / 3) σ_{a 2} σ_{t 2}}}{2 σ_{a 2} σ_{t 2}}$ (28) ) can be used to compute the expected position of the minimal values. It is depicted in Figure (c) as the green curve. It can be seen that the green curve lies within the valley of the numerical minimal values.

From Figure (a)–(c) it is apparent that neither the measurements of the different gamma lines nor the neutron flux measurements alone have a unique minimum. In combination however, we obtain an objective function with a minimum contained within a small region, well suitable for a reconstruction, as depicted in Figure (d).

4.3. Uniqueness of the parameter estimation problem

In the following, we prove our main result regarding the uniqueness of a solution to the considered parameter estimation problem. Throughout, we implicitly assume existence of a solution.

Theorem 4.1

Let $J [ρ, ψ]$ be the objective function as defined in Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) ), ψ being a solution of Equation (Equation23(23) $\nabla^{2} ψ + ηψ = \frac{Q}{γ} with η = \frac{\nabla^{2} γ}{γ} + \frac{Σ_{a}}{γ^{2}} .$ (23) ), whereas $ρ, Σ_{t}, Σ_{a} > 0$ . Then J has a unique minimum.

The proof of Theorem 4.1 relies on the following two lemmas, which are consequences of the weak and strong maximum principle, respectively.

Lemma 4.2

Let D be a bounded domain with smooth boundary $∂D$ . Let u be a weakly differentiable function satisfying a mild regularity assumption (cf. Appendix A for details) and (29) $- Δu (x) + η (x) u (x) = Q (x) in D$ (29) with bounded, non-negative η and Q that are strictly positive on a subset of D that is not a null set. Then u is strictly positive in D.

Lemma 4.3

Let D be a bounded domain with smooth boundary $∂D$ . Then the problem (30) $\begin{aligned} - Δu (x) + α (x) u (x) & = Q (x) in D \\ u (x) + β (x) \frac{\partial u}{\partial n} (x) & = 0 on ∂D \end{aligned}$ (30) has at most one solution u if $α (x), β (x) > 0$ .

The proofs of both lemmas are carried out in Appendices A and B. We can now present the proof of the main result.

Proof.

In the following, the different terms in the objective function (Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) )) are investigated separately. We define $J_{P, \min}$ as the set of values ρ that minimizes the first term in Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) ) (corresponding to the gamma detectors) and $J_{R, \min}$ as the set of values ρ the minimizes the second term in Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) ) (corresponding to the neutron detectors). Since both terms are positive, the intersection of $J_{P, \min}$ and $J_{R, \min}$ is then the set of values that minimizes the full objective function. The proof works as follows: showing that there exists a unique solution $ψ_{\min}$ that minimizes $J_{R, \min}$ using monotonicity arguments, we find explicit expressions for the elemental densities $ρ_{l}$ which are again unique.

Since the ψ is determined by the parameter η, the neutron counting rates defined in Equation (Equation26(26) $R_{n} [ψ] = \int_{V_{\det, n}} σ_{d} \frac{1}{γ} ψ (x) d x,$ (26) ) can be seen as implicitly dependent of the parameter η. In the following we show that ψ and thus the counting rates are strictly monotone in η. We assume in the following that we have two parameters with $\tilde{η} > η$ , and corresponding $\tilde{ψ}$ and ψ, such that Equation (Equation23(23) $\nabla^{2} ψ + ηψ = \frac{Q}{γ} with η = \frac{\nabla^{2} γ}{γ} + \frac{Σ_{a}}{γ^{2}} .$ (23) ) is satisfied: (31) $\begin{aligned} - \nabla \cdot ∇ψ + ηψ & = \frac{Q}{γ} \\ - \nabla \cdot \nabla \tilde{ψ} + \tilde{η} \tilde{ψ} & = \frac{Q}{γ} . \end{aligned}$ (31) By subtracting Equation (Equation31(31) $\begin{aligned} - \nabla \cdot ∇ψ + ηψ & = \frac{Q}{γ} \\ - \nabla \cdot \nabla \tilde{ψ} + \tilde{η} \tilde{ψ} & = \frac{Q}{γ} . \end{aligned}$ (31) ) and defining (32) $λ = ψ - \tilde{ψ} and δ = η - \tilde{η}$ (32) we obtain (33) $- \nabla \cdot ∇λ + ηψ - \tilde{η} \tilde{ψ} = 0,$ (33) which is equivalent to (34) $- \nabla \cdot ∇λ + \tilde{η} λ = - δψ .$ (34) By applying Lemma 4.2, we obtain the strict positivity of ψ and since $δ < 0$ , the strict positivity of λ follows directly with the same lemma. Since λ is strictly positive, we have $\tilde{ψ} < ψ$ for $\tilde{η} > η$ . The counting rate $R_{n} [ψ]$ defined in Equation (Equation26(26) $R_{n} [ψ] = \int_{V_{\det, n}} σ_{d} \frac{1}{γ} ψ (x) d x,$ (26) ) is a positive linear functional of ψ, thus we also obtain monotonicity of $R_{n} [ψ]$ . This ensures that there is just one $η_{\min}$ minimizing $J_{n}$ .

With Lemma 4.3 it follows directly that there is only one $ψ_{\min}$ that minimizes the second term in Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) ), so that $J_{R, \min} = {ρ | η (ρ) = η_{\min}}$ . Now we can compute the intersection of the sets of minima of both terms in Equation (Equation25(25) $J [ρ, ψ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ψ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ψ] - \bar{R_{n}})^{2} .$ (25) ). Inserting $ψ_{\min}$ and using the homogeneity of the sample and the linearity of the integral, Equation (Equation27(27) $P_{E_{γ, l}} [ρ, ψ] = \int_{V_{sam}} ϵ_{E_{γ, l}} (x) ρ_{l} σ_{E_{γ, l}} \frac{1}{γ} ψ (x) d x .$ (27) ) yields (35) $\bar{P_{E_{γ, l}}} = \frac{ρ_{l}}{γ} I_{E_{γ, l}} = ρ_{l} \sqrt{3 \sum_{l^{'}} σ_{t, l^{'}} ρ_{l^{'}}} I_{E_{γ, l}}$ (35) where (36) $I_{E_{γ, l}} = \int_{V_{sam}} ϵ_{E_{γ, l}} (x) σ_{E_{γ, l}} ψ_{\min} (x) d x .$ (36) From this system of L equations, where L is the number of elements, we can, using some algebra, derive linear relations for between the different elemental densities, which are valid for all pairs $l^{'}, l with 1 \leq l^{'}, l \leq L$ : (37) $ρ_{l^{'}} = \frac{\bar{P_{E_{γ, l^{'}}}} I_{E_{γ, l}}}{\bar{P_{E_{γ, l}}} I_{E_{γ, l^{'}}}} ρ_{l} .$ (37) By inserting them in Equation (Equation35(35) $\bar{P_{E_{γ, l}}} = \frac{ρ_{l}}{γ} I_{E_{γ, l}} = ρ_{l} \sqrt{3 \sum_{l^{'}} σ_{t, l^{'}} ρ_{l^{'}}} I_{E_{γ, l}}$ (35) ), we obtain for each element l the relation (38) $\bar{P_{E_{γ, l}}} = ρ_{l}^{3 / 2} \sqrt{3 \sum_{l^{'}} σ_{t, l^{'}} \frac{\bar{P_{E_{γ, l^{'}}}} I_{E_{γ, l}}}{\bar{P_{E_{γ, l}}} I_{E_{γ, l^{'}}}}} I_{E_{γ, l}} .$ (38) Since all expressions apart from $ρ_{l}$ are constant, it is apparent that there is exactly one value for each $ρ_{l}, 1 \leq l \leq L$ , that satisfies the equation. As a consequence, $J_{\min}$ contains just one point.

It should be noted that the argument would still be valid if a different model had been chosen for the gamma counting rates defined in Equation (Equation6(6) $ϵ_{E_{γ, l}} (x) = ϵ_{geom, E_{γ, l}} (x) \cdot ϵ_{att, E_{γ, l}} (x) \cdot ϵ_{\det, E_{γ, l}},$ (6) ). The generalization to heterogeneous media is discussed in the concluding Section 7.

5. Implementation

For the neutron transport computations, a neutron transport code named SPARC [Citation6] is used. We apply the finite element method for the spatial discretization of Equation (Equation3(3) $- \nabla \cdot (\frac{1}{3 Σ_{t} (x)} ∇ϕ (x)) + Σ_{a} (x) ϕ (x) = Q (x) .$ (3) ), to handle the geometric complexity of the underlying model. Furthermore, the finite element method is well suited for Poisson-like equations, as the SP₁-equation. The FEniCS library [Citation15,Citation16] with the automatic mesh generator Gmsh [Citation17] is used for the implementation. We use PETSc [Citation18] as linear algebra backend in conjunction with hypre [Citation19], a library of high performance preconditioners. For this work we use a combination of a gmres solver [Citation20] and an algebraic multigrid preconditioner [Citation21]. Since the SP₁-equation is self-adjoint, a modification of the source term is sufficient to also compute the adjoint neutron flux. The gradient of the Lagrangian can then be computed as detailed out in Section 3.

The model used for the following simulation studies is shown in Figure . It depicts a zoom into the relevant part of the computational domain. The remaining parts are filled solely with more moderating material. It depicts the moderator (green), the sample (yellow), the gamma detector (purple), the neutron detectors (red) and on the left side the source (in blue). Figure (a) shows the computational domain, while Figure (b) shows the Finite Element Mesh used for the reconstruction including approx. 84.000 Finite Elements.

Figure 4. Computational Model used in SPARC including the different regions of the computational domain (left side) and the Finite Element mesh used for the reconstruction (right side). Depicted are the moderator (green), the sample (yellow), the gamma detector (purple), the neutron detectors (red) and on the left side the source (in blue). (a) Computational Domain and (b) Computational Mesh used for Reconstruction.

6. Numerical results

In this section, we present numerical results which show the applicability of the herein developed theory. In Subsection 6.1, we first present a comparison of the gradient computed with the Optimal Control formalism as introduced in Subsection 3.2 and a Finite Difference approach in order to verify our implementation and the adjoint calculus in Section 3. Thereafter, we show exemplary minimizations for which the computed gradient is used within an L-BFGS-B scheme. In Subsection 6.2 we study both the stability of the reconstruction under given measurement errors as well as the applicability of our method for different materials. Throughout our simulation studies we have used fixed numerical grids, but the investigation of the effect of measurement noise of varying intensities in Subsection 6.2 shows that we are not running into problems related to the so-called inverse crime in accordance with [Citation22].

6.1. Verification of the implementation

In order to verify the implementation, the gradient of the objective function is computed with both Finite Differences and the Optimal Control formalism. Figure depicts an exemplary case which compares the gradient components along the line segment defined by $ρ_{2} = 1 - ρ_{1}$ , $ρ_{1} \in [0, 1]$ . In the upper subfigure the absolute value of the gradient components is taken so that a logarithmic scale can be applied for better visualization. The plot shows a very good agreement between the results for both $ρ_{1}$ and $ρ_{2}$ . In the lower subfigure, the difference between the gradients computed with both methods is depicted. The relative error is smaller than $0.1 %$ for both $ρ_{1}$ and $ρ_{2}$ .

Figure 5. Comparison of the partial derivatives of the objective function computed with Finite Differences and the Optimal Control Formalism along the line segment defined by $ρ_{2} = 1 - ρ_{1}$ , $ρ_{1} \in [0, 1]$ . (a) Comparison of the gradient components computed with Finite Differences and the Optimal Control Formalism and (b) Relative error of the gradient components computed with the Optimal Control formalism.

While Figure depicts the verification of the gradient itself, Figure depicts the gradient-based minimization process. The gradients computed with the Optimal Control formalism are used to carry out the minimization using the L-BFGS-B [Citation23] algorithm implemented in SciPy [Citation24]. Figure depicts the objective function as well as three exemplary minimizations. The minimizations are represented by the blue, cyan and green lines, while the objective function is represented by a contour plot. It should be noted that each colour in the contour plot represents an order of magnitude, so a major challenge here is to minimize a function which values vary over many orders of magnitude. For each case, the minimum (purple) is found within only a few iterations.

Figure 6. The logarithmised objective function with blue, cyan and green lines showing three exemplary minimizations using the L-BFGS-B algorithm.

6.2. Simulation studies

In order to investigate the stability of the reconstruction under measurement errors, a simulation study has been carried out with normally distributed errors. The standard deviations were as large as 5, 10, and 20% of the simulated counting rates. For each error level 50 Simulations have been carried out. The reconstructed values are depicted in Figure . The error-free reconstruction is depicted as the black point in the center, while the reconstructed values for ρ are depicted in red, blue, and green, depending on the error level. It can be seen that the reconstructed values for ρ are scattered in a wider area for larger error levels. Table shows the reconstruction errors for the different error levels. The errors are to be understood as mean absolute percentage errors. For all three cases the reconstruction errors are much smaller than the errors on the counting rates. At the same time the number of needed L-BFGS-B iterations is nearly independent from the error on the simulated counting rates.

In order to test the applicability of our method, four cases with different material properties have been investigated. The materials themselves are artificial, but using parameters that are realistic for the application. The first two cases are combinations of a highly scattering and a highly absorbing material in each case. The third and fourth cases investigate combinations of materials with similar neutronics properties, in one case two highly scattering materials and in the other case two highly absorbing materials. The upper half of Table shows the errors of the reconstruction for the four different cases. It can be seen that the reconstruction of ρ works well for all investigated cases. The error on the counting rates is $10 %$ , while the error on the reconstructed densities is typically 5– $6 %$ . In one case the error on $ρ_{2}$ is as low as $2.6 %$ . This seems to be an exception, however.

Figure 7. Reconstructed values of ρ for normally distributed measurements errors. 50 Simulations were carried out for each error level. The average errors on the reconstruction and the number of needed L-BFGS-B iterations can be found in Table .

Table 1. Average absolute errors on the reconstruction of $ρ_{1}$ / $ρ_{2}$ for three different error levels as well as the average number of needed L-BFGS-B iterations.

Display Table

Table 2. Average absolute errors on the reconstruction of $ρ_{1}$ / $ρ_{2}$ for four different material combinations as well as the average number of needed L-BFGS-B iterations.

Display Table

The regularizing effect of the term punishing deviations from the total mass introduced in Equation (Equation12(12) $J_{m} [ρ] = \frac{w_{m}}{{\bar{M}}^{2}} (c_{1} ρ_{1} + c_{2} ρ_{2} - \bar{M})^{2}$ (12) ) can be seen by comparing the upper and the lower half of Table . The lower half shows the reconstruction results with the weight of the term $w_{m}$ set to zero. The same initial and measurement values as before have been used. Without the regularization, the error of the reconstruction massively increases, from values between 2.6 and $6.1 %$ to values between 23.3 and $66.1 %$ . The average number of iterations also increased considerably, from approximately five to six to values between nine and eleven. The second test case is the sole exception. These results underline the importance of the regularization term in order to increase the accuracy of the proposed method.

7. Conclusion and discussion

In this work we have considered a new kind of parameter estimation problem related to PGNAA where two different types of detectors are used, one to record a gamma spectrum and additional detectors for the total neutron flux in direct proximity to the sample. This parameter estimation problem arises in a specific measurement facility, QUANTOM, where radioactive waste drums are analyzed for their elemental composition. We invert a combined neutron-photon transport model to obtain a vector of elemental densities. In a simplified setting considering a homogeneous sample and one-group neutron transport we proved uniqueness of a solution. To our knowledge, this is the first mathematically rigorous proof of uniqueness for a PGNAA problem.

To the current state of the art PGNAA spectra are evaluated in an iterative manner by solving a fixed point problem [Citation7,Citation8]. This naturally reflects the dependence of the elemental composition and the neutron flux distribution in the sample. The method proposed in this work offers an alternative. Formulating the parameter estimation problem as an optimization problem we make use of a gradient descent method to solve for the elemental composition as a parameter of the neutron transport model. As a side effect, our method yields the elemental composition as absolute quantities in grams or kilograms. Furthermore, this approach is especially useful when several measurements are combined, since all measurements can be combined in the objective function. For the measurements in QUANTOM the relative formulation in [Citation7,Citation8] does not apply. Hence, our method offers more flexibility to deal with a broader range of measurement tasks related to PGNAA. In situations where both methods may be applied (standard PGNAA with one measurement and one detector type) it is not obvious which methods yields the best performance in terms of convergence and computation times. This will be a topic of future research.

Furthermore, we see a close connection to the theory of Diffuse Optical Tomography (DOT). In both DOT and the PGNAA problems presented in this paper, the governing equation is a diffusion equation. The measurement of the flux alone is not sufficient to ensure a unique reconstruction. In DOT, it is not possible to reconstruct both the scattering and absorption coefficients from the photon flux measurements alone [Citation14]. Similarly, we have seen that the neutron flux is insufficient to reconstruct the elemental densities, as discussed in Section 4. Only the combination with the energy resolved measurements of the gamma detector, which yields measurement results for each element individually, ensures uniqueness. While in DOT no general uniqueness result exists due to the range of different measurables and unknowns [Citation25], it is still possible to obtain uniqueness results for certain scenarios using additional assumptions on the scattering and absorption coefficients that are to be reconstructed [Citation26]. For PGNAA comparable studies do not exist yet since this work represents the first result in this direction. Therefore, a complete study of possible necessary and sufficient conditions for existence and uniqueness of solutions should be a focus of future work.

The measurement results of the additional 3He neutron counters are included in the objective function (Equation (Equation11(11) $J_{\det} [ρ, ϕ] = \sum_{l = 1}^{L} \frac{w_{γ}}{{\bar{P_{E_{γ, l}}}}^{2}} (P_{E_{γ, l}} [ρ, ϕ] - \bar{P_{E_{γ, l}}})^{2} + \sum_{n = 1}^{N} \frac{w_{d}}{{\bar{R_{n}}}^{2}} (R_{n} [ϕ] - \bar{R_{n}})^{2} .$ (11) )). Monotonicity of the neutron flux with respect to the parameter η leads to monotonicity of the counting rates and this fact was used to prove the main uniqueness result Theorem 4.1. The proof works for any measurement device of the quantity ‘total neutron flux’, as long as its constituting functional, representing the measurement device, is linear and positive with respect to the neutron flux ϕ. This means, that in praxis devices like fission chambers are also adequate to support QUANTOM measurements. Furthermore, the positioning and the number of 3He detectors were not used in the proof. Nevertheless, they might influence both the accuracy and speed of evaluations and are worth to be further investigated in the future to optimize QUANTOM measurements.

The homogeneity of the sample is a key assumption for our proof of uniqueness in Section 4. This case is of practical importance for PGNAA, since most measurements are carried out with homogeneous samples, especially at research reactors [Citation5]. Monitoring of the total neutron flux is applied by monitor materials in this case. Hence, our results may be adapted also for this situation.

In the case of an inhomogeneous sample our results do not guarantee a unique solution. In the QUANTOM facility, this situation is covered by performing numerous measurements at different measurement positions, corresponding to a complete surface scan of the drum by the HPGe detector. Measurements of the total neutron flux by the 3He detectors are also performed in every measurement position. A joint evaluation of all measurement positions shall account for inhomogeneities in the drum. Nevertheless, the rigorous examination of uniqueness of a solution in this situation needs to be performed in the future. It should be noted that the gradient based method in introduced in Section 3 holds for the inhomogeneous case. Since more measurements are necessary, the objective function will include more terms. The method relies then on the assumption of homogeneity in discretized parts of the drum.

Acknowledgments

The authors would like to thank Framatome for the permission to use the cut-view of the QUANTOM facility (Figure ). We would also like to thank René van Geemert (Framatome) for helpful discussions on the topic of parameter estimation problems and neutron transport. We would also like to acknowledge the contribution of Jannick Wolters and Christopher Helmes in the development of the SPARC code, which was used to produce the numerical results for this paper.

Disclosure statement

The authors report there are no competing interests to declare.

Additional information

Funding

The work of A. J. was performed within the project RAPID, funded by the German Federal Ministry of Education and Research (BMBF) under the grant number 033RK094B. The work of A.J. was also funded by the European Regional Development Fund (ERDF) project ZEBRA, funded by the European Union and the federal state of North Rhine-Westphalia under the funding number EFRE-0800566. The work of K.K., when affiliated with AiNT, was performed in parts within the project QUANTOM, funded by the German Federal Ministry of Education and Research (BMBF) under the grant no. 15S9406B, and in project RAPID, funded by the German Federal Ministry of Education and Research (BMBF) under grant no. 033RK094A. The authors are solely responsible for the content of this publication.

References

Filß P. Specific activity of large-volume sources determined by a collimated external gamma detector. Kerntechnik. 1989;54(3):198–201. doi: 10.1515/kern-1989-540326
Web of Science ®Google Scholar
Filß P. Relation between the activity of a high-density waste drum and its gamma count rate measured with an unshielded Ge-detector. Appl Radiat Isot. 1995;48(8):805–812.
Google Scholar
Coquard L, Havenith A, Kettler J, et al. Non-destructive material characterization of radioactive waste packages with QUANTOM. In: WM2020 Conference; 2020 March; Phoenix, Arizona.
Google Scholar
Coquard L, Hummel J, Nordhardt G, et al. Non-destructive verification of materials in waste packages using QUANTOM. EPJ Nuclear Sci Technol. 2023;9(55 doi: 10.1051/epjn/2022043
Google Scholar
Molnar G. Handbook of prompt gamma activation analysis with neutron beams. New York: Springer; 2004. doi: 10.1007/978-0-387-23359-8
Google Scholar
Jesser A, Krycki K, Frank M. Partial Cross-Section calculations for PGNAA based on a deterministic neutron transport solver. Nucl Technol. 2022;208(7):1114–1123. doi: 10.1080/00295450.2021.2016018
Web of Science ®Google Scholar
Holloway J, Akkurt H. The fixed point formulation for large sample PGNAA – part 1: theory. Nucl Instrum Methods Phys Res A. 2004;522:529–544. doi: 10.1016/j.nima.2003.11.401
Web of Science ®Google Scholar
Akkurt H, Holloway J, Smith L. The fixed point formulation for large sample PGNAA – part 2: experimental demonstration. Nucl Instrum Methods Phys Res A. 2004;522:545–557. doi: 10.1016/j.nima.2003.11.400
Web of Science ®Google Scholar
Cercignani C. The boltzmann equation and its applications. Springer New York; 1988. doi: 10.1007/978-1-4612-1039-9
Google Scholar
Rising ME, Armstrong JC, Bolding SR, et al. MCNP® Code Version 6.3.0 Release Notes. Los Alamos (NM): Los Alamos National Laboratory; 2023. LA-UR-22-33103, Rev. 1.
Google Scholar
Gilmore G. Practical Gamma-Ray spectrometry. Wiley; 2008. doi: 10.1002/9780470861981
Google Scholar
Seltzer S. Tables of X-Ray mass attenuation coefficients and mass energy-absorption coefficients, NIST Standard Reference Database 126; 1995.
Google Scholar
Hinze M, Pinnau R, Ulbrich M, et al. Optimization with PDE constraints. Dordrecht: Springer; 2009. doi: 10.1007/978-1-4020-8839-1
Google Scholar
Arridge S, Lionheart W. Nonuniqueness in diffusion-based optical tomography. Opt Lett. 1998;23(11):882–884. doi: 10.1364/OL.23.000882
PubMed Web of Science ®Google Scholar
Alnæs M, Blechta J, Hake J, et al. The FEniCS project version 1.5. Archive Numer Softw. 2015;3(100):9–23.
Google Scholar
Logg A, Mardal K, Wells G. Automated solution of differential equations by the finite element method. Springer; 2012. doi: 10.1007/978-3-642-23099-8
Google Scholar
Geuzaine C, Remacle F. Gmsh: a three-dimensional finite element mesh generator with built-in pre- and post-processing facilities. Int J Numer Methods Eng. 2009;79(11):1309–1331. doi: 10.1002/nme.v79:11
Web of Science ®Google Scholar
Balay S, Abhyankar S, Adams M, et al. PETSc users manual. Argonne National Laboratory; 2020. ANL-95/11 – Revision 3.14.
Google Scholar
Falgout R, Meier Yang U. hypre: a library of high performance preconditioners. In: International Conference on Computational Science; 2002 April 21–24; Amsterdam, The Netherlands: Springer.
Google Scholar
Saad Y, Schultz M. GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems. Siam J Sci Stat Comput. 1986;7:856–869. doi: 10.1137/0907058
Web of Science ®Google Scholar
Ruge J, Stüben K. Algebraic multigrid. In: McCormick S, editor. Multigrid methods, volume 3 of Frontiers in applied mathematics. SIAM; 1987. p. 73–130. doi: 10.1137/1.9781611971057
Google Scholar
Wirgin A. The inverse crime; 2004. doi: 10.48550/arXiv.math-ph/0401050
Google Scholar
Zhu C, Byrd RH, Lu P, et al. Algorithm 778: L-BFGS-B: fortran subroutines for large-scale bound-constrained optimization. ACM Trans Math Softw. 1997;23:550–560. doi: 10.1145/279232.279236
Web of Science ®Google Scholar
Virtanen P, Gommers R, Oliphant TE, et al. SciPy 1.0: fundamental algorithms for scientific computing in python. Nat Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2
PubMed Web of Science ®Google Scholar
Gibson AP, Hebden JC, Arridge SR. Recent advances in diffuse optical imaging. Phys Med Bio. 2005 Feb;50(4):R1. R43doi: 10.1088/0031-9155/50/4/R01
PubMed Web of Science ®Google Scholar
Harrach B. On uniqueness in diffuse optical tomography. Inverse Probl. 2009 Apr;25(5):055010. doi: 10.1088/0266-5611/25/5/055010
Web of Science ®Google Scholar
Gilbarg D, Trudinger N. Elliptic partial differential equations of second order. Berlin: Springer; 2001. (Classics in mathematics).
Google Scholar

Appendix A.

Proof of Lemma 4.2

Let D be a bounded domain with smooth boundary

∂D

. Let

u \in W^{1, 2} (D)

satisfy

- Δu (x) + η (x) u (x) = Q (x) in D

with bounded, non-negative η and Q that are strictly positive on a subset of D that is not a null set. Then u is strictly positive in D.

Proof.

We carry out the proof by constructing a contradiction. Let us first define $u = - λ$ so that Equation (Equation29(29) $- Δu (x) + η (x) u (x) = Q (x) in D$ (29) ) can be rewritten as (A1) $L (x) u (x) = Q (x) in D .$ (A1) with the differential operator defined by (A2) $L (x) λ (x) = (Δ - η (x)) λ (x) .$ (A2) The coefficients of L are bounded and the partial differential equation is strictly elliptic. We also have $\int - ηv d x \leq 0$ for $v (x) \leq 0$ since $η > 0$ and $Lu \geq 0$ since Q is non-negative. Therefore we can apply the strong maximum principle (theorem 8.19 in [Citation27]).

If $sup_{D} u \geq 0$ , then according to the strong maximum principle u is constant in D and for $u \neq 0$ we have (A3) $\int_{D} ηv d x = 0 \forall v \geq 0, v \in C_{0}^{1} (D) .$ (A3) While u = 0 cannot satisfy Equation (EquationA1(A1) $L (x) u (x) = Q (x) in D .$ (A1) ) since Q>0 on a non-null subset of D, $u \neq 0$ cannot satisfy Equation (EquationA3(A3) $\int_{D} ηv d x = 0 \forall v \geq 0, v \in C_{0}^{1} (D) .$ (A3) ) since $η > 0$ for non-null subset of D. Therefore, we have $sup_{D} u < 0$ and thus $inf_{D} λ > 0$ .

Appendix B.

Proof of Lemma 4.3

Let D be a bounded domain with smooth boundary $∂D$ . Then the problem $\begin{aligned} - Δu (x) + α (x) u (x) & = Q (x) in D \\ u (x) + β (x) \frac{\partial u}{\partial n} (x) & = 0 on ∂D \end{aligned}$ has at most one solution u if $α, β > 0$ .

Proof.

Let $u_{1}$ and $u_{2}$ be solutions of Equation (Equation30(30) $\begin{aligned} - Δu (x) + α (x) u (x) & = Q (x) in D \\ u (x) + β (x) \frac{\partial u}{\partial n} (x) & = 0 on ∂D \end{aligned}$ (30) ) and $v = u_{1} - u_{2}$ . For $w = v ∇v$ , the divergence theorem (A4) $\int_{D} \nabla \cdot w d x = \int_{\partial D} w \cdot n d s$ (A4) yields (A5) $\int_{D} (∇v)^{2} + v Δv d x = \int_{\partial D} v \frac{\partial v}{\partial n} d s .$ (A5) Using $- Δv + αv = 0$ in D and inserting the boundary condition, we obtain (A6) $\int_{D} (∇v)^{2} + α v^{2} d x = \int_{\partial D} - β {(\frac{\partial v}{\partial n})}^{2} d s .$ (A6) Since the integral on the left side is non-negative and the integral on the right side non-positive, both integrands are identically zero. Therefore, $\frac{\partial v}{\partial n} = 0$ and from the boundary condition also v is identically zero on $∂D$ . From the weak maximum principle [Citation27] follows that v is identically zero on D.

Appendix C.

Material properties used in Section 6

Table C1. Material properties.

Display Table

Numerical solution of a parameter estimation problem arising in Prompt-Gamma Neutron Activation Analysis

Abstract

1. Introduction

2. Forward model

3. Parameter estimation problem