WENO3-NN: A maximum-order three-point data-driven weighted essentially non-oscillatory scheme

doi:10.1016/j.jcp.2021.110920

Journal of Computational Physics

Volume 452, 1 March 2022, 110920

https://doi.org/10.1016/j.jcp.2021.110920 Get rights and content

Highlights

•
A low-dissipation data-driven third-order WENO scheme (WENO3-NN) is proposed.
•
Additional loss on the reconstruction weights yields maximum-order convergence.
•
The network smoothness measure facilitates interpretation and generalization.
•
Very good performance on canonical test cases, including strong shock interactions.
•
Error magnitudes and wavenumber resolution comparable to or better than WENO5-JS.

Abstract

Neural networks have become more and more relevant for computational fluid dynamics. In recent works, neural network based weighted essentially non-oscillatory schemes have been developed. Challenges faced with such schemes are to ensure maximum-order convergence on narrow stencils and the ENO property. In this work, we use a neural network as a weighting function in the WENO scheme and address these shortcomings. Based on the input stencil, the neural network calculates a convex combination of local interpolation polynomials. We use a Galilean invariant embedding in the input layer and introduce an additional loss on the reconstruction weights, such that the WENO scheme inherently recognizes a smooth input function and achieves maximum-order convergence. The performance of the WENO3-NN scheme is demonstrated for one- and two-dimensional test cases, including strong shocks and shock-density wave interactions. The WENO3-NN scheme shows very good generalizability across all benchmark cases and different resolutions, and exhibits a performance similar to or better than the classical WENO5-JS scheme. By analyzing the approximate dispersion relation of the WENO3-NN scheme, we find that the neural network scheme learns a highly non-trivial dispersion-dissipation relation. Especially, data-driven schemes may introduce vanishing dissipation near the cutoff wavenumber which is counterintuitive to classical discretization-design principles.

Keywords

WENO

WENO-JS

WENO-Z

Neural network

Machine learning

Euler equations

1. Introduction

The solution of hyperbolic partial differential equations, e.g. the time-dependent Euler equations for compressible flows, requires numerical schemes capable of shock capturing and, at the same time, capable of resolving small-scale flow features. These seemingly contradictory demands, i.e. sufficient numerical dissipation around shock discontinuities and low-dissipation in smooth regions of the flow field, render the development of numerical schemes a long-standing challenge in computational fluid mechanics.

Weighted essentially non-oscillatory (WENO) [1] schemes are among the most popular choices to address aforementioned issues. The full stencil of a

-order WENO scheme can be subdivided into r stencils, each of which forms a local interpolation polynomial. WENO schemes use a solution-adaptive nonlinear convex combination of these low-order polynomials. The contribution of the low-order polynomials is determined by nonlinear coefficients in the convex combination, so called weights. These weights are calculated from local smoothness measures of the solution such that in smooth parts of the flow field a high-order approximation is achieved while interpolation across discontinuities is avoided and sufficient numerical dissipation is introduced to allow for sharp shock capturing. The low-order polynomial of a stencil which contains a discontinuity receives essentially zero weight (ENO property) so that oscillations are suppressed. In smooth parts of the solution, the low-order polynomials are combined such that the background central upwinding scheme of order

is recovered.

As shown by Henrick et al. [2], the classical WENO-JS scheme looses its full-order of accuracy at critical points. The proposed WENO-M scheme maps the classical WENO-JS weights such that the overall scheme recovers optimal-order of convergence. Borges et al. [3] proposed an improved WENO-Z scheme which assigns higher weights on less smooth stencils, for the first time, taking into account a global smoothness measure on the full stencil. In order to further decrease numerical dissipation, Hu et al. [4] propose an adaptive central-upwind 6-th-order WENO-CU6 scheme. WENO-CU6 has been further developed into an implicit LES model [5]. The family of targeted ENO (TENO) schemes by Fu et al. [6] is based on low-order polynomials with incrementally increasing stencil sizes. This allows for better treatment of multiple discontinuities while further minimizing the numerical dissipation.

In recent years, machine learning has become increasingly present in the field of fluid mechanics. Hybrid numerical schemes try to enhance classical numerical schemes by data-driven components. In the finite-volume method, increased interest has been placed on enhancing the cell face reconstruction process with data-driven algorithms, e.g. [7]. Stevens and Colonius [8] have proposed a WENO5-NN scheme in which a neural network maps the classical WENO5-JS weights in order to improve the solution. However, subsequent postprocessing of the new weights is necessary and the overall accuracy of the method is decreased to first-order.

In this work, we follow a different approach which is based on learning a complete smoothness measure that weights the standard ENO polynomials. This approach is similar to our previous work in [7], where we have used Harten polynomials as the basis of a data-driven finite-volume scheme for nonclassical shocks. By using ENO polynomials and a Galilean invariant input embedding for the neural network we incorporate prior knowledge and facilitate the machine learning task. This allows for very good generalization to unseen configurations and yields a highly interpretable neural network scheme. The resulting data-driven WENO scheme is trained offline on a set of canonical functions and later is applied to hyperbolic conservation laws. We show that by adaptive penalization of the deviation of the neural network output from the ideal weights (i.e. the weights that recover the background linear scheme), we can train a WENO-NN scheme which inherently achieves maximum-order convergence in smooth regions of the solution while maintaining the essentially non-oscillatory property at discontinuities. We focus on a third-order scheme due to its narrow stencil which is advantageous for practical applications. Comparing the weights of the proposed WENO3-NN scheme with the classical WENO3-JS scheme, the WENO3-NN maintains near-ideal weights over a large part of the solution field. One- and two-dimensional test cases show that WENO3-NN has very low dissipation and achieves a performance similar to the WENO5-JS scheme. An a posteriori analysis of the dispersion and dissipation characteristics of WENO3-NN schemes reveals that data-driven reconstruction schemes can generate a complex dispersion-dissipation relation which is counterintuitive to classical discretization-design principles. E.g. one variant of the WENO3-NN scheme introduces vanishing dissipation near the cutoff wavenumber.

The remainder of the paper is structured as follows. In Sec. 2, we review the classical WENO-JS scheme. In Sec. 3, we introduce the WENO3-NN method and describes the training procedure. Convergence behavior and characteristic behavior of the WENO3-NN scheme are discussed in detail. Section 4 presents applications to several 1D and 2D model problems, including the linear advection and Euler equations. Finally, we draw a conclusion and present an outlook in Sec. 5.

2. Review of WENO schemes

In this section we briefly review the classical third-order weighted essentially non-oscillatory scheme by Jiang and Shu (WENO3-JS) [1], [9] applied to hyperbolic conservation laws in a finite-volume framework. Without loss of generality, we consider the one dimensional scalar hyperbolic conservation law(1)

Application to multiple dimensions or systems of equations is handled via the so called dimension-by-dimension technique. We discretize the spatial domain into finite-volumes of uniform size Δx and denote by

the cell center and by

the cell faces of the i-th cell. The semi-discrete finite-volume formulation of Eq. (1) reads(2)

where(3)

is the cell average of

in cell i. According to the method of lines, Eq. (2) yields a system of ordinary differential equations (ODEs) which can be integrated in time by any ODE-solver, e.g. TVD Runge-Kutta schemes. The physical flux

in Eq. (2) is approximated by the numerical flux function

, so that in practice we solve(4)

where the numerical flux is a two-argument function of left and right cell face values(5)

The unknown cell face values

have to be reconstructed from the known cell average values

. In the following, we will only present the calculation of

. The computation of

is analogous. We will drop the superscript + for simplicity.

The classical third-order WENO3-JS scheme builds

as a convex combination of the two-point substencils

and

. Each substencil,

and

, implies a second-order approximation

of the cell face value

, so that the convex combination reads(6)

For third order, the interpolants are given as(7)

The classical WENO3-JS scheme finds the normalized nonlinear weights

in Eq. (6) as(8)

are unnormalized weights and

are the smoothness measures.

are the ideal weights which generate the third-order upwind-biased scheme for the values

. The parameter p enhances a scale separation of the smoothness parameters. Here we set

, and

avoids division by zero. The smoothness indicators

are calculated as(9)

The evaluation of the smoothness indicators in Eq. (9) gives(10)

We summarize two central ideas behind the WENO methodology: On the one hand, in smooth flow regions maximal accuracy (here, third order) is desirable. Therefore, in smooth parts of the solution the smoothness indicators

are similar to each other, thus the weights

approach the ideal weights

. On the other hand, the influence of a stencil

containing a discontinuity should be diminished in the interpolation process Eq. (6). The smoothness measure

of a discontinuous stencil is

, so that the corresponding weight

is relatively small (nearly zero) compared to smoother stencils. This guarantees that the interpolation process is essentially non-oscillatory (ENO property).

3. WENO3-NN

3.1. Neural network basics

Neural networks are parameterizable nonlinear compound functions that map any input x to an output

, where θ are free and learnable parameters. Deep neural networks (DNNs) consist of multiple hidden layers of units (so called neurons) between in- and output layer. They perform successive elementary nonlinear transformations to map x to y. The numerical values in each layer are called hidden-state activations.

In multilayer perceptrons (MLPs), neurons in adjacent layers are densely connected. The vector of activations

in layer l is computed from the activations of the previous layer

by first applying an affine linear transformation, followed by an element-wise nonlinearity

. The activation of the i-th neuron in layer l denoted as

is calculated as(11)

where

indicates the weight matrix linking layers

and l,

is the bias vector, and

indicates the number of neurons in layer

. The error between the network prediction y and the true output

is calculated by a suitable loss function

. Training a neural network means finding a set of parameters θ that approximately minimizes the selected loss function. Typically, the loss function is minimized via mini-batch gradient descent or the popular Adam optimizer [10].

3.2. WENO3-NN Architecture

In this section, we introduce the WENO3-NN architecture. The polynomial weights

from above are functions of the cell values in the full input stencil

with

, i.e.(12)

subject to

and

. Therefore, any function

subject to aforementioned restrictions can be regarded as a third-order WENO-type weighting function. Naturally, any artificial neural network with suitable in- and output space represents a parameterizable WENO weighting function(13)

The proposed WENO3-NN architecture is displayed in Fig. 1. First, the input values from the 3-point stencil

are passed to the so called Delta layer which calculates input features for the neural network. In this work, we use the following features(14)

(15)

where

. The four features

are essentially the normalized amplitudes of first and second-order derivatives of the local input field, and thus are expected to give a good measure of regularity of the underlying function. In line with classical WENO weighting functions, the features are designed to be Galilean invariant such that the overall weighting function is Galilean invariant as well. We note that the proper design of the input features injects appropriate prior knowledge into the neural network and eases generalization.

are then passed through a multilayer perceptron. Here, the network consists of an input layer, 3 hidden layers with 16 nodes each, and a softmax output layer. The softmax function is a natural choice for the output activation as it implies

and

. We use the swish activation function [11] in the hidden layers,

, where

is the sigmoid function. Upon proper training, we expect the softmax activation function to assign low weights to low-order polynomials that contain discontinuities. During numerical experimentation, we found that it is difficult for the network to output essentially zero weights due to the saturation of the softmax activation function. This is mainly a problem for test cases including very strong shocks such that spurious oscillations are no longer suppressed and negative densities or pressures might be reconstructed, e.g. the interacting blast wave test case in Sec. 4. Therefore, at test time, we pass the

weights through a so called ENO layer which restores the ENO property. This is essentially a sharp cutoff function followed by a renormalization (similarly to [6]), i.e.(16)

where

is the cutoff threshold. We use

, see A.4 for more details.

Upon proper training, the neural network inherently learns a smoothness measure and a weight function. Restricting the neural network as a weight function has several advantages: Firstly, we make use of the well defined ENO interpolants. Secondly, the network outputs are interpretable. Finally, we can identify a posteriori the analytical relation that the network has learned. Thus machine learning enables a new avenue of finding improved smoothness measures.

As mentioned above, suitable WENO weights should be close to the ideal weights in smooth flow regions while discontinuous stencils should be assigned effectively zero weights. In previous works, WENO neural networks schemes have been only trained on a reconstruction error, i.e. finding a suitable mapping such that the cell face values are reconstructed in a best possible way. Unsurprisingly, such schemes were not able to achieve full-order convergence upon mesh refinement. Ideally, we want the network to fall back to the ideal weights when the input stencil is smooth enough.

Therefore, we introduce the following loss function(17)

(18)

(19)

The summation of the loss functions is over a mini-batch consisting of

samples. The total loss

is composed of three individual parts: the cell face reconstruction loss

, the deviation from the ideal weights

, and an

-regularization loss on the neural network weights

to prevent overfitting. γ is a data dependent parameter that adaptively weights the two loss components

and

for each sample.

measures the well-resolvedness of the function in the given stencil. For a completely resolved function (e.g. a linear function)

, and we only require that the network tries to approximate the ideal reconstruction weights

. For a discontinuous function

, and the network has no a priori bias w.r.t. the ideal weights, only the reconstruction loss is penalized. For a function that is not well resolved on the given stencil,

, so that the network has to strike a balance between reconstruction loss and deviation from the ideal reconstruction weights. This a priori bias incentivizes the scheme to output the ideal weights

while providing the neural network with the liberty to deviate from

if the reconstruction loss is significantly reduced. The α exponent creates a scale separation mechanism. Increasing α or

pushes the neural network towards the ideal weights

. We highlight the influence of α in A.3. Note that

penalizes only the reconstruction,

would recover the background linear scheme.

In order to find a suitable γ function for the WENO3-NN scheme, we expand

in terms of a Taylor series around the cell center

(20)

where the right hand side is to be evaluated at the cell center

. We consider a function to be smooth in the neighborhood of

when the contribution of high-order derivatives is small compared to the contribution of their low-order counterparts. E.g., when only information on first and second-order derivatives is available, we consider a function as smooth if(21)

Therefore, for the third-order WENO3-NN scheme, we use(22)

where

is a small number to avoid division by zero.

3.3. Training

The training dataset is composed of canonical functions that mimic local solution features of hyperbolic conservation laws. Table 1 summarizes the training dataset. We use polynomials up to degree 3, jump discontinuities, sawtooth functions, and trigonometric functions. Polynomials and the tanh functions are evaluated on the domain

, while all other functions are evaluated on

. We use a discretization of

for the dataset. For jumps and sawtooth functions, only stencils that include a discontinuity are included in the dataset. During network training, we split the data between training and validation set with a validation split of 0.1. We train the network with the Adam optimizer [10] for 100 epochs. The learning rate is fixed at

, and we use a mini-batch size of

. During our hyperparameter study, we find two optional WENO3-NN schemes, one with

(in the following denoted as WENO3-NN1) and one with

(in the following denoted as WENO3-NN2), that perform very well over a number of different test cases. In the following, we will focus on these two WENO3-NN variants. Further details on the datasets and the training process are provided in A.1 and A.2, respectively.

Table 1. Training dataset. represents a uniform distribution, represents the Bernoulli distribution.

Function f(x)	Parameters	Number of samples
		4000 for each k
u_l(x < 0.5)+u_r(x > 0.5)		8000
(−1)^ax + δ(x > 0.5)		4000
		4000
		4000

3.4. Convergence behavior

In the following, we show that including an additional loss term

on the deviation from the ideal polynomial weights

restores full-order accuracy. To this end, we compare the convergence behavior of the WENO3-NN with (

) and without (

) the ideal weights penalty in Fig. 2. As test functions, we use

and

on the domain

. When no penalty on the deviation from the ideal weights is applied (

), the WENO3-NN scheme yields second-order convergence. The neural network has not learned to adapt the reconstruction in the asymptotic limit to recover the third-order central-upstream scheme. The overall accuracy degenerates to the accuracy of the low-order polynomials (in this case second-order polynomials). However, when a deviation from the ideal weights is penalized (

), the neural network learns to assemble the upstream linear background scheme for smooth inputs, and the WENO3-NN scheme achieves third-order convergence for the

test case. For the more complex

that involves smooth extrema and critical points, the WENO3-NN scheme achieves a convergence order of

which is in accordance with the WENO3-JS and WENO3-Z scheme. Note, that both WENO3-NN schemes show a drop in the

error in the coarse regime for

. This corroborates that the WENO-NN schemes presented here comply with our intention: they have the potential to improve on classical WENO methods for coarsely resolved flow features while maintaining full-order convergence upon mesh refinement.

3.5. Behavior at smooth extreme points and discontinuities

We analyze the behavior of WENO3-NN1 (

) and WENO3-NN2 (

) in smooth regions and near discontinuities by computing the weights

and

for the function(23)

Fig. 3 shows the weight

at all cell faces

(

is omitted for legibility). The black line represents the ideal weight

for smooth regions. Both WENO3-NN schemes are able to follow the ideal weight for most parts of the domain, and show smaller deviations than the WENO3-N scheme [12]. At both extreme points as well as at the jump discontinuity the WENO3-NN scheme degenerates and gives

, respectively. The analysis of the WENO3-NN weighting function underscores the high interpretability of the proposed framework. Compared to WENO3-NN1, the WENO3-NN2 scheme with

deviates slightly stronger from the ideal weights as a smaller α prioritizes the reconstruction error over the ideal weights error. We note that the WENO3-NN scheme has learned a weighting function which shows some qualitative similarities to the WENO-F3+ scheme [13], although the deviation from the ideal weight is much smaller for WENO3-NN.

Additionally, we are interested in the reconstruction behavior of the WENO3-NN scheme around saddle points. Fig. 4 shows the weight distribution

for the function(24)

While WENO3-JS and WENO3-Z degenerate at the saddle point (

), the WENO3-NN schemes are able to distinguish the saddle point, similar to the WENO-F3+ scheme, and adapt the reconstruction accordingly.

4. Results

In the following, we illustrate the WENO3-NN scheme by solving the linear advection equation and the Euler equations. The one-dimensional linear advection equation is given by(25)

The three-dimensional, compressible Euler equations are(26)

is the total energy per unit volume, where e is the internal energy per unit mass. Here, the internal energy is closed by the equation of state for an ideal gas,

with

if not stated otherwise.

We use Local Lax-Friedrichs (LLF) flux splitting (also called Rusanov method). The 3rd-order TVD Runge Kutta scheme is used to integrate the equations in time. Unless stated otherwise, all computations are carried out with

and performed on a uniform mesh. For the Euler equations, we apply the WENO-reconstruction on the characteristic variables [14]. Throughout this section, we compare the WENO3-NN scheme

(denoted as WENO3-NN1) and the WENO3-NN scheme with

(denoted as WENO3-NN2) with the third and fifth-order classical WENO-JS schemes (WENO3-JS and WENO5-JS, respectively), the third-order (improved) WENO3-Z scheme [15], the fifth-order WENO5-Z scheme [3], and the WENO3-N scheme [12]. Although it is well known that the classical WENO5-JS scheme is strongly dissipative and that many improved WENO5 schemes have been developed [2], [3], [16], the approximation quality of the fifth-order WENO5-JS scheme is still a good reference for any improved third-order WENO scheme due to its wider five-point stencil and the therein contained information on higher order derivatives. For all classical WENO schemes, we use the parameter

to avoid division by zero. Reference solutions labeled as “Exact” are obtained by the WENO5-JS scheme on a uniform mesh with

points.

4.1. Linear advection

We test the WENO3-NN scheme on the linear advection equation (25). We consider a discontinuous initial condition consisting of a Gaussian, a square, a triangle, and a semi-ellipse (GSTE), given by(27)

where

. The constants are

, and

. The domain is

and we apply periodic boundary conditions. The resolution is

. The initial condition is transported up to

Fig. 5 compares the results of the proposed WENO3-NN scheme with WENO3-JS/Z/N and WENO5-JS/Z. The results for WENO3-NN outperform the WENO3-JS/Z/N schemes, and are much closer to WENO5-JS. The comparatively low dissipation of the WENO3-NN scheme yields better results especially near spikes and discontinuities compared to WENO3-N. While WENO3-NN is not able to achieve the same performance as WENO5-JS/Z for functions with a local maximum (i.e. the Gaussian, the triangle, and the semi-ellipse), it is interesting to note that the diffusion of the square discontinuity is similar for WENO3-NN and WENO5-JS/Z. WENO3-NN2 achieves slightly better results than the WENO3-NN1 scheme.

Another standard test for WENO reconstruction schemes is the advection of a nonlinear discontinuity defined by the initial condition(28)

We advance the solution up to

. Fig. 6 shows the numerical solutions of the different WENO schemes. Compared with WENO3-N, the WENO3-NN schemes yield substantially better results and are able to capture the discontinuity better. Note that the WENO3-NN schemes also give better results in the smooth part of the flow field, especially near the smooth extreme points where they are nearly identical with WENO5-JS.

Finally, we consider the advection of the initial distribution

up to

. Fig. 7 shows the results of the WENO3-NN scheme at a resolution of

points. Note that this is a coarser resolution than the training dataset which is generated at

. In this test case, the behavior at the extreme points and the saddle point are of interest. As discussed earlier, the proposed WENO3-NN schemes are able to detect the saddle point and adapt the reconstruction accordingly. The results are remarkably better than other third-order WENO schemes. The low dissipation allows a good reconstruction at the extreme points.

Additionally, we provide the pointwise error distributions and integral

error norms for the linear advection tests in Appendix A.6.

4.2. Shock-tube problems

We test the WENO3-NN schemes on the shock-tube test problems: namely, the Sod problem [17], the Lax problem [18], and the 123 problem [19]. We use a resolution of

grid points for all three test problems. The initial condition of the Sod problem is(29)

and the final simulation time is

. Fig. 8 shows the solution at the final simulation time. The WENO3-NN schemes capture the contact discontinuity and the right-moving shock much sharper than WENO3-JS/Z/N, and give a nearly identical performance to WENO5-JS.

The initial condition of the Lax problem is(30)

and the final simulation time is

. Fig. 9 shows density and velocity distributions at the final time. The WENO3-NN schemes do not introduce any oscillations at the shock wave. WENO3-NN captures the contact discontinuity and shock very sharply. The results of the WENO3-NN schemes are better than the WENO5-JS scheme and are closer to the performance of WENO5-Z. The plateau value is exceptionally well captured by both WENO3-NN schemes. The WENO3-NN2 slightly outperforms WENO3-NN1. Due to reduced dissipation, a slight overshoot, similar in strength to WENO5-JS, in the velocity profile is visible for WENO3-NN2. Compared to WENO3-JS/Z/N, WENO3-NN needs considerably less points to resolve the discontinuities.

The initial condition of the 123 problem is(31)

and the final simulation is

. The 123 problem consists of two strong rarefaction waves and poses a demanding test for numerical schemes as the pressure of the intermediate state is close to zero. Fig. 10 provides a comparison between the established WENO schemes and WENO3-NN. WENO3-NN is able to handle the strong rarefaction-waves very well without introducing invalid pressure values. A detailed view of the head section of the rarefaction wave indicates that WENO3-NN clearly outperforms WENO3-N and WENO5-JS.

We provide the pointwise error distributions and integral

error norms for the shocktube tests in A.6.

4.3. Interacting blast waves

We consider the interacting blast wave test case from Woodward and Colella [20]. The initial condition is(32)

The computational domain is

and reflective boundary conditions are applied at

and

. The final simulation time is

. Fig. 11 shows the simulation results on a uniform mesh with

(top) and

(bottom) cells. From all three-point stencils, WENO3-NN provides the best approximation of the density profile. The difference is especially pronounced near the valley and at the right density peak. The quality of the WENO3-NN approximation at 400 points is similar to the WENO3-JS scheme at twice the resolution. Both WENO3-NN give very similar results over most of the domain, however WENO3-NN2 approximates the right density peak slightly better.

4.4. Shock-density wave interaction

We consider the shock-density interaction test case by Shu and Osher [21]. The initial condition is a Mach 3 shock running into a perturbed density field,(33)

The computational domain is

. The final simulation time is

. Fig. 12 shows the simulation results on a uniform mesh with

(top) and

(bottom) cells. WENO3-NN manages to resolve the density waves much better compared to the smeared solution of WENO3-JS/Z/N, indicating that smooth flow features are very well captured by WENO3-NN. The shock wave is well captured by all schemes. WENO5-Z provides the best performance among all investigated schemes. However, the WENO3-NN results are sharper compared to WENO3-JS and less points are needed to resolve the discontinuity. The solution of WENO3-NN at

is comparable to the WENO3-JS solution at twice the resolution. For

, we observe the WENO3-NN result approaching WENO5-JS.

4.5. Gresho vortex advection

We consider the unsteady Gresho vortex. A uniform flow with(34)

is superposed with a rotating vortex of radius R placed at the center

of the computational domain

. We use a uniform mesh with a resolution of

cells.

is the speed of sound and

is the Mach number. Periodic boundary conditions are applied. The numerical solution is calculated on a

grid and the final simulation time is

. The initial conditions of the vortex are given in terms of the radial distance from the vortex core

. The tangential velocity

and the pressure are given by(35)

and(36)

We choose

and

. The Gresho vortex is used to test the low Mach number behavior of numerical schemes. Here, we are mainly interested in the dissipative properties of the reconstruction schemes. Fig. 13 shows the pressure field at the final simulation time. The relatively strong dissipation of WENO3-JS and WENO3-Z have led to considerable deformations of the vortex. WENO3-N retains the overall vortex shape and provides an acceptable approximation, while the WENO5-JS solution approximates the vortex very well apart from small spurious disturbances. For the WENO3-NN schemes, the vortex structure looks reasonable and is only slightly smeared out when compared to the WENO5-JS solution. Compared to WENO3-NN2, the vortex in the WENO3-NN1 solution is approximated sharper. At the same time, the solution features more spurious noise due to low dissipation.

4.6. Double Mach reflection of a strong shock

We consider the double Mach reflection test of a strong shock from Woodward and Colella [20]. A right-moving Mach 10 shock is reflected from a wall. The initial condition is(37)

Reflective boundary conditions are applied at the lower wall, at the top boundary the flow variables are prescribed as to follow the exact evolution of the moving shock. We use a mesh resolution of

cells. The final simulation time is

. Fig. 14 shows density contours in a detailed view of the region

Both WENO3-NN schemes are able to resolve considerably finer structures than the WENO3-JS/Z/N schemes and show an overall sharper density field. This becomes especially visible in the jet region and at the primary slip line which shows the development of wave structures. Due to excess dissipation, the slip line appears rather smooth for the classical WENO3 schemes. Here, WENO3-NN1 resolves ever so slightly finer structures than WENO3-NN2. The WENO3-NN schemes show a slightly more pronounced post shock instability behind the incident shock wave when compared to WENO3-N or WENO5-JS. The overall solution structure looks very much similar to the WENO5-JS result. However, the WENO3-NN schemes manage to suppress some of the spurious oscillations present in the WENO5-JS solution.

4.7. Rayleigh-Taylor instability

The initial condition of the Rayleigh-Taylor instability is(38)

with the speed of sound

and the ratio of specific heats

. The computational domain is

. Reflective boundary conditions are applied for left and right boundary. Dirichlet boundary conditions are used for top and bottom boundary, i.e.

and

, respectively. The final simulation time is

Fig. 15 compares the solutions of all WENO schemes at a resolution of

corresponding to a grid spacing

. The WENO3-NN result has much finer structures than WENO3-JS and WENO3-Z. The slightly more spherical shapes in the WENO3-NN results indicate low dissipation. Between the two WENO3-NN schemes, WENO3-NN1 resolves very sharp and fine flow structures.

Fig. 16 compares the numerical solutions at a higher resolution of

corresponding to a grid spacing

. The vortical structures starting to form in the WENO3-NN solution are considerably smaller than the WENO3-JS/Z/N ones. While the WENO3-NN2 solution has qualitative similarity with the WENO3-Z solution, the density contours of WENO3-NN1 display very fine and detailed flow features. Interestingly, WENO3-NN1 has less prominent vortical structures compared to WENO3-NN2, and WENO5-JS does not show vortical structures at all at the current resolution level.

4.8. Approximate dispersion relation

The dissipation-dispersion relation provides further insight into the properties of nonlinear shock-capturing schemes. Following Pirozzoli's approximate dispersion relation [22], we analyze the properties of the WENO3-NN schemes and compare them with the background linear central upwind-schemes of corresponding order (CU1, CU3, CU5) as well as with third and fifth-order WENO-JS methods. Fig. 17 shows the ADR for WENO3-NN1 and -NN2. ξ denotes the wavenumber.

and

are the imaginary and real part of the modified wavenumber, respectively. Although the performance of both WENO3-NN schemes in above tests was very similar, the dispersion relations of both schemes differ quite drastically. The WENO3-NN2 scheme shows a more classical dissipation-dispersion behavior. Over the whole wave number range WENO3-NN2 is considerably less dissipative compared to WENO3-JS. Interestingly, WENO3-NN2 is even slightly less dissipative than the WENO5-JS scheme for higher wave numbers. The corresponding dissipation curve has a monotonic behavior and does not show the distinct valleys of WENO3-JS or WENO5-JS. The dispersive behavior of WENO3-NN2 shows qualitative similarities to the linear central-upwind schemes. The ADR of WENO3-NN1 shows a very intersting behavior. The dissipation is kept to a minimum over the whole wavenumber range. Especially, at the cutoff wavenumber zero dissipation is introduced. Here, WENO3-NN1 strikes a complex balance between dissipation and dispersion. We interpret these results such that underresolved wave packets near the cutoff wavenumber are maintained by the WENO3-NN1 scheme, but do not interact with the resolved scales, akin to soliton solutions. These findings could explain the good numerical results of WENO3-NN1 for the Gresho vortex test case and the much sharper flow structures in the Double Mach reflection test. While WENO3-NN1 introduces near zero dissipation at the cutoff wavenumber, this distinct behavior might at the same time impede the development of flow structures near the cutoff wavenumber, e.g. the vortical structures in the Rayleigh-Taylor test case.

4.9. Online computational cost

Despite the fact that WENO3-NN predominantly is intended to explore machine learning supported design of nonlinear discretization schemes we assess in this section its computational performance in comparison with established WENO schemes. For a realistic estimation of the online computational cost, we evaluate the computational efficiency of the proposed WENO3-NN schemes for the double Mach reflection test case from Sec. 4.6. We measure the average computational time required for advancing the state of one cell for one time step. Since we use the 3rd-order TVD Runge Kutta scheme for time integration, one time step corresponds to three flux evaluations and therefore to three cell face reconstructions. The simulations are performed at a resolution of

points and are integrated over 3700 time steps. The algorithm is implemented in Jax [23]. We use a Nvidia GTX2080 GPU for all simulations. Table 2 shows the absolute and normalized average wall clock times per cell for a computational step.

Table 2. Average computational performance of selected WENO schemes.

Scheme	GPU time per cell and time step (in ns)	Normalized cost
WENO3-JS	389	1.00
WENO3-Z	396	1.02
WENO3-N	411	1.06
WENO3-NN1	1676	4.31
WENO3-NN2	1743	4.48
WENO5-JS	561	1.44

Neural network evaluations at each stencil render WENO3-NN as clearly more expensive than classical schemes with analytically derived smoothness measures. However, we emphasize that neural network evaluations have not been optimized for computational efficiency nor have we optimized the size of the network. Keeping in mind the strongly improved results of the WENO3-NN schemes, we believe that with the rapid development of dedicated computational hardware for fast neural network evaluations (e.g. tensor processing units), data-driven reconstruction schemes will become a viable alternative to classical approaches.

5. Conclusion

Inspired by the recent success of machine learning enhanced numerical methods for computational fluid dynamics, we have proposed a neural-network-weighted essentially-non-oscillatory scheme. The characteristics of the WENO3-NN scheme have been extensively analyzed and a plethora of one- and two-dimensional benchmark test cases, involving strong shocks and shock-density interactions, has been performed. We demonstrate that data-driven training can lead us to new schemes with unexpected properties and significantly improved performance.

The WENO3-NN scheme utilizes a neural network as a parameterizable weighting function of low-order local interpolation polynomials. We have shown that embedding a priori knowledge, such as a Galilean invariant input layer and the reconstruction via Harten polynomials, into the design of the WENO3-NN scheme yields an interpretable and generalizable data-driven scheme. A newly introduced loss term on the network weights allows the WENO3-NN scheme to reach full-order convergence whereas earlier data-driven WENO schemes required a rescaling of the network output in order to guarantee convergence. The WENO3-NN scheme, although trained on a handful of one-dimensional elementary functions at a single resolution level, has shown very good performance at different resolution levels and in multidimensional problems. In fact, the WENO3-NN scheme outperforms the classical WENO3-JS/Z/N schemes across all benchmark problems and, in some test cases, has shown similarities to the performance of WENO5-JS. The approximate dispersion relation of the WENO3-NN schemes has revealed a very interesting behavior. While the WENO3-NN2 scheme features a rather classical, low-dissipation behavior over the whole wavenumber range, the ADR of the WENO3-NN1 scheme reveals a rather complex and unexpected dispersion and dissipation behavior. Especially, the WENO3-NN1 scheme introduces zero dissipation at the wavenumber cutoff. Machine learning might provide the necessary tools to further investigate such non-trivial dispersion-dissipation relations of numerical schemes. More generally, data-driven ansatze might have the potential to push the boundaries of conventional and long-standing concepts in the design of numerical schemes for fluid mechanics. The combination of such an analysis with an extension of the proposed WENO-NN methodology to higher order schemes seems like a promising avenue for the development of data-driven numerical schemes and is the subject of ongoing research. Finally, we want to point out that the high interpretability of the WENO3-NN weighting function may help to find analytical smoothness measures/weighting functions for higher order WENO schemes. The proposed framework could enable a new line of development of WENO schemes in which firstly a deep WENO-NN scheme is trained on a rich dataset and, then, a functional relation of the learned weighting scheme is identified (e.g. using sparse regression). Aforementioned issues motivate future work.

CRediT authorship contribution statement

Deniz A. Bezgin: Conceptualization, Formal analysis, Investigation, Methodology, Software, Writing – original draft. Steffen J. Schmidt: Conceptualization, Supervision, Writing – review & editing. Nikolaus A. Adams: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This project has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement No. 667483).

Appendix A.

A.1. Dataset

One has to take special care when generating finite-volume data (i.e. cell-averaged data) on a three-point stencil in order to avoid contradicting training samples. Especially, jump discontinuities and extreme points can introduce inconsistencies into the dataset. For example, consider the finite-volume representation of a jump discontinuity in the middle of cell i, see Fig. 18 (left). In the cell-averaged representation the discontinuous example and the smooth linear example look identical. However, the corresponding cell face values are different. The network, therefore, would get two different labels for the same input. We avoid this predicament in training samples by placing jumps on cell faces only.

The same holds true for extreme points (e.g. of a sine function) and jump discontinuities, see Fig. 18 (right). The finite-volume representation of the sine function is identical with a jump placed at

. To this end, for trigonometric functions, we exclude all stencils from the training dataset for which cell face values lie outside the cell averages in the corresponding stencil.

A.2. Model training

Table 3 shows the value ranges of the model hyperparameters. Each model is initialized with 5 sets of different random weights. For each set of model hyperparameters, we choose the model with lowest validation error that also passes all 1D benchmark problems. Fig. 19 shows the training and validation loss during the optimization process. The losses are fully converged at the end of the training process.

Table 3. Model hyperparameters.

Parameter	Value range
α	[0, 1E-2, 3E-2, 1E-1, 3E-1, 1]
β_d	[1E-2, 1E-1, 1]
β_W	1E-9
c_eno	2E-4

A.3. Influence of hyperparameters

The analysis of the hyperparameters on the model performance is very instructive. We evaluate the weight

for

while keeping either α or

fixed and varying the other parameter in the loss function Eq. (17). The left part of Fig. 20 shows that for fixed

decreasing α provides the WENO3-NN model with more liberty to deviate further from the ideal weights

. As mentioned earlier, α introduces a scale separation mechanism. α values closer to one smoothen out the input field and push the WENO3-NN outputs closer to the ideal weights over the whole domain. For α values closer to zero, network outputs deviate stronger from the ideal

. The right part of Fig. 20 shows that increasing

for fixed α pushes the network output more towards the ideal weights. For

deviates from the ideal weight only at the smooth extreme points. Both behaviors are in line with our understanding of the loss function Eq. (17).

A.4. Cutoff threshold of the ENO layer

WENO3-NN schemes detect discontinuities in the flow field very well and adjust the reconstruction accordingly. However, during numerical experimentation we found that WENO3-NN schemes without an ENO layer assign small non-zero weights to discontinuous stencils. For example, the left of Fig. 21 shows the output weights of WENO3-NN1 without ENO layer at the cell face

evaluated for a single jump discontinuity placed at

, i.e.(A.1)

We observe that WENO3-NN assigns a small, but non-zero weight of around

to the discontinuous stencils. During training the neural network learns to detect discontinuities and adapts the reconstruction accordingly. However, the neural network does not decrease the output weights below a certain threshold (here

) since the loss at such discontinuous stencils is already small compared to other training samples. Additionally, the saturation of the softmax output activation makes it difficult to output exact zeros and ones. In practice, this does not pose a problem for many flow applications. However in the vicinity of very strong shocks, e.g. the interacting blast wave test by Woodward and Colella in Sec. 4.3,

might already lead to the reconstruction of negative pressures or densities. To prevent the reconstruction of such inadmissible states, we postprocess the network output at test time. We pass the output weights

through a simple cutoff function, the so called ENO layer, which restores the ENO property.(A.2)

for discontinuities, we choose

as the threshold value for the ENO layer. Note, that the cutoff function can be easily implemented via a ReLU activation function with corresponding threshold, i.e.(A.3)

The weights

with active ENO layer are visualized in the right part of Fig. 21. The ENO layer only affects the numerical solution near discontinuities.

A.5. Convergence for the linear advection equation

We provide further details on the convergence behavior of the WENO3-NN scheme for the linear advection equation. We apply the WENO3-NN scheme to the linear advection of

on the domain

. The initial condition is integrated up to

. We choose a small enough time step (

) to exclude any errors by the time stepping scheme. Fig. 22 shows the grid convergence in

, and

errors for WENO3-JS/Z/N/NN1/NN2 and WENO5-JS/Z, respectively. WENO3-NN1 and -NN2 achieve lower absolute errors than WENO3-JS/Z/N and also show better convergence behavior.

A.6. Pointwise error for linear advection and shocktube tests

In the following, we provide the pointwise errors for the linear advection test cases from Sec. 4.1 and the shocktube tests from Sec. 4.2. Fig. 23 shows the error distributions for linear advection of the multiwave Eq. (27), the discontinuity Eq. (28), and

. The error distributions show that WENO3-NN schemes consistently give a lower error level near discontinuities and in smooth regions when compared to WENO3-JS/Z/N. The multiwave test case indicates that initial discontinuities are smeared out similarly for WENO3-NN and WENO5-JS. The error level of WENO3-NN2 is slightly lower than WENO3-NN1 for all linear advection test cases.

We also provide the pointwise density and velocity error plots for the Sod, Lax, and 123 shocktube problems, see Eqs. (29), (30), and (31). We compare the WENO approximations to the cell-averaged exact solution. Fig. 24 shows the error distributions for the Sod problem. For the density, WENO3-NN1 and -NN2 show considerably lower error levels than other three-point schemes. Around the head and tail of the rarefaction wave and also around the discontinuities, WENO3-NN schemes even outperform WENO5-JS. The error distributions for the Lax problem in Fig. 25 underline the improved shock-capturing capabilities of the WENO3-NN schemes. Near the contact discontinuity and the shock, WENO3-NN schemes outperform WENO3-JS/Z/N and WENO5-JS. Here, the error levels of WENO3-NN schemes are much closer to WENO5-Z. The 123 problem indicates that WENO3-NN schemes are able to resolve strong rarefactions very well, see Fig. 26. WENO3-NN schemes have the lowest overall error from all three-point stencil schemes. However, WENO3-NN schemes show a slightly larger error in the region between the two rarefaction waves. We note that at times, the pointwise errors plots are difficult to interpret. For example in the Sod or 123 tests, WENO3-JS shows lower error magnitudes than WENO5-JS in some regions but delivers much worse approximations in other regions. Therfore, we summarize the integral

errors in Table 4. The

error for the Euler equations is calculated as the sum of the

errors of density, velocity, and pressure, i.e.

. In the linear advection tests, WENO3-NN schemes clearly outperform WENO3-JS/Z/N and are much closer to the WENO5-JS results. WENO5-Z yields the best performance across all linear advection tests. For the nonlinear shocktube problems, WENO3-NN schemes consistently deliver lower error magnitudes than WENO3-JS/Z/N and even outperform WENO5-JS. For the Sod and Lax problems, WENO5-Z achieves the lowest error magnitudes among all the WENO schemes considered in this work. However in the 123 problem, WENO3-NN schemes deliver better approximations than WENO5-Z.

Table 4. L₁ errors of various WENO schemes for the linear advection equation and shocktube problems of the Euler equation. Linear advection tests include , the multiwave in Eq. (27), and the discontinuity in Eq. (28). The Sod, Lax, and 123 shocktube problems are defined by Eqs. (29), (30), and (31).

Empty Cell		GSTE	Disc.	Sod	Lax	123
WENO3-JS	1.180E-1	3.702E-1	8.260E-2	1.636E-2	3.949E-2	2.540E-2
WENO3-Z	5.552E-2	2.343E-1	5.902E-2	1.272E-2	2.931E-2	2.317E-2
WENO3-N	4.197E-2	1.969E-1	5.225E-2	1.188E-2	3.153E-2	2.255E-2
WENO3-NN1	2.661E-2	1.651E-1	4.355E-2	1.078E-2	2.527E-2	1.973E-2
WENO3-NN2	2.445E-2	1.609E-1	4.235E-2	1.035E-2	2.416E-2	1.967E-2
WENO5-JS	2.572E-3	9.964E-2	2.634E-2	1.106E-2	2.590E-2	2.393E-2
WENO5-Z	1.930E-3	7.731E-2	2.335E-2	8.113E-3	1.900E-2	2.282E- 2

References

[1]
G.S. Jiang, C.W. Shu
Efficient implementation of weighted ENO schemes
J. Comput. Phys., 126 (1) (1996), pp. 202-228, 10.1006/jcph.1996.0130
View PDF View article View in Scopus Google Scholar
[2]
A.K. Henrick, T.D. Aslam, J.M. Powers
Mapped weighted essentially non-oscillatory schemes: achieving optimal order near critical points
J. Comput. Phys., 207 (2) (2005), pp. 542-567, 10.1016/j.jcp.2005.01.023
http://linkinghub.elsevier.com/retrieve/pii/S0021999105000409
View PDF View article View in Scopus Google Scholar
[3]
R. Borges, M. Carmona, B. Costa, W.S. Don
An improved weighted essentially non-oscillatory scheme for hyperbolic conservation laws
J. Comput. Phys., 227 (6) (2008), pp. 3191-3211, 10.1016/j.jcp.2007.11.038
View PDF View article View in Scopus Google Scholar
[4]
X. Hu, Q. Wang, N. Adams
An adaptive central-upwind weighted essentially non-oscillatory scheme
J. Comput. Phys., 229 (23) (2010), pp. 8952-8965, 10.1016/j.jcp.2010.08.019
http://linkinghub.elsevier.com/retrieve/pii/S0021999110004560
View PDF View article View in Scopus Google Scholar
[5]
X. Hu, N. Adams
Scale separation for implicit large eddy simulation
J. Comput. Phys., 230 (19) (2011), pp. 7240-7249, 10.1016/j.jcp.2011.05.023
http://linkinghub.elsevier.com/retrieve/pii/S0021999111003342
View PDF View article View in Scopus Google Scholar
[6]
L. Fu, X.Y. Hu, N.A. Adams
A family of high-order targeted ENO schemes for compressible-fluid simulations
J. Comput. Phys., 305 (2016), pp. 333-359, 10.1016/j.jcp.2015.10.037
View PDF View article View in Scopus Google Scholar
[7]
D.A. Bezgin, S.J. Schmidt, N.A. Adams
A data-driven physics-informed finite-volume scheme for nonclassical undercompressive shocks
J. Comput. Phys., 437 (2021), Article 110324, 10.1016/j.jcp.2021.110324
https://linkinghub.elsevier.com/retrieve/pii/S0021999121002199
View PDF View article View in Scopus Google Scholar
[8]
B. Stevens, T. Colonius
Enhancement of shock-capturing methods via machine learning
Theor. Comput. Fluid Dyn., 34 (4) (2020), pp. 483-496, 10.1007/s00162-020-00531-1
arXiv:2002.02521
http://arxiv.org/abs/2002.02521
View in Scopus Google Scholar
[9]
C.-W. Shu
Essentially non-oscillatory and weighted essentially non-oscillatory schemes for hyperbolic conservation laws
Adv. Numer. Approx. Nonlin. Hyperbol. Eq., 97 (1998), pp. 325-432, 10.1007/bfb0096355
Google Scholar
[10]
D.P. Kingma, J.L. Ba
Adam: a method for stochastic optimization
3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, International Conference on Learning Representations, ICLR (2015)
arXiv:1412.6980
Google Scholar
[11]
P. Ramachandran, B. Zoph, Q.V. Le
Searching for activation functions
arXiv:1710.05941
Google Scholar
[12]
W. Xiaoshuai, Z. Yuxin
A high-resolution hybrid scheme for hyperbolic conservation laws
Int. J. Numer. Methods Fluids, 78 (3) (2015), pp. 162-187, 10.1002/fld
View in Scopus Google Scholar
[13]
N.R. Gande, A.A. Bhise
Modified third and fifth order WENO schemes for inviscid compressible flows
Numer. Algorithms (2020), 10.1007/s11075-020-01039-9
Google Scholar
[14]
E.F. Toro
Riemann Solvers and Numerical Methods for Fluid Dynamics: A Practical Introduction
(3rd edition), Springer Verlag (2009)
Google Scholar
[15]
W.S. Don, R. Borges
Accuracy of the weighted essentially non-oscillatory conservative finite difference schemes
J. Comput. Phys., 250 (2013), pp. 347-372, 10.1016/j.jcp.2013.05.018
http://www.sciencedirect.com/science/article/pii/S0021999113003501
View PDF View article View in Scopus Google Scholar
[16]
F. Acker, R.B. Borges, B. Costa
An improved WENO-Z scheme
J. Comput. Phys., 313 (January 2016), pp. 726-753, 10.1016/j.jcp.2016.01.038
View PDF View article View in Scopus Google Scholar
[17]
G.A. Sod
A survey of several finite difference methods for systems of nonlinear hyperbolic conservation laws
J. Comput. Phys., 27 (1) (1978), pp. 1-31, 10.1016/0021-9991(78)90023-2
View PDF View article View in Scopus Google Scholar
[18]
P.D. Lax
Weak solutions of nonlinear hyperbolic equations and their numerical computation
Commun. Pure Appl. Math., 7 (1) (1954), pp. 159-193, 10.1002/cpa.3160070112
View in Scopus Google Scholar
[19]
B. Einfeldt, C.D. Munz, P.L. Roe, B. Sjögreen
On Godunov-type methods near low densities
J. Comput. Phys., 92 (2) (1991), pp. 273-295, 10.1016/0021-9991(91)90211-3
https://ac.els-cdn.com/0021999191902113/1-s2.0-0021999191902113-main.pdf?_tid=287ef5dc-0d80-11e8-b258-00000aacb35f&acdnat=1518170652_513ed5eb86a2c3a005968ac90b3598bc
View PDF View article View in Scopus Google Scholar
[20]
P. Woodward, P. Colella
The numerical simulation of two-dimensional fluid flow with strong shocks
J. Comput. Phys., 54 (1) (1984), pp. 115-173
https://www.sciencedirect.com/science/article/pii/0021999184901426
View PDF View article View in Scopus Google Scholar
[21]
C.-W. Shu, S. Osher
Efficient Implementation of Essentially Non-oscillatory Shock-Capturing Schemes, II
Springer (1989), pp. 328-374
http://link.springer.com/chapter/10.1007/978-3-642-60543-7_14
Crossref Google Scholar
[22]
S. Pirozzoli
On the spectral properties of shock-capturing schemes
J. Comput. Phys., 219 (2) (2006), pp. 489-497, 10.1016/j.jcp.2006.07.009
View PDF View article View in Scopus Google Scholar
[23]
J. Bradbury, R. Frostig, P. Hawkins, M.J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. VanderPlas, S. Wanderman-Milne, Q. Zhang
JAX: composable transformations of Python+NumPy programs
http://github.com/google/jax (2018)
Google Scholar

Cited by (12)

JAX-Fluids 2.0: Towards HPC for differentiable CFD of compressible two-phase flows
2025, Computer Physics Communications
In our effort to facilitate machine learning-assisted computational fluid dynamics (CFD), we introduce the second iteration of JAX-Fluids. JAX-Fluids is a Python-based fully-differentiable CFD solver designed for compressible single- and two-phase flows. In this work, the first version is extended to incorporate high-performance computing (HPC) capabilities. We introduce a parallelization strategy utilizing JAX primitive operations that scales efficiently on GPU (up to 512 NVIDIA A100 graphics cards) and TPU (up to 1024 TPU v3 cores) HPC systems. We further demonstrate stable parallel computation of automatic differentiation gradients across extended integration trajectories. The new code version offers enhanced two-phase flow modeling capabilities. In particular, a five-equation diffuse-interface model is incorporated which complements the level-set sharp-interface model. Additional algorithmic improvements include positivity-preserving limiters for increased robustness, support for stretched Cartesian meshes, refactored I/O handling, comprehensive post-processing routines, and an updated list of state-of-the-art high-order numerical discretization schemes. We verify newly added numerical models by showcasing simulation results for single- and two-phase flows, including turbulent boundary layer and channel flows, air-helium shock bubble interactions, and air-water shock drop interactions.
PROGRAM SUMMARY
Program Title: JAX-Fluids
CPC Library link to program files: https://doi.org/10.17632/pzvkwn5s6p.2
Developer's repository link: https://github.com/tumaer/JAXFLUIDS
Licensing provisions: GPLv3
Programming language: Python, JAX
Supplementary material: Source code, example scripts, videos
Journal reference of previous version: D.A. Bezgin, A.B. Buhendwa, N.A. Adams, JAX-Fluids: A fully-differentiable high-order computational fluid dynamics solver for compressible two-phase flows, Computer Physics Communications 282 (2022) 108527.
Does the new version supersede the previous version?: Yes
Reasons for the new version: New features and updates of the CFD solver
Summary of revisions:
•
JAX primitives-based parallelization for GPU and TPU clusters
•
Automatic differentiation through distributed simulations
•
Diffuse-interface model for two-phase flows
•
Positivity-preserving interpolation and flux limiters
•
Support for stretched Cartesian meshes
•
Extended list of numerical discretization schemes
•
Performance improvements
•
Revised I/O handling
Nature of problem: The compressible Navier-Stokes equations describe continuum-scale fluid flows which may exhibit complex phenomena such as shock waves, material interfaces, and turbulence. The accurate numerical solution of fluid flows is computationally expensive and, therefore, requires high-performance computing (HPC) architectures. To this end, machine learning (ML), in particular differentiable programming, is continuously being explored as a tool to accelerate conventional computational fluid dynamics (CFD). With the second iteration of JAX-Fluids, we provide a comprehensive differentiable CFD code that scales efficiently on HPC systems, seamlessly integrates ML models, and accurately simulates complex flow physics with high-order low-dissipative numerical methods.
Solution method: JAX-Fluids is a finite-volume solver which uses high-order low-dissipative shock capturing schemes in combination with approximate Riemann solvers. Two-phase flows can be simulated using the sharp-interface level-set method or the diffuse-interface five-equation model. The code is written in Python and builds on the JAX library. The JAX backend allows the computation of automatic differentiation gradients. We use a homogenous domain decomposition ansatz to implement the parallelization. An object-oriented programming style and a modular design philosophy allow exchanging numerical schemes and integrating custom subroutines.
Additional comments including restrictions and unusual features: JAX-Fluids runs on CPUs, GPUs, and TPUs in single- and multi-device settings. JAX-Fluids requires open-source third-party Python libraries which are automatically installed. The solver has been tested on Linux and macOS operating systems.
A deep reinforcement learning framework for dynamic optimization of numerical schemes for compressible flow simulations
2023, Journal of Computational Physics
Citation Excerpt :
Data-driven methods based on artificial neural networks (ANNs) or Gaussian processes (GPs) gain significance in CFD due to their ability to learn complex, nonlinear relation from data. They have been integrated with the numerical solvers to produce physically consistent solutions [20–26]. Due to different characteristics of subgrid scales, traditional optimization strategies, such as those based on spectral properties [7] or based on evolutionary [15] as well as Bayesian optimization [24,25], may not be effective in finding the optimal solutions.
Marginal or under-resolved simulations of compressible flow configurations that often occur in practical applications classically are enabled by administering sufficient numerical dissipation to keep the simulation stable. Such measures, however, often are physically inconsistent due to non-selectively altering of dynamics across scales. Sustaining physically consistent large scale dynamics requires the numerical solution to effectively model non-resolved small scale dynamics. In this work, we propose a general deep-reinforcement-learning framework for devising an agent to interact with high-resolution scheme in order to balance dissipation and dispersion such that physically consistent modeling of non-resolved scales is achieved. A densely distributed reward function without involving labeled data is defined. The agent is trained on low-resolution uniform grids that capture the dominant flow structures. We demonstrate that it can be applied directly to high-resolution simulations without the need for retraining or fine-tuning, thereby, demonstrating significantly improved modeling performance compared to empirically designed high-resolution schemes. The proposed methodology opens a new path for self-adaptive numerical solutions whose truncation errors act as physically consistent model for unresolved scales of widely differing flow configurations.
JAX-Fluids: A fully-differentiable high-order computational fluid dynamics solver for compressible two-phase flows
2023, Computer Physics Communications
Citation Excerpt :
Upon proper training, they are then plugged into an existing CFD solver for evaluation of down-stream tasks. Examples include training of explicit subgrid scale models in large eddy simulations [33], interface reconstruction in multiphase flows [34,35], and cell face reconstruction in shock-capturing schemes [36,37]. Although the offline training of ML models is relatively easy, there are several drawbacks to this approach.
Physical systems are governed by partial differential equations (PDEs). The Navier-Stokes equations describe fluid flows and are representative of nonlinear physical systems with complex spatio-temporal interactions. Fluid flows are omnipresent in nature and engineering applications, and their accurate simulation is essential for providing insights into these processes. While PDEs are typically solved with numerical methods, the recent success of machine learning (ML) has shown that ML methods can provide novel avenues of finding solutions to PDEs. ML is becoming more and more present in computational fluid dynamics (CFD). However, up to this date, there does not exist a general-purpose ML-CFD package which provides 1) powerful state-of-the-art numerical methods, 2) seamless hybridization of ML with CFD, and 3) automatic differentiation (AD) capabilities. AD in particular is essential to ML-CFD research as it provides gradient information and enables optimization of preexisting and novel CFD models. In this work, we propose JAX-Fluids: a comprehensive fully-differentiable CFD Python solver for compressible two-phase flows. JAX-Fluids is intended for ML-supported CFD research. The framework allows the simulation of complex fluid dynamics with phenomena like three-dimensional turbulence, compressibility effects, and two-phase flows. Written entirely in JAX, it is straightforward to include existing ML models into the proposed framework. Furthermore, JAX-Fluids enables end-to-end optimization. I.e., ML models can be optimized with gradients that are backpropagated through the entire CFD algorithm, and therefore contain not only information of the underlying PDE but also of the applied numerical methods. We believe that a Python package like JAX-Fluids is crucial to facilitate research at the intersection of ML and CFD and may pave the way for an era of differentiable fluid dynamics.
Program title: JAX-Fluids
CPC Library link to program files: https://doi.org/10.17632/pzvkwn5s6p.1
Developer's repository link: https://github.com/tumaer/JAXFLUIDS
Code Ocean capsule: https://codeocean.com/capsule/6819679
Licensing provisions: GNU GPLv3
Programming language: Python
Supplementary material: Source code; Examples; Videos: Moving solid bodies, Taylor-Green vortex, Rising bubble, Shock-bubble interaction.
Nature of problem: The compressible Navier-Stokes equations describe continuum-scale fluid flows. These flows often involve highly complex flow phenomena such as shocks, material interfaces, and turbulence. The intrinsic nonlinear dynamics render the numerical simulation of the these equations challenging. Machine learning provides novel avenues for describing partial differential equations. Machine learning models rely on gradient information provided by automatic differentiation and are often implemented in Python. In contrast, existing high-performance computational fluid dynamics codes are typically written in Fortran or C++ and do not offer inherent automatic differentiation capabilities. These discrepancies hinder the advance of machine-learning-supported computational fluid dynamics. Up to this day, a general-purpose fully-differentiable computational fluid dynamics solver for compressible two-phase flows is missing.
Solution method: We introduce JAX-Fluids: a general-purpose three-dimensional fully-differentiable computational fluid dynamics solver for compressible two-phase flows. JAX-Fluids is a simulation framework intended for machine-learning-supported computational fluid dynamics research. Our framework is written entirely in JAX, a high-performance numerical computing library with automatic differentiation capabilities. We have used an object-oriented programming style and a modular design philosophy. This allows the straightforward exchange of numerical methods. We provide a wide variety of state-of-the-art high-order computational methods for compressible flows. The modularity of our framework additionally facilitates the integration of custom subroutines. We use the sharp-interface level-set method to model two-phase flows. The software package can easily be installed as a Python package. We have build the source code around the JAX NumPy API. This makes JAX-Fluids accessible and performant. JAX-Fluids runs on CPUs, GPUs, and TPUs. We use HDF5 in combination with XDMF for writing output quantities. The Python packages Haiku and Optax are used for implementation and training of machine learning methods.
Additional comments including restrictions and unusual features: JAX-Fluids relies on open-source third-party Python libraries. These are automatically installed. In the current version, JAX-Fluids only runs on a single accelerator (CPU/GPU/TPU). Future versions will include support for parallel execution. JAX-Fluids has been tested on Linux and macOS operating systems.
Weak baselines and reporting biases lead to overoptimism in machine learning for fluid-related partial differential equations
2024, Nature Machine Intelligence
Review of the High-Order TENO Schemes for Compressible Gas Dynamics and Turbulence
2023, Archives of Computational Methods in Engineering
BAYESIAN OPTIMIZATION ON FIFTH-ORDER TARGETED ENO SCHEME FOR COMPRESSIBLE FLOWS
2022, WCCM-APCOM 2022 - 15th World Congress on Computational Mechanics and 8th Asian Pacific Congress on Computational Mechanics: Pursuing the Infinite Potential of Computational Mechanics

View all citing articles on Scopus

View Abstract

[1] [1]
G.S. Jiang, C.W. Shu
Efficient implementation of weighted ENO schemes
J. Comput. Phys., 126 (1) (1996), pp. 202-228, 10.1006/jcph.1996.0130
View PDF View article View in Scopus Google Scholar

[2] [2]
A.K. Henrick, T.D. Aslam, J.M. Powers
Mapped weighted essentially non-oscillatory schemes: achieving optimal order near critical points
J. Comput. Phys., 207 (2) (2005), pp. 542-567, 10.1016/j.jcp.2005.01.023
http://linkinghub.elsevier.com/retrieve/pii/S0021999105000409
View PDF View article View in Scopus Google Scholar

[3] [3]
R. Borges, M. Carmona, B. Costa, W.S. Don
An improved weighted essentially non-oscillatory scheme for hyperbolic conservation laws
J. Comput. Phys., 227 (6) (2008), pp. 3191-3211, 10.1016/j.jcp.2007.11.038
View PDF View article View in Scopus Google Scholar

[4] [4]
X. Hu, Q. Wang, N. Adams
An adaptive central-upwind weighted essentially non-oscillatory scheme
J. Comput. Phys., 229 (23) (2010), pp. 8952-8965, 10.1016/j.jcp.2010.08.019
http://linkinghub.elsevier.com/retrieve/pii/S0021999110004560
View PDF View article View in Scopus Google Scholar

[5] [5]
X. Hu, N. Adams
Scale separation for implicit large eddy simulation
J. Comput. Phys., 230 (19) (2011), pp. 7240-7249, 10.1016/j.jcp.2011.05.023
http://linkinghub.elsevier.com/retrieve/pii/S0021999111003342
View PDF View article View in Scopus Google Scholar

[6] [6]
L. Fu, X.Y. Hu, N.A. Adams
A family of high-order targeted ENO schemes for compressible-fluid simulations
J. Comput. Phys., 305 (2016), pp. 333-359, 10.1016/j.jcp.2015.10.037
View PDF View article View in Scopus Google Scholar

[7] [7]
D.A. Bezgin, S.J. Schmidt, N.A. Adams
A data-driven physics-informed finite-volume scheme for nonclassical undercompressive shocks
J. Comput. Phys., 437 (2021), Article 110324, 10.1016/j.jcp.2021.110324
https://linkinghub.elsevier.com/retrieve/pii/S0021999121002199
View PDF View article View in Scopus Google Scholar

[8] [8]
B. Stevens, T. Colonius
Enhancement of shock-capturing methods via machine learning
Theor. Comput. Fluid Dyn., 34 (4) (2020), pp. 483-496, 10.1007/s00162-020-00531-1
arXiv:2002.02521
http://arxiv.org/abs/2002.02521
View in Scopus Google Scholar

[9] [9]
C.-W. Shu
Essentially non-oscillatory and weighted essentially non-oscillatory schemes for hyperbolic conservation laws
Adv. Numer. Approx. Nonlin. Hyperbol. Eq., 97 (1998), pp. 325-432, 10.1007/bfb0096355
Google Scholar

[10] [10]
D.P. Kingma, J.L. Ba
Adam: a method for stochastic optimization
3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, International Conference on Learning Representations, ICLR (2015)
arXiv:1412.6980
Google Scholar

[11] [11]
P. Ramachandran, B. Zoph, Q.V. Le
Searching for activation functions
arXiv:1710.05941
Google Scholar

[12] [12]
W. Xiaoshuai, Z. Yuxin
A high-resolution hybrid scheme for hyperbolic conservation laws
Int. J. Numer. Methods Fluids, 78 (3) (2015), pp. 162-187, 10.1002/fld
View in Scopus Google Scholar

[13] [13]
N.R. Gande, A.A. Bhise
Modified third and fifth order WENO schemes for inviscid compressible flows
Numer. Algorithms (2020), 10.1007/s11075-020-01039-9
Google Scholar

[14] [14]
E.F. Toro
Riemann Solvers and Numerical Methods for Fluid Dynamics: A Practical Introduction
(3rd edition), Springer Verlag (2009)
Google Scholar

[15] [15]
W.S. Don, R. Borges
Accuracy of the weighted essentially non-oscillatory conservative finite difference schemes
J. Comput. Phys., 250 (2013), pp. 347-372, 10.1016/j.jcp.2013.05.018
http://www.sciencedirect.com/science/article/pii/S0021999113003501
View PDF View article View in Scopus Google Scholar

[16] [16]
F. Acker, R.B. Borges, B. Costa
An improved WENO-Z scheme
J. Comput. Phys., 313 (January 2016), pp. 726-753, 10.1016/j.jcp.2016.01.038
View PDF View article View in Scopus Google Scholar

[17] [17]
G.A. Sod
A survey of several finite difference methods for systems of nonlinear hyperbolic conservation laws
J. Comput. Phys., 27 (1) (1978), pp. 1-31, 10.1016/0021-9991(78)90023-2
View PDF View article View in Scopus Google Scholar

[18] [18]
P.D. Lax
Weak solutions of nonlinear hyperbolic equations and their numerical computation
Commun. Pure Appl. Math., 7 (1) (1954), pp. 159-193, 10.1002/cpa.3160070112
View in Scopus Google Scholar

[19] [19]
B. Einfeldt, C.D. Munz, P.L. Roe, B. Sjögreen
On Godunov-type methods near low densities
J. Comput. Phys., 92 (2) (1991), pp. 273-295, 10.1016/0021-9991(91)90211-3
https://ac.els-cdn.com/0021999191902113/1-s2.0-0021999191902113-main.pdf?_tid=287ef5dc-0d80-11e8-b258-00000aacb35f&acdnat=1518170652_513ed5eb86a2c3a005968ac90b3598bc
View PDF View article View in Scopus Google Scholar

[20] [20]
P. Woodward, P. Colella
The numerical simulation of two-dimensional fluid flow with strong shocks
J. Comput. Phys., 54 (1) (1984), pp. 115-173
https://www.sciencedirect.com/science/article/pii/0021999184901426
View PDF View article View in Scopus Google Scholar

[21] [21]
C.-W. Shu, S. Osher
Efficient Implementation of Essentially Non-oscillatory Shock-Capturing Schemes, II
Springer (1989), pp. 328-374
http://link.springer.com/chapter/10.1007/978-3-642-60543-7_14
Crossref Google Scholar

[22] [22]
S. Pirozzoli
On the spectral properties of shock-capturing schemes
J. Comput. Phys., 219 (2) (2006), pp. 489-497, 10.1016/j.jcp.2006.07.009
View PDF View article View in Scopus Google Scholar

[23] [23]
J. Bradbury, R. Frostig, P. Hawkins, M.J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. VanderPlas, S. Wanderman-Milne, Q. Zhang
JAX: composable transformations of Python+NumPy programs
http://github.com/google/jax (2018)
Google Scholar