0% found this document useful (0 votes)

173 views51 pages

Energy-Based PDE Solutions with DNNs

This document discusses using deep neural networks (DNNs) to solve partial differential equations (PDEs) that arise in computational mechanics problems. It proposes formulating the PDE as an energy minimization problem, with the energy function used as the loss function for DNN training. This "deep energy method" frames the problem as an optimization that is well-suited to machine learning techniques. The document outlines the general approach, reviews relevant DNN and PDE concepts, and provides examples of computational mechanics problems solved using this method to demonstrate its capabilities for engineering applications.

Uploaded by

Gaston GB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

173 views51 pages

Energy-Based PDE Solutions with DNNs

Uploaded by

Gaston GB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

An Energy Approach to the Solution of Partial Differential

Equations in Computational Mechanics via Machine Learning:

Concepts, Implementation and Applications
E. Samaniegoa , C. Anitescub , S. Goswamib , V.M. Nguyen-Thanhc , H. Guoc , K. Hamdiac ,
T. Rabczukb,∗, X. Zhuangc
a
School of Engineering and Departamento de Recursos Hı́dricos y Ciencias Ambientales, Universidad de
Cuenca, Av. 12 de Abril s/n., Cuenca, Ecuador
arXiv:1908.10407v1 [[Link]] 27 Aug 2019

b
Institute of Structural Mechanics, Bauhaus-Universität Weimar, Marienstraße 15 99423 Weimar
c
Institute of Continuum Mechanics, Leibniz Universität Hannover, Appelstraße 11, 30167 Hannover,
Germany

Abstract
Partial Differential Equations (PDE) are fundamental to model different phenomena in sci-
ence and engineering mathematically. Solving them is a crucial step towards a precise knowl-
edge of the behaviour of natural and engineered systems. In general, in order to solve PDEs
that represent real systems to an acceptable degree, analytical methods are usually not
enough. One has to resort to discretization methods. For engineering problems, probably
the best known option is the finite element method (FEM). However, powerful alternatives
such as mesh-free methods and Isogeometric Analysis (IGA) are also available. The funda-
mental idea is to approximate the solution of the PDE by means of functions specifically
built to have some desirable properties. In this contribution, we explore Deep Neural Net-
works (DNNs) as an option for approximation. They have shown impressive results in areas
such as visual recognition. DNNs are regarded here as function approximation machines.
There is great flexibility to define their structure and important advances in the architec-
ture and the efficiency of the algorithms to implement them make DNNs a very interesting
alternative to approximate the solution of a PDE. We concentrate in applications that have
an interest for Computational Mechanics. Most contributions that have decided to explore
this possibility have adopted a collocation strategy. In this contribution, we concentrate in
mechanical problems and analyze the energetic format of the PDE. The energy of a mechan-
ical system seems to be the natural loss function for a machine learning method to approach
a mechanical problem. As proofs of concept, we deal with several problems and explore the
capabilities of the method for applications in engineering.
Keywords: Physics informed, Deep neural networks, Energy approach

∗
Corresponding author.
Email address: [Link]@[Link] (T. Rabczuk)
1. Introduction
Computational mechanics aims at solving mechanical problems using computer methods.
These mechanical problems can originate from the study of either natural or engineered
systems. In order to describe their behaviour in a precise manner, mathematical models
have to be devised. In engineering applications, these mathematical models are often based
on partial differential equations (PDEs). When realistic models are considered, one has to
resort to numerical methods to solve them. The idea is to look for an approximate solution
for the problem, in a finite-dimensional space. Then, the problem reduces to find a finite set
of parameters that define this approximate solution. Conventional ways to tackle the solution
of PDEs are the finite element method (FEM) [1], mesh-free methods [2], and isogeometric
analysis [3].
In recent times, the use of deep neural networks (DNNs) have led to outstanding achieve-
ments in several areas, such as visual recognition [4]. Feed-forward neural networks are de-
vised to approximate the target functions. The approximating functions depend on certain
parameters (the weights and the biases) that have to be ”learned” by means of a ”training”
process. Then, it is conceivable that Neural Networks may be used to approximate the so-
lution of a PDE. This is the perspective that we are going to adopt with respect to DNNs
in this article: they are function approximation machines.
The success of DNNs in important learning tasks can be related to the development of
very powerful computational tools. Libraries such as Tensorflow [5] and PyTorch [6] provide
the building blocks to devise learning machines for very different problems. Interfaces that
allow to code in a readable languages like Python and the availability of optimized numer-
ical algorithms leads to achieving a near-mathematical notation, reminiscent of computing
plataforms like FEniCS [7].
There are some works in which the idea of using DNNs to solve PDEs has been pursued
[8]. However, in general, they have dealt with the strong form of the PDE, leading to
an approach based on collocation [9]. Then, the training process is based in devising an
objective function, the empirical loss function, whose minimization leads to the fulfilment of
the governing equations. Also, the problems dealt with are not in general problems directly
related to engineering applications.
A way to approach the solution of a PDE is to write it as a variational problem [10]. From
the mechanical point of view, the corresponding functional has the meaning of an energy.
Given the fact that the training process in machine learning can be regarded as a process of
minimizing the loss function, it seems natural to regard the energy of the system as a very
good candidate for this loss function. In addition, the near-mathematical syntax achieved
by the platforms associated to Tensorflow and Pytorch imply a high degree of readability
and ease of implementation of the PDE solver. Moreover, once the variational problem is
approximated in a finite-dimensional space, it becomes an optimization problem, which is
very convenient given that machine learning libraries are specially oriented to optimization
techniques.
It is worth mentioning that our approach points at solving the PDEs by means of DNNs
as an approximation strategy. In that respect, what we propose is different from approaches
such as [11]. They use labeled data from numerical simulations (although it could be obtained
from experiments, in principle) to help the solution of a Boundary Value Problem in some

2
specific aspect, where detailed knowledge of the phenomena that is being modelled is lacking.
For instance, [12] replaces the constitutive model for a data-driven model. In summary, they
use machine learning to build a surrogate model, while we use it to build the approximation
space.
In this work, we explore the possibility of using a DNN based solver for PDEs. The paper
is organized as follows. In Section 2, we introduce the reader to the generalized problem
setup. Section 3 provides a brief introduction to DNN. The strategy for solving PDEs with
DNNs based on collocation method and deep energy method is explained in Section 4. The
implementation technique is explained in Section 5. To explain the implementation, we have
used snippets from the code. Some representative applications in computational mechanics
are tackled in Section 6, to explore the possibilities of this approach. Finally, Section 7
concludes the study by summarizing the key results of the present work.

2. Mathematical modeling of continuous physical systems

The general aim of this work is to set the foundations for a new paradigm in the field of
computational mechanics that enriches deep learning with the long standing developments
in mathematical physics. To that end, we first consider a generalized PDE and later briefly
introduce the energy approach.

2.1. Partial Differential Equations

Consider a generalized PDE expressed as:
N [u] = 0 on Ω, (1)
where N stands for a (possibly non-linear) differential operator acting upon a (possibly
vector-valued) function, u. In addition, u must satisfy both the Dirichlet and the Neumann-
boundary conditions. The solution of this PDE subjected to appropriate boundary conditions
is referred to a boundary value problem (BVP). Many problems in science and engineering
can be written as specific cases of this generalized format. Sometimes, this version of the
corresponding BVP is called the strong form.
Let us consider how this general setting can be applied to the case of a linear elastic body.
We will have the displacement, u as the primal variable. Then, the second order total strain
tensor,(u) is defined as follows:

1 T
(u) = ∇u + ∇u . (2)
2
For the sake of simplicity, in Eq. (2) we have dropped the explicit dependence of the involved
fields on position, x and time, t. The Cauchy stress tensor, σ() can be computed as:
σ() = C : , (3)
where C is the elasticity tensor. The momentum balance equation, neglecting the inertial
effects and considering zero body forces reads as:
∇ · σ = 0. (4)
Hence, we define Eq. (1) as:
N [u] = ∇ · σ (u) , (5)

3
which for sufficiently smooth u can be solved by a collocation-type method as detailed in
Section 4.

2.2. Energy Approach

For the energy approach to a BVP, consider the following problem
min E[u], (6)
u

where u is constrained by Dirichlet boundary conditions. Here, we will assume that the
total variational energy of the system, E is such that there exists a unique solution of the
problem defined in Eq. (6) and is the same as the solution of Eq. (1). In that case, Eq. (1)
is called the Euler-Lagrange equation of the variational problem defined in Eq. (6).
The first step to go from the variational energy formulation to the corresponding strong
form is to find a stationary state of E by equating its first variation to zero:

d
δE[u, δu] = E[u + τ δu] = 0, (7)
dτ τ =0
which has to hold for any admissible δu (i.e., for any smooth enough function having homoge-
neous boundary conditions on the Dirichlet part of the boundary). The resulting expression
is the so-called weak form of the problem defined by Eq. (1) plus appropriate boundary
conditions. In mechanical problems, this corresponds to the principle of virtual work. The
weak form is the point of departure of important discretization methods such as the FEM.
A characteristic feature of the approach used in this contribution is that it deals directly
with the minimization of the variational energy, E circumventing the need to derive the weak
form from the energy of the system explicitly.
In order to illustrate this approach, let us again consider a linear elastic body. One can
define the stored elastic strain energy, Ψ() as:
1
Ψ() = : C : . (8)
2
Notice that the Cauchy stress tensor, σ() can be computed as follows:
∂Ψ
σ() = . (9)
∂
Then, for the case of zero body forces and homogeneous Neumann boundary conditions,
we can define the energy of the solid as
Z

E[u] = Ψ (u) dΩ. (10)
Ω

3. Deep Neural Networks for PDE discretization

Artificial Neural Networks (ANNs) are learning machines loosely inspired by the biological
neural networks. The general idea is to transform some given input into an output. For this,
there are several units, called neurons, whose state can change depending on the inputs and
the functional relations between these states. ANNs are usually represented by a graph,
whose nodes are the neurons.

4
Given an input x, an ANN would map it to an output up (x). The functional structure
of the ANN is defined by a set of parameters (which can be collected in a vector, p). Deep
Neural Networks are ANNs obtained by means of the composition of simple functions:
up (x) = AL (σ(AL−1 (σ(. . . σ(A1 (x)) . . .)))), (11)
where Al (with l = 1, 2, . . . , L) are affine mappings and σ is a the activation function applied
element-wise. It is important to note that the non-linearity of the DNNs come from the
activation function. Due to the graphical representation of the composition, each of these
functions is called a layer, where L denotes the number of layers of the network. So, loosely
speaking, we say that an ANN is a DNN with one layer, barring the input and the output
layers..
Each affine mapping Al can be defined by a (in general not square) matrix and a vector.
The elements of the matrix are called weights and the elements of the vector are called biases.
They together parametrize the function up (x), and can be grouped in a vector p. In the
following, we consider feed-forward fully-connected DNNs, where each neuron is connected
to all the neurons in the adjacent layers. Other types of neural networks can be considered
as well, such as convolutional neural networks [13], where the layers and the connections
between them are organized in an hierarchical structure.
Once the architecture of the network has been decided, that is, once the number of layers,
the number of neurons per layer, and the activation functions have been frozen, defining
up (x) boils down to determining p. The process of determining these parameters (the
weights and the biases) is called training the network. Usually, when solving problems
with machine learning, this entails having a large amount of input and output data. In the
approach used here, data for training is generated from evaluations of the PDEs at given
point in the domain, as well as from certain points on the boundary. This issue is explained
more in detail below.

4. Solution strategies
4.1. Collocation Methods
Collocation methods are based on the solution of the BVP in strong form. Given a set of
points {x1 , x2 , . . . , xn } in the domain Ω, the idea is to evaluate N [up ] at each point xi and
construct a loss function whose optimization tends to impose the condition N [up ](xi ) = 0.
For instance, loss functions based on the mean square error of N [up ](xi ) can be used.

4.2. Deep Energy Method

The main idea of the method advocated in this contribution is to take advantage of the
variational (energetic) structure of some BVPs. To that end, the energy of the system is
used as the loss function for the DNN, as proposed by [14]. Due to its mechanical flavor, we
name it the Deep Energy Method (DEM) here. One of the key ingredients is to approximate
the energy of the body by a weighted sum of the energy density at integration points. Then,
the following form for the loss function, L(p) is obtained:
X
E[up ] ≈ L(p) = Ψ (up (xi )) wi , (12)
i

5
where up (xi ) is the displacement approximation function evaluated at the integration point,
xi and its corresponding weight, wi . A key issue is the prescription of the boundary condi-
tions. In an variational energy formulation, Neumann boundary conditions can in principle
be imposed in a natural way. However, Dirichlet boundary conditions have to be imposed
in a somehow more involved way, since DNN approximation functions do not fulfill the
Kronecker delta property. We will address these issues in Section 6.

4.2.1. Optimization in machine learning

Once the architecture of a DNN and the empirical loss function have been selected, finding
the parameters p (weights and biases) that define the approximation up of the the unknown
u is the main task. Then, having in mind that the loss function can be expressed as
X
L(p) = fi (p), (13)
i

we end up facing a non-convex finite-sum optimization problem [15]:

X
min fi (p). (14)
p
i

In machine learning, first order methods are almost universal. In fact, one may say that most
optimization strategies in machine learning are variants of the so-called gradient descent
method. A typical iteration of this method is the following:
!
X
pk+1 = pk + γk ∇p fi (p) , (15)
i

where ukp is the DNN approximation of the unknown u parametrized by vector pk , which
groups all weights and biases at iteration k. In addition, γk stands for a parameter called
learning rate. Several variations of this algorithm can be considered. First of all, not all the
points are always considered. In fact, stochasticity in the selection of the points considered
in the finite sum is a general practice, resulting in the so-called stochastic gradient descent
(SGD) algorithm. Acceleration strategies are also very common. Another very common
variant implies the addition of a penalty term to avoid unbounded values of the parameters.
Sometimes, even some approximation of second order information is included, for example,
by means of the L-BFGS method, a quasi-Newton method.
One important issue related to the optimization problem is the structure of the search
space. Recent mathematical results describe the presence of highly unfavorable properties of
the topological structure of DNN spaces. This means that, although the space of functions
generated by a DNN can be very expressive, the structure of the search space may cause
difficulties for optimization algorithms [16]. However, some unexpected properties of the
gradient-descent-based algorithms seem to be able to overcome these difficulties [17], at
least for some applications. This an issue that is the object of intense research currently.

5. Implementation
In this section, we illustrate how the proposed DEM approach can be used for solving
problems in the field of continuum mechanics. The primary goal of the approach is to

6
obtain the field variables by solving the governing partial differential equation, which in
turn requires to compute the integral Eq. (10). We do this by using machine learning
tools available in open-source libraries like TensorFlow and PyTorch. This allows us to use
collective knowledge accumulated in a very vibrant community.
We describe the typical steps that one would have to follow for a TensorFlow implemen-
tation. The idea is to illustrate the general procedure. Specific details for each application
are given below. The full source code will be published on GitHub or can be obtained by
contacting the authors.
First, the geometry is generated using uniformly spaced points in the whole domain and
obtain the training points, Xf and prediction points, xpred . Before we start to train the
network, we initialize the weights of the network randomly from a Gaussian distribution
using the Xavier initialization technique [18]. The network is initialized in the following
manner:
def initialize_NN ( self , layers ) :
weights = []
biases = []
num_layers = len ( layers )
for l in range (0 , num_layers - 1) :
W = self . xavier_init ( size =[ layers [ l ] , layers [ l + 1]])
b = tf . Variable ( tf . zeros ([1 , layers [ l + 1]]) ,
dtype = tf . float32 )
weights . append ( W )
biases . append ( b )
return weights , biases

def xavier_init ( self , size ) :

in_dim = size [0]
out_dim = size [1]
xavier_stddev = np . sqrt (2.0 / ( in_dim + out_dim ) )
w_init = tf . Variable ( tf . truncated_normal ([ in_dim , out_dim ] ,
stddev = xavier_stddev ) )
return w_init
where the network is initialized with layers. Once the weights are initialized, we begin
training the neural network.
Next, we modify the neural network outputs in such a way so that the Dirichlet boundary
conditions are exactly satisfied. This can be done as in the following listing, where uN N
and vN N denote the output from the neural network.
def net_uv ( self ,x ,y , vdelta ) :
X = tf . concat ([ x , y ] ,1)
uv = self . neural_net (X , self . weights , self . biases )
uNN = uv [: ,0:1]
vNN = uv [: ,1:2]
u = (1 - x ) * x * uNN
v = y *( y -1) * vNN
return u , v
The neural network subroutine can be defined according to the type of activation function
chosen between subsequent layers as in the following example:
def neural_net ( self ,X , weights , biases ) :

7
num_layers = len ( weights ) + 1

H = 2.0 * ( X - self . lb ) / ( self . ub - self . lb ) - 1.0

for l in range (0 , num_layers - 2) :
W = weights [ l ]
b = biases [ l ]
H = tf . nn . relu ( tf . add ( tf . matmul (H , W ) , b ) ) **2
W = weights [ -1]
b = biases [ -1]
Y = tf . add ( tf . matmul (H , W ) , b )
return Y
Here [Link] and [Link] denote the lower bound and the upper bound of the inputs to the
neural network, which implies the spatial co-ordinates.
In the next step, automatic differentiation is used to compute the displacement gradients
and compute the variational energy at any point in the domain.
def net_energy ( self ,x , y ) :
u , v = self . net_uv (x , y )
u_x = tf . gradients (u , x ) [0]
v_y = tf . gradients (v , y ) [0]
u_y = tf . gradients (u , y ) [0]
v_x = tf . gradients (v , x ) [0]
u_xy = ( u_y + v_x )
sigmaX = self . c11 * u_x + self . c12 * v_y
sigmaY = self . c21 * u_x + self . c22 * v_y
tauXY = self . c33 * u_xy
energy = 0.5*( sigmaX * u_x + sigmaY * v_y + tauXY * u_xy )
return energy
Finally, we minimize the mean error of the energy losses computed in the previous step. For
optimization, we use the Adam (adaptive momentum) optimizer followed by a quasi-Newton
method (L-BFGS). Once the network is optimized, we predict the values of the field variables
at xpred points.

6. Applications
In this section, we explore the application of DEM to solve PDEs in various domains of
continuum mechanics. The first section considers a one-dimensional problem and the results
obtained using DEM is compared to the available analytical solution. Later, we discuss the
application of DEM on linear elasticity, hyperelasticity. In the latter part of the section, we
solve PDEs for coupled problems. For coupled problems, we address phase-field modeling of
fracture, piezoelectricity and lastly the bending of a Kirchhoff plate.
The implementation has been carried out using the TensorFlow framework [19]. Details
on the network architecture, such as number of layers, number of neurons in each layer, the
activation function used etc. have been provided with each example.

6.1. DNN with ReLU activation functions and FE in 1D

ANNs are known to be universal approximators. Moreover, recent results in approximation
theory show that DNNs using rectified linear units (ReLU) as activation functions can repro-
duce linear finite element spaces [20]. ReLU functions are defined by ReLU (x) = max(0, x).

8
These results have been extended to more general Finite Element spaces using Sobolev norms
in [21]. Here we illustrate the relationship between FE approximations and DNNs by means
of a one-dimensional example, as done in [20].
Let us consider a discretization of the interval [0, l] in the real line: x0 < x1 < . . . < xn ,
where x0 = 0 and xn = l. Then, consider a piecewise linear FE approximation of a function
u(x) taking values on [0, l]:
n
X
uh (x) = ui Ni (x) (16)
i=0

where Ni (x) is a linear hat function with compact support in the subinterval [xi−1 , xi+1 ], such
that Ni (xi ) = 1. For i = 0 and i = n, Ni (x) has compact support in [x0 , x1 ] and [xn−1 , xn ],
respectively. Function Ni (x), for intermediate nodes, can be expressed in terms of rectified
linear unis ReLU as follows:

ReLU (x − xi−1 ) − ReLU (x − xi ) ReLU (x − xi ) − ReLU (x − xi+1 )

Ni (x) = − (17)
hi hi+1
where hi = xi − xi−1
Rearranging terms, we get

1 1 1 1
Ni (x) = ReLU (x − xi−1 ) − + ReLU (x − xi ) + ReLU (x − xi+1 ) (18)
hi hi hi+1 hi+1
Using Eq. 18 and Eq. 16, the following expression can be obtained:
n−1
h
X ∆i+1 ∆i
u (x) = u0 + − ReLU (x − xi ) (19)
i=0
hi+1 hi

where ∆i = ui − ui−1 , for i = 1, 2, . . . , n, and ∆i = 0, for i = 0. Then, uh can be expressed

as an ANN with one hidden layer, having x as input, uh (x) as output, and ReLU as the
activation function:
n−1
X
h
u (x) = u0 + wi ReLU (x − xi ) (20)
i=0

with
∆i+1 ∆i
wi = − (21)
hi+1 hi
Notice that uh is parametrized by the weights wi , which can be grouped in a vector w,
and by the position of the nodes xi (the biases of the hidden layer), which can be grouped
in the vector xp . For fixed xp , and prescribing u0 , the vector value of the elements of w is
defined by the parameters ui , i.e., the values of uh evaluated at the nodes xi .
Consider the approximation in Eq. (20). Then, if Ψ ) is the one-dimensional linear elastic
energy density, we can define an energy
Z l
Ψ (uh (x; w, xp )) dx.

E(w, xp ) = (22)
0

9
in terms of the weights vector w and a vector containing the position of the nodes, xp :

min E(w, xp ) (23)

w,xp

For fixed xp , the problem would be equivalent to solving a standard FE problem (remember
the relationship between w and the nodal FE unknowns). When xp is not fixed, from the
mathematical point of view, we would be solving an adaptive FE problem. However, since
the problem is now non-linear (and probably non-convex), there may be issues related to the
use of typical machine learning algorithms to optimize the loss function defined in 23.
Although this 1D problem is illustrative of the meaning of the parameters of a DNN
approximation, it is also clear that, for general situations, assigning physical meaning to
these parameters would be very difficult. This is may be a consequence of the mesh-free
character of DNN approximations for the solution of PDEs. This has several implications.
Collocation points, for instance, are not to be confused with nodes in a Finite Element mesh.
Points in a FE mesh are tightly related to the parameters of the approximation space, while
collocation points are the inputs for the training process in the DNN approximation.

6.2. Linear Elasticity problem

In this section, the physics informed neural network is developed to solve the governing
partial differential equation for linear-elastic problems using DEM approach. The solution
of the displacement field is obtained by minimizing the stored elastic strain energy of the
system. The problem statement is written as:
Minimize: E = Ψ(),
Z
where: Ψ() = f (x)dx,
Ω
1 (24)
and f (x) = : C : ,
2
subject to: u = u on ∂ΩD ,
and σ · n = tN on ∂ΩN
where Ψ() denotes the stored elastic strain energy of the system expressed in terms of the
strain tensor, (u), C represents the constitutive elastic matrix, u is the prescribed dis-
placement on the Dirichlet boundary, ∂ΩD and tN is the prescribed boundary forces on the
Neumann boundary, ∂ΩN . Using the DEM approach, the homogeneous Neumann bound-
ary conditions are automatically satisfied. The field variables are obtained by minimizing
the mean error of the energy functional at the integration points. Additionally, the non-
homogeneous Neumann boundary conditions are satisfied by minimizing the mean error at
the integration points on the boundary. For optimization, the neural network is trained by
using a combination of Adam optimizer and second-order quasi-Newton method (L-BFGS).
We shall now present the results obtained using DEM for two and three-dimensional linear
elastic problems. Analytical solution is available for all these problems, hence the accuracy
of the method is measured by computing the L2 error for the displacement field and the
strain energy.

10
6.2.1. Pressurized thick-cylinder
The first example considers the benchmark problem of of a thick cylinder subjected to
internal pressure under plane stress condition [22]. Due to symmetry in the geometry and
boundary conditions, only a quarter section represented by an annulus is analyzed. The
cylinder is subjected to internal pressure applied on the inner circular edge. The geometrical
setup and the boundary conditions are shown in Fig. 1(a). In this example, we have consid-
ered E = 1 × 105 and ν = 0.3. The analytical solution, in terms of the stress components,
is [23]:
Ra2 P Ra2

σrr = 2 1− 2 ,
Ra − Rb2 r
2
Ra2

Ra P
σθθ = 2 1+ 2 ,
Ra − Rb2 r
σrθ = 0, (25)
2 !
2
Ra P r Rb
urad = 2 2
1−ν+ (1 + ν) ,
E(Rb − Ra ) r
uexact = urad cos θ,
vexact = urad cos θ,
where Ra and Rb represents the inner and the outer radius of the cylinder, respectively with
Ra = 1 and Rb = 4. P is the pressure exerted along the inner circular edge. In Eq. (25), uexact
and vexact are the analytical solution of the displacement field in x and y-axis, respectively.
The minimization problem reads as stated in Eq. (65). The Dirichlet and Neumann boundary
conditions for the thick cylinder under internal pressure are:
u(0, y) = v(x, 0) = 0,
tN,x (Ra , θ) = P u, (26)
tN,y (Ra , θ) = P v,
where u and v are the solutions of the elastic field in x and y-axis, respectively and tN (Ra , θ)
denotes the traction force which is expressed in terms of the applied internal pressure and
the displacement field. To find the solution of the displacement field, the trial solution is
defined as:
u = xû,
(27)
v = yv̂,
where û and v̂ are obtained from the neural network. The solution of u and v are chosen
in such a way that it satisfies all the boundary conditions as in [24, 25]. As a consequence,
no component corresponding to the boundary loss is needed in the loss function. This
significantly simplifies the objective function to be minimized. We have used a network with
3 hidden layers of 30 neurons each and Nx × Ny uniformly spaced points in the interior of the
annulus, with Nx = Ny = 80. The inner circular edge is discretized using NBound uniformly
spaced points to apply the internal pressure in terms of traction force, with NBound = 80.
For the input layer and the first two hidden layers, we have considered rectified linear units,
ReLU2 activation function; whereas for the last layer connected to the output layer, linear

11
(a) Geometrical setup and boundary
conditions (b) Displacement in x-axis.

(c) Displacement in y-axis. (d) Strain energy.

Figure 1: Setup and numerical results for the pressurized thick cylinder example

activation function has been considered. The loss function is computed as

Lelastic = Lint − Lneu ,
Nx ×Ny
AΩ X
Lint = f (xi ),
Nx × Ny i=1
NX
Bound
AΩ (28)
Lneu = fneu (xi ),
Nbound i=1
1
f (x) = : C : ,
2
fneu (x) = tN,x (x) + tN,y (x),
where Lint and Lneu defines the losses in the elastic strain energy and the Neumann boundary
loss, respectively and AΩ is the area of the analyzed domain. The predicted displacement
field using the DEM approach is shown in Fig. 1(b)-(c). The strain energy obtained using
the neural network is presented in Fig. 1(d). In order to quantify the accuracy of the results
obtained using the proposed approach, we compute the relative L2 error as
q
PNpred 2
i=1 Verr (xi )dx
Lrel
2 = q , (29)
PNpred 2
i=1 Vexact (xi )dx

12
(a) Error in the displacement field in x-axis. (b) Error in the displacement field in y-axis.

Figure 2: Error plots for pressurized thick-cylinder.

where Vexact corresponds to the results obtains using the analytical solution of the field
variable, V and Verr = Vexact − Vpred , where Vpred is obtained from the neural network.
Corresponding to the displacement and the energy norms, relative prediction errors of 0.5%
and 3.7%, respectively have been observed. Fig. 2 shows the errors in the displacement field.

6.2.2. Plate with a circular hole

The second problem is a benchmark problem of an infinite plate with a hole, in plane stress.
To solve the problem numerically, we consider a finite domain. Exploiting the symmetry
of the problem, only one quarter of the plate is analyzed. The geometrical setup and the
boundary conditions for the analyzed domain are depicted in Fig. 3. The material properties
considered are E = 1 × 105 and ν = 0.3. A traction force, Tx = 10 is applied on the plate
as shown in Fig. 3. The analytical solution of the problem, in terms of polar coordinates is
given by [26]:
R2 R4 R2

Tx Tx
σrr = 1− 2 + 1 + 3 4 − 4 2 cos 2θ,
2 r 2 r r
R2 R4

Tx Tx
σθθ = 1+ 2 − 1 + 3 4 cos 2θ, (30)
2 r 2 r
R2 R4

Tx
σrθ = − 1 + 2 2 − 3 4 sin 2θ,
2 r r
where R denotes the radius of the circular hole and (r, θ) is used to represent the radial
distance and the angle for locating any point on the plate. Representing the stresses in the
Cartesian co-ordinate system, we obtain
   
σxx (x, y) σrr (r, θ)
 σyy (x, y)  = A−1  σθθ (r, θ)  , (31)
σxy (x, y) σrθ (r, θ)

13
Figure 3: Problem setup for the plate with a circular hole example

where the transformation matrix A is expressed as

sin2 θ
 
cos2 θ 2 sin θ cos θ
A= sin2 θ cos2 θ −2 sin θ cos θ  . (32)
− sin θ cos θ sin θ cos θ cos2 θ − sin2 θ
The minimization problem reads as stated in Eq. (65). The Dirichlet and Neumann bound-
ary conditions are:
u(0, y) = v(x, 0) = 0,
(33)
tN,x = Tx ,
where u and v are the solutions of the elastic field in x and y-axis, respectively and tN,x
denotes the traction force applied on the plate along the x -axis. To find the solution of
the displacement field, the trial solution is defined as in Eq. (27). Similar to the previous
example, we have used a network with 3 hidden layers of 30 neurons each. For the first two
layers, we have considered rectified linear units, ReLU2 activation function; whereas for the
last layer, linear activation function has been considered. The domain is discretized into
Nx × Ny uniformly spaced points in the interior of the plate and NBound uniformly spaced
points on the edge for applying the traction force, with NBound = 80. We use Eq. (28) to
compute the loss function to optimize the network. The predicted displacement field for the
plate with a circular hole, using the DEM approach, is shown in Fig. 4 (a)-(b).
To quantify the accuracy of the neural network, the relative L2 error corresponding to u
and strain energy has been computed using Eq. (29). Corresponding to u and strain energy,
a prediction error of 1.8% and 3.19%, respectively have been observed. Fig. 4(c)-(d) shows
the errors in the displacement field.

6.2.3. Hollow sphere under internal pressure

In this example, a hollow sphere is subjected to internal pressure is considered. Owing to
the symmetrical structure, we analyze only one-eighth of the hollow sphere. The problem
domain and its corresponding boundary conditions are shown in Fig. 5(a). The analytical

14
(a) Displacement in x-axis. (b) Displacement in y-axis.

Figure 4: Computed solution and errors for plate with hole.

solution of the problem is given by [27]

P Rb3 r (1 + ν)Ra3

ur = 1 − 2ν + ,
E(Ra3 − Rb3 ) 2r3
P R3 (R3 − r3 )
σr = 3 i 3 a 3 , (34)
r (Ra − Rb )
P Ri3 (Ra3 + 2r3 )
σφ = σθ = ,
2r3 (Ra3 − Rb3 )
where r and θ denote the radial distance and the angle with respect to the origin to locate
any co-ordinatewithin the domain and φ represents the azimuthal angular in the x − y plane
from the x -axis. In Eq. (34), Ra and Rb denote the outer and the inner radius of the hollow
sphere, respectively, where Ra = 4 and Rb = 1. For this problem, the material properties
considered are E = 1 × 103 and ν = 0.3. The hollow sphere is subjected to an internal
pressure, P = 1. The Dirichlet boundary conditions are:
u(x, y, 0) = v(x, y, 0) = w(x, y, 0) = 0, (35)
where u, v and w are the solutions of the elastic field in x, y, and z -axis, respectively. To
ensure that the boundary conditions are exactly satisfied, we have set
u = xû,
v = yv̂, (36)
w = z ŵ,

15
(a) Hollow sphere under internal pres-
sure. (b) Cube with a spherical hole.

Figure 5: Geometrical Setup and boundary condition for the application of linear elasticity on three-
dimensional structure.

where û, v̂ and ŵ are obtained from the neural network. To obtain the displacement field,
we have used a network with 3 hidden layers of 50 neurons each and Nx × Ny × Nz uniformly
spaced points in the interior of the domain, Nx = Ny = 80 and Nz = 10. The boundary
face on which the internal pressure, in terms of traction force, is applied is discretized into
NBound points, with NBound = 1600. For the first two layers, we have considered rectified
linear units, ReLU2 activation function; whereas for the last layer, linear activation function
has been considered. For a three-dimensional problem, the loss function is computed as
Lelastic = Lint − Lneu ,
Nx ×Ny ×Nz
VΩ X
Lint = f (xi ),
Nx × Ny × Nz i=1
NBound
V∂Ω X (37)
Lneu = fneu (xi ),
Nbound i=1
1
f (x) = : C : ,
2
fneu (x) = tN,x (x) + tN,y (x) + tN,z (x),
where Lint and Lneu defines the losses in the elastic strain energy and the Neumann boundary
loss, respectively and VΩ is the volume of the analyzed domain. The predicted displacement
field and the strain energy, using the DEM approach is shown in Fig. 6.
Similar to the previous examples, the L2 error is computed to check the accuracy of the
solution. Corresponding to u and strain energy, a prediction error of 1.05% and 4.44%,
respectively have been observed.

6.2.4. Cube with a spherical hole subject to uniform tension

As the last example of the linear elasticity section, we consider a spherical hole in a cube
subjected to uniform tensile loading. Owing to symmetry, only one-eight of the spherical

16
(a) Displacement. (b) Strain energy.

Figure 6: Computed solution for the hollow sphere subjected to internal pressure

hole is analyzed as shown in Fig. 5(b). The analytical stresses in spherical coordinates for
this are [28]:
3
6a5

2 S a 2 2
σrr = S cos θ + (6 − 5 cos θ(5 − ν)) + 5 (3 cos θ − 1) ,
7 − 5ν r3 r
3 5

3S a 2 a 2
σφφ = (5ν − 2 + 5 cos θ(1 − 2ν)) + 5 (1 − 5 cos θ) ,
2(7 − 5ν) r3 r
3 (38)
3a5

2 S a 2 2
σθθ = S sin θ + 4 − 5ν + 5 cos θ (1 − 2ν) + 5 (3 − 7 cos θ) ,
2(7 − 5ν) r3 r
5 3

1 12a 5a (1 + ν)
σrθ = S −1 + 5
− ,
7 − 5ν r r3
where S denotes the applied uniaxial tension and a is the radius of the spherical hole. The
material properties considered for this example are E = 1 × 103 and ν = 0.3. The Dirichlet
boundary conditions are:
u(x, y, 0) = v(x, y, 0) = w(x, y, 0) = 0, (39)
where u, v and w are the solutions of the elastic field in x, y, and z -axis, respectively. To
ensure that the boundary conditions are exactly satisfied, we use the same trial function
for the displacement field as stated in Eq. (36). For obtaining the displacement, we have
used a fully connected neural network with 3 hidden layers of 50 neurons each. Similar to
previous example, for the first two layers ReLU2 activation function and for the last layer
linear activation function have been used. The domain is discretized into NInt points in the
interior of the domain, with NInt = 32000. The boundary face on which the internal pressure
in terms of traction force is applied is discretized into NBound points, with NBound = 1600.

17
(a) Displacement (b) Strain energy

Figure 7: Computed solution for the cube with a hole example

For a three-dimensional problem, the loss function is computed as

Lelastic = Lint − Lneu ,
NInt
VΩ X
Lint = f (xi ),
NInt i=1
NBound
V∂Ω X (40)
Lneu = fneu (xi ),
Nbound i=1
1
f (x) = : C : ,
2
fneu (x) = tN,x (x) + tN,y (x) + tN,z (x),
where Lint and Lneu defines the losses in the elastic strain energy and the Neumann boundary
loss, respectively and VΩ is the volume of the analyzed domain. The predicted displacement
field and the strain energy are shown in Fig. 7.
Similar to the previous examples, the L2 error is computed for the strain energy to check
thee accuracy of the solution, and a prediction error of 5.3% is observed. In the next section,
we would analyze the PDEs involved in solving problems of hyperelasticity.

6.3. Elastodynamics
To investigate the approximation properties of neural networks for time-dependent prob-
lems, we consider a wave propagation example. In one dimension, the governing (equilibrium)
equation is of the form:
∂ 2u 2
2∂ u
= c for x ∈ Ω := (a, b) and t ∈ (0, T ) (41)
∂t2 ∂x2
together with the initial conditions
u(x, 0) = u0 (x) and ut (x) = v0 (x) for x ∈ Ω (42)

18
(a) Computed and exact displacement (b) Computed and exact velocity

(c) Error for displacement (d) Error for velocity

Figure 8: Comparison between the computed and exact solutions for the 1D wave propagation example

and boundary conditions

u(x, t) = ū(t) for x ∈ ΓD and (43)
ux (x, t) = g(t) for x ∈ ΓN . (44)
Here u(x, t) : Ω × [0, T ] → R is the displacement at point x and time t, c is a real positive
constant representing the wave propagation speed, u0 is the initial displacement, v0 is the
initial velocity, and ū(t) and g(t) are the prescribed data for the Dirichlet and Neumann
boundaries ΓD and respectively ΓN .
In this example, we assume that c = 1 and the left side of the domain is subject to a
sinusoidal load while the right side is fixed:
(
− sin(πt), for 0 ≤ t ≤ 1
ux (0, t) = and u(L, t) = 0, (45)
0 otherwise
where L = 4. The initial displacement and velocity are set to zero:
u0 (x) = v0 (x) = 0. (46)
We compute the displacement up to time T = 2 with a space-time discretization neural
network with 3 hidden layers of 50 neurons each. The input layer has 2 neurons (for the
space and time coordinates), while the output layer has 1 neuron for outputting the displace-
ment. The loss function is defined based on a collocation approach which incorporates terms

19
corresponding to the equilibrium equation together with the initial and boundary conditions
as:
L(p) = Lresid (p) + Linit (p) + Lbnd (p), (47)
where
Nint
1 X
Lresid (p) = (ûtt (xint int 2 int int
i , ti ; p) − c ûxx (xi , ti ; p))
2
(48)
Nint i=1
Ninit
1 X
(û(xinit init 2 init init 2

Linit (p) = j , 0; p) − u0 (xj )) + (ût (xj , 0; p) − v0 (xj )) (49)
Ninit j=1
Nbnd
1 X
(ûx (0, tbnd bnd 2 bnd bnd 2

Lbnd (p) = k ; p) − ū(tk )) + (û(L, tk ; p) − g(tk )) . (50)
Nbnd k=1
Here û(x, t; p) is the neural network approximation to u(x, t) which is determined by the pa-
rameters p obtained by minimizing the loss function. Moreover, (xint int
i , ti ) for i = 1, . . . , Nint
are interior collocation points, xinit
j with j = 1, . . . , Ninit are the points in the space do-
bnd
main and tk , k = 1, . . . , Nbnd are points in time corresponding to the initial and boundary
conditions respectively.
The approximation obtained with Nint = 1992 ,and Ninit = 199, and Nbnd = 200 collocation
points as well as the errors in the computed displacement and velocity are shown in Figure
8. The relative errors in L2 norm for the displacement and velocity are 1.422382 · 10−3 and
5.954601 · 10−3 respectively.

6.4. Hyperelasticity
In the context of the elastostatics at finite deformation (shown in Figure 9), the equilibrium
equation along with the boundary conditions, for an initial configuration, is given as:

Equilibrium: ∇ · P + f b = 0, (51)
Dirichlet boundary : u = u on ∂ΩD , (52)
Neumann boundary : P · n = t on ∂ΩN , (53)
in which u and t are prescribed values on Dirichlet boundary and Neumann boundary,
respectively. Therein, the boundaries have to fulfill ∂ΩD ∪ ∂ΩN = ∂Ω, ∂ΩD ∩ ∂ΩN = ∅.
The outward normal unit vector is denoted by n. The 1st Piola-Kirchhoff stress tensor P
is related to its power conjugate F , so-called deformation gradient tensor, by a constitutive
equation P = ∂Ψ/∂F . The deformation gradient is defined as follows
F = Grad ϕ(X), (54)
where ϕ denotes the mapping of material points in the initial configuration to the current
configuration. It is defined as:
ϕ(X) := x = X + u.
Since the strain energy density Ψ describing the elastic energy stored in the body is explicitly
known, it is more convenient to find the possible deformations in a form of potential energy

20
ϕ(X)
t
Initial configuration Current configuration

u(X)

Ω
ϕ(Ω)

∂Ω X
x(X) ϕ(∂Ω)

e2 e3
e1
O
Figure 9: Motion of body B.

rather than the strong formulation. The potential functional is written as

Z Z Z
E(ϕ) = Ψ dV − f b · ϕ dV − t · ϕ dA. (55)
Ω Ω ∂ΩN

In order to obtain the solution we minimize the potential energy

min E(ϕ), (56)
ϕ∈H

where H is the set of admissible functions (trial functions). The method is named the
principle of minimum total potential energy.
Now that our objective is to minimize the potential energy with the aid of neural networks,
we need to cast the minimization into the optimization problem in the context of machine
learning. Therefore, the definition of a loss function is essential. In this case we exploit the
potential energy as the loss function. It reads
Z Z Z
L(p) = Ψ(ϕ(X; p)) dV − f b · ϕ(X; p) dV − t · ϕ(X; p) dA, (57)
Ω Ω ∂ΩN

where the trial function now reads

ϕp (X) = up (X) + X, (58)
and up (X) has to fulfill boundary conditions prior to being applied any operators. We use
a short notation fp (X) instead of f (X; p). Here, a feedforward neural network constructed
by hyperparameters p correspond to the weights and biases of the neural architecture is
employed to establish the trial solution. In this context the loss function now is written by

21
u|Γ0
u|Γ1

H=1

W =1
L = 1.25

Figure 10: A hyperelastic 3D Cuboid is twisted an angle of 60o .

means of an approximation as follows

NΩ NΩ N∂Ω
VΩ X VΩ X A∂ΩN XN
L(p) ≈ Ψ((ϕp )i ) − (fb )i · (ϕp )i − ti · (ϕp )i , (59)
NΩ i=1 NΩ i=1 N∂ΩN i=1
where NΩ is the total number of data of the solid; N∂ΩN is the number of data of the surface
having force. VΩ is the volume of the solid and A∂ΩN is the area of the surface. If a function
f is evaluated at a material point Xi , we denote this by fi .
An example about torsion inspired from a Fenics application [29] is chosen to examine
in order to verify the robustness of DEM in hyperelasticity problems. Let us consider a
3D Cuboid made of an isotropic, homogeneous, hyperelastic material with length L = 1.25,
width W = 1.0 and depth L = 1.0 (we drop out all units for the sake of simplicity). The solid
is clamped at the left surface and is twisted an angle of 60o counterclockwise by prescribed
displacement boundary conditions u|Γ1 applied at the right-end surface. In addition, the
Cuboid is subjected to a body force f b = [0, −0.5, 0]T and traction forces t = [1, 0, 0]T at
the bottom, back, top and front surfaces (Figure 10). The Dirichlet boundary conditions are
prescribed as follows
u|Γ0 = [0, 0, 0]T ,
 
0
u|Γ1 = 0.5 [0.5 + (X2 − 0.5) cos(π/3) − (X3 − 0.5) sin(π/3) − X2 ] . (60)
0.5 [0.5 + (X2 − 0.5) sin(π/3) + (X3 − 0.5) cos(π/3) − X3 ]
In this problem, the Neo-Hookean model is considered and it has the form
1 1
Ψ(I1 , J) = λ[log(J)]2 − µ log(J) + µ(I1 − 3), (61)
2 2
where two invariants are given by
I1 = trace(C), J = det(F ). (62)
The right Cauchy-Green tensor is expressed as C = F T · F . In this example, material
properties are prescribed in Table 1,

22
Description Value

E - Young modulus 106

ν - Poisson ratio 0.3
E
µ - Lame’ parameter 2(1+ν)
Eν
λ - Lame’ parameter (1+ν)(1−2ν)

Table 1: Material parameters for 3D hyperelastic Cuboid.

With the prescribed boundary conditions, the displacement has the form
X1
up (X) = u|Γ1 + (X1 − 1.25) X1 u
b (X; p). (63)
1.25
Setup. We generate 64000 equidistant grid points (N1 = 40, N2 = 40, N3 = 40) over the
entire domain as the feeding data to train our network (Figure 11). A 5-layer network
(3 − 30 − 30 − 30 − 3) is constructed. Therein, we use 3 neurons in the input layer for
the nodal coordinates and 3 neurons in the output layer for the unconstrained solution.
Moreover, in each hidden layer 30 neurons are used to enforce the learning behavior. We use
tanh activation function to evaluate neural values in the hidden layers. The learning rate
r = 0.5 is used in the L-BFGS optimizer. The neural network is strained for 50 steps.

Figure 11: The training point distribution of a 3D hyperelastic cuboid in DEM. The red points correspond
to the first Dirichlet boundary condition. The points data are used to enforce/train the second Dirichlet
boundary condition. The black points are used to train the surface force. The blue points are used for the
interior domain integration and the body force.

Result. Since the analytical solutions are not available in this example, we use a finite ele-
ment program to simulate the Cuboid with a fine mesh to obtain the reference solution. We

23
first measure the displacements and stresses along the AB line depicted in the Figure 12a.
The result in general reveals a good agreement between DEM solution and the reference
solution in Figure 13. It is found that the deep energy method exhibits good solutions in
terms of displacements. However, there has small difference in terms of stress because the
outputs of our neural network are the displacements and the gradient fields are then given
in the post-processing procedure based on the displacement field.

A(0.625, 1, 0.5)
F

B(0.625, 0, 0.5)

C
1
(a) AB line (b) CDEF plane

Figure 12: Positions of AB line and CDEF plane.

(a) Displacement comparison (b) Stress comparison

Figure 13: The displacement and stress results of DEM measured at the AB line compared to the reference
solution.

We observe the surface displacement and surface stress at the CDEF plane as sketched
in Figure 12b. Figure 14 shows the predicted displacement in terms of magnitude and the

24
predicted stress in terms of VonMises. Then we compare the norms of the DEM solution
and reference solution in terms of L2 norm and H 1 seminorm in two setups. Setup 1: We
use 8000 equidistant data points (N1 = 20, N2 = 20, N3 = 20) as the feeding data to train
our network; the network is trained for 50 steps. Setup 2: We use 64000 equidistant data
points (N1 = 40, N2 = 40, N3 = 40) as the feeding data to train our network; the neural
network is trained for 25 steps. The L2 norm and H 1 seminorm of the reference solution are
given by a Fenics program with a fine mesh as: ||u||FL2E = 0.13275 and ||u||FHE1 = 0.51407. As
shown in Table 2, the DEM obtain good results in terms of L2 norm and H 1 seminorm.

(a) Displacement (b) VonMises stress

Figure 14: Displacement and VonMises stress at the CDEF plane in DEM of a twisted Neo-Hookean 3D
Cuboid.

DEMM
Data
||u||L2 ||u||H 1
20x20x20 (50 steps) 0.12921 0.49929
40x40x40 (25 steps) 0.13210 0.51001

Table 2: The L2 norm and H 1 seminorm of DEM in 2 setups. The corresponding norms of the reference
solution are given by Fenics program with a fine mesh as: ||u||F E FE
L2 = 0.13275, ||u||H 1 = 0.51407.

For the next three sections, we apply DEM on problems involving more than one field.
We start with the modeling of fracture using phase field approach.

6.5. Phase field modeling of fracture

The prevention of fracture-induced failure is a major constraint in engineering designs.
The phase field model for fracture is an effective way to model fracture by assuming the
process zone has a finite width which is controlled by a length scale parameter (l0 ). A
sharp crack topology is recovered in the limit as l0 → 0 [30]. In this approach, the effects

25
associated with crack formation such as stress release are incorporated into the constitutive
model. A continuous scalar parameter (φ) is used to track the fracture pattern. The cracked
region is represented by φ = 1 while the undamaged portion is given by φ = 0. The phase
field approach aims to simultaneously solve for the displacement field and fracture region
by minimizing the total potential energy of system, as postulated by the Griffith theory for
brittle fracture [31].
In this section, physics informed neural networks are developed to solve the governing par-
tial differential equations for fracture analysis using DEM. The neural networks are trained
to approximate the displacement field and the damage fields which satisfy the governing
equations used to describe the physical phenomena. Modeling fracture using the phase field
method involves the solving for the vector-valued elastic field, u and the scalar-valued phase
field, φ. In DEM, the solution is obtained by minimization of the total energy of the system,
E [32]. The problem statement can be written as:
Minimize: E = Ee + Ec ,
(64)
subject to: u = u on ∂ΩD ,
where Ee is the stored elastic strain energy, Ec is the fracture energy and u is the prescribed
displacement on the Dirichlet boundary, ∂ΩD . In Eq. (64), Ee and Ec are defined as:
Z
Ee = g(φ)Ψ0 ()dΩ,
Ω Z
Gc
Z (65)
2 2 2

Ec = φ + l0 |∇φ| dΩ + g(φ)H(x, t)dΩ,
2l0 Ω
Ω

where g(φ) represents the monotonically decreasing stress-degradation function, Gc is the

critical energy release rate, Ψ0 (, φ) is the initial strain energy density functional expressed
in terms of linearised strain tensor, (u). A common form of the degradation function, as
used in literature, for isotropic solids is [33]:
g(φ) = (1 − φ)2 . (66)
H(x, t) contains the maximum positive tensile energy (Ψ+
0 ) in the history of deformation of
the system and is defined as:
H(x, t) = max Ψ+
0 ((x, s)), (67)
s∈[0,t]

where x is a point in the domain. The strain-history functional enforces irreversibility

condition on the crack [33]. Ψ+
0 is obtained as:
λ
Ψ+
0 () = htr()i2+ + µtr(2 )+ , (68)
2
where λ and µ are Lamé constants. The tensile strain, + is computed using the spectral
decomposition of the strain tensor. The strain-history functional can be used to define initial
cracks in the system [34]. The initial strain history function (H(x, 0)) could be defined as
a function of the closest distance from x to the line (l), which represents the discrete crack

26
[35]. In particular, we set
2d(x,l) l0
BGc
2l0
(1 − l0
) d(x, l) 6 2
H(x, 0) = l0 , (69)
0 d(x, l) > 2
where B is a scalar parameter that controls the magnitude of the scalar history field and is
calculated as:
1
B= for φ < 1. (70)
1−φ
We shall now present the results obtained using the physics informed neural networks to
study phase field modeling of fracture.

6.5.1. One-dimensional phase field model

The first example is to approximate the phase field in a one-dimensional bar of length,
L = 50 units and has a crack located at 25 units. An analytical solution for a one-dimensional
phase field model is available in [33]. Hence, it is possible to validate the results obtained
from the proposed approach. The analytical solution for φ(x) is:

−|x − a|
φex (x) = exp , (71)
l0
where the crack is located at a units. We consider l0 = 1 in this example, which is solved
using DCM and DEM to compare the accuracy of both methods with the analytical solution.
The formulation of the collocation method is to minimize the function f , defined by
1
f (x) = φ00 (x) − 2 φ(x) in Ω, (72)
l0
at a set of chosen collocation points, x subject to the Dirichlet-type boundary conditions:
φ (a) = 1 , φ0 (a) = 0 ,
lim φ (x) = lim φ (x) = 0, and (73)
x→∞ x→−∞
0 0
lim φ (x) = lim φ (x) = 0.
x→∞ x→−∞

We have used a network with 3 hidden layers of 50 neurons each and NCol = 8000 uniformly
spaced points within the domain. To find the solution of φ in the domain [0, 50], the trial
solution is defined as
x(x − 50) h i
φ(x) = − (x − 25)φ̂ + 1 , (74)
625
where φ̂ is given by the neural network. The solution of φ is chosen in such a way that it
satisfies all the boundary conditions as in [24, 25]. This ensures that the boundary conditions
are satisfied during the training of the network. In the collocation method, the mean squared
error of the function, f at the collocation points is minimized. The loss function, LCol for
the collocation method is defined as:
NCol
1 X
LCol = |f (xi )|2 . (75)
NCol i=1
For optimization, we use the Adam (adaptive momentum) optimizer followed by a quasi-
Newton method (L-BFGS). The boundary conditions are satisfied exactly as could be seen

27
1.00 comp
102 Adam
exact L-BFGS
0.75
101

Loss value
(x)

0.50
100
0.25

0.00 10 1

0 10 20 30 40 50 0 1000 2000
x Iteration
(a) Comparison of φexact and φcomp . (b) Convergence of the loss function.

Figure 15: One-dimensional phase field model using the collocation method.

in Fig. 15(a), where φcomp represents the value of φ obtained using the neural network and
φexact is obtained using Eq. (71). Fig. 15(b) shows the convergence of the loss function first
with the Adam optimizer and later using the L-BFGS optimizer. To measure the accuracy
of the method, the relative L2 error, Lrel
2 is computed using using the formula:
q
PNpred 2
i=1 (φcomp − φex ) dx
Lrel
2 = q . (76)
PNpred 2
i=1 φex dx

For the collocation method, Lrel

2 = 70.6% due to the inability of the collocation to capture
the discontinuity in the derivative at x = 25.
Using the same network architecture and the same number of integration points, NInt =
8000 in the interior domain, we solve the one-dimensional problem using DEM. For DEM,
the problem is written as:
Z
1
φ(x) − l02 |∇φ|2 dx,

Minimize: I (φ) = (77)
2
Ω

subject to the same boundary conditions as in Eq. (73). The form of the solution used for
DEM is as stated in Eq. (74). The loss function, LEner for DEM is defined as
NInt
L X
φ(xi ) − l02 |∇φ(xi )|2 .

LEner = (78)
NInt i=1
Fig. 16(a) presents the solution of φ obtained using DEM, while Fig. 15(b) shows the con-
vergence of the loss function. For DEM, Lrel
2 = 2.88%.

6.5.2. Single-edge notched tension example

In this example, we consider a unit square plate with a horizontal crack from the midpoint
of the left outer edge to the center of the plate. The geometric setup and boundary conditions

28
1.00 comp Adam
103
exact L-BFGS
0.75
102

Loss value
0.50
(x)

101
0.25

0.00 100
0 10 20 30 40 50 0 1000 2000
x Iteration
(a) Comparison of φexact and φcomp . (b) Convergence of the loss function.

Figure 16: One-dimensional phase field model using the energy method.

(a) Geometrical setup and boundary

conditions. (b) Training grid.

Figure 17: Single-edge notch tension example.

29
of the problem are shown in Fig. 17(a). In this example, we consider l0 = 0.0125 and
Gc = 2.7 × 10−3 kN/mm. The material properties of the plate are expressed in terms of
Láme’s constants, λ = 121.15 kN/mm2 and µ = 80.77 kN/mm2 . The domain is subdivided
into 3 sections along the y-axis; the crack zone, the bottom and the top of the crack. The
crack zone is between [(0.5 − 2l0 ), (0.5 + 2l0 )] on the y-axis. The training grid is shown in
Fig. 17(b). The strain history function is used to initialize the crack in the domain as in
Eq. (67). We have used a network with 3 hidden layers of 20 neurons each and Nx × Ny
uniformly spaced points for each section in the interior of the plate with Nx = 300 and
Ny = 81. The problem statement is defined as:
Z
Minimize: I (φ) = f (φ(x))dx,
Ω (79)
Gc
φ(x)2 + l02 |∇φ(x)|2 + g(φ(x))H(x, 0) .

where f (x) =
2l0
The loss function, L0 is defined as:
Atc = Abc = (0.5 − 2l0 ),
Ac = 4l0 ,
Nx ×Ny Nx ×Ny Nx ×Ny
! (80)
1 X X X
L0 = Atc f (xtc
i ) + Ac f (xci ) + Abc f (xbc
i ) ,
Nx × Ny i=1 i=1 i=1

where Atc , Abc , Ac denote the areas of section on top and bottom of the crack and the crack
zone, respectively. xtc c bc
i , xi and xi in Eq. (80) represent points in the three sub domains in
the interior of the plate. To compare the performance of DEM, we also initialize the crack
using the collocation method. The formulation of the collocation method is to minimize the
function f , defined by
Gc
f (x) = Gc l0 ∆φ(x) − φ(x) − g 0 (φ(x))H(x, 0) in Ω,
l0 (81)
0
and g (φ(x)) = −2(1 − φ(x))
subjected to homogeneous Neumann-type boundary conditions on the entire boundary de-
fined as:
∇φ · n = 0 on ∂Ω. (82)
We have used a network with 5 hidden layers of 20 neurons each and Nx × Ny uniformly
spaced points for each section in the interior of the plate and NBound uniformly spaced points
on each edge of the plate, with Nx = 300 Ny = 81 and NBound = 200. The loss function,
LCol for the collocation method is defined as:
Nx ×Ny Nx ×Ny Nx ×Ny
!
1 X X X
LCol = Atc |f (xtc 2
i )| + Ac |f (xci )|2 + Abc |f (xbc
i )|
2
+
Nx × Ny i=1 i=1 i=1
2 ! (83)
NX
Bound L
2 R
2 T
2 B
1 ∂φ(xi ) ∂φ(xi ) ∂φ(xi ) ∂φ(xi )
+ + + ,
NBound i=1 ∂x ∂x ∂y ∂y

30
where xLi , xR T B
i , xi , xi are the points on the left, right, top and bottom edges of the plate,
respectively. Fig. 18(a) and (b) show the initial crack in the plate using DEM and DCM, re-
spectively. Fig. 18(c) and (d) present the one-dimensional approximation plots and Fig. 18(e)
and (f) show the convergence of the loss function for both the methods.
In subsubsection 6.5.1, we observed that DEM performs better than DCM for the same
network parameters. From the results of the initilization of the crack, we can conclude that
DEM requires a smaller network to predict results accurately. Moreover, it is also seen that
DCM needs an explicit treatment of the Neumann boundary conditions even for traction-free
boundaries, which requires the use of more collocation points. The convergence of the loss
function is also slow, requiring 10 times more iterations. Hence, we use DEM to study the
growth of crack in a plate under tensile loading.
To observe the propagation of crack in the plate, we simultaneously solve the elastic field
and the phase field using a monolithic solver. The minimization problem reads as stated in
Eq. (65). The Dirichlet boundary conditions are:
u(0, y) = v(x, 0) = 0, v(x, 1) = ∆v, (84)
where u and v are the solutions of the elastic field in x and y-axis-axis. The computation is
performed by applying constant displacement increments of ∆v = 1 × 10−3 mm. The model
for the Dirichlet problem is:
u = [x(1 − x)]û,
(85)
v = [y(y − 1)]v̂ + y∆v,
where û and v̂ are obtained from the neural network. We have used a network with 5 hidden
layers of 50 neurons each. The loss function, LE is defined as:
LE = LElas + LP F ,
Nx ×Ny Nx ×Ny Nx ×Ny
!
1 X X X
LElas = Atc fe (xtc
i ) + Ac fe (xci ) + Abc fe (xbc
i ) ,
Nx × Ny i=1 i=1 i=1
Nx ×Ny Nx ×Ny Nx ×Ny
!
1 X X X (86)
LP F = Atc fc (xtc
i ) + Ac fc (xci ) + Abc fc (xbc
i ) ,
Nx × Ny i=1 i=1 i=1
fe (x) = g(φ(x))Ψ0 ((x)),
Gc
φ(x)2 + l02 |∇φ(x)|2 + g 0 (φ(x))H(x, t),

fc (x) =
2l0
where LElas and LP F define the losses in the elastic strain energy and the fracture energy,
respectively. The scatter plots of the deformed configuration and the corresponding phase
field plots for the simulation are shown in Fig. 19 and Fig. 20, respectively.
In the next section, we apply DEM on an electro-mechanically coupled systems.

6.6. Piezoelectricity
The electro-mechanical coupling is generated in some crystal materials where an electrical
polarization, P is induced due to the existence of the mechanical strain or the existence of
an electrical field can cause deformation of the geometry. The linear dependence between
the electric polarization and the associated mechanical strain refers to the piezoelectric effect

31
(a) Initial crack using DEM. (b) Initial crack using DCM.
1.00
comp
exact
0.75

0.50
(x)

0.25

0.00
0.0 0.5 1.0
x
(c) Comparison of φexact and φcomp us- (d) Comparison of φexact and φcomp us-
ing DEM. ing DCM.

Adam 106 Adam

L-BFGS L-BFGS
104
Loss value

Loss value

101 102

100

10 2

0 1000 2000 0 10000 20000

Iteration Iteration
(e) Convergence of the loss function us- (f) Convergence of the loss function us-
ing DEM. ing DCM.

Figure 18: Initialization of crack in the plate using the strain-history function.

32
(a) (b) (c)

(d) (e)

Figure 19: Scatter plots of the deformed configuration for prescribed displacement of (a) 1 × 10−3 , (b)
2 × 10−3 , (c) 3 × 10−3 , (d) 4 × 10−3 and (e) 5 × 10−3 .

(a) (b) (c)

(d) (e)

Figure 20: Crack pattern for prescribed displacement of (a) 1 × 10−3 , (b) 2 × 10−3 , (c) 3 × 10−3 , (d) 4 × 10−3
and (e) 5 × 10−3 .

33
which is common in non-centrosymmetric crystals. The direct and the indirect piezoelectric
effect can be expressed in mathematical form of as
P = p : ,
σ = p · E, (87)
where p refers to the third order piezoelectric tensor, and σ are the second order strain and
stress tensors, and E is the electric field vector with Ei = −θ,i and θ is the electric potential.
It can be concluded that the piezoelectricity is a cross coupling between the elastic and the
dielectric variables Hence, the constitutive equations to identify the coupling between the
mechanical stress and the strain with the related electric field and displacement are given by
D = e : + κ · E,
σ = C : − e · E, (88)
where D refers to the electric displacement that causes the electrical polarization, while
e, κ, C are the piezoelectric, the dielectric, and the elastic tensors, respectively.
Energy minimization is adopted in the DEM to obtain the solution of the field variables.
The problem statement reads as
Minimize: E = Ei + Wext ,
subject to: u = u on ∂ΩDu ,
(89)
and θ = θ on ∂ΩDθ ,
and σ · n = tN on ∂ΩN ,
where E represents the total energy of the system, Ei is the bulk internal energy, whose
density is Ψ, and Wext stands for the work of external forces. The internal energy density,
Ψ, is expressed as
1 1
Ψ = : C : − E · e · E − E · κ · E. (90)
2 2
We shall now present the example of a cantilever beam to show the application of DEm to
solve PDEs involving electromechanical coupling.

6.6.1. Cantilever beam

In this example, a cantilever beam configuration (the most common problem in evaluating
the effect of piezoelectricity) is solved under plane strain conditions. The material is assumed
to be isotropic and linearly elastic. The geometrical setup and the boundary conditions are
shown in Fig. 21. A fully connected network with three hidden layers of 150 neurons each
is used to solve the electro-mechanically coupled problem. For the first two layers, we have
considered hyperbolic tangent, tanh activation function; whereas for the last layer, linear
activation function has been considered. To optimize the hyper-parameters, the loss function
is calculated similar to the examples discussed in the previous sections. For training, we have
used the ADAM optimizer followed by L-BFGS.
In this problem, we investigate two loading conditions:
1. pure mechanical loading
2. pure electrical loading

34
Electrode Applied force
V F

Figure 21: Schematic diagram showing the electrical and the mechanical boundary conditions for the can-
tilever nanobeam under study.

Displacement in x
10 Displacement in x x 10
−4
0.5 10
5
5 0 5 0
−5
0 −0.5 0
0 50 100 0 20 40 60 80 100
Displacement in y
10 Displacement in y x 10
−3

−2 10 0
5 −4
−6 5 −5
0 −8
0 50 100 0
0 20 40 60 80 100

Electric potential θ Electric potential θ

10 0 10 0

5 −0.1 5 −50
−0.2 0 −100
0 0 20 40 60 80 100
0 20 40 60 80 100
(b)
(a)

Figure 22: The predicted values of three outputs of the system using DEM: (a) applying mechanical loading,
and (b) applying electrical loading.

In the case of pure mechanical loading, a point load is applied as shown in Fig. 21. Due to
the electro-mechanical coupling, an electrical potential, θ is induced by the mechanical load.
Assuming closed circuit configuration, θ is fixed to zero on the bottom surface while the
electrode placed on the top surface undergoes a difference in electric potential as a result of
the deformation. Fig. 22b shows the plots of the deformed shape and the electric potential
obtained using the DEM approach.
In the next step, we consider pure electrical loading. In this case, the electric potential
on the top surface is fixed to V = −100 MV. The defomed configuration is presented in
Fig. 22b. Both the cases considered in this example, demonstrate the ability of the DEM to
solve electro-mechanically coupled system without the aid of the classical numerical methods.
All the examples discussed previously involved second-order PDEs. To show the robustness
of DEM, we take up the study of a fourth-order PDE in the next section.

6.7. Kirchhoff Plate bending

The Kirchhoff plate bending problem is a classical problem governed by a fourth-order
PDE. The weak formulation of the problem involves the second-order derivatives of the
transversal deflection. In traditional mesh-based methods, the essential requirement of C 1

35
continuity poses significant challenges. This, however, can be easily solved by the pro-
posed deep collocation and the deep energy methods with a set of DNNs approximating the
transversal deflection, which are proven to be effective in the bending analysis of Kirchhoff
plate of various geometries and even with cut-outs.
The governing equation of Kirchhoff bending problem, expressed in terms of transversal
deflection, is
p
52 52 w = 54 w = ,

(91)
D
∂4 ∂ 4 ∂4
where 54 (·) = ∂x 4 +2 ∂x2 ∂y 2 + ∂y 4 is commonly referred to as biharmonic operator. Similar to

the previous examples, the problem statement in strong form is equivalent the minimization
of the total potential energy of the system, E, where
ZZ Z Z
1 T ∂w
E= k M − qw dΩ − V̄n wdS + M̄n dS (92)
Ω 2 S3 S2 +S3 ∂n
under uniformly distributed load. The problem is stated as:
w = argmin E (v) (93)
¨
v ∈H(D)

Additionally, the boundary conditions of the Kirchhoff plate taken into consideration can be
classified into three types; simply-supported, clamped and free boundary conditions and is
expressed as
∂Ω = Γ1 + Γ2 + Γ3 . (94)
To solve Kirchhoff bending problems with boundary constraints, the physical domain is
first discretized with randomly distributed collocation points denoted by x Ω = (x1 , . . . , xNΩ )T .
Another set of collocation points are added to discretize the boundaries denoted by x Γ (x1 , . . . , xNΓ )T .
The transversal deflection, w can thus be approximated with the aforementioned deep feed-
forward neural network wh (x; p). The loss function is constructed to find the approximate
solution by minimizing either the mean squared error of the governing equation in residual
form or the total energy, along with boundary conditions approximated by wh (x; p).
Substituting wh (x Ω ; p) into either Equation 91 or into 92, a physical informed deep neural
network is obtained: G (x Ω ; p) for the strong form of the problem, and Ep (x Ω ; p) the energy
approach, respectively.
The boundary conditions can also be expressed by the physical informed neural network
approximation wh (x Γ ; p):
On clamped boundaries Γ1 , we have
∂wh (x Γ1 ; p)
wh (x Γ1 ; p) = w̃, = p̃n . (95)
∂n
On simply-supported boundaries Γ2 ,
wh (x Γ2 ; p) = w̃, M̃n (x Γ2 ; p) = M̃n , (96)
On free boundaries Γ3 ,
∂Mns (x Γ3 ; p)
Mn (x Γ3 ; p) = M̃n , + Qn (x Γ3 ; p) = q̃, (97)
∂s

36
It should be noted that n, s here refer to the normal and tangent directions along each
boundary.
It is clear that all induced physical informed neural network in Kirchhoff bending analysis
G (x; p), E (x Ω ; p), Mn (x; p), Mns (x; p), Qn (x; p) share the same parameters as wh (x; p).
Considering the generated collocation points in domain and on boundaries, they can all be
learned by minimizing the mean square error loss function:
L (p) = M SE = M SEG + M SEΓ1 + M SEΓ2 + M SEΓ3 , (98)
with
Nd NΩ
1 X 2 1 X 4 h
5 w (x Ω ; p) − p 2

M SEG = G (x Ω ; p) =

D
,
Nd i=1 NΩ i=1
NΓ NΓ1
1 X1 h
2 1 X ∂wh (x Γ1 ;p)
2
M SEΓ1 = w (x Γ1 ; p) − w̃ + ,

NΓ1 i=1

NΓ1 i=1

∂n
− p̃ n

NΓ NΓ2
1 X2 h
2
w (x Γ2 ; p) − w̃ + 1 X
M̃n (x Γ ; p) − M̃n 2 ,

M SEΓ2 = 2
NΓ2 i=1 NΓ2 i=1
NΓ NΓ
1 X3 2 1 X3 ∂Mns (x Γ3 ;p)
2
M SEΓ3 = M̃n (x Γ3 ; p) − M̃n + ,

NΓ3 i=1

NΓ3 i=1

∂s
+ Qn (x Γ 3 ; p) − q̃
(99)
N K h
where x Ω ∈ R , p ∈ R are the neural network parameters. L (p) = 0, w (x; p) is then
a solution to transversal deflection. Our goal is to find a set of parameters p such that the
approximated deflection wh (x; p) minimizes the loss L (p). If L (p) is a very small value,
the approximation wh (x; p) is very closely satisfying governing equations and boundary
conditions, namely
p = argmin L (p̂) (100)
p̂∈RK

However, in order to solve the problem of a thin plate with the deep energy method, it is
necessary to construct the loss function in a different way:

L (p) = M AEE + M SEΓ1 + M SEΓ2 , (101)

Here, only due to the fact that energy method is adopted, only Dirichlet boundary conditions
among three boundary types need to be taken into account. This makes it easier than
the deep collocation method. The latter needs a physical discovery of Neumann boundary
conditions with DNN. In this case, as shown above, the twist moment and shear force along
the boundaries need to be approximated with the physical informed deep neural networks.
The plate bending problems with DNN based method can be accordingly reduced to an
optimization problem. In the deep learning Tensorflow framework, a variety of optimizers
are available to help to gain an optimal solution, and the Adam and LBFGS optimizers are
mainly adopted in numerical examples.
Next, a series of numerical examples are tested to verify the feasibility of deep collocation
method and deep energy method in the analysis of Kirchhoff plate bending problems.

37

2QHKLGGHQOD\HU
7ZRKLGGHQOD\HUV
7KUHHKLGGHQOD\HUV

5HODWLYHHUURURIWUDQVYHUVDOGHIOHFWLRQ(x10 −2 )

1HXURQVSHUKLGGHQOD\HU

(a) Collocation points in the square do- (b) The relative error of deflection with varying hidden
main. layers and neurons.

Figure 23

6.7.1. Simply-supported square plate on Winkler foundation

The deep collocation method is first applied to study the bending problems of various
plates, which are chosen with analytical or exact solutions as comparisons.
To begin with, we consider a simply-supported square plate resting on Winkler foundation,
which assumes that the foundation’s reaction p (x, y) can be described by p (x, y) = kw, k be-
ing a constant called f oundation modulus. For a plate on a continuous Winkler foundation,
the governing Equation 91 can be written as
(p − q) (p − kw)
52 52 w = 54 w =

= (102)
D D
Here, D denotes the flexural stiffness of the plate depending on the plate thickness and
material properties.
The analytical solution for this numerical example is [36]:
∞ ∞
16p X X sin mπx
a
sin nπy
b
w= h i (103)
ab m=1,3,5,··· n=1,3,5,··· mn π 4 D m2 + n2 2 + k
a2 b2

For this numerical example, the arrangement of collocation points is depicted in Fig-
ure 23(a). Neural networks with different neurons and depth are applied in the calculation
to study the feasibility of deep collocation in solving Kirchhoff plate bending problems. Table
1 lists the maximum deflection at the central point with varying hidden layers and neurons.
It is clear that the results predicted by more hidden layers are more desirable and as hidden
layer and neuron number grows, the maximum deflection becomes more accurate approach-
ing the analytical serial solution for even two hidden layers. The relative error shown in
Figure 23(b) better depicts the advantages of deep neural network than shallow wide neural
network. More hidden layers, with more neurons yield flatting of the relative error. Vari-
ous contour plots are shown in Figure 24 and compared with the analytical solution. It is

38
Table 1: Maximum deflection predicted by deep collocation method.

Square Plate on Winkler Predicted Maximum Analytical

foundation Deflection solution
1 hidden layer, 30 neurons 0.33999
1 hidden layer, 40 neurons 0.35689
1 hidden layer, 50 neurons 0.32168
2 hidden layers, 30 neurons 0.32248
2 hidden layers, 40 neurons 0.32176 0.32137
2 hidden layers, 50 neurons 0.32168
3 hidden layers, 30 neurons 0.32216
3 hidden layers, 40 neurons 0.32172
3 hidden layers, 50 neurons 0.32181

obvious that the deep collocation method predict the deflection of the whole plate in good
accordance with the analytical solution.

6.7.2. Clamped circular plate

Also, in this section, we study the feasibility of deep collocation method in the analysis of
Kirchhoff plate bending problems. We first apply the deep collocation method in studying a
clamped circular plate with radius R under a uniform load p. 1000 collocation points shown
in Figure 25(a) are employed in the domain of the circular plate. The exact solution of this
problem can be referred in [36]:
2
p (R2 − (x2 + y 2 ))
w= , (104)
64D
D denoting the flexural stiffness. The maximum deflection at the central of the circular plate
with varying hidden layers and neurons is summarized in Table 3 and compared with the
exact solution. The predicted maximum deflection become more accurate with increasing
number of neurons and hidden layers. The relative error for deflection of clamped circular
plate with increasing hidden layers and neurons is depcited in Figure 25(b). As hidden layer
number increases, the relative error curves flatten out and converges to the exact solution.
All neural networks perform well with a relative error magnitude of 1 × 10−4 .
The deformation contour, deflection error contour, predicted and exact deformation figures
are displayed in Figure 26. It is clear that the predicted deflection of this circular plate agrees
well with the exact solution.

6.7.3. Simply-supported square plate under a sinusoidally distributed load

For the next two numerical examples, deep energy method is applied in the calculation.
For deep energy method, we use a different DNN scheme based on Pytorch, and to further
improve the primary results, different strategies have been proposed, including an autoen-
coder and tailored activation function, which has been proved to be feasible and efficient
during the numerical experiment. The LBFGS optimizer is applied during the training of
the deep neural network. The numerical experiments are executed on a 64-bit macOS Mojave
server with Intel(R) Core(TM) i7-8850H CPU, 32GB memory.
First, a simply-supported square plate under a sinusoidal distribution transverse loading
is studied with deep energy method. The distributed load is given by
p = pD0 sin πx sin πy

a b
. (105)

39
wpred wex wpred

(a) (b)

wexact
wpred

x
[

\ y

(c) (d)

Figure 24: (a) Predicted deflection contour (b) Deflection error contour (c) Predicted deflection (d) Analytical
deflection of the simply-supported plate on Winkler foundation with 3 hidden layers and 50 neurons.

2QHKLGGHQOD\HU
7ZRKLGGHQOD\HUV
7KUHHKLGGHQOD\HUV
5HODWLYHHUURURIWUDQVYHUVDOGHIOHFWLRQ(x10 −4 )

1HXURQVSHUKLGGHQOD\HU

(a) Collocation points in the circular do- (b) The relative error of deflection with varying hidden
main layers and neurons

Figure 25

40
Table 2: Maximum deflection predicted by deep collocation method.

Predicted Maximum
Clamped Circular Plate Exact solution
Deflection
1 hidden layers, 30 neurons 15.5958
1 hidden layers, 40 neurons 15.5685
1 hidden layers, 50 neurons 15.6201
2 hidden layers, 30 neurons 15.6251
2 hidden layers, 40 neurons 15.6264 15.6250
2 hidden layers, 50 neurons 15.6224
3 hidden layers, 30 neurons 15.6269
3 hidden layers, 40 neurons 15.6247
3 hidden layers, 50 neurons 15.6229

wpred wexact wpred

(a) (b)

wexact
wpred

[ x

\ y

(c) (d)

Figure 26: (a) Predicted deflection contour (b) Deflection error contour (c) Predicted deflection (d) Exact
deflection of the clamped circular plate with 3 hidden layers and 50 neurons

41
where a,b are the length of the [Link] analytical solution for this problem is given by
w = 4 p10 1 2 sin πx sin πy

a b
. (106)
π D( 2 + 2 )
a b

For the detailed configuration of this numerical example, a 31x31 random distributed
collocation point set shown in Figure 23 is first generated in the plate domain. Based on this,
the Monte Carlo integration scheme is adopted in the calculation of total potential energy.
For this problem with sinusoidal load, the activation function was tailored accordingly as
f (x) = sin πx 2
, which improves the accuracy of the results. A neural network with 50
neurons each hidden layer is deployed to study the relative error of maximum deflection at
central points and the whole deflection by the proposed activation function and classical
Tanh activation function. And for the DNN, three hidden layers is deployed. As for DNN
with an autoencoder, two encoding layer configurations [2, 1] ∗ H and [3] ∗ H are considered,
with H the number of neurons on each encoding layer. It is shown clearly in Figure 27,
Figure 28 that the numericals gained by proposed activation function outweigh classical
Tanh activation function for both two neural network schemes.
Moreover, we carry on studying an accurate and efficient configuration of the proposed
DNN with an autoencoder, in hope for a better discovery of its physical patterns. For
this numerical example, ten encoding layer types are studied and shown in Figure 29. We
begin by studying the influence of different encoding layer with different neurons per layer
on the accuracy and efficiency of this problem. We can see that with more encoding layer
and neurons per layer, it will obtain a more stable and accurate results. Moreover, the
corresponding computational cost of those ten schemes are listed in Figure 30. It is clear
that a two encoding layer autoencoder can meet both accuracy and efficiency ends. And the
neurons on each scheme increase, it is clearly that the numerical solution converges to the
analytical solution.
Finally, the deep energy scheme is then compared with an open source IGA-FEM code
from on this problem [37], which is also based on the kirchhoff plate theory. The accuracy and
computational time of both scheme are compared. For DEM, a single encoding layer with 30
neurons is applied with DNN including increasing collocation points. And the computational
time for DEM means the training time. It is clear that IGA-FEM is clearly faster and more
accurate. But, the performance of DEM is still in the acceptance range. Once the DNN
with an autoencoder is trained, it can be very fast used to predict deflection and stress in
the whole plate.
For deep energy method configuration, an auto-encoder is added to the deep neural net-
works to better capture the physical patterns of this problem. We also studied the optimal
configuration of this auto encoder with different layers and neuron numbers, which will be
beneficial to the further application of this deep energy method.
To better reflect the deflection vector in the whole physical domain, the contour plot,
contour error plot of deflection predicted by two encoding layers with [150,100] neurons are
shown in Figure 32, which agree well with deflection obtained from analytical solution.

6.7.4. Simply-supported annular plate

In this section, an annular plate, which is simply-supported on the outer circle and free
on the inner circle, is studied with deep energy method, for this problem, deep collocation
method performs poorly for a feed-forward DNN, when considering free boundary conditions.

42

Tanh(x) Tanh(x)

sin(πx2 ) sin(πx2 )

5HODWLYHHUURURIPD[PLXPGHIOHFWLRQ(x10 −5 )

5HODWLYHHUURURIZKROHGHIOHFWLRQ(x10 −3 )

+LGGHQOD\HU +LGGHQOD\HU

(a) (b)

Figure 27: Relative error of (a) maximum deflection and (b) whole deflection predicted by Tanh and proposed
activation function of a DNN for the simply-supported plate

Tanh(x) Tanh(x)

sin(πx2 )
sin(πx2 )
5HODWLYHHUURURIPD[PLXPGHIOHFWLRQ(x10 −5 )

5HODWLYHHUURURIZKROHGHIOHFWLRQ(x10 −3 )

+LGGHQOD\HU +LGGHQOD\HU

(a) (b)

Figure 28: Relative error of (a) maximum deflection and (b) whole deflection predicted by Tanh and proposed
activation function of a DNN with an autoencoder for the simply-supported plate

5HODWLYHHUURURIPD[LPXPWUDQVYHUVDOGHIOHFWLRQ(x10 −5 )

5HODWLYHHUURURIWUDQVYHUVDOGHIOHFWLRQ(x10−3)

1HXURQVSHUKLGGHQO7ZRHQFRGL
2QHHQFRGLQJOD\HUZLWK[1] KLGGHQOD\HUQHXURQV D\HU QJOD\HUZLWK[3 2] KLGGHQOD\HUQHXURQV 1HXURQVSHUKLGGHQO7ZRHQFRGL
2QHHQFRGLQJOD\HUZLWK[1] KLGGHQOD\HUQHXURQV D\HU QJOD\HUZLWK[3 2] KLGGHQOD\HUQHXURQV
2QHHQFRGLQJOD\HUZLWK[2] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 1 1] KLGGHQOD\HUQHXURQV 2QHHQFRGLQJOD\HUZLWK[2] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 1 1] KLGGHQOD\HUQHXURQV
2QHHQFRGLQJOD\HUZLWK[3] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 2 1] KLGGHQOD\HUQHXURQV 2QHHQFRGLQJOD\HUZLWK[3] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 2 1] KLGGHQOD\HUQHXURQV
7ZRHQFRGLQJOD\HUZLWK[2 1] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 2 2] KLGGHQOD\HUQHXURQV 7ZRHQFRGLQJOD\HUZLWK[2 1] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 2 2] KLGGHQOD\HUQHXURQV
7ZRHQFRGLQJOD\HUZLWK[3 1] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 3 2] KLGGHQOD\HUQHXURQV 7ZRHQFRGLQJOD\HUZLWK[3 1] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 3 2] KLGGHQOD\HUQHXURQV

(a) (b)

Figure 29: Relative error of (a) maximum deflection and (b) whole deflection predicted by different encoding
layer configurations of a DNN with an autoencoder for the simply-supported plate.

43

+LGGHQOD\HUQHXURQV

+LGGHQOD\HUQHXURQV
+LGGHQOD\HUQHXURQV

&RPSXWDWLRQ7LPHV

[1] [2] [3] [2 1] [3 1] [3 2] [3 1 1] [3 2 1] [3 2 2] [3 3 2]

Figure 30: The computational time of ten encoding layer configurations for the autoencoder

'(0

,*$)(0

5HODWLYHHUURURIWUDQVYHUVDOGHIOHFWLRQ(x10 −2 )

&RPSXWDWLRQ7LPHV

3RLQWVLQVLGHSK\VLFDOGRPDLQ ,*$)(0 '(0

(a) (b)

Figure 31: Relative error of (a) deflection and (b) predicted by the DNN with an autoencoder and IGA for
the simply-supported plate.

44
(a) (b)

Figure 32: (a) Predicted deflection contour (b) Deflection error contour (c) Predicted deflection (d) Analytical
deflection of the simply-supported square plate.

45
Figure 33: Collocation points in the annular domain.

However, for energy based method, only the essential boundary conditions needs to be taken
into consideration, which simplifies the problem. The numerical results predicted by the
deep energy method lend credence to the feasibility of deep energy method in analysis of
plates with cut-out.
The analytical solution of this problem is [36]:
qa4 r 2 4α β 2
r
r4 2α1 2
w= − 1− + 1− − log , (107)
64D a 1+ν a 1−ν a
β 2
where α1 = (3 + ν) (1 − β 2 ) − 4 (1 + ν) β 2 κ, α2 = (3 + ν) + 4 (1 + ν) κ, β = ab , κ = 1−β 2 logβ,

with a, b the outer and inner radius of the annular plate respectively.
Likewise, we studied the deflection at a certain point to study the convergency of deflection
with the varying encoding layers, which is chosen for annular plate as ( a+b 2
, 0). In this
numerical example, 1000 collocation points are generated in the physical domain. A DNN
with an autoencoder is also constructed which will help better predict the physical pattern of
this problem. During our numerical example, it is discovered that the proper point sampling
inside physical domain affects the stability and accurate of this deep energy method. And
compared with some other sampling method, the collocation points generated in Figure 33
can be suitable for the numerical analysis of this problem.
As for the numerical analysis, we first test the deep neural network with an autocoder with
different activation function, namely the mostly used tanh (x) and the proposed sin πx

2
.
Shown in Figure 34 is very clear that the proposed activation based DNN better predicts
the deflection for the whole plate.
Further, we studied some chosen encoding layer configurations in affecting the deflection
of the annular plate and its corresponding computational cost. Shown in Figure 37, as the
number of neurons and encoding layers increase, the deflection converges to the analytical
solution. Combined with computational cost demonstrated in Figure 36, it can be concluded
that two encoding layers can be sufficient for the bending analysis of Kirchhoff plate.
Additionally, the deflection contour and the error contour are also shown in Figure 37. We

46
Tanh(x)
sin(πx2 )
5HODWLYHHUURURIPD[PLXPGHIOHFWLRQ(x10 −3 )

5HODWLYHHUURURIZKROHGHIOHFWLRQ(x10 −2 )

Tanh(x)
sin(πx2 )

1HXURQVRQHQFRGLQJOD\HU 1HXURQVRQHQFRGLQJOD\HU

(a) (b)

Figure 34: Relative error of (a) maximum deflection and (b) whole deflection predicted by tanh and proposed
activation function of a DNN for the annular plate.

5HODWLYHHUURURIPD[LPXPWUDQVYHUVDOGHIOHFWLRQ(x10 −3 )

5HODWLYHHUURURIPD[LPXPWUDQVYHUVDOGHIOHFWLRQ(x10 −2 )

1HXURQVSHUKLGGHQO7ZRHQFRGL
2QHHQFRGLQJOD\HUZLWK[1] KLGGHQOD\HUQHXURQV D\HU QJOD\HUZLWK[3 1] KLGGHQOD\HUQHXURQV 1HXURQVSHUKLGGHQO7ZRHQFRGL
2QHHQFRGLQJOD\HUZLWK[1] KLGGHQOD\HUQHXURQV D\HU QJOD\HUZLWK[3 1] KLGGHQOD\HUQHXURQV
2QHHQFRGLQJOD\HUZLWK[2] KLGGHQOD\HUQHXURQV 7ZRHQFRGLQJOD\HUZLWK[3 2] KLGGHQOD\HUQHXURQV 2QHHQFRGLQJOD\HUZLWK[2] KLGGHQOD\HUQHXURQV 7ZRHQFRGLQJOD\HUZLWK[3 2] KLGGHQOD\HUQHXURQV
2QHHQFRGLQJOD\HUZLWK[3] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 2 1] KLGGHQOD\HUQHXURQV 2QHHQFRGLQJOD\HUZLWK[3] KLGGHQOD\HUQHXURQV 7KUHHHQFRGLQJOD\HUZLWK[3 2 1] KLGGHQOD\HUQHXURQV
7ZRHQFRGLQJOD\HUZLWK[2 1] KLGGHQOD\HUQHXURQV 7ZRHQFRGLQJOD\HUZLWK[2 1] KLGGHQOD\HUQHXURQV

(a) (b)

Figure 35: Relative error of (a) maximum deflection and (b) whole deflection predicted by different encoding
layer configurations of a DNN with an autoencoder for the simply-supported plate.

+LGGHQOD\HUQHXURQV
+LGGHQOD\HUQHXURQV
+LGGHQOD\HUQHXURQV

&RPSXWDWLRQ7LPHV

[1] [2] [3] [2 1] [3 1] [3 2] [3 2 1]

Figure 36: The computational time of seven encoding layer configurations for the autoencoder.

47
(a) (b)

Figure 37: (a) Predicted deflection contour (b) Deflection error contour (c) Predicted deflection (d) Analytical
deflection of the simply-supported annular plate with 3 hidden layers and 50 neurons.

can conclude from Figure 37 that the transversal deflection of Kirchhoff plate predicted by
our deep energy method agrees well with the analytical solution, even for plate with hole.

7. Concluding remarks
The use of DNNs to solve boundary value problems has been explored. Several very
relevant examples from computational mechanics have been solved using DNNs to build the
approximation space. This constitutes a proof of concept for the possibility of approximating
the solution of BVPs using concepts and tools coming from deep learning.
The first pillar of our approach is to use an energy characterizing the behaviour of the
physical body under study. This energy is the basis for the construction of the loss function.
The second fundamental idea is that the approximation space is defined by the architecture
of the neural network. This two ideas lead to a finite sum nonconvex problem. In order to
solve this problem, we remain faithful to the machine learning philosophy of using gradient
based optimization approaches. This allows us to use standard libraries from the machine
learning community such as TensorFlow.
One of the advantages of the approach proposed here is that a wealth of concepts and
tools developed by a very active community can be used. This also leads to ease of imple-
mentation. Near-mathematical notation can be used to define the loss function, which has

48
the meaning of energy. Conceptually, this opens a door to the possibility of trying different
mathematical models by defining the corresponding energies and implementing them in a
very straightforward and transparent manner.
However, there are still some important caveats. The optimization problems that have to
be solved are non-convex. The approximation spaces, although being very expressive, can
be very difficult to analyze. Even linear problems lead to non-linear, non-convex discrete
problems. This poses several challenges that have to be explored by the computational
mechanics community.

References
[1] T. J. Hughes, The finite element method: linear static and dynamic finite element
analysis, Courier Corporation, 2012.

[2] A. Huerta, T. Belytschko, S. Fernández-Méndez, T. Rabczuk, X. Zhuang, M. Arroyo,

Meshfree methods, Encyclopedia of Computational Mechanics Second Edition (2018)
1–38.

[3] T. J. Hughes, G. Sangalli, M. Tani, Isogeometric analysis: Mathematical and imple-

mentational aspects, with applications, in: Splines and PDEs: From Approximation
Theory to Numerical Linear Algebra, Springer, 2018, pp. 237–315.

[4] I. Goodfellow, Y. Bengio, A. Courville, Deep learning, MIT press, 2016.

[5] M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat,

G. Irving, M. Isard, et al., Tensorflow: A system for large-scale machine learning,
in: 12th {USENIX} Symposium on Operating Systems Design and Implementation
({OSDI} 16), 2016, pp. 265–283.

[6] N. Ketkar, Introduction to pytorch, in: Deep learning with python, Springer, 2017, pp.
195–208.

[7] M. S. Alnæs, A. Logg, K. B. Ølgaard, M. E. Rognes, G. N. Wells, Unified form language:

A domain-specific language for weak formulations of partial differential equations, ACM
Transactions on Mathematical Software (TOMS) 40 (2) (2014) 9.

[8] M. Raissi, Deep hidden physics models: Deep learning of nonlinear partial differential
equations, The Journal of Machine Learning Research 19 (1) (2018) 932–955.

[9] M. Raissi, P. Perdikaris, G. E. Karniadakis, Physics informed deep learning (part

i): Data-driven solutions of nonlinear partial differential equations, arXiv preprint
arXiv:1711.10561.

[10] B. Dacorogna, Introduction to the Calculus of Variations, World Scientific Publishing

Company, 2014.

[11] J. Baiges, R. Codina, I. Castanar, E. Castillo, A finite element reduced order model
based on adaptive mesh refinement and artificial neural networks.

49
[12] T. Kirchdoerfer, M. Ortiz, Data-driven computational mechanics, Computer Methods
in Applied Mechanics and Engineering 304 (2016) 81–101.
[13] L. Ruthotto, E. Haber, Deep Neural Networks Motivated by Partial Differential Equa-
tions, arXiv e-print 1804.04272.
[14] E. Weinan, B. Yu, The deep ritz method: A deep learning-based numerical algorithm
for solving variational problems, Communications in Mathematics and Statistics 6 (1)
(2018) 1–12.
[15] L. Lei, C. Ju, J. Chen, M. I. Jordan, Non-convex finite-sum optimization via scsg
methods, in: Advances in Neural Information Processing Systems, 2017, pp. 2348–2358.
[16] P. Petersen, M. Raslan, F. Voigtlaender, Topological properties of the set of functions
generated by neural networks of fixed size, arXiv preprint arXiv:1806.08459.
[17] M. I. Jordan, Dynamical, symplectic and stochastic perspectives on gradient-based op-
timization, University of California, Berkeley.
[18] X. Glorot, Y. Bengio, Understanding the difficulty of training deep feedforward neu-
ral networks, in: Proceedings of the thirteenth international conference on artificial
intelligence and statistics, 2010, pp. 249–256.
[19] M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado,
A. Davis, J. Dean, M. Devin, et al., Tensorflow: Large-scale machine learning on het-
erogeneous distributed systems, arXiv preprint arXiv:1603.04467.
[20] J. He, L. Li, J. Xu, C. Zheng, Relu deep neural networks and linear finite elements,
arXiv preprint arXiv:1807.03973.
[21] J. Opschoor, P. Petersen, C. Schwab, Deep relu networks and high-order finite element
methods, SAM, ETH Zürich.
[22] Y. Jia, C. Anitescu, Y. J. Zhang, T. Rabczuk, An adaptive isogeometric analysis col-
location method with a recovery-based error estimator, Computer Methods in Applied
Mechanics and Engineering 345 (2019) 52–74.
[23] S. P. Timoshenko, J. Goodier, Theory of elasticity, McGraw-hill, 1970.
[24] V. Avrutskiy, Enhancing approximation abilities of neural networks by training deriva-
tives, arXiv preprint arXiv:1712.04473.
[25] J. Berg, K. Nyström, A unified deep artificial neural network approach to partial dif-
ferential equations in complex geometries, Neurocomputing 317 (2018) 28–41.
[26] P. L. Gould, Y. Feng, Introduction to linear elasticity, Springer, 1994.
[27] D. Schillinger, J. A. Evans, F. Frischmann, R. R. Hiemstra, M.-C. Hsu, T. J. Hughes, A
collocated c0 finite element method: Reduced quadrature perspective, cost comparison
with standard finite elements, and explicit structural dynamics, International Journal
for Numerical Methods in Engineering 102 (3-4) (2015) 576–631.

50
[28] M. A. Scott, R. N. Simpson, J. A. Evans, S. Lipton, S. P. Bordas, T. J. Hughes,
T. W. Sederberg, Isogeometric boundary element analysis using unstructured t-splines,
Computer Methods in Applied Mechanics and Engineering 254 (2013) 197–221.

[29] A. Logg, K.-A. Mardal, G. Wells, Automated solution of differential equations by the
finite element method: The FEniCS book, Vol. 84, Springer Science & Business Media,
2012.

[30] B. Bourdin, G. Francfort, J.-J. Marigo, Numerical experiments in revisited brittle

fracture, Journal of the Mechanics and Physics of Solids 48 (4) (2000) 797–826.
doi:10.1016/S0022-5096(99)00028-9.

[31] A. Griffith, The Phenomena of Rupture and Flow in Solids, Philisophical Transactions
of the Royal Society of London 221 (Series A) (1921) 163–198.

[32] M. J. Borden, T. J. Hughes, C. M. Landis, C. V. Verhoosel, A higher-order phase-field

model for brittle fracture: Formulation and analysis within the isogeometric analysis
framework, Computer Methods in Applied Mechanics and Engineering 273 (2014) 100–
118. doi:10.1016/[Link].2014.01.016.

[33] C. Miehe, F. Welschinger, M. Hofacker, Thermodynamically consistent phase-field

models of fracture: Variational principles and multi-field FE implementations, Inter-
national Journal for Numerical Methods in Engineering 83 (10) (2010) 1273–1311.
doi:10.1002/nme.2861.

[34] C. Miehe, M. Hofacker, F. Welschinger, A phase field model for rate-independent crack
propagation: Robust algorithmic implementation based on operator splits, Computer
Methods in Applied Mechanics and Engineering 199 (45-48) (2010) 2765–2778. doi:
10.1016/[Link].2010.04.011.

[35] M. J. Borden, C. V. Verhoosel, M. A. Scott, T. J. Hughes, C. M. Landis, A phase-field

description of dynamic brittle fracture, Computer Methods in Applied Mechanics and
Engineering 217-220 (2012) 77–95. doi:10.1016/[Link].2012.01.008.

[36] S. P. Timoshenko, S. Woinowsky-Krieger, Theory of plates and shells, McGraw-hill,

1959.

[37] V. P. Nguyen, C. Anitescu, S. P. Bordas, T. Rabczuk, Isogeometric analysis: an overview

and computer implementation aspects, Mathematics and Computers in Simulation 117
(2015) 89–116.

Neural-Integrated Meshfree Method for PDEs
No ratings yet
Neural-Integrated Meshfree Method for PDEs
44 pages
Neural Networks for PDE Solutions Review
No ratings yet
Neural Networks for PDE Solutions Review
32 pages
Comparing PINNs and FEM for PDEs
No ratings yet
Comparing PINNs and FEM for PDEs
27 pages
Generalization Error of PINNs for PDEs
No ratings yet
Generalization Error of PINNs for PDEs
36 pages
Machine Learning For Partial Differential Equations
No ratings yet
Machine Learning For Partial Differential Equations
16 pages
(Berg, Jens, and Kaj Nystrom), Data-Driven Discovery of PDEs in Complex Datasets, Journal of Computational Physics 384 (2019)
No ratings yet
(Berg, Jens, and Kaj Nystrom), Data-Driven Discovery of PDEs in Complex Datasets, Journal of Computational Physics 384 (2019)
14 pages
Solving High-Dimensional Partial Differential Equations Using Deep Learning
No ratings yet
Solving High-Dimensional Partial Differential Equations Using Deep Learning
6 pages
Overview of Physics-Informed Neural Networks
No ratings yet
Overview of Physics-Informed Neural Networks
43 pages
Sciadv Abi8605
No ratings yet
Sciadv Abi8605
10 pages
AI's Role in Solving Scientific PDEs
No ratings yet
AI's Role in Solving Scientific PDEs
47 pages
Hard-Constraint PINN for PDE Solutions
No ratings yet
Hard-Constraint PINN for PDE Solutions
21 pages
Machine Learning for PDE Solutions
No ratings yet
Machine Learning for PDE Solutions
59 pages
Discovering PDEs with Deep Learning
No ratings yet
Discovering PDEs with Deep Learning
25 pages
High Precision PDE Solutions with PINNs
No ratings yet
High Precision PDE Solutions with PINNs
28 pages
Physics-Informed Neural Networks for Elastic Plates
No ratings yet
Physics-Informed Neural Networks for Elastic Plates
29 pages
Neural Networks for PDE Solutions Review
No ratings yet
Neural Networks for PDE Solutions Review
8 pages
Numerical Analysis of Physics Informed Neural Networks and Related Models in Physics Informed Machine Learning
No ratings yet
Numerical Analysis of Physics Informed Neural Networks and Related Models in Physics Informed Machine Learning
81 pages
Failure Modes in Physics-Informed Neural Networks
No ratings yet
Failure Modes in Physics-Informed Neural Networks
20 pages
Parameterized NODEs for Physics Simulations
No ratings yet
Parameterized NODEs for Physics Simulations
19 pages
Discovering PDEs with PDE-Net 2.0
No ratings yet
Discovering PDEs with PDE-Net 2.0
16 pages
One-Shot Learning for PDE Solutions
No ratings yet
One-Shot Learning for PDE Solutions
18 pages
Failure Modes in Physics-Informed Neural Networks
No ratings yet
Failure Modes in Physics-Informed Neural Networks
13 pages
Deep Learning for High-Dimensional PDEs
No ratings yet
Deep Learning for High-Dimensional PDEs
14 pages
Physics-Informed Neural Networks for Beams
No ratings yet
Physics-Informed Neural Networks for Beams
23 pages
Comparacion Metosoa Numericos
No ratings yet
Comparacion Metosoa Numericos
32 pages
Deep Learning and PDE Connections
No ratings yet
Deep Learning and PDE Connections
2 pages
High-Dimensional PDE Solution Approximation
No ratings yet
High-Dimensional PDE Solution Approximation
15 pages
High-Dimensional PDE Solution Approximation
No ratings yet
High-Dimensional PDE Solution Approximation
2 pages
Learning Solution Operators for PDEs
No ratings yet
Learning Solution Operators for PDEs
2 pages
Physics-Informed Neural Networks for Airfoil Optimization
No ratings yet
Physics-Informed Neural Networks for Airfoil Optimization
23 pages
Reparameterization in Physics-Informed Neural Networks
No ratings yet
Reparameterization in Physics-Informed Neural Networks
8 pages
Mixed Deep Energy Method for Hyperelasticity
No ratings yet
Mixed Deep Energy Method for Hyperelasticity
17 pages
Machine Learning Advances in PDEs
No ratings yet
Machine Learning Advances in PDEs
12 pages
Neural Network PDE Solver with High Precision
No ratings yet
Neural Network PDE Solver with High Precision
12 pages
Deep Learning for High-Dimensional PDEs
No ratings yet
Deep Learning for High-Dimensional PDEs
13 pages
Physics-Informed Neural Networks for Beams
No ratings yet
Physics-Informed Neural Networks for Beams
7 pages
PDE-Net 2.0: Discovering PDEs from Data
No ratings yet
PDE-Net 2.0: Discovering PDEs from Data
18 pages
Deep Learning in Macro-Finance Models
No ratings yet
Deep Learning in Macro-Finance Models
30 pages
Deepxde: A Deep Learning Library For Solving Differential Equations
No ratings yet
Deepxde: A Deep Learning Library For Solving Differential Equations
21 pages
Kolmogorov–Arnold Neural Network for PDEs
No ratings yet
Kolmogorov–Arnold Neural Network for PDEs
42 pages
Machine Learning for Nonlinear IDEs
No ratings yet
Machine Learning for Nonlinear IDEs
31 pages
PINNs for 2nd Order ODEs with Sharp Gradients
No ratings yet
PINNs for 2nd Order ODEs with Sharp Gradients
8 pages
Geometry-Aware Deep Energy Method
No ratings yet
Geometry-Aware Deep Energy Method
15 pages
Deep Learning for High-Dimensional PDEs
No ratings yet
Deep Learning for High-Dimensional PDEs
6 pages
Physics-Informed Neural Networks Explained
No ratings yet
Physics-Informed Neural Networks Explained
92 pages
Physics-Informed Neural Networks Overview
No ratings yet
Physics-Informed Neural Networks Overview
17 pages
Machine Learning in Scientific Modeling
No ratings yet
Machine Learning in Scientific Modeling
8 pages
PINNs for 1D Advection Equation in PyTorch
No ratings yet
PINNs for 1D Advection Equation in PyTorch
15 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
49 pages
Deep Learning for High-Dimensional PDEs
No ratings yet
Deep Learning for High-Dimensional PDEs
6 pages
M-PINN: Enhanced PINN for Solid Mechanics
No ratings yet
M-PINN: Enhanced PINN for Solid Mechanics
17 pages
Journal of Computational Physics: Zichao Long, Yiping Lu, Bin Dong
No ratings yet
Journal of Computational Physics: Zichao Long, Yiping Lu, Bin Dong
17 pages
Efficient Multi-Head PINNs Regularization
No ratings yet
Efficient Multi-Head PINNs Regularization
27 pages
Neural Networks for PDE Bifurcation Analysis
No ratings yet
Neural Networks for PDE Bifurcation Analysis
34 pages
Modified Loss Function for PINN in Mechanics
No ratings yet
Modified Loss Function for PINN in Mechanics
20 pages
An Overview On Deep Learning-Based Approximation Methods For Partial Differential Equations
No ratings yet
An Overview On Deep Learning-Based Approximation Methods For Partial Differential Equations
49 pages
Limitations and Future of PINNs in PDEs
No ratings yet
Limitations and Future of PINNs in PDEs
30 pages
DeepXDE: Physics-Informed Neural Networks
No ratings yet
DeepXDE: Physics-Informed Neural Networks
17 pages
Machine Learning Based Spectral Methods For Partial Differential Equations
No ratings yet
Machine Learning Based Spectral Methods For Partial Differential Equations
10 pages
Exercises by Vadim Ponomarenko
No ratings yet
Exercises by Vadim Ponomarenko
59 pages
Schubert Operators in Kac-Moody Theory
No ratings yet
Schubert Operators in Kac-Moody Theory
29 pages
Aut and Out Groups of Sol 3-Manifolds
No ratings yet
Aut and Out Groups of Sol 3-Manifolds
40 pages
Conditions for Berwald Finsler Geometries
No ratings yet
Conditions for Berwald Finsler Geometries
6 pages
An Extreme Amplitude, Massive Heartbeat System in The LMC Characterized Using ASAS-SN and TESS
No ratings yet
An Extreme Amplitude, Massive Heartbeat System in The LMC Characterized Using ASAS-SN and TESS
8 pages
Lucas Atoms and Polynomial Analogues
No ratings yet
Lucas Atoms and Polynomial Analogues
23 pages
Lines on Cubic Surfaces and Witt Invariants
No ratings yet
Lines on Cubic Surfaces and Witt Invariants
9 pages
Online Nonrepetitive Coloring of k-Trees
No ratings yet
Online Nonrepetitive Coloring of k-Trees
8 pages
Isometries and Level Sets in Riemannian Manifolds
100% (1)
Isometries and Level Sets in Riemannian Manifolds
14 pages
Cylindrical Integrable Systems in Magnetic Fields
No ratings yet
Cylindrical Integrable Systems in Magnetic Fields
26 pages
Interval Partition Diffusions from Lévy Processes
No ratings yet
Interval Partition Diffusions from Lévy Processes
60 pages
Contact Geometry in Celestial Mechanics
No ratings yet
Contact Geometry in Celestial Mechanics
31 pages
Local Topology of Function-Germs
No ratings yet
Local Topology of Function-Germs
20 pages
Torsors of Loop Reductive Groups
No ratings yet
Torsors of Loop Reductive Groups
6 pages
Perfectness in Gaussian Graphical Models
No ratings yet
Perfectness in Gaussian Graphical Models
15 pages
High-Order Spectral Deferred Correction
No ratings yet
High-Order Spectral Deferred Correction
25 pages
Limit Theorems for GDDMCs and PDMPs
No ratings yet
Limit Theorems for GDDMCs and PDMPs
29 pages
MRC in Cooperative NOMA for VCs
No ratings yet
MRC in Cooperative NOMA for VCs
18 pages
Convergence Rate of Transport Processes to Brownian Sheet
No ratings yet
Convergence Rate of Transport Processes to Brownian Sheet
8 pages
Equivalence Groupoid of Burgers-KdV Equations
No ratings yet
Equivalence Groupoid of Burgers-KdV Equations
15 pages
Harmonic Measure in Simply Connected Domains
No ratings yet
Harmonic Measure in Simply Connected Domains
20 pages
Sensor Switching Control with Watermarking
No ratings yet
Sensor Switching Control with Watermarking
13 pages
Arboreal Galois Group of Quadratic PCF
No ratings yet
Arboreal Galois Group of Quadratic PCF
32 pages
Numerical Analysis of Semilinear Fractional Diffusion
No ratings yet
Numerical Analysis of Semilinear Fractional Diffusion
30 pages
Minimum Lq-Distance Estimators Explained
No ratings yet
Minimum Lq-Distance Estimators Explained
32 pages
Uniform Distribution Probability Calculations
No ratings yet
Uniform Distribution Probability Calculations
15 pages
Integration Techniques: Partial Fractions
No ratings yet
Integration Techniques: Partial Fractions
5 pages
Inverse Trigonometric Functions Guide
No ratings yet
Inverse Trigonometric Functions Guide
30 pages
Invertible Defects in Rational CFTs
No ratings yet
Invertible Defects in Rational CFTs
20 pages
Errata for Linear Systems Theory
No ratings yet
Errata for Linear Systems Theory
8 pages
Simplex Method for Linear Programming
No ratings yet
Simplex Method for Linear Programming
24 pages
Trigonometric Identities Overview
No ratings yet
Trigonometric Identities Overview
7 pages
ASAT Class 11-12 Math Sample Paper
No ratings yet
ASAT Class 11-12 Math Sample Paper
14 pages
Mathematics For Business, Finance, and Economics
100% (5)
Mathematics For Business, Finance, and Economics
548 pages
Convergence and Sums of Series
No ratings yet
Convergence and Sums of Series
4 pages
Legendre's Equation and Polynomials Proofs
No ratings yet
Legendre's Equation and Polynomials Proofs
3 pages
SAT Practice Test 4 Overview
No ratings yet
SAT Practice Test 4 Overview
7 pages
Introduction to Quantum Graphs
No ratings yet
Introduction to Quantum Graphs
290 pages
Countable and Uncountable Sets Exam
No ratings yet
Countable and Uncountable Sets Exam
1 page
Math8 q1 Mod16 Solving Systems of Linear Equations in Two Variables 08092020
No ratings yet
Math8 q1 Mod16 Solving Systems of Linear Equations in Two Variables 08092020
37 pages
Puzzles For Practice - Fractions and Decimals
80% (10)
Puzzles For Practice - Fractions and Decimals
48 pages
Basis and Dimension in Linear Algebra
No ratings yet
Basis and Dimension in Linear Algebra
24 pages
Phreatic Line in Earth Dam Engineering
No ratings yet
Phreatic Line in Earth Dam Engineering
8 pages
Specimen Paper WMA12/01 Analysis
57% (7)
Specimen Paper WMA12/01 Analysis
10 pages
S5 Subsidiary Mathematics Exam 2024
No ratings yet
S5 Subsidiary Mathematics Exam 2024
3 pages
Convolution with Toeplitz Matrices
No ratings yet
Convolution with Toeplitz Matrices
48 pages
MA256 Systems Biology Revision Guide
No ratings yet
MA256 Systems Biology Revision Guide
17 pages
Numerical Methods for Equation Solutions
No ratings yet
Numerical Methods for Equation Solutions
117 pages
Spatial Transformation in Robotics
No ratings yet
Spatial Transformation in Robotics
5 pages
JEE Mathematics Function Exercises
No ratings yet
JEE Mathematics Function Exercises
12 pages
Haskell Module for Polynomial Operations
No ratings yet
Haskell Module for Polynomial Operations
5 pages
Maximum Water Height in Parabolic Container
No ratings yet
Maximum Water Height in Parabolic Container
134 pages
Functions in General Mathematics Module 1
No ratings yet
Functions in General Mathematics Module 1
12 pages
Solving Linear Inequalities Graphically
No ratings yet
Solving Linear Inequalities Graphically
3 pages
Boundary Controllability of Euler-Bernoulli Beams
No ratings yet
Boundary Controllability of Euler-Bernoulli Beams
19 pages

Energy-Based PDE Solutions with DNNs

Uploaded by

Energy-Based PDE Solutions with DNNs

Uploaded by

An Energy Approach to the Solution of Partial Differential

Equations in Computational Mechanics via Machine Learning:

2. Mathematical modeling of continuous physical systems

2.1. Partial Differential Equations

2.2. Energy Approach

3. Deep Neural Networks for PDE discretization

4.2. Deep Energy Method

4.2.1. Optimization in machine learning

we end up facing a non-convex finite-sum optimization problem [15]:

def xavier_init ( self , size ) :

H = 2.0 * ( X - self . lb ) / ( self . ub - self . lb ) - 1.0

6.1. DNN with ReLU activation functions and FE in 1D

ReLU (x − xi−1 ) − ReLU (x − xi ) ReLU (x − xi ) − ReLU (x − xi+1 )

where ∆i = ui − ui−1 , for i = 1, 2, . . . , n, and ∆i = 0, for i = 0. Then, uh can be expressed

min E(w, xp ) (23)

6.2. Linear Elasticity problem

(c) Displacement in y-axis. (d) Strain energy.

activation function has been considered. The loss function is computed as

Figure 2: Error plots for pressurized thick-cylinder.

6.2.2. Plate with a circular hole

where the transformation matrix A is expressed as

6.2.3. Hollow sphere under internal pressure

Figure 4: Computed solution and errors for plate with hole.

solution of the problem is given by [27]

6.2.4. Cube with a spherical hole subject to uniform tension

Figure 7: Computed solution for the cube with a hole example

For a three-dimensional problem, the loss function is computed as

(c) Error for displacement (d) Error for velocity

and boundary conditions

rather than the strong formulation. The potential functional is written as

In order to obtain the solution we minimize the potential energy

where the trial function now reads

Figure 10: A hyperelastic 3D Cuboid is twisted an angle of 60o .

means of an approximation as follows

E - Young modulus 106

Table 1: Material parameters for 3D hyperelastic Cuboid.

Figure 12: Positions of AB line and CDEF plane.

(a) Displacement comparison (b) Stress comparison

(a) Displacement (b) VonMises stress

6.5. Phase field modeling of fracture

where g(φ) represents the monotonically decreasing stress-degradation function, Gc is the

where x is a point in the domain. The strain-history functional enforces irreversibility

6.5.1. One-dimensional phase field model

For the collocation method, Lrel

6.5.2. Single-edge notched tension example

(a) Geometrical setup and boundary

Figure 17: Single-edge notch tension example.

Adam 106 Adam

0 1000 2000 0 10000 20000

(a) (b) (c)

6.6.1. Cantilever beam

Electric potential θ Electric potential θ

6.7. Kirchhoff Plate bending

L (p) = M AEE + M SEΓ1 + M SEΓ2 , (101)

6.7.1. Simply-supported square plate on Winkler foundation

Square Plate on Winkler Predicted Maximum Analytical

6.7.2. Clamped circular plate

6.7.3. Simply-supported square plate under a sinusoidally distributed load

wpred wexact wpred

6.7.4. Simply-supported annular plate

Tanh(x)  Tanh(x)

[2] A. Huerta, T. Belytschko, S. Fernández-Méndez, T. Rabczuk, X. Zhuang, M. Arroyo,

[3] T. J. Hughes, G. Sangalli, M. Tani, Isogeometric analysis: Mathematical and imple-

[4] I. Goodfellow, Y. Bengio, A. Courville, Deep learning, MIT press, 2016.

[5] M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat,

[7] M. S. Alnæs, A. Logg, K. B. Ølgaard, M. E. Rognes, G. N. Wells, Unified form language:

[9] M. Raissi, P. Perdikaris, G. E. Karniadakis, Physics informed deep learning (part

[10] B. Dacorogna, Introduction to the Calculus of Variations, World Scientific Publishing

[30] B. Bourdin, G. Francfort, J.-J. Marigo, Numerical experiments in revisited brittle

[32] M. J. Borden, T. J. Hughes, C. M. Landis, C. V. Verhoosel, A higher-order phase-field

[33] C. Miehe, F. Welschinger, M. Hofacker, Thermodynamically consistent phase-field

[35] M. J. Borden, C. V. Verhoosel, M. A. Scott, T. J. Hughes, C. M. Landis, A phase-field

[36] S. P. Timoshenko, S. Woinowsky-Krieger, Theory of plates and shells, McGraw-hill,

[37] V. P. Nguyen, C. Anitescu, S. P. Bordas, T. Rabczuk, Isogeometric analysis: an overview

You might also like

Tanh(x) Tanh(x)