You're reading the documentation for a development version. For the latest stable documentation, please have a look at v5.1.X.

Neural Network Binding¶

The Neural Network (NN) binding model is a non-parametric, data-driven binding model that represents the equilibrium relation between pore-phase and solid-phase concentration using a trained feedforward neural network. The model predicts an equilibrium loading from a neural network trained on experimental or simulated adsorption data.

For each component \(i\) and bound state \(m\), an equilibrium loading

\[c^{s,\ast}_{i,m} = f_{i,m}(c^p)\]

is constructed from user-provided trained neural network weights and biases. The neural network is a feedforward architecture with exponential linear unit (ELU) activations and an arbitrary number of hidden layers \(N \geq 1\).

Network Architecture¶

The neural network predictor is defined by the following recurrence over \(N\) hidden layers:

\[\begin{split}h_0 &= x \\ h_l &= \text{ELU}(W_l \cdot h_{l-1} + b_l), \quad l = 1, \ldots, N \\ f(x) &= W_{N+1} \cdot h_N + b_{N+1}\end{split}\]

where \(W_l\) denote weight matrices, \(b_l\) denote bias vectors, and \(x\) is the input vector. The output layer applies no activation. The special cases are:

\[\begin{split}N=1: \quad f(x) &= W_2 \cdot \text{ELU}(W_1 \cdot x + b_1) + b_2 \\ N=2: \quad f(x) &= W_3 \cdot \text{ELU}(W_2 \cdot \text{ELU}(W_1 \cdot x + b_1) + b_2) + b_3\end{split}\]

The exponential linear unit (ELU) activation function is defined as:

\[\begin{split}\text{ELU}(z) = \begin{cases} z & \text{if } z \geq 0 \\ e^z - 1 & \text{if } z < 0 \end{cases}\end{split}\]

The input to the neural network is the normalized pore-phase concentration vector:

\[x = c^p \odot \alpha_{\text{norm}}\]

where \(\alpha_{\text{norm}}\) is the normalization factor specified by NORM_FACTOR, and \(\odot\) denotes element-wise multiplication.

The network output is scaled by a porosity factor and shifted by an offset:

\[c^{s,\ast}(c^p) = \beta_{\text{poros}} \cdot \left(f(c^p \odot \alpha_{\text{norm}}) - f(0)\right)\]

where \(\beta_{\text{poros}}\) is specified by POROSITY_FACTOR. The offset \(f(0)\) is computed once during configuration to ensure that the predicted loading is zero when the pore-phase concentration is zero (enforcing physically valid boundary conditions).

Normalization¶

The normalization factor \(\alpha_{\text{norm}}\) scales the liquid-phase concentrations before they are passed to the ANN. The ANN is trained and evaluated on these normalized concentrations rather than on the raw concentrations. Normalization factors should be chosen such that the normalized ANN inputs are typically of order 1. This generally improves training stability, prediction accuracy, and robustness of the nonlinear solver. Values that are excessively large or small should be avoided.

With mechanistic knowledge, it may be practical to choose normalization factors that reflect characteristic concentration scales of the respective bound state (e.g., equilibrium constants, affinity parameters, or other physically meaningful scaling factors). Without mechanistic knowledge, it may be practical to choose normalization factors purely for numerical conditioning. A common choice is to scale each input component to values of order unity, for example \(\alpha_{\text{norm}} = max(c^p)\)

Porosity factor¶

The porosity scaling factor \(\beta_{\text{poros}}\) scales this prediction to the physical solid-phase loading used by the binding model. Since equilibrium loadings are often reported on different reference volumes or masses, \(\beta_{\text{poros}}\) is intended to be used as a simple correction factor to account for differences in porosity or unit conversions between the training data and the simulation conditions.

Kinetic Form¶

The model is used in a kinetic linear-driving-force form. For each component \(i\) and bound state \(m\), the exchange term is based on the deviation of the current solid-phase loading \(c^s_{i,m}\) from the neural network-predicted equilibrium loading \(c^{s,\ast}_{i,m}\):

\[\frac{\partial c^s_{i,m}}{\partial t} = k^{\mathrm{kin}}_{i,m}\left(c^{s,\ast}_{i,m}(c^p) - c^s_{i,m}\right).\]

Equivalently, in residual form the implementation evaluates

\[r_{i,m} = k^{\mathrm{kin}}_{i,m}\left(c^s_{i,m} - c^{s,\ast}_{i,m}(c^p)\right).\]

Thus, the neural network provides the equilibrium target, while the kinetic constant \(k^{\mathrm{kin}}_{i,m}\) controls how fast the equilibrium state is approached. The kinetic parameter is configured through NN_KKIN.

Jacobian Computation¶

An analytical Jacobian is implemented. Denoting the pre-activation at hidden layer \(l\) as \(z_l = W_l \cdot h_{l-1} + b_l\), the gradient of the network output with respect to the input is given by the backpropagation chain rule:

\[\frac{\partial f}{\partial x} = W_1^\top \cdot \left[ \prod_{l=1}^{N} \text{diag}\!\left(\text{ELU}'(z_l)\right) \cdot W_{l+1}^\top \right]\]

where the product is taken left-to-right with increasing \(l\), and the ELU derivative is:

\[\begin{split}\text{ELU}'(z) = \begin{cases} 1 & \text{if } z \geq 0 \\ e^z & \text{if } z < 0 \end{cases}\end{split}\]

The chain rule accounts for input normalization and output scaling when computing the full Jacobian contribution to the binding residual.

For more information on model parameters required to define in CADET file format, see Neural Network Binding.

CADET-Core

Navigation

Releases

Branches