Integrating physical modeling with artificial intelligence for predicting fish survival zones in polluted rivers to maintain a sustainable aquaculture industry

Water quality prediction and management are crucial for ensuring the sustainability of water supplies. Contaminated water can harm humans and aquatic life. As the demand for seafood grows, the aquaculture industry faces several obstacles, including disease management, feeding optimization, water quality monitoring, and aquaculture area extraction. Recently, aquaculture systems have increasingly used AI techniques to successfully and sustainably handle these issues. However, traditional AI techniques such as random forest (RF) and multi-layer perceptron (MLP) among others frequently face data scarcity and poor physical consistency. This research bridges this gap by integrating physical sciences with AI algorithms through the solution of the two coupled pollution–aeration equations to generate a high-fidelity physics-derived dataset of 50,000 observations over an extended spatial domain ranging from 0 to 4. This dataset is then used to train a novel hybrid RF–MLP algorithm to identify fish-survival zones within a polluted river at a given time, while determining the minimum allowable water velocity and the upstream dissolved oxygen level required to maintain environmentally safe conditions along the entire river reach. The proposed algorithm employs a three-stage sequential residual learning logic, combining RF’s stable feature partitioning with MLP’s improved non-linear error correction. The algorithm’s performance was benchmarked against nine standalone AI algorithms using a comprehensive suite of metrics. The experiments demonstrated exceptional precision with a Correlation Coefficient (CC) of 0.9999999973, a Scatter Index (SI) of 0.00007326, a Willmott’s Index (WI) of 0.9999999986, a Test RMSE of 0.00012966, and a 0.9999999692. Beyond accuracy, the hybrid algorithm demonstrated superior computational efficiency, training in just 22.58 s—a 24.45-fold reduction compared to BiLSTM architectures. These results provide a robust tool for decision-makers to identify optimal river reaches for fish farms based on minimum water velocity and permissible dissolved oxygen transfer levels, bridging the gap between theoretical physics and industrial aquaculture management.

Aquaculture industry provides an abundance supply of fish resources for humanity. The aquaculture industry faces numerous challenges; particularly water quality (WQ) monitoring. Monitoring and predicting water quality is crucial for identifying pollution and environmental concerns, as well as for making educated decisions and managing water resources sustainably1. The discharge of industrial waste into water resources, such as rivers and lakes, has increased as industrial processes have expanded and innovated. Such discharge leads to the deterioration of water quality, which directly affects human lives, fish survival, and environmental health in general2. As the demand for fish as a food source increased, the aquaculture environment steadily developed to provide fish for human consumption. Every year, there are tons of fish mortalities due to water contamination. Fish mortality is an essential production and welfare metric in the aquaculture industry. To address the issue of fish mortality, water quality (WQ) monitoring is critical to make aquaculture production profitable and sustainable3,4. Several factors influence WQ, including fish density, feed quality, feeding interval, climate, and water quality parameters. Water temperature (WT), free ammonia (AMM), fecal coliform (TC), fecal coliform (FC), conductivity (COND), biochemical oxygen demand, water acidity/base level (pH), and dissolved oxygen (DO), among others, are critical water quality parameters in aquaculture. Meteorological conditions and the intricate interdependence relationships between different water quality parameters all have an impact on aquaculture water quality. As a result, changes in water quality parameters exhibit nonlinear properties, resulting in low prediction accuracy. According to5,6, accurate dissolved oxygen (DO) concentration prediction is an important aspect of water quality monitoring and assessment, as it provides essential information about the chemical, physical, and biological characteristics of water bodies, which is critical for ensuring the healthy growth of aquatic species. An adequate DO concentration (typically above 5.0 mg/L for most freshwater fish species) can support normal growth and development, whereas low DO concentrations (typically below 3.0 mg/L) may inhibit biological activity, leading to significant economic losses and even fish mortality7. Scholars worldwide have employed several methods, including the Markov model, Grey model, support vector regression (SVR), autoregressive integrated moving average (ARIMA), and seasonal autoregressive integrated moving average model (SARIMA), among others, to estimate DO concentrations8,9,10. Because the water quality parameters are nonlinear, dynamic, variable, and complicated, these techniques often struggle to capture the complex and nonlinear hydroclimatic processes and nonstationary patterns that dynamically vary over space and time11,12. These issues highlight the need for innovative technologies such as artificial intelligence (AI), including machine learning and deep learning, to improve the accuracy and interpretability of water quality prediction, as well as optimize commercial aquaculture practices, efficiency, productivity, and profitability, which is the primary goal of this research.

As the demand for fish as a food source grew, the aquaculture environment evolved to offer fish for human consumption. The aquaculture business has various obstacles, most notably water quality (WQ) monitoring. To ensure sustainability while optimizing efficiency, productivity, and profitability, the aquaculture industry has turned to AI techniques such as random forest (RF), multi-layer perceptron (MLP), Support Vector Machine (SVM), and Long short-term memory (LSTM), among others, as an aid in water quality monitoring and prediction13,14,15,16,17,18,19,20,21. Because AI techniques can efficiently solve complicated nonlinear problems, particularly those related to water quality monitoring and prediction, they have recently become a global trend in sustainable development research22. However, most AI algorithms require massive amounts of datasets to achieve higher accuracies, which are determined using costly and time-consuming lab and statistical analyses that require sample collection, transportation to labs, and a significant amount of time and calculation, all of which are inefficient. Integrating physical sciences with advanced AI algorithms has recently emerged as an effective way to address the training data shortage, increase model generalizability, and provide a robust solution for predicting water pollution concentration and management23,24,25,26,27,28,29. Physics-based knowledge is often presented in two ways: One is to use physics-based equations that typically establish a precise relationship between certain inputs and outputs. Another one is to employ numerical models used to simulate complex systems. Motivated by these considerations, we propose a new physics-guided hybrid AI algorithm that combines physics-based models, such as the two coupled pollution-aeration equations (PAEs), with AI algorithms for monitoring suitable areas for fish survival in a polluted river at a particular time along the river with the minimum permissible water velocity and the minimum permissible oxygen transfer from air to water. The advection-diffusion equation (ADE) is commonly used for predicting water pollution concentrations. The aeration equation, which is made up of two coupled non-linear partial differential equations (PDEs) and their associated conditions, is a strong instrument for imposing constraints on urban and farming practices. The coupling happens when oxygen combines with pollutants produced in industrial and agricultural areas, resulting in innocuous chemicals. The pollution-aeration model’s analytical and numerical solutions allow for the assessment of fish survival scenarios, which typically demand a dissolved oxygen concentration greater than 30% of saturated dissolved oxygen concentration30.

The primary contributions of this research are summarized as follows:

Advancement of Physics-AI Integration in Hydrology: Expanding on recent research by23,24,25,26,27,28,29, this study establishes a synergistic relationship between relevant physical sciences and AI. The methodology serves as a reproducible framework for water research, empowering researchers to bridge the gap between deterministic physical models and data-driven architectures in diverse environmental contexts.

Novel Heterogeneous Boosting physics-guided hybrid (RF-MLP) Algorithm: A novel physics-guided hybrid RF-MLP algorithm that integrates a physics-based model with AI algorithms to improve performance and accuracy for predicting both pollution concentration and dissolved oxygen, particularly when data scarcity and quality are the key factors. The proposed algorithm employs a three-stage sequential residual learning logic, combining RF’s stable feature partitioning with MLP’s improved non-linear error correction. Unlike existing hybrid AI implementations that rely on meta-heuristic optimization or simple ensemble, the novelty of the proposed RF-MLP lies in its heterogeneous boosting architecture. It specifically utilizes a sequential residual learning logic to bridge the gap between discrete tree-based partitioning and continuous physical transport phenomena.

Synthetic Data Generation via Coupled Equations: To address the common challenge of data scarcity and the high cost of field collection in water research, this study utilizes the solution of two coupled pollution and aeration equations to generate a high-fidelity dataset of 50,000 spatial observations. This approach ensures physical consistency and provides a robust foundation for training complex machine learning algorithms over an extended spatial domain ranging from 0 to 4.

Restoration of Physical Gradients: The algorithm successfully resolves the “step-like” artifacts typical of tree-based models. By using the MLP as a universal function approximator for residuals, the hybrid algorithm restores the smooth exponential curvature and continuous gradients required for physical fidelity to the coupled transport process.

Computational Performance Optimization: The research demonstrates a superior performance-per-second ratio. The proposed hybrid algorithm achieves an order-of-magnitude reduction in RMSE compared to deep learning peers like GRU and LSTM while requiring significantly less training time (22.58 s compared to 552.21 s for BiLSTM, representing an approximately 24.45-fold reduction in training duration).

Empirical Validation of Predictive Precision: Through exhaustive comparative analysis, the proposed hybrid algorithm achieved near-perfect metrics, including a Test \(\:{R}^{2}\) of 0.9999999692, a Correlation Coefficient (CC) of 0.9999999973, and a Willmott’s Index of Agreement (WI) of 0.9999999986, outperforming nine individual AI algorithms.

Sustainable Development Goals (SDG): It outperforms nine individual AI algorithms in terms of both predictive outcomes and training efficiency, assisting environmental decision-makers by contributing to Clean Water and Sanitation (SDG 6) and Good Health and Well-Being (SDG 3), among other goals.

Both human life and aquatic life are at risk due to chemical and biological pollution contaminating rivers worldwide. Contaminant transport through river water under spatially and temporally varying flow conditions is crucial for many environmental, biophysical, and ecological applications, including pollutant treatment and water quality management. The current study investigates the remediation of a polluted river using dissolved oxygen (DO) enrichment through controlled clean-water release, with the aim of monitoring environmentally safe zones for fish survival at a given time. In addition, an AI-based framework is employed to predict the coupled dynamics of pollutant concentration and dissolved oxygen, enabling efficient identification of suitable habitats and supporting water-management decisions that maintain dissolved oxygen above the ecological threshold for aquatic life. The solution of the two coupled pollution and aeration equations for flow in a river is utilized to achieve this goal. The coupled pollution–aeration equations governing river flow are solved using a finite difference scheme. In addition, an analytical solution is employed under simplified conditions to validate the numerical results. This combined analytical–numerical framework enables an accurate and reliable investigation of pollutant transport and dissolved oxygen dynamics in polluted rivers under realistic environmental conditions. For instance, the authors of31 used the advection-diffusion equation (ADE) to characterize the dynamics of river pollution and used a simulation study to derive a numerical solution. By splitting the river into two regions32, investigated an analytical solution for a one-dimensional ADE of the pollutant concentration. The authors of33 addressed the space–time-dependent ADE in two-dimensional conservative solute transport in a heterogeneous porous medium for different input sources analytically34. Investigated the use of clean water to remediate contamination in a river. The authors of35 offered analytical and numerical solutions for pollution concentration with exponentially and evenly increasing sources. The authors of36 examined the numerical treatment of the mathematical model for water contamination; they solved the generalized transport equation using an implicit centred difference scheme in space and a forward difference scheme in time. A grey differential model was employed by37 to develop a numerical simulation of river water contamination. For the mathematical model of the transport equation in a straightforward rectangular box domain38, offered a numerical solution. The ADE with variable coefficients and for anisotropic media was solved by39 using the boundary element approach. A numerical simulation of one-dimensional ADE utilizing an explicit centered difference scheme and a Crank-Nicolson scheme for specified initial and boundary data was reported by40. The authors of41,42 developed a mathematical model to help understand the behavior of the pollution-aeration process, its relationship to injected contaminants along a river, and its effect on fish survival. The suggested approach helps regulate industrial, agricultural, and urban behaviours and imposes restrictions as necessary. Finally43, carried out a comprehensive study in the atmospheric science community by comparing the deep neural network (DNN) with the finite difference techniques (FDM) to solve the 2D advection-diffusion equation to characterize the long-range transport of atmospheric pollutants. The results show that the DNN solver is the most accurate, with the mean absolute error and maximum absolute error of fluid concentration reduced by up to two orders of magnitude when compared to the FDM-based method, corresponding to relative error reductions of 95% and 97%, respectively. For measuring the generality of neural network layers across a continuously-parameterized set of tasks44, proposed a method based on the singular vector canonical correlation analysis (SVCCA). They demonstrate this method by investigating generality in neural networks trained to solve parameterized boundary value issues using the Poisson partial differential equation45. used the Laplace transformation technique to derive an analytical solution for the unsteady-state aeration model in its linear form. The authors of46 developed a numerical solution for the non-linear aeration model in two case studies. Studies on modeling the relationship between fish survival and population dynamics have been conducted47,48,49,50. Experimentally, the study of aeration systems is a rich and active research subject. Researchers address several aeration technologies and their usefulness in the context of design51,52,53,54.

Artificial intelligence (AI) approaches have recently been used more extensively in sustainable development research22 to describe solute movement in porous media, particularly in studies on river pollution and water quality prediction (e.g.20,55). Because of their inherent ability to manage enormous datasets, intricate variable interactions, and complex non-linear relationships, these approaches provide significant benefit over conventional statistical and mechanistic models. According to recent literature reviews14,15,16,17,18,19,20,21, building hybrid algorithms by combining different machine learning and deep learning algorithms has gained the attention of researchers, as it can overcome the limitations of individual algorithms and achieve higher performance and accuracy than individual ones. Some of the prior research in this area is described below.

A hybrid deep learning (DL) model, CNN-LSTM and CNN-GRU, is presented in56 for aquaculture water quality prediction (WQP) by combining convolutional neural networks (CNN) with long short-term memory (LSTM) and gated recurrent units (GRU). Similarly, the authors of17 developed a hybrid model for WQP that combines CNN, LSTM, and a self-attention mechanism.

For the objective of forecasting the river hydrodynamic changes57, presented a coupling framework of a hydrodynamic model and machine learning approach, named GRU-HD, for modeling lake-connected river systems. The framework is based on the integration of the Gated Recurrent Unit (GRU) with a 1-D HydroDynamic model (GRU-HD), taking into account the advantages of the interpretability of physical hydrodynamic modeling and the adaptability of machine learning.

Reference58 introduced hybrid models that integrate temporal pattern attention (TPA) mechanisms with advanced neural networks, including feed-forward neural networks (FFNNs) and long short-term memory networks (LSTMs), for predicting water quality in complex systems.

To predict pan evaporation (Ep), a critical factor in water resource management59, presented two hybrid models, called GA-RF and GA-MLP, for modelling Ep at a target station. These models use data from adjacent stations by combining two base models: Random Forest (RF) and Multi-Layer Perceptron (MLP), along with their optimized counterparts created through a Genetic Algorithm (GA).

The authors of60 presented a hybrid Wavelet-CNN-LSTM model for short-term water demand forecasting that incorporates the time-frequency decomposition features of Wavelet Multi-Resolution Analysis (MRA) into CNN-LSTM, an advanced deep learning model. Experimental results show that, in both single-step and multi-step prediction, the Wavelet-CNN-LSTM model outperforms four distinct deep learning models: ANN, Conv1D, LSTM, and a gated recurrent unit (GRU).

Reference61 proposed two hybrid decision tree-based machine learning models, called CEEMDAN-XGBoost and CEEMDAN-RF, for short-term water quality prediction. The two hybrid models’ basic components are random forest (RF) and extreme gradient boosting (XGBoost), each of which introduces a sophisticated method for denoising data: complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN).

The authors of62 studied hybrid models that combine machine learning (ML) with statistical techniques (such as principal component analysis (PCA)) to anticipate and classify water quality.

The water quality class (WQC), a unique class defined on the basis of the water quality index (WQI), and the water quality index (WQI), a single index to describe the overall quality of water, are estimated by63 using a number of supervised machine learning algorithms. According to the experimental results, the most effective methods for predicting the WQI are gradient boosting (learning rate of 0.1) and polynomial regression (degree of 2), with mean absolute errors (MAE) of 1.9642 and 2.7273, respectively. On the other hand, the WQC is most effectively classified by the multi-layer perceptron (MLP), which has a configuration of (3, 7) and an accuracy of 0.8507.

The authors of64 proposed a hybrid CNN-LSTM-based deep learning framework for predicting water level and quality. The authors of65 proposed a new hybrid deep learning model for air quality forecasting. One-dimensional Convolutional Neural Networks (1D-CNNs) and Bi-directional Long Short-term Memory networks (Bi-LSTM) serve as the foundation for the proposed model. The former extracts the local trend features and spatial correlation features, while the latter one is utilized to learn spatial-temporal connections.

While these models demonstrate the power of ensembling, they primarily focus on feature optimization for specific hydrological variables. Our work differs by addressing the structural limitations of tree-based algorithms—specifically the lack of gradient continuity—and using a physics-derived dataset to enforce strict physical consistency in a multi-stage sequential framework.

The transport of pollutants and dissolved oxygen (DO) in the river is modeled using a coupled one-dimensional advection–dispersion framework. The model accounts for advection, longitudinal dispersion, pollutant degradation, oxygen consumption, and atmospheric reaeration. The unsteady transport process is governed by a system of coupled one-dimensional advection–dispersion equations for the pollutant concentration \(\:\text{P}(x,\text{t})\) (kg km-3) and the dissolved oxygen concentration \(\:\text{X}(x,\text{t})\) (kg km-3), where t (day) denotes time. The coupled equations can be expressed as follows:

where, \(\:{D}_{p}\:\left({{km}^{2}\:day}^{-1}\right)\) is the diffusion coefficient of the pollutant concentration, \(\:{D}_{X}\:\left({{km}^{2}\:day}^{-1}\right)\) is the diffusion coefficient of DO, \(\:u\:\left({km\:day}^{-1}\right)\) is the flow velocity, \(\:{K}_{1}\:\left({day}^{-1}\right)\) is the degradation rate coefficient for pollution at a constant temperature, \(\:{K}_{2}\:\left({day}^{-1}\right)\) is the degradation rate coefficient for DO at a constant temperature, \(\:{K}_{3}\left({kg\:km}^{-3}\right)\) is the half-saturated oxygen demand concentration for pollution decay, \(\:\mu\:\left({kg\:km}^{-1}{day}^{-1}\right)\) is the rate of the injected pollutant discharge, \(\:\lambda\:\) is spatial decay parameter of the pollutant discharge source, \(\:\alpha\:\:\left({day}^{-1}\right)\) is the reaeration coefficient representing the oxygen transfer rate from the atmosphere to the water and \(\:S\left({kg\:km}^{-3}\right)\) is saturation concentration of dissolved oxygen in water.

In Eq. (1), the term \(\:\frac{\partial\:P}{\partial\:t}\:\)represents the temporal variation of the pollutant concentration. The diffusion term \(\:{D}_{p}\frac{{\partial\:}^{2}P}{\partial\:{x}^{2}}\) describes the longitudinal dispersion of pollutants caused by molecular diffusion and turbulent mixing along the river. The advection term \(\:-u\frac{\partial\:P}{\partial\:x}\:\) represents the downstream transport of pollutants due to the river flow velocity. The nonlinear reaction term \(\:-{K}_{1}\frac{X}{X+{K}_{3}}\:P\) models the degradation of pollutants through oxygen-dependent biochemical processes, where the degradation rate increases with dissolved oxygen availability. The source term \(\:{\upmu\:}\:\left(1-\text{exp}\left(-\lambda\:x\right)\right)\:\) represents the spatially distributed pollutant loading along the river, where the exponential function accounts for the gradual spatial decay of the pollutant discharge source.

Similarly, in Eq. (2), the term \(\:\frac{\partial\:X}{\partial\:t}\:\) denotes the temporal variation of dissolved oxygen concentration. The diffusion term \(\:{D}_{X}\frac{{\partial\:}^{2}X}{\partial\:{x}^{2}}\:\) represents the longitudinal dispersion of dissolved oxygen within the river. The advection term \(\:-u\frac{\partial\:X}{\partial\:x}\:\) describes the transport of dissolved oxygen by the flowing water. The nonlinear consumption term \(\:-{K}_{2}\frac{X}{X+{K}_{3}}P\) characterizes the depletion of dissolved oxygen due to biochemical pollutant degradation processes. Finally, the reaeration term \(\:\alpha\:\:\left(S-X\right)\:\) represents the atmospheric oxygen transfer from air to water, which drives the dissolved oxygen concentration toward the saturation level (S).

In this study, the river is assumed to be polluted as a result of previous industrial activities. To mitigate the existing pollution, clean oxygen-rich water is released at the upstream boundary (\(\:x=0\)). The river initially contains dissolved pollutants before the remediation process starts. Accordingly, the initial and boundary conditions associated with Eqs. (1)–(2) describe the initial spatial distributions of pollutant concentration and dissolved oxygen along the river. At the upstream boundary, the pollutant concentration is assumed to vanish due to the injection of clean water, while a prescribed dissolved oxygen concentration is maintained. The far-field boundary conditions assume negligible concentration gradients sufficiently far downstream. The corresponding initial and boundary conditions are given by:

where\(\:\:{P}_{0}\) \(\:(\text{k}\text{g}\:{km}^{-\:3}\)) and \(\:{X}_{0}\:\left(\text{k}\text{g}\:{km}^{-\:3}\right)\) are the initial concentrations for pollution and DO at \(\:x\to\:\infty\:\), \(\:{\text{P}}_{1}\left(\text{k}\text{g}\:{km}^{-\:3}\right)\) and\(\:\:{X}_{1}\left(\text{k}\text{g}\:{km}^{-\:3}\right)\) are constant. \(\:{K}_{4}\left(\text{k}\text{m}\right)\) is the initial pollutant-decay length and \(\:{K}_{5}\left(\text{k}\text{m}\right)\) is initial DO growth length. \(\:{X}_{2}\left(\text{k}\text{g}\:{km}^{-\:3}\right)\) represents the dissolved oxygen concentration of the clean water released at the upstream boundary \(\:x=0.\) This value is typically close to, or equal to, the saturation concentration \(\:S\:\), reflecting oxygen-rich inflow used for river flushing and remediation.

This study considers a natural river system that has been polluted as a result of previous industrial activities, leading to the presence of dissolved pollutants along the river before the implementation of any remediation measures. This pollution causes a deterioration in water quality and a reduction in dissolved oxygen levels, which adversely affect the river ecosystem and aquatic life. The problem does not aim to model an active industrial discharge; rather, it focuses on mitigating the impact of existing pollution through controlled environmental remediation strategies. To improve water quality, clean water is released into the river at the upstream location \(\:x=0\). The injected water is assumed to be free of pollutants and rich in dissolved oxygen, representing a controlled river-flushing and remediation process. The clean-water release starts at \(\:t=0\) and continues over time, allowing the river system to gradually recover in the downstream direction. As a result, pollutant concentrations decrease due to dilution, transport, and physical, chemical, and biological degradation processes, while dissolved oxygen levels increase due to direct oxygen supply in addition to atmospheric reaeration. Controlling the flow velocity at the clean-water injection point is a key element of the remediation strategy. The inlet velocity \(\:u\) reflects the amount of clean water introduced into the river and therefore directly influences pollutant transport, dispersion, degradation rates, and dissolved oxygen dynamics. Increasing the flow velocity enhances longitudinal mixing and accelerates downstream transport, while also improving oxygen availability. However, excessive clean-water release may lead to inefficient water use without proportional environmental benefits, highlighting the need to identify an optimal flow velocity. From an ecological perspective, the effectiveness of the remediation process is evaluated based on dissolved oxygen availability, which is a critical factor for aquatic life. It is well known that fish inhabit regions where the dissolved oxygen concentration exceeds 30% of the saturation concentration. Accordingly, this threshold is adopted as a criterion for defining environmentally safe conditions along the river. Ensuring that dissolved oxygen levels remain above this limit throughout the river reach is therefore a primary objective of the clean-water release strategy. By linking the flow velocity with the coupled dynamics of pollutant concentration and dissolved oxygen, the proposed framework enables the identification of river regions capable of sustaining aquatic life. Regions that satisfy the oxygen threshold are classified as environmentally safe habitats, which support the delineation of suitable fishing zones and the selection of appropriate locations for fish farming along the river. At the same time, the model allows for the determination of the minimum amount of clean water required to achieve these environmental objectives through appropriate control of the inlet velocity. This approach helps avoid unnecessary water release and supports efficient and sustainable water resource management, while balancing ecological protection with responsible water use.

The explicit finite difference method is applied to solve Eqs. (1) and (2) associated with the initial and boundary conditions (3–8). The central difference scheme was used for \(\:\frac{{\partial\:}^{2}P}{\partial\:{x}^{2}}\), \(\:\:\frac{{\partial\:}^{2}X}{\partial\:{x}^{2}}\), \(\:\:\frac{\partial\:P}{\partial\:x}\) and \(\:\frac{\partial\:X}{\partial\:x}\). The forward difference scheme was used for \(\:\frac{\partial\:P}{\partial\:t}\:\) and \(\:\frac{\partial\:X}{\partial\:t}\). With these substitutions, Eqs. (1) and (2) can be written as:

where i and j refer to the discrete step lengths \(\:\varDelta\:x\) and \(\:\varDelta\:t\) for the coordinate x and time t, respectively, and

The initial and boundary conditions for the two Eqs. (9) and (10) can be expressed in the finite difference form as

where\(\:\:{t}_{j}=j\:{\Delta\:}t\:\) and \(\:{x}_{i}=i\:{\Delta\:}x.\) \(\:N=\frac{{\:\:x}_{\infty\:}}{\varDelta\:x}\:\) is the grid dimension in the \(\:x\) direction and \(\:{x}_{\infty\:}\:\) is the distance from \(\:x=0\) in the direction of \(\:x\) at which \(\:\:\:\frac{\partial\:P}{\partial\:x}\to\:0\:\) and \(\:\frac{\partial\:X}{\partial\:x}\to\:0\).

To validate the accuracy of the explicit finite difference scheme, we consider a simplified version of Eq. (1) in which the reaction and source terms vanish. Setting \(\:{K}_{1}=0\) and \(\:{\upmu\:}=0\), and choosing the initial and boundary conditions as \(\:{P}_{0}=0\), the governing equation reduces to the homogeneous advection–diffusion equation:

Applying the Laplace transformation to Eq. (17) and using the inverse Laplace transformation, we obtain the analytical solution of the advection–dispersion Eq. (17), subject to the initial and boundary conditions (18)–(20), in terms of \(\:x\) and \(\:t\), as follows:

where erfc is the complementary error function. We have confirmed that Eq. (21) satisfies Eqs. (17–20).

This research adopts a systematic machine learning and deep learning framework for predicting pollutant concentration dynamics in a river system. The core of the methodology involves leveraging physically based simulation data to train and validate hybrid AI algorithm. As depicted in the methodology workflow (Fig. 1), the study follows a rigorous, multi-phase progression. The data management lifecycle—encompassing collection, pre-processing, artifact filtering, and exploratory analysis—is detailed in Sect. 4.1, while the specific partitioning of datasets for training, validation, and testing is discussed in Sect. 4.2. The benchmark baseline models used for comparison are introduced in Sect. 4.3. The architecture and logic of the proposed hybrid RF-MLP algorithm, alongside its algorithmic configuration and implementation details, are elaborated in Sect. 5. Performance benchmarking and the selection of evaluation metrics are established in Sect. 6, leading to the comprehensive result analysis and trade-off discussion provided in Sect. 7.

The dataset used to train and evaluate the AI algorithms was derived from the two coupled pollution and aeration equations. Thus, the proposed hybrid algorithm adheres to the physical consistency of the ADE process. The dataset contains the pollution concentration \(\:P\left(x,t\right)\) and dissolved oxygen concentration \(\:X\left(x,t\right)\) at 50,000 specific spatial positions (\(\:x\:\)) across various distinct speeds (\(\:u\)). Each observation represents a unique spatial point ranging from \(\:x=0.0\:\) to \(\:x=4.0\:\), with corresponding levels for speeds \(\:u\in\:\left\{\text{1,6},11\right\}\). This multi-output structure allows the models to learn the spatial distribution of the pollution plume and dissolved oxygen recovery under varying hydrodynamic regimes. The primary input feature for the predictive algorithms is the spatial position (\(\:x\:\)), while the target variables are the concentration values \(\:P\:\) and \(\:X\:\) across the specified speeds. Prior to model training, comprehensive data quality assessment and pre-processing steps were implemented to ensure data integrity and optimize algorithmic performance:

Integrity Check: The dataset dimensions and data types were inspected to confirm consistency across the 50,000 observations.

Handling Missing Values: A robust pre-processing pipeline was established using a SimpleImputer with a “mean” strategy to address any potential numerical instabilities, although the analytical source provides a continuous dataset.

Artifact Filtering: To ensure physical plausibility, any rows containing negative concentration values—which may arise as numerical artifacts during the calculation process—were excluded. The final cleaned dataset maintained a dimension of 50,000 observations with the input feature \(\:x\:\) and target outputs for \(\:P\:\) and \(\:X\:\) at three different flow velocities. (One input feature \(\:x\:\) and six target outputs).

Feature and Target Separation: The spatial variable \(\:x\:\) was isolated as the independent feature, while the concentration values \(\:P\:\) and \(\:X\:\) for each speed were defined as the dependent multi-output target vector.

Feature Scaling: To mitigate the impact of varying scales and accelerate the convergence of gradient-based algorithms, the data were normalized using StandardScalar. Thus, transforming the spatial features to have a zero mean and unit variance.

Exploratory data analysis (EDA) was performed to understand the structural and statistical characteristics of the concentrations across different hydrodynamic conditions. The results reveal strong spatial and hydrodynamic dependencies, which are critical for characterizing the non-linear transport processes captured by the ADE. The Histograms (Fig. 2) illustrate the frequency distribution of concentration values for each speed\(\:\:\:u\in\:\left\{\text{1,6},11\right\}\). For pollution concentration (\(\:P\:\)), the distributions are bimodal distributions, reflecting the distinct phases of plume rise and stabilization across the spatial domain. This pattern indicates that at most spatial positions (\(\:x\:\)), the concentration oscillates between initial low-level states and stabilized peak regions. Conversely, the dissolved oxygen (\(\:X\:\)) profiles demonstrate a depletion and sag pattern, where concentrations decrease significantly as the water moves farther from the upstream boundary, indicating a strong negative correlation with spatial position \(\:x\:\).

Visualizing histograms to inspect data distributions of pollutant concentrations (\(\:P\)) and dissolved oxygen (\(\:X\)) for each speed (\(\:u\)).

The Box Plots (Fig. 3) and Correlation Matrix (Fig. 4) further highlight the internal structure of the multi-output vector. The box plots reveal that for pollution concentration \(\:P\:\), the majority of data points fall within a narrow interquartile range (IQR). Notably, the data points plotted beyond the whiskers, particularly visible for \(\:{P}_{u=11}\), are not indicative of measurement noise or data errors. Since the input data is received from the solution of the coupled equations, these “statistical (not real) outliers” are physically significant, as they represent the high-concentration peaks and map the core of the pollutant plume moving through the spatial domain. In contrast, the dissolved oxygen \(\:X\:\) shows a much broader variability, reflecting its complex recovery curve across the spatial domain. The correlation matrix confirms an extremely high inter-variable correlation (\(\:\rho\:\approx\:0.99\:\)) between the concentrations at different speeds. Interestingly, spatial position \(\:x\:\)shows a strong negative correlation with \(\:{P}_{u=6}\) and a strong positive correlation with \(\:{X}_{u=11}\), quantifying the physical phenomena of pollutant attenuation and oxygen re-aeration as water moves downstream.

Box plots to detect potential outliers and assess variability.

Correlation matrix to analyze the relationships between pollutant and dissolved oxygen concentrations at different speeds.

The descriptive statistics for the input feature and target variables are summarized in Table 1. The spatial coordinate \(\:x\:\) is uniformly distributed across the domain [0, 4.0]. For the target concentration\(\:\:\:P\), both the median and the standard deviation increase monotonically with speed. For instance, at \(\:u=1\), the median concentration is approximately 0.41 with a standard deviation of 0.14. By \(\:u=11\), the median rises to 0.43 and the standard deviation to approximately 0.25. The 75th percentile also shifts from 0.55 (at \(\:\:u=1\)) to 0.60 (at \(\:u=11\)), confirming that higher flow velocities result in a broader spatial distribution of concentration levels across the extended spatial domain. These statistics confirm the complexity and nonlinearity of the modeled coupled system.

To prevent overfitting and ensure robust evaluation, the dataset was partitioned into three distinct subsets using a two-phase splitting strategy to ensure both robust hyperparameter tuning and unbiased performance evaluation:

Initial Split: An 80/20 split was first applied to separate the Testing Set (10,000 samples) from the initial training pool.

Secondary Split: The remaining training pool was further divided using an 80/20 ratio to create a Validation Set for hyperparameter tuning and early stopping, and a final Training Set.

The resulting final distribution consists of approximately 64% training data (32,000 samples), 16% validation data (8,000 samples), and 20% testing data (10,000 samples). A fixed random state was maintained throughout to ensure the reproducibility of the experimental results.

Before detailing the proposed hybrid algorithm, several standard machine and deep learning algorithms were implemented to establish performance benchmarks.

Linear Regression: Linear Regression is one of the simplest supervised learning models and assumes a linear relationship between the independent variables and the target variable. Despite its simplicity, it is frequently used as a baseline model for evaluating the performance of more complex machine learning algorithms.

Support Vector Machine (linear/RBF SVM): Support Vector Machines aim to find an optimal separating hyperplane between data points in the feature space. For nonlinear classification problems, kernel functions such as the Radial Basis Function (RBF) kernel are used to project data into a higher-dimensional space where linear separation becomes feasible. Linear SVM is commonly applied when data are linearly separable or when dealing with high-dimensional feature spaces.

Random Forest (RF): Random Forest (RF) is an ensemble algorithm that constructs multiple decision trees and aggregates their outputs to produce a more accurate and stable prediction66. Each tree is trained using bootstrap sampling, and at each split a random subset of features is considered. This randomness reduces correlation among trees, lowering variance and improving generalization. In classification, predictions are made via majority voting, while regression uses averaging of tree outputs. RF is widely used due to robustness to noise, ability to model nonlinear patterns, and strong performance with high‑dimensional data.

Multi-Layer Perceptron (MLP): A Multilayer Perceptron (MLP) is a feed‑forward neural network composed of an input layer, one or more hidden layers, and an output layer. Each neuron performs a weighted sum followed by a nonlinear activation, enabling the network to model complex relationships67. MLPs are trained using backpropagation, and their flexibility in architecture and activation functions makes them suitable for nonlinear prediction tasks. MLPs also have strong approximation capabilities.

Gradient Boosting: Gradient Boosting is an ensemble learning technique that builds a sequence of weak learners, typically decision trees, where each subsequent model is trained to reduce the residual errors of the previous one. It is the most recent algorithm employed in competitions. It uses an additive model that allows for the optimization of a differentiable loss function. The optimization process minimizes a loss function using gradient descent, leading to strong predictive performance, especially in nonlinear problems68.

Recurrent Neural Networks (GRU, LSTM, and BiLSTM): Recurrent Neural Networks (RNNs) are designed to model sequential and time-dependent data by maintaining internal memory of past inputs. Long Short-Term Memory (LSTM) networks address the vanishing gradient problem through gating mechanisms that regulate information flow69. Gated Recurrent Units (GRU) provides a simplified alternative to LSTM, reducing the number of parameters while maintaining competitive learning performance. Bidirectional LSTM (BiLSTM) processes sequences in both forward and backward directions, enabling the model to capture past and future temporal dependencies more effectively70.

Computational framework and models architecture

The proposed RF-MLP algorithm utilizes a heterogeneous boosting architecture that differs fundamentally from standard ensemble techniques in its underlying philosophy and structural execution. While standard stacking typically employs multiple base learners whose outputs are aggregated by a meta-learner, and standard boosting (e.g., AdaBoost or Gradient Boosting) relies on a sequence of homogeneous weak learners, our approach follows a three-stage sequential logic:

Stage 1 (Initial Mapping): The Random Forest base learner generates an initial prediction of the concentration values. This stage captures the primary non-linear trend of the advection-dispersion process. The choice of Random Forest (RF) as the primary layer is motivated by its inherent stability in handling high-dimensional environmental data and its ability to identify hierarchical “regimes” in spatial data. In fluvial transport, the pollutant profile is split into distinct regions (e.g., arrival, peak, and decay phases). RF’s decision-tree structure is uniquely suited to partition the feature space into these segments without requiring extensive manual feature engineering. RF establishes the global magnitude “backbone” of the plume, effectively reducing variance and identifying non-linear patterns that baseline linear models fail to capture.

Stage 2 (Residual Refinement Layer): While RF provides a robust magnitude estimate, its output is piecewise-constant, leading to “step-like” artifacts that violate the continuous physical gradients of diffusion. Thus, an MLP layer, unlike simple “correctors” that merely add a bias term, is then trained to predict the residuals (errors) of the RF. As a universal function approximator, the MLP excels at learning continuous relationships. Because the MLP’s target is the “unaccounted variance” rather than the raw data, it can focus specifically on refining the boundaries where the RF’s decision trees are less precise, restoring the smooth exponential curvature and high-frequency transitions required for physical fidelity to the coupled equations.

Stage 3 (Synthesis): The final output is the sum of the RF prediction and the MLP-corrected residual. The hybrid framework concludes after the MLP stage because additional layers introduce unnecessary complexity with negligible performance gains. As will be shown in the “results section”, the Hybrid RF + MLP (Residual) Corrected model already achieves a Willmott’s Index of 0.999999998635489. Further layers would provide the risk of overfitting to the training data and increase computational latency, which would undermine the model’s utility for real-time environmental monitoring.

This architecture ensures that the model does not merely perform ‘function approximation’ but internalizes the governing physics, resulting in superior physical fidelity compared to standalone or traditionally stacked models.

The hyperparameters for all machine learning and deep learning algorithms employed in this study were meticulously defined to ensure reliable performance and a fair comparison across different modeling algorithms. The final configurations for each model/algorithm are summarized in Table 2. These parameters were selected based on empirical experimentation and validation performance, aiming to achieve an optimal balance between predictive accuracy, generalization capability, and computational efficiency. For the Random Forest (RF) model, a total of 100 trees were utilized to construct the ensemble, providing stable predictions while minimizing model variance. A fixed random state of 42 was applied to ensure the reproducibility of all experimental results. The Multi-Layer Perceptron (MLP) was configured with three hidden layers consisting of 100, 100, and 50 neurons, respectively, enabling the network to approximate complex nonlinear mappings effectively. The Rectified Linear Unit (ReLU) activation function was used in the hidden layers to facilitate gradient flow, while the Adam optimizer was employed to manage the learning rate during training. For the sequential deep learning models, LSTM, BiLSTM, and GRU architectures were implemented with 50 hidden units. The tanh activation function was utilized within the recurrent layers to handle gated state transitions, and the models were trained using the Adam optimizer with a Mean Squared Error (MSE) loss function. To prevent overfitting and ensure effective generalization to the test set, we employed an Early Stopping callback with a patience of 15 epochs, monitoring the validation loss to restore the best performing weights.

The implementation was executed in a Python environment on the Google Colab platform. Traditional machine learning algorithms were implemented using the Scikit-learn library, with models such as Gradient Boosting and SVM wrapped in a MultiOutputRegressor to handle the simultaneous prediction of concentrations across multiple spatial locations. The deep learning architectures were developed using the TensorFlow/Keras framework, utilizing the Sequential API for model construction.

Prior to presenting the obtained results, we will outline the various measures utilized to evaluate the performance of the proposed algorithm in comparison to state-of-the-art algorithms. The evaluation of predictive models for river pollution necessitates a multifaceted statistical approach, as no single metric can fully characterize the nuances of model fit, variance explanation, and potential for generalization71. In all subsequent equations, \(\:n\:\) denotes the number of samples, \(\:{y}_{i}\) represents the observed analytical concentration, \(\:\widehat{{y}_{i}}\) is the predicted concentration, and \(\:\overline{y}\) is the mean of the observed values.

Absolute Error Metrics provide an assessment of the magnitude of error (deviation between predicted and observed pollution concentrations) in the same physical units as the pollutant concentration. Thus, absolute error metrics are crucial for understanding the model’s reliability in practical units. For regression, we used the following metrics:

Mean Absolute Error (MAE): Measures the average magnitude of errors without considering direction. It provides a robust ground-truth of expected error in the same units as the target variable.

Mean Squared Error (MSE): Penalizes larger errors quadratically. A lower MSE in the Hybrid model indicates a significant reduction in “outlier” predictions compared to traditional methods.

Root Mean Squared Error (RMSE): The square root of MSE, providing an error metric in the original units while remaining sensitive to large deviations.

MSE and RMSE penalize larger errors more heavily than MAE, making it a rigorous indicator of model stability in the presence of pollution spikes or outliers.

Correlation and Agreement metrics are dimensionless indices that evaluate how well the model captures the trend and variance of the pollutant plume:

Coefficient of Determination (\(\:{R}^{2}\)): Quantifies the proportion of the variance in the target variable that is predictable from the input features.

Correlation Coefficient (CC): Measures the strength and direction of the linear relationship between the analytical and predicted values.

Willmott’s Index of Agreement (WI): A standardized measure of the degree of model prediction error by comparing the accuracy of a model to a reference model based on the mean of the observed data. It varies between 0 and 1, where 1 indicates a perfect match:

Scattered Index (SI) is a normalized measure of the RMSE, used to compare the precision of the models relative to the mean of the observations to provide a measure of relative dispersion.

Training and prediction time: Given the requirements for potential real-time water quality monitoring or high frequency forecasting, the total CPU/GPU time required for model convergence (Training Time) and the latency for a single forward pass (Prediction Time) were recorded. These metrics were calculated for both validation and testing phases to ensure robustness and mitigate the risks of overfitting, which is a common concern in high-dimensional machine learning applications.

Table 3 summarizes the statistical comparative performance across all evaluated algorithms during the test phase.

The results demonstrate that the proposed hybrid (RF-MLP) model consistently outperforms all other architectures, achieving near-perfect metrics with a Test RMSE of 0.00012966 and an \(\:{R}^{2}\) of 0.9999999692. This high level of precision is indicative of the model’s ability to not only learn the primary structural patterns of the data via the RF base but also to refine these predictions by capturing the residual errors through an MLP corrector. While standalone ensemble models achieve high accuracy, the Hybrid model, by adding the MLP corrector, further reduces the error, representing a consistent refinement in predictive precision. This suggests that the residual term contains significant deterministic patterns—likely non-linear spatial dependencies that the tree-based splits of the RF could not fully resolve—which the MLP is adept at identifying. This dual-stage modeling respects the structural strengths of both algorithms: the RF provides a stable, low-variance baseline, while the MLP provides a high-capacity non-linear refinement. Deep learning algorithms like LSTM and GRU are frequently proposed as solutions for datasets with complex dependencies. Indeed, the results show that LSTM and GRU perform exceptionally well (\(\:{R}^{2}\approx\:0.999\)). However, they do not surpass the hybrid RF-MLP in this specific dataset spanning the expanded spatial domain [0, 4.0]. The progression of error increases as the analysis moves toward these deep learning architectures (Test RMSE = 0.003114 for GRU, and 0.003025 for LSTM), suggesting that while these models are adept at capturing dependencies, they may require more extensive hyperparameter tuning or larger datasets to match the precision of refined ensemble hybrids in this specific context. A possible explanation is that the hybrid model leverages the strengths of ensemble learning, which is often more data-efficient than the deep recurrent structures of LSTMs when the dataset size is finite. Furthermore, the residual correction strategy allows the model to explicitly target the ‘hardest’ part of the prediction task—the errors—rather than attempting to learn the entire signal at once. In contrast to the hybrid framework, linear models—specifically Linear Regression and Linear SVM—demonstrate significantly higher error rates and lower variance explanation, underscoring the inadequacy of linear approximations for environmental spatial snapshots that are inherently governed by non-linear physical interactions. A CC and \(\:{R}^{2}\) of nearly 1.0, as seen in the top-performing models, indicate that these architectures have achieved near-perfect mapping of the input-output relationships. Because the training, validation, and testing partitions are synthetically generated from a deterministic, noise-free physical model (specifically, the coupled advection-dispersion-aeration equations), the machine learning algorithms are emulating a mathematically closed, continuous system rather than fitting stochastic, observationally noisy real-world field measurements. Consequently, the near-unity values of \(\:{R}^{2}\) and CC reflect the high-capacity estimators’ absolute precision in emulating the idealized, noise-free physics of mass transport, rather than an expectation of identical statistical performance under unmodeled real-world environmental turbulence and sensor-level measurement errors. While CC and \(\:{R}^{2}\) are high for most of the compared models, the \(\:{R}^{2}\) degrades for the linear models. The Willmott Index (WI), or Index of Agreement, provides additional validation of model skill. For the top models—including Hybrid, RF, GRU, MLP, LSTM, BiLSTM, and GB—the WI is nearly 1.0, reflecting ‘exceptional accuracy’ and ‘strong agreement’ with spatially distributed pollution patterns (e.g., Hybrid WI = 0.9999999986). Even for the poorly performing linear algorithms, the agreement is noticeably lower, suggesting that they fail significantly in both absolute precision and capturing general directional trends. The compared models’ performance was evaluated based on the Scatter Index (SI) criteria, where an SI below 0.1 signifies ‘excellent’ model performance. Applying these thresholds, the Proposed Hybrid (RF-MLP), RF, GRU, LSTM, MLP, BiLSTM, and GB consistently achieved ‘excellent’ ratings with SI values significantly below the 0.1 threshold (e.g., Hybrid SI = 0.00007326). In stark contrast, the linear models yielded significantly higher SI values, placing them in lower performance categories. This disparity validates that for river pollution forecasting, the non-linear mapping capabilities of hybrid algorithms are an operational necessity for achieving reliable, high-fidelity concentration profiles.

In the context of real-time monitoring and autonomous early-warning systems, predictive fidelity must be strategically balanced against computational overhead. The deployment of pollution models in IoT-integrated sensor networks necessitates architectures that can be trained and executed within strict temporal windows, particularly as the sampling frequency of river data increases. The training and inference latencies recorded for the evaluated models reveal critical trade-offs in algorithmic efficiency as shown in Table 4.

The training duration serves as a proxy for the offline computational cost of model development. The deep recurrent architectures (LSTM, GRU, BiLSTM) exhibited the highest computational burden. Specifically, the GRU and BiLSTM required 453.19 s and 552.21 s, respectively, to reach convergence—representing an approximately 94.7-fold increase in training time for the BiLSTM compared to the standalone Random Forest model (5.83 s). This latency is a structural hallmark of recurrent networks, where the sequential processing of steps and the optimization of multi-gated weight matrices (input, forget, and output gates) impose a high cost on backpropagation through time (BPTT). While these models are highly effective at capturing dependencies, their training complexity poses a significant bottleneck for systems requiring frequent online retraining or deployment on edge devices with limited memory and processing power. The Proposed Hybrid RF-MLP algorithm occupies a computational “sweet spot,” with a total training time of 22.6 s. Given that this model achieved an order-of-magnitude reduction in RMSE compared to its deep learning peers (e.g., GRU at 0.003114 vs. Hybrid at 0.00012966), this represents an exceptionally high performance-per-second ratio. The marginal 16.75 s increase in training time over the standalone RF (5.83 s) is a justifiable engineering trade-off. It indicates that the MLP residual corrector, while significantly enhancing precision, remains a lightweight architectural addition compared to the massive parameter space of the deep BiLSTM.

For operational water quality management, inference latency (prediction time) can be more critical than training time. Once deployed, a model must ingest real-time spatial data and generate forecasts near-instantaneously to facilitate rapid emergency intervention. The MLP model demonstrated the highest inference efficiency with a prediction time of 0.1493 s for the entire test set. This speed is attributed to the feed-forward nature of the network, which, post-training, simplifies to a series of optimized matrix multiplications. In contrast, the Hybrid RF-MLP and standard RF models showed higher latencies (2.1275 s and 2.1182 s, respectively). This is due to the computational requirement of traversing a high-density forest of decision trees during the ensemble aggregation phase. However, even at approximately two to three seconds, these inference speeds are more than sufficient for sub-hourly or even minute-by-minute river monitoring. The most notable delays in this updated analysis occurred with the deep recurrent architectures, specifically the BiLSTM (2.9192 s) and LSTM (2.7704 s). For these models, the sequential processing logic and multi-gated weight matrices impose higher computational overhead compared to the proposed hybrid framework.

Table 5 summarizes the statistical comparative performance across all evaluated algorithms during validation phase.

A deeper examination of the validation vs. testing metrics provides critical insights into model generalization and the potential for overfitting. In this study, the performance of the Hybrid, RF, and deep learning models is remarkably consistent between the two phases. For example, the Hybrid RF-MLP algorithm maintains a near-identical error profile across both partitions, with a Validation RMSE of 0.00012442 and a Test RMSE of 0.00012966. Consistently low error rates across both datasets indicate a highly stable architecture with high generalization skills. This stability is often attributed to the ensemble nature of the Random Forest base, which uses bagging to reduce variance. By averaging the results of many uncorrelated decision trees, the model effectively filters out the random fluctuations present in individual sampling points. Similarly, the standalone RF model displays high stability with a Validation RMSE of 0.00013114 and a Test RMSE of 0.00013649, confirming that the high predictive precision is not a result of overfitting to the training or validation data, but rather a reflection of the models’ ability to capture the underlying physical dynamics of the coupled pollution-aeration equations.

The hybrid algorithm accurately captured the distinct spatial regimes of the plume, including the near-source peak, the dispersion zone, and the downstream depletion and sag phase. By refining the RF residuals, the MLP successfully eliminated step-like artifacts, resulting in smooth pollutant concentration curves and dissolved oxygen concentration curves that are fully consistent with the numerical results obtained using the explicit finite difference method as illustrated in Fig. 5 for the spatial domain\(\:\:0\:\le\:x\:\le\:4\:\). The spatial distributions of pollutant concentration \(\:P\left(x,t\right)\) and dissolved oxygen concentration \(\:X\left(x,t\right)\) were evaluated at flow velocities \(\:u\:=\:1,\:6\:\) and \(\:11\:\) along the river.

Predicted hybrid AI algorithm for the coupled water pollution and aeration concentrations vs. analytical solver actual data.

Beyond this specific case, the results highlight the broader potential of artificial intelligence to address coupled transport problems of increasing complexity. In realistic environmental systems, pollutant dynamics are often influenced by nonlinear interactions, spatial heterogeneity, and incomplete or noisy measurements, which make classical numerical approaches difficult to scale or computationally expensive. By learning directly from data while remaining guided by the underlying physical trends, hybrid AI models can effectively handle such complexities, improve predictive accuracy, and provide robust estimates even when the governing processes are difficult to model using traditional deterministic solvers. As a result, AI-based algorithms offer a practical and flexible tool for modeling and forecasting coupled pollutant transport and aeration dynamics in real-world river systems.

Regarding model interpretability, it is important to note that since the input is restricted to the spatial coordinate (x), traditional feature importance analyses are not applicable. Instead, the model’s interpretability is established through its physical consistency; the RF-MLP framework successfully captures the complex, non-linear spatial gradients inherent in the Advection-Diffusion Equation, ensuring that the AI-driven predictions remain grounded in the deterministic laws of mass transport and longitudinal dispersion.

To demonstrate the framework’s generalizability beyond the analytical generating equation, the hybrid RF-MLP model was evaluated against an independent validation dataset generated via a Numerical Solver. As illustrated in Fig. 6, the hybrid RF-MLP algorithm demonstrates exceptional agreement with the numerical target, accurately capturing the peak concentrations and arrival times. It is worth mentioning that the cross-solver validation datasets generated via analytical solver (Fig. 5) as well as numerical solver (Fig. 6) provide rigorous evidence that the proposed hybrid RF-MLP algorithm has successfully internalized the underlying physical laws of the coupled pollution–aeration equations, rather than merely performing function approximation of the analytical formula. However, the authors emphasize that this cross-solver comparison fundamentally serves as a rigorous mathematical verification of physical model emulation and mathematical consistency. While it confirms that the hybrid RF-MLP architecture successfully emulates the continuous spatial gradients of the governing transport equations, it does not constitute site-specific empirical validation. Natural river systems exhibit complex, stochastic irregularities, transient boundaries, and sensor-level measurement noise that are simplified by idealized, deterministic differential equations. This cross-solver verification is therefore presented as an essential mathematical baseline, establishing physical consistency before deployment in natural real-world environments.

Predicted hybrid AI algorithm for the coupled water pollution and aeration concentrations vs. numerical solver actual data.

In the numerical solution of Eqs. (9) and (10) using explicit finite difference method, the step length is assumed to be \(\:\varDelta\:x=0.05\:\left(km\right)\:\) and \(\:\varDelta\:t=0.002\:\left(day\right)\), for the ensuring of the stability of the finite difference scheme. The default parameter values 0 ≤ \(\:x\) ≤ 4, \(\:\text{t}=0.1\) (day), \(\:\mu\:=0.9\:\left(kg\:{km}^{-1}{day}^{-1}\right)\) \(\:{\text{D}}_{\text{P}}=\:{\text{D}}_{\text{X}}=0.3{(\text{k}\text{m}}^{2}{\text{d}\text{a}\text{y}}^{-1})\), \(\:u\) = 1\(\:\left({km\:day}^{-1}\right),\) \(\:{\:K}_{1}=0.2\) \(\:\left({day}^{-1}\right)\), \(\:{\text{K}}_{2}=1\left({day}^{-1}\right)\), \(\:{\text{K}}_{3}=0.3\left(\text{k}\text{g}{\:\text{k}\text{m}}^{-3}\right)\), \(\:{\:K}_{4}={\:K}_{5}=1\left(km\right)\), \(\:{\text{P}}_{\text{o}}=0.2,{\:\text{P}}_{1}=0.8\:\left(\text{k}\text{g}{\:\text{k}\text{m}}^{-3}\right)\),\(\:\:\:{\text{X}}_{\text{o}}=2.2\:\left(\text{k}\text{g}\:\text{k}{\text{m}}^{-3}\right)\), \(\:{\text{X}}_{1}=0.2\:\left(\text{k}\text{g}\:\text{k}{\text{m}}^{-3}\right)\),\(\:\:{\text{X}}_{2}=6.5\:\left(\text{k}\text{g}\:\text{k}{\text{m}}^{-3}\right)\), \(\:{\uplambda\:}=0.8{(\text{k}\text{m}}^{-1})\), \(\:{\upalpha\:}=0.01\:\left({\text{d}\text{a}\text{y}}^{-1}\right)\) and \(\:\text{S}=7\:\left(\text{k}\text{g}\:{\text{k}\text{m}}^{-3}\right)\).

Figure 5 illustrates the spatial distributions of the dissolved oxygen concentration \(\:X\left(x,t\right)\)and the pollutant concentration \(\:P\left(x,t\right)\) along the river at a fixed time for different values of the inlet flow velocity\(\:\:\:u\), representing different rates of clean-water release at the upstream boundary. At the upstream location\(\:\:x=0\), the dissolved oxygen concentration exhibits high values for all flow velocities due to the continuous injection of clean, oxygen-rich water. This reflects the implemented remediation strategy aimed at improving water quality in a river that was previously polluted as a result of past industrial activities. Since the injected water is free of pollutants, the pollutant concentration near the inlet remains very low, confirming the effectiveness of the clean-water flushing process in protecting the upstream region.

As the water flows downstream, the dissolved oxygen concentration gradually decreases due to its consumption by biochemical processes associated with the degradation of the existing pollutants, in addition to transport and dispersion effects. At the same time, the pollutant concentration initially increases, forming a characteristic downstream hump that represents the transport and redistribution of previously accumulated pollutants along the river. Further downstream, pollutant concentrations decrease due to dilution, dispersion, and degradation processes, indicating a gradual improvement in water quality.

The effect of the inlet flow velocity is clearly observed in both concentration profiles. For lower velocities, the residence time of water within the polluted reach is longer, which enhances oxygen consumption and results in a more rapid decline in dissolved oxygen levels. In contrast, higher flow velocities enhance longitudinal mixing and accelerate downstream transport, delaying the oxygen depletion and shifting the pollutant accumulation zone further downstream. Consequently, increasing the flow velocity improves the overall oxygen availability along the river while reducing localized pollutant buildup.

The horizontal dashed line represents 30% of the dissolved oxygen saturation concentration, which is adopted as the ecological threshold for sustaining aquatic life. The figure shows that, for the selected operating conditions, the dissolved oxygen concentration remains above this critical limit throughout the river reach for all considered flow velocities. This indicates that the applied clean-water release strategy is sufficient to maintain environmentally safe conditions along the river, allowing the ecosystem to support aquatic life.

From an environmental management perspective, the figure demonstrates how the proposed model can be used to assess the effectiveness of different remediation scenarios and to determine suitable flow velocities that ensure safe dissolved oxygen levels. By linking flow control with the coupled dynamics of pollutants and dissolved oxygen, the model provides a practical framework for identifying environmentally safe river sections that are suitable for fishing activities and fish farming, while also enabling the selection of the minimum clean-water release required to achieve these ecological objectives in a sustainable manner.

Figure 7 illustrates a comparison between the analytical solution given by Eq. (21) and the numerical solution of Eq. (9) computed using an explicit finite-difference method for \(\:{K}_{1}={\upmu\:}={P}_{1}=0.\) The profiles are plotted at three different times, \(\:\text{t}=0.02,\:\:0.04,\:0.06.\) The results show excellent agreement between the two approaches across all tested times: the numerical results closely follow the analytical solutions, with only negligible differences attributable to discretization errors. This consistency may confirm the accuracy and stability of the explicit finite-difference scheme. Based on this validation, all subsequent results presented in Figs. 7, 8 and 9 are obtained using the explicit finite-difference method applied to the coupled advection–dispersion Eqs. (9) and (10).

Figure 8 presents the temporal evolution of pollutant concentration \(\:P\left(x,t\right)\) and dissolved oxygen concentration \(\:X\left(x,t\right)\) along the river at three different times, \(\:\text{t}=0.02,\:0.06,\:0.1\). Near the upstream boundary \(\:x=0\), the dissolved oxygen concentration is high due to the injection of clean, oxygen-rich water. Moving downstream, the dissolved oxygen level gradually decreases as oxygen is consumed during pollutant degradation and transported by advection and dispersion. As time progresses, the influence of the injected clean water extends farther along the river, leading to a noticeable improvement in dissolved oxygen levels and a smoother spatial decay, which reflects the gradual recovery of the river system. In contrast, the pollutant concentration \(\:P\left(x,t\right)\:\)remains low near the upstream region because the injected water is free of pollutants. The concentration then increases to a maximum at a certain downstream distance as previously existing pollutants are transported and dispersed along the river. Farther downstream, the pollutant concentration decreases due to dilution and degradation processes. With increasing time, an overall reduction in pollutant concentration is observed throughout the river reach, indicating the effectiveness of the remediation strategy. Overall, the figure highlights the strong interaction between pollutant reduction and dissolved oxygen recovery, demonstrating a progressive improvement in water quality and the establishment of more favorable conditions for aquatic life.

Figure 9 illustrates the influence of the parameter \(\:\mu\:\) on the spatial distributions of the pollutant concentration \(\:P\left(x,t\right)\) and the dissolved oxygen concentration \(\:X\left(x,t\right)\) along the river for \(\:\mu\:=0,\:\:10,\:20\). It is observed from Fig. 8 that the dissolved oxygen curves are almost overlapping as the value of \(\:\mu\:\) increases, indicating that the effect of \(\:\mu\:\) on the spatial distribution of dissolved oxygen is relatively weak. This behavior can be attributed to the fact that the injected water at the upstream boundary contains a dissolved oxygen concentration close to saturation, which provides sufficient oxygen to compensate for the amount consumed during the water purification process, even with increasing \(\:\mu\:\). In addition, natural reaeration contributes to maintaining nearly uniform dissolved oxygen levels along the river, thereby reducing its sensitivity to variations in \(\:\mu\:\). In contrast, the pollutant concentration exhibits a clear response to increasing \(\:\:\mu\:\), with higher values observed along the river as \(\:\mu\:\) increases, particularly in the downstream region. This behavior reflects the direct role of \(\:\mu\:\) in enhancing the presence of pollutants within the river system, compared to its limited influence on dissolved oxygen. Overall, the figure confirms that the clean-water injection strategy ensures the stability of dissolved oxygen levels, while the pollutant concentration remains more sensitive to increases in \(\:\mu\:\).

Comparison between the analytical and numerical solutions P(x,t).

Figure 10 illustrates the influence of the parameter \(\:\alpha\:\) on the spatial distributions of the pollutant concentration \(\:P\left(x,t\right)\) and the dissolved oxygen concentration \(\:X\left(x,t\right)\) along the river for the values \(\:\alpha\:=1,\:\:3,\:\:5\). The figure illustrates the effect of the reaeration coefficient \(\:\alpha\:\) on the spatial distributions of the dissolved oxygen concentration \(\:X\left(x,t\right)\) and the pollutant concentration \(\:P\left(x,t\right)\:\)along the river. It is observed that increasing the value of \(\:\alpha\:\) leads to a noticeable increase in dissolved oxygen levels throughout the spatial domain, with higher \(\:X\left(x,t\right)\:\)curves corresponding to larger values of \(\:\alpha\:\), particularly in the downstream regions. This behavior reflects the effective role of reaeration in replenishing the oxygen consumed during the water purification process and in maintaining higher and more stable dissolved oxygen levels within the river. In contrast, the pollutant concentration \(\:P\left(x,t\right)\:\)shows very similar profiles for different values of\(\:\:\alpha\:\), indicating that the influence of reaeration on pollutant transport is relatively weak. This is because reaeration primarily affects the oxygen balance, while the pollutant dynamics are mainly governed by transport and degradation processes. Overall, the figure demonstrates that increasing \(\:\alpha\:\) directly improves the environmental conditions by enhancing dissolved oxygen availability, while having a negligible effect on the spatial distribution of pollutants.

The variation of P(x,T) and X(x,t) with t.

The variation of P(x,T) and X(x,t) with μ.

The variation of \(\:P\left(x,T\right)\:and\:X(x,t)\) with \(\:\alpha\:\).

Water quality prediction and monitoring are essential for mitigating fish mortality and ensuring that commercial aquaculture operations remain both environmentally sustainable and economically viable. This study has successfully addressed these challenges by developing a novel physics-guided hybrid RF-MLP (residual) algorithm, trained on a high-fidelity dataset of 50,000 observations derived from the coupled pollution-aeration transport equations. The core conclusions of this research are summarized as follows:

Methodological & Sustainability Contributions:

We established a reproducible framework integrating physical sciences with machine learning to identify optimal fish-survival zones (dissolved oxygen levels above 30% of saturation) along a polluted river reach.

By enabling decision-makers to determine the minimum allowable water velocity and upstream dissolved oxygen levels required to maintain safe conditions, this tool directly supports global sustainability initiatives, including United Nations Sustainable Development Goals for Clean Water and Sanitation (SDG 6) and Good Health and Well-Being (SDG 3).

Computational & Statistical Performance:

The proposed hybrid RF-MLP architecture proved to be the superior predictive model, achieving an exceptionally high accuracy with an RMSE of 0.00012966 and an \(\:{R}^{2}\) of 0.9999999692.

The sequential residual learning logic successfully combined the stable feature-partitioning of Random Forests (RF) with the continuous, non-linear error-correction of a Multilayer Perceptron (MLP), completely restoring the smooth physical gradients and eliminating the “step-like” artifacts typical of standalone tree-based models.

From an operational standpoint, the hybrid model offered an optimized performance-to-latency ratio, converging in just 22.58 s—representing a substantial 24.45 fold reduction in training duration compared to deep BiLSTM architectures (552.21 s) while maintaining superior accuracy over recurrent models (such as GRU and LSTM) and linear baselines.

Physical & Hydrological Insights:

The physical-based analysis demonstrated that injecting clean, oxygen-rich water at the upstream boundary (x = 0) effectively dilutes pollutant concentrations near the inlet and progressively improves downstream dissolved oxygen levels over time.

Controlling the inlet flow velocity u is a critical lever for remediation; higher velocities enhance longitudinal mixing, accelerate downstream transport, and successfully delay the onset of oxygen depletion along the river channel.

Parametric evaluations revealed that while pollutant concentration remains highly sensitive to variations in the discharge source parameter \(\:\mu\:\), the reaeration rate coefficient \(\:\alpha\:\) exerts a dominant control on maintaining favorable downstream dissolved oxygen levels, whereas its influence on pollutant transport remains negligible.

Close agreement between the analytical solutions and the explicit finite-difference numerical scheme verified the mathematical stability and physical consistency of the underlying dataset used to guide the machine learning models.

The following recommendations for water resource managers and aquaculture planners are presented in order to empower future researchers to replicate the technique in diverse contexts and expand on the findings offered in this study:

Optimize Water Resources via Threshold-Based Flushing: Environmental managers should utilize the predictive model to identify and apply the minimum allowable upstream flushing velocity (u). Our spatial simulations indicate that while higher velocities accelerate pollutant dilution and delay downstream oxygen depletion, discharging excess clean water yields diminishing ecological returns. Identifying this optimal threshold ensures responsible water use while maintaining dissolved oxygen (DO) levels safely above the critical 30% ecological threshold required for fish survival.

Implement Dynamic Upstream Flow Control: To protect sensitive downstream fish farming zones, managers should dynamically adjust the upstream inlet velocity (u) in response to seasonal agricultural or industrial runoff cycles. Higher injection velocities successfully shift the “oxygen sag” (depletion) zone further downstream, effectively moving localized pollution away from critical aquaculture enclosures.

Prioritize Physical Reaeration Infrastructure: For long-term habitat restoration, physical remediation efforts (such as deploying cascade aerators, step structures, or modifying riverbed morphology to promote localized turbulence) must be prioritized. Our parametric evaluations reveal that the atmospheric reaeration rate coefficient (\(\:\alpha\:\)) exerts dominant, sensitive control over downstream oxygen recovery, while its overall impact on the transport of the pollutants themselves is negligible. Enhancing natural reaeration is therefore the most direct and sustainable way to maintain safe DO levels downstream.

Impose Strict Regulations on Point Sources: Because downstream pollutant concentration is highly sensitive to changes in the discharge source intensity parameter (\(\:\mu\:\)), regulatory bodies must strictly control active municipal and industrial discharge rates upstream. While clean-water flushing and natural reaeration are effective at preserving dissolved oxygen levels, they cannot fully neutralize massive spikes in upstream pollutant inputs.

Deploy Hybrid Physics-Guided AI for IoT Telemetry: For real-time water quality monitoring and early-warning alert networks, we strongly recommend adopting hybrid architectures like our sequential RF-MLP residual model over standalone deep learning models. Standalone recurrent networks like BiLSTM are computationally heavy, requiring extensive training times (552.21 s). In contrast, our hybrid model achieves superior predictive accuracy with a substantial 24.45-fold reduction in training duration (converging in just 22.58 s), making it exceptionally well-suited for edge computing, low-power telemetry sensors, and rapid online retraining.

Future research will expand the dimensional and multivariable scope:

Multi-Dimensional Modeling: Extending the current 1D model to 2D and 3D scenarios. This expansion is essential to account for transverse and vertical mixing processes, which are critical in wider or deeper river reaches where pollutant concentrations (\(\:P\:\)) and dissolved oxygen levels (\(\:X\:\)) are not uniform across the cross-section.

Multivariate Indicators: Moving beyond the coupled forecasting of \(\:P\:\) and \(\:X\:\) to simultaneously predict additional intercorrelated parameters such as pH, temperature, and nutrient concentrations (Nitrogen and Phosphorus). This holistic approach better reflects the complex biochemical interactions within aquatic ecosystems.

Algorithmic Optimization: Exploring the use of Transformer-based architectures or Physics-Informed Neural Networks (PINNs). These advanced structures could further reduce the 16.76 s training overhead observed between the standalone Random Forest model (5.83 s) and the Hybrid model (22.58 s), while ensuring that the predictions remain strictly bounded by the physical laws of mass conservation.

Empirical validation and field-data benchmarking: Future research will focus on the empirical validation of the RF-MLP framework through benchmarking against real-world hydrological datasets. A rigorous comparative analysis will be conducted to evaluate the model’s predictive performance on field-observed pollutant concentrations versus the results derived from deterministic ADE-generated synthetic data. This transition is critical for assessing the framework’s generalizability and its resilience to the stochastic complexities—such as sensor noise and environmental turbulence—that characterize natural river systems but are typically absent in idealized synthetic environments.

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Giri, S. Water quality prospective in twenty first century: status of water quality in major river basins, contemporary strategies and impediments: a review. Environ. Pollut. 271, 116332 (2021).

Article CAS PubMed Google Scholar

Jan, I. et al. Threats and consequences of untreated wastewater on freshwater environments. Editor(s): Gowhar Hamid Dar, Rouf Ahmad Bhat, Humaira Qadri, Khalid Rehman Hakeem. In Advances in Environmental Pollution Research, Microbial consortium and biotransformation for pollution decontamination, Elsevier, pp 1–26, ISBN 9780323918930. https://doi.org/10.1016/B978-0-323-91893-0.00009-2 (2022).

Rakesh, R. et al. MortCam: An Artificial Intelligence-aided fish mortality detection and alert system for recirculating aquaculture. Aquacult. Eng. 102 (102341). https://doi.org/10.1016/j.aquaeng.2023.102341 (2023).

Martos-Sitcha, J. A., Mancera, J. M., Prunet, P. & Magnoni, L. J. Editorial: Welfare and stressors in fish: Challenges facing aquaculture. Front. Physiol. 11, 162 (2020).

Article PubMed PubMed Central Google Scholar

Huang, J., Liu, S., Hassan, S. G., Xu, L. & Huang, C. A hybrid model for short-term dissolved oxygen content prediction. Comput. Electron. Agric. 186, 106216 (2021).

Yahya, A. et al. Water quality prediction model based support vector machine model for ungauged river catchment under dual scenarios. Water 11 (6), 1231 (2019).

Rahman, A., Dabrowski, J. & McCulloch, J. Dissolved oxygen prediction in prawn ponds from a group of one step predictors. Inf. Process. Agric. 2020 (7), 307–317 (2020).

Li, D., Sun, Y., Sun, J., Wang, X. & Zhang, X. An advanced approach for the precise prediction of water quality using a discrete hidden markov model. J. Hydrol. 2022 (609), 127659 (2022).

Parmezan, A. R. S., Souza, V. M. A. & Batista, G. E. A. P. A. Evaluation of statistical and machine learning models for time series prediction: Identifying the state-of-the-art and the best conditions for the use of each model. Inf. Sci. 484, 302–337 (2019).

Avila, R., Horn, B., Moriarty, E., Hodson, R. & Moltchanova, E. Evaluating statistical model performance in water quality prediction. J. Environ. Manag. 206, 910–919 (2018).

Rajaee, T., Khani, S. & Ravansalar, M. Artificial intelligence-based single and hybrid models for prediction of water quality in rivers: A review. Chemom Intell. Lab. Syst. 2020 (200), 103978 (2020).

Rui Wang, R. & Yu. R Physics-Guided Deep Learning for Dynamical Systems: A Survey. ACM Comput. Surv. 58, 31. https://doi.org/10.1145/3766887 (2025). 5, Article 136 (December 2025.

Liu, P., Wang, J., Sangaiah, A. K., Xie, Y. & Yin, X. Analysis and prediction of water quality using LSTM deep neural networks in IoT environment. Sustainability 11 (7), 2058 (2019).

Article ADS CAS Google Scholar

Aung, T., Razak, R. A. & Nor, A. R. B. M. Artificial intelligence methods used in various aquaculture applications: A systematic literature review. J. World Aquac Soc. 56 (1), 13107. https://doi.org/10.1111/jwas.13107 (2025).

Helaly, A. et al. Advancements in water quality prediction: a practical review of machine learning and deep learning approaches. Cluster Comput. 28, 598. https://doi.org/10.1007/s10586-025-05221-3 (2025).

Munoz-Alegria, J. A. et al. Bibliometric-systematic literature review (B-SLR) of machine learning-based water quality prediction: Trends, gaps, and future directions. Water 17, 2994. https://doi.org/10.3390/w17202994 (2025).

Zhu, Q., Yu, X., Zhao, L. & Xu, L. A Hybrid Deep Learning Model for Water Quality Prediction. IEEE Access. 13, 106406–106415 (2025).

Tripathy, K. P., Ashok, K. & Mishra Deep learning in hydrology and water resources disciplines: concepts, methods, applications, and research directions. J. Hydrol. 628 https://doi.org/10.1016/j.jhydrol.2023.130458 (2024).

Shedrawi, G. et al. Leveraging deep learning and computer vision technologies to enhance management of coastal fisheries in the pacific region. Sci. Rep. 14 (1), 20915. https://doi.org/10.1038/s41598-024-71763-y (2024).

Article ADS CAS PubMed PubMed Central Google Scholar

Sanja, C. et al. Application of machine learning in river water quality management: a review. Water Sci. Technol. 88, 9: 2297–2308. https://doi.org/10.2166/wst.2023.331 (2023).

Landone, E. Machine learning applications in river research: Trends, opportunities and challenges. Methods Ecol. Evol. 13 (11), 2603–2621. https://doi.org/10.1111/2041-210x.13992 (2022).

Gohri, C. et al. Artificial intelligence in sustainable development research. Nat. Sustain. 8, 970–978. https://doi.org/10.1038/s41893-025-01598-6 (2025).

Meng, C. et al. When physics meets machine learning: a survey of physics-informed machine learning. Mach. Learn. Comput. Sci. Eng. 20 https://doi.org/10.1007/s44379-025-00016-0 (2025).

Renny, S. D. et al. A Systematic Review of Physical Artificial Intelligence (Physical AI): Concepts, Applications, Challenges, and Future Directions. J. Artif. Intell. Eng. Appl. (JAIEA). 4 (3), 2246–2253. https://doi.org/10.59934/jaiea.v4i3.1101 (2025).

Jiao, L. et al. AI meets physics: a comprehensive survey. Artif. Intell. Rev. 57, 256. https://doi.org/10.1007/s10462-024-10874-4 (2024).

Azra, S., Mahdi, B. & SeyedEhsan Ndaaee, O. Machine Learning and Physics: A Survey of Integrated Models. ACM Comput. Surv. 56, 5. https://doi.org/10.1145/3611383 (2023).

Li, Y., Li, Z., Duan, Y. & Spulber, A. B. Physical artificial intelligence (PAI): the next-generation artificial intelligence. Front. Inform. Technol. Electron. Eng. 24 (8), 1231–1238. https://doi.org/10.1631/FITEE.2200675 (2023).

Muther, T. et al. Physical laws meet machine intelligence: current developments and future directions. Artif. Intell. Rev. 56, 6947–7013. https://doi.org/10.1007/s10462-022-10329-8 (2023).

Abd El-Sattar, H. K. et al. A new hybrid physics-guided AI algorithm for water quality prediction supported by variable coefficients advection-diffusion model. Discov. Artif. Intell. 6, 380. https://doi.org/10.1007/s44163-026-01248-6 (2026).

Pimpunchat, B. et al. Modelling river pollution and removal by aeration. In: (eds Oxley, L. & Kulasiri, D.) MODSIM 2007 international congress on modelling and simulation. land, water and environmental management: integrated systems for sustainability. Modelling and Simulation Society of Australia and New Zealand, 2431–2437 (2007).

Simon, T. & Koya, P. R. Modeling and numerical simulation of river pollution using diffusion-reaction equation. Am. J. Appl. Math. 3 (6), 335–340 (2015).

Wadi, A. S., Dimian, M. F. & Ibrahim, F. N. Analytical solutions for one-dimensional advection–dispersion equation of the pollutant concentration. J. Earth Syst. Sci. 123 (6), 1317–1324 (2014).

Yadav, R. R. & Kumar, L. K. Analytical solution of two-dimensional conservative solute transport in a heterogeneous porous medium for varying input point source. Environ. Earth Sci. 80, 327. https://doi.org/10.1007/s12665-021-09584-9 (2021).

Saleh, A., Ibrahim, F. N. & Hadhouda, M. K. Remediation of pollution in a river by releasing clean water. Inform. Sci. Lett. 11, 127–133 (2022). https://digitalcommons.aaru.edu.jo/isl/vol11/iss1/18

Manitcharoen, N. & Pimpunchatt, B. Analytical and Numerical Solutions of Pollution Concentration with Uniformly and Exponentially Increasing Forms of Sources. J. Appl. Math. https://doi.org/10.1155/2020/9504835 (2020).

Article MathSciNet Google Scholar

Agusto, F. B. & Bamingbola, O. M. Numerical Treatment of the Mathematical Models for Water Pollution. Res. J. Appl. Sci. 2 (5), 548–556. https://doi.org/10.3844/jmssp.2007.172.180 (2007).

Changjun, Z. & Shuwen, L. Numerical Simulation of River Water Pollution Using Grey Differential Model. J. Comput. Sci. 5 (9), 1417–1423. https://doi.org/10.4304/jcp.5.9.1417-1423 (2010).

Kusuma, J., Ribal, A. & Mahie, A. G. On FTCS Approach for Box Model of Three-Dimension Advection-Diffusion Equation. Int. J. Differ. Equations. 1, 1–9 (2018).

Suryani, S., Kusuma, J., Ilyas, N., Bahri, M. & Azis, M. I. A boundary element method solution to spatially variable coefficients diffusion convection equation of anisotropic media. J. Phys. Conf. Ser. 1341, (2019).

Andallah, L. S. & Khatun, M. R. Numerical solution of advection-diffusion equation using finite difference schemes. Bangladesh J. Sci. Ind. Res. 55 (1), 15–22. https://doi.org/10.3329/bjsir.v55i1.46728 (2020).

Rafat, P. B., Ibrahim, F. N. & Saleh, A. On determining conditions and suitable locations for fish survival by using the solution of the two coupled pollution and aeration equations. Sci. Rep. 13 (1). https://doi.org/10.1038/s41598-023-33368-9 (2023).

Saleh, A., Ibrahim, F. N. & Sharaf, M. A. M. Mathematical model for expanding suitable areas for fish survival in a polluted river by using the solution of the two coupled pollution and aeration equations. Model. Earth Syst. Environ. 10, 1803–1813. https://doi.org/10.1007/s40808-023-01868-2 (2024).

Ahmed, K. S. et al. Deep learning solver for solving advection–diffusion equation in comparison to finite difference methods. Commun. Nonlinear Sci. Numer. Simul. 115 https://doi.org/10.1016/j.cnsns.2022.106780 (2022).

Martin, M., Faisal, Q. & Hendrick, W. H. Neural networks trained to solve differential equations learn general representations. Adv. Neural Inf. Process. Syst. 31, 2018 (2018).

Ibrahim, F. N., Dimian, M. F. & Wadi, A. S. Remediation of pollution in a river by unsteady aeration with arbitrary initial and boundary conditions. J. Hydrol. 525, 793–797. https://doi.org/10.1016/j.jhydrol.2015.03.037 (2015).

Hadhouda, MKh & Hassan, Z. S. Mathematical model for unsteady remediation of river pollution by aeration. Inf. Sci. Lett. 11, 323–329. https://doi.org/10.18576/isl/110203 (2022).

Hallam, T. G., Lassiter, R. R. & Henson, S. M. Modeling fish population dynamics. Nonlinear Anal. 40 (00), 227–250. https://doi.org/10.1016/S0362-546X(00)85013-0 (2000).

Hallam, T. G. & Lika, K. Modeling the effects of toxicants on a fish population in a spatially heterogeneous environment: I. Behavior of the unstressed, spatial model. Nonlinear Anal. Theory Methods Appl. 30, 1699–1707. https://doi.org/10.1016/S0362- (1997). 546X(97) 00050 – 3.

Shukla, J. B., Dubey, B. & Freedman, H. I. Effect of changing habitat on survival of species. Ecol. Model. 87 (95), 205–216. https://doi.org/10.1016/0304- (1996).

Shukla, J. B., Misra, A. K. & Chandra, P. Mathematical modeling of the survival of a biological species in polluted water bodies. Differ. Equ Dyn. Syst. 15, 209–230 (2007).

Roy, S. M., Tanveer, M. & Machavaram, R. Applications of gravity aeration system in aquaculture—A systematic review. Aquacult. Int. 30, 1593–1621. https://doi.org/10.1007/s10499-022-00851-5 (2022a).

Roy, S. M. et al. Diversified aeration facilities for effective aquaculture systems—A comprehensive review. Aquacult. Int. 29, 1181–1217. https://doi.org/10.1007/s10499-021-00685-7 (2021).

Roy, S. M. et al. Optimizing the aeration performance of a perforated pooled circular stepped cascade aerator using hybrid ANNPSO technique. Inf. Process. Agric. 9, 533–546. https://doi.org/10.1016/j.inpa.2021.09.002 (2022).

Skouteris, G. et al. The use of pure oxygen for aeration in aerobic wastewater treatment: A review of its potential and limitations. Bioresour Technol. https://doi.org/10.1016/j.biortech.2020.123595 (2020).

Bhatt, D., Swain, M. & Yadav, D. Artificial intelligence based detection and control strategies for river water pollution: A comprehensive review. J. Contam. Hydrol. 271, 104541. https://doi.org/10.1016/j.jconhyd.2025.104541 (2025).

Haq., K. P. R. A. & Harigovindan, V. P. Water quality prediction for smart aquaculture using hybrid deep learning models. IEEE Access 10, 60078–60098 (2022).

Huang, S. et al. Coupling machine learning into hydrodynamic models to improve river modeling with complex boundary conditions. Water Resour. Res. 58, 10. https://doi.org/10.1029/2022WR032183 (2022).

Sadra, S. et al. Enhanced predictive modelling of dissolved oxygen concentrations in riverine systems using novel hybrid temporal pattern attention deep neural networks. Environ. Res. 263, Part 1. https://doi.org/10.1016/j.envres.2024.120015 (2024).

Shadkani, S. et al. Random Forest and multilayer perceptron hybrid models integrated with the genetic algorithm for predicting pan evaporation of target site using a limited set of neighboring reference station data. Earth Sci. Inf. 17 1261–1280. https://doi.org/10.1007/s12145-024-01237-2 (2024).

Zhengheng Pu, Jieru, Y. et al. A hybrid Wavelet-CNN-LSTM deep learning model for short term urban water demand forecasting. Front. Environ. Sci. Eng. 2023, 17 (2). https://doi.org/10.1007/s11783-023-1622-3 (2023).

Lu, H. & Ma, X. Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249, 126169. https://doi.org/10.1016/j.chemosphere.2020.126169 (2020).

Lokman, A., Ismail, W. Z. W. & Aziz, N. A. A. A review of water quality forecasting and classification using machine learning models and statistical analysis. Water 17, 2243. https://doi.org/10.3390/w17152243 (2025).

Ahmed, U. et al. Efficient water quality prediction using supervised machine learning. Water 11(11), 2210. https://doi.org/10.3390/w11112210 (2019).

Baek, S. S., Pyo, J. & Chun, J. A. Prediction of water level and water quality using a CNN-LSTM combined deep learning approach. Water 12(12), 3399. (2020).

Du, S., Li, T., Yang, Y. & Horng, S. J. Deep air quality forecasting using hybrid deep learning framework. IEEE Trans. Knowl. Data Eng. 33(6), 2412-2424. https://doi.org/10.1109/TKDE.2019.2954510 (2021).

Moon, J., Kim, Y., Son, M. & Hwang, E. Hybrid short-term load forecasting scheme using random forest and multilayer perceptron. Energies 11 (12), 3283 (2018).

Günther, F. & Fritsch, S. Neuralnet: Training of neural networks. R J. 2 (1), 30–38 (2010). DOI:10.32614/RJ- 2010-006.

Friedman, J. H. Stochastic gradient boosting. Comput. Stat. Data Anal. 38, 367–378 (2002).

Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9 (8), 1735–1780 (1997).

Schuster, M. & Paliwal, K. K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45 (11), 2673–2681 (1997).

Mamat, N., Razali, M., Hamzah, F. B. & S. F. and Enhancement of water quality index prediction using support vector machine with sensitivity analysis. Front. Environ. Sci. 10, 1061835. https://doi.org/10.3389/fenvs.2022.1061835 (2023).

The authors are grateful to Ain Shams University (ASU) in Cairo, Egypt, for facilitating and sponsoring this study endeavor. We also like to express sincere appreciation to anonymous reviewers for their insightful comments.

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB). This work was supported by Ain Shams University (ASU), Cairo, Egypt.

Faculty of Science, Mathematics Department, Computer Science Division, Ain Shams University, Abbassia, 11566, Cairo, Egypt

Knowledge-Based Systems and Robotics Dept, Informatics Research Institute, City of Scientific Research and Technological Applications, New Borg El-Arab, Alexandria, Egypt

Department of Basic Science, Egyptian Academy for Engineering and Advanced Technology, Cairo, Egypt

Department of Artificial Intelligence, Faculty of Computers and Artificial Intelligence, Hurghada University, Hurghada, Egypt

Search author on:PubMed Google Scholar

H.K.: Conceptualization, Formal analysis, Investigation, Validation, Project administration, Methodology, Supervision and Writing the main manuscript A. S.: Statistical analysis, data collection, and Methodology M. S. & S. A.: Data curation, Formal analysis, Investigation and implementationAll authors designed and conducted the study.All authors reviewed the manuscript.

Correspondence to Hussein Karam Abd El-Sattar.

The authors declare no competing interests.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

El-Sattar, H.K.A., Elshambakey, M., Saleh, A. et al. Integrating physical modeling with artificial intelligence for predicting fish survival zones in polluted rivers to maintain a sustainable aquaculture industry. Sci Rep 16, 19009 (2026). https://doi.org/10.1038/s41598-026-57547-6

Version of record: 18 June 2026

DOI: https://doi.org/10.1038/s41598-026-57547-6

Integrating physical modeling with artificial intelligence for predicting fish survival zones in polluted rivers to maintain a sustainable aquaculture industry

Related Stories

US treasury chief urged Trump not to host ‘Mr Bean on crack’ Zelenskyy, book says

Iran says Strait of Hormuz will be closed over Israel attacks on Lebanon

FIFA World Cup: Germany takes on Ivory Coast at Toronto Stadium. Live updates here

Iran says it shut Strait of Hormuz over truce violations, but U.S. says it remains open

‘I’ve finally found God without all the extras’: behind the surge in people converting to Progressive Judaism

Youth struck by vehicle in North End collision

Italian tourist killed in Dominican Republic beach resort fire

Artist paints large Lionel Messi mural in Dallas