Predicting Bangladesh’s Harvest: A Price Forecasting Challenge

Author: Denis Avetisyan

A new dataset and comparative analysis reveals the difficulties of accurately forecasting agricultural commodity prices in a dynamic, developing market.

During a May-June 2025 forecast of sweet pumpkin retail prices, a BiLSTM model demonstrated superior performance with a Mean Absolute Error of 2.66, outperforming the Vanilla Transformer (MAE 4.17), while the Prophet model significantly overestimated prices, resulting in a substantial Mean Absolute Percentage Error of 74.56%.

This study benchmarks classical and deep learning time series models on a novel Bangladeshi market price dataset, finding limited benefit from learnable temporal embeddings and inherent challenges with step-function price dynamics.

Accurate short-term forecasting of agricultural commodity prices remains a persistent challenge, particularly in data-scarce environments like developing economies. This is addressed in ‘A Benchmark of Classical and Deep Learning Models for Agricultural Commodity Price Forecasting on A Novel Bangladeshi Market Price Dataset’, which introduces AgriPriceBD, a new daily retail price dataset for five key Bangladeshi commodities. Our analysis reveals fundamentally heterogeneous forecastability, with surprisingly strong performance from naïve persistence and a consistent failure of deep learning approaches-including Time2Vec enhanced Transformers-to outperform simpler models given the limited data. Can alternative architectures or data augmentation techniques unlock the potential of deep learning for agricultural price forecasting in contexts where historical data is constrained and price dynamics are inherently discrete?

Forecasting for Resilience: Understanding Agricultural Price Dynamics

The availability of sufficient, safe, and nutritious food for all – the very definition of global food security – is inextricably linked to the ability to anticipate future agricultural prices. Accurate price forecasting allows governments and aid organizations to proactively plan for potential shortfalls, ensuring resources are allocated efficiently to stabilize markets and prevent widespread hunger. Without reliable predictions, interventions become reactive rather than preventative, increasing the cost of assistance and diminishing its effectiveness. Furthermore, consistent price signals are crucial for farmers, enabling informed decisions about planting, harvesting, and storage, ultimately contributing to a more stable and resilient food system. The capacity to forecast these prices, therefore, is not merely an economic exercise, but a fundamental component of safeguarding global well-being and achieving sustainable food security for all populations.

Agricultural markets present a unique forecasting challenge due to their inherent complexity and dynamism. Unlike manufactured goods with relatively stable production chains, agricultural commodity prices are intensely sensitive to a confluence of unpredictable factors – weather patterns, geopolitical events, disease outbreaks, and even subtle shifts in consumer demand. Traditional statistical methods, often reliant on historical price data and linear relationships, frequently fail to capture these non-linear interactions and abrupt changes. Consequently, forecasts generated by these conventional approaches often exhibit significant errors, particularly when faced with unforeseen circumstances or ‘black swan’ events. This unreliability extends beyond mere economic inconvenience; inaccurate price predictions can disrupt supply chains, impede effective resource allocation by governments and aid organizations, and ultimately compromise the availability of affordable food for vulnerable populations.

Inaccurate agricultural price predictions create a ripple effect of hardship, particularly for vulnerable populations who allocate a significant portion of their income to food. These communities are least equipped to absorb sudden price increases or navigate market volatility stemming from unreliable forecasts. Research indicates that commodity price forecastability isn’t uniform; some commodities exhibit predictable patterns allowing for relatively accurate projections, while others remain stubbornly difficult to forecast. This heterogeneity complicates effective resource allocation, as interventions designed to stabilize prices or ensure access may prove ineffective for certain commodities, ultimately exacerbating food insecurity for those most in need and highlighting the critical importance of nuanced, commodity-specific forecasting approaches.

Despite achieving training convergence, the Informer model exhibits erratic predictions on the garlic test set and fails to learn coherent patterns across commodities-garlic shows 116% prediction variance, while chickpea and green chilli demonstrate much higher inflation at 4987% and 1108%, respectively-suggesting that ProbSparse attention requires a larger training dataset.

Constructing a Foundation: The AgriPriceBD Dataset

Agricultural price data is fundamentally structured as a time series, consisting of observations of price values recorded at successive points in time. This temporal ordering is critical, as it implies that consecutive data points are not independent; the price of a commodity in one period is highly likely to be correlated with its price in preceding periods. Consequently, standard statistical methods assuming independence may yield inaccurate results. Appropriate analytical techniques for time series data, such as autoregressive models, moving averages, or more complex state-space models, are necessary to account for this autocorrelation and effectively model price dynamics, forecast future prices, and identify underlying trends or seasonality. Ignoring the time-dependent nature of agricultural prices can lead to biased estimates and unreliable predictions.

A high-quality benchmark dataset is crucial for robust analysis of agricultural price trends and effective economic modeling. To address the limited availability of such data in Bangladesh, we introduce AgriPriceBD, a novel resource specifically constructed for this purpose. AgriPriceBD provides a standardized and accessible compilation of historical agricultural prices, enabling researchers and policymakers to conduct rigorous investigations into market dynamics, price volatility, and the impact of various factors on food security. The dataset’s quality is ensured through careful data cleaning and validation procedures, aiming to provide a reliable foundation for empirical research and informed decision-making within the Bangladeshi agricultural sector.

AgriPriceBD is a benchmark dataset constructed through an LLM-Assisted Pipeline designed to address data scarcity in agricultural price research. The pipeline systematically extracts data from PDF reports published by the Bangladesh government, a source previously difficult to access due to the unstructured format and manual extraction requirements. This automated process significantly reduces the time and effort needed to compile historical price data for key agricultural commodities, resulting in a resource containing granular price information at the district level. The resulting dataset provides researchers with a standardized and readily available source for analysis of agricultural market trends and price dynamics in Bangladesh.

Seasonal decomposition of garlic prices (BDT/kg) reveals a U-shaped trend from 2020 to late 2024, driven by import-supply disruptions, alongside a strong harvest cycle (<span class="katex-eq" data-katex-display="false">R/S</span> ratio of 0.93) which, despite its exploitable structure, offers limited gains due to training-scale constraints. — Seasonal decomposition of garlic prices (BDT/kg) reveals a U-shaped trend from 2020 to late 2024, driven by import-supply disruptions, alongside a strong harvest cycle ( $R/S$ ratio of 0.93) which, despite its exploitable structure, offers limited gains due to training-scale constraints.

Refining Prediction: Advanced Techniques for Price Forecasting

The forecasting methodologies examined encompass both established statistical techniques and contemporary machine learning models. Statistical approaches include Seasonal Autoregressive Integrated Moving Average (SARIMA) models, utilized for their ability to capture linear dependencies and seasonality within time series data. Machine learning methods investigated are Bi-directional Long Short-Term Memory (BiLSTM) networks, effective at modeling sequential data through recurrent neural networks, and Prophet, a decomposable time-series forecasting procedure particularly suited for business time-series with strong seasonal effects and trend changes. These models were selected to represent a diversity of approaches to time series prediction, ranging from traditional statistical modeling to more advanced deep learning architectures.

Prophet, a forecasting procedure designed for time series data with strong seasonality and trend components, can exhibit discrepancies when applied to agricultural commodities displaying Discrete Step-Function Dynamics. These dynamics are characterized by abrupt, discontinuous shifts in price levels, often resulting from policy changes, supply shocks, or significant alterations in demand. While Prophet effectively models continuous trends and seasonality, its underlying functional form may struggle to accurately capture and project these step changes, leading to forecast errors where the model attempts to smooth or interpolate across the observed price discontinuities. This limitation is particularly noticeable in markets where administrative controls or trade interventions create artificial price floors or ceilings, or when unexpected events drastically alter baseline supply conditions.

Informer is a deep learning model utilizing the Transformer architecture, specifically designed to address the limitations of recurrent neural networks (RNNs) in long time series forecasting. Traditional RNNs, including LSTMs and GRUs, often suffer from vanishing gradients and computational bottlenecks when processing lengthy sequences. The Transformer architecture, through its self-attention mechanism, allows Informer to weigh the importance of different time steps without being constrained by sequential processing, enabling it to capture long-range dependencies more effectively. Key to Informer’s efficiency is ProbSparse Self-Attention, which reduces the quadratic complexity of standard self-attention to linear complexity, and generative style decoder which drastically reduces the number of parameters. This allows for improved performance and scalability in modeling complex time series dependencies present in agricultural price data, potentially surpassing the accuracy of SARIMA, BiLSTM, and even Prophet in certain forecasting scenarios.

SARIMA outperformed all models in forecasting cucumber prices (May-June 2025) with a Mean Absolute Error of 8.97, while the T2V-Transformer showed a negligible improvement in RMSE over the Vanilla Transformer (<span class="katex-eq" data-katex-display="false">p=0.962</span>), and Prophet consistently underpredicted prices due to its smoothing assumptions. — SARIMA outperformed all models in forecasting cucumber prices (May-June 2025) with a Mean Absolute Error of 8.97, while the T2V-Transformer showed a negligible improvement in RMSE over the Vanilla Transformer ( $p=0.962$ ), and Prophet consistently underpredicted prices due to its smoothing assumptions.

Validating Accuracy: Rigorous Evaluation of Forecasting Performance

Prior to model training and evaluation, the Augmented Dickey-Fuller (ADF) test was implemented to verify the stationarity of the time series data representing agricultural prices. Stationarity is a crucial assumption for many time series forecasting models; non-stationary data exhibits trends or seasonality that can lead to spurious regressions and inaccurate predictions. The ADF test assesses this by examining whether a unit root is present in the time series. The null hypothesis of the ADF test is that a unit root exists, implying non-stationarity. The p-value generated by the test determines whether to reject this null hypothesis; a p-value less than a predetermined significance level (typically 0.05) indicates sufficient evidence to conclude that the time series is stationary. If the data was determined to be non-stationary, differencing was applied until stationarity was achieved, ensuring the validity of subsequent analyses.

The Diebold-Mariano (DM) test is a statistical test used to formally assess whether the forecast accuracy of one model is significantly different from another. Unlike simple visual comparisons or paired t-tests which assume normally distributed errors, the DM test is designed for comparing the predictive power of two potentially non-nested models and does not require the assumption of normally distributed forecast errors. The test functions by comparing the Mean Squared Forecast Errors (MSFE) of the two models; a statistically significant difference in MSFE, typically determined using a p-value threshold (e.g., p < 0.05), indicates that one model provides demonstrably superior forecasts. Implementation involves calculating the DM statistic and its associated p-value, allowing for a robust, objective determination of forecast superiority beyond what is observable through descriptive metrics alone.

Rigorous evaluation of forecasting methods for Bangladeshi agricultural prices revealed performance variations quantified by Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). MAE values ranged from 2.66 to 18.85 BDT/kg across all commodities analyzed, while RMSE ranged from 2.63 to 13.42 BDT/kg. Statistical comparison using the Diebold-Mariano test indicated significant differences (p < 0.001) in forecasting accuracy between the T2V-Transformer and Vanilla Transformer models for four out of the five commodities examined, suggesting a demonstrable advantage for the T2V-Transformer in predicting price fluctuations.

Green chilli prices (<span class="katex-eq" data-katex-display="false">BDT/kg</span>) exhibit primarily exogenous, shock-driven dynamics-dominated by events like border closures and monsoon disruptions-rather than predictable seasonality, as indicated by substantial residual amplitudes and a limited forecastability proxy despite an <span class="katex-eq" data-katex-display="false">R/S</span> ratio of 0.74. — Green chilli prices ( $BDT/kg$ ) exhibit primarily exogenous, shock-driven dynamics-dominated by events like border closures and monsoon disruptions-rather than predictable seasonality, as indicated by substantial residual amplitudes and a limited forecastability proxy despite an $R/S$ ratio of 0.74.

Looking Ahead: Implications for Food Security and Future Research

The capacity to accurately predict agricultural prices, increasingly enabled by sophisticated methodologies and comprehensive datasets such as AgriPriceBD, represents a crucial step towards bolstering global food security. Reliable forecasts empower stakeholders – from farmers and traders to policymakers and aid organizations – to make informed decisions regarding planting, harvesting, storage, and distribution. This proactive approach minimizes waste, optimizes resource allocation, and allows for timely interventions to mitigate the impacts of price volatility, particularly in vulnerable regions. By anticipating potential shortages or surpluses, these predictive models facilitate effective risk management strategies and contribute to a more stable and equitable food system, ultimately safeguarding access to affordable and nutritious food for a growing population.

The capacity to anticipate agricultural price fluctuations empowers stakeholders to move beyond reactive strategies and embrace proactive solutions for food security. Accurate forecasts facilitate the strategic allocation of resources – from optimizing storage and distribution networks to preemptively addressing potential shortages – minimizing waste and ensuring equitable access to essential commodities. Governments and policymakers benefit from this foresight, enabling the implementation of targeted interventions, such as adjusting import/export policies or providing financial support to vulnerable populations, before crises escalate. Furthermore, improved price predictability allows farmers and businesses to manage risk more effectively, encouraging investment in sustainable agricultural practices and bolstering the overall resilience of food systems against external shocks like climate change or geopolitical instability.

Advancing the precision of agricultural price forecasting necessitates a broadened analytical scope, moving beyond historical price data alone. Future studies should prioritize the integration of dynamic external factors – specifically, detailed weather patterns and complex market dynamics – to capture a more holistic view of commodity price formation. Analysis of the Residual-to-Seasonal (R/S) ratio, which varied from 0.70 to 1.32 across different commodities studied, reveals the inherent degree of predictable, periodic behavior within each price series; commodities with R/S ratios closer to one exhibit stronger, more consistent seasonal patterns that can be leveraged for improved forecasting. By capitalizing on these exploitable structures and incorporating a wider range of influencing variables, predictive models can become more resilient to unforeseen shocks and offer increasingly accurate insights for effective food security planning.

Analysis of daily retail prices from July 2020 to June 2025 reveals a strong positive correlation between garlic and chickpeas (<span class="katex-eq" data-katex-display="false">r=0.61</span>), likely due to shared import-based pricing, while cucumber exhibits weak correlations with green chilli and sweet pumpkin (<span class="katex-eq" data-katex-display="false">r=0.09</span> and <span class="katex-eq" data-katex-display="false">r=0.11</span>, respectively) but a moderate correlation with chickpeas (<span class="katex-eq" data-katex-display="false">r=0.44</span>). — Analysis of daily retail prices from July 2020 to June 2025 reveals a strong positive correlation between garlic and chickpeas ( $r=0.61$ ), likely due to shared import-based pricing, while cucumber exhibits weak correlations with green chilli and sweet pumpkin ( $r=0.09$ and $r=0.11$ , respectively) but a moderate correlation with chickpeas ( $r=0.44$ ).

The study reveals a certain austerity in the Bangladeshi agricultural market – discrete price steps resisting the smoothing tendencies of complex models. This echoes a sentiment articulated by Donald Knuth: “Premature optimization is the root of all evil.” The researchers did not force complexity onto a system where established methods, despite their limitations in capturing nuanced dynamics, remain viable. The pursuit of marginal gains through learnable temporal embeddings, like Time2Vec, proved unnecessary – an acknowledgement that simplicity, even with imperfect forecasting, holds inherent value. The core finding – standard techniques struggle with step-function market behaviors – suggests a focus on understanding fundamental market mechanics, rather than chasing algorithmic sophistication.

Further Refinements

The persistence of baseline performance despite the introduction of learnable temporal embeddings warrants scrutiny. The expectation that models could ‘learn’ the inherent rhythm of these markets proved optimistic; the data, it seems, resists such elegant capture. This is not necessarily a failure of the method, but rather an indictment of the assumption that increased model complexity universally yields increased predictive power. Unnecessary is violence against attention; the focus must shift from architectural innovation to a more fundamental understanding of the underlying dynamics.

The observed step-function behavior in commodity price series presents a particular challenge. Standard time-series forecasting techniques, predicated on smooth, continuous change, are ill-equipped to model such discrete transitions. Future work should explore methods explicitly designed for discontinuous data, potentially drawing from the fields of regime switching models or event-driven simulation. Density of meaning is the new minimalism; simplification, not expansion, may be the path forward.

Ultimately, the value of this benchmark lies not in identifying a ‘best’ model, but in clarifying the limitations of current approaches. The Bangladeshi market, with its unique characteristics, serves as a crucial test case. Further investigation into the interplay of seasonal factors, external shocks, and local trading practices is essential. The pursuit of accuracy must be tempered by a realistic appraisal of predictability itself.

Original article: https://arxiv.org/pdf/2604.06227.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/