<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">SAPARS</journal-id>
<journal-title>Scientiarum: A Multidisciplinary Journal</journal-title>
<abbrev-journal-title abbrev-type="pubmed">SAPARS</abbrev-journal-title>
<issn pub-type="epub">0000-0000</issn>
<publisher>
<publisher-name>BOHR</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.54646/SAPARS.2025.26</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Forecasting agricultural production using ARIMAX model</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Mekha</surname> <given-names>R.</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Vijayakumar</surname> <given-names>M.</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
</contrib>
</contrib-group>
<aff><institution>Department of Statistics, Annamalai University</institution>, <addr-line>Cuddalore</addr-line>, <country>India</country></aff>
<author-notes>
<corresp id="c001">&#x002A;Correspondence: R. Mekha, <email>mekhastat@gmail.com</email> M. Vijayakumar, <email>mvkumar1975@gmail.com</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>11</month>
<year>2025</year>
</pub-date>
<volume>1</volume>
<issue>6</issue>
<fpage>1</fpage>
<lpage>6</lpage>
<history>
<date date-type="received">
<day>04</day>
<month>08</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>11</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2025 Mekha and Vijayakumar.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Mekha and Vijayakumar</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/"><p>&#x00A9; The Author(s). 2024 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.</p></license>
</permissions>
<abstract>
<p>This study explores the dynamic relationship between rainfall and agricultural production in Tamil Nadu, with a specific focus on maize, a staple crop critical to the region&#x2019;s food security and economy. Employing time series analysis, the research utilizes both the Autoregressive Integrated Moving Average (ARIMA) and ARIMA with exogenous variables (ARIMAX) models to understand and forecast maize production trends. Initially, an ARIMA (p,d,q) model is developed based on historical maize production data to identify and forecast future production trends. In parallel, an ARIMAX (p,d,q) model is constructed by integrating rainfall data as an exogenous variable, enabling the study to assess the impact of climatic factors particularly precipitation on maize productivity. This dual-model approach facilitates a comparative analysis of forecast accuracy. The study not only helps in understanding the underlying patterns in maize production but also emphasizes the significance of incorporating climatic variables into predictive modelling for more realistic forecasts. Such insights are useful for policymakers, agricultural planners, and farmers which helps them make decisions on crop planning, irrigation strategies, and risk management. Ultimately, this research contributes to the development of climate-resilient agricultural strategies in Tamil Nadu, where rainfall anomalies increasingly pose a challenge to stable food production.</p>
</abstract>
<kwd-group>
<kwd>maize production</kwd>
<kwd>ARIMA</kwd>
<kwd>ARIMAX</kwd>
<kwd>rainfall forecasting</kwd>
<kwd>time series analysis</kwd>
<kwd>agricultural forecasting</kwd>
<kwd>exogenous variables</kwd>
<kwd>Tamil Nadu agriculture</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="3"/>
<equation-count count="2"/>
<ref-count count="5"/>
<page-count count="6"/>
<word-count count="2313"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Maize (<italic>Zea mays</italic>), widely known as corn, is one of the most significant cereal crops globally, serving as a cornerstone of food security, animal feed, and industrial raw material. Renowned for its adaptability, maize is cultivated in a broad range of climatic zones, from temperate to tropical regions, making it a globally favoured crop. It ranks just after rice and wheat in terms of importance and area under cultivation in India. The major maize-producing countries include the United States, China, India, Brazil, and Mexico, reflecting its global demand and versatility. In India, maize is grown across various agro-climatic regions and serves diverse purposes, such as human consumption, poultry and cattle feed, starch extraction, and ethanol production.</p>
<p>Tamil Nadu has witnessed remarkable growth in maize cultivation in recent years, owing to the crop&#x2019;s compatibility with both monsoon (kharif) and post-monsoon (rabi) seasons. Districts such as Perambalur, Namakkal, Salem, Dindigul, Theni, Coimbatore, Madurai, and Virudhunagar are key maize-growing regions in the state. Favorable soil conditions especially well-drained red and black soils and moderate rainfall ranging from 600 mm to 900 mm support maize cultivation. The crop is particularly sensitive to fluctuations in rainfall and irrigation availability, making it vulnerable to climate variability.</p>
<p>Maize farming in Tamil Nadu faces several challenges, including dependence on the southwest monsoon, limited irrigation coverage, pest infestations, soil nutrient depletion, and price volatility. Additionally, unsustainable farming practices may lead to long-term soil degradation and water stress. However, interventions such as the introduction of high-yielding drought-tolerant hybrids, improved agronomic techniques, government support schemes, and precision agriculture are gradually improving maize productivity and farmer resilience. Strengthening post-harvest infrastructure and ensuring stable market access can further enhance the economic viability of maize farming in Tamil Nadu.</p>
<sec id="S1.SS1">
<title>Data description</title>
<p>The dataset used in this paper includes maize production (in tonnes) and rainfall data of Tamil Nadu from 1965 to 2022 from the online portal of Department of Economics and Statistics. The maize production and rainfall data is recorded annually. The analysis is conducted using the R programming language, leveraging the auto.arima () function from the forecast package to select the optimal model parameters. By using the auto.arima () function the best Autoregressive Integrated Moving Average (ARIMA) model is automatically selected based on a given time series dataset.</p>
</sec>
</sec>
<sec id="S2">
<title>Methodology</title>
<sec id="S2.SS1">
<title>ARIMAX model</title>
<p>The Auto-Regressive Integrated Moving Average with Exogenous Variables (ARIMAX) model extends ARIMA by incorporating external variables, such as rainfall, to enhance prediction accuracy (<xref ref-type="bibr" rid="B1">1</xref>). This model accounts for additional predictors that influence the target variable, making it particularly useful in agricultural forecasting where climatic factors play a crucial role (<xref ref-type="bibr" rid="B2">2</xref>).</p>
<p>The ARIMAX model combines the following components</p>
<list list-type="simple">
<list-item>
<label>&#x2022;</label>
<p>Autoregressive Component (AR): Uses past values of the dependent variable to predict future values.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Integrated Component (I): Applies differencing to make the time series stationary.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Moving Average Component (MA): Accounts for past forecasting errors to refine predictions.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Exogenous Variables (X): Incorporates independent variables that can explain variations in the dependent variable.</p>
</list-item>
</list>
<p>It is generally expressed as:</p>
<disp-formula id="S2.Ex1"><mml:math id="M1">
<mml:mrow>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi mathvariant="normal">&#x03B1;</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mpadded>
</mml:mrow>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mpadded>
</mml:mrow>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mi>p</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.Ex2"><mml:math id="M3">
<mml:mrow>
<mml:mi mathvariant="normal">&#x2003;</mml:mi>
<mml:mo separator="true">&#x2003;&#x2005;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">&#x03B3;</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mpadded>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">+</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B5;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where,</p>
<list list-type="simple">
<list-item>
<label>1.</label>
<p><italic>Y</italic><sub><italic>t</italic></sub> is the dependent variable.</p>
</list-item>
<list-item>
<label>2.</label>
<p>&#x03B1; is a constant.</p>
</list-item>
<list-item>
<label>3.</label>
<p><italic>Y</italic><sub><italic>t</italic>&#x2212;1</sub>, <italic>Y</italic><sub><italic>t</italic>&#x2212;2</sub>, &#x2026; <italic>Y</italic><sub><italic>t</italic>&#x2212;<italic>p</italic></sub> are the past values of the dependent variable.</p>
</list-item>
<list-item>
<label>4.</label>
<p>&#x03B3; is the coefficient of exogenous variable.</p>
</list-item>
<list-item>
<label>5.</label>
<p><italic>X</italic><sub><italic>t</italic></sub> are the exogenous variables. The trend in maize production in Tamil Nadu from 1965 to 2022 is presented in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
</list-item>
<list-item>
<label>6.</label>
<p>&#x03B5;<sub><italic>t</italic></sub> is the error term.</p>
</list-item>
</list>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Maize production from 1965 to 2022.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="sapars-2025-26-g001.tif"/>
</fig>
<p>The ARIMAX model is a powerful tool for time series forecasting when external factors play a significant role in influencing the behavior of the dependent variable. It combines the strengths of both time series analysis and regression models, allowing for more informed and accurate predictions that account for both historical patterns and external influences. This makes it particularly useful in applications such as economic forecasting, sales prediction, climate modeling and demand estimation.</p>
</sec>
<sec id="S2.SS2">
<title>ARIMA model</title>
<p>Time series forecasting plays an important role in various fields, including agriculture, finance, economics, and climate science. One of the most widely used statistical models for time series forecasting is the ARIMA model, which is effective in capturing temporal dependencies and patterns in historical data (<xref ref-type="bibr" rid="B3">3</xref>).</p>
<p>The ARIMA model is a combination of three components:</p>
<list list-type="simple">
<list-item>
<label>&#x2022;</label>
<p>Auto-Regressive (AR) Component: This captures the relationship between an observation and its previous values.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Integrated (I) Component: This represents the differencing required to make a non-stationary time series stationary.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Moving Average (MA) Component: This accounts for the dependency between an observation and past forecast errors.</p>
</list-item>
</list>
<p>Mathematically, an ARIMA (p, d, q) model is expressed as:</p>
<disp-formula id="S2.Ex3"><mml:math id="M5">
<mml:mrow>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">&#x03B1;</mml:mi>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x2205;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x2205;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x2205;</mml:mi>
<mml:mi>p</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.Ex4"><mml:math id="M7">
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B8;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B5;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B8;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B5;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B8;</mml:mi>
<mml:mi>q</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B5;</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mi>q</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B5;</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where,</p>
<list list-type="simple">
<list-item>
<label>1.</label>
<p><italic>Y<sub>t</sub></italic> is the dependent variable (e.g., maize production).</p>
</list-item>
<list-item>
<label>2.</label>
<p><italic>p</italic> is the number of AR terms.</p>
</list-item>
<list-item>
<label>3.</label>
<p><italic>d</italic> is the degree of differencing to make the series stationary.</p>
</list-item>
<list-item>
<label>4.</label>
<p><italic>q</italic> is the number of MA terms.</p>
</list-item>
<list-item>
<label>5.</label>
<p>&#x03B1; is a constant.</p>
</list-item>
<list-item>
<label>6.</label>
<p>&#x03B8;<sub>1</sub>, &#x03B8;<sub>2</sub>, &#x2026;, &#x03B8;<sub><italic>q</italic></sub> are the coefficients for the MA part of the model, which represent the relationship between the current value and past errors (lags).</p>
</list-item>
<list-item>
<label>7.</label>
<p>&#x03B5;<sub><italic>t</italic>&#x2212;1</sub>, &#x03B5;<sub><italic>t</italic>&#x2212;2</sub>, &#x03B5;<sub><italic>t</italic>&#x2212;<italic>q</italic></sub> are the past forecast errors (lags of the residuals).</p>
</list-item>
<list-item>
<label>8.</label>
<p>&#x03B5;<sub><italic>t</italic></sub> is the white noise (random error term).</p>
</list-item>
</list>
<p>In the agricultural domain, ARIMA models are extensively used for forecasting crop production, yield estimation, and analyzing the impact of climatic variables on agriculture. By leveraging historical production and rainfall data, ARIMA helps in predicting future trends and making data-driven decisions for sustainable agricultural planning (<xref ref-type="bibr" rid="B4">4</xref>).</p>
</sec>
</sec>
<sec id="S3">
<title>Results and discussion</title>
<p>Both ARIMAX and ARIMA models were applied to the maize production data. The statistical analysis is done using R software.</p>
<sec id="S3.SS1">
<title>Forecast using ARIMAX model</title>
<p>The ARIMAX model with ARIMA (0,1,1) residuals is identified as the best model, similar to findings reported in earlier agricultural forecasting studies (<xref ref-type="bibr" rid="B5">5</xref>). The ARIMAX model with rainfall as an exogenous variable shows that the moving average coefficient (ma1) is &#x2212;0.5335, the drift term is 50,281.88, and the rainfall coefficient (xreg) is 290.8014, indicating a positive relationship between rainfall and agricultural production.</p>
<p>The training set error measures, including a negative mean error (ME) of &#x2212;1,831.89 and a high Mean Absolute Percentage Error (MAPE) of 230.68%, indicate that the model tends to underestimate agricultural production and has significant prediction errors. However, the low autocorrelation of the residuals (ACF1 = &#x2212;0.017) suggests that the model effectively captured the temporal structure in the data. The model&#x2019;s Akaike Information Criterion (AIC) of 1,619.75 and Bayesian Information Criterion (BIC) of 1,627.92 suggest a reasonable fit, though there is room for improvement in predictive accuracy.</p>
<p>The maize production is forecasted for 20 years [2023&#x2013;2042] using ARIMAX model. The forecasted maize production is computed and presented in <xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Forecasted maize production using ARIMAX model for 20 years [2023&#x2013;2042].</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Year</td>
<td valign="top" align="center">Forecasted maize production (in 1,000 MT)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">2023</td>
<td valign="top" align="center">2,862,077</td>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="center">2,912,359</td>
</tr>
<tr>
<td valign="top" align="left">2025</td>
<td valign="top" align="center">2,962,641</td>
</tr>
<tr>
<td valign="top" align="left">2026</td>
<td valign="top" align="center">3,012,923</td>
</tr>
<tr>
<td valign="top" align="left">2027</td>
<td valign="top" align="center">3,063,205</td>
</tr>
<tr>
<td valign="top" align="left">2028</td>
<td valign="top" align="center">3,113,487</td>
</tr>
<tr>
<td valign="top" align="left">2029</td>
<td valign="top" align="center">3,163,768</td>
</tr>
<tr>
<td valign="top" align="left">2030</td>
<td valign="top" align="center">3,214,050</td>
</tr>
<tr>
<td valign="top" align="left">2031</td>
<td valign="top" align="center">3,264,332</td>
</tr>
<tr>
<td valign="top" align="left">2032</td>
<td valign="top" align="center">3,314,614</td>
</tr>
<tr>
<td valign="top" align="left">2033</td>
<td valign="top" align="center">3,364,896</td>
</tr>
<tr>
<td valign="top" align="left">2034</td>
<td valign="top" align="center">3,415,178</td>
</tr>
<tr>
<td valign="top" align="left">2035</td>
<td valign="top" align="center">3,465,460</td>
</tr>
<tr>
<td valign="top" align="left">2036</td>
<td valign="top" align="center">3,515,742</td>
</tr>
<tr>
<td valign="top" align="left">2037</td>
<td valign="top" align="center">3,566,023</td>
</tr>
<tr>
<td valign="top" align="left">2038</td>
<td valign="top" align="center">3,616,305</td>
</tr>
<tr>
<td valign="top" align="left">2039</td>
<td valign="top" align="center">3,666,587</td>
</tr>
<tr>
<td valign="top" align="left">2040</td>
<td valign="top" align="center">3,716,869</td>
</tr>
<tr>
<td valign="top" align="left">2041</td>
<td valign="top" align="center">3,767,151</td>
</tr>
<tr>
<td valign="top" align="left">2042</td>
<td valign="top" align="center">3,817,433</td>
</tr>
</tbody>
</table></table-wrap>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>The forecasted production using ARIMAX model.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="sapars-2025-26-g002.tif"/>
</fig>
<p>The actual and ARIMAX predicted maize production is compared and presented in <xref ref-type="fig" rid="F3">Figure 3</xref>.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>The actual vs predicted maize production.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="sapars-2025-26-g003.tif"/>
</fig>
</sec>
<sec id="S3.SS2">
<title>Forecast using ARIMA model</title>
<p>The ARIMA (0,1,1) with drift is identified as the best model. The moving average coefficient of &#x2212;0.5409 and drift term of 51,426.08 are well-estimated. The model shows a slight negative bias with a ME of &#x2212;1,642.93. The low autocorrelation of the residuals (ACF1 = &#x2212;0.0283) suggests that the model has effectively captured the underlying structure of the data. The error metrics such as Root Mean Square Error (RMSE) (335,221) and Mean Absolute Error (MAE) (202,918) highlight areas for potential improvement, but overall, the model demonstrates a solid foundation for forecasting agricultural production. An AIC value of 1,619.47, indicates an efficient model selection.</p>
<p>The maize production is forecasted for 20 years [2023&#x2013;2042] using ARIMA model. The forecasted maize production is computed and presented in <xref ref-type="table" rid="T2">Table 2</xref> and <xref ref-type="fig" rid="F4">Figure 4</xref>.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Forecasted maize production using ARIMA model for 20 years [2023&#x2013;2042].</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Year</td>
<td valign="top" align="center">Forecasted maize production (in 1,000 MT)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">2023</td>
<td valign="top" align="center">2,929,800</td>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="center">2,981,226</td>
</tr>
<tr>
<td valign="top" align="left">2025</td>
<td valign="top" align="center">3,032,652</td>
</tr>
<tr>
<td valign="top" align="left">2026</td>
<td valign="top" align="center">3,084,078</td>
</tr>
<tr>
<td valign="top" align="left">2027</td>
<td valign="top" align="center">3,135,504</td>
</tr>
<tr>
<td valign="top" align="left">2028</td>
<td valign="top" align="center">3,186,930</td>
</tr>
<tr>
<td valign="top" align="left">2029</td>
<td valign="top" align="center">3,238,356</td>
</tr>
<tr>
<td valign="top" align="left">2030</td>
<td valign="top" align="center">3,289,782</td>
</tr>
<tr>
<td valign="top" align="left">2031</td>
<td valign="top" align="center">3,341,208</td>
</tr>
<tr>
<td valign="top" align="left">2032</td>
<td valign="top" align="center">3,392,634</td>
</tr>
<tr>
<td valign="top" align="left">2033</td>
<td valign="top" align="center">3,444,061</td>
</tr>
<tr>
<td valign="top" align="left">2034</td>
<td valign="top" align="center">3,495,487</td>
</tr>
<tr>
<td valign="top" align="left">2035</td>
<td valign="top" align="center">3,546,913</td>
</tr>
<tr>
<td valign="top" align="left">2036</td>
<td valign="top" align="center">3,598,339</td>
</tr>
<tr>
<td valign="top" align="left">2037</td>
<td valign="top" align="center">3,649,765</td>
</tr>
<tr>
<td valign="top" align="left">2038</td>
<td valign="top" align="center">3,701,191</td>
</tr>
<tr>
<td valign="top" align="left">2039</td>
<td valign="top" align="center">3,752,617</td>
</tr>
<tr>
<td valign="top" align="left">2040</td>
<td valign="top" align="center">3,804,043</td>
</tr>
<tr>
<td valign="top" align="left">2041</td>
<td valign="top" align="center">3,855,469</td>
</tr>
<tr>
<td valign="top" align="left">2042</td>
<td valign="top" align="center">3,906,895</td>
</tr>
</tbody>
</table></table-wrap>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>The forecasted maize production using ARIMA model.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="sapars-2025-26-g004.tif"/>
</fig>
<p>The actual and the ARIMA predicted maize production is compared and presented in the <xref ref-type="fig" rid="F5">Figure 5</xref>.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>The actual vs ARIMA predicted maize production.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="sapars-2025-26-g005.tif"/>
</fig>
</sec>
<sec id="S3.SS3">
<title>Comparison between ARIMA and ARIMAX models</title>
<p>The error metrics of ARIMA and ARIMAX models are computed and given in <xref ref-type="table" rid="T3">Table 3</xref>.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>The error metrices of ARIMAX and ARIMA model.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Model</td>
<td valign="top" align="center">MAPE</td>
<td valign="top" align="center">MSE</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">ARIMAX</td>
<td valign="top" align="center">230.6836</td>
<td valign="top" align="center">109,061,149,061</td>
</tr>
<tr>
<td valign="top" align="left">ARIMA</td>
<td valign="top" align="center">241.0660</td>
<td valign="top" align="center">112,373,129,496</td>
</tr>
</tbody>
</table></table-wrap>
<p>The actual vs ARIMA and ARIMAX predicted maize production is compared and presented in <xref ref-type="fig" rid="F6">Figure 6</xref>.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>The actual vs ARIMA and ARIMAX predicted maize production.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="sapars-2025-26-g006.tif"/>
</fig>
<p>The ARIMAX model performs slightly better than the ARIMA model, as seen by the reduction in both MAPE and MSE. While the ARIMAX model improves on the ARIMA model by incorporating rainfall as an additional predictor.</p>
</sec>
</sec>
<sec id="S4" sec-type="conclusion">
<title>Conclusion</title>
<p>This study identifies the significance of using time series models for agricultural forecasting. The comparison between ARIMA and ARIMAX models explains about incorporating external variables like rainfall and shows how it improves the predictive accuracy. The automated model selection process using auto.arima () function provides a powerful and efficient approach to time series analysis, making it a valuable tool for researchers. This method simplified the analysis, allowing a focus on interpreting results and drawing meaningful conclusions. The results shows that the ARIMA model captures trends effectively but the ARIMAX model performs better by incorporating rainfall as an exogenous variable. This improvement is reflected in lower error values for ARIMAX compared to ARIMA, explaining the importance of exogenous factors in forecasting maize production.</p>
</sec>
</body>
<back>
<sec id="S5" sec-type="funding-information">
<title>Funding</title>
<p>The authors declare that no financial support was received for the research, authorship, and/or publication of this article.</p>
</sec>
<sec id="S6">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wangdi</surname> <given-names>K</given-names></name> <name><surname>Singhasivanon</surname> <given-names>P</given-names></name> <name><surname>Silawan</surname> <given-names>T</given-names></name> <name><surname>Lawpoolsri</surname> <given-names>S</given-names></name> <name><surname>White</surname> <given-names>NJ</given-names></name> <name><surname>Kaewkungwal</surname> <given-names>J</given-names></name></person-group>. <article-title>Development of temporal modelling for forecasting and prediction of malaria infections using time-series and ARIMAX analyses: a case study in endemic districts of Bhutan.</article-title> <source><italic>Malar J.</italic></source> (<year>2010</year>) <volume>9</volume>:<fpage>1</fpage>&#x2013;<lpage>9</lpage>.</citation></ref>
<ref id="B2"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yogarajah</surname> <given-names>B</given-names></name> <name><surname>Elankumaran</surname> <given-names>C</given-names></name> <name><surname>Vigneswaran</surname> <given-names>R</given-names></name></person-group>. <article-title>Application of ARIMAX model for forecasting paddy production in Trincomalee district in Sri Lanka.</article-title> <source><italic>Proceedings of the Third International Symposium</italic></source> (<year>2013</year>).</citation></ref>
<ref id="B3"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Box</surname> <given-names>GEP</given-names></name> <name><surname>Jenkins</surname> <given-names>GM</given-names></name> <name><surname>Reinsel</surname> <given-names>GC.</given-names></name></person-group> <source><italic>Time Series Analysis: Forecasting and Control.</italic></source> <publisher-loc>New York</publisher-loc>: <publisher-name>John Wiley &#x0026; Sons Inc</publisher-name> (<year>2008</year>).</citation></ref>
<ref id="B4"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gopinath</surname> <given-names>M</given-names></name> <name><surname>Kavithamani</surname> <given-names>M</given-names></name></person-group>. <article-title>Production of sugarcane forecasting using ARIMAX model.</article-title> <source><italic>Int J Innovat Technol Explor Eng.</italic></source> (<year>2019</year>) <volume>8</volume>(<issue>12s</issue>):<fpage>596</fpage>&#x2013;<lpage>600</lpage>.</citation></ref>
<ref id="B5"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stellamani</surname> <given-names>S</given-names></name> <name><surname>Vijayakumar</surname> <given-names>M</given-names></name> <name><surname>Philip Arokiadoss</surname> <given-names>A</given-names></name></person-group>. <article-title>Forecasting time series for paddy production.</article-title> <source><italic>Aryabhatta J Math Informat.</italic></source> (<year>2022</year>) <volume>14</volume>:<fpage>113</fpage>&#x2013;<lpage>20</lpage>.</citation></ref>
</ref-list>
</back>
</article>
