<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Archives Statistics - Open Forecasting</title>
	<atom:link href="https://openforecast.org/category/statistics/feed/" rel="self" type="application/rss+xml" />
	<link>https://openforecast.org/category/statistics/</link>
	<description>How to look into the future</description>
	<lastBuildDate>Wed, 07 Jan 2026 17:32:28 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2015/08/cropped-usd-05-32x32.png&amp;nocache=1</url>
	<title>Archives Statistics - Open Forecasting</title>
	<link>https://openforecast.org/category/statistics/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Teaching Statistics and Descriptive Analytics in the world of AI</title>
		<link>https://openforecast.org/2026/01/07/teaching-statistics-and-descriptive-analytics-in-the-world-of-ai/</link>
					<comments>https://openforecast.org/2026/01/07/teaching-statistics-and-descriptive-analytics-in-the-world-of-ai/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Wed, 07 Jan 2026 17:32:28 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[teaching]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3946</guid>

					<description><![CDATA[<p>Teaching statistics as a flipped classroom with the help of AI? You heard that right! That’s exactly what I tried this year &#8211; and here are the results. Attached to this post is the student evaluation score for the module. Yes, the number of responses is quite low (only 50% of the cohort), but it [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2026/01/07/teaching-statistics-and-descriptive-analytics-in-the-world-of-ai/">Teaching Statistics and Descriptive Analytics in the world of AI</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Teaching statistics as a flipped classroom with the help of AI? You heard that right! That’s exactly what I tried this year &#8211; and here are the results.</p>
<p>Attached to this post is the student evaluation score for the module. Yes, the number of responses is quite low (only 50% of the cohort), but it should still give a sense of how students perceived Statistics and Descriptive Analytics. Of course, this reflects only their impression &#8211; coursework submissions are yet to come &#8211; but it’s still an encouraging sign that some things worked well.</p>
<p>I’ve taught this module since 2018, first with Dave Worthington and later with Alisa Yusupova. Normally, I focused on the second half, covering regression through lectures and workshops. But this year, I took on the full module and realised I didn’t want to teach probability theory and statistics in the traditional way &#8211; long monologues in lectures followed by awkward silence in workshops. That format, I believe, no longer works. After all, students can always ask their favourite LLM to explain concepts they don&#8217;t understand. And some don&#8217;t even do that &#8211; they just ask to solve problems without understanding them. So, what can be done in this brave new world?</p>
<p>I don’t yet have a definitive answer &#8211; only the results of an experiment.</p>
<p>Lectures. This year, I used Google Notebook ML to prepare lecture materials. I provided my existing notes, slides, and relevant texts, then asked it to produce podcasts on specific topics. This took more time than expected, as I had to review the generated content, adjust prompts, and refine focus areas, listening to the podcasts over and over again. Once ready, I uploaded the materials on Moodle and asked students to listen beforehand. In class, we skipped formal lectures and instead had whiteboard &#038; marker discussions. I asked questions, showed derivations, and encouraged debate. With a class of 26 students, it was possible to create much more interaction than in previous years.</p>
<p>Workshops. We still had problem-solving sessions, but I allowed (actually encouraged) students to use LLMs to solve tasks and explain why the solutions were correct. The aim was to emphasise reasoning and assumptions over simply obtaining the right number. This worked with mixed success, and I still need to think how it can be improved further.</p>
<p>Did it work overall?</p>
<p>I&#8217;m not entirely sure. Not all students engaged with the materials in advance, but those who did seemed to benefit and appreciated the approach. What I do know is that the &#8220;two-hour monologue while everyone tries not to fall asleep&#8221; format does not work any more. For universities, and for the (very!) expensive UK education, to remain relevant, we must innovate and rethink how we teach.</p>
<p>What would you change if you were teaching a technical subject at university in the era of AI? I’d love to hear your ideas.</p>
<p>Message <a href="https://openforecast.org/2026/01/07/teaching-statistics-and-descriptive-analytics-in-the-world-of-ai/">Teaching Statistics and Descriptive Analytics in the world of AI</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2026/01/07/teaching-statistics-and-descriptive-analytics-in-the-world-of-ai/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>On randomness and uncertainty</title>
		<link>https://openforecast.org/2025/04/28/on-randomness-and-uncertainty/</link>
					<comments>https://openforecast.org/2025/04/28/on-randomness-and-uncertainty/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Mon, 28 Apr 2025 11:05:29 +0000</pubDate>
				<category><![CDATA[Social media]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Theory of forecasting]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[uncertainty]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3828</guid>

					<description><![CDATA[<p>Everything is random! Your data, your model, its parameter estimates, the forecasts it produces, and even the minimum of the loss function you used. There is no such thing as a &#8220;deterministic&#8221; forecast &#8211; everything is stochastic! Whenever you work with data, you are working with a sample from a population. In some cases, this [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2025/04/28/on-randomness-and-uncertainty/">On randomness and uncertainty</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Everything is random! Your data, your model, its parameter estimates, the forecasts it produces, and even the minimum of the loss function you used. There is no such thing as a &#8220;deterministic&#8221; forecast &#8211; everything is stochastic!</p>
<p>Whenever you work with data, you are working with a sample from a population. In some cases, this is more apparent than in others. In my statistics lectures, I typically give the following example. Consider that we are interested in the average height of students at the university. I could ask every student at the lecture to tell me their height, take the average, and get a number. Is this number random? Yes, indeed. Why? Because if a student who was late for the lecture comes in, I would need to recalculate the average, and the number would change. The average that I get depends on who specifically I have in the sample and how many observations I have. It will vary more in smaller samples and become more stable in larger ones. But this example gives you an idea about the inherent uncertainty of any estimates we deal with.</p>
<p>In time series, the situation is somewhat similar: you are dealing with a sample of values that you have observed up until a specific moment. If, for example, you want to forecast daily admissions in the emergency department of a hospital and apply a model, its forecast will change when a new day comes and a new cohort of patients arrives. This is because your sample changes, and you receive new information about the demand.</p>
<p>So, the parameter estimates of a model you use will change when you get a new observation (e.g., a new record of product sales). Yes, if you estimate the model properly (e.g., using Least Squares), the parameter estimates won’t change substantially, but they will change nonetheless. And this would affect point forecasts and any other statistics produced by your model. Your standard errors, p-values, conditional means, prediction intervals, error measures, model ranking &#8211; everything will change with a new observation. In fact, if you do model selection, the structure of the model might change as well. For example, in the case of ETS, you might switch from a model without a trend to one with a trend. So, every time you estimate anything on a sample of data, you should keep in mind that it is random and will change if your sample changes or gets updated.</p>
<p>Why is that important? Because we need to understand this inherent uncertainty, and ideally, we should somehow take it into account. In forecasting, this means you should not draw conclusions based on one application of a model to a dataset. At the very least, you should perform <a href="/adam/rollingOrigin.html">a rolling origin evaluation</a>. As Leonidas Tsaprounis says, &#8220;if you don&#8217;t roll the origin, you roll the dice&#8221;.</p>
<p>So, embrace the uncertainty and learn how to deal with it.</p>
<p>By the way, Kandrika Pritularga and I are holding a course on Demand Forecasting starting on 6th May. There is still time to <a href="https://online-payments.lancaster-university.co.uk/product-catalogue/courses/lancaster-university-management-school-lums/centre-for-marketing-analytics-forecasting-cmaf/demand-forecasting-principles-with-examples-in-r">sign up for it here</a>.</p>
<p>Message <a href="https://openforecast.org/2025/04/28/on-randomness-and-uncertainty/">On randomness and uncertainty</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2025/04/28/on-randomness-and-uncertainty/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Why do zeroes happen? A model-based view on demand classification</title>
		<link>https://openforecast.org/2025/03/20/why-do-zeroes-happen-a-model-based-view-on-demand-classification/</link>
					<comments>https://openforecast.org/2025/03/20/why-do-zeroes-happen-a-model-based-view-on-demand-classification/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Thu, 20 Mar 2025 15:32:52 +0000</pubDate>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[intermittent demand]]></category>
		<category><![CDATA[presentations]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3804</guid>

					<description><![CDATA[<p>I presented our current work with Anna Sroginis during my visit of IÉSEG School of Management, Lille, France last week. It was great to see my colleague and friend Sarah Van der Auweraer, and I enjoyed the discussion we had with people in her group related to forecasting and intermittent demand. You can see details [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2025/03/20/why-do-zeroes-happen-a-model-based-view-on-demand-classification/">Why do zeroes happen? A model-based view on demand classification</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>I presented our current work with Anna Sroginis during my visit of <a href="https://www.ieseg.fr/">IÉSEG School of Management</a>, Lille, France last week. It was great to see my colleague and friend Sarah Van der Auweraer, and I enjoyed the discussion we had with people in her group related to forecasting and intermittent demand. You can see details of the event <a href="/events/why-do-zeroes-happen-a-model-based-view-on-demand-classification/">here</a> and find slides <a href="/wp-content/uploads/2025/03/2024-Lille-AID-Presentation.pdf">here</a>.</p>
<p>Message <a href="https://openforecast.org/2025/03/20/why-do-zeroes-happen-a-model-based-view-on-demand-classification/">Why do zeroes happen? A model-based view on demand classification</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2025/03/20/why-do-zeroes-happen-a-model-based-view-on-demand-classification/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Model vs Method &#8211; why should we care?</title>
		<link>https://openforecast.org/2025/02/04/model-vs-method-why-should-we-care/</link>
					<comments>https://openforecast.org/2025/02/04/model-vs-method-why-should-we-care/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Tue, 04 Feb 2025 12:14:44 +0000</pubDate>
				<category><![CDATA[Social media]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Theory of forecasting]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[theory]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3771</guid>

					<description><![CDATA[<p>Image above depicts a fashion model making a presentation about a forecasting method. I like the forecast for the final period in that image&#8230; Over the last few years, I’ve seen phrases like &#8220;LightGBM model&#8221; or &#8220;Neural Network model&#8221; on LinkedIn many times, and the statistician in me shivers every time. So, I figured it’s [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2025/02/04/model-vs-method-why-should-we-care/">Model vs Method &#8211; why should we care?</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><em>Image above depicts a fashion model making a presentation about a forecasting method. I like the forecast for the final period in that image&#8230;</em></p>
<p>Over the last few years, I’ve seen phrases like &#8220;LightGBM model&#8221; or &#8220;Neural Network model&#8221; on LinkedIn many times, and the statistician in me shivers every time. So, I figured it’s time to discuss the difference between a model and a method.</p>
<p>Some of you might remember that I wrote <a href="/2020/06/08/forecasting-method-vs-forecasting-model-what-s-difference/">a post on this topic</a> a few years ago. But it seems it is worth revisiting.</p>
<p>John Boylan and I came up with the following definitions in <a href="/2023/09/08/iets-state-space-model-for-intermittent-demand-forecasting/">our paper</a>:</p>
<ul>
<li>A forecasting model is a mathematical representation of a real phenomenon with a complete specification of distribution and parameters;</li>
<li>A forecasting method is a mathematical procedure that generates point and/or interval forecasts, with or without a forecasting model.</li>
</ul>
<p>If these sound too technical, here’s a simpler explanation:</p>
<ul>
<li>A forecasting method is a way of generating forecasts;</li>
<li>A forecasting model is a way to describe the assumed structure of a real phenomenon.</li>
</ul>
<p>The key difference? A method focuses on producing something specific (e.g., point forecasts) with minimal assumptions, while a model relies on assumptions but can do much more:</p>
<ol>
<li>Rigorous estimation. Models can be constructed in ways that ensure their estimates of parameters are efficient and consistent.</li>
<li>Model selection using information criteria. A powerful approach that saves computational time and typically produces reasonable forecasts.</li>
<li>Predictive distribution. Models can generate moments (mean, variance, skewness) and quantiles, capturing uncertainty around future values.</li>
<li>Confidence intervals for parameters. While not crucial for forecasting, this is useful in other areas to quantify uncertainty.</li>
<li>Extendibility. Additional variables and components can be easily incorporated in a model.</li>
</ol>
<p>All of this comes at a price of <a href="/2025/01/07/there-is-no-such-thing-as-assumption-free-approach/">making assumptions about the reality</a>. If the assumptions don’t hold, the model won’t perform well. It might still be useful, but the risk of error increases. For example, you can apply a Random Walk model to purely random data, but you shouldn’t expect it to work well.</p>
<h3>Examples</h3>
<ol>
<li>A forecasting method: Naïve, defined by the simple equation:<br />
\( F_t = A _{t-1} \)<br />
This method is easy to explain, hard to break, and provides point forecasts, but nothing more.</li>
<li>A forecasting model: Random Walk, which underlies the Naïve method:<br />
\( A_t = A_{t-1} + \epsilon_t \)<br />
where \( \epsilon_t \) follows some distribution with zero mean and fixed variance. The Random Walk model has all the properties described above.</li>
</ol>
<p>In some cases, you can derive models underlying the methods. In my opinion, this typically enhances the latter, making them more powerful due to the reasons explained above. What is interesting about this general connection is that if we can identify a model underlying a method, we can do much more with it.</p>
<p>For example, when estimating a quantile regression, we typically minimize a pinball loss function, which gives us a method for generating quantiles. However, if we estimate the same linear regression model using likelihood, assuming that the error term follows the <a href="https://doi.org/10.1093/biostatistics/kxj039">Asymmetric Laplace distribution</a>, we arrive at exactly the same parameter estimates as in quantile regression. But now, we also gain additional benefits, such as model selection, predictive distribution, and confidence intervals for parameters &#8211; features outlined in the previous post. In a way, these benefits come &#8220;for free&#8221;, although at the cost of making explicit assumptions about the model. That said, I’d argue that assumptions exist in quantile regression anyway &#8211; they’re just not stated explicitly.</p>
<p>And here we finally come to the ML approaches. According to the definitions we discussed earlier, Decision Trees, k-Nearest Neighbors, Artificial Neural Networks (ANNs) and other ML approaches are not forecasting models. They do not attempt to capture the underlying structure of the data. Instead, they focus on identifying nonlinear patterns via engineered features to produce point forecasts. In other words, they are methods, not models.</p>
<p>This doesn’t make them inferior. Their strength lies in their flexibility, precisely because they don&#8217;t impose strong assumptions. However, treating them as forecasting models can lead to potential issues.</p>
<p>For example, plugging LightGBM’s point forecasts into a probability distribution doesn’t magically turn it into a model. It simply makes it a method that now generates quantiles, but without a solid theoretical foundation for why a specific distribution is chosen or used in a particular way.</p>
<p>Another example is model selection using information criteria, which is meaningless for ML approaches. Why? Because information criteria rely on the assumption that the model is estimated in a specific way (e.g., via maximum likelihood estimation), ensuring parameter consistency and model identifiability. However, some ML methods, such as ANNs, are fundamentally unidentifiable, as different architectures can produce the same output. So, the information criteria become meaningless in this setting.</p>
<p>So next time you see the term model, take a moment to consider whether it’s used correctly and whether it actually means what the author thinks.</p>
<p>Message <a href="https://openforecast.org/2025/02/04/model-vs-method-why-should-we-care/">Model vs Method &#8211; why should we care?</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2025/02/04/model-vs-method-why-should-we-care/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>There is no such thing as an &#8220;assumption-free approach&#8221;</title>
		<link>https://openforecast.org/2025/01/07/there-is-no-such-thing-as-assumption-free-approach/</link>
					<comments>https://openforecast.org/2025/01/07/there-is-no-such-thing-as-assumption-free-approach/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Tue, 07 Jan 2025 10:50:55 +0000</pubDate>
				<category><![CDATA[Social media]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Theory of forecasting]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[theory]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3761</guid>

					<description><![CDATA[<p>One thing that bothers me when I read posts on social media or papers in peer-reviewed journals is the claim that a proposed approach is &#8220;assumption-free.&#8221; In forecasting, this is never true. Such an approach is like a spherical unicorn in a vacuum (see image above). Here&#8217;s why. Every model is a simplification of reality, [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2025/01/07/there-is-no-such-thing-as-assumption-free-approach/">There is no such thing as an &#8220;assumption-free approach&#8221;</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>One thing that bothers me when I read posts on social media or papers in peer-reviewed journals is the claim that a proposed approach is &#8220;assumption-free.&#8221; In forecasting, this is never true. Such an approach is like a spherical unicorn in a vacuum (see image above). Here&#8217;s why.</p>
<p>Every model is a simplification of reality, meaning that it captures only a part of it. Simplifying implies that certain aspects of reality are irrelevant and can be ignored. For example, in forecasting, regardless of the approach used, we typically assume that the model captures the structure correctly, i.e. neither omitting important elements nor overfitting the data. Different approaches address this differently: statistical models do that explicitly, while a good ML approach seeks a balance between underfitting and overfitting, often in a non-linear way. When the structure is captured correctly, the forecast reflects the essential part of reality while ignoring small random fluctuations (see a post on <a href="/2024/08/13/structure-vs-noise-a-fundamental-concept-in-forecasting/">structure vs. noise</a>).</p>
<p>Depending on the assumptions we make, we can classify approaches as parametric, semiparametric, or nonparametric.</p>
<p>Parametric approaches assume that the model is correctly specified, its parameters are accurately estimated, and the chosen distribution is appropriate (often the normal one, though others can be used). In this case, we fully rely on the model. A classical example is the construction of conventional prediction intervals: the conditional expectation and variance are calculated and plugged into the normal distribution to derive the necessary quantiles for a specified confidence level. Specifically, in this case we assume that the model is correct, errors are uncorrelated and homoscedastic, and that they follow a normal distribution.</p>
<p>Semiparametric approaches relax some of these assumptions. For example, we might calculate statistics in a more robust manner or drop the assumption of a specific distribution. For example, instead of relying on textbook formulae, we could use in-sample multistep forecast errors to calculate conditional variances. This eliminates the need to assume uncorrelated and homoscedastic errors and allows for some flexibility in the model structure. However, in this example, we still rely on normality.</p>
<p>Nonparametric approaches avoid most of the above assumptions but come with their own hidden ones. For instance, the method proposed by <a href="https://doi.org/10.1287/mnsc.45.2.225">Taylor &#038; Bunn (1999)</a> for constructing prediction intervals fits quantile regressions to in-sample multistep forecast errors. This method does not assume a correct model, well-behaved residuals, or normality. However, it does assume the appropriateness of the chosen quantile regression function (Spoiler: they used polynomial regression, but my experiments suggest that a power function is a more robust alternative).</p>
<p>You might think that nonparametric approaches, with fewer assumptions, should always be preferred. But that&#8217;s not necessarily the case. It is &#8220;horses for courses&#8221;: you should select the approach that best fits your specific situation. For example, when working with small samples, introducing some assumptions might be necessary to get meaningful estimates. A nonparametric approach, while powerful, might require more data than you have available.</p>
<p>Finally, there is no such thing as a &#8220;best&#8221; method for every situation. As is often the case in forecasting, you need to try different approaches and choose the one that works best. Even then, remember that forecasting always rests on a fundamental assumption: the future will resemble the past. And no fancy method can guarantee that this assumption will hold.</p>
<p>Message <a href="https://openforecast.org/2025/01/07/there-is-no-such-thing-as-assumption-free-approach/">There is no such thing as an &#8220;assumption-free approach&#8221;</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2025/01/07/there-is-no-such-thing-as-assumption-free-approach/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Structure vs. Noise: A Fundamental Concept in Forecasting</title>
		<link>https://openforecast.org/2024/08/13/structure-vs-noise-a-fundamental-concept-in-forecasting/</link>
					<comments>https://openforecast.org/2024/08/13/structure-vs-noise-a-fundamental-concept-in-forecasting/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Tue, 13 Aug 2024 13:06:01 +0000</pubDate>
				<category><![CDATA[Social media]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Theory of forecasting]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[theory]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3647</guid>

					<description><![CDATA[<p>One of the core ideas in statistics, which extends to many other fields including forecasting, is the concept of structure versus noise. You’ve probably heard of it, but it’s often overlooked by those without a strong quantitative background. So, let&#8217;s discuss. The core of the idea is that any data consists of two fundamental parts: [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2024/08/13/structure-vs-noise-a-fundamental-concept-in-forecasting/">Structure vs. Noise: A Fundamental Concept in Forecasting</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>One of the core ideas in statistics, which extends to many other fields including forecasting, is the concept of structure versus noise. You’ve probably heard of it, but it’s often overlooked by those without a strong quantitative background. So, let&#8217;s discuss.</p>
<p>The core of the idea is that any data consists of two fundamental parts:</p>
<ol>
<li>Structure, which can take various forms, and might include trend, seasonality, calendar effects, and the influence of external factors on demand (e.g., price changes, promotions etc).</li>
<li>Noise, which is inherently unpredictable.</li>
</ol>
<p>Structure can be captured using models or methods, and this is what produces the fitted values or point forecasts. Noise, on the other hand, is unpredictable &#8211; like not knowing exactly who will visit a store and when they’ll make a purchase.</p>
<p>For example, consider a local Lancaster pub that has a nice selection of beers. Their sales likely follow a pattern, such as higher sales on weekends or during special events like football matches. These patterns are the structure we can capture and forecast. However, the pub can&#8217;t anticipate when my friend Yves will visit me, and when we&#8217;ll go out for drinks. This element of uncertainty forms the noise &#8211; while it’s explainable from my perspective, it’s a mystery to the pub owner.</p>
<p>But as I said, the idea of structure vs. noise isn’t just relevant in demand forecasting; it applies in many other areas too. Take classification, for instance. When identifying mushrooms, you might not be able to tell for sure whether you’re looking at a Rosy Brittlegill or The Sickener without a microscope. While certain characteristics (like stem shape or cap colour) make up the structure, there’s always some randomness that can make one mushroom look like another. So, in classification, you can only say that it’s more likely that we have one type of mushroom rather than the other, and you need to consider the uncertainty around this choice (the modern approach to this is to use conformal prediction).</p>
<p>Furthermore, we as humans are very good at finding patterns in the noise. If you look at clouds and see a mushroom, it’s not a real mushroom, just a random arrangement of vapour. So when you work with the data, remember this feature and don’t fall into the trap of finding patterns that don’t actually exist. Be critical and avoid overfitting the noise.</p>
<p>As you can see, the concept of structure versus noise is fundamental and shows up in many contexts. In forecasting, our job all is to capture the structure somehow, filter out the noise so that we can then produce point forecasts (future structure) and prediction intervals (representing the size of uncertainty) to be able to make adequate decisions.</p>
<p>Message <a href="https://openforecast.org/2024/08/13/structure-vs-noise-a-fundamental-concept-in-forecasting/">Structure vs. Noise: A Fundamental Concept in Forecasting</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2024/08/13/structure-vs-noise-a-fundamental-concept-in-forecasting/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Complex-Valued Econometrics with Examples in R</title>
		<link>https://openforecast.org/2024/08/04/complex-valued-econometrics-with-examples-in-r/</link>
					<comments>https://openforecast.org/2024/08/04/complex-valued-econometrics-with-examples-in-r/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Sun, 04 Aug 2024 14:33:40 +0000</pubDate>
				<category><![CDATA[Complex-valued models]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[complex variables]]></category>
		<category><![CDATA[multivariate models]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[statistics]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3632</guid>

					<description><![CDATA[<p>Back in 2022, my father asked me to help him in amending and editing a monograph he wrote on the topic of &#8220;Complex-Valued Econometrics&#8221;. The original book focused on dynamic models, but after looking through the material and a thorough discussion, we decided to write something more fundamental. The monograph is based on the research [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2024/08/04/complex-valued-econometrics-with-examples-in-r/">Complex-Valued Econometrics with Examples in R</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Back in 2022, my father asked me to help him in amending and editing a monograph he wrote on the topic of &#8220;Complex-Valued Econometrics&#8221;. The original book focused on dynamic models, but after looking through the material and a thorough discussion, we decided to write something more fundamental. The monograph is based on the research he has done over the years, working in Saint Petersburg. I developed an R package called &#8220;<a href="https://github.com/config-i1/complex">complex</a>&#8221; to support the book and then expanded the text with some derivations and examples of application. The result was then submitted to Springer and is <a href="https://doi.org/10.1007/978-3-031-62608-1">now finally published</a> in their &#8220;Contributions to Economics&#8221; series. Unfortunately, due to the agreement with the publisher, we cannot make the book freely available, but some of related materials can be found on a github repo, <a href="https://github.com/config-i1/complex-econometrics">here</a>.</p>
<p>We will receive royalties from selling this book, and we have decided to direct them to a charity to help Ukrainians (<a href="https://www.justgiving.com/campaign/ukraine-aid-help-now">this one</a>).</p>
<p>And here is how the cover of the book looks like:<br />
<div id="attachment_3638" style="width: 209px" class="wp-caption aligncenter"><a href="https://openforecast.org/wp-content/uploads/2024/08/title.webp"><img fetchpriority="high" decoding="async" aria-describedby="caption-attachment-3638" src="https://openforecast.org/wp-content/uploads/2024/08/title-199x300.webp" alt="Complex-Valued Econometrics with Examples in R" width="199" height="300" class="size-medium wp-image-3638" srcset="https://openforecast.org/wp-content/uploads/2024/08/title-199x300.webp 199w, https://openforecast.org/wp-content/uploads/2024/08/title-680x1024.webp 680w, https://openforecast.org/wp-content/uploads/2024/08/title-768x1157.webp 768w, https://openforecast.org/wp-content/uploads/2024/08/title.webp 827w" sizes="(max-width: 199px) 100vw, 199px" /></a><p id="caption-attachment-3638" class="wp-caption-text">Complex-Valued Econometrics with Examples in R</p></div>
<p><a href="https://doi.org/10.1007/978-3-031-62608-1">Svetunkov S., Svetunkov I. (2024). Complex-Valued Econometrics with Examples in R: Modelling, Regression and Applications. Springer Cham. 154 pages. DOI: 10.1007/978-3-031-62608-1</a></p>
<p>Message <a href="https://openforecast.org/2024/08/04/complex-valued-econometrics-with-examples-in-r/">Complex-Valued Econometrics with Examples in R</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2024/08/04/complex-valued-econometrics-with-examples-in-r/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>ISF2024: How to Bootstrap Time Series without Attracting Attention of Statisticians</title>
		<link>https://openforecast.org/2024/07/03/isf2024-how-to-bootstrap-time-series-without-attracting-attention-of-statisticians/</link>
					<comments>https://openforecast.org/2024/07/03/isf2024-how-to-bootstrap-time-series-without-attracting-attention-of-statisticians/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Wed, 03 Jul 2024 14:11:41 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[conferences]]></category>
		<category><![CDATA[ISF]]></category>
		<category><![CDATA[presentations]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=3611</guid>

					<description><![CDATA[<p>On 1st July, I presented my ongoing work on time series bootstrap and its impact on prediction intervals at ISF2024 in Dijon, France. Abstract: Bootstrap is extensively used in statistics and machine learning for cross-sectional data to account for uncertainty about the data, model form, and parameter estimates. However, conventional methods may not be suitable [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2024/07/03/isf2024-how-to-bootstrap-time-series-without-attracting-attention-of-statisticians/">ISF2024: How to Bootstrap Time Series without Attracting Attention of Statisticians</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>On 1st July, I presented my ongoing work on time series bootstrap and its impact on prediction intervals at ISF2024 in Dijon, France.</p>
<p><strong>Abstract</strong>: Bootstrap is extensively used in statistics and machine learning for cross-sectional data to account for uncertainty about the data, model form, and parameter estimates. However, conventional methods may not be suitable for time series data due to autocorrelation and specific dynamic structures. Over the years, various approaches have been developed to address this issue. Some assume specific models (e.g., STL), while others are non-parametric (e.g., Maximum Entropy Bootstrap, MEB). However, the former can be overly restrictive, while the latter may not perform well in case of outliers and external drivers. To address these issues, we propose a non-parametric bootstrap approach inspired by MEB, which does not assume any structure in the data yet creates reasonable copies of existing time series of different nature. These copies can be utilised in bagged ETS/ARIMA or any other approach involving small sample uncertainty. We demonstrate how the proposed bootstrap works using real-time series examples and assess improvements it brings in terms of forecasting accuracy compared to conventional approaches.</p>
<p><a href="/wp-content/uploads/2024/07/2024-ISF-Svetunkov-Bootstrap.pdf">Here are the slides</a> of the presentation.</p>
<p>And here is me, trying not to attract attention of statisticians:</p>
<div id="attachment_3614" style="width: 310px" class="wp-caption aligncenter"><a href="https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03.jpeg&amp;nocache=1"><img decoding="async" aria-describedby="caption-attachment-3614" src="https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03-300x225.jpeg&amp;nocache=1" alt="How to attract attention of statisticians..." width="300" height="225" class="size-medium wp-image-3614" srcset="https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03-300x225.jpeg&amp;nocache=1 300w, https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03-1024x768.jpeg&amp;nocache=1 1024w, https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03-768x576.jpeg&amp;nocache=1 768w, https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03-1536x1152.jpeg&amp;nocache=1 1536w, https://openforecast.org/wp-content/webpc-passthru.php?src=https://openforecast.org/wp-content/uploads/2024/07/WhatsApp-Image-2024-07-01-at-14.53.03.jpeg&amp;nocache=1 2016w" sizes="(max-width: 300px) 100vw, 300px" /></a><p id="caption-attachment-3614" class="wp-caption-text">How to attract attention of statisticians&#8230;</p></div>
<p>Message <a href="https://openforecast.org/2024/07/03/isf2024-how-to-bootstrap-time-series-without-attracting-attention-of-statisticians/">ISF2024: How to Bootstrap Time Series without Attracting Attention of Statisticians</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2024/07/03/isf2024-how-to-bootstrap-time-series-without-attracting-attention-of-statisticians/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Statistical tests flowchart</title>
		<link>https://openforecast.org/2021/11/30/statistical-tests-flowchart/</link>
					<comments>https://openforecast.org/2021/11/30/statistical-tests-flowchart/#respond</comments>
		
		<dc:creator><![CDATA[Ivan Svetunkov]]></dc:creator>
		<pubDate>Tue, 30 Nov 2021 15:29:12 +0000</pubDate>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[statistics]]></category>
		<guid isPermaLink="false">https://openforecast.org/?p=2740</guid>

					<description><![CDATA[<p>In Lancaster University, I teach the module called &#8220;Statistics and Descriptive Analytics&#8221;, which is compulsory for master students of the programme &#8220;Business Analytics&#8220;. This year, the module has been delivered by Alisa Yusupova and me, and I have prepared a flowchart that should (hopefully) help students decide, which of the statistical tests to use in [&#8230;]</p>
<p>Message <a href="https://openforecast.org/2021/11/30/statistical-tests-flowchart/">Statistical tests flowchart</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>In Lancaster University, I teach the module called &#8220;Statistics and Descriptive Analytics&#8221;, which is compulsory for master students of the programme &#8220;<a href="https://www.lancaster.ac.uk/lums/our-departments/management-science/masters/msc-business-analytics/" rel="noopener" target="_blank">Business Analytics</a>&#8220;. This year, the module has been delivered by <a href="https://www.lancaster.ac.uk/lums/people/alisa-yusupova" rel="noopener" target="_blank">Alisa Yusupova</a> and me, and I have prepared a flowchart that should (hopefully) help students decide, which of the statistical tests to use in different situations. This is in no way a full diagram, it mainly focuses on tests for the mean / median and variance, but it should be useful for the students and those who use statistical tests in their work. Here it is:</p>
<div id="attachment_2743" style="width: 222px" class="wp-caption aligncenter"><a href="https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-scaled.gif"><img decoding="async" aria-describedby="caption-attachment-2743" src="https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-212x300.gif" alt="Statistical Tests Flowchart" width="212" height="300" class="size-medium wp-image-2743" srcset="https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-212x300.gif 212w, https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-723x1024.gif 723w, https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-768x1088.gif 768w, https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-1085x1536.gif 1085w, https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-FlowChart-1446x2048.gif 1446w" sizes="(max-width: 212px) 100vw, 212px" /></a><p id="caption-attachment-2743" class="wp-caption-text">Start your journey from the green rhombus</p></div>
<p>And here is <a href="https://openforecast.org/wp-content/uploads/2021/11/Statistical-Tests-Flowchart.pdf">the same thing in pdf</a></p>
<p>Message <a href="https://openforecast.org/2021/11/30/statistical-tests-flowchart/">Statistical tests flowchart</a> first appeared on <a href="https://openforecast.org">Open Forecasting</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openforecast.org/2021/11/30/statistical-tests-flowchart/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
