ΑΙhub 01月06日
AI weather models can now beat the best traditional forecasts
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

谷歌DeepMind研究人员在《自然》杂志上发表论文,介绍了一种名为GenCast的新型机器学习天气预测模型。该模型采用扩散模型方法,类似于AI图像生成器,能够生成多个预测结果,捕捉大气复杂行为。与传统方法相比,GenCast所需的时间和计算资源更少。它通过从1979年至2018年的再分析数据中学习,预测温度、压力、湿度和风速等多个变量。该模型在单个张量处理器上仅需8分钟即可完成15天的预测,速度远超传统模型。虽然机器学习预测日益普及,但传统数值天气预测仍是基础,为机器学习模型提供初始条件和数据。

🌪️ GenCast使用扩散模型,通过生成多个天气预测来捕捉大气复杂性,每个预测更真实地反映自然界的复杂性,并通过平均多个预测来提高准确度。

⏱️ GenCast的预测速度显著快于传统方法,在单个处理器上仅用8分钟即可完成15天的预测,而训练模型仅需5天,并使用了32个张量处理器。

📊 GenCast的训练数据来源于1979年至2018年的再分析数据,包括温度、压力、湿度和风速等多个变量,并在全球范围内进行了细致的网格划分,以提高预测的准确性。

💡 机器学习天气预测系统虽然在不断发展,但仍依赖于传统的数值天气预测和再分析数据,后者为前者提供初始条件和数据支持,并不断微调机器学习模型。

NASA/GSFC, MODIS Rapid Response Team, Jacques Descloitres

By Vassili Kitsios, CSIRO

A new machine-learning weather prediction model called GenCast can outperform the best traditional forecasting systems in at least some situations, according to a paper by Google DeepMind researchers published last month in Nature.

Using a diffusion model approach similar to artificial intelligence (AI) image generators, the system generates multiple forecasts to capture the complex behaviour of the atmosphere. It does so with a fraction of the time and computing resources required for traditional approaches.

How weather forecasts work

The weather predictions we use in practice are produced by running multiple numerical simulations of the atmosphere.

Each simulation starts from a slightly different estimate of the current weather. This is because we don’t know exactly what the weather is at this instant everywhere in the world. To know that, we would need sensor measurements everywhere.

These numerical simulations use a model of the world’s atmosphere divided into a grid of three-dimensional blocks. By solving equations describing the fundamental physical laws of nature, the simulations predict what will happen in the atmosphere.

Known as general circulation models, these simulations need a lot of computing power. They are usually run at high-performance supercomputing facilities.

Machine-learning the weather

The past few years have seen an explosion in efforts to produce weather prediction models using machine learning. Typically, these approaches don’t incorporate our knowledge of the laws of nature the way general circulation models do.

Most of these models use some form of neural network to learn patterns in historical data and produce a single future forecast. However, this approach produces predictions that lose detail as they progress into the future, gradually becoming “smoother”. This smoothness is not what we see in real weather systems.

Researchers at Google’s DeepMind AI research lab have just published a paper in Nature describing their latest machine-learning model, GenCast.

GenCast mitigates this smoothing effect by generating an ensemble of multiple forecasts. Each individual forecast is less smooth, and better resembles the complexity observed in nature.

The best estimate of the actual future then comes from averaging the different forecasts. The size of the differences between the individual forecasts indicates how much uncertainty there is.

According to the GenCast paper, this probabilistic approach creates more accurate forecasts than the best numerical weather prediction system in the world – the one at the European Centre for Medium-Range Weather Forecasts.

Generative AI – for weather

GenCast is trained on what is called reanalysis data from the years 1979 to 2018. This data is produced by the kind of general circulation models we talked about earlier, which are additionally corrected to resemble actual historical weather observations to produce a more consistent picture of the world’s weather.

The GenCast model makes predictions of several variables such as temperature, pressure, humidity and wind speed at the surface and at 13 different heights, on a grid that divides the world up into 0.25-degree regions of latitude and longitude.

GenCast is what is called a “diffusion model”, similar to AI image generators. However, instead of taking text and producing an image, it takes the current state of the atmosphere and produces an estimate of what it will be like in 12 hours.

This works by first setting the values of the atmospheric variables 12 hours into the future as random noise. GenCast then uses a neural network to find structures in the noise that are compatible with the current and previous weather variables. An ensemble of multiple forecasts can be generated by starting with different random noise.

Forecasts are run out to 15 days, taking 8 minutes on a single processor called a tensor processor unit (TPU). This is significantly faster than a general circulation model. The training of the model took five days using 32 TPUs.

Machine-learning forecasts could become more widespread in the coming years as they become more efficient and reliable.

However, classical numerical weather prediction and reanalysed data will still be required. Not only are they needed to provide the initial conditions for the machine learning weather forecasts, they also produce the input data to continually fine-tune the machine learning models.

What about the climate?

Current machine learning weather forecasting systems are not appropriate for climate projections, for three reasons.

Firstly, to make weather predictions weeks into the future, you can assume that the ocean, land and sea ice won’t change. This is not the case for climate predictions over multiple decades.

Secondly, weather prediction is highly dependent on the details of the current weather. However, climate projections are concerned with the statistics of the climate decades into the future, for which today’s weather is irrelevant. Future carbon emissions are the greater determinant of the future state of the climate.

Thirdly, weather prediction is a “big data” problem. There are vast amounts of relevant observational data, which is what you need to train a complex machine learning model.

Climate projection is a “small data” problem, with relatively little available data. This is because the relevant physical phenomena (such as sea levels or climate drivers such as the El Niño–Southern Oscillation) evolve much more slowly than the weather.

There are ways to address these problems. One approach is to use our knowledge of physics to simplify our models, meaning they require less data for machine learning.

Another approach is to use physics-informed neural networks to try to fit the data and also satisfy the laws of nature. A third is to use physics to set “ground rules” for a system, then use machine learning to determine the specific model parameters.

Machine learning has a role to play in the future of both weather forecasting and climate projections. However, fundamental physics – fluid mechanics and thermodynamics – will continue to play a crucial role.

Vassili Kitsios, Senior Research Scientist, Climate Forecasting, CSIRO

This article is republished from The Conversation under a Creative Commons license. Read the original article.


Read the work in full

Probabilistic weather forecasting with machine learning, Ilan Price, Alvaro Sanchez-Gonzalez, Ferran Alet, Tom R. Andersson, Andrew El-Kadi, Dominic Masters, Timo Ewalds, Jacklynn Stott, Shakir Mohamed, Peter Battaglia, Remi Lam & Matthew Willson (2024).

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

GenCast 机器学习 天气预测 扩散模型 AI
相关文章