Arquivo da tag: Inteligência artifical

AI Is Changing the Way We Predict the Weather. It’s More Perilous Than We Think (Gizmodo)

AI forecast models offer some clear benefits over traditional physical models, but they are ill-equipped to handle the increasing volatility of a warming climate.

By Ellyn Lapointe

Published April 27, 2026, 6:00 am ET

Original article

 On November 12, 1970, the Bhola cyclone slammed into the coast of what was then East Pakistan. The storm brought maximum sustained wind speeds of 130 miles per hour (205 kilometers per hour) and a 35-foot (10.5-meter) storm surge, killing an estimated 300,000 to 500,000 people.

Today, the Bhola cyclone remains the deadliest tropical storm on record. But if it had struck a decade later, it might not have been so devastating. Weather forecasting changed dramatically in the 1970s as meteorologists adopted physics-based computer models that improved storm prediction. With the rise of AI, forecasting is evolving again—but this time, experts worry the new models may be less reliable when it comes to predicting unprecedented weather events.

Researchers are calling this the “gray swan” problem. Gray swan weather extremes are physically plausible but so rare that they are poorly represented in training datasets. The trouble is, climate change is leading to more first-of-their-kind weather extremes. Think: the 2021 Pacific Northwest heatwave. This event was so severe that it would have been virtually impossible without climate change.

Physical forecast models can simulate gray swan events like the Pacific Northwest heatwave, though they are labeled extremely rare. They can do that because they are built on the laws of physics. AI models are trained on past weather data, wherein gray swans are practically nonexistent.

“They fail on gray swans,” Pedram Hassanzadeh, an associate professor of geophysical sciences at the University of Chicago, told Gizmodo. He and his colleagues published a study last April that removed all Category 3 through 5 hurricanes from an AI model’s training dataset, then tested it on Category 5 storms. The results showed that AI models cannot accurately forecast previously unseen events, as this would require extrapolation.

“The concern isn’t occasional misses. It’s that AI models can miss silently, producing confident forecasts of unremarkable weather while a record-breaking event is unfolding,” Rose Yu, an associate professor of computer science and engineering at the University of California San Diego, told Gizmodo in an email.

“Other risks matter too,” she said. “AI models can violate conservation laws in subtle ways that don’t show up in standard metrics. When they bust a forecast, diagnosing why is harder. They depend on stable observing systems, which is a real concern given current pressure on satellite programs. And institutionally, if we consolidate around AI too quickly and let physics-based infrastructure atrophy, we lose the redundancy that currently catches AI’s failures.”

The case for AI forecasting

Despite these pitfalls, meteorologists are rapidly adopting AI forecast models, and it’s actually easy to understand why. They’re faster, cheaper, and require far less computational infrastructure than physical models. When it comes to predicting typical weather patterns and events (not gray swans), their accuracy is comparable and improving rapidly.

“The typical rate of progress for most state-of-the-art physical models has been something like a day more accurate per decade, which doesn’t sound like a lot, but that’s consequential,” Andrew Charlton-Perez, a professor of meteorology and head of the School of Mathematical, Physical, and Computational Sciences at the University of Reading, told Gizmodo.

“The rate of accuracy growth for machine learning models has vastly exceeded that,” he said. “They are now competitive, and two-three years ago, they were not even in the same ballpark.”

During the 2025 Atlantic hurricane season, for example, Google DeepMind’s model outperformed nearly every physical model on storm track and intensity. In fact, since 2023, leading AI models such as GraphCast, Pangu-Weather, and the ECMWF’s AIFS have matched or outperformed the best physical models on medium-range forecasting metrics, according to Yu.

AI models are proving especially valuable in parts of the world that lack traditional forecasting resources—regions that are often on the frontlines of climate change. Hassanzadeh co-directed an initiative that provided 38 million farmers across India with AI-based monsoon forecasts, giving them up to four weeks’ advance notice of the rainy season’s onset.

“​​A lot of countries were left behind in that first revolution of weather forecasting, because [traditional] weather forecasting requires a supercomputer, hundreds of millions of dollars, various fields, workforce, and experts,” Hassanzadeh explained. AI models, by comparison, are far more accessible to lower-income countries.

Filling the knowledge gaps

Still, rapidly adopting these models without addressing the risks would be dangerous, especially in parts of the world highly vulnerable to the impacts of climate change. Shruti Nath, a postdoctoral research associate at the University of Oxford, recently co-authored an editorial calling for more rigorous testing of AI forecast models before public agencies widely adopt them.

“There is still a lot of work to be done in understanding the limits of these models, alongside where they could supplement physical models and why,” she told Gizmodo in an email.

Nath’s editorial outlines a framework for testing AI forecast models that would deliberately withhold a designated set of “iconic” extreme events (like the Pacific Northwest heat wave, for example) from the training dataset. These events would be reserved solely for testing in order to assess the models’ ability to extrapolate unprecedented weather extremes, or gray swans.

Actually implementing this AI Retraining Without Iconic Events (AIRWIE) protocol “would require the meteorological community to agree on which high-impact events constitute a rigorous benchmark,” the editorial states. This would be a great undertaking, but Nath believes most researchers agree that there is an urgent need for this kind of testing.

“We need to be a bit more organized, however, in ensuring that proper protocols can be followed and that robust safeguards are put in place and maintained by the community,” Nath said. “This is difficult when things are in such a hype phase and no one wants to miss out on the bandwagon.”

Other researchers, like Hassanzadeh, are developing ways to teach AI forecast models to predict gray swans. He and his colleagues are investigating whether combining AI systems with “relevant sampling” methods—which allow them to generate samples of gray swan events—can improve the models’ ability to extrapolate unprecedented extremes.

Efforts to understand and address the limitations of AI forecasting will be critical, because there’s no turning back now. AI is already reshaping the way we predict the weather, and as the climate becomes increasingly volatile, meteorologists will need every tool in their arsenal to be sharp and reliable. Despite their current limitations, there is much to gain from continuing to push these systems forward and figuring out how to best integrate them with physical forecasting.

“The research agenda is about making AI models physically consistent, well-calibrated, and robust to distribution shift,” Yu said. “Abandoning this approach because of the gray swan problem means giving up the biggest improvement in forecasting in a generation.”

Projecting a robot’s intentions: New spin on virtual reality helps engineers read robots’ minds (Science Daily)

Date: October 29, 2014

Source: Massachusetts Institute of Technology

Summary: In a darkened, hangar-like space inside MIT’s Building 41, a small, Roomba-like robot is trying to make up its mind. Standing in its path is an obstacle — a human pedestrian who’s pacing back and forth. To get to the other side of the room, the robot has to first determine where the pedestrian is, then choose the optimal route to avoid a close encounter. As the robot considers its options, its “thoughts” are projected on the ground: A large pink dot appears to follow the pedestrian — a symbol of the robot’s perception of the pedestrian’s position in space.

A new spin on virtual reality helps engineers read robots’ minds. Credit: Video screenshot courtesy of Melanie Gonick/MIT

In a darkened, hangar-like space inside MIT’s Building 41, a small, Roomba-like robot is trying to make up its mind.

Standing in its path is an obstacle — a human pedestrian who’s pacing back and forth. To get to the other side of the room, the robot has to first determine where the pedestrian is, then choose the optimal route to avoid a close encounter.

As the robot considers its options, its “thoughts” are projected on the ground: A large pink dot appears to follow the pedestrian — a symbol of the robot’s perception of the pedestrian’s position in space. Lines, each representing a possible route for the robot to take, radiate across the room in meandering patterns and colors, with a green line signifying the optimal route. The lines and dots shift and adjust as the pedestrian and the robot move.

This new visualization system combines ceiling-mounted projectors with motion-capture technology and animation software to project a robot’s intentions in real time. The researchers have dubbed the system “measurable virtual reality (MVR) — a spin on conventional virtual reality that’s designed to visualize a robot’s “perceptions and understanding of the world,” says Ali-akbar Agha-mohammadi, a postdoc in MIT’s Aerospace Controls Lab.

“Normally, a robot may make some decision, but you can’t quite tell what’s going on in its mind — why it’s choosing a particular path,” Agha-mohammadi says. “But if you can see the robot’s plan projected on the ground, you can connect what it perceives with what it does to make sense of its actions.”

Agha-mohammadi says the system may help speed up the development of self-driving cars, package-delivering drones, and other autonomous, route-planning vehicles.

“As designers, when we can compare the robot’s perceptions with how it acts, we can find bugs in our code much faster,” Agha-mohammadi says. “For example, if we fly a quadrotor, and see something go wrong in its mind, we can terminate the code before it hits the wall, or breaks.”

The system was developed by Shayegan Omidshafiei, a graduate student, and Agha-mohammadi. They and their colleagues, including Jonathan How, a professor of aeronautics and astronautics, will present details of the visualization system at the American Institute of Aeronautics and Astronautics’ SciTech conference in January.

Seeing into the mind of a robot

The researchers initially conceived of the visualization system in response to feedback from visitors to their lab. During demonstrations of robotic missions, it was often difficult for people to understand why robots chose certain actions.

“Some of the decisions almost seemed random,” Omidshafiei recalls.

The team developed the system as a way to visually represent the robots’ decision-making process. The engineers mounted 18 motion-capture cameras on the ceiling to track multiple robotic vehicles simultaneously. They then developed computer software that visually renders “hidden” information, such as a robot’s possible routes, and its perception of an obstacle’s position. They projected this information on the ground in real time, as physical robots operated.

The researchers soon found that by projecting the robots’ intentions, they were able to spot problems in the underlying algorithms, and make improvements much faster than before.

“There are a lot of problems that pop up because of uncertainty in the real world, or hardware issues, and that’s where our system can significantly reduce the amount of effort spent by researchers to pinpoint the causes,” Omidshafiei says. “Traditionally, physical and simulation systems were disjointed. You would have to go to the lowest level of your code, break it down, and try to figure out where the issues were coming from. Now we have the capability to show low-level information in a physical manner, so you don’t have to go deep into your code, or restructure your vision of how your algorithm works. You could see applications where you might cut down a whole month of work into a few days.”

Bringing the outdoors in

The group has explored a few such applications using the visualization system. In one scenario, the team is looking into the role of drones in fighting forest fires. Such drones may one day be used both to survey and to squelch fires — first observing a fire’s effect on various types of vegetation, then identifying and putting out those fires that are most likely to spread.

To make fire-fighting drones a reality, the team is first testing the possibility virtually. In addition to projecting a drone’s intentions, the researchers can also project landscapes to simulate an outdoor environment. In test scenarios, the group has flown physical quadrotors over projections of forests, shown from an aerial perspective to simulate a drone’s view, as if it were flying over treetops. The researchers projected fire on various parts of the landscape, and directed quadrotors to take images of the terrain — images that could eventually be used to “teach” the robots to recognize signs of a particularly dangerous fire.

Going forward, Agha-mohammadi says, the team plans to use the system to test drone performance in package-delivery scenarios. Toward this end, the researchers will simulate urban environments by creating street-view projections of cities, similar to zoomed-in perspectives on Google Maps.

“Imagine we can project a bunch of apartments in Cambridge,” Agha-mohammadi says. “Depending on where the vehicle is, you can look at the environment from different angles, and what it sees will be quite similar to what it would see if it were flying in reality.”

Because the Federal Aviation Administration has placed restrictions on outdoor testing of quadrotors and other autonomous flying vehicles, Omidshafiei points out that testing such robots in a virtual environment may be the next best thing. In fact, the sky’s the limit as far as the types of virtual environments that the new system may project.

“With this system, you can design any environment you want, and can test and prototype your vehicles as if they’re fully outdoors, before you deploy them in the real world,” Omidshafiei says.

This work was supported by Boeing.

Video: http://www.youtube.com/watch?v=utM9zOYXgUY