Edge AI: The On-Device Frontier Beyond the Data Center

Recently, when driving past a data center being constructed near Port Washington, Wisconsin, I was stunned. The size of this facility is breathtaking, designed for 2.5 million square feet of data center space across 672 acres (with the potential to expand to 1,900 total acres). Upon completion, it calls for an estimated 50,000 to 150,000 AI servers drawing an estimated 900 MW to 1+ GW of IT load and power demand. And this single project is just one of dozens going up around the country.

Presently, investors are enthralled by the enormity of the aggregate AI Infrastructure build. Huge companies are making extensive investments in infrastructure, components, and software to stand up a complex inferencing ecosystem. Everything is supersized.
As the example above illustrates, the footprints themselves are gigantic, covering vast tracts of real estate and requiring substantial energy capabilities to power them. Inside will sit thousands of server racks, each weighing as much as a small pickup truck or a male walrus. Even the silicon (once known as “MICROchips”) has subscribed to the bigger-is-better philosophy, with a high-profile, newly public company building a single system-on-chip about the size of an LP vinyl record.

All these assets, and the trillions of dollars spent on them, are being assembled to power large language models (LLMs). The operative word is large.

We marvel, of course, at this entire situation. But this is not where the story ends. As is often the case in technology, good things come in small packages. Another frontier for investors to consider is on the horizon, beyond the bulldozers: Edge AI.

In simple terms, Edge AI places workloads as close to the “edge” where data is created and actions are executed. It is implemented via any unique device built to perform a specific function but upfitted with onboard compute capacity. In practice, these devices ingest information from their operating environment, process it using artificial intelligence, and execute the best actions in real time.

Today’s massive data center buildout is largely about constructing the “brain,” the centralized intelligence responsible for training increasingly sophisticated AI models. The greater the computing power devoted to this infrastructure, the more advanced the intelligence frontier becomes.

While much of today’s AI investment is focused on centralized data centers and cloud infrastructure, Edge AI pushes intelligence outward onto dynamic endpoint devices operating at the edge of the network. The full potential of AI investment will only be realized when that centralized intelligence works in concert with highly capable Edge AI systems. Together, they enable a continuous, 360-degree machine learning ecosystem, where intelligence is not only trained centrally, but also deployed locally to deliver real-time insights, autonomous decision-making, and instantaneous action.

Edge AI: Coming Soon to a Device Near You

The best place to start this discussion is with the applications it will enable. We can identify many, but like all amazing technological enhancements, the ultimate extensions are unknowable. In 2000, when the internet was being imagined, was it possible to predict that 25 years later, 27% of married couples would meet on dating apps?

AI is enabling Edge 2.0 applications beyond today’s traditional robotics and Internet of Things (IoT). We introduce a few here, aware that opportunities without boundaries are infinite.

Robotics

Much of today’s AI discussion centers on software agents that help knowledge workers and consumers automate repetitive digital workflows. These applications are naturally well-suited for cloud-based computing resources. The automation of physical tasks will explosively expand the scope. Robots of all shapes, sizes, and capabilities are being developed and deployed in dynamic real-world operating environments, such as factory floors, warehouses, and logistics networks.

While industrial robots have long been capable of repetitive functions, such as material handling, welding, and assembly-line execution, the next generation aims to navigate imperfect operating environments that require real-time adaptation and decision-making. This opportunity is substantial given the more than half a billion manufacturing workers globally who today must augment physical processes with continuous human insights and on-the-fly adjustments to get the job done.