News

Startups

Artificial Intelligence

Americas

Gimlet Labs Raises $80 Million to Scale Its Heterogeneous AI Inference Cloud

Gimlet Labs has announced an $80 million Series A to accelerate its multi-silicon infrastructure designed for the high-speed demands of autonomous AI agents.

Gimlet Labs has announced an $80 million Series A to accelerate its multi-silicon infrastructure designed for the high-speed demands of autonomous AI agents.

NewDecoded

Published Mar 24, 2026

Mar 24, 2026

3 min read

Image by Gimet Labs

Gimlet Labs has closed an $80 million Series A funding round to address the growing compute challenges of the AI industry. The investment was led by Menlo Ventures and included participation from Eclipse, Factory, Prosperity7, and Triatomic. This capital injection brings the startup's total funding to $92 million just five months after its public launch, reflecting intense market interest in specialized AI infrastructure.

The company is building what it calls a multi-silicon inference cloud, specifically designed to run complex AI agents. Unlike traditional setups that rely solely on homogeneous GPU clusters, Gimlet's architecture orchestrates workloads across diverse hardware including CPUs and specialized SRAM accelerators. This approach has already produced 3-10X speedups on frontier models with over one trillion parameters while operating within the same power envelope.

Gimlet reports that its customer base has tripled in recent months, now including a top tier frontier AI lab and a major hyperscaler. These organizations are leveraging the platform to manage the high token throughput required by agentic workflows, such as coding assistants that ingest entire codebases. The platform's ability to run proprietary models on specialized hardware provides a significant performance edge in what the company calls the Inference Speed Wars.

Founded by the technical team behind Pixie Labs, Gimlet is also innovating at the physical layer of the data center. They are designing facilities to connect disparate accelerators over high-speed networks, solving complex thermal and plumbing issues inherent in mixed-architecture environments. The team, led by Zain Asgar, plans to use the new funding to expand their physical infrastructure and grow their engineering team to meet global demand.


Decoded Take

Decoded Take

Decoded Take

The Shift Toward Multi-Silicon Architecture

The transition from simple chatbots to autonomous agents marks a turning point where brute force GPU scaling reaches its economic and physical limits. By decoupling AI workloads from specific hardware, Gimlet Labs is pioneering a trend toward multi-silicon environments where software intelligence dictates hardware efficiency. This move signals that the future of AI will not be won by the largest chip clusters alone, but by the most sophisticated orchestration of specialized silicon designed for agentic tasks.

Share this article

Related Articles