Penguin Solutions Achieves NVIDIA AI Factory Specialisation to Support Enterprise and Sovereign AI Deployments

26 June 2026 | NEWS

Designation recognises Penguin’s expertise in delivering full-stack AI factory infrastructure, helping organisations accelerate AI adoption, optimise token economics and scale next-generation AI workloads.

Penguin Solutions, Inc. (Nasdaq: PENG), the AI Factory Platform Company, announced it has become an NVIDIA AI Factory Specialised Partner, joining a select group of NVIDIA Partner Network (NPN) solution providers. Penguin achieved this invitation-only NVIDIA AI Factory specialisation by completing NVIDIA’s comprehensive training, maintaining the relevant competencies, meeting solution requirements, and bringing proven experience in designing, building, deploying, and operating full-stack, NVIDIA-based AI factory infrastructure for enterprise and hyperscale customers.

The NVIDIA AI Factory Specialised Partner designation recognises Penguin Solutions’ expertise and capabilities in delivering enterprise-scale AI inferencing and training solutions that enable organisations’ agentic and AI workloads.

Penguin helps customers accelerate AI time-to-value and achieve superior token economics by delivering and optimising NVIDIA-accelerated AI factories. Penguin’s Full-Stack AI Factory Platform considers every layer of the AI environment, including the underlying power required to produce tokens; the accelerated NVIDIA processors that enable efficient computation; the specialised server, networking and storage infrastructure needed to orchestrate thousands of GPUs; and the various models and applications used to innovate and transform our world.

"For over a decade, we have worked closely with NVIDIA in delivering and operating AI factories and GPU-based solutions for leading hyperscalers, enterprises, sovereign AI, and neocloud providers," said Kash Shaikh, President and CEO of Penguin Solutions. “This designation validates Penguin's deep capabilities to design, build, deploy, and operate full-stack AI factories at scale. As demand for AI infrastructure continues to accelerate, our customers are realising meaningful business outcomes from their AI investments in competitive, fast-paced industries where success is increasingly determined by their ability to operationalise AI at scale with superior token economics.”

As organisations look to accelerate their AI initiatives, demand for AI infrastructure that can support large-scale inference and emerging agentic AI workloads continues to grow. They require AI factory platform solutions that deliver performance, scalability, efficiency, and operational reliability. For example, Deepgram recently worked with Penguin to deploy a production-ready AI inference platform supporting large-scale Speech-to-Text, Text-to-Speech, and Voice Agent applications and workloads.

Penguin also collaborated with NVIDIA and SK Telecom in the creation of the Haein AI Factory, one of Korea’s largest, most powerful, and award-winning GPU-as-a-Service clusters, defining a new model for collaboration, execution, and sovereign AI strategy.

Penguin's Full-Stack AI Factory Platform brings together innovative products, including ClusterWareAI, MemoryAI, ComputeAI, OriginAI®, and End-to-end Services to help customers design, deploy, and manage AI factory environments to accelerate AI adoption while simplifying deployment and reducing operational complexity.

The NVIDIA AI Factory Specialisation designation further validates Penguin’s strategy to become a leading Full Stack AI Factory Platform company for enterprises, sovereign AI initiatives, and neocloud providers.