TechnologyOpenAI and NVIDIA break record: 1.5 million tokens per second with open-licensed AI models
OpenAI and NVIDIA have joined forces to release two powerful artificial intelligence models with the GPT‑OSS‑120B and GPT‑OSS‑20B, marking a key moment in democratizing access to advanced AI. These models are designed to support a wide range of applications in content generation, reasoning, healthcare, industrial manufacturing, and more. Distributed under the Apache 2.0 license, the models are free for commercial and research use, providing developers, enterprises, governments, and startups with the tools to build transformative AI-based solutions.
The GPT‑OSS models are trained on NVIDIA H100 GPUs and optimized to run on NVIDIA’s global CUDA platform, which powers hundreds of millions of GPUs in the cloud, personal computers, and workstations worldwide. This strategic alignment ensures that developers around the globe can integrate these models into their existing infrastructure.
At the heart of this advancement is NVIDIA’s Blackwell architecture, purpose-built for high-throughput AI inference. The GB200 NVL72 mainframe achieves an unprecedented 1.5 million tokens per second when running the GPT‑OSS‑120B model, making it one of the most powerful inference platforms in the world. Blackwell brings innovations like NVFP44-bit precision, which enables extremely efficient execution with high precision, while significantly reducing power consumption and memory requirements, a giant step towards real-time use of models with trillions of parameters.
This collaboration also reflects a long-standing partnership between OpenAI and NVIDIA, dating back to 2016, when NVIDIA founder Jensen Huang personally delivered the first DGX-1 supercomputer to OpenAI’s headquarters in San Francisco. Since then, the two companies have collaborated on some of the world’s most ambitious AI training. Today’s release builds on this legacy, bringing cutting-edge AI capabilities to millions of developers globally, supported by an ecosystem of over 6.5 million developers in more than 250 countries.
Fundamentally, the release of GPT‑OSS represents a significant step towards making advanced AI more transparent, efficient, and accessible to everyone. With scalable infrastructure, open licensing, and broad hardware support, OpenAI and NVIDIA are not only accelerating the pace of innovation, but also building the foundation for the next industrial revolution driven by open, accountable, and high-performance AI.