Model Overview

Description:

The NVIDIA gpt-oss-120b Eagle model is the Eagle head of the OpenAI’s gpt-oss-120b model, which is an auto-regressive language model that uses a mixture-of-experts (MoE) architecture with 5 billion activated parameters and 120 billion total parameters. For more information, please check here. The NVIDIA gpt-oss-120b Eagle3 model incorporates Eagle speculative decoding with TensorRT Model Optimizer.

This model is ready for commercial/non-commercial use.

Note

nvidia/gpt-oss-120b-Eagle3-v2 is typically better for use cases of less than 8k context length.

License/Terms of Use:

nvidia-open-model-license

Deployment Geography:

Global

Use Case:

Developers designing AI Agent systems, chatbots, RAG systems, and other AI-powered applications. Also suitable for typical instruction-following tasks.