Model Overview
Description:
The NVIDIA gpt-oss-120b Eagle model is the Eagle head of the OpenAI’s gpt-oss-120b model, which is an auto-regressive language model that uses a mixture-of-experts (MoE) architecture with 5 billion activated parameters and 120 billion total parameters. For more information, please check here. The NVIDIA gpt-oss-120b Eagle3 model incorporates Eagle speculative decoding with TensorRT Model Optimizer.
This model is ready for commercial/non-commercial use.
Note
nvidia/gpt-oss-120b-Eagle3-v2 is typically better for use cases of less than 8k context length.
License/Terms of Use:
Deployment Geography:
Global
Use Case:
Developers designing AI Agent systems, chatbots, RAG systems, and other AI-powered applications. Also suitable for typical instruction-following tasks.