NeMo

Jun 26, 2025
Run Google DeepMind’s Gemma 3n on NVIDIA Jetson and RTX
As of today, NVIDIA now supports the general availability of Gemma 3n on NVIDIA RTX and Jetson. Gemma, previewed by Google DeepMind at Google I/O last month,...
4 MIN READ

Jun 25, 2025
Check Out Sovereign AI in Practice Through an NVIDIA Webinar
Join NVIDIA experts and leading European model builders on July 8 for a webinar on building and deploying multilingual large language models.
1 MIN READ

Jun 25, 2025
How to Streamline Complex LLM Workflows Using NVIDIA NeMo-Skills
A typical recipe for improving LLMs involves multiple stages: synthetic data generation (SDG), model training through supervised fine-tuning (SFT) or...
10 MIN READ

Jun 24, 2025
Upcoming Livestream: Beyond the Algorithm With NVIDIA
Join us on June 26 to learn how to distill cost-efficient models with the NVIDIA Data Flywheel Blueprint.
1 MIN READ

Jun 11, 2025
Chat With Your Enterprise Data Through Open-Source AI-Q NVIDIA Blueprint
Enterprise data is exploding—petabytes of emails, reports, Slack messages, and databases pile up faster than anyone can read. Employees are left searching for...
8 MIN READ

Jun 11, 2025
Build Efficient AI Agents Through Model Distillation With the NVIDIA Data Flywheel Blueprint
As enterprise adoption of agentic AI accelerates, teams face a growing challenge of scaling intelligent applications while managing inference costs. Large...
11 MIN READ

Jun 02, 2025
Scaling to Millions of Tokens with Efficient Long-Context LLM Training
The evolution of large language models (LLMs) has been marked by significant advancements in their ability to process and generate text. Among these...
7 MIN READ

May 30, 2025
NVIDIA Deep Learning Institute Offers Multilingual AI Training at GTC Paris
Large language models (LLMs) are capable of recognizing, summarizing, translating, predicting, and generating content. Yet even the most powerful LLMs face...
6 MIN READ

May 28, 2025
Spotlight: Build Scalable and Observable AI Ready for Production with Iguazio's MLRun and NVIDIA NIM
The collaboration between Iguazio (acquired by McKinsey) and NVIDIA empowers organizations to build production-grade AI solutions that are not only...
7 MIN READ

May 27, 2025
Advanced Optimization Strategies for LLM Training on NVIDIA Grace Hopper
In the previous post, Profiling LLM Training Workflows on NVIDIA Grace Hopper, we explored the importance of profiling large language model (LLM) training...
10 MIN READ

May 27, 2025
Profiling LLM Training Workflows on NVIDIA Grace Hopper
The rapid advancements in AI have resulted in an era of exponential growth in model sizes, particularly in the domain of large language models (LLMs). These...
12 MIN READ

May 14, 2025
Build Custom Reasoning Models with Advanced, Open Post-Training Datasets
Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...
5 MIN READ

May 12, 2025
Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework
As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...
6 MIN READ

May 09, 2025
Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research
Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...
11 MIN READ

May 08, 2025
Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework
Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...
6 MIN READ

Apr 23, 2025
Enhance Your AI Agent with Data Flywheels Using NVIDIA NeMo Microservices
Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on...
12 MIN READ