somusan's ckpt

Diffusion Policy Visuomotor Policy Learning Via Action Diffusion — Paper Explained

The problem statement Diffusion Policy solves is Visuomotor manipulation...

Posted on March 13, 2026

Introduction We all know diffusion models like DALL-E and Stable Diffusion for their ability to generate stunning images by iteratively removing noise. But what if we applied that exact same principle to robotic control? Diffusion Policy is a groundbreaking approach to visuomotor manipulation that adapts the DDPM architecture to solve imitation learning. Instead of converting a latent vector into an image, it learns to denoise a random sequence into a highly accurate “action chunk”—a trajectory of 7-DoF end-effector poses. By conditioning this denoising process on camera observations rather than text prompts, Diffusion Policy gracefully handles the multi-modal, non-Markovian nature of... [Read More]

Tags: VLA Vision Action Model Diffusion Policy

Best Project I've Worked on!

My work on Autonomous vehicle at IIIT Delhi

Posted on March 25, 2025

One of the best projects I worked on was developing and deploying an end-to-end Traffic Light Following ADAS feature on an autonomous vehicle at IIIT Delhi. Here is a demo: [Read More]

Tags: Portfolio