OVH became a data-driven business by developing an extensive data ingestion and processing pipeline utilizing Apache Flink, enabling it to manage over 200 databases and 10 million events per day. The company focused on reliable financial KPIs for investor accountability while transitioning to complex data transformations, ensuring efficient event order, and implementing checkpoints for monitoring. Future plans include automating processes, exploring Hive 3.0, and enhancing data transformation capabilities.