Skip to content

Conversation

@kimminsu38oo
Copy link
Contributor

@kimminsu38oo kimminsu38oo commented Nov 27, 2025

This PR adds operator-level profiling to the ggml-cpu backend.

Key Changes

  • Compile Option: Added GGML_CPU_OP_PROFILING to enable this feature.
  • Output: Saves operator execution times in ms to op_profiling.csv
  • Thread Safety: Implemented synchronization barriers to ensure accurate timing in multi-threaded environments.

Performance

  • Negligible runtime overhead.

Example Output
image

@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant