Skip to content

Conversation

@remi-or
Copy link
Collaborator

@remi-or remi-or commented Oct 16, 2025

This PR:

  • adds throughput to the final pretty print at the end of the benchmark runs
  • adds the information of the output shape to the decoded text of the output (usefull to double check if throughput is right)
  • changes the behavior of the run_benchmarks.py script so that it will only run 3 configs unless instructed to do otherwise

This way, anyone can run a "quick"

python benchmark_v2/run_benchmarks.py --model-id "meta-llama/Meta-Llama-3-8B" -b 32 -s 128 -n 256

after changing the generate code or some model specific code to check perf did not take a big hit.
Thanks @SunMarc for the suggestion!

@remi-or remi-or requested review from McPatate and SunMarc October 16, 2025 14:19
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@SunMarc SunMarc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot !

@remi-or remi-or merged commit f7c33ab into main Oct 16, 2025
14 checks passed
@remi-or remi-or deleted the small-bm-script branch October 16, 2025 15:25
ngazagna-qc pushed a commit to ngazagna-qc/transformers that referenced this pull request Oct 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants