Practice prompt calibration #670

wpietri · 2024-11-05T18:56:03Z

No description provided.

…he 0.5 standards. Moving calibration testing to modelbench-private.

github-actions · 2024-11-05T18:56:17Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

bkorycki

Thanks for doing this!

src/modelbench/run.py

bollacker

I see nothing obviously wrong.

dhosterman

👍

rogthefrog · 2024-11-05T21:17:15Z

src/modelbench/run.py

        benchmarks.append(GeneralPurposeAiChatBenchmarkV1(l, "ensemble"))
-    run_result = run_benchmarks_for_suts(benchmarks, reference_suts, 100)
+    run_result = run_benchmarks_for_suts(benchmarks, reference_suts, None)


Style: for non-obvious arguments, I generally recommend naming it, for 3am me. E.g.

run_benchmarks_for_suts(benchmarks, reference_suts, what_this_does=None)

wpietri added 3 commits November 5, 2024 10:20

Updated standards.

524bfd2

Updated standards. Making sure they're in a sane order. Adding back t…

f6c5242

…he 0.5 standards. Moving calibration testing to modelbench-private.

Updated standards. Making sure they're in a sane order. Adding back t…

d782ccf

…he 0.5 standards. Moving calibration testing to modelbench-private.

wpietri requested a review from bkorycki November 5, 2024 18:56

wpietri requested a review from a team as a code owner November 5, 2024 18:56

wpietri requested a review from rogthefrog November 5, 2024 18:56

bkorycki reviewed Nov 5, 2024

View reviewed changes

src/modelbench/run.py Outdated Show resolved Hide resolved

src/modelbench/run.py Show resolved Hide resolved

Calibration should use everything.

124da95

bkorycki approved these changes Nov 5, 2024

View reviewed changes

bollacker approved these changes Nov 5, 2024

View reviewed changes

dhosterman approved these changes Nov 5, 2024

View reviewed changes

wpietri merged commit 10a1e74 into main Nov 5, 2024
4 checks passed

github-actions bot locked and limited conversation to collaborators Nov 5, 2024

rogthefrog approved these changes Nov 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Practice prompt calibration #670

Practice prompt calibration #670

wpietri commented Nov 5, 2024

github-actions bot commented Nov 5, 2024 •

edited

Loading

bkorycki left a comment

bollacker left a comment

dhosterman left a comment

rogthefrog Nov 5, 2024

Practice prompt calibration #670

Practice prompt calibration #670

Conversation

wpietri commented Nov 5, 2024

github-actions bot commented Nov 5, 2024 • edited Loading

bkorycki left a comment

Choose a reason for hiding this comment

bollacker left a comment

Choose a reason for hiding this comment

dhosterman left a comment

Choose a reason for hiding this comment

rogthefrog Nov 5, 2024

Choose a reason for hiding this comment

github-actions bot commented Nov 5, 2024 •

edited

Loading