Richard Feldman
908ef03502
Split out cron and non-cron unit evals ( #42472 )
...
Release Notes:
- N/A
---------
Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de >
2025-11-11 13:45:48 -05:00
Richard Feldman
9e1e732db8
Use longer timeout on evals ( #42465 )
...
The GPT-5 ones in particular can take a long time!
Release Notes:
- N/A
---------
Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de >
2025-11-11 16:37:20 +00:00
Richard Feldman
0268b17096
Add more secrets to eval workflows ( #42459 )
...
Release Notes:
- N/A
2025-11-11 16:07:57 +00:00
Conrad Irwin
359521e91d
Allow passing model_name to evals ( #42395 )
...
Release Notes:
- N/A
2025-11-10 23:00:52 +00:00
Conrad Irwin
c24f9e47b4
Try to download wasi-sdk ahead of time ( #42377 )
...
This hopefully resolves the lingering test failures on linux,
but also adds some logging just in case this isn't the problem...
Release Notes:
- N/A
---------
Co-authored-by: Ben Kunkle <ben@zed.dev >
2025-11-10 19:50:43 +00:00
Conrad Irwin
d075a56ee7
Fix merge conflict ( #41853 )
...
Closes #ISSUE
Release Notes:
- N/A
2025-11-03 13:41:39 -07:00
Ben Kunkle
2408f767f4
gh-workflow unit evals ( #41637 )
...
Closes #ISSUE
Release Notes:
- N/A *or* Added/Fixed/Improved ...
2025-11-01 22:45:44 -04:00