Skip to content

Actions: willccbb/verifiers

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
389 workflow runs
389 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

detect when tool_calls is a list of JSON strings (#250)
Style #267: Commit 85ae8e4 pushed by willccbb
August 27, 2025 03:34 18s main
August 27, 2025 03:34 18s
detect when tool_calls is a list of JSON strings (#250)
Test #210: Commit 85ae8e4 pushed by willccbb
August 27, 2025 03:34 44s main
August 27, 2025 03:34 44s
Add MedAgentBench Envrionment
Test #208: Pull request #249 opened by Pranavb333
August 26, 2025 20:36 Action required Pranavb333:add-med_agent_bench
August 26, 2025 20:36 Action required
Add MedAgentBench Envrionment
Style #265: Pull request #249 opened by Pranavb333
August 26, 2025 20:36 Action required Pranavb333:add-med_agent_bench
August 26, 2025 20:36 Action required
Release version 0.1.3
Test #206: Commit 2106820 pushed by willccbb
August 26, 2025 11:55 50s main
August 26, 2025 11:55 50s
Release version 0.1.3
Style #263: Commit 2106820 pushed by willccbb
August 26, 2025 11:55 10s main
August 26, 2025 11:55 10s
revert version
Style #262: Commit 93b8b72 pushed by willccbb
August 26, 2025 09:40 12s main
August 26, 2025 09:40 12s
fix saving dataset to HF, toolcall sanitizing (#246)
Test #205: Commit aef9f21 pushed by willccbb
August 26, 2025 08:14 46s main
August 26, 2025 08:14 46s
fix saving dataset to HF, toolcall sanitizing (#246)
Style #261: Commit aef9f21 pushed by willccbb
August 26, 2025 08:14 9s main
August 26, 2025 08:14 9s
August 26, 2025 07:23 9s
August 26, 2025 07:23 44s
Add sampling_args flag to vf-eval (#240)
Test #201: Commit 8e38e7f pushed by willccbb
August 26, 2025 03:44 41s main
August 26, 2025 03:44 41s
Add sampling_args flag to vf-eval (#240)
Style #257: Commit 8e38e7f pushed by willccbb
August 26, 2025 03:44 9s main
August 26, 2025 03:44 9s
Allow unsetting max_tokens in eval script (#241)
Test #199: Commit c054ff9 pushed by willccbb
August 26, 2025 03:28 42s main
August 26, 2025 03:28 42s
Allow unsetting max_tokens in eval script (#241)
Style #255: Commit c054ff9 pushed by willccbb
August 26, 2025 03:28 12s main
August 26, 2025 03:28 12s
MMLU example working, tui fixes (#243)
Style #254: Commit fcc0267 pushed by willccbb
August 26, 2025 03:23 9s main
August 26, 2025 03:23 9s
MMLU example working, tui fixes (#243)
Test #198: Commit fcc0267 pushed by willccbb
August 26, 2025 03:23 50s main
August 26, 2025 03:23 50s