Skip to content

Releases: OpenPipe/ART

v0.5.9

30 Jan 19:46
d0e2a5e

Choose a tag to compare

Release Highlights

What's Changed

  • Add tool support for RULER evaluation (#542)
  • Remove SkyPilot backend and docs (#541)
  • refactor: streamline AutoTrajectoryContext management (#540)
  • feat: group-level trajectory metadata (#539)
  • Correct Qwen3 30B model name and link (#538)
  • Document Qwen3 30B (#537)
  • Vendor tinker_cookbook under art package (#535)
  • feat: replace pyright with ty for type checking (#534)
  • feat(tinker): add TinkerNativeBackend (#532)
  • fix: align Tinker server port with client base_url (#531)
  • Fix model registration (#529)
  • fix: preserve tool-call context in tokenization (#527)
  • Fix OpenAI temperature handling (#526)
  • SFT preprocessing (#525)
  • Remove dev/playwright_agent directory (security + cleanup) (#524)
  • feat: add persistent state methods to Model (#522)
  • feat: Backend-First Training API (Phase 1) (#521)
  • Remove TorchTune service and related dev projects (#520)
  • feat: move trajectory logging from backend to frontend (#518)
  • feat: Replace pyright with ty for type checking (#517)
  • feat: Replace run_checks.sh with prek pre-commit hooks (#516)
  • Multi-checkpoint inference for pipelined training (RFC #513) (#515)
  • Use training_step for W&B x-axis to allow out-of-order logging (#512)
  • Update wandb and weave (#507)
  • Upgrade litellm, suppress bugs (#506)
  • bump: Release v0.5.7 (#505)

Full Changelog: v0.5.7...v0.5.9

v0.5.7

08 Jan 00:38
51528ca

Choose a tag to compare

Release Highlights

What's Changed

  • fix: Bump minimum openai version to 2.14.0 (#504)
  • Release v0.5.6 (#502)
  • fix: Pin vLLM to 0.13.0 (#501)

Full Changelog: v0.5.6...v0.5.7

v0.5.6

06 Jan 16:16
b4b9965

Choose a tag to compare

Release Highlights

What's Changed

  • fix: Pin vLLM to 0.13.0 (#501)
  • release: Bump version to 0.5.5 (#500)
  • feat: Add support for a LocalBackend Tinker model service (#499)

Full Changelog: v0.5.5...v0.5.6

v0.5.5

06 Jan 01:42
789e273

Choose a tag to compare

Release Highlights

What's Changed

  • feat: Add support for a LocalBackend Tinker model service (#499)
  • feat: Default loss to CISPO (#498)
  • refactor: Remove coupled Unsloth service, upgrade to vLLM 0.11+ (#497)
  • Deploy to model name + " (deployment)" (#496)
  • Support more LoRA models in WandbDeploymentConfig (#495)
  • Log metrics horizontally in W&B to simplify comparison to future runs (#494)
  • Return 0th checkpoint (#492)
  • Ensure bucket exists before pushing model weights (#491)
  • Release v0.5.4 (#488)
  • Revert "Update version to 0.5.4" (#487)
  • Update version to 0.5.4 (#485)

Full Changelog: v0.5.4...v0.5.5

v0.5.4

15 Dec 23:41
9c8a3d1

Choose a tag to compare

Release Highlights

This patch release adds checkpoint download functionality and includes fixes for SkyPilot and package configuration.

New Features

  • Download Checkpoint: Much simpler workflow for downloading checkpoints from training runs

Bug Fixes

  • Fixed --system flag for uv pip install on SkyPilot backend
  • Replaced deprecated uv.dev-dependencies with dependency-groups in package configuration

What's Changed

  • Add download checkpoint functionality (#464)
  • fix: add --system flag to uv pip install for SkyPilot backend (#462)
  • fix: replace deprecated uv.dev-dependencies with dependency-groups (#484)

Full Changelog: v0.5.3...v0.5.4

v0.5.3

24 Nov 16:14
08c4c29

Choose a tag to compare

Release Highlights

What's Changed

  • Release version 0.5.2 (#457)
  • Remove openpipe dependency (#456)
  • Add strip_logprobs utility function (#455)
  • fix: Handle RULER rewards when all trajectories are identical (#454)
  • Make copy.copy work for trajectories (#453)
  • Fix lint (#451)
  • feat: Add OpenEnv integration example (#445)
  • Release v0.5.1 (#442)

Full Changelog: v0.5.1...v0.5.3

v0.5.1

22 Oct 15:58
0036512

Choose a tag to compare

Release Highlights

What's Changed

  • Record auto_metrics on trajectory (#440)
  • Fix ruff formatting (#438)
  • Document Qwen3 14B instead of Qwen2.5 14B in non-tutorial examples (#437)
  • Small docs change (#433)
  • Serverless documentation (#432)
  • Retry GET and DELETE (#431)
  • Release v0.5.0 (#428)
  • Release v0.4.12 (#427)
  • feat: Add VLM support (#412)

Full Changelog: v0.5.0...v0.5.1

v0.5.0

07 Oct 05:57
fe5bd7d

Choose a tag to compare

Release Highlights

What's Changed

  • Release v0.4.12 (#427)

Full Changelog: v0.4.12...v0.5.0

v0.4.12

07 Oct 03:24
e3f7968

Choose a tag to compare

Release Highlights

What's Changed

  • Add client-side error capturing (#426)
  • Send wandb-artifact:/// prefix as part of inference name. (#425)
  • Report serverless metrics (#424)
  • feat: Add placeholder _log() implementation for ServerlessBackend (#422)
  • Upgrade SkyPilot to 0.10.3.post1 (#421)
  • fix: exit the loop in the monitor_openai_server (#418)
  • feat: Upgrade Unsloth (#408)
  • add playwright agent example to dev/ART (#402)
  • feat: Update SkyPilot to 0.10.2 (#393)
  • Update training details (#389)
  • LocalBackend._monitor_openai_server improvement (#388)
  • Add RULER scoring to LG integration example (#385)
  • Show open deep research tutorial (#384)
  • Minor open deep research doc edits (#383)
  • Add deep research tutorial (#382)
  • Show GSPO docs page (#381)
  • Add MCP•RL doc, remove GSPO (#380)
  • Add news item on LangGraph (#379)
  • gather_trajectory_groups: fused after_each callback to group await (#378)
  • Document RULER in README (#376)
  • Release v0.4.11 (#374)
  • Pin gql verion to fix (#372)
  • Update nb links (#371)
  • Allow run_checks.sh to succeed on mac (#370)
  • Add art.mcp package, release 0.4.10 (#369)

Full Changelog: v0.4.11...v0.4.12

v0.4.11

27 Aug 01:59
80bd0d2

Choose a tag to compare

Release Highlights

This patch release includes bug fixes, documentation improvements, and a new MCP package:

New Features

  • MCP Package: Added art.mcp package for Model Context Protocol integration

Bug Fixes

  • Fixed GraphQL dependency compatibility by pinning gql package to < 4
  • Fixed run_checks.sh script to succeed on macOS

Documentation & Examples

  • Updated notebook links and examples
  • Improved LangGraph integration documentation
  • Better documentation for wrap_rollout function

What's Changed

  • Add art.mcp package (#369)
  • Pin gql version to fix compatibility (#372)
  • Allow run_checks.sh to succeed on mac (#370)
  • Update nb links (#371)
  • Properly document wrap_rollout (#368)
  • Update LangGraph docs (#367)
  • Link to LangGraph (#365)
  • Update LangGraph integration doc (#363)

Full Changelog: v0.4.9...v0.4.11