Consolidate CLI RL logic by yatlas20098 · Pull Request #1602 · google/tunix

yatlas20098 · 2026-06-17T21:29:03Z

Consolidate RL CLI logic into a new base pipeline.

This CL introduces tunix/cli/base_rl_main.py with a BasePipeline abstract class to house common Reinforcement Learning (RL) CLI logic, refactored out of tunix/cli/grpo_rl_main.py. This change is being made to allow for code reuse when implementing a new PPO CLI.

Updates the GRPO CLI pipeline to inherit from BasePipeline and modifies GRPO CLI tests accordingly.

Change-Id: I88188941bef08525d96b5cdfd03146e198d3a0d1

Resolves #506889007

It's a good idea to open an issue first for discussion.

Reference

Colab Notebook

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and all unit tests pass.
I have added all appropriate doc-strings/documentation.
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have signed the Contributor License Agreement.
I have followed Contribution Guidelines.

from GrpoPipeline. Update GrpoPipeline to inherit from BasePipeline. Change-Id: I88188941bef08525d96b5cdfd03146e198d3a0d1

Change-Id: I17bab9cb1a8a0c3fdb7d53a54b0825e47f4c689a

Change-Id: I23c889fd73524a9a3cc510c6da6d7f56be3b940c

Create BasePipeline in tunix/cli/base_rl_main.py to consolidate RL logic

7be2efa

from GrpoPipeline. Update GrpoPipeline to inherit from BasePipeline. Change-Id: I88188941bef08525d96b5cdfd03146e198d3a0d1

yatlas20098 requested a review from sizhit2 June 17, 2026 21:29

yatlas20098 requested review from abheesht17, hgao327, jiangyangmu, lc5211, s-noghabi, tianshub and wang2yn84 as code owners June 17, 2026 21:29

github-actions Bot assigned abheesht17 Jun 17, 2026

yatlas20098 temporarily deployed to testing June 17, 2026 21:29 — with GitHub Actions Inactive

Refactor CLI

461dbf0

Change-Id: I17bab9cb1a8a0c3fdb7d53a54b0825e47f4c689a

yatlas20098 had a problem deploying to testing June 18, 2026 20:16 — with GitHub Actions Error

base cli tests

5d70d58

Change-Id: I23c889fd73524a9a3cc510c6da6d7f56be3b940c

yatlas20098 had a problem deploying to testing June 18, 2026 20:22 — with GitHub Actions Error

yatlas20098 had a problem deploying to testing June 18, 2026 20:33 — with GitHub Actions Error

yatlas20098 force-pushed the yatlas-ppo-cli branch from a559cff to 5d70d58 Compare June 18, 2026 20:36

yatlas20098 temporarily deployed to testing June 18, 2026 20:36 — with GitHub Actions Inactive

yatlas20098 had a problem deploying to testing June 18, 2026 20:36 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidate CLI RL logic#1602

Consolidate CLI RL logic#1602
yatlas20098 wants to merge 3 commits into
mainfrom
yatlas-ppo-cli

yatlas20098 commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yatlas20098 commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants