Skip to content

Conversation

@github-actions
Copy link
Contributor

@github-actions github-actions bot commented Oct 9, 2025

🚀 Version bump to Version(__version__).major.Version(__version__).minor.Version(__version__).microrc1.dev0

pablo-garay and others added 28 commits August 19, 2025 11:22
* Remove ray deprecated dashboard-grpo-port arg

Signed-off-by: Charlie Truong <[email protected]>

* Fix nemo run ray cluster tests

Signed-off-by: Charlie Truong <[email protected]>

* Remove DASHBOARD_GRPC_PORT

Signed-off-by: Charlie Truong <[email protected]>

---------

Signed-off-by: Charlie Truong <[email protected]>
* add a grace for Jobs that may start in Unknown

Signed-off-by: Prekshi Vyas <[email protected]>

* add a grace for Jobs that may start in Unknown

Signed-off-by: Prekshi Vyas <[email protected]>

* add a grace for Jobs that may start in Unknown

Signed-off-by: Prekshi Vyas <[email protected]>

* fix linting

Signed-off-by: Prekshi Vyas <[email protected]>

* make the handling of Unknown job status better by polling

Signed-off-by: prekshivyas <[email protected]>

---------

Signed-off-by: Prekshi Vyas <[email protected]>
Signed-off-by: prekshivyas <[email protected]>
Co-authored-by: prekshivyas <[email protected]>
* add image pull secrets for lepton

Signed-off-by: Pablo Garay <[email protected]>

* update format

Signed-off-by: Pablo Garay <[email protected]>

---------

Signed-off-by: Pablo Garay <[email protected]>
Allow users to specify an existing node reservation with the
LeptonExecutor to be able to run on dedicated resources.

Signed-off-by: Robert Clark <[email protected]>
Signed-off-by: Romil Bhardwaj <[email protected]>
Signed-off-by: Romil Bhardwaj <[email protected]>
…c cloud sync (#335)

* fix: support for SkyPilot Storage configurations in file_mounts

- Modified SkypilotExecutor to handle both string paths and dict configs in file_mounts
- Dictionary configs are automatically converted to sky.Storage objects
- Enables automatic cloud storage mounting (GCS, S3, etc.) for outputs

This change allows users to specify cloud storage backends directly in
file_mounts, enabling automatic synchronization of training outputs to
cloud storage without manual rsync operations.

Signed-off-by: Andy Lee <[email protected]>

* refactor: Separate storage_mounts from file_mounts for cleaner API

Signed-off-by: Andy Lee <[email protected]>

* test: Add unit tests for storage_mounts functionality

- Test storage_mounts parameter initialization
- Test to_task() method with storage_mounts configurations
- Test combined file_mounts and storage_mounts usage
- Verify Storage.from_yaml_config() integration
- Ensure backward compatibility when storage_mounts is None

Signed-off-by: Andy Lee <[email protected]>

* fix tests

Signed-off-by: Hemil Desai <[email protected]>

---------

Signed-off-by: Andy Lee <[email protected]>
Signed-off-by: Hemil Desai <[email protected]>
Co-authored-by: Hemil Desai <[email protected]>
* Create SkypilotJobsExecutor to allow running managed jobs with Skypilot API

Signed-off-by: Rahim Dharssi <[email protected]>

* Remove unnecessary comments

Signed-off-by: Rahim Dharssi <[email protected]>

* fix lints

Signed-off-by: Rahim Dharssi <[email protected]>

* Add comment for suppressing import error

Signed-off-by: Rahim Dharssi <[email protected]>

* Write unit tests for _save_job_dir and _get_job_dirs

Signed-off-by: Rahim Dharssi <[email protected]>

* Fix lints

Signed-off-by: Rahim Dharssi <[email protected]>

---------

Signed-off-by: Rahim Dharssi <[email protected]>
* Refactor tar packaging logic for improved performance and simplicity

Signed-off-by: smajumdar <[email protected]>

* Clarify tar repacking logic to avoid issues with concatenating tar files

Signed-off-by: smajumdar <[email protected]>

* Remove redundant test for concatenating tar files on Linux

Signed-off-by: smajumdar <[email protected]>

* spell check fix

Signed-off-by: Hemil Desai <[email protected]>

---------

Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Hemil Desai <[email protected]>
Co-authored-by: Hemil Desai <[email protected]>
* Fixing documentation layout

Signed-off-by: Andrew Schilling <[email protected]>

* documentation.md

Signed-off-by: Andrew Schilling <[email protected]>

* Removing live-server

Signed-off-by: Andrew Schilling <[email protected]>

* Correctin .vscode

Signed-off-by: Andrew Schilling <[email protected]>

---------

Signed-off-by: Andrew Schilling <[email protected]>
Signed-off-by: Andrew Schilling <[email protected]>
Signed-off-by: Pablo Garay <[email protected]>
* fix: Emit exit-code of docker runs

Signed-off-by: oliver könig <[email protected]>

* fix test

Signed-off-by: oliver könig <[email protected]>

* fixes

Signed-off-by: oliver könig <[email protected]>

* refactor

Signed-off-by: oliver könig <[email protected]>

* cleanup

Signed-off-by: oliver könig <[email protected]>

* add scheduler test

Signed-off-by: oliver könig <[email protected]>

* more scheduler tests

Signed-off-by: oliver könig <[email protected]>

* test executor

Signed-off-by: oliver könig <[email protected]>

* formatting

Signed-off-by: oliver könig <[email protected]>

---------

Signed-off-by: oliver könig <[email protected]>
Signed-off-by: Pablo Garay <[email protected]>
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: NeMo Bot <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
…_version__).minor.Version(__version__).microrc1.dev0` !

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
@github-actions github-actions bot force-pushed the ci/bump-Version(__version__).major.Version(__version__).minor.Version(__version__).microrc1.dev0 branch from b4a3020 to 0ba0b24 Compare December 3, 2025 23:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.