-
-
Notifications
You must be signed in to change notification settings - Fork 1k
Description
Hello RD-Agent Team,
Thank you for your great work on this promising project.
I would like to request a feature that allows the agent to resume from its last state if a task is interrupted.
Problem:
Currently, if a long-running task is interrupted for any reason (e.g., system reboot, network issue, or manual stop), it seems the only option is to restart the entire task from the beginning. This can be very time-consuming and inefficient, especially for complex R&D processes.
Proposed Solution:
It would be incredibly helpful if the agent could periodically save its state (create a checkpoint) to the --log_dir. When the agent is restarted with the same command or a specific --resume flag, it could check for a saved state in the log directory and continue the task from where it left off.
This feature would make the RD-Agent much more robust and practical for real-world, long-duration tasks.
Thank you for considering this request!