ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

Paper | Model | Real-World Data

About

ForeAct is a visual foresight planner that empowers VLAs with the ability to anticipate future observations, enabling more informed decision-making.
ForeAct is general and plug-and-play: state-of-the-art VLAs can seamlessly incorporate ForeAct without any architectural modification.
ForeAct is highly efficient, generating a high-fidelity 640 $\times$ 480 future observation in just 0.33s on a single H100 GPU.

Demo

Usage

Environment Setup

git clone https://github.com/mit-han-lab/foreact
cd foreact
bash environment_setup.sh foreact

Finetune

Download the pretrained weights and prepare your own real-world data (or use our processed real-world data). Update the relevant paths in configs/finetune.yaml, then launch:

bash scripts/run_finetune.sh

Inference

### CLI
python app_cli.py --checkpoint_path path/to/model --prompt "" --input_image path/to/image  --output_dir ./results

### Gradio
python app.py --checkpoint_path path/to/model

VLA Training

We provide examples regarding policy training in ./third-party/lerobot.

Acknowledgements

Thanks to metaquery, diffusers, lerobot for the wonderful open-source codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
configs		configs
dataloaders		dataloaders
models		models
scripts		scripts
third-party/lerobot		third-party/lerobot
utils		utils
README.md		README.md
app.py		app.py
app_cli.py		app_cli.py
environment_setup.sh		environment_setup.sh
pipeline.py		pipeline.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

Paper | Model | Real-World Data

About

Demo

Usage

Environment Setup

Finetune

Inference

VLA Training

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

mit-han-lab/foreact

Folders and files

Latest commit

History

Repository files navigation

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

Paper | Model | Real-World Data

About

Demo

Usage

Environment Setup

Finetune

Inference

VLA Training

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages