- Masters in robotics at the University of Maryland, College Park.
- [email protected]
Pinned Loading
-
-
Minimal-GRPO
Minimal-GRPO PublicImplementation of Group Relative Policy Optimization (GRPO) to fine-tune Open Language Models like LlaMa-3.2, Qwen2 for Math Tasks.
Python 2
-
Informed-RRT-star
Informed-RRT-star Publicinformed RRT* for path planning in N dimensions
Jupyter Notebook 5
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.