Pinned Loading
-
Open-Llama
Open-Llama PublicThe complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
-
TITAN-RL
TITAN-RL PublicTITAN-RL is a distributed reinforcement learning framework that separates policy rollout, experience storage, and training into independent microservices. This design enables flexible scaling and e…
Python
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
baichuan-inc/Baichuan-7B
baichuan-inc/Baichuan-7B PublicA large-scale 7B pretraining language model developed by BaiChuan-Inc.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.