Skip to content
View xrsrke's full-sized avatar
🎯
🎯

Block or report xrsrke

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xrsrke/README.md

Hi there 👋

currently distributed training @ nous research. ex-nanotron @ huggingface. DMs open

https://phucnguyen.dev.

DMs open

Best way to reach me is discord: neuralink, twitter/@xariusrke

Pinned Loading

  1. huggingface/nanotron huggingface/nanotron Public

    Minimalistic large language model 3D-parallelism training

    Python 2.7k 320

  2. pipegoose pipegoose Public

    Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

    Python 87 20

  3. instructGOOSE instructGOOSE Public

    Implementation of Reinforcement Learning from Human Feedback (RLHF)

    Jupyter Notebook 172 21

  4. toolformer toolformer Public

    Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools

    Jupyter Notebook 145 14

  5. reinforcement-learning reinforcement-learning Public

    Jupyter Notebook 10 1