Skip to content
View SumanthRH's full-sized avatar
:shipit:
shipit
:shipit:
shipit

Highlights

  • Pro

Organizations

@anyscale @NovaSky-AI

Block or report SumanthRH

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SumanthRH/README.md

Hi there πŸ‘‹

  • πŸ˜„ I'm Sumanth, a software engineer at Anyscale, working on post-training. My primary interests are broadly in machine learning and systems engineering.
  • πŸš€ I'm trying to understand generative models, and have worked on finetuning and in-context learning for language models. Addicted to compute πŸ€–
  • πŸ’» I'm currently working on SkyThought, and SkyRL.
  • 🌱 I'm trying to learn what it takes to build machine learning systems in practice.
  • ✨ I have a blog: https://sumanthrh.com
  • πŸ’¬ Some samples of my writing:

Pinned Loading

  1. NovaSky-AI/SkyRL NovaSky-AI/SkyRL Public

    SkyRL: A Modular Full-stack RL Library for LLMs

    Python 1.7k 289

  2. NovaSky-AI/SkyThought NovaSky-AI/SkyThought Public

    Sky-T1: Train your own O1 preview model within $450

    Python 3.4k 342

  3. tokenization tokenization Public

    A comprehensive deep dive into the world of tokens

    Python 228 10

  4. frankxwang/dpo-prefix-sharing frankxwang/dpo-prefix-sharing Public

    DPO, but faster πŸš€

    Python 51 5

  5. peft peft Public

    Forked from huggingface/peft

    Fork of πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. Our implementation for IA3, a new fine-tuning method is now a part of the official Huggingface library!

    Python

  6. varun19299/deep-atrous-guided-filter varun19299/deep-atrous-guided-filter Public

    Deep Atrous Guided Filter for Image Restoration in Under Display Cameras (UDC Challenge, ECCV 2020).

    Jupyter Notebook 38 6