wenzhaoabc

Follow

wenzhaoabc

Follow

8 followers · 11 following

Tongji University
Shanghai, China
14:50 (UTC +08:00)
https://wenzhaoabc.github.io
https://wenzhaoabc.com

Achievements

Achievements

Highlights

Pro

wenzhaoabc/README.md

Hi, there 👋

🧑‍💻 About Me

Currently pursuing my Master's degree while bridging the gap between theory and implementation. My interests:

🤖 LLM Post-training (RLHF / PPO / GRPO)
🧠 Reinforcement Learning for decision systems
🔎 Retrieval-Augmented Generation (RAG)
🛡️ Reliability / safety / stability

🛠 Skills

📫 Contacts

Pinned Loading

llm-tap-rl llm-tap-rl Public

Reinforcement Learning for LLM

Jupyter Notebook 38 1
mmkg-rag mmkg-rag Public

Enhancing Retrieval-Augmented Generation with Multi-Modal Knowledge Graph Integration

Python 15 2
verl-project/verl verl-project/verl Public

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22.2k 4.2k
minillm minillm Public

A lightweight implementation of LLMs with PyTorch and Transformers.

Python 3
tinyhttpd tinyhttpd Public

tinyhttpd

C 3
o8oo8o/WebCurl o8oo8o/WebCurl Public

极简网页版API调试神器

HTML 597 91