ChangyWen

Follow

👾

Focusing

ChangyWen

👾

Focusing

Follow

10 followers · 8 following

The University of Hong Kong
Hong Kong, China

Achievements

Achievements

Highlights

Pro

Pinned Loading

wolpertinger_ddpg wolpertinger_ddpg Public

Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.

Python 66 17
STCNet-for-Smoke-Detection STCNet-for-Smoke-Detection Public

STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

Python 15 2
ReasoningBank ReasoningBank Public

ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

Python 1
TruthRL TruthRL Public

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Python 1
PolitiFact-scraping PolitiFact-scraping Public

PolitiFact Scarper

Python 1 1
A3C_RS_ML A3C_RS_ML Public

low_memory version

Python