Skip to content
View ChangyWen's full-sized avatar
👾
Focusing
👾
Focusing
  • The University of Hong Kong
  • Hong Kong, China

Highlights

  • Pro

Block or report ChangyWen

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. wolpertinger_ddpg wolpertinger_ddpg Public

    Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.

    Python 66 17

  2. STCNet-for-Smoke-Detection STCNet-for-Smoke-Detection Public

    STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

    Python 15 2

  3. ReasoningBank ReasoningBank Public

    ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

    Python 1

  4. TruthRL TruthRL Public

    TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

    Python 1

  5. PolitiFact-scraping PolitiFact-scraping Public

    PolitiFact Scarper

    Python 1 1

  6. A3C_RS_ML A3C_RS_ML Public

    low_memory version

    Python