AI Reading Club

A lightweight repository for running an AI Reading Club on foundational papers in modern language models.

Positioning:

From papers to executable understanding.

The club reads foundational AI papers, discusses what they really mean, and connects them to modern LLM systems. Some sessions now pair with executable workshop artifacts in hghalebi/rust-ml, especially for Rust, typed tiny ML, and category-theory-inspired reconstruction. An additional companion resource is hghalebi/category_theory_transformer_rs, where we implement a tiny ML model in Rust from scratch through a category-theory lens.

Logistics

Cadence: one paper every two weeks
Format: 10-15 minute volunteer overview, followed by about 45 minutes of discussion
Joining (Discord): https://discord.gg/5rAMsuVXXp
Schedule: sessions/schedule-2026.md (started on 2026-03-11; confirmed history is tracked in docs/workshop-history.md; no sessions in August)

See:

docs/workshop-history.md (confirmed session and workshop archive)
docs/announcement-template.md (announcement template)
docs/why-read.md (motivation)
docs/organizer-tips.md (organiser tips)

Curriculum (14 Papers)

Module 1: Foundations and Architecture

Neural Machine Translation of Rare Words with Subword Units (2015)
Attention Is All You Need (2017)

Module 2: Interpretability (Inside the Black Box)

What Does BERT Look At? An Analysis of BERT's Attention (2019)
Attention is not Explanation (2019)
Transformer Feed-Forward Layers Are Key-Value Memories (2020)

Module 3: Generation and Decoding

The Curious Case of Neural Text Degeneration (2019)

Module 4: The Data Foundation

Datasheets for Datasets (2018)
Croissant: A Metadata Format for ML-Ready Datasets (2024)

Module 5: Efficiency and Scaling

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (2022)
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale (2022)

Module 6: Fine-Tuning and Alignment

LoRA: Low-Rank Adaptation of Large Language Models (2021)
QLoRA: Efficient Finetuning of Quantized LLMs (2023)
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning (2023)
LIMA: Less Is More for Alignment (2023)

Supplemental Papers

Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity (2021)

Detailed rationale and paper links are in curriculum/README.md.

How to Run Sessions

Create one GitHub issue per paper (use the "Paper Session" issue template).
Assign a discussion lead for each session; they prepare a short slide deck or document.
Add three guiding questions before the session so the discussion has a clear starting point.
If the mathematics is dense, focus on the abstract, introduction, diagrams, and conclusion.

Repository Layout

curriculum/: the ordered reading list + paper links
docs/: announcements and organiser guidance
sessions/: session notes and templates
docs/workshop-history.md: confirmed AI Reading Club and Rust/ML workshop history
sections/: workshop and implementation assets grouped by module, including BPE materials under sections/tokenization/
sections/tokenization/ch02/: BPE notebook walkthrough and assets
sections/tokenization/rust_bpe_tokenizer/: Rust BPE implementation used in the same module
sections/bert_attention_paper/: Rust walkthrough that reimplements the BERT attention-analysis paper with runnable step-by-step binaries
.github/: issue templates and PR template

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Reading Club

Logistics

Curriculum (14 Papers)

Module 1: Foundations and Architecture

Module 2: Interpretability (Inside the Black Box)

Module 3: Generation and Decoding

Module 4: The Data Foundation

Module 5: Efficiency and Scaling

Module 6: Fine-Tuning and Alignment

Supplemental Papers

How to Run Sessions

Repository Layout

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github		.github
assets		assets
curriculum		curriculum
docs		docs
sections		sections
sessions		sessions
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

AI Reading Club

Logistics

Curriculum (14 Papers)

Module 1: Foundations and Architecture

Module 2: Interpretability (Inside the Black Box)

Module 3: Generation and Decoding

Module 4: The Data Foundation

Module 5: Efficiency and Scaling

Module 6: Fine-Tuning and Alignment

Supplemental Papers

How to Run Sessions

Repository Layout

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages