Artifacts for the Teacher-Signal Usability in Extreme-Gap Distillation thesis
Carlos Miguel Patiño
AI & ML interests
None yet
Recent Activity
updated a bucket 19 minutes ago
gemma-challenge/gemma-rusho-evolve new activity about 3 hours ago
rl-llm-wiki/knowledge-base:topic: foundations/policy-gradient-methods new activity about 3 hours ago
rl-llm-wiki/knowledge-base:source: arxiv:1506.02438 — Generalized Advantage Estimation (GAE)