Ruiyi Wang
ruiyiwang
AI & ML interests
social agents, LLM reasoning, reinforcement learning
Recent Activity
updated a dataset 21 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits published a dataset 21 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits updated a model about 1 month ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3Organizations
None yet