Ruiyi Wang's picture

Ruiyi Wang

ruiyiwang

https://ruiyiw.github.io

AI & ML interests

social agents, LLM reasoning, reinforcement learning

Recent Activity

updated a dataset 21 days ago

ruiyiwang/grpo-qwen1.5b-textworld-policy-logits

published a dataset 21 days ago

ruiyiwang/grpo-qwen1.5b-textworld-policy-logits

updated a model about 1 month ago

ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3

View all activity

Organizations

None yet

New activity in PEARLS-Lab/robocasa-composite-raw-videos 3 months ago

Remove root-level episode files (should be under task folders)

#1 opened 3 months ago by