Polixir

company

http://polixir.ai/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xionghuichen authored a paper about 5 hours ago

Adversarial Counterfactual Environment Model Learning

xionghuichen authored a paper about 5 hours ago

NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios

xionghuichen authored a paper about 5 hours ago

Soft Adaptive Policy Optimization

View all activity

xionghuichen

authored 8 papers about 5 hours ago

Adversarial Counterfactual Environment Model Learning

Paper • 2206.04890 • Published Jun 10, 2022

NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios

Paper • 2503.19267 • Published Mar 25, 2025

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Paper • 2606.17030 • Published 8 days ago • 28

Qwen-RobotManip Technical Report: Alignment Unlocks Scale for Robotic Manipulation Foundation Models

Paper • 2606.17846 • Published 6 days ago

Qwen-RobotNav Technical Report: A Scalable Navigation Model Designed for an Agentic Navigation System

Paper • 2606.18112 • Published 5 days ago

xionghuichen

authored a paper 7 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 163

xionghuichen

authored a paper 11 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

xionghuichen

authored a paper about 1 year ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

xionghuichen

in polixirai/NeoRL2 about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

xionghuichen

updated a dataset about 1 year ago

polixirai/NeoRL2

Viewer • Updated May 19, 2025 • 981k • 462 • 2

xionghuichen

published a dataset about 1 year ago

polixirai/NeoRL2

Viewer • Updated May 19, 2025 • 981k • 462 • 2

xionghuichen

authored 5 papers about 1 year ago

Language Model Self-improvement by Reinforcement Learning Contemplation

Paper • 2305.14483 • Published May 23, 2023 • 1

AFlow: Automating Agentic Workflow Generation

Paper • 2410.10762 • Published Oct 14, 2024 • 2

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems

Paper • 2305.04832 • Published May 3, 2023

A Survey on Model-based Reinforcement Learning

Paper • 2206.09328 • Published Jun 19, 2022

Offline Reinforcement Learning with Causal Structured World Models

Paper • 2206.01474 • Published Jun 3, 2022

AI & ML interests

Recent Activity

Team members 1

polixirai's activity

[bot] Conversion to Parquet