Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.16856

adavanced learning

Efficient Exploration at Scale

Paper • 2603.17378 • Published 12 days ago • 13
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57
Hyperagents

Paper • 2603.19461 • Published 11 days ago • 36

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57
Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published 13 days ago • 20

Continual Learning

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57

CoLLM: A Large Language Model for Composed Image Retrieval

Paper • 2503.19910 • Published Mar 25, 2025 • 15
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83
OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 80
Dynamic Chunking Diffusion Transformer

Paper • 2603.06351 • Published 24 days ago • 14

FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use

Paper • 2603.08262 • Published 21 days ago • 42
On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 3
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57
Mixture-of-Depths Attention

Paper • 2603.15619 • Published 14 days ago • 79

about 6 hours ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 152
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published 14 days ago • 148
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published 19 days ago • 151
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 24 days ago • 117

Self Supervision

Self-Supervised Prompt Optimization

Paper • 2502.06855 • Published Feb 7, 2025 • 18
Context Learning for Multi-Agent Discussion

Paper • 2602.02350 • Published Feb 2 • 4
XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 18 days ago • 32
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57

Self Improvement

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 69
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57

adavanced learning

Efficient Exploration at Scale

Paper • 2603.17378 • Published 12 days ago • 13
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57
Hyperagents

Paper • 2603.19461 • Published 11 days ago • 36

FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use

Paper • 2603.08262 • Published 21 days ago • 42
On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 3
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57
Mixture-of-Depths Attention

Paper • 2603.15619 • Published 14 days ago • 79

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57
Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published 13 days ago • 20

about 6 hours ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 152
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published 14 days ago • 148
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published 19 days ago • 151
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 24 days ago • 117

Continual Learning

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57

Self Supervision

Self-Supervised Prompt Optimization

Paper • 2502.06855 • Published Feb 7, 2025 • 18
Context Learning for Multi-Agent Discussion

Paper • 2602.02350 • Published Feb 2 • 4
XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 18 days ago • 32
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57

CoLLM: A Large Language Model for Composed Image Retrieval

Paper • 2503.19910 • Published Mar 25, 2025 • 15
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83
OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 80
Dynamic Chunking Diffusion Transformer

Paper • 2603.06351 • Published 24 days ago • 14

Self Improvement

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 69
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 13 days ago • 57

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs