Collections
Discover the best community collections!
Collections including paper arxiv:2603.16856
-
CoLLM: A Large Language Model for Composed Image Retrieval
Paper • 2503.19910 • Published • 15 -
Parallel Scaling Law for Language Models
Paper • 2505.10475 • Published • 83 -
OLMoE: Open Mixture-of-Experts Language Models
Paper • 2409.02060 • Published • 80 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 14
-
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper • 2603.08262 • Published • 42 -
On-Policy Context Distillation for Language Models
Paper • 2602.12275 • Published • 3 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 57 -
Mixture-of-Depths Attention
Paper • 2603.15619 • Published • 79
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 148 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 151 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 117
-
Self-Supervised Prompt Optimization
Paper • 2502.06855 • Published • 18 -
Context Learning for Multi-Agent Discussion
Paper • 2602.02350 • Published • 4 -
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 32 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 57
-
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper • 2603.08262 • Published • 42 -
On-Policy Context Distillation for Language Models
Paper • 2602.12275 • Published • 3 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 57 -
Mixture-of-Depths Attention
Paper • 2603.15619 • Published • 79
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 148 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 151 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 117
-
Self-Supervised Prompt Optimization
Paper • 2502.06855 • Published • 18 -
Context Learning for Multi-Agent Discussion
Paper • 2602.02350 • Published • 4 -
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 32 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 57
-
CoLLM: A Large Language Model for Composed Image Retrieval
Paper • 2503.19910 • Published • 15 -
Parallel Scaling Law for Language Models
Paper • 2505.10475 • Published • 83 -
OLMoE: Open Mixture-of-Experts Language Models
Paper • 2409.02060 • Published • 80 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 14