
Deep Dive into Open Source RL for Large Scale LLMs DAPO
DAPO is an open-source RL framework that enhances LLM reasoning efficiency, achieving top-tier AIME
DAPO is an open-source RL framework that enhances LLM reasoning efficiency, achieving top-tier AIME
SmolDocling, a 256M VLM, enables efficient document conversion using DocTags to preserve structure while reducing
Chain of Draft (CoD) optimizes LLM efficiency by reducing verbosity while maintaining accuracy. It cuts
The highest distinction in the data science profession. Not just earn a charter, but use it as a designation.
DAPO is an open-source RL framework that enhances LLM reasoning efficiency, achieving top-tier AIME
SmolDocling, a 256M VLM, enables efficient document conversion using DocTags to preserve structure while reducing
Chain of Draft (CoD) optimizes LLM efficiency by reducing verbosity while maintaining accuracy. It cuts
DeepSeek’s MLA reduces KV cache memory via low-rank compression and decoupled positional encoding, enabling efficient
DRAMA enhances dense retrieval by leveraging LLM-based data augmentation and pruning to create efficient, high-performance
AI co-scientists powered by Gemini 2.0 accelerate scientific discovery by generating and ranking hypotheses using
SWE-Lancer benchmarks AI models on 1,400+ real freelance software engineering tasks worth $1M, evaluating their
Step-Video-T2V, a cutting-edge text-to-video model with 30B parameters, enhances video quality using Video-VAE, Video-DPO, and
Nomic Embed Text V2 revolutionizes text embeddings with Mixture-of-Experts (MoE), enhancing efficiency, multilingual support, and
TAID enhances LLM distillation by dynamically interpolating student-teacher distributions, solving capacity gaps and mode collapse.
We noticed you're visiting from India. We've updated our prices to Indian rupee for your shopping convenience. Use United States (US) dollar instead. Dismiss