
Deep Dive into Open Source RL for Large Scale LLMs DAPO
DAPO is an open-source RL framework that enhances LLM reasoning efficiency, achieving top-tier AIME
DAPO is an open-source RL framework that enhances LLM reasoning efficiency, achieving top-tier AIME
SmolDocling, a 256M VLM, enables efficient document conversion using DocTags to preserve structure while reducing
Chain of Draft (CoD) optimizes LLM efficiency by reducing verbosity while maintaining accuracy. It cuts
Author(s): Abishek V Ashok
Author(s): Abhishek Kumar, Dr.Pradyut Sarkar
Author(s): Ashutosh Kothiwala, Aravind Chandramouli
Author(s): Harmeet Thukran, Neeti Kashyap
Author(s): Kejitan Dontas, Krishna Kumar Tiwari
Author: Mayukh Chowdhury
Author: Nagarjun Gururaj
Author(s): Prateek Khandelwal, Anuj Khandelwal, Snigdha Agarwal
Author(s); Rupesh Wadibhasme, Amit Nandi, Bhavesh Wadibhasme, Sandip Sawarkar
Author(s): Surya Pratap Singh, Harmeet Thukran
We noticed you're visiting from India. We've updated our prices to Indian rupee for your shopping convenience. Use United States (US) dollar instead. Dismiss