NEW POST
DeepSeek's LLMs made a big splash, but more interesting is their recent research papers. Shayan Mohanty writes an overview of them, outlining their three main arcs: efficiency, HPC Co-Design, and RL for emergent reasoning.
NEW POST
DeepSeek's LLMs made a big splash, but more interesting is their recent research papers. Shayan Mohanty writes an overview of them, outlining their three main arcs: efficiency, HPC Co-Design, and RL for emergent reasoning.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.