Hacker News Digest

Тег: #attention-mechanism

Постов: 1

Native Sparse Attention (aclanthology.org)

by CalmStorm • 01 августа 2025 г. в 19:48 • 139 points

ОригиналHN

#attention-mechanism#natural-language-processing#machine-learning

Комментарии (31)

Deep seek papers are a must to read for anyone who wants to understand how to make LLMs operate at hyper scale. All western labs hide their best results, or at most release summaries that are about as meaningful as the answers Cleo used to give on stack exchange: https://math.sta