What Are Large Language Models

DeepSeek发布梁文锋署名新论文 V4有望支持全新记忆架构

1月13日消息，今日，DeepSeek发布新论文《Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models》 (基于可扩展查找的条件记忆：大型语言模型稀疏性的新维度)。

15 天

Unlocking Business Value With Open-Weight Large Language Models

Open-weight LLMs can unlock significant strategic advantages, delivering customization and independence in an increasingly AI ...

18 小时

AI researchers are now studying LLMs as if they were living organisms

Large language models have grown so vast and complex that even the people who build them no longer fully understand how they work. A single modern ...

1 天

DeepSeek发布梁文锋署名新论文开源记忆模块

2026年1月13日，DeepSeek与北京大学合作发布新论文《Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models》，创始人梁文锋为合著作者之一。论文提出条件记忆（conditional memory）概念，通过可扩展查找结构解决大语言模型知识检索效率低下的问题。同日，团队 ...

腾讯网

DeepSeek发布梁文锋署名新论文，开源相关记忆模块Engram

每经AI快讯，1月12日晚， DeepSeek 发布新论文《Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language ...

1 天

Autocomplete: Large language models can repeat training data verbatim

Researchers show that LLMs can reproduce copyrighted training data almost verbatim. This means headaches for model providers.

Devdiscourse

How to Run LLMs Locally with Ollama: Setup, Models, and Best Practices

Ollama supports common operating systems and is typically installed via a desktop installer (Windows/macOS) or a ...

Forbes

Large Behavior Models Surpass Large Language Models To Create AI That Walks And Talks

Forbes contributors publish independent expert analyses and insights. Dr. Lance B. Eliot is a world-renowned AI scientist and consultant. In today’s column, I closely explore the rapidly emerging ...

Slator

Italian Benchmark Evaluates Large Language Models, Includes AI Translation

A new community-driven initiative evaluates large language models using Italian-native tasks, with AI translation among the ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果