PANews 1月21日消息,据量子位报道,DeepSeek在GitHub更新的FlashMLA代码中首次出现“MODEL1”名称,覆盖114个文件中28处提及,且与现有版本V32(DeepSeek-V3.2)并列,暗示MODEL1为下一代新架构模型。代码差异显示该模型在KV缓存布局、稀疏性处理及FP8解码等方面进行了优化,或将在春节前后正式发布。结合近期公开的mHC残差连接机制与Engram记忆模块,MODEL1有望整合多项自研创新。
DeepSeek新模型MODEL1代码曝光,疑为全新架构
Favorite
Share
Disclaimer: This article is copyrighted by the original author and does not represent MyToken’s views and positions. If you have any questions regarding content or copyright, please contact us.(www.mytokencap.com)contact
About MyToken:https://www.mytokencap.com/aboutusArticle Link:https://www.mytokencap.com/news/556093.html
More exciting content is available on
X(https://x.com/MyTokencap)or join the community to learn more:MyToken-English Telegram Group
(https://t.me/mytokenGroup)
X(https://x.com/MyTokencap)or join the community to learn more:MyToken-English Telegram Group
(https://t.me/mytokenGroup)
Previous:顾景辞:1.21比特币/以太坊操作策略附行情分析
Related Reading



Bitcoin Reclaims $63,500 As Traders Watch For Squeeze Toward $67,000
Bitcoin bulls are watching the $63,500 support zone as traders map a potential squeeze toward the $6...
NewsBTC2026-06-21 21:08:45

Saylor Says Strategy Added More Than 716,000 BTC Since 2022 Balance Sheet Stress
Michael Saylor says Strategy added more than 716,000 BTC after its 2022 balance sheet stress, pointi...
NewsBTC2026-06-21 20:30:41

JaredFromSubway MEV Bot Drained of $7.5M in Token Approval Trick
JaredFromSubway MEV bot lost $7.5M after being tricked into granting token approvals. Blockaid says ...
blockchainreporter2026-06-21 20:00:00