NEW KIMI AI | Coding with a Multimodal Long-Context Brain

My Account

Mijn Account

Mon Compte

Mein Konto

我的帐户

Inloggen

Connexion

Anmeldung

Acceso

NEW KIMI AI | Coding with a Multimodal Long-Context Brain

Kimi K2 is the large language model series developed by Moonshot AI team

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained with the Muon optimizer, Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities.

Key Features

Large-Scale Training: Pre-trained a 1T parameter MoE model on 15.5T tokens with zero training instability.

MuonClip Optimizer: We apply the Muon optimizer to an unprecedented scale, and develop novel optimization techniques to resolve instabilities while scaling up.

Agentic Intelligence: Specifically designed for tool use, reasoning, and autonomous problem-solving.

Model Variants

Kimi-K2-Base: The foundation model, a strong start for researchers and builders who want full control for fine-tuning and custom solutions.

Kimi-K2-Instruct: The post-trained model best for drop-in, general-purpose chat and agentic experiences. It is a reflex-grade model without long thinking.

https://www.kimi.com

https://aimode.co/app/kimi

Architecture	Mixture-of-Experts (MoE)
Total Parameters	1T
Activated Parameters	32B
Number of Layers (Dense layer included)	61
Attention Hidden Dimension	7168
MoE Hidden Dimension (per Expert)	2048
Number of Attention Heads	64
Number of Experts	384
Selected Experts per Token	8
Number of Shared Experts	1
Vocabulary Size0	160K
Context Length	128K
Attention Mechanism	MLA
Activation Function	SwiGLU

Wes Roth

content creator

NEW KIMI AI | Coding with a Multimodal Long-Context Brain

Join the conversation 🎭

Wes Roth | Google is About to Bust the AI Bubble...

Sabine Hossenfelder | 5 Signs the AI Bubble is Bursting

FE2E | Qwen3 | Ernie 4.5 | Stable Audio 2.5 | HunyuanImage 2.1

Matt Wolfe | 30 AI Demos and News Headlines You Missed

NetworkChuck | You Need to Learn Model Context Protocol Right Now

Seedream 4.0 is Proof There’s No Stopping AI Advancement

ChatGPT Just Got Its Most Powerful Upgrade Yet

Aperture | The AI Takeover Is Closer Than You Think

Neil Patel | The Google Update No One Is Ready For

Sabine Hossenfelder | How Fast will AI Change Everything?

Google’s AI EmbeddingGemma is Breaking Records

Matthew Berman | Did OpenAI Just Solve Hallucinations?

Wes Roth | Grok 4.2 Sonoma Sky Will be Scary Good

New AI Beats NanoBanana | AI Minecraft | Dish-Washing Robots

AI Ode To Greta Thunberg | How Dare You?!

Anthropic to Pay $1.5 Billion to Settle Lawsuit With Book Authors

Upper Echelon | Real Life Skynet

OpenAI is about to Launch an AI Jobs Network

Economy Media | How AI Is Crushing Junior Developers

OpenAI is Making ChatGPT Into Something Way Bigger

Dr. Roman Yampolskiy | Only 5 Jobs will remain in 2030

Google Prepares to Unleash Jules | OpenAI Faces Billionaire Conspiracy

VibeVoice | The best Free AI Text to Speech Voice Cloner is Here

Wes Roth | How To Use Midjourney | Beginners Guide

Patrick Boyle | Is AI Slop Killing the Internet?

VoxHammer | CoMPaSS | Bytedance Waver | OmniHuman

Matt Wolfe | AI News: Adobe Just Gave in to Google’s NanoBanana

Hermes 4 Just Proved Open Source AI Can Beat OpenAI

Google’s Nano anana just killed Photoshop... Let’s Run It

Sabine Hossenfelder | I Tried Vibe Physics, This is What I Learned