Microsoft got tired of bleeding cash to OpenAI and decided to build their own models. They bought most of Inflection's team for like $650 million or something ridiculous, then burned through 15,000 H100s. At 30 grand each, that's almost half a billion just for the hardware.
The math is brutal. Every Copilot query was costing them a fortune through OpenAI's API. When you're hemorrhaging $2.9 billion per quarter on AI costs, building your own mediocre model starts looking smart. Sure, it ranks 13th on LMArena - behind everything that actually matters - but at least it's their mediocre model.
MAI-1-preview launched August 28th and immediately got destroyed by GPT-4, Claude, Gemini, and basically every other model worth using. Microsoft knew this would happen. They don't care about being the best; they care about not paying OpenAI $0.03 per 1K tokens anymore.
The Technical Reality
They used mixture-of-experts architecture, which sounds impressive until you realize everyone's been doing this since 2017. Different parts of the model activate for different tasks - it's more efficient than giant monolithic models, but it's not exactly groundbreaking.
Microsoft isn't innovating here. They copied the approach from a paper that's older than most interns at the company. MAI-1 has around 500 billion parameters compared to GPT-4's estimated 1.76 trillion. So they built something smaller and dumber and somehow expected people to get excited.
The GB200 clusters they keep bragging about? Just NVIDIA's latest overpriced hardware that everyone else is using too. Microsoft claims they "borrowed techniques from the open-source community" - at least they're admitting they copied everything instead of their usual embrace, extend, extinguish playbook.
Here's the thing: they didn't need the best model. They needed something "good enough" for Office users that wouldn't bankrupt them. MAI-1-preview achieves that goal - barely. It's like choosing the cheapest beer at the bar. Nobody expects it to taste good, but it'll get you drunk for less money.
I spent an hour trying to get MAI-1-preview to write decent Python code through the LMArena interface. The results were painful. It kept suggesting deprecated pandas stuff and had no clue about the SettingWithCopyWarning hell I was stuck in. Took me forever to figure out it was just wrong about everything.