The Bloatening: When AI Companies Forgot About the Little Guy
I used to get excited about model releases. A new tiny model would drop and I would immediately try to run it on my laptop that sounds like a jet engine. Now I scroll through announcements and see numbers that require a data center just to pronounce.
Take Qwen. I liked Qwen. Qwen had a 0.6B model. It was cute. It was manageable. It could run on hardware that did not require a second mortgage. Then came the next generation. Suddenly 0.6B became 0.8B. That is a thirty-three percent increase in bloat for one iteration. Meanwhile they are also releasing models with parameter counts that look like phone numbers.
The Creep of More
There is a pattern here. Start with something small and accessible. Build a community of developers who actually use your models on actual consumer hardware. Then quietly shift the goalposts. The new baseline is bigger. The new expectations require more VRAM. The new documentation assumes you have access to enterprise-grade infrastructure or at least a very understanding roommate with a powerful gaming rig.
I am not saying bigger models are bad. Sometimes you need the big guns. But when every release nudges the minimum requirements upward, the little guys get left behind. People like me, who train models with 100K parameters for fun and profit - mostly fun, definitely not profit - start to feel like we are showing up to a Formula 1 race on a tricycle.
Progress should not mean exclusion. Innovation should not require a hardware upgrade just to participate in the conversation.
My Tiny Rebellion
So I keep going small. I train models that fit in memory. I experiment with architectures that prioritize efficiency over scale. I accept that my models will sometimes respond to math questions with fish facts because they are tiny and confused and frankly adorable.
Maybe this is stubborn. Maybe I am the one who needs to adapt. But I believe there is value in accessibility. There is power in letting people experiment without needing a cloud budget. There is something deeply human about building tools that regular people can actually use.
Also my laptop is still making that jet engine noise and I cannot afford a wind tunnel.