Which AI Are You Actually Using, and Does It Matter?

There’s a thread doing the rounds comparing the major AI assistants, and it’s the usual mix of genuine insight and confident nonsense. But buried in there are a few observations that stuck with me.

Someone mentioned their mum now uses Gemini daily, gets answers in her own language, solves her own problems. Someone else’s mum has apparently made Claude her best friend. This is not the AI adoption story that gets written about in tech publications, but it might be the more important one. Quiet, practical, personal. Not productivity gains or enterprise integration. Just: my mum can get help now when she needs it.

Posts

Benchmarking Yourself Against the Machines

Someone on Reddit built a tool that lets you benchmark yourself against AI language models. Same tests, same scoring. You sit down, answer the questions, and find out what size model you approximate.

The post took off, mostly because the original poster was having an absolute blast in the comments, treating themselves like a product listing. Quantization options. Token pricing. VRAM requirements. The bit where someone asked if they’d fit on an 8GB graphics card and they replied that they’d had Coca-Cola and cheesecake before testing “for an extra pump” — that’s genuinely funny. The whole thread had the energy of someone who understood exactly what they’d made and leaned into it without overselling it.

Posts

Teaching AI to Play Poker (Sort Of): When LLMs Meet Game Strategy

I’ve been fascinated by a project that’s been making the rounds lately: BalatroBench, which essentially lets large language models play Balatro, that brilliant poker-inspired roguelike that took the gaming world by storm last year. The concept is simple but elegant — feed the LLM the game state as text, let it decide what to do, and watch it either triumph or faceplant spectacularly.

For those unfamiliar, Balatro is a poker-based roguelike where you build synergies between cards, jokers, and special effects to reach increasingly absurd score targets. It’s the kind of game that requires both strategic planning and tactical decision-making, which makes it a genuinely interesting test for AI reasoning capabilities.

Posts

When AI Becomes a Propaganda Megaphone: The Problem With Unvetted Training Data

I’ve been watching the AI hype train for a couple of years now, equal parts fascinated and concerned. The technology is genuinely impressive in some ways, but there’s always been this nagging worry at the back of my mind about what happens when we hand over our critical thinking to machines that don’t actually think.

Recent research showing that ChatGPT, Gemini, DeepSeek, and Grok are serving up Russian propaganda about the Ukraine invasion feels like that worry manifesting in real time. It’s not surprising, but it’s deeply frustrating.

Posts

The Hidden Power of Tensor Offloading: Boosting Local LLM Performance

Running large language models locally has been a fascinating journey, especially for those of us who’ve been tinkering with these systems on consumer-grade hardware. Recently, I’ve discovered something quite remarkable about tensor offloading that’s completely changed how I approach running these models on my setup.

The traditional approach of offloading entire layers to manage VRAM constraints turns out to be rather inefficient. Instead, selectively offloading specific tensors - particularly the larger FFN (Feed Forward Network) tensors - to the CPU while keeping the attention mechanisms on the GPU can dramatically improve performance. We’re talking about potential speed improvements of 200% or more in some cases.