<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Ai-Benchmarks on Left 4 More</title><link>https://left4more.com/tags/ai-benchmarks/</link><description>Recent content in Ai-Benchmarks on Left 4 More</description><generator>Hugo</generator><language>en-au</language><lastBuildDate>Sat, 13 Jun 2026 03:15:15 +1000</lastBuildDate><atom:link href="https://left4more.com/tags/ai-benchmarks/index.xml" rel="self" type="application/rss+xml"/><item><title>Kimi K2.7: Coding AI That's Not Trying to Fool You</title><link>https://left4more.com/posts/kimi-k27-coding-ai-thats-not-trying-to-fool-you/</link><pubDate>Sat, 13 Jun 2026 03:15:15 +1000</pubDate><guid>https://left4more.com/posts/kimi-k27-coding-ai-thats-not-trying-to-fool-you/</guid><description>&lt;p>There&amp;rsquo;s a thing that happens in the AI space, reliably, almost rhythmically: a new model drops, the benchmarks are suspiciously curated, the blog post reads like it was written by a marketing department that just discovered the word &amp;ldquo;unprecedented,&amp;rdquo; and within 48 hours someone on Reddit has found the caveats buried in appendix C. Rinse, repeat.&lt;/p>
&lt;p>So when Moonshot AI put out Kimi K2.7 Code this week, I was half-expecting the usual. What I got was something a bit different, and I find myself cautiously impressed, not by the model itself, which I haven&amp;rsquo;t tested properly, but by the way it was presented.&lt;/p></description></item></channel></rss>