<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>LLM on C.CUI's Log</title><link>https://cuicaihao.github.io/tags/llm/</link><description>Recent content in LLM on C.CUI's Log</description><generator>Hugo</generator><language>en-AU</language><lastBuildDate>Thu, 04 Jun 2026 07:00:00 +1000</lastBuildDate><atom:link href="https://cuicaihao.github.io/tags/llm/index.xml" rel="self" type="application/rss+xml"/><item><title>No More 'Playing It By Ear': Why Harness Engineering is Becoming Key to AI Production Deployment?</title><link>https://cuicaihao.github.io/posts/2026-06-04-no-more-playing-it-by-ear-why-harness-engineering-is-becoming-key-to-ai-production-deployment/</link><pubDate>Thu, 04 Jun 2026 07:00:00 +1000</pubDate><guid>https://cuicaihao.github.io/posts/2026-06-04-no-more-playing-it-by-ear-why-harness-engineering-is-becoming-key-to-ai-production-deployment/</guid><description>Harness Engineering is emerging as a crucial paradigm for deploying AI into production environments, moving beyond unreliable prompt-based approaches. It conceptualizes large models as powerful &amp;ldquo;wild horses&amp;rdquo; and introduces a &amp;ldquo;rein system&amp;rdquo; comprising constraints, guidance, verification, and correction. This system enables AI Agents to accomplish tasks more stably, controllably, and reliably in complex, real-world business scenarios.</description></item><item><title>Deep Decoding: How Large Language Models 'See' Structure?</title><link>https://cuicaihao.github.io/posts/2026-05-03-how-llms-see-structure/</link><pubDate>Sun, 03 May 2026 09:05:02 +1000</pubDate><guid>https://cuicaihao.github.io/posts/2026-05-03-how-llms-see-structure/</guid><description>Explore the architectural differences between Native Multimodal, MoE, and Long Context LLMs, and how they perceive and process structural information in professional workflows.</description></item><item><title>The Great AI Dao Debate: A Xianxia Epic of Silicon Immortality</title><link>https://cuicaihao.github.io/posts/2026-05-01-ai-dao-debate-xianxia-epic/</link><pubDate>Fri, 01 May 2026 14:30:00 +1000</pubDate><guid>https://cuicaihao.github.io/posts/2026-05-01-ai-dao-debate-xianxia-epic/</guid><description>A deep dive into the 2026 global AI landscape through the lens of Xianxia (Chinese fantasy). This epic narrative explores the strategic rivalries between major &amp;lsquo;sects&amp;rsquo; like OpenAI, Meta, Google, Microsoft, Apple, DeepSeek, and Alibaba. It examines the fundamental &amp;lsquo;Dao&amp;rsquo; debates: Scaling Law vs. World Models, and Open-Source vs. Closed-Source, mapping the path toward AGI in an era of silicon immortality.</description></item><item><title>Breaking the Million-Token Barrier: 5 Impactful Takeaways from DeepSeek-V4</title><link>https://cuicaihao.github.io/posts/2026-04-30-breaking-million-token-barrier-deepseek-v4-takeaways/</link><pubDate>Thu, 30 Apr 2026 08:55:59 +1000</pubDate><guid>https://cuicaihao.github.io/posts/2026-04-30-breaking-million-token-barrier-deepseek-v4-takeaways/</guid><description>An analysis of DeepSeek-V4&amp;rsquo;s breakthroughs in context efficiency, architectural innovation, and stability, reducing KV cache by up to 93% and enabling million-token contexts.</description></item></channel></rss>