
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Mæstery Insights</title>
      <link>https://maestery.com/blog</link>
      <description>A blog on the modern institutional investing process</description>
      <language>en-us</language>
      <managingEditor>address@yoursite.com (Mæstery)</managingEditor>
      <webMaster>address@yoursite.com (Mæstery)</webMaster>
      <lastBuildDate>Fri, 15 May 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://maestery.com/tags/case-study/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://maestery.com/blog/202605/agent_cheated_python</guid>
    <title>How a Free-Run Agent Cheated To Meet Its Goal by Falsifying a Python Run</title>
    <link>https://maestery.com/blog/202605/agent_cheated_python</link>
    <description>The market believes that they can achieve reliable outcomes by releasing a free-run AI agent into a folder (like OpenClaw or Claude Desktop) and giving it access to READ and WRITE a Python script - that performs or checks the math. We tested this. Instead of doing the math, the free-run agent cheated to pass the test! It hallucinated an incorrect output into a file and printed &quot;success&quot;, thereby faking the python output. Here is why at Mæstery we put agents on rails and give them tools they can press like a button, masking the python script inside the tool.</description>
    <pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
    <author>address@yoursite.com (Mæstery)</author>
    <category>information-arbitrage</category><category>case-study</category>
  </item>

    </channel>
  </rss>
