OpenAI's New SDK Lets AI Agents Run Unsupervised for Hours

OpenAI just gave enterprise developers the tools to build agents that run for hours without human supervision, and they did it by baking execution environments directly into the SDK.

The Summary

OpenAI updated its Agents SDK with native sandbox execution and a model-native harness for secure, long-running agent workflows
The update targets enterprise developers who need agents that can handle files and tools without constant oversight
This is infrastructure for the shift from chatbots that answer questions to agents that complete multi-step tasks autonomously

The Signal

OpenAI's Agents SDK now includes native sandbox execution, which means developers no longer need to jerry-rig their own isolated environments for agent code. The SDK handles it. You get a secure container where your agent can run Python, manipulate files, call APIs, and chain together tool use without escaping into your production systems.

The second piece is what OpenAI calls a "model-native harness." This is the control layer that lets agents persist across sessions, maintain state, and execute workflows that take minutes or hours instead of seconds. The focus is squarely on enterprise use cases, where an agent might need to process a batch of contracts, reconcile datasets, or monitor systems overnight.

"Long-running agents across files and tools" is the key phrase here. This is OpenAI acknowledging that enterprises want workers, not assistants.

What makes this different from earlier agent frameworks:

Built-in sandboxing instead of relying on third-party containers
State management for tasks that span multiple sessions
File handling as a first-class feature, not a hack

The timing matters. We're 18 months into the agent hype cycle, and enterprises are still trying to figure out how to deploy these things safely. Most companies building agents today are using a patchwork of tools: LangChain for orchestration, Docker for isolation, custom code for state. OpenAI is collapsing that stack into one SDK with opinionated defaults.

The Implication

If you're building agent infrastructure, this is a warning shot. OpenAI is competing directly with the orchestration layer, not just the model layer. Companies betting on being the "enterprise-safe wrapper" around OpenAI models now face the platform eating their differentiation.

For enterprises, the question is whether you trust OpenAI's sandbox enough to run production workloads. The SDK lowers the barrier to experimentation, but production deployment is still about trust, compliance, and what happens when an agent does something unexpected at 3 AM.

Sources

TechCrunch AI | OpenAI Blog

The Summary

The Signal

The Implication

Sources

Keep Reading