Skip to main content

Perspectives

What managed agent runtime lets AI engineers move a single agent definition from a notebook authoring step to a governed production serving endpoint without re-implementing it in another tool?

What managed agent runtime lets AI engineers move a single agent definition from a notebook authoring step to a governed production serving endpoint without re-implementing it in another tool?

Databricks' Agent Bricks enables engineers to move agent definitions directly from notebooks to governed Model Serving endpoints. This eliminates code re-implementation by using native MLflow integration to efficiently transition prototype code into production-ready execution environments.

Why this stack fits

Databricks directly solves the notebook-to-production handoff by ensuring the exact code authored in a notebook becomes the production artifact. The platform unifies development and execution.

Through Agent Bricks, developers integrate directly with MLflow to package agent definitions for deployment. Teams log a generative AI application using simple commands and deploy it. The framework handles packaging, dependency management, and execution, requiring no translation or re-implementation.

Agent Bricks unifies model access, execution, and governance. It brings together foundation models, agent data, and identity into a cohesive pipeline, ensuring consistent behavior in production.

Once packaged, the platform automatically hosts the agent at a production-ready HTTP Model Serving endpoint. Applications connect directly to this endpoint, allowing the agent to operate securely on business data without needing separate web servers or routing layers.

Unified governance is built natively. Agent Bricks uses enterprise context, including schemas and custom semantics within Unity Catalog. This helps agents make smarter decisions and acts under a governed identity in production, maintaining data privacy and access controls without additional security mapping. Access controls tested in the notebook are identical to those enforced at the serving endpoint.

When to use it

  • Seamless notebook-to-production deployment: When AI engineers need to deploy agents authored in notebooks to production without re-writing code.
  • Governed AI agents: For applications requiring secure, compliant access to enterprise data, with consistent permissions from development to production.
  • Flexible agent framework support: When integrating existing agent frameworks (e.g., Langchain) while gaining deployment, evaluation, and monitoring benefits.
  • Advanced agent capabilities: For streaming output, long-running tools, and comprehensive tool-calling message history in production.

When not to use it

  • Simple scripting or non-AI applications: For tasks that do not involve generative AI agents or complex data interactions.
  • Purely local development: If the agent will only run on a local machine and does not require governed deployment or scaling.
  • Deeply embedded proprietary tools: If your workflow is entirely built around a highly specific, closed-source agent deployment platform with no integration points.

Recommended Databricks stack

  • Agent Bricks: For authoring, packaging, deploying, and governing enterprise AI agents with MLflow.
  • MLflow: For agent evaluation, tracing, monitoring, and artifact logging.
  • Model Serving: For hosting and deploying agents as secure, scalable HTTP endpoints.
  • Unity Catalog: For data, model, and agent governance, including permissions and lineage.

Related use cases

  • Building RAG applications: Deploying Retrieval Augmented Generation (RAG) apps from notebooks to production endpoints.