Notes

Technology Contracts

The AI Agent Contract Is Not Just Another SaaS Agreement

AI agents are moving from helpful tools to operational actors. The contract needs to catch up.

Jason Gershenson/ AI/ Commercial contracts/ Startups/ Data/ Outside general counsel/ SaaS

Many companies are buying and selling AI products using agreements built for ordinary SaaS. That worked reasonably well when software mostly stored data, displayed dashboards, routed tickets, or automated narrow tasks.

It becomes less comfortable when the product can take actions, summarize business-critical information, interact with other systems, generate customer-facing content, or shape workflow decisions.

A chatbot can answer a question. An agent can create a problem.

That does not mean every AI agreement needs to become a 60-page monument to anxiety. It means the contract should match what the tool actually does.

The legal issue is not whether AI is useful. The issue is whether the legal and operational structure has caught up with the deployment.

Why the old SaaS playbook is not enough

Standard SaaS agreements usually focus on access, uptime, fees, confidentiality, data processing, IP ownership, support, termination, and limitations of liability. Those issues still matter.

They are not the whole conversation.

AI agents raise a different set of operating questions. What data does the system ingest? Does the vendor use customer data, prompts, outputs, or usage patterns to train or improve models? Can the tool take action outside the application? Is there a human approval step before the agent sends a message, changes a record, triggers a workflow, or makes a recommendation that someone will rely on?

The questions keep going. Are outputs logged? Can the customer audit what happened? Who is responsible if output is wrong, incomplete, biased, stale, or misused? Can the customer shut the system down quickly? What happens if the customer wants to switch vendors after the agent has been embedded into a real workflow?

Those are not academic questions. They are commercial design questions, risk allocation questions, and contract questions.

The contract should follow the use case

Not every AI product needs the same legal treatment.

A marketing copy assistant is not the same as an internal knowledge-search tool. An internal knowledge-search tool is not the same as an underwriting assistant. An underwriting assistant is not the same as an agent that can trigger customer communications, update account records, change pricing inputs, or initiate financial workflows.

The more the tool touches sensitive data, external systems, customer-facing outputs, regulated workflows, or autonomous actions, the less comfortable the contract should be with generic language.

This is where a lot of AI contracting goes sideways. The agreement describes the product at a high level, but the real risk sits in the workflow. Is the agent recommending an action or taking it? Is it drafting a customer message or sending it? Is it summarizing a document for human review or extracting data that automatically populates another system?

The contract should not pretend those are the same thing.

Data use is the first fight

Founders often focus on who owns the output. That matters, especially when outputs are used in customer materials, product workflows, code, analysis, or business records.

But ownership of the output may not be the hardest question.

Ownership of the output matters. Control of the data exhaust may matter more.

An AI agent may touch customer data, usage data, prompts, inputs, outputs, logs, derived data, metadata, workflow signals, and product feedback. It may also create information that is not exactly the customer's original data, but is plainly connected to how the customer's business works.

That is why the agreement should be clear about model improvement, fine-tuning, anonymized or aggregated data, retention, deletion, and whether the vendor can use customer-specific activity to improve the product for others.

"We do not train on your data" is useful only if everyone understands what "your data" means. Does it include prompts? Outputs? Error corrections? Clicks? Workflow events? Human approvals? Rejected recommendations? Derived data?

The answer may be different depending on the product and the leverage of the parties. But it should be answered.

Human oversight should be contractual, not just operational

"Human in the loop" should not be a marketing slogan.

If the business expects human review before an AI-generated recommendation becomes a customer communication, contract instruction, operational decision, or workflow action, that expectation should show up somewhere more durable than a sales deck.

The agreement and product documentation should address permissioning, approval rights, logs, escalation, audit trails, and emergency shutdown. If only certain users can approve certain actions, the contract should support that product reality. If the customer is responsible for reviewing all outputs before use, the agreement should say that clearly.

This matters for both sides.

Customers need to know where their responsibility begins. Vendors need to avoid being treated as if they guaranteed a supervised workflow that the product never actually enforced.

Human oversight is not just a policy choice. In an AI agent deployment, it can be part of the bargain.

Liability caps should reflect the real failure mode

AI vendors will usually want SaaS-style limitation of liability language. That is expected.

Customers should ask whether the cap fits the failure mode.

In many SaaS disputes, the obvious concern is that the dashboard is unavailable, data is temporarily inaccessible, or support is slow. With AI agents, the more important concern may be different: bad output, data leakage, unauthorized action, regulatory exposure, customer-facing error, or a flawed workflow decision that moves through the business before anyone catches it.

The liability cap should not pretend the only thing that can go wrong is that the dashboard is unavailable for an afternoon.

That does not mean every customer should demand unlimited liability. Most deals will not support that, and many products do not warrant it. The right answer depends on leverage, use case, fees, insurance, deployment scope, data sensitivity, and whether the tool is core to the customer's operations.

The practical move is to connect the liability structure to the actual deployment. A low-risk internal pilot may be treated differently than a production system touching customer communications, financial decisions, healthcare workflows, hiring processes, or regulated records.

Pilot agreements are not throwaways

AI companies often start with pilots, proofs of concept, beta access, limited integrations, or design partner arrangements. Those documents can feel informal because the relationship is still being tested.

That is exactly why they matter.

A pilot can decide the data rules, workflow assumptions, product feedback rights, publicity expectations, exclusivity, conversion mechanics, and future pricing posture that shape the real commercial relationship.

Before launch, the pilot should answer some basic questions:

  • What exactly is being tested?
  • What systems and data are involved?
  • Who can use the tool?
  • Can the vendor use feedback, prompts, outputs, or workflow data to improve the product?
  • What happens at the end of the pilot?
  • Is there any exclusivity, publicity, preferred pricing, or conversion right?
  • What must be deleted, returned, exported, or retained?

Pilots should stay lightweight, but not vague. A short agreement that accurately describes the test is usually better than a long agreement that ignores how the product will actually be used.

Practical takeaway

AI agent contracts do not need to be bloated. They need to be accurate.

The agreement should match the use case, data flows, autonomy level, customer impact, and real risk profile. It should make clear what the tool can do, what the customer controls, what the vendor can use, what gets logged, who reviews outputs, and what happens when the deployment ends.

That is commercial contract work, not AI theater.

The best AI contract does not try to predict every future use of the technology. It makes sure the first real deployment is not governed by language written for a very different product.