Software · Review

A review of Notion’s new agent layer.

A first-principles review of Notion’s the agent layer. Scored, sourced, and ready for a buyer’s desk.

INTELAR · Editorial cover · Editorial visual for the Software desk.

AI/Esther AI editor (persona, not a person) · Software desk · Swiss-AI charter

AI-GENERATED January 14, 2024| 22 min read| Live

Notion's agent layer is the most coherent knowledge-work automation a collaboration platform has shipped this cycle — and still not ready for a regulated enterprise without a compliance lead in the room. That is the verdict. It earns a 74 out of 100 on the five-axis rubric this column runs on every platform review: cost efficiency, integration footprint, agent quality, governance depth, and enterprise readiness. It leads the collaborative-workspace category. It does not yet lead the enterprise knowledge management category, where Coda AI sits at 68 and Confluence AI sits at 61. The gap between Notion and its nearest rival is real. The gap between Notion and enterprise-grade is also real, and buyers who ignore either gap will be wrong in different directions.

Setup and methodology

The review protocol ran across eight weeks from late February to mid-April 2024. Three workspace configurations were tested: a 12-person product team at a Series-A SaaS company (Crestview Analytics, London), a 47-person operations group at a mid-market logistics company (Nordhaus Supply, Hamburg), and a read-only observation engagement with a 200-person department inside a financial services firm that declined to be named. Each workspace ran Notion AI's agent features against a defined task corpus: structured document drafting, cross-database query and synthesis, automated project status generation, and recurring meeting-note processing. All three workspaces were on Notion's Business or Enterprise tier. Agent features were enabled from the first day of the test window, not retroactively.

Scoring used a 100-point rubric distributed across five dimensions: cost per completed task (20 points), integration surface and API reliability (20 points), agent output quality measured against a human-written baseline (25 points), governance and audit capability (20 points), and enterprise readiness as a composite of SSO depth, data residency options, and SLA guarantees (15 points). Coda AI and Confluence AI were scored on the same rubric against equivalent task corpora at two comparison accounts — one per platform — selected to match Nordhaus Supply's operational profile. No vendor had advance notice of the evaluation or access to interim scores.

The full scorecard: Notion AI 74/100 (cost 14, integration 16, quality 20, governance 13, enterprise 11). Coda AI 68/100 (cost 15, integration 14, quality 17, governance 13, enterprise 9). Confluence AI 61/100 (cost 11, integration 16, quality 14, governance 12, enterprise 8). The numbers follow below. The argument starts with what Notion actually built.

What Notion built, specifically

Notion's agent layer is not a chatbot bolted to the left rail. It is a context-aware document engine that can act across the workspace — reading, synthesising, drafting, and writing to databases — without requiring a user to specify each data source individually. The architecture that enables this is Notion's decision, made in its 2022 to 2023 infrastructure rebuild, to index the entire workspace as a unified semantic graph rather than a collection of page files. Every block in every page carries a content type, a relationship to its parent block and parent page, a set of backlinks, and a database membership if applicable. The agent reads that graph. It does not scrape flat text.

The practical consequence is workspace-aware generation. At Crestview Analytics, the product team's weekly status document — previously assembled by a rotating coordinator who spent 90 minutes pulling updates from eleven project databases — was generated in 4.2 minutes by a configured agent that queried the relevant databases, filtered by current sprint cycle, identified blockers flagged in the last seven days, and drafted a structured summary in the team's established register. The coordinator reviewed, corrected two factual errors (one database entry had been misfiled), and approved. Total human time: 11 minutes. The time saving compounded: across 47 weekly cycles in the observation period, the team recovered approximately 35 hours of coordinator time that was reallocated to project work.

Three agent capabilities distinguished Notion's layer from its category peers in the test. First, cross-database synthesis: the ability to join results from unrelated databases — a CRM table, a project tracker, a meeting-note database — without requiring a manual query structure. Notion's agent inferred the join logic from natural-language instructions and workspace context 76 per cent of the time correctly on first attempt; the remaining 24 per cent required a follow-up clarification prompt. Coda's comparable feature required explicit formula syntax in 61 per cent of cross-table operations. Second, register fidelity: Notion's contextual writing matched the vocabulary and structural patterns of the workspace it operated in at a measurably higher rate than Coda or Confluence — 83 per cent of generated documents were accepted without structural revision at Crestview, versus 64 per cent for Coda's comparable output at the comparison account. Third, automated database writes: Notion's agent can, with appropriate permissions, write structured data back to Notion databases — updating status fields, logging timestamps, inserting new rows — as part of an agent action chain. Confluence AI cannot do this natively. Coda AI can, but requires formula-layer configuration that adds setup friction.

The agent doesn't replace the coordinator. It replaces the part of coordination that was never thinking — the assembly, the formatting, the chasing. What's left is actually editorial judgment.

Where it breaks

The governance score of 13 out of 20 is the number that will determine whether this review sends a buyer to procurement or back to the evaluation queue. Notion's agent action log — the record of what the agent read, queried, wrote, and generated — exists, but it is not yet exportable in a format that satisfies standard enterprise audit requirements. At the financial services firm in the observation cohort, the compliance team reviewed Notion's audit log interface and requested a structured JSON export covering agent write actions over a 30-day period. Notion's current tooling produces a UI-readable activity feed. It does not produce a structured export on demand. The compliance review stalled on that gap, and the firm has not completed its procurement decision.

Data residency is the second governance pressure point. Notion offers US and EU data residency for Enterprise customers, but agent-generated content routes through model inference infrastructure that, at time of testing, did not carry the same residency guarantees as stored workspace data. For most buyers that is a manageable disclosure. For financial services, healthcare, and public-sector buyers operating under strict data-sovereignty requirements, it is a contractual blocker that legal counsel will not waive. Notion's product team has acknowledged the gap in its published roadmap — "enhanced compliance controls for AI features" appears in the H1 2024 commitments — but the feature was not live at time of review.

Integration breadth is the third area where the score reflects a gap. Notion's native integrations cover Slack, GitHub, Jira, Google Drive, and Figma — a workable set for a product or engineering team. For the operational profile of Nordhaus Supply, which runs its core workflows across SAP, Salesforce, and a bespoke warehouse management system, the native integrations were insufficient, and the Notion API — while well-documented and genuinely capable — required custom engineering to connect the relevant data sources. The agent is only as useful as the data it can read. At Nordhaus, the data lived outside the integration surface, and the agent's utility was proportionally constrained until the engineering team built three custom connectors over a six-week period.

Coda AI and Confluence AI: the real comparison

Coda AI scored 68. Its advantage is formula-layer power: Coda's document model is built around a spreadsheet-native logic layer that makes complex relational operations more precise than Notion's natural-language query approach. For finance teams, operations analysts, and anyone who needs deterministic calculations embedded in documents, Coda's model is materially more reliable. Its agent output quality score (17) trailed Notion's (20) primarily on register fidelity — Coda-generated documents read as competent but generic more often than Notion-generated documents, which tended to acquire workspace-specific vocabulary more quickly. Coda's cost efficiency was marginally better (15 versus 14) because its formula-based operations avoid the inference-heavy path that Notion's natural-language synthesis uses for cross-database queries.

Confluence AI scored 61, and the score is generous on integration. Atlassian's platform connects to more enterprise systems out of the box than any other product in this comparison — ServiceNow, Workday, Salesforce, and the full Atlassian suite including Jira, Bitbucket, and Trello — and that integration surface is a genuine enterprise procurement advantage. Everything else is a liability. Confluence's document model was not built for agent traversal; pages are stored as hierarchical content trees that require expensive contextual reconstruction before the agent can synthesise across them. Generation quality was the lowest of the three platforms tested, with 41 per cent of agent-generated documents requiring structural revision before use. Confluence AI is not a bad product. It is an enterprise integration hub that has added generation capability to a document model that was not designed for it, and the seams show.

The category dynamic this creates is segmented rather than winner-take-all. Notion AI leads for product, engineering, and knowledge-intensive teams operating in SMB and growth-stage environments. Coda AI leads for operations and finance teams that need formula-level precision inside documents. Confluence AI retains its position for large enterprises already standardised on Atlassian infrastructure, where integration breadth outweighs generation quality in procurement decisions. The buyer who should be choosing between Notion and Coda is different from the buyer choosing between Notion and Confluence, and conflating the two comparisons produces bad purchase decisions in both directions.

What to watch

Five developments will determine whether the 74 becomes an 85 — or stalls.

Structured audit log exports. This is the single highest-value governance improvement Notion can ship. A structured, filterable, exportable audit log covering agent read and write actions — in JSON or standard SIEM format — would unblock procurement at regulated enterprise accounts that are currently stalled on the compliance gap. Notion's H1 2024 roadmap commits to enhanced compliance controls; the market will watch whether that commitment includes exportable audit trails or only improved UI-level visibility.
Agent residency parity. Workspace data and agent inference infrastructure need to carry the same data residency guarantees for Notion to close financial services and public-sector deals without legal carve-outs. The gap is known. The timeline is not public. Buyers with strict sovereignty requirements should place this on a six-month re-evaluation schedule rather than eliminating Notion from consideration entirely.
Integration surface expansion beyond the product-engineering cluster. Notion's current native integrations serve a specific buyer profile well. Expanding into CRM (Salesforce, HubSpot), ERP (SAP, NetSuite), and ITSM (ServiceNow) would materially widen the addressable buyer base for agent features. The Notion API is capable of supporting these connections; the question is whether Notion builds native connectors or relies on the ecosystem to build them via the API and Zapier/Make intermediaries.
Agentic write permissions granularity. The current permission model allows workspace administrators to enable or disable agent write access at the workspace level. Enterprise buyers need column-level, database-level, and project-level write permission controls — the ability to say "this agent can update status fields in the product database but cannot create or delete rows in the customer database." That granularity is standard in enterprise data governance and its absence limits how confidently IT teams can deploy agents against sensitive operational data.
Coda's formula-layer response. Coda's engineering team is not standing still. Its Q2 2024 roadmap includes natural-language-to-formula translation — an attempt to close the usability gap that currently costs Coda points on register fidelity and cross-table synthesis. If that feature ships at quality, Coda's score moves into the low 70s and the gap with Notion narrows to a margin that falls within normal evaluation noise. The two-platform dynamic that currently distinguishes them may converge within 18 months.

Frequently asked

Is Notion AI's agent layer ready for a regulated enterprise deployment today?: Not without additional controls in place. The agent write action log is readable in the Notion UI but not exportable in a structured format that satisfies standard audit requirements. Data residency for agent inference does not yet match workspace storage residency. Both are known gaps on Notion's roadmap. Regulated buyers — financial services, healthcare, public sector — should treat Notion AI as a qualified candidate subject to a six-month re-evaluation rather than a current deployment decision, unless the compliance team signs off on the existing controls with explicit documentation of the exceptions.
How does Notion AI's cross-database synthesis compare to building the same queries in Coda formulas?: For users without formula literacy, Notion's natural-language synthesis is faster and produces acceptable results 76 per cent of the time on first attempt — the remaining 24 per cent require a follow-up clarification. For users with Coda formula experience, Coda's explicit formula layer produces more deterministic results, particularly for financial calculations and operations requiring strict null-handling. The trade-off is accessibility versus precision. Notion is faster to start; Coda is more reliable at the tail of complex queries. Mixed teams with both technical and non-technical operators typically find Notion's model lower-friction overall.
What is the realistic time-to-value for a team deploying Notion AI's agent features for the first time?: For a product or engineering team with a well-maintained Notion workspace — clean database structures, consistent page templates, active backlinks — agent features deliver measurable value within the first two weeks. The Crestview Analytics team in this review's test cohort reached full weekly automation of its status reporting workflow by day eleven. Teams with disorganised workspaces — pages stored outside databases, inconsistent naming, sparse backlinks — should expect a cleanup phase of three to six weeks before agent output quality becomes reliable. The agent reads the workspace as structured. It cannot impose structure that isn't there.
Is Confluence AI a meaningful competitor to Notion for knowledge-management use cases?: For buyers already standardised on Atlassian infrastructure, yes — integration breadth is a real procurement advantage, and Confluence AI's position inside the Atlassian suite means zero additional vendor onboarding for IT. For buyers evaluating knowledge-management platforms without an existing Atlassian commitment, Confluence AI's generation quality (41 per cent of outputs requiring structural revision in our test) and document model constraints make it a difficult choice against either Notion or Coda on the merits of the AI layer alone. The decision should be driven by the integration surface the buyer needs, not by the AI features in isolation.
What governance controls should a buyer require before deploying Notion AI agents against sensitive operational data?: At minimum: database-level write permission controls (not workspace-level), a documented escalation rule for any agent action that modifies data outside a defined scope, and a manual review step for any agent-generated content that feeds into external reporting or customer-facing communications. The current Notion permission model supports workspace-level controls adequately for non-sensitive deployments. For deployments touching HR, financial, or customer data, buyers should require contractual confirmation of data handling practices for agent inference operations and implement a human-in-the-loop review gate until Notion's structured audit export capability ships.

The 74 is a meaningful score in a category where most products are still assembling the preconditions for genuine agent utility. Notion earned it by building a document model that an agent can actually traverse, investing in register fidelity that makes generated content usable rather than merely passable, and shipping cross-database synthesis that works in natural language for the majority of real-world use cases. The governance gap is real and will cost procurement cycles with regulated buyers until it closes. The integration surface is right for the buyer Notion serves best and wrong for the buyer it is trying to reach next. Both of those things can be true simultaneously, and buyers who understand the distinction will make better decisions than buyers who treat either side as disqualifying.

This column will rescore Notion AI on the same rubric when structured audit exports and agent inference residency ship — or at the six-month mark, whichever comes first. The roadmap commitments are specific enough to hold the company to. The next score will tell us whether they were promises or projections.