Glossary

Action item detection

Short answer

Action item detection is the automatic identification of commitments, tasks, and follow-ups in a conversation — capturing who agreed to do what, and by when — from natural speech rather than from a manually maintained list.

Last updated June 14, 2026

Action item detection is the automatic identification of commitments, tasks, and follow-ups in a conversation — capturing who agreed to do what, and by when — from natural speech rather than from a manually maintained list.

How it works

Action item detection runs on top of a transcript. As speech is converted to text — usually via streaming transcription with speaker diarization attributing each line to a person — a language model scans the conversation for the linguistic signals of a commitment: phrases like “I’ll send that over,” “can you take the deck,” “let’s circle back Friday,” or “we need to file by the 30th.” It then normalizes each into a structured item — an owner, an action, and (where stated) a due date — and discards the surrounding small talk.

The hard part isn’t spotting obvious assignments; it’s telling a real commitment apart from hypotheticals (“we could reach out to legal”) and rhetorical filler. Good detection leans on speaker attribution and context, which is why it’s far more reliable when it shares a pipeline with diarization and a running summary of the discussion.

Why it matters

Most meeting follow-through fails not because people disagree on the work, but because the work is never written down while it’s fresh. Action item detection turns the spoken decisions of a call into a list someone can actually act on — without a participant having to type notes instead of paying attention.

Real-time vs. after-the-meeting detection

Most meeting tools detect action items after the call, as part of a static recap you read later. The limitation is timing: by the time the list appears, the meeting is over and you can no longer correct a misattributed task or confirm a date out loud.

Canary detects and surfaces commitments during the meeting, inside its live rolling summary. Because Canary is a real-time, bot-free meeting summarizer that reads your computer’s system audio — no bot in the call, no plugin — the action items build up in the live view as they’re spoken. That means you can catch a task with your name on it the instant it lands, the same way the live summary lets you see what you missed when you’re suddenly called on or catch up after stepping away from a call. It’s one more layer of real-time meeting summarization and live situational awareness, rather than a report you open once the moment to act has passed.