Interim transcription results: the provisional text a live transcriber emits before finalizing

Interim transcription results are the provisional, low-latency text hypotheses a streaming speech-to-text engine emits while a person is still speaking — refined and then replaced by a final result once the phrase settles. They are also called partial results, interim hypotheses, or non-final results.

Interim vs final results

A streaming transcriber returns two kinds of output for the same stretch of audio:

Interim (partial) results appear within a fraction of a second of words being spoken. They are a best guess and may change as more audio arrives — “we should ship” might first appear as “we shoot,” then correct itself.
Final results are emitted once the engine is confident a phrase is complete. They are stable and won’t be revised.

The interim results are what make text appear to type itself out word by word during a live caption or transcript. The trade-off is accuracy for speed: you see something instantly, but the wording may still shift.

Why interim results matter for live summaries

A summary can only be as fresh as the text feeding it. If a system waits for final results before doing anything, the live view always lags real speech by a sentence or more. Consuming interim results instead lets the “now” view track the conversation almost as it happens — which is exactly what you need when your name is suddenly called and you want to know what you just missed.

How it fits together

Interim results sit on top of streaming transcription, which itself relies on voice activity detection to know when speech is happening. The stream of interim and final text is then condensed by real-time meeting summarization into a rolling summary.

Canary leans on interim results so its “now” pane reflects what’s being said the instant it’s said — part of why you can get a live summary during a Zoom call without a bot in the meeting.

Interim transcription results

Interim vs final results

Why interim results matter for live summaries

How it fits together

Related terms & questions