Scribe Agent & Digest Agent¶
Thinklio Built-in Agent Specification — New Additions Version 0.1 | March 2026
These two agents are not in the initial starter catalogue but are identified here as natural extensions of the agent set. Both address gaps that become apparent once the coordinator agents are considered.
1. Scribe Agent¶
1.1 Purpose¶
The Scribe Agent operates during meetings and conversations to produce a faithful, structured record. It handles transcription, extracts key information, and produces outputs that other agents (particularly the Meeting Agent and Taskmaster) can act on immediately.
It solves a persistent problem: the gap between what happens in a meeting and what gets recorded. Action items get forgotten. Decisions aren't documented. Context that seemed obvious in the room is invisible to people who weren't there. The Scribe Agent makes the meeting's output as structured and durable as anything produced before or after it.
1.2 Input Modes¶
The Scribe Agent accepts meeting content in several forms:
| Input mode | Description |
|---|---|
| Live audio | Real-time audio stream from a meeting or call (via browser integration or telephony connector) |
| Audio file | Uploaded recording (MP3, M4A, WAV, etc.) |
| Video file | Uploaded video with audio track |
| Third-party transcript | Import from Zoom, Teams, Google Meet, or similar (structured or plain text) |
| Manual notes | User pastes or types unstructured notes; Scribe structures them |
| Integration | Direct connection to video conferencing platform APIs |
Not all input modes will be available at launch. Audio file upload and third-party transcript import are the lowest-barrier starting points. Live audio requires platform-level audio stream handling.
1.3 UI Structure¶
Chat Tab: Interact with the Scribe Agent about a specific session: - "Summarise the key decisions from yesterday's board meeting" - "What action items came out of the planning session?" - "Who was responsible for the budget review task?" - "Generate a follow-up email for the client meeting"
Sessions Tab: Library of all recorded and processed sessions, searchable by date, participants, and topics. Each session shows its processing status and available outputs.
Outputs Tab: All structured outputs produced from sessions: summaries, transcripts, action item lists, and follow-up drafts.
1.4 Processing Pipeline¶
For each session, the Scribe Agent produces outputs in stages:
Input (audio / transcript / notes)
↓
Stage 1: Raw transcript (if audio input)
Speaker diarisation (who is speaking)
Timestamp alignment
↓
Stage 2: Structured transcript
Clean formatting
Speaker labels resolved to names (from participant list)
Timestamps normalised
↓
Stage 3: Intelligent extraction
Meeting summary (key topics, key points)
Decisions made (explicit and implied)
Action items (what, who, by when)
Open questions (raised but not resolved)
Key participants and their positions
↓
Stage 4: Downstream outputs
Formatted meeting notes (markdown)
Action items → Taskmaster (for each named assignee)
Meeting log → Rolodex (Interaction record for all participants)
Follow-up draft → Writer Agent (optional, on request)
1.5 Action Item Extraction¶
Action items are extracted as structured objects:
ActionItem
├── description string
├── assigned_to string (name from transcript)
├── person_id UUID | null (Rolodex match)
├── due_date date | null (inferred from transcript language)
├── source_quote string (the words in the transcript that indicate this action)
└── confidence float
Low-confidence action items are flagged for human review rather than automatically created as tasks. The user reviews and confirms before Taskmaster records are created.
1.6 Configuration¶
| Setting | Description |
|---|---|
| Default participant list | Participants to use for speaker resolution when not specified at session start |
| Action item confidence threshold | Below this, action items are flagged for review rather than auto-created |
| Auto-create tasks | Whether confirmed action items are automatically sent to Taskmaster |
| Auto-log interactions | Whether completed sessions are automatically logged in Rolodex |
| Transcript storage | How long raw transcripts are retained (privacy consideration) |
| Speaker diarisation | Whether to attempt speaker identification in audio input |
1.7 Privacy Considerations¶
Meeting transcription is sensitive. The Scribe Agent must handle participant data with care:
- Participants should be aware their voice is being recorded and processed
- Transcripts containing sensitive business information should have appropriate access controls
- Raw audio is not stored beyond the transcription step unless explicitly configured
- Transcript content is scoped to the user or team that initiated the session
- The agent will not share transcript content with agents operating in different scope contexts
1.8 Use Cases¶
UC-1: Post-meeting processing A user finishes a 60-minute strategy meeting. They upload the Zoom recording to the Scribe Agent. The agent transcribes it, identifies four speakers, extracts eight action items, three decisions, and two open questions. It produces a meeting summary and flags two action items with low confidence for review. The user confirms six action items → Taskmaster creates tasks. The meeting is logged in Rolodex against all four participants.
UC-2: Notes structuring A user has disorganised notes from a call: "spoke to James - he wants revised scope by Friday - budget still tbc - Sarah to check with finance - meeting again next thursday." They paste this into the Scribe Agent. The agent structures it: summary (one paragraph), two action items (James: revised scope by Friday; Sarah: check budget with finance), one open question (budget), one scheduled follow-up (next Thursday). Taskmaster tasks created automatically.
UC-3: Meeting Agent integration The Meeting Agent invokes the Scribe Agent post-meeting as part of its standard post-meeting workflow. The Scribe Agent processes the transcript, returns structured outputs to the Meeting Agent, which uses them to: create Taskmaster tasks, draft a follow-up email for approval, and log the Rolodex Interaction.
2. Digest Agent¶
2.1 Purpose¶
The Digest Agent is a scheduled monitor and synthesiser. It watches multiple information sources across a configured time window, filters for what's relevant, and produces a unified briefing delivered to the user on a schedule they control.
It is different from the Personal Assistant's morning briefing in scope and intent. The PA's briefing is about today — tasks, calendar, urgent messages. The Digest Agent's briefing is about the world beyond the user's immediate task list: news on topics they follow, updates from monitored sources, signals from their external environment.
It answers the question: "What do I need to know that I didn't go looking for?"
2.2 Source Types¶
The Digest Agent can monitor and draw from:
| Source type | Examples |
|---|---|
| News searches | Topics, keywords, organisations, people |
| Research feeds | Academic publications, preprint servers, on a topic |
| Web monitoring | Specific sites or pages for changes |
| Internal sources | Items in the Enquiry Agent queue, Taskmaster patterns, Rolodex interactions |
| Agent outputs | Completed Research Agent runs, new Knowledge Base additions |
| Scheduled Research Agent | A Research Agent instance running on the same schedule feeds directly into the Digest |
2.3 UI Structure¶
Chat Tab: Query the digest or configure it conversationally: - "Add a digest topic: Australian aged care policy" - "What came through on the Thinklio competitive landscape this week?" - "Change my digest to weekly on Monday mornings"
Digests Tab: Archive of all produced digests, browsable and searchable. Each digest links to its source items.
Sources Tab: Configuration of monitored sources. Each source shows: type, query or URL, last checked, and items found. Add, edit, and remove sources from here.
Schedule Tab: When the digest runs, what format it takes, where it is delivered (in-platform, email, or both), and notification preferences.
2.4 Digest Structure¶
A digest is a structured document, not a raw feed:
Digest
├── period string ("Week of 17–23 March 2026")
├── theme_summary string (one-paragraph synthesis of the most significant developments)
├── sections[]
│ ├── topic string
│ ├── items[]
│ │ ├── title string
│ │ ├── source string
│ │ ├── summary string (2–3 sentences)
│ │ ├── url string
│ │ └── date date
│ └── synthesis string (what the agent thinks is significant about this cluster)
└── flagged_items[] (items the agent has identified as particularly significant)
The agent does not just aggregate — it synthesises. The theme_summary and per-section synthesis fields contain the agent's interpretation of what the collected items mean, why they matter, and what connections it notices between them. Users can configure how much synthesis they want (from "just the links" to "full analysis").
2.5 Configuration¶
| Setting | Description |
|---|---|
| Topics | List of topics, keywords, or entities to monitor |
| Sources | Specific sources to include (in addition to general search) |
| Period | Lookback window for each digest (e.g. last 7 days) |
| Schedule | When the digest is produced (daily, weekly, custom) |
| Delivery | In-platform only, email, or both |
| Synthesis level | links_only, summary, full_analysis |
| Minimum items | Don't produce a digest if fewer than N items found (avoids empty digests) |
| Item limit | Maximum items per topic |
2.6 Relationship to Research Agent¶
The Digest Agent uses the Research Agent as a delegate for source retrieval. When the digest schedule fires:
- The Digest Agent reads its configured topics and sources
- It invokes Research Agent instances (one or more, possibly in parallel) with appropriate parameters
- It receives source lists back
- It filters, clusters, and synthesises
- It produces the structured Digest output
- It delivers according to the configured schedule
The Research Agent's update frequency feature (scheduling re-runs) can be used to pre-populate fresh source lists that the Digest Agent then draws from, rather than the Digest Agent triggering Research Agent runs inline.
2.7 Use Cases¶
UC-1: Weekly professional digest A user configures a weekly digest covering: Australian aged care regulation, AI in healthcare, and Perth commercial property. Every Monday morning they receive a structured briefing covering developments across all three topics from the previous week, with the Digest Agent's synthesis of what's significant.
UC-2: Competitive monitoring A business development manager configures a daily digest monitoring competitors and industry news. The digest flags when a competitor makes a significant announcement, surfaces relevant analyst commentary, and notes any client-relevant developments. It arrives in their inbox each morning before they start the day.
UC-3: Internal digest A team lead configures a digest that monitors internal sources: new Items in the support queue (volume and type), completed Research Agent runs (what new research has been produced), and Taskmaster patterns (overdue task trends). This gives them a weekly operational pulse without having to check each system manually.
3. Open Questions¶
Scribe Agent¶
- Live audio transcription requires real-time streaming infrastructure that is qualitatively different from file upload. Should live audio be deferred to a later phase, with file upload and transcript import as the initial scope?
- Speaker diarisation quality varies significantly by audio quality and number of speakers. How does the agent handle poor audio? A graceful fallback to "Speaker 1, Speaker 2" with user-resolved names is better than failing.
- Retention of raw audio and transcripts is a significant privacy and compliance consideration. Is a configurable retention window sufficient, or should raw audio deletion be mandatory and immediate after transcription?
Digest Agent¶
- The synthesis feature requires the agent to make editorial judgements about what matters. This is valuable but introduces potential for bias or missed significance. Should synthesis be opt-in, with raw aggregation as the default?
- Digest delivery via email means the Mail Agent is in the dependency chain for the Digest Agent. The Digest Agent should degrade gracefully if the Mail Agent is not configured — deliver in-platform only.
- How should the Digest Agent handle a week with very little matching content? A "nothing significant this week" digest is still useful. An empty digest is not. The
minimum_itemsconfig setting addresses this but needs a good default.
These agents are proposed additions to the starter catalogue. They should be reviewed for inclusion in the Phase 2 agent set.