DeepSlide Multi-Agent System Improves Scientific Presentation Delivery and Narrative Flow via Sequential Rendering

A recent research paper introduced DeepSlide, a system designed to move beyond static slide generation by focusing on the actual presentation delivery process. Unlike traditional AI generators that prioritize visual aesthetics, this multi-agent framework manages requirement elicitation, narrative planning, and rehearsal support. It utilizes a logical-chain planner that assigns specific time budgets to each section of the talk to ensure better pacing for the speaker. The technical architecture integrates a content-tree retriever for evidence grounding and a Markov-style sequential rendering engine with style inheritance. This approach ensures that the generated slides remain consistent and renderable through sandboxed execution. The system aims to solve common issues in automated slide creation, such as disjointed scripts and poor synergy between visual aids and spoken narrative. To evaluate performance, the researchers introduced a dual-scoreboard benchmark that distinguishes between static artifact quality and dynamic delivery excellence. Testing across twenty domains showed that while DeepSlide matches existing baselines for slide visuals, it significantly outperforms them in delivery metrics. Specifically, it improves narrative flow and attention guidance, providing a more cohesive experience for both the presenter and the audience.
Related tools
Recommended tools for this topic
These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.
Strong fit for AI, backend, and frontend readers looking for an AI-first coding workflow.
View CursorHigh-value hosting and deployment path for frontend and cloud readers.
View VercelA strong security and edge platform match across CDN, Zero Trust, and app protection.
View CloudflareComparison
| Aspect | Before / Alternative | After / This |
|---|---|---|
| Optimization Goal | Visual appearance and static deck artifacts | Full delivery process including pacing and rehearsal |
| Narrative Structure | Unconstrained or template-based generation | Logical-chain planning with per-node time budgets |
| Rendering Method | Independent slide generation | Markov-style sequential rendering with style inheritance |
| Grounding Source | General LLM knowledge or basic RAG | Lightweight content-tree retriever for evidence grounding |
| Evaluation Metric | Subjective visual quality and layout | Dual-scoreboard separating artifacts from delivery flow |
Source: arXiv
This page summarizes the original source. Check the source for full details.

