Back to news
frontend Priority 4/5 5/19/2026, 11:05:47 AM

DeepSlide Multi-Agent System Improves Scientific Presentation Delivery and Narrative Flow via Sequential Rendering

DeepSlide Multi-Agent System Improves Scientific Presentation Delivery and Narrative Flow via Sequential Rendering

A recent research paper introduced DeepSlide, a system designed to move beyond static slide generation by focusing on the actual presentation delivery process. Unlike traditional AI generators that prioritize visual aesthetics, this multi-agent framework manages requirement elicitation, narrative planning, and rehearsal support. It utilizes a logical-chain planner that assigns specific time budgets to each section of the talk to ensure better pacing for the speaker. The technical architecture integrates a content-tree retriever for evidence grounding and a Markov-style sequential rendering engine with style inheritance. This approach ensures that the generated slides remain consistent and renderable through sandboxed execution. The system aims to solve common issues in automated slide creation, such as disjointed scripts and poor synergy between visual aids and spoken narrative. To evaluate performance, the researchers introduced a dual-scoreboard benchmark that distinguishes between static artifact quality and dynamic delivery excellence. Testing across twenty domains showed that while DeepSlide matches existing baselines for slide visuals, it significantly outperforms them in delivery metrics. Specifically, it improves narrative flow and attention guidance, providing a more cohesive experience for both the presenter and the audience.

Related tools

Recommended tools for this topic

These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.

#arxiv#research#ai#agent

Comparison

AspectBefore / AlternativeAfter / This
Optimization GoalVisual appearance and static deck artifactsFull delivery process including pacing and rehearsal
Narrative StructureUnconstrained or template-based generationLogical-chain planning with per-node time budgets
Rendering MethodIndependent slide generationMarkov-style sequential rendering with style inheritance
Grounding SourceGeneral LLM knowledge or basic RAGLightweight content-tree retriever for evidence grounding
Evaluation MetricSubjective visual quality and layoutDual-scoreboard separating artifacts from delivery flow

Source: arXiv

This page summarizes the original source. Check the source for full details.

Related