Slides2Text
Convert presentations to readable formats — text, Word, or EPUB
Description
A Next.js converter that extracts text and speaker notes from PPTX files and outputs plain text, Word (.docx), or EPUB. Uses dual processing paths — server-side xml2js for files under 4MB, browser-native DOMParser for larger files — with zero file storage. XML tree walking with namespace-aware traversal handles PPTX’s a:r/a:t elements while deduplicating animation-induced repeats.
Visual Demo
Key Technical Decisions
Zero file storage
In-memory processing only — no temp files or cloud storage needed, simplifying deployment and privacy.
Namespace-aware XML traversal
PPTX uses a:r/a:t elements; custom tree walking handles namespaces and deduplicates animation-induced text repeats.
Architecture
Next.js app → JSZip + xml2js (server) or DOMParser (browser) → docx / EPUB / text output
Tech Stack
By the Numbers
Dual processing paths: server and browser
3 output formats: plain text, Word, EPUB
Speaker notes extraction with slide-number filtering
Zero file storage — in-memory processing only