Back to Portfolio

Slides2Text

Web

Convert presentations to readable formats — text, Word, or EPUB

Description

A Next.js converter that extracts text and speaker notes from PPTX files and outputs plain text, Word (.docx), or EPUB. Uses dual processing paths — server-side xml2js for files under 4MB, browser-native DOMParser for larger files — with zero file storage. XML tree walking with namespace-aware traversal handles PPTX’s a:r/a:t elements while deduplicating animation-induced repeats.

Visual Demo

Key Technical Decisions

Zero file storage

In-memory processing only — no temp files or cloud storage needed, simplifying deployment and privacy.

Namespace-aware XML traversal

PPTX uses a:r/a:t elements; custom tree walking handles namespaces and deduplicates animation-induced text repeats.

Architecture

Next.js app → JSZip + xml2js (server) or DOMParser (browser) → docx / EPUB / text output

Tech Stack

Next.js React TypeScript Tailwind CSS Vercel

By the Numbers

Dual processing paths: server and browser

3 output formats: plain text, Word, EPUB

Speaker notes extraction with slide-number filtering

Zero file storage — in-memory processing only