PDF to Excel Converter - Extract Tables from PDF

Convert PDF tables to Excel (XLSX/CSV). Auto table detection with preview. 100% free, no server upload.

About This Tool

PDF to Excel extracts tables and structured data from PDF documents into downloadable XLSX spreadsheets. It automatically detects table boundaries, rows, and columns, making it the go-to tool for accountants, analysts, and anyone who needs to work with tabular PDF data in Excel or Google Sheets.

Drag & drop PDF or click to upload

Text-based PDFs only (not scanned/image PDFs)

How to Use

  1. Drag and drop a PDF file, or click to upload.
  2. The tool automatically detects and extracts tables from all pages.
  3. Review the extracted data in the preview table.
  4. Download as CSV or XLSX format.

Frequently Asked Questions

Can I convert scanned PDFs (image-based)?

No. This tool only supports text-based PDFs. Scanned PDFs require OCR (Optical Character Recognition), which is not currently supported.

Does it handle non-English characters correctly?

Yes. CSV files are saved with UTF-8 BOM encoding, and XLSX files automatically support Unicode characters including Korean, Chinese, Japanese, etc.

Can I convert multi-page PDFs?

Yes! Tables are automatically detected from all pages. When downloading XLSX, each page is saved as a separate sheet.

Is my PDF uploaded to a server?

No. Everything is processed in your browser. Your file never leaves your device. It works even offline.

Does it handle complex tables (merged cells, nested tables)?

Basic table structures are extracted well. Complex layouts with merged cells may produce imperfect results. Check the preview after extraction.

Related Tools

How It Works

The PDF file is loaded as an ArrayBuffer via the File API and parsed using pdf.js to extract text content with precise positional coordinates (x, y, width, height) for each text item. A table-detection algorithm groups nearby text items into rows and columns based on their alignment and spacing patterns. The identified table structure is then written into an XLSX file using a client-side spreadsheet library (such as SheetJS/xlsx). Cell values, column widths, and row groupings are mapped to Excel cells. The final XLSX file is generated as a binary Blob for download.

All PDF parsing, table detection, and Excel generation happen entirely in your browser using pdf.js, SheetJS, and the File API. Your documents never leave your device.

Share

Related Articles

84+

Tools available

97+

Blog articles

English & 한국어

Languages

Bookmark this page! We add new free tools every week.