Is there a tutorial on how to extract table from pdf or image for Apple Vision F... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		sumedh on Jan 3, 2024 \| parent \| context \| favorite \| on: How to do OCR on a Mac using the CLI or just Pytho... Is there a tutorial on how to extract table from pdf or image for Apple Vision Framework. I tried the two links in your post and it just extracts the text without maintaining the table structure. AWS textract provides sample python code to extract tables into csv which works great.

dkjaudyeqooe on Jan 3, 2024 | [–]

The best way I've found for extracting tables from PDFs in a well formatted way is Adobe's free online service:

https://www.adobe.com/acrobat/online/pdf-to-excel.html

mcbetz on Jan 3, 2024 | [–]

I had good repeated success extracting tables from PDFs using Camelot (Python, https://github.com/camelot-dev/camelot)

sumedh on Jan 3, 2024 | [–]

Thanks will check it out.

Have you compared it with Textract?

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact