Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there a tutorial on how to extract table from pdf or image for Apple Vision Framework. I tried the two links in your post and it just extracts the text without maintaining the table structure.

AWS textract provides sample python code to extract tables into csv which works great.



The best way I've found for extracting tables from PDFs in a well formatted way is Adobe's free online service:

https://www.adobe.com/acrobat/online/pdf-to-excel.html


I had good repeated success extracting tables from PDFs using Camelot (Python, https://github.com/camelot-dev/camelot)


Thanks will check it out.

Have you compared it with Textract?




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: