Add initial implementation of PDF to Excel table extractor

- Create main script for extracting tables from PDF files and saving to Excel format.
- Add dependency checks for required libraries and Ghostscript.
- Implement functions for extracting tables and saving them to Excel.
- Update README with usage instructions and examples.
- Add devcontainer configuration for development environment.
- Include .gitignore to exclude PDF and Excel files from version control.
- Specify required packages in requirements.txt.
This commit is contained in:
Lukas Holzner
2025-12-02 09:22:42 +00:00
commit 6b18868ca6
5 changed files with 473 additions and 0 deletions

2
.gitignore vendored Normal file
View File

@@ -0,0 +1,2 @@
*.pdf
*.xlsx