PDFs are widely used for extractionRequest storage and data sharing, yet extracting structured information, especially tables, can be challenging and time-consuming. Many organisations struggle to retrieve tabular data efficiently for further processing, often relying on manual methods or third-party tools that don't always give enough flexibility or in house tools to complete the extraction.
- Learn How to Build Microservices – By designing a modular system that processes PDFs, extracts tables, and serves structured data, I will explore the principles of service decomposition, scalability, and communication.
- Understand the Technology Choices – Implementing messaging, REST APIs, and worker queues will give hands-on experience with asynchronous processing, event-driven architecture, and the practical trade-offs in system design.

TODO
- Font-end
- Finish Express Integration; Proxying complete, service is fully working. Need to allow the API to provide status API.
- Change the worker service to allow for processing of pdf to images, also allow nodes to disable specific job types.
- error endpoint for frontend clients.
clean compile install
Main libary, followed by Worker Libary.clean compile package
Worker-Management-Service and Worker Service.docker compose up --build
inside the PDF-Microservices-File-Configurations/Job Service This will start up the job service.
To start the backend-server,
run docker compose up --build
inside the PDF-Microservices-Backend-Service.