Problem

PDFs are widely used for extractionRequest storage and data sharing, yet extracting structured information, especially tables, can be challenging and time-consuming. Many organisations struggle to retrieve tabular data efficiently for further processing, often relying on manual methods or third-party tools that don't always give enough flexibility or in house tools to complete the extraction.

Motivation for building this project:

Learn How to Build Microservices – By designing a modular system that processes PDFs, extracts tables, and serves structured data, I will explore the principles of service decomposition, scalability, and communication.
Understand the Technology Choices – Implementing messaging, REST APIs, and worker queues will give hands-on experience with asynchronous processing, event-driven architecture, and the practical trade-offs in system design.

Project Planning

Network Arch

Client flow diagram

Communication

TODO

Font-end
Finish Express Integration; Proxying complete, service is fully working. Need to allow the API to provide status API.
Change the worker service to allow for processing of pdf to images, also allow nodes to disable specific job types.
error endpoint for frontend clients.

How to setup

clean compile install Main libary, followed by Worker Libary.
clean compile package Worker-Management-Service and Worker Service.
docker compose up --build inside the PDF-Microservices-File-Configurations/Job Service This will start up the job service.

To start the backend-server, run docker compose up --build inside the PDF-Microservices-Backend-Service.

Postman documentation

Go to documentation

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.idea		.idea
PDF-Microservices-Backend-Service @ 0cf4fd2		PDF-Microservices-Backend-Service @ 0cf4fd2
PDF-Microservices-Eukrea @ 6ecc676		PDF-Microservices-Eukrea @ 6ecc676
PDF-Microservices-File-Configurations @ 7e37878		PDF-Microservices-File-Configurations @ 7e37878
PDF-Microservices-Main-Libary @ 426c665		PDF-Microservices-Main-Libary @ 426c665
PDF-Microservices-Worker-Libary @ f574d39		PDF-Microservices-Worker-Libary @ f574d39
PDF-Microservices-Worker-Management-Service @ 20cb2c7		PDF-Microservices-Worker-Management-Service @ 20cb2c7
PDF-Microservices-Worker-Service @ 31e57a2		PDF-Microservices-Worker-Service @ 31e57a2
.DS_Store		.DS_Store
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Problem

Motivation for building this project:

Project Planning

Network Arch

Client flow diagram

Communication

How to setup

Postman documentation

About

Uh oh!

Releases

Packages

zer0origin/PDF-Microservices

Folders and files

Latest commit

History

Repository files navigation

Problem

Motivation for building this project:

Project Planning

Network Arch

Client flow diagram

Communication

How to setup

Postman documentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages