Skip to content

zer0origin/PDF-Microservices

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Problem

PDFs are widely used for extractionRequest storage and data sharing, yet extracting structured information, especially tables, can be challenging and time-consuming. Many organisations struggle to retrieve tabular data efficiently for further processing, often relying on manual methods or third-party tools that don't always give enough flexibility or in house tools to complete the extraction.

Motivation for building this project:

  1. Learn How to Build Microservices – By designing a modular system that processes PDFs, extracts tables, and serves structured data, I will explore the principles of service decomposition, scalability, and communication.
  2. Understand the Technology Choices – Implementing messaging, REST APIs, and worker queues will give hands-on experience with asynchronous processing, event-driven architecture, and the practical trade-offs in system design.

Project Planning

image

image

image

Network Arch

image

Client flow diagram

image

Communication

Screenshot 2025-06-13 at 19 28 13

TODO

  • Font-end
  • Finish Express Integration; Proxying complete, service is fully working. Need to allow the API to provide status API.
  • Change the worker service to allow for processing of pdf to images, also allow nodes to disable specific job types.
  • error endpoint for frontend clients.

How to setup

  1. clean compile install Main libary, followed by Worker Libary.
  2. clean compile package Worker-Management-Service and Worker Service.
  3. docker compose up --build inside the PDF-Microservices-File-Configurations/Job Service This will start up the job service.

To start the backend-server, run docker compose up --build inside the PDF-Microservices-Backend-Service.

Postman documentation

Go to documentation

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published