Project

General

Profile

Edit Copy Actions

Feature #7241

open

Optimize PDF Processing Pipeline and Large File Uploads to AWS S3

Added by Kalyan Ravula about 23 hours ago. Updated about 16 hours ago.

Status:
In Progress
Priority:
Normal
Assignee:
Target version:
-
Start date:
01/30/2026
Due date:
% Done:

90%

Estimated time:
6:00 h
Spent time:

Description

Currently, a single API endpoint is responsible for handling multiple operations, including:

Downloading a PDF file temporarily from AWS S3

Reading and extracting data from the PDF using a Python script

Processing extracted data (e.g., BOM extraction, matching, persistence)

Handling additional unrelated events within the same request lifecycle

This design has led to high API response times, increased memory usage, and poor scalability.
Additionally, large file uploads to AWS S3 are not optimized, resulting in slow uploads and potential failures under load.


Add

Subtasks


Add

Related issues

Edit Copy Actions

Also available in: Atom PDF