Skip to main content
-9 votes
0 answers
115 views

I'm trying to extract text from a video. I created cropped frames from the video using ffmpeg and saved them as PNGs. Now, I'm using those PNGs to extract the text using EasyOCR. At first, I tried ...
frkn's user avatar
  • 7
-6 votes
0 answers
91 views

I am building a system where: An Android device captures an image using the camera I use adb pull to transfer the image from /sdcard/DCIM/Camera Then I process it using pytesseract OCR in Python ...
Gurusai Gudi's user avatar
Best practices
0 votes
3 replies
72 views

I have an on-prem OCR pipeline that returns extracted text inside a JSON blob. I parse the LLM response and call a local normalizer before returning the text to callers. Example call site: result = ...
agaonsindhe's user avatar
Advice
0 votes
1 replies
94 views

I’m working on a project where I want to use AI / computer vision to read MEP (Mechanical, Electrical, Plumbing) drawings or floor plans. My goal is to: Detect rooms and extract their labels (e.g., “...
Narmeen Zafar's user avatar
Advice
0 votes
0 replies
28 views

I’m architecting a document processing pipeline for a real-time workflow. I have 5 document types, but they require two completely different extraction strategies. The Document for Dynamic Form: This ...
JS3's user avatar
  • 1,951