Skip to main content
-9 votes
0 answers
115 views

I'm trying to extract text from a video. I created cropped frames from the video using ffmpeg and saved them as PNGs. Now, I'm using those PNGs to extract the text using EasyOCR. At first, I tried ...
frkn's user avatar
  • 7
-6 votes
0 answers
91 views

I am building a system where: An Android device captures an image using the camera I use adb pull to transfer the image from /sdcard/DCIM/Camera Then I process it using pytesseract OCR in Python ...
Gurusai Gudi's user avatar
Best practices
0 votes
3 replies
72 views

I have an on-prem OCR pipeline that returns extracted text inside a JSON blob. I parse the LLM response and call a local normalizer before returning the text to callers. Example call site: result = ...
agaonsindhe's user avatar
Advice
0 votes
0 replies
28 views

I’m architecting a document processing pipeline for a real-time workflow. I have 5 document types, but they require two completely different extraction strategies. The Document for Dynamic Form: This ...
JS3's user avatar
  • 1,951
Advice
0 votes
1 replies
94 views

I’m working on a project where I want to use AI / computer vision to read MEP (Mechanical, Electrical, Plumbing) drawings or floor plans. My goal is to: Detect rooms and extract their labels (e.g., “...
Narmeen Zafar's user avatar
3 votes
0 answers
90 views

Problem: I am building an OCR-based engine to digitize chess scoresheets using Python and python-chess. I use the Levenshtein distance (specifically Jaro-Winkler) to map recognized text (e.g., "...
TacerTV's user avatar
  • 27
Best practices
0 votes
7 replies
107 views

I am working on an OCR project and need to create a dataset consisting of approximately 1247 pages from 6 books. I need to crop the images line by line and transcribe the text for training a model. ...
Muhammad Asif's user avatar
Advice
1 vote
1 replies
79 views

How can I create an Android library similar to Google ML Kit Text Recognition that works fully offline, detects text from images, and returns both the recognized text and the bounding boxes? I want to ...
hrithik.k's user avatar
0 votes
0 answers
72 views

var ocr = new IronTesseract(); ocr.Language = OcrLanguage.Bengali; // Optimization for high-density forms (18 boxes) ocr.Configuration.ReadBarCodes = false; ocr.Configuration.PageSegmentationMode = ...
enamul haque's user avatar
0 votes
0 answers
48 views

I'm encountering a segmentation fault when trying to initialize PaddleOCR on a Raspberry Pi 5 (8GB RAM) running Python 3.13 in a virtual environment. The error occurs during the model loading phase. ...
Ching On Chan's user avatar
Advice
0 votes
1 replies
73 views

I work on the toponymie in France and need to know for each city its departement and the origins of the city name. To do so I have PDF of a scanned book (700 pages) that indicates those informations. ...
L_O's user avatar
  • 1
3 votes
2 answers
1k views

I'm trying to use PaddleOCR: from paddleocr import PaddleOCR from PIL import Image # Initialize the OCR engine ocr = PaddleOCR(use_textline_orientation=False, lang='es') # Run OCR on an image path ...
Angel Panda's user avatar
3 votes
0 answers
401 views

The Department of Justice has recently released Volumes 09 and 10 of the Epstein files. Among them is a PDF: https://www.justice.gov/epstein/files/DataSet%209/EFTA01012650.pdf This PDF contains ...
Maximilian Gerhardt's user avatar
Advice
0 votes
3 replies
79 views

I am working on a flutter app that needs to read digital odometer state from an image. The goal is to have an image of the odometer, crop it around the number as close as possible and then do text ...
schmru's user avatar
  • 619
3 votes
0 answers
92 views

I have a python problem regarding OCR of time stamps/GPS overlays from video files. The video files contain camera footage of seagrass meadows (screenshot). I would like to extract date + time (left ...
Philipp's user avatar
  • 31

15 30 50 per page
1
2 3 4 5
415