Skip to main content

All Questions

0 votes
1 answer
183 views

How do I iterate through JSON files stored in GCP bucket in different folders. Example; | Bucket/Dict/Folder2/file.json Bucket/Dict/Folder1/file.json

I have dumped JSON files from DOCAI to GCP but each file is stored in individual folder, although they are in the same bucket on Cloud Storage. I am not able to iterate through the JSON files stored ...
Vedant Patil's user avatar
0 votes
1 answer
1k views

How to locally process a batch of files using Document AI with the Python client?

I'm trying to use the Python console to use the Document OCR processor to locally process a large amount of pdf documents (native and scanned) to extract the text and some metadata. The documents are ...
Vojta Partík's user avatar
0 votes
3 answers
1k views

How can I split a PDF in Google cloud storage?

I have a single PDF that I would like to create different PDFs for each of its pages. How would I be able to so without downloading anything locally? I know that Document AI has a file splitting ...
saladass4254's user avatar
1 vote
2 answers
7k views

google.api_core.exceptions.InternalServerError: 500 Failed to process all the documents

I am getting this error when trying to implement the Document OCR from google cloud in python as explained here: https://cloud.google.com/document-ai/docs/ocr#documentai_process_document-python. When ...
MegaSpeed45's user avatar