All Questions
4 questions
0
votes
1
answer
183
views
How do I iterate through JSON files stored in GCP bucket in different folders. Example; | Bucket/Dict/Folder2/file.json Bucket/Dict/Folder1/file.json
I have dumped JSON files from DOCAI to GCP but each file is stored in individual folder, although they are in the same bucket on Cloud Storage. I am not able to iterate through the JSON files stored ...
0
votes
1
answer
1k
views
How to locally process a batch of files using Document AI with the Python client?
I'm trying to use the Python console to use the Document OCR processor to locally process a large amount of pdf documents (native and scanned) to extract the text and some metadata. The documents are ...
0
votes
3
answers
1k
views
How can I split a PDF in Google cloud storage?
I have a single PDF that I would like to create different PDFs for each of its pages. How would I be able to so without downloading anything locally? I know that Document AI has a file splitting ...
1
vote
2
answers
7k
views
google.api_core.exceptions.InternalServerError: 500 Failed to process all the documents
I am getting this error when trying to implement the Document OCR from google cloud in python as explained here: https://cloud.google.com/document-ai/docs/ocr#documentai_process_document-python.
When ...