Newest 'retrieval-augmented-generation' Questions

0 votes

0 answers

34 views

How to fetch specific file following the pattern for RAG in AWS bedrock?

I have created a knowledgebase in AWS and attached an S3 datasource to it. Now I want to perform query on specific files using RAG. When you create a datasource in AWS it creates serverless ...

Makarand

616

asked Apr 14 at 3:52

0 votes

1 answer

58 views

Error raised by bedrock service: when calling the InvokeModel operation: Malformed input request

ValueError: Error raised by bedrock service: An error occurred (ValidationException) when calling the InvokeModel operation: Malformed input request: #: required key [messages] not found, please ...

DIVYANSH TRIVEDI

1

asked Mar 30 at 18:53

0 votes

0 answers

31 views

KeyFrame detection in python

I'm building a RAG system for a platform where the primary content consists of videos and slides. My approach involves extracting keyframes from videos using OpenCV diff = cv2.absdiff(prev_image, ...

Daniel

11

asked Mar 24 at 15:40

0 votes

0 answers

52 views

How to expand context window based on metadata of the vector-store collection

I have a working RAG code, using Langchain and Milvus. Now I'd like to add the feature to look at the metadata of each of the extracted k documents, and do the following: find the paragraph_id of ...

ArieAI

514

asked Mar 2 at 11:30

0 votes

0 answers

413 views

BM25Retriever + ChromaDB Hybrid Search Optimization using LangChain

For those who have integrated the ChromaDB client with the Langchain framework, I am proposing the following approach to implement the Hybrid search (Vector Search + BM25Retriever): from ...

Diallo Francis Patrick

177

asked Mar 1 at 14:31

0 votes

0 answers

20 views

Index type 0x73726576 ("vers") not recognized

I have a chat bot app that I can run without any problem in my local environment. I can both run it locally on pycharm and I can run a docker container locally again. then I deploy it to koyeb using ...

mehmet

39

asked Feb 28 at 12:36

0 votes

0 answers

183 views

RAG on Mac (M3) with langchain (RetrievalQA): code runs indefinitely

I'm trying to run a RAG system on my mac M3-pro (18gb RAM) using langchain and `Llama-3.2-3B-Instruct` on a jupyter notebook (and the vector storage is Milvus). When I am invoking RetrievalQA....

ArieAI

514

asked Jan 13 at 10:32

0 votes

1 answer

809 views

ModuleNotFoundError: No module named 'huggingface_hub.inference._types'

I am running a RAG pipeline, with LlamaIndex and quantized LLama3-8B-Instruct. I just installed these libraries: !pip install --upgrade huggingface_hub !pip install --upgrade peft !pip install llama-...

Hoang Cuong Nguyen

441

asked Dec 21, 2024 at 4:34

1 vote

1 answer

83 views

Creating an index in PyMilvus 2.5.x does not actually index any rows

I am trying to create an index on text embeddings for a RAG system with Milvus 2.5.x as vector database in Python. I have already create the collections and populated them. My dataset size is quite ...

Liqs

197

asked Dec 17, 2024 at 14:10

0 votes

0 answers

40 views

How can I augment a JSON document when I have an application that is designed to use OpenAI to answer user questions about the document?

I am using an OpenAI API to make a local application that answers questions using a certain document as a main reference. I am using JavaScript where I integrated the OpenAI API, and then CSS and HTML ...

Emmanuel Mark Mones

1

asked Dec 3, 2024 at 11:30

0 votes

0 answers

8 views

IS there anyway to create a interaction between chatbot using langchain

I have created a chatbot using langchain and openai embeddings which reads pdf and breaks down into chuncks to retrieve answers . now i have multiple similar context in the pdf but it gives the top ...

Syed shabaz hyder

1

asked Nov 27, 2024 at 16:38

1 vote

0 answers

28 views

Llamaindex Bug: ToolInteractiveReflectionAgentWorker not doing corrective reflection

I tried exactly the code here line by line but with a different contents of the tool (shouldn't matter): https://docs.llamaindex.ai/en/stable/examples/agent/introspective_agent_toxicity_reduction/ ...

Burny

11

asked Oct 19, 2024 at 10:57

1 vote

0 answers

304 views

code walkthrough of chain syntax in langchain [duplicate]

I am following a RAG tutorial from: https://medium.com/@vndee.huynh/build-your-own-rag-and-run-it-locally-langchain-ollama-streamlit-181d42805895 In the tutorial there is a section that creates a ...

Null Salad

1,060

asked Oct 17, 2024 at 20:28

0 votes

1 answer

804 views

Getting Tokens Usage Metadata from Gemini LLM calls in LangChain RAG RunnableSequence

I would like to have the token utilisation of my RAG chain each time it is invoked. No matter what I do, I can't seem to find the right way to output the total tokens from the Gemini model I'm using. ...

Matheus Torquato

1,639

asked Sep 30, 2024 at 15:04

6 votes

0 answers

354 views

Best Approach to Evaluate a Graph RAG Pipeline Using Metrics?

I’ve developed a Graph RAG (Retrieval-Augmented Generation) pipeline that performs reasoning over a knowledge graph. Given a user query, the pipeline retrieves relevant nodes and relationships in the ...

LLM_Enthusiast

87

asked Aug 17, 2024 at 3:44

Collectives™ on Stack Overflow

How to fetch specific file following the pattern for RAG in AWS bedrock?

Error raised by bedrock service: when calling the InvokeModel operation: Malformed input request

KeyFrame detection in python

How to expand context window based on metadata of the vector-store collection

BM25Retriever + ChromaDB Hybrid Search Optimization using LangChain

Index type 0x73726576 ("vers") not recognized

RAG on Mac (M3) with langchain (RetrievalQA): code runs indefinitely

ModuleNotFoundError: No module named 'huggingface_hub.inference._types'

Creating an index in PyMilvus 2.5.x does not actually index any rows

How can I augment a JSON document when I have an application that is designed to use OpenAI to answer user questions about the document?

IS there anyway to create a interaction between chatbot using langchain

Llamaindex Bug: ToolInteractiveReflectionAgentWorker not doing corrective reflection

code walkthrough of chain syntax in langchain [duplicate]

Getting Tokens Usage Metadata from Gemini LLM calls in LangChain RAG RunnableSequence

Best Approach to Evaluate a Graph RAG Pipeline Using Metrics?

Hot Network Questions

Collectives™ on Stack Overflow

Related Tags