125 questions
0
votes
0
answers
34
views
How to fetch specific file following the pattern for RAG in AWS bedrock?
I have created a knowledgebase in AWS and attached an S3 datasource to it. Now I want to perform query on specific files using RAG.
When you create a datasource in AWS it creates serverless ...
0
votes
1
answer
58
views
Error raised by bedrock service: when calling the InvokeModel operation: Malformed input request
ValueError: Error raised by bedrock service: An error occurred (ValidationException) when calling the InvokeModel operation: Malformed input request: #: required key [messages] not found, please ...
0
votes
0
answers
31
views
KeyFrame detection in python
I'm building a RAG system for a platform where the primary content consists of videos and slides. My approach involves extracting keyframes from videos using OpenCV
diff = cv2.absdiff(prev_image, ...
0
votes
0
answers
52
views
How to expand context window based on metadata of the vector-store collection
I have a working RAG code, using Langchain and Milvus. Now I'd like to add the feature to look at the metadata of each of the extracted k documents, and do the following:
find the paragraph_id of ...
0
votes
0
answers
413
views
BM25Retriever + ChromaDB Hybrid Search Optimization using LangChain
For those who have integrated the ChromaDB client with the Langchain framework, I am proposing the following approach to implement the Hybrid search (Vector Search + BM25Retriever):
from ...
0
votes
0
answers
20
views
Index type 0x73726576 ("vers") not recognized
I have a chat bot app that I can run without any problem in my local environment.
I can both run it locally on pycharm and I can run a docker container locally again. then I deploy it to koyeb using ...
0
votes
0
answers
183
views
RAG on Mac (M3) with langchain (RetrievalQA): code runs indefinitely
I'm trying to run a RAG system on my mac M3-pro (18gb RAM) using langchain and `Llama-3.2-3B-Instruct` on a jupyter notebook (and the vector storage is Milvus).
When I am invoking RetrievalQA....
0
votes
1
answer
809
views
ModuleNotFoundError: No module named 'huggingface_hub.inference._types'
I am running a RAG pipeline, with LlamaIndex and quantized LLama3-8B-Instruct. I just installed these libraries:
!pip install --upgrade huggingface_hub
!pip install --upgrade peft
!pip install llama-...
1
vote
1
answer
83
views
Creating an index in PyMilvus 2.5.x does not actually index any rows
I am trying to create an index on text embeddings for a RAG system with Milvus 2.5.x as vector database in Python. I have already create the collections and populated them. My dataset size is quite ...
0
votes
0
answers
40
views
How can I augment a JSON document when I have an application that is designed to use OpenAI to answer user questions about the document?
I am using an OpenAI API to make a local application that answers questions using a certain document as a main reference. I am using JavaScript where I integrated the OpenAI API, and then CSS and HTML ...
0
votes
0
answers
8
views
IS there anyway to create a interaction between chatbot using langchain
I have created a chatbot using langchain and openai embeddings which reads pdf and breaks down into chuncks to retrieve answers . now i have multiple similar context in the pdf but it gives the top ...
1
vote
0
answers
28
views
Llamaindex Bug: ToolInteractiveReflectionAgentWorker not doing corrective reflection
I tried exactly the code here line by line but with a different contents of the tool (shouldn't matter):
https://docs.llamaindex.ai/en/stable/examples/agent/introspective_agent_toxicity_reduction/
...
1
vote
0
answers
304
views
code walkthrough of chain syntax in langchain [duplicate]
I am following a RAG tutorial from: https://medium.com/@vndee.huynh/build-your-own-rag-and-run-it-locally-langchain-ollama-streamlit-181d42805895
In the tutorial there is a section that creates a ...
0
votes
1
answer
804
views
Getting Tokens Usage Metadata from Gemini LLM calls in LangChain RAG RunnableSequence
I would like to have the token utilisation of my RAG chain each time it is invoked.
No matter what I do, I can't seem to find the right way to output the total tokens from the Gemini model I'm using.
...
6
votes
0
answers
354
views
Best Approach to Evaluate a Graph RAG Pipeline Using Metrics?
I’ve developed a Graph RAG (Retrieval-Augmented Generation) pipeline that performs reasoning over a knowledge graph. Given a user query, the pipeline retrieves relevant nodes and relationships in the ...