Skip to main content
0 votes
0 answers
34 views

How to fetch specific file following the pattern for RAG in AWS bedrock?

I have created a knowledgebase in AWS and attached an S3 datasource to it. Now I want to perform query on specific files using RAG. When you create a datasource in AWS it creates serverless ...
Makarand's user avatar
  • 616
0 votes
1 answer
58 views

Error raised by bedrock service: when calling the InvokeModel operation: Malformed input request

ValueError: Error raised by bedrock service: An error occurred (ValidationException) when calling the InvokeModel operation: Malformed input request: #: required key [messages] not found, please ...
DIVYANSH TRIVEDI's user avatar
0 votes
0 answers
31 views

KeyFrame detection in python

I'm building a RAG system for a platform where the primary content consists of videos and slides. My approach involves extracting keyframes from videos using OpenCV diff = cv2.absdiff(prev_image, ...
Daniel's user avatar
  • 11
0 votes
0 answers
52 views

How to expand context window based on metadata of the vector-store collection

I have a working RAG code, using Langchain and Milvus. Now I'd like to add the feature to look at the metadata of each of the extracted k documents, and do the following: find the paragraph_id of ...
ArieAI's user avatar
  • 514
0 votes
0 answers
413 views

BM25Retriever + ChromaDB Hybrid Search Optimization using LangChain

For those who have integrated the ChromaDB client with the Langchain framework, I am proposing the following approach to implement the Hybrid search (Vector Search + BM25Retriever): from ...
Diallo Francis Patrick's user avatar
0 votes
0 answers
20 views

Index type 0x73726576 ("vers") not recognized

I have a chat bot app that I can run without any problem in my local environment. I can both run it locally on pycharm and I can run a docker container locally again. then I deploy it to koyeb using ...
mehmet's user avatar
  • 39
0 votes
0 answers
183 views

RAG on Mac (M3) with langchain (RetrievalQA): code runs indefinitely

I'm trying to run a RAG system on my mac M3-pro (18gb RAM) using langchain and `Llama-3.2-3B-Instruct` on a jupyter notebook (and the vector storage is Milvus). When I am invoking RetrievalQA....
ArieAI's user avatar
  • 514
0 votes
1 answer
809 views

ModuleNotFoundError: No module named 'huggingface_hub.inference._types'

I am running a RAG pipeline, with LlamaIndex and quantized LLama3-8B-Instruct. I just installed these libraries: !pip install --upgrade huggingface_hub !pip install --upgrade peft !pip install llama-...
Hoang Cuong Nguyen's user avatar
1 vote
1 answer
83 views

Creating an index in PyMilvus 2.5.x does not actually index any rows

I am trying to create an index on text embeddings for a RAG system with Milvus 2.5.x as vector database in Python. I have already create the collections and populated them. My dataset size is quite ...
Liqs's user avatar
  • 197
0 votes
0 answers
40 views

How can I augment a JSON document when I have an application that is designed to use OpenAI to answer user questions about the document?

I am using an OpenAI API to make a local application that answers questions using a certain document as a main reference. I am using JavaScript where I integrated the OpenAI API, and then CSS and HTML ...
Emmanuel Mark Mones's user avatar
0 votes
0 answers
8 views

IS there anyway to create a interaction between chatbot using langchain

I have created a chatbot using langchain and openai embeddings which reads pdf and breaks down into chuncks to retrieve answers . now i have multiple similar context in the pdf but it gives the top ...
Syed shabaz hyder's user avatar
1 vote
0 answers
28 views

Llamaindex Bug: ToolInteractiveReflectionAgentWorker not doing corrective reflection

I tried exactly the code here line by line but with a different contents of the tool (shouldn't matter): https://docs.llamaindex.ai/en/stable/examples/agent/introspective_agent_toxicity_reduction/ ...
Burny's user avatar
  • 11
1 vote
0 answers
304 views

code walkthrough of chain syntax in langchain [duplicate]

I am following a RAG tutorial from: https://medium.com/@vndee.huynh/build-your-own-rag-and-run-it-locally-langchain-ollama-streamlit-181d42805895 In the tutorial there is a section that creates a ...
Null Salad's user avatar
  • 1,060
0 votes
1 answer
804 views

Getting Tokens Usage Metadata from Gemini LLM calls in LangChain RAG RunnableSequence

I would like to have the token utilisation of my RAG chain each time it is invoked. No matter what I do, I can't seem to find the right way to output the total tokens from the Gemini model I'm using. ...
Matheus Torquato's user avatar
6 votes
0 answers
354 views

Best Approach to Evaluate a Graph RAG Pipeline Using Metrics?

I’ve developed a Graph RAG (Retrieval-Augmented Generation) pipeline that performs reasoning over a knowledge graph. Given a user query, the pipeline retrieves relevant nodes and relationships in the ...
LLM_Enthusiast's user avatar

15 30 50 per page
1
2 3 4 5
9