Skip to main content
Advice
1 vote
1 replies
36 views

I’m building a tool that generates new mathematics exam problems using an internal database of past problems. My current setup uses a RAG pipeline, Pinecone as the vector database, and GPT-5 as the ...
Marc-Loïc Abena's user avatar
1 vote
0 answers
39 views

I am trying to create a simple vector index for conversation AI application where i want to use radis as long-term memory. i configured radis locally and created the index which ideally stores "...
Hari's user avatar
  • 11
0 votes
0 answers
28 views

Question: I'm building a memory-augmented AI system using RAG with persistent vector storage, but facing memory leaks and context contamination between sessions. Problem: Vector embeddings aren't ...
TensorMind's user avatar
0 votes
0 answers
207 views

I'm working with LangChain and trying to create a vector database from a text file containing book descriptions and ISBNs. Each line in the text file is a separate book description, so I split the ...
Gruncio's user avatar
  • 11
0 votes
1 answer
79 views

I'm trying to use MUVERA compression with Jina ColBERT v2 embeddings in Weaviate, following the official documentation. However, MUVERA compression is not being applied: I'm still getting raw multi-...
tat's user avatar
  • 351
0 votes
1 answer
42 views

I'm looking for advice on a vector search problem that goes against the grain of standard similarity searches. What I have: I'm using Genkit with a vector database (Firestore) that's populated with ...
TinyTiger's user avatar
  • 2,255
1 vote
1 answer
329 views

I'm using Qdrant in a cluster setup and want to define a payload_schema for a collection. I want to ensure that the structure of my payload is recognized and listed in the collection metadata. Here’s ...
Tim's user avatar
  • 47
0 votes
0 answers
43 views

I'm working on a RAG pipeline using a vector database to search over a Q&A dataset. I'm using embedding-based dense retrieval to fetch relevant answers to user queries. The issue I'm facing is ...
MojtabaMAleki02's user avatar
0 votes
0 answers
470 views

Context I am working on a semantic search application and using Qdrant to store three types of embeddings per document: Dense embeddings (from OpenAI) Sparse embeddings (from Qdrant/BM25) Rerank ...
Himanshu Gupta's user avatar
0 votes
0 answers
120 views

This one is weird to me as I did create the embeddings with text model 004. I persisted the database and had to zip it and upload it to hugging face utilizing git LFS. I planned on unzipping that ...
Ico's user avatar
  • 1
1 vote
0 answers
60 views

In Milvus, with an IVF Flat Index, I understand that nlist clusters are made on the whole index. Now If I make a query with highly selective metadata filtering, will Milvus choose nprobe clusters from ...
Goutham 's user avatar
0 votes
1 answer
66 views

I am using BAAI/bge-large-en-v1.5 model to embed and then store these embeddings in ChromaDB vector-store. These embeddings are in the memory and using HNSW indexing. Is there a way I can find out the ...
Jarvis's user avatar
  • 13
0 votes
2 answers
217 views

I tried to retrieve documents with similar content to later modify them, but when it came to updating, I realized I couldn't get the ID to update these documents. Here’s the function I was using to ...
DepressedChalk's user avatar
0 votes
1 answer
538 views

I am working on a Retrieval-Augmented Generation (RAG) pipeline and need to attach my documents to an Azure OpenAI Assistant. I have followed these steps: Processed Documents: Extracted text and ...
Kevin Scrimgeour's user avatar
0 votes
0 answers
37 views

I’m testing Milvus upsert functionality with 1,000 records. Here’s my workflow: Create a collection Insert data (1,000 records) Flush the data Create an HNSW index Observe the collection stats: ...
tmandyai's user avatar

15 30 50 per page
1
2 3 4 5
19