Newest 'natural-language-processing' Questions - Artificial Intelligence Stack Exchange

1 vote

0 answers

12 views

Why do transformer models sometimes produce fluent but logically inconsistent answers even when retrieval provides the correct context?

I understand that transformer-based language models can generate highly fluent responses, and that retrieval-augmented generation (RAG) is often used to improve factual grounding by supplying relevant ...

Avalon Brooks

637

asked yesterday

0 votes

1 answer

26 views

Are there AI architectures where one model generates reasoning and another model verifies or monitors the reasoning process?

I am a student who has recently started learning about artificial intelligence and reasoning systems, so I apologize in advance if this question is already well known in the literature. Many modern ...

Sagar P.

1

asked Mar 11 at 3:19

1 vote

2 answers

86 views

Why do large language models hallucinate facts even when trained on large datasets?

Large language models such as GPT, LLaMA, and Claude are trained on massive datasets and can generate highly coherent text. However, they still frequently produce incorrect or fabricated information, ...

Avalon Brooks

637

asked Mar 9 at 8:11

0 votes

1 answer

13 views

How to design similarity search for mechanical products

I have a website which shows different products about machines and its different parts. There are 10000 of product pages, and want to build a functionality which shows similar product pages and ...

Learner

1

asked Mar 8 at 9:53

2 votes

1 answer

58 views

System Prompts & LLM [closed]

I’m currently building an AI chatbot that converts natural language user queries into accurate SQL queries, executes them on a database, and returns the results in a simple, readable format. At the ...

Rahul Prasadh

29

asked Feb 10 at 14:45

4 votes

1 answer

146 views

What are the current state-of-the-art techniques for reducing hallucinations in large language models?

I’m studying how modern large language models (LLMs) generate factual and verifiable outputs. Despite improvements in training data quality and model alignment, hallucinations still occur. My question ...

Avalon Brooks

637

asked Feb 10 at 11:33

3 votes

4 answers

3k views

Can AI hallucination be regarded as a machine error?

Can AI hallucination be regarded as an error in which the model is incapable of sacrificing the quality of language of its response for its integrity, due to its discrete, all or none nature? An ...

Mohamed El Nawawy

93

asked Jan 19 at 21:51

3 votes

1 answer

54 views

BERT [CLS] token captures context by attending to all other tokens -- isn't this true for vocab tokens too?

I've frequently seen it mentioned that the embedding of the [CLS] token from BERT can capture the context of the sequence because it "attends to all the other tokens". But BERT implements ...

The Hagen

133

asked Jan 16 at 2:35

2 votes

2 answers

212 views

If LLMs like OpenAI / DeepSeek / Gemini exist, why do we still need ML or NLP libraries, now and in the future?

I’m new to AI and NLP, and I’m trying to understand how different tools fit together. Large Language Models (LLMs) like OpenAI, DeepSeek, or Gemini can already handle many NLP tasks text ...

itsdevthen

23

asked Nov 16, 2025 at 10:37

0 votes

1 answer

97 views

How do I manage a discussion between two characters in code?

I want to code (C++) a method allowing a character C1 to ask or request something from another character C2. The answer of C2 will be environment related: does it knows the thing C1 is looking for? ...

philB

101

asked Oct 14, 2025 at 9:41

0 votes

0 answers

49 views

Extracting services mentioned in short reports — rules vs ML?

I’m trying to identify which home services are present vs explicitly excluded in short, free-text reports. I also need to normalize synonyms (e.g., “pressure washing” → “power washing”). Goal: decide ...

ulajci

1

asked Sep 29, 2025 at 9:03

2 votes

1 answer

107 views

Are traditional NLP tasks solve well with modern transformer/llm technologies?

Before the LLM explosion, the tradidional NLP tasks, such as parsing, coreference resolution, translation to logical representation, temporal and event sequence resolution -- all have approached a ...

horsh

121

asked Aug 30, 2025 at 19:59

0 votes

0 answers

71 views

How can I give context to the BLIP model when generating captions?

I'm using HuggingFace's 'blip-image-captioning-base' model for image captioning. I trained it on both existing and domain-specific datasets I created specifically for generating Turkish language ...

Batuhan

21

asked Aug 3, 2025 at 10:29

2 votes

0 answers

53 views

How can I integrate BERT Tokenizer into BLIP model for image captioning?

Lately, I've been working on generating alt text for images using the BLIP model. The model I use is "blip-image-captioning-base" from HuggingFace. However, to generate alt text in Turkish ...

Batuhan

21

asked Jul 31, 2025 at 21:23

0 votes

0 answers

52 views

How can a symbolic "traveler equation" help AI detect signal-carrying works across time?

**In my personal project, "The Circular Vision: an equation to find signal‑carrying humans", I propose a symbolic framework to think about how some human works (songs, poems, scientific ...

Ramiro Goicoechea

1

asked Jul 28, 2025 at 19:36

Stack Exchange Network

Questions tagged [natural-language-processing]

Why do transformer models sometimes produce fluent but logically inconsistent answers even when retrieval provides the correct context?

Are there AI architectures where one model generates reasoning and another model verifies or monitors the reasoning process?

Why do large language models hallucinate facts even when trained on large datasets?

How to design similarity search for mechanical products

System Prompts & LLM [closed]

What are the current state-of-the-art techniques for reducing hallucinations in large language models?

Can AI hallucination be regarded as a machine error?

BERT [CLS] token captures context by attending to all other tokens -- isn't this true for vocab tokens too?

If LLMs like OpenAI / DeepSeek / Gemini exist, why do we still need ML or NLP libraries, now and in the future?

How do I manage a discussion between two characters in code?

Extracting services mentioned in short reports — rules vs ML?

Are traditional NLP tasks solve well with modern transformer/llm technologies?

How can I give context to the BLIP model when generating captions?

How can I integrate BERT Tokenizer into BLIP model for image captioning?

How can a symbolic "traveler equation" help AI detect signal-carrying works across time?

Hot Network Questions