Skip to main content

Questions tagged [natural-language-processing]

For questions related to natural language processing (NLP), which is concerned with the interactions between computers and human (or natural) languages, in particular how to create programs that process and analyze large amounts of natural language data.

1 vote
0 answers
12 views

I understand that transformer-based language models can generate highly fluent responses, and that retrieval-augmented generation (RAG) is often used to improve factual grounding by supplying relevant ...
Avalon Brooks's user avatar
0 votes
1 answer
26 views

I am a student who has recently started learning about artificial intelligence and reasoning systems, so I apologize in advance if this question is already well known in the literature. Many modern ...
Sagar P.'s user avatar
1 vote
2 answers
86 views

Large language models such as GPT, LLaMA, and Claude are trained on massive datasets and can generate highly coherent text. However, they still frequently produce incorrect or fabricated information, ...
Avalon Brooks's user avatar
0 votes
1 answer
13 views

I have a website which shows different products about machines and its different parts. There are 10000 of product pages, and want to build a functionality which shows similar product pages and ...
Learner's user avatar
2 votes
1 answer
58 views

I’m currently building an AI chatbot that converts natural language user queries into accurate SQL queries, executes them on a database, and returns the results in a simple, readable format. At the ...
Rahul Prasadh's user avatar
4 votes
1 answer
146 views

I’m studying how modern large language models (LLMs) generate factual and verifiable outputs. Despite improvements in training data quality and model alignment, hallucinations still occur. My question ...
Avalon Brooks's user avatar
3 votes
4 answers
3k views

Can AI hallucination be regarded as an error in which the model is incapable of sacrificing the quality of language of its response for its integrity, due to its discrete, all or none nature? An ...
Mohamed El Nawawy's user avatar
3 votes
1 answer
54 views

I've frequently seen it mentioned that the embedding of the [CLS] token from BERT can capture the context of the sequence because it "attends to all the other tokens". But BERT implements ...
The Hagen's user avatar
  • 133
2 votes
2 answers
212 views

I’m new to AI and NLP, and I’m trying to understand how different tools fit together. Large Language Models (LLMs) like OpenAI, DeepSeek, or Gemini can already handle many NLP tasks text ...
itsdevthen's user avatar
0 votes
1 answer
97 views

I want to code (C++) a method allowing a character C1 to ask or request something from another character C2. The answer of C2 will be environment related: does it knows the thing C1 is looking for? ...
philB's user avatar
  • 101
0 votes
0 answers
49 views

I’m trying to identify which home services are present vs explicitly excluded in short, free-text reports. I also need to normalize synonyms (e.g., “pressure washing” → “power washing”). Goal: decide ...
ulajci's user avatar
  • 1
2 votes
1 answer
107 views

Before the LLM explosion, the tradidional NLP tasks, such as parsing, coreference resolution, translation to logical representation, temporal and event sequence resolution -- all have approached a ...
horsh's user avatar
  • 121
0 votes
0 answers
71 views

I'm using HuggingFace's 'blip-image-captioning-base' model for image captioning. I trained it on both existing and domain-specific datasets I created specifically for generating Turkish language ...
Batuhan's user avatar
  • 21
2 votes
0 answers
53 views

Lately, I've been working on generating alt text for images using the BLIP model. The model I use is "blip-image-captioning-base" from HuggingFace. However, to generate alt text in Turkish ...
Batuhan's user avatar
  • 21
0 votes
0 answers
52 views

**In my personal project, "The Circular Vision: an equation to find signal‑carrying humans", I propose a symbolic framework to think about how some human works (songs, poems, scientific ...
Ramiro Goicoechea's user avatar

15 30 50 per page