Questions tagged [natural-language-processing]
Natural language processing draws knowledge from a diverse collection of fields including computer science, linguistics, and statistics in order to extract pertinent information from the spoken or written word.
42 questions
-2
votes
3
answers
209
views
Its possible to distinguish between functional and non functional software requirements by computer software? [closed]
I am trying to do a computer program that can distinguish the functional and non-functional requirements written in Arabic languages from the Linguistic structure, so I am I look for the things that ...
-2
votes
1
answer
169
views
Is there a way to encode binary into natural language?
I was wondering if there was a way to encode binary data, such as Crypto addresses into a natural language format, in a similar fashion to how what3words.com can encode locations into three easy-to-...
-1
votes
1
answer
59
views
Equal transformations on both indexed content and query content before a search is attempted
In search engine indexing, a body of text is often processed before it is indexed. A common example is stemming, were words are reduced to their root form (plurals are dropped, tense is normalized). ...
3
votes
1
answer
207
views
Designing the schema for a database of Spanish language words?
For a project I'm working on that will help people learn Spanish, I would like to create a standalone service to handle the retrieval of data about words. For this, I've captured and codified data ...
4
votes
2
answers
649
views
Database structure for word co-occurrence frequencies in a large corpus
I would like to store the frequencies with which words co-occur with each other over a variety of contexts in a large (> 1 billion tokens) text corpus. I need to store the word pair, the type of co-...
3
votes
0
answers
104
views
Software design strategy for a machine learning tool that outputs a subset of the text input (Information Extraction)?
Let's say I have thousands of pdfs that are each about 30k words written in conversational English. In each of the pdfs there is a name / names of a person/people who snowboard. There are also many ...
3
votes
1
answer
94
views
What approaches can I take to figure out the "relevancy" of certain terms in a string?
I'm not even sure "relevancy" is the most accurate word, so I'll just describe the problem:
I'm building an app that needs to somehow parse product descriptions from a popular website (let's just say ...
0
votes
2
answers
99
views
Divergent Bodies of Truth, Relationships of Concepts
A recent endeavor of mine requires asking users how related two concepts are. Example: Kiwi and Fruit. Identical? Fairly Similar? Rather different? Unrelated?
It occurs to me that for certain ...
0
votes
1
answer
530
views
What is the best way to summarize a sentiment value for a paragraph of text based on the sentiment value for the sentences within it?
So I am using Stanford CoreNLP in my project. I have data which consists of reviews of products on a forum. I need to be able to assign a sentiment value to a given review. CoreNLP allows you to ...
1
vote
2
answers
352
views
Automatic summarization - whole text or paragraphs
I am writing a tool that will give users the ability to summarize text content on a webpage, by highlighting the text that they wish to get summarized.
So far, I've received results that I can work ...
3
votes
2
answers
690
views
How to build a class for comparing words in a lexical dictionary?
I'm wanting to create a class that stores words in a set so that I can see if a word belongs to that set or not. I'm not wanting to build ever set every time I instantiate the class, so I'm using what ...
3
votes
2
answers
308
views
Is polymorphism appropriate for modeling natural language form (structure), or is there something better?
Let's take French and Japanese adjectives as simple examples.
French adjectives have gender (masculine or feminine) and number (singular or plural), whereas Japanese nouns have neither. However, ...
0
votes
1
answer
197
views
Which strategy to use in Client-Server messaging to trigger specific behavior in them?
I'm developing a virtual assistant to manage transactions in a business. I'm using an external service to handle Natural Language and
get intents, actions and parameters from my user requests.
The ...
2
votes
2
answers
2k
views
How to create an efficient implementation of a dictionary of synonyms?
How do I create a dictionary of synonyms that is efficient in terms of retrieving synonyms of a word?
Basically, I want to use some sorts of data storage to store groups of words, and enable an end-...
1
vote
1
answer
751
views
What type of "spam filter" algorithm will tokenize characters for non-exact matches?
I was having a look at a basic example of spam filtering using a logistic regression algorithm and an answer posted on this Stack Overflow question: https://stackoverflow.com/questions/29291263/mllib-...