Skip to main content

Questions tagged [regex]

Regular expressions provide a declarative language to match patterns within strings. They are commonly used for string validation, parsing, and transformation. Since regular expressions are not fully standardized, all questions with this tag should also include a tag specifying the applicable programming language or tool. NOTE: Asking for HTML, JSON, etc. regexes tends to be met with negative reactions. If there is a parser for it, use that instead.

4 votes
0 answers
261 views

I have data in pandas as below: 123-543-2345 876|678|3469 304-762-2467 Trying to change all to this format: 123-543-2345 I ...
Alfred's user avatar
  • 49
1 vote
1 answer
56 views

I want to extract entities using 5 entity lists that I have gathered. The python lists contain around a million elements in each. The lists are Nationality, Organization, Occupation, Religion, Hobby. ...
Aryaman Shukla's user avatar
3 votes
1 answer
254 views

I am using regex to extract specific sections from resumes, such as key skills, summary, and work experience. The approach involves: First, I extract the text from the resume based on predefined ...
Arfa Ahsan's user avatar
0 votes
1 answer
45 views

How can I classify an E-commerce URL Page into the following categories, Cart Payment Product Page Checkout How can I achieve this with the url and page title in my hand? I have tried multiple ways ...
Loukik's user avatar
  • 41
2 votes
0 answers
169 views

The dateparser package fails to detect texts like the following and generate a date range 'last 2 weeks of 2020': Should return 18th December 2020 - 31st December 2020 'first three quarters of 2018': ...
Zing's user avatar
  • 21
1 vote
1 answer
121 views

R newbie here. I'm doing some text analysis using the package quanteda. Basically, what I'm trying to do is put all the words follow the regex pattern ...
user116883's user avatar
-3 votes
1 answer
705 views

Hi guys do you know how to change this regex, that it only looks for test :
Shalomi90's user avatar
1 vote
1 answer
346 views

Is there a direct python pandas method to match values of series and update different series with some string ? I couldn’t find any direct method of doing it. Here the match is to find a value in a ...
user3016638's user avatar
0 votes
1 answer
355 views

I am working on a classification problem where I would like to separate business emails from personal emails to analyse their behaviours separately. I am thinking about using regex but after looking ...
nilsinelabore's user avatar
1 vote
0 answers
176 views

I need to parse around 1.6k REGEX expressions such as the pair I am writing below. I have also around 7k documents (1/2 page long each in average) that need to be parsed according to the REGEX ...
Luisda's user avatar
  • 31
2 votes
1 answer
7k views

I have a column named "MATCH" in a dataframe and a list of patterns named "PATTERN". ...
vicky's user avatar
  • 121
1 vote
1 answer
115 views

I would like to know what would be the best approach to compare two texts and see the differences between them. For example: ...
Math's user avatar
  • 161
0 votes
2 answers
2k views

I have a data set like the following, and the first column contains the groupings. However, some are labelled slightly differently. I need to remove all characters following the punctuation used (...
NewtoR's user avatar
  • 1
2 votes
1 answer
385 views

I am using rule based logic to extract features from resume. Basically I am trying to find if the candidate switched the company in less than 1 year. So I have the code in place to find it using ...
Akash's user avatar
  • 235
1 vote
1 answer
189 views

Is the result of a search for a specific n-gram like sherlock+holmes equal to the result of a regex search for "sherlock holmes" in the same document corpus? So if i read about n-grams for certain ...
bartman99's user avatar

15 30 50 per page