Skip to main content
Advice
0 votes
4 replies
88 views

I am working with a university faculty salary dataset where the same person appears across many years, but their name strings are inconsistent. The dataset has about 8,000 unique people and years from ...
Mengyang Cao's user avatar
-1 votes
2 answers
167 views

I'd like to process some input queries in 3 possible ways: query: select * from People query: select * from People exclude addresses query: select * from People include department I have two regex1 ...
DayaMoon's user avatar
  • 364
0 votes
2 answers
87 views

I'm working with many tabular datasets (Excel, CSV) that contain inconsistent or messy column names due to typos, different naming conventions, spacing, punctuation, etc. I have a standard schema (as ...
Ste347789's user avatar
0 votes
1 answer
56 views

In SLES15 SP6 on x86_64 I'm using a bash script and expect-5.45.4 to do automated program testing. Basically I'm checking whether the program to test (./pwg.pl) outputs a specific string. Starting to ...
U. Windl's user avatar
  • 4,748
-2 votes
1 answer
116 views

I'm working with two datasets for German NUTS-3 level regions: A shapefile from Eurostat via the giscoR package: > library(giscoR) > nuts3_germany <- gisco_get_nuts(country = "Germany&...
Saïd Maanan's user avatar
4 votes
4 answers
169 views

Let's say I want to match any sequence of the hash sign # at the start of a string; so I'd want to match ## here: local mystr = "##First line\nSecond line\nThird line" ... and ### here: ...
sdbbs's user avatar
  • 5,948
2 votes
3 answers
123 views

I have a column in Pandas DataFrame(Names) with a large collection of names. I have another DataFrame(Title) text column and in between text, the names in Name frame are there. What would be the ...
Totura's user avatar
  • 167
2 votes
0 answers
88 views

This question is a little complicated, so I try to describe it through an example. First, we get a string foo, and put it into collection S. Then we get a string sample, and put it into S too. Next, ...
differentrain's user avatar
1 vote
1 answer
71 views

I have a database with three columns: name, occupation, and organization. In these columns, I have duplicates with slightly different names. For example, Anne Sue Frank and Anne S. Frank refer to the ...
Vitoria Sanchez's user avatar
0 votes
2 answers
86 views

savvy people, I will have participants of an event sign up where they, aside from their personal details, also provide a duo partners name or leave that blank. So, I will have two columns, ...
Lex Plantenga's user avatar
1 vote
3 answers
96 views

I have a large pandas DataFrames like below. import pandas as pd import numpy as np df = pd.DataFrame( [ ("1", "Dixon Street", "Auckland"), ("2&...
Totura's user avatar
  • 167
0 votes
1 answer
90 views

this seems like it should be an easy problem to solve, but I've been battling with it and cannot seem to find a solution. I have two dataframes of different sizes and different column names. I am ...
Rose_Trojan's user avatar
1 vote
1 answer
79 views

I'm trying to write a regex that matches every occurrence of some_function(...), but it should not match when it's part of an object method like my.some_function(...) or if it is a substring of ...
JVS's user avatar
  • 2,682
2 votes
2 answers
88 views

Do Kotlin's List/Array data structures have a findSublist method analogous to String.indexOf(CharSequence), that takes a List/Array/Sequence to match against the list?
tpdi's user avatar
  • 35.3k
1 vote
0 answers
78 views

What I'm trying to do is find and correct similar names in my database, like 'Patrick Maxwell' and 'Patrick Maxwel.' However, the issue I'm facing is that the best match for each name is often itself, ...
Kauan Randall Oliveira Ferreir's user avatar

15 30 50 per page
1
2 3 4 5
157