Skip to main content

Questions tagged [management]

1 vote
1 answer
56 views

I have collected all my data for a study and need to run my analysis but have come unstuck (I should have planned better beforehand I know). I'm looking to see whether personality traits (five trait ...
Jay's user avatar
  • 11
4 votes
2 answers
118 views

This is more of an infrastructural question about data science. How would you manage data merging in your GitHub repository? As an example, as a data scientist I might be working on my branch and ...
Mattia Surricchio's user avatar
0 votes
1 answer
110 views

I have about a few million records (small CSV / JSON file) from different sources, with about 50k added everyday. All on my local host. Until now, I have been using simple file structure to manage ...
Johnny's user avatar
  • 111
1 vote
0 answers
36 views

When working with someone whose background and skill level in data work may not be strong, how do you best make the argument for tidy data over "pretty" data? There are notes of what I want ...
Rob Creel's user avatar
  • 111
4 votes
2 answers
188 views

I'm in charge of a small data science team (3 data scientists, me included). We do our projects with at least one business person (PM) per project ( we have 5 of these). We managed everything with ...
Diogo Santos's user avatar
3 votes
1 answer
729 views

Having working on data sets, sometimes we want to keep track of mtiple models with different architectures which work on the same data set on which we have made some transformations and preprecessing ...
Shrijit Basak's user avatar
1 vote
1 answer
58 views

Do you think it's normal for data science projects to have some amount of "what should we do" -time? Or does it mitigate by experience? By "what should we do" -time I refer to time being spent on ...
mavavilj's user avatar
  • 426
2 votes
1 answer
159 views

Most of the time in Data Science projects is not spent in (performing) actual analytics but rather in other tasks, such as organizing data sources, collecting samples and preparing datasets, compiling ...
Gonzalo Garcia's user avatar
4 votes
1 answer
150 views

I am trying to find best practices for scaling data science teams, i.e find an efficient workflow/methodology to divide work between Software Engineers and Researchers working on a same product. I’...
cyberjoac's user avatar
  • 141
0 votes
2 answers
56 views

After searching quite some time for it on Google I could not find a sufficient software/toolbox that can manage trainings of neural networks. I thought of a program that combines visualization ...
Lukas's user avatar
  • 11
3 votes
1 answer
86 views

I have a series of scripts. Some are in R, some in Python, and others in SAS. I have built them in such a way that one code outputs a .csv file that the next code obtains and then that code outputs a ....
conv3d's user avatar
  • 205
2 votes
2 answers
65 views

When you have multiple methods to accomplish a task, how do you choose which one to implement?
DBinJP's user avatar
  • 123
2 votes
1 answer
249 views

Can anyone please tell me how is clustering is used in data management? I was recently asked this question and was left totally stumped.
Paul B's user avatar
  • 21
1 vote
0 answers
81 views

I will soon deal with multiple projects in python. Some of them have to run regularly (many times a day), they can take some time (many days) and they use/produce some data coming from and going to ...
Robin's user avatar
  • 1,347