Questions tagged [csv]
Comma-Separated Values are a list of plain text values delimited by commas, or a file containing one or more lists in that format.
105 questions
2
votes
0
answers
29
views
Looking for guidance on what to do with a small dataset to showcase for a potential employer
I'm preparing a portfolio project using my personal finance data (600 rows) to demonstrate skills for entry-level data analyst/internship positions. The dataset structure is:
...
5
votes
1
answer
156
views
Data for football matches
I am doing a project for my bachelor's thesis. I want to make a ML project in which I calculate the outputs of a football match in terms of probabilities- for example, 40% chance for the home team to ...
1
vote
0
answers
86
views
How to Design a JSON Schema for Mapping Survey Data from a CSV File with Complex Question Types?
I’m working with a large dataset containing survey responses stored in a CSV file with over 100 columns. I want to map this data to a JSON schema for better structure and downstream processing. ...
1
vote
2
answers
155
views
Merging Turbine and Forecast Data Diagonally
I am trying to combine my main dataset, which is hourly data collected from a turbine, with forecast data. The forecast data is collected by calling an API every hour to get a new .csv file which is ...
1
vote
1
answer
52
views
What are these 'code' columns in this csv and what does Pandas pivot method do with it?
In a csv discussed in the book HoML (hands on machine learning), what do the values in capital letters represent? Are these some sort of code? Also, it seems the pandas.pivot() methods does something ...
0
votes
1
answer
359
views
Writing python dictionary to a csv [closed]
I have a dictionary called dict_object
dict_object=class_object.__dict__
print(type(dict_object))
print(dict_object)
This code gave the following output:
...
4
votes
1
answer
222
views
Suitability of column store formats like Parquet for table data
I'm new to the data analytics world, but perhaps my question could help others, we all start somewhere. We have a process that extracts data from an SQL database and stores it in temp files as CSV ...
1
vote
2
answers
845
views
What is source_column argument in csv loader?
In this tutorial, what is the purpose of source_column argument? Does it act like a primary key in Databases? Thanks in advance.
...
3
votes
2
answers
324
views
Neural network not learning at all
I am training a MLP on a tabular dataset, the pendigits dataset. Problem is that training loss and accuracy are more or less stable, while validation and test loss and accuracy are completely constant....
0
votes
1
answer
70
views
How to Manipulate a Dataset
I have a raw dataset of Images I got from Kaggle, It has been classified already, but I want to randomly split the information in a 80:20 ratio between train and test, problem is since all information ...
1
vote
0
answers
47
views
Python train convolutional on numerical values shape issue
I want to train a convolutional neural network autoencoder on a csv file which contains values pixel neighborhood position of an original image of 1024x1024. When I try to train it, I have the ...
2
votes
1
answer
3k
views
pands doesn't recognize missing values in CSV [closed]
In the CSV there's missing values in many columns but
the code shows there's no missing values.
how can i fix this?
6
votes
4
answers
1k
views
How can I observe my CSV files better? [closed]
I'm running a lot of experiments that give their output as CSV files. An experiment might be running for hours, with a new line being added to the CSV every 10 seconds.
Right now I'm opening these CSV ...
4
votes
4
answers
2k
views
Algorithm to determine whether the first row in CSV is likely to be a header row or a data row
I have a fairly simple problem. I am trying to determine whether the first row in CSV is likely to be a header row or a data row. Looking at single column, the problem can be simplified to: I have a ...
0
votes
1
answer
207
views
read_csv, non-printing ascii delimiters, and multi index
I have some text data which uses the ASCII data characters 0x1e "group separator" and 0x1d "record separator"...