Let's say we have a dataset in csv. Let the data be representatively as follows. Let's assume that there are 1000 columns and 1000 rows in the csv that contains this data.
Let's say we use column A and B while performing regression and predict operations in the Keras library. I want to delete the duplicate data in A and leave only one. For example, if there are 5 of 1, only 4 will remain. At the same time, I want all 4 data deleted from duplicate data to be deleted from column B or any column X.
If we think of it as 2 different scenarios,
Duplicate data in column A is likewise deleted from column B or any other column.
The other scenario is to delete more than one, i.e. repetitive data in each column independently of each other.
The regression process needs to be performed using the keras module with the last remaining data.
Can you help with this?