How can I remove Nan from list Python/NumPy

Question

I have a list that countain values, one of the values I got is 'nan'

countries= [nan, 'USA', 'UK', 'France']

I tried to remove it, but I everytime get an error

cleanedList = [x for x in countries if (math.isnan(x) == True)]
TypeError: a float is required

When I tried this one :

cleanedList = cities[np.logical_not(np.isnan(countries))]
cleanedList = cities[~np.isnan(countries)]

TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

That looks like the string "nan", not an actual NaN value. — BrenBarn, Commented Jan 9, 2014 at 4:49
if condition == True is unnecessary, you can always just do if condition. — reem, Commented Jan 9, 2014 at 5:35
No solution provided so far are not satisfying. I have the same problem. Basically, it does not work for strings. Therefore in your case np.isnan('USA') will send the same error message. If I find some solution I will upload it. — Yohan Obadia, Commented Jan 26, 2017 at 12:52

score 218 · Accepted Answer · 2022-04-13 08:02:44Z

218

The question has changed, so too has the answer:

Strings can't be tested using math.isnan as this expects a float argument. In your countries list, you have floats and strings.

In your case the following should suffice:

cleanedList = [x for x in countries if str(x) != 'nan']

Old answer

In your countries list, the literal 'nan' is a string not the Python float nan which is equivalent to:

float('NaN')

In your case the following should suffice:

cleanedList = [x for x in countries if x != 'nan']

edited Apr 13, 2022 at 8:02

answered Jan 9, 2014 at 4:51

user764357

1

Logically, what you say is true. But it didn't work out with me.
– user3001937
Commented Jan 9, 2014 at 5:02
Then the problem is in another area, the array you gave is strings which math.isnan will naturall through errors with.
– user764357
Commented Jan 9, 2014 at 5:06
Yes ! when I print the output, I got this : [nan, 'USA', 'UK', 'France']
– user3001937
Commented Jan 9, 2014 at 5:07
1

@user3001937 I've updated the answer based on the new information
– user764357
Commented Jan 9, 2014 at 5:15
2

zhangxaochen: it is not a string, it is a float. Look carefully at the updated answer; Lego Stormtroopr's converting x to a string so you can compare it. nan always returns false for ==, even when compared to nan, so that's the easiest way to compare it.
– Free Monica Cellio
Commented Jan 9, 2014 at 6:30

| Show 4 more comments

vlmercado · Accepted Answer · 2022-04-13 08:03:53Z

65

Using your example where...

countries= [nan, 'USA', 'UK', 'France']

Since nan is not equal to nan (nan != nan) and countries[0] = nan, you should observe the following:

countries[0] == countries[0]
False

However,

countries[1] == countries[1]
True
countries[2] == countries[2]
True
countries[3] == countries[3]
True

Therefore, the following should work:

cleanedList = [x for x in countries if x == x]

edited Apr 13, 2022 at 8:03

user7864386

answered May 11, 2018 at 17:16

vlmercado

1,8982 gold badges18 silver badges20 bronze badges

5

This is the only answer that works when you have a float('nan') in a list of strings
– user2317421
Commented Jun 15, 2019 at 21:56

Add a comment |

Yohan Obadia · Accepted Answer · 2022-04-13 08:08:39Z

24

The problem comes from the fact that np.isnan() does not handle string values correctly. For example, if you do:

np.isnan("A")
TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

However the pandas version pd.isnull() works for numeric and string values:

import pandas as pd
pd.isnull("A")
> False

pd.isnull(3)
> False

pd.isnull(np.nan)
> True

pd.isnull(None)
> True

edited Apr 13, 2022 at 8:08

user7864386

answered Jan 26, 2017 at 13:03

Yohan Obadia

2,6822 gold badges26 silver badges32 bronze badges

Add a comment |

S.A. · Accepted Answer · 2019-06-06 10:17:39Z

17

import numpy as np

mylist = [3, 4, 5, np.nan]
l = [x for x in mylist if ~np.isnan(x)]

This should remove all NaN. Of course, I assume that it is not a string here but actual NaN (np.nan).

edited Jun 6, 2019 at 10:17

S.A.

2,1711 gold badge25 silver badges40 bronze badges

answered Mar 22, 2018 at 4:34

Ajay Shah

4145 silver badges10 bronze badges

4

This gives me error: TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''
– Zak Keirn
Commented Jan 9, 2019 at 23:46
1

Why not simply: x[~ np.isnan(x)] ? No list comprehension needed in numpy. Of course, I assume x is a numpy array.
– bue
Commented May 29, 2019 at 3:18
I assumed x is not going to be a numpy array as the question suggested.
– Ajay Shah
Commented May 29, 2019 at 5:41
1

It will expect float. Won't work on lists with strings @ZakKeirn
– quick_silver009
Commented Aug 4, 2020 at 11:08

Add a comment |

Aaron England · Accepted Answer · 2022-04-13 08:07:41Z

13

I like to remove missing values from a list like this:

import pandas as pd
list_no_nan = [x for x in list_with_nan if pd.notnull(x)]

edited Apr 13, 2022 at 8:07

user7864386

answered Nov 4, 2019 at 15:38

Aaron England

1,2731 gold badge14 silver badges34 bronze badges

Add a comment |

zhangxaochen · Accepted Answer · 2014-01-09 06:03:26Z

7

use numpy fancy indexing:

In [29]: countries=np.asarray(countries)

In [30]: countries[countries!='nan']
Out[30]: 
array(['USA', 'UK', 'France'], 
      dtype='|S6')

answered Jan 9, 2014 at 6:03

zhangxaochen

34.1k15 gold badges82 silver badges115 bronze badges

Add a comment |

Beyran11 · Accepted Answer · 2018-08-26 12:47:28Z

6

if you check for the element type

type(countries[1])

the result will be <class float> so you can use the following code:

[i for i in countries if type(i) is not float]

answered Aug 26, 2018 at 12:47

Beyran11

611 silver badge1 bronze badge

Add a comment |

Zisis F · Accepted Answer · 2021-04-05 09:32:15Z

5

A way to directly remove the nan value is:

import numpy as np    
countries.remove(np.nan)

answered Apr 5, 2021 at 9:32

Zisis F

3625 silver badges13 bronze badges

1

Keep in mind, that if the list contains more than one matching the specified value, only the first one is deleted by remove().
– bpelhos
Commented Oct 20, 2021 at 13:04

Add a comment |

Sorin Dragan · Accepted Answer · 2020-02-21 18:48:54Z

4

Another way to do it would include using filter like this:

countries = list(filter(lambda x: str(x) != 'nan', countries))

answered Feb 21, 2020 at 18:48

Sorin Dragan

5404 silver badges9 bronze badges

Add a comment |

user7864386user7864386 · Accepted Answer · 2022-01-25 22:39:22Z

3

If you have a list of items of different types and you want to filter out NaN, you can do the following:

import math
lst = [1.1, 2, 'string', float('nan'), {'di':'ct'}, {'set'}, (3, 4), ['li', 5]]
filtered_lst = [x for x in lst if not (isinstance(x, float) and math.isnan(x))]

Output:

[1.1, 2, 'string', {'di': 'ct'}, {'set'}, (3, 4), ['li', 5]]

answered Jan 25, 2022 at 22:39

user7864386

Add a comment |

Serial · Accepted Answer · 2014-01-09 04:52:57Z

2

In your example 'nan' is a string so instead of using isnan() just check for the string

like this:

cleanedList = [x for x in countries if x != 'nan']

answered Jan 9, 2014 at 4:52

Serial

8,04314 gold badges55 silver badges74 bronze badges

Add a comment |

Angelo · Accepted Answer · 2021-07-20 14:14:35Z

In my opinion most of the solutions suggested do not take into account performance. Loop for and list comprehension are not valid solutions if your list has many values. The solution below is more efficient in terms of computational time and it doesn't assume your list has numbers or strings.

import numpy as np
import pandas as pd
list_var = [np.nan, 4, np.nan, 20,3, 'test']
df = pd.DataFrame({'list_values':list_var})
list_var2 = list(df['list_values'].dropna())
print("\n* list_var2 = {}".format(list_var2))

ListenSoftware Louise Ai Agent · Accepted Answer · 2021-06-01 16:42:47Z

0

exclude 0 from the range list

['ret'+str(x) for x in list(range(-120,241,5)) if (x!=0) ]

answered Jun 1, 2021 at 16:42

ListenSoftware Louise Ai Agent

4,2732 gold badges30 silver badges38 bronze badges

Add a comment |

GenDemo · Accepted Answer · 2024-01-15 04:57:03Z

0

I had a similar problem to solve, and strangely none of the suggested above worked (python 3.7.9):

but this one did:

df['colA'] = df['colA'].apply(lambda x: [item for item in x if not pd.isna(item)])

answered Jan 15, 2024 at 4:57

GenDemo

7611 gold badge10 silver badges27 bronze badges

Add a comment |

sparrow · Accepted Answer · 2016-07-25 22:31:21Z

-1

I noticed that Pandas for example will return 'nan' for blank values. Since it's not a string you need to convert it to one in order to match it. For example:

ulist = df.column1.unique() #create a list from a column with Pandas which 
for loc in ulist:
    loc = str(loc)   #here 'nan' is converted to a string to compare with if
    if loc != 'nan':
        print(loc)

answered Jul 25, 2016 at 22:31

sparrow

11.5k12 gold badges60 silver badges76 bronze badges

Add a comment |

Sayed · Accepted Answer · 2022-09-16 12:58:46Z

-3

import numpy as np
countries=[x for x in countries if x is not np.nan]

answered Sep 16, 2022 at 12:58

Sayed

52 bronze badges

4

Welcome to Stack Overflow. Code is a lot more helpful when it is accompanied by an explanation. SO is about learning, not providing snippets to blindly copy and paste. This is particularly important when answering old questions with existing answers (this question is nearly 9 years old, and has 15 answers). Please edit your answer and explain how it answers the specific question being asked, and how it improves upon what is already here. See How to Answer.
– Chris
Commented Sep 18, 2022 at 18:04
Sorry, I picked the wrong reason in review queue audit, the suggested edit should be rejected as "clearly conflicts with author intent" instead. It's author's responsibility to make answer helpful, other people should write their own answers.
– STerliakov
Commented Jan 21, 2023 at 20:05

Add a comment |

Collectives™ on Stack Overflow

How can I remove Nan from list Python/NumPy

16 Answers 16

Old answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

16 Answers 16

Old answer

Linked

Related