removing characters from list elements in python

Question

I am extracting content with the help of scrapy into an array. Each element has the unwanted characters ": " inside which I would like to remove as efficient as possible.

v = response.xpath('//div[@id="tab"]/text()').extract()
>>> v
['Marke:', 'Modell:']
>>> for i in v : re.sub(r'[^\w]', '', i)
... 
'Marke'
'Modell'

Now that seems to work, but how can I retain the result? In my code, v hasn't changed:

>>> v
['Marke:', 'Modell:']

Maximilian Burszley · Accepted Answer · 2018-11-26 17:41:39Z

3

You can solve this with a list comprehension:

>>> v = response.xpath('//div[@id="tab"]/text()').extract()
>>>
>>> import re
>>> v = [re.sub(r'[^\w]', '', i) for i in v]
>>> v
['Marke', 'Modell']

answered Nov 26, 2018 at 17:41

Maximilian Burszley

19.8k7 gold badges38 silver badges66 bronze badges

Add a comment |

Prune · Accepted Answer · 2018-11-26 18:13:01Z

1

I think that pulling in regex for this is a little overkill: use the string replace method:

v = ['Marke:', 'Modell:']
v = [str.replace(':', '') for str in v]
print(v)

Output:

['Marke', 'Modell']

answered Nov 26, 2018 at 18:13

Prune

78k14 gold badges63 silver badges83 bronze badges

Add a comment |

Collectives™ on Stack Overflow

removing characters from list elements in python

2 Answers 2

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Related