Regular Expressions, using search in python

Question

I want to decode a some hex in python.

In part of the string \xcd\xed\xb0\xb2

    text = re.search(r'(\\x\w{2}){4}', rtf)

    unicodeText = text.decode('gb2312')

Error: '_sre.SRE_Match' object has no attribute 'decode'

Hope someone can help, Thanks

Why do you need to use regex? Can't you decode the whole string? — Germano
– Germano, Commented Sep 15, 2014 at 13:12

falsetru · Accepted Answer · 2014-09-15 13:22:36Z

1

re.search returns a Match object, not a matched string.

Use group method to get the matched string.

>>> rtf = r'\xcd\xed\xb0\xb2'
>>> matched = re.search(r'(\\x\w{2}){4}', rtf)
>>> text = matched.group()
>>> text.decode('string-escape').decode('gb2312')
u'\u665a\u5b89'

# In Python 3.x
# >>> text.encode().decode('unicode-escape').encode('latin1').decode('gb2312')
# '晚安'

BTW, you don't need to use regular expression, what you want is convert \xOO:

Python 2.x:

>>> rtf = r'\xcd\xed\xb0\xb2'
>>> rtf.decode('string-escape').decode('gb2312')
u'\u665a\u5b89'
>>> print rtf.decode('string-escape').decode('gb2312')
晚安

Python 3.x:

>>> rtf = r'\xcd\xed\xb0\xb2'
>>> rtf.encode().decode('unicode-escape').encode('latin1').decode('gb2312')
'晚安'

edited Sep 15, 2014 at 13:22

answered Sep 15, 2014 at 13:09

falsetru

371k69 gold badges769 silver badges659 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

James Garner Over a year ago

That returns "'str' object has no attribute 'decode'"

falsetru Over a year ago

@JamesGarner, I just updated the answer. If you use Python 3.x, try the code in the comment.

James Garner Over a year ago

Perfect, thanks! I will mark correct as soon as I can

falsetru Over a year ago

@JamesGarner, I updated the answer again. In short, you don't need to use regular expression.

James Garner Over a year ago

The reason why I use the regex is because I have mulitple languages, should it not matter?

|

Collectives™ on Stack Overflow

Regular Expressions, using search in python

1 Answer 1

8 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

8 Comments

Related