Convert ASCII string encoded array into string?

Question

Input:

'0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'

Output:

Hello!

Current Solution:

    s = []
    birary_data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'.replace(' ', '').split('0x')
    for c in birary_data:
        if len(c) > 1:
            s.append(bytes.fromhex(c).decode('utf-8', 'ignore'))
    print("".join(s))

Need help with:

Could anyone suggest a more elegant solution, please?

@MauriceMeyer, it feels like I'm doing extra and unnecessary steps. However, there could a very simple solution, like birary_data.decode('hex') — Gооd_Mаn
– Gооd_Mаn, Commented Mar 5, 2020 at 11:41

Shubham Sharma · Accepted Answer · 2020-03-05 11:50:19Z

4

Try this:

data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'
string = "".join([chr(int(item, 16)) for item in data.split()])
print(string)

Output:

Hello!

answered Mar 5, 2020 at 11:50

Shubham Sharma

71.8k6 gold badges26 silver badges58 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Nullman Over a year ago

I always forget that int can take a base, this is a good solution

Jon Betts Over a year ago

Doesn't this have a leading null byte and something else before the last char?

Guy · Accepted Answer · 2020-03-05 12:08:51Z

4

Another option is to remove 3 characters (or less) length substrings, 0x and white spaces. bytes.fromhex can handle a string like '48656c6c6f8E21'

binary_data = '0X0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'
binary_data = re.sub(r'\b\w{3}\b|\s?0x', '', binary_data)
print(bytes.fromhex(binary_data).decode('utf-8', 'ignore'))

edited Mar 5, 2020 at 12:08

answered Mar 5, 2020 at 12:01

Guy

51.2k10 gold badges49 silver badges96 bronze badges

1 Comment

Nullman Over a year ago

why not change your pattern to r'\b\w{3}\b|\s?0x' to get '48656c6c6f8E21' directly?

Akhil Pathania · Accepted Answer · 2020-03-05 11:55:09Z

3

You can use the below one Here in the code i am first splitting the hex according to white-space and then iterating and joining the character i get.

a = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'
print(''.join(chr(int(i, 16)) for i in a.split()))

answered Mar 5, 2020 at 11:55

Akhil Pathania

7522 gold badges6 silver badges20 bronze badges

Comments

Jon Betts · Accepted Answer · 2020-03-05 12:06:21Z

The builtin bytes.fromhex() is very nearly all we need. There are however two problems we need to get around:

The null byte at the front
The invalid char in position 6 (0x8E)

import re

data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'

string = bytes.fromhex(re.sub('0x(0 )?', '', data)).decode('utf-8', 'ignore')

The regex will take care of both stripping the null byte and formatting the string correctly for bytes.fromhex(). The ignore in the decode will skip the bad byte.

Joan Lara · Accepted Answer · 2020-03-05 12:02:40Z

2

birary_data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'.replace('0x', '').split()
print(bytearray.fromhex(''.join(c for c in birary_data if len(c) > 1)).decode('utf-8', 'ignore'))

Output:

Hello!

edited Mar 5, 2020 at 12:02

answered Mar 5, 2020 at 11:36

Joan Lara

1,3978 silver badges15 bronze badges

Collectives™ on Stack Overflow

Convert ASCII string encoded array into string?

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Related