Numpy - assign column data types (dtype) to existing array

Question

I have a given array:

array = [(u'Andrew', -3, 3, 100.032) (u'Bob', -4, 4, 103.323) (u'Joe', -5, 5, 154.324)]

that is generated from another process (that I cannot control) of taking a CSV table and it outputs this numpy array. I now need to assign the dtypes of the columns to do further analysis.

How can I do this?

Thank you

what is this question -1?

code base 5000
– code base 5000

2014-08-27 09:48:43 +00:00
Commented Aug 27, 2014 at 9:48 — code base 5000
– code base 5000, Commented Aug 27, 2014 at 9:48

jrjc · Accepted Answer · 2014-07-21 13:15:44Z

Is this what you need ?

new_array = np.array(array, dtype = [("name", object), 
                                     ("N1", int), 
                                     ("N2", int),
                                     ("N3", float)])

where name and N1-3 are column names I gave.

It gives :

array([(u'Andrew', -3, 3, 100.032), (u'Bob', -4, 4, 103.323),
       (u'Joe', -5, 5, 154.324)], 
      dtype=[('name', 'O'), ('N1', '<i8'), ('N2', '<i8'), ('N3', '<f8')])

You can sort on "N1" for instance :

new_array.sort(order="N1")
new_array
array([(u'Joe', -5, 5, 154.324), (u'Bob', -4, 4, 103.323),
       (u'Andrew', -3, 3, 100.032)], 
      dtype=[('name', 'O'), ('N1', '<i8'), ('N2', '<i8'), ('N3', '<f8')])

Hope this helps.

Stefan · Accepted Answer · 2014-07-21 13:22:32Z

3

recarr = np.rec.fromrecords(array)

Optionally set field names:

recarr = np.rec.fromrecords(array, names="name, idata, idata2, fdata")

edited Jul 21, 2014 at 13:22

answered Jul 21, 2014 at 13:17

Stefan

4,6102 gold badges33 silver badges34 bronze badges

Collectives™ on Stack Overflow

Numpy - assign column data types (dtype) to existing array

2 Answers 2

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Linked

Related