extract excel columns into python array

Question

I want to extract excel columns (NOT rows) into python arrays of array. It has to be arrays, not dictionaries.

The excel file looks like this:

     A    B    C
1   123  534  576
2   456  745  345
3   234  765  285

I want to bring this into python in the following format:

[[123,534,576],[456,745,345],[234,765,285]]

How would I do this? Thank you

@wnnmaw I did and I also looked at numpy, but I don't know how to do columns. I am able to do only rows — user1681664
– user1681664, Commented Mar 20, 2014 at 18:52
@user1681664, what have you tried with xlrd to pull the columns? — wnnmaw
– wnnmaw, Commented Mar 20, 2014 at 18:54
Surely columns would be: [[123, 456, 234], [534, 745, 765], [576, 345, 285]] - either way, using xlrd it's row_values or col_values - the documentation is fairly simple to follow... — Jon Clements
– Jon Clements, Commented Mar 20, 2014 at 18:55

Guillaume Jacquenot · Accepted Answer · 2018-12-19 15:22:42Z

13

Here's a yet simpler approach:

import xlrd
book = xlrd.open_workbook('your.xlsx')
sheet = book.sheet_by_name('example')
data = [[sheet.cell_value(r, c) for c in range(sheet.ncols)] for r in range(sheet.nrows)]
# Profit !
print(data)

edited Dec 19, 2018 at 15:22

Guillaume Jacquenot

11.8k6 gold badges45 silver badges50 bronze badges

answered Apr 23, 2016 at 18:07

Imad Salimi

1411 silver badge7 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Zac1 Over a year ago

It'd be great if you can also add how to retrieve a particular cell value from data. Thanks.

Nick is tired · Accepted Answer · 2021-01-27 11:07:26Z

2

If you're following the above comments and look into the xlrd package, can you try this and see if it works?

(based on what I found here: http://www.youlikeprogramming.com/2012/03/examples-reading-excel-xls-documents-using-pythons-xlrd/)

import xlrd
workbook = xlrd.open_workbook('my_workbook.xls')
worksheet = workbook.sheet_by_name('Sheet1')
num_rows = worksheet.nrows - 1
curr_row = 0

#creates an array to store all the rows
row_array = []

while curr_row < num_rows:
    row = worksheet.row(curr_row)
    row_array += row
    curr_row += 1

print(row_array)

edited Jan 27, 2021 at 11:07

Nick is tired

7,17721 gold badges44 silver badges55 bronze badges

answered Mar 20, 2014 at 19:09

David B.

3815 silver badges17 bronze badges

2 Comments

Marichyasana Over a year ago

Does it work for Excel 2010 also? File type is "xlsx" or "xlsm"?

David B. Over a year ago

Good question. Not sure to be honest.

wwii · Accepted Answer · 2014-03-20 19:21:00Z

1

Use xlrd to load the data row-wise, then use zip to transpose it.

>>> 
>>> a = [[1,2,3],[4,5,6],[7,8,9]]
>>> zip(*a)
[(1, 4, 7), (2, 5, 8), (3, 6, 9)]
>>>

Use xlrd to load the data row-wise, use it to create a numpy array, then transpose it.

>>> import numpy
>>> a = [[1,2,3],[4,5,6],[7,8,9]]
>>> z = numpy.array(a)
>>> z
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])
>>> z.transpose()
array([[1, 4, 7],
       [2, 5, 8],
       [3, 6, 9]])
>>>

edited Mar 20, 2014 at 19:21

answered Mar 20, 2014 at 19:04

wwii

23.9k7 gold badges42 silver badges81 bronze badges

Comments

Guillaume Jacquenot · Accepted Answer · 2018-12-19 15:23:01Z

0

I figured it out.

import csv
cr = csv.reader(open("temp.csv","rb"))
arr = range(100)  # adjust to needed
x = 0
for row in cr:    
    arr[x] = row
    x += 1

print(arr[:22])  # adjust to needed

edited Dec 19, 2018 at 15:23

Guillaume Jacquenot

11.8k6 gold badges45 silver badges50 bronze badges

answered Mar 21, 2014 at 3:48

user1681664

1,8318 gold badges29 silver badges53 bronze badges

Comments

Guillaume Jacquenot · Accepted Answer · 2018-12-19 15:23:31Z

0

import csv
array = []
with open(* insert file directory here*) as fin:
     reader = csv.reader(fin)
     rows = [row for row in reader]
     for row in rows:
        j = 0
        arr = []
        for i = 0 < 3:
          arr[i] = row[i]
        array[j] = arr
        j = j + 1

edited Dec 19, 2018 at 15:23

Guillaume Jacquenot

11.8k6 gold badges45 silver badges50 bronze badges

answered Mar 20, 2014 at 19:11

user3245033

293 bronze badges

Comments

Tushar Kale · Accepted Answer · 2020-12-04 12:04:36Z

0

import csv

csv_rows = csv.reader(open("temp.csv","r"))
result_array = []
for row_index, row in enumerate(csv_rows):   
    if row_index != 0: #to neglect column names row
        result_array.append(row)
print(result_array)

answered Dec 4, 2020 at 12:04

Tushar Kale

1691 gold badge1 silver badge15 bronze badges

1 Comment

Elletlar Over a year ago

Hi Tushar. Could you please add an explanation to all your answers even if it is only a brief one. Also, there are already a lot of answers for this question. Why do we need another one? There is no explanation of why this solution should be considered over the others. How to Answer. Kind Regards.

Collectives™ on Stack Overflow

extract excel columns into python array

6 Answers 6

Here's a yet simpler approach:

1 Comment

2 Comments

Comments

Comments

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

Here's a yet simpler approach:

1 Comment

2 Comments

Comments

Comments

Comments

1 Comment

Linked

Related