Python Pandas: Get dataframe.value_counts() result as list

Question

I have a DataFrame and I want to get both group names and corresponding group counts as a list or numpy array. However when I convert the output to matrix I only get group counts I dont get the names. Like in the example below:

  df = pd.DataFrame({'a':[0.5, 0.4, 5 , 0.4, 0.5, 0.6 ]})
  b = df['a'].value_counts()
  print(b)

output:

[0.4    2
0.5    2
0.6    1
5.0    1
Name: a, dtype: int64]

what I tried is print[b.as_matrix()]. Output:

[array([2, 2, 1, 1])]

In this case I do not have the information of corresponding group names which also I need. Thank you.

Arya McCarthy · Accepted Answer · 2017-05-28 21:03:11Z

11

Convert it to a dict:

bd = dict(b)
print(bd)
# {0.40000000000000002: 2, 0.5: 2, 0.59999999999999998: 1, 5.0: 1}

Don't worry about the long decimals. They're just a result of floating point representation; you still get what you expect from the dict.

bd[0.4]
# 2

edited May 28, 2017 at 21:03

answered May 28, 2017 at 20:16

Arya McCarthy

8,8544 gold badges39 silver badges59 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Krissh · Accepted Answer · 2018-12-10 10:10:17Z

4

most simplest way

list(df['a'].value_counts())

answered Dec 10, 2018 at 10:10

Krissh

3573 silver badges16 bronze badges

Comments

Divakar · Accepted Answer · 2017-05-28 20:51:55Z

2

One approach with np.unique -

np.c_[np.unique(df.a, return_counts=1)]

Sample run -

In [270]: df
Out[270]: 
     a
0  0.5
1  0.4
2  5.0
3  0.4
4  0.5
5  0.6

In [271]: np.c_[np.unique(df.a, return_counts=1)]
Out[271]: 
array([[ 0.4,  2. ],
       [ 0.5,  2. ],
       [ 0.6,  1. ],
       [ 5. ,  1. ]])

We can zip the outputs from np.unique for list output -

In [283]: zip(*np.unique(df.a, return_counts=1))
Out[283]: [(0.40000000000000002, 2), (0.5, 2), (0.59999999999999998, 1), (5.0, 1)]

Or use zip directly on the value_counts() output -

In [338]: b = df['a'].value_counts()

In [339]: zip(b.index, b.values)
Out[339]: [(0.40000000000000002, 2), (0.5, 2), (0.59999999999999998, 1), (5.0, 1)]

edited May 28, 2017 at 20:51

answered May 28, 2017 at 20:15

Divakar

222k19 gold badges273 silver badges374 bronze badges

Collectives™ on Stack Overflow

Python Pandas: Get dataframe.value_counts() result as list

3 Answers 3

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related