How to count the frequency of the elements in an unordered list? [duplicate]

Question

Given an unordered list of values like

a = [5, 1, 2, 2, 4, 3, 1, 2, 3, 1, 1, 5, 2]

How can I get the frequency of each value that appears in the list, like so?

# `a` has 4 instances of `1`, 4 of `2`, 2 of `3`, 1 of `4,` 2 of `5`
b = [4, 4, 2, 1, 2] # expected output

Does this answer your question? How do I count the occurrences of a list item? — Alireza75
– Alireza75, Commented Jul 28, 2022 at 4:08
@Alireza How does it answer this question? This linked question is about counting a single, specific item from a list. This question asks to get the count of all elements in a list — Tomerikoo
– Tomerikoo, Commented Jul 28, 2022 at 7:33
@Tomerikoo see the 'user52028778' answer and just use Counter.values() — Alireza75
– Alireza75, Commented Jul 28, 2022 at 7:43

Karl Knechtel · Accepted Answer · 2022-07-28 03:51:19Z

655

In Python 2.7 (or newer), you can use collections.Counter:

>>> import collections
>>> a = [5, 1, 2, 2, 4, 3, 1, 2, 3, 1, 1, 5, 2]
>>> counter = collections.Counter(a)
>>> counter
Counter({1: 4, 2: 4, 5: 2, 3: 2, 4: 1})
>>> counter.values()
dict_values([2, 4, 4, 1, 2])
>>> counter.keys()
dict_keys([5, 1, 2, 4, 3])
>>> counter.most_common(3)
[(1, 4), (2, 4), (5, 2)]
>>> dict(counter)
{5: 2, 1: 4, 2: 4, 4: 1, 3: 2}
>>> # Get the counts in order matching the original specification,
>>> # by iterating over keys in sorted order
>>> [counter[x] for x in sorted(counter.keys())]
[4, 4, 2, 1, 2]

If you are using Python 2.6 or older, you can download an implementation here.

edited Jul 28, 2022 at 3:51

Karl Knechtel

61.7k14 gold badges136 silver badges195 bronze badges

answered Jan 29, 2010 at 13:02

unutbu

888k199 gold badges1.9k silver badges1.7k bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Srivatsan Over a year ago

@unutbu: What if I have three lists, a,b,c for which a and b remain the same, but c changes? How to count the the value of c for which a and c are same?

unutbu Over a year ago

@Srivatsan: I don't understand the situation. Please post a new question where you can elaborate.

Pavan Over a year ago

Is there a way to extract the dictionary {1:4, 2:4, 3:2, 5:2, 4:1} from the counter object ?

unutbu Over a year ago

@Pavan: collections.Counter is a subclass of dict. You can use it in the same way you would a normal dict. If you really want a dict, however, you could convert it to a dict using dict(counter).

user Over a year ago

Is there a way to count values if the list is a set of co-ordinates? Say a = [(0,0),(0,1),(0,2),(1,0),(0,1)...] I need to get the frequency in place, preferably in another list

Karl Knechtel · Accepted Answer · 2022-07-28 03:45:02Z

174

If the list is sorted, you can use groupby from the itertools standard library (if it isn't, you can just sort it first, although this takes O(n lg n) time):

from itertools import groupby

a = [5, 1, 2, 2, 4, 3, 1, 2, 3, 1, 1, 5, 2]
[len(list(group)) for key, group in groupby(sorted(a))]

Output:

[4, 4, 2, 1, 2]

edited Jul 28, 2022 at 3:45

Karl Knechtel

61.7k14 gold badges136 silver badges195 bronze badges

answered Jan 29, 2010 at 12:18

Nadia Alramli

116k39 gold badges176 silver badges152 bronze badges

11 Comments

Eli Bendersky Over a year ago

nice, using groupby. I wonder about its efficiency vs. the dict approach, though

Evan Over a year ago

The python groupby creates new groups when the value it sees changes. In this case 1,1,1,2,1,1,1] would return [3,1,3]. If you expected [6,1] then just be sure to sort the data before using groupby.

Martijn Pieters Over a year ago

@CristianCiupitu: sum(1 for _ in group).

buhtz Over a year ago

This is not a solution. The output doesn't tell what was counted.

Eric Pauley Over a year ago

[(key, len(list(group))) for key, group in groupby(a)] or {key: len(list(group)) for key, group in groupby(a)} @buhtz

|

Amjith · Accepted Answer · 2012-03-16 20:44:01Z

119

Python 2.7+ introduces Dictionary Comprehension. Building the dictionary from the list will get you the count as well as get rid of duplicates.

>>> a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
>>> d = {x:a.count(x) for x in a}
>>> d
{1: 4, 2: 4, 3: 2, 4: 1, 5: 2}
>>> a, b = d.keys(), d.values()
>>> a
[1, 2, 3, 4, 5]
>>> b
[4, 4, 2, 1, 2]

answered Mar 16, 2012 at 20:44

Amjith

24.3k14 gold badges47 silver badges39 bronze badges

9 Comments

stenci Over a year ago

It's faster using a set: {x:a.count(x) for x in set(a)}

Martijn Pieters Over a year ago

This is hugely inefficient. a.count() does a full traverse for each element in a, making this a O(N^2) quadradic approach. collections.Counter() is much more efficient because it counts in linear time (O(N)). In numbers, that means this approach will execute 1 million steps for a list of length 1000, vs. just 1000 steps with Counter(), 10^12 steps where only 10^6 are needed by Counter for a million items in a list, etc.

Martijn Pieters Over a year ago

@stenci: sure, but the horror of using a.count() completely dwarfs the efficiency of having used a set there.

stenci Over a year ago

@MartijnPieters one more reason to use it fewer times :)

Nzbuu Over a year ago

@DylanYoung, that's what collections.Counter does but better.

|

Karl Knechtel · Accepted Answer · 2022-07-28 04:00:58Z

53

Count the number of appearances manually by iterating through the list and counting them up, using a collections.defaultdict to track what has been seen so far:

from collections import defaultdict

appearances = defaultdict(int)

for curr in a:
    appearances[curr] += 1

edited Jul 28, 2022 at 4:00

Karl Knechtel

61.7k14 gold badges136 silver badges195 bronze badges

answered Jan 29, 2010 at 12:16

Idan K

21k11 gold badges66 silver badges83 bronze badges

3 Comments

hughdbrown Over a year ago

+1 for collections.defaultdict. Also, in python 3.x, look up collections.Counter. It is the same as collections.defaultdict(int).

Cristian Ciupitu Over a year ago

@hughdbrown, actually Counter can use multiple numeric types including float or Decimal, not just int.

Karl Knechtel Over a year ago

collections.Counter does much more, and is a much more specialized tool, than collections.defaultdict with a numeric value type. It has extra convenience functions, and conceptually models the idea that the values represent counts rather than just being arbitrary numbers.

YOU · Accepted Answer · 2010-01-29 13:00:53Z

38

In Python 2.7+, you could use collections.Counter to count items

>>> a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
>>>
>>> from collections import Counter
>>> c=Counter(a)
>>>
>>> c.values()
[4, 4, 2, 1, 2]
>>>
>>> c.keys()
[1, 2, 3, 4, 5]

answered Jan 29, 2010 at 13:00

YOU

125k34 gold badges192 silver badges222 bronze badges

2 Comments

Jonathan Ray Over a year ago

Counter is much slower than the default dict, and the default dict is much slower than manual use of a dict.

wsaleem Over a year ago

@JonathanRay, not anymore, stackoverflow.com/a/27802189/1382487.

Idan K · Accepted Answer · 2010-01-29 12:21:30Z

32

Counting the frequency of elements is probably best done with a dictionary:

b = {}
for item in a:
    b[item] = b.get(item, 0) + 1

To remove the duplicates, use a set:

a = list(set(a))

edited Jan 29, 2010 at 12:21

Idan K

21k11 gold badges66 silver badges83 bronze badges

answered Jan 29, 2010 at 12:16

lindelof

35.5k32 gold badges103 silver badges144 bronze badges

5 Comments

S.Lott Over a year ago

@phkahler: Mine would only a tiny bit better than this. It's hardly worth my posting a separate answer when this can be improved with a small change. The point of SO is to get to the best answers. I could simply edit this, but I prefer to allow the original author a chance to make their own improvements.

user1532172 Over a year ago

@S.Lott The code is much cleaner without having to import defaultdict.

DylanYoung Over a year ago

Why not preinitialize b: b = {k:0 for k in a}?

Nzbuu Over a year ago

@DylanYoung, because then you have to scan the list twice. And there's unlikely to be any benefit in Python: but check this for yourself.

DylanYoung Over a year ago

The benefit is clean code :) Could use a defaultdict too of course, then you don't have to iterate through a

Evgenii Pavlov · Accepted Answer · 2017-07-23 18:17:54Z

25

You can do this:

import numpy as np
a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
np.unique(a, return_counts=True)

Output:

(array([1, 2, 3, 4, 5]), array([4, 4, 2, 1, 2], dtype=int64))

The first array is values, and the second array is the number of elements with these values.

So If you want to get just array with the numbers you should use this:

np.unique(a, return_counts=True)[1]

edited Jul 23, 2017 at 18:17

answered Jul 23, 2017 at 18:11

Evgenii Pavlov

2513 silver badges4 bronze badges

Comments

rbento · Accepted Answer · 2022-07-28 04:19:01Z

22

Here's another succint alternative using itertools.groupby which also works for unordered input:

from itertools import groupby

items = [5, 1, 1, 2, 2, 1, 1, 2, 2, 3, 4, 3, 5]

results = {value: len(list(freq)) for value, freq in groupby(sorted(items))}

results

format: {value: num_of_occurencies}

{1: 4, 2: 4, 3: 2, 4: 1, 5: 2}

edited Jul 28, 2022 at 4:19

answered Mar 31, 2019 at 2:17

rbento

11.9k3 gold badges68 silver badges68 bronze badges

Comments

user2757762 · Accepted Answer · 2016-01-30 18:35:39Z

10

I would simply use scipy.stats.itemfreq in the following manner:

from scipy.stats import itemfreq

a = [1,1,1,1,2,2,2,2,3,3,4,5,5]

freq = itemfreq(a)

a = freq[:,0]
b = freq[:,1]

you may check the documentation here: http://docs.scipy.org/doc/scipy-0.16.0/reference/generated/scipy.stats.itemfreq.html

answered Jan 30, 2016 at 18:35

user2757762

Comments

sheldonzy · Accepted Answer · 2017-12-27 22:19:27Z

9

from collections import Counter
a=["E","D","C","G","B","A","B","F","D","D","C","A","G","A","C","B","F","C","B"]

counter=Counter(a)

kk=[list(counter.keys()),list(counter.values())]

pd.DataFrame(np.array(kk).T, columns=['Letter','Count'])

edited Dec 27, 2017 at 22:19

sheldonzy

6,04911 gold badges58 silver badges98 bronze badges

answered Dec 27, 2017 at 19:39

Anirban Lahiri

1071 silver badge3 bronze badges

2 Comments

Rahul Gupta Over a year ago

While this code snippet may be the solution, including an explanation really helps to improve the quality of your post. Remember that you are answering the question for readers in the future, and those people might not know the reasons for your code suggestion

Anirban Lahiri Over a year ago

Yes will do that Rahul Gupta

Karl Knechtel · Accepted Answer · 2022-07-28 03:59:20Z

8

Suppose we have a list:

fruits = ['banana', 'banana', 'apple', 'banana']

We can find out how many of each fruit we have in the list like so:

import numpy as np    
(unique, counts) = np.unique(fruits, return_counts=True)
{x:y for x,y in zip(unique, counts)}

Result:

{'banana': 3, 'apple': 1}

edited Jul 28, 2022 at 3:59

Karl Knechtel

61.7k14 gold badges136 silver badges195 bronze badges

answered Sep 21, 2020 at 17:05

jobima

5,9501 gold badge23 silver badges18 bronze badges

Comments

lprsd · Accepted Answer · 2010-01-29 12:24:18Z

6

seta = set(a)
b = [a.count(el) for el in seta]
a = list(seta) #Only if you really want it.

edited Jan 29, 2010 at 12:24

answered Jan 29, 2010 at 12:18

lprsd

88.1k48 gold badges142 silver badges169 bronze badges

3 Comments

Idan K Over a year ago

using lists count is ridiculously expensive and uncalled for in this scenario.

Kritika Rajain Over a year ago

@IdanK why count is expensive?

DylanYoung Over a year ago

@KritikaRajain For each unique element in the list you iterate over the whole list to generate a count (quadratic in the number of unique elements in the list). Instead, you can iterate over the list once and count up the number of each unique element (linear in the size of the list). If your list has only one unique element, the result will be the same. Moreover, this approach requires an additional intermediate set.

Corey Richey · Accepted Answer · 2015-06-30 18:39:12Z

5

This answer is more explicit

a = [1,1,1,1,2,2,2,2,3,3,3,4,4]

d = {}
for item in a:
    if item in d:
        d[item] = d.get(item)+1
    else:
        d[item] = 1

for k,v in d.items():
    print(str(k)+':'+str(v))

# output
#1:4
#2:4
#3:3
#4:2

#remove dups
d = set(a)
print(d)
#{1, 2, 3, 4}

answered Jun 30, 2015 at 18:39

Corey Richey

671 silver badge1 bronze badge

2 Comments

Abdul Salam Over a year ago

Good work, simple solution to implement occurrence count in dictionary.

MisterMiyagi Over a year ago

There is no need to use d.get(item) after checking if item in d: – both will check the exact same thing. Either use d[item] = d[item]+1 inside the if, or remove the if and use the single case of d[item] = d.get(item, 0) + 1.

t3rse · Accepted Answer · 2010-01-29 12:10:59Z

3

For your first question, iterate the list and use a dictionary to keep track of an elements existsence.

For your second question, just use the set operator.

answered Jan 29, 2010 at 12:10

t3rse

10.1k11 gold badges62 silver badges85 bronze badges

1 Comment

Bruce Over a year ago

Can you please elaborate on the first answer

user2422819 · Accepted Answer · 2016-12-06 16:50:55Z

3

def frequencyDistribution(data):
    return {i: data.count(i) for i in data}   

print frequencyDistribution([1,2,3,4])

...

 {1: 1, 2: 1, 3: 1, 4: 1}   # originalNumber: count

answered Dec 6, 2016 at 16:50

user2422819

18715 bronze badges

Comments

jax · Accepted Answer · 2019-01-14 15:18:47Z

3

I am quite late, but this will also work, and will help others:

a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
freq_list = []
a_l = list(set(a))

for x in a_l:
    freq_list.append(a.count(x))


print 'Freq',freq_list
print 'number',a_l

will produce this..

Freq  [4, 4, 2, 1, 2]
number[1, 2, 3, 4, 5]

edited Jan 14, 2019 at 15:18

answered Oct 26, 2018 at 15:17

jax

4,22710 gold badges45 silver badges74 bronze badges

Comments

d.b · Accepted Answer · 2022-07-28 14:42:26Z

3

a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
counts = dict.fromkeys(a, 0)
for el in a: counts[el] += 1
print(counts)
# {1: 4, 2: 4, 3: 2, 4: 1, 5: 2}

edited Jul 28, 2022 at 14:42

answered Oct 19, 2021 at 4:00

d.b

32.6k6 gold badges46 silver badges91 bronze badges

Comments

Sai Kiran · Accepted Answer · 2019-04-09 12:55:29Z

1

a = [1,1,1,1,2,2,2,2,3,3,4,5,5]

# 1. Get counts and store in another list
output = []
for i in set(a):
    output.append(a.count(i))
print(output)

# 2. Remove duplicates using set constructor
a = list(set(a))
print(a)

Set collection does not allow duplicates, passing a list to the set() constructor will give an iterable of totally unique objects. count() function returns an integer count when an object that is in a list is passed. With that the unique objects are counted and each count value is stored by appending to an empty list output
list() constructor is used to convert the set(a) into list and referred by the same variable a

Output

D:\MLrec\venv\Scripts\python.exe D:/MLrec/listgroup.py
[4, 4, 2, 1, 2]
[1, 2, 3, 4, 5]

edited Apr 9, 2019 at 12:55

answered Apr 9, 2019 at 12:00

Sai Kiran

617 bronze badges

Comments

oshaiken · Accepted Answer · 2019-04-18 23:32:45Z

1

Simple solution using a dictionary.

def frequency(l):
     d = {}
     for i in l:
        if i in d.keys():
           d[i] += 1
        else:
           d[i] = 1

     for k, v in d.iteritems():
        if v ==max (d.values()):
           return k,d.keys()

print(frequency([10,10,10,10,20,20,20,20,40,40,50,50,30]))

answered Apr 18, 2019 at 23:32

oshaiken

2,6701 gold badge19 silver badges27 bronze badges

2 Comments

DylanYoung Over a year ago

max(d.values()) will not change in the last loop. Don't compute it in the loop, compute it before the loop.

MisterMiyagi Over a year ago

This returns the most common item plus all unique items, not the count/frequency of items.

amrutha · Accepted Answer · 2015-07-10 09:31:01Z

0

#!usr/bin/python
def frq(words):
    freq = {}
    for w in words:
            if w in freq:
                    freq[w] = freq.get(w)+1
            else:
                    freq[w] =1
    return freq

fp = open("poem","r")
list = fp.read()
fp.close()
input = list.split()
print input
d = frq(input)
print "frequency of input\n: "
print d
fp1 = open("output.txt","w+")
for k,v in d.items():
fp1.write(str(k)+':'+str(v)+"\n")
fp1.close()

answered Jul 10, 2015 at 9:31

amrutha

91 bronze badge

Comments

Veera Balla Deva · Accepted Answer · 2018-01-26 09:26:04Z

0

from collections import OrderedDict
a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
def get_count(lists):
    dictionary = OrderedDict()
    for val in lists:
        dictionary.setdefault(val,[]).append(1)
    return [sum(val) for val in dictionary.values()]
print(get_count(a))
>>>[4, 4, 2, 1, 2]

To remove duplicates and Maintain order:

list(dict.fromkeys(get_count(a)))
>>>[4, 2, 1]

answered Jan 26, 2018 at 9:26

Veera Balla Deva

7886 silver badges20 bronze badges

Comments

roberto · Accepted Answer · 2018-01-27 09:37:28Z

0

i'm using Counter to generate a freq. dict from text file words in 1 line of code

def _fileIndex(fh):
''' create a dict using Counter of a
flat list of words (re.findall(re.compile(r"[a-zA-Z]+"), lines)) in (lines in file->for lines in fh)
'''
return Counter(
    [wrd.lower() for wrdList in
     [words for words in
      [re.findall(re.compile(r'[a-zA-Z]+'), lines) for lines in fh]]
     for wrd in wrdList])

answered Jan 27, 2018 at 9:37

roberto

5856 silver badges5 bronze badges

Comments

jferard · Accepted Answer · 2019-03-26 17:52:39Z

For the record, a functional answer:

>>> L = [1,1,1,1,2,2,2,2,3,3,4,5,5]
>>> import functools
>>> >>> functools.reduce(lambda acc, e: [v+(i==e) for i, v in enumerate(acc,1)] if e<=len(acc) else acc+[0 for _ in range(e-len(acc)-1)]+[1], L, [])
[4, 4, 2, 1, 2]

It's cleaner if you count zeroes too:

>>> functools.reduce(lambda acc, e: [v+(i==e) for i, v in enumerate(acc)] if e<len(acc) else acc+[0 for _ in range(e-len(acc))]+[1], L, [])
[0, 4, 4, 2, 1, 2]

An explanation:

we start with an empty acc list;
if the next element e of L is lower than the size of acc, we just update this element: v+(i==e) means v+1 if the index i of acc is the current element e, otherwise the previous value v;
if the next element e of L is greater or equals to the size of acc, we have to expand acc to host the new 1.

The elements do not have to be sorted (itertools.groupby). You'll get weird results if you have negative numbers.

Abhishek Poojary · Accepted Answer · 2019-09-29 16:38:05Z

0

Another approach of doing this, albeit by using a heavier but powerful library - NLTK.

import nltk

fdist = nltk.FreqDist(a)
fdist.values()
fdist.most_common()

answered Sep 29, 2019 at 16:38

Abhishek Poojary

81910 silver badges13 bronze badges

Comments

Abhishek Poojary · Accepted Answer · 2020-04-23 17:36:39Z

0

Found another way of doing this, using sets.

#ar is the list of elements
#convert ar to set to get unique elements
sock_set = set(ar)

#create dictionary of frequency of socks
sock_dict = {}

for sock in sock_set:
    sock_dict[sock] = ar.count(sock)

answered Apr 23, 2020 at 17:36

Abhishek Poojary

81910 silver badges13 bronze badges

Comments

Luigi Tiburzi · Accepted Answer · 2020-09-17 15:49:25Z

0

For an unordered list you should use:

[a.count(el) for el in set(a)]

The output is

[4, 4, 2, 1, 2]

answered Sep 17, 2020 at 15:49

Luigi Tiburzi

4,3457 gold badges35 silver badges43 bronze badges

1 Comment

MisterMiyagi Over a year ago

Note that sets do not preserve order. As a result, the positions in the list and thus the meaning of the contained counts are completely arbitrary wrt the actual items.

Reza Abtin · Accepted Answer · 2015-12-20 08:17:38Z

-1

Yet another solution with another algorithm without using collections:

def countFreq(A):
   n=len(A)
   count=[0]*n                     # Create a new list initialized with '0'
   for i in range(n):
      count[A[i]]+= 1              # increase occurrence for value A[i]
   return [x for x in count if x]  # return non-zero count

answered Dec 20, 2015 at 8:17

Reza Abtin

2153 silver badges8 bronze badges

Comments

chandan anand · Accepted Answer · 2017-09-03 18:43:08Z

-1

num=[3,2,3,5,5,3,7,6,4,6,7,2]
print ('\nelements are:\t',num)
count_dict={}
for elements in num:
    count_dict[elements]=num.count(elements)
print ('\nfrequency:\t',count_dict)

edited Sep 3, 2017 at 18:43

user6655984

answered Sep 3, 2017 at 18:40

chandan anand

71 bronze badge

2 Comments

Erik A Over a year ago

Please don't post code-only answers but clarify your code, especially when a question already has a valid answer.

ciskoh Over a year ago

this is one of the slowest way you can do it

Varun Shaandhesh · Accepted Answer · 2017-10-30 14:49:28Z

-1

You can use the in-built function provided in python

l.count(l[i])


  d=[]
  for i in range(len(l)):
        if l[i] not in d:
             d.append(l[i])
             print(l.count(l[i])

The above code automatically removes duplicates in a list and also prints the frequency of each element in original list and the list without duplicates.

Two birds for one shot ! X D

answered Oct 30, 2017 at 14:49

Varun Shaandhesh

771 silver badge10 bronze badges

Comments

Namrata Tolani · Accepted Answer · 2017-12-06 18:44:18Z

-1

This approach can be tried if you don't want to use any library and keep it simple and short!

a = [1,1,1,1,2,2,2,2,3,3,4,5,5]
marked = []
b = [(a.count(i), marked.append(i))[0] for i in a if i not in marked]
print(b)

o/p

[4, 4, 2, 1, 2]

answered Dec 6, 2017 at 18:44

Namrata Tolani

8619 silver badges13 bronze badges

Collectives™ on Stack Overflow

How to count the frequency of the elements in an unordered list? [duplicate]

32 Answers 32

5 Comments

11 Comments

9 Comments

3 Comments

2 Comments

5 Comments

Comments

Comments

Comments

2 Comments

Comments

3 Comments

2 Comments

1 Comment

Comments

Comments

Comments

Comments

2 Comments

Comments

Comments

Comments

Comments

Comments

Comments

1 Comment

Comments

2 Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

32 Answers 32

5 Comments

11 Comments

9 Comments

3 Comments

2 Comments

5 Comments

Comments

Comments

Comments

2 Comments

Comments

3 Comments

2 Comments

1 Comment

Comments

Comments

Comments

Comments

2 Comments

Comments

Comments

Comments

Comments

Comments

Comments

1 Comment

Comments

2 Comments

Comments

Comments

Linked

Related