Keep duplicates in a list in Python

Question

I know this is probably an easy answer but I can't figure it out. What is the best way in Python to keep the duplicates in a list:

x = [1,2,2,2,3,4,5,6,6,7]

The output should be:

[2,6]

I found this link: Find (and keep) duplicates of sublist in python, but I'm still relatively new to Python and I can't get it to work for a simple list.

Post the code you are having trouble with, otherwise this is a complete retread of that other question. — Steven Rumbalski, Commented Apr 4, 2013 at 13:28
@StevenRumbalski: Not precisely, the other question is also flattening nested lists at the same time. — MattH, Commented Apr 4, 2013 at 13:31

mgilson · Accepted Answer · 2013-04-04 13:55:51Z

16

I'd use a collections.Counter:

from collections import Counter
x = [1, 2, 2, 2, 3, 4, 5, 6, 6, 7]
counts = Counter(x)
output = [value for value, count in counts.items() if count > 1]

Here's another version which keeps the order of when the item was first duplicated that only assumes that the sequence passed in contains hashable items and it will work back to when set or yeild was introduced to the language (whenever that was).

def keep_dupes(iterable):
    seen = set()
    dupes = set()
    for x in iterable:
        if x in seen and x not in dupes:
            yield x
            dupes.add(x)
        else:
            seen.add(x)

print list(keep_dupes([1,2,2,2,3,4,5,6,6,7]))

edited Apr 4, 2013 at 13:55

answered Apr 4, 2013 at 13:27

mgilson

311k70 gold badges655 silver badges718 bronze badges

However you lose the order of the elements in the output.
– Jochen Ritzel
Commented Apr 4, 2013 at 13:30
Yep. There are a lot of situations where this isn't the best way to go. It also requires the input be hashable... But, it's O(n) even for un-sorted lists which is nice.
– mgilson
Commented Apr 4, 2013 at 13:32
The shortest ordered variant I can think of offhand is [k for k in OrderedDict.fromkeys(x) if counts[k] > 1].
– DSM
Commented Apr 4, 2013 at 13:33
1

@DSM -- Why do you need an OrderedDict there? why not just [k for k in x if counts[k] > 1]? Actually, that's better than what I have. I'll update...
– mgilson
Commented Apr 4, 2013 at 13:40
@mgilson: try it and see.. :^)
– DSM
Commented Apr 4, 2013 at 13:40

| Show 6 more comments

Jochen Ritzel · Accepted Answer · 2013-04-04 13:27:45Z

10

This is a short way to do it if the list is sorted already:

x = [1,2,2,2,3,4,5,6,6,7]

from itertools import groupby
print [key for key,group in groupby(x) if len(list(group)) > 1]

answered Apr 4, 2013 at 13:27

Jochen Ritzel

108k33 gold badges204 silver badges195 bronze badges

1

This will also work with python2.6 which is a problem with mine.
– mgilson
Commented Apr 4, 2013 at 13:29
@luchosrock: No, groupby groups consecutive elements
– Jochen Ritzel
Commented Apr 4, 2013 at 14:30

Add a comment |

Ivan · Accepted Answer · 2023-08-24 07:31:29Z

3

List Comprehension in combination with set() will do exactly what you want.

>>> list(set([i for i in x if x.count(i) > 1]))

[2, 6]

edited Aug 24, 2023 at 7:31

answered Oct 20, 2022 at 10:14

Ivan

631 gold badge1 silver badge6 bronze badges

Add a comment |

luchosrock · Accepted Answer · 2013-04-04 13:42:35Z

0

keepin' it simple:

array2 = []
aux = 0
aux2=0
for i in x:
    aux2 = i
    if(aux2==aux):
        array2.append(i)
    aux= i
list(set(array2))

That should work

edited Apr 4, 2013 at 13:42

answered Apr 4, 2013 at 13:34

luchosrock

70810 silver badges24 bronze badges

Won't that give [2,2,6]?
– DSM
Commented Apr 4, 2013 at 13:35
@DSM ahaha you're totally right, I edited my answer, Thanks :)
– luchosrock
Commented Apr 4, 2013 at 13:43

Add a comment |

L Ken · Accepted Answer · 2020-03-06 13:07:37Z

0

Not efficient but just to get the output, you could try:

import numpy as np

def check_for_repeat(check_list):
    repeated_list = []

    for idx in range(len(check_list)):
        elem = check_list[idx]
        check_list[idx] = None

        if elem in temp_list:
            repeated_list.append(elem)

    repeated_list = np.array(repeated_list)

    return list(np.unique(repeated_list))

edited Mar 6, 2020 at 13:07

answered Mar 6, 2020 at 12:45

L Ken

93 bronze badges

Add a comment |

Collectives™ on Stack Overflow

Keep duplicates in a list in Python

5 Answers 5

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Linked

Related