How do I sort a list of objects based on an attribute of the objects?

Question

I have a list of Python objects that I want to sort by a specific attribute of each object:

[Tag(name="toe", count=10), Tag(name="leg", count=2), ...]

How do I sort the list by .count in descending order?

Dupe: stackoverflow.com/questions/157424/…, stackoverflow.com/questions/222752/…, stackoverflow.com/questions/327191/… — S.Lott, Dec 31, 2008 at 18:23
Sorting HOW TO for those who are looking for more info about sorting in Python. — Jeyekomon, May 30, 2018 at 8:48
apart from operator.attrgetter('attribute_name') you can also use functors as key like object_list.sort(key=my_sorting_functor('my_key')), leaving the implementation out intentionally. — vijay shanker, Apr 8, 2019 at 20:07

Mateen Ulhaq · Accepted Answer · 2023-02-27 21:52:12Z

1952

To sort the list in place:

orig_list.sort(key=lambda x: x.count, reverse=True)

To return a new list, use sorted:

new_list = sorted(orig_list, key=lambda x: x.count, reverse=True)

Explanation:

key=lambda x: x.count sorts by count.
reverse=True sorts in descending order.

More on sorting by keys.

edited Feb 27, 2023 at 21:52

Mateen Ulhaq

25.8k20 gold badges111 silver badges143 bronze badges

answered Dec 31, 2008 at 16:42

Kenan Banks

210k35 gold badges157 silver badges175 bronze badges

2

No problem. btw, if muhuk is right and it's a list of Django objects, you should consider his solution. However, for the general case of sorting objects, my solution is probably best practice.
– Kenan Banks
Dec 31, 2008 at 17:12
66

On large lists you will get better performance using operator.attrgetter('count') as your key. This is just an optimized (lower level) form of the lambda function in this answer.
– David Eyk
Dec 31, 2008 at 19:35
8

Thanks for the great answer. In case if it is a list of dictionaries and 'count' is one of its key then it needs to be changed like below : ut.sort(key=lambda x: x['count'], reverse=True)
– dganesh2002
Dec 8, 2016 at 21:20
2

I suppose it deserves the following update: if there is a need to sort by multiple fields, it could be achieved by consecutive calls to sort(), because python is using stable sort algorithm.
– uuu777
Feb 23, 2020 at 14:41
2

Thanks @KenanBanks, you were right. Annoyingly outlook was doing some weird things with calendar timezones so that some came through without the timezone details... no idea why!
– peetysmith
Apr 20, 2022 at 6:05

| Show 3 more comments

tzot · Accepted Answer · 2018-02-28 10:01:31Z

118

A way that can be fastest, especially if your list has a lot of records, is to use operator.attrgetter("count"). However, this might run on an pre-operator version of Python, so it would be nice to have a fallback mechanism. You might want to do the following, then:

try: import operator
except ImportError: keyfun= lambda x: x.count # use a lambda if no operator module
else: keyfun= operator.attrgetter("count") # use operator since it's faster than lambda

ut.sort(key=keyfun, reverse=True) # sort in-place

edited Feb 28, 2018 at 10:01

answered Dec 31, 2008 at 17:48

tzot

94.5k30 gold badges146 silver badges207 bronze badges

8

Here I would use the variable name "keyfun" instead of "cmpfun" to avoid confusion. The sort() method does accept a comparison function through the cmp= argument as well.
– akaihola
Jan 2, 2009 at 12:16
This doesn't seems to work if the object has dynamically added attributes, (if you've done self.__dict__ = {'some':'dict'} after the __init__ method). I don't know why it sould be different, though.
– tutuca
Jan 7, 2013 at 20:40
@tutuca: I've never replaced the instance __dict__. Note that "an object having dynamically added attributes" and "setting an object's __dict__ attribute" are almost orthogonal concepts. I'm saying that because your comment seems to imply that setting the __dict__ attribute is a requirement for dynamically adding attributes.
– tzot
Jan 9, 2013 at 23:14
1

@tzot: if I understand the use of operator.attrgetter, I could supply a function with any property name and return a sorted collection.
– IAbstract
Feb 24, 2016 at 18:16
1

For those looking for more info: wiki.python.org/moin/HowTo/Sorting#Operator_Module_Functions
– alekosot
Jan 20, 2017 at 15:01

| Show 4 more comments

Steven Rumbalski · Accepted Answer · 2012-11-15 14:17:06Z

99

Readers should notice that the key= method:

ut.sort(key=lambda x: x.count, reverse=True)

is many times faster than adding rich comparison operators to the objects. I was surprised to read this (page 485 of "Python in a Nutshell"). You can confirm this by running tests on this little program:

#!/usr/bin/env python
import random

class C:
    def __init__(self,count):
        self.count = count

    def __cmp__(self,other):
        return cmp(self.count,other.count)

longList = [C(random.random()) for i in xrange(1000000)] #about 6.1 secs
longList2 = longList[:]

longList.sort() #about 52 - 6.1 = 46 secs
longList2.sort(key = lambda c: c.count) #about 9 - 6.1 = 3 secs

My, very minimal, tests show the first sort is more than 10 times slower, but the book says it is only about 5 times slower in general. The reason they say is due to the highly optimizes sort algorithm used in python (timsort).

Still, its very odd that .sort(lambda) is faster than plain old .sort(). I hope they fix that.

edited Nov 15, 2012 at 14:17

Steven Rumbalski

45.3k9 gold badges89 silver badges121 bronze badges

answered Jun 12, 2009 at 19:54

Jose M Vidal

8,9946 gold badges44 silver badges51 bronze badges

4

Defining __cmp__ is equivalent to calling .sort(cmp=lambda), not .sort(key=lambda), so it isn't odd at all.
– tzot
Sep 9, 2019 at 7:46
@tzot is exactly right. The first sort has to compare objects against each other again and again. The second sort accesses each object only once to extract its count value, and then it performs a simple numerical sort which is highly optimized. A more fair comparison would be longList2.sort(cmp = cmp). I tried this out and it performed nearly the same as .sort(). (Also: note that the "cmp" sort parameter was removed in Python 3.)
– Bryan Roach
Oct 29, 2019 at 4:56
4

cmp was deprecated in Python 3: docs.python.org/3/howto/…
– neves
Feb 2, 2021 at 22:40

Add a comment |

jpp · Accepted Answer · 2018-02-11 11:35:22Z

84

Object-oriented approach

It's good practice to make object sorting logic, if applicable, a property of the class rather than incorporated in each instance the ordering is required.

This ensures consistency and removes the need for boilerplate code.

At a minimum, you should specify __eq__ and __lt__ operations for this to work. Then just use sorted(list_of_objects).

class Card(object):

    def __init__(self, rank, suit):
        self.rank = rank
        self.suit = suit

    def __eq__(self, other):
        return self.rank == other.rank and self.suit == other.suit

    def __lt__(self, other):
        return self.rank < other.rank

hand = [Card(10, 'H'), Card(2, 'h'), Card(12, 'h'), Card(13, 'h'), Card(14, 'h')]
hand_order = [c.rank for c in hand]  # [10, 2, 12, 13, 14]

hand_sorted = sorted(hand)
hand_sorted_order = [c.rank for c in hand_sorted]  # [2, 10, 12, 13, 14]

answered Feb 11, 2018 at 11:35

jpp

162k35 gold badges286 silver badges348 bronze badges

3

That's what I was looking for! Could you point us to some documentation that elaborates on why __eq__ and __lt__ are the minimum implementation requirements?
– FriendFX
Aug 7, 2019 at 0:23
4

@FriendFX, I believe it's implied by this: •The sort routines are guaranteed to use __lt__() when making comparisons between two objects...
– jpp
Aug 7, 2019 at 8:04
2

@FriendFX: See portingguide.readthedocs.io/en/latest/comparisons.html for Comparison and Sorting
– Cornel Masson
Feb 19, 2020 at 10:30
is there any way to automatically forward all special binary comparison methods to one attribute of the class instead of implementing __eq__, __lt__, __le__, __gt__, __ge__ and __ne__ and inside just forward to the attributes special function?
– j-hap
Jun 25, 2022 at 14:15
I just wrote my own decorator to accomplish what I wanted in the previous comment. it's really ugly.mbetter implement __eq__ and __lt__ and then use @functools.total_ordering to get the rest.
– j-hap
Jun 25, 2022 at 15:10

| Show 1 more comment

attrgetterattrgetter · Accepted Answer · 2008-12-31 19:00:33Z

47

from operator import attrgetter
ut.sort(key = attrgetter('count'), reverse = True)

answered Dec 31, 2008 at 19:00

attrgetter

Add a comment |

muhuk · Accepted Answer · 2008-12-31 17:10:00Z

18

It looks much like a list of Django ORM model instances.

Why not sort them on query like this:

ut = Tag.objects.order_by('-count')

answered Dec 31, 2008 at 17:10

muhuk

15.9k10 gold badges60 silver badges100 bronze badges

It is, but using django-tagging, so I was using a built-in for grabbing a Tag set by usage for a particular query set, like so: Tag.objects.usage_for_queryset(QuerySet, counts=True)
– Nick Sergeant
Dec 31, 2008 at 17:39

Add a comment |

Georgy · Accepted Answer · 2019-09-24 15:21:15Z

16

If the attribute you want to sort by is a property, then you can avoid importing operator.attrgetter and use the property's fget method instead.

For example, for a class Circle with a property radius we could sort a list of circles by radii as follows:

result = sorted(circles, key=Circle.radius.fget)

This is not the most well-known feature but often saves me a line with the import.

answered Sep 24, 2019 at 15:21

Georgy

13k7 gold badges65 silver badges74 bronze badges

I was surprised I couldn't just put Circle.radius. I don't understand why it doesn't work. Thanks for the pointer of using fget. I don't like defining lambdas if something else can do the job, and I don't like passing names of attributes as strings to attrgetter, it just feels wrong, to me.
– Peter Wood
Jul 5, 2023 at 10:46
1

@PeterWood The key needs to (1) be callable, (2) accept one argument, and (3) return a value which is sortable. sorted takes each item to be sorted, passes each of them in turn to the key, and sorts the values returned as proxies for the items themselves. Circle.radius doesn't work because property objects themselves aren't callable. But they do have an attribute called fget which is.
– ibonyun
Nov 9, 2023 at 0:46
@ibonyun thanks, that really helps clarify why
– Peter Wood
Nov 9, 2023 at 16:12

Add a comment |

rob · Accepted Answer · 2008-12-31 16:45:15Z

14

Add rich comparison operators to the object class, then use sort() method of the list.
See rich comparison in python.

Update: Although this method would work, I think solution from Triptych is better suited to your case because way simpler.

answered Dec 31, 2008 at 16:45

rob

37.3k2 gold badges58 silver badges66 bronze badges

Add a comment |

Furqan Ali · Accepted Answer · 2023-01-21 10:07:14Z

Also if someone wants to sort list that contains strings and numbers for e.g.

 eglist=[
     "some0thing3",
     "some0thing2",
     "some1thing2",
     "some1thing0",
     "some3thing10",
     "some3thing2",
     "some1thing1",
     "some0thing1"]

Then here is the code for that:

import re

def atoi(text):
    return int(text) if text.isdigit() else text

def natural_keys(text):
    return [ atoi(c) for c in re.split(r'(\d+)', text) ]

eglist=[
         "some0thing3",
         "some0thing2",
         "some1thing2",
         "some1thing0",
         "some3thing10",
         "some3thing2",
         "some1thing1",
         "some0thing1"
]

eglist.sort(key=natural_keys)
print(eglist)

cottontail · Accepted Answer · 2024-02-20 08:16:02Z

@Jose M Vidal's answer mentions something important: using rich comparisons (__lt__, __eq__ etc. as in @jpp's answer) makes sorting much slower than passing a key function (as in the accepted answer). However, they end up showing a test using __cmp__ method which doesn't exist in Python 3 (which tbf was indeed so much slower than passing key in Python 2).

I want to point out that even in Python 3.12.0, using rich comparisons as in @jpp's answer makes sorting much slower than passing a key function.

The results of a little timeit test is shown below where sorting using rich comparisons vs a lambda key function vs operator.attrgetter as key are compared. For a list with 10k items, it took about 18.8 ms when rich comparisons were used whereas it took 2.93 ms when a lambda key function was used and 2.47 ms when operator.attrgetter was used.

So, as @tzot mentioned, operator.attrgetter is faster than a lambda; however, using a key function in the first place instead of rich comparisons makes sorting over 5 times faster.

import timeit
import random
from operator import attrgetter

class Card(object):

    def __init__(self, rank):
        self.rank = rank

    def __eq__(self, other):
        return self.rank == other.rank

    def __lt__(self, other):
        return self.rank < other.rank

n = 100
random.seed(0)
lst = [Card(random.randrange(10000)) for _ in range(10000)]


min(timeit.repeat(lambda: sorted(lst), number=n))/n
# 0.018813106999732553

min(timeit.repeat(lambda: sorted(lst, key=lambda card: card.rank), number=n))/n
# 0.0029304770001908763

min(timeit.repeat(lambda: sorted(lst, key=attrgetter('rank')), number=n))/n
# 0.00247172600007616

Collectives™ on Stack Overflow

How do I sort a list of objects based on an attribute of the objects?

10 Answers 10

Not the answer you're looking for? Browse other questions tagged
python
list
sorting
reverse
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

10 Answers 10

Not the answer you're looking for? Browse other questions tagged pythonlistsortingreverse or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
python
list
sorting
reverse
or ask your own question.