🎓 Doc Freemo :jpf: 🇳🇱: "holy hell python doesnt even have a built in red-…"

LucifarGundam @lucifargundam@qoto.org

@Ademan

> do you need one for reasons other than performance?

Yes and the performance difference would be very significant (so much so it is worth the extra development time)

But it also has the tendency to make code much simpler in many cases as well, which doesnt apply to me in this case, but also a valid reason for the need in any language.

> how much active maintenance does a basic data structure really need?

Probably not much but when I see something with very little time invested into it I worry it may be slow to fix bugs or not well flushed out. In this case I could be wrong (I am taking the risk and using it so we will see).

**LucifarGundam** @lucifargundam@qoto.org · Jan 15, 2021, 22:17

**LucifarGundam** @lucifargundam@qoto.org · Jan 15, 2021, 22:17

Jan 15, 2021, 22:17

@freemo
Because it's easier for people to learn and use in comparison to #c and #lisp ?

**Paul Ganssle** @pganssle@qoto.org · Jan 15, 2021, 23:14

**Paul Ganssle** @pganssle@qoto.org · Jan 15, 2021, 23:14

Jan 15, 2021, 23:14

@freemo I don't think I've ever needed such a thing, I suspect most people haven't, which is why there's nothing built-in.

You may find what you want by looking instead at how people solve similar problems in Python. Might be something out there that has a red-black tree or similar at its core but isn't advertised as such.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 15, 2021, 23:22

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 15, 2021, 23:22

Jan 15, 2021, 23:22

@pganssle Depends what you mean by "need". Obviously you could always roll your own Treemap so in the strictest sense you never truly "need" it.

But presuming you are relatively active coder I can guarantee you that you have been in positions where having it would have either (or both) made your code simpler and/or more efficient.

Chances are you accomplished the same end result by either using a hashmap (ordinary dictionary) or a hashset, and just repeatedly sorting it by calling a sort function every time you needed a sorted version of the data (which you may have even done after every new insert in some extreme cases). Which means inefficient code (often significantly so) and more lines of code and more logic than you might need. Not to mention the potential for all sorts of issues on top of that if it is a multithreaded environment.

I have used and needed Tree maps more times than I can count though could have hacked around it easy enough if i had really wanted to.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 15, 2021, 23:24

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 15, 2021, 23:24

Jan 15, 2021, 23:24

@pganssle And yes, Treemaps and Navigable maps in most languages are backed by balanced red-black trees underneath or something very similar. So if you've ever seen a sorted map/dict in a language it was likely a red-black tree.

**Paul Ganssle** @pganssle@qoto.org · Jan 16, 2021, 03:15

**Paul Ganssle** @pganssle@qoto.org · Jan 16, 2021, 03:15

Jan 16, 2021, 03:15

@freemo I am a very active coder, what I mean to say is that I have basically never been in a situation where my code was slow enough that it was a problem (and I do a lot of low-level library programming so that's often "slow at all") and benchmarks showed that dictionary hash map lookup is the bottleneck and switching implementations would fix it.

Python has a huge amount of overhead for most operations, so often the way to get high performance when you need it is to basically call an API that is a wrapper around an optimized implementation in a low-level language (e.g. numpy).

Admittedly, it may also be that the kind of programming I do is not likely to hit this problem, but I imagine the fact that there's *not* an obvious choice here means that it's either not a very common problem or this is an XY problem and you are overlooking more idiomatic solutions to your overall problem.

**Paul Ganssle** @pganssle@qoto.org · Jan 16, 2021, 03:19

**Paul Ganssle** @pganssle@qoto.org · Jan 16, 2021, 03:19

Jan 16, 2021, 03:19

@freemo I don't know the specifics of your situation so I may be way off base, but I often find that when people start using a new language or ecosystem they reach for the familiar idioms from their more comfortable language and don't realize that in this new language you are meant to structure things differently. I have been guilty of this myself many, many times.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 16, 2021, 03:21

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 16, 2021, 03:21

Jan 16, 2021, 03:21

@pganssle Well having your code not be slow enough to force you to use it is, of course, not that uncommon. though in those cases your code was probably still significantly slower than it needed to be... But as i said it isnt **just** about code performance it is also about code complexity. Even if you never needed the performance improvement it would have forced your code in many situations to be more complex and bug-prone then it needed to be.

In terms of code elegance pretty much everyone has hit it if you ever needed a map where the keys had a sorting order (sorting it yourself every time you modify the map is added code and prone to issues).

In terms of performance you'd only need it if you needed a sorted-key map and that map was particularly large and needed to be re-sorted often.

**Paul Ganssle** @pganssle@qoto.org · Jan 16, 2021, 03:53

**Paul Ganssle** @pganssle@qoto.org · Jan 16, 2021, 03:53

Jan 16, 2021, 03:53

@freemo I mean, there's sortedcontainers, which is very popular and seems relatively well-maintained. It's implemented in pure python and I don't think it's a red-black tree, so if it's a performance bottleneck then you may need something else, but if you just want to simplify your code, then that should work.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 16, 2021, 03:55

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 16, 2021, 03:55

Jan 16, 2021, 03:55

@pganssle in my case its a performance concern. I am basically implementing a pass-through array cache that remembers balues read from the underlying array-like object and stores it in a local array it needs to know what values it has seen though which makes it quite complicated as it would need to save ranges of seen values. Whole point of such a cahce is performance.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 16, 2021, 04:06

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 16, 2021, 04:06

Jan 16, 2021, 04:06

@pganssle so the keys i a sorted containers implementation are required to be comparable and dont need to be hashable?

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 17, 2021, 03:31

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 17, 2021, 03:31

Jan 17, 2021, 03:31

@freemo i don’t think Python people do that kind of stuff

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 17, 2021, 03:44

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 17, 2021, 03:44

Jan 17, 2021, 03:44

@louiscouture What sort things? :)

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 17, 2021, 03:46

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 17, 2021, 03:46

Jan 17, 2021, 03:46

@freemo python is mostly for people who want to learn programming, and it’s mostly used for mathematics and machine learning / ai libraries
.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 17, 2021, 03:48

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 17, 2021, 03:48

Jan 17, 2021, 03:48

@louiscouture while it is often a first choice in programming language for the noobs I wouldnt say its limited to that at all. It is often used for some pretty advanced math/AL/ML which is why it surprises me it lacks such fundamental features as a tree map considering they are pretty necessary to make a lot of those things if you want them to run efficiently.

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 17, 2021, 03:53

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 17, 2021, 03:53

Jan 17, 2021, 03:53

@freemo I’m still a student in software engineering and hasn’t really reached AI yet but doesn’t it make no sense to do some demanding calculations on a language that is really slow.

I get the appeal of interpreted languages as it can be run on multiple machines, but then why not use faster alternatives, like Java ?

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 17, 2021, 03:57

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 17, 2021, 03:57

Jan 17, 2021, 03:57

@louiscouture most mature languages will be close enough to as fast as any other not to matter in and of itself. It isnt that python isnt fast, its more that python doesnt make it easy or pleasant to make your code fast.. ITs multiprocessing element is shit and requires you to jump through hoops and many of its built-in libraries dont leverage multiple CPUs and obscure things away in such a way that it isnt always easy to build that in.. But all that said it can (and many examples of where it has been) made to do things as fast as any other language.

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 17, 2021, 23:39

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 17, 2021, 23:39

Jan 17, 2021, 23:39

@louiscouture
This is not the case. A lot of huge, non-math projects use Python.
@freemo

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 18, 2021, 00:17

**Louis Couture 🏳️‍🌈** @louiscouture@qoto.org · Jan 18, 2021, 00:17

Jan 18, 2021, 00:17

@kirschwipfel @freemo like what

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 16:41

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 16:41

Jan 18, 2021, 16:41

@louiscouture @freemo
ERPnext, Odoo, Tryton (which itself is the base for other systems like GNU Health, Occhiolino), ansible, Slat, OpenStack, mailman, Plone, a lot of Web-Application (based on e.g. Django, Pyramid or Tornado (Facebooc)). BorgBackup, DockerCompose, GNUmed. Used a lot in InfoSec, e.g. in exploit development, detecting vulnerabilites, etc. e.g. GRR Rapid Response.
1/2

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 16:42

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 16:42

Jan 18, 2021, 16:42

@louiscouture @freemo

Reddit, spotify, Instagrem, Facebook, Google, NASA, CERN use it.
Many application use Python as scripting language, e.g. LibreOffice, Blender, GIMP, Inkscape, the GNU Debugger, ArcGIS.
Checkout more at https://www.python.org/about/quotes/ https://www.python.org/about/apps/ https://en.wikipedia.org/wiki/Python_(programming_language)#Uses https://github.com/pyinstaller/pyinstaller/wiki/Projects-Using-PyInstaller

2/2

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 16:52

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 16:52

Jan 18, 2021, 16:52

@louiscouture @freemo
I missed some introduction, making the message sound a bit harsh. Sorry.

Anyhow, as you can see Python is used for a huge variety of platforms and applications.

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 17:10

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 17:10

Jan 18, 2021, 17:10

@freemo
When I came from other programming languages to Python, I was missing trees, double-linked listes, sort-algos etc, too. But I quickly learned live being easier and me being more productive if I don't need to care about these.

Dicts are acceptable for most (of my) needs, being O(1) in average.
https://wiki.python.org/moin/TimeComplexity#dict

If I would dare for trees, I would checkout one of the many packages available https://pypi.org/search/?q=AVL+tree
(AVL being a real subset of red-black) or try numpy

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 18, 2021, 17:12

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 18, 2021, 17:12

Jan 18, 2021, 17:12

@kirschwipfel I fail to see how lacking fundemental tools that not just improve performance but reduce the complexity of your own code somehow makes life easier.

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 17:30

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 18, 2021, 17:30

Jan 18, 2021, 17:30

@freemo
Can't see how red-black trees improve performance here: dicts are O(1), for red-black trees it is O(log n).

Can't see how a tree reduces complexity of my code. In fact (beside performance) I don't want to care about the datastructure used. Just put elements in and get them out. Whether tree or dict: I would the interface expect to be the same.

Anyhow: Algorithms never belong to the language, but to the library.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 18, 2021, 17:46

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 18, 2021, 17:46

Jan 18, 2021, 17:46

@kirschwipfel hashtrees are only O(1) when looking up a specific value, they are far less efficient when you wish to traverse the keys of a hashmap in sorted order or to ask "which key comes after key X in the sort order"

Hashmaps are, and should be, the prefered choice when they suit the task. I am not implying we should have tree maps and not hash maps, that would be just as harmful.. but a good programmer (one who wants to keep their code efficient and minimal in complexity) needs both and both are used in different situations, and each is significantly more performant in their respective applications.

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 21, 2021, 20:59

**Hartmut Goebel** @kirschwipfel@nerdculture.de · Jan 21, 2021, 20:59

Jan 21, 2021, 20:59

@freemo I agree their might be use-cases where a tree actually makes sense.

Please check out some of the many (AVL) tree packages at PyPi. I'm quite confident there are well-maintained ones.

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 21, 2021, 21:23

**🎓 Doc Freemo 🇳🇱** @freemo@qoto.org · Jan 21, 2021, 21:23

Jan 21, 2021, 21:23